Sage Journals: Discover world-class research

Abstract

In this paper, we investigate MiniMax Entropy models, a class of neural symbolic models where symbolic and subsymbolic features are seamlessly integrated. We show how these models recover classical algorithms from both the deep learning and statistical relational learning scenarios. Novel hybrid settings are defined and experimentally explored, showing state-of-the-art performance in collective classification, knowledge base completion and graph (molecular) data generation.

Keywords

Neural symbolic artificial intelligence statistical relational artificial intelligence deep learning logic

Get full access to this article

View all access options for this article.

References

Kahneman

, Thinking, fast and slow. Farrar, Straus and Giroux, (2017).

LeCun

, Bengio

and Hinton

, Deep learning, Nature 521(7553) (2015), 436–444.

De Raedt

, Kersting

, Natarajan

and Poole

, Statistical Relational Artificial Intelligence: Logic, Probability, and Computation. Synthesis Lectures on Artificial Intelligence and Machine Learning. Morgan & Claypool Publishers (2016). doi: 10.2200/S00692ED1V01Y201601AIM032. URL https://doi.org/10.2200/S00692ED1V01Y201601AIM032.

d’Avila Garcez

A.S.

, Gori

, Lamb

L.C.

, Serafini

, Spranger

and Tran

S.N.

, Neural-symbolic computing: An effective methodology for principled integration of machine learning and reasoning, FLAP 6 (2019).

De Raedt

, Dumančić

, Manhaeve

and Marra

, From statistical relational to neural symbolic artificial intelligence, In IJCAI 2020, (2020).

Rocktäschel

and Riedel

, End-to-end differentiable proving. In Advances in Neural Information Processing Systems (2017), 3788–3800.

Šourek

, Aschenbrenner

, Zelezný

, Schockaert

and Kuželka

, Lifted relational neural networks: Efficient learning of latent relational structures, JAIR 62 (2018).

Diligenti

, Gori

and Saccà

, Semantic-based regularization for learning and inference, Artif. Intell. 244 (2017).

Donadello

, Serafini

and d’Avila Garcez

A.S.

, Logic tensor networks for semantic image interpretation, In IJCAI, (2017).

10.

, Zhang

, Friedman

, Liang

and Van den Broeck

, A semantic loss function for deep learning with symbolic knowledge, In ICML (2018).

11.

Hájek

, Metamathematics of fuzzy logic, volume 4, Springer Science & Business Media, (2013).

12.

Marra

, Giannini

, Diligenti

, Maggini

and Gori

, Learning and t-norms theory, CoRR (2019).

13.

Richardson

and Domingos

, Markov logic networks, Machine learning 62(1-2) (2006), 107–136.

14.

Kuželka

and Davis

, Markov logic networks for knowledge base completion: A theoretical analysis under the MCAR assumption, In Proceedings of the Thirty-Fifth Conference on Uncertainty in Artificial Intelligence, UAI, (2019).

15.

Boyd

and Vandenberghe

, Convex optimization, Cambridge university press, (2004).

16.

Wainwright

M.J.

, Jordan

M.I.

, et al., Graphical models, exponential families, and variational inference, Foundations and Trends^® in Machine Learning 1(1–2) (2008), 1–305.

17.

Kuželka

, Wang

, Davis

and Schockaert

, Relational marginal problems: Theory and estimation, In Thirty-Second AAAI Conference on Artificial Intelligence, (2018).

18.

Zhu

S.C.

, Wu

Y.N.

and Mumford

, Minimax entropy principle and its application to texture modeling, Neural Computation 9(8) (1997), 1627–1660.

19.

Bach

S.H.

, Broecheler

, Huang

and Getoor

, Hinge-loss markov random fields and probabilistic soft logic, J. Mach. Learn. Res., 18 (2017).

20.

Giannini

, Diligenti

, Gori

and Maggini

, On a convex logic fragment for learning and reasoning, IEEE TFS 27 (2018).

21.

Goodfellow

, Bengio

and Courville

, Deep Learning. MIT Press, (2016). http://www.deeplearningbook.org.

22.

Choi

, Chavira

and Darwiche

, Node splitting: A scheme for generating upper bounds in bayesian networks, arXiv preprint arXiv:1206.5251 (2012).

23.

Robert

and Casella

, Monte Carlo statistical methods, Springer Science & Business Media (2013).

24.

Bishop

C.M.

, Training with noise is equivalent to tikhonov regularization, Neural Computation 7(1) (1995), 108–116.

25.

Kok

and Domingos

, Statistical predicate invention, In Proceedings of the 24th international conference on Machine learning, pages 433–440. ACM, (2007).

26.

Diligenti

, Gori

and Sacca

, Semantic-based regularization for learning and inference, Artificial Intelligence 244 (2017), 143–165.

27.

Marra

, Giannini

, Diligenti

and Gori

, Lyrics: a general interface layer to integrate ai and deep learning, arXiv preprint arXiv:1903.07534 (2019).

28.

Marra

, Giannini

, Diligenti

and Gori

, Integrating learning and reasoning with deep logic models, In Joint European Conference on Machine Learning and Knowledge Discovery in Databases, pages 517–532. Springer, (2019).

29.

Marra

, Diligenti

, Giannini

, Gori

and Maggini

, Relational neural machines. In ECAI, (2020).

30.

Marra

and Kuželka

, Neural markov logic networks. In UAI, (2021).

31.

Poon

and Domingos

, Sound and efficient inference with probabilistic and deterministic dependencies, In AAAI 6 (2006), 458–463.

32.

Khot

, Natarajan

, Kersting

and Shavlik

, Gradient-based boosting for statistical relational learning: the markov logic network and missing data cases, Machine Learning 100(1) (2015), 75–100.

33.

Besold

T.R.

, d’Avila Garcez

, Bader

, Bowman

, Domingos

, Hitzler

, Kühnberger

K.-U.

, Lamb

L.C.

, Lowd

, Lima

P.M.V.

, et al,. Neural-symbolic learning and reasoning: A survey and interpretation, arXiv preprint arXiv:1711.03902, (2017).

34.

Lippi

and Frasconi

, Prediction of protein β-residue contacts by markov logic networks with grounding-specific weights, Bioinformatics 25(18) (2009), 2326–2333.

35.

Manhaeve

, Dumančić

, Kimmig

, Demeester

and De Raedt

, Deepproblog: Neural probabilistic logic programming, In NeurIPS, (2018).

36.

De Raedt

, Kimmig

and Toivonen

, Problog: A probabilistic prolog and its application in link discovery, In IJCAI, volume 7, pages 2462–2467. Hyderabad, (2007).

37.

Ellis

, Morales

, Sablé-Meyer

, Solar-Lezama

and Tenenbaum

, Learning libraries of subroutines for neurally-guided bayesian program induction, In NeurIPS (2018).

38.

Minervini

, Bošnjak

, Rocktäschel

, Riedel

and Grefenstette

, Differentiable reasoning on large knowledge bases and natural language, In AAAI, (2020).

39.

Sourek

, Aschenbrenner

, Zelezný

, Schockaert

and Kuzelka

, Lifted relational neural networks: Efficient learning of latent relational structures, J. Artif. Intell. Res. 62 (2018), 69–100.

40.

Scarselli

, Gori

, Tsoi

A.C.

, Hagenbuchner

and Monfardini

, The graph neural network model, IEEE Transactions on Neural Networks 20(1) (2009), 61–80.

41.

, Hu

, Leskovec

and Jegelka

, How powerful are graph neural networks? In International Conference on Learning Representations (2018).

42.

Wang

, Mao

, Wang

and Guo

, Knowledge graph embedding: A survey of approaches and applications, IEEE Transactions on Knowledge and Data Engineering 29(12), 2724–2743.

43.

Kazemi

S.M.

and Poole

, Relnn: A deep neural model for relational learning, In Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence, (AAAI-18), pages 6367–6375, (2018).

44.

Kingma

D.P.

and Ba

, Adam: A method for stochastic optimization, arXiv preprint arXiv:1412.6980 (2014).

45.

and Getoor

, Link-based classification. In Proceedings of the 20th International Conference on Machine Learning (ICML-03), (2003), 496–503.

46.

Gammerman

, Vovk

and Vapnik

, Learning by transduction. In Proceedings of the Fourteenth conference on Uncertainty in artificial intelligence, pages 148–155. Morgan Kaufmann Publishers Inc., (1998).

47.

Trouillon

, Welbl

, Riedel

, Gaussier

É.

and Bouchard

, Complex embeddings for simple link prediction, In ICML, (2016).

48.

Minervini

, Riedel

, Stenetorp

, Grefenstette

and Rocktäschel

, Learning reasoning strategies in end-to-end differentiable proving, arXiv preprint arXiv:2007.06477 (2020).

49.

You

, Ying

, Ren

, Hamilton

W.L.

and Leskovec

, Graphrnn: Generating realistic graphs with deep auto-regressive models, arXiv preprint arXiv:1802.08773, (2018).

50.

, Vinyals

, Dyer

, Pascanu

and Battaglia

, Learning deep generative models of graphs, arXiv preprint arXiv:1803.03324 (2018).

51.

Gaulton

, Hersey

, Nowotka

and Patrícia Bento

, Chambers

Mendez

Mutowo

Atkinson

Bellis

L.J.

, Cibrián-Uhalte

, et al., The chembl database in 2017, Nucleic Acids Research 45(D1) (2016), D945–D954.

Bridging symbolic and subsymbolic reasoning with minimax entropy models

Abstract

Keywords

Get full access to this article

References