Common Sense Knowledge Graphs (CSKGs), ISWC 2020 Tutorial

Tutorial Program

Our tutorial will provide a comprehensive overview of both top-down and bottom-up commonsense knowledge graphs. We will introduce each graph in turn, and contrast their design principles. Next, we will discuss the benefits and challenges for their integration and consolidation in a single CSKG, followed by our approach to perform this consolidation. Finally, we will discuss how to compute embeddings and ground text to such CSKGs, as well as how such CSKGs are used to reason on downstream question answering tasks. The tutorial will include demos based on publicly available resources and Jupyter Notebooks.

Agenda:

All times are given in Pacific Standard Time (PST).

Time	Length	Content	Slides
08:00 PST	1h 50min	Part I - Review of CSKGs	[.pptx] [.pdf]
	15 min	Introduction to commonsense knowledge (slides) - Pedro
	25 min	Review of top-down commonsense knowledge graphs (slides) - Mayank
	70 min	Review of bottom-up commonsense knowledge graphs (slides+demo) - Mayank, Filip, Pedro
	10 min	Break
10:00 PST	45 min	Part II - Integration and analysis	[.pptx] [.pdf]
	35 min	Consolidating commonsense graphs (slides) - Filip
	10 min	Consolidating commonsense graphs (demo) - Pedro
	10 min	Break
10:55 PST	1h 05min	Part III - Downstream use of CSKGs	[.pptx] [.pdf]
	35 min	Answering questions with CSKGs (slides) - Filip
	15 min	CSKG embeddings and grounding (demo) - Filip
	15 min	Wrap-up (slides) - Mayank

Learning Outcomes:

Familiarization with the state-of-the-art knowledge sources that provide commonsense knowledge
Understanding how these sources can be integrated into a single commonsense knowledge graph
Code for analysis of a consolidated CSKG, intrinsic operations (e.g., computation of embeddings), and application of the CSKG on standing commonsense tasks in a natural language form.

Presentation Style style will be informal and very hands-on. We will avoid representation specifics and terminology used by the individual graphs to the best extent possible (i.e. without losing rigor, or over-simplifying). Our slides will focus on visual intuition, use actual examples, present lessons learned from our latest implementations of algorithms and commonsense reasoning frameworks, and will be accompanied by demos and hands-on activities that participants will be able to do without requiring extensive platform-dependent setup. We will permit questions and interactions throughout the tutorial. All three of us will be mostly present throughout the tutorial but will be individually presenting our sections.

Background & Requirements

Capturing, representing, and leveraging commonsense knowledge has been a paramount for AI since its early days, cf. (McCarthy, 1960). In the light of the modern large (commonsense) knowledge graphs and various neural advancements, the recently introduced DARPA Machine Common Sense program represents a new effort to understand commonsense knowledge through question-answering evaluation benchmarks. Intuitively, graphs of (commonsense) knowledge are essential in such tasks in order to inject background knowledge that humans possess and apply, but machines cannot access or distill directly in communication.

Our team has been working on several aspects of commonsense knowledge found in knowledge graphs. Firstly, we have been integrating a number of knowledge graphs in a single commonsense knowledge graph, including: ConceptNet, WordNet, Visual Genome, FrameNet, ATOMIC, WebChild, Wikidata, and Cyc. Secondly, we have been building software to perform intrinsic operations on the consolidated CSKG, such as generating embeddings and performing knowledge graph completion. Thirdly, we have a framework that allows us to integrate (parts of) our CSKG into a reasoning system that answers questions phrased in natural language. These questions come from several commonsense evaluation datasets, focused on social, physical, visual, and situational reasoning.

It is important for the semantic web community to keep pace with this new ‘wave’ of commonsense knowledge representation and reasoning for downstream applications. Aspects that we seek to address during through our presentation and activities are: what is a commonsense knowledge graph? Which kind of knowledge is captured by existing CSKGs? What are their major strengths and weaknesses? How can they be integrated into a graph that is more than ‘a sum of its parts’? How can this graph be refined/enriched with more semantics or missing information? How can it be applied on downstream applications in natural language understanding? Our tutorial will be very practical, and will answer these questions in a focused, foundational manner that all Semantic Web researchers will be able to easily follow.

Prior knowledge expected from participants (beyond fairly basic Python 3 skills, and familiarity with Semantic Web concepts like RDF) will be minimal. Some knowledge of machine learning, including basic concepts like training, testing and validating, feature engineering etc. will be helpful but are not absolute prerequisites, as we will not go into advanced machine learning math or optimization. Additionally, where possible, we will introduce basic machine learning concepts so that everyone has an opportunity to follow along. Participants are not expected to have any knowledge of answering natural language commonsense questions.

We will be using our own computers for presenting demos and PowerPoint slides and only require equipment to facilitate such projection for an extended period of time (e.g., projector, table, power outlet). We (and also the participants) will require an internet/wifi connection to access the tutorial material. There are no audio elements to our presentation. All demos and hands-on activities will be doable on a reasonable laptop by interested participants. We will also bring extra USB storage devices with copies of code, programs and slides in case some participants did not download the material prior to the tutorial.

Expected Coverage

Tools and data:

Our latest commonsense knowledge graph (link to download the graph will be shared with the participants)
Code for analysing the contents of CSKG
Code for computing graph and text embeddings
Code for grounding natural questions to CSKG

Bibliography:

Gunning, D. (2018). Machine common sense concept paper. arXiv preprint arXiv:1810.07528
Speer, R., Chin, J., & Havasi, C. (2017, February). Conceptnet 5.5: An open multilingual graph of general knowledge. In Thirty-First AAAI Conference on Artificial Intelligence.
Krishna, R., Zhu, Y., Groth, O., Johnson, J., Hata, K., Kravitz, J., ... & Bernstein, M. S. (2017). Visual genome: Connecting language and vision using crowdsourced dense image annotations. International Journal of Computer Vision, 123(1), 32-73.
Miller, G. A. (1995). WordNet: a lexical database for English. Communications of the ACM, 38(11), 39-41.
Ilievski, F., Szekely, P., & Schwabe, D. (2020). Commonsense Knowledge in Wikidata. Wikidata workshop at ISWC 2020.
Ilievski, F., Szekely, P., Cheng, J., Zhang, F., & Qasemi, E. (2020). Consolidating Commonsense Knowledge. arXiv preprint arXiv:2006.06114
Speer, R., & Chin, J. (2016). An ensemble method to produce high-quality word embeddings. arXiv preprint arXiv:1604.01692.
Zellers, R., Holtzman, A., Bisk, Y., Farhadi, A., & Choi, Y. (2019). HellaSwag: Can a Machine Really Finish Your Sentence?. arXiv preprint arXiv:1905.07830.
Sap, M., Rashkin, H., Chen, D., LeBras, R., & Choi, Y. (2019). Socialiqa: Commonsense reasoning about social interactions. arXiv preprint arXiv:1904.09728.
Wang, P., Peng, N., Ilievski, F., Szekely, P., & Ren, X. (2020). Connecting the Dots: A Knowledgeable Path Generator for Commonsense Question Answering. EMNLP Findings 2020.
Zellers, R., Bisk, Y., Farhadi, A., & Choi, Y. (2019). From recognition to cognition: Visual commonsense reasoning. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp. 6720-6731).

Organizers

Tutorial Program

Background & Requirements

Expected Coverage