QAWiki:Guide/Mentions

From QAWiki
Revision as of 00:42, 4 December 2022 by Aidan (talk | contribs) (first draft of initial guidelines)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigation Jump to search

Importance of mentions

What is a mention? Take the question "What is the capital of Ireland?". We can identify two types of mentions in this case: (1) the phrase Ireland is an entity mention that refers to the Wikidata entity (aka item) Q27; (2) the phrase capital is a property mention that refers to the Wikidata property P36. Adding mentions relates (sub)phrases of questions and question alises to elements of knowledge-bases such as Wikidata.

If you can add just a question, or a question and a query, then that's very welcome. However, adding mentions can be very useful for Question Answering systems in order to know which parts of the question address which entity or property on knowledge-bases such as Wikidata. This is a non-trivial task to do automatically, and in a highly precise and complete way. For example, it may not be immediately obvious if Ireland refers to Q27, Q22890 or Q1140152. Knowing how entities or properties are mentioned enable Question Answering systems to better generalise to answering similar questions that involve, for example, a different entity and property. Here we will provide guidelines on how to add mentions linking phrases of questions to specific entities and properties on Wikidata.

What about other knowledge-bases?

QAWiki is open to collecting questions, mentions and queries to enable question answering over other open knowledge bases. However, please do keep in mind that Wikidata offers links to a wide range of knowledge bases, including DBpedia, Wikipedia, YAGO, and many, many, more besides. Hence mentions, specifically, can be easily translated via these links to these other knowledge bases. Aside from this, Wikidata offers a wider selection of entities and (curated) properties when compared to these other sources. For this reason, we currently recommend focusing on adding mentions for Wikidata. In future, we may look at ways in which mentions for other knowledge bases can be added automatically. If there are other open knowledge bases without links from Wikidata, we would rather favour adding the links to Wikidata, as they will be much more reusable (not just for QAWiki).

General guidelines for mentions

We start with some general guidelines for mentions:

  • We recommend to be quite liberal when adding mentions. There are in some cases many ways to write the same query that may use the same combinations of entities and mentions. For example, for the question "Which U.S. president had the most spouses?", we can offer entity mentions for U.S. (Q30) and president (Q30461) even if there is a specific entity for U.S. president (Q11696). Likewise, for a mention like wives, you can add the property P26 (spouse), which though not expressing exactly the same property, is likely to be used in such a query given the lack of a specific wife property in Wikidata.
  • The same phrase can have multiple relevant mentions, and can mix entity and property mentions. For example, in the question "What is the capital of Ireland?", the phrase capital corresponds not only to the property P36, but also the entity Q5119 (capital city).

Labels for entities