bokomslag Identity of Long-tail Entities in Text

Identity of Long-tail Entities in Text

Filip Ilievski

Pocket

1309:-

Funktionen begränsas av dina webbläsarinställningar (t.ex. privat läge).

Tillfälligt slut online – klicka på "Bevaka" för att få ett mejl så fort varan går att köpa igen.

  • 220 sidor
  • 2019
The digital era has generated a huge amount of data on the identities (profiles) of people, organizations and other entities in a digital format, largely consisting of textual documents such as news articles, encyclopedias, personal websites, books, and social media. Identity has thus been transformed from a philosophical to a societal issue, one requiring robust computational tools to determine entity identity in text. Computational systems developed to establish identity in text often struggle with long-tail cases. This book investigates how Natural Language Processing (NLP) techniques for establishing the identity of long-tail entities - which are all infrequent in communication, hardly represented in knowledge bases, and potentially very ambiguous - can be improved through the use of background knowledge. Topics covered include: distinguishing tail entities from head entities; assessing whether current evaluation datasets and metrics are representative for long-tail cases; improving evaluation of long-tail cases; accessing and enriching knowledge on long-tail entities in the Linked Open Data cloud; and investigating the added value of background knowledge ("profiling") models for establishing the identity of NIL entities. Providing novel insights into an under-explored and difficult NLP challenge, the book will be of interest to all those working in the field of entity identification in text.
  • Författare: Filip Ilievski
  • Format: Pocket/Paperback
  • ISBN: 9781643680422
  • Språk: Engelska
  • Antal sidor: 220
  • Utgivningsdatum: 2019-11-29
  • Förlag: SAGE Publications