Identity of Long-tail Entities in Text
Filip Ilievski(Editor)
IOS Press,US
1st Edition
Published on 29. November 2019
Book
Paperback/Softback
220 pages
978-1-64368-042-2 (ISBN)
Description
The digital era has generated a huge amount of data on the identities (profiles) of people, organizations and other entities in a digital format, largely consisting of textual documents such as news articles, encyclopedias, personal websites, books, and social media. Identity has thus been transformed from a philosophical to a societal issue, one requiring robust computational tools to determine entity identity in text.
Computational systems developed to establish identity in text often struggle with long-tail cases. This book investigates how Natural Language Processing (NLP) techniques for establishing the identity of long-tail entities - which are all infrequent in communication, hardly represented in knowledge bases, and potentially very ambiguous - can be improved through the use of background knowledge. Topics covered include: distinguishing tail entities from head entities; assessing whether current evaluation datasets and metrics are representative for long-tail cases; improving evaluation of long-tail cases; accessing and enriching knowledge on long-tail entities in the Linked Open Data cloud; and investigating the added value of background knowledge ("profiling") models for establishing the identity of NIL entities.
Providing novel insights into an under-explored and difficult NLP challenge, the book will be of interest to all those working in the field of entity identification in text.
Computational systems developed to establish identity in text often struggle with long-tail cases. This book investigates how Natural Language Processing (NLP) techniques for establishing the identity of long-tail entities - which are all infrequent in communication, hardly represented in knowledge bases, and potentially very ambiguous - can be improved through the use of background knowledge. Topics covered include: distinguishing tail entities from head entities; assessing whether current evaluation datasets and metrics are representative for long-tail cases; improving evaluation of long-tail cases; accessing and enriching knowledge on long-tail entities in the Linked Open Data cloud; and investigating the added value of background knowledge ("profiling") models for establishing the identity of NIL entities.
Providing novel insights into an under-explored and difficult NLP challenge, the book will be of interest to all those working in the field of entity identification in text.
More details
Series
Language
English
Place of publication
London
Netherlands
Publishing group
IOS Press
Target group
Professional and scholarly
ISBN-13
978-1-64368-042-2 (9781643680422)
Copyright in bibliographic data is held by Nielsen Book Services Limited or its licensors: all rights reserved.
Schweitzer Classification
Other editions
Additional editions

Filip Ilievski
Identity of Long-tail Entities in Text
E-Book
11/2019
1st Edition
IOS Press,US
€97.99
Available for download