Named Entities provides critical information for many NLP applications. Named Entity recognition and classification (NERC) in text is recognized as one of the important sub-tasks of Information Extraction (IE). The seven papers in this volume cover various interesting and informative aspects of NERC research. Nadeau & Sekine provide an extensive survey of past NERC technologies, which should be a very useful resource for new researchers in this field. Smith & Osborne describe a machine learning model which tries to solve the over-fitting problem. Mazur & Dale tackle a common problem of NE and conjunction; as conjunctions are often a part of NEs or appear close to NEs, this is an important practical problem. A further three papers describe analyses and implementations of NERC for different languages: Spanish (Galicia-Haro & Gelbukh), Bengali (Ekbal, Naskar & Bandyopadhyay), and Serbian (Vitas, Krstev & Maurel). Finally, Steinberger & Pouliquen report on a real WEB application where multilingual NERC technology is used to identify occurrences of people, locations and organizations in newspapers in different languages.
The contributions to this volume were previously published in Lingvisticae Investigationes 30:1 (2007).
Reihe
Sprache
Verlagsort
Zielgruppe
Illustrationen
Maße
Höhe: 245 mm
Breite: 164 mm
Gewicht
ISBN-13
978-90-272-2249-7 (9789027222497)
Copyright in bibliographic data and cover images is held by Nielsen Book Services Limited or by the publishers or by their respective licensors: all rights reserved.
Schweitzer Klassifikation
1. Foreword; 2. Articles; 3. A survey of named entity recognition and classification (by Nadeau, David); 4. Diversity in logarithmic opinion pools (by Smith, Andrew D.M.); 5. Handling conjunctions in named entities (by Mazur, Pawel); 6. Complex named entities in Spanish texts: Structures and properties (by Galicia-Haro, Sofia N.); 7. Named Entity Recognition and transliteration in Bengali (by Ekbal, Asif); 8. A note on the semantic and morphological properties of proper names in the Prolex project (by Vitas, Dusko); 9. Cross-lingual Named Entity Recognition (by Steinberger, Ralf); 10. Index