
Neural Generation of Textual Summaries from Knowledge Base Triples
Description
Alles über E-Books | Antworten auf Fragen rund um E-Books, Kopierschutz und Dateiformate finden Sie in unserem Info- & Hilfebereich.
An end-to-end trainable architecture is proposed, which encodes the information from a set of knowledge graph triples into a vector of fixed dimensionality, and generates a textual summary by conditioning the output on this encoded vector. Different methodologies for building the required data-to-text corpora are explored to train and evaluate the performance of the approach. Attention is first focused on generating biographies, and the author demonstrates that the technique is capable of scaling to domains with larger and more challenging vocabularies.
The applicability of the technique for the generation of open-domain Wikipedia summaries in Arabic and Esperanto - two under-resourced languages - is then discussed, and a set of community studies, devised to measure the usability of the automatically generated content by Wikipedia readers and editors, is described.
Finally, the book explains an extension of the original model with a pointer mechanism that enables it to learn to verbalise in a different number of ways the content from the triples while retaining the capacity to generate words from a fixed target vocabulary. The evaluation of performance using a dataset encompassing all of English Wikipedia is described, with results from both automatic and human evaluation both of which highlight the superiority of the latter approach as compared to the original architecture.
More details
Other editions
Additional editions
Content
- Intro
- Title Page
- Acknowledgements
- Abstract
- Notation
- Contents
- Introduction
- Aims and Objectives
- Contributions
- Thesis Structure
- Background
- Natural Language Generation
- Neural Networks in Natural Language Processing
- Language Modelling with Neural Networks
- Neural Networks as Generative Models
- Encoder-Decoder Framework
- Summary
- Evaluation Methodology
- Evaluation Methods
- Automatic Evaluation
- Human Evaluation
- Evaluating Multilingual Summaries from the Perspective of Wikipedia Readers and Editors
- Baselines
- Random
- Kneser-Ney (KN) Language Model
- Information Retrieval (IR)
- Machine Translation (MT)
- Summary
- Building Corpora of Natural Language Texts Aligned with Knowledge Base Triples
- Automatically Aligning Texts and Triples
- Wikipedia Summaries
- Knowledge Base Triples
- Aligned Corpora
- Biographies
- The D3 Corpus
- Building Multilingual Corpora
- Discussion
- Summary
- Neural Wikipedian: Generating Biographies from Knowledge Base Triples
- The Model
- Triple Encoder
- Decoder
- Property-Type Placeholders
- Model Training
- Generating Summaries
- Dataset Preparation
- Modelling the Textual Summaries
- Modelling the Input Triples
- Experiments
- Training Details
- Automatic Evaluation
- Human Evaluation
- Discussion
- Conclusion
- Learning to Generate Wikipedia Summaries for Underserved Languages
- Model
- Property Placeholders
- Dataset Preparation
- Experiments
- Training Details
- Automatic Evaluation
- Community Study
- Recruitment
- Readers' Evaluation
- Editors' Evaluation
- Conclusion
- Point at the Triple: Improving Neural Wikipedian with a Pointer Mechanism
- The Model
- Decoder
- Triple Encoder
- Dynamically Expanding the Vocabulary
- Summarising By Pointing and Generating
- Dataset Preparation
- Experiments
- Training Details
- Automatic Evaluation
- Human Evaluation
- Summary and Discussion
- Conclusion and Future Work
- Summary and Conclusions
- Current Limitations and Future Work
- Generation of Multi-Sentence Summaries
- The Main Entity of Interest
- Using the S3 Corpus
- Bibliography
System requirements
File format: PDF
Copy protection: Watermark-DRM (Digital Rights Management)
System requirements:
- Computer (Windows; MacOS X; Linux): Use the free software Adobe Reader, Adobe Digital Editions, or any other PDF viewer of your choice (see eBook Help).
- Tablet/Smartphone (Android; iOS): Install the free app Adobe Digital Editions or another reading app for eBooks, e.g., PocketBook (see eBook Help).
- E-reader: Bookeen, Kobo, Pocketbook, Sony, Tolino and many more (only limited: Kindle).
The file format PDF always displays a book page identically on any hardware. This makes PDF suitable for complex layouts such as those used in textbooks and reference books (images, tables, columns, footnotes). Unfortunately, on the small screens of e-readers or smartphones, PDFs are rather annoying, requiring too much scrolling.
This eBook uses Watermark-DRM, a „soft” copy protection. This means that there are no technical restrictions to prevent illegal distribution. However, there is a personalised watermark embedded in the eBook that can be used to identify the purchaser of the eBook in the event of misuse and to provide evidence for legal purposes.
For more information, see our eBook Help page.