Document Processing and Retrieval: TEXPROS focuses on the design and implementation of a personal, customizable office information and document processing system called TEXPROS (a TEXt PROcessing System). TEXPROS is a personal, intelligent office information and document processing system for text-oriented documents. This system supports the storage, classification, categorization, retrieval and reproduction of documents, as well as extracting, browsing, retrieving and synthesizing information from a variety of documents. When using TEXPROS in a multi-user or distributed environment, it requires specific protocols for extracting, storing, transmitting and exchanging information.
The authors have used a variety of techniques to implement TEXPROS, such as Object-Oriented Programming, Tcl/Tk, X-Windows, etc. The system can be used for many different purposes in many different applications, such as digital libraries, software documentation and information delivery.
Audience: Provides in-depth, state-of-the-art coverage of information processing and retrieval, and documentation for such professionals as database specialists, information systems and software developers, and information providers.
Auflage
Sprache
Verlagsort
Zielgruppe
Für Beruf und Forschung
Research
Produkt-Hinweis
Fadenheftung
Gewebe-Einband
Illustrationen
Maße
Höhe: 234 mm
Breite: 156 mm
Dicke: 19 mm
Gewicht
ISBN-13
978-0-7923-9644-4 (9780792396444)
DOI
10.1007/978-1-4613-1295-6
Schweitzer Klassifikation
1 Introduction.- 1.1 Texpros: An Overall Organization.- 1.2 Organization of the Book.- 2 Data Model and Algebra for Office Document.- 2.1 Related Work.- 2.2 Formal Framework of the D_model.- 2.3 Formalism of the D_algebra.- 2.4 Discussion.- 2.5 Summary.- 3 Document Categorization.- 3.1 Data Model Concepts.- 3.2 The Reconstruction Problem.- 3.3 Agent-Based Filing Architecture.- 3.4 Summary.- 4 Document Classification and Information Extraction.- 4.1 Document Classification and Information Extraction Techniques.- 4.2 Document Structures.- 4.3 Organization of Document Classification and Information Extraction Components.- 4.4 Document Layout Analysis.- 4.5 Conceptual Analysis on Structured Part of Document.- 4.6 Content Analysis on Unstructured Part of Document.- 4.7 Summary.- 5 Knowledge-Based Document Classification.- 5.1 Architecture of Knowledge-Based Document Classification.- 5.2 Knowledge Acquisition Tool (KAT).- 5.3 Document Type Tree Inference Engine.- 5.4 Classification Handler (CH).- 5.5 Summary.- 6 Document Retrieval.- 6.1 Document Retrieval Techniques for TEXPROS.- 6.2 Current Research on Document Retrieval.- 6.3 Overall Architecture of Retrieval System.- 6.4 Summary.- 7 Query Transformation.- 7.1 System Catalog - The Representation of Domain Knowledge and Meta-data Knowledge.- 7.2 Query Transformation Mechanism.- 7.3 Summary.- 8 Browser.- 8.1 Object Network.- 8.2 Architecture of Browser.- 8.3 Browsing in TEXPROS.- 8.4 Topic Interpreter.- 8.5 Object Network Constructor.- 8.6 Examples.- 8.7 Summary.- 9 Generalizer.- 9.1 Introduction to Generalizer.- 9.2 Generalization and Substitution Concepts.- 9.3 Generalization Algorithm for Detecting Erroneous Presuppositions.- 9.4 Giving Cooperative Responses by Substitutions.- 9.5 Summary.- References.