
Text, Speech, and Dialogue
Description
Alles über E-Books | Antworten auf Fragen rund um E-Books, Kopierschutz und Dateiformate finden Sie in unserem Info- & Hilfebereich.
This book constitutes the refereed proceedings of the 18th International Conference on Text, Speech and Dialogue, TSD 2015, held in Pilsen, Czech Republic, in September 2015.
The 67 papers presented together with 3 invited papers were carefully reviewed and selected from 138 submissions. They focus on topics such as corpora and language resources; speech recognition; tagging, classification and parsing of text and speech; speech and spoken language generation; semantic processing of text and speech; integrating applications of text and speech processing; automatic dialogue systems; as well as multimodal techniques and modelling.
More details
Other editions
Additional editions

Content
- Intro
- Preface
- Organization
- About Plzen (Pilsen)
- Contents
- Invited Talks
- Speech Analysis in the Big Data Era
- 1 Introduction
- 2 Data: The Availability-Shock
- 3 On Efficiency: Learning Cooperatively
- 3.1 Transfer Learning
- 3.2 (Dynamic) Active Learning
- 3.3 Semi-supervised Learning
- 3.4 Cooperative Learning
- 4 On Decision-Making: Learning Confidence Measures
- 4.1 Agreement-Based Confidence Measures
- 4.2 Learning Errors
- 5 On Seeing the Larger Picture: Learning Multiple Targets
- 6 On Big Data: Distribution
- 7 Conclusion
- References
- I Conference Papers
- A Multi-criteria Text Selection Approach for Building a Speech Corpus
- 1 Introduction
- 2 Literature Review
- 3 Proposed Approach
- 4 Experimental Results
- 5 Conclusion
- References
- Experiment with GMM-Based Artefact Localization in Czech Synthetic Speech
- 1 Introduction
- 2 Method
- 3 Material, Experiments, and Results
- 4 Discussion and Conclusion
- References
- Tuned and GPU-Accelerated Parallel Data Mining from Comparable Corpora
- 1 Introduction
- 2 State of the Art
- 3 Parallel Data Mining
- 4 Yalign and Improvements
- 5 Evaluation of Obtained Comparable Corpora
- 6 Conclusions
- References
- Investigating Genre and Method Variation in Translation Using Text Classification
- 1 Introduction
- 2 Related Work and Theoretical Background
- 3 Methods
- 3.1 Data
- 3.2 Algorithms
- 4 Results
- 4.1 Genres and Methods
- 4.2 Translation Methods
- 4.3 Different Genres: Different Language?
- 4.4 Human vs. Machine
- 4.5 Feature Analysis
- 5 Conclusion and Outlook
- References
- Extracting Characteristics of Fashion Models from Magazines for Item Recommendation
- 1 Introduction
- 2 Previous Research
- 3 Proposed Method
- 3.1 Acquiring Item Name and Description
- 3.2 Finding the Features of the Model from an Item Name
- 3.3 Finding the Feature of the Model from the Item Description
- 3.4 Creating a Fashion Style Vector
- 4 Experiment
- 4.1 Method 1 Data
- 4.2 Method 2 Data
- 4.3 Image Score
- 4.4 Evaluation
- 4.5 Evaluation Results
- 4.6 Discussion
- 5 Conclusion
- References
- Segment Representations in Named Entity Recognition
- 1 Introduction
- 2 Segment Representations
- 3 Related Work
- 4 NER System
- 5 Corpora
- 6 Experiments
- 6.1 Standard Partitioning
- 6.2 Significance Tests
- 6.3 Discussion
- 7 Conclusion
- References
- Analyzing Text Coherence via Multiple Annotation in the Prague Dependency Treebank
- 1 Introduction
- 2 Aim of Work
- 3 Language Material -- the Prague Dependency Treebank
- 3.1 Annotation of Sentence Information Structure
- 3.2 Annotation of Bridging Anaphora
- 4 Methods
- 5 Results
- 6 Conclusion
- References
- Automatic Detection of Parkinson's Disease in Reverberant Environments
- 1 Introduction
- 2 Experimental Setup
- 2.1 Databases
- 2.2 Speech Tasks
- 2.3 Reverberation
- 2.4 Preprocessing and Characterization of Unvoiced Frames
- 2.5 Classification
- 3 Results and Discussion
- 4 Conclusions
- References
- Automatic Detection of Parkinson's Disease from Compressed Speech Recordings
- 1 Introduction
- 2 Experimental Setup
- 2.1 Speech Recordings
- 2.2 Encoding - Compression
- 2.3 Pre-processing and Voiced/Unvoiced Segmentation
- 2.4 Characterization
- 2.5 Classification
- 3 Results
- 4 Conclusion
- References
- Time Dependent ARMA for Automatic Recognition of Fear-Type Emotions in Speech
- 1 Introduction
- 2 Materials and Methods
- 2.1 Segmentation
- 2.2 SP-TARMA Modeling
- 2.3 Feature Estimation
- 2.4 Classification
- 3 Experimental Framework and Results
- 3.1 Datasets
- 3.2 Experimental Setup
- 3.3 Results and Discussion
- 4 Conclusion
- References
- Using Lexical Stress in Authorship Attribution of Historical Texts
- 1 Introduction
- 2 Lexical Stress
- 2.1 Motivation
- 2.2 Extracting Lexical Stress from Text
- 3 Machine Learning Algorithms
- 3.1 Learning Methods
- 3.2 Features Used
- 3.3 Evaluating Performance
- 3.4 Weighted Voting
- 4 Authorship Attribution with Lexical Stress Pattern Vectors
- 4.1 Experiments with Lexical Stress Patterns Only
- 4.2 Combining Stress with Other Lexical Features
- 5 Conclusion and Future Work
- References
- Eye Gaze Analyses in L1 and L2 Conversations: Difference in Interaction Structures
- 1 Introduction
- 2 Mutimodal Corpus
- 2.1 Participants
- 2.2 Experimental Setup
- 2.3 Procedure
- 2.4 Annotations
- 2.5 Transcription
- 3 Analyses
- 3.1 Analysis 1: Listener's Eye Gaze Activities
- 3.2 Analysis 2: Speaker's Eye Gaze Activities
- 3.3 Analysis 3: Eye Gaze Activities Among the Participants
- 4 Discussion
- 5 Conclusion
- References
- Word Categorization of Corporate Annual Reports for Bankruptcy Prediction by Machine Learning Methods
- 1 Introduction
- 2 Data Preprocessing
- 3 Experimental Results
- 4 Conclusion
- References
- Novel Multi-word Lists for Investors' Decision Making
- 1 Introduction
- 2 Research Methodology
- 3 Data Collection and Description
- 4 Relationship Between Word Lists and Investment Indicators
- 5 Conclusion
- References
- Topic Classifier for Customer Service Dialog Systems
- 1 Introduction
- 2 The Task and Corpus
- 3 Semantic Parsing as a Previous Step to Classification
- 3.1 The Semantic Grammar
- 3.2 Word-Level Grammars
- 4 Entropy Modeling
- 5 Classification
- 6 Experiments and Results
- 7 Conclusions and Future Work
- References
- Defining a Global Adaptive Duration Target Cost for Unit Selection Speech Synthesis
- 1 Introduction
- 2 The TTS System
- 3 An Adaptive Duration Target Cost
- 3.1 Neural Network
- 3.2 Duration Target Cost
- 4 Experiments
- 4.1 Corpus Description
- 4.2 Objective Analysis
- 4.3 Subjective Evaluation
- 5 Conclusion
- References
- Adaptive Speech Synthesis of Albanian Dialects
- 1 Introduction
- 2 The Albanian Language
- 3 Grapheme-to-Phoneme Conversion
- 4 Recording Script
- 5 Recording
- 6 Voice Building
- 7 Evaluation
- 7.1 Design
- 7.2 Results
- 8 Conclusion
- References
- Language-Independent Age Estimation from Speech Using Phonological and Phonemic Features
- 1 Introduction
- 2 Test Data and Subjective Evaluation
- 3 Features Computed from the Speech Data
- 4 Results and Discussion
- References
- LecTrack: Incremental Dialog State Tracking with Long Short-Term Memory Networks
- 1 Introduction
- 2 Dialog State Tracking
- 3 LSTM Dialog State Tracker
- 3.1 Model
- 3.2 Training
- 4 Experiments
- 4.1 Dataset
- 4.2 Baseline
- 4.3 Data Preprocessing
- 4.4 Experimental Methodology
- 4.5 Results
- 5 Discussion
- 6 Related Work
- 7 Conclusion
- References
- Automatic Construction of Domain Specific Sentiment Lexicons for Hungarian
- 1 Introduction
- 2 Sentiment Lexica
- 2.1 Translating a Foreign Lexicon
- 2.2 Bootstrapping Sentiment Lexicon
- 2.3 Extending Lexicons
- 3 Data
- 4 Results
- 5 Conclusions
- References
- Investigation of Word Senses over Time Using Linguistic Corpora
- 1 Introduction
- 2 Related Work
- 3 Topic Models
- 4 Topic Models over Time
- 5 Experiments
- 6 Conclusion and Future Work
- References
- Dependency-Based Problem Phrase Extraction from User Reviews of Products
- 1 Introduction
- 2 Related Work
- 3 Target Phrase Extraction
- 3.1 Dependency Relations for Target Extraction
- 3.2 Calculating Semantic Relatedness of Problem Targets
- 3.3 Dependency-Based Approach
- 4 Evaluation and Experiments
- 4.1 Difficulties in Evaluation
- 5 Conclusion
- References
- Semantic Splitting of German Medical Compounds
- 1 Introduction
- 2 Related Work
- 3 Semantic Compound Splitting Approach
- 3.1 Extract Constituent Candidates from Corpus
- 3.2 Generate Corpus-Based Split Options
- 3.3 Dismiss Split Options Based on POS Tags and Suffixed
- 3.4 Generate Split Options that Include Unknown Constituents
- 3.5 Disambiguation of Split Options
- 4 Evaluation
- 4.1 Evaluation Resources
- 4.2 Evaluation Technique
- 4.3 Evaluation Results
- 4.4 Discussion and Future Work
- 5 Conclusion
- References
- A Comparison of MT Methods for Closely Related Languages: A Case Study on Czech -- Slovak and Croatian -- Slovenian Language Pairs
- 1 Introduction
- 2 State of the Art
- 3 Methodology
- 3.1 Experiment Outline
- 3.2 Test Data
- 4 Results
- 4.1 Czech -- Slovak
- 4.2 Croatian -- Slovenian
- 4.3 Inter-rater Agreement
- 5 Conclusions and Further Work
- References
- Ideas for Clustering of Similar Models of a Speaker in an Online Speaker Diarization System
- 1 Introduction
- 2 The Diarization System
- Feature Extraction and Voice Activity Detection.
- 3 Offline Clustering
- 4 Online Clustering
- 4.1 Merging Multiple GMMs into a Single One
- 4.2 Treating Multiple GMMs as Belonging to a Single Speaker
- 5 Experiments
- 6 Conclusion
- References
- Simultaneously Trained NN-Based Acoustic Model and NN-Based Feature Extractor
- 1 Introduction
- 2 Neural-Network-Based Feature Extraction
- 3 Mean Normalization, Variance Normalization and Delta Coefficients
- 4 Neural-Network-Based Acoustic Model
- 5 Experiments and Results
- 6 Conclusion and Future Work
- References
- Named Entity Recognition for Mongolian Language
- 1 Introduction
- 2 Mongolian Names
- 3 The NER System
- 3.1 Preprocessing
- 3.2 Feature Generation
- 3.3 Classifiers
- 3.4 Ensembling
- 4 Resource Building
- 5 Experimental Results and Discussion
- 5.1 Results in Edit Distance Functions
- 5.2 Results in Feature Selection Experiments
- 5.3 Results in the Classifier Ensemble
- References
- Comparing Semantic Models for Evaluating Automatic Document Summarization
- 1 Introduction
- 2 Dataset
- 2.1 Data Annotation
- 3 Examined Language Models
- 3.1 TfIdf
- 3.2 LSA
- 3.3 LDA
- 3.4 Word2Vec
- 3.5 Doc2Vec
- 4 Evaluation
- 5 Conclusion
- References
- Automatic Labeling of Semantic Roles with a Dependency Parser in Hungarian Economic Texts
- 1 Introduction
- 2 Related Work
- 3 Semantic Frames and Semantic Roles
- 4 Corpus, Programs
- 4.1 The Syntactic Parse Tree
- 5 Classification
- 5.1 Feature Set
- 5.2 Using the Probabilities of the Base Features
- 5.3 Reducing the Number of Feature-occurrences
- 5.4 Grouping Target Words
- 5.5 Baseline Methods
- 5.6 Statistical Data
- 6 Results
- 6.1 Results for Baseline Methods
- 6.2 Results for Grouping the Target Words
- 6.3 Results for Ablation Analysis
- 6.4 Results for the News from Stock Markets Domain
- 6.5 The Comparison of the Results with the Related Works
- 7 Discussion, Conclusions
- References
- Significance of Unvoiced Segments and Fundamental Frequency in Infant Cry Analysis
- 1 Introduction
- 2 Teager Energy Operator (TEO) Based F0 Estimation
- 3 Experimental Results
- 4 Discussion
- 5 Summary and Conclusions
- References
- Speech Corpus Preparation for Voice Banking of Laryngectomised Patients
- 1 Introduction
- 2 Building of the Speech Corpus
- 2.1 Greedy Algorithm for Sentences Selection
- 2.2 Optimal Sentence Length Preselection
- 2.3 Step-by-Step Sentence Selection Procedure
- 2.4 Text Corpus Statistics
- 2.5 Recording
- 3 First Observations
- 4 Conclusions
- References
- An Open Source Speech Synthesis Frontend for HTS
- 1 Introduction
- 2 Voice Model ``Leo''
- 3 Framework Architecture
- 3.1 Manager Module and API
- 3.2 Frontend Interfaces
- 3.3 Text Analysis Modules
- 3.4 Synthesis Modules
- 4 Adding New Languages
- 4.1 Gathering Data
- 4.2 Integration
- 4.3 Training Voice Models
- 5 Conclusion
- References
- Tibetan Linguistic Terminology on the Base of the Tibetan Traditional Grammar Treatises Corpus
- 1 Introduction
- 2 Modern Corpus Linguistics of the Tibetan Language
- 3 Corpus Structure
- 4 Corpus Annotation
- 5 Corpus Search and Usage
- 6 Lexical Database of the Tibetan Grammatical Treatises Corpus
- 6.1 Special Tagging of Grammatical Terminology
- 6.2 Tags for Grammatical Terminology
- 6.3 Structure of Lexical Database
- 6.4 User Interface
- 7 Future Works
- 8 Conclusion
- References
- Improving Multi-label Document Classification of Czech News Articles
- 1 Introduction
- 2 Current System and Baseline Classifier
- 3 Training and Testing Data
- 4 Evaluation Metric
- 5 Vector Space Models
- 5.1 Performance on Our Data
- 6 Thresholding Strategy
- 6.1 TopN
- 6.2 Threshold Selection
- 7 Conclusion and Future Work
- References
- Score Normalization Methods for Relevant Documents Selection for Blind Relevance Feedback in Speech Information Retrieval
- 1 Introduction
- 2 Information Retrieval System
- 2.1 Query Likelihood Model
- 2.2 Blind Relevance Feedback
- 3 Score Normalization for Relevant Documents Selection
- 3.1 Score Normalization Methods
- 4 Experiments
- 4.1 Information Retrieval Collection
- 4.2 Evaluation Metrics
- 4.3 Results
- 5 Conclusions
- References
- Imbalanced Text Categorization Based on Positive and Negative Term Weighting Approach
- 1 Introduction
- 2 Term Weighting Approach
- 3 Proposed Positive and Negative Based Term Weighting Scheme
- 4 Empirical Observation of Term Weighting and Feature Selection Approaches
- 5 Experiments
- 6 Conclusion
- References
- CloudASR: Platform and Service
- 1 Introduction
- 2 CloudASR Platform
- 2.1 Platform Scalability
- 2.2 Platform High Availability
- 2.3 Platform Maintenance and Customizability
- 2.4 Annotation Interface
- 3 Evaluation
- 3.1 Real Time Factor and Word Error Rate
- 3.2 Online vs. Batch Mode Latency
- 3.3 Parallel Requests Benchmark
- 4 CloudASR Web-Service
- 5 Conclusion and Future Work
- References
- Experimental Tagging of the ORAL Series Corpora: Insights on Using a Stochastic Tagger
- 1 Introduction
- 1.1 Speech Transcripts vs. Written-Text-Based NLP Tools
- 1.2 ORAL Series Corpora
- 2 Method
- 3 Results and Discussion
- 3.1 Morphological Dictionary Modifications
- 3.2 Training Set Modifications
- 3.3 Linguistic Concerns
- 4 Conclusion
- References
- Development and Evaluation of the Emotional Slovenian Speech Database - EmoLUKS
- 1 Introduction
- 2 Emotional Speech Database - EmoLUKS
- 2.1 Database Description and Preparation
- 2.2 Defining the Emotional States
- 2.3 Database Annotation Through Crowd-Sourcing
- 2.4 Analysis of the Annotated EmoLUKS Database
- 3 Baseline Emotion Recognition Experiments
- 4 Conclusion and Feature Work
- References
- A Semi-automatic Adjective Mapping Between plWordNet and Princeton WordNet
- 1 Introduction
- 2 Adjective Relation Structure in plWordNet and Princeton WordNet
- 3 Automatic Prompt Algorithms
- 4 Results
- 5 Conclusions
- References
- Toward Exploring the Role of Disfluencies from an Acoustic Point of View: A New Aspect of (Dis)continuous Speech Prosody Modelling
- 1 Introduction
- 2 Material and Methods
- 2.1 Speech Databases
- 2.2 Automatic Segmentation for Phonological Phrases
- 3 Overall vs Partial Interpolation of F0
- 3.1 Post Processing Alternatives for F0
- 3.2 Results
- 4 Conclusions
- References
- Heuristic Algorithm for Zero Subject Detection in Polish
- 1 Introduction
- 2 Related Work
- 3 Problem Definition
- 3.1 Functional Verb Classification
- 3.2 Definition of Verb and Noun
- 4 Data
- 5 Algorithm
- 6 Results
- 7 Conclusions and Future Work
- References
- Vocal Tract Length Normalization Features for Audio Search
- 1 Introduction
- 2 Relation to Prior Work
- 3 Vocal Tract Length Normalization (VTLN)
- 3.1 Scale Transform Cepstral Coefficients (STCC)
- 3.2 Warped Linear Prediction Cepstral Coefficients (WLPCC)
- 4 Experimental Results
- 4.1 Database Used
- 4.2 Architecture of Audio Matching System
- 4.3 Results and Discussion
- 4.4 Evaluation of Class Separability
- 5 Summary and Conclusions
- References
- RENA: A Named Entity Recognition System for Arabic
- 1 Introduction
- 2 Background
- 2.1 The Arabic Language
- 2.2 Challenges in Arabic Named Entity Recognition
- 3 Related Work
- 4 Approach
- 4.1 CRF and MIRA
- 4.2 Features
- 5 Experimental Setup
- 5.1 Datasets
- 5.2 Tools
- 5.3 Evaluation Metrics
- 6 Experimental Results
- 7 Conclusion and Future Work
- References
- Combining Evidences from Mel Cepstral and Cochlear Cepstral Features for Speaker Recognition Using Whispered Speech
- 1 Introduction
- 2 System Features
- 2.1 Cochlear Frequency Cepstral Coefficients (CFCC)
- 3 Experiment Results
- 3.1 Speech Database
- 3.2 Speaker Recognition by Listening Test of Whispered Speech
- 3.3 Speaker Recognition Using Spectral Features
- 4 Summary and Conclusions
- References
- Random Indexing Explained with High Probability
- 1 Introduction
- 2 Random Projections in Euclidean Spaces
- 3 The Significance of the Proposed Mathematical Justification
- 4 Experimental Results
- 5 Discussion
- References
- Hungarian Grammar Writing with LTAG and XMG
- 1 Core Data: Hungarian Sentence Articulation
- 1.1 The Preverbal Field
- 1.2 Verbal Modifiers
- 2 LTAG and XMG in a Nutshell
- 3 Hungarian: Simple Sentences with XMG
- 3.1 The Nucleus and the Postverbal Field
- 3.2 Positions in the Preverbal Field
- 3.3 Basic Sentence Structures
- 4 Summary and Further Work
- 4.1 Verbal Field(s) in Complex Sentences
- References
- Phonetic Segmentation Using KALDI and Reduced Pronunciation Detection in Causal Czech Speech
- 1 Introduction
- 2 Automatic Phonetic Segmentation
- 2.1 KALDI-Based Segmentation
- 2.2 Labtool - Command Line & Praat Version
- 2.3 Labtool Under MS Windows
- 3 Casual Speech Analysis
- 3.1 Phonetic Segmentation of Casual Speech
- 3.2 Detection of Reduced Pronunciation in Casual Speech
- 4 Experimental Part
- 5 Conclusions
- References
- Automatic Robust Rule-Based Phonetization of Standard Arabic
- 1 Introduction
- 2 Pre-transcription
- 3 Arabic Letters to Sound Rules
- 3.1 Rules at the Phonemic Level
- 3.2 Discussion and Proposed Modifications at the Phonemic Level
- 3.3 Rules at the Phonetic Level
- 3.4 Discussion of the Phonetic-Level Rules
- 3.5 Phonetic Level Encoding
- 4 Syllabication
- 5 Training and Testing Sets
- 6 Evaluation
- 7 Conclusion
- References
- Knowledge-Based and Data-Driven Approaches for Georeferencing of Informal Documents
- 1 Introduction
- 2 Related Work
- 3 Geographical Knowledge-Based Heuristics for Georeferencing
- 4 Information Retrieval with Re-ranking for Georeferencing
- 5 GeoFusion: Knowledge-Based and Data-Driven Georeferencing
- 6 Evaluation
- 7 Conclusions
- References
- Automated Mining of Relevant N-grams in Relation to Predominant Topics of Text Documents
- 1 Introduction
- 2 Data and Its Preprocessing
- 3 Searching for N-grams
- 4 Overview of Results and Discussion
- 5 Conclusion
- References
- Incremental Dependency Parsing and Disfluency Detection in Spoken Learner English
- 1 Introduction
- 2 Transition-Based Dependency Parsing
- 3 Experiments
- 3.1 Datasets
- 3.2 Procedure
- 4 Results
- 5 Discussion
- References
- Open Source German Distant Speech Recognition: Corpus and Acoustic Model
- 1 Introduction
- 1.1 Related Work
- 2 Corpus
- 3 Experiments
- 3.1 Phoneme Dictionary
- 3.2 Language Model
- 3.3 CMU Sphinx Acoustic Model
- 3.4 Kaldi Acoustic Models
- 4 Evaluation
- 5 Conclusion and Future Work
- References
- First Steps in Czech Entity Linking
- 1 Introduction
- 2 Related Work
- 3 Corpus Creation
- 4 Similarity Metrics
- 5 Proposed Combination
- 6 Experiments
- 7 Conclusion and Future Work
- References
- Classification of Prosodic Phrases by Using HMMs
- 1 Introduction
- 2 Prosody Model and Prosodemes
- 3 Proposed Approach
- 3.1 Training Stage
- 4 Evaluation and Results
- 4.1 Evaluation of Classification Results
- 4.2 Evaluation of Unit Selection Synthesis
- 5 Conclusion
- 5.1 Future Work
- References
- Adding Multilingual Terminological Resources to Parallel Corpora for Statistical Machine Translation Deteriorates System Performance: A Negative Result from Experiments in the Biomedical Domain
- 1 Introduction
- 2 Experimental Set-Up
- 2.1 Parallel Corpora and Biomedical Terminologies
- 2.2 Configurations of MT Systems
- 3 Evaluation
- 3.1 Automatic Evaluation---System Comparison
- 3.2 Manual Analysis
- 3.3 Out-of-Vocabulary Analysis
- 4 Conclusions
- References
- Derivancze --- Derivational Analyzer of Czech
- 1 Introduction
- 2 Motivation
- 3 Related Work
- 4 Design of Derivancze: in Constrast with DeriNet
- 4.1 Semantically Labelled Relations Instead of Purely Derivational Relations
- 4.2 More than One Base Word and Semantic Equivalence
- 4.3 Overgeneration Followed by Filtering through Language Corpora
- 5 Results
- 6 Conclusions and Future Work
- References
- Detection of Large Segmentation Errors with Score Predictive Model
- 1 Introduction
- 2 Data and Score Predictive Model
- 3 Detection Method
- 4 Experiments and Results
- 4.1 The Most Suitable Statistic Value
- 4.2 Reducing Number of False Detections
- 4.3 Specific Boundaries
- 5 Conclusion
- References
- Identification of Noun-Noun Compounds in the Context of Speech-to-Speech Translation
- 1 Introduction
- 2 Identification of Noun-Noun Compounds
- 2.1 Linguistic Tests for Compound Identification
- 2.2 Lexical Database
- 3 ASR Output Specifities
- 4 Implementation
- 5 Experiments and Related Work
- References
- From Spoken Language to Ontology-Driven Dialogue Management
- 1 Introduction
- 2 Related Work
- 3 Natural Language Processing Engine
- 4 Sentence Semantic Representation
- 4.1 Utterance Semantic Representation in Shallow and Deep MMIL
- 5 Paraphrase Generation
- 5.1 ANTI LF Paraphrase Rule
- 6 Domain Ontology
- 6.1 Lexical Information in the Domain Ontology
- 7 Extracting Facts for the Dialogue Manager from Deep MMIL
- 7.1 Types of Facts in the Knowledge Base of the Dialogue Manager
- 7.2 Fact Extraction Algorithm
- 8 Dialogue Manager
- 8.1 Dialogue Move Engine
- 8.2 Domain Binding
- 8.3 Problem Solving
- 9 Evaluation, Conclusion and Future Work
- References
- Entity-Oriented Sentiment Analysis of Tweets: Results and Problems
- 1 Introduction
- 2 Related Work
- 3 Twitter Entity-Oriented Task at SentiRuEval
- 3.1 Data Collections
- 3.2 Data Annotation and Measures
- 3.3 Participants and Results
- 4 Analysis of Participants' Results in Two Domains
- 4.1 Explaining the Difference in the Perfomance in Two Domains
- 4.2 Analyzing Difficult Tweets
- 4.3 Understanding If Systems Were Really Entity-Oriented
- 5 Conclusion
- References
- Improved Estimation of Articulatory Features Based on Acoustic Features with Temporal Context
- 1 Introduction
- 2 AF Estimation
- 2.1 AF Classes for Czech
- 2.2 Acoustic Features for AF Estimation
- 2.3 MLP-Based AF Classification
- 3 Experiments and Discussion
- 3.1 Experimental Setup
- 3.2 Results
- 4 Conclusions
- References
- Do Important Words in Bag-of-Words Model of Text Relatedness Help?
- 1 Introduction
- 2 Related Work
- 3 Finding Important Words Using Word Relatedness
- 4 Methods Used
- 4.1 Latent Semantic Analysis
- 4.2 The Google Trigram Model
- 5 Evaluation Datasets and Results
- 5.1 ABC1225
- 5.2 OnWN2012
- 5.3 SMTeuroparl2012
- 5.4 SMT2013
- 5.5 HDL2013
- 6 Conclusion
- References
- Increased Recall in Annotation Variance Detection in Treebanks
- 1 Introduction
- 2 Alternative Approach
- 3 Generalization of Variants
- 3.1 Some Minor Changes
- 4 Experiments and Discussion
- 4.1 Effects of the Minor Changes
- 4.2 Effects of Generalization
- 5 Related Work
- 6 Final Remarks
- References
- Using Sociolinguistic Inspired Features for Gender Classification of Web Authors
- 1 Introduction
- 2 Related Work
- 3 Gender Classification Methodology
- 4 Experimental Setup and Results
- 5 Conclusions
- References
- Modified Group Delay Based Features for Asthma and HIE Infant Cries Classification
- 1 Introduction
- 2 Modified Group Delay Function
- 3 Feature Extraction
- 4 Support Vector Machine (SVM) Classifier
- 5 Experimental Results
- 5.1 Database Used
- 5.2 Experimental Results
- 6 Summary and Conclusions
- References
- Self-Enrichment of Normalized LMF Dictionaries Through Syntactic-Behaviors-to-Meanings Links
- 1 Introduction
- 2 Related Works
- 3 Proposed Approach
- 3.1 Basis Concepts
- 3.2 Steps of the Approach
- 4 Experiment and Results
- 4.1 Experiment
- 4.2 Results
- 5 Conclusion and Perspectives
- References
- Author Index
System requirements
File format: PDF
Copy protection: Watermark-DRM (Digital Rights Management)
System requirements:
- Computer (Windows; MacOS X; Linux): Use the free software Adobe Reader, Adobe Digital Editions, or any other PDF viewer of your choice (see eBook Help).
- Tablet/Smartphone (Android; iOS): Install the free app Adobe Digital Editions or another reading app for eBooks, e.g., PocketBook (see eBook Help).
- E-reader: Bookeen, Kobo, Pocketbook, Sony, Tolino and many more (only limited: Kindle).
The file format PDF always displays a book page identically on any hardware. This makes PDF suitable for complex layouts such as those used in textbooks and reference books (images, tables, columns, footnotes). Unfortunately, on the small screens of e-readers or smartphones, PDFs are rather annoying, requiring too much scrolling.
This eBook uses Watermark-DRM, a „soft” copy protection. This means that there are no technical restrictions to prevent illegal distribution. However, there is a personalised watermark embedded in the eBook that can be used to identify the purchaser of the eBook in the event of misuse and to provide evidence for legal purposes.
For more information, see our eBook Help page.