In recent years, online social networking has revolutionized interpersonal communication. The newer research on language analysis in social media has been increasingly focusing on the latter's impact on our daily lives, both on a personal and a professional level. Natural language processing (NLP) is one of the most promising avenues for social media data processing. It is a scientific challenge to develop powerful methods and algorithms which extract relevant information from a large volume of data coming from multiple sources and languages in various formats or in free form. We discuss the challenges in analyzing social media texts in contrast with traditional documents. Research methods in information extraction, automatic categorization and clustering, automatic summarization and indexing, and statistical machine translation need to be adapted to a new kind of data. This book reviews the current research on NLP tools and methods for processing the non-traditional information from social media data that is available in large amounts (big data), and shows how innovative NLP approaches can integrate appropriate linguistic information in various fields such as social media monitoring, healthcare, business intelligence, industry, marketing, and security and defence. We review the existing evaluation metrics for NLP and social media applications, and the new efforts in evaluation campaigns or shared tasks on new datasets collected from social media. Such tasks are organized by the Association for Computational Linguistics (such as SemEval tasks) or by the National Institute of Standards and Technology via the Text REtrieval Conference (TREC) and the Text Analysis Conference (TAC). In the concluding chapter, we discuss the importance of this dynamic discipline and its great potential for NLP in the coming decade, in the context of changes in mobile technology, cloud computing, virtual reality, and social networking. In this second edition, we have added information about recent progress in the tasks and applications presented in the first edition. We discuss new methods and their results. The number of research projects and publications that use social media data is constantly increasing due to continuously growing amounts of social media data and the need to automatically process them. We have added 85 new references to the more than 300 references from the first edition. Besides updating each section, we have added a new application (digital marketing) to the section on media monitoring and we have augmented the section on healthcare applications with an extended discussion of recent research on detecting signs of mental illness from social media.
Dr. Atefeh (Anna) Farzindar is a research associate of the Data Science Institute (DSI) and a faculty member of the Department of Computer Science, Viterbi School of Engineering at the University of Southern California (USC). She received her Ph.D. in Computer Science from the University of Montreal and her Doctorate in automatic summarization of legal documents from Paris-Sorbonne University in 2005.
Dr. Farzindar was founder and CEO of NLP Technologies Inc. specializing in natural language processing, summarization of legal decisions, machine translation, and social media analysis. She has served as Industry Chair of the Canadian Artificial Intelligence Association (2013-2015); General Co-chair of the 2013 AI/GI/CRV Conference, Regina, Canada; General Chair of the 2014 AI/GI/CRV Conference, Montréal, Canada; Program Committee co-chair for the 23rd Canadian Conference on Artificial Intelligence (AI 2010), Ottawa, Canada; Chair of the technology sector of the Language Industry Association Canada (AILIA 2009-2013); Vice President of The Language Technologies Research Centre (LTRC) of Canada (2012-2014); member of the Natural Sciences and Engineering Research Council of Canada (NSERC) Computer Science Liaison Committee (since 2014); and member of the Canadian Advisory Committee of International Organization for Standardization (ISO). She was an Adjunct Professor at the University of Montréal, Canada (2009-2015), Lecturer at Polytechnique Montréal, engineering school (2012-2014), and Visiting Professor and Honorary Research Fellow at the Research Group in Computational Linguistics at the University of Wolverhampton, UK (2010-2012).
Dr. Farzindar won Femmessor-Montréal's contest, "Succeeding with a balanced lifestyle," in the Innovative Technology and Information and Communications Technology category because of her involvement in the arts. Her paintings were published in the book One Thousand and One Nights in 2014. She has published more than 50 conference and journal papers, authored 3 books, and contributed the chapter "Social Network Integration in Document Summarization" to the book Innovative Document Summarization Techniques: Revolutionizing Knowledge Understanding, published by IGI Global.
Table of Contents: Preface to the Second Edition / Acknowledgments / Introduction to Social Media Analysis / Linguistic Pre-processing of Social Media Texts / Semantic Analysis of Social Media Texts / Applications of Social Media Text Analysis / Data Collection, Annotation, and Evaluation / Conclusion and Perspectives / Glossary / Bibliography / Authors' Biographies / Index