Provenance in Data Science

From Data Models to Context-Aware Knowledge Graphs
 
 
Springer (Verlag)
  • erschienen am 17. April 2021
  • |
  • 110 Seiten
 
E-Book | PDF mit Wasserzeichen-DRM | Systemvoraussetzungen
978-3-030-67681-0 (ISBN)
 
RDF-based knowledge graphs require additional formalisms to be fully context-aware, which is presented in this book. This book also provides a collection of provenance techniques and state-of-the-art metadata-enhanced, provenance-aware, knowledge graph-based representations across multiple application domains, in order to demonstrate how to combine graph-based data models and provenance representations. This is important to make statements authoritative, verifiable, and reproducible, such as in biomedical, pharmaceutical, and cybersecurity applications, where the data source and generator can be just as important as the data itself. Capturing provenance is critical to ensure sound experimental results and rigorously designed research studies for patient and drug safety, pathology reports, and medical evidence generation. Similarly, provenance is needed for cyberthreat intelligence dashboards and attack maps that aggregate and/or fuse heterogeneous data from disparate data sources to differentiate between unimportant online events and dangerous cyberattacks, which is demonstrated in this book. Without provenance, data reliability and trustworthiness might be limited, causing data reuse, trust, reproducibility and accountability issues.
This book primarily targets researchers who utilize knowledge graphs in their methods and approaches (this includes researchers from a variety of domains, such as cybersecurity, eHealth, data science, Semantic Web, etc.). This book collects core facts for the state of the art in provenance approaches and techniques, complemented by a critical review of existing approaches. New research directions are also provided that combine data science and knowledge graphs, for an increasingly important research topic.
weitere Ausgaben werden ermittelt
Dr. Leslie F. Sikos is a computer scientist specializing in artificial intelligence and data science, with a focus on cybersecurity applications. He holds two Ph.D. degrees and 20+ industry certificates. He is an active member of the research community as an author, editor, reviewer, conference organizer, and speaker, and a member of industry-leading organizations, such as the ACM and the IEEE. He contributed to international standards and developed state-of-the-art AI systems. Dr. Sikos published more than 20 books, including textbooks, monographs, and edited volumes.
Dr. Oshani W. Seneviratne is the Director of Health Data Research at the Institute for Data Exploration and Applications at the Rensselaer Polytechnic Institute (Rensselaer IDEA). She obtained her Ph.D. in Computer Science from Massachusetts Institute of Technology in 2014 under the supervision of Sir Tim Berners-Lee, the inventor of the World Wide Web. During her Ph.D., Oshani researched Accountable Systems for the Web. She invented a novel web protocol called HTTPA (HyperText Transfer Protocol with Accountability), and a novel provenance tracking mechanism called the Provenance Tracking Network. This work was demonstrated to be effective in several domains including electronic health care records transfer, and intellectual property protection in Web-based decentralized systems. At Rensselaer IDEA, Oshani leads the Smart Contracts Augmented with Analytics Learning and Semantics (SCALeS) project. The goal of this project is to predict, detect, and fix initially unforeseen situations in smart contracts utilizing novel combinations of machine learning, program analysis, and semantic technologies. Oshani is also involved in the Health Empowerment by Analytics, Learning, and Semantics (HEALS) Project. In HEALS she oversees the research operations targeted at the characterization and analysis of computational medical guidelines for chronic diseases such as diabetes, and the modeling of guideline provenance. Before Rensselaer, Oshani worked at Oracle specializing in distributed systems, provenance and healthcare-related research. She is the co-inventor of two enterprise provenance patents.
Prof. Deborah L. McGuinness is the Tetherless World Senior Constellation Chair and Professor of Computer, Cognitive, and Web Sciences at RPI. She is also the founding director of the Web Science Research Center and the CEO of McGuinness Associates Consulting. Deborah has been recognized with awards as a fellow of the American Association for the Advancement of Science (AAAS) for contributions to the Semantic Web, knowledge representation, and reasoning environments and as the recipient of the Robert Engelmore Award from the Association for the Advancement of Artificial Intelligence (AAAI) for leadership in Semantic Web research and in bridging Artificial Intelligence (AI) and eScience, significant contributions to deployed AI applications, and extensive service to the AI community. Deborah leads a number of large diverse data intensive resource efforts and her team is creating next-generation ontology-enabled research infrastructure for work in large interdisciplinary settings. Prior to joining RPI, Deborah was the acting director of the Knowledge Systems, Artificial Intelligence Laboratory and Senior Research Scientist in the Computer Science Department of Stanford University, and previous to that she was at AT\&T Bell Laboratories. Deborah consults with numerous large corporations as well as emerging startup companies wishing to plan, develop, deploy, and maintain semantic web and/or AI applications. Some areas of recent work include data science, next generation health advisors, ontology design and evolution environments, semantically enabled virtual observatories, semantic integration of scientific data, context-aware mobile applications, search, eCommerce, configuration, and supply chain management. Deborah holds a Bachelor of Math and Computer Science from Duke University, a Master of Computer Science from University of California at Berkeley, and a Ph.D. in Computer Science from Rutgers University.
The Evolution of Context-Aware RDF Knowledge Graphs.- Data Provenance and Accountability on the Web.- The Right (Provenance) Hammer for the Job: a Comparison of Data Provenance Instrumentation.- Contextualized Knowledge Graphs in Communication Network and Cyber-Physical System Modeling.- ProvCaRe: A Large-Scale Semantic Provenance Resource for Scientific Reproducibility.- Graph-Based Natural Language Processing for the Pharmaceutical Industry.
DNB DDC Sachgruppen

Dateiformat: PDF
Kopierschutz: Wasserzeichen-DRM (Digital Rights Management)

Systemvoraussetzungen:

Computer (Windows; MacOS X; Linux): Verwenden Sie zum Lesen die kostenlose Software Adobe Reader, Adobe Digital Editions oder einen anderen PDF-Viewer Ihrer Wahl (siehe E-Book Hilfe).

Tablet/Smartphone (Android; iOS): Installieren Sie die kostenlose App Adobe Digital Editions oder eine andere Lese-App für E-Books (siehe E-Book Hilfe).

E-Book-Reader: Bookeen, Kobo, Pocketbook, Sony, Tolino u.v.a.m. (nur bedingt: Kindle)

Das Dateiformat PDF zeigt auf jeder Hardware eine Buchseite stets identisch an. Daher ist eine PDF auch für ein komplexes Layout geeignet, wie es bei Lehr- und Fachbüchern verwendet wird (Bilder, Tabellen, Spalten, Fußnoten). Bei kleinen Displays von E-Readern oder Smartphones sind PDF leider eher nervig, weil zu viel Scrollen notwendig ist. Mit Wasserzeichen-DRM wird hier ein "weicher" Kopierschutz verwendet. Daher ist technisch zwar alles möglich - sogar eine unzulässige Weitergabe. Aber an sichtbaren und unsichtbaren Stellen wird der Käufer des E-Books als Wasserzeichen hinterlegt, sodass im Falle eines Missbrauchs die Spur zurückverfolgt werden kann.

Weitere Informationen finden Sie in unserer E-Book Hilfe.


Download (sofort verfügbar)

106,99 €
inkl. 7% MwSt.
Download / Einzel-Lizenz
PDF mit Wasserzeichen-DRM
siehe Systemvoraussetzungen
E-Book bestellen