Schweitzer Fachinformationen
Wenn es um professionelles Wissen geht, ist Schweitzer Fachinformationen wegweisend. Kunden aus Recht und Beratung sowie Unternehmen, öffentliche Verwaltungen und Bibliotheken erhalten komplette Lösungen zum Beschaffen, Verwalten und Nutzen von digitalen und gedruckten Medien.
Demonstrate your Data Science skills by earning the brand-new CompTIA DataX credential
In CompTIA DataX Study Guide: Exam DY0-001, data scientist and analytics professor, Fred Nwanganga, delivers a practical, hands-on guide to establishing your credentials as a data science practitioner and succeeding on the CompTIA DataX certification exam. In this book, you'll explore all the domains covered by the new credential, which include key concepts in mathematics and statistics; techniques for modeling, analysis and evaluating outcomes; foundations of machine learning; data science operations and processes; and specialized applications of data science.
This up-to-date Study Guide walks you through the new, advanced-level data science certification offered by CompTIA and includes hundreds of practice questions and electronic flashcards that help you to retain and remember the knowledge you need to succeed on the exam and at your next (or current) professional data science role. You'll find:
Perfect for aspiring and current data science professionals, CompTIA DataX Study Guide is a must-have resource for anyone preparing for the DataX certification exam (DY0-001) and seeking a better, more reliable, and faster way to succeed on the test.
ABOUT THE AUTHOR
FRED NWANGANGA is a technology professional and professor in the IT, Analytics, and Operations Department within the University of Notre Dame - Mendoza College of Business. He teaches undergraduate and graduate courses in Python for Data Analytics, Machine Learning, and Unstructured Data Analytics. He has over 20 years of experience in technology management and analytics. He is the author of several LinkedIn Learning machine learning courses and the founder of the Early Bridges to Data Science Program in the Notre Dame Lucy Family Institute for Data & Society.
Introduction xxiii
Chapter 1 What Is Data Science? 1
Chapter 2 Mathematics and Statistical Methods 25
Chapter 3 Data Collection and Storage 63
Chapter 4 Data Exploration and Analysis 97
Chapter 5 Data Processing and Preparation 131
Chapter 6 Modeling and Evaluation 167
Chapter 7 Model Validation and Deployment 195
Chapter 8 Unsupervised Machine Learning 225
Chapter 9 Supervised Machine Learning 249
Chapter 10 Neural Networks and Deep Learning 271
Chapter 11 Natural Language Processing 293
Chapter 12 Specialized Applications of Data Science 315
Appendix Answers to Review Questions 337
Chapter 1: What Is Data Science? 338
Chapter 2: Mathematics and Statistical Methods 339
Chapter 3: Data Collection and Storage 341
Chapter 4: Data Exploration and Analysis 343
Chapter 5: Data Processing and Preparation 345
Chapter 6: Modeling and Evaluation 346
Chapter 7: Model Validation and Deployment 347
Chapter 8: Unsupervised Machine Learning 349
Chapter 9: Supervised Machine Learning 350
Chapter 10: Neural Networks and Deep Learning 352
Chapter 11: Natural Language Processing 353
Chapter 12: Specialized Applications of Data Science 355
Index 357
Congratulations on taking the initial step toward achieving your CompTIA DataX certification. The DataX certification, as described by CompTIA, is "the premier skills development program for highly experienced professionals seeking to validate their competency in the rapidly evolving field of data science." This study guide is tailored for data scientists who are in the early to mid-stages of their careers. It is designed to serve as a refresher for some and a source of new insights for others. No matter your level of expertise, this guide aims to solidify your understanding of essential data science tools and concepts necessary to effectively prepare for and pass the DataX certification exam.
In the following pages, you will find essential information about the CompTIA DataX exam, details on the organization and scope of this book, and a sample assessment test. This test is intended to help gauge your initial readiness for the certification exam. The answer key for the assessment questions references which chapter within the book addresses the concepts or exam objective behind the question. I encourage you to concentrate your study efforts on those chapters that cover areas where you feel you need to build your skills and confidence.
The DataX certification is designed to be a vendor-neutral validation of expert-level data science skills. CompTIA recommends the certification for professionals with 5+ years of experience in data science or similar roles. You can find additional information about the certification at:
www.comptia.org/certifications/datax
According to CompTIA, the certification is designed to assess a candidate's ability to:
CompTIA goes to great lengths to ensure that its certifications accurately reflect industry best practices. It works with a team of professionals, training providers, publishers, and subject matter experts (SMEs) to establish baseline competency for each of its exams. Based on this information, CompTIA has published five major domains that the DataX certification exam covers. The following is a list of the domains and the extent to which they are represented on the certification exam:
The DataX exam employs what CompTIA refers to as a "performance-based assessment" format. This approach integrates traditional multiple-choice questions with a variety of interactive question types, including fill-in-the-blank, multiple-response, drag-and-drop, and image-based problems, to create a more dynamic and comprehensive evaluation of a candidate's abilities. For more details about CompTIA's performance exams, visit:
www.comptia.org/testing/testing-options/about-comptia-performance-exams
The exam consists of 90 questions and has a time limit of 165 minutes. The results are provided in a pass/fail format. As you prepare, keep in mind two important aspects regarding the nature of the questions you will encounter.
First, CompTIA exams are known for their occasionally ambiguous questions. You may find yourself faced with multiple answers that seem correct, requiring you to choose the "most correct" one based on your knowledge and sometimes intuition. It's important not to spend too much time on these questions. Make your best choice, and then move on to the next question.
Second, be aware that CompTIA often includes unscored questions in their exams to collect psychometric data, a process known as item seeding. These questions are used to help develop future versions of the exam. Although these questions won't affect your score, you may not be able to distinguish them from scored questions, so you should attempt to answer every question as accurately as possible. Before starting the exam, you'll be informed about the possibility of encountering unscored questions. If you come across a question that doesn't seem related to any of the stated exam objectives, it might be one of these seeded questions, but since you can't be sure, it's best to treat every question as if it counts toward your final score.
Once you are ready to take the exam, visit the CompTIA store (https://store.comptia.org) to purchase a voucher for the exam. This book also includes a coupon that you may use to save 10 percent on the exam registration. CompTIA offers two options for taking the certification exam. You can either take the exam in person at a Pearson VUE testing center or online. The online exam involves a remote exam proctoring service powered by Pearson OnVUE.
https://store.comptia.org
You can find more information about CompTIA testing options at www.comptia.org/testing/testing-options/about-testing-options.
www.comptia.org/testing/testing-options/about-testing-options
This study guide covers everything you need to prepare and pass the DataX exam. Each chapter includes several recurring elements to help you prepare. Here's a description of some of those elements:
The chapters in this book are structured to facilitate a smooth flow and deepen your understanding of key concepts. They are not necessarily arranged in alignment with the sequence or structure of the certification exam objectives. To assist you in your exam preparation, the following is a high-level map that shows how the exam objectives correspond to the chapters in this study guide. This mapping will help you navigate the material more effectively and ensure that you cover all necessary topics as you prepare for the exam.
Dateiformat: ePUBKopierschutz: Adobe-DRM (Digital Rights Management)
Systemvoraussetzungen:
Das Dateiformat ePUB ist sehr gut für Romane und Sachbücher geeignet – also für „fließenden” Text ohne komplexes Layout. Bei E-Readern oder Smartphones passt sich der Zeilen- und Seitenumbruch automatisch den kleinen Displays an. Mit Adobe-DRM wird hier ein „harter” Kopierschutz verwendet. Wenn die notwendigen Voraussetzungen nicht vorliegen, können Sie das E-Book leider nicht öffnen. Daher müssen Sie bereits vor dem Download Ihre Lese-Hardware vorbereiten.Bitte beachten Sie: Wir empfehlen Ihnen unbedingt nach Installation der Lese-Software diese mit Ihrer persönlichen Adobe-ID zu autorisieren!
Weitere Informationen finden Sie in unserer E-Book Hilfe.