Schweitzer Fachinformationen
Wenn es um professionelles Wissen geht, ist Schweitzer Fachinformationen wegweisend. Kunden aus Recht und Beratung sowie Unternehmen, öffentliche Verwaltungen und Bibliotheken erhalten komplette Lösungen zum Beschaffen, Verwalten und Nutzen von digitalen und gedruckten Medien.
The human mind is endowed with a remarkable capacity for creative synthesis between intuition and reason; this mental alchemy is the source of genius. A new synergy is emerging between human ingenuity and the computational capacity of generative AI models.
Automated Data Analytics focuses on this fruitful collaboration between the two to unlock the full potential of data analysis. Together, human ethics and algorithmic productivity have created an alloy stronger than the sum of its parts. The future belongs to this symbiosis between heart and mind, human and machine. If we succeed in harmoniously combining our strengths, it will only be a matter of time before we discover new analytical horizons.
This book sets out the foundations of this promising partnership, in which everyone makes their contribution to a common work of considerable scope. History is being forged before our very eyes. It is our responsibility to write it wisely, and to collectively pursue the ideal of augmented intelligence progress.
Soraya Sedkaoui is a professor at the University of Khemis Miliana, Algeria. She is also a data analyst and strategic consultant. Her research interests include Big Data and the development of algorithms and models for business applications.
Preface ix
Introduction xv
Chapter 1 Artificial Intelligence (AI) and Automated Data Analytics 1
1.1 The emergence of automated data analytics and the potential of generative AI 2
1.1.1 The power unleashed by generative AI 2
1.1.2 Transforming the data analytics process 2
1.1.3 Redefining coding with the intelligent agent 3
1.1.4 Human-AI collaboration 5
1.1.5 The power of prompt engineering 5
1.1.6 An ethical North Star 6
1.2 Revolutionizing the data analytics process with ChatGPT 7
1.2.1 The numbers behind ChatGPT's potential 7
1.2.2 ChatGPT and the future of data analytics 9
1.3 Harmony between human creativity and automated analysis: the winning duo 11
1.3.1 The value of human creativity 11
1.3.2 The complementary power of automated analysis 12
1.3.3 Navigating the partnership responsibly 13
1.4 Unlocking the secrets of prompt engineering for powerful results 15
1.4.1 The art of prompting 15
1.4.2 Optimizing prompts for efficient interaction with ChatGPT: fundamental aspects 17
Chapter 2 ChatGPT for Data Analytics 21
2.1 Exploring the ChatGPT universe: history, presentation and capabilities 21
2.1.1 From GPT-1 to GPT-4: generative pre-trained transformers 21
2.1.2 ChatGPT in practice 24
2.2 Powerful features for intelligent data analytics: natural language at your service 27
2.3 ChatGPT versus data scientists: an intelligence battle that looks like an alliance 30
2.3.1 Seamless communication and key features of an effective partnership 30
2.3.2 ChatGPT, between ally and threat 33
2.3.3 Automating data science workflows: unleashing the potential of ChatGPT plugins 35
2.4 Benefits and challenges of integrating ChatGPT into data analytics workflows 37
2.4.1 Unlocking analytical potential and reducing costs 37
Chapter 3 Data Preparation for Analysis with ChatGPT 41
3.1 ChatGPT in charge of preparing our datasets 41
3.1.1 Data cleaning 42
3.1.2 Data transformation 43
3.1.3 Data formatting 43
3.2 Automated cleaning and pre-processing for optimum results with ChatGPT 45
3.3 Handling missing data, outliers and other common data issues 48
3.4 Using ChatGPT for data transformation, feature engineering and beyond 51
Chapter 4 Intuitive Query Creation with ChatGPT 57
4.1 The discovery of patterns, trends and insights through interactive conversations 57
4.1.1 Key benefits 59
4.1.2 Discovering insights organically 60
4.1.3 Democratizing data analytics 60
4.1.4 The future of business intelligence 61
4.2 Creating natural language queries to analyze your data 62
4.3 The art of transforming analysis questions into SQL queries with ChatGPT 65
4.4 Generating efficient and optimized queries: the key to your success with ChatGPT 69
Chapter 5 ChatGPT: The Advanced Analysis Wizard 73
5.1 Exploring new horizons: ChatGPT for exploratory data analysis 73
5.2 Simplifying your analysis: automation tasks for increased efficiency 80
5.3 From statistics to predictions: ChatGPT as the partner of choice 82
5.3.1 Establishing statistical foundations 84
5.3.2 Building models with ChatGPT 84
5.3.3 Model deployment and monitoring models 85
5.3.4 Business impact 86
5.4 Deciphering feelings: text and sentiment analysis with ChatGPT 87
Chapter 6 Prediction and Modeling with ChatGPT 93
6.1 Automating the data analysis process with ChatGPT 94
6.2 ChatGPT for accurate and reboust modeling 96
6.3 Continuous improvement: optimizing model capabilities through feedback loops 100
6.4 Trend and time series analysis 105
Chapter 7 ChatGPT at the Service of Machine Learning 109
7.1 Machine learning in the functional fabric of ChatGPT 110
7.2 Creating new machine learning approaches with ChatGPT 113
7.3 Boosting machine learning algorithms with ChatGPT 116
7.3.1 Optimizing machine learning engineering 117
7.3.2 Hybrid model innovation 117
7.4 Enhancing the potential of machine learning algorithms with ChatGPT 120
Chapter 8 Narrative Fascination: Data-driven Stories and Reports 127
8.1 ChatGPT for generating data storytelling plans 127
8.2 The bewitchment of words: automating for writing data-driven stories 131
8.3 Interactive dashboards and ChatGPT's ingenuity 134
8.4 Humans at the heart of protocols: the imprint of human ingenuity in generative AI 137
Chapter 9 Power within Hands: Ethics, Orientation and Use 143
9.1 Understanding the limits of AI-generated analysis 144
9.2 Ethical harmony: ChatGPT in data analytics workflows 147
9.3 Providing iterative feedback to improve ChatGPT 151
9.4 Addressing ethical concerns and biases when using ChatGPT in data analytics 155
9.5 Ensuring fairness, transparency and accountability in automated data analytics 157
Conclusion 161
Appendix 1 167
Appendix 2 183
Appendix 3 185
References 191
Index 197
Comparing the capacity of computers to the capacity of the human brain, I've often wondered, where does our success come from? The answer is synthesis, the ability to combine creativity and calculation. into a whole that is much greater than the sum of its parts.
How Life Imitates Chess, Garry Kasparov (2007)
Data analytics is a crucial process in today's data-driven world. It involves collecting, cleaning, transforming and analyzing data to uncover useful information, insights, trends and patterns which inform business strategy, decision-making and process optimization. Traditionally, data analytics was a manual process requiring data scientists and analysts to prepare and process data before analyzing them. This was both tedious and time-consuming. The advent of machine learning and artificial intelligence (AI) has transformed data analytics by automating some parts of the process.
Generative AI models, like ChatGPT, are at the forefront of this automation revolution in data analytics. These large language models can understand human prompts and generate coherent and human-like textual responses. They are trained with massive text datasets, enabling them to perform a variety of language-based tasks. Generative models, such as ChatGPT, can be fine-tuned for specific applications, including data analytics.
We can consider these generative AI models as children with unlimited potential. The data scientist's role is to nurture these models, train them and help them grow - just as parents do with their children. In the beginning, these models are like children - they have immense capabilities, but need guidance to realize their potential.
The data scientist trains them step-by-step, teaching them the different tasks, operations and functionalities required for data analytics. This includes data preprocessing, cleaning, feature engineering, modeling, evaluation and explanation. Models are trained thoroughly, across diverse datasets, learning the nuances of each analytical task.
Gradually, just as a toddler learns to walk, talk and eat solid foods, generative models become capable with regard to various data analytics workflows. With each iteration, their skills improve: they learn to handle diverse datasets, manage missing values, transform features, select optimal models, critically assess performance and generate data-driven insights.
After extensive training on a wide range of datasets and analytical tasks, these generative models evolve from toddlers into mature analytical assistants. They move from simple memorization of problem-solving techniques to the development of true conceptual understanding. The models understand why particular data transformations, models and evaluations are appropriate for given scenarios.
In a way, they develop inductive reasoning and deductive logic, just like humans. They understand the principles and evidence-based principles that underlie data analytics workflows, rather than simply memorizing mechanical instructions. This conceptual understanding is what distinguishes generative AI from prior rule-based expert systems.
Thus, when a data scientist prompts a mature, well-trained model like ChatGPT to perform analysis, it deeply understands the request, rather than simply matching keywords. It draws on its conceptual knowledge to analyze the dataset, select optimal techniques, generate insights and explain the reasoning behind them. And it does so at superhuman speeds, leveraging the computing power of AI.
But does that make these generative models smarter than humans? The answer is no. At least not yet. Although they can outperform humans on narrow tasks within their training domain, these AI models lack generalized intelligence. Human cognition crucially remains far more advanced.
Unlike generative models, humans possess common sense, intuition, imagination, social intelligence, sensitivity and generalized reasoning capacities. We can creatively solve new and open problems that intersect several domains. Humans also have better judgment, wisdom and morality, which temper our technical capabilities with ethics and responsibility.
So, while ChatGPT can analyze datasets and quickly generate information, it lacks the general critical thinking skills to deeply understand implications and assess ethics. Its intelligence is circumscribed by its training data and purpose. It cannot reason its way through completely new scenarios, as humans can through transfer learning.
That said, narrow AI models offer complementary advantages to human intelligence. Their prodigious memory and computational speed enable exhaustive data analytics. Their lack of bias and fatigue ensure consistent performance. In this way, they endow humans with superhuman data processing capabilities.
Rather than competing with AI, we can collaborate with it - combining human wisdom and ethics with AI's productivity and precision. Together, we evolve data analytics and make it more insightful and responsible. But humans must remain in the loop to provide guidance, assess implications and ensure alignment with ethics.
An ideal symbiosis is one where humans manage creative and strategic tasks requiring reason, ethics and imagination, while AI accelerates repetitive analytical tasks requiring memory, computation and precision. Similar to Iron Man deploying the AI assistant JARVIS to enhance his human capabilities.
So, while the gap between human intelligence and AI persists, narrow AI models like ChatGPT are still in their infancy. Their capabilities will continue to grow exponentially thanks to increased data size, computing power and algorithmic advances. One day, they may even cross the threshold into artificial general intelligence (AGI).
But for the time being, generative AI enhances rather than replaces humans when it comes to data analytics process. It takes care of the tedious parts, allowing data scientists to focus on creative, high-added-value tasks. It is becoming an indispensable analytical assistant that continues to learn - like a child who grows into an adult over the years with careful attention.
The key is for humans to guide the development of these generative models in a thoughtful and ethical way. We need to focus on beneficial objectives and monitor for harmful biases or abuse. With judicious care and training, AI can usher in an era of augmented analytics - where human and machine intelligences meet and converge for more powerful, yet ethical, data insights. But the human must remain the supervising parent for the AI child.
Rather than wondering when AI will surpass human intelligence, we should focus on how to cultivate beneficial and ethical AI applications today. Generative models like ChatGPT are impressionable children that will shape the future based on the guidance they receive. Data scientists have a profound opportunity and responsibility: to educate these AI "children", so that they become responsible and collaborative allies instead of impenetrable adversaries.
Just as teaching helps humans to consolidate our own knowledge, the training of AI models requires us to thoroughly evaluate our hypotheses, biases and best practices. AI development is as much about advancing our own intelligence - codifying disciplines into coherent frameworks, evidence-based principles and methodologies.
Collective training of generative models advances human knowledge across domains. This requires distilling nebulous issues into structured frameworks; formalizing messy tasks into step-by-step workflows; and crystallizing weakly defined domains into rigorous first principles. Teaching AI models through examples helps us to better evaluate solutions, generalize insights and formalize ethics for humans as well.
The future of data analytics is humans and AI working together - combining the human's imaginative definition of problems, ethics and strategic judgment with AI's vast memory, exhaustive computation and high-velocity analytical workflows. Neither can match the synergistic value of the two intelligences combined. Data science augments both human and artificial intelligence.
The time has come to actively train this child prodigy: ChatGPT! It holds enormous potential to enhance human capabilities if nurtured properly. We need to nurture it carefully - teaching analytical skills while emphasizing ethics, exposing ChatGPT to a variety of data and scenarios under supervision, so that it moves from mechanical regurgitation to contextual comprehension.
So, let us therefore guide these generative models with wisdom and kindness. Let us instill analytical techniques with values and ethics, guiding them from innocence to maturity. And let us develop artificial intelligence that makes individuals responsible instead of replacing them. Models such as ChatGPT are still in their maturing stages. With the careful supervision of researchers concerned with data ethics, they could evolve into assistants, opening up new horizons of ethically guided discovery, and collaborating more fruitfully with human than they could achieve on their own.
Just as Garry Kasparov's quote states, human success arises from our ability to synthesize the creative and intuitive aspects of cognition with the computational and analytical aspects. When we combine these complementary modes of thinking and reasoning, the result is an emergent intelligence that simply exceeds the linear sum of creativity and calculation. There are synergies and amplifying effects that result from the fusion of different thinking styles, which makes us unique as human...
Dateiformat: ePUBKopierschutz: Adobe-DRM (Digital Rights Management)
Systemvoraussetzungen:
Das Dateiformat ePUB ist sehr gut für Romane und Sachbücher geeignet – also für „fließenden” Text ohne komplexes Layout. Bei E-Readern oder Smartphones passt sich der Zeilen- und Seitenumbruch automatisch den kleinen Displays an. Mit Adobe-DRM wird hier ein „harter” Kopierschutz verwendet. Wenn die notwendigen Voraussetzungen nicht vorliegen, können Sie das E-Book leider nicht öffnen. Daher müssen Sie bereits vor dem Download Ihre Lese-Hardware vorbereiten.Bitte beachten Sie: Wir empfehlen Ihnen unbedingt nach Installation der Lese-Software diese mit Ihrer persönlichen Adobe-ID zu autorisieren!
Weitere Informationen finden Sie in unserer E-Book Hilfe.