
Big Data
Description
Alles über E-Books | Antworten auf Fragen rund um E-Books, Kopierschutz und Dateiformate finden Sie in unserem Info- & Hilfebereich.
Big Data is based on an inquiry of several years within Proxem, a software publisher specializing in big data processing. The book examines how data scientists explore, interpret and visualize our digital traces to make sense of them, and to produce new knowledge. Grounded in epistemology and science and technology studies, Big Data offers a reflection on data in general, and on how they help us to better understand reality and decide on our daily actions.
More details
Other editions
Additional editions


Person
Églantine Schmitt holds a PhD in Philosophy of Science from the University of Technology of Compiègne (Sorbonne Universités), France. After 7 years at Proxem, she has devoted her career to building bridges between humans and technology through product management and design, in a start-up environment.
Content
Introduction vii
Chapter 1. From Trace to Web Data: An Ontology of the Digital Footprint 1
1.1. The epistemology of the cultural sciences 7
1.2. The footprint in evidential sciences 9
1.3. The log or activity history 14
1.4. The digital footprint as a web log 18
1.5. The intentionality of digital footprints 20
1.6. Data as theoretically-loaded footprints 24
Chapter 2. Toward an Epistemic Continuity Anchored in the Cultural Sciences 29
2.1. Digital technology in the cultural sciences 31
2.2. Field and corpus: two modes of access to reality 34
2.3. Virtual methods, a reconstruction of access to the field 38
2.4. The challenges of the technical revolution of the text 48
2.5. From the web as an object to the web as a corpus 59
2.6. Conclusion 69
Chapter 3. The Status of Computation in Data Sciences 71
3.1. Making data computable 73
3.2. The field of computability 77
3.3. Computational thinking 81
3.4. Computation in the natural sciences 87
3.5. From exploratory analysis to data mining 98
3.6. The institutional and theoretical melting pot of data science 107
3.7. The contribution of artificial intelligence 115
3.8. Conclusion 122
Chapter 4. A Practical Big Data Use Case 125
4.1. Presentation of the case study 126
4.2. Customer experience and coding of feedback131
4.3. From the representative approach to the "big data" project 134
4.4. Data preparation 137
4.5. Design of the coding plan 140
4.6. The constitution of linguistic resources 143
4.7. Constituting the coding plan 148
4.8. Visibility of the language activity 153
4.9. Storytelling and interpretation of the data 155
4.10. Conclusion 161
Chapter 5. From Narratives to Systems: How to Shape and Share Data Analysis 165
5.1. Two epistemic configurations 166
5.2. The genesis of systems 172
5.3. Conclusion 183
Chapter 6. The Art of Data Visualization 187
6.1. Graphic semiology 187
6.2. Data cartography 198
6.3. Representation as evidence 203
6.4. The visual language of design in system configuration 207
6.5. Materialization and interpretation of recommendations 214
Chapter 7. Knowledge and Decision 219
7.1. Big data, a pragmatic epistemology? 220
7.2. Toward gradual validity of knowledge 227
7.3. Deciding, knowing and measuring 233
Conclusion 239
References 243
Index 257
Introduction
Our philosophical literature is full of intricate accounts of causal theories of perception, yet they have curiously little do with real life. We have fantastical descriptions of aberrant causal chains which, Gettier-style, call in question this or that conceptual analysis. But the modem microscopist has far more amazing tricks than the most imaginative of armchair students of perception. What we require in philosophy is better awareness of the truths that are stranger than fictions. We ought to have some understanding of those astounding physical systems "by whose augmenting power we now see more/than all the world has ever done before".
Ian Hacking, "Est-ce qu'on voit à travers un microscope?" (1981)
Every innovation in knowledge technologies disrupts our relationship with reality, increasing our perception, memory and reasoning abilities. Scientific measuring instruments dedicated to observation reveal new aspects of reality, while tools dedicated to manipulation give us the ability to intervene in what is no longer immaculate nature, but a system made up of what we have found and what we have brought to it. The telescope has given us access to what is at a distance, the microscope to infinitely small particles and the X-ray has given access to the inner side of the material. Closer to home, the advent of digital technology has reinvented the way we record and share our knowledge. It is a new material for action and knowledge, as well as a new tool for manipulating and constituting this knowledge. The multiplication of the traces that we leave of ourselves on these digital materials now gives us a new access to our own culture.
Each innovation in knowledge technologies calls for a new epistemology, a new reasoned look at the objects we wish to learn about. While already relying on knowledge, these technologies augment our knowledge-producing thinking and our capacity for memory, learning and manipulation. Technology not only equips the scientific mind, but also pushes it beyond its limits, toward new theories of reality and new methods to apprehend it. These new approaches, hesitant and shaky, nevertheless build bridges between what we can see and manipulate, and our need for rationality. As Popper (1985) wrote: "Reason works by trial and error. We invent our myths and theories and we try them out: we try to see how far they take us".
As such, the new approaches brought about by innovations in knowledge technologies are inevitably unsatisfactory, both in the light of our usual standards and because of their nascent character. They are always incomplete, insufficient and unacceptable. They will be criticized, amended, revisited and taken up at the root. Nevertheless, without the imperfection of these pioneering trials, there is nothing more to perfect than the deconstruction of what could have been done.
The multiplication of the traces we leave of ourselves on digital media is no exception to these observations. More or less indiscriminately referred to as big data, data sciences, algorithms or artificial intelligence, the reasoned and technically instrumented study of these traces emerges with its procession of "myths and theories", as Popper says, that we formulate along the way. Similar to the dawn of Plato's logos, the boundary between myth and science is still fragile, and it takes a sharp eye to distinguish between them. The myth tells a story that is more pleasant and easier to understand than the story of trial and error, full of technicalities, of the first achievements, thus spreading faster and further. The experimenter navigates by sight, as much from what they know as from what they would like to know, intertwining the two. They are the hero of the myth that is told to them, and that they tell themselves in order to find their way around. Although they draw inspiration from it, there is, as we shall see, nothing in the study of digital footprints that satisfies the criteria of contemporary sciences, whatever they may be, while having a fundamentally similar mode of emergence.
To understand what is at stake with the multiplication of digital footprints, we need to listen to the pleasant myth as well as to the technicalities, and take them both seriously. To account for these new knowledge technologies, we need to mobilize a benevolent philosophy of science, attentive to detail. We must opt for an attitude that is simultaneously descriptive and normative, because to describe things for the first time is to name them, that is to say, to lay down the terms in which they can be apprehended. Carrying out big data epistemology means building the theoretical apparatus and the conceptual position required to understand and study these large masses of data. It is about formulating an initial methodological paradigm, general enough to apply to any study project of this type, and specific enough to already guide the necessary adjustments to the actual situation of a project.
This simultaneously descriptive - almost historicizing - and normative approach - in the sense that its contribution is a method-prescribing paradigm - allows us to escape from another, less fruitful normativity: that of deconstruction. As we shall see, to brand a phenomenon a social construction is to lock it up and condemn it, as if it had nothing more to say beyond its status as a cultural artifact. To reduce it to a sociologizing object, a matter of power games between actors, is to boil it down to a pure exteriority. To avoid this pitfall, we choose to adopt a philosophical approach that integrates the object's internal and external properties, revealing the complexity of the belief system it constitutes, rather than bringing it back within the boundaries of a simple object, simple and straightforward, which only supports one aspect of analysis.
This stance mobilizes a certain conception of the philosophy of science. A certain conception of philosophy, first of all, conscious of the criticisms addressed to it and of its predilection for "intricate accounts of causal theories [with] curiously little to do with real life", according to the formulation of Ian Hacking, himself a philosopher, and quoted above in this introduction. The philosophy that we practice is constantly nourished by reality, through what the human and social sciences have to say about it, and by first-hand experience as to its purpose, which, as we shall see, benefits the author of these lines. It is a philosophy that wants to be defined by the object it gives itself and the developments it imposes on it, more than by a particular philosophical tradition, a specific school of thought. We will be obsessed with what has actually happened, with what is observable, with the material, practical and empirical conditions of the emergence of our object. We situate ourselves between science and technology as articulated by science and technology studies (STS). It will be as much a philosophy of science as it is a philosophy of technology, adopting a conceptual approach without being analytical, as well as a historicizing approach without being a history of science.
We also mobilize a certain conception of science (or sciences), according to which it is not reduced to a string of disciplines subject to the scientific imperialism (Mäki 2013) of physics, but integrates any simultaneously empiricalizing and reasoned study of an object, whether it comes from nature or culture, from well-established scientific institutions or from novice amateurs. We will also strive to avoid the naïve belief that epistemic practices are pure and disinterested, in the service of an unveiling of truth as correspondence to reality. We will thus consider that the actors of these practices, while adhering to a certain more or less sophisticated scientific realism, also act according to other epistemological and extraepistemological norms.
Our object is therefore not the pure and disinterested knowledge that might emerge from the heap of our activity traces; we consider the question of big data as a system with epistemic but also practical, economic and semiotic components. Innovation in knowledge technologies is accompanied in particular by economic issues that have serious consequences on the effective production of knowledge. These technologies have a cost, and they are sold rather than given. Those who have access are not necessarily those who would benefit most or best from it. Without going into a detailed mapping of the agents and financial flows concerned by this system, we will always bear in mind these practical conditions, which are not only technical but also economic, particularly in that they provide factors that explain the ways in which knowledge is produced.
Like any good philosophical object, the big data phenomenon is vast, rich and complex. On the other hand, little has been said about it, from a philosophical point of view, that is worth retaining: it is more or less virgin ground for epistemology. Our ambition is therefore not to exhaust it and to conceptualize it in its entirety. It is no longer what it was at the beginning of our research, and is probably not about to stabilize. More modestly, our ambition is to provide, like pioneers, the first keys to understanding the object, its complexity and the different angles from which it can be viewed, and to enable others to spare themselves from speculative or sterile explorations. These keys to interpretation are as much conceived as means of understanding as they are remedies for the risk of misunderstanding a subject that is the target of much superficial, emotionally or axiologically charged discourse. We wish,...
System requirements
File format: ePUB
Copy protection: Adobe-DRM (Digital Rights Management)
System requirements:
- Computer (Windows; MacOS X; Linux): Install the free reader Adobe Digital Editions prior to download (see eBook Help).
- Tablet/smartphone (Android; iOS): Install the free app Adobe Digital Editions or the app PocketBook before downloading (see eBook Help).
- E-reader: Bookeen, Kobo, Pocketbook, Sony, Tolino and many more (not Kindle).
The file format ePub works well for novels and non-fiction books – i.e., „flowing” text without complex layout. On an e-reader or smartphone, line and page breaks automatically adjust to fit the small displays.
This eBook uses Adobe-DRM, a „hard” copy protection. If the necessary requirements are not met, unfortunately you will not be able to open the eBook. You will therefore need to prepare your reading hardware before downloading.
Please note: We strongly recommend that you authorise using your personal Adobe ID after installation of any reading software.
For more information, see our ebook Help page.