Artificial Intelligence (AI) in Forensic Sciences

Name: Artificial Intelligence (AI) in Forensic Sciences
Brand: Wiley-ISTE
Price: 93.99 EUR
Availability: OnlineOnly

Zeno Geradts Katrin Franke(Editor)

Wiley-ISTE (Publisher)

1st Edition

Published on 22. August 2023

256 pages

E-Book

ePUB with Adobe-DRM

System requirements

978-1-119-81334-7 (ISBN)

€93.99incl. 7% vat

System requirements

for ePUB with Adobe-DRM

E-Book Single Licence

Available for download

Description

Alles über E-Books | Antworten auf Fragen rund um E-Books, Kopierschutz und Dateiformate finden Sie in unserem Info- & Hilfebereich.

Alles über E-Books, Kopierschutz & Dateiformate finden Sie in unserem Info- & Hilfebereich.

ARTIFICIAL INTELLIGENCE (AI) IN FORENSIC SCIENCES

Foundational text for teaching and learning within the field of Artificial Intelligence (AI) as it applies to forensic science

Artificial Intelligence (AI) in Forensic Sciences presents an overview of the state-of-the-art applications of Artificial Intelligence within Forensic Science, covering issues with validation and new crimes that use AI; issues with triage, preselection, identification, argumentation and explain ability; demonstrating uses of AI in forensic science; and providing discussions on bias when using AI.

The text discusses the challenges for the legal presentation of AI data and interpretation and offers solutions to this problem while addressing broader practical and emerging issues in a growing area of interest in forensics. It builds on key developing areas of focus in academic and government research, providing an authoritative and well-researched perspective.

Compiled by two highly qualified editors with significant experience in the field, and part of the Wiley -- AAFS series 'Forensic Science in Focus', Artificial Intelligence (AI) in Forensic Sciences includes information on:

* Cyber IoT, fundamentals on AI in forensic science, speaker and facial comparison, and deepfake detection

* Digital-based evidence creation, 3D and AI, interoperability of standards, and forensic audio and speech analysis

* Text analysis, video and multimedia analytics, reliability, privacy, network forensics, intelligence operations, argumentation support in court, and case applications

* Identification of genetic markers, current state and federal legislation with regards to AI, and forensics and fingerprint analysis

Providing comprehensive coverage of the subject, Artificial Intelligence (AI) in Forensic Sciences is an essential advanced text for final year undergraduates and master's students in forensic science, as well as universities teaching forensics (police, IT security, digital science and engineering), forensic product vendors and governmental and cyber security agencies.

More details

Other editions

Persons

Content

About the editors, ix

List of Contributors, x

Series Preface, xi

Preface Book, xii

Acknowledgements, xiii

1 Introduction, 1
Zeno Geradts and Katrin Franke

2 AI-based Forensic Evaluation in Court: The Desirability of Explanation and the Necessity of Validation, 3
Rolf J.F. Ypma, Daniel Ramos, and Didier Meuwly

2.1 Introduction, 3

2.1.1 AI for Forensic Evaluation, 6

2.2 The Desirability for Explanation and the Necessity of Validation, 7

2.3 Explainability (and its Validity), 8

2.3.1 Reasons to Pursue Explanations, 9

2.3.2 Types of Explanations, 9

2.3.3 Limitations of Explanations, 11

2.4 Validation (and its Explanation), 11

2.4.1 Measure the Method's Performance, 12

2.4.2 Approach in Four Steps, 12

2.4.3 Accountability, 16

2.5 Conclusion, 17

3 Machine Learning for Evidence in Criminal Proceedings: Techno-legal Challenges for Reliability Assurance, 21
Radina Stoykova, Jeanne Mifsud Bonnici, and Katrin Franke

3.1 Introduction: AI in the Intersection of Criminal Procedure and Forensics, 21

3.1.1 Technical Fragmentation in Digital Investigations, 21

3.1.2 Legal and Methodological Fragmentation in Digital Investigations, 22

3.1.3 Specifics of ML-based Investigative Approach, 23

3.1.4 Scope and Definitions, 25

3.2 Legal Framework, 27

3.2.1 The Fair Trial Principle, 28

3.2.2 Necessity and Proportionality of Investigative Measures, 32

3.2.3 The AIA Proposal, 33

3.2.4 AI System Development and Legislative Contradictions, 35

3.3 Machine Learning Pipelines: Techno-legal Challenges, 44

3.3.1 Task + Purpose Limitation and Data Minimization, 44

3.3.2 Dataset Engineering and Data Governance, 48

3.3.3 Pre-processing for Input: Trade-offs between Accuracy and Computational Costs, 53

3.3.4 Modelling, 56

3.4 AI Use in Investigations: AI System Design + Data Protection = Fair Trial?, 63

3.5 Conclusion, 66

4 Formalising Representation and Interpretation of Digital Evidence to Reinforce Reasoning and Automated Analysis, 74
Eoghan Casey and Timothy Bollé

4.1 Introduction, 74

4.2 Background and Related Work, 76

4.3 Method, 77

4.4 Representing Digital Traces, 79

4.5 Representing Computed Similarity, 86

4.6 Representing ML Classification, 89

4.7 Representing Hypothesis Test Results (a.k.a. Inferences), 91

4.7.1 Location Example, 93

4.7.2 Identification Example, 95

4.8 Effective/Reliable/Responsible Automated Analysis, 99

4.9 Conclusion, 101

5 Servicing Digital Investigations with Artificial Intelligence, 103
Harm van Beek and Hans Henseler

5.1 Introduction, 103

5.2 Introduction To Hansken, 104

5.2.1 Normalized Trace Model, 105

5.2.2 Forensic Tool Application, 106

5.2.3 Hansken's Application Programming Interfaces, 108

5.3 Large Scale Application of AI Techniques, 109

5.3.1 Rule-based AI Techniques Implemented in Hansken, 109

5.3.2 Deep-learning AI Techniques Currently Implemented in Hansken, 111

5.3.3 Deep-learning AI Techniques to be Implemented in Hansken, 115

5.3.4 The application of large language models in digital forensics, 118

5.4 Conclusions and Further Reading, 120

6 On the Feasibility of Social Network Analysis Methods for Investigating Large-scale Criminal Networks, 123
Jan William Johnsen and Katrin Franke

6.1 Introduction, 123

6.2 Previous Work, 125

6.3 Material and Methods, 127

6.3.1 Real-world Underground Forum Database Dumps, 127

6.3.2 Network Centrality Measures, 129

6.3.3 Measuring Association Using Bi-variate Analysis, 129

6.3.4 Topic Modelling Algorithms, 130

6.4 Experimental Setup, 130

6.4.1 Evaluating Network Centrality Measures for Forensics, 130

6.4.2 Our Novel Approach for Analysing Cybercriminal's Technical Skills, 133

6.5 Experimental Results and Discussion, 137

6.5.1 Correlation Testing, 137

6.5.2 Our Newly Proposed Method, 142

6.6 Conclusion, 145

7 Mapping NLP Techniques to Investigations and Investigative Interviews, 149
Kyle Porter and Bente Skattør

7.1 Introduction, 149

7.2 Criminal Investigation, 150

7.2.1 Investigative Interviews, 150

7.3 Assessing the Needs of Investigators in an NLP Context, 151

7.3.1 Mapping Interviewer Needs to Existing NLP Tasks, 151

7.4 Automatic Speech Recognition, 152

7.4.1 ASR Basics, 152

7.4.2 ASR, Digital Investigation, and the State of the Art, 153

7.5 NLP Basics, 154

7.5.1 Common Terminology, 154

7.5.2 Vector Space Models and Embeddings, 156

7.5.3 Modern NLP Models, 157

7.6 Text Extraction, 157

7.6.1 Entity Identification and Named Entity Recognition, 157

7.6.2 Named Entity Recognition Metrics, 158

7.6.3 NER Applied to Investigations, 159

7.6.4 Entity Linking, 159

7.6.5 Limitations of Using NER, 160

7.6.6 Extraction Methods outside NER, 161

7.7 Text Classification, 161

7.7.1 Classification Evaluation Metrics, 162

7.7.2 Text Classification and Digital Investigation, 162

7.7.3 Classification Limitations, 163

7.8 Text Reduction, 164

7.8.1 Thematic Extraction and Topic Modelling, 164

7.8.2 Topic Modelling and Digital Investigations, 165

7.8.3 Limitations of Topic Modelling, 166

7.8.4 Text Summarization, 166

7.8.5 Text Summarization and Digital Investigations, 167

7.8.6 Summarization Limitations, 167

7.9 Discussion and Conclusion, 167

7.9.1 Future Work, 169

8 The Influence of Compression on the Detection of Deepfake Videos, 174
Meike Kombrink and Zeno Geradts

8.1 Introduction, 174

8.2 Method, 178

8.2.1 Dataset, 178

8.2.2 Deepfake Detection, 180

8.3 Results, 183

8.3.1 Compressed Dataset, 183

8.3.2 Algorithms, 184

8.4 Discussion, 190

8.4.1 Deepfake Detection, 190

8.4.2 Compression, 191

8.4.3 Future Work, 193

8.5 Conclusion, 193

9 Event Log Analysis and Correlation: A Digital Forensic Perspective, 195
Neminath Hubballi and Pratibha Khandait

9.1 Introduction, 195

9.2 Sources of Logs, 197

9.2.1 End Host System Logs, 198

9.2.2 Networking Devices and Security Applications, 203

9.2.3 Application Logs, 207

9.3 Need for Correlation, 208

9.4 Correlation Techniques, 210

9.5 Conclusions, 214

10 (Hyper-)graph Analysis and its Application in Forensics, 216
Marcel Worring

10.1 Introduction, 216

10.2 Survey of Methods, 218

10.2.1 Preliminaries, 218

10.2.2 Tasks, 219

10.2.3 Graph Neural Networks, 220

10.3 Explainability and Visualization, 224

10.4 Conclusion, 227

11 Conclusion, 230
Zeno Geradts and Katrin Franke

Index, 232

CHAPTER 2
AI-based Forensic Evaluation in Court: The Desirability of Explanation and the Necessity of Validation

Rolf J.F. Ypma1, Daniel Ramos2, and Didier Meuwly1,3

1 Netherlands Forensic Institute, Netherlands
2 AUDIAS Lab. Universidad Autonoma de Madrid, Madrid, Spain
3 University of Twente, Enschede, Netherlands

2.1 Introduction

Artificial intelligence (AI) is based on complex algorithms, methods using them, and systems. In this chapter, we understand algorithms as the core technology (e.g., a deep neural network (DNN)), the methods as the application of that core technology to a particular problem (e.g., the use of a DNN to evaluate fingerprint evidence), and the system as the software tool(s) that implement that method and algorithm (e.g., the GUI software package, or API, that is commercialized). The core technology typically includes pattern recognition and machine learning algorithms, whose complexity and performance has increased dramatically over the last years. These algorithms are often explainable in their inputs and their outputs, but the rationale that governs their internal mechanisms can be very difficult to explain. A prime example is DNNs, a family of algorithms forming the so-called deep learning field (LeCun et al. 2015). A DNN is a dense and very complex grid of connections between units called neurons. These connections can even be recurrent, or complemented with filters that interrelate different regions in the input features. Originally, a neural network aimed at mimicking the topology of the human brain, hence its name. The input of a DNN is typically a list of numbers representing some textual or sensory raw data (e.g., a fingermark image or an audio file). The output of a DNN consists of continuous or discrete outputs, depending on the task (e.g., a probability for each of a set of classes in a classification task or a magnitude of interest to predict in a regression task). Thus, the inputs and outputs are well defined, and even explainable in many cases. However, it is very difficult to know or explain what the activation of any single neuron, or a group of them, means. Although some intuitions exist, interpreting those intermediate outcomes is extremely difficult in general. Indeed, the explainability of machine learning algorithms is a field in itself, named explainable AI (XAI), (Doshi-Velez and Kim 2017; Guidotti et al. 2018; LeCun et al. 2015; Molnar 2020) covering explainable algorithms for machine learning and AI. XAI is closely related to more philosophical areas such as ethics, fairness, and the risk of applicability of AI to real-world problems.

In the context of forensic evaluation using algorithms, we consider that to interpret means providing a forensic meaning to the results of the computation (e.g., a set of features or a score obtained or extracted from the evidence), and modelling them probabilistically in the form of a likelihood ratio. The likelihood ratio is a numerical expression of the probative value that is meaningful in the judicial context, where the defence and the prosecution alternative propositions are disputed (Evett et al. 2000; Jackson et al. 2013). However, the interpretation by humans in understandable terms of the inner components of the "black box" that forms many machine learning algorithms (such as DNNs) is still an issue, particularly for high-impact decisions as in forensic science. At this point it is worth highlighting that we distinguish between the interpretation of the inner components and workings of an AI algorithm, and the interpretation of its results (e.g., outputs such as the strength of evidence expressed as a likelihood ratio (LR) computed by an algorithm, or a class probability of a classifier). In the XAI literature these are often referred to as "global" and "local" explanations. The likelihood ratio framework is considered as a logical approach for the forensic interpretation of results of evidence evaluation, in this case in the context of a Bayesian decision scenario. But we can also think about the interpretation of the rationale of the methods themselves, i.e., how do they work internally.

Machine learning algorithms, when training has finished, are completely deterministic and thus reproducible and repeatable. This is good news when trying to characterize the performance and the behavior of a given algorithm in a given experimental set-up. Indeed, as it is described below, this repeatability and reproducibility makes the system testable, which is at the basis of a rigorous empirical validation process. However, in recent years the size and complexity of DNNs has steadily increased. As a consequence, creating explanations to provide insight into the rationale of the algorithm has become more challenging. We can, therefore, state that machine learning and pattern recognition algorithms in AI systems can be validated, but their rationale remains hard to explain.

As an example in the context of forensic science, it is common to use complex analytical chemistry methods for forensic examination. We can think about the use of laser ablation with inductively coupled plasma mass spectrometry (LA-ICP-MS) for glass comparative analysis. In court, it is extremely difficult to explain the process of extraction of information from glass with a LA-ICP-MS analytical device. As a result, explainability of the technical details of the method is not helpful in general, with lawyers generally lacking the competence to understand such an explanation. However, the results of the method are typically trusted and assumed to be reliable: courts generally accept forensic examination based on these chemical profiles as valuable evidence, as we think it should be. We believe that an important reason for this is the fact that the process of producing and comparing analytical results in forensic glass examination with LA-ICP-MS has been tested and validated according to international quality standards and accredited by a national accreditation body. This proves the method's reliability in court, more than any explanations that the forensic scientist could give to the judge or jury. In this case, explainability is welcome if it makes the whole process more understandable and transparent, but is not essential.

Explainability problems pile up if we consider a typical operational scenario, where the data have not been seen before, and may not have been well represented in the controlled dataset that was used to train the algorithms. It is very difficult to predict the behavior of a system trained on a controlled dataset in all situations. This is due to the complexity of the algorithm, but also to the diversity and variability of the possible scenarios. Even in controlled conditions, the complexity of the feature and parameter spaces of modern machine learning algorithms is huge. If the scenario in which the machine learning algorithm is going to operate is not very well characterized and targeted, the results might be unpredictable. The bad news is that, when this degradation of performance happens, it is very difficult to scrutinize the inner behavior of the algorithm in order to solve, or even explain, this lack of performance.

One recent problem that can help to understand this situation is the sensitivity of DNNs to adversarial noise. Lately, it was discovered that DNNs, although presenting extremely competitive performance in a wide variety of tasks, are also highly sensitive to so-called adversarial examples. Goodfellow et al. (2014) present a simple scenario in an image classification task, where a DNN is used to classify input images in classes (e.g., written digits, or type of objects in the image). The performance of the DNN was excellent in those tasks, where the conditions of the training and the testing datasets were similar. However, by adding so-called adversarial noise, i.e., a controlled degradation in the input image, the performance of the DNN dramatically dropped. The most intriguing fact about the experiment is that the adversarial noise could not be perceived by a human observer: the original and the noisy image looked exactly the same to the human eye. Also, this ability of adversarial noise to fool a DNN seems to manifest in different datasets and DNN architectures, revealing a vulnerability of DNNs for image classification in general. Of course, in forensic science it is difficult to manipulate data in order to add this kind of perturbation. However, the adversarial noise is a good example of potential unknown unknowns resulting in unexpected results from an AI-based system, illustrating our lack of understanding of such systems. The main message for forensic examiners is important: even if an algorithm such as a DNN has been tested in controlled conditions, it can fail in unexpected ways when dataset conditions are different.

Unfortunately, forensic conditions are always very variable, uncontrolled, and uncertain. This is a very challenging and unfavorable situation for any machine learning algorithm, but in particular it threatens the reliability of complex AI algorithms such as DNNs. Under such circumstances, the machine learning field has to continue to improve in order to generate solutions to a variety of scenarios where the operational data can be very variable and different to the training data. This includes strategies such as uncertainty incorporation to models, probabilistic calibration, domain adaptation, or transfer learning. All these approaches are aimed at making the system more robust to variation between datasets and data scarcity. Yet, safeguards at the operational level, such as a...

System requirements

Save as PDF Copy link into clipboard

Schweitzer Fachinformationen

Artificial Intelligence (AI) in Forensic Sciences

Description

More details

Other editions

Additional editions

Persons

Content

CHAPTER 2
AI-based Forensic Evaluation in Court: The Desirability of Explanation and the Necessity of Validation

2.1 Introduction

System requirements

Schweitzer Fachinformationen

Artificial Intelligence (AI) in Forensic Sciences

Description

More details

Other editions

Additional editions

Persons

Content

CHAPTER 2 AI-based Forensic Evaluation in Court: The Desirability of Explanation and the Necessity of Validation

2.1 Introduction

System requirements

CHAPTER 2
AI-based Forensic Evaluation in Court: The Desirability of Explanation and the Necessity of Validation