Deep Learning

Name: Deep Learning | Advancing Robotics Through Intelligent Systems
Brand: One Billion Knowledgeable
Price: 4.49 EUR
Availability: OnlineOnly

Advancing Robotics Through Intelligent Systems

Fouad Sabry(Autor*in)

One Billion Knowledgeable (Verlag)

1. Auflage

Erschienen am 1. Januar 2025

290 Seiten

E-Book

ePUB mit Adobe-DRM

Systemvoraussetzungen

6610000692941 (EAN)

4,49 €inkl. 7% MwSt.

Systemvoraussetzungen

für ePUB mit Adobe-DRM

E-Book Einzellizenz

Als Download verfügbar

Beschreibung

'Deep Learning' is an essential guide to the evolving world of robotics, offering indepth insights into the revolutionary field of artificial intelligence. Whether you're a professional, a student, or an enthusiast, this book provides the foundation necessary to understand the complex principles behind machine learning and neural networks. Explore how these technologies are shaping the future of robotics, from speech recognition to quantum neural networks, and gain the knowledge needed to stay ahead in a rapidly advancing field.

Chapters Brief Overview:

1: Deep learning: Introduction to deep learning and its applications in robotics and AI.

2: Neural network (machine learning): Understanding the fundamental structure and learning processes of neural networks.

3: Speech recognition: How deep learning powers speech recognition technologies, enabling more intuitive humanrobot interaction.

4: Jürgen Schmidhuber: A deep dive into the contributions of Jürgen Schmidhuber, a key figure in neural network advancements.

5: Recurrent neural network: The role of recurrent neural networks (RNNs) in processing sequential data and time series.

6: Quantum neural network: Exploring the intersection of quantum computing and neural networks, opening new dimensions for AI.

7: Echo state network: A look into echo state networks (ESNs) and their efficiency in complex dynamic systems.

8: Long shortterm memory: An exploration of LSTM networks and their ability to retain longterm information, critical in robotics.

9: Types of artificial neural networks: Overview of various neural network types and their specific applications in robotics.

10: Convolutional neural network: Understanding CNNs and their impact on image processing and visual recognition in robotics.

11: Bidirectional recurrent neural networks: A study of bidirectional RNNs and their ability to process data from both past and future contexts.

12: Alex Graves (computer scientist): Focusing on the pioneering work of Alex Graves in neural networks and AI, and its impact on robotics.

13: AI accelerator: Examining the hardware advancements, such as AI accelerators, that enhance deep learning model performance.

14: Timeline of machine learning: A historical overview of key milestones in the development of machine learning and AI.

15: Differentiable neural computer: A look at differentiable neural computers (DNCs) and their potential to revolutionize memory and problemsolving in robots.

16: AlexNet: Understanding the groundbreaking AlexNet model and its role in popularizing deep learning for image classification.

17: Connectionist temporal classification: An exploration of CTC for speech and sequence processing, vital for humanrobot communication.

18: Highway network: The significance of highway networks in overcoming the limitations of deep architectures for improved learning.

19: Residual neural network: Studying residual networks and how they help train very deep neural networks for robotics.

20: History of artificial neural networks: A comprehensive history of neural networks, from their inception to their dominance in modern AI.

21: Attention Is All You Need: A deep dive into the transformer model, which has revolutionized natural language processing in robotics.

The world of robotics is rapidly transforming, and the advancements in deep learning are driving much of this change. This book serves as a comprehensive resource for professionals, students, and hobbyists interested in understanding the theoretical and practical aspects of deep learning in robotics. Gain insights from the experts, discover cuttingedge technologies, and see how deep learning is poised to shape the future of AI and robotics.

Weitere Details

Inhalt

Chapter 2: Neural network (machine learning)

A neural network, also known as an artificial neural network or neural net, and known by its abbreviations ANN or NN, is a model that is used in machine learning. This model is inspired by the form and function of biological neural networks seen in animal brains.

The artificial neurons that make up an artificial neural network (ANN) are connected units or nodes that are modeled after the neurons that are found in the brain. The edges that connect these are a representation of the synapses that are found in the brain. After receiving signals from other neurons that are connected to it, each artificial neuron processes those signals and then transmits them to other neurons that are connected to it. The "signal" is a real number, and the activation function is a non-linear function that operates on the total of the inputs to each neuron. This function is responsible for computing the output of each neuron. During the process of learning, a weight is used to determine the strength of the signal at each link. This weight is adjusted and adjusted as necessary.

In a typical situation, neurons are grouped together into layers. It is possible for various layers to each conduct a unique transformation on their respective inputs. It is possible for signals to go via numerous intermediate layers, also known as hidden layers, as they make their way from the first layer, which is the input layer, to the last layer, which is the output layer. When a network includes at least two hidden layers, it is often referred to as a deep neural network to describe the network.

There are several applications for artificial neural networks, including predictive modeling, adaptive control, and the resolution of issues in artificial intelligence. There are also many other applications. They have the ability to gain knowledge through experience and can draw inferences from a collection of facts that appears to be unrelated to one another.

The majority of the time, neural networks are trained through the process of empirical risk limitation. This approach is predicated on the concept of optimizing the parameters of the network in order to minimize the difference, also known as the empirical risk, between the anticipated output and the actual target values in a particular dataset. Estimation of the parameters of the network is often accomplished by the utilization of gradient-based approaches such as backpropagation. Artificial neural networks (ANNs) learn from labeled training data during the training phase by iteratively changing its parameters in order to minimize a loss function that has been set. Using this strategy, the network is able to generalize to data that it has not previously encountered.

More than two centuries ago, early work in statistics laid the foundation for the deep neural networks that are used today. The most basic type of feedforward neural network (FNN) is a linear network. This type of network is made up of a single layer of output nodes that have linear activation functions. The inputs are sent directly to the outputs through a series of weights. Calculations are performed at each node to determine the total of the products of the weights and the inputs. For the purpose of minimizing the mean squared errors that occur between these estimated outputs and the goal values that have been provided, an adjustment to the weights has been created. Both the method of least squares and linear regression are names that have been used to refer to this technique for more than two centuries. Legendre (1805) and Gauss (1795) utilized it as a method for determining a decent rough linear fit to a set of points in order to make predictions regarding the movement of the planets.

The operation of digital computers, such as the von Neumann architecture, has traditionally been accomplished through the execution of explicit instructions, with memory being accessed by a number of discrete processors. Certain neural networks, on the other hand, were initially developed as a result of an attempt to describe the information processing that occurs in biological systems by utilizing the connectionism framework. Connectionist computing, in contrast to the von Neumann approach, does not divide memory and processing into two distinct categories.

In 1943, Warren McCulloch and Walter Pitts examined the possibility of devising a computational model for neural networks that did not include learning. The research was able to be divided into two distinct methodologies as a result of this model. The first method concentrated on biological processes, while the second method was centered on the utilization of neural networks in the field of artificial intelligence science.

D. was born in the late 1940s. Hebbian learning is a learning hypothesis that was proposed by O. Hebb. This hypothesis was founded on the mechanism of brain plasticity and developed into a theory. Numerous early neural networks, such as Rosenblatt's perceptron and the Hopfield network, made use of it in their operations. In 1954, Farley and Clark carried out a simulation of a Hebbian network by utilizing computing devices. Rochester, Holland, Habit, and Duda (1956) were the ones who came up with further neural network computational machines.

The United States Office of Naval Research provided funding for the development of the perceptron, which was one of the first artificial neural networks to be developed. The perceptron was described by psychologist Frank Rosenblatt in the year 1958.

An even earlier perceptron-like device was developed by Farley and Clark, according to R. D. Joseph (1960). He writes that "Farley and Clark of MIT Lincoln Laboratory actually preceded Rosenblatt in the development of a perceptron-like device." However, "they dropped the subject."

As a result of the perceptron, public interest in research pertaining to artificial neural networks increased, which led to a significant increase in funding from the United States government. "The Golden Age of AI" was fueled by the optimistic claims made by computer scientists on the ability of perceptrons to replicate human intelligence. This circumstance helped to the development of artificial intelligence.

It was not possible for the initial perceptrons to have adaptive hidden units. Nevertheless, Joseph (1960) also addressed multilayer perceptrons that included an adaptable hidden layer on their structure. These concepts were cited and used by Rosenblatt (1962), who also gave credit to the work done by H. Both D. Block and B. The Knight, W. The early efforts that were made did not, however, result in a learning algorithm that was capable of working for hidden units, also known as deep learning.

In the 1960s and 1970s, fundamental research on artificial neural networks (ANNs) was carried out. The Group technique of data handling, which was developed in 1965 in Ukraine by Alexey Ivakhnenko and Lapa, was the first algorithm for deep learning that succeeded in its intended purpose. It was a way for training neural networks that were arbitrarily deep. According to their interpretation, it was a type of polynomial regression or a generalization of Rosenblatt's perceptron analysis. This method, which is based on training layers by layers through regression analysis, was reported in a study that was published in 1971. The deep network that was developed using this method included eight layers. Through the utilization of a distinct validation set, unnecessary hidden units are eliminated. Due to the fact that the activation functions of the nodes are calculated using Kolmogorov-Gabor polynomials, these were also the first deep networks to contain multiplicative units, sometimes known as "gates."

The first deep learning multilayer perceptron that was learned using stochastic gradient descent was published by Shun'ichi Amari in the year 1967. During the course of computer experiments that were carried out by Saito, a student of Amari's, a multilayer perceptron (MLP) with five layers and two layers that could be modified learnt internal representations to identify non-linearly separable pattern classes. As a result of further improvements in hardware and hyperparameter tunings, end-to-end stochastic gradient descent has emerged as the most popular training technique at the present time.

It was in 1969 when Kunihiko Fukushima first presented the ReLU activation function, which stands for the rectified linear unit. It is already well acknowledged that the rectifier is the most widely used activation function for deep learning.

Nevertheless, research halted in the United States until the work of Minsky and Papert (1969), who noted that basic perceptrons were incapable of processing the exclusive-or circuit. In the case of the deep networks proposed by Ivakhnenko (1965) and Amari (1967), this understanding was completely irrelevant.

Deep learning architectures for convolutional neural networks (CNNs) with convolutional layers and downsampling layers and weight replication were first developed by Kunihiko Fukushima in 1979 with the Neocognitron. However, backpropagation was not used to train the Neocognitron.

Within the context of networks consisting of differentiable nodes, backpropagation is an effective application of the chain rule that Gottfried Wilhelm Leibniz developed in the year 1673. However, Rosenblatt did not know how to apply this, despite the fact that Henry J. Kelley had a continuous antecedent of backpropagation in 1960 within the context of control theory. Rosenblatt was the one who initially popularized the term "back-propagating errors" in 1962. It was in 1970 when Seppo Linnainmaa submitted his master thesis, which was the first publication of the present type of backpropagation. Mr. G.M. It was reissued in 1971...

Systemvoraussetzungen

Als PDF speichern Als Link merken