With examples that cover computer vision, NLP and computer graphics, this book aims to become the practical guide for beginners in machine learning to understand the fundamental principles of GANs and learn the efficient applications of PyTorch library. The explanations of underlying mathematics behind the models also deliver in-depth knowledge to the advanced readers who wish to use GANs and PyTorch professionally.
Key Features
Explain the fundamental structures and principles of more than 22 GAN models
Showcase various image/audio/text synthesis and image-to-image/text-to-image translation tasks with GANs
More than just GAN: enriched with presentation of traditional approaches as well as similar deep learning techniques
Book DescriptionComputer vision is one of the most actively researched fields in deep learning. Aside from image classification and object recognition, the tasks of image synthesis have become an intriguing way to attract the attention of beginners in machine learning. GANs have been proved to be successful in generating realistic images over the last few years. As PyTorch is becoming the most popular open-source library in deep learning, a practical guide for building GAN models with PyTorch can be very useful for readers.
In this book, we start with a quick example of building a simple GAN model with pure Python and thorough demonstrations of key features in PyTorch. We dive into a classical GAN model for image synthesis while giving clear explanation of its mechanism, before we move on to more interesting applications including image-to-image translation, image restoration, text synthesis, text-to-image translation, audio synthesis and 3D model generation. We also give thorough explanations of underlying mathematical principles, related traditional approaches and useful techniques in deep learning along the course, which may help advanced readers professionally.What you will learn
Build a simple GAN model with pure Python
Key features in the latest version of PyTorch 2.x
Understand how computational graphs are built for neural networks
Perform image synthesis, image style transfer and image restoration
Generate images based on label prompts or text descriptions
Handle NLP tasks by generating audio and text with GANs
Synthesize texture image and produce 3D models
Perform adversarial attack and data augmentation with GANs
Useful tricks and techniques in deep learning
Who this book is forThis book is for anyone looking to do creative work in machine learning, especially in deep learning. It is required for the reader to have programming experience in Python. Those who are familiar with the concepts of machine learning, deep learning and computer vision may find it much easier to follow our course. Do not worry if you haven't used PyTorch before, because we will explain everything you need to know about PyTorch. It is highly recommended to have access to medium-to-high end NVIDIA graphics card to save you much time waiting for the model results.
Auflage
Sprache
Verlagsort
Editions-Typ
Maße
Höhe: 235 mm
Breite: 191 mm
ISBN-13
978-1-83763-714-0 (9781837637140)
Copyright in bibliographic data and cover images is held by Nielsen Book Services Limited or by the publishers or by their respective licensors: all rights reserved.
Schweitzer Klassifikation
John Hany received his master's degree and bachelor's degree in calculational mathematics at the University of Electronic Science and Technology of China. He majors in pattern recognition and has years of experience in machine learning and computer vision. He has taken part in several practical projects, including intelligent transport systems and facial recognition systems. His current research interests lie in reducing the computation costs of deep neural networks while improving their performance on image classification and detection tasks. He is enthusiastic about open source projects and has contributed to many of them. Shuai Yan is a Computer Vision engineer. As an engineer in the field of deep learning, he has rich work experience, and he has made contributions to many outstanding enterprises dedicated to deep learning. He has in-depth knowledge and rich engineering experience in related fields.
Table of Contents
Product Information Document
Generative Adversarial Networks Fundamentals
Getting Started with PyTorch
Building your first GAN with PyTorch
Interactively generating images via Conditional GAN
Image-to-image translation and its applications
Image restoration with GANs
Attack and improve classification models with GANs
Image generation from description text
Sequence synthesis with GANs
GANs for computer graphics
VAE, Vision Transformer and Diffusion models