
Mastering New Age Computer Vision
Advanced techniques in computer vision object detection, segmentation, and deep learning
Zonunfeli Ralte(Author)
BPB Publications (Publisher)
Published on 19. February 2025
Book
Paperback/Softback
426 pages
978-93-6589-840-8 (ISBN)
Description
DESCRIPTION
Mastering New Age Computer Vision is a comprehensive guide that explores the latest advancements in computer vision, a field that is enabling machines to not only see but also understand and interpret the visual world in increasingly sophisticated ways, guiding you from foundational concepts to practical applications.
This book explores cutting-edge computer vision techniques, starting with zero-shot and few-shot learning, DETR, and DINO for object detection. It covers advanced segmentation models like Segment Anything and Vision Transformers, along with YOLO and CLIP. Using PyTorch, readers will learn image regression, multi-task learning, multi-instance learning, and deep metric learning. Hands-on coding examples, dataset preparation, and optimization techniques help apply these methods in real-world scenarios. Each chapter tackles key challenges, introduces architectural innovations, and improves performance in object detection, segmentation, and vision-language tasks.
By the time you have turned the final page of this book, you will be a confident computer vision practitioner, armed with a comprehensive grasp of core principles and the ability to apply cutting-edge techniques to solve real-world problems. You will be prepared to develop innovative solutions across a broad spectrum of computer vision challenges, actively contributing to the ongoing advancements in this dynamic field.
WHAT YOU WILL LEARN
¿ Use PyTorch for both basic and advanced image processing.
¿ Build object detection models using CNNs and modern frameworks.
¿ Apply multi-task and multi-instance learning to complex datasets.
¿ Develop segmentation models, including panoptic segmentation.
¿ Improve feature representation with metric learning and bilinear pooling.
¿ Explore transformers and self-supervised learning for computer vision.
WHO THIS BOOK IS FOR
This book is for data scientists, AI practitioners, and researchers with a basic understanding of Python programming and ML concepts. Familiarity with deep learning frameworks like PyTorch and foundational knowledge of computer vision will help readers fully grasp the advanced techniques discussed.
More details
Language
English
Place of publication
New Delhi
India
Dimensions
Height: 235 mm
Width: 191 mm
Thickness: 23 mm
Weight
795 gr
ISBN-13
978-93-6589-840-8 (9789365898408)
Copyright in bibliographic data and cover images is held by Nielsen Book Services Limited or by the publishers or by their respective licensors: all rights reserved.
Schweitzer Classification