
Learning OpenCV 5 Computer Vision with Python
Tackle computer vision and machine learning with the newest tools, techniques and algorithms
Packt Publishing
4th Edition
Will be published approx. on 12. June 2026
Book
Paperback/Softback
978-1-80323-022-1 (ISBN)
Description
Updated for OpenCV 5, this book covers the latest on depth cameras, 3D navigation, deep neural networks, and Cloud computing, helping you solve real-world computer vision problems with practical code
Key Features
Build powerful computer vision applications in concise code with OpenCV 5 and Python 3
Learn the fundamental concepts of image processing, object classification, and 2D and 3D tracking
Train, use, and understand machine learning models, and deploy them in the Cloud
Book DescriptionComputer vision is a rapidly evolving science in the field of artificial intelligence, encompassing diverse use cases and techniques. This book will not only help those who are getting started with computer vision but also experts in the domain. You'll be able to put theory into practice by building apps with OpenCV 5 and Python 3.
You'll start by setting up OpenCV 5 with Python 3 on various platforms. Next, you'll learn how to perform basic operations such as reading, writing, manipulating, and displaying images, videos, and camera feeds. From taking you through image processing, video analysis, depth estimation, and segmentation, to helping you gain practice by building a GUI app, this book ensures you'll have opportunities for hands-on activities. You'll tackle two popular challenges: face detection and face recognition. You'll also learn about object classification and machine learning, which will enable you to create and use object detectors and even track moving objects in real time. Later, you'll develop your skills in augmented reality and real-world 3D navigation. Finally, you'll cover ANNs and DNNs, learning how to develop apps for recognizing handwritten digits and classifying a person's gender and age, and you'll deploy your solutions to the Cloud.
By the end of this book, you'll have the skills you need to execute real-world computer vision projects.What you will learn
Install and familiarize yourself with OpenCV 5 s Python 3 bindings
Understand image processing and video analysis
Use a depth camera to distinguish foreground and background regions
Detect and identify objects, and track their motion in videos
Train and use your own models to match images and classify objects
Detect and recognize faces, and classify their gender and age
Build augmented reality applications, and navigate the real 3D world
Train neural networks and deploy them as Cloud-based solutions
Who this book is forThis OpenCV book is a good fit for Python programmers who want to get started with computer vision and machine learning. This book will also be useful for Computer vision and AI/ML developers who want to expand their OpenCV skills as well as experts who want to stay up-to-date with OpenCV 5.
Key Features
Build powerful computer vision applications in concise code with OpenCV 5 and Python 3
Learn the fundamental concepts of image processing, object classification, and 2D and 3D tracking
Train, use, and understand machine learning models, and deploy them in the Cloud
Book DescriptionComputer vision is a rapidly evolving science in the field of artificial intelligence, encompassing diverse use cases and techniques. This book will not only help those who are getting started with computer vision but also experts in the domain. You'll be able to put theory into practice by building apps with OpenCV 5 and Python 3.
You'll start by setting up OpenCV 5 with Python 3 on various platforms. Next, you'll learn how to perform basic operations such as reading, writing, manipulating, and displaying images, videos, and camera feeds. From taking you through image processing, video analysis, depth estimation, and segmentation, to helping you gain practice by building a GUI app, this book ensures you'll have opportunities for hands-on activities. You'll tackle two popular challenges: face detection and face recognition. You'll also learn about object classification and machine learning, which will enable you to create and use object detectors and even track moving objects in real time. Later, you'll develop your skills in augmented reality and real-world 3D navigation. Finally, you'll cover ANNs and DNNs, learning how to develop apps for recognizing handwritten digits and classifying a person's gender and age, and you'll deploy your solutions to the Cloud.
By the end of this book, you'll have the skills you need to execute real-world computer vision projects.What you will learn
Install and familiarize yourself with OpenCV 5 s Python 3 bindings
Understand image processing and video analysis
Use a depth camera to distinguish foreground and background regions
Detect and identify objects, and track their motion in videos
Train and use your own models to match images and classify objects
Detect and recognize faces, and classify their gender and age
Build augmented reality applications, and navigate the real 3D world
Train neural networks and deploy them as Cloud-based solutions
Who this book is forThis OpenCV book is a good fit for Python programmers who want to get started with computer vision and machine learning. This book will also be useful for Computer vision and AI/ML developers who want to expand their OpenCV skills as well as experts who want to stay up-to-date with OpenCV 5.
More details
Edition
4th Revised edition
Language
English
Place of publication
Birmingham
United Kingdom
Target group
Professional and scholarly
Edition type
Revised edition
Dimensions
Height: 235 mm
Width: 191 mm
ISBN-13
978-1-80323-022-1 (9781803230221)
Copyright in bibliographic data and cover images is held by Nielsen Book Services Limited or by the publishers or by their respective licensors: all rights reserved.
Schweitzer Classification
Persons
Joseph Howse lives in a Canadian fishing village, where he chats with his cats, crafts his books, and nurtures an orchard of hardy fruit trees. He is President of Nummist Media Corporation, which exists to support his books and to provide mentoring and consulting services, with a specialty in computer vision. On average, in 2015-2022, Joseph has written 1.4 new books or new editions per year for Packt. He also writes fiction, including an upcoming novel about the lives of a group of young people in the last days of the Soviet Union. Joe Minichino is an R&D labs engineer at Teamwork. He is a passionate programmer who is immensely curious about programming languages and technologies and constantly experimenting with them. Born and raised in Varese, Lombardy, Italy, and coming from a humanistic background in philosophy (at Milan's Universita Statale), Joe has lived in Cork, Ireland, since 2004. There, he became a computer science graduate at the Cork Institute of Technology.
Content
Table of Contents
Setting Up OpenCV
Handling Files, Cameras, and GUIs
Processing Images with OpenCV
Depth Estimation and Segmentation
Detecting and Recognizing Faces
Retrieving Images and Searching Using Image Descriptors
Building Custom Object Detector
Tracking Objects
Camera Models and Augmented Reality
3D Reconstruction and Navigation
NeuraNetworks with OpenCV - an Introduction
OpenCV Applications at Scale
Appendix A: Bending Color Space with the Curves Filter
Setting Up OpenCV
Handling Files, Cameras, and GUIs
Processing Images with OpenCV
Depth Estimation and Segmentation
Detecting and Recognizing Faces
Retrieving Images and Searching Using Image Descriptors
Building Custom Object Detector
Tracking Objects
Camera Models and Augmented Reality
3D Reconstruction and Navigation
NeuraNetworks with OpenCV - an Introduction
OpenCV Applications at Scale
Appendix A: Bending Color Space with the Curves Filter