
Deep Learning for Multimedia Processing Applications
Description
Alles über E-Books | Antworten auf Fragen rund um E-Books, Kopierschutz und Dateiformate finden Sie in unserem Info- & Hilfebereich.
Divided into two volumes, Volume One begins by introducing the fundamental concepts of deep learning, providing readers with a solid foundation to understand its relevance in multimedia processing. Readers will discover how deep learning techniques enable accurate and efficient image recognition, object detection, semantic segmentation, and image synthesis. The book also covers video analysis techniques, including action recognition, video captioning, and video generation, highlighting the role of deep learning in extracting meaningful information from videos.
Furthermore, the book explores audio processing tasks such as speech recognition, music classification, and sound event detection using deep learning models. It demonstrates how deep learning algorithms can effectively process audio data, opening up new possibilities in multimedia applications. Lastly, the book explores the integration of deep learning with natural language processing techniques, enabling systems to understand, generate, and interpret textual information in multimedia contexts.
Throughout the book, practical examples, code snippets, and real-world case studies are provided to help readers gain hands-on experience in implementing deep learning solutions for multimedia processing. Deep Learning for Multimedia Processing Applications is an essential resource for anyone interested in harnessing the power of deep learning to unlock the vast potential of multimedia data.
More details
Other editions
Additional editions


Persons
Jingbing Li is a doctor, professor, doctoral supervisor, and the vice president of Hainan Provincial Invention Association. He has been awarded honorary titles of Leading Talents in Hainan Province, Famous Teaching Teachers in Hainan Province, Outstanding Young and Middle-aged Backbone Teachers in Hainan Province, and Excellent Teachers in Baosteel. He has also won the second prize of the Hainan Provincial Science and Technology Progress Award three times (the first completer twice, the second completer once). He has obtained 13 authorized national invention patents, published 5 monographs such as medical image digital watermarking, and published more than 80 SCI/EI retrieved academic papers (including 22 SCI retrieved papers) as the first author or corresponding author. He has presided over two projects of the National Natural Science Foundation of China, and five projects of Hainan Province's key research and development projects and Hainan Province's international scientific and technological cooperation projects.
Dr. Huang Mengxing is dean of the School of Information, at Hainan University. He has occupied many roles, such as the leader of the talent team of "Smart Service", the chief scientist of the National Key R&D Program, a member of the Expert Committee of Artificial Intelligence and Blockchain of the Science and Technology Committee of the Ministry of Education, the executive director of the Postgraduate Education Branch of the China Electronics Education Society, and the Computer Professional Teaching Committee of the Ministry of Education, among others. His main research areas include big data and intelligent information processing, multi-source information perception and fusion, artificial intelligence and intelligent services, etc. In recent years, he has published more than 230 academic papers as the first author and corresponding author, has obtained 36 invention patents authorized by the state, 96 software copyrights, published 4 monographs, and translated 2 books. He has won first prize and second prize in the Hainan Provincial Science and Technology Progress Award as the first person who completed it; and won two Hainan Provincial Excellent Teaching Achievement Awards and Excellent Teacher Awards. He has presided over and undertaken more than 30 national, provincial, and ministerial-level projects, such as national key research and development plan projects, national science and technology support plans, and National Natural Science Foundation projects.
Sibghat Ullah Bazai completed his undergraduate and graduate studies in computer engineering at the Balochistan University of Information Technology, Engineering, and Management Sciences (BUITEMS) in Quetta, Pakistan. He received his PhD (IT) in cybersecurity from Massey University in Auckland, New Zealand, in 2020. As part of his research, he is interested in applying cybersecurity, identifying diseases with deep learning, automating exams with natural language processing, developing local language sentiment data sets, and planning smart cities. Sibghat is a guest editor and reviewer for several journals' special issues in MDPI, Hindawi, CMC, PLoS One, Frontier, and others.
Muhammad Aamir received the bachelor of engineering degree in computer systems engineering from Mehran University of Engineering & Technology Jamshoro, Sindh, Pakistan, in 2008, the master of engineering degree in software engineering from Chongqing University, China, in 2014, and the PhD degree in computer science and technology from Sichuan University, Chengdu, China, in 2019. He is currently an associate professor at the Department of Computer, Huanggang Normal University, China. His main research interests include pattern recognition, computer vision, image processing, deep learning, and fractional calculus.
Content
System requirements
File format: PDF
Copy-Protection: Adobe-DRM (Digital Rights Management)
System requirements:
- Computer (Windows; MacOS X; Linux): Install the free reader Adobe Digital Editions prior to download (see eBook Help).
- Tablet/smartphone (Android; iOS): Install the free app Adobe Digital Editions or the app PocketBook before downloading (see eBook Help).
- E-reader: Bookeen, Kobo, Pocketbook, Sony, Tolino and many more (only limited: Kindle).
The file format PDF always displays a book page identically on any hardware. This makes PDF suitable for complex layouts such as those used in textbooks and reference books (images, tables, columns, footnotes). Unfortunately, on the small screens of e-readers or smartphones, PDFs are rather annoying, requiring too much scrolling.
This eBook uses Adobe-DRM, a „hard” copy protection. If the necessary requirements are not met, unfortunately you will not be able to open the eBook. You will therefore need to prepare your reading hardware before downloading.
Please note: We strongly recommend that you authorise using your personal Adobe ID after installation of any reading software.
For more information, see our eBook Help page.