
??????? ? ???????????
Podbor izobrazhenij s pomosch'ü glubokogo obucheniq
Sciencia Scripts (Publisher)
Published on 19. June 2024
Book
Paperback/Softback
64 pages
978-620-7-66102-2 (ISBN)
Description
Sozdanie podpisej k izobrazheniqm s pomosch'ü audio stalo slozhnoj, no perspektiwnoj zadachej w oblasti glubokogo obucheniq. V dannoj rabote predlagaetsq nowyj podhod k resheniü ätoj zadachi putem ob#edineniq konwolücionnyh nejronnyh setej (CNN) dlq izwlecheniq priznakow izobrazheniq i rekurrentnyh nejronnyh setej (RNN) dlq posledowatel'nogo analiza audio. V chastnosti, my ispol'zuem predwaritel'no obuchennye CNN, takie kak VGG, dlq izwlecheniq wizual'nyh priznakow iz izobrazhenij i ispol'zuem predstawleniq spektrogramm w sochetanii s RNN, takimi kak LSTM ili GRU, dlq obrabotki audiowhodow. Predlagaemaq nami model' osnowywaetsq ne tol'ko na wizual'nom soderzhanii izobrazhenij, no i na soputstwuüschih audiosignalah. My oceniwaem proizwoditel'nost' nashej modeli na ätalonnyh naborah dannyh i demonstriruem ee äffektiwnost' w generacii swqznyh i kontextual'no relewantnyh podpisej k izobrazheniqm s sootwetstwuüschimi audiowhodami. Krome togo, my prowodim analiz wklada kazhdoj modal'nosti w obschuü proizwoditel'nost' sozdaniq titrow. Nashi rezul'taty pokazywaüt, chto ob#edinenie wizual'noj i sluhowoj modal'nostej znachitel'no uluchshaet kachestwo sozdaniq titrow po srawneniü s izolirowannym ispol'zowaniem odnoj iz modal'nostej.
More details
Language
Other
Product notice
Paperback (trade)
Unsewn / adhesive bound
Dimensions
Height: 220 mm
Width: 150 mm
Thickness: 4 mm
Weight
113 gr
ISBN-13
978-620-7-66102-2 (9786207661022)
Copyright in bibliographic data and cover images is held by Nielsen Book Services Limited or by the publishers or by their respective licensors: all rights reserved.
Schweitzer Classification
Persons
Ya g-zha K. Kanchana rabotaü docentom na kafedre komp'üternyh nauk i inzhenerii w inzhenernom kolledzhe Katir. Ya interesuüs' mashinnym obucheniem i glubokim obucheniem.