MultiMedia Modeling

Name: MultiMedia Modeling | 31st International Conference on Multimedia Modeling, MMM 2025, Nara, Japan, January 8-10, 2025, Proceedings, Part II
Brand: Springer
Price: 74.89 EUR
Availability: OnlineOnly

31st International Conference on Multimedia Modeling, MMM 2025, Nara, Japan, January 8-10, 2025, Proceedings, Part II

Ichiro Ide Ioannis Kompatsiaris Changsheng Xu Keiji Yanai Wei-Ta Chu Naoko Nitta Michael Riegler Toshihiko Yamasaki(Editor)

Springer (Publisher)

Published on 30. December 2024

XX, 455 pages

E-Book

PDF with digital watermarking

System requirements

978-981-96-2061-6 (ISBN)

€74.89incl. 7% vat

System requirements

for PDF with digital watermarking

E-Book Single Licence

Available for download

Description

More details

Other editions

Content

Regular Papers.- gFlow: Distributed Real-Time Reverse Remote Rendering System Model.- Grounding Deliberate Reasoning in Multimodal Large Language Models.- GWUNet: A UNet with Gated Attention and Improved Wavelet Transform for Thyroid Nodules Segmentation.- HCV: Lightweight Hybrid CNN-Vision Transformer for Visual Object Tracking.-HierArtEx: Hierarchical Representations and Art Experts Supporting the Retrieval of Museums in the Metaverse.- Hybrid Scalable Video Coding with Neural Compression and Enhancement for Streaming Media.- Hyper-NeuS:Hypernetworks for Neural SDF Implicit Surface Reconstruction by Volume Rendering.- Image-Generation AI Model Retrieval by Contrastive Learning-based Style Distance Calculation.- Improving Singing Voice Transcription Generalization with AI Generated Accompaniments.- Infrared Small Target Detection with Feature Refinement and Context Enhancement.- Innovative Lifelog Visualization and Exploration in Virtual Reality - A Comparative Study.- Integrating S1&S2 Framework for Enhanced Semantic Match in Person Re-identification.- Intra-Class Compact Facial Expression Recognition Based on Amplitude Phase Separation.- Joint Decision Network with Modality-Specific and Dual Interactive Features for Fake News Detection.- Kiite World: Socializing Map-Based Music Exploration Through Playlist Sharing and Synchronized Listening.- KuzushijiDiffuser: Japanese Kuzushiji Font Generation with FontDiffuser.- LIESA: Low-light Image Enhancement with Semantic Awareness.- Lightweight Dual Grouped Large-Kernel Convolutions for Salient Object Detection Network.- Lightweight Motion-Aware Video Super-Resolution for Compressed Videos.- LITA: LMM-guided Image-Text Alignment for Art Assessment.- LLMs-based Augmentation for Domain Adaptation in Long-tailed Food Datasets.- Making strides Security in Multimodal Fake News Detection Models: A Comprehensive Analysis of Adversarial Attacks.- MambaTalk: Speech-driven 3D Facial Animation with Mamba.- MC-YOLO: Multi-scale Transmission Line Defect Target Recognition Network.- MDT-Net: A Mask Decoder Tuning Strategy for CLIP-based Zero-shot 3D Classification.- MICAN: Multi-modal Inconsistency-based Cooperation Attention Network for Fake News Detection.- MineTinyNet-YOLO: An Efficient Small Object Detection Method for Complex Underground Coal Mine Scenarios.- Mix-YOLONet: Deep Image Dehazing for Improving Object Detection.- MKSNet: Advanced Small Object Detection in Remote Sensing Imagery with Multi-Kernel and Dual Attention Mechanisms.- MLP-AMDC: A MLP Architecture for Adaptive-Mask-based Dual-Camera Snapshot Hyperspectral Imaging.- MM-CARP: Multimodal Model with Cross-modal Retrieval-Augmented and Visual Region Perception.- Modality-Specific Hashing: Transform Cross-Modal Retrieval into Single-Modal Retrieval.

System requirements

Save as PDF Copy link into clipboard

Schweitzer Fachinformationen

MultiMedia Modeling

Description

More details

Other editions

Additional editions

Content

System requirements