Computer Vision - ECCV 2024

Name: Computer Vision - ECCV 2024 | 18th European Conference, Milan, Italy, September 29-October 4, 2024, Proceedings, Part XLV
Brand: Springer
Price: 78.1 EUR
Availability: OnlineOnly

18th European Conference, Milan, Italy, September 29-October 4, 2024, Proceedings, Part XLV

Ales Leonardis Elisa Ricci Stefan Roth Olga Russakovsky Torsten Sattler Gül Varol(Herausgeber*in)

Springer (Verlag)

Erschienen am 23. November 2024

LXXXV, 495 Seiten

E-Book

PDF mit Wasserzeichen-DRM

Systemvoraussetzungen

978-3-031-72995-9 (ISBN)

78,10 €inkl. 7% MwSt.

Systemvoraussetzungen

für PDF mit Wasserzeichen-DRM

E-Book Einzellizenz

Als Download verfügbar

Beschreibung

Weitere Details

Weitere Ausgaben

Inhalt

KFD-NeRF: Rethinking Dynamic NeRF with Kalman Filter.- Physical-Based Event Camera Simulator.- V-IRL: Grounding Virtual Intelligence in Real Life.- Adversarial Prompt Tuning for Vision-Language Models.- Relightable 3D Gaussians: Realistic Point Cloud Relighting with BRDF Decomposition and Ray Tracing.- Mono-ViFI: A Unified Learning Framework for Self-supervised Single- and Multi-frame Monocular Depth Estimation.- CC-SAM: Enhancing SAM with Cross-feature Attention and Context for Ultrasound Image Segmentation.- An Efficient and Effective Transformer Decoder-Based Framework for Multi-Task Visual Grounding.- Think2Drive: Efficient Reinforcement Learning by Thinking with Latent World Model for Autonomous Driving (in CARLA-v2).- PanGu-Draw: Advancing Resource-Efficient Text-to-Image Synthesis with Time-Decoupled Training and Reusable Coop-Diffusion.- X-InstructBLIP: A Framework for Aligning Image, 3D, Audio, Video to LLMs and its Emergent Cross-modal Reasoning.- Learning Neural Volumetric Pose Features for Camera Localization.- Betrayed by Attention: A Simple yet Effective Approach for Self-supervised Video Object Segmentation.- REFRAME: Reflective Surface Real-Time Rendering for Mobile Devices.- Self-Training Room Layout via Geometry-aware Ray-casting.- Closed-Loop Unsupervised Representation Disentanglement with $\beta$-VAE Distillation and Diffusion Probabilistic Feedback.- Rethinking Weakly-supervised Video Temporal Grounding From a Game Perspective.- Every Pixel Has its Moments: Ultra-High-Resolution Unpaired Image-to-Image Translation via Dense Normalization.- ZoLA: Zero-Shot Creative Long Animation Generation with Short Video Model.- Parameter-Efficient and Memory-Efficient Tuning for Vision Transformer: A Disentangled Approach.- Restore Anything with Masks: Leveraging Mask Image Modeling for Blind All-in-One Image Restoration.- When Fast Fourier Transform Meets Transformer for Image Restoration.- Dolphins: Multimodal Language Model for Driving.- Rethinking Video Deblurring with Wavelet-Aware Dynamic Transformer and Diffusion Model.- CamoTeacher: Dual-Rotation Consistency Learning for Semi-Supervised Camouflaged Object Detection.- Placing Objects in Context via Inpainting for Out-of-distribution Segmentation.- Textual Grounding for Open-vocabulary Visual Information Extraction in Layout-diversified Documents.

Systemvoraussetzungen

Als PDF speichern Als Link merken