Computer Vision - ECCV 2024 : 18th European Conference, Milan, Italy, September 29-October 4, 2024, Proceedings, Part LVI

Click to enlarge

ISBN No.:	9783031729911
Pages:	lxxxv, 499
Year:	202410
Format:	Trade Paper
Price:	$ 110.39
Dispatch delay:	Dispatched between 7 to 15 days
Status:	Available

Qty: Add to Cart

Synopsis
Table of contents
Full Details

HowToCaption: Prompting LLMs to Transform Video Annotations at Scale.- LabelDistill: Label-guided Cross-modal Knowledge Distillation for Camera-based 3D Object Detection.- Beyond the Data Imbalance: Employing the Heterogeneous Datasets for Vehicle Maneuver Prediction.- On Pretraining Data Diversity for Self-Supervised Learning.- Look Around and Learn: Self-Training Object Detection by Exploration.- Bayesian Self-Training for Semi-Supervised 3D Segmentation.- Motion and Structure from Event-based Normal Flow.- ParCo: Part-Coordinating Text-to-Motion Synthesis.

- Learning to Complement and to Defer to Multiple Users.- Tiny Models are the Computational Saver for Large Models.- DragVideo: Interactive Drag-style Video Editing.- Multi-Sentence Grounding for Long-term Instructional Video.- Do Generalised Classifiers really work on Human Drawn Sketches?.- KMTalk: Speech-Driven 3D Facial Animation with Key Motion Embedding.- Head360: Learning a Parametric 3D Full-Head for Free-View Synthesis in 360°.- MotionDirector: Motion Customization of Text-to-Video Diffusion Models.

- Text2LiDAR: Text-guided LiDAR Point Clouds Generation via Equirectangular Transformer.- Enhanced Motion Forecasting with Visual Relation Reasoning.- Rate-Distortion-Cognition Controllable Versatile Neural Image Compression.- Temporal As a Plugin: Unsupervised Video Denoising with Pre-Trained Image Denoisers.- LiDAR-based All-weather 3D Object Detection via Prompting and Distilling 4D Radar.- MM-SafetyBench: A Benchmark for Safety Evaluation of Multimodal Large Language Models.- Post-training Quantization with Progressive Calibration and Activation Relaxing for Text-to-Image Diffusion Models.- Scene Coordinate Reconstruction: Posing of Image Collections via Incremental Learning of a Relocalizer.

- Diffusion Models are Geometry Critics: Single Image 3D Editing Using Pre-Trained Diffusion Priors.- Weakly Supervised Co-training with Swapping Assignments for Semantic Segmentation.- StoryImager: A Unified and Efficient Framework for Coherent Story Visualization and Completion.

To be able to view the table of contents for this publication then please subscribe by clicking the button below...

To be able to view the full description for this publication then please subscribe by clicking the button below...