Browse Subject Headings
Computer Vision - ECCV 2024 : 18th European Conference, Milan, Italy, September 29-October 4, 2024, Proceedings, Part L
Computer Vision - ECCV 2024 : 18th European Conference, Milan, Italy, September 29-October 4, 2024, Proceedings, Part L
Click to enlarge
ISBN No.: 9783031729720
Pages: lxxxv, 485
Year: 202412
Format: Trade Paper
Price: $ 110.39
Dispatch delay: Dispatched between 7 to 15 days
Status: Available (Forthcoming)

Revisit Human-Scene Interaction via Space Occupancy.- Face Adapter for Pre-Trained Diffusion Models with Fine-Grained ID and Attribute Control.- WeConvene: Learned Image Compression with Wavelet-Domain Convolution and Entropy Model.- Grid-Attention: Enhancing Computational Efficiency of Large Vision Models without Fine-Tuning.- Mitigating Background Shift in Class-Incremental Semantic Segmentation.- Relation DETR: Exploring Explicit Position Relation Prior for Object Detection.- BKDSNN: Enhancing the Performance of Learning-based Spiking Neural Networks Training with Blurred Knowledge Distillation.- Agent Attention: On the Integration of Softmax and Linear Attention.


- Learning by Aligning 2D Skeleton Sequences and Multi-Modality Fusion.- Resolving Scale Ambiguity in Multi-view 3D Reconstruction using Dual-Pixel Sensors.- Object-Oriented Anchoring and Modal Alignment in Multimodal Learning.- Towards Stable 3D Object Detection.- FYI: Flip Your Images for Dataset Distillation.- On-the-fly Category Discovery for LiDAR Semantic Segmentation.- Dual-Camera Smooth Zoom on Mobile Phones.- ProtoComp: Diverse Point Cloud Completion with Controllable Prototype.


- CONDA: Condensed Deep Association Learning for Co-Salient Object Detection.- Cascade Prompt Learning for Visual-Language Model Adaptation.- PolyRoom: Room-aware Transformer for Floorplan Reconstruction.- BenchLMM: Benchmarking Cross-style Visual Capability of Large Multimodal Models.- SMFANet: A Lightweight Self-Modulation Feature Aggregation Network for Efficient Image Super-Resolution.- HENet: Hybrid Encoding for End-to-end Multi-task 3D Perception from Multi-view Cameras.- Hierarchical Unsupervised Relation Distillation for Source Free Domain Adaptation.- Customized Generation Reimagined: Fidelity and Editability Harmonized.


- AUFormer: Vision Transformers are Parameter-Efficient Facial Action Unit Detectors.- Improving Video Segmentation via Dynamic Anchor Queries.- Controllable Contextualized Image Captioning: Directing the Visual Narrative through User-Defined Highlights.


To be able to view the table of contents for this publication then please subscribe by clicking the button below...
To be able to view the full description for this publication then please subscribe by clicking the button below...
Browse Subject Headings