Elevating All Zero-Shot Sketch-Based Image Retrieval Through Multimodal Prompt Learning.- Improving Knowledge Distillation via Regularizing Feature Direction and Norm.- 3DFG-PIFu: 3D Feature Grids for Human Digitization from Sparse Views.- Lazy Diffusion Transformer for Interactive Image Editing.- Non-parametric Sensor Noise Modeling and Synthesis.- Stripe Observation Guided Inference Cost-free Attention Mechanism.- The Nerfect Match: Exploring NeRF Features for Visual Localization.- ComboVerse: Compositional 3D Assets Creation Using Spatially-Aware Diffusion Guidance.
- Robust Calibration of Large Vision-Language Adapters.- Leveraging Hierarchical Feature Sharing for Efficient Dataset Condensation.- Improving Domain Generalization in Self-Supervised Monocular Depth Estimation via Stabilized Adversarial Training.- milliFlow: Scene Flow Estimation on mmWave Radar Point Cloud for Human Motion Sensing.- denoiSplit: a method for joint microscopy image splitting and unsupervised denoising.- AugDETR: Improving Multi-scale Learning for Detection Transformer.- Spherical World-Locking for Audio-Visual Localization in Egocentric Videos.- SPIN: Hierarchical Segmentation with Subpart Granularity in Natural Images.
- SIGMA: Sinkhorn-Guided Masked Video Modeling.- Generative Camera Dolly: Extreme Monocular Dynamic Novel View Synthesis.- Distribution Alignment for Fully Test-Time Adaptation with Dynamic Online Data Streams.- Divide and Fuse: Body Part Mesh Recovery from Partially Visible Human Images.- Understanding Physical Dynamics with Counterfactual World Modeling.- MIGS: Multi-Identity Gaussian Splatting via Tensor Decomposition.- 4Diff: 3D-Aware Diffusion Model for Third-to-First Viewpoint Translation.- Improving Point-based Crowd Counting and Localization Based on Auxiliary Point Guidance.
- Nymeria: A Massive Collection of Egocentric Multi-modal Human Motion in the Wild.- DreamStruct: Understanding Slides and User Interfaces via Synthetic Data Generation.- SemTrack: A Large-scale Dataset for Semantic Tracking in the Wild.