Visual Prompting via Partial Optimal Transport.- Modelling Competitive Behaviors in Autonomous Driving Under Generative World Model.- Tendency-driven Mutual Exclusivity for Weakly Supervised Incremental Semantic Segmentation.- AdaCLIP: Adapting CLIP with Hybrid Learnable Prompts for Zero-Shot Anomaly Detection.- Pathformer3D: A 3D Scanpath Transformer for 360° Images.- TransFusion -- A Transparency-Based Diffusion Model for Anomaly Detection.- SparseLIF: High-Performance Sparse LiDAR-Camera Fusion for 3D Object Detection.- 3D Gaussian Parametric Head Model.
- RING-NeRF : Rethinking Inductive Biases for Versatile and Efficient Neural Fields.- Platypus: A Generalized Specialist Model for Reading Text in Various Forms.- Structured-NeRF: Hierarchical Scene Graph with Neural Representation.- EGIC: Enhanced Low-Bit-Rate Generative Image Compression Guided by Semantic Segmentation.- Plug-and-Play Learned Proximal Trajectory for 3D Sparse-View X-Ray Computed Tomography.- PPAD: Iterative Interactions of Prediction and Planning for End-to-end Autonomous Driving.- Test-Time Stain Adaptation with Diffusion Models for Histopathology Image Classification.- Beyond MOT: Semantic Multi-Object Tracking.
- Temporal Event Stereo via Joint Learning with Stereoscopic Flow.- SAM-COD: SAM-guided Unified Framework for Weakly-Supervised Camouflaged Object Detection.- Just a Hint: Point-Supervised Camouflaged Object Detection.- ManiGaussian: Dynamic Gaussian Splatting for Multi-task Robotic Manipulation.- Global-Local Collaborative Inference with LLM for Lidar-Based Open-Vocabulary Detection.- Learning High-resolution Vector Representation from Multi-Camera Images for 3D Object Detection.- View-Consistent 3D Editing with Gaussian Splatting.- E3V-K5: An Authentic Benchmark for Redefining Video-Based Energy Expenditure Estimation.
- GeoGaussian: Geometry-aware Gaussian Splatting for Scene Rendering.- URS-NeRF: Unordered Rolling Shutter Bundle Adjustment for Neural Radiance Fields.