YOLOv9: Learning What You Want to Learn Using Programmable Gradient Information.- Unsupervised Multi-modal Medical Image Registration via Invertible Translation.- Functional Transform-Based Low-Rank Tensor Factorization for Multi-Dimensional Data Recovery.- CRM: Single Image to 3D Textured Mesh with Convolutional Reconstruction Model.- Domain Reduction Strategy for Non-Line-of-Sight Imaging.- HPE-Li: WiFi-enabled Lightweight Dual Selective Kernel Convolution for Human Pose Estimation.- Cut out the Middleman: Revisiting Pose-based Gait Recognition.- HiEI: A Universal Framework for Generating High-quality Emerging Images from Natural Images.
- High-Precision Self-Supervised Monocular Depth Estimation with Rich-Resource Prior.- SGS-SLAM: Semantic Gaussian Splatting For Neural Dense SLAM.- View Selection for 3D Captioning via Diffusion Ranking.- OmniSSR: Zero-shot Omnidirectional Image Super-Resolution using Stable Diffusion Model.- UDiffText: A Unified Framework for High-quality Text Synthesis in Arbitrary Images via Character-aware Diffusion Models.- Confidence Self-Calibration for Multi-Label Class-Incremental Learning.- OMG: Occlusion-friendly Personalized Multi-concept Generation in Diffusion Models.- Versatile Incremental Learning: Towards Class and Domain-Agnostic Incremental Learning.
- WeCromCL: Weakly Supervised Cross-Modality Contrastive Learning for Transcription-only Supervised Text Spotting.- An Incremental Unified Framework for Small Defect Inspection.- Enhancing Optimization Robustness in 1-bit Neural Networks through Stochastic Sign Descent.- Temporally Consistent Stereo Matching.- A Rotation-invariant Texture ViT for Fine-Grained Recognition of Esophageal Cancer Endoscopic Ultrasound Images.- BI-MDRG: Bridging Image History in Multimodal Dialogue Response Generation.- Adapting Fine-Grained Cross-View Localization to Areas without Fine Ground Truth.- BeNeRF:Neural Radiance Fields from a Single Blurry Image and Event Stream.
- Human Motion Forecasting in Dynamic Domain Shifts: A Homeostatic Continual Test-time Adaptation Framework.- CloudFixer: Test-Time Adaptation for 3D Point Clouds via Diffusion-Guided Geometric Transformation.- DreamDiffusion: High-Quality EEG-to-Image Generation with Temporal Masked Signal Modeling and CLIP Alignment.