Skip to content
Change the repository type filter

All

    Repositories list

    • speed3r

      Public
      [CVPR 2026 Findings] Speed3R: Sparse Feed-forward 3D Reconstruction Models
      Python
      BSD 3-Clause "New" or "Revised" License
      0000Updated Mar 5, 2026Mar 5, 2026
    • SPoT

      Public
      Official code for paper "Surgical Post-Training: Cutting Errors, Keeping Knowledge"
      Python
      0910Updated Mar 5, 2026Mar 5, 2026
    • ICE

      Public
      [CVPR2025 Highlight] ICE: Intrinsic Concept Extraction from a Single Image via Diffusion Models
      Python
      01700Updated Mar 3, 2026Mar 3, 2026
    • 0900Updated Feb 6, 2026Feb 6, 2026
    • Pancap

      Public
      [NeurIPS 2025] Panoptic Captioning: An Equivalence Bridge for Image and Text
      Python
      MIT License
      03300Updated Jan 31, 2026Jan 31, 2026
    • LooC

      Public
      LooC: Effective Low-Dimentional Codebook for Compositional Vector Quantization
      Python
      MIT License
      0100Updated Jan 7, 2026Jan 7, 2026
    • JavaScript
      15002Updated Jan 4, 2026Jan 4, 2026
    • SEAL

      Public
      [NeurIPS 2025] SEAL: Semantic-Aware Hierarchical Learning for Generalized Category Discovery
      Python
      01200Updated Dec 27, 2025Dec 27, 2025
    • JoVA

      Public
      JoVA: Unified Multimodal Learning for Joint Video-Audio Generation
      03010Updated Dec 22, 2025Dec 22, 2025
    • Fin3R

      Public
      [NeurIPS 2025] Fin3R: Fine-tuning Feed-forward 3D Reconstruction Models via Monocular Knowledge Distillation
      Python
      Other
      35220Updated Dec 18, 2025Dec 18, 2025
    • [TPAMI 2025] Semantic Correspondence: Unified Benchmarking and a Strong Baseline
      Python
      01600Updated Dec 11, 2025Dec 11, 2025
    • A collection of papers on semantic correspondence, organized by year.
      22200Updated Dec 10, 2025Dec 10, 2025
    • 3DRS

      Public
      [NeurIPS 2025] 3DRS: MLLMs Need 3D-Aware Representation Supervision for Scene Understanding
      Python
      Apache License 2.0
      015110Updated Dec 9, 2025Dec 9, 2025
    • [ICCV 2025] Inpaint4Drag: Repurposing Inpainting Models for Drag-Based Image Editing via Bidirectional Warping
      Python
      Apache License 2.0
      109100Updated Nov 30, 2025Nov 30, 2025
    • HypCD

      Public
      [CVPR 2025] Hyperbolic Category Discovery
      Python
      MIT License
      22600Updated Nov 7, 2025Nov 7, 2025
    • DebGCD

      Public
      [ICLR 2025] DebGCD: Debiased Learning with Distribution Guidance for Generalized Category Discovery
      Python
      MIT License
      01400Updated Sep 27, 2025Sep 27, 2025
    • Mr.DETR

      Public
      [CVPR 2025] Mr. DETR: Instructive Multi-Route Training for Detection Transformers
      Python
      MIT License
      1016570Updated Sep 6, 2025Sep 6, 2025
    • GAMEBoT

      Public
      [ACL 2025] GAMEBoT: Transparent Assessment of LLM Reasoning in Games
      Python
      23100Updated Aug 17, 2025Aug 17, 2025
    • HiLo

      Public
      [ICLR2025] HiLo: A Learning Framework for Generalized Category Discovery Robust to Domain Shifts
      Python
      22120Updated Aug 1, 2025Aug 1, 2025
    • PruneVid

      Public
      [ACL 2025] PruneVid: Visual Token Pruning for Efficient Video Large Language Models
      Python
      06710Updated May 15, 2025May 15, 2025
    • SPTNet

      Public
      [ICLR2024] SPTNet: An Efficient Alternative Framework for Generalized Category Discovery with Spatial Prompt Tuning
      Python
      Other
      33700Updated Apr 9, 2025Apr 9, 2025
    • v-CLR

      Public
      [CVPR 2025 Highlight] v-CLR: View-Consistent Learning for Open-World Instance Segmentation
      Python
      MIT License
      32120Updated Apr 7, 2025Apr 7, 2025
    • PromptCCD

      Public
      [ECCV2024] PromptCCD: Learning Gaussian Mixture Prompt Pool for Continual Category Discovery
      Python
      63000Updated Apr 3, 2025Apr 3, 2025
    • FROSTER

      Public
      [ICLR 2024] FROSTER: Frozen CLIP is a Strong Teacher for Open-Vocabulary Action Recognition
      Python
      Other
      99710Updated Jan 14, 2025Jan 14, 2025
    • [ECCV2024] RegionDrag: Fast Region-Based Image Editing with Diffusion Models
      Python
      46420Updated Oct 9, 2024Oct 9, 2024
    • SCD

      Public
      [CVPRW2024] What’s in a Name? Beyond Class Indices for Image Recognition
      Python
      11700Updated Aug 30, 2024Aug 30, 2024
    • Dissect-OOD-OSR

      Public
      [IJCV 2024] Dissecting Out-of-Distribution Detection and Open-Set Recognition: A Critical Analysis of Methods and Benchmarks
      Python
      11500Updated Aug 30, 2024Aug 30, 2024