All repositories Change the repository type filter All Repositories list Collect some papers about transformer for detection and segmentation. Awesome Detection Transformer for Computer Vision (CV)
[ECCV 2024] Official implementation of the paper "UniPose : Detecting Any Keypoints"
• • 11• 267• 11• 1• Updated Jul 2, 2024 Jul 2, 2024 [ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
[Arxiv-2024] MotionLLM: Understanding Human Behaviors from Human Motions and Videos
• • 4• 181• 3• 0• Updated Jun 28, 2024 Jun 28, 2024 detrex is a research platform for DETR-based object detection, segmentation, pose estimation and other visual recognition tasks.
[ECCV2024] API code for T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy
• • 120• 2k• 2• 0• Updated Jun 25, 2024 Jun 25, 2024 [ICML 2024] 🍅HumanTOMATO: Text-aligned Whole-body Motion Generation
• • 6• 243• 11• 0• Updated Jun 19, 2024 Jun 19, 2024 API for Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series
• • 20• 596• 14• 0• Updated Jun 13, 2024 Jun 13, 2024 [NeurIPS 2023] Official implementation of the paper "Motion-X: A Large-scale 3D Expressive Whole-body Human Motion Dataset"
• • 13• 484• 55• 0• Updated Jun 12, 2024 Jun 12, 2024 Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
[NeurIPS 2023] Official implementation of the paper "DreamWaltz: Make a Scene with Complex 3D Animatable Avatars".
• • 8• 169• 1• 0• Updated May 8, 2024 May 8, 2024 [ICLR 2024] Official implementation of the paper "Toss: High-quality text-guided novel view synthesis from a single image"
• • 1• 12• 1• 0• Updated May 5, 2024 May 5, 2024 [ICCV 2023] Official implementation of the paper "Detection Transformer with Stable Matching"
• • 4• 198• 15• 0• Updated Apr 29, 2024 Apr 29, 2024 The Go-To Choice for CV Data Visualization, Annotation, and Model Analysis.
• • 14• 208• 2• 0• Updated Apr 19, 2024 Apr 19, 2024 [ICLR 2023] Official implementation of the paper "DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection"
• 1• 9• 0• 0• Updated Feb 19, 2024 Feb 19, 2024 [ICCV 2023] Official implementation of the paper "A Simple Framework for Open-Vocabulary Segmentation and Detection"
• • 37• 614• 17• 0• Updated Jan 22, 2024 Jan 22, 2024 [CVPR 2022 Oral] Official implementation of DN-DETR
• • 57• 528• 34• 1• Updated Dec 20, 2023 Dec 20, 2023 [CVPR 2023] Official implementation of the paper "Mask DINO: Towards A Unified Transformer-based Framework for Object Detection and Segmentation"
• • 97• 1.1k• 48• 0• Updated Dec 20, 2023 Dec 20, 2023 • 0• 2• 0• 0• Updated Dec 15, 2023 Dec 15, 2023 "Effective Whole-body Pose Estimation with Two-stages Distillation" (ICCV 2023, CV4Metaverse Workshop)
• • 134• 2k• 23• 2• Updated Dec 12, 2023 Dec 12, 2023 [ICCV 2023] The official implementation of paper "HumanSD: A Native Skeleton-Guided Diffusion Model for Human Image Generation"
• • 17• 262• 7• 0• Updated Oct 24, 2023 Oct 24, 2023 [CVPR 2023] Official implementation of the paper: MP-Former: Mask-Piloted Transformer for Image Segmentation
• • 3• 111• 5• 0• Updated Oct 22, 2023 Oct 22, 2023 [CVPR 2023] Official implementation of the paper "One-Stage 3D Whole-Body Mesh Recovery with Component Aware Transformer"
• • 53• 600• 20• 0• Updated Oct 12, 2023 Oct 12, 2023 [ICCV 2023] Official implementation of the paper "Neural Interactive Keypoint Detection"
• • 2• 63• 2• 0• Updated Oct 12, 2023 Oct 12, 2023 [CVPR 2023] The official implementation of CVPR 2023 paper "Human-Art: A Versatile Human-Centric Dataset Bridging Natural and Artificial Scenes"
• 3• 205• 2• 0• Updated Oct 3, 2023 Oct 3, 2023 [ICCV 2023] Official implementation of the paper "DFA3D: 3D Deformable Attention For 2D-to-3D Feature Lifting"
• • 2• 139• 2• 0• Updated Sep 20, 2023 Sep 20, 2023 [ICLR 2023] Official implementation of the paper "Explicit Box Detection Unifies End-to-End Multi-Person Pose Estimation "
• • 9• 145• 17• 0• Updated Sep 20, 2023 Sep 20, 2023 Official implementation of the paper "Boosting Human-Object Interaction Detection with Text-to-Image Diffusion Model"
• • 1• 38• 0• 0• Updated Aug 9, 2023 Aug 9, 2023 Official PyTorch implementation of the paper "DisCo-CLIP: A Distributed Contrastive Loss for Memory Efficient CLIP Training".
• • 3• 42• 5• 0• Updated Aug 2, 2023 Aug 2, 2023
You can’t perform that action at this time.