All

BetaGive feedback

35 repositories

awesome-detection-transformer
Public
Collect some papers about transformer for detection and segmentation. Awesome Detection Transformer for Computer Vision (CV)
108•1.2k•7•2•Updated Jul 4, 2024Jul 4, 2024
UniPose
Public
[ECCV 2024] Official implementation of the paper "UniPose : Detecting Any Keypoints"
Python
•
Other
•11•267•11•1•Updated Jul 2, 2024Jul 2, 2024
GroundingDINO
Public
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
open-world object-detection vision-language vision-language-transformer open-world-detection
Python
•
Apache License 2.0
•591•5.6k•230•8•Updated Jun 28, 2024Jun 28, 2024
MotionLLM
Public
[Arxiv-2024] MotionLLM: Understanding Human Behaviors from Human Motions and Videos
Python
•
Other
•4•181•3•0•Updated Jun 28, 2024Jun 28, 2024
detrex
Public
detrex is a research platform for DETR-based object detection, segmentation, pose estimation and other visual recognition tasks.
segmentation pose-estimation dino state-of-the-art deta detr deformable-detr dab-detr mask-dino dn-detr
Python
•
Apache License 2.0
•199•1.9k•58•3•Updated Jun 27, 2024Jun 27, 2024
T-Rex
Public
[ECCV2024] API code for T-Rex2: Towards Generic Object Detection via Text-Visual Prompt Synergy
interactive object-detection open-set object-counting text-prompt visual-prompt
Python
•
Other
•120•2k•2•0•Updated Jun 25, 2024Jun 25, 2024
HumanTOMATO
Public
[ICML 2024] 🍅HumanTOMATO: Text-aligned Whole-body Motion Generation
motion generation gpt whole-body smplx motion-generation whole-body-motion
Python
•
Other
•6•243•11•0•Updated Jun 19, 2024Jun 19, 2024
Grounding-DINO-1.5-API
Public
API for Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series
open-world object-detection open-set zero-shot-object-detection foundation-model open-vocabulary-detection grounding-dino
Python
•
Apache License 2.0
•20•596•14•0•Updated Jun 13, 2024Jun 13, 2024
Motion-X
Public
[NeurIPS 2023] Official implementation of the paper "Motion-X: A Large-scale 3D Expressive Whole-body Human Motion Dataset"
Python
•
Other
•13•484•55•0•Updated Jun 12, 2024Jun 12, 2024
Grounded-Segment-Anything
Public
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
speech image-editing caption data-generation 3d-whole-body-pose-estimation open-vocabulary-detection open-vocabulary-segmentation automatic-labeling-system
Jupyter Notebook
•
Apache License 2.0
•1.3k•14k•274•1•Updated May 23, 2024May 23, 2024
DreamWaltz
Public
[NeurIPS 2023] Official implementation of the paper "DreamWaltz: Make a Scene with Complex 3D Animatable Avatars".
Python
•
Other
•8•169•1•0•Updated May 8, 2024May 8, 2024
TOSS
Public
[ICLR 2024] Official implementation of the paper "Toss: High-quality text-guided novel view synthesis from a single image"
open-world 3d-generation novel-view-synthesis
Python
•
Apache License 2.0
•1•12•1•0•Updated May 5, 2024May 5, 2024
Stable-DINO
Public
[ICCV 2023] Official implementation of the paper "Detection Transformer with Stable Matching"
transformer object-detection dino detection-transformer detrex
Python
•
Apache License 2.0
•4•198•15•0•Updated Apr 29, 2024Apr 29, 2024
deepdataspace
Public
The Go-To Choice for CV Data Visualization, Annotation, and Model Analysis.
computer-vision model-analysis labeling-tool dataset-visualization intelligent-annotation collaborative-annotation
TypeScript
•
Apache License 2.0
•14•208•2•0•Updated Apr 19, 2024Apr 19, 2024
DINO
Public
[ICLR 2023] Official implementation of the paper "DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection"
computer-vision deep-learning object-detection
Python
•
Apache License 2.0
•228•2.1k•138•2•Updated Apr 7, 2024Apr 7, 2024
IYFC
Public
C++
•1•9•0•0•Updated Feb 19, 2024Feb 19, 2024
OpenSeeD
Public
[ICCV 2023] Official implementation of the paper "A Simple Framework for Open-Vocabulary Segmentation and Detection"
Python
•
Apache License 2.0
•37•614•17•0•Updated Jan 22, 2024Jan 22, 2024
DN-DETR
Public
[CVPR 2022 Oral] Official implementation of DN-DETR
object-detection detr
Python
•
Apache License 2.0
•57•528•34•1•Updated Dec 20, 2023Dec 20, 2023
MaskDINO
Public
[CVPR 2023] Official implementation of the paper "Mask DINO: Towards A Unified Transformer-based Framework for Object Detection and Segmentation"
object-detection semantic-segmentation instance-segmentation panoptic-segmentation
Python
•
Apache License 2.0
•97•1.1k•48•0•Updated Dec 20, 2023Dec 20, 2023
detrex-storage
Public
Apache License 2.0
•0•2•0•0•Updated Dec 15, 2023Dec 15, 2023
DWPose
Public
"Effective Whole-body Pose Estimation with Two-stages Distillation" (ICCV 2023, CV4Metaverse Workshop)
knowledge-distillation pose-estimation stable-diffusion-webui controlnet
Python
•
Apache License 2.0
•134•2k•23•2•Updated Dec 12, 2023Dec 12, 2023
HumanSD
Public
[ICCV 2023] The official implementation of paper "HumanSD: A Native Skeleton-Guided Diffusion Model for Human Image Generation"
deep-learning image-generation iccv conditional-image-generation iccv2023 pytorch
Python
•
Apache License 2.0
•17•262•7•0•Updated Oct 24, 2023Oct 24, 2023
MP-Former
Public
[CVPR 2023] Official implementation of the paper: MP-Former: Mask-Piloted Transformer for Image Segmentation
Python
•
Other
•3•111•5•0•Updated Oct 22, 2023Oct 22, 2023
OSX
Public
[CVPR 2023] Official implementation of the paper "One-Stage 3D Whole-Body Mesh Recovery with Component Aware Transformer"
human-pose-estimation smpl-model smplx 3d-body-recovery whole-body-pose-estimation cvpr2023
Python
•
MIT License
•53•600•20•0•Updated Oct 12, 2023Oct 12, 2023
Click-Pose
Public
[ICCV 2023] Official implementation of the paper "Neural Interactive Keypoint Detection"
annotation-tool pose-estimation human-in-the-loop iccv2023
Python
•
Other
•2•63•2•0•Updated Oct 12, 2023Oct 12, 2023
HumanArt
Public
[CVPR 2023] The official implementation of CVPR 2023 paper "Human-Art: A Versatile Human-Centric Dataset Bridging Natural and Artificial Scenes"
image-generation human-pose-estimation datasets cvpr pose-estimation multi-scenario multi-scene cvpr2023
Apache License 2.0
•3•205•2•0•Updated Oct 3, 2023Oct 3, 2023
3D-deformable-attention
Public
[ICCV 2023] Official implementation of the paper "DFA3D: 3D Deformable Attention For 2D-to-3D Feature Lifting"
Python
•
Other
•2•139•2•0•Updated Sep 20, 2023Sep 20, 2023
ED-Pose
Public
[ICLR 2023] Official implementation of the paper "Explicit Box Detection Unifies End-to-End Multi-Person Pose Estimation "
end-to-end multi-person-pose-estimation iclr2023
Python
•
Other
•9•145•17•0•Updated Sep 20, 2023Sep 20, 2023
DiffHOI
Public
Official implementation of the paper "Boosting Human-Object Interaction Detection with Text-to-Image Diffusion Model"
Python
•
Other
•1•38•0•0•Updated Aug 9, 2023Aug 9, 2023
DisCo-CLIP
Public
Official PyTorch implementation of the paper "DisCo-CLIP: A Distributed Contrastive Loss for Memory Efficient CLIP Training".
Python
•
Apache License 2.0
•3•42•5•0•Updated Aug 2, 2023Aug 2, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

IDEA-Research

All repositories

All

35 repositories

awesome-detection-transformer

UniPose

GroundingDINO

MotionLLM

detrex

T-Rex

HumanTOMATO

Grounding-DINO-1.5-API

Motion-X

Grounded-Segment-Anything

DreamWaltz

TOSS

Stable-DINO

deepdataspace

DINO

IYFC

OpenSeeD

DN-DETR

MaskDINO

detrex-storage

DWPose

HumanSD

MP-Former

OSX

Click-Pose

HumanArt

3D-deformable-attention

ED-Pose

DiffHOI

DisCo-CLIP

All repositories

All

Repositories list

35 repositories