Skip to content
View rentainhe's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report rentainhe

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.

Starred repositories

Showing results

Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.

Python 213 9 Updated Aug 23, 2024

[ICML 2024] Selecting High-Quality Data for Training Language Models

Python 130 9 Updated Jun 20, 2024

VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)

Python 1,506 114 Updated Aug 22, 2024

RLAIF-V: Aligning MLLMs through Open-Source AI Feedback for Super GPT-4V Trustworthiness

Python 190 6 Updated Aug 12, 2024

SD变现宝:一键把comfyui工作流转换成小程序。

Python 881 106 Updated Aug 20, 2024

Run Segment Anything Model 2 on a live video stream

Jupyter Notebook 71 11 Updated Aug 20, 2024

[ICLR 2024] SWE-Bench: Can Language Models Resolve Real-world Github Issues?

Python 1,661 279 Updated Aug 20, 2024

Bring portraits to life!

Python 10,427 1,030 Updated Aug 19, 2024

Python Library to evaluate VLM models' robustness across diverse benchmarks

Jupyter Notebook 146 5 Updated Aug 23, 2024

This repository compiles a list of papers related to the application of video technology in the field of robotics! Star⭐ the repo and follow me if you like what you see🤩.

100 6 Updated Aug 12, 2024

A curated list of video object segmentation (vos) papers, datasets, and projects.

169 4 Updated Aug 23, 2024

[ECCV 2024] Paying More Attention to Image: A Training-Free Method for Alleviating Hallucination in LVLMs

Python 30 Updated Aug 12, 2024

Text-to-video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Python 5,831 533 Updated Aug 23, 2024

SF3D: Stable Fast 3D Mesh Reconstruction with UV-unwrapping and Illumination Disentanglement

Python 834 65 Updated Aug 14, 2024

A Collection on Large Language Models for Optimization

94 11 Updated Aug 19, 2024

Run PyTorch LLMs locally on servers, desktop and mobile

Python 2,992 183 Updated Aug 24, 2024

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Jupyter Notebook 4 Updated Aug 13, 2024

Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO and SAM 2

Jupyter Notebook 506 23 Updated Aug 21, 2024

Official inference repo for FLUX.1 models

Python 11,541 763 Updated Aug 21, 2024

Official implementation of OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion

Python 186 8 Updated Aug 22, 2024

[AAAI2024] FontDiffuser: One-Shot Font Generation via Denoising Diffusion with Multi-Scale Content Aggregation and Style Contrastive Learning

Python 251 22 Updated Mar 14, 2024

MINT-1T: A one trillion token multimodal interleaved dataset.

714 18 Updated Jul 31, 2024
Python 701 42 Updated Aug 13, 2024

PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838

Python 606 29 Updated Aug 20, 2024

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 9,873 724 Updated Aug 21, 2024

DUSt3R: Geometric 3D Vision Made Easy

Python 4,893 539 Updated Aug 10, 2024

[ECCV2024] Adaptive Parametric Activation

Jupyter Notebook 23 2 Updated Jul 26, 2024

The memory layer for Personalized AI

Python 20,091 1,877 Updated Aug 24, 2024

The Autograd Engine

Python 436 33 Updated Aug 24, 2024

DenseFusion-1M: Merging Vision Experts for Comprehensive Multimodal Perception

Python 101 1 Updated Aug 23, 2024
Next
-