Aph-xin's starred repositories
mv-extractor
Extract frames and motion vectors from H.264 and MPEG-4 encoded video.
DragonDiffusion
ICLR 2024 (Spotlight)
EQUI-VOCAL
EQUI-VOCAL: Synthesizing Queries for Compositional Video Events from Limited User Interactions
Awesome-Open-Vocabulary
(TPAMI 2024) A Survey on Open Vocabulary Learning
cityscapesScripts
README and scripts for the Cityscapes Dataset
chatgpt-ui-server
A ChatGPT UI server based on the Django framework.
GroundingDINO
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
Grounding-DINO-1.5-API
API for Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series
InstanceDiffusion
[CVPR 2024] Code release for "InstanceDiffusion: Instance-level Control for Image Generation"
dive-into-llms
《动手学大模型Dive into LLMs》系列编程实践教程
autodistill-owlv2
OWLv2 base model for use with Autodistill.