Beast code in Giters

Aph-xin's starred repositories

lancedb

Developer-friendly, serverless vector database for AI applications. Easily add long-term memory to your LLM apps!

Language:RustApache-2.0470000

ReDiffusion

Language:Python1400

mv-extractor

Extract frames and motion vectors from H.264 and MPEG-4 encoded video.

Language:CMIT29900

DragonDiffusion

ICLR 2024 (Spotlight)

Language:PythonApache-2.072400

EQUI-VOCAL

EQUI-VOCAL: Synthesizing Queries for Compositional Video Events from Limited User Interactions

Language:Jupyter NotebookMIT600

SLD

🔥 [CVPR2024] Official implementation of "Self-correcting LLM-controlled Diffusion Models (SLD)

Language:PythonMIT15400

Awesome-Open-Vocabulary

(TPAMI 2024) A Survey on Open Vocabulary Learning

83700

cityscapesScripts

README and scripts for the Cityscapes Dataset

Language:PythonMIT217700

QD-DETR

Official pytorch repository for "QD-DETR : Query-Dependent Video Representation for Moment Retrieval and Highlight Detection" (CVPR 2023 Paper)

Language:PythonNOASSERTION20400

OpenPSG

Benchmarking Panoptic Scene Graph Generation (PSG), ECCV'22

Language:PythonMIT42200

miris

MIRIS: Fast Object Track Queries in Video

Language:GoMIT1700

everest

Top-K Deep Video Analytics: A Probabilistic Approach

Language:PythonGPL-3.01200

seesaw

(Research) interactive retrieval system+algorithms: find objects of interest within image databases with less human effort

Language:Jupyter NotebookMIT600

chatgpt-ui-server

A ChatGPT UI server based on the Django framework.

Language:Python30000

ragflow

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

Language:PythonApache-2.02233400

GroundingDINO

[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Language:PythonApache-2.0671000

mmcv

OpenMMLab Computer Vision Foundation

Language:PythonApache-2.0589600

GLEE

[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale

Language:PythonMIT107600

Grounding-DINO-1.5-API

API for Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series

Language:PythonApache-2.077100

BLIP

PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

Language:Jupyter NotebookBSD-3-Clause480000

qdrant

Qdrant - High-performance, massive-scale Vector Database and Vector Search Engine for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/

Language:RustApache-2.02051400

d2l-zh

《动手学深度学习》：面向中文读者、能运行、可讨论。中英文版被70多个国家的500多所大学用于教学。

Language:PythonApache-2.06346100

MIGC

[CVPR 2024 Highlight] "MIGC: Multi-Instance Generation Controller for Text-to-Image Synthesis" (Official Implementation)

Language:PythonNOASSERTION53700

InstanceDiffusion

[CVPR 2024] Code release for "InstanceDiffusion: Instance-level Control for Image Generation"

Language:PythonApache-2.050300

scenic

Scenic: A Jax Library for Computer Vision Research and Beyond

Language:PythonApache-2.0332400

CLIP4Clip

An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"

Language:PythonMIT87700

bootcamp

Dealing with all unstructured data, such as reverse image search, audio search, molecular search, video analysis, question and answer systems, NLP, etc.

Language:HTMLApache-2.0187800

dive-into-llms

《动手学大模型Dive into LLMs》系列编程实践教程

369700

autodistill-owlv2

OWLv2 base model for use with Autodistill.

Language:PythonApache-2.0500

nanoowl

A project that optimizes OWL-ViT for real-time inference with NVIDIA TensorRT.

Language:PythonApache-2.025400