Aph-xin

Aph-xin

Geek Repo

Github PK Tool:Github PK Tool

Aph-xin's starred repositories

lancedb

Developer-friendly, serverless vector database for AI applications. Easily add long-term memory to your LLM apps!

Language:RustLicense:Apache-2.0Stargazers:4700Issues:0Issues:0
Language:PythonStargazers:14Issues:0Issues:0

mv-extractor

Extract frames and motion vectors from H.264 and MPEG-4 encoded video.

Language:CLicense:MITStargazers:299Issues:0Issues:0

DragonDiffusion

ICLR 2024 (Spotlight)

Language:PythonLicense:Apache-2.0Stargazers:724Issues:0Issues:0

EQUI-VOCAL

EQUI-VOCAL: Synthesizing Queries for Compositional Video Events from Limited User Interactions

Language:Jupyter NotebookLicense:MITStargazers:6Issues:0Issues:0

SLD

🔥 [CVPR2024] Official implementation of "Self-correcting LLM-controlled Diffusion Models (SLD)

Language:PythonLicense:MITStargazers:154Issues:0Issues:0

Awesome-Open-Vocabulary

(TPAMI 2024) A Survey on Open Vocabulary Learning

Stargazers:837Issues:0Issues:0

cityscapesScripts

README and scripts for the Cityscapes Dataset

Language:PythonLicense:MITStargazers:2177Issues:0Issues:0

QD-DETR

Official pytorch repository for "QD-DETR : Query-Dependent Video Representation for Moment Retrieval and Highlight Detection" (CVPR 2023 Paper)

Language:PythonLicense:NOASSERTIONStargazers:204Issues:0Issues:0

OpenPSG

Benchmarking Panoptic Scene Graph Generation (PSG), ECCV'22

Language:PythonLicense:MITStargazers:422Issues:0Issues:0

miris

MIRIS: Fast Object Track Queries in Video

Language:GoLicense:MITStargazers:17Issues:0Issues:0

everest

Top-K Deep Video Analytics: A Probabilistic Approach

Language:PythonLicense:GPL-3.0Stargazers:12Issues:0Issues:0

seesaw

(Research) interactive retrieval system+algorithms: find objects of interest within image databases with less human effort

Language:Jupyter NotebookLicense:MITStargazers:6Issues:0Issues:0

chatgpt-ui-server

A ChatGPT UI server based on the Django framework.

Language:PythonStargazers:300Issues:0Issues:0

ragflow

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

Language:PythonLicense:Apache-2.0Stargazers:22334Issues:0Issues:0

GroundingDINO

[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Language:PythonLicense:Apache-2.0Stargazers:6710Issues:0Issues:0

mmcv

OpenMMLab Computer Vision Foundation

Language:PythonLicense:Apache-2.0Stargazers:5896Issues:0Issues:0

GLEE

[CVPR2024 Highlight]GLEE: General Object Foundation Model for Images and Videos at Scale

Language:PythonLicense:MITStargazers:1076Issues:0Issues:0

Grounding-DINO-1.5-API

API for Grounding DINO 1.5: IDEA Research's Most Capable Open-World Object Detection Model Series

Language:PythonLicense:Apache-2.0Stargazers:771Issues:0Issues:0

BLIP

PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:4800Issues:0Issues:0

qdrant

Qdrant - High-performance, massive-scale Vector Database and Vector Search Engine for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/

Language:RustLicense:Apache-2.0Stargazers:20514Issues:0Issues:0

d2l-zh

《动手学深度学习》:面向中文读者、能运行、可讨论。中英文版被70多个国家的500多所大学用于教学。

Language:PythonLicense:Apache-2.0Stargazers:63461Issues:0Issues:0

MIGC

[CVPR 2024 Highlight] "MIGC: Multi-Instance Generation Controller for Text-to-Image Synthesis" (Official Implementation)

Language:PythonLicense:NOASSERTIONStargazers:537Issues:0Issues:0

InstanceDiffusion

[CVPR 2024] Code release for "InstanceDiffusion: Instance-level Control for Image Generation"

Language:PythonLicense:Apache-2.0Stargazers:503Issues:0Issues:0

scenic

Scenic: A Jax Library for Computer Vision Research and Beyond

Language:PythonLicense:Apache-2.0Stargazers:3324Issues:0Issues:0

CLIP4Clip

An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"

Language:PythonLicense:MITStargazers:877Issues:0Issues:0

bootcamp

Dealing with all unstructured data, such as reverse image search, audio search, molecular search, video analysis, question and answer systems, NLP, etc.

Language:HTMLLicense:Apache-2.0Stargazers:1878Issues:0Issues:0

dive-into-llms

《动手学大模型Dive into LLMs》系列编程实践教程

Stargazers:3697Issues:0Issues:0

autodistill-owlv2

OWLv2 base model for use with Autodistill.

Language:PythonLicense:Apache-2.0Stargazers:5Issues:0Issues:0

nanoowl

A project that optimizes OWL-ViT for real-time inference with NVIDIA TensorRT.

Language:PythonLicense:Apache-2.0Stargazers:254Issues:0Issues:0