Kevin's repositories
awesome-cbir-papers
📝Awesome and classical image retrieval papers
generative_agents
Generative Agents: Interactive Simulacra of Human Behavior
PyTorch-BigGraph
Generate embeddings from large-scale graph-structured data.
StreamDiffusion
StreamDiffusion: A Pipeline-Level Solution for Real-Time Interactive Generation
so-vits-svc
SoftVC VITS Singing Voice Conversion
awesome-vector-search
Collections of vector search related libraries, service and research papers
BLIP
PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
camel
🐫 CAMEL: Communicative Agents for “Mind” Exploration of Large Scale Language Model Society
cu
package cu provides an idiomatic interface to the CUDA Driver API.
DB
A PyTorch implementation of "Real-time Scene Text Detection with Differentiable Binarization".
elastiknn
Elasticsearch plugin for nearest neighbor search. Store vectors and run similarity search using exact and approximate algorithms.
embedding_model_test
基于开源embedding模型的中文向量效果测试
faiss
A library for efficient similarity search and clustering of dense vectors.
gans-awesome-applications
Curated list of awesome GAN applications and demo
genworlds
The pod that creates, ensembles, and deploys agents on demand.
GPTeam
GPTeam: An open-source multi-agent simulation
guidance
A guidance language for controlling large language models.
hnswlib
Header-only C++/python library for fast approximate nearest neighbors
langchaingo
LangChain for Go
LaTeX-OCR
pix2tex: Using a ViT to convert images of equations into LaTeX code.
LAVIS
LAVIS - A One-stop Library for Language-Vision Intelligence
llama
Inference code for LLaMA models
LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
mindocr
A toolbox of OCR models, algorithms, and pipelines based on MindSpore
pytorch-image-models
PyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNet-V3/V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more
SynthText
Code for generating synthetic text images as described in "Synthetic Data for Text Localisation in Natural Images", Ankush Gupta, Andrea Vedaldi, Andrew Zisserman, CVPR 2016.
TextRecognitionDataGenerator
A synthetic data generator for text recognition
unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
webrtc
Pure Go implementation of the WebRTC API