比你笨's repositories
ADAS
Automated Design of Agentic Systems, 使用中文大模型
bgpt
Beyond Language Models: Byte Models are Digital World Simulators
ColBERT
ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22)
ComfyUI
The most powerful and modular stable diffusion GUI with a graph/nodes interface.
Emu3
Next-Token Prediction is All You Need
GeneFace
GeneFace: Generalized and High-Fidelity 3D Talking Face Synthesis; ICLR 2023; Official code
GeneFacePlusPlus
GeneFace++: Generalized and Stable Real-Time 3D Talking Face Generation; Official Code
golc
🚀 Building Go applications with LLMs through composability
jepa
PyTorch code and models for V-JEPA self-supervised learning from video.
LS-LLaMA
A Simple but Powerful SOTA NER Model | Official Code For Label Supervised LLaMA Finetuning
MockingBird
🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time
nano-graphrag
A simple, easy-to-hack GraphRAG implementation
Native-LLM-for-Android
Demonstration of running a native LLM on Android device.
Open-Sora
Open-Sora: Democratizing Efficient Video Production for All
PowerInfer
High-speed Large Language Model Serving on PCs with Consumer-grade GPUs
Qwen-Agent
Agent framework and applications built upon Qwen2, featuring Function Calling, Code Interpreter, RAG, and Chrome extension.
Real3DPortrait
Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis; ICLR 2024 Spotlight; Official code
representation-engineering
Representation Engineering: A Top-Down Approach to AI Transparency
StableCascade
Official Code for Stable Cascade
Time-Series-Library
A Library for Advanced Deep Time Series Models.
TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
WiFiAnalyzer
Android application to analyze WiFi signals.
WikiChat
WikiChat stops the hallucination of large language models by retrieving data from Wikipedia.