Liangyu Liu's starred repositories
open-interpreter
A natural language interface for computers
YOLO-World
[CVPR 2024] Real-Time Open-Vocabulary Object Detection
clash-for-linux
clash-for-linux
flash-attention
Fast and memory-efficient exact attention
duckduckgo_search
Search for words, documents, images, videos, news, maps and text translation using the DuckDuckGo.com search engine. Downloading files and images to a local hard drive.
Medical_NLP
Medical NLP Competition, dataset, large models, paper
gemma_pytorch
The official PyTorch implementation of Google's Gemma models
GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
magic-animate
[CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model
AnimateAnyone
Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation
self-operating-computer
A framework to enable multimodal models to operate a computer.
Qwen-Agent
Agent framework and applications built upon Qwen2, featuring Function Calling, Code Interpreter, RAG, and Chrome extension.