Beast code in Giters

lujiale621's starred repositories

torchchat

Run PyTorch LLMs locally on servers, desktop and mobile

Language:PythonBSD-3-Clause251000

NapCatQQ

基于NTQQ的无头Bot框架

Language:TypeScriptMPL-2.0143100

pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Language:Jupyter NotebookMIT565000

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Language:PythonApache-2.013020500

Stable-Hair

Stable-Hair: Real-World Hair Transfer via Diffusion Model

Apache-2.027900

MovieChat

[CVPR 2024] 🎬💭 chat with over 10K frames of video!

Language:PythonBSD-3-Clause47200

CosyVoice

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Language:PythonApache-2.0342800

How-to-use-Transformers

Transformers 库快速入门教程

Language:PythonApache-2.087700

IDM-VTON

[ECCV2024] IDM-VTON : Improving Diffusion Models for Authentic Virtual Try-on in the Wild

Language:Python331900

Qwen2-Audio

The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.

52000

IMAGDressing

👔IMAGDressing👔: Interactive Modular Apparel Generation for Virtual Dressing

Language:PythonApache-2.083900

Stirling-PDF

#1 Locally hosted web application that allows you to perform various operations on PDF files

Language:JavaGPL-3.03568100

mr-Blip

Official Implementation of "The Surprising Effectiveness of Multimodal Large Language Models for Video Moment Retrieval"

Language:PythonBSD-3-Clause2600

fish-speech

Brand new TTS solution

Language:PythonNOASSERTION692300

SoniTranslate

Synchronized Translation for Videos. Video dubbing

Language:PythonApache-2.039900

StreamSpeech

StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.

Language:PythonMIT78700

video-mamba-suite

The suite of modeling video with Mamba

Language:PythonMIT20200

R2-Tuning

🌀 R^2-Tuning: Efficient Image-to-Video Transfer Learning for Video Temporal Grounding (ECCV 2024)

Language:PythonBSD-3-Clause4200

Speech-to-text, text-to-speech, and speaker recognition using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, Raspberry Pi, RISC-V, x86_64 servers, websocket server/client, C/C++, Python, Kotlin, C#, Go, NodeJS, Java, Swift, Dart, JavaScript, Flutter

Language:C++Apache-2.0267500

lujiale621

lujiale621's starred repositories

torchchat

NapCatQQ

pyannote-audio

transformers

Stable-Hair

MovieChat

CosyVoice

How-to-use-Transformers

IDM-VTON

Qwen2-Audio

IMAGDressing

Stirling-PDF

mr-Blip

fish-speech

SoniTranslate

StreamSpeech

video-mamba-suite

R2-Tuning

sherpa-onnx

wesubtitle

ShareGPT4Video

video-subtitle-extractor

BilibiliSummary

Uni-TTS

GPT-SoVITS-Inference

MiniCPM-V

GPT-SoVITS

RTranslator

whisper

fairseq