KeithZh's starred repositories

Machine-Learning-Yearning-Chinese-ver

(完结)Andrew NG Machine-Learning-Yearning translation documents(吴恩达《Machine Learning Yearning》中文翻译及英文原稿)

License:CC-BY-SA-4.0Stargazers:254Issues:0Issues:0

x-transformers

A simple but complete full-attention transformer with a set of promising experimental features from various papers

Language:PythonLicense:MITStargazers:4263Issues:0Issues:0

piper

A fast, local neural text to speech system

Language:C++License:MITStargazers:4694Issues:0Issues:0

khoj

Your AI second brain. Get answers to your questions, whether they be online or in your own notes. Use online AI models (e.g gpt4) or private, local LLMs (e.g llama3). Self-host locally or use our cloud instance. Access from Obsidian, Emacs, Desktop app, Web or Whatsapp.

Language:PythonLicense:AGPL-3.0Stargazers:11240Issues:0Issues:0

SubFix

SubFix: Efficient Web-Based Audio Subtitle Editing and Multilingual Automatic Annotation Tool.

Language:PythonLicense:Apache-2.0Stargazers:178Issues:0Issues:0

ChatGLM-finetune-LoRA

Code for fintune ChatGLM-6b using low-rank adaptation (LoRA)

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:716Issues:0Issues:0

ChatTTS

ChatTTS is a generative speech model for daily dialogue.

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:23148Issues:0Issues:0

AudioGPT

AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head

Language:PythonLicense:NOASSERTIONStargazers:9846Issues:0Issues:0

MassTTS

a TTS demo for training new characters.

Language:PythonLicense:Apache-2.0Stargazers:410Issues:0Issues:0

bark-voice-cloning-HuBERT-quantizer

The code for the bark-voicecloning model. Training and inference.

Language:PythonLicense:MITStargazers:604Issues:0Issues:0
Language:PythonLicense:MITStargazers:9Issues:0Issues:0

diffusion_models

All about the fundamentals and working of Diffusion Models

Language:HTMLLicense:MITStargazers:146Issues:0Issues:0

vector-quantize-pytorch

Vector (and Scalar) Quantization, in Pytorch

Language:PythonLicense:MITStargazers:2033Issues:0Issues:0

vq-vae-2-pytorch

Implementation of Generating Diverse High-Fidelity Images with VQ-VAE-2 in PyTorch

Language:PythonLicense:NOASSERTIONStargazers:1538Issues:0Issues:0

MockingBird

🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time

Language:PythonLicense:NOASSERTIONStargazers:34243Issues:0Issues:0

AnyGPT

Code for "AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling"

Language:PythonStargazers:597Issues:0Issues:0

qlib

Qlib is an AI-oriented quantitative investment platform that aims to realize the potential, empower research, and create value using AI technologies in quantitative investment, from exploring ideas to implementing productions. Qlib supports diverse machine learning modeling paradigms. including supervised learning, market dynamics modeling, and RL.

Language:PythonLicense:MITStargazers:14414Issues:0Issues:0

AnimatableGaussians

Code of [CVPR 2024] "Animatable Gaussians: Learning Pose-dependent Gaussian Maps for High-fidelity Human Avatar Modeling"

Language:PythonLicense:NOASSERTIONStargazers:788Issues:0Issues:0

E2FGVI

Official code for "Towards An End-to-End Framework for Flow-Guided Video Inpainting" (CVPR2022)

Language:PythonLicense:NOASSERTIONStargazers:976Issues:0Issues:0

TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Language:PythonLicense:MPL-2.0Stargazers:30715Issues:0Issues:0

devops-api

自动化运维平台:CMDB、CI/CD、DevOps、资产管理、任务编排、持续交付、运维管理、基于Django + REST framework + Vue 运维发布平台,UI自动化测试平台,

Language:PythonStargazers:348Issues:0Issues:0

EMO

Emote Portrait Alive: Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions

Stargazers:7070Issues:0Issues:0

AnimateAnyone

Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation

License:Apache-2.0Stargazers:14075Issues:0Issues:0

Chinese-CLIP

Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.

Language:PythonLicense:MITStargazers:3829Issues:0Issues:0

diaspora

A privacy-aware, distributed, open source social network.

Language:RubyLicense:AGPL-3.0Stargazers:13370Issues:0Issues:0

audio2photoreal

Code and dataset for photorealistic Codec Avatars driven from audio

Language:PythonLicense:NOASSERTIONStargazers:2565Issues:0Issues:0

havatar

[TOG 2023] HAvatar: High-fidelity Head Avatar via Facial Model ConditionedNeural Radiance Field

Language:PythonStargazers:116Issues:0Issues:0

Awesome-Talking-Head-Synthesis

💬 An extensive collection of exceptional resources dedicated to the captivating world of talking face synthesis! ⭐ If you find this repo useful, please give it a star! 🤩

License:MITStargazers:522Issues:0Issues:0

FlagEmbedding

Retrieval and Retrieval-augmented LLMs

Language:PythonLicense:MITStargazers:5581Issues:0Issues:0

SIM

Official repository of Semantic Image Matting

Language:PythonStargazers:217Issues:0Issues:0