taichuai

followers

following

stars

Mars

long_time_no_see's starred repositories

ChatGLM-6B

ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型

Language:PythonApache-2.039627 395 1285

visual-chatgpt

Official repo for the paper: Visual ChatGPT: Talking, Drawing and Editing with Visual Foundation Models

Language:PythonMIT31643 275 282

so-vits-svc

SoftVC VITS Singing Voice Conversion

Language:PythonAGPL-3.024312 172 130

V2rayU

V2rayU,基于v2ray核心的mac版客户端,用于科学上网,使用swift编写,支持trojan,vmess,shadowsocks,socks5等服务协议,支持订阅, 支持二维码,剪贴板导入,手动配置,二维码分享等

voice-changer

リアルタイムボイスチェンジャー Realtime Voice Changer

Language:PythonNOASSERTION14973 108 933

evals

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.

Language:PythonNOASSERTION14112 261 199

the-algorithm-ml

Source code for Twitter's Recommendation Algorithm

Language:PythonAGPL-3.09927 99 35

BELLE

BELLE: Be Everyone's Large Language model Engine（开源中文对话大模型）

Language:HTMLApache-2.07626 107 436

PaLM-rlhf-pytorch

Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM

Language:PythonMIT7614 142 46

Text2Video-Zero

[ICCV 2023 Oral] Text-to-Image Diffusion Models are Zero-Shot Video Generators

Language:PythonNOASSERTION3847 64 67

torchscale

Foundation Architecture for (M)LLMs

Language:PythonMIT2946 46 74

DiffSinger

An advanced singing voice synthesis system with high fidelity, expressiveness, controllability and flexibility based on DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism

Language:PythonApache-2.02591 35 91

Alpaca-CoT

We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tuning) together for easy use. We welcome open-source enthusiasts to initiate any meaningful PR on this repo and integrate as many LLM related technologies as possible. 我们打造了方便研究人员上手和使用大模型等微调平台，我们欢迎开源爱好者发起任何有意义的pr！

Language:Jupyter NotebookApache-2.02501 37 97

Deep3DFaceReconstruction

Accurate 3D Face Reconstruction with Weakly-Supervised Learning: From Single Image to Image Set (CVPRW 2019)

Language:PythonMIT2115 70 209

gigagan-pytorch

Implementation of GigaGAN, new SOTA GAN out of Adobe. Culmination of nearly a decade of research into GANs

Language:PythonMIT1632 73 45

unidiffuser

Code and models for the paper "One Transformer Fits All Distributions in Multi-Modal Diffusion"

Language:PythonAGPL-3.01303 17 32

muse-maskgit-pytorch

Implementation of Muse: Text-to-Image Generation via Masked Generative Transformers, in Pytorch

Language:PythonMIT821 34 36

HumanML3D

HumanML3D: A large and diverse 3d human motion-language dataset.

Language:PythonMIT626 8 125

so-vits-svc-Chinese-Detaild-Documents

So-VITS-SVC 中文本地部署/训练/推理/使用帮助文档

Language:Jupyter NotebookAGPL-3.0536 4 11

open-musiclm

Implementation of MusicLM, a text to music model published by Google Research, with a few modifications.

Language:PythonMIT488 17 25

MDT

Masked Diffusion Transformer is the SOTA for image synthesis. (ICCV 2023)

Language:PythonApache-2.0459 18 44

Large-Audio-Models

Keep track of big models in audio domain, including speech, singing, music etc.

SHERF

Code for our ICCV'2023 paper "SHERF: Generalizable Human NeRF from a Single Image"

Language:PythonNOASSERTION293 33 37

ChineseTtsTflite

Android Chinese TTS Engine Base On Tensorflow TTS , use for TfLite Models Test。安卓离线中文TTS引擎，在TensorflowTTS基础上开发，用于TfLite模型测试。

Language:JavaApache-2.0259 6 11

awesome-conditional-content-generation

Update-to-data resources for conditional content generation, including human motion generation, image or video generation and editing.

TranSpeech

PyTorch Implementation of TranSpeech (ICLR'23): Textless NAR Speech-to-Speech Translation with Bilateral Perturbation

Language:PythonMIT158 16 3

TriAAN-VC

TriAAN-VC: Triple Adaptive Attention Normalization for Any-to-Any Voice Conversion

Language:PythonMIT125 7 21

SVCC23_FastSVC

Singing Voice Conversion Challenge 2023 Starter Kit: FastSVC Reimplementation

Language:Python104 7 10

EmoTalk

This is the repository for EmoTalk: Speech-Driven Emotional Disentanglement for 3D Face Animation

LfID

On the Learning Mechanisms in Physical Reasoning, NeurIPS 2022.

Language:Python800