long_time_no_see (taichuai)

taichuai

Geek Repo

Location:Mars

Github PK Tool:Github PK Tool

long_time_no_see's starred repositories

ChatGLM-6B

ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型

Language:PythonLicense:Apache-2.0Stargazers:39627Issues:395Issues:1285

visual-chatgpt

Official repo for the paper: Visual ChatGPT: Talking, Drawing and Editing with Visual Foundation Models

Language:PythonLicense:MITStargazers:31643Issues:275Issues:282

so-vits-svc

SoftVC VITS Singing Voice Conversion

Language:PythonLicense:AGPL-3.0Stargazers:24312Issues:172Issues:130

V2rayU

V2rayU,基于v2ray核心的mac版客户端,用于科学上网,使用swift编写,支持trojan,vmess,shadowsocks,socks5等服务协议,支持订阅, 支持二维码,剪贴板导入,手动配置,二维码分享等

voice-changer

リアルタイムボイスチェンジャー Realtime Voice Changer

Language:PythonLicense:NOASSERTIONStargazers:14973Issues:108Issues:933

evals

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.

Language:PythonLicense:NOASSERTIONStargazers:14112Issues:261Issues:199

the-algorithm-ml

Source code for Twitter's Recommendation Algorithm

Language:PythonLicense:AGPL-3.0Stargazers:9927Issues:99Issues:35

BELLE

BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)

Language:HTMLLicense:Apache-2.0Stargazers:7626Issues:107Issues:436

PaLM-rlhf-pytorch

Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM

Language:PythonLicense:MITStargazers:7614Issues:142Issues:46

Text2Video-Zero

[ICCV 2023 Oral] Text-to-Image Diffusion Models are Zero-Shot Video Generators

Language:PythonLicense:NOASSERTIONStargazers:3847Issues:64Issues:67

torchscale

Foundation Architecture for (M)LLMs

Language:PythonLicense:MITStargazers:2946Issues:46Issues:74

DiffSinger

An advanced singing voice synthesis system with high fidelity, expressiveness, controllability and flexibility based on DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism

Language:PythonLicense:Apache-2.0Stargazers:2591Issues:35Issues:91

Alpaca-CoT

We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tuning) together for easy use. We welcome open-source enthusiasts to initiate any meaningful PR on this repo and integrate as many LLM related technologies as possible. 我们打造了方便研究人员上手和使用大模型等微调平台,我们欢迎开源爱好者发起任何有意义的pr!

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2501Issues:37Issues:97

Deep3DFaceReconstruction

Accurate 3D Face Reconstruction with Weakly-Supervised Learning: From Single Image to Image Set (CVPRW 2019)

Language:PythonLicense:MITStargazers:2115Issues:70Issues:209

gigagan-pytorch

Implementation of GigaGAN, new SOTA GAN out of Adobe. Culmination of nearly a decade of research into GANs

Language:PythonLicense:MITStargazers:1632Issues:73Issues:45

unidiffuser

Code and models for the paper "One Transformer Fits All Distributions in Multi-Modal Diffusion"

Language:PythonLicense:AGPL-3.0Stargazers:1303Issues:17Issues:32

muse-maskgit-pytorch

Implementation of Muse: Text-to-Image Generation via Masked Generative Transformers, in Pytorch

Language:PythonLicense:MITStargazers:821Issues:34Issues:36

HumanML3D

HumanML3D: A large and diverse 3d human motion-language dataset.

Language:PythonLicense:MITStargazers:626Issues:8Issues:125

so-vits-svc-Chinese-Detaild-Documents

So-VITS-SVC 中文本地部署/训练/推理/使用帮助文档

Language:Jupyter NotebookLicense:AGPL-3.0Stargazers:536Issues:4Issues:11

open-musiclm

Implementation of MusicLM, a text to music model published by Google Research, with a few modifications.

Language:PythonLicense:MITStargazers:488Issues:17Issues:25

MDT

Masked Diffusion Transformer is the SOTA for image synthesis. (ICCV 2023)

Language:PythonLicense:Apache-2.0Stargazers:459Issues:18Issues:44

Large-Audio-Models

Keep track of big models in audio domain, including speech, singing, music etc.

SHERF

Code for our ICCV'2023 paper "SHERF: Generalizable Human NeRF from a Single Image"

Language:PythonLicense:NOASSERTIONStargazers:293Issues:33Issues:37

ChineseTtsTflite

Android Chinese TTS Engine Base On Tensorflow TTS , use for TfLite Models Test。安卓离线中文TTS引擎,在TensorflowTTS基础上开发,用于TfLite模型测试。

Language:JavaLicense:Apache-2.0Stargazers:259Issues:6Issues:11

awesome-conditional-content-generation

Update-to-data resources for conditional content generation, including human motion generation, image or video generation and editing.

TranSpeech

PyTorch Implementation of TranSpeech (ICLR'23): Textless NAR Speech-to-Speech Translation with Bilateral Perturbation

Language:PythonLicense:MITStargazers:158Issues:16Issues:3

TriAAN-VC

TriAAN-VC: Triple Adaptive Attention Normalization for Any-to-Any Voice Conversion

Language:PythonLicense:MITStargazers:125Issues:7Issues:21

SVCC23_FastSVC

Singing Voice Conversion Challenge 2023 Starter Kit: FastSVC Reimplementation

EmoTalk

This is the repository for EmoTalk: Speech-Driven Emotional Disentanglement for 3D Face Animation

LfID

On the Learning Mechanisms in Physical Reasoning, NeurIPS 2022.

Language:PythonStargazers:8Issues:0Issues:0