Comedian1926's repositories
awesome_LLMs_interview_notes
LLMs interview notes and answers:该仓库主要记录大模型(LLMs)算法工程师相关的面试题和参考答案
Bert-VITS2
vits2 backbone with bert
carefree-creator
AI magics meet Infinite draw board.
Chat-Haruhi-Suzumiya
Chat凉宫春日, 由李鲁鲁, 冷子昂等同学开发的模仿二次元对话的聊天机器人。
Chinese-Text-Classification-Pytorch
中文文本分类,TextCNN,TextRNN,FastText,TextRCNN,BiLSTM_Attention,DPCNN,Transformer,基于pytorch,开箱即用。
DreamSound
Code for Investigating Personalization Methods in Text to Music Generation
facechain
FaceChain is a deep-learning toolchain for generating your Digital-Twin.
Learned-Motion-Matching
A neural-network-based generative model for video-game characters animations
lightweight-human-pose-estimation.pytorch
Fast and accurate human pose estimation in PyTorch. Contains implementation of "Real-time 2D Multi-Person Pose Estimation on CPU: Lightweight OpenPose" paper.
EmotiVoice
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
FaceStudio
Put Your Face Everywhere in Seconds.
generative-models
Generative Models by Stability AI
modelscope-agent
ModelScope-Agent: An agent framework connecting models in ModelScope with the world
neo-ai-dlr
Neo-AI-DLR is a common runtime for machine learning models compiled by AWS SageMaker Neo, TVM, or TreeLite.
openai-quickstart
A comprehensive guide to understanding and implementing large language models with hands-on examples using LangChain for GenAI applications.
openposeFPGA_mobilenet
This is an HLS-based FPGA accelerator implementation for openpose application.
PytorchToCaffe
Pytorch model to caffe model, supported pytorch 0.3, 0.3.1, 0.4, 0.4.1 ,1.0 , 1.0.1 , 1.2 ,1.3 .notice that only pytorch 1.1 have some bugs
stable-diffusion-webui
Stable Diffusion web UI
strongtrack
A python tool with facial landmark annotation and coefficient finder
StyleTTS2
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
talking-head-anime-3-demo
Demo Programs for the "Talking Head(?) Anime from a Single Image 3: Now the Body Too" Project
tango
Codes and Model of the paper "Text-to-Audio Generation using Instruction Tuned LLM and Latent Diffusion Model"
vits
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech