Tomjackson (ThetaRgo)

ThetaRgo

Geek Repo

Github PK Tool:Github PK Tool

Tomjackson's repositories

annotated_deep_learning_paper_implementations

🧑‍🏫 59 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠

Language:Jupyter NotebookLicense:MITStargazers:1Issues:0Issues:0

ArxivPapers

Code behind Arxiv Papers

Stargazers:0Issues:0Issues:0

audio2photoreal

Code and dataset for photorealistic Codec Avatars driven from audio

License:NOASSERTIONStargazers:0Issues:0Issues:0

Auto-GPT

An experimental open-source attempt to make GPT-4 fully autonomous.

License:MITStargazers:0Issues:0Issues:0

Bark-Voice-Cloning

Bark Voice Cloning and Voice Cloning for Chinese Speech

License:MITStargazers:0Issues:0Issues:0

consistency_models

Official repo for consistency models.

License:MITStargazers:0Issues:0Issues:0

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

DH_live

每个人都能用的数字人

Stargazers:0Issues:0Issues:0

DreamPose

Official implementation of "DreamPose: Fashion Image-to-Video Synthesis via Stable Diffusion"

Stargazers:0Issues:0Issues:0

Dromedary

Dromedary is a helpful, ethical, reliable LLM.

License:Apache-2.0Stargazers:0Issues:0Issues:0

facefusion

Next generation face swapper and enhancer

Language:PythonStargazers:0Issues:0Issues:0

faster-whisper

Faster Whisper transcription with CTranslate2

License:MITStargazers:0Issues:0Issues:0

GeneFace

GeneFace: Generalized and High-Fidelity 3D Talking Face Synthesis; ICLR 2023; Official code

License:MITStargazers:0Issues:0Issues:0

ijkplayer

Android/iOS video player based on FFmpeg n3.4, with MediaCodec, VideoToolbox support.

License:GPL-2.0Stargazers:0Issues:0Issues:0

KAN-TTS

KAN-TTS is a speech-synthesis training framework, please try the demos we have posted at https://modelscope.cn/models?page=1&tasks=text-to-speech

License:MITStargazers:0Issues:0Issues:0

Kolors

Kolors Team

License:Apache-2.0Stargazers:0Issues:0Issues:0

lekshop

B2B2C多语言多商户短视频直播种草阶梯拼团电商系统

License:MITStargazers:0Issues:0Issues:0

LivePortrait

Make one portrait alive!

License:MITStargazers:0Issues:0Issues:0

LLaMA-Efficient-Tuning

Easy-to-use fine-tuning framework using PEFT (PT+SFT+RLHF with QLoRA) (LLaMA-2, BLOOM, Falcon, Baichuan)

License:Apache-2.0Stargazers:0Issues:0Issues:0

metahuman-stream

Real time streaming digital human based on nerf

License:MITStargazers:0Issues:0Issues:0

MiniGPT-4

MiniGPT-4: Enhancing Vision-language Understanding with Advanced Large Language Models

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

open-llms

A list of open LLMs available for commercial use

License:Apache-2.0Stargazers:0Issues:0Issues:0

OpenVoice

Instant voice cloning by MyShell.

License:MITStargazers:0Issues:0Issues:0

segment-anything-2

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

License:Apache-2.0Stargazers:0Issues:0Issues:0

SyncTalk

[CVPR 2024] This is the official source for our paper "SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis"

License:NOASSERTIONStargazers:0Issues:0Issues:0

TigerBot

TigerBot: A multi-language multi-task LLM

License:Apache-2.0Stargazers:0Issues:0Issues:0

Vach

Real time streaming talking head

Stargazers:0Issues:0Issues:0

video-parser

Douyin Kuaishou Tiktok live room protocal

Language:JavaScriptLicense:NOASSERTIONStargazers:0Issues:0Issues:0

vits_chinese

Best practice TTS based on BERT and VITS with some Natural Speech Features Of Microsoft; Support streaming out!

License:MITStargazers:0Issues:0Issues:0

Whisper-Finetune

微调Whisper语音识别模型和加速推理,支持Web部署和Android部署

License:Apache-2.0Stargazers:0Issues:0Issues:0