比你笨 (jack139)

jack139

Geek Repo

Location:Amoy, China

Home Page:https://jack139.top

Github PK Tool:Github PK Tool

比你笨's starred repositories

whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Language:PythonLicense:MITStargazers:65498Issues:548Issues:0

ComfyUI

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

Language:PythonLicense:GPL-3.0Stargazers:44099Issues:348Issues:2620

FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Language:PythonLicense:Apache-2.0Stargazers:35998Issues:348Issues:1736

MockingBird

🚀AI拟声: 5秒内克隆您的声音并生成任意语音内容 Clone a voice in 5 seconds to generate arbitrary speech in real-time

Language:PythonLicense:NOASSERTIONStargazers:34675Issues:310Issues:877

bark

🔊 Text-Prompted Generative Audio Model

Language:Jupyter NotebookLicense:MITStargazers:34077Issues:318Issues:425

so-vits-svc

SoftVC VITS Singing Voice Conversion

Language:PythonLicense:AGPL-3.0Stargazers:25000Issues:174Issues:130

Wav2Lip

This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020. For HD commercial model, please try out Sync Labs

video-retalking

[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild

Language:PythonLicense:Apache-2.0Stargazers:6202Issues:70Issues:234

LLM-Agent-Paper-List

The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.

Time-Series-Library

A Library for Advanced Deep Time Series Models.

Language:PythonLicense:MITStargazers:5469Issues:63Issues:425

DiffSinger

DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code

Language:PythonLicense:MITStargazers:4208Issues:43Issues:100

wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

Language:PythonLicense:Apache-2.0Stargazers:3979Issues:91Issues:1007

FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Language:PythonLicense:NOASSERTIONStargazers:3955Issues:48Issues:841

GeneFace

GeneFace: Generalized and High-Fidelity 3D Talking Face Synthesis; ICLR 2023; Official code

Language:PythonLicense:MITStargazers:2476Issues:50Issues:281

protocol

Specification of the Farcaster Protocol

Qwen-Audio

The official repo of Qwen-Audio (通义千问-Audio) chat & pretrained large audio language model proposed by Alibaba Cloud.

Language:PythonLicense:NOASSERTIONStargazers:1301Issues:25Issues:62

fastsdcpu

Fast stable diffusion on CPU

Language:PythonLicense:MITStargazers:1040Issues:19Issues:138

deep_learning_and_the_game_of_go

Code and other material for the book "Deep Learning and the Game of Go"

ER-NeRF

[ICCV'23] Efficient Region-Aware Neural Radiance Fields for High-Fidelity Talking Portrait Synthesis

Language:PythonLicense:MITStargazers:930Issues:16Issues:152

RAD-NeRF

Real-time Neural Radiance Talking Portrait Synthesis via Audio-spatial Decomposition

Language:PythonLicense:MITStargazers:866Issues:30Issues:94

betago

BetaGo: AlphaGo for the masses, live on GitHub.

Language:PythonLicense:MITStargazers:675Issues:56Issues:28

Yuan-2.0

Yuan 2.0 Large Language Model

Language:PythonLicense:NOASSERTIONStargazers:674Issues:5Issues:91

representation-engineering

Representation Engineering: A Top-Down Approach to AI Transparency

Language:Jupyter NotebookLicense:MITStargazers:661Issues:29Issues:41

Lipreading-DenseNet3D

DenseNet3D Model In "LRW-1000: A Naturally-Distributed Large-Scale Benchmark for Lip Reading in the Wild", https://arxiv.org/abs/1810.06990

copycat

Modern port of Melanie Mitchell's and Douglas Hofstadter's Copycat

Language:PythonLicense:MITStargazers:113Issues:14Issues:0

co.py.cat

co.py.cat extends Hofstadter's, pythonically

Language:PythonLicense:MITStargazers:53Issues:8Issues:7

copycat

A translation of Melanie Mitchell's original Copycat project from Lisp to Python.

Language:PythonLicense:GPL-2.0Stargazers:41Issues:7Issues:0