Hui Wang's starred repositories

gpt_academic

为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, moss等。

Language:PythonLicense:GPL-3.0Stargazers:61298Issues:256Issues:1510

annotated_deep_learning_paper_implementations

🧑‍🏫 60 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠

Language:PythonLicense:MITStargazers:51296Issues:431Issues:128

yolov5

YOLOv5 🚀 in PyTorch > ONNX > CoreML > TFLite

Language:PythonLicense:AGPL-3.0Stargazers:48361Issues:364Issues:9154

handson-ml2

A series of Jupyter notebooks that walk you through the fundamentals of Machine Learning and Deep Learning in Python using Scikit-Learn, Keras and TensorFlow 2.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:27278Issues:656Issues:510

pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Language:Jupyter NotebookLicense:MITStargazers:5464Issues:68Issues:972

ccf-deadlines

⏰ Collaboratively track deadlines of conferences recommended by CCF (Website, Python Cli, Wechat Applet) / If you find it useful, please star this project, thanks~

Language:VueLicense:MITStargazers:5321Issues:22Issues:72

Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

Language:PythonLicense:MITStargazers:4180Issues:56Issues:125

awesome-speech-recognition-speech-synthesis-papers

Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)

LibMTL

A PyTorch Library for Multi-Task Learning

Language:PythonLicense:MITStargazers:1856Issues:18Issues:73

uncertainty-toolbox

Uncertainty Toolbox: a Python toolbox for predictive uncertainty quantification, calibration, metrics, and visualization

Language:PythonLicense:MITStargazers:1761Issues:33Issues:33

Multilingual_Text_to_Speech

An implementation of Tacotron 2 that supports multilingual experiments with parameter-sharing, code-switching, and voice cloning.

Language:PythonLicense:MITStargazers:817Issues:30Issues:79

CSSummerCamp2021

关于2021年CS保研夏令营通知公告的汇总。欢迎大家积极分享夏令营信息,资瓷一下互联网精神吼不吼啊?

book-text-to-speech

A book about Text-to-Speech (TTS) in Chinese.

Language:TeXLicense:Apache-2.0Stargazers:560Issues:7Issues:4

DeepSpeaker-pytorch

Speaker embedding(verification and recognition) using Pytorch

Language:PythonLicense:MITStargazers:361Issues:19Issues:16

calibration-framework

The net:cal calibration framework is a Python 3 library for measuring and mitigating miscalibration of uncertainty estimates, e.g., by a neural network.

Language:PythonLicense:Apache-2.0Stargazers:325Issues:7Issues:47

pytorch_xvectors

Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196

Language:PythonLicense:MITStargazers:302Issues:8Issues:15

movie_recommend_system

:movie_camera: 一个简单的电影推荐系统

Language:Jupyter NotebookLicense:MITStargazers:223Issues:4Issues:12

UTMOS22

UT-Sarulab MOS prediction system using SSL models

Language:PythonLicense:MITStargazers:143Issues:7Issues:10

Audio-Classification

Pytorch code for "Rethinking CNN Models for Audio Classification"

LDNet

Official implementation of the paper: "LDNet: Unified Listener Dependent Modeling in MOS Prediction for Synthetic Speech"

Language:PythonLicense:MITStargazers:59Issues:4Issues:3

Prosody_Prediction

Predict prosody labels for Chinese sentences.

baize

:fire:协程化的轻量级高性能网络库:rocket:

Language:C++License:MITStargazers:39Issues:2Issues:0

cppip

c++ package management tool

License:MITStargazers:1Issues:0Issues:0

codesnippet

code snippets for sharing and learning

Language:C++License:MITStargazers:1Issues:1Issues:0