Xiangrui Liu's starred repositories

WritingAIPaper

Writing AI Conference Papers: A Handbook for Beginners

Stargazers:973Issues:0Issues:0

ContextMenuManager

🖱️ 纯粹的Windows右键菜单管理程序

Language:C#License:GPL-3.0Stargazers:12142Issues:0Issues:0

gpt_academic

为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, moss等。

Language:PythonLicense:GPL-3.0Stargazers:64398Issues:0Issues:0

ModelSoups

ModelSoups for Tensorflow2 and Torch

Language:Jupyter NotebookLicense:MITStargazers:46Issues:0Issues:0

FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Language:PythonLicense:NOASSERTIONStargazers:6143Issues:0Issues:0

awesome-diarization

A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.

License:Apache-2.0Stargazers:1577Issues:0Issues:0

WavAugment

A library for speech data augmentation in time-domain

Language:PythonLicense:MITStargazers:635Issues:0Issues:0

knowledge-distillation-pytorch

A PyTorch implementation for exploring deep and shallow knowledge distillation (KD) experiments with flexibility

Language:PythonLicense:MITStargazers:1840Issues:0Issues:0

6DRepNet

Official Pytorch implementation of 6DRepNet: 6D Rotation representation for unconstrained head pose estimation.

Language:PythonLicense:MITStargazers:536Issues:0Issues:0

mt-dnn

Multi-Task Deep Neural Networks for Natural Language Understanding

Language:PythonLicense:MITStargazers:2227Issues:0Issues:0

auto_avsr

Auto-AVSR: Lip-Reading Sentences Project

Language:PythonLicense:Apache-2.0Stargazers:168Issues:0Issues:0

INTERSPEECH-2023-24-Papers

INTERSPEECH 2023-2024 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023-24 conference. Explore the latest advances in speech and language processing. Code included. Star the repository to support the advancement of speech technology!

License:MITStargazers:631Issues:0Issues:0
Language:PythonStargazers:16Issues:0Issues:0
Language:PythonLicense:MITStargazers:240Issues:0Issues:0

audio-diffusion

Apply diffusion models using the new Hugging Face diffusers package to synthesize music instead of images.

Language:Jupyter NotebookLicense:GPL-3.0Stargazers:704Issues:0Issues:0

diffwave

DiffWave is a fast, high-quality neural vocoder and waveform synthesizer.

Language:PythonLicense:Apache-2.0Stargazers:756Issues:0Issues:0

Awesome-Diffusion-Models

A collection of resources and papers on Diffusion Models

Language:HTMLLicense:MITStargazers:10827Issues:0Issues:0

sgmse

Score-based Generative Models (Diffusion Models) for Speech Enhancement and Dereverberation

Language:PythonLicense:MITStargazers:463Issues:0Issues:0

DEEP-FSMN

Tensorflow version of DFSMN

Language:PythonStargazers:1Issues:0Issues:0

espnet

End-to-End Speech Processing Toolkit

Language:PythonLicense:Apache-2.0Stargazers:8336Issues:0Issues:0

DeepLearningForAudioWithPython

Code and slides for the "Deep Learning (For Audio) With Python" course on TheSoundOfAI Youtube channel.

Language:PythonLicense:MITStargazers:630Issues:0Issues:0

leetcode_101

LeetCode 101:和你一起你轻松刷题(C++)

Stargazers:8202Issues:0Issues:0

Speech_emotion_recognition_BLSTM

Bidirectional LSTM network for speech emotion recognition.

Language:PythonLicense:MITStargazers:260Issues:0Issues:0

Python-100-Days

Python - 100天从新手到大师

Language:PythonStargazers:155353Issues:0Issues:0