lxrswdd

Xiangrui Liu's starred repositories

WritingAIPaper

Writing AI Conference Papers: A Handbook for Beginners

ContextMenuManager

🖱️ 纯粹的Windows右键菜单管理程序

Language:C#GPL-3.01214200

为GPT/GLM等LLM大语言模型提供实用化交互接口，特别优化论文阅读/润色/写作体验，模块化设计，支持自定义快捷按钮&函数插件，支持Python和C++等项目剖析&自译解功能，PDF/LaTex论文翻译&总结功能，支持并行问询多种LLM模型，支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, moss等。

Language:PythonGPL-3.06439800

ModelSoups

ModelSoups for Tensorflow2 and Torch

Language:Jupyter NotebookMIT4600

FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Language:PythonNOASSERTION614300

awesome-diarization

A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.

Apache-2.0157700

WavAugment

A library for speech data augmentation in time-domain

Language:PythonMIT63500

knowledge-distillation-pytorch

A PyTorch implementation for exploring deep and shallow knowledge distillation (KD) experiments with flexibility

Language:PythonMIT184000

6DRepNet

Official Pytorch implementation of 6DRepNet: 6D Rotation representation for unconstrained head pose estimation.

Language:PythonMIT53600

mt-dnn

Multi-Task Deep Neural Networks for Natural Language Understanding

Language:PythonMIT222700

auto_avsr

Auto-AVSR: Lip-Reading Sentences Project

Language:PythonApache-2.016800

INTERSPEECH-2023-24-Papers

INTERSPEECH 2023-2024 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023-24 conference. Explore the latest advances in speech and language processing. Code included. Star the repository to support the advancement of speech technology!

MIT63100