HyH-QAQ

followers

following

stars

China University of Mining and Technology

wenchang Campus, China University of Mining and Technology, No. 1 Daxue Road, Xuzhou, Jiangsu Province, China

https://yjsb.cumt.edu.cn/

Hu Hengyu's starred repositories

pyAudioAnalysis

Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications

Language:PythonApache-2.0582400

py-webrtcvad

Python interface to the WebRTC Voice Activity Detector

Language:CNOASSERTION202000

MossFormer2

This is the audio sample repository for speech separation model "MossFormer2".

Language:PythonMIT8100

librosa

Python library for audio and music analysis

Language:PythonISC704300

silero-vad

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

Language:PythonMIT404900

pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Language:Jupyter NotebookMIT597500

NSD-MA-MSE

A pytorch implementation of the paper "ANSD-MA-MSE: Adaptive Neural Speaker Diarization Using Memory-Aware Multi-Speaker Embedding"

Language:Shell4300

NSD-MS2S

CHIME-7/8 diarization champion system: neural speaker diarization using memory-aware multi-speaker embedding with sequence-to-sequence architecture

Language:Shell6200

LLaMA-Factory

Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)

Language:PythonApache-2.03150100

self-refine

LLMs can generate feedback on their work, use it to improve the output, and repeat this process iteratively.

Language:PythonApache-2.057200

reflexion

[NeurIPS 2023] Reflexion: Language Agents with Verbal Reinforcement Learning

Language:PythonMIT231500

git-lfs

Git extension for versioning large files

Language:GoNOASSERTION1285300

FastGPT

FastGPT is a knowledge-based platform built on the LLMs, offers a comprehensive suite of out-of-the-box capabilities such as data processing, RAG retrieval, and visual AI workflow orchestration, letting you easily develop and deploy complex question-answering systems without the need for extensive setup or configuration.

Language:TypeScriptNOASSERTION1714100

chaoxing-sign-cli

超星学习通签到：支持普通签到、拍照签到、手势签到、位置签到、二维码签到，支持自动监测、QQ机器人签到与推送。

Language:TypeScriptMIT117100

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Language:PythonMIT3308500

ChatGLM3

ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型

Language:PythonApache-2.01335500

project-based-learning

Curated list of project-based tutorials