yangwenwen's starred repositories
awesome-asr-contextualization
A curated list of awesome papers on contextualizing E2E ASR outputs
speech-adapters
Codes and datasets for our ICASSP2023 paper, Evaluating parameter-efficient transfer learning approaches on SURE benchmark for speech understanding
Speech-Prompts-Adapters
This Repository surveys the paper focusing on Prompting and Adapters for Speech Processing.
Chinese-LLaMA-Alpaca
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
LLMsNineStoryDemonTower
【LLMs九层妖塔】分享 LLMs在自然语言处理(ChatGLM、Chinese-LLaMA-Alpaca、小羊驼 Vicuna、LLaMA、GPT4ALL等)、信息检索(langchain)、语言合成、语言识别、多模态等领域(Stable Diffusion、MiniGPT-4、VisualGLM-6B、Ziya-Visual等)等 实战与经验。
llm-foundry
LLM training code for Databricks foundation models
Leaderboard
SpeechIO Leaderboard: a large, robust, comprehensive, benchmarking platform for Automatic Speech Recognition.
wer_are_we
Attempt at tracking states of the arts and recent results (bibliography) on speech recognition.
audio_visual_speech_enhancement
Face Landmark-based Speaker-Independent Audio-Visual Speech Enhancement in Multi-Talker Environments
SpecAugment
A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain
py-webrtcvad
Python interface to the WebRTC Voice Activity Detector
VisualizeMNIST
This project is real-time visualization of a network recognizing digits from user's input.
Lipreading-DenseNet3D
DenseNet3D Model In "LRW-1000: A Naturally-Distributed Large-Scale Benchmark for Lip Reading in the Wild", https://arxiv.org/abs/1810.06990
Faceswap-Deepfake-Pytorch
Faceswap with Pytorch or DeepFake with Pytorch
speech_separation
Include some core functions and model to handle speech separation
Looking-to-Listen-at-the-Cocktail-Party
Executable code based on Google articles
awesome-Face_Recognition
papers about Face Detection; Face Alignment; Face Recognition && Face Identification && Face Verification && Face Representation; Face Reconstruction; Face Tracking; Face Super-Resolution && Face Deblurring; Face Generation && Face Synthesis; Face Transfer; Face Anti-Spoofing; Face Retrieval;
facenet-pytorch
Pretrained Pytorch face detection (MTCNN) and facial recognition (InceptionResnet) models