pengyizhou's repositories
fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
audiomentations
A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.
espnet
End-to-End Speech Processing Toolkit
CE-OptimizedLoss
Computes the MWER (minimum WER) Loss with beam search and negative sampling strategy.
torch-audiomentations
Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.
CTC-OptimizedLoss
Computes the MWER (minimum WER) Loss with CTC beam search. Knowledge distillation for CTC loss.
rsync
An open source utility that provides fast incremental file transfer. It also has useful features for backup and restore operations among many other use cases.
multinerf
A Code Release for Mip-NeRF 360, Ref-NeRF, and RawNeRF
leetcode
LeetCode Solutions: A Record of My Problem Solving Journey.( leetcode题解,记录自己的leetcode解题之路。)
HowToCook
程序员在家做饭方法指南。Programmer's guide about how to cook at home (Chinese).
Resemblyzer
A python package to analyze and compare voices with deep learning
resume
An elegant \LaTeX\ résumé template. 大陆镜像 https://gods.coding.net/p/resume/git
kaldi
kaldi-asr/kaldi is the official location of the Kaldi project.
frp
A fast reverse proxy to help you expose a local server behind a NAT or firewall to the internet.
GeneratingNoisySpeechData
A repository comprising of code for generation of noisy speech data from clean data using deep learning methods
xlm-t
Repository for XLM-T, a framework for evaluating multilingual language models on Twitter data
编程电子书,电子书,编程书籍,包括C,C#,Docker,Elasticsearch,Git,Hadoop,HeadFirst,Java,Javascript,jvm,Kafka,Linux,Maven,MongoDB,MyBatis,MySQL,Netty,Nginx,Python,RabbitMQ,Redis,Scala,Solr,Spark,Spring,SpringBoot,SpringCloud,TCPIP,Tomcat,Zookeeper,人工智能,大数据类,并发编程,数据库类,数据挖掘,新面试题,架构设计,算法系列,计算机类,设计模式,软件测试,重构优化,等更多分类
CallBlocker-1
An iOS call blocker sample app in Swift using CallKit.
espresso
Espresso: A Fast End-to-End Neural Speech Recognition Toolkit
ConferencingSpeech2021-1
Conferencing Speech Challenge
asteroid
The PyTorch-based audio source separation toolkit for researchers
P.808
This is an open-source implementation of the ITU P.808 standard for "Subjective evaluation of speech quality with a crowdsourcing approach" (see https://www.itu.int/rec/T-REC-P.808/en). It uses Amazon Mechanical Turk as the crowdsourcing platform. It includes implementations for Absolute Category Rating (ACR), Degradation Category Rating (DCR), and Comparison Category Rating (CCR).
ConferencingSpeech2021
Conferencing Speech Challenge