张大成's repositories
7days-golang
7 days golang apps from scratch (web framework Gee, distributed cache GeeCache, object relational mapping ORM framework GeeORM etc) 7天用Go动手写/从零实现系列
audio-SNR
Mixing an audio file with a noise file at any Signal-to-Noise Ratio (SNR)
awesome-go-cn
一个很棒的Go框架、库和软件的中文收录大全。:alarm_clock:脚本定期与英文文档同步,包含了各工程star数/最近更新时间,助您快速发现优质项目。Awesome Go~
chinese-poetry
最全中华古诗词数据库, 唐宋两朝近一万四千古诗人, 接近5.5万首唐诗加26万宋诗. 两宋时期1564位词人,21050首词。
CPlusPlusThings
C++那些事
DeepLearning-500-questions
深度学习500问,以问答形式对常用的概率知识、线性代数、机器学习、深度学习、计算机视觉等热点问题进行阐述,以帮助自己及有需要的读者。 全书分为18个章节,50余万字。由于水平有限,书中不妥之处恳请广大读者批评指正。 未完待续............ 如有意合作,联系scutjy2015@163.com 版权所有,违权必究 Tan 2018.06
Deformable-3D-Gaussians
[CVPR 2024] Official implementation of "Deformable 3D Gaussians for High-Fidelity Monocular Dynamic Scene Reconstruction"
denoiser
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech Enhancement in the Waveform Domain. In which, we present a causal speech enhancement model working on the raw waveform that runs in real-time on a laptop CPU. The proposed model is based on an encoder-decoder architecture with skip-connections. It is optimized on both time and frequency domains, using multiple loss functions. Empirical evidence shows that it is capable of removing various kinds of background noise including stationary and non-stationary noises, as well as room reverb. Additionally, we suggest a set of data augmentation techniques applied directly on the raw waveform which further improve model performance and its generalization abilities.
espnet
End-to-End Speech Processing Toolkit
HanLP
中文分词 词性标注 命名实体识别 依存句法分析 语义依存分析 新词发现 关键词短语提取 自动摘要 文本分类聚类 拼音简繁 自然语言处理
learn-regex
Learn regex the easy way
lstm-crf-pytorch
LSTM-CRF in PyTorch
mediapipe
MediaPipe is a cross-platform framework for building multimodal applied machine learning pipelines
Montreal-Forced-Aligner
Command line utility for forced alignment using Kaldi
nara_wpe
Different implementations of "Weighted Prediction Error" for speech dereverberation
nonparaSeq2seqVC_code
Implementation code of non-parallel sequence-to-sequence VC
nvidia-benchmarks
NVIDIA Deep Learning Benchmarks
pytorch_lstmcrf
Pytorch implementation of LSTM-CRF for named entity recognition
PyTorch_Speaker_Verification
PyTorch implementation of "Generalized End-to-End Loss for Speaker Verification" by Wan, Li et al.
Real-Time-Voice-Cloning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Resemblyzer
A python package to analyze and compare voices with deep learning
sha-rnn
Single Headed Attention RNN - "Stop thinking with your head"
speechmetrics
A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR
tensorflow-handbook
简单粗暴 TensorFlow 2.0 | A Concise Handbook of TensorFlow 2.0
vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
VOLDOR
VOLDOR-SLAM is a real-time dense-indirect SLAM system takes dense optical flows as input that supports monocular, stereo and RGB-D video sequence.
WaveGrad
Implementation of Google Brain's WaveGrad high-fidelity vocoder (paper: https://arxiv.org/pdf/2009.00713.pdf). First implementation on GitHub.