Qibaba's repositories
Restormer-1
A PyTorch implementation of Restormer based on CVPR 2022 paper "Restormer: Efficient Transformer for High-Resolution Image Restoration"
mathlib
Lean mathematical components library
rocBLAS
Next generation BLAS implementation for ROCm platform
contentvec
speech self-supervised representations
chinese_speech_pretrain
chinese speech pretrained models
jax
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
openspeech
Open-Source Toolkit for End-to-End Speech Recognition leveraging PyTorch-Lightning and Hydra.
sherpa
Streaming and non-streaming ASR server in Python
kaldifeat
Kaldi-compatible online & offline feature extraction with PyTorch, supporting CUDA, batch processing, chunk processing, and autograd - Provide C++ & Python API
CppCoreGuidelines
The C++ Core Guidelines are a set of tried-and-true guidelines, rules, and best practices about coding in C++
lhotse
Tools for handling speech data in machine learning projects.
kaldilm
Python wrapper for kaldi's arpa2fst
SpeechSplit2
Official implementation of SpeechSplit2
CDFSE_FastSpeech2
The Official Implementation of “Content-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synthesis”
CLBlast
Tuned OpenCL BLAS
wespeaker
Production First and Production Ready Speaker Recognition Toolkit
3m-asr
3M: Multi-loss, Multi-path and Multi-level Neural Networks for speech recognition
Efficient-Segmentation-Networks
Lightweight models for real-time semantic segmentationon PyTorch (include SQNet, LinkNet, SegNet, UNet, ENet, ERFNet, EDANet, ESPNet, ESPNetv2, LEDNet, ESNet, FSSNet, CGNet, DABNet, Fast-SCNN, ContextNet, FPENet, etc.)
voice_activity_detection
Voice Activity Detection based on Deep Learning & TensorFlow
kaldi-onnx
Kaldi model converter to ONNX
server
The Triton Inference Server provides an optimized cloud and edge inferencing solution.
SimCLUE
大规模语义理解与匹配数据集。可用于无监督对比学习、半监督学习等构建中文领域效果最好的预训练模型
ssr_eval
Evaluation and Benchmarking of Speech Super-resolution Methods
DeepFaceLive
Real-time face swap for PC streaming or video calls
QA-Survey-CN
北京航空航天大学大数据高精尖中心自然语言处理研究团队开展了智能问答的研究与应用总结。包括基于知识图谱的问答(KBQA),基于文本的问答系统(TextQA),基于表格的问答系统(TableQA)、基于视觉的问答系统(VisualQA)和机器阅读理解(MRC)等,每类任务分别对学术界和工业界进行了相关总结。
awesome-semantic-segmentation-pytorch
Semantic Segmentation on PyTorch (include FCN, PSPNet, Deeplabv3, Deeplabv3+, DANet, DenseASPP, BiSeNet, EncNet, DUNet, ICNet, ENet, OCNet, CCNet, PSANet, CGNet, ESPNet, LEDNet, DFANet)
Squeezeformer
PyTorch implementation of "Squeezeformer: An Efficient Transformer for Automatic Speech Recognition"
kaldi_native_io
python wrapper for kaldi's native I/O
External-Attention-pytorch
🍀 Pytorch implementation of various Attention Mechanisms, MLP, Re-parameter, Convolution, which is helpful to further understand papers.⭐⭐⭐