boji123's repositories
pytorch-kaldi-asr
This project aimming to provide a feature inference for kaldi that allows us to train the neural network with pytorch
mnist-with-numpy
building neural network from the buttom using numpy
attention-is-all-you-need-pytorch
A PyTorch implementation of the Transformer model in "Attention is All You Need".
mouse_driver
鼠标驱动PC端,与邱昱的安卓端结合使用
socket-transfer-station
一个基于C++写的linux服务器中转示例,该示例可以将数据通过模拟的client发出,然后由server接收并转发到其他服务器
analytics-zoo
Distributed Tensorflow, Keras and PyTorch on Apache Spark/Flink & Ray
awesome-diarization
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
CosyVoice
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
FunASR
A Fundamental End-to-End Speech Recognition Toolkit
kaldi-io-for-python
Python functions for reading kaldi data formats. Useful for rapid prototyping with python.
pyaec
simple and efficient python implemention of a series of adaptive filters. including time domain adaptive filters(lms、nlms、rls、ap、kalman)、nonlinear adaptive filters(volterra filter、functional link adaptive filters)、frequency domain adaptive filters(frequency domain adaptive filter、frequency domain kalman filter) for acoustic echo cancellation.
pytorch-kaldi
pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.
RealChar
🎙️🤖Create, Customize and Talk to your AI Character/Companion in Realtime (All in One Codebase!). Have a natural seamless conversation with AI everywhere (mobile, web and terminal) using LLM OpenAI GPT3.5/4, Anthropic Claude2, Chroma Vector DB, Whisper Speech2Text, ElevenLabs Text2Speech🎙️🤖
server
The Triton Inference Server provides an optimized cloud and edge inferencing solution.
spec_augment
🔦 A Pytorch implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition
SpecAugment
A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain
state-of-the-art-result-for-machine-learning-problems
This repository provides state of the art (SoTA) results for all machine learning problems. We do our best to keep this repository up to date. If you do find a problem's SoTA result is out of date or missing, please raise this as an issue or submit Google form (with this information: research paper name, dataset, metric, source code and year). We will fix it immediately.
tensorflow
An Open Source Machine Learning Framework for Everyone
VITA
✨✨VITA: Towards Open-Source Interactive Omni Multimodal LLM