mtxing69's starred repositories

VoiceprintRecognition-Pytorch

This project uses a variety of advanced voiceprint recognition models such as EcapaTdnn, ResNetSE, ERes2Net, CAM++, etc. It is not excluded that more models will be supported in the future. At the same time, this project also supports MelSpectrogram, Spectrogram data preprocessing methods

Language:PythonLicense:Apache-2.0Stargazers:708Issues:0Issues:0

MASR

Pytorch实现的流式与非流式的自动语音识别框架,同时兼容在线和离线识别,目前支持Conformer、Squeezeformer、DeepSpeech2模型,支持多种数据增强方法。

Language:PythonLicense:Apache-2.0Stargazers:576Issues:0Issues:0

PaddlePaddle-DeepSpeech

基于PaddlePaddle实现的语音识别,中文语音识别。项目完善,识别效果好。支持Windows,Linux下训练和预测,支持Nvidia Jetson开发板预测。

Language:PythonLicense:Apache-2.0Stargazers:658Issues:0Issues:0

PPASR

基于PaddlePaddle实现端到端中文语音识别,从入门到实战,超简单的入门案例,超实用的企业项目。支持当前最流行的DeepSpeech2、Conformer、Squeezeformer模型

Language:PythonLicense:Apache-2.0Stargazers:797Issues:0Issues:0

speech_recognition

中文语音识别

Language:PythonStargazers:790Issues:0Issues:0

torchlm

💎A high level pipeline for face landmarks detection, it supports training, evaluating, exporting, inference(Python/C++) and 100+ data augmentations, can easily install via pip.

Language:PythonLicense:MITStargazers:239Issues:0Issues:0

lite.ai.toolkit

🛠 A lite C++ toolkit of awesome AI models, support ONNXRuntime, MNN, TNN, NCNN and TensorRT.

Language:C++License:GPL-3.0Stargazers:3543Issues:0Issues:0

BullshitGenerator

Needs to generate some texts to test if my GUI rendering codes good or not. so I made this.

Language:JavaScriptLicense:NOASSERTIONStargazers:15699Issues:0Issues:0

ChineseBQB

🇨🇳 Chinese sticker pack,More joy / 表情包的博物馆, Github最有毒的仓库, **表情包大集合, 聚欢乐~

Language:JavaScriptStargazers:12067Issues:0Issues:0

silero-vad

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

Language:PythonLicense:MITStargazers:3532Issues:0Issues:0

deep-head-pose

:fire::fire: Deep Learning Head Pose Estimation using PyTorch.

Language:PythonLicense:NOASSERTIONStargazers:1554Issues:0Issues:0

virtual_try_on_use_deep_learning

使用深度学习算法实现虚拟试衣镜,结合了人体姿态估计、人体分割、几何匹配和GAN,四种模型。仅仅只依赖opencv库就能运行

Language:PythonStargazers:238Issues:0Issues:0

Speech-Emotion-Analyzer

The neural network model is capable of detecting five different male/female emotions from audio speeches. (Deep Learning, NLP, Python)

Language:Jupyter NotebookLicense:MITStargazers:1272Issues:0Issues:0

ConvNeXt

Code release for ConvNeXt model

Language:PythonLicense:MITStargazers:5663Issues:0Issues:0

handwriting-synthesis

Handwriting Synthesis with RNNs ✏️

Language:PythonStargazers:4235Issues:0Issues:0

LxgwWenKai

An open-source Chinese font derived from Fontworks' Klee One. 一款开源中文字体,基于 FONTWORKS 出品字体 Klee One 衍生。

Language:BatchfileLicense:OFL-1.1Stargazers:17098Issues:0Issues:0

mandarin-tts

Chinese Mandarin tts text-to-speech 中文 (普通话) 语音 合成 , by fastspeech 2 , implemented in pytorch, using waveglow as vocoder, with biaobei and aishell3 datasets

Language:PythonStargazers:455Issues:0Issues:0

TTS-Clone-Chinese

基于Real-Time-Voice-Cloning语音克隆中文普通话实现

Language:PythonLicense:NOASSERTIONStargazers:210Issues:0Issues:0

awesome-multimodal-ml

Reading list for research topics in multimodal machine learning

License:MITStargazers:5735Issues:0Issues:0

DeepClustering

Methods and Implements of Deep Clustering

Stargazers:2760Issues:0Issues:0

LSTR

This is an official repository of End-to-end Lane Shape Prediction with Transformers.

Language:PythonLicense:BSD-3-ClauseStargazers:638Issues:0Issues:0

Container

Official Code Release for Container : Context Aggregation Network

Stargazers:46Issues:0Issues:0

ml-visuals

🎨 ML Visuals contains figures and templates which you can reuse and customize to improve your scientific writing.

License:MITStargazers:12884Issues:0Issues:0

PyTorchConv3D

I3D and 3D-ResNets in PyTorch

Language:PythonLicense:Apache-2.0Stargazers:184Issues:0Issues:0

pseudo-3d-pytorch

pytorch version of pseudo-3d-residual-networks(P-3D), pretrained model is supported

Language:PythonLicense:MITStargazers:450Issues:0Issues:0

pseudo-3d-residual-networks

Pseudo-3D Convolutional Residual Networks for Video Representation Learning

Language:C++License:MITStargazers:352Issues:0Issues:0

ESC-50

ESC-50: Dataset for Environmental Sound Classification

Language:PythonLicense:NOASSERTIONStargazers:1317Issues:0Issues:0

crnn-audio-classification

UrbanSound classification using Convolutional Recurrent Networks in PyTorch

Language:PythonLicense:MITStargazers:378Issues:0Issues:0

pytorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Language:PythonLicense:NOASSERTIONStargazers:80917Issues:0Issues:0

ThreadPool

A simple C++11 Thread Pool implementation

Language:C++License:ZlibStargazers:7730Issues:0Issues:0