Kai's repositories

ISCSLP2022-CSSD-Challenge

ISCSLP2022 CSSD Challenge

Language:PythonStargazers:1Issues:1Issues:0

kli017.github.io

Personal Blog

Language:SCSSStargazers:1Issues:0Issues:0

wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

Language:C++License:Apache-2.0Stargazers:1Issues:1Issues:0

caffe-segnet-cudnn5

This repository was a fork of BVLC/caffe and includes the upsample, bn, dense_image_data and softmax_with_loss (with class weighting) layers of caffe-segnet (https://github.com/alexgkendall/caffe-segnet) to run SegNet with cuDNN version 5.

Language:C++License:NOASSERTIONStargazers:0Issues:2Issues:0

deep-residual-networks

Deep Residual Learning for Image Recognition

License:MITStargazers:0Issues:2Issues:0

EEND-vector-clustering

This repository contains a set of codes to run (i.e., train, perform inference with, evaluate) a diarization method called EEND-vector-clustering.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

kaldi

This is the official location of the Kaldi project.

Language:ShellLicense:NOASSERTIONStargazers:0Issues:0Issues:0

RNN-for-Human-Activity-Recognition-using-2D-Pose-Input

Activity Recognition from 2D pose using an LSTM RNN

Language:Jupyter NotebookStargazers:0Issues:1Issues:0

deeplab_v2

基于v2版本的deeplab,使用VGG16模型,在VOC2012,Pascal-context,NYU-v2等多个数据集上进行训练

Language:Jupyter NotebookStargazers:0Issues:2Issues:0

espnet

End-to-End Speech Processing Toolkit

Language:ShellLicense:Apache-2.0Stargazers:0Issues:1Issues:0

Fine_Grained_car

web demo of fine grained car model classification

Stargazers:0Issues:2Issues:0

FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models.

License:NOASSERTIONStargazers:0Issues:0Issues:0

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

License:MITStargazers:0Issues:0Issues:0

HyperVID

开源移动端车型识别[Experimental] Mobile Plateform Vehicle Identification Model

Language:C++Stargazers:0Issues:1Issues:0
Language:ShellStargazers:0Issues:2Issues:0

KAN-TTS

KAN-TTS is a speech-synthesis training framework, please try the demos we have posted at https://modelscope.cn/models?page=1&tasks=text-to-speech

License:MITStargazers:0Issues:0Issues:0

kli017

Personal Blog Site

Language:SCSSLicense:MITStargazers:0Issues:0Issues:0

langchain-ChatGLM

langchain-ChatGLM, local knowledge based ChatGLM with langchain | 基于本地知识库的 ChatGLM 问答

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

lexicon

lexicon for word seg and word pronunciation

Stargazers:0Issues:2Issues:0

libhv

🔥 比libevent、libuv更易用的网络库。A c/c++ network library for developing TCP/UDP/SSL/HTTP/WebSocket/MQTT client/server.

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

loguru

Python logging made (stupidly) simple

License:MITStargazers:0Issues:0Issues:0

postgress-boot

Postgress With Spring boot

Language:JavaStargazers:0Issues:0Issues:0

Printed_chinesechar_deeprecog

deep network for common used printed chinese character classification

Stargazers:0Issues:0Issues:0

resnet-protofiles

Caffe Protofiles for MSRA ResNet: train prototxt

Language:PythonStargazers:0Issues:2Issues:0

transducer-tutorial

Example code for a neural transducer model.

Stargazers:0Issues:0Issues:0

VALL-E-X

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io

License:MITStargazers:0Issues:0Issues:0

vosk-server

WebSocket and gRPC server for speech recognition based on Kaldi and Vosk libraries

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

wekws_dev

Production First and Production Ready End-to-End Keyword Spotting Toolkit

License:Apache-2.0Stargazers:0Issues:0Issues:0
License:Apache-2.0Stargazers:0Issues:0Issues:0

wfrest

C++ Web Framework REST API

License:Apache-2.0Stargazers:0Issues:0Issues:0