kli017

Kai's repositories

ISCSLP2022-CSSD-Challenge

ISCSLP2022 CSSD Challenge

Language:Python1 10

wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

Language:C++Apache-2.01 10

This repository was a fork of BVLC/caffe and includes the upsample, bn, dense_image_data and softmax_with_loss (with class weighting) layers of caffe-segnet (https://github.com/alexgkendall/caffe-segnet) to run SegNet with cuDNN version 5.

Language:C++NOASSERTION020

deep-residual-networks

Deep Residual Learning for Image Recognition

MIT020

EEND-vector-clustering

This repository contains a set of codes to run (i.e., train, perform inference with, evaluate) a diarization method called EEND-vector-clustering.

Language:PythonNOASSERTION000

kaldi

This is the official location of the Kaldi project.

Language:ShellNOASSERTION000

RNN-for-Human-Activity-Recognition-using-2D-Pose-Input

Activity Recognition from 2D pose using an LSTM RNN

Language:Jupyter Notebook010

deeplab_v2

基于v2版本的deeplab,使用VGG16模型，在VOC2012，Pascal-context，NYU-v2等多个数据集上进行训练

Language:Jupyter Notebook020

espnet

End-to-End Speech Processing Toolkit

Language:ShellApache-2.0010

Fine_Grained_car

web demo of fine grained car model classification

020

FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models.

NOASSERTION000

GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

MIT000

HyperVID

开源移动端车型识别[Experimental] Mobile Plateform Vehicle Identification Model

Language:C++010

kaldi-trunk

Language:Shell020

KAN-TTS

KAN-TTS is a speech-synthesis training framework, please try the demos we have posted at https://modelscope.cn/models?page=1&tasks=text-to-speech

MIT000

kli017

Personal Blog Site

Language:SCSSMIT000

langchain-ChatGLM

langchain-ChatGLM, local knowledge based ChatGLM with langchain ｜基于本地知识库的 ChatGLM 问答

Language:PythonApache-2.0000

lexicon

lexicon for word seg and word pronunciation

020

libhv

🔥 比libevent、libuv更易用的网络库。A c/c++ network library for developing TCP/UDP/SSL/HTTP/WebSocket/MQTT client/server.

BSD-3-Clause000

loguru

Python logging made (stupidly) simple

MIT000

postgress-boot

Postgress With Spring boot

Language:Java000

Printed_chinesechar_deeprecog

deep network for common used printed chinese character classification

000

resnet-protofiles

Caffe Protofiles for MSRA ResNet: train prototxt

Language:Python020

transducer-tutorial

Example code for a neural transducer model.

000

VALL-E-X

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io

MIT000

vosk-server

WebSocket and gRPC server for speech recognition based on Kaldi and Vosk libraries

Language:PythonApache-2.0010

wekws_dev

Production First and Production Ready End-to-End Keyword Spotting Toolkit

Apache-2.0000

wenet-online-decoder-onnx

Apache-2.0000

wfrest

C++ Web Framework REST API

Apache-2.0000

kli017

Kai's repositories

ISCSLP2022-CSSD-Challenge

kli017.github.io

wenet

caffe-segnet-cudnn5

deep-residual-networks

EEND-vector-clustering

kaldi

RNN-for-Human-Activity-Recognition-using-2D-Pose-Input

deeplab_v2

espnet

Fine_Grained_car

FunASR

GPT-SoVITS

HyperVID

kaldi-trunk

KAN-TTS

kli017

langchain-ChatGLM

lexicon

libhv

loguru

postgress-boot

Printed_chinesechar_deeprecog

resnet-protofiles

transducer-tutorial

VALL-E-X

vosk-server

wekws_dev

wenet-online-decoder-onnx

wfrest