kevin_up (xiexukang)

xiexukang

Geek Repo

Company:Master's student at JiangNan University

Location:china

Github PK Tool:Github PK Tool

kevin_up's repositories

pyannote-audio

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

3D-Speaker

A repository for single- and multi-modal speaker verification, speaker recognition and speaker diarization.

License:Apache-2.0Stargazers:0Issues:0Issues:0

awesome-asr-contextualization

A curated list of awesome papers on contextualizing E2E ASR outputs

License:Apache-2.0Stargazers:0Issues:0Issues:0

awesome-cpp

A curated list of awesome C++ (or C) frameworks, libraries, resources, and shiny things. Inspired by awesome-... stuff.

License:MITStargazers:0Issues:0Issues:0

Awesome-LLM

Awesome-LLM: a curated list of Large Language Model

License:CC0-1.0Stargazers:0Issues:0Issues:0

awesome-multimodal-ml

Reading list for research topics in multimodal machine learning

License:MITStargazers:0Issues:0Issues:0

awesome-ncnn

😎 A Collection of Awesome NCNN-based Projects

Stargazers:0Issues:0Issues:0

Cantonese-learning

粤语学习资料

Stargazers:0Issues:0Issues:0

ChatWaifu_Mobile

移动版二次元 AI 老婆聊天器

License:MITStargazers:0Issues:0Issues:0

code-switching-papers

A curated list of research papers and resources on code-switching

License:Apache-2.0Stargazers:0Issues:0Issues:0

ctc_decoder

A ctc decoder for both online and offline asr model

Stargazers:0Issues:0Issues:0

data2vec-pytorch

PyTorch implementation of "data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language" from Meta AI

License:MITStargazers:0Issues:0Issues:0

espresso

Espresso: A Fast End-to-End Neural Speech Recognition Toolkit

License:NOASSERTIONStargazers:0Issues:0Issues:0

expert_readed_books

2021年最新总结,推荐工程师合适读本,计算机科学,软件技术,创业,**类,数学类,人物传记书籍

Stargazers:0Issues:0Issues:0

FastASR

这是一个用C++实现ASR推理的项目,它依赖很少,安装也很简单,推理速度很快,在树莓派4B等ARM平台也可以流畅的运行。 支持的模型是由Google的Transformer模型中优化而来,数据集是开源wenetspeech(10000+小时)或阿里私有数据集(60000+小时), 所以识别效果也很好,可以媲美许多商用的ASR软件。

License:Apache-2.0Stargazers:0Issues:0Issues:0

FastDeploy

⚡️An Easy-to-use and Fast Deep Learning Model Deployment Toolkit for ☁️Cloud 📱Mobile and 📹Edge. Including Image, Video, Text and Audio 20+ main stream scenarios and 150+ SOTA models with end-to-end optimization, multi-platform and multi-framework support.

License:Apache-2.0Stargazers:0Issues:0Issues:0

findpapers

Findpapers: A tool for helping researchers who are looking for related works

License:MITStargazers:0Issues:0Issues:0
License:Apache-2.0Stargazers:0Issues:0Issues:0

json

JSON for Modern C++

License:MITStargazers:0Issues:0Issues:0

keyword-spot

端到端语音唤醒工具箱,从模型训练到模型推理。

License:MITStargazers:0Issues:0Issues:0

myblog

myblog powered by django,xadmin

Language:PythonStargazers:0Issues:0Issues:0

ncnn

ncnn is a high-performance neural network inference framework optimized for the mobile platform

License:NOASSERTIONStargazers:0Issues:0Issues:0

OpenAI_Whisper_ASR

A minimalistic automatic speech recognition streamlit based webapp powered by OpenAI's Whisper "State of the Art" models

License:MITStargazers:0Issues:0Issues:0

pocolm

Small language toolkit for creation, interpolation and pruning of ARPA language models

License:NOASSERTIONStargazers:0Issues:0Issues:0

sherpa-ncnn

Real-time speech recognition using next-gen Kaldi with ncnn

License:NOASSERTIONStargazers:0Issues:0Issues:0

torchaudio

Data manipulation and transformation for audio signal processing, powered by PyTorch

License:BSD-2-ClauseStargazers:0Issues:0Issues:0
License:Apache-2.0Stargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0
License:Apache-2.0Stargazers:0Issues:0Issues:0
License:MITStargazers:0Issues:0Issues:0