Beast code in Giters

GreedyIsGood's repositories

SpeechAlgorithms

Speech Algorithms Collections

Apache-2.0000

speechbrain

A PyTorch-based Speech Toolkit

Apache-2.0000

audapolis

an editor for spoken-word audio with automatic transcription

AGPL-3.0000

awesome-speech-enhancement

speech enhancement\speech seperation\sound source localization

GPL-2.0000

im2latex

Pytorch implemention of Deep CNN Encoder + LSTM Decoder with Attention for Image to Latex

MIT000

end-to-end-synthetic-speech-detection

Time-domain synthetic speech detection net (TSSDNet), having the classic ResNet and Inception Net style structures (Res-TSSDNet and Inc-TSSDNet), for end-to-end synthetic speech detection. They achieve the state-of-the-art performance in terms of EER on ASVspoof 2019 challenge and promising generalization capability tested on ASVspoof 2015.

GPL-3.0000

wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

Apache-2.0000

MDVC

PyTorch implementation of Multi-modal Dense Video Captioning (CVPR 2020 Workshops)

000

DSTC8-AVSD

We rank the 1st in DSTC8 Audio-Visual Scene-Aware Dialog competition. This is the source code for our IEEE/ACM TASLP (AAAI2020-DSTC8-AVSD) paper "Bridging Text and Video: A Universal Multimodal Transformer for Video-Audio Scene-Aware Dialog".

MIT000

PandaOCR

PandaOCR - 多功能OCR图文识别+翻译+朗读+弹窗+公式+表格+图床+搜图+二维码

000

mmocr

OpenMMLab Text Detection, Recognition and Understanding Toolbox

Apache-2.0000

hair

remove image background

GPL-2.0000

swapping-autoencoder-pytorch

Official Implementation of Swapping Autoencoder for Deep Image Manipulation (NeurIPS 2020)

000

SlowFast

PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.

Apache-2.0000

qlib

Qlib is an AI-oriented quantitative investment platform, which aims to realize the potential, empower the research, and create the value of AI technologies in quantitative investment. With Qlib, you can easily try your ideas to create better Quant investment strategies.

MIT000

MODNet

A Trimap-Free Solution for Portrait Matting in Real Time under Changing Scenes

000

mmdetection

OpenMMLab Detection Toolbox and Benchmark

Apache-2.0000

pytorch-softdtw-cuda

Fast CUDA implementation of (differentiable) soft dynamic time warping for PyTorch using Numba

MIT000

OpenTransformer

A No-Recurrence Sequence-to-Sequence Model for Speech Recognition

MIT000

CAT

A CRF-based ASR Toolkit

Apache-2.0000

RRPN_plusplus

RRPN++: Guidance Towards More Accurate Scene Text Detection

000

nanodet

⚡Super fast and lightweight anchor-free object detection model. 🔥Only 1.8mb and run 97FPS on cellphone🔥

000

source_separation

Deep learning based speech source separation using Pytorch

Apache-2.0000

SoundLocation

基于pynq-z2的声源定位系统

000

Ideal-Piano

这是一款智能钢琴软件，通过乐理逻辑的算法来判断当前演奏的音组成的是什么和弦，支持midi键盘，电脑键盘，DAW同步播放工程，播放midi文件分析和弦并且实时演示。This is a piano software that analyzes what chords you are playing in real time by music theory based chord types detection algorithms written by me and displays the chord types on the screen. This piano software supports midi keyboard playing, computer keyboard playing, play and analyze midi files, DAW synchronous display and so on.

GPL-3.0000

caozhengquan

GreedyIsGood's repositories

SpeechAlgorithms

speechbrain

audapolis

awesome-speech-enhancement

im2latex

end-to-end-synthetic-speech-detection

wenet

MDVC

DSTC8-AVSD

PandaOCR

mmocr

hair

swapping-autoencoder-pytorch

SlowFast

qlib

MODNet

mmdetection

pytorch-softdtw-cuda

OpenTransformer

CAT

RRPN_plusplus

nanodet

source_separation

SoundLocation

Ideal-Piano

Real-Time-Voice-Cloning

Awesome-CoreML-Models

piano_transcription

asap-dataset

ChineseBQB