GreedyIsGood's repositories

asap-dataset

A dataset of 222 digital musical scores aligned with 1068 performances (more than 92 hours) of Western classical piano music.

License:NOASSERTIONStargazers:0Issues:0Issues:0

audapolis

an editor for spoken-word audio with automatic transcription

Language:TypeScriptLicense:AGPL-3.0Stargazers:0Issues:1Issues:0

Awesome-CoreML-Models

Largest list of models for Core ML (for iOS 11+)

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

awesome-speech-enhancement

speech enhancement\speech seperation\sound source localization

License:GPL-2.0Stargazers:0Issues:0Issues:0

CAT

A CRF-based ASR Toolkit

License:Apache-2.0Stargazers:0Issues:0Issues:0

ChineseBQB

🇨🇳 Chinese sticker pack,More joy / 表情包的博物馆, Github最有毒的仓库, **表情包大集合, 聚欢乐~

Stargazers:0Issues:0Issues:0

DSTC8-AVSD

We rank the 1st in DSTC8 Audio-Visual Scene-Aware Dialog competition. This is the source code for our IEEE/ACM TASLP (AAAI2020-DSTC8-AVSD) paper "Bridging Text and Video: A Universal Multimodal Transformer for Video-Audio Scene-Aware Dialog".

License:MITStargazers:0Issues:0Issues:0

end-to-end-synthetic-speech-detection

Time-domain synthetic speech detection net (TSSDNet), having the classic ResNet and Inception Net style structures (Res-TSSDNet and Inc-TSSDNet), for end-to-end synthetic speech detection. They achieve the state-of-the-art performance in terms of EER on ASVspoof 2019 challenge and promising generalization capability tested on ASVspoof 2015.

Language:PythonLicense:GPL-3.0Stargazers:0Issues:1Issues:0

hair

remove image background

License:GPL-2.0Stargazers:0Issues:0Issues:0

Ideal-Piano

这是一款智能钢琴软件,通过乐理逻辑的算法来判断当前演奏的音组成的是什么和弦,支持midi键盘,电脑键盘,DAW同步播放工程,播放midi文件分析和弦并且实时演示。This is a piano software that analyzes what chords you are playing in real time by music theory based chord types detection algorithms written by me and displays the chord types on the screen. This piano software supports midi keyboard playing, computer keyboard playing, play and analyze midi files, DAW synchronous display and so on.

License:GPL-3.0Stargazers:0Issues:0Issues:0

im2latex

Pytorch implemention of Deep CNN Encoder + LSTM Decoder with Attention for Image to Latex

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

MDVC

PyTorch implementation of Multi-modal Dense Video Captioning (CVPR 2020 Workshops)

Stargazers:0Issues:0Issues:0

mmdetection

OpenMMLab Detection Toolbox and Benchmark

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

mmocr

OpenMMLab Text Detection, Recognition and Understanding Toolbox

License:Apache-2.0Stargazers:0Issues:0Issues:0

MODNet

A Trimap-Free Solution for Portrait Matting in Real Time under Changing Scenes

Stargazers:0Issues:0Issues:0

nanodet

⚡Super fast and lightweight anchor-free object detection model. 🔥Only 1.8mb and run 97FPS on cellphone🔥

Language:PythonStargazers:0Issues:1Issues:0

OpenTransformer

A No-Recurrence Sequence-to-Sequence Model for Speech Recognition

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

PandaOCR

PandaOCR - 多功能OCR图文识别+翻译+朗读+弹窗+公式+表格+图床+搜图+二维码

Stargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:1Issues:0

pytorch-softdtw-cuda

Fast CUDA implementation of (differentiable) soft dynamic time warping for PyTorch using Numba

Language:PythonLicense:MITStargazers:0Issues:1Issues:0

qlib

Qlib is an AI-oriented quantitative investment platform, which aims to realize the potential, empower the research, and create the value of AI technologies in quantitative investment. With Qlib, you can easily try your ideas to create better Quant investment strategies.

License:MITStargazers:0Issues:0Issues:0

Real-Time-Voice-Cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time

Language:PythonLicense:NOASSERTIONStargazers:0Issues:1Issues:0

RRPN_plusplus

RRPN++: Guidance Towards More Accurate Scene Text Detection

Language:PythonStargazers:0Issues:1Issues:0

SlowFast

PySlowFast: video understanding codebase from FAIR for reproducing state-of-the-art video models.

License:Apache-2.0Stargazers:0Issues:0Issues:0

SoundLocation

基于pynq-z2的声源定位系统

Language:CStargazers:0Issues:1Issues:0

source_separation

Deep learning based speech source separation using Pytorch

License:Apache-2.0Stargazers:0Issues:0Issues:0

SpeechAlgorithms

Speech Algorithms Collections

License:Apache-2.0Stargazers:0Issues:0Issues:0

speechbrain

A PyTorch-based Speech Toolkit

License:Apache-2.0Stargazers:0Issues:0Issues:0

swapping-autoencoder-pytorch

Official Implementation of Swapping Autoencoder for Deep Image Manipulation (NeurIPS 2020)

Stargazers:0Issues:0Issues:0

wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

License:Apache-2.0Stargazers:0Issues:0Issues:0