Beast code in Giters

GreedyIsGood's repositories

asap-dataset

A dataset of 222 digital musical scores aligned with 1068 performances (more than 92 hours) of Western classical piano music.

NOASSERTION000

audapolis

an editor for spoken-word audio with automatic transcription

Language:TypeScriptAGPL-3.0010

Awesome-CoreML-Models

Largest list of models for Core ML (for iOS 11+)

Language:PythonMIT010

awesome-speech-enhancement

speech enhancement\speech seperation\sound source localization

GPL-2.0000

ChineseBQB

🇨🇳 Chinese sticker pack,More joy / 表情包的博物馆, Github最有毒的仓库, **表情包大集合, 聚欢乐~

000

We rank the 1st in DSTC8 Audio-Visual Scene-Aware Dialog competition. This is the source code for our IEEE/ACM TASLP (AAAI2020-DSTC8-AVSD) paper "Bridging Text and Video: A Universal Multimodal Transformer for Video-Audio Scene-Aware Dialog".

MIT000

end-to-end-synthetic-speech-detection

Time-domain synthetic speech detection net (TSSDNet), having the classic ResNet and Inception Net style structures (Res-TSSDNet and Inc-TSSDNet), for end-to-end synthetic speech detection. They achieve the state-of-the-art performance in terms of EER on ASVspoof 2019 challenge and promising generalization capability tested on ASVspoof 2015.

Language:PythonGPL-3.0010

hair

remove image background

GPL-2.0000

Ideal-Piano

这是一款智能钢琴软件，通过乐理逻辑的算法来判断当前演奏的音组成的是什么和弦，支持midi键盘，电脑键盘，DAW同步播放工程，播放midi文件分析和弦并且实时演示。This is a piano software that analyzes what chords you are playing in real time by music theory based chord types detection algorithms written by me and displays the chord types on the screen. This piano software supports midi keyboard playing, computer keyboard playing, play and analyze midi files, DAW synchronous display and so on.

GPL-3.0000

im2latex

Pytorch implemention of Deep CNN Encoder + LSTM Decoder with Attention for Image to Latex

Language:PythonMIT010

MDVC

PyTorch implementation of Multi-modal Dense Video Captioning (CVPR 2020 Workshops)

000

mmdetection

OpenMMLab Detection Toolbox and Benchmark

Language:PythonApache-2.0010

mmocr

OpenMMLab Text Detection, Recognition and Understanding Toolbox

Apache-2.0000

MODNet

A Trimap-Free Solution for Portrait Matting in Real Time under Changing Scenes

000

nanodet

⚡Super fast and lightweight anchor-free object detection model. 🔥Only 1.8mb and run 97FPS on cellphone🔥

Language:Python010

OpenTransformer

A No-Recurrence Sequence-to-Sequence Model for Speech Recognition

Language:PythonMIT010

PandaOCR

PandaOCR - 多功能OCR图文识别+翻译+朗读+弹窗+公式+表格+图床+搜图+二维码

000

piano_transcription

Language:Python010

pytorch-softdtw-cuda

Fast CUDA implementation of (differentiable) soft dynamic time warping for PyTorch using Numba

Language:PythonMIT010

qlib

Qlib is an AI-oriented quantitative investment platform, which aims to realize the potential, empower the research, and create the value of AI technologies in quantitative investment. With Qlib, you can easily try your ideas to create better Quant investment strategies.

MIT000

caozhengquan

GreedyIsGood's repositories

asap-dataset

audapolis

Awesome-CoreML-Models

awesome-speech-enhancement

CAT

ChineseBQB

DSTC8-AVSD

end-to-end-synthetic-speech-detection

hair

Ideal-Piano

im2latex

MDVC

mmdetection

mmocr

MODNet

nanodet

OpenTransformer

PandaOCR

piano_transcription

pytorch-softdtw-cuda

qlib

Real-Time-Voice-Cloning

RRPN_plusplus

SlowFast

SoundLocation

source_separation

SpeechAlgorithms

speechbrain

swapping-autoencoder-pytorch

wenet