Beast code in Giters

AI Legend's repositories

optimization_learning

newton,quasi-newton,DFP,BFGS,accurate_line_search,Cholesky_decomposition,FR_conjugate_gradient_method,Wolfe,BB_method,armijo,gradient_descent,Goldstein

Language:Jupyter Notebook3 10

HMCN-F

pytorch version HMCN-F

Language:PythonGPL-3.0200

sudo_rm_rf

Code for SuDoRm-Rf networks for efficient audio source separation. Chinese no work

Language:Jupyter NotebookMIT100

AAAI-EEG-To-Text

code for AAAI2022 paper "Open Vocabulary Electroencephalography-To-Text Decoding and Zero-shot Sentiment Classification"

Language:Python000

asteroid_speech_separation

The PyTorch-based audio source separation toolkit for researchers

Language:PythonMIT000

EEG_video_matching

match-mismatch prediction of eeg and video pair

Language:Jupyter Notebook000

audio-diffusion-pytorch

Unconditional audio generation using diffusion models, in PyTorch.

Language:PythonMIT000

AUDIO_GAN

some traditional gans transferred from image to audio

000

benke-NPU-Thesis

西北工业大学本科毕业设计论文模版 | Thesis Template for Northwestern Polytechnical University

GPL-3.0000

easyTorch

no need to calculate in_channels in conv, input neurons in fc layer or parameter_nums in PReLU

010

McNet

The official repo: "McNet: Fuse Multiple Cues for Multichannel Speech Enhancement" submitted to ICASSP 2023

000

NLP_ability

总结梳理自然语言处理工程师(NLP)需要积累的各方面知识，包括面试题，各种基础知识，工程能力等等，提升核心竞争力

Language:Python000

notablog-starter

The official starter project for Notablog.

Language:CSSMIT000

使用 NextJS + Notion API 实现的，支持多种部署方案的静态博客，无需服务器、零门槛搭建网站，为Notion和所有创作者设计。 (A static blog built with NextJS and Notion API, supporting multiple deployment options. No server required, zero threshold to set up a website. Designed for Notion and all creators.)

Language:JavaScriptMIT000

paper

Read papers

GPL-3.0000

PDVC

End-to-End Dense Video Captioning with Parallel Decoding (ICCV 2021)

MIT000

Shuobo-LaTeX-Template-for-NPU-Thesis

西北工业大学硕博学位论文模版 | Yet Another Thesis Template for Northwestern Polytechnical University

GPL-3.0000

speechSeperation

A PyTorch-based Speech Toolkit

Language:PythonApache-2.0000

SpeechTokenizer

This is the code for the SpeechTokenizer presented in the SpeechTokenizer: Unified Speech Tokenizer for Speech Language Models. Samples are presented on

Language:PythonApache-2.0000

streamlit-example

Example Streamlit app that you can fork to test out share.streamlit.io

Language:Python000

StyleT2I

stylegan,text2img

BSD-2-Clause000

vad

coding based on others' research

MIT000

vad_speaker_numbering-speech_separation

010

video-diffusion-pytorch-ddim

Implementation of Video Diffusion Models, Jonathan Ho's new paper extending DDPMs to Video Generation - in Pytorch

Language:PythonMIT000

Whisper-Finetune

Fine-tune the Whisper speech recognition model to support training without timestamp data, training with timestamp data, and training without speech data. Accelerate inference and support Web deployment, Windows desktop deployment, and Android deployment

Language:CApache-2.0000

mixiazhiyang

AI Legend's repositories

optimization_learning

HMCN-F

sudo_rm_rf

vpn

AAAI-EEG-To-Text

asteroid_speech_separation

diffusion_playground

EEG_video_matching

audio-diffusion-pytorch

AUDIO_GAN

awesome-videochatbot

benke-NPU-Thesis

easyTorch

freak.github.io

McNet

mixiazhiyang.github.io

NLP_ability

notablog-starter

NotionNext

paper

PDVC

Shuobo-LaTeX-Template-for-NPU-Thesis

speechSeperation

SpeechTokenizer

streamlit-example

StyleT2I

vad

vad_speaker_numbering-speech_separation

video-diffusion-pytorch-ddim

Whisper-Finetune