Mikezz1

Mikezz1

Geek Repo

Location:Moscow

Github PK Tool:Github PK Tool

Mikezz1's starred repositories

scientific-computing-2024

Bridging the gap between mathematical courses and ML

Language:Jupyter NotebookLicense:MITStargazers:53Issues:0Issues:0

SALMONN

SALMONN: Speech Audio Language Music Open Neural Network

Language:PythonLicense:Apache-2.0Stargazers:1019Issues:0Issues:0

Qwen-Audio

The official repo of Qwen-Audio (通义千问-Audio) chat & pretrained large audio language model proposed by Alibaba Cloud.

Language:PythonLicense:NOASSERTIONStargazers:1446Issues:0Issues:0

ichigo

Llama3.1 learns to Listen

Language:PythonStargazers:1039Issues:0Issues:0

Hypo2Trans

Single-blind supplementary materials for NeurIPS 2023 submission

Language:PythonLicense:MITStargazers:91Issues:0Issues:0

Whispering-LLaMA

EMNLP 23 - Integrating Whisper Encoder to LLaMA Decoder for Generative ASR Error Correction

Language:Jupyter NotebookLicense:MITStargazers:228Issues:0Issues:0

fusedswiglu

Fused SwiGLU Triton kernels

Language:PythonLicense:MITStargazers:2Issues:0Issues:0

so-vits-svc

SoftVC VITS Singing Voice Conversion

Language:PythonLicense:AGPL-3.0Stargazers:25676Issues:0Issues:0

so-vits-svc-4.0-v2

SoftVC VITS Singing Voice Conversion

Language:PythonLicense:MITStargazers:556Issues:0Issues:0

DynamiCrafter

[ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors

Language:PythonLicense:Apache-2.0Stargazers:2513Issues:0Issues:0

torch-conv-kan

This project is dedicated to the implementation and research of Kolmogorov-Arnold convolutional networks. The repository includes implementations of 1D, 2D, and 3D convolutions with different kernels, ResNet-like and DenseNet-like models, training code based on accelerate/PyTorch, as well as scripts for experiments with CIFAR-10 and Tiny ImageNet.

Language:PythonLicense:MITStargazers:387Issues:0Issues:0

descript-audio-codec

State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.

Language:PythonLicense:MITStargazers:1167Issues:0Issues:0

demucs

Code for the paper Hybrid Spectrogram and Waveform Source Separation

Language:PythonLicense:MITStargazers:909Issues:0Issues:0

lhotse

Tools for handling speech data in machine learning projects.

Language:PythonLicense:Apache-2.0Stargazers:941Issues:0Issues:0

BS-RoFormer

Implementation of Band Split Roformer, SOTA Attention network for music source separation out of ByteDance AI Labs

Language:PythonLicense:MITStargazers:410Issues:0Issues:0

lion-pytorch

🦁 Lion, new optimizer discovered by Google Brain using genetic algorithms that is purportedly better than Adam(w), in Pytorch

Language:PythonLicense:MITStargazers:2021Issues:0Issues:0

rotary-embedding-torch

Implementation of Rotary Embeddings, from the Roformer paper, in Pytorch

Language:PythonLicense:MITStargazers:547Issues:0Issues:0

deepsparse

Sparsity-aware deep learning inference runtime for CPUs

Language:PythonLicense:NOASSERTIONStargazers:2999Issues:0Issues:0

mixture-of-experts

A Pytorch implementation of Sparsely-Gated Mixture of Experts, for massively increasing the parameter count of language models

Language:PythonLicense:MITStargazers:628Issues:0Issues:0

x-transformers

A concise but complete full-attention transformer with a set of promising experimental features from various papers

Language:PythonLicense:MITStargazers:4680Issues:0Issues:0
Language:PythonLicense:NOASSERTIONStargazers:292Issues:0Issues:0

Practical_RL

A course in reinforcement learning in the wild

Language:Jupyter NotebookLicense:UnlicenseStargazers:5893Issues:0Issues:0

MossFormer

This repo provides the processed samples of the manuscript "MossFormer: Pushing the Performance Limit of Monaural Speech Separation using Gated Single-head Transformer with Convolution-augmented Joint Self-Attentions", which was submitted to ICASSP 2023.

License:Apache-2.0Stargazers:82Issues:0Issues:0

LibriMix

An open source dataset for source separation

Language:PythonLicense:MITStargazers:371Issues:0Issues:0

asteroid

The PyTorch-based audio source separation toolkit for researchers

Language:PythonLicense:MITStargazers:2250Issues:0Issues:0

CMGAN

Conformer-based Metric GAN for speech enhancement

Language:PythonLicense:MITStargazers:304Issues:0Issues:0

speech_course

Deep Learning for Speech

Language:Jupyter NotebookStargazers:77Issues:0Issues:0
Language:C++Stargazers:855Issues:0Issues:0

VSCode-LaTeX-Inkscape

✍️ A way to integrate LaTeX, VS Code, and Inkscape in macOS

Language:PythonLicense:MITStargazers:351Issues:0Issues:0

ICASSP-2023-24-Papers

ICASSP 2023-2024 Papers: A complete collection of influential and exciting research papers from the ICASSP 2023-24 conferences. Explore the latest advancements in acoustics, speech and signal processing. Code included. Star the repository to support the advancement of audio and signal processing!

Language:PythonLicense:MITStargazers:385Issues:0Issues:0