Beast code in Giters

yearnyeen ho's starred repositories

ollama

Get up and running with Llama 3.1, Mistral, Gemma 2, and other large language models.

Language:GoMIT86303 513 4087

open-interpreter

A natural language interface for computers

Language:PythonAGPL-3.051662 384 911

llm.c

LLM training in simple, raw C/CUDA

Language:CudaMIT22764 223 129

TinyLlama

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Language:PythonApache-2.07530 109 152

umap

Uniform Manifold Approximation and Projection

Language:PythonBSD-3-Clause7311 127 785

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

Language:PythonMIT4400 58 149

torchinfo

View model summaries in PyTorch!

Language:PythonMIT2461 18 155

visualization-curriculum

A data visualization curriculum of interactive notebooks.

Language:Jupyter NotebookBSD-3-Clause1275 54 13

Hybrid-Net

Real-time audio to chords, lyrics, beat, and melody.

Language:Python652 5 2

speech-trident

Awesome speech/audio LLMs, representation learning, and codec models

552 30 2

StreamMultiDiffusion

Official code for the paper "StreamMultiDiffusion: Real-Time Interactive Generation with Region-Based Semantic Control."

Language:Jupyter NotebookMIT516 10 14

edm2

Analyzing and Improving the Training Dynamics of Diffusion Models (EDM2)

Language:PythonNOASSERTION461 12 5

awesome-audio-plaza

Daily tracking of awesome audio papers, including music generation, zero-shot tts, asr, audio generation

MIT287 30 1

VoiceFlow-TTS

[ICASSP 2024] This is the official code for "VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching"

Language:Python274 16 13

ML-from-scratch-seminar

This repository is part of a "Machine Learning from Scratch" seminar at Harvard Medical School.

Language:Jupyter NotebookMIT257 220

ect

Consistency Models Made Easy

Language:Python185 6 11

gflownet

Generative Flow Networks - GFlowNet

Language:PythonApache-2.0152 7 47

DiffusionRet

[ICCV 2023] DiffusionRet: Generative Text-Video Retrieval with Diffusion Model

Language:PythonApache-2.0107 3 9

EnCLAP

Official Implementation of EnCLAP (ICASSP 2024)

Language:PythonMIT89 6 9

Rank-N-Contrast

[NeurIPS 2023, Spotlight] Rank-N-Contrast: Learning Continuous Representations for Regression

Language:Python73 1 5

pflow-encodec

Implementation of TTS model based on NVIDIA P-Flow TTS Paper

Language:Python64 6 6

mini_edm

Minimum implementation of EDM (Elucidating the Design Space of Diffusion-Based Generative Models) on cifar10 and mnist

Language:Python33 3 2

timbre-trap

Code for the paper "Timbre-Trap: A Low-Resource Framework for Instrument-Agnostic Music Transcription"

Language:PythonMIT32 20

MWAFM

Multi-Scale Attention for Audio Question Answering

Language:Python24 2 4

Cacophony

Inference codebase for "Cacophony: An Improved Contrastive Audio-Text Model". Preprint: https://arxiv.org/abs/2402.06986

Language:PythonMIT24 4 2

music-text-representation-pp

Enriching Music Descriptions with a Finetuned-LLM and Metadata for Text-to-Music Retrieval (TTMR++) [ICASSP24]

Language:Python17 2 2

real-time-lyrics-alignment

Codebase for 'A Real-Time Lyrics Alignment System Using Chroma And Phonetic Features For Classical Vocal Performance', ICASSP 2024

Language:PythonNOASSERTION11 10

audio-representations

JEPAs for audio representation learning

Language:Python11 4 2

ICASSP-2024-BEAFX-using-DDSP

Github repository for the paper accepted in ICASSP 2024 : Blind estimation of audio effects using an auto-encoder approach and differentiable signal processing

Language:Jupyter Notebook10 10

Call-Response

Responding to the Call: Exploring Automatic Music Composition Using a Knowledge-Enhanced Model

Language:Python5 10