Matthew Baas's starred repositories

yt-dlp

A feature-rich command-line audio/video downloader

Language:PythonLicense:UnlicenseStargazers:74164Issues:458Issues:7111

segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:44946Issues:299Issues:646

llama2.c

Inference Llama 2 in one file of pure C

mistral-src

Reference implementation of Mistral AI 7B v0.1 model.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:8772Issues:116Issues:115

llama-recipes

Scripts for fine-tuning Llama2 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization & question answering. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment.Demo apps to showcase Llama2 for WhatsApp & Messenger

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:7850Issues:68Issues:227

chatgpt_system_prompt

A collection of GPT system prompts and various prompt injection/leaking knowledge.

Language:HTMLLicense:MITStargazers:7574Issues:83Issues:9

VALL-E-X

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io

Language:PythonLicense:MITStargazers:7349Issues:83Issues:148

aim

Aim 💫 — An easy-to-use & supercharged open-source experiment tracker.

Language:PythonLicense:Apache-2.0Stargazers:4888Issues:43Issues:988

Anima

33B Chinese LLM, DPO QLORA, 100K context, AirLLM 70B inference with single 4GB GPU

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:3403Issues:98Issues:131

AudioLDM2

Text-to-Audio/Music Generation

Language:PythonLicense:NOASSERTIONStargazers:2120Issues:44Issues:64

HierSpeechpp

The official implementation of HierSpeech++

Language:PythonLicense:MITStargazers:1110Issues:57Issues:45

descript-audio-codec

State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.

Language:PythonLicense:MITStargazers:964Issues:26Issues:56

versatile_audio_super_resolution

Versatile audio super resolution (any -> 48kHz) with AudioSR.

Language:PythonLicense:MITStargazers:939Issues:25Issues:47

code-llama-for-vscode

Use Code Llama with Visual Studio Code and the Continue extension. A local LLM alternative to GitHub Copilot.

Language:PythonLicense:MITStargazers:526Issues:6Issues:11

joytag

The JoyTag Image Tagging Model

Language:PythonLicense:Apache-2.0Stargazers:335Issues:13Issues:10

SpeechTokenizer

This is the code for the SpeechTokenizer presented in the SpeechTokenizer: Unified Speech Tokenizer for Speech Language Models. Samples are presented on

Language:PythonLicense:Apache-2.0Stargazers:320Issues:13Issues:4

CharsiuG2P

Multilingual G2P in 100 languages

Language:Jupyter NotebookLicense:MITStargazers:256Issues:10Issues:10

concept-erasure

Erasing concepts from neural representations with provable guarantees

Language:PythonLicense:MITStargazers:195Issues:9Issues:5

bigvsan

Pytorch implementation of BigVSAN

Language:PythonLicense:MITStargazers:184Issues:28Issues:4

easse

Easier Automatic Sentence Simplification Evaluation

Language:RoffLicense:GPL-3.0Stargazers:153Issues:6Issues:50

wvmos

MOS score prediction by fine-tuned wav2vec2.0 model

USLM

Unified Speech Language Model for paper "SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language Models"(ICLR 2024)

pyreaper

A python wrapper for REAPER

Language:CythonLicense:NOASSERTIONStargazers:78Issues:5Issues:15

LPC_for_TTS

Linear Prediction Coefficients estimation from mel-spectrogram implemented in Python based on Levinson-Durbin algorithm.

PLC-Challenge

This repo contains required files for the INTERSPEECH 2022 Audio Deep Packet Loss Concealment (PLC) Challenge.

Language:PythonLicense:MITStargazers:67Issues:8Issues:6

laughter-synthesis

Official implementation of the paper "Laughter Synthesis using Pseudo Phonetic Tokens with a Large-scale In-the-wild Laughter Corpus" accepted by INTERSPEECH 2023.

Language:PythonLicense:MITStargazers:63Issues:4Issues:4

InstaNovo

De novo peptide sequencing with InstaNovo: Accurate, database-free peptide identification for large scale proteomics experiments

Language:PythonLicense:Apache-2.0Stargazers:38Issues:8Issues:13
Language:PythonLicense:NOASSERTIONStargazers:31Issues:1Issues:5