RF5 - Giters

Matthew Baas's starred repositories

yt-dlp

A feature-rich command-line audio/video downloader

Language:PythonUnlicense74164 458 7111

segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Language:Jupyter NotebookApache-2.044946 299 646

llama2.c

Inference Llama 2 in one file of pure C

Language:CMIT16557 189 213

mistral-src

Reference implementation of Mistral AI 7B v0.1 model.

Language:Jupyter NotebookApache-2.08772 116 115

Scripts for fine-tuning Llama2 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization & question answering. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment.Demo apps to showcase Llama2 for WhatsApp & Messenger

Language:Jupyter NotebookNOASSERTION7850 68 227

chatgpt_system_prompt

A collection of GPT system prompts and various prompt injection/leaking knowledge.

Language:HTMLMIT7574 83 9

VALL-E-X

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io

Language:PythonMIT7349 83 148

aim

Aim 💫 — An easy-to-use & supercharged open-source experiment tracker.

Language:PythonApache-2.04888 43 988

Anima

33B Chinese LLM, DPO QLORA, 100K context, AirLLM 70B inference with single 4GB GPU

Language:Jupyter NotebookApache-2.03403 98 131

AudioLDM2

Text-to-Audio/Music Generation

Language:PythonNOASSERTION2120 44 64

Single-GPU-Passthrough

Language:Shell1427 34 88

HierSpeechpp

The official implementation of HierSpeech++

Language:PythonMIT1110 57 45

descript-audio-codec

State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.

Language:PythonMIT964 26 56

versatile_audio_super_resolution

Versatile audio super resolution (any -> 48kHz) with AudioSR.

Language:PythonMIT939 25 47

code-llama-for-vscode

Use Code Llama with Visual Studio Code and the Continue extension. A local LLM alternative to GitHub Copilot.

Language:PythonMIT526 6 11

joytag

The JoyTag Image Tagging Model

Language:PythonApache-2.0335 13 10

SpeechTokenizer

This is the code for the SpeechTokenizer presented in the SpeechTokenizer: Unified Speech Tokenizer for Speech Language Models. Samples are presented on

Language:PythonApache-2.0320 13 4

CharsiuG2P

Multilingual G2P in 100 languages

Language:Jupyter NotebookMIT256 10 10

concept-erasure

Erasing concepts from neural representations with provable guarantees

Language:PythonMIT195 9 5

bigvsan

Pytorch implementation of BigVSAN

Language:PythonMIT184 28 4

easse

Easier Automatic Sentence Simplification Evaluation

Language:RoffGPL-3.0153 6 50

wvmos

MOS score prediction by fine-tuned wav2vec2.0 model

Language:Python124 5 5

USLM

Unified Speech Language Model for paper "SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language Models"(ICLR 2024)

Language:Python115 8 4

pyreaper

A python wrapper for REAPER

Language:CythonNOASSERTION78 5 15

LPC_for_TTS

Linear Prediction Coefficients estimation from mel-spectrogram implemented in Python based on Levinson-Durbin algorithm.

Language:Python68 6 2

PLC-Challenge

This repo contains required files for the INTERSPEECH 2022 Audio Deep Packet Loss Concealment (PLC) Challenge.

Language:PythonMIT67 8 6

laughter-synthesis

Official implementation of the paper "Laughter Synthesis using Pseudo Phonetic Tokens with a Large-scale In-the-wild Laughter Corpus" accepted by INTERSPEECH 2023.

Language:PythonMIT63 4 4

InstaNovo

De novo peptide sequencing with InstaNovo: Accurate, database-free peptide identification for large scale proteomics experiments

Language:PythonApache-2.038 8 13

Phoneme_Hallucinator

Language:Jupyter Notebook37 2 1

recipe-ai

Language:PythonNOASSERTION31 1 5