Beast code in Giters

João Felipe Santos's starred repositories

segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Language:Jupyter NotebookApache-2.044915 299 646

insanely-fast-whisper

Language:Jupyter NotebookApache-2.06757 61 172

streaming-llm

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks

Language:PythonMIT6299 61 76

consistent_depth

We estimate dense, flicker-free, geometrically consistent depth from monocular video, for example hand-held cell phone video.

Language:PythonMIT1590 57 69

soundstorm-pytorch

Implementation of SoundStorm, Efficient Parallel Audio Generation from Google Deepmind, in Pytorch

Language:PythonMIT1137 51 15

InstaFlow

:zap: InstaFlow! One-Step Stable Diffusion with Rectified Flow (ICLR 2024)

Language:PythonMIT1025 43 26

versatile_audio_super_resolution

Versatile audio super resolution (any -> 48kHz) with AudioSR.

Language:PythonMIT937 25 47

SALMONN

SALMONN: Speech Audio Language Music Open Neural Network

Language:PythonApache-2.0845 26 31

ODISE

Official PyTorch implementation of ODISE: Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models [CVPR 2023 Highlight]

Language:PythonNOASSERTION815 40 42

dawproject

Open exchange format for DAWs

Language:HTMLMIT725 23 54

RectifiedFlow

Official Implementation of Rectified Flow (ICLR2023 Spotlight)

Language:Python604 9 19

PedalinoMini

Wireless and Bluetooth MIDI Foot Controller

Language:CGPL-3.0466 37 428

LeanDojo

Tool for data extraction and interacting with Lean programmatically.

Language:PythonMIT462 16 42

awesome-large-audio-models

Collection of resources on the applications of Large Language Models (LLMs) in Audio AI.

418 17 3

Pengi

An Audio Language model for Audio Tasks

Language:PythonMIT255 14 11

lp-music-caps

LP-MusicCaps: LLM-Based Pseudo Music Captioning [ISMIR23]

Language:Python237 8 7

Bayesian-Flow-Networks

A simple implimentation of Bayesian Flow Networks (BFN)

Language:Jupyter NotebookApache-2.0232 8 5

dscore

Diarization scoring tools.

Language:PythonBSD-2-Clause198 8 4

pesto

Self-supervised learning for fast pitch estimation

Language:PythonLGPL-3.0161 8 16

Diff-Foley

Diff-Foley: Synchronized Video-to-Audio Synthesis with Latent Diffusion Models

Language:PythonApache-2.0120 8 25

im2wav

Official implementation of the pipeline presented in I hear your true colors: Image Guided Audio Generation

Language:PythonMIT98 3 11

architecture-objective

Language:PythonApache-2.089 4 11

BMC

BMC the Badass MIDI Controller, all-in-one Scalable MIDI Controller library with a companion Desktop/Browser Editor App for Teensy 3.2, 3.5, 3.6, 4.0, 4.1, Micromod

Language:CNOASSERTION82 13 10

UniAudio

The official source code of UniAudio

Language:Python73 8 1

CondFoleyGen

Official PyTorch implementation of "Conditional Generation of Audio from Video via Foley Analogies".

Language:PythonMIT59 5 11

regnet

Official PyTorch implementation of the TIP paper "Generating Visually Aligned Sound from Videos" and the corresponding Visually Aligned Sound (VAS) dataset.

Language:Python45 1 10

SparseSync

Source code for "Sparse in Space and Time: Audio-visual Synchronisation with Trainable Selectors." (Spotlight at the BMVC 2022)

Language:PythonMIT45 2 3

SLfM

Official code for the paper: [ICCV2023] Sound Localization from Motion: Jointly Learning Sound Direction and Camera Rotation

Language:PythonMIT34 2 6

Evening-Sun-MC6

An Arduino based Midi Controller with 6 Buttons and a 4 line LCD Display

Language:C++700

Fractal-Audio-FM3-Midi-Controller

Fractal Audio FM3 Midi Controller

Language:C++2 10