João Felipe Santos's starred repositories
segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
streaming-llm
[ICLR 2024] Efficient Streaming Language Models with Attention Sinks
consistent_depth
We estimate dense, flicker-free, geometrically consistent depth from monocular video, for example hand-held cell phone video.
soundstorm-pytorch
Implementation of SoundStorm, Efficient Parallel Audio Generation from Google Deepmind, in Pytorch
versatile_audio_super_resolution
Versatile audio super resolution (any -> 48kHz) with AudioSR.
dawproject
Open exchange format for DAWs
RectifiedFlow
Official Implementation of Rectified Flow (ICLR2023 Spotlight)
PedalinoMini
Wireless and Bluetooth MIDI Foot Controller
awesome-large-audio-models
Collection of resources on the applications of Large Language Models (LLMs) in Audio AI.
lp-music-caps
LP-MusicCaps: LLM-Based Pseudo Music Captioning [ISMIR23]
Bayesian-Flow-Networks
A simple implimentation of Bayesian Flow Networks (BFN)
Diff-Foley
Diff-Foley: Synchronized Video-to-Audio Synthesis with Latent Diffusion Models
CondFoleyGen
Official PyTorch implementation of "Conditional Generation of Audio from Video via Foley Analogies".
SparseSync
Source code for "Sparse in Space and Time: Audio-visual Synchronisation with Trainable Selectors." (Spotlight at the BMVC 2022)
Evening-Sun-MC6
An Arduino based Midi Controller with 6 Buttons and a 4 line LCD Display
Fractal-Audio-FM3-Midi-Controller
Fractal Audio FM3 Midi Controller