João Felipe Santos (jfsantos)

jfsantos

Geek Repo

Company:@NVIDIA

Location:Vancouver, BC, Canada

Home Page:http://www.seaandsailor.com

Twitter:@seaandsailor

Github PK Tool:Github PK Tool


Organizations
JuliaDSP
MuSAELab

João Felipe Santos's starred repositories

segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:44915Issues:299Issues:646
Language:Jupyter NotebookLicense:Apache-2.0Stargazers:6757Issues:61Issues:172

streaming-llm

[ICLR 2024] Efficient Streaming Language Models with Attention Sinks

Language:PythonLicense:MITStargazers:6299Issues:61Issues:76

consistent_depth

We estimate dense, flicker-free, geometrically consistent depth from monocular video, for example hand-held cell phone video.

Language:PythonLicense:MITStargazers:1590Issues:57Issues:69

soundstorm-pytorch

Implementation of SoundStorm, Efficient Parallel Audio Generation from Google Deepmind, in Pytorch

Language:PythonLicense:MITStargazers:1137Issues:51Issues:15

InstaFlow

:zap: InstaFlow! One-Step Stable Diffusion with Rectified Flow (ICLR 2024)

Language:PythonLicense:MITStargazers:1025Issues:43Issues:26

versatile_audio_super_resolution

Versatile audio super resolution (any -> 48kHz) with AudioSR.

Language:PythonLicense:MITStargazers:937Issues:25Issues:47

SALMONN

SALMONN: Speech Audio Language Music Open Neural Network

Language:PythonLicense:Apache-2.0Stargazers:845Issues:26Issues:31

ODISE

Official PyTorch implementation of ODISE: Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models [CVPR 2023 Highlight]

Language:PythonLicense:NOASSERTIONStargazers:815Issues:40Issues:42

dawproject

Open exchange format for DAWs

Language:HTMLLicense:MITStargazers:725Issues:23Issues:54

RectifiedFlow

Official Implementation of Rectified Flow (ICLR2023 Spotlight)

PedalinoMini

Wireless and Bluetooth MIDI Foot Controller

Language:CLicense:GPL-3.0Stargazers:466Issues:37Issues:428

LeanDojo

Tool for data extraction and interacting with Lean programmatically.

Language:PythonLicense:MITStargazers:462Issues:16Issues:42

awesome-large-audio-models

Collection of resources on the applications of Large Language Models (LLMs) in Audio AI.

Pengi

An Audio Language model for Audio Tasks

Language:PythonLicense:MITStargazers:255Issues:14Issues:11

lp-music-caps

LP-MusicCaps: LLM-Based Pseudo Music Captioning [ISMIR23]

Bayesian-Flow-Networks

A simple implimentation of Bayesian Flow Networks (BFN)

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:232Issues:8Issues:5

dscore

Diarization scoring tools.

Language:PythonLicense:BSD-2-ClauseStargazers:198Issues:8Issues:4

pesto

Self-supervised learning for fast pitch estimation

Language:PythonLicense:LGPL-3.0Stargazers:161Issues:8Issues:16

Diff-Foley

Diff-Foley: Synchronized Video-to-Audio Synthesis with Latent Diffusion Models

Language:PythonLicense:Apache-2.0Stargazers:120Issues:8Issues:25

im2wav

Official implementation of the pipeline presented in I hear your true colors: Image Guided Audio Generation

Language:PythonLicense:MITStargazers:98Issues:3Issues:11

BMC

BMC the Badass MIDI Controller, all-in-one Scalable MIDI Controller library with a companion Desktop/Browser Editor App for Teensy 3.2, 3.5, 3.6, 4.0, 4.1, Micromod

Language:CLicense:NOASSERTIONStargazers:82Issues:13Issues:10

UniAudio

The official source code of UniAudio

CondFoleyGen

Official PyTorch implementation of "Conditional Generation of Audio from Video via Foley Analogies".

Language:PythonLicense:MITStargazers:59Issues:5Issues:11

regnet

Official PyTorch implementation of the TIP paper "Generating Visually Aligned Sound from Videos" and the corresponding Visually Aligned Sound (VAS) dataset.

SparseSync

Source code for "Sparse in Space and Time: Audio-visual Synchronisation with Trainable Selectors." (Spotlight at the BMVC 2022)

Language:PythonLicense:MITStargazers:45Issues:2Issues:3

SLfM

Official code for the paper: [ICCV2023] Sound Localization from Motion: Jointly Learning Sound Direction and Camera Rotation

Language:PythonLicense:MITStargazers:34Issues:2Issues:6

Evening-Sun-MC6

An Arduino based Midi Controller with 6 Buttons and a 4 line LCD Display

Language:C++Stargazers:7Issues:0Issues:0

Fractal-Audio-FM3-Midi-Controller

Fractal Audio FM3 Midi Controller

Language:C++Stargazers:2Issues:1Issues:0