Tinglok

Tingle Li's starred repositories

AutoGPT

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

Language:PythonMIT167253 1553 2692

awesome-chatgpt-prompts

This repo includes ChatGPT prompt curation to use ChatGPT better.

Language:HTMLCC0-1.0111351 14400

ImageBind

ImageBind One Embedding Space to Bind Them All

Language:PythonNOASSERTION8255 99 89

guided-diffusion

Language:PythonMIT6111 141 140

AugLy

A data augmentations library for audio, image, text, and video.

Language:PythonNOASSERTION4950 78 74

encodec

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.

Language:PythonMIT3449 57 70

stable-audio-tools

Generative models for conditional audio generation

Language:PythonMIT2565 43 92

audio-diffusion-pytorch

Audio generation using diffusion models, in PyTorch.

Language:PythonMIT1934 39 43

audio-ai-timeline

A timeline of the latest AI models for audio generation, starting in 2023!

1880 170 4

audiomentations

A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.

Language:PythonMIT1826 20 181

CLAP

Contrastive Language-Audio Pretraining

Language:PythonCC0-1.01361 28 88

descript-audio-codec

State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.

Language:PythonMIT1146 27 75

clean-fid

PyTorch - FID calculation with proper image resizing and quantization steps [CVPR 2022]

Language:PythonMIT944 9 49

lhotse

Tools for handling speech data in machine learning projects.

Language:PythonApache-2.0936 44 410

av_hubert

A self-supervised learning framework for audio-visual speech

Language:PythonNOASSERTION835 15 111

audio-dataset

Audio Dataset for training CLAP and other models

Language:Python619 21 58

textlesslib

Library for Textless Spoken Language Processing

Language:PythonMIT528 16 24

MultiBench

[NeurIPS 2021] Multiscale Benchmarks for Multimodal Representation Learning

Language:HTMLMIT478 16 34

CPC_audio

An implementation of the Contrast Predictive Coding (CPC) method to train audio features in an unsupervised fashion.

Language:PythonMIT347 15 12

lyrebird-wav2clip

Official implementation of the paper WAV2CLIP: LEARNING ROBUST AUDIO REPRESENTATIONS FROM CLIP

Language:PythonMIT324 11 13

frechet-audio-distance

A lightweight library for Frechet Audio Distance calculation.

Language:PythonMIT231 2 13

awesome-audiovisual-learning

A curated list of audio-visual learning methods and datasets.

222 9 2

VisualVoice

Audio-Visual Speech Separation with Cross-Modal Consistency

Language:PythonNOASSERTION219 9 31

audio-data-pytorch

A collection of useful audio datasets and transforms for PyTorch.

Language:PythonMIT130 5 5

selavi

This repo covers the implementation for Labelling unlabelled videos from scratch with multi-modal self-supervision, which learns clusters from multi-modal data in a self-supervised way.

Language:PythonNOASSERTION114 12 4

MixGCF

MixGCF: An Improved Training Method for Graph Neural Network-based Recommender Systems, KDD2021

Language:Python95 1 16

audioscrape

Scrape audio from YouTube and SoundCloud with a simple command-line interface.

Language:PythonAGPL-3.083 3 6

cocktail-fork-separation

Baseline multi-resolution cross network model trained using the Divide and Remaster Dataset

Language:PythonMIT74 4 2

AudioLoader

PyTorch Dataset for Speech and Music audio

Language:Python73 4 4

avstyle

Codebase for the Paper: Learning Visual Styles from Audio-Visual Associations (ECCV 2022, in PyTorch)

Language:PythonMIT14 2 4