diehlj's starred repositories
gpt-engineer
Specify what you want it to build, the AI asks for clarification, and then builds it.
audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
Grounded-Segment-Anything
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
LLaMA-Adapter
[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters
pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
Time-Series-Library
A Library for Advanced Deep Time Series Models.
AITemplate
AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.
tree-of-thought-llm
[NeurIPS 2023] Tree of Thoughts: Deliberate Problem Solving with Large Language Models
NeMo-Guardrails
NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.
One-2-3-45
[NeurIPS 2023] Official code of "One-2-3-45: Any Single Image to 3D Mesh in 45 Seconds without Per-Shape Optimization"
S.A.T.U.R.D.A.Y
A toolbox for working with WebRTC, Audio and AI
SocraticAI
Problem solving by engaging multiple AI agents in conversation with each other and the user.
TSInterpret
An Open-Source Library for the interpretability of time series classifiers
MusicGen-Google-Colab
Google colab book for Facebook Research music gen and AudioCraft. Book will save files to colab instance then connect to google drive and save wav sample files to specified folder.