diehlj's starred repositories
gpt-engineer
Specify what you want it to build, the AI asks for clarification, and then builds it.
audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
S.A.T.U.R.D.A.Y
A toolbox for working with WebRTC, Audio and AI
One-2-3-45
[NeurIPS 2023] Official code of "One-2-3-45: Any Single Image to 3D Mesh in 45 Seconds without Per-Shape Optimization"
MusicGen-Google-Colab
Google colab book for Facebook Research music gen and AudioCraft. Book will save files to colab instance then connect to google drive and save wav sample files to specified folder.
Time-Series-Library
A Library for Advanced Deep Time Series Models.
AITemplate
AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.
SocraticAI
Problem solving by engaging multiple AI agents in conversation with each other and the user.
tree-of-thought-llm
[NeurIPS 2023] Tree of Thoughts: Deliberate Problem Solving with Large Language Models
LLaMA-Adapter
[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters
Grounded-Segment-Anything
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
TSInterpret
An Open-Source Library for the interpretability of time series classifiers
NeMo-Guardrails
NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.
pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding