p4thakur's repositories
build-your-own-x
Master programming by recreating your favorite technologies from scratch.
AnimateDiff
Official implementation of AnimateDiff.
AnyDoor
Official implementations for paper: Anydoor: zero-shot object-level image customization
buzz
Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.
Chat2DB
🔥 🔥 🔥 An intelligent and versatile general-purpose SQL client and reporting tool for databases which integrates ChatGPT capabilities.(智能的通用数据库SQL客户端和报表工具)
ComfyScript
A Python front end and library for ComfyUI
ComfyUI
The most powerful and modular stable diffusion GUI with a graph/nodes interface.
comfyui_controlnet_aux
ComfyUI's ControlNet Auxiliary Preprocessors
elevenlabs-python
The official Python API for ElevenLabs text-to-speech.
fast-stable-diffusion
fast-stable-diffusion + DreamBooth
faster-whisper
Faster Whisper transcription with CTranslate2
gpt-engineer
Specify what you want it to build, the AI asks for clarification, and then builds it.
infinite-zoom-automatic1111-webui
infinite zoom effect extension for AUTOMATIC1111's webui - stable diffusion
instant-ngp
Instant neural graphics primitives: lightning fast NeRF and more
low-level-design-primer
Dedicated Resources for the Low-Level System Design. Learn how to design and implement large-scale systems. Prep for the system design interview.
magic-animate-for-colab
MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model
one-click-dense-pose
One Click Dense Pose Video with just One click
PhotoMaker
PhotoMaker
pix2pix3D
pix2pix3D: Generating 3D Objects from 2D User Inputs
Real-ESRGAN
Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.
richard-ryan
Richard Ryan is a fully responsive portfolio website, Responsive for all devices, build using HTML, CSS, and JavaScript.
roop
one-click deepfake (face swap)
SadTalker
[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
stable-diffusion-webui-colab
stable diffusion webui colab
stable-ts
ASR with reliable word-level timestamps using OpenAI's Whisper
whisper-timestamped
Multilingual Automatic Speech Recognition with word-level timestamps and confidence
whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)