Beast code in Giters

p4thakur's repositories

build-your-own-x

Master programming by recreating your favorite technologies from scratch.

100

AnimateDiff

Official implementation of AnimateDiff.

Apache-2.0000

AnyDoor

Official implementations for paper: Anydoor: zero-shot object-level image customization

Language:PythonMIT000

buzz

Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.

MIT000

Chat2DB

🔥 🔥 🔥 An intelligent and versatile general-purpose SQL client and reporting tool for databases which integrates ChatGPT capabilities.(智能的通用数据库SQL客户端和报表工具)

Apache-2.0000

ComfyScript

A Python front end and library for ComfyUI

MIT000

ComfyUI

The most powerful and modular stable diffusion GUI with a graph/nodes interface.

GPL-3.0000

comfyui_controlnet_aux

ComfyUI's ControlNet Auxiliary Preprocessors

Apache-2.0000

ComfyUI_IPAdapter_plus

GPL-3.0000

elevenlabs-python

The official Python API for ElevenLabs text-to-speech.

000

fast-stable-diffusion

fast-stable-diffusion + DreamBooth

MIT000

faster-whisper

Faster Whisper transcription with CTranslate2

MIT000

gpt-engineer

Specify what you want it to build, the AI asks for clarification, and then builds it.

Language:PythonMIT000

infinite-zoom-automatic1111-webui

infinite zoom effect extension for AUTOMATIC1111's webui - stable diffusion

MIT000

instant-ngp

Instant neural graphics primitives: lightning fast NeRF and more

NOASSERTION000

low-level-design-primer

Dedicated Resources for the Low-Level System Design. Learn how to design and implement large-scale systems. Prep for the system design interview.

000

magic-animate-for-colab

MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model

BSD-3-Clause000

magic-animate-modified

BSD-3-Clause000

one-click-dense-pose

One Click Dense Pose Video with just One click

Language:Python000

PhotoMaker

Language:Jupyter NotebookNOASSERTION000

pix2pix3D

pix2pix3D: Generating 3D Objects from 2D User Inputs

MIT000

Real-ESRGAN

Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.

BSD-3-Clause000

richard-ryan

Richard Ryan is a fully responsive portfolio website, Responsive for all devices, build using HTML, CSS, and JavaScript.

000

roop

one-click deepfake (face swap)

AGPL-3.0000

SadTalker

[CVPR 2023] SadTalker：Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation

MIT000

stable-diffusion-webui-colab

stable diffusion webui colab

Unlicense000

stable-ts

ASR with reliable word-level timestamps using OpenAI's Whisper

MIT000

warp-fusion

000

whisper-timestamped

Multilingual Automatic Speech Recognition with word-level timestamps and confidence

AGPL-3.0000

whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Language:PythonBSD-4-Clause000