p4thakur's repositories

build-your-own-x

Master programming by recreating your favorite technologies from scratch.

Stargazers:1Issues:0Issues:0

AnimateDiff

Official implementation of AnimateDiff.

License:Apache-2.0Stargazers:0Issues:0Issues:0

AnyDoor

Official implementations for paper: Anydoor: zero-shot object-level image customization

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

buzz

Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.

License:MITStargazers:0Issues:0Issues:0

Chat2DB

🔥 🔥 🔥 An intelligent and versatile general-purpose SQL client and reporting tool for databases which integrates ChatGPT capabilities.(智能的通用数据库SQL客户端和报表工具)

License:Apache-2.0Stargazers:0Issues:0Issues:0

ComfyScript

A Python front end and library for ComfyUI

License:MITStargazers:0Issues:0Issues:0

ComfyUI

The most powerful and modular stable diffusion GUI with a graph/nodes interface.

License:GPL-3.0Stargazers:0Issues:0Issues:0

comfyui_controlnet_aux

ComfyUI's ControlNet Auxiliary Preprocessors

License:Apache-2.0Stargazers:0Issues:0Issues:0
License:GPL-3.0Stargazers:0Issues:0Issues:0

elevenlabs-python

The official Python API for ElevenLabs text-to-speech.

Stargazers:0Issues:0Issues:0

fast-stable-diffusion

fast-stable-diffusion + DreamBooth

License:MITStargazers:0Issues:0Issues:0

faster-whisper

Faster Whisper transcription with CTranslate2

License:MITStargazers:0Issues:0Issues:0

gpt-engineer

Specify what you want it to build, the AI asks for clarification, and then builds it.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

infinite-zoom-automatic1111-webui

infinite zoom effect extension for AUTOMATIC1111's webui - stable diffusion

License:MITStargazers:0Issues:0Issues:0

instant-ngp

Instant neural graphics primitives: lightning fast NeRF and more

License:NOASSERTIONStargazers:0Issues:0Issues:0

low-level-design-primer

Dedicated Resources for the Low-Level System Design. Learn how to design and implement large-scale systems. Prep for the system design interview.

Stargazers:0Issues:0Issues:0

magic-animate-for-colab

MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model

License:BSD-3-ClauseStargazers:0Issues:0Issues:0
License:BSD-3-ClauseStargazers:0Issues:0Issues:0

one-click-dense-pose

One Click Dense Pose Video with just One click

Language:PythonStargazers:0Issues:0Issues:0

PhotoMaker

PhotoMaker

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:0Issues:0Issues:0

pix2pix3D

pix2pix3D: Generating 3D Objects from 2D User Inputs

License:MITStargazers:0Issues:0Issues:0

Real-ESRGAN

Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.

License:BSD-3-ClauseStargazers:0Issues:0Issues:0

richard-ryan

Richard Ryan is a fully responsive portfolio website, Responsive for all devices, build using HTML, CSS, and JavaScript.

Stargazers:0Issues:0Issues:0

roop

one-click deepfake (face swap)

License:AGPL-3.0Stargazers:0Issues:0Issues:0

SadTalker

[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation

License:MITStargazers:0Issues:0Issues:0

stable-diffusion-webui-colab

stable diffusion webui colab

License:UnlicenseStargazers:0Issues:0Issues:0

stable-ts

ASR with reliable word-level timestamps using OpenAI's Whisper

License:MITStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0

whisper-timestamped

Multilingual Automatic Speech Recognition with word-level timestamps and confidence

License:AGPL-3.0Stargazers:0Issues:0Issues:0

whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Language:PythonLicense:BSD-4-ClauseStargazers:0Issues:0Issues:0