Hyojung Han's repositories
a-person-mask-generator
Extension for Automatic1111 and ComfyUI to automatically create masks for Background/Hair/Body/Face/Clothes in Img2Img
ai-notes
notes for software engineers getting up to speed on new AI developments. Serves as datastore for https://latent.space writing, and product brainstorming, but has cleaned up canonical references under the /Resources folder.
AnimateDiff
Official implementation of AnimateDiff.
animatediff-cli-prompt-travel
animatediff prompt travel
ask-multiple-pdfs
A Langchain app that allows you to chat with multiple PDFs
clone-voice
一个带web界面的声音克隆工具,使用你的音色或任意声音来录制音频
DynamiCrafter
DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
embedchain
Framework to easily create LLM powered bots over any dataset.
facefusion
Next generation face swapper and enhancer
gpt-researcher
GPT based autonomous agent that does online comprehensive research on any given topic
GPTs
leaked prompts of GPTs
IDM-VTON
IDM-VTON : Improving Diffusion Models for Authentic Virtual Try-on in the Wild
infiniteGPT
InfiniteGPT is a Python script that lets you input an unlimited size text into the OpenAI API. No more tedious copy & pasting. Long live multithreading!
MoneyPrinter
Automate Creation of YouTube Shorts using MoviePy.
Mr.-Ranedeer-AI-Tutor
A GPT-4 AI Tutor Prompt for customizable personalized learning experiences.
open-chat-video-editor
Open source short video automatic generation tool
paper-summarizer
A Slack Bot of paper summarization for arXiv papers, powered by OpenAI LLMs.
Personalize-SAM
Personalize Segment Anything Model (SAM) with 1 shot in 10 seconds
rembg
Rembg is a tool to remove images background.
SUPIR
SUPIR aims at developing Practical Algorithms for Photo-Realistic Image Restoration In the Wild
system-design-101
Explain complex systems using visuals and simple terms. Help you prepare for system design interviews.
text-generation-webui-colab
A colab gradio web UI for running Large Language Models
VALL-E-X
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io
Wav2Lip
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multimedia 2020.
Whisper-WebUI
A Web UI for easy subtitle using whisper model.
whisper.api
This project provides an API with user level access support to transcribe speech to text using a finetuned and processed Whisper ASR model.