MOOJ's starred repositories
llama_index
LlamaIndex is a data framework for your LLM applications
PhotoMaker
PhotoMaker [CVPR 2024]
Make-A-Character
Official repo for Make-A-Character: High Quality Text-to-3D Character Generation within Minutes
awesome-chatgpt-prompts
This repo includes ChatGPT prompt curation to use ChatGPT better.
streaming-llm
[ICLR 2024] Efficient Streaming Language Models with Attention Sinks
instructor
structured outputs for llms
CodeFormer
[NeurIPS 2022] Towards Robust Blind Face Restoration with Codebook Lookup Transformer
KwaiAgents
A generalized information-seeking agent system with Large Language Models (LLMs).
video-retalking
[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
SadTalker-Video-Lip-Sync
本项目基于SadTalkers实现视频唇形合成的Wav2lip。通过以视频文件方式进行语音驱动生成唇形,设置面部区域可配置的增强方式进行合成唇形(人脸)区域画面增强,提高生成唇形的清晰度。使用DAIN 插帧的DL算法对生成视频进行补帧,补充帧间合成唇形的动作过渡,使合成的唇形更为流畅、真实以及自然。
sd-webui-controlnet
WebUI extension for ControlNet
PowerInfer
High-speed Large Language Model Serving on PCs with Consumer-grade GPUs
stable-diffusion-webui
Stable Diffusion web UI
magic-animate
[CVPR 2024] MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model
so-vits-svc
SoftVC VITS Singing Voice Conversion
pyvideotrans
Translate the video from one language to another and add dubbing. 将视频从一种语言翻译为另一种语言,并添加配音