AliceShen122's starred repositories
Awesome-LLM-Strawberry
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques.
SuperPrompt
SuperPrompt is an attempt to engineer prompts that might help us understand AI agents.
VideoCrafter
VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models
text-to-video-synthesis-colab
Text To Video Synthesis Colab
Hotshot-XL
✨ Hotshot-XL: State-of-the-art AI text-to-GIF model trained to work alongside Stable Diffusion XL
generative-models
Generative Models by Stability AI
Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
Text2Video-Zero
[ICCV 2023 Oral] Text-to-Image Diffusion Models are Zero-Shot Video Generators
SenseVoice
Multilingual Voice Understanding Model
fish-speech
Brand new TTS solution
ailia-models
The collection of pre-trained, state-of-the-art AI models for ailia SDK
GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
descript-audio-codec
State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.
jailbreak_llms
[CCS'24] A dataset consists of 15,140 ChatGPT prompts from Reddit, Discord, websites, and open-source datasets (including 1,405 jailbreak prompts).
RAGChecker
RAGChecker: A Fine-grained Framework For Diagnosing RAG
label-studio
Label Studio is a multi-type data labeling and annotation tool with standardized output format
MMA-Diffusion
[CVPR2024] MMA-Diffusion: MultiModal Attack on Diffusion Models
LAION-SAFETY
An open toolbox for NSFW & toxicity detection