af-74413592's repositories
MixtralKit
A toolkit for inference and evaluation of 'mixtral-8x7b-32kseqlen' from Mistral AI
AnglE
Angle-optimized Text Embeddings | 🔥 SOTA on STS and MTEB Leaderboard
AniPortrait
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation
Awesome-Video-Diffusion
A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.
consistency_models
A mini-library for training consistency models.
Diffusion-Tryon-Trainer
Diffusion-Tryon-Trainer
encodec
State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.
EvalCrafter
[CVPR 2024] EvalCrafter: Benchmarking and Evaluating Large Video Generation Models
Everything-of-Thoughts-XoT
An implemtation of Everyting of Thoughts (XoT).
FreeNoise-AnimateDiff
[ICLR 2024] Code for FreeNoise based on AnimateDiff
insightface
State-of-the-art 2D and 3D Face Analysis Project
langchain-ChatGLM
langchain-ChatGLM, local knowledge based ChatGLM with langchain | 基于本地知识库的 ChatGLM 问答
llm-cookbook
面向开发者的 LLM 入门教程,吴恩达大模型系列课程中文版
Megatron-LM
Ongoing research training transformer models at scale
NExT-GPT
Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model
OOTDiffusion
Official implementation of OOTDiffusion
Open-Sora
Building your own video generation model like OpenAI's Sora
Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
OpenDiT
OpenDiT: An Easy, Fast and Memory-Efficient System for DiT Training and Inference
pgmpy
Python Library for learning (Structure and Parameter), inference (Probabilistic and Causal), and simulations in Bayesian Networks.
PnPInversion
[ICLR2024] Official repo for paper "PnP Inversion: Boosting Diffusion-based Editing with 3 Lines of Code"
qwen-eval
通义千问的ceval打分评测示例
StreamingT2V
StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text
unsloth
2-5X faster 80% less memory QLoRA & LoRA finetuning
upsampling_guidence
an unofficial implementation of https://arxiv.org/pdf/2404.01709
VAR
[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!
Video-LLaVA
Video-LLaVA: Learning United Visual Representation by Alignment Before Projection
VideoMV
VideoMV: Consistent Multi-View Generation Based on Large Video Generative Model