af-74413592

followers

following

stars

af-74413592's repositories

MixtralKit

A toolkit for inference and evaluation of 'mixtral-8x7b-32kseqlen' from Mistral AI

Language:PythonApache-2.0100

AnglE

Angle-optimized Text Embeddings | 🔥 SOTA on STS and MTEB Leaderboard

MIT000

AniPortrait

AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation

Apache-2.0000

Awesome-Video-Diffusion

A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.

000

consistency_models

A mini-library for training consistency models.

MIT000

DeepSeek-MoE

MIT000

Diffusion-Tryon-Trainer

Diffusion-Tryon-Trainer

NOASSERTION000

encodec

State-of-the-art deep learning based audio codec supporting both mono 24 kHz audio and stereo 48 kHz audio.

MIT000

EvalCrafter

[CVPR 2024] EvalCrafter: Benchmarking and Evaluating Large Video Generation Models

000

Everything-of-Thoughts-XoT

An implemtation of Everyting of Thoughts (XoT).

NOASSERTION000

FreeNoise-AnimateDiff

[ICLR 2024] Code for FreeNoise based on AnimateDiff

Apache-2.0000

insightface

State-of-the-art 2D and 3D Face Analysis Project

000

langchain-ChatGLM

langchain-ChatGLM, local knowledge based ChatGLM with langchain ｜基于本地知识库的 ChatGLM 问答

Language:PythonApache-2.0000

llm-cookbook

面向开发者的 LLM 入门教程，吴恩达大模型系列课程中文版

000

longlife-chatglm

Language:Python010

Megatron-LM

Ongoing research training transformer models at scale

NOASSERTION000

NExT-GPT

Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model

BSD-3-Clause000

OOTDiffusion

Official implementation of OOTDiffusion

NOASSERTION000

Open-Sora

Building your own video generation model like OpenAI's Sora

Apache-2.0000

Open-Sora-Plan

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

MIT000

OpenDiT

OpenDiT: An Easy, Fast and Memory-Efficient System for DiT Training and Inference

Apache-2.0000

pgmpy

Python Library for learning (Structure and Parameter), inference (Probabilistic and Causal), and simulations in Bayesian Networks.

MIT000

PnPInversion

[ICLR2024] Official repo for paper "PnP Inversion: Boosting Diffusion-based Editing with 3 Lines of Code"

000

qwen-eval

通义千问的ceval打分评测示例

000

StreamingT2V

StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text

000

unsloth

2-5X faster 80% less memory QLoRA & LoRA finetuning

Apache-2.0000

upsampling_guidence

an unofficial implementation of https://arxiv.org/pdf/2404.01709

Language:Python000

VAR

[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!

MIT000

Video-LLaVA

Video-LLaVA: Learning United Visual Representation by Alignment Before Projection

Apache-2.0000

VideoMV

VideoMV: Consistent Multi-View Generation Based on Large Video Generative Model

MIT000