BinZhu-ece

followers

following

stars

BeiJing

Bin Zhu's starred repositories

gpt4free

The official gpt4free repository | various collection of powerful language models

Language:PythonGPL-3.059953 465 1320

paper-reading

深度学习经典、新论文逐段精读

Apache-2.026225 7210

llama3

The official Meta Llama 3 GitHub site

Language:PythonNOASSERTION26097 215 236

Open-Sora-Plan

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Language:PythonMIT11245 160 290

AnimateDiff

Official implementation of AnimateDiff.

Language:PythonApache-2.010267 104 346

PhotoMaker

PhotoMaker [CVPR 2024]

Language:Jupyter NotebookNOASSERTION9317 103 155

instruct-pix2pix

Language:PythonNOASSERTION6249 70 118

VideoCrafter

VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models

Language:PythonNOASSERTION4463 71 81

MoE-LLaVA

Mixture-of-Experts for Large Vision-Language Models

Language:PythonApache-2.01908 24 89

Latte

Latte: Latent Diffusion Transformer for Video Generation.

Language:PythonApache-2.01636 24 100

lang-segment-anything

SAM with text prompt

Language:Jupyter NotebookApache-2.01539 9 51

fastmoe

A fast MoE impl for PyTorch

Language:PythonApache-2.01519 13 118

OpenDiT

OpenDiT: An Easy, Fast and Memory-Efficient System for DiT Training and Inference

Language:PythonApache-2.01413 23 60

MagicTime

MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators

Language:PythonApache-2.01270 20 29

DiffSynth-Studio

Enjoy the magic of Diffusion models!

Language:PythonApache-2.0742 20 35

OneLLM

[CVPR 2024] OneLLM: One Framework to Align All Modalities with Language

Language:PythonNOASSERTION552 11 24

Multimodal-AND-Large-Language-Models

Paper list about multimodal and large language models, only used to record papers I read in the daily arxiv for personal needs.

Machine-Mindset

An MBTI Exploration of Large Language Models

Language:PythonApache-2.0446 7 2

LLMGA

This project is the official implementation of 'LLMGA: Multimodal Large Language Model based Generation Assistant', ECCV2024 Oral

Language:PythonApache-2.0444 13 4

Mini-DALLE3

Mini-DALLE3: Interactive Text to Image by Prompting Large Language Models

Language:Python298 4 9

repaint123

Official implementation of Repaint123: Fast and High-quality One Image to 3D Generation with Progressive Controllable 2D Repainting (ECCV 2024)

parameter-efficient-moe

Language:Python239 17 3

SoraFlows

The most powerful and modular Sora WebUI, api and backend with OpenAI's Sora Model. Collecting the highest quality prompts for Sora. using NextJs and Tailwind CSS

Language:TypeScriptNOASSERTION191 20

Progressive3D

Official implementation of "Progressive3D: Progressively Local Editing for Text-to-3D Content Creation with Complex Semantic Prompts" [ICLR 2024]

Language:PythonMIT101 3 5

Envision3D

Envision3D: One Image to 3D with Anchor Views Interpolation

Language:Python101 3 7

TaxDiff

The official code for "TaxDiff: Taxonomic-Guided Diffusion Model for Protein Sequence Generation"

Language:PythonMIT49 5 3

EvaGaussians

ECDFormer

The official code for "Deep peak property learning for efficient chiral molecules ECD spectra prediction"

Language:Python28 20

web_gpt-on-wechat

有chatgpt账户即可白嫖使用微信机器人，无需支付api费用；且通过自定义提示词很方便的为微信机器人设置好角色属性、定位。"With a ChatGPT account, you can easily use the WeChat bot for free without paying API fees; and it's convenient to set up role attributes and positioning for the WeChat bot through custom prompt words."

Language:PythonMIT2500

fid-metrics

A toolkit for computing Fréchet Inception Distance (FID) & Fréchet Video Distance (FVD) metrics.

Language:PythonMIT6 20