Bin Zhu (BinZhu-ece)

BinZhu-ece

Geek Repo

Location:BeiJing

Github PK Tool:Github PK Tool

Bin Zhu's starred repositories

gpt4free

The official gpt4free repository | various collection of powerful language models

Language:PythonLicense:GPL-3.0Stargazers:59953Issues:465Issues:1320

paper-reading

深度学习经典、新论文逐段精读

License:Apache-2.0Stargazers:26225Issues:721Issues:0

llama3

The official Meta Llama 3 GitHub site

Language:PythonLicense:NOASSERTIONStargazers:26097Issues:215Issues:236

Open-Sora-Plan

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Language:PythonLicense:MITStargazers:11245Issues:160Issues:290

AnimateDiff

Official implementation of AnimateDiff.

Language:PythonLicense:Apache-2.0Stargazers:10267Issues:104Issues:346

PhotoMaker

PhotoMaker [CVPR 2024]

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:9317Issues:103Issues:155
Language:PythonLicense:NOASSERTIONStargazers:6249Issues:70Issues:118

VideoCrafter

VideoCrafter2: Overcoming Data Limitations for High-Quality Video Diffusion Models

Language:PythonLicense:NOASSERTIONStargazers:4463Issues:71Issues:81

MoE-LLaVA

Mixture-of-Experts for Large Vision-Language Models

Language:PythonLicense:Apache-2.0Stargazers:1908Issues:24Issues:89

Latte

Latte: Latent Diffusion Transformer for Video Generation.

Language:PythonLicense:Apache-2.0Stargazers:1636Issues:24Issues:100

lang-segment-anything

SAM with text prompt

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:1539Issues:9Issues:51

fastmoe

A fast MoE impl for PyTorch

Language:PythonLicense:Apache-2.0Stargazers:1519Issues:13Issues:118

OpenDiT

OpenDiT: An Easy, Fast and Memory-Efficient System for DiT Training and Inference

Language:PythonLicense:Apache-2.0Stargazers:1413Issues:23Issues:60

MagicTime

MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators

Language:PythonLicense:Apache-2.0Stargazers:1270Issues:20Issues:29

DiffSynth-Studio

Enjoy the magic of Diffusion models!

Language:PythonLicense:Apache-2.0Stargazers:742Issues:20Issues:35

OneLLM

[CVPR 2024] OneLLM: One Framework to Align All Modalities with Language

Language:PythonLicense:NOASSERTIONStargazers:552Issues:11Issues:24

Multimodal-AND-Large-Language-Models

Paper list about multimodal and large language models, only used to record papers I read in the daily arxiv for personal needs.

Machine-Mindset

An MBTI Exploration of Large Language Models

Language:PythonLicense:Apache-2.0Stargazers:446Issues:7Issues:2

LLMGA

This project is the official implementation of 'LLMGA: Multimodal Large Language Model based Generation Assistant', ECCV2024 Oral

Language:PythonLicense:Apache-2.0Stargazers:444Issues:13Issues:4

Mini-DALLE3

Mini-DALLE3: Interactive Text to Image by Prompting Large Language Models

repaint123

Official implementation of Repaint123: Fast and High-quality One Image to 3D Generation with Progressive Controllable 2D Repainting (ECCV 2024)

SoraFlows

The most powerful and modular Sora WebUI, api and backend with OpenAI's Sora Model. Collecting the highest quality prompts for Sora. using NextJs and Tailwind CSS

Language:TypeScriptLicense:NOASSERTIONStargazers:191Issues:2Issues:0

Progressive3D

Official implementation of "Progressive3D: Progressively Local Editing for Text-to-3D Content Creation with Complex Semantic Prompts" [ICLR 2024]

Language:PythonLicense:MITStargazers:101Issues:3Issues:5

Envision3D

Envision3D: One Image to 3D with Anchor Views Interpolation

TaxDiff

The official code for "TaxDiff: Taxonomic-Guided Diffusion Model for Protein Sequence Generation"

Language:PythonLicense:MITStargazers:49Issues:5Issues:3

ECDFormer

The official code for "Deep peak property learning for efficient chiral molecules ECD spectra prediction"

Language:PythonStargazers:28Issues:2Issues:0

web_gpt-on-wechat

有chatgpt账户即可白嫖使用微信机器人,无需支付api费用;且通过自定义提示词很方便的为微信机器人设置好角色属性、定位。"With a ChatGPT account, you can easily use the WeChat bot for free without paying API fees; and it's convenient to set up role attributes and positioning for the WeChat bot through custom prompt words."

Language:PythonLicense:MITStargazers:25Issues:0Issues:0

fid-metrics

A toolkit for computing Fréchet Inception Distance (FID) & Fréchet Video Distance (FVD) metrics.

Language:PythonLicense:MITStargazers:6Issues:2Issues:0