Bin Zhu (BinZhu-ece)

BinZhu-ece

Geek Repo

Location:BeiJing

Github PK Tool:Github PK Tool

Bin Zhu's repositories

LLMBind

LLMBind: A Unified Modality-Task Integration Framework

Language:PythonLicense:Apache-2.0Stargazers:7Issues:0Issues:0
Language:PythonLicense:MITStargazers:1Issues:0Issues:0

AnimateDiff

Official implementation of AnimateDiff.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

chatgpt-on-wechat

Wechat robot based on ChatGPT, which using OpenAI api and itchat library. 使用ChatGPT搭建微信聊天机器人,基于GPT3.5/4.0 API实现,支持个人微信、公众号、企业微信部署,能处理文本、语音和图片,访问操作系统和互联网。

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

consistency_models

Official repo for consistency models.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

ControlNet

Let us control diffusion models!

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

denoising-diffusion-pytorch

Implementation of Denoising Diffusion Probabilistic Model in Pytorch

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

Dlink_Parse

解析 优酷,腾讯,哔哩哔哩,抖音,芒果TV,爱奇艺,PP视频,咪咕视频,AcFun,快手,搜狐视频,QQ音乐,网易云音乐,酷我音乐,腾讯课堂,西瓜视频等下载地址

License:MITStargazers:0Issues:0Issues:0

fastmoe

A fast MoE impl for PyTorch

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

Grounded-Segment-Anything

Marrying Grounding DINO with Segment Anything & Stable Diffusion & Tag2Text & BLIP & Whisper & ChatBot - Automatically Detect , Segment and Generate Anything with Image, Text, and Audio Inputs

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0

hfai-models

HFAI deep learning models

License:MITStargazers:0Issues:0Issues:0

Latte

Latte: Latent Diffusion Transformer for Video Generation.

License:Apache-2.0Stargazers:0Issues:0Issues:0

LaVIN

[NeurIPS 2023] Official implementations of "Cheap and Quick: Efficient Vision-Language Instruction Tuning for Large Language Models"

Language:PythonStargazers:0Issues:0Issues:0

LISA

Project Page for "LISA: Reasoning Segmentation via Large Language Model"

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning: LLaVA (Large Language-and-Vision Assistant) built towards GPT-4V level capabilities.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

lux

👾 Fast and simple video download library and CLI tool written in Go

License:MITStargazers:0Issues:0Issues:0

MagicTime

MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators

License:Apache-2.0Stargazers:0Issues:0Issues:0

minGPT

A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Open-Sora-Plan

This project aim to reproducing Sora (Open AI T2V model), but we only have limited resource. We deeply wish the all open source community can contribute to this project.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

opencompass

OpenCompass is an LLM evaluation platform, supporting a wide range of models (LLaMA, LLaMa2, ChatGLM2, ChatGPT, Claude, etc) over 50+ datasets.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

SEED-Bench

A benchmark for evaluating Multimodal LLMs using multiple-choice questions.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

stable-diffusion

A latent text-to-image diffusion model

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:0Issues:0Issues:0

TaiSu

TaiSu(太素)--a large-scale Chinese multimodal dataset(亿级大规模中文视觉语言预训练数据集)

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

taming-transformers

Taming Transformers for High-Resolution Image Synthesis

Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0
Stargazers:0Issues:1Issues:0
Stargazers:0Issues:0Issues:0

VALOR

Codes and Models for VALOR: Vision-Audio-Language Omni-Perception Pretraining Model and Dataset

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Video-Bench

A Comprehensive Benchmark and Toolkit for Evaluating Video-based Large Language Models!

Stargazers:0Issues:0Issues:0

Video-LLaMA

Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding

Language:PythonLicense:BSD-3-ClauseStargazers:0Issues:0Issues:0