Yi Wang (shepnerd)

shepnerd

Geek Repo

Company:@OpenGVLab

Location:China

Github PK Tool:Github PK Tool


Organizations
OpenGVLab

Yi Wang's starred repositories

gpt4free

The official gpt4free repository | various collection of powerful language models

Language:PythonLicense:GPL-3.0Stargazers:59563Issues:464Issues:1289

bark

🔊 Text-Prompted Generative Audio Model

Language:Jupyter NotebookLicense:MITStargazers:34129Issues:319Issues:427

generative-models

Generative Models by Stability AI

Language:PythonLicense:MITStargazers:23707Issues:252Issues:289

LAVIS

LAVIS - A One-stop Library for Language-Vision Intelligence

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:9347Issues:97Issues:635

consistency_models

Official repo for consistency models.

Language:PythonLicense:MITStargazers:6047Issues:60Issues:51

annotated_latex_equations

Examples of how to create colorful, annotated equations in Latex using Tikz.

Language:TeXLicense:MITStargazers:3738Issues:37Issues:3

open_flamingo

An open-source framework for training large multimodal models.

Language:PythonLicense:MITStargazers:3609Issues:47Issues:173

img2dataset

Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.

Language:PythonLicense:MITStargazers:3509Issues:31Issues:253

vault-ai

OP Vault ChatGPT: Give ChatGPT long-term memory using the OP Stack (OpenAI + Pinecone Vector Database). Upload your own custom knowledge base files (PDF, txt, epub, etc) using a simple React frontend.

Language:JavaScriptLicense:MITStargazers:3252Issues:48Issues:59

InternGPT

InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now it supports DragGAN, ChatGPT, ImageBind, multimodal chat like GPT-4, SAM, interactive image editing, etc. Try it at igpt.opengvlab.com (支持DragGAN、ChatGPT、ImageBind、SAM的在线Demo系统)

Language:PythonLicense:Apache-2.0Stargazers:3175Issues:43Issues:49

Vim

[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

Language:PythonLicense:Apache-2.0Stargazers:2690Issues:30Issues:103

clip-interrogator

Image to prompt with BLIP and CLIP

Language:PythonLicense:MITStargazers:2614Issues:29Issues:94

LongLoRA

Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)

Language:PythonLicense:Apache-2.0Stargazers:2568Issues:12Issues:170

releasing-research-code

Tips for releasing research code in Machine Learning (with official NeurIPS 2020 recommendations)

mamba-minimal

Simple, minimal implementation of the Mamba SSM in one file of PyTorch.

Language:PythonLicense:Apache-2.0Stargazers:2485Issues:23Issues:25

awesome-typst

Awesome Typst Links

Language:EarthlyLicense:CC0-1.0Stargazers:2049Issues:26Issues:26

make-a-video-pytorch

Implementation of Make-A-Video, new SOTA text to video generator from Meta AI, in Pytorch

Language:PythonLicense:MITStargazers:1884Issues:71Issues:16

awesome-openai-vision-api-experiments

Must-have resource for anyone who wants to experiment with and build on the OpenAI vision API 🔥

InternVideo

[ECCV2024] Video Foundation Models & Data for Multimodal Understanding

Language:PythonLicense:Apache-2.0Stargazers:1207Issues:30Issues:138
Language:PythonLicense:Apache-2.0Stargazers:873Issues:13Issues:30

VideoMamba

[ECCV2024] VideoMamba: State Space Model for Efficient Video Understanding

Language:PythonLicense:Apache-2.0Stargazers:734Issues:13Issues:74

LucidDreamer

Official implementation of "LucidDreamer: Towards High-Fidelity Text-to-3D Generation via Interval Score Matching"

Language:PythonLicense:MITStargazers:724Issues:23Issues:35

CLIP_benchmark

CLIP-like model evaluation

Language:Jupyter NotebookLicense:MITStargazers:555Issues:12Issues:64

InfiniTransformer

Unofficial PyTorch/🤗Transformers(Gemma/Llama3) implementation of Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention

Language:PythonLicense:MITStargazers:319Issues:8Issues:24

VPGTrans

Codes for VPGTrans: Transfer Visual Prompt Generator across LLMs. VL-LLaMA, VL-Vicuna.

Language:PythonLicense:BSD-3-ClauseStargazers:266Issues:6Issues:18

ScaleVLN

[ICCV 2023 Oral]: Scaling Data Generation in Vision-and-Language Navigation

EgoVideo

[CVPR 2024 Champions] Solutions for EgoVis Chanllenges in CVPR 2024

Language:Jupyter NotebookStargazers:96Issues:1Issues:13

LORIS

Long-Term Rhythmic Video Soundtracker, ICML2023

Language:PythonLicense:MITStargazers:54Issues:5Issues:6

TMT-VIS

Taxonomy-aware Multi-dataset Joint Training for Video Instance Segmentation (NeurIPS 23)

perception_test_iccv2023

Champion Solutions repository for Perception Test challenges in ICCV2023 workshop.

Language:PythonLicense:MITStargazers:11Issues:1Issues:0