Qihang Yu (yucornetto)

yucornetto

Geek Repo

Company:Johns Hopkins University

Location:Baltimore

Home Page:https://yucornetto.github.io/

Github PK Tool:Github PK Tool

Qihang Yu's starred repositories

gpt4free

The official gpt4free repository | various collection of powerful language models

Language:PythonLicense:GPL-3.0Stargazers:58632Issues:456Issues:1224

llama

Inference code for Llama models

Language:PythonLicense:NOASSERTIONStargazers:53812Issues:509Issues:923

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Language:PythonLicense:Apache-2.0Stargazers:33296Issues:337Issues:2585

llama_index

LlamaIndex is a data framework for your LLM applications

Language:PythonLicense:MITStargazers:32489Issues:232Issues:4135

diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.

Language:PythonLicense:Apache-2.0Stargazers:23268Issues:193Issues:3634

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonLicense:Apache-2.0Stargazers:17330Issues:155Issues:1344

LAVIS

LAVIS - A One-stop Library for Language-Vision Intelligence

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:9023Issues:95Issues:618

mistral-src

Reference implementation of Mistral AI 7B v0.1 model.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:8772Issues:116Issues:115

open_flamingo

An open-source framework for training large multimodal models.

Language:PythonLicense:MITStargazers:3519Issues:47Issues:170

Detic

Code release for "Detecting Twenty-thousand Classes using Image-level Supervision".

Language:PythonLicense:Apache-2.0Stargazers:1800Issues:21Issues:102

Emu

Emu Series: Generative Multimodal Models from BAAI

Language:PythonLicense:Apache-2.0Stargazers:1528Issues:21Issues:84
Language:Jupyter NotebookLicense:MITStargazers:918Issues:23Issues:37

mmc4

MultimodalC4 is a multimodal extension of c4 that interleaves millions of images with text.

Language:PythonLicense:MITStargazers:875Issues:9Issues:17
Language:PythonLicense:NOASSERTIONStargazers:701Issues:8Issues:61

GPT4RoI

GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest

Language:PythonLicense:NOASSERTIONStargazers:469Issues:8Issues:42

lvis-api

Python API for LVIS Dataset

Language:PythonLicense:NOASSERTIONStargazers:399Issues:12Issues:31

all-seeing

[ICLR 2024] This is the official implementation of the paper "The All-Seeing Project: Towards Panoptic Visual Recognition and Understanding of the Open World"

open-muse

Open reproduction of MUSE for fast text2image generation.

Language:PythonLicense:Apache-2.0Stargazers:294Issues:38Issues:27
Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:275Issues:6Issues:14

fc-clip

[NeurIPS 2023] This repo contains the code for our paper Convolutions Die Hard: Open-Vocabulary Segmentation with Single Frozen Convolutional CLIP

Language:PythonLicense:Apache-2.0Stargazers:260Issues:16Issues:28

DETA

Detection Transformers with Assignment

Language:PythonLicense:Apache-2.0Stargazers:233Issues:5Issues:25

3D-TransUNet

This is the official repository for the paper "3D TransUNet: Advancing Medical Image Segmentation through Vision Transformers"

Language:PythonLicense:Apache-2.0Stargazers:156Issues:3Issues:30

CLIPSelf

[ICLR2024 Spotlight] Code Release of CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction

Language:PythonLicense:NOASSERTIONStargazers:144Issues:6Issues:21

ViTamin

[CVPR 2024] Official implementation of "ViTamin: Designing Scalable Vision Models in the Vision-language Era"

Language:PythonLicense:Apache-2.0Stargazers:131Issues:5Issues:8

qa-lora

Official PyTorch implementation of QA-LoRA

Language:PythonLicense:MITStargazers:94Issues:4Issues:32

OmniScient-Model

This repo contains the code for our paper Towards Open-Ended Visual Recognition with Large Language Model

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:88Issues:10Issues:4

kmax-deeplab

a PyTorch re-implementation of ECCV 2022 paper based on Detectron2: k-means mask Transformer.

Language:PythonLicense:Apache-2.0Stargazers:64Issues:7Issues:3

MaXTron

This repo contains the code for our paper MaXTron: Mask Transformer with Trajectory Attention for Video Panoptic Segmentation

Language:PythonLicense:Apache-2.0Stargazers:26Issues:6Issues:2