Yura Choi (Yuuraa)

Yuuraa

Geek Repo

Company:Yonsei Univeristy

Home Page:https://velog.io/@yoorachoi

Twitter:@Yura02786865

Github PK Tool:Github PK Tool

Yura Choi's starred repositories

FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Language:PythonLicense:Apache-2.0Stargazers:35499Issues:347Issues:1715
Language:PythonLicense:NOASSERTIONStargazers:34523Issues:305Issues:351

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonLicense:Apache-2.0Stargazers:17775Issues:157Issues:1370

peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Language:PythonLicense:Apache-2.0Stargazers:14821Issues:105Issues:943

Awesome-Multimodal-Large-Language-Models

:sparkles::sparkles:Latest Advances on Multimodal Large Language Models

LAVIS

LAVIS - A One-stop Library for Language-Vision Intelligence

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:9149Issues:96Issues:626

fiftyone

The open-source tool for building high-quality datasets and computer vision models

Language:PythonLicense:Apache-2.0Stargazers:7840Issues:54Issues:1465

ai-collection

The Generative AI Landscape - A Collection of Awesome Generative AI Applications

lora

Using Low-rank adaptation to quickly fine-tune diffusion models.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:6759Issues:59Issues:137

Voyager

An Open-Ended Embodied Agent with Large Language Models

Language:JavaScriptLicense:MITStargazers:5328Issues:62Issues:139

LMOps

General technology for enabling AI capabilities w/ LLMs and MLLMs

Language:PythonLicense:MITStargazers:3348Issues:58Issues:94

Ask-Anything

[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.

Language:PythonLicense:MITStargazers:2841Issues:37Issues:184

MetaCLIP

ICLR2024 Spotlight: curation/training code, metadata, distribution and pre-trained models for MetaCLIP; CVPR 2024: MoDE: CLIP Data Experts via Clustering

Language:PythonLicense:NOASSERTIONStargazers:1101Issues:13Issues:23

Video-ChatGPT

[ACL 2024 🔥] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted for spatiotemporal video representation. We also introduce a rigorous 'Quantitative Evaluation Benchmarking' for video-based conversational models.

Language:PythonLicense:CC-BY-4.0Stargazers:1041Issues:14Issues:107

awesome-instruction-dataset

A collection of open-source dataset to train instruction-following LLMs (ChatGPT,LLaMA,Alpaca)

CoCa-pytorch

Implementation of CoCa, Contrastive Captioners are Image-Text Foundation Models, in Pytorch

Language:PythonLicense:MITStargazers:1004Issues:14Issues:18

Awesome-LLMs-for-Video-Understanding

🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.

Awesome_Prompting_Papers_in_Computer_Vision

A curated list of prompt-based paper in computer vision and vision-language learning.

plug-and-play

Official Pytorch Implementation for “Plug-and-Play Diffusion Features for Text-Driven Image-to-Image Translation” (CVPR 2023)

Everything-LLMs-And-Robotics

The world's largest GitHub Repository for LLMs + Robotics

License:BSD-3-ClauseStargazers:706Issues:20Issues:0

x-clip

A concise but complete implementation of CLIP with various experimental improvements from recent papers

Language:PythonLicense:MITStargazers:664Issues:25Issues:14

actionformer_release

Code release for ActionFormer (ECCV 2022)

Language:PythonLicense:MITStargazers:403Issues:10Issues:131
Language:PythonLicense:Apache-2.0Stargazers:314Issues:11Issues:5

ALLaVA

Harnessing 1.4M GPT4V-synthesized Data for A Lite Vision-Language Model

Language:PythonLicense:Apache-2.0Stargazers:200Issues:11Issues:11

TVRetrieval

[ECCV 2020] PyTorch code for XML on TVRetrieval dataset - TVR: A Large-Scale Dataset for Video-Subtitle Moment Retrieval

Language:PythonLicense:MITStargazers:151Issues:8Issues:12

BIKE

【CVPR'2023】Bidirectional Cross-Modal Knowledge Exploration for Video Recognition with Pre-trained Vision-Language Models

Language:PythonLicense:MITStargazers:151Issues:12Issues:20

VidIL

Pytorch code for Language Models with Image Descriptors are Strong Few-Shot Video-Language Learners

Language:PythonLicense:MITStargazers:110Issues:5Issues:11
Language:PythonLicense:MITStargazers:95Issues:4Issues:11

GVRT

[ECCV-2022]Grounding Visual Representations with Texts for Domain Generalization