Yura Choi (Yuuraa)

Yuuraa

Geek Repo

Company:Yonsei Univeristy

Home Page:https://velog.io/@yoorachoi

Twitter:@Yura02786865

Github PK Tool:Github PK Tool

Yura Choi's starred repositories

FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Language:PythonLicense:Apache-2.0Stargazers:35976Issues:348Issues:1734
Language:PythonLicense:NOASSERTIONStargazers:34541Issues:302Issues:350

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonLicense:Apache-2.0Stargazers:18472Issues:158Issues:1423

peft

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Language:PythonLicense:Apache-2.0Stargazers:15303Issues:105Issues:991

Awesome-Multimodal-Large-Language-Models

:sparkles::sparkles:Latest Advances on Multimodal Large Language Models

LAVIS

LAVIS - A One-stop Library for Language-Vision Intelligence

Language:Jupyter NotebookLicense:BSD-3-ClauseStargazers:9325Issues:97Issues:635

fiftyone

The open-source tool for building high-quality datasets and computer vision models

Language:PythonLicense:Apache-2.0Stargazers:7953Issues:56Issues:1483

ai-collection

The Generative AI Landscape - A Collection of Awesome Generative AI Applications

lora

Using Low-rank adaptation to quickly fine-tune diffusion models.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:6850Issues:59Issues:137

Voyager

An Open-Ended Embodied Agent with Large Language Models

Language:JavaScriptLicense:MITStargazers:5411Issues:62Issues:143

anthropic-cookbook

A collection of notebooks/recipes showcasing some fun and effective ways of using Claude.

Language:Jupyter NotebookLicense:MITStargazers:3911Issues:111Issues:23

LMOps

General technology for enabling AI capabilities w/ LLMs and MLLMs

Language:PythonLicense:MITStargazers:3465Issues:56Issues:103

Ask-Anything

[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.

Language:PythonLicense:MITStargazers:2907Issues:37Issues:200

MetaCLIP

ICLR2024 Spotlight: curation/training code, metadata, distribution and pre-trained models for MetaCLIP; CVPR 2024: MoDE: CLIP Data Experts via Clustering

Language:PythonLicense:NOASSERTIONStargazers:1133Issues:13Issues:24

Video-ChatGPT

[ACL 2024 🔥] Video-ChatGPT is a video conversation model capable of generating meaningful conversation about videos. It combines the capabilities of LLMs with a pretrained visual encoder adapted for spatiotemporal video representation. We also introduce a rigorous 'Quantitative Evaluation Benchmarking' for video-based conversational models.

Language:PythonLicense:CC-BY-4.0Stargazers:1100Issues:13Issues:113

awesome-instruction-dataset

A collection of open-source dataset to train instruction-following LLMs (ChatGPT,LLaMA,Alpaca)

Awesome-LLMs-for-Video-Understanding

🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.

CoCa-pytorch

Implementation of CoCa, Contrastive Captioners are Image-Text Foundation Models, in Pytorch

Language:PythonLicense:MITStargazers:1016Issues:14Issues:18

Awesome_Prompting_Papers_in_Computer_Vision

A curated list of prompt-based paper in computer vision and vision-language learning.

plug-and-play

Official Pytorch Implementation for “Plug-and-Play Diffusion Features for Text-Driven Image-to-Image Translation” (CVPR 2023)

Everything-LLMs-And-Robotics

The world's largest GitHub Repository for LLMs + Robotics

License:BSD-3-ClauseStargazers:727Issues:20Issues:0

x-clip

A concise but complete implementation of CLIP with various experimental improvements from recent papers

Language:PythonLicense:MITStargazers:673Issues:25Issues:14

actionformer_release

Code release for ActionFormer (ECCV 2022)

Language:PythonLicense:MITStargazers:410Issues:10Issues:132
Language:PythonLicense:Apache-2.0Stargazers:319Issues:10Issues:5

ALLaVA

Harnessing 1.4M GPT4V-synthesized Data for A Lite Vision-Language Model

Language:PythonLicense:Apache-2.0Stargazers:229Issues:11Issues:11

BIKE

【CVPR'2023】Bidirectional Cross-Modal Knowledge Exploration for Video Recognition with Pre-trained Vision-Language Models

Language:PythonLicense:MITStargazers:153Issues:12Issues:22

TVRetrieval

[ECCV 2020] PyTorch code for XML on TVRetrieval dataset - TVR: A Large-Scale Dataset for Video-Subtitle Moment Retrieval

Language:PythonLicense:MITStargazers:151Issues:8Issues:12

VidIL

Pytorch code for Language Models with Image Descriptors are Strong Few-Shot Video-Language Learners

Language:PythonLicense:MITStargazers:110Issues:5Issues:11
Language:PythonLicense:MITStargazers:97Issues:4Issues:11