wilson1yan

Wilson Yan's repositories

VideoGPT

Language:Jupyter NotebookMIT914 23 37

teco

Language:Python96 5 2

contrastive-forward-model

Language:Python30 5 6

povt

Language:Python11 3 1

i3d-jax

Language:PythonMIT2 30

collect-minecraft

Language:Python1 30

Video-LLaVA

Video-LLaVA: Learning United Visual Representation by Alignment Before Projection

Language:PythonApache-2.01 10

video2dataset

Easily create large video dataset from video urls

Language:PythonMIT1 10

collect-habitat

Language:Python030

cs330

Language:Python020

deepul

Language:Jupyter Notebook010

FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Language:PythonApache-2.0010

flaxmodels

Pretrained models for Jax/Flax: StyleGAN2, GPT2, VGG, ResNet, etc.

Language:Python010

habitat-sim

A flexible, high-performance 3D simulator for Embodied AI research.

Language:C++MIT020

htmldate

Fast and robust date extraction from web pages, with Python or on the command-line

Language:PythonGPL-3.0020

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning: LLaVA (Large Language-and-Vision Assistant) built towards GPT-4V level capabilities.

Language:PythonApache-2.0010

LLMTest_NeedleInAHaystack

Doing simple retrieval from LLM models at various context lengths to measure accuracy

Language:Python000

long-video-gan

Official PyTorch implementation of LongVideoGAN

Language:PythonNOASSERTION020

LongChat

Official repository for LongChat and LongEval

Language:PythonApache-2.0010

Megatron-LM

Ongoing research training transformer models at scale

Language:PythonNOASSERTION010

mmc4

MultimodalC4 is a multimodal extension of c4 that interleaves millions of images with text.

Language:PythonMIT000

MMMU

This repo contains evaluation code for the paper "MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI"

Language:PythonApache-2.0010

RaMViD

Language:PythonMIT020

shell_scripts

Language:Shell030

taming-transformers

Taming Transformers for High-Resolution Image Synthesis

Language:Jupyter NotebookMIT020

TATS

Official PyTorch implementation of TATS: A Long Video Generation Framework with Time-Agnostic VQGAN and Time-Sensitive Transformer (ECCV 2022)

Language:PythonMIT010

trafilatura

Python & command-line tool to gather text on the Web: web crawling/scraping, extraction of text, metadata, comments

Language:PythonGPL-3.0020

tux

Tools and Utils for Experiments (TUX). Modified from many others' code to fit my needs.

Language:Python010

Valley

The official repository of "Video assistant towards large language model makes everything easy"

Language:Python010

wilson1yan.github.io

Language:HTML020