Wilson Yan's repositories
Video-LLaVA
Video-LLaVA: Learning United Visual Representation by Alignment Before Projection
video2dataset
Easily create large video dataset from video urls
flaxmodels
Pretrained models for Jax/Flax: StyleGAN2, GPT2, VGG, ResNet, etc.
habitat-sim
A flexible, high-performance 3D simulator for Embodied AI research.
LLMTest_NeedleInAHaystack
Doing simple retrieval from LLM models at various context lengths to measure accuracy
long-video-gan
Official PyTorch implementation of LongVideoGAN
Megatron-LM
Ongoing research training transformer models at scale
mmc4
MultimodalC4 is a multimodal extension of c4 that interleaves millions of images with text.
taming-transformers
Taming Transformers for High-Resolution Image Synthesis
trafilatura
Python & command-line tool to gather text on the Web: web crawling/scraping, extraction of text, metadata, comments