Shuhuai Ren's starred repositories
modern-unix
A collection of modern/faster/saner alternatives to common unix commands.
llm-course
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
ml-engineering
Machine Learning Engineering Open Book
mistral-src
Reference implementation of Mistral AI 7B v0.1 model.
text-generation-inference
Large Language Model Text Generation Inference
LLaMA2-Accessory
An Open-source Toolkit for LLM Development
whisper-timestamped
Multilingual Automatic Speech Recognition with word-level timestamps and confidence
Awesome-Video-Diffusion-Models
[Arxiv] A Survey on Video Diffusion Models
bolei_awesome_posters
CVPR and NeurIPS poster examples and templates. May we have in-person poster session soon!
CVinW_Readings
A collection of papers on the topic of ``Computer Vision in the Wild (CVinW)''
phenaki-pytorch
Implementation of Phenaki Video, which uses Mask GIT to produce text guided videos of up to 2 minutes in length, in Pytorch
Chat-UniVi
[CVPR 2024 Highlight🔥] Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video Understanding
Multimodal-AND-Large-Language-Models
Paper list about multimodal and large language models, only used to record papers I read in the daily arxiv for personal needs.
CS-PhD-Application-fee-waivers
Collections of CS PhD Application Fee Waivers of schools in North America
GPT4V-AD-Exploration
On the Road with GPT-4V(ision): Explorations of Utilizing Visual-Language Model as Autonomous Driving Agent
VideoDirectorGPT
official implementation of VideoDirectorGPT: Consistent Multi-scene Video Generation via LLM-Guided Planning
GPT-4V-API
Self-hosted GPT-4V api