Shuhuai Ren (RenShuhuai-Andy)

RenShuhuai-Andy

Geek Repo

Company:Peking University

Location:Beijing, China

Home Page:https://renshuhuai-andy.github.io/

Github PK Tool:Github PK Tool


Organizations
lancopku

Shuhuai Ren's starred repositories

modern-unix

A collection of modern/faster/saner alternatives to common unix commands.

llm-course

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:29843Issues:315Issues:53

WeChatMsg

提取微信聊天记录,将其导出成HTML、Word、Excel文档永久保存,对聊天记录进行分析生成年度聊天报告,用聊天数据训练专属于个人的AI聊天助手

Language:PythonLicense:GPL-3.0Stargazers:28634Issues:156Issues:354

DocsGPT

GPT-powered chat for documentation, chat with your documents

Language:PythonLicense:MITStargazers:14248Issues:87Issues:339

LazyVim

Neovim config for the lazy

Language:LuaLicense:Apache-2.0Stargazers:13176Issues:51Issues:891

ml-engineering

Machine Learning Engineering Open Book

Language:PythonLicense:CC-BY-SA-4.0Stargazers:9913Issues:99Issues:17

mistral-src

Reference implementation of Mistral AI 7B v0.1 model.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:8759Issues:114Issues:115

trl

Train transformer language models with reinforcement learning.

Language:PythonLicense:Apache-2.0Stargazers:8275Issues:75Issues:901

text-generation-inference

Large Language Model Text Generation Inference

Language:PythonLicense:Apache-2.0Stargazers:8041Issues:98Issues:1083

Qwen-VL

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Language:PythonLicense:NOASSERTIONStargazers:3888Issues:44Issues:351

LLaMA2-Accessory

An Open-source Toolkit for LLM Development

Language:PythonLicense:NOASSERTIONStargazers:2544Issues:36Issues:126

whisper-timestamped

Multilingual Automatic Speech Recognition with word-level timestamps and confidence

Language:PythonLicense:AGPL-3.0Stargazers:1565Issues:24Issues:132

Awesome-Video-Diffusion-Models

[Arxiv] A Survey on Video Diffusion Models

bolei_awesome_posters

CVPR and NeurIPS poster examples and templates. May we have in-person poster session soon!

CVinW_Readings

A collection of papers on the topic of ``Computer Vision in the Wild (CVinW)''

phenaki-pytorch

Implementation of Phenaki Video, which uses Mask GIT to produce text guided videos of up to 2 minutes in length, in Pytorch

Language:PythonLicense:MITStargazers:719Issues:39Issues:29

Chat-UniVi

[CVPR 2024 Highlight🔥] Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video Understanding

Language:PythonLicense:Apache-2.0Stargazers:647Issues:7Issues:32

Multimodal-AND-Large-Language-Models

Paper list about multimodal and large language models, only used to record papers I read in the daily arxiv for personal needs.

CS-PhD-Application-fee-waivers

Collections of CS PhD Application Fee Waivers of schools in North America

GPT4V-AD-Exploration

On the Road with GPT-4V(ision): Explorations of Utilizing Visual-Language Model as Autonomous Driving Agent

TimeChat

[CVPR 2024] TimeChat: A Time-sensitive Multimodal Large Language Model for Long Video Understanding

Language:PythonLicense:BSD-3-ClauseStargazers:198Issues:5Issues:15

Text4Vis

【AAAI'2023 & IJCV】Transferring Vision-Language Models for Visual Recognition: A Classifier Perspective

Language:PythonLicense:MITStargazers:197Issues:6Issues:20

MathVista

MathVista: data, code, and evaluation for Mathematical Reasoning in Visual Contexts

Language:Jupyter NotebookLicense:CC-BY-SA-4.0Stargazers:181Issues:4Issues:21

VideoDirectorGPT

official implementation of VideoDirectorGPT: Consistent Multi-scene Video Generation via LLM-Guided Planning

PCA-EVAL

PCA-Bench: Evaluating Multimodal Large Language Models in Perception-Cognition-Action Chain

Language:Jupyter NotebookStargazers:92Issues:5Issues:3

FETV

[NeurIPS 2023 Datasets and Benchmarks] "FETV: A Benchmark for Fine-Grained Evaluation of Open-Domain Text-to-Video Generation", Yuanxin Liu, Lei Li, Shuhuai Ren, Rundong Gao, Shicheng Li, Sishuo Chen, Xu Sun, Lu Hou

TESTA

[EMNLP 2023] TESTA: Temporal-Spatial Token Aggregation for Long-form Video-Language Understanding

Language:PythonLicense:MITStargazers:40Issues:2Issues:0

GPT-4V-API

Self-hosted GPT-4V api

Language:JavaScriptLicense:MITStargazers:30Issues:1Issues:1

PDE

Official repo of Progressive Data Expansion: data, code and evaluation

Language:Jupyter NotebookLicense:MITStargazers:23Issues:2Issues:0
Language:PythonStargazers:11Issues:2Issues:0