SenZHANG-GitHub

Sen ZHANG's starred repositories

LLaVA-UHD

LLaVA-UHD: an LMM Perceiving Any Aspect Ratio and High-Resolution Images

Language:Python21800

xtuner

An efficient, flexible and full-featured toolkit for fine-tuning LLM (InternLM2, Llama3, Phi3, Qwen, Mistral, ...)

Language:PythonApache-2.0293700

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonApache-2.01717000

X2-VLM

All-In-One VLM: Image + Video + Transfer to Other Languages / Domains (TPAMI 2023)

Language:PythonBSD-3-Clause11400

Bunny

A family of lightweight multimodal models.

Language:PythonApache-2.071100

self-correction-llm-papers

This is a collection of research papers for Self-Correcting Large Language Models with Automated Feedback.

Apache-2.033000

SurgicalPart-SAM

Official implementation of SurgicalPart-SAM (SP-SAM)

1100

Awesome-CV-Foundational-Models

42500

stable-baselines3

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

Language:PythonMIT814600

cleanrl

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Language:PythonNOASSERTION464000

funsearch

Language:Jupyter NotebookApache-2.064300

LightZero

[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios

Language:PythonApache-2.090300

SurgicalGym

High-performance GPU-based simulation platform for reinforcement learning with surgical robot learning

Language:PythonMIT3900

DEX

[ICRA'23] Demonstration-Guided Reinforcement Learning with Efficient Exploration for Task Automation of Surgical Robot

Language:PythonMIT2900

alpha-zero-general

A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more

Language:Jupyter NotebookMIT369900

controlgym

Controlgym: Large-Scale Control Environments for Benchmarking Reinforcement Learning Algorithms

Language:PythonMIT1500

IROS2023PaperList

IROS2023 Paper List

10100

Reinforcement-Learning-2nd-Edition-by-Sutton-Exercise-Solutions

Solutions of Reinforcement Learning, An Introduction

Language:Jupyter NotebookMIT190600

HyQ

Official code repo for paper: Hybrid RL: Using both offline and online data can make RL efficient.

Language:Python2100

SurgicalSAM

Official implementation of SurgicalSAM

Language:PythonMIT4900

mup

maximal update parametrization (µP)

Language:Jupyter NotebookMIT120600

Awesome-Healthcare-Foundation-Models

MIT34600

prm800k

800,000 step-level correctness labels on LLM solutions to MATH problems

Language:PythonMIT130700

Reinforcement-Learning-Papers

Related papers for reinforcement learning, including classic papers and latest papers in top conferences

MIT24000

safe-rlhf

Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback

Language:PythonApache-2.0119000

test

Measuring Massive Multitask Language Understanding | ICLR 2021

Language:PythonMIT100000

Prompt4ReasoningPapers

[ACL 2023] Reasoning with Language Model Prompting: A Survey

MIT81500

stable-baselines

A fork of OpenAI Baselines, implementations of reinforcement learning algorithms

Language:PythonMIT406800

PPO

PPO implementation for OpenAI gym environment based on Unity ML Agents

Language:Python14300

LLMsPracticalGuide

A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)

882000