xuanlinli17

Xuanlin (Simon) Li's repositories

CS285_Fa19_Deep_Reinforcement_Learning

My solutions to UC Berkeley CS285 (originally CS294-112, deeprlcourse) Fall 2019 assignments

Language:Python117 3 4

large_vlm_distillation_ood

Distilling Large Vision-Language Model with Out-of-Distribution Generalizability (ICCV 2023)

Language:PythonMIT56 1 2

corl_22_frame_mining

[CoRL22] Frame Mining - a Free Lunch for Learning Robotic Manipulation from 3D Point Clouds

Language:PythonApache-2.027 2 2

iclr2021_rlreg

Regularization Matters in Policy Optimization

Language:Python20 40

autoregressive_inference

Code for "Discovering Non-monotonic Autoregressive Orderings with Variational Inference" (paper and code updated from ICLR 2021)

Language:PythonMIT12 20

nanoowl

A project that optimizes OWL-ViT for real-time inference with NVIDIA TensorRT.

Language:PythonApache-2.0400

efficientvit

EfficientViT is a new family of vision models for efficient high-resolution vision.

Language:PythonApache-2.0100

MinkowskiEngine

Minkowski Engine is an auto-diff neural network library for high-dimensional sparse tensors

Language:PythonNOASSERTION000

python-pcl

Python bindings to the pointcloud library (pcl)

Language:PythonNOASSERTION000

graspnetAPI

Toolbox for our GraspNet-1Billion dataset.

Language:Python000

instant-nsr-pl

Neural Surface reconstruction based on Instant-NGP. Efficient and customizable boilerplate for your research projects. Train NeuS in 10min!

Language:PythonMIT000

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonApache-2.0000

ml-veclip

The official repo for the paper "VeCLIP: Improving CLIP Training via Visual-enriched Captions"

Language:Jupyter NotebookNOASSERTION000

MobileSAM

This is the official code for MobileSAM project that makes SAM lightweight for mobile applications and beyond!

Language:Jupyter NotebookApache-2.0000

MobileVLM

Strong and Open Vision Language Assistant for Mobile Devices

Language:PythonApache-2.0000

rlds_dataset_builder

ManiSkill2 RLDS dataset builder for X-embodiment dataset conversion.

Language:PythonApache-2.0000

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Language:Jupyter NotebookApache-2.0000

tapnet

Tracking Any Point (TAP)

Language:PythonApache-2.0000

tensor2robot

Distributed machine learning infrastructure for large-scale robotics research

Language:PythonApache-2.0000

TensoRF

[ECCV 2022] Tensorial Radiance Fields, a novel approach to model and reconstruct radiance fields

Language:PythonMIT000

VILA

VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)

Language:PythonApache-2.0000

XMem

Language:Jupyter NotebookMIT010

XMem_fork

[ECCV 2022] XMem: Long-Term Video Object Segmentation with an Atkinson-Shiffrin Memory Model

Language:PythonMIT000

YOLO-World

[CVPR 2024] Real-Time Open-Vocabulary Object Detection

Language:Jupyter NotebookGPL-3.0000

xuanlinli17

Xuanlin (Simon) Li's repositories

CS285_Fa19_Deep_Reinforcement_Learning

large_vlm_distillation_ood

corl_22_frame_mining

iclr2021_rlreg

autoregressive_inference

nanoowl

efficientvit

xuanlinli17.github.io

MinkowskiEngine

python-pcl

activezero2_official

graspnetAPI

instant-nsr-pl

LLaVA

ml-veclip

MobileSAM

MobileVLM

rlds_dataset_builder

robotics_transformer

sam2

tapnet

tensor2robot

TensoRF

VILA

XMem

XMem_fork

YOLO-World