jianghaoyuan1994

jianghaoyuan1994's starred repositories

Diff4RLSurvey

This repository contains a collection of resources and papers on Diffusion Models for RL, accompanying the paper "Diffusion Models for Reinforcement Learning: A Survey"

Apache-2.031200

diffuser-control-tutorial

Apache-2.05000

GESA

100

X-Light

MIT200

Tutorial-on-PhD-Application

Tutorial on PhD Application

76400

gpt-investor

Language:Jupyter NotebookMIT220300

LLM-Agent-Paper-List

The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.

557800

XAgent

An Autonomous LLM Agent for Complex Task Solving

Language:PythonApache-2.0772300

DiT

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Language:PythonNOASSERTION543000

minbpe

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Language:PythonMIT853100

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonApache-2.01737500

MiniCPM

MiniCPM-2B: An end-side LLM outperforming Llama2-13B.

Language:Jupyter NotebookApache-2.0424600

waymax

A JAX-based simulator for autonomous driving research.

Language:PythonNOASSERTION78500

llama.cpp

LLM inference in C/C++

Language:C++MIT6003900

Visualizer

assistant tools for attention visualization in deep learning

Language:Jupyter NotebookApache-2.088700

graphsage-simple

Simple reference implementation of GraphSAGE.

Language:Python97700

awesome-ai-agents

A list of AI autonomous agents

NOASSERTION766700

build-your-own-x

Master programming by recreating your favorite technologies from scratch.

27252500

dlpack

common in-memory tensor structure

Language:PythonApache-2.086500

skrl

Modular reinforcement learning library (on PyTorch and JAX) with support for NVIDIA Isaac Gym, Isaac Orbit and Omniverse Isaac Gym

Language:PythonMIT42700

madbg

A fully-featured remote and preemptive debugger for python

Language:PythonMIT23200

PyTorch-RL

PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.

Language:PythonMIT106600

jax

Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more

Language:PythonApache-2.02875000

Eureka

Official Repository for "Eureka: Human-Level Reward Design via Coding Large Language Models" (ICLR 2024)

Language:Jupyter NotebookMIT270300

torchbeast

A PyTorch Platform for Distributed RL

Language:PythonApache-2.073500

End-to-end-Autonomous-Driving

All you need for End-to-end Autonomous Driving

MIT157200

Awesome-Embodied-Agent-with-LLMs

This is a curated list of "Embodied AI or robot with Large Language Models" research. Watch this repository for the latest updates!

64600

gpt-engineer

Specify what you want it to build, the AI asks for clarification, and then builds it.

Language:PythonMIT5102100

babyagi

Language:PythonMIT1951600

ToolLearningPapers

Apache-2.079700