jianghaoyuan1994

jianghaoyuan1994

Geek Repo

Company:Baidu

Location:ShenZhen

Github PK Tool:Github PK Tool

jianghaoyuan1994's starred repositories

Diff4RLSurvey

This repository contains a collection of resources and papers on Diffusion Models for RL, accompanying the paper "Diffusion Models for Reinforcement Learning: A Survey"

License:Apache-2.0Stargazers:312Issues:0Issues:0
License:Apache-2.0Stargazers:50Issues:0Issues:0
Stargazers:1Issues:0Issues:0
License:MITStargazers:2Issues:0Issues:0

Tutorial-on-PhD-Application

Tutorial on PhD Application

Stargazers:764Issues:0Issues:0
Language:Jupyter NotebookLicense:MITStargazers:2203Issues:0Issues:0

LLM-Agent-Paper-List

The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.

Stargazers:5578Issues:0Issues:0

XAgent

An Autonomous LLM Agent for Complex Task Solving

Language:PythonLicense:Apache-2.0Stargazers:7723Issues:0Issues:0

DiT

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Language:PythonLicense:NOASSERTIONStargazers:5430Issues:0Issues:0

minbpe

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Language:PythonLicense:MITStargazers:8531Issues:0Issues:0

LLaVA

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Language:PythonLicense:Apache-2.0Stargazers:17375Issues:0Issues:0

MiniCPM

MiniCPM-2B: An end-side LLM outperforming Llama2-13B.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:4246Issues:0Issues:0

waymax

A JAX-based simulator for autonomous driving research.

Language:PythonLicense:NOASSERTIONStargazers:785Issues:0Issues:0

llama.cpp

LLM inference in C/C++

Language:C++License:MITStargazers:60039Issues:0Issues:0

Visualizer

assistant tools for attention visualization in deep learning

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:887Issues:0Issues:0

graphsage-simple

Simple reference implementation of GraphSAGE.

Language:PythonStargazers:977Issues:0Issues:0

awesome-ai-agents

A list of AI autonomous agents

License:NOASSERTIONStargazers:7667Issues:0Issues:0

build-your-own-x

Master programming by recreating your favorite technologies from scratch.

Stargazers:272525Issues:0Issues:0

dlpack

common in-memory tensor structure

Language:PythonLicense:Apache-2.0Stargazers:865Issues:0Issues:0

skrl

Modular reinforcement learning library (on PyTorch and JAX) with support for NVIDIA Isaac Gym, Isaac Orbit and Omniverse Isaac Gym

Language:PythonLicense:MITStargazers:427Issues:0Issues:0

madbg

A fully-featured remote and preemptive debugger for python

Language:PythonLicense:MITStargazers:232Issues:0Issues:0

PyTorch-RL

PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.

Language:PythonLicense:MITStargazers:1066Issues:0Issues:0

jax

Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more

Language:PythonLicense:Apache-2.0Stargazers:28750Issues:0Issues:0

Eureka

Official Repository for "Eureka: Human-Level Reward Design via Coding Large Language Models" (ICLR 2024)

Language:Jupyter NotebookLicense:MITStargazers:2703Issues:0Issues:0

torchbeast

A PyTorch Platform for Distributed RL

Language:PythonLicense:Apache-2.0Stargazers:735Issues:0Issues:0

End-to-end-Autonomous-Driving

All you need for End-to-end Autonomous Driving

License:MITStargazers:1572Issues:0Issues:0

Awesome-Embodied-Agent-with-LLMs

This is a curated list of "Embodied AI or robot with Large Language Models" research. Watch this repository for the latest updates!

Stargazers:646Issues:0Issues:0

gpt-engineer

Specify what you want it to build, the AI asks for clarification, and then builds it.

Language:PythonLicense:MITStargazers:51021Issues:0Issues:0
Language:PythonLicense:MITStargazers:19516Issues:0Issues:0
License:Apache-2.0Stargazers:797Issues:0Issues:0