SHITIANYU (SHITIANYU-hue)

SHITIANYU-hue

Geek Repo

Company:University of Toronto

Location:Toronto, Canada

Home Page:https://shitianyu-hue.github.io/

Github PK Tool:Github PK Tool

SHITIANYU's repositories

AI-follow

梳理每周最新多模态,LLMs,embodied AI相关论文

Language:Jupyter NotebookStargazers:3Issues:2Issues:0

SUMO-changing-lane-agent

Implementation of a reinforcement learning agent able to do autonomous changing lane using Sumo

Language:PythonStargazers:3Issues:1Issues:0

agebias

process for age bias dataset

DRL-robot-navigation

Deep Reinforcement Learning for mobile robot navigation in ROS Gazebo simulator. Using Twin Delayed Deep Deterministic Policy Gradient (TD3) neural network, a robot learns to navigate to a random goal point in a simulated environment while avoiding obstacles.

Language:PythonLicense:MITStargazers:2Issues:0Issues:0

sumosim

A sumo based simulator that can support both micro and macro level control

Language:Jupyter NotebookStargazers:2Issues:0Issues:0

SHITIANYU-hue.github.io

Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes

Language:JavaScriptLicense:MITStargazers:1Issues:1Issues:0

ageism-research

[ICML 2022] RankSim: Ranking Similarity Regularization for Deep Imbalanced Regression

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

AgentTuning

AgentTuning: Enabling Generalized Agent Abilities for LLMs

Stargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0
Stargazers:0Issues:2Issues:0

ChatGLM-6B

ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

CityLearn

Official reinforcement learning environment for demand response and load shaping

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Stargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:1Issues:0
Stargazers:0Issues:0Issues:0

interview-assistant

Load a PDF file and ask questions via llama_index and GPT

Language:PythonStargazers:0Issues:0Issues:0
Language:Jupyter NotebookStargazers:0Issues:2Issues:0

la-mbda

LAMBDA is a model-based reinforcement learning agent that uses Bayesian world models for safe policy optimization

Language:PythonLicense:MITStargazers:0Issues:1Issues:0
Language:PythonStargazers:0Issues:0Issues:0
Language:JavaScriptStargazers:0Issues:0Issues:0
Language:Jupyter NotebookStargazers:0Issues:0Issues:0

proximal-exploration

PyTorch implementation for our paper "Proximal Exploration for Model-guided Protein Sequence Design"

Language:PythonLicense:Apache-2.0Stargazers:0Issues:1Issues:0

RL-for-MSRs

An implementation of using rl to control magnetic soft robots.

License:MITStargazers:0Issues:0Issues:0
License:MITStargazers:0Issues:1Issues:0
Stargazers:0Issues:1Issues:0

SUMO-DVSL

A SUMO environment for differential varaible speed limits control

Language:PythonStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:1Issues:0