lanseyege's starred repositories

DeepSeek-Coder

DeepSeek Coder: Let the Code Write Itself

Language:PythonLicense:MITStargazers:6403Issues:0Issues:0

minGPT

A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training

Language:PythonLicense:MITStargazers:19731Issues:0Issues:0

Quadcopter-Controller

A prototype for a physically based quadcopter controller with an autopilot in Unity3D

Language:C#License:MITStargazers:33Issues:0Issues:0

FedML

FEDML - The unified and scalable ML library for large-scale distributed training, model serving, and federated learning. FEDML Launch, a cross-cloud scheduler, further enables running any AI jobs on any GPU cloud or on-premise cluster. Built on this library, TensorOpera AI (https://TensorOpera.ai) is your generative AI platform at scale.

Language:PythonLicense:Apache-2.0Stargazers:4132Issues:0Issues:0

Federated-Learning

联邦学习

Stargazers:984Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:164Issues:0Issues:0

smac

SMAC: The StarCraft Multi-Agent Challenge

Language:PythonLicense:MITStargazers:1062Issues:0Issues:0

angry-ai

Battle Robots Demo made with Unity Machine Learning Agents

Language:C#License:MITStargazers:124Issues:0Issues:0

ml-agents

The Unity Machine Learning Agents Toolkit (ML-Agents) is an open-source project that enables games and simulations to serve as environments for training intelligent agents using deep reinforcement learning and imitation learning.

Language:C#License:NOASSERTIONStargazers:16813Issues:0Issues:0

GDK

Microsoft Public GDK

Language:PowerShellLicense:NOASSERTIONStargazers:1497Issues:0Issues:0

vehicle-motion-forecasting

A PyTorch-based deep inverse reinforcement learning pipeline for vehicle motion forecasting

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:73Issues:0Issues:0

HRL-for-combinatorial-optimization

Hierarchical deep reinforcement learning for combinatorial optimization problem

Language:Jupyter NotebookStargazers:32Issues:0Issues:0

CQL

Code for conservative Q-learning

Language:PythonStargazers:387Issues:0Issues:0

learning-tsp

Code for the paper 'Learning TSP Requires Rethinking Generalization' (CP 2021)

Language:Jupyter NotebookLicense:MITStargazers:202Issues:0Issues:0

graph_comb_opt

Implementation of "Learning Combinatorial Optimization Algorithms over Graphs"

Language:C++License:MITStargazers:486Issues:0Issues:0

attention-learn-to-route

Attention based model for learning to solve different routing problems

Language:Jupyter NotebookLicense:MITStargazers:1057Issues:0Issues:0

MATH5411

All you need for MATH5411 Advanced Probability, 2020 Fall, HKUST, Lecturer BAO Zhigang.

License:MITStargazers:49Issues:0Issues:0

GRAND

Source code and dataset of the NeurIPS 2020 paper "Graph Random Neural Network for Semi-Supervised Learning on Graphs"

Language:PythonLicense:MITStargazers:201Issues:0Issues:0

awesome-optimal-transport

A list of awesome papers and cool resources on optimal transport and its applications in general! As you will notice, this list is currently mostly focused on optimal transport for machine learning topics.

License:MITStargazers:201Issues:0Issues:0

Java-Interview

Java 面试必会 直通BAT

Stargazers:6037Issues:0Issues:0

gpsresilience

Use of taxi GPS devices as pervasive resilience sensors.

Language:PythonStargazers:29Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:96Issues:0Issues:0

CCL2020-Humor-Computation

CCL2020,“小牛杯”幽默计算任务数据发布

Stargazers:21Issues:0Issues:0

awesome-rl-competitions

List of competitions related to Reinforcement Learning

Stargazers:347Issues:0Issues:0

distribution-is-all-you-need

The basic distribution probability Tutorial for Deep Learning Researchers

Language:PythonLicense:MITStargazers:1619Issues:0Issues:0

holdem

:black_joker: OpenAI Gym No Limit Texas Hold 'em Environment for Reinforcement Learning

Language:PythonStargazers:162Issues:0Issues:0

pytorch-maml

PyTorch implementation of MAML: https://arxiv.org/abs/1703.03400

Language:Jupyter NotebookLicense:MITStargazers:553Issues:0Issues:0

MAML-Pytorch

Elegant PyTorch implementation of paper Model-Agnostic Meta-Learning (MAML)

Language:PythonLicense:MITStargazers:2281Issues:0Issues:0

safety-starter-agents

Basic constrained RL agents used in experiments for the "Benchmarking Safe Exploration in Deep Reinforcement Learning" paper.

Language:PythonLicense:MITStargazers:388Issues:0Issues:0

MAgent

A Platform for Many-Agent Reinforcement Learning

Language:PythonLicense:MITStargazers:1680Issues:0Issues:0