Yawen Duan's repositories

Logistic-Regression-on-MNIST-with-NumPy-from-Scratch

Implementing Logistic Regression on MNIST dataset from scratch

Language:PythonLicense:MITStargazers:23Issues:0Issues:0

TransNASBench

This repository contains the publishable code for CVPR 2021 paper TransNAS-Bench-101: Improving Transferrability and Generalizability of Cross-Task Neural Architecture Search.

Language:PythonLicense:MITStargazers:21Issues:2Issues:6

DQN_Family_PyTorch

This is a repository of DQN and its variants implementation in PyTorch based on the original papar.

Language:PythonStargazers:12Issues:1Issues:0

Neural-Network-on-MNIST-with-NumPy-from-Scratch

Implement and train a neural network from scratch in Python for the MNIST dataset (no PyTorch).

Language:PythonLicense:MITStargazers:12Issues:0Issues:0

dmc2gym

OpenAI Gym wrapper for the DeepMind Control Suite

Language:PythonLicense:MITStargazers:1Issues:0Issues:0

MNIST-with-CNN-from-Scratch

Implement and train a CNN from scratch in Python for the MNIST dataset (no PyTorch).

Language:PythonStargazers:1Issues:0Issues:0

TensorFlow-Core

This is the learning notes and mini projects by TensorFlow learning based on "TensorFlow 1.x Deep Learning Cookbook" by Antonio Gulli & Amita Kapoor.

Language:Jupyter NotebookStargazers:1Issues:1Issues:0
Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

Causality4NLP_Papers

A reading list for papers on causality for natural language processing (NLP)

Stargazers:0Issues:0Issues:0

CIFAR10-Classifier-CNN-in-PyTorch

Implement and train a CNN classifier in PyTorch for the CIFAR10 dataset

Language:Jupyter NotebookStargazers:0Issues:0Issues:0
Language:Jupyter NotebookStargazers:0Issues:0Issues:0

dm_control

DeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

flow

Computational framework for reinforcement learning in traffic control

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

hh-rlhf

Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"

License:MITStargazers:0Issues:0Issues:0

KataGo

GTP engine and self-play learning in Go

Language:C++License:NOASSERTIONStargazers:0Issues:0Issues:0

Key-Paper-Summary-in-DRL

This repository contains the summaries of around 100 key papers on deep reinforcement learning listed in on OpenAI Spinning Up.

Stargazers:0Issues:2Issues:0
Language:Jupyter NotebookStargazers:0Issues:0Issues:0

Minigrid

Simple and easily configurable grid world environments for reinforcement learning

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

MLMI

MPhil Machine Learning and Machine Intelligence @ University of Cambridge

Stargazers:0Issues:0Issues:0

mphil-intro-module

Jupyter notebooks on inference, regression and classification for MPhil students

Language:Jupyter NotebookStargazers:0Issues:0Issues:0

OpenAI_DeepRL_Spinning_Up

This is a repository of study materials, implementation codes and notes on OpenAI Spinning Up in Deep Reinforcement Learning.

Language:PythonStargazers:0Issues:0Issues:0

pytorch-tutorial

PyTorch Tutorial for Deep Learning Researchers

Language:PythonLicense:MITStargazers:0Issues:0Issues:0