yawen-d

Yawen Duan's repositories

Logistic-Regression-on-MNIST-with-NumPy-from-Scratch

Implementing Logistic Regression on MNIST dataset from scratch

Language:PythonMIT2300

TransNASBench

This repository contains the publishable code for CVPR 2021 paper TransNAS-Bench-101: Improving Transferrability and Generalizability of Cross-Task Neural Architecture Search.

Language:PythonMIT21 2 6

DQN_Family_PyTorch

This is a repository of DQN and its variants implementation in PyTorch based on the original papar.

Language:Python12 10

Neural-Network-on-MNIST-with-NumPy-from-Scratch

Implement and train a neural network from scratch in Python for the MNIST dataset (no PyTorch).

Language:PythonMIT1200

dmc2gym

OpenAI Gym wrapper for the DeepMind Control Suite

Language:PythonMIT100

Generative_Adversarial_Networks-in-PyTorch

Language:Python100

MNIST-with-CNN-from-Scratch

Implement and train a CNN from scratch in Python for the MNIST dataset (no PyTorch).

Language:Python100

TensorFlow-Core

This is the learning notes and mini projects by TensorFlow learning based on "TensorFlow 1.x Deep Learning Cookbook" by Antonio Gulli & Amita Kapoor.

Language:Jupyter Notebook1 10

academic-website

Language:Jupyter NotebookMIT000

Causality4NLP_Papers

A reading list for papers on causality for natural language processing (NLP)

000

CIFAR10-Classifier-CNN-in-PyTorch

Implement and train a CNN classifier in PyTorch for the CIFAR10 dataset

Language:Jupyter Notebook000

CIFAR100-ResNet-PyTorch

Language:Jupyter Notebook000

dm_control

DeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo.

Language:PythonApache-2.0000

flow

Computational framework for reinforcement learning in traffic control

Language:PythonMIT000

hh-rlhf

Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"

MIT000

KataGo

GTP engine and self-play learning in Go

Language:C++NOASSERTION000

Key-Paper-Summary-in-DRL

This repository contains the summaries of around 100 key papers on deep reinforcement learning listed in on OpenAI Spinning Up.

020

Logistic-Regression-Explained

Language:Jupyter Notebook000

Minigrid

Simple and easily configurable grid world environments for reinforcement learning

Language:PythonNOASSERTION000

MLMI

MPhil Machine Learning and Machine Intelligence @ University of Cambridge

000

mphil-intro-module

Jupyter notebooks on inference, regression and classification for MPhil students

Language:Jupyter Notebook000

OpenAI_DeepRL_Spinning_Up

This is a repository of study materials, implementation codes and notes on OpenAI Spinning Up in Deep Reinforcement Learning.

Language:Python000

pytorch-tutorial

PyTorch Tutorial for Deep Learning Researchers

Language:PythonMIT000