ZhuoranYang

followers

following

stars

ZhuoranYang's repositories

Python-for-Signal-Processing

Notebooks for "Python for Signal Processing" book

Language:PythonNOASSERTION3 20

awesome-courses

List of awesome university courses for learning Computer Science!

2 20

Algorithms-1

Data Structures and Algorithms in Python

Language:PythonWTFPL1 20

baselines

OpenAI Baselines: high-quality implementations of reinforcement learning algorithms

Language:PythonMIT1 20

soft_dqn

Soft DQN algorithm

Language:Python1 20

algforopt-notebooks

Jupyter notebooks associated with the Algorithms for Optimization textbook

Language:Jupyter NotebookNOASSERTION000

algorithms

Algorithms & Data Structures in C++

Language:C++MIT020

algorithms-2

Algorithms & Data Structures in Go

Language:GoNOASSERTION020

cpo

Constrained Policy Optimization

Language:Python020

Dshell

Dshell is a network forensic analysis framework.

Language:PythonNOASSERTION020

few-shot-cot

Try few shot COT and ICL. Modified from "Automatic Chain of Thought Prompting in Large Language Models" (stay tuned & more will be updated)

Apache-2.0000

mmp

Implimentation of some Reinforcement Learning algorithms

Language:PythonMIT020

neural-style

Torch implementation of neural style algorithm

Language:LuaMIT020

reinforcement-learning

Minimal and Clean Reinforcement Learning Examples

Language:PythonMIT020

Reinforcement-Learning-Algorithms

These implementatios shows Convergence and performance of policy and value iteration algorithms, how the convergence of these algorithms to the optimal value function depends on the number of iterations used. Furthermore, I have implemented on-policy SARSA and off-policy Q-learning algorithms and showed how the performance of these algorithms depends on the exploration-exploitation tradeoff, and on learning rates. My experiments were evaluted on benchmark reinforcement learning tasks such as a smallworld, gridworld and a cliffworld MDP to analyze the performance of our algorithms.

Language:MATLAB010

Stein-Variational-Gradient-Descent

code for the paper "Stein Variational Gradient Descent (SVGD): A General Purpose Bayesian Inference Algorithm"

Language:PythonMIT020

tdlearn

some common TD Learning algorithms

Language:Python020

v120

Proceedings of Learning for Dynamics and Control

Language:TeX000

zero_shot_few_shot_cot

Zero-Shot and Few-Shot COT and ICL

Language:Python000

zhuoranyang.github.io

Academic Website of Zhuoran Yang

Language:HTMLMIT000