ZhuoranYang's repositories

Python-for-Signal-Processing

Notebooks for "Python for Signal Processing" book

Language:PythonLicense:NOASSERTIONStargazers:3Issues:2Issues:0

awesome-courses

List of awesome university courses for learning Computer Science!

Algorithms-1

Data Structures and Algorithms in Python

Language:PythonLicense:WTFPLStargazers:1Issues:2Issues:0

baselines

OpenAI Baselines: high-quality implementations of reinforcement learning algorithms

Language:PythonLicense:MITStargazers:1Issues:2Issues:0

soft_dqn

Soft DQN algorithm

Language:PythonStargazers:1Issues:2Issues:0

algforopt-notebooks

Jupyter notebooks associated with the Algorithms for Optimization textbook

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:0Issues:0Issues:0

algorithms

Algorithms & Data Structures in C++

Language:C++License:MITStargazers:0Issues:2Issues:0

algorithms-2

Algorithms & Data Structures in Go

Language:GoLicense:NOASSERTIONStargazers:0Issues:2Issues:0

cpo

Constrained Policy Optimization

Language:PythonStargazers:0Issues:2Issues:0

Dshell

Dshell is a network forensic analysis framework.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:2Issues:0

few-shot-cot

Try few shot COT and ICL. Modified from "Automatic Chain of Thought Prompting in Large Language Models" (stay tuned & more will be updated)

License:Apache-2.0Stargazers:0Issues:0Issues:0

mmp

Implimentation of some Reinforcement Learning algorithms

Language:PythonLicense:MITStargazers:0Issues:2Issues:0

neural-style

Torch implementation of neural style algorithm

Language:LuaLicense:MITStargazers:0Issues:2Issues:0

reinforcement-learning

Minimal and Clean Reinforcement Learning Examples

Language:PythonLicense:MITStargazers:0Issues:2Issues:0

Reinforcement-Learning-Algorithms

These implementatios shows Convergence and performance of policy and value iteration algorithms, how the convergence of these algorithms to the optimal value function depends on the number of iterations used. Furthermore, I have implemented on-policy SARSA and off-policy Q-learning algorithms and showed how the performance of these algorithms depends on the exploration-exploitation tradeoff, and on learning rates. My experiments were evaluted on benchmark reinforcement learning tasks such as a smallworld, gridworld and a cliffworld MDP to analyze the performance of our algorithms.

Language:MATLABStargazers:0Issues:1Issues:0

Stein-Variational-Gradient-Descent

code for the paper "Stein Variational Gradient Descent (SVGD): A General Purpose Bayesian Inference Algorithm"

Language:PythonLicense:MITStargazers:0Issues:2Issues:0

tdlearn

some common TD Learning algorithms

Language:PythonStargazers:0Issues:2Issues:0

v120

Proceedings of Learning for Dynamics and Control

Language:TeXStargazers:0Issues:0Issues:0

zero_shot_few_shot_cot

Zero-Shot and Few-Shot COT and ICL

Language:PythonStargazers:0Issues:0Issues:0

zhuoranyang.github.io

Academic Website of Zhuoran Yang

Language:HTMLLicense:MITStargazers:0Issues:0Issues:0