Samyak Parajuli's repositories

Language:JavaScriptStargazers:4Issues:0Issues:0
Language:CSSLicense:Apache-2.0Stargazers:1Issues:4Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:2Issues:0
Language:Jupyter NotebookStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:0Issues:0
Language:JavaStargazers:0Issues:2Issues:0
Language:HTMLStargazers:0Issues:2Issues:0
Language:PythonStargazers:0Issues:2Issues:0
Language:Jupyter NotebookStargazers:0Issues:0Issues:0

emergent-language

An implementation of Emergence of Grounded Compositional Language in Multi-Agent Populations by Igor Mordatch and Pieter Abbeel

Language:PythonStargazers:0Issues:0Issues:0

flow-project.github.io

The website for the flow-project repo

Language:CSSStargazers:0Issues:0Issues:0
Language:JavaStargazers:0Issues:2Issues:0

h-baselines

A repository of high-performing hierarchical reinforcement learning models and algorithms.

License:MITStargazers:0Issues:0Issues:0
Language:JavaStargazers:0Issues:2Issues:0
Language:C++Stargazers:0Issues:2Issues:0
Language:Jupyter NotebookStargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:1Issues:0

nodejs-docs-samples

Node.js samples for Google Cloud Platform products.

Language:JavaScriptLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonStargazers:0Issues:2Issues:0
Language:JavaStargazers:0Issues:0Issues:0
Language:JavaScriptStargazers:0Issues:2Issues:0
Language:PythonStargazers:0Issues:0Issues:0
Language:HTMLStargazers:0Issues:2Issues:0
Language:ShellStargazers:0Issues:0Issues:0

train-procgen

Code for the paper "Leveraging Procedural Generation to Benchmark Reinforcement Learning"

License:MITStargazers:0Issues:0Issues:0

trlx

A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)

Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0
Language:HTMLStargazers:0Issues:0Issues:0