Samyak Parajuli's repositories
Language:Jupyter Notebook000
Language:Python000
Language:Jupyter Notebook000
emergent-language
An implementation of Emergence of Grounded Compositional Language in Multi-Agent Populations by Igor Mordatch and Pieter Abbeel
Language:Python000
flow-project.github.io
The website for the flow-project repo
Language:CSS000
h-baselines
A repository of high-performing hierarchical reinforcement learning models and algorithms.
MIT000
Language:Jupyter Notebook000
nodejs-docs-samples
Node.js samples for Google Cloud Platform products.
Language:JavaScriptApache-2.0000
Language:Java000
Language:Python000
Language:Shell000
train-procgen
Code for the paper "Leveraging Procedural Generation to Benchmark Reinforcement Learning"
MIT000
trlx
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
Language:PythonMIT000
Language:PythonMIT000
Language:HTML000