MykhaiIo

Mykhaïlo Lytvynenko's starred repositories

DRL-Pytorch

Clean, Robust, and Unified PyTorch implementation of popular Deep Reinforcement Learning (DRL) algorithms (Q-learning, Duel DDQN, PER, C51, Noisy DQN, PPO, DDPG, TD3, SAC, ASL)

Language:Python100900

LLMTSCS

Official code for article "LLMLight: Large Language Models as Traffic Signal Control Agents".

Language:Python14800

LLM-Assisted-Light

This repository contains the code for the paper "LLM-Assisted Light: Leveraging Large Language Model Capabilities for Human-Mimetic Traffic Signal Control in Complex Urban Environments".

Language:PythonApache-2.03300

bootstrap_dqn

Implementation of Bootstrap DQN and Randomized Prior Functions on ALE

Language:Python5200

MARL-Algorithms

Implementations of IQL, QMIX, VDN, COMA, QTRAN, MAVEN, CommNet, DyMA-CL, and G2ANet on SMAC, the decentralised micromanagement scenario of StarCraft II

Language:Python138400

bootsrapped-dqn

This is pytorch implmentation project of Bootsrapped DQN

Language:PythonApache-2.0700

coach

Reinforcement Learning Coach by Intel AI Lab enables easy experimentation with state of the art Reinforcement Learning algorithms

Language:PythonApache-2.0232000

LLMsPracticalGuide

A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)

910700

LLM-RL-Papers

Monitoring recent cross-research on LLM & RL on arXiv for control. If there are good papers, PRs are welcome.

7300

LLM-Prompt-Library

Advanced Code and Text Manipulation Prompts for Various LLMs. Suitable for Siri, GPT-4o, Claude, Llama3, Gemini, and other high-performance open-source LLMs.

47600

rl

A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.

Language:PythonMIT205600

langgraph

Build resilient language agents as graphs.

Language:PythonMIT477900

langchain-tutorials

Overview and tutorial of the LangChain Library

Language:Jupyter Notebook655400

prompt-engineering

Tips and tricks for working with Large Language Models like OpenAI's GPT-4.

MIT822500

Awesome-LLMOps

An awesome & curated list of best LLMOps tools for developers

Language:ShellCC0-1.0352300

nui_in_madrl

Negative Update Intervals in Multi-Agent Deep Reinforcement Learning

Language:PythonGPL-3.03200

DI-engine

OpenDILab Decision AI Engine. The Most Comprehensive Reinforcement Learning Framework B.P.

Language:PythonApache-2.0285100

pymarl

Python Multi-Agent Reinforcement Learning framework

Language:PythonApache-2.0179100

MAProj

Multi-agent project (commnet, bicnet, maddpg) in pytorch for Multi-Agent Particle Environment

Language:Python10900

generative-ai

Sample code and notebooks for Generative AI on Google Cloud, with Gemini on Vertex AI

Language:Jupyter NotebookApache-2.0645700

training-data-analyst

Labs and demos for courses for GCP Training (http://cloud.google.com/training).

Language:Jupyter NotebookApache-2.0769200

Machine-Learning-with-Python

Practice and tutorial-style notebooks covering wide variety of machine learning techniques

Language:Jupyter NotebookBSD-2-Clause304400

CQL

Conservative Q Learning on top of SAC

Language:PythonMIT11600

pytorch_seed_rl

A PyTorch implementation of SEED, originally created by Google Research for TensorFlow 2.

Language:PythonApache-2.01000

ConvLSTM_pytorch

Implementation of Convolutional LSTM in PyTorch.

Language:PythonMIT192500

CommNet

PyTorch implementation of CommNet

Language:Python3600

Spiking-Neural-Network-SNN-with-PyTorch-where-Backpropagation-engenders-STDP

What about coding a Spiking Neural Network using an automatic differentiation framework? In SNNs, there is a time axis and the neural network sees data throughout time, and activation functions are instead spikes that are raised past a certain pre-activation threshold. Pre-activation values constantly fades if neurons aren't excited enough.

Language:Jupyter Notebook25400

CommNet-Reproduced-for-Levers-Task

A pytorch implementation of commNet on the levers task from "Learning Multiagent Communication with Backpropagation" paper. Reproduced from https://github.com/facebookarchive/CommNet

Language:Python400