MykhaiIo

Mykhaïlo Lytvynenko's starred repositories

bootstrap_dqn

Implementation of Bootstrap DQN and Randomized Prior Functions on ALE

Language:Python5100

MARL-Algorithms

Implementations of IQL, QMIX, VDN, COMA, QTRAN, MAVEN, CommNet, DyMA-CL, and G2ANet on SMAC, the decentralised micromanagement scenario of StarCraft II

Language:Python136800

bootsrapped-dqn

This is pytorch implmentation project of Bootsrapped DQN

Language:PythonApache-2.0700

coach

Reinforcement Learning Coach by Intel AI Lab enables easy experimentation with state of the art Reinforcement Learning algorithms

Language:PythonApache-2.0231800

LLMsPracticalGuide

A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)

899200

LLM-RL-Papers

Monitoring recent cross-research on LLM & RL on arXiv for control. If there are good papers, PRs are welcome.

4100

LLM-Prompt-Library

Advanced Code and Text Manipulation Prompts for Various LLMs. Suitable for Siri, GPT-4o, Claude, Llama3, Gemini, and other high-performance open-source LLMs.

43500

rl

A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.

Language:PythonMIT200400

langgraph

Build resilient language agents as graphs.

Language:PythonMIT426600

langchain-tutorials

Overview and tutorial of the LangChain Library

Language:Jupyter Notebook649900

prompt-engineering

Tips and tricks for working with Large Language Models like OpenAI's GPT-4.

MIT817600

Awesome-LLMOps

An awesome & curated list of best LLMOps tools for developers

Language:ShellCC0-1.0340200

nui_in_madrl

Negative Update Intervals in Multi-Agent Deep Reinforcement Learning

Language:PythonGPL-3.03200

DI-engine

OpenDILab Decision AI Engine

Language:PythonApache-2.0279000

pymarl

Python Multi-Agent Reinforcement Learning framework

Language:PythonApache-2.0177100

MAProj

Multi-agent project (commnet, bicnet, maddpg) in pytorch for Multi-Agent Particle Environment

Language:Python10800

generative-ai

Sample code and notebooks for Generative AI on Google Cloud, with Gemini on Vertex AI

Language:Jupyter NotebookApache-2.0627600

training-data-analyst

Labs and demos for courses for GCP Training (http://cloud.google.com/training).

Language:Jupyter NotebookApache-2.0764700

Machine-Learning-with-Python

Practice and tutorial-style notebooks covering wide variety of machine learning techniques

Language:Jupyter NotebookBSD-2-Clause303200

CQL

Conservative Q Learning on top of SAC

Language:PythonMIT11600

pytorch_seed_rl

A PyTorch implementation of SEED, originally created by Google Research for TensorFlow 2.

Language:PythonApache-2.01000

ConvLSTM_pytorch

Implementation of Convolutional LSTM in PyTorch.

Language:PythonMIT189500

CommNet

PyTorch implementation of CommNet

Language:Python3600

Spiking-Neural-Network-SNN-with-PyTorch-where-Backpropagation-engenders-STDP

What about coding a Spiking Neural Network using an automatic differentiation framework? In SNNs, there is a time axis and the neural network sees data throughout time, and activation functions are instead spikes that are raised past a certain pre-activation threshold. Pre-activation values constantly fades if neurons aren't excited enough.

Language:Jupyter Notebook25400