Mykhaïlo Lytvynenko's starred repositories

Language:PythonStargazers:87Issues:0Issues:0

bootstrap_dqn

Implementation of Bootstrap DQN and Randomized Prior Functions on ALE

Language:PythonStargazers:51Issues:0Issues:0

MARL-Algorithms

Implementations of IQL, QMIX, VDN, COMA, QTRAN, MAVEN, CommNet, DyMA-CL, and G2ANet on SMAC, the decentralised micromanagement scenario of StarCraft II

Language:PythonStargazers:1368Issues:0Issues:0

bootsrapped-dqn

This is pytorch implmentation project of Bootsrapped DQN

Language:PythonLicense:Apache-2.0Stargazers:7Issues:0Issues:0

coach

Reinforcement Learning Coach by Intel AI Lab enables easy experimentation with state of the art Reinforcement Learning algorithms

Language:PythonLicense:Apache-2.0Stargazers:2318Issues:0Issues:0

LLMsPracticalGuide

A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)

Stargazers:8992Issues:0Issues:0

LLM-RL-Papers

Monitoring recent cross-research on LLM & RL on arXiv for control. If there are good papers, PRs are welcome.

Stargazers:41Issues:0Issues:0

LLM-Prompt-Library

Advanced Code and Text Manipulation Prompts for Various LLMs. Suitable for Siri, GPT-4o, Claude, Llama3, Gemini, and other high-performance open-source LLMs.

Stargazers:435Issues:0Issues:0

rl

A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.

Language:PythonLicense:MITStargazers:2004Issues:0Issues:0

langgraph

Build resilient language agents as graphs.

Language:PythonLicense:MITStargazers:4266Issues:0Issues:0
Language:Jupyter NotebookStargazers:687Issues:0Issues:0

langchain-tutorials

Overview and tutorial of the LangChain Library

Language:Jupyter NotebookStargazers:6499Issues:0Issues:0

prompt-engineering

Tips and tricks for working with Large Language Models like OpenAI's GPT-4.

License:MITStargazers:8176Issues:0Issues:0

Awesome-LLMOps

An awesome & curated list of best LLMOps tools for developers

Language:ShellLicense:CC0-1.0Stargazers:3402Issues:0Issues:0

nui_in_madrl

Negative Update Intervals in Multi-Agent Deep Reinforcement Learning

Language:PythonLicense:GPL-3.0Stargazers:32Issues:0Issues:0

DI-engine

OpenDILab Decision AI Engine

Language:PythonLicense:Apache-2.0Stargazers:2790Issues:0Issues:0

pymarl

Python Multi-Agent Reinforcement Learning framework

Language:PythonLicense:Apache-2.0Stargazers:1771Issues:0Issues:0

MAProj

Multi-agent project (commnet, bicnet, maddpg) in pytorch for Multi-Agent Particle Environment

Language:PythonStargazers:108Issues:0Issues:0

generative-ai

Sample code and notebooks for Generative AI on Google Cloud, with Gemini on Vertex AI

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:6276Issues:0Issues:0

training-data-analyst

Labs and demos for courses for GCP Training (http://cloud.google.com/training).

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:7647Issues:0Issues:0

Machine-Learning-with-Python

Practice and tutorial-style notebooks covering wide variety of machine learning techniques

Language:Jupyter NotebookLicense:BSD-2-ClauseStargazers:3032Issues:0Issues:0

CQL

Conservative Q Learning on top of SAC

Language:PythonLicense:MITStargazers:116Issues:0Issues:0

pytorch_seed_rl

A PyTorch implementation of SEED, originally created by Google Research for TensorFlow 2.

Language:PythonLicense:Apache-2.0Stargazers:10Issues:0Issues:0

ConvLSTM_pytorch

Implementation of Convolutional LSTM in PyTorch.

Language:PythonLicense:MITStargazers:1895Issues:0Issues:0

CommNet

PyTorch implementation of CommNet

Language:PythonStargazers:36Issues:0Issues:0

Spiking-Neural-Network-SNN-with-PyTorch-where-Backpropagation-engenders-STDP

What about coding a Spiking Neural Network using an automatic differentiation framework? In SNNs, there is a time axis and the neural network sees data throughout time, and activation functions are instead spikes that are raised past a certain pre-activation threshold. Pre-activation values constantly fades if neurons aren't excited enough.

Language:Jupyter NotebookStargazers:254Issues:0Issues:0

CommNet-Reproduced-for-Levers-Task

A pytorch implementation of commNet on the levers task from "Learning Multiagent Communication with Backpropagation" paper. Reproduced from https://github.com/facebookarchive/CommNet

Language:PythonStargazers:4Issues:0Issues:0

dqn-multi-agent-rl

Deep Q-learning (DQN) for Multi-agent Reinforcement Learning (RL)

Language:PythonLicense:MITStargazers:293Issues:0Issues:0

Self-Learning-Traffic-Lights

Master thesis Artificial Intelligence at the University of Amsterdam as intern at the municipality of Amsterdam.

Language:PythonStargazers:6Issues:0Issues:0

metalight

MetaLight: a value-based meta-reinforcement learning framework for traffic signal control

Language:PythonStargazers:36Issues:0Issues:0