imaxxs

Mahendra Kutare's starred repositories

CoT-Igniting-Agent

This repository contains the paper list for the paper: Igniting Language Intelligence: The Hitchhiker's Guide From Chain-of-Thought Reasoning to Language Agents

32300

Python script that use AbuseIPDB API to check IP reputation for threats. Supports both command line and GUI interfaces. Input options include single IP, subnet, or file. Generates detailed reports and is configurable via settings file

Language:PythonMIT700

alerting-detection-strategy-framework

A framework for developing alerting and detection strategies for incident response.

MIT64400

hook0

Open-source webhook server that helps you provide webhooks to your users. It handles for you a great amount of features that are usually tedious to (re)implement.

Language:RustNOASSERTION55300

neural_nets_research

A collection of projects for neural nets research in Mehdi's research group.

Language:Jupyter NotebookMIT700

dqn-multi-agent-rl

Deep Q-learning (DQN) for Multi-agent Reinforcement Learning (RL)

Language:PythonMIT29900

Multi-agent-reinforcement-learning

Implementation of Multi-Agent Reinforcement Learning algorithm(s). Currently includes: MADDPG

Language:PythonMIT6300

Multi-Agent-Reinforcement-Learning

PyTorch implements multi-agent reinforcement learning algorithms, including QMIX, Independent PPO, Centralized PPO, Grid Wise Control, Grid Wise Control+PPO, Grid Wise Control+DDPG.

Language:PythonMIT18300

off-policy

PyTorch implementations of popular off-policy multi-agent reinforcement learning algorithms, including QMix, VDN, MADDPG, and MATD3.

Language:PythonMIT38100

MARL-Papers

Paper list of multi-agent reinforcement learning (MARL)

391800

AgileRL

Streamlining reinforcement learning with RLOps. State-of-the-art RL algorithms and tools.

Language:PythonApache-2.056100

Mava

🦁 A research-friendly codebase for fast experimentation of multi-agent reinforcement learning in JAX

Language:PythonApache-2.068200

maro

Multi-Agent Resource Optimization (MARO) platform is an instance of Reinforcement Learning as a Service (RaaS) for real-world resource optimization problems.

Language:PythonMIT83600

chatarena

ChatArena (or Chat Arena) is a Multi-Agent Language Game Environments for LLMs. The goal is to develop communication and collaboration capabilities of AIs.

Language:PythonApache-2.0131300

rl

A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.

Language:PythonMIT214400

MARL-papers-with-code

Multi-Agent Reinforcement Learning (MARL) papers with code

28700

stable-baselines3

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

Language:PythonMIT856500

pr-agent

🚀CodiumAI PR-Agent: An AI-Powered 🤖 Tool for Automated Pull Request Analysis, Feedback, Suggestions and More! 💻🔍

Language:PythonApache-2.0530200

ipex-llm

Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, Baichuan, Mixtral, Gemma, Phi, MiniCPM, etc.) on Intel CPU and GPU (e.g., local PC with iGPU, discrete GPU such as Arc, Flex and Max); seamlessly integrate with llama.cpp, Ollama, HuggingFace, LangChain, LlamaIndex, GraphRAG, DeepSpeed, vLLM, FastChat, Axolotl, etc.

Language:PythonApache-2.0639400

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonApache-2.02486200

reflexion

[NeurIPS 2023] Reflexion: Language Agents with Verbal Reinforcement Learning

Language:PythonMIT222300

LanguageAgentTreeSearch

[ICML 2024] Official repository for "Language Agent Tree Search Unifies Reasoning Acting and Planning in Language Models"

Language:PythonMIT56900

OpenDevin

🐚 OpenDevin: Code Less, Make More

Language:PythonMIT3011600

devika

Devika is an Agentic AI Software Engineer that can understand high-level human instructions, break them down into steps, research relevant information, and write code to achieve the given objective. Devika aims to be a competitive open-source alternative to Devin by Cognition AI.

Language:PythonMIT1812400