Mahendra Kutare (imaxxs)

imaxxs

Geek Repo

Company:DeepTrail

Location:San Francisco

Home Page:https://www.linkedin.com/in/imaxxs/

Twitter:@imaxxs

Github PK Tool:Github PK Tool

Mahendra Kutare's starred repositories

CoT-Igniting-Agent

This repository contains the paper list for the paper: Igniting Language Intelligence: The Hitchhiker's Guide From Chain-of-Thought Reasoning to Language Agents

Stargazers:323Issues:0Issues:0

AbuseIPDB-Checker

Python script that use AbuseIPDB API to check IP reputation for threats. Supports both command line and GUI interfaces. Input options include single IP, subnet, or file. Generates detailed reports and is configurable via settings file

Language:PythonLicense:MITStargazers:7Issues:0Issues:0

alerting-detection-strategy-framework

A framework for developing alerting and detection strategies for incident response.

License:MITStargazers:644Issues:0Issues:0

hook0

Open-source webhook server that helps you provide webhooks to your users. It handles for you a great amount of features that are usually tedious to (re)implement.

Language:RustLicense:NOASSERTIONStargazers:553Issues:0Issues:0

neural_nets_research

A collection of projects for neural nets research in Mehdi's research group.

Language:Jupyter NotebookLicense:MITStargazers:7Issues:0Issues:0

dqn-multi-agent-rl

Deep Q-learning (DQN) for Multi-agent Reinforcement Learning (RL)

Language:PythonLicense:MITStargazers:299Issues:0Issues:0

Multi-agent-reinforcement-learning

Implementation of Multi-Agent Reinforcement Learning algorithm(s). Currently includes: MADDPG

Language:PythonLicense:MITStargazers:63Issues:0Issues:0

Multi-Agent-Reinforcement-Learning

PyTorch implements multi-agent reinforcement learning algorithms, including QMIX, Independent PPO, Centralized PPO, Grid Wise Control, Grid Wise Control+PPO, Grid Wise Control+DDPG.

Language:PythonLicense:MITStargazers:183Issues:0Issues:0

off-policy

PyTorch implementations of popular off-policy multi-agent reinforcement learning algorithms, including QMix, VDN, MADDPG, and MATD3.

Language:PythonLicense:MITStargazers:381Issues:0Issues:0

MARL-Papers

Paper list of multi-agent reinforcement learning (MARL)

Stargazers:3918Issues:0Issues:0

AgileRL

Streamlining reinforcement learning with RLOps. State-of-the-art RL algorithms and tools.

Language:PythonLicense:Apache-2.0Stargazers:561Issues:0Issues:0

Mava

🦁 A research-friendly codebase for fast experimentation of multi-agent reinforcement learning in JAX

Language:PythonLicense:Apache-2.0Stargazers:682Issues:0Issues:0

maro

Multi-Agent Resource Optimization (MARO) platform is an instance of Reinforcement Learning as a Service (RaaS) for real-world resource optimization problems.

Language:PythonLicense:MITStargazers:836Issues:0Issues:0

chatarena

ChatArena (or Chat Arena) is a Multi-Agent Language Game Environments for LLMs. The goal is to develop communication and collaboration capabilities of AIs.

Language:PythonLicense:Apache-2.0Stargazers:1313Issues:0Issues:0

rl

A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.

Language:PythonLicense:MITStargazers:2144Issues:0Issues:0

MARL-papers-with-code

Multi-Agent Reinforcement Learning (MARL) papers with code

Stargazers:287Issues:0Issues:0

stable-baselines3

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

Language:PythonLicense:MITStargazers:8565Issues:0Issues:0

pr-agent

🚀CodiumAI PR-Agent: An AI-Powered 🤖 Tool for Automated Pull Request Analysis, Feedback, Suggestions and More! 💻🔍

Language:PythonLicense:Apache-2.0Stargazers:5302Issues:0Issues:0

ipex-llm

Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, Baichuan, Mixtral, Gemma, Phi, MiniCPM, etc.) on Intel CPU and GPU (e.g., local PC with iGPU, discrete GPU such as Arc, Flex and Max); seamlessly integrate with llama.cpp, Ollama, HuggingFace, LangChain, LlamaIndex, GraphRAG, DeepSpeed, vLLM, FastChat, Axolotl, etc.

Language:PythonLicense:Apache-2.0Stargazers:6394Issues:0Issues:0

vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Language:PythonLicense:Apache-2.0Stargazers:24862Issues:0Issues:0

reflexion

[NeurIPS 2023] Reflexion: Language Agents with Verbal Reinforcement Learning

Language:PythonLicense:MITStargazers:2223Issues:0Issues:0

LanguageAgentTreeSearch

[ICML 2024] Official repository for "Language Agent Tree Search Unifies Reasoning Acting and Planning in Language Models"

Language:PythonLicense:MITStargazers:569Issues:0Issues:0

OpenDevin

🐚 OpenDevin: Code Less, Make More

Language:PythonLicense:MITStargazers:30116Issues:0Issues:0

devika

Devika is an Agentic AI Software Engineer that can understand high-level human instructions, break them down into steps, research relevant information, and write code to achieve the given objective. Devika aims to be a competitive open-source alternative to Devin by Cognition AI.

Language:PythonLicense:MITStargazers:18124Issues:0Issues:0

eyeballer

Convolutional neural network for analyzing pentest screenshots

Language:PythonLicense:GPL-3.0Stargazers:1020Issues:0Issues:0

DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Language:PythonLicense:Apache-2.0Stargazers:34339Issues:0Issues:0

DeepSeek-Coder

DeepSeek Coder: Let the Code Write Itself

Language:PythonLicense:MITStargazers:6341Issues:0Issues:0

kauldron

Modular, scalable codebase to train ML models

Language:PythonLicense:Apache-2.0Stargazers:34Issues:0Issues:0

prompt-tuning

Original Implementation of Prompt Tuning from Lester, et al, 2021

Language:PythonLicense:Apache-2.0Stargazers:639Issues:0Issues:0

cloud

Cloud instance management for deep learning applications.

Language:PythonLicense:GPL-3.0Stargazers:38Issues:0Issues:0