Tokarev Igor's repositories

awesome-llm-rl-agents

List of sources related to llms, transformers and reinforcement learning agents

agents

An Open-source Framework for Autonomous Language Agents

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

ARP

Procgen Experiments of "Guide Your Agent with Adaptive Multimodal Rewards"

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

Auto-GPT

An experimental open-source attempt to make GPT-4 fully autonomous.

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

csle

A research platform to develop automated security policies using quantitative methods, e.g. optimal control, computational game theory, reinforcement learning, optimization, evolutionary methods, and causal inference.

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0
Language:Jupyter NotebookLicense:MITStargazers:0Issues:0Issues:0

DIFUSCO

Code of NeurIPS paper: arxiv.org/abs/2302.08224

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

dynalang

Code for "Learning to Model the World with Language."

Language:PythonStargazers:0Issues:0Issues:0

ember

Elastic Malware Benchmark for Empowering Researchers

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:0Issues:0Issues:0

garak

LLM vulnerability scanner

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

generative_agents

Generative Agents: Interactive Simulacra of Human Behavior

License:Apache-2.0Stargazers:0Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0

lamorel

Lamorel is a Python library designed for RL practitioners eager to use Large Language Models (LLMs).

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

LAPO

Code for the ICLR 2024 spotlight paper: "Learning to Act without Actions" (introducing Latent Action Policies)

Stargazers:0Issues:0Issues:0

llama-recipes

Examples and recipes for Llama 2 model

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

llama_generative_agent

A generative agent implementation for LLaMA based models, derived from langchain's implementation.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0

llm.c

LLM training in simple, raw C/CUDA

Language:CudaLicense:MITStargazers:0Issues:0Issues:0

metasploit-framework

Metasploit Framework

Language:RubyLicense:NOASSERTIONStargazers:0Issues:0Issues:0

Minigrid

Simple and easily configurable grid world environments for reinforcement learning

Language:PythonLicense:NOASSERTIONStargazers:0Issues:0Issues:0

mistral-src

Reference implementation of Mistral AI 7B v0.1 model.

Language:PythonLicense:Apache-2.0Stargazers:0Issues:0Issues:0
Language:Jupyter NotebookStargazers:0Issues:0Issues:0

pipegoose

Megatron-LM 3D parallelism for 🤗 transformers model *(still work in progress)*

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

reflexion

Reflexion: Language Agents with Verbal Reinforcement Learning

Language:PythonLicense:MITStargazers:0Issues:0Issues:0

rulm

Language modeling and instruction tuning for Russian

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:0Issues:0Issues:0

secml_malware

Create adversarial attacks against machine learning Windows malware detectors

Language:PythonLicense:GPL-3.0Stargazers:0Issues:0Issues:0
Language:JavaScriptLicense:MITStargazers:0Issues:1Issues:0
Language:PythonLicense:MITStargazers:0Issues:0Issues:0

wapiti

Web vulnerability scanner written in Python3

Language:PythonLicense:GPL-2.0Stargazers:0Issues:0Issues:0