tokarev-i-v

Tokarev Igor's repositories

awesome-llm-rl-agents

List of sources related to llms, transformers and reinforcement learning agents

1 20

agents

An Open-source Framework for Autonomous Language Agents

Language:PythonApache-2.0000

ARP

Procgen Experiments of "Guide Your Agent with Adaptive Multimodal Rewards"

Language:PythonMIT000

Auto-GPT

An experimental open-source attempt to make GPT-4 fully autonomous.

Language:PythonMIT000

A research platform to develop automated security policies using quantitative methods, e.g. optimal control, computational game theory, reinforcement learning, optimization, evolutionary methods, and causal inference.

Language:PythonNOASSERTION000

deep-learning-pytorch-huggingface

Language:Jupyter NotebookMIT000

DIFUSCO

Code of NeurIPS paper: arxiv.org/abs/2302.08224

Language:PythonMIT000

dynalang

Code for "Learning to Model the World with Language."

Language:Python000

ember

Elastic Malware Benchmark for Empowering Researchers

Language:Jupyter NotebookNOASSERTION000

garak

LLM vulnerability scanner

Language:PythonApache-2.0000

generative_agents

Generative Agents: Interactive Simulacra of Human Behavior

Apache-2.0000

JoTR

Language:PythonApache-2.0000

lamorel

Lamorel is a Python library designed for RL practitioners eager to use Large Language Models (LLMs).

Language:PythonMIT000

LAPO

Code for the ICLR 2024 spotlight paper: "Learning to Act without Actions" (introducing Latent Action Policies)

000

llama-recipes

Examples and recipes for Llama 2 model

Language:PythonNOASSERTION000

llama_generative_agent

A generative agent implementation for LLaMA based models, derived from langchain's implementation.

Language:Jupyter NotebookApache-2.0000

llm.c

LLM training in simple, raw C/CUDA

Language:CudaMIT000

metasploit-framework

Metasploit Framework

Language:RubyNOASSERTION000

Minigrid

Simple and easily configurable grid world environments for reinforcement learning

Language:PythonNOASSERTION000

mistral-src

Reference implementation of Mistral AI 7B v0.1 model.

Language:PythonApache-2.0000

mistral_finetune_notebooks

Language:Jupyter Notebook000

pipegoose

Megatron-LM 3D parallelism for 🤗 transformers model *(still work in progress)*

Language:PythonMIT000

reflexion

Reflexion: Language Agents with Verbal Reinforcement Learning

Language:PythonMIT000

rulm

Language modeling and instruction tuning for Russian

Language:Jupyter NotebookApache-2.0000

secml_malware

Create adversarial attacks against machine learning Windows malware detectors

Language:PythonGPL-3.0000

tokarev-i-v.github.io

My site

Language:JavaScriptMIT010

TransformerLens

Language:PythonMIT000

wapiti

Web vulnerability scanner written in Python3

Language:PythonGPL-2.0000

tokarev-i-v

Tokarev Igor's repositories

awesome-llm-rl-agents

agents

awesome-ml-cybersecurity

alphageometry

ARP

Auto-GPT

csle

deep-learning-pytorch-huggingface

DIFUSCO

dynalang

ember

garak

generative_agents

JoTR

lamorel

LAPO

llama-recipes

llama_generative_agent

llm.c

metasploit-framework

Minigrid

mistral-src

mistral_finetune_notebooks

pipegoose

reflexion

rulm

secml_malware

tokarev-i-v.github.io

TransformerLens

wapiti