Beast code in Giters

simonsays1980's starred repositories

noah-pufferlib

Simplifying reinforcement learning for complex game environments

MIT200

fabric

fabric is an open-source framework for augmenting humans using AI. It provides a modular framework for solving specific problems using a crowdsourced set of AI prompts that can be used anywhere.

Language:PythonMIT2001000

morl-baselines

Multi-Objective Reinforcement Learning algorithms implementations.

Language:PythonMIT25500

PufferLib

Simplifying reinforcement learning for complex game environments

Language:PythonMIT71800

multi-agent-emergence-environments

Environment generation code for the paper "Emergent Tool Use From Multi-Agent Autocurricula"

Language:PythonMIT161500

llm-twin-course

🤖 𝗟𝗲𝗮𝗿𝗻 for 𝗳𝗿𝗲𝗲 how to 𝗯𝘂𝗶𝗹𝗱 an end-to-end 𝗽𝗿𝗼𝗱𝘂𝗰𝘁𝗶𝗼𝗻-𝗿𝗲𝗮𝗱𝘆 𝗟𝗟𝗠 & 𝗥𝗔𝗚 𝘀𝘆𝘀𝘁𝗲𝗺 using 𝗟𝗟𝗠𝗢𝗽𝘀 best practices: ~ 𝘴𝘰𝘶𝘳𝘤𝘦 𝘤𝘰𝘥𝘦 + 12 𝘩𝘢𝘯𝘥𝘴-𝘰𝘯 𝘭𝘦𝘴𝘴𝘰𝘯𝘴

Language:PythonMIT209300

neuromancer

Pytorch-based framework for solving parametric constrained optimization problems, physics-informed system identification, and parametric model predictive control.

Language:PythonNOASSERTION83400

lectures

Material for cuda-mode lectures

Language:Jupyter NotebookApache-2.0202500

tox-uv

Use https://github.com/astral-sh/uv with tox

Language:PythonMIT4800

CityLearn

Official reinforcement learning environment for demand response and load shaping

Language:PythonMIT45500

nvtop

GPU & Accelerator process monitoring for AMD, Apple, Huawei, Intel, NVIDIA and Qualcomm

Language:CNOASSERTION782600

minbpe

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Language:PythonMIT883300

trl

Train transformer language models with reinforcement learning.

Language:PythonApache-2.0892500

speedscope

🔬 A fast, interactive web-based viewer for performance profiles.

Language:TypeScriptMIT538600

flow

Computational framework for reinforcement learning in traffic control

Language:PythonMIT104800

xgboost_ray

Distributed XGBoost on Ray

Language:PythonApache-2.013400

DeepNetSlice

Reinforcement Learning tool for Network Slice Placement problems

Language:Python2000

Syllabus

Synchronized Curriculum Learning for RL Agents

Language:PythonMIT2100

OCTIS

OCTIS: Comparing Topic Models is Simple! A python package to optimize and evaluate topic models (accepted at EACL2021 demo track)

Language:PythonMIT70700

CFN

Accompanying Code for "Flipping Coins to Estimate Pseudocounts for Exploration in Reinforcement Learning", ICML 2023

Language:PythonApache-2.01500

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.

Language:PythonMIT2034700

simonsays1980

simonsays1980's starred repositories

JaxLife

noah-pufferlib

fabric

rlbase_stable

morl-baselines

PufferLib

multi-agent-emergence-environments

unitree_rl_gym

llm-twin-course

neuromancer

lectures

tox-uv

gigastep

CityLearn

nvtop

minbpe

trl

speedscope

flow

xgboost_ray

DeepNetSlice

Syllabus

OCTIS

CFN

audiocraft

DRL-for-Pick-and-Place-Task-subtasks

pysparklines

ray-llm

sd

the-algorithm