simonsays1980's starred repositories

Language:PythonStargazers:13Issues:0Issues:0

noah-pufferlib

Simplifying reinforcement learning for complex game environments

License:MITStargazers:2Issues:0Issues:0

fabric

fabric is an open-source framework for augmenting humans using AI. It provides a modular framework for solving specific problems using a crowdsourced set of AI prompts that can be used anywhere.

Language:PythonLicense:MITStargazers:20010Issues:0Issues:0
Language:PythonStargazers:37Issues:0Issues:0

morl-baselines

Multi-Objective Reinforcement Learning algorithms implementations.

Language:PythonLicense:MITStargazers:255Issues:0Issues:0

PufferLib

Simplifying reinforcement learning for complex game environments

Language:PythonLicense:MITStargazers:718Issues:0Issues:0

multi-agent-emergence-environments

Environment generation code for the paper "Emergent Tool Use From Multi-Agent Autocurricula"

Language:PythonLicense:MITStargazers:1615Issues:0Issues:0
Language:PythonLicense:BSD-3-ClauseStargazers:126Issues:0Issues:0

llm-twin-course

🤖 𝗟𝗲𝗮𝗿𝗻 for 𝗳𝗿𝗲𝗲 how to 𝗯𝘂𝗶𝗹𝗱 an end-to-end 𝗽𝗿𝗼𝗱𝘂𝗰𝘁𝗶𝗼𝗻-𝗿𝗲𝗮𝗱𝘆 𝗟𝗟𝗠 & 𝗥𝗔𝗚 𝘀𝘆𝘀𝘁𝗲𝗺 using 𝗟𝗟𝗠𝗢𝗽𝘀 best practices: ~ 𝘴𝘰𝘶𝘳𝘤𝘦 𝘤𝘰𝘥𝘦 + 12 𝘩𝘢𝘯𝘥𝘴-𝘰𝘯 𝘭𝘦𝘴𝘴𝘰𝘯𝘴

Language:PythonLicense:MITStargazers:2093Issues:0Issues:0

neuromancer

Pytorch-based framework for solving parametric constrained optimization problems, physics-informed system identification, and parametric model predictive control.

Language:PythonLicense:NOASSERTIONStargazers:834Issues:0Issues:0

lectures

Material for cuda-mode lectures

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:2025Issues:0Issues:0

tox-uv

Use https://github.com/astral-sh/uv with tox

Language:PythonLicense:MITStargazers:48Issues:0Issues:0
Language:PythonLicense:MITStargazers:58Issues:0Issues:0

CityLearn

Official reinforcement learning environment for demand response and load shaping

Language:PythonLicense:MITStargazers:455Issues:0Issues:0

nvtop

GPU & Accelerator process monitoring for AMD, Apple, Huawei, Intel, NVIDIA and Qualcomm

Language:CLicense:NOASSERTIONStargazers:7826Issues:0Issues:0

minbpe

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Language:PythonLicense:MITStargazers:8833Issues:0Issues:0

trl

Train transformer language models with reinforcement learning.

Language:PythonLicense:Apache-2.0Stargazers:8925Issues:0Issues:0

speedscope

🔬 A fast, interactive web-based viewer for performance profiles.

Language:TypeScriptLicense:MITStargazers:5386Issues:0Issues:0

flow

Computational framework for reinforcement learning in traffic control

Language:PythonLicense:MITStargazers:1048Issues:0Issues:0

xgboost_ray

Distributed XGBoost on Ray

Language:PythonLicense:Apache-2.0Stargazers:134Issues:0Issues:0

DeepNetSlice

Reinforcement Learning tool for Network Slice Placement problems

Language:PythonStargazers:20Issues:0Issues:0

Syllabus

Synchronized Curriculum Learning for RL Agents

Language:PythonLicense:MITStargazers:21Issues:0Issues:0

OCTIS

OCTIS: Comparing Topic Models is Simple! A python package to optimize and evaluate topic models (accepted at EACL2021 demo track)

Language:PythonLicense:MITStargazers:707Issues:0Issues:0

CFN

Accompanying Code for "Flipping Coins to Estimate Pseudocounts for Exploration in Reinforcement Learning", ICML 2023

Language:PythonLicense:Apache-2.0Stargazers:15Issues:0Issues:0

audiocraft

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.

Language:PythonLicense:MITStargazers:20347Issues:0Issues:0

DRL-for-Pick-and-Place-Task-subtasks

A multi-subtask reinforcement learning method where complex tasks can be decomposed into low-level subtasks.

Language:PythonLicense:MITStargazers:26Issues:0Issues:0

pysparklines

Python clone of @holman's spark

Language:PythonLicense:BSD-2-ClauseStargazers:50Issues:0Issues:0

ray-llm

RayLLM - LLMs on Ray

Language:PythonLicense:Apache-2.0Stargazers:1208Issues:0Issues:0

sd

Intuitive find & replace CLI (sed alternative)

Language:RustLicense:MITStargazers:5626Issues:0Issues:0

the-algorithm

Source code for Twitter's Recommendation Algorithm

Language:ScalaLicense:AGPL-3.0Stargazers:61832Issues:0Issues:0