Nathan Matare (nmatare)

nmatare

Geek Repo

Location:New York City, New York

Home Page:https://nathanmatare.com

Github PK Tool:Github PK Tool

Nathan Matare's starred repositories

dspy

DSPy: The framework for programming—not prompting—foundation models

Language:PythonLicense:MITStargazers:12697Issues:0Issues:0

submitit

Python 3.8+ toolbox for submitting jobs to Slurm

Language:PythonLicense:MITStargazers:1144Issues:0Issues:0

tensordict

TensorDict is a pytorch dedicated tensor container.

Language:PythonLicense:MITStargazers:612Issues:0Issues:0
License:Apache-2.0Stargazers:857Issues:0Issues:0

mamba

Mamba SSM architecture

Language:PythonLicense:Apache-2.0Stargazers:10990Issues:0Issues:0

ml_collections

ML Collections is a library of Python Collections designed for ML use cases.

Language:PythonLicense:Apache-2.0Stargazers:848Issues:0Issues:0

ml-engineering

Machine Learning Engineering Open Book

Language:PythonLicense:CC-BY-SA-4.0Stargazers:10064Issues:0Issues:0

JaxMARL

Multi-Agent Reinforcement Learning with JAX

Language:PythonLicense:Apache-2.0Stargazers:352Issues:0Issues:0

reverb

Reverb is an efficient and easy-to-use data storage and transport system designed for machine learning research

Language:C++License:Apache-2.0Stargazers:693Issues:0Issues:0

awesome-model-based-RL

A curated list of awesome model based RL resources (continually updated)

License:Apache-2.0Stargazers:769Issues:0Issues:0

rliable

[NeurIPS'21 Outstanding Paper] Library for reliable evaluation on RL and ML benchmarks, even with only a handful of seeds.

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:718Issues:0Issues:0

flashbax

⚡ Flashbax: Accelerated Replay Buffers in JAX

Language:PythonLicense:Apache-2.0Stargazers:168Issues:0Issues:0

tree

tree is a library for working with nested data structures

Language:PythonLicense:Apache-2.0Stargazers:919Issues:0Issues:0

Eureka

Official Repository for "Eureka: Human-Level Reward Design via Coding Large Language Models" (ICLR 2024)

Language:Jupyter NotebookLicense:MITStargazers:2703Issues:0Issues:0

mctx

Monte Carlo tree search in JAX

Language:PythonLicense:Apache-2.0Stargazers:2226Issues:0Issues:0

LightZero

[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios

Language:PythonLicense:Apache-2.0Stargazers:909Issues:0Issues:0

jumanji

🕹️ A diverse suite of scalable reinforcement learning environments in JAX

Language:PythonLicense:Apache-2.0Stargazers:549Issues:0Issues:0

open_spiel

OpenSpiel is a collection of environments and algorithms for research in general reinforcement learning and search/planning in games.

Language:C++License:Apache-2.0Stargazers:4060Issues:0Issues:0

alpa

Training and serving large-scale neural networks with auto parallelization.

Language:PythonLicense:Apache-2.0Stargazers:3000Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:1223Issues:0Issues:0

sagemaker-tensorflow-serving-container

A TensorFlow Serving solution for use in SageMaker. This repo is now deprecated.

Language:PythonLicense:Apache-2.0Stargazers:174Issues:0Issues:0

flight-sql-server-example

An example Flight SQL Server implementation - with DuckDB and SQLite back-ends.

Language:C++License:Apache-2.0Stargazers:165Issues:0Issues:0

go-iex

A Go library for accessing the IEX Developer API.

Language:GoLicense:LGPL-3.0Stargazers:92Issues:0Issues:0

environment

Neural MMO - A Massively Multiagent Environment for Artificial Intelligence Research

Language:PythonLicense:MITStargazers:483Issues:0Issues:0

envlogger

A tool for recording RL trajectories.

Language:PythonLicense:Apache-2.0Stargazers:90Issues:0Issues:0

keras-core

A multi-backend implementation of the Keras API, with support for TensorFlow, JAX, and PyTorch.

Language:PythonLicense:Apache-2.0Stargazers:1268Issues:0Issues:0

Sophia

Effortless plugin and play Optimizer to cut model training costs by 50%. New optimizer that is 2x faster than Adam on LLMs.

Language:PythonLicense:Apache-2.0Stargazers:368Issues:0Issues:0

memory-maze

Evaluating long-term memory of reinforcement learning algorithms

Language:PythonLicense:MITStargazers:125Issues:0Issues:0

online-dt

Online Decision Transformer

Language:PythonLicense:NOASSERTIONStargazers:218Issues:0Issues:0

Temporal-Latent-Bottleneck-TF

An unofficial implementation of the paper "Temporal Latent Bottleneck: Synthesis of Fast and Slow Processing Mechanisms in Sequence Learning"

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:6Issues:0Issues:0