Beast code in Giters

The Unity Machine Learning Agents Toolkit (ML-Agents) is an open-source project that enables games and simulations to serve as environments for training intelligent agents using deep reinforcement learning and imitation learning.

Language:C#NOASSERTION1697800

virtualhome

API to run VirtualHome, a Multi-Agent Household Simulator

Language:PythonMIT45400

implicit_chain_of_thought

Language:Python9100

hfppl

Probabilistic programming with HuggingFace language models

Language:Python8600

ziglings

Learn the Zig programming language by fixing tiny broken programs.

MIT429300

CTranslate2

Fast inference engine for Transformer models

Language:C++MIT325300

LLM_Tree_Search

The official implementation of paper: Alphazero-like Tree-Search can guide large language model decoding and training

200

gpt-fast

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Language:PythonBSD-3-Clause555800

LLM_Tree_Search

(ICML 2024) Alphazero-like Tree-Search can guide large language model decoding and training

Language:Python18700

TaskWeaver

A code-first agent framework for seamlessly planning and executing data analytics tasks.

Language:PythonMIT522900

knowledge_graph_attention_network

KGAT: Knowledge Graph Attention Network for Recommendation, KDD2019

Language:PythonMIT105600

PyTorch-BigGraph

Generate embeddings from large-scale graph-structured data.

Language:PythonNOASSERTION336500

nle

The NetHack Learning Environment

Language:CNOASSERTION93700

worldsense

WorldSense benchmark for grounded reasoning in language models

Language:PythonNOASSERTION1300

diplomacy_cicero

Code for Cicero, an AI agent that plays the game of Diplomacy with open-domain natural language negotiation.

Language:PythonNOASSERTION128400

diplomacy

Diplomacy: DATC-Compliant Game Engine with Web Interface

Language:PythonAGPL-3.010000

DeepDip

DeepDip, a DRL Gym agent that plays no-press Diplomacy in BANDANA

Language:PythonGPL-3.01200

diplomacy

Language:PythonApache-2.04500

muzero

A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each other, and investigate reliability of learned MuZero MDP models.

Language:Jupyter NotebookMIT15400

muzero-general

MuZero

Language:PythonMIT247400

lark

Lark is a parsing toolkit for Python, built with a focus on ergonomics, performance and modularity.

Language:PythonMIT479800

openhd

Language:C++MIT800

qdrant-azure

Qdrant Vector Database on Azure Cloud

Language:ShellMIT9000