artivus2023

artivus2023

Geek Repo

Github PK Tool:Github PK Tool

artivus2023's starred repositories

TextWorld

​TextWorld is a sandbox learning environment for the training and evaluation of reinforcement learning (RL) agents on text-based games.

Language:Jupyter NotebookLicense:NOASSERTIONStargazers:1202Issues:0Issues:0

MoE-LLaVA

Mixture-of-Experts for Large Vision-Language Models

Language:PythonLicense:Apache-2.0Stargazers:1933Issues:0Issues:0

LLaMA-Factory

Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)

Language:PythonLicense:Apache-2.0Stargazers:31656Issues:0Issues:0

mergekit

Tools for merging pretrained large language models.

Language:PythonLicense:LGPL-3.0Stargazers:4570Issues:0Issues:0

mlx-deep-dive

A deep dive into the MLX deep learning framework

Language:PythonLicense:MITStargazers:18Issues:0Issues:0

MineDojo

Building Open-Ended Embodied Agents with Internet-Scale Knowledge

Language:JavaLicense:MITStargazers:1762Issues:0Issues:0

minerl

MineRL Competition for Sample Efficient Reinforcement Learning - Python Package

Language:JavaLicense:NOASSERTIONStargazers:685Issues:0Issues:0

ml-agents

The Unity Machine Learning Agents Toolkit (ML-Agents) is an open-source project that enables games and simulations to serve as environments for training intelligent agents using deep reinforcement learning and imitation learning.

Language:C#License:NOASSERTIONStargazers:16978Issues:0Issues:0

virtualhome

API to run VirtualHome, a Multi-Agent Household Simulator

Language:PythonLicense:MITStargazers:454Issues:0Issues:0
Language:PythonStargazers:91Issues:0Issues:0

hfppl

Probabilistic programming with HuggingFace language models

Language:PythonStargazers:86Issues:0Issues:0

ziglings

Learn the Zig programming language by fixing tiny broken programs.

License:MITStargazers:4293Issues:0Issues:0

CTranslate2

Fast inference engine for Transformer models

Language:C++License:MITStargazers:3253Issues:0Issues:0

LLM_Tree_Search

The official implementation of paper: Alphazero-like Tree-Search can guide large language model decoding and training

Stargazers:2Issues:0Issues:0

gpt-fast

Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

Language:PythonLicense:BSD-3-ClauseStargazers:5558Issues:0Issues:0

LLM_Tree_Search

(ICML 2024) Alphazero-like Tree-Search can guide large language model decoding and training

Language:PythonStargazers:187Issues:0Issues:0

TaskWeaver

A code-first agent framework for seamlessly planning and executing data analytics tasks.

Language:PythonLicense:MITStargazers:5229Issues:0Issues:0

knowledge_graph_attention_network

KGAT: Knowledge Graph Attention Network for Recommendation, KDD2019

Language:PythonLicense:MITStargazers:1056Issues:0Issues:0

PyTorch-BigGraph

Generate embeddings from large-scale graph-structured data.

Language:PythonLicense:NOASSERTIONStargazers:3365Issues:0Issues:0

nle

The NetHack Learning Environment

Language:CLicense:NOASSERTIONStargazers:937Issues:0Issues:0

worldsense

WorldSense benchmark for grounded reasoning in language models

Language:PythonLicense:NOASSERTIONStargazers:13Issues:0Issues:0

diplomacy_cicero

Code for Cicero, an AI agent that plays the game of Diplomacy with open-domain natural language negotiation.

Language:PythonLicense:NOASSERTIONStargazers:1284Issues:0Issues:0

diplomacy

Diplomacy: DATC-Compliant Game Engine with Web Interface

Language:PythonLicense:AGPL-3.0Stargazers:100Issues:0Issues:0

DeepDip

DeepDip, a DRL Gym agent that plays no-press Diplomacy in BANDANA

Language:PythonLicense:GPL-3.0Stargazers:12Issues:0Issues:0
Language:PythonLicense:Apache-2.0Stargazers:45Issues:0Issues:0

muzero

A clean implementation of MuZero and AlphaZero following the AlphaZero General framework. Train and Pit both algorithms against each other, and investigate reliability of learned MuZero MDP models.

Language:Jupyter NotebookLicense:MITStargazers:154Issues:0Issues:0
Language:PythonLicense:MITStargazers:2474Issues:0Issues:0

lark

Lark is a parsing toolkit for Python, built with a focus on ergonomics, performance and modularity.

Language:PythonLicense:MITStargazers:4798Issues:0Issues:0
Language:C++License:MITStargazers:8Issues:0Issues:0

qdrant-azure

Qdrant Vector Database on Azure Cloud

Language:ShellLicense:MITStargazers:90Issues:0Issues:0