Beast code in Giters

Poppy Humanoid is an open-source and 3D printed humanoid robot. Optimized for research and education purposes, its modularity allows for a wide range of applications and experimentations.

Language:Jupyter Notebook65300

pipeline-psro

Official Code Release for Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games

Language:PythonMIT4400

AlphaZero_Gomoku

An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)

Language:PythonMIT328300

Gomoku

iOS五子棋游戏，支持人机对战、双人对战、联机对战。iOS Gomuku game with amazing AI, developed in Objective-C

Language:Objective-C18300

gobang

javascript gobang AI，JS五子棋AI，源码+教程，基于Alpha-Beta剪枝算法（不是神经网络）

Language:JavaScript163800

Dummy-Robot

我的超迷你机械臂机器人项目。

Language:C1204100

rl4rs-papers

A collection of research and survey papers of reforcement learning (RL) based recommender system techniques.

6800

Reinforcement-Learning-Papers

Related papers for reinforcement learning, including classic papers and latest papers in top conferences

MIT29100

FourierDiffusion

This repository implements time series diffusion in the frequency domain.

Language:Jupyter NotebookMIT2400

MOSS-RLHF

Language:PythonApache-2.0127400

Firefly-LLaMA2-Chinese

Firefly中文LLaMA-2大模型，支持增量预训练Baichuan2、Llama2、Llama、Falcon、Qwen、Baichuan、InternLM、Bloom等大模型

Language:Python39600

OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)

Language:PythonApache-2.0210700

Firefly

Firefly: 大模型训练工具，支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

Language:Python569400