powergiant's starred repositories

Quiet_STaR

This project aims to implements quiet_star algoithm

Language:PythonLicense:MITStargazers:4Issues:0Issues:0

quiet-star

Implementation of the Quiet-STAR paper (https://arxiv.org/pdf/2403.09629.pdf)

Language:PythonLicense:MITStargazers:35Issues:0Issues:0

dlrover

DLRover: An Automatic Distributed Deep Learning System

Language:PythonLicense:NOASSERTIONStargazers:1224Issues:0Issues:0

Awesome-LLM-Strawberry

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques.

License:Apache-2.0Stargazers:3979Issues:0Issues:0
Language:TypeScriptLicense:MITStargazers:45Issues:0Issues:0

15_by_15_AlphaGomoku

An implementation of improved AlphaGo algorithm in the game of Gomoku.

Language:PythonLicense:Apache-2.0Stargazers:57Issues:0Issues:0

alpha-zero-gomoku

A Multi-threaded Implementation of AlphaZero (C++)

Language:PythonStargazers:366Issues:0Issues:0

AlphaZero_Gomoku_MPI

An asynchronous/parallel method of AlphaGo Zero algorithm with Gomoku

Language:PythonStargazers:185Issues:0Issues:0

gomoku_rl

train AI agents to master Free-style Gomoku(五子棋)

Language:PythonLicense:MITStargazers:8Issues:0Issues:0

legged_control

NMPC, WBC, state estimation, and sim2real framework for legged robots based on OCS2 and ros-controls

Language:C++License:BSD-3-ClauseStargazers:906Issues:0Issues:0

gz-sim

Open source robotics simulator. The latest version of Gazebo.

Language:C++License:Apache-2.0Stargazers:685Issues:0Issues:0

nimbro-op

Sourcecode & CAD drawings of NimbRo-OP

Language:ShellLicense:NOASSERTIONStargazers:25Issues:0Issues:0

nimbro-op-ros

NimbRo-OP ROS software release

Language:C++License:NOASSERTIONStargazers:38Issues:0Issues:0

ROBOTIS-OP3

ROS packages for the ROBOTIS OP3

Language:C++License:Apache-2.0Stargazers:114Issues:0Issues:0

ROBOTIS-OP2

ROS packages for the ROBOTIS OP2

Language:C++License:Apache-2.0Stargazers:5Issues:0Issues:0
Language:C++License:MITStargazers:2550Issues:0Issues:0

rx1

RX1 humanoid robot ROS1 pacakge

Language:C++License:MITStargazers:75Issues:0Issues:0

poppy-humanoid

Poppy Humanoid is an open-source and 3D printed humanoid robot. Optimized for research and education purposes, its modularity allows for a wide range of applications and experimentations.

Language:Jupyter NotebookStargazers:653Issues:0Issues:0

pipeline-psro

Official Code Release for Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games

Language:PythonLicense:MITStargazers:44Issues:0Issues:0

AlphaZero_Gomoku

An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)

Language:PythonLicense:MITStargazers:3283Issues:0Issues:0

Gomoku

iOS五子棋游戏,支持人机对战、双人对战、联机对战。iOS Gomuku game with amazing AI, developed in Objective-C

Language:Objective-CStargazers:183Issues:0Issues:0

gobang

javascript gobang AI,JS五子棋AI,源码+教程,基于Alpha-Beta剪枝算法(不是神经网络)

Language:JavaScriptStargazers:1638Issues:0Issues:0

Dummy-Robot

我的超迷你机械臂机器人项目。

Language:CStargazers:12041Issues:0Issues:0

rl4rs-papers

A collection of research and survey papers of reforcement learning (RL) based recommender system techniques.

Stargazers:68Issues:0Issues:0

Reinforcement-Learning-Papers

Related papers for reinforcement learning, including classic papers and latest papers in top conferences

License:MITStargazers:291Issues:0Issues:0

FourierDiffusion

This repository implements time series diffusion in the frequency domain.

Language:Jupyter NotebookLicense:MITStargazers:24Issues:0Issues:0

MOSS-RLHF

MOSS-RLHF

Language:PythonLicense:Apache-2.0Stargazers:1274Issues:0Issues:0

Firefly-LLaMA2-Chinese

Firefly中文LLaMA-2大模型,支持增量预训练Baichuan2、Llama2、Llama、Falcon、Qwen、Baichuan、InternLM、Bloom等大模型

Language:PythonStargazers:396Issues:0Issues:0

OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)

Language:PythonLicense:Apache-2.0Stargazers:2107Issues:0Issues:0

Firefly

Firefly: 大模型训练工具,支持训练Qwen2.5、Qwen2、Yi1.5、Phi-3、Llama3、Gemma、MiniCPM、Yi、Deepseek、Orion、Xverse、Mixtral-8x7B、Zephyr、Mistral、Baichuan2、Llma2、Llama、Qwen、Baichuan、ChatGLM2、InternLM、Ziya2、Vicuna、Bloom等大模型

Language:PythonStargazers:5694Issues:0Issues:0