powergiant's starred repositories
Quiet_STaR
This project aims to implements quiet_star algoithm
quiet-star
Implementation of the Quiet-STAR paper (https://arxiv.org/pdf/2403.09629.pdf)
Awesome-LLM-Strawberry
A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 and reasoning techniques.
15_by_15_AlphaGomoku
An implementation of improved AlphaGo algorithm in the game of Gomoku.
alpha-zero-gomoku
A Multi-threaded Implementation of AlphaZero (C++)
AlphaZero_Gomoku_MPI
An asynchronous/parallel method of AlphaGo Zero algorithm with Gomoku
legged_control
NMPC, WBC, state estimation, and sim2real framework for legged robots based on OCS2 and ros-controls
nimbro-op-ros
NimbRo-OP ROS software release
ROBOTIS-OP3
ROS packages for the ROBOTIS OP3
ROBOTIS-OP2
ROS packages for the ROBOTIS OP2
poppy-humanoid
Poppy Humanoid is an open-source and 3D printed humanoid robot. Optimized for research and education purposes, its modularity allows for a wide range of applications and experimentations.
pipeline-psro
Official Code Release for Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games
AlphaZero_Gomoku
An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)
Dummy-Robot
我的超迷你机械臂机器人项目。
rl4rs-papers
A collection of research and survey papers of reforcement learning (RL) based recommender system techniques.
Reinforcement-Learning-Papers
Related papers for reinforcement learning, including classic papers and latest papers in top conferences
FourierDiffusion
This repository implements time series diffusion in the frequency domain.
Firefly-LLaMA2-Chinese
Firefly中文LLaMA-2大模型,支持增量预训练Baichuan2、Llama2、Llama、Falcon、Qwen、Baichuan、InternLM、Bloom等大模型