peter-peng-w

Peng Wang's starred repositories

MARIO_EVAL

Language:Python1800

NLProofS

EMNLP 2022: Generating Natural Language Proofs with Verifier-Guided Search https://arxiv.org/abs/2205.12443

Language:PythonMIT8000

ReST-MCTS

ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search

Language:Python5800

OpenDevin

🐚 OpenDevin: Code Less, Make More

Language:PythonMIT2929900

RLBench

A large-scale benchmark and learning environment.

Language:PythonNOASSERTION106100

llemma_formal2formal

Llemma formal2formal (tactic prediction) theorem proving experiments

Language:PythonMIT1500

llm-continual-learning-survey

Continual Learning of Large Language Models: A Comprehensive Survey

17700

CEB

Language:Python400

llmstep

llmstep: [L]LM proofstep suggestions in Lean 4.

Language:PythonMIT10400

Multi-Agents-Debate

MAD: The first work to explore Multi-Agent Debate with Large Language Models :D

Language:PythonGPL-3.022000

set.mm

Metamath source file for logic and set theory

Language:HTMLCC0-1.023900

draft_sketch_prove

Language:PythonNOASSERTION5700

ntptutorial

Tutorial on neural theorem proving

Language:Jupyter NotebookMIT14700

parsel

Code for Parsel 🐍 - generate complex programs with language models

Language:Python39900

DotaMath

2000

LightZero

[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)

Language:PythonApache-2.099100

MCTS-DPO

This is the repository that contains the source code for the Self-Evaluation Guided MCTS for online DPO.

Language:PythonApache-2.04200

lean4-maze

maze game encoded in Lean 4 syntax

Language:LeanApache-2.04300

ChatGLM-Math

Language:PythonMIT7100

llm-reasoners

A library for advanced large language model reasoning

Language:PythonApache-2.0103200

TheoremLlama

This is the official repository for all the code of TheoremLlama

2300

DL4TP

[COLM 2024] A Survey on Deep Learning for Theorem Proving

MIT9800

AgentGym

Code and implementations for the paper "AgentGym: Evolving Large Language Model-based Agents across Diverse Environments" by Zhiheng Xi et al.

Language:PythonMIT27400

automatic-lean4-compilation

Language:Lean1000

LLM_Tree_Search

(ICML 2024) Alphazero-like Tree-Search can guide large language model decoding and training

Language:Python15400

abel

SOTA Math Opensource LLM

Language:Python29200

dl4math

Resources of deep learning for mathematical reasoning (DL4MATH).

MIT31900

Step-DPO

Implementation for "Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMs"

Language:Python17300

InstructRAG

InstructRAG: Instructing Retrieval-Augmented Generation with Explicit Denoising

Language:PythonMIT2100

Self-Explore

Self-Explore to avoid ️the p️️it! Improving the Reasoning Capabilities of Language Models with Fine-grained Rewards

Language:Python3300