Peng Wang (peter-peng-w)

peter-peng-w

Geek Repo

Company:University of Virginia

Location:Charlottesville, VA

Home Page:https://peter-peng-w.github.io/

Github PK Tool:Github PK Tool

Peng Wang's starred repositories

Language:PythonStargazers:18Issues:0Issues:0

NLProofS

EMNLP 2022: Generating Natural Language Proofs with Verifier-Guided Search https://arxiv.org/abs/2205.12443

Language:PythonLicense:MITStargazers:80Issues:0Issues:0

ReST-MCTS

ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search

Language:PythonStargazers:58Issues:0Issues:0

OpenDevin

🐚 OpenDevin: Code Less, Make More

Language:PythonLicense:MITStargazers:29299Issues:0Issues:0

RLBench

A large-scale benchmark and learning environment.

Language:PythonLicense:NOASSERTIONStargazers:1061Issues:0Issues:0

llemma_formal2formal

Llemma formal2formal (tactic prediction) theorem proving experiments

Language:PythonLicense:MITStargazers:15Issues:0Issues:0

llm-continual-learning-survey

Continual Learning of Large Language Models: A Comprehensive Survey

Stargazers:177Issues:0Issues:0
Language:PythonStargazers:4Issues:0Issues:0

llmstep

llmstep: [L]LM proofstep suggestions in Lean 4.

Language:PythonLicense:MITStargazers:104Issues:0Issues:0

Multi-Agents-Debate

MAD: The first work to explore Multi-Agent Debate with Large Language Models :D

Language:PythonLicense:GPL-3.0Stargazers:220Issues:0Issues:0

set.mm

Metamath source file for logic and set theory

Language:HTMLLicense:CC0-1.0Stargazers:239Issues:0Issues:0
Language:PythonLicense:NOASSERTIONStargazers:57Issues:0Issues:0

ntptutorial

Tutorial on neural theorem proving

Language:Jupyter NotebookLicense:MITStargazers:147Issues:0Issues:0

parsel

Code for Parsel 🐍 - generate complex programs with language models

Language:PythonStargazers:399Issues:0Issues:0
Stargazers:20Issues:0Issues:0

LightZero

[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)

Language:PythonLicense:Apache-2.0Stargazers:991Issues:0Issues:0

MCTS-DPO

This is the repository that contains the source code for the Self-Evaluation Guided MCTS for online DPO.

Language:PythonLicense:Apache-2.0Stargazers:42Issues:0Issues:0

lean4-maze

maze game encoded in Lean 4 syntax

Language:LeanLicense:Apache-2.0Stargazers:43Issues:0Issues:0
Language:PythonLicense:MITStargazers:71Issues:0Issues:0

llm-reasoners

A library for advanced large language model reasoning

Language:PythonLicense:Apache-2.0Stargazers:1032Issues:0Issues:0

TheoremLlama

This is the official repository for all the code of TheoremLlama

Stargazers:23Issues:0Issues:0

DL4TP

[COLM 2024] A Survey on Deep Learning for Theorem Proving

License:MITStargazers:98Issues:0Issues:0

AgentGym

Code and implementations for the paper "AgentGym: Evolving Large Language Model-based Agents across Diverse Environments" by Zhiheng Xi et al.

Language:PythonLicense:MITStargazers:274Issues:0Issues:0
Language:LeanStargazers:10Issues:0Issues:0

LLM_Tree_Search

(ICML 2024) Alphazero-like Tree-Search can guide large language model decoding and training

Language:PythonStargazers:154Issues:0Issues:0

abel

SOTA Math Opensource LLM

Language:PythonStargazers:292Issues:0Issues:0

dl4math

Resources of deep learning for mathematical reasoning (DL4MATH).

License:MITStargazers:319Issues:0Issues:0

Step-DPO

Implementation for "Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMs"

Language:PythonStargazers:173Issues:0Issues:0

InstructRAG

InstructRAG: Instructing Retrieval-Augmented Generation with Explicit Denoising

Language:PythonLicense:MITStargazers:21Issues:0Issues:0

Self-Explore

Self-Explore to avoid ️the p️️it! Improving the Reasoning Capabilities of Language Models with Fine-grained Rewards

Language:PythonStargazers:33Issues:0Issues:0