Weihua Du (StigLidu)

StigLidu

Geek Repo

Company:CMU

Location:Pittsburgh, US

Home Page:https://stiglidu.github.io/

Github PK Tool:Github PK Tool

Weihua Du's starred repositories

ScienceWorld

ScienceWorld is a text-based virtual environment centered around accomplishing tasks from the standardized elementary science curriculum.

Language:ScalaLicense:Apache-2.0Stargazers:205Issues:0Issues:0

Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Language:PythonLicense:Apache-2.0Stargazers:21733Issues:0Issues:0

Agent-FLAN

[ACL2024 Findings] Agent-FLAN: Designing Data and Methods of Effective Agent Tuning for Large Language Models

License:Apache-2.0Stargazers:321Issues:0Issues:0

video-nonlocal-net

Non-local Neural Networks for Video Classification

Language:PythonLicense:NOASSERTIONStargazers:1971Issues:0Issues:0

mmaction2

OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark

Language:PythonLicense:Apache-2.0Stargazers:4197Issues:0Issues:0
Language:ShellStargazers:752Issues:0Issues:0

VMZ

VMZ: Model Zoo for Video Modeling

Language:PythonLicense:Apache-2.0Stargazers:1037Issues:0Issues:0

training_extensions

Train, Evaluate, Optimize, Deploy Computer Vision Models via OpenVINO™

Language:PythonLicense:Apache-2.0Stargazers:1139Issues:0Issues:0

uoj

Universal Online Judge

Language:JavaScriptLicense:MITStargazers:524Issues:0Issues:0

Voyager

An Open-Ended Embodied Agent with Large Language Models

Language:JavaScriptLicense:MITStargazers:5512Issues:0Issues:0

DeepSeek-Math

DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

Language:PythonLicense:MITStargazers:788Issues:0Issues:0

lagent

A lightweight framework for building LLM-based agents

Language:PythonLicense:Apache-2.0Stargazers:1763Issues:0Issues:0

lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Language:PythonLicense:Apache-2.0Stargazers:4298Issues:0Issues:0

on-policy

This is the official implementation of Multi-Agent PPO (MAPPO).

Language:PythonLicense:MITStargazers:1270Issues:0Issues:0

HAZARD

HAZARD challenge

Language:PythonLicense:BSD-2-ClauseStargazers:25Issues:0Issues:0

docs

清华大学飞跃手册

License:NOASSERTIONStargazers:297Issues:0Issues:0

google-research

Google Research

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:33916Issues:0Issues:0

SmartPlay

SmartPlay is a benchmark for Large Language Models (LLMs). Uses a variety of games to test various important LLM capabilities as agents. SmartPlay is designed to be easy to use, and to support future development of LLMs.

Language:PythonLicense:CC-BY-4.0Stargazers:116Issues:0Issues:0

Grounding_LLMs_with_online_RL

We perform functional grounding of LLMs' knowledge in BabyAI-Text

Language:PythonLicense:MITStargazers:214Issues:0Issues:0

llm-reasoners

A library for advanced large language model reasoning

Language:PythonLicense:Apache-2.0Stargazers:1159Issues:0Issues:0

text2reward

[ICLR 2024] Code for the paper "Text2Reward: Automated Dense Reward Function Generation for Reinforcement Learning"

Language:Jupyter NotebookStargazers:114Issues:0Issues:0

rl-prompt

Accompanying repo for the RLPrompt paper

Language:PythonLicense:MITStargazers:294Issues:0Issues:0

grace

[EMNLP 2023, Findings] GRACE: Discriminator-Guided Chain-of-Thought Reasoning

Language:PythonStargazers:42Issues:0Issues:0

T-Eval

[ACL2024] T-Eval: Evaluating Tool Utilization Capability of Large Language Models Step by Step

Language:PythonLicense:Apache-2.0Stargazers:214Issues:0Issues:0

XAgent

An Autonomous LLM Agent for Complex Task Solving

Language:PythonLicense:Apache-2.0Stargazers:8058Issues:0Issues:0

BIG-Bench-Hard

Challenging BIG-Bench Tasks and Whether Chain-of-Thought Can Solve Them

License:MITStargazers:416Issues:0Issues:0

InternLM

Official release of InternLM2.5 base and chat models. 1M context support

Language:PythonLicense:Apache-2.0Stargazers:6281Issues:0Issues:0

FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Language:PythonLicense:Apache-2.0Stargazers:36551Issues:0Issues:0

Qwen

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Language:PythonLicense:Apache-2.0Stargazers:13555Issues:0Issues:0

evalplus

Rigourous evaluation of LLM-synthesized code - NeurIPS 2023

Language:PythonLicense:Apache-2.0Stargazers:1160Issues:0Issues:0