Haofei Yu (lwaekfjlk)

lwaekfjlk

Geek Repo

Company:CKC@ZJU -> LTI@CMU -> CS@UIUC

Location:Champaign, IL

Home Page:https://haofeiyu.me

Twitter:@haofeiyu44

Github PK Tool:Github PK Tool


Organizations
consciousness-lab
hemm-lab
sotopia-lab
ulab-uiuc
web-arena-x
WebPixie

Haofei Yu's starred repositories

grok-1

Grok open release

Language:PythonLicense:Apache-2.0Stargazers:49395Issues:561Issues:208

gradio

Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!

Language:PythonLicense:Apache-2.0Stargazers:31880Issues:167Issues:4662

mamba

Mamba SSM architecture

Language:PythonLicense:Apache-2.0Stargazers:12372Issues:101Issues:485

espnet

End-to-End Speech Processing Toolkit

Language:PythonLicense:Apache-2.0Stargazers:8248Issues:181Issues:2346

PowerInfer

High-speed Large Language Model Serving on PCs with Consumer-grade GPUs

Language:C++License:MITStargazers:7840Issues:75Issues:155

clean-fid

PyTorch - FID calculation with proper image resizing and quantization steps [CVPR 2022]

Language:PythonLicense:MITStargazers:924Issues:9Issues:49

Awesome-Language-Model-on-Graphs

A curated list of papers and resources based on "Large Language Models on Graphs: A Comprehensive Survey".

SeeAct

[ICML'24] SeeAct is a system for generalist web agents that autonomously carry out tasks on any given website, with a focus on large multimodal models (LMMs) such as GPT-4V(ision).

Language:PythonLicense:NOASSERTIONStargazers:578Issues:16Issues:41

LaVIN

[NeurIPS 2023] Official implementations of "Cheap and Quick: Efficient Vision-Language Instruction Tuning for Large Language Models"

Pandora

Pandora: Towards General World Model with Natural Language Actions and Video States

model-soups

Model soups: averaging weights of multiple fine-tuned models improves accuracy without increasing inference time

Language:PythonLicense:MITStargazers:404Issues:10Issues:18

miniwob-plusplus

MiniWoB++: a web interaction benchmark for reinforcement learning

Language:HTMLLicense:MITStargazers:273Issues:15Issues:24

visualwebarena

VisualWebArena is a benchmark for multimodal agents.

Language:PythonLicense:MITStargazers:206Issues:5Issues:43

ScienceWorld

ScienceWorld is a text-based virtual environment centered around accomplishing tasks from the standardized elementary science curriculum.

Language:ScalaLicense:Apache-2.0Stargazers:199Issues:9Issues:33
Language:Jupyter NotebookStargazers:57Issues:4Issues:1

awesome-social-agents

A collection of works that investigate social agents, simulations and their real-world impact in text, embodied, and robotics contexts.

Language:TypeScriptLicense:Apache-2.0Stargazers:52Issues:2Issues:8

sotopia-pi

Sotopia-π: Interactive Learning of Socially Intelligent Language Agents (ACL 2024)

Language:PythonLicense:Apache-2.0Stargazers:46Issues:3Issues:78

GSM-Plus

GSM-Plus: Data, Code, and Evaluation for Enhancing Robust Mathematical Reasoning in Math Word Problems.

HEMM

Holistic evaluation of multimodal foundation models

Language:PythonLicense:MITStargazers:35Issues:7Issues:4

MM-InstructEval

This repository contains code to evaluate various multimodal large language models using different instructions across multiple multimodal content comprehension tasks.

Language:PythonStargazers:24Issues:3Issues:0
Language:PythonLicense:MITStargazers:13Issues:1Issues:0

FeedbackPreference

This is the repo for our proposed Feedback Preference corpus

Language:PythonLicense:MITStargazers:4Issues:2Issues:0

sotopia-space

A synced repository for https://huggingface.co/spaces/wdplx/sotopia-space/

Language:PythonStargazers:1Issues:0Issues:0