lwaekfjlk

Haofei Yu's starred repositories

grok-1

Grok open release

Language:PythonApache-2.049395 561 208

gradio

Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!

Language:PythonApache-2.031880 167 4662

mamba

Mamba SSM architecture

Language:PythonApache-2.012372 101 485

espnet

End-to-End Speech Processing Toolkit

Language:PythonApache-2.08248 181 2346

PowerInfer

High-speed Large Language Model Serving on PCs with Consumer-grade GPUs

Language:C++MIT7840 75 155

clean-fid

PyTorch - FID calculation with proper image resizing and quantization steps [CVPR 2022]

Language:PythonMIT924 9 49

Awesome-Language-Model-on-Graphs

A curated list of papers and resources based on "Large Language Models on Graphs: A Comprehensive Survey".

MIT682 15 6

SeeAct

[ICML'24] SeeAct is a system for generalist web agents that autonomously carry out tasks on any given website, with a focus on large multimodal models (LMMs) such as GPT-4V(ision).

Language:PythonNOASSERTION578 16 41

LaVIN

[NeurIPS 2023] Official implementations of "Cheap and Quick: Efficient Vision-Language Instruction Tuning for Large Language Models"

Language:Python497 6 41

Pandora

Pandora: Towards General World Model with Natural Language Actions and Video States

Language:Python455 17 7

model-soups

Model soups: averaging weights of multiple fine-tuned models improves accuracy without increasing inference time

Language:PythonMIT404 10 18

AGI-survey

MIT309 10 6

miniwob-plusplus

MiniWoB++: a web interaction benchmark for reinforcement learning

Language:HTMLMIT273 15 24

visualwebarena

VisualWebArena is a benchmark for multimodal agents.

Language:PythonMIT206 5 43

ScienceWorld

ScienceWorld is a text-based virtual environment centered around accomplishing tasks from the standardized elementary science curriculum.

Language:ScalaApache-2.0199 9 33

awesome-tool-llm

160 4 2

gemini-benchmark

Language:Jupyter Notebook149 8 14

coach

Language:Jupyter Notebook57 4 1

awesome-social-agents

A collection of works that investigate social agents, simulations and their real-world impact in text, embodied, and robotics contexts.

Language:TypeScriptApache-2.052 2 8

sotopia-pi

Sotopia-π: Interactive Learning of Socially Intelligent Language Agents (ACL 2024)

Language:PythonApache-2.046 3 78

GSM-Plus

GSM-Plus: Data, Code, and Evaluation for Enhancing Robust Mathematical Reasoning in Math Word Problems.

Language:Python38 1 3

HEMM

Holistic evaluation of multimodal foundation models

Language:PythonMIT35 7 4

MM-InstructEval

This repository contains code to evaluate various multimodal large language models using different instructions across multiple multimodal content comprehension tasks.

Language:Python24 30