SivilTaram

[ICML'24] SeeAct is a system for generalist web agents that autonomously carry out tasks on any given website, with a focus on large multimodal models (LMMs) such as GPT-4V(ision).

Language:PythonNOASSERTION510 17 27

gritlm

Generative Representational Instruction Tuning

Language:Jupyter NotebookMIT439 8 32

zero-bubble-pipeline-parallelism

Zero Bubble Pipeline Parallelism

Language:PythonNOASSERTION201 5 12

AgentBoard

An Analytical Evaluation Board of Multi-turn LLM Agents

Language:SAS199 4 7

LongMamba

Some preliminary explorations of Mamba's context scaling.

Language:Python170 15 4

visualwebarena

VisualWebArena is a benchmark for multimodal agents.

Language:PythonMIT157 7 28

sewformer

Language:Python117 19 16

gated_linear_attention

Language:PythonMIT84 6 8

sailor-llm

Sailor: Open Language Models for South-East Asia

Language:PythonMIT78 7 1

Agent-Smith

[ICML2024] Agent Smith: A Single Image Can Jailbreak One Million Multimodal LLM Agents Exponentially Fast

Language:PythonMIT54 6 1

astraios

Astraios: Parameter-Efficient Instruction Tuning Code Language Models

Language:Jupyter NotebookMIT52 4 4

weak-to-strong

Weak-to-Strong Jailbreaking on Large Language Models

Language:PythonMIT4700

autofd

Automatic Functional Differentiation in JAX

Language:PythonApache-2.044 4 3

arks

Language:Python40 5 2

AnyDoor

AnyDoor: Test-Time Backdoor Attacks on Multimodal Large Language Models

Language:Python34 6 1

youtube_subtitle_dataset

YT_subtitles - extracts subtitles from YouTube videos to raw text for Language Model training

Language:Python25 2 1

SciTab

The project page for "SCITAB: A Challenging Benchmark for Compositional Reasoning and Claim Verification on Scientific Tables"

MIT16 2 1

thaimaimee

Scrape, clean and explore ThaiME dataset

Language:Jupyter NotebookNOASSERTION12 20

Bridge_for_Numerical_Reasoning

Language:Python400