XLANG Lab

XLANG Lab's repositories

OpenAgents

[COLM 2024] OpenAgents: An Open Platform for Language Agents in the Wild

Language:PythonApache-2.04605 46 103

OSWorld

[NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments

Language:PythonApache-2.02301 28 177

instructor-embedding

[ACL 2023] One Embedder, Any Task: Instruction-Finetuned Text Embeddings

Language:PythonApache-2.02015 18 113

Spider2

[ICLR 2025 Oral] Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows

Language:HTMLMIT630 12 139

UnifiedSKG

[EMNLP 2022] Unifying and multi-tasking structured knowledge grounding with language models

Language:PythonApache-2.0566 12 39

OpenCUA

OpenCUA: Open Foundations for Computer-Use Agents

Language:PythonMIT552 6 34

aguvis

[ICML2025] Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction

Language:Python367 15 30

xlang-paper-reading

Paper collection on building and evaluating language model agents via executable language grounding

362 110

Binder

[ICLR 2023] Code for the paper "Binding Language Models in Symbolic Languages"

Language:PythonApache-2.0324 10 11

DS-1000

[ICML 2023] Data and code release for the paper "DS-1000: A Natural and Reliable Benchmark for Data Science Code Generation".

Language:PythonCC-BY-SA-4.0258 8 21

text2reward

[ICLR 2024 Spotlight] Text2Reward: Reward Shaping with Language Models for Reinforcement Learning

Language:Jupyter Notebook186 8 6

BRIGHT

[ICLR 2025] BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval

Language:PythonCC-BY-4.0176 4 13

Spider2-V

[NeurIPS 2024] Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?

Language:Jupyter NotebookApache-2.0132 4 3

OSWorld-G

[NeurIPS 2025 Spotlight] Scaling Computer-Use Grounding via UI Decomposition and Synthesis

Language:TypeScript126 5 9

icl-selective-annotation

[ICLR 2023] Code for our paper "Selective Annotation Makes Language Models Better Few-Shot Learners"

Language:Python111 4 2

batch-prompting

[EMNLP 2023 Industry Track] A simple prompting approach that enables the LLMs to run inference in batches.

Language:Python76 7 2

EVOR

Language:PythonApache-2.066 5 4

computer-agent-arena

Computer Agent Arena: Test & compare AI agents in real desktop apps & web environments. Code/data coming soon!

Apache-2.050 6 1

AgentTrek

[ICLR2025 Spotlight] Agent Trajectory Synthesis via Guiding Replay with Web Tutorials

Language:Python47 4 4

VideoAgentTrek

The official repo of VideoAgentTrek

Language:PythonMIT2800

AgentNetTool

This is the official code base of AgentNetTool in OpenCUA. Website: https://opencua.xlang.ai/

Language:TypeScriptMIT2401

diagrams_toolkit

Source code for diagrams in the paper of NLPers from HKU.

Language:PythonMIT5 40

xlang-ai.github.io

The official website of xlang.ai

Language:TypeScript4 60

.github

1 30

verl

veRL: Volcano Engine Reinforcement Learning for LLM

Language:PythonApache-2.0100

Pai-Megatron-Patch

The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.

Apache-2.0000