XLANG Lab (xlang-ai)

XLANG Lab

xlang-ai

Organization data from Github https://github.com/xlang-ai

Developing embodied AI agents that empower users to use language to interact with digital and physical environments to carry out real-world tasks.

Home Page:https://xlang.ai

GitHub:@xlang-ai

Twitter:@XLangNLP

XLANG Lab's repositories

OpenAgents

[COLM 2024] OpenAgents: An Open Platform for Language Agents in the Wild

Language:PythonLicense:Apache-2.0Stargazers:4605Issues:46Issues:103

OSWorld

[NeurIPS 2024] OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments

Language:PythonLicense:Apache-2.0Stargazers:2301Issues:28Issues:177

instructor-embedding

[ACL 2023] One Embedder, Any Task: Instruction-Finetuned Text Embeddings

Language:PythonLicense:Apache-2.0Stargazers:2015Issues:18Issues:113

Spider2

[ICLR 2025 Oral] Spider 2.0: Evaluating Language Models on Real-World Enterprise Text-to-SQL Workflows

Language:HTMLLicense:MITStargazers:630Issues:12Issues:139

UnifiedSKG

[EMNLP 2022] Unifying and multi-tasking structured knowledge grounding with language models

Language:PythonLicense:Apache-2.0Stargazers:566Issues:12Issues:39

OpenCUA

OpenCUA: Open Foundations for Computer-Use Agents

Language:PythonLicense:MITStargazers:552Issues:6Issues:34

aguvis

[ICML2025] Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction

xlang-paper-reading

Paper collection on building and evaluating language model agents via executable language grounding

Binder

[ICLR 2023] Code for the paper "Binding Language Models in Symbolic Languages"

Language:PythonLicense:Apache-2.0Stargazers:324Issues:10Issues:11

DS-1000

[ICML 2023] Data and code release for the paper "DS-1000: A Natural and Reliable Benchmark for Data Science Code Generation".

Language:PythonLicense:CC-BY-SA-4.0Stargazers:258Issues:8Issues:21

text2reward

[ICLR 2024 Spotlight] Text2Reward: Reward Shaping with Language Models for Reinforcement Learning

Language:Jupyter NotebookStargazers:186Issues:8Issues:6

BRIGHT

[ICLR 2025] BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval

Language:PythonLicense:CC-BY-4.0Stargazers:176Issues:4Issues:13

Spider2-V

[NeurIPS 2024] Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?

Language:Jupyter NotebookLicense:Apache-2.0Stargazers:132Issues:4Issues:3

OSWorld-G

[NeurIPS 2025 Spotlight] Scaling Computer-Use Grounding via UI Decomposition and Synthesis

icl-selective-annotation

[ICLR 2023] Code for our paper "Selective Annotation Makes Language Models Better Few-Shot Learners"

batch-prompting

[EMNLP 2023 Industry Track] A simple prompting approach that enables the LLMs to run inference in batches.

Language:PythonLicense:Apache-2.0Stargazers:66Issues:5Issues:4

computer-agent-arena

Computer Agent Arena: Test & compare AI agents in real desktop apps & web environments. Code/data coming soon!

AgentTrek

[ICLR2025 Spotlight] Agent Trajectory Synthesis via Guiding Replay with Web Tutorials

VideoAgentTrek

The official repo of VideoAgentTrek

Language:PythonLicense:MITStargazers:28Issues:0Issues:0

AgentNetTool

This is the official code base of AgentNetTool in OpenCUA. Website: https://opencua.xlang.ai/

Language:TypeScriptLicense:MITStargazers:24Issues:0Issues:1

diagrams_toolkit

Source code for diagrams in the paper of NLPers from HKU.

Language:PythonLicense:MITStargazers:5Issues:4Issues:0

xlang-ai.github.io

The official website of xlang.ai

Language:TypeScriptStargazers:4Issues:6Issues:0

verl

veRL: Volcano Engine Reinforcement Learning for LLM

Language:PythonLicense:Apache-2.0Stargazers:1Issues:0Issues:0

Pai-Megatron-Patch

The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.

License:Apache-2.0Stargazers:0Issues:0Issues:0