SignLLM / Prompt2Sign

The Data and Code of Prompt2Sign: The First Comprehensive Multilingual Sign Language Dataset.

Home Page:https://signllm.github.io/Prompt2Sign/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Prompt2Sign

Welcome to Prompt2Sign! This repository stores the preprocessed data for the paper:
SignLLM: Sign Languages Production Large Language Models.

Note: The release of our data is tentatively expected at the end of 2024, so don't rush.

News

[2024.06.30] The Jupyer Notebook and Docker for data processing has been released.
[2024.05.17] The arXiv version of the paper is now available.
[2024.01.16] Prompt2Sign homepage is available and data is expected to be released after accept (maybe at the end of 2024, so don't rush).
[2023.12.14] We have made supplementary materials and demo available at this page.
[2023.11.04] We have made Prompt2Sign and Tools available at GitHub. Check out here.

Dataset Introduction

Prompt2Sign is first comprehensive multilingual sign language dataset, which uses tools to automate the acquisition and processing of sign language videos on the web, is an evolving data set that is efficient, lightweight, reducing the previous shortcomings. The details of the dataset are available at https://signllm.github.io/Prompt2Sign/.

Current languages include: American Sign Language (ASL), German Sign Language (GSL, Alias DGS), Swiss German Sign Language (DSGS), French Sign Language of Switzerland (LSF-CH), Italian Sign Language of Switzerland (LIS-CH), Argentine Sign Language (Lengua de Señas Argentina, LSA), Korean Sign Language (KSL), and Turkish Sign Language (TSL).

Dataset Summary
Name Language Vocab. Duration (h) Signers Multiview Transcription Gloss Pose Depth Speech Prompt Compress
Video-Based CSL CSL 178 100 50 ✔️ ✔️ ✔️
SIGNUM GSL 450 55 25 ✔️ ✔️
RWTH-Phoenix-2014T GSL 3k 11 9 ✔️ ✔️
Public DGS Corpus GSL -- 50 327 ✔️ ✔️ ✔️ ✔️
BSL Corpus BSL 5k -- 249 ✔️ ✔️
NCSLGR ASL 1.8k 5.3 4 ✔️ ✔️ ✔️
How2Sign ASL 16k 79 11 ✔️ ✔️ ✔️ ✔️ ✔️ ✔️
Prompt2Sign (ours) Multilingual 40k 200 40 ✔️ ✔️ ✔️ ✔️ ✔️ ✔️ ✔️ ✔️

Star History Chart

How To Cite

Please cite the following paper when using Prompt2Sign in your research:

@misc{fang2024signllm,
      title={SignLLM: Sign Languages Production Large Language Models}, 
      author={Sen Fang and Lei Wang and Ce Zheng and Yapeng Tian and Chen Chen},
      year={2024},
      eprint={2405.10718},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

@misc{fang2023signdiff,
      title={SignDiff: Learning Diffusion Models for American Sign Language Production}, 
      author={Sen Fang and Chunyu Sui and Xuedong Zhang and Yapeng Tian},
      year={2023},
      eprint={2308.16082},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

About

The Data and Code of Prompt2Sign: The First Comprehensive Multilingual Sign Language Dataset.

https://signllm.github.io/Prompt2Sign/

License:Other


Languages

Language:Python 63.9%Language:Jupyter Notebook 36.1%