TamaraAtanasoska / Semantic_Relatedness_SemEval2024

SemEval 2024 Task 1 : Textual Semantic Relatedness

Home Page:https://semantic-textual-relatedness.github.io

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

SemEval 2024 Task 1: Semantic Textual Relatedness

This repository contains the data and resources for the SemEval 2024 Task 1: Semantic Textual Relatedness (STR). For more information, please visit the shared task and competition websites.

Dataset | Languages | Shared Task Starter Kit | Citing This Work

Dataset

The STR dataset is available in the data folder or can be downloaded from Hugging Face (coming soon).

Languages

The STR task focuses on the following 14 languages:

  1. Afrikaans (afr released)
  2. Algerian Arabic (arq released)
  3. Amharic (amh released)
  4. English (eng released)
  5. Hausa (hau released)
  6. Indonesian
  7. Hindi
  8. Kinyarwanda
  9. Marathi (mar released)
  10. Modern Standard Arabic (arb released)
  11. Moroccan Arabic (ary released)
  12. Punjabi
  13. Spanish (esp released)
  14. Telugu (tel released)

Shared Task Starter Kit

A starter kit is available to help you create a baseline result. You can open the starter kit in a Colab Notebook and run the baseline system. The resultant experiment can be submitted to Codalab to ensure the submission format is clear.

To run the Colab Notebook, click the badge "Open in Colab".

  • Simple Co-occurrence Baseline for Semantic Relatedness: Open In Colab

Citing This Work

If you use our dataset or participate in the STR task, please cite the following papers:

  • STR dataset paper: coming soon
  • STR SemEval task description paper: coming soon

About

SemEval 2024 Task 1 : Textual Semantic Relatedness

https://semantic-textual-relatedness.github.io


Languages

Language:Jupyter Notebook 93.5%Language:Python 6.5%