nurulchamidah / Semantic_Relatedness_SemEval2024

SemEval 2024 Task 12 : Textual Semantic Relatedness

Home Page:https://semantic-textual-relatedness.github.io

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

SemEval 2024 Task 1: Semantic Textual Relatedness

This repository contains the data and resources for the SemEval 2024 Task 1: Semantic Textual Relatedness (STR). For more information, please visit the shared task and competition websites.

Languages

The STR task focuses on the following 14 languages:

  1. Afrikaans
  2. Algerian Arabic
  3. Amharic (released)
  4. English (released)
  5. Hausa (released)
  6. Indonesian
  7. Hindi
  8. Kinyarwanda
  9. Marathi (released)
  10. Modern Standard Arabic
  11. Moroccan Arabic (released)
  12. Punjabi
  13. Spanish
  14. Telugu (released)

Dataset

The STR dataset is available in the data folder or can be downloaded from Hugging Face.

Subtasks

  • For Subtask A: Check SubtaskA folder
  • For Subtask B: Check SubtaskB folder
  • For Subtask B: Check SubtaskB folder

Shared Task Starter Kit

A starter kit is available to help you create a baseline result. You can open the starter kit in a Colab Notebook and run the baseline system. The resultant experiment can be submitted to Codalab to ensure the submission format is clear.

To run the Colab Notebook, click the badge "Open in Colab".

  • Simple Co-Occurance Baseline for Semantic Relatedness: Open In Colab

Citing This Work

If you use our dataset or participate in the STR task, please cite the following papers:

  • STR dataset paper: coming soon
  • STR SemEval task description paper: coming soon

About

SemEval 2024 Task 12 : Textual Semantic Relatedness

https://semantic-textual-relatedness.github.io


Languages

Language:Jupyter Notebook 93.5%Language:Python 6.5%