ju-resplande / transformers

Transformers for Assin 1 RTE and TweetSentBR

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Transformers

Portuguese BERT base, BERT multilingual base and RoBERTa large evaluation on ASSIN 1 rte and TweetSentBR using Transformers in addition to ASSIN 1 sts and ASSIN2 evaluation.

Original README

TweetSentBR formatted data is not available due to Twitter Policy.

Instructions

  1. Install requirements
pip install -r ./examples/requirements.txt
  1. Update Transformers package to support these tasks

    pip install --upgrade .
  2. Run task

    Replace {TASK_TYPE} for assin and tweesent

    Replace {TASK} for assin-ptbr-rte, assin-ptbr-rte. Leave in blank for tweetsent.

    a) For BERT multilingual

    bash run_{TASK_TYPE}.sh {TASK} PT bert-base-multilingual-cased

    b) For Portuguese BERT

    bash run_{TASK_TYPE}.sh {TASK} PT neuralmind/bert-base-portuguese-cased

    c) For RoBERTa

    bash run_{TASK_TYPE}.sh {TASK} EN

Results

run_{TASK_TYPE}.sh outputs predictions.json in output/{MODEL}/{TASK}.

  • Task evaluations and ASSIN XML in output/{MODEL}.

  • Task evaluation scripts in {TASK_TYPE}_eval.yaml

  • ASSIN xml formatting script in assin_xml.yaml

XML ASSIN Similarity labels were not modified.

About

Transformers for Assin 1 RTE and TweetSentBR

License:Apache License 2.0


Languages

Language:Python 99.3%Language:Jupyter Notebook 0.6%Language:Shell 0.1%