Ella77 / Xpersona

XPersona: Evaluating Multilingual Personalized Chatbot

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

XPersona: Evaluating Multilingual Personalized Chatbot

License: MIT

This is the source code of the paper:

XPersona: Evaluating Multilingual Personalized Chatbot. [PDF]

This code has been written using PyTorch. If you use source codes or datasets included in this toolkit in your work, please cite the following papers:

XPersona

@article{lin2020xpersona,
  title={XPersona: Evaluating Multilingual Personalized Chatbot},
  author={Lin, Zhaojiang and Liu, Zihan and Winata, Genta Indra and Cahyawijaya, Samuel and Madotto, Andrea and Bang, Yejin and Ishii, Etsuko and Fung, Pascale},
  journal={arXiv preprint arXiv:2003.07568},
  year={2020}
}

English PersonaChat

@article{zhang2018personalizing,
  title={Personalizing Dialogue Agents: I have a dog, do you have pets too?},
  author={Zhang, Saizheng and Dinan, Emily and Urbanek, Jack and Szlam, Arthur and Kiela, Douwe and Weston, Jason},
  journal={arXiv preprint arXiv:1801.07243},
  year={2018}
}

Dataset

XPersona dataset is an extension of the persona-chat dataset. Specifically, we extend the ConvAI2 to the other six languages: Chinese, French, Indonesian, Italian, Korean, and Japanese.

Baselines

In this work, we provided multilingual and crosslingual trained baselines. See multilingual and crosslingual folder for more details.

Acknowledgement

This repository is implemented using Huggingface codebase.

About

XPersona: Evaluating Multilingual Personalized Chatbot

License:MIT License


Languages

Language:Python 98.6%Language:Shell 1.1%Language:Perl 0.2%