sidney1994 / Medical-Dialogue-System

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Medical-Dialogue-System

The MedDialog dataset contains conversations (in Chinese) between doctors and patients. It has 1.1 million dialogues and 4 million utterances. The data is continuously growing and more dialogues will be added. The raw dialogues are from haodf.com. All copyrights of the data belong to haodf.com.

The data can be downloaded from https://drive.google.com/file/d/13-PqKtUZZyV7ElnCAV8Sz0HALnBJxzvt/view?usp=sharing

If you find this dataset useful, please cite:

@article{chen2020meddiag,
  title={MedDialog: a large-scale medical dialogue dataset},
  author={Chen, Shu and Ju, Zeqian and Dong, Xiangyu and Fang, Hongchao and Wang, Sicheng and Yang, Yue and Zeng, Jiaqi and Zhang, Ruisi and Zhang, Ruoyu and Zhou, Meng and Zhu, Penghui and Xie, Pengtao},
  journal={https://github.com/UCSD-AI4H/Medical-Dialogue-System}, 
  year={2020}
}

About