dadelani / MedDG

a large-scale high-quality medical dialogue dataset

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

MedDG

This is the code for the following paper:

MedDG: A Large-scale Medical Consultation Dataset for Building Medical Dialogue System

Wenge Liu, Jianheng Tang, Jinghui Qin, Lin Xu, Zhen Li, Xiaodan Liang; Arxiv

Requirement

pip install allenlp==0.9

Usage

For the task of topic prediction (e.g. run the LSTM baseline):

cd topic_predict
allennlp train ../config/topic_predict/lstm_baseline.json --include-package basline -s tmp/lstm1

For the task of response generation (e.g. run the seq2seq baseline):

cd generation
allennlp train ../config/generation/seq2seqattn.json --include-package seq2seq -s tmp/seq2seq1

Dataset

https://drive.google.com/drive/folders/109WnXlNhmqttxYwb4EEscYGcA_-eq8Eg?usp=sharing

About

a large-scale high-quality medical dialogue dataset

License:GNU General Public License v3.0


Languages

Language:Python 100.0%