Chung-I / mndmv

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

multi-lingual grammar induction

This is PyTorch implementation of the paper:

Multilingual Grammar Induction with Continuous Language Identification
Wenjuan Han, Ge Wang, Yong Jiang, Kewei Tu
EMNLP 2019

The code performs unsupervised grammar learning in the dataset of 15 languages selected from UD treebanks v1.4.

Please concact hanwj@shanghaitech.edu.cn or wangge@shanghaitech.edu.cn if you have any questions.

Environments

  • Python 2.7
  • PyTorch >=1.0
  • Numpy 1.15.4

Data

run:

python ml_dmv_parser.py --cvalency 2 --do_eval --neural_epoch 1 --function_mask --lr 0.001 --epoch 70 --use_neural --embed_languages --em_type em --child_only --sentence_predict

The grammar induction is by default set to be performed on 15 selected languages, you can customize your own language selection by changing the contents in the language_list file

Reference

wait...

About


Languages

Language:Python 100.0%