406345 / ECSpell

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

ECSpell

Code for paper "General and Domain Adaptive Chinese Spelling Check with Error Consistent Pretraining"

Data usage

Path: Data/domains_data

  • For zero-shot tasks, you should combine the *.train file and *.test file.
  • For common tasks, the *.train file is used to do training and do evaluating while *.test is adopted to do predicting.

Usage:

cd glyce
python setup.py develop
pip show glyce   # to ensure the successful installation of glyce lib

Model weights

Citation

@article{lv2023general,
  title={General and Domain-adaptive Chinese Spelling Check with Error-consistent Pretraining},
  author={Lv, Qi and Cao, Ziqiang and Geng, Lei and Ai, Chunhui and Yan, Xu and Fu, Guohong},
  journal={ACM Transactions on Asian and Low-Resource Language Information Processing},
  volume={22},
  number={5},
  pages={1--18},
  year={2023},
  publisher={ACM New York, NY}
}

About

License:MIT License


Languages

Language:Python 99.9%Language:Shell 0.1%