shelleyyyyu's starred repositories
github-typo-corpus
GitHub Typo Corpus: A Large-Scale Multilingual Dataset of Misspellings and Grammatical Errors
Gramformer
A framework for detecting, highlighting and correcting grammatical errors on natural language text. Created by Prithiviraj Damodaran. Open to pull requests and other forms of collaboration.
MiduCTC-competition
文本智能校对大赛(Chinese Text Correction)的baseline
reinforcement-learning-an-introduction
Python Implementation of Reinforcement Learning: An Introduction
Paper-Writing-Tips
MLNLP社区用来帮助大家避免论文投稿小错误的整理仓库。 Paper Writing Tips
soft-masked-bert-for-spelling-error-correction
A third-party implementation of paper《Spelling Error Correction with Soft-Masked BERT》using tensorflow==1.12.0
google-research
Google Research
EDA_NLP_for_Chinese
An implement of the paper of EDA for Chinese corpus.中文语料的EDA数据增强工具。NLP数据增强。论文阅读笔记。
CTC-Report
CTC2021-中文文本纠错大赛的SOTA方案及在线演示
Automatic-Corpus-Generation
This repository is for the paper "A Hybrid Approach to Automatic Corpus Generation for Chinese Spelling Check"
NLP-progress
Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.
NLP-progress
Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.
DataAug4NLP
Collection of papers and resources for data augmentation for NLP.
weixin_public_corpus
微信公众号语料库
NLPCC2018_GEC
Data for NLPCC2018 Shared Task--Grammatical Error Correction (GEC).
pycorrector
pycorrector is a toolkit for text error correction. 文本纠错,实现了Kenlm,T5,MacBERT,ChatGLM3,LLaMA等模型应用在纠错场景,开箱即用。
One-shot-Relational-Learning
Code for One-shot Relational Learning for Knowledge Graphs (EMNLP18)