ag3713a's repositories
git-filter-repo
Quickly rewrite git repository history (filter-branch replacement)
bert-as-service
Mapping a variable-length sentence to a fixed-length vector using BERT model
awesome-machine-learning
A curated list of awesome Machine Learning frameworks, libraries and software.
pptx2md
a pptx to markdown converter
CASIE
CyberAttack Sensing and Information Extraction
entity-recognition-datasets
A collection of corpora for named entity recognition (NER) and entity recognition tasks. These annotated datasets cover a variety of languages, domains and entity types.
Electra_CRF_NER
We start a company-name recognition task with a small scale and low quality training data, then using skills to enhanced model training speed and predicting performance with least artificial participation. The methods we use involve lite pre-training models such as Albert-small or Electra-small with financial corpus, knowledge of distillation and multi-stage learning. The result is that we improve the recall rate of company names recognition task from 0.73 to 0.92 and get 4 times as fast as BERT-Bilstm-CRF model.
news-graph
Key information extraction from text and graph visualization
PPTX2HTML
Convert pptx file to HTML by using pure javascript