vkocaman / spark-nlp-models

Models and Pipelines for the Spark NLP library

Home Page:https://nlp.johnsnowlabs.com/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Spark NLP Models

Build Status Maven Central PyPI version Anaconda-Cloud License

We use this repository to maintain our releases of pre-trained pipelines and models for the Spark NLP library. For more info please take a look at our releases.

Project's website

Take a look at our official Spark NLP page: http://nlp.johnsnowlabs.com/ for user documentation and examples

Slack community channel

Join Slack

Table of contents

Pretrained Models

Public Models

pretrained(name, lang) function to use

English - Models

Model Name en
LemmatizerModel (Lemmatizer) lemma_antbnc Download
PerceptronModel (POS) pos_anc Download
NerCrfModel (NER with GloVe) ner_crf Download
NerDLModel (NER with GloVe) ner_dl Download
NerDLModel (NER with GloVe) ner_dl_contrib Download
NerDLModel (NER with BERT) ner_dl_bert_base_cased Download
NerDLModel (OntoNotes with GloVe 100d) onto_100 Download
NerDLModel (OntoNotes with GloVe 300d) onto_300 Download
WordEmbeddings (GloVe) glove_100d Download
BertEmbeddings (base_uncased) bert_base_uncased Download
BertEmbeddings (base_cased) bert_base_cased Download
BertEmbeddings (large_uncased) bert_large_uncased Download
BertEmbeddings (large_cased) bert_large_cased Download
DeepSentenceDetector ner_dl_sentence Download
ContextSpellCheckerModel (Spell Checker) spellcheck_dl Download
SymmetricDeleteModel (Spell Checker) spellcheck_sd Download
NorvigSweetingModel (Spell Checker) spellcheck_norvig Download
ViveknSentimentModel (Sentiment) sentiment_vivekn Download
DependencyParser (Dependency) dependency_conllu Download
TypedDependencyParser (Dependency) dependency_typed_conllu Download

French - Models

Model Name fr
LemmatizerModel (Lemmatizer) lemma Download
PerceptronModel (POS UD) pos_ud_gsd Download
NerDLModel (glove_840B_300) wikiner_840B_300 Download
Feature Description
Lemma Trained by Lemmatizer annotator on lemmatization-lists by Michal Měchura
POS Trained by PerceptronApproach annotator on the Universal Dependencies
NER Trained by NerDLApproach annotator with Char CNNs - BiLSTM - CRF and GloVe Embeddings on the WikiNER corpus and supports the identification of PER, LOC, ORG and MISC entities

German - Models

Model Name de
LemmatizerModel (Lemmatizer) lemma de
PerceptronModel (POS UD) pos_ud_hdt de
NerDLModel (glove_840B_300) wikiner_840B_300 de
Feature Description
Lemma Trained by Lemmatizer annotator on lemmatization-lists by Michal Měchura
POS Trained by PerceptronApproach annotator on the Universal Dependencies
NER Trained by NerDLApproach annotator with Char CNNs - BiLSTM - CRF and GloVe Embeddings on the WikiNER corpus and supports the identification of PER, LOC, ORG and MISC entities

Italian - Models

Model Name it
LemmatizerModel (Lemmatizer) lemma_dxc Download
SentimentDetector (Sentiment) sentiment_dxc Download
PerceptronModel (POS UD) pos_ud_isdt Download
NerDLModel (glove_840B_300) wikiner_840B_300 Download
Feature Description
Lemma Trained by Lemmatizer annotator on DXC Technology dataset
POS Trained by PerceptronApproach annotator on the Universal Dependencies
NER Trained by NerDLApproach annotator with Char CNNs - BiLSTM - CRF and GloVe Embeddings on the WikiNER corpus and supports the identification of PER, LOC, ORG and MISC entities

Multi-language

Model Name xx
WordEmbeddings (GloVe) glove_840B_300 Download
WordEmbeddings (GloVe) glove_6B_300 Download
BertEmbeddings (multi_cased) bert_multi_cased Download

Licensed Enterprise

pretrained(name, lang) function to use

English - Licensed

It is required to specify 3rd argument to pretrained(name, lang, loc) function (location) to add the location of these

Model name language loc
NerDLModel ner_clinical en clinical/models
AssertionLogRegModel assertion_ml en clinical/models
AssertionDLModel assertion_dl en clinical/models
NerDLModel deidentify_dl en clinical/models
DeIdentificationModel deidentify_rb en clinical/models
WordEmbeddingsModel embeddings_clinical en clinical/models
BertEmbeddingsModel biobert_pubmed_cased en clinical/models
BertEmbeddingsModel biobert_pmc_cased en clinical/models
BertEmbeddingsModel biobert_pubmed_pmc_cased en clinical/models
BertEmbeddingsModel biobert_clinical_cased en clinical/models
BertEmbeddingsModel biobert_discharge_cased en clinical/models
PerceptronModel pos_clinical en clinical/models
EntityResolverModel resolve_icd10 en clinical/models
EntityResolverModel resolve_icd10cm_cl_em en clinical/models
EntityResolverModel resolve_icd10pcs_cl_em en clinical/models
ContextSpellCheckerModel context_spell_med en clinical/models

Pretrained Pipelines

English - Pipelines

NOTE: noncontrib pipelines are compatible with Windows operating systems.

Pipelines Name en
Explain Document ML explain_document_ml Download
Explain Document DL explain_document_dl Download
Explain Document DL Win explain_document_dl_noncontrib Download
Explain Document DL Fast explain_document_dl_fast Download
Explain Document DL Fast Win explain_document_dl_fast_noncontrib Download
Recognize Entities DL recognize_entities_dl Download
Recognize Entities DL Win recognize_entities_dl_noncontrib Download
OntoNotes Entities Small onto_recognize_entities_sm Download
OntoNotes Entities Large onto_recognize_entities_lg Download
Match Datetime match_datetime Download
Match Pattern match_pattern Download
Match Chunk match_chunks Download
Match Phrases match_phrases Download
Clean Stop clean_stop Download
Clean Pattern clean_pattern Download
Clean Slang clean_slang Download
Check Spelling check_spelling Download
Analyze Sentiment analyze_sentiment Download
Dependency Parse dependency_parse Download

French - Pipelines

Pipelines Name fr
Explain Document Large explain_document_lg Download
Explain Document Medium explain_document_md Download
Entity Recognizer Large entity_recognizer_lg Download
Entity Recognizer Medium entity_recognizer_md Download

Italian - Pipelines

Pipelines Name it
Explain Document Large explain_document_lg Download
Explain Document Medium explain_document_md Download
Entity Recognizer Large entity_recognizer_lg Download
Entity Recognizer Medium entity_recognizer_md Download

Contact

nlp@johnsnowlabs.com

John Snow Labs

http://johnsnowlabs.com

About

Models and Pipelines for the Spark NLP library

https://nlp.johnsnowlabs.com/

License:Apache License 2.0