Plan de Tecnologías del Lenguaje - Gobierno de España

Plan de Tecnologías del Lenguaje - Gobierno de España's repositories

lm-spanish

Official source for spanish Language Models and resources made @ BSC-TEMU within the "Plan de las Tecnologías del Lenguaje" (Plan-TL).

Language:PythonApache-2.0259 27 5

lm-legal-es

Language Models for the legal domain in Spanish done @ BSC-TEMU within the "Plan de las Tecnologías del Lenguaje" (Plan-TL).

Apache-2.029 7 1

lm-biomedical-clinical-es

Official source for Spanish pretrained biomedical and clinical language models and resources made @ BSC-TEMU within the "Plan de las Tecnologías del Lenguaje" (Plan-TL).

Language:PythonApache-2.026 5 4

Biomedical-Word-Embeddings-for-Spanish

Biomedical Word embeddings generated from Spanish Biomedical corpora.

NOASSERTION10 7 2

SPACCC_MEDDOCAN

MEDDOCAN: Corpus, guidelines, IAA and scripts.

Language:PythonNOASSERTION7 40

NegEx-MES

[PlanTL/medicine/document annotation/negation] Negation detector for Spanish clinical texts based on Wendy Chapman's NegEx algorithm.

Language:JavaMIT6 5 1

corpus-cleaner

Generic toolkit for corpus cleaning

Language:PythonMIT5 2 34

AbreMES-DB

[Plan TL/medicine/lexical/terminological resource] A Spanish Medical Abbreviation DataBase.

NOASSERTION3 30

EHR-normalizer

[PlanTL/medicine/document/NLP preprocessing] Software to convert PDF files into HTML, TXT or XML files and to normalize EHRs.

Language:PerlMIT3 30

Medical-Translator

[PlanTL/medicine/neural machine translation/translation models] Files needed to use the Neural Machine Translation system for the Biomedical Domain.

Language:ShellNOASSERTION3 40

controversy-detection-model

This repository contains the code of the paper "Anticipating the Debate: Predicting Controversy in News with Transformer-based NLP"

Language:PythonApache-2.02 20

covid-predictive-model

A RNN Predictive Model for COVID-19 mortality prediction.

Language:PythonMIT2 40

EHR-TTS

[PlanTL/medicine/document annotation//time] HeidelTime grammar for temporal tagging of Spanish Electronic Health Records (EHR).

GPL-3.02 40

Medical-Translator-WMT19

Language:ShellNOASSERTION2 10

PharmaCoNER-Evaluation-Script

Language:PythonMIT2 40

SPACCC_POS-TAGGER

[PlanTL/medicine/document annotation/NLP preprocessing/part-of-speech] Part-of-Speech Tagger for medical domain corpus in Spanish based on FreeLing.

Language:Python2 3 1

AbreMES-X

[PlanTL/medicine/semantic annotation] Software used to generate the Spanish Medical Abbreviation DataBase (AbreMES-DB).

Language:JavaMIT1 30

BVS-Corpus

Biblioteca Virtual en Salud - Parallel Corpus

NOASSERTION1 10

MEDDOCAN-Format-Converter-Script

Script to convert files between MEDDOCAN-Brat, MEDDOCAN-XML, and i2b2 formats.

Language:PythonMIT1 40

shared-task-resource-example

Example README file for Shared Task submissions

1 40

SPACCC_Sentence-Splitter

[PlanTL/medicine/document annotation/NLP preprocessing/sentence splitter] Sentence splitting model created using the Apache OpenNLP machine learning toolkit

Language:JavaNOASSERTION1 50