Plan de Tecnologías del Lenguaje - Gobierno de España (PlanTL-GOB-ES)

Plan de Tecnologías del Lenguaje - Gobierno de España

PlanTL-GOB-ES

Geek Repo

https://huggingface.co/PlanTL-GOB-ES

Location:Barcelona

Home Page:https://plantl.mineco.gob.es/Paginas/index.aspx

Github PK Tool:Github PK Tool

Plan de Tecnologías del Lenguaje - Gobierno de España's repositories

lm-spanish

Official source for spanish Language Models and resources made @ BSC-TEMU within the "Plan de las Tecnologías del Lenguaje" (Plan-TL).

Language:PythonLicense:Apache-2.0Stargazers:244Issues:27Issues:5

lm-legal-es

Language Models for the legal domain in Spanish done @ BSC-TEMU within the "Plan de las Tecnologías del Lenguaje" (Plan-TL).

License:Apache-2.0Stargazers:25Issues:7Issues:0

lm-biomedical-clinical-es

Official source for Spanish pretrained biomedical and clinical language models and resources made @ BSC-TEMU within the "Plan de las Tecnologías del Lenguaje" (Plan-TL).

Language:PythonLicense:Apache-2.0Stargazers:23Issues:6Issues:4

Biomedical-Word-Embeddings-for-Spanish

Biomedical Word embeddings generated from Spanish Biomedical corpora.

NegEx-MES

[PlanTL/medicine/document annotation/negation] Negation detector for Spanish clinical texts based on Wendy Chapman's NegEx algorithm.

Language:JavaLicense:MITStargazers:6Issues:6Issues:1

SPACCC_MEDDOCAN

MEDDOCAN: Corpus, guidelines, IAA and scripts.

Language:PythonLicense:NOASSERTIONStargazers:6Issues:0Issues:0

corpus-cleaner

Generic toolkit for corpus cleaning

Language:PythonLicense:MITStargazers:5Issues:0Issues:0

AbreMES-DB

[Plan TL/medicine/lexical/terminological resource] A Spanish Medical Abbreviation DataBase.

License:NOASSERTIONStargazers:3Issues:4Issues:0

Medical-Translator

[PlanTL/medicine/neural machine translation/translation models] Files needed to use the Neural Machine Translation system for the Biomedical Domain.

Language:ShellLicense:NOASSERTIONStargazers:3Issues:5Issues:0

covid-predictive-model

A RNN Predictive Model for COVID-19 mortality prediction.

Language:PythonLicense:MITStargazers:2Issues:6Issues:0

EHR-normalizer

[PlanTL/medicine/document/NLP preprocessing] Software to convert PDF files into HTML, TXT or XML files and to normalize EHRs.

Language:PerlLicense:MITStargazers:2Issues:4Issues:0
Language:ShellLicense:NOASSERTIONStargazers:2Issues:2Issues:0
Language:PythonLicense:MITStargazers:2Issues:0Issues:0

AbreMES-X

[PlanTL/medicine/semantic annotation] Software used to generate the Spanish Medical Abbreviation DataBase (AbreMES-DB).

Language:JavaLicense:MITStargazers:1Issues:4Issues:0

BVS-Corpus

Biblioteca Virtual en Salud - Parallel Corpus

License:NOASSERTIONStargazers:1Issues:2Issues:0

controversy-detection-model

This repository contains the code of the paper "Anticipating the Debate: Predicting Controversy in News with Transformer-based NLP"

Language:PythonLicense:Apache-2.0Stargazers:1Issues:3Issues:0

EHR-TTS

[PlanTL/medicine/document annotation//time] HeidelTime grammar for temporal tagging of Spanish Electronic Health Records (EHR).

License:GPL-3.0Stargazers:1Issues:5Issues:0

MEDDOCAN-Format-Converter-Script

Script to convert files between MEDDOCAN-Brat, MEDDOCAN-XML, and i2b2 formats.

Language:PythonLicense:MITStargazers:1Issues:5Issues:0

shared-task-resource-example

Example README file for Shared Task submissions

SPACCC_POS-TAGGER

[PlanTL/medicine/document annotation/NLP preprocessing/part-of-speech] Part-of-Speech Tagger for medical domain corpus in Spanish based on FreeLing.

atc7-es-en

ATC7 (Sistema de Clasificación Anatómica 7) spanish-english translations

License:MITStargazers:0Issues:5Issues:0

MEDDOCAN-Evaluation-Script

Official evaluation script of the Medical Document Anonymization (MEDDOCAN) task.

Language:PythonLicense:MITStargazers:0Issues:5Issues:0
Stargazers:0Issues:5Issues:0
Stargazers:0Issues:5Issues:0
Language:PythonLicense:MITStargazers:0Issues:1Issues:0

SPACCC_Sentence-Splitter

[PlanTL/medicine/document annotation/NLP preprocessing/sentence splitter] Sentence splitting model created using the Apache OpenNLP machine learning toolkit

Language:JavaLicense:NOASSERTIONStargazers:0Issues:0Issues:0

SPACCC_Tokenizer

[PlanTL/medicine/document annotation/NLP preprocessing/tokenization] Tokenization model created using the Apache OpenNLP machine learning toolkit.

Language:JavaLicense:NOASSERTIONStargazers:0Issues:0Issues:0

spanish-benchmark

Spanish Benchmark website

Language:JavaScriptLicense:MITStargazers:0Issues:0Issues:0

TENTE

[PlanTL/medicine/terminological resource retrieval] A medical negated terms extraction tool.

Language:JavaLicense:MITStargazers:0Issues:5Issues:0

utils

Miscellaneous utilities and scripts.

Language:JavaStargazers:0Issues:5Issues:0