José Angel Daza's repositories
abbreviation-detector
Code for evaluating techniques for abbreviation detection and expansion in context
jupyterlite-testing
Tests for running jupyter notebooks on the browser
jupyterlite-xeus-testing
Jupyterlite Tests with the xeus kernel
language-detector
A simple n-gram based algorithm for automatically detecting the language in which an input text is written.
ML-toolbox
Basic steps for building ML pipelines using statistical methods for regression and classification
nlp-data-annotation
This repo contains code to prepare data to annotate for NLP tasks (using LabelStudio). It also contains scripts to pre-annotate with models or rules, recover the annotations, and compute basic inter-annotator agreements
SRL-S2S
Encoder-Decoder model for Semantic Role Labeling
timexy
A spaCy custom component that extracts and normalizes temporal expressions
Turku-neural-parser-pipeline
A neural parsing pipeline for segmentation, morphological tagging, dependency parsing and lemmatization with pre-trained models for more than 50 languages. Top ranker in the CoNLL-18 Shared Task.
xsrl_mbert_aligner
X-SRL Dataset. Including the code for the SRL annotation projection tool and an out-of-the-box word alignment tool based on Multilingual BERT embeddings.