Nilo Pedrazzini's repositories
OldSlavNet
Bi-LSTM Parser for Early Slavic
ancientgreek-syntactic-embeddings
Ancient Greek Syntactic Word Embeddings
averageReducedFrequency
R script to calculate the Average Reduced Frequency (ARF) of all words in a corpus
oxford-text-mining
Materials for Introduction to Text Mining (MSc Digital Scholarship, University of Oxford)
parallelbibles
Word-alignment models for Bible translations in 100+ historical and contemporary languages
PreModernSlavic-NLP
Mixed drafts, scripts or data useful for NLP tasks on Pre-Modern Slavic
academic
Jekyll theme with a focus on simplicity, typography and flexibility
ADA-DHOxSS
Teaching materials for the Applied Data Analysis course at DHOxSS. Data science methods to analyse humanities data.
best-practices-for-coding-in-dh
Turing RSE-DH Summer School practical
DataPapersAnalysis
Scripts to scrape JOHD's and RDJHSS websites for metrics on data papers and corresponding datasets, and to carry out analyses on them.
DeezyMatch
A Flexible Deep Learning Approach to Fuzzy String Matching
histLM
Neural Language Models for Historical Research
KERMIT
🐸 KERMIT - A lightweight library to encode and interpret Universal Syntactic Embeddings
MapReader
A computer vision pipeline for exploring and analyzing images at scale
nilo-cultural-web
Blog for my students at 7AAVDM14 The Cultural Web: Building a Humanities Website (King's College London 2021-2022)
OCSharmonizeOES
Python script to harmonize Church Slavonic and Old East Slavic (Old Russian) orthographic variants
spacy-lookups-data
📂 Additional lookup tables and data resources for spaCy
spec
Test spec
subsamplr
A tool for representative subsampling
text
Data loaders and abstractions for text and NLP