Petri Savolainen's starred repositories
unstructured
Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
pdfplumber
Plumb a PDF for detailed information about each char, rectangle, line, et cetera — and easily extract text and tables.
svg-spinners
A collection of 24 x 24 dp SVG spinners! (CSS & SMIL)
pdfminer.six
Community maintained fork of pdfminer - we fathom PDF
python-readability
fast python port of arc90's readability tool, updated to match latest readability.js!
searcharray
Full text search in your Pandas dataframe
EDFbrowser
A free, opensource, multiplatform, universal viewer and toolbox intended for, but not limited to, timeseries storage files like EEG, EMG, ECG, BioImpedance, etc.
pymupdf-fonts
Collection of optional fonts for PyMuPDF
polar_accesslink
Python client for Polar AccessLink API.
weighed-levenshtein-substring
Fork of https://github.com/infoscout/weighted-levenshtein