Pavel Soriano's repositories
pseudo-de-pseudonymizer
Code to transform an already pseudonymized text into pseudo (false) non-anonymized data.
beamer_etalab_template
LaTeX template for (my) Etalab presentations. In Beamer (metropolis theme).
airflow-guides
Guides and docs to help you get up and running with Airflow and Astronomer
beta.gouv.fr
Le site public de l'Incubateur de Services Numériques de l'État
cada
A simplistic interface to search and consult CADA advices. This is the engine behind http://cada.data.gouv.fr.
csv_detective
CSV inspection
datagouv-search-indicator
Keep track of udata search results performances
datagouv_dataset_analyzer
Analyzing datagouv datasets
emnlp2017-bilstm-cnn-crf
BiLSTM-CNN-CRF architecture for sequence tagging
explainerdashboard
Quickly build Explainable AI dashboards that show the inner workings of so-called "blackbox" machine learning models.
fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
flair
A very simple framework for state-of-the-art Natural Language Processing (NLP)
github-action-demo
A demonstration repository to build and host a book with GitHub Actions
jupyter-book
Build interactive, publication-quality documents from Jupyter Notebooks
mlearnable-datasets-detective
Try and discover which datasets from data.gouv.fr are useful for a supervised learning task.
piaf-combo-huggingface
Code/info to train combo French SQuADs model
politique-de-contribution-open-source
Politique de contribution open-source interministérielle
pseudonymisation_decisions_ce
Temporary repo to split the pseudo livrable
PythonPerformanceTalk
Python Performance talk for the Parallel Data Science course in Univ Lyon
random_notebooks
Random notebooks although I do not particularly like notebooks
scikit-learn
scikit-learn: machine learning in Python
transformers
🤗Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.