Amélie Medem's repositories
word2vec-graph
Exploring word2vec embeddings as a graph of nearest neighbors
ameliemedem
My portfolio
anonymisation
NER on legal cases
big-list-of-naughty-strings
The Big List of Naughty Strings is a list of strings which have a high probability of causing issues when used as user-input data.
cognito-api
An authentication API based on AWS Cognito
Databricks-Certified-Data-Engineer-Associate-Questions
This repo contains "Databricks Certified Data Engineer Associate" Questions and related docs.
docker-airflow
Docker Airflow - Contains a docker compose file for Airflow 2.0
fastai-projects
Jupyter notebooks that use the Fastai library
nlp
Natural Language Processing Best Practices & Examples
NLP-progress
Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.
PaddleOCR
Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices)
pdfplumber
Plumb a PDF for detailed information about each char, rectangle, line, et cetera — and easily extract text and tables.
pythondataanalysis
Python data repo, jupyter notebook, python scripts and data.
small-file-sharing
A small file sharing API based app
spacy-lefff
Custom French POS and lemmatizer based on Lefff for spacy
spark-standalone-cluster
This repo contains a spark standalone cluster on docker for anyone who wants to play with PySpark by submitting their applications.
Spark_docker_1
Deploying Spark Using Docker
StudyBook
Study E-Book(ComputerVision DeepLearning MachineLearning Math NLP Python ReinforcementLearning)
understrap
Underscores + Bootstrap = Understrap, the renowned open-source WordPress starter theme.
word2vec-1
This tool provides an efficient implementation of the continuous bag-of-words and skip-gram architectures for computing vector representations of words. These representations can be subsequently used in many natural language processing applications and for further research.