Mayer Antoine's repositories
2019-intro-patient-matching
2019-intro-patient-matching
duplicategenerator
Duplicate generator is a Python library that generates duplicate personal synthetic data.
interactive-recordlinkage-tool
A tool to experiment and automatically compare quality of data matching algorithm
matching-comparative-review
A comparative review of matching approaches for HIV surveillance in the absence of unique identifier
breast-cancer-mlops
MLOps using Azure ML Services and Azure DevOps
injury-narrative-coding-transformers
Natural Language Processing (NLP) Machine Learning (ML) algorithm to code unstructured work-related injury narratives
premier_analysis
A deep learning project predicting hyperinflammatory syndrome among COVID-19 patients using EHR data.
breast_cancer_classification
Udacity Breast Cancer classification
client-registry
Open Client Registry Service
clinical-adapter
GA Tech CS7643 Group Project implementing adapter-transformers for clinical information extraction and classification
dedupe
:id: A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.
disaster-damage-assessment-ml
Building an image classifier for natural disaster damage assessment using social media and google images
elasticsearch-spark-recommender
Use Jupyter Notebooks to demonstrate how to build a Recommender with Apache Spark & Elasticsearch
elasticsearch-vector-scoring
Score documents with pure dot product / cosine similarity with ES
fsdl-text-recognizer-2021-labs
Complete deep learning project developed in Full Stack Deep Learning, Spring 2021
handson-ml2
A series of Jupyter notebooks that walk you through the fundamentals of Machine Learning and Deep Learning in Python using Scikit-Learn, Keras and TensorFlow 2.
harvest-cdc-journals
Python code to harvest issues of CDC online journals MMWR, EID, and PCD.
kaggler-template
Template for data science competitions. Includes makefiles and Python scripts for feature engineering, cross validation, ensemble, etc.
negex
Automatically exported from code.google.com/p/negex
python-is-cool
Cool Python features for machine learning that I used to be too afraid to use. Will be updated as I have more time / learn more.
recordlinkage
A toolkit for record linkage and deduplication written in Python
similarity-scoring
An Elasticsearch plugin for scoring documents based on string similarity
StatComp2019CDC
Short Course on Statistical Computing at CDC in 2019
udacity_optimizing_an_ml_pipeline
Optimizing an ML Pipeline in Azure