databricks-industry-solutions / ocr-phi-masking

Our joint Solution Accelerator with John Snow Labs automates the detection of sensitive information contained within unstructured data using NLP models for healthcare. Extracted data is stored within the Lakehouse, where teams can use the pre-trained models to easily remove, obfuscate or mask data for downstream analytics at massive scale.

Home Page:https://www.databricks.com/solutions/accelerators/automated-phi-removal

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

databricks-industry-solutions/ocr-phi-masking Stargazers