CogStack / AnonCATDemo

The Deidentify app uses MedCAT to automatically redact or replace sensitive information in documents. Customisable rules and batch processing make it easy to handle large datasets.

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Deidentify app

Demo for AnonCAT. It uses MedCAT, an advanced natural language processing tool, to identify and classify sensitive information, such as names, addresses, and medical terms.

Example

Features

  • Redact sensitive information: The Deidentify app can automatically redact sensitive information from a document, replacing it with a placeholder value, such as "[REDACTED]".
  • Replace sensitive information: Alternatively, the app can replace sensitive information with a different value, such as a random name or address, to maintain the structure and context of the original document.
  • Add customizable rules: The app allows users to create custom rules for identifying and classifying sensitive information based on their specific needs and use cases.
  • Batch processing: The app can process multiple documents at once, making it easy to redact or replace sensitive information in large datasets.

DeID Model

For out of the box models please contact: contact@cogstack.org

Models configured in .env ../models/ mounted into the container under /home/models/

MODEL_NAME = '<NAME OF MODEL HERE.zip>'

Build your own model

To build your own models please follow the tutorials outlined in MedCATtutorials

Note: This is currently under development

Starting the demo service

Start the Docker services by using docker-compose. This will build the necessary Docker images and start the services.

docker-compose up

About

The Deidentify app uses MedCAT to automatically redact or replace sensitive information in documents. Customisable rules and batch processing make it easy to handle large datasets.

License:Apache License 2.0


Languages

Language:Vue 36.2%Language:Python 30.6%Language:CSS 14.6%Language:TypeScript 12.5%Language:Dockerfile 2.9%Language:JavaScript 1.2%Language:HTML 1.2%Language:Shell 0.9%