There are 11 repositories under pii topic.
An open-source framework for detecting, redacting, masking, and anonymizing sensitive data (PII) across text, images, and structured data. Supports NLP, pattern matching, and customizable pipelines.
Document (PDF, Word, PPTX ...) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documents. Remove PII. Convert any document or picture to structured JSON or Markdown
What's in your data? Extract schema, statistics and entities from datasets
Secure Vault for Customer PII/PHI/PCI/KYC Records
An AI-powered Personal Identifiable Information (PII) scanner.
A powerful scanner to scan your Filesystem, S3, MySQL, Redis, Google Cloud Storage and Firebase storage for PII and sensitive data.
Scan databases and data warehouses for PII data. Tag tables and columns in data catalogs like Amundsen and Datahub
This package features data-science related tasks for developing new recognizers for Presidio. It is used for the evaluation of the entire system, as well as for evaluating specific PII recognizers or PII detection models.
🚨 slog: Attribute formatting
Multi Cloud Data Tokenization Solution By Using Dataflow and Cloud DLP
KloudDB Shield is a comprehensive Postgres Security Tool - PII Scanner , CIS Benchmarks , SSL audit , 12+ features .. Supports Postgres, RDS ,Aurora, MySQL
A research python package for detecting, categorizing, and assessing the severity of personal identifiable information (PII)
Scala library and compiler plugin that prevent inadvertent leakage of sensitive fields in `case classes` (such as credentials, personal data, and other confidential information)
Open Privacy Vault - Secure, Performant, Open Source PII as a Service.
A Mongoose plugin that lets you transparently cipher stored PII and use securely-hashed passwords
Metadata and data identification tool and Python library. Identifies PII, common identifiers, language specific identifiers. Fully customizable and flexible rules
A package to build an end-to-end pipeline for detecting personally identifiable information from text.
Application and python script to identify, remove, and/or recode personally identifiable information (PII) from field experiment datasets.
Deidentify people's names and gender specific pronouns
Desktop App with Built-In LLM for Removing Personal Identifiable Information in Documents
Lightning-fast PII detection and anonymization library with 190x performance advantage - detect emails, SSNs, names, and more in <2MB package
Hides personal information from pages, similar to Discord's Streamer mode.
Python Data Loss Prevention (DLP) SDK - Nightfall Developer Platform
Personal Identifiable Information (PII) entity detection and performance enhancement with synthetic data generation
Library for identification, anonymization and de-anonymization of PII data
The Security Toolkit for managing Generative AI(especially LLMs) and Supervised Learning processes(Learning and Inference).
Registry of metadata identifier entities like UUID, GUID, person fullname, address and so on. Linked with other sources
Caido's passive workflow to find potential leaked secrets, PII, and sensitive fields.
.NET 8 API for document file format identification, text/metadata/attachment/embedded object/sensitive item (PII/PHI)/entity extraction.
A Keycloak provider that enables encryption of user attributes that contain PII data to be automatically encrypted upon storing to database and then decrypted upon loading from database
Redact PDF/image-based documents, or CSV/XLSX files using a Gradio-based GUI interface