There are 3 repositories under data-masking topic.
World's most advanced database DevSecOps solution for Developer, Security, DBA and Platform Engineering teams. The GitHub/GitLab for database DevSecOps.
An open-source framework for detecting, redacting, masking, and anonymizing sensitive data (PII) across text, images, and structured data. Supports NLP, pattern matching, and customizable pipelines.
Database anonymization and synthetic data generation tool
BENERATOR is a leading software solution to generate, obfuscate, pseudonymize and migrate data for development, testing, and training purposes with a model-driven approach.
Examples scripts that showcase how to use Private AI Text to de-identify, redact, hash, tokenize, mask and synthesize PII in text.
Never give AI companies your secrets! A local LLM-based privacy filter for LLM users. Seamless integration with your existing AI tools as a Python library / OpenAI SDK replacement / API Gatetway / Web Server.
已经合入(apache/incubator-kyuubi) ACL Management for Apache Spark SQL with Apache Ranger.
Simple yet powerful tool for identifying and anonymizing personal information in various formats.
M.E.D. - A Rust powered command-line data masking, encryption, and decryption tool
A lightweight javascript library for manual data masking
This framework allow users to configure some data masking operations on Salesforce environments.
Supporting utilities to develop web application automation with Selenium.
A vue2 component for text annotation and manual data masking
This script generates various types of fake data, such as names, addresses, phone numbers, coordinates, and more, using the Faker library. Users can select the data type and the quantity to generate. The generated data is saved to a JSON file
A pre-commit hook to check for PII in your code.
DataAnonymizer is an open-source personal data anonymization tool designed for GDPR compliancy
Library in the JVM for secure data storage with trusted third-parties
Easy signer server for the Crumbl platform
A masker and wiper for RAM data
Mask data from Production using Faker to use safely elsewhere
Executable for secure data storage with trusted third-parties
Easy hosting server for the Crumbl platform
Secure data storage with trusted third-parties to use in Javascript environment
Redacting classified documents
Projects that demonstrate the features and capabilities of IRI Workbench and CoSort.
A simple data minimization and anonymization microservice wrapped around go-minimizer
Data minimization, pseudonymization, and anonymization helpers for Go
GenAI-SQL is a modular, extensible suite of AI-powered tools for automating SQL code improvement, documentation, and validation. Built for developers, analysts, and data engineers, it leverages Azure OpenAI (GPT-4o) to analyze, refactor, comment, explain, test, and audit SQL — all within a secure, asynchronous, and HIPAA-compliant framework.
Adds Dynamic data masking to EF Core
Data De-Identification Tools
anonymaCy is a spaCy extension for anonymizing PII using rule-based recognizers, context-aware processing, conflict resolution and customizable anonymization.
go-logx is a high performance, highly concurrent, memory-efficient, lightweight, and production-grade logging package built on top of Uber's Zap library. It provides structured JSON logging with automatic sensitive data masking, custom sensitive keys, zero-allocation patterns, and robust concurrency safety.