There are 14 repositories under data-transformation topic.
Logical Replication extension for PostgreSQL 15, 14, 13, 12, 11, 10, 9.6, 9.5, 9.4 (Postgres), providing much faster replication than Slony, Bucardo or Londiste, as well as cross-version upgrades.
A block-based API for NSValueTransformer, with a growing collection of useful examples.
:lipstick: Durable and asynchronous data imports for consuming data at scale and publishing testable SDKs.
Low-code Python library to safely use notebooks in production: schedule workflows, generate assets, trigger webhooks, send notifications, build pipelines, manage secrets (Cloud-only)
O'Reilly Book: [Data Algorithms with Spark] by Mahmoud Parsian
A simple Spark-powered ETL framework that just works 🍺
A curated list of Clojure resources for dealing with domain-specific languages.
Data transformation and utility functions for R
Big Data Modeling, MapReduce, Spark, PySpark @ Santa Clara University
🤖 An automated machine learning framework for audio, text, image, video, or .CSV files (50+ featurizers and 15+ model trainers). Python 3.6 required.
A visual data pipeline builder with various backends
A schema-aware Scala library for data transformation
Wrangler Transform: A DMD system for transforming Big Data
Reference Architectures for Datalakes on AWS
Data transformation toolkit
Examples for working with DataWeave scripts from Apex.
object flow treatment, data transformation
⚡️ Next-generation data transformation framework for TypeScript that puts developer experience first
Serialize PHP variables, including objects, in any format. Support to unserialize it too.
Functional utilities for Common Lisp
Power Query M functions for working with Tabular Data Packages (Frictionless Data) in Power BI and Excel
A PHP serialization component focused on performance