Giters
sdv-dev
/
RDT
A library of Reversible Data Transforms
Geek Repo:
Geek Repo
Github PK Tool:
Github PK Tool
Stargazers:
110
Watchers:
17
Issues:
364
Forks:
25
sdv-dev/RDT Issues
Fix pandas FutureWarning in UniformEncoder
Closed
a month ago
Switch to using ruff for Python linting and code formatting
Closed
a month ago
Only run unit and integration tests on oldest and latest python versions for macos
Closed
a month ago
In `RegexGenerator`, provide the ability to scramble the keys
Closed
2 months ago
Pandas FutureWarnings are disrupting tqdm progress bars
Closed
2 months ago
Cleanup automated PR workflows
Closed
2 months ago
AnonymizedFaker fails when using custom Faker provider
Updated
2 months ago
Providing locales to AnonymizedFaker with a function that uses the BaseProvider crashes
Closed
2 months ago
Add dependency checker
Closed
2 months ago
Move out sdtype validations from multi-column transformers
Closed
2 months ago
Support Python 3.12
Closed
2 months ago
Fix minimum version workflow when pointing to github branch
Closed
2 months ago
Add bandit workflow
Closed
2 months ago
Add build to dev requirements
Closed
3 months ago
Allow `AnonymizedFaker` to learn cardinality from the real data
Closed
3 months ago
Transition from using setup.py to pyproject.toml to specify project metadata
Closed
3 months ago
Remove bumpversion and use bump-my-version
Closed
3 months ago
Move the _learn_rounding_digits of the FloatFormatter into a helper
Closed
4 months ago
OneHotEncoder doesn't support dtype `'category'`
Closed
4 months ago
Add a _update_multi_column_transformer method
Closed
4 months ago
Categorical reverse transform may crash with `ValueError` for certain dtypes (int64)
Closed
5 months ago
RegexGenerator should create unlimited regexes, even if unique enforcement is on
Closed
5 months ago
RegexGenerator gives a confusing message: # of possibilities are shown as an imaginary number
Closed
5 months ago
AnonymizedFaker crashes with `ValueError` for specific provider/function pairs (eg. `currency`)
Closed
6 months ago
Add enforce_min_max_values to datetime transformers
Closed
7 months ago
Support multi-column transformers
Closed
7 months ago
Improve user warnings and logic for update_sdtype
Closed
7 months ago
Improve user warnings and logic for update_transformers and update_transformers_by_sdtype
Closed
7 months ago
Improve user warnings and logic for remove_transformers and remove_transformers_by_sdtype
Closed
7 months ago
Multi column transformers crash when assigned to single column
Closed
7 months ago
Error when columns contains only numbers. PR exists.
Updated
7 months ago
Disable CLA Bot
Updated
7 months ago
The `OrderedLabelEncoder` should not accept duplicate categories
Closed
8 months ago
Make the default missing value imputation `'mean'`
Closed
7 months ago
HyperTransformer transforms while fitting and messes up the random seed
Closed
8 months ago
RDT Uniform Encoder creates nan Value
Closed
8 months ago
Switch default branch from master to main
Closed
9 months ago
When no rounding scheme is detected, log the info instead of showing a warning
Closed
9 months ago
Remove performance tests
Closed
9 months ago
`ClusterBasedNormalizer` code cleanup
Closed
9 months ago
Resolve locales warning for specific sdtype/locale combos (eg. en_US with postcode)
Closed
9 months ago
`ClusterBasedNormalizer` should only select the minimum number of required components
Updated
9 months ago
Remove clipping during `ClusterBasedNormalizer` transform
Updated
9 months ago
Remove unnecessary `if data.ndim > 1` in transformers
Updated
9 months ago
Investigate component probabilities for the `ClusterBasedNormalizer`
Updated
9 months ago
Deprecate get_input_sdtype
Closed
10 months ago
Create IDGenerator transformer
Closed
10 months ago
Add UniformEncoder (and its ordered version)
Closed
10 months ago
Allow me to use `AnonymizedFaker` with sdtype `text` columns
Closed
10 months ago
[Enterprise Usage] Unable to assign generic PII transformers (eg. `AnonymizedFaker`)
Closed
10 months ago
Previous
Next