Tracking and Saving TVAE Loss Values

Question

Tracking and Saving TVAE Loss Values

gjuresic opened this issue a year ago · comments

Environment details

CTGAN version: 0.7.3
Python version: 3.9.6
Operating System: macOS

Problem description

When fitting CTGAN, I can capture the loss values in the output variable. However, the same cannot be implemented when fitting the TVAE. How can I track and store the loss values to see if the model is able to learn the distribution of the data over the selected number of epochs?

Neha Patki · Answer 1 · Tue Aug 22 2023 01:32:12 GMT+0800 (China Standard Time)

Hi @gjuresic, currently it is not possible to track the loss values very easily although we have an outstanding feature request for it at #300. You can review the proposed functionality there and let us know if it will meet your needs.

In the meantime, there are still other options for seeing if the model learned distributions. One possible approach:

Sample synthetic data from the fitted synthesizer
Use the SDMetrics library to compare the real vs. synthetic data. If the synthesizer learned the distributions well, it should report high scores.
You can also run a Quality Report and create visualizations to manually inspect the data.

Hopefully that helps! Let me know if you have any follow ups, but otherwise, I'd defer to issue #300 for the implementation that we want to add.

Neha Patki · Answer 2 · Wed Mar 06 2024 10:25:47 GMT+0800 (China Standard Time)

This feature has now been added. For API, see #300. Thanks.