Skip to content

No label column ('default payment next month') while synthesizing data using diffuser model #4

@pnimeesha

Description

@pnimeesha

Hi,

I ran the sample code from the google colab here . The samples generated from the diffusion model do not have labels. Considering the credit card data in this case (as used in colab code), the label column refers to 'default payment next month'. So how can I run the Machine Learning efficacy evaluation metrics (referred to as utility in the paper for which code not available in colab) when the models you mentioned for the evaluation are supervised models (Random Forest, Decision Trees, Logistic Regression, Ada Boost, and Naive Bayes.). I wrote code for Utility and tried to test it. I realised label column is missing for synthetic data.
Can you please let me know how this can be done without labels in the synthetic data?

Thanks in advance!

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions