Skip to content

Missing Values in txs_features Dataset #9

@sharul-ayub

Description

@sharul-ayub

Hi,

While performing Exploratory Data Analysis (EDA) on the txs_features dataset (downloaded from this repository / Google Drive), I found that several columns contain missing values.

I would like to confirm:

  • Is this expected behaviour in the original dataset?
  • Or could this be caused by incorrect preprocessing on my side?

Details:

  • Dataset source: GitHub repository / Drive version
  • File name: txs_features.csv

Observation:

  • Columns with missing values:
  1. in_txs_degree
  2. out_txs_degree
  3. total_BTC
  4. fees
  5. size
  6. num_input_addresses
  7. num_output_addresses
  8. in_BTC_min
  9. in_BTC_max
  10. in_BTC_mean
  11. in_BTC_median
  12. in_BTC_total
  13. out_BTC_min
  14. out_BTC_max
  15. out_BTC_mean
  16. out_BTC_median
  17. out_BTC_total
  • Number of affected rows: 965

Additionally, I have attached an Excel file listing the transactions with missing values for reference.

rows_with_missing_in_txs_features.csv

Could you kindly confirm whether:

  1. These missing values are part of the dataset design?
  2. Or if there is a preprocessing step I may have missed?

Thank you.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions