more question about dataset

When n = 10，I find more than 10% were removed out of a total of 817,741 adoptions and more than 10K item is remained， which is inconsistent with the data in the paper.Following is how to process the dataset .
     sql = 'SELECT * FROM shopping_processing where product_ID in' \
          '(SELECT product_ID FROM shopping_processing group by product_ID having count(userID)>5)'
     pandasql.sqldf(sql , globals())
Can you tell me where is wrong？Thanks.


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

more question about dataset #3

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

more question about dataset #3

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions