I run a sha256 dedup pass on the dataset and noticed 52 conflicts.
It appears scene 401 and 404 are duplicated. Same hash, same content, exactly the same.
I also noticed that frame 7 in scene 401 and 404 has some objects in it, but the masks are completely empty.