I have a dataset with image-binary mask pairs. Can you write a tutorial that would allow me to learn on such data?