Hi,
First of all, thank you for the incredible work on your project!
I have a question regarding the process described in your paper where masks generated from 2D images are projected into 3D space. I'm curious about how this method handles complex scenarios, specifically if there is a pillar standing in the center of the room. Could you please explain how the system manages such cases, and how it handles the projection of such obstacles?
Thank you!