Question about depth prediction training

Hello, thank you for your excellent work.

I am trying to fine-tune the checkpoint on the nuScenes dataset, following the depth loss implementation in the VGGT repository for depth and camera transformation prediction:

https://github.com/facebookresearch/vggt/blob/main/training/loss.py

However, I observed that the predicted 'loss_conf_depth' becomes very large for some scenes (while loss_reg_depth and loss_grad_depth seems normal), as shown in the figure. Did you encounter a similar issue when training the model? Do you have any suggestions for stabilizing the depth confidence during fine-tuning?

<img width="365" height="283" alt="Image" src="https://github.com/user-attachments/assets/2f4de725-9195-4a18-a9ae-8ac935f9407e" />

<img width="354" height="257" alt="Image" src="https://github.com/user-attachments/assets/72cd3dd2-1d62-41d7-a50a-bde4535e620b" />

<img width="365" height="259" alt="Image" src="https://github.com/user-attachments/assets/ebe32d44-33dc-46b6-bc5e-ce8339491949" />

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question about depth prediction training #69

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Question about depth prediction training #69

Description

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions