You have explored a very interesting direction!
I have a few questions regarding the model. Why did you choose to use the DiT architecture? Is your proposed method specifically designed for DiT models, or does the DiT architecture provide some fundamental support for your approach?
If we apply LoRA fine-tuning on the SDXL inpainting model, would it be possible to achieve similar functionality?
I look forward to your response.
You have explored a very interesting direction!
I have a few questions regarding the model. Why did you choose to use the DiT architecture? Is your proposed method specifically designed for DiT models, or does the DiT architecture provide some fundamental support for your approach?
If we apply LoRA fine-tuning on the SDXL inpainting model, would it be possible to achieve similar functionality?
I look forward to your response.