Hey, thanks for this great project, I'm a bit confused.
For painting style learning, is it necessary to add a description of each picture in the training set? For example, 1) use a unique word to represent the style of painting, such as used in dreambooth lora, and there is no image description file; 2) use image-text pair data, such as in train_txt_image_lora, a picture corresponds to a description file.
What is the difference between them? Which method is more conducive to painting style learning and reasoning use?