Description of the problem
I'm unable to load multilingual models like BAAI/AltDiffusion with diffusers.
The solution I'd like
I would like diffusers to have out-of-the-box support for models like BAAI/AltDiffusion, that use AltCLIP or a different multilingual CLIP model.
Alternatives I've considered
I tried loading BAAI/AltDiffusion, but it told me that I was missing a library.
After loading that library, it produced a different error.
I kinda gave up after that.
Additional context
Since AltCLIP has a max_position_embeddings value of 514 for its text encoder instead of 77, I had hoped I could just replace the text encoder and tokenizer of my models with those of BAAI/AltDiffusion to overcome the 77 token limit.