Heya!
Made a github for this. The script worked fine for 7b and 13bs, but I am getting out of memory errors when trying to load in a 70b model.
I tried using CPU mode, and I tried loading it in with 2 or 3 a100 80gb gpus.
I'm not sure what to do next, hope you can fix it :)
I don't know code, only know how to run scripts, and this and your blockmerge was easy to run.
Heya!
Made a github for this. The script worked fine for 7b and 13bs, but I am getting out of memory errors when trying to load in a 70b model.
I tried using CPU mode, and I tried loading it in with 2 or 3 a100 80gb gpus.
I'm not sure what to do next, hope you can fix it :)
I don't know code, only know how to run scripts, and this and your blockmerge was easy to run.