Bump transformers to 4.25.1 by justheuristic · Pull Request #151 · bigscience-workshop/petals

justheuristic · 2022-12-12T19:15:26Z

latest accelerate, transformers, huggingface_hub
rearrange attention caches to support [BLOOM] Clean modeling code huggingface/transformers#18344
remove unused code
fix edge case where session crashes when receiving seq length 0
assert transformer version when importing WrappedBloomBlock

justheuristic · 2022-12-12T19:28:30Z

+    huggingface-hub==0.11.1
+    transformers==4.25.1
    protobuf>=3.20.3,<4.0dev
    hivemind==1.1.3


also gonna bump it, but it's a separate PR

justheuristic · 2022-12-12T19:34:55Z

@@ -0,0 +1,74 @@
+"""


This file is not new. It was renamed from model.py, but git does not recognize the diff

borzunov

We've found some bugs, pending their resolution.

mryab · 2022-12-13T00:41:17Z

+
+        for i in range(0, num_embeddings, self.chunk_size):
+            chunk = word_embeddings[i : i + self.chunk_size].float()
+            output[..., i : i + self.chunk_size] = F.linear(hidden_states, chunk)


Not sure if this is worth doing, but maybe you can do torch.matmul(hidden_states, chunk, out=output[..., i : i + self.chunk_size]) to avoid allocating memory for the intermediate result?

Tried to do the same thing, but to no avail
On GPU, it appears that F.linear has a better support for some optimizations like TF32 (enabled by default)
On CPU, this has no effect.

mryab · 2022-12-13T00:49:39Z

+                key_past = key_cache.flatten(0, 1)[:, :, :prefix_length]  # [batch * num_heads, head_dim, kv_length]
+                value_past = value_cache.flatten(0, 1)[:, :prefix_length, :]  # [batch * num_heads, kv_length, head_dim]


Can't you just directly reshape the past tensors to these shapes like you've done in src/petals/server/handler.py?

Nope, we cannot

hypo_ids need shape [2, batch_size, ...]

training needs key [batch_size * heads, ..., length] and value [..., length, :], making them non-concat-able

handler needs them to be concat-able in a single tensor

Co-authored-by: Max Ryabinin <mryabinin0@gmail.com>

justheuristic added 3 commits December 12, 2022 20:29

bump versions

e62345e

bump versions

88348cf

yeet models

7018954

justheuristic requested a review from borzunov December 12, 2022 19:15

justheuristic added 2 commits December 12, 2022 22:18

y u no instal?

f46d421

fix imports

1e2ab6b

borzunov approved these changes Dec 12, 2022

View reviewed changes

justheuristic commented Dec 12, 2022

View reviewed changes

justheuristic and others added 3 commits December 12, 2022 22:47

fix edge case where session crashes when receiving seq length 0

4b35dd7

Merge branch 'main' into bump

67e8070

review

062cd51

borzunov requested changes Dec 12, 2022

View reviewed changes

justheuristic and others added 7 commits December 13, 2022 00:01

mixin

d227021

remix

ab813ba

fix throughput

9bf813b

fix throughput

524468e

benchmark throughput in CI jobs

c473012

reduce ban timeout

f9e0910

fork pytest

044e915

borzunov approved these changes Dec 12, 2022

View reviewed changes

borzunov reviewed Dec 13, 2022

View reviewed changes

Comment thread src/petals/bloom/from_pretrained.py

mryab reviewed Dec 13, 2022

View reviewed changes

mryab and others added 8 commits December 13, 2022 10:34

review

b12ad06

review

35e2c0a

Update tests/test_aux_functions.py

27ac588

Co-authored-by: Max Ryabinin <mryabinin0@gmail.com>

Merge branch 'bump' of github.com:bigscience-workshop/petals into bump

26fe612

isort

b090dd2

Update src/petals/server/handler.py

7f7f5dc

Co-authored-by: Max Ryabinin <mryabinin0@gmail.com>

Update src/petals/bloom/modeling_utils.py

7103268

Co-authored-by: Max Ryabinin <mryabinin0@gmail.com>

cleanup

fa632a3

justheuristic and others added 4 commits December 13, 2022 10:40

Merge branch 'bump' of github.com:bigscience-workshop/petals into bump

d1d59e4

cleanup

110a307

Update src/petals/server/backend.py

a24659f

Co-authored-by: Max Ryabinin <mryabinin0@gmail.com>

check transformers version

a61c5bb

justheuristic merged commit b04982c into main Dec 13, 2022

justheuristic deleted the bump branch December 13, 2022 08:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bump transformers to 4.25.1#151

Bump transformers to 4.25.1#151
justheuristic merged 27 commits intomainfrom
bump

justheuristic commented Dec 12, 2022 •

edited

Loading

Uh oh!

justheuristic Dec 12, 2022

Uh oh!

justheuristic Dec 12, 2022

Uh oh!

borzunov left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mryab Dec 13, 2022

Uh oh!

justheuristic Dec 13, 2022

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mryab Dec 13, 2022

Uh oh!

justheuristic Dec 13, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		key_past = key_cache.flatten(0, 1)[:, :, :prefix_length] # [batch * num_heads, head_dim, kv_length]
		value_past = value_cache.flatten(0, 1)[:, :prefix_length, :] # [batch * num_heads, kv_length, head_dim]

Conversation

justheuristic commented Dec 12, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

justheuristic Dec 12, 2022

Choose a reason for hiding this comment

Uh oh!

justheuristic Dec 12, 2022

Choose a reason for hiding this comment

Uh oh!

borzunov left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mryab Dec 13, 2022

Choose a reason for hiding this comment

Uh oh!

justheuristic Dec 13, 2022

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mryab Dec 13, 2022

Choose a reason for hiding this comment

Uh oh!

justheuristic Dec 13, 2022

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

justheuristic commented Dec 12, 2022 •

edited

Loading