added attn and mlp bias by JRosenkranz · Pull Request #83 · IBM/text-generation-inference

JRosenkranz · 2024-05-06T14:28:50Z

Motivation

[Describe why this change is needed]

The Calico models currently set the mlp and attention bias to true, which was hard-coded to false in flash and paged llama implementations. This will use the config params set in huggingface/transformers#30031 to set those values properly.

Modifications

[Describe the code changes]

added attention_bias, mlp_bias to config for Flash and Paged Llama implementations (default is False)
set bias in attention and mlp to the config value

Result

[Describe how the changes affects existing behavior and how to test it]

Models should be able to load properly if containing attention and mlp bias

Related Issues

NA

Signed-off-by: Joshua Rosenkranz <jmrosenk@us.ibm.com>

…text-generation-inference-server into added_attn_mlp_bias

[pull] main from IBM:main

JRosenkranz added 3 commits May 6, 2024 10:26

added attn and mlp bias

7c91102

added attn and mlp bias

1b34dbc

Signed-off-by: Joshua Rosenkranz <jmrosenk@us.ibm.com>

Merge branch 'added_attn_mlp_bias' of https://github.com/JRosenkranz/…

cd537e6

…text-generation-inference-server into added_attn_mlp_bias

JRosenkranz closed this May 6, 2024

Xaenalt pushed a commit to Xaenalt/text-generation-inference that referenced this pull request Aug 1, 2024

Merge pull request IBM#83 from IBM/main

e76c9f4

[pull] main from IBM:main

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

added attn and mlp bias#83

added attn and mlp bias#83
JRosenkranz wants to merge 3 commits intoIBM:mainfrom
JRosenkranz:added_attn_mlp_bias

JRosenkranz commented May 6, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

JRosenkranz commented May 6, 2024

Motivation

Modifications

Result

Related Issues

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant