show exactly what framework (vllm,trtllm,sglang) legend

See parent issue for context.

<img width="943" height="707" alt="Image" src="https://github.com/user-attachments/assets/9b0dda71-dc17-4d9f-99f4-eafd9521793e" />

<img width="1077" height="727" alt="Image" src="https://github.com/user-attachments/assets/95904905-1316-4539-862c-6fe999f0a783" />

As shown in the diagram above, there are some caveats to this one that might make this a bit trickier than originally thought (there may be some refactoring involved).

My initial thinking is this: for each selection combination of Model + ISL/OSL + Precision, you can "calculate" the set of all possible calculations of GPU (`hw` field which is truncated as described [here](https://github.com/InferenceMAX/InferenceMAX/issues/229#:~:text=the%20possible%20hardwares%20is%20simply%20the%20set%20of%20all%20hw%20fields%20with%20any%20%2D*%20suffix%20truncated.)) + Framework (`framework` field after mapping applied, as described [here](https://github.com/InferenceMAX/InferenceMAX/issues/229#:~:text=Note%3A%20the%20(annoying,TRT%2DLLM%0Aetc.)) + additional software stuff (such as `mtp` field)[^1]. Once you have the set of all these combinations for a particular selection, you use these as selectable configurations on the legend and on the "Select a GPU for comparison" dropdown. This means that the "Select a GPU for comparison" dropdown can no longer be static and will depend on the selected Model + ISL/OSL + Precision.

[^1]: Another annoying detail, with this PR #251, the `mtp: on/off` field has been changed to `spec-decoding: [str]`. Therefore, we will have to support both the `mtp` field and the newer `spec-decoding` field.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

show exactly what framework (vllm,trtllm,sglang) legend #127

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

show exactly what framework (vllm,trtllm,sglang) legend #127

Description

Footnotes

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions