Skip to content

[vLLM-ATOM] Add profile trace parsing tool for vLLM-ATOM#644

Draft
kliuae-amd wants to merge 5 commits intomainfrom
kliuae/vllm_atom_profiler
Draft

[vLLM-ATOM] Add profile trace parsing tool for vLLM-ATOM#644
kliuae-amd wants to merge 5 commits intomainfrom
kliuae/vllm_atom_profiler

Conversation

@kliuae-amd
Copy link
Copy Markdown
Contributor

Motivation

This PR adds a profile trace parsing tool that breaks down vLLM-ATOM traces for hot kernel analysis.

Technical Details

Provide kernel breakdowns of mixed/prefill and decode batches for kernel runtime analysis.
It leverages vLLM's trace markers enabled by VLLM_CUSTOM_SCOPES_FOR_PROFILING=1 to gather kernels in batches.
Currently it uses input layernorms as boundaries to separate layers from one another.

Usage:

python parse_vllm_trace.py <trace.json.gz>
    [--layer N]            Target transformer layer index.
    [--percentile P]       The X-percentile batches to extract kernel stats from.
    [--eager-trace <path>] Companion eager trace for extracting host module attributes.
                           For graph-captured batches like decode workloads in FULL_AND_PIECEWISE settings,
                           GPU kernels in a graph are packed into a single graph replay event.
                           An eager trace can be provided alongside with graphed traces to establish kernel
                           launches with host side launch modules from eager traces.
    [--output-prefix <str> Prefix for output XLSX filename.

Test Plan

Test Result

Submission Checklist

kliuae added 4 commits April 24, 2026 11:41
Signed-off-by: kliuae <kuanfu.liu@embeddedllm.com>
Signed-off-by: kliuae <kuanfu.liu@embeddedllm.com>
Signed-off-by: kliuae <kuanfu.liu@embeddedllm.com>
Signed-off-by: kliuae <kuanfu.liu@embeddedllm.com>
@kliuae-amd kliuae-amd marked this pull request as ready for review April 28, 2026 06:01
@kliuae-amd kliuae-amd marked this pull request as draft April 30, 2026 03:58
Signed-off-by: kliuae <kuanfu.liu@embeddedllm.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants