Output HTML and JSON from codegen diff tool by jacobhinkle · Pull Request #996 · NVIDIA/Fuser

jacobhinkle · 2023-09-29T19:47:38Z

I have been chasing down codegen changes in #840 and #947 and have needed to dig through a lot of spurious diffs. I decided to extend the codegen diff tool to output HTML, and to also modify the diffing a bit. This PR:

Changes tools/compare_codegen.sh to output env information as well as add ptxas_verbose dump option.
Changes the diffs performed by that tool to ignore both the kernel name and the preamble. The preamble is estimated by skipping the typedef of nvfuser_index_t. If preambles between two runs differ, we report that with a warning and show the diff in the output.
Adds an --html option to tools/diff_codegen_nvfuser_tests.py which will write a self-contained HTML file holding all the differing kernels and diffs. To use this option you must have previously run pip install jinja2.
Adds a --json option to tools/diff_codegen_nvfuser_tests.py which writes a JSON file containing all the information contained in the HTML file in an easier-to-parse format.
Changes the default to not printing the diffs to STDOUT. This can be re-enabled with the --show-diffs argument.

This lets us communicate code differences easily by sharing these files, which could be generated by our CI. An example output is attached.

Github doesn't support uploading html so I have uploaded a zipped example:
codediff_f7786819_feda1e1e_binary_tests.html.zip

Note that this file is probably typical for a medium sized change: it results in a zipped file size of 184KB and unzipped it is 2.1MB.

Some ideas left out of this PR that might be nice in the future:

Handle not just nvfuser_tests output but also nvfuser_bench and pytest output. We could also fall back to arbitrary command output where we just group everything to one big "test" if we can't associate each kernel with a specific test/benchmark.
Show multiple commands in one HTML file. Especially if the first bullet is addressed, then we could have a single summary for our whole suite.
Include benchmark results. This could be done in another hidden div with a "benchmarks" button. It might be tricky especially if the number of benchmark items associated to each kernel is changed between commits, but it might also be handy to refer to benchmark regressions and have the codegen output one click away.

Fixes #1007

xwang233 · 2023-09-29T20:05:43Z

Thanks for adding the HTML output. That looks great!

Does the python script tools/diff_codegen_nvfuser_tests.py compare kernels based on test name but not file name on all test suites? I recall that nvfuser_tests works now but not other test suites like python tests or nvfuser_bench.

Nvfuser_bench is different from all other test suites (nvfuser_tests, python tests) because it prints test name (benchmark name) after kernel dump file (PRINTING: __tmp_kernel ...) while all other test suites print test name before the kernel dump file.

jacobhinkle · 2023-09-29T20:52:33Z

Does the python script tools/diff_codegen_nvfuser_tests.py compare kernels based on test name but not file name on all test suites?

Yes it compares based on test name and it's specific to nvfuser_tests. We could update it pretty easily to also handle pytest since that prints the test name before the kernels

Nvfuser_bench is different from all other test suites (nvfuser_tests, python tests) because it prints test name (benchmark name) after kernel dump file (PRINTING: __tmp_kernel ...) while all other test suites print test name before the kernel dump file.

We could probably enable this too if we're clever, we just need to accept some regexes and plumb them around.

xwang233 · 2023-10-01T21:57:22Z

Can we add an option to only dump limited number (say 200) of kernel comparisons? We can add a prompt at the end of that page saying something like, e.g. "Only dumped 200 of 10086 total mismatches. To dump all the mismatches, please do xxx".

This is to make the generated file not being too huge in size.

jacobhinkle · 2023-10-02T01:43:21Z

Can we add an option to only dump limited number (say 200) of kernel comparisons? We can add a prompt at the end of that page saying something like, e.g. "Only dumped 200 of 10086 total mismatches. To dump all the mismatches, please do xxx".

This is to make the generated file not being too huge in size.

Good idea. There are a few other things I'd like to add before merging too so I'll add this to the list.

jacobhinkle · 2023-10-02T01:46:37Z

There are a few other things I'd like to add before merging

Btw one of those is an option to exclude the preamble which can decrease file size a bit. Still if a change impacts tons of tests we could easily wind up with hundreds of diffs which is probably not ideal for a ci artifact. At least they seem to compress well..

I will use this but not show it directly. Instead, I'll parse it and show the info on each kernel line, along with possible index type change and number of lines added/removed.

This reduces file size considerably. The original 11MB uncompressed file is now 2.0MB.

xwang233

The generated webpage looks really cool! Thanks for adding this.

It was trivial, and might be helpful for CI?

jacobhinkle · 2023-10-03T14:50:50Z

@xwang233 I am not sure what impact this will have on CI when merged. Note that we no longer print diffs to screen by default. We can do that if needed with --show-diffs, but I also included a --json <filename> argument that gives us a structured version instead. The exit code is not the number of diffs (which can easily be over 256) but is 1 if there are differences in either the preamble or kernels and zero if otherwise.

jacobhinkle · 2023-10-03T15:03:04Z

!build

jacobhinkle · 2023-10-03T15:45:39Z

!build

xwang233 · 2023-10-03T16:12:59Z

Should the python script be called after the bash script? The python script hasn't been integrated into the CI yet. I'll check those later today.

jacobhinkle · 2023-10-03T17:13:41Z

Should the python script be called after the bash script? The python script hasn't been integrated into the CI yet. I'll check those later today.

Yes, tools/compare_codegen.sh will create a directory called by default codegen_comparison/. Under that will be two directories, named after the 8-character hash of the git commits on main and the PR branch. Those two directories are the required args to python tools/diff_codegen_nvfuser_tests.py. Until we update CI to upload an HTML report or to parse the JSON, we should probably add the --show-diffs so that this tool will behave the same as before. In fact, I think no changes to CI will be needed right away if I just change it to show the diffs by default and replace that arg with a --hide-diffs arg. I'll give that a go. Note that the codediff command seems to have worked in the previous CI run though; it's nvfuser-ci that says it fails (reference B#8206-J#70698158).

The --show-diffs arg actually had no effect (oops). Fixed that also.

jacobhinkle · 2023-10-03T17:17:47Z

!build

xwang233 · 2023-10-03T17:28:13Z

Thanks for that. The failure in nvfuser-ci with reference numbers doesn't necessarily mean it's the codegen-diff job failure. It could be something else flaky in network. Don't worry about that.

jacobhinkle · 2023-10-03T17:45:32Z

Ah OK. If CI fails again I'll just merge without a !build then.

This will show FAILED -> FAILED as well. The only hidden case is now SUCCESS -> SUCCESS

jacobhinkle added 5 commits September 29, 2023 13:20

First stab at html output for codegen diffs

713d421

Add small test script REMOVE LATER

fc7746d

Still WIP

2994d77

First working version

e845a96

Change widths of buttons. Still ugly but easier to hit

31b9362

jacobhinkle requested a review from xwang233 September 29, 2023 19:47

jacobhinkle added 4 commits September 29, 2023 19:23

Merge remote-tracking branch 'origin/main' into html_codegen_diff

763d107

Add command to html output

7b50828

Show preamble, and diff if they do not match

ec9ca99

Add note to enable skipping preamble in output

f55f19a

jacobhinkle marked this pull request as ready for review September 30, 2023 00:30

jacobhinkle added 6 commits September 29, 2023 20:35

Formatting

6f4373a

Add "Toggle All" buttons

8841b45

Remove test script

9bc0118

Lint template using djlint

18927a7

Use uninitialized members in dataclasses

4a6bbd5

Add PTX

e535015

jacobhinkle added 7 commits October 2, 2023 07:44

Remove ptxas from code diffs.

7a1ddba

I will use this but not show it directly. Instead, I'll parse it and show the info on each kernel line, along with possible index type change and number of lines added/removed.

Don't print diffs to STDOUT by default

f30b119

Add --max-diffs option (default=200)

b098145

Highlight in client

67a2e5f

This reduces file size considerably. The original 11MB uncompressed file is now 2.0MB.

Add --html-omit-preamble option

4ed7817

Load cpp and diff for highlighting

2f69df2

Use asdict() instead of custom to_dict

c2de705

jacobhinkle mentioned this pull request Oct 2, 2023

Should the codegen diff tool only check the kernel itself? #1007

Closed

xwang233 approved these changes Oct 3, 2023

View reviewed changes

jacobhinkle added 11 commits October 3, 2023 07:36

Add arch, clean up info lines

6d6a75c

Add --json argument

ccc0c9b

It was trivial, and might be helpful for CI?

Remove stale pygments ref in template

409fb13

Record env in compare_codegen.sh

a92ed12

Add env, gpu, nvcc version, with --hide-env option

546c540

Add hrs, envs, reformat a bit

38cd3bd

Add footer

327edca

Add license headers

c097ace

Fix footer link

7685536

Fix a few formatting bugs

d648f6f

Clean env by removing $testdir

cd0c50f

jacobhinkle changed the title ~~Output html from codegen diff tool~~ Output HTML and JSON from codegen diff tool Oct 3, 2023

jacobhinkle and others added 2 commits October 3, 2023 11:44

Strip testdir= from env dump

9fb46de

Merge branch 'main' into html_codegen_diff

c7a345b

Change --show-diffs to --hide-diffs

30bfc4c

The --show-diffs arg actually had no effect (oops). Fixed that also.

jacobhinkle added 3 commits October 3, 2023 13:49

Match failing tests not just passed

6e77355

Include fail/pass if different, or if new test fails

4bc4dd6

Change condition for test fail/pass printing

932d065

This will show FAILED -> FAILED as well. The only hidden case is now SUCCESS -> SUCCESS

jacobhinkle merged commit f24e61f into main Oct 4, 2023

jacobhinkle deleted the html_codegen_diff branch October 4, 2023 08:29

jacobhinkle mentioned this pull request Oct 4, 2023

[initial build up] mbarrier: arrive wait barrier on smem #995

Merged

Conversation

jacobhinkle commented Sep 29, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

xwang233 commented Sep 29, 2023

Uh oh!

jacobhinkle commented Sep 29, 2023

Uh oh!

xwang233 commented Oct 1, 2023

Uh oh!

jacobhinkle commented Oct 2, 2023

Uh oh!

jacobhinkle commented Oct 2, 2023

Uh oh!

xwang233 left a comment

Choose a reason for hiding this comment

Uh oh!

jacobhinkle commented Oct 3, 2023

Uh oh!

jacobhinkle commented Oct 3, 2023

Uh oh!

jacobhinkle commented Oct 3, 2023

Uh oh!

xwang233 commented Oct 3, 2023

Uh oh!

jacobhinkle commented Oct 3, 2023

Uh oh!

jacobhinkle commented Oct 3, 2023

Uh oh!

xwang233 commented Oct 3, 2023

Uh oh!

jacobhinkle commented Oct 3, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

jacobhinkle commented Sep 29, 2023 •

edited

Loading