performance: perf cnt bugfix and add more macros for message print #6353

btian1 · 2022-09-29T13:21:55Z

Current cycle is based on 38400000, and perf cnt module did not consider
the clock wrap case, i.e the later cycle maybe small than the previous one.
so add code to handle this situation.

Signed-off-by: Baofeng Tian baofeng.tian@intel.com

lyakh · 2022-09-30T08:23:50Z

xtos/include/sof/lib/perf_cnt.h

is this 64-bit arithmetics? If it's 32-bit then that will overflow: UINT32_MAX + plat_ts will be calculated first and it doesn't fit in 32 bits. Just change that to UINT32_MAX - (pcd)->plat_ts) + plat_ts. And I'm wondering why one would decide to use parentheses in

x = (a + b);

...

Thanks, changed based on your comments, please review again, if no other issues, please +1 for merge.

lyakh · 2022-09-30T08:24:20Z

xtos/include/sof/lib/perf_cnt.h

ditto. Please remove all external parentheses

lyakh

possibly for a separate PR: these are getting rather big for macros, wondering if they could be converted to inline functions

lyakh · 2022-10-10T07:23:36Z

xtos/include/sof/lib/perf_cnt.h

lines 122 and 123 aren't needed? Are .plat_ts and .cpu_ts updated elsewhere now?

line 122 and 123 will be updated with perf_cnt_init before each module beginning, then after each module, will calculate the diff and decide the data.

this already split from original #6322 , how about keep it, too much prs will make upstream more difficult? it is not complicate actually, regarding change to inline, due to still pass function into the macro, how about keep for now?
I even want to rewrite perf_cnt, but due to I am new, and don't know too much about history, so I change based on current, in future, if there is new request, I may rewrite this module.

sorry, I don't understand. perf_cnt_init() initialises .plat_ts and .cpu_ts but when calculating the current deltas, you have to compare to the previous call, not to the initialisation values? What am I missing?

in: perf_cnt_init()
get initial clock for plat_ts and cpu_ts.
then followed by function calling.
....
then get current clock with local plat_ts and cpu_ts
then get the delta and peak for this function calling.
record the delta and accumulated it with 1024 times to get average.

it will loop around per 1 seconds.

Ok, I see now, so it looks like it is redundant in the current implementation? Would be good to have that change - just removing those two lines as a separate commit, but well, at least I know now why they are removed, thanks

Hmm, sorry, I don't understand how this works. At every function we calculate "(pcd)->plat_delta_last = plat_ts - (pcd)->plat_ts;" , but if "(pcd)->plat_ts" is not update (as removed in this patch), the delta is not correctly calculated (it's delta to initial value, not delta to previous call). Did I miss where a new place where "(pcd)->plat_ts" is updated?

Thanks @btian1 , so the old code really was wrong.

xtos/include/sof/lib/perf_cnt.h

lyakh · 2022-10-11T09:03:31Z

xtos/include/sof/lib/perf_cnt.h

Ok, I see now, so it looks like it is redundant in the current implementation? Would be good to have that change - just removing those two lines as a separate commit, but well, at least I know now why they are removed, thanks

kv2019i

Thanks @btian1 . Splitting PRs into simple separate changes is a good way to get changes incrementally done. But in this case, I believe I'm missing something essential as I don't get how the delta calculation works after this PR. Please see my comment inline.

kv2019i · 2022-10-11T10:50:16Z

xtos/include/sof/lib/perf_cnt.h

Hmm, sorry, I don't understand how this works. At every function we calculate "(pcd)->plat_delta_last = plat_ts - (pcd)->plat_ts;" , but if "(pcd)->plat_ts" is not update (as removed in this patch), the delta is not correctly calculated (it's delta to initial value, not delta to previous call). Did I miss where a new place where "(pcd)->plat_ts" is updated?

btian1 · 2022-10-11T12:08:19Z

@kv2019i , (pcd)->plat_ts will be updated in perf_cnt_init before each module running.
You can refer to:
https://github.com/thesofproject/sof/pull/6344/files#diff-1986b2a8b4fbd60e9bdf7b2829d35eee6c9150e6794201f7fff09dc8d54dbcacR248

due to required to split to multiple patches, changes may cause some mis-understanding.
Sorry for that, I will submit PRs with more careful next time.

kv2019i · 2022-10-11T12:21:46Z

xtos/include/sof/lib/perf_cnt.h

Thanks @btian1 , so the old code really was wrong.

RanderWang

LGTM

kv2019i · 2022-10-13T10:36:41Z

And same CI error on this PR as well. FYI @wszypelt

Current hw cycle is based on 38400000, and perf cnt module did not consider the clock wrap case, i.e the later cycle maybe small than the previous one. so add code to handle this situation. Signed-off-by: Baofeng Tian <baofeng.tian@intel.com>

kv2019i · 2022-10-18T10:41:33Z

Known fail with Intel IPC4 test set, otherwise looks good, merging.

btian1 force-pushed the perfcnt branch from 9872339 to 8de3e40 Compare September 29, 2022 13:25

btian1 requested review from andrula-song and singalsu September 29, 2022 13:25

btian1 force-pushed the perfcnt branch 2 times, most recently from dcd2dfb to 8c4a1d5 Compare September 29, 2022 13:39

btian1 marked this pull request as ready for review September 29, 2022 13:52

btian1 requested review from dbaluta, lbetlej, lgirdwood, mmaka1 and plbossart as code owners September 29, 2022 13:52

andrula-song approved these changes Sep 30, 2022

View reviewed changes

lyakh requested changes Sep 30, 2022

View reviewed changes

btian1 force-pushed the perfcnt branch 3 times, most recently from f1cc90b to 2e058da Compare October 8, 2022 02:43

btian1 requested a review from lyakh October 10, 2022 00:57

lyakh reviewed Oct 10, 2022

View reviewed changes

singalsu reviewed Oct 10, 2022

View reviewed changes

xtos/include/sof/lib/perf_cnt.h Outdated Show resolved Hide resolved

lyakh approved these changes Oct 11, 2022

View reviewed changes

kv2019i requested changes Oct 11, 2022

View reviewed changes

kv2019i approved these changes Oct 11, 2022

View reviewed changes

xtos/include/sof/lib/perf_cnt.h Outdated

Copy link

Collaborator

kv2019i Oct 11, 2022

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks @btian1 , so the old code really was wrong.

btian1 force-pushed the perfcnt branch from 2e058da to 3aceca5 Compare October 12, 2022 13:42

btian1 mentioned this pull request Oct 12, 2022

profiling task: modify task profiling with performance counter macro #6407

Merged

btian1 force-pushed the perfcnt branch 2 times, most recently from dadb93f to 00b1e69 Compare October 12, 2022 14:02

RanderWang approved these changes Oct 13, 2022

View reviewed changes

btian1 force-pushed the perfcnt branch from 00b1e69 to d553ab8 Compare October 14, 2022 12:03

performance: perf cnt bugfix for profiling

d553ab8

Current hw cycle is based on 38400000, and perf cnt module did not consider the clock wrap case, i.e the later cycle maybe small than the previous one. so add code to handle this situation. Signed-off-by: Baofeng Tian <baofeng.tian@intel.com>

kv2019i merged commit 177b614 into thesofproject:main Oct 18, 2022

performance: perf cnt bugfix and add more macros for message print #6353

performance: perf cnt bugfix and add more macros for message print #6353

Uh oh!

Conversation

btian1 commented Sep 29, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

lyakh left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

kv2019i left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

btian1 commented Oct 11, 2022

Uh oh!

Choose a reason for hiding this comment

Uh oh!

RanderWang left a comment

Choose a reason for hiding this comment

Uh oh!

kv2019i commented Oct 13, 2022

Uh oh!

kv2019i commented Oct 18, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

btian1 commented Sep 29, 2022 •

edited

Loading