Config file bug cleanup by JosephMoore25 · Pull Request #370 · UoB-HPC/SimEng

JosephMoore25 · 2024-01-16T19:31:17Z

This PR aims to squash the bugs being investigated that appear through certain parameter values in the config file.

The current known issues are:

[FIXED] Vector-Length being greater than Load-Bandwidth or Store-Bandwidth causes progression to halt with no error. A simple check is needed in ModelConfig.cc to ensure this is not the case, and to error out if it is the case.

[FIXED] Fetch-Block-Size being greater than 128 (despite being a valid value) causes a large memory leak. This was due to the precision of bufferedBytes_ and bufferOffset being 8 bit, as was the bytesAvailable parameter of the function predecode. This led to an infinite loop of instruction fetches.

[TODO, not in this PR] Setting GeneralPurpose-Count or FloatingPoint/SVE-Count to a small value above architectural register values in OoO/MicroOp mode causes indefinite stalls due to not being able to rename all destination registers. This is to be handled in a different PR due to this not being a trivial fix.

[DONE] Investigate further parameter values to check for unexpected behaviour. Nothing is noticably wrong at this time.

dANW34V3R

Minor comments, but concepts look good

dANW34V3R · 2024-01-19T14:05:08Z

It could be worth adding tests for the maximum and minimum bound of each config option e.g. just run the default program for these options and make sure it runs to completion. This would be far from covering all potential issues as all options are dependent but would omit basic issues in the future. This would probably take a non-trivial amount of time so something that could be added to the CI pipeline. Thoughts @jj16791 @FinnWilkinson @ABenC377 @JosephMoore25 ?

JosephMoore25 · 2024-01-19T14:37:26Z

@dANW34V3R Ideally this would be tested. Problem is for ~23 parameters that would be 46 tests. In the case that one breaks, bounds would need to be set on the program. A timeout would be needed in the case of broken progression, and I guess memory leaks should eventually crash - these things may be slightly easier to monitor manually.

It may be sufficient to do this once before each release - I can provide an internal tool to run these tests automatically.

Our options are:

Add tests to the pipeline. Pipeline will become noticably longer, but each push we confirm the bounds of all individual parameters (but can not guarantee dependent parameter combinations still)
Use an internal tool to run this test semi-automated, to be run before releases
Manually test bounds when changing code that may affect these (especially precision of variables)

We should take a vote/discussion on this, but the outcome is unlikely to change this PR. @FinnWilkinson @jj16791 @dANW34V3R

I'm under the impression that 2 should be sufficient and 3 should always be done anyway. These limits are often not touched through normal use of SimEng, but instead when designing extreme CPUs, hence not running into an issue until now.

FinnWilkinson · 2024-01-19T17:20:52Z

It could be worth adding tests for the maximum and minimum bound of each config option e.g. just run the default program for these options and make sure it runs to completion. This would be far from covering all potential issues as all options are dependent but would omit basic issues in the future. This would probably take a non-trivial amount of time so something that could be added to the CI pipeline. Thoughts @jj16791 @FinnWilkinson @ABenC377 @JosephMoore25 ?

This would be nice to do yes, but I think we would need to use some other benchmark other than the default one to ensure all potential issues are discovered. For example, without running a code using SVE instructions, one of the bugs fixed by this PR wouldn't have been discovered

FinnWilkinson · 2024-01-19T17:23:38Z

@dANW34V3R Ideally this would be tested. Problem is for ~23 parameters that would be 46 tests. In the case that one breaks, bounds would need to be set on the program. A timeout would be needed in the case of broken progression, and I guess memory leaks should eventually crash - these things may be slightly easier to monitor manually.

It may be sufficient to do this once before each release - I can provide an internal tool to run these tests automatically.

Our options are:

Add tests to the pipeline. Pipeline will become noticably longer, but each push we confirm the bounds of all individual parameters (but can not guarantee dependent parameter combinations still)

Use an internal tool to run this test semi-automated, to be run before releases

Manually test bounds when changing code that may affect these (especially precision of variables)

We should take a vote/discussion on this, but the outcome is unlikely to change this PR. @FinnWilkinson @jj16791 @dANW34V3R

I'm under the impression that 2 should be sufficient and 3 should always be done anyway. These limits are often not touched through normal use of SimEng, but instead when designing extreme CPUs, hence not running into an issue until now.

We run a semi-automated script on the project before release to check that all exisiting benchmarks still functionally work, so doing this for something else wouldn't be too bad (in my opinion). However, upkeeping such a tool would require a bit more work given it is subject to change much more often than the list of supported benchmarks.

An alternative would be to implement unit / integration tests to ensure all the bounds are functioning correctly - this option would need to be a PR in itself though

dANW34V3R · 2024-01-19T21:09:41Z

We run a semi-automated script on the project before release to check that all exisiting benchmarks still functionally work, so doing this for something else wouldn't be too bad (in my opinion). However, upkeeping such a tool would require a bit more work given it is subject to change much more often than the list of supported benchmarks.

Check on release seems like a possible solution to get the tests run but not unnecessarily over checking. I can see a way where it could be automated by checking the values set by functions like setValueBounds and setValueSet which would mean no manual upkeep.

An alternative would be to implement unit / integration tests to ensure all the bounds are functioning correctly - this option would need to be a PR in itself though

This would be option 1 I guess as all GTests are run in the CI pipeline on every push to an open PR. Check on release seems like a better option if these tests take some time to run

Merged dev updates into this branch (attempt 2, oops)

…essages being wrong way around

Merging with latest dev

…tore bandwidths

Fixed config file issue regarding vector length

b072e2d

JosephMoore25 added the bug Something isn't working label Jan 16, 2024

JosephMoore25 requested review from FinnWilkinson, dANW34V3R and jj16791 January 16, 2024 19:31

JosephMoore25 self-assigned this Jan 16, 2024

FinnWilkinson linked an issue Jan 16, 2024 that may be closed by this pull request

Verify Config file parameters #369

Closed

FinnWilkinson added the 0.9.6 Part of SimEng Release 0.9.6 label Jan 16, 2024

FinnWilkinson marked this pull request as draft January 16, 2024 20:14

JosephMoore25 added 2 commits January 18, 2024 17:40

Fixed Fetch-Block-Size memory leak

3e628c9

Clang formatted ModelConfig.cc

092e87d

JosephMoore25 marked this pull request as ready for review January 19, 2024 12:19

dANW34V3R reviewed Jan 19, 2024

View reviewed changes

Comment thread src/lib/config/ModelConfig.cc Outdated

Comment thread src/lib/config/ModelConfig.cc Outdated

Added more informative comments. Separated load and store check

bf85b15

dANW34V3R previously approved these changes Jan 19, 2024

View reviewed changes

FinnWilkinson requested changes Jan 19, 2024

View reviewed changes

Comment thread src/lib/config/ModelConfig.cc Outdated

Comment thread src/lib/config/ModelConfig.cc Outdated

Added test for SVL and made descriptions more informative

f1115e9

JosephMoore25 dismissed dANW34V3R’s stale review via f1115e9 January 22, 2024 15:56

FinnWilkinson previously approved these changes Jan 24, 2024

View reviewed changes

JosephMoore25 mentioned this pull request Jan 24, 2024

RISC-V Compressed Instructions #368

Merged

Fixed bug where 2048 was missed as a valid value for Fetch-Block-Size

2494b39

JosephMoore25 dismissed FinnWilkinson’s stale review via 2494b39 January 24, 2024 17:21

JosephMoore25 force-pushed the config-bug-fixes branch from 3905e01 to 2494b39 Compare February 1, 2024 14:51

Merge branch 'dev' into config-bug-fixes

916d68b

Merged dev updates into this branch (attempt 2, oops)

jj16791 requested changes Feb 2, 2024

View reviewed changes

Comment thread src/lib/config/ModelConfig.cc Outdated

Comment thread src/lib/config/ModelConfig.cc Outdated

JosephMoore25 added 6 commits February 2, 2024 14:48

Merge branch 'dev' into config-bug-fixes

638691e

Added aarch64 if statement around bandwidth checks, and fixed error m…

c62c7f7

…essages being wrong way around

Clang tidy

620e533

Merge remote-tracking branch 'origin/dev' into config-bug-fixes

3982142

Merging with latest dev

Fixed gtests breaking due to using invalid load/store bandwidths

230a61b

Updated value bounds to reflect minimum load/store bandwidths

4f82931

JosephMoore25 requested a review from ABenC377 February 7, 2024 18:25

ABenC377 previously approved these changes Feb 8, 2024

View reviewed changes

jj16791 requested changes Feb 8, 2024

View reviewed changes

Comment thread src/lib/config/ModelConfig.cc Outdated

Increased min register count temporarily. Merged isa check for load/s…

655067c

…tore bandwidths

JosephMoore25 dismissed ABenC377’s stale review via 655067c February 8, 2024 14:25

Fixed mistake with forgetting to also update createExpectation

0066c08

jj16791 approved these changes Feb 8, 2024

View reviewed changes

FinnWilkinson approved these changes Feb 8, 2024

View reviewed changes

dANW34V3R approved these changes Feb 9, 2024

View reviewed changes

JosephMoore25 merged commit 8af924c into dev Feb 9, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Config file bug cleanup#370

Config file bug cleanup#370
JosephMoore25 merged 15 commits intodevfrom
config-bug-fixes

JosephMoore25 commented Jan 16, 2024 •

edited

Loading

Uh oh!

dANW34V3R left a comment

Uh oh!

Uh oh!

Uh oh!

dANW34V3R commented Jan 19, 2024

Uh oh!

JosephMoore25 commented Jan 19, 2024

Uh oh!

FinnWilkinson commented Jan 19, 2024

Uh oh!

FinnWilkinson commented Jan 19, 2024

Uh oh!

Uh oh!

Uh oh!

dANW34V3R commented Jan 19, 2024

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

JosephMoore25 commented Jan 16, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dANW34V3R left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

dANW34V3R commented Jan 19, 2024

Uh oh!

JosephMoore25 commented Jan 19, 2024

Uh oh!

FinnWilkinson commented Jan 19, 2024

Uh oh!

FinnWilkinson commented Jan 19, 2024

Uh oh!

Uh oh!

Uh oh!

dANW34V3R commented Jan 19, 2024

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

JosephMoore25 commented Jan 16, 2024 •

edited

Loading