Add tests for invalid UTF-8 encoding in the text format by sunfishcode · Pull Request #474 · WebAssembly/spec

sunfishcode · 2017-05-15T06:44:16Z

This adds a ".fail.wast" test file for each of the 179 invalid UTF-8 byte sequences created for the other UTF-8 tests. This provides some test coverage for validation on the text parsing side (because it matters for some consumers), however it's an awkwardly large number of files, and I don't know if it's worth the hassle. In any case, here's the PR in case anyone's interested.

rossberg · 2017-05-15T12:01:57Z

I'm torn. The more tests the better, but it is a large number of files. OTOH, once #471 lands, we probably want similar tests checking the UTF-8 encoding of the input itself (i.e., invalid UTF-8 occurring directly, not escaped, in either strings or comments). Perhaps a solution is to introduce a subdirectory for everything Unicode related?

AndrewScheidecker · 2017-05-15T12:17:31Z

Couldn't the need for the .fail.wast files be eliminated by adding an (assert_unparseable "...") that embeds the unparseable modules in strings?

rossberg · 2017-05-15T12:36:53Z

@AndrewScheidecker, ha, interesting idea. Let me think about that.

rossberg · 2017-06-06T15:12:54Z

PR #475 landed, so these tests can now be written in just one file.

rossberg · 2017-09-21T09:36:13Z

@sunfishcode, any chance you can adapt this PR?

sunfishcode · 2017-10-12T18:24:10Z

I still do intend to do this, though it's not a high priority right now.

binji · 2017-10-12T19:06:25Z

Pushed a change to refactor these. Hope you don't mind! :-)

rossberg

Cool, thanks. LGTM (without looking at all the tests individually).

rossberg · 2017-10-13T06:11:14Z

When you merge, can you make sure to update the title/description?

sunfishcode · 2017-10-13T17:25:48Z

@binji That looks like all that needs to be done here then. Thanks!

remove duplicated token definition in parser

rossberg mentioned this pull request May 15, 2017

[interpreter] Support quoted module definitions in .wast #475

Merged

sunfishcode and others added 2 commits October 12, 2017 12:03

Add ".fail.wast" versions of the malformed UTF-8 tests.

e95fb26

Refactor into assert_malformed tests

92c1c91

binji force-pushed the utf8-invalid-text branch from 9998f29 to 92c1c91 Compare October 12, 2017 19:06

rossberg approved these changes Oct 13, 2017

View reviewed changes

binji changed the title ~~Add ".fail.wast" versions of the malformed UTF-8 tests.~~ Add tests for invalid UTF-8 encoding in the text format Oct 13, 2017

binji merged commit b3c6413 into master Oct 13, 2017

binji deleted the utf8-invalid-text branch October 13, 2017 17:54

dhil pushed a commit to dhil/webassembly-spec that referenced this pull request Nov 13, 2023

Merge pull request WebAssembly#474 from zapashcanon/duplicate_refi31

5f25162

remove duplicated token definition in parser

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add tests for invalid UTF-8 encoding in the text format#474

Add tests for invalid UTF-8 encoding in the text format#474
binji merged 2 commits intomasterfrom
utf8-invalid-text

sunfishcode commented May 15, 2017

Uh oh!

rossberg commented May 15, 2017

Uh oh!

AndrewScheidecker commented May 15, 2017

Uh oh!

rossberg commented May 15, 2017

Uh oh!

rossberg commented Jun 6, 2017

Uh oh!

rossberg commented Sep 21, 2017

Uh oh!

sunfishcode commented Oct 12, 2017

Uh oh!

binji commented Oct 12, 2017

Uh oh!

rossberg left a comment

Uh oh!

rossberg commented Oct 13, 2017

Uh oh!

sunfishcode commented Oct 13, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

sunfishcode commented May 15, 2017

Uh oh!

rossberg commented May 15, 2017

Uh oh!

AndrewScheidecker commented May 15, 2017

Uh oh!

rossberg commented May 15, 2017

Uh oh!

rossberg commented Jun 6, 2017

Uh oh!

rossberg commented Sep 21, 2017

Uh oh!

sunfishcode commented Oct 12, 2017

Uh oh!

binji commented Oct 12, 2017

Uh oh!

rossberg left a comment

Choose a reason for hiding this comment

Uh oh!

rossberg commented Oct 13, 2017

Uh oh!

sunfishcode commented Oct 13, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants