isbn-verifier: Crafted input to catch more incorrect algorithms by nathanielknight · Pull Request #1255 · exercism/problem-specifications

nathanielknight · 2018-06-25T19:37:48Z

This commit adds an input that will be incorrectly validated by
algorithms that don't explicitly check the length of their inputs, as
discussed in #1199 and #993.

For example, if an algorithm only checks the first (or last) ten
digits of the input, ignoring leftovers, it will pass the current test
suite even though it doesn't correctly implement the spec.

closes #1199

This commit adds an input that will be incorrectly validated by algorithms that don't explicitly check the length of their inputs, as discussed in exercism#1199 and exercism#993. For example, if an algorithm only checks the first (or last) ten digits of the input, ignoring leftovers, it will pass the current test suite even though it doesn't correctly implement the spec. closes exercism#1199

nathanielknight · 2018-06-25T19:38:25Z

Big thanks to @petertseng for helping me with this submission. 👍

petertseng

Thank you, now the two implementations at https://github.com/petertseng/exercism-problem-specifications/blob/verify/exercises/isbn-verifier/verify.rb#L46-L52 and https://github.com/petertseng/exercism-problem-specifications/blob/verify/exercises/isbn-verifier/verify.rb#L62-L69 are covered by this test case, and these were the last two mistaken implementations being tracked!

Question: At this point, the input named "too long isbn" is superseded by the new case added in this PR. The mistaken implementation at https://github.com/petertseng/exercism-problem-specifications/blob/verify/exercises/isbn-verifier/verify.rb#L54-L61 also incorrectly accepts the new case, in addition to the "too long isbn" case.

I would suggest that this means the old "too long isbn" case should be removed, since it does not independently serve any other purpose. There are no other mistaken implementations that incorrectly accept it.

Any disagreements with this suggestion?

(Of course, I Approve this PR to be merged as-is if there are any disagreements, but I do think the now-obsolete case can be removed)

petertseng · 2018-06-25T20:06:01Z

(to avoid any mistakes, the old case I'm referring to that I suggest to remove is 3-598-21507-XX)

nathanielknight · 2018-06-25T20:18:32Z

I think it's worth keeping both cases. Having both "too long" and "too long with valid ISBN" cases highlights that a solution can have incorrect features even if it incidentally passes the cases designed to test those features.

petertseng · 2018-06-25T20:31:39Z

highlights that a solution can have incorrect features even if it incidentally passes the cases designed to test those features.

This is an interesting point! Let's talk about it (to make sure I understand the reasoning).

So I think that means that a student who initially writes a solution that checks all 11 digits (corresponding to https://github.com/petertseng/exercism-problem-specifications/blob/verify/exercises/isbn-verifier/verify.rb#L46-L52 ) would correctly reject the 3-598-21507-XX case, and think "good, I am indeed rejecting ISBNs that are too long". Then when they get to 98245726788 the student sees "ah, in fact I am not rejecting all ISBNs that are too long", and this is a good learning moment.

Okay, I can see this being a good thing. I wish that this could be a part of the experience for everyone as well. Currently a student who writes a solution that checks the left 10 digits (corresponding to https://github.com/petertseng/exercism-problem-specifications/blob/verify/exercises/isbn-verifier/verify.rb#L54-L61) would already incorrectly accept the 3-598-21507-XX case, and therefore they don't get this extra moment of learning. Is it OK that the experience is different for students who write such a solution?

(The answer might be yes, I haven't thought about it. I am just trying to figure out why and explain my approach to deciding whether to keep a case, and seeing if I should change my approach)

nathanielknight · 2018-06-25T21:23:31Z

So I think that means that a student who initially writes a solution that checks all 11 digits (corresponding to https://github.com/petertseng/exercism-problem-specifications/blob/verify/exercises/isbn-verifier/verify.rb#L46-L52 ) would correctly reject the 3-598-21507-XX case, and think "good, I am indeed rejecting ISBNs that are too long". Then when they get to 98245726788 the student sees "ah, in fact I am not rejecting all ISBNs that are too long", and this is a good learning moment.

Yeah, this is exactly what I meant! 👍

I think it's okay for learners to have different experiences as they move through the test suite (indeed, some will have learned this lesson before and be wise to this sort of ploy), but it's worth making sure we're getting the best possible outcome for as many folks as possible.

The cases we're considering are solutions that:

validate length and check exactly 10 digits
don't validate length and:
- validate only the first 10 digits of input
- validate only the last 10 digits of input
- validate all digits of input with a generalized checksum

If the learner's solution validates length, I don't think we can catch them out with the ploy of a "treacherous test" that gives the right result for the wrong reason. If they're in the other category, we want a progression of tests where everybody gets a false positive that they'll catch eventually.

By keeping both exercises in, we get the "right outcome for the wrong reasons" in two out of three of the above cases (incorrect solutions that check 11 digits or the last 10 digits of input will reject 3-598-21507-XX for the wrong reasons). The goal would be to find an input that also gives the "right outcome for the wrong reason" for solutions that only check the first ten digits. I think that might just end up with a test case that's a random string of numbers that any solution would reject.

I have to sit down and game this out in more detail tonight... 😖

nathanielknight · 2018-06-26T04:49:36Z

Upon review, I think we can take out the "too long isbn" test(3-598-21507-XX). If all my checks are correct, the previous "too long isbn and no dashes" test (3598215078X) will spuriously pass for each of the three solutions types we discussed above. The new test case will catch the oversight later, and everybody gets the lesson.

I've updated the PR removing the old test case.

The "too long no dashes" test case will serve the same purpose, and will allow certain solutions that don't check length (e.g. that test the first or last ten digits of the input, or that generalise the checksum to more digits) to look like they're working until they reach the new "too long but contains valid isb" exercise.

ErikSchierboom

I agree with removing the old test case. This seems like a natural progression to me.

petertseng

Hey there, just checking in to confirm (as should be known) that I do agree with things as they currently are (having removed 3-598-21507-XX).

It would have been interesting to get a progression that presents all of the three listed implementations (left 10, right 10, all 11) with first a passing test (as 3-598-21507-XX did for some), but I think logically one can see that is not possible unless all of those implementations would correctly reject the test case, and if we're doing that then I suppose we'd better think of a fourth type of implementation that would incorrectly accept the earlier case.

I believe enough time has passed since the last commit was submitted for review, so I think merging is appropriate, unless someone has any last words in the next 12 hours or so. Since you the submitter can merge, I will not be merging and instead leaving it to you to do so.

rpottsoh · 2018-06-28T18:15:59Z

I don't really have the time right now to do a thorough review. I trust @petertseng and @ErikSchierboom judgment here. Peter and I have worked together on issues pertaining to this particular exercise before.

Please squash when merging.

Handles all three possible implementations: * Truncate left * Truncate right * Consider all 11 exercism/problem-specifications#993 (comment) exercism/problem-specifications#1199 exercism/problem-specifications#1255

petertseng approved these changes Jun 25, 2018

View reviewed changes

ErikSchierboom approved these changes Jun 27, 2018

View reviewed changes

petertseng approved these changes Jun 28, 2018

View reviewed changes

rpottsoh merged commit 3134243 into exercism:master Jul 1, 2018

rpottsoh mentioned this pull request Jul 1, 2018

isbn-verifier: update to 2.7.0 exercism/delphi#239

Merged

petertseng mentioned this pull request Aug 1, 2018

isbn-verifier 2.7.0.8: Replace too long case exercism/haskell#699

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

isbn-verifier: Crafted input to catch more incorrect algorithms#1255

isbn-verifier: Crafted input to catch more incorrect algorithms#1255
rpottsoh merged 2 commits intoexercism:masterfrom
nathanielknight:master

nathanielknight commented Jun 25, 2018

Uh oh!

nathanielknight commented Jun 25, 2018

Uh oh!

petertseng left a comment

Uh oh!

petertseng commented Jun 25, 2018

Uh oh!

nathanielknight commented Jun 25, 2018

Uh oh!

petertseng commented Jun 25, 2018

Uh oh!

nathanielknight commented Jun 25, 2018

Uh oh!

nathanielknight commented Jun 26, 2018

Uh oh!

ErikSchierboom left a comment

Uh oh!

petertseng left a comment

Uh oh!

rpottsoh commented Jun 28, 2018 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

Conversation

nathanielknight commented Jun 25, 2018

Uh oh!

nathanielknight commented Jun 25, 2018

Uh oh!

petertseng left a comment

Choose a reason for hiding this comment

Uh oh!

petertseng commented Jun 25, 2018

Uh oh!

nathanielknight commented Jun 25, 2018

Uh oh!

petertseng commented Jun 25, 2018

Uh oh!

nathanielknight commented Jun 25, 2018

Uh oh!

nathanielknight commented Jun 26, 2018

Uh oh!

ErikSchierboom left a comment

Choose a reason for hiding this comment

Uh oh!

petertseng left a comment

Choose a reason for hiding this comment

Uh oh!

rpottsoh commented Jun 28, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

rpottsoh commented Jun 28, 2018 •

edited

Loading