Implement representer #15

tehsphinx · 2022-01-08T13:19:56Z

Features

creates representation
creates mapping
docker file created

Representation

based on AST
removes comments / comment groups
removes the file level from AST
top level is package level
currently JSON format -- think about other format that might be easier to read

Mapping

JSON format
simple map of PLACEHOLDER to original name

tehsphinx · 2022-01-08T13:32:29Z

Next steps:

I think JSON AST output is too verbose, so I'm going to improve that. Probably something like in the example here.
reduce AST further to necessary nodes, combining some nodes into one with more information

junedev · 2022-01-08T16:47:40Z

@tehsphinx In the issue and on Slack there was a preference to have human readable "cleaned up code" as representation instead of some AST serialization. It sounds like you decided otherwise, can you share your reasoning?

Here is a link to the Slack discussion: https://exercism-team.slack.com/archives/CAQP7JL3T/p1635175129080800?thread_ts=1635171130.077500&cid=CAQP7JL3T

…re test cases; use native AST

tehsphinx · 2022-01-09T16:14:27Z

Hi @junedev. Didn't see your comment but have come to the same conclusion. I wasn't happy with how hard to read, complicated and also large the representations where. I remembered how easy it would be to turn AST back into Go code after e.g. replacing names and removing comments.

That's why I rewrote the whole thing again. It now parses the AST (drops comments right away), iterates it to replace the names and converts the AST back to Go code. That also removes any differences in whitespace or different formatting, which is nice.

I think these steps are next for me:

some code review / clean-up from the rewrite
add more test cases and improve the output where needed

tehsphinx · 2022-01-09T16:16:46Z

If you have a moment (or anyone else), can you look at the example outputs in the ./representer/tests folders? Let me know if you'd change anything on the output.

iHiD · 2022-01-10T12:37:33Z

(This is exciting!)

junedev · 2022-01-10T20:38:12Z

@iHiD Is it possible to get a larger set of example solutions for some exercises in Go so we can check how many distinct representations our representer would produce and potentially identify additional things that need to be normalized?

junedev · 2022-01-10T20:53:03Z

@tehsphinx I looked through the representations and they looked good to me at first glance but I don't have experience with representers.

One thing that came to mind was whether we need to normalize things like var a string = "" vs var a string vs a := "". Would those produce different representations currently? Should they?

andrerfcsantos · 2022-01-10T22:22:24Z

@tehsphinx Awesome work, thanks for taking on this task!

The representations seem to be exactly what we want, so it looks great. June talked about the different representations variables can have and whether the representation should be the same. Looking at the examples, I'm wondering the same thing and if things like myvar++ should have the same representation as myvar = myvar + 1 or even myvar+=1. For now, I think there's value in keeping these representations separate. Even though they achieve the same thing in the end, we might want to give different feedback. For instance, we might look at var a string = "" and say there is a shorter way to write this. Same with things like myvar = myvar + 1. The worst that can happen with keeping these representations separate would be us having to repeat feedback on different representations, but I think that might be an OK trade-off for the flexibility it gives us.

I also briefly reviewed the code and it looks good, I found it very easy to follow. Since I'm not well versed in AST black magic, I didn't review that part in detail yet, but everything made sense looking at a glance.

I understand this might not be totally finished yet, but I approved the changes to signal how much I like this PR. This is a solid first iteration.

ErikSchierboom

This looks great! Lots of really nice tests too.

ErikSchierboom · 2022-01-11T09:58:52Z

Looking at the examples, I'm wondering the same thing and if things like myvar++ should have the same representation as myvar = myvar + 1 or even myvar+=1. For now, I think there's value in keeping these representations separate.

I'd also argue that they should be separate representations. Now for some concepts in some exercises, you could argue that you don't want to comment on this in a representation. In that case, you could perhaps build a system where certain transformations are only applied to certain exercises.

junedev · 2022-01-11T10:26:56Z

@ErikSchierboom Thanks for the feedback!

tehsphinx · 2022-01-13T09:27:27Z

Thanks for the feedback, everyone! ❤️

From my side we can merge this and then open separate MRs for further iterations, or I can continue to work on this MR, whatever you prefer.

About different variable definition, incrementing a variable, etc:
I think it makes sense to comment on that on earlier exercises and later on not. However my suggestion would be to have the same representer logic for all exercises and rather handle that with the analyzer. In my experience the analyzer is much more exercise specific anyway, so I think it fits better.

If you agree, I'd go on checking on how to generalize these things:

variable declarations
variable increment

(In a new branch/MR if you decide to merge this one)

What do think?

junedev · 2022-01-13T20:26:37Z

@tehsphinx Feel free to merge.

I will follow up with the discussion in the issue.

first implementation; ignores comments and file nodes

d8e0332

tehsphinx requested review from a team and junedev January 8, 2022 13:19

tehsphinx added 2 commits January 8, 2022 21:54

simplify representation; add tests

61b8812

complete rewrite; switch to code built from AST as representation; mo…

e26cd48

…re test cases; use native AST

tehsphinx marked this pull request as ready for review January 9, 2022 21:43

tehsphinx requested a review from a team as a code owner January 9, 2022 21:43

andrerfcsantos approved these changes Jan 10, 2022

View reviewed changes

ErikSchierboom approved these changes Jan 11, 2022

View reviewed changes

junedev mentioned this pull request Jan 13, 2022

Write representer #10

Closed

tehsphinx merged commit 3456757 into main Jan 14, 2022

tehsphinx deleted the representer branch January 14, 2022 20:11

junedev added the x:size/large Large amount of work label Jan 14, 2022

This was referenced Jul 14, 2022

Document reputation label exercism/docs#347

Merged

Consider correct reputation amounts for different labels exercism/exercism#6440

Closed

Consider correct reputation amounts for different labels exercism/exercism#6441

Closed

Uh oh!

Implement representer #15

Implement representer #15

Uh oh!

Conversation

tehsphinx commented Jan 8, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

tehsphinx commented Jan 8, 2022

Uh oh!

junedev commented Jan 8, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

tehsphinx commented Jan 9, 2022

Uh oh!

tehsphinx commented Jan 9, 2022

Uh oh!

iHiD commented Jan 10, 2022

Uh oh!

junedev commented Jan 10, 2022

Uh oh!

junedev commented Jan 10, 2022

Uh oh!

andrerfcsantos commented Jan 10, 2022

Uh oh!

ErikSchierboom left a comment

Choose a reason for hiding this comment

Uh oh!

ErikSchierboom commented Jan 11, 2022

Uh oh!

junedev commented Jan 11, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

tehsphinx commented Jan 13, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

junedev commented Jan 13, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

tehsphinx commented Jan 8, 2022 •

edited

Loading

junedev commented Jan 8, 2022 •

edited

Loading

junedev commented Jan 11, 2022 •

edited

Loading

tehsphinx commented Jan 13, 2022 •

edited

Loading