Uniq: fix -s and implement -f flags by marcospb19 · Pull Request #131 · GrayJack/coreutils

marcospb19 · 2020-10-20T20:11:05Z

There are some decisions I have to ask you first before completing and asking for a complete review:

It seems that other implementations of uniq ignore the trailing '\n' at the end of the lines, but does not ignore '\r', what is the desired behavior for me to implement? (I know that this project is only meant to be used in Windows, but \r\n files are still possible).
In Rust we usually deal with UTF-8 encoded text, it happens that the -s flag, commonly named --skip-chars, does not skip UTF-8 chars, but bytes instead (other implementations), see this example:

uniq -s 1
é verdade
ó verdade
a verdade

Uniq will display the 3 lines received.

The problem when comparing different slices of bytes is that:

let text = "não";
let slice = &text[2..];

Throws a runtime panic, because of the multi-byte char 'ã'.

Should I just work this around by using .as_bytes() instead of string slices?

About -s previous errors, the output wasn't showing the full line that first matched with the "pattern". I'll fix tests soon.

GrayJack · 2020-10-20T23:52:48Z

Nowadays almost all editors supports windows like line-ending, so I think if a file has it, and it is followed by a "\n" it should be have the same behavior as "\n"
There is something specifying that in the -s should consider only ASCII in the specification? If there isn't I think it should handle UTF-8 gracefully, otherwise, it would be a cool extension a new flag that handles UTF-8

I didn't had the time to look a the code yet, but I'll check as soon as possible

marcospb19 · 2020-10-30T01:52:08Z

@GrayJack, I had some time to work in fixing -s and implementing -f similar the GNU implementation of uniq.

This means that "\n" is different than "\r\n", and -s|skip-chars=N skips bytes instead (a char from the C language).

I'm sorry I don't have the time to change this :/ so I suggest that you could create an issue for each one if you want to.

GrayJack · 2020-10-30T05:39:40Z

bors r+

bors · 2020-10-30T05:46:15Z

Build succeeded:

marcospb19 force-pushed the implement-skip-fields-flag branch from 7b8fd5e to 92c472f Compare October 29, 2020 02:12

Uniq: fix -s and implement -f

c3f5205

marcospb19 force-pushed the implement-skip-fields-flag branch from 92c472f to c3f5205 Compare October 30, 2020 01:41

marcospb19 marked this pull request as ready for review October 30, 2020 01:46

GrayJack added the hacktoberfest-accepted Accepted PR for hacktoberfest label Oct 30, 2020

bors bot merged commit ceb606b into GrayJack:dev Oct 30, 2020

marcospb19 mentioned this pull request Oct 30, 2020

Uniq: Implement -f option #122

Closed

marcospb19 mentioned this pull request Nov 28, 2020

Uniq: Start implementation. #121

Merged

7 tasks

This was referenced Dec 29, 2020

Implement uniq #57

Open

Seq: Implement -f option #127

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uniq: fix -s and implement -f flags#131

Uniq: fix -s and implement -f flags#131
bors[bot] merged 1 commit intoGrayJack:devfrom
marcospb19:implement-skip-fields-flag

marcospb19 commented Oct 20, 2020 •

edited

Loading

Uh oh!

GrayJack commented Oct 20, 2020 •

edited

Loading

Uh oh!

marcospb19 commented Oct 30, 2020

Uh oh!

GrayJack commented Oct 30, 2020

Uh oh!

bors bot commented Oct 30, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

marcospb19 commented Oct 20, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

GrayJack commented Oct 20, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

marcospb19 commented Oct 30, 2020

Uh oh!

GrayJack commented Oct 30, 2020

Uh oh!

bors bot commented Oct 30, 2020

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

marcospb19 commented Oct 20, 2020 •

edited

Loading

GrayJack commented Oct 20, 2020 •

edited

Loading