Skip to content

Conversation

@larrylawl
Copy link

@larrylawl larrylawl commented Dec 29, 2022

Motivation. I want OpusRead.printPairs to be a generator for downstream task. Specifically, I intend to share Opus as a huggingface dataset (see: DatasetBuilder._generate_examples in link).

Change. Added yield_tuple write mode which allows OpusRead.printPairs to be a generator.

P.s. thanks for this amazing package!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant