In exercise 10.1, we say that we need to explicitly set the character encoding, because Project Gutenberg doesn't do it. But it actually does set the encoding to UTF-8.
We may remove that part of the exercise to make it less complex. (Or better: check if it is correctly set and only fix it if necessary, but that would make it more complex.)