Unicode docstrings are not handled correctly on Python 2.x #1

mdboom · 2013-08-13T18:53:38Z

If a docstring is Unicode on Python 2.x, Numpydoc will crash with a UnicodeEncodeError. This is because it calls str on the SphinxDocString object, calling SphinxDocString.__str__ which returns unicode when the docstring itself is unicode. Python then fails converting that unicode object to a bytes (str) object using the default encoding.

# -*- coding: utf-8 -*-

docstring = u"""
numpydoc הוא מדהים
"""

from numpydoc import docscrape_sphinx

d = docscrape_sphinx.SphinxDocString(docstring)

x = str(d)  # fails
x = unicode(d)  # works, whether docstring is unicode or bytes

This PR restores the behavior in the numpydoc that ships with Numpy 1.7.1, which was to keep unicode as-is, and convert bytes to unicode using the default encoding.

stefanv · 2013-08-14T05:43:26Z

+1

jseabold · 2013-10-21T14:00:48Z

+1

mdboom · 2013-10-21T14:10:20Z

Any chance we can have this merged? As matplotlib has doctsrings with unicode, there have been a few reports over there about this.

Unicode docstrings are not handled correctly on Python 2.x

ENH : simplify handling of Yield section

Handle Unicode docstrings on Python 2.x correctly

54cccad

jseabold mentioned this pull request Oct 21, 2013

unable to build docs locally matplotlib/matplotlib#2529

Closed

certik added a commit that referenced this pull request Oct 21, 2013

Merge pull request #1 from mdboom/unicode-docstrings

9fc0e08

Unicode docstrings are not handled correctly on Python 2.x

certik merged commit 9fc0e08 into numpy:master Oct 21, 2013

jorisvandenbossche mentioned this pull request Jan 15, 2014

Rebase ipython_directive on top of recent ipython updated version pandas-dev/pandas#5925

Merged

pv pushed a commit that referenced this pull request Feb 14, 2015

Merge pull request #1 from tacaswell/yield_section

ddd7dc6

ENH : simplify handling of Yield section

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Unicode docstrings are not handled correctly on Python 2.x #1

Unicode docstrings are not handled correctly on Python 2.x #1

Uh oh!

mdboom commented Aug 13, 2013

Uh oh!

stefanv commented Aug 14, 2013

Uh oh!

jseabold commented Oct 21, 2013

Uh oh!

mdboom commented Oct 21, 2013

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

Unicode docstrings are not handled correctly on Python 2.x #1

Unicode docstrings are not handled correctly on Python 2.x #1

Uh oh!

Conversation

mdboom commented Aug 13, 2013

Uh oh!

stefanv commented Aug 14, 2013

Uh oh!

jseabold commented Oct 21, 2013

Uh oh!

mdboom commented Oct 21, 2013

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants