Skip to content

Kerberos Problem with PyArrow #16377

@asfimport

Description

@asfimport

I'm having trouble using pyarrow with kerberos. I'm trying to connect to HDFS with the following signature:

hdfs = HdfsClient(host='ip-172-31-53-87.ec2.internal', port=8020, kerb_ticket='/tmp/krb5cc_1000', driver='libhdfs3', user='centos')
ArrowException                            Traceback (most recent call last)
<ipython-input-2-15087f93c239> in <module>()
----> 1 hdfs = HdfsClient(host='ip-172-31-53-87.ec2.internal', port=8020, kerb_ticket='/tmp/krb5cc_1000', driver='libhdfs3', user='centos')

/home/centos/miniconda3/envs/hdfs_test/lib/python3.5/site-packages/pyarrow/filesystem.py in __init__(self, host, port, user, kerb_ticket, driver)
    168     def __init__(self, host="default", port=0, user=None, kerb_ticket=None,
    169                  driver='libhdfs'):
--> 170         self._connect(host, port, user, kerb_ticket, driver)
    171
    172     @implements(Filesystem.isdir)

/home/centos/miniconda3/envs/hdfs_test/lib/python3.5/site-packages/pyarrow/io.pyx in pyarrow.io._HdfsClient._connect (/feedstock_root/build_artefacts/pyarrow_1488727736041/work/arrow-f6924ad83bc95741f003830892ad4815ca3b70fd/python/build/temp.linux-x86_64-3.5/io.cxx:11090)()

/home/centos/miniconda3/envs/hdfs_test/lib/python3.5/site-packages/pyarrow/error.pyx in pyarrow.error.check_status (/feedstock_root/build_artefacts/pyarrow_1488727736041/work/arrow-f6924ad83bc95741f003830892ad4815ca3b70fd/python/build/temp.linux-x86_64-3.5/error.cxx:1197)()

ArrowException: IOError: HDFS connection failed

Below shows a valid ticket:

[centos@ip-172-31-61-224 usr]$ klist
Ticket cache: FILE:/tmp/krb5cc_1000
Default principal: centos@DOMAIN

Valid starting       Expires              Service principal
04/03/2017 14:36:38  04/04/2017 14:36:38  krbtgt/DOMAIN@DOMAIN
        renew until 04/10/2017 14:36:38

Environment: Centos 7.2, HDP 2.4.3
Reporter: Benjamin Zaitlen
Assignee: Wes McKinney / @wesm

Related issues:

Note: This issue was originally created as ARROW-762. Please see the migration documentation for further details.

Metadata

Metadata

Assignees

Type

No type

Projects

No projects

Milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions