PERF: always slice when indexing on columns #33597

jbrockmendel · 2020-04-16T20:35:08Z

Touched on in #32779, this makes all* DataFrame indexing-on-columns do slicing instead of taking, so doesn't make copies.

* Actually there is a different path in _slice_take_blocks_ax0 that we go down if we self._is_single_block, can update that later if we decide this is something we want to do.

In [3]: dti = pd.date_range("2016-01-01", periods=10**5, freq="S")                                                                                                                                                  
In [4]: df = pd.DataFrame._from_arrays([dti]*10 + [dti - dti] * 10 + [dti.to_period("D")]*10, columns=range(30), index=range(len(dti)))                                                                             

In [8]: arr = np.arange(30)                                                                                                                                                                                         
In [9]: np.random.shuffle(arr)                                                                                                                                                                                      

In [10]: %timeit df[arr]
8.35 ms ± 64.1 µs per loop (mean ± std. dev. of 7 runs, 100 loops each)    # <-- master                                                                                                                                                     
650 µs ± 52.5 µs per loop (mean ± std. dev. of 7 runs, 1000 loops each)   # <-- PR

The tradeoff is that we end up with less-consolidated results. I'm OK with that, but there may be downsides I'm not aware of, will wait for others to weigh in.

…rf-arith-frame

jorisvandenbossche · 2020-04-17T20:38:20Z

@jbrockmendel can you maybe open an issue for this? As your questions seems to be more broad than the actual PR, and to not mix that discussion with code review of the actual PR

…rf-never-take

jbrockmendel · 2020-04-21T01:19:03Z

i cant reproduce the test failures locally

jbrockmendel · 2020-04-24T23:21:45Z

Opened #33780, mothballing this.

jreback · 2021-11-28T21:05:35Z

this is quite old, happen to reopen if actively worked on.

jbrockmendel added 30 commits March 17, 2020 09:59

PERF: block-wise arithmetic for frame-with-frame

1697252

Merge branch 'master' of https://github.com/pandas-dev/pandas into pe…

a7764d6

…rf-arith-frame

lint fixup

30a836d

Merge branch 'master' of https://github.com/pandas-dev/pandas into pe…

3559698

…rf-arith-frame

troubleshoot npdev build

4334353

Merge branch 'master' of https://github.com/pandas-dev/pandas into pe…

cb40b0c

…rf-arith-frame

comment

713a776

checkpoint passing

95ef3ad

checkpoint passing

61e5cd6

refactor

89c3d7b

blackify

e348e46

Merge branch 'master' of https://github.com/pandas-dev/pandas into pe…

519c757

…rf-arith-frame

disable assertions for perf

2b1ba18

Merge branch 'master' of https://github.com/pandas-dev/pandas into pe…

53e93fc

…rf-arith-frame

asv

91c86a3

whatsnew

2034084

Merge branch 'master' of https://github.com/pandas-dev/pandas into pe…

8aedf35

…rf-arith-frame

revert warning suppression

0c12d35

Fixupm indentation

9727562

Merge branch 'master' of https://github.com/pandas-dev/pandas into pe…

6661dd3

…rf-arith-frame

suppress warning

42bbbf3

Merge branch 'master' of https://github.com/pandas-dev/pandas into pe…

65ab023

…rf-arith-frame

update asv

0d958a3

Merge branch 'master' of https://github.com/pandas-dev/pandas into pe…

7f91e74

…rf-arith-frame

_data->_mgr

56eef51

Merge branch 'master' of https://github.com/pandas-dev/pandas into pe…

4baea6f

…rf-arith-frame

Merge branch 'master' of https://github.com/pandas-dev/pandas into pe…

41a4e7a

…rf-arith-frame

Merge branch 'master' of https://github.com/pandas-dev/pandas into pe…

b23144e

…rf-arith-frame

Merge branch 'master' of https://github.com/pandas-dev/pandas into pe…

7f24d57

…rf-arith-frame

update to use faspath constructor

ae744b7

jbrockmendel added 2 commits April 20, 2020 17:50

Merge branch 'master' of https://github.com/pandas-dev/pandas into pe…

e557ace

…rf-never-take

avoid take in one more case

96b7d6a

jbrockmendel mentioned this pull request Apr 24, 2020

API: Should indexing on columns give views or copies? #33780

Closed

jbrockmendel closed this Apr 24, 2020

jbrockmendel added the Mothballed Temporarily-closed PR the author plans to return to label Apr 24, 2020

jbrockmendel mentioned this pull request Sep 4, 2020

DEPR: making copies when indexing along columns #36105

Closed

5 tasks

jbrockmendel mentioned this pull request Jan 20, 2021

BUG: 2D ndarray of dtype 'object' is always copied upon construction #39272

Merged

4 tasks

jbrockmendel mentioned this pull request Jul 16, 2021

Proposal for future copy / view semantics in indexing operations #36195

Closed

jbrockmendel added 6 commits July 16, 2021 14:57

Merge branch 'master' into perf-never-take

2ab3917

Merge branch 'master' into perf-never-take

c63464f

Merge branch 'master' into perf-never-take

6e34e3e

Merge branch 'master' into perf-never-take

314a0f8

Merge branch 'master' into perf-never-take

587ffef

update

7fad752

jbrockmendel reopened this Jul 24, 2021

jbrockmendel added Indexing Related to indexing on series/frames, not to indexes themselves Copy / view semantics and removed Mothballed Temporarily-closed PR the author plans to return to labels Jul 24, 2021

jbrockmendel added 6 commits July 31, 2021 11:20

Merge branch 'master' into perf-never-take

7e8b0bd

Merge branch 'master' into perf-never-take

6d82359

fix last 2 failing tests

201e57d

mypy fixup

69f08cb

Merge branch 'master' into perf-never-take

f65cbf1

Merge branch 'master' into perf-never-take

f337dfe

jreback closed this Nov 28, 2021

jbrockmendel added the Mothballed Temporarily-closed PR the author plans to return to label Dec 6, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

PERF: always slice when indexing on columns #33597

PERF: always slice when indexing on columns #33597

Uh oh!

jbrockmendel commented Apr 16, 2020

Uh oh!

jorisvandenbossche commented Apr 17, 2020

Uh oh!

jbrockmendel commented Apr 21, 2020

Uh oh!

jbrockmendel commented Apr 24, 2020

Uh oh!

jreback commented Nov 28, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

PERF: always slice when indexing on columns #33597

PERF: always slice when indexing on columns #33597

Uh oh!

Conversation

jbrockmendel commented Apr 16, 2020

Uh oh!

jorisvandenbossche commented Apr 17, 2020

Uh oh!

jbrockmendel commented Apr 21, 2020

Uh oh!

jbrockmendel commented Apr 24, 2020

Uh oh!

jreback commented Nov 28, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants