perf: better string building #997

henryiii · 2025-11-27T06:54:16Z

This makes the __str__ method faster, about 10%. This is used quite a bit, so saving some time here is useful.

henryiii · 2025-11-27T16:20:16Z

Almost all the savings here was from avoiding the NamedTuple indirection. This is now only 1% faster total time, probably 5% or something like that faster for the str operation. Saving the intermediate value doesn't have any measurable effect anymore, so I've remove that.

Now it's mostly up to if you think this looks better (and it still is a little faster).

brettcannon · 2025-11-27T18:05:43Z

Now it's mostly up to if you think this looks better

I'm indifferent.

notatallshaw · 2025-11-27T18:56:39Z

"".join(...) reads better to me because I've written that pattern so often in Python, but that's just anecdotal.

I was also under the impression, apparently incorrectly, that join would be faster. Because naively concatenating strings can be O(n^2) with regards to memory allocation operations, and I thought the .join method had some kind of optimization to handle that. Maybe this is just too few concatenations with too small strings where the memory allocations become a dominating factor.

henryiii · 2025-11-27T20:30:29Z

I'm nearly sure it's the fact the strings are generally small. If they were large I'm almost sure it would be the other way around.

I played around with several ways to do this - I thought making all four separately then using an f-string to join them would be fastest, but short circuiting if None was too important. Now that the largest cost (accessing the nested field in the NamedTuple) is gone, it's possible that is faster.

brettcannon · 2025-11-28T17:38:30Z

I was also under the impression, apparently incorrectly, that join would be faster. Because naively concatenating strings can be O(n^2) with regards to memory allocation operations, and I thought the .join method had some kind of optimization to handle that. Maybe this is just too few concatenations with too small strings where the memory allocations become a dominating factor.

There's an optimization in CPython specifically for += in a loop. So you're right that str.join() should be faster, but we cheated in CPython. 😁

henryiii · 2026-01-05T16:51:40Z

Okay, after one more change (inlining base_version), now this is much faster, around 10%.

All benchmarks:

Change	Before [`3803ce3`]	After [`4a4953d`] <henryiii/perf/str>	Ratio	Benchmark (Parameter)
	2.30±0.01ms	2.31±0.01ms	1.01	markers.TimeMarkerSuite.time_constructor
	1.20±0.02ms	1.15±0.05ms	0.95	markers.TimeMarkerSuite.time_evaluate
	9.88±0.4ms	9.68±0.7ms	0.98	requirement.TimeRequirementSuite.time_constructor
	601±8μs	605±70μs	1.01	resolver.TimeResolverSuite.time_resolver_loop
	3.43±0.04ms	3.33±0.03ms	0.97	specifiers.TimeSpecSuite.time_constructor
	4.09±0.02ms	4.00±0.03ms	0.98	specifiers.TimeSpecSuite.time_contains
	61.5±0.3μs	61.2±0.3μs	1	specifiers.TimeSpecSuite.time_filter
	3.99±0.04μs	4.04±0.04μs	1.01	utils.TimeUtils.time_canonicalize_name
	1.97±0.01ms	1.98±0ms	1.01	version.TimeVersionSuite.time_constructor
	1.87±0.01ms	1.87±0.01ms	1	version.TimeVersionSuite.time_sort
-	811±6μs	728±9μs	0.9	version.TimeVersionSuite.time_str

Signed-off-by: Henry Schreiner <henryfs@princeton.edu> chore: remove saving intermediate (no longer much faster) Signed-off-by: Henry Schreiner <henryfs@princeton.edu>

Signed-off-by: Henry Schreiner <henryfs@princeton.edu>

henryiii force-pushed the henryiii/perf/str branch 3 times, most recently from 267b5d9 to e584ec3 Compare November 27, 2025 16:14

henryiii force-pushed the henryiii/perf/str branch 4 times, most recently from 624a237 to 4a4953d Compare January 5, 2026 16:50

henryiii force-pushed the henryiii/perf/str branch from 4a4953d to 4ebd365 Compare January 5, 2026 17:03

henryiii added 2 commits January 5, 2026 12:03

perf: better string building

8b367b1

Signed-off-by: Henry Schreiner <henryfs@princeton.edu> chore: remove saving intermediate (no longer much faster) Signed-off-by: Henry Schreiner <henryfs@princeton.edu>

perf: inline base_version

4ebd365

Signed-off-by: Henry Schreiner <henryfs@princeton.edu>

notatallshaw approved these changes Jan 5, 2026

View reviewed changes

henryiii merged commit 65092ce into pypa:main Jan 6, 2026
40 checks passed

henryiii deleted the henryiii/perf/str branch January 6, 2026 00:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

perf: better string building #997

perf: better string building #997

henryiii commented Nov 27, 2025 •

edited

Loading

Uh oh!

henryiii commented Nov 27, 2025 •

edited

Loading

Uh oh!

brettcannon commented Nov 27, 2025

Uh oh!

notatallshaw commented Nov 27, 2025

Uh oh!

henryiii commented Nov 27, 2025

Uh oh!

brettcannon commented Nov 28, 2025

Uh oh!

henryiii commented Jan 5, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

perf: better string building #997

perf: better string building #997

Conversation

henryiii commented Nov 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

henryiii commented Nov 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

brettcannon commented Nov 27, 2025

Uh oh!

notatallshaw commented Nov 27, 2025

Uh oh!

henryiii commented Nov 27, 2025

Uh oh!

brettcannon commented Nov 28, 2025

Uh oh!

henryiii commented Jan 5, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

henryiii commented Nov 27, 2025 •

edited

Loading

henryiii commented Nov 27, 2025 •

edited

Loading