faster CRC32 implementation by MartinNowak · Pull Request #5023 · dlang/phobos

MartinNowak · 2017-01-07T17:48:29Z

use slicing by 8 algorithm with bigger precomputed tables
roughly 4x faster

WalterWaldron · 2017-01-07T19:01:28Z

This suggests that this implementation is endian sensitive.
The link contains the endian aware modification and mentions that reordering the lookups could yield further performance improvement.

MartinNowak · 2017-01-07T19:16:51Z

Nope, the implementation also works for big endian, b/c it assembles the uint's from byte-wise reads instead of relying on unaligned hardware loads. Still did the reordering of the operations which indeed provided a noticeable speedup.

WalterWaldron · 2017-01-07T23:49:23Z

Ok, I hadn't digested that hasUnalignedReads was only to force that optimization in DMD rather than to provide a per-architecture switch.

LGTM on the basis of matching other slicing by 8 implementations.

MartinNowak · 2017-01-08T02:17:57Z

After your comment, I was actually a bit unsure whether genTables is endian correct, so I build a gdc cross-compiler and tested on my MIPS router. Renamed the enum to make clear that this can only be done on LE architectures.

- use slicing by 8 algorithm with bigger precomputed tables - roughly 4x faster

DmitryOlshansky · 2017-01-09T11:41:48Z

LGTM, also love the CTFE construction of the table.

DmitryOlshansky

LGTM

UplinkCoder · 2017-01-10T01:49:48Z

Auto-merge toggled on

bgaff · 2017-01-11T10:40:57Z

Dumb question here but why can't sse4 crc32 numonics be used when available? I apologize of the answer is obvious

kubo39 · 2017-01-11T17:14:33Z

@bgaff SSE4.2 crc32 instruction is for castagnoli polynomial, not IEEE.

MartinNowak force-pushed the faster_crc branch from 2904621 to 40d7df8 Compare January 7, 2017 18:01

MartinNowak force-pushed the faster_crc branch from 40d7df8 to 86d0185 Compare January 7, 2017 19:15

MartinNowak force-pushed the faster_crc branch from 86d0185 to f4d6ecb Compare January 7, 2017 21:19

faster CRC32 implementation

382f9d2

- use slicing by 8 algorithm with bigger precomputed tables - roughly 4x faster

MartinNowak force-pushed the faster_crc branch from f4d6ecb to 382f9d2 Compare January 8, 2017 02:22

wilzbach added Severity:Enhancement Severity:Optimization labels Jan 8, 2017

DmitryOlshansky approved these changes Jan 9, 2017

View reviewed changes

UplinkCoder merged commit c0b6660 into dlang:master Jan 10, 2017

MartinNowak deleted the faster_crc branch January 10, 2017 11:57

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

faster CRC32 implementation#5023

faster CRC32 implementation#5023
UplinkCoder merged 1 commit intodlang:masterfrom
MartinNowak:faster_crc

MartinNowak commented Jan 7, 2017

Uh oh!

WalterWaldron commented Jan 7, 2017 •

edited

Loading

Uh oh!

MartinNowak commented Jan 7, 2017

Uh oh!

WalterWaldron commented Jan 7, 2017

Uh oh!

MartinNowak commented Jan 8, 2017 •

edited

Loading

Uh oh!

DmitryOlshansky commented Jan 9, 2017

Uh oh!

DmitryOlshansky left a comment

Uh oh!

UplinkCoder commented Jan 10, 2017

Uh oh!

bgaff commented Jan 11, 2017

Uh oh!

kubo39 commented Jan 11, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

Uh oh!

Conversation

MartinNowak commented Jan 7, 2017

Uh oh!

WalterWaldron commented Jan 7, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

MartinNowak commented Jan 7, 2017

Uh oh!

WalterWaldron commented Jan 7, 2017

Uh oh!

MartinNowak commented Jan 8, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

DmitryOlshansky commented Jan 9, 2017

Uh oh!

DmitryOlshansky left a comment

Choose a reason for hiding this comment

Uh oh!

UplinkCoder commented Jan 10, 2017

Uh oh!

bgaff commented Jan 11, 2017

Uh oh!

kubo39 commented Jan 11, 2017

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

WalterWaldron commented Jan 7, 2017 •

edited

Loading

MartinNowak commented Jan 8, 2017 •

edited

Loading