Prevent use of unaligned uint16_t pointers #145

daviesrob · 2026-01-14T12:31:45Z

UBSan on clang complains about dereferencing unaligned uint16_tpointers even when the dereference is being done by memmove(). Strictly it's correct as even setting the pointer to such a value is undefined behaviour according to the standard.

Silence the noise by changing the pointers to uint8_t and adjusting the arithmetic on them as necessary.

Some bounds checks to ensure fast code won't read beyond the end of its input are also adjusted to prevent any possibility of generating an address beyond the limits of the input memory.

(configure CFLAGS='-g -O2 -fsanitize=address,undefined -fno-sanitize-recover=all' LDFLAGS='-g -O2 -fsanitize=address,undefined' to highlight the problem this solves.)

jkbonfield · 2026-02-02T16:37:33Z

htscodecs/rANS_static32x16pr.c

+                ptr16[-2] = ransN[z-k];
+                ptr16[-1] = ransN[z-k]>>8;


This is incorrect.

If you make htscodecs_endian.h a nop so the #ifdef HTSCODECS_LITTLE_ENDIAN checks fail, this no longer builds.

It should be:

ptr[-2] = ransN[z-k]; ptr[-1] = ransN[z-k]>>8;

Should be fixed now.

UBSan on clang complains about dereferencing unaligned uint16_t pointers even when the dereference is being done by memmove(). Strictly it's correct as even setting the pointer to such a value is undefined behaviour according to the standard. Silence the noise by changing the pointers to uint8_t and adjusting the arithmetic on them as necessary. Some bounds checks to ensure fast code won't read beyond the end of its input are also adjusted to prevent any possibility of generating an address beyond the limits of the input memory.

The O0 and O1 encoder had incorrect arguments to *storeu_epi16. The (1<<pc2)-1 is creating a bit-mask of which 16-bit quantities to store (it's the "_mask_" part of the function call), so we can't naively double it. The O1 decoder missed doubling the 'sp' increment when we're decoding a TF_SHIFT_O1_FAST data-stream.

jkbonfield · 2026-02-03T15:11:59Z

If you're happy that my changes to the AVX512 didn't reintroduce any issues (it doesn't look like it would trigger ubsan as the changes are minimal, but I didn't retest it), then I'm happy to accept this.

I noted it has a small performance hit in the scalar encoding function (ie rans_compress_O0_32x16). It's about 7% slower on clang-21 and less with gcc. However this isn't such a commonly targetted platform type as most modern x86_64 systems have AVX2 capabilities and 7% isn't disastrous.

I did try improving it, but everything I tried slowed up down so I'll not spend more time fiddling with it.

daviesrob · 2026-02-03T15:38:17Z

I'm happy with your changes.

daviesrob assigned jkbonfield Jan 22, 2026

jkbonfield reviewed Feb 2, 2026

View reviewed changes

daviesrob and others added 2 commits February 3, 2026 15:07

jkbonfield force-pushed the pointer-alignment branch from fb67d96 to 6e4d12d Compare February 3, 2026 15:08

jkbonfield merged commit 1682b5f into samtools:master Feb 3, 2026
6 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Prevent use of unaligned uint16_t pointers #145

Prevent use of unaligned uint16_t pointers #145

Uh oh!

daviesrob commented Jan 14, 2026

Uh oh!

jkbonfield Feb 2, 2026

Uh oh!

daviesrob Feb 2, 2026

Uh oh!

jkbonfield commented Feb 3, 2026

Uh oh!

daviesrob commented Feb 3, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Prevent use of unaligned uint16_t pointers #145

Prevent use of unaligned uint16_t pointers #145

Uh oh!

Conversation

daviesrob commented Jan 14, 2026

Uh oh!

jkbonfield Feb 2, 2026

Choose a reason for hiding this comment

Uh oh!

daviesrob Feb 2, 2026

Choose a reason for hiding this comment

Uh oh!

jkbonfield commented Feb 3, 2026

Uh oh!

daviesrob commented Feb 3, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants