iframe-proxy

divanik · 2026-03-19T17:05:15Z

Changelog category (leave one):

Bug Fix (user-visible misbehavior in an official stable release)

Changelog entry (a user-readable short description of the changes that goes into CHANGELOG.md):

Make correctly processing negative values inside NumericIndexedVectorDataBSI

Version info

Merged into: 26.4.1.778

…IndexedVector` `UInt64(std::floor(rhs))` is undefined behavior when `rhs` is negative: for integer types, `std::floor` implicitly converts to double first; for float types, casting a negative double to `UInt64` is UB per C++ standard. `checkValidValue` throws for negative values at runtime, but the compiler may not see that line 1281 is unreachable for negatives, and UBSan fires. Fix by using `if constexpr` to skip `std::floor` for integer types and an explicit non-negative check for float types. https://s3.amazonaws.com/clickhouse-test-reports/json.html?PR=99869&sha=9b37a8c67f4b0fceacf60174b3bb77d2b8477431&name_0=PR&name_1=AST%20fuzzer%20%28amd_ubsan%29 Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

clickhouse-gh · 2026-03-19T17:05:59Z

Copilot

Pull request overview

This PR aims to eliminate undefined behavior in BSINumericIndexedVector when pointwise operations (notably ...PointwiseEqual) are invoked with negative scalar values, and adds a stateless regression test for the scenario.

Changes:

Adds a new stateless query test for negative scalar inputs to numericIndexedVectorPointwiseEqual.
Introduces a negative-value rejection in checkValidValue.
Adjusts scalar-to-integer conversion logic in BSINumericIndexedVector::pointwiseEqual(lhs, rhs).

Reviewed changes

Copilot reviewed 2 out of 3 changed files in this pull request and generated 4 comments.

File	Description
`src/AggregateFunctions/AggregateFunctionGroupNumericIndexedVectorDataBSI.h`	Adds validation and changes scalar conversion logic in pointwise equality path.
`tests/queries/0_stateless/04049_numeric_indexed_vector_negative_scalar_ubsan.sql`	New regression test asserting `INCORRECT_DATA` for negative scalar inputs.
`tests/queries/0_stateless/04049_numeric_indexed_vector_negative_scalar_ubsan.reference`	Reference output file for the new stateless test.

You can also share your feedback on Copilot code review. Take the survey.

+        if (value < 0)
+        {
+            throw Exception(ErrorCodes::INCORRECT_DATA, "NumericIndexedVector does not support negative values");
+        }


+        UInt64 long_value;
+        if constexpr (std::is_floating_point_v<ValueType>)
+        {
+            auto floored = std::floor(rhs);
+            long_value = floored >= 0 ? static_cast<UInt64>(floored) : static_cast<UInt64>(static_cast<Int64>(floored));
+        }


+-- Negative scalar values passed to NumericIndexedVector pointwise operations
+-- should throw INCORRECT_DATA, not trigger undefined behavior.
+
+DROP TABLE IF EXISTS t_int;
+CREATE TABLE t_int (ds Date, uin UInt32, value Int64) ENGINE = MergeTree() ORDER BY ds;
+INSERT INTO t_int VALUES ('2023-12-26', 1, 1);
+
+SELECT numericIndexedVectorPointwiseEqual(groupNumericIndexedVectorState(uin, value), -1) FROM t_int; -- { serverError INCORRECT_DATA }
+


rienath · 2026-03-23T12:04:47Z

Please ping me when it's ready for review, I'll take a look

…e` instead of rejecting negative values The blanket negative value check in `checkValidValue` broke all tests using signed types (Int8/16/32/64). BSI vectors support negative values via two's complement. The actual UBSan issue is the unguarded `Int64(value * scaling)` cast in `initializeFromVectorAndValue` when the float value is out of Int64 range. Add an overflow check matching the existing pattern in the `add` method. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

…ric_indexed_vector

clickhouse-gh · 2026-03-24T14:58:36Z

+        if constexpr (std::is_floating_point_v<ValueType>)
+        {
+            auto floored = std::floor(rhs);
+            long_value = floored >= 0 ? static_cast<UInt64>(floored) : static_cast<UInt64>(static_cast<Int64>(floored));


pointwiseEqual still has UB here for out-of-range floating values:

floored >= 0 branch: static_cast<UInt64>(floored) is undefined when floored > UInt64::max().

floored < 0 branch: static_cast<Int64>(floored) is undefined when floored < Int64::lowest().

checkValidValue only rejects NaN/Inf, so finite values like 1e30 / -1e30 can still hit this path.

Please add an explicit bounds check (or reuse float64ToUInt64, which already clamps both sides) before integer casts.

alexey-milovidov · 2026-04-09T20:06:44Z

@rienath, I think it is ready for review.

…ric_indexed_vector

…oat scalars The previous fix addressed the float-to-UInt64 cast for negative values by going through Int64 first, but the `decimal_value` computation was still broken for negative floats (converting the two's complement UInt64 back to float gives a huge positive number, leading to another UB cast). Rewrite `pointwiseEqual` to use the same unified fixed-point conversion as `initializeFromVectorAndValue`: compute `Int64 scaled_value` with an explicit overflow guard, then compare all bits of the two's complement representation in a single loop. For out-of-range scalars, return an empty bitmap (no element can match) instead of triggering UB. Also add test coverage for `pointwiseEqual` with out-of-range and negative float scalars. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

…ric_indexed_vector

…m shift underflow When `total_bit_num` is 0 (possible with `groupNumericIndexedVectorState('BSI', 0, 0)`), the expression `bit_pattern >> (total_bit_num - 1)` causes undefined behavior because `total_bit_num - 1` wraps to `UINT32_MAX`. Since a zero-bit BSI can only represent 0, and the `rhs == 0` case is already handled above, return an empty bitmap immediately. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

…ric_indexed_vector

`static_cast<Float64>(std::numeric_limits<Int64>::max())` rounds up to 2^63 in Float64, so values at exactly 2^63 passed the old `fabs(value) > lim` guard and then hit `static_cast<Int64>(...)` which is undefined behavior. Replace the rounded-Float64 bound with an exact-integer-domain check: compute the scaled value first, then compare against 2^63 (which is exactly representable in Float64) using `>= int64_upper || < -int64_upper`. Applied to all three call sites: `initializeFromVectorAndValue`, `pointwiseEqual`, and `addValue`. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

clickhouse-gh · 2026-04-10T03:46:00Z

LLVM Coverage Report

Metric	Baseline	Current	Δ
Lines	84.00%	84.00%	+0.00%
Functions	90.90%	90.90%	+0.00%
Branches	76.50%	76.50%	+0.00%

Changed lines: 92.54% (62/67) | lost baseline coverage: 9 line(s) · Uncovered code

Full report · Diff report

alexey-milovidov

The code is quite complex, but the changes look good.

clickgapai · 2026-04-28T04:19:05Z

divanik and others added 2 commits March 19, 2026 17:57

Fix undefined behaviour

dfb362f

clickhouse-gh Bot added the pr-bugfix Pull request with bugfix, not backported by default label Mar 19, 2026

Remove neuro-slopped comments

3b78b52

divanik changed the title ~~Fix undefined behaviour~~ Fix UB in BSINumericIndexedVector Mar 19, 2026

divanik requested review from Copilot and rienath March 19, 2026 17:07

divanik assigned rienath Mar 19, 2026

divanik mentioned this pull request Mar 19, 2026

UndefinedBehaviorSanitizer: undefined behavior (STID: 2527-362b) #100052

Closed

Copilot started reviewing on behalf of divanik March 19, 2026 17:08 View session

Copilot AI reviewed Mar 19, 2026

View reviewed changes

rienath removed their request for review March 23, 2026 12:04

alexey-milovidov mentioned this pull request Mar 24, 2026

Fix signed integer overflow in toStartOfInterval for Millisecond/Microsecond intervals #100156

Merged

1 task

alexey-milovidov and others added 2 commits March 24, 2026 15:49

Merge remote-tracking branch 'origin/master' into divanik/fix_UB_nume…

de95d2a

…ric_indexed_vector

clickhouse-gh Bot reviewed Mar 24, 2026

View reviewed changes

groeneai mentioned this pull request Apr 2, 2026

Fix undefined behavior in BSINumericIndexedVector pointwise operations #101614

Closed

alexey-milovidov mentioned this pull request Apr 9, 2026

Do not fail to start server on transient Azure errors during disk initialization #100701

Merged

1 task

alexey-milovidov and others added 2 commits April 9, 2026 20:11

Merge remote-tracking branch 'origin/master' into divanik/fix_UB_nume…

d023702

…ric_indexed_vector

clickhouse-gh Bot reviewed Apr 9, 2026

View reviewed changes

Comment thread src/AggregateFunctions/AggregateFunctionGroupNumericIndexedVectorDataBSI.h

alexey-milovidov mentioned this pull request Apr 9, 2026

Fix UBSan: float-to-Int64 overflow in NumericIndexedVector #102006

Closed

1 task

Merge remote-tracking branch 'origin/master' into divanik/fix_UB_nume…

b389a2d

…ric_indexed_vector

clickhouse-gh Bot reviewed Apr 9, 2026

View reviewed changes

Comment thread src/AggregateFunctions/AggregateFunctionGroupNumericIndexedVectorDataBSI.h Outdated

clickhouse-gh Bot reviewed Apr 9, 2026

View reviewed changes

Comment thread src/AggregateFunctions/AggregateFunctionGroupNumericIndexedVectorDataBSI.h Outdated

alexey-milovidov mentioned this pull request Apr 10, 2026

Fix column resize in play.html after web component refactoring #101295

Merged

1 task

alexey-milovidov and others added 2 commits April 10, 2026 00:31

Merge remote-tracking branch 'origin/master' into divanik/fix_UB_nume…

d221561

…ric_indexed_vector

alexey-milovidov approved these changes Apr 10, 2026

View reviewed changes

alexey-milovidov self-assigned this Apr 10, 2026

alexey-milovidov merged commit 23720c1 into master Apr 10, 2026
162 of 163 checks passed

alexey-milovidov deleted the divanik/fix_UB_numeric_indexed_vector branch April 10, 2026 05:17

robot-clickhouse-ci-2 added the pr-synced-to-cloud The PR is synced to the cloud repo label Apr 10, 2026

clickgapai mentioned this pull request Apr 28, 2026

pointwiseEqual post-loop sign-extension check unconditionally applies signed two's complement logic, returning empty for unsigned values with the high bit set #103640

Open

Sunbelt Computer Software

PL/B Language Development and Support

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix UB in BSINumericIndexedVector#100086

Fix UB in BSINumericIndexedVector#100086
alexey-milovidov merged 11 commits into
masterfrom
divanik/fix_UB_numeric_indexed_vector

divanik commented Mar 19, 2026 •

edited by robot-clickhouse

Loading

Uh oh!

clickhouse-gh Bot commented Mar 19, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

rienath commented Mar 23, 2026

Uh oh!

clickhouse-gh Bot Mar 24, 2026

Uh oh!

alexey-milovidov commented Apr 9, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

clickhouse-gh Bot commented Apr 10, 2026

Uh oh!

alexey-milovidov left a comment

Uh oh!

Uh oh!

clickgapai commented Apr 28, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Sunbelt Computer Software

PL/B Language Development and Support

Uh oh!

Conversation

divanik commented Mar 19, 2026 • edited by robot-clickhouse Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changelog category (leave one):

Changelog entry (a user-readable short description of the changes that goes into CHANGELOG.md):

Version info

Uh oh!

clickhouse-gh Bot commented Mar 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

AI Review

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

rienath commented Mar 23, 2026

Uh oh!

clickhouse-gh Bot Mar 24, 2026

Choose a reason for hiding this comment

Uh oh!

alexey-milovidov commented Apr 9, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

clickhouse-gh Bot commented Apr 10, 2026

LLVM Coverage Report

Uh oh!

alexey-milovidov left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

clickgapai commented Apr 28, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

divanik commented Mar 19, 2026 •

edited by robot-clickhouse

Loading

clickhouse-gh Bot commented Mar 19, 2026 •

edited

Loading