iframe-proxy

ibrahimkarimeddin · 2026-03-19T05:32:00Z

Changelog category (leave one):

New Feature

Changelog entry:

Added support for SQL truth-value predicates IS TRUE, IS FALSE, IS UNKNOWN, IS NOT TRUE, IS NOT FALSE, and IS NOT UNKNOWN for nullable boolean expressions.

Documentation entry for user-facing changes

Motivation: Allows unambiguous testing of nullable boolean columns for all three SQL truth values (true, false, null/unknown). Unlike = true, the IS TRUE predicate returns false (not NULL) when the value is NULL.
Example use:

-- Scalar
SELECT 1 WHERE NULL IS UNKNOWN;  -- returns 1
SELECT 1 WHERE NULL IS TRUE;     -- returns nothing
-- On a nullable column
SELECT
    x IS TRUE,
    x IS FALSE,
    x IS UNKNOWN
FROM my_table;

<!-- ch-version-info:start -->
### Version info
- Merged into: `26.5.1.864`
<!-- ch-version-info:end -->

CLAassistant · 2026-03-19T05:32:11Z

clickhouse-gh · 2026-03-19T06:01:18Z

Workflow [PR], commit [8adb439]

Summary: ❌

job_name	test_name	status	info	comment
Stateless tests (amd_llvm_coverage, ParallelReplicas, s3 storage, parallel)		FAIL
	01710_projection_additional_filters	FAIL	cidb

AI Review

Summary

This PR adds parser support for SQL truth-value predicates (IS TRUE, IS FALSE, IS UNKNOWN, and IS NOT variants), rewrites them to existing comparison/null-check functions, documents the new operators, and adds a focused stateless test. I reviewed the current diff and prior discussion threads against the latest code, and I do not see unresolved contract violations in correctness, safety, compatibility, or rollout behavior.

Final Verdict

Status: ✅ Approve

alexey-milovidov

Good change, almost ready!

…m` and `isDistinctFrom` operators.

ibrahimkarimeddin · 2026-03-20T09:07:40Z

@alexey-milovidov I checked the failing tests, and they do not seem to be caused by the new changes in this PR. Please let me know whether I should investigate them or treat them as unrelated CI failures.

alexey-milovidov · 2026-03-24T23:58:58Z

@ibrahimkarimeddin, check this https://s3.amazonaws.com/clickhouse-test-reports/json.html?PR=99997&sha=ee482d21a5ba665f0895314230eee1ca8432f858&name_0=PR&name_1=Finish%20Workflow

ibrahimkarimeddin · 2026-03-30T18:54:06Z

@alexey-milovidov just a kind reminder on this PR. If any further updates are needed from my side, I’m ready to make them.

alexey-milovidov

LGTM. Approved.

ibrahimkarimeddin · 2026-04-06T15:05:47Z

@alexey-milovidov sorry for the ping — quick question as this is my first time. Do I need to do anything after approval for the PR to be merged, or does it just take some time?

alexey-milovidov · 2026-04-06T21:56:52Z

We will take it from here.

alexey-milovidov · 2026-04-07T01:15:33Z

Fix for the integration test: #101897

…alse-unknown

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

alexey-milovidov · 2026-04-08T00:49:02Z

Now we have to fix the hung queries in the Stress test...

alexey-milovidov · 2026-04-10T02:14:43Z

The Hung check failure is fixed by #102008 and #102010, let's update the branch

…mkarimeddin-fix-99597-is-true-false-unknown

alexey-milovidov · 2026-04-10T10:24:41Z

I fixed the hung check, will update the branch.

…alse-unknown

ibrahimkarimeddin · 2026-05-01T07:09:51Z

Hi @alexey-milovidov , gentle follow-up on this PR. Thanks again for the review and help here. Please let me know if anything else is needed from my side.

alexey-milovidov · 2026-05-16T00:52:48Z

I will take it from here. The CI is not green, we have to fix the CI first.

…alse-unknown

alexey-milovidov · 2026-05-16T00:58:08Z

Merged latest master into the branch and pushed (8adb439e4f5d). CI should re-run.

CI failures analysis (from the previous run)

The two failing checks are unrelated to this PR (which only touches the SQL parser):

Stateless tests (amd_binary, flaky check) — timeout in 03207_json_read_subcolumns_2_memory. This is a JSON subcolumns test, no relation to truth-value predicates.
Stress test (arm_msan) — MemorySanitizer: use-of-uninitialized-value inside DB::MergeTreeReadersChain::executeActionsBeforePrewhere, with the uninitialized memory created by an allocation of patch_max_version in src/Storages/MergeTree/MergeTreeReadersChain.cpp:210. Entirely in MergeTree read code.

@groeneai, investigate both unrelated failures:

groeneai · 2026-05-16T01:35:07Z

@alexey-milovidov — investigated both failures. Both are pre-existing trunk issues, unrelated to this PR's parser-only diff.

1. Stress test (arm_msan) — `MemorySanitizer: use-of-uninitialized-value (STID: 1478-2063)`

This is the same chronic MSan trunk bug tracked by #102241 (which was closed as completed on 2026-04-10, but the bug is still active — see below).

The stack shows the uninitialized value is reported through TextLog::appendToBlock → OwnAsyncSplitChannel::runTextLog, and MSan attributes the allocation to patch_max_version in MergeTreeReadersChain::executeActionsBeforePrewhere at src/Storages/MergeTree/MergeTreeReadersChain.cpp:210. The actual variable is the std::optional<UInt64> returned by getMaxPatchVersionForStep (which just forwards prewhere_info->mutation_version). The poison is then propagated through patch-application machinery and eventually surfaces in a Poco::Message constructor when text logs are flushed asynchronously.

Cross-PR distribution (last 30 days, stress-test rows whose test_context_raw mentions patch_max_version or MergeTreeReadersChain): 24+ distinct PRs affected, all sanitizer/architecture variants:

pull_request_number  check_name                              cnt  last_day
100185               Stress test (amd_asan_ubsan)             1   2026-05-05
100185               Stress test (amd_msan)                   1   2026-05-05
100185               Stress test (arm_msan)                   1   2026-05-14
100185               Stress test (arm_tsan)                   1   2026-05-05
100185               Stress test (amd_debug)                  1   2026-05-14
104494               Stress test (arm_asan_ubsan, s3)         1   2026-05-15
104903               Stress test (amd_debug)                  1   2026-05-14
104120               Stress test (amd_msan)                   1   2026-05-06
... and ~16 more PRs across amd/arm × debug/asan/tsan/msan/ubsan

Master hits (pull_request_number = 0) in the last 60 days:

day          failures  checks
2026-05-14   1         Stress test (experimental, serverfuzz, azure, amd_tsan)
2026-05-11   1         Stress test (experimental, serverfuzz, arm_tsan)
2026-05-07   1         Stress test (experimental, serverfuzz, amd_tsan)
2026-04-25   1         Stress test (amd_debug)
2026-04-21   1         Stress test (experimental, serverfuzz, amd_debug)
2026-04-10   4         Stress test (arm_msan)

8 master hits in 60 days, latest 2026-05-14. #102241 should likely be reopened — the bug is still recurring on master (including the serverfuzz nightlies) more than a month after the issue was closed.

PR #102305 (Unpoison fiber stack on AArch64) and PR #102290 (Rollback jemalloc upgrade) were both opened as candidate fixes but closed without merging, so the root cause remains unaddressed.

2. Stateless tests (amd_binary, flaky check) — `03207_json_read_subcolumns_2_memory` timeout

Only relevant to the flaky-check stage (which re-runs new/changed tests 50× under heavy randomization within a 600s budget). The test itself is intrinsically heavy: 800K rows of JSON data with max_dynamic_paths=2 and dozens of SELECT json.… paths-with-types queries.

Cross-PR distribution (last 60 days):

pull_request_number  cnt  first_day    last_day     checks
99513                5    2026-03-25   2026-03-26   amd_llvm_coverage variants
102269               2    2026-04-10   2026-04-13   amd_binary, flaky check
99997                2    2026-04-10   2026-04-10   amd_binary, flaky check
101645               2    2026-04-10   2026-04-10   amd_binary, flaky check
101918               1    2026-04-10   2026-04-10   amd_binary, flaky check
100535               1    2026-04-10   2026-04-10   amd_binary, flaky check
96130                1    2026-03-21   2026-03-21   arm_binary, parallel

Master hits in 60 days: 0.

5 of the 6 amd_binary, flaky check hits cluster on a single day (2026-04-10) across completely unrelated PRs (#102269, #99997, #101645, #101918, #100535) — same day the run we're looking at was performed. That looks like a CI infrastructure or shared-resource event on that runner, not a property of this PR.

Note: I previously merged #101765 for the sibling test 03469_json_read_subcolumns_combined_2_compact_merge_tree, which had a similar randomized-settings-induced timeout. The same approach (capping index_granularity / index_granularity_bytes and pinning max_threads) likely applies to 03207_json_read_subcolumns_2_memory if it surfaces again. I'll open a follow-up for that.

Conclusion

Both failures are pre-existing trunk issues that pre-date this PR. The new CI run on 8adb439e4f5d should be fine for the flaky-check timeout (the cluster of 2026-04-10 hits looks transient) and may also miss the MSan report (it's stochastic at ~one hit per few days per build variant), but the MSan signature itself is master-bug territory and not caused by the parser changes here. I'll file a follow-up to revisit #102241 / extend the timeout fix to 03207_json_read_subcolumns_2_memory.

groeneai · 2026-05-16T01:37:11Z

Correction on the MSan analysis above — I was over-inclusive with my pattern matching.

The test_context_raw LIKE '%patch_max_version%' OR LIKE '%MergeTreeReadersChain%' query I ran above also matches stress runs whose attached server log contains other unrelated failures (Logical errors, Hung check, etc.) — the MergeTreeReadersChain::executeActionsBeforePrewhere frame appears in many normal query stack traces, so the broader query is noisy.

Filtering to the actual STID test_name:

SELECT toStartOfDay(check_start_time) AS day, count() AS failures, any(check_name) AS check
FROM default.checks
WHERE test_name = 'MemorySanitizer: use-of-uninitialized-value (STID: 1478-2063)'
  AND test_status IN ('FAIL', 'ERROR')
  AND pull_request_number = 0
  AND check_start_time > now() - INTERVAL 60 DAY
GROUP BY day ORDER BY day DESC

day          failures  check
2026-04-10   3         Stress test (arm_msan)
2026-04-09   12        Stress test (arm_msan)
2026-04-08   1         Stress test (arm_msan)
2026-04-03   1         Stress test (arm_msan)

Zero master hits since 2026-04-10, so STID 1478-2063 itself does appear to be resolved (likely consistent with #102241 being closed on that same day).

The CI failure linked from this PR is on SHA 1b7c5887c2f7… — checking the failure log timestamps, that run is from 2026-04-10 08:21, i.e. before the fix that closed #102241 landed. So this is a stale CI run, not a fresh MSan recurrence. The new CI on 8adb439e4f5d should not reproduce it.

Same story for the 03207_json_read_subcolumns_2_memory flaky-check timeout — also from the 2026-04-10 run cluster, also stale.

TL;DR: Both failures are stale CI from 2026-04-10. The new run on 8adb439e4f5d should be green; no further investigation needed on my side. Apologies for the noise on #102241 — that one looks legitimately fixed.

clickhouse-gh · 2026-05-16T04:15:57Z

LLVM Coverage Report

Metric	Baseline	Current	Δ
Lines	84.10%	84.10%	+0.00%
Functions	90.80%	90.80%	+0.00%
Branches	76.60%	76.60%	+0.00%

Changed lines: 94.34% (50/53) | lost baseline coverage: 1 line(s) · Uncovered code

Full report · Diff report

PedroTadim · 2026-05-26T14:19:51Z

+    toUInt8(x IS NOT UNKNOWN)
+FROM bool_predicates
+ORDER BY isNull(x), x;
+


@groeneai if there is a set index on a merge tree boolean column and IS TRUE or IS UNKNOWN predicate (if the column is nullable) is used, is the index used? Add a test case, please.

Filed #105865 with a regression test. Short answer: yes, the set skip index is already used in all six forms.

The new predicates lower to functions that the set index recognises in MergeTreeIndexConditionSet::atomFromDAG and then evaluates per-granule in mayBeTrueOnGranule:

b IS TRUE -> isNotDistinctFrom(b, true)

b IS FALSE -> isNotDistinctFrom(b, false)

b IS UNKNOWN -> isNull(b)

b IS NOT TRUE -> isDistinctFrom(b, true)

b IS NOT FALSE -> isDistinctFrom(b, false)

b IS NOT UNKNOWN -> isNotNull(b)

With three granules holding only true, only false, and only NULL respectively, the set index drops the two unrelated granules for each positive form and the matching granule for each IS NOT form. The new test asserts that via ProfileEvents['SelectedMarks'].

One small follow-up not addressed by this PR: EXPLAIN indexes = 1 prints Condition: true for the isNotDistinctFrom / isDistinctFrom forms because KeyCondition only special-cases isNull / isNotNull (for IS UNKNOWN / IS NOT UNKNOWN it prints (b isNull) / (b isNotNull)). The pruning still happens via the actions DAG path, but teaching KeyCondition about the new operators would make the EXPLAIN output more informative. Happy to send a separate PR for that if you'd like.

Sibling of `04290_set_index_is_true_false_unknown` covering the leading primary-key path on a `Nullable(Bool)` column. `KeyCondition` recognises the `isNull` / `isNotNull` lowered forms and drops one granule for `IS UNKNOWN` and `IS NOT UNKNOWN`. The other four predicates lower to `isNotDistinctFrom` / `isDistinctFrom`, which are not recognised by `KeyCondition`, so the test pins this current behaviour. Follow-up to ClickHouse#99997. Requested in ClickHouse#105865.

…NKNOWN Verifies that the set skip index prunes granules for the new truth-value predicates from ClickHouse#99997 on a `Nullable(Bool)` column. The new test populates three granules of eight rows each (only true, only false, only NULL) and checks via `ProfileEvents['SelectedMarks']` that the index drops the two unrelated granules for each positive form and the matching granule for each `IS NOT` form. Follow-up to ClickHouse#99997.

The `swap-clickhouse-jdbc.py` module docstring claimed the NoREC oracle's `IS TRUE` postfix is rewritten to `= TRUE`, but the implementation (and the inline comment, and the PR description) correctly rewrite it to `!= 0`. The `= TRUE` variant was tried and rejected; the docstring was left behind. Also clarify the rationale in both the docstring and the Dockerfile comment. `IS TRUE`/`IS FALSE` are now implemented in ClickHouse's parser (since #99997), but `IS TRUE` parses as `<expr> <=> true`, i.e. strict equality with `1`, which does not match `WHERE <expr>` truthiness (any non-zero numeric). Verified locally that `5 IS TRUE` = 0 while `WHERE 5` matches and `5 != 0` = 1, so the `!= 0` rewrite remains correct and necessary - using native `IS TRUE` would silently produce false-positive oracle mismatches. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

Add support for IS TRUE, IS FALSE, IS UNKNOWN boolean predicates

361733e

alexey-milovidov added the can be tested Allows running workflows for external contributors label Mar 19, 2026

clickhouse-gh Bot added the pr-feature Pull request with new product feature label Mar 19, 2026

alexey-milovidov reviewed Mar 19, 2026

View reviewed changes

Comment thread src/Parsers/ExpressionListParsers.cpp Outdated

alexey-milovidov reviewed Mar 19, 2026

View reviewed changes

Comment thread src/Parsers/ExpressionListParsers.cpp

alexey-milovidov reviewed Mar 19, 2026

View reviewed changes

alexey-milovidov self-assigned this Mar 19, 2026

clickhouse-gh Bot reviewed Mar 19, 2026

View reviewed changes

Comment thread src/Parsers/ExpressionListParsers.cpp

Refactor makeTruthValuePredicateOperator to using `isNotDistinctFro…

ee482d2

…m` and `isDistinctFrom` operators.

ibrahimkarimeddin requested a review from alexey-milovidov March 19, 2026 09:14

ibrahimkarimeddin added 2 commits March 25, 2026 21:08

Add docs for IS TRUE, IS FALSE, and IS UNKNOWN operators

494d7c9

fix relative path for Bool data type link in operators documentation.

8c897dc

clickhouse-gh Bot reviewed Mar 25, 2026

View reviewed changes

Comment thread src/Parsers/ExpressionListParsers.cpp

Remove an extra blank line in the SQL operators documentation.

b637d74

alexey-milovidov approved these changes Mar 30, 2026

View reviewed changes

alexey-milovidov and others added 2 commits April 7, 2026 18:44

Merge remote-tracking branch 'origin/master' into fix-99597-is-true-f…

5cb0526

…alse-unknown

Minor style fixes: add trailing newline to test, remove extra blank line

355f90c

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

alexey-milovidov mentioned this pull request Apr 8, 2026

Do not ignore DROP for refreshable materialized views in stress tests #102008

Merged

1 task

Merge remote-tracking branch 'origin/master' into merge-master-ibrahi…

9fc09f2

…mkarimeddin-fix-99597-is-true-false-unknown

Merge remote-tracking branch 'origin/master' into fix-99597-is-true-f…

1b7c588

…alse-unknown

alexey-milovidov mentioned this pull request May 16, 2026

CI: Add SQLancer++ check #104984

Merged

1 task

Merge remote-tracking branch 'origin/master' into fix-99597-is-true-f…

8adb439

…alse-unknown

alexey-milovidov merged commit 97cbcf5 into ClickHouse:master May 19, 2026
163 of 167 checks passed

robot-ch-test-poll2 added the pr-synced-to-cloud The PR is synced to the cloud repo label May 19, 2026

PedroTadim reviewed May 26, 2026

View reviewed changes

groeneai mentioned this pull request May 26, 2026

Add regression test for set skip index with IS TRUE / IS FALSE / IS UNKNOWN #105865

Merged

1 task

robot-clickhouse-ci-1 mentioned this pull request Jun 27, 2026

Support IS TRUE, IS FALSE, IS UNKNOWN boolean predicates #99597

Closed

Sunbelt Computer Software

PL/B Language Development and Support

Uh oh!

Conversation

ibrahimkarimeddin commented Mar 19, 2026 • edited by robot-clickhouse Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changelog category (leave one):

Changelog entry:

Documentation entry for user-facing changes

Uh oh!

CLAassistant commented Mar 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

clickhouse-gh Bot commented Mar 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

AI Review

Summary

Final Verdict

Uh oh!

Uh oh!

Uh oh!

alexey-milovidov left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

ibrahimkarimeddin commented Mar 20, 2026

Uh oh!

alexey-milovidov commented Mar 24, 2026

Uh oh!

Uh oh!

ibrahimkarimeddin commented Mar 30, 2026

Uh oh!

alexey-milovidov left a comment

Choose a reason for hiding this comment

Uh oh!

ibrahimkarimeddin commented Apr 6, 2026

Uh oh!

alexey-milovidov commented Apr 6, 2026

Uh oh!

alexey-milovidov commented Apr 7, 2026

Uh oh!

alexey-milovidov commented Apr 8, 2026

Uh oh!

alexey-milovidov commented Apr 10, 2026

Uh oh!

alexey-milovidov commented Apr 10, 2026

Uh oh!

ibrahimkarimeddin commented May 1, 2026

Uh oh!

alexey-milovidov commented May 16, 2026

Uh oh!

alexey-milovidov commented May 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

CI failures analysis (from the previous run)

Uh oh!

groeneai commented May 16, 2026

1. Stress test (arm_msan) — MemorySanitizer: use-of-uninitialized-value (STID: 1478-2063)

2. Stateless tests (amd_binary, flaky check) — 03207_json_read_subcolumns_2_memory timeout

Conclusion

Uh oh!

groeneai commented May 16, 2026

Uh oh!

clickhouse-gh Bot commented May 16, 2026

LLVM Coverage Report

Uh oh!

Uh oh!

PedroTadim May 26, 2026

Choose a reason for hiding this comment

Uh oh!

groeneai May 26, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

ibrahimkarimeddin commented Mar 19, 2026 •

edited by robot-clickhouse

Loading

CLAassistant commented Mar 19, 2026 •

edited

Loading

clickhouse-gh Bot commented Mar 19, 2026 •

edited

Loading

alexey-milovidov commented May 16, 2026 •

edited

Loading

1. Stress test (arm_msan) — `MemorySanitizer: use-of-uninitialized-value (STID: 1478-2063)`

2. Stateless tests (amd_binary, flaky check) — `03207_json_read_subcolumns_2_memory` timeout