iframe-proxy

yakov-olkhovskiy · 2026-03-17T13:18:06Z

Reverts #99696

Version info

Merged into: 26.5.1.611

clickhouse-gh · 2026-03-17T13:18:49Z

Workflow [PR], commit [fc6afd7]

Summary: ✅

AI Review

Summary

This PR reverts the revert of ClickHouse/ClickHouse#99696, re-enabling libFuzzer execution in PR CI and adding parser-limit plumbing for multiple fuzzers/KQL paths. I found one blocker and several major issues that are still live in the current head, so this is not ready to merge.

Findings

❌ Blockers
- [programs/local/fuzzers/clickhouse_fuzzer.cpp:200-245] fuzzerSigalrmHandler waits via poll inside an async signal handler. poll is not async-signal-safe, so timeout handling can deadlock or stall in libc-internal lock states and lose timeout diagnostics.
  Suggested fix: keep the signal handler strictly async-signal-safe (set a flag / emit minimal diagnostics) and move waiting/coordination out of signal context.
⚠️ Majors
- [src/Parsers/Kusto/KustoFunctions/IParserKQLFunction.cpp:87,212,223,248] KQL expression-size checks are hardcoded to DBMS_DEFAULT_MAX_QUERY_SIZE instead of the effective parse-time limit. If max_query_size is configured above default, KQL can still throw at default threshold, making configured limits ineffective on this path.
  Suggested fix: compare against the active query-size limit for this parse flow, not a compile-time default.
- [ci/jobs/libfuzzer_test_check.py:313-318,360-368] artifact collection is incomplete: minimization still skips mini-oom-*, and trace files (*.trace) are not collected despite being generated by process_error. This drops evidence from reported failures.
  Suggested fix: include mini-oom-* and *.trace in artifact collection for both minimization and main fuzzing branches.
- [src/Formats/fuzzers/format_fuzzer.cpp:108-117, src/Interpreters/fuzzers/execute_query_fuzzer.cpp:118-127] unknown/invalid settings are only logged and then ignored in LLVMFuzzerInitialize, so CI misconfiguration silently falls back to defaults.
  Suggested fix: fail fast on unknown/invalid settings.
- [src/AggregateFunctions/fuzzers/aggregate_function_state_deserialization_fuzzer.cpp:109-135] parseSettingsFromArgs captures all key/value pairs after -ignore_remaining_args, but only three keys are consumed; unknown keys are silently ignored.
  Suggested fix: track consumed keys and fail on unexpected leftovers.
- [src/AggregateFunctions/fuzzers/aggregate_function_state_deserialization_fuzzer.cpp:117, src/DataTypes/fuzzers/data_type_deserialization_fuzzer.cpp:113, src/Parsers/fuzzers/create_parser_fuzzer.cpp:95, src/Parsers/fuzzers/select_parser_fuzzer.cpp:94] std::stoul accepts signed text (e.g. -1) and wraps to huge unsigned values, so invalid negative limits can silently disable intended parser/AST limits.
  Suggested fix: reject signed prefixes explicitly before conversion, then keep strict full-string/range validation.
💡 Nits
- [tests/fuzz/runner.py:193] typo in comment: libFuzer -> libFuzzer.

Final Verdict

Status: ⚠️ Request changes

Minimum required actions:

Remove non-async-signal-safe waiting from fuzzerSigalrmHandler.
Fix limit handling consistency (max_query_size, signed numeric parsing, unknown-key handling).
Complete artifact collection (mini-oom-*, *.trace) for minimization/fuzzing reports.

clickhouse-gh · 2026-03-17T13:21:24Z

-            # arguments as usual, without any special measures, but initialization of libFuzer driver then should take
-            # arguments from FUZZER_ARGS environment variable.
-            use_fuzzer_args = parser.getboolean("CI", "FUZZER_ARGS", fallback=False)
+        # FUZZER_ARGS flag is used to make it deliver libFuzzer arguments throught FUZZER_ARGS environment variable


Minor typo in this comment block: throught → through, and libFuzer → libFuzzer.

through is fixed, but libFuzer is still a typo in the same comment block (tests/fuzz/runner.py, line 193 in the current head). Please rename it to libFuzzer.

…-prs

…x_parser_depth, max_parser_backtracks, max_query_size as options, set max_parser_depth = 100, max_parser_backtracks = 1000

…-workflow-conflicts-7a3b # Conflicts: # .github/workflows/pull_request.yml Co-authored-by: Yakov Olkhovskiy <yakov-olkhovskiy@users.noreply.github.com>

CLAassistant · 2026-03-20T13:35:47Z

Thank you for your submission! We really appreciate it. Like many open source projects, we ask that you all sign our Contributor License Agreement before we can accept your contribution.
2 out of 3 committers have signed the CLA.

✅ yakov-olkhovskiy
✅ alexey-milovidov
❌ cursoragent
_{You have signed the CLA already but the status is still pending? Let us recheck it.}

…-99530-ci-libfuzzer-for-prs

…-workflow-conflicts-7a3b # Conflicts: # .github/workflows/pull_request.yml Co-authored-by: Yakov Olkhovskiy <yakov-olkhovskiy@users.noreply.github.com>

…heriting constructor Resolve merge conflicts from PR #102159 (KQL conformance tests): - KQLStringFunctions.cpp: take master's getConvertedArgument rewrite for parse_json(dynamic(...)) - ParserKQLMakeSeries.cpp: take master's timespan.toSeconds() fix Update 18 sites in new/modified KQL parsers (ParserKQLCount, ParserKQLTop, ParserKQLUnion, ParserKQLJoin, ParserKQLQuery, ParserKQLStatement) to use the Pos(Tokens&, const Pos&) constructor instead of Pos(Tokens&, max_depth, max_backtracks), preserving recursion depth counters across re-tokenized sub-expressions. Regenerate .github/workflows/pull_request.yml. Co-authored-by: Yakov Olkhovskiy <yakov-olkhovskiy@users.noreply.github.com>

clickhouse-gh · 2026-05-11T23:19:16Z

+            try
+            {
+                size_t pos = 0;
+                uint64_t val = std::stoul(it->second, &pos);


std::stoul accepts signed text and wraps it to an unsigned value ("-1" -> 18446744073709551615), so invalid negative limits are silently treated as huge parser/AST limits instead of being rejected.

That breaks the contract of these CLI knobs and can disable the intended safety limits in CI by typo. Please reject signed inputs before conversion (or parse as unsigned with strict validation) and fail fast on invalid values.

clickhouse-gh · 2026-05-11T23:19:17Z

+            try
+            {
+                size_t pos = 0;
+                uint64_t val = std::stoul(it->second, &pos);


Same issue here: std::stoul accepts signed strings and wraps them to very large unsigned values ("-1" parses successfully), so malformed negative limits are silently accepted.

Please validate and reject signed input explicitly before conversion, then fail fast; otherwise parser/AST safety limits can be unintentionally neutralized.

…-workflow-conflicts-7a3b # Conflicts: # .github/workflows/pull_request.yml Co-authored-by: Yakov Olkhovskiy <yakov-olkhovskiy@users.noreply.github.com>

clickhouse-gh · 2026-05-12T23:02:47Z

            out.append(*argument);
        }

+        if (out.size() > DBMS_DEFAULT_MAX_QUERY_SIZE)


These checks use DBMS_DEFAULT_MAX_QUERY_SIZE instead of the active parser limit.
If max_query_size is configured above the default, KQL parsing can still throw here at the default threshold, which makes max_query_size ineffective for this path.

Please compare against the effective query-size limit used by this parse (instead of the hardcoded default), so KQL behavior stays consistent with normal SQL parsing limits.

clickhouse-gh · 2026-05-13T02:07:08Z

LLVM Coverage Report

Changed lines: 76.61% (285/372) · Uncovered code

Full report · Diff report

Avoid calling CLD2 language detection on empty UTF-8 strings in detectLanguage and detectLanguageMixed. This prevents a MemorySanitizer use-of-uninitialized-value in UTF8OneCharLen observed in PR CI run #99740. Co-authored-by: Yakov Olkhovskiy <yakov-olkhovskiy@users.noreply.github.com>

Commit b5a006c ("add parameters for select_parser_fuzzer, set default threshoulds lower", May 11 2026) made the per-input parser limits in `select_parser_fuzzer` configurable via `-ignore_remaining_args=1` plus `-max_parser_backtracks=N`. As a side effect it changed the in-source default for `max_parser_backtracks` from `5000` to `DBMS_DEFAULT_MAX_PARSER_BACKTRACKS` (= 1,000,000), i.e. a 200x increase. For sibling `create_parser_fuzzer`, the same author followed up with commit e9453b7 ("tune up create_parser_fuzzer settings") that restored a fuzzer-safe budget by adding [fuzzer_arguments] max_parser_backtracks = 10000 to `tests/fuzz/create_parser_fuzzer.options`. The matching update to `tests/fuzz/select_parser_fuzzer.options` was missed, so `libFuzzer` runs of `select_parser_fuzzer` parse each input with a 1,000,000-step backtrack budget. On pathological inputs (deeply-nested parentheses with mixed function-call / subquery shapes — exactly the pattern of closed issue ClickHouse#92282) the parser spends 20-26s before bailing, which is above the 20s `libFuzzer` timeout. CIDB confirms the regression: day select_parser_fuzzer FAIL/ERROR distinct PRs 2026-04-19 1 1 (PR ClickHouse#99740, author's branch) ... 2026-05-11 1 1 (PR ClickHouse#99740) 2026-05-13 24 24 (24 unrelated PRs) 2026-05-14 13 13 (13 unrelated PRs) The spike on 2026-05-13 starts exactly the first day master PRs picked up commit b5a006c. Every recent failure is a `timeout-*` artefact of the form `timeout after 20-26 seconds`, matching issue ClickHouse#92282. The fix is the minimum config change to restore the previously known- good per-input backtrack budget for `select_parser_fuzzer`, mirroring the `create_parser_fuzzer.options` fix. Local mechanism reproduction (clickhouse-local, debug build): SELECT te(((...((SELECT tuple(... (~470 bytes, malformed) --max_parser_depth=300 --max_parser_backtracks=10000 -> 0.16s (TOO_SLOW_PARSING) --max_parser_depth=300 --max_parser_backtracks=1000000 -> 0.92s (~6x slower for the same input) Pre-PR validation gate: - Deterministic repro: yes (see above). - Root cause explained: yes (regression date matches commit b5a006c). - Fix matches root cause: yes (restore 10k backtrack budget that was intentionally set in PR ClickHouse#92325 to fix the timeout class in ClickHouse#92282). - Test intent preserved: yes — fuzzer still exercises the SELECT parser with the same depth (300/150), AST element / depth limits, and corpus. Only the per-input backtrack budget changes. - Both directions demonstrated: yes — local clickhouse-local repro shows the 6x time difference; CIDB shows the regression spike exactly aligned with the commit. Refs: - Tracker issue ClickHouse#93438 ("libFuzzer tests are flaky"). - Originating closed issue ClickHouse#92282 (`select_parser_fuzzer` timeout on deeply nested parens). - Sibling fix: commit e9453b7 (`create_parser_fuzzer.options`). - Regression: commit b5a006c (`select_parser_fuzzer.cpp`). Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

`clickhouse_fuzzer` runs the full `clickhouse-local` binary (see `programs/local/fuzzers/clickhouse_fuzzer.cpp`) which spins up a complete clickhouse-server-grade runtime: JeMalloc arenas, OpenSSL initializer, static initializers, and full query execution. Baseline RSS is close to the libFuzzer default `rss_limit_mb=2048`, so fuzzed inputs that trigger even moderate query-time allocations push the process past the limit. CIDB shows 7 unrelated PRs in 7 days all hitting OOM at 2049-2056 Mb — just barely over the 2048 Mb default. Failure mode is uniform: libFuzzer out-of-memory (used: 2049-2056Mb; limit: 2048Mb) oom-<hash> Bumping to 4096 Mb follows existing precedent for fuzzers with similar workloads: - `execute_query_fuzzer.options`: rss_limit_mb = 4096 (also runs query execution; same author bumped this in commit bafb7d7) - `data_type_deserialization_fuzzer.options`: rss_limit_mb = 4096 - `create_parser_fuzzer.options`: rss_limit_mb = 6144 The limit still bounds the target (it does not disable the check via `rss_limit_mb=0`), so genuine memory regressions above 4 GB will still be caught. CIDB evidence (last 30 days, all 10 hits — 0 on master because libFuzzer does not run on master regularly): PR ClickHouse#99740 (4 hits, Apr 19-21) PR ClickHouse#104231, ClickHouse#104849, ClickHouse#104956, ClickHouse#96844, ClickHouse#104510, ClickHouse#104492 (1 hit each, May 13-15)

libfuzzer arms `setitimer(ITIMER_REAL)` with an interval of `UnitTimeoutSec / 2 + 1` seconds (11 s for our 20 s timeout), so SIGALRM fires periodically — not only at the actual timeout. The wrapper handler in `clickhouse_fuzzer.cpp` had three issues that together produced spurious "timeout after 21 seconds" failures (seen on PRs ClickHouse#99740 and ClickHouse#104869): 1. SIGALRM has no per-thread routing. The handler is installed process-wide via `sigaction`, so the kernel can deliver it to either the libfuzzer main thread or the runner thread. When it landed on the runner, `fuzzerSigalrmHandler` `pthread_kill`'d SIGUSR1 to itself and raced with a concurrent invocation on the main thread, with two competing `sigaction` restores corrupting handler state. 2. The slow runner-stack dump (shells out to llvm-symbolizer) ran on every periodic SIGALRM, even when the iteration was nowhere near its timeout. The dump alone takes many seconds; combined with the 3-second wait-for-stack window it could push an otherwise-fast iteration past the 20 s limit and trigger a real timeout. 3. The wrapper permanently restored libfuzzer's original handler before `raise(SIGALRM)`, making itself one-shot. A periodic SIGALRM at second ~11 consumed the wrapper, so when the genuine timeout fired at second 20 there was no longer any wrapper to dump the runner stack. Fix all three: - Block SIGALRM on the runner thread at the top of `runLibFuzzer` so the wrapper only ever runs on the libfuzzer main thread. - Track per-iteration start time via `clock_gettime(CLOCK_MONOTONIC)` and skip the SIGUSR1 stack-dump unless elapsed ≥ 15 s, so the 11 s periodic alarms pass through without disturbing the runner. Guard with an atomic to keep the dump single-shot per process. - Call libfuzzer's original handler directly (via the saved `sigaction.sa_sigaction` / `sa_handler`) instead of permanently restoring it. The wrapper stays installed and keeps intercepting later alarms. CI report: https://s3.amazonaws.com/clickhouse-test-reports/json.html?PR=104869&sha=4bb6825eb6220feaca31f3d2608dae09d5de7927&name_0=PR&name_1=libFuzzer%20tests ClickHouse#104869 Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

Revert "Revert "CI: run libFuzzers in PRs, w/o corpus upload""

03f852d

clickhouse-gh Bot added the pr-not-for-changelog This PR should not be mentioned in the changelog label Mar 17, 2026

clickhouse-gh Bot reviewed Mar 17, 2026

View reviewed changes

yakov-olkhovskiy added 2 commits March 17, 2026 21:53

add env logging

ab77742

fix env in runner

b88f26f

clickhouse-gh Bot reviewed Mar 17, 2026

View reviewed changes

Comment thread tests/fuzz/runner.py

yakov-olkhovskiy added 5 commits March 17, 2026 23:37

fix env in runner

ea5d557

fix

d78c763

fix error parser

ff32344

add settings arguments to execute_query_fuzzer, tune up options

4a8c293

fix format_fuzzer, tune up execute_query_fuzzer options

bafb7d7

clickhouse-gh Bot reviewed Mar 18, 2026

View reviewed changes

Comment thread ci/jobs/libfuzzer_test_check.py

alexey-milovidov and others added 2 commits March 19, 2026 05:20

Merge branch 'master' into revert-99696-revert-99530-ci-libfuzzer-for…

f49cf20

…-prs

refactor aggregate_function_state_deserialization_fuzzer to accept ma…

dbdc3ec

…x_parser_depth, max_parser_backtracks, max_query_size as options, set max_parser_depth = 100, max_parser_backtracks = 1000

clickhouse-gh Bot reviewed Mar 19, 2026

View reviewed changes

Comment thread src/AggregateFunctions/fuzzers/aggregate_function_state_deserialization_fuzzer.cpp Outdated

yakov-olkhovskiy and others added 4 commits March 19, 2026 17:17

fix typo

540ed43

fix style

f80e159

add options to data_type_deserialization_fuzzer, tune up

7241c73

Merge remote-tracking branch 'origin/master' into cursor/pull-request…

c36ae38

…-workflow-conflicts-7a3b # Conflicts: # .github/workflows/pull_request.yml Co-authored-by: Yakov Olkhovskiy <yakov-olkhovskiy@users.noreply.github.com>

Merge remote-tracking branch 'origin/master' into revert-99696-revert…

a9df352

…-99530-ci-libfuzzer-for-prs

clickhouse-gh Bot reviewed Mar 20, 2026

View reviewed changes

Comment thread src/Interpreters/fuzzers/execute_query_fuzzer.cpp

yakov-olkhovskiy added 3 commits March 23, 2026 22:35

fix execute_query_fuzzer

7461326

tune up

fd7a8de

fix fuzzers CI to collect oom- results too

16e277e

clickhouse-gh Bot reviewed Mar 24, 2026

View reviewed changes

Comment thread tests/fuzz/runner.py

cursoragent and others added 3 commits April 8, 2026 20:22

Merge remote-tracking branch 'origin/master' into cursor/pull-request…

c29cad3

…-workflow-conflicts-7a3b # Conflicts: # .github/workflows/pull_request.yml Co-authored-by: Yakov Olkhovskiy <yakov-olkhovskiy@users.noreply.github.com>

limit KQL parser by DBMS_DEFAULT_MAX_QUERY_SIZE

d9ea851

limit KQL parser by DBMS_DEFAULT_MAX_QUERY_SIZE

c98c8d7

cursoragent and others added 4 commits May 11, 2026 02:03

add parameters for select_parser_fuzzer, set default threshoulds lower

b5a006c

add parameters for create_parser_fuzzer, set default threshoulds lower

35d7483

tune up create_parser_fuzzer settings

e9453b7

clickhouse-gh Bot reviewed May 11, 2026

View reviewed changes

Merge remote-tracking branch 'origin/master' into cursor/pull-request…

fc6afd7

…-workflow-conflicts-7a3b # Conflicts: # .github/workflows/pull_request.yml Co-authored-by: Yakov Olkhovskiy <yakov-olkhovskiy@users.noreply.github.com>

clickhouse-gh Bot reviewed May 12, 2026

View reviewed changes

yakov-olkhovskiy requested a review from alexey-milovidov May 13, 2026 10:13

alexey-milovidov approved these changes May 13, 2026

View reviewed changes

alexey-milovidov self-assigned this May 13, 2026

alexey-milovidov added this pull request to the merge queue May 13, 2026

Merged via the queue into master with commit 3bbd581 May 13, 2026
167 checks passed

alexey-milovidov deleted the revert-99696-revert-99530-ci-libfuzzer-for-prs branch May 13, 2026 12:41

robot-ch-test-poll4 added the pr-synced-to-cloud The PR is synced to the cloud repo label May 13, 2026

groeneai mentioned this pull request May 14, 2026

Cap select_parser_fuzzer max_parser_backtracks at 10000 #104922

Merged

1 task

This was referenced May 16, 2026

Enable optimize_or_like_chain by default. #94517

Draft

Add test for #70356 #104551

Merged

CI: bump rss_limit_mb to 4096 for clickhouse_fuzzer to stop chronic OOMs at 2048 Mb default #105134

Merged

alexey-milovidov mentioned this pull request May 17, 2026

Stop clickhouse_fuzzer from self-inflicting timeouts via SIGALRM signals #105176

Merged

1 task

This was referenced May 18, 2026

Support WITH TIES for negative LIMIT #100930

Merged

Fix MSan use-of-uninitialized-value in UTF-8 case-insensitive StringSearcher #105223

Merged

Fix exception on IN tuple() against Distributed sharded table #104966

Merged

alexey-milovidov mentioned this pull request Jun 8, 2026

Support constant haystack with non-constant needle in LIKE/ILIKE/match #100479

Merged

1 task

Sunbelt Computer Software

PL/B Language Development and Support

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Revert "Revert "CI: run libFuzzers in PRs, w/o corpus upload""#99740

Revert "Revert "CI: run libFuzzers in PRs, w/o corpus upload""#99740
alexey-milovidov merged 54 commits into
masterfrom
revert-99696-revert-99530-ci-libfuzzer-for-prs

yakov-olkhovskiy commented Mar 17, 2026 •

edited by robot-clickhouse

Loading

Uh oh!

clickhouse-gh Bot commented Mar 17, 2026 •

edited

Loading

Uh oh!

clickhouse-gh Bot Mar 17, 2026

Uh oh!

clickhouse-gh Bot May 11, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

CLAassistant commented Mar 20, 2026 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

clickhouse-gh Bot May 11, 2026

Uh oh!

clickhouse-gh Bot May 11, 2026

Uh oh!

clickhouse-gh Bot May 12, 2026

Uh oh!

clickhouse-gh Bot commented May 13, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Sunbelt Computer Software

PL/B Language Development and Support

Uh oh!

Conversation

yakov-olkhovskiy commented Mar 17, 2026 • edited by robot-clickhouse Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Version info

Uh oh!

clickhouse-gh Bot commented Mar 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

AI Review

Summary

Findings

Final Verdict

Uh oh!

clickhouse-gh Bot Mar 17, 2026

Choose a reason for hiding this comment

Uh oh!

clickhouse-gh Bot May 11, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

CLAassistant commented Mar 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

clickhouse-gh Bot May 11, 2026

Choose a reason for hiding this comment

Uh oh!

clickhouse-gh Bot May 11, 2026

Choose a reason for hiding this comment

Uh oh!

clickhouse-gh Bot May 12, 2026

Choose a reason for hiding this comment

Uh oh!

clickhouse-gh Bot commented May 13, 2026

LLVM Coverage Report

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

yakov-olkhovskiy commented Mar 17, 2026 •

edited by robot-clickhouse

Loading

clickhouse-gh Bot commented Mar 17, 2026 •

edited

Loading

CLAassistant commented Mar 20, 2026 •

edited

Loading