iframe-proxy

serxa · 2026-06-24T15:40:58Z

Fixes a ThreadSanitizer data race on master, found by the Stateless tests (amd_tsan) job.

BlockIO::onFinish released the query's workload resources — which resets the MemoryReservation — before finalizing the pipeline. Pipeline executor threads hold raw pointers to that MemoryReservation (WorkloadResources in PipelineExecutor reads QueryStatus::getMemoryReservation() at thread startup and uses it for syncWithMemoryTracker), so resetting it while the pipeline is still being finalized races with — and can outlive — those threads.

TSan trace: unique_ptr<MemoryReservation>::reset (via releaseWorkloadResources from BlockIO::onFinish) vs getMemoryReservation (in a PipelineExecutor::spawnThreads worker).

onException / onCancelOrConnectionLoss already do the right thing (stop the pipeline, then release); onFinish and BlockIO::reset did the opposite.

The query slot, in contrast, is intentionally released early: until it is released the query keeps occupying a concurrency slot even though the client already considers the query finished, which would needlessly block the next query. Pipeline threads do not touch the query slot, so this is safe. Only the memory reservation needs to wait.

So the release is split:

QueryStatus/BlockIO/Context gain releaseQuerySlot(), and QueryStatus/BlockIO also gain releaseMemoryReservation(). releaseWorkloadResources() still releases both and is used by the error/cancel paths (pipeline already stopped).
BlockIO::onFinish: release the query slot early, finalize the pipeline (which joins the executor threads — PipelineExecutor::pool is wait()-ed and joined on destruction), then release the memory reservation.
BlockIO::reset: reset the pipeline before releasing resources.
executeQuery: release only the query slot early; the reservation is released later by streams.onFinish().
Removed the now-unused Context::releaseWorkloadResources.

CI report: https://s3.amazonaws.com/clickhouse-test-reports/json.html?REF=master&sha=cd3c529cd3a8002caa1a4ba585a39da636836784&name_0=MasterCI&name_1=Stateless%20tests%20%28amd_tsan%2C%20parallel%2C%202%2F2%29

Changelog category (leave one):

Not for changelog (changelog entry is not required)

Fixes: #108393

Version info

Merged into: 26.7.1.89

`BlockIO::onFinish` released the memory reservation before finalizing the pipeline, racing with executor threads that hold raw pointers to it. Release the query slot early (as before, for slot reuse) but the memory reservation only after the pipeline is finalized, matching `onException`. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

clickhouse-gh · 2026-06-24T15:41:35Z

Workflow [PR], commit [d869a6d]

Summary: ❌

job_name	test_name	status	info	comment
Integration tests (amd_asan_ubsan, db disk, old analyzer, 2/6)		FAIL
	test_quorum_inserts_parallel/test.py::test_parallel_quorum_actually_quorum	FAIL	cidb	IGNORED
AST fuzzer (amd_debug)		FAIL
	Logical error: Block structure mismatch in A stream: different columns: (STID: 0993-27f0)	FAIL	cidb, issue	ISSUE EXISTS

AI Review

Summary

This PR fixes the MemoryReservation lifetime race by splitting query-slot and reservation release, but the current final release point regresses query-level scheduler accounting: successful queries can log MemoryReservationIncreases without the matching MemoryReservationDecreases. I would request changes until the reservation is released after all pipeline threads are joined but before the query-log profile-event snapshot, or the observable contract is adjusted with replacement coverage.

Findings

⚠️ Majors

[src/QueryPipeline/BlockIO.cpp:86] BlockIO::onFinish now destroys the MemoryReservation only after finalize_query_pipeline and finish_callbacks complete. In the logging path, finalize_query_pipeline calls finalizeQueryPipelineBeforeLogging, which runs CurrentThread::finalizePerformanceCounters before returning, so the destructor's MemoryReservationDecreases event is emitted after the query_log snapshot. Users then see a query with MemoryReservationIncreases = 1 and no matching decrease, and the PR weakens test_reserve_memory to accept that loss. Release all BlockIO::process_list_entries reservations immediately after query_pipeline.reset joins executor threads but before CurrentThread::finalizePerformanceCounters, or preserve another query-scoped accounting path. Existing discussion: Fix data race on MemoryReservation release at query finish #108391 (comment)

Tests

⚠️ [tests/integration/test_scheduler_memory/test.py:127] The changed test removes the only successful-query assertion that caught the lost MemoryReservationDecreases attribution. Keep a focused assertion for the successful BlockIO::onFinish path, or, if per-query decreases are intentionally no longer supported, add replacement coverage that MemoryReservationApproved returns to zero after successful finish so the weaker query_log assertion does not hide a missed release.

Final Verdict

Status: ⚠️ Request changes

Minimum required action: preserve query-level MemoryReservationDecreases attribution, or provide an explicit replacement contract and coverage that keeps memory-reservation release observable after successful query finish.

serxa · 2026-06-24T15:51:02Z

The race was introduced together with the MemoryReservation / reserve_memory feature in #82414.

azat · 2026-06-24T16:22:19Z

-    releaseWorkloadResources();
+    /// Reset the pipeline before releasing workload resources: pipeline threads hold raw pointers
+    /// to `MemoryReservation` (see `WorkloadResources` in `PipelineExecutor`), so the reservation
+    /// must outlive them.
    resetPipeline(/*cancel=*/false);
+    releaseWorkloadResources();


So this is the fix? (plus releasing query slot separatelly earlier)

Exactly. I blindly followed the query slot logic in the original PR, but memory reservations are actually a bit different

We probably can run sync with MemoryTracker here as well when we release the slot to make sure memory consumption is reduced to as little as possible. But I fear we will introduce unnecessary complexity. Not worth it IMO

serxa · 2026-06-25T11:50:01Z

The failed test is indeed related. Working on it. It's pretty obvious that we moved the memory reservation release to a later point, and we now can't rely on the MemoryReservationDecreases profile event to be triggered after the query stops because the memory reservation now exists longer.

E   AssertionError: Profile event MemoryReservationDecreases check failed for query test_production, got 0

The previous commit released the `MemoryReservation` only at the very end of `BlockIO::onFinish`, after `finalize_query_pipeline` (and therefore after `CurrentThread::finalizePerformanceCounters`) had already run. The `MemoryReservationDecreases` profile event emitted by the reservation's destructor on the query thread then landed after the `query_log` snapshot and was lost, so `test_scheduler_memory/test.py::test_reserve_memory` failed with `MemoryReservationDecreases ... got 0`. Release the memory reservation inside `finalizeQueryPipelineBeforeLogging`, between `query_pipeline.reset()` and `finalizePerformanceCounters`. This is the only point that satisfies every constraint at once: the pipeline (and its threads) have already been torn down so there is no data race and the query's real memory is already freed (no under-reporting to the memory scheduler), while the event is still emitted before the `query_log` snapshot so it stays attributed to the query. The reservation also stays alive through `query_finish_callback`, which runs before `streams.onFinish`. `BlockIO::onFinish` keeps releasing the memory reservation only on the non-logging path; the logging path releases it inside the finalize step. Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>

clickhouse-gh · 2026-06-25T12:45:48Z

+    /// `MemoryReservationDecreases` profile event is recorded in `query_log`.
+    context->releaseMemoryReservation();
+
    /// Update performance counters before logging to query_log


This only releases the reservation attached to the outer Context, but BlockIO can also own process-list entries inherited from an inner executeQuery result. For example, rewritten queries such as SHOW TABLES return the internal query's BlockIO, then the outer executeQueryImpl appends its own process_list_entry; the pipeline being reset here belongs to the inner query, while this call releases only the outer query's reservation. Since the logging branch of BlockIO::onFinish no longer calls BlockIO::releaseMemoryReservation, the inner reservation survives until BlockIO is destroyed or cleared, after CurrentThread::finalizePerformanceCounters and after the query_log snapshot. That loses the promised MemoryReservationDecreases attribution for those queries and keeps MemoryReservationApproved charged through finish callbacks/final protocol send. Please release every entry in BlockIO::process_list_entries at this point, e.g. by having finalize_query_pipeline take a release callback and invoking BlockIO::releaseMemoryReservation after query_pipeline.reset but before CurrentThread::finalizePerformanceCounters.

…ery_log" This reverts commit 2db56a1.

…memory The reservation is released at query teardown (BlockIO::onFinish), after the query's ProfileEvents are snapshotted into query_log, so the decrease is not reliably attributed to the query. Asserting it per-query is race-prone; the test now checks only MemoryReservationIncreases/Killed/Failed. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>

clickhouse-gh · 2026-06-25T21:27:21Z

LLVM Coverage Report

Metric	Baseline	Current	Δ
Lines	85.40%	85.40%	+0.00%
Functions	92.60%	92.60%	+0.00%
Branches	77.60%	77.60%	+0.00%

Changed lines: Changed C/C++ lines covered by tests: 24/24 (100.00%) | Lost baseline coverage (was covered on master, now uncovered in this PR): 2 line(s) · Uncovered code

Full report · Diff report

serxa · 2026-06-25T22:16:40Z

@groeneai, investigate the failure: https://github.com/ClickHouse/ClickHouse/actions/runs/28187956117/job/83517213709 and provide a fix in a separate PR. If the fix is already in progress, link it here.

Integration tests (amd_asan_ubsan, db disk, old analyzer, 2/6) → test_quorum_inserts_parallel/test.py::test_parallel_quorum_actually_quorum fails with helpers.client.QueryTimeoutExceedException: Client timed out! on a plain INSERT INTO q VALUES(3, 'Hi'). There is no server crash or logical error, and it is unrelated to this PR: this PR only changes the MemoryReservation release point at query finish (Context/ProcessList/BlockIO), and that path is dormant unless a memory RESOURCE + query-level reserve_memory are configured, which this test does not do. Looks like an infra/timing flake; I couldn't find an existing tracking issue.

For completeness, the other red — AST fuzzer (amd_debug): Block structure mismatch in UnionStep stream (STID 0993-27f0) — was already auto-marked flaky and matched to #108142, so no action is needed there.

groeneai · 2026-06-25T22:51:55Z

…ouse#99475) Master (ClickHouse#108391, "Fix data race on MemoryReservation release at query finish") split BlockIO::releaseWorkloadResources() into releaseQuerySlot() (safe while the pipeline runs) and releaseMemoryReservation() (safe only after the pipeline has been finalized), because pipeline threads hold raw pointers to the MemoryReservation. BlockIO::onFinish() and executeQuery() now release the query slot early and the memory reservation later, inside onFinish, after the pipeline is finalized. The Prometheus query_log fix released full workload resources (including the memory reservation) before query_finish_callback / io.onFinish, which after that change would re-introduce the same data race in the Prometheus path. Switch the early release to io.releaseQuerySlot() to match the updated executeQuery: the slot is still freed before the slow HTTP final flush, while the memory reservation is released by io.onFinish() once the pipeline has been finalized.

clickhouse-gh Bot added the pr-not-for-changelog This PR should not be mentioned in the changelog label Jun 24, 2026

clickhouse-gh Bot reviewed Jun 24, 2026

View reviewed changes

Comment thread src/Interpreters/executeQuery.cpp

azat self-assigned this Jun 24, 2026

azat approved these changes Jun 24, 2026

View reviewed changes

azat mentioned this pull request Jun 24, 2026

Fix data race on query MemoryReservation during normal query finish #108395

Closed

Algunenano added the v26.6-must-backport label Jun 24, 2026

Ergus mentioned this pull request Jun 24, 2026

Text index postprocessor #98939

Merged

This was referenced Jun 25, 2026

Add canonical-format fast path to parseDateTimeBestEffort #108187

Merged

Fix server crash during shutdown when a database shutdown throws #108417

Merged

clickhouse-gh Bot reviewed Jun 25, 2026

View reviewed changes

serxa and others added 2 commits June 25, 2026 17:16

Revert "Keep MemoryReservationDecreases attributed to the query in qu…

fd86b3a

…ery_log" This reverts commit 2db56a1.

This was referenced Jun 25, 2026

Fix LOGICAL_ERROR in MergeTreeIndexConditionSet under MV layer type change #105552

Open

Restrict catboostEvaluate model path to user_files #108463

Open

groeneai mentioned this pull request Jun 25, 2026

Deny mergeTreeProjection and mergeTreeIndex reads that bypass a SELECT row policy #108462

Open

serxa mentioned this pull request Jun 25, 2026

Remove obsolete CustomResourceManager in favor of WorkloadResourceManager #108286

Open

serxa added this pull request to the merge queue Jun 25, 2026

groeneai mentioned this pull request Jun 25, 2026

Restore O(1) name-to-value lookup for String-to-Enum casts #108534

Open

Merged via the queue into master with commit 474a3c1 Jun 25, 2026
164 of 167 checks passed

serxa deleted the fix-memory-reservation-release-race branch June 25, 2026 23:49

robot-ch-test-poll2 added the pr-synced-to-cloud The PR is synced to the cloud repo label Jun 26, 2026

robot-clickhouse-ci-2 added the pr-must-backport-synced The `*-must-backport` labels are synced into the cloud Sync PR label Jun 26, 2026

groeneai mentioned this pull request Jun 26, 2026

Fix SQL injection in PostgreSQL wire protocol parameter binding #108469

Open

robot-ch-test-poll3 mentioned this pull request Jun 26, 2026

Cherry pick #108391 to 26.6: Fix data race on MemoryReservation release at query finish #108562

Open

groeneai mentioned this pull request Jun 26, 2026

Limit concurrent bcrypt authentications to bound CPU under auth floods #108527

Open

alexey-milovidov mentioned this pull request Jun 26, 2026

add option skip_unavailable_shards_mode #79091

Merged

1 task

serxa added v26.6-must-backport and removed v26.6-must-backport labels Jun 27, 2026

robot-ch-test-poll2 mentioned this pull request Jun 27, 2026

ThreadSanitizer: data race in QueryStatus memory reservation (releaseWorkloadResources vs getMemoryReservation) (STID: 4071-3348) #108393

Closed

Sunbelt Computer Software

PL/B Language Development and Support

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix data race on MemoryReservation release at query finish#108391

Fix data race on MemoryReservation release at query finish#108391
serxa merged 4 commits into
masterfrom
fix-memory-reservation-release-race

serxa commented Jun 24, 2026 •

edited by robot-clickhouse-ci-2

Loading

Uh oh!

clickhouse-gh Bot commented Jun 24, 2026 •

edited by serxa

Loading

Uh oh!

Uh oh!

serxa commented Jun 24, 2026

Uh oh!

azat Jun 24, 2026

Uh oh!

serxa Jun 24, 2026

Uh oh!

serxa Jun 24, 2026 •

edited

Loading

Uh oh!

serxa commented Jun 25, 2026 •

edited

Loading

Uh oh!

clickhouse-gh Bot Jun 25, 2026

Uh oh!

clickhouse-gh Bot commented Jun 25, 2026

Uh oh!

serxa commented Jun 25, 2026

Uh oh!

groeneai commented Jun 25, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Sunbelt Computer Software

PL/B Language Development and Support

Uh oh!

Conversation

serxa commented Jun 24, 2026 • edited by robot-clickhouse-ci-2 Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changelog category (leave one):

Version info

Uh oh!

clickhouse-gh Bot commented Jun 24, 2026 • edited by serxa Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

AI Review

Summary

Findings

Tests

Final Verdict

Uh oh!

Uh oh!

serxa commented Jun 24, 2026

Uh oh!

azat Jun 24, 2026

Choose a reason for hiding this comment

Uh oh!

serxa Jun 24, 2026

Choose a reason for hiding this comment

Uh oh!

serxa Jun 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

serxa commented Jun 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

clickhouse-gh Bot Jun 25, 2026

Choose a reason for hiding this comment

Uh oh!

clickhouse-gh Bot commented Jun 25, 2026

LLVM Coverage Report

Uh oh!

serxa commented Jun 25, 2026

Uh oh!

groeneai commented Jun 25, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

serxa commented Jun 24, 2026 •

edited by robot-clickhouse-ci-2

Loading

clickhouse-gh Bot commented Jun 24, 2026 •

edited by serxa

Loading

serxa Jun 24, 2026 •

edited

Loading

serxa commented Jun 25, 2026 •

edited

Loading