iframe-proxy

antaljanosbenjamin · 2026-04-08T09:58:45Z

Changelog category (leave one):

Bug Fix (user-visible misbehavior in an official stable release)

Changelog entry (a user-readable short description of the changes that goes into CHANGELOG.md):

Fixes incorrect row ordering in queries that use ORDER BY with the grace_hash join algorithm. Affected queries could return results in the wrong order, producing silently incorrect output.

Documentation entry for user-facing changes

Documentation is written (mandatory for new features)

Claude explanation:
Root Cause

The bug is in src/Processors/QueryPlan/Optimizations/optimizeReadInOrder.cpp, in the findReadingStep function (line 150). The optimize_read_in_order optimization propagates the "data is sorted" property through JOIN steps,
allowing the final ORDER BY to skip a full sort and rely on the MergeTree's key ordering. However, this optimization checked only the join kind (Inner/Left) and strictness (Any/All) — it never verified whether the join algorithm
preserves input row order.

Grace hash join destroys input order because it scatters rows into buckets by hash value. Rows from bucket 0 come first, then bucket 1, etc. — this has nothing to do with the original sort key order. When the sorting step relies
on a "prefix sort" (assuming data is already ordered), the output comes out in hash-bucket order rather than key order.

The Fix

One-line change: add && !join_ptr->hasDelayedBlocks() to the condition that allows read-in-order through joins. Joins with delayed blocks (grace hash join) reorder rows, so the optimization must not propagate through them.
Regular HashJoin and ConcurrentHashJoin process rows in a single pass preserving order, so they remain unaffected.

Claude explanation: Root Cause The bug is in src/Processors/QueryPlan/Optimizations/optimizeReadInOrder.cpp, in the findReadingStep function (line 150). The optimize_read_in_order optimization propagates the "data is sorted" property through JOIN steps, allowing the final ORDER BY to skip a full sort and rely on the MergeTree's key ordering. However, this optimization checked only the join kind (Inner/Left) and strictness (Any/All) — it never verified whether the join algorithm preserves input row order. Grace hash join destroys input order because it scatters rows into buckets by hash value. Rows from bucket 0 come first, then bucket 1, etc. — this has nothing to do with the original sort key order. When the sorting step relies on a "prefix sort" (assuming data is already ordered), the output comes out in hash-bucket order rather than key order. The Fix One-line change: add && !join_ptr->hasDelayedBlocks() to the condition that allows read-in-order through joins. Joins with delayed blocks (grace hash join) reorder rows, so the optimization must not propagate through them. Regular HashJoin and ConcurrentHashJoin process rows in a single pass preserving order, so they remain unaffected.

clickhouse-gh · 2026-04-08T09:59:29Z

PedroTadim · 2026-04-09T07:55:41Z

Does it fix #100781 ? If so, mention it to close it.

alexey-milovidov · 2026-04-10T02:27:49Z

The flaky check failure is fixed in #102148, let's update the branch.

clickhouse-gh · 2026-04-10T06:30:25Z

LLVM Coverage Report

Changed lines: 100.00% (8/8) | lost baseline coverage: 1 line(s) · Uncovered code

Full report · Diff report

alexey-milovidov

The changes are very clear to me, thanks!

It was disabled by #102036 and it is possibly the cleaner approach

clickhouse-gh Bot added the pr-bugfix Pull request with bugfix, not backported by default label Apr 8, 2026

antaljanosbenjamin linked an issue Apr 9, 2026 that may be closed by this pull request

Grace hash wrong result with number of buckets #100781

Closed

Merge branch 'master' into do-not-use-read-in-order-with-grace-hash-join

0fb153a

alexey-milovidov approved these changes Apr 10, 2026

View reviewed changes

alexey-milovidov self-assigned this Apr 10, 2026

alexey-milovidov added this pull request to the merge queue Apr 10, 2026

Merged via the queue into master with commit 3412f87 Apr 10, 2026
164 checks passed

alexey-milovidov deleted the do-not-use-read-in-order-with-grace-hash-join branch April 10, 2026 08:46

robot-ch-test-poll4 added the pr-synced-to-cloud The PR is synced to the cloud repo label Apr 10, 2026

antaljanosbenjamin added a commit that referenced this pull request Apr 10, 2026

Remove read-in-order optimization related logic for SpillingHashJoin

e895cc4

It was disabled by #102036 and it is possibly the cleaner approach

Sunbelt Computer Software

PL/B Language Development and Support

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Do not use read-in-order optimization with grace hash join#102036

Do not use read-in-order optimization with grace hash join#102036
alexey-milovidov merged 2 commits into
masterfrom
do-not-use-read-in-order-with-grace-hash-join

antaljanosbenjamin commented Apr 8, 2026 •

edited

Loading

Uh oh!

clickhouse-gh Bot commented Apr 8, 2026 •

edited

Loading

Uh oh!

PedroTadim commented Apr 9, 2026 •

edited

Loading

Uh oh!

alexey-milovidov commented Apr 10, 2026

Uh oh!

clickhouse-gh Bot commented Apr 10, 2026

Uh oh!

alexey-milovidov left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Sunbelt Computer Software

PL/B Language Development and Support

Uh oh!

Conversation

antaljanosbenjamin commented Apr 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changelog category (leave one):

Changelog entry (a user-readable short description of the changes that goes into CHANGELOG.md):

Documentation entry for user-facing changes

Uh oh!

clickhouse-gh Bot commented Apr 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

AI Review

Summary

Missing context

ClickHouse Rules

Final Verdict

Uh oh!

PedroTadim commented Apr 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

alexey-milovidov commented Apr 10, 2026

Uh oh!

clickhouse-gh Bot commented Apr 10, 2026

LLVM Coverage Report

Uh oh!

alexey-milovidov left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

antaljanosbenjamin commented Apr 8, 2026 •

edited

Loading

clickhouse-gh Bot commented Apr 8, 2026 •

edited

Loading

PedroTadim commented Apr 9, 2026 •

edited

Loading