iframe-proxy

nihalzp · 2026-05-06T14:53:38Z

Simplified version of #99581 that does not have many of the optimizations like no materialization during scatter, single hashing and reuse of hash, single serialization and reuse of serialized bytes.

There are mostly regressions since data is not usually high cardinality and evenly distributed but If enabled for perf tests, some tests have sped up:

Changelog category (leave one):

Performance Improvement

Changelog entry (a user-readable short description of the changes that goes into CHANGELOG.md):

New GROUP BY optimization for high cardinality evenly distributed keys that scatters rows across threads by hashing the grouping key, so each thread aggregates a disjoint subset of keys without a merge phase. Set enable_sharding_aggregator = 1 to enable it.

Documentation entry for user-facing changes

Documentation is written (mandatory for new features)

Version info

Merged into: 26.6.1.128

clickhouse-gh · 2026-05-06T14:54:32Z

clickhouse-gh

Posting inline findings only.

nihalzp · 2026-05-07T13:38:36Z

Similar performance improvement as the optimized sharded aggregation for high cardinality data and aggregate functions with expensive merge.

Some queries are slowed down bit more cases where data copy for scatter is expensive and we rehash. The following query was slowed down by +1.535x only in the optimized version.

clickhouse-gh · 2026-05-07T13:47:13Z

+        params.group_by_two_level_threshold_bytes = 0;
+
+        /// Sharded aggregation does not implement temporary-file spill/merge yet.
+        params.max_bytes_before_external_group_by = 0;


Disabling params.max_bytes_before_external_group_by here means the sharded path ignores external aggregation and always keeps hash tables in memory. With optimize_aggregation_by_sharding enabled by default, this is a behavior change for queries that previously spilled to disk: they can now hit memory-limit exceptions instead.

Please either (1) keep sharded aggregation disabled by default until spill support exists, or (2) do not take the sharded path when external aggregation settings are configured, so existing memory-safety behavior is preserved.

params.max_bytes_before_external_group_by is still unconditionally reset to 0 in the sharded path (AggregatingStep::transformPipeline, line 344 in current head), so the earlier concern is still live.

The regression is now visible in this PR itself: multiple integration tests had to explicitly set enable_sharding_aggregator = 0 to keep external GROUP BY behavior (test_max_bytes_ratio_before_external_order_group_by_for_server, test_temporary_data_in_cache, test_tmp_policy, etc.). That means enabling sharded aggregation by default currently removes an existing spill-to-disk safety mechanism for eligible queries.

Please keep sharded aggregation off unless external aggregation is unsupported for that query shape, or gate this path when external-aggregation settings are active.

egor-click · 2026-05-07T16:30:06Z

Similar performance improvement as the optimized sharded aggregation for high cardinality data and aggregate functions with expensive merge.

Some queries are slowed down bit more cases where data copy for scatter is expensive and we rehash. The following query was slowed down by +1.535x only in the optimized version.

tbh i noticed this pr while working on perf comparison dashboard, and it looks like we have a lot of degradations, especially in quantile and some group by , some by 5-10x

nihalzp · 2026-05-07T17:10:28Z

tbh i noticed this pr while working on perf comparison dashboard, and it looks like we have a lot of degradations, especially in quantile and some group by , some by 5-10x

Yes, this is expected especially for low cardinality and skewed data. My comment was mainly comparing difference between this PR and #99581 (which is this PR + some optimizations). Both have many degradations.

nickitat · 2026-05-15T20:43:07Z

+
+    /// Try to pull a new input chunk.
+    input.setNeeded();
+    if (input.hasData())


We need to limit the max queue length and don't accept new chunks while already at this limit.

Yes, good catch. Actually, the original optimized sharded aggregation had it but later I removed it for this naive version because I thought since we split the chunks into smaller parts the memory will be equivalent whether the content is in hash table or in IColumn chunks. But then realized, for low cardinality case, the memory difference would be quite high between queued chunks in BufferedShardByHashTransform.cpp and their memory in hash tables after aggregation.

I have added a limit of 10 chunks at most. Maybe we can also put a limit in terms of rows or bytes.

nickitat · 2026-05-15T20:50:57Z

+    /// causing them to cluster into a small subset of hash table buckets.
+    /// The golden ratio constant ensures thorough bit mixing with a single multiply.
+    /// Combine the mix with Lemire fastrange to map into [0, num_shards) without a divide.
+    static constexpr size_t fibonacci_hash_multiplier = 0x9e3779b97f4a7c15ULL;


We already have similar logic implemented in scatterBlockByHashGeneric. Maybe we could reuse it.

I could not use scatterBlockByHashGeneric directly because the signature requires Block and but we only have Chunk and there would be conversion overhead if used. I decided to reuse only JoinCommon::hashToSelector to avoid doing hashing manually.

nickitat

All good, modulo a few comments.

clickhouse-gh · 2026-05-25T13:09:15Z

LLVM Coverage Report

Metric	Baseline	Current	Δ
Lines	84.20%	84.10%	-0.10%
Functions	91.40%	91.40%	+0.00%
Branches	76.60%	76.60%	+0.00%

Changed lines: 89.34% (218/244) · Uncovered code

Full report · Diff report

clickgapai · 2026-05-25T17:40:22Z

…gation Naive Sharded Aggregation for high cardinality data

nihalzp added 7 commits May 6, 2026 13:55

Add setting optimize_aggregation_by_sharding

f1239d9

Integrate the setting

442ff9b

Implement BufferedScatterByHashTransform

9ae797b

Add some minor optimizations

c5f79e4

Integrate sharded aggregation

7831c6d

Disable sharded aggregation in some integration tests

e6b7073

Add tests

9140e71

clickhouse-gh Bot added the pr-performance Pull request with some performance improvements label May 6, 2026

clickhouse-gh Bot reviewed May 6, 2026

View reviewed changes

nihalzp requested a review from nickitat May 7, 2026 13:38

Merge branch 'master' into naive-sharded-aggregation

b7f4dca

clickhouse-gh Bot reviewed May 7, 2026

View reviewed changes

Comment thread src/Processors/Transforms/BufferedShardByHashTransform.cpp

nickitat self-assigned this May 15, 2026

nickitat reviewed May 15, 2026

View reviewed changes

nihalzp added 11 commits May 18, 2026 12:23

Merge branch 'master' into naive-sharded-aggregation

0711c41

Change setting name to enable_sharding_aggregator

4f63ff7

Use "shard" instead of "scatter" everywhere

3591d44

Update tests

117f1ce

Remove comment

46385a9

Respect pipeline width

3e4addb

Update tests

92188d5

Make skipping for small types more systematic

426b62f

Make TODOs indexable

be36e96

Add TODO

da5214f

Avoid too much overhead from routing

315c341

nihalzp added 3 commits May 18, 2026 14:30

Add TODO for potential fallback

4549341

Keep the queue bounded

603904f

Reuse hashing logic from JoinUtils

8c82baa

nickitat approved these changes May 18, 2026

View reviewed changes

Comment thread src/Processors/QueryPlan/AggregatingStep.cpp Outdated

Comment thread tests/integration/test_max_bytes_ratio_before_external_order_group_by_for_server/test.py

Comment thread src/Processors/Transforms/BufferedShardByHashTransform.cpp Outdated

nihalzp added 5 commits May 18, 2026 20:20

Use only Lemire sharding

93aa345

Add missing TODO

19ea557

Refactor into a smaller helper

72b0648

Skip parallel replicas for EXPLAIN tests

fd30a9c

Disable the sharding aggregator by default

9c9852e

clickhouse-gh Bot reviewed May 18, 2026

View reviewed changes

Comment thread src/Processors/QueryPlan/AggregatingStep.cpp Outdated

nihalzp mentioned this pull request May 19, 2026

Add test for aggregation in order mismatch scenario #105369

Merged

nihalzp added 5 commits May 24, 2026 21:03

Merge branch 'master' into naive-sharded-aggregation

eca4c21

Move setting to 26.6

b541a2c

Use already computed settings adjusted max threads

d9d60ff

Skip the buggy test

f39d960

Merge branch 'master' into naive-sharded-aggregation

6c94086

nihalzp added this pull request to the merge queue May 25, 2026

Merged via the queue into ClickHouse:master with commit 4775f2d May 25, 2026
326 of 329 checks passed

nihalzp deleted the naive-sharded-aggregation branch May 25, 2026 17:14

robot-ch-test-poll4 added the pr-synced-to-cloud The PR is synced to the cloud repo label May 25, 2026

DavidHe-2008 pushed a commit to DavidHe-2008/ClickHouse that referenced this pull request Jun 1, 2026

Merge pull request ClickHouse#104233 from nihalzp/naive-sharded-aggre…

60a4aa5

…gation Naive Sharded Aggregation for high cardinality data

PedroTadim mentioned this pull request Jun 1, 2026

Logical error: Pipeline stuck still happens #106237

Open

groeneai mentioned this pull request Jun 1, 2026

Fix pipeline-stuck deadlock in BufferedShardByHashTransform #106251

Open

1 task

alexey-milovidov mentioned this pull request Jun 12, 2026

Feature: Paimon minmax index #100160

Open

1 task

groeneai mentioned this pull request Jun 12, 2026

Disable parallel replicas for 01891_not_in_partition_prune #107362

Merged

nihalzp mentioned this pull request Jun 21, 2026

Skew-robust Sharded Aggregation #108067

Open

Sunbelt Computer Software

PL/B Language Development and Support

Uh oh!

Conversation

nihalzp commented May 6, 2026 • edited by robot-clickhouse Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changelog category (leave one):

Changelog entry (a user-readable short description of the changes that goes into CHANGELOG.md):

Documentation entry for user-facing changes

Version info

Uh oh!

clickhouse-gh Bot commented May 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

AI Review

Summary

Findings

Final Verdict

Uh oh!

clickhouse-gh Bot left a comment

Choose a reason for hiding this comment

Uh oh!

nihalzp commented May 7, 2026

Uh oh!

clickhouse-gh Bot May 7, 2026

Choose a reason for hiding this comment

Uh oh!

clickhouse-gh Bot May 18, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

egor-click commented May 7, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

nihalzp commented May 7, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

nickitat May 15, 2026

Choose a reason for hiding this comment

Uh oh!

nihalzp May 18, 2026

Choose a reason for hiding this comment

Uh oh!

nickitat May 15, 2026

Choose a reason for hiding this comment

Uh oh!

nihalzp May 18, 2026

Choose a reason for hiding this comment

Uh oh!

nickitat left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

clickhouse-gh Bot commented May 25, 2026

LLVM Coverage Report

Uh oh!

Uh oh!

clickgapai commented May 25, 2026

What this does

When to use

Automatically disabled when

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

nihalzp commented May 6, 2026 •

edited by robot-clickhouse

Loading

clickhouse-gh Bot commented May 6, 2026 •

edited

Loading

egor-click commented May 7, 2026 •

edited

Loading