[ISSUE-378][HugePartition][Part-2] Introduce memory usage limit and data flush#471
Conversation
|
I think these two similar configurations will bring more confusion to end user. |
If single buffer flush could be enabled automatically, how to set the flush threshold size? I don't hope the
Emm. Yes. |
|
It seems I got your point. The conf of |
Codecov Report
@@ Coverage Diff @@
## master #471 +/- ##
============================================
+ Coverage 58.78% 58.86% +0.07%
- Complexity 1704 1714 +10
============================================
Files 206 206
Lines 11471 11517 +46
Branches 1024 1033 +9
============================================
+ Hits 6743 6779 +36
- Misses 4317 4324 +7
- Partials 411 414 +3
📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more |
Sorry, I forgot to reply this comment. If I understand the code and design correctly, So, I think we could reuse the |
Memory limit is required. Usually the huge partition writing speed is fast, and the HDFS flushing speed is slower than writing. If missing memory limit, the regular partitions will be affected. By the way, this design has been introduced into our internal version, it works well. Before this PR, sometimes one huge partition will make other regular partitions buffer require fail. |
I'm ok to add |
This suggestion has been accepted 😁 (Here: #471 (comment)). Please review the latest code. |
en. I will take a look in details by tomorrow. But on the surface, the |
|
|
|
PTAL @advancedxy. After this PR is merged, I will introduce some metrics about huge partition. |


What changes were proposed in this pull request?
rss.server.single.buffer.flush.thresholdvalue, single-buffer flush will be triggered whatever the single buffer flush is enabled or notWhy are the changes needed?
Does this PR introduce any user-facing change?
Yes.
How was this patch tested?