iframe-proxy

ppraneth · 2026-04-22T08:22:39Z

Previously, ModelProviderRequest::get_cache_key() serialized the request twice: once via serde_json::to_value to produce a serde_json::Value, then again via .to_string() to get a JSON string for hashing. Between the two steps it also had to allocate the intermediate Value, find and remove the inference_id key from it, and then allocate the final String buffer.

This PR replaces that with a single streaming serialization pass directly into the blake3::Hasher (which implements std::io::Write), using serde_json::to_writer. To exclude inference_id from the hash without an intermediate allocation, it is now marked #[serde(skip)] on ModelInferenceRequest.

What changed

tensorzero-inference-types
- Added #[serde(skip)] to the inference_id field on ModelInferenceRequest. The field is only serialized for cache key purposes and is intentionally excluded from the hash, so skipping it during serialization is correct and safe.
tensorzero-core/src/cache.rs
- Replaced the three-step to_value → remove → to_string sequence with:
```
serde_json::to_writer(&mut hasher, request)
```
Added a Criterion benchmark: benches/cache_key.rs for regression tracking.

Benchmark results (release mode, same machine)

✅ All 88 cache unit tests pass.

github-actions · 2026-04-22T08:22:52Z

All contributors have signed the CLA ✍️ ✅
_{Posted by the CLA Assistant Lite bot.}

ppraneth · 2026-04-22T08:23:16Z

I have read the Contributor License Agreement (CLA) and hereby sign the CLA.

ppraneth · 2026-04-22T14:37:00Z

opti

9003da7

Merge branch 'main' into perf1

5601851

github-actions Bot added a commit that referenced this pull request Apr 22, 2026

@ppraneth has signed the CLA in #7345

21d5e7d

ppraneth added 2 commits April 22, 2026 19:47

fix clippy error

b17062c

fix pre existing clippy error

0c283bf

Merge branch 'main' into perf1

05b2354

Version	Time
Before	11.4 µs
After	5.0 µs
Change	~57% faster (≈2.3× speedup)

Sunbelt Computer Software

PL/B Language Development and Support

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf: eliminate double JSON serialization in cache key hot path#7345

perf: eliminate double JSON serialization in cache key hot path#7345
ppraneth wants to merge 5 commits intotensorzero:mainfrom
ppraneth:perf1

ppraneth commented Apr 22, 2026

Uh oh!

github-actions Bot commented Apr 22, 2026 •

edited

Loading

Uh oh!

ppraneth commented Apr 22, 2026

Uh oh!

ppraneth commented Apr 22, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Sunbelt Computer Software

PL/B Language Development and Support

Conversation

ppraneth commented Apr 22, 2026

What changed

Benchmark results (release mode, same machine)

Uh oh!

github-actions Bot commented Apr 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ppraneth commented Apr 22, 2026

Uh oh!

ppraneth commented Apr 22, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

github-actions Bot commented Apr 22, 2026 •

edited

Loading