iframe-proxy

padmak30 · 2026-06-25T16:10:57Z

Description

Adds end-to-end coverage for agentcore export harness across both source modes — in-project (--name) and out-of-project (--arn) — proving each exported Strands runtime agent actually works at runtime, not just that the spec/wiring is generated.

The source harness attaches following export surface together:

an existing project memory (referenced by name)
an agentcore_code_interpreter tool (managed default)
a public GitHub skill (cloned at runtime, no credential)
an MCP gateway tool (in-project gateway + mcp-server target)

Flow (e2e-tests/export-harness-full.test.ts):

create --no-agent + add memory + add gateway (mcp-server target)
deploy build(deps): bump diff and @aws-cdk/cloudformation-diff #1 — provisions memory + gateway (ARNs now exist)
add harness attaching the memory (by name) + gateway (by --gateway-arn) + code-interpreter tool + git skill
deploy chore: Add 3rd party licenses #2 — harness with all surfaces
invoke the harness; assert the code interpreter runs
export --name → deploy → verify the in-project agent (memory wired via discovery env var; gateway + CI as connections)
export --arn into a fresh project → deploy → verify the out-of-project agent (every resource external → all wired as connections)

Each exported agent is behaviorally verified via four invokes:

code interpreter: exact factorial value (the model can't fabricate it)
gateway tool: lists a gateway-prefixed / Exa MCP tool (provider-specific token, not generic prose)
skill: references the cloned returns-policy skill
memory: same-session round-trip recall

The test is skipIf(!canRun)-gated (AWS creds + npm + git), tears down both projects in afterAll, and uses E2e-prefixed, per-run-unique project names so a failed teardown is still swept by global-setup's stale-stack GC.

Type of Change

Other: test-only (adds e2e coverage; no product code changes)

Testing

I ran npm run typecheck (clean)
I ran npm run lint (clean)
Verified live end-to-end against a real AWS account (us-east-1): 7/7 steps pass; both exported agents (in-project + by-ARN) deploy and pass all four capability checks; resources auto-torn-down.

Checklist

I have added any necessary tests that prove the feature works
My changes generate no new warnings

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

End-to-end coverage for `export harness` across both source modes, proving each exported agent works at runtime (not just that the spec/wiring is generated). The source harness attaches every export surface: an existing project memory (by name), an agentcore_code_interpreter tool, a public GitHub skill, and an MCP gateway tool. Flow: deploy memory+gateway → create the harness attaching both → deploy → invoke the harness → export --name (in-project) → deploy → verify → export --arn (new empty project) → deploy → verify. Each exported agent is behaviorally verified via four invokes: code interpreter (exact factorial value), gateway tool (assert the gateway-prefixed/Exa MCP tool is listed), skill (assert the returns-policy skill is referenced), and memory (same-session round-trip recall). Verified live: 7/7 steps pass; both projects torn down in afterAll; project names are E2e-prefixed + per-run-unique so a failed teardown is still swept by global-setup's stale-stack GC.

agentcore-devx-automation · 2026-06-25T16:12:47Z

github-actions · 2026-06-25T16:13:10Z

Package Tarball

aws-agentcore-0.21.0.tgz

How to install

gh release download pr-1641-tarball --repo aws/agentcore-cli --pattern "*.tgz" --dir /tmp/pr-tarball
npm install -g /tmp/pr-tarball/aws-agentcore-0.21.0.tgz

agentcore-cli-automation

Test is well-structured and the per-capability behavioral assertions are a clear improvement over "did it return 200" checks. One concern about flakiness needs to be addressed before merging.

github-actions · 2026-06-25T16:17:03Z

Coverage Report

Status	Category	Percentage	Covered / Total
🔵	Lines	37.42%	13771 / 36800
🔵	Statements	36.69%	14646 / 39910
🔵	Functions	32%	2356 / 7362
🔵	Branches	31.46%	9180 / 29173

Generated in workflow #3839 for commit e9a124e by the Vitest Coverage Report Action

The per-capability checks asserted on a single LLM response outside the retry — only invokeAndExpectSuccess's CLI-`success` check was retried. That risks intermittent CI failures from (a) memory same-session write/read visibility lag on the recall turn and (b) LLM phrasing nondeterminism on the gateway-tool and skill checks (a `success: true` response that happens not to name the expected token). Add an optional `verify` predicate to invokeAndExpectSuccess that runs INSIDE the retried unit, and move every content assertion (factorial value, gateway tool token, skill reference, memory recall) into it — so a flaky sample re-invokes instead of failing. Mirrors the retry pattern already used in harness-e2e-helper.

agentcore-devx-automation · 2026-06-25T16:31:55Z

padmak30 requested a review from a team June 25, 2026 16:10

github-actions Bot added the size/l PR size: L label Jun 25, 2026

padmak30 temporarily deployed to e2e-testing June 25, 2026 16:12 — with GitHub Actions Inactive

agentcore-devx-automation Bot added the claude-security-reviewing Claude Code /security-review in progress label Jun 25, 2026

github-actions Bot added the agentcore-harness-reviewing AgentCore Harness review in progress label Jun 25, 2026

agentcore-devx-automation Bot removed the claude-security-reviewing Claude Code /security-review in progress label Jun 25, 2026

agentcore-cli-automation suggested changes Jun 25, 2026

View reviewed changes

Comment thread e2e-tests/export-harness-full.test.ts

github-actions Bot removed the agentcore-harness-reviewing AgentCore Harness review in progress label Jun 25, 2026

github-actions Bot added size/l PR size: L and removed size/l PR size: L labels Jun 25, 2026

agentcore-devx-automation Bot added the claude-security-reviewing Claude Code /security-review in progress label Jun 25, 2026

agentcore-devx-automation Bot removed the claude-security-reviewing Claude Code /security-review in progress label Jun 25, 2026

padmak30 temporarily deployed to e2e-testing June 25, 2026 16:42 — with GitHub Actions Inactive

avi-alpert approved these changes Jun 25, 2026

View reviewed changes

padmak30 merged commit aba397a into main Jun 25, 2026
33 checks passed

padmak30 deleted the test/export-harness-e2e branch June 25, 2026 17:41

Sunbelt Computer Software

PL/B Language Development and Support

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

test(e2e): export a fully-featured harness in-project and by ARN#1641

test(e2e): export a fully-featured harness in-project and by ARN#1641
padmak30 merged 2 commits into
mainfrom
test/export-harness-e2e

padmak30 commented Jun 25, 2026 •

edited

Loading

Uh oh!

agentcore-devx-automation Bot commented Jun 25, 2026

Uh oh!

github-actions Bot commented Jun 25, 2026

Uh oh!

agentcore-cli-automation left a comment

Uh oh!

Uh oh!

github-actions Bot commented Jun 25, 2026 •

edited

Loading

Uh oh!

agentcore-devx-automation Bot commented Jun 25, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Sunbelt Computer Software

PL/B Language Development and Support

Uh oh!

Conversation

padmak30 commented Jun 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Type of Change

Testing

Checklist

Uh oh!

agentcore-devx-automation Bot commented Jun 25, 2026

Uh oh!

github-actions Bot commented Jun 25, 2026

Package Tarball

How to install

Uh oh!

agentcore-cli-automation left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

github-actions Bot commented Jun 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Coverage Report

Uh oh!

agentcore-devx-automation Bot commented Jun 25, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

padmak30 commented Jun 25, 2026 •

edited

Loading

github-actions Bot commented Jun 25, 2026 •

edited

Loading