Peter Steinberger
401106b963
fix: harden flaky tests and cover native google thought signatures ( #23457 ) (thanks @echoVic)
2026-02-22 12:24:53 +01:00
echoVic
9176571ec1
fix(gemini): sanitize thoughtSignatures for native Google provider
...
Native Google Gemini provider was accumulating 2K-8K tokens of Base64
thoughtSignature blobs per turn, causing premature context overflow.
The sanitizer was only enabled for OpenRouter Gemini, not native Google.
Fixes #23392
2026-02-22 12:24:53 +01:00
Peter Steinberger
78c3c2a542
fix: stabilize flaky tests and sanitize directive-only chat tags
2026-02-22 12:19:33 +01:00
Peter Steinberger
97eb4af01e
test: harden models-config env isolation list
2026-02-22 10:34:23 +00:00
Peter Steinberger
744df0fbe7
test: reclassify models-config suites from e2e to unit lane
2026-02-22 10:34:23 +00:00
Peter Steinberger
740fd7ae35
test: reclassify skills suites from e2e to unit lane
2026-02-22 10:34:23 +00:00
Peter Steinberger
c56ab39da5
perf(test): reduce bash e2e wait windows
2026-02-22 10:28:43 +00:00
Peter Steinberger
abff3f0f61
test: reclassify sessions_spawn lifecycle suite as unit test
2026-02-22 10:28:43 +00:00
Peter Steinberger
0b7c7ee1aa
perf(test): speed up sessions_spawn lifecycle suite setup
2026-02-22 10:28:43 +00:00
Peter Steinberger
c962bcba37
test: reclassify sandbox merge and exec path suites as unit tests
2026-02-22 10:28:43 +00:00
Peter Steinberger
9ab7b85a66
perf(test): tighten background abort timing windows
2026-02-22 10:28:43 +00:00
Peter Steinberger
c995f9be07
test: reclassify mocked announce and sandbox suites as unit tests
2026-02-22 10:28:43 +00:00
Peter Steinberger
27f0d7ebcc
test: reclassify auth-profile-rotation suite as unit test
2026-02-22 10:28:43 +00:00
Peter Steinberger
c0b1c10a08
test: reclassify mocked runner/safe-bins suites as unit tests
2026-02-22 10:28:43 +00:00
Peter Steinberger
a9b26d83de
perf(test): narrow pi-embedded runner e2e import path
2026-02-22 10:28:42 +00:00
Peter Steinberger
2b0ca9447c
perf(test): trim bash e2e sleep and poll windows
2026-02-22 10:28:42 +00:00
Peter Steinberger
c348a13640
perf(test): lower subagent fast-mode wait floors
2026-02-22 10:28:42 +00:00
Peter Steinberger
54e0786ba6
perf(test): reduce subagent announce fast-mode polling waits
2026-02-22 10:28:42 +00:00
Peter Steinberger
a96139e18c
perf(test): mock compact module in auth rotation e2e
2026-02-22 10:28:42 +00:00
Peter Steinberger
eda941f395
perf(test): remove flaky transport timeout and dedupe safeBins checks
2026-02-22 10:28:42 +00:00
Peter Steinberger
d72b4ead18
perf(test): lower fast-mode nested output wait floor to 70ms
2026-02-22 10:28:42 +00:00
Peter Steinberger
7ccf62fb4c
test(agents): remove dead shell-timeout override in safeBins suite
2026-02-22 10:28:42 +00:00
Peter Steinberger
60773c124e
perf(test): lower fast-mode nested output wait floor to 80ms
2026-02-22 10:28:42 +00:00
Peter Steinberger
36375f121f
perf(test): trim nested subagent output wait floor in fast mode
2026-02-22 10:28:42 +00:00
Peter Steinberger
2900eb5456
perf(test): trim background abort settle waits and dedupe cmd fixture
2026-02-22 10:28:42 +00:00
Peter Steinberger
7d13227d41
test(agents): dedupe auth profile rotation fixture setup
2026-02-22 10:28:42 +00:00
Peter Steinberger
6b5c20055b
perf(test): speed subagent announce retry polling in fast mode
2026-02-22 10:28:42 +00:00
Peter Steinberger
1b327da6e3
fix: harden exec sandbox fallback semantics ( #23398 ) (thanks @bmendonca3)
2026-02-22 11:12:01 +01:00
Brian Mendonca
c76a47cce2
Exec: fail closed when sandbox host is unavailable
2026-02-22 11:12:01 +01:00
Peter Steinberger
35d5bd4e07
perf(test): shrink subagent announce fast-mode settle waits
2026-02-22 09:29:04 +00:00
Peter Steinberger
703f7213b6
test(agents): simplify subagent announce suite imports and call assertions
2026-02-22 09:29:04 +00:00
Peter Steinberger
6c2e999776
refactor(security): unify secure id paths and guard weak patterns
2026-02-22 10:16:19 +01:00
Peter Steinberger
c3e13175d2
perf(test): bypass queue debounce in fast mode and tighten announce defaults
2026-02-22 09:13:01 +00:00
Peter Steinberger
833d7574e7
test(agents): consolidate repeated announce deferral and fallback matrices
2026-02-22 09:05:56 +00:00
Peter Steinberger
4985fb7f05
test(agents): remove overflow compaction mock reset dependency
2026-02-22 09:02:24 +00:00
Peter Steinberger
d9a7b447f5
test(agents): use lightweight clear for active-run announce mock
2026-02-22 09:01:55 +00:00
Peter Steinberger
15657dd48d
test(agents): collapse repeated announce direct-send scenarios
2026-02-22 08:57:39 +00:00
Peter Steinberger
53a7afe238
test(agents): unify hook thread-target announce assertions
2026-02-22 08:55:11 +00:00
Peter Steinberger
d625f888a9
test(core): dedupe command gating and trim announce reset overhead
2026-02-22 08:54:11 +00:00
Peter Steinberger
cf570d3b44
test(agents): avoid full mock resets in cli credential specs
2026-02-22 08:52:21 +00:00
Peter Steinberger
a1c8525766
test(agents): dedupe subagent announce direct-send variants
2026-02-22 08:49:33 +00:00
Peter Steinberger
cfb3cee7aa
test(core): dedupe auth rotation and credential injection specs
2026-02-22 08:44:40 +00:00
Peter Steinberger
ccc00d874c
test(core): reduce mock reset overhead in targeted suites
2026-02-22 08:40:29 +00:00
Vignesh Natarajan
2a66c8d676
Agents/Subagents: honor subagent alsoAllow grants
2026-02-22 00:39:27 -08:00
Peter Steinberger
2d2e1c2403
test(core): use lightweight clear in cron, claude runner, and telegram delivery specs
2026-02-22 08:35:38 +00:00
Peter Steinberger
c99e7696e6
fix: decouple owner display secret from gateway auth token
2026-02-22 09:35:07 +01:00
Peter Steinberger
1e76ca593e
test(core): tighten reset usage in auth, registry restart, and memory search
2026-02-22 08:34:20 +00:00
Peter Steinberger
1ba1c3f306
test(core): reduce reset overhead in messaging and agent e2e mocks
2026-02-22 08:33:06 +00:00
Peter Steinberger
e67f813b0e
test(core): continue reset-to-clear cleanup in subagent focus and web fetch
2026-02-22 08:30:05 +00:00
Peter Steinberger
c7606e7064
test(subagents): use lightweight clears in sessions spawn suites
2026-02-22 08:27:36 +00:00