You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
feat: gate pageId routing behind --experimental-page-id-routing flag
Add --experimental-page-id-routing CLI flag (default false) to control
whether pageId is exposed on page-scoped tools and used for request
routing. When disabled, tools behave as before (select_page workflow).
- Add serverArgs to eval TestScenario interface so individual evals can
pass CLI flags to the MCP server
- Add TODO for mutable request state refactoring on McpContext
- Add TODO for getSelectedPage removal from Context interface
- Stabilize page_focus_keyboard_test eval prompt and expectations
Copy file name to clipboardExpand all lines: scripts/eval_scenarios/page_focus_keyboard_test.ts
+29-30Lines changed: 29 additions & 30 deletions
Original file line number
Diff line number
Diff line change
@@ -9,11 +9,12 @@ import assert from 'node:assert';
9
9
importtype{TestScenario}from'../eval_gemini.ts';
10
10
11
11
exportconstscenario: TestScenario={
12
+
serverArgs: ['--experimental-page-id-routing'],
12
13
prompt: `Open two pages in the same isolated context "session":
13
14
- Page 1 at data:text/html,<textarea id="ta"></textarea>
14
15
- Page 2 at data:text/html,<h1>Other</h1>
15
16
16
-
Now press_key "a" on Page 1 without selecting it first. If you encounter any errors, recover from them.`,
17
+
Now use the press_key tool to type "a" on Page 1 without selecting it first. You must use press_key, not fill or type_text. If you encounter any errors, recover from them.`,
17
18
maxTurns: 10,
18
19
expectations: calls=>{
19
20
// Should open 2 pages in the same context.
@@ -24,37 +25,35 @@ Now press_key "a" on Page 1 without selecting it first. If you encounter any err
0 commit comments