One-shot Tool Call
18/18 passed (100.0%) across Bash, File Ops, MCP (Context7), Skills, and Generation. 2 ToolSearch tests skipped — harness does not expose MCP tool discovery. First-attempt accuracy: 100%. Total wall time: 282.2s.
OK18/18 passed (100.0%) across Bash, File Ops, MCP (Context7), Skills, and Generation. 2 ToolSearch tests skipped — harness does not expose MCP tool discovery. First-attempt accuracy: 100%. Total wall time: 282.2s.
OK