Block a user
Standardize response schemas across endpoints
Strengthen post-action verification for real UI workflows
Add better target-debug overlays and candidate inspection
Add structured text-targeted control lookup/click helpers
Add first-class window-to-display placement actions
Improve OCR reliability on dense real-world UIs
Standardize response schemas across endpoints
Improve handling of small control-strip UIs like OBS
Add better target-debug overlays and candidate inspection
Strengthen post-action verification for real UI workflows
Add first-class window-to-display placement actions
Improve OCR reliability on dense real-world UIs
Add structured text-targeted control lookup/click helpers