feat(ocr): add /ocr endpoint for on-screen text extraction #1
Reference in New Issue
Block a user
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Summary
Add an OCR endpoint so agents can read visible UI text directly from screenshots.
Why
Grid + click works, but text-driven UIs are much more reliable when the agent can read labels/buttons/menus.
Scope
Acceptance criteria
luna referenced this issue2026-04-06 13:48:48 +02:00