1.2 KiB
1.2 KiB
ScreenJob Skill (OpenClaw Agents)
What ScreenJob Solves
ScreenJob lets an agent execute tasks that require a real desktop UI plus terminal access, with structured tool calls and job tracking.
Main Features
- Screen perception (
see_screen,enhance) - Mouse/keyboard control (
click,type,press_key) - Terminal execution (
execute_command,sleep) - Structured completion payload (
task_complete(return=..., data=...)) - Safety gate, auth, history, and live monitoring
Important Environment Note
ScreenJob runs on a separate computer (the human/operator machine), not inside the agent's own runtime environment.
Why It Is Useful
Agents can use ScreenJob to launch and control GUI workflows, including orchestrating other GUI agents/tools on a human computer.
Example Tasks
- Open amazon.de and buy a USB-C to USB-C cable for 10 EUR or less.
- Open google.com, go to my account, and change my profile picture to a provided image URL.
- Run
ls -ainC:/Users/username/Documentsand return the output indata.
Practical Usage
- Submit job via CLI or API.
- Agent performs tool loop.
- Read final
response.returnandresponse.datafrom job status.