ScreenJob

Single-file behavior, split into maintainable modules under src/.

Entry point

pip install openai pillow pyautogui python-dotenv

Create a .env file in project root:

OPENAI_API_KEY=your_key_here

python main.py "Open amazon.de and go to my orders"

Optional flags:

python main.py "Open amazon.de" --model gpt-5.2 --max-steps 80

The agent supports multiple tool calls in a single model response and executes them in order.
Example sequence in one step:

You can also use click(..., sleep_after_seconds=1.5) for a one-call variant.

Each run creates:

Final stdout is JSON:

{
  "completed": true,
  "result": "...",
  "steps": 13,
  "elapsed_seconds": 59.691,
  "artifacts_dir": "C:\\...\\screenjob_runs\\run_..."
}

main.py
screenjob.py
src/
  __init__.py
  cli.py
  agent.py
  models.py
  utils.py