683f2d59612880159e321a66a5cba5c540f024e1
All checks were successful
python-syntax / syntax-check (push) Successful in 4s
Clickthrough
Let an Agent interact with your computer over HTTP, with grid-aware screenshots and precise input actions.
What this provides
- Visual endpoints: full-screen capture with optional grid overlay and labeled cells (
asImage=truecan return raw image bytes) - Zoom endpoint: crop around a point with denser grid for fine targeting (
asImage=truesupported) - Action endpoints: move/click/right-click/double-click/middle-click/scroll/type/hotkey
- Coordinate transform metadata in visual responses so agents can map grid cells to real pixels
- Safety knobs: token auth, dry-run mode, optional allowed-region restriction
Quick start
cd /root/external-projects/clickthrough
python3 -m venv .venv
. .venv/bin/activate
pip install -r requirements.txt
CLICKTHROUGH_TOKEN=change-me python -m server.app
Server defaults to 127.0.0.1:8123.
Minimal API flow
GET /screenwith grid- Decide cell / target
- Optional
POST /zoomfor finer targeting POST /actionto executeGET /screenagain to verify result
See:
docs/API.mddocs/coordinate-system.mdskill/SKILL.md
Configuration
Environment variables:
CLICKTHROUGH_HOST(default127.0.0.1)CLICKTHROUGH_PORT(default8123)CLICKTHROUGH_TOKEN(optional; if set, requirex-clickthrough-tokenheader)CLICKTHROUGH_DRY_RUN(true/false; defaultfalse)CLICKTHROUGH_GRID_ROWS(default12)CLICKTHROUGH_GRID_COLS(default12)CLICKTHROUGH_ALLOWED_REGION(optionalx,y,width,height)
Gitea CI
A Gitea Actions workflow is included at .gitea/workflows/python-syntax.yml.
It runs Python syntax checks (py_compile) on every push and pull request.
Languages
Python
100%