Source linked

Syll Open-Source Agent unifica las APIs, CLI y Control de GUI de escritorio

Syll permite a los usuarios enseñar procedimientos por demostración, compilarlos en habilidades reutilizables y externalizar toda la memoria y rutinas de agentes como archivos locales editables.

syllopen sourcepersonal automationmultimodal agentsgui controlmcp tools

Syll works on Stardew Valley and Adobe Photoshop from the same agent runtime — that's the kind of cross-surface validation most personal automation frameworks never get near.

From Stardew Valley to Photoshop: One Agent to Rule All Surfaces

Syll's developers built an open-source, self-hosted multimodal agent harness that unifies MCP/API tools, CLI execution, and visual GUI control in a modular runtime. Instead of forcing agents to pick one interface, Syll lets the same system coordinate computer use across heterogeneous surfaces — shells, web UIs, desktop apps, programmatic APIs. The validation suite includes real production desktop applications: Adobe Photoshop, Adobe Audition, Stardew Valley, macOS Finder, and others. That's not a toy benchmark; those are apps with complex, non‑standard GUIs.

Teach by Doing: How Syll Compiles User Demonstrations into Reusable Skills

Most automation agents require scripting or natural language prompts for every task. Syll flips that: users teach procedures through direct demonstration — pointing, clicking, typing — and Syll compiles those demonstrations into reusable skills. The core is a bidirectional user‑agent interaction layer. Agent execution translates back into multimodal evidence: logs, keyframes, and approval checkpoints for inspection and control. You don't just tell the agent what to do; you show it, and it remembers.

Memory You Can Edit: Externalizing Agent Internals as Plain Files

Syll externalizes memory, skills, routines, and governance as editable local artifacts. No black boxes, no proprietary databases — just files you can inspect, extend, and version control. The system reports mechanism‑oriented studies that validate multimodal routing, teachable GUI replay, and persistent local artifacts. For a technically literate user, that means you can audit exactly what your agent remembers, rewrite a skill that's misbehaving, or share routines as plain text.

Syll positions itself as a practical open‑source foundation for personal automation that users can teach, inspect, and continuously extend — the kind of tool that makes agentic computing feel less like a black box and more like a programmable assistant you actually trust.


Source: Syll: Open-Source Personal Automation with Cross-Surface Execution
Domain: arxiv.org

Read original source ->

External source stays available while the OJO article and comment thread stay local.

Comments load interactively on the live page.