Source linked

Syll Open-Source Agent vereint APIs, CLI und Desktop GUI Control

Syll ermöglicht es Benutzern, Prozeduren durch Demonstration zu lehren, sie in wiederverwendbare Fähigkeiten zu kompilieren und alle Agentengedächtnisse und Routinen als bearbeitbare lokale Dateien zu externalisieren.

syllopen sourcepersonal automationmultimodal agentsgui controlmcp tools

Syll works on Stardew Valley and Adobe Photoshop from the same agent runtime — that's the kind of cross-surface validation most personal automation frameworks never get near.

From Stardew Valley to Photoshop: One Agent to Rule All Surfaces

Syll's developers built an open-source, self-hosted multimodal agent harness that unifies MCP/API tools, CLI execution, and visual GUI control in a modular runtime. Instead of forcing agents to pick one interface, Syll lets the same system coordinate computer use across heterogeneous surfaces — shells, web UIs, desktop apps, programmatic APIs. The validation suite includes real production desktop applications: Adobe Photoshop, Adobe Audition, Stardew Valley, macOS Finder, and others. That's not a toy benchmark; those are apps with complex, non‑standard GUIs.

Teach by Doing: How Syll Compiles User Demonstrations into Reusable Skills

Most automation agents require scripting or natural language prompts for every task. Syll flips that: users teach procedures through direct demonstration — pointing, clicking, typing — and Syll compiles those demonstrations into reusable skills. The core is a bidirectional user‑agent interaction layer. Agent execution translates back into multimodal evidence: logs, keyframes, and approval checkpoints for inspection and control. You don't just tell the agent what to do; you show it, and it remembers.

Memory You Can Edit: Externalizing Agent Internals as Plain Files

Syll externalizes memory, skills, routines, and governance as editable local artifacts. No black boxes, no proprietary databases — just files you can inspect, extend, and version control. The system reports mechanism‑oriented studies that validate multimodal routing, teachable GUI replay, and persistent local artifacts. For a technically literate user, that means you can audit exactly what your agent remembers, rewrite a skill that's misbehaving, or share routines as plain text.

Syll positions itself as a practical open‑source foundation for personal automation that users can teach, inspect, and continuously extend — the kind of tool that makes agentic computing feel less like a black box and more like a programmable assistant you actually trust.


Source: Syll: Open-Source Personal Automation with Cross-Surface Execution
Domain: arxiv.org

Read original source ->

External source stays available while the OJO article and comment thread stay local.

Comments load interactively on the live page.