Show HN: AI agent that runs real browser workflows

Posted by heavymemory 3 hours ago

Counter4Comment6OpenOriginal

I’ve been experimenting with letting an AI agent execute full workflows in a browser.

In this demo I gave it my CV and asked it to find matching jobs. It scans my inbox, opens the listings, extracts the details and builds a Google Sheet automatically.

Comments

Comment by fidorka 1 hour ago

Cool demo. The tricky bit with browser workflow agents is figuring out which workflows to automate in the first place. Most people don't even realize they're doing the same thing over and over - they just do it.

I've been building MemoryLane (https://github.com/deusXmachina-dev/memorylane) which comes at this from the other side - it records screen activity, spots repeated patterns with AI, and then tells you "hey you keep doing this, want to automate it?" Works as an MCP plugin for Claude/Cursor.

Feels like pattern detection (finding what to automate) + browser agents like yours (actually doing the automation) is the right combo. Are you thinking about the discovery side at all, or mostly focused on execution?

Comment by heavymemory 55 minutes ago

Interesting. Part of why I built this was to avoid screen capture as the control layer. Once you’re taking screenshots, guessing what to click, moving the mouse, and repeating, it gets slow and brittle fast. Here the workflow is just described in text, executed in the browser, and saved for reuse.

Comment by hkonte 2 hours ago

The CV-to-job-search demo is a good showcase. For multi-step workflows like this, one issue I've run into is that the agent instructions degrade across steps. The initial task description is clear, but by step 5 the model is interpolating intent from earlier context rather than following explicit instructions.

Structuring the task prompt into named blocks (objective, constraints, expected output format per step) before the workflow starts makes each step much more reliable. The agent has less to infer.

Built github.com/Nyrok/flompt to help with this, a visual builder that decomposes instructions into semantic blocks and compiles to Claude-optimized XML. Useful for defining the "task shape" before handing it to an agent.

Comment by heavymemory 2 hours ago

Yeah, instruction drift is a real problem in long agent chains. In this case the workflow gets decomposed into steps up front and each step is executed by a separate sub-agent.

So the model isn’t carrying the whole instruction chain across multiple steps, it’s just solving the current task. Similar pattern to what tools like Codex CLI or Claude Code do.

Comment by abraxas 3 hours ago

I was looking for a similar produc/project the other day. Alas my need is a Linux native version. You may want to consider it as Mac seems to be overserved by the agent harness supply while Linux is the opposite

Comment by heavymemory 3 hours ago

linux and windows support is on the way, i’ve designed it in a decoupled way, so should be straight forward.

Just need to see if people find this version useful