Superlabs Inc

Voice and Recording-Led Setup Flow for AI Workflow Automation

Superlabs is a pre-seed startup with a bold idea: non-technical people should be able to automate their own workflows, without IT, without a developer, and without a three-month implementation. We were brought in to figure out what that actually looks like.

Company

Superlabs Inc.

Product

Okto

My role

Product Designer

Team

4 (2 designers, 1 PM, 1 researcher)

Status

Ongoing · Spring 2026

01 · Project Overview

Everything built from scratch

There was no mature product to iterate on—only hypotheses, investor decks, and the founder's proof that bespoke automations could be stitched together manually if someone technical babysat the toolchain long enough.

Our challenge was to translate that founder grit into something repeatable: an experience where operations-minded people could externalize tacit process knowledge without feeling like they had joined an engineering team by accident.

Eight-panel illustrated storyboard of Joey’s daily workflow: juggling SAP, Excel, CRM, PDF sign-off, report builder, colleague pings, fragmented tasks, and the desire for a better way.

Joey's repetitive workflow - the problem

The scope we landed on

After months of research and iteration, we focused on one moment: the first time a user sits down to capture their workflow and hand it to an AI. Get this right, and the rest becomes possible.

User Research

The pattern across six very different jobs.

We paired foundational interviews with diary prompts about “what broke last week” so people weren't performing expertise—they were narrating friction. The same phrases surfaced again and again: visibility, blame, and the quiet shame of not knowing which tool actually held the truth.

Synthesis wasn't a tidy persona exercise; it was an argument with ourselves about what automation replaces (clicks) versus what it must preserve (accountability and narrative).

User interviews

sessions across roles and industries

Manual work

of the workday spent on manual tasks (self-reported bands)

Themes

core themes from affinity clustering

Affinity mapping — secondary data visualizations

Theme 01

Trust is built like you train an intern

People mentally simulate oversight loops—what they check first, what proof counts—and automation has to earn passes through those same gates.

Theme 02

Adoption fails in the handoff not the launch

Excitement evaporates when outputs land in someone else’s inbox with no shared story about how they were produced.

Theme 03

The anxiety is about invisibility

The scary part isn’t “AI”; it’s not being able to point to the moment the system misunderstood you.

Theme 04

Everyone is already a human API

Office managers already translate between tools; we’re designing for people who broker glue work between SaaS islands.

“I need to see with my own eyes that it's doing it the right way before I start trusting it.”
— P3, Project Manager · Construction

03 · Competitive Analysis

Everyone is selling to IT. Nobody is selling to Joey.

We mapped vendors across two axes: how technical the buyer expects to be, and how much of the workflow lives inside a single vendor versus orchestration across tools. The crowded quadrant was “enterprise IT installs this for you”—thin air for the admin lead wiring fourteen SaaS apps.

That gap clarified positioning: Okto isn't permissioned automation inside one suite—it's legible capture across the messy reality of operational glue work.

Gumloop

Flowise

Dify

Power Automate

Palantir Foundry

Vercept

ServiceNow

Tropic

Zylo

Competitive landscape map

Insight 01

Technical complexity remains a barrier. Competitors still require dedicated technical teams. Intuitive, conversational approaches consistently stand out as the differentiator.

Insight 02

Users want to prove value on their own terms. Free trials, self-serve dashboards, and small implementations let users experience results before committing.

Insight 03

ROI needs to be visible and specific. The most effective competitors quantify savings in dollar amounts and surface metrics relevant to each user's role.

Vercept

Vercept sat closest to “ambient monitoring” narratives—we pushed harder on consentful capture and reversible teaching moments rather than silent observation.

Five takeaways anchored downstream decisions: sell outcomes not connectors; pair capture with confirmation UI; avoid surveillance framing; design for partial automation; and assume every workflow has a human escalation path that must remain dignified.

04 · User Journey Explorations

We tried a lot of versions before we knew what we were building.

Journey explorations stayed deliberately messy—whiteboards with parallel flows for billing, onboarding vendors, and exec scheduling—until patterns condensed into three interacting modes instead of three disconnected features.

Record

Ambient capture with explicit start/stop cues so “being watched” transforms into “being taught.”

Prompt

Lightweight clarifying questions that stitch gaps without turning setup into an interrogation.

Conversation

A readable dialogue layer where corrections feel like coaching—not debugging terminal output.

User flow iterations — Joey's billing workflow

Recording mode carried the most baggage culturally—associations with surveillance tooling ran hot—so we prototyped consent rhythms, visible indicators, and granular deletion paths before we chased clever inference.

What consolidated was less a linear funnel than a spiral: capture enough fidelity to replay, pause for human checkpoints at ambiguous forks, and let conversation compress weeks of tacit knowledge into something shareable.

05 · Challenges Around Recording

Recording sounds simple. It isn't.

Each assumption—what counts as a step, whether narration is natural, how collaborators enter frame—had downstream implications for trust and legal comfort. We surfaced disagreements early so engineering didn't optimize the wrong latency bottleneck.

How much is enough?

Define minimum viable fidelity without pushing users into infinite capture anxiety—or brittle, overfitted automations.

Will users actually narrate?

Test prompts, pacing, and recovery when someone forgets mid-flow; narration can't be the only channel for intent.

Real-time feedback: helpful or surveillance?

Balance reassurance cues against the feeling of being scored while doing normal work.

What about collaborative workflows?

Attribute edits without turning coworkers into suspects; shared ownership needs shared safety.

The tension we kept returning to

Visibility heals trust when it's paired with control—otherwise it becomes proof-of-work theater that punishes the people we're trying to empower.

06 · Emerging Questions

The more we designed, the more we didn't know.

Open questions became guardrails: we documented them publicly inside the team so scope debates referenced risks instead of personalities—especially where product metaphors touched enterprise procurement realities we hadn't stress-tested yet.

Parts vs. wholes

Should automation chunks mirror tickets, emails, or human mental models of “the job”?

Controls without overwhelm

Progressive disclosure versus expert surfaces—who earns density, and when?

What happens offline?

Capture degrades gracefully when VPNs fail mid-task—the workflow can't strand Maria.

Resilience over time

Drift detection when apps update underneath recorded flows—how loudly should Okto complain?

The escalation chain

When automation misfires, who sees it first—and what language prevents blame cascades?

07 · Testing adoption signals

We built two prototypes and watched people use them.

We recruited participants with operational breadth—not just software fluency—and ran moderated sessions that emphasized think-aloud during capture and retrospective trust calibration after.

Prototypes diverged on how explicitly Okto announced itself in the OS chrome versus staying whisper-quiet until summoned; reactions split cleanly across risk posture and role power dynamics.

Prototype A

ambient, minimal

Favored by participants who wanted automation to feel like a quiet copilot—until something broke and they hunted for where Okto was “looking.”

Tension · Legibility vs. calm

Prototype B

Zoom-like, explicit control

Preferred when participants wanted theatrical clarity—clear modes, obvious boundaries—at the cost of feeling “always on stage” during sensitive screens.

Tension · Performance anxiety

Adoption testing - A

Adoption testing - B

Results weren't a winner-take-all verdict—they were a calibration curve. Teams with stronger IT partnerships tolerated ambient patterns; smaller shops skewed toward explicit guardrails that doubled as training wheels.

We logged failures as richly as successes: hesitation clicks, repeated pauses at consent screens, and language participants invented to describe “where the agent is” became interaction copy seeds.

The through-line was consent ergonomics: participants forgave imperfection when they felt able to steer, interrupt, and roll back—less so when intelligence felt theatrical or coy.

“Where is Pearl's eye looking right now?”
— Testing participant

08 · Presence Indicators

Making intelligence legible.

Presence isn't ornament—it's the semantic bridge between “something smart is happening” and “I can intervene.” We treated indicators as a system: consistent posture in the menu bar, consistent semantics on-screen, and predictable escalation into explicit controls.

The goal was legibility without spectacle: calm defaults that intensify in informational density only when risk or ambiguity spikes.

The menu bar icon

A quiet heartbeat state when idle; shifts weight subtly during capture without mimicking surveillance recording LEDs.

Composite workflow: Slack DM with TechSupplies invoice PDF highlighted, QuickBooks vendor expense form prefilled for TechSupplies Inc. INV-892 $847.50, and Okto overlay showing recording timer plus recent tracked steps linking Slack to QuickBooks.

Screen-level reading highlights

Transient emphasis tied to comprehension—not decorative gradients—so users map AI attention to concrete UI regions.

Recent steps Okto tracked card: Recording armed with Joey's Slack DM and Vendor Expenses form centered; timestamp 2:08 PM.

Live comprehension feedback

Short, reversible summaries during capture that invite correction (“Did you mean this invoice cycle?”).

Mini preview control panel marked LIVE with preview stage, Mute and Watch actions, green Stop Share button, and red End button.

Controls as indicators

Pause, exclude window, and redaction aren’t buried settings—they’re part of how Okto proves manners.

Okto — final design screens

Recording mode

Conversation mode

Where we landed treats automation less like a black box and more like a teammate on a short leash: visible presence, interruptible actions, and language that mirrors how people already explain their jobs across tools.