NexusGPT

Enhancing first-crack AI agent usability
with guided prompt design

Timeline

2 months (Dec '24 - Jan '25)

My role

Product designer

Team

1 PD, 1 PM, 1 DEV

Status

Under-development

Background

1/7

Problem
3/7

AI agents aren’t finished when they’re deployed.
Just like human employees, they grow through feedback.

As they interact with real users, they encounter edge cases, learn new tasks, and develop blind spots.

But improving an agent isn’t as simple as checking a box. It means updating its brain — the prompt.

Problem
3/7
Problem
3/7

In theory, Nexus supports this loop—but in practice, the system makes this incredibly difficult.

But don't just take my word for it. Here's Justine, a customer support manager, to share what her experience was really like.

Problem
3/7
Justine's experience

Agent is deployed

Week 1

Issues start cropping up

Week 2-3

Fixes attempted

Week 4

Real consequences

Week 5

The agent works well and receives positive feedback.

Confusing responses. Edge cases encountered.

Unnecessary escalations to human support.

Repetitive or robotic tone.

Justine edits the agent’s prompt.

She’s overwhelmed by a wall of text.

She makes a hesitant change.

The agent’s logic breaks the next day.

Justine loses trust and stops editing.

😄

🤔

😥

💀

Justine's feelings throughout

Justine wasn't alone. Countless managers faced this same uncertainty every day.

The gap

Justine's story is far from unique. It exposes the fundamental flaws in how Nexus handles prompt maintenance, making agent improvement unnecessarily complex and risky.

Problem

2/7

Problem
3/7

We studied how users maintained agents and where their understanding fell short.

Problem
3/7

Lack of prompt expertise

Most managers don’t understand prompt engineering, leading to guesswork and ineffective updates.

Feedback isn't actionable

Feedback from conversations and tasks isn’t mapped to the right parts of the prompt, forcing users to hunt for issues blindly.

Fear of breaking things

Even small edits can cause unexpected failures, making users hesitant to improve their agents.

Problem
3/7

Beyond user challenges, the Nexus platform itself created friction that made prompt maintenance slow, risky, and unintuitive.

Problem
3/7

Unstructured prompts

Prompts were presented as dense walls of text, with no hierarchy or modularity, making it difficult to locate or edit the right sections.

Poor interface design

The prompt editor functioned like a plain text field, offering no tools or guidance tailored to prompt maintenance.

No rollbacks or safety nets

There was no version control or rollback option if changes caused performance deterioration, discouraging experimentation.

Our goal

It was clear that Nexus needed a simpler way to maintain and improve agent prompts. Our goal was to build a tool that is structured, easy to use and safe so managers can improve their agents with confidence.

Discovery

3/7

Problem
3/7

We have text editors for communicating with people. Code editors for instructing computers. But nothing yet for AI agents.

Typing started with a simple notepad, but over time it evolved to serve new use cases like writing, messaging, and coding. Now it has evolved to prompt-writing, and the next step in that evolution is here: prompt editors.

Problem
3/7
Problem
3/7

Learning how AI reads its instructions, and how formatting and structure affect it

AI reads many formatting cues the same way humans do — lists suggest order, capitalization signals importance, quotes indicate exact wording. The more intentional the structure of a prompt, the better the AI can follow it.

Problem
3/7

Return home

Back to work

Create a free website with Framer, the website builder loved by startups, designers and agencies.