Before You Build Your Agent Portfolio — Notes from the Jagged Frontier

The most common question I come across these days is: How do I capture the most value from my agentic portfolio?

I would flip the question around and start with first principles. Before any enterprise commits engineering resources to an agentic workflow, there are three questions that need honest answers: Should we build this at all? Can we build this well technically? And if we proceed, what does a responsible design look like?

This is an important distinction. If you are building a roadmap for a portfolio of agents without a clear prioritization lens, it wastes something far beyond engineering cycles — and that's organizational trust in AI. Start by mapping your business-critical workflows. The ones worth pursuing share a common profile: clear business value, repeatable steps, and data you can actually trust.

Here's my framework on how I would prioritize and implement an agentic roadmap.

Business considerations — should we build this at all?

This is the filter most teams skip — when they are working in a silo (shadow IT), disconnected from business priorities (sidecar AI), or have unclear requirements (failed pilots). They see a workflow, imagine an agent, and start building.

Three questions stop bad projects before they start:

What is the business value of automating this workflow? Not in general terms but specific metrics mapped to your OKRs. Does automating this reduce cost, increase throughput, reduce error rate, or free up skilled human capacity for higher-value work? If the answer requires significant assumptions to be positive, pause here.

What is the cost to execute this at scale? Agentic workflows are not free. LLM inference, orchestration, data retrieval, monitoring, and human review all have costs that compound with volume. A simple and cheap prototype can become expensive at production scale. Model the unit economics before building.

Is the organization ready — and is the workflow designed for it? The most common reason agentic workflows fail is when the agent is bolted onto an existing workflow rather than built into a redesigned one. Two things have to be true: the people or systems downstream need to trust the outputs enough to act on them, and the agent needs to trigger naturally at the right moment with the right context. Adoption is an architecture and upstream design problem, not a change management issue that comes as an afterthought.

↗ Pass — all three pass, move to step 02

↘ Rethink — any one fails, fix root cause first, or stop

Architectural filters — can we build this technically?

Assuming the business case holds, three technical filters determine whether the workflow can be transformed with AI:

Can the workflow be decomposed into clear, deterministic steps? Agents work best on tasks that can be broken into discrete, verifiable sub-tasks with clear inputs and outputs. Workflows that require continuous human judgment, contextual improvisation, or ambiguous decision criteria are not agent-ready — they are copilot candidates at best.

Is the data feeding this agent reliable and connected? This is the most underestimated filter. An agent is only as good as the data it reasons over. If the underlying data is siloed, inconsistent, poorly governed, or stale, the agent will produce outputs that are confidently wrong. Data readiness is the prerequisite, not a production-time problem.

Which specific steps can actually be replaced by an agent — and which require human judgment? Not every step in an automatable workflow should be automated. Identifying the exact boundary between agent execution and human decision is a core design question. Workflows where that execution boundary is unclear should not be automated.

↗ Pass — all three pass, move to step 03

↘ Fail — any one fails, workflow is not agent-ready yet

Architectural design requirements — how do we build it well?

Passing the first two steps means you have workflows worth building. Now I put on my PM hat. This step determines whether you build it in a way that is safe, trustworthy, and improvable over time.

Modularity. Requirements change, models improve, and business logic evolves. A well-designed agentic system allows individual components to be upgraded, replaced, or reconfigured without rebuilding the whole. Design for change from the beginning.

Guardrails: security, governance, explainability. What can the agent access? What is it prohibited from doing? Can a human understand why it took a specific action? These are the conditions under which the organization will trust and scale the system — they should be part of the design from day one.

Observability as a first-class requirement. You cannot govern what you cannot see. Real-time visibility into what the agent is doing, what data it is accessing, and what decisions it is making is not a monitoring feature added after launch. If you cannot instrument it from the start, you cannot operate it safely at scale.

Feedback loop and drift detection. Agents degrade over time as the world changes and their training data becomes stale. The system needs a mechanism to detect when outputs are declining in quality and a process to retrain, recalibrate, or escalate when that happens.

Human handoff protocol. When the agent stops — because it has reached its execution boundary, encountered an exception, or requires authorization — what exactly does it communicate to the human who takes over? In what format? With what context? The handoff is not just a failure state; it needs to be designed explicitly.

Rollback design. When an agent makes a mistake — and it will — what is the minimum blast radius? Can the action be reversed? Can the system detect that something went wrong before downstream consequences compound? Systems that cannot be safely reversed should not be fully automated.

↗ All six designed — proceed to build

↘ Any gap — address before shipping

Workflow	Step 01	Step 02	Step 03 focus
Invoice processingFinance · ERP	✓ AP cost reduction clear ✓ Cost per run justified ✓ Linear workflow, easy to redesign	✓ Steps are deterministic ✓ Structured ERP data ✓ Matching is replaceable	GuardrailsNo auto-pay above threshold RollbackPayment reversal protocol required
Sales forecastingRevenue · GTM	✓ Strategic value clear ✓ Economics justified ✓ Sales team ready	✓ Pattern steps clear ✓ CRM data reliable ✓ Scoring is replaceable	HandoffHuman owns final number — agent surfaces, not decides DriftRetrain as pipeline patterns shift quarterly
IT ticket resolutionIT ops · Service desk	✓ High volume, cost clear ✓ Volume justifies build ✓ IT team ready	✓ Most steps deterministic ✓ ITSM data reliable ⚠ Some actions irreversible	RollbackClassify reversibility before acting — hard stop on irreversible actions ObservabilityTrack resolution quality, not just speed

Workflow

Step 01

Step 02

Step 03 focus

Invoice processingFinance · ERP

✓ AP cost reduction clear
✓ Cost per run justified
✓ Linear workflow, easy to redesign

✓ Steps are deterministic
✓ Structured ERP data
✓ Matching is replaceable

GuardrailsNo auto-pay above threshold RollbackPayment reversal protocol required

Sales forecastingRevenue · GTM

✓ Strategic value clear
✓ Economics justified
✓ Sales team ready

✓ Pattern steps clear
✓ CRM data reliable
✓ Scoring is replaceable

HandoffHuman owns final number — agent surfaces, not decides DriftRetrain as pipeline patterns shift quarterly

IT ticket resolutionIT ops · Service desk

✓ High volume, cost clear
✓ Volume justifies build
✓ IT team ready

✓ Most steps deterministic
✓ ITSM data reliable ⚠ Some actions irreversible

RollbackClassify reversibility before acting — hard stop on irreversible actions ObservabilityTrack resolution quality, not just speed

Workflow	Stops at	Why	What to do instead
Support ticket routingCX · Service desk	Step 02Data quality gate	Ticket history fragmented across three systems with inconsistent update frequencies. Agent would produce confidently wrong routing decisions.	Consolidate ticket history into a single source of truth. Define a data governance model. Re-run step 02 in 60–90 days.
AI performance reviewsHR · People ops	Step 01Org readiness	Employees will not trust AI outputs on career decisions. Even if technically sound, adoption will fail or create organizational backlash.	Introduce AI as a copilot that surfaces data to managers. Build trust over 2–3 review cycles before considering further automation.
Real-time pricing agentCommerce · Revenue ops	Step 02Data quality gate	Pricing data lives in three systems with different refresh rates. The agent would act on stale prices, creating customer trust and margin problems.	Unify pricing data into a single source with real-time refresh. Define data SLAs before re-running step 02.

Workflow

Stops at

Why

What to do instead

Support ticket routingCX · Service desk

Step 02Data quality gate

Ticket history fragmented across three systems with inconsistent update frequencies. Agent would produce confidently wrong routing decisions.

Consolidate ticket history into a single source of truth. Define a data governance model. Re-run step 02 in 60–90 days.

AI performance reviewsHR · People ops

Step 01Org readiness

Employees will not trust AI outputs on career decisions. Even if technically sound, adoption will fail or create organizational backlash.

Introduce AI as a copilot that surfaces data to managers. Build trust over 2–3 review cycles before considering further automation.

Real-time pricing agentCommerce · Revenue ops

Step 02Data quality gate

Pricing data lives in three systems with different refresh rates. The agent would act on stale prices, creating customer trust and margin problems.

Unify pricing data into a single source with real-time refresh. Define data SLAs before re-running step 02.

Before You Build Your
Agent Portfolio:
A Go/No-Go Framework

Business considerations — should we build this at all?

Architectural filters — can we build this technically?

Architectural design requirements — how do we build it well?

Business value

Unit economics

Org readiness

Decomposability

Data quality gate

Replaceability

Human handoff

Guardrails

Rollback design

Observability

Feedback + drift

Modularity

A note on sequencing

Before You Build YourAgent Portfolio:A Go/No-Go Framework

Business considerations — should we build this at all?

Architectural filters — can we build this technically?

Architectural design requirements — how do we build it well?

Business value

Unit economics

Org readiness

Decomposability

Data quality gate

Replaceability

Human handoff

Guardrails

Rollback design

Observability

Feedback + drift

Modularity

A note on sequencing

Before You Build Your
Agent Portfolio:
A Go/No-Go Framework