Making agents work
Applying Work, Friction, and Waste to our human-AI allocation // officially retiring the DeejDrone 9000 đ˘
This is the last post in a three-part series on which parts of work are valuable. It started with creating a ridiculous biodiesel drone to show how AI can generate perfect-looking garbage, then updating the Work-Friction-Waste framework to spot which activities actually matter.
Now, let's put it all together to figure out how to actually use agents without destroying what makes your team valuable.
If agents can't replace PMs, what can they do? Lots, as it turns out. Just not what you'd expect from following the hype train.
The Work of PMs
So, we know how Work works, and we know the PRD isnât the Work, but the Friction, for the PM. What does that mean for the VP who wants to assign agents as PMs?
Good news and bad news. Bad news first: you still need PMs.
The danger of replacing PMs with agents - at least in this scenario - is threefold:
The agent making PRDs is not doing the Work. Itâs handling the Friction (which - can be a good thing! But more on that in a sec).
The person prompting the agent is imposing all of their biases onto the agent, and disguising those biases as good ideas with rigor and research. (Like I did with the DeejDrone 9000). The key insight here is that none of my ideas were any better after they were put into the PRD than they were before: I didnât do any Work to validate anything. But now that theyâre in PRD format, itâs way harder to tell that theyâre terrible. Friction that doesnât carry any Work is Waste, by definition.
To top it all off, because weâve replaced all our PMs with agents, the only people who can see through my PRDs and tell that they donât have any merit are the people we let go. We lost our BS detectors, so we canât tell when weâre lying to ourselves anymore1.
But letâs say I know PRDs arenât the end goal of a PM. Can I train an agent to be a PM if I open up the scope?
Again, probably not, not yet. Agents arenât great at the Work PMs do. It takes some savvy to handle discovering the Voice of the Customer and reading between the lines, and humans tend to be more effusive and expressive in front of other humans. (Ever âinterviewed yourselfâ for a job with one of those automated video systems, or trialed one you were considering using? Youâre not likely to âwarm up to the interviewerâ during the process; people still like talking to other people).
You need the LLM to have a thesis, interrogate the thesis, and subtly test their thesis while also looking out for âunknown unknownsâ during these conversations⌠but the state-of-the-art realtime voice call agents these days tend to be just a little better than call center phone trees.
PRHs and Product Gap Agents
Good news: agents can still help, a lot. Thereâs a ton of ways to throw agentic automations against Waste and Friction in the average Product org, but for this conversation, we followed the PRD thread specifically.
First: low-hanging fruit. PM comes back from their customer interviews with notes, dumps them into an LLM, and the LLM summarizes them into PRD format. Thatâs not agentic behavior by any means, but itâs still pulling Friction out of the PRD process, assuming you can fact-check in less time than writing the document outright.
But letâs get more creative and find more value. Start from the point where your PMs go talk to customers and work backwards. The output of all of that is going to be a PRD; but is there a utility to having a PRD as an input too?
I could see it - for the sake of argument, letâs call it a PRH - a Product Requirements Hypothesis:
Try this out: take the exact same format, but fill it in based on your initial best guess - what do you think the right answers are going to be? Generate a PRD as though you knew the right answer based on your âday zeroâ knowledge - using an agent to research across the enterpriseâs data sources - and then react to it. Do the details and assumptions and logical conclusions match what you expected?
Iterate on this and expose what would make your hypothesis wrong explicitly in the document. This step is key: otherwise youâre just biasing yourself, poisoning your own well. The end state for a PRH is not a draft PRD - youâre gonna redo all of this - but an experimental survey design that explains exactly what you think the options are for each element of the product design, the sensitivity and impacts of each decision change, and the things you want to listen for and explore in conversations.
The final piece of the PRH is the customer inventory - the conversations that you want to have that could get at the level of detail youâre looking for. Again, an LLM can help you run through CRMs or CX datasets to check for specific info, or at least give you guesses that you can react to and reject or edit. Youâre looking to get more prescriptive on who to talk to - rather than âwhoever my CSMs proffer upâ - so you can test really specific elements of your hypothesis (or, also interestingly, find out that the things you expected to find arenât as easily-discoverable as you hoped)2.
This would result in - in my experience - way more prep for customer conversations than most PMs walk in the door with3. If youâre sitting there testing for both high-level questions and a bunch of little details that you wouldnât have thought of but could materially affect the end picture, the agentâs done a lot of actual Work helping you prime yourself to extract Customer Voice from what could be otherwise less-structured conversations4.
Ok - then letâs go one step earlier. How do I prime a PRH? Again, I really donât want to set up a system where I take my innate biases about the product or the business and polish them up so that no one else can tell that theyâre hollow and backed by nothing. Iâd love to set up chains of events that feed me good triggers and data to inform my hypotheses: my agents should be setting me up for success.
The #1 place that this breaks down in most customer orgs is the product gaps process. Product gaps reporting is often sketchy at best, even among companies that design products for reporting product gaps! The feeds of information are inconsistent, the identity deduplication is rare, the supporting data is scarce, prioritization is done by squeakiest wheel, and at a certain length of backlog no automated gap feed is actually entering the working memory of any team member until a long-overdue-hygiene process goes through and flushes months-old cards off the board.
But an agent, operating in the right environment with the right framework, could do a couple interesting things:
They could (obviously) feed and organize new product gaps into the right streams at the right times. Thatâs easy. Thatâs not even agent work - thatâs API + WHERE statements, maybe embeds to make the sorting less fragile.
They could reference existing projects or PRHs and ask requestors or gap filers targeted questions at the time of filing - what was the source of the bug? Where does the enterprise customer want this API to feed into? You filed 12 P0 gaps - if I can only do 3 this quarter, which ones would you vote for5?
They could build and maintain profiles on the requestors - both the internal parties, and the actual end requestors (the customers, the Twitter personalities, the vendors, the partners) to build context over time, so that gaps get richer over time. Lots of folks have pined wistfully for the opportunity to do this over my career, basically no one has the time to do it. Prime agent work - stuff humans can only conspire but to dream of because they have actual shit to do.
When it comes time for customer interviews or CABs, agents can shortlist the requestors (filtering for things like red accounts and churn) closing the loop on the information pull6.
My prompt to the VP at this point - how do you go one step before this? How do you use agents to prime a product gap process that feeds you great information and returns great sentiment? I wonât go into the answers here: I think the pointâs been made and the ideas here start getting very tactical and very real. Itâs much more exciting and useful than I, PMBot: the energy in the room is different because weâre targeting real problems with precise solutions instead of employing unguided AI for its own sake.
Bias for action
I want to close on a note my cofounder Deji brought up when we were discussing this concept. He mentioned his initial gut reaction: doesnât this push back against conventional startup wisdom to just go do something?
Thereâs the classic cliche âbias for action.â Brian Armstrong famously put this as âaction generates informationâ - when youâre pre-PMF, taking action is the best way to learn, even if youâre not sure itâs the right action7.
I like that. I donât think the frameworks clash. Instead, Iâd say the two mesh with one simple phrase: donât confuse activity for action. Or, if I were to paraphrase Brianâs quote into our framework: âAll Work generates information: any Work, even the wrong Work. Waste does not.â
Itâs easy to run around looking busy, doing productivity theater, making yourself feel busy, doing everything but what matters most. Thatâs common for larger businesses layering on process for the sake of process; thatâs equally common for small startups procrastinating and doing everything but getting in the van, talking to customers, and building what they want. If you spend all your time as a startup on Waste, youâre not taking real action, and youâre not getting real information.
The only difference for pre- and post-PMF companies is that, before you know your customers want to pay you for your Work, you just have to take your best guess. Itâs still critical to minimize things that arenât Work or supporting it - maybe more so - because you need to try so many different types of Work until you find the right one.
Work is the highest revenue-bearing activity. Work is also the highest information-bearing activity. Friction is your Work delivery vehicle.
Everything else is just in your way.
Framing exposes opportunities
Is the point of all this that you should turn your Product team onto making PRHs now? No. Itâs an experiment, we ran it once, the juryâs still out on if itâs worth the effort or not. Feel free to play with it and let me know if it fits with how you build.
The actual point is: see the difference better framing makes? By lining up agents to destroy Waste, improve Work outputs, and cycle out Friction to improve throughput, we get a way different Product org that makes the PMs feel like theyâre omniscient. The alternative - having agents make fake PRDs - doesnât feel remotely as cool in comparison.
It's intellectually easy to swap humans for AI 1-for-1. It feels tempting to take all your Waste and task an LLM to do it for you. But I guarantee it's worse. There are better problems to target, and you can spot them using this framework. Tap your AI to eliminate Waste, minimize Friction, and improve your real Work.
Every company using AI is about to learn this lesson. The question is whether they'll learn it the easy way or the hard way.
Youâll be able to tell by how many DeejDrones they launch.
Coming off of Inbound, there have been a lot of cool conversations about how people in Sales & Marketing roles are using agents productively - and not - and Iâll try to bring this back to earth a little bit and share what Iâm hearing starting tomorrow.
We talked about hypocognition in an earlier post - this is sort of hypocognition-by-design, and itâs pretty dangerous.
We had a saying when I was in the Air Force - ânegative intelligence is still intelligence.â Basically - donât discount when you find out that something you wanted to learn cannot be learned.
Example of a simpler customer interview template. Nothingâs wrong with it, but thereâs nothing setting you up to challenge exactly what you need to evaluate to get your product right; itâs mostly a blank piece of paper, ideal if you have no idea what your next product looks like, but the more you know about what you want to test, the weaker a template like this is. You can fill a template this generic out, exactly as written, without testing a single hypothesis. A PRH done well might guardrail you away from that outcome.
Hereâs the draft PRH we made based on the DeejDrone concept: C003A2 - DD9K PRH. Again - the droneâs still a bad idea. But the template now is designed to expose that in customer interviews and get us detailed information about the viability of going into the commercial drone industry at all, and where the nuances are. This template filled in, plus the interview notes, turned into an actual PRD, will be infinitely better than the result at the beginning of the series.
This is, by the way, the exact reason I dislike the P# system and why we donât use it.
Your product gaps process may be wildly different than these folks, but we set up an agent with fairly straightforward system instructions to run off a Slack webhook and feed disparate gap reports back to the Product team with some basic enrichment, which was enough to get the wheels turning on other long-neglected problems: https://github.com/centaurprise/deej-drone-9000/tree/main/agents
The original interview is here: