
Notes from the field on what actually wins in enterprise AI “The Guide notes that the most expensive failure mode of an enterprise AI company is, against all reasonable expectations, not building the wrong model. It is building the right one, and then being unable to put it anywhere.” \ \ I work as a forward-deployed engineer at a Series B AI company that is building enterprise super intelligence. Over the last eighteen months, I've shipped deployments at large enterprise customers: freight brokers, logistics carriers, and real estate operators, and I've watched, from a privileged seat, how enterprise AI actually gets bought and put to work. \n Here is the thing nobody talks about clearly enough: \n The model is not the moat. \n The model is, increasingly, a commodity. So is the agent framework. So is the eval harness. So is the orchestration layer. Anyone with a credit card and a weekend can stand up a working v0 of an AI agent. The hard part of enterprise AI hasn't been "can you build it" for at least a year now. \n The hard part is everything that happens after the demo and before the contract renewal. \n That's the part nobody writes about, because it isn't sexy. It's the daily 15-minute standup with the ops manager who's testing the bot every morning before his shift. It's the IT lead who got you endpoints in 24 hours instead of three weeks. It's the edge case you caught in week two instead of week six. It's the moment you stop trying to sell the use case the customer signed for and start asking what actually hurts every single day. \n This is the deployment muscle. And it's the thing that decides which AI companies are still here in five years. \n What follows is a phase-by-phase walk through what that work actually looks like, told from inside the role that does it. \ \ The Demo, and Why Slides Are a Cry for Help “The Guide notes that a typical enterprise prospect sees ten demos a week and remembers, on average, one.” \ Here's an uncomfortable thing I've come to believe about enterprise demos: Most slide decks aren't a communication choice. They're a hiding spot. \n When the person doing the demo can't actually customize the product, can't build a working mock with their customer's data, can't stand up an interactive flow in an evening, they reach for slides. Slides are what you bring when you're not confident enough in the product to let it speak for itself. The audience can tell. They've seen ten demos this week. They know what hiding looks like. \ A working demo gets remembered. A slide deck gets nodded at. \n The mechanics of a demo that wins: \n The first five minutes do all the work. Not background, not company history, not "the problem we solve." Open with the most impressive thing the product can do, running live, on something that looks like the customer's world. Generic "Acme Co" placeholder data is the universal tell that you didn't prepare. Their site names. Their teams. Their workflow. Familiarity makes the product feel real before the technology has a chance to. \n Talk less than you think you should. People decide they're bored in under sixty seconds. Once they pick up their phone, you've lost them for the rest of the call. Let the product carry the room. Let silence be the thing that makes them ask the next question. The best demos I've ever run, I barely talked for the back half, the customer was asking questions, walking through their own edge cases, projecting themselves into using the product. By the end of the call they weren't deciding whether to buy. They were deciding which of their teams should pilot it first. \n Have the specifics ready, but don't lead with them. If you say "another customer saw massive value here," be ready to quantify it to two decimal places, whether that is hours saved, headcount avoided or revenue won/saved. But save the depth for when they ask. Going deep before they ask is how you turn a demo into a procurement conversation. And procurement conversations don't win pilots. \n A good demo isn't a presentation. It's a working product the audience can almost reach out and touch. \n If you find yourself reaching for slides, ask why. Most of the time the honest answer is: "Because I'm not sure I could pull off the demo without them." Fix that, not the slide deck. \n \ \ Discovery, and What a Wasted Meeting Looks Like A meeting with a customer is wasted if you walk out without doing one of two things: \n Disproving an assumption you walked in with. Getting measurably closer to going live. \ That's the bar. Everything else is theater. \n Most FDEs, myself included, on plenty of bad days fall into the same trap: the customer dodges a question, we nod, we tell ourselves we'll "follow up by email," and we never re-ask it. We're afraid of seeming like we didn't get it the first time. \n Ask it again anyway. They won't be annoyed. They'll respect that you actually care about getting this right. \ The discipline I've found works: Walk in with a discovery mindset, not a closing mindset. Try to eliminate this customer. If by the end of the call you can't, that's how you know they're real. The framing flips the power dynamic in ways that are hard to overstate. You stop performing. You start diagnosing. \ Ask "so what" until the answer points at money, time, or risk. Not "we'd be more efficient." How many people? How many hours? What does the manager get yelled at for today? Specifics or it didn't happen. \n Write down the assumption you most want to disprove before the meeting starts. If you don't disprove or confirm it, the meeting was wasted, and you shouldn’t schedule another one until you know how to get the most out of a meeting. \n Let them do 75% of the talking. Silence is digestion. Your job isn't to dump information; it's to make their pain obvious to them. \n You are not there to convince anyone of anything. You are there to find out whether there is a real problem, whether the person across from you has the authority and energy to solve it, and whether the solution looks anything like the product you have. Demos sell vision. Discovery sells pilots. \ \ Scoping, and the Door You Come In Through “The Guide observes that the first use case is rarely the use case. It is, however, the door.” \ A story I keep coming back to. A customer signed with us last year for an inbound voice AI use case, a bot that would handle inbound sales calls from leads, qualify them, and then if applicable make the sale. Strong ROI on paper. Their head of sales was clear, though: we don't hand customer relationships to a bot. That's our differentiator. Humans stay on those calls. \ Six months in, they paused the use case they'd originally signed for. \n It would have been easy to call that a loss. \n Instead we asked a different set of questions. What is the most painful, communications-heavy work your team does every single day? Where would adding a hundred more people not fix the problem but a tireless agent would? \n The answer was tracking. Track-and-trace. The unglamorous, repetitive, six-call-types-per-load work that an offshore team had been doing 24/7 and that nobody on the customer side wanted to think about. \ We shipped that workflow in six weeks. Today it tracks 12,000 loads a month, end-to-end. \n Here's the part that's actually interesting: that same customer doesn't think of us as their voice AI vendor anymore. We're their tracking operations engine. We're their exception management layer. We're rolling a real-time dashboard out from 20 to 120 of their users in the next month, and the conversation is no longer about "which workflow next", instead it's about which parts of their business should run on our platform. \n We came in through one door. We're now somewhere closer to the foundation. \n The principle worth taking from this is more useful than the story. The first use case is rarely the use case. It is the foot in the door. Optimize it for getting deployed, not for being important. Pick the workflow that puts you in the room every single day, even if the ROI on paper is less exciting. Daily contact is how trust compounds. Trust is what gets you the next workflow. \n And when a customer kills a use case mid-deployment, don't argue. Ask what hurts. The workflow they cancel and the workflow you end up running are almost never the same one. Be flexible enough to find the second one. \n You don't sell an AI operating system on day one. You come in through one door and you make yourself impossible to remove. \ \ The Pilot, and Why It Takes Longer Than It Should “The Guide notes that a "pilot" is, in theory, a controlled test of whether the technology works. In practice it is far more often a discovery period during which the vendor finds out what they should have known before the kickoff meeting.” \ Most enterprise AI pilots take longer than they need to. \ The reason usually isn't the model. It isn't the integrations. It isn't even the customer's process. The reason is that the team running the pilot is, by week three, still finding out things they should have figured out in week zero. \ A working frame: The pilot is not the place to test whether the product works. The pilot is the place to test whether your deployment works. \n That distinction is, I think, the entire game. \n Two specific things make pilots shorter: \ Pick pilots that test different aspects of your product. Not the same one. If you've already proven the agent can handle inbound calls for one customer, the next pilot should not be another inbound voice deployment. You know that works. Running it again is using six weeks of customer goodwill to confirm something you already learned. \n The valuable next pilot probes a different capability — outbound at scale, a multi-channel handoff, an integration shape you haven't seen, a workflow that combines voice and chat in a way the last one didn't. Each pilot should test a new category of unknown. If you've got three live pilots all testing voice quality, you're running one pilot three times. \n Capture the known edge cases before UAT. All of them. If the customer surfaces an edge case during UAT that you could have predicted and pre-tested, that's not the customer catching a bug. That's you failing to do your homework. \n The edge cases that ship a pilot late are almost never exotic. They're things like what happens when the caller hangs up mid-form, or what the scheduler does on a holiday the customer's region observes but you didn't think to check, or what the transfer step assumes about the customer's CRM field structure that turned out to be different from the last customer's. \n Any of these, run internally before UAT, take a day. Caught in UAT, they cost a week. Caught in production, they cost more than that and they cost trust. \n The discipline is straightforward but underused: before the pilot kicks off, sit down and write every edge case you can think of. Walk through the workflow with someone who's never seen it. Stress-test the inputs you don't normally stress-test. The pilot timeline is for catching the edge cases you couldn't have anticipated. Not the ones you could have. \ A six-week pilot can become a three-week pilot if you do this well. A three-week pilot can become a kickoff-to-go-live. \n The shortest pilots aren't run by the fastest engineers. They're run by the engineers who showed up to day one with the most questions already answered. \ \ The Six-Week Deployment, and Who's Actually in the Room There's a lot of mythology around how fast AI can ship. Most of it is wrong. \n Models don't make a deployment fast. Frameworks don't make a deployment fast. Even the engineer doesn't, really, make a deployment fast. \n The customer makes a deployment fast. \n The best six-week deployment I've ever been part of had four ingredients, and I had nothing to do with three of them. \ Two engineers. No more. Adding people slows you down. The hand-offs alone consume the speed gain. Two people who can hold the entire system in their heads at once will outrun a team of six every time. \ A 15-minute standup every single morning. Not a weekly check-in. Not a Wednesday demo. Fifteen minutes, every day, with the same three people on the customer side: someone from IT, someone from ops, and someone who actually does the work the agent is replacing. Anything that came up the day before gets resolved before lunch. \n A customer who tests, not a customer who reviews. This is the unlock most enterprise AI deployments never get. The head of their ops team called the bot every morning before his own day started. He'd send back voice notes, transcripts, edge cases. We debugged in real time. By the time we shipped, that agent had been stress-tested harder than most new hires ever get before they're put on the phones. \ A clear "what good looks like." A scoped set of call types, a scoped set of integrations, a date on the calendar. Bug fixes always, that's the deal. New use cases later, that's the next deployment. \ The thing that surprised me most isn't on this list, and it's worth saying directly: the customer's people were what made this work. The IT lead who got us endpoints in 24 hours instead of three weeks. The ops manager who showed up at the standup every single day for six weeks straight. The exec who held the line on a clean scope when other parts of the business wanted to bolt on more. \ If you're an FDE about to start a deployment and you're trying to predict how it'll go, don't look at the model. Don't look at the integration list. Look at who's in the daily standup. \n If three of the right people are showing up, you'll ship. If they're not, no model in the world is fast enough to save the timeline. \ \ On the Plane Ticket “The Guide notes that the most underrated piece of equipment in the modern FDE's toolkit is, against all reasonable expectation, a plane ticket.” \ A paradox I keep running into: \ As software gets easier to build, showing up in person matters more, not less. \n The instinct is the opposite. Models are stronger every quarter, agentic tooling is exploding, you can stand up a working v0 of almost anything in a weekend. The natural conclusion is that the FDE role is about to get more remote, more leveraged, more virtualized. \ It's not. \ Here's what actually happens when the software gets easier: the moat shifts. It moves out of "can you build it" and into "do you know what the customer actually needs you to build." And the most important question, what's actually worth building , is essentially impossible to answer from a Zoom call. \n A few things I've only ever learned by being in the room: \n The thing the executive said the team needed, and the thing the operator actually does all day, are different things. Sometimes radically different. You will not find out which one is real until you sit next to the operator for a few hours. \ The friction that nobody mentions in scoping calls is the one that kills the deployment. It's the legacy app nobody complains about because everyone's resigned to it. It's the form the dispatcher fills out on paper before they log it. It's the Slack channel the ops team uses to work around the official system. You only see this stuff in person. \n Conviction transfers through a room. It doesn't transfer through a screen. A customer who's spent two days with you in their office, watching you build a v0 overnight, is a customer who knows you can ship. A customer who's seen you on six Zooms is a customer who's seen you on six Zooms. \n You learn what the company actually feels like. Whether people respect each other. Whether the energy in the room is "we want to do this" or "we have to be seen to do this." None of this shows up in a transcript. \n The deeper version of this is about depth, not just travel. An FDE spread across five accounts can't transform anything in any of them. The math doesn't work, there aren't enough hours, not enough context, not enough time at any one desk to find the use case that matters. The best FDE work I've seen comes from people who go deep on one or two customers and stay there long enough to become honorary employees. \n That's not a model that scales by adding more meetings. It scales by getting on more planes. \n The next few years of enterprise AI will be won by the teams that are willing to do unglamorous, physical, in-person work, sit with the operator, eat lunch with the ops team, show up at the office on a Tuesday while shipping software faster than anyone else. Not despite the software getting easier. Because of it. \ \ The Moat Is Deployment If you've read this far, here's the part that matters. \ The next decade of enterprise AI is going to be ugly for a lot of companies that thought they were in the model business. Every advantage that has historically lived inside a model, capability, latency, cost, reliability, is collapsing toward parity. The frontier moves; everyone gets there eventually. The cycle is measured in months, not years. \n What doesn't collapse toward parity is the muscle to put a model into a customer's actual workflow, automate tasks end to end, integrate with their actual systems, and stay there through reorgs and shifting business priorities. That's not a model capability. That's an organizational capability. \n It looks like: \n A demo team that can build a working v0 of the customer's workflow in an evening, not a slide deck. An FDE org structured for depth, not breadth, people who go on planes, sit at desks, become honorary employees of the customer they're embedded with. A scoping discipline that picks the workflow that gets you in the room daily, not the workflow that gives the AE the biggest commission. A pilot process that catches its own edge cases, so UAT confirms instead of discovers. A pricing model with negotiating surface area, so both sides can win. A six-week deployment muscle built around the right three people in the customer's daily standup. None of that comes from the model. All of it comes from the deployment. The companies that win the next wave of enterprise AI won't be the ones with the best evals. They'll be the ones who figured out, in some uncomfortably unsexy way, how to build the deployment muscle that turns a working agent into a customer's operating system. It will not look glamorous from the outside. It will look like a thirty-year-old engineer on a Tuesday in Chicago, sitting next to an ops user, watching them fill out a form on paper, asking the same question three times until the answer points at money, time, or risk. \n That's the moat. \n Don't panic. Get on the plane. \n \
View original source — Hacker Noon ↗

