Leadership Told You to Look Into AI

A product manager overwhelmed by AI: a faceless figure surrounded by question marks, tangled cables, and scattered papers, with ChatGPT and a PRD on screens behind him.

Your CEO just asked you to look into AI and see how it can assist. As a product manager, you may have already dabbled in AI or you may be highly experienced with AI. Whether you’re AI-fluent or still wondering whether AI can make a difference, your CEO made this request because she hasn’t noticed a difference in your productivity or the quality of your work. This essay is a stepwise guide, starting at the beginning. You can skip ahead to your place in the journey if you’re already navigating this new industrial revolution.

For the beginner, you may wonder where to start. Perhaps you’ve tried ChatGPT once or twice and weren’t impressed. Maybe you watched a colleague type a clever prompt and felt like you missed the workshop. Maybe you’ve been quietly hoping the topic would resolve itself before you had to engage with it.

Here’s the part worth knowing: most PMs are exactly where you are right now. The gap between haven’t started and operating AI-native dailyis smaller than the discourse suggests — a few months of deliberate use, not a graduate degree. The reason most people don’t close it is that they think they need a strategy before they can begin.

What follows is a starter path that takes you from zero to fluent without ceremony.

The starter path

A confident, AI-native product manager smiling at his desk, surrounded by glowing AI icons, lightbulbs, a success flow chart, and a 'User Growth +20%' metric.

Pick one tool and commit to it for two weeks.
Use it on tasks you’re already doing — don’t invent new ones.
Learn iterative prompting, not exhaustive prompting.
Start with synthesis tasks before generation tasks.
Apply senior product judgment to every output.
Build a personal prompt library as you go.
Add stress-testing and pre-mortem work to your kit.
Stay honest about what AI gets wrong.

The details

1. Pick one tool and commit to it for two weeks.

The market has a dozen credible LLM products right now. The differences between major LLM tools are real but smaller than marketing suggests, and they shift constantly as new models release. Pick one and use it for everything for two weeks before you try a second. The compounding benefit of mastery on one tool dominates the marginal benefit of having sampled three.

I used Perplexity for everything until this year. Now I’m using Claude Code. But I recently tried Grok to successfully troubleshoot and adjust an outboard motor on a friend’s sailboat with zero experience in boats or engines. The point is, pick one and use it for everything. Not just work. Recipes, home repair, learning more about a historical character, anything you like.

For most PMs, the right starter tool is Claude (claude.ai) or ChatGPT (chatgpt.com). Both have free tiers sufficient for the first two weeks. Don’t pay for anything until you know what you’d use the paid tier for.

What “commit” means concretely: open the tool when you start your day, leave the tab open, and reach for it first whenever you encounter a task that involves reading, summarizing, drafting, or thinking through tradeoffs. Treat reaching for it as the new default, not the exception.

2. Use it on tasks you’re already doing.

This is the single most important step. The temptation when learning AI tools is to invent new things to do with them — build a custom agent, automate a workflow, prototype a feature. Don’t. Take your existing tasks and run them through the tool first. The reason: you already know what the output should look like for those tasks, so you can evaluate quality cleanly. This novel technology requires your hearty skepticism. Is the tool doing it right?

I wanted to be sure, so I ran the top five LLM products through logic and bias tests a couple years ago. The tools are getting better all the time, but bias is still a major risk to watch for.

Tasks worth trying in your first week:

Enter recent meeting notes or a transcript and ask for a list of action items.
Import a quarter’s worth of customer feedback and instruct AI to surface the major themes and cluster the feedback by theme.
Enter the details you know about a new product or feature and instruct AI to draft a PRD in DOCX format. This will give you a well-formatted doc with some details already filled in. If your chosen LLM tool doesn’t ask you follow-up questions to get the missing details, ask AI for those questions.
Enter a confusing Slack message or email and instruct AI to rewrite it for executive consumption.
Explain a roadmap decision you’re considering and instruct AI to generate three counterarguments.
Tell AI about your company and product. Then ask AI to identify extant competitors. Then instruct AI to research the competition and create a competitive-feature matrix.

For each experiment, note how long it took and compare that to how long it would have taken you without AI. The savings compound quickly.

3. Learn iterative prompting, not exhaustive prompting.

New users typically write huge, detailed prompts hoping for one perfect response. Experienced users write small prompts, evaluate the output, refine the prompt, and iterate three or four times. The conversation is the tool, not the opening message.

Start short. “Cluster these 30 customer interview notes into five to seven themes” is a fine first prompt. Read the output. If a theme is too broad, say so: “Theme 3 is too broad. Split it into two.” If you’d rather have data points than abstractions: “Replace each theme with a verbatim quote that captures it.” Three exchanges produce vastly better output than one exhaustive prompt.

Aim for specificity. If you’re comfortable writing a highly specific user story, then go ahead. But you don’t have to at this stage.

This is also where most people give up too early. After one disappointing response, they conclude the tool can’t do the task. The right response is to refine. Treat the first output as a starting point, not a finished product. When the LLM tool responds incorrectly, you should respond with clarification. This is important: if the LLM tool you’re using continues to apologize and still gets it wrong after a few attempts, make a note and try a different model or a different tool.

4. Start with synthesis tasks before generation tasks.

Synthesis tasks — here’s input, produce a structured summary— are easier to learn on than generation tasks — write me a new thing. The reason is evaluation: you can quickly verify whether a summary captures the source material; you can’t easily verify whether a from-scratch PRD is good without applying senior judgment.

Synthesis tasks worth practicing first:

Customer interview transcripts → themes
Long document → executive summary
Meeting transcript → action items
Competitor blog post → key claims and counter-arguments
Stakeholder feedback → prioritized concerns
Engineering design doc → product-level translation for non-technical stakeholders

Once these feel natural (typically four to five days of daily use), expand to generation tasks: drafting PRDs, writing positioning statements, building roadmap narratives, sketching first-draft launch announcements.

5. Apply senior product judgment to every output.

The single biggest failure mode is treating the model’s output as conclusion rather than input. The model can summarize, cluster, draft, restructure, and surface patterns. It cannot decide which patterns matter for your customer in your market under yourbusiness pressure. PMs who ship product based on unfiltered model output ship product that looks coherent and isn’t.

Treat every AI output as a draft a junior PM handed you. Read it critically. Edit. Push back. Ask what’s missing. Ask what assumption it made that you would not have made. If you wouldn’t ship a junior PM’s first draft without changes, don’t ship the model’s either.

This isn’t about distrust (though you shouldn’t have many reasons to trust the LLM at this stage). It’s about the right division of labor. The model accelerates the work; you provide the judgment. That division is what makes AI productive rather than dangerous.

6. Build a personal prompt library as you go.

When a prompt works well, save it. After two to three weeks you’ll have a working playbook for the dozen tasks that consume most of your time. A Notion page or a Google doc is sufficient — no need to over-engineer.

Also, stick with the thread until you’re finished with a topic or task. AI will benefit from the context, even and especially if you change your mind midstream. AI won’t lose the thread and may prompt you with questions about whether to keep or abandon early instructions.

A starter library structure:

Synthesis prompts— interview clustering, meeting summary, document compression, theme extraction
Drafting prompts— PRD section, stakeholder update, executive narrative, launch announcement
Critique prompts — “argue against this approach,” “find the weakest assumption in this PRD,” “what would a regulator, auditor, or skeptic ask?”
Decision support— pros and cons of a roadmap choice, second-order effects of a feature decision, what would change if we cut scope by 30 percent?

Reusing prompts isn’t lazy. It’s the equivalent of keeping a checklist. The first version of any prompt is almost always the worst version; the third or fourth version is the keeper.

7. Add stress-testing and pre-mortem work to your kit.

Once basic synthesis and generation are comfortable, the highest-leverage use of AI in product work is stress-testing decisions before you ship them. Run a draft PRD through the lenses of an aggressive engineer, a skeptical executive, a worried compliance officer, and a confused customer. The model will surface concerns you might otherwise miss until production.

This is where AI moves from productivity hack to judgment recalibration. You start asking different questions of your own work because the cost of asking dropped to near zero.

Example prompts:

“What’s the most aggressive critique an engineer would make of this approach?”
“What questions would a customer ask after this feature ships that I haven’t accounted for?”
“If this feature is wrong five percent of the time, what is the impact? What’s the customer’s recourse and how do we handle it?”
“What’s the case for doing nothing instead of shipping this?”
“What would a regulator or auditor ask about this decision a year from now?”

You won’t act on every answer. But you’ll notice the ones that matter, and the model will surface them faster than your inner critic would.

8. Stay honest about what AI gets wrong and remain skeptical.

The model will be confidently wrong sometimes. It will hallucinate facts, citations, and product capabilities that don’t exist. It will agree with you too readily when you push back. It will miss context you didn’t give it. It will be worse at math than a calculator, worse at recent facts than a search engine, and worse at your specific company’s product than someone who actually works there.

Know these failure modes and check for them. The skill isn’t trusting the model; it’s calibrating exactly how much to trust which kinds of output. That calibration takes practice. It doesn’t take a PhD.

When you catch a confident hallucination, don’t conclude the tool is useless. Conclude that this is one task or one topic where the model needs more guardrails. Sometimes the fix is feeding it source material instead of asking from memory. Sometimes it’s asking a different question. Sometimes it’s doing the task yourself because the marginal value of AI assistance is too low.

What two weeks of practice will give you

Two weeks from now, you’ll feel different about AI as a working tool. Not because you’ll have learned all of it — nobody has — but because you’ll know what it does well, what it does poorly, and how to use the parts that work for the tasks you have. That’s the threshold leadership was pointing at when they asked you to “look into” the topic. Cross it once, and the rest builds naturally.

The PMs who become AI-native first in any organization aren’t necessarily the most technical ones. They’re the ones who started before they felt ready. Start today.

The Next Stage — once you’ve graduated from web-UI chat

As you become comfortable, you’re ready for the next commitment. Buy a subscription and install your LLM tool of choice on your computer. LLM tools have usage limits for each tier and you won’t want to be limited as you begin to generate docs, models, wire-frames, system diagrams, prototypes, and more.

1. Install your LLM tool natively and pay for it. Run it where it can read and write files on your machine, not just chat through a browser tab. This is the unlock for everything below. Web UI plateaus; an installed agent doesn’t.

2. Documents.Have AI draft, maintain, and revise full PRDs and decision memos as working documents — not just sections you copy into Word. The shift from “AI helps me write the PRD” to “AI keeps the PRD current as the spec evolves” is bigger than it sounds.

3. Planning artifacts.Generate quarterly or annual roadmaps from your contextual documents — strategy memos, customer research, competitive positioning, prior commitments. Break strategic themes into epics, and epics into user stories with explicit acceptance criteria. Draft user journeys that span discovery, activation, value moment, and retention. Each used to take a PM days of solo synthesis; with AI maintaining the working draft against your context, they take hours and stay current as the context shifts. The judgment about whichthemes belong on the roadmap is still yours — but the drafting that used to gate the conversation is now cheap.

4. Models.Build a one-page ROI or cost model for a feature you’re considering, with inputs you can adjust and assumptions called out explicitly. Pricing, adoption curve, support cost, churn impact. AI handles the math; you defend the assumptions.

5. Wire-frames.Produce low-fidelity UI mockups from a written description, then iterate on them with stakeholder feedback before engineering ever sees them. You reach a UX conversation that used to require a designer’s calendar.

6. System diagrams.Generate a data-flow or architecture diagram for a feature — sources, transformations, downstream consumers, retention requirements — and use it to interrogate engineering’s design. Especially load-bearing in regulated categories, where the diagram itself often becomes the audit artifact.

7. Prototypes. Have AI build a clickable prototype in HTML or React and validate UX with customers before you finalize the PRD. The PRD that follows is a different document than one drafted from a hunch.

Stress-testing and judgment — where AI moves from productivity to recalibration

8. Run any non-trivial decision through four critic lenses in a single sitting — aggressive engineer, skeptical executive, worried compliance officer, confused customer. The cost of stress-testing drops to near-zero. You start asking different questions of your own work because the cost of asking drops.

9. Use AI to read your team’s codebase before evaluating a feasibility tradeoff. Most PMs declare technical decisions out of scope because reading code costs them too much. That cost dropped. You can now hold an opinion about whether the proposed architecture matches the product intent — before engineering finishes the design doc.

AI-native — judgment reshaped by the tools at hand

10. Build a custom Project (or GPT) scoped to your product area — loaded with your context documents, brand voice, customer language, recurring patterns, and your team’s vocabulary. The tool becomes calibrated to your work specifically rather than to PM work generically. Output quality jumps and stays jumped.

11. Redesign your PRD template, customer research cadence, and stakeholder-update format on the assumption that AI is part of the workflow. When the tools shift, the rituals around them should shift too. Most don’t, because the rituals predate the tools. This is the work most teams skip.

12. Recalibrate your definition of “ready to ship” to reflect what AI-assisted features can and cannot defensibly do in production. This is the AI-native threshold — distinct from competent AI use. Product judgment reshaped by having LLM-shaped capabilities available, and a definition of done that reflects that shift.