Inconsistent results
Problem: Different output each time
You run the same prompt on the same document but get different results. Common causes:- Prompt is too vague or open-ended
- Missing essential context
- Relying on AI to infer rather than instruct
Problem: Quality varies wildly
Sometimes the AI nails it, sometimes it’s completely off. Common causes:- Prompt works for some document types but not others
- Context requirements vary by situation
- Model updates affecting performance
- Create document-specific prompts rather than universal ones
- Add conditional logic: “If customer is government entity, then…”
- Test prompts regularly and update as needed
Wrong focus
Problem: AI focuses on wrong issues
The AI obsesses over formatting while missing major liability issues. Common causes:- No priority guidance provided
- Equal weight given to all issues
- Missing risk framework
Problem: Too much or too little detail
Getting novels when you need summaries, or single sentences when you need analysis. Common causes:- No length specification
- Audience unclear
- Purpose undefined
Context problems
Problem: AI doesn’t understand the situation
The AI gives generic advice that doesn’t fit your specific circumstances. Common causes:- Insufficient context provided
- Wrong context emphasized
- Contradictory information given
Problem: AI ignores critical context
You mentioned you’re in healthcare but the AI ignores HIPAA requirements. Common causes:- Context buried in long prompt
- Competing contexts confusing priority
- Key context mentioned too late
- Lead with critical context
- Repeat important constraints
- Use headers to organize context clearly
Output format issues
Problem: Wrong format for needs
Getting prose when you need a table, or bullets when you need an email. Common causes:- Format not specified
- Conflicting format instructions
- AI defaulting to its preferences
Problem: Information scattered
Important points buried throughout long responses. Common causes:- No structure requested
- Multiple asks in one prompt
- AI trying to be comprehensive
Accuracy issues
Problem: AI misses important provisions
The AI doesn’t catch critical clauses. Common causes:- Rule/prompt too narrow
- Keywords not matching document language
- Important sections in unexpected places
- Broaden search terms: “audit, inspection, review, examination”
- Check entire document: “Review all sections including exhibits”
- Be explicit: “Check for liability caps anywhere in document”
Problem: False positives
AI flags issues that aren’t really problems. Common causes:- Prompt too broad
- Missing negative prompting
- No threshold for materiality
Performance issues
Problem: AI takes forever
Response times are unusually long. Common causes:- Overly complex prompts
- Multiple large documents
- Requesting too much in one prompt
- Break into smaller requests
- Process documents separately
- Simplify prompt structure
Problem: AI cuts off mid-response
Responses end abruptly without completing the task. Common causes:- Output length limits reached
- Token limits exceeded
- Complex chains causing timeouts
- Request more concise output
- Break into multiple prompts
- Ask for summary first, details separately
Model-specific issues
Problem: Prompt works in one model but not another
Different AI models behave differently. Common causes:- Models trained differently
- Format preferences vary
- Instruction interpretation differs
- Adjust formatting for the specific model you’re using
- Test prompts on your target platform
- Use model-appropriate structure (some models prefer markdown headers, others respond better to XML tags)
Problem: Quality degraded after update
Previously good prompts now failing. Common causes:- Model improvements changing behavior
- Prompt exploiting previous quirks
- Overfitting to old model version
- Simplify and clarify prompts
- Remove workarounds for old issues
- Test and update prompt library
Quick diagnostic checklist
When something’s not working, check:- Context: Is it clear who you are and what you need?
- Specificity: Are instructions precise or vague?
- Format: Did you specify how you want output?
- Scope: Is the AI looking at the right things?
- Constraints: Are there clear boundaries?
- Priority: Does the AI know what matters most?
Iterative refinement
When a prompt isn’t working, resist the urge to rewrite it from scratch. Small, targeted adjustments are faster and more effective.Step 1: Identify the gap
Run your prompt and compare the output to what you wanted. Ask: is the problem with context, scope, format, or specificity?Step 2: Make one change at a time
Adjust a single element per iteration so you can tell what fixed it:Step 3: Save what works
Once you land on a prompt that delivers, save it to your prompt library with notes on what you changed and why.Emergency fixes
-
When nothing works
-
When pressed for time
-
When results are dangerous
Prevention strategies
- Test early and often Don’t wait until the big deal to discover your prompt doesn’t work.
- Document what works When you find a good prompt, save it immediately with notes on why it worked.
- Build gradually Start simple and add complexity only as needed.
- Stay updated AI models evolve. What worked last month might need adjustment today.