import Prerequisites from "/snippets/standard-prerequisites.mdx" import SkillsCredentials from "/snippets/skills-credentials.mdx"
The SRE skill turns AI agents into expert SRE investigators:
- Investigation methodology: Hypothesis-driven debugging with systematic triage using Golden Signals and USE/RED methods
- APL query patterns: Reference materials for constructing effective Axiom queries
- Memory system: Persistent storage that learns from debugging sessions, capturing what works and what doesn't
- API helpers: Scripts for querying Axiom directly from the agent
Use SRE skill
The SRE skill activates automatically when you ask your AI agent about:
- Incident response and debugging
- Root cause analysis
- Log investigation
- Production troubleshooting
Example prompts:
- "Investigate why API latency increased in the last hour"
- "Find the root cause of the 500 errors in production"
- "Analyze error patterns in the logs dataset"
Memory system
The SRE skill learns from every debugging session. The memory system initializes automatically on first use. Memory persists at ~/.config/amp/memory/axiom-sre/ (global) or .agents/memory/axiom-sre/ (project/local).
Tell the agent to remember things
Remember this for next time.Save this query, it worked.Add to memory: the orders team uses #orders-oncall.
Learn automatically
The SRE skill automatically captures what works and what doesn't when:
- A query or approach finds the root cause.
- You correct it, and it records what didn't work and what did.
- A debugging session completes successfully.
Customize with your own knowledge
Customize the SRE skill by adding your own knowledge to the knowledge base files:
kb/facts.md: Team contacts, Slack channels, conventions.kb/integrations.md: Database connections, API endpoints.kb/patterns.md: Failure patterns you've seen before.