What Are the Best AI Skills for “evals”?
6 skills tagged “evals”, each forged from a YouTube creator's methodology.
Nick Nisi Harness Engineering for AI Agents
Ship reliable AI-agent pipelines by replacing trust with cryptographic evidence, state-machine enforcement, and failure-driven memory — so your agents stop lying and start proving.
31 May 2026AI EngineerSchmid Agent-Ready Engineering Framework
Diagnose and fix the five specific mindset and architecture gaps that cause experienced engineers to build unreliable AI agents, then redesign your agent system so it is production-ready.
30 May 2026Hetzel Agent Observability Differentiation Framework
Accurately diagnose whether a given AI agent system requires traditional observability tooling, agent-specific observability, or both — and design the right observability stack accordingly.
29 May 2026AI EngineerHetzel Eval Maturity Phases Framework
Apply a structured, stage-by-stage methodology to design and mature your LLM/agent evaluation system — from first vibes to production flywheels — so your agent reaches production with measurable, defensible quality.
27 May 2026AI EngineerHetzel Agent Team Composition Framework
Design the right cross-functional team structure for building production-grade agentic AI applications by correctly positioning data scientists, engineers, and domain experts.
26 May 2026Hetzel Agent Team Composition Framework
Design the right cross-functional team mix to build production-ready agentic AI systems by applying Phil Hetzel's diagnostic for who should own, build, and evaluate agents in your organisation.
25 May 2026