1 skill tagged “open-source-evals”, each forged from a YouTube creator's methodology.
Design and deploy community-scale agentic evaluation systems that are transparent, unsaturatable, and accessible to non-expert contributors — not just AI labs.