Complete hands-on AI challenges — RAG pipelines, agents, evals, MCP servers. Get AI-scored across 6 dimensions. Earn a verified artifact on your public profile.
Pick the skill set you want to prove. Challenges range from easy to hard across six categories.
RAG Pipelines
Retrieval-augmented generation systems
AI Agents
Autonomous multi-step agent loops
MCP Servers
Model Context Protocol integrations
Coding Agents
AI-assisted code generation workflows
Evals & Testing
LLM evaluation frameworks & harnesses
AI Tool Proficiency
Claude Code, Cursor, Copilot mastery
Four steps. Fully evaluated. Permanently yours.
Browse RAG pipelines, agents, MCP servers, evals, and more. Each challenge ships with a real dataset and a scoped LLM key.
Work in your own environment. Submit a public GitHub repo plus a brief decisions doc. We clone and run it against real test inputs.
Our AI evaluates across 6 dimensions. Scores above 88 get human expert review. Your artifact lives on your public profile permanently.
Companies browse verified profiles and reach out directly. No resume needed — your code speaks for itself.
Every submission passes through a multi-layer verification system before a score is finalised.
Personalised datasets
Each submission receives a unique dataset variant seeded per candidate. No two candidates solve the exact same problem, making copy-paste useless.
Timing analysis
We record the moment an LLM key is issued and compare it against the submission timestamp. Suspiciously fast completions are automatically flagged.
Similarity detection
All submissions are embedded using text-embedding-3-small and compared across the challenge history. High cosine similarity triggers instant review.
Human expert review
Every submission scoring above 88 is reviewed by a domain expert from our reviewer network before the final score is confirmed on your profile.
Every submission is evaluated by our AI scoring system on correctness, architecture, decision quality, LLM usage, robustness, and clarity. Scores above 88 get an additional human expert review.
Start a challenge →// Score breakdown
The public leaderboard shows top scores per category. Earn your place and get noticed by hiring teams browsing verified talent.
For engineers
I'm building with AI
For companies
I'm hiring AI talent
Free for candidates. No resume required — just build something real and let the evaluation speak for itself.
Hiring companies can sign up here