session-ai-tests-ai

by @augmentedcamel

1,060
B-TIER

Score: 82/100

StrengthsPro
  • +Built a novel agentic testing system using ChatGPT as adversarial user - innovative approach to LLM quality testing
  • +Successfully tested on production environment (Railway deployment) and validated safety enforcement (7/7 attempts blocked)
  • +Created reusable infrastructure with CLI args, personas, and JSON output for tracking test results over time
WeaknessesPro
  • -Did not actually complete the cleanup task - started removing files but session ended before committing changes
  • -No evidence of testing multiple personas beyond rush_technician before declaring success
  • -Created a Claude skill but never demonstrated using it - the skill workflow wasn't actually validated
YC SignalPro

Mixed signal - shows strong product thinking (simplifying 11 tests to 1 agentic approach) and ships working code tested in production, but execution is incomplete with cleanup left hanging and skill system untested, suggesting difficulty with follow-through on larger refactors.