Scaling QA for AI: Lessons from NAF’s Greenfield ML Deployment