- Published on
Simon Willison argues that coding agents become trustworthy when you stop reviewing their code line-by-line and start demanding proof: red-green TDD, runtime smoke tests, conformance suites, and sandboxed execution. The shift from human review to automated verification is what makes agent autonomy viable.