+ FIELD NOTES
Blog
Findings, methodology, and the occasional teardown. Technical writing on what our agents discover and how we measure them.
Why most security benchmarks lie to you
Leakage, gameable heuristics, and stale targets quietly inflate every number you read. Here's how we design probes that resist all three.
An agent found an SSRF chain we didn't plant
While validating a web target, one of our discovery agents surfaced a server-side request forgery path that wasn't in the ground-truth set. A short teardown.