+ FIELD NOTES

Blog

Findings, methodology, and the occasional teardown. Technical writing on what our agents discover and how we measure them.

Jun 12, 2026 NOTE-001

Why most security benchmarks lie to you

Leakage, gameable heuristics, and stale targets quietly inflate every number you read. Here's how we design probes that resist all three.

benchmarksmethodology

→

Jun 5, 2026 FIND-002

An agent found an SSRF chain we didn't plant

While validating a web target, one of our discovery agents surfaced a server-side request forgery path that wasn't in the ground-truth set. A short teardown.

findingsagentsweb

→