The Bench
The people who sign off on your code.
Fewer than 5% of applicants pass our screen. Every reviewer is a staff-level or principal engineer, vetted on real production code, and matched to your PR by the domain they know cold.
Marcus T. — Principal Security Engineer · ex-Cloudflare · 12y
What I Look For
AuthN/AuthZ boundaries that shift under load. Most teams get the happy path right and miss the failure modes — token expiry races, privilege escalation through chained calls, SSRF via redirect chains. I also look hard at anything touching crypto primitives. Rolling your own, even slightly, is almost always wrong.
A Problem I Caught
A payments API was signing JWTs with HS256 and storing the secret in an env var that was also printed to structured logs. The logs fed into a third-party analytics pipeline. The key had been rotated zero times in two years. We found it on a Tuesday. They rotated everything by Thursday.
12 years in application and infrastructure security. Previously led security review at a major CDN provider before moving to independent consulting across fintech and healthcare.
“The scariest bugs aren't the ones attackers find first. They're the ones sitting in your logs.”
Priya N. — Staff Engineer, Distributed Systems · ex-DoorDash · 10y
What I Look For
Consistency assumptions that don't survive network partitions. Engineers often model distributed systems as slow local ones — they're not. I check for missing idempotency keys, non-atomic cross-service state updates, and Saga patterns that have no compensating transactions. Retry logic with exponential backoff is another perennial miss.
A Problem I Caught
A checkout service was writing order state to Postgres and then publishing to Kafka in the same function, with no outbox pattern. Under normal conditions: fine. When the broker went down for four minutes during a deploy, they created ~6,000 orders that never emitted an event. Fulfillment never started. Support tickets started around minute seven.
10 years across high-throughput consumer platforms. Built and maintained systems processing millions of events per minute. Now consults on distributed system design and failure mode analysis.
“If your system only works when nothing goes wrong, it doesn't work.”
Jordan K. — Principal Engineer, FinTech · ex-Stripe · 14y
What I Look For
Floating point in money calculations. It never ends well. I also look for race conditions in ledger entries, missing idempotency on payment intents, and webhook handlers that don't dequeue atomically. Regulatory edge cases get missed too — transaction limits, AML triggers, reconciliation logic that only runs nightly.
A Problem I Caught
A subscription billing system was calculating prorated amounts using JavaScript floats. The error was fractions of a cent per charge — invisible in testing, but across 200k monthly subscribers it was accumulating roughly $800/month in rounding drift that didn't match their payment processor. Nobody noticed for eight months.
14 years building payment infrastructure at scale. Has worked on acquiring, issuing, and embedded finance products. Deep background in PCI-DSS compliance and financial reconciliation systems.
“Money is the one domain where being almost right is just wrong.”
Simone R. — Staff ML Engineer · ex-Databricks · 9y
What I Look For
Training-serving skew — the silent killer of ML systems in production. Features computed differently at train vs. inference time can tank model quality with zero error logs. I also catch data leakage in feature pipelines, non-reproducible training runs, missing model versioning, and prompt injection vectors in LLM-integrated systems.
A Problem I Caught
A fraud model was trained with normalized transaction amounts using the training set's min/max. In production, the same normalization was recomputed nightly against the last 30 days of live data. As transaction volumes grew, the normalization window shifted, and the model quietly degraded over three months. Fraud rates climbed 18% before anyone correlated it.
9 years in ML platform engineering and applied ML. Previously at a major data lakehouse company. Now focuses on production ML systems, LLM integration patterns, and AI safety for high-stakes applications.
“Your model's accuracy in a notebook means nothing. What does it do on a Tuesday at 3am with stale features?”
David L. — Staff Engineer, Infrastructure · ex-HashiCorp · 13y
What I Look For
IAM policies that are too permissive, secrets in environment variables that propagate to child processes, and Terraform state that's a single point of failure. I also look at blast radius: how bad is the worst-case deploy? Most teams haven't modeled it. Observability gaps are another recurring theme — you can't debug what you can't see.
A Problem I Caught
A team was using a single AWS IAM role across all Lambda functions in their account. One function had a dependency with a known path traversal CVE. An attacker could have read any secret in Secrets Manager, started or stopped any EC2 instance, and enumerated the entire account. The fix was a week of IAM decomposition — the discovery was a fifteen-minute code review.
13 years in infrastructure engineering and DevOps. Previously worked on enterprise infrastructure tooling. Deep expertise in cloud security posture, IaC review, and incident response architecture.
“Principle of least privilege is a principle, not a checkbox. Most systems I review have never actually applied it.”
Anya M. — Principal Engineer, API Design · ex-Twilio · 11y
What I Look For
Pagination that breaks under concurrent writes, rate limiting that doesn't account for distributed callers, and versioning strategies that painted the team into a corner six months ago. I also catch missing idempotency on mutating endpoints, over-fetching that'll become a performance cliff at scale, and error response shapes that leak implementation details.
A Problem I Caught
An API was using cursor-based pagination where the cursor was a base64-encoded Postgres offset. The client was long-polling a high-write table. As rows were inserted, offsets shifted mid-page — callers were silently skipping records and occasionally seeing duplicates. It was catastrophic for the downstream sync job that assumed completeness.
11 years designing and scaling APIs at developer platforms. Built API infrastructure used by hundreds of thousands of developers. Now consults on API design, backwards compatibility strategy, and developer experience.
“An API is a contract. Most teams ship the happy path and call it a contract. It isn't.”