Platform Audit
Infrastructure · Reliability · Performance · Observability
March 2026 · 65 cities · 100+ microservices · 50B+ rows analyzed
Driver Accept Rate
191K accepts out of 7.3M daily offers
97.4% of ride requests go unanswered
The data exists (5.82B rows) but isn't surfaced with diagnostic granularity.
Estimated impact of fixing funnel leaks: 10-15K incremental rides/week
56% of searches drop off — but we have zero stage-level visibility into where or why
Wait time is #1 cancellation reason (126K/week) — but no actual wait time data is measured
341 Kafka topics · 407 consumers · Zero pipeline visibility
Ingesting 450 GB/day — app log retention only 2 days
Filter health checks + sample istio at 10% = 89% storage reduction
One service consumes 86% of all monitoring capacity
Root cause: route_id × num_stops × hour × le histogram buckets
Fix: remove route_id from histogram labels
No driver economics dashboard exists. No automated response to fraud detections. No payment collection pipeline visibility.
+ 30+ alert rules defined · ~4,600 new series (0.05% of TSDB — negligible)
This audit was done by Claude in 3 days. The fixes can be deployed the same way — using Claude Max sessions as force multipliers.
How this audit was done: 403 tasks across 16 manifests, 12 reports, 13 findings, 8 PRs with 14K+ lines of production code — all generated by Claude Max fleet sessions in ~72 hours.
Total: ~26 Claude sessions over 5 days — human effort: ~2-3 hours/day of review and approvals
Proof of concept: This entire audit — 403 tasks, 12 reports, 8 PRs, 14K lines — completed by Claude in 72 hours.
| Session | Task | Parallel? | Human |
|---|---|---|---|
| Fleet ×5 | Infra monitoring restoration | Yes | Approve infra |
| Fleet ×7 | Dashboard import + alert wiring | Yes | Verify accuracy |
| Fleet ×8 | PR review comment resolution | Yes | Merge approval |
| Fleet ×3 | Revenue fixes (FRFS, BAP 4xx, PgBouncer) | Yes | Staging sign-off |
| Single | Cross-PR conflict detection + merge sequencing | Sequential | Deploy ordering |
| Single | OpenSearch log sampling config | Yes | Approve policy |
| Single | Driver acceptance materialized view + dashboard | Yes | Validate logic |
| Overnight | Go-Home failure RCA + fix | Yes | Morning review |
~26 Claude sessions over 5 days, mostly parallel. Human effort: ~2-3 hours/day of review.
Step 1: Assign P0 owners today
Step 2: Spin up Day 1 Claude fleet (5 infra sessions) tomorrow
This audit: 403 tasks · 12 reports · 8 PRs · 14K lines of code — generated by Claude Max in 72 hours