Case study: real-time competitive intelligence for trading fintech

Challenge

A fintech client (Series B, trading platform for retail crypto+forex traders) competed on a fundamental metric: the speed at which their UI displayed current pricing data from competing venues. Their research showed traders deciding to buy/sell look at 3–4 platforms simultaneously — whoever shows pricing first retains user attention.

State in 2024: their scrape pipeline used Selenium + datacenter proxies, refresh every 5–10 minutes, success rate <75% due to blocks on Cloudflare-protected venues. Real lead time vs largest competitors was 4–6 minutes BEHIND. Every minute of delay was a measurable hit to user retention.

The internal team attempted an upgrade twice. The first attempt (residential proxies + Playwright) raised success rate to 89% but did not change latency — too many synchronous retries. The second attempt (rewriting on Go workers + Kafka) increased throughput but did not solve anti-bot detection in peak hours.

Approach

We designed a real-time architecture with three layers of parallelism: spatial (14 venues scanned simultaneously), temporal (every venue with a dedicated worker pool refreshing every 25–30s), redundancy (every critical pair scraped from 3 venues with cross-validation).

Critical decisions: residential mobile proxy pool for peak hours (when datacenter becomes useless), browser farm with 60–80 concurrent sessions per venue, dedicated fingerprint pool per venue (each venue gets its own anti-bot tuning), real-time event stream to the client via Kafka.

Architecture: Temporal as orchestrator (continuous workflows, not batch), Playwright pool in Kubernetes with autoscaling on latency metric, ClickHouse for time-series price storage, custom WebSocket gateway for real-time delivery to client UI. Persistence double-buffered — primary path to Redis (sub-millisecond reads), secondary to Postgres for historical analytics.

Anti-detection: per-venue persona engineering — every worker has a stable identity (browser fingerprint, IP geography, user agent, cookies) preserved for 6–12 hours. Rotation only when CAPTCHA rate exceeds threshold. Behavioural simulation per venue tuned to typical user patterns on that platform.

Outcome

Pricing lead time vs largest competitors: -7 minutes average (the client sees pricing 7 minutes before competitors). Measured via third-party benchmarking service over 6 months.

Success rate aggregated across 14 venues: 97.8% (vs 75% baseline). End-to-end P95 latency (from venue source to client UI): 1.8 seconds.

User retention metric (return-within-7-days for active traders): +28% post-deployment. Client attributes this to lead time advantage based on a controlled rollout.

System running 18 months 24/7 at 99.7% uptime. Two major outages in the period — both recovered in <30 min thanks to multi-venue redundancy (if venue X is down, cross-validation from 2 others covers the gap).

Stack

TemporalPlaywrightKubernetesClickHouseRedisKafkaBright Data residential + mobileCustom WebSocket gatewayDatadog

Metrics

800+ — Trading pairs monitored
14 — Venues
25–30s — Refresh interval
97.8% — Success rate
1.8s — P95 latency end-to-end
99.7% — Uptime
+28% — User retention lift

Real-time competitive intelligence for a trading fintech

Challenge

Approach

Outcome

Stack

Metrics

Continuous lead intelligence system for a B2B SaaS sales team

Real-time pricing intelligence across 1,200 marketplaces

Autonomous research agent for a financial intelligence firm

Every project is different, but patterns repeat.

Real-time competitive intelligence for a trading fintech

§01Challenge

§02Approach

§03Outcome

Stack

Metrics

Continuous lead intelligence system for a B2B SaaS sales team

Real-time pricing intelligence across 1,200 marketplaces

Autonomous research agent for a financial intelligence firm

Every project is different, but patterns repeat.

Challenge

Approach

Outcome