How a competitive price intelligence agency migrated to Databay residential proxies to scrape 5 million product SKUs daily across 40 retailers, cutting infrastructure cost 58% while improving success rate to 99.1%. Names anonymized by mutual agreement.
The Customer
Customer name and specific retailer targets are anonymized at the agency's request. Figures below are rounded to the nearest reportable range.
The Challenge
- Success rate collapse. Datacenter IPs were being blocked at increasing rates on the major retail targets, especially on platforms running Akamai Bot Manager and Cloudflare's advanced bot protection. The effective success rate dropped from 92% to roughly 67% over six months.
- Quality degradation on price data. Datacenter IPs frequently got dynamic pricing treatment different from real consumer IPs. They saw generic catalog prices instead of region-specific localised pricing. That corrupted the core data deliverable to clients.
- Cost inflation from retries. High failure rates meant the agency was retrying each target 3-5 times. That tripled effective bandwidth cost and doubled infrastructure spend on worker instances.
The engineering team concluded that ISP-registered residential IPs were the only viable path to sustainable scaling at their target volume. They were still hesitant about the headline per-GB cost of residential compared to their existing datacenter baseline.
The Evaluation
- Success rate on anti-bot protected targets. Tested against five retailer platforms running Akamai, Cloudflare Bot Management, DataDome, and PerimeterX.
- City-level geo-targeting accuracy. Localised pricing needs real consumer IPs in specific metropolitan areas, not just country-level.
- Session continuity. Multi-step checkout scraping (add-to-cart to observe final price with taxes and shipping) needs sticky sessions of 15-30 minutes.
- Per-GB effective cost including retries. Raw per-GB rate matters less than the cost per successfully-collected SKU.
- API and usage analytics. Production monitoring, team access management, and usage forecasting.
Databay was one of two finalists. During pilot testing against the five target retailers, Databay's residential network returned a 99.1% success rate over 50,000 pilot requests. City-level targeting accuracy verified at 97% of requests. Sticky sessions were natively supported up to 120 minutes, comfortably longer than the 30-minute maximum the scraping workflows needed.
The Implementation
- Week 1: Route 10% of production traffic through Databay gateway for two lower-risk retailer targets. Monitor success rate, latency, and data accuracy against the production baseline.
- Week 2: Expand to 50% of production traffic across all 40 retailers. Begin decommissioning previous proxy contracts.
- Week 3: Complete migration to 100% Databay traffic. Full cut-over.
Technical integration was clean because both providers expose a similar backconnect gateway with geo-targeting in the proxy username. The agency's existing worker code (Python Scrapy with custom middleware) needed changes in one configuration file: two environment variable updates. Sticky session logic was adjusted to use Databay's session-id parameter syntax.
The agency opted for the Databay Enterprise tier (1 TB+ commitment) at $0.65/GB, reflecting their projected 4-5 TB monthly residential bandwidth consumption.
The Results
- Success rate improved from 67% to 99.1%. First-attempt success rate on anti-bot protected targets rose sharply. Retry overhead dropped by roughly 80%.
- Infrastructure cost decreased 58% net. The higher per-GB residential cost was more than offset by eliminated retry overhead and reduced worker instance count. Total monthly proxy + compute spend dropped from roughly $42,000 to $17,500.
- Localised pricing accuracy improved from 78% to 94%. City-level residential IPs surfaced region-specific pricing that datacenter IPs were missing, improving core data product quality.
- SKU coverage capacity grew from 5 million to 11 million daily within the same budget. The cost efficiency gain let the agency expand coverage to new retailers without extra spend.
- Client satisfaction score rose from 7.8 to 9.2 (agency's internal quarterly client survey, 10-point scale).
The lead engineer noted that the main driver of savings was not the proxy rate itself. It was the cascading efficiency gains from higher success rates: fewer retries, fewer worker instances, less human QA time catching bad data, and measurably better data quality for clients.
Key Takeaways
- Total cost matters more than per-GB rate. A higher-quality proxy that eliminates retries and improves data accuracy often pays for itself several times over in reduced compute and QA overhead.
- City-level geo-targeting materially affects e-commerce data quality. Many retailers show region-specific pricing that country-level proxies miss. For competitive intelligence specifically, this quality gap matters more than scraping volume.
- Sticky sessions are essential for checkout-flow price observation. Static pricing scraping can use rotating proxies. Observing final prices with taxes, shipping, and regional adjustments needs session continuity.
- Pilot against specific targets. Residential proxy pool quality varies by provider. Always pilot against your actual production targets before committing to a volume tier.
- Measure first-attempt success rate, not just overall success. A 67% first-attempt with 92% eventual success is operationally very different from 99% first-attempt with 99.5% eventual success.
About Databay Residential Proxies
- City-level, ZIP-code, GPS-coordinate, and ASN targeting at no extra charge
- Rotating sessions for broad IP diversity
- Sticky sessions up to 120 minutes for checkout-flow observation
- HTTP, HTTPS, and SOCKS5 protocols
- Unlimited concurrent connections and no bandwidth throttling
- Pay-as-you-go pricing from $2.75/GB with Enterprise tier at $0.65/GB for 1TB+ commitments
Learn more: Residential Proxies · Residential Pricing · Complete E-commerce Price Monitoring Guide
Case study results are specific to this customer's workload, targets, and implementation. Your mileage may vary depending on specific retail targets, scraping architecture, and geo-targeting mix. All customer names and specific retailer targets are anonymized at the customer's request.
