Each day thousands of companies and more than a million developers rely on Scrapinghub tools and services to extract the data they need from the web. To strengthen its position as a market leader, Scrapinghub recently launched a new product, AutoExtract, that provides customers with AI-enabled, automated web data extraction at scale. Scrapinghub built AutoExtract on Confluent Cloud running on Google Cloud Platform (GCP), with an Apache Kafka®-based, event-streaming backbone for its service architecture. These technologies were chosen to shorten time to market, and to ensure reliability and scalability.
Accelerate the delivery of a next-generation web scraping service, capable of handling growing customer demand with no downtime.
Use Confluent Cloud and Apache Kafka to implement a reliable, scalable event-streaming backbone that links web crawlers with AI-enabled data extraction components.
Severstal utilise la plateforme Confluent pour diffuser des données à partir des sites de production, intégrer des microservices et alimenter des modèles d'apprentissage automatique afin de prédire les problèmes avant qu'ils surviennent.
TiVo travaille avec Confluent afin de mieux gérer et exploiter ses données pour perpétuer son héritage : révolutionner la façon dont les gens trouvent et profitent de la TV, des films et de la musique.
Ticketmaster Leverages Confluent to Reduce Development Friction and Boost Machine Learning.