
Tag: data pipeline
From Big Beer to Big Data: Inside AB InBev’s Digital Transformation
With more than 500 beer brands and $55 billion in sales, Anheuser-Busch InBev is already the world's biggest beer company. And if all goes as planned with its digital transformation project, it will be the best beer comp Read more…
Google Doubles Down on Cloud Data Migration
Data integration startups have become prime acquisition targets as cloud analytics vendors look to beef up their migration capabilities. What that in mind, Google Cloud announced this week it intends to acquire data m Read more…
Streamsets Gets $35M for DataOps
StreamSets, which bills itself as the "air traffic control" tasked with preventing collisions from occurring with big data, today announced that it raised $35 million, which it will use to continue building its data oper Read more…
Machine Teaching Will Drive Crowdsourced Cognition into the AI Pipeline
Building high-quality artificial intelligence (AI) is hard work. It’s a specialized discipline that historically has required highly skilled specialists, aka data scientists. Any time you require some highly skilled Read more…
How Disney Built a Pipeline for Streaming Analytics
The explosion of on-demand video content is having a huge impact on how we watch television. You can now binge watch an entire season's worth of Grey's Anatomy at one sitting, if that suits your fancy. For a media giant Read more…
Apache Airflow to Power Google’s New Workflow Service
Apache Airflow, the workload management system developed by Airbnb, will power the new workflow service that Google rolled out today. Called Cloud Composer, the new Airflow-based service allows data analysts and applicat Read more…
How Netflix Optimized Flink for Massive Scale on AWS
When it comes to streaming data, it's tough to find a company operating on a more massive scale than Netflix, which streams more than 125 million hours of TV shows and movies -- per day. Netflix captures billions of pi Read more…
The Cure for Chaos: Automating Your Data Pipeline
The number of SaaS applications companies use has exploded. Most SMBs use at least a dozen, which means customer data are spread across systems and departments. So while customers can engage with businesses through mo Read more…
Orchestrator Emerges to Speed ML Models to Production
As the pace of machine learning model development accelerates, vendors are beginning to offer orchestration tools designed to help data scientists manage the testing, retraining and redeployment of predictive analytics m Read more…
The Top Three Challenges of Moving Data to the Cloud
Most data-driven businesses have already or are looking to move their data from on-premises databases to the cloud in order to take advantage of its unlimited, on-demand storage and compute cycles. Implementing cloud war Read more…
How Pandora Uses Kafka
As a big Hadoop user, Pandora Media is no stranger to distributed processing technologies. But when the music streaming service decided to transition its ad tracking system from a batch-oriented system into a real-time o Read more…