Big Data At Walmart: How The Mind-Blowing 40+ Petabyte Improves Retail Decision-Making

Originally published by Bernard Marr on LinkedIn: Big Data At Walmart: How The Mind-Blowing 40+ Petabyte Improves Retail Decision-Making

Walmart – the world’s biggest retailer with over 20,000 stores in 28 countries, is in the process of building the world’ biggest private cloud, to process 2.5 petabytes of data every hour.

To make sense of all of this information, and put it to work solving problems, the company has recently completed construction of what it calls its Data Café – a state-of-the-art analytics hub located within its Bentonville, Arkansas headquarters.

Here, over 200 streams of internal and external data, including 40 petabytes of recent transactional data, can be modelled, manipulated and visualized. Teams from any part of the business are invited to bring their problems to the analytics experts and then see a solution appear before their eyes on the nerve centre’s touch screen “smart boards”.

This tool has cut down the amount of time it takes to solve complex business questions, which are reliant on multiple external and internal variables, from weeks to minutes.

Senior Statistical Analyst Naveen Peddamail – who won his job with the company through a competition on crowd-sourced data competition website Kaggle – spoke to me about the project.

He said “If you can’t get insights until you’ve analyzed your sales for a week or a month, then you’ve lost sales within that time.

“If you can cut down that time from two or three weeks to 20 or 30 minutes, then that saves a lot of money for Walmart and stopped us losing sales. That’s the real value of what we have built with the data café.”

When bombarded with huge amounts of verified, quantifiable data at high speeds, problems caused by human error or miscalculation at the planning or execution stage of a particular business activity will often simply melt away.

For example Naveen told me about a grocery team who could not understand why sales had suddenly declined in a particular product category. The team came to the café to find out why, and by drilling into the data were quickly able to see that pricing miscalculations had been made, leading to the products being listed at a higher price than they should have been, in some regions.

In another example, during Halloween, sales analysts were able to see in real-time that although a particular novelty cookie was very popular in most stores, there were two stores where it wasn’t selling at all. The alert allowed the situation to be quickly investigated, and it was found that a simple stocking oversight had led to the cookies not being put on the shelves. The company was able to then rectify the situation immediately, avoiding further lost sales.