Big Data

Several definitions exist for Big Data, here are the ones I prefer:

  1. A collection of data sets so large and complex that it becomes difficult to process using on-hand database management tools or traditional data processing applications.
  2. When the data could not fit in Excel (used to be 65,536 lines, now 1,048,577 lines).
  3. When it's cheaper to keep everything than spend the effort to decide what to throw away.

Introduction to Big Data

Let's start by a quick introduction to Big Data.

I tried to summarize the major big data technologies in one slide.

Ready to Go, you can start with installing a platform and start playing:

You can even try this tutorial to predict flights delays.

All you wanted to know about big data!

A more complete and technical big data presentation is available and used for training, see the consulting section.

Case Studies

Panel discussing data as the new oil for business (in French).

In december 2012, I presented at EyeForTravel Smart Travel Analytics conference, some proof of concepts made around data collected in near real time and analysed to provide actionable insights.