The phrase Big Data comes from the computational sciences. Specifically, it is used to describe scenarios where the volume and variety of data types overwhelm the existing tools to store and process it. Big Data is term for collection of data sets so large and complex. It becomes difficult to process using on hand base management tools or traditional data processing applications.


VOLUME refers to the amount of data being generated. Think in terms of gigabytes, terabytes, and petabytes. Many systems and applications are just not able to store, let alone ingest or process, that much data.

VELOCITY refers to the rate at which new data is generated. Megabytes per second, gigabytes per second…Data is streaming in at unprecedented speed and must be dealt with in a timely manner in order to extract the maximum value.

VARIETY refers to the number of types of data being generated.

Challenges in Big Data

Capture, curation, storage, search, sharing, transfer, analysis and visualization.

Unstructured Data is Exploding.

System or enterprise generates huge amount of data – Tera bytes and peta bytes of data.