Big data, the name itself suggests that it is kind of data which is huge/big. We need to know which data makes to that big/huge level.
#What is Big Data ?
According to IDC ( International Data Corporation ), we define the data as big data based on three parameters viz. Volume, Velocity and Variety.
Volume refers to size of the data. Eg : Millions of tweets posted for every minute
Velocity refers to how fast you need analysis on that data. Eg : In IPL (Cricket) match you need to check which team has more tweets and to display that information, it need to query based on (#CSK) tag or any team tag.
Variety refers to what type of data you have, text/image/audio/sensorinformation/video or any mixture of those. Eg : Facebook is processing your status message(text),photo ( image) and video formats. The data may be structured or unstructured.
According SAS ( another company ) variability and complexity are the two key terms.
Variability refers to how variable that data size and the time at which we are getting that data, it is more like trending data.
Complexity refers to the same as variety term defined in IDC terms. Data is coming from multiple sources in multiple format and we need to process relevant information. Not just blindly joining every information.
#Where is it used?
The organisations which are having the data that has previous properties like Google,Facebook,Twitter etc. We can't decide the big data based on the size of the data. 1 tera byte is a huge amount of data for one organisation, but 1 gb data is a huge amount for another organisation.
#How is it used?
Only defining big data is not worthy. It is defined to do some analytics over that huge amount of data and come out with new information and project new ideas in developing organisation.
To do this, Big data combines with Hadoop (will be posted soon ) framework and yields results.
#What is Big Data ?
According to IDC ( International Data Corporation ), we define the data as big data based on three parameters viz. Volume, Velocity and Variety.
Volume refers to size of the data. Eg : Millions of tweets posted for every minute
Velocity refers to how fast you need analysis on that data. Eg : In IPL (Cricket) match you need to check which team has more tweets and to display that information, it need to query based on (#CSK) tag or any team tag.
Variety refers to what type of data you have, text/image/audio/sensorinformation/video or any mixture of those. Eg : Facebook is processing your status message(text),photo ( image) and video formats. The data may be structured or unstructured.
According SAS ( another company ) variability and complexity are the two key terms.
Variability refers to how variable that data size and the time at which we are getting that data, it is more like trending data.
Complexity refers to the same as variety term defined in IDC terms. Data is coming from multiple sources in multiple format and we need to process relevant information. Not just blindly joining every information.
#Where is it used?
The organisations which are having the data that has previous properties like Google,Facebook,Twitter etc. We can't decide the big data based on the size of the data. 1 tera byte is a huge amount of data for one organisation, but 1 gb data is a huge amount for another organisation.
#How is it used?
Only defining big data is not worthy. It is defined to do some analytics over that huge amount of data and come out with new information and project new ideas in developing organisation.
To do this, Big data combines with Hadoop (will be posted soon ) framework and yields results.
"Small data is gone. Data is just going to get bigger and bigger and bigger, and people just have to think differently about how they manage it."
Comments
Post a Comment