5vs of Big data to a non-IT person
The 5vs, nowadays seems so fancy like the big 5 (laughs inside).
But genuinely whoever came up with it, made the definition of big data quite clear.
So here they are…
*Volume, Velocity, Variety, Veracity, Value*
So lets get ahead…
Imagine you getting a bunch of homework all during the same day from different subjects with soo many worksheets in each subject…
Just imagine all of it is happening on the same day. Now think about your stress levels, that irritation and the speed in which the teachers give the homework at, the amount of work load, the variety of subjects, and some teachers might even end up giving you worksheets that are already preexisting in other websites, without them putting the effort for you, and when they say to submit/return all the homework’s and worksheets with a one day deadline, what quality can they expect from you ?.
Seems problematic and tough to your brain right?
Well that’s exactly what’s happening with big data too.
where the homework’s and worksheets are the data, their abundance or too many homework’s indicate the VOLUME, which is actually huge. (Same like how its unacceptable by your brain, the system finds it difficult to hold that much data too)
The homework’s in different subjects indicate the variety. (Huge data coming in different VARIETY-> Too many homework’s from different subjects)
All the teachers gave the homework’s soo quickly on the same day, VELOCITY. (Indicates the speed in which the data is received)
Many teachers copy pasted worksheets from websites, there’s no quality or trust, as they aren’t putting the effort they needed too (this indicates VERACITY, are you actually able to trust and validate the data you received ?)
When all the submissions are put on the same day can you actually do them and finish them within the time frame?, many of us can’t and then we mostly go for copying others works. So the VALUE you created for your work here, is it good?
Note: This situation was totally hypothetical
That’s exactly what the 5vs are trying to explain.
- Volume, the amount of data
- Velocity, how often new data is created and needs to be stored or how fast the data is being updated.
- Variety, how heterogeneous data types are
- Veracity, the “truthiness” or “messiness” of the data
- Value, the significance of data
I hope it helped you. Thanks.