Published in


What Is Big Data?

Understanding Big Data

Photo by Markus Spiske on Unsplash

Big Data is a term used to describe the collection and analysis of large amounts of data in order to provide a better understanding of the world around us. The term is often used in conjunction with other terms such as “big data” and “data analytics”. Big data can be thought of as data that is too large to fit on a single hard drive, but too small to be processed by a traditional data processing system.

In other words, big data is the ability to collect and analyze a large amount of information in a short period of time. This is accomplished through the use of a variety of techniques, including data mining, machine learning, artificial intelligence, and data warehousing.

What are the different types of Big Datasets?

There are a number of different kinds of big datasets, each of which has its own strengths and weaknesses. For example, there are data sets that are very large in size and require a lot of processing power to process.

There are also large datasets that have a high degree of redundancy, meaning that the information is stored in multiple locations, making it more difficult to access. Finally, some datasets are large because they contain a great deal of unstructured data, which makes it difficult for a data scientist to analyze and interpret the data.

How do I know if my data set is big enough?

The first thing you need to do is determine the size of your dataset. To do this, you can use a tool like BigQuery or Hadoop to query your data and find out how big it is. You can also use tools like Databricks to create your own custom datasets. Once you have determined your size, the next step is to determine what kind of analysis you want to perform on it. If you are interested in data science, this is where you should start.

However, if you’re just looking to get a feel for what data analysis is all about, it’s best to start with a small dataset and work your way up to a larger one. It’s also important to note that there is no right or wrong answer when it comes to choosing the right type of dataset for your needs.

As long as you know what your goals are and how you plan to achieve them, then you’ll be able to make an informed decision about which dataset is right for you.




We curate and disseminate outstanding articles from diverse domains and disciplines to create fusion and synergy.

Recommended from Medium

Natasha Caruana in conversation with British Council

Apples and oranges.

Tableau Charts Cheat Sheet for Data Science Part 1

Watermarking in Spark Structured Streaming

Data Wars — Epilogue

READ/DOWNLOAD$! Microsoft Excel 2016 Step by Step

Machine Learning Etc. Starter Guide

Opinion: Automated ML is a key tool for small emerging data science teams

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store
Arda Œ

Arda Œ

More from Medium

So You Want to Work with Data?

Reflecting on the Data Analysis Process

Get the AWS Data Analytics Cert in 3 Weeks