Siladitya GhoshTaming the Data Deluge: How Apache Druid Manages Massive Datasets with SpeedIn today’s data-driven world, organizations are constantly bombarded with information. From website clicks and sensor readings to social…4 min read·1 hour ago--
Apache DorisApache Doris for log and time series data analysis in NetEase, why not Elasticsearch and InfluxDB?NetEase has replaced Elasticsearch and InfluxDB with Apache Doris in its monitoring and time series data analysis platforms, respectively…9 min read·May 24, 2024--1
Ihor LukianovA Brief History of Data Management — From Relational Databases to Data LakehousesHow we evolved to modern data management approaches and what should we know as Data Engineers5 min read·Jun 3, 2024--3--3
Lalitha Mohanasundaram🌟The Role and Impact of AQE in Apache Spark🌟What is AQE?🤔2 min read·1 day ago----
Rindhuja Treesa JohnsoninTowards Data ScienceApache Hadoop and Apache Spark for Big Data AnalysisA complete guide to big data analysis using Apache Hadoop (HDFS) and PySpark library in Python on game reviews on the Steam gaming…14 min read·May 8, 2024--1--1
Siladitya GhoshTaming the Data Deluge: How Apache Druid Manages Massive Datasets with SpeedIn today’s data-driven world, organizations are constantly bombarded with information. From website clicks and sensor readings to social…4 min read·1 hour ago--
Apache DorisApache Doris for log and time series data analysis in NetEase, why not Elasticsearch and InfluxDB?NetEase has replaced Elasticsearch and InfluxDB with Apache Doris in its monitoring and time series data analysis platforms, respectively…9 min read·May 24, 2024--1
Ihor LukianovA Brief History of Data Management — From Relational Databases to Data LakehousesHow we evolved to modern data management approaches and what should we know as Data Engineers5 min read·Jun 3, 2024--3
Lalitha Mohanasundaram🌟The Role and Impact of AQE in Apache Spark🌟What is AQE?🤔2 min read·1 day ago--
Rindhuja Treesa JohnsoninTowards Data ScienceApache Hadoop and Apache Spark for Big Data AnalysisA complete guide to big data analysis using Apache Hadoop (HDFS) and PySpark library in Python on game reviews on the Steam gaming…14 min read·May 8, 2024--1
Abhik DeyApache Hadoop 3.3.6 Installation on Ubuntu 22.04In the ever-expanding world of big data, managing and processing vast amounts of information efficiently has become paramount for…7 min read·Nov 7, 2023--6
Pranav BarathwajBig Data and its impact on Data Curation and Management: Key Practices and ApplicationsThis article explores the significant impact of Big Data on data curation and management practices, highlighting the necessity for new…7 min read·1 day ago--
Andrew TaftinTowards Data ScienceSelf-Service Data Analytics as a Hierarchy of NeedsFrom food and shelter to self-actualization. How to use a scientific approach to create the foundations that support self-service…15 min read·Nov 22, 2023--8