PinnedNicholas PiescoModeling in PySpark Pt. 2A robust data model is crucial for maintaining data integrity and quality, particularly when dealing with complex and hierarchical…3 min read·Feb 6, 2024--1--1
PinnedNicholas PiescoinData Engineer ThingsComplex Data Types & OBT Modeling in PySparkComplex data types are essential when dealing with intricate or semi-structured data in PySpark. This guide provides a comprehensive…3 min read·Jan 9, 2024--1--1
PinnedNicholas PiescoinData Engineer ThingsObject-Oriented Programming in PySpark: Serialization IssuesObject-Oriented Programming (OOP) is a foundational principle for modern software development, providing a clear code structure and design…3 min read·Aug 8, 2023----
PinnedNicholas PiescoinData Engineer ThingsThe Evolution of Big Data: From SQL Databases to Interactive Analytics5 min read·Jun 19, 2023--2--2
Nicholas PiescoinData Engineer ThingsGranular Look at Left, Semi, and Anti Joins in PySparkIn data operations, understanding the inner-working of the various types of joins can optimize query performance and accuracy. Spark…5 min read·May 20, 2024----
Nicholas PiescoinData Engineer ThingsCracking the Code: Tied Scores, a Window Functions PerspectiveIn modern businesses, performance dashboards have become essential tools for recognizing and celebrating top performers. Serving as a…4 min read·Apr 3, 2024--1--1
Nicholas PiescoPerformance Considerations; ORMs, ODBC, and Database DriversInteractions between software and databases are a critical component of system architecture. Engineers can employ various data access…3 min read·Nov 14, 2023----
Nicholas PiescoinData Engineer ThingsTraditional & Modern Data Modeling in Distributed Computing Frameworks: A Guide to Effective…Handling data in distributed environments imposes numerous challenges to organizations related to data management and efficient processing…6 min read·Oct 26, 2023--1--1
Nicholas PiescoDecentralization vs Centralization: Balancing Data Management DynamicsThe continuously evolving nature of industry conditions underscores a discernible shift towards data decentralization. Largely triggered by…3 min read·Jun 16, 2023----