PinnedThibaut GourdelTop Dataframe Libraries in 2024Explore the evolution of Python dataframe libraries in 2024, highlighting trends like Apache Arrow, Apache Iceberg, and GPU acceleration.Jul 26Jul 26
PinnedThibaut GourdelETL Engineering Trends in 2024ETL trends 2024 includes: Python hegemony, small data, unstructured data, GenAI-powered ETL, and lakehouse architecture.Jun 24Jun 24
Thibaut GourdelWhy is Data Engineering being PythonisedJava and Scala ruled data software. Now, Python is the top choice for data experts. Let’s understand why.3d ago3d ago
Thibaut GourdelSmall but Mighty DataWhy It’s Time to Reconsider How We Envision and Process Data Workloads.Jul 16Jul 16
Thibaut GourdelETL in the age of LLMsDiscover how LLMs are revolutionizing ETL with natural language pipelines, smart automation, and enhanced unstructured data processing.Jul 11Jul 11
Thibaut GourdelUnstructured data ETL in 2024Exploring the State of Unstructured Data ETL Amidst the GenAI Wave.Jul 3Jul 3
Thibaut GourdelUnlocking the Power of Pandas on Major Cloud PlatformsIn my previous article, I discussed whether Pandas was suitable for developing ETL pipelines. To summarize, Pandas is a solid choice for…Jun 17Jun 17
Thibaut GourdelShould You Use Pandas for ETL?Evaluating Pandas for ETL: Strengths, Limitations, and AlternativesJun 10Jun 10
Thibaut GourdelMy favorite JupyterLab extensions in 2024JupyterLab probably needs no introduction; it’s the go-to IDE for data science. I’ll share with you my favorites extensions for JupyterLab.Jun 4Jun 4