Vu TrinhinGoogle Cloud - CommunityI spent 6 hours understanding the design principles of BigQuery. Here’s what I foundAll insights from BigQuery academic paper.10 min read·Jan 20, 2024--3--3
Mudra PatelData Engineering concepts: Part 9, Data SecurityThis is Part 9 of my 10 part series of Data Engineering concepts. And in this part, we will discuss about Data Security.7 min read·Apr 8, 2024--1--1
ML-GuyArchitecting a Successful Modern Data Analytics Platform in the CloudAfter we discussed the concepts for Building a Successful Modern Data Analytics Platform in the Cloud, it is time to architect it. This…·12 min read·Feb 5, 2021--2--2
ML-GuyArchitecting a Successful Modern Data Analytics Platform in the CloudAfter we discussed the concepts for Building a Successful Modern Data Analytics Platform in the Cloud, it is time to architect it. This…·12 min read·Feb 5, 2021--2--2
Axel Thevenot 🐣inGoogle Cloud - CommunityEfficient BigQuery Data Modeling: A Storage and Compute ComparisonBigQuery storage and compute comparison for normalized, denormalized, and nested design: an in-depth analysis with actionable optimizations17 min read·Mar 11, 2024--12--12
Marie LefevreinTowards Data ScienceA Guide To Building a Data Department From ScratchSome practical advice on where to start, from someone who’s been there before·7 min read·Mar 12, 2024--5--5
Rashi DesaiinTowards Data ScienceHow To Create A Successful Data PresentationPresent your data like a pro in five easy steps…·6 min read·Mar 11, 2024--1--1
Kasper Groes Albin LudvigseninTowards Data ScienceEnd-to-End NLP Project with Hugging Face, FastAPI, and DockerThis tutorial explains how to build a containerized sentiment analysis API using Hugging Face, FastAPI and Docker·10 min read·Mar 7, 2024--6--6
AvainLevel Up CodingEssential Python Libraries for Web Scraping and Data ExtractionPhoto by Stephen Dawson on Unsplash·3 min read·Sep 11, 2023----
Dave MelilloinTowards Data ScienceBuilding a Data Platform in 2024How to build a modern, scalable data platform to power your analytics and data science projects (updated)9 min read·Feb 5, 2024--42--42
Benjamin EtienneinTowards Data ScienceA Complete Guide to Write your own TransformersAn end-to-end implementation of a Pytorch Transformer, in which we will cover key concepts such as self-attention, encoders, decoders, and…18 min read·Feb 24, 2024--8--8
Vishal BulbuleinGoogle Cloud - CommunityStreamlining an ETL Data Pipeline on Google Cloud with Cloud Data Fusion & AirflowIntroduction5 min read·Feb 23, 2024----
Ryota Kiuchi, Ph.D.inTowards Data ScienceHow OpenAI’s Sora is Changing the Game: An Insight into Its Core TechnologiesA masterpiece of state of the art technologies12 min read·Feb 19, 2024--3--3
krishankant singhalRunning Mistral LLM Locally with OllamaIn this guide, we’ll walk you through the process of downloading Ollama, installing Mistral, and using the Ollama model through LangChain…·3 min read·Feb 17, 2024--1--1
Alexandre Magno Lima MartinsinApache AirflowWhat we learned after running Airflow on Kubernetes for 2 yearsApache Airflow is one of the most important components in our Data Platform, used by different teams inside the business. It powers all of…13 min read·Feb 6, 2024--20--20
Kurt KlingensmithinTowards Data ScienceProfessionally Visualize Data Distributions in PythonLearn seven different methods for visualizing data distributions·12 min read·Feb 18, 2024--6--6
janmeskensinThe Modern ScientistData Ingestion — Part 1: Architectural PatternsOver the course of two articles, I will thoroughly explore data ingestion, a fundamental process that bridges the operational and…11 min read·Nov 27, 2023--24--24
Nikhil AdithyaninDataDrivenInvestorStock Price Prediction with Quantum Machine Learning in PythonAn overview of the challenges and opportunities17 min read·Jan 23, 2024--20--20
Romin IraniinGoogle Cloud - CommunityGetting Started with Gemini AI API via Google Cloud Code Application TemplatesThe Gemini Model is available to all developers and I wanted to share with you one way via which you can test out the API via an…5 min read·Dec 20, 2023--1--1
Nishit KamdarinGoogle Cloud - CommunityDataplex — An intelligent Data Fabric | Data Governance at Scale| Google Cloud | Part — 1 |…Background:6 min read·Sep 24, 2023----