Member-only story
Understanding the GA4 BigQuery Export Schema and Structure
A qualitative investigation into one of the weirdest data structures ever forced upon millions of innocent, unsuspecting analysts
Introduction
Google Analytics 4 is currently estimated to be used by 15.6 million websites in the world, meaning that the BigQuery GA4 Export is possibly one of the most widely exported data schemas of all time. Google Analytics 4 data is accessible through the web user interface or directly via the API into Looker Studio, but if you want to:
- Own your data beyond Google’s retention policy,
- Archive your data to prevent possible data loss,
- Combine your data with other internal data sources,
- Augment your data from additional external sources, APIs or LLMs, or
- Build custom automation workflows
then the recommended approach is to enable the GA4 export to BigQuery, which is very simple to set up and configure.
Great! Job done!
Not quite. Once the data starts appearing in daily BigQuery exports, you may notice that the structure is somewhat irregular, making it very difficult to work with directly. This…