Skip to main content

Google Analytics (Universal Analytics) [ARCHIVED]

This page contains the setup guide and reference information for the Google Analytics (Universal Analytics) source connector.

This connector supports Universal Analytics properties through the Reporting API v4.

danger

Google Analytics Universal Analytics Source Connector will be deprecated due to the deprecation of the Google Analytics Universal Analytics API by Google. This deprecation is scheduled by Google on July 1, 2024 (see Google's Documentation for more details). Transition to the Google Analytics 4 (GA4) Source Connector by July 1, 2024, to continue accessing your analytics data.

caution

The Google Analytics (Universal Analytics) connector will be deprecated soon.

Google is phasing out Universal Analytics in favor of Google Analytics 4 (GA4). In consequence, we are deprecating the Google Analytics (Universal Analytics) connector and recommend that you migrate to the Google Analytics 4 (GA4) connector as soon as possible to ensure your syncs are not affected.

Due to this deprecation, we will not be accepting new contributions for this source.

For more information, see "Universal Analytics is going away".

note

Google Analytics Universal Analytics (UA) connector, uses the older version of Google Analytics, which has been the standard for tracking website and app user behavior since 2012.

Google Analytics 4 (GA4) connector is the latest version of Google Analytics, which was introduced in 2020. It offers a new data model that emphasizes events and user properties, rather than pageviews and sessions. This new model allows for more flexible and customizable reporting, as well as more accurate measurement of user behavior across devices and platforms.

Prerequisites

A Google Cloud account with Viewer permissions and Google Analytics Reporting API and Google Analytics API enabled.

Setup guide

For Airbyte Cloud:

  1. Log into your Airbyte Cloud account.
  2. In the left navigation bar, click Sources. In the top-right corner, click + New source.
  3. On the Set up the source page, select Google Analytics from the Source type dropdown.
  4. For Name, enter a name for the Google Analytics connector.
  5. Authenticate your Google account via OAuth or Service Account Key Authentication.
    • To authenticate your Google account via OAuth, click Sign in with Google and complete the authentication workflow.
    • To authenticate your Google account via Service Account Key Authentication, enter your Google Cloud service account key in JSON format. Make sure the Service Account has the Project Viewer permission.
  6. Enter the Replication Start Date in YYYY-MM-DD format. The data added on and after this date will be replicated. If this field is blank, Airbyte will replicate all data.
  7. Enter the View ID for the Google Analytics View you want to fetch data from.
  8. Leave Data request time increment in days (Optional) blank or set to 1. For faster syncs, set this value to more than 1 but that might result in the Google Analytics API returning sampled data, potentially causing inaccuracies in the returned results. The maximum allowed value is 364.

For Airbyte Open Source:

  1. Navigate to the Airbyte Open Source dashboard.
  2. Go to the Airbyte UI and click Sources and then click + New source.
  3. On the Set up the source page, select Google Analytics from the Source type dropdown.
  4. Enter a name for the Google Analytics connector.
  5. Authenticate your Google account via OAuth or Service Account Key Authentication:
  6. Enter the Replication Start Date in YYYY-MM-DD format. The data added on and after this date will be replicated. If this field is blank, Airbyte will replicate all data.
  1. Enter the View ID for the Google Analytics View you want to fetch data from.
  2. Optionally, enter a JSON object as a string in the Custom Reports field. For details, refer to Requesting custom reports
  3. Leave Data request time increment in days (Optional) blank or set to 1. For faster syncs, set this value to more than 1 but that might result in the Google Analytics API returning sampled data, potentially causing inaccuracies in the returned results. The maximum allowed value is 364.

Supported sync modes

The Google Analytics source connector supports the following sync modes:

caution

You need to add the service account email address on the account level, not the property level. Otherwise, an 403 error will be returned.

Supported streams

The Google Analytics (Universal Analytics) source connector can sync the following tables:

Stream nameSchema
website_overview{"ga_date":"2021-02-11","ga_users":1,"ga_newUsers":0,"ga_sessions":9,"ga_sessionsPerUser":9.0,"ga_avgSessionDuration":28.77777777777778,"ga_pageviews":63,"ga_pageviewsPerSession":7.0,"ga_avgTimeOnPage":4.685185185185185,"ga_bounceRate":0.0,"ga_exitRate":14.285714285714285,"view_id":"211669975"}
traffic_sources{"ga_date":"2021-02-11","ga_source":"(direct)","ga_medium":"(none)","ga_socialNetwork":"(not set)","ga_users":1,"ga_newUsers":0,"ga_sessions":9,"ga_sessionsPerUser":9.0,"ga_avgSessionDuration":28.77777777777778,"ga_pageviews":63,"ga_pageviewsPerSession":7.0,"ga_avgTimeOnPage":4.685185185185185,"ga_bounceRate":0.0,"ga_exitRate":14.285714285714285,"view_id":"211669975"}
pages{"ga_date":"2021-02-11","ga_hostname":"mydemo.com","ga_pagePath":"/home5","ga_pageviews":63,"ga_uniquePageviews":9,"ga_avgTimeOnPage":4.685185185185185,"ga_entrances":9,"ga_entranceRate":14.285714285714285,"ga_bounceRate":0.0,"ga_exits":9,"ga_exitRate":14.285714285714285,"view_id":"211669975"}
locations{"ga_date":"2021-02-11","ga_continent":"Americas","ga_subContinent":"Northern America","ga_country":"United States","ga_region":"Iowa","ga_metro":"Des Moines-Ames IA","ga_city":"Des Moines","ga_users":1,"ga_newUsers":0,"ga_sessions":1,"ga_sessionsPerUser":1.0,"ga_avgSessionDuration":29.0,"ga_pageviews":7,"ga_pageviewsPerSession":7.0,"ga_avgTimeOnPage":4.666666666666667,"ga_bounceRate":0.0,"ga_exitRate":14.285714285714285,"view_id":"211669975"}
monthly_active_users{"ga_date":"2021-02-11","ga_30dayUsers":1,"view_id":"211669975"}
four_weekly_active_users{"ga_date":"2021-02-11","ga_28dayUsers":1,"view_id":"211669975"}
two_weekly_active_users{"ga_date":"2021-02-11","ga_14dayUsers":1,"view_id":"211669975"}
weekly_active_users{"ga_date":"2021-02-11","ga_7dayUsers":1,"view_id":"211669975"}
daily_active_users{"ga_date":"2021-02-11","ga_1dayUsers":1,"view_id":"211669975"}
devices{"ga_date":"2021-02-11","ga_deviceCategory":"desktop","ga_operatingSystem":"Macintosh","ga_browser":"Chrome","ga_users":1,"ga_newUsers":0,"ga_sessions":9,"ga_sessionsPerUser":9.0,"ga_avgSessionDuration":28.77777777777778,"ga_pageviews":63,"ga_pageviewsPerSession":7.0,"ga_avgTimeOnPage":4.685185185185185,"ga_bounceRate":0.0,"ga_exitRate":14.285714285714285,"view_id":"211669975"}
Any custom reportsSee below for details.

Reach out to us on Slack or create an issue if you need to send custom Google Analytics report data with Airbyte.

Rate Limits and Performance Considerations (Airbyte Open Source)

Analytics Reporting API v4

  • Number of requests per day per project: 50,000
  • Number of requests per view (profile) per day: 10,000 (cannot be increased)
  • Number of requests per 100 seconds per project: 2,000
  • Number of requests per 100 seconds per user per project: 100 (can be increased in Google API Console to 1,000).

The Google Analytics connector should not run into the "requests per 100 seconds" limitation under normal usage. Create an issue if you see any rate limit issues that are not automatically retried successfully and try increasing the window_in_days value.

Sampled data in reports

If you are not on the Google Analytics 360 tier, the Google Analytics API may return sampled data if the amount of data in your Google Analytics account exceeds Google's pre-determined compute thresholds. This means the data returned in the report is an estimate which may have some inaccuracy. This Google page provides a comprehensive overview of how Google applies sampling to your data.

In order to minimize the chances of sampling being applied to your data, Airbyte makes data requests to Google in one day increments (the smallest allowed date increment). This reduces the amount of data the Google API processes per request, thus minimizing the chances of sampling being applied. The downside of requesting data in one day increments is that it increases the time it takes to export your Google Analytics data. If sampling is not a concern, you can override this behavior by setting the optional window_in_day parameter to specify the number of days to look back and avoid sampling. When sampling occurs, a warning is logged to the sync log.

Requesting Custom Reports

Custom Reports allow for flexibility in the reporting dimensions and metrics to meet your specific use case. Use the GA4 Query Explorer to help build your report. To ensure your dimensions and metrics are compatible, you can also refer to the GA4 Dimensions & Metrics Explorer.

A custom report is formatted as: [{"name": "<report-name>", "dimensions": ["<dimension-name>", ...], "metrics": ["<metric-name>", ...]}]

Example of a custom report:

[
{
"name": "page_views_and_users",
"dimensions": [
"ga:date",
"ga:pagePath",
"ga:sessionDefaultChannelGrouping"
],
"metrics": ["ga:screenPageViews", "ga:totalUsers"]
}
]

Multiple custom reports should be entered with a comma separator. Each custom report is created as it's own stream. Example of multiple custom reports:

[
{
"name": "page_views_and_users",
"dimensions": ["ga:date", "ga:pagePath"],
"metrics": ["ga:screenPageViews", "ga:totalUsers"]
},
{
"name": "sessions_by_region",
"dimensions": ["ga:date", "ga:region"],
"metrics": ["ga:totalUsers", "ga:sessions"]
}
]

Custom reports can also include segments and filters to pull a subset of your data. The report should be formatted as:

[
{
"name": "<report-name>",
"dimensions": ["<dimension-name>", ...],
"metrics": ["<metric-name>", ...],
"segments": ["<segment-id-or-dynamic-segment-name>", ...],
"filter": "<filter-name>"
}
]
  • When using segments, make sure you also add the ga:segment dimension.

Example of a custom report with segments and/or filters:

[
{
"name": "page_views_and_users",
"dimensions": ["ga:date", "ga:pagePath", "ga:segment"],
"metrics": ["ga:sessions", "ga:totalUsers"],
"segments": ["ga:sessionSource!=(direct)"],
"filter": ["ga:sessionSource!=(direct);ga:sessionSource!=(not set)"]
}
]

To create a list of dimensions, you can use default Google Analytics dimensions (listed below) or custom dimensions if you have some defined. Each report can contain no more than 7 dimensions, and they must all be unique. The default Google Analytics dimensions are:

  • ga:browser
  • ga:city
  • ga:continent
  • ga:country
  • ga:date
  • ga:deviceCategory
  • ga:hostname
  • ga:medium
  • ga:metro
  • ga:operatingSystem
  • ga:pagePath
  • ga:region
  • ga:socialNetwork
  • ga:source
  • ga:subContinent

To create a list of metrics, use a default Google Analytics metric (values from the list below) or custom metrics if you have defined them. A custom report can contain no more than 10 unique metrics. The default available Google Analytics metrics are:

  • ga:14dayUsers
  • ga:1dayUsers
  • ga:28dayUsers
  • ga:30dayUsers
  • ga:7dayUsers
  • ga:avgSessionDuration
  • ga:avgTimeOnPage
  • ga:bounceRate
  • ga:entranceRate
  • ga:entrances
  • ga:exitRate
  • ga:exits
  • ga:newUsers
  • ga:pageviews
  • ga:pageviewsPerSession
  • ga:sessions
  • ga:sessionsPerUser
  • ga:uniquePageviews
  • ga:users

Incremental sync is supported only if you add ga:date dimension to your custom report.

Limitations & Troubleshooting

Expand to see details about Google Analytics v4 connector limitations and troubleshooting.

Connector limitations

Rate limiting

Analytics Reporting API v4

  • Number of requests per day per project: 50,000
  • Number of requests per view (profile) per day: 10,000 (cannot be increased)
  • Number of requests per 100 seconds per project: 2,000
  • Number of requests per 100 seconds per user per project: 100 (can be increased in Google API Console to 1,000).

The Google Analytics connector should not run into the "requests per 100 seconds" limitation under normal usage. Create an issue if you see any rate limit issues that are not automatically retried successfully and try increasing the window_in_days value.

Troubleshooting

  • Check out common troubleshooting issues for the Google Analytics v4 source connector on our Airbyte Forum.

Changelog

Expand to review
VersionDatePull RequestSubject
0.4.02024-07-0140244Deprecate the connector
0.3.32024-06-2139940Update dependencies
0.3.22024-06-0438934[autopull] Upgrade base image to v1.2.1
0.3.12024-04-1937432Fix empty response error for test stream
0.3.02024-03-1936267Pin airbyte-cdk version to ^0
0.2.52024-02-0935101Manage dependencies with Poetry.
0.2.42024-01-2234323Update setup dependencies
0.2.32024-01-1834353Add End date option
0.2.22023-10-1931599Base image migration: remove Dockerfile and use the python-connector-base image
0.2.12023-07-1128149Specify date format to support datepicker in UI
0.2.02023-06-2627738License Update: Elv2
0.1.362023-04-1322223Fix custom report with Segments dimensions
0.1.352023-05-3126885Remove authSpecification from spec in favour of advancedAuth
0.1.342023-01-2722006Set AvailabilityStrategy for streams explicitly to None
0.1.332022-12-2320858Fix check connection
0.1.322022-11-0418965Fix for discovery stage, when custom_reports are provided with single stream as dict
0.1.312022-10-3018670Add Custom Reports schema validation on check connection
0.1.302022-10-1317943Fix pagination
0.1.292022-10-1217905Handle exceeded daily quota gracefully
0.1.282022-09-2416920Added segments and filters to custom reports
0.1.272022-10-0717717Improve CHECK by using ga:hits metric.
0.1.262022-09-2817326Migrate to per-stream states.
0.1.252022-07-2715087Fix documentationUrl
0.1.242022-07-2615042Update additionalProperties field to true from schemas
0.1.232022-07-2214949Add handle request daily quota error
0.1.222022-06-3014298Specify integer type for ga:dateHourMinute dimension
0.1.212022-04-3012500Improve input configuration copy
0.1.202022-04-2812426Expose isDataGOlden field and always resync data two days back to make sure it is golden
0.1.192022-04-1912150Minor changes to documentation
0.1.182022-04-0711803Improved documentation
0.1.172022-03-3111512Improved Unit and Acceptance tests coverage, fixed read with abnormally large state values
0.1.162022-01-269480Reintroduce window_in_days and log warning when sampling occurs
0.1.152021-12-289165Update titles and descriptions
0.1.142021-12-098656Fix date format in schemas
0.1.132021-12-098676Fix window_in_days validation issue
0.1.122021-12-038175Fix validation of unknown metric(s) or dimension(s) error
0.1.112021-11-308264Corrected date range
0.1.102021-11-198087Support start_date before the account has any data
0.1.92021-10-277410Add check for correct permission for requested view_id
0.1.82021-10-137020Add intermediary auth config support
0.1.72021-10-076414Declare OAuth parameters in Google sources
0.1.62021-09-276459Update OAuth Spec File
0.1.32021-09-216357Fix OAuth workflow parameters
0.1.22021-09-206306Support of Airbyte OAuth initialization flow
0.1.12021-08-255655Corrected validation of empty custom report
0.1.02021-08-105290Initial Release