Google Analytics → Google BigQuery: Create a user behavior web streaming

The Google Analytics→Google BigQuery pipeline collects raw non-sampled data about user behavior on your website, both at hit and session levels.

The collected hit data is uploaded to BigQuery tables in real-time. Session data gets to BigQuery within the following day.

Note:When you create a hit and session data pipeline, OWOX BI automatically creates a cost data pipeline from Google Analytics to Google BigQuery.

What you need before creating a pipeline

  1. To use Google BigQuery and create a streaming pipeline in OWOX BI, you need a Google Cloud Platform project with activated billing. Read more about how to set up Google BigQuery in this article.
  2. To create a user behavior data pipeline, you need a subscription to the plan Growth, Professional, Marketing Data, Reports & Attribution, or Enterprise in OWOX BI. Session cost attribution for this pipeline is not available in the Marketing Data subscription.

How to create a pipeline

To create an OWOX BI hit-based web streaming take the following steps:

Step 1. On the Workspace page, click the ‘New’ button and select the Streaming option in a drop-down menu:

01-nnnew.png

Or, if your workspace page is empty, you will see a screen with three major blocks. Click the ‘+ New streaming’ button.

02-nnnew.png

Step 2. As a streaming type, select ‘Hit-based web streaming’:

03-nnnew.png

And click the 'Create & Setup' button to proceed to the next step.

Step 3. Provide access to Google Analytics:Create_a_Pipeline_3._Provide_access_to_GA.png

Step 4. Select the Google Analytics property and view as the data source.

Here, specify the Google Analytics view you want to gather data to. You need to have Edit permission for this Google Analytics property.

Note that you can set up only one pipeline per property.Create_a_Pipeline_4.Select_GA_view.png

Step 5. Provide access to the Google BigQuery account you want to upload Google Analytics data to:Create_a_Pipeline_5._Provide_Access_to_BQ.png

Step 6. Select a Google BigQuery dataset.

First, select the Google BigQuery project that contains the dataset you want to gather data to. Then, select the dataset.

If needed, you can create a new dataset right here.

You can set up only one OWOX BI pipeline for each BigQuery dataset. Creating several pipelines for one dataset may cause the hit data to be overwritten.Create_a_Pipeline_6.Select_BQ_dataset.png

 

Step 7. Integrate the tracking code on your website. 

The method of implementing the tracking code depends on your project. Read about the tracking with Google Tag Manager, analytics.js, Measurement Protocol, Tealium IQ, and AMP pages, Android and iOS apps tracking in the list of the tracking code implementation methods.

Find it on the created pipeline's page: Settings > JavaScript code for hit and session data collection:Pipeline_Tracking_Code_en.png

Done. The Google Analytics Real-Time Data → Google BigQuery streaming pipeline is all set.

A dataset with hit data tables will appear in your BigQuery project in the first minute of the next hour. Session data will get to BigQuery in 12 hours after the beginning of the next day.  

Please note: In OWOX BI pipeline, the maximum hit payload size is 16 KB. The hits exceeding this limit won't be processed and sent to BigQuery.
In Google Analytics, the maximum hit payload size is 8 KB.

Was this article helpful?
5 out of 5 found this helpful
Have more questions? Submit a request

0 Comments

Please sign in to leave a comment.