Google Analytics 4 connector for Zoho DataPrep

Google Analytics 4 connector for Zoho DataPrep [BETA]

Zoho DataPrep allows you to import your website and app analytics data from Google Analytics 4 (GA4) for data preparation. You can also schedule the import of your GA4 data at regular intervals. With this connector, you can streamline your ETL workflows, enable seamless data movement, and simplify data integration between Google Analytics 4 and Zoho DataPrep for deeper insights and reporting.

To import data from Google Analytics 4

1. Open an existing pipeline or create a pipeline from the Home Page, Pipelines tab or Workspaces tab and click the Add data option.

InfoInfo: You can also click the Import data  icon at the top of the pipeline builder to bring data from multiple sources into the pipeline.


2. Choose the Marketing category from the left pane and click Google Analytics 4(GA4). You can also search for Google Analytics 4(GA4) in the search box.


NotesNote: If you had already added a connection earlier, click the required connection and proceed to import. You can also find your saved connections under the Saved connections category from the left pane. To learn more about Saved connections, click here.

3. Select an account from the saved connections or connect a new account using Add new option. 



4. Enter a unique connection name and click Authenticate to continue. You will be prompted to authenticate your Google account. You will need to authorize DataPrep to access your data when you do this for the first time.





Notes
Note: The connection configuration will be saved for importing from the Google Analytics 4 in the future. Credentials are securely encrypted and stored.
5. Choose the required Property, Modules and the corresponding Fields will be displayed. Select the Modules and Fields that you would like to import. 

Here's the list of total modules supported,
  1. Demographics
  2. Geo
  3. EventTracking
  4. PageTracking
  5. Ecommerce
  6. TrafficSources
  7. AdWords
  8. PlatformOrDevice
  9. System



6. Choose one of the options to import data from your Google Analytics 4 account.

All Data - This option imports all data from your account.
From date - This option imports data from a specific date till the current date.

7. Once you have completed importing data, Pipeline builder page opens and you can start applying transforms to your ETL pipeline. You can also right-click the stage and choose the Prepare data option and prepare your data in the DataPrep Studio page. Click here to know more about the transforms.



Note: When you import more than one module from your Google Analytics 4 account, each module will be created as a stage in DataPrep as above.
8. Once you are done creating your data flow and applying necessary transforms in your stages, you can right-click a stage and add a destination to complete your data flow.

Notes
Note: After adding a destination to the pipeline, you can try executing your pipeline using a manual run at first. Once you make sure manual run works, you can set up schedule to automate the pipeline and data movement. Learn about the different types of runs here.

Schedule

You can schedule your pipeline using the Schedule option. 

Schedule configuration

1. Select the Schedule option in the pipeline builder.

2. Select a Repeat method (hourly, daily, weekly, monthly) and set frequency using Perform every dropdown. The options of the Perform every dropdown change with the Repeat method. Click here to know more.




3. Select the timezone at which you want to import new data found in the source. By default, your local time zone will be selected.


4. Pause schedule afterThis option allows you to choose to pause the schedule after n number of failures.
InfoInfo: The range can be between 2-100. The default value is 2.

Import configuration

You can configure how to import and fetch data from your Google Analytics 4 account using the Import configuration option. Refer to the sheet below to view the available import configuration options for each modules and corresponding run types.
Note: The import configuration needs to be mandatorily setup for all the sources in the pipeline. Without setting up the import configuration, the schedule cannot be saved.



5. Select the Click here link to set the import configuration. 

6. Select the required option from the How to import data from source? drop down. You can choose to import all datamodified and new data, or do not import based on the module you imported.

Import all data  

If you want to import all data, select the date from when data needs to be imported in the From field. This option will import all available data from the selected date. 



Notes
Note: The From date option is available for the selected modules only(refer the above sheet). For modules that do not support this option, all data will be imported.

Incremental data fetch

Only modified and new data

To import the modified and new data incrementally from the last imported time, select Only modified and new data option 
from the drop-down.



Use existing data if new data is not available: 

During incremental import,

  1. If the checkbox is checked: When there is no new data in the source, the last fetched data will be imported again.
  2. If the checkbox is unchecked: When there is no new data in the source, the import will fail and no files will be imported. This will, in turn, cause the entire pipeline job to fail.

Do not import data 

The data is imported only once. The second time, the rules get applied to the same data and get exported.



7. Click Save to schedule import for your data.
NotesNote: If you have already configured a schedule from Google Analytics 4, data will be reloaded based on your earlier configuration under the Import configuration section when you click on the Edit schedule option and set a new schedule.

Schedule settings

Stop export if data has invalid values: Enabling this will stop the export when prepared data still has invalid values.



Order exports

You can use this option when you have configured multiple destinations and would like to determine in what order the data has to be exported to destinations.

If not enabled, export will run in the default order.
Note: This option will be visible only if you have added more than one destination in your pipeline.

To rearrange the order of your export destinations

1) Click the Order exports toggle


2) You can drag and drop to change the order of the destinations and then click Save.



Note: Click the Edit order link if you want to rearrange the order again.


8. After you configure the schedule configuration, click 
Save to execute the schedule. This will start the pipeline.



Each scheduled run is saved as a job. When a pipeline is scheduled, the data will be fetched from your data sources, prepared using the series of transforms you have applied in each of the stages, and then data will be exported to your destination through seamless data integration at regular intervals. This complete process is captured in the job history.

9. To go to the jobs list of a particular pipeline, go to the  ellipses icon in the pipeline builder, and click on the Job history menu to check the job status of your pipeline.

10. Click the required job ID in the Jobs history page to navigate to the Job summary of a particular job.

The Job summary shows the history of a job executed in a pipeline flow. Click here to know more.

11. When the schedule is completed, the data prepared in your ETL pipeline will be exported to the configured destinations.
Info: You can also view the status of your schedules later on the Jobs page.
NotesNote: If you make any further changes to the pipeline, the changes are saved as a draft version. Choose the Draft option and mark your pipeline as ready for the changes to reflect in the schedule.




After you set your schedule, you can choose to Pause schedule or Resume scheduleEdit schedule and Remove schedule using the Schedule Active option in the pipeline builder.

When you edit and save a schedule, the next job will be from the last schedule run time to the next scheduled data interval.

Important: Adding Google Analytics 4 as destination and pushing data to Google Analytics 4 from DataPrep is not supported yet.


SEE ALSO