18F / analytics-reporter

Lightweight analytics reporting and publishing tool for Digital Analytics Program's Google Analytics 360 data.

Home Page:https://analytics.usa.gov/

Geek Repo:Geek Repo

Github PK Tool:Github PK Tool

Audit query and report delta between UA -> GA4 application

scottqueen-bixal opened this issue · comments

Background

When Migrating to GA4 there may be some changes in the response or some migrations that can not be complete

User Story

As a product owner I want to be able to see the impact and changes required by updating to the GA4 Client

Acceptance Requirement

A Table that describes "Issues", "Effects", and "Adjustments" based on discovery during upgrade

Issue Effects Adjustment
Page and hostName not part of realTimeReporting top-pages-realtime.json Leverage unifiedScreenName, and loose others
“max-results” is now “limit” Query that includes the key Replace “max-results” with “limit” in query
Some differences in mapping -> https://github.com/18F/analytics-reporter/blob/migrate-ga3-to-ga4/src/process-results/ga-data-processor.js#L131-L162 Some keys on UA client are named or cased differently in GA4 client Update keys based on the mapping provided
Browser version in is not available https://stackoverflow.com/a/68286131, Mostly impacts ie, and windows-ie reports  remove reports
Exit is not available, bounceRate may be closest thing Query that includes exit reporting will no longer have this valueQuery that rely on exit value will no longer be reported Top-exit-pages-30-days won’t be reported
pagePath is said to map to pagePathPlusQueryString, both return values, neither of them has hostname included Query that only include pagePath Use fullPageUrl
hasSocialSourceReferral is not available Was a yes no value ???
download|downloads|(outbound downloads don’t appear to be returned on event names Download api report Use file_download
Sampling level is automatic today, top-pages-7-days We can’t adjust precision in sampling, but we still get a metadata response when sampling has been performed.In some cases sampling is done while processing is still in progress, so !Golden.  When this occurs GA4 buckets values into the “(other)” key.  For reports like today, with date rage “today” -> “today” this makes a significant impact.  By updating the report date range to start from “yesterday”, we get a much more accurate SUM value, but need to handle some data clean-up on FE rendering.

This ticket will close when the following tickets are QA complete