Data Explained: Google Analytics Data Sampling

Sometimes, the reports you create inside Google Analytics won’t match the data you receive from ChannelMix. Why is that?


A lot of times, it’s because of sampling, a technique that Google Analytics may apply when generating on-the-fly reports.

What is Sampling

Google Analytics offers a great example of sampling in its Help section:

For example, if you wanted to estimate the number of trees in a 100-acre area where the distribution of trees was fairly uniform, you could count the number of trees in 1 acre and multiply by 100, or count the trees in a half acre and multiply by 200 to get an accurate representation of the entire 100 acres.

The big benefit of sampling is that it’s much faster and, if done right, still very accurate. (Though not as accurate as if -- in the example above -- you’d gone through and counted every single tree in the whole 100 acres, and somehow didn’t make any mistakes.)

How Google Analytics Uses Sampling

Google Analytics often uses sampling when a user modifies one of GA’s existing reports. Let’s say you apply a different segment, filter or secondary dimension. Google Analytics calls this an “ad hoc report.”

Instead of looking at every single row of data you’ve stored in Google Analytics, GA will take take a percentage of those rows and extrapolate metrics for your whole dataset.

  • If you’re using the standard version of Google Analytics, sampling will kick in for ad hoc reports that encompass 500,000 sessions at the property level for the date range you’re using.
  • When using Google Analytics 360, sampling won’t apply until you’re dealing with 1 million to 100 million sessions at the view level for the data range you’re using. And you also have the ability to request unsampled data as a 360 client. (We’re a GA 360 reseller and can answer other questions you might have.)

How to Tell if your Report is Sampled in Google Analytics

Just look in the top left hand corner of the screen. You’ll see text that reads something like “This report is based on X% of sessions.”

Next to that line, you’ll see a toggle button that gives you the ability to control your report’s sample size. You can choose between “Greater precision” and “Faster response.”

And if you go back and remove that filter, dimension or segment you applied to the default report, you’ll see that your numbers are back to being based on 100 percent of your rows.

We hope that helps. If you’ve got more questions, please open a ticket at help.channelmix.com.

Was this article helpful?
0 out of 0 found this helpful
Have more questions? Submit a request

Comments

0 comments

Please sign in to leave a comment.