Ever wondered what other SaaS companies are doing in LinkedIn and Facebook Ads? We put together an Ad list of over 450 SaaS companies so you stop guessing. Check it out for free on the button below.
Data reflects your efforts. The more you understand it accurately, the better you can improve your processes. That is, of course, when you genuinely care about understanding what’s going on without pushing a particular agenda.
Here, you will find an example of some ideas and methods we use when analyzing data to inform performance marketing strategies. As you will see, beyond shaping data to fit a narrative, we do our best to be impartial and help organizations improve their decision-making capabilities.
Let us know if you find them valuable in the comments.
Data sample
We took a data set of around 500 records that had:
Original Source: How did the prospect first land on the website
Latest Source: What was the source before they converted (i.e., filled out some form to become a known contact)
How Did You Hear About Us (HDY): volunteered data that the prospect provides when they fill out a form (like requesting a demo)
The analysis below covers some high-level insights we found in the data.
Note: This is not an extensive data set (one company) and it must not be extrapolated as a playbook.
Data Distribution
First, here’s the distribution of the data:
The first observation is that Google Search is the biggest driver in both cases. For prospects who said, ‘We heard about you via Google Search (42%) - Hubspot matches by saying that most people came in via Organic Search ( 35% to be exact). The HDY field doesn’t differentiate b/w Paid Search & Organic Search, so we can assume that 15% of Paid Search traffic is included in the Search HDY bucket.
Around 20% of folks mentioned website/ blog - this is a bit misleading since it doesn't answer the Q ‘How did you land on our website blog?’ but we can assume a high % of these might be from Search (paid or organic) because of the high % in Organic / Paid Search Original Source data.
What is interesting is the high % of direct traffic. At the surface, it does not correlate with any other channel. What we would expect is a high % of users mentioning Podcast or Social as under HD which would then correlate with a high % of direct traffic. But only 4% of HD mentions LinkedIn that correlates to Direct Traffic under Original Source. Surprisingly only 0.9% of Podcast mentions correlate - our initial working assumption was that Podcast might be leading to Direct Traffic (heard a podcast - went to the website) but that is not the case according to the data. Its likely Direct Traffic is a mix of search + social + paid + long tail of other sources where the referral data was not retained or sessions expired.
For everyone who mentioned trade show as the HD source, the Offline Sources lines up. So in this case Hubspot (in some cases more) was able to track Trade Shows properly under Offline Sources. Curious how to properly attribute / assign source to Offline Sources? Check out our Hubspot Weekly series for more on that.
There’s a lot of noise in B2B Marketing about podcasts but only 3% of folks came in via “podcast’ as a HDY source which seems low for what might be a high cost / effort channel to produce. We definitely don’t recommend NOT doing podcasts but it might not be as effective as we’re led to believe.
As we already mentioned, this is a single company limited data set, so it’s critical you analyze your particular GTM source data to reach such conclusions (feel free to reply to this email if you need a hand).
What is missing from Original Source is Paid Social - on HDY source it shows as FB / LinkedIn - but its unclear if its Paid or Organic. We will make a reasonable assumption that it is in fact Paid Social & the ad platforms show it as View Through Conversion vs a Click Through one (i.e. the user saw the ad & converted within a 30 day window).
To visualize the relationship b/w Original Source & HD Source we plotted this correlation visual:
The darker the shade of the box - the higher the #s
When comparing Original Source to Latest Source:
Almost 40% of prospects converted via direct traffic. This might imply they auto complete the website on the browser or return to after the session / cookie expires.
Email Marketing & Organic are close seconds. For Organic search we can reasonably assume its branded organic search (Google company name) and email might be because they converted via an email marketing campaign.
What does this all mean?
While its hard to draw absolute conclusions from a small data set there’s some learnings here:
HDY Source can help but you probably are already capturing the data via Referral sources.
In this case because Search (Paid & Organic) contributes to the majority of conversions - having HDY Source on form might actually be just an extra step to complete - at worse it doesn’t add anything new.
Podcasts only drove 1% of conversions & our initial assumption that podcast might drive a lot of direct traffic did not hold true.
Capturing First Touch (Original Source) & Last Touch (Latest Source) is probably sufficient for B2B businesses. The long tail of Reddit / Podcasts, etc., likely does not drive meaningful volume to warrant a deeper analysis. One caveat here - as you scale and go into OOH & Linear TV or other non digital channels - its best to employ lift tests, geo tests and other measurement models to determine their success.
What did you think of this mini?
😍 Loved it
👍 Just ok
👎 Not for me
RevenueHero, our sponsor, keeps 42/ free for you. They help GTM teams increase their pipeline by qualifying and showing the right sales rep’s calendar immediately after a form submit. Companies like Sprinto, Cast, FusionAuth, Ethyca, and many more achieve their pipeline goals with their product.
Thank you for reading.
You’re in good company, folks from these companies read 42/
I have seen similar metrics across a number of SaaS companies. I think what really matters is what actions will be taken with this data to further prove out hypotheses taken from the data. Too often I see people debating a metric number and then the discussion stops there because they are exhausted trying to defend their point. These metrics are a reference point. Your sub head tells it all.
Very interesting! A few points come to mind:
— Low podcast attribution: The fact that the podcast didn't jump out in HDY data & then explain all the "dark" direct traffic seems quite notable, and the folks pushing podcasts + social for e.g. IMO confuse the medium for the message. CW etc use podcasts+social to push a very distinct narrative and then convert people who buy on narrative. If you 'do a podcast' but don't have that kind of message *or* that kind of extremely online market to meme (i.e. other marketers) then you're outta luck in that playbook, and I'm not sure they'd have much more to offer. In this case, it might be interesting to try and plug a podcast-only offer to send folks to a specific page and see if anyone bites as a test of prospect engagement.
— High trade show attribution: The fact trade shows conversely *does* stand out is pretty interesting -- I was surprised by that. Given the effort + expense that goes into trade shows, I wonder if that makes HDY info worthwhile, at least for a period.
— Organic search: This still remains a black box IMO, and on the organic search side I find "organic" as a grouping pretty meaningless when it's brand + SEO, and it'd be interesting to see how much organic search traffic went to non-brand-SERP pages & what GSC looks like for brand search.
— Direct: IMO it's pretty interesting that HDY data *does* shed some light on direct traffic, and if I understand your correlation chart, that what it reveals is as you say -- it's just a mix of the other sources, there isn't a secret hidden pseudo-channel there waiting to be exploited.
Finally, it's interesting to think about what we can infer about awareness ('brand') vs activation ('conversion') from this kind of data too. I wonder what a kind of meta metric that bucketed all 'brand' traffic together (organic brand, paid brand, direct brand) and treated that as function of all other activity (paid + SEO + podcast + trade shows), with a kind of weighted contribution from each. Not sure exactly what it would look like, but I'd like to think one day our marketing tools will better reflect buying reality and not just data collected... we can dream, at least :)