Attribution (marketing)

Last updated

In marketing, attribution, also known as multi-touch attribution (MTA), is the identification of a set of user actions ("events" or "touchpoints") that contribute to a desired outcome, and then the assignment of a value to each of these events. [1] [2] Marketing attribution provides a level of understanding of what combination of events in what particular order influence individuals to engage in a desired behavior, typically referred to as a conversion. [1] [2]

Contents

History

The roots of marketing attribution can be traced to the psychological theory of attribution. [2] [3] By most accounts, the current application of attribution theory in marketing was spurred by the transition of advertising spending from traditional, offline ads to digital media and the expansion of data available through digital channels such as paid and organic search, display, and email marketing. [2] [4]

Concept

The purpose of marketing attribution is to quantify the influence each advertising impression has on a consumer's decision to make a purchase decision, or convert. [4] Visibility into what influences the audience, when and to what extent, allows marketers to optimize media spend for conversions and compare the value of different marketing channels, including paid and organic search, email, affiliate marketing, display ads, social media and more. [2] Understanding the entire conversion path across the whole marketing mix diminishes the accuracy challenge of analyzing data from siloed channels. Typically, attribution data is used by marketers to plan future ad campaigns and inform the performance of previous campaigns by analyzing which media placements (ads) were the most cost-effective and influential as determined by metrics such as return on ad spend (ROAS) or cost per lead (CPL). [2]

Attribution models

Resulting from the disruption created by the rapid growth of online advertising over the last ten years, marketing organizations have access to significantly more data to track effectiveness and ROI. This change has impacted how marketers measure the effectiveness of advertisements, as well as the development of new metrics such as cost per click (CPC), Cost per thousand impressions (CPM), Cost per action/acquisition (CPA) and click-through conversion. Additionally, multiple attribution models have evolved over time as the proliferation of digital devices and tremendous growth in data available have pushed the development of attribution technology.

Constructing an algorithmic attribution model

Binary classification methods from statistics and machine learning can be used to build appropriate models. However, an important element of the models is model interpretability; therefore, logistic regression is often appropriate due to the ease of interpreting model coefficients.

Behavioral model

Suppose observed advertising data are where

  • covariates
  • consumer saw ad or not
  • conversion: binary response to the ad
Consumer choice model

  covariates and ads

Covariates, , generally include different characteristics about the ad served (creative, size, campaign, marketing tactic, etc.) and descriptive data about the consumer who saw the ad (geographic location, device type, OS type, etc.). [8]

Utility theory

  [9]

Counterfactual procedure

An important feature of the modeling approach is estimating the potential outcome of consumers supposing that they were not exposed to an ad. Because marketing is not a controlled experiment, it is helpful to derive potential outcomes in order to understand the true effect of marketing.

Mean outcome if all consumers saw the same advertisement is given by

 

A marketer is often interested in understanding the 'base', or the likelihood that a consumer will convert without being influenced by marketing. This allows the marketer to understand the true effectiveness of the marketing plan. The total number of conversions minus the 'base' conversions will give an accurate view of the number of conversions driven by marketing. The 'base' estimate can be approximated using the derived logistic function and using potential outcomes.

Once the base is derived, the incremental effect of marketing can be understood to be the lift over the 'base' for each ad supposing the others were not seen in the potential outcome. This lift over the base is often used as the weight for that characteristic inside the attribution model.

With the weights constructed, the marketer can know the true proportion of conversions driven by different marketing channels or tactics.

Marketing mix and attribution models

Depending on the company's marketing mix, they may use different types of attribution to track their marketing channels:

Related Research Articles

In economics, utility is a measure of the satisfaction that a certain person has from a certain state of the world. Over time, the term has been used in at least two different meanings.

<span class="mw-page-title-main">Mental accounting</span>

Mental accounting is a model of consumer behaviour developed by Richard Thaler that attempts to describe the process whereby people code, categorize and evaluate economic outcomes. Mental accounting incorporates the economic concepts of prospect theory and transactional utility theory to evaluate how people create distinctions between their financial resources in the form of mental accounts, which in turn impacts the buyer decision process and reaction to economic outcomes. People are presumed to make mental accounts as a self control strategy to manage and keep track of their spending and resources. People budget money into mental accounts for savings or expense categories. People also are assumed to make mental accounts to facilitate savings for larger purposes. Mental accounting can result in people demonstrating greater loss aversion for certain mental accounts, resulting in cognitive bias that incentivizes systematic departures from consumer rationality. Through an increased understanding of mental accounting differences in decision making based on different resources, and different reactions based on similar outcomes can be greater understood.

In statistics, econometrics, epidemiology and related disciplines, the method of instrumental variables (IV) is used to estimate causal relationships when controlled experiments are not feasible or when a treatment is not successfully delivered to every unit in a randomized experiment. Intuitively, IVs are used when an explanatory variable of interest is correlated with the error term (endogenous), in which case ordinary least squares and ANOVA give biased results. A valid instrument induces changes in the explanatory variable but has no independent effect on the dependent variable and is not correlated with the error term, allowing a researcher to uncover the causal effect of the explanatory variable on the dependent variable.

Cost per action (CPA), also sometimes misconstrued in marketing environments as cost per acquisition, is an online advertising measurement and pricing model referring to a specified action, for example, a sale, click, or form submit.

<span class="mw-page-title-main">Regularization (mathematics)</span> Technique to make a model more generalizable and transferable

In mathematics, statistics, finance, and computer science, particularly in machine learning and inverse problems, regularization is a process that converts the answer of a problem to a simpler one. It is often used in solving ill-posed problems or to prevent overfitting.

Functional data analysis (FDA) is a branch of statistics that analyses data providing information about curves, surfaces or anything else varying over a continuum. In its most general form, under an FDA framework, each sample element of functional data is considered to be a random function. The physical continuum over which these functions are defined is often time, but may also be spatial location, wavelength, probability, etc. Intrinsically, functional data are infinite dimensional. The high intrinsic dimensionality of these data brings challenges for theory as well as computation, where these challenges vary with how the functional data were sampled. However, the high or infinite dimensional structure of the data is a rich source of information and there are many interesting challenges for research and data analysis.

In statistics, Poisson regression is a generalized linear model form of regression analysis used to model count data and contingency tables. Poisson regression assumes the response variable Y has a Poisson distribution, and assumes the logarithm of its expected value can be modeled by a linear combination of unknown parameters. A Poisson regression model is sometimes known as a log-linear model, especially when used to model contingency tables.

In statistics, binomial regression is a regression analysis technique in which the response has a binomial distribution: it is the number of successes in a series of independent Bernoulli trials, where each trial has probability of success . In binomial regression, the probability of a success is related to explanatory variables: the corresponding concept in ordinary regression is to relate the mean value of the unobserved response to explanatory variables.

<span class="mw-page-title-main">Digital marketing</span> Marketing of products or services using digital technologies or digital tools

Digital marketing is the component of marketing that uses the Internet and online-based digital technologies such as desktop computers, mobile phones, and other digital media and platforms to promote products and services. It has significantly transformed the way brands and businesses utilize technology for marketing since the 1990s and 2000s. As digital platforms became increasingly incorporated into marketing plans and everyday life, and as people increasingly used digital devices instead of visiting physical shops, digital marketing campaigns have become prevalent, employing combinations of search engine optimization (SEO), search engine marketing (SEM), content marketing, influencer marketing, content automation, campaign marketing, data-driven marketing, e-commerce marketing, social media marketing, social media optimization, e-mail direct marketing, display advertising, e-books, and optical disks and games have become commonplace. Digital marketing extends to non-Internet channels that provide digital media, such as television, mobile phones, callbacks, and on-hold mobile ringtones. The extension to non-Internet channels differentiates digital marketing from online marketing.

In the statistical analysis of observational data, propensity score matching (PSM) is a statistical matching technique that attempts to estimate the effect of a treatment, policy, or other intervention by accounting for the covariates that predict receiving the treatment. PSM attempts to reduce the bias due to confounding variables that could be found in an estimate of the treatment effect obtained from simply comparing outcomes among units that received the treatment versus those that did not. Paul R. Rosenbaum and Donald Rubin introduced the technique in 1983.

In statistics, principal component regression (PCR) is a regression analysis technique that is based on principal component analysis (PCA). More specifically, PCR is used for estimating the unknown regression coefficients in a standard linear regression model.

In statistical theory, the field of high-dimensional statistics studies data whose dimension is larger than typically considered in classical multivariate analysis. The area arose owing to the emergence of many modern data sets in which the dimension of the data vectors may be comparable to, or even larger than, the sample size, so that justification for the use of traditional techniques, often based on asymptotic arguments with the dimension held fixed as the sample size increased, was lacking.

In statistics and machine learning, lasso is a regression analysis method that performs both variable selection and regularization in order to enhance the prediction accuracy and interpretability of the resulting statistical model. The lasso method assumes that the coefficients of the linear model are sparse, meaning that few of them are non-zero. It was originally introduced in geophysics, and later by Robert Tibshirani, who coined the term.

A view-through rate (VTR), measures the number of post-impression response or viewthrough from display media impressions viewed during and following an online advertising campaign. Such post-exposure behavior can be expressed in site visits, on-site events, conversions occurring at one or more Web sites or potentially offline:

Affiliate Tracking Software is used to track the referral, endorsement or recommendation made by one person or company to buy products or services from another person or company. Tracking is necessary to manage and reward or compensate the participants of an affiliate marketing group of participants or affiliate networks.

Inverse probability weighting is a statistical technique for estimating quantities related to a population other than the one from which the data was collected. Study designs with a disparate sampling population and population of target inference are common in application. There may be prohibitive factors barring researchers from directly sampling from the target population such as cost, time, or ethical concerns. A solution to this problem is to use an alternate design strategy, e.g. stratified sampling. Weighting, when correctly applied, can potentially improve the efficiency and reduce the bias of unweighted estimators.

In electronic commerce, conversion marketing is marketing with the intention of increasing conversions—that is, site visitors who are paying customers.

Functional regression is a version of regression analysis when responses or covariates include functional data. Functional regression models can be classified into four types depending on whether the responses or covariates are functional or scalar: (i) scalar responses with functional covariates, (ii) functional responses with scalar covariates, (iii) functional responses with functional covariates, and (iv) scalar or functional responses with functional and scalar covariates. In addition, functional regression models can be linear, partially linear, or nonlinear. In particular, functional polynomial models, functional single and multiple index models and functional additive models are three special cases of functional nonlinear models.

Google Attribution is a monitoring program developed by Internet advertising company Alphabet Inc. launched in 2017. It has to link with a Google Analytics view that is associated with a registered AdWords or DoubleClick Search account. An attribution model is set of rules, that shows how credit for sales and conversions are allocated to touchpoints in conversion paths. For example, the Last Interaction model in Analytics assigns 100% credit to final touchpoints that immediately precede sales or conversion. In contrast, the First Interaction model assigns 100% credit to touchpoints that initiate conversion paths.

Data-driven marketing is a process where marketers employ a process to gain insights into consumer behavior, including purchasing patterns, advert effectiveness, and browsing habits. Contemporary methods utilize big data strategies to collect and analyze information on consumer interactions and engagements, aiming to predict future behaviors. This analysis involves evaluating existing data, acquiring new data and systematically organizing and interpreting it to improve marketing strategies. The primary objective is to better understand and address customer needs. Market research provides a detailed understanding of consumer preferences

References

  1. 1 2 Benjamin Dick (August 1, 2016). "Digital Attribution Primer 2.0" (PDF). IAB.com. Retrieved April 30, 2019.
  2. 1 2 3 4 5 6 7 8 Stephanie Miller (February 6, 2013). "Digital Marketing Attribution.Digital Marketing Attribution". DMNews.com. Retrieved March 25, 2013.
  3. Kartik Hosanagar (July 2012). "Attribution: Who gets the Credit for a New Customer?". The Wharton School. Retrieved March 25, 2013.
  4. 1 2 3 4 Yair Halevi (October 10, 2012). "The problem with click-based attribution". iMediaConnection.com. Retrieved March 25, 2013.
  5. 1 2 3 Tina Moffett (April 30, 2012). "The Forrester Wave: Cross-Channel Attribution Providers". Forrester Research. Archived from the original on April 13, 2013. Retrieved March 22, 2013.
  6. 1 2 3 David Raab (July 1, 2011). "Marketing Attribution Beyond the Last Click". Information-Management.com. Retrieved March 25, 2013.
  7. Broadbent, Andrew J. (1918-01-10). Perfect attribution modeling and how to attain this marketing nirvana. TNW.
  8. Lancaster, Kelvin J. (1966-01-01). "A New Approach to Consumer Theory". Journal of Political Economy. 74 (2): 132–157. doi:10.1086/259131. S2CID   222425622.
  9. McFadden, D. (1972-01-01). "CONDITIONAL LOGIT ANALYSIS OF QUALITATIVE CHOICE BEHAVIOR". Working Paper Institute of Urban and Regional (199/).
  10. "Why Your Demand Team Can't Ignore Account Based Attribution". www.bizible.com. Retrieved 2016-01-11.