I am speaking on my own behalf and not on behalf of Google or Alphabet.
A common perception among economists who act as consultants is that the public is either unwilling or unable to cope with uncertainty. Hence, they argue that pragmatism dictates provision of point predictions and estimates, even though they may not be credible.
— Charles F. Manski, The Lure of Incredible Certitude (2018)
Set up a statistical null hypothesis of “no mean difference” or “zero correlation.” Don’t specify the predictions of your research hypothesis or of any alternative substantive hypotheses.
Use 5% as a convention for rejecting the null. If significant, accept your research hypothesis.
Always perform this procedure.
Scientific conclusions and business or policy decisions should not be based only on whether a p-value passes a specific threshold.
— Wasserstein & Lazar, ASA Statement on Statistical Significance and P-Values (2016)
What p-values DON’T tell us:
Correct Interpretation: > Given that the true effect is zero, what’s the probability of seeing our result?
Common Misinterpretation: > Given that we saw our result, what’s the probability that the true effect is zero?
Most empirical papers in economics focus on two aspects of their results: whether the estimates are statistically significantly different from zero and the interpretation of the point estimates. This focus obscures important information about the implications of the results for economically interesting hypotheses about values of the parameters other than zero, and in some cases, about the strength of the evidence against values of zero. This limitation can be overcome by reporting confidence intervals for papers’ main estimates and discussing their economic interpretation.
A 95% confidence interval is a range generated by a procedure that, if repeated many times, would contain the true parameter value in 95% of cases.
It’s a statement about the long-run success of the method, not a probability statement about your specific interval.
The following are all false statements.
The probability that the true mean is greater than 0 is at least 95%. (Endorsed by 38% of researchers)
The probability that the true mean equals 0 is smaller than 5%. (Endorsed by 47% of researchers)
The “null hypothesis” that the true mean equals 0 is likely to be incorrect. (Endorsed by 86% of researchers)
There is a 95% probability that the true mean lies between 0.1 and 0.4. (Endorsed by 59% of researchers)
We can be 95% confident that the true mean lies between 0.1 and 0.4. (Endorsed by 55% of researchers)
If we were to repeat the experiment, 95% of the time the true mean would fall between 0.1 and 0.4. (Endorsed by 58% of researchers)
The question business leaders actually need answered:
“How likely is it that Option A is better than Option B?”
This is a question about making a decision under uncertainty.
The traditional frequentist framework wasn’t designed to answer this question directly.
Identify Decision(s) – What specific business decision needs to be made?
Craft Business Questions – What questions can inform that decision?
Design, Implement, and Analyze – How do we get reliable answers?
Communicate Findings – How do we present uncertainty clearly?
Bayesian statistics is a framework for reallocating credibility among different possibilities as new evidence emerges.
It naturally answers the questions business leaders ask.
Before making a change, we usually consider three possibilities:
Bayes gives us the probability of each scenario.
A well-structured question can be surprisingly simple, often following the format:
“Does A do B among C compared to D?”
Relying only on the Average Treatment Effect (ATE) means you treat all your users as if they are the same.
This can be a costly mistake.
You risk leaving money on the table by missing opportunities to target the right users.
To make better decisions, we need to understand heterogeneity.
Methods like Bayesian Causal Forests help us find pockets of opportunity where interventions work best.
The goal is not to be right every time, but to make better bets over the long run.
Ignacio Martinez
ignacio.martinez.fyi
Thank you!
Why Bayes? A Decisions First Framework
jsm25.martinez.fyi