class: left, bottom, inverted, title-slide # IOPS Winter 2021 ## Different perspectives on the Bayes factor ### Jorge N. Tendeiro ### Hiroshima University ###
09-Dec-2021
Email:
tendeiro@hiroshima-u.ac.jp
Webpage:
www.jorgetendeiro.com
Photo by
Pixabay
from
Pexels
.
--- background-image: url(data:image/png;base64,#Figures/pexels-terje-sollie-313700.png) background-size: cover ## Today - What is the Bayes factor?<br><br> - What did I think of it?<br> <i>Tendeiro and Kiers (2019)</i> - vRW trying to come to the rescue.<br> <i>van Ravenzwaaij and Wagenmakers (2021)</i> - Nope, I don't think so.<br> <i>Tendeiro and Kiers (2021)</i> .footnote[ <span class="myfoot">Photo by [Terje Sollie](https://www.pexels.com/@solliefoto) from [Pexels](https://www.pexels.com/photo/close-up-of-menu-313700/).</span> ] --- class: inverse, middle # What is the Bayes factor? <style type="text/css"> .confused { position: relative; z-index: 1; } .confused::before { content: ""; background-image: url(Figures/confused2.gif); background-size: cover; position: absolute; top: 0px; right: 0px; bottom: 0px; left: 0px; opacity: 0.06; z-index: -1; } </style> --- background-image: url(data:image/png;base64,#Figures/pexels-olya-kobruseva-5428833.png) background-size: cover ## What is the Bayes factor? .footnote[ <span class="myfoot">Photo by [Olya Kobruseva](https://www.pexels.com/@olyakobruseva) from [Pexels](https://www.pexels.com/photo/question-marks-on-paper-crafts-5428833/).</span> ] To put it lightly: > *The Bayes factor (BF) is the Bayesian way of testing hypotheses.* -- <br> To put it accurately: > *The BF compares the <span style='color: #DCA559;'>predictive ability</span> of two competing models.<br>* --- class: confused ## What is the Bayes factor? .footnote[ <span class="myfoot">GIF by [Achievement Hunter](https://giphy.com/achievementhunter) from [Giphy](https://giphy.com/gifs/achievementhunter-rooster-teeth-achievement-hunter-off-topic-ah-4JVTF9zR9BicshFAb7).</span> ] From Bayes rule we can derive that `$$\boxed{ \underset{\text{prior odds}}{\underbrace{\frac{p(\mathcal{M}_0)}{p(\mathcal{M}_1)}}}\times \underset{\text{Bayes factor, }BF_{01}}{\underbrace{\frac{p(D|\mathcal{M}_0)}{p(D|\mathcal{M}_1)}}} = \underset{\text{posterior odds}}{\underbrace{\frac{p(\mathcal{M}_0|D)}{p(\mathcal{M}_1|D)}}} }$$` -- <br><br> Two possible interpretations for `\(BF_{01}\)`: 1. It conveys the relative probability of the observed data under either model.<br> In other words, it indicates what the relative <span style='color: #DCA559;'>predictive ability</span> of the two models is. -- 2. It indicates how an *a priori* relative model belief should be rationally updated, in light of the observed data. <!-- <br> --> <!-- So, if for instance `\(BF_{01}=5\)`, then we can say that: --> <!-- 1. The observed data are five times more likely under `\(\mathcal{M}_0\)` than under `\(\mathcal{M}_1\)`. --> <!-- 2. Whatever your relative model belief is before observing the data, you should *always* revise your own belief in favor of the null model by 5 times. --> --- class: inverse, middle # What did I think of it?<br><span style="font-size:70%"><i>Tendeiro and Kiers (2019)</i></span> --- background-image: url(data:image/png;base64,#Figures/pexels-photo-1181343.png) background-size: cover ## What did I think of it? <span style="font-size:70%"><i>Tendeiro and Kiers (2019)</i></span> .footnote[ <span class="myfoot"> Photo by [Christina Morillo](https://www.pexels.com/@divinetechygirl) from [Pexels](https://www.pexels.com/photo/man-wearing-blue-dress-shirt-facing-whiteboard-1181343/).</span> ] ### Why not scrutinize the BF? > *Any methodological approach has its advantages <span style='color: #DCA559;'>as well as its drawbacks</span> and NHBT is no different.* (TK2019, p. 775) -- <br> ### Talk about humility in science: > *One year before this writing, <span style='color: #DCA559;'>the authors felt they did not know enough</span> about NHBT to properly understand how it works, what its merits are, and what its potential limitations are. Therefore, we carried out an extensive literature study on NHBT and related topics. This article is the result of putting together a range of discussions on NHBT.* (TK2019, p. 775) --- background-image: url(data:image/png;base64,#Figures/pexels-photo-1181343.png) background-size: cover ## What did I think of it? <span style="font-size:70%"><i>Tendeiro and Kiers (2019)</i></span> .footnote[ <span class="myfoot"> Photo by [Christina Morillo](https://www.pexels.com/@divinetechygirl) from [Pexels](https://www.pexels.com/photo/man-wearing-blue-dress-shirt-facing-whiteboard-1181343/).</span> ] ### The semantics of what's an ''issue'': > *In this article we offer a wide overview of <span style='color: #DCA559;'>issues</span> about NHBT which is currently missing in the literature.* (TK2019, Abstract) -- <br> > *An ''issue'' can either be a <span style='color: #DCA559;'>limitation</span> (according to us) or a <span style='color: #DCA559;'>feature</span> that may (according to us) increase the risk of misuse or misinterpretation of a Bayes factor.* (TK2019, p. 775) --- background-image: url(data:image/png;base64,#Figures/pexels-photo-1181343.png) background-size: cover ## What did I think of it? <span style="font-size:70%"><i>Tendeiro and Kiers (2019)</i></span> ### Our list of issues: 1. Bayes factors can be hard to compute. 2. Bayes factors are sensitive to within-model priors. 3. Use of ''default'' Bayes factors. 4. Bayes factors are not posterior model probabilities. 5. Bayes factors do not imply a model is probably correct. 6. Qualitative interpretation of Bayes factors. 7. Bayes factors test model classes. 8. Mismatch between Bayes factors and parameter estimation. 9. Bayes factors favor the point null model. 10. Bayes factors favor the alternative. 11. Bayes factors often agree with `\(p\)` values. .footnote[ <span class="myfoot"> Photo by [Christina Morillo](https://www.pexels.com/@divinetechygirl) from [Pexels](https://www.pexels.com/photo/man-wearing-blue-dress-shirt-facing-whiteboard-1181343/).</span> ] --- class: inverse, middle # vRW trying to come to the rescue<br><span style="font-size:70%"><i>van Ravenzwaaij and Wagenmakers (2021)</i></span> --- background-image: url(data:image/png;base64,#Figures/pexels-klaus-nielsen-6303535.png) background-size: cover ## vRW trying to come to the rescue<span style="font-size:70%"> <i>van Ravenzwaaij and Wagenmakers (2021)</i></span> .footnote[ <span class="myfoot"> Photo by [Klaus Nielsen](https://www.pexels.com/@klaus-nielsen) from [Pexels](https://www.pexels.com/photo/expressive-doctor-in-superhero-costume-6303535/).</span> ] ### From the Abstract: > *But although we agree with many of their thoughtful recommendations, we believe that Tendeiro and Kiers are <span style='color: #DCA559;'>overly pessimistic</span>, and that several of their 'issues' with NHBT may in fact be conceived as <span style='color: #DCA559;'>pronounced advantages</span>.* -- <div style="margin-bottom:30px;"></div> > *We (...) end with a critical discussion of one of the recommendations by Tendeiro and Kiers, which is that ''estimation of the full posterior distribution offers a more complete picture'' than a Bayes factor hypothesis test.* -- <br> ### On the 11 issues: > *In our opinion, many of the 'issues' listed by TK are a <span style='color: #DCA559;'>blessing</span> rather than a <span style='color: #DCA559;'>curse</span>.*<br> (vRW2021, pp. 2-3) --- background-image: url(data:image/png;base64,#Figures/pexels-cottonbro-4631066.png) background-size: cover ## vRW trying to come to the rescue<span style="font-size:70%"> <i>van Ravenzwaaij and Wagenmakers (2021)</i></span> ### Their take on our 11 issues: <center> <img src="data:image/png;base64,#Figures/vRandW2021.png" title="van Ravenzwaaij and Wagenmakers (2021)." alt="van Ravenzwaaij and Wagenmakers (2021)." width="70%" /> </center> .footnote[ <span class="myfoot"> Photo by [cottonbro](https://www.pexels.com/@cottonbro) from [Pexels](https://www.pexels.com/photo/photo-of-person-s-hand-with-words-4631066/).</span> ] <style type="text/css"> .stressed { position: relative; z-index: 1; } .stressed::before { content: ""; background-image: url(Figures/stressed.gif); background-size: cover; position: absolute; top: 0px; right: 0px; bottom: 0px; left: 0px; opacity: 0.1; z-index: -1; } </style> --- class: inverse, middle # Nope, I don't think so<br><span style="font-size:70%"><i>Tendeiro and Kiers (2021)</i></span> --- class: stressed ## So what did we make of it? .footnote[ <span class="myfoot">GIF by [1091](https://giphy.com/1091/) from [Giphy](https://giphy.com/gifs/orchfilms-f-the-prom-3ohhwphLpCpe7hF2Bq).</span> ] Me and Henk thought long and hard about the reply by vRW to our paper. > *Could it be that they are right?* -- <br><br> But, really, in our very *biased* opinion: - We offered completely unassailable arguments that were not refuted by vRW. - vRW have points of view with which we cannot concur *in the least*. --- background-image: url(data:image/png;base64,#Figures/kostiantyn-li-Fi_nhg5itCw-unsplash.png) background-size: cover ## Bits of what vRW had to say E.g., concerning the sensitivity of the BF to the priors, this is both ideal: > *The reason that <span style='color: #DCA559;'>Bayes factors are sensitive to within-model priors</span> is that Bayes factors evaluate models by the predictions they make, and predictions are determined partly by the prior.* (vRW2021, p. 6), -- something to be careful about: > *(...) <span style='color: #DCA559;'>sensitivity analysis</span>, that is, a comprehensive investigation of the extent to which the Bayes factor differs across alternative specification of the prior distribution.* (vRW2021, p. 13), -- something that may be avoided by means of relying on defaults: > *(...) <span style='color: #DCA559;'>default Bayes factors</span> allow efficient communication of evidence that many researchers may consider less sensitive to human bias than more subjective or 'informed' alternatives.* (vRW2021, p. 13), -- and it's even not that bad to close your eyes to it: > *In our opinion, a default Bayes factor, <span style='color: #DCA559;'>however blindly applied</span>, is usually preferable over no Bayes factor at all.* (vRW2021, p. 13) --- background-image: url(data:image/png;base64,#Figures/kostiantyn-li-Fi_nhg5itCw-unsplash.png) background-size: cover ## Bits of what vRW had to say ### Advising researchers to use BFs, even if only based on a superficial working knowledge: > *We believe that NHBT, <span style='color: #DCA559;'>even if executed as a thoughtless ritual</span>, still markedly improves on the status quo.* (vRW2021, p. 33) -- ### About the truth of a point null hypothesis: It most likely should be false: > *(...) it should be stressed that the Bayes factor is based on a comparison of predictive performance that is independent of the notion of absolute or relative model truth (Wagenmakers, Grünwald, & Steyvers, 2006). The idea that we should not use models that we know to be false can easily result in an inferential impasse, <span style='color: #DCA559;'>because all statistical models are ultimately false</span>.* (vRW2021, p. 26) -- but at the same time it is, purportedly, a common research goal: > *[Estimation] assumes the falsity of the null hypothesis, <span style='color: #DCA559;'>which is often the very target of inference</span>.* (vRW2021, p. 34) <style type="text/css"> .yeah { position: relative; z-index: 1; } .yeah::before { content: ""; background-image: url(Figures/yeah.gif); background-size: cover; position: absolute; top: 0px; right: 0px; bottom: 0px; left: 0px; opacity: 0.12; z-index: -1; } </style> --- class: yeah ## <i>This is us</i> <span style="font-size:70%"><i>(Tendeiro and Kiers, 2021)</i></span> So we still have something to add to the discussion: > *Tendeiro, J. N. and Kiers, H. A. L. (2021). On the white, the black, and the many shades of gray in between: Our Reply to van Ravenzwaaij and Wagenmakers (2021). Preprint: 10.31234/osf.io/tjxvz.* .footnote[ <span class="myfoot">GIF from [Giphy](https://giphy.com/gifs/excited-yes-nicolas-cage-RrVzUOXldFe8M).</span> ] --- background-image: url(data:image/png;base64,#Figures/pexels-caio-65547.png) background-size: cover ## Some give-aways (I) <span style="font-size:70%"><i>(Tendeiro and Kiers, 2021)</i></span> .footnote[ <span class="myfoot">Photo by [Caio](https://www.pexels.com/@caio) from [Pexels](https://www.pexels.com/photo/m-m-s-chocolates-in-bowl-65547/).</span> ] - Bayes factors aren't easy to understand. - Priors, as part of the model being compared, warrant careful attention. - Disentangle BFs from posterior model odds. - Relative model assessment `\(\not=\)` truthfulness of either model. - Conclusions derived from a Bayes factor are *always* relative, thus avoid absolute conclusions.<br> So, no such thing as absolute evidence for the null! --- background-image: url(data:image/png;base64,#Figures/pexels-caio-65547.png) background-size: cover .footnote[ <span class="myfoot">Photo by [Caio](https://www.pexels.com/@caio) from [Pexels](https://www.pexels.com/photo/m-m-s-chocolates-in-bowl-65547/).</span> ] ## Some give-aways (II) <span style="font-size:70%"><i>(Tendeiro and Kiers, 2021)</i></span> - There's no such thing as the `\(\not=0\)` alternative hypothesis.<br> There's a prior that goes with it! - Be aware of relying on qualitative labels for BFs. - Bayes factors can easily point at the wrong model (if you're into that kind of thing), especially for small `\(n\)` and ES. - Bayes factors, in its default shape, favor the point null model.<br> See our worked out example. --- background-image: url(data:image/png;base64,#Figures/pexels-caio-65547.png) background-size: cover .footnote[ <span class="myfoot">Photo by [Caio](https://www.pexels.com/@caio) from [Pexels](https://www.pexels.com/photo/m-m-s-chocolates-in-bowl-65547/).</span> ] ## Some give-aways (III) <span style="font-size:70%"><i>(Tendeiro and Kiers, 2021)</i></span> - No blind endorsing of Bayes factors. Please. -- - Point null hypothesis, really? At least think about it for a bit. -- - Bayes factors *cannot* be used to **establish** the existence of an effect! -- - We clearly reject the "first test, then who knows estimate" pseudo-inferential algorithm. -- - Only reporting Bayes factor is *really* poor practice.<br>Quantify effects sizes as much as you can! Estimate! <br> -- And remember... -- <center> <p style="font-size:150%; color:#EA5455"> The Bayes factor is nothing but a number, guys! </p> </center> --- background-image: url(data:image/png;base64,#Figures/pexels-pixabay-268953.png) background-size: cover ## References .footnote[ <span class="myfoot">Photo by [Pixabay](https://www.pexels.com/@pixabay) from [Pexels](https://www.pexels.com/photo/macro-outdoors-perspective-rocky-268953/).</span> ] Ravenzwaaij, D. van and E. Wagenmakers (2021). _Advantages Masquerading as `Issues' in Bayesian Hypothesis Testing: A Commentary on Tendeiro and Kiers (2019)_. Preprint. PsyArXiv. DOI: [10.31234/osf.io/nf7rp](https://doi.org/10.31234%2Fosf.io%2Fnf7rp). Tendeiro, J. N. and H. A. L. Kiers (2019). "A Review of Issues about Null Hypothesis Bayesian Testing." In: _Psychological Methods_ 24.6, pp. 774-795. DOI: [10.1037/met0000221](https://doi.org/10.1037%2Fmet0000221). Tendeiro, J. N. and H. A. L. Kiers (2021). _On the White, the Black, and the Many Shades of Gray in between: Our Reply to van Ravenzwaaij and Wagenmakers (2021)_. DOI: [10.31234/osf.io/tjxvz](https://doi.org/10.31234%2Fosf.io%2Ftjxvz).