Forum—P Values and Model Selection

In defense of P values

Paul A. Murtaugh

Corresponding Author

E-mail address:murtaugh@science.oregonstate.edu

Department of Statistics, Oregon State University, Corvallis, Oregon 97331 USA

E-mail: E-mail address:murtaugh@science.oregonstate.edu
Search for more papers by this author
First published: March 2014
Cited by: 87

Corresponding Editor: A. M. Ellison. For reprints of this Forum, see footnote 1, p. 609.

Abstract

Statistical hypothesis testing has been widely criticized by ecologists in recent years. I review some of the more persistent criticisms of P values and argue that most stem from misunderstandings or incorrect interpretations, rather than from intrinsic shortcomings of the P value. I show that P values are intimately linked to confidence intervals and to differences in Akaike's information criterion (ΔAIC), two metrics that have been advocated as replacements for the P value. The choice of a threshold value of ΔAIC that breaks ties among competing models is as arbitrary as the choice of the probability of a Type I error in hypothesis testing, and several other criticisms of the P value apply equally to ΔAIC. Since P values, confidence intervals, and ΔAIC are based on the same statistical information, all have their places in modern statistical practice. The choice of which to use should be stylistic, dictated by details of the application rather than by dogmatic, a priori considerations.

Number of times cited: 87

  • , Biodiversity–multifunctionality relationships depend on identity and number of measured functions, Nature Ecology & Evolution, 2, 1, (44), (2018).
  • , The development of an intraruminal nylon bag technique using non-fistulated animals to assess the rumen degradability of dietary plant materials, animal, 12, 01, (54), (2018).
  • , Regulatory feedback response mechanisms to phosphate starvation in rice, npj Systems Biology and Applications, 4, 1, (2018).
  • , Linking black hole growth with host galaxies: the accretion–stellar mass relation and its cosmic evolution, Monthly Notices of the Royal Astronomical Society, 475, 2, (1887), (2018).
  • , DYNAMICS OF SOCIAL BEHAVIOUR AT PARTURITION IN A GREGARIOUS UNGULATE, Behavioural Processes, (2018).
  • , On the testing of Hardy‐Weinberg proportions and equality of allele frequencies in males and females at biallelic genetic markers, Genetic Epidemiology, 42, 1, (34-48), (2017).
  • , Annual survival and breeding dispersal of a migratory passerine, the Scissor‐tailed Flycatcher, Journal of Field Ornithology, 89, 1, (22-36), (2018).
  • , Salinization triggers a trophic cascade in experimental freshwater communities with varying food‐chain length, Ecological Applications, 27, 3, (833-844), (2017).
  • , When perception reflects reality: Non‐native grass invasion alters small mammal risk landscapes and survival, Ecology and Evolution, 7, 6, (1823-1835), (2017).
  • , Boreal bryophyte response to natural fire edge creation, Journal of Vegetation Science, 28, 5, (915-927), (2017).
  • , Subtle physiological and morphological differences explain ecological success of sympatric congeners, Ecosphere, 8, 10, (2017).
  • , Environmental heterogeneity effects on predator and parasitoid insects vary across spatial scales and seasons: a multi‐taxon approach, Insect Conservation and Diversity, 10, 6, (462-471), (2017).
  • , Scaling methane emissions in ruminants and global estimates in wild populations, Science of The Total Environment, 579, (1572), (2017).
  • , X-shooter study of accretion in Chamaeleon I, Astronomy & Astrophysics, 604, (A127), (2017).
  • , A guide to analyzing biodiversity experiments, Journal of Plant Ecology, 10, 1, (91), (2017).
  • , Development of atmospheric acid deposition in China from the 1990s to the 2010s, Environmental Pollution, 231, (182), (2017).
  • , Zebra mussel (Dreissena polymorpha) affects the feeding ecology of early stage striped bass (Morone saxatilis) in the Hudson River estuary, Environmental Biology of Fishes, 100, 4, (395), (2017).
  • , The earth is flat (p > 0.05): significance thresholds and the crisis of unreplicable research, PeerJ, 5, (e3544), (2017).
  • , “Transit makes you short”: On health impact assessment of transportation and the built environment, Journal of Transport & Health, 4, (373), (2017).
  • , On doing better science: From thrill of discovery to policy implications, The Leadership Quarterly, 10.1016/j.leaqua.2017.01.006, 28, 1, (5-21), (2017).
  • , Is the choice of conservation measures influenced by the targeted natural habitats? The case of French coastal Natura 2000 sites, Ocean & Coastal Management, 142, (15), (2017).
  • , Large Wood and Instream Habitat for Juvenile Coho Salmon and Larval Lampreys in a Pacific Northwest Stream, North American Journal of Fisheries Management, 37, 4, (683-699), (2017).
  • , Equivalent statistics and data interpretation, Behavior Research Methods, 49, 4, (1524), (2017).
  • , Partitioning risks of tree mortality by modes of death in managed and unmanaged northern hardwoods and mixedwoods, The Forestry Chronicle, 10.5558/tfc2017-033, 93, 03, (246-258), (2017).
  • , Litter Decomposition ofSpartina alternifloraandJuncus roemerianus: Implications of Climate Change in Salt Marshes, Journal of Coastal Research, 332, (372), (2017).
  • , Long‐term trends in weather severity indices for dabbling ducks in eastern North America, Wildlife Society Bulletin, 41, 4, (615-623), (2017).
  • , Effects of a common insecticide on wetland communities with varying quality of leaf litter inputs, Environmental Pollution, 226, (452), (2017).
  • , The Harm Done to Reproducibility by the Culture of Null Hypothesis Significance Testing, American Journal of Epidemiology, 186, 6, (627), (2017).
  • , Seasonal and diel patterns in cetacean use and foraging at a potential marine renewable energy site, Marine Pollution Bulletin, (2017).
  • , Perturbations on the uniform distribution of p-values can lead to misleading inferences from null-hypothesis testing, Trends in Neuroscience and Education, 8-9, (18), (2017).
  • , Trends in body condition of smallmouth bass and northern pike (1982–2013) following multiple ecological perturbations in the St. Lawrence River, Canadian Journal of Fisheries and Aquatic Sciences, 74, 8, (1158), (2017).
  • , Species’ traits help predict small mammal responses to habitat homogenization by an invasive grass, Ecological Applications, 27, 5, (1451-1465), (2017).
  • , Seasonal effects of macrophyte growth on rainbow trout habitat availability and selection in a low‐gradient, groundwater‐dominated river, Ecology of Freshwater Fish, 26, 4, (653-665), (2016).
  • , A graphical framework for model selection criteria and significance tests: refutation, confirmation and ecology, Methods in Ecology and Evolution, 8, 1, (47-56), (2016).
  • , Prevalence and risk of heterotrophic microorganisms in a carbonated soft drink factory, African Journal of Microbiology Research, 11, 6, (245), (2017).
  • , Plant diversity enhances moth diversity in an intensive forest management experiment, Ecological Applications, 27, 1, (134-142), (2017).
  • , Wet acid deposition in Chinese natural and agricultural ecosystems: Evidence from national‐scale monitoring, Journal of Geophysical Research: Atmospheres, 121, 18, (10,995-11,005), (2016).
  • , Remotely sensed temperature and precipitation data improve species distribution modelling in the tropics, Global Ecology and Biogeography, 25, 4, (443-454), (2016).
  • , The relative performance of AIC, AICC and BIC in the presence of unobserved heterogeneity, Methods in Ecology and Evolution, 7, 6, (679-692), (2016).
  • , Forest loss increases insect herbivory levels in human-altered landscapes, Acta Oecologica, 77, (136), (2016).
  • , Modelling Melodic Discrimination Tests: Descriptive and Explanatory Approaches, Journal of New Music Research, 45, 3, (265), (2016).
  • , Edge and land use effects on dung beetles (Coleoptera: Scarabaeidae: Scarabaeinae) in Brazilian cerrado vegetation, Journal of Insect Conservation, 20, 6, (957), (2016).
  • , Patterns of shelter use and their effects on the relative survival of subadult California spiny lobster (Panulirus interruptus), Marine and Freshwater Research, 67, 8, (1153), (2016).
  • , Is Banning Significance Testing the Best Way to Improve Applied Social Science Research? – Questions on , Sociological Research Online, 21, 3, (1), (2016).
  • , Transparency in Ecology and Evolution: Real Problems, Real Solutions, Trends in Ecology & Evolution, 31, 9, (711), (2016).
  • , Is Exposure to Macondo Oil Reflected in the Otolith Chemistry of Marsh-Resident Fish?, PLOS ONE, 11, 9, (e0162699), (2016).
  • , Are red wood ants ( Formica rufa -group) tectonic indicators? A statistical approach, Ecological Indicators, 61, (968), (2016).
  • , Thermal and Osmotic Tolerance of ‘Irukandji’ Polyps: Cubozoa; Carukia barnesi, PLOS ONE, 11, 7, (e0159380), (2016).
  • 2016 IEEE International Conference on Mechatronics and Automation, (2016).174410.1109/ICMA.2016.7558827
  • , Seabird diving behaviour reveals the functional significance of shelf-sea fronts as foraging hotspots, Royal Society Open Science, 3, 9, (160317), (2016).
  • , Goodness-of-fit test for specification of semiparametric copula dependence models, Journal of Econometrics, 193, 1, (215), (2016).
  • , Statistical tests, P values, confidence intervals, and power: a guide to misinterpretations, European Journal of Epidemiology, 31, 4, (337), (2016).
  • , So close and yet so far away: long‐distance dispersal events govern bryophyte metacommunity reassembly, Journal of Ecology, 104, 6, (1707-1719), (2016).
  • , Underappreciated problems of low replication in ecological field studies, Ecology, 97, 10, (2554-2561), (2016).
  • , Model averaging and muddled multimodel inferences, Ecology, 96, 9, (2370-2382), (2015).
  • , No evidence for consistent long‐term growth stimulation of 13 tropical tree species: results from tree‐ring analysis, Global Change Biology, 21, 10, (3762-3776), (2015).
  • , MMI: Multimodel inference or models with management implications?, The Journal of Wildlife Management, 79, 5, (708-718), (2015).
  • , History of multimodel inference via model selection in wildlife science, The Journal of Wildlife Management, 79, 5, (704-707), (2015).
  • , Can't see the trees for the forest: complex factors influence tree survival in a temperate second growth forest, Ecosphere, 6, 11, (1-17), (2015).
  • , Changing soil characteristics alter the arbuscular mycorrhizal fungi communities of Arabica coffee (Coffea arabica) in Ethiopia across a management intensity gradient, Soil Biology and Biochemistry, 91, (133), (2015).
  • , Multimodal signalling in an antelope: fluctuating facemasks and knee-clicks reveal the social status of eland bulls, Animal Behaviour, 102, (231), (2015).
  • , Wear Fast, Die Young: More Worn Teeth and Shorter Lives in Iberian Compared to Scottish Red Deer, PLOS ONE, 10, 8, (e0134788), (2015).
  • , The human sex ratio from conception to birth, Proceedings of the National Academy of Sciences, 10.1073/pnas.1416546112, 112, 16, (E2102-E2111), (2015).
  • , Adaptive invasive species distribution models: a framework for modeling incipient invasions, Biological Invasions, 17, 10, (2831), (2015).
  • , Muskellunge egg incubation habitat in the upper Niagara River, Journal of Great Lakes Research, 41, 2, (448), (2015).
  • , Does the P Value Have a Future in Plant Pathology?, Phytopathology, 105, 11, (1400), (2015).
  • , Pervasive Local-Scale Tree-Soil Habitat Association in a Tropical Forest Community, PLOS ONE, 10, 11, (e0141488), (2015).
  • , Habitat-dependent olfactory discrimination in three-spined sticklebacks (Gasterosteus aculeatus), Animal Cognition, 18, 4, (839), (2015).
  • , Cecal drop reflects the chickens' cecal microbiome, fecal drop does not, Journal of Microbiological Methods, 117, (164), (2015).
  • , The fickle P value generates irreproducible results, Nature Methods, 12, 3, (179), (2015).
  • , The influence of habitat on body size and tooth wear in Scottish red deer (Cervus elaphus), Canadian Journal of Zoology, 93, 1, (61), (2015).
  • , Dense, small and male-biased cages exacerbate male–male competition and reduce female choosiness in Bicyclus anynana, Animal Behaviour, 104, (229), (2015).
  • , How far to go? Determinants of migration distance in land mammals, Ecology Letters, 18, 6, (545-552), (2015).
  • , Questioning the use of an amphibian colour morph as an indicator of climate change, Global Change Biology, 21, 2, (566-571), (2014).
  • , Effects of climate on reproductive investment in a masting species: assessment of climatic predictors and underlying mechanisms, Journal of Ecology, 103, 5, (1317-1324), (2015).
  • , Transfer of scientific knowledge to practitioners: Do we need a reform of the journal policy?, Applied Vegetation Science, 17, 3, (609-610), (2014).
  • , Recurring controversies about P values and confidence intervals revisited, Ecology, 95, 3, (645-651), (2014).
  • , Comment on Murtaugh, Ecology, 95, 3, (642-645), (2014).
  • , To P or not to P?, Ecology, 95, 3, (621-626), (2014).
  • , Model selection for ecologists: the worldviews of AIC and BIC, Ecology, 95, 3, (631-636), (2014).
  • , Rejoinder, Ecology, 95, 3, (651-653), (2014).
  • , P values are only an index to evidence: 20th‐ vs. 21st‐century statistical science, Ecology, 95, 3, (627-630), (2014).
  • , In defense of P values: comment on the statistical methods actually used by ecologists, Ecology, 95, 3, (637-642), (2014).
  • , The common sense of P values, Ecology, 95, 3, (617-621), (2014).
  • , The continuing misuse of null hypothesis significance testing in biological anthropology, American Journal of Physical Anthropology, , (2018).
  • , Understanding Acceptable Level of Risk: Incorporating the Economic Cost of Under-Managing Invasive Species, PLOS ONE, 10.1371/journal.pone.0141958, 10, 11, (e0141958), (2015).
  • , How to effectively design public health interventions: Implications from the interaction effects between socioeconomic status and health locus of control beliefs on healthy dietary behaviours among US adults, Health & Social Care in the Community, , (2018).