Burkner_Vuorre_2019

Paul-Christian Bürkner and Matti Vuorre. Ordinal Regression Models in Psychology: A Tutorial. Advances in Methods and Practices in Psychological Science 2019 2:1, 77-101

Keywords: Ordinal variables, stats, methods

treating ordinal data as continuous/metric can lead to errors such as low rates of correct detection, distorted effect-size estimates, inflated false alarm (Type I error) rates, and inversions of differences between groups
errors occur because distances between values are not equidistant, distribution of responses is often non-normally distributed, and variances between groups can be unequal
ordinal regression models are recommended in place of other insufficient solutions such as averaging over data
Likert scales are appropriate for cumulative models -> where we know categories have ordering but we don’t know the distance between them or whether the distances between them vary across participants
- each response level is provided with a cutpoint and the model uses these thresholds/cutpoints instead of intercepts
Sequential models - if the response can be understood as being the result of a sequential process, such that a higher response category is possible only after all lower categories are achieved, the sequential model proposed by Tutz (1990) is usually appropriate
Adjacent Category Models - widely used in item response theory and are applied in many large-scale assessment studies. Chosen for its mathematical convenience rather than any quality of interpretation
unequal variances can be modeled with all three classes of ordinal models (brm(bf(Y ∼ X, disc ∼ X), …))
Category-specific effects (modelling the effects a predictor has on K coefficients) is unproblematic in sequential and adjacent-category but can cause problems in cumulative models
Ignoring the possibility of unequal variances can lead to problems such as inflated error rates and distorted effect sizes but incorporating unequal variances is easily done
family = cumulative('probit') changes error term to normally distributed instead of logistically distributed
Estimate column is posterior means of the parameters and Est.Error is posterior std devs -> analogous to Frequentist point estimates
Rhat should not be larger than 1.1 and effective sample is ideally > 1000
Threshold estimates are similar to intercepts - rarely interpreted
categorical explanatories:
- coefficients indicate the extent to which people within each category differ from the level that was eaten by the intercept
switch to adjacent category model (family = acat()) to see how different groups respond to questions differently
unequal variances
- add linear formula based on variances to account for unequal variances across groups
can use model comparison to assess which model fits data best: base, adjacent category, or unequal variances
multiple Likert items: the participants provide responses to multiple items. In such cases, one can fit a multilevel ordinal model that takes the items and participants into account, incorporating all information in the data into the model while controlling for dependencies between ratings from the same person and between ratings of the same item.
- rating ~ 1 + belief + (1|person) + (1|item)
To model threshold-specific variances, we would write (cs(1)|<group>)