Comparative Judgment, Pairwise Comparison, and Bradley–Terry Model

Using pairwise comparison data to assess abilities

Introduction

Comparative judgment involves using pairwise comparisons to access the abilities of the compared objects. This is widely applied in the real world, e.g., sports science, psychology, and education. For example, many sport games (e.g., basketball, football, chess) involve two players/teams competing against each other each time. The resulting data can be used in comparative judgment.

Bradley–Terry Model

The Bradley–Terry model (Bradley & Terry, 1952) is commonly applied in comparative judgment. It assumes a set P={1,,p}\mathcal{P}=\left\{1,\dots,p\right\} in which every component is a "player" compared with another by some judges, with αi/αj\alpha_i/\alpha_j being the probability of ii beating jj. In functional form, the model seeks to estimate

Logit(πij)=λiλj,\text{Logit}(\pi_{ij}) = \lambda_i-\lambda_j,

where λi=log(αi)\lambda_i=\log\left(\alpha_i\right), αi\alpha_i and λi\lambda_i are the ability score and log ability score of player ii in P\mathcal{P} respectively and πij\pi_{ij} is the probability of ii beats jj. However, in this context, none of the above variables are known in practice. By the nature of logistic regression, the total number of times that ii beats jj, YijY_{ij}, is assumed to follow the binomial distribution:

YijBinomial(nij,πij),Y_{ij}\sim\text{Binomial}\left(n_{ij},\pi_{ij}\right),

where nij=njin_{ij}=n_{ji} is the number of times that ii and jj are compared against each other. This unstructured model could be further extended to a structured version, in which

λi=q=1pβqxiq+εi,\lambda_i=\sum^p_{q=1}\beta_qx_{iq}+\varepsilon_i,

where xqx_q's are explanatory variables and εiN(0,σ2)\varepsilon_i\sim\mathcal{N}\left(0,\sigma^2\right) is the random error term. Both fixed and random effects could be used for estimation. The explanatory variables could be attributed to the players being compared (player-specific), to the judge (contest-specific), or to the specific comparison (contest-specific).

Bayesian Spatial Bradley–Terry Model

The Bayesian Spatial Bradley–Terry model is specific to spatial data, allowing the learning of abilities from nearby players via clustering (Rowland et al., 2022). This model is useful for assessing issues related to human right abuses, such as forced marriage, female genital mutilation, human trafficking, and online child sexual exploitation and abuse.

Model fitting

References

Bradley, R. A., & Terry, M. E. (1952). Rank analysis of incomplete block designs: I. The method of paired comparisons. Biometrika, 39(3/4), 324–345.

Firth, D. (2012). BradleyTerry2: Flexible models for paired comparisons in R. Journal of Statistical Software, 48(9), 1–21.

Rowland, S., & others. (2022). A Bayesian spatial Bradley–Terry model for paired comparison data. arXiv preprint.