Bradley-Terry Model

Based on wikipedia. The Bradley-Terry model (1952) is a probability model that allows us to infer scores for individual objects based on a dataset of pairwise comparisons between them. Specifically, it estimates the probability that $i ▹ j$ (i.e. $i$ is preferred to $j$ ) as:

$P (i ▹ j) = \frac{p _{i}}{p _{i} + p _{j}}$

Where $p_{i}$ is a positive real-valued score assigned to object $i$ (not necessarily a probability). Typically, $p_{i}$ is parametrized as an exponential score $p_{i} = e^{β_{i}}$ , and the goal is to learn the parameters $β_{i}$ from pairwise comparisons. This results in:

$P (i ▹ j) = \frac{e ^{β_{i}}}{e ^{β_{i}} + e ^{β_{j}}}$

Parameter Estimation

Parameter estimation is typically done using maximum likelihood. Starting with a set of pairwise comparisons between individual objects, let $w_{ij}$ be the number of times object $i$ beats object $j$ . Then the likelihood of a given set of parameters $p := [p_{1}, ..., p_{n}]$ ( $n$ denotes number of objects) is as follows:

$L (p) = l n ij \prod P (i ▹ j)^{w_{ij}} = i = 1 \sum n j = 1 \sum n l n (\frac{p _{i}}{p _{i} + p _{j}})^{w_{ij}} = i = 1 \sum n j = 1 \sum n w_{ij} [l n p_{i} - l n (p_{i} + p_{j})]$

This likelihood function can then be minimized by differentiating wrt $p_{i}$ and solved by setting to zero.

BT Model as a Sigmoid Function

We can also express the likelihood as a function of the difference in scores $β_{i} - β_{j}$ . Recall that the sigmoid function is $σ (x) = 1/ (1 + e^{- x})$ . Then: $L (p) = i = 1 \sum n j = 1 \sum n w_{ij} \cdot l n [\frac{e ^{β_{i}}}{e ^{β_{i}} + e ^{β_{j}}}] = i = 1 \sum n j = 1 \sum n w_{ij} \cdot l n σ (β_{i} - β_{j})$

The derivation for the second line above is found at /Identities/sigmoid. This re-parametrization shows that the BT-model is really modelling the preference as a difference in scores and then running that through the sigmoid function to convert it into a probability. This means that we are basically running a pairwise logistic regression.

Keyboard shortcuts

Chux's Notebook

Bradley-Terry Model

Parameter Estimation

BT Model as a Sigmoid Function