Pointwise vs. Pairwise vs. Listwise Learning to Rank

Nikhil Dandekar
2 min readSep 29, 2016

--

At a high level, pointwise, pairwise and listwise approaches differ in how many documents you consider at a time in your loss function when training your model.

Pointwise approaches

Pointwise approaches look at a single document at a time in the loss function. They essentially take a single document and train a classifier / regressor on it to predict how relevant it is for the current query. The final ranking is achieved by simply sorting the result list by these document scores. For pointwise approaches, the score for each document is independent of the other documents that are in the result list for the query.

All the standard regression and classification algorithms can be directly used for pointwise learning to rank.

Pairwise approaches

Pairwise approaches look at a pair of documents at a time in the loss function. Given a pair of documents, they try and come up with the optimal ordering for that pair and compare it to the ground truth. The goal for the ranker is to minimize the number of inversions in ranking i.e. cases where the pair of results are in the wrong order relative to the ground truth.

Pairwise approaches work better in practice than pointwise approaches because predicting relative order is closer to the nature of ranking than predicting class label or relevance score. Some of the most popular Learning to Rank algorithms like RankNet, LambdaRank and LambdaMART [1] [2] are pairwise approaches.

Listwise approaches

Listwise approaches directly look at the entire list of documents and try to come up with the optimal ordering for it. There are 2 main sub-techniques for doing listwise Learning to Rank:

  1. Direct optimization of IR measures such as NDCG. E.g. SoftRank [3], AdaRank [4]
  2. Minimize a loss function that is defined based on understanding the unique properties of the kind of ranking you are trying to achieve. E.g. ListNet [5], ListMLE [6]

Listwise approaches can get fairly complex compared to pointwise or pairwise approaches.

Here are some good resources in case you want to learn more about this:

Footnotes

[1] From RankNet to LambdaRank to LambdaMART: An Overview — Microsoft Research

[2] What is the intuitive explanation of Learning to Rank and algorithms like RankNet, LambdaRank and LambdaMART?

[3] SoftRank: Optimising Non-Smooth Rank Metrics — Microsoft Research

[4] AdaRank on Letor

[5] Learning to Rank: From Pairwise Approach to Listwise Approach — Microsoft Research

[6] Position-Aware ListMLE: A Sequential Learning Process for Ranking

Originally published on Quora

--

--

Nikhil Dandekar

Engineering Manager doing Machine Learning @ Google. Previously worked on ML and search at Quora, Foursquare and Bing.