Metrics and Analysis#
Base Interfaces#
Base class for LensKit metrics. |
|
Base class for metrics that measure individual recommendation (or prediction) lists, and whose results may be aggregated to compute overall metrics. |
|
Base class for metrics that measure entire runs at a time. |
|
Interface for per-list metrics implemented as simple functions. |
|
Base class for most ranking metrics, implementing a |
Bulk Analysis#
Compute metrics over a collection of item lists composing a run. |
|
Results of a bulk metric computation. |
Basic Statistics#
Report the length of the output (recommendation list or predictions). |
|
Report the number of test items. |
Top-N Accuracy#
Compute the normalized discounted cumulative gain [JarvelinKekalainen02]. |
|
Evaluate recommendations with rank-biased precision [MZ08]. |
|
Compute recommendation precision. |
|
Compute recommendation recall. |
|
Compute the reciprocal rank [KV97] of the first relevant item in a list of recommendations. |
List and Item Properties#
Compute the _obscurity_ (mean popularity rank) of the recommendations. |
Item Distributions#
Measure exposure distribution of recommendations with the Gini coefficient. |
|
Measure item diversity of recommendations with the Gini coefficient. |
Prediction Accuracy#
Rank Weights#
The rank weighting classes (RankWeight
and descendants) provide
flexible rank weights for use in evaluation metrics. The rank-weighted top-N
metrics (Top-N Accuracy) use these for weighting the recommendations.
Base class for rank weighting models. |
|
Geometric cascade weighting for result ranks. |
|
Logarithmic weighting for result ranks, as used in NDCG. |