lenskit.data.MatrixRelationshipSet#

class lenskit.data.MatrixRelationshipSet(name, vocabularies, schema, table)#

Bases: RelationshipSet

Two-entity relationships without duplicates, accessible in matrix form.

Note

Client code does not need to construct this class; obtain instances from a relationship set’s matrix() method.

Parameters:

name (str)
vocabularies (dict[str, Vocabulary])
schema (RelationshipSchema)
table (pa.Table)

__init__(name, vocabularies, schema, table)#

Parameters:

name (str)
vocabularies (dict[str, Vocabulary])
schema (RelationshipSchema)
table (Table)

Methods

`__init__`(name, vocabularies, schema, table)
`arrow`(*[, attributes, ids])	Get these relationships and their attributes as a PyArrow table.
`col_stats`()
`coo_structure`()	Get the compressed sparse row structure of this relationship matrix.
`count`()
`csr_structure`()	Get the compressed sparse row structure of this relationship matrix.
`matrix`(*[, combine])	Convert this relationship set into a matrix, coalescing duplicate observations.
`pandas`(*[, attributes, ids])	Get these relationship and their attributes as a PyArrow table.
`row_items`([id, number])	Get a single row of this interaction matrix as an item list. Only valid when the column entity class is ``item''.
`row_stats`()
`row_table`([id, number])	Get a single row of this interaction matrix as a table.
`sample_negatives`(rows, *[, weighting, n, ...])	Sample negative columns (columns with no observation recorded) for an array of rows.
`scipy`([attribute, layout, legacy])	Get this relationship matrix as a SciPy sparse matrix.
`to_ilc`()	Get the rows as an item list collection.
`torch`([attribute, layout])	Get this relationship matrix as a PyTorch sparse tensor.

Attributes

`attribute_names`
`col_vocabulary`	The vocabulary for column entities.
`entities`
`is_interaction`	Query whether these relationships represent interactions.
`n_cols`
`n_rows`
`row_vocabulary`	The vocabulary for row entities.
`row_type`
`col_type`
`name`	The name of the relationship class for these relationships.
`schema`

property row_vocabulary#: The vocabulary for row entities.

property col_vocabulary#: The vocabulary for column entities.

matrix(*, combine=None)#

Convert this relationship set into a matrix, coalescing duplicate observations.

Parameters:

row_entity – The specified row entity of the matrix
col_entity – The specified column entity of the matrix
combine (Literal['count', 'sum', 'mean', 'first', 'last'] | dict[str, ~typing.Literal['count', 'sum', 'mean', 'first', 'last']] | None)

Return type:

MatrixRelationshipSet

csr_structure()#

Get the compressed sparse row structure of this relationship matrix.

Return type:: CSRStructure

coo_structure()#

Get the compressed sparse row structure of this relationship matrix.

Return type:: COOStructure

scipy(attribute=None, *, layout='csr', legacy=False)#

Get this relationship matrix as a SciPy sparse matrix.

Parameters:

attribute (str | None) – The attribute to return, or None to return an indicator-only sparse matrix (all observed values are 1).
layout (Literal['csr', 'coo']) – The matrix layout to return.
legacy (bool)

Returns:

The sparse matrix.

Return type:

sparray | spmatrix

torch(attribute=None, *, layout='csr')#

Get this relationship matrix as a PyTorch sparse tensor.

Parameters:

attribute (str | None) – The attribute to return, or None to return an indicator-only sparse matrix (all observed values are 1).
layout (Literal['csr', 'coo']) – The matrix layout to return.

Returns:

The sparse matrix.

Return type:

Tensor

sample_negatives(rows, *, weighting='uniform', n=None, verify=True, max_attempts=10, rng=None)#

Sample negative columns (columns with no observation recorded) for an array of rows. On a normal interaction matrix, this samples negative items for users.

Parameters:

rows (ndarray[tuple[int], dtype[int32]]) – The row numbers. Duplicates are allowed, and negative columns are sampled independently for each row. Must be a 1D array or tensor.
weighting (Literal['uniform', 'popular', 'popularity']) – The weighting for sampled negatives; uniform samples them uniformly at random, while popularity samples them proportional to their popularity (number of occurrences).
n (int | None) – The number of negatives to sample for each user. If None, a single-dimensional vector is returned.
verify (bool) – Whether to verify that the negative items are actually negative. Unverified sampling is much faster but can return false negatives.
max_attempts (int) – When verification is on, the maximum attempts before giving up and returning a possible false negative.
rng (Generator | None) – A random number generator to use.

Return type:

ndarray[tuple[Any, …], dtype[int32]]

row_table(id=None, *, number=None)#

Get a single row of this interaction matrix as a table.

Parameters:

id (int | str | bytes | integer[Any] | str_ | bytes_ | object_ | None)
number (int | None)

Return type:

Table | None

row_items(id=None, *, number=None)#

Get a single row of this interaction matrix as an item list. Only valid when the column entity class is ``item’’.

Parameters:

id (int | str | bytes | integer[Any] | str_ | bytes_ | object_ | None)
number (int | None)

Return type:

ItemList | None

to_ilc()#

Get the rows as an item list collection.

Return type:: ItemListCollection

lenskit.data.MatrixRelationshipSet#

This Page