lenskit.logging#

Logging, progress, and resource records.

class lenskit.logging.LoggingConfig#

Bases: object

Configuration for LensKit logging.

This class is intended as a convenience for LensKit applications to set up a useful logging and progress reporting configuration; if unconfigured, LensKit will emit its logging messages directly to structlog and/or logging, which you can configure in any way you wish.

Stability:: Caller (see Stability Levels).

apply()#: Apply the configuration.

set_log_file(path, level=None, format='json')#

Configure a log file.

Parameters:

path (PathLike[str])
level (int | None)
format (Literal['json', 'logfmt', 'text'])

set_stream_mode(mode)#

Configure the standard error stream mode.

Parameters:: mode (Literal['full', 'simple', 'json'])

set_verbose(verbose=True)#

Enable verbose logging.

Note

It is better to only call this method if your application’s verbose option is provided, rather than passing your verbose option to it, to allow the LK_LOG_LEVEL environment variable to apply in the absence of a configuration option.

Parameters:: verbose (bool | int) – The level of verbosity. Values of True or 1 turn on DEBUG-level logs, and 2 or greater turns on tracing.

lenskit.logging.basic_logging(level=20)#

Simple one-function logging configuration for simple command lines.

Stability:: Caller (see Stability Levels).
Parameters:: level (int)

lenskit.logging.notebook_logging(level=20)#

Simple one-function logging configuration for notebooks and similar.

Stability:: Caller (see Stability Levels).
Parameters:: level (int)

class lenskit.logging.Progress(*args, uuid=None, **kwargs)#

Bases: object

Base class for progress reporting. The default implementations do nothing.

Parameters:

args (Any)
uuid (UUID)
kwargs (Any)

finish()#: Finish and clean up this progress bar. If the progresss bar is used as a context manager, this is automatically called on context exit.

update(advance=1, completed=None, total=None, **kwargs)#

Update the progress bar.

Parameters:

advance (int) – The amount to advance by.
completed (int | None) – The number completed; this overrides advance.
total (int | None) – A new total, to update the progress bar total.
kwargs (float | int | str)

lenskit.logging.item_progress(label, total=None, fields=None)#

Create a progress bar for distinct, counted items.

Parameters:

label (str) – The progress bar label.
total (int | None) – The total number of items.
fields (dict[str, str | None] | None) – Additional fields to report with the progress bar (such as a current loss). These are specified as a dictionary mapping field names to format strings (the pieces inside {...} in str.format()), and the values come from extra kwargs to Progress.update(); mapping to None use default str formatting.

Return type:

Progress

lenskit.logging.set_progress_impl(name, *options)#

Set the progress bar implementation.

Parameters:

name (str | ProgressBackend | None)
options (Any)

Return type:

None

class lenskit.logging.Task(label, *, file=None, parent=None, reset_hwm=None, task_id=<factory>, parent_id=None, subprocess=False, hostname=<factory>, machine=<factory>, status=TaskStatus.PENDING, start_time=None, finish_time=None, duration=None, cpu_time=None, peak_memory=None, peak_gpu_memory=None, system_power=None, cpu_power=None, gpu_power=None, subtasks=<factory>, **data)#

Bases: BaseModel

A task for logging and resource measurement.

A task may be top-level (have no parent), or it may be a subtask. By default, new tasks have the current active task as their parent. Tasks are not active until they are started (using a task as a context manager automatically does this, which is the recommended process).

The task-tracking mechanism is currently designed to support large tasks, like training a model or running batch inference; it is not yet designed for fine-grained spans like you would see in OpenTelemetry or Eliot.

Note

The notion of the “active task” does not yet support multi-threaded tasks.

Stability:

Caller (see Stability Levels).

Parameters:

file (PathLike[str] | None) – A file to save the task when it is finished.
parent (Task | UUID | None) – The parent task. If unspecified, uses the currently-active task.
reset_hwm (bool | None) – Whether to reset the system resource high-water-marks at the start of this task. Only effective on Linux, but allows for measurement of the peak memory use of this task specifically. If unspecified, it resets the HWM if there is no parent.
label (str)
task_id (UUID)
parent_id (UUID | None)
subprocess (bool)
hostname (str)
machine (str | None)
status (TaskStatus)
start_time (float | None)
finish_time (float | None)
duration (float | None)
cpu_time (float | None)
peak_memory (int | None)
peak_gpu_memory (int | None)
system_power (float | None)
cpu_power (float | None)
gpu_power (float | None)
subtasks (Annotated[list[Annotated[Task, SerializeAsAny()]], BeforeValidator(func=~lenskit.logging.tasks._dict_extract_values, json_schema_input_type=PydanticUndefined)])
data (Any)

add_subtask(task)#

Add or update a subtask.

Parameters:: task (Task)

static current()#

Get the currently-active task.

Return type:: Task | None

finish(status=TaskStatus.FINISHED)#

Finish the task.

Parameters:: status (TaskStatus)

model_config: ClassVar[ConfigDict] = {'extra': 'allow'}#: Configuration for the model, should be a dictionary conforming to [ConfigDict][pydantic.config.ConfigDict].

model_post_init(context, /)#

This function is meant to behave like a BaseModel method to initialise private attributes.

It takes context as an argument since that’s what pydantic-core passes when calling it.

Parameters:

self (BaseModel) – The BaseModel instance.
context (Any) – The context.

Return type:

None

monitor_refresh()#: Refresh method called by the monitor backend.

static root()#

Get the root task.

Return type:: Task | None

save_to_file(path, monitor=True)#

Save this task to a file, and re-save it when finished.

Parameters:

path (PathLike[str])
monitor (bool)

start()#: Start the task.

total_cpu()#

Compute the total CPU time (including subprocesses).

Return type:: float | None

update()#: Update the task’s resource measurements and save the file (if one is set).

update_resources()#

Update the resource measurements. Returns the current measurement.

This method is called by update(), with an exclusive lock held.

Return type:: ResourceMeasurement

task_id: UUID#: The task ID.

parent_id: UUID | None#: The parent task ID.

subprocess: bool#: Whether this task is a subprocess of its parent. Subprocess task CPU times are not included in the parent task times.

label: str#: Human-readable task label.

hostname: str#: The hostname on which the task was run.

machine: str | None#

The machine on which the task was run.

Machines are project-meaningful labels for compute machines or clusters, primarily for understanding resource consumption. See settings.

status: TaskStatus#: The task’s current status.

start_time: float | None#: The task start time (UNIX timestamp).

finish_time: float | None#: The task completion time (UNIX timestamp).

duration: float | None#: Task duration in seconds. Measured using time.perf_counter(), so it may disagree slightly with the difference in start and finish times.

cpu_time: float | None#: CPU time consumed in seconds.

peak_memory: int | None#: Peak memory usage (max RSS) in bytes. Only available on Unix; individual task peak memory use is only reliable on Linux (MacOS will report the max memory used since the process was started).

peak_gpu_memory: int | None#: Peak PyTorch GPU memory usage in bytes.

system_power: float | None#: Estimated total system power consumption (in Joules).

cpu_power: float | None#: Estimated CPU power consumption (in Joules).

gpu_power: float | None#: Estimated GPU power consumption (in Joules).

subtasks: Annotated[list[SerializeAsAny[Task]], BeforeValidator(_dict_extract_values)]#: This task’s subtasks.

lenskit.logging.get_logger(name, *, remove_private=True, **init_als)#

Get a logger. This works like structlog.stdlib.get_logger(), except the returned proxy logger is quiet (only WARN and higher messages) if structlog has not been configured. LensKit code should use this instead of obtaining loggers from Structlog directly.

It also suppresses private module name components of the logger name, so e.g. lenskit.pipeline._impl becomes ``lenskit.pipeline`.

Params:

name:: The logger name.
remove_private:: Set to False to keep private module components of the logger name instead of removing them.
init_vals:: Initial values to bind into the logger when crated.

Returns:

A lazy proxy logger. The returned logger is type-compatible with structlib.stdlib.BoundLogger, but is actually an instance of an internal proxy that provies more sensible defaults and handles LensKit’s TRACE-level logging support.

Parameters:

name (str)
remove_private (bool)
init_als (Any)

Return type:

BoundLogger

lenskit.logging.get_tracer(logger, **initial_values)#

Get a tracer for efficient low-level tracing of computations.

Stability:

Experimental

Parameters:

logger (str | BoundLogger)
initial_values (Any)

lenskit.logging.trace(logger, *args, **kwargs)#

Emit a trace-level message, if LensKit tracing is enabled. Trace-level messages are more fine-grained than debug-level messages, and you usually don’t want them.

This function does not work on the lazy proxies returned by get_logger() and similar — it only works on bound loggers.

Stability:

Caller (see Stability Levels).

Parameters:

logger (BoundLogger)
args (Any)
kwargs (Any)

class lenskit.logging.Tracer(logger)#

Bases: object

Logger-like thing that is only for TRACE-level events.

This class is designed to support structured tracing without the overhead of creating and binding new loggers. It is also imperative, rather than functional, so we create fewer objects and so it is a little more ergonomic for common tracing flows.

Note

Don’t create instances of this class directly — use get_tracer() to create a tracer.

Stability:: Experimental
Parameters:: logger (BoundLogger)

add_bindings(**new_values)#

Bind new data in the keys.

Note

Unlike structlog.Logger.bind(), this method is **imperative*: it updates the tracer in-place instead of returning a new tracer. If you need a new, disconnected tracer, use split().

Parameters:: new_values (Any)
Return type:: None

remove_bindings(*keys)#

Unbind keys in the tracer.

Note

Unlike structlog.Logger.bind(), this method is **imperative*: it updates the tracer in-place instead of returning a new tracer. If you need a new, disconnected tracer, use split().

Parameters:: keys (str)
Return type:: None

reset()#

Reset this tracer’s underlying logger to the original logger.

Return type:: None

trace(event, *args, **bindings)#

Emit a TRACE-level event.

Parameters:

event (str)
args (Any)
bindings (Any)

Return type:

None

lenskit.logging.stdout_console()#: Get a console attached to stdout.

lenskit.logging.friendly_duration(elapsed)#

Short, human-friendly representation of a duration.

Parameters:: elapsed (float | timedelta)

class lenskit.logging.Stopwatch(start=True)#

Bases: object

Timer class for recording elapsed wall time in operations.

elapsed(*, accumulated=True)#

Get the elapsed time.

Return type:: float

measure(accumulate=False)#

Context manager to measure an item, optionally accumulating its time.

Parameters:: accumulate (bool)

Modules

`config`	Logging pipeline configuration.
`formats`	Formatting utilities for log and diagnostic output.
`monitor`	Old home of the LensKit logging monitor.
`multiprocess`
`processors`	LensKit logging processors and converters.
`progress`
`resource`	Measure resource consumption.
`stopwatch`	Timing support
`tasks`	Abstraction for recording tasks.
`tracing`	Extended logger providing TRACE support.
`worker`	Old home of the LensKit logging worker logic.

lenskit.logging#

This Page