API Documentation#

This documentation covers PyTensor module-wise. This is suited to finding the Types and Ops that you can use to build and compile expression graphs.

Modules#

There are also some top-level imports that you might find more convenient:

Graph#

pytensor.shared(...)[source]#: Alias for pytensor.compile.sharedvalue.shared()

pytensor.function(...)[source]#: Alias for pytensor.compile.maker.function()

Control flow#

pytensor.scan(...)[source]#

This function constructs and applies a scan operation to the provided arguments.

Parameters:

fn –

A function that describes the operations involved in one step of scan. fn should construct variables describing the output of one iteration step. It should expect as input Variables representing all the slices of the input sequences and previous values of the outputs, as well as all other arguments given to scan as non_sequences. The order in which scan passes these variables to fn is the following :

all time slices of the first sequence

all time slices of the second sequence

…

all time slices of the last sequence

all past slices of the first output

all past slices of the second output

…

all past slices of the last output

all other arguments (the list given as non_sequences to

scan)

The order of the sequences is the same as the one in the list sequences given to scan. The order of the outputs is the same as the order of outputs_info. For any sequence or output the order of the time slices is the same as the one in which they have been given as taps. For example if one writes the following :

scan(
    fn,
    sequences=[
        dict(input=Sequence1, taps=[-3, 2, -1]),
        Sequence2,
        dict(input=Sequence3, taps=3),
    ],
    outputs_info=[
        dict(initial=Output1, taps=[-3, -5]),
        dict(initial=Output2, taps=None),
        Output3,
    ],
    non_sequences=[Argument1, Argument2],
)

fn should expect the following arguments in this given order:

sequence1[t-3]

sequence1[t+2]

sequence1[t-1]

sequence2[t]

sequence3[t+3]

output1[t-3]

output1[t-5]

output3[t-1]

argument1

argument2

The list of non_sequences can also contain shared variables used in the function, though scan is able to figure those out on its own so they can be skipped. For the clarity of the code we recommend though to provide them to scan. To some extend scan can also figure out other non sequences (not shared) even if not passed to scan (but used by fn). A simple example of this would be :

import pytensor.tensor as pt

W = pt.matrix()
W_2 = W**2

sequenceslist of Variable or dict or None, optional

The sequences scan has to iterate over. If wrapped in a dict, then a set of optional information can be provided about the sequence. The ``dict``should have the following keys:

input (mandatory) – Variable representing the sequence.
taps – Temporal taps of the sequence required by fn. They are provided as a list of integers, where a value k impiles that at iteration step t scan will pass to fn the slice t+k. Default value is [0]

All Variables in the list sequences are automatically wrapped into a dict where taps is set to [0]

outputs_infolist of Variable or dict or None, optional

The initial state of the outputs computed recurrently. If given as dicts, optional information can be provided about the output corresponding to those initial states. The dict should have the following keys:

initial – A Variable that represents the initial state of a given output. In case the output is not computed recursively (e.g. a map-like function) and does not require an initial state, this field can be skipped. Given that only the previous time step of the output is used by fn, the initial state should have the same shape as the output and should not involve a downcast of the data type of the output. If multiple time taps are used, the initial state should have one extra dimension that covers all the possible taps. For example if we use -5, -2 and -1 as past taps, at step 0, fn will require (by an abuse of notation) output[-5], output[-2] and output[-1]. This will be given by the initial state, which in this case should have the shape (5,) + output.shape. If this Variable containing the initial state is called init_y then init_y[0] corresponds to output[-5]. init_y[1] corresponds to output[-4], init_y[2] corresponds to output[-3], init_y[3] corresponds to output[-2], init_y[4] corresponds to output[-1]. While this order might seem strange, it comes natural from splitting an array at a given point. assume that we have a array x, and we choose k to be time step 0. Then our initial state would be x[:k], while the output will be x[k:]. Looking at this split, elements in x[:k] are ordered exactly like those in init_y.
taps – Temporal taps of the output that will be passed to fn. They are provided as a list of negative integers, where a value k implies that at iteration step t scan will pass to fn the slice t+k.

scan will follow this logic if partial information is given:

If an output is not wrapped in a dict, scan will wrap it in one assuming that you use only the last step of the output (i.e. it makes your tap value list equal to [-1]).
If you wrap an output in a dict and you do not provide any taps but you provide an initial state it will assume that you are using only a tap value of -1.
If you wrap an output in a dict but you do not provide any initial state, it assumes that you are not using any form of taps.
If you provide a None instead of a Variable or a empty dict scan assumes that you will not use any taps for this output (like for example in case of a map)

If outputs_info is an empty list or None, scan assumes that no tap is used for any of the outputs. If information is provided just for a subset of the outputs, an exception is raised, because there is no convention on how scan should map the provided information to the outputs of fn.

non_sequenceslist of Variable or None, optional

The arguments that are passed to fn at each step. One can choose to exclude variables used in fn from this list, as long as they are part of the computational graph, although this is not encouraged for clarity.

n_stepsint or Variable or None, optional

The number of steps to iterate given as an int or a scalar Variable. If any of the input sequences do not have enough elements, scan will raise an error. If the value is 0, the outputs will have 0 rows. If not provided, scan will figure out the amount of steps it should run given its input sequences. n_steps < 0 is not supported.

truncate_gradientint

The number of steps to use in truncated back-propagation through time (BPTT). If you compute gradients through a Scan Op, they are computed using BPTT. By providing a different value then -1, you choose to use truncated BPTT instead of classical BPTT, where you go for only truncate_gradient number of steps back in time.

go_backwardsbool

Indicates if scan should go backwards through the sequences. If you think of each sequence as indexed by time, making this flag True would mean that scan goes back in time, namely that for any sequence it starts from the end and goes towards 0.

namestr or None, optional

When profiling scan, it is helpful to provide a name for any instance of scan. For example, the profiler will produce an overall profile of your code as well as profiles for the computation of one step of each instance of Scan. The name of the instance appears in those profiles and can greatly help to disambiguate information.

modestr or None, optional

The mode used to compile the inner-graph. If you prefer the computations of one step of scan to be done differently then the entire function, you can use this parameter to describe how the computations in this loop are done (see pytensor.function for details about possible values and their meaning).

profilebool or str

If True or a non-empty string, a profile object will be created and attached to the inner graph of Scan. When profile is True, the profiler results will use the name of the Scan instance, otherwise it will use the passed string. The profiler only collects and prints information when running the inner graph with the CVM Linker.

allow_gcbool or None, optional

Set the value of allow_gc for the internal graph of the Scan. If set to None, this will use the value of pytensor.config.scan__allow_gc.

The full Scan behavior related to allocation is determined by this value and the flag pytensor.config.allow_gc. If the flag allow_gc is True (default) and this allow_gc is False (default), then we let Scan allocate all intermediate memory on the first iteration, and they are not garbage collected after that first iteration; this is determined by allow_gc. This can speed up allocation of the subsequent iterations. All those temporary allocations are freed at the end of all iterations; this is what the flag pytensor.config.allow_gc means.

strictbool

If True, all the shared variables used in fn must be provided as a part of non_sequences or sequences.

return_listbool

If True, will always return a list, even if there is only one output.

return_updatesbool, optional

If True (default), the returned tuple includes the updates dictionary.

Returns:

outputs (Variable or list of Variable) – The outputs of the scan, in the same order as outputs_info.
updates (dict) – Dictionary of update rules for shared variables used in the scan. Pass this to pytensor.function when compiling your function.

Alias for pytensor.scan.basic.scan()

Convert to Variable#

Wrap JAX functions#

pytensor.wrap_jax(...)[source]#

Return a PyTensor-compatible function from a JAX jittable function.

This decorator wraps a JAX function so that it accepts and returns pytensor.Variable objects. The JAX-jittable function can accept any nested Python structure (a Pytree) as input, and might return any nested Python structure.

Parameters:

jax_function (Callable, optional) – A JAX function to be wrapped. If None, returns a decorator function.
allow_eval (bool, default=True) – Whether to allow evaluation of symbolic shapes when input shapes are not fully determined.

Returns:

A function that wraps the given JAX function so that it can be called with pytensor.Variable inputs and returns pytensor.Variable outputs.

Return type:

Callable

Examples

>>> import jax.numpy as jnp
>>> import pytensor.tensor as pt
>>> from pytensor import wrap_jax
>>> @wrap_jax
... def add(x, y):
...     return jnp.add(x, y)
>>> x = pt.scalar("x")
>>> y = pt.scalar("y")
>>> result = add(x, y)
>>> f = pytensor.function([x, y], [result])
>>> print(f(1, 2))
[array(3.)]

We can also pass arbitrary jax pytree structures as inputs and outputs:

>>> import jax
>>> import jax.numpy as jnp
>>> import pytensor.tensor as pt
>>> from pytensor import wrap_jax
>>> @wrap_jax
... def complex_function(x, y, scale=1.0):
...     return {
...         "sum": jnp.add(x, y) * scale,
...     }
>>> x = pt.vector("x", shape=(3,))
>>> y = pt.vector("y", shape=(3,))
>>> result = complex_function(x, y, scale=2.0)
>>> f = pytensor.function([x, y], [result["sum"]])

Or Equinox modules:

>>> x = pt.tensor("x", shape=(3,))  # doctest +SKIP
>>> y = pt.tensor("y", shape=(3,))  # doctest +SKIP
>>> import equinox as eqx  # doctest +SKIP
>>> mlp = eqx.nn.MLP(
...     3, 3, 3, depth=2, activation=jnp.tanh, key=jax.random.key(0)
... )  # doctest +SKIP
>>> mlp = eqx.tree_at(lambda m: m.layers[0].bias, mlp, y)  # doctest +SKIP
>>> @wrap_jax  # doctest +SKIP
... def neural_network(x, mlp):  # doctest +SKIP
...     return mlp(x)  # doctest +SKIP
>>> out = neural_network(x, mlp)  # doctest +SKIP

If the input shapes are not fully determined, and valid input shapes cannot be inferred by evaluating the inputs either, an error will be raised:

>>> import jax.numpy as jnp
>>> import pytensor.tensor as pt
>>> @wrap_jax
... def add(x, y):
...     return jnp.add(x, y)
>>> x = pt.vector("x")  # shape is not fully determined
>>> y = pt.vector("y")  # shape is not fully determined
>>> result = add(x, y)
ValueError: Could not compile a function to infer example shapes. Please provide inputs with fully determined shapes by calling pt.specify_shape.
...

Alias for pytensor.link.jax.ops.wrap_jax()

Debug#

pytensor.dprint(...)[source]#

Print a graph as text.

Each line printed represents a Variable in a graph. The indentation of lines corresponds to its depth in the symbolic graph. The first part of the text identifies whether it is an input or the output of some Apply node. The second part of the text is an identifier of the Variable.

If a Variable is encountered multiple times in the depth-first search, it is only printed recursively the first time. Later, just the Variable identifier is printed.

If an Apply node has multiple outputs, then a .N suffix will be appended to the Apply node’s identifier, indicating to which output a line corresponds.

Parameters:

graph_like – The object(s) to be printed.
depth – Print graph to this depth (-1 for unlimited).
print_type – If True, print the Types of each Variable in the graph.
print_shape – If True, print the shape of each Variable in the graph.
file – When file extends TextIO, print to it; when file is equal to "str", return a string; when file is None, print to sys.stdout; when file is "rich", return a rich.tree.Tree that can be rendered with rich.print().
id_type –
Determines the type of identifier used for Variables:
- "id": print the python id value,
- "int": print integer character,
- "CHAR": print capital character,
- "auto": print the Variable.auto_name values,
- "": don’t print an identifier.
stop_on_name – When True, if a node in the graph has a name, we don’t print anything below it.
done – A dict where we store the ids of printed nodes. Useful to have multiple call to debugprint share the same ids.
print_storage – If True, this will print the storage map for PyTensor functions. When combined with allow_gc=False, after the execution of an PyTensor function, the output will show the intermediate results.
used_ids – A map between nodes and their printed ids.
print_op_info – Print extra information provided by the relevant Ops. For example, print the tap information for Scan inputs and outputs.
print_destroy_map – Whether to print the destroy_maps of printed objects
print_view_map – Whether to print the view_maps of printed objects
print_memory_map – Whether to set both print_destroy_map and print_view_map to True.
print_inner_graphs (bool or "auto", optional.) – Whether to print inner graphs. Default “auto” leaves it to PyTensor discretion, or a per Op basis.
print_fgraph_inputs – Print the inputs of FunctionGraphs.
print_assumptions – If True, annotate each Variable that has known structural assumptions with an a={...} tag, e.g. a={diag} or a={!sym}. Implied facts are pruned, so a diagonal matrix shows {diag} rather than every property it entails.

Returns:

A string representing the printed graph if file="str", a
rich.tree.Tree if file="rich", otherwise file (or None
when printing to stdout).

Alias for pytensor.printing.debugprint()