StructuredArrays
is a Julia package which provides multi-dimensional arrays
behaving like regular arrays but whose elements have the same given value or are lazily
computed by applying a given function to their indices. The main advantage of such arrays
is that they are very light in terms of memory: their storage requirement is O(1)
whatever their size instead of O(n)
for an ordinary array of n
elements.
Note that StructuredArrays
has a different purpose than
StructArrays
which is designed for
arrays whose elements are struct
.
To install the latest stable release with Julia's package manager:
] add StructuredArrays
All elements of a uniform array have the same value. A uniform array thus require to only
store this value and the dimensions (or the axes) of the array. In addition, some
operations (e.g., minimum
, maximum
, extrema
, all
, any
, sum
, prod
, count
,
findmin
, findmax
, reverse
, or unique
) may be implemented so as to be very fast for
uniform arrays.
To build a uniform array, call:
A = UniformArray(val, inds...)
which yields an array A
behaving as a read-only array whose values are all val
and
whose shape is defined by inds...
. Each of inds...
defines an array axis and can be
an integer for a 1-based index or an integer-valued unit step range. It is thus possible
to have offset axes as in the
OffsetArrays
package.
Uniform arrays implement conventional linear indexing: A[i]
yields val
for all linear
indices i
in the range 1:length(A)
.
The statement A[i] = x
is however not implemented because uniform arrays are considered
as read-only. You may call MutableUniformArray(val,dims)
to create a mutable uniform
array. For example:
B = MutableUniformArray(val, inds...)
For B
, the statement B[i] = x
is allowed to change the value of all the elements of
B
provided index i
represents all possible indices in B
. Typically B[:] = x
or
B[1:end] = x
are accepted but not B[1] = x
, unless B
has a single element.
Apart from all values being the same, uniform arrays behave like ordinary Julia arrays.
When calling a uniform array constructor, the element type T
and the number of
dimensions N
may be specified. This is most useful for T
to enforce a given element
type. By default, T = typeof(val)
is assumed. For example:
A = UniformArray{T}(val, inds...)
B = MutableUniformArray{T,N}(val, inds...)
A fast uniform array is like an immutable uniform array but with the value of all elements being part of the signature so that this value is known at compile time. To build such an array, call one of:
A = FastUniformArray(val, inds...)
A = FastUniformArray{T}(val, inds...)
A = FastUniformArray{T,N}(val, inds...)
The values of the elements of a structured array are computed on the fly as a function of their indices. To build such an array, call:
A = StructuredArray(func, inds...)
which yields a read-only array A
whose value at index i
is computed as func(i)
and
whose shape is defined by inds...
. In other words, A[i]
yields func(i)
.
An optional leading argument S
may be used to specify another index style than the
default IndexCartesian
:
A = StructuredArray(S, func, inds...)
where S
may be a sub-type of IndexStyle
or an instance of such a sub-type. If S
is
IndexCartesian
(the default), the function func
will be called with N
integer
arguments, a Vararg{Int,N}
, N
being the number of dimensions. If S
is IndexLinear
,
the function func
will be called with a single integer argument, an Int
.
A structured array can be used to specify the location of structural non-zeros in a sparse
matrix. For instance, the structure of a lower triangular matrix of size m×n
could be
given by:
StructuredArray((i,j) -> (i ≥ j), m, n)
but with a constant small storage requirement whatever the size of the matrix.
Although the callable object func
may not be a pure function, its return type shall be
stable and structured arrays are considered as immutable in the sense that a statement
like A[i] = x
is not implemented. The type, say, T
of the elements of structured array
is inferred by applying func
to the unit index or may be explicitly specified:
StructuredArray{T}(S, func, dims)
where, if omitted, S = IndexCartesian
is assumed. The StructuredArray
constructor also
supports the number of dimensions N
and the indexing style S
as optional type
parameters. The two following examples are equivalent:
A = StructuredArray{T,N}(S, func, inds...)
A = StructuredArray{T,N,S}(func, inds...)
As implemented in StructuredArrays
, N
-dimensional Cartesian meshes have equally spaced
nodes with given step and origin. The mesh step is the spacing between
contiguous nodes, it may be different (in length and units) along the different dimensions
of the mesh. The mesh origin is the index of the node whose coordinates are all equal
to zero, this index has no units and may be fractional.
The coordinates of the node at Cartesian index i = (i1,i2,...)
are formally given by:
step .* i # if `origin` is `nothing`
step .* (i .- origin) # else
which yields a N
-tuple of coordinates. Thanks to broadcasting rules, each of step
and
origin
may be specified as a scalar to assume that this parameter is the same for all
dimensions, or as a N
-tuple otherwise. origin
may also be nothing
(the default), to
assume that the origin of the mesh is at index (0,0,...)
. In the implementation, the
exact formula used to compute the coordinates of the nodes is optimized for the different
possible cases. As a consequence, specifying origin
as 0
or as a N
-tuple of 0
s
yields a mesh with the same coordinates but computed with more overheads than with origin = nothing
.
The values of step
and origin
stored by a mesh object A
may be retrieved by calling
step(A)
or origin(A)
. To retrieve a N
-dimensional step or origin in all cases,
call step(Tuple,A)
or origin(Tuple,A)
instead.
To create a Cartesian mesh as a function, call:
mesh = CartesianMesh{N}(step, origin = nothing)
which yields a callable object such that mesh(i1,i2,...)
generates the coordinates of
the node at N
-dimensional index (i1,i2,...)
according to the above formula. If any of
step
or origin
is a N
-tuple, the parameter N
may be omitted. When calling the
object, the node indices may also be specified as a N
-tuple or as a CartesianIndex
.
The values of step
and origin
stored by a mesh are automatically converted at
construction time to reduce the number of operations when computing coordinates. This
optimization assumes that all indices are specified as Int
s but fractional indices (i.e.
reals) are also accepted.
A typical usage is to wrap a Cartesian mesh function in a StructuredArray
to build an
abstract array whose values are the coordinates of the nodes of a finite size Cartesian
mesh. For example:
A = StructuredArray(CartesianMesh{N}(step, origin), inds...)
with inds...
the shape (indices and/or dimensions) of the mesh. This can also be done
by calling:
A = CartesianMeshArray(inds...; step, origin=nothing)
Then the syntax A[i1,i2,...]
yields the coordinates of the node at index (i1,i2,...)
.
Node indices may also be specified as a tuple of Int
s or as a CartesianIndex
. The
coordinates are lazily computed and the storage is O(1)
.