The offset filtration (also called the "union-of-balls"[1] or "union-of-disks"[2] filtration) is a growing sequence of metric balls used to detect the size and scale of topological features of a data set.
The offset filtration commonly arises in persistent homology and the field of topological data analysis.
Utilizing a union of balls to approximate the shape of geometric objects was first suggested by Frosini in 1992 in the context of submanifolds of Euclidean space.
[3] The construction was independently explored by Robins in 1998, and expanded to considering the collection of offsets indexed over a series of increasing scale parameters (i.e., a growing sequence of balls), in order to observe the stability of topological features with respect to attractors.
[4] Homological persistence as introduced in these papers by Frosini and Robins was subsequently formalized by Edelsbrunner et al. in their seminal 2002 paper Topological Persistence and Simplification.
[5] Since then, the offset filtration has become a primary example in the study of computational topology and data analysis.
be a finite set in a metric space
be the closed ball of radius
is a family of nested topological spaces indexed over
[6] Note that it is also possible to view the offset filtration as a functor
from the poset category of non-negative real numbers to the category of topological spaces and continuous maps.
[7][8] There are some advantages to the categorical viewpoint, as explored by Bubenik and others.
[9] A standard application of the nerve theorem shows that the union of balls has the same homotopy type as its nerve, since closed balls are convex and the intersection of convex sets is convex.
[10] The nerve of the union of balls is also known as the Čech complex,[11] which is a subcomplex of the Vietoris-Rips complex.
[12] Therefore the offset filtration is weakly equivalent to the Čech filtration (defined as the nerve of each offset across all scale parameters), so their homology groups are isomorphic.
[13] Although the Vietoris-Rips filtration is not identical to the Čech filtration in general, it is an approximation in a sense.
we have a chain of inclusions
between the Rips and Čech complexes on
[14] In general metric spaces, we have that
, implying that the Rips and Cech filtrations are 2-interleaved with respect to the interleaving distance as introduced by Chazal et al. in 2009.
[15][16] It is a well-known result of Niyogi, Smale, and Weinberger that given a sufficiently dense random point cloud sample of a smooth submanifold in Euclidean space, the union of balls of a certain radius recovers the homology of the object via a deformation retraction of the Čech complex.
[17] The offset filtration is also known to be stable with respect to perturbations of the underlying data set.
This follows from the fact that the offset filtration can be viewed as a sublevel-set filtration with respect to the distance function of the metric space.
The stability of sublevel-set filtrations can be stated as follows: Given any two real-valued functions
-dimensional homology modules on the sublevel-set filtrations with respect to
are point-wise finite dimensional, we have
denote the bottleneck and sup-norm distances, respectively, and
-dimensional persistent homology barcode.
[18] While first stated in 2005, this sublevel stability result also follows directly from an algebraic stability property sometimes known as the "Isometry Theorem,"[9] which was proved in one direction in 2009,[16] and the other direction in 2011.
[19][20] A multiparameter extension of the offset filtration defined by considering points covered by multiple balls is given by the multicover bifiltration, and has also been an object of interest in persistent homology and computational geometry.