Title: | Fit a Principal Curve in Arbitrary Dimension |
---|---|
Description: | Fitting a principal curve to a data matrix in arbitrary dimensions. Hastie and Stuetzle (1989) <doi:10.2307/2289936>. |
Authors: | Trevor Hastie [aut], Andreas Weingessel [aut], Kurt Hornik [aut] , Henrik Bengtsson [ctb] (HenrikBengtsson), Robrecht Cannoodt [aut, cre] (<https://orcid.org/0000-0003-3641-729X>, rcannood) |
Maintainer: | Robrecht Cannoodt <[email protected]> |
License: | GPL-2 |
Version: | 2.1.6 |
Built: | 2025-01-16 09:47:21 UTC |
Source: | https://github.com/rcannood/princurve |
Fit a principal curve which describes a smooth curve that passes through the middle
of the data x
in an orthogonal sense. This curve is a non-parametric generalization
of a linear principal component. If a closed curve is fit (using smoother = "periodic_lowess"
)
then the starting curve defaults to a circle, and each fit is followed by a bias correction
suggested by Jeff Banfield.
Hastie, T. and Stuetzle, W., Principal Curves, JASA, Vol. 84, No. 406 (Jun., 1989), pp. 502-516, doi:10.2307/2289936 (PDF).
See also Banfield and Raftery (JASA, 1992).
principal_curve
, project_to_curve
Fit a principal curve which describes a smooth curve that passes through the middle
of the data x
in an orthogonal sense. This curve is a non-parametric generalization
of a linear principal component. If a closed curve is fit (using smoother = "periodic_lowess"
)
then the starting curve defaults to a circle, and each fit is followed by a bias correction
suggested by Jeff Banfield.
principal_curve( x, start = NULL, thresh = 0.001, maxit = 10, stretch = 2, smoother = c("smooth_spline", "lowess", "periodic_lowess"), approx_points = FALSE, trace = FALSE, plot_iterations = FALSE, ... ) ## S3 method for class 'principal_curve' lines(x, ...) ## S3 method for class 'principal_curve' plot(x, ...) ## S3 method for class 'principal_curve' points(x, ...) whiskers(x, s, ...)
principal_curve( x, start = NULL, thresh = 0.001, maxit = 10, stretch = 2, smoother = c("smooth_spline", "lowess", "periodic_lowess"), approx_points = FALSE, trace = FALSE, plot_iterations = FALSE, ... ) ## S3 method for class 'principal_curve' lines(x, ...) ## S3 method for class 'principal_curve' plot(x, ...) ## S3 method for class 'principal_curve' points(x, ...) whiskers(x, s, ...)
x |
a matrix of points in arbitrary dimension. |
start |
either a previously fit principal curve, or else a matrix
of points that in row order define a starting curve. If missing or NULL,
then the first principal component is used. If the smoother is
|
thresh |
convergence threshold on shortest distances to the curve. |
maxit |
maximum number of iterations. |
stretch |
A stretch factor for the endpoints of the curve, allowing the curve to grow to avoid bunching at the end. Must be a numeric value between 0 and 2. |
smoother |
choice of smoother. The default is
|
approx_points |
Approximate curve after smoothing to reduce computational time.
If |
trace |
If |
plot_iterations |
If |
... |
additional arguments to the smoothers |
s |
a parametrized curve, represented by a polygon. |
An object of class "principal_curve"
is returned. For this object
the following generic methods a currently available: plot, points, lines
.
It has components:
s |
a matrix corresponding to |
ord |
an index, such that |
lambda |
for each point, its arc-length from the beginning of the
curve. The curve is parametrized approximately by arc-length, and
hence is |
dist |
the sum-of-squared distances from the points to their projections. |
converged |
A logical indicating whether the algorithm converged or not. |
num_iterations |
Number of iterations completed before returning. |
call |
the call that created this object; allows it to be
|
Hastie, T. and Stuetzle, W., Principal Curves, JASA, Vol. 84, No. 406 (Jun., 1989), pp. 502-516, doi:10.2307/2289936 (PDF).
x <- runif(100,-1,1) x <- cbind(x, x ^ 2 + rnorm(100, sd = 0.1)) fit <- principal_curve(x) plot(fit) lines(fit) points(fit) whiskers(x, fit$s)
x <- runif(100,-1,1) x <- cbind(x, x ^ 2 + rnorm(100, sd = 0.1)) fit <- principal_curve(x) plot(fit) lines(fit) points(fit) whiskers(x, fit$s)
This function is deprecated, please use
principal_curve
and project_to_curve
instead.
principal.curve(...) ## S3 method for class 'principal.curve' lines(...) ## S3 method for class 'principal.curve' plot(...) ## S3 method for class 'principal.curve' points(...) get.lam(...)
principal.curve(...) ## S3 method for class 'principal.curve' lines(...) ## S3 method for class 'principal.curve' plot(...) ## S3 method for class 'principal.curve' points(...) get.lam(...)
... |
Catch-all for old parameters. |
Finds the projection index for a matrix of points x
, when
projected onto a curve s
. The curve need not be of the same
length as the number of points.
project_to_curve(x, s, stretch = 2)
project_to_curve(x, s, stretch = 2)
x |
a matrix of data points. |
s |
a parametrized curve, represented by a polygon. |
stretch |
A stretch factor for the endpoints of the curve, allowing the curve to grow to avoid bunching at the end. Must be a numeric value between 0 and 2. |
A structure is returned which represents a fitted curve. It has components
s |
The fitted points on the curve corresponding to each point |
ord |
the order of the fitted points |
lambda |
The projection index for each point |
dist |
The total squared distance from the curve |
dist_ind |
The squared distances from the curve to each of the respective points |
t <- runif(100, -1, 1) x <- cbind(t, t ^ 2) + rnorm(200, sd = 0.05) s <- matrix(c(-1, 0, 1, 1, 0, 1), ncol = 2) proj <- project_to_curve(x, s) plot(x) lines(s) segments(x[, 1], x[, 2], proj$s[, 1], proj$s[, 2])
t <- runif(100, -1, 1) x <- cbind(t, t ^ 2) + rnorm(200, sd = 0.05) s <- matrix(c(-1, 0, 1, 1, 0, 1), ncol = 2) proj <- project_to_curve(x, s) plot(x) lines(s) segments(x[, 1], x[, 2], proj$s[, 1], proj$s[, 2])
Each of these functions have an interface function(lambda, xj, ...)
, and
return smoothed values for xj. The output is expected to be ordered along an ordered lambda.
This means that the following is true:
x <- runif(100) y <- runif(100) ord <- sample.int(100) sfun <- smoother_functions[[1]] all(sfun(x, y) == sfun(x[ord], y[ord]))
smoother_functions
smoother_functions
An object of class list
of length 3.
The starting circle is defined in the first two dimensions, and has zero values in all other dimensions.
start_circle(x)
start_circle(x)
x |
The data for which to generate the initial circle |
## Not run: x <- cbind( rnorm(100, 1, .2), rnorm(100, -5, .2), runif(100, 1.9, 2.1), runif(100, 2.9, 3.1) ) circ <- start_circle(x) plot(x) lines(circ) ## End(Not run)
## Not run: x <- cbind( rnorm(100, 1, .2), rnorm(100, -5, .2), runif(100, 1.9, 2.1), runif(100, 2.9, 3.1) ) circ <- start_circle(x) plot(x) lines(circ) ## End(Not run)