• Keine Ergebnisse gefunden

Seminar on Functional Data Analysis A short summary

N/A
N/A
Protected

Academic year: 2022

Aktie "Seminar on Functional Data Analysis A short summary"

Copied!
6
0
0

Wird geladen.... (Jetzt Volltext ansehen)

Volltext

(1)

Seminar on Functional Data Analysis A short summary

Hansruedi K ¨unsch, ETH Zurich

Spring Semester 2014

(2)

FDA as infinite dimensional statistics

I Functional data analysis is multivariate statistics in infinite dimensions.

I Observations are always finite dimensional, so we need to convert them to functions.

I If observation points are dense, the relevant dimension may be much smaller than number of observations.

I If observation points are sparse and different between subjects, need functions for comparison, alignment or use as explanatory variables in a regression model.

I Principal components analysis shows the directions where most variability in a sample of functions occurs.

I Kernel estimators for regression function and densities exist also for functional data. Semimetrics are a tool to avoid the curse of (infinite) dimensionality.

(3)

Regularization

I Underlying principle for FDA is regularization based on basis expansion and smoothness assumptions.

I Regularization by penalization is preferred over truncation in a basis expansion.

I Lack of smoothness of a functionx is usually quantified by Z

(Lx(t))2dt

whereLis a differential operator. Standard choices are Lx(t) =D2x(t)for splines andLx(t) = (D+ (T )2D3)x(t) forT-periodic functions.

I To estimatem-th derivatives, the penalty should involve derivatives of orderm+2.

I (Generalized) crossvalidation is the preferred method to choose the amount of regularization.

(4)

Statistical methods generalized to FDA

I To generalize a standard statistical method to functional data, turn subscriptsj,k into function argumentss,t, replace sums by integrals and add a penalty term.

I Example: Linear regression with scalar response

Yi0+

p

X

j=1

βjxiji →Yi0+ Z

β(t)xi(t)dt+εi.

Fitting by penalized least squares

arg min X

i

(Yi −β0− Z

β(t)xi(t)dt)2+λ Z

(Lβ(t))2dt

! .

I Basis expansions ofβandxi allow to compute the integrals and lead to linear equations for the unknown coefficients.

I Extensions to linear regression with functional response exist.

(5)

Distinctive features of FDA

I Functional data analysis is multivariate statistics with variables ordered in time or space.

I Important information is contained in derivatives of curves.

I Principal differential analysis allows to study linear relations between functions and their derivatives.

I Registration (alignment) of curves is a tool to study

variation between subjects other than shifts and amplitude variation.

(6)

General remarks

I The main goal of seminars is not to learn a new topic, but to learn how to read a book chapter or a scientific paper and how to present the material in an understandable way.

I The book by Ramsay and Silverman has its emphasis on intuitive introduction of concepts and practical advice. At some places I would prefer more clarity and precision, using mathematical language.

I Talks in last two weeks gave a flavor of asymptotic results based on limit theorems.

Referenzen

ÄHNLICHE DOKUMENTE

In order to begin to create a model, now select the menu item File on the Model Builder and Equation Editor screen, shown in Figure 2.. Then click on the line item Create New

Chapter 13: functional response, multivariate covariates; Chapter 14: functional re- sponse, functional covariate with concurrent model. VII Functional covariates (07.04.2014,

The Microstructure Approach to Exchange Rates (The MIT Press, Cambridge, MA). Naik, Narayan Y., and Pradeep K. Yadav, 2003, Trading costs of public investors with obligatory

These include support of replicated or redundant servers for continuos access to file server data in the face of server or network failures and highly available data service in the

For the experiments we implemented the sparse stress model, Algorithm 1, as well as different sampling techniques in Java using Oracle SDK 1.8 and the yFiles 2.9 graph

Spectral analysis of stationary time series is extended to multivariate nonparametric trend func- tions that are estimated by wavelet thresholding1. A regression cross

In the functional data context, for example, a functional data object, called an fd object, in its simplest version conforms to the functional data class that specifies that the

A convenient approach is to assume a parametric distribution for the heterogeneity component e i (individual parameter) and to estimate the regression coefficients