$$ \DeclareMathOperator*{\argmin}{\arg\!\min} \DeclareMathOperator*{\argmax}{\arg\!\max} \newcommand\independenT{\protect\mathpalette{\protect\independenT}{\perp}} \def\independenT#1{\mathrel{\rlap{#1}\mkern2mu{#1}}} \newcommand{\rv}[2]{ #1_{1},...,#1_{#2} } \newcommand{\conv}[1]{ \overset{#1}{\longrightarrow} } \newcommand{\dnconv}[1]{ \overset{#1}{\centernot\longrightarrow} } \newcommand\smtop{\mkern-2mu\raise.25ex\hbox{$\scriptscriptstyle\top$}\mkern-3mu} \def\ds{\displaystyle} \def\bs{\boldsymbol} \def\bsl{\backslash} \def\ipl{\langle} \def\ipr{\rangle} \def\st{\text{\indent s.t \indent }} \def\bif{\text{\indent if \indent } } \def\a{\alpha} \def\b{\beta} \def\g{\gamma} \def\th{\theta} \def\bth{\boldsymbol \theta} \def\la{\lambda} \def\La{\Lambda} \def\k{\kappa} \def\t{\tau} \def\r{\rho} \def\s{\sigma} \def\ssq{\sigma^2} \def\d{\delta} \def\w{\omega} \def\om{\Omega} \def\vep{\varepsilon} \def\vphi{\varphi} \def\cL{\mathcal{L}} \def\indi{\mathbbm{1}} \def\I{\mathcal{I}} \def\J{\mathcal{J}} \def\es{\emptyset} \def\var{\mathrm{var}} \def\Var{\mathrm{var}} \def\cov{\mathrm{cov}} \def\sd{\mathrm{sd}} \def\y{\textbf{\textit{y}}} \def\R{\mathbb{R}} \def\Q{\mathbb{Q}} \def\Qc{\mathbb{Q}^{c}} \def\bE{\mathbb{E}} \def\bP{\mathbb{P}} \def\Z{\mathbb{Z}} \def\C{\mathbb{C}} \def\N{\mathbb{N}} \def\H{\mathbb{H}} \def\Sbb{\mathbb{S}} \def\sA{\mathscr{A}} \def\cA{\mathcal{A}} \def\sD{\mathscr{D}} \def\cD{\mathcal{D}} \def\cG{\mathcal{G}} \def\G{\mathscr{G}} \def\F{\mathscr{F}} \def\sR{\mathscr{R}} \def\cB{\mathcal{B}} \def\sB{\mathscr{B}} \def\sC{\mathscr{C}} \def\cC{\mathcal{C}} \def\cE{\mathcal{E}} \def\sE{\mathscr{E}} \def\fF{\mathfrak{F}} \def\sF{\mathscr{F}} \def\fG{\mathfrak{G}} \def\bG{\mathbbm{G}} \def\sG{\mathscr{G}} \def\sH{\mathscr{H}} \def\cH{\mathcal{H}} \def\sJ{\mathscr{J}} \def\cJ{\mathcal{J}} \def\I{\mathcal{I}} \def\J{\mathcal{J}} \def\sT{\mathscr{T}} \def\cT{\mathcal{T}} \def\fM{\mathfrak{M}} \def\cM{\mathcal{M}} \def\sM{\mathscr{M}} \def\cN{\mathcal{N}} \def\sN{\mathscr{N}} \def\sP{\mathscr{P}} \def\cP{\mathcal{P}} \def\cW{\mathcal{W}} \def\X{\mathcal{X}} \def\Y{\mathcal{Y}} \def\cP{\mathcal{P}} \def\sP{\mathscr{P}} \def\SS{\mathscr{S}} \def\ker{\mathrm{ker}} \def\ran{\mathrm{ran}} \def\vecv{\mathrm{vec}} \def\wec{\mathrm{wec}} \def\diag{\mathrm{diag}} \def\sspan{\mathrm{span}} % I think this clashes with something; will not work \def\tr{\mathrm{tr}} \def\rank{\mathrm{rank}} \def\pr{\mathrm{pr}} \def\dist{\mathrm{dist}} \def\BIC{\mathrm{BIC}} \def\KC{\mathrm{KC}} \def\inprob{\overset{p}{\longrightarrow}} \def\indist{\overset{d}{\longrightarrow}} \def\eqdist{\overset{d}{=}} \def\as{\overset{a.s}{\longrightarrow}} \def\tends{\longrightarrow} \def\ind{\;\;\;\;\;} \def\etal{\text{et al.} } \def\indep{\independenT{\perp}} \def\iid{\overset{iid}{\sim}} \def\GEP{\mathrm{GEP}} \def\sspan{\mathrm{span}} \def\bnorm{ \bigg \| } $$

Functional Sliced Inverse Regression

Author

Harris Quach

Published

January 1, 2025

Understanding Functional Sliced Inverse Regression (FSIR)

Functional Sliced Inverse Regression (FSIR) is an extension of the classical Sliced Inverse Regression (SIR), tailored for functional data. Functional data arises when the predictors are not scalar or vector values but functions, such as curves or trajectories over a continuous domain. This fundamental difference necessitates specialized methods like FSIR, as functional data requires handling infinite-dimensional spaces and smooth trajectories, unlike finite-dimensional Euclidean data.

Motivation

In regression problems with functional predictors, the goal is to understand how a response variable $Y$ depends on a set of functional predictors $X(t)$, where $t$ represents a continuous domain (e.g., time, wavelength). Unlike Euclidean data, where predictors are represented as points or vectors in finite-dimensional space, functional data consists of entire functions observed over a continuum. Standard dimension reduction techniques may not be directly applicable due to the infinite-dimensional nature of functional predictors. FSIR adapts the principles of SIR to handle this complexity.

FSIR identifies directions in the functional predictor space that are most informative about the response variable. These directions, referred to as effective dimension reduction (EDR) directions, reduce the functional data to a finite-dimensional subspace while retaining the essential relationship between $Y$ and $X(t)$.

The FSIR Algorithm

FSIR builds on the classical SIR framework but incorporates functional data concepts. The algorithm can be summarized as follows:

Steps for Implementation

Center and standardize the functional predictors:
- Compute the mean function of $X(t)$ across all observations.
- Subtract the mean function from each $X(t)$ to center the data.
- Standardize the data to have unit variance over the domain $t$.
Slice the response variable $Y$:
- Divide the range of $Y$ into $H$ non-overlapping slices, typically based on quantiles.
- Assign each observation to a slice based on the value of $Y$.
Compute mean functions for each slice:
- For each slice $h$, calculate the mean function $m_h(t)$ of the functional predictors $X(t)$ belonging to that slice.
Estimate the covariance operator:
- Calculate the covariance operator of the mean functions $m_h(t)$ across slices. This captures the variation in $X(t)$ that corresponds to variations in $Y$.
Perform eigen-decomposition:
- Perform eigen-decomposition on the covariance operator to extract its leading eigenfunctions.
- These eigenfunctions represent the functional EDR directions.
Project the functional data:
- Project each functional predictor $X(t)$ onto the leading eigenfunctions to obtain a finite-dimensional representation.
Model the reduced representation:
- Use the reduced representation to model the relationship between $Y$ and $X(t)$ using appropriate regression or classification techniques.

Key Assumptions

Linearity condition: The marginal distribution of $X(t)$ is such that any linear combination of $X(t)$ is approximately normally distributed. This ensures that slicing captures the relevant information about $Y$.
Sufficient dimension reduction (SDR): The relationship between $Y$ and $X(t)$ is fully captured by a finite-dimensional projection of $X(t)$. This assumption parallels that for Euclidean data but must account for the infinite-dimensional nature of functional predictors.

Applications

FSIR is widely applied in fields such as: - Neuroscience: To analyze brain activity curves and relate them to cognitive outcomes. - Environmental science: To model relationships between environmental variables (e.g., temperature curves) and ecological responses. - Finance: To study the impact of functional predictors like stock price trajectories on financial outcomes.

Example

Consider a dataset with functional predictors $X(t)$ representing temperature curves over a day and a continuous response $Y$ indicating crop yield. Using FSIR, we may find that only a few functional EDR directions explain the majority of the relationship between $Y$ and $X(t)$. This enables us to reduce the problem from an infinite-dimensional regression to a finite-dimensional one, simplifying analysis and interpretation.

Limitations

Dependence on slicing: The choice of slicing method and the number of slices can influence the results.
Linearity assumption: Violations of the linearity condition can lead to inaccurate estimates of functional EDR directions.
Complex interactions: FSIR may struggle with capturing complex nonlinear interactions among functional predictors.