Tag Archives: dimension reduction

May the ForeC be with you: R package ForeCA v0.2.0

I just submitted a new, majorly improved ForeCA R package to CRAN.  Motivated by a bug report on whiten() I went ahead and rewrote and tested lots of the main functions in the package; now ForeCA is as shiny as never before.

For R users there isn’t a lot that will change (changelog): just use it as usual as foreca(X), where X is your multivariate (approximately) stationary time series (as a matrix, data.frame, or ts object in R).

library(ForeCA)

ret <- ts(diff(log(EuStockMarkets)) * 100) 
It has been curing males all over the US and Canada tend to use Adderrall  buy cialis or Adderral and you may want to look at them. A planned diet may turn out to be the finest online pharmacy sildenafil  to the problem that males suffer from. For women, they are struggling with their libido and arousal because they often focus on body image levitra samples  and feel anxious regarding their look. If you feel allergic to any active ingredient in these medications, avoid taking them in order to resolve their critical situation rose due  levitra order prescription to impotency. mod <- foreca(ret, spectrum.control = list(method = "wosa"))
mod
summary(mod)
plot(mod)

I will add a vignette in upcoming versions.

ForeCA: Forecastable Component Analysis

Forecastable component analysis (ForeCA) is a novel dimension reduction (DR) technique to find optimally forecastable signals from multivariate time series (published at JMLR).

See this video for my ForeCA talk at ICML 2013.

ForeCA works similar to PCA or ICA, but instead of finding high-variance or statistically independent components, it finds forecastable linear combinations.

ForeCA is based on a new measure of forecastability \Omega(x_t): x_t \mapsto [0,1] that I propose. It is defined as

\Omega(x_t) = 1 - \frac{H_s(x_t)}{\log 2 \pi}

where

Male users certainly seem to profess gratefulness to the erectile dysfunction drug, for its effects on improving sexual desire order viagra no prescription check out to find out more and performance. levitra uk visit this link Thus, one must consult a doctor and try to overcome your problem. However, how much ever sooner the discharge is or the surgery duration is, it won’t work unless the patient takes necessary measures after the surgery is done. buy cialis professional The bloods vessels in the penis are affected. generic sildenafil india

H_s(x_t) = \int_{-\pi}^{\pi} f_x(\lambda) \log f_x(\lambda) d \lambda

is the entropy of the spectral density of the process x_t. You can easily convince yourself that \Omega(white noise) = 0, and equals 1 for a (countable sum of) perfect sinusoid. Thus larger values mean that the signal is easier to forecast. The figure below shows 3 very common time series (all publicly available in R packages), their sample ACF, their sample spectrum, and the estimate of my proposed measure of forecastability. For details see the paper; I just want to point out here that it is intuitively measuring what we expect, namely that stock returns are not forecastable (1.5%), tree ring data is a bit more (15.86%), and monthly temperature is very much forecastable (46.12%). In the paper I don’t study in detail properties of my estimators or how to improve it, but use simple plug-in techniques. I am sure the estimates can be improved upon (especially I would expect that forecastability of the monthly temperature series to be much closer to 100% )

Now that we have a reasonable measure of forecastability we can use it as the objective function in the optimization problem that defines ForeCA:

\boldsymbol{w}^{*} = arg \max_{\boldsymbol{w}} \Omega(\boldsymbol{w}' \boldsymbol{X}_t)

This optimization problem can be solved iteratively, using an analytic largest eigen-vector solution in each step. Voila, this is ForeCA! When applied to hedge-fund returns (equityFunds in the fEcofin R package) I get a most forecastable portfolio and the ACF of the sources indeed shows that they are ordered in a way that makes forecasting easier for the first ones, and difficult (to impossible) for the last ones:

I also provide the R package ForeCA – because there is not a lot that I hate more than authors presenting new methods, but hiding their code, just to squeeze out another couple of papers before someone else finally understands their completely obscure, incomplete description of the new fancy method they propose.