How to skip NaN values in cross correlation between 2 time series datasets.
Show older comments
Hi. I need to conduct cross correlation analysis between 2 time series datasets. Both datasets contain NaN values. In matlab, xcorr wouldn't skip wherever NaN values occurred in the time series. Please help.
~Thanks
My data look like this:
Soil moisture (SM):
SM = [ay1-1, ay1-2, ay1-3, ay1-4, NaN, NaN, NaN, NaN, ay2-1, ay2-2, ay2-3, NaN, NaN, NaN,....]
Plant growth (PG):
PG = [by1-1, by1-2, by1-3, by1-4, NaN, NaN, NaN, NaN, by2-1, by2-2, by2-3, NaN, NaN, NaN....]
NaN values in both vectors only occurred outside growing season; and that wherever there is NaN in SM, there is NaN in PG.
I want to see if there is any delayed responses in PG to changes in SM within growing season (lag analysis).
6 Comments
Guillaume
on 1 Aug 2019
What do you mean by skipping? Remove the NaN altogether thus shortening the time series? If so, that will completely alter the whole correlation.
Chalita
on 1 Aug 2019
Guillaume
on 1 Aug 2019
I'm not sure what you think a cross correlation analysis is. For discrete vectors, a cross correlation is just a multiplication element by element of the two vectors with a different shift followed by a sum. So, when there is a NaN what should the result of the multiplication be?
Chalita
on 1 Aug 2019
Depends by what you mean by not affecting the correlation. It would mean that the matching non-nan element in the other vector wouldn't be taken into account for that particular shift, since the result of the multiplication would be 0. That may indeed be useful to do that as long as there aren't too many NaNs, but it will definitively affect the correlation values.
Replacing the NaNs by interpolated values may be more useful though. It's probably what Joe suggests doing with fillmissing (if he meant to use the 'linear' method).
Chalita
on 1 Aug 2019
Answers (1)
Joe Vinciguerra
on 1 Aug 2019
Edited: Joe Vinciguerra
on 1 Aug 2019
0 votes
I encountered this same issue a few weeks ago. My solution was to use fillmissing() to patch up the NaN's.
I also used resample() to make sure my time vector was evenly distributed, but I suppose you could delete your NaN's all together, then resample.
Categories
Find more on Correlation and Convolution in Help Center and File Exchange
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!