As per title. Our model is a time series of rainfall based on 3 environemntal factors (temperature, wind speed, humidity) within an island.
We have data from a period of 6.5 years.
Our data is in time intervals of every 5 minutes.
However over this period, not all stations are always active (there are about 18-23 stations active at any one datapoint and sometimes this number drops to 8 or even 1). Sometimes no stations are available for data over a period of 15 minutes.
our goal in plotting the time series is to train three models (the initial time series, its first derivative to assess data drive, and then a third model to analyze concept drift)
What can we do to deal with these discrepancies?