rolling-computation – Make Me Engineer

Rolling OLS Regressions and Predictions by Group

June 5, 2023 by Tarik

You should be able to achieve what you want using the groupby / apply pattern. The below code should be helpful. Create example data: from statsmodels.regression.rolling import RollingOLS from statsmodels.tools.tools import add_constant import pandas as pd import numpy as np # make some toy data race_dates = pd.to_datetime([‘2020-06-09’]*3 + [‘2020-12-01’]*4 + [‘2021-01-21’]*4 + [‘2021-05-04’]*5) distance … Read more

Pandas rolling apply using multiple columns

November 20, 2022 by Tarik

How about this: def masscenter(ser): print(df.loc[ser.index]) return 0 rol = df.price.rolling(window=2) rol.apply(masscenter, raw=False) It uses the rolling logic to get subsets from an arbitrary column. The raw=False option provides you with index values for those subsets (which are given to you as Series), then you use those index values to get multi-column slices from your … Read more

Sum values in a rolling/sliding window

August 1, 2022 by Tarik

What you have is a vector, not an array. You can use rollapply function from zoo package to get what you need. > x <- c(1, 2, 3, 10, 20, 30) > #library(zoo) > rollapply(x, 3, sum) [1] 6 15 33 60 Take a look at ?rollapply for further details on what rollapply does and … Read more

Python – rolling functions for GroupBy object

July 6, 2022 by Tarik

For the Googlers who come upon this old question: Regarding @kekert’s comment on @Garrett’s answer to use the new df.groupby(‘id’)[‘x’].rolling(2).mean() rather than the now-deprecated df.groupby(‘id’)[‘x’].apply(pd.rolling_mean, 2, min_periods=1) curiously, it seems that the new .rolling().mean() approach returns a multi-indexed series, indexed by the group_by column first and then the index. Whereas, the old approach would simply … Read more

Consecutive/Rolling sums in a vector in R

June 19, 2022 by Tarik

Pandas: rolling mean by time interval

May 17, 2022 by Tarik

In the meantime, a time-window capability was added. See this link. In [1]: df = DataFrame({‘B’: range(5)}) In [2]: df.index = [Timestamp(‘20130101 09:00:00’), …: Timestamp(‘20130101 09:00:02’), …: Timestamp(‘20130101 09:00:03’), …: Timestamp(‘20130101 09:00:05’), …: Timestamp(‘20130101 09:00:06’)] In [3]: df Out[3]: B 2013-01-01 09:00:00 0 2013-01-01 09:00:02 1 2013-01-01 09:00:03 2 2013-01-01 09:00:05 3 2013-01-01 09:00:06 4 … Read more

How to calculate rolling / moving average using python + NumPy / SciPy?

May 9, 2022 by Tarik

If you just want a straightforward non-weighted moving average, you can easily implement it with np.cumsum, which may be is faster than FFT based methods: EDIT Corrected an off-by-one wrong indexing spotted by Bean in the code. EDIT def moving_average(a, n=3) : ret = np.cumsum(a, dtype=float) ret[n:] = ret[n:] – ret[:-n] return ret[n – 1:] … Read more