[WIP] draft subsampling bootstrap for mcse #1974

OriolAbril · 2022-02-03T13:17:04Z

Description

This adds a draft version of the subsampling bootstrap method (as defined in https://doi.org/10.1214/14-EJS957) but allowing for it to be used on any arbitrary function.

Checklist

Follows official PR format
Includes a sample plot to visually illustrate the changes (only for plot-related functions)
New features are properly documented (with an example if appropriate)?
Includes new or updated tests to cover the new feature
Code style correct (follows pylint and black guidelines)
Changes are listed in changelog

ahartikainen · 2022-02-08T10:10:00Z

arviz/stats/diagnostics.py

+       https://doi.org/10.1214/14-EJS957
+
+    """
+    flat_ary = np.ravel(ary)


Is this sensitive for which order the ravel is done?

I think it technically is, but there should be no difference (hopefully) if the model has converged. It is also not clear to me how should multiple chains be handled when implementing this algorithm, I started with this flatten approach but I can test a couple options.

@OriolAbril have you tested alternative approaches for handling multiple chains yet?

I benchmarked 3 different approaches for estimating the mcse of the mean

ess: using ess to estimate mcse, for reference

sbm_stack_chains: estimating mcse with SBM by concatenating the chains (as done here)

sbm_stack_draws: estimating mcse with SBM by interleaving chains (i.e. flattening on the other dimension)

sbm_separate_chains: estimate the mcse of each chain with SBM and then sum the variances and divide by nchains^2.

sbm_shuffle: flatten the chains and shuffle the draws before running SBM
I performed the same benchmark as in https://avehtari.github.io/rhat_ess/ess_comparison.html, and transforming the chains to target different stationary distributions. Here's the result:

In all cases SBM underestimates the MCSE; this is particularly severe when autocorrelation is high and sample sizes are low. sbm_stack_chains is consistently better than the alternatives though. I didn't even bother plotting sbm_shuffle, since it was apparent pretty quickly that it was far worse than the others.

arviz/stats/diagnostics.py

OriolAbril · 2022-03-13T19:52:20Z

This func method for mcse should also replace mc_error

ahartikainen · 2022-03-14T09:37:11Z

arviz/stats/diagnostics.py

+    if prob is not None:
+        func_kwargs["prob"] = prob
+    elif func is not None:
+        func_kwargs["func"] = func
+


~~setdefault ?~~

nevermind, bad suggestion

OriolAbril · 2022-08-17T23:01:36Z

arviz/stats/diagnostics.py

+    for i in range(n - b):
+        sub_ary = flat_ary[i : i + b]
+        func_estimates[i] = func(sub_ary, **func_kwargs)
+    func_estimate_sd = np.sqrt(b * var_func(func_estimates))


we should probably decide API-wise if we want to keep this or instead move to std_func and multiply that by the square root of b.

OriolAbril · 2022-08-17T23:07:09Z

arviz/stats/diagnostics.py

        Quantile information.
+    func : callable, optional


we could also consider allowing some strings here. e.g. using "circmean" expands to stats.circmean as func here and also fills the mcse_kwargs with {"var_func": stats.circvar}

codecov · 2022-08-17T23:27:01Z

Codecov Report

Merging #1974 (13dd989) into main (8a2bc39) will decrease coverage by 0.16%.
The diff coverage is 36.84%.

@@            Coverage Diff             @@
##             main    #1974      +/-   ##
==========================================
- Coverage   90.78%   90.62%   -0.17%     
==========================================
  Files         117      117              
  Lines       12484    12518      +34     
==========================================
+ Hits        11334    11344      +10     
- Misses       1150     1174      +24

Impacted Files	Coverage Δ
arviz/stats/diagnostics.py	`93.28% <36.84%> (-5.69%)`	⬇️
arviz/data/datasets.py	`98.48% <0.00%> (+0.09%)`	⬆️

Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here.

OriolAbril marked this pull request as draft February 3, 2022 13:32

ahartikainen reviewed Feb 8, 2022

View reviewed changes

OriolAbril commented Feb 8, 2022

View reviewed changes

arviz/stats/diagnostics.py Outdated Show resolved Hide resolved

OriolAbril force-pushed the mcse_sbm branch from bcb03c4 to 2a0aabb Compare March 11, 2022 15:15

OriolAbril mentioned this pull request Mar 13, 2022

add mad wrapper arviz-devs/xarray-einstats#4

Merged

ahartikainen reviewed Mar 14, 2022

View reviewed changes

OriolAbril added 4 commits August 12, 2022 04:38

draft subsampling bootstrap for mcse

f64d942

fix n term in sbm mcse method

7373179

fix behaviour on arrays

f123d25

fix issue with numba version

697d9a5

OriolAbril force-pushed the mcse_sbm branch from ba2809f to 697d9a5 Compare August 15, 2022 09:18

OriolAbril added 2 commits August 15, 2022 12:01

updates to api

00bdf7d

add var_func argument

13dd989

OriolAbril commented Aug 17, 2022

View reviewed changes

This was referenced Jan 12, 2023

Improved MCSE TuringLang/MCMCDiagnosticTools.jl#39

Closed

Redesign of MCSE TuringLang/MCMCDiagnosticTools.jl#63

Merged

OriolAbril mentioned this pull request Apr 4, 2023

Consider interpolation HDI calculations #2168

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] draft subsampling bootstrap for mcse #1974

[WIP] draft subsampling bootstrap for mcse #1974

OriolAbril commented Feb 3, 2022

ahartikainen Feb 8, 2022

OriolAbril Feb 8, 2022

sethaxen Jan 16, 2023

sethaxen Jan 17, 2023

OriolAbril commented Mar 13, 2022

ahartikainen Mar 14, 2022 •

edited

Loading

ahartikainen Mar 14, 2022

OriolAbril Aug 17, 2022

OriolAbril Aug 17, 2022

codecov bot commented Aug 17, 2022

[WIP] draft subsampling bootstrap for mcse #1974

Are you sure you want to change the base?

[WIP] draft subsampling bootstrap for mcse #1974

Conversation

OriolAbril commented Feb 3, 2022

Description

Checklist

ahartikainen Feb 8, 2022

Choose a reason for hiding this comment

OriolAbril Feb 8, 2022

Choose a reason for hiding this comment

sethaxen Jan 16, 2023

Choose a reason for hiding this comment

sethaxen Jan 17, 2023

Choose a reason for hiding this comment

OriolAbril commented Mar 13, 2022

ahartikainen Mar 14, 2022 • edited Loading

Choose a reason for hiding this comment

ahartikainen Mar 14, 2022

Choose a reason for hiding this comment

OriolAbril Aug 17, 2022

Choose a reason for hiding this comment

OriolAbril Aug 17, 2022

Choose a reason for hiding this comment

codecov bot commented Aug 17, 2022

Codecov Report

ahartikainen Mar 14, 2022 •

edited

Loading