Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Feature request: Python tools for obs sequences #742

Open
hkershaw-brown opened this issue Sep 26, 2024 · 0 comments
Open

Feature request: Python tools for obs sequences #742

hkershaw-brown opened this issue Sep 26, 2024 · 0 comments

Comments

@hkershaw-brown
Copy link
Member

Use case

For CROCODILE, Python based tools for observation space diagnostics. Might be useful more generally for DART, so adding this issue to track.

Is your feature request related to a problem?

Originally for CROCODILE obs space diaginostic plotting in the python ecosystem, but the ability to examine obs squences in a dataframe in a Jupyter notebook (or Python tool of your choice) is quite helpful, e.g. finding duplicates in obs sequences, looking at output from obs converters, subsetting observations (in space, time, or by X), splitting and joining obs sequences.
No need to run obs_diag to bin observations, you can read the obs_sequence into a dataframe directly.

Example finding duplicates:
Screenshot 2024-09-26 at 3 37 33 PM

Describe your preferred solution

https://github.com/NCAR/pyDARTdiags. See issues for various notes and docs for documentation.
https://pypi.org/project/pydartdiags/ (but recommend you do a local editable pip install if you are developing this or playing with it)
BUYER BEWARE, this is bleeding edge.

Describe any alternatives you have considered

Currently using pandas, which seems ok (tried naively loading 20GB obs sequences one after the other, actually worked on my mac). Probably need to think about big-big data tools for going larger (and maybe faster).
Also keeping notes on other observation tools (NCAR/pyDARTdiags#4).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant