Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Unzip Transformer #341

Open
BenGalewsky opened this issue Sep 1, 2021 · 0 comments · May be fixed by #574
Open

Unzip Transformer #341

BenGalewsky opened this issue Sep 1, 2021 · 0 comments · May be fixed by #574
Labels
PONDD PONDD Grand Challenge

Comments

@BenGalewsky
Copy link
Contributor

BenGalewsky commented Sep 1, 2021

Story

As an Astrophysicist, I want to extract compressed data from the Spitzer Photometry and Accurate Rotation Curves website so I can get accepted rotation curves for my analysis

Description

There are zip files available to download via http. We would like to perform analysis on the data contained within them so we need a way to inject them into the ServiceX object store.

This is the first case of a transformer that doesn't require a code-gen. We will use this opportunity to make the code-gen property of the transform request optional. If the code-gen property is not included, the transform workflow will skip the code generation step. The transformer manager will handle the case that there is no generated code configmap to mount in the transformers.

The user will provide the transformer "science container" in the transform request. It will be sslhep/unzip-transformer:latest

The transformer will download the zip file via HTTP and unzip the results into the usual output minio bucket.

See if we can use a streaming unzip to make this scalable by avoiding keeping the entire zip file and unzipped contents local to the transformer.

Test Case

We specifically want to see how this works with http://astroweb.cwru.edu/SPARC/Rotmod_LTG.zip

@BenGalewsky BenGalewsky moved this to Backlog in ServiceX Jun 2, 2022
@BenGalewsky BenGalewsky moved this to Backlog in ServiceX Sep 16, 2022
@BenGalewsky BenGalewsky added the PONDD PONDD Grand Challenge label Mar 23, 2023
@BenGalewsky BenGalewsky moved this from Backlog to Ready in ServiceX Mar 27, 2023
@shriram192 shriram192 moved this from Ready to In Progress in ServiceX Apr 3, 2023
@shriram192 shriram192 moved this from In Progress to Ready for Review in ServiceX Apr 19, 2023
@BenGalewsky BenGalewsky removed this from ServiceX Sep 26, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
PONDD PONDD Grand Challenge
Projects
None yet
Development

Successfully merging a pull request may close this issue.

1 participant