Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Build DAG outline #37

Merged
merged 1 commit into from
Jun 20, 2024
Merged

Build DAG outline #37

merged 1 commit into from
Jun 20, 2024

Conversation

jacobthill
Copy link
Contributor

No description provided.

Copy link
Contributor

@edsu edsu left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I left some naming suggestions, but approved it, in case you decide you like it better the way it is.

"""
snapshot_dir = create_snapshot_dir(data_dir)
return snapshot_dir

@task()
def fetch_sul_pub(snapshot_dir):
def dimensions_harvest_orcid(orcids):
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I wonder if a name like dimensions_harvest_dois might better describe what is being harvested?

return True

@task()
def openalex_harvest_orcid(orcids):
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Same question here: does openalex_harvest_dois() describe what is being harvested?

"""
return True

@task()
def fetch_openalex(dois):
def dimensions_harvest_doi(dois):
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Here we are harvesting publications?

"""
return True

@task()
def fetch_dimensions(dois):
def openalex_harvest_doi(dois):
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We are harvesting publications instead of dois.

@@ -70,16 +85,16 @@ def merge_publications(sul_pub, openalex, dimensions):
return True

@task()
def merge_contributors(pubs):
def join_org_data(pubs, org_data):
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I wonder if it might be better to refer to them as authors instead of org_data?

join_authors() ?

@jacobthill
Copy link
Contributor Author

I'm merging this and will clean up the names in my next pr

@jacobthill jacobthill merged commit e614313 into main Jun 20, 2024
1 check failed
@jacobthill jacobthill deleted the dag-outline branch June 20, 2024 15:00
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants