Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

DO NOT MERGE: Address qc-duplicate-exact-synonym-no-abbrev failures in mondo - build #755

Closed

Conversation

joeflack4
Copy link
Contributor

- Filters cases where there are multiple Mondo terms with the same exactSynonym, and instead puts them into a special curation TSV.
- Add: reports/review-qc-duplicate-exact-synonym-no-abbrev.tsv
- Add/Update: Make goals & Python scripting to filter files & create that TSV.
- Update: Handle Mondo -unconfirmed cases too: Cases where the synonym sync finds no trace of the synonym in the source, but since we have not yet
- Update: Removed -confirmed deduping. This wasn't the correct way to handle checking against synonyms that already exist in Mondo, and -confirmed does not introduce duplication anyway. It doesn't add synonyms; only adds evidence for existing ones.
- Update: Handling cases now where a new exactSynonym coming in through the sync is equivalent to a label that exists on another Mondo term.
@joeflack4 joeflack4 marked this pull request as draft January 16, 2025 20:30
@joeflack4 joeflack4 self-assigned this Jan 16, 2025
@joeflack4 joeflack4 added the build Mostly for build PRs: when changes only to data files post `build-mondo-ingest`; no code changes label Jan 16, 2025
@joeflack4 joeflack4 force-pushed the qc-duplicate-exact-synonym-no-abbrev--build branch from 145c019 to 7f159e3 Compare January 17, 2025 03:06
@joeflack4 joeflack4 requested a review from twhetzel January 17, 2025 03:53
@joeflack4 joeflack4 force-pushed the qc-duplicate-exact-synonym-no-abbrev--build branch from 7f159e3 to 016ff3e Compare January 17, 2025 03:53
Copy link
Contributor Author

@joeflack4 joeflack4 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Build looks good, and these results were used to get QC to pass in:

Copy link
Contributor Author

@joeflack4 joeflack4 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM! Also what I wrote in the mini-build, confirmed working as attended via QC check in mondo:

Base automatically changed from qc-duplicate-exact-synonym-no-abbrev to develop February 5, 2025 02:13
@joeflack4 joeflack4 closed this Feb 6, 2025
@joeflack4 joeflack4 deleted the qc-duplicate-exact-synonym-no-abbrev--build branch February 6, 2025 21:11
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
build Mostly for build PRs: when changes only to data files post `build-mondo-ingest`; no code changes
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant