Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Implement missing JSON export for OpenSearch version #871

Merged
merged 3 commits into from
Feb 28, 2025

Conversation

lonvia
Copy link
Collaborator

@lonvia lonvia commented Feb 28, 2025

That adds the as-of-yet unimplemented JSON export to OpenSearch.

There is a new option to hand in '-' as the filename to dump the data to standard out. This is needed to pack these files on the fly. To make that correctly work in turn, log information is now directed to standard error.

Note that these export dump the JSON that is used to import documents in the ES database. That means that the dumps are not compatible when done from different versions of Photon. To account for that, I have now added a "header" document which contains a version field and may get other global information in later versions.

The dumped documents now also include the document ID. This should make it possible to recreate a database exactly from the dump. A bzipped dump of the planet has only 11GB. So this gives us an interesting opening for importing custom filtered databases or databases with extra indexes for structured search, for example. Time to dust off PR #438.

@lonvia lonvia merged commit 43ecb5f into komoot:master Feb 28, 2025
4 checks passed
@lonvia lonvia deleted the json-export-opensearch branch February 28, 2025 10:14
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant