Preserve pre-existing index templates (#1900). #1912

fressi-elastic · 2025-02-05T10:59:15Z

Preserve pre-existing index templates (#1900).

When opening an EsMetricsStore, it creates the index template if any of following:

the index template doesn't exist
reporting/overwrite_existing_templates option is true

It will preserve existing template on all the other cases.

gareth-ellis · 2025-02-06T08:37:27Z

Could we get an update to the docs done too, to explain this feature? Obviously we want to have it tested too before we merge

fressi-elastic · 2025-02-06T10:34:22Z

Could we get an update to the docs done too, to explain this feature? Obviously we want to have it tested too before we merge

I did tested and it works as expected in all cases except one: one the overwrite_existing_templates is false it looks like it act as it is true. I guess the code is parsing it as string instead of a boolean. I am investigating on it.

tests/config_test.py

gbanasiak · 2025-02-19T14:09:37Z

In addition to documentation I would also try adding new modules in here to enable full scope of mypy checks:

rally/pyproject.toml

Lines 200 to 206 in 2ef33eb

    
           [[tool.mypy.overrides]] 
        
           module = [ 
        
               "esrally.mechanic.team", 
        
               "esrally.utils.modules", 
        
               "esrally.utils.io", 
        
               "esrally.utils.process", 
        
           ]

Context: Until #1798, so recently on Rally timescale, we had no annotations. They were introduced very surgically to reduce the scope. Then in #1859 we introduced mypy overrides that restore full set of checks for selected modules - either new ones, or the ones we are revisiting. As we're planning to broaden annotations it would be best if all new modules were added there. Ultimately the overrides should be removed.

esrally/metrics.py

gbanasiak · 2025-02-21T13:09:44Z

CI failure will be addressed by elastic/rally-tracks#748.

fressi-elastic · 2025-02-23T05:33:21Z

I enabled some more typing check in more modules (including config.py) and improved unit tests. While doing that I made some cleanup in the Config class to simplify it. Its unit test is still a mess on my opinion but I would avoid refactoring it in this change because it is out of scope and this is getting too big. I am considering splitting this PR in more parts to speed up the review process. Do you think it would help?

.pre-commit-config.yaml

gbanasiak · 2025-02-24T11:57:25Z

I am considering splitting this PR in more parts to speed up the review process. Do you think it would help?

I think config.py simplification and its merge with types.py should go into a separate PR with a more representative title.

gbanasiak · 2025-02-24T12:24:46Z

For example, it seems we're dropping some useful tests from types_test.py? I think making sure the list of literals is sorted, or making sure that every literal is actually present somewhere in the code are useful. But this discussion should not block index template change I think.

fressi-elastic · 2025-02-27T14:39:40Z

For example, it seems we're dropping some useful tests from types_test.py? I think making sure the list of literals is sorted, or making sure that every literal is actually present somewhere in the code are useful. But this discussion should not block index template change I think.

@gbanasiak @gareth-ellis @favilo I think using Literals for defining sections and keys in the config file has been kind of naive and incomplete solution. A better one could be adopted (like for example using dataclasses and ad hoc section importers from a generic ini file parser).

We could have for example a loader for every section implemented near the code that is going to use it (for preserving code modularity). The ini file is first converted to a dict of dict and then passed to a dataclass before being used in the code. In this way we do have regular linters protecting us from refractory errors. There are no strong reasons why not to have independent conversion (dict to object) functions for every individual section, given a common dict of dict made from the original file.

For the purpose of scopes it is enough to made up the final objects from multiple set of dictionaries (one for every scope) and in the right order to keep scope priority.

I tried to reduce the complexity of the code, but this is still a naive solution, this testing I removed is very weak and it doesn't allow easy further refractory of the code.

Could we please forget about this test module and plan a true refactor of this part of the project after this change?

gbanasiak

This works great, thank you. I left some comments inline.

I think the same logic should apply to remaining templates - rally-results, rally-races, and rally-annotations, for consistency. WDYT?

Regarding config refactor my personal preference would be to simply avoid it in this PR due to its size and the fact the PR title does not match the content. I would simply use convert.to_bool() like in all other places (datastore.overwrite_existing_templates is not the first true/false setting in the INI file) and work on the refactor in another PR. Note we are note quite there yet, as now we should convert all the boolean config reads to not use convert.to_bool() which will extend the scope further.

I don't have strong feelings about types_test.py. I think its form is due to the context in which it came to be. There were no annotations at all and they were inserted surgically. Some of these tests were just to confirm that code edits were complete.

esrally/config.py

.pre-commit-config.yaml

gbanasiak · 2025-02-28T11:55:13Z

esrally/config.py

+        if v is None or v.scope.value <= scope.value:
+            self._opts[section][key] = _V(value, scope)


That's a nice and important simplification. We lose configuration values with broder scope (lower numerical value), but we don't seem to need them? That's the type of thing I would prefer to see in a PR with a different title. Do we need scopes at all?

I realized the lower scope value where stored but never read. Therefore I preferred to filter out lower scope value when putting new values instead when retrieving them. This should open to many more simplifications later on. For example at this point because we are using a dictionary of dictionaries we could probably use 3rd party code for the inner implementation as it is the way typically ini file are being loaded (I.E https://docs.python.org/3/library/configparser.html).

I can see how lower scope could be useful if we say iterated through multiple benchmarks with slightly different settings, and wanted to override settings per-benchmark, then remove this override, then work with another override. But we're not doing this today. I think I'm OK with it unless @favilo has different opinion? We can return to this discussion in the expected new incoming PR.

tests/config_test.py

tests/utils/pretty_test.py

esrally/metrics.py

docs/configuration.rst

docs/migrate.rst

fressi-elastic · 2025-03-01T09:17:16Z

I am still processing pending comments. I am going to split this PR.

fressi-elastic · 2025-03-03T14:06:05Z

I took out parts of this PR to some new smaller ones:

When opening an `EsMetricsStore`, it creates the index template if any of following: - the index template doesn't exist - `reporting/datastore.overwrite_existing_templates` option is `true` It will preserve existing template on all the other cases. It adds a new method for getting boolean configuration options. It logs a warning when an existing index template is being replaced. It highlights index template differences between the existing one (if any) and the configured one (according to rally.ini).

fressi-elastic force-pushed the issue/1900 branch from 63a06d5 to 5a38d2e Compare February 5, 2025 13:49

gareth-ellis requested a review from a team February 5, 2025 14:02

fressi-elastic changed the title ~~Issue/1900~~ Preserve pre-existing index templates (#1900). Feb 5, 2025

favilo approved these changes Feb 5, 2025

View reviewed changes

gareth-ellis added enhancement Improves the status quo highlight A substantial improvement that is worth mentioning separately in release notes labels Feb 6, 2025

fressi-elastic force-pushed the issue/1900 branch 5 times, most recently from 6e1cffa to 2ed976a Compare February 7, 2025 15:32

favilo reviewed Feb 7, 2025

View reviewed changes

tests/config_test.py Outdated Show resolved Hide resolved

fressi-elastic force-pushed the issue/1900 branch 4 times, most recently from 9ed48b4 to c0c373a Compare February 13, 2025 09:29

gareth-ellis requested a review from a team February 13, 2025 09:47

fressi-elastic requested review from favilo and gareth-ellis February 13, 2025 10:28

fressi-elastic force-pushed the issue/1900 branch 2 times, most recently from 1fa76d8 to d1e7004 Compare February 13, 2025 14:47

gbanasiak reviewed Feb 20, 2025

View reviewed changes

esrally/metrics.py Outdated Show resolved Hide resolved

fressi-elastic force-pushed the issue/1900 branch 4 times, most recently from 95de3dd to fef436a Compare February 23, 2025 05:27

fressi-elastic commented Feb 23, 2025

View reviewed changes

.pre-commit-config.yaml Outdated Show resolved Hide resolved

fressi-elastic force-pushed the issue/1900 branch 2 times, most recently from 3ef0c5d to e07cf14 Compare February 23, 2025 05:58

fressi-elastic force-pushed the issue/1900 branch from e07cf14 to d904b79 Compare February 27, 2025 04:49

fressi-elastic force-pushed the issue/1900 branch 3 times, most recently from dba9e78 to 13a3fa4 Compare February 28, 2025 12:26

gbanasiak reviewed Feb 28, 2025

View reviewed changes

fressi-elastic force-pushed the issue/1900 branch from 13a3fa4 to b720f0a Compare March 1, 2025 06:36

fressi-elastic force-pushed the issue/1900 branch 8 times, most recently from 5ea8073 to 7eb55d2 Compare March 3, 2025 13:50

fressi-elastic force-pushed the issue/1900 branch 5 times, most recently from 8e8fbb2 to 79b3907 Compare March 4, 2025 14:03

fressi-elastic force-pushed the issue/1900 branch from 79b3907 to 92b190d Compare March 4, 2025 14:04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Preserve pre-existing index templates (#1900). #1912

Preserve pre-existing index templates (#1900). #1912

fressi-elastic commented Feb 5, 2025

gareth-ellis commented Feb 6, 2025

fressi-elastic commented Feb 6, 2025

gbanasiak commented Feb 19, 2025

gbanasiak commented Feb 21, 2025

fressi-elastic commented Feb 23, 2025

gbanasiak commented Feb 24, 2025

gbanasiak commented Feb 24, 2025

fressi-elastic commented Feb 27, 2025 •

edited

Loading

gbanasiak left a comment

gbanasiak Feb 28, 2025

fressi-elastic Mar 1, 2025

gbanasiak Mar 3, 2025

fressi-elastic commented Mar 1, 2025 •

edited

Loading

fressi-elastic commented Mar 3, 2025 •

edited by gbanasiak

Loading

		if v is None or v.scope.value <= scope.value:
		self._opts[section][key] = _V(value, scope)

Preserve pre-existing index templates (#1900). #1912

Are you sure you want to change the base?

Preserve pre-existing index templates (#1900). #1912

Conversation

fressi-elastic commented Feb 5, 2025

gareth-ellis commented Feb 6, 2025

fressi-elastic commented Feb 6, 2025

gbanasiak commented Feb 19, 2025

gbanasiak commented Feb 21, 2025

fressi-elastic commented Feb 23, 2025

gbanasiak commented Feb 24, 2025

gbanasiak commented Feb 24, 2025

fressi-elastic commented Feb 27, 2025 • edited Loading

gbanasiak left a comment

Choose a reason for hiding this comment

gbanasiak Feb 28, 2025

Choose a reason for hiding this comment

fressi-elastic Mar 1, 2025

Choose a reason for hiding this comment

gbanasiak Mar 3, 2025

Choose a reason for hiding this comment

fressi-elastic commented Mar 1, 2025 • edited Loading

fressi-elastic commented Mar 3, 2025 • edited by gbanasiak Loading

fressi-elastic commented Feb 27, 2025 •

edited

Loading

fressi-elastic commented Mar 1, 2025 •

edited

Loading

fressi-elastic commented Mar 3, 2025 •

edited by gbanasiak

Loading