Should each algorithm have its own configuration #367

jdeschamps · 2025-01-22T14:14:22Z

Description

We originally split the configuration in algorithm-specific configurations (child classes) to be able to perform validation across sub-configurations:

from pydantic import BaseModel

class Configuration(BaseMode):
    algorithm_config: UNetBasedAlgorithm | LVAEBasedAlgorithm
    data_config: GeneralDataConfig

# N2V
class N2VAlgorithm(UNetBasedAlgorithm):
    ...

class N2VDataConfig(GeneralDataConfig):
    ...

class N2VConfiguration(Configuration):
    algorithm_config: N2VAlgorithm
    data_config: N2VDataConfig
    # validate parameters from N2VAlgorithm and N2VDataConfig against each other


# CARE
class CAREAlgorithm(UNetBasedAlgorithm):
    ...

class CAREConfiguration(Configuration):
    algorithm_config: CAREAlgorithm

@melisande-c correctly pointed out that in #365 we are now removing the need for N2VDataConfig. The only difference between N2VConfiguration and CAREConfiguration are now:

Algorithm sub-configuration parameter
Methods to generate citations, references, summary and friendly name for BMZ export (e.g. here)

There will be no more validation across algorithm and data, at least for now.

Which part of the code?

Algorithm configuration declared in algorithm-specific configuration:

careamics/src/careamics/config/n2v_configuration.py

Line 86 in 9a027a5

algorithm_config: N2VAlgorithm
Methods to generate metadata for BMZ export:

careamics/src/careamics/config/n2v_configuration.py

Line 189 in 9a027a5

def get_algorithm_references(self) -> str:

Potential solutions

Therefore, a few aspects to discuss:

Is having a clean class per algorithm to generate the metadata (references, citations, description) for the BMZ enough to justify the existence of N2VAlgorithm, CAREAlgorithm etc. ?
Where will the noise model end up in the PN2V configuration? Probably in algorithm_config as well isn't it?
Do we foresee any need for specific data or training configuration for the LVAE-based algorithms (HDN, microSplit)?

Solution 1: Remove algorithm-specific configurations

from pydantic import BaseModel

class Configuration(BaseMode):
    algorithm_config: N2VAlgorithm | N2NAlgorithm | CAREAlgorithm | PN2VAlgorithm | MicroSplitAlgorithm | HDNAlgorithm
    data_config: DataConfig

The main advantage is really to have a single Configuration type. Code-base will not change much:

Less classes in the configurations.
We will need a new space for all the BMZ-related metadata creation (which is algorithm specific).
Doc is easier to write.

API will not really be impacted.

Solution 2: Change nothing

We keep a separated structure per algorithm. Advantages:

Clean separation of the BMZ metadata
We still have the possibility to validate across sub-configurations

This comes at the cost of the multiplication of configurations, and a more complex documentation.

Did I miss anything? Opinions?

@CatEek @melisande-c @veegalinova @federico-carrara

The text was updated successfully, but these errors were encountered:

melisande-c · 2025-01-24T14:13:26Z

Ok so I did forget about the citations, refs etc. when I made the comment; but I guess moving them to the algorithm model classes, does make sense? (citations don't relate to the data).

Even if the configurations for LVAE based algorithms end up being split for data + algorithm validation, that doesn't mean the CARE-family configurations also have to be split.

But this is not a massive priority since it works fine as is, but if the number of configuration classes starts to become cumbersome we may want to consider it.

jdeschamps · 2025-01-27T10:18:25Z

That makes sense, I am convinced!

It will be an easy enough PR.

jdeschamps added the refactoring Streamline and improve source code label Jan 22, 2025

jdeschamps mentioned this issue Jan 22, 2025

Pydantic errors are overwhelming in convenience functions #356

Open

jdeschamps added this to the v0.1.0 milestone Jan 27, 2025

jdeschamps self-assigned this Jan 27, 2025

jdeschamps added this to v0.1.0 Jan 27, 2025

jdeschamps moved this to Backlog in v0.1.0 Jan 27, 2025

jdeschamps moved this from Backlog to Todo in v0.1.0 Jan 27, 2025

jdeschamps mentioned this issue Jan 30, 2025

Configuration discrimination fails if Pydantic algorithm models are passed #384

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Should each algorithm have its own configuration #367

Should each algorithm have its own configuration #367

jdeschamps commented Jan 22, 2025

melisande-c commented Jan 24, 2025

jdeschamps commented Jan 27, 2025

Should each algorithm have its own configuration #367

Should each algorithm have its own configuration #367

Comments

jdeschamps commented Jan 22, 2025

Description

Which part of the code?

Potential solutions

Solution 1: Remove algorithm-specific configurations

Solution 2: Change nothing

melisande-c commented Jan 24, 2025

jdeschamps commented Jan 27, 2025