Mobo enhancement #248

roussel-ryan · 2024-09-20T15:37:44Z

enhances multi-objective bayesian optimization algorithms by adding an use_pf_as_initial_points flag which uses points on the pareto frontier to initialize optimization of the EHVI acquisition function which results in substantial speed-up of convergence to the pareto front in high-dimensional input spaces

…f points - adds flag use_pf_as_initial_points to enable behavior

nikitakuklev · 2024-10-28T23:59:18Z

A general comment is that I'd suggest we do a more complete implementation with more randomization as a followup PR. There is also a question of how to compute feasibility - just drop infeasible points like is done now, or sample each candidate to determine things probabilistically, potentially getting more accurate borderline candidates.

More specifically:

Botorch does a more complicated process with posterior sampling + pareto downselect in the pruning function for qNEHVI family - see here. Note how if any of the samples at a baseline point are infeasible, the point is set equal to reference and excluded from pareto front. This encodes noise knowledge of the model into surviving candidates and should in general be better than removing the pareto point based on observed data, at the cost of performance.
In fact, since prune_inferior_points_multi_objective will frequently get called in the acquisition function, it might be useful to cache baseline as part of pf initialization and feed results as new X_baseline + prune_baseline=false to save on repeating pareto front computation.
For choosing which points to use when len(initial_points) > num_restarts, it might be good to use stochastic behavior. First, use 'around best' logic (see here) to generate raw_pf_samples points, with raw_pf_samples = num_restarts*factor. Then, use the same stochastic logic as in Botorch raw_samples parameter. Namely, compute acquisition function at all points and pick exactly num_restarts probabilistically with bias for higher acquisition values (see here). One can make an argument that acquisition functions values will be pretty similar around each point unless perturbations are large, and thus this complicated procedure will not be particularly useful. A simpler solution is to only generate at most 'num_restarts' candidates without downselect, for example by picking num_restarts pareto points using above weighted procedure and then generating 1 nudged candidate per point. Need to benchmark to see if that is worth it. The overall goal here is to make initialization not use completely random parts of pareto front, but be softly biased towards more promising areas.
It would be interesting to plot how many points are on pareto front vs dimensionality. Have a feeling num_restarts might need to be scaled a lot for larger problems if the fully random scheme is kept.

nikitakuklev · 2024-10-29T00:56:10Z

xopt/vocs.py

+        observable_data = self.observable_data(data, "")
+
+        if return_valid:
+            feasable_status = self.feasibility_data(data)["feasible"]


nikitakuklev

Minor concerns with going to GPU and a lot of redoing of calcs - probably a small thing compared to main MOBO loop time. Otherwise LGTM.

nikitakuklev · 2024-10-29T01:06:55Z

xopt/generators/bayesian/mobo.py

+    supports_batch_generation: bool = True
+    use_pf_as_initial_points: bool = Field(
+        False,
+        description="flag to specify if pf front points are to be used during "


nikitakuklev · 2024-10-29T01:09:02Z

xopt/tests/generators/bayesian/test_mobo.py

+            use_pf_as_initial_points=True,
+        )
+        gen.add_data(test_data)
+        gen._get_initial_conditions()


verify that infeasible candidate did not make it

nikitakuklev · 2024-10-29T01:10:30Z

xopt/numerical_optimizer.py

@@ -67,7 +70,7 @@ class GridOptimizer(NumericalOptimizer):
        10, description="number of grid points per axis used for optimization"
    )

-    def optimize(self, function, bounds, n_candidates=1):
+    def optimize(self, function, bounds, n_candidates=1, **kwargs):


assert empty kwargs if none are expected

nikitakuklev · 2024-10-29T01:13:37Z

xopt/generators/bayesian/bayesian_generator.py

+        )
+        non_dominated = is_non_dominated(obj_data)
+
+        weights = set_botorch_weights(self.vocs).to(**self._tkwargs)[


can you reuse weight from _get_scaled_data() and avoid recomputing?

nikitakuklev · 2024-10-29T01:15:44Z

xopt/generators/bayesian/bayesian_generator.py

-            ]
+        variable_data = torch.tensor(var_df[self.vocs.variable_names].to_numpy())
+        objective_data = torch.tensor(obj_df[self.vocs.objective_names].to_numpy())
+        weights = set_botorch_weights(self.vocs).to(**self._tkwargs)[


I haven't benchmarked this but going to GPU might be quite slow for our small dataset sizes

roussel-ryan · 2024-10-29T21:37:47Z

@nikitakuklev for the record here, I'll reiterate that we are happy to incorporate your suggested improvements to this process in a future PR

roussel-ryan added 6 commits September 19, 2024 16:03

update numerical optimizer to accept kwarg arguments

f03e67e

update vocs extract data with flag to extract only valid samples

16a4e0a

allow multiobjective generator to initialize acqf optimization with p…

b001267

…f points - adds flag use_pf_as_initial_points to enable behavior

Handle case where the PF set is empty

90c42b3

bugfix for mobo w / constraints

c95b5f3

additional test for pf calculation

3069f71

roussel-ryan requested a review from ChristopherMayes September 27, 2024 16:10

roussel-ryan requested a review from nikitakuklev October 25, 2024 19:17

add to dostring

d00a5a0

ChristopherMayes approved these changes Oct 28, 2024

View reviewed changes

nikitakuklev reviewed Oct 29, 2024

View reviewed changes

nikitakuklev approved these changes Oct 29, 2024

View reviewed changes

roussel-ryan added 2 commits October 29, 2024 13:59

update tests, fix a bug and address typos

d55eedf

fix linting

7f1311d

roussel-ryan merged commit 8e54995 into main Oct 29, 2024
14 checks passed

roussel-ryan deleted the mobo-enhancement branch October 29, 2024 21:38

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Mobo enhancement #248

Mobo enhancement #248

roussel-ryan commented Sep 20, 2024

nikitakuklev commented Oct 28, 2024 •

edited

Loading

nikitakuklev Oct 29, 2024

nikitakuklev left a comment •

edited

Loading

nikitakuklev Oct 29, 2024

nikitakuklev Oct 29, 2024

nikitakuklev Oct 29, 2024

nikitakuklev Oct 29, 2024

nikitakuklev Oct 29, 2024

roussel-ryan commented Oct 29, 2024

Mobo enhancement #248

Mobo enhancement #248

Conversation

roussel-ryan commented Sep 20, 2024

nikitakuklev commented Oct 28, 2024 • edited Loading

nikitakuklev Oct 29, 2024

Choose a reason for hiding this comment

nikitakuklev left a comment • edited Loading

Choose a reason for hiding this comment

nikitakuklev Oct 29, 2024

Choose a reason for hiding this comment

nikitakuklev Oct 29, 2024

Choose a reason for hiding this comment

nikitakuklev Oct 29, 2024

Choose a reason for hiding this comment

nikitakuklev Oct 29, 2024

Choose a reason for hiding this comment

nikitakuklev Oct 29, 2024

Choose a reason for hiding this comment

roussel-ryan commented Oct 29, 2024

nikitakuklev commented Oct 28, 2024 •

edited

Loading

nikitakuklev left a comment •

edited

Loading