add a `validate=False` option for `graph_objects` and `px` figures #1812

michaelbabyn · 2019-10-10T22:17:15Z

There's already an issue outlining the effects graph_object validation has on plot generation time. Users can bypass this performance hit by replacing the graph_objects with dict and then display the plot with plotly.offline.iplot(fig, validate=False) or if they are creating graphs in Dash, they can forgo the plotly.py library altogether and just use a dict in their Graph component's figure argument.

This solution can greatly improve the performance of Dash apps but it means that Dash users with expensive graphs have to choose between using px/plotly.py's update methods and optimally fast code.

I wonder if a way to turn off validation, especially in Dash apps, would help Dash users get the best of both worlds.

cc @matthewchan15

The text was updated successfully, but these errors were encountered:

emmanuelle · 2020-03-06T17:27:44Z

Also related to https://community.plot.ly/t/plotting-large-number-of-graphs/35907.

emmanuelle · 2020-03-26T15:16:42Z

To be checked: can we do this and still keep the magical underscore methods?

Also possible: half-way point where we would disable the validation of only data arrays.

Note that the "import" time is a big part of the lag when developing

parksj10 · 2020-06-25T23:59:11Z

Any update on this? certainly have my +1, using large data sets with datashader and it's taking seconds to validate. Likely will have to retrofit my code with the dict methods :(

nicolaskruchten · 2020-06-26T01:07:43Z

@parksj10 can you confirm you’re seeing performance issues with a version of plotly of 4.7 or higher? We made a number of performance improvements in 4.7 so I just want to make sure :)

parksj10 · 2020-06-26T02:45:58Z

@nicolaskruchten running plotly 4.8.1, I've attached a cProfile below, you can see that half the figure generation time is spent validating. In case you're interested, I've also attached the cProfile .dat file. Let me know if I can do anything else to help or provide other information. I think it would be rather difficult to create a low-complexity, working example from my app, but perhaps @michaelbabyn 's examples could be useful in this regard

temp.dat.zip

nicolaskruchten · 2020-06-27T13:07:10Z

Thanks! This is something we should fix, and we’d appreciate any help :)

ndrezn · 2022-12-05T22:34:15Z

I'm running into this, a few years later 🙂. This causes major issues when working with e.g. choropleth maps with large GeoJSON files, where you will end up with giant JSON blobs that certainly do not need to be validated.

I imagine this is a pretty common issue for folks working with charts with many points, and I had no idea this was even a thing until today. It'd be great at least to document this behaviour or make people more aware of it until it's possible to disable validation. Maybe even on https://plotly.com/python/webgl-vs-svg/?

alexcjohnson · 2022-12-06T12:27:36Z

I like the idea of a three-level approach: full validation (current behavior), top-level validation (don’t dig into data arrays or nested objects like GeoJSON), and no validation.

ndrezn · 2022-12-06T15:02:32Z

(want to note as well that I'm seeing ~1second validation time/mb of object. With GeoJSONs, we often see blobs in the size of 60mb+, which just destroys your app performance.)

Having the top-level validation option seems perfect!

nicolaskruchten · 2022-12-06T15:30:03Z

So independently of the validation issue, if the GeoJSONs are static, you should always load them from assets in a Dash app, for caching purposes. Basically just pass in the URL rather than the GeoJSON blob.

nicolaskruchten · 2022-12-06T15:30:33Z

Having the top-level validation option seems perfect!

Yes, of course, although the last time we tried, we were unable to make it work :)

ndrezn · 2022-12-06T16:30:25Z

@nicolaskruchten -- yes, I'm able to mostly get around this issue by using OperatorTransform from Dash Extensions and combining that with using objects to define my Dash apps. Adding to assets/ would make it even better though... great idea.

My main concern here is that this isn't intuitive, and it's also not intuitive that you can boost performance of figures in Dash apps with a large number of points just by switching how they are defined (which is why it'd be great to at least see this behaviour documented).

ndrezn · 2022-12-09T15:23:41Z

(cc @red-patience / @LiamConnors on that last point maybe)

hannahker · 2023-02-03T00:24:31Z

Throwing my support behind this one! Even if it takes some time to add in a validate=False param, in the meantime it would be really helpful to have documentation to alert people that this might be a bottleneck in chart performance and that you can work around it with creating the dict directly.

Both this trick and passing data as a static asset url have massively improved the performance of my graph and I wouldn't have known to do either of these things if I hadn't been pointed towards this issue.

cc @red-patience

bmaranville · 2023-03-13T11:47:54Z

I think I have a related issue affecting subplots.make_subplots, where the time to execute increases non-linearly with the number of plots. For a 20x20 grid of plots it is taking 14 seconds, for a 21x21 grid it takes 18 seconds, for example. This is for an empty figure, which is created with make_subplots e.g.

from plotly.subplots import make_subplots

%time fig = make_subplots(rows=20, cols=20)

From profiling, it is spending the vast majority of its time in the _ret function of basedatatypes.py, and all of the time in that function is spent in find_closest_string, which I think is because it is pre-calculating an error message for a missing key - which is related to the validation. There would be a > 90% speedup if validation could be disabled, from what I can see in the profiling.

EDIT: I think I will make a new issue for this: see #4100

nicolaskruchten · 2023-03-13T12:37:26Z

Thanks for that profiling! We could probably speed things up by only computing error strings when we know there's an error...

michaelbabyn added the enhancement label Oct 10, 2019

gvwilson self-assigned this May 23, 2024

gvwilson removed their assignment Aug 2, 2024

gvwilson added feature something new P3 backlog and removed enhancement labels Aug 12, 2024

gvwilson changed the title ~~A validate=False an option for graph_objects and px Figures?~~ add a validate=False option for graph_objects and px figures Aug 12, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add a `validate=False` option for `graph_objects` and `px` figures #1812

add a `validate=False` option for `graph_objects` and `px` figures #1812

michaelbabyn commented Oct 10, 2019

emmanuelle commented Mar 6, 2020

emmanuelle commented Mar 26, 2020

parksj10 commented Jun 25, 2020

nicolaskruchten commented Jun 26, 2020

parksj10 commented Jun 26, 2020

nicolaskruchten commented Jun 27, 2020

ndrezn commented Dec 5, 2022 •

edited

Loading

alexcjohnson commented Dec 6, 2022

ndrezn commented Dec 6, 2022 •

edited

Loading

nicolaskruchten commented Dec 6, 2022

nicolaskruchten commented Dec 6, 2022

ndrezn commented Dec 6, 2022

ndrezn commented Dec 9, 2022

hannahker commented Feb 3, 2023

bmaranville commented Mar 13, 2023 •

edited

Loading

nicolaskruchten commented Mar 13, 2023

add a validate=False option for graph_objects and px figures #1812

add a validate=False option for graph_objects and px figures #1812

Comments

michaelbabyn commented Oct 10, 2019

emmanuelle commented Mar 6, 2020

emmanuelle commented Mar 26, 2020

parksj10 commented Jun 25, 2020

nicolaskruchten commented Jun 26, 2020

parksj10 commented Jun 26, 2020

nicolaskruchten commented Jun 27, 2020

ndrezn commented Dec 5, 2022 • edited Loading

alexcjohnson commented Dec 6, 2022

ndrezn commented Dec 6, 2022 • edited Loading

nicolaskruchten commented Dec 6, 2022

nicolaskruchten commented Dec 6, 2022

ndrezn commented Dec 6, 2022

ndrezn commented Dec 9, 2022

hannahker commented Feb 3, 2023

bmaranville commented Mar 13, 2023 • edited Loading

nicolaskruchten commented Mar 13, 2023

add a `validate=False` option for `graph_objects` and `px` figures #1812

add a `validate=False` option for `graph_objects` and `px` figures #1812

ndrezn commented Dec 5, 2022 •

edited

Loading

ndrezn commented Dec 6, 2022 •

edited

Loading

bmaranville commented Mar 13, 2023 •

edited

Loading