[Core] Adding option to avoid Plasma Fetch and Deserialisation + e2e benchmarks #6

alindkhare · 2021-02-17T00:07:12Z

Why are these changes needed?

Related issue number

Checks

I've run scripts/format.sh to lint the changes in this PR.
I've included any doc changes needed for https://docs.ray.io/en/master/.
I've made sure the tests are passing. Note that there might be a few flaky tests, see the recent failures at https://flakey-tests.ray.io/
Testing Strategy
- Unit tests
- Release tests
- This PR is not tested :(

Add GPU num_gpus=1 for replicas Intermediate tensor is torch tensor Only 1 iteration is enough for calculating throughput and closed loop latency Image as raw bytes Removed the need to pass args as list for enqueue remote Added server code Error Fix Added server code Fix bug for subprocess submission Few more print fixes Fire more queries to see

Final config Final Config

alindkhare · 2021-02-17T00:09:37Z

src/ray/core_worker/core_worker.cc

+  memory_store_->GetAsync(
+      object_id, [python_future, success_callback, fallback_callback, object_id,
+                  fetch_plasma_data](std::shared_ptr<RayObject> ray_object) {
+        if (ray_object->IsInPlasmaError() && fetch_plasma_data) {
+          fallback_callback(ray_object, object_id, python_future, fetch_plasma_data);
+        } else {
+          success_callback(ray_object, object_id, python_future);
+        }
+      });


@atumanov This is the main change that I made

alindkhare · 2021-02-17T01:22:50Z

Performance Benchmark Results

Latency Table

Implementation	[95th, 99th, 100th] ms latency percentile
Vanilla Ray Serve	[ 58.30999687 91.6196993 135.20691544]
Reference	[44.97625791 51.53332099 88.14558759]
Reference + Pipeline Orch.	[46.11008726 55.16686797 97.10958973]
Ray Hack + Callbacks + Pipeline Orch.	[42.77230315 45.28134219 50.52455887]

alindkhare added 7 commits February 11, 2021 11:55

Added no serialization callbacks + perf benchmarks

fa47f54

Data return

12df662

Added no plasma fetch option

0b6d7ec

Added in-between tensor pipeline

1b886da

Added image prepoc example

24c853d

Added numpy transfer

480869f

Final config Final Config

alindkhare commented Feb 17, 2021

View reviewed changes

Fixed a plasma callback bug

a39cb4a

Added fan-in/out router

53301c4

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Core] Adding option to avoid Plasma Fetch and Deserialisation + e2e benchmarks #6

[Core] Adding option to avoid Plasma Fetch and Deserialisation + e2e benchmarks #6

alindkhare commented Feb 17, 2021

alindkhare Feb 17, 2021

alindkhare commented Feb 17, 2021

[Core] Adding option to avoid Plasma Fetch and Deserialisation + e2e benchmarks #6

Are you sure you want to change the base?

[Core] Adding option to avoid Plasma Fetch and Deserialisation + e2e benchmarks #6

Conversation

alindkhare commented Feb 17, 2021

Why are these changes needed?

Related issue number

Checks

alindkhare Feb 17, 2021

Choose a reason for hiding this comment

alindkhare commented Feb 17, 2021

Performance Benchmark Results

Latency Table