Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Feature/multiprocess #19

Open
wants to merge 33 commits into
base: dev
Choose a base branch
from
Open

Feature/multiprocess #19

wants to merge 33 commits into from

Conversation

jishminor
Copy link
Member

This is a large overhaul of the backend, changing the implementation to be all contained within the tritonserver process, to splitting out and managing a child process per tflite model instance. This PR closes #5 as there will be a copy of the ACL scheduler singelton instance per model now.

Tensorpipe is used to manage the transport of the input tensor data to the respective model instance processes, and as it's written, only one memcpy into a shared memory channel is needed to do this, meaning the overhead is minimal.

@jishminor
Copy link
Member Author

One issue with tensorpipe shared mem channel is that both sides of the pipe busy wait, and repeatedly call sched_yield as mentioned here. The flame graph for one side of the pipe where the TP_SHM_Reactor is eating an entire core busy waiting looks like this:
perf

jishminor added 19 commits May 31, 2023 10:10
Signed-off-by: Josh Minor <[email protected]>
Signed-off-by: Josh Minor <[email protected]>
Signed-off-by: Josh Minor <[email protected]>
Signed-off-by: Josh Minor <[email protected]>
Signed-off-by: Josh Minor <[email protected]>
Signed-off-by: Josh Minor <[email protected]>
@jishminor jishminor force-pushed the feature/multiprocess branch 2 times, most recently from 631e840 to 9efcdfe Compare July 28, 2023 21:19
@jishminor jishminor force-pushed the feature/multiprocess branch from 9efcdfe to 0ba5134 Compare August 2, 2023 19:12
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant