Spmc queue #74

bartoszmodelski · 2023-06-02T19:16:31Z

Motivation

Currently, the blessed data structure for Domainslib is the deque. It's LIFO, and while that's optimal for locality, it's quite easy to shoot oneself in the foot with liveness issues. For example, if a web server is processing a stream of requests and starts some compute in the background, all requests will eventually have to wait for the compute to finish. Even knowing about the issue there's not much that can be improved here (without re-enginering the workload or creating multiple schedulers) because LIFO keeps working on the existing sub-tree of tasks until done by design. FIFO, on the other hand, juggles all subtrees and treats a single task as unit of work. I believe it's a much safer choice for the default scheduling strategy.

Thus, this PR adds a simple single-producer multi-consumer queue inspired by the work-stealing deque and Golang's scheduler. It's useful as a general structure but has been written mostly with Domainslib in mind.

Design

Similar design to the deque. The array is not atomic. Writer first inserts the item and increments the tail index. Stealers first read item in the array and try to claim it with cas on the head. Thus the writer operates on the region of the array between tail (incl.) and head (excl.), while stealers between head (incl.) and tail (excl.). Stealer may do a non-linearizable read of the array but it won't be returned to the caller as cas fails in such a case.

Local deque could be identical to stealing. I've modified to first modify index and then read the array to ensure wait-freedom (or, in particular, to eliminate the risk of local deque competing with steals). I've added further explanations in the code.

The structure is wait-free for the owner of the queue and lock-free for stealers. This design should help it keep stable performance as system becomes loaded and stealing decreases.

Testing

Tests:

DSCheck tests. I've used granular dependency branch, where they take around 0.05s.
Standard multicore tests with multiple domains hammering the structure.

We can also add a lock-free steal-half function, which will improve work distribution on skewed workloads, but keeping it simple for now.

spmc queue

844456b

bartoszmodelski requested a review from a team June 2, 2023 19:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Spmc queue #74

Spmc queue #74

bartoszmodelski commented Jun 2, 2023 •

edited

Loading

Spmc queue #74

Are you sure you want to change the base?

Spmc queue #74

Conversation

bartoszmodelski commented Jun 2, 2023 • edited Loading

Motivation

Design

Testing

bartoszmodelski commented Jun 2, 2023 •

edited

Loading