Extreme slow down when using Map#lock/unlock from multiple goroutines #992

BaLiKfromUA · 2024-02-05T00:13:26Z

Go version: 1.21
Hazelcast Go Client version: 1.4.1
Hazelcast server version: 5.1.7
Number of the clients: 10
Cluster size, i.e. the number of Hazelcast cluster members: 1
OS version : Linux (inside docker-compose)
Docker version: 24.0.5
Docker-compose version: 1.29.2

Context

I am conducting experiments with Hazelcast on locking distributed map.

During this experiment, I have 10 goroutines and each of them increments the value mapped to the same key 10 000 times.

As a reference, I use a pessimistic locking example given in the documentation and a lock example given in this repository (with some API changes mentioned in client docs).

I isolated my experiment in docker-compose environment, using an official hazelcast image without any change in hazelcast default configuration.

Code

Link

package main

import (
	"context"
	"github.com/hazelcast/hazelcast-go-client"
	"log"
	"os"
	"strings"
	"sync"
	"time"
)

func lockAndIncrement(ctx context.Context, distMap *hazelcast.Map, key string) {
	intValue := int64(0)
	// Create a new unique lock context.
	// https://pkg.go.dev/github.com/hazelcast/hazelcast-go-client#hdr-Using_Locks
	lockCtx := distMap.NewLockContext(ctx)
	// Lock the key.
	// The key cannot be unlocked without the same lock context.
	if err := distMap.Lock(lockCtx, key); err != nil {
		log.Fatalf("Error on Lock: %v", err)
	}
	// Remember to unlock the key, otherwise it won't be accessible elsewhere.
	defer distMap.Unlock(lockCtx, key)
	// The same lock context, or a derived one from that lock context must be used,
	// otherwise the Get operation below will block.
	v, err := distMap.Get(lockCtx, key)
	if err != nil {
		log.Fatalf("Error on Get: %v", err)
	}
	// If v is not nil, then there's already a value for the key.
	if v != nil {
		intValue = v.(int64)
	}
	// Increment and set the value back.
	intValue++
	// The same lock context, or a derived one from that lock context must be used,
	// otherwise the Set operation below will block.
	if err = distMap.Set(lockCtx, key, intValue); err != nil {
		log.Fatalf("Error on Set: %v", err)
	}
}

func buildConfig() hazelcast.Config {
	config := hazelcast.NewConfig()
	config.Cluster.Name = "test_lock"
	addresses, ok := os.LookupEnv("HAZELCAST_ADDRESSES")

	if !ok {
		log.Fatal("Failed to get addresses of hazecast nodes")
	}

	config.Cluster.Network.Addresses = strings.Split(addresses, ",")

	return config
}

func main() {
	time.Sleep(10 * time.Second) // just wait for node to be ready

	ctx := context.Background()
	client, err := hazelcast.StartNewClientWithConfig(ctx, buildConfig())

	if err != nil {
		log.Fatalf("Failed to create client: %v", err)
	}

	distMap, err := client.GetMap(ctx, "my-test")

	if err != nil {
		log.Fatalf("Failed to get dist. map: %v", err)
	}

	var wg sync.WaitGroup
	wg.Add(10)

	key := "test-key"

	start := time.Now()
	for i := 0; i < 10; i++ {
		go func() {
			defer wg.Done()

			for j := 0; j < 10_000; j++ {
				if j%1000 == 0 {
					log.Printf("[Pessimistic] At %d\n", j)
				}
				// taken from https://github.com/hazelcast/hazelcast-go-client/blob/e7a962174982d98a3e3840ab3bec917bf67596a0/examples/map/lock/main.go
				lockAndIncrement(ctx, distMap, key)
			}
		}()
	}
	wg.Wait()
	elapsed := time.Since(start)

	v, err := distMap.Get(ctx, key)
	if err != nil {
		log.Fatalf("Error on Get: %v", err)
	}

	log.Printf("Pessimistic locking update took %s, value=%d", elapsed, v.(int64))
}

Expected behavior

I have a powerful machine so expect this code to run relatively fast (up to 3-5 minutes???).

Actual behavior

This code runs on my machine for more than 1 hour.

tester-container | 2024/02/04 23:37:01 Pessimistic locking update took 1h12m12.783679237s, value=100000

Full logs can be found here

I also tried to increase the number of hazelcast nodes to 3 but it decreased time only to 40-50 minutes.

Steps to reproduce the behavior

Run this docker-compose file from /db_environment/hazelcast/ folder

The text was updated successfully, but these errors were encountered:

BaLiKfromUA · 2024-02-05T00:16:14Z

This issue might be, to some extent, similar to hazelcast/hazelcast-python-client#616

BaLiKfromUA changed the title ~~Extreme slow down when using Map#lock/unlock from multiple threads~~ Extreme slow down when using Map#lock/unlock from multiple goroutines Feb 5, 2024

arodionov added Source: Community Type: Defect labels Feb 5, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Extreme slow down when using Map#lock/unlock from multiple goroutines #992

Extreme slow down when using Map#lock/unlock from multiple goroutines #992

BaLiKfromUA commented Feb 5, 2024 •

edited

Loading

BaLiKfromUA commented Feb 5, 2024

Extreme slow down when using Map#lock/unlock from multiple goroutines #992

Extreme slow down when using Map#lock/unlock from multiple goroutines #992

Comments

BaLiKfromUA commented Feb 5, 2024 • edited Loading

Context

Code

Expected behavior

Actual behavior

Steps to reproduce the behavior

BaLiKfromUA commented Feb 5, 2024

BaLiKfromUA commented Feb 5, 2024 •

edited

Loading