Need readiness check endpoint for PD #5658

hanlins · 2022-11-01T01:03:08Z

Feature Request

Describe your feature request related problem

When doing a rolling update for a PD cluster, we need to turn down a PD instance, then bring up a PD instance with the updated config, and repeat this process for all PD instances one at a time. Typically we use tidb-operator for automating this, and a PD instance is considered "Running" as long as the pod is up for some duration. However, it could be the PD instance is still picking up with the leader and receiving etcd raft logs, and cannot serve the request at the moment. If let's say we have a deployment of PDs that has 3 replicas, the first two pods are updated but not ready for serving new requests, then at this point the PD is unavailable since the remaining PD cannot write quorum (as the other two are still syncing etcd).

Describe the feature you'd like

It would be helpful if PD instance can expose info (maybe via a restful endpoint) whether its internal data is synced with leader and ready for serving new requests.

Describe alternatives you've considered

An alternative approach in tidb-operator scenario is to add a init container in PD's pod spec that sleeps for certain time. Before sleep is done, the Pod will stuck in Init state so the rolling update process will wait til the sleep completes. Hopefully, during this time, the etcd data for that PD instance is synced.

Teachability, Documentation, Adoption, Migration Strategy

The endpoint should be something new, and since old system is depending on it, we don't need to worry about the migration.

The text was updated successfully, but these errors were encountered:

hanlins · 2022-11-07T23:11:32Z

Proposing to add a restful endpoint which is similar to the health endpoint to expose its etcd readiness information

pd/tools/pd-ctl/pdctl/command/health_command.go

Line 24 in e53caec

healthPrefix = "pd/api/v1/health"

The logic for readiness check would be checking if the current PD's corresponding etcd member is still a learner. If an etcd member is synced with the leader, then it should have been promoted as a voting member, i.e. a follower¹. In the etcd cluster status response, the IS LEARNER info is exposed² so I suppose it's feasible to do so.

cc @nolouch

rleungx · 2022-11-08T03:30:41Z

IMO, when we do a rolling update, there also could be a log lag between followers and the leader, which cannot be solved by only checking the learner.

hanlins · 2022-11-08T03:54:10Z

IMO, when we do a rolling update, there also could be a log lag between followers and the leader, which cannot be solved by only checking the learner.

Sure, thanks for pointing that out! Any suggestions on what should we check?

rleungx · 2022-11-08T08:57:19Z

Can we just use health API?

hanlins · 2022-11-08T21:51:29Z

Can we just use health API?

Was checking CheckHealth, it seems to be checking all members rather than checking whether the current PD instance is ready or not?

pd/server/cluster/cluster.go

Lines 2407 to 2432 in 91f1664

    
           // CheckHealth checks if members are healthy. 
        
           func CheckHealth(client *http.Client, members []*pdpb.Member) map[uint64]*pdpb.Member { 
        
           	healthMembers := make(map[uint64]*pdpb.Member) 
        
           	for _, member := range members { 
        
           		for _, cURL := range member.ClientUrls { 
        
           			ctx, cancel := context.WithTimeout(context.Background(), clientTimeout) 
        
           			req, err := http.NewRequestWithContext(ctx, http.MethodGet, fmt.Sprintf("%s%s", cURL, healthURL), nil) 
        
           			if err != nil { 
        
           				log.Error("failed to new request", errs.ZapError(errs.ErrNewHTTPRequest, err)) 
        
           				cancel() 
        
           				continue 
        
           			} 
        
           			resp, err := client.Do(req) 
        
           			if resp != nil { 
        
           				resp.Body.Close() 
        
           			} 
        
           			cancel() 
        
           			if err == nil && resp.StatusCode == http.StatusOK { 
        
           				healthMembers[member.GetMemberId()] = member 
        
           				break 
        
           			} 
        
           		} 
        
           	} 
        
           	return healthMembers 
        
           }

I think in k8s rolling upgrade, we would be interested in whether the current PD instance has been keeping up with the leader. What about also checking the applied index for current PD against the leader's committed index?

hanlins · 2022-11-09T04:02:34Z

Also for the health endpoint, it seems it's returning 200 status all the time

pd/server/api/health.go

Line 73 in 01b8f34

h.rd.JSON(w, http.StatusOK, healths)

In k8s we typically need the endpoint to return a status code larger than 400 if a status check probe fails¹, quote:

Any code greater than or equal to 200 and less than 400 indicates success. Any other code indicates failure.

https://kubernetes.io/docs/tasks/configure-pod-container/configure-liveness-readiness-startup-probes/#define-a-liveness-http-request ↩

hanlins added the type/feature-request Categorizes issue or PR as related to a new feature. label Nov 1, 2022

hanlins linked a pull request Nov 8, 2022 that will close this issue

api: Add the readiness endpoint for PD #5685

Open

hanlins mentioned this issue Nov 9, 2022

Need to expose readiness probe for PD in TidbCluster spec pingcap/tidb-operator#4777

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Need readiness check endpoint for PD #5658

Need readiness check endpoint for PD #5658

hanlins commented Nov 1, 2022

hanlins commented Nov 7, 2022

rleungx commented Nov 8, 2022

hanlins commented Nov 8, 2022

rleungx commented Nov 8, 2022

hanlins commented Nov 8, 2022

hanlins commented Nov 9, 2022

Need readiness check endpoint for PD #5658

Need readiness check endpoint for PD #5658

Comments

hanlins commented Nov 1, 2022

Feature Request

Describe your feature request related problem

Describe the feature you'd like

Describe alternatives you've considered

Teachability, Documentation, Adoption, Migration Strategy

hanlins commented Nov 7, 2022

Footnotes

rleungx commented Nov 8, 2022

hanlins commented Nov 8, 2022

rleungx commented Nov 8, 2022

hanlins commented Nov 8, 2022

hanlins commented Nov 9, 2022

Footnotes