diff --git a/images/dashboard-otel.png b/images/dashboard-otel.png new file mode 100644 index 0000000..c564fc7 Binary files /dev/null and b/images/dashboard-otel.png differ diff --git a/images/dashboard-redis.png b/images/dashboard-redis.png new file mode 100644 index 0000000..c5b23e8 Binary files /dev/null and b/images/dashboard-redis.png differ diff --git a/index.html b/index.html index d526252..94dfd8d 100644 --- a/index.html +++ b/index.html @@ -15,7 +15,7 @@
LOAD & STREAM
Prometheus metrics are enabled by setting the type of metric (cluster.metrics.type) to ‘OTEL’. This is the default, the other option being ‘REDIS’, which will log metrics as Redis’ TimeSeries objects in the connect configuration database, thus taking advantage of -Redis' time series capabilities, in which case additional data about keys and TTL are available.
+Redis' time series capabilities, in which case additional data about keys and TTL will be available.The port used by OTEL metrics is set by prometheus.port, it is set to 19090 by default.
The OpenTelemetry metrics are reported for Create/Update/Delete operations for every table configured in the payload file. The metrics generated are -the overall throughput, as well as the lag and latency for each operation, which are reported as histograms. They are tagged so that table and -operation information can be used to report individual operations or operations on a specific table, or they can be aggregated.
+OpenTelemetry metrics are reported for Create/Update/Delete operations for every table configured in the payload file.
+The operational metrics generated are lag and latency for each operation, which are reported as histograms. +They are tagged so that table and operation information can be used to report individual operations or operations on a +specific table, or so they can be aggregated.
Tags: job, database, table, partition, operation, stage
+Statistical metrics include total throughput and average elapsed time. +Lastly, metrics are reported for job starts and stops, as well as errors encountered during processing.
Redis Connect provides a dashboard for monitoring the system. After installing Grafana and connecting it to Prometheus, i.e., adding a datasource, you can install the Redis Connect dashboard by navigating to the Grafana dashboard page and clicking New → Import.
+Redis Connect provides a dashboard for monitoring the system. After installing Grafana and connecting it to Prometheus, +i.e., adding a datasource, you can install the Redis Connect dashboard by navigating to the Grafana dashboard page, +clicking New → Import, and then importing the redis-connect-grafana-dashboard.json file found in the +../config/samples/dashboard subdirectory.
The Redis Connect dashboard reports the following metrics:
metric |
-label |
-type |
-description |
+Metric |
+Label |
+Type |
+Description |
+Metadata (see below) |
event_job_starts_total |
job starts |
count |
number of times job has been started |
+host, job |
||||
event_job_stops_total |
job stops |
count |
number of times job has been stopped |
+host, job |
+||||
event_job_errors |
+errors |
+count |
+number of exceptions that have occurred |
+job, operation, schema/table, exception, message |
+||||
event_job_invalid_events |
+invalid |
+count |
+number of events that were rejected |
+[partition], job, database, stage, operation, schema/table |
||||
event_input_buffer_histogram |
buffer |
histogram |
number of events received |
+job, database |
||||
event_input_buffer_count |
buffer count |
count |
number of measurements |
+job, database |
||||
event_input_buffer_sum |
buffer total |
count |
sum of all measured quantities |
+job, database |
||||
event_operation_lag |
lag |
histogram |
time it took connect to receive event |
+[partition], job, database, stage, operation, schema/table |
||||
event_operation_lag_milliseconds_count |
lag count |
count |
number of measurements |
+[partition], job, database, stage, operation, schema/table |
||||
event_operation_lag_milliseconds_sum |
lag total |
count |
sum of all measured quantities |
+[partition], job, database, stage, operation, schema/table |
||||
event_operation_latency |
latency |
histogram |
time it took to process the event |
+[partition], job, database, stage, operation, schema/table |
||||
event_operation_latency_milliseconds_count |
latency count |
count |
number of measurements |
+[partition], job, database, stage, operation, schema/table |
||||
event_operation_latency_milliseconds_sum |
latency total |
count |
sum of all measured quantities |
+[partition], job, database, stage, operation, schema/table |
||||
event_operation_elapsed |
elapsed |
histogram |
time it took to write event to redis |
+job |
||||
event_operation_elapsed_milliseconds_count |
elapsed count |
count |
number of measurements |
+job |
||||
event_operation_elapsed_milliseconds_sum |
elapsed sum |
count |
sum of all measured quantities |
+job |
||||
event_job_operation_throughput_total |
throughput |
count |
total number of events processed |
+[partition], job, database, stage, operation, schema/table |
Tag |
+Description |
+
job |
+the job name |
+
database |
+the job source database |
+
schema/table |
+the job source table |
+
partition |
+the job partition - only reported when there are multiple partitions |
+
operation |
+the job operation: C(reate) / U(pdate) / D(elete) |
+
stage |
+the job stage name |
+
host |
+the job host |
+
exception |
+the job exception class — in the case of an error |
+
message |
+the job error message — if reported |
+
With regards to Redis’ TimeSeries metrics, you can view these metrics in Grafana using the Redis Datasource for Grafana.
@@ -2451,6 +2545,7 @@String
The name of the directory containing the Redis Connect credentials file.
This directory path must include a properties file named redisconnect_credentials_jobmanager.properties
-[1].
../config/ samples/ credentials
mail.smtp.start.tls.enable
Boolean
Set or disable STARTTLS encryption[2].
Set or disable STARTTLS encryption[5].
true
Note: jobName should not be confused with jobId. jobIds are created as part of a job claim. They add-on a namespace to the jobName to identify the jobType and partitionId (if jobType=PARTITIONED_STREAM) -[3].
+[6].n/a
The number of job partitions that can be claimed, and executed, on the same Redis Connect instance (JVM).
If the limit forces partitions to span more instances than are currently deployed, then the job will not be able to start nor migrate -[4].
0
Integer
Redis Connect’s pipeline is powered by the LMAX Disruptor library (High Performance Inter-Thread Messaging).
Must be a power of 2, minimum 1024 -[5]
4096
Although the producer’s polling event loop enqueues changed-data events in batches, each event is processed individually through the pipeline. This is because Redis Connect updates the checkpoint at the changed-data event level and not the batch -[6].
+[9].false
Long
Fixed delay in between sourceConnectionMaxRetryAttempts.
-Measured in seconds; minimum is 0[7].
Measured in seconds; minimum is 0[10].
60
false
credentialsRotationEventListenerEnabled
Boolean
When enabled, the credentialsDirectoryPath will be periodically scanned for changes that are specific to the -property file associated with this database[9].
false
String
Specifies the criteria for running a snapshot when the connector starts. It is not recommended to use this debezium capability for initial load in Production. See Production Readiness for more information. MySQL, Postgres default to 'never'; Oracle, SQL Server default to 'schema_only' -[10].
n/a
Maximum number of connections that the pool can create. If all connections are in use, an operation requiring a client-to server-connection is blocked until a connection is available or the free-connection-timeout is reached. If set to -1, there is no maximum. The setting must indicate a cap greater than min-connections -[11].
-1
max.queue.size
Integer
Specifies the maximum number of records that the blocking queue can hold. -[12].
32768
Boolean
When enabled, batches of changed-data events are persisted to Redis Stream, before they are enqueued within the in-memory queue, which effectively mimics a change-data-capture (CDC) process within Redis Connect. -[13].
True