View a markdown version of this page

使用 Amazon CloudWatch 监控 Aurora DSQL - Amazon Aurora DSQL

使用 Amazon CloudWatch 监控 Aurora DSQL

使用 CloudWatch 监控 Aurora DSQL,CloudWatch 会收集原始数据并将其处理为易读且近乎实时的指标。CloudWatch 将这些统计数据保留 15 个月,有助于您更好地了解 Web 应用程序或服务性能。设置警报以监视特定阈值,并在达到阈值时发送通知或采取行动。查看以下可用于 Aurora DSQL 的使用情况和可观测性指标。

有关更多信息,请参阅《Amazon CloudWatch 用户指南》。

可观测性和性能

此表概述了 Aurora DSQL 的可观测性指标。它包括用于跟踪只读事务数和总事务数的指标,以提供总体工作负载特征。包括查询超时和 OCC 冲突率等可操作指标,有助于识别性能问题和并发冲突。与会话相关的指标,包括有关活动状态和总数方面的指标,可供深入了解系统上的当前负载。

CloudWatch 指标名称 指标 单位 说明
ReadOnlyTransactions Read-only transactions none The number of read-only transactions
TotalTransactions Total transactions none The total number of transactions executed on the system, including read-only transactions.
QueryTimeouts Query timeouts none The number of queries which have timed out due to hitting the maximum transaction time
OccConflicts OCC conflicts none The number of transactions aborted due to key level OCC
CommitLatency Commit Latency milliseconds Time spent by commit phase of query execution (P50)
BytesWritten Bytes Written bytes Bytes written to storage
BytesRead Bytes Read bytes Bytes read from storage
ComputeTime QP compute time milliseconds QP wall clock time
ClusterStorageSize Cluster Storage Size bytes Cluster size

使用情况指标

Aurora DSQL 使用名为分布式处理单元(DPU)的单个标准化计费单位,来衡量所有基于请求的活动,例如查询处理、读取和写入。

CloudWatch 指标名称 指标 维度:ResourceId 单位 说明
WriteDPU Write Units <cluster-id> DPU Approximates the write active-use component of your Aurora DSQL cluster DPU usage.
MultiRegionWriteDPU Multi-Region Write Units <cluster-id> DPU Applicable for Multi-Region clusters: Approximates the multi-Region write active-use component of your Aurora DSQL cluster DPU usage.
ReadDPU Read Units <cluster-id> DPU Approximates the read active-use component of your Aurora DSQL cluster DPU usage.
ComputeDPU Compute Units <cluster-id> DPU Approximates the compute active-use component of your Aurora DSQL cluster DPU usage.
TotalDPU Total Units <cluster-id> DPU Approximates the total active-use component of your Aurora DSQL cluster DPU usage.

CDC 流指标

Aurora DSQL 发布针对更改数据捕获(CDC)流的以下指标。这些指标使用 ClusterIdStreamId 维度,因此您可以独立监控每个 CDC 流。有关 CDC 流的更多信息,请参阅更改数据捕获(CDC)流

CloudWatch 指标名称 指标 单位 说明
IsImpaired Is impaired none Indicates whether the stream is impaired. The value is 1 when the stream is in the IMPAIRED state, and 0 when the stream is healthy. Use this metric to create a CloudWatch alarm that notifies you when a stream becomes impaired.
PublishedBytes Published bytes bytes The total number of bytes that Aurora DSQL wrote to the target Kinesis data stream.
PublishedRecords Published records none The number of CDC records that Aurora DSQL wrote to the target Kinesis data stream.
BehindSourceLag Behind source lag milliseconds The delay, in milliseconds, between when a transaction commits in Aurora DSQL and when the CDC system processes the resulting record. A rising value indicates that the CDC pipeline is falling behind the write workload. If lag grows beyond the failure threshold, the stream transitions to FAILED.
BytesStreamed Bytes streamed bytes The total bytes streamed through the CDC pipeline for billing purposes. This metric reflects the data volume used to calculate streaming charges.
StreamDPU Stream DPU DPU The Distributed Processing Units (DPU) consumed by the CDC stream. This metric reflects the processing cost of streaming change data.