Alert rule: CephOSDNearFull

Please consider opening a PR to improve this runbook if you gain new information about causes of the alert, or how to debug or resolve the alert. Click "Edit this Page" in the top right corner to create a PR directly on GitHub.

Overview

This alert fires when utilization of a Ceph storage device (disk) is higher than 75% of the device’s capacity.

This alert may indicate that the cluster utilization will soon reach problematic levels. If this alert is caused by high cluster utilization, unused data should be deleted or the cluster size must be increased.

Otherwise, investigate why this particular device has higher utilization than the other storage devices in the Ceph cluster.

Steps for debugging

See the how-to on scaling a PVC-based Ceph cluster for instructions to resize the cluster.