Project

General

Profile

πŸ“‹ Ceph Operations Checklist

βœ… Daily Tasks

  • Check cluster health
    ceph -s
    
  • Review detailed health report
    ceph health detail
    
  • Verify monitor quorum
    ceph quorum_status --format json-pretty
    
  • Check OSD status
    ceph osd stat
    
  • Scan logs for warnings
    journalctl -u ceph-mon@<hostname> --since today
    journalctl -u ceph-osd@<id> --since today
    

πŸ“… Weekly Tasks

  • Check pool usage and capacity
    ceph df
    
  • Review placement group statistics
    ceph pg stat
    
  • Inspect OSD tree layout
    ceph osd tree
    
  • Run OSD scrub (data consistency)
    ceph osd scrub <id>
    
  • Rebalance cluster if needed
    ceph osd reweight-by-utilization
    

🚨 Watch For

  • OSDs marked down/out β†’ restart or replace disk.
  • PGs stuck/inactive β†’ check network and MON quorum.
  • Slow requests β†’ test Ceph network latency and disk health.
  • Near‑full warnings β†’ add OSDs or clean up unused images/snapshots.
  • MON quorum loss β†’ ensure at least 2 of 3 MONs are running.