full_ratios

* full_ratios - please explain?
@ 2015-02-18 14:39 Wyllys Ingersoll
  2015-02-18 15:05 ` Wido den Hollander
  0 siblings, 1 reply; 5+ messages in thread
From: Wyllys Ingersoll @ 2015-02-18 14:39 UTC (permalink / raw)
  To: ceph-devel

Can someone explain the interaction and effects of all of these
"full_ratio" parameters?  I havent found any real good explanation of how
they affect the distribution of data once the cluster gets above the
"nearfull" and close to the "close" ratios.

mon_osd_full_ratio
mon_osd_nearfull_ratio

osd_backfill_full_ratio
osd_failsafe_full_ratio
osd_failsafe_nearfull_ratio

We have a cluster with about 144 OSDs (518 TB) and trying to get it to a
90% full rate for testing purposes.

We've found that when some of the OSDs get above the mon_osd_full_ratio
value (.95 in our system), then it stops accepting any new data, even
though there is plenty of space left on other OSDs that are not yet even up
to 90%.  Tweaking the osd_failsafe ratios enabled data to move again for a
bit, but eventually it becomes unbalanced and stops working again.

Is there a recommended combination of values to use that will allow the
cluster to continue accepting data and rebalancing correctly above 90%.

thanks,
 Wyllys Ingersoll

^ permalink raw reply	[flat|nested] 5+ messages in thread