All of lore.kernel.org
 help / color / mirror / Atom feed
* ghost degraded objects
@ 2018-01-19 10:25 Ugis
       [not found] ` <CAE63xUN4+fzJFjEOwKU-gXjqk+=8=gugU97V2xA4fx9oBrwvUQ-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
  0 siblings, 1 reply; 3+ messages in thread
From: Ugis @ 2018-01-19 10:25 UTC (permalink / raw)
  To: ceph-devel, ceph-users

Hi,

Running Luminous 12.2.2, noticed strange behavior lately.
When for example setting "ceph osd out X" closer to the reballancing
end "degraded" objects still show up, but in "pgs:" section of ceph -s
no degraded pgs are still recovering, just ramapped and no degraded
pgs can be found in "ceph pg dump"

  health: HEALTH_WARN
            355767/30286841 objects misplaced (1.175%)
            Degraded data redundancy: 28/30286841 objects degraded
(0.000%), 96 pgs unclean

  services:
    ...
    osd: 38 osds: 38 up, 37 in; 96 remapped pgs

  data:
    pools:   19 pools, 4176 pgs
    objects: 9859k objects, 39358 GB
    usage:   114 TB used, 120 TB / 234 TB avail
    pgs:     28/30286841 objects degraded (0.000%)
             355767/30286841 objects misplaced (1.175%)
             4080 active+clean
             81   active+remapped+backfilling
             15   active+remapped+backfill_wait


Where those 28 degraded objects come from?

In such cases usually when backfilling is done degraded objects also
disappear, but normally degraded objects should fix before remapped
ones by priority.

Ugis

^ permalink raw reply	[flat|nested] 3+ messages in thread

* Re: ghost degraded objects
       [not found] ` <CAE63xUN4+fzJFjEOwKU-gXjqk+=8=gugU97V2xA4fx9oBrwvUQ-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
@ 2018-01-19 13:14   ` Sage Weil
  2018-01-23  1:38     ` [ceph-users] " David Zafman
  0 siblings, 1 reply; 3+ messages in thread
From: Sage Weil @ 2018-01-19 13:14 UTC (permalink / raw)
  To: Ugis; +Cc: ceph-devel-u79uwXL29TY76Z2rM5mHXA, ceph-users-Qp0mS5GaXlQ

On Fri, 19 Jan 2018, Ugis wrote:
> Running Luminous 12.2.2, noticed strange behavior lately.
> When for example setting "ceph osd out X" closer to the reballancing
> end "degraded" objects still show up, but in "pgs:" section of ceph -s
> no degraded pgs are still recovering, just ramapped and no degraded
> pgs can be found in "ceph pg dump"
> 
>   health: HEALTH_WARN
>             355767/30286841 objects misplaced (1.175%)
>             Degraded data redundancy: 28/30286841 objects degraded
> (0.000%), 96 pgs unclean
> 
>   services:
>     ...
>     osd: 38 osds: 38 up, 37 in; 96 remapped pgs
> 
>   data:
>     pools:   19 pools, 4176 pgs
>     objects: 9859k objects, 39358 GB
>     usage:   114 TB used, 120 TB / 234 TB avail
>     pgs:     28/30286841 objects degraded (0.000%)
>              355767/30286841 objects misplaced (1.175%)
>              4080 active+clean
>              81   active+remapped+backfilling
>              15   active+remapped+backfill_wait
> 
> 
> Where those 28 degraded objects come from?

There aren't actually degraded objects.. in this case it's just 
misreporting that there are.

This is a known issue in luminous.  Shortly after release we noticed the 
problem and David has been working on several changes to the stats 
calculation to improve the reporting, but those changes have not been 
backported (and aren't quite complete, either--getting a truly accurate 
number there is nontrivial in some cases it turns out).

> In such cases usually when backfilling is done degraded objects also
> disappear, but normally degraded objects should fix before remapped
> ones by priority.

Yes.

It's unfortunately a scary warning (there shouldn't be degraded 
objects... and generally speaking aren't) that understandably alarms 
users.  We hope to have this sorted out soon!

sage

^ permalink raw reply	[flat|nested] 3+ messages in thread

* Re: [ceph-users] ghost degraded objects
  2018-01-19 13:14   ` Sage Weil
@ 2018-01-23  1:38     ` David Zafman
  0 siblings, 0 replies; 3+ messages in thread
From: David Zafman @ 2018-01-23  1:38 UTC (permalink / raw)
  To: Sage Weil, Ugis; +Cc: ceph-devel, ceph-users


Yes, the pending backport for what we have so far is in 
https://github.com/ceph/ceph/pull/20055

With this changes a backfill caused by marking an osd out has the 
results as shown:


     health: HEALTH_WARN
             115/600 objects misplaced (19.167%)

...
   data:
     pools:   1 pools, 1 pgs
     objects: 200 objects, 310 kB
     usage:   173 GB used, 126 GB / 299 GB avail
     pgs:     115/600 objects misplaced (19.167%)
              1 active+remapped+backfilling

David


On 1/19/18 5:14 AM, Sage Weil wrote:
> On Fri, 19 Jan 2018, Ugis wrote:
>> Running Luminous 12.2.2, noticed strange behavior lately.
>> When for example setting "ceph osd out X" closer to the reballancing
>> end "degraded" objects still show up, but in "pgs:" section of ceph -s
>> no degraded pgs are still recovering, just ramapped and no degraded
>> pgs can be found in "ceph pg dump"
>>
>>    health: HEALTH_WARN
>>              355767/30286841 objects misplaced (1.175%)
>>              Degraded data redundancy: 28/30286841 objects degraded
>> (0.000%), 96 pgs unclean
>>
>>    services:
>>      ...
>>      osd: 38 osds: 38 up, 37 in; 96 remapped pgs
>>
>>    data:
>>      pools:   19 pools, 4176 pgs
>>      objects: 9859k objects, 39358 GB
>>      usage:   114 TB used, 120 TB / 234 TB avail
>>      pgs:     28/30286841 objects degraded (0.000%)
>>               355767/30286841 objects misplaced (1.175%)
>>               4080 active+clean
>>               81   active+remapped+backfilling
>>               15   active+remapped+backfill_wait
>>
>>
>> Where those 28 degraded objects come from?
> There aren't actually degraded objects.. in this case it's just
> misreporting that there are.
>
> This is a known issue in luminous.  Shortly after release we noticed the
> problem and David has been working on several changes to the stats
> calculation to improve the reporting, but those changes have not been
> backported (and aren't quite complete, either--getting a truly accurate
> number there is nontrivial in some cases it turns out).
>
>> In such cases usually when backfilling is done degraded objects also
>> disappear, but normally degraded objects should fix before remapped
>> ones by priority.
> Yes.
>
> It's unfortunately a scary warning (there shouldn't be degraded
> objects... and generally speaking aren't) that understandably alarms
> users.  We hope to have this sorted out soon!
>
> sage
> _______________________________________________
> ceph-users mailing list
> ceph-users@lists.ceph.com
> http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com


^ permalink raw reply	[flat|nested] 3+ messages in thread

end of thread, other threads:[~2018-01-23  1:38 UTC | newest]

Thread overview: 3+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2018-01-19 10:25 ghost degraded objects Ugis
     [not found] ` <CAE63xUN4+fzJFjEOwKU-gXjqk+=8=gugU97V2xA4fx9oBrwvUQ-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2018-01-19 13:14   ` Sage Weil
2018-01-23  1:38     ` [ceph-users] " David Zafman

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.