All of lore.kernel.org
 help / color / mirror / Atom feed
* Re: scubbing for a long time and not finished
       [not found] ` <da34275f64ad3e2960322516f488fd@ip-10-0-3-214>
@ 2015-03-19  7:30   ` Xinze Chi
       [not found]     ` <CANE=7sWbh9aJHp603BuFZivtLJ5a-axDq7CyHCfwaDWMRw7wAg-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
  0 siblings, 1 reply; 3+ messages in thread
From: Xinze Chi @ 2015-03-19  7:30 UTC (permalink / raw)
  To: ceph-users, ceph-devel

Currently, users do not know  when some pg do scrubbing for a long time.
I think whether we could give some warming if it happend (defined as
osd_scrub_max_time).
It would tell the user something may be wrong in cluster.


2015-03-17 21:21 GMT+08:00 池信泽 <xmdxcxz@gmail.com>:
>
> On 周二, 3月 17, 2015 at 10:01 上午, Xinze Chi <xmdxcxz@gmail.com> wrote:
>
> hi,all:
>
>     I find a pg on my test cluster in doing scrubbing for a long time
> and not finish. there are not some useful scrubbing log. scrubs_active
> is 1, so inc_scrubs_pending return false. I think the reason is that
> some scrub message is lost, so primary can not continue chunky_scrub ,
> so it hang up at scrubbing.
>
>    Could anyone give some suggestion?
>
>    Thanks
>
>
> [root@ceph0 ~]# date
> Tue Mar 17 09:54:54 CST 2015
> [root@ceph0 ~]# ceph pg dump | grep scrub
> dumped all in format plain
> pg_stat objects mip degr misp unf bytes log disklog state state_stamp
> v reported up up_primary acting acting_primary last_scrub
> scrub_stamplast_deep_scrub deep_scrub_stamp
> 1.97 30 0 0 0 0 117702656 31 31 active+clean+scrubbing 2015-03-16
> 14:50:02.110796 78'31 78:50 [9,6,1] 9 [9,6,1] 9 0'0 2015-03-15
> 14:49:33.661597 0'0 2015-03-13 14:48:53.341679
>
>    The attachment is the log from primary, the scrubbing pg is 1.97s0.
--
To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
the body of a message to majordomo@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

^ permalink raw reply	[flat|nested] 3+ messages in thread

* Re: scubbing for a long time and not finished
       [not found]     ` <CANE=7sWbh9aJHp603BuFZivtLJ5a-axDq7CyHCfwaDWMRw7wAg-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
@ 2015-03-19 13:33       ` Sage Weil
  2015-03-20  1:23         ` Xinze Chi
  0 siblings, 1 reply; 3+ messages in thread
From: Sage Weil @ 2015-03-19 13:33 UTC (permalink / raw)
  To: Xinze Chi
  Cc: ceph-users-idqoXFIVOFJgJs9I8MT0rw, ceph-devel-u79uwXL29TY76Z2rM5mHXA

On Thu, 19 Mar 2015, Xinze Chi wrote:
> Currently, users do not know  when some pg do scrubbing for a long time.
> I think whether we could give some warming if it happend (defined as
> osd_scrub_max_time).
> It would tell the user something may be wrong in cluster.

This should be pretty straightforward to add along with the other "stuck 
x" warnings based on the pg_stat_t state timestamps.  On the otherhead, 
that may be a somewhat heavyweight approach (each new warning bloats the 
stat structure a bit); open to other ideas!

sage

^ permalink raw reply	[flat|nested] 3+ messages in thread

* Re: scubbing for a long time and not finished
  2015-03-19 13:33       ` Sage Weil
@ 2015-03-20  1:23         ` Xinze Chi
  0 siblings, 0 replies; 3+ messages in thread
From: Xinze Chi @ 2015-03-20  1:23 UTC (permalink / raw)
  To: ceph-devel

I add a issue http://tracker.ceph.com/issues/11162.

2015-03-19 21:33 GMT+08:00 Sage Weil <sage@newdream.net>:
> On Thu, 19 Mar 2015, Xinze Chi wrote:
>> Currently, users do not know  when some pg do scrubbing for a long time.
>> I think whether we could give some warming if it happend (defined as
>> osd_scrub_max_time).
>> It would tell the user something may be wrong in cluster.
>
> This should be pretty straightforward to add along with the other "stuck
> x" warnings based on the pg_stat_t state timestamps.  On the otherhead,
> that may be a somewhat heavyweight approach (each new warning bloats the
> stat structure a bit); open to other ideas!
>
> sage

^ permalink raw reply	[flat|nested] 3+ messages in thread

end of thread, other threads:[~2015-03-20  1:23 UTC | newest]

Thread overview: 3+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
     [not found] <CANE=7sWwheXa3WZb_+b3G57MBuhcdMssJJqVjR9y9yRHu6vFEQ@mail.gmail.com>
     [not found] ` <da34275f64ad3e2960322516f488fd@ip-10-0-3-214>
2015-03-19  7:30   ` scubbing for a long time and not finished Xinze Chi
     [not found]     ` <CANE=7sWbh9aJHp603BuFZivtLJ5a-axDq7CyHCfwaDWMRw7wAg-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>
2015-03-19 13:33       ` Sage Weil
2015-03-20  1:23         ` Xinze Chi

This is an external index of several public inboxes,
see mirroring instructions on how to clone and mirror
all data and code used by this external index.