* Re: scubbing for a long time and not finished [not found] ` <da34275f64ad3e2960322516f488fd@ip-10-0-3-214> @ 2015-03-19 7:30 ` Xinze Chi [not found] ` <CANE=7sWbh9aJHp603BuFZivtLJ5a-axDq7CyHCfwaDWMRw7wAg-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org> 0 siblings, 1 reply; 3+ messages in thread From: Xinze Chi @ 2015-03-19 7:30 UTC (permalink / raw) To: ceph-users, ceph-devel Currently, users do not know when some pg do scrubbing for a long time. I think whether we could give some warming if it happend (defined as osd_scrub_max_time). It would tell the user something may be wrong in cluster. 2015-03-17 21:21 GMT+08:00 池信泽 <xmdxcxz@gmail.com>: > > On 周二, 3月 17, 2015 at 10:01 上午, Xinze Chi <xmdxcxz@gmail.com> wrote: > > hi,all: > > I find a pg on my test cluster in doing scrubbing for a long time > and not finish. there are not some useful scrubbing log. scrubs_active > is 1, so inc_scrubs_pending return false. I think the reason is that > some scrub message is lost, so primary can not continue chunky_scrub , > so it hang up at scrubbing. > > Could anyone give some suggestion? > > Thanks > > > [root@ceph0 ~]# date > Tue Mar 17 09:54:54 CST 2015 > [root@ceph0 ~]# ceph pg dump | grep scrub > dumped all in format plain > pg_stat objects mip degr misp unf bytes log disklog state state_stamp > v reported up up_primary acting acting_primary last_scrub > scrub_stamplast_deep_scrub deep_scrub_stamp > 1.97 30 0 0 0 0 117702656 31 31 active+clean+scrubbing 2015-03-16 > 14:50:02.110796 78'31 78:50 [9,6,1] 9 [9,6,1] 9 0'0 2015-03-15 > 14:49:33.661597 0'0 2015-03-13 14:48:53.341679 > > The attachment is the log from primary, the scrubbing pg is 1.97s0. -- To unsubscribe from this list: send the line "unsubscribe ceph-devel" in the body of a message to majordomo@vger.kernel.org More majordomo info at http://vger.kernel.org/majordomo-info.html ^ permalink raw reply [flat|nested] 3+ messages in thread
[parent not found: <CANE=7sWbh9aJHp603BuFZivtLJ5a-axDq7CyHCfwaDWMRw7wAg-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org>]
* Re: scubbing for a long time and not finished [not found] ` <CANE=7sWbh9aJHp603BuFZivtLJ5a-axDq7CyHCfwaDWMRw7wAg-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org> @ 2015-03-19 13:33 ` Sage Weil 2015-03-20 1:23 ` Xinze Chi 0 siblings, 1 reply; 3+ messages in thread From: Sage Weil @ 2015-03-19 13:33 UTC (permalink / raw) To: Xinze Chi Cc: ceph-users-idqoXFIVOFJgJs9I8MT0rw, ceph-devel-u79uwXL29TY76Z2rM5mHXA On Thu, 19 Mar 2015, Xinze Chi wrote: > Currently, users do not know when some pg do scrubbing for a long time. > I think whether we could give some warming if it happend (defined as > osd_scrub_max_time). > It would tell the user something may be wrong in cluster. This should be pretty straightforward to add along with the other "stuck x" warnings based on the pg_stat_t state timestamps. On the otherhead, that may be a somewhat heavyweight approach (each new warning bloats the stat structure a bit); open to other ideas! sage ^ permalink raw reply [flat|nested] 3+ messages in thread
* Re: scubbing for a long time and not finished 2015-03-19 13:33 ` Sage Weil @ 2015-03-20 1:23 ` Xinze Chi 0 siblings, 0 replies; 3+ messages in thread From: Xinze Chi @ 2015-03-20 1:23 UTC (permalink / raw) To: ceph-devel I add a issue http://tracker.ceph.com/issues/11162. 2015-03-19 21:33 GMT+08:00 Sage Weil <sage@newdream.net>: > On Thu, 19 Mar 2015, Xinze Chi wrote: >> Currently, users do not know when some pg do scrubbing for a long time. >> I think whether we could give some warming if it happend (defined as >> osd_scrub_max_time). >> It would tell the user something may be wrong in cluster. > > This should be pretty straightforward to add along with the other "stuck > x" warnings based on the pg_stat_t state timestamps. On the otherhead, > that may be a somewhat heavyweight approach (each new warning bloats the > stat structure a bit); open to other ideas! > > sage ^ permalink raw reply [flat|nested] 3+ messages in thread
end of thread, other threads:[~2015-03-20 1:23 UTC | newest] Thread overview: 3+ messages (download: mbox.gz / follow: Atom feed) -- links below jump to the message on this page -- [not found] <CANE=7sWwheXa3WZb_+b3G57MBuhcdMssJJqVjR9y9yRHu6vFEQ@mail.gmail.com> [not found] ` <da34275f64ad3e2960322516f488fd@ip-10-0-3-214> 2015-03-19 7:30 ` scubbing for a long time and not finished Xinze Chi [not found] ` <CANE=7sWbh9aJHp603BuFZivtLJ5a-axDq7CyHCfwaDWMRw7wAg-JsoAwUIsXosN+BqQ9rBEUg@public.gmane.org> 2015-03-19 13:33 ` Sage Weil 2015-03-20 1:23 ` Xinze Chi
This is an external index of several public inboxes, see mirroring instructions on how to clone and mirror all data and code used by this external index.