Sorry, I just realized this one slipped through the cracks!

On Sat, 4 Sep 2010, FWDF wrote:

> ےےےےWe use 3 servers to build a test system of ceph, configured as below:
> ےےےے
> ےےےےHost                          IP      
> ےےےےclient01            192.168.1.10   
> ےےےےceph01              192.168.2.50
> ےےےےceph02              192.168.2.51   
> ےےےے
> ےےےےThe OS is unbuntu 10.04 LTS and the version of ceph is v0.21.1
> ےےےے
> ےےےےceph.conf:
> ےےےے[global]
> ےےےے        auth supported = cephx
> ےےےے        pid file = /var/run/ceph/$name.pid
> ےےےے        debug ms = 0
> ےےےے        keyring = /etc/ceph/keyring.bin
> ےےےے [mon]
> ےےےے        mon data = /mnt/ceph/data/mon$id
> ےےےے        debug ms = 1
> ےےےے[mon0]
> ےےےے        host = ceph01
> ےےےے        mon addr = 192.168.2.50:6789
> ےےےے [mds]
> ےےےے        keyring = /etc/ceph/keyring.$name
> debug ms = 1
> ےےےے[mds.ceph01]
> ےےےے        host = ceph01
> ےےےے[mds.ceph02]
> ےےےے        host = ceph02
> ےےےے[osd]
> ےےےے        sudo = true
> ےےےے        osd data = /mnt/ceph/osd$id/data
> ےےےے        keyring = /etc/ceph/keyring.$name
> ےےےے        osd journal = /mnt/ceph/osd$id/data/journal
> ےےےے        osd journal size = 100
> ےےےے[osd0]
> ےےےے        host = ceph01
> ےےےے[osd1]
> ےےےے        host = ceph01
> ےےےے[osd2]
> ےےےے        host = ceph01
> ےےےے[osd3]
> ےےےے        host = ceph01
> ےےےے[osd10]
> ےےےے        host = ceph02
> ےےےے
> ےےےےThere are 4 HDDs in the ceph01 and every HDD has a OSD named as osd0, osd1, osd2,osd3; there is 1 HDD in the ceph02 named as osd10. All these HDDs are made as btrfs and mounted on the mount point as listed below:
> ےےےے
> ےےےےceph01
> ےےےے         /dev/sdc1         /mnt/ceph/osd0/data               btrfs
> ےےےے         /dev/sdd1         /mnt/ceph/osd1/data               btrfs
> ےےےے         /dev/sde1         /mnt/ceph/osd2/data               btrfs
> ےےےے         /dev/sdf1          /mnt/ceph/osd3/data               btrfs
> ےےےے
> ےےےےceph02
> ےےےے         /dev/sdb1         /mnt.ceph/osd10/data             btrfs
> ےےےے
> ےےےےMake ceph FileSystem:
> ےےےےroot@ceph01:~#  mkcephfs  -c /etc/cepf/ceph.conf -a -k /etc/ceph/keyring.bin
> ےےےے
> ےےےےStartup ceph:
> ےےےےroot@ceph01:~#  /etc/init.d/ceph ےےa  start
> 
>          Then
> ےےےےroot@ceph01:~#  ceph -w
> ےےےے10.09.01_17:56:19.337895   mds e17: 1/1/1 up {0=up:active}, 1 up:standby
> ےےےے10.09.01_17:56:19.347184   osd e27: 5 osds: 5 up, 5 in
> ےےےے10.09.01_17:56:19.349447     log ےے 
> ےےےے10.09.01_17:56:19.373773   mon e1: 1 mons at 192.168.2.50:6789/0
> ےےےے
> ےےےےThe ceph file system is mounted to client01(192.168.1.10), ceph01(192.168.2.50), ceph02ےے192.168.2.51ےےat /data/ceph. It works fine at the beginning, I can use ls and the write and read of file is ok. After some files are wrote , I find I canےےt use ls ےےl /data/ceph until I umount ceph from ceph02, but one day later the same problem occurred again, then I umount ceph from ceph01 the system and everything is ok.
> 
> ےےےےQ1:
>          Can the ceph filesystem be mounted to a member of ceph cluster?

Technically, yes, but you should be very careful doing so.  The problem is 
that when the kernel is low on memory it will force the client to write 
out dirty data so that it can reclaim those pages.  If the writeout 
depends on then waking up some user process (cosd daemon), doing a bunch 
of random work, and writing the data to disk (dirtying yet more memory), 
you can deadlock the system.
 
>          When I follow the instruction of http://ceph.newdream.net/wiki/Monitor_cluster_expansion to expand a monitor to ceph02, the following error occurred:
> ےےےے
> ےےےےroot@ceph02:~#  /etc/init.d/ceph start mon1
> ےےےے[/etc/ceph/fetch_config/tmp/fetched.ceph.conf.14210] ceph.conf 100%  2565  2.5KB/s  00:00 
> ےےےے=== mon.1 ===
> ےےےےStarting Ceph mon1 on ceph02...
> ےےےے ** WARNING: Ceph is still under heavy development, and is only suitable for **
> ےےےے ** testing and review.  Do not trust it with important data.  **
> ےےےےterminate called after throwing an instance of 'std::logic_error'
> ےےےے  what():  basic_string::_S_construct NULL not valid
> ےےےےAborted (core dumped)
> ےےےےfailed: ' /usr/bin/cmon -i 1 -c /tmp/fetched.ceph.conf.14210 '

I haven't seen that crash, but it looks like a std::string constructor is 
being passed a NULL pointer.  Do you have a core dump (to get a 
backtrace)?  Which version are you running (`cmon -v`)?

> ےےےےQ2:
> ےےےےHow to expand a monitor to a running ceph system?

The process in that wiki article can expand the monitor cluster while it 
is online.  Note that the monitor identication changed slightly between 
v0.21 and the current unstable branch (will be v0.22), and the 
instructions still need to be updated for that.

> ےےےےQ3
> ےےےے    Is it possible to add mds when the ceph system is running? how?

Yes.  Add the new mds to ceph.conf, start the daemon.  You should see it 
as up:standby in the 'ceph -s' or 'ceph mds dump -o -' output.  Then

 ceph mds setmaxmds 2

change the size of the 'active' cluster to 2.  

Please keep in mind the clustered MDS still has some bugs; we expect v0.22 
to be stable.

> ےےےے
> ےےےےI fdisked a HDD into two partition, one for journal, other one for data like this:
> ےےےے/dev/sdc1ےے180GBےےas data
> ےےےے/dev/sdc2ےے10GBےےas journal
> ےےےے
> ےےےے/dev/sdc1 made as btrfs, mount to /mnt/osd0/data
> ےےےے/dev/sdc2 made as btrfs, mount to /mnt/osd0/journal
> ےےےے
> ےےےےceph.conf:
> ےےےےےے
> ےےےے[osd]
> ےےےے        osd data = /mnt/ceph/osd$id/data
> ےےےے        osd journal = /mnt/ceph/osd$id/journal
> ےےےے        ; osd journal size = 100
> ےےےےےے
> ےےےےWhen I use mkcephfs command, I can't build osd until I edited ceph.conf like this:
> ےےےے
> ےےےے[osd]
> ےےےے        osd data = /mnt/ceph/osd$id/data
> ےےےے        osd journal = /mnt/ceph/osd$id/data/journal
> ےےےے        osd journal size = 100

If the journal is a file, the system won't create it for you unless you 
specify a size.  If it already exists (e.g., you created it via 'dd', or 
it's a block device) the journal size isn't needed.

> ےےےےQ4.
> ےےےے  How to set the journal path to a device or patition?

	osd journal = /dev/sdc1  ; or whatever

Hope this helps!  Sorry for the slow response.  Let us know if you have 
further questions!

sage


> ےےےےThanks for all help and reply , sorry for my lame English.
> ےےےے
> ےےےےLin
> 
> 
> --
> To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
> the body of a message to majordomo@vger.kernel.org
> More majordomo info at  http://vger.kernel.org/majordomo-info.html
> 
>