hello

here the result :

fio --ioengine=rbd --name=test --bs=4k --iodepth=1 --rw=randwrite --runtime=60  -pool=kube -rbdname=bench
test: (g=0): rw=randwrite, bs=(R) 4096B-4096B, (W) 4096B-4096B, (T) 4096B-4096B, ioengine=rbd, iodepth=1
fio-3.12
Starting 1 process
Jobs: 1 (f=1): [w(1)][100.0%][w=248KiB/s][w=62 IOPS][eta 00m:00s]
test: (groupid=0, jobs=1): err= 0: pid=1903256: Fri Aug 16 13:22:59 2019
  write: IOPS=58, BW=232KiB/s (238kB/s)(13.6MiB/60011msec); 0 zone resets
    slat (usec): min=9, max=351, avg=52.82, stdev=23.02
    clat (usec): min=1264, max=96970, avg=17156.70, stdev=6713.88
     lat (usec): min=1276, max=97050, avg=17209.52, stdev=6715.06
    clat percentiles (usec):
     |  1.00th=[ 2933],  5.00th=[ 3884], 10.00th=[11863], 20.00th=[13304],
     | 30.00th=[13960], 40.00th=[14484], 50.00th=[15008], 60.00th=[20579],
     | 70.00th=[22152], 80.00th=[23987], 90.00th=[25297], 95.00th=[25822],
     | 99.00th=[26346], 99.50th=[27395], 99.90th=[71828], 99.95th=[82314],
     | 99.99th=[96994]
   bw (  KiB/s): min=  104, max=  272, per=100.00%, avg=232.17, stdev=19.55, samples=120
   iops        : min=   26, max=   68, avg=57.97, stdev= 4.88, samples=120
  lat (msec)   : 2=0.06%, 4=5.51%, 10=3.41%, 20=50.22%, 50=40.69%
  lat (msec)   : 100=0.11%
  cpu          : usr=0.44%, sys=0.27%, ctx=3489, majf=0, minf=3582
  IO depths    : 1=100.0%, 2=0.0%, 4=0.0%, 8=0.0%, 16=0.0%, 32=0.0%, >=64=0.0%
     submit    : 0=0.0%, 4=100.0%, 8=0.0%, 16=0.0%, 32=0.0%, 64=0.0%, >=64=0.0%
     complete  : 0=0.0%, 4=100.0%, 8=0.0%, 16=0.0%, 32=0.0%, 64=0.0%, >=64=0.0%
     issued rwts: total=0,3485,0,0 short=0,0,0,0 dropped=0,0,0,0
     latency   : target=0, window=0, percentile=100.00%, depth=1

Run status group 0 (all jobs):
  WRITE: bw=232KiB/s (238kB/s), 232KiB/s-232KiB/s (238kB/s-238kB/s), io=13.6MiB (14.3MB), run=60011-60011msec

Disk stats (read/write):
    md2: ios=3/4611, merge=0/0, ticks=0/0, in_queue=0, util=0.00%, aggrios=3/8532, aggrmerge=0/2364, aggrticks=0/1346, aggrin_queue=52504, aggrutil=88.03%
  nvme1n1: ios=3/8529, merge=0/2295, ticks=1/1347, in_queue=52932, util=88.03%
  nvme0n1: ios=3/8535, merge=1/2434, ticks=0/1346, in_queue=52076, util=86.52%

for you information my disk setup is :

2 500G nvme disk with :

10G raid 1 partition with the OS
160G raid0 partition for local docker data
387.00g partition on each disk for the osd.

I got 5 physical devices with 12 core cpu and 32go RAM. Ceph is running on dedicated 1Gbps network.

I got quite the same hardware setup with ceph 10.2.11 and I got much better performance.


oau

lsblk 
NAME                  MAJ:MIN RM   SIZE RO TYPE  MOUNTPOINT
rbd0                  252:0    0     1G  0 disk  /var/lib/kubelet/pods/db24eb3b-650c-42f3-bdf0-92ea0eaf37d8/volumes/kubernetes.io~csi/pvc-e70f3d74-c7bd-4652-983b-de3874d36117/mou
rbd1                  252:16   0     1G  0 disk  /var/lib/kubelet/pods/a1b0b156-0bcc-4d0c-b2f8-26a74337baed/volumes/kubernetes.io~csi/pvc-14460f8a-da5a-44e3-a033-a3bf5054f967/mou
rbd2                  252:32   0     8G  0 disk  /var/lib/kubelet/pods/0a5e3745-c6c2-49a9-971a-3ddac59af66c/volumes/kubernetes.io~csi/pvc-b8907f44-58d6-4599-b189-fafe65daed09/mou
rbd3                  252:48   0    50G  0 disk  /var/lib/kubelet/pods/84df04af-a7dd-4035-a9a2-22d3d315fa60/volumes/kubernetes.io~csi/pvc-faf5cefa-ecce-450a-9f5c-42e7ca7d7fc2/mou
rbd4                  252:64   0     1G  0 disk  /var/lib/kubelet/pods/872a3d7d-63d5-4567-86f6-bedab3fe0ad3/volumes/kubernetes.io~csi/pvc-83ea4013-b936-469b-b20f-7e703db2c871/mou
nvme0n1               259:0    0   477G  0 disk  
├─nvme0n1p1           259:1    0   511M  0 part  /boot/efi
├─nvme0n1p2           259:2    0   9.8G  0 part  
│ └─md2                 9:2    0   9.8G  0 raid1 /
├─nvme0n1p3           259:3    0  79.5G  0 part  
│ └─md3                 9:3    0 158.9G  0 raid0 
│   └─datavg-dockerlv 253:0    0    30G  0 lvm   /var/lib/docker
└─nvme0n1p4           259:8    0 387.2G  0 part  
  └─ceph--52ce0eb9--9e69--4f29--8b87--9ab3fbb5df3e-osd--block--c26bdb06--2325--4fcd--9b8c--77a93ab46de0
                      253:1    0   387G  0 lvm   
nvme1n1               259:4    0   477G  0 disk  
├─nvme1n1p1           259:5    0   511M  0 part  
├─nvme1n1p2           259:6    0   9.8G  0 part  
│ └─md2                 9:2    0   9.8G  0 raid1 /
├─nvme1n1p3           259:7    0  79.5G  0 part  
│ └─md3                 9:3    0 158.9G  0 raid0 
│   └─datavg-dockerlv 253:0    0    30G  0 lvm   /var/lib/docker
└─nvme1n1p4           259:9    0 387.2G  0 part  
  └─ceph--4ef4eb77--4b23--4ffd--b7d2--0fd2cfa5e568-osd--block--28435fba--91c7--4424--99f2--ea86709a87ca
                      253:2    0   387G  0 lvm   

Le vendredi 16 août 2019 à 01:16 +0300, Vitaliy Filippov a écrit :
rbd -p kube bench kube/bench --io-type write --io-threads 1 --io-total  
10G --io-pattern rand
elapsed:    14  ops:   262144  ops/sec: 17818.16  bytes/sec: 72983201.32

It's a totally unreal number. Something is wrong with the test.

Test it with `fio` please:

fio -ioengine=rbd -name=test -bs=4k -iodepth=1 -rw=randwrite -runtime=60  
-pool=kube -rbdname=bench

Reads are very very slow:
elapsed:   445  ops:    81216  ops/sec:   182.37  bytes/sec: 747006.15
elapsed:    14  ops:    14153  ops/sec:   957.57  bytes/sec: 3922192.15

This is closer to the reality.