[Bug 1846427] Re: 4.1.0: qcow2 corruption on savevm/quit/loadvm cycle

qemu-devel

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Bug 1846427] Re: 4.1.0: qcow2 corruption on savevm/quit/loadvm cycle

From:	Laszlo Ersek (Red Hat)
Subject:	[Bug 1846427] Re: 4.1.0: qcow2 corruption on savevm/quit/loadvm cycle
Date:	Mon, 21 Oct 2019 18:12:25 -0000

In reply to Kevin's comment#13:

> I find Laszlo's case with a preallocated image particularly surprising
> because the behaviour isn't supposed to have changed at all for
> preallocated images, at least if the heuristics still detects them as
> such.

But isn't that "if" at the core of this problem? What happens if the
detection misfires? (This is not a loaded question, I'm not implying any
particular circumstances; I'm just surprised that heuristics could be
considered at all.)

> Once a preallocated image becomes almost fully allocated, it's
> expected that we won't detect it any more. So, Laszlo, do you know how
> much of your images was allocated? 'qemu-img check' prints the
> allocation statistics.

I don't have the images any longer, and since then, I've been running
qemu 4.0 (for my upstream QEMU binaries).

However, I can say some things (with both affected VMs being Fedora
installations):

- As noted earlier, the images were formatted for 100GB, with
  preallocation=metadata.

- I always install Fedora from Live ISOs (never starting with
  pre-installed images), and right after installation, "du" on the host
  side always reports 5-8 GB usage. Definitely never more than 10GB. So
  I'd say these images were very sparsely populated.

- I always use qcow2 images like this, in the domain XMLs:

  <driver name='qemu' type='qcow2' cache='writeback'
   error_policy='enospace' discard='unmap'/>

  and I always use virtio-scsi so that discard='unmap' actually have an
  effect.

- I occasionally run "fstrim" in the guest, and / or "virsh domfstrim"
  on the host. (And re-run "du" on the host side in every such case.)

- Right after installation (with the VM powered down), I might compress
  the image with "qemu-img convert -c"; but I don't believe I've done
  that too recently.

- The general idea on my end is that I'd like to limit guest disk usage
  by the *host* disk's free space, and not by an arbitrary pre-set disk
  image size. Hence 100GB stands for "infinity" (I might have used 1TB
  just as well), and error_policy='enospace' lets me act, should a guest
  actually run out of space, on the host disk. Finally, discard='unmap'
  prevents waste. I use "preallocation=metadata" because the initial
  size cost is negligible, but I perceive writes to be faster.

Hopefully this helps at least a tiny bit... Thanks!

-- 
You received this bug notification because you are a member of qemu-
devel-ml, which is subscribed to QEMU.
https://bugs.launchpad.net/bugs/1846427

Title:
  4.1.0: qcow2 corruption on savevm/quit/loadvm cycle

Status in QEMU:
  New

Bug description:
  I'm seeing massive corruption of qcow2 images with qemu 4.1.0 and git
  master as of 7f21573c822805a8e6be379d9bcf3ad9effef3dc after a few
  savevm/quit/loadvm cycles. I've narrowed it down to the following
  reproducer (further notes below):

  # qemu-img check debian.qcow2
  No errors were found on the image.
  251601/327680 = 76.78% allocated, 1.63% fragmented, 0.00% compressed clusters
  Image end offset: 18340446208
  # bin/qemu/bin/qemu-system-x86_64 -machine pc-q35-4.0.1,accel=kvm -m 4096 
-chardev stdio,id=charmonitor -mon chardev=charmonitor -drive 
file=debian.qcow2,id=d -S
  qemu-system-x86_64: warning: dbind: Couldn't register with accessibility bus: 
Did not receive a reply. Possible causes include: the remote application did 
not send a reply, the message bus security policy blocked the reply, the reply 
timeout expired, or the network connection was broken.
  QEMU 4.1.50 monitor - type 'help' for more information
  (qemu) loadvm foo
  (qemu) c
  (qemu) qcow2_free_clusters failed: Invalid argument
  qcow2_free_clusters failed: Invalid argument
  qcow2_free_clusters failed: Invalid argument
  qcow2_free_clusters failed: Invalid argument
  quit
  [m@nargothrond:~] qemu-img check debian.qcow2
  Leaked cluster 85179 refcount=2 reference=1
  Leaked cluster 85180 refcount=2 reference=1
  ERROR cluster 266150 refcount=0 reference=2
  [...]
  ERROR OFLAG_COPIED data cluster: l2_entry=422840000 refcount=1

  9493 errors were found on the image.
  Data may be corrupted, or further writes to the image may corrupt it.

  2 leaked clusters were found on the image.
  This means waste of disk space, but no harm to data.
  259266/327680 = 79.12% allocated, 1.67% fragmented, 0.00% compressed clusters
  Image end offset: 18340446208

  This is on a x86_64 Linux 5.3.1 Gentoo host with qemu-system-x86_64
  and accel=kvm. The compiler is gcc-9.2.0 with the rest of the system
  similarly current.

  Reproduced with qemu-4.1.0 from distribution package as well as
  vanilla git checkout of tag v4.1.0 and commit
  7f21573c822805a8e6be379d9bcf3ad9effef3dc (today's master). Does not
  happen with qemu compiled from vanilla checkout of tag v4.0.0. Build
  sequence:

  ./configure --prefix=$HOME/bin/qemu-bisect --target-list=x86_64-softmmu 
--disable-werror --disable-docs
  [...]
  CFLAGS            -O2 -U_FORTIFY_SOURCE -D_FORTIFY_SOURCE=2 -g
  [...] (can provide full configure output if helpful)
  make -j8 install

  The kind of guest OS does not matter: seen with Debian testing 64bit,
  Windows 7 x86/x64 BIOS and Windows 7 x64 EFI.

  The virtual storage controller does not seem to matter: seen with
  VirtIO SCSI, emulated SCSI and emulated SATA AHCI.

  Caching modes (none, directsync, writeback), aio mode (threads,
  native) or discard (ignore, unmap) or detect-zeroes (off, unmap) does
  not influence occurence either.

  Having more RAM in the guest seems to increase odds of corruption:
  With 512MB to the Debian guest problem hardly occurs at all, with 4GB
  RAM it happens almost instantly.

  An automated reproducer works as follows:

  - the guest *does* mount its root fs and swap with option discard and
  my testing leaves me with the impression that file deletion rather
  than reading is causing the issue

  - foo is a snapshot of the running Debian VM which is already running
  command

  # while true ; do dd if=/dev/zero of=foo bs=10240k count=400 ; done

  to produce some I/O to the disk (4GB file with 4GB of RAM).

  - on the host a loop continuously resumes and saves the guest state
  and quits qemu inbetween:

  # while true ; do (echo loadvm foo ; echo c ; sleep 10 ; echo stop ;
  echo savevm foo ; echo quit ) | bin/qemu-bisect/bin/qemu-system-x86_64
  -machine pc-q35-3.1,accel=kvm -m 4096 -chardev stdio,id=charmonitor
  -mon chardev=charmonitor -drive file=debian.qcow2,id=d -S -display
  none ; done

  - quitting qemu inbetween saves and loads seems to be necessary for
  the problem to occur. Just continusouly in one session saving and
  loading guest state does not trigger it.

  - For me, after about 2 to 6 iterations of above loop the image is
  corrupted.

  - corruption manifests with other messages from qemu as well, e.g.:

  (qemu) loadvm foo
  Error: Device 'd' does not have the requested snapshot 'foo'

  Using above reproducer I have to the be best of my ability bisected
  the introduction of the problem to commit
  69f47505ee66afaa513305de0c1895a224e52c45 (block: avoid recursive
  block_status call if possible). qemu compiled from the commit before
  does not exhibit the issue, from that commit on it does and reverting
  the commit off of current master makes it disappear.

To manage notifications about this bug go to:
https://bugs.launchpad.net/qemu/+bug/1846427/+subscriptions

[Prev in Thread]

Current Thread

[Next in Thread]

[Bug 1846427] Re: 4.1.0: qcow2 corruption on savevm/quit/loadvm cycle, (continued)
- [Bug 1846427] Re: 4.1.0: qcow2 corruption on savevm/quit/loadvm cycle, Michael Weiser, 2019/10/16
- [Bug 1846427] Re: 4.1.0: qcow2 corruption on savevm/quit/loadvm cycle, Michael Weiser, 2019/10/16
- [Bug 1846427] Re: 4.1.0: qcow2 corruption on savevm/quit/loadvm cycle, Laszlo Ersek (Red Hat), 2019/10/16
- [Bug 1846427] Re: 4.1.0: qcow2 corruption on savevm/quit/loadvm cycle, psyhomb, 2019/10/16
- [Bug 1846427] Re: 4.1.0: qcow2 corruption on savevm/quit/loadvm cycle, Laszlo Ersek (Red Hat), 2019/10/17
- [Bug 1846427] Re: 4.1.0: qcow2 corruption on savevm/quit/loadvm cycle, Laszlo Ersek (Red Hat), 2019/10/17
- [Bug 1846427] Re: 4.1.0: qcow2 corruption on savevm/quit/loadvm cycle, Michael Weiser, 2019/10/18
- [Bug 1846427] Re: 4.1.0: qcow2 corruption on savevm/quit/loadvm cycle, Simon John, 2019/10/20
- [Bug 1846427] Re: 4.1.0: qcow2 corruption on savevm/quit/loadvm cycle, Simon John, 2019/10/20
- [Bug 1846427] Re: 4.1.0: qcow2 corruption on savevm/quit/loadvm cycle, Kevin Wolf, 2019/10/21
- [Bug 1846427] Re: 4.1.0: qcow2 corruption on savevm/quit/loadvm cycle, Laszlo Ersek (Red Hat) <=
- [Bug 1846427] Re: 4.1.0: qcow2 corruption on savevm/quit/loadvm cycle, Michael Weiser, 2019/10/21
- [Bug 1846427] Re: 4.1.0: qcow2 corruption on savevm/quit/loadvm cycle, Kevin Wolf, 2019/10/22
- [Bug 1846427] Re: 4.1.0: qcow2 corruption on savevm/quit/loadvm cycle, Kevin Wolf, 2019/10/22
- [Bug 1846427] Re: 4.1.0: qcow2 corruption on savevm/quit/loadvm cycle, Michael Weiser, 2019/10/22
- [Bug 1846427] Re: 4.1.0: qcow2 corruption on savevm/quit/loadvm cycle, Michael Weiser, 2019/10/22
- [Bug 1846427] Re: 4.1.0: qcow2 corruption on savevm/quit/loadvm cycle, Laszlo Ersek (Red Hat), 2019/10/22
- [Bug 1846427] Re: 4.1.0: qcow2 corruption on savevm/quit/loadvm cycle, Kevin Wolf, 2019/10/23
- [Bug 1846427] Re: 4.1.0: qcow2 corruption on savevm/quit/loadvm cycle, Kevin Wolf, 2019/10/23
- [Bug 1846427] Re: 4.1.0: qcow2 corruption on savevm/quit/loadvm cycle, Michael Weiser, 2019/10/23
- [Bug 1846427] Re: 4.1.0: qcow2 corruption on savevm/quit/loadvm cycle, Michael Weiser, 2019/10/23

Prev by Date: Re: [PATCH 3/7] hppa: remove ISA region
Next by Date: Re: [PATCH 1/7] hw/hppa/dino.c: Improve emulation of Dino PCI chip
Previous by thread: [Bug 1846427] Re: 4.1.0: qcow2 corruption on savevm/quit/loadvm cycle
Next by thread: [Bug 1846427] Re: 4.1.0: qcow2 corruption on savevm/quit/loadvm cycle
Index(es):
- Date
- Thread