Re: [PATCH 3/6] parallels: Add checking and repairing duplicate offsets

qemu-block

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [PATCH 3/6] parallels: Add checking and repairing duplicate offsets

From:	Alexander Ivanov
Subject:	Re: [PATCH 3/6] parallels: Add checking and repairing duplicate offsets in BAT
Date:	Fri, 9 Sep 2022 09:37:09 +0200
User-agent:	Mozilla/5.0 (X11; Linux x86_64; rv:91.0) Gecko/20100101 Thunderbird/91.11.0


On 08.09.2022 19:45, Denis V. Lunev wrote:

On 9/8/22 19:15, Denis V. Lunev wrote:

On 9/2/22 10:52, Alexander Ivanov wrote:

Cluster offsets must be unique among all BAT entries.
Find duplicate offsets in the BAT.

If a duplicated offset is found fix it by copying the content
of the relevant cluster to a new allocated cluster and
set the new cluster offset to the duplicated entry.

Add host_cluster_index() helper to deduplicate the code.
Add highest_offset() helper. It will be used for code deduplication
in the next patch.

Signed-off-by: Alexander Ivanov <alexander.ivanov@virtuozzo.com>
---

block/parallels.c | 136++++++++++++++++++++++++++++++++++++++++++++++

  1 file changed, 136 insertions(+)

diff --git a/block/parallels.c b/block/parallels.c
index dbcaf5d310..339ce45634 100644
--- a/block/parallels.c
+++ b/block/parallels.c

@@ -136,6 +136,26 @@ static int cluster_remainder(BDRVParallelsState*s, int64_t sector_num,

      return MIN(nb_sectors, ret);
  }

+static uint32_t host_cluster_index(BDRVParallelsState *s, int64_toff)

+{
+    off -= s->header->data_off << BDRV_SECTOR_BITS;
+    return off / s->cluster_size;
+}
+
+static int64_t highest_offset(BDRVParallelsState *s)
+{
+    int64_t off, high_off = 0;
+    int i;
+
+    for (i = 0; i < s->bat_size; i++) {
+        off = bat2sect(s, i) << BDRV_SECTOR_BITS;
+        if (off > high_off) {
+            high_off = off;
+        }
+    }
+    return high_off;
+}
+

static int64_t block_status(BDRVParallelsState *s, int64_tsector_num,

                              int nb_sectors, int *pnum)
  {

@@ -547,6 +567,114 @@ static intparallels_check_leak(BlockDriverState *bs,

      return 0;
  }
  +static int parallels_check_duplicate(BlockDriverState *bs,
+                                     BdrvCheckResult *res,
+                                     BdrvCheckMode fix)
+{
+    BDRVParallelsState *s = bs->opaque;
+    QEMUIOVector qiov;
+    int64_t off, high_off, sector;
+    unsigned long *bitmap;
+    uint32_t i, bitmap_size, cluster_index;
+    int n, ret = 0;
+    uint64_t *buf = NULL;
+    bool new_allocations = false;
+
+    high_off = highest_offset(s);
+    if (high_off == 0) {
+        return 0;
+    }
+
+    /*
+     * Create a bitmap of used clusters.
+     * If a bit is set, there is a BAT entry pointing to this cluster.

+ * Loop through the BAT entrues, check bits relevant to anentry offset.

+     * If bit is set, this entry is duplicated. Otherwise set the bit.
+     */
+    bitmap_size = host_cluster_index(s, high_off) + 1;
+    bitmap = bitmap_new(bitmap_size);
+
+    buf = g_malloc(s->cluster_size);
+    qemu_iovec_init(&qiov, 0);
+    qemu_iovec_add(&qiov, buf, s->cluster_size);
+
+    for (i = 0; i < s->bat_size; i++) {
+        off = bat2sect(s, i) << BDRV_SECTOR_BITS;
+        if (off == 0) {
+            continue;
+        }
+
+        cluster_index = host_cluster_index(s, off);
+        if (test_bit(cluster_index, bitmap)) {
+            /* this cluster duplicates another one */
+            fprintf(stderr,
+                    "%s duplicate offset in BAT entry %u\n",
+                    fix & BDRV_FIX_ERRORS ? "Repairing" : "ERROR", i);
+
+            res->corruptions++;
+
+            if (fix & BDRV_FIX_ERRORS) {
+                /*
+                 * Reset the entry and allocate a new cluster
+                 * for the relevant guest offset. In this way we let
+                 * the lower layer to place the new cluster properly.
+                 * Copy the original cluster to the allocated one.
+                 */
+                parallels_set_bat_entry(s, i, 0);
+

+ ret = bdrv_pread(bs->file, off, s->cluster_size,buf, 0);

+                if (ret < 0) {
+                    res->check_errors++;
+                    goto out;
+                }
+
+                sector = (i * s->cluster_size) >> BDRV_SECTOR_BITS;
+                off = allocate_clusters(bs, sector, s->tracks, &n);
+                if (off < 0) {
+                    res->check_errors++;
+                    ret = off;
+                    goto out;
+                }
+                off <<= BDRV_SECTOR_BITS;
+                if (off > high_off) {
+                    high_off = off;
+                }
+

+ ret = bdrv_co_pwritev(bs->file, off,s->cluster_size, &qiov, 0);

+                if (ret < 0) {
+                    res->check_errors++;
+                    goto out;
+                }
+
+                new_allocations = true;
+                res->corruptions_fixed++;
+            }
+
+        } else {
+            bitmap_set(bitmap, cluster_index, 1);
+        }
+    }
+
+    if (new_allocations) {
+        /*
+         * When new clusters are allocated, file size increases
+         * by 128 Mb blocks. We need to truncate the file to the
+         * right size.
+         */
+        ret = parallels_handle_leak(bs, res, high_off, true);
+        if (ret < 0) {
+            res->check_errors++;
+            goto out;
+        }
+    }

OK. I have re-read the code with test case handy and now
understand the situation completely.

The problem is that img_check() routine calls bdrv_check()
actually TWICE without image reopening and thus we
comes to some trouble on the second check as we have
had preallocated some space inside the image. This
is root of the problem.

Though this kind of the fix seems like overkill, I still do not
like the resulted code. It at least do not scale with the checks
which we will add further.

I think that we could do that in two ways:
* temporary set prealloc_mode to none at start of parallels_co_check
  and return it back at the end
* parallels_leak_check should just set data_end and do nothing
   more + we should have truncate at the end of the
   parallels_co_check() if we have had performed ANY fix


better way found. We should check not file length in handle_leak!
We should compare highest_off with the data_end and that is
146% correct.

File length COULD be more than highest possible data cluster
offset, but data_end should point to the correct location.
That is it!

Den


Initially data_end points to the end of the cluster with the highest

offset in the BAT, not to the file end. So we can't detect leaks in

such a way.

[Prev in Thread]

Current Thread

[Next in Thread]

[PATCH 1/6] parallels: Incorrect data end calculation in parallels_open(), (continued)
- [PATCH 1/6] parallels: Incorrect data end calculation in parallels_open(), Alexander Ivanov, 2022/09/02
  - Re: [PATCH 1/6] parallels: Incorrect data end calculation in parallels_open(), Denis V. Lunev, 2022/09/07
- [PATCH 2/6] parallels: Create parallels_handle_leak() to truncate excess clusters, Alexander Ivanov, 2022/09/02
  - Re: [PATCH 2/6] parallels: Create parallels_handle_leak() to truncate excess clusters, Denis V. Lunev, 2022/09/07
- [PATCH 4/6] parallels: Use highest_offset() helper in leak check, Alexander Ivanov, 2022/09/02
  - Re: [PATCH 4/6] parallels: Use highest_offset() helper in leak check, Denis V. Lunev, 2022/09/07
- [PATCH 3/6] parallels: Add checking and repairing duplicate offsets in BAT, Alexander Ivanov, 2022/09/02
  - Re: [PATCH 3/6] parallels: Add checking and repairing duplicate offsets in BAT, Denis V. Lunev, 2022/09/07
  - Re: [PATCH 3/6] parallels: Add checking and repairing duplicate offsets in BAT, Denis V. Lunev, 2022/09/08
    - Re: [PATCH 3/6] parallels: Add checking and repairing duplicate offsets in BAT, Denis V. Lunev, 2022/09/08
    - Re: [PATCH 3/6] parallels: Add checking and repairing duplicate offsets in BAT, Alexander Ivanov <=
- [PATCH 5/6] parallels: Replace fprintf by qemu_log, Alexander Ivanov, 2022/09/02
  - Re: [PATCH 5/6] parallels: Replace fprintf by qemu_log, Denis V. Lunev, 2022/09/07
- [PATCH 6/6] parallels: Image repairing in parallels_open(), Alexander Ivanov, 2022/09/02
  - Re: [PATCH 6/6] parallels: Image repairing in parallels_open(), Denis V. Lunev, 2022/09/07
  - Re: [PATCH 6/6] parallels: Image repairing in parallels_open(), Denis V. Lunev, 2022/09/08
- Re: [PATCH 0/6] parallels: Add duplication check, repair at open, fix bugs, Denis V. Lunev, 2022/09/07

Prev by Date: Re: [PATCH 3/6] parallels: Add checking and repairing duplicate offsets in BAT
Next by Date: Re: [PATCH] hw/nvme: remove param zoned.auto_transition
Previous by thread: Re: [PATCH 3/6] parallels: Add checking and repairing duplicate offsets in BAT
Next by thread: [PATCH 5/6] parallels: Replace fprintf by qemu_log
Index(es):
- Date
- Thread