Re: [PATCH v2 03/17] hw/block/nvme: handle dma errors

qemu-devel

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [PATCH v2 03/17] hw/block/nvme: handle dma errors

From:	Klaus Jensen
Subject:	Re: [PATCH v2 03/17] hw/block/nvme: handle dma errors
Date:	Mon, 21 Sep 2020 21:47:37 +0200

On Sep 21 09:50, Keith Busch wrote:
> On Fri, Sep 18, 2020 at 10:36:07PM +0200, Klaus Jensen wrote:
> > @@ -466,15 +476,21 @@ static void nvme_post_cqes(void *opaque)
> >              break;
> >          }
> >  
> > -        QTAILQ_REMOVE(&cq->req_list, req, entry);
> >          sq = req->sq;
> >          req->cqe.status = cpu_to_le16((req->status << 1) | cq->phase);
> >          req->cqe.sq_id = cpu_to_le16(sq->sqid);
> >          req->cqe.sq_head = cpu_to_le16(sq->head);
> >          addr = cq->dma_addr + cq->tail * n->cqe_size;
> > +        ret = pci_dma_write(&n->parent_obj, addr, (void *)&req->cqe,
> > +                            sizeof(req->cqe));
> > +        if (ret) {
> > +            trace_pci_nvme_err_addr_write(addr);
> > +            timer_mod(cq->timer, qemu_clock_get_ns(QEMU_CLOCK_VIRTUAL) +
> > +                      500 * SCALE_MS);
> > +            break;
> > +        }
> > +        QTAILQ_REMOVE(&cq->req_list, req, entry);
> 
> Is this error really ever transient such that a retry later may be
> successful? I didn't find a way that appeared that it could be. If not,
> this probably deserves a CFS condition rather than a retry.
> 

I'll admit that I did this patch with the blktests test case in mind,
and that retrying causes the test to pass.

But I think you are right that retrying might not be the right way to
"pass the test".

I tested and setting CFS also passes the test and now actually exercises
the reset path in the kernel. So wins all around.

I am thinking we do the same for a failed read in nvme_process_sq then?

signature.asc
Description: PGP signature

[Prev in Thread]

Current Thread

[Next in Thread]

[PATCH v2 00/17] hw/block/nvme: multiple namespaces support, Klaus Jensen, 2020/09/18
- [PATCH v2 01/17] hw/block/nvme: fix typo in trace event, Klaus Jensen, 2020/09/18
  - Re: [PATCH v2 01/17] hw/block/nvme: fix typo in trace event, Philippe Mathieu-Daudé, 2020/09/21
- [PATCH v2 02/17] pci: pass along the return value of dma_memory_rw, Klaus Jensen, 2020/09/18
- [PATCH v2 03/17] hw/block/nvme: handle dma errors, Klaus Jensen, 2020/09/18
  - Re: [PATCH v2 03/17] hw/block/nvme: handle dma errors, Keith Busch, 2020/09/21
    - Re: [PATCH v2 03/17] hw/block/nvme: handle dma errors, Klaus Jensen <=
- [PATCH v2 04/17] hw/block/nvme: commonize nvme_rw error handling, Klaus Jensen, 2020/09/18
- [PATCH v2 07/17] hw/block/nvme: fix endian conversion, Klaus Jensen, 2020/09/18
  - Re: [PATCH v2 07/17] hw/block/nvme: fix endian conversion, Philippe Mathieu-Daudé, 2020/09/21
- [PATCH v2 06/17] hw/block/nvme: add a lba to bytes helper, Klaus Jensen, 2020/09/18
- [PATCH v2 05/17] hw/block/nvme: alignment style fixes, Klaus Jensen, 2020/09/18
- [PATCH v2 09/17] hw/block/nvme: refactor aio submission, Klaus Jensen, 2020/09/18
  - Re: [PATCH v2 09/17] hw/block/nvme: refactor aio submission, Keith Busch, 2020/09/21
    - Re: [PATCH v2 09/17] hw/block/nvme: refactor aio submission, Klaus Jensen, 2020/09/21
- [PATCH v2 08/17] hw/block/nvme: add symbolic command name to trace events, Klaus Jensen, 2020/09/18
- [PATCH v2 10/17] hw/block/nvme: default request status to success, Klaus Jensen, 2020/09/18

Prev by Date: Re: [PATCH v2 3/5] hw/timer/bcm2835: Rename variable holding CTRL_STATUS register
Next by Date: Re: [PATCH v2 5/5] hw/arm/bcm2835_peripherals: Correctly wire the SYS_timer IRQs
Previous by thread: Re: [PATCH v2 03/17] hw/block/nvme: handle dma errors
Next by thread: [PATCH v2 04/17] hw/block/nvme: commonize nvme_rw error handling
Index(es):
- Date
- Thread