bbrezillon, tfiga_: in case you didn't know, be aware the dmabuf importation for DMABuf that requires pinning, like Intel DRM exported DMABuf don't currently work with V4L2, hverkuil had a branch to try and address this, and the opposite, dmabuf from v4l2 into Intel DRM don't work either, you get bunch of visual corruption due to un-handled caching issues
it works of course if the DMABuf is CMA backed, but it does not work if it's e.g. vmalloc and written by the CPU (UVC and vivid cases)
ndufresne: dmabuf + caching requires quite a bit of work :-S
pinchartl, yes, I'm just doing a reminder that it's not implemented in vb2, or partially I should say
but it's complicated, because e.g. on Intel, dmabuf transfer usually don't need that, except for the DRM side which do dmabuf transfer without handling the cpu cache
I've seen that problem on the ZynqMP with a UVC camera too
What I'm wondering is if the current buffer API has something to do with this, so maybe it's worth if bbrezillon gives a quick look
ndufresne: AFAIR, doing cache maintenance on vmalloc-ed bufs is a pain
bbrezillon: doing cache maintenance is a pain. period
even more if your cache is VIVT
who uses VIVT these days ?
as you might have several entries pointing to the same physical address if the memory region is mapped from different context (userspace+kernel space)
no one
I just remember having this discussion with rmk
But the good side, is that Hans WIP for importing DRM buffer into UVC where relatively simply, it was mostly adding the missing dmabuf_sync calls, so that DRM side can do it's "pinning" thing
the other way around is ... complicated, a capture exporter does not know what the buffers will be used to, and there is no API right now to tell I believe
I wonder what the performance implications are
ndufresne: wasn't x86 coherent?
it is, except for the dma transfer to display
we have an x86 platform which has a v4l2 codec
Most of the time, yes.
I was told it's because of the strict timing requirement that they disable that
we import from V4L2 to i915 and it seems to work fine
tfiga_, codecs are very often fine, it's CPU writes that are problematic
but we probably don't use hardware overlays, so those buffers would never hit the display directly
it breaks when the cpu cache is populated with pages that aren't in physical memory yet
ndufresne: Many GPUs have non-coherent memory access and some ISPs, too. But there are opposite examples as well.
It depends on the device, and some devices do both.
(the platform is still on 3.14, so that's why we don't use overlays)
yes, as sailus underline, this seems to be why it's complicated, you don't know in advance if it's going to requires a flush or not
tfiga_, does the chromeos compositor have seamless support for HW overlays ?
ndufresne: yes, if the display controller exposes proper atomic
nice, I didn't know