[08:27] <hverkuil> ndufresne: does the TRY_FMT issue in libv4l2 occur with a uvc webcam? I see that v4lconvert_do_try_format_uvc() just sets it to 0, but that can easily be fixed in libv4l2.
[10:06] <dafna21> hverkuil: a link from an entity to itself is allowed right?
[10:08] * dafna21 sent a long message:  <  >
[10:11] <hverkuil> dafna21: good question. It doesn't make much sense. sailus?
[10:11] <hverkuil> (make much sense in real HW)
[10:27] <sailus> away
[10:27] <sailus> I don't think anyone has thought about that. :-)
[10:27] <sailus> In general circular graphs may have issues.
[10:29] <sailus> The graph traversal algorithm keeps track of the entities it has seen, so it won't end up in a loop (until the maximum parse depth is reached), but it may not work correctly either.
[10:29] <sailus> It hasn't been tested at least.
[10:29] <sailus> dafna21: All I see after the first message is: * dafna21 sent a long message:  <  >
[10:32] <sailus> Lunch time. I'll be back soon.
[10:52] <sailus> Back.
[10:57] <dafna21> oh,
[10:58] <dafna21> Ill cut it to several: there is a problem with it in vimc, when I add a link from the scaler to itself, like this: http://ix.io/22fQ
[10:58] <dafna21> I get error when printing the topology: `media-ctl -d0 --print-dot`
[10:58] <dafna21> free(): invalid next size (fast)
[10:58] <dafna21> Aborted
[10:59] <dafna21> and a crash when unloading the module: http://ix.io/22fR
[11:22] <sailus> Hmm.
[11:22] <sailus> The code in __media_entity_remove_link() seems to assume there are two entities involved.
[11:23] <sailus> So I presume it removes two entries in the list in a single iteration, including the next entry to be iterated over.
[11:24] <sailus> That should be fixed.
[11:24] <sailus> Why would you like to create a link to the entity itself?
[11:33] <dafna21> I write the configfs feature to vimc which allow userspace to configure topology as they wish
[11:34] <dafna21> and now I write tests for it which includes topologies with entities to themself
[11:35] <dafna21> https://patchwork.kernel.org/cover/11153253/
[12:03] <sailus> Ok.
[12:04] <sailus> I think the kernel side *might* be good with this if __media_entity_remove_link() and other link related code is verified it works.
[12:04] <sailus> When a link is added, a backlink is created to the remote entity. That does not make sense if the remote entity is the same.
[12:05] <sailus> Generally code that is dealing with remote entities needs to be checked, too.
[12:13] <dafna21> in case connecting an entity to itself does not make sense in any hardware then maybe it will be easyer to just not allow it
[12:27] <tfiga> ndufresne: STREAMOFF(CAPTURE) was defined to reset because you may lose some bitstream when you STREAMOFF
[12:27] <tfiga> because STREAMOFF just discards any buffers available to be dequeued
[12:27] <tfiga> and resets them into dequeued state
[12:28] <tfiga> so for reallocation without a reset I'd indeed go with what hverkuil suggested
[12:29] <tfiga> hverkuil: I see in your vb2 cpu access series that you mention that it's possible to defer end_cpu_access call to buf_finish
[12:31] <tfiga> I guess that's because we only allow CPU access to buffers allocated from vb2-vmalloc?
[12:34] <ndufresne> hverkuil, yes, you can reproduce it with a UVC camera
[12:34] <ndufresne> hverkuil, and indeed, it might be fixable, someone need to sit and thing a little how to emulate colorimetry in general, but the CSC don't expose anything atm
[12:35] <ndufresne> (it's trivial to make userspace happy, it's not has trivial to make it work properly, since CSC may change/apply the colorimetry)
[12:36] <ndufresne> tfiga, noted
[12:37] <tfiga> ndufresne: also AFAIR none of the existing encoders actually keep the OUTPUT buffers for reference
[12:37] <tfiga> they have internal buffers
[12:38] <tfiga> not 100% sure about coda
[12:41] <tfiga> looking at coda, it also allocates internal framebuffers
[12:42] <tfiga> I guess most of the encoders may be blitting the source frame to a local buffer in a tiled (or otherwise more efficient) format
[12:43] <ndufresne> I wonder what's the exact lists now
[12:43] <ndufresne> though I doubt the Xilinx encoder have an intermediate
[12:44] <ndufresne> for sure Coda uses it's IPP, since it only deals with some Coda specific tiles
[12:45] <ndufresne> would need to ask svarbanov if that's the case for Venus
[12:45] <tfiga> still, I think that even if we have an encoder that needs to hold to a source buffer, we just wouldn't return it to the userspace before it releases it
[12:45] <ndufresne> for Nvidia, the driver is a bit non-standard, and obviously downstream
[12:46] <tfiga> and if one calls STREAMOFF(CAPTURE) that triggers a reset, such buffers could be released
[12:46] <ndufresne> that's not exactly correct
[12:46] <ndufresne> one raw image can produce multiple output buffers in real life
[12:47] <ndufresne> hence in theory, you could have a backward mode, where the output buffer can be held in order to complete it before pushing it downstream (for AU alignment)
[12:48] <ndufresne> doing streamoff(capture) would break that AU
[12:48] <tfiga> STREAMOFF(CAPTURE) basically means I don't care about what was being encoded anymore
[12:48] <tfiga> which is exactly the definition of STREAMOFF
[12:48] <ndufresne> on m2m, only STREAMOFF(CAPTURE) + STREAMOFF(OUTPUT) has strictly meant that
[12:49] <tfiga> not really
[12:49] <ndufresne> and that's why we do that on both side in user space (ffmpeg/gstreamer at least)
[12:49] <tfiga> in V4L2 in general STREAMOFF discards any buffers on the queue
[12:49] <ndufresne> in dequeue the buffers
[12:49] <tfiga> that makes the whole m2m processing that was happening meaningless
[12:50] <ndufresne> most encoder will have a ringbuffer on the encoded side
[12:50] <ndufresne> so I'm not sure what it would be discarding
[12:50] <tfiga> I think that's only coda :P
[12:50] <ndufresne> did you heard about amlogic ?
[12:50] <tfiga> haven't seen anything :)
[12:50] <tfiga> still, what STREAMOFF discards is the data that is already out of the ringbuffer
[12:51] <tfiga> so you end up with a hole in the stream
[12:51] <ndufresne> it's common for HW with a HW demuxers (aka STB hardware)
[12:51] <tfiga> basically STREAMOFF discards buffers that are past vb2_buffer_done() but before the userspace DQBUF
[12:51] <ndufresne> well, userspace can dequeue them before doing streamoff
[12:52] <tfiga> it needs to do a drain
[12:52] <tfiga> otherwise it doesn't know when to end dequeuing
[12:52] <ndufresne> sure
[12:52] <tfiga> but drain is a special case
[12:53] <tfiga> on the other hand, it probably doesn't matter...
[12:53] <ndufresne> what I'm saying is that associating a full reset with the CAPTURE queue is arbitrary, why not on the OUTPUT Queue ?
[12:53] <tfiga> one needs to do a drain anyway, if they want to reset in a deterministic point
[12:54] <ndufresne> good point
[12:54] <tfiga> it's not arbitrary - it's as I said, STREAMOFF can drop data
[12:54] <ndufresne> but I think it's good if we just go ahead and say you can allocate and S_FMT without streamoff
[12:54] <tfiga> dropping bitstream means the stream isn't valid anymore
[12:54] <hverkuil> tfiga: cpu access is also done for non-vmalloc drivers. But apparently you can call vb2_plane_end_cpu_access in buf_finish only for vmalloc drivers. Sorry, It is too long ago to remember why that is so. I wrote it, so it must be true :-)
[12:55] <ndufresne> sure, but the encoder state is still valid, encoder don't care about the encoded bits
[12:55] <tfiga> but the only way to resume meaningful encoding is to reset
[12:55] <ndufresne> sure, for which you will streamoff the output queue too
[12:55] <ndufresne> cause the cookies/timestamp make no sense anymore if you reset
[12:55] <tfiga> but what should happen if you just streamon the CAPTURE queue again?
[12:56] <tfiga> that would lead to some kind of undefined behavior
[12:56] <ndufresne> sounds like that, it's also undefined if you just cycle the CAPTURE queue imho
[12:56] <ndufresne> (but less dramatic maybe)
[12:56] <tfiga> in current spec it's defined to do a reset :)
[12:57] <tfiga> reset doesn't have to mean a full encoder reset, though
[12:57] <tfiga> reset is defined as producing a stream that is valid on its own
[12:57] <tfiga> without any dependencies on previous bitstream
[12:58] <ndufresne> it's just really bad guideline for userspace dev, since it will cause issue if one didn't "reset" both
[12:58] <tfiga> why it would cause any issue?
[12:59] <tfiga> hverkuil: what was in general the idea behind vb2-vmalloc? was it only for drivers that copy from/to those buffers?
[12:59] <ndufresne> tfiga, it's not very deterministic which frames remains in the queue
[13:00] <ndufresne> userspace would have to query each buffers to check the state, in order to know how many remains
[13:00] <ndufresne> I would just flush them all really
[13:03] <hverkuil> tfiga: originally it was for drivers that do not use direct DMA into the buffer, but instead copy the data to the buffer (or from, depending on the direction).
[13:03] <ndufresne> tfiga, thanks for the chat, have to leave now, I'll try and think about it, I'm still a little worried about "partial" reset of the queues, but in the end, it seems like userspace issue
[13:05] <tfiga> hverkuil: okay, so that's what I was suspecting
[13:05] <tfiga> it's interesting that we can export those buffers as DMA-bufs, though
[13:05] <tfiga> which then makes it possible for them to go to some hardware DMA
[13:05] <hverkuil> tfiga: This doesn't really hold true anymore. Right now it is just memory that can be allocated with vmalloc and doesn't need either contiguous DMA or scatter-gather DMA.
[13:06] <hverkuil> Although if it is imported as a dmabuf to another driver then scatter-gather is required again.
[13:08] <hverkuil> That was a later addition though. And I am not 100% certain this works perfectly in all circumstances.
[13:08] <hverkuil> Specifically it makes me wonder if the actual vmalloc shouldn't at least round up to the next page size.
[13:08] <tfiga> hmm, any reason to use vb2-vmalloc instead of vb2-dma-sg for a driver that does a DMA?
[13:08] <tfiga> the DMA still needs some way to address the buffer
[13:09] <hverkuil> Probably not.
[13:10] <hverkuil> vmalloc is typically used by usb devices (since the usb packets need to be parsed) and of course virtual drivers.
[13:10] <tfiga> if we limit it to the case of CPU access only, we could just make vb2-vmalloc call dma_buf_begin/end_cpu_access() in a generic way instead of the drivers
[13:11] <tfiga> I was also thinking about changing the concept a bit to having a vb2_queue flag that means that the driver needs CPU access in buf_prepare and another flag for buf_finish
[13:12] <tfiga> then the vb2 core could just call the begin and end functions appropriately
[13:12] <tfiga> we would also reuse those flags for allowing cached MMAP buffers
[13:12] <hverkuil> tfiga: just to make sure: you've seen the link to Sakari's version, developed from my original code? Just want to make sure you've looked at that as well.
[13:13] <hverkuil> https://git.linuxtv.org/sailus/media_tree.git/log/?h=vb2-dc-noncoherent
[13:13] <tfiga> hverkuil: I think that reached RFCv4 on the lists in the end
[13:13] <tfiga> which I reviewed
[13:13] <tfiga> and which we took over recently
[13:14] <hverkuil> I'm actually not sure if that's the same as the use of cpu access.
[13:14] <tfiga> I think it's slightly different, but actually related
[13:14] <tfiga> Sakari's series is about handling cached buffers
[13:14] <tfiga> but cpu access is also about cached buffers, but imported ones
[13:15] <hverkuil> Anyway, go wild. It's all too long ago since I worked on this, so my memory of it is rather vague. And things have changed since then, so some things might not make sense anymore, or can be done better.
[13:15] <tfiga> we're going to merge this into one series
[13:15] <hverkuil> sounds good.
[13:15] <tfiga> I hope we can get something posted around next week
[13:16] <tfiga> at least an RFC :)
[13:16] <tfiga> it might take a while to inspect all the drivers for what they're doing in their buf_prepare and buf_finish
[13:17] <tfiga> alternatively we could just assume by default that all drivers do some access in their callbacks
[13:17] <tfiga> I guess that's the existing behavior
[13:17] <tfiga> and gradually change drivers to set the queue flags appropraitely
[13:20] <hverkuil> Actually, outside usb and virtual drivers there aren't many that need cpu access.
[13:20] <hverkuil> In the remaining cases it's most commonly needed for compressed formats where the driver needs to patch a header.
[13:21] <hverkuil> You can make it generic for vmalloc, but for the others I would patch the drivers.
[14:19] <dagmcr> hi guys! I'm trying to solve a problem with a v4l2 platform device which sometimes is not properly initialized in the media controller (about 10% of the times). This means without changing anything, just rebooting the system and check. Any hint?
[14:29] <ezequielg> dagmcr: what do you mean?
[14:37] <dagmcr> The `/dev/mediaX` node is not created even though the rest of the `/dev/videoX` are listed. The weird is that only happens around 10% of the times you boot the system (without changing anything). The rest, works fines (so `/dev/mediaX` is created). And sadly, I have only tested on a 4.14 kernel version
[14:45] <ezequielg> sounds like a pretty regular race somewhere.
[14:45] <ezequielg> not sure it's strictly media related.
[14:45] <ezequielg> have you done any debugging/tracing?
[14:47] <dagmcr> I just enabled dynamic debugging in the drivers/media/* but no tracing yet
[14:47] <ezequielg> which driver is that?
[14:47] <dagmcr> qcam-camss
[14:54] <ezequielg> dagmcr: i suspect sometimes not all your subdevs register, and thus v4l2_async_notifier_operations.complete is not called.
[14:54] <ezequielg> add some prints there.
[14:55] <dagmcr> cool! thanks ezequielg for the tip. I will start debugging that part for all the subdevs.
[16:11] *** benjiG has left