How to use a single pipeline function to decode a video file (ex: .mp4) into video AND audio tensors #5597

zade-twelvelabs · 2024-08-05T20:02:30Z

Describe the question.

fn.readers.video must be used to read and decode video files
fn.readers.file must be used to decode audio files, but does not accept video formats

So if I can't uses fn.readers.file to read a videos audio, and fn.readers.video does not decode video audio, how do I decode a .mp4 files audio?

Check for duplicates

I have searched the open bugs/issues and have found no duplicates for this bug report

JanuszL · 2024-08-05T21:15:56Z

Hi @zade-twelvelabs,

Thank you for reaching out. Currently, DALI doesn't support decoding audio from mp4 files. The current audio decoding capabilities (and the flow) are described here.
What you can do is use the external source operator and utilize FFmpeg to load and decode audio from mp4 containers. As audio decoding is not GPU accelerated in DALI, there shouldn't be a substantial perf overhead due to this.

zade-twelvelabs added the question Further information is requested label Aug 5, 2024

dali-automaton assigned JanuszL Aug 6, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to use a single pipeline function to decode a video file (ex: .mp4) into video AND audio tensors #5597

How to use a single pipeline function to decode a video file (ex: .mp4) into video AND audio tensors #5597

zade-twelvelabs commented Aug 5, 2024

JanuszL commented Aug 5, 2024

How to use a single pipeline function to decode a video file (ex: .mp4) into video AND audio tensors #5597

How to use a single pipeline function to decode a video file (ex: .mp4) into video AND audio tensors #5597

Comments

zade-twelvelabs commented Aug 5, 2024

Describe the question.

Check for duplicates

JanuszL commented Aug 5, 2024