Unsupported features for 16-bit PCM WAV #55

jasondraether · 2022-02-04T21:03:53Z

Using opensmile to process a wavfile saved as 16-bit PCM at 16000 sampling rate, I'm getting some features as 0.0 (mainly fundamental frequency / F0), which of course cascades to 0.0 for jitter and shimmer as well. Does opensmile not support 16-bit PCM? This problem appears to vanish if I convert the speech signal array to 32-bit floating point before running it through "smile.process_signal()". This has happened for all feature sets for both Functionals and LLDs.

frankenjoe · 2022-02-07T08:27:26Z

opensmile expects 32-bit float as input. You can use process_file(file) to directly process 16-bit PCM, though.

chausner-audeering · 2022-02-07T09:01:56Z

If 32-bit float input is a requirement, I guess we should validate that the input format matches and throw an exception if not. Otherwise the chance that you get bogus results without noticing is high. Or we could convert implicitly to 32-bit float before writing to openSMILE, or have openSMILE do the conversion by correctly setting the format settings of cExternalAudioInput.

frankenjoe · 2022-02-07T11:40:06Z

I checked and actually we forward int16 to opensmile, so we need to bypass the following line:

opensmile-python/opensmile/core/smile.py

Line 272 in 43042a8

signal *= 32768

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Unsupported features for 16-bit PCM WAV #55

Unsupported features for 16-bit PCM WAV #55

jasondraether commented Feb 4, 2022

frankenjoe commented Feb 7, 2022

chausner-audeering commented Feb 7, 2022

frankenjoe commented Feb 7, 2022 •

edited

Loading

Unsupported features for 16-bit PCM WAV #55

Unsupported features for 16-bit PCM WAV #55

Comments

jasondraether commented Feb 4, 2022

frankenjoe commented Feb 7, 2022

chausner-audeering commented Feb 7, 2022

frankenjoe commented Feb 7, 2022 • edited Loading

frankenjoe commented Feb 7, 2022 •

edited

Loading