New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Is feature extraction possible? #72

Open

aretii opened this issue Sep 23, 2024 · 2 comments

aretii commented Sep 23, 2024

Is it possible to use the model for feature exrtaction from audio? Without having any text as input?

thanhtvt commented Sep 26, 2024

@aretii In that case, why don't you use Whisper instead, since whisper-large-v3 was used as the audio encoder in Qwen2-Audio?

Author

aretii commented Oct 14, 2024

@thanhtvt I thought there were modifications to the whisper-large-v3 used for Qwen.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment