Multi-modal classifications of digits with image and audio modality. One shot learning with Siamese network is used to predict if the given input image-audio pair belongs to same class or not.
-
Updated
Mar 25, 2023 - Python
Multi-modal classifications of digits with image and audio modality. One shot learning with Siamese network is used to predict if the given input image-audio pair belongs to same class or not.
It's a fantastic tool for visualizing medical heartbeat audio data in a unique and engaging way, using NumPy and Matplotlib packages. The script allows you to easily create beautiful visualizations of heartbeats from .wav files.
Audio-image classification of emotions
Multi-angle Lip Multimodal Video Data
Add a description, image, and links to the audio-image-classification topic page so that developers can more easily learn about it.
To associate your repository with the audio-image-classification topic, visit your repo's landing page and select "manage topics."