In recent years, machine learning (ML), especially deep neural networks (DNNs) and artificial intelligence (AI) in general, have been utilized broadly and successfully in different multimedia computing tasks, such as audio processing, image classification, computer vision, image retrieval, and healthcare, amongst others [1]-[2]. It is widely acknowledged that such tasks involve the processing of various information streams to gain valuable insights from the input data sources, intermediate decis