For machine learning analysis of sound, I've found nothing that beats a spectrogram.
While making this post, I wondered, can you make audio from a spectrogram? The answer is, apparently you can. https://stackoverflow.com/questions/57967487/convert-spectrogram-to-audio-using-librosa-functions