Skip to main content

A no-reference model for detecting audio artifacts using pretrained audio neural networks

This work presents a No-Reference model to detect audio artifacts in video. The model, based upon a Pretrained Audio Neural Network, classifies a 1-second audio segment as either No Defect, Audio Hum, Audio Hiss, Audio Distortion or Audio Clicks. The model achieves a balanced accuracy of 0.986 on our proprietary simulated dataset.

For the full paper, see A no-reference model for detecting audio artifacts using pretrained audio neural networks on the Amazon Science website.

Computer Vision Scientist – Prime Video
Software Development Engineer – Amazon
Senior Machine Learning Engineer – Prime Video