Skip to main content

Multiscale convolutional neural networks for in-loop video restoration

Incorporating neural networks into a video codec as an in-loop filter has been shown to provide significant improvements in coding efficiency. Unfortunately, the computational complexity associated with the neural network, specifically the number of multiply-accumulate (MAC) operations, makes these approaches intractable in practice. In this paper, we consider using a multiscale approach to reduce complexity while maintaining coding efficiency. Experimental results demonstrate a 5.4× reduction in MAC operations while achieving an average bit rate savings of 6.4% and 6.3% for all intra and random access coding, respectively, when compared to the evolving AV2 standard. Ablation studies are also provided and show that the approach achieves all but 0.2% of the coding efficiency of full resolution processing.

For the full paper, see Multiscale convolutional neural networks for in-loop video restoration on the Amazon Science website.

Senior Video Scientist – Prime Video
Senior Principal Scientist – Prime Video
Principal Research Scientist – Prime Video