Vimal Bhat

Graphic with text: "Producing high-quality output with minimal data labeling"

Automatically detecting recaps, introductions, and credits in content at scale

Prime Video uses computer vision and video understanding techniques to detect different video content segments, such as introductions, recaps, and opening or ending credits.

Hooman Mahyar, Vimal Bhat

Apr 25, 2023

Graphic with text: "Achieving a 99.6% reduction in memory footprint."

Machine Learning

Prime Video automatically detects audio-video synchronization defects in dubbed media at scale

Prime Video achieves a 99.4% F1 score in synchronizing dubbed audio to non-dubbed audio using an innovative, fast, and memory-efficient approach.

Avijit Vajpayee, Zhikang Zhang, Vimal Bhat

Mar 29, 2023

Graphic with the text "Creating a faster and more precise CV model."

Computer Vision

Automatically identifying scene boundaries in movies and TV shows

Prime Video beat previous state-of-the-art work on the MovieNet dataset by 13% with a new model that is 90% smaller and 84% faster.

Shixing Chen, Xiaohan Nie, David Fan, Dongqing Zhang, Vimal Bhat, Raffay Hamid

Feb 09, 2023

Computer Vision

LipNeRF: What is the right feature space to lip-sync a NeRF

In this work, we propose LipNeRF, a lip-syncing NeRF that bridges the gap between the accurate lip synchronization of GAN-based methods and the accurate 3D face modeling of NeRFs.

Abhinav Jain, Rohith Mysore Vijaya Kumar, Vimal Bhat

Jan 02, 2023

Machine Learning

A simple and efficient method for dubbed audio sync detection using compressive sensing

In this paper, we present a novel, accurate and efficient method for temporal sync detection between dubbed audio tracks and corresponding non-dubbed original-language audio tracks.

Avijit Vajpayee, Zhikang Zhang, Abhinav Jain, Vimal Bhat

Jan 02, 2023