Computer Vision

Graphic with the text "Prime Video presents at CVPR 2022."

Prime Video presents two papers at CVPR 2022

Science teams presented two state-of-the-art works at the Conference on Computer Vision and Pattern Recognition (CVPR) 2022.

Raffay Hamid, Xiaohan Nie, Shixing Chen

Feb 01, 2023

Graphic with the text "Creating a faster and more precise CV model."

Computer Vision

Automatically identifying scene boundaries in movies and TV shows

Prime Video beat previous state-of-the-art work on the MovieNet dataset by 13% with a new model that is 90% smaller and 84% faster.

Shixing Chen, Xiaohan Nie, David Fan, Dongqing Zhang, Vimal Bhat, Raffay Hamid

Feb 09, 2023

Graphic with the text "Prime Video presents at WACV 2021."

Computer Vision

Prime Video’s work on sports field registration, recap/intro detection

Two Prime Video papers at the Winter Conference on Applications of Computer Vision (WACV) 2021 proposed neural models for enhancing video-streaming experiences.

Raffay Hamid

Feb 01, 2023

Graphic with the text "Prime Video presents at WACV 2023."

Computer Vision

Prime Video presents on Video/Audio Quality in Computer Vision and hosts a Grand Challenge during WACV 2023

During the Winter Conference on Applications of Computer Vision (WACV), Prime Video’s Yongjun Wu and Sriram Sethuraman discussed Video/Audio Quality in Computer Vision, and Hai Wei presented the HDR VQM Grand Challenge awards.

Yongjun Wu, Sriram Sethuraman, Hai Wei

Feb 07, 2023

Graphic with the text: "Label-efficient video-content understanding."

Computer Vision

How Prime Video uses contrastive learning to accelerate automatic video-understanding at scale

Prime Video invents new state-of-the-art weakly and self-supervised contrastive learning algorithms to reduce its dependence on large amounts of labeled training data.

Raffay Hamid

Feb 16, 2023

Graphic with the text "Using CV to reinvent sports-field tracking."

Computer Vision

Prime Video uses automatic field registration to create immersive viewing experiences for live sports

Prime Video used computer vision technology to reinvent sports-field tracking for monocular broadcasting videos.

Raffay Hamid, Xiaohan Nie

Feb 13, 2023

Computer Vision

Robust actor recognition in entertainment multimedia at scale

Actor identification and localization in movies and TV series seasons can enable deeper engagement with the content. Manual actor identification and tagging at every time-instance in a video is error prone as it is a highly repetitive, decision intensive and time-consuming task. The goal of this paper is to accurately label as many faces as possible in the video with actor names.

Manivel Sethu

Jan 02, 2023

Computer Vision

A robust and efficient framework for sports-field registration

We propose a novel framework to register sports-fields as they appear in broadcast sports videos. Unlike previous approaches, we particularly address the challenge of field registration when: (a) there are not enough distinguishable features on the field, and (b) no prior knowledge is available about the camera.

Xiaohan Nie, Shixing Chen, Raffay Hamid

Jan 02, 2023

Computer Vision

Improving compression efficiency using an encoder-aware motion compensated temporal filter

To overcome the drawbacks of prior MCTF design, we propose an encoder-aware MCTF (EA-MCTF) that resides within the encoder.

Rahul Vanam, Sriram Sethuraman

Mar 17, 2023

Computer Vision

Subjective and objective video quality assessment of high dynamic range sports content

In this paper, we present a large-scale HDR video quality dataset for sports content that includes the above mentioned important issues in live streaming, and a method of merging multiple datasets using anchor videos.

Yixu Chen, Yongjun Wu, Hai Wei, Sriram Sethuraman

Mar 10, 2023

Computer Vision

Assessment of subjective and objective quality of live streaming sports videos

We built a video quality database specifically designed for live streaming VQA research. The new video database is called the Laboratory for Image and Video Engineering (LIVE) Live stream Database. The LIVE Livestream Database includes 315 videos of 45 contents impaired by 6 types of distortions.

Yongjun Wu, Hai Wei, Sriram Sethuraman

Jan 02, 2023

Computer Vision

No-reference video quality assessment using space-time chips

We propose a new prototype model for no-reference video quality assessment (VQA) based on the natural statistics of space-time chips of videos. Space-time chips (ST-chips) are a new, quality-aware feature space which we define as space-time localized cuts of video data in directions that are determined by the local motion flow.

Yongjun Wu, Hai Wei

Jan 02, 2023

Computer Vision

Content about computer vision at Prime Video.