Shixing Chen

Graphic with the text "Creating a faster and more precise CV model."

Automatically identifying scene boundaries in movies and TV shows

Prime Video beat previous state-of-the-art work on the MovieNet dataset by 13% with a new model that is 90% smaller and 84% faster.

Shixing Chen, Xiaohan Nie, David Fan, Dongqing Zhang, Vimal Bhat, Raffay Hamid

Feb 09, 2023

Graphic with the text "Prime Video presents at CVPR 2022."

Computer Vision

Prime Video presents two papers at CVPR 2022

Science teams presented two state-of-the-art works at the Conference on Computer Vision and Pattern Recognition (CVPR) 2022.

Raffay Hamid, Xiaohan Nie, Shixing Chen

Feb 01, 2023

Computer Vision

Robust cross-modal representation learning with progressive self-distillation

We introduce a novel training framework based on cross-modal contrastive learning that uses progressive self-distillation and soft image-text alignments to more efficiently learn robust representations from noisy data.

Shixing Chen, Raffay Hamid

Jan 02, 2023

Computer Vision

A robust and efficient framework for sports-field registration

We propose a novel framework to register sports-fields as they appear in broadcast sports videos. Unlike previous approaches, we particularly address the challenge of field registration when: (a) there are not enough distinguishable features on the field, and (b) no prior knowledge is available about the camera.

Xiaohan Nie, Shixing Chen, Raffay Hamid

Jan 02, 2023