Image generative models have seen significant progress, but video generative models face challenges in generating short video clips. Current evaluation metrics like FVD may not adequately capture the unique characteristics of videos. STREAM proposes a new metric that can evaluate spatial and temporal aspects separately, offering insights into improving video generative models. By independently assessing temporal naturalness (STREAM-T) and realism/diversity (STREAM-S), STREAM provides a comprehensive analysis tool for video quality. The proposed metric addresses limitations in existing metrics and offers a versatile solution for evaluating various types of videos.
Sang ngôn ngữ khác
từ nội dung nguồn
arxiv.org
Thông tin chi tiết chính được chắt lọc từ
by Pum Jun Kim,... lúc arxiv.org 03-18-2024
https://arxiv.org/pdf/2403.09669.pdfYêu cầu sâu hơn