The 32nd British Machine Vision (Virtual) Conference 2021 : Weakly-Supervised Dense Action Anticipation

Weakly-Supervised Dense Action Anticipation

Haotong Zhang, Fuhai Chen and Angela Yao

Keywords: dense anticipation video understanding weak supervision

Abstract: Dense anticipation aims to forecast future actions and their durations for long horizons. Existing approaches rely on fully-labelled data, i.e. sequences labelled with all future actions and their durations. We present a (semi-) weakly supervised method using only a small number of fully-labelled sequences and predominantly sequences in which only the (one) upcoming action is labelled. To this end, we propose a framework that generates pseudo-labels for future actions and their durations and adaptively refines them through a refinement module. Given only the upcoming action label as input, these pseudo-labels guide action/duration prediction for the future. We further design an attention mechanism to predict context-aware durations. Experiments on the Breakfast and 50Salads benchmarks verify our method’s effectiveness; we are competitive even when compared to fully supervised state-of-the-art models. We will make our code available at: https://github.com/zhanghaotong1/WSLVideoDenseAnticipation.

Paper Supplemental Code Poster Session 3

Weakly-Supervised Dense Action Anticipation

Haotong Zhang, Fuhai Chen and Angela Yao

Keywords: dense anticipation video understanding weak supervision

Video

Discussion