A framework for online segmentation and classification of modeled actions performed in the context of unmodeled ones

TitleA framework for online segmentation and classification of modeled actions performed in the context of unmodeled ones
Publication TypeJournal Article
Year of Publication2017
AuthorsKosmopoulos, D, Papoutsakis, K, Argyros, A
JournalIEEE Transactions on Circuits and Systems for Video Technology
Volume27
Issue12
Pagination2578-2590
Abstract

In this work, we propose a discriminative framework for online simultaneous segmentation and classification of modeled visual actions that can be performed in the context of other, unknown actions.To this end, we employ Hough transform to vote in a 3D space for the begin point, the end point and the label of the segmented part of the input stream. An SVM is used to model each class and to suggest putative labeled segments on the timeline.To identify the most plausible segments among the putative ones we apply a dynamic programming algorithm, which maximizes the likelihood for label assignment in linear time. The performance of our method is evaluated on synthetic, as well as on real data (Weizmann, TUM Kitchen, UTKAD and Berkeley multimodal human action databases). Extensive quantitative results obtained on a number of standard datasets demonstrate that the proposed approach is of comparable accuracy to the state of the art for online stream segmentation and classification when all performed actions are known and performs considerably better in the presence of unmodeled actions.