Tejo Chalasani, Jan Ondrej, and Aljosa Smolic.
Egocentric gesture recognition for head mounted ar devices.
In Adjunct Proceedings of the IEEE International Symposium for Mixed and Augmented Reality 2018 (To appear). 2018.
[BibTeX▼]
Natural interaction with virtual objects in AR/VR environments makes for a smooth user experience. Gestures are a natural extension from real world to augmented space to achieve these interactions. Finding discriminating spatio-temporal features relevant to gestures and hands in ego-view is the primary challenge for recognising ego centric gestures. In this work we propose a data driven end-to- end deep learning approach to address the problem of ego centric gesture recognition; which combines an ego hand encoder network to find ego hand features; and a recurrent neural network to discern temporally discriminating features. Since deep learning networks are data intensive; we propose a novel data augmentation technique using green screen capture to alleviate the problem of ground truth annotation. In addition we publish a dataset of 10 gestures performed in a natural fashion in front of a green screen for training and the same 10 gestures performed in different natural scenes without green screen for validation. These gesture performances are recorded using the ego centric RGB camera in a HoloLens with corresponding head transformations. We evaluate the performance of our network using our natural dataset. In addition; we also present the results of our network's performance in comparison to the state-of-the-art using the AirGest dataset.