Han, Ting: Learning to Interpret and Apply Multimodal Descriptions. 2018

Inhalt

Introduction

Related work

Relations between speech and co-verbal hand gestures

Multimodal human-computer interfaces

Multimodal corpora

Multimodal spatial scene description corpus

Multimodal object description corpus

A system of understanding multimodal spatial descriptions

Learning knowledge from prior experience

Summary

Towards real-time understanding of multimodal spatial descriptions

System evaluation

Summary

Investigate symbolic and iconic modes in object descriptions

Experiments

Learning semantic categories of multimodal descriptions

Summary

Conclusion and future work

References