Machine Learning for Multimodal Interaction

Popescu-Belis, Andrei.

Machine Learning for Multimodal Interaction 4th International Workshop, MLMI 2007, Brno, Czech Republic, June 28-30, 2007, Revised Selected Papers / [electronic resource] : edited by Andrei Popescu-Belis, Steve Renals, Hervé Bourlard. - XI, 308 p. online resource. - Lecture Notes in Computer Science, 4892 0302-9743 ; . - Lecture Notes in Computer Science, 4892 .

Invited Paper -- Robust Real Time Face Tracking for the Analysis of Human Behaviour -- Multimodal Processing -- Conditional Sequence Model for Context-Based Recognition of Gaze Aversion -- Meeting State Recognition from Visual and Aural Labels -- Object Category Recognition Using Probabilistic Fusion of Speech and Image Classifiers -- HCI, User Studies and Applications -- Automatic Annotation of Dialogue Structure from Simple User Interaction -- Interactive Pattern Recognition -- User Specific Training of a Music Search Engine -- An Ego-Centric and Tangible Approach to Meeting Indexing and Browsing -- Integrating Semantics into Multimodal Interaction Patterns -- Towards an Objective Test for Meeting Browsers: The BET4TQB Pilot Experiment -- Image and Video Processing -- Face Recognition in Smart Rooms -- Gaussian Process Latent Variable Models for Human Pose Estimation -- Discourse and Dialogue Processing -- Automatic Labeling Inconsistencies Detection and Correction for Sentence Unit Segmentation in Conversational Speech -- Term-Weighting for Summarization of Multi-party Spoken Dialogues -- Automatic Decision Detection in Meeting Speech -- Czech Text-to-Sign Speech Synthesizer -- Speech and Audio Processing -- Using Prosodic Features in Language Models for Meetings -- Posterior-Based Features and Distances in Template Matching for Speech Recognition -- A Study of Phoneme and Grapheme Based Context-Dependent ASR Systems -- Transfer Learning for Tandem ASR Feature Extraction -- Spoken Term Detection System Based on Combination of LVCSR and Phonetic Search -- Frequency Domain Linear Prediction for QMF Sub-bands and Applications to Audio Coding -- Modeling Vocal Interaction for Segmentation in Meeting Recognition -- Binaural Speech Separation Using Recurrent Timing Neural Networks for Joint F0-Localisation Estimation -- PASCAL Speech Separation Challenge II -- To Separate Speech -- Microphone Array Beamforming Approach to Blind Speech Separation.

9783540781554

10.1007/978-3-540-78155-4 doi


Computer science.
Artificial intelligence.
Translators (Computer programs).
Computer vision.
Computer Science.
User Interfaces and Human Computer Interaction.
Artificial Intelligence (incl. Robotics).
Language Translation and Linguistics.
Computers and Society.
Image Processing and Computer Vision.

QA76.9.U83 QA76.9.H85

005.437 4.019

Maintained by VTU Library