|
![]() |
|||
|
||||
OverviewThis book constitutes the thoroughly refereed post-proceedings of the First International Workshop on Machine Learning for Multimodal Interaction, MLMI 2004, held in Martigny, Switzerland in June 2004.The 30 revised full papers presented were carefully selected during two rounds of reviewing and revision. The papers are organized in topical sections on HCI and applications, structuring and interaction, multimodal processing, speech processing, dialogue management, and vision and emotion. Full Product DetailsAuthor: Samy Bengio , Hervé BourlardPublisher: Springer-Verlag Berlin and Heidelberg GmbH & Co. KG Imprint: Springer-Verlag Berlin and Heidelberg GmbH & Co. K Edition: 2005 ed. Volume: 3361 Dimensions: Width: 15.50cm , Height: 1.90cm , Length: 23.50cm Weight: 1.170kg ISBN: 9783540245094ISBN 10: 354024509 Pages: 362 Publication Date: 31 January 2005 Audience: Professional and scholarly , Professional & Vocational Format: Paperback Publisher's Status: Active Availability: In Print ![]() This item will be ordered in for you from one of our suppliers. Upon receipt, we will promptly dispatch it out to you. For in store availability, please contact us. Table of ContentsMLMI 2004.- Accessing Multimodal Meeting Data: Systems, Problems and Possibilities.- Browsing Recorded Meetings with Ferret.- Meeting Modelling in the Context of Multimodal Research.- Artificial Companions.- Zakim – A Multimodal Software System for Large-Scale Teleconferencing.- Towards Computer Understanding of Human Interactions.- Multistream Dynamic Bayesian Network for Meeting Segmentation.- Using Static Documents as Structured and Thematic Interfaces to Multimedia Meeting Archives.- An Integrated Framework for the Management of Video Collection.- The NITE XML Toolkit Meets the ICSI Meeting Corpus: Import, Annotation, and Browsing.- S-SEER: Selective Perception in a Multimodal Office Activity Recognition System.- Mapping from Speech to Images Using Continuous State Space Models.- An Online Algorithm for Hierarchical Phoneme Classification.- Towards Predicting Optimal Fusion Candidates: A Case Study on Biometric Authentication Tasks.- Mixture of SVMs for Face Class Modeling.- AV16.3: An Audio-Visual Corpus for Speaker Localization and Tracking.- The 2004 ICSI-SRI-UW Meeting Recognition System.- On the Adequacy of Baseform Pronunciations and Pronunciation Variants.- Tandem Connectionist Feature Extraction for Conversational Speech Recognition.- Long-Term Temporal Features for Conversational Speech Recognition.- Speaker Indexing in Audio Archives Using Gaussian Mixture Scoring Simulation.- Speech Transcription and Spoken Document Retrieval in Finnish.- A Mixed-Lingual Phonological Component Which Drives the Statistical Prosody Control of a Polyglot TTS Synthesis System.- Shallow Dialogue Processing Using Machine Learning Algorithms (or Not).- ARCHIVUS: A System for Accessing the Content of Recorded Multimodal Meetings.- Piecing Together the Emotion Jigsaw.- EmotionAnalysis in Man-Machine Interaction Systems.- A Hierarchical System for Recognition, Tracking and Pose Estimation.- Automatic Pedestrian Tracking Using Discrete Choice Models and Image Correlation Techniques.- A Shape Based, Viewpoint Invariant Local Descriptor.ReviewsAuthor InformationTab Content 6Author Website:Countries AvailableAll regions |