|
![]() |
|||
|
||||
OverviewThe universal adoption of the Internet and the WWW have created an enormous, multilingual virtual textual database. Rather than looking upon foreign language documents as distracting noise, one can consider these documents as untapped sources of information. This book addresses the problem of accessing multilingual information through a single-language query, a research problem which is receiving growing attention by US and foreign governments. It describes the problem, highlighting the differences between the field and the related areas of machine translation and information retrieval. Researchers from Europe, Japan and America present a wide variety of techniques and experimental results. The life-size experiments are run on modern large-scale retrieval testbeds, running up to hundreds of megabytes of texts. The techniques involve using bilingual dictionaries, machine translation systems, parallel text corpora, comparable but non-parallel text corpora, latent semantic indexing, and weighted Boolean interrogation. This text should be suitable as a secondary text for a graduate level course on cross-language information retrieval, and as a reference for researchers and practitioners in industry. Full Product DetailsAuthor: Gregory GrefenstettePublisher: Springer Imprint: Springer Edition: 1998 ed. Volume: 2 Dimensions: Width: 15.50cm , Height: 1.20cm , Length: 23.50cm Weight: 1.030kg ISBN: 9780792381228ISBN 10: 079238122 Pages: 182 Publication Date: 31 March 1998 Audience: College/higher education , Professional and scholarly , Postgraduate, Research & Scholarly , Professional & Vocational Format: Hardback Publisher's Status: Active Availability: In Print ![]() This item will be ordered in for you from one of our suppliers. Upon receipt, we will promptly dispatch it out to you. For in store availability, please contact us. Table of Contents1 The Problem of Cross-Language Information Retrieval.- 1 Introduction.- 2 The Three Problems of CLIR.- 3 Finding Translations.- 4 Pruning Translation Alternatives.- 5 Weighting Translation Alternatives.- 6 Conclusion.- 2 On The Effective use of Large Parallel Corpora in Cross-Language Text Retrieval.- 1 Introduction.- 2 Recuerdo: A Spanish Retrieval Engine.- 3 Collins Bilingual English-Spanish Dictionary.- 4 The UN Parallel Corpus.- 5 The Disambiguation Engine.- 6 Monolingual and Cross-Language Retrieval Results.- 7 The Role of Disambiguation.- 8 Further Improving Disambiguation Performance.- 9 Conclusions.- 3 Statistical Methods for Cross-Language Information Retrieval.- 1 Introduction.- 2 Previous Work.- 3 Dictionary Translation Using Expansion.- 4 Experiments.- 5 Conclusions and Future Work.- 4 Distributed Cross-Lingual Information Retrieval.- 1 The EMIR project approach.- 2 The distributed multilingual client-server architecture.- 3 Experimentation on a library catalog.- 4 Conclusion.- 5 Automatic Cross-Language Information Retrieval Using Latent Semantic Indexing.- 1 Introduction.- 2 Overview of Latent Semantic Indexing (LSI).- 3 Cross-Language Retrieval Using LSI.- 4 Conclusion.- 6 Mapping Vocabularies Using Latent Semantics.- 1 The Problem: Managing Semantics in Medical Vocabularies.- 2 The Basis of a Solution: Latent Semantic Indexing.- 3 The Hypothesis: LSI Applied to Terms.- 4 Experiments: LSI Mappings of Medical ‘Term Space’.- 5 Conclusion: Developing General, Robust Methods for Medical Semantics.- 7 Cross-Language Information Retrieval: A System for Comparable Corpus Querying.- 1 Introduction.- 2 Corpus Linguistics and Bilingual Reference Corpora.- 3 Procedures for Comparable Corpus Processing.- 4 First Results.- 5 Application to Cross LanguageInformation Retrieval.- 8 A Language Conversion Front-End for Cross-Language Information Retrieval.- 1 Introduction.- 2 Problems in Translation Equivalent Selection.- 3 Description of the Method.- 4 Implementation.- 5 Experimental Results.- 6 Discussion.- 7 Conclusion.- 9 The Systran NLP Browser: An Application of Machine Translation Technology in Cross-Language Information Retrieval.- 1 Introduction.- 2 SYSTRAN NLP Browser.- 3 Discussion.- 4 Conclusion.- 10 A Weighted Boolean Model for Cross-Language Text Retrieval.- 1 Introduction.- 2 Boolean vs. Vector Space Retrieval.- 3 Background: Extended Boolean Models.- 4 A New Weighted Boolean Model.- 5 Background: Cross-Language Text Retrieval.- 6 Testing the Weighted Boolean Model.- 7 Commentary.- 11 Building a Large Multilingual Test Collection from Comparable News Documents.- 1 Introduction.- 2 Evaluation using Seed Documents.- 3 The ETH Multilingual Test Collection.- 4 Summary and Conclusions.- 12 Evaluating Cross-Language Text Filtering Effectiveness.- 1 Introduction.- 2 Adaptive Multilingual Text Filtering.- 3 Ideal Experiment Design.- 4 Use of Available Corpora.- 5 Results.- 6 Conclusions.- References.ReviewsAuthor InformationTab Content 6Author Website:Countries AvailableAll regions |