|
|
|||
|
||||
OverviewLinguists and engineers in natural language processing tend to use electronic corpora more and more. Most research has long been limited to raw (unannotated) texts or to tagged texts (annotated with parts of speech only), but these approaches suffer from a word by word perspective. A new line of research involves corpora with richer annotations such as clauses and major constituents, grammatical functions and dependency links. The first parsed corpora were the English Lancaster treebank and Penn treebank. New ones have recently been developed for other languages. This text provides an update on work being done with parsed corpora. It presents 21 papers on building and using parsed corpora raising many relevant questions, and deals with a variety of languages and a variety of corpora. It is intended for those working in linguistics, computational linguistics, natural language, syntax, and grammar. Full Product DetailsAuthor: A. AbeilléPublisher: Springer-Verlag New York Inc. Imprint: Springer-Verlag New York Inc. Edition: Softcover reprint of the original 1st ed. 2003 Volume: 20 Dimensions: Width: 15.60cm , Height: 2.20cm , Length: 23.40cm Weight: 1.350kg ISBN: 9781402013355ISBN 10: 1402013353 Pages: 407 Publication Date: 30 September 2003 Audience: College/higher education , Professional and scholarly , Postgraduate, Research & Scholarly , Professional & Vocational Format: Paperback Publisher's Status: Active Availability: In Print This item will be ordered in for you from one of our suppliers. Upon receipt, we will promptly dispatch it out to you. For in store availability, please contact us. Table of Contents1 The Penn Treebank: An Overview.- 2 Thoughts on Two Decades of Drawing Trees.- 3 Bank of English and Beyond.- 4 Completing Parsed Corpora from Correction to Evolution.- 5 Syntactic Annotation of a German Newspaper Corpus.- 6 Annotation of Error Types for a German Newsgroup Corpus.- 7 The PDT: A 3-level Annotation Scenario.- 8 An HPSG-Annotated Test Suite for Polish.- 9 Developing a Spanish Treebank.- 10 Building a Treebank for French.- 11 Building the Italian Syntactic-Semantic Treebank.- 12 Automated Creation of a Medieval Portuguese Treebank.- 13 Sinica Treebank.- 14 Building a Japanese Parsed Corpus.- 15 Building a Turkish Treebank.- 16 Encoding Syntactic Annotation.- 17 Parser Evaluation.- 18 Dependency-based Evaluation of Minipar.- 19 Extracting Stochastic Grammars from Treebanks.- 20 Stochastic Lexicalized Tree Grammars.- 21 From Treebank Resources To LFG f-Structures.- Contributing Authors.ReviewsFrom the reviews: <p> Anne AbeillA(c) draws together a collection of fifteen short pieces focused primarily on the issues that come up in creating treebanks, demonstrated across an impressive variety of languages, along with six chapters on how treebanks are used. a ] For computational linguists working on automatic parsing, a pass through this book should be required a ] . The reader a ] will be rewarded with a clear sense of the challenge and the promise of systematically applying theoretically motivated linguistic representations to a ~language in the largea (TM). (Philip Resnik, Language, Vol. 83 (4), 2007) From the reviews: Anne Abeille draws together a collection of fifteen short pieces focused primarily on the issues that come up in creating treebanks, demonstrated across an impressive variety of languages, along with six chapters on how treebanks are used. ... For computational linguists working on automatic parsing, a pass through this book should be required ... . The reader ... will be rewarded with a clear sense of the challenge and the promise of systematically applying theoretically motivated linguistic representations to `language in the large'. (Philip Resnik, Language, Vol. 83 (4), 2007) Author InformationTab Content 6Author Website:Countries AvailableAll regions |