Back to Results
First PageMeta Content
Applied linguistics / Discourse analysis / Semantics / Text corpus / Word-sense disambiguation / International Corpus of English / Perplexity / Natural language processing / Linguistics / Computational linguistics / Corpus linguistics


langid.py for better language modelling Paul Cook♥ and Marco Lui♥♣ ♥ Department of Computing and Information Systems The University of Melbourne Victoria 3010, Australia ♣ NICTA Victoria Research Laboratory
Add to Reading List

Document Date: 2012-11-29 20:03:05


Open Document

File Size: 1,04 MB

Share Result on Facebook

City

Trento / Istanbul / Las Vegas / Marrakech / Lyon / Kampala / Denver / Jeju / Dublin / Chiang Mai / Valletta / Ngram / SRILM / Prague / Bologna / Lisbon / Sebastopol / Toulouse / Edinburgh / /

Company

O’Reilly Media Inc. / NICTA Victoria Research Laboratory / PPL / Google / Stanford Natural Language Processing Group / Vinci / /

Country

Thailand / France / Uganda / Australia / Portugal / Malta / Scotland / Italy / Turkey / United States / Morocco / South Korea / Ireland / Czech Republic / /

/

Facility

Masaryk University / Information Systems The University of Melbourne Victoria / Brown University / /

IndustryTerm

Web corpus / Web text corpus / larger Web corpora / language technology / body text extraction algorithm / Web Track / automated search engine queries / Web corpus construction methods / large Web corpora / Web crawl / readily-available tools / document post-processing / post-processing / cient Web crawling / compare Web corpora / Web corpus construction projects / Web corpus construction / Web corpora / corpora using tools / larger Web crawl / statistical language technology systems / readily-available Web crawl / language identification tool / minimally-supervised Web-corpus / Web crawl consisting / natural language processing / language identification tools / search engine / large Web crawls / /

Organization

University of Melbourne Victoria / ICT Centre of Excellence / Australian Government / Department of Computing / European parliament / Oxford University / Department of Broadband / Communications and the Digital Economy / Brown University / European Chapter / Australian Research Council / Masaryk University / Association for Computational Linguistics / /

Person

Amit Dubey / Nick Craswell / Edward Loper / Eric Brill / Marco Lui / Ellen M. Voorhees / Henry Kucera / James Curran / Marco Baroni / Brian Murphy / Paul Cook / Nicholas Kushmerick / Serge Sharoff / Michele Banko / Barry Smyth / William B. Cavnar / T. Sue Atkins / Adriano Ferraresi / Beatrice Alex / Ewan Klein / Ian Soboroff / Yannick Versley / Charles L. A. Clarke / W. Nelson Francis / Egon Stemle / Aidan Finn / Pavel Rychl / Yana Panchenko / Wacky / Silvia Bernardini / Liu / Jan Pomik´alek / Adam Kilgarriff / Eros Zanchetta / John M. Trenkle / Francis Chantree / Steven Bird / Lou Burnard / Timothy Baldwin / Vit Suchomel / Andreas Stolcke / Patrick Hanks / Frank Keller / /

Position

editor / /

ProgrammingLanguage

php / Python / /

PublishedMedium

Computational Linguistics / /

Technology

body text extraction algorithm / HTML / language technology / Broadband / php / search engine / Natural Language Processing / /

URL

http /

SocialTag