Back to Results
First PageMeta Content
Corpus linguistics / Computational linguistics / Online databases / Text corpus / International Corpus of English / Bank of English / Parallel text / Natural language processing / British National Corpus / Linguistics / Applied linguistics / Corpora


Balancing SoNaR: IPR versus Processing Issues in a 500-Million-Word Written Dutch Reference Corpus Martin Reynaert1 , Nelleke Oostdijk2 , Orph´ee De Clercq3 , Henk van den Heuvel2 , Franciska de Jong4 ILK, Tilburg Unive
Add to Reading List

Document Date: 2010-06-11 04:03:06


Open Document

File Size: 308,46 KB

Share Result on Facebook

City

Lancaster / Valetta / Oslo / Valletta / Morristown / Marrakech / Berlin / /

Company

COCA / Twitter / Bank of English / MW Press / Bank of English2 / Sixth International Language Resources / NRC Handelsblad / /

Country

Netherlands / Belgium / United States / Malta / Morocco / /

Event

FDA Phase / Product Issues / Business Partnership / /

Facility

University College Ghent/Ghent University3 / University of Twente4 Abstract In The Low Countries / building There / building When / /

IndustryTerm

media texts / media text types / web-crawled corpora / database systems / web corpora / web-based approach / copyright law / web harvesting approach / web harvesting approaches / web-as-corpus approach / end product / language technology / corpus-based natural language processing / wacky wide web / cleaner web corpora / web download / citation law / data processing / internet fora / web-as-corpus corpora / web-as-corpus / internet forum / web harvesters / Internet Archive / online database / language technology developments / Flemish internet forum text / web-as-corpus setting / /

OperatingSystem

Unix / /

Organization

European Language Resources Association / Dutch Human Language Technology Agency / European Chapter / Institute for Dutch Lexicology / Dutch Language Union / The Dutch-Flemish / Nuclear Regulatory Commission / Dutch HTL Agency / HLT Agency / Association for Computational Linguistics / /

Person

M. Reynaert / I. Schuurman / V / V. Vandeghinste / V. Hoste / R. Ordelman / I. Schuurman / P. Monachesi / G. Van Noord / /

Position

editor / researcher / /

Product

2.5 MW Advantages Greater availability / IPR / /

ProgrammingLanguage

XML / php / /

PublishedMedium

Computational Linguistics / the Computational Linguistics / the NRC Handelsblad / /

Technology

XML / SMS / Unix / Optical Character Recognition / html / OCR / natural language processing / language technology / /

URL

http /

SocialTag