<--- Back to Details
First PageDocument Content
Linguistics / Computational linguistics / Statistical natural language processing / Natural language processing / Corpus linguistics / Applied linguistics / Speech recognition / Topic model / Latent Dirichlet allocation / N-gram / Stemming / Text corpus
Date: 2017-07-19 14:45:03
Linguistics
Computational linguistics
Statistical natural language processing
Natural language processing
Corpus linguistics
Applied linguistics
Speech recognition
Topic model
Latent Dirichlet allocation
N-gram
Stemming
Text corpus

Understanding Text Pre-Processing for Latent Dirichlet Allocation Alexandra Schofield1 M˚ans Magnusson2 Laure Thompson1 David Mimno3 1 Department of Computer Science, Cornell University, Ithaca, NY {xanda, laurejt}@cs.c

Add to Reading List

Source URL: www.cs.cornell.edu

Download Document from Source Website

File Size: 94,08 KB

Share Document on Facebook

Similar Documents

Cognitive Linguistics, Corpus Linguistics, ChildDirected Speech and Developmental Robots Kerstin Fischer University of Southern Denmark Kilian Foth University of Hamburg

Cognitive Linguistics, Corpus Linguistics, ChildDirected Speech and Developmental Robots Kerstin Fischer University of Southern Denmark Kilian Foth University of Hamburg

DocID: 1vaXS - View Document

The ACL Anthology Reference Corpus: A Reference Dataset for Bibliographic Research in Computational Linguistics Steven Bird1 , Robert Dale2 , Bonnie J. Dorr3 , Bryan Gibson4 , Mark T. Joseph4 , Min-Yen Kan5† , Dongwon

The ACL Anthology Reference Corpus: A Reference Dataset for Bibliographic Research in Computational Linguistics Steven Bird1 , Robert Dale2 , Bonnie J. Dorr3 , Bryan Gibson4 , Mark T. Joseph4 , Min-Yen Kan5† , Dongwon

DocID: 1v63z - View Document

Building the Uppsala Hindi Corpus  Anju Saxena, Pranava Swaroop Madhyasta and Joakim Nivre Uppsala University, Department of Linguistics and Philology {anju.saxena,joakim.nivre}@lingfil.uu.se

Building the Uppsala Hindi Corpus Anju Saxena, Pranava Swaroop Madhyasta and Joakim Nivre Uppsala University, Department of Linguistics and Philology {anju.saxena,joakim.nivre}@lingfil.uu.se

DocID: 1uLAh - View Document

Predicting Second Language Learner Successes and Mistakes by Means of Conjunctive Features Yves Bestgen Centre for English Corpus Linguistics Universit´e catholique de Louvain Place Cardinal Mercier, Louvain-la-

Predicting Second Language Learner Successes and Mistakes by Means of Conjunctive Features Yves Bestgen Centre for English Corpus Linguistics Universit´e catholique de Louvain Place Cardinal Mercier, Louvain-la-

DocID: 1uJLl - View Document

Panel on Corpus Linguistics and Information Retrieval Robert Krovetz Computer Science Department University of Massachusetts, Amherst, MACorpus Linguistics is becoming increasingly important. The most recent confe

Panel on Corpus Linguistics and Information Retrieval Robert Krovetz Computer Science Department University of Massachusetts, Amherst, MACorpus Linguistics is becoming increasingly important. The most recent confe

DocID: 1uHyP - View Document