<--- Back to Details
First PageDocument Content
Linguistics / Computational linguistics / Statistical natural language processing / Natural language processing / Corpus linguistics / Applied linguistics / Speech recognition / Topic model / Latent Dirichlet allocation / N-gram / Stemming / Text corpus
Date: 2017-07-19 14:45:03
Linguistics
Computational linguistics
Statistical natural language processing
Natural language processing
Corpus linguistics
Applied linguistics
Speech recognition
Topic model
Latent Dirichlet allocation
N-gram
Stemming
Text corpus

Understanding Text Pre-Processing for Latent Dirichlet Allocation Alexandra Schofield1 M˚ans Magnusson2 Laure Thompson1 David Mimno3 1 Department of Computer Science, Cornell University, Ithaca, NY {xanda, laurejt}@cs.c

Add to Reading List

Source URL: www.cs.cornell.edu

Download Document from Source Website

File Size: 94,08 KB

Share Document on Facebook

Similar Documents

Multi-source annotation projection of coreference chains: assessing strategies and testing opportunities Yulia Grishina and Manfred Stede Applied Computational Linguistics FSP Cognitive Science University of Potsdam

Multi-source annotation projection of coreference chains: assessing strategies and testing opportunities Yulia Grishina and Manfred Stede Applied Computational Linguistics FSP Cognitive Science University of Potsdam

DocID: 1vrep - View Document

Modelling context within a constraint-based account of quantifier usage Chris Cummins1, 2 and Napoleon Katsos1 1  Department of Theoretical and Applied Linguistics, University of Cambridge

Modelling context within a constraint-based account of quantifier usage Chris Cummins1, 2 and Napoleon Katsos1 1 Department of Theoretical and Applied Linguistics, University of Cambridge

DocID: 1uKOl - View Document

Surfaces and Depths in Text Understanding: The Case of Newspaper Commentary Manfred Stede University of Potsdam Dept. of Linguistics Applied Computational Linguistics

Surfaces and Depths in Text Understanding: The Case of Newspaper Commentary Manfred Stede University of Potsdam Dept. of Linguistics Applied Computational Linguistics

DocID: 1uATj - View Document

Medical-domain Machine Translation in KConnect  Pavel Pecina Charles University, Prague Faculty of Mathematics and Physics Institute of Formal and Applied Linguistics

Medical-domain Machine Translation in KConnect Pavel Pecina Charles University, Prague Faculty of Mathematics and Physics Institute of Formal and Applied Linguistics

DocID: 1uxga - View Document

Scrambled Word Recognition: Implications for Position Coding Chris Cummins Research Centre for English and Applied Linguistics (RCEAL) University of Cambridge Position coding:

Scrambled Word Recognition: Implications for Position Coding Chris Cummins Research Centre for English and Applied Linguistics (RCEAL) University of Cambridge Position coding:

DocID: 1u6Ph - View Document