Pairwise Document Similarity in Large Collections with MapReduce Tamer Elsayed,∗ Jimmy Lin,† and Douglas W. Oard† Human Language Technology Center of Excellence and UMIACS Laboratory for Computational Linguistics a - Document processing - Document - PDFSEARCH.IO - Document Search Engine

Back to Results

First Page	Meta Content
	Pairwise Document Similarity in Large Collections with MapReduce Tamer Elsayed,∗ Jimmy Lin,† and Douglas W. Oard† Human Language Technology Center of Excellence and UMIACS Laboratory for Computational Linguistics a Add to Reading List Document Date: 2008-06-22 13:17:18 Open Document File Size: 177,58 KB Share Result on Facebook City Columbus / / Company UMIACS Laboratory / / Country United States / / / Facility College of Information Studies / Information Processing University of Maryland / College Park / / IndustryTerm data processing / inner product / similarity algorithm / inner products / pairs similarity search / final inner product / attractive solution / Near-optimal hashing algorithms / parallel processing architectures / / Organization National Institute of Health / Cornell / University of Maryland / College Park / College of Information Studies / Association for Computational Linguistics / / Person Douglas W. Oard / Jimmy Lin / / Position head / Dean / programmer / / Product Hadoop version 0.16.0 / Hadoop 0.16.0 / / ProgrammingLanguage Java / / ProvinceOrState Ohio / / PublishedMedium Computational Linguistics / / Technology functional programming / load balancing / Java / map input input map Algorithm / search engine / similarity algorithm / MapReduce algorithm / parallel processing / / URL http / SocialTag Information retrieval Distributed computing architecture MapReduce Parallel computing Natural language processing Apache Hadoop Search engine indexing Inverted index Information science Computing