Back to Results
First PageMeta Content
Information retrieval / Distributed computing architecture / MapReduce / Parallel computing / Natural language processing / Apache Hadoop / Search engine indexing / Inverted index / Information science / Computing / Concurrent computing


Pairwise Document Similarity in Large Collections with MapReduce Tamer Elsayed,∗ Jimmy Lin,† and Douglas W. Oard† Human Language Technology Center of Excellence and UMIACS Laboratory for Computational Linguistics a
Add to Reading List

Document Date: 2008-06-22 13:17:18


Open Document

File Size: 177,58 KB

Share Result on Facebook

City

Columbus / /

Company

UMIACS Laboratory / /

Country

United States / /

/

Facility

College of Information Studies / Information Processing University of Maryland / College Park / /

IndustryTerm

data processing / inner product / similarity algorithm / inner products / pairs similarity search / final inner product / attractive solution / Near-optimal hashing algorithms / parallel processing architectures / /

Organization

National Institute of Health / Cornell / University of Maryland / College Park / College of Information Studies / Association for Computational Linguistics / /

Person

Douglas W. Oard / Jimmy Lin / /

Position

head / Dean / programmer / /

Product

Hadoop version 0.16.0 / Hadoop 0.16.0 / /

ProgrammingLanguage

Java / /

ProvinceOrState

Ohio / /

PublishedMedium

Computational Linguistics / /

Technology

functional programming / load balancing / Java / map input input map Algorithm / search engine / similarity algorithm / MapReduce algorithm / parallel processing / /

URL

http /

SocialTag