<--- Back to Details
First PageDocument Content
Computing / Concurrent computing / Parallel computing / Workflow technology / Distributed computing architecture / Apache Software Foundation / Cloud infrastructure / MapReduce / Apache Hadoop / Record linkage / Workflow / Data cleansing
Date: 2012-08-30 08:45:36
Computing
Concurrent computing
Parallel computing
Workflow technology
Distributed computing architecture
Apache Software Foundation
Cloud infrastructure
MapReduce
Apache Hadoop
Record linkage
Workflow
Data cleansing

Dedoop: Efficient Deduplication with Hadoop Lars Kolb Andreas Thor Erhard Rahm

Add to Reading List

Source URL: dbs.uni-leipzig.de

Download Document from Source Website

File Size: 1,05 MB

Share Document on Facebook

Similar Documents

CVA Memo #138  Sikker: A High-Performance Distributed System Architecture for Secure Service-Oriented Computing  Nicholas McDonald and William J. Dally

CVA Memo #138 Sikker: A High-Performance Distributed System Architecture for Secure Service-Oriented Computing Nicholas McDonald and William J. Dally

DocID: 1uuaH - View Document

CVA Memo #137  Sikker: A Distributed System Architecture for Secure High Performance Computing Nicholas McDonald

CVA Memo #137 Sikker: A Distributed System Architecture for Secure High Performance Computing Nicholas McDonald

DocID: 1uivM - View Document

<Insert Picture Here>  Hudson Execution and Scheduling Architecture

Hudson Execution and Scheduling Architecture

DocID: 1rtYD - View Document

Microsoft Word - HotPower_camera_ready_ACM_format

Microsoft Word - HotPower_camera_ready_ACM_format

DocID: 1rtKl - View Document

SplitStream: High-bandwidth content distribution in cooperative environments Miguel Castro1 Peter Druschel2 Antony Rowstron1

SplitStream: High-bandwidth content distribution in cooperative environments Miguel Castro1 Peter Druschel2 Antony Rowstron1

DocID: 1rry7 - View Document