Spark Infrastructure

Results: 136



#Item
1Computing / Data / Hadoop / Apache Software Foundation / Cask / Teradata / Data management / Cloud infrastructure / Big data / Apache Hadoop / Apache Spark / Extract /  transform /  load

Cask Data Application Platform (CDAP) Extensions CDAP Extensions provide additional capabilities and user interfaces to CDAP. They are use-case specific applications designed to solve common and critical big data challe

Add to Reading List

Source URL: customers.cask.co

Language: English - Date: 2016-08-02 06:10:32
2Computing / Hadoop / Apache Software Foundation / Cloud infrastructure / Java platform / Inter-process communication / Apache Hadoop / Apache Spark / MapR FS / MapR / Pipeline / Franz Kafka

    StreamSets Data CollectorRelease Notes August 4, 2016

Add to Reading List

Source URL: streamsets.com

Language: English - Date: 2016-08-04 21:52:48
3Computing / Concurrent computing / Parallel computing / Hadoop / Distributed computing architecture / Cloud infrastructure / Apache Software Foundation / MapReduce / Apache Spark / MapR / Data-intensive computing / Apache Hadoop

Large-Scale Numerical Computation Using a Data Flow Engine Matei Zaharia Outline

Add to Reading List

Source URL: mmds-data.org

Language: English - Date: 2014-06-24 03:07:59
4Computing / Apache Software Foundation / Cloud infrastructure / NoSQL / Parallel computing / Structured storage / Apache Cassandra / Garbage collection / Apache Spark / Apache Hadoop / Computer cluster / Distributed computing

Trash Day: Coordinating Garbage Collection in Distributed Systems Martin Maas? † ∗ Tim Harris† Krste Asanovi´c? John Kubiatowicz? † Oracle Labs, Cambridge

Add to Reading List

Source URL: www.usenix.org

Language: English - Date: 2016-02-21 01:58:31
5Computing / Hadoop / Apache Software Foundation / Cloud infrastructure / Parallel computing / Distributed computing architecture / Apache Hadoop / MapReduce / Suffix array / Apache Spark / Sanders / Big data

Massive Suffix Array Construction with Thrill Michael Axtmann, Timo Bingmann, Peter Sanders, Sebastian Schlag, and 6 Students | @ SPP 1736 I NSTITUTE OF

Add to Reading List

Source URL: panthema.net

Language: English - Date: 2015-10-13 13:22:55
6Parallel computing / Apache Software Foundation / Cloud infrastructure / Hadoop / Apache Spark / Cluster computing / Apache Hadoop / MapReduce / Scheduling / Data-intensive computing / Computer cluster / Fold

GoSpark: An In-Memory Distributed Computation Platform in Go Kuan-Ting Yu CSAIL MIT

Add to Reading List

Source URL: css.csail.mit.edu

Language: English - Date: 2014-12-08 14:33:02
7Hadoop / Cloud infrastructure / Apache Software Foundation / Cask / Apache Hadoop / Apache HBase / MapReduce / Big data / Cloudera / Hortonworks / Apache Spark / Microsoft Azure

Cask Data Application Platform (CDAP) CDAP is an open source, Apache 2.0 licensed, distributed, application framework for delivering Hadoop solutions. It integrates and abstracts the underlying Hadoop technologies to pr

Add to Reading List

Source URL: customers.cask.co

Language: English - Date: 2016-07-31 17:55:06
8Java platform / Apache Software Foundation / Cloud infrastructure / Parallel computing / Distributed computing architecture / Akka / Apache Hadoop / Actor model / Storm / Apache Spark / Scala / Transmission Control Protocol

  GearPump  –  Real-­‐time  Streaming  Engine   Using  Akka*       Sean  Zhong,  Kam  Kasravi,  Huafeng  Wang,  Manu  Zhang,  Weihua  Jiang  

Add to Reading List

Source URL: downloads.typesafe.com

Language: English - Date: 2014-12-15 11:49:41
9Hadoop / Cloud infrastructure / Apache Software Foundation / Parallel computing / Apache Hadoop / Apache HBase / Data-intensive computing / Apache Spark / MapR / GraphLab / Big data / Solid-state drive

Big Data Meets HPC – Exploiting HPC Technologies for Accelerating Big Data Processing Keynote Talk at HPCAC-Switzerland (Marby Dhabaleswar K. (DK) Panda

Add to Reading List

Source URL: www.hpcadvisorycouncil.com

Language: English - Date: 2016-04-06 01:18:06
10Apache Software Foundation / Cloud infrastructure / Hadoop / Data management / Cluster computing / Apache Spark / Apache Hadoop / Apache Flink / OpenStack / NoSQL / Apache Cassandra / Big data

NSF14start October 1, 2014 Datanet: CIF21 DIBBs: Middleware and High Performance Analytics Libraries for Scalable Data Science • 

Add to Reading List

Source URL: www.nationaldataservice.org

Language: English - Date: 2016-04-06 11:00:39
UPDATE