Spark Infrastructure - PDFSEARCH.IO - Document Search Engine

Spark Infrastructure
Results: 136

#	Item
1	Cask Data Application Platform (CDAP) Extensions CDAP Extensions provide additional capabilities and user interfaces to CDAP. They are use-case specific applications designed to solve common and critical big data challe Add to Reading List Source URL: customers.cask.co Language: English - Date: 2016-08-02 06:10:32 Computing Data Hadoop Apache Software Foundation Cask Teradata Data management Cloud infrastructure Big data Apache Hadoop Apache Spark Extract transform load
2	StreamSets Data CollectorRelease Notes August 4, 2016 Add to Reading List Source URL: streamsets.com Language: English - Date: 2016-08-04 21:52:48 Computing Hadoop Apache Software Foundation Cloud infrastructure Java platform Inter-process communication Apache Hadoop Apache Spark MapR FS MapR Pipeline Franz Kafka
3	Large-Scale Numerical Computation Using a Data Flow Engine Matei Zaharia Outline Add to Reading List Source URL: mmds-data.org Language: English - Date: 2014-06-24 03:07:59 Computing Concurrent computing Parallel computing Hadoop Distributed computing architecture Cloud infrastructure Apache Software Foundation MapReduce Apache Spark MapR Data-intensive computing Apache Hadoop
4	Trash Day: Coordinating Garbage Collection in Distributed Systems Martin Maas? † ∗ Tim Harris† Krste Asanovi´c? John Kubiatowicz? † Oracle Labs, Cambridge Add to Reading List Source URL: www.usenix.org Language: English - Date: 2016-02-21 01:58:31 Computing Apache Software Foundation Cloud infrastructure NoSQL Parallel computing Structured storage Apache Cassandra Garbage collection Apache Spark Apache Hadoop Computer cluster Distributed computing
5	Massive Suffix Array Construction with Thrill Michael Axtmann, Timo Bingmann, Peter Sanders, Sebastian Schlag, and 6 Students \| @ SPP 1736 I NSTITUTE OF Add to Reading List Source URL: panthema.net Language: English - Date: 2015-10-13 13:22:55 Computing Hadoop Apache Software Foundation Cloud infrastructure Parallel computing Distributed computing architecture Apache Hadoop MapReduce Suffix array Apache Spark Sanders Big data
6	GoSpark: An In-Memory Distributed Computation Platform in Go Kuan-Ting Yu CSAIL MIT Add to Reading List Source URL: css.csail.mit.edu Language: English - Date: 2014-12-08 14:33:02 Parallel computing Apache Software Foundation Cloud infrastructure Hadoop Apache Spark Cluster computing Apache Hadoop MapReduce Scheduling Data-intensive computing Computer cluster Fold
7	Cask Data Application Platform (CDAP) CDAP is an open source, Apache 2.0 licensed, distributed, application framework for delivering Hadoop solutions. It integrates and abstracts the underlying Hadoop technologies to pr Add to Reading List Source URL: customers.cask.co Language: English - Date: 2016-07-31 17:55:06 Hadoop Cloud infrastructure Apache Software Foundation Cask Apache Hadoop Apache HBase MapReduce Big data Cloudera Hortonworks Apache Spark Microsoft Azure
8	GearPump – Real-‐time Streaming Engine Using Akka* Sean Zhong, Kam Kasravi, Huafeng Wang, Manu Zhang, Weihua Jiang Add to Reading List Source URL: downloads.typesafe.com Language: English - Date: 2014-12-15 11:49:41 Java platform Apache Software Foundation Cloud infrastructure Parallel computing Distributed computing architecture Akka Apache Hadoop Actor model Storm Apache Spark Scala Transmission Control Protocol
9	Big Data Meets HPC – Exploiting HPC Technologies for Accelerating Big Data Processing Keynote Talk at HPCAC-Switzerland (Marby Dhabaleswar K. (DK) Panda Add to Reading List Source URL: www.hpcadvisorycouncil.com Language: English - Date: 2016-04-06 01:18:06 Hadoop Cloud infrastructure Apache Software Foundation Parallel computing Apache Hadoop Apache HBase Data-intensive computing Apache Spark MapR GraphLab Big data Solid-state drive
10	NSF14start October 1, 2014 Datanet: CIF21 DIBBs: Middleware and High Performance Analytics Libraries for Scalable Data Science •  Add to Reading List Source URL: www.nationaldataservice.org Language: English - Date: 2016-04-06 11:00:39 Apache Software Foundation Cloud infrastructure Hadoop Data management Cluster computing Apache Spark Apache Hadoop Apache Flink OpenStack NoSQL Apache Cassandra Big data

UPDATE