Back to Results
First PageMeta Content
Hadoop / Cloud infrastructure / Cloud computing / Structured storage / Apache Hadoop / Emerging technologies / MapReduce / Cascading / Apache Cassandra / Computing / Concurrent computing / Data management


Summingbird: A Framework for Integrating Batch and Online MapReduce Computations Oscar Boykin, Sam Ritchie, Ian O’Connell, and Jimmy Lin Twitter, Inc. San Francisco, California @posco @sritchie @0x138 @lintool
Add to Reading List

Document Date: 2014-08-04 14:41:22


Open Document

File Size: 363,19 KB

Share Result on Facebook

City

San Francisco / Hadoop / Hangzhou / /

Company

Twitter / Section 3 / Jimmy Lin Twitter Inc. / IBM / Amazon / Creative Commons / MySQL / /

Country

China / /

/

Event

Natural Disaster / /

Facility

store Cassandra / /

IndustryTerm

analytical processing / internet scale / online execution framework / online results / comparable analytical tool / stream processing middleware / online production setting / probabilistic algorithm / distributed processing / online processing / distributed online processing / data stream processing / online system / batch processing / real time / data management applications / online processing pipeline / later processing / Online/Batch Processing Summingbird / online computations / open-source software / hybrid processing / high-throughput batch processing / query-suggestion algorithm / algebraic / disparate online analytics systems / social media / scale-out distributed processing systems / online processing capabilities / stream processing engines / continuous bulk processing / streaming algorithms / batch algorithm / trivial solution / online results coverage / online training / open-source stream processing framework / hybrid online/batch processing / real-time web / online partial results / online learner / online processing framework / data processing framework / online analytics needs / online retailing / analytics infrastructure / stream processing / Online MapReduce Computations Oscar Boykin / batch/online hybrids / online analytics processing / analytics solutions / online case / online learning / online dashboards / batch processing delays / data mining / hybrid processing model / downstream systems / imperfect solution / batch processing case / online setting / Batch processing frameworks / online store / online analytics / Online data processing frameworks / online execution / online environment / incremental batch processing / obvious solution / near-optimal cardinality estimation algorithm / data management / Online Prototype / tolerant systems / online processing case / na¨ıve exact solution / search queries / offline data products / online analytics capabilities / data processing systems / online mode / declarative stream processing engine / hybrid processing mode / analytical processing tasks / queries involving products / data scientists with powerful tools / /

Organization

Technical Committee on Data Engineering / VLDB Endowment / /

Person

Brian Wallerstein / R. Ananthanarayanan / V / Alex Roetter / Min Sketches / Wen-Hao Lue / Sam Ritchie / Joe Nievelt / Bill Darrow / Oscar Boykin / Dmitriy Ryaboy / /

Position

data scientist / producer / algorithm designer / /

Product

Summingbird / Storm platform / Storm / /

ProgrammingLanguage

Java / SQL / Scala / /

ProvinceOrState

Hadoop-based / /

PublishedMedium

Communications of the ACM / /

Technology

functional programming / data warehouse / API / batch algorithm / cardinality estimation algorithm / Java / sliding windows / machine learning / DSL / data mining / query-suggestion algorithm / /

URL

http /

SocialTag