Results 1 -
3 of
3
Large-Scale Social-Media Analytics on Stratosphere
"... The importance of social-media platforms and online communities – in business as well as public context – is more and more acknowledged and appreciated by industry and researchers alike. Consequently, a wide range of analytics has been proposed to understand, steer, and exploit the mechanics and law ..."
Abstract
-
Cited by 3 (0 self)
- Add to MetaCart
(Show Context)
The importance of social-media platforms and online communities – in business as well as public context – is more and more acknowledged and appreciated by industry and researchers alike. Consequently, a wide range of analytics has been proposed to understand, steer, and exploit the mechanics and laws driving their functionality and creating the resulting benefits. However, analysts usually face significant problems in scaling existing and novel approaches to match the data volume and size of modern online communities. In this work, we propose and demonstrate the usage of the massively parallel data processing system Stratosphere, based on second order functions as an extended notion of the MapReduce paradigm, to provide a new level of scalability to such social-media analytics. Based on the popular example of role analysis, we present and illustrate how this massively parallel approach can be leveraged to scale out complex data-mining tasks, while providing a programming approach that eases the formulation of complete analytical workflows.
Large-Scale Social-Media Analytics on Stratosphere
"... and other research outputs Large-scale social-media analytics on stratosphere ..."
Abstract
- Add to MetaCart
(Show Context)
and other research outputs Large-scale social-media analytics on stratosphere
under grant FOR 1306.
"... Abstract We present Stratosphere, an open-source soft-ware stack for parallel data analysis. Stratosphere brings together a unique set of features that allow the expressive, easy, and efficient programming of analytical applications at very large scale. Stratosphere’s features include “in situ” data ..."
Abstract
- Add to MetaCart
(Show Context)
Abstract We present Stratosphere, an open-source soft-ware stack for parallel data analysis. Stratosphere brings together a unique set of features that allow the expressive, easy, and efficient programming of analytical applications at very large scale. Stratosphere’s features include “in situ” data processing, a declarative query language, treatment of user-defined functions as first-class citizens, automatic pro-Stratosphere is funded by the German Research Foundation (DFG)