• Documents
  • Authors
  • Tables
  • Log in
  • Sign up
  • MetaCart
  • DMCA
  • Donate

CiteSeerX logo

Advanced Search Include Citations
Advanced Search Include Citations

Generalized low rank models (2015)

by Madeleine Udell
Add To MetaCart

Tools

Sorted by:
Results 1 - 1 of 1

Factorbird — a parameter server approach to distributed matrix factorization

by Sebastian Schelter, Technische Universität Berlin, Venu Satuluri, Reza Bosagh Zadeh - NIPS 2014 Workshop on Distributed Machine Learning and Matrix Computations , 2014
"... We present ‘Factorbird’, a prototype of a parameter server approach for factor-izing large matrices with Stochastic Gradient Descent-based algorithms. We de-signed Factorbird to meet the following desiderata: (a) scalability to tall and wide matrices with dozens of billions of non-zeros, (b) extensi ..."
Abstract - Cited by 1 (1 self) - Add to MetaCart
We present ‘Factorbird’, a prototype of a parameter server approach for factor-izing large matrices with Stochastic Gradient Descent-based algorithms. We de-signed Factorbird to meet the following desiderata: (a) scalability to tall and wide matrices with dozens of billions of non-zeros, (b) extensibility to different kinds of models and loss functions as long as they can be optimized using Stochastic Gradient Descent (SGD), and (c) adaptability to both batch and streaming scenar-ios. Factorbird uses a parameter server in order to scale to models that exceed the memory of an individual machine, and employs lock-free Hogwild!-style learning with a special partitioning scheme to drastically reduce conflicting updates. We also discuss other aspects of the design of our system such as how to efficiently grid search for hyperparameters at scale. We present experiments of Factorbird on a matrix built from a subset of Twitter’s interaction graph, consisting of more than 38 billion non-zeros and about 200 million rows and columns, which is to the best of our knowledge the largest matrix on which factorization results have been reported in the literature. 1
(Show Context)

Citation Context

...e prototype of a system that elegantly solves these challenges. (1) The resulting factor matrices for a huge network quickly become larger than the memory available on an individual commodity machine =-=[22, 23]-=-. For example, U and V with k = 100 and a single precision factor representation for a graph with 250 million vertices already have a combined size of about 200 GB. This estimation does not even take ...

Powered by: Apache Solr
  • About CiteSeerX
  • Submit and Index Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2019 The Pennsylvania State University