A hypergraph-partitioned vertex programming approach for large-scale consensus optimization
2013 IEEE International Conference on Big Data, 2013•ieeexplore.ieee.org
In modern data science problems, techniques for extracting value from big data require
performing large-scale optimization over heterogenous, irregularly structured data. Much of
this data is best represented as multi-relational graphs, making vertex-programming
abstractions such as those of Pregel and GraphLab ideal fits for modern large-scale data
analysis. In this paper, we describe a vertex-programming implementation of a popular
consensus optimization technique known as the alternating direction method of multipliers …
performing large-scale optimization over heterogenous, irregularly structured data. Much of
this data is best represented as multi-relational graphs, making vertex-programming
abstractions such as those of Pregel and GraphLab ideal fits for modern large-scale data
analysis. In this paper, we describe a vertex-programming implementation of a popular
consensus optimization technique known as the alternating direction method of multipliers …
In modern data science problems, techniques for extracting value from big data require performing large-scale optimization over heterogenous, irregularly structured data. Much of this data is best represented as multi-relational graphs, making vertex-programming abstractions such as those of Pregel and GraphLab ideal fits for modern large-scale data analysis. In this paper, we describe a vertex-programming implementation of a popular consensus optimization technique known as the alternating direction method of multipliers (ADMM) [1]. ADMM consensus optimization allows the elegant solution of complex objectives such as inference in rich probabilistic models. We also introduce a novel hypergraph partitioning technique that improves over the state-of-the-art vertex programming framework and significantly reduces the communication cost by reducing the number of replicated nodes by an order of magnitude. We implement our algorithm in GraphLab and measure scaling performance on a variety of realistic bipartite graphs and a large synthetic voter-opinion analysis application. We show a 50% improvement in running time over the current GraphLab partitioning scheme.
ieeexplore.ieee.org
Showing the best result for this search. See all results