Results 1 - 10
of
48
Combining labeled and unlabeled data with co-training
, 1998
"... We consider the problem of using a large unlabeled sample to boost performance of a learning algorithm when only a small set of labeled examples is available. In particular, we consider a setting in which the description of each example can be partitioned into two distinct views, motivated by the ta ..."
Abstract
-
Cited by 947 (27 self)
- Add to MetaCart
We consider the problem of using a large unlabeled sample to boost performance of a learning algorithm when only a small set of labeled examples is available. In particular, we consider a setting in which the description of each example can be partitioned into two distinct views, motivated by the task of learning to classify web pages. For example, the description of a web page can be partitioned into the words occurring on that page, and the words occurring in hyperlinks that point to that page. We assume that either view of the example would be su cient for learning if we had enough labeled data, but our goal is to use both views together to allow inexpensive unlabeled data to augment amuch smaller set of labeled examples. Speci cally, the presence of two distinct views of each example suggests strategies in which two learning algorithms are trained separately on each view, and then each algorithm's predictions on new unlabeled examples are used to enlarge the training set of the other. Our goal in this paper is to provide a PAC-style analysis for this setting, and, more broadly, a PAC-style framework for the general problem of learning from both labeled and unlabeled data. We also provide empirical results on real web-page data indicating that this use of unlabeled examples can lead to signi cant improvement of hypotheses in practice. As part of our analysis, we provide new re-
An Experimental Comparison of Min-Cut/Max-Flow Algorithms for Energy Minimization in Vision
- IEEE Transactions on Pattern Analysis and Machine Intelligence
, 2001
"... After [10, 15, 12, 2, 4] minimum cut/maximum ow algorithms on graphs emerged as an increasingly useful tool for exact or approximate energy minimization in low-level vision. The combinatorial optimization literature provides many min-cut/max-ow algorithms with dierent polynomial time complexity. ..."
Abstract
-
Cited by 471 (36 self)
- Add to MetaCart
After [10, 15, 12, 2, 4] minimum cut/maximum ow algorithms on graphs emerged as an increasingly useful tool for exact or approximate energy minimization in low-level vision. The combinatorial optimization literature provides many min-cut/max-ow algorithms with dierent polynomial time complexity. Their practical eciency, however, has to date been studied mainly outside the scope of computer vision.
A new approach to the minimum cut problem
- Journal of the ACM
, 1996
"... Abstract. This paper presents a new approach to finding minimum cuts in undirected graphs. The fundamental principle is simple: the edges in a graph’s minimum cut form an extremely small fraction of the graph’s edges. Using this idea, we give a randomized, strongly polynomial algorithm that finds th ..."
Abstract
-
Cited by 83 (8 self)
- Add to MetaCart
Abstract. This paper presents a new approach to finding minimum cuts in undirected graphs. The fundamental principle is simple: the edges in a graph’s minimum cut form an extremely small fraction of the graph’s edges. Using this idea, we give a randomized, strongly polynomial algorithm that finds the minimum cut in an arbitrarily weighted undirected graph with high probability. The algorithm runs in O(n 2 log 3 n) time, a significant improvement over the previous Õ(mn) time bounds based on maximum flows. It is simple and intuitive and uses no complex data structures. Our algorithm can be parallelized to run in �� � with n 2 processors; this gives the first proof that the minimum cut problem can be solved in ���. The algorithm does more than find a single minimum cut; it finds all of them. With minor modifications, our algorithm solves two other problems of interest. Our algorithm finds all cuts with value within a multiplicative factor of � of the minimum cut’s in expected Õ(n 2 � ) time, or in �� � with n 2 � processors. The problem of finding a minimum multiway cut of a graph into r pieces is solved in expected Õ(n 2(r�1) ) time, or in �� � with n 2(r�1) processors. The “trace ” of the algorithm’s execution on these two problems forms a new compact data structure for representing all small cuts and all multiway cuts in a graph. This data structure can be efficiently transformed into the
Minimum Cuts in Near-Linear Time
- In Proceedings of the 28 th ACM Symposium on Theory of Computing [ACM96
, 1996
"... We significantly improve known time bounds for solving the minimum cut problem on undirected graphs. We use a "semi-duality" between minimum cuts and maximum spanning tree packings combined with our previously developed random sampling techniques. We give a randomized (Monte Carlo) algorithm that fi ..."
Abstract
-
Cited by 63 (11 self)
- Add to MetaCart
We significantly improve known time bounds for solving the minimum cut problem on undirected graphs. We use a "semi-duality" between minimum cuts and maximum spanning tree packings combined with our previously developed random sampling techniques. We give a randomized (Monte Carlo) algorithm that finds a minimum cut in an m-edge, n-vertex graph with high probability in O(m log 3 n) time. We also give a simpler randomized algorithm that finds all minimum cuts with high probability in O(n 2 log n) time. This variant has an optimal RNC parallelization. Both variants improve on the previous best time bound of O(n 2 log 3 n). Other applications of the tree-packing approach are new, nearly tight bounds on the number of near minimum cuts a graph may have and a new data structure for representing them in a space-efficient manner. 1 Introduction The minimum cut problem has been studied for many years as a fundamental graph optimization problem with numerous applications. Initially, th...
Improved Approximation Algorithms for Uniform Connectivity Problems
- J. Algorithms
"... The problem of finding minimum weight spanning subgraphs with a given connectivity requirement is considered. The problem is NP-hard when the connectivity requirement is greater than one. Polynomial time approximation algorithms for various weighted and unweighted connectivity problems are given. Th ..."
Abstract
-
Cited by 57 (2 self)
- Add to MetaCart
The problem of finding minimum weight spanning subgraphs with a given connectivity requirement is considered. The problem is NP-hard when the connectivity requirement is greater than one. Polynomial time approximation algorithms for various weighted and unweighted connectivity problems are given. The following results are presented: 1. For the unweighted k-edge-connectivity problem an approximation algorithm that achieves a performance ratio of 1.85 is described. This is the first polynomial-time algorithm that achieves a constant less than 2, for all k. 2. For the weighted k-vertex-connectivity problem, a constant factor approximation algorithm is given assuming that the edge-weights satisfy the triangle inequality. This is the first constant factor approximation algorithm for this problem. 3. For the case of biconnectivity, with no assumptions about the weights of the edges, an algorithm that achieves a factor asymptotically approaching 2 is described. This matches the previous best...
A Randomized Fully Polynomial Time Approximation Scheme for the All Terminal Network Reliability Problem
- SIAM J. Comput
, 1996
"... The classic all-terminal network reliability problem posits a graph, each of whose edges fails independently with some given probability. The goal is to determine the probability that the network becomes disconnected due to edge failures. This problem has obvious applications in the design of commun ..."
Abstract
-
Cited by 43 (2 self)
- Add to MetaCart
The classic all-terminal network reliability problem posits a graph, each of whose edges fails independently with some given probability. The goal is to determine the probability that the network becomes disconnected due to edge failures. This problem has obvious applications in the design of communication networks. Since the problem is ]P-complete, and thus believed hard to solve exactly, a great deal of research has been devoted to estimating the failure probability. In this paper, we give a fully polynomial randomized approximation scheme that, given any n-vertex graph with specified failure probabilities, computes in time polynomial in n and 1=ffl an estimate for the failure probability that is accurate to within a relative error of 1 \Sigma ffl with high probability. We also give a deterministic polynomial approximation scheme for the case of small failure probabilities. Some extensions to evaluating probabilities of k-connectivity, strong connectivity in directed Eulerian graph...
Experimental Study of Minimum Cut Algorithms
- PROCEEDINGS OF THE EIGHTH ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS (SODA)
, 1997
"... Recently, several new algorithms have been developed for the minimum cut problem. These algorithms are very different from the earlier ones and from each other and substantially improve worst-case time bounds for the problem. We conduct experimental evaluation the relative performance of these algor ..."
Abstract
-
Cited by 36 (3 self)
- Add to MetaCart
Recently, several new algorithms have been developed for the minimum cut problem. These algorithms are very different from the earlier ones and from each other and substantially improve worst-case time bounds for the problem. We conduct experimental evaluation the relative performance of these algorithms. In the process, we develop heuristics and data structures that substantially improve practical performance of the algorithms. We also develop problem families for testing minimum cut algorithms. Our work leads to a better understanding of practical performance of the minimum cut algorithms and produces very efficient codes for the problem.
Approximating Minimum-Size k-Connected Spanning Subgraphs via Matching
- SIAM J. Comput
, 1998
"... Abstract: An efficient heuristic is presented for the problem of finding a minimum-size k- connected spanning subgraph of an (undirected or directed) simple graph G =(V#E). There are four versions of the problem, and the approximation guarantees are as follows: minimum-size k-node connected spann ..."
Abstract
-
Cited by 31 (3 self)
- Add to MetaCart
Abstract: An efficient heuristic is presented for the problem of finding a minimum-size k- connected spanning subgraph of an (undirected or directed) simple graph G =(V#E). There are four versions of the problem, and the approximation guarantees are as follows: minimum-size k-node connected spanning subgraph of an undirected graph 1+[1=k], minimum-size k-node connected spanning subgraph of a directed graph 1+[1=k], minimum-size k-edge connected spanning subgraph of an undirected graph 1+[2=(k + 1)], and minimum-size k-edge connected spanning subgraph of a directed graph 1+[4= p k].
Building chain and cactus representations of all minimum cuts from Hao-Orlin in the same asymptotic run time
, 1998
"... A cactus tree is a simple data structure that represents all minimum cuts of a weighted graph in linear space. We describe the first algorithm that can build a cactus tree from the asymptotically fastest deterministic algorithm that finds all minimum cuts in a weighted graph --- the Hao-Orlin min ..."
Abstract
-
Cited by 21 (1 self)
- Add to MetaCart
A cactus tree is a simple data structure that represents all minimum cuts of a weighted graph in linear space. We describe the first algorithm that can build a cactus tree from the asymptotically fastest deterministic algorithm that finds all minimum cuts in a weighted graph --- the Hao-Orlin minimum cut algorithm. This improves the time to construct the cactus in graphs with n vertices and m edges from O(n 3 ) to O(nm log n 2 =m).
Recent Developments in Maximum Flow Algorithms
- in Proceedings of Scandinavian Workshop on Algorithm Theory (SWAT
, 1998
"... Introduction The maximum flow problem is a classical optimization problem with many applications; see e.g. [1, 18, 39]. Algorithms for this problem have been studied for over four decades. Recently, significant improvements have been made in theoretical performance of maximum flow algorithms. In t ..."
Abstract
-
Cited by 17 (1 self)
- Add to MetaCart
Introduction The maximum flow problem is a classical optimization problem with many applications; see e.g. [1, 18, 39]. Algorithms for this problem have been studied for over four decades. Recently, significant improvements have been made in theoretical performance of maximum flow algorithms. In this survey we put these results in perspective and provide pointers to the literature. We assume that the reader is familiar with basic flow algorithms, including Dinitz' blocking flow algorithm [13]. 2 Preliminaries The maximum flow problem is to find a flow of the maximum value given a graph G with arc capacities, a source s, and a sink t, Here a flow is a function on arcs that satisfies capacity constraints for all arcs and conservation constraints for all vertices except the source and the sink. For more details, see [1, 18, 39]. We distinguish between directed

