Results 1 - 10
of
30
On the limited memory BFGS method for large scale optimization
- Mathematical Programming
, 1989
"... this paper has appeared in ..."
First and Second-Order Methods for Learning: between Steepest Descent and Newton's Method
- Neural Computation
, 1992
"... On-line first order backpropagation is sufficiently fast and effective for many large-scale classification problems but for very high precision mappings, batch processing may be the method of choice. This paper reviews first- and second-order optimization methods for learning in feedforward neura ..."
Abstract
-
Cited by 108 (6 self)
- Add to MetaCart
On-line first order backpropagation is sufficiently fast and effective for many large-scale classification problems but for very high precision mappings, batch processing may be the method of choice. This paper reviews first- and second-order optimization methods for learning in feedforward neural networks. The viewpoint is that of optimization: many methods can be cast in the language of optimization techniques, allowing the transfer to neural nets of detailed results about computational complexity and safety procedures to ensure convergence and to avoid numerical problems. The review is not intended to deliver detailed prescriptions for the most appropriate methods in specific applications, but to illustrate the main characteristics of the different methods and their mutual relations.
Theory of Algorithms for Unconstrained Optimization
, 1992
"... this article I will attempt to review the most recent advances in the theory of unconstrained optimization, and will also describe some important open questions. Before doing so, I should point out that the value of the theory of optimization is not limited to its capacity for explaining the behavio ..."
Abstract
-
Cited by 67 (1 self)
- Add to MetaCart
this article I will attempt to review the most recent advances in the theory of unconstrained optimization, and will also describe some important open questions. Before doing so, I should point out that the value of the theory of optimization is not limited to its capacity for explaining the behavior of the most widely used techniques. The question
Comparative study of stock trend prediction using time delay, recurrent and probabilistic neural networks
- IEEE Transactions on Neural Networks
, 1998
"... Abstract — Three networks are compared for low false alarm stock trend predictions. Short-term trends, particularly attractive for neural network analysis, can be used profitably in scenarios such as option trading, but only with significant risk. Therefore, we focus on limiting false alarms, which ..."
Abstract
-
Cited by 29 (0 self)
- Add to MetaCart
Abstract — Three networks are compared for low false alarm stock trend predictions. Short-term trends, particularly attractive for neural network analysis, can be used profitably in scenarios such as option trading, but only with significant risk. Therefore, we focus on limiting false alarms, which improves the risk/reward ratio by preventing losses. To predict stock trends, we exploit time delay, recurrent, and probabilistic neural networks (TDNN, RNN, and PNN, respectively), utilizing conjugate gradient and multistream extended Kalman filter training for TDNN and RNN. We also discuss different predictability analysis techniques and perform an analysis of predictability based on a history of daily closing price. Our results indicate that all the networks are feasible, the primary preference being one of convenience. Index Terms—Conjugate gradient, extended Kalman filter, financial engineering, financial forecasting, predictability analysis, probablistic neural network, recurrent neural network, stock market forecasting, time delay neural network, time series analysis, time series prediction, trend prediction. I.
Fast Training Algorithms For Multi-Layer Neural Nets
, 1993
"... Training a multilayer neural net by back-propagation is slow and requires arbitrary choices regarding the number of hidden units and layers. This paper describes an algorithm which is much faster than back-propagation and for which it is not necessary to specify the number of hidden units in advance ..."
Abstract
-
Cited by 25 (0 self)
- Add to MetaCart
Training a multilayer neural net by back-propagation is slow and requires arbitrary choices regarding the number of hidden units and layers. This paper describes an algorithm which is much faster than back-propagation and for which it is not necessary to specify the number of hidden units in advance. The relationship with other fast pattern recognition algorithms, such as algorithms based on k-d trees, is mentioned. The algorithm has been implemented and tested on articial problems such as the parity problem and on real problems arising in speech recognition. Experimental results, including training times and recognition accuracy, are given. Generally, the algorithm achieves accuracy as good as or better than nets trained using back-propagation, and the training process is much faster than back-propagation. Accuracy is comparable to that for the \nearest neighbour" algorithm, which is slower and requires more storage space. Comments Only the Abstract is given here. The full paper ap...
Location-Aware Computing: A Neural Network Model For Determining Location In Wireless LANs
, 2002
"... The strengths of the RF signals arriving from more access points in a wireless LANs are related to the position of the mobile terminal and can be used to derive the location of the user. In a ..."
Abstract
-
Cited by 18 (1 self)
- Add to MetaCart
The strengths of the RF signals arriving from more access points in a wireless LANs are related to the position of the mobile terminal and can be used to derive the location of the user. In a
A survey of nonlinear conjugate gradient methods
- Pacific Journal of Optimization
, 2006
"... Abstract. This paper reviews the development of different versions of nonlinear conjugate gradient methods, with special attention given to global convergence properties. ..."
Abstract
-
Cited by 12 (1 self)
- Add to MetaCart
Abstract. This paper reviews the development of different versions of nonlinear conjugate gradient methods, with special attention given to global convergence properties.
A Spectral Conjugate Gradient Method for Unconstrained Optimization
, 1999
"... A family of scaled conjugate-gradient algorithms for large-scale unconstrained minimization is dened. The Perry, the Polak-Ribiere and the Fletcher-Reeves formulae are compared using a spectral scaling derived from Raydan's spectral gradient optimization method. The best combination of formula, ..."
Abstract
-
Cited by 11 (1 self)
- Add to MetaCart
A family of scaled conjugate-gradient algorithms for large-scale unconstrained minimization is dened. The Perry, the Polak-Ribiere and the Fletcher-Reeves formulae are compared using a spectral scaling derived from Raydan's spectral gradient optimization method. The best combination of formula, scaling and initial choice of step-length is compared against well known algorithms using a classical set of problems. An additional comparison involving an ill-conditioned estimation problem in Optics is presented. Keywords. Unconstrained minimization, spectral gradient method, conjugate gradients. AMS: 49M07, 49M10, 90C06, 65K. 1
BFGS with update skipping and varying memory
- SIAM J. Optim
, 1998
"... Abstract. We give conditions under which limited-memory quasi-Newton methods with exact line searches will terminate in n steps when minimizing n-dimensional quadratic functions. We show that although all Broyden family methods terminate in n steps in their full-memory versions, only BFGS does so wi ..."
Abstract
-
Cited by 9 (2 self)
- Add to MetaCart
Abstract. We give conditions under which limited-memory quasi-Newton methods with exact line searches will terminate in n steps when minimizing n-dimensional quadratic functions. We show that although all Broyden family methods terminate in n steps in their full-memory versions, only BFGS does so with limited-memory. Additionally, we show that full-memory Broyden family methods with exact line searches terminate in at most n + p steps when p matrix updates are skipped. We introduce new limited-memory BFGS variants and test them on nonquadratic minimization problems.
A Globally Convergent Version of the Polak-Ribière Conjugate Gradient Method
, 1995
"... In this paper we propose a new line search algorithm that ensures global convergence of the PolakRibi `ere conjugate gradient method for the unconstrained minimization of nonconvex differentiable functions. In particular, we show that every limit point produced by the Polak-Ribi`ere iteration is a s ..."
Abstract
-
Cited by 8 (0 self)
- Add to MetaCart
In this paper we propose a new line search algorithm that ensures global convergence of the PolakRibi `ere conjugate gradient method for the unconstrained minimization of nonconvex differentiable functions. In particular, we show that every limit point produced by the Polak-Ribi`ere iteration is a stationary point of the objective function. Moreover, we prove that, asymptotically, the first stationary point along the search direction can be accepted and that, under strong convexity assumptions, the known global convergence results can be reobtained as a special case. From a computational point of view, we may expect that an algorithm incorporating the stepsize acceptance rules proposed here will retain the same good features of the Polak-Ribi`ere method, while avoiding pathological situations. 1 Introduction The objective of this paper is that of defining a new globally convergent implementation of the PolakRibi `ere conjugate gradient method for the unconstrained minimization of a ...

