• Documents
  • Authors
  • Tables
  • Log in
  • Sign up
  • MetaCart
  • DMCA
  • Donate

CiteSeerX logo

Advanced Search Include Citations

Tools

Sorted by:
Try your query at:
Semantic Scholar Scholar Academic
Google Bing DBLP
Results 1 - 10 of 925
Next 10 →

Coverage Control for Mobile Sensing Networks

by Jorge Cortes, Sonia Martínez, Timur Karatas, Francesco Bullo , 2002
"... This paper presents control and coordination algorithms for groups of vehicles. The focus is on autonomous vehicle networks performing distributed sensing tasks where each vehicle plays the role of a mobile tunable sensor. The paper proposes gradient descent algorithms for a class of utility functio ..."
Abstract - Cited by 582 (49 self) - Add to MetaCart
functions which encode optimal coverage and sensing policies. The resulting closed-loop behavior is adaptive, distributed, asynchronous, and verifiably correct.

Between MDPs and Semi-MDPs: A Framework for Temporal Abstraction in Reinforcement Learning

by Richard S. Sutton , Doina Precup , Satinder Singh , 1999
"... Learning, planning, and representing knowledge at multiple levels of temporal abstraction are key, longstanding challenges for AI. In this paper we consider how these challenges can be addressed within the mathematical framework of reinforcement learning and Markov decision processes (MDPs). We exte ..."
Abstract - Cited by 569 (38 self) - Add to MetaCart
extend the usual notion of action in this framework to include options|closed-loop policies for taking action over a period of time. Examples of options include picking up an object, going to lunch, and traveling to a distant city, as well as primitive actions such as muscle twitches and joint knowledge

Stochastic

by S Av
"... tim ue he f inc rta the 2009 Elsevier Ltd. All rights reserved. ndustri evolv duct d n orde are d as a closed-loop optimizer that rejects disturbances affecting profit [45]. Nevertheless, an important limitation of RTO is that it is en-tirely reactive, in the sense that only current disturbances ar ..."
Abstract - Add to MetaCart
tim ue he f inc rta the 2009 Elsevier Ltd. All rights reserved. ndustri evolv duct d n orde are d as a closed-loop optimizer that rejects disturbances affecting profit [45]. Nevertheless, an important limitation of RTO is that it is en-tirely reactive, in the sense that only current disturbances

Robust mapping and localization in indoor environments using sonar data

by Juan D. Tardós, José Neira, Paul M. Newman, John J. Leonard - INT. J. ROBOTICS RESEARCH , 2002
"... In this paper we describe a new technique for the creation of featurebased stochastic maps using standard Polaroid sonar sensors. The fundamental contributions of our proposal are: (1) a perceptual grouping process that permits the robust identification and localization of environmental features, su ..."
Abstract - Cited by 179 (30 self) - Add to MetaCart
correspond to the same environment feature, allowing the system to update the stochastic map accordingly, and perform tasks such as revisiting and loop closing. We demonstrate the practicality of this approach by building a geometric map of a medium size, real indoor environment, with several people moving

LARGE POPULATION STOCHASTIC DYNAMIC GAMES: CLOSED-LOOP MCKEAN-VLASOV SYSTEMS AND THE NASH Certainty Equivalence Principle

by Minyi Huang, Roland P. Malhamé, Peter E. Caines , 2006
"... We consider stochastic dynamic games in large population conditions where multiclass agents are weakly coupled via their individual dynamics and costs. We approach this large population game problem by the so-called Nash Certainty Equivalence (NCE) Principle which leads to a decentralized control ..."
Abstract - Cited by 74 (9 self) - Add to MetaCart
depend upon this measure. We designate the NCE Principle as the property that the resulting scheme is consistent (or soluble), i.e. the prescribed control laws produce sample paths which produce the mass effect measure. By construction, the overall closed-loop behaviour is such that each agent’s

Stabilizability of stochastic linear systems with finite feedback data rates

by Girish N. Nair, Robin J. Evans - SIAM JOUR. CONTR. OPTIM , 2004
"... Feedback control with limited data rates is an emerging area which incorporates ideas from both control and information theory. A fundamental question it poses is how low the closed loop data rate can be made before a given dynamical system is impossible to stabilize by any coding and control law. ..."
Abstract - Cited by 136 (8 self) - Add to MetaCart
Feedback control with limited data rates is an emerging area which incorporates ideas from both control and information theory. A fundamental question it poses is how low the closed loop data rate can be made before a given dynamical system is impossible to stabilize by any coding and control law

Optimal monetary policy in an economy with inflation persistence

by Jón Steinsson, Jón Steinsson - Journal of Monetary Economics , 2003
"... This paper presents a closed economy dynamic stochastic general equilibrium model with mo-nopolistic competition and sticky prices. Two types of price setters are assumed to exist. One acts rationally given Calvo-type constraints on price setting. The other type sets prices accord-ing to a rule-of-t ..."
Abstract - Cited by 147 (2 self) - Add to MetaCart
This paper presents a closed economy dynamic stochastic general equilibrium model with mo-nopolistic competition and sticky prices. Two types of price setters are assumed to exist. One acts rationally given Calvo-type constraints on price setting. The other type sets prices accord-ing to a rule

Stochastic Analysis and Control of Real-Time Systems with Random Time Delays

by Johan Nilsson, Bo Bernhardsson, Björn Wittenmark - Automatica
"... The paper discusses modeling and analysis of real-time systems subject to random time delays in the communication network. A new method for analysis of different control schemes is presented. The method is used to evaluate different suggested schemes from the literature. A new scheme, using so calle ..."
Abstract - Cited by 120 (4 self) - Add to MetaCart
systems, real-time systems, stochastic control, stochastic parameters, timing jitter. 1 Introduction Many real-time systems are implemented as distributed control systems, where the control loops are closed over a communication network or a field bus. There will inevitably be time delays

Planning with Closed-Loop Macro Actions

by Doina Precup, Richard S. Sutton, Satinder Singh , 1997
"... Planning and learning at multiple levels of temporal abstraction is a key problem for artificial intelligence. In this paper we summarize an approach to this problem based on the mathematical framework of Markov decision processes and reinforcement learning. Conventional model-based reinforcement le ..."
Abstract - Cited by 21 (4 self) - Add to MetaCart
in that they are closed loop, uncertain, and of variable duration. Macro actions are needed to represent common-sense higher-level actions such as going to lunch, grasping an object, or traveling to a distant city. This paper generalizes prior work on temporally abstract models (Sutton 1995) and extends it from

Closed-Loop Learning of Visual Control Policies

by Sébastien Jodogne, Justus H. Piater , 2007
"... In this paper we present a general, flexible framework for learning mappings from images to actions by interacting with the environment. The basic idea is to introduce a feature-based image classifier in front of a reinforcement learning algorithm. The classifier partitions the visual space accordin ..."
Abstract - Cited by 12 (1 self) - Add to MetaCart
In this paper we present a general, flexible framework for learning mappings from images to actions by interacting with the environment. The basic idea is to introduce a feature-based image classifier in front of a reinforcement learning algorithm. The classifier partitions the visual space according to the presence or absence of few highly informative local descriptors that are incrementally selected in a sequence of attempts to remove perceptual aliasing. We also address the problem of fighting overfitting in such a greedy algorithm. Finally, we show how high-level visual features can be generated when the power of local descriptors is insufficient for completely disambiguating the aliased states. This is done by building a hierarchy of composite features that consist of recursive spatial combinations of visual features. We demonstrate the efficacy of our algorithms by solving three visual navigation tasks and a visual version of the classical “Car on the Hill” control problem.
Next 10 →
Results 1 - 10 of 925
Powered by: Apache Solr
  • About CiteSeerX
  • Submit and Index Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2019 The Pennsylvania State University