• Documents
  • Authors
  • Tables
  • Log in
  • Sign up
  • MetaCart
  • DMCA
  • Donate

CiteSeerX logo

Advanced Search Include Citations

Tools

Sorted by:
Try your query at:
Semantic Scholar Scholar Academic
Google Bing DBLP
Results 1 - 10 of 3,259
Next 10 →

ASAP-UCT: Abstraction of State-Action Pairs in UCT

by Ankit An, Aditya Grover, Parag Singla
"... Monte-Carlo Tree Search (MCTS) algorithms such as UCT are an attractive online framework for solv-ing planning under uncertainty problems modeled as a Markov Decision Process. However, MCTS search trees are constructed in flat state and action spaces, which can lead to poor policies for large proble ..."
Abstract - Add to MetaCart
(Abstraction of State-Action Pairs) frame-work, which extends and unifies past work on do-main abstractions by holistically aggregating both states and state-action pairs – ASAP uncovers a much larger number of symmetries in a given do-main. Second, we propose ASAP-UCT, which im-plements ASAP

A Control-Theoretic Approach to Flow Control

by Srinivasan Keshav , 1991
"... This paper presents a control-theoretic approach to reactive flow control in networks that do not reserve bandwidth. We assume a round-robin-like queue service discipline in the output queues of the network’s switches, and propose deterministic and stochastic models for a single conversation in a ne ..."
Abstract - Cited by 454 (8 self) - Add to MetaCart
network of such switches. These models motivate the Packet-Pair rate probing technique, and a provably stable rate-based flow control scheme. A Kalman state estimator is derived from discrete-time state space analysis, but there are difficulties in using the estimator in practice. These difficulties

Policy gradient methods for reinforcement learning with function approximation.

by Richard S Sutton , David Mcallester , Satinder Singh , Yishay Mansour - In NIPS, , 1999
"... Abstract Function approximation is essential to reinforcement learning, but the standard approach of approximating a value function and determining a policy from it has so far proven theoretically intractable. In this paper we explore an alternative approach in which the policy is explicitly repres ..."
Abstract - Cited by 439 (20 self) - Add to MetaCart
under π, which we assume exists and is independent of s 0 for all policies. In the average reward formulation, the value of a state-action pair given a policy is defined as The second formulation we cover is that in which there is a designated start state s 0 , and we care only about the long

Evolving 3D morphology and behavior by competition

by Karl Sims - Proceedings of Artificial Life IV , 1994
"... This paper describes a system for the evolution and co-evolution of virtual creatures that compete in physically simulated three-dimensional worlds. Pairs of individuals enter one-on-one contests in which they contend to gain control of a common resource. The winners receive higher relative fitness ..."
Abstract - Cited by 439 (0 self) - Add to MetaCart
This paper describes a system for the evolution and co-evolution of virtual creatures that compete in physically simulated three-dimensional worlds. Pairs of individuals enter one-on-one contests in which they contend to gain control of a common resource. The winners receive higher relative fitness

Future Motion Decisions using State-action Pair Predictions

by Masashi Sugimoto, Kentarou Kurashige
"... Robots that works in a dynamic environment must possess, the ability to autonomously cope with the changes in the environment. This paper proposes an approach to predict changes in the state and actions of robots. Further, this approach attempts to ap-ply predicted future actions to current actions. ..."
Abstract - Add to MetaCart
Robots that works in a dynamic environment must possess, the ability to autonomously cope with the changes in the environment. This paper proposes an approach to predict changes in the state and actions of robots. Further, this approach attempts to ap-ply predicted future actions to current actions

Evolutionary On-line Learning of Cooperative Behaviour with Situation-Action-Pairs”, IcMs2000.2000

by Jörg Denzinger, Michael Kordt, Fachbereich Informatik Universität Kaiserslautern
"... We present a concept to use off-line learning approaches to achieve on-line learning of cooperative behavior of agents and instantiate this concept for evolutionary learning with agents based on prototype situation-action-pairs and the nearest-neighbor rule. For such an agent model also modeling of ..."
Abstract - Cited by 19 (9 self) - Add to MetaCart
We present a concept to use off-line learning approaches to achieve on-line learning of cooperative behavior of agents and instantiate this concept for evolutionary learning with agents based on prototype situation-action-pairs and the nearest-neighbor rule. For such an agent model also modeling

From frequency to meaning : Vector space models of semantics

by Peter D. Turney, Patrick Pantel - Journal of Artificial Intelligence Research , 2010
"... Computers understand very little of the meaning of human language. This profoundly limits our ability to give instructions to computers, the ability of computers to explain their actions to us, and the ability of computers to analyse and process text. Vector space models (VSMs) of semantics are begi ..."
Abstract - Cited by 347 (3 self) - Add to MetaCart
Computers understand very little of the meaning of human language. This profoundly limits our ability to give instructions to computers, the ability of computers to explain their actions to us, and the ability of computers to analyse and process text. Vector space models (VSMs) of semantics

The symplectic nature of fundamental groups of surfaces

by William M. Goldman - ADV. MATH , 1984
"... A symplectic structure on a manifold is a closed nondegenerate exterior 2-form. The most common type of symplectic structure arises on a complex manifold as the imaginary part of a Hermitian metric which is Klhler. Many moduli spaces associated with Riemann surfaces have such Kahler structures: the ..."
Abstract - Cited by 223 (8 self) - Add to MetaCart
of the above examples and interprets their symplectic structures in terms of the inter-section pairing on the surface. Our setup is as follows. Consider a closed oriented topological surface S with fundamental group rt and let G be a connected Lie group. The space Hom(n, G) consisting of representations rr + G

Action Bank: A High-Level Representation of Activity in Video

by Sreemanananth Sadan, Jason J. Corso
"... Activity recognition in video is dominated by low- and mid-level features, and while demonstrably capable, by nature, these features carry little semantic meaning. Inspired by the recent object bank approach to image representation, we present Action Bank, a new high-level representation of video. A ..."
Abstract - Cited by 170 (8 self) - Add to MetaCart
. Action bank is comprised of many individual action detectors sampled broadly in semantic space as well as viewpoint space. Our representation is constructed to be semantically rich and even when paired with simple linear SVM classifiers is capable of highly discriminative performance. We have tested

An Architecture for Autonomy

by R. Alami, R. Chatila, S. Fleury, M. Ghallab, F. Ingrand - INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH , 1998
"... An autonomous robot o ers a challenging and ideal field for the study of intelligent architectures. Autonomy within a rational behavior could be evaluated by the robot's effectiveness and robustness in carrying out tasks in different and ill-known environments. It raises major requirements on t ..."
Abstract - Cited by 205 (42 self) - Add to MetaCart
constraints, to perform corresponding actions and to control their execution in real-time, while being reactive to possible events. The general architecture is composed of three levels: a decision level, an execution level and a functional level. The later is composed of modules that embed the functions
Next 10 →
Results 1 - 10 of 3,259
Powered by: Apache Solr
  • About CiteSeerX
  • Submit and Index Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2019 The Pennsylvania State University