Results 11 - 20
of
804
Multi-objective Monte-Carlo Tree Search
- in "Asian Conferenc on Machine Learning (ACML 2012
"... Concerned with multi-objective reinforcement learning (MORL), this paper presents MO-MCTS, an extension of Monte-Carlo Tree Search to multi-objective sequential decision making. The known multi-objective indicator referred to as hyper-volume indicator is used to define an action selection criterion, ..."
Abstract
-
Cited by 2 (1 self)
- Add to MetaCart
Concerned with multi-objective reinforcement learning (MORL), this paper presents MO-MCTS, an extension of Monte-Carlo Tree Search to multi-objective sequential decision making. The known multi-objective indicator referred to as hyper-volume indicator is used to define an action selection criterion
AVoCS 2006 Multi-Object Checking Counterexamples
"... Model checking has become the most widely used technique for the verification of state based systems. In addition to its automation in checking whether a system model satisfies a specification, model checking provides a counterexample trace if the checking condition does not hold. Such a sequence of ..."
Abstract
- Add to MetaCart
to multi-object checking – the automated generation of a global system trace, if a specification in multi-object logic does not hold.
Multi-Table Reinforcement Learning for Visual Object Recognition
"... Abstract This paper presents a bag of feature based method for visual object recognition. Our contribution is focussed on the selection of the best feature descriptor. It is implemented by using a novel multi-table reinforcement learning method that selects among five of classical descriptors (i.e., ..."
Abstract
- Add to MetaCart
Abstract This paper presents a bag of feature based method for visual object recognition. Our contribution is focussed on the selection of the best feature descriptor. It is implemented by using a novel multi-table reinforcement learning method that selects among five of classical descriptors (i
Asian Conference on Machine Learning Multi-objective Monte-Carlo Tree Search
, 2012
"... Concerned with multi-objective reinforcement learning (MORL), this paper presents MO-MCTS, an extension of Monte-Carlo Tree Search to multi-objective sequential decision making. The known multi-objective indicator referred to as hyper-volume indicator is used to define an action selection criterion, ..."
Abstract
- Add to MetaCart
Concerned with multi-objective reinforcement learning (MORL), this paper presents MO-MCTS, an extension of Monte-Carlo Tree Search to multi-objective sequential decision making. The known multi-objective indicator referred to as hyper-volume indicator is used to define an action selection criterion
A Novel Adaptive Weight Selection Algorithm for Multi-Objective Multi-Agent Reinforcement Learning
"... Abstract — To solve multi-objective problems, multiple re-ward signals are often scalarized into a single value and further processed using established single-objective problem solving techniques. While the field of multi-objective optimization has made many advances in applying scalarization techni ..."
Abstract
-
Cited by 1 (1 self)
- Add to MetaCart
the multi-objective multi-agent reinforcement learning setup. The analysed approaches intel-ligently explore the weight-space in order to find a wider range of system trade-offs. In our second contribution, we propose a novel adaptive weight algorithm which interacts with the underlying local multi-objective
Combined Real and Reactive Power Economic Dispatch using Multi-Objective Reinforced Learning with Optimized Losses
"... Abstract- Most of the economic dispatch (ED) works so far deal with real power dispatch only. With the integration of renewable energy into the grid, reactive power dispatch cannot be ignored any longer due to its importance in providing security and reliability in power system planning, operation a ..."
Abstract
- Add to MetaCart
objective reinforcement learning (MORL) is proposed in this paper. The IEEE 14 Bus was used to validate the effectiveness of the proposed CRRED formulation and Hybrid method.The numerical results obtained show that combining real and reactive power results in a 0.95 % decrease in the overall generation cost
LOY et al.: MODELLING MULTI-OBJECT ACTIVITY BY GAUSSIAN PROCESSES 1 Modelling Multi-object Activity by Gaussian Processes
"... We present a new approach for activity modelling and anomaly detection based on non-parametric Gaussian Process (GP) models. Specifically, GP regression models are formulated to learn non-linear relationships between multi-object activity patterns observed from semantically decomposed regions in com ..."
Abstract
- Add to MetaCart
We present a new approach for activity modelling and anomaly detection based on non-parametric Gaussian Process (GP) models. Specifically, GP regression models are formulated to learn non-linear relationships between multi-object activity patterns observed from semantically decomposed regions
ReCoM: reinforcement clustering of multi-type interrelated data objects
- In Proceedings of the ACM SIGIR Conference on Research and Development in Information Retrieval
, 2003
"... Most existing clustering algorithms cluster highly related data objects such as Web pages and Web users separately. The interrelation among different types of data objects is either not considered, or represented by a static feature space and treated in the same ways as other attributes of the objec ..."
Abstract
-
Cited by 39 (11 self)
- Add to MetaCart
of the objects. In this paper, we propose a novel clustering approach for clustering multi-type interrelated data objects, ReCoM (Reinforcement Clustering of Multi-type Interrelated data objects). Under this approach, relationships among data objects are used to improve the cluster quality of interrelated data
An Adaptive Multi-Objective Scheduling Selection Framework for Continuous Query Processing ∗
"... Adaptive operator scheduling algorithms for continuous query processing are usually designed to serve a single performance objective, such as minimizing memory usage or maximizing query throughput. We observe that different performance objectives may sometimes conflict with each other. Also due to t ..."
Abstract
- Add to MetaCart
to the dynamic nature of streaming environments, the performance objective may need to change dynamically. Furthermore, the performance specification defined by users may itself be multi-dimensional. Therefore, utilizing a single scheduling algorithm optimized for a single objective is no longer sufficient
CONTEXT MEMORY NETWORKS FOR MULTI-OBJECTIVE SEMANTIC PARSING IN CONVERSATIONAL UNDERSTANDING
"... ABSTRACT The end-to-end multi-domain and multi-task learning of the full semantic frame of user utterances (i.e., domain and intent classes and slots in utterances) have recently emerged as a new paradigm in spoken language understanding. An advantage of the joint optimization of these semantic fra ..."
Abstract
- Add to MetaCart
are then provided to a new multi-objective long short term memory network (LSTM) to infer the intent class and slot tags. Our empirical investigations on CMN show impressive gains over the end-to-end LSTM baselines on ATIS dataset as well as two other humanto-machine conversational datasets. Index Terms
Results 11 - 20
of
804