• Documents
  • Authors
  • Tables
  • Log in
  • Sign up
  • MetaCart
  • DMCA
  • Donate

CiteSeerX logo

Advanced Search Include Citations
Advanced Search Include Citations

Automatic annotation of spoken language using out-ofdomain resources and domain adaptation (2011)

by A Margolis
Add To MetaCart

Tools

Sorted by:
Results 1 - 1 of 1

COMPOSE: A Semi-Supervised Learning Framework for Initially Labeled Nonstationary Streaming Data

by Karl B. Dyer, Robert Capo, Robi Polikar
"... Abstract – An increasing number of real-world applications are associated with streaming data drawn from drifting and nonstationary distributions that change over time. These applications demand new algorithms that can learn and adapt to such changes, also known as concept drift. Proper characteriza ..."
Abstract - Cited by 3 (0 self) - Add to MetaCart
Abstract – An increasing number of real-world applications are associated with streaming data drawn from drifting and nonstationary distributions that change over time. These applications demand new algorithms that can learn and adapt to such changes, also known as concept drift. Proper characterization of such data with existing approaches typically requires substantial amount of labeled instances, which may be difficult, expensive or even impractical to obtain. In this contribution, we introduce COMPOSE, a computational geometry based framework to learn from nonstationary streaming data, where labels are unavailable (or presented very sporadically) after initialization. We introduce the algorithm in detail, and discuss its results and performances on several synthetic and real-world datasets, which demonstrate the ability of the algorithm to learn under several different scenarios of initially labeled streaming environments (ILSE). On carefully designed synthetic datasets, we compare the performance of COMPOSE against the optimal Bayes classifier, as well as the APT algorithm, which addresses a similar environment referred to as extreme verification latency. Furthermore, using the real-world NOAA Weather Dataset, we demonstrate that COMPOSE is competitive even with a well-established, fully supervised, nonstationary learning algorithm that receives labeled data in every batch.
(Show Context)

Citation Context

...often provide theoretical guarantees and bounds on performance, computational complexity or the number of labeled instances required. A good review of the domain adaptation approaches can be found in =-=[7;24]-=-. B. Semi-supervised Learning Semi-supervised learning uses limited labeled data to transfer their class information to unlabeled data following one or more of four general assumptions [25;26]; i) the...

Powered by: Apache Solr
  • About CiteSeerX
  • Submit and Index Documents
  • Privacy Policy
  • Help
  • Data
  • Source
  • Contact Us

Developed at and hosted by The College of Information Sciences and Technology

© 2007-2019 The Pennsylvania State University