MetaCart Sign in to MyCiteSeerX

Include Citations | Advanced Search | Help

Disambiguated Search | Include Citations | Advanced Search | Help

Mining of Concurrent Text and Time Series (0) [17 citations — 0 self]

by Victor Lavrenko ,  Matt Schmill ,  Dawn Lawrie ,  Paul Ogilvie ,  David Jensen ,  James Allan
In proceedings of the 6 th ACM SIGKDD Int'l Conference on Knowledge Discovery and Data Mining Workshop on Text Mining
Add To MetaCart

Abstract:

We present a unique approach to identifying news stories that influence the behavior of financial markets. We describe the design and implementation of AEnalyst, a system for predicting trends in stock prices based on the content of news stories that precede the trends. We identify trends in time series using piecewise linear fitting and then assign labels to the trends according to an automated binning procedure. We use language models to represent patterns of language that are highly associated with particular labeled trends. AEnalyst can then identify news stories that are highly indicative of future trends. We evaluate the system in terms of its ability to predict forthcoming trends in the stock prices. We perform a market simulation, demonstrate that AEnalyst is capable of producing profits that are significantly highly than random.

Citations

490 A language modeling approach to information retrieval – Ponte, Croft - 1998
370 On the Optimality of Simple Bayesian Classifier under Zero-one Loss – Domingos, P, et al. - 1997
299 Cluster analysis – Everitt - 1993
155 The Zero-Frequency Problem: Estimating the Probabilities of Novel Events in Adaptive Text Compression – Witten, Bell - 1991
131 The DET Curve in Assessment of Detection Task Performance," presented at Eurospeech – Martin, Doddington, et al. - 1997
101 An enhanced representation of time series which allows fast and accurate classification, clustering and relevance feedback – Keogh, Pazzani - 1998
91 A theoretical basis for the use of co-occurrence data in information retrieval – Rijsbergen - 1977
81 Activity monitoring: Noticing interesting changes in behavior – Fawcett, Provost - 1999
72 Segmentation of plane curves – Pavlidis, Horowitz - 1974
38 Encyclopaedia of Statistical Sciences – Kotz, Johnson, et al. - 1982
25 On the Optimality of the Simple Bayesian Classi under Zero{One Loss – Domingos, Pazzani - 1997
19 Topic detection in broadcast news – Walls, Jin, et al. - 1999
14 Learned models for continuous planning – Schmill, Oates, et al. - 1999
6 Text Processing for Classification – Cho, Wüthrich, et al. - 1999
4 Feedforward and recurrent neural networks and genetic programs for stock market and time series forecasting – McCluskey - 1993
3 Randomization Tests, Third Edition – Edgington - 1995
1 Text processing for classi – Cho, Wutrich, et al. - 1998