Mining of Concurrent Text and Time Series (0) [17 citations — 0 self]
http://www.cse.cuhk.edu.hk/~mzhou/Classified Refer
http://www.cs.cmu.edu/~dunja/KDDpapers/Lavrenko_TM
http://ciir.cs.umass.edu/pubfiles/ir-203.pdf
http://cobar.cs.umass.edu/pubfiles/ir-203.ps
http://www-2.cs.cmu.edu/People/pto/papers/ciir/ir-
http://ciir.cs.umass.edu/~lavrenko/aenalyst/kdd2k.
http://www-2.cs.cmu.edu/People/pto/papers/ciir/ir-
CACHED:
Abstract:
We present a unique approach to identifying news stories that influence the behavior of financial markets. We describe the design and implementation of AEnalyst, a system for predicting trends in stock prices based on the content of news stories that precede the trends. We identify trends in time series using piecewise linear fitting and then assign labels to the trends according to an automated binning procedure. We use language models to represent patterns of language that are highly associated with particular labeled trends. AEnalyst can then identify news stories that are highly indicative of future trends. We evaluate the system in terms of its ability to predict forthcoming trends in the stock prices. We perform a market simulation, demonstrate that AEnalyst is capable of producing profits that are significantly highly than random.

