‹‹ Go Back

Seunghye J. Wilson

George Mason University



‹‹ Go Back

James Gentle

George Mason University



‹‹ Go Back

Please enter your access key

The asset you are trying to access is locked for premium users. Please enter your access key to unlock.


Email This Presentation:

From:

To:

Subject:

Body:

←Back IconGems-Print

249 – Regularization and Prediction in Time Series and Longitudinal Models

Data Representation and Pattern Recognition in Financial Time Series

Sponsor: Section on Statistical Learning and Data Science
Keywords: Data reduction, Financial time series, Piecewise linear approximation, Online method, Classification, Similarity measure

Seunghye J. Wilson

George Mason University

James Gentle

George Mason University

One major interest of financial time series analysis is to identify changepoints of trends and recognize patterns that can be used for classification and clustering of time series. Because of the large amounts of data, nonlinear relationship of the data elements and the presence of random noise, some method of data reduction is necessary. The data reduction, however, must preserve the important characteristics of the original data. Many representation methods in the time domain or frequency domain have been suggested to accomplish efficient extraction of information. These include, for example, piecewise linear approximation, symbolic representation, and discrete wavelet transformation (DWT). However, most of the existing methods do not take into consideration time information of trends and/or depend on user-defined parameters, for example the number of segments for piecewise approximation. We introduce piecewise banding approximation (PBA) for data representation based on linear regression using small sets of current data points. The proposed method is flexible and interpretable in the sense that it allows the acquisition and addition of new data points (online method) to detect meaningful trends and changepoints. Changepoints are confirmed once new data points stray far enough outside of the band, creating a reduced dataset of changepoints to utilize. Next, we define patterns from the reduced data which preserve trends and the length of a trends duration. Finally, a distance metric is suggested as a similarity measure to classify the present application example of classification.

"eventScribe", the eventScribe logo, "CadmiumCD", and the CadmiumCD logo are trademarks of CadmiumCD LLC, and may not be copied, imitated or used, in whole or in part, without prior written permission from CadmiumCD. The appearance of these proceedings, customized graphics that are unique to these proceedings, and customized scripts are the service mark, trademark and/or trade dress of CadmiumCD and may not be copied, imitated or used, in whole or in part, without prior written notification. All other trademarks, slogans, company names or logos are the property of their respective owners. Reference to any products, services, processes or other information, by trade name, trademark, manufacturer, owner, or otherwise does not constitute or imply endorsement, sponsorship, or recommendation thereof by CadmiumCD.

As a user you may provide CadmiumCD with feedback. Any ideas or suggestions you provide through any feedback mechanisms on these proceedings may be used by CadmiumCD, at our sole discretion, including future modifications to the eventScribe product. You hereby grant to CadmiumCD and our assigns a perpetual, worldwide, fully transferable, sublicensable, irrevocable, royalty free license to use, reproduce, modify, create derivative works from, distribute, and display the feedback in any manner and for any purpose.

© 2016 CadmiumCD