IDEAS home Printed from https://ideas.repec.org/a/hin/jnlmpe/5379086.html
   My bibliography  Save this article

Discovering Significant Sequential Patterns in Data Stream by an Efficient Two-Phase Procedure

Author

Listed:
  • Huijun Tang
  • Le Wang
  • Yangguang Liu
  • Jiangbo Qian
  • Ghous Ali

Abstract

One essential topic of mining sequential patterns in the data stream is to optimize the time-space computations. However, more importantly, it should pay more attention to the significance of mining results as a large portion of them just response to the user-defined constraints purely by accident and they may have no statistical significance. In this paper, we propose FSSPDS, an efficient two-phase algorithm to discover the significant sequential patterns (SSPs) in the data stream with typical sliding windows, which has never been considered in existing problems. First, for generating SSPs candidates with high-quality, FSSPDS takes testable support and pattern length constraints into account and insignificant patterns were removed timely by a pattern-growth method. In the second phase, appropriate permutation testing is used to test the significance of the SSPs candidates. Exact permutation p values are obtained in a novel combination way based on unconditional Barnard’s test statistic which better reflects the process of data generations and collections. Experimental evaluations show that FSSPDS allows the discovery of SSPs in the data stream and rivals the state-of-the-art approaches efficiently under the control of family-wise error rate (FWER), especially for time efficiency, which was approximately an order of magnitude higher.

Suggested Citation

  • Huijun Tang & Le Wang & Yangguang Liu & Jiangbo Qian & Ghous Ali, 2022. "Discovering Significant Sequential Patterns in Data Stream by an Efficient Two-Phase Procedure," Mathematical Problems in Engineering, Hindawi, vol. 2022, pages 1-23, December.
  • Handle: RePEc:hin:jnlmpe:5379086
    DOI: 10.1155/2022/5379086
    as

    Download full text from publisher

    File URL: http://downloads.hindawi.com/journals/mpe/2022/5379086.pdf
    Download Restriction: no

    File URL: http://downloads.hindawi.com/journals/mpe/2022/5379086.xml
    Download Restriction: no

    File URL: https://libkey.io/10.1155/2022/5379086?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:hin:jnlmpe:5379086. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Mohamed Abdelhakeem (email available below). General contact details of provider: https://www.hindawi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.