IDEAS home Printed from https://ideas.repec.org/a/eee/csdana/v52y2008i6p3269-3281.html
   My bibliography  Save this article

Exploratory data analysis leading towards the most interesting simple association rules

Author

Listed:
  • Iodice D'Enza, Alfonso
  • Palumbo, Francesco
  • Greenacre, Michael

Abstract

Association rules (AR) represent one of the most powerful and largely used approaches to detect the presence of regularities and paths in large databases. Rules express the relations (in terms of co-occurrence) between pairs of items and are defined in two measures: support and confidence. Most techniques for finding AR scan the whole data set, evaluate all possible rules and retain only rules that have support and confidence greater than thresholds, which should be fixed in order to avoid both that only trivial rules are retained and also that interesting rules are not discarded. A multistep approach aims to the identification of potentially interesting items exploiting well-known techniques of multidimensional data analysis. In particular, interesting pairs of items have a well-defined degree of association: an item pair is well defined if its degree of co-occurrence is very high with respect to one or more subsets of the considered set of transactions.

Suggested Citation

  • Iodice D'Enza, Alfonso & Palumbo, Francesco & Greenacre, Michael, 2008. "Exploratory data analysis leading towards the most interesting simple association rules," Computational Statistics & Data Analysis, Elsevier, vol. 52(6), pages 3269-3281, February.
  • Handle: RePEc:eee:csdana:v:52:y:2008:i:6:p:3269-3281
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0167-9473(07)00387-8
    Download Restriction: Full text for ScienceDirect subscribers only.
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Michael Greenacre, 2000. "Correspondence analysis of square asymmetric matrices," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 49(3), pages 297-310.
    2. Plasse, Marie & Niang, Ndeye & Saporta, Gilbert & Villeminot, Alexandre & Leblond, Laurent, 2007. "Combined use of association rules mining and clustering methods to find relevant links between binary rare attributes in a large data set," Computational Statistics & Data Analysis, Elsevier, vol. 52(1), pages 596-613, September.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Cristina Tortora & Mireille Gettler Summa & Marina Marino & Francesco Palumbo, 2016. "Factor probabilistic distance clustering (FPDC): a new clustering method," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 10(4), pages 441-464, December.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Jérome SARACCO & Marie CHAVENT & Vanessa KUENTZ, 2010. "Clustering of categorical variables around latent variables," Cahiers du GREThA (2007-2019) 2010-02, Groupe de Recherche en Economie Théorique et Appliquée (GREThA).
    2. Giuseppe Bove & Akinori Okada, 2018. "Methods for the analysis of asymmetric pairwise relationships," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 12(1), pages 5-31, March.
    3. Abdorrahman Haeri, 2020. "Analyzing safety level and recognizing flaws of commercial centers through data mining approach," Journal of Risk and Reliability, , vol. 234(3), pages 512-526, June.
    4. Wenge Rong & Baolin Peng & Yuanxin Ouyang & Kecheng Liu & Zhang Xiong, 2015. "Collaborative personal profiling for web service ranking and recommendation," Information Systems Frontiers, Springer, vol. 17(6), pages 1265-1282, December.
    5. Kojadinovic, Ivan, 2010. "Hierarchical clustering of continuous variables based on the empirical copula process and permutation linkages," Computational Statistics & Data Analysis, Elsevier, vol. 54(1), pages 90-108, January.
    6. Michael Greenacre, 2003. "Singular value decomposition of matched matrices," Journal of Applied Statistics, Taylor & Francis Journals, vol. 30(10), pages 1101-1113.
    7. Michael Greenacre, 2001. "Analysis of matched matrices," Economics Working Papers 539, Department of Economics and Business, Universitat Pompeu Fabra.
    8. Nora Lado & Anna Torres & Oscar Licandro, 2006. "Changes in the Importance of Bank Attributes Provoked by a Financial Crisis: A Dynamic Analysis of the Uruguayan Case," Economics Working Papers ECO2006/4, European University Institute.
    9. Tiziano CAMPORESI & Gelsomina CATALANO & Massimo FLORIO & Francesco GIFFONI, 2016. "A Professional Premium for LHC Students: Perceptions from within," Departmental Working Papers 2016-07, Department of Economics, Management and Quantitative Methods at Università degli Studi di Milano, revised 08 Jun 2016.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:csdana:v:52:y:2008:i:6:p:3269-3281. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/csda .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.