Estimating mutual information for feature selection in the presence of label noise
Author
Abstract
Suggested Citation
DOI: 10.1016/j.csda.2013.05.001
Download full text from publisher
As the access to this document is restricted, you may want to search for a different version of it.
References listed on IDEAS
- Xu, Ping & Brock, Guy N. & Parrish, Rudolph S., 2009. "Modified linear discriminant analysis approaches for classification of high-dimensional microarray data," Computational Statistics & Data Analysis, Elsevier, vol. 53(5), pages 1674-1687, March.
- Lee, Jae Won & Lee, Jung Bok & Park, Mira & Song, Seuck Heun, 2005. "An extensive comparison of recent classification tools applied to microarray data," Computational Statistics & Data Analysis, Elsevier, vol. 48(4), pages 869-885, April.
- Hall, Peter & Xue, Jing-Hao, 2014. "On selecting interacting features from high-dimensional data," Computational Statistics & Data Analysis, Elsevier, vol. 71(C), pages 694-708.
- Paulino, Carlos Daniel & Silva, Giovani & Alberto Achcar, Jorge, 2005. "Bayesian analysis of correlated misclassified binary data," Computational Statistics & Data Analysis, Elsevier, vol. 49(4), pages 1120-1131, June.
- Li, Chin-Shang & Cheng, Cheng, 2004. "Stable classification with applications to microarray data," Computational Statistics & Data Analysis, Elsevier, vol. 47(3), pages 599-609, October.
- Wang, Xiaoming & Park, Taesung & Carriere, K.C., 2010. "Variable selection via combined penalization for high-dimensional data analysis," Computational Statistics & Data Analysis, Elsevier, vol. 54(10), pages 2230-2243, October.
Citations
Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
Cited by:
- Abpeykar, Shadi & Ghatee, Mehdi & Zare, Hadi, 2019. "Ensemble decision forest of RBF networks via hybrid feature clustering approach for high-dimensional data classification," Computational Statistics & Data Analysis, Elsevier, vol. 131(C), pages 12-36.
Most related items
These are the items that most often cite the same works as this one and are cited by the same works as this one.- Pires, Ana M. & Branco, João A., 2010. "Projection-pursuit approach to robust linear discriminant analysis," Journal of Multivariate Analysis, Elsevier, vol. 101(10), pages 2464-2485, November.
- Parrish, Rudolph S. & Spencer III, Horace J. & Xu, Ping, 2009. "Distribution modeling and simulation of gene expression data," Computational Statistics & Data Analysis, Elsevier, vol. 53(5), pages 1650-1660, March.
- Ivana Krtolica & Dragan Savić & Bojana Bajić & Snežana Radulović, 2022. "Machine Learning for Water Quality Assessment Based on Macrophyte Presence," Sustainability, MDPI, vol. 15(1), pages 1-13, December.
- Alan R Dabney & John D Storey, 2007. "Optimality Driven Nearest Centroid Classification from Genomic Data," PLOS ONE, Public Library of Science, vol. 2(10), pages 1-7, October.
- Jun Lu & Dan Wang & Qinqin Hu, 2022. "Interaction screening via canonical correlation," Computational Statistics, Springer, vol. 37(5), pages 2637-2670, November.
- Dong, Kai & Pang, Herbert & Tong, Tiejun & Genton, Marc G., 2016. "Shrinkage-based diagonal Hotelling’s tests for high-dimensional small sample size data," Journal of Multivariate Analysis, Elsevier, vol. 143(C), pages 127-142.
- Sajad Shojaee & Nastaran Hajizadeh & Hadis Najafimehr & Luca Busani & Mohamad Amin Pourhoseingholi & Ahmad Reza Baghestani & Maryam Nasserinejad & Sara Ashtari & Mohammad Reza Zali, 2018. "Bayesian adjustment for trend of colorectal cancer incidence in misclassified registering across Iranian provinces," PLOS ONE, Public Library of Science, vol. 13(12), pages 1-10, December.
- Wenya Liu & Qi Li, 2017. "An Efficient Elastic Net with Regression Coefficients Method for Variable Selection of Spectrum Data," PLOS ONE, Public Library of Science, vol. 12(2), pages 1-13, February.
- A. Poterie & J.-F. Dupuy & V. Monbet & L. Rouvière, 2019. "Classification tree algorithm for grouped variables," Computational Statistics, Springer, vol. 34(4), pages 1613-1648, December.
- Brendan P. W. Ames & Mingyi Hong, 2016. "Alternating direction method of multipliers for penalized zero-variance discriminant analysis," Computational Optimization and Applications, Springer, vol. 64(3), pages 725-754, July.
- Xuewei Cheng & Gang Li & Hong Wang, 2024. "The concordance filter: an adaptive model-free feature screening procedure," Computational Statistics, Springer, vol. 39(5), pages 2413-2436, July.
- Herbert Pang & Tiejun Tong & Hongyu Zhao, 2009. "Shrinkage-based Diagonal Discriminant Analysis and Its Applications in High-Dimensional Data," Biometrics, The International Biometric Society, vol. 65(4), pages 1021-1029, December.
- Shieh Albert D & Hung Yeung Sam, 2009. "Detecting Outlier Samples in Microarray Data," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 8(1), pages 1-26, February.
- Valkenborg Dirk & Van Sanden Suzy & Lin Dan & Kasim Adetayo & Zhu Qi & Haldermans Philippe & Jansen Ivy & Shkedy Ziv & Burzykowski Tomasz, 2008. "A Cross-Validation Study to Select a Classification Procedure for Clinical Diagnosis Based on Proteomic Mass Spectrometry," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 7(2), pages 1-22, March.
- Lambert-Lacroix, Sophie & Peyre, Julie, 2006. "Local likelihood regression in generalized linear single-index models with applications to microarray data," Computational Statistics & Data Analysis, Elsevier, vol. 51(3), pages 2091-2113, December.
- Kubokawa, Tatsuya & Hyodo, Masashi & Srivastava, Muni S., 2013. "Asymptotic expansion and estimation of EPMC for linear classification rules in high dimension," Journal of Multivariate Analysis, Elsevier, vol. 115(C), pages 496-515.
- Irina Gaynanova & James G. Booth & Martin T. Wells, 2016. "Simultaneous Sparse Estimation of Canonical Vectors in the ≫ Setting," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 111(514), pages 696-706, April.
- Bang, Sungwan & Jhun, Myoungshic, 2012. "Simultaneous estimation and factor selection in quantile regression via adaptive sup-norm regularization," Computational Statistics & Data Analysis, Elsevier, vol. 56(4), pages 813-826.
- Jong Victor L. & Novianti Putri W. & Roes Kit C.B. & Eijkemans Marinus J.C., 2014. "Exploring homogeneity of correlation structures of gene expression datasets within and between etiological disease categories," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 13(6), pages 717-732, December.
- Yang, Tae Young, 2009. "Efficient multi-class cancer diagnosis algorithm, using a global similarity pattern," Computational Statistics & Data Analysis, Elsevier, vol. 53(3), pages 756-765, January.
More about this item
Keywords
Label noise; Mutual information; Entropy estimation; Feature selection;All these keywords.
Statistics
Access and download statisticsCorrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:csdana:v:71:y:2014:i:c:p:832-848. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/csda .
Please note that corrections may take a couple of weeks to filter through the various RePEc services.