The Synergy Between PAV and AdaBoost
- PMID: 29456289
- PMCID: PMC5815843
- DOI: 10.1007/s10994-005-1123-6
The Synergy Between PAV and AdaBoost
Abstract
Schapire and Singer's improved version of AdaBoost for handling weak hypotheses with confidence rated predictions represents an important advance in the theory and practice of boosting. Its success results from a more efficient use of information in weak hypotheses during updating. Instead of simple binary voting a weak hypothesis is allowed to vote for or against a classification with a variable strength or confidence. The Pool Adjacent Violators (PAV) algorithm is a method for converting a score into a probability. We show how PAV may be applied to a weak hypothesis to yield a new weak hypothesis which is in a sense an ideal confidence rated prediction and that this leads to an optimal updating for AdaBoost. The result is a new algorithm which we term PAV-AdaBoost. We give several examples illustrating problems for which this new algorithm provides advantages in performance.
Keywords: boosting; convergence; document classification; isotonic regression; k nearest neighbors.
Figures
![Figure 1](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/5815843/bin/nihms939020f1.gif)
![Figure 2](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/5815843/bin/nihms939020f2.gif)
![Figure 3](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/5815843/bin/nihms939020f3.gif)
![Figure 4](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/5815843/bin/nihms939020f4.gif)
Similar articles
-
Isotonic Regression under Lipschitz Constraint.J Optim Theory Appl. 2009 May;141(2):429-443. doi: 10.1007/s10957-008-9477-0. Epub 2009 Jan 7. J Optim Theory Appl. 2009. PMID: 29456266 Free PMC article.
-
Improved PSO_AdaBoost Ensemble Algorithm for Imbalanced Data.Sensors (Basel). 2019 Mar 26;19(6):1476. doi: 10.3390/s19061476. Sensors (Basel). 2019. PMID: 30917599 Free PMC article.
-
Boosting for high-dimensional two-class prediction.BMC Bioinformatics. 2015 Sep 21;16:300. doi: 10.1186/s12859-015-0723-9. BMC Bioinformatics. 2015. PMID: 26390865 Free PMC article.
-
Probability estimation with machine learning methods for dichotomous and multicategory outcome: theory.Biom J. 2014 Jul;56(4):534-63. doi: 10.1002/bimj.201300068. Epub 2014 Jan 29. Biom J. 2014. PMID: 24478134 Review.
-
Experiments with AdaBoost.RT, an improved boosting scheme for regression.Neural Comput. 2006 Jul;18(7):1678-710. doi: 10.1162/neco.2006.18.7.1678. Neural Comput. 2006. PMID: 16764518
Cited by
-
Between neurons and networks: investigating mesoscale brain connectivity in neurological and psychiatric disorders.Front Neurosci. 2024 Feb 20;18:1340345. doi: 10.3389/fnins.2024.1340345. eCollection 2024. Front Neurosci. 2024. PMID: 38445254 Free PMC article. Review.
-
Design of a medical decision-supporting system for the identification of brain tumors using entropy-based thresholding and non-local texture features.Front Hum Neurosci. 2023 Mar 22;17:1157155. doi: 10.3389/fnhum.2023.1157155. eCollection 2023. Front Hum Neurosci. 2023. PMID: 37033909 Free PMC article.
-
Better synonyms for enriching biomedical search.J Am Med Inform Assoc. 2020 Dec 9;27(12):1894-1902. doi: 10.1093/jamia/ocaa151. J Am Med Inform Assoc. 2020. PMID: 33083825 Free PMC article.
-
Isotonic Regression under Lipschitz Constraint.J Optim Theory Appl. 2009 May;141(2):429-443. doi: 10.1007/s10957-008-9477-0. Epub 2009 Jan 7. J Optim Theory Appl. 2009. PMID: 29456266 Free PMC article.
-
Author Name Disambiguation for PubMed.J Assoc Inf Sci Technol. 2014 Apr;65(4):765-781. doi: 10.1002/asi.23063. Epub 2013 Nov 21. J Assoc Inf Sci Technol. 2014. PMID: 28758138 Free PMC article.
References
-
- Apte C, Damerau F, Weiss S. Text mining with decision rules and decision trees. Conference Proceedings The Conference on Automated Learning and Discovery; CMU; 1998.
-
- Aslam J. Improving algorithms for boosting. Conference Proceedings 13th COLT; Palo Alto, California. 2000.
-
- Ayer M, Brunk HD, Ewing GM, Reid WT, Silverman E. An empirical distribution function for sampling with incomplete information. Annals of Mathematical Statistics. 1954;26:641–647.
-
- Bennett KP, Demiriz A, Shawe-Taylor J. A column generation algorithm for boosting. Conference Proceedings 17th ICML.2000.
-
- Buja A, Hastie T, Tibshirani R. Linear smoothers and additive models. The Annals of Statistics. 1989;17(2):453–555.
Grants and funding
LinkOut - more resources
Full Text Sources