The Synergy Between PAV and AdaBoost

doi:10.1007/s10994-005-1123-6

. 2005 Nov;61(1-3):71-103.

doi: 10.1007/s10994-005-1123-6. Epub 2005 Jun 8.

The Synergy Between PAV and AdaBoost

W John Wilbur¹, Lana Yeganova¹, Won Kim¹

Affiliations

PMID: 29456289
PMCID: PMC5815843
DOI: 10.1007/s10994-005-1123-6

The Synergy Between PAV and AdaBoost

W John Wilbur et al. Mach Learn. 2005 Nov.

. 2005 Nov;61(1-3):71-103.

doi: 10.1007/s10994-005-1123-6. Epub 2005 Jun 8.

Authors

W John Wilbur¹, Lana Yeganova¹, Won Kim¹

Affiliation

¹ National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD, U.S.A.

PMID: 29456289
PMCID: PMC5815843
DOI: 10.1007/s10994-005-1123-6

Abstract

Schapire and Singer's improved version of AdaBoost for handling weak hypotheses with confidence rated predictions represents an important advance in the theory and practice of boosting. Its success results from a more efficient use of information in weak hypotheses during updating. Instead of simple binary voting a weak hypothesis is allowed to vote for or against a classification with a variable strength or confidence. The Pool Adjacent Violators (PAV) algorithm is a method for converting a score into a probability. We show how PAV may be applied to a weak hypothesis to yield a new weak hypothesis which is in a sense an ideal confidence rated prediction and that this leads to an optimal updating for AdaBoost. The result is a new algorithm which we term PAV-AdaBoost. We give several examples illustrating problems for which this new algorithm provides advantages in performance.

Keywords: boosting; convergence; document classification; isotonic regression; k nearest neighbors.

PubMed Disclaimer

Figures

**Figure 1**
PAV-AdaBoost with Cmass as weak learner. Precision is 11-point average precision.

**Figure 2**
Boosting Nscore with PAV-AdaBoost compared with several different attempts to boost Nscore with linear AdaBoost.

**Figure 3**
Probability of label class +1 as a function of score as estimated by PAV and by sigmoid curves. The curve marked Sigmoid is the estimate implicitly used by linear AdaBoost and is clearly not the optimal sigmoid minimizing *Z_t*. SigmoidOpt is the result of optimization over both ω and b in definition (52).

**Figure 4**
Boosting naïve Bayes (binary form) with three different algorithms. PAV-AdaBoost gives the lowest error on the training space (lower panel), but optimal linear AdaBoost gives the best generalizability (upper panel).

See this image and copyright information in PMC

Cited by

Between neurons and networks: investigating mesoscale brain connectivity in neurological and psychiatric disorders.
Caznok Silveira AC, Antunes ASLM, Athié MCP, da Silva BF, Ribeiro Dos Santos JV, Canateli C, Fontoura MA, Pinto A, Pimentel-Silva LR, Avansini SH, de Carvalho M. Caznok Silveira AC, et al. Front Neurosci. 2024 Feb 20;18:1340345. doi: 10.3389/fnins.2024.1340345. eCollection 2024. Front Neurosci. 2024. PMID: 38445254 Free PMC article. Review.
Design of a medical decision-supporting system for the identification of brain tumors using entropy-based thresholding and non-local texture features.
Reddy KR, Batchu RK, Polinati S, Bavirisetti DP. Reddy KR, et al. Front Hum Neurosci. 2023 Mar 22;17:1157155. doi: 10.3389/fnhum.2023.1157155. eCollection 2023. Front Hum Neurosci. 2023. PMID: 37033909 Free PMC article.
Better synonyms for enriching biomedical search.
Yeganova L, Kim S, Chen Q, Balasanov G, Wilbur WJ, Lu Z. Yeganova L, et al. J Am Med Inform Assoc. 2020 Dec 9;27(12):1894-1902. doi: 10.1093/jamia/ocaa151. J Am Med Inform Assoc. 2020. PMID: 33083825 Free PMC article.
Isotonic Regression under Lipschitz Constraint.
Yeganova L, Wilbur WJ. Yeganova L, et al. J Optim Theory Appl. 2009 May;141(2):429-443. doi: 10.1007/s10957-008-9477-0. Epub 2009 Jan 7. J Optim Theory Appl. 2009. PMID: 29456266 Free PMC article.
Author Name Disambiguation for PubMed.
Liu W, Islamaj Doğan R, Kim S, Comeau DC, Kim W, Yeganova L, Lu Z, Wilbur WJ. Liu W, et al. J Assoc Inf Sci Technol. 2014 Apr;65(4):765-781. doi: 10.1002/asi.23063. Epub 2013 Nov 21. J Assoc Inf Sci Technol. 2014. PMID: 28758138 Free PMC article.

See all "Cited by" articles

References

1. Apte C, Damerau F, Weiss S. Text mining with decision rules and decision trees. Conference Proceedings The Conference on Automated Learning and Discovery; CMU; 1998.
1. Aslam J. Improving algorithms for boosting. Conference Proceedings 13th COLT; Palo Alto, California. 2000.
1. Ayer M, Brunk HD, Ewing GM, Reid WT, Silverman E. An empirical distribution function for sampling with incomplete information. Annals of Mathematical Statistics. 1954;26:641–647.
1. Bennett KP, Demiriz A, Shawe-Taylor J. A column generation algorithm for boosting. Conference Proceedings 17th ICML.2000.
1. Buja A, Hastie T, Tibshirani R. Linear smoothers and additive models. The Annals of Statistics. 1989;17(2):453–555.

Grants and funding

Z99 LM999999/Intramural NIH HHS/United States

LinkOut - more resources

Full Text Sources

[1] Apte C, Damerau F, Weiss S. Text mining with decision rules and decision trees. Conference Proceedings The Conference on Automated Learning and Discovery; CMU; 1998.

[2] Apte C, Damerau F, Weiss S. Text mining with decision rules and decision trees. Conference Proceedings The Conference on Automated Learning and Discovery; CMU; 1998.

[3] Aslam J. Improving algorithms for boosting. Conference Proceedings 13th COLT; Palo Alto, California. 2000.

[4] Aslam J. Improving algorithms for boosting. Conference Proceedings 13th COLT; Palo Alto, California. 2000.

[5] Ayer M, Brunk HD, Ewing GM, Reid WT, Silverman E. An empirical distribution function for sampling with incomplete information. Annals of Mathematical Statistics. 1954;26:641–647.

[6] Ayer M, Brunk HD, Ewing GM, Reid WT, Silverman E. An empirical distribution function for sampling with incomplete information. Annals of Mathematical Statistics. 1954;26:641–647.

[7] Bennett KP, Demiriz A, Shawe-Taylor J. A column generation algorithm for boosting. Conference Proceedings 17th ICML.2000.

[8] Bennett KP, Demiriz A, Shawe-Taylor J. A column generation algorithm for boosting. Conference Proceedings 17th ICML.2000.

[9] Buja A, Hastie T, Tibshirani R. Linear smoothers and additive models. The Annals of Statistics. 1989;17(2):453–555.

[10] Buja A, Hastie T, Tibshirani R. Linear smoothers and additive models. The Annals of Statistics. 1989;17(2):453–555.

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

The Synergy Between PAV and AdaBoost

Affiliation

The Synergy Between PAV and AdaBoost

Authors

Affiliation

Abstract

Figures

Similar articles

Cited by

References

Grants and funding

LinkOut - more resources

Full Text Sources

Abstract

Figures

Similar articles

Cited by

References

Related information

Grants and funding

LinkOut - more resources

Full Text Sources