Understanding PubMed user search behavior through log analysis
- PMID: 20157491
- PMCID: PMC2797455
- DOI: 10.1093/database/bap018
Understanding PubMed user search behavior through log analysis
Abstract
This article reports on a detailed investigation of PubMed users' needs and behavior as a step toward improving biomedical information retrieval. PubMed is providing free service to researchers with access to more than 19 million citations for biomedical articles from MEDLINE and life science journals. It is accessed by millions of users each day. Efficient search tools are crucial for biomedical researchers to keep abreast of the biomedical literature relating to their own research. This study provides insight into PubMed users' needs and their behavior. This investigation was conducted through the analysis of one month of log data, consisting of more than 23 million user sessions and more than 58 million user queries. Multiple aspects of users' interactions with PubMed are characterized in detail with evidence from these logs. Despite having many features in common with general Web searches, biomedical information searches have unique characteristics that are made evident in this study. PubMed users are more persistent in seeking information and they reformulate queries often. The three most frequent types of search are search by author name, search by gene/protein, and search by disease. Use of abbreviation in queries is very frequent. Factors such as result set size influence users' decisions. Analysis of characteristics such as these plays a critical role in identifying users' information needs and their search habits. In turn, such an analysis also provides useful insight for improving biomedical information retrieval.Database URL:http://www.ncbi.nlm.nih.gov/PubMed.
Figures
![Figure 1.](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/2797455/bin/bap018f1.gif)
![Figure 2.](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/2797455/bin/bap018f2.gif)
![Figure 3.](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/2797455/bin/bap018f3.gif)
![Figure 4.](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/2797455/bin/bap018f4.gif)
![Figure 5.](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/2797455/bin/bap018f5.gif)
![Figure 6.](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/2797455/bin/bap018f6.gif)
![Figure 7.](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/2797455/bin/bap018f7.gif)
![Figure 8.](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/2797455/bin/bap018f8.gif)
![Figure 9.](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/2797455/bin/bap018f9.gif)
![Figure 10.](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/2797455/bin/bap018f10.gif)
![Figure 11.](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/2797455/bin/bap018f11.gif)
![Figure 12.](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/2797455/bin/bap018f12.gif)
![Figure 13.](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/2797455/bin/bap018f13.gif)
![Figure 14.](https://www.ncbi.nlm.nih.gov/pmc/articles/instance/2797455/bin/bap018f14.gif)
Similar articles
-
Analysis of PubMed User Sessions Using a Full-Day PubMed Query Log: A Comparison of Experienced and Nonexperienced PubMed Users.JMIR Med Inform. 2015 Jul 2;3(3):e25. doi: 10.2196/medinform.3740. JMIR Med Inform. 2015. PMID: 26139516 Free PMC article.
-
G-Bean: an ontology-graph based web tool for biomedical literature retrieval.BMC Bioinformatics. 2014;15 Suppl 12(Suppl 12):S1. doi: 10.1186/1471-2105-15-S12-S1. Epub 2014 Nov 6. BMC Bioinformatics. 2014. PMID: 25474588 Free PMC article.
-
Developing topic-specific search filters for PubMed with click-through data.Methods Inf Med. 2013;52(5):395-402. doi: 10.3414/ME12-01-0054. Epub 2013 May 13. Methods Inf Med. 2013. PMID: 23666447 Free PMC article.
-
PubMed and beyond: a survey of web tools for searching biomedical literature.Database (Oxford). 2011 Jan 18;2011:baq036. doi: 10.1093/database/baq036. Print 2011. Database (Oxford). 2011. PMID: 21245076 Free PMC article. Review.
-
Searching the MEDLINE literature database through PubMed: a short guide.Onkologie. 2005 Oct;28(10):517-22. doi: 10.1159/000087186. Epub 2005 Aug 19. Onkologie. 2005. PMID: 16186693 Review.
Cited by
-
On the additive artificial intelligence-based discovery of nanoparticle neurodegenerative disease drug delivery systems.Beilstein J Nanotechnol. 2024 May 15;15:535-555. doi: 10.3762/bjnano.15.47. eCollection 2024. Beilstein J Nanotechnol. 2024. PMID: 38774585 Free PMC article.
-
PubMed features to save your time.J Hosp Librariansh. 2024;24(1):1-9. doi: 10.1080/15323269.2023.2291284. Epub 2023 Dec 7. J Hosp Librariansh. 2024. PMID: 38645880
-
GNorm2: an improved gene name recognition and normalization system.Bioinformatics. 2023 Oct 3;39(10):btad599. doi: 10.1093/bioinformatics/btad599. Bioinformatics. 2023. PMID: 37878810 Free PMC article.
-
Spin in Abstracts of Systematic Reviews and Meta-analyses of Melanoma Therapies: Cross-sectional Analysis.JMIR Dermatol. 2022 Feb 24;5(1):e33996. doi: 10.2196/33996. JMIR Dermatol. 2022. PMID: 37632865 Free PMC article.
-
Chemical identification and indexing in full-text articles: an overview of the NLM-Chem track at BioCreative VII.Database (Oxford). 2023 Mar 7;2023:baad005. doi: 10.1093/database/baad005. Database (Oxford). 2023. PMID: 36882099 Free PMC article.
References
-
- Tenopir C. Online databases: are e-journals good for science? Library J. 2008;133:24.
-
- Taylor R. Question negotiation and information seeking in libraries. College Res. Libraries. 1968;29:178–194.
-
- Murray GC, Teevan J. Query log analysis: social and technological challenges (WWW 2007 Workshop Report) ACM SIGIR Forum. 2007;41:112–120.
-
- Spink A, Jansen BJ, editors. Web Search: Public Searching of the Web. Kluwer, Dordrecht: 2004.
LinkOut - more resources
Full Text Sources
Other Literature Sources