Skip to content
Surf Wiki
Save to docs
technology/databases

From Surf Wiki (app.surf) — the open knowledge base

PubMed

Online biomedical database

PubMed

Online biomedical database

FieldValue
titlePubMed
logo[[File:PubMed logo blue.svg250px]]
centerUnited States National Library of Medicine (NLM)
released
url
screenshot[[File:PubMed Home Page 2025-11-29.png250px]]

From 1971 to 1997, online access to the MEDLINE database was provided via computer, using phone lines primarily through institutional facilities, such as university libraries. PubMed, first released in January 1996, ushered in the era of private, free, home- and office-based MEDLINE searching. It was released alongside "Internet Grateful Med" (web-version of Grateful Med). In 2001 Grateful Med was deleted and entirely replaced by PubMed. The PubMed system was offered free to the public starting in June 1997.

Content

In addition to MEDLINE, PubMed provides access to:

  • older references from the print version of Index Medicus, back to 1951 and earlier
  • references to some journals before they were indexed in Index Medicus and MEDLINE, for instance Science, BMJ, and Annals of Surgery
  • very recent entries to records for an article before it is indexed with Medical Subject Headings (MeSH) and added to MEDLINE
  • a collection of books available full-text and other subsets of NLM records
  • PMC citations
  • NCBI Bookshelf

Many PubMed records contain links to full text articles, some of which are freely available, often in PubMed Central and local mirrors, such as Europe PubMed Central.

Information about the journals indexed in MEDLINE, and available through PubMed, is found in the NLM Catalog.

, PubMed has more than 35 million citations and abstracts dating back to 1966, selectively to the year 1865, and very selectively to 1809. , 24.6 million of PubMed's records are listed with their abstracts, and 26.8 million records have links to full-text versions (of which 10.9 million articles are available, full-text for free). Over the last 10 years (ending 31 December 2019), an average of nearly one million new records were added each year.

In 2016, NLM changed the indexing system so that publishers are able to directly correct typos and errors in PubMed indexed articles.

PubMed has been reported to include some articles published in predatory journals. MEDLINE and PubMed policies for the selection of journals for database inclusion are slightly different. Weaknesses in the criteria and procedures for indexing journals in PubMed Central may allow publications from predatory journals to leak into PubMed.

On March 4, 2025, there was a temporary global outage of the PubMed database. The outage lasted about a day and ‘stoked fears about the database’s future’ because researchers worldwide depend on PubMed to access peer-reviewed scientific literature. The disruption was apparently not deliberate, and services were restored, but the event highlighted concerns about the reliability and contingency planning for essential research infrastructure

Characteristics

Website design

A new PubMed interface was launched in October 2009 and encouraged the use of such quick, Google-like search formulations; they have also been described as 'telegram' searches. By default the results are sorted by Most Recent, but this can be changed to Best Match, Publication Date, First Author, Last Author, Journal, or Title.

The PubMed website design and domain was updated in January 2020 and became default on 15 May 2020, with the updated and new features. There was a critical reaction from many researchers who frequently use the site.

PubMed for handhelds/mobiles

PubMed/MEDLINE can be accessed via handheld devices, using for instance the "PICO" option (for focused clinical questions) created by the NLM. A "PubMed Mobile" option, providing access to a mobile friendly, simplified PubMed version, is also available.

Standard search

Simple searches on PubMed can be carried out by entering key aspects of a subject into PubMed's search window.

PubMed translates this initial search formulation and automatically adds field names, relevant MeSH (Medical Subject Headings) terms, synonyms, Boolean operators, and 'nests' the resulting terms appropriately, enhancing the search formulation significantly, in particular by routinely combining (using the OR operator) textwords and MeSH terms.

Comprehensive search

For optimal searches in PubMed, it is necessary to understand its core component, MEDLINE, and especially of the MeSH (Medical Subject Headings) controlled vocabulary used to index MEDLINE articles. They may also require complex search strategies, use of field names (tags), proper use of limits and other features; reference librarians and search specialists offer search services.

The search into PubMed's search window is only recommended for the search of unequivocal topics or new interventions that do not yet have a MeSH heading created, as well as for the search for commercial brands of medicines and proper nouns. It is also useful when there is no suitable heading or the descriptor represents a partial aspect. The search using the thesaurus MeSH is more accurate and will give fewer irrelevant results. In addition, it saves the disadvantage of the free text search in which the spelling, singular/plural or abbreviated differences have to be taken into consideration. On the other side, articles more recently incorporated into the database to which descriptors have not yet been assigned will not be found. Therefore, to guarantee an exhaustive search, a combination of controlled language headings and free text terms must be used.

Journal article parameters

When a journal article is indexed, numerous article parameters are extracted and stored as structured information. Such parameters are: Article Type (MeSH terms, e.g., "Clinical Trial"), Secondary identifiers, (MeSH terms), Language, Country of the Journal or publication history (e-publication date, print journal publication date).

Publication Type: Clinical queries/systematic reviews

Publication type parameter allows searching by the type of publication, including reports of various kinds of clinical research.

Secondary ID

Since July 2005, the MEDLINE article indexing process extracts identifiers from the article abstract and puts those in a field called Secondary Identifier (SI). The secondary identifier field is to store accession numbers to various databases of molecular sequence data, gene expression or chemical compounds and clinical trial IDs. For clinical trials, PubMed extracts trial IDs for the two largest trial registries: ClinicalTrials.gov (NCT identifier) and the International Standard Randomized Controlled Trial Number Register (IRCTN identifier).

Alternative interfaces

MEDLINE is one of the databases which are accessible via PubMed. Several companies provide access to MEDLINE through their platforms.

The National Library of Medicine leases the MEDLINE information to a number of private vendors such as Embase, Ovid, Dialog, EBSCO, Knowledge Finder and many other commercial, non-commercial, and academic providers. , more than 500 licenses had been issued, more than 200 of them to providers outside the United States. As licenses to use MEDLINE data are available for free, the NLM in effect provides a free testing ground for a wide range of alternative interfaces and 3rd party additions to PubMed, one of a very few large, professionally curated databases which offers this option.

Lu identifies a sample of 28 current and free Web-based PubMed versions, requiring no installation or registration, which are grouped into four categories:

  1. Ranking search results, for instance: eTBLAST; MedlineRanker; MiSearch;
  2. Clustering results by topics, authors, journals etc., for instance: Anne O'Tate; ClusterMed;
  3. Enhancing semantics and visualization, for instance: EBIMed; MedEvi.
  4. Improved search interface and retrieval experience, for instance, askMEDLINE BabelMeSH; and PubCrawler.

As most of these and other alternatives rely essentially on PubMed/MEDLINE data leased under license from the NLM/PubMed, the term "PubMed derivatives" has been suggested. Without the need to store about 90 GB of original PubMed Datasets, anybody can write PubMed applications using the eutils-application program interface as described in "The E-utilities In-Depth: Parameters, Syntax and More", by Eric Sayers, PhD. Various citation format generators, taking PMID numbers as input, are examples of web applications making use of the eutils-application program interface. Sample web pages include Citation Generator – Mick Schroeder, Pubmed Citation Generator – Ultrasound of the Week, and Cite this for me.

In 2025, the ZBMed, the German National Library of Medicine, announced plans to develop an internationally supported open source version of PubMed.

Data mining of PubMed

Code can be automated to systematically query with different keywords such as disease, year, organs, etc.

For bulk processing, the full PubMed database is available as XML which can be downloaded from an FTP server. The annual baseline is released in December, followed by daily update files.

In addition to its traditional role as a biomedical database, PubMed has become common resource for training biomedical language models.

The data accessible by PubMed can be mirrored locally using an unofficial tool such as MEDOC.

Millions of PubMed records augment various open data datasets about open access, like Unpaywall. Data analysis tools like Unpaywall Journals are used by libraries to assist with big deal cancellations: libraries can avoid subscriptions for materials already served by instant open access via open archives like PubMed Central.

References

References

  1. "PubMed".
  2. (2000). "Internet access to the National Library of Medicine". Effective Clinical Practice.
  3. (2006-10-05). "PubMed Celebrates its 10th Anniversary". [[United States National Library of Medicine]].
  4. (2002-06-07). "PubMed: MEDLINE Retrieval on the World Wide Web". United States National Library of Medicine.
  5. (January 2001). "PubMed Central: The GenBank of the published literature". Proceedings of the National Academy of Sciences of the United States of America.
  6. (January 2011). "UKPMC: a full text article resource for the life sciences". Nucleic Acids Research.
  7. (2011). "NLM Catalogue: Journals referenced in the NCBI Databases". NCBI.
  8. "PubMed".
  9. (July–August 2016). "MEDLINE/PubMed Production Improvements Underway". NLM Technical Bulletin.
  10. (September 2018). "How predatory journals leak into PubMed". CMAJ.
  11. Mallapaty, Smriti. (2025-03-04). "‘Omg, did PubMed go dark?’ Blackout stokes fears about database’s future". Nature.
  12. (September 2000). "Pragmatic approach is effective in evidence based health care". BMJ.
  13. (January 2014). "How to improve your PubMed/MEDLINE searches: 2. display settings, complex search queries and topic searching". Journal of Telemedicine and Telecare.
  14. Trawick, Bart. (21 January 2020). "A New and Improved PubMed®". NLM Musings From the Mezzanine.
  15. Price, Michael. (22 May 2020). "They redesigned PubMed, a beloved website. It hasn't gone over well". Science.
  16. (2004). "PubMed via handhelds (PICO)". United States National Library of Medicine.
  17. (2011). "PubMed Mobile Beta". United States National Library of Medicine.
  18. (July 1993). "Searching the literature. Be systematic in your searching". BMJ.
  19. (Spring 1999). "The art and science of searching MEDLINE to answer clinical questions. Finding the right number of articles". International Journal of Technology Assessment in Health Care.
  20. (2018). "Cómo elaborar una estrategia de búsqueda bibliográfica". Enfermería Intensiva.
  21. (2010). "Clinical Queries Filter Terms explained". NCBI.
  22. (June 2013). "Evaluating adherence to the International Committee of Medical Journal Editors' policy of mandatory, timely clinical trial registration". Journal of the American Medical Informatics Association.
  23. "Computation of Related Articles explained". NCBI.
  24. (February 2006). "Searching the literature using medical subject headings versus text word with PubMed". The Laryngoscope.
  25. (March 2014). "How to improve your PubMed/MEDLINE searches: 3. advanced searching, MeSH and My NCBI". Journal of Telemedicine and Telecare.
  26. (13 December 2010). "My NCBI explained". NCBI.
  27. (2000). "PubMed Cubby". United States National Library of Medicine.
  28. (2010). "LinkOut Overview". NCBI.
  29. (2011). "LinkOut Participants 2011". NCBI.
  30. "An Updated PubMed is on its Way".
  31. PubMed Commons Team. (17 December 2015). "Commenting on PubMed: A Successful Pilot".
  32. (2018-02-01). "PubMed Commons to be Discontinued". NCBI Insights.
  33. (2018-02-02). "PubMed shuts down its comments feature, PubMed Commons".
  34. (2005). "askMedline". NCBI.
  35. "Search Field Descriptions and Tags". National Center for Biotechnology Information.
  36. Keener, Molly. "PMID vs. PMCID: What's the difference?". University of Chicago.
  37. (2011). "Leasing journal citations from PubMed/Medline". NLM.
  38. (2011). "PubMed and beyond: a survey of web tools for searching biomedical literature". Database.
  39. (July 2009). "MedlineRanker: flexible ranking of biomedical literature". Nucleic Acids Research.
  40. (April 2009). "MiSearch adaptive pubMed search tool". Bioinformatics.
  41. (February 2008). "Anne O'Tate: A tool to support user-driven summarization, drill-down and browsing of PubMed search results". Journal of Biomedical Discovery and Collaboration.
  42. (2011). "ClusterMed". Vivisimo Clustering Engine.
  43. (January 2007). "EBIMed--text crunching to gather facts for proteins from Medline". Bioinformatics.
  44. (June 2008). "MedEvi: retrieving textual evidence of relations between biomedical concepts from Medline". Bioinformatics.
  45. (2006). "askMEDLINE: a report on a year-long experience". AMIA ... Annual Symposium Proceedings. AMIA Symposium.
  46. (2005). "MeSH Speller + askMEDLINE: auto-completes MeSH terms then searches MEDLINE/PubMed via free-text, natural language queries". AMIA ... Annual Symposium Proceedings. AMIA Symposium.
  47. (2007). "PICO Linguist and BabelMeSH: development and partial evaluation of evidence-based multilanguage search tools for MEDLINE/PubMed". Studies in Health Technology and Informatics.
  48. (July 2004). "PubCrawler: keeping up comfortably with PubMed and GenBank". Nucleic Acids Research.
  49. Eric Sayers, PhD. (24 October 2018). "The E-utilities In-Depth: Parameters, Syntax and More". NCBI.
  50. (28 May 2025). "Germany Plans Global Alternative to PubMed". MedPage Today.
  51. "OLSPub: Life sciences research needs a solid infrastructure".
  52. (2023-11-04). "Notes on the data quality of bibliographic records from the MEDLINE database". Database.
  53. (3 August 2023). "Large language models encode clinical knowledge". Nature.
  54. {{GitHub. MrMimic/MEDOC
  55. Denise Wolfe. (2020-04-07). "SUNY Negotiates New, Modified Agreement with Elsevier – Libraries News Center University at Buffalo Libraries". [[University at Buffalo]].
Info: Wikipedia Source

This article was imported from Wikipedia and is available under the Creative Commons Attribution-ShareAlike 4.0 License. Content has been adapted to SurfDoc format. Original contributors can be found on the article history page.

Want to explore this topic further?

Ask Mako anything about PubMed — get instant answers, deeper analysis, and related topics.

Research with Mako

Free with your Surf account

Content sourced from Wikipedia, available under CC BY-SA 4.0.

This content may have been generated or modified by AI. CloudSurf Software LLC is not responsible for the accuracy, completeness, or reliability of AI-generated content. Always verify important information from primary sources.

Report