Tag: RefSeq

New! RefSeq Release 224

New! RefSeq Release 224

Check out RefSeq release 224, now available online and from the FTP site. You can access RefSeq data through NCBI Datasets.

What’s included in this release?

As of May 6, 2024, this full release incorporates genomic, transcript, and protein data containing:

  • 435,879,646 records
  • 324,246,652 proteins
  • 62,348,147 RNAs
  • Sequences from 150,742 organisms

The release is provided in several directories as a complete dataset and also as divided by logical groupings. Continue reading “New! RefSeq Release 224”

Now Available! Updated Bacterial and Archaeal Reference Genomes Collection

Now Available! Updated Bacterial and Archaeal Reference Genomes Collection

Download the updated bacterial and archaeal reference genome collection! We built this collection of 19,328 genomes by selecting the “best” genome assembly for each species among the 350,000+ prokaryotic genomes in RefSeq (except for E. coli for which two assemblies were selected as reference).

What’s New?
  • 413 species are represented in this collection for the first time
  • 198 species are represented by a better assembly
  • 27 species were removed because of changes in NCBI Taxonomy or uncertainty in their species assignment 

Continue reading “Now Available! Updated Bacterial and Archaeal Reference Genomes Collection”

NCBI Hidden Markov Models (HMM) Release 15.0 Now Available!

NCBI Hidden Markov Models (HMM) Release 15.0 Now Available!

Download release 15.0 of the NCBI protein profile Hidden Markov models (HMMs) used by the Prokaryotic Genome Annotation Pipeline (PGAP)! Search this collection against your favorite prokaryotic proteins to identify their function using the HMMER sequence analysis package.

What’s New?

Release 15.0 contains:

  • 16,667 HMMs maintained by NCBI
  • 279 new HMMs since release 14.0
  • Several hundreds HMMs with better names, EC numbers, Gene Ontology (GO) terms, gene symbols, or publications. 

Continue reading “NCBI Hidden Markov Models (HMM) Release 15.0 Now Available!”

New RefSeq Annotations Now Available!

New RefSeq Annotations Now Available!

In February and March, the NCBI Eukaryotic Genome Annotation Pipeline released forty-six new annotations in RefSeq!

New Annotations
  • Aedes albopictus (Asian tiger mosquito)
  • Anolis carolinensis (green anole)
  • Armigeres subalbatus (mosquito)
  • Bacillus rossius redtenbacheri (walking stick)
  • Bolinopsis microptera (comb jelly)
  • Bombyx mori (domestic silkworm)
  • Bubalus kerabau (carabao)
  • Candoia aspera (snake)
  • Cavia porcellus (domestic guinea pig) 
  • Continue reading “New RefSeq Annotations Now Available!”
Now Available: RefSeq Release 223

Now Available: RefSeq Release 223

Check out RefSeq release 223, now available online and from the FTP site. You can access RefSeq data through NCBI Datasets.

What’s included in this release?

As of March 4, 2024, this full release incorporates genomic, transcript, and protein data containing:

  • 425,594,654 records
  • 316,329,937 proteins
  • 60,886,133 RNAs
  • sequences from 147,591 organisms 

Continue reading “Now Available: RefSeq Release 223”

Join NCBI at TAGC 2024

Join NCBI at TAGC 2024

March 6-10 in Washington, D.C. 

We look forward to seeing you in person at The Allied Genetics Conference (TAGC), March 6-10, 2024, in the Washington D.C. metro area. NCBI staff will participate in a variety of activities and events, including hosting a hands-on workshop: Exploring and downloading NCBI data with NCBI Datasets. We’re also excited to share our recent efforts on the NIH Comparative Genomics Resource (CGR) in a talk during Sunday’s Technology, Tools, and Resources session. 

Check out NCBI’s schedule of activities and events:

Continue reading “Join NCBI at TAGC 2024”

New RefSeq Annotations Now Available!

New RefSeq Annotations Now Available!

During October to January, the NCBI Eukaryotic Genome Annotation Pipeline released seventy new annotations in RefSeq!

New Annotations
  • Alnus glutinosa (eudicot)
  • Amyelois transitella (moth)
  • Anolis sagrei ordinatus (Brown anole)
  • Apis cerana (Asiatic honeybee)
  • Balaenoptera ricei (Rice’s whale)
  • Bombus pascuorum (bee)
  • Bos javanicus (banteng)
  • Bos taurus (cattle) 

Continue reading “New RefSeq Annotations Now Available!”

Updated Bacterial and Archaeal Reference Genome Collection is Available!

Updated Bacterial and Archaeal Reference Genome Collection is Available!

Download the updated bacterial and archaeal reference genome collection! This collection (18,941 genomes as of Jan 18, 2024) was built by selecting the “best” genome assembly for each species among the 330,000+ prokaryotic genomes in RefSeq (except for E. coli for which two assemblies were selected as reference). You can speed up your sequence searches by running them against these high-quality genomes instead of the entire nucleotide or protein database.

The criteria for selecting the reference assembly for a given species include assembly contiguity and completeness and quality of the RefSeq annotation. Continue reading “Updated Bacterial and Archaeal Reference Genome Collection is Available!”

RefSeq Release 222 Now Available!

RefSeq Release 222 Now Available!

Check out RefSeq release 222, now available online and from the FTP site. You can access RefSeq data through NCBI Datasets.

What’s included in this release?

As of January 8, 2024, this full release incorporates genomic, transcript, and protein data containing:

  • 411,137,832 records
  • 304,562,770 proteins
  • 59,343,570 RNAs
  • sequences from 145,371 organisms 

Continue reading “RefSeq Release 222 Now Available!”

Now Available: NCBI Hidden Markov Models (HMM) Release 14.0!

Now Available: NCBI Hidden Markov Models (HMM) Release 14.0!

Download release 14.0 of the NCBI protein profile Hidden Markov models (HMMs) used by the Prokaryotic Genome Annotation Pipeline (PGAP)! Search this collection against your favorite prokaryotic proteins to identify their function using the HMMER sequence analysis package. Continue reading “Now Available: NCBI Hidden Markov Models (HMM) Release 14.0!”