Data from: Dataset of de novo assembly and functional annotation of the transcriptome of blueberry (Vaccinium spp.)

Metadata Updated: November 10, 2020

Blueberry is an economically important berry crop. Both production and consumption of blueberries have increased sharply worldwide in recent years at least partly due to their known health benefits. The development of improved genomic resources for blueberry, such as a well-assembled genome and transcriptome, could accelerate breeding through genomic-assisted approaches. To enrich available transcriptome data and identify genes potentially involved in fruit quality, RNA sequencing was performed on fruit tissue from two northern-adapted hybrid blueberry breeding populations. RNA-seq was carried out using the Illumina HiSeqTM 2500 platform. Because of the absence of a reference-grade genome for blueberry, a transcriptome was de novo assembled from this RNA-seq data and other publicly available transcriptome data from blueberry downloaded from the National Center for Biotechnology Information (NCBI) Short Read Archive (SRA) using Trinity. After removing redundancy, this resulted in a dataset of 91,861 blueberry unigenes. This unigene dataset was functionally annotated using the NCBI-Nr protein database. All raw reads from the breeding populations were deposited in the NCBI SRA with accession numbers SRR6281886, SRR6281887, SRR6281888, and SRR6281889. The de novo transcriptome assembly was deposited at NCBI Transcriptome Shotgun Assembly (TSA) database with accession number GGAB00000000. These data will provide real expression evidence for the blueberry genome gene prediction and gene functional annotation and a reference transcriptome for future gene expression studies involving blueberry fruit.

Access & Use Information

Public: This dataset is intended for public access and use. License: Creative Commons CCZero

Downloads & Resources


Metadata Created Date November 10, 2020
Metadata Updated Date November 10, 2020

Metadata Source

Harvested from USDA JSON

Additional Metadata

Resource Type Dataset
Metadata Created Date November 10, 2020
Metadata Updated Date November 10, 2020
Publisher Agricultural Research Service
Unique Identifier Unknown
Identifier cdf5e7a9-6528-4dc7-9d50-67331cd79728
Data Last Modified 2019-08-23
Public Access Level public
Bureau Code 005:18
Metadata Context
Schema Version
Catalog Describedby
Program Code 005:040
Source Datajson Identifier True
Source Hash 91fb8958e1088bb99aff4ebfa5bafef9b0d8c020
Source Schema Version 1.1

Didn't find what you're looking for? Suggest a dataset here.