Title
Dataset of de novo assembly and functional annotation of the transcriptomes of three native oleaginous microalgae from the Peruvian Amazon
Date Issued
01 August 2020
Access level
open access
Resource Type
data paper
Publisher(s)
Elsevier Inc.
Abstract
Microalgae are photosynthetic organisms with cosmopolitan distribution (i.e., marine, freshwater and terrestrial habitats) and possess a great diversity of species [1] and consequently an immense variation in biochemical compositions [2]. To date genomic information is available mainly from the model green microalga Chlamydomonas reinhardtii [3]. Here we provide the dataset of a de novo assembly and functional annotation of the transcriptomes of three native oleaginous microalgae from the Peruvian Amazon. Native oleaginous microalgae species Ankistrodesmus sp., Chlorella sp., and Scenedesmus sp. were cultured in triplicate using Chu-10 medium with or without a source of nitrate (NaNO3). Total RNA was purified, the cDNA libraries were constructed and sequenced as paired-end reads on an Illumina HiSeq™2500 platform. Transcriptomes were de novo assembled using Trinity v2.9.1. A total of 48,554 transcripts (range from 250 to 7966 bp; N50 = 1047) for Ankistrodesmus sp., 108,126 transcripts (range from 250 to 8160 bp; N50 = 1090) for Chlorella sp., and 77,689 transcripts (range from 250 to 8481 bp; N50 = 1281) for Scenedesmus sp. were de novo assembled. Completeness of the assembled transcriptomes were evaluated with the Benchmarking Universal Single-Copy Orthologs (BUSCO) software v2/v3. Functional annotation of the assembled transcriptomes was conducted with TransDecoder v3.0.1 and the web-based platforms Kyoto Encyclopedia of Genes and Genomes (KEGG) Automatic Annotation Server (KAAS) and FunctionAnnotator. The raw reads were deposited into NCBI and are accessible via BioProject accession number PRJNA628966 (https://www.ncbi.nlm.nih.gov/bioproject/PRJNA628966) and Sequence Read Archive (SRA) with accession numbers: SRX8295665 (https://www.ncbi.nlm.nih.gov/sra/SRX8295665), SRX8295666 (https://www.ncbi.nlm.nih.gov/sra/SRX8295666), SRX8295667 (https://www.ncbi.nlm.nih.gov/sra/SRX8295667), SRX8295668 (https://www.ncbi.nlm.nih.gov/sra/SRX8295668), SRX8295669 (https://www.ncbi.nlm.nih.gov/sra/SRX8295669), and SRX8295670 (https://www.ncbi.nlm.nih.gov/sra/SRX8295670). Additionally, transcriptome shotgun assembly sequences and functional annotations are available via Discover Mendeley Data (https://data.mendeley.com/datasets/47wdjmw9xr/1).
Volume
31
Language
English
OCDE Knowledge area
Biotecnología ambiental
Scopus EID
2-s2.0-85086940187
Source
Data in Brief
ISSN of the container
23523409
Sponsor(s)
Authors received financial support from the Peruvian funding agency Programa Nacional de Innovación para la Competitividad y Productividad (PNICP) with grant contract No. 383-PNICP-PIBA-2-P-031–14. JDM was supported, in part, by a grant from El Fondo Nacional de Desarrollo Científico, Tecnológico y de Innovación Tecnológica (44–2019-FONDECYT) during a portion of this research. Additionally, we thank the Laboratorio de Biotecnología y Bioenergética, Universidad Científica del Perú (UCP) and the Pritzker Laboratory for Molecular Systematics and Evolution, Field Museum of Natural History for providing necessary infrastructure and equipment.
Sources of information: Directorio de Producción Científica Scopus