Released
Dataset

pmoA gene reference database (fasta-formatted sequences and taxonomy)

Cite as:

Yang, Sizhong; Wen, Xi; Liebner, Susanne (2016): pmoA gene reference database (fasta-formatted sequences and taxonomy). GFZ Data Services. https://doi.org/10.5880/GFZ.5.3.2016.001

Status

I   N       R   E   V   I   E   W : Yang, Sizhong; Wen, Xi; Liebner, Susanne (2016): pmoA gene reference database (fasta-formatted sequences and taxonomy). GFZ Data Services. https://doi.org/10.5880/GFZ.5.3.2016.001

Abstract

This data set is a part of result affiliated to our manuscript about pmoA gene (encoding the alpha subunit of the enzyme of particular methane monooxygenase). The taxonomy database consists of 7809 unaligned pmoA nucleotide sequences in fasta format and a corresponding taxonomy file, according the format specified by the software platforms of Mothur and QIIME. The taxonomy file is a two column text file where the first column is the accession number of the sequence and the second column is a string of taxonomic information separated by semicolons. We created a comprehensive taxonomy database for the pmoA nucleotide sequences which could be probed by the primer set combination of A189f and A682r. Sequences in this database were firstly retrieved from the NCBI database and progressively screened by Biopython or R scripts. The corresponding taxonomy was generally referred to the NCBI taxonomy if the explicit taxonomic ranks from phylum to species are available. For those with ambiguous taxonomies given by the NCBI database, taxonomic classification was improved as possible by referring to the Dumont’s database (Frontier in Microbiology, 2014, 5: 34. doi: 10.3389/fmicb.2014.00034).

Authors

  • Yang, Sizhong;GFZ German Research Centre for Geosciences, Potsdam, Germany
  • Wen, Xi;GFZ German Research Centre for Geosciences, Potsdam, Germany
  • Liebner, Susanne;GFZ German Research Centre for Geosciences, Potsdam, Germany

Contact

  • Yang, Sizhong (Postdoc) ; GFZ German Research Centre for Geosciences, Potsdam, Germany;

Keywords

pmoA gene, reference database, microbiology

Files

License: CC BY 4.0

Dataset Description

Supplement to