NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300002654

3300002654: Forest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaT HF113 (Metagenome Metatranscriptome, Counting Only)



Overview

Basic Information
IMG/M Taxon OID3300002654 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0085736 | Gp0056644 | Ga0005464
Sample NameForest soil microbial communities from Harvard Forest Long Term Ecological Research site in Petersham, Massachusetts, USA - MetaT HF113 (Metagenome Metatranscriptome, Counting Only)
Sequencing StatusPermanent Draft
Sequencing CenterDOE Joint Genome Institute (JGI)
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size6259931
Sequencing Scaffolds25
Novel Protein Genes27
Associated Families26

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
Not Available20
All Organisms → cellular organisms → Eukaryota → Opisthokonta → Fungi → Dikarya → Basidiomycota → Agaricomycotina → Agaricomycetes → Agaricomycetes incertae sedis → Cantharellales → Botryobasidiaceae → Botryobasidium → Botryobasidium botryosum1
All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Protostomia → Ecdysozoa → Panarthropoda → Arthropoda → Mandibulata → Pancrustacea → Hexapoda → Insecta → Dicondylia → Pterygota → Neoptera → Endopterygota → Diptera → Brachycera → Muscomorpha → Eremoneura → Cyclorrhapha → Schizophora → Acalyptratae → Ephydroidea → Drosophilidae → Drosophilinae → Drosophilini → Drosophila → Sophophora → melanogaster group → melanogaster subgroup → Drosophila melanogaster1
All Organisms → cellular organisms → Eukaryota → Viridiplantae → Streptophyta → Streptophytina → Embryophyta → Tracheophyta → Euphyllophyta → Spermatophyta → Magnoliopsida → Mesangiospermae → eudicotyledons → Gunneridae → Pentapetalae → rosids → fabids → Malpighiales → Rhizophoraceae → Rhizophora → Rhizophora mucronata1
All Organisms → cellular organisms → Eukaryota → Opisthokonta → Fungi → Dikarya → Basidiomycota → Agaricomycotina → Agaricomycetes → Agaricomycetidae → Agaricales → Tricholomatineae → Lyophyllaceae → Hypsizygus → Hypsizygus marmoreus1
All Organisms → cellular organisms → Eukaryota → Opisthokonta → Fungi → Dikarya → Basidiomycota → Agaricomycotina → Agaricomycetes → Agaricomycetes incertae sedis → Polyporales → Fibroporiaceae → Fibroporia → Fibroporia radiculosa1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameForest Soil Microbial Communities From Harvard Forest Long Term Ecological Research (Lter) Site In Petersham, Ma, For Long-Term Soil Warming Studies
TypeEnvironmental
TaxonomyEnvironmental → Terrestrial → Soil → Loam → Forest Soil → Forest Soil → Forest Soil Microbial Communities From Harvard Forest Long Term Ecological Research (Lter) Site In Petersham, Ma, For Long-Term Soil Warming Studies

Alternative Ecosystem Assignments
Environment Ontology (ENVO)forest biomelandforest soil
Earth Microbiome Project Ontology (EMPO)Free-living → Non-saline → Soil (non-saline)

Location Information
LocationHarvard Forest LTER, Petersham, MA, USA
CoordinatesLat. (o)42.532967Long. (o)-72.180244Alt. (m)N/ADepth (m)0 to .1
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F001380Metagenome / Metatranscriptome709Y
F002654Metagenome / Metatranscriptome539Y
F004183Metagenome / Metatranscriptome449Y
F005001Metagenome / Metatranscriptome415Y
F016923Metagenome / Metatranscriptome243Y
F016972Metagenome / Metatranscriptome243Y
F018933Metagenome / Metatranscriptome232N
F019794Metagenome / Metatranscriptome227N
F021919Metagenome / Metatranscriptome216Y
F023505Metagenome / Metatranscriptome209Y
F024050Metagenome / Metatranscriptome207Y
F024792Metagenome / Metatranscriptome204Y
F030305Metagenome / Metatranscriptome185N
F035144Metagenome / Metatranscriptome172Y
F036049Metagenome / Metatranscriptome170Y
F038248Metagenome / Metatranscriptome166Y
F038563Metagenome / Metatranscriptome165Y
F045560Metagenome / Metatranscriptome152Y
F050386Metagenome / Metatranscriptome145Y
F051291Metagenome / Metatranscriptome144Y
F059015Metagenome / Metatranscriptome134Y
F060406Metagenome / Metatranscriptome133N
F062337Metagenome / Metatranscriptome130N
F063716Metagenome / Metatranscriptome129Y
F076700Metagenome / Metatranscriptome117N
F089618Metagenome / Metatranscriptome108N

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0005464J37254_100140Not Available818Open in IMG/M
Ga0005464J37254_100172Not Available598Open in IMG/M
Ga0005464J37254_100236Not Available578Open in IMG/M
Ga0005464J37254_100447All Organisms → cellular organisms → Eukaryota → Opisthokonta → Fungi → Dikarya → Basidiomycota → Agaricomycotina → Agaricomycetes → Agaricomycetes incertae sedis → Cantharellales → Botryobasidiaceae → Botryobasidium → Botryobasidium botryosum576Open in IMG/M
Ga0005464J37254_100474Not Available507Open in IMG/M
Ga0005464J37254_100495Not Available862Open in IMG/M
Ga0005464J37254_100523Not Available581Open in IMG/M
Ga0005464J37254_100569All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Protostomia → Ecdysozoa → Panarthropoda → Arthropoda → Mandibulata → Pancrustacea → Hexapoda → Insecta → Dicondylia → Pterygota → Neoptera → Endopterygota → Diptera → Brachycera → Muscomorpha → Eremoneura → Cyclorrhapha → Schizophora → Acalyptratae → Ephydroidea → Drosophilidae → Drosophilinae → Drosophilini → Drosophila → Sophophora → melanogaster group → melanogaster subgroup → Drosophila melanogaster533Open in IMG/M
Ga0005464J37254_100796All Organisms → cellular organisms → Eukaryota → Viridiplantae → Streptophyta → Streptophytina → Embryophyta → Tracheophyta → Euphyllophyta → Spermatophyta → Magnoliopsida → Mesangiospermae → eudicotyledons → Gunneridae → Pentapetalae → rosids → fabids → Malpighiales → Rhizophoraceae → Rhizophora → Rhizophora mucronata1406Open in IMG/M
Ga0005464J37254_100969Not Available822Open in IMG/M
Ga0005464J37254_101363Not Available695Open in IMG/M
Ga0005464J37254_101409Not Available697Open in IMG/M
Ga0005464J37254_101744Not Available543Open in IMG/M
Ga0005464J37254_102255Not Available810Open in IMG/M
Ga0005464J37254_102373All Organisms → cellular organisms → Eukaryota → Opisthokonta → Fungi → Dikarya → Basidiomycota → Agaricomycotina → Agaricomycetes → Agaricomycetidae → Agaricales → Tricholomatineae → Lyophyllaceae → Hypsizygus → Hypsizygus marmoreus2311Open in IMG/M
Ga0005464J37254_102408Not Available578Open in IMG/M
Ga0005464J37254_102448All Organisms → cellular organisms → Eukaryota → Opisthokonta → Fungi → Dikarya → Basidiomycota → Agaricomycotina → Agaricomycetes → Agaricomycetes incertae sedis → Polyporales → Fibroporiaceae → Fibroporia → Fibroporia radiculosa1839Open in IMG/M
Ga0005464J37254_103488Not Available543Open in IMG/M
Ga0005464J37254_103986Not Available806Open in IMG/M
Ga0005464J37254_104781Not Available516Open in IMG/M
Ga0005464J37254_105389Not Available567Open in IMG/M
Ga0005464J37254_105828Not Available718Open in IMG/M
Ga0005464J37254_106060Not Available588Open in IMG/M
Ga0005464J37254_106262Not Available716Open in IMG/M
Ga0005464J37254_107267Not Available520Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0005464J37254_100140Ga0005464J37254_1001401F024792VTGERLSQAGWLNPSKSASQGAETGGRIHQFLWRRSHAVSDAKQELSGKGLN*
Ga0005464J37254_100172Ga0005464J37254_1001721F004183MERLRGANEDRANSEGGPNGCEAYQRVDPEGAKTPEGERRQAAQPAEQAGTECNGLEAWMQPEAGANQQLAAESKSRTSRKTGRQVSEVAGQEL*
Ga0005464J37254_100236Ga0005464J37254_1002361F089618LPLPPLAAPVSNIRLASAALPPARPRANPPARIGVFSPGSTGGKHPACTVCYALPIDWLLTFQLALASGLQLGLRLLPTHIWCRPSARQVLRLPVLTGFRRNLRLAPAAAAAPTRAGGCPLLPHRPRTSDSHRLLFCRLCRSRLTQLALRALTSGWAFDAPLASTEPCIAG*
Ga0005464J37254_100447Ga0005464J37254_1004471F001380PGFTVRFSDPSACGRSLCNKMLRINSSSLSAAAFQHAAGHSNQQIFVGLSTLPEPESRYGLSLAHNDAFATIARSMFLACTFVSTSKSFANPFDSRLFRSVRFRGRTGATSMPGTRFPLHPSILRIHPQSPLPFRPSFENPSDQSVRPVQNPEARLTRRPIASSYSPPLPLAIPLRISARNSLRLSLTYRS
Ga0005464J37254_100474Ga0005464J37254_1004741F004183GANEDRALPEGGPNGCEASQGFDPEGVKTPRGELRQAAQPAKQAGKECNGFEAWMQPEAGANQPLVAESKSRTGRKTGKQVSEVAGQDL*
Ga0005464J37254_100495Ga0005464J37254_1004951F030305VFRMPLQFIGSCIVPFGFLVPVPSFLFPALSDLARRCSSGRPVPRVSDRTGDEAPSCPGSSVFSAVPADGSSSRPDSRILQLGSPQIARLPRLSTSCLAVDERPGCPVRSIIWLYRRRRFRVAPNLTSFGGTVSNSPSRPGSSLLQPLPLMVLRVAPGAPSSGFAGGDSSGCPEALIPRLCRLVAFRVSPDLPPSDICRFRFSGLPQIGFLGGSMMNPRFARTLHPRSIQLTSLQVAPKFPLPAAPRMNLQTHSGLAFLPTLRCSLNLYPLFARRRTGLLRTTINQF
Ga0005464J37254_100523Ga0005464J37254_1005231F059015SRVHGTIQRPVCMRTFALQPDVRDKTTHRSPSPLFSMRQVIALSRLQHASLSD*HPAGAGSSIRPFARSQRRFRHHCEVNVPGLHLRFHIENLCESVRLQALSLRSVSRPNRGAVNAQNPLSAPISNTPDLSPISTPLQVLLRKPSGSKRSTGSISGNPSYQTFDCPLLPATSSFDSATDQRLKLASFGLTYR
Ga0005464J37254_100569Ga0005464J37254_1005691F021919FQVTRSRSQTPRRHATAQSFASRIAGRHPFGSPPRFFKRNGSIPRGSPASFPSWNRHLEAAFHSPETTARFQATISRSKLPTYSFDTLPCIRPARSDPDSPTRPGSPRRAQDHYRNPVA*LLPGTSNPSSDLHSPSGPFEPLRIKAFNPIPGRKVHLPSAPDCPSLPNIESILLVRC
Ga0005464J37254_100796Ga0005464J37254_1007961F002654MLNNQTFNDKVVCLKGNSPEQVFKVPKLLLSESKEVFKSYNQEIGLEAAIF*
Ga0005464J37254_100969Ga0005464J37254_1009691F045560PRRAKSDERKMAQVSGIIPGDWGKVRPGWLAGLLLKEAARHISGGRIHQFLWQRSRAVSKREKGTER*
Ga0005464J37254_101363Ga0005464J37254_1013632F024050PVAPASSCSARDEGLELPLVPHLRLHRQWIVESPRLSHLSAVPTGQSSSRPESRPFGIADDSSPRLPQTPNPPVPADRYPSYLGSRTIRFALVESPGCPGHSPLATAIDQFPGCPKSRVFRRSPILLNSSRPEPWFLG*
Ga0005464J37254_101409Ga0005464J37254_1014093F063716VRNSNLRVKRRDPWQGANALPKAAADPVLMVKTRTRIAGVS*
Ga0005464J37254_101744Ga0005464J37254_1017441F076700LITRSTAGVASSSYPSGRSPAFAGAGSSGCAGTVALTCVNALHSGSTGGQPPGSDRFRVLRLDRLQTSDSHRLFRASDRPVVTRSTCVEPSFLRLGRQPTSDSHRCRPLARLAASSGLRLMLPLPLEWLRFQFAPAASPFAFTGREPLGLRLVAPSPAEPLMHSLFPPNLASPAKPSMSI
Ga0005464J37254_102124Ga0005464J37254_1021241F016923FQVTRRSARSPAGMHGQNLASGNGVT*ALCSPRPILFSYGSMLWCASAFTRSGRSSILDTAFCSPAATADLSVRPRSRVNAPGLYLRTDPKIYARPVRLRAPAPVVAFLSPSGARSSHVARCQVRNQNSLSVLKPPLPSRTSQSFGIVALNLIPTREAYPCELPDFLSLPAAL*
Ga0005464J37254_102255Ga0005464J37254_1022551F018933SRVAGLSLCCSRRSRIAPRSTLDSLYRPRIAPLPVLIDRCCPPIARRSTLNALPRSQIAPRSILCARGDLGLLLVHHLEPSPRLRIAPYSTLCFPLRSQIALRSVLPALCRSRIAPRSTIDAQCRSTDISVRLHSAPRAVHGSLRAQHFMLRAVHRLLCVRRLHFARFADCSALLASCLVSHADCSVLDT*
Ga0005464J37254_102373Ga0005464J37254_1023732F050386LRGPRKSYQWKRISEVHKLMVDIQTAHAKLGSQDGNSREHEIKVPNDY*
Ga0005464J37254_102408Ga0005464J37254_1024081F005001QVT*FPAWLTTGMHGIELREQERRTIRLSAPRWPFSPAAGSMLPGSPLAASCPEPVARNGFSLARNSCRLSATSIPGSKLPACYFASFQVGFRARSTFRLHYRVPDCAGCGGFIACGPLHCHHSVRPAAPAISTPLRDFCLPRDQSVQPRLLPAGPPGESARFPFAPRRPS*LKFGLRIIVPGPLRFRRLAV
Ga0005464J37254_102448Ga0005464J37254_1024482F016972VRLTPLTDEIPVVNLYLTINVTDRVSFGAKTSRYGPYELGYRRATKSFTK*
Ga0005464J37254_103488Ga0005464J37254_1034881F038563TSCANTYFSWKITSIDHRSMANNNRRRRRNRGAGASTLPGPSRFIAPVQAKDGQSLIVKVTKPEEGVPKQVSATHYSVTLKGTQGQMILHGSRCANTWTTGKIPVGDWKDVYVQVSSDDTVALTSVIWYSAQ*
Ga0005464J37254_103535Ga0005464J37254_1035351F062337PTSGSTAGQPSGSDRCCLAWLGQWQVLSFRCVLRAADRLAADLPTCVGVRPPARPAITTDSHLALSFSSAGLSASGSHRLPLQQLACAGCCCNPQLALAVATFRHTGGVLPTRIGCSALRLYRFRITRLAPCASTSGWAFDAPLTSTEPCIAGKPSMSIPYPPVRASSGSASLTTFDLRRLLQSLAQPAIPLRL
Ga0005464J37254_103986Ga0005464J37254_1039861F019794SVVQPVLRPPATPETSLQLALPLPPLAAPASNLRLASAALRPARPGANPPARIGVLSFGSTGGKRRAFAVCIALPFVRWLTFQLALASFLRLGRRPTADSHLVLILQLGSCPTSGSHRLLLQPSACASCCYDSPACAGRRPFAIPAANFRLASDVTPSSFTGFDSPDLRRMFLPPVGPLMHPLLQPNLASPAEPSMSIQSPPVLAPSGSASFNNLRLASVFAMSGATSDPSAAFASGFTLWLGLRRFSDSRQLFVPPAIPATNSQCPT
Ga0005464J37254_104781Ga0005464J37254_1047811F036049ISGGCDDRRLPVVALGTPLLAMVLFVGGSPRGPTYTVRSGCSVEIDAACPSGPGNPTDFIVRNFTDVGLAGLEIPASQLLLPTDPSEMAIHHGFLPWKEASWRLQQRSDTRLVTQTCVGEVAKLHLSSALTDLNWLKVQLWQIPSLQPQRCAIAPTVCLTDC*
Ga0005464J37254_105389Ga0005464J37254_1053891F060406LGELNSTTGCRVLRLPQRPTPSFRLRSRPLAAPARCFRLASTASHSGSTGGRLSRLGSAFCRSARSVANLRFASALRTAVRPLADLPACAGVLPQARPRTNFRLTSDLDSSARLVSNFRFAPVVVATSACAFCCCGLWLAPVPPVCQTGGEPPTRIGCPSSGFTGFDSLGLRLVLLPPAGLLMHP*
Ga0005464J37254_105828Ga0005464J37254_1058281F038248SALLPTDLRLASPINLPACLRPTSDSHRNLCLPASPSDQPPTCAGCRLFGLCLPACLQLAPSTCLRALPSDLPLARAADPSSGSAFQLNFRLSIGYCIFQLRLFEAASGLRRTLRSVALPSGLLPACALCRPSGFTFRLTVDSRRPSVFQICLPSNFRLSSKSGSQRSLRWTFDLRRRSFFRPAFEPISDFCRISNSWTSPSDSSPACASYQSSKLCLPTYLWLAPPICLPALPSCLP
Ga0005464J37254_106060Ga0005464J37254_1060601F023505LDNGGGRKKVVGLPGIIPGDWGKVEPGWLVEPLEARFARSGNRWHNSPVPLAAFVRRQ*
Ga0005464J37254_106262Ga0005464J37254_1062621F051291MNSKQRVLSNRDVFPPAVSWRARRGTEPSRAPVVLIVDEDLGFVWWLGQMFSEAGCQVVPALNPEQTDSFAQNLNVDVIVVNPELAGVPELIRGMSSPRPPKIVAIRNHNSGVRSGVRADATLERPSGWGNVSQNEWLGCVRRLVRDVQATRVAYPQCS*
Ga0005464J37254_107267Ga0005464J37254_1072671F035144VTHKFSRLVLFSASAMVISMFTLAAAPCTPGTLASYIALGSTGCTVGNDTFFNFQLINDNASGGATMVTAADINVQGMGPAGTMGASSQNSFLPQDIGVDFDTALWAVTAGQSQDDDISFDVSVGTGAVDITDAGVDQISNTVPNGTASVTEKGCSGLVFPCAS

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.