


| Basic Information | |
|---|---|
| IMG/M Taxon OID | 3300006705 Open in IMG/M |
| GOLD Reference (Study | Sequencing Project | Analysis Project) | Gs0053074 | Gp0092418 | Ga0031684 |
| Sample Name | Metatranscriptome of deep ocean microbial communities from Atlantic Ocean - MP547 (Metagenome Metatranscriptome) |
| Sequencing Status | Permanent Draft |
| Sequencing Center | DOE Joint Genome Institute (JGI) |
| Published? | N |
| Use Policy | Open |
| Dataset Contents | |
|---|---|
| Total Genome Size | 75206620 |
| Sequencing Scaffolds | 16 |
| Novel Protein Genes | 21 |
| Associated Families | 20 |
| Dataset Phylogeny | |
|---|---|
| Taxonomy Groups | Number of Scaffolds |
| All Organisms → cellular organisms → Eukaryota | 3 |
| Not Available | 8 |
| All Organisms → cellular organisms → Eukaryota → Sar | 1 |
| All Organisms → cellular organisms → Bacteria | 1 |
| All Organisms → cellular organisms → Eukaryota → Amoebozoa → Discosea → Flabellinia → Vannellidae → Vannella → Vannella robusta | 1 |
| All Organisms → cellular organisms → Eukaryota → Sar → Alveolata → Dinophyceae → Syndiniales → Amoebophryaceae → Amoebophrya | 1 |
| All Organisms → cellular organisms → Eukaryota → Opisthokonta → Fungi → Dikarya → Ascomycota → saccharomyceta → Saccharomycotina → Saccharomycetes → Saccharomycetales → Debaryomycetaceae → Meyerozyma → Meyerozyma guilliermondii | 1 |
| Ecosystem Assignment (GOLD) | |
|---|---|
| Name | Deep Ocean Microbial Communities From The Global Malaspina Expedition |
| Type | Environmental |
| Taxonomy | Environmental → Aquatic → Marine → Oceanic → Unclassified → Deep Ocean → Deep Ocean Microbial Communities From The Global Malaspina Expedition |
| Alternative Ecosystem Assignments | |
|---|---|
| Environment Ontology (ENVO) | marine biome → marine water body → sea water |
| Earth Microbiome Project Ontology (EMPO) | Free-living → Saline → Water (saline) |
| Location Information | ||||||||
|---|---|---|---|---|---|---|---|---|
| Location | South Atlantic Ocean | |||||||
| Coordinates | Lat. (o) | -26.95 | Long. (o) | -21.4 | Alt. (m) | N/A | Depth (m) | N/A | Location on Map |
| Zoom: | Powered by OpenStreetMap © | |||||||
| Family | Category | Number of Sequences | 3D Structure? |
|---|---|---|---|
| F000049 | Metagenome / Metatranscriptome | 3277 | Y |
| F000052 | Metagenome / Metatranscriptome | 3223 | Y |
| F000075 | Metagenome / Metatranscriptome | 2622 | Y |
| F001716 | Metatranscriptome | 647 | Y |
| F006501 | Metagenome / Metatranscriptome | 371 | N |
| F010461 | Metatranscriptome | 303 | Y |
| F010686 | Metagenome / Metatranscriptome | 300 | Y |
| F010768 | Metagenome / Metatranscriptome | 299 | Y |
| F014836 | Metagenome / Metatranscriptome | 259 | Y |
| F019484 | Metagenome / Metatranscriptome | 229 | Y |
| F027881 | Metagenome / Metatranscriptome | 193 | Y |
| F030135 | Metagenome / Metatranscriptome | 186 | Y |
| F046996 | Metagenome / Metatranscriptome | 150 | Y |
| F051870 | Metatranscriptome | 143 | N |
| F061786 | Metagenome / Metatranscriptome | 131 | Y |
| F063613 | Metatranscriptome | 129 | N |
| F070259 | Metagenome / Metatranscriptome | 123 | Y |
| F090278 | Metatranscriptome | 108 | N |
| F097173 | Metatranscriptome | 104 | Y |
| F101008 | Metatranscriptome | 102 | N |
| Scaffold | Taxonomy | Length | IMG/M Link |
|---|---|---|---|
| Ga0031684_1000698 | All Organisms → cellular organisms → Eukaryota | 518 | Open in IMG/M |
| Ga0031684_1013363 | Not Available | 776 | Open in IMG/M |
| Ga0031684_1016401 | All Organisms → cellular organisms → Eukaryota → Sar | 602 | Open in IMG/M |
| Ga0031684_1024000 | All Organisms → cellular organisms → Bacteria | 2068 | Open in IMG/M |
| Ga0031684_1155457 | All Organisms → cellular organisms → Eukaryota → Amoebozoa → Discosea → Flabellinia → Vannellidae → Vannella → Vannella robusta | 642 | Open in IMG/M |
| Ga0031684_1162320 | All Organisms → cellular organisms → Eukaryota | 572 | Open in IMG/M |
| Ga0031684_1182008 | Not Available | 653 | Open in IMG/M |
| Ga0031684_1192364 | Not Available | 516 | Open in IMG/M |
| Ga0031684_1193763 | All Organisms → cellular organisms → Eukaryota | 600 | Open in IMG/M |
| Ga0031684_1197521 | Not Available | 541 | Open in IMG/M |
| Ga0031684_1206314 | All Organisms → cellular organisms → Eukaryota → Sar → Alveolata → Dinophyceae → Syndiniales → Amoebophryaceae → Amoebophrya | 752 | Open in IMG/M |
| Ga0031684_1208891 | All Organisms → cellular organisms → Eukaryota → Opisthokonta → Fungi → Dikarya → Ascomycota → saccharomyceta → Saccharomycotina → Saccharomycetes → Saccharomycetales → Debaryomycetaceae → Meyerozyma → Meyerozyma guilliermondii | 625 | Open in IMG/M |
| Ga0031684_1210502 | Not Available | 1182 | Open in IMG/M |
| Ga0031684_1212634 | Not Available | 733 | Open in IMG/M |
| Ga0031684_1214563 | Not Available | 673 | Open in IMG/M |
| Ga0031684_1232933 | Not Available | 946 | Open in IMG/M |
| Scaffold ID | Protein ID | Family | Sequence |
|---|---|---|---|
| Ga0031684_1000698 | Ga0031684_10006982 | F070259 | MKLESLVIANQHVTVNDLSNFAHTAHHARKVVLAGNYLGLLIFELFIKK* |
| Ga0031684_1002867 | Ga0031684_10028672 | F027881 | MSLTVARSERVRDVLTHASCTEIFILNFSSARVSNTLICFQ* |
| Ga0031684_1004153 | Ga0031684_10041531 | F027881 | IMSLTVARSERVRSVLTHASCTEISINFSSVRVSNIQNCC* |
| Ga0031684_1009265 | Ga0031684_10092651 | F000049 | MRHSKMAEWMENVEKAIARIMADKVYTSAEFKRERDNFQALCKDLERTEVKKWLNQILEILMAERAKSQQETENEKLNVLIKKHEELIPSVQKTSVMVDLYWKCYAYGDELKPHIEFLDGIMLSSTRDIAPSCVENVDELIERQEKSLVQLDSKKNIVVDLIAKGKVILEHPD |
| Ga0031684_1013363 | Ga0031684_10133632 | F019484 | ILGGSLKKIAKKITTSVPISKKVYTNASIIYLSIY* |
| Ga0031684_1016401 | Ga0031684_10164011 | F010768 | MKEVLNGGYNVHPVPPPNSEIKERIRRKYERKRIKIERLFTLGYTTSGDP* |
| Ga0031684_1024000 | Ga0031684_10240002 | F010686 | LDLCGQAIKGTWGMSWRQEALKGVEDCEKLGEAVKRALIPRFLN* |
| Ga0031684_1155457 | Ga0031684_11554571 | F061786 | NNTKGLYLCETIYPPDNGVSCIPPEKLEKAIKKGEIPPQVQADEEDLDGNMDEVIDNCIREVWQYYDPKGVGFLNKKQTQTFFKDALNLVALRKNCKSKDLFQGRKEGAALEEAFRQVNTSGDGRVTFEQFEEFINAYDLDEAVDLLTGGNSEHQINTQGVQMVDNSSLPSGGGPVKGKIEYRDYGALED* |
| Ga0031684_1162320 | Ga0031684_11623201 | F090278 | SDLQSLWRSKREASEGLVDVDEDDFYEFLQQVNDNRHFRHNAMGNLTCVLHKCGQLTEDMEVNMDFYTTALRAEEPETGFTWDVEGSAAKDPEWREKIATAYEDCHDLAESWPATSLNRRPMTRMFGRQMIFFKCADRTERRVCTEAQLLENLETFYGSESDEETAERIAAIGLPEDKYDAAAISIAVIQ |
| Ga0031684_1182008 | Ga0031684_11820081 | F001716 | HTDMLSIAAIGALSNTAEISSTNLRAAPAFDNIKASWNQALKLGDFNTNLKCNYDHSANRDFLKEVSLSGDLVEGGADDVSVSYEVSHDFSDKNTNVKLTANTQGTTLSAEYDRNNGIKEVSAERDVSIADQNVNVQPSWLVKAQTARVKLMSKLGDSKDSVSAQIDYDTNGKSSSYEVGYDHNIDAGRDVSATFNPDKKQLDVDYVDNKFESGATW |
| Ga0031684_1192364 | Ga0031684_11923641 | F010461 | LTVQFFVIYVMIWVAVTVKEFTGWEWHFITNTMENAKGTLAFCPMLCILFVATRMRAMTLTQWKGAPQGWAQDGMYMATWSILLQFLMVLLIPLCTLVMEGKATQPELDEDGNVKWKPSGKIALICVQVVRWLGFVLLYVGANCVMVGAMTLTPETANGRGSVPLVRQTPF |
| Ga0031684_1193763 | Ga0031684_11937631 | F101008 | KDDRMALEMSVNTMVTPYTFHMNAPYFLPRFFNDINRKTIDATIMHEMGSKLEVKSNCPEFETFIITTTGNKRSVVLNGKELTVVDFQRGARRISQTTELPSGEHLTTTVEWTEDSLKKNQATITVEVTPNRKFEGIFGWDFQTLSTGNFHFDVKGENPWVGNYAIDRHANWEMNKPRYMFNWVGKSEFKTGPFSSFSP |
| Ga0031684_1197521 | Ga0031684_11975211 | F046996 | DEFSGFVDYFKAVREKIFGGYLDGIDFDWEGYCDAGCLKGTCICAWNDKICGTVTPEELAAGVFWDAPPVPGQKPMKHQCWIMPTTSTFQVMSGITHAMKREGFVVTLVPMSTSMYSGEADTSPKQVMRNEYVKYRMQTSFGQKVDLLDMADGILLQWYSGFDAALCVNTDVSSKACTCD |
| Ga0031684_1200074 | Ga0031684_12000741 | F000052 | CFIEYEQIAFNETAEICRTPLVKDCDVQGPEICRTEYESECWTKQEVHDVEDDVVECTTEVEEKCEDETSGYTTNTKCSKWPREVCSVSKKPVKKYTPITGCTKEPRELCAPAGCGFKEGREECYEKTQTVVQDAPKEQCTLEPQRTCKHVTKLVPKLEPTEECVDVPKEVCTRSRTNPRKVKKPVVKKWCYVPTEESGLA* |
| Ga0031684_1206314 | Ga0031684_12063141 | F097173 | SMQCVISLTIQYFLVYTALALVRTAADTFNVKYDNLPIQDILKTATYTVAYAPMLAVLFLGCRMRVTWLTQGKGNPPEYVQAAMYCSTYAVLAMTLCVCVIPVFTGKVFAVDPKTGDIPMDAKPFSNAALGVGFTVLKYLIMLGLYAGALVVVYGIINFEPPKGTWPGDKIPPVSPAVQCVMILSCQYFLVYGGIQVCKSVIEFAGDLFGYTHTLQEALNTATLSINFCPMLAVLFIGARMRALQMDPVN |
| Ga0031684_1208891 | Ga0031684_12088911 | F063613 | FILLTMNLITSLLKKRGGERGSGLLVGLALKGSGGLLALVVGGSTNLSLLLQSLDGILILPSDLVGQTTEASEVAARSESQHTEGLRANHTVDLVVRRGDTLEDLEVSQSGGTTGGLVGDHSTDGSPENLRGSTEVERTTSGVDVASLSKEVLVLELVSEVRTRDVDVLATNNNDLLSGQELLGNSGGKTAQKVTLSVNHDSLLKHA |
| Ga0031684_1210502 | Ga0031684_12105021 | F051870 | MNTMLSLSFLVVLLGQLTETMATTEVNASYNFLGYGSCRSADVSRYPNHYLKIGNVNREECVAQCTADQGCTHMEWGLNGHGQNQCVIFSPQQQQGIAPRGWIFVHGNGGQSITQANGWSSVSTCSAKVHPDPANYRFLGTGSCRSDNIFTYPNHYLKTDPTNALTRFACVNQCNSEVGCTHMEWGLNGNGEKQCVIFCPQCDPSMVPNGWAFVAGNGGTAITQGNSWSDVSKCFAKSMSCQDAQDGYLYTKKTNPKICYKDHTWRDYLTGYWGYEAVCGGAICEMEPGLALALTPQPYRSWWTPHCKVVQCKFGPRRLQEAKSGQLRGTFV* |
| Ga0031684_1212634 | Ga0031684_12126342 | F030135 | ELATLSSFTDIGHSTFVGCQFPDAILPGEEAIDLVAGPLN* |
| Ga0031684_1214563 | Ga0031684_12145631 | F014836 | KLVVVFLLAALSSLALAQCLLERNTCYEEEIYNGCFCFAEWNDANDVGNWTVNENWLQLMEPSWVHFVSISGDNTVTVDEERRINELYVGPNRWDTTRLVIDEDLTIVYDDVPVISRIQAYRLPTCQVRLIIQGKGFGFVSEDIAVVAEDFYEVDNDSNIDDMEPFTYVCNNPTLTYRDAKIECNLTAANIMPSALKVQVRANGYTTDFVLLSEYVQ* |
| Ga0031684_1229161 | Ga0031684_12291611 | F000075 | FFALVAAVSATQYDSMTEDELLVNLESTLNSAQRSEARGDGDAAVAKTAAIKNIQKALTARILKRLDDGQPLVEVARKMKAIEGMQPQINDMERRLGIMQSVEPVLENAIKTLQKVVDVRGMGKK* |
| Ga0031684_1232933 | Ga0031684_12329331 | F006501 | LMESHKIASDDIKARLDAIAKCEQQSREQDKGIQNHTATVVNNARVIHRDCRDKELRMTKDANEKCQFLDFLTVPAALPSESADRKAKLAYGNTMMGYWCNKDDQMKACAAATDALAPVVKECNKKQTQFESEFCAMAIVYHAQCQDLNDVCYTATRAAYDTSVASTTKLVAKWKVEYSALKKINCFLDVWLSDGDANTVSSEKLAACKATDADASVMNIDFGTPVKEFVCADAGFGTLPDYPGTPDFVTKEYGAWPELVQDVIHCHIEDPVAVSTTAGANTPAYR* |
| ⦗Top⦘ |