


| Basic Information | |
|---|---|
| IMG/M Taxon OID | 3300009350 Open in IMG/M |
| GOLD Reference (Study | Sequencing Project | Analysis Project) | Gs0117984 | Gp0126421 | Ga0103832 |
| Sample Name | Microbial communities of water from the North Atlantic ocean - ACM35 |
| Sequencing Status | Permanent Draft |
| Sequencing Center | University of Georgia |
| Published? | N |
| Use Policy | Open |
| Dataset Contents | |
|---|---|
| Total Genome Size | 48631543 |
| Sequencing Scaffolds | 20 |
| Novel Protein Genes | 26 |
| Associated Families | 23 |
| Dataset Phylogeny | |
|---|---|
| Taxonomy Groups | Number of Scaffolds |
| All Organisms → cellular organisms → Eukaryota → Sar → Alveolata → Ciliophora → Intramacronucleata → Spirotrichea | 2 |
| All Organisms → cellular organisms → Eukaryota → Sar → Alveolata → Dinophyceae → Gonyaulacales → Lingulodiniaceae → Lingulodinium → Lingulodinium polyedra | 3 |
| All Organisms → cellular organisms → Eukaryota → Sar → Alveolata → Ciliophora → Intramacronucleata → Spirotrichea → Stichotrichia → Urostylida → Pseudourostylidae → Pseudourostyla → Pseudourostyla cristata | 1 |
| All Organisms → cellular organisms → Eukaryota → Sar | 1 |
| Not Available | 5 |
| All Organisms → cellular organisms → Eukaryota | 6 |
| All Organisms → cellular organisms → Eukaryota → Sar → Alveolata → Dinophyceae → Noctilucales → Noctilucaceae → Noctiluca → Noctiluca scintillans | 1 |
| All Organisms → cellular organisms → Eukaryota → Haptista → Haptophyta | 1 |
| Ecosystem Assignment (GOLD) | |
|---|---|
| Name | Aquatic Microbial Communities From Amazon River, Brazil And North Atlantic Ocean |
| Type | Environmental |
| Taxonomy | Environmental → Aquatic → Freshwater → Unclassified → Unclassified → River Water → Aquatic Microbial Communities From Amazon River, Brazil And North Atlantic Ocean |
| Alternative Ecosystem Assignments | |
|---|---|
| Environment Ontology (ENVO) | marine biome → marine water body → surface water |
| Earth Microbiome Project Ontology (EMPO) | Free-living → Saline → Water (saline) |
| Location Information | ||||||||
|---|---|---|---|---|---|---|---|---|
| Location | North Pacific Ocean | |||||||
| Coordinates | Lat. (o) | N/A | Long. (o) | N/A | Alt. (m) | N/A | Depth (m) | N/A | Location on Map |
| Zoom: | Powered by OpenStreetMap © | |||||||
| Family | Category | Number of Sequences | 3D Structure? |
|---|---|---|---|
| F000070 | Metagenome / Metatranscriptome | 2710 | N |
| F000491 | Metatranscriptome | 1079 | Y |
| F002457 | Metagenome / Metatranscriptome | 557 | Y |
| F002556 | Metagenome / Metatranscriptome | 548 | Y |
| F003081 | Metagenome / Metatranscriptome | 508 | Y |
| F003808 | Metatranscriptome | 467 | Y |
| F005505 | Metagenome / Metatranscriptome | 398 | Y |
| F006501 | Metagenome / Metatranscriptome | 371 | N |
| F009426 | Metagenome / Metatranscriptome | 318 | Y |
| F011139 | Metagenome / Metatranscriptome | 294 | Y |
| F014625 | Metatranscriptome | 261 | Y |
| F018667 | Metatranscriptome | 233 | Y |
| F019484 | Metagenome / Metatranscriptome | 229 | Y |
| F020014 | Metagenome / Metatranscriptome | 226 | Y |
| F023858 | Metatranscriptome | 208 | Y |
| F024323 | Metagenome / Metatranscriptome | 206 | Y |
| F035177 | Metatranscriptome | 172 | Y |
| F040511 | Metatranscriptome | 161 | Y |
| F040635 | Metagenome / Metatranscriptome | 161 | Y |
| F047469 | Metatranscriptome | 149 | Y |
| F048725 | Metatranscriptome | 147 | Y |
| F071939 | Metatranscriptome | 121 | N |
| F074356 | Metatranscriptome | 119 | N |
| Scaffold | Taxonomy | Length | IMG/M Link |
|---|---|---|---|
| Ga0103832_1000289 | All Organisms → cellular organisms → Eukaryota → Sar → Alveolata → Ciliophora → Intramacronucleata → Spirotrichea | 1481 | Open in IMG/M |
| Ga0103832_1000373 | All Organisms → cellular organisms → Eukaryota → Sar → Alveolata → Dinophyceae → Gonyaulacales → Lingulodiniaceae → Lingulodinium → Lingulodinium polyedra | 1357 | Open in IMG/M |
| Ga0103832_1000489 | All Organisms → cellular organisms → Eukaryota → Sar → Alveolata → Dinophyceae → Gonyaulacales → Lingulodiniaceae → Lingulodinium → Lingulodinium polyedra | 1230 | Open in IMG/M |
| Ga0103832_1000528 | All Organisms → cellular organisms → Eukaryota → Sar → Alveolata → Ciliophora → Intramacronucleata → Spirotrichea → Stichotrichia → Urostylida → Pseudourostylidae → Pseudourostyla → Pseudourostyla cristata | 1202 | Open in IMG/M |
| Ga0103832_1000559 | All Organisms → cellular organisms → Eukaryota → Sar → Alveolata → Dinophyceae → Gonyaulacales → Lingulodiniaceae → Lingulodinium → Lingulodinium polyedra | 1178 | Open in IMG/M |
| Ga0103832_1000737 | All Organisms → cellular organisms → Eukaryota → Sar | 1087 | Open in IMG/M |
| Ga0103832_1000827 | Not Available | 1044 | Open in IMG/M |
| Ga0103832_1001365 | Not Available | 872 | Open in IMG/M |
| Ga0103832_1001539 | All Organisms → cellular organisms → Eukaryota | 840 | Open in IMG/M |
| Ga0103832_1001960 | All Organisms → cellular organisms → Eukaryota | 770 | Open in IMG/M |
| Ga0103832_1002042 | All Organisms → cellular organisms → Eukaryota | 760 | Open in IMG/M |
| Ga0103832_1002893 | Not Available | 672 | Open in IMG/M |
| Ga0103832_1003436 | Not Available | 632 | Open in IMG/M |
| Ga0103832_1004091 | All Organisms → cellular organisms → Eukaryota → Sar → Alveolata → Ciliophora → Intramacronucleata → Spirotrichea | 595 | Open in IMG/M |
| Ga0103832_1005136 | All Organisms → cellular organisms → Eukaryota → Sar → Alveolata → Dinophyceae → Noctilucales → Noctilucaceae → Noctiluca → Noctiluca scintillans | 550 | Open in IMG/M |
| Ga0103832_1005139 | All Organisms → cellular organisms → Eukaryota → Haptista → Haptophyta | 550 | Open in IMG/M |
| Ga0103832_1005224 | All Organisms → cellular organisms → Eukaryota | 547 | Open in IMG/M |
| Ga0103832_1005804 | All Organisms → cellular organisms → Eukaryota | 527 | Open in IMG/M |
| Ga0103832_1005863 | All Organisms → cellular organisms → Eukaryota | 525 | Open in IMG/M |
| Ga0103832_1006330 | Not Available | 511 | Open in IMG/M |
| Scaffold ID | Protein ID | Family | Sequence |
|---|---|---|---|
| Ga0103832_1000289 | Ga0103832_10002892 | F005505 | LDEIRFYGVAPH*YFRPYMGILVISPTHYEGLMRMGLFLGSLACLPLIYNVYNTFNKYVSTIPMQNSILQTTTFTLFMMSLYCANSMLPCGRYYYEPEGGYVGNP*VKFSYQYMYLYLC*LLHHLDLIDHYIFQFSQTFLRKINPNLLKSASGKKTNY* |
| Ga0103832_1000289 | Ga0103832_10002893 | F003081 | MPEPGLVIELREEMFNDTRFGSEVFYMHVRGVDTLMLLSYIHILKKIFLKNYVTAESDG* |
| Ga0103832_1000373 | Ga0103832_10003732 | F003808 | MSPAVEEPGLSLFCFAVYTKNTGSPKPSQELELFRMQRENSWSLFSCAEWAVYSDVVEDLGGGVKTIEVRDVKGDFNILKRKETGCWVNTGMFVQVWSAIRDAGHATNHNWVIKVDADAVFFPSKLVRALSDYTVPQEGVYMENCKYVDWGYFGNLEVFSKQAFITLVDNLETCYTSIPWKDGVLGGKYGPMGEDLFAQKCMDMLGVGRQENWMLTTDGACQADRPEEEKHNKKYVPPCEGVSTPTIHPYKKPEMYRTCWQQAVDA* |
| Ga0103832_1000489 | Ga0103832_10004891 | F018667 | MEISRTANTDVDEMSDSLVMQTEVPRSHSTQKLLGAMAASLLVGAFAGSRLAYHEQPLVSASGDLQELAQIIAKPKRGECSSVKEDCASTGCCDIVGYTCFQTKPGAAKCMKTCTPSATQLCTQPQSIMEPVLQDAVPVGTSMYCFEVYTKDTGTTKKSEELETIQYQYSKGLSIFACDAQDVFADVEVEVGPGLSTISVVDAENDFHFAKRKETGAWVNTGMFTQVWRAIAIGGKYQSADWVVKVDADAVFVPSRLRSKLGAQLVPPSGIYLENCKYVEYGYFGNLEVFSQAAWSTLVDKIDDCKADSQINWKVGVHDGKYGPMGEDLFAQACLDKFGVRRVEAFDITTDGACPADRPIDQQKNKKWKPTCAWTATPAMHPFKKVADWIQCHDATV |
| Ga0103832_1000528 | Ga0103832_10005282 | F011139 | MGLFLGSLAFLPLIYNAYNSFNRYVSTIPMQNSILQTTMFILFMLSLFCANSMLPCGRYYYEPEGGYVGNP* |
| Ga0103832_1000559 | Ga0103832_10005591 | F003808 | MKKVVASPGGSLFCFSCYTANTGSEKPSHELELLQMQHENAWNIFSCAEWAVYSDVVAPLGGGDMTIKVDDVKGDFHFAKRKEAKTWINTGMFVQIWTALRDAGHATNHDWVIKADADAVFFPWKLVDALRSATVPVEGLYMENCKFVEWGYFGNLEVFSKQAFTTLVNNLDTCYTSLPWKVGVHGGKHGPMGEDLFAQKCMDLMGVAKQENFGLTTDGACEADRPEGQKKNKKFVPTCAGVSTPSIHPFKKPEAYRECWAQAASVQP* |
| Ga0103832_1000737 | Ga0103832_10007372 | F040635 | VRHEDIPETNNPKIRFPVLEGDKLMKEVLNGGYNVHPVPPPNSEIKER |
| Ga0103832_1000827 | Ga0103832_10008272 | F074356 | FSPKMAPQCEFGEDVFGAYEAIIGETVIGLWVMSTWSWRITLEIPSFRSGKAKNLVQEYALYAKQLGYDPITFATIVGWMNGIVAAGLVWAIVNPNFQLQSTCGGVMLLLTGFSIYCRRAVGDGWEKCYDAIVLFLMALAITVSSATALSKGCYAYAINGVDHGFRLSCGYTVVSAIVFWGIKSLQAGDLTEWEKFLDAEDEAPAEPTLGEFFFGASKKTDQEEALLA* |
| Ga0103832_1001365 | Ga0103832_10013651 | F006501 | KIASDDIKARLDAIKECEQKSREQDKGVQEDIARKVHAVRVIHSDCRDKELGLTKDANEKCQFLDFLTEPAALPKESADKKTKLAYGETMMGYWCNKDEQFKACAAATDALEPVVKECNKKQTQFESEFCAMAIVYHAQCQDLNDVCYTETRAAYDSSVASTSKLLGKWKIEYQALKKINCFLDVWMENGDANTVSSEKLAACKATEADASILNIDFGTPVKEFVCADAGFGTLPDYPGTPDFVTKEYGAWPDLVQDVIHCHIEDPVAVSTTLGANHPDWEHGDHSDPFA |
| Ga0103832_1001539 | Ga0103832_10015391 | F000491 | GAKEAAEPGSSKAKRELYGFLALSFGDVDTDKDGLINAEQFDQLLAEVAALPRRYGLAPLDVGDAVSRAVNHKILFDTLDTKNGPARGVLGLDQFIEWAYDHVVTHVPKVPAKDVGLYHVEDYSEEEYIGFVEKAVNNPGSYEHASFYNFILNCFVEADEQCQGRITYDQFDKLLSRAATVPRHFGLAPPESSTEARKKMFDELELKRGGKGTGYVTARTFWEWTVVHVRGMIELQKAGKGWRENH* |
| Ga0103832_1001960 | Ga0103832_10019601 | F035177 | PHLAQAWTAESSGDGLPGQVGSESYYVSADKKFKAHKFEYPEQSCTKISLHDPTQLHHIAGGERNYYVGCDSVNCCYSDFQMKSWDIEKSGLFNKVEFVGYEDTTELNDNPVTGAEHWRQATKIPFANVSVGYDYFLHRTDAGDVISHRIDYTADGAQIPGGSILYGDFEVQHDIDTFKQVFTPPAECLKSNVLKCPNQKVSSWEAQFFTRDMASMLV* |
| Ga0103832_1002042 | Ga0103832_10020421 | F002556 | HLITKVATIDIHADVDYYHIEQYGEHQYLTHLEEAVTNPNSRAHASLYEFLLAIFTECDTRSTGVLTFAEFDQLLSRAAEVPRTFGLAPPEASKETRKKFFDSMEDKQMGGVTFRLLLAWTIEHSKGKIAAQKAGKGYKK* |
| Ga0103832_1002893 | Ga0103832_10028931 | F019484 | IFYFLTILGGSLKKIAKKITISVPISKRVYTNASIF* |
| Ga0103832_1003436 | Ga0103832_10034362 | F040511 | MFAGSKSNPYNGAAYLYNADETKCCKTQPKGFGAEKLSVAQGNFYNTLEYVDERDFNGVYYQGKAKYYKLTGVNEPVREFWYFTDQDGKPVQQGEAGTGPTDQGYPTSIGHTIWHDYDQSTFDTSAIDSSVFAVPEACKTTTLKCNFP* |
| Ga0103832_1004091 | Ga0103832_10040911 | F003081 | ELREEMFNDTRYGAEVYYMHVRGVDTLMVLSYVHILKKIFLKNYVTAESDG* |
| Ga0103832_1004379 | Ga0103832_10043791 | F047469 | KTMKASAMKAMKAKTMKASAMKAMKVSVIAKGPRAKAVVFLGLGNKSKTLSGLKKSDLMKSKSGKIVSKALSARGKQLFAQSALKKWSVALQQARKELGITGFCAVNGKTPQGKALYAKVKAILGK* |
| Ga0103832_1005136 | Ga0103832_10051361 | F014625 | QLNDLYHDMDDKRKIEKDLHELVKEWDVLNEVARTDPDLARAHRDGHCHEAVMWYSHHLPEGMKKLLKDKISLPLLSSMKHSMKDVEHGPRVHRAYEEKVTCASCHSFEYPSATVV* |
| Ga0103832_1005139 | Ga0103832_10051391 | F024323 | MDVLESSWFLVSSLIIGIVLLVDPKSSLTGSNTNAVLGLFSSPSSGQQFIYNFSAIPILSFFLLTIVLSLNN* |
| Ga0103832_1005177 | Ga0103832_10051771 | F002457 | QFTDWATTHIAGKIAEIDTSSEVDFYHVSNYSEAEFLKAIEVAVTNKNSREYASLYEFLLTAFVETDATCRGEITYAEFNKLIERAAAVPRTFGLAPPDGTVEARKAIFESMDDTKTGLITFRKFLEWTVTHTAGKVEAHKAGKGYKK* |
| Ga0103832_1005194 | Ga0103832_10051941 | F071939 | WTSMWEPHLIHYEPKKVLIKEWVTSDSFADDISSVFYEVDRDGNHMLEWNNGEIRNFINKVYQMKGLATPCESTMYDMYRIFDEDNNGGLDAVEAQHLAQAHVMSLVTALHL* |
| Ga0103832_1005224 | Ga0103832_10052241 | F023858 | YVKLNGPNCCYCDNVDKPKMWDIADSGLFTKVGFVAYEDTTELNDNPVKGAEHWATSSVLPKVLTVTYDYFLHREDNGDVVSHRINFNTSVEQSGEILYGNFAVQHDLDAHRERFAVPQECKGNILSCCDDMDKVDAKWFRHDFAVRQAEKTVV* |
| Ga0103832_1005343 | Ga0103832_10053431 | F000070 | KQQEAFEKIGAVEQQAELRSAEASRRLGALDLRMSGVQGGLGEHKRDILKLREEVNGLTVKSASHEVDIQKNSDATRKLEKQRNMDEQNWKAQMDAVHDVLDTKVNEKPFEDLKHCVASLTKGVVKFAQVVGVFPGPRFDDAEGVDQSEADVELLGWEECAENMSFRVDKAWRQRCSQRF |
| Ga0103832_1005804 | Ga0103832_10058041 | F020014 | FCPFYRDEPNPEYAPKKKSVNKPRAARPPIVATIQRGIRPISLISIIIKD* |
| Ga0103832_1005863 | Ga0103832_10058631 | F023858 | DSPKQWDIPKSGLFTKVKFNGFEDTTELNDNPVQGAEHWFTNSVLPKVLTVSYDYFLHREDSGDVISHRINFNTSVGQEGSILYGGFQVAHDLDAHRAKFDVPQQCKGNILDCCDNREETMATWFKHDHAVEQATKAEVAV* |
| Ga0103832_1006307 | Ga0103832_10063071 | F048725 | LDDTMATCTQKANDFEARQQLRAEEIQAIEKAIEIISRNAVSGAAGKHLPSMIQQKTTSLAQFRSGSSSPSQFKVAIYLQDKARQLNSRILSALADRVEKDPFKKVKKMIKDLIVKLMEEANEEVEHKGYCDKELATNEHTRKEKTEAVVMLTAEIDELTASIAALTEQI |
| Ga0103832_1006330 | Ga0103832_10063301 | F009426 | ESFSTAAKVNKNVKLLLNMTLDEILYTIIFFMTTSPCVISAGAARETLFEGKADYCMIVCE* |
| ⦗Top⦘ |