


| Basic Information | |
|---|---|
| IMG/M Taxon OID | 3300009796 Open in IMG/M |
| GOLD Reference (Study | Sequencing Project | Analysis Project) | Gs0114663 | Gp0127653 | Ga0105086 |
| Sample Name | Groundwater microbial communities from the Columbia River, Washington, USA - GW-RW S2_0_10 |
| Sequencing Status | Permanent Draft |
| Sequencing Center | DOE Joint Genome Institute (JGI) |
| Published? | N |
| Use Policy | Open |
| Dataset Contents | |
|---|---|
| Total Genome Size | 38093636 |
| Sequencing Scaffolds | 24 |
| Novel Protein Genes | 26 |
| Associated Families | 25 |
| Dataset Phylogeny | |
|---|---|
| Taxonomy Groups | Number of Scaffolds |
| All Organisms → Viruses → Predicted Viral | 1 |
| All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium | 1 |
| All Organisms → cellular organisms → Bacteria → Nitrospirae | 5 |
| All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Nitrosomonadales → Gallionellaceae → Candidatus Nitrotoga → unclassified Candidatus Nitrotoga → Candidatus Nitrotoga sp. CP45 | 1 |
| All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium | 1 |
| Not Available | 7 |
| All Organisms → cellular organisms → Bacteria → Nitrospirae → Nitrospira → Nitrospirales → Nitrospiraceae → Nitrospira → unclassified Nitrospira → Nitrospira sp. | 1 |
| All Organisms → cellular organisms → Archaea | 3 |
| All Organisms → cellular organisms → Bacteria → Nitrospirae → Nitrospira → Nitrospirales → Nitrospiraceae | 1 |
| All Organisms → cellular organisms → Bacteria | 1 |
| All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium | 1 |
| All Organisms → cellular organisms → Bacteria → Acidobacteria → Blastocatellia → Blastocatellales → Pyrinomonadaceae → Pyrinomonas → Pyrinomonas methylaliphatogenes | 1 |
| Ecosystem Assignment (GOLD) | |
|---|---|
| Name | Groundwater Microbial Communities From The Columbia River, Washington, Usa |
| Type | Environmental |
| Taxonomy | Environmental → Terrestrial → Soil → Sand → Unclassified → Groundwater Sand → Groundwater Microbial Communities From The Columbia River, Washington, Usa |
| Alternative Ecosystem Assignments | |
|---|---|
| Environment Ontology (ENVO) | freshwater river biome → river bed → sediment |
| Earth Microbiome Project Ontology (EMPO) | Free-living → Non-saline → Subsurface (non-saline) |
| Location Information | ||||||||
|---|---|---|---|---|---|---|---|---|
| Location | USA: Columbia River, Washington | |||||||
| Coordinates | Lat. (o) | 46.372 | Long. (o) | -119.272 | Alt. (m) | N/A | Depth (m) | N/A | Location on Map |
| Zoom: | Powered by OpenStreetMap © | |||||||
| Family | Category | Number of Sequences | 3D Structure? |
|---|---|---|---|
| F004137 | Metagenome / Metatranscriptome | 451 | Y |
| F005305 | Metagenome / Metatranscriptome | 405 | N |
| F007831 | Metagenome / Metatranscriptome | 344 | Y |
| F007886 | Metagenome / Metatranscriptome | 343 | Y |
| F008161 | Metagenome / Metatranscriptome | 338 | Y |
| F009436 | Metagenome | 318 | Y |
| F009467 | Metagenome | 317 | Y |
| F012359 | Metagenome | 281 | Y |
| F012915 | Metagenome / Metatranscriptome | 276 | Y |
| F013894 | Metagenome | 267 | N |
| F015807 | Metagenome / Metatranscriptome | 252 | Y |
| F018577 | Metagenome | 234 | Y |
| F021258 | Metagenome / Metatranscriptome | 219 | Y |
| F021408 | Metagenome / Metatranscriptome | 219 | Y |
| F025366 | Metagenome / Metatranscriptome | 202 | Y |
| F029045 | Metagenome | 189 | Y |
| F037643 | Metagenome / Metatranscriptome | 167 | Y |
| F055103 | Metagenome / Metatranscriptome | 139 | Y |
| F055512 | Metagenome / Metatranscriptome | 138 | Y |
| F067500 | Metagenome | 125 | Y |
| F072619 | Metagenome / Metatranscriptome | 121 | Y |
| F079840 | Metagenome / Metatranscriptome | 115 | Y |
| F084286 | Metagenome / Metatranscriptome | 112 | Y |
| F091213 | Metagenome / Metatranscriptome | 107 | Y |
| F093898 | Metagenome / Metatranscriptome | 106 | Y |
| Scaffold | Taxonomy | Length | IMG/M Link |
|---|---|---|---|
| Ga0105086_100019 | All Organisms → Viruses → Predicted Viral | 4646 | Open in IMG/M |
| Ga0105086_100725 | All Organisms → cellular organisms → Bacteria → Nitrospirae → unclassified Nitrospirae → Nitrospirae bacterium | 1167 | Open in IMG/M |
| Ga0105086_100766 | All Organisms → cellular organisms → Bacteria → Nitrospirae | 1153 | Open in IMG/M |
| Ga0105086_100785 | All Organisms → cellular organisms → Bacteria → Nitrospirae | 1148 | Open in IMG/M |
| Ga0105086_101103 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Nitrosomonadales → Gallionellaceae → Candidatus Nitrotoga → unclassified Candidatus Nitrotoga → Candidatus Nitrotoga sp. CP45 | 1050 | Open in IMG/M |
| Ga0105086_103320 | All Organisms → cellular organisms → Bacteria → Nitrospirae | 791 | Open in IMG/M |
| Ga0105086_104419 | All Organisms → cellular organisms → Bacteria → Nitrospirae | 740 | Open in IMG/M |
| Ga0105086_104501 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Bradyrhizobiaceae → Bradyrhizobium | 737 | Open in IMG/M |
| Ga0105086_104935 | All Organisms → cellular organisms → Bacteria → Nitrospirae | 720 | Open in IMG/M |
| Ga0105086_105898 | Not Available | 690 | Open in IMG/M |
| Ga0105086_108439 | Not Available | 632 | Open in IMG/M |
| Ga0105086_108616 | Not Available | 629 | Open in IMG/M |
| Ga0105086_109989 | Not Available | 608 | Open in IMG/M |
| Ga0105086_110602 | All Organisms → cellular organisms → Bacteria → Nitrospirae → Nitrospira → Nitrospirales → Nitrospiraceae → Nitrospira → unclassified Nitrospira → Nitrospira sp. | 600 | Open in IMG/M |
| Ga0105086_110727 | Not Available | 598 | Open in IMG/M |
| Ga0105086_114734 | All Organisms → cellular organisms → Archaea | 555 | Open in IMG/M |
| Ga0105086_115802 | All Organisms → cellular organisms → Archaea | 546 | Open in IMG/M |
| Ga0105086_116772 | All Organisms → cellular organisms → Archaea | 539 | Open in IMG/M |
| Ga0105086_117098 | All Organisms → cellular organisms → Bacteria → Nitrospirae → Nitrospira → Nitrospirales → Nitrospiraceae | 536 | Open in IMG/M |
| Ga0105086_118016 | All Organisms → cellular organisms → Bacteria | 530 | Open in IMG/M |
| Ga0105086_118542 | All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium | 527 | Open in IMG/M |
| Ga0105086_118796 | Not Available | 525 | Open in IMG/M |
| Ga0105086_119223 | Not Available | 523 | Open in IMG/M |
| Ga0105086_119865 | All Organisms → cellular organisms → Bacteria → Acidobacteria → Blastocatellia → Blastocatellales → Pyrinomonadaceae → Pyrinomonas → Pyrinomonas methylaliphatogenes | 519 | Open in IMG/M |
| Scaffold ID | Protein ID | Family | Sequence |
|---|---|---|---|
| Ga0105086_100019 | Ga0105086_1000195 | F084286 | MAFNCGVSFGACMVRITKVDENGNVIAGDNSYVTDKPISISVNPNIETGNAFSVRNGCGCSISRRKFPDTFNWWELSLQTATLEPEMIAFMLGADTIDDGADVVGVAFPSALACEDANPAVAFEFWSEHVVGSGLDATYPYFHWVFPSSVWQIGDNTFEEGPAQPTLNAFTQTNGNWGDGPYGDGPPDSQDISEGGFWATADALPTAECAAQAVTATS* |
| Ga0105086_100725 | Ga0105086_1007252 | F055512 | MKEINRSQSMALDDVQRGKLFGGSPVLRDLVGPRCAHCGSPRYFLVFRISGDGRNGILTGRCSRCRNPRELKADEIEQGYHV* |
| Ga0105086_100766 | Ga0105086_1007662 | F025366 | LLMLLSTVTPRQARSNVEAWLIYFGIENIPPWLADKYTDTWVSWIAFVGFCVWAIHLYLRPNLRNGKLSVMIRAGEPWVQVDPGIDQSQASHTSGKLYTYRIALINSADSTLRNVEVKLMSLEKKPQNFHAMGSHLKLRHDQVGTTNFNVHPTKDSQFMDAVFVDVFKFFSGPEGSSFLHVMSLPEDMNRRIPVDKYDVKIIATSERGEMAVAEIAFVPRPDHIPDFRLLNMQSFPGATRSSS* |
| Ga0105086_100785 | Ga0105086_1007852 | F009436 | MDLAHISSAAELLTIEHPHFGRIQPPGFLKTTIGTVTGTSRKAVVIADYSSGNRQTVAVQIDPRNFMITIASSFVLVTLKEGESRHSVIHKSWRLVKQRKGEGVEDFFGTVSMYVDSDGLKWEHRFDRTPDEDELIGFGSNGEEVLIITLESFYPTPAPGVSGVIISPTS* |
| Ga0105086_100785 | Ga0105086_1007853 | F007831 | MNIFATIMFLLPLVLVTLYLYVSGVFTNRRAFQFTFYTLLTFFALIIGIGGVQLSSQGYDLPATAPVSLSEVLPEP* |
| Ga0105086_101103 | Ga0105086_1011031 | F093898 | MMRFLLARAVNPQLSLLDGLRNWNHVASGLREGRRKRTYQH |
| Ga0105086_103320 | Ga0105086_1033202 | F008161 | GFHALLIWMGAWHFLSKEAGEGSLLASDLFAGGLLGATALSLLRLSVGRHLLRFVALIGILLVLYHVWQGKAEWGSLTDVAIWAYTFYSFGPSSWRHALYSPLGVFEKLKT* |
| Ga0105086_104419 | Ga0105086_1044191 | F009467 | RSKIFEALGASFFEAFEEEDLYTRIDLFQKLTQLSHGVTASWNTEDIVDKALDELLSEILTSKVALREFS* |
| Ga0105086_104501 | Ga0105086_1045011 | F015807 | MERLMRHRGESRETDRPTWSHHRRFFGGQGACRDVGCERVEENHRVVINGSYPDR |
| Ga0105086_104935 | Ga0105086_1049351 | F021408 | MKKTSIAIATICLLWAAVATVPTTEAATTFAGAIVGIDQTQRTITYQTEDGRTWTLPVTDSNILKQEQIAKGDRVRVEVEVSDDLSQRITKITKIPDQPRTKPTQSLNDVRP* |
| Ga0105086_105898 | Ga0105086_1058981 | F004137 | MSASLILLGLVGWALGLLFVLALFRMSGEQDRAARHAQQRLHPYSDVTITRTATMRLR* |
| Ga0105086_107130 | Ga0105086_1071302 | F021258 | FHQRRMQAPLGVTRLFDRMDFRAQIVRAQEIVGDPQSSRGVSF* |
| Ga0105086_108439 | Ga0105086_1084391 | F079840 | KSTFKVKFTGIQEKVVFRCDVFLLEKIKNTLASQHLAGNYEYPNLSSFFRSALHACQHGMPLTYQRELSNPRKEVSFRLTGELMTFYGSLPLSSRTGIMERALGSYYYRVFG* |
| Ga0105086_108616 | Ga0105086_1086161 | F029045 | MALLKFEPPKELAPLRVKIARPVHERVERYRQFCGALALADLVERALVYVVDHGPDFRAHEKGSATIRAKVEPRPNASRVSPVDQADGAP* |
| Ga0105086_109989 | Ga0105086_1099891 | F055103 | MLAGRTTLASGLRGFWPVLAGHLGHARLANRMIYWLRRSSPTWLAQNDPENEGAKILLSRRVSSLRPFRVLMKYPG* |
| Ga0105086_110602 | Ga0105086_1106021 | F072619 | MNLLSSLMFLMGVWLGENDHTRRITVQEIAPSDRTNLALGKKSRGRDGAEPLLHGSAIVIGVAEESLSTPATAEQECSEWRVFVLRSLCSQAKVQVVACRLCIPKVELHGLAFLNHVSDRDGSGLLIRSNEVPNEEVAPLEMTPMLIDHNAQ |
| Ga0105086_110727 | Ga0105086_1107272 | F091213 | MGRETLGIESVGLSTPREKLRSGAGQNELAVDQVLANFVAELFATLHPATLPMLERMFEDVKQDAETDGEWELKRRAEIGIAAARIAAREQEAIEN* |
| Ga0105086_114734 | Ga0105086_1147341 | F005305 | GPLIENFQNLVKDAKAKKLDSLREDEDFKKEFNQLSKDVIEPMMRKFESYLKCKDVNSSVNIQSQIVAGKNPSIEFSLHFKLTHESRYPNIKFSSSGKKISIQEERLMAKGEVRQDMMPEYYDNEQITEEFVKERLIRLIKSCFDKNWQSFYP* |
| Ga0105086_115802 | Ga0105086_1158021 | F067500 | MVFQHKCYYCTTNGFNSTDEYEPHVVTRHPNLSGYQCPSDTTFYGLEKQGMPWERKMKNDIEWNQL |
| Ga0105086_116772 | Ga0105086_1167721 | F012359 | LSIILDSLLLKTKEITKSKRGEPKTSSILLISAALIMAIILLMYAGISYTEEMMKQYSMCRDKIVGYGQHGMYPSAEEFKLALSFCD* |
| Ga0105086_117098 | Ga0105086_1170981 | F007886 | AVLEVLEVVLDGVLAGADEGGAVGVVGVVDVVWSDALGAVSFFSVVAGVGASLPEEGFILSE* |
| Ga0105086_118016 | Ga0105086_1180162 | F037643 | MGAAEIISLEEVHARKQWDTLRHQLHERFDQWLDTLETQWPEPPASLPEMTATVWDLRQQLTGGITEAIVAHVHRGEHDRTQGNCPRCDGVLRAQ |
| Ga0105086_118542 | Ga0105086_1185421 | F012915 | VGVPPRGVMTIQRYGELGGRVGCIPTVLQVAMLHGMRQCLEELKKNGTEAAYFKNTPGIEDTRKWYANMGNVELKELEKKYGY* |
| Ga0105086_118796 | Ga0105086_1187961 | F013894 | MTYQLEEPFFEDKGKITVQKEIGDNKTQMTFSSNGTFKGNIEVTNSGDLVSLSKGNKGTSAQGQGVVTTKDGSEEANYTFLQVGKAITKDGNPVLRGVGSAVWSTDSTDKLAF |
| Ga0105086_119223 | Ga0105086_1192232 | F055103 | MLAAQDTLASGLRGFWLVVDGHLGHARLANRMIYWLRRSSPTWLAHNDPEIRVLMKYPG* |
| Ga0105086_119865 | Ga0105086_1198651 | F018577 | CRRKMITILWTGVLAIGTIILIYKEMTALLYILATLGVTALLLVVAMADLSHAERLTNDPRRSEDSAAIGSGITSTFGAKQP* |
| ⦗Top⦘ |