


| Basic Information | |
|---|---|
| IMG/M Taxon OID | 3300025142 Open in IMG/M |
| GOLD Reference (Study | Sequencing Project | Analysis Project) | Gs0111384 | Gp0110936 | Ga0210019 |
| Sample Name | Groundwater microbial communities from aquifer - Crystal Geyser CG11_big_fil_rev_8/21/14_0.20 (SPAdes) |
| Sequencing Status | Permanent Draft |
| Sequencing Center | DOE Joint Genome Institute (JGI) |
| Published? | Y |
| Use Policy | Open |
| Dataset Contents | |
|---|---|
| Total Genome Size | 791621780 |
| Sequencing Scaffolds | 21 |
| Novel Protein Genes | 25 |
| Associated Families | 21 |
| Dataset Phylogeny | |
|---|---|
| Taxonomy Groups | Number of Scaffolds |
| All Organisms → cellular organisms → Bacteria → Nitrospinae/Tectomicrobia group → Nitrospinae → Nitrospinia → Nitrospinales → Nitrospinaceae → Candidatus Nitromaritima → Candidatus Nitromaritima sp. SCGC AAA799-C22 | 1 |
| All Organisms → cellular organisms → Bacteria | 4 |
| All Organisms → Viruses → Predicted Viral | 1 |
| All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium | 1 |
| All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Nitrosopumilales → Nitrosopumilaceae → Nitrosopumilus | 1 |
| Not Available | 10 |
| All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium RIFCSPLOWO2_12_FULL_66_21 | 1 |
| All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium | 1 |
| All Organisms → cellular organisms → Bacteria → Acidobacteria | 1 |
| Ecosystem Assignment (GOLD) | |
|---|---|
| Name | Development Of A Pipeline For High-Throughput Recovery Of Near-Complete And Complete Microbial Genomes From Complex Metagenomic Datasets |
| Type | Environmental |
| Taxonomy | Environmental → Aquatic → Freshwater → Groundwater → Unclassified → Groundwater → Development Of A Pipeline For High-Throughput Recovery Of Near-Complete And Complete Microbial Genomes From Complex Metagenomic Datasets |
| Alternative Ecosystem Assignments | |
|---|---|
| Environment Ontology (ENVO) | freshwater biome → aquifer → groundwater |
| Earth Microbiome Project Ontology (EMPO) | Free-living → Non-saline → Subsurface (non-saline) |
| Location Information | ||||||||
|---|---|---|---|---|---|---|---|---|
| Location | USA: Utah: Grand County | |||||||
| Coordinates | Lat. (o) | 38.9383 | Long. (o) | -110.1342 | Alt. (m) | N/A | Depth (m) | N/A | Location on Map |
| Zoom: | Powered by OpenStreetMap © | |||||||
| Family | Category | Number of Sequences | 3D Structure? |
|---|---|---|---|
| F016496 | Metagenome / Metatranscriptome | 246 | N |
| F025056 | Metagenome / Metatranscriptome | 203 | Y |
| F025862 | Metagenome | 200 | Y |
| F037250 | Metagenome / Metatranscriptome | 168 | Y |
| F039194 | Metagenome | 164 | Y |
| F043249 | Metagenome / Metatranscriptome | 156 | Y |
| F043771 | Metagenome | 155 | Y |
| F047487 | Metagenome / Metatranscriptome | 149 | N |
| F071959 | Metagenome / Metatranscriptome | 121 | Y |
| F071971 | Metagenome | 121 | N |
| F075683 | Metagenome | 118 | N |
| F075685 | Metagenome | 118 | Y |
| F076867 | Metagenome | 117 | N |
| F079604 | Metagenome / Metatranscriptome | 115 | N |
| F083231 | Metagenome / Metatranscriptome | 113 | Y |
| F091390 | Metagenome | 107 | Y |
| F094905 | Metagenome | 105 | N |
| F094906 | Metagenome | 105 | N |
| F096615 | Metagenome / Metatranscriptome | 104 | N |
| F099328 | Metagenome / Metatranscriptome | 103 | Y |
| F102531 | Metagenome | 101 | N |
| Scaffold | Taxonomy | Length | IMG/M Link |
|---|---|---|---|
| Ga0210019_1005994 | All Organisms → cellular organisms → Bacteria → Nitrospinae/Tectomicrobia group → Nitrospinae → Nitrospinia → Nitrospinales → Nitrospinaceae → Candidatus Nitromaritima → Candidatus Nitromaritima sp. SCGC AAA799-C22 | 9948 | Open in IMG/M |
| Ga0210019_1009172 | All Organisms → cellular organisms → Bacteria | 7424 | Open in IMG/M |
| Ga0210019_1022542 | All Organisms → cellular organisms → Bacteria | 3980 | Open in IMG/M |
| Ga0210019_1033154 | All Organisms → cellular organisms → Bacteria | 3061 | Open in IMG/M |
| Ga0210019_1045071 | All Organisms → Viruses → Predicted Viral | 2482 | Open in IMG/M |
| Ga0210019_1046634 | All Organisms → cellular organisms → Bacteria → Proteobacteria → delta/epsilon subdivisions → Deltaproteobacteria → unclassified Deltaproteobacteria → Deltaproteobacteria bacterium | 2427 | Open in IMG/M |
| Ga0210019_1053012 | All Organisms → cellular organisms → Bacteria | 2218 | Open in IMG/M |
| Ga0210019_1053226 | All Organisms → cellular organisms → Archaea → TACK group → Thaumarchaeota → Nitrosopumilales → Nitrosopumilaceae → Nitrosopumilus | 2213 | Open in IMG/M |
| Ga0210019_1108780 | Not Available | 1356 | Open in IMG/M |
| Ga0210019_1142019 | Not Available | 1125 | Open in IMG/M |
| Ga0210019_1160498 | Not Available | 1032 | Open in IMG/M |
| Ga0210019_1188853 | All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium RIFCSPLOWO2_12_FULL_66_21 | 919 | Open in IMG/M |
| Ga0210019_1218928 | Not Available | 828 | Open in IMG/M |
| Ga0210019_1220928 | All Organisms → cellular organisms → Bacteria → Acidobacteria → unclassified Acidobacteria → Acidobacteria bacterium | 823 | Open in IMG/M |
| Ga0210019_1239677 | Not Available | 777 | Open in IMG/M |
| Ga0210019_1257479 | Not Available | 738 | Open in IMG/M |
| Ga0210019_1264980 | Not Available | 723 | Open in IMG/M |
| Ga0210019_1316769 | Not Available | 636 | Open in IMG/M |
| Ga0210019_1323959 | All Organisms → cellular organisms → Bacteria → Acidobacteria | 626 | Open in IMG/M |
| Ga0210019_1392871 | Not Available | 546 | Open in IMG/M |
| Ga0210019_1409904 | Not Available | 529 | Open in IMG/M |
| Scaffold ID | Protein ID | Family | Sequence |
|---|---|---|---|
| Ga0210019_1005994 | Ga0210019_10059948 | F083231 | MSSKDGKIFDTIVMIGLGINGLTAVYLLLMFFEII |
| Ga0210019_1009172 | Ga0210019_10091722 | F076867 | MDEKNRNLVEIAKKKRYIAIVEKLGRGSLSSKELKELEEFEKAEQRPLDGVIDGTVALPILCVYLEKSPRMIRRYVQQGMPVIRDAAGEIARFKVGEVFKWFYGKQGSEEDNGKDYWDKEYRKNRAKLSEIELKQKEGEVIPFEDHVSIVKNQIRGIKAGFLRLPKHVAPKLYQQDPKVICEMLDQEIRYIIEQFAGKQNVNKAGKRNS |
| Ga0210019_1022542 | Ga0210019_10225425 | F102531 | LFADPIGTLQGLQASAQQVQSVAQQGLTELSFASSILNLPNADNSAGWTLGTYGRQTNVIQPQKTVAGINQEIDKLVANGVDKLEAIRQVGSVSVPDYAATPEQIAQLQLADKARTADEILQCPYTWCRHNSTVSDAILESRGYQPYRAAMMMQTTEGLSAGGHQVSAIIVNGEPVFIDLTNNLIIPGQQALEQILLNSGKQLTALEMIRLTTNNVWDVINLIPK |
| Ga0210019_1033154 | Ga0210019_10331542 | F076867 | MDEKNQNLVELAKKKRYIALVEKLGRGSLGPKELKELEEFEKVEQAKETGVIDGAVDLGTISIYLEKSSRMVRRYVSQGMPVIRDSSGELSRFKVNDVFKWVYGNKGKDTEDKDYWENEYRKNRAKLSELELKQKEGELIPFADHVSVVKNQIRGIRAGLLRLPKHVAPKLYQQDPKLICEMLDQEIRYIINQFAGVKSNDKANKRGA |
| Ga0210019_1045071 | Ga0210019_10450712 | F075683 | MVSIYTTHMQPATGWIDQCGEVEIRGGFGVGEMIRCDCCGKKRPAEDCVVQCYYDGMSVWCAEGKGCKSPEEIEQKRLIAHENRSRGQRARRAKEKND |
| Ga0210019_1046634 | Ga0210019_10466341 | F043249 | LIRVDGSGDLRQGNVGVFGAVQSRSQLFVRQGADVAVSNLTTGGFFPSDQTFVTLAVRVWTYFRFNVESQRTDAQNTTGPVASLAGVTADRIQRVHKLYHQAENQLFWQFIAGDKPQLTTFTAYTPAAGGLDGFFSDTRLPRANNGVPTSAALMRLARPILVPPRQGFQVVAIASPIGQAQGASIIEQLNGAVPNNDPWGASSTGGTVTGTVGTNGLTTTGRDDIEKDIKYLIDGIHSRDVL |
| Ga0210019_1053012 | Ga0210019_10530124 | F102531 | QGTDSPACQYGAMGITTGFFADPIGTLQGLQASAQQVQSVAQQGLTELSFASSILNLPNADNSAGWTLGTYGGQTIPIQPQKTIAQINQEIDNLVARGVDELEAIRQVGSISIPNYATTPEQIAALRLADQARTADEILQCPYTWCRHNSAISDAILESRDYQPYRAAMTMQTTEGLSAGGHQTSAIIINGEPVFIDLTNNLIITGQQALEQVLINSEKQLTALEMIRLTTNNVWDVINLIPK |
| Ga0210019_1053226 | Ga0210019_10532264 | F071971 | MIREVFYNAESTISAMNNTAKYGLENLNNVHEAFSIGFAITHADMAIMTAPETIAVHRYTGPTMGSPFSVVKKNGAIENTTITKVEIAMVLADVFSDWTAVSIGTFTLL |
| Ga0210019_1058654 | Ga0210019_10586541 | F099328 | IHAGTAEVCLVFFNRRIFLGEDLRAAIDFQARRALFKSCVRIGRTQ |
| Ga0210019_1108780 | Ga0210019_11087802 | F071959 | MASRNYLVMSNDMTLSDKKEYRLNALSAGLERCGLRGIGDIKADIPGLAGIPDANKVARVKLIHNYLITGQWPRSIDQRELTTGTDLVVAPAVDSWLTAPMAAVGNIVSCFQGVAAPQLVQGKLMVCYAVSVESSAVPMPVSRLIFRRGAAGNVQAQFDMEAMGVRWEVDAFFSEVVVIDPQDVFAIQVRCRNATAVAEIVHIHNFLFESAGLVVA |
| Ga0210019_1142019 | Ga0210019_11420192 | F043771 | MDEIPLKCECECGCGEDATTSDEGEALCEACAEYTVDPESGEVVCSRDPRAEEVTESCGAGGQTHSYWRLRPPESPAPSPNGEWACYWDTAGDGSRVVSRHETEAEAAQAVAAQDWPAPGDHTHYLCGFEVRKWDGEAWIEPGALTP |
| Ga0210019_1148984 | Ga0210019_11489842 | F094906 | NTSGTSHILPIKYLKPNTTTYFVIKAYTATENVQSSQYSVAVPNGDSKVYDALLSVSSGQTKISWYYFVASPPDNFSARYRKNGGSWNPWSISRTGQRYYTDWVNTGWTTGDQLDIDIYNPSNRSETNIQDSILYGNDCF |
| Ga0210019_1149202 | Ga0210019_11492023 | F094905 | MTTLYGYIAAIVLIVFLGAGWAHEHDKRIVFEAQVEQAGKDAAKHTAEIDAKHREEMQNAEQNTIIATNSIADWYRAHPAVRVRYANTDCSAVPSTDNNPGIPDDSTT |
| Ga0210019_1160498 | Ga0210019_11604983 | F047487 | MTTIYGYIAAAVLIIFLGAGWAHEHDKRIVFEAQVEQAGKDAAKHTAEINAKHQEEMQNAEQNTIIATNSISDWYRAHPVVRVRYTNTDCSTMPGTANNPPIPDDTSPSGYVSPYNPEDTEQVANRLDQLQKLLRADGVRIE |
| Ga0210019_1188853 | Ga0210019_11888532 | F025862 | MPRIARPRVQEEGSRPAAEKRFVSRKNAAAEPLDDVERQFLEPVARYLTILREWSLERLSDGTGTVPPEQP |
| Ga0210019_1218928 | Ga0210019_12189281 | F047487 | MNALYGYIAVFVVIVFLGAGWAVEHDKRITYQAKVEQAGADALAQTEKINAKHREEMQNAEHNTIIATNSIADWYRAHPAVRVRYANTDCSAMPSTDNNPGIPDDSTASGYASPYSPESTEQVASRLDQLQKLLRADGVRVE |
| Ga0210019_1220928 | Ga0210019_12209282 | F037250 | MAAFLDFGSGRSLKTGLLRLTKSRNLSASTCFSRNALSGGSLLMSICVTSIPCVSRKLLAFLQVVQPGFV |
| Ga0210019_1239677 | Ga0210019_12396772 | F075685 | MSSTTTRTLYDPNTHPVAAAYVDANAAERHAARETWLASDEERGYRFAVYRTVGLRHCDSVPRRRESPHLCRPSELRHRVDLWARAIASALESQTATYWLICHAYPVHPRTGRADHGLLAEYILAVHPDVPSCIGYL |
| Ga0210019_1257479 | Ga0210019_12574791 | F079604 | MPTARRGITFSLNRKKFKLNTFLQDIRKSPIAFMEICQYQGRQVQKFKQTKRILRLISENNKSIAVLPRGASKSFSLAIIALWYFYTRENFRVAIFSRSHRQSKAVLEICSDIIDSSPLLKTSRQSFQIDQKQRLKSHINSEIIAHPFDASTVLGEHPDIVLADECAFFGDDSFFRMVVLPMQSGVRTIEKIPKISLVSTIDQQEGFFYDVWKNPEKYGYTKLKMTWQECDGYTK |
| Ga0210019_1259992 | Ga0210019_12599922 | F094906 | NTSGTSHILPIKYLKPNTTTYFVIKAYTATENVQSSQYSVAVSNADSKVYDVYMNVNSGQVRLRWHYFVASPPNNFYAYYRKTAWYNWNAHYSSGDEYVTDYANIGWTTGDQLDVDFYNPSNRSETNIQDSILYGNQCF |
| Ga0210019_1264980 | Ga0210019_12649802 | F096615 | MAGIYGNNPEDRARQQELNKYLDGANRQDNDDEHDERIKELARDKFNALPNFYSPKDERYRYTYMDDAMGSLKQEELIALARLLRDGNYMQAGKLLEASLMRILTAEAEEEI |
| Ga0210019_1316769 | Ga0210019_13167691 | F091390 | MKSTESTTIRINKSTKGLLDNLDFVRKDTYDDIIIKLMDYYKRKKG |
| Ga0210019_1323959 | Ga0210019_13239592 | F025056 | MKIVKTAVLAGVLAASGALVAAQEKPVPKDSMRIFVPGCTKGQIFTAGPKTEDQPGRSDIPEGMHLRMNGPRKLMAEIKAHEGSMIEI |
| Ga0210019_1392871 | Ga0210019_13928711 | F016496 | GNKYQQKGSAGLRYGGTTPVAKDKEKNLERLRRGWKPKTLICKDITISDDEEAFANEFSKKVSEENDTYVDTFLIELAIAQILQVHRVYVYAKEKKISRDASRMIGTVLSTLREMNATKNARKEDNIKVTVNSDIMTLIQQNLNLISNDESKKGNNIDKK |
| Ga0210019_1409904 | Ga0210019_14099041 | F039194 | RKVDFLEGIPAFQCIDITAGAGLAALTVTGRVNVTNLEMADNEFGLWRWYPIDDAQVRLYHPTGIAKYQLRNLQVPVDMNIVLRDPNLVSTEIAVWQNNRPGVEAINGHAFALGAVRLIAIGYRFHSVDLESGKNADPMLVKAIKEGKAPCTDIWCSGRGTGD |
| ⦗Top⦘ |