


| Basic Information | |
|---|---|
| IMG/M Taxon OID | 3300026939 Open in IMG/M |
| GOLD Reference (Study | Sequencing Project | Analysis Project) | Gs0095510 | Gp0055684 | Ga0207542 |
| Sample Name | Soil microbial communities from Arlington Agricultural Research Station in Wisconsin, USA - Nitrogen cycling UWRJ-G10A5-12 (SPAdes) |
| Sequencing Status | Permanent Draft |
| Sequencing Center | DOE Joint Genome Institute (JGI) |
| Published? | N |
| Use Policy | Open |
| Dataset Contents | |
|---|---|
| Total Genome Size | 26580529 |
| Sequencing Scaffolds | 23 |
| Novel Protein Genes | 24 |
| Associated Families | 23 |
| Dataset Phylogeny | |
|---|---|
| Taxonomy Groups | Number of Scaffolds |
| All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium | 3 |
| All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria | 2 |
| All Organisms → cellular organisms → Bacteria → Proteobacteria | 1 |
| Not Available | 11 |
| All Organisms → cellular organisms → Bacteria | 2 |
| All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodobacterales → Rhodobacteraceae → Rhodovulum → unclassified Rhodovulum → Rhodovulum sp. PH10 | 1 |
| All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales | 1 |
| All Organisms → cellular organisms → Bacteria → Terrabacteria group | 1 |
| All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Hyphomicrobiaceae → Methyloceanibacter → Methyloceanibacter methanicus | 1 |
| Ecosystem Assignment (GOLD) | |
|---|---|
| Name | Soil Microbial Communities From Arlington Agricultural Research Station In Wisconsin And Kellogg Biological Station In Michigan, Replicating The Bioenergy Cropping Systems Trials (Bcsts) |
| Type | Environmental |
| Taxonomy | Environmental → Terrestrial → Soil → Unclassified → Agricultural Land → Soil → Soil Microbial Communities From Arlington Agricultural Research Station In Wisconsin And Kellogg Biological Station In Michigan, Replicating The Bioenergy Cropping Systems Trials (Bcsts) |
| Alternative Ecosystem Assignments | |
|---|---|
| Environment Ontology (ENVO) | terrestrial biome → agricultural field → agricultural soil |
| Earth Microbiome Project Ontology (EMPO) | Free-living → Non-saline → Soil (non-saline) |
| Location Information | ||||||||
|---|---|---|---|---|---|---|---|---|
| Location | Wisconsin, United States | |||||||
| Coordinates | Lat. (o) | 43.2958 | Long. (o) | -89.3799 | Alt. (m) | N/A | Depth (m) | N/A | Location on Map |
| Zoom: | Powered by OpenStreetMap © | |||||||
| Family | Category | Number of Sequences | 3D Structure? |
|---|---|---|---|
| F000268 | Metagenome / Metatranscriptome | 1411 | Y |
| F001033 | Metagenome / Metatranscriptome | 799 | Y |
| F002616 | Metagenome / Metatranscriptome | 543 | Y |
| F003758 | Metagenome / Metatranscriptome | 470 | Y |
| F005366 | Metagenome / Metatranscriptome | 403 | Y |
| F006477 | Metagenome / Metatranscriptome | 372 | Y |
| F012929 | Metagenome / Metatranscriptome | 276 | Y |
| F015863 | Metagenome / Metatranscriptome | 251 | Y |
| F017759 | Metagenome | 239 | N |
| F020986 | Metagenome / Metatranscriptome | 221 | Y |
| F024822 | Metagenome / Metatranscriptome | 204 | N |
| F031563 | Metagenome | 182 | Y |
| F046968 | Metagenome / Metatranscriptome | 150 | Y |
| F054151 | Metagenome / Metatranscriptome | 140 | N |
| F056406 | Metagenome | 137 | N |
| F068085 | Metagenome | 125 | Y |
| F068641 | Metagenome / Metatranscriptome | 124 | Y |
| F075090 | Metagenome / Metatranscriptome | 119 | N |
| F080568 | Metagenome / Metatranscriptome | 115 | Y |
| F081321 | Metagenome / Metatranscriptome | 114 | N |
| F085779 | Metagenome / Metatranscriptome | 111 | N |
| F087420 | Metagenome / Metatranscriptome | 110 | Y |
| F102094 | Metagenome / Metatranscriptome | 102 | Y |
| Scaffold | Taxonomy | Length | IMG/M Link |
|---|---|---|---|
| Ga0207542_100164 | All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium | 1373 | Open in IMG/M |
| Ga0207542_100314 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria | 1142 | Open in IMG/M |
| Ga0207542_100489 | All Organisms → cellular organisms → Bacteria → Proteobacteria | 1022 | Open in IMG/M |
| Ga0207542_100541 | Not Available | 989 | Open in IMG/M |
| Ga0207542_100737 | All Organisms → cellular organisms → Bacteria | 913 | Open in IMG/M |
| Ga0207542_100756 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Rhodobacterales → Rhodobacteraceae → Rhodovulum → unclassified Rhodovulum → Rhodovulum sp. PH10 | 907 | Open in IMG/M |
| Ga0207542_100998 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria | 835 | Open in IMG/M |
| Ga0207542_101227 | Not Available | 786 | Open in IMG/M |
| Ga0207542_101275 | All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium | 777 | Open in IMG/M |
| Ga0207542_101757 | Not Available | 699 | Open in IMG/M |
| Ga0207542_101942 | All Organisms → cellular organisms → Bacteria | 678 | Open in IMG/M |
| Ga0207542_102236 | Not Available | 642 | Open in IMG/M |
| Ga0207542_102272 | Not Available | 639 | Open in IMG/M |
| Ga0207542_102487 | Not Available | 619 | Open in IMG/M |
| Ga0207542_102493 | Not Available | 619 | Open in IMG/M |
| Ga0207542_102634 | Not Available | 608 | Open in IMG/M |
| Ga0207542_102803 | Not Available | 596 | Open in IMG/M |
| Ga0207542_103060 | Not Available | 581 | Open in IMG/M |
| Ga0207542_103347 | All Organisms → cellular organisms → Bacteria → PVC group → Verrucomicrobia → unclassified Verrucomicrobia → Verrucomicrobia bacterium | 564 | Open in IMG/M |
| Ga0207542_103558 | Not Available | 553 | Open in IMG/M |
| Ga0207542_104003 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales | 532 | Open in IMG/M |
| Ga0207542_104314 | All Organisms → cellular organisms → Bacteria → Terrabacteria group | 521 | Open in IMG/M |
| Ga0207542_104481 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Alphaproteobacteria → Hyphomicrobiales → Hyphomicrobiaceae → Methyloceanibacter → Methyloceanibacter methanicus | 515 | Open in IMG/M |
| Scaffold ID | Protein ID | Family | Sequence |
|---|---|---|---|
| Ga0207542_100164 | Ga0207542_1001641 | F001033 | MHLTKMVPRSARIWTRQPKIERRFMKFVSLTKSTAVPHQILGTSTNEKRELLMCGHSLIARLTIAVFVFQMLGVTSVVHAETPDSTAGTSSAGTRKLFIDPSSTSVALRGKASLIVSPLTHRDGNYVGDYQLKVRPYFFKSEKGSLLLAASDDAVRKLQAGTAINFTGQAVTHKDGRTHIVLGRATPSSRDRGSVTFSIVTDDARIVFNTSYHFPAPRP |
| Ga0207542_100314 | Ga0207542_1003143 | F081321 | VMCTVLRAAALISLALVLVSPTHAACRGNCEPNVEVARAAMQQIFKQTFLSPYTLVSFERLDGRSGERYGGVFYEMRIRAVLHYDGVRLRCRRPSCPELHHYLLENDAASKKATVAGWLFLANDGDGWKTVPLTLQSPQ |
| Ga0207542_100384 | Ga0207542_1003841 | F102094 | MYMLIVVIGVLSQGASVLPVGVTSQIVGKFKNLDECKAAAKQPHA |
| Ga0207542_100489 | Ga0207542_1004891 | F003758 | DPRMSPKVLRAFENAEDEVLTNTLKGAPRLNVPSLFRPQFMEAVQKEHPEYFADLPPLK |
| Ga0207542_100541 | Ga0207542_1005412 | F024822 | VREPFIRIAGAILCALALSGCVDSSGPLLSDAQPVLGEQLRLQFYSLRKGTADEPEQATYKWDRGAYQRTGGGMTDIGSFSVHPLARDIFVVQSAAAKRPGVFEYAIARRLVDGVYQVIAIDEADAGRVTRARFCKRASDSSCRIETRNQLYAFARATAERKKGQGGLVLRLADGVAESS |
| Ga0207542_100737 | Ga0207542_1007372 | F000268 | MRVVAVMLLLSAGIAAEAVSYSFVSKASGRLGGPIRFEFYHDSTTRPKTDIKSFTVSMRTADDRWKAMWSILSGRGLTQPIEYGVTPPGFTTMIQPQKLIPGRVYAGFATDGHGGTSGVTFGFDKNGRMTFPDSFDQ |
| Ga0207542_100756 | Ga0207542_1007561 | F075090 | PVIVQEAFSLAGAIVAEGEIAKAAALKPDEIPNGVSGDTVWKVADNLFTISVLPKDAVPAEIGDLNDLIVGGDAQKCRGDFFAGAMLDVVESLTVARAYTNCRTQQAETSTYYFAMPRKSGGLYLLKTIATGVEVTPVADRTIKELEARVRSVITAALAKL |
| Ga0207542_100998 | Ga0207542_1009981 | F085779 | MKNSHCLLGTSGVAAAVLLYSLGAAVFAQDEPRRPLNPGLVPNTGQINTGAAPQPWSKSAEVDNIPAPAEAWAALVQPISRQPSAGGGATTTGTGGQQPPASGTINASSEPPPSGPIGSFGQTIPAKFSKRNDILDHLPTMAIPLPLTQEQRKQIYDAVMAEKSQPVVGADALEL |
| Ga0207542_101227 | Ga0207542_1012272 | F002616 | RPSYDLLYSYAHLTPRMQDTPMPNSFPNWTSRIVIAGRIAEYRRPENWNESTMPGDYHLPFGRVEAQAKAARWLWQSYII |
| Ga0207542_101275 | Ga0207542_1012752 | F006477 | SGLAFVAYKHPNAYRVMFIFAVPVLVMGGLIVLAIKIGDLNGSIKSIYHELPNIRKYALSDQLPYQIRRLYEVGQFLKVFVIYYISGFAYLVFLLVLGGFLDLARDRHLSLRDMERK |
| Ga0207542_101757 | Ga0207542_1017571 | F087420 | QMGPTLGIRAKRSAAGGAIVLVLCALPHAPASAAKPPYAGCVVVTKQEYDSAKKQHMLRTRYTEYVRTGLPGRRQYWYCR |
| Ga0207542_101942 | Ga0207542_1019422 | F046968 | MRIISFGTQDSKFDTPRMGIILDTNGRDSRYRLDCEKLFESADRPSNPLAWFDMDGRW |
| Ga0207542_102236 | Ga0207542_1022362 | F054151 | MRKVDQYTLADHFRALAEGLSLRAVSERAPAQRAELQRLAECYAELAKQQSPADHF |
| Ga0207542_102272 | Ga0207542_1022721 | F075090 | AKAAALKPDEIPDGVSGDTVWKVADNLFTISVLPKDAVPAEIGDLNDLIVGGNAQKCRGDFFAGAMLDVVESTTIARAYTTCQTQQAATSTYYFAMPRKQGGGLYLTKIIATGVEVPPTIERAIKELDAKVRGVITAALARL |
| Ga0207542_102487 | Ga0207542_1024872 | F020986 | MKTRDIATELDRAFAAARVKGRMGGVVLCDTIAPLYAIHKALKATHASGELTDQQYTEKGRELLEILGNAVVSLLIQQAMSKHNH |
| Ga0207542_102493 | Ga0207542_1024931 | F015863 | LVFGASYASALINRVMGPWSSTAIHQDGSLSHMQFGVDLPRPEWVPVYPGAWVVGGSKITSVEHPAGFHGLDLGTRASLDEVKRFYTEQLTAAGFEVSDLGLMGLNPMTAAYLGVDGMLSAKRHATDDAIDVQIRTPDGIIPSRLLQIHWRKISATLAPPVAAHPRASEGPAAN |
| Ga0207542_102634 | Ga0207542_1026342 | F068085 | MIVLKYILLFTCLGIGLALCVGVISILRSPPDNGPPAWFAAAFGAMFFWG |
| Ga0207542_102803 | Ga0207542_1028032 | F017759 | IEAPESSLSGAVTVPGSSPVLLFGVVVGLLNSECPQQDRAYESKYGADSQHIELQGKVHGSASLVDALRLARNDPAPKAPVTRPAFPAGGIAYRTCAIDNRLIERLKKSEGPKILIVQNSCDTEFFMANLRQIPSILRGNDSTGFVAKEI |
| Ga0207542_103060 | Ga0207542_1030601 | F080568 | IKVVKSFTYRGQTRLFSNRYHFNGGLPPDSAHWTTLSDAIVTAEKAIYFAPQIVHTYGYAAGSEVPIFSKAYTTAGTLALGTQERCPGDCAGLIRYATTARTSKNHPVYLFNYYHGVVAVGASFDDVGAVQASAYSTYAGLWIAGFSDGATTYNRAGPNGASAVGSLVEPYITHRDLPR |
| Ga0207542_103347 | Ga0207542_1033471 | F068641 | MKSAVIELIQIALISATTLEVSAADTNYELKPISLPGATGTIALDYFAYDHATGKVWVPASNTGRVDVIDDATDAVSQVTGFATGEIERR |
| Ga0207542_103558 | Ga0207542_1035582 | F056406 | APKPPVSVTEQYTGSIIIVPTRGEDCRQMMLDNRTGRMWDKGIVNCYEAVSRPEKGQRGGMSSLRMNAIGKAFNRRDE |
| Ga0207542_104003 | Ga0207542_1040031 | F012929 | FAAVLVGAAVLFGLEQQFGVKLYLAIPAAIAVYFATLIVLTLAFGSGNQTK |
| Ga0207542_104314 | Ga0207542_1043142 | F031563 | MCGHRHHHRGLGRRGRRFPNREEWIRRLEERQRDLEQEIADLADVIKHLKSGETPEG |
| Ga0207542_104481 | Ga0207542_1044811 | F005366 | MYPLTLQAQIMRHMLNLAKDMASGDLLESRKKLDEYPLCGDPSVFDQIMRARAVQDRMWGREVDDTKNDPWRWSTYISQCAVRWLRDPHKWTREDTDDFYDAMIETAAICAAAAESVVRQRNTNGRTFYEPGSGKGK |
| ⦗Top⦘ |