


| Basic Information | |
|---|---|
| IMG/M Taxon OID | 3300010237 Open in IMG/M |
| GOLD Reference (Study | Sequencing Project | Analysis Project) | Gs0121435 | Gp0154128 | Ga0136250 |
| Sample Name | Terrestrial oil reservoir microbial community from Ivishak Formation, Alaska - I2 |
| Sequencing Status | Permanent Draft |
| Sequencing Center | Yale Center for Genome Analysis |
| Published? | Y |
| Use Policy | Open |
| Dataset Contents | |
|---|---|
| Total Genome Size | 1260996402 |
| Sequencing Scaffolds | 6 |
| Novel Protein Genes | 12 |
| Associated Families | 10 |
| Dataset Phylogeny | |
|---|---|
| Taxonomy Groups | Number of Scaffolds |
| All Organisms → Viruses → Predicted Viral | 1 |
| Not Available | 3 |
| All Organisms → cellular organisms → Archaea → Candidatus Thermoplasmatota → Thermoplasmata → unclassified Thermoplasmata → Thermoplasmata archaeon | 1 |
| All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Deuterostomia → Chordata → Craniata → Vertebrata → Gnathostomata → Teleostomi → Euteleostomi → Sarcopterygii → Dipnotetrapodomorpha → Tetrapoda → Amniota → Mammalia → Theria → Eutheria → Boreoeutheria → Euarchontoglires → Primates → Haplorrhini → Simiiformes | 1 |
| Ecosystem Assignment (GOLD) | |
|---|---|
| Name | Terrestrial Oil Reservoir Microbial Communities From Alaska |
| Type | Environmental |
| Taxonomy | Environmental → Terrestrial → Oil Reservoir → Unclassified → Unclassified → Produced Fluid → Terrestrial Oil Reservoir Microbial Communities From Alaska |
| Alternative Ecosystem Assignments | |
|---|---|
| Environment Ontology (ENVO) | Unclassified |
| Earth Microbiome Project Ontology (EMPO) | Free-living → Non-saline → Subsurface (non-saline) |
| Location Information | ||||||||
|---|---|---|---|---|---|---|---|---|
| Location | Alaska, USA | |||||||
| Coordinates | Lat. (o) | 70.4 | Long. (o) | -148.7 | Alt. (m) | N/A | Depth (m) | N/A | Location on Map |
| Zoom: | Powered by OpenStreetMap © | |||||||
| Family | Category | Number of Sequences | 3D Structure? |
|---|---|---|---|
| F004971 | Metagenome | 416 | Y |
| F024691 | Metagenome | 204 | Y |
| F041530 | Metagenome | 159 | N |
| F043719 | Metagenome | 155 | N |
| F045470 | Metagenome | 152 | N |
| F059535 | Metagenome | 133 | N |
| F066220 | Metagenome | 127 | N |
| F076711 | Metagenome | 117 | Y |
| F077781 | Metagenome / Metatranscriptome | 117 | N |
| F079441 | Metagenome | 115 | N |
| Scaffold | Taxonomy | Length | IMG/M Link |
|---|---|---|---|
| Ga0136250_100000730 | All Organisms → Viruses → Predicted Viral | 4242 | Open in IMG/M |
| Ga0136250_100001505 | Not Available | 2252 | Open in IMG/M |
| Ga0136250_100001743 | All Organisms → cellular organisms → Archaea → Candidatus Thermoplasmatota → Thermoplasmata → unclassified Thermoplasmata → Thermoplasmata archaeon | 1992 | Open in IMG/M |
| Ga0136250_100002654 | All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Deuterostomia → Chordata → Craniata → Vertebrata → Gnathostomata → Teleostomi → Euteleostomi → Sarcopterygii → Dipnotetrapodomorpha → Tetrapoda → Amniota → Mammalia → Theria → Eutheria → Boreoeutheria → Euarchontoglires → Primates → Haplorrhini → Simiiformes | 1435 | Open in IMG/M |
| Ga0136250_100004520 | Not Available | 959 | Open in IMG/M |
| Ga0136250_100004559 | Not Available | 954 | Open in IMG/M |
| Scaffold ID | Protein ID | Family | Sequence |
|---|---|---|---|
| Ga0136250_100000730 | Ga0136250_1000007301 | F059535 | AEFDGFTIQYDEAIGWYDYVETGDTSKYGKDFGYYASLIVRVRNEDGVCRSLLTAIYGKENGKVDITDEGIEALYGIASFVITSIRVKGTAVKIPVKEEERVIVAPTKEIPELSAVVGLIGYIAKNYNMSVDEVIELVRKSFKRK* |
| Ga0136250_100000730 | Ga0136250_1000007302 | F066220 | MGLFVNVRELSVRELMDLARSNEYSVAYTPFGRFGLRTYIRRAAPWKGLKGDAFWDEAKKHGNLANGLRSAIDISRRFAGNTGVALVKYMDGSYGIMPYKSALQGQGKTIADIIATAPNYFRLITRPEASELIKAAKLQPVVY* |
| Ga0136250_100000730 | Ga0136250_1000007306 | F045470 | MVFEEFKPVVSPRMNVRVLPDYGFYRVLSVDPVGVLIKDFGKTLAADAESKNNELEEVYMQDNEMAQWRIWVLDPVGIKLKVVGQEKLRGATKYSTSEIRSFSPETNDDDGISQFFSFEDEKIYVDIVNLTKDTLTSQRIKLIGYKYVLEKLDHAPEIYTDIPFGLVR* |
| Ga0136250_100000730 | Ga0136250_1000007307 | F043719 | MLRNISKFAEFTAAATDTIVHTVSTNRKAVITNIIVTNKESADILVTLYDGASTGEDKKLEIKVATGDTKVISLEDEYGIEFRTSVVAQVDSYTSGSSIWIGGYEA* |
| Ga0136250_100000730 | Ga0136250_1000007308 | F079441 | MRLESYELVEQLKTLNENIRTLNSILKDWANITNRLDEQIRLLETQRYSSKYWHFEPIEHYDLETLQPGEVVNLLSTTESGAFVLFCSFTTSDKVYWEVRADDLVIKGTIEDLYNYGLIGYNPRTLWISAGNGGYAVWATPNPYIEYRNKIQVYGKNVGDTEIQYNYIIFRYKYIGGD* |
| Ga0136250_100001505 | Ga0136250_1000015055 | F004971 | MGEEMEIKVGNLSEFEEGWFEALRCLAIQKISEATGEPAKLATGFCNDDEALHLLTASGRVFHVCYSDPLCVFVVEEVRCPRCGQWVPANILEEHLEEHEEFCVWEEEVM* |
| Ga0136250_100001743 | Ga0136250_1000017432 | F024691 | MRYHTEGQRDKRKLKTHIPLLEIKASKRKSTKHDRVLVILSTLVRNFSERVNVDHVKPPIGLDKTLKLTDERGRKYDIAFIYEGELFLVEVKRCGKWQ* |
| Ga0136250_100001743 | Ga0136250_1000017433 | F041530 | MAIEKMERKQKGVITKISGKLKNVEFRTRRDGRIELGARRIPKQPLSPAQLSVRQAYGKIYQKWTQLSEEEKAYWNELGKPHKLSGWNVFLRENMKAQTFKVVLDCSGMSTTYENYTLIMFLADFPDLYNALNDDLTNIQVLDSDKTTELQFFRLITKRESTDTVLLINIPSLPPGGYYEIYIKTSMRFNNPGTPGSDFFSFFDDF |
| Ga0136250_100002654 | Ga0136250_1000026542 | F077781 | PAPAHLKAPAAVTGGWGSPRQNDFFILLTLESPAPCDPLQTESHLVFRTRIHSQVQWGDVRGVAGRTKDSLPSDSVCTVGPRAGALSVRTADSLYLGFPRPHPGTPGLGRFWPFLALQSLSETPSHARMPRVTVARTSPETLEISPLRAAT* |
| Ga0136250_100004520 | Ga0136250_1000045201 | F076711 | MGVGWAICPVCGKKFEGSYFPDTGVNTAVHMVQRHMVLRHSKKITDSGIVVAKLPDDYIRASELKTFVAGGTGNTRLQGRAHIVCVCGRYFDITLEEHYDEGKKIEVVVREYDDVGRYKKISERRVVERVGHIDVSKFAELWERAWERIKEDIIREHLMEHGIHL* |
| Ga0136250_100004520 | Ga0136250_1000045202 | F004971 | MEIKIENLDLFEDGWFEALKCLAIQKISEATGELCRLATGFCNDDEAFHLTAGGRVFHVCYSDPLLVFVVEEVRCPKCGQWVPGNVLEEHLEEHEVM* |
| Ga0136250_100004559 | Ga0136250_1000045592 | F076711 | MMGIGWATCPVCGKRFEGSYFPDTGVNTAVHMVQRHTILRHGKKITDSGIVDAKLPDDFIRASELKTFVAGGTGNTRLQGRAYIICVCGRYFDITLTEIYDEGKKIEVVVREYDDVGRYKKISERRVVERVGHVDVSKFERLWEQAWERIKEDIIREHLEEHGIHL* |
| ⦗Top⦘ |