


| Basic Information | |
|---|---|
| IMG/M Taxon OID | 3300006475 Open in IMG/M |
| GOLD Reference (Study | Sequencing Project | Analysis Project) | Gs0063646 | Gp0052508 | Ga0100234 |
| Sample Name | Human buccal mucosa microbial communities from NIH, USA - visit 2 of subject 158883629 |
| Sequencing Status | Permanent Draft |
| Sequencing Center | Baylor College of Medicine, J. Craig Venter Institute (JCVI), Washington University in St. Louis |
| Published? | N |
| Use Policy | Open |
| Dataset Contents | |
|---|---|
| Total Genome Size | 21424045 |
| Sequencing Scaffolds | 5 |
| Novel Protein Genes | 5 |
| Associated Families | 5 |
| Dataset Phylogeny | |
|---|---|
| Taxonomy Groups | Number of Scaffolds |
| All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes | 2 |
| All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Pasteurellales → Pasteurellaceae → Haemophilus → Haemophilus parainfluenzae | 1 |
| All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Neisseriales → Neisseriaceae → Neisseria | 1 |
| All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Deuterostomia → Chordata → Craniata → Vertebrata → Gnathostomata → Teleostomi → Euteleostomi → Sarcopterygii → Dipnotetrapodomorpha → Tetrapoda → Amniota → Mammalia → Theria → Eutheria → Boreoeutheria → Euarchontoglires → Primates → Haplorrhini → Simiiformes → Catarrhini → Hominoidea → Hominidae → Homininae → Pan | 1 |
| Ecosystem Assignment (GOLD) | |
|---|---|
| Name | Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase |
| Type | Host-Associated |
| Taxonomy | Host-Associated → Human → Digestive System → Oral Cavity → Buccal Mucosa → Human → Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase |
| Alternative Ecosystem Assignments | |
|---|---|
| Environment Ontology (ENVO) | Unclassified |
| Earth Microbiome Project Ontology (EMPO) | Host-associated → Animal → Animal surface |
| Location Information | ||||||||
|---|---|---|---|---|---|---|---|---|
| Location | USA: Maryland: Natonal Institute of Health | |||||||
| Coordinates | Lat. (o) | 39.0042816 | Long. (o) | -77.1012173 | Alt. (m) | N/A | Depth (m) | N/A | Location on Map |
| Zoom: | Powered by OpenStreetMap © | |||||||
| Family | Category | Number of Sequences | 3D Structure? |
|---|---|---|---|
| F077405 | Metagenome | 117 | N |
| F077781 | Metagenome / Metatranscriptome | 117 | N |
| F101360 | Metagenome | 102 | N |
| F103431 | Metagenome | 101 | N |
| F103434 | Metagenome | 101 | N |
| Scaffold | Taxonomy | Length | IMG/M Link |
|---|---|---|---|
| Ga0100234_100208 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes | 7052 | Open in IMG/M |
| Ga0100234_100358 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes | 4750 | Open in IMG/M |
| Ga0100234_101021 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Pasteurellales → Pasteurellaceae → Haemophilus → Haemophilus parainfluenzae | 2170 | Open in IMG/M |
| Ga0100234_104662 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Betaproteobacteria → Neisseriales → Neisseriaceae → Neisseria | 718 | Open in IMG/M |
| Ga0100234_105002 | All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Deuterostomia → Chordata → Craniata → Vertebrata → Gnathostomata → Teleostomi → Euteleostomi → Sarcopterygii → Dipnotetrapodomorpha → Tetrapoda → Amniota → Mammalia → Theria → Eutheria → Boreoeutheria → Euarchontoglires → Primates → Haplorrhini → Simiiformes → Catarrhini → Hominoidea → Hominidae → Homininae → Pan | 689 | Open in IMG/M |
| Scaffold ID | Protein ID | Family | Sequence |
|---|---|---|---|
| Ga0100234_100208 | Ga0100234_1002083 | F103434 | VVGFRPRRGRYLENGPHVVEVTLAVVKEGRTGRRFERGETFVVDKVLVQPSAGNALKATENRVIRGDLTDETTLKVFGTGRKWPGGPHSWVKIIKGPESLVGKTFQQAGEPLTYDASPMTRHWSVRCDTLGTESR* |
| Ga0100234_100358 | Ga0100234_1003582 | F101360 | VSKESALRRAAIAAHIAKVASQEKKKALKELEEYMAPGDTSKPQDDGLQVGTVSVSAPQPRYQVVDENALVTWLEWNKPDAVHKVPAPWFVATAALEGFIKQTGEVPDGVEVVQGDPRISVRVSGAQEEAIRELISTGDISLIEIEGGDA* |
| Ga0100234_101021 | Ga0100234_1010214 | F077405 | SNSRPRPWQGRALPTELFPRLLVAKQRGVFYGFILLCQIKFVKNFFDWLKIIQK* |
| Ga0100234_104662 | Ga0100234_1046623 | F103431 | MINLDGLIVGMLFFIQLFLQSIAWGVAIAHFLHAERGNAAAAAFDGAFGENIADCHAEDDNDKNAESQKEGFHVCIPEG* |
| Ga0100234_105002 | Ga0100234_1050021 | F077781 | PIAAPARPASAHLKAPAAVTGGWGSPRQNDFFILLTLESPAPCDPLQTESHLVFRTRIHSQVQWGDVRGVAGRTKDSLPSDSVCTVGPRAGALSVRTADSLYLGFPRPHPGTPGLGRFWNFLALQSLSETPSHARMPRVTVARTSPETLEISPLRAAT* |
| ⦗Top⦘ |