


| Basic Information | |
|---|---|
| IMG/M Taxon OID | 3300006545 Open in IMG/M |
| GOLD Reference (Study | Sequencing Project | Analysis Project) | Gs0063646 | Gp0052548 | Ga0101078 |
| Sample Name | Human buccal mucosa microbial communities from NIH, USA - visit 1, subject 823052294 |
| Sequencing Status | Permanent Draft |
| Sequencing Center | Baylor College of Medicine, J. Craig Venter Institute (JCVI), Washington University in St. Louis |
| Published? | N |
| Use Policy | Open |
| Dataset Contents | |
|---|---|
| Total Genome Size | 34501236 |
| Sequencing Scaffolds | 5 |
| Novel Protein Genes | 5 |
| Associated Families | 5 |
| Dataset Phylogeny | |
|---|---|
| Taxonomy Groups | Number of Scaffolds |
| All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Pasteurellales → Pasteurellaceae → Haemophilus → Haemophilus parainfluenzae | 1 |
| All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Siphoviridae sp. ctYBm1 | 1 |
| All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Siphoviridae sp. ctSdk10 | 1 |
| All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales | 1 |
| All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes | 1 |
| Ecosystem Assignment (GOLD) | |
|---|---|
| Name | Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase |
| Type | Host-Associated |
| Taxonomy | Host-Associated → Human → Digestive System → Oral Cavity → Buccal Mucosa → Human → Human Microbial Communities From The National Institute Of Health, Usa, Hmp Production Phase |
| Alternative Ecosystem Assignments | |
|---|---|
| Environment Ontology (ENVO) | Unclassified |
| Earth Microbiome Project Ontology (EMPO) | Host-associated → Animal → Animal surface |
| Location Information | ||||||||
|---|---|---|---|---|---|---|---|---|
| Location | USA: Maryland: Natonal Institute of Health | |||||||
| Coordinates | Lat. (o) | 39.0042816 | Long. (o) | -77.1012173 | Alt. (m) | N/A | Depth (m) | N/A | Location on Map |
| Zoom: | Powered by OpenStreetMap © | |||||||
| Family | Category | Number of Sequences | 3D Structure? |
|---|---|---|---|
| F027205 | Metagenome | 195 | N |
| F043991 | Metagenome | 155 | N |
| F051212 | Metagenome | 144 | N |
| F077405 | Metagenome | 117 | N |
| F081510 | Metagenome | 114 | N |
| Scaffold | Taxonomy | Length | IMG/M Link |
|---|---|---|---|
| Ga0101078_100377 | All Organisms → cellular organisms → Bacteria → Proteobacteria → Gammaproteobacteria → Pasteurellales → Pasteurellaceae → Haemophilus → Haemophilus parainfluenzae | 8409 | Open in IMG/M |
| Ga0101078_100411 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Siphoviridae sp. ctYBm1 | 7650 | Open in IMG/M |
| Ga0101078_107739 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → unclassified Caudoviricetes → Siphoviridae sp. ctSdk10 | 788 | Open in IMG/M |
| Ga0101078_109877 | All Organisms → cellular organisms → Bacteria → FCB group → Bacteroidetes/Chlorobi group → Bacteroidetes → Flavobacteriia → Flavobacteriales | 660 | Open in IMG/M |
| Ga0101078_113213 | All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes | 533 | Open in IMG/M |
| Scaffold ID | Protein ID | Family | Sequence |
|---|---|---|---|
| Ga0101078_100377 | Ga0101078_10037710 | F077405 | WQGRALPTELFPRLLVAKQRGVFYGFILLCQIKFVKKFFDWLKIIQKQKVRLK* |
| Ga0101078_100411 | Ga0101078_1004118 | F081510 | MKLPKLPNMQTIKSTAKSAMVTTKILGKKYAPFVLLGVGLVGYGYSVYAGVKSGKKLEATKAKYEAKDAAGEEYTRMEVVKDVAKDVAIPVAVATASTAAIVLGFAIQTNRLKAVSSALAIVTEEHARYRLRAKEVLDEATFKKIDAPLETKTVELDGQEVEVESIVPNEGDFYGQWFKYSSNYVSDDPDYNESYIKEAETYLVNRMMKKGVLTFGEVLDKLGFDVPRAALPFGWTDTDDFYIEWDAHEVFDDVKQEYDLQFYVRWKTPRNLYATTSFKDFVPKKTRKELN* |
| Ga0101078_107739 | Ga0101078_1077391 | F027205 | VASRLIVSADDILKAVKESEEFERKALKEAKKRDRAEGKEPRETLYPNPDLKPGREIVLDYIKNPERRRTPRCSVHLEKRTANNSYRFIVDVSQVRNRELADEIEKDLFAFMDYLLDEYDIPQRIKRSTK* |
| Ga0101078_109877 | Ga0101078_1098771 | F051212 | EKQNTEKIERIIYSQTGGDTGGKNVYLVITKDSIIYRLTEGVTDEKTIANLSLNNNNKDWEAFIDKIDLEDFEKGKPSKELIMDLPTTKIIIKTDKKEYSKTNIQNNKTWDYITKQIIDIKFSQLYNHLNLEK* |
| Ga0101078_113213 | Ga0101078_1132131 | F043991 | MSKKTPSVIDYFSLNGDVVEEANEFDGISLEDWIDKRSSIKPSWVGQYSQQMHFDLADDTEVSFYKTPNVIYADILFAGGVRTILFKCRQKKNLTRFISRVLELANLGPKHVHPDFRA* |
| ⦗Top⦘ |