NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300007605

3300007605: Marine microbial communities from the Southern Atlantic ocean - KN S15 Surf_B metaT (Metagenome Metatranscriptome)



Overview

Basic Information
IMG/M Taxon OID3300007605 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0114292 | Gp0125887 | Ga0102779
Sample NameMarine microbial communities from the Southern Atlantic ocean - KN S15 Surf_B metaT (Metagenome Metatranscriptome)
Sequencing StatusPermanent Draft
Sequencing CenterDOE Joint Genome Institute (JGI)
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size92608742
Sequencing Scaffolds28
Novel Protein Genes30
Associated Families24

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → cellular organisms → Bacteria → Terrabacteria group → Cyanobacteria/Melainabacteria group → Cyanobacteria → Synechococcales → Prochlorococcaceae → Prochlorococcus1
Not Available18
All Organisms → cellular organisms → Eukaryota → Sar1
All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Protostomia → Ecdysozoa → Panarthropoda → Arthropoda → Mandibulata → Pancrustacea → Crustacea → Multicrustacea → Hexanauplia → Copepoda → Neocopepoda1
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Kyanoviridae → Haifavirus → Haifavirus tim681
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Caudovirales → Myoviridae2
All Organisms → Viruses → Predicted Viral3
All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Caudovirales → Myoviridae → Kanaloavirus → unclassified Kanaloavirus → Kanaloavirus sp.1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameMarine Microbial Communities From The Southern Atlantic Ocean Transect To Study Dissolved Organic Matter And Carbon Cycling
TypeEnvironmental
TaxonomyEnvironmental → Aquatic → Marine → Oceanic → Unclassified → Marine → Marine Microbial Communities From The Southern Atlantic Ocean Transect To Study Dissolved Organic Matter And Carbon Cycling

Alternative Ecosystem Assignments
Environment Ontology (ENVO)marine biomemarine water bodysea water
Earth Microbiome Project Ontology (EMPO)Free-living → Saline → Water (saline)

Location Information
LocationSouthern Atlantic Ocean
CoordinatesLat. (o)-28.2362Long. (o)-38.4949Alt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F001145Metagenome / Metatranscriptome765Y
F003068Metagenome / Metatranscriptome509Y
F004869Metagenome / Metatranscriptome420Y
F005911Metagenome / Metatranscriptome386Y
F006348Metagenome / Metatranscriptome375Y
F007173Metagenome / Metatranscriptome356Y
F007756Metagenome / Metatranscriptome345Y
F008624Metagenome / Metatranscriptome330Y
F008889Metagenome / Metatranscriptome326Y
F010476Metagenome / Metatranscriptome303Y
F013897Metagenome / Metatranscriptome267Y
F016979Metagenome / Metatranscriptome243Y
F018809Metagenome / Metatranscriptome233Y
F025306Metagenome / Metatranscriptome202N
F028201Metagenome / Metatranscriptome192Y
F029129Metagenome / Metatranscriptome189Y
F034213Metagenome / Metatranscriptome175Y
F046429Metagenome / Metatranscriptome151N
F049701Metagenome / Metatranscriptome146N
F049702Metagenome / Metatranscriptome146Y
F059045Metagenome / Metatranscriptome134Y
F071278Metagenome / Metatranscriptome122N
F090441Metatranscriptome108N
F097482Metagenome / Metatranscriptome104N

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0102779_1000542All Organisms → cellular organisms → Bacteria → Terrabacteria group → Cyanobacteria/Melainabacteria group → Cyanobacteria → Synechococcales → Prochlorococcaceae → Prochlorococcus512Open in IMG/M
Ga0102779_1004681Not Available631Open in IMG/M
Ga0102779_1125649Not Available763Open in IMG/M
Ga0102779_1144663Not Available531Open in IMG/M
Ga0102779_1149673Not Available603Open in IMG/M
Ga0102779_1174719All Organisms → cellular organisms → Eukaryota → Sar525Open in IMG/M
Ga0102779_1182525Not Available748Open in IMG/M
Ga0102779_1201889Not Available558Open in IMG/M
Ga0102779_1209115Not Available511Open in IMG/M
Ga0102779_1220444All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Protostomia → Ecdysozoa → Panarthropoda → Arthropoda → Mandibulata → Pancrustacea → Crustacea → Multicrustacea → Hexanauplia → Copepoda → Neocopepoda599Open in IMG/M
Ga0102779_1222901All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Kyanoviridae → Haifavirus → Haifavirus tim68690Open in IMG/M
Ga0102779_1226815Not Available731Open in IMG/M
Ga0102779_1243709All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Caudovirales → Myoviridae544Open in IMG/M
Ga0102779_1249106All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Caudovirales → Myoviridae643Open in IMG/M
Ga0102779_1249930All Organisms → Viruses → Predicted Viral1028Open in IMG/M
Ga0102779_1252595All Organisms → Viruses → Predicted Viral1064Open in IMG/M
Ga0102779_1260835Not Available559Open in IMG/M
Ga0102779_1263361Not Available625Open in IMG/M
Ga0102779_1263622Not Available759Open in IMG/M
Ga0102779_1264371Not Available550Open in IMG/M
Ga0102779_1268683Not Available558Open in IMG/M
Ga0102779_1271440Not Available757Open in IMG/M
Ga0102779_1275133All Organisms → Viruses → Predicted Viral1193Open in IMG/M
Ga0102779_1275756Not Available696Open in IMG/M
Ga0102779_1276088Not Available829Open in IMG/M
Ga0102779_1277552Not Available619Open in IMG/M
Ga0102779_1277865Not Available954Open in IMG/M
Ga0102779_1280700All Organisms → Viruses → Duplodnaviria → Heunggongvirae → Uroviricota → Caudoviricetes → Caudovirales → Myoviridae → Kanaloavirus → unclassified Kanaloavirus → Kanaloavirus sp.945Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0102779_1000542Ga0102779_10005421F016979MAENTAPLEGVMEQNVIYVALFIPKEKRNLYVKSLKKFI*
Ga0102779_1004681Ga0102779_10046811F005911VSLTTLSEDSKQRVAHLYNAYQMGQSIKGKIDGADINQARLGGSTEAELEAPIDDLFHNAQFVTLKEMVKKAVIIVSDDPTATTRVGTDTGYDLDNSGVLIKFADVDGTVIDDGTGGDTLVADVHLFVNLAGTADATTNAPYTAGEPFYYILMDDVTAGFNDASKRTIDLTQFPTGILATNDGGPQAEVSVVLPQDSEE*
Ga0102779_1010940Ga0102779_10109401F097482LFSSQNYISYNEVNLANYAQACFDKAMAEIVSKAITTNLLLQLIMNELILLKETLLD
Ga0102779_1125649Ga0102779_11256491F005911MSVKTKLLNILNSKKGNALLLATAGTIAATFGVYFFVTLTTLSEDSKQRVAHLYNAYQMGQAIKAKIDGADINQARLGSGTENDIEAPIDDLFHNGNFVTLAQMVKKAVIIVADDPTATARSGSDMAYDLTNSGALIKYADAAGNVIEADSDDGSDSVPIVTDVQVFVNLAGTADATTNSPYTAGEPFYYILMDADTAGLTASDITVDATQFPTGILATNDGGPQAEVSVIL
Ga0102779_1144663Ga0102779_11446632F018809MGGTMIKNLIIIGLFTIVVTQTDIGITDVFNYVEIGLDKLQELVYTMKRSV*
Ga0102779_1149673Ga0102779_11496731F003068IMRKILTLISVLSLVSFNAYAVDVSQLSVTAGVAHNSAVYGASAKETNRNESNVVKTVDKESGVFTESHQSYFMELNAGEFISLGFEHTPDSITTPENQRITNTNATTKVKVDFNDLNIAYVKLNVPGGLYVKAGVVETDLDIKESMASGSTYNNVSTEGTLLGVGYSRDLGDSPFSIRVEGSYMELDDVTTSNGVSATGG
Ga0102779_1174719Ga0102779_11747191F001145ETNIQNLIILIGILIYANNVSFSVSLENRQKEIIQTIENAQKDVLNASNYYYLAEKGFTQSLFWLQSWKVLYEKDKIDLVTNKYNLVKNGLLETFLTTENLIANFEKKAFITLQRYIIFVTASRILRKFLFLSDEEQSKLIEVTISKLGGV*
Ga0102779_1182525Ga0102779_11825251F005911MLKNKILNFLNSKKGNALLLATAGAIAATFSVYFFVSLTTLSEDSKQRVAHLYNAYQMGQSIKAKIDGADINQARLGSGDEDQIEAPINELFHNGNFIKLSEMVKKAVIIVSDDPTATARKGYDIGYDTENSGVLIKFAAADGNVIQPDQGDADDTIVADVHLFVNLAGTADTDANAPYVDGTPFYYILMDATTAGLADSLETINSTIYPTGILATNGGGPQAEVSVVLPQ
Ga0102779_1201889Ga0102779_12018892F046429IIEGDLIVAKNVVTEARRIIGSVGDILVESTKRQVLKG*
Ga0102779_1209115Ga0102779_12091152F007756MTKLNTLKRFYVNVKFEKYGTYTIEARSKEHALEIYNDGNYGWSDYSEDFGEFNEVVEDVEEEVFADTQLTLSGVFS*
Ga0102779_1220444Ga0102779_12204441F059045LGDEIPKEDVQKMFAEICDPEDEDGLFPYMSFLDRLTGKA*
Ga0102779_1222901Ga0102779_12229012F007173MQTTTTKAVISRDKLMEYIHEERDLLMGLQDDLSDMLSATGKFSITLDEIVQSFMPYIPLYLIENEDEIKQAFPDRITDDEYIFIYDKDMTPNEITLNVEWRD*
Ga0102779_1226815Ga0102779_12268151F003068INIRITTMRKILTIISMVTLVSFPVKAVDFDFGSVSVTGGIALNSAVYGASAKETNRDESNVIRTVNKESGVFTEDHQSVFGEVNLGEFVSLGFEHTPDSITTPENRRTVQADGNSTAGTTTVSVDFNDLNVAYLKFNLPGGMYLKYGYVDVNLDIKESMASGSTYANVGTEGTLAGIGYSRPLGDAGLALRVEGSYMSLDDVTTSNGVSASGATVANGGRNQVDASNLEGLNGKIALTYTFG
Ga0102779_1243709Ga0102779_12437092F049701VRPWDKSEYTELSDLMEQAKRYEVIEDDDFKMTIDYQNMTSIFEVK*
Ga0102779_1249106Ga0102779_12491062F025306MKPILFTEAELETIERAMDDYVSYADPDTPASDLIGGLPIMDRYNSIMEKITTAYCDL*
Ga0102779_1249930Ga0102779_12499301F049702IDFDKKELHDIYSALQYSRLEIGFESKSEEELYNRLTKLMDKVAKLRQVCDCQQNVSK*
Ga0102779_1252595Ga0102779_12525951F025306TEAELETIERAMDDYACYDDPDTPASDLIGGLPVMDRINSIMEKITTAYCDL*
Ga0102779_1260835Ga0102779_12608352F006348MEVSKMQKFYIVENLEFDSKFKTPEEIEDLKWKQDQAIGVWSSVGKTEDERINDLFNKVQDYMGVYLTSLSYCNNRPHPLTAFK*
Ga0102779_1263361Ga0102779_12633611F006348MNKKYIVENLEFDKKFKNEKEIEDLNFKRVNAIGIWDIEGKTEDERVSKLFDKVQDYMGVYLCSLSYCNNRPHPLTAFK*
Ga0102779_1263361Ga0102779_12633612F008889MTYTTEQFEKDVAGLRALIKMCDDLEKENDRKTNALIDQINGENPFAWRAN*
Ga0102779_1263622Ga0102779_12636221F013897LMNTIDFATDNDASYEEYTIIKSGTSDLELIKDILYNEYIHQTQ*
Ga0102779_1264371Ga0102779_12643712F028201LPPFKMINYDQKQSKEIYDALLDSKVDLLEYFLGPDPKKSSYYRKHVKRYRVQSILNDY*
Ga0102779_1268683Ga0102779_12686832F010476FDYKIIAYNKLGKVQETENLFCAPDEIDDVMYTMSEQYGYAEALDTMDTHCGEYGERPLSLGERRYF*
Ga0102779_1271440Ga0102779_12714401F090441ELATLSFPDSLALGALITRRVIVGCQFPDAALNRKNDCDTFRFHSGT*
Ga0102779_1275133Ga0102779_12751332F004869MTCDTEHYYALHTFLEDDELHEIWNIVEKAMNREGYDVSNSELSMRLYDSELEENIEHDMENLLSSLSEGKDQPVEKAAHGHVRKDLDTL*
Ga0102779_1275756Ga0102779_12757561F071278GAIAATFSVYFFVSISTLSDDSKQRVTHLYNAYQMALSVKGKINGDSMQANKLDGTNVEDDIEDSLDPLFHNGSFITLREMVIASIIIVQNDPTTTSERGKKIPYDVDNSGVLIKFANSTDAVIAPSDTNRAIEDRANLAKVHDLHLFVNLAGTTDIDSRPNGPYAVGDPFFYVVMASDSDTGRTDGLTDALKTVNLTQFPTGMLATNQGGPQAETSVILPQDFD*
Ga0102779_1276088Ga0102779_12760881F034213MTIKTLKNFASMDQSFKEWLTTCPKEYIWQINEVTEDLEGNFTFR
Ga0102779_1277552Ga0102779_12775523F029129MTNRTFIIEVKEDGVSKVNKYGDKLHSLIDDLGWEYQRMGRSGRETYDEICHLLGMIPEDEVYME
Ga0102779_1277865Ga0102779_12778652F008624MNPQAHRSTEELKTIVKALSKLRLLNTPEEDQRLFECEQELRKRKREQDFIDAHFHVVVYN*
Ga0102779_1280700Ga0102779_12807001F008624HRYYLFPMNPQAHRSTEELKTIVKALSKLRLLNTPEEDQRLFECEQELRKRKREDDFINAHFQVITY*

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.