NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Sample 3300009341

3300009341: Microbial communities of water from the North Atlantic ocean - ACM17



Overview

Basic Information
IMG/M Taxon OID3300009341 Open in IMG/M
GOLD Reference
(Study | Sequencing Project | Analysis Project)
Gs0117984 | Gp0126425 | Ga0103836
Sample NameMicrobial communities of water from the North Atlantic ocean - ACM17
Sequencing StatusPermanent Draft
Sequencing CenterUniversity of Georgia
Published?N
Use PolicyOpen

Dataset Contents
Total Genome Size42220838
Sequencing Scaffolds12
Novel Protein Genes19
Associated Families19

Dataset Phylogeny
Taxonomy GroupsNumber of Scaffolds
All Organisms → cellular organisms → Eukaryota → Sar → Alveolata → Apicomplexa → Aconoidasida → Haemosporida → Plasmodiidae → Plasmodium1
All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Protostomia → Ecdysozoa → Panarthropoda → Arthropoda → Mandibulata → Pancrustacea → Crustacea → Multicrustacea → Hexanauplia → Copepoda → Neocopepoda → Gymnoplea → Calanoida → Temoridae → Eurytemora → Eurytemora affinis2
All Organisms → cellular organisms → Eukaryota → Sar → Alveolata → Ciliophora → Intramacronucleata → Spirotrichea → Stichotrichia → Urostylida → Pseudourostylidae → Pseudourostyla → Pseudourostyla cristata1
All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Protostomia → Ecdysozoa → Panarthropoda → Arthropoda → Mandibulata → Pancrustacea → Crustacea → Multicrustacea → Hexanauplia → Copepoda → Neocopepoda → Podoplea → Harpacticoida → Harpacticidae → Tigriopus → Tigriopus californicus2
All Organisms → cellular organisms → Eukaryota1
Not Available2
All Organisms → cellular organisms → Eukaryota → Sar → Alveolata → Dinophyceae1
All Organisms → cellular organisms → Eukaryota → Opisthokonta1
All Organisms → cellular organisms → Eukaryota → Sar → Alveolata → Ciliophora → Intramacronucleata → Spirotrichea1

Ecosystem and Geography

Ecosystem Assignment (GOLD)
NameAquatic Microbial Communities From Amazon River, Brazil And North Atlantic Ocean
TypeEnvironmental
TaxonomyEnvironmental → Aquatic → Freshwater → Unclassified → Unclassified → River Water → Aquatic Microbial Communities From Amazon River, Brazil And North Atlantic Ocean

Alternative Ecosystem Assignments
Environment Ontology (ENVO)marine biomemarine water bodysurface water
Earth Microbiome Project Ontology (EMPO)Free-living → Saline → Water (saline)

Location Information
LocationNorth Pacific Ocean
CoordinatesLat. (o)N/ALong. (o)N/AAlt. (m)N/ADepth (m)N/A
Location on Map
Zoom:    Powered by OpenStreetMap ©


Associated Families

FamilyCategoryNumber of Sequences3D Structure?
F000021Metagenome / Metatranscriptome6082Y
F000048Metagenome / Metatranscriptome3365Y
F000052Metagenome / Metatranscriptome3223Y
F000073Metagenome / Metatranscriptome2639Y
F000076Metatranscriptome2620Y
F000147Metagenome / Metatranscriptome1917Y
F000235Metatranscriptome1500Y
F000287Metatranscriptome1370Y
F001926Metagenome / Metatranscriptome616Y
F003081Metagenome / Metatranscriptome508Y
F003495Metagenome / Metatranscriptome483Y
F004957Metagenome / Metatranscriptome417Y
F005202Metatranscriptome408N
F011139Metagenome / Metatranscriptome294Y
F012568Metatranscriptome279Y
F014224Metagenome / Metatranscriptome264N
F041786Metagenome / Metatranscriptome159Y
F062416Metatranscriptome130Y
F085736Metagenome / Metatranscriptome111Y

Associated Scaffolds

ScaffoldTaxonomyLengthIMG/M Link
Ga0103836_1000290All Organisms → cellular organisms → Eukaryota → Sar → Alveolata → Apicomplexa → Aconoidasida → Haemosporida → Plasmodiidae → Plasmodium1409Open in IMG/M
Ga0103836_1001663All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Protostomia → Ecdysozoa → Panarthropoda → Arthropoda → Mandibulata → Pancrustacea → Crustacea → Multicrustacea → Hexanauplia → Copepoda → Neocopepoda → Gymnoplea → Calanoida → Temoridae → Eurytemora → Eurytemora affinis893Open in IMG/M
Ga0103836_1001674All Organisms → cellular organisms → Eukaryota → Sar → Alveolata → Ciliophora → Intramacronucleata → Spirotrichea → Stichotrichia → Urostylida → Pseudourostylidae → Pseudourostyla → Pseudourostyla cristata891Open in IMG/M
Ga0103836_1002276All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Protostomia → Ecdysozoa → Panarthropoda → Arthropoda → Mandibulata → Pancrustacea → Crustacea → Multicrustacea → Hexanauplia → Copepoda → Neocopepoda → Podoplea → Harpacticoida → Harpacticidae → Tigriopus → Tigriopus californicus822Open in IMG/M
Ga0103836_1002387All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Protostomia → Ecdysozoa → Panarthropoda → Arthropoda → Mandibulata → Pancrustacea → Crustacea → Multicrustacea → Hexanauplia → Copepoda → Neocopepoda → Podoplea → Harpacticoida → Harpacticidae → Tigriopus → Tigriopus californicus810Open in IMG/M
Ga0103836_1004379All Organisms → cellular organisms → Eukaryota688Open in IMG/M
Ga0103836_1004719Not Available675Open in IMG/M
Ga0103836_1006528All Organisms → cellular organisms → Eukaryota → Sar → Alveolata → Dinophyceae617Open in IMG/M
Ga0103836_1007214Not Available598Open in IMG/M
Ga0103836_1008697All Organisms → cellular organisms → Eukaryota → Opisthokonta566Open in IMG/M
Ga0103836_1009122All Organisms → cellular organisms → Eukaryota → Sar → Alveolata → Ciliophora → Intramacronucleata → Spirotrichea558Open in IMG/M
Ga0103836_1011347All Organisms → cellular organisms → Eukaryota → Opisthokonta → Metazoa → Eumetazoa → Bilateria → Protostomia → Ecdysozoa → Panarthropoda → Arthropoda → Mandibulata → Pancrustacea → Crustacea → Multicrustacea → Hexanauplia → Copepoda → Neocopepoda → Gymnoplea → Calanoida → Temoridae → Eurytemora → Eurytemora affinis523Open in IMG/M

Sequences

Scaffold IDProtein IDFamilySequence
Ga0103836_1000290Ga0103836_10002901F001926VRHEDIPDMSNPKTTKAVLEGDKLIMEVLKGGYNVHPVPPPNSVHKESMTKRYETGRMRIDKLLTLGYTTSADPRKIGTK*
Ga0103836_1001663Ga0103836_10016631F014224MHLNILFSSTLLCIANGAPQYGGVQPSPAAAPLRGQGTVKCVTQYATIWDTEYTETETEVCTTEYEKVCRTETQRLCQPTTRQECSTVYEKQCQTVYRNECVEQFKTEYEPYTETECSTEYKEDCEYQWEGHGNDKVWAPIAGTCKNNPYETCSDVTKSKAVQVPYPVCRDIPEQKCVDVPRQECVQVPDQVCTNQPLQKCQDVPRQACQQIHKKVPNRVSRKVPKKVCQDTGSHLNGVAVGNGPVVIDARKNEKSVQFEEGVETPNSIVFGD*
Ga0103836_1001674Ga0103836_10016741F011139MGLFLGLLAFLPLVYNFYNTFSKYVATIPMQNSVLQTTTFIIFMLSLYCANSMLPCGRYYYEPEGGYVGNP*
Ga0103836_1002243Ga0103836_10022431F000052GPGADGKRCIDKVEMVEETEYDDVVQCDHSYDKRCHTTYVTNYESQQEEECEENFRKSCFIEYEQIAFNETVAVCRTPLVKDCDVQGPEICRTEYESECWTKQEVHDVEDDVVSCTTEVEEKCEDETSGYTTNTKCSKWPKEVCSVEKKAVKKYTPITGCTKEPREICAPAGCGFKEGAEECYDKTQTVVQDAPKEQCSLEPQRTCKHVTKLVPKLEPSEECVDVPKEVCTRSRTNPRKVKKPVVKKWCYVPTEESGLA*
Ga0103836_1002276Ga0103836_10022761F041786QDGKIDAETFRAQLMTFGNKFSASEVDDAFAEFQIDGGMIDAAHLKGLMVSK*
Ga0103836_1002387Ga0103836_10023871F000287GGAWRRPRMKVYDYNQEFGGNYYQPMIQYINKKDIYGPFHKRTEVYLPHSAEVTSDKYTNMRYQDKSSAKHNLDEFLKNAYSKQIKELNGTTAMARVNLMKNIVSNRRQPHSPLDNVNTPYNPIRLLKGAPPGQEAVNHYISELSIEKKHNTKLSDKQRKHLVMLEACDDEYNYHYKLFGQGAVDRDMKFFAPQLVQDYVNQIRRF*
Ga0103836_1004243Ga0103836_10042431F000048NKMTTHAEADAEVAALKARFDAVKAISETWVKKCDVLVKEWVLLDNTVTELNSWVAKDKSAEGENQFSLEKMESTLGELKNIFKQKEKLVEGL*
Ga0103836_1004379Ga0103836_10043791F005202KELNVLDASNNIDVQAMKRDIQKYTMPSEWFKNKYEHLLDTCYEMATNLPADIAENSVVTGDSFGTVKLGEVKMFTKCCEKAKTKLCMHQDIKKKVESNFGPMEDILQETQLTEHEFFPLVMQLLHGKEMDYMTGGM*
Ga0103836_1004719Ga0103836_10047191F012568DGLIEVTEEDFAEFLEDFDEFKDHVASKMSALTCVLAKMNMLDSNLQVNLKAYTEDVWNDIDLSQTLAGEDPVWRKMMVDGYNDCYDIARAFPQSALNKNPIMKVFGRHMVFFKCSKKVEAHNCGMAQANHMIETLYGSNDGGYDWSQHGLPNNKYERAAMYMKVQYGTATPEEKFVHNFFYMDPAM*
Ga0103836_1004971Ga0103836_10049711F000021DNIHKYYYLIAFTAYMREAADAARNLVPDDKKAELTLTGGKCAIPANQLKLSRKFAAFMEENKKLIDLCETGKGNLQWERDIPPAALKNLEDLAKGDFKANLGKIIHDIYQTAHIMFSDMPQGDHKKRAKYRFASKTLMRILPAKEKAEVEGLIEKKTITLDLYEILGKCTWTPV*
Ga0103836_1006528Ga0103836_10065281F003495MKEINENNIPDEHEISKDTNAPPKDTAALDKRSVLANVNSTGSGI*
Ga0103836_1007214Ga0103836_10072142F062416TGAASEEVWIEPVEMDLADDADQIEIAIAFEQRALYSPKLVHRKTQDGEYSEVTDAKYETQTDANGNDQTVAMALVNEGGAYVVEDEVNVGAVVAIVFAGLVFIGAIMFIVWWKFFKSPPAADEYSVNYASGTTTA*
Ga0103836_1007335Ga0103836_10073351F085736Y*TLYVFVYF*LHHFTPSTVNYFFFER*NIAELDEIRFYAVAPH*YFRPLMGLLVVSPSHYEGLM*MGL*FVLLSALPIIYNWYNSNNNYLPIIPMQSSLLQTSGFILFMLSMYCVSSMLPCGRYYYEPEGGYVGNP*VKFSYQYMYLYLG*FLHHLDLIEHYSFQYAQHILRRHSNLRKVARARLNTPILDRVAES
Ga0103836_1008697Ga0103836_10086971F004957IMNVLAVFLSLLVSVTCLKPCPGDTRGDRRCNHDSTHRVCAKIGVEGTSFWEFTGQRSWCGTIGNYGGPYGSLPRCPPEEPTWCICKWATARWIAGEGCGDAVEFDCDATDVCDLKASYLDFNVDLKPAHDCMEKKCKKQWDACPDNRVSNGV*
Ga0103836_1008715Ga0103836_10087151F000076GQEVIFTENTAPDREIWDEKGGLIKKGAAEIAERDQDHVPEVKVLTTLGMFAVPCTAETTVGEIRQKVQEWKPEYSLEQIELVCKDKDAWGPLFNKTSRPRDDDETVESLWSDWDIPRRMMQVALWIKEKNEETGEWESTFWKMLEESKCDDWSFQGHTI*
Ga0103836_1009122Ga0103836_10091221F003081MPEPGLVIELREEMFNDTRYGAEVYYMHVRGVDTLMVLSYVHILKKIFLKNYVTSESDG*
Ga0103836_1011183Ga0103836_10111831F000073VFAGYMREQAAAAREAVEADKKADVALPASGKCSIPATDLKVSRTFVQFMEENASLRTLIDTGKGNLQWERDIPPEALANLEGLAAKDFAGNLGKIIHDIYQTAHVMFGDMPQGDHKKRAKYRFASKTLMRILPADKKTEVEGLIAKSEMTMDLYEILGKCPWTQPK*
Ga0103836_1011347Ga0103836_10113471F000235VAAAVLASAVSAATVLQGREQLVKAFGTKNIKFDPKTKTNFCNQCHEITVSSTGGTLEHQPQRLGKYVVDGSIWEDMIPFWKSANNQYITPDPNSNPIMYYIKWVVSESVGGFNAGLMNDAYTDGYNCPYEIADQWQYEYQRQWFVDPTLKFTCTKTRDD*
Ga0103836_1011476Ga0103836_10114761F000147GTWINSNMFIATWKKIQEEGLWQDKDWTVKVDVDAIFLPSRLRTYVQNKEVTPNGMYFENCKYVNFGFFGSLEVLSHDAAATYMENLDDCKTSLNYLGREKLYGNEPWGEDLFAQRCMDLHGVDKVDAFDLNTDAACAAWRPEGQKKNMKWLPDCATVKTPAMHHFKTPKLYF

 ⦗Top⦘



© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.