NMPFamsDB

NMPFamsDB

NMPFamsDB

A database of Novel Metagenome Protein Families

A database of Novel Metagenome Protein Clusters

A database of Novel Metagenome Protein Clusters
x
This website uses cookies to improve user experience. By using NMPFamDB you consent to all cookies in accordance with our privacy policy. OK
Metagenome Family F068646

Metagenome Family F068646

Go to section:
Overview Alignments Structure & Topology Gene Neighborhood Phylogeny Ecosystems Sequences
Select file to download:
   Download


Overview

Basic Information
Family ID F068646
Family Type Metagenome
Number of Sequences 124
Average Sequence Length 171 residues
Representative Sequence LSEGAASGLVIVKALPMLVPESVVQQSAEKKRVKKGILGGSEERFVFGKTLYLPYLEFSYQYSTAKGFLSKQSVLAQGRSAVMALREVNLGFYPELISLLPQLADVESDSGSVVQGVDSTILVSERLDELRTMLSDYDKQLLELSKQHDSLTKTDRAKRE
Number of Associated Samples 87
Number of Associated Scaffolds 124

Quality Assessment
Transcriptomic Evidence No
Most common taxonomic group Archaea
% of genes with valid RBS motifs 69.35 %
% of genes near scaffold ends (potentially truncated) 98.39 %
% of genes from short scaffolds (< 2000 bps) 91.94 %
Associated GOLD sequencing projects 80
AlphaFold2 3D model prediction Yes
3D model pTM-score0.26

Note: High quality evidence is represented by blue. Low quality evidence is represented by red.
Hidden Markov Model
Powered by Skylign

Most Common Taxonomy
Group Archaea (95.161 % of family members)
NCBI Taxonomy ID 2157
Taxonomy All Organisms → cellular organisms → Archaea

Most Common Ecosystem
GOLD Ecosystem Environmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil
(37.097 % of family members)
Environment Ontology (ENVO) Unclassified
(52.419 % of family members)
Earth Microbiome Project Ontology (EMPO) Free-living → Non-saline → Soil (non-saline)
(58.065 % of family members)



 ⦗Top⦘

Multiple Sequence Alignments

Select alignment to view:      


 ⦗Top⦘

Structure & Topology

Predicted Secondary Structure and Topology

Predicted Topology & Secondary Structure
Classification: Globular Signal Peptide: No Secondary Structure distribution: α-helix: 29.79%    β-sheet: 24.47%    Coil/Unstructured: 45.74%
Feature Viewer
Powered by Feature Viewer

Predicted 3D Structure

Structure Viewer
Per-residue confidence (pLDDT):
  0-50   51-70   71-90   91-100  
pTM-score: 0.26
Powered by PDBe Molstar

Low Quality Model:

This family has a low confidence model (pTM < 0.7) and has not been screened against SCOPe or PDB.


 ⦗Top⦘

Gene Neighborhood

Neighboring Pfam domains

Pfam IDName % Frequency in 124 Family Scaffolds
PF00892EamA 10.48
PF06195DUF996 2.42
PF02803Thiolase_C 0.81
PF00583Acetyltransf_1 0.81
PF02518HATPase_c 0.81

Neighboring Clusters of Orthologous Genes (COGs)

COG IDNameFunctional Category % Frequency in 124 Family Scaffolds
COG2245Uncharacterized membrane protein, contains DUF996 domainFunction unknown [S] 2.42
COG0183Acetyl-CoA acetyltransferaseLipid transport and metabolism [I] 0.81


 ⦗Top⦘

Phylogeny

NCBI Taxonomy

Select NCBI taxonomy Level:
NameRankTaxonomyDistribution
All OrganismsrootAll Organisms100.00 %
UnclassifiedrootN/A0.00 %

Visualization
Powered by ApexCharts

Associated Scaffolds


ScaffoldTaxonomyLengthIMG/M Link
3300002558|JGI25385J37094_10044115All Organisms → cellular organisms → Archaea1521Open in IMG/M
3300002560|JGI25383J37093_10024884All Organisms → cellular organisms → Archaea2009Open in IMG/M
3300002561|JGI25384J37096_10227136All Organisms → cellular organisms → Archaea546Open in IMG/M
3300002561|JGI25384J37096_10263405All Organisms → cellular organisms → Archaea502Open in IMG/M
3300002562|JGI25382J37095_10038222All Organisms → cellular organisms → Archaea1877Open in IMG/M
3300002562|JGI25382J37095_10058293All Organisms → cellular organisms → Archaea1469Open in IMG/M
3300002562|JGI25382J37095_10203651All Organisms → cellular organisms → Archaea598Open in IMG/M
3300002911|JGI25390J43892_10109025All Organisms → cellular organisms → Archaea625Open in IMG/M
3300002916|JGI25389J43894_1014870All Organisms → cellular organisms → Archaea1331Open in IMG/M
3300005166|Ga0066674_10283685All Organisms → cellular organisms → Archaea780Open in IMG/M
3300005166|Ga0066674_10296098All Organisms → cellular organisms → Archaea760Open in IMG/M
3300005171|Ga0066677_10737697All Organisms → cellular organisms → Archaea548Open in IMG/M
3300005172|Ga0066683_10397123All Organisms → cellular organisms → Archaea852Open in IMG/M
3300005172|Ga0066683_10883530All Organisms → cellular organisms → Archaea513Open in IMG/M
3300005174|Ga0066680_10176058All Organisms → cellular organisms → Archaea1347Open in IMG/M
3300005180|Ga0066685_10265047All Organisms → cellular organisms → Archaea1186Open in IMG/M
3300005181|Ga0066678_10970194All Organisms → cellular organisms → Archaea553Open in IMG/M
3300005446|Ga0066686_10167119All Organisms → cellular organisms → Archaea1462Open in IMG/M
3300005450|Ga0066682_10347586All Organisms → cellular organisms → Archaea955Open in IMG/M
3300005450|Ga0066682_10403395All Organisms → cellular organisms → Archaea875Open in IMG/M
3300005468|Ga0070707_102142029All Organisms → cellular organisms → Bacteria → Terrabacteria group → Actinobacteria → Actinomycetia → Corynebacteriales → Nocardiaceae → Williamsia → unclassified Williamsia → Williamsia sp. D3527Open in IMG/M
3300005552|Ga0066701_10619321All Organisms → cellular organisms → Archaea657Open in IMG/M
3300005553|Ga0066695_10257782All Organisms → cellular organisms → Archaea1101Open in IMG/M
3300005555|Ga0066692_10498086All Organisms → cellular organisms → Archaea775Open in IMG/M
3300005555|Ga0066692_10704356All Organisms → cellular organisms → Archaea626Open in IMG/M
3300005555|Ga0066692_11022270All Organisms → cellular organisms → Archaea505Open in IMG/M
3300005598|Ga0066706_10825988All Organisms → cellular organisms → Archaea728Open in IMG/M
3300006032|Ga0066696_10006438All Organisms → cellular organisms → Archaea5486Open in IMG/M
3300006034|Ga0066656_11133967All Organisms → cellular organisms → Archaea503Open in IMG/M
3300006796|Ga0066665_10271260All Organisms → cellular organisms → Archaea1351Open in IMG/M
3300006797|Ga0066659_10133911All Organisms → cellular organisms → Archaea1738Open in IMG/M
3300006797|Ga0066659_11658500All Organisms → cellular organisms → Archaea538Open in IMG/M
3300006800|Ga0066660_11417150All Organisms → cellular organisms → Archaea545Open in IMG/M
3300006806|Ga0079220_11836980All Organisms → cellular organisms → Archaea535Open in IMG/M
3300007258|Ga0099793_10003321All Organisms → cellular organisms → Bacteria5552Open in IMG/M
3300007258|Ga0099793_10125808All Organisms → cellular organisms → Archaea1202Open in IMG/M
3300007258|Ga0099793_10329648All Organisms → cellular organisms → Archaea744Open in IMG/M
3300007265|Ga0099794_10349750All Organisms → cellular organisms → Archaea769Open in IMG/M
3300009038|Ga0099829_10065680All Organisms → cellular organisms → Archaea2737Open in IMG/M
3300009038|Ga0099829_10125684All Organisms → cellular organisms → Bacteria → Proteobacteria2024Open in IMG/M
3300009038|Ga0099829_10864428All Organisms → cellular organisms → Archaea751Open in IMG/M
3300009137|Ga0066709_104356854All Organisms → cellular organisms → Archaea516Open in IMG/M
3300010304|Ga0134088_10115498All Organisms → cellular organisms → Archaea1268Open in IMG/M
3300010325|Ga0134064_10003245All Organisms → cellular organisms → Bacteria3913Open in IMG/M
3300010326|Ga0134065_10232452All Organisms → cellular organisms → Archaea680Open in IMG/M
3300010329|Ga0134111_10489811All Organisms → cellular organisms → Archaea538Open in IMG/M
3300010329|Ga0134111_10516486All Organisms → cellular organisms → Archaea526Open in IMG/M
3300010333|Ga0134080_10068515All Organisms → cellular organisms → Archaea1420Open in IMG/M
3300010333|Ga0134080_10618622All Organisms → cellular organisms → Archaea529Open in IMG/M
3300010336|Ga0134071_10055461All Organisms → cellular organisms → Archaea1807Open in IMG/M
3300011269|Ga0137392_11539052All Organisms → cellular organisms → Archaea524Open in IMG/M
3300011271|Ga0137393_10501320All Organisms → cellular organisms → Archaea1042Open in IMG/M
3300011271|Ga0137393_11031186All Organisms → cellular organisms → Archaea700Open in IMG/M
3300012189|Ga0137388_10478623All Organisms → cellular organisms → Archaea1156Open in IMG/M
3300012189|Ga0137388_11961438All Organisms → cellular organisms → Archaea514Open in IMG/M
3300012198|Ga0137364_10422606All Organisms → cellular organisms → Archaea998Open in IMG/M
3300012199|Ga0137383_11326251All Organisms → cellular organisms → Archaea512Open in IMG/M
3300012200|Ga0137382_10080129All Organisms → cellular organisms → Archaea2115Open in IMG/M
3300012200|Ga0137382_10657009All Organisms → cellular organisms → Archaea750Open in IMG/M
3300012200|Ga0137382_11011678All Organisms → cellular organisms → Archaea596Open in IMG/M
3300012201|Ga0137365_10169785All Organisms → cellular organisms → Archaea1635Open in IMG/M
3300012201|Ga0137365_10859576All Organisms → cellular organisms → Archaea662Open in IMG/M
3300012203|Ga0137399_11525363All Organisms → cellular organisms → Archaea556Open in IMG/M
3300012205|Ga0137362_10421229All Organisms → cellular organisms → Archaea1156Open in IMG/M
3300012206|Ga0137380_10552128All Organisms → cellular organisms → Archaea1011Open in IMG/M
3300012208|Ga0137376_10211475All Organisms → cellular organisms → Archaea1679Open in IMG/M
3300012208|Ga0137376_11420888All Organisms → cellular organisms → Archaea585Open in IMG/M
3300012209|Ga0137379_10687589All Organisms → cellular organisms → Archaea929Open in IMG/M
3300012209|Ga0137379_11833521All Organisms → cellular organisms → Archaea500Open in IMG/M
3300012210|Ga0137378_10247845All Organisms → cellular organisms → Archaea1658Open in IMG/M
3300012210|Ga0137378_10683272All Organisms → cellular organisms → Archaea937Open in IMG/M
3300012210|Ga0137378_10817460All Organisms → cellular organisms → Archaea844Open in IMG/M
3300012210|Ga0137378_11267348All Organisms → cellular organisms → Archaea654Open in IMG/M
3300012210|Ga0137378_11440495All Organisms → cellular organisms → Archaea601Open in IMG/M
3300012211|Ga0137377_11476442All Organisms → cellular organisms → Archaea606Open in IMG/M
3300012356|Ga0137371_10875319All Organisms → cellular organisms → Archaea683Open in IMG/M
3300012357|Ga0137384_11443715All Organisms → cellular organisms → Archaea537Open in IMG/M
3300012361|Ga0137360_10609282All Organisms → cellular organisms → Archaea935Open in IMG/M
3300012363|Ga0137390_11163107All Organisms → cellular organisms → Archaea719Open in IMG/M
3300012918|Ga0137396_10144041All Organisms → cellular organisms → Archaea1732Open in IMG/M
3300012918|Ga0137396_10425693All Organisms → cellular organisms → Archaea984Open in IMG/M
3300012918|Ga0137396_10672100All Organisms → cellular organisms → Archaea765Open in IMG/M
3300012927|Ga0137416_12139665All Organisms → cellular organisms → Archaea514Open in IMG/M
3300012972|Ga0134077_10274538All Organisms → cellular organisms → Archaea703Open in IMG/M
3300012975|Ga0134110_10422830All Organisms → cellular organisms → Archaea595Open in IMG/M
3300012976|Ga0134076_10214847All Organisms → cellular organisms → Archaea810Open in IMG/M
3300014150|Ga0134081_10033372All Organisms → cellular organisms → Archaea1498Open in IMG/M
3300014157|Ga0134078_10295292All Organisms → cellular organisms → Archaea695Open in IMG/M
3300015054|Ga0137420_1146309All Organisms → cellular organisms → Archaea675Open in IMG/M
3300015357|Ga0134072_10144470All Organisms → cellular organisms → Archaea777Open in IMG/M
3300017657|Ga0134074_1006178All Organisms → cellular organisms → Bacteria3833Open in IMG/M
3300017659|Ga0134083_10459056All Organisms → cellular organisms → Archaea564Open in IMG/M
3300017934|Ga0187803_10332854All Organisms → cellular organisms → Archaea609Open in IMG/M
3300018431|Ga0066655_10776505All Organisms → cellular organisms → Archaea651Open in IMG/M
3300018433|Ga0066667_10129941All Organisms → cellular organisms → Archaea1738Open in IMG/M
3300018433|Ga0066667_11203709All Organisms → cellular organisms → Archaea659Open in IMG/M
3300018468|Ga0066662_10033068All Organisms → cellular organisms → Archaea3098Open in IMG/M
3300018468|Ga0066662_10209560All Organisms → cellular organisms → Archaea1550Open in IMG/M
3300021046|Ga0215015_10175893All Organisms → cellular organisms → Archaea797Open in IMG/M
3300021046|Ga0215015_10442518All Organisms → cellular organisms → Archaea1240Open in IMG/M
3300021046|Ga0215015_10689225All Organisms → cellular organisms → Archaea1808Open in IMG/M
3300021088|Ga0210404_10094661All Organisms → cellular organisms → Archaea1495Open in IMG/M
3300026297|Ga0209237_1075894All Organisms → cellular organisms → Archaea1560Open in IMG/M
3300026297|Ga0209237_1082713All Organisms → cellular organisms → Bacteria1465Open in IMG/M
3300026313|Ga0209761_1289196All Organisms → cellular organisms → Archaea579Open in IMG/M
3300026317|Ga0209154_1233214All Organisms → cellular organisms → Archaea669Open in IMG/M
3300026318|Ga0209471_1076311All Organisms → cellular organisms → Archaea1488Open in IMG/M
3300026327|Ga0209266_1286373All Organisms → cellular organisms → Archaea519Open in IMG/M
3300026328|Ga0209802_1242059All Organisms → cellular organisms → Archaea630Open in IMG/M
3300026334|Ga0209377_1154646All Organisms → cellular organisms → Archaea862Open in IMG/M
3300026334|Ga0209377_1303094All Organisms → cellular organisms → Archaea532Open in IMG/M
3300026480|Ga0257177_1002455All Organisms → cellular organisms → Archaea1991Open in IMG/M
3300026532|Ga0209160_1173055All Organisms → cellular organisms → Archaea925Open in IMG/M
3300026537|Ga0209157_1349208All Organisms → cellular organisms → Archaea533Open in IMG/M
3300026551|Ga0209648_10335470All Organisms → cellular organisms → Archaea1053Open in IMG/M
3300027655|Ga0209388_1082413All Organisms → cellular organisms → Archaea926Open in IMG/M
3300027846|Ga0209180_10110573All Organisms → cellular organisms → Archaea1570Open in IMG/M
3300027862|Ga0209701_10685518All Organisms → cellular organisms → Archaea530Open in IMG/M
3300027882|Ga0209590_10053785All Organisms → cellular organisms → Archaea2255Open in IMG/M
3300028536|Ga0137415_11303576All Organisms → cellular organisms → Archaea544Open in IMG/M
3300032180|Ga0307471_100302596All Organisms → cellular organisms → Archaea1687Open in IMG/M
3300032180|Ga0307471_100392086All Organisms → cellular organisms → Archaea1513Open in IMG/M
3300032180|Ga0307471_100924692All Organisms → cellular organisms → Archaea1039Open in IMG/M
3300032180|Ga0307471_103318312All Organisms → cellular organisms → Archaea570Open in IMG/M



 ⦗Top⦘

Environmental Properties

Associated Habitat Types

Select Environment Taxonomy Level:
HabitatTaxonomyDistribution
Vadose Zone SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Vadose Zone Soil37.10%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Soil24.19%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Grasslands → Grasslands Soil15.32%
Grasslands SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Grasslands Soil12.90%
Hardwood Forest SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Hardwood Forest Soil3.23%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Unclassified → Soil2.42%
SoilEnvironmental → Terrestrial → Soil → Unclassified → Forest Soil → Soil1.61%
Freshwater SedimentEnvironmental → Aquatic → Freshwater → Wetlands → Sediment → Freshwater Sediment0.81%
Agricultural SoilEnvironmental → Terrestrial → Soil → Unclassified → Agricultural Land → Agricultural Soil0.81%
SoilEnvironmental → Terrestrial → Soil → Loam → Grasslands → Soil0.81%
Corn, Switchgrass And Miscanthus RhizosphereEnvironmental → Terrestrial → Soil → Loam → Agricultural Soil → Corn, Switchgrass And Miscanthus Rhizosphere0.81%

Visualization
Powered by ApexCharts



Associated Samples

Taxon OIDSample NameHabitat TypeIMG/M Link
3300002558Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_27_2013_1_40cmEnvironmentalOpen in IMG/M
3300002560Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_20cmEnvironmentalOpen in IMG/M
3300002561Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_40cmEnvironmentalOpen in IMG/M
3300002562Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 08_20_2013_1_40cmEnvironmentalOpen in IMG/M
3300002911Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_2_20cmEnvironmentalOpen in IMG/M
3300002916Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_20cmEnvironmentalOpen in IMG/M
3300005166Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_123EnvironmentalOpen in IMG/M
3300005171Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_126EnvironmentalOpen in IMG/M
3300005172Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132EnvironmentalOpen in IMG/M
3300005174Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129EnvironmentalOpen in IMG/M
3300005180Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_134EnvironmentalOpen in IMG/M
3300005181Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_127EnvironmentalOpen in IMG/M
3300005446Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135EnvironmentalOpen in IMG/M
3300005450Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_131EnvironmentalOpen in IMG/M
3300005468Corn, switchgrass and miscanthus rhizosphere microbial communities from Kellogg Biological Station, Michigan, USA - KBS K5-50-2 metaGEnvironmentalOpen in IMG/M
3300005552Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_150EnvironmentalOpen in IMG/M
3300005553Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_144EnvironmentalOpen in IMG/M
3300005555Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141EnvironmentalOpen in IMG/M
3300005598Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_155EnvironmentalOpen in IMG/M
3300006032Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_145EnvironmentalOpen in IMG/M
3300006034Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_105EnvironmentalOpen in IMG/M
3300006796Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_114EnvironmentalOpen in IMG/M
3300006797Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_108EnvironmentalOpen in IMG/M
3300006800Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_109EnvironmentalOpen in IMG/M
3300006806Agricultural soil microbial communities from Georgia to study Nitrogen management - GA AS100EnvironmentalOpen in IMG/M
3300007258Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_3EnvironmentalOpen in IMG/M
3300007265Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - Rivendell_Oct2014_Saprolite_2_DNA_Rhizosphere_1EnvironmentalOpen in IMG/M
3300009038Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaGEnvironmentalOpen in IMG/M
3300009137Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_158EnvironmentalOpen in IMG/M
3300010304Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_09182015EnvironmentalOpen in IMG/M
3300010325Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_2_0_1 metaGEnvironmentalOpen in IMG/M
3300010326Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010329Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_11112015EnvironmentalOpen in IMG/M
3300010333Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300010336Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_5_09082015EnvironmentalOpen in IMG/M
3300011269Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4A metaGEnvironmentalOpen in IMG/M
3300011271Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h3.4B metaGEnvironmentalOpen in IMG/M
3300012189Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h1.4A metaGEnvironmentalOpen in IMG/M
3300012198Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012199Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012200Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_20_16 metaGEnvironmentalOpen in IMG/M
3300012201Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012203Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czorhiz3.16 metaGEnvironmentalOpen in IMG/M
3300012205Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_100_16 metaGEnvironmentalOpen in IMG/M
3300012206Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_100_16 metaGEnvironmentalOpen in IMG/M
3300012208Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_20_16 metaGEnvironmentalOpen in IMG/M
3300012209Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_80_16 metaGEnvironmentalOpen in IMG/M
3300012210Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_60_16 metaGEnvironmentalOpen in IMG/M
3300012211Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_L_40_16 metaGEnvironmentalOpen in IMG/M
3300012356Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage1_R_40_16 metaGEnvironmentalOpen in IMG/M
3300012357Vadose zone soil microbial communities from Sagehorn Ranch, Mendocino, California, USA - Sage2_R_60_16 metaGEnvironmentalOpen in IMG/M
3300012361Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - Mad1_60_16 metaGEnvironmentalOpen in IMG/M
3300012363Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - 15con2h2.4A metaGEnvironmentalOpen in IMG/M
3300012918Vadose zone soil microbial communities from Angelo Coast Range Reserve, California, USA - czcobulk3.16 metaGEnvironmentalOpen in IMG/M
3300012927Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Illumina Assembly)EnvironmentalOpen in IMG/M
3300012972Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300012975Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_20cm_5_11112015EnvironmentalOpen in IMG/M
3300012976Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_40cm_5_0_1 metaGEnvironmentalOpen in IMG/M
3300014150Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_20cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300014157Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_20cm_2_24_1 metaGEnvironmentalOpen in IMG/M
3300015054Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_2_16fungal (PacBio error correction)EnvironmentalOpen in IMG/M
3300015357Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Wat_20cm_5_0_1 metaGEnvironmentalOpen in IMG/M
3300017657Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Rain_40cm_2_09212015EnvironmentalOpen in IMG/M
3300017659Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - 15_D_Met_40cm_5_24_1 metaGEnvironmentalOpen in IMG/M
3300017934Wetland sediment microbial communities from Neuse River Estuary, North Carolina, USA - Control_3EnvironmentalOpen in IMG/M
3300018431Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_104EnvironmentalOpen in IMG/M
3300018433Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_116EnvironmentalOpen in IMG/M
3300018468Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_111EnvironmentalOpen in IMG/M
3300021046Soil microbial communities from Shale Hills CZO, Pennsylvania, United States - 90cm depthEnvironmentalOpen in IMG/M
3300021088Forest soil microbial communities from Barre Woods Harvard Forest LTER site, Petersham, Massachusetts, United States - Inc-BW-H-28-MEnvironmentalOpen in IMG/M
3300026297Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 10_02_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026313Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 09_25_2013_1_40cm (SPAdes)EnvironmentalOpen in IMG/M
3300026317Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_121 (SPAdes)EnvironmentalOpen in IMG/M
3300026318Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_128 (SPAdes)EnvironmentalOpen in IMG/M
3300026327Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_132 (SPAdes)EnvironmentalOpen in IMG/M
3300026328Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_129 (SPAdes)EnvironmentalOpen in IMG/M
3300026334Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_141 (SPAdes)EnvironmentalOpen in IMG/M
3300026480Soil microbial communities from H.J. Andrews Experimental Forest, Oregon, United States - NL-07-BEnvironmentalOpen in IMG/M
3300026532Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_153 (SPAdes)EnvironmentalOpen in IMG/M
3300026537Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample Angelo_135 (SPAdes)EnvironmentalOpen in IMG/M
3300026551Grasslands soil microbial communities from the Angelo Coastal Reserve, California, USA - Sample 9_17_2013_115cm (SPAdes)EnvironmentalOpen in IMG/M
3300027655Vadose zone soil and rhizosphere microbial communities from the Eel River Critical Zone Observatory, Northern California to study diel carbon cycling - Rivendell_Oct2014_Saprolite_2_DNA_Bulk_1 (SPAdes)EnvironmentalOpen in IMG/M
3300027846Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H2.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027862Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con2H3.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300027882Vadose zone soil microbial communities from the Eel River Critical Zone Observatory, Northern California, USA - CZOApr15con1.8 metaG (SPAdes)EnvironmentalOpen in IMG/M
3300028536Vadose zone soil fungal communities from Angelo Coast Range Reserve, California, USA - CZODoug3_1_1_16fungal (Hybrid Assembly)EnvironmentalOpen in IMG/M
3300032180Hardwood forest soil microbial communities from Morgan-Monroe State Forest, Indiana, United States - atmos_gasesAM3C_515EnvironmentalOpen in IMG/M

Geographical Distribution
Zoom:     Powered by OpenStreetMap



 ⦗Top⦘

Family Sequences

Protein ID Sample Taxon ID Habitat Sequence
JGI25385J37094_1004411523300002558Grasslands SoilLNQGVAPERVIVKALPMLVPESAVQHYAEKKRVKRGILGGSEERFVFGKTLYLPYLEFSYQYSTAKGFLSKQSVLAHGRSTVMALREVNLGFYPELISLLPQLADVEPDSGSVVRGVESTILVSERLDELRGMLSDFDHQLQELSKQYDSLTKTDRAIQ
JGI25383J37093_1002488443300002560Grasslands SoilMLVPESVLQQNAEKKRVKKGILGGSEERFVFGKTLYLPYLEFTYQYSTEKGFRSKQSVLGQGRSVVMALREVNLGFYPELVSVVPQLVDMEPDLGSMVQGVDSTVLVSERLDELRKTLSDYDYQLQELSKQHDSLTKTDSARQEIKENIDHLKKTRETRWKMFADGLKLPSKI
JGI25384J37096_1022713613300002561Grasslands SoilLNEGSAPGQIIVKALPLLIPESVVQQDAEKKRVKRGILGGSEERFVFGKTLYLPYLEFSYQYSTAKGFLSKQSVLVQGRSAVMALREVNLGFYPELISLLPQLADVEPDSGSVLQGVDSTILVSQRLDELRGMLSDFDHKLREL
JGI25384J37096_1026340513300002561Grasslands SoilRFVFGKTLYLPYLEFTYQYSTEKGFLSKRSILGQGRSVVMALREVNLGFDPDLISLLPQLADIEPDSGSVVQGVDSTILVSERLDELKKTLSDYDHQLQELSKKHDSLTKMDSARREVKENIDHLKKTRETRWKMFADGLKLPSKVDLNKFELLDGNLFYMPYFVAR
JGI25382J37095_1003822223300002562Grasslands SoilMGQFYYQAFDRAEAGILSPRWLMSYHQERLEPEEIKRMEEGPAPEQITVKALSMLVPESVVQQNAEKKRVKRGILGGSEERVVFGRTLYLPYLDFAYQYSTEKGFLSKQSVLSQGRSVFMALREVNMGFDPELVSIAPQLADLEIDAGSVVQGVDSTILVSERLDELRGMLSDFDHQLQELSKRYDSLTKTDRAR
JGI25382J37095_1005829313300002562Grasslands SoilMAYQLSNGDTRIQEVRTLNQGVAPERVIVKALPMLVPESAVQHYAEKKRVKRGILGGSEERFVFGKTLYLPYLEFSYQYSTAKGFLSKQSVLAHGRSTVMALREVNLGFYPELISLLPQLADVEPDSGSVVRGVESTILVSERLDELRGMLSDFDRQLQELSKQYDSLTKTDRARQEIKENIDHLKKTRETR
JGI25382J37095_1020365113300002562Grasslands SoilVPESVVQQDAEKKRVKKGILGGSEERFVFGKTLYLPYLEFSYQYSTAKGFLSKQSVLVQGRSVVMALREVNLGFYPELISLLPQLADVEPDYASLVRGVESTILVSERLDELRGMLSEFDHQLQELSKQYDSLTKTDRAKQEIKENIDHL
JGI25390J43892_1010902513300002911Grasslands SoilMLVPESLVQQDAEKKRVKRGMLGGSEERFVFGKILYLPYLEFTYQYATEKGFLSKQSLLAQGRSVVMALRQVNLGFYPELISIMHQLTDVEAESGSVVQGVDSTILVGERLDELRGMLSDYDHQLEELSNQGDSLTKRDSAKQGIKE
JGI25389J43894_101487013300002916Grasslands SoilLSEGAASGLAIVKALPMXMPESVVQQSAEKKRVKKGILGGSEERFVFGKTLYLPYLEFSYQYSTAKGFLSKQSVLAQGRSAVMALREVNLGFYPELISFLPQLVDVEPDSGSVVRGVDSTILVSERLDELRRMLSDFDHRLQELSKQYDSLTKTDRAKQEIKENLDHLKKTRD
Ga0066674_1028368513300005166SoilMLMPESVVQQSAEKKRVKKGILGGSEERFVFGKTLYLPYLEFSYQYSTAKGFLSKQSVLAQGRSAVMALREVNLGFYPELISFLPQLVDVEPDSGSVVRGVDSTILVSERLDELRRMLSDFDHQLQELSKQYDSLTKTDRAKQEIKENLDHLKKTRDTRWKMFADGLKLPAKI
Ga0066674_1029609813300005166SoilMLVPESVVQQDAEKKRVKKGILGGSEERFVFGKILYLPYLDFTYQYSTEKGFLSKRSIIGQGRSVVIALREVNLGFYPELVSLAPQQAEMESDSGSVIQGVDSTILVSERLDELKKMLSDYDKELLELSKQHDSLTKTDP
Ga0066677_1073769713300005171SoilMLMPESVVQQSAEKKRVKKGILGGSEERFVFGKTLYLPYLEFSYQYSTAKGFLSKQSVLAQGRSAVMALREVNLGFYPELISFLPQLVDVEPDSGSVVRGVDSTILVSERLDELRRMLSDFDHRLQELSKQYDSLTKTDRAKQE
Ga0066683_1039712313300005172SoilMLVPESVVQQDAEKKRVKKGILGGSEERFVFGKILYLPYLDFTYQYSTEKGFLSKRSIIGQGRSVVIALREVNLGFYPELVSLAPQQAEMESDSGSVIQGVDSTILVSERLDELKKMLSDYDKELLELSKQHDSLTKTDPAKQ
Ga0066683_1088353013300005172SoilWLISYHRESWEHIESKKLSEAAVPGQTIVKGLSMMVPESIVQQNAERKRVKKGILGGSEERFVFGKTLYLPYLEFTYQYSTEKGFLSKRSILGQGRSVVMALREVNLGFDPDLISLLPQLADIEPDSGSVVQGVDSTILVSERLDELKKTLSDYDHQLQELSKKHDSLTKM
Ga0066680_1017605823300005174SoilLNEGSAPGQIIVKALPLLIPESVVQQDAEKKRVKRGILGGSEERFVFGKTLYLPYLEFSYQYSTAKGFLSKQSVLVQGRSAVMALREVNLGFYPELISLLPQLADVEPDSGSVLQGVDSTTLVSQRLDELRGMLSDFDHKLRELSKLYDSLTK
Ga0066685_1026504713300005180SoilMLVPESLVQQDAEKKRVKRGMLGGSEERFVFGKILYLPYLEFTYQYATEKGFLSKQSLLAQGRSVVMALRQVNLGFYPELISIMHQLTDVEAESGSVVQGVDSTILVGERLDELRGMLSDYDHQLEELSNQGDSLTKRDSAKQGIKENIDHLKKTRETRWKMFADG
Ga0066678_1097019413300005181SoilMLVPESVVQHDAEKKRVKKGILGGSEEKFVFGKTLYLPYLDFTYQYSTEKGFLSKQSVLAQGRSAVMALREVNLGFYPELISLLPQLADVEPDSGSVVRGVESTILVSERLDELKRMLSDFDHRLQELSKQYDSLTKTDRAKQEIKENID
Ga0066686_1016711923300005446SoilLSEGAASGLAIVKALPMLVPESVVQQSAEKKRVKKGILGGSEERFVFGKMLYLPYLEFSYQYSTAKGFLSKQSVLAQGRSAVLALREVNLGFYPELISFLPQLVDVEPDSGSVVRGVDSTILVSERLDELRRMLSDFDHQLQELSKQYD
Ga0066682_1034758613300005450SoilMLVPESVVQQSAEKKRVKKGILGGSEERFVLGKTLYLPYLDFTYRYSTEKGFLSKQSVLTQGRSAVMALREVNLGFYPELISLLPQLADIELDSGSVVQGVDSTILVSERLDELKKMLSDYDKELLELSKQHDSLTKTDPAKQEIKENIDHLKKTRDTRWKMFADGLKLPSKVDLEKF
Ga0066682_1040339513300005450SoilMLVPESVVQQDAEKKRVKKGILGGSEERFVFGKILYLPYLDFTYQYSTEKGFLSKRSIIGQGRSVVIALREVNLGFYPELVSLAPQQAEMESDSGSVIQGVDSTILVSERLDELKKMLSDYDKELLELSKQHDSFTKTGPAKQEIKENIDHLKKTRDTRWKMFADGLKLPSKVDLEKF
Ga0070707_10214202913300005468Corn, Switchgrass And Miscanthus RhizosphereEERFVFGKTLYLPYLEFSYQYPTVKGIFSKQTVLAQGRSAVMALREVDLGFYPELVSLTSQLTEEEADYNSVVPGIDSTTLVRERLDELRRTLSDYDNQLKELSKRYDSLTKTDRARQDLKENIDHLKETRESRWKIFADGLKLPSKIEMEGFECLEANLFYMPYFVARFSRGGE
Ga0066701_1061932123300005552SoilMSYHPDPYSGEGQKVNEGAAPEQIVVKALPLLVPESVVQQDAEKKRVKKGILGGSEERFVFGKILYLPYLDFTYQYSTEKGFLSKRSIIGQGRSVVIALREVNLGFYPELVSLAPQQAEMESDSGSVIQGVDSTILVSERLDELKKMLSDYDKELLELSKQHDSLTKTDRA
Ga0066695_1025778223300005553SoilMMVPESIVQQNAERKRVKKGILGGSEERFVFGKTLYLPYLEFTYQYSTEKGFLSKQSILGQGRSVVMALREVNLGFDPDLISLLPQLADIEPDSGSVVQGVDSTILVSERLDELKKTLSDYDHQLQELSKKHDSLTKMDSARREVKENIDHLKKTRETRWKMFADGLK
Ga0066692_1049808613300005555SoilLKAEGQKMDEGTAPEHIIVRALPMLVPESVVQQDAEKKRVKRGILGGSEERFVFGKTLYLPYLEFTYQYTTEKGFLSKQSILAQGRSVVMALREVNLGFYPELVSLAPQLAEMESDHGSVIQGVDSTILVGERLDELKKMLSDYDKQSLELSKQHDSLTKTDRA
Ga0066692_1070435613300005555SoilMLVPESIVQQDAEKKRVKKGILGGSEERFIFGKTLYLPYLEFSYQYSTAKGFLSKQSVLAQGRSAVMALREVNLGFYPELISLLPQLADVESDSGSVMQGVDSTILVSERLDELRAMLSDYDRQLLELSKQHDSLTKTDRA
Ga0066692_1102227013300005555SoilMSYHPDPYSGEGQKVNEGAAPEQIVVKALPLLVPESVVQQDAEKKRVKKGILGGSEERFVFGKILYLPYLDFTYQYSTEKGFLSKRSIIGQGRSVVIALREVNLGFYPELVSLAPQQAEMESDSGSVIQGVDSTILVSERLDELKKMLSDYDKELLELSKQHDSLTK
Ga0066706_1082598813300005598SoilLSEGAASGLVIVKALPMLVPESVVQHDAEKKRVKKGILGGSEERFVFGKTLYLPYLDFTYQYSTEKGFLSKQSVLAQGRSAVMALREVNLGFYPELISLLPQLADVESDSGSVVQGVDSTILVSERLDELRGMLSDFDHELQELSKQFDSLTKT
Ga0066696_1000643873300006032SoilLSEGAASGLAIVKALPMLVPESVVQQSAEKKRVKKGILGGSEERFVFGKTLYLPYLEFSYQYSTAKGFLSKQSVLAQGRSAVMALREVNLGFYPELISFLPQLVDVEPDSGSVVRGVDSTILVSERLDELRRMLSDFDHQLQELSKQYDSLTKTDRA
Ga0066656_1113396713300006034SoilGILGGSEERFVFGKTLYLPYLEFSYQYSTAKGFLSKQSVLAQGRSAVMALREVNLGFYPELISFLPQLVDVEPDSGSVVRGVDSTILVSERLDELRRMLSDFDHQLQELSKQYDSLTKTDRAKQEIKENLDHLKKTRDTRWKMFADGLKLPAKIDLEKIEFLEGNLF
Ga0066665_1027126013300006796SoilMSYHPDPYSGEGQKVNEGAAPEQIVVKALPLLVPESVVQQDAEKKRVKKGILGGSEERFVFGKILYLPYLDFTYQYSTEKGFLSKRSIIGQGRSVVIALREVNLGFYPELVSLAPQQAEMESDSGSVIQGVDSTILVSERLDELKKMLSDYDKELLELSKQHDSLQRLTLQNRRSRKTLII*
Ga0066659_1013391133300006797SoilLNEGVAPGQVIIKALPLLVPESIVQQDAEKRRVKRGILGGSEEQFVFGKTLYLPYLEFSYQYSTAKGFISKQSVLAQGRSVVMALREVNFGFYPELISLLPQLADVESDSGSVVKGVDSTILVSERLDELKTMLSDYDKQFLELSEQHDSLTKIDRAKREIEENIDHLKKTREMR
Ga0066659_1165850013300006797SoilKRVKKGILGGSEERFVFGKTLYLPYLDFTYQYSTEKGLLSKQSVLAQGRSTVMALREVNLGFYPELISLLPQLADVESDSGSVVQGVDSTILVSERLDELRTMLSDYDKQLLELSKQHDSLTKTDRAKREIEENIDHLKKTREMRWKMFADGLKLPSKIDPEKFELLEGNLFHMPYFFA
Ga0066660_1141715013300006800SoilAPLQRGEIRRLSEGAASGLAIVKALPMLVPESVVQHDAEKKRVKKGILGGSEERFVFGKTLYLPYLDFTYQYSTEKGFLSKQSVLAQGRSAVMALREVNLGFYPELISLLPHLADVEPDSGSVVRGVESTILVSERLDELRRMLSDFDHQLQELSKQYDSLTKTDRAKQEIKENIDHLKK
Ga0079220_1183698013300006806Agricultural SoilMLLPESAIQEDAEKKRVKRGMLGGSEEKFVFGKILYLPYLEFTYQYSTEKGFLSKQNALAQGRSAVMALREVNLGFYPELVSLLPQLAEVDANPGSVVQGVDSTILVNERLEELKTMLSDYDGKLRELSKQHDSL
Ga0099793_1000332173300007258Vadose Zone SoilMDEGDVPEHIIIRALPLLVPESVVQEDAEKKRVKKGILGGSEERFVFGKTLHLPYLEFTYQYSTQKVFRSKQSILAQGRSVVLALREVNLGFYPELISIIPQLADAEPEPGSVLQGVDSTILVSERLDELRRMLSDYDHQLQELSKQRDSLTKNDSAKQEIKE
Ga0099793_1012580813300007258Vadose Zone SoilMIVRALPMLVPESIVQQDAEKKRVKKGILGGSEERFVFGKTLYLPYLEFTYQYSTEKGFLSKQSVLAQGRSVVMALREVNLGFYPEIISIMPQLADAEPESGSVLQGVDSTILVSERLDELRRMLSDYDHQLQELSKQRDSLTKNDSAKQEIKENIDHIRKTREMRWKMFADGLKLPSKVDLEKFELL
Ga0099793_1032964813300007258Vadose Zone SoilLTEDSPPGQIIIRALPMLVPESLVQQDAEKKRVKRGMLGGSEERFVFGKILYLPYLEFTYQYTTEKGFLSKQSLLAQGRSVVMALRQVNLGFYPELISIMPQLADAKPEPGSVLQGVDSTNLVSERLDELRRMLSDYDHQLQELSKQRDSLTKNDSAKQEIKE
Ga0099794_1034975013300007265Vadose Zone SoilLTEDFASGQILVKALPMLLPESVVQGDAEKKRVKRGMLGGSEERFVFGKILYLPYLEFTYQYSIEKGFLSRQSVLAQGRSTVMALREVNLGFYSELIALLPQVVDIEADSSSVVQGVDSTILLSERLEELKTMLSDYDSQLQELSKQYDSLTKTDRARQEIKENIDH
Ga0099829_1006568013300009038Vadose Zone SoilMEESTAPVHITIRALPMLVPESVVQRDAEKKRVKRGILGGSEERFVFGKTLYLPYLEFTYQYSSPKGFLSKQSVLAQGRSVVMALREVNLGFYPELISIMHQLADVEAESGSVVQGVDSTILVGERLDELKGMLSDYDHQLQELSKQRDSLTKRDSAKQEIAENIEHMKKTREMRWKMFADGLKLPSKVDLEKFEL
Ga0099829_1012568413300009038Vadose Zone SoilLELWGKGKSLNEGAAPEKDIVRALPMLVPESVVQQNSEKKRVKKGILGGSEERTVFGKTLYLPYLDFSYQYSTERGFLSKQSVLAQGRSVVMGLREVNLGFYPELVSIMHQLTDVEPESGSVVQGVDSTILVSERLDELKKMLSDYDHQLQELSKQRESLTKTDSARQEIKENIDHMKKTRETRWKMFSDGLRLPSKIDLETFELLEAN
Ga0099829_1086442813300009038Vadose Zone SoilMIVRALPMLVPESLVQRDAEKKRVKKGILGGSEERFVFGKTIYLPYLEFTYQYSTEKGFLSKQNVLAQGRSAVMALREVNLGFYPELVSLLPQLIDIEPDPGSVVHGVDSTTLVSERLEELKTMLSDYDSQLQELSKQYDSLTKTDRARQEIKENIDHLKQTREARWKMFADGLKL
Ga0066709_10435685413300009137Grasslands SoilGSEERVVFGRTLYLPYLDFAYQYSTEKGFLSKQSVLSQGRSVFMALREVNMGFDPELVSIAPQLADLEIDAGSVVQGVDSTILVSERLDELRGMLSDFDHQLQELSKRYDSLTKTDRARKEIKENIDHLRKTRETRWKMFADGLKLPSKVDLEKFELLEGNLFYMPYFVAR
Ga0134088_1011549823300010304Grasslands SoilLSEGAASGLVIVKALPMLVPESVVQQSAEKKRVKKGILGGSEERFVFGKTLYLPYLEFSYQYSTAKGFLSKQSVLAQGRSAVMALREVNLGFYPELISLLPQLADVESDSGSVVQGVDSTILVSERLDELRTMLSDYDKQLLELSKQHDSLTKTDRAKRE
Ga0134064_1000324563300010325Grasslands SoilMPESVVQQSAEKKRVKKGILGGSEERFVFGKTLYLPYLEFSYQYSTAKGFLSKQSVLAQGRSAVMALREVNLGFYPELISFLPQLVDVEPDSGSVVRGVDSTILVSERLDELRRMLSDFDHQLQELSKQYDSLTKTDRAKQEIKENLDHLKKTRDTRWKMFADGLKL
Ga0134065_1023245213300010326Grasslands SoilEFSLKSVQFYYQISNQGEKLEFSYLVWLMSYHKGPLQQGDVERLSEGDTSGQVTVKALPMLVPESVVQQSAEKKRVKKGILGGSEERFVFGKTLYLPYLEFSYQYSTAKGFLSKQSVLAQGRSAVMALREVNLGFYPELISFLPQLVDVEPDSGSVVRGVDSTILVSERLDELRGILSDFDHQLQELSKQYDSLTKTDRARQEIKENIDHLKKTRDMRWKMFGEGL
Ga0134111_1048981113300010329Grasslands SoilMPESVVQQSAEKKRVKKGILGGSEERFVFGKTLYLPYLEFSYQYSTAKGFLSKQSVLAQGRSAVMALREVNLGFYPELISFLPQLVDVEPDSGSVVRGVDSTILVSERLDELRGMLSDFDHQLQELSKQYDSLTKTDRAR
Ga0134111_1051648613300010329Grasslands SoilVEIKRLNEEAARGQIIVKALPMLVPESVVQQDAEKKRVKKGILGGSEEHFVFGKTLYLPYLDFTYQYSTEKGLLSKQSVLAQGRSAVMALREVNLGFYPELISLLPQLADVESDSGSVVQGVDSTILVSERLDELRTMLSDYDKQLLELSKQHDSLTKTDRAKREIEENI
Ga0134080_1006851523300010333Grasslands SoilLSEGAASGLVNVKALPMLVPESVVQQSAEKKRVKKGILGGSEERFVFGKTLYLPYLDFTYQYSTEKGLLSKQSVLAQGRSTVMALREVNLGFYPELISLLPQLADVESDSGSVVQGVDSTILVSERLDELKTMLSDYDKQLLELSKQHDSLTKTDRAKREIEENIDHLKKTREMRWKMFADGLKLPSKIDPEK
Ga0134080_1061862213300010333Grasslands SoilRVKKGILGGSEEHFVFGKTLYLPYLDFTYRYSTEKGFLSKRSIIGQGRSVVIALREVNLGFYPELVSLAPQQAEMESDSGSVIQGVDSTILVSERLDELKKMLSDYDKELLELSKQHDSLTKTDPAKQEIKENIDHLKKTRDTRWKMFADGLKLPSKVDLEKFEFLEGNLFYMPYF
Ga0134071_1005546133300010336Grasslands SoilLSEGAASGLVIVKALPMLMPESVVQQSAEKKRVKKGILGGSEERFAFGKTLYLPYLEFSYQYSTAKGFLSKQSVLAQGRSAVLALREVNLGFYPELISFLPQLVDVEPDSGSVVRGVDSTILVSERLDELRRMLSDFDHQLQELSKQYDSLTKTDRAKQEIKENIDHLKKTRDTRWKMFA
Ga0137392_1153905213300011269Vadose Zone SoilMAYQPAKRTLGIGEGLIVVAGDAPEHIIVKALPMLVPESLVQQDAEKKRVKKGILGGSEERFVFGKTLYLPYLDFSYQYSTERGFLSKQSVLAQGRSVVMGLREVNLGFYPELISIMHQLTDVEPESGSVVQGIDSTILVSERLDELKKMLSEYDHQL
Ga0137393_1050132013300011271Vadose Zone SoilLEPGEVNRLNEEAAPGQIIVKALQMLVPESVVQQDAEKKRLKKGILGGSEERFVFGKILYLPYLDFAYQYSSEKGFLSKQSVLSQGRSILMALREVNLGFYPELFSVVPQLVEMEPDPGSVVQGVDSTILVSERLNELKRMLSDYDHQLQELLKQRDSLTKRD
Ga0137393_1103118613300011271Vadose Zone SoilMEESTAPVHITIRALPMLVPESVVQRDAEKKRVKRGILGGSEERFVFGKTLYLPYLEFTYQYSSPKGFLSKQSVLAQVMSVVMAHREVNLGFYPELISIMHQLADVEAESGSVVQGVDSTILVGERLDELKGMLSDYDHQLQELSKQRDSLTKRDSAKQEIAENIEHMKKTRETRWKMFADGLK
Ga0137388_1047862313300012189Vadose Zone SoilMEESTAPVHITIRALPMLVPESVVQRDAEKKRVKRGILGGSEERFVFGKTLYLPYLEFTYQYSSQKGFLSKQSVLAQGRSVVMALREVNLGFYPEIISIMHQLADVEAESGSVVQGVDSTILVGERLDELKGMLSDYDQQLQELSKQRDSLTKRDSAKQEIAENIEHMKKTREMRWKMFADGLKLPSKVDLEKF
Ga0137388_1196143813300012189Vadose Zone SoilSEERVLFAKTFYLPYLDFTYQYSTVKGFLSKQTILGQGRSVVMALREVNFGFYPEMATLAPQLADMESESNSVVRGVDSTVLVSERLDELKQMLSGYDKQLEDLSQQYNSLLKTDNAKQDVKENIDSLRKTRESRWKMFAEGLKLPSKIDMDKFEFLEGNLFYVPYFMAK
Ga0137364_1042260613300012198Vadose Zone SoilLAYELSYGGPLQRGEIERLSEGTAPGLVIVKALPMLVPESVVQHDAEKKRVKKGILGGSEERFVFGKTLYLPYLEFNYQYSTAKGLLSKQSILAQGRSVVMALREVNLGFFPELISLLPQLADVESDSGSVVQGVDSTILVRERLDELKIMLSDYDKQLLELSKQHDSLTKTDRAKREIEENIDHLKKTREMRWK
Ga0137383_1132625113300012199Vadose Zone SoilRVKRGILGGSEERFVFGKTLYLPYLEFSYQYSTAKGFLSKQNVLVQGRSVVMALREVNLGFYPELISLLPQLADVEHDSGSVVRGVESTILVSERLDELKKMLSDYDKQLLELSKQHDSLTKIDRAKREIKENIDHLKKTREMRWKMFADGLKLPSKIDLETFEFLEGSV
Ga0137382_1008012913300012200Vadose Zone SoilLNEGAALGQRIVKALPMLVPESVVQHDAEKKRVKKGILGGSEERFVFGKILYLPYLDFTYQYSTEKGFLSKQSVLAQGRSVVMALREVNLGFYPELISLLPKLADVESDSGSVVQGVDSTILISERLDELRGMLSDFDRQLQELSKQYDSLTKTDHAKQEIKENLDHLKKTRDTRWKMFADGLKLPSKIDLETFEFLEGSV
Ga0137382_1065700913300012200Vadose Zone SoilLSEEGAASGLVIVKALPMLVPESVVQQDAEKKRVKKGILGGSEERFVFGKILYLPYLDFTYQYSTEKGFLSKRSIIGQGRSVVIALREVNLGFYPELVSLAPQQAEMESDSGSVIQGVDSTILVSQRLDELKKMLSDYDKELLELSKQHDSLT
Ga0137382_1101167813300012200Vadose Zone SoilLSEGAASGLVIVKALPMLVPESVVQQSAEKKRVKKGILGGSEERFVFGKTLYLPYLDFTYQYSTEKGLLSKQSVLAQGRSTVMALREVNLGFYPELISLLPQLADVESDSGSVVQGVDSTILVSERLDELKTMLSDYDKQLLELSKQHDSLTKTDRAK
Ga0137365_1016978533300012201Vadose Zone SoilLNEGSAPGQIIIKALPLLVPESVVQQDAEKKRVKKGILGGSEERFVFGKTLYLPYLEFSYQYSTAKGFLSKQSVLAQGRSTVMALREVNLGFYPEVISLLPQLANLEPDSGSVVRGVESTILVSERLDELRGMLSDFDHQLQELSKQYDSLTKTDRAKQEIKENIDHLKKTRDMRWKMFADGLKLP
Ga0137365_1085957613300012201Vadose Zone SoilLSEEGAASGLVIVKALPMLVPESVVQQDAEKKRVKKGILGGSEERFVFGKILYLPYLDFTYQYSTEKGFLSKRSIIGQGRSVVIALREVNLGFYPELVSLAPQQAEMESDSGSVIQGVDSTILVSERLDELKKMLSDYDKE
Ga0137399_1152536313300012203Vadose Zone SoilEKKRVKKGILGGSEERFIFGKTLYLPYLEFTYQYSTEKGFLSKQSILAQGRSFLMALREVNLGFYPELISVVPQLADVEPDSGSVVEGVNSTILVSERLDELRRILSDYDYQLQELSKRYDSLAKTDPAKGEIKENIEHLRKTRETRWKMFADGLKLPSKIDLEKFELLEGGLFYIPFFVARFSR
Ga0137362_1042122913300012205Vadose Zone SoilMLVPESLVQQDAEKKRVKRGMLGGSEERFVFGKILYLPYLEFTYQYSTQKGFLSKQSVLAQGRSVVMALREVNLGFYPELISIMHQLTDVEAESGSVVQGVDSTILVGERLDELRGMLSDYDHQLQELSNQGDSLTKRDSAKQ
Ga0137380_1055212813300012206Vadose Zone SoilLSEGAASGLVIVKALPMLVPESVVQQSAEKKRVKKGILGGSEERFVFGKTLYLPYLDFTYRYSTEKGFLSKQSVLTQGRSAVMALREVNLGFYPELISLLPQLADIGPDSSSVVQGVDSTILVSERLDELKKMLSDYDKKLLELSKQHDSLTKTDPAKQEIKENIDHLKKTRDT
Ga0137376_1021147513300012208Vadose Zone SoilVEIKRLNEEAARGQIIVKALPMLVPESVVQQDAEKKRVKKGILGGSEEHFVFGKTLYLPYLDFTYQYSTEKGLLSKQSVLAQGRSAVMALREVNLGFYPELISLLPQLADVESDSGSVVQGVDSTILVSERLDELRTMLSDYDKQLLELSKQHDSLTKTDRAKREIEENIDHLKKT
Ga0137376_1142088813300012208Vadose Zone SoilLSEGAASGLVNVKALPMLVPESVVQQSAEKKRVKKGILGGSEERFVFGKTLYLPYLDFTYQYSTEKGLLSKQSVLAQGRSTVMALREVNLGFYPELISLLPQLADVESDSGSVVQGVDSTILVSERLDELKTMLSDYDKQLLELSKQHDSLTKTDRAKREIEE
Ga0137379_1068758913300012209Vadose Zone SoilLSEGAASGLVIVKALPMLMPESVVQQSAEKKRVKKGILGGSEERFVFGKTLYLPYLEFSYQYSTAKGFLSKQSVLAQGRSAVLALREVNLGFYPELISFLPQLVDVEPDSGSVVRGVDSTILVSERLDELRRMLSDFDHQLQELSKQYDSLTKTDRAKQ
Ga0137379_1183352113300012209Vadose Zone SoilVQHYAEKKRVKRGILGGSEERFVFGKTLYLPYLEFSYQYSTAKGFLSKQSVLAHGRSTVMALREVNLGFYPELISLLPQLADVEPDSGSVVRGVESTILVSERLDELRGMLSDFDRQLQELSKQYDSLTKTDRARQEIKENIDHLKKTRETRWKMFADGLKLPAKI
Ga0137378_1024784513300012210Vadose Zone SoilLNEGAALGQRIVKALPMLVPESVVQHDAEKKRVKKGILGGSEERFVFGKILYLPYLDFTYQYSTEKGFLSKQSVLAQGRSVVMALREVNLGFYPELISLLPKLADVESDSGSVVQGVDSTILISERLDELRGMLSDFDRQLQELSKQYDSLTKTDRAKQEIKENLDHLKKTRDTRWKMFADGLKLPSKIDLETFEFLEGSV
Ga0137378_1068327213300012210Vadose Zone SoilLAYELSYGGPLQRGEIERLSEGTAPGLVIVKALPMLVPESVVQHDAEKKRVKKGILGGSEERFVFGKTLYLPYLEFNYQYSTAKGLLSKQSILAQGRSVVMALREVNLGFFPELISLLPQLADVESDSGSVVQGVDSTILVRERLDELKIMLSDYDKQLLELSKQHDSLTKTDRAKREIE
Ga0137378_1081746013300012210Vadose Zone SoilLSEGAASGLVIVKALPMLVPESVVQQSAEKKRVKKGILGGSEERFVFGKTLYLPYLDFTYRYSTEKGFLSKQSVLTQGRSAVMALREVNLGFYPELISLLPQLADIELDSGSVVQGVDSTILVSERLDELKKMLSDYDKELLELSKQHDSLTKTDPAKQ
Ga0137378_1126734813300012210Vadose Zone SoilLSEEGAASGLVIVKALPMLVPESVVQQDAEKKRVKKGILGGSEERFVFGKILYLPYLDFTYQYSTEKGFLSKRSIIGQGRSVVIALREVNLGFYPELVSLAPQQAEMESDSGSVIQGVDSTILVSERLDELKKMLSDYDKELLELSKQHDSLTK
Ga0137378_1144049513300012210Vadose Zone SoilIRALPLLVPESIVQQDAEKKRVKRGILGGSEERFVFGKTLYLPYLEFSYQYSTAKGFLSKQSVLAQGRSVVMALREVNLGFYPELISLLPKLADVESDSGSVVQGVESTILVSERLDELRRILSDFDRQLQELSKQYDSLTKTDHSKREIKENLDHMKGTRDTRWKMFAEGLKLPSRIDLETFEFLEGSVFYMPYFLAR
Ga0137377_1147644213300012211Vadose Zone SoilAPLQRGEIERLSEGAASGLVIVKALPMLVPESVVQQDAEKKRVKKGILGGSEERFVFGKILYLPYLDFTYQYSTEKGFLSKRSIIGQGRSVVIALREVNLGFYPELVSLAPQQAEMESDSGSVIQGVDSTILVSERLDELKKMLSDYDKELLGLSKQHDSLTKTDRAKREIKENIDHLKKTRDTRWKMFADGLKLPSKIDL
Ga0137371_1087531913300012356Vadose Zone SoilVIKRLNEGAVPGQITVKALPMLVPESNVQQDAEKKRVKKGILGGSEERFIFGKILYLPYIDFTYQYSTEKGLLSKQTAIGRGRSVVMALREVNLGFYPELVSLVPQLSEIESDSGSVIQGVDSTILVSERLEELKTMLSDYDKQLLELSKQHDSLTKIDQAKREI
Ga0137384_1144371513300012357Vadose Zone SoilVQQDAEKKRVKRGILGGSEERFVFGKTLYLPYLEFSYQYSTAKGFLSKQNVLVQGRSVVMALREVNLGFYPELISLLPQLADVEHDSGSVVRGVESTILVSERLDELKKMLSDYDKQLLELSKQHDSLTKIDRAKREIKENIDHLKKTREMRWKMFADGLKLPSKIDLETFEFLEGSV
Ga0137360_1060928213300012361Vadose Zone SoilLNEGSAPGQIIVKALPLLIPESVVQQDAEKKRVKRGILGGSEERFVFGKTLYLPYLEFSYQYSTAKGFLSKQSVLVQGRSAVMALREVNLGFYPELISLLPQLADVEPDSGSVLQGVDSTTLVSQRLDELRGMLSDFDHK
Ga0137390_1116310713300012363Vadose Zone SoilMIVRALPMLVPESLVQQDAEKKRVKKGILGGSEERFVFGKTIYLPYLEFTYQYSTEKGFLSKQNVLAQGRSAVMALREVNLGFYPELVSLLPQLIDIEPDSGSVVHGVDSTILVSERLEELKTMLSDYDSQLQELSKQYDSLTKTDRARQEIKENIDHLK
Ga0137396_1014404113300012918Vadose Zone SoilMDEGDVAEHIIIRALPLLVPESVVQEDAEKKRVKKGILGGSEERFVFGKTLYLPYLEFTYQYSTQKGFLSKQSILAQGRSVVLALREVNLGFYPELISIIPQLADAEPEPGSVLQGVDSTILVSERLDELRRMLSDYDHQLQELSKQRDSLTKND
Ga0137396_1042569313300012918Vadose Zone SoilMLLPESVAQEDAEKKRVKRGMLGGSEERFVFGKILYLPYLEFTYQYSTEKGFFSKQMFLAQGRSAVMALREVNLGFYPELVSLLPQLIDIEADSGSVVQGVDSTILVSERLEGLRTMLSDYDGQLQDLSKQYDSLTKTDRARQEIKENIDHLKRTR
Ga0137396_1067210023300012918Vadose Zone SoilMIVRALPMLVPESLVQQDAEKKRVKKGILGGSEERFVFGKTLYLPYLEFTYQYSTEKGFLSKQSVLAQGRSVVMALREVNLGFYPELISIMPQLADAEPESGSVLQGVDSTILVSERLDELRRMLSDYDHQLQELSKQRDSLTKNDSAKQEIKENIDHI
Ga0137416_1213966513300012927Vadose Zone SoilSGRMIVRALPMLVPESLVQQDAEKKRVKRGMLGGSEERFVFGKILYLPYLEFTYQYSIEKGFLSKQSVLAQGKSTVMALREVNLGFYSELIALLPLVVDIEADSASVVQGVDSTILVSERLKELKTMLSDYDSQLQELSKQYDSLTKTDRARHEIKENLDHLKRTRQTRWK
Ga0134077_1027453813300012972Grasslands SoilVEIKRLNEEAARGQIIVKALPMLVPESVVQQDAEKKRVKKGILGGSEEHFVFGKTLYLPYLDFTYRYSTEKGLLSKQSVLAQGRSAVMALREVNLGFYPELISLLPQLADVESDSGSVVQGVDSTILVSERLDELRTMLSDYDKQLLEL
Ga0134110_1042283013300012975Grasslands SoilVEIKRLNEEAARGQIIVKALPMLVPESVVQQDAEKKRVKKGILGGSEEHFVFGKTLYLPYLDFTYQYSTEKGLLSKQSVLAQGRSAVMALREVNLGFYPELISLLPQLADVESDSGSVVQGVDSTILVSERLDELRTMLSDYDKQLLELSKQHDSLTKTDRAKREIEENIDHLKKTREMRWKM
Ga0134076_1021484713300012976Grasslands SoilVEIKRLNEEAARGQIIVKALPMLVPESVVQQDAEKKRVKKGILGGSEEHFVFGKTLYLPYLDFTYQYSTEKGLLSKQSVLAQGRSAVMALREVNLGFYPELISLLPQLADVESDSGSVVQGVDPTILVSERLDELRGMLSDFDHELQELSKQFDS
Ga0134081_1003337213300014150Grasslands SoilMSYHKGPLQQGDVERLSEGDTSGQVTVKALPMLVPESVVQQDAEKKRVKRGILGGSEERFVFGKILYLPYLDFTYQYSTEKGFLSKQSVLAQGKSTVMALREVNLGFYPELISLLPQLADVEPDSGSVVRGVESTILVSERLDELRRMLSDFDHQLQELSKQY
Ga0134078_1029529213300014157Grasslands SoilMSYHKGPLQQGDVERLSEGDTSGQVTVKALPMLVPESVVQQDAEKKRVKRGILGGSEERFVFGKILYLPYLDFTYQYSTEKGFLSKQSVLAQGKSTVMALREVNLGFYPELISLLPQLADVEPDSDSVMRGVDSTILVSERLDELRGMLSDFDHQLQ
Ga0137420_114630913300015054Vadose Zone SoilLLEAREGQEMDEGDVAEHIIIRALPLLVPESVVQEDAEKKRVKKGILGGSEERFVFGKTLYLPYLEFTYQYSTQKGFLSKQSILAQGRSVVLALREVNLGFYPELISIIPQLADAEPEPGSVLQGVDSTILVSERLDELRRMLSDYDHQLQELSKQRDSLTKNDSAKQESKENIDHIRKTREMRWKMFADGLTETAL*
Ga0134072_1014447013300015357Grasslands SoilLSEGAASGLAIVKALPMLMPESVVQQSAEKKRVKKGILGGSEERFVFGKTLYLPYLEFSYQYSTAKGFLSKQSVLAQGRSAVMALREVNLGFYPELISLLPQLADVESDSGSVVQGVDSTILVSERLDELRRMLSAFDHQLQELAKQYDSLTKTDRAKQEIKENLD
Ga0134074_100617853300017657Grasslands SoilLNEEAARGQIIVKALPMLVPESVVQQDAEKKRVKKGILGGSEEHFVFGKTLYLPYLDFTYRYSTEKGLLSKQSVLAQGRSAVMALREVNLGFYPELISLLPQLADVESDSGSVVQGVDSTILVSERLDELKTMLSDYDKQLLELSKQHDSLTKTDRAKREIEENIDHLKKTREMRWKMF
Ga0134083_1045905613300017659Grasslands SoilLTEDSLPGQIIIRALQMLVPESLVQQDAEKKRVKRGMLGGSEERFVFGKILYLPYLEFTYQYTTEKGFLSKQSLLAQGRSVVMALRQVNLGFYPELISIVPQLADMEPESGSVVQGVDSTILVSERLDELRRMLSDYDHQLRELSKQRDSLTKR
Ga0187803_1033285413300017934Freshwater SedimentKTSDTKMVDQGQPNPDFQMAYQLSRGIVGACRSIALNEGAASDQIIVKALPMMVPETVVQHDAEKKRVKKGILGGSEERYVFGKTFYLPYLDFTYQYLSEKGFLSKQSVLGQGRSVAMSLREVNLGFYPELISLMPQLTDIEPESGSLVQGVDSTILVSERLEELKKMLSDYDNKLRELLNQHDILTKTDPARQEIRENIEH
Ga0066655_1077650513300018431Grasslands SoilMLMPESVVQQSAEKKRVKKGILGGSEERFAFGKTLYLPYLEFSYQYSTAKGFLSKQSVLAQGRSAVMALREVNLGFYPELISFLPQLVDVEPDSGSVVRGVDSTILVSERLDELRRMLSDFDHRLQELSKQYDSL
Ga0066667_1012994113300018433Grasslands SoilVEIKRLNEEAARGQIIVKALPMLVPESVVQQDAEKKRVKKGILGGSEEHFVFGKTLYLPYLDFTYQYSTEKGLLSKQSVLAQGRSAVMALREVNLGFYPELISLLPQLADVESDSGSVVQGVDSTILVSERLDELRTMLSDYDKQLLELSKQHDSLTKTDRAKREIEENIDHLKKTREMRWK
Ga0066667_1120370913300018433Grasslands SoilMLVPESLVQQSAEKKRVKKGILGGSEERFVFGKTLYLPYLEFSYQYSTAKGFLSKQSVLAQGRSAVMALREVNLGFYPELISLLPQLADVEPDSDSVVRGVDSTILVSERLDELRGMLSDFDHQLQELSKQYDSLTKTDRARQEIKENIDHLKKTRDMRWKMFGDRLNLPSKVDLETFEFLEGRLFY
Ga0066662_1003306813300018468Grasslands SoilMLVPESVVQQDAEKKRVKRGILGGSEERFVFGKTLYLPYLEFTYQYTTEKGFLSKQSILAQGRSVVMALREVNLGFYPELVSLAPQLAEMESDHGSVIQGVDSTILVGERLDELKKMLSDYDKQSLELSKQHDSLTKTDRAKREIKENIDHLKKTREMRWKMFTDGLKLPSKIDPEKFELLEGNLFHMP
Ga0066662_1020956013300018468Grasslands SoilLNEGSAPGQIIVKALPLLIPESVVQQDAEKKRVKRGILGGSEERFVFGKTLYLPYLEFSYQYSTAKGFLSKQSVLVQGRSAVMALREVNLGFYPELISLLPQLADVEPDSGSVLQGVDSTILVSQRLDELRGMLSDFDHKLRELSKLYDSLTKADRAKQEIKENIDHLKKTRDMRWKMFADGLKLPSKTDLEKFE
Ga0215015_1017589313300021046SoilMSTTTPDIIQTALFTTFRRTMALISMRINRIQNIKSNQEQTFEPDGQGKGWDLTEDFAPGQIIVKALPMLVPESIVQQDAEKKRVKRGMLGGSEERFVFGKILYLPYLEFTYQYSADKGFLSKKSVLAQGRSVVMALREVNLGFYPELISLLSQLVDIETDSSSVVQGVDSTILVGERLDELRRILSDYDHQLQELSKQYDSLTKTDYAKEGIKENIDHMKKTRETRWKMFTHGLKLPSNMDLEKLELVEGN
Ga0215015_1044251813300021046SoilMLVPESILQEDAEKKRVKRGMLGGSEERFVFGKILFLPYLDFAYQYSAEKGFFSKQSVLSQGRSVVMALREVNLGFYPELTSLLPQLVDIQANSGSVLQGVDSTIIVSERLEDLRTILSDYDSRLQDLSNQYDSLTKTDPTKEEIKENIDHLKNNREARWKMFADGLKLPSKIDLEKFELLEGNLFYMPYFV
Ga0215015_1068922523300021046SoilMDEGTAPEHIIIRALPMLVPVSVVQQDAEKRRVKKGILGGSEERFVFGKTLCLPYLEFTYQYSTEKGFLSKRSILAQGRSVVMALREVNLGFYPELISIMPQLADVESESGSVVPGVDSTILVSERLDELKRMLSDYDHQLHELSKQRDSLTKKASAKQEINENIDHMKKTRETVSYTHLTLPTICSV
Ga0210404_1009466113300021088SoilMLQPESFIQQDAEKKRVKRGMLGGSEERFVFGKILYLPYLEFTYQYSSEKGFFSKQSILAQGRSAVMALREVNLGFYPELISLFPQLVDIEADSGSVVQGVDSTILVNERLEELRAILSDYDGRLQELSRQYDSLTKTDRAKQDIKENIDHLKKTRETRWKIFADGLKLPSKIDLEKFELLEGNLF
Ga0209237_107589413300026297Grasslands SoilLNQGVAPERVIVKALPMLVPESAVQHYAEKKRVKRGILGGSEERFVFGKTLYLPYLEFSYQYSTAKGFLSKQSVLAHGRSTVMALREVNLGFYPELISLLPQLADVEPDSGSVVRGVESTILVSERLDELRGMLSDFDRQLQELSKQYDSLTKTDRARQEIKENIDHLKKTRETRWKMFADGLKLPAKIDL
Ga0209237_108271313300026297Grasslands SoilVEIKRLNEEAAQGQIIVKALPMLVPESVVQQDAEKKRVKKGILGGSEEHFVFGKTLYLPYLDFTYQYSTEKGLLSKQSVLGQGRSTVMALREVNLGFYPELISLLPQLADVEPDSDSVVRGVDSTILVSERLDELRGMLSDFDHQLQELSKQYDSLTKTDRARQEIKENIDHLKKT
Ga0209761_128919613300026313Grasslands SoilYQLSNGDTRIQEVRTLNQGVAPERVIVKALPMLVPESAVQHYAEKKRVKRGILGGSEERFVFGKTLYLPYLEFSYQYSTAKGFLSKQSVLAHGRSTVMALREVNLGFYPELISLLPQLADVEPDSGSVVRGVESTILVSERLDELRGMLSDFDRQLQELSKQYDSLTKTDRARQEIKENIDHLKKTRETRWK
Ga0209154_123321413300026317SoilLSEGAASGLVIVKALPMLVPESVVQQSAEKKRVKKGILGGSEERFVFGKTLYLPYLDFTYQYSTEKGFLSKQSVLAQGRSAVMALREVNLGFYPELISLLPHLADVEPDSGSVVRGVDSTILVSERLDELRRMLSDFDHQLQELSKQYDSLTKTNRAKQEIKENIDHLKKTRDTRWKMFSDGLKLPSKVDLETFEFLE
Ga0209471_107631123300026318SoilLKAEGQKMDEGTAPEHIIVRALPMLVPESVVQQDAEKKRVKRGILGGSEERFVFGKTLYLPYLEFTYQYTTEKGFLSKQSILAQGRSVVMALREVNLGFYPELVSLAPQLAEMESDHGSVIQGVDSTILVGERLDELKKMLSDYDKQSLELSKQHDSLTKTDRAKREIKENIDHLKKTREMRWKMFTDGLKLPSKID
Ga0209266_128637313300026327SoilMLVPESVVQQSAEKKRVKKGILGGSEERFVFGKTLYLPYLDFTYRYSTEKGFLSKQSVLTQGRSAVMALREVNLGFYPELISLLPQLADIGPDSSSVVQGVDSTILVSERLDELKKMLSDYDKELLELSKQHDSLT
Ga0209802_124205913300026328SoilLNEGSAPGQIIVKALPLLIPESVVQQDAEKKRVKRGILGGSEERFVFGKTLYLPYLEFSYQYSTAKGFLSKQSVLVQGRSAVMALREVNLGFYPELISLLPQLADVEPDSGSVLQGVDSTILVSQRLDELRGMLSDFDHKLRELSKLYDSLTKADRAKQEIKENIDHLKKTRDMRWKMFADGLKL
Ga0209377_115464623300026334SoilLKAEGQKMDEGTAPEHIIVRALPMLVPESVVQQDAEKKRVKRGILGGSEERFVFGKTLYLPYLEFTYQYTTEKGFLSKQSILAQGRSVVMALREVNLGFYPELVSLAPQLAEMESDHGSVIQGVDSTILVGERLDELKKMLSDYDKQSLELSKQHDSLTKTDRAKREIKENID
Ga0209377_130309413300026334SoilMSYHPDPYSGEGQKVNEGAAPEQIVVKALPLLVPESVVQQDAEKKRVKKGILGGSEERFVFGKILYLPYLDFTYQYSTEKGFLSKRSIIGQGRSVVIALREVNLGFYPELVSLAPQQAEMESDSGSVIQGVDSTILVSERLDELKKMLSDYDKELLELSKQHDSLTKTDRARREIK
Ga0257177_100245523300026480SoilMIVRALPMLVPESLVQQDAEKKRVKKGILGGSEERFVFGKTIYLPYLEFTYQYSTEKGFLSKQNVLAQGRSAVMALREVNLGFYPELVSLLPQLIDIEPDSGSLVHGVDSTILVSERLEELKTMLSDYDSQLQELSKQYDSLTKTDRARQEIKENIDHLKQTREARW
Ga0209160_117305513300026532SoilMLVPESIVQQDAEKKRVKKGILGGSEERFIFGKTLYLPYLEFSYQYSTAKGFLSKQSVLAQGRSAVMALREVNLGFYPELISLLPQLADVESDSGSVMQGVDSTILVSERLDELRTMLSDYDKQLLELSKQHDSLTKTDRAKREIEENIDHLKKTREMRWKMFAD
Ga0209157_134920813300026537SoilQDAEKKRVKKGILGGSEEHFVFGKTLYLPYLDFTYRYSTEKGFLSKQSVLTQGRSAVMALREVNLGFYPELISLLPQLADIEPDSGSVVQGVDSTILVSERLDELKKMLSDYDKELLELSKQHDSLTKTDPAKQEIKENIDHLKKTRDTRWKMFADGLKLPSKVDLEKFEFLEGNLF
Ga0209648_1033547013300026551Grasslands SoilMIVRALPMLLPESLVQQDAEKKRVKKGILGGSEERFVFGKTIYLPYLEFTYQYSTEKGFLSKQNVLAQGRSAVMALREVNLGFYPELVSLLPQLIDIEPDSGSVVRGVDSTILVSERLEELKTMLSDYDGQLQELSKQYDSLTKTDRARQEIKENIDH
Ga0209388_108241313300027655Vadose Zone SoilMIVRALPMLVPESLVQQDAEKKRVKKGILGGSEERFVFGKTIYLPYLEFTYQYSTEKGFLSKQNVLVQGRSAVMALREVNLGFYPELVSLVPQLIDIEPDAGSVVRGVDSTILVTERLEELKTMLSDYDSQLQELSKQYESL
Ga0209180_1011057313300027846Vadose Zone SoilMSYQRDHYSEEIYRLNEGAAPRQVTIKALPMLVPESVVQQDAEKKRLKRGILGGSEERFVFGKILYLPYLDFTYQYSTERGFLSKQRIIGQGRSVVMALREVNLGFYPELTSLLPQLADVEPDSGSVVLGVDSTILVSERLDELRRMLSDFDHQLH
Ga0209701_1068551813300027862Vadose Zone SoilMLVPESVVQQDAEKKRLKKGILGGSEERFVFGKILYLPYLDFAYQYSSEKGFLSKQSVLSQGRSILMALREVNLGFYPELFSVVPQLVEMEPDPGSVVQGVDSTILVSERLNELKRMLSDYDHQLQELLKQRDSL
Ga0209590_1005378513300027882Vadose Zone SoilMEESTAPVHITIRALPMLVPESVVQRDAEKKRVKRGILGGSEERFVFGKTLYLPYLEFTYQYSSQKGFLSKQSVLAQGRSVVMALREVNLGFYPELISIMHQLADVEAESGSVVQGVDSTILVGERLDELKGMLSDYDHQLQELSKQRDSLTKRDSAKQEIAENIEHMKKTREMR
Ga0137415_1130357613300028536Vadose Zone SoilMLVPESLVRQDAEKKRVKRGMLGGSEERFVFGKILYLPYLEFTYQYSIEKGFLSKQSVLAQGKSTVMALREVNLGFYSELIALLPLVVDIEADSASVVQGVDSTILVSERLKELKTMLSDYDSQLQELSKQYDSLTKTDRARHEIKENLDHLKRTRQTRWKMFADGLKLPSKVNLEKFEL
Ga0307471_10030259613300032180Hardwood Forest SoilLIQVSALPMLVPEAVIQHEAEKKRIKKGILGGVEERFVFGKTVYLPYLDFTYQYTAEKGFLSKQSVLSQGRSVFMALREVNLGFYPEMISLLPQLAEIEPDPGSVVQGVDSTVLVSERLNELKTMLSDYDRQLGELSRQYDLLTKTDRARQEVKENIDHLKKTRETRWKMFADGLKLPSKMDLAKFELLEGNLFYIPF
Ga0307471_10039208613300032180Hardwood Forest SoilMLLPESVVQEDAEKKRVKRGMLGGSEERFVFGKILYLPYLEFTYQYSTEKGFLSKQNVLAQGRSAVMALREVNLGFYPELVSLLPQLAEVDANSGSVVQGVDSTILVSERLEELKTMLSDYDGQLRELSKQYDSLTKTDRARQEIKENIDHLKRTRETRWKMFADGLKLPSRIDLGKFEFLEGRLF
Ga0307471_10092469213300032180Hardwood Forest SoilMAEGAAPEHVIIRALPMLVPESVVQHDAEKKRVKKGILGGSEERFVFGKTLYLPHLEFTYQYSTEKGFLSKQSILAQGRSVVIALREVNLGFYPEMISLLPQLAEVEPDPDSVVQGVDSTVLVSERLNELKTILSDYDRQLGELSKQHDSFTKTDRAKQEVKENIDHLKKTRETRWKIFADGLKLPSKIDLKKF
Ga0307471_10331831213300032180Hardwood Forest SoilVKKGILGGSEERFVFGKTLYLPYLEFTYQYCNEKGFLSKQSVLAQGRSVVMALREINLGFYPELISIMPQLADVEPESGSVVQGVDSTILVSERLGELNQMLSDYDDQLQELSKQHDSFMKTDGARREIKENI


 ⦗Top⦘


© Pavlopoulos Lab, Bioinformatics & Integrative Biology | B.S.R.C. "Alexander Fleming" | Privacy Notice
Make sure JavaScript is enabled in your browser settings to achieve functionality.