# readme for files at http://interactions.gersteinlab.org/membranemetagenomics/DATA/ # these two files contain the results of the cca analysis. Using the 0.3 cutoff, we have seperate the 151 cogs into the variable fractions(structural correlations in dimensions 1 and 2 that are outside the 0.3 circle) and the invariable fractions( inside the 0.3 circle) cca_invariable_cog.txt cca_variable_cogs.txt # these two files contain results of the PEN analysis. Using a dot product of greater than 0.5, we list the positive relationships between the environmental features and membrane protein families. Using a dot product of less than -0.5 we list the negative relationships between the environmental features and membrane protein families cca_negative_relationships.txt cca_positive_relationships.txt # this file contains the raw values at each site for dust as described in the text dust_values.txt # this file contains the raw values at each site for all 15 of the environmental features used environmental_features.txt # this file contains the percentage values used for the 16S diversity at each site as described in the text sites_16S_diversity.txt # this file contains the normalized (by total protein content at a site) values for each of the 151 membrane protein families (COG) at each of the 19 sites membrane_protein_families.txt # the list of the 151 membrane proteins used and the COG description membrane_protein_family_info.txt