Overall Pfam domain distribution:
Cumulative frequency of Pfam domains across the genetic neighborhoods. Frequency is expressed as number of Pfam domains per ECF sigma factor. Only domains present in more than 75% of the neighborhoods are shown. Genetic neighborhoods contain the proteins encoded in ±10 from the ECF coding sequence. Only the non-overlapping, highest scoring domains are considered positive. If a protein contains several copies of a domain, only one instance is further considered. In order to avoid sequence bias, only proteins from assemblies defined as "representative" or "reference" by NCBI are included (see
https://www.ncbi.nlm.nih.gov/assembly/help/).