ECF105 ECF proteins

General description: Members of ECF105 are homologous to proteins from the original ECF105 (12.33%) and are present in Firmicutes from order Bacillales (100%).

Anti-σ factors: Likewise the original group ECF105 (Staroń et al., 2009), proteins from ECF105 are regulated by TM-bound ZAS factors encoded in position encoded in +1. These AS factor contain a single transmembrane helix (85.11%).

Genomic context conservation: The multiple transmembrane proteins encoded in positions +2 and +3 in original ECF105 (Staroń et al., 2009) were also found in new ECF105 (usually 9 (34.04%) and 3 (57.46%) transmembrane helices, respectively). Other proteins conserved in the genetic context of members of ECF105 are some enzymes in ECF105s1 involved in translation and amino acid metabolism: acetokinase, pyrroline-5-carboxylate reductase, tRNA synthetase class II and 3-hydroxyacyl-CoA dehydrogenase.

Promoter motif conservation: Predicted target promoter motifs are conserved. Two -35 elements contain TG(A/T)AGGG and the -10 element has CGTCTAT. The predicted target promoter motif expands the knowledge over members of original ECF105 (Staroń et al., 2009), which lacks a conserved target promoter motif.

Summary: ECF105 expands the number of proteins (from 4 in original ECF105 (Staroń et al., 2009) to 312 in new ECF105), predicts a target promoter motif but keeps the same genetic neighborhood as original group ECF105.


Basic information

Number of representative ECFs: 382

Number of non-redundant ECFs: 527

Sequences with C-terminal extension: 0.00%

Sequences with N-terminal extension: 3.42%

Overrepresented class: Bacilli [67.80%]

Sample Neighborhood

Protein SDL63637.1 of Assembly GCA_900102965.1 (Paenibacillus jilunlii)

Promoter Motif


Protein sequence length distribution

Gene neighbourhood conservation analysis

Overall Pfam domain distribution: Cumulative frequency of Pfam domains across the genetic neighborhoods. Frequency is expressed as number of Pfam domains per ECF sigma factor. Only domains present in more than 75% of the neighborhoods are shown. Genetic neighborhoods contain the proteins encoded in ±10 from the ECF coding sequence. Only the non-overlapping, highest scoring domains are considered positive. If a protein contains several copies of a domain, only one instance is further considered.
Pfam domain distribution per position: Frequency of Pfam domain architectures in the proteins encoded in ±10 (x-axis) from the ECF coding sequences. Frequency is expressed as number of times a certain domain architecture appears per ECF sigma factor. Only the highest scoring domains with no position overlap are considered in the domain architectures. Note that the order of the Pfam domains in domain architectures may differ from their name. When a protein contains several copies of a domain, only one instance is further considered. Only domain architectures present in more than 20% of the proteins encoded in any position are shown.

