SCREENING OF NUCLEOTIDE VARIATIONS IN GENOMIC SEQUENCES ENCODING CHARGED
PROTEIN REGIONS IN THE HUMAN GENOME
Najla Kharrat1*, Sabrine Belmabrouk1, Rania Abdelhedi1, Amine Ben Ayed1, Riadh Benmarzoug1 and Ahmed Rebai1
1 : Centre de Biotechnologie de Sfax, laboratoire des procédés de criblage moléculaire er cellulaires, POBOX « 1177 » 3018, Sfax, Tunisie.
* : Correspondance : firstname.lastname@example.org
Background: Studying genetic variation distribution in proteins containing charged regions, called charge clusters (CCs), is of great interest to unravel their functional role. Charge clusters are 20 to 75 residue segments with high net positive charge, high net negative charge, or high total charge relative to the overall charge composition of the protein. We previously developed a bioinformatics tool (FCCP) to detect charge clusters in proteomes and scanned the human proteome for the occurrence of CCs. In this paper we investigate the genetic variations in the human proteins harbouring CCs.
Results: We studied the coding regions of 317 positively charged clusters and 1020 negatively charged ones previously detected in human proteins. Results revealed that coding parts of CCs are richer in sequence variants than their corresponding genes, full mRNAs, and exonic + intronic sequences and that these variants are predominately rare (Minor allele frequency < 0.005). Furthermore, variants occurring in the coding parts of positively charged regions of proteins are more often pathogenic than those occurring in negatively charged ones. Classification of variants according to their types showed that substitution is the major type followed by Indels (Insertions-deletions). Concerning substitutions, it was found that within clusters of both charges, the charged amino acids were the greatest loser groups whereas polar residues were the greatest gainers.
Conclusions: Our findings highlight the prominent features of the human charged regions from the DNA up to the protein sequence which might provide potential clues to improve the current understanding of those charged regions and their implication in the emergence of diseases.