In silico Analysis of Human Telomerase Reverse Transcriptase ( hTERT ) Gene: Identification of a Distant Homolog of Melanoma Antigen Family Gene (MAGE)
Melanoma antigen family (MAGE) genes are widely expressed in various tumor types but silent in normal cells except germ-line cells lacking human leukocyte antigen (HLA) expression. Over 25 MAGE genes have been identified in different tissues, mostly located in Xq28 of human chromosome and some of them in chromosome 3 and 15, containing either single or multiple-exons. This in silico study predicted the genes on hTERT location and identified a distant relative of MAGE gene located on chromosome 5. The study identified a single exon coding ∼850 residues polypeptide sharing ∼30% homology with Macfa-MAGE E1 and hMAGE-E1. dbEST search of the predicted transcript matches 5′ and 3′ flanking ESTs. The predicted protein showed sequence homology within the MAGE homology domain 2 (MHD2). UCSC genome annotation of CpG Island around the coding region reveals that this gene could be silent by methylation. Affymetrix all-exon track indicates the gene could be expressed in different tissues particularly in cancer cells as they widely undergo a genome wide demethylation process.
Human telomerase, a cellular reverse transcriptase, is a ribonucleoprotein enzyme that catalyzes the synthesis and extension of telomeric DNA.1 Telomerase activity appears to be associated with cell immortalization and malignant progression.2 Usually human telomerase is found in hemopoietic and in germ cells but not in normal somatic cells. Active telomerase is one of the key factors that enable malignant cells to proliferate indefinitely.3 However, molecular mechanisms triggering various telomerase activities still remain elusive.
In this post-genomic era, in silico techniques for gene finding process or identifying the location of the protein-coding regions (ORF), within uncharacterized genomic DNA sequences, constitute a central issue in the field of bioinformatics4 and are of much interest to biologists. A number of computational techniques for the prediction of distinctive features of protein-coding regions have been proposed along with the standard molecular methods. In general, the two main approaches of structural gene prediction are intrinsic (based on statistical properties of exons, splice sites, and other signals) and extrinsic (based on homology with known genes).5
In this study, the human chromosome 5p13.1–p15.33 region containing the telomerase (hTERT) gene was investigated by using both intrinsic and extrinsic method of gene prediction. In addition to hTERT ORF, two additional ORFs named gene2 and gene3 were identified. Interestingly, predicted gene2 revealed significant sequence homology with human tumor specific antigen, melanoma antigen family gene (MAGE), E1.
Materials and Methods
Complete sequence of human telomerase reverse transcriptase (hTERT) was retrieved from NCBI (gi:82399156, Accession no. DQ264729.1).
In Silico Identification of ORFs
The coding sequences of hTERT were identified using NCBI's ORF Finder (http://www.ncbi.nlm.nih.gov/gorf). The sequence (DQ264729.1) was further analyzed using various ab initio gene finding programs (GENSCAN, FGENESH and AUGUSTUS) and by comparative gene prediction (TWINSCAN) methods. The genomic location of the hTERT was studied using the UCSC Genome browser (http://genome.ucsc.edu/cgibin/hgGateway). This browser was also used to identify the CpG Island track and EST's around the predicted ORF. The 5′-UTR of the predicted transcription start site (TSS), the start codon and the 3′-UTR, 1000 bp downstream from the stop codon were searched using BLASTn, against the EST database (dbEST).
Homology Study of the Predicted ORFs
The homologous sequences of the predicted genes were identified from successive iterations using PSI-BLAST. Multiple sequences were aligned using ClustalW (1.83) (http://www.ebi.ac.uk/tools/clustalw). Secondary structure of the predicted gene2 was analyzed using the program Hierarchical Neural Network (HNN: http://www.expasy.org/tools/). The repeated pattern motifs were analyzed using the program Rapid Automatic Detection and Alignment of Repeats RADAR (http://ebi.ac.uk/radar/).
Comparative Genomics Analysis of the Predicted Gene2
Global alignment of the coding sequence of gene2 with Chimpanzee (Pan troglodytes) and Orangutan (Pongo pygmaeus-abelii) genomic sequence was performed with the program AVID using a window size of 100 bp and a conservation level of 70%. Results were viewed with the program VISTA.6 Finally, the nucleotide sequence of the Chimpanzee TERT (Pan troglodytes chromosome 5 genomic contig, reference assembly, Accession no. NW_001235370, region: 211936–253254) was further analysed by using GENSCAN to confirm the presence of conserved gene2 in Chimpanzee genome.
Prediction of the Function of the Gene2
The function of gene2 was predicted using two protein function prediction program PFP7 and SVMProt.8 For comparative analysis, function of MAGE-E1 was also predicted using these two programs.
Result and Discussion
Table 1 showed the gene prediction analysis of hTERT by different programs. GENSCAN9 had predicted 4 genes within the same hTERT location. Interestingly, apart from the known hTERT splice variant (gene1 and gene4 on the +strand), this program also predicted two additional genes in the reverse strand, namely gene2 (9155–5064 bp) and gene3 (16386–12458 bp) both of which consists of 3 exons. AUGUSTUS10 also identified two genes in the reverse strand at a slightly different location- gene2 (7869–5617 bp) consists of a single exon whereas gene3 (14225–12469 bp) consists of two exons. The predicted genes in the reverse strand were confirmed by the FGENESH and ORF Finder. Homology-based program TWINSCAN11 also predicted two reverse strand genes in the same way as FGENESH and ORF Finder (Table 1).
Summary of the gene prediction analysis result of gene2 and gene3.
Gene prediction Program
Predicted Genes
Strand
Features
Start
End
GENSCAN
gene2
Complement
Promoter
9515
9476
Initial exon
9155
9102
Internal exon
7861
5671
Terminal exon
5518
5064
Poly A site
4718
4713
gene3
Complement
promoter
16734
16116
Initial exon
16386
16116
Internal exon
14465
13849
Terminal exon
13831
12458
Poly A site
9749
9744
AUGUSTUS
gene2
Complement
CDS
7869
5617
gene3
Complement
CDS-1
14225
13860
CDS-2
13689
12469
FGENESH
gene2
Complement
TSS
9618
9618
Exon-1
7858
5606
Poly A site
4723
4723
gene3
Complement
TSS
16699
16699
Exon-1
16195
16116
Exon-2
13907
12458
Poly A site
11428
11428
TWINSCAN
gene2
Complement
Exon-1
7858
5606
gene3
Complement
Initial exon
16386
16116
Internal exon
14465
13849
Terminal exon
13831
12458
ORF Finder
gene2
Complement
ORF in Frame-2
7858
5606
gene3
Complement
ORF in Frame-3
13122
12106
In UCSC genome browser, two different gene prediction tracks NSCAN and GeneID also predicted the location of the gene2 and gene3 (Fig. 1). Tissue-specific expression pattern of the predicted genes was hypothesized by observing Affymetrix all-exon track. The genome browser also predicted the CpG island around the predicted genes. A good number of ESTs were identified at the 5′ and 3′ flanking regions for both the predicted gene2 and gene3. The predicted ESTs showed different expression profiles in different tissue types. Interestingly for the predicted gene2 it has been found that in some tissue types the gene is expressed in developmental stages and in others expressed in different cancer cell lines (Fig. 1). Similar patterns were also observed for the predicted gene3 and revealed that it shares good 5′ flanking region (∼99% identity) with the clone collection from IMAGE cDNA, mRNA database sequences (Supplementary data).
UCSC Genome Browser analysis showing the location of the predicted genes. Affymetrix All Exon Chip-Array tracks are used for tissue specific gene expression. CpG Island is also observed around the predicted gene.
VISTA plot of the AVID alignment (Fig. 2) indicated that majority of the region in gene2 is highly conserved (>92% identity) in Chimpanzee and Orangutan genome. No larger repeats (LINE, SINE, and LTR) were observed within that region. Further GENSCAN analysis of the corresponding TERT region on Chimpanzee chromosome 5 indicated that this region also contain a putative gene encoding 748 amino acids protein which is very much similar (82% identity and 84% similarity) to the predicted gene2 of hTERT region (Supplementary data). Identification of the blocks of genes with evolutionary conserved order in multiple genomes is an important issue in comparative genomics. These synteny blocks help in tracing back the evolution of genomes in terms of rearrangement event. Presence of conserved blocks of genes in multiple genomes may indicate functional relatedness of their products or presence of functionally important conserved non-coding regions.12 Through comparative genomics analysis of our predicted gene2, we have found that this gene is conserved in different genomes in terms of gene order.
Conservation of gene2 in Chimpanzee (Pan troglodytes) and orangutan (Pongo pygmaeus-abelii) on VISTA browser.
BLAST results indicated that gene2 shared some homology (∼38%—41%) with the MAGE E1, a member of type II melanoma antigen family (Table 2). MAGE family is a large family which comprises over 25 members identified in human. Most members of the MAGE family are clustered at the Xq28 region of human chromosome.13 The overall structure of MAGE-E1, and Macfa MAGE-E1 are larger proteins with extended N or C-termini. The N-terminal domain of MAGE-E1, contains a loosely conserved region of ∼220 amino acids, termed MHD2 domain.14 Global alignment in ClustalW showed loosely conserved regions for both amino-terminal and carboxy-terminal domain of gene2 with human MAGE family E1 (accession no.AAH50588.1) and MAGE1_Macfa, Melanoma associated antigenE1 (accession no.Q9BE18) (Fig. 3).
Summary of the homologous sequences of predicted gene2.
GI no.
Accession No.
Name of the protein
Length (aa)
Identity (%)
Similarity (%)
E-value
67604778
XP_666642.1
Cell surface protein that may regulate cell wall beta-glucan synthesis and bud site selection [Cryptosporidium hominis TU502]
RTX toxins and related Ca2+-binding protein [Bacillus sp. NRRL B-14911]
1415
23.33
38.0
1e–04
29792032
AAH50588.1
Melanoma antigen family E, 1 [Homo sapiens]
957
28.0
41.67
6e–04
118716717
ZP_01569254.1
Hemagluttinin domain protein [Burkholderia multivorans ATCC 17616]
1487
25.4
50.0
0.004
Global alignment of gene2 with MAGE homology domain (MHD).
Identification of the novel members of a large protein family is very difficult as the similarity searching programs are designed to highlight the most similar sequences. As a result, about 5% of the novel protein family members may remain unrecognized.15 In an example by Retief et al,15 the large family of known glutathione transferase proteins was first subjected to multiple sequence alignment, and a phylogenetic tree was made by distance methods to identify classes of proteins within the family. These proteins represented a broad range of phylogenetic context and included classes with sometimes less than 20% identity.16 Thus, in spite of having ∼40% sequence homology with MAGE-E1, predicted gene2 may be a novel member of melanoma antigen family (MAGE).
Expression pattern of different members of the MAGE family genes are different and tissue specific, some encodes tumor specific antigens and some are expressed in normal cells.17,18 Besides, co-expression pattern of some tumour associated antigen (TAA) including 5 MAGE-A genes with human telomerase reverse transcriptase was observed in non-small cell lung carcinoma (NSCLC) such as adenocarcinoma, squamous cell carcinoma and bronchiolocarinomas.19 5′ flanking EST analysis of predicted gene2 revealed that most of the ESTs are expressed in several cancer cells. As human telomerase activity is observed only in germline and cancer cells2 and its co-expression patterns was observed with some MAGE gene, our predicted gene2 may be co-expressed with telomerase in cancer cells.
Transcriptional regulation of MAGE gene family is dependent on various factors. Promoter methylation is one of these factors. Methylated CpG point in the promoter region is responsible for transcriptional silencing of the MAGE gene.20 Demethylation of the MAGE1 promoter appears to be sufficient to activate this gene in tumor cell lines.21 From UCSC genome browser, we have observed the presence of CpG island around our predicted gene2. That's why, it can be inferred that the hypermethylation of this CpG island is responsible for silencing of gene2 in normal cells and this gene may be activated through transformation dependent loss of DNA methylation in cancer cells.22–24
Repeated insertion appears to have played a major role during evolution of MAGE family. For instance, the long C-terminal domain of MAGE-D3 was most probably formed by serial duplications of decapeptide repeats. Also the N-terminal domains of MAGE-C1 and MAGE-D1, are highly repetitive, must have undergone sequential duplication events.13,25 29 repeated short peptides containing ∼12 amino acids were identified in gene2 protein by using the program RADAR26 (Table 3). Although, function of these repeats is unknown, it revealed that gene2 may be evolved from repeated insertion as found in MAGE family. To compare the repetitive pattern of gene2 with MAGE family protein, we have analyzed the repeats of MAGE-E1, MAGE-D1 and MAGE-D3 proteins (supplementary data). It was found that the repeats in these MAGE family members are not identical and in some cases several variations in amino acids are found. The pattern of MAGE-D1 repeat is WQX-PXX14 which is completely different from MAGE-D and MAGE-E1. However, these MAGE family members are univocal in that sense that they all contain repeats (may be identical or different). On the other hand, our predicted gene2 contains TPG repeat which is found at several position of MAGE-E1.
Repeated sequence motif of the predicted gene2 analyzed by RADAR.
Location of Repeats
Score
Repeat Sequence
36–
63
39.31/ 9.54
ASSSEQTPRTPGDRasLTLHETGTPRDS
64–
91
59.78/18.17
ASLTLHLRGTPGGRnsLTLHVTGTPGDG
92–
119
58.16/17.49
ASLTLHVTGTPGDGasLTLHLRGTPRDG
120–
147
59.97/18.25
ASLTLHVTETPGDSasLTLHVTGTPGDG
162–
189
65.43/20.55
ASLTLHVTGTPGDGasLTLHVTGTPGDG
190–
217
65.43/20.55
ASLTLHVTGTPGDGasLTLHVTGTPGDG
218–
245
58.50/17.63
ASLTLHVTGTRGDGasLTLHVTGTPGGL
246–
273
62.53/19.33
ASITLHVTGTPGDGasLTLHVTGTPGDS
274–
301
60.22/18.35
ASFTLHVTGTPGDRasLTLNVTGTPGDS
302–
329
60.27/18.38
ASLTLQVTGTPGGRasLTLHVTGTPGGR
330–
357
63.51/19.74
ASLTLHVTGTPGGRasLTLHVTGTPGGR
358–
385
63.51/19.74
ASLTLHVTGTPGGRasLTLHVTGTPGGR
386–
413
64.64/20.22
ASLTLHVTGTPGGRasLTLHVTGTPGDS
428–
455
62.51/19.32
ASLTLHVTGTPGGRasLTLNVTGTPGDG
456–
483
65.43/20.55
ASLTLHVTGTPGDGtsLTLHVTGTPGDG
484–
511
58.50/17.63
ASLTLHVTGTRGDGasLTLHVTGTPGGL
512–
539
61.23/18.78
ASITLHVTGTPGDSasLTLHVTGTPGDG
540–
567
62.44/19.29
ASLTLHVTGTPGDRasLTLNVTGTPGDS
568–
595
63.82/19.87
ASLTLHGTGTPGDRasLTLHVTGTPGDR
596–
623
65.09/20.41
ASLTLHVTGTPGDRasLTLHVTGTPGDR
624–
651
63.51/19.74
ASLTLHVTGTPGGRasLTLHVTGTPGGR
652–
679
65.25/20.48
ASLTLHVTGTPGGRasLTLHVTGTPGDG
680–
707
62.29/19.23
ASLTLHGTGTPGDRasLTLHGTGTPGDS
708–
735
65.18/20.45
ASLTLHVTGTPGDRasLTLHVTGTPGDS
750–
768
34.16/ 7.36
………TPGDRasLTLHVTGTPGDS
783–
810
60.00/18.26
ASLTLHVTGTPGDGasLTLHLRGTPGDS
811–
838
65.18/20.45
ASLTLHVTGTPGDRasLTLHVTGTPGDS
839–
866
55.89/16.53
ASLTLHGTGTPGGHtsITLHGTGTPGGR
867–
894
61.63/18.95
ASLTLHVTGTPGGNasLTLHVTGTPGDH
The predicted secondary structure of gene2 showed more extensive β-strands (∼44%) and coil region (∼55%) distributed along the sequence (Fig. 4). From the secondary structure of entire MAGE-E1, it was found that N-terminal MHD2 domain contains extensive coil and some β-stranded region but C-terminal MHD1 domain contains more α-helical regions along with some β-stranded regions. We have further analyzed the secondary structure pattern of repeats on MAGE-E1, MAGE-D1 and MAGE-D3 (supplementary data). Overall secondary structure pattern within the repeat region of these three family members was coil followed by β-strands but no α-helices which are very much likely to the secondary structure pattern of gene2 repeats although the amino acid composition of the repeats is slightly different.
Secondary Structure Prediction of gene2 using HNN.
To understand the functional association between gene2 and MAGE-E1, we have analyzed the protein function prediction result using two programs SVMProt and PFP. SVMProt results indicated that both gene2 and MAGE-E1 belong to Zinc-binding protein family (Table 4). PFP result classified the function of protein according to the GO annotation categories. In the biological process categories, MAGE-E1 belongs to glia cell migration, nerve growth factor receptor signaling pathway, neuronal migration, brain development etc (Table 5) and gene2 belongs to neurogenesis whose specific outcome is the progression of nervous tissue over time, from its formation to its mature state which is very much likely to MAGE-E1. In the molecular function categories, gene2 matches with MAGE-E1 in exo-alpha-sialidase activity and inositol-polyphosphate 5-phosphatase activity.
Comparative function prediction of MAGE-E1 and gene2 using SVMprot.
MAGE-E1
Gene2
Function
P Value (%)
Function
P Value (%)
Zinc-binding
73.8
Zinc-binding
58.6
Nuclear receptors
65.4
Metal-binding
58.6
EC 3.4 hydrolases—Acting on peptide bonds (peptidases)
58.6
Calcium-binding
58.6
TC 1.B.
Channels/pores—Beta-Barrel
porins
58.6
Comparative function prediction of MAGE-E1 and Gene2 using PFP.
MAGE-E1
Gene2
Biological process
Score
Biological process
score
GO.0008347 glia cell migration
2907203.66
GO.0009405 pathogenesis
760364.22
GO.0048011 nerve growth factor receptor
1164048.92
GO.0008380 RNA splicing
539421.38
signaling pathway
GO.0019233 perception of pain
961041.64
GO.0000398 nuclear mRNA splicing, via spliceosome
380866.18
GO.0001764 neuronal migration
868699.27
GO.0007520 myoblast fusion
365931.67
GO.0042060 wound healing
500313.41
GO.0007399 neurogenesis
222373.08
GO.0000074 regulation of cell cycle
473884.91
GO.0006512 ubiquitin cycle
205861.13
GO.0001558 regulation of cell growth
367704.56
GO.0016574 histone ubiquitination
205248.68
GO.0007585 respiratory gaseous exchange
316119.23
GO.0006816 calcium ion transport
187647.36
GO.0009062 fatty acid catabolism
263724.05
GO.0008104 protein localization
177700.20
GO.0007420 brain development
263045.24
GO.0042110 T-cell activation
171905.85
Molecular function
score
Molecular function
score
GO.0043015 gamma-tubulin binding
550081.62
GO.0004308 exo-alpha-sialidase activity
285616.06
GO.0016798 hydrolase activity, acting on glycosyl bonds
In the cellular component categories, both gene2 and MAGE-E1 belong to dendrite, dystrophin-associated glycoprotein complex, actin cytoskeleton and nuclear region. Some differences in the prediction categories between gene2 and MAGE-E1 are observed. This may be due to the limited functional analysis on MAGE family protein and still now we don't know enough about the expression and function of MAGE family proteins.14 From these analyses, it can be inferred that as a distant member, there are some functional alliance of gene2 with MAGE family protein.
However, low amino acid sequence homology was found for gene3 protein. But it showed high homology (99% identity) with some human ESTs (Supplementary data). For this reason, we are predicting that this gene is a novel one or it may have functions in regulation rather than coding.
Conclusion
Even though these predicted genes should be further characterized by laboratory means before their existence can be conclusively affirmed, the results presented in this study suggested and identified the location of a structurally similar gene of MAGE family on human chromosome 5. The findings can provide new insight in transcriptional activation of novel genes during malignant melanoma.
Disclosures
The authors report no conflicts of interest.
References
1.
AutexierC., LueN.F.The structure and function of telomerase reverse transcriptase.Annu Rev Biochem.2006; 75: 493–517.
2.
LeeB.J., WangS.G., ChoiJ.S., LeeJ.C., GohE.K., KimM.G.The prognostic value of telomerase expression in peripheral blood mononuclear cells of head and neck cancer patients.Am J Clin Oncol.2006; 29: 163–7.
3.
KimN.W., PiatyszekM.A., ProwseK.R.Specific association of human telomerase activity with immortal cells and cancer.Science.1994; 266: 2011–5.
4.
MatheC., SagotM.F., SchiexT., RouzeP.Current methods of gene prediction, their strengths and weaknesses.Nucleic Acids Res.2002; 30: 4103–17.
5.
BaxevanisA.D., OuelletteB.F.F.Bioinformatics: a practical guide to the analysis of genes and proteins.3rd edition, John Wiley & Sons, Inc.2005.
6.
MayorC., BrudnoM., SchwartzJ.R.VISTA: visualizing global DNA sequence alignments of arbitrary length.Bioinformatics.2000; 16: 1046–7.
7.
HawkinsT., LubanS., KiharaD.Enhanced automated function prediction using distantly related sequences and contextual association by PFP.Protein Sci.2006; 15(6): 1550–6.
8.
CaiC.Z., HanL.Y., JiZ.L., ChenX., ChenY.Z.SVMProt: Web-based support vector machine software for functional classification of a protein from its primary sequence.Nucleic Acids Res.2003; 1;31(13): 392–7.
9.
BurgeC., KarlinS.Prediction of complete gene structures in human genomic DNA.J Mol Biol.1997; 268: 78–94.
10.
StankeM., SteinkampR., WaackS., MorgensternB.AUGUSTUS: a web server for gene finding in eukaryotes.Nucleic Acids Res.2004; 32: W309–12.
FrazerK.A., ElnitskiL., ChurchD.M., DubchakI., HardisonR.C.Cross-species comparisons: a review of methods and available resources.Genome Res.2003; 13: 1–12.
13.
RognerU.C., WilkeK., SteckE., KornB., PoustkaA.The Melanoma antigen gene (MAGE) family is clustered in the chromosomal band Xq28.Genomics.1995; 29: 725–31.
14.
BarkerP.A., SalehiA.The MAGE proteins: emerging roles in cell cycle progression, apoptosis, and neurogenetic disease.J Neurosci Res.2002; 15; 67(6): 705–12.
15.
RetiefJ.D., LynchR.K., PearsonW.R.Panning for genes-A visual strategy for identifying novel gene orthologs and paralogs.Genome Res.1999; 9: 373–82.
16.
MountD.W.Bioinformatics: Sequence and Genome Analysis.2nd Edition.Cold Spring Harbor Laboratory Press; 2004. p. 506.
17.
ChomezP., De BackerO., BertrandM., De PlaenF., BoonT., LucasS.An overview of the MAGE gene family with the identification of all human members of the family.Cancer Res.2001; 61: 5544–51.
18.
LucasS., BrasseurF., BoonT.A new MAGE gene with ubiquitous expression does not code for known MAGE antigens recognized by T cells.Cancer Res.1999; 59: 4100–3.
19.
KaranikasV., TsochasS., BoukasK.Co-expression patterns of tumor-associated antigen genes by non-small cell lung carcinomas: implications for immunotherapy.Cancer Biol Ther.2008; 7(3): 345–52.
20.
WischnewskiF., FrieseO., PantelK., SchwarzenbachH.Methyl-CpG binding domain proteins and their involvement in the regulation of the MAGE-A1, MAGE-A2, MAGE-A3, and MAGE-A12 gene promoters.Mol Cancer Res.2007; 5(7): 749–59.
21.
De SmetC., De BackerO., FaraoniI., LurquinC., BrasseurF., BoonT.The activation of human gene MAGE-1 in tumor cells is correlated with genome-wide demethylation.Proc Natl Acad Sci U S A.1996; 93: 7149–53.
22.
WeberJ., SalgallerM., SamidD.Expression of the MAGE-1 tumor antigen is up-regulated by the demethylating agent 5-aza-2_-deoxycytidine.Cancer Res.1994; 54: 1766–71.
23.
SerranoA., GarciaA., AbrilE., GarridoF., Ruiz-CabelloF.Methylated CpG points identified within MAGE-1 promoter are involved in gene repression.Int J Cancer.1996; 68: 464–70.
24.
De SmetC., LurquinC., LetheB., MartelangeV., BoonT.DNA methylation is the primary silencing mechanism for a set of germ line- and tumor-specific genes with a CpG-rich promoter.Mol Cell Biol.1999; 19: 7327–35.
25.
SuA.I., WiltshireT., BatalovS.A gene atlas of the mouse and human protein encoding transcriptomes.PNAS.2004; 101(16): 6062–7.
26.
HegerA., HolmL.Rapid automatic detection and alignment of repeats in protein sequences.Proteins.2000; 41(2): 224–37.