A new GBSC method for identifying, clustering and searching for similar short tandem repeats in protein sequences is now available. Its distinctive features are clustering and searching since to the date scientists mainly focused only on identifying these motifs. Unlike other methods for protein sequence clustering, it focuses on sharing repetitive patterns rather than statistical similarity. For more information see its corresponding page and GitHub. The problems of state-of-the-art clustering and searching method are detailed in this article.
For irregular regions with biased compositions, see also CB-Search method.
Leave a Reply