Genomics Sequence Alignment : Complete Coverage- Sequence Alignment : Complete Coverage- S.Prasanth Kumar Dept. of Bioinformatics Applied Botany Centre (ABC) Gujarat University, Ahmedabad, INDIA www.facebook.com/Prasanth Sivakumar FOLLOW ME ON ACCESS MY RESOURCES IN SLIDESHARE prasanthperceptron CONTACT ME prasanthbioinformatics@gmail. com
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Suppose there are two sequences X and Z to be aligned, where |X| = m and |Z| = nIf gaps are allowed in the sequences, then the potential length of both the first and second sequences is m+n.
2m+n subsequences with spaces for the sequence X2m+n subsequences with spaces for the sequence Z
DP align two sequences by beginning at the ends of the two sequences and attempting to align all possible pairs of characters (one from each sequence) using a scoring scheme for matches, mismatches, and gaps. The highest set of scores defines the optimal alignment between the two sequences
DP algorithms solve optimization problems by dividing theproblem into independent subproblems
Optimal alignment of two sequences
Dynamic Programming Matrix
s(aibj) = +5 if ai = bj (match score)s(aibj) = -3 if ai ≠ bj (mismatch score)w = -4 (gap penalty)
• Initialization• Matrix Fill (scoring)• Traceback (alignment)
Global Alignment: Needleman-Wunsch Algorithm
Initialization Step
Each row Si,0 is set to w * i Each column S0,j is set to w * j
Easy ; Find the lowermost right corner and follow arrow
Global Alignment: Needleman-Wunsch Algorithm
5 – 3 + 5 – 4 + 5 + 5 – 4 + 5 – 4 – 4 + 5 = 11
Local Alignment: Smith-Waterman Algorithm
Initialization Step
Each row Si,0 is set to 0 Each column S0,j is set to 0
Same Rule Initialization different Trace backing need attention
Local Alignment: Smith-Waterman Algorithm
There are two cells having 14. There are multiple alignments producing the maximal alignment score What to consider ? Value in last row means aligned fully
Local Alignment: Smith-Waterman Algorithm
Two trace back pathway pointers
The two local alignments resulting in a score of 14
Local Alignment: Smith-Waterman Algorithm
5 matches, 1 mismatch, and 2 gaps
score = 5 *5 – 1 *3 – 2 *4 = 25 – 3 – 8 = 14
What in Next Coverage ?
Scoring Matrices: PAM & BLOSUMAssessing the significance of sequence alignments