How to compare two sequences?
To compare 2 sequences, you need to determine the sequence alignment. They are either
homologous or not. Homologous is are the same and sequences of ancestors look the same
as well. Paralogues are homologues between species, orthologues are homologues within a
species.
Identity: A residue is exactly the same (either nucleotide or amino acid)
Conservation: An amino acid is replaced by another amino acid that preserves the
physicochemical properties of the original residue (for nucleotides, conservation has
no meaning, as two residues are only considered equal or not)
Similarity: The extent to which sequences are related, based on identity and
conservation.
Local alignment: Only take a small portion into account when finding alignments
Global alignment: Take the entirety of both sequences into consideration when finding
alignments.
o DNA
Alignment of nucleotide sequences is useful, for example to:
Study noncoding regions of DNA (promoter regions, intergenic regions)
Confirm the identity of a cDNA
Study DNA polymorphisms
Study genomes (compare genomes, create genome annotations)
o Proteins
Alignment of protein sequences can be more informative than alignment of nucleotide
sequences. It makes it more useful to use them when there is a large evolutionary
distance. This has several reasons:
There are 20 different amino acids, but only 4 different nucleotide bases
For amino acids we can take their biophysical similarities into account
Codons are degenerate: changes in the third position of coding sequences
often do not alter the amino acid that is specified
Query NLYENFVQATFNALTAEKV
NY ENF+Q+ + + +
Subject NYAENTIQSIISTVEPAQR
The centerline provides the following information: A letter designates an identity (or
high similarity) between the two sequences. A “+” means the two sequences are
similar but not highly similar. If no symbol is given between the two sequences, then a
non-similar substitution has occurred
Substitution matrix: Putting in the number of mutations for each pair of amino acids.
From there on, a scoring matrix can be formed.
Positive values for combinations that are more likely than random chance,
Negative values for combinations that are less likely than random chance,
Zero values for combinations that are as likely as expected from random
chance
- BLOSUM matrix
A substitution scoring matrix in which scores for each position are derived
from observations of the frequencies of substitutions in blocks of local
alignments in related proteins. Each matrix is tailored to a particular
evolutionary distance. In the BLOSUM62 matrix, for example, the alignment
Voordelen van het kopen van samenvattingen bij Stuvia op een rij:
Verzekerd van kwaliteit door reviews
Stuvia-klanten hebben meer dan 700.000 samenvattingen beoordeeld. Zo weet je zeker dat je de beste documenten koopt!
Snel en makkelijk kopen
Je betaalt supersnel en eenmalig met iDeal, creditcard of Stuvia-tegoed voor de samenvatting. Zonder lidmaatschap.
Focus op de essentie
Samenvattingen worden geschreven voor en door anderen. Daarom zijn de samenvattingen altijd betrouwbaar en actueel. Zo kom je snel tot de kern!
Veelgestelde vragen
Wat krijg ik als ik dit document koop?
Je krijgt een PDF, die direct beschikbaar is na je aankoop. Het gekochte document is altijd, overal en oneindig toegankelijk via je profiel.
Tevredenheidsgarantie: hoe werkt dat?
Onze tevredenheidsgarantie zorgt ervoor dat je altijd een studiedocument vindt dat goed bij je past. Je vult een formulier in en onze klantenservice regelt de rest.
Van wie koop ik deze samenvatting?
Stuvia is een marktplaats, je koop dit document dus niet van ons, maar van verkoper emmavandergaag. Stuvia faciliteert de betaling aan de verkoper.
Zit ik meteen vast aan een abonnement?
Nee, je koopt alleen deze samenvatting voor €3,49. Je zit daarna nergens aan vast.