Protein Sequencing vs. DNA sequencing * We compare protein sequences, not DNA, because protein is more conserved in evolution than DNA * The organism’s survival depends on the protein being functional, which means having the proper amino acids sequence * Since the genetic code is degenerate, many different DNA sequences will give identical proteins. * The protein 3-dimensional structure is even more conserved, because it is more closely related to enzyme activity than the amino acid sequence is.
BLAST
* standard sequence alignment tool (BLAST = Basic Local Alignment Search Tool) * BLAST is based on the concept that if you compare the same (that is, homologous) protein from many different species, you can see that some amino acids readily substitute for each other and others almost never do. * Results are arranged with the best ones on top * The most important score is the Expect value, or E-value, which can be defined as the number of hits any random sequence (with the same length as yours) would have in the database. * E-values for good hits are usually written something like: 3e-42, which is the same as 3 x 10-42 , a very small number * Bad hits are very common, and they have e-values in a more familiar form: for example, 0.004 or 1.2 * A really good e-values is less than 1e-180, which underflows the computer’s processing capabilities, so it written as 0.0 * E-values are affected by the length of the query sequence as well as the size of the database, so even perfect matches with short sequences give poor e-values * Before we can conclude that our protein is a homologue of the proteins BLAST matches it with, we would like them to have roughly the same length and have a high percentage of identical amino acids. * the lengths of the query and subject sequences should be within 20% of each other * There should be at least 30% identical amino acids * In this case we can be quite sure we have a good match
You May Also Find These Documents Helpful
-
This new RNA strand is called messenger RNA (mRNA)A start codon, and a stop codon determine the length of the gene-STAGE TWO - Translation:The mRNA strand binds to a ribosome in the cytoplasm, with the start codon being AUG…
- 7073 Words
- 23 Pages
Good Essays -
p.13 About the calculation method of KaKs in A.halleri genes. The authors add extra codons when they found more than one variable sites in a codon. This 'concatenated codons method' increases the number of comparison and may cause underestiamtion. Please discuss on this issue.…
- 248 Words
- 1 Page
Satisfactory Essays -
b) Sequence B was the ending fragment since it ends with the stop codon, UGA.…
- 366 Words
- 2 Pages
Satisfactory Essays -
The -10 sequence for the lux operon is tgttata. 2 nucleotides are different when compared to the ideal sequence.…
- 439 Words
- 2 Pages
Satisfactory Essays -
enter the ribosome. Antibiotic Using these instructions, a new protein chain is formed. Functional protein O C…
- 7229 Words
- 29 Pages
Powerful Essays -
Likelihood of Occurrence: This will measure the likelihood a failure mode is to occur. The score range will be 1-10 with 1 meaning it is very unlikely to occur and 10 meaning very likely to occur.…
- 2877 Words
- 12 Pages
Better Essays -
o The number of nucleotide substitutions in orthologous genes is proportional to the time since the genes became duplicated…
- 4658 Words
- 19 Pages
Good Essays -
There are multiple ways your cells can read a sequence of DNA and build slightly different proteins from the same strand. We will not go through the process here, but as an illustration of this “alternate splicing”, remove codons (beads) 52 - 66 from your sentence above.…
- 624 Words
- 3 Pages
Satisfactory Essays -
The flow of information from gene to protein is based on the triplet code. The genetic instructions for the amino acid sequence of a polypeptide chain are written in DNA and RNA as a series of three-base words called codons. The three-base codons in DNA are transcribed into complementary three-base codons in RNA, and then the RNA codons are translated into amino…
- 459 Words
- 2 Pages
Good Essays -
Complete the following and place the Word document in the Homework Dropbox by midnight Sunday. Remember to include complete citations for all sources used to answer each question.…
- 532 Words
- 2 Pages
Good Essays -
3’-T A A C C T T T A C T A G G C A C T-5’…
- 438 Words
- 2 Pages
Satisfactory Essays -
The second most familiar protein was also found in Canis lupis familiaris. This genome is significantly less similar. It is 47% identical, 57% similar, and has an E Value of 0.0.…
- 1682 Words
- 7 Pages
Better Essays -
The Q meter is an instrument used to measure electrical properties of coils and capacitors. Mostly used in testing the radio frequency circuits. A Q-meter basically measures the Q (Quality Factor) of the circuit. It is also used to test the inductors, capacitors for their high frequency inductance, Q , capacitance as well as ∑, tg δ value.…
- 951 Words
- 4 Pages
Good Essays -
Its standard deviation was at 28.72 very much comparable to that of the benchmark at 29.35 has higher…
- 432 Words
- 2 Pages
Satisfactory Essays -
it is a large negative number, it is often a subject of concern because it indicates an…
- 1821 Words
- 8 Pages
Good Essays