Multiple Sequence Alignement (MSA) - Solution |
- Go to ClustalW, T-Coffee or MAFFT in MyHits, paste the sequences and do the
multiple alignment
- Have a closer look at position 97 (green arrow), you can see a "U" residue, but not in A.gambiae.
|
- at least 46
- There is a stop codon in the corresponding DNA sequence (mRNA) of Q58DU4_BOVIN (BT021503)
- When translating the mRNA in all reading frames, you can see
that the rest of the protein (light blue) is there, the first stop
being the "U" special amino acid: selenocystein
>bovin_1
KAAVMAARRDGWLGPAFGLRLLLATVLQTVSALGAEFSSESCRELGFSSNLLCSSCDLLG
QFNLLQLDPDCRGCCQEEAQFETKKLYAGAILEVCG*KLGRFPQVQAFVRSDKPKLFKGL
QIKYVRGSDPVLKLLDDSGNIAEELSILKWNTDSVEEFLSEKLERI*ILNFVLSFCYLVQ
*NTIAPRK*FSFAFFH*SVFYCEALNI*LKVQAAAQPMIGKKLTKPFFISFHPSFVDTTS
NRMPSNRLVVNYANDSL**LVSFMNNRFLN*GG**GR*LLCLVCCVLFESNNNKLESK*D
IPSLKRLPVPQILSYFCTPSLPFNRNV*FIMNALHKDFMAALL*NRFKIYSKSEIFTQRF
ALMKTTQKTFLRICVDLILSKFLCFTFLWKSQFKNDHL*DQNINKKFQKS
- Yes there are also longer versions of Q5ZM93_CHICK (AJ719491) and Q5XG54_XENLA (BC084609)
and also in ENSEMBL for zebra fish (annotation)
>Q58DU4_BOVIN
MAARRDGWLGPAFGLRLLLATVLQTVSALGAEFSSESCRELGFSSNLLCSSCDLLG
QFNLLQLDPDCRGCCQEEAQFETKKLYAGAILEVCGXKLGRFPQVQAFVRSDKPKLFKGL
QIKYVRGSDPVLKLLDDSGNIAEELSILKWNTDSVEEFLSEKLERI
>Q5ZM93_CHICK
MAAAAELAALVRCWLCLLLGLPAINVYGAQLSSEACRELGFSSNLLCSSCNLLGQFSL
NQLDPFCRQCCQEEAQLETRKLYAGAVLEVCGXKLGRFPQVQAFVRSDKPKLFRGLQIKY
VRGSDPVLKLLDDSGNIAEELSILKWNTDSVEEFLSEKLERL
>Q5XG54_XENLA
MAAERMLLWLVAVLQALASYGAELSSEACRDLGFSSNLLCSSCDLLGQFGLNEINSFCRQ
CCQEEVHLESKKRYPGAVLEICGXKLGRFPQVQAFVRSEKPKLFKGLQIKYVRGSDPVLK
LLDENGNISEELSILKWNTDSVEEFLSEKLDRV
>ENSDARP056978_BRARE
MAGEVYLLWLLPLLQGLASYGAELSSEACRELGFSSNLLCSSCELLGQFSL
NQLDLPCRQCCQEEAQLENRKLYPGAILEVCGXKLGRFPQVQAFVRSDKPKLFRGLQIKY
VRGSDPVLKLLDDNGNIAEELSILKWNTDSVEEFLSEKLER
- When you align these sequences with the sequences in Swissprot you obtain this
alignment and this
tree
- This shows that SelenoCystein for this family of protein are found only in vertebrates.
|