• Keine Ergebnisse gefunden

SequenceAnalysis:PairwiseAlignments ExerciseSheet2 SoftwarewerkzeugederBioinformatik

N/A
N/A
Protected

Academic year: 2022

Aktie "SequenceAnalysis:PairwiseAlignments ExerciseSheet2 SoftwarewerkzeugederBioinformatik"

Copied!
2
0
0

Wird geladen.... (Jetzt Volltext ansehen)

Volltext

(1)

Softwarewerkzeuge der Bioinformatik

Prof. Dr. Volkhard Helms

PD Dr. Michael Hutter, Markus Hollander, Marie Detzler

Winter Semester 2020/2021

Saarland University Center for Bioinformatics

Exercise Sheet 2

Sequence Analysis: Pairwise Alignments

Learning objective: The goal is to learn when to use which BLAST–search (ProteinBLAST, NucleotideBLAST, MegaBLAST, PSI–BLAST), and which parameters (E–value, matrix, query database etc.) are useful depending on the search. Additionally, you are going to compute a pairwise sequence alignment with the Needlemann–Wunsch algorithm and answer some theoretical questions.

Exercise 2.1: Dynamic Alignment

Compute a global alignment of the sequences ACDEFAFGHI and KDELAFG using the Needlemann–Wunschalgorithm.

A C D E F A F G H I

K

D

E

L

A

F

G

Global alignment:

(2)

Exercise 2.2: ProteinBLAST

The lecture slides could be useful for answering the following questions.

a) What is the definition of theexpected threshold (E–value)? Why is an E–value threshold of 10 not particularly useful? What are sensible E–values?

b) How does theword size affect run time and accuracy?

c) What is special about the first hit of a BLAST search against a normal database?

d) Run aProteinBLASTsearch (http://blast.ncbi.nlm.nih.gov/Blast.cgi) for the pro- teinP00042

i. against theUniProtKB/Swiss–Protdatabase with default parameters. Find the 10 proteins with the highest homology to P00042 and display their sequences. What kind of proteins are we dealing with?

ii. against thenon–redundantdatabase with an E–value threshold of 0.001. What are the differences to the previous search? For which types of organisms were results found?

Exercise 2.3: MegaBLAST

Select humanas the genome on the BLAST main page. Search for the mRNANM 175054of the human geneHIST4H4 withmegaBLASTin the databaseGenome (GRCh38.p13).

a) On which chromsome isHIST4H4 located?

b) Is there a paralogue?

c) Find two or three directly neighbouring genes of HIST4H4.

Exercise 2.4: PSI–BLAST

a) Use ProteinBLAST to search for many very distantly related homologues of the protein Q57997in thenon–redundantdatabase with an E–value threshold of 0.02.

What are suitable substitution matrices?

b) Run the same search withPSI–BLAST and a threshold of 0.001 for the maximal E–value of the sequences used for constructing the PSSM.

c) What are the differences between the results of a) and the 1. iteration in b)?

d) How do the results of part b) change with further iterations?

Have fun!

Referenzen

ÄHNLICHE DOKUMENTE

leida, millised on vajalikud sammud üleminekuks nukleotiidsete järjestuste homoloogiaotsinguprogrammilt megablast BLAST paketis programmile blastn BLAST+ paketis, ja

Under the new conditions of Draconian economic and fiscal measures imposed by supranational bodies, the Irish people no longer have any illusions about what the independence of

Overall, 77.4% of 1919 current smokers reported not to have changed their smoking behaviour, 19.1% to have reduced, and 3.5% to have increased their smoking intensity as a

When verbs like want are followed by another verb, they are usually followed by to...  I want to see

B) Put the words in the correct order to make questions. Then answer the questions. C) Match the comments on the left with the responses on the right. Bridget lives in the flat.

Annie forbidden Hector to marry has Hector's family. __Has Hector’s family forbidden Hector to

__Did Bridget advise Annie to buy some new clothes______ ? ___Yes, she did. C) Match the comments on the left with the responses on the right. Bridget lives in the flat. The girl

ii. Log into your account and create your first notebook. On the left side of the website is the button + Create. Click on it and select New Notebook. New notebooks already use