• Keine Ergebnisse gefunden

Invited talk: phylogenetic networks with constrained and unconstrained recombination

N/A
N/A
Protected

Academic year: 2022

Aktie "Invited talk: phylogenetic networks with constrained and unconstrained recombination"

Copied!
1
0
0

Wird geladen.... (Jetzt Volltext ansehen)

Volltext

(1)

Phylogenetic Networks with Constrained and Unconstrained Recombination

Dan Gusfield

Abstract:A phylogenetic network is a generalization of a phylogenetic tree, allowing structural properties that are not tree-like. With the growth of genomic data, much of which does notfit ideal tree models, and the increasing appreciation of the genomic role of such phenomena as recombination, recurrent and back mutation, horizontal gene transfer, gene conversion, and mobile genetic elements, there is greater need to understand the algorithmics and combinatorics of phylogenetic networks.

Wang et al. studied the problem of constructing a phylogenetic network for a set ofnn binary sequences derived from a known ancestral sequence, when each site in the sequence can change state at most once in the network, and recombination bet- ween sequences is allowed. They showed that the problem offinding a phylogenetic network that minimizes the number of recombination events is NP-hard, but gave a polynomial-time algorithm (O(nm+n4)-time, fornsequences of lengthmeach) that was intended to determine whether the sequences could be derived on a phylo- genetic network in which the recombination cycles are node disjoint. We call such a network a “galled-tree”. The paper by Wang et al. is seminal in defining this pro- blem and asserting that it has an efficient solution. Unfortunately, the algorithm given there is incomplete and only constitutes a sufficient, but not a necessary, test for the existence of a galled-tree for the data.

In this talk we do several things. By more deeply analyzing the combinatorial constraints on galled-trees, we obtain an algorithm that runs inO(nm+n3)-time and is guaranteed to be both a necessary and sufficient test for the existence of a galled- tree for the data. We show how to relax the assumption that we know the ancestral sequence. We show that when there is a galled-tree, the algorithm constructs a “redu- ced” galled-tree that minimizes the number of mutations occurring on recombination cycles. We prove that the algorithm produces a reduced galled-tree that minimizes the number of recombinations needed for the data, over all possible phylogenetic networks for the data, even if multiple crossovers, or different ancestral sequences are allowed.

Hence, when a galled-tree exists, the problem of minimizing the number of recom- binations can be solved efficiently. The effect is that the galled-tree is a phylogenetic network that explains the input sequences with the “littlest deviation” from a true tree model. We show that the reduced galled-tree is “nearly-unique”, but when it is not un- ique, the algorithm also obtains a count of the number of galled-trees that exist for the input data, and can create these in linear time for each one, starting from the canoni- cal galled-tree. Finally, we consider phylogenetic networks where the recombination networks are not constrained in any way. We discuss new, efficiently computed lower bounds on the number of recombination events needed.

Joint work with Satish Eddhu, Chuck Langley, and Dean Hickerson

1

Referenzen

ÄHNLICHE DOKUMENTE

(CNC). Terminalia of Ceratomerus, lateral view. hibbsi, hypandrium and phallus.. tae near mid-length; 1 erect, ventral bristle at apical third; 1 short, preapical dorsal bristle.

The excurrent ostia open either above the dorsal diaphragm (when present) or below it, or they open into the space between the two layers of the diaphragm (and hemolymph is

As examples, metabolite profiling and enzyme activity data sets have been visualized in different metabolic maps, correlation networks have been generated and similar time

Zardoya R, Meyer A (1996a) The complete nucleotide sequence of the mitochondrial genome of the lungfish (Protopterus dolloi) supports its phylogenetic position as a close relative

This directly implies two important features for mathemat- ical modeling: Firstly, it is a metric space containing all possible phylogenetic trees on n leaves, i.e., it is a

We present a Lagrangian decomposition approach for the Knapsack Constrained Maximum Spanning Tree problem yielding up- per bounds as well as heuristic solutions.. This method is

That is, we emphasized power projection forces that appeared to be the most viable in denied areas to be able to hold out the prospect of prompt, high-volume

In the aftermath of any agreement, the United States (and the international community) must also maintain the will and capability to take effec- tive action, including the use