• Keine Ergebnisse gefunden

The chloroplast genome of a Symbiodinium sp. clade C3 isolate

N/A
N/A
Protected

Academic year: 2022

Aktie "The chloroplast genome of a Symbiodinium sp. clade C3 isolate"

Copied!
13
0
0

Wird geladen.... (Jetzt Volltext ansehen)

Volltext

(1)

http://www.elsevier.de/protis

Publishedonlinedate17October2013

ORIGINALPAPER

The Chloroplast Genome of a

Symbiodinium sp. Clade C3 Isolate

AdrianC.Barbrooka,1, ChristianR.Voolstrab,and ChristopherJ.Howea

aDepartmentofBiochemistry,UniversityofCambridge,BuildingO,DowningSite, TennisCourtRoad,Cambridge,CB21QW,UK

bRedSeaResearchCenter,KingAbdullahUniversityofScienceandTechnology (KAUST),Thuwal,SaudiArabia

SubmittedJune6,2013;AcceptedSeptember17,2013 MonitoringEditor:MichaelMelkonian

DinoflagellatealgaeofthegenusSymbiodiniumformimportantsymbioseswithincoralsandotherben- thicmarineanimals.Dinoflagellates possessanextremely reducedplastidgenome relative tothose examined in plants and other algae. In dinoflagellates the plastid genes are located on small plas- mids,commonlyreferredtoas‘minicircles’.However,thechloroplastgenomesofdinoflagellateshave onlybeenextensivelycharacterisedfromahandfulofspecies.Thereisalsoevidenceofconsiderable variationinthechloroplastgenomeorganisation acrossthosespeciesthathavebeenexamined.We thereforecharacterisedthechloroplastgenomefromanenvironmentalcoralisolate,inthiscasecon- tainingasymbiontbelongingtotheSymbiodiniumsp.cladeC3.Thegenecontentofthegenomeiswell conservedwithrespecttopreviouslycharacterisedgenomes.However,unlikepreviouslycharacterised dinoflagellatechloroplastgenomeswedidnotidentifyany‘empty’minicircles.Thesequencesofthis chloroplastgenomeshowahighrate ofevolutionrelativetootheralgalspecies.Particularlynotable wasasurprisinglyhighlevelofsequencedivergencewithinthecorepolypeptidesofphotosystemI,the reasonsforwhicharecurrentlyunknown.Thischloroplastgenomealsopossessesdistinctivecodon usageandGCcontent.ThesefeaturessuggestthatchloroplastgenomesinSymbiodiniumarehighly plastic.

©2013AdrianC.Barbrook.PublishedbyElsevierGmbH.Allrightsreserved.

Keywords:Symbiodinium;minicircle;dinoflagellate;chloroplastgenome;diversity.

Introduction

Dinoflagellates arean important groupof eukary- otic algae. They have a widespread global distribution and are found in both freshwater andmarine environments.Dinoflagellatesaresig- nificant primary producers, ranking second in importancebehinddiatomsinoceanicecosystems (Field et al. 1998). Several species are remark- ableinhavingprofoundenvironmentaleffectssuch

1Correspondingauthor;fax+441223333345 e-mail acb18@cam.ac.uk(A.C.Barbrook).

as the ability to produce extensive algal blooms, manyofwhicharecapableofproducingpotenttox- ins with correspondingly devastating effects (Lee 2008).Incontrastmembers ofthegenusSymbio- dinium are renowned for forming symbioses with corals and other benthic marine animals (Rowan and Powers 1991). The symbioses which they with corals are the basis of exceptionally diverse ecosystems.However,thereisincreasingconcern aboutthestabilityoftheseecosystemsandwhether theyareabletowithstandprojectedchangesofcli- mateandtheimpact ofotherhumaninterventions (Hughesetal.2003).

©2013AdrianC.Barbrook.PublishedbyElsevierGmbH.Allrightsreserved.

http://dx.doi.org/10.1016/j.protis.2013.09.006

Konstanzer Online-Publikations-System (KOPS)

URL: http://nbn-resolving.de/urn:nbn:de:bsz:352-2-ml72dg7lawup7

(2)

Dinoflagellates show a surprising litany of unusual or even unique biological properties (Bayer et al. 2012; Lin 2011). Focusing solely on genetic aspects, all three of the genomes found within dinoflagellates can be said to be exceptional. For example, the nuclear genome of dinoflagellates variesenormouslyin sizebetween species and possesses permanently condensed chromosomes, whilst containing relatively few nucleosomes despite possessing histone genes (Bayer et al. 2012; Lin 2011; Roy and Morse 2012). The mitochondrial genome, despite con- taining very few genes, achieves remarkable complexity arising from the presence of exten- sive non-coding inverted repeats and numerous geneisoforms,manyofwhichrequireconsiderable post-transcriptionalmodification(Nashetal.2008;

WallerandJackson2009).Thechloroplastgenome of dinoflagellatesis alsohighly anomalous(Howe etal.2008).Indinoflagellateswithchloroplastscon- taining the carotenoid peridinin as an accessory pigment, which are believed to be the ancestral form,thechloroplast genomeishighly reducedin genecontentandfragmentedintoseveralplasmid- like DNA minicircles. The genes that remain part of thechloroplastgenomeandarelocatedon the minicircles mostly encode key components of the major photosynthetic complexes. The minicircles areusuallybetween2-3kbp insizeand containa single gene (Howeet al.2008). However, excep- tions to this standard form exist. In Adenoides eludens there is evidence for larger single gene minicircles of 5 kbp as well as two gene minicir- cles ofup to 12 kbp (Nelsonand Green2005).A two gene minicircle has also been reported from Ceratium horridum (Laatsch et al. 2004).Unusu- ally, in this case the two genes, ycf16 and ycf24, arenotdirectlyrelatedtophotosyntheticreactions;

in fact they encode ABC transporter family pro- teins. Additionally, a recent study has suggested that these genesfrom Ceratium horridum and as well as the genes rpl28 and rpl33, putative mini- circlesequencesfromPyrocystis lunula, arelikely tohave beenacquiredbyhorizontalgenetransfer (HGT)(Moszczynskietal.2012).Thesegenesare doubly unusualasthey havebeen foundon mini- circles only in these speciesand not inany other dinoflagellates, andwhether thesefourgenesare genuinely of dinoflagellate origin remains contro- versial. In Amphidinium species minicircles have been characterised that carry as many as four genes(Barbrooketal.2006a).However,thesizeof theseminicirclesneverthelessconformstothe2-3 kbpnorm. InAlexandriumtamarenseandAlexan- drium catenellamultiplevariantformsofthepsbA

andpsbDgenesexist,inadditiontoastandardcopy thatispresumedtobefunctional,witheachvariant foundonitsownminicircle(Iidaetal.2009,2010).

ThevariantpsbAgenesofA.tamarensehavebeen showntobetranscribedandthetranscriptsedited despitenoneofthembeingabletoencodeafunc- tional proteinbecause of thepresence of various insertions and deletions (Iida et al.2009). In Lin- gulodinium polyedrum chloroplast genes may not be on minicircles at all, since at least the psbA gene hybridises to DNA of much higher molecu- lar weight (Wang and Morse 2006a). It therefore appears that the organisation of genes within theperidinin-containingdinoflagellatesshowscon- siderable diversity. Whilst minicircles have been isolatedfromarelativelylargenumberofdinoflag- ellate species (Howe et al. 2008), for most species only one or two minicircles have been characterised. This has left a largely incomplete pictureoftheoverallgenomeorganisationineach case.

Thecoding content of thedinoflagellate chloro- plastgenomehaslargelybeenestimatedfollowing extensive minicircle characterisation from just a handful of species, including Amphidinium (two strains) and Heterocapsa (Barbrook and Howe 2000; Barbrook et al. 2001, 2006a; Hiller 2001;

Nelsonetal.2007;Nisbetetal.2004;Zhangetal.

1999, 2001).These datahave been usedin con- junctionwithESTdatafromseveralspecies,which indicatetransfertothenucleusofmanygenesthat arenormallychloroplast-locatedinotheralgaeand plants(Bachvaroffetal.2004;Hackettetal.2004), to confirmwhichgeneshave beenretainedinthe dinoflagellatechloroplast.

Inspiteof theecologicalimportance ofsymbio- ses betweendinoflagellates andcorals, therehas been no systematic characterisation of a Sym- biodinium chloroplast genome, although partial sequencesofindividual chloroplastgenesormini- circles have been generated from a variety of Symbiodiniumstrainsandisolates(Barbrooketal.

2006b;LaJeunesseandThornhill2011;McGinley etal.2012;Mooreetal.2003;Santosetal.2002).A systematicsurveyoftheSymbiodiniumchloroplast genecomplementwouldhaveanumberofbenefits.

Earlierstudieshave shown thatthe sequencesof the psbAand psbD genesshow significant varia- tions at theamino acid level which arenot found in other algal species (Iida et al. 2008). These variationscouldhaveconsequencesforphysiolog- icalfunctionandultimatelytosusceptibilitytocoral bleaching (Warner et al. 1999).By characterising anessentiallycompletesetofminicircleswehope to find outwhetherpotentially significantvariation

(3)

exists in other key genes and provide a starting pointforfurtherstudies.

An extensive characterisation of minicircle sequences would also be of value in the further development of molecular markers for different Symbiodinium strains as has been suggested by Sampayo and co-authors (Sampayo et al. 2009).

We have previously analysed the psbAminicircle in Symbiodinium with regard to its utility as a molecular marker (Barbrook et al. 2006b). We foundthatitscodingregiongaveresultsconsistent with those generated in rDNA studies. However, we found the non-coding region to be highly variable, and subsequent studies showed the non-coding region to be a useful marker for the fine-scale differentiation of Symbiodinium strains (e.g. LaJeunesse and Thornhill 2011). Given the valueofthepsbAminicircleasataxonomicmarker it seemed important to find out whether other gene minicircles could be employed in this way as well. Some of the other minicircles may have characteristics that are even more suitable than the psbA minicircle. For example, in previously studied dinoflagellates the non-coding region of thepsbAminicircleisoftenlongerthanthoseofthe other minicircles and less suitable for easy PCR amplification and sequencing. For instance, the psbA minicircle characterised by Barbrook et al.

from a clade B Symbiodinium had a non-coding regioncloseto 1.5kbp. InbothAmphidinium and H.triquetrathenon-codingregionsofthepsaAand psaB minicircles were considerably shorter than thepsbAminicircle(Barbrooketal.2006a;Zhang et al. 1999). Although theshorter lengthof these non-codingregionsmayresultintheirofferingless taxonomicresolution,theeasierPCRamplification andsequencingmaymakethemmoreuseful.

In this paper we present the first extensive set of minicircle sequences from a Symbiodiniumsp.

isolate belonging to clade C3. We find that the gene content is similar to that found in other peridinin-containing dinoflagellates. Although the minicircles from this Symbiodinium isolate share many features in common with other dinoflagel- lates, numerousnovelfeatures arealsoapparent.

Theresultsindicatethatminicirclesotherthanthe psbAminicirclemaybe usefulforassessingSym- biodiniumdiversity.

Results

PCRAmplificationofMinicircles

We used three strategies to characterise gene- containing minicircles from a DNA extract of in

hospiteSymbiodinium.Firstly degenerateprimers were designed based on conserved regions of PsbA,PsbC,PsbD,PetBandchloroplast23Sribo- somal DNA (cp23S rDNA) as determined from alignmentsbasedon algalsequenceswith partic- ular reference made to dinoflagellate sequences (Fig. 1a, primers labelled 1). PCR using these primers was carried out using the sample from Agaricia sp. DNA products of the expected size for the various primer pairs were generated and sequenced. BLAST searches indicated that the products were derived from the expected genes.

Usingthesesequencesnew‘outward’primerswere designedthatwereexpectedtoamplifytheremain- der of the minicircle sequences (Fig. 1a, primers labelled 2). In PCRs which used these ‘outward’

primersproductswereagainsuccessfullyobtained.

Sequencingoftheseproductsshowedthatthenon- codingregionsofthepsbA,psbC,psbD,petBand cp23S rDNA minicircles contained sequences of approximately 500bp in lengththat shared >70%

identity with the equivalent non-coding regions reported previously for psbA minicircles of phylo- typeC(Barbrooketal.2006b).Thus,ashasbeen reported forotherdinoflagellate species, thenon- coding regions of minicircles containing different geneshaveextensiveregions,whichhavehighlev- elsofidentitywithinaspecies,whichwedesignate asthe‘core’region(Howeetal.2008).Outsideof the ‘core’ region only limited identity is observed between minicircles and is typically only shared betweenpairsofthem.

Inoursecondapproach,primersweredesigned based on stretches of the non-coding region that had high percentage identity across the differ- ent gene-encoding minicircles (Fig. 1b, primers labelled 1). In PCRs with these primers, multi- ple products weregeneratedranging in sizefrom 1.4-2.8kbp.Theseproductsweregelpurifiedand subsequently cloned. The DNAsequence of indi- vidual cloned PCR products was determined. In additiontosequencescorrespondingtopreviously characterised genes for proteins involved in pho- tosynthesis (petB,psbA,psbC,psbD),from which theprimersequenceshad beendesigned,further sequencesforsuchgeneswereobtained,namely atpA, atpB, psaA, psbBand chloroplast16Sribo- somal DNA (cp16S rDNA). To obtain complete minicirclesequencesofthenewlyacquiredgenes, outwardfacingprimersweredesignedcorrespond- ing to the 5 and 3 coding regions of the genes (Fig. 1b, primers labelled 2). These primerswere used successfully to amplify the complete non- coding regionsof theminicirclesof therespective genes,whichweresubsequentlysequenced.

(4)

Figure1. PCRamplificationof minicircles.a)initialamplificationofcodingsequence(primerslabelled1)fol- lowedby ‘outward’amplification(primerslabelled 2),b) initialamplificationusingnon-codingregionprimers (primers labelled 1)followed by ‘outward’amplificationfrom 5 and 3 termini of genes(primers labelled 2).

Greyshadedthickstretcheswitharrowdesignategene-codingregions,blackthickstretchesdesignatehighly conservednon-codingregions(‘core’region).

Minicircle sequences from other dinoflagellate speciessuggestedthatthepsaB,psbE,petD,psbI, rpl28,rpl33, ycf16and ycf24genesmight alsobe present on minicirclesin Symbiodinium. However, thesegeneshadnotbeenidentifiedthusfar.Inthe thirdstrategywemade useoftranscriptomedata, whichhadbeengeneratedfromtwoSymbiodinium isolates(Bayeretal.2012).WesearchedtheEST data and the NCBI TSA database for sequences corresponding to psaB, psbE, petD, psbI, rpl28, rpl33,ycf16andycf24.Weidentifiedsequencesfor psaB,psbE,petD,rpl28,rpl33,ycf16andycf24but notpsbI.ThepsbIsequencehasonlybeenidenti- fiedinonedinoflagellatespecies,A.carterae,and as well asbeing short is highly divergent (Nisbet et al. 2004). It is therefore unsurprising that we failed to identify sequences corresponding to it.

NumerousfeaturesoftheESTsequencesforrpl28, rpl33, ycf16 and ycf24 indicated that in Symbio- dinium they were located in the nucleus rather than the chloroplast (codon usage, presence of organellar targeting sequencesin inferred protein sequence, polyAtail addition,low sequenceiden- tityto putativeminicircular homologues)andwere not consequently further investigated. Using the SymbiodiniumESTsequencesforpsaB,psbEand petD,aswellasanyotherdinoflagellatesequences for these genes that were available,we designed primersthatwouldamplifycodingregionsofthese genes. PCRs were carried out that successfully amplified coding regions of all three genes (as Fig.1a,primerslabelled1).Thesesequenceswere thenusedtodesign‘outward’primersforthegenes (as Fig. 1a, primerslabelled 2). PCRs with these primerswerethenabletoamplifytheremainderof

thethreeminicircles,whichincludedtheconserved non-codingregions.

MinicircleGenomeCharacteristics

TheSymbiodiniumminicircleshadameanGCcon- tentof41.5%(Table1).Thisismid-waybetweenthe percentageGC contents ofthe plastid minicircles of the two other well characterised dinoflagel- late species. However, unlike both H. triquetra andtheAmphidiniumspecies,thepercentageGC content of the coding regions (36.83%) is much lower thanthenon-codingregions (48.24%).This appearstobeaconsequenceoftheveryhighinci- dence of short inverted repeat sequences in the Symbiodiniumnon-codingregion,whicharehighly GC-rich.Theorganisationanddiversityofthenon- coding region will be described in further detail below.

The total length of the identified genome is 27,293 base pairs (Table 1). This is shorter than the totallengthsof the characterised genomesof A. carterae (both) and H. triquetra. However, the largersizesoftheothergenomesareprimarilydue totheexistenceofnumerous‘empty’andchimeric minicircles from these species (Howeet al.2008;

Zhang etal.2001).Wedidnotidentifyanyof this type of minicircle in Symbiodinium. The first and third strategiesemployed for determination of the minicirclesequencesfromSymbiodiniumwouldnot beexpectedtoleadtothediscoveryofany‘empty’

minicircles. However, the second strategy should have amplified ‘empty’ minicircles assuming their non-codingregionswereofsufficientidentityatthe primer binding sites. Chimeric minicircles should

(5)

Table1. Typeandsize(bp)ofminicirclesequencesinthechloroplastgenomesstudied(*partialcp16SrRNA sequence).

A.carteraeCS21 A.carterae CCAP1102/6

H.triquetra Symbiodinium sp.cladeC3

totallength(bp) 45,815 34,186 42,769 27,293

meanGCcontent 45.27% 46.46% 37.02% 41.50%

geneminicircles

psaA 2,558 2,443 3,005 2,788

psaB 2,366 2,363 3,121 2,741

psbA 2,520 2,311 2,151 1,914

psbB 2,327 2,282 2,286 2,232

psbC 2,477 2,341 2,330 2,135

psbD/E/I 2,358 2,369

psbD 2,644 1,880

psbE 2,214 1,309

petB/atpA 2,606 2,713

petB 2,204 1,722

atpA 2,444 2,213

petD 2,563 2,416 2,177 1,419

atpB 2,587 2,483 - 2,691

cp23SrRNA 2,713 2,655 3,027 2,839

cp16SrRNA 2,553 2,458 2,563 1,420*

‘empty’minicircles

No. 10 5 1 0

totallengthofemptyminicircles 18,196 7,367 2,012 0

chimericminicircles 0 0 5 0

totallengthofchimericminicircles 0 0 10,625 0

totallengthexcludingemptyand chimericminicircles

27,619 26,819 30,132 27,293

have been amplified by boththe first and second strategies,butwouldhavebeenunlikelyinthethird strategy as theprimer binding sites were located closetoboththe5and3endsofthegenes,oneor otherofwhichareusuallyabsentfromthechimeric minicircles.Overallourresultssuggestthatempty or chimericminicircles areless prevalentin Sym- biodinium, buttheirpresencecannotbeexcluded.

When the empty minicircles are excluded from comparisonsofgenomelength,Symbiodiniumand the A. carterae strains are remarkably similar (Table1),despitetherebeingsignificantvariationin thesizesoftheindividual minicircles. TheAmphi- dinium minicircles are considerably more uniform in length than those of Symbiodinium (Table 1).

There is only a 431bp difference between the largestandsmallestgene-containingminicirclesin the Amphidinium strains, whilst the correspond- ing figure is 1,530bp in Symbiodinium. This is also a much wider range than found in H. tri- quetra. The wider range is primarily due to the

existence of short minicircles in Symbiodinium (namely those encoding PsbE, PetD and cp16S rRNA). The shortest minicircle in Symbiodinium containspsbE.Onpreviouslydescribedminicircles psbE occurs alongside other genes, either other photosystem II genes in Amphidinium or tRNA genesinHeterocapsa. However,inSymbiodinium wefound noevidence forany co-localisedgenes.

If one subtracts the core region sequences this meansthereisconsiderablymorenon-codingthan coding DNA on the minicircle (234 coding base pairs versus 596 non-coding base pairs). In pre- viousstudieswehave suggestedthatgenessuch aspsaA might be shortened by internaldeletions toallowthemtobeaccommodatedwithina‘typical’

sized minicircle (Barbrookand Howe2000; Howe etal.2008).InSymbiodiniumwheretheminicircle sizes aremuch more variable,similar patterns of deletionareobserved,suggestingthatgenelength alteration isunlikelyto beaconsequence ofmini- circlesizeconstraints.

(6)

MinicircleGeneContent

All the Symbiodinium minicircles appear to con- tain either a single protein-encoding gene or a ribosomal RNA gene. This is similar to the majority of minicircles that have been charac- terisedinotherperidinin-containingdinoflagellates.

In Amphidinium carterae and Adenoides eludens minicircles havebeen identifiedthatcontainmore than one protein-coding gene. We found no evi- dence for such minicircles in Symbiodinium. In otherspeciestRNAgeneshavebeenfoundinclose associationwithproteinorribosomalgenes.Again we did not find this arrangement on the Symbio- dinium minicircles. In fact somewhat surprisingly, no tRNA genesweredetectedon any minicircles.

We were able to amplify sequences correspond- ingtoallthegenesthatweexpectedtobepresent on the chloroplast genome including atpB, which wasnotreportedinH.triquetra.Weanticipatethat oneormorefurthercp16SrRNA minicirclesexist, forreasonswediscusslater.Thecodingcontentof theSymbiodiniumminicirclesthereforeappearsto belargelyconsistentwithotherstudies.Wedidnot detectanyORFssimilartothosediscoveredinthe genomesofA.carterae(Barbrooketal.2006a).For most minicircles the obvious gene coding region pluscoreregioncoversvirtuallyallofthesequence.

However,asimpliedabove,someoftheshortmini- circles,whichcontainsmallgenes,havestretches ofsequencewithnoapparentfunction.Nosizeable ORFs weredeterminedin theseregions.RT-PCR experimentsmight revealifsomeoftheseregions are transcribed. RT-PCR experiments employing polyAcDNAprimershavepreviouslybeencarried outinAmphidiniumtoverifytheexistenceofsuch ORFs(Barbrooketal.2012).

MinicircleCodonUsage

The low GC content of the coding regions is reflected in the Symbiodinium codon usage (see SupplementaryTableS2).Incaseswhereanamino acidisencodedbymultiplepossiblecodonsthose withahigherATcontentareusedmorefrequently.

We have previously noted the biased codon pre- ferences of minicircle genes in A. carterae and H.triquetra(Howeetal.2003).However,in these speciesaninconsistentpatternwasobserved,with some codons which are more GC rich preferred over AT rich ones. The Symbiodinium preferen- ces areactuallymuchmoresimilartothosefound in more typical plastid genomes. The fact that in the three well characterised dinoflagellate mini- circular genomes three differing patterns have

beenobservedseemstoindicateminicirclecodon preferenceishighlyplastic amongsttheperidinin- containingdinoflagellates.

Therearealso relativelyfewrare codonsinthe Symbiodinium minicircles studies. Just 8 codons are observed10 times orfewer (out of 4512total codons). This is comparable to the Amphidinium species in which 10 codons are observed to be correspondinglyrare,whereasHeterocapsahas20 instances ofrarecodons.Thesetsofrarecodons arenothighlyoverlappingamongthethreespecies (Fig.2).Interestingly,SymbiodiniumusesTGAand TAA exclusivelyas Stop codonswhereas bothH.

triquetra and A. carterae use TAA and TAG. In mostalgal chloroplastgenomesalowpercentage ofStopcodonsareTGA,andtheyarenotusedat allinToxoplasmagondii(Meureretal.2002). The patternofStopcodonsobservedondinoflagellate minicirclesisconsistentwitharequirementforter- minationfactorpRF1inH.triquetraandA.carterae, but pRF2 in Symbiodinium (Duarte et al. 2012).

Againwhilsttranslationterminationappearstohave been simplified in dinoflagellate chloroplasts an inconsistentordiversepatternisobservedamong differenttaxa.

Intwocases,forPsaAandPetD,alternativeStart codons [Ile (ATT) and Ile (ATA) respectively] are predictedtobeused(seeSupplementaryFigsS1, S2).AlternativeStartcodons,includingthese,have beeninvokedfornumerousminicirclegenesfroma rangeofspecies.

AGA AGC TTA TCA

AAA ATA GGG CGA

TGA

ACC TTG AGG TCC TGC GGA CTA GCC CTG GTG ACG CCC

CGC CGG TCG GCG

CCG TAG Amphidinium carterae

Heterocapsa triquetra

Symbiodiniumsp.

Figure2. VennDiagram showingthe distribution of rare codons between dinoflagellate species. These codonsarefound10orfewertimesincombinedmini- circlecodingregionswithinaspecies.STOPcodons whichareneverusedwithinaspeciesarealsoshown.

STOPcodonsareshowninbold.

(7)

PhylogeneticsandSequenceDiversity Wegeneratedsimplephylogeneticnetworksusing each of the inferred Symbiodinium minicircle proteinsequencesalongwithallavailablehomolo- gousdinoflagellate sequencesand representative sequences from a wide range of algal lin- eagesincludingcyanobacteria.Thenetworkswere assessedforanyunexpectedphylogeneticassoci- ations amongstthe minicircle derived sequences.

For all protein networks all of the peridinin- containingdinoflagellate sequencesgroupedasa monophyleticgroup, albeitwithlongedgelengths incomparisontoothereukaryoticalgalsequences.

None of the dinoflagellate sequences grouped closelywiththeothereukaryoticalgalorcyanobac- terial sequences.A strikingfeature apparentfrom these networks is the increased rate of evolution thathasoccurredwithintheplastidsofthedinoflag- ellate lineage (Fig. 3). This accelerated evolution has been noted in previous studies (Bachvaroff et al. 2006; Zhang et al. 2000). It is also appar- ent that the minicircle proteins are not evolving

at a uniform rate. Whilst the core proteins of the PSII complex, D1 and D2, show the highest degree of protein conservation relative to other algalsequences,thisisstillmuchlowerthanistyp- ically found in all other eukaryotic algae (Fig. 3).

Forexample,thereismoreproteinsequencediver- sityamongstknowndinoflagellateminicirclePsbA sequencesthanbetweenasimilarsizedsetoftaxa comprising a rhodophyte, a cryptophyte, a hap- tophyte, adiatom, a euglenophyte, a chlorophyte and a glaucophyte, as shown by the differences in the number of constant amino acids in these groups (see Supplementary Table S3). This high level of diversity is also seen in components of other photosynthetic complexes, even though rel- atively few examples of dinoflagellate sequences are known (see Supplementary Table S3). The core proteins of PSI are highly divergent, which is surprising given their crucial role in the photo- syntheticelectrontransferchain.Closeranalysisof the protein alignments of PsaA and PsaB shows that this diversity is not equally spread over the lengthoftheproteins.WhilsttheC-terminalportion

Figure3. Phylogeneticnetwork constructedfrom simpleproteinsequencedistancesfor thePhotosystem II D1protein(PsbA).Taxonomicgroupingsofdinoflagellatesandotherphotosyntheticprotistsandcyanobacteria areshown.TaxonomicassignmentsandnomenclatureaccordingtoNCBITaxonomydatabase.

(8)

of both proteins is conserved to a similar extent to D1 and D2 the N-terminus is highly divergent.

ThispatternofevolutionofthecoreproteinsofPSI has previously been notedin certain cyanobacte- ria,notablymembersofthegenusProchlorococcus (Tingetal.2009;vanderStaayetal.2000).Asin somecyanobacteria,indinoflagellatestheregions whichshowhighestdivergencecomprisethelight- harvesting domains of the PSI. It is possible that this may indicate a change in thelight-harvesting properties of this complex. In consequence, dif- ferences in light-harvesting properties between different Symbiodinium species might comprise part of the physiological differences that these dinoflagellatesexertoncoralholobiontphysiology.

Overallthedinoflagellatesequencesshowalarge decreaseinthepercentageofconstantsites(Sup- plementaryTableS3),implyingalossoffunctional constraintsontheproteinstructure.Thesearekey hallmarks ofcovariotideevolutionofsequencesin which some positions in sequences are invariant in certain phylogeneticlineages but not in others (Lockhartetal.1998).

InferredProteinSequences

Lookingindetailatthealignmentsofthedinoflag- ellate proteins compared with representatives of themajoralgalgroups,somenotablefeaturesare apparent. The ATP synthase ␣ and ␤ subunits showconsiderablevariationattheN-terminalends.

These sequences correspond to the HAS bar- rel domain of the subunits, which is a ␤ –barrel domain that forms the ‘crown’of the F1 complex.

Whilst the domain does not appear to be criti- cal for the catalytic function of the ATPase, it is postulated tobe importantintheformationof crit- icalinteractionsthatarerequiredfortheassembly of the complex (Bakhtiari et al. 1999). We have previously noted the existence of a large inser- tion (41 amino acids) in the A. carterae AtpB sequence within this region (Barbrook and Howe 2000). We find that the Symbiodinium sequence has an even larger insertion (82 amino acids) at precisely thesame location, whichis shortlyafter the poorly conserved HAS-barrel domain. Other Symbiodinium sequences have indels relative to other typical algal sequences. For example, the chloroplast cytochrome b (PetB)sequence has a 4 amino acid insertion in a luminal loop region, which would be predicted to lie close to the Qo

site.ThesequencesofthecoreproteinsofPSI in Symbiodiniumpossess severalindels in compari- sontotypicaleukaryoticalgalsequences(Table2).

Alltheseindelsarelocatedinloopregionsbetween

Table2. Size and position of indels relative to typi- calalgaesequenceswithinPsaAandPsaB.Stromal loopsshadedgrey.“+”indicatesinsertionand“-”indi- cates deletion, numericalvalue indicatesnumberof aminoacidsinsertedordeleted.

loop PsaA PsaB

N-terminal -9 +17

I-II

II-III +20

III-IV -19 -41

IV-V -19 -12

V-VI

VI-VII -9

VII-VIII -17 -18

VIII-IX

IX-X -5

X-XI +2

C-terminal -4

the transmembranehelicesandare foundinboth stromalandluminalloops.Themajorityoftheindels aredeletionsandthemostextensivedeletions,>10 amino acids, are found between the same trans- membrane helices in both PsaA and PsaB (see Table 2). Sequence length variation in the core components of PSI has been previously reported in a number of cases, notably in cyanobacteria (Vanselow et al. 2009) and most extensively in several species of Prochlorococcus (Ting et al.

2009; van der Staay et al. 2000), as well as in dinoflagellates(BarbrookandHowe2000).Inother dinoflagellates PsaA and PsaB sequences have frequentindelswhichoftenmaptothesamepos- itionsastheonesfoundinSymbiodinium.However, thelengthsoftheindelsandthesequencesimilarity surrounding them are highly variable, suggest- ing that these may be mutational hotspots within the coding region. This suggests that sequences containingtheseregionsmightconstitutegoodcan- didatesformolecularmarkers.

Astrikingfeatureofthecodingsequencesofthe twocomponents ofthecytochromeb6fcomplexis the presence of an in-frame Stop codon in each.

ForthePetBpolypeptidethisisataminoacidposi- tion 136 out of 218 and in the PetD polypeptide atposition 32outof158(seeSupplementaryFig.

S2). In both cases this is a TGA codon. TGA is used as a Stop codon at 6 out of 11 predicted translation termination sites fortheSymbiodinium minicirclegenes,soitseemsunlikelythattheinter- nalStop codonswouldbe readassensecodons.

We assume that the Stop codons are edited to sensecodonstoallowexpressionoftheseproteins.

In support of this, the published Symbiodinium

(9)

transcriptomedatalackstopcodonsatthesepos- itions(althoughthesequenceswerederivedfromA andBphylotypestrains).Additionally,theLingulo- diniumpolyedrumpetBgenealsohasastopcodon (although in a different position in the sequence) that is modified to a sense codon in the mRNA (WangandMorse2006b,SupplementaryData).

RibosomalRNAGenes

In addition to the protein coding genes we iden- tified two rRNA genes on minicircles. The cp16S rRNAgeneisapparentlynotafull-lengthsequence.

Thesectionweidentifiedisapproximately300base pairsin lengthand shares70% sequenceidentity with the H. triquetra cp16S rRNA sequence. The H. triquetracp16S rRNA gene is highly divergent and has been shown to be fragmented into five pieces(DangandGreen2009).TheSymbiodinium sequencecorrespondstothefirsttwofragmentsof theH.triquetracp16SrRNA.WhereasinH.trique- traallthepiecesthatcomprisethecp16SrRNAare foundadjacenttooneanother,andintheexpected order,itappearsinSymbiodiniumfragmentsofthe cp16S rRNAarelocalisedtoseparateminicircles.

Incontrast,thecp23SrRNAappearstobeencoded in a continuous sequence on a single minicircle.

The SymbiodiniumandH.triquetra cp23S rRNAs show a similar percentage identity to the cp16S rRNAs (≈70%). This high level of identity was observedoverastretchof1800basepairsonthe minicircle. A somewhat lower percentage identity wasobservedbeyondthisregion.Giventhesehigh levelsofsequenceidentityandothersimilaritiesto therRNAsfoundinH.triquetrawebelievetheSym- biodiniumsequencesrepresentfunctionalcopiesof thechloroplastrRNAs,albeitweanticipatefurther fragmentsofthecp16SrRNAtobefound.

MinicircleNon-codingRegions

Thenon-codingregionsoftheminicircleseachcon- tain conserved ‘core’ sequencesof approximately 500 base pairs in this Symbiodinium strain. Con- servednon-codingregionsofpsbAminicirclesfrom phylotype C have previously been described in some detail (Barbrook et al. 2006b; Moore et al.

2003;Thornhilletal.2013).Theconservedregions in this strain are broadly similar to these. As in otherdinoflagellatespeciestheyshowhighlevelsof identitybetweenthedifferentminicircles, although this varies according to individual minicircles with somepairsofminicirclesdisplayinghigherpairwise identitythanothers.Comparisonofthenon-coding region between minicircles from this isolate and alsowiththepreviouslyidentifiedpsbAminicircles

further highlights themost conserved sequences.

The regionpreviously described asC4 (Barbrook etal.2006b;Moore etal.2003)isthemosthighly conserved (Fig. 4a). The region contains three inverted repeats followed by a A5-6-N-A4-6 motif (with the exception of the psbE sequence where the second run of As is interrupted by a C). The low level of nucleotide variation observed invari- ably maintains theintegrity of the invertedrepeat sequences.Additionally,itisnoticeablethatwithin the C4sequence identityis maintainedwithin the unpaired regions in the centre of the inverted repeats.Typicallyinthemanyinvertedrepeats(>8) thatarefoundinnon-codingregionstheseunpaired nucleotides arethemost variable(Barbrooketal.

2006b).Afeature commonto themajorityofnon- coding sequences between the end of the C4 region and the start of the coding regions is a stretchofaround24basepairsthatishighlyC-rich (typically75%)(Fig.4b).Thisisfoundonallminicir- clesapartfromthepetDandpsbEones.Thehighly conservedPregionthatimmediatelyprecededthe coding region on psbA minicircles (Moore et al.

2003) is not conserved among other minicircles anditseemsthatthismaybe agenespecificfea- ture,possiblyinvolvedinpsbAtranscriptregulation.

Givenitscloseproximitytothecodingregionitmay beincludedinthepsbAtranscriptsandformpartof the5untranslatedregion.

Since dinoflagellates are believed to harbour multiple copies of a given minicircle (Koumandou and Howe 2007), it is possible that single Sym- biodiniumcells couldharbournumerousdivergent copies. The study of LaJeunesse and Thornhill (2011) suggests that, at least for the non-coding region of psbA minicircles, intragenomic varia- tion does exist albeit at low levels, although the extent of this may vary according to lineage. In ourPCRreactionswedidnotencountersequence heterogeneity on gene specific minicircles, with oneexception,eventhoughtheinitialDNAprepa- ration was from an environmental sample. For example, multiple bands were not visible in gene specific minicircle amplifications, either of cod- ing or non-coding sequences. In cases where we sequenced multiple independent clones of the same minicircle we found sequences to be identical, excepting occasional single base differ- ences. Such levels of variation have been shown to be readily generated by methodological arte- facts associated with PCR and bacterial cloning (LaJeunesseandThornhill2011).Ourresultsindi- cateeitheralowleveloranabsenceofintra-colony andintra-genomicsequencevariationforallbutone type of minicircle in this isolate. The exception to

(10)

a)

psbA ATGGGCTGGGTGCCCTACCCAGCCCATATGGGCCCACACTTCGTGGGGCCCATAACGCCC petB ATGGGCTGGGTGCCCTACCCAGCCCATATGGGCCCACACTTCGTGGGGCCCATAACGCCC cp16S rRNA ATGGGCTGGGTGCCCTACCCAGCCCATATGGGCCCACACTTCGTGGGGCCCATAACGCCC atpA ATGGGGTGGGT-CCCTACCCACCCCATATGGGCCCACACTTCGTGGGGCCCATAACGCCC psaA ATGGGCTGGGTGCCCTACCCAGCCCATATGGGCCCACACTTCGTGGGGCCCATAACGCCC psbD ATGGGCTGGGTGCCCTACCCAGCCCATATGGGCCCACACTTCGTGGGGCCCATAACGCCC psaB ATGGGCTGGGTGCCCTACCCAGCCCATATGGGCCCACACTTCGTGGGGCCCATAACGCCC atpB ATGGGCTGGGTGCCCTACCCAGCCCATATGGGCCCACACTTCGTGGGGCCCATAACGCCC psbC ATGGGCTGGGTGCCCTACCCAGCCCATATGGGCCCACACTTCGTGGGGCCCATAACGCCC psbB ATGGGGTGGGTGCCCTACCCACCCCATATGGGCCCACACTTCGTGGGGCCCATAACGCCC cp23S rRNA ATGGGGTGGGTGCCCTACCCACCCCATATGGGCCCACACTTCGTGGGGCCCATAACGCCC petD ATGGGCTGGGTGCCCTACCCAGCCCATATGGGCCCACACTTCGTGGGGCCCATAACGCCC psbE ATGGGGTGGGTGCCCTACCCACCCCATATGGTCCCACACTTCGTGGGGACCATAACGCCC

***** ***** ********* ********* **************** ***********

psbA TTCGGGCTTCAAAAAATAAAA--GGT petB TTCGGGCTTCAAAAAATAAAAA---T cp16S rRNA TTCGGGCTTC-AAAAATAAAAA---T atpA TTCGGGCTTCAAAAAATAAAA---GT psaA TTCGGGCTTCAAAAAATAAAA--GGT psbD TTCGGGCTTCAAAAAATAAAA--GGT psaB TTCGGGCTTCAAAAAATAAAA--GTT atpB TTCGGGCTTCAAAAAATAAAA--GGT psbC TTCGGGCTTCAAAAAATAAAA--GGT psbB TTCGGGCTTC-AAAAACAAAAA--GT cp23S rRNA TTCGGGCTTCAAAAAATAAAA--GGT petD TTCGGGCTTC-AAAAATAAAAA--GT psbE TTCGGGCTTCAAAAAATAACATGAGT

********** ***** ** * *

b)

psbA -ATGTGCCCTCTCCCCAGCCCC-TCTCCCACCAGGCCAAAATTATTATCACAATC--TTT petB -ATATGCCCTCCACCCCTACCC-CTCCTC-CCAACAGAACATTCAGGCCAAAATTAGGTG cp16S rRNA ----TCCTGTATGCCCCTACCC-CCACCT-CCAACCTC-CTCCCCCACTGAAAT---GTG atpA -ACCTGCC-TCCACCCCTACCC-CTCCTC-CAAACGAG-AAATCAGGCCAAAATTAAGTG psaA -AAGTGCCTCCACCCCCTACCC-CCTCCT-CCAAATGGGAATTCAAGCCAAAATT--ATG psbD -ACCTGCCCTCCACCCTACCCC-CCTCCCAGCAA--TAGAATTAG-GCCAGAATT--ATG psaB -GCATACCCCTCCCCCAACCCC-CTCCCCAT---CCTGGCCAAAATT--A-- atpB -ATGTGCCCCTCCCCCTACCCC-CTCCCCACGAAG---TGAATCTGGCCAAAATT--AAA psbC -AT---CCCCTACCCCTTCCCC-CAGCAGTTAAA---TCAGGCCAAAATT--AAG psbB -ATGTGCCTCCACCCCTACCCC-TCCTCCACCAG---AAACAGGCCAAAATTACTTT cp23S rRNA -ATGTGCCCCTCACCCTACTCCTCTCCTCTCGAAGTGAAGATTGTGGCCAAAATTA-ACT petD -AAATATCGAAATTACGCCTAAAATGTGAAGTAAGTACACGGATACCCCCACCCAAAAA- psbE AAATTAAGAAATTTTCGCCTTTTTTACGAAGTAAGTAGACGGATACCTCAAAATTGTTAC

C-rich region

Figure4. SelectedregionsofanalignmentofSymbiodiniumexAgariciasp.non-codingminicirclesequences.

a)C4regionof‘core’sequences.Arrowsindicateinvertedrepeats.Starshowsasinglenucleotidewithinarepeat thatdoes nothaveapotential basepair.Boldbasesindicate unpairednucleotides betweencomplementary repeatsequences.b)C-richnucleotidesequence.Sequenceisfoundtypically<100bppriortopredictedcoding startsites.

this observation was the cp16S rRNA minicircle.

We cloned numerous fragments of this minicircle following PCR amplification using primers com- plementary to the ‘core’ region. The sequences were identical except for the presence of vari- ablenumbersofcopies,rangingfromsixtotwelve in different clones, of a repetitive sequence, 5- TGTGCCCACATGCCTGCCC-3,whichwasfound outsideofthe‘core’inthenon-codingregionofthe

minicircle.Wedidnotdetectthisrepeatinanyother minicirclesequences.

Discussion

We found that the chloroplast genome of our Symbiodinium sp. clade C3 isolate is constituted by single-gene minicircles. The size of these

(11)

minicircles is largely similar to the majority of minicircles sofar identified,although a somewhat wider range of sizes is observed. We did not detect numerous variant or chimaeric forms of minicircles as have been identified in some other species (Howe et al. 2008; Zhang et al. 2001).

This Symbiodinium chloroplast genome appears to possess a number of distinctive features; for example,theabsenceoftheStopcodonTAG,the inclusionofStopcodonswithinthegenesequences of the cytochrome b6f complex components, and the complex architecture of the minicircle core region withits many inverted repeats. There also appears to be a low incidence of ‘empty’ mini- circles. In other respects the genome conforms to characters found in other peridinin-containing dinoflagellates. Characterisation of further Sym- biodinium isolates from other phylotypes will be necessary to confirm that these are general fea- turesfoundthroughoutthegenus.Webelievethat sequenceswehave obtained willbeof usein the furtherstudyofSymbiodinium.Forexample,some oftheunusualproteinsequencesmaybeofsignif- icanceindeterminingsusceptibilitytohighthermal or light exposure. Perhaps more significantly, the relativelyshortnon-codingsequencesofthepsaA and psaB minicircles along with the presence of possible mutational ‘hotspots’ within their coding regionssuggesttheymaybe excellentcandidates asmolecularmarkersforanalysingSymbiodinium diversityandpossibletargetsofpositiveselection.

This may be aided by low levels of intragenomic variation betweenminicircles coding forthesame gene,althoughfurthersamplingofarangeofSym- biodiniumphylotypeswillbeneededtoverifythis.

Methods

SampleoriginandDNAextraction:Thematerialforthisstudy wasprovidedasagiftbyProfessorAngelaDouglas.Thesingle sampleofAgariciasp.usedwascollectedfromBermudaand wasanalysedaspartofpreviousresearchontherDNAdiversity ofSymbiodinium(Savageetal.2002).TheDNAwasextracted asdescribedbySavageetal.(2002).Thisstudyshowedthe zooxanthellaefromthesamplebelongedtoPhylotypeC.PCR amplificationandsequencingoftheITS2regionofthenuclear rDNAverifiedthis,yieldingaBLASTNhitwitha1bpdifference totheC1orC3subphylotype,(Acc.no.HG515026).ThisPCR was performedaccording toSampayoetal. (2009)withthe primersITSintfor25GAATTGCAGAACTCCGTG3andITS2- reverse 5 GGGATCCATATGCTTAAGTTCAGCGGGT 3. We subsequentlyconductedaphylogeneticanalysisbasedonthe psbAnon-codingregion,asthismarkerhasbeenshownprevi- ouslytoprovideimprovedresolutionofSymbiodiniumspecies diversity(LaJeunesseandThornhill2011).Comparisonofthe psbAnon-codingportionofourisolatetoC1andC3sequences taken from (Thornhill et al. 2013) with MAFFT (Katoh and

Standley2013)revealedamuchbetteralignmenttoC3.Aphy- logenetictreeconstructedfromthesedata(NeighborJoining, 1,000bootstraps)showedclosestcommonancestrywithstrong supportofourisolatetoC3.

PCRamplification ofminicircles: PrimersusedinPCR reactionsaredescribedinSupplementaryTableS1.Standard PCR conditions were an initial cycle of 95C for 3minutes 15seconds followed by 35 cycles of 95C for 45seconds, an annealing temperature appropriate to primer pair for 45seconds,72Cfor1to3minutesandafinalstepof72Cfor 10minutes.PCRproductswerepurifiedusingeitheraQIAquick PCRPurification Kit(QIAGEN,Germany)or aMinElute Gel ExtractionKit(QIAGEN,Germany)followingagarosegelelec- trophoresis.PCRproductswere eithersequenceddirectly or clonedintopGEM-Tplasmidvector(Promega,USA)andtrans- formedintoEscherichiacolipriortosequencing.

DNA sequencing and computational analysis of sequences: Sequencing was carried out using an Applied Biosystems 3730xl DNA Analyser. The identity of PCR products was confirmed by using BLASTN or BLASTX analysis (www.ncbi.nlm.nih.gov/BLAST). Minicircle contigs wereassembledandeditedusingChromasPro(Technelysium Pty Ltd, Australia). Sequenceshave been deposited in the EMBL database under the following accession numbers:

HG515015-HG515025,HG515027andHG515028.

MultiplealignmentsweregeneratedusingClustalX2(Larkin et al. 2007) and adjusted manually. Artemis and Artemis ComparisonTool(ACT) wereusedforwholegenomeanaly- sesofminicirclesequences(BerrimanandRutherford2003;

Carveretal.2005).Fortheseanalysesminicirclesequences were concatenated as linear DNA sequences. The circular sequenceswerelinearisedbybreakingimmediately5 ofthe codingregions.ACTwasusedtovisualiseregionsofidentity betweenspecies.Regionsofidentityweredeterminedbypair- wiseBLAST.TheoutputofthepairwiseBLASTwasthenused asaninputintoACT.SearchesfortRNAswerecarriedoutusing tRNAscan-SE1.21(Schattneretal.2005).Phylogeneticanal- ysisofsequenceswascarriedoutwithSplitstree(Husonand Bryant2006).Forphylogeneticanalysesofproteinsequences regionsincluding gapswere excluded.Distances usedwere simpleproteinsequencedistances(uncorrected(P)distances) andnetworksweregeneratedusingNeighborNet(Husonand Bryant2006).

Acknowledgements

TheauthorswishtothankProfessorAngelaDoug- lasforthegiftoftheDNAextractusedinthisstudy.

Theauthors alsowishtothank DanielJ.Thornhill andToddJ.LaJeunesseforprovidingassistancein phylogeneticassignmentoftheSymbiodiniumiso- late.Thisworkwasfundedthroughagrantfromthe Cambridge-KAUSTAcademicExcellenceAlliance (AEA) program (award number 7000000056) as wellasbytheLeverhulmeTrust.

AppendixA. SupplementaryData

Supplementary material related to this arti- cle can be found, in the online version, at http://dx.doi.org/10.1016/j.protis.2013.09.006.

Referenzen

ÄHNLICHE DOKUMENTE

But it does imply that the focus of the surveying profession is changing from being very much related to doing measurements to now being increasingly related to management of

ACGCGGGGACACTTCGAATTAGAACTCTAATAATGTATTTCGTAACTCCCGGTCTCATTTTTACGGTAATAGCACTTTGGTTGTTGTGGT

Approaches using relational database systems (DBSs) were already proposed to facilitate data integration and enable declarative access to genome data for analysis applications [15,

In conclusion, this project allowed to identify several candidate genes using microdissected tissues to not only provide global information on gene expression

In this case, one primer (penps-.. Figure 2: MEME alignment for motif #6 for the sequence u65082, as an example of alignments with taxonomic patterns. In green rectangles, the

1 College of life science / Shaanxi Key Laboratory of Ecological Restoration in Northern Shaanxi Mining Area/, Yulin University, Yulin, China,.. 2 School of ecology and environment

The aim of this work was to create a contribution to the comparative physical gene map, especially to the cytogenetic gene map of the horse by mapping candidate genes

The preceding reasoning leads me to conclude that for i = 1,..., n , the w i ′ income is the lowest possible level of income which guarantees, first, that the relative