• Keine Ergebnisse gefunden

Coprinopsis cinerea (Coprinus cinereus) has multiple hydrophobin genes

3.4.2 Phylogenetic analysis of hydrophobin genes of C. cinerea

Hydrophobin genes of C. cinerea are distributed on contigs of the genome of monokaryon Okayama 7 in one cluster of 7 genes (including the already known genes coH1 and coH2), one cluster of 6 genes, 2 clusters of 3 genes and 4 clusters of 2 genes (including the cluster of coH8 and coH9). In seven contigs, single genes were found (Fig.

2). Those genes clustering closely together are in most cases also more closely related in protein sequence (Fig. 3), suggesting these genes were generated from each other by duplication events. The genes have between 1 to 5 introns (Fig. 3 and see also Fig. 2 in

34

Figure 2. Localization of hydrophobin genes on different contigs in the genome of C. cinerea strain Okayama 7. The distance between the neighboring genes were given in between the genes and the start and stop positions of the genes on the top and below of the broad arrows representing the genes, respectively.

12kb 274246 16kb 1kb 2kb 2kb 1kb

274696

16kb 23kb 3kb 1kb 55kb

103115 81kb coH 17 29kb coH 16

CONTIG 1.108

12kb 274246 16kb 1kb 2kb 2kb 1kb

274696

16kb 23kb 3kb 1kb 55kb

103115

*20*40*60*80*100*120*140* CoH1:......MQFKFLSTV.ALATLAVA........................APAPTDPT.PIPPS.QCNTGP.IQCCNTVTQAS..NPVAG.LLLGLLGIVLQDLNVLVGLTCSPIS.IIGLPG.NSCN..AQ.PVCCQNNN...FNGLIAIGCTPININL:113 CoH2:......MQFKFLTTV.ALATLAVA..........................VPTD...P.PPTNQCNAPNNLECCNSVQAPT..NSGLIGTLLGLLNISVGDITGLVGLTCNPIS.LIG.GG.NSCN..AQ.TVCCQNNH...FGGLISIGCTPIIIDV:110 CoH3:......MQFKFLSTL.ALASLAVA.........................APTGGDPAPIPPS.QCNTGP.IQCCNAVTKAS..DPAAG.VLLGLLGIVLQDLNVLVGLTCSPIS.IIGLPG.NSCN..AQ.PVCCQNNS...FNGLIAIGCTPININL:113 CoH4:......MQFKALISL.ALAAVSAA..........................VPNG.......GGQCNTGP.IQCCESVQRAD..SVAAS.TLLALLGVVVQDLSIPIGITCSPIS.VIGLPG.NSCS..SQ.PVCCEDNS...FKGVIAIGCTPVNINV:106 CoH5:......MQFKVLSTL.ALLAATAAA..........................GP..LEARQ..GQCNTGP.VQCCNSISTAK..DPATS.LLLGLLGIVVQDLNIPIGLTCSPIS.VIGLPG.NSCS..TQ.PVCCEDNS...FNGVVAIGCTPININV:109 CoH6:......MQFKALVAL.TLATVAIA..........................APSN.LEARQ..GQCNTGP.VQCCNSVQRAD..SEAAS.KLLGLLGIVVQDVSIPIGITCTPIT.VIGLPG.NSCS..TQ.PVCCKDNS...YKGVVAIGCTPININV:110 CoH7:......MQFKTLSTL.ALLAATAAA..........................GP..LEARQSGGQCNTGP.VQCCNSISSA...KDPSTALLLGLV...VQDVNIPIGIICSPIT.IIGLPG.NSCS..TQ.PVCCQNNN...FNGIVAIGCTPINVNA:108 CoH8:MKPFAFFVALFVLSS.SLFMHVTAIPRVHRGPNAARLARGLGPLPPTRRSPTLAPRPSS.LPQQCPAGKRVQCCEEMTTAG..NP.SLALILRLYNLNLPPQT.PVGKECTA.....GSNQG.QCTGGAQLKLCCDDIAGLPVNGRVAVSCTAFPS..:144 CoH9:MKATTFFVTLF..TL..LFTLVLA..VPQGGPNAQRLARGLPPLPPVRRSPTFGPAPSPVPTNQCSAGAKVQCCYEVTTAR..NPLVSILLEAL.GIDIPADT.PIGKGCKA.....GSDRT.QCFLPGMKKLCC.NATG.PAFGLFAVGCTPGSN..:138 CoH10:...MFAKTLFALTSISAIFVSVAA..........................IPSG......APT.CATGP.IQCCERVYESQ..TTETS.LLTDLLGLNLDGLLGGIATGCSPLS.VVGIGGGNKCA..HR.PVCCTDNK...FNGLVNVGCVPVNVNL:111 CoH11:...MFNKSVLALTVA.TLA.TLVAA.......................NPVANAQ.......GSCNTGD.IQCCQKLYPSQ..SSEAS.ILGSLLGIDLGSVLGDIGSGCSPVTALLGIGGGTKCT..AA.PVCCSHNT...FNGLINIGCTPINIGL:113 CoH12:...MLNKTLFTLTTA.LLA.ITVSVTA.....................NPIPNS.......EGVCNTGP.VQCCETRFSAQ..SREAN.LLTSLLGLDLGGILGDIASGCSPLS.VVGVGGGTRCS..SA.PVCCTDNK...FNGLINVGCVPVTVGL:114 CoH13:...MFKKAILALTAA.AVA.VSVTA.......................NPLPSTQ.......GTCNTGH.IQCCESRLPP...SPTPG.LLGSLLGINLGNLLGDIGSGCSPIS.VIGIGGGNKCT..AA.PVCCTDNK...FNGLINLGCTPINIGL:111 CoH14:...MFARFNAGLLAV.ALALPAV.VSA............................TPVARTENACNTGS.LHCCESTFSSN..HPSVST.LAGLFG.FVGNLGNSIGISCSALN.IGSLGGAPNCN..QQ.TVCCTGNQ...YNGLIAFGCTPFNFGF:113 CoH15:...MRVFLNLAYGPI.AFLLLALA..................................YMNFTQCSTGD.VQCCNLLTFSN..DSVLGPLLN.LASSTVTGVVELIGIQCTPIN.ILGLTQTAQCQ..SQ.PLCCSNKNKGIL.GVMAVGCVPINISV:111 CoH16:...MRLTSAIF..AA.SLAVSALA........................APAPGLGIANFQTTY.CGNGGQTVCCNKLEQVTSLDTGVGKIL.GLLNVSLSQVTGLVGLQCTSINA.LGLGGAVSCT..QQ.TACCSNNS...YNGVVAIGCVPINISI:119 CoH17:...MFARASTLLTAA.LLASTALA..........................APS....AVYDYSQCNGGE.IQCCNKAQSTKALEWTTTRLI.GLLGLDLKGITGLVGTECTAIN.VAGVGGGSSCT..QQ.KVCCTNNS...FNGVIALGCTPINVGV:115 CoH18:...MRVSALF.VSAA..LFAGALA........................APMPGG..TE.YEFEQCNGGE.IQCCNSTKSVQSLEWTTKSLL.GLLGIDLKQITGLVGTECTSLN.VLGIGGGSKCT..QQ.KVCCNNNS...FNGVIALGCSPINIGL:116 CoH19:MKLAFIAATLL.SFF...FALVAA..VPAGGPNAERLARGLPPLPPVRRHATPAHRE.....SQCNGGT.IKCCNSVASS...NDAVPKLLSSILNLGLGLNT.IVGMQCTNLNA.LGVGGGSSCT..GQ.TVCCSGND...FNGVITAGCTPISIGA:135 CoH20:...MMARFTSALFAF.ALFAAAVA.............................VPTPQDVEYVQCNGGQ.VQCCNDVKETNQLDAPYNQLLS.VFDVDVKQLTGKVGLTCNTVN.VLGIG.SNSCD..AQ.TVCCTDNS...FDGIIALGCTPININL:115 CoH21:...MFARLTSTLFAL.AAVSAVFA..........................APG......ATTEQCNGGE.VQCCNSVQDANNLDSSVKKIITGLLHLDLKQITGQVGVTCTSVN.VLGIGGGSSCT..QQ.KVCCTNNS...FHGLIALGCTPINVSV:114 CoH22:...MFARLTTALVAF.TLVSAVVA........................NPAPT....E.IEYEQCEGGT.VQCCASYQKATDLNAEWTKWL.GFLNINARQVDANVGFTCTGVKA.GGIGGAASCT..QQ.KLCCTNSN...FNGVVAIGCTPILSAL:116 CoH23:...MFSKVITTISLC.ALFLGVSA..........................APS.......DSNQCNGGQ.VQCCNKVQDSKSLDAGVKGLL.GVLNIDLSQLTGQVGVTCTAVN.VVGVGGGSHCS..NQ.AVCCNNNN...FNGVVALGCTPINVSV:112 CoH24:...MFARLSSALVAF.TLAASAMA..........................APTS..G..V.VAQCNGGV.VQCCNEMQSSTSLEAPIAGLL.SLLGIDLSGLTGQVGLTCTDVT.VIGVGGGSSCN..NQ.QVCCNNNN...FNGVLALGCTPINVSI:114 CoH25:...MFARLSSVIVVC.TLAASAMA..........................APSSQTGD..AIA.CGNGGTLQCCNTVESSNNLSGALAGLL.TLLGVDISKLTGQVGASCTGIN.VIGVGGGTSCS..NQ.PVCCTGNN...FSGVVAIGCTPINISL:117 CoH26:...MFARLSAAFVAF.TLATAVIA..........................APGGRPSE.VEYEQCNGGT.VQCCNSYQKADSIDHSASKLL.NLLNIDVKQVAAGLGLSCTGVN.VVGIGGGSSCT..QQ.KVCCNNN.....NGVVALGCTPINASL:116 CoH27:...MFARLSTALLAF.TLATAVVA..........................APGGRPSE.VEYEQCNGGE.IQCCNSYQKADSLDYNTSKLL.GLLNIDVKQITAGVGLTCTGINA.VGIGGGSSCT..QQ.KVCCTNNK...FNGVVALGCSPINVSL:118 CoH28:...MFARLTSALLAF.TLVSAVVA............................GG.HKN.VDYQQCNGGK.IQCCNSVQDSKSLDQSLTKLL.GLLKVDVKQITGQVGTGCTSLN.VLGVGGGSSCT..QQ.KVCCTNNS...FNGVVALGCSPINVSV:115 CoH29:...MFARLTSALLAF.TLVSAVVA............................GG.HKN.VDYQQCNGGE.IQCCNSVQDSKSLDNSLTKLL.GLLKVDVKQITGQVGTGCTSLN.VLGVGGGSSCT..QQ.KVCCTNNS...FNGVVALGCSPINVSA:115 CoH30:...MFARLTSALLAF.TLVSAVVA............................GG.HKN.VDYQQCNGGE.IQCCNSVQDSKSLDQSLTKLL.GLLKVDVKQITGQVGMGCTSLN.VLGVGGGSSCT..QQ.KVCCTNNS...FNGVVALGCSPINVSV:115 CoH31:...MFARLTSALLAF.TLVSAVVA............................GG.QKN.VDYQQCNGGE.IQCCNSVQDSKSLDQSLTKLL.GLLKVDVKQITGQVGMGCTSLN.VLGVGGGSSCT..QQ.KVCCNNNS...FNGVVALGCSPINVSA:115 CoH32:...MFARLTSALLAF.TLVSAVVA............................GG.HKN.VDYQQCNGGE.IQCCNSVQDSKSLDQSLTKLL.GLLKVDVKQITGQVGMGCTSLN.VLGVGGGSSCT..QQ.KVCCSNNS...FNGVVALGCSPINVSA:115 CoH33:...MFARLSTALLAL.TLVTSAIA............................GSHHKSRVEAEQCNGGE.IQCCYGVQDSKSLDNEVTKLL.GHLKIDAKQVTGQVGVGCTALNA.LGAGGGSSCT..EQ.KVCCTNNS...FNGVVALGCSPANVSA:117 CoH34:...MFARLTSALLAF.TLVSAVVA............................GG.PKS.VEYEQCNGGE.IQCCNSVQQSNSLDSSLTKLL.GLLKVDIKQITGQVGVGCTAVN.VLGVGGGSSCT..QQ.KVCCSNNS...FNGVVALGCSPINVSA:115

Signal peptide 5/344/344/3422/3429/343/34

*20*40*60*80*100*120*140* CoH1:......MQFKFLSTV.ALATLAVA........................APAPTDPT.PIPPS.QCNTGP.IQCCNTVTQAS..NPVAG.LLLGLLGIVLQDLNVLVGLTCSPIS.IIGLPG.NSCN..AQ.PVCCQNNN...FNGLIAIGCTPININL:113 CoH2:......MQFKFLTTV.ALATLAVA..........................VPTD...P.PPTNQCNAPNNLECCNSVQAPT..NSGLIGTLLGLLNISVGDITGLVGLTCNPIS.LIG.GG.NSCN..AQ.TVCCQNNH...FGGLISIGCTPIIIDV:110 CoH3:......MQFKFLSTL.ALASLAVA.........................APTGGDPAPIPPS.QCNTGP.IQCCNAVTKAS..DPAAG.VLLGLLGIVLQDLNVLVGLTCSPIS.IIGLPG.NSCN..AQ.PVCCQNNS...FNGLIAIGCTPININL:113 CoH4:......MQFKALISL.ALAAVSAA..........................VPNG.......GGQCNTGP.IQCCESVQRAD..SVAAS.TLLALLGVVVQDLSIPIGITCSPIS.VIGLPG.NSCS..SQ.PVCCEDNS...FKGVIAIGCTPVNINV:106 CoH5:......MQFKVLSTL.ALLAATAAA..........................GP..LEARQ..GQCNTGP.VQCCNSISTAK..DPATS.LLLGLLGIVVQDLNIPIGLTCSPIS.VIGLPG.NSCS..TQ.PVCCEDNS...FNGVVAIGCTPININV:109 CoH6:......MQFKALVAL.TLATVAIA..........................APSN.LEARQ..GQCNTGP.VQCCNSVQRAD..SEAAS.KLLGLLGIVVQDVSIPIGITCTPIT.VIGLPG.NSCS..TQ.PVCCKDNS...YKGVVAIGCTPININV:110 CoH7:......MQFKTLSTL.ALLAATAAA..........................GP..LEARQSGGQCNTGP.VQCCNSISSA...KDPSTALLLGLV...VQDVNIPIGIICSPIT.IIGLPG.NSCS..TQ.PVCCQNNN...FNGIVAIGCTPINVNA:108 CoH8:MKPFAFFVALFVLSS.SLFMHVTAIPRVHRGPNAARLARGLGPLPPTRRSPTLAPRPSS.LPQQCPAGKRVQCCEEMTTAG..NP.SLALILRLYNLNLPPQT.PVGKECTA.....GSNQG.QCTGGAQLKLCCDDIAGLPVNGRVAVSCTAFPS..:144 CoH9:MKATTFFVTLF..TL..LFTLVLA..VPQGGPNAQRLARGLPPLPPVRRSPTFGPAPSPVPTNQCSAGAKVQCCYEVTTAR..NPLVSILLEAL.GIDIPADT.PIGKGCKA.....GSDRT.QCFLPGMKKLCC.NATG.PAFGLFAVGCTPGSN..:138 CoH10:...MFAKTLFALTSISAIFVSVAA..........................IPSG......APT.CATGP.IQCCERVYESQ..TTETS.LLTDLLGLNLDGLLGGIATGCSPLS.VVGIGGGNKCA..HR.PVCCTDNK...FNGLVNVGCVPVNVNL:111 CoH11:...MFNKSVLALTVA.TLA.TLVAA.......................NPVANAQ.......GSCNTGD.IQCCQKLYPSQ..SSEAS.ILGSLLGIDLGSVLGDIGSGCSPVTALLGIGGGTKCT..AA.PVCCSHNT...FNGLINIGCTPINIGL:113 CoH12:...MLNKTLFTLTTA.LLA.ITVSVTA.....................NPIPNS.......EGVCNTGP.VQCCETRFSAQ..SREAN.LLTSLLGLDLGGILGDIASGCSPLS.VVGVGGGTRCS..SA.PVCCTDNK...FNGLINVGCVPVTVGL:114 CoH13:...MFKKAILALTAA.AVA.VSVTA.......................NPLPSTQ.......GTCNTGH.IQCCESRLPP...SPTPG.LLGSLLGINLGNLLGDIGSGCSPIS.VIGIGGGNKCT..AA.PVCCTDNK...FNGLINLGCTPINIGL:111 CoH14:...MFARFNAGLLAV.ALALPAV.VSA............................TPVARTENACNTGS.LHCCESTFSSN..HPSVST.LAGLFG.FVGNLGNSIGISCSALN.IGSLGGAPNCN..QQ.TVCCTGNQ...YNGLIAFGCTPFNFGF:113 CoH15:...MRVFLNLAYGPI.AFLLLALA..................................YMNFTQCSTGD.VQCCNLLTFSN..DSVLGPLLN.LASSTVTGVVELIGIQCTPIN.ILGLTQTAQCQ..SQ.PLCCSNKNKGIL.GVMAVGCVPINISV:111 CoH16:...MRLTSAIF..AA.SLAVSALA........................APAPGLGIANFQTTY.CGNGGQTVCCNKLEQVTSLDTGVGKIL.GLLNVSLSQVTGLVGLQCTSINA.LGLGGAVSCT..QQ.TACCSNNS...YNGVVAIGCVPINISI:119 CoH17:...MFARASTLLTAA.LLASTALA..........................APS....AVYDYSQCNGGE.IQCCNKAQSTKALEWTTTRLI.GLLGLDLKGITGLVGTECTAIN.VAGVGGGSSCT..QQ.KVCCTNNS...FNGVIALGCTPINVGV:115 CoH18:...MRVSALF.VSAA..LFAGALA........................APMPGG..TE.YEFEQCNGGE.IQCCNSTKSVQSLEWTTKSLL.GLLGIDLKQITGLVGTECTSLN.VLGIGGGSKCT..QQ.KVCCNNNS...FNGVIALGCSPINIGL:116 CoH19:MKLAFIAATLL.SFF...FALVAA..VPAGGPNAERLARGLPPLPPVRRHATPAHRE.....SQCNGGT.IKCCNSVASS...NDAVPKLLSSILNLGLGLNT.IVGMQCTNLNA.LGVGGGSSCT..GQ.TVCCSGND...FNGVITAGCTPISIGA:135 CoH20:...MMARFTSALFAF.ALFAAAVA.............................VPTPQDVEYVQCNGGQ.VQCCNDVKETNQLDAPYNQLLS.VFDVDVKQLTGKVGLTCNTVN.VLGIG.SNSCD..AQ.TVCCTDNS...FDGIIALGCTPININL:115 CoH21:...MFARLTSTLFAL.AAVSAVFA..........................APG......ATTEQCNGGE.VQCCNSVQDANNLDSSVKKIITGLLHLDLKQITGQVGVTCTSVN.VLGIGGGSSCT..QQ.KVCCTNNS...FHGLIALGCTPINVSV:114 CoH22:...MFARLTTALVAF.TLVSAVVA........................NPAPT....E.IEYEQCEGGT.VQCCASYQKATDLNAEWTKWL.GFLNINARQVDANVGFTCTGVKA.GGIGGAASCT..QQ.KLCCTNSN...FNGVVAIGCTPILSAL:116 CoH23:...MFSKVITTISLC.ALFLGVSA..........................APS.......DSNQCNGGQ.VQCCNKVQDSKSLDAGVKGLL.GVLNIDLSQLTGQVGVTCTAVN.VVGVGGGSHCS..NQ.AVCCNNNN...FNGVVALGCTPINVSV:112 CoH24:...MFARLSSALVAF.TLAASAMA..........................APTS..G..V.VAQCNGGV.VQCCNEMQSSTSLEAPIAGLL.SLLGIDLSGLTGQVGLTCTDVT.VIGVGGGSSCN..NQ.QVCCNNNN...FNGVLALGCTPINVSI:114 CoH25:...MFARLSSVIVVC.TLAASAMA..........................APSSQTGD..AIA.CGNGGTLQCCNTVESSNNLSGALAGLL.TLLGVDISKLTGQVGASCTGIN.VIGVGGGTSCS..NQ.PVCCTGNN...FSGVVAIGCTPINISL:117 CoH26:...MFARLSAAFVAF.TLATAVIA..........................APGGRPSE.VEYEQCNGGT.VQCCNSYQKADSIDHSASKLL.NLLNIDVKQVAAGLGLSCTGVN.VVGIGGGSSCT..QQ.KVCCNNN.....NGVVALGCTPINASL:116 CoH27:...MFARLSTALLAF.TLATAVVA..........................APGGRPSE.VEYEQCNGGE.IQCCNSYQKADSLDYNTSKLL.GLLNIDVKQITAGVGLTCTGINA.VGIGGGSSCT..QQ.KVCCTNNK...FNGVVALGCSPINVSL:118 CoH28:...MFARLTSALLAF.TLVSAVVA............................GG.HKN.VDYQQCNGGK.IQCCNSVQDSKSLDQSLTKLL.GLLKVDVKQITGQVGTGCTSLN.VLGVGGGSSCT..QQ.KVCCTNNS...FNGVVALGCSPINVSV:115 CoH29:...MFARLTSALLAF.TLVSAVVA............................GG.HKN.VDYQQCNGGE.IQCCNSVQDSKSLDNSLTKLL.GLLKVDVKQITGQVGTGCTSLN.VLGVGGGSSCT..QQ.KVCCTNNS...FNGVVALGCSPINVSA:115 CoH30:...MFARLTSALLAF.TLVSAVVA............................GG.HKN.VDYQQCNGGE.IQCCNSVQDSKSLDQSLTKLL.GLLKVDVKQITGQVGMGCTSLN.VLGVGGGSSCT..QQ.KVCCTNNS...FNGVVALGCSPINVSV:115 CoH31:...MFARLTSALLAF.TLVSAVVA............................GG.QKN.VDYQQCNGGE.IQCCNSVQDSKSLDQSLTKLL.GLLKVDVKQITGQVGMGCTSLN.VLGVGGGSSCT..QQ.KVCCNNNS...FNGVVALGCSPINVSA:115 CoH32:...MFARLTSALLAF.TLVSAVVA............................GG.HKN.VDYQQCNGGE.IQCCNSVQDSKSLDQSLTKLL.GLLKVDVKQITGQVGMGCTSLN.VLGVGGGSSCT..QQ.KVCCSNNS...FNGVVALGCSPINVSA:115 CoH33:...MFARLSTALLAL.TLVTSAIA............................GSHHKSRVEAEQCNGGE.IQCCYGVQDSKSLDNEVTKLL.GHLKIDAKQVTGQVGVGCTALNA.LGAGGGSSCT..EQ.KVCCTNNS...FNGVVALGCSPANVSA:117 CoH34:...MFARLTSALLAF.TLVSAVVA............................GG.PKS.VEYEQCNGGE.IQCCNSVQQSNSLDSSLTKLL.GLLKVDIKQITGQVGVGCTAVN.VLGVGGGSSCT..QQ.KVCCSNNS...FNGVVALGCSPINVSA:115

Signal peptide *20*40*60*80*100*120*140* CoH1:......MQFKFLSTV.ALATLAVA........................APAPTDPT.PIPPS.QCNTGP.IQCCNTVTQAS..NPVAG.LLLGLLGIVLQDLNVLVGLTCSPIS.IIGLPG.NSCN..AQ.PVCCQNNN...FNGLIAIGCTPININL:113 CoH2:......MQFKFLTTV.ALATLAVA..........................VPTD...P.PPTNQCNAPNNLECCNSVQAPT..NSGLIGTLLGLLNISVGDITGLVGLTCNPIS.LIG.GG.NSCN..AQ.TVCCQNNH...FGGLISIGCTPIIIDV:110 CoH3:......MQFKFLSTL.ALASLAVA.........................APTGGDPAPIPPS.QCNTGP.IQCCNAVTKAS..DPAAG.VLLGLLGIVLQDLNVLVGLTCSPIS.IIGLPG.NSCN..AQ.PVCCQNNS...FNGLIAIGCTPININL:113 CoH4:......MQFKALISL.ALAAVSAA..........................VPNG.......GGQCNTGP.IQCCESVQRAD..SVAAS.TLLALLGVVVQDLSIPIGITCSPIS.VIGLPG.NSCS..SQ.PVCCEDNS...FKGVIAIGCTPVNINV:106 CoH5:......MQFKVLSTL.ALLAATAAA..........................GP..LEARQ..GQCNTGP.VQCCNSISTAK..DPATS.LLLGLLGIVVQDLNIPIGLTCSPIS.VIGLPG.NSCS..TQ.PVCCEDNS...FNGVVAIGCTPININV:109 CoH6:......MQFKALVAL.TLATVAIA..........................APSN.LEARQ..GQCNTGP.VQCCNSVQRAD..SEAAS.KLLGLLGIVVQDVSIPIGITCTPIT.VIGLPG.NSCS..TQ.PVCCKDNS...YKGVVAIGCTPININV:110 CoH7:......MQFKTLSTL.ALLAATAAA..........................GP..LEARQSGGQCNTGP.VQCCNSISSA...KDPSTALLLGLV...VQDVNIPIGIICSPIT.IIGLPG.NSCS..TQ.PVCCQNNN...FNGIVAIGCTPINVNA:108 CoH8:MKPFAFFVALFVLSS.SLFMHVTAIPRVHRGPNAARLARGLGPLPPTRRSPTLAPRPSS.LPQQCPAGKRVQCCEEMTTAG..NP.SLALILRLYNLNLPPQT.PVGKECTA.....GSNQG.QCTGGAQLKLCCDDIAGLPVNGRVAVSCTAFPS..:144 CoH9:MKATTFFVTLF..TL..LFTLVLA..VPQGGPNAQRLARGLPPLPPVRRSPTFGPAPSPVPTNQCSAGAKVQCCYEVTTAR..NPLVSILLEAL.GIDIPADT.PIGKGCKA.....GSDRT.QCFLPGMKKLCC.NATG.PAFGLFAVGCTPGSN..:138 CoH10:...MFAKTLFALTSISAIFVSVAA..........................IPSG......APT.CATGP.IQCCERVYESQ..TTETS.LLTDLLGLNLDGLLGGIATGCSPLS.VVGIGGGNKCA..HR.PVCCTDNK...FNGLVNVGCVPVNVNL:111 CoH11:...MFNKSVLALTVA.TLA.TLVAA.......................NPVANAQ.......GSCNTGD.IQCCQKLYPSQ..SSEAS.ILGSLLGIDLGSVLGDIGSGCSPVTALLGIGGGTKCT..AA.PVCCSHNT...FNGLINIGCTPINIGL:113 CoH12:...MLNKTLFTLTTA.LLA.ITVSVTA.....................NPIPNS.......EGVCNTGP.VQCCETRFSAQ..SREAN.LLTSLLGLDLGGILGDIASGCSPLS.VVGVGGGTRCS..SA.PVCCTDNK...FNGLINVGCVPVTVGL:114 CoH13:...MFKKAILALTAA.AVA.VSVTA.......................NPLPSTQ.......GTCNTGH.IQCCESRLPP...SPTPG.LLGSLLGINLGNLLGDIGSGCSPIS.VIGIGGGNKCT..AA.PVCCTDNK...FNGLINLGCTPINIGL:111 CoH14:...MFARFNAGLLAV.ALALPAV.VSA............................TPVARTENACNTGS.LHCCESTFSSN..HPSVST.LAGLFG.FVGNLGNSIGISCSALN.IGSLGGAPNCN..QQ.TVCCTGNQ...YNGLIAFGCTPFNFGF:113 CoH15:...MRVFLNLAYGPI.AFLLLALA..................................YMNFTQCSTGD.VQCCNLLTFSN..DSVLGPLLN.LASSTVTGVVELIGIQCTPIN.ILGLTQTAQCQ..SQ.PLCCSNKNKGIL.GVMAVGCVPINISV:111 CoH16:...MRLTSAIF..AA.SLAVSALA........................APAPGLGIANFQTTY.CGNGGQTVCCNKLEQVTSLDTGVGKIL.GLLNVSLSQVTGLVGLQCTSINA.LGLGGAVSCT..QQ.TACCSNNS...YNGVVAIGCVPINISI:119 CoH17:...MFARASTLLTAA.LLASTALA..........................APS....AVYDYSQCNGGE.IQCCNKAQSTKALEWTTTRLI.GLLGLDLKGITGLVGTECTAIN.VAGVGGGSSCT..QQ.KVCCTNNS...FNGVIALGCTPINVGV:115 CoH18:...MRVSALF.VSAA..LFAGALA........................APMPGG..TE.YEFEQCNGGE.IQCCNSTKSVQSLEWTTKSLL.GLLGIDLKQITGLVGTECTSLN.VLGIGGGSKCT..QQ.KVCCNNNS...FNGVIALGCSPINIGL:116 CoH19:MKLAFIAATLL.SFF...FALVAA..VPAGGPNAERLARGLPPLPPVRRHATPAHRE.....SQCNGGT.IKCCNSVASS...NDAVPKLLSSILNLGLGLNT.IVGMQCTNLNA.LGVGGGSSCT..GQ.TVCCSGND...FNGVITAGCTPISIGA:135 CoH20:...MMARFTSALFAF.ALFAAAVA.............................VPTPQDVEYVQCNGGQ.VQCCNDVKETNQLDAPYNQLLS.VFDVDVKQLTGKVGLTCNTVN.VLGIG.SNSCD..AQ.TVCCTDNS...FDGIIALGCTPININL:115 CoH21:...MFARLTSTLFAL.AAVSAVFA..........................APG......ATTEQCNGGE.VQCCNSVQDANNLDSSVKKIITGLLHLDLKQITGQVGVTCTSVN.VLGIGGGSSCT..QQ.KVCCTNNS...FHGLIALGCTPINVSV:114 CoH22:...MFARLTTALVAF.TLVSAVVA........................NPAPT....E.IEYEQCEGGT.VQCCASYQKATDLNAEWTKWL.GFLNINARQVDANVGFTCTGVKA.GGIGGAASCT..QQ.KLCCTNSN...FNGVVAIGCTPILSAL:116 CoH23:...MFSKVITTISLC.ALFLGVSA..........................APS.......DSNQCNGGQ.VQCCNKVQDSKSLDAGVKGLL.GVLNIDLSQLTGQVGVTCTAVN.VVGVGGGSHCS..NQ.AVCCNNNN...FNGVVALGCTPINVSV:112 CoH24:...MFARLSSALVAF.TLAASAMA..........................APTS..G..V.VAQCNGGV.VQCCNEMQSSTSLEAPIAGLL.SLLGIDLSGLTGQVGLTCTDVT.VIGVGGGSSCN..NQ.QVCCNNNN...FNGVLALGCTPINVSI:114 CoH25:...MFARLSSVIVVC.TLAASAMA..........................APSSQTGD..AIA.CGNGGTLQCCNTVESSNNLSGALAGLL.TLLGVDISKLTGQVGASCTGIN.VIGVGGGTSCS..NQ.PVCCTGNN...FSGVVAIGCTPINISL:117 CoH26:...MFARLSAAFVAF.TLATAVIA..........................APGGRPSE.VEYEQCNGGT.VQCCNSYQKADSIDHSASKLL.NLLNIDVKQVAAGLGLSCTGVN.VVGIGGGSSCT..QQ.KVCCNNN.....NGVVALGCTPINASL:116 CoH27:...MFARLSTALLAF.TLATAVVA..........................APGGRPSE.VEYEQCNGGE.IQCCNSYQKADSLDYNTSKLL.GLLNIDVKQITAGVGLTCTGINA.VGIGGGSSCT..QQ.KVCCTNNK...FNGVVALGCSPINVSL:118 CoH28:...MFARLTSALLAF.TLVSAVVA............................GG.HKN.VDYQQCNGGK.IQCCNSVQDSKSLDQSLTKLL.GLLKVDVKQITGQVGTGCTSLN.VLGVGGGSSCT..QQ.KVCCTNNS...FNGVVALGCSPINVSV:115 CoH29:...MFARLTSALLAF.TLVSAVVA............................GG.HKN.VDYQQCNGGE.IQCCNSVQDSKSLDNSLTKLL.GLLKVDVKQITGQVGTGCTSLN.VLGVGGGSSCT..QQ.KVCCTNNS...FNGVVALGCSPINVSA:115 CoH30:...MFARLTSALLAF.TLVSAVVA............................GG.HKN.VDYQQCNGGE.IQCCNSVQDSKSLDQSLTKLL.GLLKVDVKQITGQVGMGCTSLN.VLGVGGGSSCT..QQ.KVCCTNNS...FNGVVALGCSPINVSV:115 CoH31:...MFARLTSALLAF.TLVSAVVA............................GG.QKN.VDYQQCNGGE.IQCCNSVQDSKSLDQSLTKLL.GLLKVDVKQITGQVGMGCTSLN.VLGVGGGSSCT..QQ.KVCCNNNS...FNGVVALGCSPINVSA:115 CoH32:...MFARLTSALLAF.TLVSAVVA............................GG.HKN.VDYQQCNGGE.IQCCNSVQDSKSLDQSLTKLL.GLLKVDVKQITGQVGMGCTSLN.VLGVGGGSSCT..QQ.KVCCSNNS...FNGVVALGCSPINVSA:115 CoH33:...MFARLSTALLAL.TLVTSAIA............................GSHHKSRVEAEQCNGGE.IQCCYGVQDSKSLDNEVTKLL.GHLKIDAKQVTGQVGVGCTALNA.LGAGGGSSCT..EQ.KVCCTNNS...FNGVVALGCSPANVSA:117 CoH34:...MFARLTSALLAF.TLVSAVVA............................GG.PKS.VEYEQCNGGE.IQCCNSVQQSNSLDSSLTKLL.GLLKVDIKQITGQVGVGCTAVN.VLGVGGGSSCT..QQ.KVCCSNNS...FNGVVALGCSPINVSA:115

Signal peptide 5/344/344/3422/3429/343/34 Figure 3. Alignment of the deduced hydrophobin sequences from the C. cinerea genome. (http://www.broad.mit.edu /annotation/ fungi/coprinus_cinereus/) Filled triangles indicate the eight cysteine residues at conserved positions, empty triangles show the intron positions in the corresponding genes and the numbers underneath the triangles indicate the number of genes that have an intron at the corresponding positions. The vertical arrow indicates the position of a frameshift in gene CoH7. The alignment was done by using Clustal X (http://www-igbmc.u-strasbg.fr/BioInfo/ClustalX/Top.html) and the manual adjustments of the alignment were done using Gene doc software (http://www.psc.edu/biomed/genedoc/).

F indu ced

Figure 4. Phylogenetic analysis of all known hydrophobins from basidiomycetes, including the deduced hydrophobins from C. cinerea (CoH1-CoH34, this study). Clustal alignment was done using program Clustal X and the tree was constructed using the program MEGA version 2.1. Amino acid residues in front of the first cysteine were omitted in all proteins in order to obtain for sequence comparison the hydrophobin core regions as the N-terminal amino acid regions differ vary much in amino acid number and overall sequence (Wösten 2001). The dark grey color code refers to proteins deduced from C. cinerea. For source of proteins from other species see Figure 1(a). Stages of expression: C = mycelial cords, V = vegetative mycelium (unspecified); M = vegetative monokaryotic mycelium; D = vegetative dikaryotic mycelium; H = vegetative mycelium of self-compatible homokaryon; F = fruiting body; EM = ectomycorrhiza (for references on expression, see Walser et al. 2003).

All but coH7 translates into a complete hydrophobin sequence (Fig. 3). For phylogenetic analysis of hydrophobin genes, this frame shift was ignored and a complete protein assembled by deleting 2 base pairs (as indicated in Fig. 3). The deduced 34 hydrophobins contain the complete sequence of hydrophobin class I proteins with an N-terminal leader

peptide and the typical 8 cysteine residues being distributed at conserved positions in an otherwise variably conserved sequence (Fig. 3).

When comparing the C. cinerea hydrophobin sequences with those of other basidiomycetes (Fig. 4), the C. cinerea proteins group at six different places in the phylogenetic tree. Three of these clusters contain only C. cinerea proteins whilst the others contain hydrophobins from other species.