# STOCKHOLM 1.0 #=GF ID CPSF100_C #=GF AC PF13299.1 #=GF DE Cleavage and polyadenylation factor 2 C-terminal #=GF AU Aldam G #=GF SE Pfam-B_2065 (release 24.0) #=GF GA 22.30 22.30; #=GF TC 23.40 22.50; #=GF NC 20.80 20.60; #=GF BM hmmbuild HMM.ann SEED.ann #=GF SM hmmsearch -Z 23193494 -E 1000 --cpu 4 HMM pfamseq #=GF TP Family #=GF RN [1] #=GF RM 19748916 #=GF RT Unique features of plant cleavage and polyadenylation #=GF RT specificity factor revealed by proteomic studies. #=GF RA Zhao H, Xing D, Li QQ; #=GF RL Plant Physiol. 2009;151:1546-1556. #=GF DR INTERPRO; IPR025069; #=GF CC This family lies at the C-terminus of many fungal and plant #=GF CC cleavage and polyadenylation specificity factor subunit 2 #=GF CC proteins. The exact function of the domain is not known, but is #=GF CC likely to function as a binding domain for the protein within #=GF CC the overall CPSF complex [1]. #=GF SQ 58 #=GS A2R7F5_ASPNC/679-859 AC A2R7F5.1 #=GS A1CK68_ASPCL/834-1008 AC A1CK68.1 #=GS B6GY49_PENCW/714-887 AC B6GY49.1 #=GS B8MZM1_ASPFN/355-526 AC B8MZM1.1 #=GS B6QRB9_PENMQ/837-1009 AC B6QRB9.1 #=GS Q5B8P8_EMENI/820-999 AC Q5B8P8.1 #=GS B6K3N6_SCHJY/646-783 AC B6K3N6.1 #=GS CFT2_SCHPO/645-794 AC O74740.1 #=GS Q5KIP3_CRYNJ/724-893 AC Q5KIP3.1 #=GS A8NYN1_COPC7/770-920 AC A8NYN1.1 #=GS B0CXU7_LACBS/743-895 AC B0CXU7.1 #=GS A8PTN4_MALGO/730-857 AC A8PTN4.1 #=GS CPSF2_DICDI/629-781 AC Q55BS1.1 #=GS B3S6C6_TRIAD/599-742 AC B3S6C6.1 #=GS B0WQG5_CULQU/605-744 AC B0WQG5.1 #=GS B3LXN9_DROAN/617-753 AC B3LXN9.1 #=GS B1H337_XENTR/608-780 AC B1H337.1 #=GS B5X4U8_SALSA/609-793 AC B5X4U8.1 #=GS B8JLG9_DANRE/1-179 AC B8JLG9.1 #=GS A7RPX1_NEMVE/593-734 AC A7RPX1.1 #=GS C3YA87_BRAFL/430-604 AC C3YA87.1 #=GS A8PAE9_BRUMA/664-828 AC A8PAE9.1 #=GS CPSF2_CAEEL/642-840 AC O17403.1 #=GS Q8WPK6_OIKDI/624-762 AC Q8WPK6.1 #=GS A4HDW4_LEIBR/671-825 AC A4HDW4.2 #=GS D0A5E4_TRYB9/642-814 AC D0A5E4.1 #=GS Q6BCB1_TRYCR/634-798 AC Q6BCB1.1 #=GS C1FDL7_MICSR/630-806 AC C1FDL7.1 #=GS A4RR19_OSTLU/575-712 AC A4RR19.1 #=GS Q01GI5_OSTTA/663-804 AC Q01GI5.1 #=GS C4Y7S0_CLAL4/779-937 AC C4Y7S0.1 #=GS A3GHD1_PICST/771-931 AC A3GHD1.2 #=GS A5DGP1_PICGU/659-818 AC A5DGP1.2 #=GS B5RTE7_DEBHA/793-956 AC B5RTE7.1 #=GS A5DXK9_LODEL/862-1064 AC A5DXK9.1 #=GS B9WDT3_CANDC/764-927 AC B9WDT3.1 #=GS C4QZW3_PICPG/709-851 AC C4QZW3.1 #=GS Q750X1_ASHGO/674-800 AC Q750X1.2 #=GS Q6FW78_CANGA/704-840 AC Q6FW78.1 #=GS A7A117_YEAS7/725-856 AC A7A117.1 #=GS C5DVE5_ZYGRC/694-832 AC C5DVE5.1 #=GS A7TEK7_VANPO/685-818 AC A7TEK7.1 #=GS C5DMY3_LACTC/685-813 AC C5DMY3.1 #=GS Q4N5Y7_THEPA/224-377 AC Q4N5Y7.1 #=GS Q4UDL0_THEAN/669-822 AC Q4UDL0.1 #=GS Q6CAZ0_YARLI/666-795 AC Q6CAZ0.1 #=GS B2VU84_PYRTR/782-948 AC B2VU84.1 #=GS Q0UZX3_PHANO/789-951 AC Q0UZX3.2 #=GS C7Z120_NECH7/795-952 AC C7Z120.1 #=GS Q7S0J8_NEUCR/807-980 AC Q7S0J8.2 #=GS B2AL57_PODAN/808-961 AC B2AL57.1 #=GS Q2GQR7_CHAGB/795-950 AC Q2GQR7.1 #=GS C9SFK8_VERA1/564-733 AC C9SFK8.1 #=GS A7F0N0_SCLS1/770-930 AC A7F0N0.1 #=GS C5FLN4_ARTOC/820-992 AC C5FLN4.1 #=GS C4JWC1_UNCRE/634-809 AC C4JWC1.1 #=GS C5PF10_COCP7/845-1020 AC C5PF10.1 #=GS A6R733_AJECN/799-971 AC A6R733.1 A2R7F5_ASPNC/679-859 VKLSNNLVRRL..KWQHV......RSLGVVTLTAQLR....GPEQAVL.........E...DSTEENPS..............KKP.KLL........EEEKKEEGGSTEVATNAPPEGAKP.S.ADKSEVYPLLDVLPV.....NMA..AGTRSM.TRPLHVGDLRLADLRKIMQG.....AG..HTAEFR.GEGTLLIDGM...........VAV..RKSAT...............GRIEIEASAQSAAAT..........NLGRGGGSFLAVKQKIYEGL A1CK68_ASPCL/834-1008 VKLSTNLVRRL..KWQHV......RSLGVVTLTAQLR....GPELSVS.........E...EDSDESA.................S.K............KQKLLMEEASSVATSTLGDTKP.A.ADQSDVFPMLDILPA.....NMA..AGTRSM.TRPLHVGDLRLADLRKIMQA.....AG..HKAEFR.GEGTLLIDSL...........VAV..RKSAT...............GKIEIEASAQSAAANH.........GMGRAAGSFLAVKRKIYEGL B6GY49_PENCW/714-887 VKLSNTLVRRL..NWQHV......RSLGVVALTAQLR....GPEPA...........E...IGDVETS.................G.KK...........MKQLKDEAASSAVAPELGQADT.KIIDKVEVYPLLDTLPA.....SMA..AGTRSM.ARPLHVGDLRLADLRKLMQS.....AG..HTAEFR.GEGTLLIDKS...........VAV..RKSGT...............GKIEIEATAQSSLGR..........PSGRGLGSFLAVKKKIYEGL B8MZM1_ASPFN/355-526 VKLSNSLVRRL..KWQHV......RSLGVVTLTAQLR....GPELNP..........P...EDAADSP.................S.KK...........QKLLQEETSSPATAPTVDGTKP.T.ADKSDVYPVLDILPA.....NMA..AGTRSM.TRPLHVGDLRLADLRKIMQG.....AG..HTAEFR.GEGTLLIDRM...........VAV..RKSGT...............GKIEIEATAQSAAA............VGRGAGSFLDVKRKIYEGL B6QRB9_PENMQ/837-1009 VKLSNNLVRRL..KWQHV......RSLGVVALTAQLR....PPEIVSV.........E...DEVTES..................I.S............KKQKLIETEPDAVSTPQDGVHD.SSISKADAYPILDVLPA.....SIA..AGTRSM.ARPLHVGDLRLADLRKLMIA.....AG..YKAEFR.GEGTLLIDGM...........VAV..RKSST...............GTIEVEAAVQSSASD...........RRGQAGSFLAVKRKIYEGL Q5B8P8_EMENI/820-999 VKLSNNLVRRL..KWQHV......RTLGVVTLTGQLKAPEPVSTDED...........A..INSPNKKQ...............KLV...............EETSTPEQPTPTFQPQPTEPQQTTDKPDRYPVLDILPP.....NMA..SGTRSM.TRPLHVGDLRLADLRKIMQN.....AG..HKAEFR.GEGTLLIDGF...........VAV..RKSGT...............GKIEIEAAAYQAGPSA.........GFAQGAGSFLAVKQKIYEGL B6K3N6_SCHJY/646-783 MKLSDELVKSL..IWKKL......GNYEVAHLMAKIRM..P..........................................................................ENVDEEAEESKEPVDPKDNLPILDS.....LKTQQDFALAPRAAPIFVGNVRLAALRKTLMD.....QG..ISVELK.GEGVLLCGGI...........VAI..RKLDN...............GRIVIEGGIS.....................NRFFEIRKTIYDTL CFT2_SCHPO/645-794 LKLADDLIKNL..IWTKV......GNCEVSHMLAKVEI.SKPSE...............................................................EEDKKEEVEKKDGDKERNEEKKEEKETLPVLNA.....LTLRSDLARAPRAAPLLVGNIRLAYLRKALLD.....QG..ISAELK.GEGVLLCGGA...........VAV..RKLSG...............GKISVEGSLS.....................NRFFEIRKLVYDAL Q5KIP3_CRYNJ/724-893 LTLGDSISSALAKKWSDF......EGYEVTFVDGKIVL.PAGSTIPIL.........E.................................................TPSLVGPLVKTEAEGDDADDEAKPSAEELAAAS.....APPISSSAPLPLPTSTFIGDLRLARLKHRLSL.....LNPPIPAEFA.GEGVLVCGPGIAQEAQGAASVVSV..RKIGE...............GKIVLEGCIG.....................RVYVEVRKALYGGL A8NYN1_COPC7/770-920 ISISDEMLASL..RMSRF......EDNEIGYVRGRVVM.HSNSIIPILE.......PA.................................................................SSAFPSSQTPTTKQVLN.....KRKLGSRPQVALPHSTMIGELKLTALKARLAK.....VG..IQAELV.GEGVLICGAGVGSL...DNLAETVAVRKVAS...............GRVELEGNVS.....................DVYYTVRKEIYQLH B0CXU7_LACBS/743-895 ISISDELLASL..KMSSF......EDNQIAYVRGRIVA.HATSTIPTLE.......PV.................................................................SSSTLSEDPVDSKVTVK.....RRTLGSRQQVALPHSTMIGELKLTALKARLAS.....IG..VQAELI.GEGVLICGAGAKRNASSDTLGESVSVRKLAR...............GTVELEGNVS.....................EVYYMVRREIYS.L A8PTN4_MALGO/730-857 VRLGDALMGSL..RWHPM......QDYNIVHLH..........................................................................................VSPDFASDSDTPTLVPV.....NDAATVHTAQA.PSTLYIGDLRLPALKAYLAR.....QH.RIRADFA.GEGVLVCGDRDE.......RNVTV..TKQGT...............GRIVVEGSLS.....................TNLARVRQSIYQLV CPSF2_DICDI/629-781 LLLKDSLVNTL..KTSKI......LDYEVSYIQGKVDI.LDGSNVPV..........L.................................................DLIQSIPINNNNNNNNNNNNNNNNNNNNTTMMT.........TTTTTTNGHDESFIGDIKLSDLKQVLVN.....AG..IQVQFD..QGILNCGGL...........VYI..WRDEDHG...........GNSIINVDGIIS.....................DEYYLIKELLYKQF B3S6C6_TRIAD/599-742 VRLRDSLVSSL..YYCNA......KDAELAWVDGRVTV.TAKGHERL..........L.........................................................DKNNKNEDEAMDTDNTSITEAVVPI.....LEPLLQSEIPG.HKSVFINDPRLSDLKQTLTK.....AG..IQAEFV..GGVIVCNDK...........IAV..RRTET...............GKITLEGAIC.....................NDYYTVRDILYQQY B0WQG5_CULQU/605-744 VRLTEALVSQL..EFQKG......KDAEVAWVDAQIVI.RNKQFTSD..........Q..............................................................PMDVDQVEITEDKSDKQILT.....LDPLLNDQLPA.HNSVFINELKLIDFKQVLMK.....AN..IASEFS..GGVLWCSNG..........TLAL..RRIDT...............GKVTIEGCLS.....................EDYYRIRELLYEQY B3LXN9_DROAN/617-753 VRLTEGLVSQL..QFQKG......KDAEVAWVDGRLGM.RLKAIDA...........A................................................................MDVTAEQDNSAQEAKTLT.....LETLAEDEIPV.HNSVLINELKLSDFKQILMR.....NN..INSEFS..GGVLWCSNG..........TLAL..RRVDA...............GKVAMEGCLS.....................EEYYKIRELLYEQY B1H337_XENTR/608-780 VRLKDSLVSSL..KFCKA......KDTELAWIDGVLDM.RVSKVDTGVILE...EGEL...KDEGED..................S.E.............MQVDTQALDASAIAQQKAIKSLFGDDDKEFSEESEIIPT.....LEPLPSNEVPG.HQSVFMNEPRLSDFKQVLLR.....EG..IQAEFV..GGVLVCNNM...........VAV..RRTET...............GRIGLEGCLC.....................EDFFKIRELLYEQY B5X4U8_SALSA/609-793 VRLKDSLVSSL..QFCRA......KDTELAWIDGVLDM.RVVKVDTGVLPEEGVVKGEKGAGEEAVEDG...............EL.AM..........DVTPADDGTTDHSVVAQQRTMKTLFGEDVREPSEESDVIPT.....LEPLPAHEIPG.HQSVFINEPRLSDFKQVLLR.....EG..IQAEFV..GGVLVCNNI...........VAV..RRTEA...............GRIGLEGCLC.....................DDYYKIRELLYQQY B8JLG9_DANRE/1-179 VRLKDSLVSSL..QFCKA......RDTELAWIDGVLDM.RVEKVDTGVIVE..LGEAK...DEAEEGG................EQ.GM..........EVTEELNTEPSTAAAANQRAMKTLFGEDEKEISEESDVIPT.....LEPLPAHEVPG.HQSVFINEPRLSDFKQVLLR.....EG..IQAEFV..GGVLVCNNL...........VAV..RRTEA...............GRICLEGCHC.....................DDYYRIRELLYEQY A7RPX1_NEMVE/593-734 VKLRDALVSSL..QFAQA......RDAELAWIDGQLDM.KLAPANQDLM.......GD..............................................................KPGEEKMETDQDEALDTVPV.....LEQNTSSKIAG.HVSVFINEPRLSDFKQVLNK.....AG..IQAEFA..GGVLICNNV...........VCV..RRNET...............GRVGLEGTVC.....................EDYYTIRDLLYSQY C3YA87_BRAFL/430-604 VKLKDSLVSSL..QFYKA......RDTELAWVDGQLDL.TTPTTDTSALLE...EGEV...QEMEDLE.................E.E............QFFKARDTELAWVDGPLLTLPFTCKSAKAAAEESRETVPT.....LEALPISQIPG.HEAVFINKPRLSDIKQVLQK.....EG..IQAEFS..GGVLICNNV...........VAL..KRNES...............GRIGMEGCIC.....................EDYYKVRKLLYEQY A8PAE9_BRUMA/664-828 VTLSDAVMSSL..IFQTV......KDAELSWLDARIVR.RKTVTPGQTRNT...AEEN................................................LETNGNKEEEVEEMEQDDSDQVEGKRLSNLKVAAADTFCLEPMLSANIPP.HQAVFVNDPKLSDMKQLLAS.....NG..FRAEFS..SGVLYINNI...........ASI..RRNEA...............GRFHVEGCAC.....................EDYYKIRDIVYAQF CPSF2_CAEEL/642-840 VALSDALLADI..QFKEV.....SEGNSLAWIDARVME.KEAIDNMLAVGT...SNLM...IDDKNREEDVN.........DQEEN.GATEGE...GNAEPMEIGENGSQESLAISESGKEVENGHTNDSRTKKGTKGKIRGNLILDPLPKRLIPI.HQAVFVNDPKLSDFKNLLTD.....KG..YKAEFL..SGTLLINGG..........NCSI..RRNDT...............GVFQMEGAFT.....................KDYYKLRRLFYDQF Q8WPK6_OIKDI/624-762 LKLKDSLLSNL..NFVRVG....SKDIEVARIRGRVD...........................................................................YFGGRLELEAENGENDEPKKLEIDDIPT.....LQPVTNNYSSG.HDSIFINDTKLTELKSNLID.....CG..MQAEFI..GGNLVCNNK...........VSI..KRSAN...............GVIQVEGTLS.....................EDYFIVRKMVYDNY A4HDW4_LEIBR/671-825 VQLESSLARSLSRGLRRVRETKSKSTWEVGWVNGELS....G.................................................................VRAEADLVEPEAQRRRSDRASYFLTAVTADKSQ.....ACAAQREQQGLQSGSFFVGDMDLRRLREVSRQ....ESG..LYSEFHKKAPLLVFDEG...........VCV..RKGAD...............GTVTISSIAT.....................PALFDLRRTVYRQY D0A5E4_TRYB9/642-814 VQLDPQLANALPSALRRVKETRSNGFWDVGWVDGSLVR.AVVYKEKDEEKE...EDDH...E............................................ELHPSQRRRTEGGTSTEMRDSVYTLTALSSDKAQ.....QCAREREMRGLQRGLYYLGETDLHKLRDAARN....EQG..IRGEFHKSAPMLIFDNG...........VCL..RKSVS...............GNVSLSSMVS.....................PSVFGLRKTVYKHF Q6BCB1_TRYCR/634-798 VQLDPTLANALPSALRRVKESRSSGFWDVGWVDGALES.SFVSLTPED.........D.................................................ERQSVKRLRAEGNDGEIKDGVFTLVPLFGERAQ.....QCARERELRGMQRGLFYVGDVDLHRLRDVARS....EMG..LRGEFHKNAPMLVFDAG...........LCV..RKSAN...............GNFSLSSMIS.....................PSVFALRKTVYGQF C1FDL7_MICSR/630-806 LELSQDLLSHT..HMRDV......AGYQVGWVEGNVLI.SRGGGDPAATLV...PAKS...GMICEA..................................QRTGLQPNTGASQTATRETRTQDARVGLDFSREIDEQST.....ASELFLDELVVKKPAALVGSLKLSDSRLALAA.....AG..CATEFR..GGALMCTGD..........KVRV..RKTVNVM...........GAENLLLEGNLC.....................DTFFSVRSTLYHHH A4RR19_OSTLU/575-712 IRVSDALFQKA..NMRDM......AGYKVGWVNGVVG.............................................................................KALEEGGAPMLLPVSALNSNADGMAL....APSNATMTKVSAQPGSVFLGDLRLSDFRQALAQ.....EG..IIAEFA..DGVLVCANG..........RVTV..RKDGD...............EKLVVEGALS.....................QDYFEVRQILYSQY Q01GI5_OSTTA/663-804 IRLSDALFQKA..KMRDM......SGYRVGWVNGIVG.............................................................................KALEEGGAPMLLPMSTLSTKADAGALVTTTSNEMAIMKRAAAQPGSVFLGDLRLVDFRQALAQ.....EG..ITAEFS..GGVLVCADG..........RVTI..RKDSD...............EKLVIEGALS.....................QDFFEIRQILYSQY C4Y7S0_CLAL4/779-937 VKLDEQLDASL..VWQKI.....DGGYKVSQIQGELE........................IYQPEGV................QN.DS..........VDKIINSATQFVLKPVSN......PVFESLKNANTEDSLQG.....SRG..DF.....GPALAIGDIRLTELKKKLLS.....RD..LNAEFK.SEGTLVVNNA...........IAI..KKISVDNYQ.......GDDTGDIAIEGQIG.....................PLYYEVKNCIREML A3GHD1_PICST/771-931 VNLDDNIIEDL..KWQSI.....DGNYRVAQLYGELE........................IHNQDLSK...............KR.HRE.........VGDYINSSTLFTLKKVKK......EDFIR.RQAAVAEDVKN..........SLLLSS.GPKLAIGNIALPDLKKKLVS.....KN..LNAEFK.SEGTLVVNDK...........LAI..RKVAYGAVD.......TDDTGDIVIEGNVG.....................PLYYVVKDCIREML A5DGP1_PICGU/659-818 VVLEDEIVDTL..KWQKV.....DGNYRVAPAYGELE........................LHNPHMPR...............KK.AKT.........VPDYINPSTQFSLKYISK......EEFMK.RQTEAGQAIVQ.....QEG......SS.GPKFAIGNIRLPELKRKLIA.....KE..MNAEFK.GEGTLVVNGK...........IAI..RKVTYGSID.......GDDTGDIEIDGTVG.....................PLYYKVKACIKEML B5RTE7_DEBHA/793-956 IKLDDSIIDSL..KWQNI.....DGSYRVAQVYGELE........................IHNQDLPN...............KK.QKT.........ISDYMNSSTQFTLKHISN......QDFLK.QQQALVDAHAA.....TTG..QLLNNN.GPKLAIGNIRLPELKKKLIS.....RN..MNAEFK.SEGTLVVNNS...........LAI..RKVTYSNVE.......GEDTGDIVIDGAMG.....................PLYYEIKDCIREML A5DXK9_LODEL/862-1064 VNLDDELVSTL..KWKKV.....GDNYKVAKIYGELE........................INNQSPFMESQNSATGQTVTESTDDMADVPPTKKQKKLFADYVNSSTQFSLKPVDP......NSTLRNSQNSIINRIQD.....PKL..RAMISNASPKLAIGNVRLPDLKNKLLALTVNNQP..LKVEFK.SEGTLVVNGQ...........IAI..RKISYSGVGMGDAGDGDESGGDIVIDGNIG.....................PLYYRIKEVIREML B9WDT3_CANDC/764-927 VNLDDSIVKDL..KWQKI.....GDDYKVAKLYGELE........................LQNQFPVT...............KR.IRT.........VQDYINSNTHFSLRKLDN......TTAVK.RQETIANQVQD.....PKI..RALITN.GPKLAIGNIRLPDLKKRLQK.....LN..MTAEFK.SEGTLVVNDI...........LAV..RKIAYGLVE.......SDESGDIVIDGNVG.....................PLYYKVKECIREML C4QZW3_PICPG/709-851 VKLADTLESEL..QWQNI.....AGGYSVAYVNGVLE.......................................................................TITDKKIESQTTENEDEGDKNKDESHYQELVL.....NPLDQLSTLKS.TAPLAIGDIRLSDLKTRLLG.....LQ..LKAEFK.GKGTLVINDE...........IMI..KKLND...............GEIMIDGTCN.....................ELFYVIRSAVQGML Q750X1_ASHGO/674-800 IFIDAEMDQML..NWQRI.....SEVYTVAHVVGRLT........................................................................................KEKDTKVSHRDKWVL.....KPLPNASARMQTTDSLRIGDVRLAELKRKLTA.....AS..HVAEFR.GEGTLVVDGR...........VIV..RKISE...............SETVVDGTPS.....................DLFYKVKSAVADML Q6FW78_CANGA/704-840 ITISPELDALL..KWQRI.....SNDYTLAHVTGRLV.............................................................................KESAHQSSAVPVTDNTTSSGREKYVL.....KPLNGNVG.VQTNGSLAIGDVRLIKLKQNLNA.....TN..HTAEFK.GEGILVVDDK...........VII..RKISD...............SETIIDGPPS.....................ALFYSVKKLVMDML A7A117_YEAS7/725-856 ISIDSNLDNLL..KWQRI.....SDSYTVATVVGRLV..................................................................................KESLPQVNNHQKTASRSKLVL.....KPLHGSSR.SHKTGALSIGDVRLAQLKKLLTE.....KN..YIAEFK.GEGTLVINEK...........VAV..RKIND...............AETIIDGTPS.....................ELFDTVKKLVTDML C5DVE5_ZYGRC/694-832 ISVDPELDQLL..KWQSI.....SDGYTVAHVIGKLV............................................................................KEKPQAGKSQQQAQEQKQQLHRTRLVL.....EPLKTTSRHHHKSGSLSIGDVRLAELKRVLTA.....QR..HRAEFK.GEGTLVVDGQ...........VAV..RKIND...............GETVVDGAPS.....................ELFYLVRKSITDML A7TEK7_VANPO/685-818 ISIDPELDQML..RWQKI.....GYGHTVAHVIGRLV................................................................................KEKVQNSKLQDDDKEPLRTKMVL.....KPMENRTK.VHTGISLSIGDIRLAEVKRKLTD.....QK..HIAEFK.GEGTLVVDGQ...........VSI..RKIND...............GETIIDGSPS.....................ELYDIVKKAVVEML C5DMY3_LACTC/685-813 VSIDPDLDQHI..KWQSV.....SDGYTIAHVVGRLV.....................................................................................RDATQVAENQQQRIKWAL.....KPLSNNSK.FHPKTSLAIGDVKLGELKRKLTH.....KN..HVAEFK.GEGSLVVDGK...........VVV..RKISD...............GETVVDGNPS.....................ELFYEVKALVADML Q4N5Y7_THEPA/224-377 INSVLNSIEQW..KYKN.......NRVKSCQIVAKLTC.VEE...................SQ.........................................QDPPKTNVLGCHTFWSDDSSRKSKLELVVEDAENSE.....TAGKTVKVKKSVDSTLLVGDVSMSDYAKFLDD.....CI..PNSVSM.VAGSAVINDK...........VIV..SKYSDA............SSNQWVIEGTLD.....................PSYYLARKLL.RKL Q4UDL0_THEAN/669-822 INTVLNSLEQW..KYKN.......NRVKSCQILGKLTC.AEE...................SQ.........................................QDPPKTNLVSCHTFWSDECNKKSKLEFVIEDKENND.....PETENYQVKNSIKSTLLIGNVSMANYTKFLDD.....CI..PNAISM.ISGSAVINDK...........VIV..SKHSNT............YSNQWVIEGTLD.....................PSYYLARKLL.RKL Q6CAZ0_YARLI/666-795 IQLTPELSRLL..NWQQL.....SGGLSLAHVVGKVAK..N..................................................................................EDKSEDTPLAALALQPI.....VDAADLAVAPR.IEPLRVGDIRLAELKQALGK.....LG..FRAVFQ.AGGVLVVDGK...........VSI..RKVDE...............SNLVVDGGIG.....................SDFYAIKEVVRAQL B2VU84_PYRTR/782-948 VKLSRNMVRNL..RWQNV......RGMGVVAITGRLA....AARLEPHSS....STTT...EEADTPA.................K.K............KARLDAPAIPVSS.............DKNDNTPVLDVVPT.....NMA..TAVRSV.AQPFHVGDLRLADLRRLMTA.....NG..MQAEYR.GDGILVINGS...........VAV..RKTAT...............GQIEIDGGAYGNLDP...........RNNDAATFLRVRRQIYEGL Q0UZX3_PHANO/789-951 VKLSRTMVRNL..HWQNV......RGMGVVAITGRLA....AATLDAP.........P...KEEEGSA.................K.K............KARLDAPAVPVSS............LLESSSTPILDVVPA.....NMA..TAVRSV.AQPFHVGDLRLADLRKLMKS.....NG..MEAEFR.GEGVLVINGT...........VAV..RKTAT...............GQIEVDGGAYGNTDA...........RNNDAATFFRVKRQIYDGL C7Z120_NECH7/795-952 VKLADSLVKKI..KWQNV......RGLGIVTITGQLLA.TKLDDAP...............AGDQDA....................A.............NKRQKTEESSTTALSTVVASP.........MPTLDVLPA.....NLV..SAVRSA.AQPLHVGDLRLADLRRAMQS.....AG..HTAEFR.GEGTLVVDGT...........VAV..RKTAA...............GRVEVESVGMPT................ARRSTFYEVRKVIYDNL Q7S0J8_NEUCR/807-980 LKLADPLVKGL..KWQNV......RGLGIVTVTGLLL....PGGEFQP........IE...VGDGDGD.................A.AK...........RQKLEDSSETPTTSTALVKTGT........KHLSHHRLPP.....HTG.........PRPANPGL..LVTFTSRPTS.....TR..HKAEFR.GEGTLLIDDV...........VVV..RKSTAQ.............GGRIELESVGLPSDTMPGTTSGGLLDAAMKVGGTFYAVKKKIYEGL B2AL57_PODAN/808-961 VKLSDYLAKKL..KWQDV......NGLGIATITGVLL....PGGGFIP.........S...DDPNDEG.................N.K............RQKTEEGGSPSSSMALTTVNND.A...NPRTLPTVDVLPV.....NLAATATVKAA.SQPLHAGDLRLADLRRAMLH.....AG..HKAEFR.GEGTLLIDET...........VAV..RKSAA...............GR..............................TFYAVREKIYDVL Q2GQR7_CHAGB/795-950 VKLADPFVKRL..KWQNR......QKLEGTPATE.........................................................................TPAAATDGTLTTTNS.SPNNNKPTLPTLDVLPP.....TLA..SAVRSA.AQPLHVGDLRLADLRRAMLG.....AG..HRAEFR.GEGTLLIDGT...........VAV..RKTAT...............GRIEIESVGLPLVVGGGGV..GGGRGGVGGMGTFYEVRRKIYEGL C9SFK8_VERA1/564-733 VKLGDSLVKKL..KWQNL......RGLGIVTITGQLL....GESHA...............ISESTG..................S.N.............KRLKTASNDDSAAFKGEEGGEDSDNRAVEVVPVLDTLPL.....SMV..SAVRSV.AQPLHVGDLRLTDLRRAMQS.....TG..YTAEFR.GEGTLVINGA...........VAV..RKTNM...............GRIEVESVGVADPSA...........MMQQRSTFYEVKRMIYDGL A7F0N0_SCLS1/770-930 VKLTDSLVKQL..RWQNV......KGLGIVTLTGRLET.TNIDTDS...............HDSEGAN.................K.K............QKMLTGESEETPTQAALDSAKAVV......EMPILDVLPS.....NMA..SATRSV.AQPLHVGDLRLTDLRKIMQS.....SG..LTAELR.GEGTLLIDGS...........VIV..RKTGT...............GRIEVESVGVT...................TSSFYAVKGKIYEGL C5FLN4_ARTOC/820-992 VRLSRPLVRRL..KWQNV......SNLGVVALVGNLQ....SSQAISL.........Q...EEVLEQ....................S.............KSKGKGEAWKATGPVESQANQ.S.LIKNEKIPVLDILPA.....SLV..AATRSV.TKPLHVGDLRLSDLRKLMQS.....SG..HSAEFR.GEGTLLVDGF...........VAV..RKAGA...............GKIEVEGAARPSPSNP........TTLKQSTGSFLAVKQKIYESL C4JWC1_UNCRE/634-809 VKLSRALVRRL..KWQNV......RSLGVVALTAHLR....GPETAIE.........A...EKTEESS.................N.K............GATVQKSVENQPSGVVESRANE.S.LVKKEIYPLLDVLPP.....NLA..AATRSL.SKPLHVGDLRLADLRKLMQT.....SG..HSAEFR.GEGTLLIDGF...........VVV..KKSGA...............GRIEIEGSARAPPVNP........RAPGRDEGTFLAVKRKIYDCL C5PF10_COCP7/845-1020 VKLSRALVRRL..RWQNV......RSLGVVALTANLQ....GPDAATQ.........N...DDVEEPS.................K.K............KAMLQKGADIQGPNVVESRANE.T.LIKKEVFPLLDVLPP.....NLA..AATRSL.SKPLHVGDLRLADLRKLMQA.....SG..HSAEFR.GDGTLLIDGF...........VVV..RKSGA...............GKIEIESSARAATVNP........KASKGGEGTFLAVKRKIYDCL A6R733_AJECN/799-971 VKLSSTLVKRL..KWQSV......RSLGVVALTGELR....GPEPMAA.........D...EDGPGM....................S.............QKKQRTFSENASSSEGNEKKQ.L.VPRKHSFPLLDVLPV.....NMA..AATRSV.TRPLHVGDLRLADLRKLMQS.....SG..HTAEFR.GEGTLLIDGF...........VAV..RKSGT...............GKIEIEGAAQSALSNR........SALKRDEGSFLAVKRKIYEGL #=GC seq_cons V+LscsLlppL..+Wppl......cshpVutlsGpLt.............................................................................ht..t.p..pttsptptpsthchlss.....p.s..ssthss.spslalGDlRLuDL+chLts.....pG..hpAEF+.u-GsLllsst...........VuV..RKsus...............GcltlEGshs.....................ssaatV+chlY-tL //