>AGAP011789-PA gnl|CDD|63977 pfam00089, Trypsin, Trypsin.. 156 2e-039 gnl|CDD|65482 pfam01690, PLRV_ORF5, Potato leaf roll virus readt... 39 8e-004 gnl|CDD|68206 pfam04625, DEC-1_N, DEC-1 protein, N terminal regi... 37 0.003 gnl|CDD|69062 pfam05518, Totivirus_coat, Totivirus coat protein.. 36 0.005 gnl|CDD|72470 pfam09052, SipA, Salmonella invasion protein A. Sa... 34 0.021 gnl|CDD|69154 pfam05616, Neisseria_TspB, Neisseria meningitidis ... 34 0.028 gnl|CDD|68125 pfam04540, Herpes_UL51, Herpesvirus UL51 protein. ... 33 0.028 gnl|CDD|69414 pfam05887, Trypan_PARP, Procyclic acidic repetitiv... 33 0.032 gnl|CDD|65906 pfam02165, WT1, Wilm's tumour protein.. 31 0.20 gnl|CDD|70630 pfam07174, FAP, Fibronectin-attachment protein (FA... 29 0.42 ==> gnl|CDD|63977 pfam00089, Trypsin, Trypsin.. Length = 216 Score = 156 bits (396), Expect = 2e-039 Identities = 83/244 (34%), Positives = 118/244 (48%), Gaps = 33/244 (13%) Query: 187 SEAEYGEFPWMVAILKTEEVLGQLRENVYTCGGSLIHRQVVLTGAHCVQNKQPSQLKVRV 246 EA+ G FPW V++ + CGGSLI VLT AHCV S ++V + Sbjct: 5 DEAQPGSFPWQVSLQ---------VSGGHFCGGSLISENWVLTAAHCVSG--ASSVRVVL 53 Query: 247 GEWDTQTKNEIYPHQDRSVVEIVVHPDYYKGGLHNDVALLFLNAPVEPNESIQTVCLPPQ 306 GE D + Q V +++VHP+Y ND+ALL L +PV ++++ +CLP Sbjct: 54 GEHDLVLREGGE--QKFDVEKVIVHPNY--NPDTNDIALLKLKSPVTLGDTVRPICLPAA 109 Query: 307 DMAFN-HETCFASGWGKDVFGKAGTYQVILKKIDLPVVPNDQCQTALRTTRLGPKFNLHK 365 TC SGWG K L+++ +P+V + C++A T + Sbjct: 110 SSDLPVGTTCTVSGWGNT---KTLGTPDTLQEVTVPIVSRETCRSAYGGT-------VTD 159 Query: 366 SFICAGGVPGKDTCKGDGGSPLVCPIPNSPHHYYQTGLVAWGIGCGENGIPGVYANVAKF 425 + ICAG GKD C+GD G PLVC G+V+WG GC PGVY V+++ Sbjct: 160 NMICAGA-GGKDACQGDSGGPLVC------SDGELVGIVSWGYGCASGNYPGVYTRVSRY 212 Query: 426 RGWI 429 WI Sbjct: 213 LDWI 216 ==> gnl|CDD|65482 pfam01690, PLRV_ORF5, Potato leaf roll virus readthrough protein. This family consists mainly of the potato leaf roll virus readthrough protein. This is generated via a readthrough of open reading frame 3 a coat protein allowing transcription of open reading frame 5 to give an extended coat protein with a large c-terminal addition or read through domain. The readthrough protein is thought to play a role in the circulative aphid transmission of potato leaf roll virus. Also in the family is open reading frame 6 from beet western yellows virus and potato leaf roll virus both luteovirus and an unknown protein from cucurbit aphid-borne yellows virus a closterovirus.. Length = 486 Score = 38.5 bits (89), Expect = 8e-004 Identities = 14/28 (50%), Positives = 16/28 (57%) Query: 139 VPPAPGPNPGPGPSPGPGPAPIPPPMPE 166 V +P P PGP P+P P P P P P P Sbjct: 1 VDGSPPPEPGPSPTPTPTPQPTPQPQPC 28 Score = 38.2 bits (88), Expect = 0.001 Identities = 14/31 (45%), Positives = 15/31 (48%) Query: 141 PAPGPNPGPGPSPGPGPAPIPPPMPESRCGR 171 P P PGPSP P P P P P P+ R Sbjct: 1 VDGSPPPEPGPSPTPTPTPQPTPQPQPCAER 31 Score = 37.4 bits (86), Expect = 0.002 Identities = 13/29 (44%), Positives = 15/29 (51%) Query: 135 PPPPVPPAPGPNPGPGPSPGPGPAPIPPP 163 PP PGP+P P P+P P P P P Sbjct: 1 VDGSPPPEPGPSPTPTPTPQPTPQPQPCA 29 Score = 37.4 bits (86), Expect = 0.002 Identities = 12/30 (40%), Positives = 13/30 (43%) Query: 133 IQPPPPVPPAPGPNPGPGPSPGPGPAPIPP 162 + PP P P P P P P P P P P Sbjct: 1 VDGSPPPEPGPSPTPTPTPQPTPQPQPCAE 30 ==> gnl|CDD|68206 pfam04625, DEC-1_N, DEC-1 protein, N terminal region. The defective chorion-1 gene (dec-1) in Drosophila encodes follicle cell proteins necessary for proper eggshell assembly. Multiple products of the dec-1 gene are formed by alternative RNA splicing and proteolytic processing. Cleavage products include S80 (80 kDa) which is incorporated into the eggshell, and further proteolysis of S80 gives S60 (60 kDa).. Length = 407 Score = 36.8 bits (84), Expect = 0.003 Identities = 12/37 (32%), Positives = 16/37 (43%), Gaps = 6/37 (16%) Query: 137 PPVPPAPG------PNPGPGPSPGPGPAPIPPPMPES 167 P +P PG P P P P+P P P P ++ Sbjct: 94 PAMPSMPGLLGAAAPVPAPAPAPAAAPPAAPAPAADT 130 Score = 35.3 bits (80), Expect = 0.008 Identities = 12/31 (38%), Positives = 16/31 (51%), Gaps = 1/31 (3%) Query: 138 PVP-PAPGPNPGPGPSPGPGPAPIPPPMPES 167 PVP PAP P P +P P P+P++ Sbjct: 108 PVPAPAPAPAAAPPAAPAPAADTPAAPIPDA 138 Score = 29.5 bits (65), Expect = 0.47 Identities = 11/27 (40%), Positives = 12/27 (44%) Query: 135 PPPPVPPAPGPNPGPGPSPGPGPAPIP 161 P P PA P P P+ APIP Sbjct: 110 PAPAPAPAAAPPAAPAPAADTPAAPIP 136 ==> gnl|CDD|69062 pfam05518, Totivirus_coat, Totivirus coat protein.. Length = 753 Score = 36.1 bits (83), Expect = 0.005 Identities = 11/40 (27%), Positives = 14/40 (35%) Query: 133 IQPPPPVPPAPGPNPGPGPSPGPGPAPIPPPMPESRCGRR 172 + PPP + A GP P AP P P + Sbjct: 710 LPPPPDLGAAAGPAPCGSSLIASPTAPPEPEPPGAEQADG 749 Score = 31.9 bits (72), Expect = 0.080 Identities = 12/39 (30%), Positives = 13/39 (33%) Query: 130 RLKIQPPPPVPPAPGPNPGPGPSPGPGPAPIPPPMPESR 168 L+ P PG G P P G A P P S Sbjct: 691 ALRAPQAPRPGGPPGGGGGLPPPPDLGAAAGPAPCGSSL 729 Score = 29.2 bits (65), Expect = 0.60 Identities = 12/36 (33%), Positives = 13/36 (36%) Query: 128 SLRLKIQPPPPVPPAPGPNPGPGPSPGPGPAPIPPP 163 +LR P P PP G P P G P P Sbjct: 691 ALRAPQAPRPGGPPGGGGGLPPPPDLGAAAGPAPCG 726 Score = 28.8 bits (64), Expect = 0.74 Identities = 8/33 (24%), Positives = 10/33 (30%) Query: 134 QPPPPVPPAPGPNPGPGPSPGPGPAPIPPPMPE 166 +PP P GP+P P P Sbjct: 705 GGGGGLPPPPDLGAAAGPAPCGSSLIASPTAPP 737 ==> gnl|CDD|72470 pfam09052, SipA, Salmonella invasion protein A. Salmonella invasion protein A is an actin-binding protein that contributes to host cytoskeletal rearrangements by stimulating actin polymerisation and counteracting F-actin destabilising proteins. Members of this family possess an all-helical fold consisting of eight alpha-helices arranged so that six long, amphipathic helices form a compact fold that surrounds a final, predominantly hydrophobic helix in the middle of the molecule.. Length = 674 Score = 34.0 bits (77), Expect = 0.021 Identities = 13/32 (40%), Positives = 14/32 (43%), Gaps = 1/32 (3%) Query: 130 RLKIQPPPPVPPAPGPNPGPGPSPGPGPAPIP 161 IQ P PP P P+ GP P G G P Sbjct: 267 SHPIQDGLPTPPEPMPDGGPTPG-GNGKTSQP 297 ==> gnl|CDD|69154 pfam05616, Neisseria_TspB, Neisseria meningitidis TspB protein. This family consists of several Neisseria meningitidis TspB virulence factor proteins.. Length = 510 Score = 33.6 bits (76), Expect = 0.028 Identities = 15/33 (45%), Positives = 18/33 (54%) Query: 134 QPPPPVPPAPGPNPGPGPSPGPGPAPIPPPMPE 166 QP P V PA P P P+ PG +P P P P+ Sbjct: 338 QPLPEVSPAENPANNPNPNENPGTSPNPEPDPD 370 Score = 30.5 bits (68), Expect = 0.20 Identities = 12/34 (35%), Positives = 15/34 (44%) Query: 134 QPPPPVPPAPGPNPGPGPSPGPGPAPIPPPMPES 167 + P PA PNP P P P P P P++ Sbjct: 342 EVSPAENPANNPNPNENPGTSPNPEPDPDLNPDA 375 ==> gnl|CDD|68125 pfam04540, Herpes_UL51, Herpesvirus UL51 protein. UL51 protein is a virion protein. In pseudorabies virus, UL51 was identified as a component of the capsid. In herpes simplex virus type 1 there is evidence for post-translational modification of UL51.. Length = 239 Score = 33.4 bits (76), Expect = 0.028 Identities = 14/47 (29%), Positives = 19/47 (40%), Gaps = 2/47 (4%) Query: 128 SLRLKIQPPPPVPPAPGPNP--GPGPSPGPGPAPIPPPMPESRCGRR 172 LR+ + PPP + P P P P+PP P + RR Sbjct: 179 LLRMGLVPPPDLKDPPALVEVIDVLPEKPLPPDPVPPLKPTNPPMRR 225 ==> gnl|CDD|69414 pfam05887, Trypan_PARP, Procyclic acidic repetitive protein (PARP). This family consists of several Trypanosoma brucei procyclic acidic repetitive protein (PARP) like sequences. The procyclic acidic repetitive protein (parp) genes of Trypanosoma brucei encode a small family of abundant surface proteins whose expression is restricted to the procyclic form of the parasite. They are found at two unlinked loci, parpA and parpB; transcription of both loci is developmentally regulated.. Length = 145 Score = 33.4 bits (75), Expect = 0.032 Identities = 8/33 (24%), Positives = 8/33 (24%) Query: 134 QPPPPVPPAPGPNPGPGPSPGPGPAPIPPPMPE 166 P P P P P P PE Sbjct: 61 DDEPEEEEEPEPEEEGEEEPEPEEEGEEEPEPE 93 Score = 31.1 bits (69), Expect = 0.14 Identities = 8/33 (24%), Positives = 9/33 (27%) Query: 134 QPPPPVPPAPGPNPGPGPSPGPGPAPIPPPMPE 166 +P P P P P P PE Sbjct: 71 EPEEEGEEEPEPEEEGEEEPEPEETGEEEPEPE 103 Score = 31.1 bits (69), Expect = 0.14 Identities = 13/32 (40%), Positives = 14/32 (43%) Query: 134 QPPPPVPPAPGPNPGPGPSPGPGPAPIPPPMP 165 +P P P P P P P P P P P P P Sbjct: 91 EPEETGEEEPEPEPEPEPEPEPEPEPEPEPEP 122 Score = 30.7 bits (68), Expect = 0.18 Identities = 11/33 (33%), Positives = 12/33 (36%) Query: 134 QPPPPVPPAPGPNPGPGPSPGPGPAPIPPPMPE 166 +P P P P P P P P P PE Sbjct: 81 EPEEEGEEEPEPEETGEEEPEPEPEPEPEPEPE 113 Score = 29.9 bits (66), Expect = 0.33 Identities = 8/33 (24%), Positives = 9/33 (27%) Query: 134 QPPPPVPPAPGPNPGPGPSPGPGPAPIPPPMPE 166 + P P P P P P PE Sbjct: 75 EGEEEPEPEEEGEEEPEPEETGEEEPEPEPEPE 107 Score = 29.6 bits (65), Expect = 0.37 Identities = 7/33 (21%), Positives = 7/33 (21%) Query: 134 QPPPPVPPAPGPNPGPGPSPGPGPAPIPPPMPE 166 P P P P P PE Sbjct: 59 DPDDEPEEEEEPEPEEEGEEEPEPEEEGEEEPE 91 Score = 29.2 bits (64), Expect = 0.55 Identities = 11/33 (33%), Positives = 12/33 (36%) Query: 134 QPPPPVPPAPGPNPGPGPSPGPGPAPIPPPMPE 166 + P P P P P P P P P PE Sbjct: 85 EGEEEPEPEETGEEEPEPEPEPEPEPEPEPEPE 117 Score = 29.2 bits (64), Expect = 0.57 Identities = 6/33 (18%), Positives = 7/33 (21%) Query: 134 QPPPPVPPAPGPNPGPGPSPGPGPAPIPPPMPE 166 + P P P P P E Sbjct: 65 EEEEEPEPEEEGEEEPEPEEEGEEEPEPEETGE 97 Score = 28.8 bits (63), Expect = 0.81 Identities = 8/34 (23%), Positives = 9/34 (26%) Query: 134 QPPPPVPPAPGPNPGPGPSPGPGPAPIPPPMPES 167 + P P P P P P PE Sbjct: 73 EEEGEEEPEPEEEGEEEPEPEETGEEEPEPEPEP 106 ==> gnl|CDD|65906 pfam02165, WT1, Wilm's tumour protein.. Length = 322 Score = 30.6 bits (68), Expect = 0.20 Identities = 14/27 (51%), Positives = 14/27 (51%) Query: 137 PPVPPAPGPNPGPGPSPGPGPAPIPPP 163 PP A G GP P P P P P PPP Sbjct: 43 PPGASAYGSLGGPAPPPAPPPPPPPPP 69 ==> gnl|CDD|70630 pfam07174, FAP, Fibronectin-attachment protein (FAP). This family contains bacterial fibronectin-attachment proteins (FAP). Family members are rich in alanine and proline, are approximately 300 long, and seem to be restricted to mycobacteria. These proteins contain a fibronectin-binding motif that allows mycobacteria to bind to fibronectin in the extracellular matrix.. Length = 296 Score = 29.4 bits (65), Expect = 0.42 Identities = 16/49 (32%), Positives = 21/49 (42%) Query: 117 SASVIVIAFFLSLRLKIQPPPPVPPAPGPNPGPGPSPGPGPAPIPPPMP 165 SAS + IA + +PPPPVPP+ P + P P P Sbjct: 25 SASAVTIALPATANADPEPPPPVPPSTATTPSTAAAAPAPAPPTRAPPP 73