/seqprg/slib/bin/fasta3_t -w 80 -m 6 -q @ %p FASTA searches a protein or DNA sequence data bank version 3.1t02 March, 1998 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 @: 333 aa gi|115741|sp|P07711|CATL_HUMAN CATHEPSIN L PRECURSOR (MAJOR EXCRETED PROTEIN) (MEP) vs NBRF Protein database (complete) library searching /seqlib/lib/pir1.seq 5 library searching /seqlib/lib/pir2.seq 5 library searching /seqlib/lib/pir3.seq 5 library searching /seqlib/lib/pir4.seq 5 library opt E() < 20 865 0:====== 22 1 0:= one = represents 166 library sequences 24 2 0:= 26 6 2:* 28 33 24:* 30 131 146:* 32 537 563:===* 34 1595 1528:=========* 36 3395 3138:==================*== 38 5790 5185:===============================*=== 40 8163 7233:===========================================*====== 42 9373 8842:=====================================================*=== 44 9618 9753:==========================================================* 46 9955 9934:===========================================================* 48 8978 9510:======================================================= * 50 8189 8678:================================================== * 52 7127 7630:=========================================== * 54 6222 6517:====================================== * 56 5193 5444:================================* 58 4183 4469:==========================* 60 3440 3620:=====================* 62 2847 2902:=================* 64 2188 2308:=============* 66 1842 1824:==========*= 68 1442 1435:========* 70 1110 1125:======* 72 847 879:=====* 74 655 685:====* 76 506 533:===* 78 397 414:==* 80 275 322:=* 82 220 246:=* 84 198 195:=* 86 119 151:* 88 97 117:* inset = represents 4 library sequences 90 75 90:* 92 59 70:* :=============== * 94 37 54:* :========== * 96 34 42:* :========= * 98 28 32:* :=======* 100 17 25:* :===== * 102 15 19:* :====* 104 10 15:* :===* 106 6 12:* :==* 108 8 9:* :==* 110 3 7:* :=* 112 1 5:* :=* 114 2 4:* :* 116 1 3:* :* 118 3 2:* :* >120 160 2:* :*======================================= 33852246 residues in 105998 sequences statistics extrapolated from 50000 to 105752 sequences Expectation_n fit: rho(ln(x))= 5.7125+/-0.000472; mu= 4.6298+/- 0.026; mean_var=76.4472+/-13.971, Z-trim: 96 B-trim: 0 in 0/64 Kolmogorov-Smirnov statistic: 0.0225 (N=29) at 42 FASTA (3.14 April, 1998) function (optimized, BL50 matrix) ktup: 2 join: 37, opt: 25, gap-pen: -12/ -2, width: 16 reg.-scaled Scan time: 75.017 --------------------------------------------------------------------------- The best scores are: initn init1 opt z-sc E(105752) KHHUL cathepsin L (EC 3.4.22.15) precursor - human ( 333) 2321 2321 2321 2661.3 2.1e-141 align A58195 cathepsin L (EC 3.4.22.15) precursor - pig ( 334) 1844 1161 1883 2160.4 1.7e-113 align KHRTL cathepsin L (EC 3.4.22.15) precursor - rat ( 334) 1771 1771 1798 2063.1 4.4e-108 align KHMSL cathepsin L (EC 3.4.22.15) precursor - mouse ( 334) 1764 1764 1769 2030.0 3.1e-106 align I52525 testin precursor - rat ( 333) 1448 1448 1460 1676.6 1.5e-86 align KHCHL cathepsin L (EC 3.4.22.15) - chicken ( 218) 1180 595 1238 1425.4 1.4e-72 align A53810 cathepsin L (EC 3.4.22.15) precursor - flesh fly (Sarc ( 339) 921 593 1169 1343.6 5.2e-68 align S53027 cathepsin L (EC 3.4.22.15) precursor - penaeid shrimp ( 326) 1097 689 1143 1314.2 2.3e-66 align S47433 cathepsin L (EC 3.4.22.15) - Norway lobster ( 313) 1035 841 1141 1312.1 3e-66 align JC5443 cathepsin L-like cysteine proteinase (EC 3.4.-.-) c1 - ( 338) 891 590 1139 1309.4 4.2e-66 align JC2476 cathepsin K (EC 3.4.22.-) precursor - human ( 329) 871 321 1131 1300.4 1.3e-65 align S19651 cysteine proteinase (EC 3.4.22.-) precursor (clone LCP ( 320) 892 892 1121 1289.1 5.7e-65 align A49868 probable cysteine proteinase OC-2 precursor, osteoclas ( 329) 841 280 1120 1287.8 6.7e-65 align A42482 cathepsin S (EC 3.4.22.27) - human ( 331) 890 282 1090 1253.5 5.5e-63 align S19650 cysteine proteinase (EC 3.4.22.-) precursor (clone LCP ( 323) 1037 499 1083 1245.6 1.5e-62 align S19649 cysteine proteinase (EC 3.4.22.-) precursor (clone LCP ( 322) 827 327 1081 1243.3 2e-62 align A45087 cathepsin S (EC 3.4.22.27) - rat ( 330) 831 295 1060 1219.2 4.5e-61 align JX0366 cysteine endopeptidase (EC 3.4.22.-) precursor - silkw ( 344) 1115 585 1057 1215.5 7.2e-61 align JC5442 cathepsin L-like cysteine proteinase (EC 3.4.-.-) g3 - ( 331) 758 581 1053 1211.1 1.3e-60 align JC5441 cathepsin L-like cysteine proteinase (EC 3.4.-.-) g2 - ( 331) 759 582 1050 1207.7 1.9e-60 align S47432 cathepsin L (EC 3.4.22.15) - Norway lobster ( 324) 871 294 1045 1202.1 4e-60 align S67481 cysteine proteinase CP1 - fruit fly (Drosophila melano ( 218) 975 576 1011 1165.8 4.2e-58 align S43991 cathepsin L-like proteinases - liver fluke ( 326) 704 371 941 1083.1 1.7e-53 align S44151 cathepsin L (EC 3.4.22.15) - fluke (Schistosoma manson ( 317) 666 390 901 1037.6 5.8e-51 align I58002 cathepsin-related protein - rat (fragment) ( 236) 868 623 898 1036.1 7.1e-51 align S57777 cysteine proteinase (EC 3.4.22.-) precursor - Hemeroca ( 360) 653 245 894 1028.7 1.8e-50 align KHRZOA oryzain (EC 3.4.22.-) alpha precursor - rice ( 458) 546 270 888 1020.3 5.3e-50 align S15844 cathepsin S (EC 3.4.22.27) - bovine ( 217) 691 276 873 1008.0 2.6e-49 align S57776 cysteine proteinase - clove pink (fragment) ( 427) 388 294 874 1004.7 3.9e-49 align S41427 cysteine proteinase (EC 3.4.22.-) CP1 precursor - Tric ( 309) 523 331 867 998.9 8.4e-49 align JC4848 cysteine proteinase (EC 3.4.22.-) - Douglas fir ( 454) 396 298 864 992.9 1.8e-48 align KHRZOB oryzain (EC 3.4.22.-) beta precursor - rice ( 471) 713 393 860 988.1 3.3e-48 align S22502 endopeptidase - kidney bean ( 362) 420 298 841 968.1 4.3e-47 align S12581 cysteine proteinase (EC 3.4.22.-) - black gram ( 362) 424 302 841 968.1 4.3e-47 align JQ1111 cysteine proteinase (EC 3.4.22.-) EP-B 1 precursor - b ( 371) 573 246 827 951.9 3.4e-46 align JQ1110 cysteine proteinase (EC 3.4.22.-) EP-B 4 precursor - b ( 373) 573 246 821 945.0 8.3e-46 align JQ1121 cysteine proteinase homolog COT44 - rape ( 328) 498 318 807 929.8 5.8e-45 align S47312 cysteine proteinase (EC 3.4.22.-) precursor - spring v ( 368) 374 270 802 923.4 1.3e-44 align S49451 cysteine proteinase - chickpea ( 325) 331 233 797 918.5 2.5e-44 align KHRTH cathepsin H (EC 3.4.22.16) precursor - rat ( 333) 580 389 797 918.3 2.6e-44 align KHHUH cathepsin H (EC 3.4.22.16) precursor - human ( 335) 596 364 797 918.3 2.6e-44 align JN0719 drought-inducible cysteine proteinase (EC 3.4.22.-) RD ( 462) 427 285 796 915.0 3.9e-44 align S49166 cysteine proteinase precursor - spring vetch ( 357) 659 243 789 908.7 8.8e-44 align KHDOP prestalk cathepsin (EC 3.4.22.-) precursor - slime mold ( 376) 815 420 779 896.9 4e-43 align S66348 senescence-associated cysteine proteinase precursor (c ( 356) 600 447 768 884.7 1.9e-42 align S47434 cysteine proteinase - rice ( 378) 490 310 764 879.7 3.6e-42 align S59598 cysteine proteinase 2 precursor - maize ( 360) 631 456 754 868.6 1.5e-41 align S24602 cysteine proteinase tpp (EC 3.4.22.-) - garden pea ( 464) 467 242 745 856.7 7e-41 align S11862 cysteine proteinase homolog - garden pea ( 363) 606 287 742 854.8 8.8e-41 align S07051 cysteine proteinase (EC 3.4.22.-) precursor - Trypanos ( 450) 461 277 743 854.6 9.1e-41 align S42882 cysteine proteinase (EC 3.4.22.-) precursor - spring v ( 358) 606 287 737 849.2 1.8e-40 align S12099 cysteine proteinase (EC 3.4.22.-) precursor - Trypanos ( 450) 470 286 737 847.7 2.2e-40 align TAGB actinidain (EC 3.4.22.14) precursor - kiwi fruit ( 380) 536 353 735 846.5 2.6e-40 align KHRZOG oryzain (EC 3.4.22.-) gamma precursor - rice ( 362) 590 449 731 842.3 4.4e-40 align A45629 cruzipain - Trypanosoma cruzi ( 467) 444 283 726 834.9 1.1e-39 align S41428 cysteine proteinase (EC 3.4.22.-) CP2 precursor - Tric ( 314) 437 277 723 834.0 1.3e-39 align A60667 cysteine proteinase cruzain (EC 3.4.22.-) - Trypanosom ( 467) 434 275 723 831.5 1.8e-39 align KHBH aleurain (EC 3.4.22.-) precursor - barley ( 361) 494 335 719 828.6 2.6e-39 align JN0718 drought-inducible cysteine proteinase (EC 3.4.22.-) RD ( 368) 521 286 719 828.4 2.6e-39 align S55923 cysteine proteinase (EC 3.4.22.-) precursor - soybean ( 380) 584 273 709 816.8 1.2e-38 align S46535 probable cysteine proteinase (EC 3.4.22.-) (clone A149 ( 313) 517 285 706 814.6 1.5e-38 align B23705 cysteine proteinase (EC 3.4.22.-) - Entamoeba histolyt ( 312) 503 187 692 798.6 1.2e-37 align A41404 cathepsin L (EC 3.4.22.15) - cat (fragment) ( 139) 687 687 687 798.2 1.3e-37 align S02728 actinidain (EC 3.4.22.14) precursor (clone pAC.1) - ki ( 302) 527 346 688 794.3 2.1e-37 align S37048 cysteine proteinase - Trypanosoma congolense ( 447) 431 259 679 781.4 1.1e-36 align KHDO cysteine proteinase 1 (EC 3.4.22.-) precursor - slime mo ( 343) 506 280 669 771.7 3.7e-36 align JA0159 cysteine proteinase (EC 3.4.22.-) precursor - tomato ( ( 346) 435 298 669 771.7 3.8e-36 align S24988 cysteine proteinase (EC 3.4.22.-) precursor - tomato ( 361) 497 274 668 770.2 4.5e-36 align A23705 cysteine proteinase (EC 3.4.22.-) - Entamoeba histolyt ( 312) 440 193 665 767.8 6.2e-36 align S30150 probable cysteine proteinase precursor (clone CYP-8) - ( 365) 512 263 663 764.4 9.5e-36 align JN0633 caricain (EC 3.4.22.30) I precursor - papaya ( 348) 448 252 656 756.7 2.6e-35 align JN0634 caricain (EC 3.4.22.30) II precursor - papaya ( 367) 443 252 653 753.0 4.1e-35 align S25267 cysteine proteinase (EC 3.4.22.-) precursor - Leishman ( 354) 464 275 647 746.3 9.7e-35 align S30149 probable cysteine proteinase precursor (clone CYP-7) - ( 363) 495 263 641 739.3 2.4e-34 align S59597 cysteine proteinase 1 precursor - maize ( 371) 526 286 636 733.5 5.1e-34 align A48566 cysteine proteinase Lpcys2 (EC 3.4.22.-) - Leishmania ( 444) 481 300 637 733.4 5.1e-34 align KHSYO4 oil bodies-associated protein P34 precursor - soybean ( 379) 506 224 635 732.2 6e-34 align S29245 cysteine proteinase (EC 3.4.22.-) precursor - Leishman ( 443) 411 317 633 728.9 9.1e-34 align S04222 chymopapain (EC 3.4.22.6) - papaya ( 218) 473 235 589 683.2 3.2e-31 align PPPA papain (EC 3.4.22.2) precursor - papaya ( 345) 394 242 559 645.9 3.8e-29 align S06837 glycyl endopeptidase (EC 3.4.22.25) - papaya ( 216) 420 240 552 640.9 7.2e-29 align S68783 cathepsin L (EC 3.4.22.15) precursor - Paramecium tetr ( 314) 430 313 545 630.5 2.8e-28 align S41425 cysteine proteinase (EC 3.4.22.-) CP3 precursor - Tric ( 278) 391 262 528 611.8 3e-27 align A44938 cysteine proteinase (EC 3.4.22.-) - Trypanosoma cruzi ( 183) 332 283 523 608.8 4.4e-27 align S46476 cysteine proteinase III - mountain papaya ( 214) 468 234 488 567.8 8.6e-25 align S62736 cathepsin-like cysteine proteinase (EC 3.4.22.-) - Aut ( 323) 432 209 485 561.7 1.9e-24 align A55090 cathepsin O (EC 3.4.-.-) precursor - human ( 321) 406 188 484 560.6 2.2e-24 align S62735 cathepsin - Choristoneura fumiferana nuclear polyhedro ( 324) 471 215 482 558.2 2.9e-24 align JC5691 cysteine proteinase (EC 3.4.-.-) - Bombyx mori nuclear ( 323) 439 216 479 554.8 4.5e-24 align C44938 cysteine proteinase (EC 3.4.22.-) - Entamoeba histolyt ( 165) 347 176 465 543.2 2e-23 align A47306 cysteine proteinase - Tetrahymena thermophila (SGC5) ( 336) 393 221 458 530.5 1e-22 align B44938 cysteine proteinase (EC 3.4.22.-) - Trypanosoma brucei ( 166) 272 165 446 521.4 3.3e-22 align S03964 stem bromelain (EC 3.4.22.32) - pineapple ( 212) 431 188 436 508.4 1.7e-21 align S27044 papain-like protein - Autographa californica nuclear p ( 208) 424 201 433 505.1 2.7e-21 align A61500 allergen Der f I precursor - house-dust mite (Dermatop ( 319) 260 182 429 497.7 6.9e-21 align S57422 cysteine proteinase (EC 3.4.22.-) 8 - Tritrichomonas f ( 152) 288 153 412 483.1 4.5e-20 align A45624 trophozoite cysteine proteinase - Plasmodium falciparu ( 569) 378 226 413 475.6 1.2e-19 align S02729 actinidain (EC 3.4.22.14) precursor (clone pAC.7) - ki ( 184) 292 131 400 468.1 3e-19 align PQ0650 senescence-associated protein SAG2 - Arabidopsis thali ( 95) 353 353 385 455.3 1.6e-18 align A41158 dipeptidyl-peptidase I (EC 3.4.14.1) precursor - rat ( 462) 334 147 393 454.1 1.8e-18 align S66504 dipeptidyl-peptidase I (EC 3.4.14.1) precursor - human ( 463) 313 154 386 446.1 5.1e-18 align S57425 cysteine proteinase (EC 3.4.22.-) 7 - Tritrichomonas f ( 152) 275 167 378 444.2 6.5e-18 align S57421 cysteine proteinase (EC 3.4.22.-) 6 - Tritrichomonas f ( 152) 225 122 370 435.1 2.1e-17 align S57427 cysteine proteinase (EC 3.4.22.-) 4 - Tritrichomonas f ( 152) 290 155 370 435.1 2.1e-17 align S21864 probable cysteine proteinase (EC 3.4.22.-) - Euroglyph ( 211) 289 172 370 432.9 2.8e-17 align S46265 cysteine proteinase - Plasmodium vivax ( 583) 346 113 374 430.8 3.6e-17 align S68784 cathepsin L - Paramecium tetraurelia (SGC5) (fragment) ( 294) 279 125 361 420.5 1.4e-16 align JQ0337 allergen Der p 1 - house-dust mite (Dermatophagoides p ( 245) 285 206 349 407.9 6.9e-16 align S57423 cysteine proteinase (EC 3.4.22.-) 9 - Tritrichomonas f ( 152) 239 129 336 396.2 3.1e-15 align S57451 cysteine proteinase (EC 3.4.22.-) 3 - Tritrichomonas f ( 157) 227 147 322 379.9 2.5e-14 align B48566 cysteine proteinase Lpcys1 (EC 3.4.22.-) - Leishmania ( 149) 190 116 313 370.0 8.9e-14 align S31914 cysteine proteinase - chickpea (fragment) ( 111) 173 173 310 368.5 1.1e-13 align S46541 cysteine proteinase - chickpea (fragment) ( 111) 173 173 308 366.2 1.4e-13 align S57426 cysteine proteinase (EC 3.4.22.-) 5 - Tritrichomonas f ( 155) 186 113 305 360.6 3e-13 align S04924 CTLA-2-alpha protein precursor - mouse ( 136) 271 271 292 346.6 1.8e-12 align S60456 cysteine proteinase (EC 3.4.22.-), glucose starvation- ( 145) 179 179 291 345.0 2.2e-12 align S04925 CTLA-2-beta protein precursor - mouse (fragment) ( 141) 271 271 281 333.8 9.3e-12 align KHQBTT cysteine proteinase (EC 3.4.22.-) precursor - Theileri ( 439) 344 182 287 333.2 1e-11 align A69493 cysteine proteinase homolog - Archaeoglobus fulgidus (1088) 234 136 258 294.1 1.5e-09 align S57624 cysteine proteinase LmCPb19 - Leishmania mexicana (fra ( 136) 195 108 238 284.8 4.9e-09 align B26074 cysteine proteinase (EC 3.4.22.-) 13 - papaya (fragmen ( 96) 116 116 227 274.5 1.9e-08 align A45565 cysteine proteinase - Theileria annulata ( 441) 232 101 235 273.7 2.1e-08 align S41426 cysteine proteinase (EC 3.4.22.-) CP4 precursor - Tric ( 100) 204 146 224 270.8 3e-08 align S60479 cathepsin B-like cysteine proteinase (EC 3.4.22.-) - A ( 356) 236 169 231 270.5 3.1e-08 align S31907 cathepsin B (EC 3.4.22.1) - fluke (Schistosoma japonic ( 342) 194 89 225 263.9 7.2e-08 align KHHUB cathepsin B (EC 3.4.22.1) precursor - human ( 339) 181 83 223 261.7 9.6e-08 align S38939 probable cathepsin B-like cysteine proteinase (EC 3.4. ( 344) 233 104 223 261.6 9.7e-08 align S31909 cathepsin B-like cysteine proteinase (EC 3.4.22.-) - f ( 316) 176 94 221 259.9 1.2e-07 align KHRTB cathepsin B (EC 3.4.22.1) precursor - rat ( 339) 208 82 221 259.4 1.3e-07 align D48435 cysteine proteinase AC-3 - nematode (Haemonchus contor ( 341) 214 97 216 253.6 2.7e-07 align KHMSB cathepsin B (EC 3.4.22.1) precursor - mouse ( 339) 203 82 210 246.8 6.5e-07 align KHBOB cathepsin B (EC 3.4.22.1) precursor - bovine ( 335) 206 82 209 245.8 7.4e-07 align A61061 actinidain (EC 3.4.22.14) - kiwi fruit (cv. Hayward) ( ( 110) 241 136 200 242.7 1.1e-06 align C48435 cysteine proteinase AC-4 - nematode (Haemonchus contor ( 342) 180 101 194 228.5 6.8e-06 align A48454 cathepsin B-like cysteine proteinase (EC 3.4.-.-) - ne ( 342) 151 81 188 221.6 1.6e-05 align A57480 tubulointerstitial nephritis antigen precursor - rabbi ( 474) 117 72 189 220.6 1.9e-05 align B48454 cathepsin B-like cysteine proteinase (EC 3.4.-.-) CP-3 ( 174) 142 83 183 220.3 1.9e-05 align S58770 cathepsin B (EC 3.4.22.1) precursor - chicken ( 340) 176 82 186 219.4 2.2e-05 align B48435 cysteine proteinase AC-5 - nematode (Haemonchus contor ( 348) 128 83 186 219.2 2.2e-05 align A44965 cysteine proteinase (EC 3.4.22.-) AC-2 precursor - nem ( 342) 204 85 185 218.2 2.5e-05 align A45524 cysteine proteinase (EC 3.4.22.-) AC-1 precursor - nem ( 342) 204 85 184 217.0 3e-05 align A54505 serine-repeat antigen precursor - Plasmodium falciparu ( 989) 151 113 188 214.7 4e-05 align S35580 proteinase IV - mountain papaya (fragment) ( 43) 151 151 170 214.6 4e-05 align S35577 cysteine proteinase I - mountain papaya (fragment) ( 43) 152 152 161 204.3 0.00015 align S35578 cysteine proteinase II - mountain papaya (fragment) ( 43) 150 150 157 199.7 0.00027 align S32561 cysteine proteinase - Plasmodium vinckei ( 506) 353 114 158 184.7 0.0019 align A29172 cathepsin B-like cysteine proteinase (EC 3.4.22.-) - b ( 73) 108 108 136 172.2 0.0092 align F32946 cysteine proteinase (EC 3.4.22.-) - Caenorhabditis ele ( 53) 130 94 132 169.7 0.013 align S16162 cruzipain (EC 3.4.22.-) - Trypanosoma cruzi (fragment) ( 173) 117 117 133 163.2 0.03 align A30043 trophoblast-specific protein precursor - mouse ( 124) 156 88 124 155.0 0.084 align S23941 dipeptidyl-peptidase I (EC 3.4.14.1) - human (fragment ( 119) 130 104 115 145.0 0.3 align S15845 cathepsin L (EC 3.4.22.15) - bovine (fragments) ( 38) 178 100 107 143.3 0.38 align A31657 major fecal allergen Der p 1 - house-dust mite (Dermat ( 92) 101 101 109 139.8 0.59 align S46204 ananain (EC 3.4.22.31) - pineapple (fragment) ( 20) 80 80 100 139.5 0.61 align S14329 thaumatopain - miracle fruit (fragment) ( 35) 92 92 102 138.1 0.73 align S46205 comosain - pineapple (fragment) ( 20) 76 76 96 134.9 1.1 align S39367 proteinase omega - papaya (fragments) ( 37) 86 86 97 132.1 1.6 align S03380 major fecal allergen Der p I - house-dust mite (Dermat ( 94) 82 52 95 123.7 4.7 align A35417 28K serine proteinase homolog - bovine (fragment) ( 15) 82 82 82 120.8 6.8 align LUBO11 annexin XI form A - bovine ( 503) 47 47 101 119.6 7.9 align S23447 annexin XI form B - bovine ( 505) 47 47 101 119.6 7.9 align B45658 pancreatic lipase (EC 3.1.1.3) - sheep (fragment) ( 86) 33 33 90 118.5 9 align --------------------------------------------------------------------------- >>>@, 333 aa vs %p library >>KHHUL cathepsin L (EC 3.4.22.15) precursor - human (333 aa) initn: 2321 init1: 2321 opt: 2321 Z-score: 2661.3 expect() 2.1e-141 Smith-Waterman score: 2321; 100.000% identity in 333 aa overlap Entrez lookup Re-search database >KHHUL 1- 333:---------------------------------------------------------------------: 10 20 30 40 50 60 70 80 gi|115 MNPTLILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLYGMNEEGWRRAVWEKNMKMIELHNQEYREGKHSFTMAMNAF :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: KHHUL MNPTLILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLYGMNEEGWRRAVWEKNMKMIELHNQEYREGKHSFTMAMNAF 10 20 30 40 50 60 70 80 90 100 110 120 130 140 150 160 gi|115 GDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLIS :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: KHHUL GDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLIS 90 100 110 120 130 140 150 160 170 180 190 200 210 220 230 240 gi|115 LSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPKQEKALMKAVA :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: KHHUL LSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPKQEKALMKAVA 170 180 190 200 210 220 230 240 250 260 270 280 290 300 310 320 gi|115 TVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYVKMAKDRRN :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: KHHUL TVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYVKMAKDRRN 250 260 270 280 290 300 310 320 330 gi|115 HCGIASAASYPTV ::::::::::::: KHHUL HCGIASAASYPTV 330 --------------------------------------------------------------------------- >>A58195 cathepsin L (EC 3.4.22.15) precursor - pig (334 aa) initn: 1844 init1: 1161 opt: 1883 Z-score: 2160.4 expect() 1.7e-113 Smith-Waterman score: 1883; 78.743% identity in 334 aa overlap Entrez lookup Re-search database >A58195 1- 333:---------------------------------------------------------------------: 10 20 30 40 50 60 70 80 gi|115 MNPTLILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLYGMNEEGWRRAVWEKNMKMIELHNQEYREGKHSFTMAMNAF :.:.:.:.:.:::::::. .:..:.:.: :::: :.::::::::::::::::::::::::::::: .:::.:.:::::: A58195 MKPSLFLTALCLGIASAAPKLDQNLDADWYKWKATHGRLYGMNEEGWRRAVWEKNMKMIELHNQEYSQGKHGFSMAMNAF 10 20 30 40 50 60 70 80 90 100 110 120 130 140 150 160 gi|115 GDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLIS ::::.::::::::::::.: .:::::.: : :.:.::::::::::: ::::::::::::::::::::::::::::.:.: A58195 GDMTNEEFRQVMNGFQNQKHKKGKVFHESLVLEVPKSVDWREKGYVTAVKNQGQCGSCWAFSATGALEGQMFRKTGKLVS 90 100 110 120 130 140 150 160 170 180 190 200 210 220 230 gi|115 LSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATE-ESCKYNPKYSVANDTGFVDIPKQEKALMKAV :::::::::: ::::.:::::::: :::::.::::::.:::::: . : .:: :.:. :.::::::::::..:::::::: A58195 LSEQNLVDCSRPQGNQGCNGGLMDNAFQYVKDNGGLDTEESYPYLGRETNSCTYKPECSAANDTGFVDIPQREKALMKAV 170 180 190 200 210 220 230 240 240 250 260 270 280 290 300 310 gi|115 ATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYVKMAKDRR :::::::::::::: :: ::: :::..:::::.:.::::::::::::.:.:...:.:.:::::: ::: .::::::::. A58195 ATVGPISVAIDAGHSSFQFYKSGIYYDPDCSSKDLDHGVLVVGYGFEGTDSNSSKFWIVKNSWGPEWGWNGYVKMAKDQN 250 260 270 280 290 300 310 320 320 330 gi|115 NHCGIASAASYPTV :::::..::::::: A58195 NHCGISTAASYPTV 330 --------------------------------------------------------------------------- >>KHRTL cathepsin L (EC 3.4.22.15) precursor - rat (334 aa) initn: 1771 init1: 1771 opt: 1798 Z-score: 2063.1 expect() 4.4e-108 Smith-Waterman score: 1798; 73.273% identity in 333 aa overlap Entrez lookup Re-search database >KHRTL 1- 333:---------------------------------------------------------------------: 10 20 30 40 50 60 70 80 gi|115 MNPTLILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLYGMNEEGWRRAVWEKNMKMIELHNQEYREGKHSFTMAMNAF :.: :.::..::: : :: ::....::: .::. : :::: ::: ::::::::::.::.::: :: .:::.::: :::: KHRTL MTPLLLLAVLCLGTALATPKFDQTFNAQWHHWKSTHRRLYGTNEEEWRRAVWEKNMRMIQLHNGEYSNGKHGFTMEMNAF 10 20 30 40 50 60 70 80 90 100 110 120 130 140 150 160 gi|115 GDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLIS ::::.:::::..::....: .::..:::::. . :..::::::: ::::::::::::::::::.: :::::: :::.::: KHRTL GDMTNEEFRQIVNGYRHQKHKKGRLFQEPLMLQIPKTVDWREKGCVTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLIS 90 100 110 120 130 140 150 160 170 180 190 200 210 220 230 240 gi|115 LSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPKQEKALMKAVA :::::::::: :::.::::::::.::::...:::::::::::::: . :::: .:.:::::::::::.:::::::::: KHRTL LSEQNLVDCSHDQGNQGCNGGLMDFAFQYIKENGGLDSEESYPYEAKDGSCKYRAEYAVANDTGFVDIPQQEKALMKAVA 170 180 190 200 210 220 230 240 250 260 270 280 290 300 310 320 gi|115 TVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYVKMAKDRRN ::::::::.::.: :. ::. :::.::.:::.:.::::::::::.:.:.:...::::::::::.:::: ::.:.:::: : KHRTL TVGPISVAMDASHPSLQFYSSGIYYEPNCSSKDLDHGVLVVGYGYEGTDSNKDKYWLVKNSWGKEWGMDGYIKIAKDRNN 250 260 270 280 290 300 310 320 330 gi|115 HCGIASAASYPTV :::.:.::::: : KHRTL HCGLATAASYPIVN 330 --------------------------------------------------------------------------- >>KHMSL cathepsin L (EC 3.4.22.15) precursor - mouse (334 aa) initn: 1764 init1: 1764 opt: 1769 Z-score: 2030.0 expect() 3.1e-106 Smith-Waterman score: 1769; 72.072% identity in 333 aa overlap Entrez lookup Re-search database >KHMSL 1- 333:---------------------------------------------------------------------: 10 20 30 40 50 60 70 80 gi|115 MNPTLILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLYGMNEEGWRRAVWEKNMKMIELHNQEYREGKHSFTMAMNAF :: :.::..::: : :: ::... :.: .::. : :::: ::: ::::.:::::.::.::: :: .:.:.:.: :::: KHMSL MNLLLLLAVLCLGTALATPKFDQTFSAEWHQWKSTHRRLYGTNEEEWRRAIWEKNMRMIQLHNGEYSNGQHGFSMEMNAF 10 20 30 40 50 60 70 80 90 100 110 120 130 140 150 160 gi|115 GDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLIS ::::.::::::.::....: .::..:::::. . :.:::::::: ::::::::::::::::::.: :::::: :::.::: KHMSL GDMTNEEFRQVVNGYRHQKHKKGRLFQEPLMLKIPKSVDWREKGCVTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLIS 90 100 110 120 130 140 150 160 170 180 190 200 210 220 230 240 gi|115 LSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPKQEKALMKAVA :::::::::: :::.::::::::.::::...:::::::::::::: . :::: ...:::::::::::.:::::::::: KHMSL LSEQNLVDCSHAQGNQGCNGGLMDFAFQYIKENGGLDSEESYPYEAKDGSCKYRAEFAVANDTGFVDIPQQEKALMKAVA 170 180 190 200 210 220 230 240 250 260 270 280 290 300 310 320 gi|115 TVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYVKMAKDRRN ::::::::.::.: :. ::. :::.::.:::...:::::.::::.:.:.:..::::::::::: :::: ::.:.:::: : KHMSL TVGPISVAMDASHPSLQFYSSGIYYEPNCSSKNLDHGVLLVGYGYEGTDSNKNKYWLVKNSWGSEWGMEGYIKIAKDRDN 250 260 270 280 290 300 310 320 330 gi|115 HCGIASAASYPTV :::.:.:::::.: KHMSL HCGLATAASYPVVN 330 --------------------------------------------------------------------------- >>I52525 testin precursor - rat (333 aa) initn: 1448 init1: 1448 opt: 1460 Z-score: 1676.6 expect() 1.5e-86 Smith-Waterman score: 1460; 61.862% identity in 333 aa overlap Entrez lookup Re-search database >I52525 1- 333:---------------------------------------------------------------------: 10 20 30 40 50 60 70 80 gi|115 MNPTLILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLYGMNEEGWRRAVWEKNMKMIELHNQEYREGKHSFTMAMNAF : .:.:: .:: . :.. : : ::...:..:.. :.. :.:::: .:::::::.::::::: :: ::.:.:::::::: I52525 MIAVLFLAILCLEVDSTAPTPDPSLDVEWNEWRTKHGKTYNMNEERLKRAVWEKNFKMIELHNWEYLEGRHDFTMAMNAF 10 20 30 40 50 60 70 80 90 100 110 120 130 140 150 160 gi|115 GDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLIS ::.:. :: ..:.::: .: .: ..::. : .:. ::::. ::::::::::.:.: :::::::.::::::::: ::: I52525 GDLTNIEFVKMMTGFQRQKIKKTHIFQDHQFLYVPKRVDWRQLGYVTPVKNQGHCASSWAFSATGSLEGQMFRKTERLIP 90 100 110 120 130 140 150 160 170 180 190 200 210 220 230 240 gi|115 LSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPKQEKALMKAVA ::::::.:: : . ..::.::.:.::::::.::::: .::::::.. . :.:. . :.:: ::.:: .:.::::::: I52525 LSEQNLLDCMGSNVTHGCSGGFMQYAFQYVKDNGGLATEESYPYRGQGRECRYHAENSAANVRDFVQIPGSEEALMKAVA 170 180 190 200 210 220 230 240 250 260 270 280 290 300 310 320 gi|115 TVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYVKMAKDRRN :::::::.::.: :: :: :::.::.:. ..:.:::::::::. :::.:..::::::::::::: ::.:.::: : I52525 KVGPISVAVDASHGSFQFYGSGIYYEPQCKRVHLNHAVLVVGYGFEGEESDGNSFWLVKNSWGEEWGMKGYMKLAKDWSN 250 260 270 280 290 300 310 320 330 gi|115 HCGIASAASYPTV :::::. ..:: : I52525 HCGIATYSTYPIV 330 --------------------------------------------------------------------------- >>KHCHL cathepsin L (EC 3.4.22.15) - chicken (218 aa) initn: 1180 init1: 595 opt: 1238 Z-score: 1425.4 expect() 1.4e-72 Smith-Waterman score: 1238; 77.928% identity in 222 aa overlap Entrez lookup Re-search database >KHCHL 114- 333: ---------------------------------------------: 80 90 100 110 120 130 140 150 gi|115 TMAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFR :::::::::::::::::.:::::::::::.::::::: :: KHCHL APRSVDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQHFR 10 20 30 40 160 170 180 190 200 210 220 230 gi|115 KTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATE-ESCKYNPKYSVANDTGFVDIPK-Q :::.:.::::::::::: :.::.:::::::: ::::::::::.:::::::: : . :.:.:. .:..::::::::::. . KHCHL KTGKLVSLSEQNLVDCSRPEGNQGCNGGLMDQAFQYVQDNGGIDSEESYPYTAKDDEDCRYKAEYNAANDTGFVDIPQGH 50 60 70 80 90 100 110 120 240 250 260 270 280 290 300 310 gi|115 EKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGY :.:::::::.:::.:::::::: :: ::. :::.::::::::.::::::::::::. ..:::.:::::::.:: :: KHCHL ERALMKAVASVGPVSVAIDAGHSSFQFYQSGIYYEPDCSSEDLDHGVLVVGYGFEG----GKKYWIVKNSWGEKWGDKGY 130 140 150 160 170 180 190 320 330 gi|115 VKMAKDRRNHCGIASAASYPTV . :::::.::::::.::::: : KHCHL IYMAKDRKNHCGIATAASYPLV 200 210 --------------------------------------------------------------------------- >>A53810 cathepsin L (EC 3.4.22.15) precursor - flesh fly (Sarcophaga peregri (339 aa) initn: 921 init1: 593 opt: 1169 Z-score: 1343.6 expect() 5.2e-68 Smith-Waterman score: 1196; 51.471% identity in 340 aa overlap Entrez lookup Re-search database >A53810 4- 333: --------------------------------------------------------------------: 10 20 30 40 50 60 70 gi|115 MNPTLILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLYGMN-EEGWRRAVWEKNMKMIELHNQEYREGKHSFTMAMNA :...: . : . ... .. .: .: .: . :. . :: .: ....: . : ::: . .:: :. ...: A53810 MRTVLVALLALVALTQAISPLDLIKEEWHTYKLQHRKNYANEVEERFRMKIFNENRHKIAKHNQLFAQGKVSYKLGLNK 10 20 30 40 50 60 70 80 90 100 110 120 130 140 150 gi|115 FGDMTSEEFRQVMNGF-----QNRKPRKGKV---FQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQM ..:: .::...:::. : . : : : . : .:.::::::.: :: ::.::.::::::::.::::::: A53810 YADMLHHEFKETMNGYNHTLRQLMRERTGLVGATYIPPAHVTVPKSVDWREHGAVTGVKDQGHCGSCWAFSSTGALEGQH 80 90 100 110 120 130 140 150 160 170 180 190 200 210 220 230 gi|115 FRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPK- :::.: :.::::::::::: ::.:::::::: ::.:..::::.:.:.:::::. ..::..: :.::::::::. A53810 FRKAGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYPYEGIDDSCHFNKATIGATDTGFVDIPEG 160 170 180 190 200 210 220 230 240 250 260 270 280 290 300 310 gi|115 QEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGG .:. . :::::.::.::::::.:::: .:.::.: ::.:. ...:::::::::: . . : ::::::::: :: : A53810 DEEKMKKAVATMGPVSVAIDASHESFQLYSEGVYNEPECDEQNLDHGVLVVGYGTDESGMD---YWLVKNSWGTTWGEQG 240 250 260 270 280 290 300 310 320 330 gi|115 YVKMAKDRRNHCGIASAASYPTV :.:::... :.::::.:.::::: A53810 YIKMARNQNNQCGIATASSYPTV 320 330 --------------------------------------------------------------------------- >>S53027 cathepsin L (EC 3.4.22.15) precursor - penaeid shrimp (Penaeus vanam (326 aa) initn: 1097 init1: 689 opt: 1143 Z-score: 1314.2 expect() 2.3e-66 Smith-Waterman score: 1143; 51.506% identity in 332 aa overlap Entrez lookup Re-search database >S53027 4- 333: --------------------------------------------------------------------: 10 20 30 40 50 60 70 gi|115 MNPTLILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLYG-MNEEGWRRAVWEKNMKMIELHNQEYREGKHSFTMAMNA .: . : ...: :: ::. :: ..:: :.: :. ..:: .: .:.:.:...:. :: ....:. .::. :: S53027 KSLAVLACVVAVAVAT----PSLRQQWQNFKAEHGRRYASVQEERYRLSVFEQNQQFIDDHNARFENGEVTFTLQMNQ 10 20 30 40 50 60 70 80 90 100 110 120 130 140 150 gi|115 FGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLI ::::::::. .:::: . :. . . :..:::: :: :::::.: :::::::::.::.:::: : : :.:. S53027 FGDMTSEEIVATMNGFLGAPTRRPAAVLKADDETLPEKVDWRTKGAVTPVKDQKQCGSCWAFSTTGSLEGQHFLKDGKLV 80 90 100 110 120 130 140 150 160 170 180 190 200 210 220 230 gi|115 SLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPK-QEKALMKA ::::::::::: :: :: ::::: ::.:.. : :.:.:.:::::: . .:... . :.:::.::. . .:.:: :: S53027 SLSEQNLVDCSDKFGNMGCMGGLMDQAFRYIKANKGIDTEDSYPYEAQDGKCRFDASNVGATDTGYVDVEHGSESALKKA 160 170 180 190 200 210 220 230 240 250 260 270 280 290 300 310 gi|115 VATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYVKMAKDR :::.:::::.:::.. .: ::. :.: . ::: .:::::.:::: .. ... .:::::::. :: ::.::...: S53027 VATIGPISVGIDASQSTFHFYHTGVYHDDHCSSTMLDHGVLAVGYG---SDENGGDFWLVKNSWNTSWGDKGYIKMSRNR 240 250 260 270 280 290 300 310 320 330 gi|115 RNHCGIASAASYPTV :.::::: :::: : S53027 NNNCGIASQASYPLV 320 --------------------------------------------------------------------------- >>S47433 cathepsin L (EC 3.4.22.15) - Norway lobster (313 aa) initn: 1035 init1: 841 opt: 1141 Z-score: 1312.1 expect() 3e-66 Smith-Waterman score: 1141; 53.871% identity in 310 aa overlap Entrez lookup Re-search database >S47433 29- 333: ---------------------------------------------------------------: 10 20 30 40 50 60 70 gi|115 MNPTLILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLYG-MNEEGWRRAVWEKNMKMIELHNQEYREGKHSFTMAMNA : ..:....: :: .:: .:. :...: ...: :.....:. .: .::: S47433 CGLALATASPSWEHFKTQYGRKYGDAKEELYRQRVFQQNEQLVEAFNKKFENGEVTFKVAMNQ 10 20 30 40 50 60 80 90 100 110 120 130 140 150 gi|115 FGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYEAPR----SVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKT :::::.::: ::.:. .: .:. .: : .:::: :: :::::.::::::::::::::.:::: : :. S47433 FGDMTNEEFNAVMKGY--KKGSRGE--PTTVFTAEGRPMAADVDWRTKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLKN 70 80 90 100 110 120 130 160 170 180 190 200 210 220 230 gi|115 GRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPKQEKAL ..:.:::::.::::: ::.::.:: : ::.:..::::.:.: :::::: ..::... . :. ::::.. . :.:: S47433 NELVSLSEQELVDCSTEYGNDGCGGGWMTSAFDYIKDNGGIDTESSYPYEAQDRSCRFDANSIGATCTGFVEVQHTEEAL 140 150 160 170 180 190 200 210 240 250 260 270 280 290 300 310 gi|115 MKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYVKMA .::. .:::::::::.: :: ::. :.:.: :: ..:::::.:::: ::::. ::::::::: :: .::.::. S47433 HEAVSDIGPISVAIDASHFSFQFYSSGVYYEKKCSPTNLDHGVLAVGYGTESTED----YWLVKNSWGSGWGDAGYIKMS 220 230 240 250 260 270 280 290 320 330 gi|115 KDRRNHCGIASAASYPTV ..: :.::::: ::::: S47433 RNRDNNCGIASEPSYPTV 300 310 --------------------------------------------------------------------------- >>JC5443 cathepsin L-like cysteine proteinase (EC 3.4.-.-) c1 - Maize weevil (338 aa) initn: 891 init1: 590 opt: 1139 Z-score: 1309.4 expect() 4.2e-66 Smith-Waterman score: 1190; 51.312% identity in 343 aa overlap Entrez lookup Re-search database >JC5443 1- 333:---------------------------------------------------------------------: 10 20 30 40 50 60 70 gi|115 MNPTLILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLY-GMNEEGWRRAVWEKNMKMIELHNQEYREGKHSFTMAMNA :. ::::: :. ...: .. ::...: .:.. : . .:: .: .. .: . . ::. . .: .: ...: JC5443 MKLFLILAAVV--ISCQAVSFYDLVQEQWSSFKMQHSKNYDSETEERFRMKIFMENAHKVAKHNKLFSQGFVKFKLGLNK 10 20 30 40 50 60 70 80 90 100 110 120 130 140 150 gi|115 FGDMTSEEFRQVMNGFQNRKPR--KGK------VFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQM ..:: .:: ...:::.. : ::. : : . : .::::.:: :: ::.::.:::::.:::::.:::: JC5443 YADMLHHEFVSTLNGFNKTKNNILKGSDLNDAVRFISPANVKLPDTVDWRDKGAVTEVKDQGHCGSCWSFSATGSLEGQH 80 90 100 110 120 130 140 150 160 170 180 190 200 210 220 230 gi|115 FRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPK- :::::.:.:::::::::::: ::.:::::::: ::.:..::::.:.:.:::: : .:.:.:. . : :.: ::::: . JC5443 FRKTGKLVSLSEQNLVDCSGRYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYPYLAEDEKCHYKAQNSGATDKGFVDIEEA 160 170 180 190 200 210 220 230 240 250 260 270 280 290 300 310 gi|115 QEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGG .: : :::::::.:.::::.::.: .:..:.: .:.:::...:::::::::: : .:.. ::::::::: ::..: JC5443 NEDDLKAAVATVGPVSIAIDASHETFQLYSDGVYSDPECSSQELDHGVLVVGYG---TSDDGQDYWLVKNSWGPSWGLNG 240 250 260 270 280 290 300 310 320 330 gi|115 YVKMAKDRRNHCGIASAASYPTV :.:::... : ::.:: :::: : JC5443 YIKMARNQDNMCGVASQASYPLV 320 330 --------------------------------------------------------------------------- >>JC2476 cathepsin K (EC 3.4.22.-) precursor - human (329 aa) initn: 871 init1: 321 opt: 1131 Z-score: 1300.4 expect() 1.3e-65 Smith-Waterman score: 1131; 50.909% identity in 330 aa overlap Entrez lookup Re-search database >JC2476 7- 331: --------------------------------------------------------------------: 10 20 30 40 50 60 70 gi|115 MNPTLILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLYGMN-EEGWRRAVWEKNMKMIELHNQEYREGKHSFTMAMNA : .. : ..: .: .. :...: :: : . :. . .: :: .::::.:.: .:: : : :.. .::: JC2476 MWGLKVLLLPVVSFALYPEEILDTHWELWKKTHRKQYNNKVDEISRRLIWEKNLKYISIHNLEASLGVHTYELAMNH 10 20 30 40 50 60 70 80 90 100 110 120 130 140 150 gi|115 FGDMTSEEFRQVMNGFQN--RKPRKGKVFQEPLFY-EAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTG .::::::: : :.:.. . :.. .. : . .:: :::.:.::::::::::::::::::::..::::::. .::: JC2476 LGDMTSEEVVQKMTGLKVPLSHSRSNDTLYIPEWEGRAPDSVDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTG 80 90 100 110 120 130 140 150 160 170 180 190 200 210 220 230 gi|115 RLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPK-QEKAL .:..:: :::::: . :.::.:: : :::::: : :.:::..::: . :::: ::: ..:. :. .::. .:::: JC2476 KLLNLSPQNLVDCVSE--NDGCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQEESCMYNPTGKAAKCRGYREIPEGNEKAL 160 170 180 190 200 210 220 230 240 250 260 270 280 290 300 310 gi|115 MKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYVKMA .::: :::.::::::. :: ::..:.:.. .:.:....:.::.::::... .::.:..::::::.:: ::. :: JC2476 KRAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNSDNLNHAVLAVGYGIQK----GNKHWIIKNSWGENWGNKGYILMA 240 250 260 270 280 290 300 310 320 330 gi|115 KDRRNHCGIASAASYPTV ... : ::::. ::.: JC2476 RNKNNACGIANLASFPKM 320 --------------------------------------------------------------------------- >>S19651 cysteine proteinase (EC 3.4.22.-) precursor (clone LCP3) - American (320 aa) initn: 892 init1: 892 opt: 1121 Z-score: 1289.1 expect() 5.7e-65 Smith-Waterman score: 1121; 51.672% identity in 329 aa overlap Entrez lookup Re-search database >S19651 6- 333: --------------------------------------------------------------------: 10 20 30 40 50 60 70 gi|115 MNPTLILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLYG-MNEEGWRRAVWEKNMKMIELHNQEYREGKHSFTMAMNA . : : :.: :: . .: ..:....: :: .:: .:. :...: ..:: :.....:. .: .::: S19651 KVAALFLCGLALATAS------PSWDHFKTQYGRKYGDAKEELYRQRVFQQNEQLIEDFNKKFENGEVTFKVAMNQ 10 20 30 40 50 60 70 80 90 100 110 120 130 140 150 gi|115 FGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLI :::::.::: ::.:... . . :. :.:::: :. :::::.: ::::::::::::::::: : :. .:. S19651 FGDMTNEEFNAVMKGYKKGSRGEPKAVFTAEGRPMARDVDWRTKALVTPVKDQEQCGSCWAFSATGALEGQHFLKNDELV 80 90 100 110 120 130 140 150 160 170 180 190 200 210 220 230 gi|115 SLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPKQEKALMKAV :::::.::::: ::.::.:: : ::.:..::::.:.: :::::: ..::... . : :: :.. . :.::..:: S19651 SLSEQQLVDCSTDYGNDGCGGGWMTSAFDYIKDNGGIDTESSYPYEAEDRSCRFDANSIGAICTGSVEVQHTEEALQEAV 160 170 180 190 200 210 220 230 240 250 260 270 280 290 300 310 gi|115 ATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYVKMAKDRR . ::::::::::.: :: ::. :.:.: .:: .:::::.:::: :::.. ::::::::: :: .::.::...: S19651 SGVGPISVAIDASHFSFQFYSSGVYYEQNCSPTFLDHGVLAVGYGTESTKD----YWLVKNSWGSSWGDAGYIKMSRNRD 240 250 260 270 280 290 300 320 330 gi|115 NHCGIASAASYPTV :.::::: ::::: S19651 NNCGIASEPSYPTV 310 320 --------------------------------------------------------------------------- >>A49868 probable cysteine proteinase OC-2 precursor, osteoclast - rabbit (329 aa) initn: 841 init1: 280 opt: 1120 Z-score: 1287.8 expect() 6.7e-65 Smith-Waterman score: 1120; 50.000% identity in 330 aa overlap Entrez lookup Re-search database >A49868 7- 331: --------------------------------------------------------------------: 10 20 30 40 50 60 70 gi|115 MNPTLILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLYGMN-EEGWRRAVWEKNMKMIELHNQEYREGKHSFTMAMNA : .. : ..: .: .. :..:: :: ... :. . .: :: .::::.: : .:: : : :.. .::: A49868 MWGLKVLLLPVVSFALHPEEILDTQWELWKKTYSKQYNSKVDEISRRLIWEKNLKHISIHNLEASLGVHTYELAMNH 10 20 30 40 50 60 70 80 90 100 110 120 130 140 150 gi|115 FGDMTSEEFRQVMNGFQ---NRKPRKGKVFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTG .::::::: : :.:.. .:. . .. ..: :.:.:.::::::::::::::::::::..::::::. .::: A49868 LGDMTSEEVVQKMTGLKVPPSRSHSNDTLYIPDWEGRTPDSIDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTG 80 90 100 110 120 130 140 150 160 170 180 190 200 210 220 230 gi|115 RLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPK-QEKAL .:..:: :::::: . : ::.:: : :::::: : :.:::..::: . .::: ::: ..:. :. .::. .:::: A49868 KLLNLSPQNLVDCVSE--NYGCGGGYMTNAFQYVQRNRGIDSEDAYPYVGQDESCMYNPTGKAAKCRGYREIPEGNEKAL 160 170 180 190 200 210 220 230 240 250 260 270 280 290 300 310 gi|115 MKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYVKMA .::: :::.::::::. :: ::..:.:.. .:::....:.::.::::... .::.:..:::::: :: ::. :: A49868 KRAVARVGPVSVAIDASLTSFQFYSKGVYYDENCSSDNVNHAVLAVGYGIQK----GNKHWIIKNSWGESWGNKGYILMA 240 250 260 270 280 290 300 310 320 330 gi|115 KDRRNHCGIASAASYPTV ... : ::::. ::.: A49868 RNKNNACGIANLASFPKM 320 --------------------------------------------------------------------------- >>A42482 cathepsin S (EC 3.4.22.27) - human (331 aa) initn: 890 init1: 282 opt: 1090 Z-score: 1253.5 expect() 5.5e-63 Smith-Waterman score: 1090; 49.085% identity in 328 aa overlap Entrez lookup Re-search database >A42482 15- 333: ------------------------------------------------------------------: 10 20 30 40 50 60 70 gi|115 MNPTLILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLYGM-NEEGWRRAVWEKNMKMIELHNQEYREGKHSFTMAMNA : : : : .:. .: :: ... : :::. :: .::::.:.. ::: :. : ::. ..:: A42482 MKRLVCVLLVCSSAVAQLHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLKFVMLHNLEHSMGMHSYDLGMNH 10 20 30 40 50 60 70 80 90 100 110 120 130 140 150 gi|115 FGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYEA------PRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFR .::::::: .. ... : : . .:. . :.. : :::::::: :: :: ::.::.::::::.::::.:. A42482 LGDMTSEEVMSLTSSL--RVPSQ---WQRNITYKSNPNRILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKL 80 90 100 110 120 130 140 150 160 170 180 190 200 210 220 230 gi|115 KTGRLISLSEQNLVDCSGPQ-GNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPK-Q :::.:..:: ::::::: . ::.:::::.: ::::. :: :.::. ::::.: ...:.:. :: .:. . ....: . A42482 KTGKLVTLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKAMDQKCQYDSKYRAATCSKYTELPYGR 160 170 180 190 200 210 220 230 240 250 260 270 280 290 300 310 gi|115 EKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGY : .: .:::. ::.::..:: : ::..:. :.:.::.:. ....::::::::: . ....:::::::::...: :: A42482 EDVLKEAVANKGPVSVGVDARHPSFFLYRSGVYYEPSCT-QNVNHGVLVVGYG----DLNGKEYWLVKNSWGHNFGEEGY 240 250 260 270 280 290 300 320 330 gi|115 VKMAKDRRNHCGIASAASYPTV ..::... ::::::: ::: . A42482 IRMARNKGNHCGIASFPSYPEI 310 320 330 --------------------------------------------------------------------------- >>S19650 cysteine proteinase (EC 3.4.22.-) precursor (clone LCP2) - American (323 aa) initn: 1037 init1: 499 opt: 1083 Z-score: 1245.6 expect() 1.5e-62 Smith-Waterman score: 1083; 51.456% identity in 309 aa overlap Entrez lookup Re-search database >S19650 29- 333: ---------------------------------------------------------------: 10 20 30 40 50 60 70 gi|115 MNPTLILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLY-GMNEEGWRRAVWEKNMKMIELHNQEYREGKHSFTMAMNA : ..:. ..: : .:...::...:.:.:.:: :..:..:. .:..::: S19650 MKVAVLFLCGVALAAASPSWEHFKGKYGRQYVDAEEDSYRRVIFEQNQKYIEEFNKKYENGEVTFNLAMNK 10 20 30 40 50 60 70 80 90 100 110 120 130 140 150 gi|115 FGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYEAPRS--VDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGR ::::: ::: ::.: :. .:: : .:.. :::: :: :::::.:::::::::::.::.:::: : ::: S19650 FGDMTLEEFNAVMKGNIPRRSAPVSVFY-PKKETGPQATEVDWRTKGAVTPVKDQGQCGSCWAFSTTGSLEGQHFLKTGS 80 90 100 110 120 130 140 150 160 170 180 190 200 210 220 230 gi|115 LISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPK-QEKALM ::::.::.::::: : : .::::: :. ::.:.. :.:.:.: .::::: . ::... . .:. .: ..: . .: .:. S19650 LISLAEQQLVDCSRPYGPQGCNGGWMNDAFDYIKANNGIDTEAAYPYEARDGSCRFDSNSVAATCSGHTNIASGSETGLQ 160 170 180 190 200 210 220 230 240 250 260 270 280 290 300 310 gi|115 KAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYVKMAK .:: .:::::.:::.: :: ::. :.:.::.:: .::.::.:::: :. .. .:::::::. :: .::.::.. S19650 QAVRDIGPISVTIDAAHSSFQFYSSGVYYEPSCSPSYLDHAVLAVGYGSEGGQD----FWLVKNSWATSWGDAGYIKMSR 240 250 260 270 280 290 300 320 330 gi|115 DRRNHCGIASAASYPTV .: :.::::..:::: : S19650 NRNNNCGIATVASYPLV 310 320 --------------------------------------------------------------------------- >>S19649 cysteine proteinase (EC 3.4.22.-) precursor (clone LCP1) - American (322 aa) initn: 827 init1: 327 opt: 1081 Z-score: 1243.3 expect() 2e-62 Smith-Waterman score: 1081; 49.850% identity in 333 aa overlap Entrez lookup Re-search database >S19649 6- 333: --------------------------------------------------------------------: 10 20 30 40 50 60 70 gi|115 MNPTLILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLY-GMNEEGWRRAVWEKNMKMIELHNQEYREGKHSFTMAMNA ..: : .:.: : . . .: ..:. .: : ..:: .: :. :...:: :..:..:. ....:.: S19649 MKVVALFLFGLALA------AANPSWEEFKGKFGRKYVDLEEERYRLNVFLDNLQYIEEFNKKYERGEVTYNLAINQ 10 20 30 40 50 60 70 80 90 100 110 120 130 140 150 gi|115 FGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYEAPRS--VDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGR :.:::.:.: ::.:.. . :: . :: ::.: :::: :: :::::.:::::::::::.::..::: : :::: S19649 FSDMTNEKFNAVMKGYK-KGPRPAAVFTST--DAAPESTEVDWRTKGAVTPVKDQGQCGSCWAFSTTGGIEGQHFLKTGR 80 90 100 110 120 130 140 160 170 180 190 200 210 220 230 gi|115 LISLSEQNLVDCSGPQG-NEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPK-QEKAL :.:::::.::::.: . :.::::: .. :..::.::::.:.: :::::: ...:..: . :. ::.: : . .:.:: S19649 LVSLSEQQLVDCAGGSYYNQGCNGGWVERAIMYVRDNGGVDTESSYPYEARDNTCRFNSNTIGATCTGYVGIAQGSESAL 150 160 170 180 190 200 210 220 240 250 260 270 280 290 300 310 gi|115 MKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYVKMA :. .:::::::::.:.:: : :.:.::.::: ..::.::.:::: :. .. .:::::::. :: .::.::: S19649 KTATRDIGPISVAIDASHRSFQSYYTGVYYEPSCSSSQLDHAVLAVGYGSEGGQD----FWLVKNSWATSWGESGYIKMA 230 240 250 260 270 280 290 300 320 330 gi|115 KDRRNHCGIASAASYPTV ..: :.::::. : :::: S19649 RNRNNNCGIATDACYPTV 310 320 --------------------------------------------------------------------------- >>A45087 cathepsin S (EC 3.4.22.27) - rat (330 aa) initn: 831 init1: 295 opt: 1060 Z-score: 1219.2 expect() 4.5e-61 Smith-Waterman score: 1060; 50.915% identity in 328 aa overlap Entrez lookup Re-search database >A45087 13- 333: ------------------------------------------------------------------: 10 20 30 40 50 60 70 gi|115 MNPTLILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLY-GMNEEGWRRAVWEKNMKMIELHNQEYREGKHSFTMAMN : .. :.:: .: :: . : .::: :: .::::.:.: ::: :. : ::....:: A45087 MAVLGAPGVLCDNGATAERPTLDH----HWDLWKKTRMRRNTDQNEEDVRRLIWEKNLKFIMLHNLEHSMGMHSYSVGMN 10 20 30 40 50 60 70 80 90 100 110 120 130 140 150 gi|115 AFGDMTSEEFRQVMNGFQNRKP--RKGKVFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTG .:::: :: :.... .: :.: ... : :::::::: :: :: ::.::::::::: ::::::. ::: A45087 HMGDMTPEEVIGYMGSLRIPRPWNRSG-TLKSSSNQTLPDSVDWREKGCVTNVKYQGSCGSCWAFSAEGALEGQLKLKTG 80 90 100 110 120 130 140 150 160 170 180 190 200 210 220 230 gi|115 RLISLSEQNLVDCSGPQ--GNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIP-KQEK .:.::: ::::::: . ::.::.::.: ::::. :.. .::: ::::.: .:.: :.:: .:. . ....: .:. A45087 KLVSLSAQNLVDCSTEEKYGNKGCGGGFMTEAFQYIIDTS-IDSEASYPYKAMDEKCLYDPKNRAATCSRYIELPFGDEE 160 170 180 190 200 210 220 230 240 250 260 270 280 290 300 310 gi|115 ALMKAVATVGPISVAID-AGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYV :: .:::: ::.::.:: :.: ::..:. :.: .:.:. :.:.::::::::: :.. ::::::::: ..: ::. A45087 ALKEAVATKGPVSVGIDDASHSSFFLYQSGVYDDPSCT-ENMNHGVLVVGYG----TLDGKDYWLVKNSWGLHFGDQGYI 240 250 260 270 280 290 300 320 330 gi|115 KMAKDRRNHCGIASAASYPTV .::.. .::::::: ::: . A45087 RMARNNKNHCGIASYCSYPEI 310 320 330 --------------------------------------------------------------------------- >>JX0366 cysteine endopeptidase (EC 3.4.22.-) precursor - silkworm (344 aa) initn: 1115 init1: 585 opt: 1057 Z-score: 1215.5 expect() 7.2e-61 Smith-Waterman score: 1162; 50.435% identity in 345 aa overlap Entrez lookup Re-search database >JX0366 5- 333: --------------------------------------------------------------------: 10 20 30 40 50 60 70 gi|115 MNPTLILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLYGMN-EEGWRRAVWEKNMKMIELHNQEYREGKHSFTMAMNA :.: .. .::. :: : .:. .: .: : . :...: .. .. ..: :::.:. : :. ..::. JX0366 MKCLVLLLCAVAAVSAVQFFDLVKE-EWSAFKLQHRLNYKSEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNS 10 20 30 40 50 60 70 80 90 100 110 120 130 140 gi|115 F---GDMTSEEFRQVMNGF-------QNRKPRKGKV----FQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATG . ::: .:: ..:::: .: . :.: : : . :..::::..: :: .:.::.:::::.::.:: JX0366 WWEHGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQGKCGSCWSFSTTG 80 90 100 110 120 130 140 150 150 160 170 180 190 200 210 220 gi|115 ALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGF ::::: ::..: :.:::::::.::: ::.:::::::: ::.:..::::.:.:..::::.....:.:::: . :.:.:: JX0366 ALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKDNGGIDTEQAYPYEGVDDKCRYNPKNTGAEDVGF 160 170 180 190 200 210 220 230 230 240 250 260 270 280 290 300 gi|115 VDIPK-QEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGE ::::. .:. ::.:::::::.::::::.: : .:. :.: : .::: :.:::::::::: :. .. :::::::::. JX0366 VDIPEGDEQKLMEAVATVGPVSVAIDASHTHFQLYSSGVYNEEECSSTDLDHGVLVVGYG---TDEQGVDYWLVKNSWGR 240 250 260 270 280 290 300 310 310 320 330 gi|115 EWGMGGYVKMAKDRRNHCGIASAASYPTV :: ::.:: ... :.:::::.:::: : JX0366 SWGELGYIKMIRNKNNRCGIASSASYPLV 320 330 340 --------------------------------------------------------------------------- >>JC5442 cathepsin L-like cysteine proteinase (EC 3.4.-.-) g3 - Maize weevil (331 aa) initn: 758 init1: 581 opt: 1053 Z-score: 1211.1 expect() 1.3e-60 Smith-Waterman score: 1104; 49.107% identity in 336 aa overlap Entrez lookup Re-search database >JC5442 1- 326:---------------------------------------------------------------------: 10 20 30 40 50 60 70 gi|115 MNPTLILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLY-GMNEEGWRRAVWEKNMKMIELHNQEYREGKHSFTMAMNA :. ::::: :. ...: .. ::...: .:.. : . .:: .: .. .: . . :.. . .: .: ...: JC5442 MKLLLILAAVV--ISCQAVSFYDLVQEQWSSFKMQHSKNYDSETEERFRMKIFMENDHKVAKHSKLFSQGFVKFKLGLNK 10 20 30 40 50 60 70 80 90 100 110 120 130 140 150 gi|115 FGDMTSEEFRQVMNGFQNRKPR--KGK------VFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQM ..:: .:: ...:::.. : ::. : : . : .::::.:: :: ::.::.:::::.::..:.:::: JC5442 YADMLHHEFVSTLNGFNKTKNNILKGSDLNDAVRFISPANVKLPDTVDWRDKGAVTKVKDQGHCGSCWSFSGSGSLEGQH 80 90 100 110 120 130 140 150 160 170 180 190 200 210 220 230 gi|115 FRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPK- :::::.:.:::::::::::: :: :::::::: ::.:..::::.:.:.:::: : .:.:.:. . : :.: ::::: . JC5442 FRKTGKLVSLSEQNLVDCSGRYGNTGCNGGLMDNAFRYIKDNGGIDTEQSYPYLAEDEKCHYKTQNSGATDKGFVDIEEG 160 170 180 190 200 210 220 230 240 250 260 270 280 290 300 310 gi|115 QEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGG .: : :::::::.:.::::..:.: .:..:.: .:.:::...:::::::::: : .:.. :::::::: :..: JC5442 NEDDLKAAVATVGPVSIAIDASYETFQLYSDGVYSDPECSSQELDHGVLVVGYG---TSDDGQDYWLVKNSWRPSCGLNG 240 250 260 270 280 290 300 310 320 330 gi|115 YVKMAKDRRNHCGIASAASYPTV :.:::... : ::.:: JC5442 YIKMARNQDNMCGVAS 320 330 --------------------------------------------------------------------------- >>JC5441 cathepsin L-like cysteine proteinase (EC 3.4.-.-) g2 - Maize weevil (331 aa) initn: 759 init1: 582 opt: 1050 Z-score: 1207.7 expect() 1.9e-60 Smith-Waterman score: 1101; 49.107% identity in 336 aa overlap Entrez lookup Re-search database >JC5441 1- 326:---------------------------------------------------------------------: 10 20 30 40 50 60 70 gi|115 MNPTLILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLY-GMNEEGWRRAVWEKNMKMIELHNQEYREGKHSFTMAMNA :. ::::: :. ...: .. ::...: .:.. : . .:: .: .. .: . . :.. . .: .: ...: JC5441 MKLLLILAAVV--ISCQAVSFYDLVQEQWSSFKMQHSKNYDSETEERFRMKIFMENAHKVAKHSKLFSQGFVKFKLGLNK 10 20 30 40 50 60 70 80 90 100 110 120 130 140 150 gi|115 FGDMTSEEFRQVMNGFQNRKPR--KGK------VFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQM ..:: .:: ...:::.. : ::. : : . : .::::.:: :: ::.::.:::::.::..:.:::: JC5441 YADMLHHEFVSTLNGFNKTKNNILKGSDLNDAVRFISPANVKLPDTVDWRDKGAVTKVKDQGHCGSCWSFSGSGSLEGQH 80 90 100 110 120 130 140 150 160 170 180 190 200 210 220 230 gi|115 FRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPK- :::::.:.:::::::::::: ::.:::::::: ::.:..::::.:.:.:::: : .:.:.:. . : :.: ::::: . JC5441 FRKTGKLVSLSEQNLVDCSGRYGNNGCNGGLMDNAFRYIKDNGGIDTEQSYPYLAEDEKCHYKTQNSGATDKGFVDIEEG 160 170 180 190 200 210 220 230 240 250 260 270 280 290 300 310 gi|115 QEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGG .: : :::::::::.::::..:.: .:..:.: .:.: :...:::::::::: : .:.. :::::::: :..: JC5441 NEDDLKAAVATVGPISIAIDASYETFQLYSDGVYSDPECISQELDHGVLVVGYG---TSDDGQDYWLVKNSWRPSCGLNG 240 250 260 270 280 290 300 310 320 330 gi|115 YVKMAKDRRNHCGIASAASYPTV :.:::... : ::.:: JC5441 YIKMARNQDNMCGVAS 320 330 --------------------------------------------------------------------------- >>S47432 cathepsin L (EC 3.4.22.15) - Norway lobster (324 aa) initn: 871 init1: 294 opt: 1045 Z-score: 1202.1 expect() 4e-60 Smith-Waterman score: 1045; 46.687% identity in 332 aa overlap Entrez lookup Re-search database >S47432 6- 333: --------------------------------------------------------------------: 10 20 30 40 50 60 70 gi|115 MNPTLILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLY-GMNEEGWRRAVWEKNMKMIELHNQEYREGKHSFTMAMNA ..: : .:.: : . . .: ..:. .: : ..:: .: :. :...:: :..:. :. ....:.: S47432 MKVVALFLFGLALA------AANPSWEEFKGKFGRKYVDLEEERYRLNVFLDNLQYIEEFNKKYESGEVTYNLAINQ 10 20 30 40 50 60 70 80 90 100 110 120 130 140 150 gi|115 FGDMTSEEFRQVMNGFQNR-KPRKGKVFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRL :.:.:..:: ..:.:... .:. :: :::: :: :: ::.::::::::::::::.:::: : : :.: S47432 FSDLTNDEFNSMMKGYKTSLRPKPVAVFTSTDAAPETTEVDWRTKGCVTHVKDQGQCGSCWAFSATGSLEGQHFLKYGEL 80 90 100 110 120 130 140 150 160 170 180 190 200 210 220 230 gi|115 ISLSEQNLVDCSGP-QGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPK-QEKALM .::.::.::::.: :.::::: .. ::.:.. :::.:.: :::::: ...:..: . .:. .:::.: . .:. . S47432 VSLAEQQLVDCAGGIYYNQGCNGGWVNQAFKYIKANGGIDTESSYPYEARDNTCRFNSNSVAATCSGFVSIAQGSESPEV 160 170 180 190 200 210 220 230 240 250 260 270 280 290 300 310 gi|115 KAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYVKMAK . ....:::::::::.:.:: :. :.:.::.::: ..::.::.:::: :. .. .:::::::: :: .::..::. S47432 RRTTNTGPISVAIDAAHRSFQSYSSGVYYEPSCSSSQLDHAVLAVGYGSEGGQD----FWLVKNSWGTSWGSAGYINMAR 240 250 260 270 280 290 300 320 330 gi|115 DRRNHCGIASAASYPTV .: :.::::. :::::: S47432 NRNNNCGIATDASYPTV 310 320 --------------------------------------------------------------------------- >>S67481 cysteine proteinase CP1 - fruit fly (Drosophila melanogaster) (fragm (218 aa) initn: 975 init1: 576 opt: 1011 Z-score: 1165.8 expect() 4.2e-58 Smith-Waterman score: 1011; 63.182% identity in 220 aa overlap Entrez lookup Re-search database >S67481 115- 333: ---------------------------------------------: 80 90 100 110 120 130 140 150 gi|115 MAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRK :.::::: :: :: ::.::.::::::::.::::::: ::: S67481 LPKSVDWRTKGAVTAVKDQGHCGSCWAFSSTGALEGQHFRK 10 20 30 40 160 170 180 190 200 210 220 230 gi|115 TGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPK-QEK .: :.::::::::::: ::.:::::::: :: :..::::.:.:.:::::: ..::..: :.: ::.:::. .:: S67481 SGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFPYIKDNGGIDTEKSYPYEAIDDSCHFNRAQVGATDRGFTDIPQGDEK 50 60 70 80 90 100 110 120 240 250 260 270 280 290 300 310 gi|115 ALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYVK . . : ::::.::::::.:::: ::.::.: ::.:.....::::::::.: . . : ::::::::: :: :..: S67481 KMPEPVPTVGPVSVAIDASHESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGED---YWLVKNSWGTTWGDKGFIK 130 140 150 160 170 180 190 320 330 gi|115 MAKDRRNHCGIASAASYPTV : ....:.::::: .::: : S67481 MLRNKENQCGIASPSSYPLV 200 210 --------------------------------------------------------------------------- >>S43991 cathepsin L-like proteinases - liver fluke (326 aa) initn: 704 init1: 371 opt: 941 Z-score: 1083.1 expect() 1.7e-53 Smith-Waterman score: 941; 43.284% identity in 335 aa overlap Entrez lookup Re-search database >S43991 5- 333: --------------------------------------------------------------------: 10 20 30 40 50 60 70 80 gi|115 MNPTLILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLYGMNEEGWRRAVWEKNMKMIELHNQEYREGKHSFTMAMNAF .:::.. .:. ... . : .:: :.:. :. .. :: .::::.: :. :: .. : ..:...: : S43991 MRLFILAVLTVGVLGSN-------DDLWHQWKRMYNKEYNGADDQHRRNIWEKNVKHIQEHNLRHDLGLVTYTLGLNQF 10 20 30 40 50 60 70 90 100 110 120 130 140 150 gi|115 GDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYEA-----PRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKT ::: :::. . .. : . .... . ::: : ..::::.:::: ::.::.::::::::.::..:::.... S43991 TDMTFEEFKA---KYLTEMSRASDILSHGVPYEANNRAVPDKIDWRESGYVTEVKDQGNCGSCWAFSTTGTMEGQYMKNE 80 90 100 110 120 130 140 160 170 180 190 200 210 220 230 gi|115 GRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPK-QEKA ::.:::.::::::: ::.::.::::. :.::... : :..: :::: :.: .:.:: . .::. ::. . . .: S43991 RTSISFSEQQLVDCSGPWGNNGCSGGLMENAYQYLKQFG-LETESSYPYTAVEGQCRYNKQLGVAKVTGYYTVHSGSEVE 150 160 170 180 190 200 210 220 240 250 260 270 280 290 300 310 gi|115 LMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYVKM : . :.. : .::.:. . .:..:. ::: :: ..:.::.:::: .. .. ::.:::::: :: ::..: S43991 LKNLVGARRPAAVAVDV-ESDFMMYRSGIYQSQTCSPLRVNHAVLAVGYGTQG----GTDYWIVKNSWGTYWGERGYIRM 230 240 250 260 270 280 290 300 320 330 gi|115 AKDRRNHCGIASAASYPTV :..: : ::::: :: : : S43991 ARNRGNMCGIASLASLPMVARFP 310 320 --------------------------------------------------------------------------- >>S44151 cathepsin L (EC 3.4.22.15) - fluke (Schistosoma mansoni) (317 aa) initn: 666 init1: 390 opt: 901 Z-score: 1037.6 expect() 5.8e-51 Smith-Waterman score: 901; 44.051% identity in 311 aa overlap Entrez lookup Re-search database >S44151 29- 331: ---------------------------------------------------------------: 10 20 30 40 50 60 70 80 gi|115 MNPTLILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLYGMNEEGWRRAVWEKNMKMIELHNQEYREGKHSFTMAMNAF : .:: .:. :. ..: :.:.. . .. :. :: .. : ...::..: : S44151 VAIAQHLSLQYDDIWKQWKLKYNKTYSDSNEIRRKAIFMRYVEKIQQHNLRHDLGLEGYTMGLNQF 10 20 30 40 50 60 90 100 110 120 130 140 150 gi|115 GDMTSEEFRQVM-------NGFQNRKPRKGKVFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFR :: ::.. .: . . . : .. .. ..:: : . :::..: :::::::: :::::::::.::.:::. . S44151 CDMDWEEIKTIMLSKVFGNSPLWDDKKEELELSNDPL----PSKWDWRDHGAVTPVKNQGLCGSCWAFSAAGAVEGQLVK 70 80 90 100 110 120 130 140 160 170 180 190 200 210 220 230 gi|115 KTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIP-KQE : .:::::::.::::: ::.::.:: :: .: :.. ..::..: : . . ::.. . .:.. :::.: ..: S44151 KHKKLISLSEQQLVDCSYKYGNDGCQGGTMDQSFAYLEKYP-IESEKDYKYIGHDSSCHFRKSKGVVKVKKFVDLPARDE 150 160 170 180 190 200 210 220 240 250 260 270 280 290 300 310 gi|115 KALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYV . :.::. ::::::::: .....:: ::: .::: ..::::.:::: :. .. :::.::::: :::.:: S44151 EKLQKALYHYGPISVAIDA-LDDLILYKSGIYESKQCSSFLLNHGVLAVGYGRENRKD----YWLIKNSWGTTWGMNGYF 230 240 250 260 270 280 290 320 330 gi|115 KMAKDRRNHCGIASAASYPTV :. ....: ::::. ::.: S44151 KLRRNKHNMCGIATNASFPLL 300 310 --------------------------------------------------------------------------- >>I58002 cathepsin-related protein - rat (fragment) (236 aa) initn: 868 init1: 623 opt: 898 Z-score: 1036.1 expect() 7.1e-51 Smith-Waterman score: 898; 57.534% identity in 219 aa overlap Entrez lookup Re-search database >I58002 115- 333: ---------------------------------------------: 80 90 100 110 120 130 140 150 gi|115 MAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRK : :::..::::::.:::.:::::::.:.::.:::: : I58002 NPAAVTNPSAQKQVSIGLPNFKDWRKEGYVTPVRNQGKCGSCWAFAAVGAIEGQMSLK 10 20 30 40 50 160 170 180 190 200 210 220 230 gi|115 TGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPKQEKA :: : :: :::.: .. .: : : ::.:: : ::..: .::::. . :.:. . . :: ::::..: .: I58002 TGNLTPLSAQNLLDTKS-EGI-GLPWGTAHQAFNYVLKNKGLEAEATYPYEGKDGPCRYHSENASANITGFVNLPPNELY 60 70 80 90 100 110 120 130 240 250 260 270 280 290 300 310 gi|115 LMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYVKM : :::..::.:.::::.:.:: ::. :.: ::.::: ..:.:::::::::..:.:.:.:::.:::::::::..:..:. I58002 LWVAVASIGPVSAAIDASHDSFRFYSGGVYHEPNCSSYVVNHAVLVVGYGFEGNETDGNNYWLIKNSWGEEWGINGFMKI 140 150 160 170 180 190 200 210 320 330 gi|115 AKDRRNHCGIASAASYPTV :::: ::::::: ::.: . I58002 AKDRNNHCGIASQASFPDIF 220 230 --------------------------------------------------------------------------- >>S57777 cysteine proteinase (EC 3.4.22.-) precursor - Hemerocallis x hybrida (360 aa) initn: 653 init1: 245 opt: 894 Z-score: 1028.7 expect() 1.8e-50 Smith-Waterman score: 894; 46.012% identity in 326 aa overlap Entrez lookup Re-search database >S57777 19- 331: -----------------------------------------------------------------: 10 20 30 40 50 60 gi|115 MNPTLILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLYGMNEEGWRRAVWEKNMKMIELHNQEYREG :. . :: . ::.. :. ..:.. : :...:.:.:. ::. S57777 MAKPKFIALALVALSFLSIAQSIPFTEKDLASEDSLWNLYEKWRTHHTVARDLDEKNRRFNVFKENVKFIHEFNQKKDA- 10 20 30 40 50 60 70 70 80 90 100 110 120 130 140 gi|115 KHSFTMAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQE---PLFYEA----PR-SVDWREKGYVTPVKNQGQCGSCWAF . .:.: :::::..:::. . : . .. :. . .:. ..:: : :.::: :: :: ::.:::::::::: S57777 --PYKLALNKFGDMTNQEFRSKYAGSKIQHHRSQRGIQKNTGSFMYENVGSLPAASIDWRAKGAVTGVKDQGQCGSCWAF 80 90 100 110 120 130 140 150 150 160 170 180 190 200 210 220 gi|115 SATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYS-VA :. ...:: :::.:.:::::.::::. :::::::::::::...: :: . .:.:::: . .: : : :. S57777 STIASVEGINQIKTGELVSLSEQELVDCDTSY-NEGCNGGLMDYAFEFIQKNG-ITTEDSYPYAEQDGTCASNLLNSPVV 160 170 180 190 200 210 220 230 230 240 250 260 270 280 290 gi|115 NDTGFVDIP-KQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVK . : :.: ..:.:::.:::. ::::.:.:. .: ::.::. : :..: .:::: .:::: . :..:::.:: S57777 SIDGHQDVPANNENALMQAVAN-QPISVSIEASGYGFQFYSEGV-FTGRCGTE-LDHGVAIVGYG---ATRDGTKYWIVK 240 250 260 270 280 290 300 300 310 320 330 gi|115 NSWGEEWGMGGYVKMAK---DRRNHCGIASAASYPTV :::::::: .::..: . :.:..:::: :::: S57777 NSWGEEWGESGYIRMQRGISDKRGKCGIAMEASYPIKTSANPKNSSTRDEL 310 320 330 340 350 360 --------------------------------------------------------------------------- >>KHRZOA oryzain (EC 3.4.22.-) alpha precursor - rice (458 aa) initn: 546 init1: 270 opt: 888 Z-score: 1020.3 expect() 5.3e-50 Smith-Waterman score: 888; 44.345% identity in 336 aa overlap Entrez lookup Re-search database >KHRZOA 5- 331: --------------------------------------------------------------------: 10 20 30 40 50 60 gi|115 MNPTLILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLYG-MNEEGWRRAVWEKNMKMIELHNQEYRE : ::: ..:.: .. . ...::: :.. :. ..:: : :... :...:. :: KHRZOA MRISMALAAAALLLLLSLAAADMSIVSYGERSEEEARRLYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADA 10 20 30 40 50 60 70 80 70 80 90 100 110 120 130 140 gi|115 GKHSFTMAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFY--EA-PRSVDWREKGYVTPVKNQGQCGSCWAFSATG : ::: ...: :.:.:.::.:... :..:. :. :: .. : :: :.::::: :: :. .:.:: ::::::::: . KHRZOA GVHSFRLGLNRFADLTNEEYRDTYLGLRNKPRRERKVSDRYLAADNEALPESVDWRTKGAVAEIKDQGGCGSCWAFSAIA 90 100 110 120 130 140 150 160 150 160 170 180 190 200 210 220 gi|115 ALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYS-VANDTG :.: :: :::::::.::::. :::::::::::::... .:::.:.:..:::.. .: : : : . :.. . KHRZOA AVEDINQIVTGDLISLSEQELVDCDTSY-NEGCNGGLMDYAFDFIINNGGIDTEDDYPYKGKDERCDVNRKNAKVVTIDS 170 180 190 200 210 220 230 230 240 250 260 270 280 290 300 gi|115 FVDI-PKQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWG . :. :..: .:.::: . :.::::.:: ..: .:. :: : :.. .:::: .:::: : ... ::.:.:::: KHRZOA YEDVTPNSETSLQKAVRN-QPVSVAIEAGGRAFQLYSSGI-FTGKCGTA-LDHGVAAVGYGTE----NGKDYWIVRNSWG 240 250 260 270 280 290 300 310 310 320 330 gi|115 EEWGMGGYVKMAKDRR---NHCGIASAASYPTV . :: .:::.: .. . ..:::: ::: KHRZOA KSWGESGYVRMERNIKASSGKCGIAVEPSYPLKKGENPPNPGPTPPSPTPPPTVCDNYYTCPDSTTCCCIYEYGKYCYAW 320 330 340 350 360 370 380 390 --------------------------------------------------------------------------- >>S15844 cathepsin S (EC 3.4.22.27) - bovine (217 aa) initn: 691 init1: 276 opt: 873 Z-score: 1008.0 expect() 2.6e-49 Smith-Waterman score: 873; 56.561% identity in 221 aa overlap Entrez lookup Re-search database >S15844 115- 333: ---------------------------------------------: 80 90 100 110 120 130 140 150 gi|115 MAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRK : :.:::::: :: :: :: :::::::::.::::.:. : S15844 LPDSMDWREKGCVTEVKYQGACGSCWAFSAVGALEAQVKLK 10 20 30 40 160 170 180 190 200 210 220 230 gi|115 TGRLISLSEQNLVDCSGPQ-GNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIP-KQE ::.:.::: ::::::: . ::.:::::.: ::::. ::.:.::: ::::.: . .:.:. : .:. . ....: .: S15844 TGKLVSLSAQNLVDCSTAKYGNKGCNGGFMTEAFQYIIDNNGIDSEASYPYKAMDGKCQYDVKNRAATCSRYIELPFGSE 50 60 70 80 90 100 110 120 240 250 260 270 280 290 300 310 gi|115 KALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYV .:: .:::. ::.::.:::.: ::..:: :.:..:.:. ....::::::::: . :.. ::::::::: ..: ::. S15844 EALKEAVANKGPVSVGIDASHSSFFLYKTGVYYDPSCT-QNVNHGVLVVGYG----NLDGKDYWLVKNSWGLHFGDQGYI 130 140 150 160 170 180 190 320 330 gi|115 KMAKDRRNHCGIASAASYPTV .::.. ::::::. ::: . S15844 RMARNSGNHCGIANYPSYPEI 200 210 --------------------------------------------------------------------------- >>S57776 cysteine proteinase - clove pink (fragment) (427 aa) initn: 388 init1: 294 opt: 874 Z-score: 1004.7 expect() 3.9e-49 Smith-Waterman score: 874; 44.231% identity in 312 aa overlap Entrez lookup Re-search database >S57776 32- 331: --------------------------------------------------------------: 10 20 30 40 50 60 70 gi|115 MNPTLILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLYG-MNEEGWRRAVWEKNMKMIELHNQEYREGKHS-FTMAMN : . : . :. ..:. : :... :...:. ::.. : . : ...: S57776 QAYHLQSWLVKHRKNYNALGEKEKRFAIFRDNLEFIDQHNNNNNGGGGGEFELGLN 10 20 30 40 50 80 90 100 110 120 130 140 150 gi|115 AFGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFY-----EAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFR :.:.:..:::... : ..:.:.. . . : :.:::::.:: :. ::.:::::::::::: ::.:: S57776 KFADLTNDEFRRIYFGV--KRPEKAESVKSDRYAVKEGDELPESVDWRKKGAVSHVKDQGQCGSCWAFSAIGAVEGINKI 60 70 80 90 100 110 120 130 160 170 180 190 200 210 220 230 gi|115 KTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYS-VANDTGFVDIP-KQ :: ::.::::.::::. : ::.::::::::... .:::.:....:::.::. :: : : . :.. :. :.: .. S57776 VTGDLITLSEQELVDCDTSY-NSGCDGGLMDYAFRFIINNGGIDTDKDYPYKATDGSCDSNRKNAKVVTIDGLEDVPANN 140 150 160 170 180 190 200 210 240 250 260 270 280 290 300 310 gi|115 EKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGY ::::.:::: :. .::.:: ..: .:: :. : .:.. ..::::..:::: : .:.. ::.:.::::..:: :: S57776 EKALQKAVAH-QPVRLAIEAGGRDFQLYKSGV-FTGSCGT-SLDHGVVAVGYG---TTDDGKDYWIVRNSWGDDWGEDGY 220 230 240 250 260 270 280 320 330 gi|115 VKMAKD---RRNHCGIASAASYPTV ..: .. . ..:::: :::. S57776 IRMERNTESKSGKCGIAIEPSYPVKTSPNPPNPGPSPPSPPPAPKVVCDSYSSCPSATTCCCVYEYGPYCYMWGCCPLEA 290 300 310 320 330 340 350 360 --------------------------------------------------------------------------- >>S41427 cysteine proteinase (EC 3.4.22.-) CP1 precursor - Trichomonas vagina (309 aa) initn: 523 init1: 331 opt: 867 Z-score: 998.9 expect() 8.4e-49 Smith-Waterman score: 867; 43.791% identity in 306 aa overlap Entrez lookup Re-search database >S41427 32- 333: --------------------------------------------------------------: 10 20 30 40 50 60 70 80 gi|115 MNPTLILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLYGMNEEGWRRAVWEKNMKMIELHNQEYREGKHSFTMAMNAF : . .. .: : ..: .: .... :: .. .:..::: . S41427 MMYQAHEQKSFLGWMRETGNMFTGDEYHQRFGIWLSNKRLVQQHNA----ANGGFVLAMNKL 10 20 30 40 50 90 100 110 120 130 140 150 160 gi|115 GDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLIS . .. :.. .. ::.:.: :. .: : :: :.:::::: :.:.:.::::::::.::. :.:.: : .: : S41427 AHLSPSEYKALL-GFKNEK-RSDRVKPIASNYVAPASIDWREKGVVNPIKDQGQCGSCWTFSTIQAMESQWAVKHTKLYS 60 70 80 90 100 110 120 130 170 180 190 200 210 220 230 gi|115 LSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQ--DNGGLDSEESYPYEATEESCKYNP-KYSVANDTGFVDIPK-QEKALM ::::::::: :::::::. :..::. ..: . .: .:::.: ..:::.: : . . ::.. . . .:: :: S41427 LSEQNLVDCVTT--CYGCNGGLMELAYDYVKTYQKGKFMTEADYPYKAIDQSCKFNAAKVAEPTVTGYITVTEGDEKDLM 140 150 160 170 180 190 200 210 240 250 260 270 280 290 300 310 gi|115 KAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYVKMAK . :: :: ..::::.: :: .:. ::: : .:: : .::.: :::: :.... ::.:.:::: :: ::..: : S41427 NKVAQYGPAAIAIDASHYSFQLYSSGIYDESSCSPEGLDHAVGCVGYGSEGSKN----YWIVRNSWGVSWGEKGYIRMIK 220 230 240 250 260 270 280 290 320 330 gi|115 DRRNHCGIASAASYPTV :. :.:: :::: ::: S41427 DKNNQCGEASAACIPTVSA 300 --------------------------------------------------------------------------- >>JC4848 cysteine proteinase (EC 3.4.22.-) - Douglas fir (454 aa) initn: 396 init1: 298 opt: 864 Z-score: 992.9 expect() 1.8e-48 Smith-Waterman score: 864; 42.813% identity in 327 aa overlap Entrez lookup Re-search database >JC4848 16- 331: ------------------------------------------------------------------: 10 20 30 40 50 60 gi|115 MNPTLILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLY-GMNEEGWRRAVWEKNMKMIELHNQ : : : .. . : :.:.. : :..:. . .:.. :. .:. ::. JC4848 MGILLLFAVLALSAMAGSASRADFSIISYDSQDLIGDDAIMELYELWLAQHKKAYNGLDEKQKKFSVFKDNFLYIHQHNN 10 20 30 40 50 60 70 80 70 80 90 100 110 120 130 gi|115 EYREGKHSFTMAMNAFGDMTSEEFRQVMNGFQ-NRKPRKGKV----FQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCW . :. :. ...: :.:.. :::. .. : . . : : .. .: . . :.:.:::::: :: :::::.::::: JC4848 Q---GNPSYKLGLNQFADLSHEEFKAAYLGTKLDAKKRLSRSPSPRYQYSVGEDLPESIDWREKGAVTAVKNQGSCGSCW 90 100 110 120 130 140 150 140 150 160 170 180 190 200 210 gi|115 AFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCK-YNPKYS :::...:.:: :: : :::::.::::. :.::::::::::::.. .:::::::..:::.:.. :: : . JC4848 AFSTVAAVEGINQIVTGNLTSLSEQELVDCDTSY-NQGCNGGLMDYAFQFIISNGGLDSEDDYPYKANNGSCDAYRKNAH 160 170 180 190 200 210 220 230 220 230 240 250 260 270 280 290 gi|115 VANDTGFVDIPKQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLV :.. . :.:.... .: .:. ::::::.:. ..: ::. :. : .:... .:::: .:::: :: . :::: JC4848 VVTIDDYEDVPENDEKSLKKAAANQPISVAIEASGRAFQFYESGV-FTSNCGTQ-LDHGVTLVGYGSES----GIDYWLV 240 250 260 270 280 290 300 310 300 310 320 330 gi|115 KNSWGEEWGMGGYVKMAKD----RRNHCGIASAASYPTV :::::. :: :..:. .. . :::: ::::. JC4848 KNSWGNSWGEKGFIKLQRNLEGASTGMCGIAMEASYPVKKGANPPNPGPSPPSPVKPPTVCDNYYSCPESNTCCCMYDFG 320 330 340 350 360 370 380 390 --------------------------------------------------------------------------- >>KHRZOB oryzain (EC 3.4.22.-) beta precursor - rice (471 aa) initn: 713 init1: 393 opt: 860 Z-score: 988.1 expect() 3.3e-48 Smith-Waterman score: 860; 46.711% identity in 304 aa overlap Entrez lookup Re-search database >KHRZOB 37- 332: -------------------------------------------------------------: 10 20 30 40 50 60 70 gi|115 MNPTLILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLYGMNEEGWRRAVWEKNMKMIELHNQEYREGKHSFTMA : : : .:. : :. :.:... :: . :: .: .. KHRZOB MSIISYNAEHGARGLEEGPTEAEARAAYDLWLAENGGGSPNALGGEHER--RFLVFWDNLKFVDAHNARADEGG-GFRLG 30 40 50 60 70 80 90 100 80 90 100 110 120 130 140 150 gi|115 MNAFGDMTSEEFRQVMNGFQ--NRKPRKGKVFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRK :: :.:.:.:::: .. : . .:. :. ... : :.:::::::: :.::::::::::::::::....:. KHRZOB MNRFADLTNEEFRATFLGAKVAERSRAAGERYRHDGVEELPESVDWREKGAVAPVKNQGQCGSCWAFSAVSTVESINQLV 110 120 130 140 150 160 170 180 160 170 180 190 200 210 220 230 gi|115 TGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYN-PKYSVANDTGFVDIPKQ-E ::..:.::::.::.:: : ::::::: ::... :::.:.:..:::.:.. .: : . .:.. :: :.:.. : KHRZOB TGEMITLSEQELVECSTNGQNSGCNGGLMADAFDFIIKNGGIDTEDDYPYKAVDGKCDINRENAKVVSIDGFEDVPQNDE 190 200 210 220 230 240 250 260 240 250 260 270 280 290 300 310 gi|115 KALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNK-YWLVKNSWGEEWGMGGY :.:.:::: :.::::.:: . : .:. :. : :.. ..::::..:::: .::.: ::.:.:::: .:: .:: KHRZOB KSLQKAVAH-QPVSVAIEAGGREFQLYHSGV-FSGRCGT-SLDHGVVAVGYG-----TDNGKDYWIVRNSWGPKWGESGY 270 280 290 300 310 320 330 320 330 gi|115 VKMAKD---RRNHCGIASAASYPTV :.: .. ..:::: ::::: KHRZOB VRMERNINVTTGKCGIAMMASYPTKSGANPPKPSPTPPTPPTPPPPSAPDHVCDDNFSCPAGSTCCCAFGFRNLCLVWGC 340 350 360 370 380 390 400 410 --------------------------------------------------------------------------- >>S22502 endopeptidase - kidney bean (362 aa) initn: 420 init1: 298 opt: 841 Z-score: 968.1 expect() 4.3e-47 Smith-Waterman score: 849; 42.274% identity in 343 aa overlap Entrez lookup Re-search database >S22502 9- 331: -------------------------------------------------------------------: 10 20 30 40 50 60 gi|115 MNPTLILAAFCLGIASATLTFDHSL---EAQWT---KWKAMHNRLYGMNEEGWRRAVWEKNMKMIELHNQEYREG .. ::.:.. :..: :. : .:.. :. ...:. : :.. :. ...:: . . S22502 MATKKLLWVVLSFSLVLGVANSFDFHDKDLASEESLWDLYERWRSHHTVSRSLGEKHKRFNVFKANL--MHVHNTNKMD- 10 20 30 40 50 60 70 70 80 90 100 110 120 130 140 gi|115 KHSFTMAMNAFGDMTSEEFRQVMNGFQNRKPR--KGKVFQEPLF-YE----APRSVDWREKGYVTPVKNQGQCGSCWAFS . . . .: :.:::..:::... : . . : .: .. : :: .: :::::.:: :: ::.::::::::::: S22502 -KPYKLKLNKFADMTNHEFRSTYAGSKVNHHRMFRGTPHENGAFMYEKVVSVPPSVDWRKKGAVTDVKDQGQCGSCWAFS 80 90 100 110 120 130 140 150 150 160 170 180 190 200 210 gi|115 ATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESC---KYNPKYSV .. :.:: ::..:..::::.::::. . :.:::::::. ::......::. .: .:::.: : .: : : .: S22502 TVVAVEGINQIKTNKLVALSEQELVDCDKEE-NQGCNGGLMESAFEFIKQKGGITTESNYPYKAQEGTCDASKVND-LAV 160 170 180 190 200 210 220 230 220 230 240 250 260 270 280 290 gi|115 ANDTGFVDIP-KQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLV . : : ..: ..: ::.::::. :.::::::: .: ::.::. : :::. :..::: .:::: : :...::.: S22502 SID-GHENVPANDEDALLKAVAN-QPVSVAIDAGGSDFQFYSEGV-FTGDCST-DLNHGVAIVGYG---TTVDGTNYWIV 240 250 260 270 280 290 300 300 310 320 330 gi|115 KNSWGEEWGMGGYVKMAKD---RRNHCGIASAASYPTV .:::: ::: ::..: .. ... :::: ::: S22502 RNSWGPEWGEHGYIRMQRNISKKEGLCGIAMLPSYPIKNSSDNPTGSFSSPKDEL 310 320 330 340 350 360 --------------------------------------------------------------------------- >>S12581 cysteine proteinase (EC 3.4.22.-) - black gram (362 aa) initn: 424 init1: 302 opt: 841 Z-score: 968.1 expect() 4.3e-47 Smith-Waterman score: 848; 42.274% identity in 343 aa overlap Entrez lookup Re-search database >S12581 9- 331: -------------------------------------------------------------------: 10 20 30 40 50 60 gi|115 MNPTLILAAFCLGIASATLTFDHSLEAQ---WT---KWKAMHNRLYGMNEEGWRRAVWEKNMKMIELHNQEYREG .. ::.:.. ...::.. : .:.. :. ...:. : :.. :. ...:: . . S12581 MAMKKLLWVVLSLSLVLGVANSFDFHEKDLESEESLWDLYERWRSHHTVSRSLGEKHKRFNVFKANV--MHVHNTNKMD- 10 20 30 40 50 60 70 70 80 90 100 110 120 130 140 gi|115 KHSFTMAMNAFGDMTSEEFRQVMNGFQ--NRKPRKGKVFQEPLF-YE----APRSVDWREKGYVTPVKNQGQCGSCWAFS . . . .: :.:::..:::... : . ..: .:. : :: .: :::::.:: :: ::.::::::::::: S12581 -KPYKLKLNKFADMTNHEFRSTYAGSKVNHHKMFRGSQHGSGTFMYEKVGSVPASVDWRKKGAVTDVKDQGQCGSCWAFS 80 90 100 110 120 130 140 150 150 160 170 180 190 200 210 gi|115 ATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESC---KYNPKYSV . :.:: ::..:.:::::.::::. . :.:::::::. ::......::. .: .::: : : .: : : .: S12581 TIVAVEGINQIKTNKLVSLSEQELVDCDKEE-NQGCNGGLMESAFEFIKQKGGITTESNYPYTAQEGTCDESKVND-LAV 160 170 180 190 200 210 220 230 220 230 240 250 260 270 280 290 gi|115 ANDTGFVDIP-KQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLV . : : ..: ..:.::.::::. :.::::::: .: ::.::. : ::.. :..::: .:::: : :...::.: S12581 SID-GHENVPVNDENALLKAVAN-QPVSVAIDAGGSDFQFYSEGV-FTGDCNT-DLNHGVAIVGYG---TTVDGTNYWIV 240 250 260 270 280 290 300 300 310 320 330 gi|115 KNSWGEEWGMGGYVKMAKD---RRNHCGIASAASYPTV .:::: ::: ::..: .. ... :::: :::: S12581 RNSWGPEWGEQGYIRMQRNISKKEGLCGIAMMASYPIKNSSDNPTGSLSSPKDEL 310 320 330 340 350 360 --------------------------------------------------------------------------- >>JQ1111 cysteine proteinase (EC 3.4.22.-) EP-B 1 precursor - barley (371 aa) initn: 573 init1: 246 opt: 827 Z-score: 951.9 expect() 3.4e-46 Smith-Waterman score: 832; 41.667% identity in 348 aa overlap Entrez lookup Re-search database >JQ1111 6- 331: --------------------------------------------------------------------: 10 20 30 40 50 60 gi|115 MNPTLILAAFCLGIASATLTFDHSLEAQ---WT---KWKAMHNRLYGMNEEGWRRAVWEKNMKMIELHN .::. . . :: :..::.. : .:.. : :. : .....: ..:. :: JQ1111 MGLLSKKLLVASMVAAVLAVAAVELCSAIPMEDKDLESEEALWDLYERWQSAHRVRRHHAEKHRRFGTFKSNAHFIHSHN 10 20 30 40 50 60 70 80 70 80 90 100 110 120 130 gi|115 QEYREGKHSFTMAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYEA-------PRSVDWREKGYVTPVKNQGQCG ..: : . . .: :::: . ::: .. : . :. .: . : :. : : :::::.:: :: ::.::.:: JQ1111 ---KRGDHPYRLHLNRFGDMDQAEFRATFVG-DLRRDTPAKPPSVPGFMYAALNVSDLPPSVDWRQKGAVTGVKDQGKCG 90 100 110 120 130 140 150 140 150 160 170 180 190 200 210 gi|115 SCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCK---- ::::::.. ..:: .:: :.:::::.:.::. . :.::.::::: ::.:...:::: .: .:::.:.. .:. JQ1111 SCWAFSTVVSVEGINAIRTGSLVSLSEQELIDCDTAD-NDGCQGGLMDNAFEYIKNNGGLITEAAYPYRAARGTCNVARA 160 170 180 190 200 210 220 230 220 230 240 250 260 270 280 290 gi|115 -YNPKYSVANDTGFVDIP-KQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTES : : : : :.: ..:. : .:::. :.:::..:. ..:.::.::. : ::..: .:::: ::::: . JQ1111 AQNSPVVVHID-GHQDVPANSEEDLARAVAN-QPVSVAVEASGKAFMFYSEGV-FTGDCGTE-LDHGVAVVGYG---VAE 240 250 260 270 280 290 300 300 310 320 330 gi|115 DNNKYWLVKNSWGEEWGMGGYVKMAKDRRNH---CGIASAASYPTV :.. :: :::::: :: ::... :: :::: ::::. JQ1111 DGKAYWTVKNSWGPSWGEQGYIRVEKDSGASGGLCGIAMEASYPVKTYNKPMPRRALGAWESQ 310 320 330 340 350 360 370 --------------------------------------------------------------------------- >>JQ1110 cysteine proteinase (EC 3.4.22.-) EP-B 4 precursor - barley (373 aa) initn: 573 init1: 246 opt: 821 Z-score: 945.0 expect() 8.3e-46 Smith-Waterman score: 826; 41.379% identity in 348 aa overlap Entrez lookup Re-search database >JQ1110 6- 331: --------------------------------------------------------------------: 10 20 30 40 50 60 gi|115 MNPTLILAAFCLGIASATLTFDHSLEAQ---WT---KWKAMHNRLYGMNEEGWRRAVWEKNMKMIELHN .::. . . :: :..::.. : .:.. : :. : .....: ..:. :: JQ1110 MGLLSKKLLVASMVAAVLAVAAVELCSAIPMEDKDLESEEALWDLYERWQSAHRVRRHHAEKHRRFGTFKSNAHFIHSHN 10 20 30 40 50 60 70 80 70 80 90 100 110 120 130 gi|115 QEYREGKHSFTMAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYEA-------PRSVDWREKGYVTPVKNQGQCG ..: : . . .: :::: . ::: .. : . :. .: . : :. : : :::::.:: :: ::.::.:: JQ1110 ---KRGDHPYRLHLNRFGDMDQAEFRATFVG-DLRRDTPSKPPSVPGFMYAALNVSDLPPSVDWRQKGAVTGVKDQGKCG 90 100 110 120 130 140 150 140 150 160 170 180 190 200 210 gi|115 SCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCK---- ::::::.. ..:: .:: :.:::::.:.::. . :.::.::::: ::.:...:::: .: .:::.:.. .:. JQ1110 SCWAFSTVVSVEGINAIRTGSLVSLSEQELIDCDTAD-NDGCQGGLMDNAFEYIKNNGGLITEAAYPYRAARGTCNVARA 160 170 180 190 200 210 220 230 220 230 240 250 260 270 280 290 gi|115 -YNPKYSVANDTGFVDIP-KQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTES : : : : :.: ..:. : .:::. :.:::..:. ..:.::.::. : .:..: .:::: ::::: . JQ1110 AQNSPVVVHID-GHQDVPANSEEDLARAVAN-QPVSVAVEASGKAFMFYSEGV-FTGECGTE-LDHGVAVVGYG---VAE 240 250 260 270 280 290 300 300 310 320 330 gi|115 DNNKYWLVKNSWGEEWGMGGYVKMAKDRRNH---CGIASAASYPTV :.. :: :::::: :: ::... :: :::: ::::. JQ1110 DGKAYWTVKNSWGPSWGEQGYIRVEKDSGASGGLCGIAMEASYPVKTYSKPKPTPRRALGARESL 310 320 330 340 350 360 370 --------------------------------------------------------------------------- >>JQ1121 cysteine proteinase homolog COT44 - rape (328 aa) initn: 498 init1: 318 opt: 807 Z-score: 929.8 expect() 5.8e-45 Smith-Waterman score: 807; 39.063% identity in 320 aa overlap Entrez lookup Re-search database >JQ1121 29- 331: ---------------------------------------------------------------: 10 20 30 40 50 60 70 gi|115 MNPTLILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLYG-----MNEEGWRRAVWEKNMKMIELHNQEYREGKHSFTM . .:. :.. . .:.. : ... :...:.:::.. ... .. . JQ1121 MSIYLRWSLEHGKSNSNSNGIINQQDERFNIFKDNLRFIDLHNENNKNA--TYKL 10 20 30 40 50 80 90 100 110 120 130 140 gi|115 AMNAFGDMTSEEFRQVMNGFQNRKPRK-GKVFQEPLFY-------EAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGAL ... :...:..:.:... : ... :. :. . . : :.: .::::.:: :. .:.:: ::::::::...:. JQ1121 GLTIFANLTNDEYRSLYLGARTEPVRRITKAKNVNMKYSAAVNVDEVPVTVDWRQKGAVNAIKDQGTCGSCWAFSTAAAV 60 70 80 90 100 110 120 130 150 160 170 180 190 200 210 220 gi|115 EGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYS-VANDTGFV :: ::.:.:::::.::::. :.::::::::::::... ::::..:..:::..:. .:. : : :.. :. JQ1121 EGINKIVTGELVSLSEQELVDCDKSY-NQGCNGGLMDYAFQFIMKNGGLNTEKDYPYHGTNGKCNSLLKNSRVVTIDGYE 140 150 160 170 180 190 200 210 230 240 250 260 270 280 290 300 gi|115 DIPKQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEW :.:..... .: ... :.::::::: ..: :. :: : :.. .:::.:..:::: : .. ::.:.:::: .: JQ1121 DVPSKDETALKRAVSYQPVSVAIDAGGRAFQHYQSGI-FTGKCGT-NMDHAVVAVGYGSE----NGVDYWIVRNSWGTRW 220 230 240 250 260 270 280 310 320 330 gi|115 GMGGYVKMAKD---RRNHCGIASAASYPTV : ::..: .. . ..:::: ::::. JQ1121 GEDGYIRMERNVASKSGKCGIAIEASYPVKYSPNPVRGTSSV 290 300 310 320 --------------------------------------------------------------------------- >>S47312 cysteine proteinase (EC 3.4.22.-) precursor - spring vetch (368 aa) initn: 374 init1: 270 opt: 802 Z-score: 923.4 expect() 1.3e-44 Smith-Waterman score: 802; 38.889% identity in 342 aa overlap Entrez lookup Re-search database >S47312 4- 331: --------------------------------------------------------------------: 10 20 30 40 50 60 gi|115 MNPTLILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLY-GMNEEGWRRAVWEKNMKMIELHNQEYREG .:: .. : : : . . ... .: . :...: :. :. : ... :...:. :: . S47312 MASMTILPFFLFFSLITFSLALDIQLPTGRSNDEVMTMYEEWLVKHQKVYNGLREKDQRFQIFKDNLNFIDEHNAQ---- 10 20 30 40 50 60 70 70 80 90 100 110 120 130 140 gi|115 KHSFTMAMNAFGDMTSEEFRQVMNGFQN---RKPRKGKVFQEPLFYEA----PRSVDWREKGYVTPVKNQGQCGSCWAFS .... ...: :.:::.::.:... : .. :. :.:. . :.. : :::: :: .: .:.::.:::::::: S47312 NYTYIVGLNKFADMTNEEYRDMYLGTRSDIKRRIMKNKITGHRYAYNSGDRLPVHVDWRLKGAITHIKDQGSCGSCWAFS 80 90 100 110 120 130 140 150 150 160 170 180 190 200 210 220 gi|115 ATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCK-YNPKYSVAN . ...:. ::.:.:::::.::::. :::::::::::::... :::.:... :::.. : : : .... S47312 TIATVEAINKIVTGKLVSLSEQELVDCDRAF-NEGCNGGLMDYAFEFIIGNGGIDTDQHYPYKGFEGRCDPTRKKAKIVS 160 170 180 190 200 210 220 230 230 240 250 260 270 280 290 300 gi|115 DTGFVDIPKQ-EKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKN :. :.:.. :.:: :::: :.::::.:. ... .:. :. : :.. ..::.:..:::: : .. ::::.: S47312 IDGYEDVPSNNENALKKAVAH-QPVSVAIEASGRALQLYQSGV-FTGKCGT-SLDHAVVIVGYGSE----NGLDYWLVRN 240 250 260 270 280 290 300 310 320 330 gi|115 SWGEEWGMGGYVKMAKDRRN----HCGIASAASYPTV ::: .:: :: :: .. .. .:::: ::::. S47312 SWGTNWGEDGYFKMERNVKGTHTGKCGIAVEASYPVKYGKNSAVTTNSAYEKTEVLVSSA 310 320 330 340 350 360 --------------------------------------------------------------------------- >>S49451 cysteine proteinase - chickpea (325 aa) initn: 331 init1: 233 opt: 797 Z-score: 918.5 expect() 2.5e-44 Smith-Waterman score: 797; 41.270% identity in 315 aa overlap Entrez lookup Re-search database >S49451 29- 331: ---------------------------------------------------------------: 10 20 30 40 50 60 70 gi|115 MNPTLILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLY-GMNEEGWRRAVWEKNMKMIELHNQEYREGKHSFTMAMNA . :: . :...: :..:. : ... :...:. :: . ..:. ...: S49451 MTMYEKWLVKHQKMYNGLGEKDTRFQIFKDNLRFIDEHNAQ----NYSYKVGLNK 10 20 30 40 50 80 90 100 110 120 130 140 150 gi|115 FGDMTSEEFRQVMNGFQN---RKPRKGKVFQEPLFYEA---PRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFR :.:...::.:... : .. :. : :. . . :.. .:::: :: :: .:.::.::::::::. ...:. S49451 FADINNEEYRDMYLGTKSDAKRRVMKTKITGHRITYNSVIVTVKVDWRLKGAVTHIKDQGSCGSCWAFSTIATVEAINKI 60 70 80 90 100 110 120 130 160 170 180 190 200 210 220 230 gi|115 KTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYS-VANDTGFVDIPKQE ::...:::::.::::. :::::::::::::... :::.:....:::.. :..: . : . :.. :. :.:. S49451 VTGKFVSLSEQELVDCDRAF-NEGCNGGLMDYAFEFIIRNGGIDTDQDYPYNGFERKCDPTKKNAKVVSIDGYEDVPSYM 140 150 160 170 180 190 200 210 240 250 260 270 280 290 300 310 gi|115 KALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYV .:: :::: :.:::: . ... .:. :. : :.. :.::::.::::: : .. ::::.:::: .:: :: S49451 NALKKAVAH-QPVSVAIAGLGRALQLYQSGV-FTGKCGT-DLDHGVVVVGYGSE----NGVDYWLVRNSWGTNWGEDGYF 220 230 240 250 260 270 280 320 330 gi|115 KMA----KDRRNHCGIASAASYPTV :.: :. .:::: ::::. S49451 KIASRNVKSLYRKCGIAMEASYPVKYGQNTNSAAPQLYVTSA 290 300 310 320 --------------------------------------------------------------------------- >>KHRTH cathepsin H (EC 3.4.22.16) precursor - rat (333 aa) initn: 580 init1: 389 opt: 797 Z-score: 918.3 expect() 2.6e-44 Smith-Waterman score: 797; 39.583% identity in 336 aa overlap Entrez lookup Re-search database >KHRTH 3- 331:---------------------------------------------------------------------: 10 20 30 40 50 60 70 gi|115 MNPTLILAAFCLGI-ASATLTFDHSLEAQWTKWKAMHNRLYGMNEEGWRRAVWEKNMKMIELHNQEYREGKHSFTMA : : .:. :. :.: :: . . ..:.: .:.. :. : . : :. .: . :. :::. .:.: :. KHRTH MWTALPLLCAGAWLLSAGATAELTVNAIEKFHFTSWMKQHQKTYSSREYSHRLQVFANNWRKIQAHNQR----NHTFKMG 10 20 30 40 50 60 70 80 90 100 110 120 130 140 150 gi|115 MNAFGDMTSEEFRQ--VMNGFQNRKPRKGKVFQEPLFYEAPRSVDWREKG-YVTPVKNQGQCGSCWAFSATGALEGQMFR .: :.::. :... . . :: . :.. .. : : :.:::.:: :.:::::: :::::.::.:::::. . KHRTH LNQFSDMSFAEIKHKYLWSEPQNCSATKSNYLRGTGPY--PSSMDWRKKGNVVSPVKNQGACGSCWTFSTTGALESAVAI 80 90 100 110 120 130 140 150 160 170 180 190 200 210 220 230 gi|115 KTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIP-KQE .:....:.::.::::. .:.::.::: . ::.:. : :. .:.:::: . . .::.::. .:: . :.: ..: KHRTH ASGKMMTLAEQQLVDCAQNFNNHGCQGGLPSQAFEYILYNKGIMGEDSYPYIGKNGQCKFNPEKAVAFVKNVVNITLNDE 160 170 180 190 200 210 220 230 240 250 260 270 280 290 300 310 gi|115 KALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDC--SSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGG :...::: .:.: :... :.:..:: :.: .: . . ..:.::.:::: :... ::.:::::: .:: .: KHRTH AAMVEAVALYNPVSFAFEVT-EDFMMYKSGVYSSNSCHKTPDKVNHAVLAVGYG----EQNGLLYWIVKNSWGSNWGNNG 240 250 260 270 280 290 300 320 330 gi|115 YVKMAKDRRNHCGIASAASYPTV : . . . : ::.:. :::: KHRTH YFLIERGK-NMCGLAACASYPIPQV 310 320 330 --------------------------------------------------------------------------- >>KHHUH cathepsin H (EC 3.4.22.16) precursor - human (335 aa) initn: 596 init1: 364 opt: 797 Z-score: 918.3 expect() 2.6e-44 Smith-Waterman score: 797; 38.348% identity in 339 aa overlap Entrez lookup Re-search database >KHHUH 3- 331:---------------------------------------------------------------------: 10 20 30 40 50 60 70 gi|115 MNPTLILAAFCLGI---ASATLTFDHSLEAQWTKWKAMHNRLYGMNEEGWRRAVWEKNMKMIELHNQEYREGKHSFT : : .:. ::. ..: :. . . .. .: . : . :. .: : .. .: . :. ::. :.:.: KHHUH MWATLPLLCAGAWLLGVPVCGAAELSVNSLEKFHFKSWMSKHRKTYSTEEYHHRLQTFASNWRKINAHNN----GNHTFK 10 20 30 40 50 60 70 80 90 100 110 120 130 140 150 gi|115 MAMNAFGDMTSEEFRQ--VMNGFQNRKPRKGKVFQEPLFYEAPRSVDWREKG-YVTPVKNQGQCGSCWAFSATGALEGQM ::.: :.::. :... . . :: . :.. .. : : :::::.:: .:.:::::: :::::.::.:::::. . KHHUH MALNQFSDMSFAEIKHKYLWSEPQNCSATKSNYLRGTGPY--PPSVDWRKKGNFVSPVKNQGACGSCWTFSTTGALESAI 80 90 100 110 120 130 140 150 160 170 180 190 200 210 220 gi|115 FRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVA--NDTGFVDIP ::...::.::.::::. .: ::.::: . ::.:. : :. .:..:::.. . ::..: ... .:.. . : KHHUH AIATGKMLSLAEQQLVDCAQDFNNYGCQGGLPSQAFEYILYNKGIMGEDTYPYQGKDGYCKFQPGKAIGFVKDVANITI- 160 170 180 190 200 210 220 230 230 240 250 260 270 280 290 300 gi|115 KQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDC--SSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWG .:.:...::: .:.: :... .. :..:. ::: .: . . ..:.::.:::: :... ::.:::::: .:: KHHUH YDEEAMVEAVALYNPVSFAFEVTQD-FMMYRTGIYSSTSCHKTPDKVNHAVLAVGYG----EKNGIPYWIVKNSWGPQWG 240 250 260 270 280 290 300 310 320 330 gi|115 MGGYVKMAKDRRNHCGIASAASYPTV :.:: . . . : ::.:. :::: KHHUH MNGYFLIERGK-NMCGLAACASYPIPLV 310 320 330 --------------------------------------------------------------------------- >>JN0719 drought-inducible cysteine proteinase (EC 3.4.22.-) RD21A precursor (462 aa) initn: 427 init1: 285 opt: 796 Z-score: 915.0 expect() 3.9e-44 Smith-Waterman score: 796; 39.640% identity in 333 aa overlap Entrez lookup Re-search database >JN0719 13- 331: ------------------------------------------------------------------: 10 20 30 40 50 gi|115 MNPTLILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLYGMN---EEGWRRAVWEKNM :.... . . . . : . :.. ..: :. : ... :. JN0719 MGFLKPTMAILFLAMVAVSSAVDMSIISYDEKHGVSTTGGRSEAEVMSIYEAWLVKHGKAQSQNSLVEKDRRFEIFKDNL 10 20 30 40 50 60 70 80 60 70 80 90 100 110 120 130 gi|115 KMIELHNQEYREGKHSFTMAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYEA------PRSVDWREKGYVTPVK .... :: : . :. .... :.:.:..:.:. . : . .: ::. . : ::: :.:.:::.:: :. :: JN0719 RFVDEHN----EKNLSYRLGLTRFADLTNDEYRSKYLGAKMEK--KGER-RTSLRYEARVGDELPESIDWRKKGAVAEVK 90 100 110 120 130 140 150 140 150 160 170 180 190 200 210 gi|115 NQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEES .:: ::::::::. ::.:: :: ::.::::.::::. :::::::::::::... :::.:....:::.... . JN0719 DQGGCGSCWAFSTIGAVEGINQIVTGDLITLSEQELVDCDTSY-NEGCNGGLMDYAFEFIIKNGGIDTDKDYPYKGVDGT 160 170 180 190 200 210 220 230 220 230 240 250 260 270 280 gi|115 C-KYNPKYSVANDTGFVDIPK-QEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFEST : . . .:.. .. :.: .:..: :::: :::.::.:: ..: .: :: :. .:... .::::..:::: : JN0719 CDQIRKNAKVVTIDSYEDVPTYSEESLKKAVAH-QPISIAIEAGGRAFQLYDSGI-FDGSCGTQ-LDHGVVAVGYGTE-- 240 250 260 270 280 290 300 290 300 310 320 330 gi|115 ESDNNKYWLVKNSWGEEWGMGGYVKMAKD---RRNHCGIASAASYPTV ... ::.:.::::. :: .::..::.. ..:::: ::: JN0719 --NGKDYWIVRNSWGKSWGESGYLRMARNIASSSGKCGIAIEPSYPIKNGENPPNPGPSPPSPIKPPTQCDSYYTCPESN 310 320 330 340 350 360 370 380 JN0719 TCCCLFEYGKYCFAWGCCPLEAATCCDDNYSCCPHEYPVCDLDQGTCLLSKNSPFSVKALKRKPATPFWSQGRKNIA 390 400 410 420 430 440 450 460 --------------------------------------------------------------------------- >>S49166 cysteine proteinase precursor - spring vetch (357 aa) initn: 659 init1: 243 opt: 789 Z-score: 908.7 expect() 8.8e-44 Smith-Waterman score: 794; 40.805% identity in 348 aa overlap Entrez lookup Re-search database >S49166 5- 331: --------------------------------------------------------------------: 10 20 30 40 50 60 gi|115 MNPTLILAAFCLG-IASATLTFD---HSLEAQ---WT---KWKAMHNRLYGMNEEGWRRAVWEKNMKMIELHNQEYREG :.. .. :. : ... ::: :.::.. :. .:.. :. ...:. : :.. :. ...:: . . S49166 MEMKKLLFISLSLALIFTVANTFDFNEHDLESEKSLWNLYERWRSHHTVTRNLDEKHNRFNVFKANV--MHVHNTNKLD- 10 20 30 40 50 60 70 70 80 90 100 110 120 130 140 gi|115 KHSFTMAMNAFGDMTSEEFRQVMNGFQNRKPR--KGKVFQEPLF-YE----APRSVDWREKGYVTPVKNQGQCGSCWAFS . . . .: :::::. :::... . . : .: .. : :: .: :.:::.:: :: ::.::::::::::: S49166 -KPYKLKLNKFGDMTNYEFRRIYADSKISHHRMFRGMSHENGTFMYENAVDVPSSIDWRNKGAVTGVKDQGQCGSCWAFS 80 90 100 110 120 130 140 150 150 160 170 180 190 200 210 220 gi|115 ATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVAND . .:.:: :: .:.:::::.::::. . :::::::::.:::.....:: . .: .::: : . .: . . .... S49166 TIAAVEGINQIKTQKLVSLSEQQLVDCDTEE-NEGCNGGLMEYAFEFIKQNG-ITTESNYPYAAKDGTCDVEKEDKAVSI 160 170 180 190 200 210 220 230 230 240 250 260 270 280 290 300 gi|115 TGFVDIP-KQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNS : ..: ..: ::.::.: :.::::::: .: ::.::. : :.. :..::: .:::: . .: .:::..::: S49166 DGHENVPINNEAALLKAAAK-QPVSVAIDAGGYNFQFYSEGV-FTGHCDT-DLNHGVAIVGYG---VTQDRTKYWIMKNS 240 250 260 270 280 290 300 310 320 330 gi|115 WGEEWGMG---GYVKMAKDRRNHCGIASAASYPTV :: :. :: : . ... :.. :::: :::: S49166 WGLEF-MGPRMGRTGISS-REGLCGIAMEASYPIKKSSTKPTESSILKDEL 310 320 330 340 350 --------------------------------------------------------------------------- >>KHDOP prestalk cathepsin (EC 3.4.22.-) precursor - slime mold (Dictyosteliu (376 aa) initn: 815 init1: 420 opt: 779 Z-score: 896.9 expect() 4e-43 Smith-Waterman score: 923; 43.860% identity in 342 aa overlap Entrez lookup Re-search database >KHDOP 29- 331: ---------------------------------------------------------------: 10 20 30 40 50 60 70 gi|115 MNPTLILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLYGMNEEGWRRAVWEKNMKMIELHNQEYREGKHSF .:.: :: :. .: . : .....:: ... :.. : . KHDOP MRLLVFLILLIFVNFSFANVRPNGRRFSESQYRTAFTEWTLKFNRQYSSSEFSNRYSIFKSNMDYVDNWNSK---GDSQT 10 20 30 40 50 60 70 80 90 100 110 120 130 140 gi|115 TMAMNAFGDMTSEEFRQVMNGFQ-NRKPRKGKVFQEPLFYE----APRSVDWREKGYVTPVKNQGQCGSCWAFSATGALE ....: :.:.:.::.:... : . : . .: .: : : :.:.::: :. :::.:.::::::::.::.::. : KHDOP VLGLNNFADITNEEYRKTYLGTRVNAHSYNGYDGREVLNVEDLQTNPKSIDWRTKNAVTPIKDQGQCGSCWSFSTTGSTE 80 90 100 110 120 130 140 150 150 160 170 180 190 200 210 220 gi|115 GQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEA-TEESCKYNPKYSVANDTGFVD : :: .:.:::::::::::::. : ::.::::. ::.:. : :.:.: :::: : : .: .: . :. :.:. KHDOP GAHALKTKKLVSLSEQNLVDCSGPEENFGCDGGLMNNAFDYIIKNKGIDTESSYPYTAETGSTCLFNKSDIGATIKGYVN 160 170 180 190 200 210 220 230 230 240 250 260 270 280 gi|115 IPKQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGF---------------------- : . .. : ::.::::::.:.:: .: :::.:: :: ..:::::::::: KHDOP ITAGSEISLENGAQHGPVSVAIDASHNSFQLYTSGIYYEPKCSPTELDHGVLVVGYGVQGKDDEGPVLNRKQTIVIHKNE 240 250 260 270 280 290 300 310 290 300 310 320 330 gi|115 ----ESTE--SDN-----NKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV ::.. ::. :.::.:::::: ::. ::. :.:::.:.:::::..::: KHDOP DNKVESSDDSSDSVRPKANNYWIVKNSWGTSWGIKGYILMSKDRKNNCGIASVSSYPLA 320 330 340 350 360 370 --------------------------------------------------------------------------- >>S66348 senescence-associated cysteine proteinase precursor (clone SENU3) - (356 aa) initn: 600 init1: 447 opt: 768 Z-score: 884.7 expect() 1.9e-42 Smith-Waterman score: 768; 42.765% identity in 311 aa overlap Entrez lookup Re-search database >S66348 29- 333: ---------------------------------------------------------------: 10 20 30 40 50 60 gi|115 MNPTLILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLYGMNEE-GWRRAVWEKNMKMIELHNQEYR .... : . : :: : .. :.:::. :: : S66348 TALAGPATFADKNPIRQVVFPDELENGILQVVGQTRSALSFARFAIRHRKRYDSVEEIKQRFEIFLDNLKMIRSHN---R 20 30 40 50 60 70 80 90 70 80 90 100 110 120 130 140 gi|115 EGKHSFTMAMNAFGDMTSEEFRQ-VMNGFQN-RKPRKGKVFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATG .: :. ...: : :.: .:::. ... :: ::.. . :.. :::. : :.::: ::.:::::.::.:: S66348 KGL-SYKLGINEFTDLTWDEFRKHKLGASQNCSATTKGNLKLTNVVL--PETKDWRKDGIVSPVKAQGKCGSCWTFSTTG 100 110 120 130 140 150 160 170 150 160 170 180 190 200 210 220 gi|115 ALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGF :::. . . :. ::::::.::::.: .: :::::: . ::.:.. :::::.::.::: . . ::.. .. . S66348 ALEAAYAQAFGKGISLSEQQLVDCAGAFNNFGCNGGLPSQAFEYIKFNGGLDTEEAYPYTGKNGICKFSQANIGVKVISS 180 190 200 210 220 230 240 250 230 240 250 260 270 280 290 300 gi|115 VDIP-KQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMD--HGVLVVGYGFESTESDNNKYWLVKNSW :.: : : ::: : :.:::... ..: :: :.: .:.. :: :.::.:::: : ... :::.:::: S66348 VNITLGAEYELKYAVALVRPVSVAFEVV-KGFKQYKSGVYASTECGDTPMDVNHAVLAVGYGVE----NGTPYWLIKNSW 260 270 280 290 300 310 320 310 320 330 gi|115 GEEWGMGGYVKMAKDRRNHCGIASAASYPTV : .:: :: :: . : ::.:. :::: : S66348 GADWGEDGYFKMEMGK-NMCGVATCASYPIVA 330 340 350 --------------------------------------------------------------------------- >>S47434 cysteine proteinase - rice (378 aa) initn: 490 init1: 310 opt: 764 Z-score: 879.7 expect() 3.6e-42 Smith-Waterman score: 852; 43.363% identity in 339 aa overlap Entrez lookup Re-search database >S47434 19- 331: -----------------------------------------------------------------: 10 20 30 40 50 gi|115 MNPTLILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLY-----GM-NEEGWRR---AVWEKNMKM :. ..::.: . .:.. .. :. :..: : :. .: .. S47434 MLRCFLVAAAAVALAAAAAAPARAIPFTESDLSSEESLRALYERWRSRYTVSRPAASGGVGNDDGEARRRFNVFVENARY 10 20 30 40 50 60 70 80 60 70 80 90 100 110 120 gi|115 IELHNQEYREGKHSFTMAMNAFGDMTSEEFRQVMNGFQNRKPR--------KGKVFQEPLFYEA--PRSVDWREKGYVTP :. : :.: . : .:.: :.:::..:::... : . :. : .: :. : : .:::::.: :: S47434 IHEAN---RRGGRPFRLALNKFADMTTDEFRRTYAGSRARHHRSLSGGRGGEGGSFRYGGDDEDNLPPAVDWRERGAVTG 90 100 110 120 130 140 150 130 140 150 160 170 180 190 200 gi|115 VKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATE .:.:::::::::::...:.:: :::::..::::.::::. . :.::.:::::::::... :::. .: .:::.: . S47434 IKDQGQCGSCWAFSTVAAVEGVNKIKTGRLVTLSEQELVDCDTGD-NQGCDGGLMDYAFQFIKRNGGITTESNYPYRAEQ 160 170 180 190 200 210 220 230 210 220 230 240 250 260 270 280 gi|115 ESCKYNPKYS--VANDTGFVDIP-KQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGF :. : :. : :. :.: ..:.::.::::. :..::..:. ..: ::.::. : .:.. :.:::: .::::. S47434 GRCNKAKASSHDVTID-GYEDVPANDESALQKAVAN-QPVAVAVEASGQDFQFYSEGV-FTGECGT-DLDHGVAAVGYGI 240 250 260 270 280 290 300 310 290 300 310 320 330 gi|115 ESTESDNNKYWLVKNSWGEEWGMGGYVKMAK----DRRNHCGIASAASYPTV :..:::.:::::::.:: ::..: . : . :::: ::::. S47434 TR---DGTKYWIVKNSWGEDWGERGYIRMQRGVSSDSNGLCGIAMEASYPVKSGARNAAASNRVVKDEM 320 330 340 350 360 370 --------------------------------------------------------------------------- >>S59598 cysteine proteinase 2 precursor - maize (360 aa) initn: 631 init1: 456 opt: 754 Z-score: 868.6 expect() 1.5e-41 Smith-Waterman score: 754; 43.643% identity in 291 aa overlap Entrez lookup Re-search database >S59598 48- 333: -----------------------------------------------------------: 10 20 30 40 50 60 70 80 gi|115 AAFCLGIASATLTFDHSLEAQWTKWKAMHNRLYGMNEEGWRRAVWEKNMKMIELHNQEYREGKHSFTMAMNAFGDMTSEE : .. ....... : :.: :. ...: :.::. :: S59598 ASALESTVFAALGRTRDALRFARFAVRYGKSYESAAEVHKRFRIFSESLQLVRSTN---RKGL-SYRLGINRFADMSWEE 40 50 60 70 80 90 100 110 90 100 110 120 130 140 150 160 gi|115 FRQV-MNGFQN-RKPRKGKVFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQN :: . ... :: :. .. :.. :::: : :.::::::.:::::.::.:::::. . . ::. ::::::. S59598 FRATRLGAAQNCSATLTGNHRMRAAAVALPETKDWREDGIVSPVKNQGHCGSCWTFSTTGALEAAYTQATGKPISLSEQQ 120 130 140 150 160 170 180 190 170 180 190 200 210 220 230 240 gi|115 LVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIP-KQEKALMKAVATVGP ::::. .: :::::: . ::.:.. :::::.::::::.... ::.. . .. :.: : : ::. : : S59598 LVDCGFAFNNFGCNGGLPSQAFEYIKYNGGLDTEESYPYQGVNGICKFKNENVGVKVLDSVNITLGAEDELKDAVGLVRP 200 210 220 230 240 250 260 270 250 260 270 280 290 300 310 320 gi|115 ISVAIDAGHESFLFYKEGIYFEPDCSSEDMD--HGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYVKMAKDRRNHC .:::... .: .:: :.: :.. :: :.::.:::: : :. :::.::::: .:: :: :: . : : S59598 VSVAFEVI-TGFRLYKSGVYTSDHCGTTPMDVNHAVLAVGYGVE----DGVPYWLIKNSWGADWGDEGYFKMEMGK-NMC 280 290 300 310 320 330 340 330 gi|115 GIASAASYPTV :.:. :::: : S59598 GVATCASYPIVA 350 360 --------------------------------------------------------------------------- >>S24602 cysteine proteinase tpp (EC 3.4.22.-) - garden pea (464 aa) initn: 467 init1: 242 opt: 745 Z-score: 856.7 expect() 7e-41 Smith-Waterman score: 745; 40.303% identity in 330 aa overlap Entrez lookup Re-search database >S24602 16- 331: ------------------------------------------------------------------: 10 20 30 40 50 60 gi|115 MNPTLILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLYG-MNEEGWRRAVWEKNMKMIE :. : :. : ... .: . :.. :. ..:. : ... :. .:. S24602 MLSKLTILFITLTFTLSLALDMCIISYDKTHPDKSTPRTNDQVL-TMYEEWLVKHGKNYNALGEKEKRFEIFKDNLGFID 10 20 30 40 50 60 70 70 80 90 100 110 120 130 gi|115 LHNQEYREGKHSFTMAMNAFGDMTSEEFRQVMNGFQ---NRKPRK----GKVFQEPLFYEAPRSVDWREKGYVTPVKNQG ::.. . :: ...: :.:.:.::.: . : . ::. :: . . . . :.:::::..: :. ::.:: S24602 EHNSK----NLSFRLGLNRFADLTNEEYRTRFLGTRINPNRRNRKVNSQTNRYATRVGDKLPESVDWRKEGAVVGVKDQG 80 90 100 110 120 130 140 150 140 150 160 170 180 190 200 210 gi|115 QCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKY .::::::::: .:.:: :: :::::::.::::. :::::::::::::... . .: ::.:::.: . : S24602 SCGSCWAFSAIAAVEGVNKLATGDLISLSEQELVDCDTSY-NEGCNGGLMDYAFEFIINMVALTPEEDYPYRAIDGRCDQ 160 170 180 190 200 210 220 230 220 230 240 250 260 270 280 290 gi|115 NPKYS-VANDTGFVDIPK-QEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESD : : . :.. . :.: .: :: ::::. :.::...: . : .: :. : :.. .:::: .:::: : . S24602 NRKNAKVVSIDQYEDVPAYDEGALKKAVAN-QVIAVAVEGGGREFQLYDSGV-FTGRCGTA-LDHGVAAVGYGTE----N 240 250 260 270 280 290 300 300 310 320 330 gi|115 NNKYWLVKNSWGEEWGMGGYVKMAKD----RRNHCGIASAASYPTV .. ::.:.:::: :: .::... .. . ..:::: ::: S24602 GKDYWIVRNSWGGSWGEAGYIRLERNLATSKSGKCGIAIEPSYPIKNGLNPPKPAPSPPSPVKPPSVCDSYSCAEGSTCC 310 320 330 340 350 360 370 380 S24602 CIFDYGGSCFEWGCCPLESATCCDDHYSCCPHEYPVCDTYAGLCRKNKNNPLGVKSFKRTPAKPHFAIEGKNKMGSV 390 400 410 420 430 440 450 460 --------------------------------------------------------------------------- >>S11862 cysteine proteinase homolog - garden pea (363 aa) initn: 606 init1: 287 opt: 742 Z-score: 854.8 expect() 8.8e-41 Smith-Waterman score: 742; 38.390% identity in 323 aa overlap Entrez lookup Re-search database >S11862 22- 329: -----------------------------------------------------------------: 10 20 30 40 50 60 gi|115 MNPTLILAAFCLGIASATLTFDHSLEAQ--WTKWKAMHNRLYGMNEE-GWRRAVWEKNMKMIE :: :.:. .:..:. .. :. .:: .: .:...:. . S11862 MDRRFLFALFLFAAVATAVTDDTNNDDFIIRQVVDNEEDHLLNAEHHFTSFKSKFSKSYATKEEHDYRFGVFKSNLIKAK 10 20 30 40 50 60 70 80 70 80 90 100 110 120 130 gi|115 LHNQEYREGKHSFTMAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLF--YEAPRSVDWREKGYVTPVKNQGQCGSC ::... ..:..: :.:.:. :::. . :...: ... . :.. . :.. :::::: :::::.::.:::: S11862 LHQNRDPTAEHGIT----KFSDLTASEFRRQFLGLKKRLRLPAHAQKAPILPTTNLPEDFDWREKGAVTPVKDQGSCGSC 90 100 110 120 130 140 150 140 150 160 170 180 190 200 210 gi|115 WAFSATGALEGQMFRKTGRLISLSEQNLVDCS---GPQ--G--NEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESC ::::.:::::: . ::.:.:::::.::::. :. : . :::::::. ::.:. ..::. .:..: : . . :: S11862 WAFSTTGALEGAHYLATGKLVSLSEQQLVDCDHVCDPEQAGSCDSGCNGGLMNNAFEYLLESGGVVQEKDYAYTGRDGSC 160 170 180 190 200 210 220 230 220 230 240 250 260 270 280 gi|115 KYNPKYSVANDTGFVDIPKQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYG---FEST :.. . ::. ..: . .: . .. ::..:::.:. . : :. :.. .:::::.::.: . S11862 KFDKSKVVASVSNFSVVTLDEDQIAANLVKNGPLAVAINAAWMQT--YMSGVSCPYVCAKSRLDHGVLLVGFGKGAYAPI 240 250 260 270 280 290 300 310 290 300 310 320 330 gi|115 ESDNNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV . .. ::..:::::..:: :: :. . : : ::. : .: S11862 RLKEKPYWIIKNSWGQNWGEQGYYKICRGR-NVCGVDSMVSTVAAAQSNH 320 330 340 350 360 --------------------------------------------------------------------------- >>S07051 cysteine proteinase (EC 3.4.22.-) precursor - Trypanosoma brucei (450 aa) initn: 461 init1: 277 opt: 743 Z-score: 854.6 expect() 9.1e-41 Smith-Waterman score: 743; 40.233% identity in 343 aa overlap Entrez lookup Re-search database >S07051 3- 333:---------------------------------------------------------------------: 10 20 30 40 50 60 gi|115 MNPTLILA-AFCLG-IASATLTFDHSLEAQWTKWKAMHNRLY-GMNEEGWRRAVWEKNMKMIELHNQEYR :...:: : ::. .: ..: ..::: ... .: ....: .::..: ..:.::.. ... S07051 MPRTEMVRFVRLPVVLLAMAACLASVALGSLHVEESLEMRFAAFKKKYGKVYKDAKEEAFRFRAFEENMEQAKIQAA--- 10 20 30 40 50 60 70 70 80 90 100 110 120 130 140 gi|115 EGKHSFTMAMNAFGDMTSEEFR-QVMNG---FQNRKPRKGKVFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSA .. :.... :.::: :::: . :: : . : :. . .:: .::::::: ::::: :::::::::::. S07051 -ANPYATFGVTPFSDMTREEFRARYRNGASYFAAAQKRLRKTVNVTT-GRAPAAVDWREKGAVTPVKVQGQCGSCWAFST 80 90 100 110 120 130 140 150 150 160 170 180 190 200 210 gi|115 TGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQY-VQDNGG-LDSEESYPY---EATEESCKYNPKYS : .::: . :.::::: ::.:. . :::::::: ::.. :..::: . .: :::: .. . .:..: . S07051 IGNIEGQWQVAGNPLVSLSEQMLVSCDTI--DSGCNGGLMDNAFNWIVNSNGGNVFTEASYPYVSGNGEQPQCQMNGHEI 160 170 180 190 200 210 220 230 220 230 240 250 260 270 280 290 gi|115 VANDTGFVDIPKQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLV : : ::.:..: :. .: ::...:.:: :::. :. :: .:.:...:::::.:::. ...: ::.. S07051 GAAITDHVDLPQDEDAIAAYLAENGPLAIAVDA--ESFMDYNGGIL--TSCTSKQLDHGVLLVGYN----DNSNPPYWII 240 250 260 270 280 290 300 300 310 320 330 gi|115 KNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV ::::.. :: ::... : :.: . .:.: .: S07051 KNSWSNMWGEDGYIRIEKGT-NQCLMNQAVSSAVVGGPTPPPPPPPPPSATFTQDFCEGKGCTKGCSHATFPTGECVQTT 310 320 330 340 350 360 370 380 --------------------------------------------------------------------------- >>S42882 cysteine proteinase (EC 3.4.22.-) precursor - spring vetch (358 aa) initn: 606 init1: 287 opt: 737 Z-score: 849.2 expect() 1.8e-40 Smith-Waterman score: 737; 38.390% identity in 323 aa overlap Entrez lookup Re-search database >S42882 22- 329: -----------------------------------------------------------------: 10 20 30 40 50 60 gi|115 MNPTLILAAFCLGIASATLTFDHSLEAQ--WTKWKAMHNRLYGMNEE-GWRRAVWEKNMKMIELHNQE :: :.:. .:..:. .. :. .:: .: .:.. :. .::.. S42882 MDRRFIFALFLFAATATAATDDFLIRQVVDNEEDHLLNAEHHFTSFKSKFSKSYATKEEHDYRFGVFKANLIKAKLHQKL 10 20 30 40 50 60 70 80 70 80 90 100 110 120 130 140 gi|115 YREGKHSFTMAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLF--YEAPRSVDWREKGYVTPVKNQGQCGSCWAFSA ..:..: :.:.:. :::. . :...: ... . :.. . :.. :::::: :::::.::.::::::::. S42882 DPTAEHGIT----KFSDLTASEFRRQFLGLNKRLRLPAHAQKAPILPTTNLPEDFDWREKGAVTPVKDQGSCGSCWAFST 90 100 110 120 130 140 150 150 160 170 180 190 200 210 gi|115 TGALEGQMFRKTGRLISLSEQNLVDCS---GPQ--G--NEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPK :::::: . ::.:.:::::.::::. :. : . :::::::. ::.:. ..::. .:..: : . . :::.. . S42882 TGALEGAHYLATGKLVSLSEQQLVDCDHVCDPEEAGSCDSGCNGGLMNNAFEYLLQSGGVVQEKDYAYTGRDGSCKFDKS 160 170 180 190 200 210 220 230 220 230 240 250 260 270 280 290 gi|115 YSVANDTGFVDIPKQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYG---FESTESDNN ::. ..: . .:. . .. ::..:::.:. . : :. :.. .:::::.::.: . . .. S42882 KVVASVSNFSVVSLDEEQIAANLVKNGPLAVAINAAWMQA--YMSGVSCPYVCAKARLDHGVLLVGFGKGAYAPIRLKEK 240 250 260 270 280 290 300 310 300 310 320 330 gi|115 KYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV ::..:::::..:: :: :. . : : ::. : .: S42882 PYWIIKNSWGQNWGEQGYYKICRGR-NVCGVDSMVSTVAAAQSNN 320 330 340 350 --------------------------------------------------------------------------- >>S12099 cysteine proteinase (EC 3.4.22.-) precursor - Trypanosoma brucei (450 aa) initn: 470 init1: 286 opt: 737 Z-score: 847.7 expect() 2.2e-40 Smith-Waterman score: 737; 39.942% identity in 343 aa overlap Entrez lookup Re-search database >S12099 3- 333:---------------------------------------------------------------------: 10 20 30 40 50 60 gi|115 MNPTLILA-AFCLG-IASATLTFDHSLEAQWTKWKAMHNRLY-GMNEEGWRRAVWEKNMKMIELHNQEYR :...:: : ::. .: ..: ..::: ... .: ....: .::..: ..:.::.. ... S12099 MPRTEMVRFVRLPVVLLAMAACLASVALGSLHVEESLEMRFAAFKKKYGKVYKDAKEEAFRFRAFEENMEQAKIQAA--- 10 20 30 40 50 60 70 70 80 90 100 110 120 130 140 gi|115 EGKHSFTMAMNAFGDMTSEEFR-QVMNG---FQNRKPRKGKVFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSA .. :.... :.::: :::: . :: : . : :. . .:: .::::::: :::::.:::::::::::. S12099 -ANPYATFGVTPFSDMTREEFRARYRNGASYFAAAQKRVRKTVNVTT-GRAPAAVDWREKGAVTPVKDQGQCGSCWAFST 80 90 100 110 120 130 140 150 150 160 170 180 190 200 210 gi|115 TGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQY-VQDNGG-LDSEESYPY---EATEESCKYNPKYS : .::: . :.::::: ::.:. . ::.::::: ::.. :..::: . .: :::: .. . .:..: . S12099 IGNIEGQWQVAGNPLVSLSEQMLVSCDTI--DFGCGGGLMDNAFNWIVNSNGGNVFTEASYPYVSGNGEQPQCQMNGHEI 160 170 180 190 200 210 220 230 220 230 240 250 260 270 280 290 gi|115 VANDTGFVDIPKQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLV : : ::.:..: :. .: ::...:.:: ::. :. :: .:.::..:::::.:::. ...: ::.. S12099 GAAITDHVDLPQDEDAIAAYLAENGPLAIAVDA--TSFMDYNGGIL--TSCTSEQLDHGVLLVGYN----DNSNPPYWII 240 250 260 270 280 290 300 300 310 320 330 gi|115 KNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV ::::.. :: ::... : :.: . .:.: .: S12099 KNSWSNMWGEDGYIRIEKGT-NQCLMNQAVSSAVVGGPTPPPPPPPPPSATFTQDFCEGKGCTKGCSHATFPTGECVQTT 310 320 330 340 350 360 370 380 --------------------------------------------------------------------------- >>TAGB actinidain (EC 3.4.22.14) precursor - kiwi fruit (380 aa) initn: 536 init1: 353 opt: 735 Z-score: 846.5 expect() 2.6e-40 Smith-Waterman score: 735; 36.686% identity in 338 aa overlap Entrez lookup Re-search database >TAGB 4- 331: --------------------------------------------------------------------: 10 20 30 40 50 60 gi|115 MNPTLILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLYGMNEEGWRR--AVWEKNMKMIELHNQE ::.. .. .. . : . ..:.. .: ... :. : :.: ........:. :: . TAGB MGLPKSFVSMSLLFFSTLLILSLAFNAKNLTQRTNDEVKAMYESWLIKYGKSYNSLGE-WERRFEIFKETLRFIDEHNAD 10 20 30 40 50 60 70 70 80 90 100 110 120 130 140 gi|115 YREGKHSFTMAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQ--EPLFYEA-PRSVDWREKGYVTPVKNQGQCGSCWAFS ..:. ...: :.:.:.::::... :: . . : :: . :: .. : :::: : :. .:.::.::.::::: TAGB T---NRSYKVGLNQFADLTDEEFRSTYLGFTSGS-NKTKVSNRYEPRVGQVLPSYVDWRSAGAVVDIKSQGECGGCWAFS 80 90 100 110 120 130 140 150 150 160 170 180 190 200 210 gi|115 ATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCK---YNPKYSV : ...:: :: :::::::.:.::. :...::::: . .::.. .:::...::.::: : . :. : :: : TAGB AIATVEGINKIVTGVLISLSEQELIDCGRTQNTRGCNGGYITDGFQFIINNGGINTEENYPYTAQDGECNVELQNEKY-V 160 170 180 190 200 210 220 230 220 230 240 250 260 270 280 290 gi|115 ANDTGFVDIPKQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVK . :: . ..: ... .....: :.:::.::. ..: :. ::. : :.. .::.: .:::: :. . ::.:: TAGB TIDT-YENVPYNNEWALQTAVTYQPVSVALDAAGDAFKQYSSGIFTGP-CGTA-IDHAVTIVGYGTEG----GIDYWIVK 240 250 260 270 280 290 300 300 310 320 330 gi|115 NSWGEEWGMGGYVKMAKD--RRNHCGIASAASYPTV ::: :: ::... .. . ::::. :::. TAGB NSWDTTWGEEGYMRILRNVGGAGTCGIATMPSYPVKYNNQNYPEPYSSLINPPAFSMSKDGPVGVEDGQRYSA 310 320 330 340 350 360 370 380 --------------------------------------------------------------------------- >>KHRZOG oryzain (EC 3.4.22.-) gamma precursor - rice (362 aa) initn: 590 init1: 449 opt: 731 Z-score: 842.3 expect() 4.4e-40 Smith-Waterman score: 731; 39.490% identity in 314 aa overlap Entrez lookup Re-search database >KHRZOG 28- 333: ---------------------------------------------------------------: 10 20 30 40 50 60 gi|115 MNPTLILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLYGMNEEGWRR-AVWEKNMKMIELHNQEY ..... . :.. :: : :: .. ....... : KHRZOG VAAASSGFDDSNPIRSVTDHAASALESTVIAALGRTRGALRFARFAVRHGKRYGDAAEVQRRFRIFSESLELVRSTN--- 30 40 50 60 70 80 90 70 80 90 100 110 120 130 140 gi|115 REGKHSFTMAMNAFGDMTSEEFR-QVMNGFQNRKPR---KGKVFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFS :.: . ...: :.::. :::. . ... :: . . .. . : . :.. :::: : :.:::.::.::::: :: KHRZOG RRGL-PYRLGINRFADMSWEEFQASRLGAAQNCSATLAGNHRMRDAPAL---PETKDWREDGIVSPVKDQGHCGSCWPFS 100 110 120 130 140 150 160 170 150 160 170 180 190 200 210 220 gi|115 ATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVAND .::.::... . :: .:::::.:.::. .: ::.::: . ::.:.. :::::.::.::: ... :.:.:. . .. KHRZOG TTGSLEARYTQATGPPVSLSEQQLADCATRYNNFGCSGGLPSQAFEYIKYNGGLDTEEAYPYTGVNGICHYKPENAGVKV 180 190 200 210 220 230 240 250 230 240 250 260 270 280 290 gi|115 TGFVDIPK-QEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMD--HGVLVVGYGFESTESDNNKYWLVK :.: : : .::. : :.:::... . .: .:: :.: :.. :: :.::.:::: : .. :::.: KHRZOG LDSVNITLVAEDELKNAVGLVRPVSVAFQVIN-GFRMYKSGVYTSDHCGTSPMDVNHAVLAVGYGVE----NGVPYWLIK 260 270 280 290 300 310 320 300 310 320 330 gi|115 NSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV :::: .:: .:: : . : ::::. :::: : KHRZOG NSWGADWGDNGYFTMEMGK-NMCGIATCASYPIVA 330 340 350 360 --------------------------------------------------------------------------- >>A45629 cruzipain - Trypanosoma cruzi (467 aa) initn: 444 init1: 283 opt: 726 Z-score: 834.9 expect() 1.1e-39 Smith-Waterman score: 726; 37.901% identity in 343 aa overlap Entrez lookup Re-search database >A45629 1- 333:---------------------------------------------------------------------: 10 20 30 40 50 60 70 gi|115 MNPTLILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLY-GMNEEGWRRAVWEKNMKMIELHNQEYREGK . .:.. : . :.:.: ...: .:....: :.:.: . :: .: .:...:. . .:: . A45629 MSGWARALSLAAVLVVMACLVPAATASLHAEETLASQFVEFKQKHGRVYESAAEERFRLSVFRENLFLARLHAAA---NP 10 20 30 40 50 60 70 80 90 100 110 120 130 140 gi|115 HSFTMAMNAFGDMTSEEFR-QVMNG---FQNRKPRKGKVFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGA :. :.... :.:.: :::: . :: : . : ..: . :: .:::: .: :: ::.:::::::::::: : A45629 HA-TFGVTPFSDLTREEFRSRYHNGAAHFAAAQER-ARVPVNVEVVGAPAAVDWRARGAVTAVKDQGQCGSCWAFSAIGN 80 90 100 110 120 130 140 150 150 160 170 180 190 200 210 220 gi|115 LEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYV--QDNGGLDSEESYPY---EATEESCKYNPKYSVAN .: : : : .:::: ::.:. . . ::.::::. ::... ..::.. .:.:::: :. : . . :. A45629 VECQWFLAGHPLTNLSEQMLVSCD--KTDSGCGGGLMNNAFEWIVQENNGAVYTEDSYPYASGEGISPPCTTSGHTVGAT 160 170 180 190 200 210 220 230 230 240 250 260 270 280 290 300 gi|115 DTGFVDIPKQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNS :: :..:..: . ::. ::..::.::. :.. : :.. .: ::..:::::.:::. .: ::..::: A45629 ITGHVELPQDEAQIAACVAVNGPVAVAVDAS--SWMTYTGGVM--TSCVSEQLDHGVLLVGYN----DSAAVPYWIIKNS 240 250 260 270 280 290 300 310 320 330 gi|115 WGEEWGMGGYVKMAKDRRNHCGIASAASYPTV : .:: ::...:: :.: . :: .: A45629 WTAQWGEDGYIRIAKGS-NQCLVKEEASSAVVGGPGPTPEPTTTTTTSAPGPSPSYFVQMSCTDAACIVGCENVTLPTGQ 310 320 330 340 350 360 370 380 --------------------------------------------------------------------------- >>S41428 cysteine proteinase (EC 3.4.22.-) CP2 precursor - Trichomonas vagina (314 aa) initn: 437 init1: 277 opt: 723 Z-score: 834.0 expect() 1.3e-39 Smith-Waterman score: 730; 37.615% identity in 327 aa overlap Entrez lookup Re-search database >S41428 9- 331: -------------------------------------------------------------------: 10 20 30 40 50 60 70 80 gi|115 MNPTLILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLYGMNEEGWRRAVWEKNMKMIELHNQEYREGKHSFTMAMNAF :: :. :.... . : .: . . : . : .: .: ... : .... :: . .: ...: . S41428 MFAFLLSGATSNV-LKHEEKAFLAYMRETGNFFTG-DEYHFRLGIYLANKRLVQEHNAANK----GFKLGLNKL 10 20 30 40 50 60 90 100 110 120 130 140 150 gi|115 GDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYEAPRS--VDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRL . .:. :.:... : . ..:. :. .:: . ::::.:: :. .:.:::::::::::: : :... . . .: S41428 AHLTQSEYRSLL-GAKRLGQKSGNFFKC----DAPANDAVDWRDKGIVNKIKDQGQCGSCWAFSAIQASESRYAQANKQL 70 80 90 100 110 120 130 140 160 170 180 190 200 210 220 230 gi|115 ISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYV--QDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPKQEKALM ..:.:::.::: . ::::: . :..:: .. : . .::: : . .::.. . ::. :. .. : : S41428 LDLAEQNIVDCV--TSCYGCNGGWPSKAIDYVVKHQAGKFMLTADYPYTARDGTCKFHASKSVGLTKGYDEVKDTEAELA 150 160 170 180 190 200 210 220 240 250 260 270 280 290 300 310 gi|115 KAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYVKMAK :: :. : .:: :::.: :: .: ::: ::.::. ..::.: .:::: :.... ::.:.:::: :: ::..: : S41428 KA-ASKGVVSVCIDASHYSFQLYTSGIYDEPSCSAWNLDHAVGLVGYGTEGSKN----YWIVRNSWGTSWGEQGYIRMIK 230 240 250 260 270 280 290 320 330 gi|115 DRRNHCGIASAASYPTV :. :.::::: : : S41428 DKSNQCGIASEAILPKAL 300 310 --------------------------------------------------------------------------- >>A60667 cysteine proteinase cruzain (EC 3.4.22.-) - Trypanosoma cruzi (467 aa) initn: 434 init1: 275 opt: 723 Z-score: 831.5 expect() 1.8e-39 Smith-Waterman score: 723; 37.609% identity in 343 aa overlap Entrez lookup Re-search database >A60667 1- 333:---------------------------------------------------------------------: 10 20 30 40 50 60 70 gi|115 MNPTLILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLY-GMNEEGWRRAVWEKNMKMIELHNQEYREGK . .:.. : . :.:.: ...: .:....: :.:.: . ::..: .:...:. . .:: . A60667 MSGWARALLLAAVLVVMACLVPAATASLHAEETLTSQFAEFKQKHGRVYESAAEEAFRLSVFRENLFLARLHAAA---NP 10 20 30 40 50 60 70 80 90 100 110 120 130 140 gi|115 HSFTMAMNAFGDMTSEEFR-QVMNG---FQNRKPRKGKVFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGA :. :.... :.:.: :::: . :: : . : ..: . :: .:::: .: :: ::.:::::::::::: : A60667 HA-TFGVTPFSDLTREEFRSRYHNGAAHFAAAQER-ARVPVKVEVVGAPAAVDWRARGAVTAVKDQGQCGSCWAFSAIGN 80 90 100 110 120 130 140 150 150 160 170 180 190 200 210 220 gi|115 LEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYV--QDNGGLDSEESYPY---EATEESCKYNPKYSVAN .: : : : .:::: ::.:. . . ::.::::. ::... ..::.. .:.:::: :. : . . :. A60667 VECQWFLAGHPLTNLSEQMLVSCD--KTDSGCSGGLMNNAFEWIVQENNGAVYTEDSYPYASGEGISPPCTTSGHTVGAT 160 170 180 190 200 210 220 230 230 240 250 260 270 280 290 300 gi|115 DTGFVDIPKQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNS :: :..:..: . .:. ::..::.::. :.. : :.. .: ::..:::::.:::. .: ::..::: A60667 ITGHVELPQDEAQIAAWLAVNGPVAVAVDAS--SWMTYTGGVM--TSCVSEQLDHGVLLVGYN----DSAAVPYWIIKNS 240 250 260 270 280 290 300 310 320 330 gi|115 WGEEWGMGGYVKMAKDRRNHCGIASAASYPTV : .:: ::...:: :.: . :: .: A60667 WTTQWGEEGYIRIAKGS-NQCLVKEEASSAVVGGPGPTPEPTTTTTTSAPGPSPSYFVQMSCTDAACIVGCENVTLPTGQ 310 320 330 340 350 360 370 380 --------------------------------------------------------------------------- >>KHBH aleurain (EC 3.4.22.-) precursor - barley (361 aa) initn: 494 init1: 335 opt: 719 Z-score: 828.6 expect() 2.6e-39 Smith-Waterman score: 719; 40.252% identity in 318 aa overlap Entrez lookup Re-search database >KHBH 23- 333: ----------------------------------------------------------------: 10 20 30 40 50 60 gi|115 MNPTLILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLYGMNEEGWRR-AVWEKNMKMIEL :.: ..... . ... : : :: .. .... .. KHBH AAVAVASSSSFADSNPIRPVTDRAASTLESAVLGALGRTRHAL--RFARFAVRYGKSYESAAEVRRRFRIFSESLEEVRS 20 30 40 50 60 70 80 90 70 80 90 100 110 120 130 gi|115 HNQEYREGKHSFTMAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYEA---PRSVDWREKGYVTPVKNQGQCGSC : :.: . ...: :.::. :::. . : . .. . . :. .: :.. :::: : :.:::::..:::: KHBH TN---RKGL-PYRLGINRFSDMSWEEFQATRLG--AAQTCSATLAGNHLMRDAAALPETKDWREDGIVSPVKNQAHCGSC 100 110 120 130 140 150 160 140 150 160 170 180 190 200 210 gi|115 WAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYS :.::.:::::. . . ::. ::::::.::::.: .: :::::: . ::.: : :::.:.::::::.... :.:. . . KHBH WTFSTTGALEAAYTQATGKNISLSEQQLVDCAGGFNNFGCNGGLPSQAFEY-QYNGGIDTEESYPYKGVNGVCHYKAENA 170 180 190 200 210 220 230 240 220 230 240 250 260 270 280 290 gi|115 VANDTGFVDIP-KQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSS--EDMDHGVLVVGYGFESTESDNNKY ... :.: . : : .::. : :.:::... ..: :: :.: :.. .:..:.::.:::: : .. : KHBH AVQVLDSVNITLNAEDELKNAVGLVRPVSVAFQVI-DGFRQYKSGVYTSDHCGTTPDDVNHAVLAVGYGVE----NGVPY 250 260 270 280 290 300 310 320 300 310 320 330 gi|115 WLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV ::.::::: .:: .:: :: . : :.::. ::::.: KHBH WLTKNSWGADWGDNGYFKMEMGK-NMCAIATCASYPVVAA 330 340 350 360 --------------------------------------------------------------------------- >>JN0718 drought-inducible cysteine proteinase (EC 3.4.22.-) RD19A precursor (368 aa) initn: 521 init1: 286 opt: 719 Z-score: 828.4 expect() 2.6e-39 Smith-Waterman score: 719; 39.130% identity in 322 aa overlap Entrez lookup Re-search database >JN0718 26- 329: ----------------------------------------------------------------: 10 20 30 40 50 60 gi|115 MNPTLILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLYGMNEE-GWRRAVWEKNMKMIELHNQ : ... .: ...:. ::: .: .:.. :.. . :.. JN0718 FSVFVLSFFIVSVSSSDVNDGDDLVIRQVVGGAEPQVLTSEDHFSLFKRKFGKVYASNEEHDYRFSVFKANLRRARRHQK 10 20 30 40 50 60 70 80 70 80 90 100 110 120 130 gi|115 EYREGKHSFTMAMNAFGDMTSEEFRQ----VMNGFQNRKPRKGKVFQEPLF--YEAPRSVDWREKGYVTPVKNQGQCGSC . :. :. :.:.: :::. : .:: . :. .. . :.. . :.. :::..: ::::::::.:::: JN0718 LDPSATHGVTQ----FSDLTRSEFRKKHLGVRSGF--KLPKDAN--KAPILPTENLPEDFDWRDHGAVTPVKNQGSCGSC 90 100 110 120 130 140 150 140 150 160 170 180 190 200 210 gi|115 WAFSATGALEGQMFRKTGRLISLSEQNLVDCS---GPQG----NEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATE-ES :.::::::::: : ::.:.:::::.::::. :. . :::::::. ::.:. .::: .::.::: . . .. JN0718 WSFSATGALEGANFLATGKLVSLSEQQLVDCDHECDPEEADSCDSGCNGGLMNSAFEYTLKTGGLMKEEDYPYTGKDGKT 160 170 180 190 200 210 220 230 220 230 240 250 260 270 280 gi|115 CKYNPKYSVANDTGFVDIPKQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYG---FES :: . . ::. ..: : .:. . .. ::..:::.::. . : :. :. . ..::::.:::: . JN0718 CKLDKSKIVASVSNFSVISIDEEQIAANLVKNGPLAVAINAGYMQT--YIGGVSCPYICTRR-LNHGVLLVGYGAAGYAP 240 250 260 270 280 290 300 310 290 300 310 320 330 gi|115 TESDNNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV .. .. ::..:::::: :: .:. :. : : : ::. : .: JN0718 ARFKEKPYWIIKNSWGETWGENGFYKICKGR-NICGVDSMVSTVAATVSTTAH 320 330 340 350 360 --------------------------------------------------------------------------- >>S55923 cysteine proteinase (EC 3.4.22.-) precursor - soybean (380 aa) initn: 584 init1: 273 opt: 709 Z-score: 816.8 expect() 1.2e-38 Smith-Waterman score: 709; 37.462% identity in 331 aa overlap Entrez lookup Re-search database >S55923 22- 333: -----------------------------------------------------------------: 10 20 30 40 50 gi|115 MNPTLILAAFCLGIASATLTFDHSLEAQWTKWKA-MHN--RLYGMNEEGWRR-AVWEKNMK :. : :.:. :.: : :. .:: :: ... .:: S55923 KRGHALMCLARVSLFLCALTLSAAHGSTTVQDIARKLKLGDNELLRTEKKFKVFMENYGRSYSTEEEYLRRLGIFAQNMV 10 20 30 40 50 60 70 80 60 70 80 90 100 110 120 130 gi|115 MIELHNQEYREGKHSFTMAMNAFGDMTSEEFRQVMNGFQNRKPRKGKV---FQEPLFYEA-PRSVDWREKGYVTPVKNQG :. . :. :. :.:.: .::.....: .. : .... . :: .. :.. :::::: :: :: :: S55923 RAAEHQALDPTAVHGVTQ----FSDLTEDEFEKLYTGVNGGFPSSNNAAGGIAPPLEVDGLPENFDWREKGAVTEVKLQG 90 100 110 120 130 140 150 140 150 160 170 180 190 200 gi|115 QCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSG-------PQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEA .::::::::.::..:: : ::.:.:::::.:.::.. . ..::::::: :..:. ..:::. : :::: . S55923 RCGSCWAFSTTGSIEGANFLATGKLVSLSEQQLLDCDNKCDITEKTSCDNGCNGGLMTNAYNYLLESGGLEEESSYPYTG 160 170 180 190 200 210 220 230 210 220 230 240 250 260 270 280 gi|115 TEESCKYNPKYSVANDTGFVDIPKQEKALMKAVATVGPISVAIDAGHESFL-FYKEGIYFEPDCSSEDMDHGVLVVGYG- . ::..:. ... :.:..:: .:. . .. ::......: :. : :. ::.. ..::::.:::: S55923 ERGECKFDPEKIAVKITNFTNIPADENQIAAYLVKNGPLAMGVNA---IFMQTYIGGVSCPLICSKKRLNHGVLLVGYGA 240 250 260 270 280 290 300 310 290 300 310 320 330 gi|115 --FESTESDNNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV : . :. ::..::::::.:: :: :. . . . ::: . .: : S55923 KGFSILRLGNKPYWIIKNSWGEKWGEDGYYKLCRGH-GMCGINTMVSAAMVPQPQTTPTKNYASY 320 330 340 350 360 370 380 --------------------------------------------------------------------------- >>S46535 probable cysteine proteinase (EC 3.4.22.-) (clone A1494) - Arabidops (313 aa) initn: 517 init1: 285 opt: 706 Z-score: 814.6 expect() 1.5e-38 Smith-Waterman score: 706; 39.809% identity in 314 aa overlap Entrez lookup Re-search database >S46535 32- 329: --------------------------------------------------------------: 10 20 30 40 50 60 70 gi|115 MNPTLILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLYGMNEEGWRR-AVWEKNMKMIELHNQEYREGKHSFTMAMNA .: ...:: :: . : .:.. :. :.. ..:. :. S46535 ALFKKKFGKVYGSIEEHYYRFSVFKANLLRAMRHQKMDPSARHGVTQ---- 10 20 30 40 80 90 100 110 120 130 140 150 gi|115 FGDMTSEEFRQVMNGFQN--RKPRKGKVFQEPLF--YEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKT :.:.: :::. : .. . :. .. : :.. . :. :::..: ::::::::.:::::.::.:::::: : : S46535 FSDLTRSEFRRKHLGVKGGFKLPKDAN--QAPILPTQNLPEEFDWRDRGAVTPVKNQGSCGSCWSFSTTGALEGAHFLAT 50 60 70 80 90 100 110 120 160 170 180 190 200 210 220 gi|115 GRLISLSEQNLVDCS---GPQ--G--NEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEE-SCKYNPKYSVANDTGFVD :.:.:::::.::::. :. : . :::::::. ::.:. .::: :..::: .:. ::: . . ::. ..: S46535 GKLVSLSEQQLVDCDHECDPEEEGSCDSGCNGGLMNSAFEYTLKTGGLMREKDYPYTGTDGGSCKLDRSKIVASVSNFSV 130 140 150 160 170 180 190 200 230 240 250 260 270 280 290 300 gi|115 IPKQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYG---FESTESDNNKYWLVKNSWGE . .: . . ::..:::.:.. . : :. :: . ..::::.:::: : ... .. ::..:::::: S46535 VSINEDQIAANLIKNGPLAVAINAAYMQT--YIGGVSCPYICSRR-LNHGVLLVGYGSAGFSQARLKEKPYWIIKNSWGE 210 220 230 240 250 260 270 280 310 320 330 gi|115 EWGMGGYVKMAKDRRNHCGIASAASYPTV :: .:. :. : : : ::. : .: S46535 SWGENGFYKICKGR-NICGVDSLVSTVAATTS 290 300 310 --------------------------------------------------------------------------- >>B23705 cysteine proteinase (EC 3.4.22.-) - Entamoeba histolytica (strain SA (312 aa) initn: 503 init1: 187 opt: 692 Z-score: 798.6 expect() 1.2e-37 Smith-Waterman score: 692; 36.808% identity in 307 aa overlap Entrez lookup Re-search database >B23705 32- 332: --------------------------------------------------------------: 10 20 30 40 50 60 70 gi|115 MNPTLILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLYGMNEEGWRRAVWEKNMKMIELHNQEYREGKHSFTMAMNA- : : .:. . : :::... : ... :. : :: ..... B23705 VILMLAIANAIDFNTWAANNNKHFTAVEALRRRAIFNMNARFVAEFNK-----KGSFKLSVDGP 10 20 30 40 50 80 90 100 110 120 130 140 150 gi|115 FGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTG--- :. ::.::.: .... . ..::: : .::.::::: .: :::...:.:::::..:.. .::::... . : B23705 FAAMTNEEYRTLLKS-KRTVEENGKVTY--LNIQAPESVDWRAQGKVTPIRDQAQCGSCYTFGSLAALEGRLLIEKGGNA 60 70 80 90 100 110 120 130 160 170 180 190 200 210 220 230 gi|115 RLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPKQEKALM ..:::...:.:. .::.:::::: . ...:. .:: . .: .::: .:. .:: : : . :. ::. .:....: . B23705 NTLDLSEEHMVQCTRDNGNNGCNGGLGSNVYDYIIQNG-VAKESDYPYTGTDSTCKTNVK-AFAKITGYNKVPRNNEAEL 140 150 160 170 180 190 200 210 240 250 260 270 280 290 300 310 gi|115 KAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSE--DMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYVKM ::. . : ..:.:::. .: .:: : : . :... ..: : .:::: :... :.:.:::: :: ::..: B23705 KAALSQGLVDVSIDASSAKFQLYKSGAYSDTKCKNNFFALNHEVCAVGYGV----VDGKECWIVRNSWGTGWGDKGYINM 220 230 240 250 260 270 280 290 320 330 gi|115 AKDRRNHCGIASAASYPTV . . : ::.:. ::: B23705 VIEG-NTCGVATDPLYPTGVQYL 300 310 --------------------------------------------------------------------------- >>A41404 cathepsin L (EC 3.4.22.15) - cat (fragment) (139 aa) initn: 687 init1: 687 opt: 687 Z-score: 798.2 expect() 1.3e-37 Smith-Waterman score: 687; 68.345% identity in 139 aa overlap Entrez lookup Re-search database >A41404 180- 318: -----------------------------: 140 150 160 170 180 190 200 210 gi|115 AFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSV :::.: :::::.:::::::::::::.: .:::: :. :: A41404 GGLIDDAFQYVKDNGGLDSEESYPYHAQGDSCKYRPENSV 10 20 30 40 220 230 240 250 260 270 280 290 gi|115 ANDTGFVDIPKQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVK :: : . :::..:. :: ..:.:::::.::::. ..: :::::::..:.:::::.:::::::::: ..::..:.:::..: A41404 ANVTDYWDIPSKENELMITLAAVGPISAAIDASLDTFRFYKEGIYYDPSCSSEDVDHGVLVVGYGADGTETENKKYWIIK 50 60 70 80 90 100 110 120 300 310 320 330 gi|115 NSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV :::: .::: ::.:::::: A41404 NSWGTDWGMDGYIKMAKDR 130 --------------------------------------------------------------------------- >>S02728 actinidain (EC 3.4.22.14) precursor (clone pAC.1) - kiwi fruit (frag (302 aa) initn: 527 init1: 346 opt: 688 Z-score: 794.3 expect() 2.1e-37 Smith-Waterman score: 688; 38.516% identity in 283 aa overlap Entrez lookup Re-search database >S02728 56- 331: ---------------------------------------------------------: 20 30 40 50 60 70 80 90 gi|115 SATLTFDHSLEAQWTKWKAMHNRLYGMNEEGWRRAVWEKNMKMIELHNQEYREGKHSFTMAMNAFGDMTSEEFRQVMNGF ...:. :: . ..:. ...: :.:.:.::::... :: S02728 LRFIDEHNADT---NRSYKVGLNQFADLTGEEFRSTYLGF 10 20 30 100 110 120 130 140 150 160 170 gi|115 Q--NRKPRKGKVFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQ . : . .. .. . : :::: : :. .:.::.::.:::::: ...:: :: :::::::.:. :.: : S02728 TGGSNKTKVSNRYEPRVSQVLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIGCGGTQ 40 50 60 70 80 90 100 110 180 190 200 210 220 230 240 250 gi|115 GNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKY---NPKYSVANDTGFVDIPKQEKALMKAVATVGPISVAID ...::::: . .::.. .:::... :.::: : . :. : :: :. :: . ..: ... .....: :.:::.: S02728 NTRGCNGGYITDGFQFIINNGGINTGENYPYTAQDGECNLDLQNEKY-VTIDT-YGNVPYNNEWALQTAVTYQPVSVALD 120 130 140 150 160 170 180 190 260 270 280 290 300 310 320 gi|115 AGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYVKMAKDR--RNHCGIASAA :. ..: :. ::. : :.. .::.: .:::: :. . ::.:.::: :: ::... .. . ::::. S02728 AAGDAFKHYSSGIFTGP-CGTA-IDHAVTIVGYGTEG----GIDYWIVENSWDTTWGEEGYMRILRNVGGAGTCGIATMP 200 210 220 230 240 250 260 330 gi|115 SYPTV :::. S02728 SYPVKYNNQNYPKPYSSLINPSAFSMSKDGPVE 270 280 290 300 --------------------------------------------------------------------------- >>S37048 cysteine proteinase - Trypanosoma congolense (447 aa) initn: 431 init1: 259 opt: 679 Z-score: 781.4 expect() 1.1e-36 Smith-Waterman score: 679; 36.950% identity in 341 aa overlap Entrez lookup Re-search database >S37048 5- 333: --------------------------------------------------------------------: 10 20 30 40 50 60 gi|115 MNPTLILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLY-GMNEEGWRRAVWEKNMKMIELHNQEYR : .:: . .: ..: ..::. :.. .: ..: : .::..: :...:: : ..: S37048 MPRSEMTRTLRFSVGLLAVAACFVPVALGVLHAEQSLQQQFAAFKQKYSRSYKDATEEAFRFRVFKQNM---ERAKEEAA 10 20 30 40 50 60 70 70 80 90 100 110 120 130 140 gi|115 EGKHSFTMAMNAFGDMTSEEFRQVM-NGFQN-----RKPRKGKVFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAF . .. :.... :.::. :::: .. :: . ..::: . .:: .::::.:: :::::.:: ::::::: S37048 ANPYA-TFGVTRFSDMSPEEFRATYHNGAEYYAAALKRPRKVVNVSTG---KAPPAVDWRKKGAVTPVKDQGACGSCWAF 80 90 100 110 120 130 140 150 150 160 170 180 190 200 210 gi|115 SATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYV--QDNGGLDSEESYPYEA---TEESCKYNPK :: : .::: .: ::::: ::.:. . :: ::::: ..:.. ...:.. . .:::: . :. . : S37048 SAIGNIEGQWKVAGHELTSLSEQMLVSCDTT--DYGCRGGLMDKSLQWIVSSNKGNVFTAQSYPYASGGGKMPPCNKSGK 160 170 180 190 200 210 220 230 220 230 240 250 260 270 280 290 gi|115 YSVANDTGFVDIPKQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYW :. .: ...::.:.:. . .: ::...:.:: ::: :: :. .: :. .:: ::.::: .... :: S37048 VVGAKISGHINLPKDENAIAEWLAKNGPVAIAVDA--TSFLGYKGGVL--TSCISKGLDHDVLLVGYD----DTSKPPYW 240 250 260 270 280 290 300 300 310 320 330 gi|115 LVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV ..::::.. :: ::... : :.: . . : .: S37048 IIKNSWSKGWGEEGYIRIEKGT-NQCLMKNYARSAVVSGPPPPPPPPASTFTQEFCEGAECQSGCTKATFPTGKCVQFGG 310 320 330 340 350 360 370 380 --------------------------------------------------------------------------- >>KHDO cysteine proteinase 1 (EC 3.4.22.-) precursor - slime mold (Dictyostel (343 aa) initn: 506 init1: 280 opt: 669 Z-score: 771.7 expect() 3.7e-36 Smith-Waterman score: 753; 36.471% identity in 340 aa overlap Entrez lookup Re-search database >KHDO 5- 329: --------------------------------------------------------------------: 10 20 30 40 50 60 70 gi|115 MNPTLILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLYGMNEEGWRRAVWEKNMKMIELHNQEYREGKHSFTMAMN ..::.: . ..: . .. ..:. ... :. :. .: : ....:. :: : . : . ...: KHDO MKVILLFVLAVFTVFVSSRGIPPEE--QSQFLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEELNLIAINHKADTKFGVN 10 20 30 40 50 60 70 80 90 100 110 120 130 140 150 gi|115 AFGDMTSEEFRQV-MNG----FQNRKPRKGKVFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFR :.:..:.::.. .:. : . : . ... .. : . ::: .: ::::::::::::::.::.:: .::: : KHDO KFADLSSDEFKNYYLNNKEAIFTDDLPV-ADYLDDEFINSIPTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFI 80 90 100 110 120 130 140 150 160 170 180 190 200 210 220 gi|115 KTGRLISLSEQNLVDCS-------GPQG-NEGCNGGLMDYAFQYVQDNGGLDSEESYPYEA-TEESCKYNPKYSVANDTG . ..:.::::::::::. : .. .:::::::. :..:. :::...: :::: : : .:..: :. .. KHDO SQNKLVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIKNGGIQTESSYPYTAETGTQCNFNSANIGAKISN 160 170 180 190 200 210 220 230 230 240 250 260 270 280 290 300 gi|115 FVDIPKQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFEST-ESDNNKYWLVKNSWG :. :::.: .. ....::...: :: . . :: :. :. :. ...:::.:.:::. ..: : ::.:::::: KHDO FTMIPKNETVMAGYIVSTGPLAIAADAVE--WQFYIGGV-FDIPCNPNSLDHGILIVGYSAKNTIFRKNMPYWIVKNSWG 240 250 260 270 280 290 300 310 310 320 330 gi|115 EEWGMGGYVKMAKDRRNHCGIASAASYPTV .:: ::. . . . : ::... .: KHDO ADWGEQGYIYLRRGK-NTCGVSNFVSTSII 320 330 340 --------------------------------------------------------------------------- >>JA0159 cysteine proteinase (EC 3.4.22.-) precursor - tomato (fragment) (346 aa) initn: 435 init1: 298 opt: 669 Z-score: 771.7 expect() 3.8e-36 Smith-Waterman score: 669; 47.748% identity in 222 aa overlap Entrez lookup Re-search database >JA0159 115- 331: ---------------------------------------------: 80 90 100 110 120 130 140 150 gi|115 MAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRK :.:.:::::: .. ::.::.:::::::::..:.:. JA0159 KLSKNKSDRYLPKVGDSLPESIDWREKGVLVGVKDQGSCGSCWAFSAVAAMESINAIV 10 20 30 40 50 160 170 180 190 200 210 220 230 gi|115 TGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESC-KYNPKYSVANDTGFVDIP-KQE :: :::::::.::::. ::::.::::::::..: :::.:.::.:::. . : .: . .:.. .. :.: ..: JA0159 TGNLISLSEQELVDCDRSY-NEGCDGGLMDYAFEFVIKNGGIDTEEDYPYKERNGVCDQYRKNAKVVKIDSYEDVPVNNE 60 70 80 90 100 110 120 130 240 250 260 270 280 290 300 310 gi|115 KALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYV :::.:::: :.:.:..:: ..: :: :: : :.. .::::...::: : .. ::.:.:::: . .::. JA0159 KALQKAVAH-QPVSIALEAGGRDFQHYKSGI-FTGKCGTA-VDHGVVIAGYGTE----NGMDYWIVRNSWGANCRENGYL 140 150 160 170 180 190 200 210 320 330 gi|115 KMAKDRRNH---CGIASAASYPTV .. .. . ::.: :::. JA0159 RVQRNVSSSSGLCGLAIEPSYPVKTGPNPPKPAPSPPSPVKPPTECDEYSQCAVGTTCCCILQFRRSCFSWGCCPLEGAT 220 230 240 250 260 270 280 290 --------------------------------------------------------------------------- >>S24988 cysteine proteinase (EC 3.4.22.-) precursor - tomato (361 aa) initn: 497 init1: 274 opt: 668 Z-score: 770.2 expect() 4.5e-36 Smith-Waterman score: 668; 37.846% identity in 325 aa overlap Entrez lookup Re-search database >S24988 22- 329: -----------------------------------------------------------------: 10 20 30 40 50 60 gi|115 MNPTLILAAFCLGIASATLTFDHSLEAQ--WTKWKAMHNRLYGMNEE-GWRRAVWEKNMKMIELHN .: :.:. .. .:: ...:. .:: : :.. :.. . :. S24988 RLFLLSFLAFALFSSAIAFSDDDPLIRQVVSGNDDNHMLNAEHHFSLFKAKFGKIYASQEEHDHRLKVFKANLHRAKRHQ 10 20 30 40 50 60 70 80 70 80 90 100 110 120 130 140 gi|115 QEYREGKHSFTMAMNAFGDMTSEEFRQVMNGFQNRKPRKG-KVFQEPLF--YEAPRSVDWREKGYVTPVKNQGQCGSCWA ..:..:. :.:.: :::... :.. ::: . .. . :.. . : . :::::: :: :::::.:::::. S24988 LLDPSAEHGITQ----FSDLTPSEFRRTYLGLN--KPRPNLNAEKAPILPTKDLPSDFDWREKGAVTDVKNQGSCGSCWS 90 100 110 120 130 140 150 150 160 170 180 190 200 210 gi|115 FSATGALEGQMFRKTGRLISLSEQNLVDCS---GP-QGNE---GCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKY ::.:::.:: : ::.:.:::::.::::. : . :. ::::::: ::.:. :::. :..::: . . .:.. S24988 FSTTGAVEGAHFLATGELVSLSEQQLVDCDHECDPVEKNDCDAGCNGGLMTTAFEYTLKAGGLQLEKDYPYTGRNGKCHF 160 170 180 190 200 210 220 230 220 230 240 250 260 270 280 290 gi|115 NPKYSVANDTGFVDIPKQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGY---GFESTES . . .:. ..: . .: . . ::..:.:.:. . : .:. : ... :::::.::: :: . S24988 DKSRIAASVSNFSVVGLDEDQIAANLLKHGPLAVGINAAWMQT--YVRGVSCPLICFKRQ-DHGVLLVGYGSEGFAPIRL 240 250 260 270 280 290 300 310 300 310 320 330 gi|115 DNNKYWLVKNSWGEEWGMGGYVKMAKDRRNH-CGIASAASYPTV :. ::..:::::. :: :: :. : .: ::. . .: S24988 KNKPYWIIKNSWGKTWGEHGYYKIC--RGHHICGVDAMVSTVTATHTTNPNL 320 330 340 350 360 --------------------------------------------------------------------------- >>A23705 cysteine proteinase (EC 3.4.22.-) - Entamoeba histolytica (strain HM (312 aa) initn: 440 init1: 193 opt: 665 Z-score: 767.8 expect() 6.2e-36 Smith-Waterman score: 665; 33.876% identity in 307 aa overlap Entrez lookup Re-search database >A23705 32- 332: --------------------------------------------------------------: 10 20 30 40 50 60 70 gi|115 MNPTLILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLYGMNEEGWRRAVWEKNMKMIELHNQEYREGKHSFTMAMNA- : : .:. . : :::... : ... .:. :..: ..... A23705 VILMFYIGYGIDFNTWVANNNKHFTAVESLRRRAIFNMNARIVAENNR-----KETFKLSVDGP 10 20 30 40 50 80 90 100 110 120 130 140 150 gi|115 FGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTG--- :. ::.::. .... .. .::.: . : .::..::::.:: :::...::.::::..:.. .::::... . : A23705 FAAMTNEEYNSLLK-LKRSGEEKGEV--RYLNIQAPKAVDWRKKGKVTPIRDQGNCGSCYTFGSIAALEGRLLIEKGGDS 60 70 80 90 100 110 120 130 160 170 180 190 200 210 220 230 gi|115 RLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPKQEKALM . ..:::...:.:. .::.:::::: . ...:...:: . .: .::: ... .:. . : . :. .. . ..... . A23705 ETLDLSEEHMVQCTREDGNNGCNGGLGSNVYNYIMENG-IAKESDYPYTGSDSTCRSDVK-AFAKIKSYNRVARNNEVEL 140 150 160 170 180 190 200 210 240 250 260 270 280 290 300 310 gi|115 KAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSE--DMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYVKM ::. . : ..:.:::. .: .:: : : . .:... ..: : .:::: .:... :.:.:::: :: ::..: A23705 KAAISQGLVDVSIDASSVQFQLYKSGAYTDTQCKNNYFALNHEVCAVGYGV----ADGKECWIVRNSWGTGWGEKGYINM 220 230 240 250 260 270 280 290 320 330 gi|115 AKDRRNHCGIASAASYPTV . . : ::.:. ::: A23705 VIEG-NTCGVATDPLYPTGVEYL 300 310 --------------------------------------------------------------------------- >>S30150 probable cysteine proteinase precursor (clone CYP-8) - common tobacc (365 aa) initn: 512 init1: 263 opt: 663 Z-score: 764.4 expect() 9.5e-36 Smith-Waterman score: 663; 37.048% identity in 332 aa overlap Entrez lookup Re-search database >S30150 14- 329: ------------------------------------------------------------------: 10 20 30 40 50 gi|115 MNPTLILAAFCLGIASATLTFD-HSLEAQ--WTKWKAMHNRLYGMNEE-GWRRAVWEKNMKMI ..: : : : : :.:. .. .:. ...:. .:: : :.. :.. S30150 MDRLFLLSLPRFALFSSAIAFPDEDPLIRQVVSETETDDSHLLNAEHHFSLFKSKFGKIYASEEEHDHRFKVFKANLRRA 10 20 30 40 50 60 70 80 60 70 80 90 100 110 120 130 gi|115 ELHNQEYREGKHSFTMAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLF--YEAPRSVDWREKGYVTPVKNQGQCGS .:.. ..:..: :.:.: :::... :... :: : .. . :.. . : . :::..: :: :::::.::: S30150 RLNQLLDPSAEHGIT----KFSDLTPSEFRRTYLGLHKPKP-KVNAEKAPILPTSDLPADYDWRDHGAVTGVKNQGSCGS 90 100 110 120 130 140 150 140 150 160 170 180 190 200 210 gi|115 CWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNE-------GCNGGLMDYAFQYVQDNGGLDSEESYPYEATEES ::.::.:::.:: : ::.:.:::::.::::. .: ::.:::: ::.:. :::. :..::: . . . S30150 CWSFSTTGAVEGAHFLATGELVSLSEQQLVDCDHECDSEQQDSCDAGCGGGLMTTAFEYTLKAGGLQLEKDYPYTGKDGK 160 170 180 190 200 210 220 230 220 230 240 250 260 270 280 gi|115 CKYNPKYSVANDTGFVDIPKQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYG---FES :... . .: :.: : .: . .. ::..:.:.:. . : :. : ... :::::.:::: : S30150 CHFDKSKIAAAVTNFSVIGLDEDQIAANLVKHGPLAVGINAAWMQT--YVGGVSCPLICFKRQ-DHGVLLVGYGSHGFAP 240 250 260 270 280 290 300 310 290 300 310 320 330 gi|115 TESDNNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV . .. ::..::::::.:: :: :. . . : ::. . .: S30150 IRLKEKAYWIIKNSWGENWGEHGYYKICRGH-NICGVDAMVSTVTAAHTTNPNL 320 330 340 350 360 --------------------------------------------------------------------------- >>JN0633 caricain (EC 3.4.22.30) I precursor - papaya (348 aa) initn: 448 init1: 252 opt: 656 Z-score: 756.7 expect() 2.6e-35 Smith-Waterman score: 656; 34.441% identity in 331 aa overlap Entrez lookup Re-search database >JN0633 12- 332: -------------------------------------------------------------------: 10 20 30 40 50 60 gi|115 MNPTLILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLY-GMNEEGWRRAVWEKNMKMIE .: .. :: . : ...: ::..: ...:. .: ... :...:. JN0633 MAMIPSISKLLFVAICLFVHMSVSFGDFSIVGYSQDDLTSTERLIQLFNSWMLNHNKFYENVDEKLYRFEIFKDNLNYID 10 20 30 40 50 60 70 80 70 80 90 100 110 120 130 gi|115 LHNQEYREGKHSFTMAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYEA----PRSVDWREKGYVTPVKNQGQCG :.. ..:. ...: :.:....:: . . : . . ..: .. : :..::::.:: ::::..::.:: JN0633 ETNKK----NNSYWLGLNEFADLSNDEFNEKYVGSLIDATIE-QSYDEEFINEDTVNLPENVDWRKKGAVTPVRHQGSCG 90 100 110 120 130 140 150 140 150 160 170 180 190 200 210 gi|115 SCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNP- :::::::....:: .::.:. ::::.:::: . ..::.:: ::..:: :: . . .:::.: . .:. . JN0633 SCWAFSAVATVEGINKIRTGKLVELSEQELVDCE--RRSHGCKGGYPPYALEYVAKNG-IHLRSKYPYKAKQGTCRAKQV 160 170 180 190 200 210 220 230 220 230 240 250 260 270 280 290 gi|115 KYSVANDTGFVDI-PKQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNK ... .: . :..: :..:.: :.::.... . : .:: :: :: :... .::.: .:::: .: .. JN0633 GGPIVKTSGVGRVQPNNEGNLLNAIAK-QPVSVVVESKGRPFQLYKGGI-FEGPCGTK-VDHAVTAVGYG----KSGGKG 240 250 260 270 280 290 300 300 310 320 330 gi|115 YWLVKNSWGEEWGMGGYVKMAKDRRNH---CGIASAASYPTV : :.::::: :: ::... . : ::. ... ::: JN0633 YILIKNSWGTAWGEKGYIRIKRAPGNSPGVCGLYKSSYYPTKN 310 320 330 340 --------------------------------------------------------------------------- >>JN0634 caricain (EC 3.4.22.30) II precursor - papaya (367 aa) initn: 443 init1: 252 opt: 653 Z-score: 753.0 expect() 4.1e-35 Smith-Waterman score: 653; 34.043% identity in 329 aa overlap Entrez lookup Re-search database >JN0634 12- 331: -------------------------------------------------------------------: 10 20 30 40 50 60 gi|115 MNPTLILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLY-GMNEEGWRRAVWEKNMKMIE .: .. :: . : ...: ::..: ...:. .: ... :...:. JN0634 MAMIPSISKLLFVAICLFVHMSVSFGDFSIVGYSQDDLTSTERLIQLFNSWMLNHNKFYENVDEKLYRFEIFKDNLNYID 10 20 30 40 50 60 70 80 70 80 90 100 110 120 130 gi|115 LHNQEYREGKHSFTMAMNAFGDMTSEEFRQVMNGF---QNRKPRKGKVFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGS :.. ..:. ...: :.:....:: . . : . . . : . . . :..::::.:: ::::..::.::: JN0634 ETNKK----NNSYRLGLNEFADLSNDEFNEKYVGSLIDATIEQSYDEEFINEDIVNLPENVDWRKKGAVTPVRHQGSCGS 90 100 110 120 130 140 150 140 150 160 170 180 190 200 210 gi|115 CWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNP-K ::::::....:: .::.:. ::::.:::: . ..::.:: ::..:: :: . . .:::.: . .:. . JN0634 CWAFSAVATVEGINKIRTGKLVELSEQELVDCE--RRSHGCKGGYPPYALEYVAKNG-IHLRSKYPYKAKQGTCRAKQVG 160 170 180 190 200 210 220 230 220 230 240 250 260 270 280 290 gi|115 YSVANDTGFVDI-PKQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKY ... .: . :..: :..:.: :.::.... . : .:: :: :: :... .::.: .:::: .: .. : JN0634 GPIVKTSGVGRVQPNNEGNLLNAIAK-QPVSVVVESKGRPFQLYKGGI-FEGPCGTK-VDHAVTAVGYG----KSGGKGY 240 250 260 270 280 290 300 300 310 320 330 gi|115 WLVKNSWGEEWGMGGYVKMAKDRRNH---CGIASAASYPTV :.::::: :: ::... . : ::. ... :: JN0634 ILIKNSWGTAWGEKGYIRIKRAPGNSPGVCGLYKSSYYPIKNRDNGRIQIRPSSQHLTSHE 310 320 330 340 350 360 --------------------------------------------------------------------------- >>S25267 cysteine proteinase (EC 3.4.22.-) precursor - Leishmania mexicana (354 aa) initn: 464 init1: 275 opt: 647 Z-score: 746.3 expect() 9.7e-35 Smith-Waterman score: 647; 36.550% identity in 342 aa overlap Entrez lookup Re-search database >S25267 6- 331: --------------------------------------------------------------------: 10 20 30 40 50 60 gi|115 MNPTLILAAFCLG---IASATLTFDHSLE-AQWTKWKAMHNRLYGMN-EEGWRRAVWEKNMKMIELHNQEY :: . : : ::.. :. . :.. ..: :.. .: . ::: : ....::. . : . S25267 MARRNPLLFAIVVTILFVVCYGSALIAQTPPPVDNFVASAHYGSFKKRHGKAFGGDAEEGHRFNAFKQNMQTAYFLNTQ- 10 20 30 40 50 60 70 70 80 90 100 110 120 130 140 gi|115 REGKHSFTMAMNAFGDMTSEEFRQV-MNGFQNRKPRKGKVFQEPLFYE--APR---SVDWREKGYVTPVKNQGQCGSCWA . :. . . :.:.: .:: .. .: . :. : .: . . :: :::::.:: :::::::: :::::: S25267 --NPHAHYDVSGKFADLTPQEFAKLYLN--PDYYARHLKNHKEDVHVDDSAPSGVMSVDWRDKGAVTPVKNQGLCGSCWA 80 90 100 110 120 130 140 150 150 160 170 180 190 200 210 gi|115 FSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQD--NGGLDSEESYPYEA---TEESCKYNP ::: : .::: . :.::::: ::.:.. .::::::::: :...... ::.. .: :::: . :. :. . S25267 FSAIGNIEGQWAASGHSLVSLSEQMLVSCDNI--DEGCNGGLMDQAMNWIMQSHNGSVFTEASYPYTSGGGTRPPCHDEG 160 170 180 190 200 210 220 230 220 230 240 250 260 270 280 290 gi|115 KYSVANDTGFVDIPKQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKY . . :. :::...:..:. . . : ::..::.:: .. .: :. : . ...::::.::.. .. . : S25267 EVG-AKITGFLSLPHDEERIAEWVEKRGPVAVAVDA--TTWQLYFGGVV--SLCLAWSLNHGVLIVGFN----KNAKPPY 240 250 260 270 280 290 300 300 310 320 330 gi|115 WLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV :.:::::: :: ::...: :.: . . ::. S25267 WIVKNSWGSSWGEKGYIRLAMGS-NQCMLKN---YPVSATVESPHTPHVPTTTA 310 320 330 340 350 --------------------------------------------------------------------------- >>S30149 probable cysteine proteinase precursor (clone CYP-7) - common tobacc (363 aa) initn: 495 init1: 263 opt: 641 Z-score: 739.3 expect() 2.4e-34 Smith-Waterman score: 641; 36.278% identity in 317 aa overlap Entrez lookup Re-search database >S30149 26- 329: ----------------------------------------------------------------: 10 20 30 40 50 60 gi|115 MNPTLILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLYGMNEE-GWRRAVWEKNMKMIELHNQ : ... .:. ...:. .:: : :.. :.. .:.. S30149 LFLLSLLAFVLFSSAIAFSDEDPLIRQVVSETDDSHLLNAEHHFSLFKSKFGKIYASEEEHDHRFKVFKANLRRARLNQL 10 20 30 40 50 60 70 80 70 80 90 100 110 120 130 140 gi|115 EYREGKHSFTMAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLF--YEAPRSVDWREKGYVTPVKNQGQCGSCWAFS ..:..: :.:.: :::... :... :: : .. . :.. . : . :::..: :: :::::.:::::.:: S30149 LDPSAEHGIT----KFSDLTPSEFRRTYLGLHKPKP-KLNAEKAPILPTSDLPADFDWRDHGAVTGVKNQGSCGSCWSFS 90 100 110 120 130 140 150 150 160 170 180 190 200 210 gi|115 ATGALEGQMFRKTGRLISLSEQNLVDCS---GPQGNE----GCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNP .:::.:: : ::.:.:::::.::::. :. .. ::.:: . ::.:. :::. :..::: . . .:... S30149 TTGAVEGAHFLATGELVSLSEQQLVDCDHECDPEQQDACDAGCGGGHYATAFEYTLKAGGLQLEKDYPYTGKDGKCHFDK 160 170 180 190 200 210 220 230 220 230 240 250 260 270 280 290 gi|115 KYSVANDTGFVDIPKQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYG---FESTESDN . : :.: : .: . .. ::..:.:.:. . : :. : ... :::::.:::: : . . S30149 SKICAAVTNFSVIGLDEDQIAANLVKHGPLAVGINAAWMQT--YVGGVSCPLICFKRQ-DHGVLLVGYGSHGFAPIRLKE 240 250 260 270 280 290 300 310 300 310 320 330 gi|115 NKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV . ::..::::::.:: :: :. . . : ::. . .: S30149 KAYWIIKNSWGENWGEHGYYKICRGH-NICGVDAMVSTVTAAHTTNPNL 320 330 340 350 360 --------------------------------------------------------------------------- >>S59597 cysteine proteinase 1 precursor - maize (371 aa) initn: 526 init1: 286 opt: 636 Z-score: 733.5 expect() 5.1e-34 Smith-Waterman score: 684; 38.312% identity in 308 aa overlap Entrez lookup Re-search database >S59597 43- 329: ------------------------------------------------------------: 10 20 30 40 50 60 70 80 gi|115 PTLILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLYGMNEEGWRRAVWEKNMKMIELHNQEYREGKHSFTMAMNAFGD .:...: .:.. :.. . :. ..:. : :.: S59597 AEDPLIRQVVPGGDDNDLELNAESHFLSFVQRFGKSYKDADEHAYRLSVFKDNLRRARRHQLLDPSAEHGVT----KFSD 30 40 50 60 70 80 90 90 100 110 120 130 140 150 gi|115 MTSEEFRQVMNGFQNRKPRK------GKVFQE-PLFYE--APRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFR .: :::... :. :: :. :. .: :.. : . :::..: : ::::::.:::::.:::.::::: . S59597 LTPAEFRRTYLGL--RKSRRALLRELGESAHEAPVLPTDGLPDDFDWRDHGAVGPVKNQGSCGSCWSFSASGALEGAHYL 100 110 120 130 140 150 160 170 160 170 180 190 200 210 220 gi|115 KTGRLISLSEQNLVDC------SGPQG-NEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFV ::.: ::::..::: : :.. . ::::::: ::.:.: :::.::..::: ... .::.. . ::. .: S59597 ATGKLEVLSEQQFVDCDHECDSSEPDSCDSGCNGGLMTTAFSYLQKAGGLESEKDYPYTGSDGKCKFDKSKIVASVQNFS 180 190 200 210 220 230 240 250 230 240 250 260 270 280 290 300 gi|115 DIPKQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGY---GFESTESDNNKYWLVKNSWG . .: . . ::....:.:.. . : :. :. . .:::::.::: :: . .. ::..::::: S59597 VVSVDEAQISANLIKHGPLAIGINAAYMQT--YIGGVSCPYICGRH-LDHGVLLVGYGASGFAPIRLKDKPYWIIKNSWG 260 270 280 290 300 310 320 330 310 320 330 gi|115 EEWGMGGYVKMAK--DRRNHCGIASAASYPTV :.:: .:: :. . . ::.::. : .: S59597 ENWGENGYYKICRGSNVRNKCGVDSMVSTVSAVHASKE 340 350 360 370 --------------------------------------------------------------------------- >>A48566 cysteine proteinase Lpcys2 (EC 3.4.22.-) - Leishmania pifanoi (444 aa) initn: 481 init1: 300 opt: 637 Z-score: 733.4 expect() 5.1e-34 Smith-Waterman score: 637; 37.037% identity in 324 aa overlap Entrez lookup Re-search database >A48566 5- 314: ----------------------------------------------------------------- : 10 20 30 40 50 60 gi|115 MNPTLILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLY-GMNEEGWRRAVWEKNMKMIELHNQEYRE ..::: : . .. : . ..: ..: : . :: : : .:.:..... : . :. A48566 MATSRAALCAVAVVCVVLAAAC--APARAIHVGTPAAALFEEFKRTYGRAYETLAEEQQRLANFERNLELMREH--QARN 10 20 30 40 50 60 70 70 80 90 100 110 120 130 140 gi|115 GKHSFTMAMNAFGDMTSEEFR-QVMNG---FQNRKPRKGKVFQEPL--FYEAPRSVDWREKGYVTPVKNQGQCGSCWAFS . .: ... : :.. :: . .:: : : . .. ... . .: .::::::: :::::.:: :::::::: A48566 PHAQF--GITKFFDLSEAEFAARYLNGAAYFAAAKRHAAQHYRKARADLSAVPDAVDWREKGAVTPVKDQGACGSCWAFS 80 90 100 110 120 130 140 150 150 160 170 180 190 200 210 gi|115 ATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYV--QDNGGLDSEESYPYEATE---ESCKYNPKY :.: .::: . .:.:::::.::.:. . :.::.:::: ::... . :: : .:.:::: . . :. . . A48566 AVGNIEGQWYLAGHELVSLSEQQLVSCD--DMNDGCDGGLMLQAFDWLLQNTNGHLHTEDSYPYVSGNGYVPECSNSSEE 160 170 180 190 200 210 220 230 220 230 240 250 260 270 280 290 gi|115 SV--ANDTGFVDIPKQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKY : :. : : : ..:::. .: :::..:.::. ::. :: :. : .....::::.::: . . . : A48566 LVVGAQIDGHVLIGSSEKAMAAWLAKNGPIAIALDAS--SFMSYKSGVLTA--CIGKQLNHGVLLVGYDM----TGEVPY 240 250 260 270 280 290 300 300 310 320 330 gi|115 WLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV :..::::: .:: :::... A48566 WVIKNSWGGDWGEQGYVRVVMGVNACLLSEYPVSAHVRESAAPGTSTSSETPAPRPVMVEQVICFDKNCTQGCRKTLIKA 310 320 330 340 350 360 370 380 --------------------------------------------------------------------------- >>KHSYO4 oil bodies-associated protein P34 precursor - soybean (379 aa) initn: 506 init1: 224 opt: 635 Z-score: 732.2 expect() 6e-34 Smith-Waterman score: 635; 37.188% identity in 320 aa overlap Entrez lookup Re-search database >KHSYO4 32- 332: --------------------------------------------------------------: 10 20 30 40 50 60 70 gi|115 MNPTLILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLYGMNEEGWRR-AVWEKNMKMIELHNQEYREGK ::. :.:.: .:: .: ....: ..:. : . :.. KHSYO4 LLFSLLGLSSSSSISTHRSILDLDLTKFTTQKQVSSLFQLWKSEHGRVYHNHEEEAKRLEIFKNNSNYIRDMNAN-RKSP 10 20 30 40 50 60 70 80 80 90 100 110 120 130 140 gi|115 HSFTMAMNAFGDMTSEEF-RQVMNG----FQNRKPRKGKVFQEPLFYE-APRSVDWREKGYVTPVKNQGQCGSCWAFSAT :: ...: :.:.: .:: .. ... :. : . :. .: . : : :::.:: .: :: :: :: :::::: KHSYO4 HSHRLGLNKFADITPQEFSKKYLQAPKDVSQQIKMANKKMKKEQYSCDHPPASWDWRKKGVITQVKYQGGCGRGWAFSAT 90 100 110 120 130 140 150 160 150 160 170 180 190 200 210 220 gi|115 GALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTG ::.:. :: :.:::::.:::: . .:: .: . .:..: ..::. ....:::.: : :: : . .. : KHSYO4 GAIEAAHAIATGDLVSLSEQELVDCV--EESEGSYNGWQYQSFEWVLEHGGIATDDDYPYRAKEGRCKANKIQDKVTIDG 170 180 190 200 210 220 230 240 230 240 250 260 270 280 290 gi|115 FVDI--------PKQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSE-DMDHGVLVVGYGFESTESDNNKY . . . :.:...:. ::::.::: ..: .: ::: .:.: ..: ::.:::: .:. : KHSYO4 YETLIMSDESTESETEQAFLSAILE-QPISVSIDA--KDFHLYTGGIYDGENCTSPYGINHFVLLVGYG----SADGVDY 250 260 270 280 290 300 310 300 310 320 330 gi|115 WLVKNSWGEEWGMGGYVKMAKDRRNH---CGIASAASYPTV :..::::: .:: ::. . .. : ::. ::::: KHSYO4 WIAKNSWGFDWGEDGYIWIQRNTGNLLGVCGMNYFASYPTKEESETLVSARVKGHRRVDHSPL 320 330 340 350 360 370 --------------------------------------------------------------------------- >>S29245 cysteine proteinase (EC 3.4.22.-) precursor - Leishmania mexicana (443 aa) initn: 411 init1: 317 opt: 633 Z-score: 728.9 expect() 9.1e-34 Smith-Waterman score: 633; 37.152% identity in 323 aa overlap Entrez lookup Re-search database >S29245 5- 314: ----------------------------------------------------------------- : 10 20 30 40 50 60 gi|115 MNPTLILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLY-GMNEEGWRRAVWEKNMKMIELHNQEYRE ..::: : . .. : . ..: ..: : . :: : : .:.:..... : . :. S29245 MATSRAALCAVAVVCVVLAAAC--APARAIHVGTPAAALFEEFKRTYGRAYETLAEEQQRLANFERNLELMREH--QARN 10 20 30 40 50 60 70 70 80 90 100 110 120 130 140 gi|115 GKHSFTMAMNAFGDMTSEEFR-QVMNG---FQNRKPRKGKVFQEPL--FYEAPRSVDWREKGYVTPVKNQGQCGSCWAFS . .: ... : :.. :: . .:: : : . .. ... . .: .::::::: :::::.:: :::::::: S29245 PHAQF--GITKFFDLSEAEFAARYLNGAAYFAAAKRHAAQHYRKARADLSAVPDAVDWREKGAVTPVKDQGACGSCWAFS 80 90 100 110 120 130 140 150 150 160 170 180 190 200 210 gi|115 ATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYV-QD-NGGLDSEESYPYEATE---ESCKYNPKY :.: .::: . .:.:::::.::.:. . ..::.:::: ::... :. :: : .:.:::: . . :. . . S29245 AVGNIEGQWYLAGHELVSLSEQQLVSCD--DMDNGCSGGLMLQAFDWLLQNTNGHLHTEDSYPYVSGNGYVPECSNSSEL 160 170 180 190 200 210 220 230 220 230 240 250 260 270 280 290 gi|115 SV-ANDTGFVDIPKQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYW : :. : : : ..:::. .: :::..:.::. ::. :: :. : .....::::.::: . . . :: S29245 VVGAQIDGHVLIGSSEKAMAAWLAKNGPIAIALDAS--SFMSYKSGVLTA--CIGKQLNHGVLLVGYDM----TGEVPYW 240 250 260 270 280 290 300 300 310 320 330 gi|115 LVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV ..::::: .:: :::... S29245 VIKNSWGGDWGEQGYVRVVMGVNACLLSEYPVSAHVRESAAPGTSTSSETPAPRPVVVEQVICFDKNCRRGCRKTLIKAN 310 320 330 340 350 360 370 380 --------------------------------------------------------------------------- >>S04222 chymopapain (EC 3.4.22.6) - papaya (218 aa) initn: 473 init1: 235 opt: 589 Z-score: 683.2 expect() 3.2e-31 Smith-Waterman score: 589; 41.778% identity in 225 aa overlap Entrez lookup Re-search database >S04222 115- 331: ---------------------------------------------: 80 90 100 110 120 130 140 150 gi|115 MAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRK :.:.::: :: :::::::: ::::::::. ...:: S04222 YPQSIDWRAKGAVTPVKNQGACGSCWAFSTIATVEGINKIV 10 20 30 40 160 170 180 190 200 210 220 230 gi|115 TGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESC----KYNPKYSVANDTGFVDIPK :: :. ::::.::::. . . ::.:: . ..::: .:: . . . :::.: . .: : .:: .. ::. .:. S04222 TGNLLELSEQELVDCD--KHSYGCKGGYQTTSLQYVANNG-VHTSKVYPYQAKQYKCRATDKPGPKVKI---TGYKRVPS 50 60 70 80 90 100 110 240 250 260 270 280 290 300 gi|115 Q-EKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMG . : ... :.:. :.:: ..:: . : .:: :. :. :... .::.: .:::: ::...: ..::::: .:: S04222 NCETSFLGALAN-QPLSVLVEAGGKPFQLYKSGV-FDGPCGTK-LDHAVTAVGYG----TSDGKNYIIIKNSWGPNWGEK 120 130 140 150 160 170 180 310 320 330 gi|115 GYVKMAKDRRNH---CGIASAASYPTV ::... .. : ::. ... :: S04222 GYMRLKRQSGNSQGTCGVYKSSYYPFKGFA 190 200 210 --------------------------------------------------------------------------- >>PPPA papain (EC 3.4.22.2) precursor - papaya (345 aa) initn: 394 init1: 242 opt: 559 Z-score: 645.9 expect() 3.8e-29 Smith-Waterman score: 647; 34.545% identity in 330 aa overlap Entrez lookup Re-search database >PPPA 12- 331: -------------------------------------------------------------------: 10 20 30 40 50 60 gi|115 MNPTLILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLY-GMNEEGWRRAVWEKNMKMIE .: .. :: . : . .: ::..: ...:. .: ... :.:.:. PPPA MAMIPSISKLLFVAICLFVYMGLSFGDFSIVGYSQNDLTSTERLIQLFESWMLKHNKIYKNIDEKIYRFEIFKDNLKYID 10 20 30 40 50 60 70 80 70 80 90 100 110 120 130 gi|115 LHNQEYREGKHSFTMAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYEA----PRSVDWREKGYVTPVKNQGQCG :.. ..:. ...:.:.::...::.. ..: . .. : .. .. :. ::::.:: ::::::::.:: PPPA ETNKK----NNSYWLGLNVFADMSNDEFKEKYTGSIAGNYTTTELSYEEVLNDGDVNIPEYVDWRQKGAVTPVKNQGSCG 90 100 110 120 130 140 150 140 150 160 170 180 190 200 210 gi|115 SCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPK :::::::. ..:: . .:: : :::.:.::. . . ::::: :.: : . : . ...::::.... :. : PPPA SCWAFSAVVTIEGIIKIRTGNLNEYSEQELLDCD--RRSYGCNGGYPWSALQLVAQYG-IHYRNTYPYEGVQRYCRSREK 160 170 180 190 200 210 220 230 220 230 240 250 260 270 280 290 gi|115 YSVANDT-GFVDI-PKQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNK : : : .. : .: ::. ..:. :.::...:. ..: .:. ::. : :... .::.: .:::: . PPPA GPYAAKTDGVRQVQPYNEGALLYSIAN-QPVSVVLEAAGKDFQLYRGGIFVGP-CGNK-VDHAVAAVGYG--------PN 240 250 260 270 280 290 300 300 310 320 330 gi|115 YWLVKNSWGEEWGMGGYVKMAKDRRNH---CGIASAASYPTV : :.::::: :: .::... . : ::. ... ::. PPPA YILIKNSWGTGWGENGYIRIKRGTGNSYGVCGLYTSSFYPVKN 310 320 330 340 --------------------------------------------------------------------------- >>S06837 glycyl endopeptidase (EC 3.4.22.25) - papaya (216 aa) initn: 420 init1: 240 opt: 552 Z-score: 640.9 expect() 7.2e-29 Smith-Waterman score: 552; 42.411% identity in 224 aa overlap Entrez lookup Re-search database >S06837 115- 331: ---------------------------------------------: 80 90 100 110 120 130 140 150 gi|115 MAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRK :.::::: :: :::::.:: : ::::::.....:: : S06837 LPESVDWRAKGAVTPVKHQGYCESCWAFSTVATVEGINKIK 10 20 30 40 160 170 180 190 200 210 220 230 gi|115 TGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYN----PKYSVANDTGFVDIPK :: :. ::::.::::. . ::: : .. ..::: .:: . . .::: : ...:. : :: .. : .: :. . S06837 TGNLVELSEQELVDCD--LQSYGCNRGYQSTSLQYVAQNG-IHLRAKYPYIAKQQTCRANQVGGPKVKT-NGVGRVQ-SN 50 60 70 80 90 100 110 240 250 260 270 280 290 300 310 gi|115 QEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGG .: .:..:.: :.::..... ..: :: :: :: .:... .::.: .:::: .: .. : :.::::: :: .: S06837 NEGSLLNAIAH-QPVSVVVESAGRDFQNYKGGI-FEGSCGTK-VDHAVTAVGYG----KSGGKGYILIKNSWGPGWGENG 120 130 140 150 160 170 180 320 330 gi|115 YVKMAKDRRNH---CGIASAASYPTV :... . : ::. .. :: S06837 YIRIRRASGNSPGVCGVYRSSYYPIKN 190 200 210 --------------------------------------------------------------------------- >>S68783 cathepsin L (EC 3.4.22.15) precursor - Paramecium tetraurelia (SGC5) (314 aa) initn: 430 init1: 313 opt: 545 Z-score: 630.5 expect() 2.8e-28 Smith-Waterman score: 651; 37.785% identity in 307 aa overlap Entrez lookup Re-search database >S68783 29- 331: ---------------------------------------------------------------: 10 20 30 40 50 60 70 gi|115 MNPTLILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLY-GMNEEGWRRAVWEKNMKMIELHNQEYREGKHSFTMAMNA ...:: .:: : .. .: .: :. :...:. . .:. .::. .: S68783 MMLLGASLYLNNTQEVSDEIDTANLYANWKMKYNRRYTNQRDEMYRYKVFTDNLNYIRAFYESPEEA--TFTLELNQ 10 20 30 40 50 60 70 80 90 100 110 120 130 140 150 gi|115 FGDMTSEEFRQVMNGFQNRKPRKGKV-FQEPLFYEAPRSVDWREKGYVT-P-VKNQGQCGSCWAFSATGALEGQMFRKTG :.::...:: :.. .. . :: .:. . : ::: .. : : :::::.:::::::::.:::: . . . S68783 FADMSQQEFAQTYLSL--KVPRTAKLNAANSNFQYKGAEVDWTDNKKVKYPAVKNQGSCGSCWAFSAVGALEINTDIELN 80 90 100 110 120 130 140 150 160 170 180 190 200 210 220 230 gi|115 RLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPKQEKALM : ::::.::::::: :.::::: :: ::.:: ::: : ..::: : . .:: . : .. :: :: . .. : S68783 RKYELSEQDLVDCSGPYDNDGCNGGWMDSAFEYVADNG-LAEAKDYPYTAKDGTCKTSVKRPYTHVQGFKDIDSCDE-LA 160 170 180 190 200 210 220 230 240 250 260 270 280 290 300 310 gi|115 KAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYVKMAK ... ..::.::. . ::. :. . :. ....:::..:: ...:. : ..:::: :: .:....: S68783 QTIQER-TVAVAVDAN--PWQFYRSGVLSK--CT-KNLNHGVVLVG-----VQADGA--WKIRNSWGSSWGEAGHIRLAG 240 250 260 270 280 290 320 330 gi|115 DRRNHCGIASAASYPTV . ::: .: :.: S68783 G--DTCGICAAPSFPILG 300 310 --------------------------------------------------------------------------- >>S41425 cysteine proteinase (EC 3.4.22.-) CP3 precursor - Trichomonas vagina (278 aa) initn: 391 init1: 262 opt: 528 Z-score: 611.8 expect() 3e-27 Smith-Waterman score: 533; 34.146% identity in 287 aa overlap Entrez lookup Re-search database >S41425 6- 287: -----------------------------------------------------------: 10 20 30 40 50 60 70 80 gi|115 MNPTLILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLYGMNEEGWRRAVWEKNMKMIELHNQEYREGKHSFTMAMNAF ...:: .. ::. : ..: : . : : .. .: .: .:.. : . .. ::. .. .. ..:: . S41425 MFSAF-FATASSKLFLQHE-EKAFLDWMRSTNNMFVGDEYHFRLGVYNTNKRRVQEHNR----ANSGYQLTMNHL 10 20 30 40 50 60 90 100 110 120 130 140 150 gi|115 GDMTSEEFRQVMNGFQNRKPR-KGKVFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLI . :: :.. ... :..: . ..:.:. ..: .::::. :.:.:.:.:::::::::.. . :.: : :.:. S41425 SCMTPSEYKVLLGHKQTKKIEGEAKIFKG----DVPDAVDWRNAKIVNPIKDQAQCGSCWAFSVVQVQESQWALKKGQLL 70 80 90 100 110 120 130 140 160 170 180 190 200 210 220 230 gi|115 SLSEQNLVDCSGPQGNEGCNGGLMDYAFQYV--QDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFV--DIPKQEKAL ::.:::.::: ::.:: :..:: ...: : .::: : . :::.. .:. ..: ..: : S41425 SLAEQNMVDCVDT--CYGCDGGDEYLAYDYVIKHQKGLWMLETDYPYTARDGSCKFKAAKGVTLTKSYVRPTTTQNEDEL 150 160 170 180 190 200 210 220 240 250 260 270 280 290 300 310 gi|115 MKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYVKMA . : : .:.::::. .: .:. ::: .::: .::.: .:::: :. S41425 KAGCAKGGVVSIAIDASGYDFQLYSSGIYNPKSCSSTFLDHAVGLVGYGTENKVD 230 240 250 260 270 320 330 gi|115 KDRRNHCGIASAASYPTV --------------------------------------------------------------------------- >>A44938 cysteine proteinase (EC 3.4.22.-) - Trypanosoma cruzi (fragment) (183 aa) initn: 332 init1: 283 opt: 523 Z-score: 608.8 expect() 4.4e-27 Smith-Waterman score: 523; 44.103% identity in 195 aa overlap Entrez lookup Re-search database >A44938 114- 302: ---------------------------------------: 80 90 100 110 120 130 140 150 gi|115 TMAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFR :: .:::: .: :: ::.:::::::::::: : . :: : A44938 APAAVDWRARGAVTAVKDQGQCGSCWAFSAIGNVSGQWFL 10 20 30 40 160 170 180 190 200 210 220 gi|115 KTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYV--QDNGGLDSEESYPY---EATEESCKYNPKYSVANDTGFVDI : .:::: ::.:. . . ::.::::. ::... ..:::. .:.:::: :. : . . :. :: :.. A44938 AGHPLTNLSEQMLVSCD--KTDSGCSGGLMNNAFEWIVQENNGGVYTEDSYPYASGEGISPPCTTSGHTVGATITGHVEL 50 60 70 80 90 100 110 230 240 250 260 270 280 290 300 gi|115 PKQEKALMKAVATVGPISVAIDAGHES-FLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWG :..: . .:. ::..:: : : .. : :.. .: ::..:::.:.:::. .: ::.::::: A44938 PQDEAQIAAWLAVNGPVAVA----HASSWMTYTGGVM--TSCVSEQLDHGLLLVGYN----DSAAVPYWIVKNSW 120 130 140 150 160 170 180 310 320 330 gi|115 MGGYVKMAKDRRNHCGIASAASYPTV --------------------------------------------------------------------------- >>S46476 cysteine proteinase III - mountain papaya (214 aa) initn: 468 init1: 234 opt: 488 Z-score: 567.8 expect() 8.6e-25 Smith-Waterman score: 573; 41.704% identity in 223 aa overlap Entrez lookup Re-search database >S46476 115- 331: ---------------------------------------------: 80 90 100 110 120 130 140 150 gi|115 MAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRK :.:.:::.:: ::::::::.::::::::. ...:: S46476 YPESIDWRKKGAVTPVKNQGSCGSCWAFSTIATVEGINKIV 10 20 30 40 160 170 180 190 200 210 220 230 gi|115 TGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKY-NPKYSVANDTGFVDIPKQEK : : :::::.::::. . ..::.:: . ...:: :.: . .:. :::: . .:. . : ... .:. .:.... S46476 HGNLTSLSEQELVDCD--RRSHGCKGGYQTTSLKYVVDHG-VHTEKEYPYEEKQYKCRAKDKKPPIVKISGYKKVPSNDE 50 60 70 80 90 100 110 240 250 260 270 280 290 300 310 gi|115 -ALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYV .:.::.: :.:: ... ..: :::.::. : :... .::.: .:::: . : :.::::: :: ::. S46476 ISLIKAIAK-QPVSVLVESKGKAFQFYKKGIFGGP-CGTK-VDHAVTAVGYG--------KDYILIKNSWGPXWGEXGYI 120 130 140 150 160 170 180 320 330 gi|115 KMAKDRRNHC----GIASAASYPTV :. : .:: :: ... .:. S46476 KI-KRASGHCEGICGIYKSSYFPAEGYR 190 200 210 --------------------------------------------------------------------------- >>S62736 cathepsin-like cysteine proteinase (EC 3.4.22.-) - Autographa califo (323 aa) initn: 432 init1: 209 opt: 485 Z-score: 561.7 expect() 1.9e-24 Smith-Waterman score: 577; 36.054% identity in 294 aa overlap Entrez lookup Re-search database >S62736 37- 324: ------------------------------------------------------------ : 10 20 30 40 50 60 70 gi|115 MNPTLILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLYGMNEEGWRR-AVWEKNMKMIELHNQEYREGKHSFTMAMNA :. :: . : :: ....:.. : .::. : . .: S62736 MNKILFYLFVYGVVNSAAYDLLKAPNYFEEFVHRFNKDYGSEVEKLRRFKIFQHNLNEIINKNQN-----DSAKYEINK 10 20 30 40 50 60 70 80 90 100 110 120 130 140 150 gi|115 FGDMTSEEFRQVMNGF----QNRKPRKGKVFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKT :.:....: ..:. :... : :...: ..: :::. . :: :::::.::.::::.. ..::.:. : S62736 FSDLSKDETIAKYTGLSLPIQTQNFCKVIVLDQPP-GKGPLEFDWRRLNKVTSVKNQGMCGACWAFATLASLESQFAIKH 80 90 100 110 120 130 140 150 160 170 180 190 200 210 220 230 gi|115 GRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNP-KYSVANDTGFVDIPKQEKA ..::.::::...::. . ::::::. ::. . ::.. : .::::: ...:..: :. : . : :. S62736 NQLINLSEQQMIDCDFV--DAGCNGGLLHTAFEAIIKMGGVQLESDYPYEADNNNCRMNSNKFLVQVKDCYRYITVYEEK 160 170 180 190 200 210 220 230 240 250 260 270 280 290 300 310 gi|115 LMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYVKM : . :::: .::::. ... ::.:: : . ..:.::.:::: : .: :: ::.:: .:: :. .. S62736 LKDLLRLVGPIPMAIDAA--DIVNYKQGII--KYCFNSGLNHAVLLVGYGVE----NNIPYWTFKNTWGTDWGEDGFFRV 240 250 260 270 280 290 300 320 330 gi|115 AKDRRNHCGIASAASYPTV .. : ::. S62736 QQNI-NACGMRNELASTAVIY 310 320 --------------------------------------------------------------------------- >>A55090 cathepsin O (EC 3.4.-.-) precursor - human (321 aa) initn: 406 init1: 188 opt: 484 Z-score: 560.6 expect() 2.2e-24 Smith-Waterman score: 484; 35.793% identity in 271 aa overlap Entrez lookup Re-search database >A55090 68- 329: -------------------------------------------------------: 30 40 50 60 70 80 90 100 gi|115 QWTKWKAMHNRLYGMNEEGWRRAVWEKNMKMIELHNQEYREGKHSFTMAMNAFGDMTSEEFRQV-MNGFQNRKPR-KGKV :.. .: ...: :. . :::. . . . .. :: ...: A55090 GDADSRAPFTPTWPRSREREAAAFRESLNRHRYLNSLFPSENSTAF-YGINQFSYLFPEEFKAIYLRSKPSKFPRYSAEV 30 40 50 60 70 80 90 110 120 130 140 150 160 170 180 gi|115 FQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDY . : :::.: :: :.:: .::.:::::..::.:. . : : .:: :...::: .: ::::: A55090 HMSIPNVSLPLRFDWRDKQVVTQVRNQQMCGGCWAFSVVGAVESAYAIKGKPLEDLSVQQVIDCS--YNNYGCNGGSTLN 100 110 120 130 140 150 160 170 190 200 210 220 230 240 250 260 gi|115 AFQYVQDNG-GLDSEESYPYEATEESCKY----NPKYSVANDTGFVDIPKQEKALMKAVATVGPISVAIDAGHESFLFYK :..... : .. ::..: . :.: . .:. . ... :. :: . ::. : ::. : .:: :. : A55090 ALNWLNKMQVKLVKDSEYPFKAQNGLCHYFSGSHSGFSIKGYSAY-DFSDQEDEMAKALLTFGPLVVIVDA--VSWQDYL 180 190 200 210 220 230 240 250 270 280 290 300 310 320 330 gi|115 EGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGY--VKMAKDRRNHCGIASAASYPTV :: .. ::: . .:.::..: :..: : . ::.:.:::: ::. :: :::.. : ::::...: A55090 GGI-IQHHCSSGEANHAVLITG--FDKTGS--TPYWIVRNSWGSSWGVDGYAHVKMGS---NVCGIADSVSSIFV 260 270 280 290 300 310 320 --------------------------------------------------------------------------- >>S62735 cathepsin - Choristoneura fumiferana nuclear polyhedrosis virus (324 aa) initn: 471 init1: 215 opt: 482 Z-score: 558.2 expect() 2.9e-24 Smith-Waterman score: 578; 34.915% identity in 295 aa overlap Entrez lookup Re-search database >S62735 37- 324: ------------------------------------------------------------ : 10 20 30 40 50 60 70 gi|115 MNPTLILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLYGMNEEGWRR-AVWEKNMKMIELHNQEYREGKHSFTMAMNA :. :. . : :: ....:.. :. :... .. .. . : S62735 MNKIVLYLLVYGAVQCAAYDVLKAPNYFEDFLHKFNKSYSSESEKLRRFQIFRHNLE--EIINKNHNDSTAQYEI--NK 10 20 30 40 50 60 70 80 90 100 110 120 130 140 150 gi|115 FGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYE-----APRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRK :.:....: . ..:.. : . . : : . . .: :::. . :: :::::.::.::::.. :.::.:. : S62735 FADLSKDETISKYTGLS--LPLQTQNFCEVVVLDRPPDKGPLEFDWRRLNKVTSVKNQGMCGACWAFATLGSLESQFAIK 80 90 100 110 120 130 140 150 160 170 180 190 200 210 220 230 gi|115 TGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNP-KYSVANDTGFVDIPKQEK ...:.::::.:.::. . ::.:::. ::. :.. ::...: .:::::.. .:. : :. : . : :. S62735 HNQFINLSEQQLIDCDFV--DAGCDGGLLHTAFEAVMNMGGIQAESDYPYEANNGDCRANAAKFVVKVKKCYRYITVFEE 160 170 180 190 200 210 220 230 240 250 260 270 280 290 300 310 gi|115 ALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYVK : . .:::: :::::. ... ::.::. :... ..:.::.:::. : .. .:..::.:: .:: :: . S62735 KLKDLLRSVGPIPVAIDAS--DIVNYKRGIM--KYCANHGLNHAVLLVGYAVE----NGVPFWILKNTWGADWGEQGYFR 240 250 260 270 280 290 300 320 330 gi|115 MAKDRRNHCGIASAASYPTV . .. : ::: S62735 VQQNI-NACGIQNELPSSAEIY 310 320 --------------------------------------------------------------------------- >>JC5691 cysteine proteinase (EC 3.4.-.-) - Bombyx mori nuclear polyhedrosis (323 aa) initn: 439 init1: 216 opt: 479 Z-score: 554.8 expect() 4.5e-24 Smith-Waterman score: 571; 35.254% identity in 295 aa overlap Entrez lookup Re-search database >JC5691 37- 324: ------------------------------------------------------------ : 10 20 30 40 50 60 70 gi|115 MNPTLILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLYGMNEEGWRR-AVWEKNMKMIELHNQEYREGKHSFTMAMNA :. :. . : :: ....:.. : .::. : . .: JC5691 MNKILFYLFVYAVVKSAAYDPLKAPNYFEEFVHRFNKNYSSEVEKLRRFKIFQHNLNEIINKNQN-----DSAKYEINK 10 20 30 40 50 60 70 80 90 100 110 120 130 140 150 gi|115 FGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYE-----APRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRK :.:....: ..:.. : . . : . .. . .: :::. . :: :::::.::.::::.. :.::.:. : JC5691 FSDLSKDETIAKYTGLS--LPTQTQNFCKVILLDQPPGKGPLEFDWRRLNKVTSVKNQGMCGACWAFATLGSLESQFAIK 80 90 100 110 120 130 140 150 160 170 180 190 200 210 220 230 gi|115 TGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNP-KYSVANDTGFVDIPKQEK ..::.::::...::. . ::::::. ::. . ::.. : .::::: ...:..: :. : . : :. JC5691 HNELINLSEQQMIDCDFV--DAGCNGGLLHTAFEAIIKMGGVQLESDYPYEADNNNCRMNSNKFLVQVKDCYRYIIVYEE 160 170 180 190 200 210 220 230 240 250 260 270 280 290 300 310 gi|115 ALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYVK : . :::: .::::. ... ::.:: : . ..:.::.:::: : .: :: ::.:: .:: :. . JC5691 KLKDLLPLVGPIPMAIDAA--DIVNYKQGII--KYCFDSGLNHAVLLVGYGVE----NNIPYWTFKNTWGTDWGEDGFFR 240 250 260 270 280 290 300 320 330 gi|115 MAKDRRNHCGIASAASYPTV . .. : ::. JC5691 VQQNI-NACGMRNELASTAVIY 310 320 --------------------------------------------------------------------------- >>C44938 cysteine proteinase (EC 3.4.22.-) - Entamoeba histolytica (fragment) (165 aa) initn: 347 init1: 176 opt: 465 Z-score: 543.2 expect() 2e-23 Smith-Waterman score: 465; 43.353% identity in 173 aa overlap Entrez lookup Re-search database >C44938 132- 302: -----------------------------------: 100 110 120 130 140 150 160 170 gi|115 MNGFQNRKPRKGKVFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSG ::::::::.: .:..:::.. . :.: :.:::.::::.. C44938 QGQCGSCWTFCTTAVLEGRVNKDLGKLYSFSEQQLVDCDA 10 20 30 40 180 190 200 210 220 230 240 250 gi|115 PQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPK-QEKALMKAVATVGPISVAID ...::. : . .....:.:.:: : .:::.:. .:: .::. :: . .: .:. .: ::..:..: C44938 --SDNGCERGHPSNSLKFIQENNGLGLESDYPYKAVAGTCKKVK--NVATVTGSRRVTDGSETGLQTIIAENGPVAVGMD 50 60 70 80 90 100 110 260 270 280 290 300 310 320 gi|115 AGHESFLFYKEG-IYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAAS :.. :: .::.: :: . : :. :.: : .:::: .: :.:::.::::: C44938 ASRPSFQLYKKGTIYSDTKCRSRMMNHCVTAVGYGSNS----NGKYWIVKNSW 120 130 140 150 160 330 gi|115 YPTV --------------------------------------------------------------------------- >>A47306 cysteine proteinase - Tetrahymena thermophila (SGC5) (336 aa) initn: 393 init1: 221 opt: 458 Z-score: 530.5 expect() 1e-22 Smith-Waterman score: 556; 31.124% identity in 347 aa overlap Entrez lookup Re-search database >A47306 1- 327:---------------------------------------------------------------------: 10 20 30 40 50 60 70 gi|115 MNPTLILAAFCLGIASATLTFDHSLEA--QWTKWKAMHNRLY-GMNEEGWRRAVWEKNMKMIELHNQEYREGKHSFTMAM :: .:. .. . . :. : :.: ..::.....: : . .:. .:. :. .:.. :. ::.. ...... . A47306 MNKKFIILSIIM-LMPLCLAQDISVEKLLAYNKWSSQNQRAYLNEDEKLYRQIVFFENLQKIKEHNSN---PNNTYSIHL 10 20 30 40 50 60 70 80 90 100 110 120 130 140 gi|115 NAFGDMTSEEF-------RQVMNGFQN---RKPRKGKVFQEPLF----YEAPRSVDWREKGYVTPVKNQGQCGSCWAFSA : :.::: ::: ....: ... .. .... .: . . :.::: :: :: ::.::::::::.::: A47306 NQFSDMTREEFAEKILMKQDLINDYMKGIGQQATHNNANNETQMNSQNHTLAASIDWRTKGAVTSVKDQGQCGSCWSFSA 80 90 100 110 120 130 140 150 150 160 170 180 190 200 210 220 gi|115 TGALEGQMFRKTGRLISLSEQNLVDCSGPQG---NEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVA .. .:. : .. :...:::.:::: :.. . ::.:: ..:.. : . . ..::: :....: . . A47306 AALMESFNFIQNKALVNFSEQQLVDCVTPENGYPSYGCKGGWPATCLDYASKVG-ITTLDKYPYVAVQKNCTVTGTNNGF 160 170 180 190 200 210 220 230 230 240 250 260 270 280 290 300 gi|115 NDTGFVDIPKQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKN . .. ::. . : :.. . .:.:: .:: .. .:. ::. . .. ...:.::.::: :.:: :.::: A47306 KLKKWIVIPNTSNDL-KSALNFSPVSVLVDA--TNWDYYSSGIFNGCNQTNINLNHAVLAVGYD----EKDN---WIVKN 240 250 260 270 280 290 300 310 320 330 gi|115 SWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV ::. :: ::...: . : ::: :. A47306 SWSAGWGEHGYIRLAPN--NTCGILSSNIQVTA 310 320 330 --------------------------------------------------------------------------- >>B44938 cysteine proteinase (EC 3.4.22.-) - Trypanosoma brucei (fragment) (166 aa) initn: 272 init1: 165 opt: 446 Z-score: 521.4 expect() 3.3e-22 Smith-Waterman score: 446; 44.633% identity in 177 aa overlap Entrez lookup Re-search database >B44938 132- 302: -----------------------------------: 100 110 120 130 140 150 160 170 gi|115 MNGFQNRKPRKGKVFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSG :::::::::::. : .::: . :.::::: :: :. B44938 QGQCGSCWAFSTIGNIEGQWQVAGNPLVSLSEQILVYCD- 10 20 30 180 190 200 210 220 230 240 gi|115 PQGNEGCNGGLMDYAFQY-VQDNGG-LDSEESYPY---EATEESCKYNPKYSVANDTGFVDIPKQEKALMKAVATVGPIS : ::.::::: ::.. :..::: . .: :::: .. . .:..: . : : ::.:..: :. .: :.. B44938 PLI--GCGGGLMDNAFNWIVNSNGGNVFTEASYPYVSGNGEQPQCQMNGHEIGAAITDHVDLPQDEDAIAAYLAENRPLA 40 50 60 70 80 90 100 110 250 260 270 280 290 300 310 320 gi|115 VAIDAGHESFLFY-KEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIA .:..: . :: ..: :. .:.::..:::::.:::. ...: ::.::::: B44938 IAVEAPQ----FYGHNGGYILTSCTSEQLDHGVLLVGYN----DNSNPPYWIVKNSW 120 130 140 150 160 330 gi|115 SAASYPTV --------------------------------------------------------------------------- >>S03964 stem bromelain (EC 3.4.22.32) - pineapple (212 aa) initn: 431 init1: 188 opt: 436 Z-score: 508.4 expect() 1.7e-21 Smith-Waterman score: 525; 39.013% identity in 223 aa overlap Entrez lookup Re-search database >S03964 115- 333: ---------------------------------------------: 80 90 100 110 120 130 140 150 gi|115 MAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRK :.:.:::. : :: ::::. ::.::::.: ...:. . : S03964 AVPQSIDWRDYGAVTSVKNQNPCGACWAFAAIATVESIYKIK 10 20 30 40 160 170 180 190 200 210 220 230 gi|115 TGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPKQ-EK : : ::::...::. .: ::.:: ::... .: :. : :::.:.. .:: . . : ::.. .:.. :. S03964 KGILEPLSEQQVLDCA--KGY-GCKGGWEFRAFEFIISNKGVASGAIYPYKAAKGTCKTDGVPNSAYITGYARVPRNNES 50 60 70 80 90 100 110 240 250 260 270 280 290 300 310 gi|115 ALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYVK ..: ::. ::.::.:: . .: .:: :.. : :.. ...:.: ..::: .: . ..:: .:: .::.. S03964 SMMYAVSK-QPITVAVDA-NANFQYYKSGVFNGP-CGT-SLNHAVTAIGYGQDSI--------IYPKKWGAKWGEAGYIR 120 130 140 150 160 170 180 320 330 gi|115 MAKDRRNH---CGIASAASYPTV ::.: . :::: :::. S03964 MARDVSSSSGICGIAIDPLYPTLEE 190 200 210 --------------------------------------------------------------------------- >>S27044 papain-like protein - Autographa californica nuclear polyhedrosis vi (208 aa) initn: 424 init1: 201 opt: 433 Z-score: 505.1 expect() 2.7e-21 Smith-Waterman score: 525; 40.385% identity in 208 aa overlap Entrez lookup Re-search database >S27044 118- 324: ------------------------------------------- : 80 90 100 110 120 130 140 150 gi|115 NAFGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGR . ::. . :: :::::.::.::::.. ..::.:. : .. S27044 IHWRRLNKVTSVKNQGMCGACWAFATLASLESQFAIKHNQ 10 20 30 40 160 170 180 190 200 210 220 230 gi|115 LISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNP-KYSVANDTGFVDIPKQEKALM ::.::::...::. . ::::::. ::. . ::.. : .::::: ...:..: :. : . : :. : S27044 LINLSEQQMIDCDFV--DAGCNGGLLHTAFEAIIKMGGVQLESDYPYEADNNNCRMNSNKFLVQVKDCYRYITVYEEKLK 50 60 70 80 90 100 110 240 250 260 270 280 290 300 310 gi|115 KAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYVKMAK . :::: .::::. ... ::.:: : . ..:.::.:::: : .: :: ::.:: .:: :. .. . S27044 DLLRLVGPIPMAIDAA--DIVNYKQGII--KYCFNSGLNHAVLLVGYGVE----NNIPYWTFKNTWGTDWGEDGFFRVQQ 120 130 140 150 160 170 180 190 320 330 gi|115 DRRNHCGIASAASYPTV . : ::. S27044 NI-NACGMRNELASTAVIY 200 --------------------------------------------------------------------------- >>A61500 allergen Der f I precursor - house-dust mite (Dermatophagoides farin (319 aa) initn: 260 init1: 182 opt: 429 Z-score: 497.7 expect() 6.9e-21 Smith-Waterman score: 429; 29.393% identity in 313 aa overlap Entrez lookup Re-search database >A61500 10- 311: --------------------------------------------------------------- : 10 20 30 40 50 60 70 gi|115 MNPTLILAAFCLGIASATLTFDHSLEA--QWTKWKAMHNRLYG-MNEEGWRRAVWEKNMKMIELHNQEYREGKHSFTMAM : :.::: . .. : . ..: :. :. ..:: : . ...:..: .. .: ... A61500 MKFVLAIASLLVLTVYARPASIKTFEFKKAFNKNYATVEEEEVARKNFLESLKYVEANKGAI---NHLSDLSL 10 20 30 40 50 60 70 80 90 100 110 120 130 140 150 gi|115 NAFGD---MTSEEFRQVMNGFQ-NRKPRKGKVFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFR . : . :..: :.:. . :. : . .. . ..: .: : :::.. :: ::::::::...: :. .. A61500 DEFKNRYLMSAEAFEQLKTQFDLNAETSACRINS----VNVPSELDLRSLRTVTPIRMQGGCGSCWAFSGVAATESAYLA 80 90 100 110 120 130 140 160 170 180 190 200 210 220 gi|115 KTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKY--NPKYSVANDTGFV--DIP . ..::::.::::.. .: :.: . ...:.:.:: .. :.:::: : :. :. . .:...: . :. A61500 YRNTSLDLSEQELVDCASQHG---CHGDTIPRGIEYIQQNGVVE-ERSYPYVAREQRCRRPNSQHYGISNYCQIYPPDVK 150 160 170 180 190 200 210 220 230 240 250 260 270 280 290 300 gi|115 KQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMG . ..:: .. .... : : ..: : .. : . . :.: .:::: ::..:. ::.:.::: :: . A61500 QIREALTQTHTAIAVIIGIKDL--RAFQHYDGRTIIQHDNGYQPNYHAVNIVGYG--STQGDD--YWIVRNSWDTTWGDS 230 240 250 260 270 280 290 310 320 330 gi|115 GYVKMAKDRRNHCGIASAASYPTV :: A61500 GYGYFQAGNNLMMIEQYPYVVIM 300 310 --------------------------------------------------------------------------- >>S57422 cysteine proteinase (EC 3.4.22.-) 8 - Tritrichomonas foetus (fragmen (152 aa) initn: 288 init1: 153 opt: 412 Z-score: 483.1 expect() 4.5e-20 Smith-Waterman score: 412; 46.053% identity in 152 aa overlap Entrez lookup Re-search database >S57422 140- 288: --------------------------------: 100 110 120 130 140 150 160 170 gi|115 PRKGKVFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCN :::: : :. :.:.: :::::::: . ::: S57422 AFSAIQAAESVNCIKSGKLERYSEQNLVDCV--TACYGCN 10 20 30 180 190 200 210 220 230 240 250 gi|115 GGLMDYAFQYVQD--NGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPKQ-EKALMKAVATVGPISVAIDAGHESF ::::: ...:. : :: :. : .::: :.. .::: ::. : .:.. .. : : : : ::..:::::.. :: S57422 GGLMDASYEYIIDSQNGHLNLEADYPYTAVDGTCKYAQYTPVASITKYVNVNQNDEDDLAAKVETYGPVAVAIDASNWSF 40 50 60 70 80 90 100 110 260 270 280 290 300 310 320 330 gi|115 LFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV .: :.: ::.:: ..:::: ::.: :.. S57422 QLYTGGVYDEPSCSPYSLDHGVGCVGFGAEGSTK 120 130 140 150 --------------------------------------------------------------------------- >>A45624 trophozoite cysteine proteinase - Plasmodium falciparum (569 aa) initn: 378 init1: 226 opt: 413 Z-score: 475.6 expect() 1.2e-19 Smith-Waterman score: 517; 31.373% identity in 357 aa overlap Entrez lookup Re-search database >A45624 27- 331: ----------------------------------------------------------------: 10 20 30 40 50 60 gi|115 MNPTLILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLYGMNEEGWRR-AVWEKNMKMIELHNQE ... :. ::..: .: :. ... :. :. ::. A45624 DDKKVYLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKNIDEQMRKFEIFKINYISIKNHNKL 190 200 210 220 230 240 250 260 70 80 90 100 110 120 gi|115 YREGKHSFTMAMNAFGDMTSEEFRQ-------VMNGFQNR--KP---------------RKGKVFQEPLFYEAPRSVDWR ... . .: :.:.. ::... : : . .. :: .:: .. .: ..:. .:.: A45624 NKNAM--YKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILDYR 270 280 290 300 310 320 330 340 130 140 150 160 170 180 190 200 gi|115 EKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEES ::: : :.:: :::::::...: .:. . .:. ..:.:::..:::: . : ::.:: :.: :: .: ..: A45624 EKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCS--KDNFGCDGGHPFYSFLYVLQNELCLGDE- 350 360 370 380 390 400 410 210 220 230 240 250 260 270 gi|115 YPYEATEES-C-KYNPKYSVA-NDTGFVDIPKQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGV : :.: .. : .: : .:. .. : : .:. :. :. :::.:: . .... :. :.::.: . :: :...:.: A45624 YKYKAKDDMFCLNYRCKRKVSLSSIGAV----KENQLILALNEVGPLSVNVGVNND-FVAYSEGVY-NGTCS-EELNHSV 420 430 440 450 460 470 480 490 280 290 300 310 320 330 gi|115 LVVGYG-FESTESD-NNK-------------------YWLVKNSWGEEWGMGGYVKMAKDRRN---HCGIASAASYPTV :.:::: :.:. . ::: ::..::::...:: .:........ . :::. . :: A45624 LLVGYGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFYPIL 500 510 520 530 540 550 560 --------------------------------------------------------------------------- >>S02729 actinidain (EC 3.4.22.14) precursor (clone pAC.7) - kiwi fruit (frag (184 aa) initn: 292 init1: 131 opt: 400 Z-score: 468.1 expect() 3e-19 Smith-Waterman score: 400; 39.375% identity in 160 aa overlap Entrez lookup Re-search database >S02729 177- 331: --------------------------------: 140 150 160 170 180 190 200 210 gi|115 SCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKY--- ::::: . .::.. .:::...::.::: : . :. S02729 TRGCNGGYITDGFQFIINNGGINTEENYPYTAQDGECNLDLQ 10 20 30 40 220 230 240 250 260 270 280 290 gi|115 NPKYSVANDTGFVDIPKQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNN : :: :. :: . ..: ... .....: :.:::.::. ..: :. ::. : :.. .::.: .:::: :. . S02729 NEKY-VTIDT-YENVPYNNEWALQTAVTYQPVSVALDAAGDAFKHYSSGIFTGP-CGTA-IDHAVTIVGYGTEG----GI 50 60 70 80 90 100 110 300 310 320 330 gi|115 KYWLVKNSWGEEWGMGGYVKMAKDR--RNHCGIASAASYPTV ::.::::: :: ::... .. . ::::. :::. S02729 DYWIVKNSWDTTWGEEGYMRILRNVGGAGTCGIATMPSYPVKYNNQNYPKPYSSLINPSAFSMSKDGPVE 120 130 140 150 160 170 180 --------------------------------------------------------------------------- >>PQ0650 senescence-associated protein SAG2 - Arabidopsis thaliana (fragment) (95 aa) initn: 353 init1: 353 opt: 385 Z-score: 455.3 expect() 1.6e-18 Smith-Waterman score: 385; 58.889% identity in 90 aa overlap Entrez lookup Re-search database >PQ0650 115- 204: -------------------: 80 90 100 110 120 130 140 150 gi|115 MAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRK :.. :::: : :.:::.:: :::::.::.:::::. . . PQ0650 TEAALPETKDWREDGIVSPVKDQGGCGSCWTFSTTGALEAAYHQA 10 20 30 40 160 170 180 190 200 210 220 230 gi|115 TGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPKQEKA :. ::::::.::::.: .: : :::: . ::.:...:::::.:..: : PQ0650 FGKGISLSEQQLVDCAGAFNNYGSNGGLPSQAFEYIKSNGGLDTEKAYRY 50 60 70 80 90 240 250 260 270 280 290 300 310 gi|115 LMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYVKM --------------------------------------------------------------------------- >>A41158 dipeptidyl-peptidase I (EC 3.4.14.1) precursor - rat (462 aa) initn: 334 init1: 147 opt: 393 Z-score: 454.1 expect() 1.8e-18 Smith-Waterman score: 444; 32.237% identity in 304 aa overlap Entrez lookup Re-search database >A41158 60- 328: ---------------------------------------------------------: 20 30 40 50 60 70 80 90 gi|115 TFDHSLEAQWTKWKAMHNRLYGMNEEGWRRAVWEKNMKMIELHNQEYREGKHSFTMAMNA----FGDMTSEEFRQV-MNG : .... .:.:. :.:. . : ::.... . A41158 TMTGWVHDVLGRNWACFVGKKMANHSEKVYVNVAHLGGLQEKYSERLYSHNHNFVKAINSVQKSWTATTYEEYEKLSIRD 130 140 150 160 170 180 190 200 100 110 120 130 140 150 gi|115 FQNRKPRKGKVFQE---PLFYEA-------PRSVDWRE-KG--YVTPVKNQGQCGSCWAFSATGALEGQ--MFRKTGRLI . :. ..:.... :. : :.: :::. .: .:.::.:: .::::..:.. : ::.. .. .... A41158 LIRRSGHSGRILRPKPAPITDEIQQQILSLPESWDWRNVRGINFVSPVRNQESCGSCYSFASLGMLEARIRILTNNSQTP 210 220 230 240 250 260 270 280 160 170 180 190 200 210 220 230 gi|115 SLSEQNLVDCSGPQGNEGCNGGL-MDYAFQYVQDNGGLDSEESYPYEATEESCK--------YNPKYSVANDTGFVDIPK :: :..:.:: : . .::.::. . : .:.:: : .. :. .:: ::. :: :. .: .. :: A41158 ILSPQEVVSCS-PYA-QGCDGGFPYLIAGKYAQDFGVVE-ENCFPYTATDAPCKPKENCLRYYSSEYYYVG--GFYG--G 290 300 310 320 330 340 350 240 250 260 270 280 290 300 gi|115 QEKALMK-AVATVGPISVAIDAGHESFLFYKEGIYF-----EPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGE ..:::: .. ::..::... :..:: :. ::: .: : .:.::.:::: . . . . ::.:::::: A41158 CNEALMKLELVKHGPMAVAFEV-HDDFLHYHSGIYHHTGLSDPFNPFELTNHAVLLVGYGKDPVTGLD--YWIVKNSWGS 360 370 380 390 400 410 420 430 310 320 330 gi|115 EWGMGGYVKMAKDRRNHCGIASAASYPTV .:: .:: .. . ..:.: : : A41158 QWGESGYFRIRRGT-DECAIESIAMAAIPIPKL 440 450 460 --------------------------------------------------------------------------- >>S66504 dipeptidyl-peptidase I (EC 3.4.14.1) precursor - human (463 aa) initn: 313 init1: 154 opt: 386 Z-score: 446.1 expect() 5.1e-18 Smith-Waterman score: 432; 36.145% identity in 249 aa overlap Entrez lookup Re-search database >S66504 100- 328: ------------------------------------------------: 60 70 80 90 100 110 120 130 gi|115 ELHNQEYREGKHSFTMAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYEAPRSVDWRE-KG--YVTPVKNQGQCG : ... :. : : : :::. .: .:.::.::..:: S66504 NAIQKSWTATTYMEYETLTLGDMIRRSGGHSRKIPRPKPAPLTAEIQQKILHL--PTSWDWRNVHGINFVSPVRNQASCG 180 190 200 210 220 230 240 250 140 150 160 170 180 190 200 210 gi|115 SCWAFSATGALEGQ--MFRKTGRLISLSEQNLVDCSGPQGNEGCNGGL-MDYAFQYVQDNGGLDSEESYPYEATEESCK- ::..:.. : ::.. .. .... :: :..:.:: : .::.::. . : .:.:: : : : .:: .:. :: S66504 SCYSFASMGMLEARIRILTNNSQTPILSPQEVVSCS--QYAQGCEGGFPYLIAGKYAQDFG-LVEEACFPYTGTDSPCKM 260 270 280 290 300 310 320 330 220 230 240 250 260 270 gi|115 -------YNPKYSVANDTGFVDIPKQEKALMK-AVATVGPISVAIDAGHESFLFYKEGIYF-----EPDCSSEDMDHGVL :. .: .. :: ..:::: .. ::..::... ...:: ::.::: .: : .:.:: S66504 KEDCFRYYSSEYHYVG--GFYG--GCNEALMKLELVHHGPMAVAFEV-YDDFLHYKKGIYHHTGLRDPFNPFELTNHAVL 340 350 360 370 380 390 400 280 290 300 310 320 330 gi|115 VVGYGFESTESDNNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV .:::: .: ... ::.:::::: :: .:: .. . ..:.: : : S66504 LVGYGTDS--ASGMDYWIVKNSWGTGWGENGYFRIRRGT-DECAIESIAVAATPIPKL 410 420 430 440 450 460 --------------------------------------------------------------------------- >>S57425 cysteine proteinase (EC 3.4.22.-) 7 - Tritrichomonas foetus (fragmen (152 aa) initn: 275 init1: 167 opt: 378 Z-score: 444.2 expect() 6.5e-18 Smith-Waterman score: 378; 43.421% identity in 152 aa overlap Entrez lookup Re-search database >S57425 140- 288: --------------------------------: 100 110 120 130 140 150 160 170 gi|115 PRKGKVFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCN :::: : :. .:: : : :::::::: . ::: S57425 AFSAIQAAESANAISTGTLESYSEQNLVDCV--TACYGCN 10 20 30 180 190 200 210 220 230 240 250 gi|115 GGLMDYAFQYV--QDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPK-QEKALMKAVATVGPISVAIDAGHESF ::::: ...:. ...: .. : .: : : . .::.. .:.. . .:.. . .: : . : :::.:::::.. :: S57425 GGLMDASYEYIVAKQGGKMNYESDYVYTALDGTCKFTQYTAVGSVSKYVNVAQGDEDDLASKCETYGPIAVAIDASNWSF 40 50 60 70 80 90 100 110 260 270 280 290 300 310 320 330 gi|115 LFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV .:. ::: : .::: ..:::: :::: :.. S57425 QLYSGGIYDEKSCSSYSLDHGVGCVGYGVEGSTK 120 130 140 150 --------------------------------------------------------------------------- >>S57421 cysteine proteinase (EC 3.4.22.-) 6 - Tritrichomonas foetus (fragmen (152 aa) initn: 225 init1: 122 opt: 370 Z-score: 435.1 expect() 2.1e-17 Smith-Waterman score: 370; 44.000% identity in 150 aa overlap Entrez lookup Re-search database >S57421 140- 286: -------------------------------: 100 110 120 130 140 150 160 170 gi|115 PRKGKVFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCN :::: :.:. . :: :.:::::::::: :::: S57421 AFSAIQAIESVYAIGTGTLLSLSEQNLVDCVDT--CEGCN 10 20 30 180 190 200 210 220 230 240 250 gi|115 GGLMDYAFQYV--QDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIP-KQEKALMKAVATVGPISVAIDAGHESF ::::: :..:: ..:: ...: :: : . .:.: .. .... .:. .. ..: : : :: .:::::. .: S57421 GGLMDAAYDYVIEKQNGQFNTEASYWYIGIDETCMFDKYEKAGSISGYYNVAASSEDDLAAKVEQYGPAAVAIDASAVGF 40 50 60 70 80 90 100 110 260 270 280 290 300 310 320 330 gi|115 LFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV .: ::: . ::: .:::: ::.: :. S57421 QLYWGGIYDNSGCSSVMLDHGVGCVGFGVEGGTQ 120 130 140 150 --------------------------------------------------------------------------- >>S57427 cysteine proteinase (EC 3.4.22.-) 4 - Tritrichomonas foetus (fragmen (152 aa) initn: 290 init1: 155 opt: 370 Z-score: 435.1 expect() 2.1e-17 Smith-Waterman score: 370; 43.709% identity in 151 aa overlap Entrez lookup Re-search database >S57427 140- 286: -------------------------------: 100 110 120 130 140 150 160 170 gi|115 PRKGKVFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCN ::..: .:. . :.:.::::::::. ::.: :: S57427 AFATTQCMESINALRFKSLFSFSEQNLVDCD-PQSN-GCA 10 20 30 180 190 200 210 220 230 240 250 gi|115 GGLMDYAFQYVQ--DNGGLDSEESYPYEATEES-CKYNPKYSVANDTGFVDIPKQ-EKALMKAVATVGPISVAIDAGHES :: ::.... .:: .. :..::: .:. . ::..:. . . :::... : :. :.: ::.::::.: :::. : S57427 GGSPFSAFMFISRTQNGQINLEDDYPYTGTDTNDCKFDPSKGYGRITGFMSVQAQSEEDLFKCVASVGPIAVCIDASLAS 40 50 60 70 80 90 100 110 260 270 280 290 300 310 320 330 gi|115 FLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV : :. ::: . .::: .::.: .::: :. S57427 FNSYSSGIYNDRQCSSTVLDHAVGCIGYGAEGGA 120 130 140 150 --------------------------------------------------------------------------- >>S21864 probable cysteine proteinase (EC 3.4.22.-) - Euroglyphus maynei (211 aa) initn: 289 init1: 172 opt: 370 Z-score: 432.9 expect() 2.8e-17 Smith-Waterman score: 370; 32.000% identity in 200 aa overlap Entrez lookup Re-search database >S21864 115- 311: ----------------------------------------- : 80 90 100 110 120 130 140 150 gi|115 MAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRK : .: : :::.. :: ::::::::.... :. .. S21864 TYACSINSVSLPSELDLRSLRTVTPIRMQGGCGSCWAFSGVASTESAYLAY 10 20 30 40 50 160 170 180 190 200 210 220 230 gi|115 TGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPKQEKA . ..:.::.::::.. ..::.: . ...:.:.:: .. :. ::: : :.:: . :. . . .. .: .. S21864 RNMSLDLAEQELVDCAS---QNGCHGDTIPRGIEYIQQNGVVQ-EHYYPYVAREQSC-HRPNAQRYGLKNYCQISPPDSN 60 70 80 90 100 110 120 240 250 260 270 280 290 300 310 gi|115 LMKAVATVGPISVAIDAG---HESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGY .. . : .::. : ..: : .. : . . :.: .:::: .... ::.:.::: :: .:: S21864 KIRQALTQTHTAVAVIIGIKDLNAFRHYDGRTIMQHDNGYQPNYHAVNIVGYG----NTQGVDYWIVRNSWDTTWGDNGY 130 140 150 160 170 180 190 200 320 330 gi|115 VKMAKDRRNHCGIASAASYPTV S21864 GYFAANINL 210 --------------------------------------------------------------------------- >>S46265 cysteine proteinase - Plasmodium vivax (583 aa) initn: 346 init1: 113 opt: 374 Z-score: 430.8 expect() 3.6e-17 Smith-Waterman score: 468; 29.909% identity in 331 aa overlap Entrez lookup Re-search database >S46265 54- 331: ----------------------------------------------------------: 20 30 40 50 60 70 80 gi|115 IASATLTFDHSLEAQWTKWKAMHNRLYGMNEEGWRRAVWEKNMKMIELHNQEYREGKHSFTMAMNAFGDMTSEEF----- ::.:: :. ... : .. . : .: :.:.....: S46265 VSVAQIEGLFVNLKYASKFFNFMNKYKRSYKDINEQMEKYKNFKMNYLKIKKHNETNQMYKMKVNQFSDYSKKDFESYFR 220 230 240 250 260 270 280 290 90 100 110 120 130 140 150 gi|115 ----------RQVMNGFQNRKPRKGK--VFQEP---LFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFR .. . :.. . ::: : . :. ..:. .:.:::: : :.:: :::::::...: .: .. . S46265 KLVPIPDHLKKKYVVPFSSMNNGKGKNVVTSSSGANLLADVPEILDYREKGIVHEPKDQGLCGSCWAFASVGNVECMYAK 300 310 320 330 340 350 360 370 160 170 180 190 200 210 220 230 gi|115 KTGR-LISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPK-Q . .. ...::::..:::: . : ::.:: :.: :. .:: . ..: :.: .. : : : . . .. . S46265 EHNKTILTLSEQEVVDCS--KLNFGCDGGHPFYSFIYAIENG-ICMGDDYKYKAMDNLFCLN--YRCKNKVTLSSVGGVK 380 390 400 410 420 430 440 450 240 250 260 270 280 gi|115 EKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYG--------------------------- :. :..:. :::.:: . . ..: :: :: :. :. :...:.::.:::: S46265 ENELIRALNEVGPVSVNVGVT-DDFSFYGGGI-FNGTCT-EELNHSVLLVGYGQVQSSKIFQEKNAYDDASGVTKKGALS 460 470 480 490 500 510 520 530 290 300 310 320 330 gi|115 FESTESDNNKY-WLVKNSWGEEWGMGGYVKMAKDRRN---HCGIASAASYPTV . : .:. .: :..::::.. :: .:.......... :::. . :: S46265 YPSKADDGIQYYWIIKNSWSKFWGENGFMRISRNKEGDNVFCGIGVEVFYPIL 540 550 560 570 580 --------------------------------------------------------------------------- >>S68784 cathepsin L - Paramecium tetraurelia (SGC5) (fragment) (294 aa) initn: 279 init1: 125 opt: 361 Z-score: 420.5 expect() 1.4e-16 Smith-Waterman score: 498; 34.211% identity in 304 aa overlap Entrez lookup Re-search database >S68784 31- 331: ---------------------------------------------------------------: 10 20 30 40 50 60 70 80 gi|115 MNPTLILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLYGMNEEGWRRAVWEKNMKMIELHNQEYREGKHSFTMAMNAF .: .:..: .:. .: ....: .::: ::: :: .. :. : : S68784 TAGYYHLQEDDTNDFERWALKNNKFYTESEKLYRMEIYNSNKRMIEEHNQ--REDV-TYQMGENQF 10 20 30 40 50 60 90 100 110 120 130 140 150 gi|115 GDMTSEEFRQV-MNGFQNRKPRKGKVFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLI .. ::: .. .. .. : . : . :. .::::. :.: ::.::::.: ::::....::. . . . : S68784 MTLSHEEFVDLYLQKSDSSVNIMGASLPEVQL-EGLGAVDWRN--YTT-VKEQGQCASGWAFSVSNSLEAWYAIRGFQKI 70 80 90 100 110 120 130 160 170 180 190 200 210 220 230 gi|115 SLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCK--YNPKYSVANDTGFVDIPKQEKALMK . : :..:::. .: ::.:: ::..:: : : : .::: : ...:: : : . : .:: ... .. S68784 NASTQQIVDCD--YNNTGCSGGYNAYAMEYVLRVG-LVSSTNYPYVAKNQTCKQSRNGTYFI-NGYSFVGGSQSN---LQ 140 150 160 170 180 190 200 210 240 250 260 270 280 290 300 310 gi|115 AVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYVKMAKD . ::::...:.. . ::. :.. .:::. .: .:.::. .: :: :.:.:::: .:: .: ... S68784 YYLNNYPISVGVEASN--WQFYRSGLF--SNCSSNGTNHYALAVGF-----DSANN--WIVQNSWGTQWGESGNIRLYP- 220 230 240 250 260 270 280 320 330 gi|115 RRNHCGIASAASYPTV .: ::: . :: S68784 -QNTCGILN---YPYQVY 290 --------------------------------------------------------------------------- >>JQ0337 allergen Der p 1 - house-dust mite (Dermatophagoides pteronyssinus) (245 aa) initn: 285 init1: 206 opt: 349 Z-score: 407.9 expect() 6.9e-16 Smith-Waterman score: 349; 31.188% identity in 202 aa overlap Entrez lookup Re-search database >JQ0337 114- 311: ----------------------------------------- : 80 90 100 110 120 130 140 150 gi|115 TMAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFR :: .: :. :::.. :: ::::::::...: :. .. JQ0337 KNRFLMSAEAFEHLKTQFDLNAETNACSINGNAPAEIDLRQMRTVTPIRMQGGCGSCWAFSGVAATESAYLA 10 20 30 40 50 60 70 160 170 180 190 200 210 220 gi|115 KTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKY--NPKYSVANDTGFV--DIP . .. ..:.::.::::.. .: :.: . ...:.: :: .. : : : : :.::. .....: . . JQ0337 HRNQSLDLAEQELVDCASQHG---CHGDTIPRGIEYIQHNGVVQ-ESYYRYVAREQSCRRPNAQRFGISNYCQIYPPNAN 80 90 100 110 120 130 140 230 240 250 260 270 280 290 300 gi|115 KQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMG : ..:: . . : ..: : .. : . . :.: .::: ..... ::.:.::: .:: . JQ0337 KIREALAQPQRYCRHYWTIKDL--DAFRHYDGRTIIQRDNGYQPNYHAVNIVGY----SNAQGVDYWIVRNSWDTNWGDN 150 160 170 180 190 200 210 220 310 320 330 gi|115 GYVKMAKDRRNHCGIASAASYPTV :: JQ0337 GYGYFAANIDLMMIEEYPYVVIL 230 240 --------------------------------------------------------------------------- >>S57423 cysteine proteinase (EC 3.4.22.-) 9 - Tritrichomonas foetus (fragmen (152 aa) initn: 239 init1: 129 opt: 336 Z-score: 396.2 expect() 3.1e-15 Smith-Waterman score: 336; 38.411% identity in 151 aa overlap Entrez lookup Re-search database >S57423 140- 287: -------------------------------: 100 110 120 130 140 150 160 170 gi|115 PRKGKVFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCN ::::. : ::: : :.:.::: :::.::. . .:::. S57423 AFSAVCAQEGQWARTKGELLSLSVQNLLDCD--DDSEGCG 10 20 30 180 190 200 210 220 230 240 250 gi|115 GGL-MDYAFQYVQD-NGGLDSEESYPYEA-TEESCKYNPKYSVANDTGFVDIPKQEKALMKAVATVGPISVAIDAGHESF :: .. :. ... :: :..::: . . ..: .. . .:.. : .:..: .:. .. : : : :: ::.. .: S57423 GGWPFSGIFHVISEQNGEWMLENDYPYTSHSSNQCYFDASKGVSKTTKIVQLPINEEKILAACAEYGVISCCIDSSPIDF 40 50 60 70 80 90 100 110 260 270 280 290 300 310 320 330 gi|115 LFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV ..:.:::. .:.. ..::.: .:::: :. S57423 MYYSEGIFDTDQCNAWELDHAVNIVGYGAEAGTK 120 130 140 150 --------------------------------------------------------------------------- >>S57451 cysteine proteinase (EC 3.4.22.-) 3 - Tritrichomonas foetus (fragmen (157 aa) initn: 227 init1: 147 opt: 322 Z-score: 379.9 expect() 2.5e-14 Smith-Waterman score: 322; 33.766% identity in 154 aa overlap Entrez lookup Re-search database >S57451 140- 286: ------------------------------- : 100 110 120 130 140 150 160 170 gi|115 PRKGKVFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCN .:.: .:.:: : ..:.:...::: .::: . :: S57451 SFAACAAFEGAWFASSGKLVKISEQLFVDCC--KYCFGCY 10 20 30 180 190 200 210 220 230 240 250 gi|115 GGLMDYAFQYV--QDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDI-----PKQEKALMKAVATVGPISVAIDAG :: : :.... ...: . .:.::: .:. :.:. ... .. . .: . ..: . ... .::..::::: S57451 GGSADAAYNWAIHENDGKVCLHEDYPYTGTQGVCRYKSSMAYGHVSQYVRVFSLSEISDEDLMCQTLEEIGPLTVAIDAD 40 50 60 70 80 90 100 110 260 270 280 290 300 310 320 330 gi|115 HESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPT .: .: :::.. : . : .:.: ::::: :. S57451 GAKFRLYDSGIYYDDTCVQGDANHAVAVVGYGEEDNGEQ 120 130 140 150 --------------------------------------------------------------------------- >>B48566 cysteine proteinase Lpcys1 (EC 3.4.22.-) - Leishmania pifanoi (149 aa) initn: 190 init1: 116 opt: 313 Z-score: 370.0 expect() 8.9e-14 Smith-Waterman score: 313; 38.926% identity in 149 aa overlap Entrez lookup Re-search database >B48566 140- 283: ------------------------------ : 100 110 120 130 140 150 160 170 gi|115 PRKGKVFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCN :::: : .::: . :.::::: ::.:.. .:::: B48566 AFSAIGNIEGQWAASGHSLVSLSEQMLVSCDNI--DEGCN 10 20 30 180 190 200 210 220 230 240 250 gi|115 GGLMDYAFQYVQD--NGGLDSEESYPYEA---TEESCKYNPKYSVANDTGFVDIPKQEKALMKAVATVGPISVAIDAGHE ::::: :...... ::.. .: :::: . :. :. . . . :. :::...:..:. . . : ::..::.:: B48566 GGLMDQAMNWIMQSHNGSVFTEASYPYTSGGGTRPPCHDEGEVG-AKITGFLSLPHDEERIAEWVEKRGPVAVAVDA--T 40 50 60 70 80 90 100 110 260 270 280 290 300 310 320 330 gi|115 SFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV .. .: :. : . ...::::.::.. B48566 TWQLYFGGVV--SLCLAWSLNHGVLIVGFNKNAKPP 120 130 140 --------------------------------------------------------------------------- >>S31914 cysteine proteinase - chickpea (fragment) (111 aa) initn: 173 init1: 173 opt: 310 Z-score: 368.5 expect() 1.1e-13 Smith-Waterman score: 310; 46.491% identity in 114 aa overlap Entrez lookup Re-search database >S31914 137- 246: -----------------------: 100 110 120 130 140 150 160 170 gi|115 NRKPRKGKVFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRK--TGRLISLSEQNLVDCSGPQG :::::: . . .... : ::...:::::.: ::. S31914 SCWAFSDVQPV-SEFINKIVTGKFVSLSEQELGDCDRAF- 10 20 30 180 190 200 210 220 230 240 250 gi|115 NEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYS-VANDTGFVDIPK-QEKALMKAVATVGPISVAIDAG :::::::::::::... :::.:....:::.. :..: . : . :.. :. :.:. .:.:: :::: :.: S31914 NEGCNGGLMDYAFEFIIRNGGIDTDQDYPYNGFERKCDPTKKNAKVVSIDGYEDVPSYNENALKKAVAH-QPVS 40 50 60 70 80 90 100 110 260 270 280 290 300 310 320 330 gi|115 HESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPT --------------------------------------------------------------------------- >>S46541 cysteine proteinase - chickpea (fragment) (111 aa) initn: 173 init1: 173 opt: 308 Z-score: 366.2 expect() 1.4e-13 Smith-Waterman score: 308; 46.491% identity in 114 aa overlap Entrez lookup Re-search database >S46541 137- 246: -----------------------: 100 110 120 130 140 150 160 170 gi|115 NRKPRKGKVFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRK--TGRLISLSEQNLVDCSGPQG :::::: . . .... : ::...:::::.: ::. S46541 SCWAFSDVQPV-SEFINKIVTGKFVSLSEQELGDCDRAF- 10 20 30 180 190 200 210 220 230 240 250 gi|115 NEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYS-VANDTGFVDIPK-QEKALMKAVATVGPISVAIDAG :::::::::::::... :::.:....:::.. :..: . : . :.. :. :.:. .: :: :::: :.: S46541 NEGCNGGLMDYAFEFIIRNGGIDTDQDYPYNGFERKCDPTKKNAKVVSIDGYEDVPSYMEMALKKAVAH-QPVS 40 50 60 70 80 90 100 110 260 270 280 290 300 310 320 330 gi|115 HESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPT --------------------------------------------------------------------------- >>S57426 cysteine proteinase (EC 3.4.22.-) 5 - Tritrichomonas foetus (fragmen (155 aa) initn: 186 init1: 113 opt: 305 Z-score: 360.6 expect() 3e-13 Smith-Waterman score: 305; 37.748% identity in 151 aa overlap Entrez lookup Re-search database >S57426 140- 284: ------------------------------- : 100 110 120 130 140 150 160 170 gi|115 PRKGKVFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCN :::. : :: .::.:. :::::::::. .. .::. S57426 AFSTIVAQEGCHQIETGELLRLSEQNLVDCA--DNCHGCD 10 20 30 180 190 200 210 220 230 240 250 gi|115 GGLMDYAFQYV--QDNGGLDSEESYPYEATEES-CK-YNPKYSVANDTGFVDIPK-QEKALMKAVATVGPISVAIDAGHE :: ::.:: ...: ....::: : . : : . :.: .. .::. .:.:. ..::. ::... .:... S57426 GGWPIEAFNYVLNKQGGKYCTDDDYPYTAEQALLCYFYRVQQPVSNIASVYQIPQGDEEAMKEVVANWGPVAINVDSNYG 40 50 60 70 80 90 100 110 260 270 280 290 300 310 320 330 gi|115 SFLFYKEGIYFEPDCSSEDM-DHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV :: :: ::: : .:. . . .:.. ..::: S57426 SFNFYDGGIYVEESCQVKYVYSHAMGIIGYGSAEGQD 120 130 140 150 --------------------------------------------------------------------------- >>S04924 CTLA-2-alpha protein precursor - mouse (136 aa) initn: 271 init1: 271 opt: 292 Z-score: 346.6 expect() 1.8e-12 Smith-Waterman score: 292; 39.844% identity in 128 aa overlap Entrez lookup Re-search database >S04924 5- 132: ---------------------------: 10 20 30 40 50 60 70 gi|115 MNPTLILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLYGMNEEGWRRAVWEKNMKMIELHNQEYREGK ..: .:::. ::. : ::. .: .::. . :..::: :: :::.: : :: :: .:..:: S04924 MVSICEQKLQHFSAVFLLILCLGMMSAAPPPDPSLDNEWKEWKTKFAKAYNLNEERHRRLVWEENKKKIEAHNADYEQGK 10 20 30 40 50 60 70 80 80 90 100 110 120 130 140 150 gi|115 HSFTMAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQ :: :..: :.:.: :::. : . : : :.. : . :. : ...:.:: . : S04924 TSFYMGLNQFSDLTPEEFKT--NCYGNSLNR-GEM--AP---DLPEYEDLGKNSYLTPGRAQPE 90 100 110 120 130 160 170 180 190 200 210 220 230 gi|115 MFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPK --------------------------------------------------------------------------- >>S60456 cysteine proteinase (EC 3.4.22.-), glucose starvation-induced - maiz (145 aa) initn: 179 init1: 179 opt: 291 Z-score: 345.0 expect() 2.2e-12 Smith-Waterman score: 291; 34.058% identity in 138 aa overlap Entrez lookup Re-search database >S60456 197- 329: ----------------------------: 160 170 180 190 200 210 220 230 gi|115 RLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPKQEKALM .::..::: ... .::.. . ::. .: . .: . S60456 ESEKDYPYTGSDGKCKFDKSKIVASVQNFSVVSVDEAQIS 10 20 30 40 240 250 260 270 280 290 300 310 gi|115 KAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYG---FESTESDNNKYWLVKNSWGEEWGMGGYVK ::....:.:.. . : :. :. . .:::::.:::: : . .. ::..::::::.:: .:: : S60456 ANRIKHGPLAIGINAAYMQT--YIGGVSCPYICGRH-LDHGVLLVGYGASGFAPMRLKDKPYWIIKNSWGENWGENGYYK 50 60 70 80 90 100 110 320 330 gi|115 MAK--DRRNHCGIASAASYPTV . . . ::.::. : .: S60456 ICRGSNVRNKCGVDSMVSTVSAVHASKE 120 130 140 --------------------------------------------------------------------------- >>S04925 CTLA-2-beta protein precursor - mouse (fragment) (141 aa) initn: 271 init1: 271 opt: 281 Z-score: 333.8 expect() 9.3e-12 Smith-Waterman score: 281; 37.500% identity in 128 aa overlap Entrez lookup Re-search database >S04925 5- 132: ---------------------------: 10 20 30 40 50 60 gi|115 MNPTLILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLYGMNEEGWRRAVWEKNMKMIELHNQE ..: .:::. ::. . : ::. .: .::. . :...:: :: .::.: : :: :: . S04925 LDNKVLVSICEQKLQHFSAVFLLILCLGMMSAAPSPDPSLDNEWKEWKTTFAKAYSLDEERHRRLMWEENKKKIEAHNAD 10 20 30 40 50 60 70 80 70 80 90 100 110 120 130 140 gi|115 YREGKHSFTMAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATG :..:: :: :..: :.:.: :::: : . .:.. : . :. : ...:.:: . : S04925 YERGKTSFYMGLNQFSDLTPEEFRTNCCG---SSMCRGEM--AP---DLPEYEDLGKNSYLTPGRAQPE 90 100 110 120 130 140 150 160 170 180 190 200 210 220 gi|115 ALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGF --------------------------------------------------------------------------- >>KHQBTT cysteine proteinase (EC 3.4.22.-) precursor - Theileria parva (439 aa) initn: 344 init1: 182 opt: 287 Z-score: 333.2 expect() 1e-11 Smith-Waterman score: 451; 27.059% identity in 340 aa overlap Entrez lookup Re-search database >KHQBTT 10- 324: ------------------------------------------------------------------ : 10 20 30 40 gi|115 MNPTLILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLYGMNEEGWRR : :. : ... . .. .... .:: .. ..: : KHQBTT FSKYKMLNKFKRELDDHLTKDFPNLERSKRDTCFDELTRLFGDGFLSDDPKLEYEVYREFEEFNSKYNRRHATQQERLNR 70 80 90 100 110 120 130 140 50 60 70 80 90 100 gi|115 AVWEKNMKMIELHNQEYREGKHSFTMAMNAFGDMTSEEFRQ---VM--------NGFQNRKPRKGKVFQE---------- : .. ...:...:. : . .. ..: :.:.: .:: . :: ::. . .:.. . KHQBTT LVTFRS-NYLEVKEQK---GDEPYVKGINRFSDLTEREFYKLFPVMKPPKATYSNGYYLLSHMANKTYLKNLKKALNTDE 150 160 170 180 190 200 210 220 110 120 130 140 150 160 170 180 gi|115 --PLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYA : . ...:::... :: ::.:..::.:::::..:..:: .. . . :: :.:.::.. ..::.:::.. : KHQBTT DVDLAKLTGENLDWRRSSSVTSVKDQSNCGGCWAFSTVGSVEGYYMSHFDKSYELSVQELLDCDSF--SNGCQGGLLESA 230 240 250 260 270 280 290 190 200 210 220 230 240 250 260 gi|115 FQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPKQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFE ..::. : : : .. :. . :. :: . .. .. . : .... ... : .: :: .... : . :: :. : KHQBTT YEYVRKYG-LVSAKDLPFVDKARRCSV-PKAKKVSVPSY-HVFKGKEVMTRSL-TSSPCSVYLSVSPE-LAKYKSGV-FT 300 310 320 330 340 350 360 370 270 280 290 300 310 320 330 gi|115 PDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYVKMAKDR--RNHCGIASAASYPTV .:. ....:.:..:: :.. : ...::.:.:::: .:: .::... . ..::. KHQBTT GECG-KSLNHAVVLVGEGYD--EVTKKRYWVVQNSWGTDWGENGYMRLERTNMGTDKCGVLDTSMSAFEL 380 390 400 410 420 430 --------------------------------------------------------------------------- >>A69493 cysteine proteinase homolog - Archaeoglobus fulgidus (1088 aa) initn: 234 init1: 136 opt: 258 Z-score: 294.1 expect() 1.5e-09 Smith-Waterman score: 289; 30.864% identity in 243 aa overlap Entrez lookup Re-search database >A69493 115- 330: ---------------------------------------------: 80 90 100 110 120 130 140 150 gi|115 MAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRK : :::. .. :..::.:::::: ::..:::. .. . A69493 ASRILTFTSTCSDGVQNGDEEGIDCGGSCLPCNRCDMASLPSRFDWRDYTGLSAVRDQGSCGSCWAHSAVAALESALIVE 560 570 580 590 600 610 620 630 160 170 180 190 200 210 220 gi|115 TG--RLISLSEQNLV----DCSGPQGN-----EG-CNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVAND .: :.::::.:. :: :. : :.:: :.... .:: : : .:: ::. .: . A69493 SGASSSIDLSEQHLLSCEQDCEVGIGDWCWASSGDCDGGWPHKALNFIINNGVPD-ESCFPYTATNGNCGSKCGDWEDRT 640 650 660 670 680 690 700 710 230 240 250 260 270 280 290 gi|115 TGFV---DIPKQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKY---- : : . .. .:: .:. ::.::: ::. .:..:.::: :: ..:: A69493 EGAVYRGKVSSNVEALKRALICHGPLSVA----------------------SENWEHALLLVGYDDLSTIC-TQKYGKSG 720 730 740 750 760 770 300 310 320 330 gi|115 -WLVKNSWG-------EEWGMGGYVKMAKDRRNHCGIASAASYPTV :..::::: . : ::. . . .. : ..: : A69493 CWILKNSWGVFSGFSHDVWHEYGYAYIPYSGFKYSDIKNGAYYVIPSDYTLHADFEMMDGLAAGDLDGDGMAEIVHADRG 780 790 800 810 820 830 840 850 A69493 DLLQIFNLGGLQSSEQMDFEEGDRIATGDVDGDGRMDVIHADRGDEVSIHFQPPVGVVSSFYLDFEEGDDIAAGDVNGDG 860 870 880 890 900 910 920 930 --------------------------------------------------------------------------- >>S57624 cysteine proteinase LmCPb19 - Leishmania mexicana (fragment) (136 aa) initn: 195 init1: 108 opt: 238 Z-score: 284.8 expect() 4.9e-09 Smith-Waterman score: 238; 41.758% identity in 91 aa overlap Entrez lookup Re-search database >S57624 224- 314: ------------------- : 190 200 210 220 230 240 250 260 gi|115 DYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPKQEKALMKAVATVGPISVAIDAGHESFLFYKEGI : : : ..:::. .: :::..:.::. ::. :: :. S57624 ELVVGAQIDGHVLIGSSEKAMAAWLAKNGPIAIALDAS--SFMSYKSGV 10 20 30 40 270 280 290 300 310 320 330 gi|115 YFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV : .....::::.::: . . . ::..::::: .:: :::... S57624 LTA--CIGKQLNHGVLLVGYDM----TGEVPYWVIKNSWGGDWGEQGYVRVVMGVNACLLSEYPVSAHVRESAAPGTSTS 50 60 70 80 90 100 110 120 S57624 SETPAHNSVMVEQVY 130 --------------------------------------------------------------------------- >>B26074 cysteine proteinase (EC 3.4.22.-) 13 - papaya (fragment) (96 aa) initn: 116 init1: 116 opt: 227 Z-score: 274.5 expect() 1.9e-08 Smith-Waterman score: 227; 41.111% identity in 90 aa overlap Entrez lookup Re-search database >B26074 243- 329: ------------------: 210 220 230 240 250 260 270 280 gi|115 PYEATEESCKYNPKYSVANDTGFVDIPKQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVG ::..:::.:.. . : :. :: . ..::::.:: B26074 GPLAVAINAAYMQT--YIGGVSCPYICSRR-LNHGVLLVG 10 20 30 290 300 310 320 330 gi|115 YG---FESTESDNNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV :: . . .. ::..::::::.:: .:: :. . : : ::. : .: B26074 YGSAGYAPIRLKEKPYWVIKNSWGENWGENGYYKICRGR-NICGVDSMVSTVAAVHTTSQ 40 50 60 70 80 90 --------------------------------------------------------------------------- >>A45565 cysteine proteinase - Theileria annulata (441 aa) initn: 232 init1: 101 opt: 235 Z-score: 273.7 expect() 2.1e-08 Smith-Waterman score: 437; 27.193% identity in 342 aa overlap Entrez lookup Re-search database >A45565 19- 331: -----------------------------------------------------------------: 10 20 30 40 50 gi|115 MNPTLILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLYGMNEEGWRRAVWEKNMKM . :: .: :.: .: . ... : ...::... A45565 IESHYPSMDPSKRAGFVEEIVKIRQTGKITSDAESELDMLIEFDAFVE----KYKKVHRSF---DQRVQRFLTFRKNYHI 80 90 100 110 120 130 140 150 60 70 80 90 100 110 gi|115 IELHNQEYREGKHSFTMAMNAFGDMTSEEFRQV------------------MNGFQNRKP------RKGKVFQE--PLFY .. :. . ... .: :.:...:::. . .. .....: .:.: ..: : A45565 VKTHKPT-----EPYSLDLNKFSDLSDEEFKALYPVITPPKTYTSLSKHLEFKKMSHKNPIYISKLKKAKGIEEIKDLSL 160 170 180 190 200 210 220 120 130 140 150 160 170 180 190 gi|115 EAPRSVDWREKGYVTPVKNQG-QCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQ . ....: . :.:.:.:: .::::::::. ...:. . .. ::::.::.:. ... :: ::: :..:.. A45565 ITGENLNWARTDAVSPTKDQGDHCGSCWAFSSIASVESLYRLYKNKSYFLSEQELVNCD--KSSMGCAGGLPITALEYIH 230 240 250 260 270 280 290 300 200 210 220 230 240 250 260 270 gi|115 DNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPKQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSS ..: .. : :: . :: . : .: :. ..: : . .. :... ..: :.: . .: . .:. :: : :.. A45565 SKG-VSFESEVPYTGIVSPCKPSIKNKVFIDS--ISILKGNDVVNKSLV-ISPTVVGIAVTKE-LKLYSGGI-FTGKCGG 310 320 330 340 350 360 370 280 290 300 310 320 330 gi|115 EDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYVKMAKDRR--NHCGIASAASYPTV : ..:.::.:: : . . . .::..::::::.:: .:.... . .. ..::: . . : A45565 E-LNHAVLLVGEGVD--HETGMRYWIIKNSWGEDWGENGFLRLQRTKKGLDKCGILTFGLNPILYSS 380 390 400 410 420 430 440 --------------------------------------------------------------------------- >>S41426 cysteine proteinase (EC 3.4.22.-) CP4 precursor - Trichomonas vagina (100 aa) initn: 204 init1: 146 opt: 224 Z-score: 270.8 expect() 3e-08 Smith-Waterman score: 224; 31.481% identity in 108 aa overlap Entrez lookup Re-search database >S41426 32- 139: ----------------------: 10 20 30 40 50 60 70 80 gi|115 MNPTLILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLYGMNEEGWRRAVWEKNMKMIELHNQEYREGKHSFTMAMNAF : . .. .: : ..: .: .... ::. .. .::.:.: . S41426 WMRETGNMFTGEEYQTRLGIWLSNKRLVQEHNR----ANLGFTVALNKL 10 20 30 40 90 100 110 120 130 140 150 160 gi|115 GDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLIS . .: :. ... ::. : .. : .. . : . :::.:: :.:.:.:::::::: S41426 AHLTPAEYNSLL-GFRMNKAERKAVKSNAI---ANADCDWRKKGAVNPIKDQGQCGSCW 50 60 70 80 90 100 170 180 190 200 210 220 230 240 gi|115 LSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPKQEKALMKAVA --------------------------------------------------------------------------- >>S60479 cathepsin B-like cysteine proteinase (EC 3.4.22.-) - Aztec tobacco (356 aa) initn: 236 init1: 169 opt: 231 Z-score: 270.5 expect() 3.1e-08 Smith-Waterman score: 361; 29.452% identity in 292 aa overlap Entrez lookup Re-search database >S60479 70- 332: ------------------------------------------------------: 30 40 50 60 70 80 90 100 gi|115 TKWKAMHNRLYGMNEEGWRRAVWEKNMKMIELHNQEYREGKHSFTMAMNA-FGDMTSEEFRQVMNGFQNRK-PRKG-KVF : .. :.: :...: .:..... .:: :: .. S60479 LLIGASIIVLQVVAEQPISQAKAESAILQDSIVKQVNENEKAGWKAALNPRFSNFTVSQFKRLLGVKPTRKGDLKGIPIL 20 30 40 50 60 70 80 90 110 120 130 140 150 160 170 180 gi|115 QEPLFYEAPRSVD----WREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGL .: . : :. : : . . . . .::.:::::::.:. .: .. . : :::: ..: : : ..::.:: S60479 THPKLLELPQEFDARVAWSNCSTIGRILDQGHCGSCWAFGAVESLSDRFCIHYGLNISLSANDLYACCGFLCGDGCDGGY 100 110 120 130 140 150 160 170 190 200 210 220 230 240 gi|115 MDYAFQY-VQ------------DNGGLDS---EESYPYEATEESC-KYNPKYSVANDTGFVD--IPKQEKALMKAVATVG :..: :. :: : . : .:: ...: : : .: .. : : .. ..: : : S60479 PLQAWKYFVRKGVVTDECDPYFDNEGCSHPGCEPAYPTPKCHRKCVKQNLLWSRSKHFGVNAYMITSDPLSIMTEVYKNG 180 190 200 210 220 230 240 250 250 260 270 280 290 300 310 320 gi|115 PISVAIDAGHESFLFYKEGIYFEPDCSSEDMD-HGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYVKMAKDRRNHC :. :.. . .:.: :: :.: ... : :.: ..:.: : :.. :::. :.:.. :: :: :. . :.: S60479 PVEVSFTV-YEDFAHYKSGVY--KHVTGDVMGGHAVKLIGWG---TSEDGEDYWLLANQWNRGWGDDGYFKIRRGT-NEC 260 270 280 290 300 310 320 330 gi|115 GIAS--AASYPTV : . .:. :.. S60479 EIEDEVVAGLPSARNLNVELDVSDAYLDAAM 330 340 350 --------------------------------------------------------------------------- >>S31907 cathepsin B (EC 3.4.22.1) - fluke (Schistosoma japonicum) (342 aa) initn: 194 init1: 89 opt: 225 Z-score: 263.9 expect() 7.2e-08 Smith-Waterman score: 346; 28.148% identity in 270 aa overlap Entrez lookup Re-search database >S31907 96- 326: -------------------------------------------------: 60 70 80 90 100 110 120 130 gi|115 MKMIELHNQEYREGKHSFTMAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYEAPRSVDWREK----GYVTPVKN .::.: : .. : : : . : :.: .. ... S31907 ISFINEHPDAGWKADKSDRFHSLDDARILMGARKEDAEMKRNRRPT---VDHHDLNVEIPSQFDSRKKWPHCKSISQIRD 40 50 60 70 80 90 100 110 140 150 160 170 180 190 200 gi|115 QGQCGSCWAFSATGALEGQMFRKTG--RLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNG---GLDSE-----ES :..:::::::.:. :. .. ..: . :: .:..: :. ::.::. :..: : : ..: . S31907 QSRCGSCWAFGAVEAMTDRICIQSGGGQSAELSALDLISCCKDCGD-GCQGGFPGVAWDYWVKRGIVTGGSKENHTGCQP 120 130 140 150 160 170 180 190 210 220 230 240 250 gi|115 YPYEATE---------------------ESCKYNPKYSVANDTGFVD----IPKQEKALMKAVATVGPISVAIDAGHESF ::. : ..:. . : .: . : . ..::.... . ::. .:.:. .:.: S31907 YPFPKCEHHTKGKYPACGTKIYKTPQCKQTCQKGYKTPYEQDKHYGDESYNVQNNEKVIQRDIMMYGPVEAAFDV-YEDF 200 210 220 230 240 250 260 260 270 280 290 300 310 320 330 gi|115 LFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV : :: ::: . : :.. ..:.: :. . :::. :::.:.:: : .:.. : ..:.: : S31907 LNYKSGIYRHVTGSIVG-GHAIRIIGWGVEKR----TPYWLIANSWNEDWGEKGLFRMVRGR-DECSIESDVVAGLIKT 270 280 290 300 310 320 330 340 --------------------------------------------------------------------------- >>KHHUB cathepsin B (EC 3.4.22.1) precursor - human (339 aa) initn: 181 init1: 83 opt: 223 Z-score: 261.7 expect() 9.6e-08 Smith-Waterman score: 362; 27.575% identity in 301 aa overlap Entrez lookup Re-search database >KHHUB 66- 326: -------------------------------------------------------: 30 40 50 60 70 80 90 100 gi|115 EAQWTKWKAMHNRLYGMNEEGWRRAVWEKNMKMIELHNQEYREGKHSFTMAMNAFGDMTSEEFRQVMNGFQNR-KPRKGK : . ... .: . : .. .... . : . :: . KHHUB MWQLWASLCCLLVLANARSRPSFHPVSDELVNYVNKRNTTWQAGHNFYNVDMSYLKRLCGTFLGGPKPPQRV 10 20 30 40 50 60 70 110 120 130 140 150 160 170 gi|115 VFQEPLFYEAPRSVDWREKGYVTP----VKNQGQCGSCWAFSATGALEGQMFRKTGRLISL--SEQNLVDCSGPQGNEGC .: : : . : : : ::. : ...::.:::::::.:. :. .. .:. .:. : ..:. : : . ..:: KHHUB MFTEDL--KLPASFDAREQWPQCPTIKEIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGSMCGDGC 80 90 100 110 120 130 140 150 180 190 200 210 220 gi|115 NGGLMDYAFQYVQDNGGLDS--EESY----PY------------------EATEESCK------YNPKYSVANDTGF--V ::: :... .: ... ::. :: :. .:. :.: :. . :. KHHUB NGGYPAEAWNFWTRKGLVSGGLYESHVGCRPYSIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSY 160 170 180 190 200 210 220 230 230 240 250 260 270 280 290 300 gi|115 DIPKQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMD-HGVLVVGYGFESTESDNNKYWLVKNSWGEE .. ..:: .: . ::. :... . .::.:: :.: ..: : :.. ..:.: : ... :::: :::. . KHHUB SVSNSEKDIMAEIYKNGPVEGAFSV-YSDFLLYKSGVY--QHVTGEMMGGHAIRILGWGVE----NGTPYWLVANSWNTD 240 250 260 270 280 290 300 310 320 330 gi|115 WGMGGYVKMAKDRRNHCGIASAASYPTV :: .:. :. . . .:::: : KHHUB WGDNGFFKILRGQ-DHCGIESEVVAGIPRTDQYWEKI 310 320 330 --------------------------------------------------------------------------- >>S38939 probable cathepsin B-like cysteine proteinase (EC 3.4.22.-) 29K, pre (344 aa) initn: 233 init1: 104 opt: 223 Z-score: 261.6 expect() 9.7e-08 Smith-Waterman score: 327; 28.626% identity in 262 aa overlap Entrez lookup Re-search database >S38939 109- 333: ----------------------------------------------: 70 80 90 100 110 120 130 140 gi|115 GKHSFTMAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALE : ..: .. : . . ...::.:::::::.:. :. S38939 SVPRSHFRRLMGVHPDAHKFTLHEKSLVLGEEVGLADSDVPEEFDARKA--WPNCPTIGEIRDQGSCGSCWAFGAVEAMS 60 70 80 90 100 110 120 130 150 160 170 180 190 200 gi|115 GQMFRKTGRLISL--SEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLD------SEESYPYE-ATEE---------- .. ... : . : ..::.: : :::::. :. : .: .. :. ::: : : S38939 DRLCIHSNATIHFHFSADDLVSCCHTCG-FGCNGGFPGAAWAYWTRKGIVSGGPYGSSQGCRPYEIAPCEHHVNGTRPPC 140 150 160 170 180 190 200 210 220 230 240 250 260 270 gi|115 --------SCKY--NPKYSV--ANDTGF----VDIPKQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSED ::.. . .:.: .: : .. .. : ..: . ::. :. . .:....::.:.: . . : S38939 DGEHGKTPSCRHECQKSYDVDYKTDKHFGSKSYSVKRNVKDIQKEIMQNGPVEGAFTV-YEDLILYKDGVY-QHVHGREL 210 220 230 240 250 260 270 280 280 290 300 310 320 330 gi|115 MDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASA--ASYPTV :.. ..:.: : ... :::. :::. .:: .:. :: . . .:::: :: :. : : S38939 GGHAIRILGWGVE----NKTPYWLIANSWNTDWGNNGFFKMLRGE-DHCGIESAIAAGLPKV 290 300 310 320 330 340 --------------------------------------------------------------------------- >>S31909 cathepsin B-like cysteine proteinase (EC 3.4.22.-) - fluke (Schistos (316 aa) initn: 176 init1: 94 opt: 221 Z-score: 259.9 expect() 1.2e-07 Smith-Waterman score: 331; 27.239% identity in 268 aa overlap Entrez lookup Re-search database >S31909 98- 326: -------------------------------------------------: 60 70 80 90 100 110 120 130 gi|115 MIELHNQEYREGKHSFTMAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYEAPRSVDWREK----GYVTPVKNQG :. :. : .. : : : : :.: .. ...:. S31909 MISFINKHPNAGWKADKSDRFHSVDDARILLGGRREDPNLRQKRRPTVDHHDLKVEIPSHFDSRKKWPRCKSISQIRDQS 10 20 30 40 50 60 70 80 140 150 160 170 180 190 200 gi|115 QCGSCWAFSATGALEGQMFRKTG--RLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNG---GLDSE-----ESYP .:.: :: ::.::. .. ..: . . :: .:..: :. ::.::. :..: ..: : ..: . :: S31909 RCASSWAVSAVGAMSDRICIQSGGKQSVELSAIDLISCCENCGS-GCDGGFPGPAWDYWVSHGIVTGGSKENHTGCQPYP 90 100 110 120 130 140 150 160 210 220 230 240 250 gi|115 YEATEE---------------------SCKYNPKYSVANDTGF----VDIPKQEKALMKAVATVGPISVAIDAGHESFLF . :. .:. . : .: . ... :.:.:..: . ::. . . :.:: S31909 FPKCEHHSKGKYPSCGDKMYKTPQCKRKCQKGYKTPYEHDKHYGGISINVIKNESAIQKEIMMYGPVEAYLLI-FEDFLN 170 180 190 200 210 220 230 240 260 270 280 290 300 310 320 330 gi|115 YKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV :: ::: . .: .: : ..:.:.: ... :::. :.:.:.:: :: .... : :.:.. :. S31909 YKSGIY-RYTTGSFVGEHYVRIIGWGIE----NGTAYWLAANTWNEDWGEKGYFRIVRGR-NECSVESVVVAGRLKS 250 260 270 280 290 300 310 --------------------------------------------------------------------------- >>KHRTB cathepsin B (EC 3.4.22.1) precursor - rat (339 aa) initn: 208 init1: 82 opt: 221 Z-score: 259.4 expect() 1.3e-07 Smith-Waterman score: 349; 26.861% identity in 309 aa overlap Entrez lookup Re-search database >KHRTB 56- 326: ---------------------------------------------------------: 20 30 40 50 60 70 80 90 gi|115 SATLTFDHSLEAQWTKWKAMHNRLYGMNEEGWRRAVWEKNMKMIELHNQEYREGKHSFTMAMNAFGDMTSEEFRQVMNGF ...:. .: .. :.. ... .. . . . :..: KHRTB MWWSLIPLSCLLALTSAHDKPSSHPLSDDMINYINKQNTTWQAGRNFYNVDISYLKKLCGT----VLGG- 10 20 30 40 50 60 100 110 120 130 140 150 160 gi|115 QNRKPRKGKVFQEPLFYEAPRSVDWREKGYVTP----VKNQGQCGSCWAFSATGALEGQMFRKT-GRL-ISLSEQNLVDC : : : :.: . . :.: : ::. : ...::.:::::::.:. :. .. .: ::. . .: ..:. : KHRTB PNLPERVG--FSEDI--NLPESFDAREQWSNCPTIAQIRDQGSCGSCWAFGAVEAMSDRICIHTNGRVNVEVSAEDLLTC 70 80 90 100 110 120 130 140 170 180 190 200 210 gi|115 SGPQGNEGCNGGLMDYAFQYVQD----NGGLDSEE--SYPY------------------EATEESCK------YNPKYSV : : ..::::: . :... .::. . . :: :. .:. :. .:. KHRTB CGIQCGDGCNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPCEHHVNGSRPPCTGEGDTPKCNKMCEAGYSTSYKE 150 160 170 180 190 200 210 220 220 230 240 250 260 270 280 290 gi|115 ANDTGFVD--IPKQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWL . :... . .:: .: . ::. :. . .:: :: :.: . . .. :.. ..:.:.: .. ::: KHRTB DKHYGYTSYSVSDSEKEIMAEIYKNGPVEGAFTV-FSDFLTYKSGVY-KHEAGDVMGGHAIRILGWGIE----NGVPYWL 230 240 250 260 270 280 290 300 310 320 330 gi|115 VKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV : :::. .:: .:. :. . . ::::: : KHRTB VANSWNVDWGDNGFFKILRGE-NHCGIESEIVAGIPRTQQYWGRF 300 310 320 330 --------------------------------------------------------------------------- >>D48435 cysteine proteinase AC-3 - nematode (Haemonchus contortus) (341 aa) initn: 214 init1: 97 opt: 216 Z-score: 253.6 expect() 2.7e-07 Smith-Waterman score: 322; 28.571% identity in 280 aa overlap Entrez lookup Re-search database >D48435 88- 324: -------------------------------------------------- : 50 60 70 80 90 100 110 gi|115 RRAVWEKNMKMIELHNQEYREGKHSFTMAMNAFGDMTSEEFRQ-VMN-GFQNRKPR---KGKVFQE---PLFYEAPRSVD :.: .:. :.:..: : : : :. ::.. D48435 DVNAAQEIPLEAQTLSGEPLVAYLRKNQNLFEVNSTPTPGFKQKIMDIKFRNQNPNLIVKDDPEPEDDIPEEYD-PRKI- 20 30 40 50 60 70 80 90 120 130 140 150 160 170 180 190 gi|115 WREKGYVTPVKNQGQCGSCWAFSATGALEGQMF--RKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNG--- : . ...:..:::::: :...:. .. :. . ...: .:: : : . ::.:: :..: : D48435 WSNCTSFY-IRDQANCGSCWAVSTAAAISDRICIATKARKQVNISATDLVTCCTPTCGFGCDGGWSIKAWEYFTYAGLVS 100 110 120 130 140 150 160 170 200 210 220 230 240 gi|115 ------------------GLDSEESY----PYEATEESCK------YNPKYSVANDTGF--VDIPKQEKALMKAVATVGP : ....: : ::. ::: : : . . : ..::. .:..: . :: D48435 GGEYRSKRCCRPYPIHPCGHHGNDTYYGECPEEASTPSCKKKCQPGYRKLYRMDKRYGTDAFQLPKSVEAIQKELLKNGP 180 190 200 210 220 230 240 250 250 260 270 280 290 300 310 320 gi|115 ISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGI ..... : .:.: .:: ::: . . . :.: ..:.: :. ..: :::. ::: ..:: .:: .. . : ::: D48435 VTASF-AVYEDFSLYKSGIYRHTAGELRGY-HAVKMIGWGTEN-RTD---YWLIANSWHDDWGENGYFRIIRGI-NDCGI 260 270 280 290 300 310 320 330 gi|115 ASAASYPTV D48435 EENVAAGLIDVESL 330 340 --------------------------------------------------------------------------- >>KHMSB cathepsin B (EC 3.4.22.1) precursor - mouse (339 aa) initn: 203 init1: 82 opt: 210 Z-score: 246.8 expect() 6.5e-07 Smith-Waterman score: 339; 26.129% identity in 310 aa overlap Entrez lookup Re-search database >KHMSB 56- 326: ---------------------------------------------------------: 20 30 40 50 60 70 80 90 gi|115 SATLTFDHSLEAQWTKWKAMHNRLYGMNEEGWRRAVWEKNMKMIELHNQEYREGKHSFTMAMNAFGDMTSEEFRQVMNGF ...:. .: .. :.. ... .. . . . :..: KHMSB MWWSLILLSCLLALTSAHDKPSFHPLSDDLINYINKQNTTWQAGRNFYNVDISYLKKLCGT----VLGG- 10 20 30 40 50 60 100 110 120 130 140 150 160 gi|115 QNRKPRKGKV-FQEPLFYEAPRSVDWREKGYVTP----VKNQGQCGSCWAFSATGALEGQMFRKT-GRL-ISLSEQNLVD . : :.: : : . . :.. : ::. : ...::.:::::::.:. :. . .: ::. . .: ..:. KHMSB -PKLP--GRVAFGEDI--DLPETFDAREQWSNCPTIGQIRDQGSCGSCWAFGAVEAISDRTCIHTNGRVNVEVSAEDLLT 70 80 90 100 110 120 130 140 170 180 190 200 210 gi|115 CSGPQGNEGCNGGLMDYAFQYVQD----NGGLDSEE--SYPYEAT----------------------EESCK--YNPKYS : : : ..::::: . :... .::. . . :: ..::. :.:.:. KHMSB CCGIQCGDGCNGGYPSGAWSFWTKKGLVSGGVYNSHVGCLPYTIPPCEHHVNGSRPPCTGEGDTPRCNKSCEAGYSPSYK 150 160 170 180 190 200 210 220 220 230 240 250 260 270 280 290 gi|115 VANDTGFV--DIPKQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYW . :.. .. .. : .: . ::. :. . .:: :: :.: . . .. :.. ..:.: : .. :: KHMSB EDKHFGYTSYSVSNSVKEIMAEIYKNGPVEGAFTV-FSDFLTYKSGVY-KHEAGDMMGGHAIRILGWGVE----NGVPYW 230 240 250 260 270 280 290 300 310 320 330 gi|115 LVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV :. :::. .:: .:. :. . . ::::: : KHMSB LAANSWNLDWGDNGFFKILRGE-NHCGIESEIVAGIPRTDQYWGRF 300 310 320 330 --------------------------------------------------------------------------- >>KHBOB cathepsin B (EC 3.4.22.1) precursor - bovine (335 aa) initn: 206 init1: 82 opt: 209 Z-score: 245.8 expect() 7.4e-07 Smith-Waterman score: 360; 29.457% identity in 258 aa overlap Entrez lookup Re-search database >KHBOB 115- 331: ---------------------------------------------: 80 90 100 110 120 130 140 150 gi|115 MAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYEAPRSVDWREKGYVTP----VKNQGQCGSCWAFSATGALEGQ :.: : ::. : ...::.:::::::.:. :. . KHBOB WKAGHNFYNVDLSYVKKLCGAILGGPKLPQRDAFAADVVLPESFDAREQWPNCPTIKEIRDQGSCGSCWAFGAVEAISDR 50 60 70 80 90 100 110 120 160 170 180 190 200 gi|115 M-FRKTGRL-ISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQD----NGGLDSEE--SYPY------------------ . ....::. . .: .... : : . ..:::::. . :... .::: . . :: KHBOB ICIHSNGRVNVEVSAEDMLTCCGGECGDGCNGGFPSGAWNFWTKKGLVSGGLYNSHVGCRPYSIPPCEHHVNGSRPPCTG 130 140 150 160 170 180 190 200 210 220 230 240 250 260 270 gi|115 EATEESCK------YNPKYSVANDTGF--VDIPKQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMD- :. .:. :.:.:. . : .. ..:: .: . ::. :... . .::.:: :.: :.: : KHBOB EGDTPKCSKTCEPGYSPSYKEDKHFGCSSYSVANNEKEIMAEIYKNGPVEGAFSV-YSDFLLYKSGVY--QHVSGEIMGG 210 220 230 240 250 260 270 280 290 300 310 320 330 gi|115 HGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIAS--AASYPTV :.. ..:.: : ... :::: :::. .:: .:. :. . . .:::: : .:..: KHBOB HAIRILGWGVE----NGTPYWLVGNSWNTDWGDNGFFKILRGQ-DHCGIESEIVAGMPCTHQY 280 290 300 310 320 330 --------------------------------------------------------------------------- >>A61061 actinidain (EC 3.4.22.14) - kiwi fruit (cv. Hayward) (fragments) (110 aa) initn: 241 init1: 136 opt: 200 Z-score: 242.7 expect() 1.1e-06 Smith-Waterman score: 212; 33.103% identity in 145 aa overlap Entrez lookup Re-search database >A61061 155- 299: -------------------------------: 120 130 140 150 160 170 180 190 gi|115 PRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNG :: :::::::.:.::. .::.:: . .::.. ..: A61061 SAGAVVDIKIVTGVLISLSEQELIDCG-----RGCDGGYITDGFQFIINDG 10 20 30 40 200 210 220 230 240 250 260 270 gi|115 GLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPKQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDM :...::.::: : . .: :. : : : :. ::. : :.. . A61061 GINTEENYPYTAQDGDC---------------DVALQ------------------DQKH-----YSSGIFTGP-CGTA-I 50 60 70 80 280 290 300 310 320 330 gi|115 DHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV ::.: .:::: :. . ::.:: A61061 DHAVTIVGYGTEG----GIDYWIVKYNN 90 100 110 --------------------------------------------------------------------------- >>C48435 cysteine proteinase AC-4 - nematode (Haemonchus contortus) (342 aa) initn: 180 init1: 101 opt: 194 Z-score: 228.5 expect() 6.8e-06 Smith-Waterman score: 307; 27.500% identity in 280 aa overlap Entrez lookup Re-search database >C48435 85- 324: -------------------------------------------------- : 50 60 70 80 90 100 110 120 gi|115 EGWRRAVWEKNMKMIELHNQEYREGKHSFTMAMNAFGDMTSEEFRQ-VMN-GFQNRKPRKGKVFQEPLFYE-APRSVDWR . .:.: .:. :.:.: . : ..: : :. : : C48435 SGASINAAQEIPLEAQTLTGEPLVAYLRKNQNLFEVNSEPTPNFEQKIMDIKFKNQK-LNFVVKNDPEPNEDIPEEYDPR 20 30 40 50 60 70 80 90 130 140 150 160 170 180 190 gi|115 EKGYVTP--VKNQGQCGSCWAFSATGALEGQMFRKTG--RLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNG--- :: . ...:..:::::: :...:. .. :. . ...: ... : .:: . ::.:: :..: .: C48435 EKFKCSTFYIRDQANCGSCWAVSTAAAISDRICIATNGEKQVNISSTDILTCCNPQCGFGCGGGWSIRAWEYFVYEGVVS 100 110 120 130 140 150 160 170 200 210 220 230 240 gi|115 ------------------GLDSEESY----PYEATEESCK------YNPKYSVANDTGFV--DIPKQEKALMKAVATVGP : ....: : ::. :: :. . . . : : . .:.:... . :: C48435 GGEYLTKGVCRPYPIHPCGHHGNDTYYGECPREAATPPCKKKCQPGYKKIFRMDKRQGKVAYGVEPKEEAIQREILRHGP 180 190 200 210 220 230 240 250 250 260 270 280 290 300 310 320 gi|115 ISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGI . :: : .:.: .:: :.: . . . . :.: ..:.: .: . . ::::. ::: ..:: .:: .. . : : : C48435 V-VASFAVYEDFSLYKTGVYKHTAGALRGY-HAVKMMGWGVDS--KTKAKYWLIANSWHNDWGENGYFRFIRGI-NDCEI 260 270 280 290 300 310 320 330 gi|115 ASAASYPTV C48435 EDTVAAGIVDVDSL 330 340 --------------------------------------------------------------------------- >>A48454 cathepsin B-like cysteine proteinase (EC 3.4.-.-) - nematode (Ostert (342 aa) initn: 151 init1: 81 opt: 188 Z-score: 221.6 expect() 1.6e-05 Smith-Waterman score: 300; 28.685% identity in 251 aa overlap Entrez lookup Re-search database >A48454 109- 323: --------------------------------------------- : 70 80 90 100 110 120 130 140 gi|115 GKHSFTMAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALE : :. :: ..: . . . . .:..:::::: :...:. A48454 TATPVPYFKQRLMDLKYIDQNNIPDEEVEDEELEENNDDIPESYD-PR-IQWANCSSLFHIPDQANCGSCWAVSSAAAMS 60 70 80 90 100 110 120 150 160 170 180 190 200 gi|115 GQMF--RKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNG---GLDSEES---YPYE--------------- .. : .. . .: :..:.: :. ::.:: ::.. :.: : : . . ::: A48454 DRICIASKGAKQVLISAQDVVSCCTWCGD-GCEGGWPISAFRFHADEGVVTGGDYNTKGSCRPYEIHPCGHHGNETYYGE 130 140 150 160 170 180 190 200 210 220 230 240 250 260 270 gi|115 ----ATEESCKYN-----PKYSVANDTGF----VDIPKQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSE : :: :: : .: . .. .. ::..: . ::. :: . .:.: :. ::: . . A48454 CVGMADTPRCKRRCLLGYPK-SYPSDRYYGKKAYQLKNSVKAIQKDIMKNGPV-VATYTVYEDFAHYRSGIYKHKAGRKT 210 220 230 240 250 260 270 280 280 290 300 310 320 330 gi|115 DMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV . :.: :.:.: : .. ::.: ::: ..:: .:. .: . : ::. A48454 GL-HAVKVIGWG----EEKGTPYWIVANSWHDDWGENGFFRMHRGS-NDCGFEERMAAGSVQ 290 300 310 320 330 340 --------------------------------------------------------------------------- >>A57480 tubulointerstitial nephritis antigen precursor - rabbit (474 aa) initn: 117 init1: 72 opt: 189 Z-score: 220.6 expect() 1.9e-05 Smith-Waterman score: 282; 26.033% identity in 242 aa overlap Entrez lookup Re-search database >A57480 109- 316: ------------------------------------------- : 70 80 90 100 110 120 130 140 gi|115 GKHSFTMAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFS-ATGAL : :. : .. : .. .. :: :.. :::: :. : A57480 QFWGMTLEEGFRFRLGTLPPSPVLLSMNEMRATLPETTDLPEFFIAFLQMAWMDS-WAIGSKN---CAASWAFSTASVAA 180 190 200 210 220 230 240 250 150 160 170 180 190 200 210 gi|115 EGQMFRKTGRLIS-LSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPY----EATEESCKYNPKYS---- . ....:: . :: :::..: . .. .:::.: .: :. :.. : : :. :: . ....: .. : . A57480 DRIAIQSNGRYTANLSPQNLISCCA-KNRHGCNSGSIDRAWWYLRKRG-LVSHACYPLFKDQNISNNTCAMTSKADGRGK 260 270 280 290 300 310 320 220 230 240 250 260 270 gi|115 ----------------VANDTGFVDIPKQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMD------- . . . . ..: .:: . ::... ... ::.:. :: ::: . ..:. . A57480 RHATRPCPNNIEKSNRIYQCSPPYRVSSNETEIMKEIMQNGPVQAIMQV-HEDFFHYKTGIYRHVISTNEESEKYRKLQT 330 340 350 360 370 380 390 400 280 290 300 310 320 330 gi|115 HGVLVVGYG-FESTESDNNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV :.: ..:.: .........:.:.. ::::. :: .:: .. . A57480 HAVKLTGWGTLKGARGQKEKFWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAAWGQLTSSDEP 410 420 430 440 450 460 470 --------------------------------------------------------------------------- >>B48454 cathepsin B-like cysteine proteinase (EC 3.4.-.-) CP-3 - nematode (O (174 aa) initn: 142 init1: 83 opt: 183 Z-score: 220.3 expect() 1.9e-05 Smith-Waterman score: 184; 26.178% identity in 191 aa overlap Entrez lookup Re-search database >B48454 139- 324: --------------------------------------- : 100 110 120 130 140 150 160 170 gi|115 KPRKGKVFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWA-FSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEG : :. :.. : .:: : .. : : .:.: B48454 AWQYFALEGVVTGGNYRKQG---CCRPYEFPPC-GRHGKEP 10 20 30 180 190 200 210 220 230 240 250 gi|115 CNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGF----VDIPKQEKALMKAVATVGPISVAIDAGH : .: : . : ...:. . . .: : .:.. ::... . ::. :: . B48454 YYGECYDTA--------------KTP--KCQKTCQRGYLKAYKEDKHFGKSAYRLPNNVKAIQRDIMKNGPV-VAGFIVY 40 50 60 70 80 90 100 260 270 280 290 300 310 320 330 gi|115 ESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV :.: :: ::: . . :.: ..:.: :. .. :::. ::: ..:: :. .: . :.: : B48454 EDFAHYKSGIY-KHTAGRMTGGHAVKIIGWGKEK----GTPYWLIANSWHDDWGEKGFYRMIRGI-NNCRIEEMVFAGIV 110 120 130 140 150 160 170 --------------------------------------------------------------------------- >>S58770 cathepsin B (EC 3.4.22.1) precursor - chicken (340 aa) initn: 176 init1: 82 opt: 186 Z-score: 219.4 expect() 2.2e-05 Smith-Waterman score: 363; 30.041% identity in 243 aa overlap Entrez lookup Re-search database >S58770 120- 326: --------------------------------------------: 80 90 100 110 120 130 140 150 gi|115 FGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLI : . .. ...::.:::::::.:. :. .. .:. . S58770 TDMSYVKKLCGTFLGGPKAPERVDFAEDMDLPDTFDTRKQWPNCPTISEIRDQGSCGSCWAFGAVEAISDRICVHTNAKV 50 60 70 80 90 100 110 120 160 170 180 190 200 210 gi|115 S--LSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQD----NGGL-DSE---ESY---PYE----ATEESCK---------- : .: ..:..: : . . ::::: . :..: . .::: ::. ..: : : ... : S58770 SVEVSAEDLLSCCGFECGMGCNGGYPSGAWRYWTERGLVSGGLYDSHVGCRAYTIPPCEHHVNGSRPPCTGEGGETPRCS 130 140 150 160 170 180 190 200 220 230 240 250 260 270 280 gi|115 ------YNPKYSVANDTGFVD--IPKQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMD-HGVLVVGY :.:.:. . :... .:..:: .: . ::. :. . .:.::.:: :.: :.:.. :.. ..:. S58770 RHCEPGYSPSYKEDKHYGITSYGVPRSEKEIMAEIYKNGPVEGAFIV-YEDFLMYKSGVY--QHVSGEQVGGHAIRILGW 210 220 230 240 250 260 270 280 290 300 310 320 330 gi|115 GFESTESDNNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV : : ... :::. :::. .::. :. :. . . .:::: : S58770 GVE----NGTPYWLAANSWNTDWGITGFFKILRGE-DHCGIESEIVAGVPRMEQYWTRV 290 300 310 320 330 340 --------------------------------------------------------------------------- >>B48435 cysteine proteinase AC-5 - nematode (Haemonchus contortus) (348 aa) initn: 128 init1: 83 opt: 186 Z-score: 219.2 expect() 2.2e-05 Smith-Waterman score: 279; 25.296% identity in 253 aa overlap Entrez lookup Re-search database >B48435 109- 324: --------------------------------------------- : 70 80 90 100 110 120 130 140 gi|115 GKHSFTMAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALE : :. :: : :.. . ...:..:::::: :...:. B48435 EVRTTPTPGFKYKLMDKAFANANQNLNPVVNDDNDTGADLPENYD-PRIV-WKNCSSFHTIRDQANCGSCWAVSTAAAIS 50 60 70 80 90 100 110 120 150 160 170 180 190 200 gi|115 GQMF--RKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNG--------GLDSEESYPYE------------- .. : . . :. ... : : . . :: :: :... . .: : :: . B48435 DRICIATKGKKQVYASDTDILTCCGARCGLGCRGGWPIEAWKFFEYDGVVSGGPYLGKGCCSPYPLHPCGRHGNDTFYGN 130 140 150 160 170 180 190 200 210 220 230 240 250 260 270 gi|115 ----ATEESCK------YNPKYSVANDTG----FVDIPKQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSS : :: . : : . : .:..: . . . : . ::. : .:.: :. ::: . B48435 CVGMAPTPPCKRKCQPGFRGMYRVDKRYGEPGRTYTLPRSEVKIRRDIKERGSV-VAVFAVYEDFSHYQSGIYKHTAGRF 210 220 230 240 250 260 270 280 280 290 300 310 320 330 gi|115 EDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV :.: ..:.: ..... :::. ::: ..:: .:. .: . :.::: B48435 TGGYHAVKMIGWG----KDNGTDYWLIANSWHDDWGENGFFRMIRGI-NNCGIEEQVDAGIVDVESL 290 300 310 320 330 340 --------------------------------------------------------------------------- >>A44965 cysteine proteinase (EC 3.4.22.-) AC-2 precursor - nematode (Haemonc (342 aa) initn: 204 init1: 85 opt: 185 Z-score: 218.2 expect() 2.5e-05 Smith-Waterman score: 293; 29.084% identity in 251 aa overlap Entrez lookup Re-search database >A44965 109- 324: --------------------------------------------- : 70 80 90 100 110 120 130 140 gi|115 GKHSFTMAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALE : :. ::.: :.. ...:..:::::: :...:. A44965 LFEVNSDPTPDFEQKIMSIKYKHQKLNLMVKEDPDPEVDIPPSYD-PRDV-WKNCTTFY-IRDQANCGSCWAVSTAAAIS 50 60 70 80 90 100 110 120 150 160 170 180 190 200 gi|115 GQMF--RKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQY-VQD---NGG--LDSEESYPY---------------- .. :. . ...: ... : :: ..::.:: :..: . : .:: : .. :: A44965 DRICIASKAEKQVNISATDIMTCCRPQCGDGCEGGWPIEAWKYFIYDGVVSGGEYLTKDVCRPYPIHPCGHHGNDTYYGE 130 140 150 160 170 180 190 200 210 220 230 240 250 260 270 gi|115 ---EATEESCKYNPK------YSVANDTGF-VDIPKQE-KALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSED : :: . . : . . : . : :: ::... . ::. :: : .:.: :: ::: . . A44965 CRGTAPTPPCKRKCRPGVRKMYRIDKRYGKDAYIVKQSVKAIQSEILKNGPV-VASFAVYEDFRHYKSGIYKHTAGELRG 210 220 230 240 250 260 270 280 280 290 300 310 320 330 gi|115 MDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV . :.: ..:.: : .:. .::. ::: ..:: :: .... : ::: A44965 Y-HAVKMIGWGNE----NNTDFWLIANSWHNDWGEKGYFRIVRGS-NDCGIEGTIAAGIVDTESL 290 300 310 320 330 340 --------------------------------------------------------------------------- >>A45524 cysteine proteinase (EC 3.4.22.-) AC-1 precursor - nematode (Haemonc (342 aa) initn: 204 init1: 85 opt: 184 Z-score: 217.0 expect() 3e-05 Smith-Waterman score: 292; 29.084% identity in 251 aa overlap Entrez lookup Re-search database >A45524 109- 324: --------------------------------------------- : 70 80 90 100 110 120 130 140 gi|115 GKHSFTMAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALE : :. ::.: :.. ...:..:::::: :...:. A45524 LFEVNSAPTPNFEQKIMDIKYKHQKLNLMVKEDPDPEVDIPPSYD-PRDV-WKNCTTFY-IRDQANCGSCWAVSTAAAIS 50 60 70 80 90 100 110 120 150 160 170 180 190 200 gi|115 GQMF--RKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQY-VQD---NGG--LDSEESYPY---------------- .. :. . ...: ... : :: ..::.:: :..: . : .:: : .. :: A45524 DRICIASKAEKQVNISATDIMTCCRPQCGDGCEGGWPIEAWKYFIYDGVVSGGEYLTKDVCRPYPIHPCGHHGNDTYYGE 130 140 150 160 170 180 190 200 210 220 230 240 250 260 270 gi|115 ---EATEESCKYNPK------YSVANDTGF-VDIPKQE-KALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSED : :: . . : . . : . : :: ::... . ::. :: : .:.: :: ::: . . A45524 CRGTAPTPPCKRKCRPGVRKMYRIDKRYGKDAYIVKQSVKAIQSEILRNGPV-VASFAVYEDFRHYKSGIYKHTAGELRG 210 220 230 240 250 260 270 280 280 290 300 310 320 330 gi|115 MDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV . :.: ..:.: : .:. .::. ::: ..:: :: .. . : ::: A45524 Y-HAVKMIGWGNE----NNTDFWLIANSWHNDWGEKGYFRIIRGT-NDCGIEGTIAAGIVDTESL 290 300 310 320 330 340 --------------------------------------------------------------------------- >>A54505 serine-repeat antigen precursor - Plasmodium falciparum (strain FCR3 (989 aa) initn: 151 init1: 113 opt: 188 Z-score: 214.7 expect() 4e-05 Smith-Waterman score: 260; 26.906% identity in 223 aa overlap Entrez lookup Re-search database >A54505 129- 322: ----------------------------------------- : 90 100 110 120 130 140 150 160 gi|115 RQVMNGFQNRKPRKGKVFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVD :..::.: . : :.. :: : . ..: ... A54505 VDTTLEKEDTLSYDNSDNMFCNKEYCNRLKDENNCISNLQVEDQGNCDTSWIFASKYHLETIRCMKGYEPTKISALYVAN 540 550 560 570 580 590 600 610 170 180 190 200 210 220 230 gi|115 CSGPQGNEGCNGGLMDYAF-QYVQDNGGLDSEESYPYEATE--ESC-KYNPKYSVANDTGFVDIPKQEK----------- : . .. :. : . : : ..: : : .: .:::. .. :.: : . .. :.: . :.: A54505 CYKGEHKDRCDEGSSPMEFLQIIEDYGFLPAESNYPYNYVKVGEQCPKVEDHWMNLWDNGKILHNKNEPNSLDGKGYTAY 620 630 640 650 660 670 680 690 240 250 260 270 280 290 gi|115 ----------ALMKAVAT--VGPISVAIDAGHESFLFYK-EGIYFEPDCSSEDMDHGVLVVGYG-FESTESDNNKYWLVK :..: . : .. :: :. . :. : . :... ::.: .:::: . ..:.....::.:. A54505 ESERFHDNMDAFVKIIKTEVMNKGSVIAYIKAENVMGYEFSGKKVQNLCGDDTADHAVNIVGYGNYVNSEGEKKSYWIVR 700 710 720 730 740 750 760 770 300 310 320 330 gi|115 NSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV :::: :: :: :. .:: A54505 NSWGPYWGDEGYFKVDMYGPTHCHFNFIHSVVIFNVDLPMNNKTTKKESKIYDYYLKASPEFYHNLYFKNFNVGKKNLFS 780 790 800 810 820 830 840 850 --------------------------------------------------------------------------- >>S35580 proteinase IV - mountain papaya (fragment) (43 aa) initn: 151 init1: 151 opt: 170 Z-score: 214.6 expect() 4e-05 Smith-Waterman score: 170; 59.524% identity in 42 aa overlap Entrez lookup Re-search database >S35580 115- 156: ---------: 80 90 100 110 120 130 140 150 gi|115 MAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRK :.:.:::.:: ::::::::. :: ::::. ..:: . S35580 YPESIDWRKKGAVTPVKNQGSXGSXWAFSTIVTVEGINKIR 10 20 30 40 160 170 180 190 200 210 220 230 gi|115 TGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPKQEKA :: S35580 TG --------------------------------------------------------------------------- >>S35577 cysteine proteinase I - mountain papaya (fragment) (43 aa) initn: 152 init1: 152 opt: 161 Z-score: 204.3 expect() 0.00015 Smith-Waterman score: 161; 63.636% identity in 33 aa overlap Entrez lookup Re-search database >S35577 117- 149: ------- : 80 90 100 110 120 130 140 150 gi|115 MNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTG :.:::.:: ::::.:::. :: :.::...:.:: S35577 IVASIDWRQKGAVTPVRNQGSXGSXWTFSSVAAVEGIIKIRGT 10 20 30 40 160 170 180 190 200 210 220 230 gi|115 RLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPKQEKALM --------------------------------------------------------------------------- >>S35578 cysteine proteinase II - mountain papaya (fragment) (43 aa) initn: 150 init1: 150 opt: 157 Z-score: 199.7 expect() 0.00027 Smith-Waterman score: 157; 62.857% identity in 35 aa overlap Entrez lookup Re-search database >S35578 115- 149: ------- : 80 90 100 110 120 130 140 150 gi|115 MAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRK : :::::.:: :::::.:. :: ::::.....:: S35578 YPGSVDWRQKGAVTPVKDQNPXGSXWAFSTVATVEGINKIV 10 20 30 40 160 170 180 190 200 210 220 230 gi|115 TGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPKQEKA S35578 TG --------------------------------------------------------------------------- >>S32561 cysteine proteinase - Plasmodium vinckei (506 aa) initn: 353 init1: 114 opt: 158 Z-score: 184.7 expect() 0.0019 Smith-Waterman score: 515; 31.653% identity in 357 aa overlap Entrez lookup Re-search database >S32561 27- 331: ----------------------------------------------------------------: 10 20 30 40 50 60 gi|115 MNPTLILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLY-GMNEEGWRRAVWEKNMKMIELHNQE ... :. .:. : .:.:. : .:.:. ...:. S32561 KNNANKVNTFDVNNESNKNIDPTYIFRQKLESMQDNIKYASKFFKYMKENNKKYENMDEQLQRF----ENFKIRYMKTQK 120 130 140 150 160 170 180 190 70 80 90 100 110 120 gi|115 YRE--GKHSFTMA--MNAFGDMTSEEF----RQVMNGFQNRK-----PRKGKVFQEPLFY------EAPRSVDWREKGYV . : ::...:.. .: ..:...::: ..... .. : : : .. . :. . : : :.: : S32561 HNEMVGKNGLTYVQKVNQYSDFSKEEFDNYFKKLLSVPMDLKSKYIVPLKKHLANTNLISVDNKSKDFPDSRDYRSKFNF 200 210 220 230 240 250 260 270 130 140 150 160 170 180 190 200 gi|115 TPVKNQGQCGSCWAFSATGALEGQMFRKTGRL-ISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYE : :.::.:::::::.: : .: . . .. ::.:::..:::: : ::.:: ::: :. .:: ..: :::. S32561 LPPKDQGNCGSCWAFAAIGNFEYLYVHTRHEMPISFSEQQMVDCSTE--NYGCDGGNPFYAFLYMINNGVCLGDE-YPYK 280 290 300 310 320 330 340 350 210 220 230 240 250 260 270 280 gi|115 ATEESCKYNPKYSVANDTGFV-DIPKQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYG . :. : . :. . . :. :. .: :. :. :::...:. :. :.:..:. :. :. .:. : ..:.::.:::: S32561 GHEDFFCLNYRCSLLGRVHFIGDVKPNE--LIMALNYVGPVTIAVGAS-EDFVLYSGGV-FDGECNPE-LNHSVLLVGYG 360 370 380 390 400 410 420 290 300 310 320 330 gi|115 -------FESTES--DNN------------------KYWLVKNSWGEEWGMGGYVKMAKDRRN---HCGIASAASYPTV ::...: :.: ::.:.:::: .:: :::... ... . ::..: . .: S32561 QVKKSLAFEDSHSNVDSNLIKKYKENIKGDDDDDIIYYWIVRNSWGPNWGEGGYIRIKRNKAGDDGFCGVGSDVFFPIY 430 440 450 460 470 480 490 500 --------------------------------------------------------------------------- >>A29172 cathepsin B-like cysteine proteinase (EC 3.4.22.-) - bovine (fragmen (73 aa) initn: 108 init1: 108 opt: 136 Z-score: 172.2 expect() 0.0092 Smith-Waterman score: 136; 43.902% identity in 41 aa overlap Entrez lookup Re-search database >A29172 274- 314: --------- : 240 250 260 270 280 290 300 310 gi|115 ALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYVK ..: : :.:.: ::. .::.:.::::: :: :... A29172 SEYNDQAFINHIVSVAGWGV----SDGMEYWIVRNSWGEPWGEHGWMR 10 20 30 40 320 330 gi|115 MAKDRRNHCGIASAASYPTV .. A29172 IVTSTYKGGEGARYNLAIEESCTFGDPIV 50 60 70 --------------------------------------------------------------------------- >>F32946 cysteine proteinase (EC 3.4.22.-) - Caenorhabditis elegans (fragment (53 aa) initn: 130 init1: 94 opt: 132 Z-score: 169.7 expect() 0.013 Smith-Waterman score: 132; 40.385% identity in 52 aa overlap Entrez lookup Re-search database >F32946 132- 181: ----------: 100 110 120 130 140 150 160 gi|115 MNGFQNRKPRKGKVFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGAL-EGQMFRKTG-RLISLSEQNLVDC :::::::::::.. .. .: . ..: . . .:. : F32946 QGQCGSCWAFSTAEVISDGTCMASNGTQQPIICPTDLLTC 10 20 30 40 170 180 190 200 210 220 230 240 gi|115 SGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPKQEKALMKAVATVGPISVAI .:::::: F32946 CWNVCGEGCNGGY 50 --------------------------------------------------------------------------- >>S16162 cruzipain (EC 3.4.22.-) - Trypanosoma cruzi (fragment) (173 aa) initn: 117 init1: 117 opt: 133 Z-score: 163.2 expect() 0.03 Smith-Waterman score: 133; 43.590% identity in 39 aa overlap Entrez lookup Re-search database >S16162 295- 333: -------: 260 270 280 290 300 310 320 330 gi|115 SFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTV ::..:::: .:: ::...:: :.: . :: .: S16162 AAVPYWIIKNSWTAQWGEDGYIRIAKGS-NQCLVKEEASSAVVG 10 20 30 40 S16162 GPGPTPEPTTTTTTSAPGPSPSYFVQMSCTDAACIVGCENVTLPTGQCLLTTSGVSAIVTCGAETLTEEVFFTSTHCSGP 50 60 70 80 90 100 110 120 --------------------------------------------------------------------------- >>A30043 trophoblast-specific protein precursor - mouse (124 aa) initn: 156 init1: 88 opt: 124 Z-score: 155.0 expect() 0.084 Smith-Waterman score: 184; 29.496% identity in 139 aa overlap Entrez lookup Re-search database >A30043 1- 134:----------------------------: 10 20 30 40 50 60 70 80 gi|115 MNPTLILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLYGMNEEGWRRAVWEKNMKMIELHNQEYREGKHSFTMAMNAF :.::..:. .:::.:::... . .:.:. . : ..: .::: : :: .::..: . .. .. :.: A30043 MTPTIFLVILCLGVASAVIVPEAQLDAELQEQK---------DKEVLIKAVWSKFMKTNKLHSSENDQETEGSNIEMSAS 10 20 30 40 50 60 70 90 100 110 120 130 140 150 gi|115 GDMTSEEFRQVMNG-----FQNRKPRKGKVFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKT :..:.::. ..:. :.... . : ..: : . .: : :. .: :: : A30043 GQLTDEELMKIMTTVLHPMFEEEENKPQPVVDDPEFEDYTESGD----GFFVP--NQPQ 80 90 100 110 120 160 170 180 190 200 210 220 230 gi|115 GRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPKQEKAL --------------------------------------------------------------------------- >>S23941 dipeptidyl-peptidase I (EC 3.4.14.1) - human (fragments) (119 aa) initn: 130 init1: 104 opt: 115 Z-score: 145.0 expect() 0.3 Smith-Waterman score: 145; 30.147% identity in 136 aa overlap Entrez lookup Re-search database >S23941 125- 259: ----------------------------: 90 100 110 120 130 140 150 160 gi|115 SEEFRQVMNGFQNRKPRKGKVFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQ .:.::.::..::::..:.. : ::... :... :. S23941 LPTSDVRNVHGINFVSPVRNQASCGSCYSFASMGMLEARI-----RILTNSQT 10 20 30 40 170 180 190 200 210 220 230 240 gi|115 NLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPKQEKALMK-AVATVG ... :: . .:.:: : : : :.:: . :. .: .. :: ...:::: .. : S23941 PILS---PQ-----------EVVSYAQDFG-LVEEASFPY-----TDYYSSEYHYVG--GFYG--GMNEALMKLELVRHG 50 60 70 80 90 100 250 260 270 280 290 300 310 320 gi|115 PISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCG :..::.. .. :: : S23941 PMAVAFEYVYD-FLHY 110 --------------------------------------------------------------------------- >>S15845 cathepsin L (EC 3.4.22.15) - bovine (fragments) (38 aa) initn: 178 init1: 100 opt: 107 Z-score: 143.3 expect() 0.38 Smith-Waterman score: 107; 83.333% identity in 18 aa overlap Entrez lookup Re-search database >S15845 115- 132: ---- : 80 90 100 110 120 130 140 150 gi|115 MAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRK : :::: .::::::::::. S15845 VPWSVDWTKKGYVTPVKNQNNKFWIVKNSXGGEXGXGG 10 20 30 160 170 180 190 200 210 220 230 gi|115 TGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPKQEKA --------------------------------------------------------------------------- >>A31657 major fecal allergen Der p 1 - house-dust mite (Dermatophagoides pte (92 aa) initn: 101 init1: 101 opt: 109 Z-score: 139.8 expect() 0.59 Smith-Waterman score: 109; 45.714% identity in 35 aa overlap Entrez lookup Re-search database >A31657 114- 148: ------- : 80 90 100 110 120 130 140 150 gi|115 TMAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFR :: .: :. :::.. :: ::: :::.....: A31657 TNACSINGNAPAEIDLRQMRTVTPIRMQGGCGSXXAFSGVAGIEYIQHN 10 20 30 40 160 170 180 190 200 210 220 230 gi|115 KTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPKQEK A31657 GVVQESYYRFGISNYCQIYPPNANKDNGYQPNYXAVNIVGYXN 50 60 70 80 90 --------------------------------------------------------------------------- >>S46204 ananain (EC 3.4.22.31) - pineapple (fragment) (20 aa) initn: 80 init1: 80 opt: 100 Z-score: 139.5 expect() 0.61 Smith-Waterman score: 100; 68.421% identity in 19 aa overlap Entrez lookup Re-search database >S46204 115- 133: ----: 80 90 100 110 120 130 140 150 gi|115 MAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRK :.:.:::..: :: ::::: S46204 VPQSIDWRDSGAVTSVKNQG 10 20 160 170 180 190 200 210 220 230 gi|115 TGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPKQEKA --------------------------------------------------------------------------- >>S14329 thaumatopain - miracle fruit (fragment) (35 aa) initn: 92 init1: 92 opt: 102 Z-score: 138.1 expect() 0.73 Smith-Waterman score: 102; 58.621% identity in 29 aa overlap Entrez lookup Re-search database >S14329 115- 143: ------: 80 90 100 110 120 130 140 150 gi|115 MAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRK : :::: .:: :. :::: :: :::. S14329 NLPNSVDWWKKGAVAAVKNQRXXGSXXAFSSIKTS 10 20 30 160 170 180 190 200 210 220 230 gi|115 TGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPKQEKA --------------------------------------------------------------------------- >>S46205 comosain - pineapple (fragment) (20 aa) initn: 76 init1: 76 opt: 96 Z-score: 134.9 expect() 1.1 Smith-Waterman score: 96; 68.421% identity in 19 aa overlap Entrez lookup Re-search database >S46205 115- 133: ----: 80 90 100 110 120 130 140 150 gi|115 MAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRK :.:.:::. : :: ::::: S46205 VPQSIDWRNYGAVTSVKNQG 10 20 160 170 180 190 200 210 220 230 gi|115 TGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPKQEKA --------------------------------------------------------------------------- >>S39367 proteinase omega - papaya (fragments) (37 aa) initn: 86 init1: 86 opt: 97 Z-score: 132.1 expect() 1.6 Smith-Waterman score: 97; 68.421% identity in 19 aa overlap Entrez lookup Re-search database >S39367 115- 133: ---- : 80 90 100 110 120 130 140 150 gi|115 MAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRK :..::::.:: :::: :: S39367 LPENVDWRKKGAVTPVXXQGYGKSGGKGYILIKNSSG 10 20 30 160 170 180 190 200 210 220 230 gi|115 TGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPKQEKA --------------------------------------------------------------------------- >>S03380 major fecal allergen Der p I - house-dust mite (Dermatophagoides pte (94 aa) initn: 82 init1: 52 opt: 95 Z-score: 123.7 expect() 4.7 Smith-Waterman score: 95; 43.243% identity in 37 aa overlap Entrez lookup Re-search database >S03380 114- 148: ------- : 80 90 100 110 120 130 140 150 gi|115 TMAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYEAPRSVDWREKGYVTPVKN--QGQCGSCWAFSATGALEGQM :: .: :. :::.. :: ::: :::.....: S03380 TNACSINGNAPAEIDLRQMRTVTPIRMQMQGGCGSXXAFSGVAGIEYIQ 10 20 30 40 160 170 180 190 200 210 220 230 gi|115 FRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPKQ S03380 HNGVVQESYYRFGISNYCQIYPPNANKDNGYQPNYXAVNIVGYXN 50 60 70 80 90 --------------------------------------------------------------------------- >>A35417 28K serine proteinase homolog - bovine (fragment) (15 aa) initn: 82 init1: 82 opt: 82 Z-score: 120.8 expect() 6.8 Smith-Waterman score: 82; 73.333% identity in 15 aa overlap Entrez lookup Re-search database >A35417 114- 128: ---: 80 90 100 110 120 130 140 150 gi|115 TMAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFR :: :.:.:.:::::: A35417 APDSIDYRKKGYVTP 10 160 170 180 190 200 210 220 230 gi|115 KTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPKQEK --------------------------------------------------------------------------- >>LUBO11 annexin XI form A - bovine (503 aa) initn: 47 init1: 47 opt: 101 Z-score: 119.6 expect() 7.9 Smith-Waterman score: 101; 26.415% identity in 106 aa overlap Entrez lookup Re-search database >LUBO11 107- 207: ---------------------- : 70 80 90 100 110 120 130 140 gi|115 REGKHSFTMAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFS-ATG :.:. : . .: :::. . .: :. ... :.: LUBO11 YPGAPVPGQPMLPPGQQPPGVYPGQPPMTYPGQSPVPPPGQQPV----PSYPGYSGSGTVTPAVSPAQFGNRGTITDASG 130 140 150 160 170 180 190 150 160 170 180 190 200 210 220 gi|115 ---ALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQD-NGGLDSEESYPYEATEESCKYNPKYSVAN .....::. . .. .:: ..:: : ..:. . :... : .: : :: : .: : LUBO11 FDPLRDAEVLRKAMKGFGTDEQAIIDCLGSRSNKQRQQILLSFKTAYGKDLIKDLKSELSGNFEKTILALMKTPVLFDAY 200 210 220 230 240 250 260 270 230 240 250 260 270 280 290 300 gi|115 DTGFVDIPKQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNS LUBO11 EIKEAIKGAGTDEACLIEILASRSNEHIRELNRVYKTEFKKTLEEAIRSDTSGHFQRLLISLSQGNRDESTNVDMTLVQR 280 290 300 310 320 330 340 350 --------------------------------------------------------------------------- >>S23447 annexin XI form B - bovine (505 aa) initn: 47 init1: 47 opt: 101 Z-score: 119.6 expect() 7.9 Smith-Waterman score: 101; 26.415% identity in 106 aa overlap Entrez lookup Re-search database >S23447 107- 207: ---------------------- : 70 80 90 100 110 120 130 140 gi|115 REGKHSFTMAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFS-ATG :.:. : . .: :::. . .: :. ... :.: S23447 YPGAPVPGQPMLPPGQQPPGVYPGQPPMTYPGQSPVPPPGQQPV----PSYPGYSGSGTVTPAVSPAQFGNRGTITDASG 130 140 150 160 170 180 190 150 160 170 180 190 200 210 220 gi|115 ---ALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQD-NGGLDSEESYPYEATEESCKYNPKYSVAN .....::. . .. .:: ..:: : ..:. . :... : .: : :: : .: : S23447 FDPLRDAEVLRKAMKGFGTDEQAIIDCLGSRSNKQRQQILLSFKTAYGKDLIKDLKSELSGNFEKTILALMKTPVLFDAY 200 210 220 230 240 250 260 270 230 240 250 260 270 280 290 300 gi|115 DTGFVDIPKQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNS S23447 EIKEAIKGAGTDEACLIEILASRSNEHIRELNRVYKTEFKKTLEEAIRSDTSGHFQRLLISLSQGNRDESTNVDMTLVQR 280 290 300 310 320 330 340 350 --------------------------------------------------------------------------- >>B45658 pancreatic lipase (EC 3.1.1.3) - sheep (fragment) (86 aa) initn: 33 init1: 33 opt: 90 Z-score: 118.5 expect() 9 Smith-Waterman score: 90; 27.273% identity in 88 aa overlap Entrez lookup Re-search database >B45658 195- 282: ------------------ : 160 170 180 190 200 210 220 230 gi|115 TGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSVANDTGFVDIPKQEKA ::: : : ...: : . .: .:. :::. . . : B45658 ITGLDPAEPY-FQGTPELVRLDP-----SDAQFVDVIHTDAA 10 20 30 240 250 260 270 280 290 300 310 gi|115 LMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEEWGMGGYVKM : : .: .. :: .: . .: : :... ... : . : B45658 PMIPNLGFGTMSQVV--GHLDF--FPNGGKEMPGCQKNALSQIVDIDGIWEGTR 40 50 60 70 80 320 330 gi|115 AKDRRNHCGIASAASYPTV --------------------------------------------------------------------------- 333 residues in 1 query sequences 33852246 residues in 105998 library sequences Tcomplib (4 proc)[version 3.1t02 March, 1998] start: Sun Apr 26 17:06:07 1998 done: Sun Apr 26 17:07:51 1998 Scan time: 75.017 Display time: 6.533 Function used was FASTA