/seqprg/slib/bin/fasta3_t -w 80 -m 6 -q @ %p FASTA searches a protein or DNA sequence data bank version 3.1t02 March, 1998 Please cite: W.R. Pearson & D.J. Lipman PNAS (1988) 85:2444-2448 @: 569 aa QUERY sequence vs NBRF Protein database (complete) library searching /seqlib/lib/pir1.seq 5 library searching /seqlib/lib/pir2.seq 5 library searching /seqlib/lib/pir3.seq 5 library searching /seqlib/lib/pir4.seq 5 library opt E() < 20 867 0:====== 22 2 0:= one = represents 172 library sequences 24 6 0:= 26 13 2:* 28 35 24:* 30 120 146:* 32 407 563:===* 34 1187 1527:======= * 36 2872 3136:================= * 38 5169 5183:==============================* 40 7414 7230:==========================================*= 42 8922 8837:===================================================* 44 10313 9748:========================================================*=== 46 10259 9929:=========================================================*== 48 9593 9506:=======================================================* 50 8347 8674:================================================= * 52 7393 7626:=========================================== * 54 6240 6514:=====================================* 56 5045 5441:============================== * 58 4267 4467:=========================* 60 3431 3619:==================== * 62 2783 2901:================* 64 2172 2307:=============* 66 1787 1824:==========* 68 1447 1434:========* 70 1149 1124:======* 72 932 878:=====* 74 667 685:===* 76 584 533:===* 78 433 414:==* 80 336 322:=* 82 292 246:=* 84 254 195:=* 86 182 151:*= 88 149 117:* inset = represents 5 library sequences 90 129 90:* 92 105 70:* :=============*======= 94 82 54:* :==========*====== 96 76 42:* :========*======= 98 57 32:* :======*===== 100 53 25:* :====*====== 102 38 19:* :===*==== 104 40 15:* :==*===== 106 35 12:* :==*==== 108 22 9:* :=*=== 110 25 7:* :=*=== 112 11 5:* :*== 114 17 4:* :*=== 116 9 3:* :*= 118 15 2:* :*== >120 215 2:*= :*======================================= 33852246 residues in 105998 sequences statistics extrapolated from 50000 to 105703 sequences Expectation_n fit: rho(ln(x))= 6.3449+/-0.000483; mu= 3.5262+/- 0.026; mean_var=95.7100+/-17.634, Z-trim: 103 B-trim: 992 in 1/63 Kolmogorov-Smirnov statistic: 0.0136 (N=29) at 66 FASTA (3.14 April, 1998) function (optimized, BL50 matrix) ktup: 2 join: 37, opt: 25, gap-pen: -12/ -2, width: 16 reg.-scaled Scan time: 104.117 --------------------------------------------------------------------------- The best scores are: initn init1 opt z-sc E(105703) A45624 trophozoite cysteine proteinase - Plasmodium falciparu ( 569) 3739 3739 3739 3827.1 2.4e-206 align S46265 cysteine proteinase - Plasmodium vivax ( 583) 1326 680 1574 1614.0 4.6e-83 align S32561 cysteine proteinase - Plasmodium vinckei ( 506) 852 555 1317 1352.2 1.7e-68 align A45565 cysteine proteinase - Theileria annulata ( 441) 494 192 488 505.7 2.5e-21 align S49166 cysteine proteinase precursor - spring vetch ( 357) 532 211 420 437.6 1.5e-17 align KHQBTT cysteine proteinase (EC 3.4.22.-) precursor - Theileri ( 439) 425 187 417 433.2 2.7e-17 align KHHUL cathepsin L (EC 3.4.22.15) precursor - human ( 333) 378 226 413 430.9 3.6e-17 align JN0633 caricain (EC 3.4.22.30) I precursor - papaya ( 348) 508 302 413 430.6 3.8e-17 align A49868 probable cysteine proteinase OC-2 precursor, osteoclas ( 329) 425 274 412 429.9 4.1e-17 align JN0634 caricain (EC 3.4.22.30) II precursor - papaya ( 367) 513 302 410 427.2 5.8e-17 align S22502 endopeptidase - kidney bean ( 362) 478 218 407 424.2 8.5e-17 align S06837 glycyl endopeptidase (EC 3.4.22.25) - papaya ( 216) 504 291 394 414.3 3e-16 align S57777 cysteine proteinase (EC 3.4.22.-) precursor - Hemeroca ( 360) 500 182 397 414.0 3.1e-16 align JA0159 cysteine proteinase (EC 3.4.22.-) precursor - tomato ( ( 346) 492 224 395 412.2 4e-16 align S12581 cysteine proteinase (EC 3.4.22.-) - black gram ( 362) 481 221 395 411.9 4.1e-16 align S44151 cathepsin L (EC 3.4.22.15) - fluke (Schistosoma manson ( 317) 441 185 391 408.7 6.2e-16 align JC2476 cathepsin K (EC 3.4.22.-) precursor - human ( 329) 420 264 391 408.5 6.4e-16 align KHRTL cathepsin L (EC 3.4.22.15) precursor - rat ( 334) 320 189 390 407.4 7.4e-16 align KHMSL cathepsin L (EC 3.4.22.15) precursor - mouse ( 334) 283 181 389 406.3 8.4e-16 align JN0719 drought-inducible cysteine proteinase (EC 3.4.22.-) RD ( 462) 459 219 391 406.3 8.5e-16 align JC5443 cathepsin L-like cysteine proteinase (EC 3.4.-.-) c1 - ( 338) 463 224 389 406.3 8.5e-16 align S57776 cysteine proteinase - clove pink (fragment) ( 427) 537 214 390 405.8 9.1e-16 align S03964 stem bromelain (EC 3.4.22.32) - pineapple ( 212) 373 196 384 404.2 1.1e-15 align JX0366 cysteine endopeptidase (EC 3.4.22.-) precursor - silkw ( 344) 427 202 387 404.1 1.1e-15 align KHRZOB oryzain (EC 3.4.22.-) beta precursor - rice ( 471) 463 208 386 401.0 1.7e-15 align PPPA papain (EC 3.4.22.2) precursor - papaya ( 345) 553 249 383 400.0 1.9e-15 align KHRZOA oryzain (EC 3.4.22.-) alpha precursor - rice ( 458) 484 210 383 398.1 2.4e-15 align A53810 cathepsin L (EC 3.4.22.15) precursor - flesh fly (Sarc ( 339) 430 210 380 397.0 2.8e-15 align A58195 cathepsin L (EC 3.4.22.15) precursor - pig ( 334) 350 221 379 396.1 3.1e-15 align KHDOP prestalk cathepsin (EC 3.4.22.-) precursor - slime mold ( 376) 301 179 378 394.3 3.9e-15 align JC5442 cathepsin L-like cysteine proteinase (EC 3.4.-.-) g3 - ( 331) 383 216 377 394.1 4e-15 align JC5441 cathepsin L-like cysteine proteinase (EC 3.4.-.-) g2 - ( 331) 381 216 376 393.1 4.6e-15 align A42482 cathepsin S (EC 3.4.22.27) - human ( 331) 382 179 376 393.1 4.6e-15 align S41428 cysteine proteinase (EC 3.4.22.-) CP2 precursor - Tric ( 314) 358 250 373 390.4 6.5e-15 align S15844 cathepsin S (EC 3.4.22.27) - bovine ( 217) 337 199 370 389.7 7.1e-15 align S04222 chymopapain (EC 3.4.22.6) - papaya ( 218) 461 256 370 389.7 7.1e-15 align S43991 cathepsin L-like proteinases - liver fluke ( 326) 454 236 372 389.1 7.7e-15 align S47434 cysteine proteinase - rice ( 378) 458 188 372 388.2 8.7e-15 align S46476 cysteine proteinase III - mountain papaya ( 214) 353 262 368 387.8 9.1e-15 align S47312 cysteine proteinase (EC 3.4.22.-) precursor - spring v ( 368) 470 174 369 385.3 1.3e-14 align JC4848 cysteine proteinase (EC 3.4.22.-) - Douglas fir ( 454) 584 210 370 384.9 1.3e-14 align KHHUH cathepsin H (EC 3.4.22.16) precursor - human ( 335) 364 164 364 380.8 2.2e-14 align JQ1111 cysteine proteinase (EC 3.4.22.-) EP-B 1 precursor - b ( 371) 476 181 364 380.1 2.4e-14 align JQ1110 cysteine proteinase (EC 3.4.22.-) EP-B 4 precursor - b ( 373) 476 181 364 380.1 2.4e-14 align S24602 cysteine proteinase tpp (EC 3.4.22.-) - garden pea ( 464) 486 200 364 378.6 2.9e-14 align KHBH aleurain (EC 3.4.22.-) precursor - barley ( 361) 422 196 361 377.2 3.5e-14 align S67481 cysteine proteinase CP1 - fruit fly (Drosophila melano ( 218) 379 183 355 374.3 5.1e-14 align S47433 cathepsin L (EC 3.4.22.15) - Norway lobster ( 313) 385 194 355 372.0 6.9e-14 align S29245 cysteine proteinase (EC 3.4.22.-) precursor - Leishman ( 443) 403 270 357 371.8 7.1e-14 align A48566 cysteine proteinase Lpcys2 (EC 3.4.22.-) - Leishmania ( 444) 415 282 353 367.7 1.2e-13 align S19651 cysteine proteinase (EC 3.4.22.-) precursor (clone LCP ( 320) 333 170 350 366.7 1.4e-13 align JQ1121 cysteine proteinase homolog COT44 - rape ( 328) 430 204 347 363.5 2e-13 align KHCHL cathepsin L (EC 3.4.22.15) - chicken ( 218) 361 199 344 363.1 2.2e-13 align A55090 cathepsin O (EC 3.4.-.-) precursor - human ( 321) 297 200 344 360.6 3e-13 align S19650 cysteine proteinase (EC 3.4.22.-) precursor (clone LCP ( 323) 332 179 342 358.5 3.9e-13 align KHRTH cathepsin H (EC 3.4.22.16) precursor - rat ( 333) 358 153 342 358.3 4e-13 align I58002 cathepsin-related protein - rat (fragment) ( 236) 320 176 338 356.5 5.1e-13 align TAGB actinidain (EC 3.4.22.14) precursor - kiwi fruit ( 380) 407 138 341 356.4 5.1e-13 align S53027 cathepsin L (EC 3.4.22.15) precursor - penaeid shrimp ( 326) 390 201 340 356.4 5.1e-13 align JC5691 cysteine proteinase (EC 3.4.-.-) - Bombyx mori nuclear ( 323) 398 207 331 347.3 1.6e-12 align S12099 cysteine proteinase (EC 3.4.22.-) precursor - Trypanos ( 450) 364 228 332 346.1 1.9e-12 align S62736 cathepsin-like cysteine proteinase (EC 3.4.22.-) - Aut ( 323) 373 199 326 342.2 3.2e-12 align S62735 cathepsin - Choristoneura fumiferana nuclear polyhedro ( 324) 321 211 325 341.1 3.6e-12 align S49451 cysteine proteinase - chickpea ( 325) 429 176 324 340.1 4.1e-12 align S41425 cysteine proteinase (EC 3.4.22.-) CP3 precursor - Tric ( 278) 283 247 323 340.1 4.1e-12 align S37048 cysteine proteinase - Trypanosoma congolense ( 447) 380 212 325 339.0 4.7e-12 align S41427 cysteine proteinase (EC 3.4.22.-) CP1 precursor - Tric ( 309) 383 238 322 338.3 5.2e-12 align I52525 testin precursor - rat ( 333) 291 170 319 334.8 8.1e-12 align A60667 cysteine proteinase cruzain (EC 3.4.22.-) - Trypanosom ( 467) 369 192 320 333.6 9.4e-12 align S27044 papain-like protein - Autographa californica nuclear p ( 208) 335 199 313 331.7 1.2e-11 align KHRZOG oryzain (EC 3.4.22.-) gamma precursor - rice ( 362) 420 200 316 331.2 1.3e-11 align A47306 cysteine proteinase - Tetrahymena thermophila (SGC5) ( 336) 384 175 315 330.7 1.4e-11 align A45629 cruzipain - Trypanosoma cruzi ( 467) 368 191 317 330.6 1.4e-11 align S19649 cysteine proteinase (EC 3.4.22.-) precursor (clone LCP ( 322) 352 183 313 328.9 1.7e-11 align S66348 senescence-associated cysteine proteinase precursor (c ( 356) 417 195 310 325.2 2.8e-11 align S59598 cysteine proteinase 2 precursor - maize ( 360) 422 203 307 322.0 4.2e-11 align A44938 cysteine proteinase (EC 3.4.22.-) - Trypanosoma cruzi ( 183) 296 186 301 320.3 5.2e-11 align S07051 cysteine proteinase (EC 3.4.22.-) precursor - Trypanos ( 450) 292 208 306 319.6 5.7e-11 align S47432 cathepsin L (EC 3.4.22.15) - Norway lobster ( 324) 327 171 299 314.5 1.1e-10 align S46535 probable cysteine proteinase (EC 3.4.22.-) (clone A149 ( 313) 459 188 294 309.6 2e-10 align A45087 cathepsin S (EC 3.4.22.27) - rat ( 330) 390 172 291 306.2 3.2e-10 align KHDO cysteine proteinase 1 (EC 3.4.22.-) precursor - slime mo ( 343) 288 148 290 305.0 3.7e-10 align KHSYO4 oil bodies-associated protein P34 precursor - soybean ( 379) 442 218 290 304.3 4.1e-10 align S42882 cysteine proteinase (EC 3.4.22.-) precursor - spring v ( 358) 329 206 283 297.5 9.7e-10 align C44938 cysteine proteinase (EC 3.4.22.-) - Entamoeba histolyt ( 165) 265 181 278 297.4 9.8e-10 align S11862 cysteine proteinase homolog - garden pea ( 363) 314 206 283 297.4 9.8e-10 align S55923 cysteine proteinase (EC 3.4.22.-) precursor - soybean ( 380) 448 199 280 294.1 1.5e-09 align S24988 cysteine proteinase (EC 3.4.22.-) precursor - tomato ( 361) 357 177 272 286.2 4.1e-09 align S68783 cathepsin L (EC 3.4.22.15) precursor - Paramecium tetr ( 314) 329 132 270 285.1 4.8e-09 align S25267 cysteine proteinase (EC 3.4.22.-) precursor - Leishman ( 354) 370 220 268 282.3 6.9e-09 align B23705 cysteine proteinase (EC 3.4.22.-) - Entamoeba histolyt ( 312) 260 91 264 279.0 1e-08 align JN0718 drought-inducible cysteine proteinase (EC 3.4.22.-) RD ( 368) 465 181 262 275.9 1.6e-08 align S59597 cysteine proteinase 1 precursor - maize ( 371) 341 124 261 274.8 1.8e-08 align S57427 cysteine proteinase (EC 3.4.22.-) 4 - Tritrichomonas f ( 152) 175 146 254 273.4 2.1e-08 align PQ0650 senescence-associated protein SAG2 - Arabidopsis thali ( 95) 201 201 250 272.4 2.4e-08 align S68784 cathepsin L - Paramecium tetraurelia (SGC5) (fragment) ( 294) 300 188 252 267.1 4.8e-08 align S57423 cysteine proteinase (EC 3.4.22.-) 9 - Tritrichomonas f ( 152) 197 123 243 262.2 9e-08 align S30149 probable cysteine proteinase precursor (clone CYP-7) - ( 363) 319 141 247 260.6 1.1e-07 align S30150 probable cysteine proteinase precursor (clone CYP-8) - ( 365) 274 141 247 260.6 1.1e-07 align S57426 cysteine proteinase (EC 3.4.22.-) 5 - Tritrichomonas f ( 155) 135 135 236 254.9 2.3e-07 align A61500 allergen Der f I precursor - house-dust mite (Dermatop ( 319) 284 166 236 250.2 4.2e-07 align S21864 probable cysteine proteinase (EC 3.4.22.-) - Euroglyph ( 211) 268 158 224 240.7 1.4e-06 align JQ0337 allergen Der p 1 - house-dust mite (Dermatophagoides p ( 245) 232 145 220 235.6 2.7e-06 align S60479 cathepsin B-like cysteine proteinase (EC 3.4.22.-) - A ( 356) 210 107 220 233.2 3.7e-06 align S57422 cysteine proteinase (EC 3.4.22.-) 8 - Tritrichomonas f ( 152) 121 121 212 230.5 5.2e-06 align S57421 cysteine proteinase (EC 3.4.22.-) 6 - Tritrichomonas f ( 152) 128 128 211 229.5 6e-06 align S57425 cysteine proteinase (EC 3.4.22.-) 7 - Tritrichomonas f ( 152) 111 111 210 228.5 6.8e-06 align B44938 cysteine proteinase (EC 3.4.22.-) - Trypanosoma brucei ( 166) 253 143 209 226.9 8.3e-06 align A41158 dipeptidyl-peptidase I (EC 3.4.14.1) precursor - rat ( 462) 301 96 173 183.4 0.0022 align S02729 actinidain (EC 3.4.22.14) precursor (clone pAC.7) - ki ( 184) 238 131 163 179.2 0.0038 align S58770 cathepsin B (EC 3.4.22.1) precursor - chicken ( 340) 229 87 164 176.2 0.0055 align S02728 actinidain (EC 3.4.22.14) precursor (clone pAC.1) - ki ( 302) 371 126 158 170.9 0.011 align B48566 cysteine proteinase Lpcys1 (EC 3.4.22.-) - Leishmania ( 149) 162 126 152 169.3 0.013 align S66504 dipeptidyl-peptidase I (EC 3.4.14.1) precursor - human ( 463) 339 106 159 169.1 0.014 align A45524 cysteine proteinase (EC 3.4.22.-) AC-1 precursor - nem ( 342) 164 70 152 163.9 0.027 align A44965 cysteine proteinase (EC 3.4.22.-) AC-2 precursor - nem ( 342) 164 70 152 163.9 0.027 align B26074 cysteine proteinase (EC 3.4.22.-) 13 - papaya (fragmen ( 96) 180 110 143 163.0 0.03 align S60456 cysteine proteinase (EC 3.4.22.-), glucose starvation- ( 145) 171 111 145 162.3 0.033 align A23770 asparagine-rich protein - Plasmodium falciparum ( 537) 64 64 152 161.0 0.039 align S31914 cysteine proteinase - chickpea (fragment) ( 111) 120 60 141 160.0 0.044 align S46541 cysteine proteinase - chickpea (fragment) ( 111) 120 60 141 160.0 0.044 align B48435 cysteine proteinase AC-5 - nematode (Haemonchus contor ( 348) 168 112 148 159.7 0.046 align JC6009 surface-located membrane protein lmp3 - Mycoplasma hom (1302) 66 41 153 156.3 0.071 align S23207 DNA-directed RNA polymerase (EC 2.7.7.6) chain a - eug ( 528) 92 69 147 156.0 0.074 align A69493 cysteine proteinase homolog - Archaeoglobus fulgidus (1088) 199 97 151 155.4 0.08 align S58729 hypothetical protein N2485 - yeast (Saccharomyces cere ( 237) 34 34 141 155.1 0.083 align S16162 cruzipain (EC 3.4.22.-) - Trypanosoma cruzi (fragment) ( 173) 98 98 138 154.0 0.095 align S62150 hypothetical protein YNL050c - yeast (Saccharomyces ce ( 270) 34 34 140 153.2 0.11 align S57451 cysteine proteinase (EC 3.4.22.-) 3 - Tritrichomonas f ( 157) 136 99 136 152.6 0.11 align KHRTB cathepsin B (EC 3.4.22.1) precursor - rat ( 339) 247 100 136 147.6 0.22 align A48454 cathepsin B-like cysteine proteinase (EC 3.4.-.-) - ne ( 342) 217 107 136 147.6 0.22 align S14535 asparagine-rich protein (clone 25C4) - Plasmodium falc ( 669) 77 63 139 146.3 0.26 align H64709 hypothetical protein HP1520 - Helicobacter pylori (str ( 430) 58 58 136 146.1 0.26 align D48435 cysteine proteinase AC-3 - nematode (Haemonchus contor ( 341) 234 97 134 145.5 0.28 align B48454 cathepsin B-like cysteine proteinase (EC 3.4.-.-) CP-3 ( 174) 143 98 129 144.8 0.31 align S35580 proteinase IV - mountain papaya (fragment) ( 43) 102 102 119 143.6 0.36 align KHMSB cathepsin B (EC 3.4.22.1) precursor - mouse ( 339) 243 95 131 142.5 0.42 align A29172 cathepsin B-like cysteine proteinase (EC 3.4.22.-) - b ( 73) 115 115 121 142.3 0.43 align C64246 hypothetical protein MG419 - Mycoplasma genitalium (SG ( 287) 46 46 129 141.6 0.47 align KHHUB cathepsin B (EC 3.4.22.1) precursor - human ( 339) 256 102 130 141.5 0.48 align C64409 hypothetical protein MJ0875 - Methanococcus jannaschii ( 748) 66 40 134 140.4 0.54 align A56677 neuronal cell cycle withdrawal protein QN1 - quail (fr (1251) 51 51 137 140.2 0.56 align KHBOB cathepsin B (EC 3.4.22.1) precursor - bovine ( 335) 254 101 128 139.5 0.61 align A38194 desmoplakin I - human (2677) 89 53 140 138.3 0.72 align A42771 reticulocyte-binding protein 1 - Plasmodium vivax (2829) 62 62 140 138.0 0.75 align A38748 3-phosphatidylinositol kinase (EC 2.7.1.-) 85K chain - ( 724) 71 71 131 137.6 0.78 align S41720 intermediate filament - goldfish ( 472) 77 53 128 137.3 0.81 align S49394 HsdR1 protein - Mycoplasma pulmonis (SGC3) ( 986) 84 54 132 136.6 0.89 align S35578 cysteine proteinase II - mountain papaya (fragment) ( 43) 66 66 112 136.5 0.9 align S42488 heat shock protein 70 - Pyrenomonas salina nucleomorph ( 649) 78 78 129 136.3 0.93 align S28104 probable DNA-directed RNA polymerase (EC 2.7.7.6) - gi (1102) 48 48 132 135.9 0.98 align S64493 hypothetical protein YGR179c - yeast (Saccharomyces ce ( 406) 107 53 125 135.2 1.1 align S57624 cysteine proteinase LmCPb19 - Leishmania mexicana (fra ( 136) 147 91 118 135.1 1.1 align S64439 hypothetical protein YGR130c - yeast (Saccharomyces ce ( 816) 58 58 129 134.8 1.1 align A64224 hypothetical protein MG218 - Mycoplasma genitalium (SG (1805) 41 41 134 134.7 1.1 align S38939 probable cathepsin B-like cysteine proteinase (EC 3.4. ( 344) 260 101 123 134.2 1.2 align S41426 cysteine proteinase (EC 3.4.22.-) CP4 precursor - Tric ( 100) 96 96 114 133.1 1.4 align A41404 cathepsin L (EC 3.4.22.15) - cat (fragment) ( 139) 195 87 116 133.0 1.4 align A28121 major merozoite surface antigen - Plasmodium yoelii (f ( 680) 61 61 126 132.9 1.4 align A38747 phosphatidlyinositol 3-kinase (EC 2.7.1.-) 85K chain - ( 724) 66 66 126 132.5 1.5 align SAZQK1 major merozoite surface antigen precursor - Plasmodium (1631) 107 55 131 132.3 1.5 align S05603 major merozoite surface antigen precursor - Plasmodium (1639) 128 51 131 132.3 1.5 align C48435 cysteine proteinase AC-4 - nematode (Haemonchus contor ( 342) 276 98 121 132.2 1.6 align A57480 tubulointerstitial nephritis antigen precursor - rabbi ( 474) 118 95 123 132.2 1.6 align H64387 hypothetical protein MJ0704 - Methanococcus jannaschii ( 377) 78 78 120 130.6 1.9 align A37488 Ras guanine nucleotide exchange factor son-of-sevenles (1333) 33 33 128 130.6 1.9 align S35577 cysteine proteinase I - mountain papaya (fragment) ( 43) 72 72 106 130.4 2 align A24594 probable major surface antigen (83K, 19K, 42K) precurs (1640) 127 51 129 130.2 2 align A38749 3-phosphatidylinositol kinase (EC 2.7.1.-) 85K chain a ( 724) 70 70 123 129.4 2.2 align G64245 hypothetical protein homolog MG413 - Mycoplasma genita ( 728) 135 105 123 129.4 2.2 align A64505 P115 homolog - Methanococcus jannaschii (1169) 129 68 126 129.4 2.2 align S54052 DOS1 protein - yeast (Saccharomyces cerevisiae) ( 310) 53 53 117 128.8 2.4 align A23705 cysteine proteinase (EC 3.4.22.-) - Entamoeba histolyt ( 312) 202 80 117 128.7 2.4 align A39340 neurofilament protein 60K splice form NF60 - longfin s ( 511) 64 64 120 128.6 2.5 align S23941 dipeptidyl-peptidase I (EC 3.4.14.1) - human (fragment ( 119) 76 76 110 127.8 2.7 align H64474 hypothetical protein MJ1401 - Methanococcus jannaschii ( 808) 76 76 122 127.7 2.8 align B39340 neurofilament protein 70K splice form NF70 - longfin s ( 615) 64 64 120 127.4 2.9 align S67069 hypothetical protein YOR177c - yeast (Saccharomyces ce ( 464) 48 48 118 127.2 3 align H64245 hypothetical protein MG414 - Mycoplasma genitalium (SG (1036) 135 105 123 127.1 3 align S31909 cathepsin B-like cysteine proteinase (EC 3.4.22.-) - f ( 316) 183 77 115 126.6 3.2 align S57751 protein phosphatase 2A chain B - yeast (Candida tropic ( 508) 64 39 118 126.6 3.2 align S31907 cathepsin B (EC 3.4.22.1) - fluke (Schistosoma japonic ( 342) 227 86 115 126.1 3.4 align S55940 telomerase component p95 - Tetrahymena thermophila (SG ( 872) 44 44 120 125.1 3.9 align JQ1515 heat-shock protein HSP70 - Chlamydomonas reinhardtii ( 649) 36 36 118 125.0 3.9 align S67600 hypothetical protein YDL065c - yeast (Saccharomyces ce ( 350) 77 46 114 124.9 4 align B33501 myosin heavy chain 2, smooth muscle - rabbit (fragment ( 484) 47 47 116 124.9 4 align F64055 prrD protein homolog - Haemophilus influenzae (strain ( 163) 35 35 109 124.8 4.1 align A64238 lipase-esterase lip1 homolog - Mycoplasma genitalium ( ( 273) 35 35 112 124.5 4.2 align S04511 keratin 3, type I, cytoskeletal (clone pUF451) - Afric ( 327) 50 50 113 124.3 4.3 align S07533 puff II/9A-2 protein precursor - fungus gnat (Sciara c ( 286) 52 52 112 124.2 4.4 align S46426 probable botulinum neurotoxin regulator protein 22 - C ( 179) 39 39 109 124.2 4.4 align S58691 kinesin-related polypeptides SpKRP95 - sea urchin (Str ( 742) 42 42 118 124.1 4.4 align C64439 asparagine synthetase (EC 6.3.-.-) - Methanococcus jan ( 544) 58 58 116 124.1 4.4 align S21175 heat shock protein 71 - rainbow trout ( 651) 92 92 117 124.0 4.5 align F69723 trigger factor (prolyl isomerase) tig - Bacillus subti ( 424) 83 50 113 122.7 5.3 align S40460 ribosomal protein S3 - Chlamydomonas frankii chloropla ( 809) 68 41 117 122.6 5.4 align S67593 transport protein USO1 - yeast (Saccharomyces cerevisi (1790) 51 51 122 122.5 5.4 align S49369 mob protein - Campylobacter coli ( 321) 64 41 111 122.4 5.5 align S05362 probable DNA-directed DNA polymerase (EC 2.7.7.7) - fu (1202) 83 52 119 122.0 5.8 align A49464 chromosome segregation protein SMC1 - yeast (Saccharom (1225) 92 62 119 121.9 5.9 align S20614 hypothetical protein 1738 - beechdrops plastid (1738) 62 54 121 121.7 6 align A54639 parasitophorous vacuole antigen p126 - Plasmodium falc ( 427) 129 64 112 121.6 6.1 align C56657 PfEMP2/MESA (clone 9025/60) - Plasmodium falciparum (f ( 230) 88 61 108 121.5 6.2 align D64332 hypothetical protein MJ0259 - Methanococcus jannaschii ( 202) 59 59 107 121.3 6.3 align D64245 peripheral membrane protein B homolog - Mycoplasma gen ( 329) 88 61 110 121.2 6.4 align F64639 hypothetical protein HP0958 - Helicobacter pylori (str ( 254) 68 68 108 120.9 6.7 align A57681 hypothetical protein - Mycoplasma capricolum (SGC3) ( 655) 64 64 114 120.9 6.7 align S12319 pre-mRNA splicing factor PRP6 - yeast (Saccharomyces c ( 899) 55 55 116 120.9 6.7 align S70790 lipA protein - Mycoplasma pulmonis (SGC3) (fragment) ( 261) 80 53 108 120.7 6.8 align JQ0647 Div protein - Bacillus subtilis ( 841) 100 47 115 120.3 7.2 align F69704 preprotein translocase subunit secA - Bacillus subtili ( 841) 100 47 115 120.3 7.2 align A56157 chromosome segregation protein SMC2 - yeast (Saccharom (1170) 69 46 117 120.2 7.3 align B64136 molybdenum cofactor biosynthesis protein A - Haemophil ( 337) 67 44 109 120.1 7.4 align A46194 high-molecular-weight neurofilament protein NF-220 - S (1200) 94 64 117 120.0 7.5 align F64501 hypothetical protein MJ1615 - Methanococcus jannaschii ( 255) 67 50 107 119.8 7.7 align F32946 cysteine proteinase (EC 3.4.22.-) - Caenorhabditis ele ( 53) 105 80 97 119.8 7.7 align S77691 probable finger protein YBR267w - yeast (Saccharomyces ( 393) 76 52 109 119.1 8.4 align S41415 heat shock protein 70 - rat ( 641) 76 76 112 119.0 8.6 align I49761 heat shock protein 70 - mouse ( 641) 78 78 112 119.0 8.6 align A27077 heat shock cognate protein 70 - human ( 646) 85 85 112 118.9 8.6 align JC4853 heat-shock protein 73 - mouse ( 646) 85 85 112 118.9 8.6 align A35922 heat shock cognate protein 70 - Chinese hamster ( 646) 85 85 112 118.9 8.6 align A45935 heat shock cognate protein 70 - mouse ( 646) 85 85 112 118.9 8.6 align S07197 heat shock cognate protein hsc73 - rat ( 646) 85 85 112 118.9 8.6 align S31716 hsp72-ps1 protein - rat ( 646) 85 85 112 118.9 8.6 align D64467 hypothetical protein MJ1341 - Methanococcus jannaschii ( 312) 55 55 107 118.5 9.1 align A61061 actinidain (EC 3.4.22.14) - kiwi fruit (cv. Hayward) ( ( 110) 163 96 100 118.1 9.5 align PC4035 cell-cycle-dependent 350K nuclear protein - human (fra (1017) 49 49 114 118.0 9.7 align PWSP1 H+-transporting ATP synthase (EC 3.6.1.34) chain I - sp ( 184) 65 65 103 117.9 9.9 align S60818 M protein precursor - Streptococcus pyogenes (serotype ( 116) 41 41 100 117.8 10 align --------------------------------------------------------------------------- >>>@, 569 aa vs %p library >>A45624 trophozoite cysteine proteinase - Plasmodium falciparum (569 aa) initn: 3739 init1: 3739 opt: 3739 Z-score: 3827.1 expect() 2.4e-206 Smith-Waterman score: 3739; 100.000% identity in 569 aa overlap Entrez lookup Re-search database >A45624 1- 569:---------------------------------------------------------------------: 10 20 30 40 50 60 70 80 QUERY MVAIKEMKELAFARPSLVETLNKKKKFLKKKEKRTFVLSIYAFITFIIFCIGILYFTNKSSAHNNNNNKNEHSLKKEEIE :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: A45624 MVAIKEMKELAFARPSLVETLNKKKKFLKKKEKRTFVLSIYAFITFIIFCIGILYFTNKSSAHNNNNNKNEHSLKKEEIE 10 20 30 40 50 60 70 80 90 100 110 120 130 140 150 160 QUERY LLRVLLEKYKKQKDGILNESSNEEDEEKYTLNSETYNNKNNVSNIKNDSIKSKKEEYINLERILLEKYKKFINENNEENR :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: A45624 LLRVLLEKYKKQKDGILNESSNEEDEEKYTLNSETYNNKNNVSNIKNDSIKSKKEEYINLERILLEKYKKFINENNEENR 90 100 110 120 130 140 150 160 170 180 190 200 210 220 230 240 QUERY KELSNILHKLLEINKLILREEKDDKKVYLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKNID :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: A45624 KELSNILHKLLEINKLILREEKDDKKVYLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKNID 170 180 190 200 210 220 230 240 250 260 270 280 290 300 310 320 QUERY EQMRKFEIFKINYISIKNHNKLNKNAMYKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYT :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: A45624 EQMRKFEIFKINYISIKNHNKLNKNAMYKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYT 250 260 270 280 290 300 310 320 330 340 350 360 370 380 390 400 QUERY NGKRNEKDIFSKVPEILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKDNFGCDGGHP :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: A45624 NGKRNEKDIFSKVPEILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKDNFGCDGGHP 330 340 350 360 370 380 390 400 410 420 430 440 450 460 470 480 QUERY FYSFLYVLQNELCLGDEYKYKAKDDMFCLNYRCKRKVSLSSIGAVKENQLILALNEVGPLSVNVGVNNDFVAYSEGVYNG :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: A45624 FYSFLYVLQNELCLGDEYKYKAKDDMFCLNYRCKRKVSLSSIGAVKENQLILALNEVGPLSVNVGVNNDFVAYSEGVYNG 410 420 430 440 450 460 470 480 490 500 510 520 530 540 550 560 QUERY TCSEELNHSVLLVGYGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGI :::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::::: A45624 TCSEELNHSVLLVGYGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGI 490 500 510 520 530 540 550 560 QUERY GEEVFYPIL ::::::::: A45624 GEEVFYPIL --------------------------------------------------------------------------- >>S46265 cysteine proteinase - Plasmodium vivax (583 aa) initn: 1326 init1: 680 opt: 1574 Z-score: 1614.0 expect() 4.6e-83 Smith-Waterman score: 1650; 46.293% identity in 607 aa overlap Entrez lookup Re-search database >S46265 5- 569:---------------------------------------------------------------------: 10 20 30 40 50 60 70 QUERY MVAIKEMKELAFARPSLVETLNKKKKFLKKKEKRTFVLSIYAFITF-----IIFCIGILYFTN----KSSAHNNNNN-KN ...: . ... :: :.::... . ::. .. . . .::..:: ...:. . .. .:..::.... :. S46265 MAQDIKIMNLTKSSL-EALNRNQMLSKKSSRKILKICMYAILTFAMCGVVLICLTAMSNSDGSLTQSGSHNQSGSLKG 10 20 30 40 50 60 70 80 90 100 110 120 130 140 QUERY EHS-------LKKEEIELLRVLLEKYKKQKDGILNESSNEEDEEKYTLNSETYNNKNNVSNIKNDSIKSKKEEYINLERI : :.: ::: :: .. .: . :.. . .: :: . .. :.... .: :. : : S46265 LSSTPGDGEILNKAEIETLRFIFSNYPHG-----NRDPTGDDVEKPA-DAALPNEEDQKVKIA-DAGKHIK--------- 80 90 100 110 120 130 140 150 160 170 180 190 200 210 QUERY LLEKYKKFINENNEENRKELSNILHKLLE--INKLILREE------KDDKKVYLIND---NYDEKGALEIGMNEEMKY-- :...:.... . .:.:...:...:..::. ::. ..: .. :.: :. :: . .: . ..: . S46265 LMKQYNEIVADMSEDNKEQLAKMLRELLKKKINERKKKREDPNGNNEEGKEVINISVPSFNYKRVSANQDDSDDEEEVSV 150 160 170 180 190 200 210 220 220 230 240 250 260 270 280 QUERY -KKEDPINNIKYASKFFKFMKEHNKVYKNIDEQMRKFEIFKINYISIKNHNKLNKNAMYKKKVNQFSDYSEEELKEYFKT . : . :.:::::::.::..... ::.:.:::.:.. ::.::..::.::. :. ::: :::::::::..... ::. S46265 AQIEGLFVNLKYASKFFNFMNKYKRSYKDINEQMEKYKNFKMNYLKIKKHNETNQ--MYKMKVNQFSDYSKKDFESYFRK 230 240 250 260 270 280 290 290 300 310 320 330 340 350 360 QUERY LLHVPNHMIEKYSKPF---ENHLKDNILISEFYTNGKRNEKDIFSKVPEILDYREKGIVHEPKDQGLCGSCWAFASVGNI :. .:.:. .:: :: .: :.. : ..: .... :::::::::::::::::::::::::::::::::. S46265 LVPIPDHLKKKYVVPFSSMNNGKGKNVVTS---SSGA----NLLADVPEILDYREKGIVHEPKDQGLCGSCWAFASVGNV 300 310 320 330 340 350 360 370 370 380 390 400 410 420 430 440 QUERY ESVFAKK-NKNILSFSEQEVVDCSKDNFGCDGGHPFYSFLYVLQNELCLGDEYKYKAKDDMFCLNYRCKRKVSLSSIGAV : ..::. ::.::..:::::::::: :::::::::::::.:...: .:.::.::::: :..:::::::: ::.:::.:.: S46265 ECMYAKEHNKTILTLSEQEVVDCSKLNFGCDGGHPFYSFIYAIENGICMGDDYKYKAMDNLFCLNYRCKNKVTLSSVGGV 380 390 400 410 420 430 440 450 450 460 470 480 490 500 510 520 QUERY KENQLILALNEVGPLSVNVGVNNDFVAYSEGVYNGTCSEELNHSVLLVGYGQVEKTKL----NYNNKIQTYNTKENSNQP :::.:: :::::::.::::::..:: :. :..::::.:::::::::::::::...:. : . . . : . : S46265 KENELIRALNEVGPVSVNVGVTDDFSFYGGGIFNGTCTEELNHSVLLVGYGQVQSSKIFQEKNAYDDASGVTKKGALSYP 460 470 480 490 500 510 520 530 530 540 550 560 QUERY ---DDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFYPIL ::.: ::::::::::: ::::::::.::::.::::::::: ::::::: S46265 SKADDGIQYYWIIKNSWSKFWGENGFMRISRNKEGDNVFCGIGVEVFYPIL 540 550 560 570 580 --------------------------------------------------------------------------- >>S32561 cysteine proteinase - Plasmodium vinckei (506 aa) initn: 852 init1: 555 opt: 1317 Z-score: 1352.2 expect() 1.7e-68 Smith-Waterman score: 1364; 42.207% identity in 571 aa overlap Entrez lookup Re-search database >S32561 9- 568: --------------------------------------------------------------------: 10 20 30 40 50 60 70 80 QUERY MVAIKEMKELAFARPSLVETLNKKKKFLKKKEKRTFVLSIYAFITFIIFCIGILYFTNKSSAHNNNNNKNEHSLKKEEIE .. :. :. ...:... .:: ..:.:. . ::. .. .: :: ..: :... : . .: : ..:: S32561 MSDNIGQINFTIPG-IQSLDENDTYLKINHKKTIKICAYAITAIALFFIGGVFFKNQAKI-NALDAIDEAVLMNKEIA 10 20 30 40 50 60 70 90 100 110 120 130 140 150 160 QUERY LLRVLLEKYKKQKDGILNESSNEEDEEKYTLNSETYNNKNNVSNIKNDSIKSKKEEYINLERILLEKYKKFINENNEENR :: .:.::: . ::.:: : ..:.:::. :. ...:: .: . :. :. : S32561 HLREILNKYKA--------TINEDDEFVY----QAYDNKNGDSE----------------NQLLLMLHKLLKNNANKVNT 80 90 100 110 120 170 180 190 200 210 220 230 240 QUERY KELSNILHKLLEINKLILREEKDDKKVYLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKNID ...: : :: : : .. . : :. .::::::::::.:::.:: :.:.: S32561 FDVNN------ESNKNI-----DPTYIF--------------------RQKLESMQDNIKYASKFFKYMKENNKKYENMD 130 140 150 160 170 250 260 270 280 290 300 310 QUERY EQMRKFEIFKINYISIKNHNKL-NKNAM-YKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEF ::...:: ::: :.. ..::.. .::.. : .::::.::.:.::. .::: :: :: . :: :...:: .. ::: S32561 EQLQRFENFKIRYMKTQKHNEMVGKNGLTYVQKVNQYSDFSKEEFDNYFKKLLSVPMDLKSKYIVPLKKHLANTNLIS-- 180 190 200 210 220 230 240 250 320 330 340 350 360 370 380 390 QUERY YTNGKRNEKDIFSKVPEILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAK-KNKNILSFSEQEVVDCSKDNFGCDG ...: ::. :. ::: : ::::: ::::::::..::.: .... ... .:::::..:::: .:.:::: S32561 -VDNKS--KDF----PDSRDYRSKFNFLPPKDQGNCGSCWAFAAIGNFEYLYVHTRHEMPISFSEQQMVDCSTENYGCDG 260 270 280 290 300 310 320 400 410 420 430 440 450 460 470 QUERY GHPFYSFLYVLQNELCLGDEYKYKAKDDMFCLNYRCKRKVSLSSIGAVKENQLILALNEVGPLSVNVGVNNDFVAYSEGV :.:::.:::...: .:::::: ::...:.:::::::. . :: :: :.::.::: :::... ::...::: :: :: S32561 GNPFYAFLYMINNGVCLGDEYPYKGHEDFFCLNYRCSLLGRVHFIGDVKPNELIMALNYVGPVTIAVGASEDFVLYSGGV 330 340 350 360 370 380 390 400 480 490 500 510 520 530 540 QUERY YNGTCSEELNHSVLLVGYGQVEKT------KLNYN-NKIQTYNTKEN-SNQPDDNIIYYWIIKNSWSKKWGENGFMRLSR ..: :. ::::::::::::::.:. . : . : :. : ::: ... ::.::::::..:::. .:::.:..:..: S32561 FDGECNPELNHSVLLVGYGQVKKSLAFEDSHSNVDSNLIKKY--KENIKGDDDDDIIYYWIVRNSWGPNWGEGGYIRIKR 410 420 430 440 450 460 470 480 550 560 QUERY NKNGDNVFCGIGEEVFYPIL :: ::. :::.: .::.:: S32561 NKAGDDGFCGVGSDVFFPIY 490 500 --------------------------------------------------------------------------- >>A45565 cysteine proteinase - Theileria annulata (441 aa) initn: 494 init1: 192 opt: 488 Z-score: 505.7 expect() 2.5e-21 Smith-Waterman score: 594; 33.711% identity in 353 aa overlap Entrez lookup Re-search database >A45565 224- 569: ------------------------------------------: 190 200 210 220 230 240 250 260 QUERY DKKVYLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKNIDEQMRKFEIFKINYISIKNHNKLN .: :.....::....:.....: :. :: .:.:. . A45565 ESHYPSMDPSKRAGFVEEIVKIRQTGKITSDAESELDMLIEFDAFVEKYKKVHRSFDQRVQRFLTFRKNYHIVKTHKPTE 80 90 100 110 120 130 140 150 270 280 290 300 310 320 330 QUERY KNAMYKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFE----NHLKDNILISEFY-TNGKRNEKDIFSKVPEILD :. .:.::: :.::.: . .. .: . . :: .: .: :. : ::.. ..: .. ::. . : :. A45565 P---YSLDLNKFSDLSDEEFKALYPVI--TPPKTYTSLSKHLEFKKMSH-KNPIYISKLKKAKGIEEIKDLSLITGENLN 160 170 180 190 200 210 220 230 340 350 360 370 380 390 400 410 QUERY YREKGIVHEPKDQG-LCGSCWAFASVGNIESVFA-KKNKNILSFSEQEVVDCSKDNFGCDGGHPFYSFLYVLQNELCLGD . . : :::: :::::::.:....::.. :::. . .::::.:.:.:...:: :: :. .. :. .. . . . A45565 WARTDAVSPTKDQGDHCGSCWAFSSIASVESLYRLYKNKSYF-LSEQELVNCDKSSMGCAGGLPITALEYIHSKGVSFES 240 250 260 270 280 290 300 310 420 430 440 450 460 470 480 490 QUERY EYKYKAKDDMFCLNYRCKRKVSLSSIGAVKENQLILALNEVGPLSVNVGVNNDFVAYSEGVYNGTCSEELNHSVLLVGYG : : . . : . : :: ..::. .: :... ..: :...:.... :: :...: :. ::::.::::: : A45565 EVPYTGIVSP-C-KPSIKNKVFIDSISILKGNDVVNKSLVISPTVVGIAVTKELKLYSGGIFTGKCGGELNHAVLLVGEG 320 330 340 350 360 370 380 500 510 520 530 540 550 560 QUERY QVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFYPIL ..: . : :::::::.. ::::::.::.:.:.: . ::: . ::: A45565 VDHETGMRY-----------------------WIIKNSWGEDWGENGFLRLQRTKKGLDK-CGILTFGLNPILYSS 390 400 410 420 430 440 --------------------------------------------------------------------------- >>S49166 cysteine proteinase precursor - spring vetch (357 aa) initn: 532 init1: 211 opt: 420 Z-score: 437.6 expect() 1.5e-17 Smith-Waterman score: 548; 31.319% identity in 364 aa overlap Entrez lookup Re-search database >S49166 213- 568: -------------------------------------------: 180 190 200 210 220 230 240 250 QUERY INKLILREEKDDKKVYLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKNIDEQMRKFEIFKIN : ... : .... . :. : .:.::. .:..:: : S49166 MEMKKLLFISLSLALIFTVANTFDFNEHDLESEKSLWNLYERWRSHHTVTRNLDEKHNRFNVFKAN 10 20 30 40 50 60 260 270 280 290 300 310 320 330 QUERY YISIKNHNKLNKNAMYKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSK . ..: :::.: :: :.:.:.:... :... . .. .: . :.. ..: . .: :. S49166 VMHVHNTNKLDKP--YKLKLNKFGDMTNYEFRRIYADS-KISHHRM------FRGMSHENGTF--MYENA--------VD 70 80 90 100 110 120 340 350 360 370 380 390 400 410 QUERY VPEILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDC-SKDNFGCDGGHPFYSFLYVLQNE :: .:.:.:: : :::: :::::::.... .:.. :.....:.:::..::: ...: ::.:: :.: .. :: S49166 VPSSIDWRNKGAVTGVKDQGQCGSCWAFSTIAAVEGINQIKTQKLVSLSEQQLVDCDTEENEGCNGGLMEYAFEFIKQNG 130 140 150 160 170 180 190 200 420 430 440 450 460 470 480 QUERY LCLGDEYKYKAKDDMFCLNYRCKRKVSLSSIGAVKENQLILALNEVG--PLSVNVGVNN-DFVAYSEGVYNGTCSEELNH . ..: : ::: : . . ::... : :. :. .. :.:: . ... .: :::::..: :. .::: S49166 ITTESNYPYAAKDGT-CDVEKEDKAVSIDGHENVPINNEAALLKAAAKQPVSVAIDAGGYNFQFYSEGVFTGHCDTDLNH 210 220 230 240 250 260 270 280 490 500 510 520 530 540 550 560 QUERY SVLLVGYGQVE-KTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFM--RLSRNK-NGDNVFCGIGEEV .: .:::: .. .:: :::.::::. . :: :..:. .. . .:::. :. S49166 GVAIVGYGVTQDRTK-------------------------YWIMKNSWGLE-----FMGPRMGRTGISSREGLCGIAMEA 290 300 310 320 330 QUERY FYPIL ::: S49166 SYPIKKSSTKPTESSILKDEL 340 350 --------------------------------------------------------------------------- >>KHQBTT cysteine proteinase (EC 3.4.22.-) precursor - Theileria parva (439 aa) initn: 425 init1: 187 opt: 417 Z-score: 433.2 expect() 2.7e-17 Smith-Waterman score: 501; 27.557% identity in 352 aa overlap Entrez lookup Re-search database >KHQBTT 210- 560: -------------------------------------------: 170 180 190 200 210 220 230 240 QUERY LLEINKLILREEKDDKKVYLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKNIDEQMRKFEIF . ..:: . . .: .: ...:. . . .:.. .. : KHQBTT KMLNKFKRELDDHLTKDFPNLERSKRDTCFDELTRLFGDGFLSDDPKLEYEVYREFEEFNSKYNRRHATQQERLNRLVTF 70 80 90 100 110 120 130 140 250 260 270 280 290 300 310 320 QUERY KINYISIKNHNKLNKNAMYKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEK-D . ::. .:... . : : .:.::: .:.:. . : .. : . . . . .:. .. .... . .: : KHQBTT RSNYLEVKEQKG---DEPYVKGINRFSDLTEREFYKLFPVMK--PPKATYSNGYYLLSHMANKTYLKNLKKALNTDEDVD 150 160 170 180 190 200 210 220 330 340 350 360 370 380 390 400 QUERY IFSKVPEILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKDNFGCDGGHPFYSFLYVL . . . : ::.:... : :::. ::.::::..::..:. . .. . .: ::..::.. . ::.:: .. :: KHQBTT LAKLTGENLDWRRSSSVTSVKDQSNCGGCWAFSTVGSVEGYYMSHFDKSYELSVQELLDCDSFSNGCQGGLLESAYEYVR 230 240 250 260 270 280 290 300 410 420 430 440 450 460 470 480 QUERY QNELCLGDEYKYKAKDDMFCLNYRCKRKVSLSSIGAVKENQLILALNEVGPLSVNVGVNNDFVAYSEGVYNGTCSEELNH . : . . . : : . : :::. : . : .... .: :: ..:. ... :. ::..: :.. ::: KHQBTT KYGLVSAKDLPFVDKARR-CSVPKAK-KVSVPSYHVFKGKEVMTRSLTSSPCSVYLSVSPELAKYKSGVFTGECGKSLNH 310 320 330 340 350 360 370 380 490 500 510 520 530 540 550 560 QUERY SVLLVGYGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFYPI .:.::: : : :: : :...:::. :::::.::: :.. : . ::. KHQBTT AVVLVGEGYDEVTKKRY-----------------------WVVQNSWGTDWGENGYMRLERTNMGTDK-CGVLDTSMSAF 390 400 410 420 430 QUERY L KHQBTT EL --------------------------------------------------------------------------- >>KHHUL cathepsin L (EC 3.4.22.15) precursor - human (333 aa) initn: 378 init1: 226 opt: 413 Z-score: 430.9 expect() 3.6e-17 Smith-Waterman score: 517; 30.812% identity in 357 aa overlap Entrez lookup Re-search database >KHHUL 223- 567: ------------------------------------------: 190 200 210 220 230 240 250 260 QUERY DDKKVYLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKNIDEQMRKFEIFKINYISIKNHNKL ... :. ::..: .: :. ... :. :. ::. KHHUL MNPTLILAAFCLGIASATLTFDHSLEAQWTKWKAMHNRLYGMNEEGWRR-AVWEKNMKMIELHNQE 10 20 30 40 50 60 270 280 290 300 310 320 330 340 QUERY NKNAM--YKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILDYR ... . .: :.:.. ::... .. . . :: .:: .. .: ..:. .:.: KHHUL YREGKHSFTMAMNAFGDMTSEEFRQVMNGF---------QNRKP---------------RKGKVFQEPLFYEAPRSVDWR 70 80 90 100 110 120 350 360 370 380 390 400 410 QUERY EKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCS--KDNFGCDGGHPFYSFLYVLQNELCLGDE- ::: : :.:: :::::::...: .:. . .:. ..:.:::..:::: . : ::.:: :.: :: .: ..: KHHUL EKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAFQYVQDNGGLDSEES 130 140 150 160 170 180 190 200 420 430 440 450 460 470 480 490 QUERY YKYKAKDDMFCLNYRCKRKVSLSSIGAV----KENQLILALNEVGPLSVNVGVNND-FVAYSEGVY-NGTCS-EELNHSV : :.: .. : .: : .:. .. : : .:. :. :. :::.:: . .... :. :.::.: . :: :...:.: KHHUL YPYEATEES-C-KYNPKYSVA-NDTGFVDIPKQEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGV 210 220 230 240 250 260 270 500 510 520 530 540 550 560 QUERY LLVGYGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFYPIL :.:::: :.:. . ::: ::..::::...:: .:........ . :::. . :: KHHUL LVVGYG-FESTESD-NNK-------------------YWLVKNSWGEEWGMGGYVKMAKDRRNH---CGIASAASYPTV 280 290 300 310 320 330 --------------------------------------------------------------------------- >>JN0633 caricain (EC 3.4.22.30) I precursor - papaya (348 aa) initn: 508 init1: 302 opt: 413 Z-score: 430.6 expect() 3.8e-17 Smith-Waterman score: 580; 31.302% identity in 361 aa overlap Entrez lookup Re-search database >JN0633 210- 567: -------------------------------------------: 170 180 190 200 210 220 230 240 QUERY LLEINKLILREEKDDKKVYLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKNIDEQMRKFEIF :...: .. . . : ..: .::: :.:.::.. .:::: JN0633 MAMIPSISKLLFVAICLFVHMSVSFGDFSIVGYSQDDLTSTERLIQLFNSWMLNHNKFYENVDEKLYRFEIF 10 20 30 40 50 60 70 250 260 270 280 290 300 310 320 QUERY KINYISIKNHNKLNKNAMYKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDI : : : . :: :: : .:.:.: :..:..: . : . . ..:.. : : .:.. JN0633 KDNLNYIDETNK--KNNSYWLGLNEFADLSNDEFNEKYVGSL-IDATIEQSYDEEFIN--EDTV---------------- 80 90 100 110 120 130 330 340 350 360 370 380 390 400 QUERY FSKVPEILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKDNFGCDGGHPFYSFLYVLQ ..:: .:.:.:: : . :: :::::::..:...:.. .. ... .::::.::: . . :: ::.: :.. :: . JN0633 --NLPENVDWRKKGAVTPVRHQGSCGSCWAFSAVATVEGINKIRTGKLVELSEQELVDCERRSHGCKGGYPPYALEYVAK 140 150 160 170 180 190 200 410 420 430 440 450 460 470 480 QUERY NELCLGDEYKYKAKDDMFCLNYRCKRKVSLSSIGAVKENQLILALNEVG--PLSVNV-GVNNDFVAYSEGVYNGTCSEEL : . : ..: ::::. . :. :..: :. :. :: .. :.:: : . . : :. :...: :. .. JN0633 NGIHLRSKYPYKAKQGTCRAKQVGGPIVKTSGVGRVQPNNEGNLLNAIAKQPVSVVVESKGRPFQLYKGGIFEGPCGTKV 210 220 230 240 250 260 270 280 490 500 510 520 530 540 550 560 QUERY NHSVLLVGYGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFY .:.: :::: : ... : .:::::. :::.:..:..: ... ::. . .: JN0633 DHAVTAVGYG------------------KSGGKG-------YILIKNSWGTAWGEKGYIRIKRAPGNSPGVCGLYKSSYY 290 300 310 320 330 340 QUERY PIL : JN0633 PTKN --------------------------------------------------------------------------- >>A49868 probable cysteine proteinase OC-2 precursor, osteoclast - rabbit (329 aa) initn: 425 init1: 274 opt: 412 Z-score: 429.9 expect() 4.1e-17 Smith-Waterman score: 523; 32.948% identity in 346 aa overlap Entrez lookup Re-search database >A49868 230- 567: -----------------------------------------: 190 200 210 220 230 240 250 260 QUERY INDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKN-IDEQMRKFEIFK-INYISIKNHNKLNKNAM : ..: :.. .:: :.. : ...:::.: . A49868 MWGLKVLLLPVVSFALHPEEILDTQWELWKKTYSKQYNSKVDEISRRLIWEKNLKHISIHNLEASLGVHT 10 20 30 40 50 60 70 270 280 290 300 310 320 330 340 QUERY YKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILDYREKGIVHE :. .:...:.. ::. . . : :.:: : ..: .:.. : : ...:. .:::.:: : A49868 YELAMNHLGDMTSEEVVQKM-TGLKVP---------PSRSHSNDTLYIP-----------DWEGRTPDSIDYRKKGYVTP 80 90 100 110 120 350 360 370 380 390 400 410 420 QUERY PKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKDNFGCDGGHPFYSFLYVLQNE-LCLGDEYKYKAKDD- :.:: :::::::.::: .:. . ::. ..:..: :..::: ..:.:: ::. .: :: .:. . : : : ..:. A49868 VKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSENYGCGGGYMTNAFQYVQRNRGIDSEDAYPYVGQDES 130 140 150 160 170 180 190 200 430 440 450 460 470 480 490 500 QUERY -MFCLNYRCKRKVSLSSIGAVKENQLILALNEVGPLSVNVGVN-NDFVAYSEGVY-NGTCS-EELNHSVLLVGYGQVEKT :. . . . . : .:. : :. .:::.:: . .. ..: ::.::: . .:: ...::.:: :::: ..: A49868 CMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDASLTSFQFYSKGVYYDENCSSDNVNHAVLAVGYG-IQK- 210 220 230 240 250 260 270 280 510 520 530 540 550 560 QUERY KLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFYPIL .:: .:::::::...::..:.. ..::::. :::.. . .: A49868 ----GNK-------------------HWIIKNSWGESWGNKGYILMARNKNNA---CGIANLASFPKM 290 300 310 320 --------------------------------------------------------------------------- >>JN0634 caricain (EC 3.4.22.30) II precursor - papaya (367 aa) initn: 513 init1: 302 opt: 410 Z-score: 427.2 expect() 5.8e-17 Smith-Waterman score: 592; 31.768% identity in 362 aa overlap Entrez lookup Re-search database >JN0634 210- 568: -------------------------------------------: 170 180 190 200 210 220 230 240 QUERY LLEINKLILREEKDDKKVYLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKNIDEQMRKFEIF :...: .. . . : ..: .::: :.:.::.. .:::: JN0634 MAMIPSISKLLFVAICLFVHMSVSFGDFSIVGYSQDDLTSTERLIQLFNSWMLNHNKFYENVDEKLYRFEIF 10 20 30 40 50 60 70 250 260 270 280 290 300 310 320 QUERY KINYISIKNHNKLNKNAMYKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDI : : : . :: :: :. .:.:.: :..:..: . : . . ..:.. : : .:: JN0634 KDNLNYIDETNK--KNNSYRLGLNEFADLSNDEFNEKYVGSL-IDATIEQSYDEEFIN-------------------EDI 80 90 100 110 120 130 330 340 350 360 370 380 390 400 QUERY FSKVPEILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKDNFGCDGGHPFYSFLYVLQ . .:: .:.:.:: : . :: :::::::..:...:.. .. ... .::::.::: . . :: ::.: :.. :: . JN0634 VN-LPENVDWRKKGAVTPVRHQGSCGSCWAFSAVATVEGINKIRTGKLVELSEQELVDCERRSHGCKGGYPPYALEYVAK 140 150 160 170 180 190 200 410 420 430 440 450 460 470 480 QUERY NELCLGDEYKYKAKDDMFCLNYRCKRKVSLSSIGAVKENQLILALNEVG--PLSVNV-GVNNDFVAYSEGVYNGTCSEEL : . : ..: ::::. . :. :..: :. :. :: .. :.:: : . . : :. :...: :. .. JN0634 NGIHLRSKYPYKAKQGTCRAKQVGGPIVKTSGVGRVQPNNEGNLLNAIAKQPVSVVVESKGRPFQLYKGGIFEGPCGTKV 210 220 230 240 250 260 270 280 490 500 510 520 530 540 550 560 QUERY NHSVLLVGYGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFY .:.: ::::. ..: : .:::::. :::.:..:..: ... ::. . .: JN0634 DHAVTAVGYGKS-------GGK------------------GYILIKNSWGTAWGEKGYIRIKRAPGNSPGVCGLYKSSYY 290 300 310 320 330 340 QUERY PIL :: JN0634 PIKNRDNGRIQIRPSSQHLTSHE 350 360 --------------------------------------------------------------------------- >>S22502 endopeptidase - kidney bean (362 aa) initn: 478 init1: 218 opt: 407 Z-score: 424.2 expect() 8.5e-17 Smith-Waterman score: 598; 30.851% identity in 376 aa overlap Entrez lookup Re-search database >S22502 199- 568: ---------------------------------------------: 160 170 180 190 200 210 220 230 QUERY NRKELSNILHKLLEINKLILREEKDDKKVYLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKN .: .:. . . .. .: . . . ... . :. : .. S22502 MATKKLLWVVLSFSLVLGVANSFDFHDKD-LASEESLWDLYERWRSHHTVSRS 10 20 30 40 50 240 250 260 270 280 290 300 310 QUERY IDEQMRKFEIFKINYISIKNHNKLNKNAMYKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEF . :. ..:..:: : . ..: ::..: :: :.:.:.:....:.. . .: .: . . . : :: S22502 LGEKHKRFNVFKANLMHVHNTNKMDKP--YKLKLNKFADMTNHEFRSTYAGS-KVNHHRMFRGT-PHEN----------- 60 70 80 90 100 110 320 330 340 350 360 370 380 390 QUERY YTNGKRNEKDIFSKVPEILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKD-NFGCDG : . . : :: .:.:.:: : . :::: :::::::..: .:.. :.......::::.:::.:. : ::.: S22502 ---GAFMYEKVVS-VPPSVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQIKTNKLVALSEQELVDCDKEENQGCNG 120 130 140 150 160 170 180 190 400 410 420 430 440 450 460 470 QUERY GHPFYSFLYVLQNE-LCLGDEYKYKAKDDMFCLNYRCKRKVSLS---SIGAVKENQLILALNEVGPLSVNVGVN-NDFVA : .: .. :. . ..: :::.. . ::.. .. : :. :. :. . :.:: . .. .:: S22502 GLMESAFEFIKQKGGITTESNYPYKAQEGTCDASKVNDLAVSIDGHENVPANDEDALLKAVAN-QPVSVAIDAGGSDFQF 200 210 220 230 240 250 260 270 480 490 500 510 520 530 540 550 QUERY YSEGVYNGTCSEELNHSVLLVGYGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKN :::::..: :: .:::.: .:::: .: ...: :::..:::. .:::.:..:..:: . S22502 YSEGVFTGDCSTDLNHGVAIVGYG----------------TTVDGTN--------YWIVRNSWGPEWGEHGYIRMQRNIS 280 290 300 310 320 560 QUERY GDNVFCGIGEEVFYPIL . .:::. ::: S22502 KKEGLCGIAMLPSYPIKNSSDNPTGSFSSPKDEL 330 340 350 360 --------------------------------------------------------------------------- >>S06837 glycyl endopeptidase (EC 3.4.22.25) - papaya (216 aa) initn: 504 init1: 291 opt: 394 Z-score: 414.3 expect() 3e-16 Smith-Waterman score: 470; 34.728% identity in 239 aa overlap Entrez lookup Re-search database >S06837 333- 568: ----------------------------: 300 310 320 330 340 350 360 370 QUERY VPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAK .:: .:.: :: : : :: : :::::..:...:.. S06837 LPESVDWRAKGAVTPVKHQGYCESCWAFSTVATVEGINKI 10 20 30 40 380 390 400 410 420 430 440 450 QUERY KNKNILSFSEQEVVDCSKDNFGCDGGHPFYSFLYVLQNELCLGDEYKYKAKDDMFCLNYRCKRKVSLSSIGAVKENQLIL :. :.. .::::.:::. ...::. :. :. :: :: . : .: : ::.. : ::. ...: :. :. S06837 KTGNLVELSEQELVDCDLQSYGCNRGYQSTSLQYVAQNGIHLRAKYPYIAKQQTCRANQVGGPKVKTNGVGRVQSNNEGS 50 60 70 80 90 100 110 120 460 470 480 490 500 510 520 QUERY ALNEVG--PLSVNV-GVNNDFVAYSEGVYNGTCSEELNHSVLLVGYGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYW :: .. :.:: : ... :: :. :...:.:. ...:.: :::: : ... : S06837 LLNAIAHQPVSVVVESAGRDFQNYKGGIFEGSCGTKVDHAVTAVGYG------------------KSGGKG-------YI 130 140 150 160 170 530 540 550 560 QUERY IIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFYPIL .:::::. :::::..:. : .... ::. . .::: S06837 LIKNSWGPGWGENGYIRIRRASGNSPGVCGVYRSSYYPIKN 180 190 200 210 --------------------------------------------------------------------------- >>S57777 cysteine proteinase (EC 3.4.22.-) precursor - Hemerocallis x hybrida (360 aa) initn: 500 init1: 182 opt: 397 Z-score: 414.0 expect() 3.1e-16 Smith-Waterman score: 588; 31.728% identity in 353 aa overlap Entrez lookup Re-search database >S57777 225- 568: -----------------------------------------: 190 200 210 220 230 240 250 260 QUERY KKVYLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKNIDEQMRKFEIFKINYISIKNHNKLNK ... . :. : ...::. :.:..:: : :.. :. .: S57777 MAKPKFIALALVALSFLSIAQSIPFTEKDLASEDSLWNLYEKWRTHHTVARDLDEKNRRFNVFKENVKFIHEFNQ-KK 10 20 30 40 50 60 70 270 280 290 300 310 320 330 340 QUERY NAMYKKKVNQFSDYSEEELKEYF---KTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEI-LDYR .: :: .:.:.:....:.. . : : .. :.: . : . .:. ...: .:.: S57777 DAPYKLALNKFGDMTNQEFRSKYAGSKIQHHRSQRGIQKNTGSF---MYENV-----------------GSLPAASIDWR 80 90 100 110 120 130 350 360 370 380 390 400 410 QUERY EKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDC-SKDNFGCDGGHPFYSFLYVLQNELCLGDEYK :: : :::: :::::::......:.. :. ...:.::::.::: .. : ::.:: :.: .. .: . : : S57777 AKGAVTGVKDQGQCGSCWAFSTIASVEGINQIKTGELVSLSEQELVDCDTSYNEGCNGGLMDYAFEFIQKNGITTEDSYP 140 150 160 170 180 190 200 210 420 430 440 450 460 470 480 490 QUERY YKAKDDMFCLNYRCKRKVSLSS---IGAVKENQLILALNEVGPLSVNVGVNN-DFVAYSEGVYNGTCSEELNHSVLLVGY : .: : . ::... . : .:: :. :. . :.::.. ... : :::::..: :. ::.:.: .::: S57777 YAEQDGTCASNLLNSPVVSIDGHQDVPANNENALMQAVAN-QPISVSIEASGYGFQFYSEGVFTGRCGTELDHGVAIVGY 220 230 240 250 260 270 280 290 500 510 520 530 540 550 560 QUERY GQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFYPIL : :..... :::.::::...:::.:..:..:. . :::. :. ::: S57777 GA----------------TRDGTK--------YWIVKNSWGEEWGESGYIRMQRGISDKRGKCGIAMEASYPIKTSANPK 300 310 320 330 340 350 S57777 NSSTRDEL 360 --------------------------------------------------------------------------- >>JA0159 cysteine proteinase (EC 3.4.22.-) precursor - tomato (fragment) (346 aa) initn: 492 init1: 224 opt: 395 Z-score: 412.2 expect() 4e-16 Smith-Waterman score: 478; 34.568% identity in 243 aa overlap Entrez lookup Re-search database >JA0159 333- 568: ----------------------------: 300 310 320 330 340 350 360 370 QUERY VPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAK .:: .:.::::.. :::: :::::::..:. .::. : JA0159 KLSKNKSDRYLPKVGDSLPESIDWREKGVLVGVKDQGSCGSCWAFSAVAAMESINAI 10 20 30 40 50 380 390 400 410 420 430 440 QUERY KNKNILSFSEQEVVDCSKD-NFGCDGGHPFYSFLYVLQNE-LCLGDEYKYKAKDDMFCLNYRCKRKV----SLSSIGAVK . :..:.::::.:::... : ::::: :.: .:..: . ..: :: .. . : .:: . :: : .. . . JA0159 VTGNLISLSEQELVDCDRSYNEGCDGGLMDYAFEFVIKNGGIDTEEDYPYKERNGV-CDQYRKNAKVVKIDSYEDVPVNN 60 70 80 90 100 110 120 130 450 460 470 480 490 500 510 520 QUERY ENQLILALNEVGPLSVNVGVNN-DFVAYSEGVYNGTCSEELNHSVLLVGYGQVEKTKLNYNNKIQTYNTKENSNQPDDNI :. : :. . :.:. . ... :: :. :...: :. ..:.:...::: .: JA0159 EKALQKAVAH-QPVSIALEAGGRDFQHYKSGIFTGKCGTAVDHGVVIAGYGT-------------------------ENG 140 150 160 170 180 190 530 540 550 560 QUERY IYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFYPIL . :::..:::. . :::..:..:: .... .::.. : ::. JA0159 MDYWIVRNSWGANCRENGYLRVQRNVSSSSGLCGLAIEPSYPVKTGPNPPKPAPSPPSPVKPPTECDEYSQCAVGTTCCC 200 210 220 230 240 250 260 270 JA0159 ILQFRRSCFSWGCCPLEGATCCEDHYSCCPHDYPICNVRQGTCSMSKGNPLGVKAMKRILAQPIGAFGNGGKKSSS 280 290 300 310 320 330 340 --------------------------------------------------------------------------- >>S12581 cysteine proteinase (EC 3.4.22.-) - black gram (362 aa) initn: 481 init1: 221 opt: 395 Z-score: 411.9 expect() 4.1e-16 Smith-Waterman score: 590; 30.239% identity in 377 aa overlap Entrez lookup Re-search database >S12581 199- 568: ---------------------------------------------: 160 170 180 190 200 210 220 230 QUERY NRKELSNILHKLLEINKLILREEKDDKKVYLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKN .: .:. . . ....: ... . ... . :. : .. S12581 MAMKKLLWVVLSLSLVLGVANSFDFHEKD-LESEESLWDLYERWRSHHTVSRS 10 20 30 40 50 240 250 260 270 280 290 300 310 QUERY IDEQMRKFEIFKINYISIKNHNKLNKNAMYKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEF . :. ..:..:: : . ..: ::..: :: :.:.:.:....:.. . .: .: . . :. : . ... S12581 LGEKHKRFNVFKANVMHVHNTNKMDKP--YKLKLNKFADMTNHEFRSTYAGS-KVNHHKMFRGSQ----HGSGTFMY--- 60 70 80 90 100 110 120 320 330 340 350 360 370 380 390 QUERY YTNGKRNEKDIFSKVPEILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKD-NFGCDG :: ..:: .:.:.:: : . :::: :::::::... .:.. :.....:.::::.:::.:. : ::.: S12581 -------EK--VGSVPASVDWRKKGAVTDVKDQGQCGSCWAFSTIVAVEGINQIKTNKLVSLSEQELVDCDKEENQGCNG 130 140 150 160 170 180 190 400 410 420 430 440 450 460 470 QUERY GHPFYSFLYVLQNE-LCLGDEYKYKAKDDMFCLNYRCKR-KVSLS---SIGAVKENQLILALNEVGPLSVNVGVN-NDFV : .: .. :. . ..: : :.. : . . . ::.. .. . :: :. :. . :.:: . .. .:: S12581 GLMESAFEFIKQKGGITTESNYPYTAQEGT-CDESKVNDLAVSIDGHENVPVNDENALLKAVAN-QPVSVAIDAGGSDFQ 200 210 220 230 240 250 260 270 480 490 500 510 520 530 540 550 QUERY AYSEGVYNGTCSEELNHSVLLVGYGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNK :::::..: :. .:::.: .:::: .: ...: :::..:::. .:::.:..:..:: S12581 FYSEGVFTGDCNTDLNHGVAIVGYG----------------TTVDGTN--------YWIVRNSWGPEWGEQGYIRMQRNI 280 290 300 310 320 560 QUERY NGDNVFCGIGEEVFYPIL . . .:::. . ::: S12581 SKKEGLCGIAMMASYPIKNSSDNPTGSLSSPKDEL 330 340 350 360 --------------------------------------------------------------------------- >>S44151 cathepsin L (EC 3.4.22.15) - fluke (Schistosoma mansoni) (317 aa) initn: 441 init1: 185 opt: 391 Z-score: 408.7 expect() 6.2e-16 Smith-Waterman score: 525; 32.597% identity in 362 aa overlap Entrez lookup Re-search database >S44151 218- 569: ------------------------------------------: 180 190 200 210 220 230 240 250 QUERY LREEKDDKKVYLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKNIDEQMRKFEIFKINYI-SI ...: . . .. ..::.:.. .: :: :: . :. .: S44151 VAIAQHLSLQYDDIWKQWKLKYNKTYSDSNEIRRK-AIF-MRYVEKI 10 20 30 40 260 270 280 290 300 310 320 330 QUERY KNHNKLNKNAM--YKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVP ..:: . .. : .::: :.. ::.: :.. :: : : : .. . :.. . . .: S44151 QQHNLRHDLGLEGYTMGLNQFCDMDWEEIK----TIM---------LSKVFGN--------SPLWDDKKEELELSNDPLP 50 60 70 80 90 100 340 350 360 370 380 390 400 410 QUERY EILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSK--DNFGCDGGHPFYSFLYVLQNEL :.:..: : :.::::::::::...: .:. ..::.:...:.:::..:::: : ::.:: :: :. . . S44151 SKWDWRDHGAVTPVKNQGLCGSCWAFSAAGAVEGQLVKKHKKLISLSEQQLVDCSYKYGNDGCQGGTMDQSFAYLEKYPI 110 120 130 140 150 160 170 180 420 430 440 450 460 470 480 QUERY CLGDEYKYKAKDDMFCLNYRCKRKVSLS---SIGAVKENQLILALNEVGPLSVNVGVNNDFVAYSEGVYNGT-CSEEL-N .::: ..:. : . : :... .. : :..: :: . ::.:: . . .:.. :. :.:.. :: : : S44151 ESEKDYKYIGHDSS-CHFRKSKGVVKVKKFVDLPARDEEKLQKALYHYGPISVAIDALDDLILYKSGIYESKQCSSFLLN 190 200 210 220 230 240 250 260 490 500 510 520 530 540 550 560 QUERY HSVLLVGYGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFYP :.:: :::: .:: .. ::.:::::. :: ::...: :::.. .:::. .. .: S44151 HGVLAVGYG------------------RENRKD-------YWLIKNSWGTTWGMNGYFKLRRNKHN---MCGIATNASFP 270 280 290 300 310 QUERY IL .: S44151 LL --------------------------------------------------------------------------- >>JC2476 cathepsin K (EC 3.4.22.-) precursor - human (329 aa) initn: 420 init1: 264 opt: 391 Z-score: 408.5 expect() 6.4e-16 Smith-Waterman score: 516; 33.815% identity in 346 aa overlap Entrez lookup Re-search database >JC2476 230- 567: -----------------------------------------: 190 200 210 220 230 240 250 260 QUERY INDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKN-IDEQMRKFEIFK-INYISIKNHNKLNKNAM : : : :.: .:: :.. : ..::::.: . JC2476 MWGLKVLLLPVVSFALYPEEILDTHWELWKKTHRKQYNNKVDEISRRLIWEKNLKYISIHNLEASLGVHT 10 20 30 40 50 60 70 270 280 290 300 310 320 330 340 QUERY YKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILDYREKGIVHE :. .:...:.. ::. . . : :.:: ..:. .:.. : :. .:. .:. .:::.:: : JC2476 YELAMNHLGDMTSEEVVQKM-TGLKVP----LSHSRS-----NDTLYIPEW--EGR---------APDSVDYRKKGYVTP 80 90 100 110 120 350 360 370 380 390 400 410 420 QUERY PKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKDNFGCDGGHPFYSFLYVLQNE-LCLGDEYKYKAKDD- :.:: :::::::.::: .:. . ::. ..:..: :..::: ..: :: ::. .: :: .:. . : : : .... JC2476 VKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQEES 130 140 150 160 170 180 190 200 430 440 450 460 470 480 490 500 QUERY -MFCLNYRCKRKVSLSSIGAVKENQLILALNEVGPLSVNVGVN-NDFVAYSEGVY-NGTC-SEELNHSVLLVGYGQVEKT :. . . . . : .:. : :. .:::.:: . .. ..: ::.::: . .: :..:::.:: :::: ..: JC2476 CMYNPTGKAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNSDNLNHAVLAVGYG-IQK- 210 220 230 240 250 260 270 280 510 520 530 540 550 560 QUERY KLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFYPIL .:: .:::::::...::..:.. ..::::. :::.. . .: JC2476 ----GNK-------------------HWIIKNSWGENWGNKGYILMARNKNNA---CGIANLASFPKM 290 300 310 320 --------------------------------------------------------------------------- >>KHRTL cathepsin L (EC 3.4.22.15) precursor - rat (334 aa) initn: 320 init1: 189 opt: 390 Z-score: 407.4 expect() 7.4e-16 Smith-Waterman score: 523; 29.428% identity in 367 aa overlap Entrez lookup Re-search database >KHRTL 215- 569: -------------------------------------------: 180 190 200 210 220 230 240 250 QUERY KLILREEKDDKKVYLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKNIDEQMRKFEIFKINYI : . . ... .. . : ..: . .:. :. ... :. KHRTL MTPLLLLAVLCLGTALATPKFDQTFNAQWHHWKSTHRRLYGTNEEEWRR-AVWEKNMR 10 20 30 40 50 260 270 280 290 300 310 320 330 QUERY SIKNHNKLNKNAM--YKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSK :. :: .:. . ..: :.:...::... . : .:..: :. .. .. . KHRTL MIQLHNGEYSNGKHGFTMEMNAFGDMTNEEFRQIVNGYRH------QKHKK------------------GRLFQEPLMLQ 60 70 80 90 100 110 340 350 360 370 380 390 400 410 QUERY VPEILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKD--NFGCDGGHPFYSFLYVLQN .:. .:.:::: : :.:: :::::::.. : .:. . :. ...:.:::..::::.: : ::.:: ..: :. .: KHRTL IPKTVDWREKGCVTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHDQGNQGCNGGLMDFAFQYIKEN 120 130 140 150 160 170 180 190 420 430 440 450 460 470 480 QUERY ELCLGDE-YKYKAKDDMFCLNYRCKRKVSLSSIGAV----KENQLILALNEVGPLSVNVGVNNDFVA-YSEGVY-NGTCS ..: : :.::: : .:: . :. .. : : .:. :. :. :::.:: . ... . :: :.: . .:: KHRTL GGLDSEESYPYEAKDGS-C-KYRAEYAVA-NDTGFVDIPQQEKALMKAVATVGPISVAMDASHPSLQFYSSGIYYEPNCS 200 210 220 230 240 250 260 270 490 500 510 520 530 540 550 560 QUERY -EELNHSVLLVGYGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGE ..:.:.::.:::: :.... :. ::..::::.:.:: .:........:. ::.. KHRTL SKDLDHGVLVVGYGY------------------EGTDSNKDK---YWLVKNSWGKEWGMDGYIKIAKDRNNH---CGLAT 280 290 300 310 320 QUERY EVFYPIL . :::. KHRTL AASYPIVN 330 --------------------------------------------------------------------------- >>KHMSL cathepsin L (EC 3.4.22.15) precursor - mouse (334 aa) initn: 283 init1: 181 opt: 389 Z-score: 406.3 expect() 8.4e-16 Smith-Waterman score: 517; 30.245% identity in 367 aa overlap Entrez lookup Re-search database >KHMSL 215- 569: -------------------------------------------: 180 190 200 210 220 230 240 250 QUERY KLILREEKDDKKVYLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKNIDEQMRKFEIFKINYI : . ..... .. . : ..: . .:. :. :.. :. KHMSL MNLLLLLAVLCLGTALATPKFDQTFSAEWHQWKSTHRRLYGTNEEEWRR-AIWEKNMR 10 20 30 40 50 260 270 280 290 300 310 320 330 QUERY SIKNHNKLNKNAM--YKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSK :. :: .:.. .. ..: :.:...::... . : .:..: :. .. .. : KHMSL MIQLHNGEYSNGQHGFSMEMNAFGDMTNEEFRQVVNGYRH------QKHKK------------------GRLFQEPLMLK 60 70 80 90 100 110 340 350 360 370 380 390 400 410 QUERY VPEILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCS--KDNFGCDGGHPFYSFLYVLQN .:. .:.:::: : :.:: :::::::.. : .:. . :. ...:.:::..:::: . : ::.:: ..: :. .: KHMSL IPKSVDWREKGCVTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGCNGGLMDFAFQYIKEN 120 130 140 150 160 170 180 190 420 430 440 450 460 470 480 QUERY ELCLGDE-YKYKAKDDMFCLNYRCKRKVSLSSIGAV----KENQLILALNEVGPLSVNVGVNNDFVA-YSEGVY-NGTCS ..: : :.::: : .:: . :. .. : : .:. :. :. :::.:: . ... . :: :.: . .:: KHMSL GGLDSEESYPYEAKDGS-C-KYRAEFAVA-NDTGFVDIPQQEKALMKAVATVGPISVAMDASHPSLQFYSSGIYYEPNCS 200 210 220 230 240 250 260 270 490 500 510 520 530 540 550 560 QUERY -EELNHSVLLVGYGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGE ..:.:.::::::: :.. : :.:. ::..::::...:: .:........ :: ::.. KHMSL SKNLDHGVLLVGYG--------YEG------TDSNKNK-------YWLVKNSWGSEWGMEGYIKIAKDR--DN-HCGLAT 280 290 300 310 320 QUERY EVFYPIL . ::.. KHMSL AASYPVVN 330 --------------------------------------------------------------------------- >>JN0719 drought-inducible cysteine proteinase (EC 3.4.22.-) RD21A precursor (462 aa) initn: 459 init1: 219 opt: 391 Z-score: 406.3 expect() 8.5e-16 Smith-Waterman score: 551; 30.990% identity in 384 aa overlap Entrez lookup Re-search database >JN0719 193- 568: ---------------------------------------------: 160 170 180 190 200 210 220 230 QUERY NENNEENRKELSNILHKLLEINKLILREEKDDKKVYLINDNYDEK-GALEIGMNEEMKYKKEDPINNIKYASKFFKFMKE .:::: :. : ..: . .: : . . : : JN0719 MGFLKPTMAILFLAMVAVSSAVDMSIISYDEKHGVSTTGG------RSEAEVMSI-YEAWLVKHGKA 10 20 30 40 50 60 240 250 260 270 280 290 300 310 QUERY HNKVYKNIDEQMRKFEIFKINYISIKNHNKLNKNAMYKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKD ... ... :. :.::::: : . .::. :: :. ...:.: ...: . . . :. : : . JN0719 QSQ--NSLVEKDRRFEIFKDNLRFVDEHNE--KNLSYRLGLTRFADLTNDEYRSKY---------LGAKMEKKGERR--- 70 80 90 100 110 120 320 330 340 350 360 370 380 390 QUERY NILISEFYTNGKRNEKDIFSKVPEILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDC-SK .. : : . ...:: .:.:.:: : : :::: :::::::...: .:.. . .....::::.::: .. JN0719 ---------TSLRYEARVGDELPESIDWRKKGAVAEVKDQGGCGSCWAFSTIGAVEGINQIVTGDLITLSEQELVDCDTS 130 140 150 160 170 180 190 400 410 420 430 440 450 460 QUERY DNFGCDGGHPFYSFLYVLQNELCLGD-EYKYKAKDDMFCLNYRCKRKV----SLSSIGAVKENQLILALNEVGPLSVNVG : ::.:: :.: ....: : .: ::. : : . : . :: : .. . .:..: :. . :.:. . JN0719 YNEGCNGGLMDYAFEFIIKNGGIDTDKDYPYKGVDGT-CDQIRKNAKVVTIDSYEDVPTYSEESLKKAVAH-QPISIAIE 200 210 220 230 240 250 260 270 470 480 490 500 510 520 530 540 QUERY VNND-FVAYSEGVYNGTCSEELNHSVLLVGYGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGF ... : :. :...:.:. .:.:.:. :::: : : :. :::..:::.:.:::.:. JN0719 AGGRAFQLYDSGIFDGSCGTQLDHGVVAVGYG--------------TENGKD-----------YWIVRNSWGKSWGESGY 280 290 300 310 320 550 560 QUERY MRLSRNKNGDNVFCGIGEEVFYPIL .:..:: ... :::. : ::: JN0719 LRMARNIASSSGKCGIAIEPSYPIKNGENPPNPGPSPPSPIKPPTQCDSYYTCPESNTCCCLFEYGKYCFAWGCCPLEAA 330 340 350 360 370 380 390 400 --------------------------------------------------------------------------- >>JC5443 cathepsin L-like cysteine proteinase (EC 3.4.-.-) c1 - Maize weevil (338 aa) initn: 463 init1: 224 opt: 389 Z-score: 406.3 expect() 8.5e-16 Smith-Waterman score: 536; 32.102% identity in 352 aa overlap Entrez lookup Re-search database >JC5443 228- 569: -----------------------------------------: 190 200 210 220 230 240 250 260 QUERY YLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKNIDEQMRKFEIFKINYISIKNHNKLNKNAM : .:.: : . :. ...:: : .. .:::: .... JC5443 MKLFLILAAVVISCQAVSFYDLVQEQWSSFKMQHSKNYDSETEERFRMKIFMENAHKVAKHNKLFSQGF 10 20 30 40 50 60 270 280 290 300 310 320 330 340 QUERY YKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILDYREKGIVHE : :.. .. :.. .:. .:: . : :. :.::: . ... : . :.:. .:.:.:: : : JC5443 VKFKLG-LNKYADMLHHEFVSTL------------NGF-NKTKNNILKGSDLNDAVRFISPANVKLPDTVDWRDKGAVTE 70 80 90 100 110 120 130 350 360 370 380 390 400 410 420 QUERY PKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSK--DNFGCDGGHPFYSFLYVLQNE-LCLGDEYKYKAKD :::: :::::.:...:..:. .:. ...:.:::..:::: : ::.:: .: :. .: . : : :.: JC5443 VKDQGHCGSCWSFSATGSLEGQHFRKTGKLVSLSEQNLVDCSGRYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYPYLAED 140 150 160 170 180 190 200 210 430 440 450 460 470 480 490 QUERY DMFCLNYRCKRKVSLSS----IGAVKENQLILALNEVGPLSVNVGVNND-FVAYSEGVYNGT-CS-EELNHSVLLVGYGQ . : .:. . . . .. : ..:..: :. :::.:. . .... : ::.:::. :: .::.:.::.:::: JC5443 EK-C-HYKAQNSGATDKGFVDIEEANEDDLKAAVATVGPVSIAIDASHETFQLYSDGVYSDPECSSQELDHGVLVVGYG- 220 230 240 250 260 270 280 290 500 510 520 530 540 550 560 QUERY VEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFYPIL .:.. .: ::..::::. .:: ::.....::. ::. ::.. .. ::.. JC5443 -------------------TSDDGQD----YWLVKNSWGPSWGLNGYIKMARNQ--DNM-CGVASQASYPLV 300 310 320 330 --------------------------------------------------------------------------- >>S57776 cysteine proteinase - clove pink (fragment) (427 aa) initn: 537 init1: 214 opt: 390 Z-score: 405.8 expect() 9.1e-16 Smith-Waterman score: 542; 31.728% identity in 353 aa overlap Entrez lookup Re-search database >S57776 228- 568: -----------------------------------------: 190 200 210 220 230 240 250 260 QUERY YLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKNIDEQMRKFEIFKINYISIKNHNKLNKNA- .. .: : :. . :. ..: ::. : : .::. :... S57776 QAYHLQSWLVKHRKNYNALGEKEKRFAIFRDNLEFIDQHNNNNNGGG 10 20 30 40 270 280 290 300 310 320 330 340 QUERY --MYKKKVNQFSDYSEEELKE-YFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILDYREKG .. .:.:.: ...:... :: . : . :. .: ..: : ...:: .:.:.:: S57776 GGEFELGLNKFADLTNDEFRRIYFGV----------KRPEKAESVKSDRYAVKE----G--------DELPESVDWRKKG 50 60 70 80 90 100 350 360 370 380 390 400 410 420 QUERY IVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDC-SKDNFGCDGGHPFYSFLYVLQNELCLGD-EYKYK : . :::: :::::::...: .:.. . .....::::.::: .. : ::::: :.: ....: : .: :: S57776 AVSHVKDQGQCGSCWAFSAIGAVEGINKIVTGDLITLSEQELVDCDTSYNSGCDGGLMDYAFRFIINNGGIDTDKDYPYK 110 120 130 140 150 160 170 180 430 440 450 460 470 480 490 QUERY AKDDMFCLNYRCKRKV----SLSSIGAVKENQLILALNEVGP--LSVNVGVNNDFVAYSEGVYNGTCSEELNHSVLLVGY : : : . : . :: .: .. : .:. : :. . : :....: . :: :. ::..:.:. :.:.:. ::: S57776 ATDGS-CDSNRKNAKVVTIDGLEDVPANNEKALQKAVAH-QPVRLAIEAG-GRDFQLYKSGVFTGSCGTSLDHGVVAVGY 190 200 210 220 230 240 250 260 500 510 520 530 540 550 560 QUERY GQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFYPIL : .. : : :::..:::. :::.:..:. :: .. . :::. : ::. S57776 GTTDDGK-------------------D-----YWIVRNSWGDDWGEDGYIRMERNTESKSGKCGIAIEPSYPVKTSPNPP 270 280 290 300 310 S57776 NPGPSPPSPPPAPKVVCDSYSSCPSATTCCCVYEYGPYCYMWGCCPLEAASCCDDDSSCCPHDYPVCNTQQGTCSKSKNN 320 330 340 350 360 370 380 390 --------------------------------------------------------------------------- >>S03964 stem bromelain (EC 3.4.22.32) - pineapple (212 aa) initn: 373 init1: 196 opt: 384 Z-score: 404.2 expect() 1.1e-15 Smith-Waterman score: 453; 32.365% identity in 241 aa overlap Entrez lookup Re-search database >S03964 333- 569: ----------------------------: 300 310 320 330 340 350 360 370 QUERY VPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAK ::. .:.:. : : :.:. ::.:::::.....::.. S03964 AVPQSIDWRDYGAVTSVKNQNPCGACWAFAAIATVESIYKI 10 20 30 40 380 390 400 410 420 430 440 QUERY KNKNILSFSEQEVVDCSKDNFGCDGGHPFYSFLYVLQNE-LCLGDEYKYKAKDDMFCLNYRCKRKVSLSSIGAV---KEN :. . .:::.:.::.: ..:: :: : .: ....:. . : : ::: : . .. ... . : .:. S03964 KKGILEPLSEQQVLDCAK-GYGCKGGWEFRAFEFIISNKGVASGAIYPYKAAKGT-CKTDGVPNSAYITGYARVPRNNES 50 60 70 80 90 100 110 450 460 470 480 490 500 510 520 QUERY QLILALNEVGPLSVNVGVNNDFVAYSEGVYNGTCSEELNHSVLLVGYGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYY ... :... :..: : .: .: :. ::.:: :. :::.: .:::: :.::: S03964 SMMYAVSK-QPITVAVDANANFQYYKSGVFNGPCGTSLNHAVTAIGYGQ-------------------------DSIIY- 120 130 140 150 160 170 530 540 550 560 QUERY WIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFYPIL ..:. :::: :..:..:. .... .:::. . .:: : S03964 ---PKKWGAKWGEAGYIRMARDVSSSSGICGIAIDPLYPTLEE 180 190 200 210 --------------------------------------------------------------------------- >>JX0366 cysteine endopeptidase (EC 3.4.22.-) precursor - silkworm (344 aa) initn: 427 init1: 202 opt: 387 Z-score: 404.1 expect() 1.1e-15 Smith-Waterman score: 497; 30.328% identity in 366 aa overlap Entrez lookup Re-search database >JX0366 222- 569: ------------------------------------------: 190 200 210 220 230 240 250 QUERY KDDKKVYLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKE--------HNKVYKNIDEQMRKFEIFKINY : .:: ..:: : ::. :. ...:. . JX0366 MKCLVLLLCAVAAVSAVQFFDLVKEEWSAFKLQHRLNYKSEVEDNFRMKIYAEHK 10 20 30 40 50 260 270 280 290 300 310 320 330 QUERY ISIKNHNKLNKNAM--YKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFS : .::. . .. :: .:.. .... .:. ::. . ..: .: : : :. .. . : . . JX0366 HIIAKHNQKYEMGLVSYKLGMNSWWEHGDMLHHEFVKTM-----NGFNKTAK----HNK-NLYMKGGSVRGAKFISPANV 60 70 80 90 100 110 120 340 350 360 370 380 390 400 QUERY KVPEILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKD--NFGCDGGHPFYSFLYVLQ :.:: .:.:..: : . :::: :::::.:...: .:. ... ..:.:::...:::.. : ::.:: .: :. . JX0366 KLPEQVDWRKHGAVTDIKDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKD 130 140 150 160 170 180 190 200 410 420 430 440 450 460 470 480 QUERY NE-LCLGDEYKYKAKDDMFCLNYRC--KRKVSLSSIGAVKENQLILALNEVGPLSVNVGVNND-FVAYSEGVYNGT-CSE : . . : :.. :: : . . :.. .: :..:. :. :::.:: . ... : :: :::: :: JX0366 NGGIDTEQAYPYEGVDDKCRYNPKNTGAEDVGFVDIPEGDEQKLMEAVATVGPVSVAIDASHTHFQLYSSGVYNEEECSS 210 220 230 240 250 260 270 280 490 500 510 520 530 540 550 560 QUERY -ELNHSVLLVGYGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEE .:.:.::.:::: :.. . ::..::::...::: :.... ::::. :::. JX0366 TDLDHGVLVVGYGT------------------------DEQGVDYWLVKNSWGRSWGELGYIKMIRNKNNR---CGIASS 290 300 310 320 330 QUERY VFYPIL . ::.. JX0366 ASYPLV 340 --------------------------------------------------------------------------- >>KHRZOB oryzain (EC 3.4.22.-) beta precursor - rice (471 aa) initn: 463 init1: 208 opt: 386 Z-score: 401.0 expect() 1.7e-15 Smith-Waterman score: 515; 31.138% identity in 334 aa overlap Entrez lookup Re-search database >KHRZOB 241- 567: ---------------------------------------: 210 220 230 240 250 260 270 QUERY EIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKNIDEQMRKFEIFKINYISIKNHN-KLNKNAMYKKKVNQFSDYS :. :.: .: : . :: . .... .. .:.:.: . KHRZOB YNAEHGARGLEEGPTEAEARAAYDLWLAENGGGSPNALGGEHERRFLVFWDNLKFVDAHNARADEGGGFRLGMNRFADLT 30 40 50 60 70 80 90 100 280 290 300 310 320 330 340 350 QUERY EEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILDYREKGIVHEPKDQGLCGSCWA .::.. : . .. :. :. . :.: ..: ..:: .:.:::: : :.:: :::::: KHRZOB NEEFRATF-----LGAKVAER-SR----------------AAGERYRHDGVEELPESVDWREKGAVAPVKNQGQCGSCWA 110 120 130 140 150 160 360 370 380 390 400 410 420 430 QUERY FASVGNIESVFAKKNKNILSFSEQEVVDCSKD--NFGCDGGHPFYSFLYVLQNE-LCLGDEYKYKAKDDMFCLNYRCKRK :..:...::. . .....::::.:.:: . : ::.:: .: ....: . :.: ::: : .: . . KHRZOB FSAVSTVESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMADAFDFIIKNGGIDTEDDYPYKAVDGKCDINRENAKV 170 180 190 200 210 220 230 240 440 450 460 470 480 490 500 510 QUERY VSLSSIGAVKENQLILALNEVG--PLSVNVGVNN-DFVAYSEGVYNGTCSEELNHSVLLVGYGQVEKTKLNYNNKIQTYN ::.... : .:. . :. :.:: . ... .: : ::..: :. :.:.:. :::: : : KHRZOB VSIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGGREFQLYHSGVFSGRCGTSLDHGVVAVGYG--------------TDN 250 260 270 280 290 300 310 520 530 540 550 560 QUERY TKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFYPIL :. :::..:::. ::::.:..:. :: : . :::. . :: KHRZOB GKD-----------YWIVRNSWGPKWGESGYVRMERNINVTTGKCGIAMMASYPTKSGANPPKPSPTPPTPPTPPPPSAP 320 330 340 350 360 370 380 KHRZOB DHVCDDNFSCPAGSTCCCAFGFRNLCLVWGCCPVEGATCCKDHASCCPPDYPVCNTRAGTCSASKNSPLSVKALKRTLAK 390 400 410 420 430 440 450 460 --------------------------------------------------------------------------- >>PPPA papain (EC 3.4.22.2) precursor - papaya (345 aa) initn: 553 init1: 249 opt: 383 Z-score: 400.0 expect() 1.9e-15 Smith-Waterman score: 579; 31.233% identity in 365 aa overlap Entrez lookup Re-search database >PPPA 210- 568: -------------------------------------------: 170 180 190 200 210 220 230 240 QUERY LLEINKLILREEKDDKKVYLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKNIDEQMRKFEIF :...: .. . . : ..: .:::.::::::.. .:::: PPPA MAMIPSISKLLFVAICLFVYMGLSFGDFSIVGYSQNDLTSTERLIQLFESWMLKHNKIYKNIDEKIYRFEIF 10 20 30 40 50 60 70 250 260 270 280 290 300 310 320 QUERY KINYISIKNHNKLNKNAMYKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDI : : : . :: :: : .: :.:.:..:.:: ::. . . : .:. . :. :. PPPA KDNLKYIDETNK--KNNSYWLGLNVFADMSNDEFKE--------------KYT----GSIAGNYTTTELSYEEVLNDGDV 80 90 100 110 120 130 330 340 350 360 370 380 390 400 QUERY FSKVPEILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKDNFGCDGGHPFYSFLYVLQ ..:: .:.:.:: : :.:: :::::::..: .::... .. :. .::::..::.. ..::.::.:. .. : : PPPA --NIPEYVDWRQKGAVTPVKNQGSCGSCWAFSAVVTIEGIIKIRTGNLNEYSEQELLDCDRRSYGCNGGYPWSALQLVAQ 140 150 160 170 180 190 200 210 410 420 430 440 450 460 470 480 QUERY NELCLGDEYKYKAKDDMFCLNYRCKRKVSLSSIGAVK-----ENQLILALNEVGPLSVNV-GVNNDFVAYSEGVYNGTCS . . : :.. . .: . : : . .. :. . :. :. .. . :.:: . ....:: : :.. : :. PPPA YGIHYRNTYPYEGVQR-YCRS-REKGPYAAKTDGVRQVQPYNEGALLYSIAN-QPVSVVLEAAGKDFQLYRGGIFVGPCG 220 230 240 250 260 270 280 490 500 510 520 530 540 550 560 QUERY EELNHSVLLVGYGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEE ....:.: :::: :. : .:::::. :::::..:..:. ... ::. PPPA NKVDHAVAAVGYG------------------------PN-----YILIKNSWGTGWGENGYIRIKRGTGNSYGVCGLYTS 290 300 310 320 330 QUERY VFYPIL :::. PPPA SFYPVKN 340 --------------------------------------------------------------------------- >>KHRZOA oryzain (EC 3.4.22.-) alpha precursor - rice (458 aa) initn: 484 init1: 210 opt: 383 Z-score: 398.1 expect() 2.4e-15 Smith-Waterman score: 564; 31.549% identity in 355 aa overlap Entrez lookup Re-search database >KHRZOA 222- 568: ------------------------------------------: 190 200 210 220 230 240 250 260 QUERY KDDKKVYLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMK-EHNKVYKNIDEQMRKFEIFKINYISIKNHN : ... : ::.: :. . :. :.. :. : : .:: KHRZOA MRISMALAAAALLLLLSLAAADMSIVSYGERSEEEARRLYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHN 10 20 30 40 50 60 70 270 280 290 300 310 320 330 QUERY KLNKNAM--YKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILD .. .. .:.:.: ..:: .. . : . : . : ...: : .. :: .:: .: KHRZOA AAADAGVHSFRLGLNRFADLTNEEYRDTYLGLRNKPRR---------ERKVSDRYLAAD-------NEA-----LPESVD 80 90 100 110 120 130 340 350 360 370 380 390 400 410 QUERY YREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDC-SKDNFGCDGGHPFYSFLYVLQNE-LCLGD .: :: : : :::: :::::::.... .:.. . ...:.::::.::: .. : ::.:: :.: ....: . : KHRZOA WRTKGAVAEIKDQGGCGSCWAFSAIAAVEDINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYAFDFIINNGGIDTED 140 150 160 170 180 190 200 210 420 430 440 450 460 470 480 490 QUERY EYKYKAKDDMFCLNYRCKRKVSLSSIGAVKENQLILALNEV--GPLSVNVGVNND-FVAYSEGVYNGTCSEELNHSVLLV .: ::.::. .: . . :...: : :. . : :.:: . ... : :: :...: :. :.:.: : KHRZOA DYPYKGKDERCDVNRKNAKVVTIDSYEDVTPNSETSLQKAVRNQPVSVAIEAGGRAFQLYSSGIFTGKCGTALDHGVAAV 220 230 240 250 260 270 280 290 500 510 520 530 540 550 560 QUERY GYGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFYPIL ::: : : :. :::..:::.:.:::.:..:. :: .... :::. : ::. KHRZOA GYG--------------TENGKD-----------YWIVRNSWGKSWGESGYVRMERNIKASSGKCGIAVEPSYPLKKGEN 300 310 320 330 340 KHRZOA PPNPGPTPPSPTPPPTVCDNYYTCPDSTTCCCIYEYGKYCYAWGCCPLEGATCCDDHYSCCPHEYPICNVQQGTCLMAKD 350 360 370 380 390 400 410 420 --------------------------------------------------------------------------- >>A53810 cathepsin L (EC 3.4.22.15) precursor - flesh fly (Sarcophaga peregri (339 aa) initn: 430 init1: 210 opt: 380 Z-score: 397.0 expect() 2.8e-15 Smith-Waterman score: 527; 31.421% identity in 366 aa overlap Entrez lookup Re-search database >A53810 215- 567: -------------------------------------------: 180 190 200 210 220 230 240 250 QUERY KLILREEKDDKKVYLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKNIDEQMRKFEIFKINYI :.. :: . .:. .: : : : :. ...::. : A53810 MRTVLVALLALVALTQAISPLDLIKEEWHTYKL--QHRKNYANEVEERFRMKIFNENRH 10 20 30 40 50 260 270 280 290 300 310 320 330 QUERY SIKNHNKL--NKNAMYKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSK .: .::.: . .. :: .:...:. ..:.:: .. :. ..... . :.. : A53810 KIAKHNQLFAQGKVSYKLGLNKYADMLHHEFKETMNGYNHTLRQLMRERTG----------LVGATYI------PPAHVT 60 70 80 90 100 110 120 340 350 360 370 380 390 400 410 QUERY VPEILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKD--NFGCDGGHPFYSFLYVLQN ::. .:.::.: : :::: :::::::.:.: .:. .: ..:.:::..:::: : ::.:: .: :. .: A53810 VPKSVDWREHGAVTGVKDQGHCGSCWAFSSTGALEGQHFRKAGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDN 130 140 150 160 170 180 190 200 420 430 440 450 460 470 480 QUERY E-LCLGDEYKYKAKDDMFCLNYRCKRKVSLSSIGAVK-----ENQLILALNEVGPLSVNVGVNND-FVAYSEGVYN-GTC . : :.. :: .: : .. .. : : :... :. .::.:: . .... : ::::::: : A53810 GGIDTEKSYPYEGIDDSCHFN---KATIGATDTGFVDIPEGDEEKMKKAVATMGPVSVAIDASHESFQLYSEGVYNEPEC 210 220 230 240 250 260 270 490 500 510 520 530 540 550 560 QUERY SEE-LNHSVLLVGYGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIG .:. :.:.::.:::: :.. . ::..::::. :::.:.....::.:.. :::. A53810 DEQNLDHGVLVVGYGT------------------------DESGMDYWLVKNSWGTTWGEQGYIKMARNQNNQ---CGIA 280 290 300 310 320 330 QUERY EEVFYPIL :: A53810 TASSYPTV --------------------------------------------------------------------------- >>A58195 cathepsin L (EC 3.4.22.15) precursor - pig (334 aa) initn: 350 init1: 221 opt: 379 Z-score: 396.1 expect() 3.1e-15 Smith-Waterman score: 485; 29.379% identity in 354 aa overlap Entrez lookup Re-search database >A58195 225- 567: -----------------------------------------: 190 200 210 220 230 240 250 260 QUERY KKVYLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKNIDEQMRKFEIFKINYISIKNHNKLNK ..:. :...: .: :. ... :. :. ::. . A58195 MKPSLFLTALCLGIASAAPKLDQNLDADWYKWKATHGRLYGMNEEGWRR-AVWEKNMKMIELHNQEYS 10 20 30 40 50 60 270 280 290 300 310 320 330 340 QUERY NAM--YKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILDYREK .. .. .: :.:...::... .. . . .:..: :: .... .::. .:.::: A58195 QGKHGFSMAMNAFGDMTNEEFRQVMNGFQN------QKHKK------------------GKVFHESLVLEVPKSVDWREK 70 80 90 100 110 120 350 360 370 380 390 400 410 QUERY GIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKD--NFGCDGGHPFYSFLYVLQNE-LCLGDEYK : : :.:: :::::::...: .:. . .:. ...:.:::..::::. : ::.:: .: :: .: : . : A58195 GYVTAVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQGCNGGLMDNAFQYVKDNGGLDTEESYP 130 140 150 160 170 180 190 200 420 430 440 450 460 470 480 490 QUERY YKAKDDMFCLNYR--CKRKVSLSSIGAV-KENQLILALNEVGPLSVNVGV-NNDFVAYSEGVY-NGTCS-EELNHSVLLV : ... : .:. :. . . . .:. :. :. :::.:: . . ...: :. :.: . :: ..:.:.::.: A58195 YLGRETNSC-TYKPECSAANDTGFVDIPQREKALMKAVATVGPISVAIDAGHSSFQFYKSGIYYDPDCSSKDLDHGVLVV 210 220 230 240 250 260 270 280 500 510 520 530 540 550 560 QUERY GYGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFYPIL ::: : : :.: .::.::::. .:: ::........:. :::. . :: A58195 GYG-FEGT--------------------DSNSSKFWIVKNSWGPEWGWNGYVKMAKDQNNH---CGISTAASYPTV 290 300 310 320 330 --------------------------------------------------------------------------- >>KHDOP prestalk cathepsin (EC 3.4.22.-) precursor - slime mold (Dictyosteliu (376 aa) initn: 301 init1: 179 opt: 378 Z-score: 394.3 expect() 3.9e-15 Smith-Waterman score: 528; 30.108% identity in 372 aa overlap Entrez lookup Re-search database >KHDOP 220- 568: ------------------------------------------: 180 190 200 210 220 230 240 250 QUERY EEKDDKKVYLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKNIDEQMRKFEIFKINYISIKNH .: . : .. . :. :.. .: .. ::: :. . : KHDOP MRLLVFLILLIFVNFSFANVRPNGRRFSESQYRTAFTEWTLKFNRQYSS-SEFSNRYSIFKSNMDYVDNW 10 20 30 40 50 60 260 270 280 290 300 310 320 330 QUERY NKLNKNAMYKKKVNQFSDYSEEEL-KEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILD :. . ... .:.:.: ..:: : :. : .: : . :. .. . . .. :: :. .: KHDOP NS-KGDSQTVLGLNNFADITNEEYRKTYLGT--RVNAHSYNGYDG------REVLNVEDLQTN------------PKSID 70 80 90 100 110 120 340 350 360 370 380 390 400 410 QUERY YREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCS--KDNFGCDGGHPFYSFLYVLQNE-LCLG .: :. : :::: :::::.:...:. :.. : :.:...:.:::..:::: ..::::::: .: :...:. . KHDOP WRTKNAVTPIKDQGQCGSCWSFSTTGSTEGAHALKTKKLVSLSEQNLVDCSGPEENFGCDGGLMNNAFDYIIKNKGIDTE 130 140 150 160 170 180 190 200 420 430 440 450 460 470 480 490 QUERY DEYKYKAKDDMFCLNYRCKRKVSLSSIGAVKENQLILALN--EVGPLSVNVGVN-NDFVAYSEGVY-NGTCSE-ELNHSV . : : :. :: . ..... . .. : : . ::.:: . .. :.: :. :.: . :: ::.:.: KHDOP SSYPYTAETGSTCLFNKSDIGATIKGYVNITAGSEISLENGAQHGPVSVAIDASHNSFQLYTSGIYYEPKCSPTELDHGV 210 220 230 240 250 260 270 280 500 510 520 530 540 550 QUERY LLVGYG--------------QVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNV :.:::: :. . : .::... . . .: .: : :::.::::. .:: .:.. .:...... KHDOP LVVGYGVQGKDDEGPVLNRKQTIVIHKNEDNKVESSDDSSDSVRPKAN--NYWIVKNSWGTSWGIKGYILMSKDRKNN-- 290 300 310 320 330 340 350 360 560 QUERY FCGIGEEVFYPIL :::. ::. KHDOP -CGIASVSSYPLA 370 --------------------------------------------------------------------------- >>JC5442 cathepsin L-like cysteine proteinase (EC 3.4.-.-) g3 - Maize weevil (331 aa) initn: 383 init1: 216 opt: 377 Z-score: 394.1 expect() 4e-15 Smith-Waterman score: 477; 31.487% identity in 343 aa overlap Entrez lookup Re-search database >JC5442 228- 560: -----------------------------------------: 190 200 210 220 230 240 250 260 QUERY YLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKNIDEQMRKFEIFKINYISIKNHNKLNKNAM : .:.: : . :. ...:: : .. .:.:: .... JC5442 MKLLLILAAVVISCQAVSFYDLVQEQWSSFKMQHSKNYDSETEERFRMKIFMENDHKVAKHSKLFSQGF 10 20 30 40 50 60 270 280 290 300 310 320 330 340 QUERY YKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILDYREKGIVHE : :.. .. :.. .:. .:: . : :. :.::: . ... : . :.:. .:.:.:: : . JC5442 VKFKLG-LNKYADMLHHEFVSTL------------NGF-NKTKNNILKGSDLNDAVRFISPANVKLPDTVDWRDKGAVTK 70 80 90 100 110 120 130 350 360 370 380 390 400 410 420 QUERY PKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCS--KDNFGCDGGHPFYSFLYVLQNE-LCLGDEYKYKAKD :::: :::::.:.. :..:. .:. ...:.:::..:::: : ::.:: .: :. .: . . : : :.: JC5442 VKDQGHCGSCWSFSGSGSLEGQHFRKTGKLVSLSEQNLVDCSGRYGNTGCNGGLMDNAFRYIKDNGGIDTEQSYPYLAED 140 150 160 170 180 190 200 210 430 440 450 460 470 480 490 QUERY DMFCLNYRCKRKVSLSS----IGAVKENQLILALNEVGPLSVNVGVNND-FVAYSEGVYNGT-CS-EELNHSVLLVGYGQ . : .:. . . . .. : .:..: :. :::.:. . .. . : ::.:::. :: .::.:.::.:::: JC5442 EK-C-HYKTQNSGATDKGFVDIEEGNEDDLKAAVATVGPVSIAIDASYETFQLYSDGVYSDPECSSQELDHGVLVVGYG- 220 230 240 250 260 270 280 290 500 510 520 530 540 550 560 QUERY VEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFYPIL .:.. .: ::..:::: . : ::.....::. ::. ::.. JC5442 -------------------TSDDGQD----YWLVKNSWRPSCGLNGYIKMARNQ--DNM-CGVAS 300 310 320 330 --------------------------------------------------------------------------- >>JC5441 cathepsin L-like cysteine proteinase (EC 3.4.-.-) g2 - Maize weevil (331 aa) initn: 381 init1: 216 opt: 376 Z-score: 393.1 expect() 4.6e-15 Smith-Waterman score: 477; 31.487% identity in 343 aa overlap Entrez lookup Re-search database >JC5441 228- 560: -----------------------------------------: 190 200 210 220 230 240 250 260 QUERY YLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKNIDEQMRKFEIFKINYISIKNHNKLNKNAM : .:.: : . :. ...:: : .. .:.:: .... JC5441 MKLLLILAAVVISCQAVSFYDLVQEQWSSFKMQHSKNYDSETEERFRMKIFMENAHKVAKHSKLFSQGF 10 20 30 40 50 60 270 280 290 300 310 320 330 340 QUERY YKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILDYREKGIVHE : :.. .. :.. .:. .:: . : :. :.::: . ... : . :.:. .:.:.:: : . JC5441 VKFKLG-LNKYADMLHHEFVSTL------------NGF-NKTKNNILKGSDLNDAVRFISPANVKLPDTVDWRDKGAVTK 70 80 90 100 110 120 130 350 360 370 380 390 400 410 420 QUERY PKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCS--KDNFGCDGGHPFYSFLYVLQNE-LCLGDEYKYKAKD :::: :::::.:.. :..:. .:. ...:.:::..:::: : ::.:: .: :. .: . . : : :.: JC5441 VKDQGHCGSCWSFSGSGSLEGQHFRKTGKLVSLSEQNLVDCSGRYGNNGCNGGLMDNAFRYIKDNGGIDTEQSYPYLAED 140 150 160 170 180 190 200 210 430 440 450 460 470 480 490 QUERY DMFCLNYRCKRKVSLSS----IGAVKENQLILALNEVGPLSVNVGVNND-FVAYSEGVYNGT-C-SEELNHSVLLVGYGQ . : .:. . . . .. : .:..: :. :::.:. . .. . : ::.:::. : :.::.:.::.:::: JC5441 EK-C-HYKTQNSGATDKGFVDIEEGNEDDLKAAVATVGPISIAIDASYETFQLYSDGVYSDPECISQELDHGVLVVGYG- 220 230 240 250 260 270 280 290 500 510 520 530 540 550 560 QUERY VEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFYPIL .:.. .: ::..:::: . : ::.....::. ::. ::.. JC5441 -------------------TSDDGQD----YWLVKNSWRPSCGLNGYIKMARNQ--DNM-CGVAS 300 310 320 330 --------------------------------------------------------------------------- >>A42482 cathepsin S (EC 3.4.22.27) - human (331 aa) initn: 382 init1: 179 opt: 376 Z-score: 393.1 expect() 4.6e-15 Smith-Waterman score: 482; 31.322% identity in 348 aa overlap Entrez lookup Re-search database >A42482 230- 567: -----------------------------------------: 190 200 210 220 230 240 250 260 QUERY INDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKNIDEQMRKFEIFKINYISIKNHNKLNKNAM-- : ..: ::. .:. . :.. : . :: .. .: A42482 MKRLVCVLLVCSSAVAQLHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLKFVMLHNLEHSMGMHS 10 20 30 40 50 60 70 270 280 290 300 310 320 330 340 QUERY YKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILDYREKGIVHE : .:...:.. ::. ..: .::.. . :: : : . :. :. .:.:::: : : A42482 YDLGMNHLGDMTSEEVMSLTSSL-RVPSQW------------QRNITY-------KSNPNRIL---PDSVDWREKGCVTE 80 90 100 110 120 350 360 370 380 390 400 410 420 QUERY PKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKDNFG---CDGGHPFYSFLYVLQNELCLGDE-YKYKAK : :: ::.::::..:: .:. . :. .....: :..:::: ...: :.:: .: :...:. .: : ::: A42482 VKYQGSCGACWAFSAVGALEAQLKLKTGKLVTLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKAM 130 140 150 160 170 180 190 200 430 440 450 460 470 480 490 QUERY DD--MFCLNYRCKRKVSLSSIGAVKENQLILALNEVGPLSVNVGVNN-DFVAYSEGVY-NGTCSEELNHSVLLVGYGQVE :. .. .:: . . . .:. : :. . ::.::.: . . .: : ::: . .:....::.::.::::.. A42482 DQKCQYDSKYRAATCSKYTELPYGREDVLKEAVANKGPVSVGVDARHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYGDL- 210 220 230 240 250 260 270 280 500 510 520 530 540 550 560 QUERY KTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFYPIL : :: ::..::::....::.:..:..::: :.. :::. :: A42482 -------------NGKE-----------YWLVKNSWGHNFGEEGYIRMARNK-GNH--CGIASFPSYPEI 290 300 310 320 330 --------------------------------------------------------------------------- >>S41428 cysteine proteinase (EC 3.4.22.-) CP2 precursor - Trichomonas vagina (314 aa) initn: 358 init1: 250 opt: 373 Z-score: 390.4 expect() 6.5e-15 Smith-Waterman score: 478; 29.167% identity in 360 aa overlap Entrez lookup Re-search database >S41428 217- 567: ------------------------------------------: 180 190 200 210 220 230 240 250 QUERY ILREEKDDKKVYLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASK-FFKFMKEHNKVYKNIDEQMRKFEIFKINYIS : .:. : :. .:.: .. . . :: .. :. : S41428 MFAFLLSGATSNVLKHEEKAFLAYMRETGNFFTG-DEYHFRLGIYLANKRL 10 20 30 40 50 260 270 280 290 300 310 320 330 QUERY IKNHNKLNKNAMYKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPE ...:: ::. .: .:... .. : ...:: . ... .: .. :. : . S41428 VQEHNAANKG--FKLGLNKLAHLTQSE----YRSLLGA-KRLGQKSGNFFKCDAPAN----------------------D 60 70 80 90 100 340 350 360 370 380 390 400 410 QUERY ILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKDNFGCDGGHPFYSFLYVLQNE---L .:.:.::::.. :::: :::::::... :: .:. ::..:...::..::: . .::.:: : .. ::.... . S41428 AVDWRDKGIVNKIKDQGQCGSCWAFSAIQASESRYAQANKQLLDLAEQNIVDCVTSCYGCNGGWPSKAIDYVVKHQAGKF 110 120 130 140 150 160 170 180 420 430 440 450 460 470 480 QUERY CLGDEYKYKAKDDMFCLNYRCKRKVSLSS-IGAVKENQLILA-LNEVGPLSVNVGVNN-DFVAYSEGVYN-GTCSE-ELN : .: : :.: : ... ...:.:.. ::... :: : .:: . ... .: :. :.:. .:: .:. S41428 MLTADYPYTARDGT-C-KFHASKSVGLTKGYDEVKDTEAELAKAASKGVVSVCIDASHYSFQLYTSGIYDEPSCSAWNLD 190 200 210 220 230 240 250 490 500 510 520 530 540 550 560 QUERY HSVLLVGYGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFYP :.: ::::: : ..:. :::..:::. .:::.:..:. ..:... :::. :.. : S41428 HAVGLVGYG--------------TEGSKN-----------YWIVRNSWGTSWGEQGYIRMIKDKSNQ---CGIASEAILP 260 270 280 290 300 310 QUERY IL S41428 KAL --------------------------------------------------------------------------- >>S15844 cathepsin S (EC 3.4.22.27) - bovine (217 aa) initn: 337 init1: 199 opt: 370 Z-score: 389.7 expect() 7.1e-15 Smith-Waterman score: 446; 35.366% identity in 246 aa overlap Entrez lookup Re-search database >S15844 333- 567: ----------------------------: 300 310 320 330 340 350 360 370 QUERY VPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAK .:. .:.:::: : : : :: :::::::..:: .:. S15844 LPDSMDWREKGCVTEVKYQGACGSCWAFSAVGALEAQVKL 10 20 30 40 380 390 400 410 420 430 440 QUERY KNKNILSFSEQEVVDCSKDNFG---CDGGHPFYSFLYVLQNELCLGDE-YKYKAKD-----DMFCLNYRCKRKVSLSSIG :. ...:.: :..:::: ..: :.:: .: :...:. .. : ::: : :. :.: . : .: S15844 KTGKLVSLSAQNLVDCSTAKYGNKGCNGGFMTEAFQYIIDNNGIDSEASYPYKAMDGKCQYDVKNRAATCSRYIELP-FG 50 60 70 80 90 100 110 450 460 470 480 490 500 510 520 QUERY AVKENQLILALNEVGPLSVNVGVNND-FVAYSEGVY-NGTCSEELNHSVLLVGYGQVEKTKLNYNNKIQTYNTKENSNQP . :. : :. . ::.::.. .... : :. ::: . .:....::.::.:::: : ..: S15844 S--EEALKEAVANKGPVSVGIDASHSSFFLYKTGVYYDPSCTQNVNHGVLVVGYG-------NLDGK------------- 120 130 140 150 160 170 530 540 550 560 QUERY DDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFYPIL : ::..::::. ..:..:..:..:: .:.. :::.. :: S15844 D-----YWLVKNSWGLHFGDQGYIRMARN-SGNH--CGIANYPSYPEI 180 190 200 210 --------------------------------------------------------------------------- >>S04222 chymopapain (EC 3.4.22.6) - papaya (218 aa) initn: 461 init1: 256 opt: 370 Z-score: 389.7 expect() 7.1e-15 Smith-Waterman score: 464; 35.246% identity in 244 aa overlap Entrez lookup Re-search database >S04222 334- 567: ----------------------------: 300 310 320 330 340 350 360 370 QUERY PNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKK :. .:.: :: : :.:: :::::::......:.. S04222 YPQSIDWRAKGAVTPVKNQGACGSCWAFSTIATVEGINKIV 10 20 30 40 380 390 400 410 420 430 440 QUERY NKNILSFSEQEVVDCSKDNFGCDGGHPFYSFLYVLQNELCLGDEYKYKAKDDMFCLNYRCKR------KVSLSSIGAVKE . :.: .::::.:::.: ..:: ::. :. :: .: . . : :.::. :.:. ::.... : S04222 TGNLLELSEQELVDCDKHSYGCKGGYQTTSLQYVANNGVHTSKVYPYQAKQ------YKCRATDKPGPKVKITGYKRVPS 50 60 70 80 90 100 110 450 460 470 480 490 500 510 520 QUERY N---QLILALNEVGPLSVNVGVNND-FVAYSEGVYNGTCSEELNHSVLLVGYGQVEKTKLNYNNKIQTYNTKENSNQPDD : ... :: . :::: : ... : :. ::..: :. .:.:.: :::: :....: S04222 NCETSFLGALAN-QPLSVLVEAGGKPFQLYKSGVFDGPCGTKLDHAVTAVGYG-----------------TSDGKN---- 120 130 140 150 160 170 530 540 550 560 QUERY NIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFYPIL : ::::::. .:::.:.:::.:...... ::. . .::. S04222 ----YIIIKNSWGPNWGEKGYMRLKRQSGNSQGTCGVYKSSYYPFKGFA 180 190 200 210 --------------------------------------------------------------------------- >>S43991 cathepsin L-like proteinases - liver fluke (326 aa) initn: 454 init1: 236 opt: 372 Z-score: 389.1 expect() 7.7e-15 Smith-Waterman score: 510; 30.966% identity in 352 aa overlap Entrez lookup Re-search database >S43991 227- 569: -----------------------------------------: 190 200 210 220 230 240 250 260 QUERY VYLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKNIDEQMRKFEIFKINYISIKNHNKLNKNA .. . .:: :.. :.: :. .:.. : :..:: . . S43991 MRLFILAVLTVGVLGSNDDLWHQWKRMYNKEYNGADDQHRR-NIWEKNVKHIQEHNLRHDLG 10 20 30 40 50 60 270 280 290 300 310 320 330 340 QUERY M--YKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILDYREKGI . : .:::.:.. ::.: . : . . .. ... :.: .:.. ::. .:.::.: S43991 LVTYTLGLNQFTDMTFEEFKAKYLTEMSRASDIL-SHGVPYE-------------ANNR--------AVPDKIDWRESGY 70 80 90 100 110 350 360 370 380 390 400 410 420 QUERY VHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSK--DNFGCDGGHPFYSFLYVLQNELCLGDEYKYKA : : :::: :::::::...:..:. . :.... .:::::..:::: : ::.:: .. :. : : . : : : S43991 VTEVKDQGNCGSCWAFSTTGTMEGQYMKNERTSISFSEQQLVDCSGPWGNNGCSGGLMENAYQYLKQFGLETESSYPYTA 120 130 140 150 160 170 180 190 430 440 450 460 470 480 490 QUERY KDDMFCLNYRCKRKVSLSSIGAVKENQLILALNEVG---PLSVNVGVNNDFVAYSEGVYNG-TCSE-ELNHSVLLVGYGQ . . : . ..... .:. .. . : :: : .: : :..::. : :.:.. ::: ..::.:: :::: S43991 VEGQ-CRYNKQLGVAKVTGYYTVHSGSEVELKNLVGARRPAAVAVDVESDFMMYRSGIYQSQTCSPLRVNHAVLAVGYG- 200 210 220 230 240 250 260 270 500 510 520 530 540 550 560 QUERY VEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFYPIL :. ... :::.::::. ::: :..:..::... .:::. . :.. S43991 ----------------TQGGTD--------YWIVKNSWGTYWGERGYIRMARNRGN---MCGIASLASLPMVARFP 280 290 300 310 320 --------------------------------------------------------------------------- >>S47434 cysteine proteinase - rice (378 aa) initn: 458 init1: 188 opt: 372 Z-score: 388.2 expect() 8.7e-15 Smith-Waterman score: 500; 31.343% identity in 335 aa overlap Entrez lookup Re-search database >S47434 241- 568: ---------------------------------------: 210 220 230 240 250 260 270 280 QUERY EIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKNIDEQMRKFEIFKINYISIKNHNKLNKNAMYKKKVNQFSDYSE : :.:..: : :.. :. . .. .:.:.:.. S47434 PFTESDLSSEESLRALYERWRSRYTVSRPAASGGVGNDDGEARRRFNVFVENARYIHEANRRGGRP-FRLALNKFADMTT 30 40 50 60 70 80 90 100 290 300 310 320 330 340 350 360 QUERY EELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILDYREKGIVHEPKDQGLCGSCWAF .:... . .. .: . .. :. . : .: ...: ..: .:.::.: : :::: ::::::: S47434 DEFRRTYAGS-RARHHRSLSGGRGGEG--------GSFRYGG--DDED---NLPPAVDWRERGAVTGIKDQGQCGSCWAF 110 120 130 140 150 160 170 370 380 390 400 410 420 430 QUERY ASVGNIESVFAKKNKNILSFSEQEVVDC-SKDNFGCDGGHPFYSFLYVLQNE-LCLGDEYKYKAKDDMFCLNYRCKRKVS ..:. .:.: :. ....::::.::: . :: ::::: :.: .. .: . ..: :.:.. .. :. S47434 STVAAVEGVNKIKTGRLVTLSEQELVDCDTGDNQGCDGGLMDYAFQFIKRNGGITTESNYPYRAEQGRCNKAKASSHDVT 180 190 200 210 220 230 240 250 440 450 460 470 480 490 500 510 QUERY LSS---IGAVKENQLILALNEVGPLSVNVGVNN-DFVAYSEGVYNGTCSEELNHSVLLVGYGQVEKTKLNYNNKIQTYNT ... . : :. : :. . :..: : ... :: :::::..: :. .:.:.: :::: : .: S47434 IDGYEDVPANDESALQKAVAN-QPVAVAVEASGQDFQFYSEGVFTGECGTDLDHGVAAVGYG------------ITRDGT 260 270 280 290 300 310 520 530 540 550 560 QUERY KENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGD-NVFCGIGEEVFYPIL : :::.::::.. ::: :..:..:. ..: : .:::. :. ::. S47434 K------------YWIVKNSWGEDWGERGYIRMQRGVSSDSNGLCGIAMEASYPVKSGARNAAASNRVVKDEM 320 330 340 350 360 370 --------------------------------------------------------------------------- >>S46476 cysteine proteinase III - mountain papaya (214 aa) initn: 353 init1: 262 opt: 368 Z-score: 387.8 expect() 9.1e-15 Smith-Waterman score: 423; 31.276% identity in 243 aa overlap Entrez lookup Re-search database >S46476 334- 567: ----------------------------: 300 310 320 330 340 350 360 370 QUERY PNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKK :: .:.:.:: : :.:: :::::::......:.. S46476 YPESIDWRKKGAVTPVKNQGSCGSCWAFSTIATVEGINKIV 10 20 30 40 380 390 400 410 420 430 440 QUERY NKNILSFSEQEVVDCSKDNFGCDGGHPFYSFLYVLQNELCLGDEYKYKAKDDMFCLNYRCKRK------VSLSSIGAVKE . :. :.::::.:::.. . :: ::. :. ::... . :: :. :. :.:. : :..:. : S46476 HGNLTSLSEQELVDCDRRSHGCKGGYQTTSLKYVVDHGVHTEKEYPYEEKQ------YKCRAKDKKPPIVKISGYKKVPS 50 60 70 80 90 100 110 450 460 470 480 490 500 510 520 QUERY NQLILALNEVGPLSVNVGVNND---FVAYSEGVYNGTCSEELNHSVLLVGYGQVEKTKLNYNNKIQTYNTKENSNQPDDN :. : .. .. :.: :.. : :..:...: :. ...:.: ::::. : S46476 NDEISLIKAIAKQPVSVLVESKGKAFQFYKKGIFGGPCGTKVDHAVTAVGYGK------------------------D-- 120 130 140 150 160 530 540 550 560 QUERY IIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFYPIL : .:::::. ::: :.....: .. . .::: . ..: S46476 ---YILIKNSWGPXWGEXGYIKIKRASGHCEGICGIYKSSYFPAEGYR 170 180 190 200 210 --------------------------------------------------------------------------- >>S47312 cysteine proteinase (EC 3.4.22.-) precursor - spring vetch (368 aa) initn: 470 init1: 174 opt: 369 Z-score: 385.3 expect() 1.3e-14 Smith-Waterman score: 529; 29.917% identity in 361 aa overlap Entrez lookup Re-search database >S47312 217- 568: ------------------------------------------: 180 190 200 210 220 230 240 250 QUERY ILREEKDDKKVYLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKNIDEQMRKFEIFKINYISI .: . . . ... .:.:::... :. ..:.::: : : S47312 MASMTILPFFLFFSLITFSLALDIQLPTGRSNDEVMTMYEEWLVKHQKVYNGLREKDQRFQIFKDNLNFI 10 20 30 40 50 60 70 260 270 280 290 300 310 320 330 QUERY KNHNKLNKNAMYKKKVNQFSDYSEEELKE-YFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPE .:: .: : .:.:.:...:: .. :. : . .. .:..: .:.: . ...: S47312 DEHNA--QNYTYIVGLNKFADMTNEEYRDMYLGTRSDIKRRI-----------MKNKI-------TGHRYAYNSGDRLPV 80 90 100 110 120 130 340 350 360 370 380 390 400 410 QUERY ILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKD-NFGCDGGHPFYSFLYVLQNELCL .:.: :: . . :::: :::::::......:.. . ...:.::::.:::.. : ::.:: :.: ... : S47312 HVDWRLKGAITHIKDQGSCGSCWAFSTIATVEAINKIVTGKLVSLSEQELVDCDRAFNEGCNGGLMDYAFEFIIGNGGID 140 150 160 170 180 190 200 210 420 430 440 450 460 470 480 QUERY GDE-YKYKAKDDMFCLNYRCKRK-VSLS---SIGAVKENQLILALNEVGPLSVNVGVNNDFVA-YSEGVYNGTCSEELNH :. : ::. . : : : : ::.. .. . .:: : :. . :.:: . ... . :. ::..: :. :.: S47312 TDQHYPYKGFEGR-CDPTRKKAKIVSIDGYEDVPSNNENALKKAVAH-QPVSVAIEASGRALQLYQSGVFTGKCGTSLDH 220 230 240 250 260 270 280 490 500 510 520 530 540 550 560 QUERY SVLLVGYGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVF-CGIGEEVFYP .:..::::. .: . ::...:::. .:::.:.... :: .: .. :::. :. :: S47312 AVVIVGYGS-------------------------ENGLDYWLVRNSWGTNWGEDGYFKMERNVKGTHTGKCGIAVEASYP 290 300 310 320 330 340 QUERY IL . S47312 VKYGKNSAVTTNSAYEKTEVLVSSA 350 360 --------------------------------------------------------------------------- >>JC4848 cysteine proteinase (EC 3.4.22.-) - Douglas fir (454 aa) initn: 584 init1: 210 opt: 370 Z-score: 384.9 expect() 1.3e-14 Smith-Waterman score: 611; 33.062% identity in 369 aa overlap Entrez lookup Re-search database >JC4848 208- 568: --------------------------------------------: 170 180 190 200 210 220 230 240 QUERY HKLLEINKLILREEKDDKKVYLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKNIDEQMRKFE ..: ..: :.. . .. .:.:.:...::...:: JC4848 MGILLLFAVLALSAMAGSASRADFSIISYDSQDLIGDDAIMELYELWLAQHKKAYNGLDEKQKKFS 10 20 30 40 50 60 250 260 270 280 290 300 310 320 QUERY IFKINYISIKNHNKLNKNAMYKKKVNQFSDYSEEELKE-YFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNE .:: :.. :..::. . : :: .:::.: :.::.: :. : : . ... .. : .. :. :. JC4848 VFKDNFLYIHQHNNQG-NPSYKLGLNQFADLSHEEFKAAYLGTKLDAKKRLSRSPSPRYQ------------YSVGE--- 70 80 90 100 110 120 130 330 340 350 360 370 380 390 400 QUERY KDIFSKVPEILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDC-SKDNFGCDGGHPFYSFL :. :: .:.:::: : :.:: :::::::..:. .:.. . :. :.::::.::: .. : ::.:: :.: JC4848 -DL----PESIDWREKGAVTAVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSLSEQELVDCDTSYNQGCNGGLMDYAFQ 140 150 160 170 180 190 200 410 420 430 440 450 460 470 480 QUERY YVLQNE-LCLGDEYKYKAKDDMFCLNYRCKRKV-SLSSIGAVKEN--QLILALNEVGPLSVNVGVNND-FVAYSEGVYNG ....: : :.: :::.. : :: . .: .... : :: . . :.:: . ... : : ::... JC4848 FIISNGGLDSEDDYPYKANNGS-CDAYRKNAHVVTIDDYEDVPENDEKSLKKAAANQPISVAIEASGRAFQFYESGVFTS 210 220 230 240 250 260 270 280 490 500 510 520 530 540 550 QUERY TCSEELNHSVLLVGYGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNV-FCG .:. .:.:.: :::::. .. : ::..::::...:::.::..:.:: .: .. .:: JC4848 NCGTQLDHGVTLVGYGS-------------------------ESGIDYWLVKNSWGNSWGEKGFIKLQRNLEGASTGMCG 290 300 310 320 330 560 QUERY IGEEVFYPIL :. :. ::. JC4848 IAMEASYPVKKGANPPNPGPSPPSPVKPPTVCDNYYSCPESNTCCCMYDFGGYCYAWGCCPLNSATCCDDHYSCCPSDHP 340 350 360 370 380 390 400 410 --------------------------------------------------------------------------- >>KHHUH cathepsin H (EC 3.4.22.16) precursor - human (335 aa) initn: 364 init1: 164 opt: 364 Z-score: 380.8 expect() 2.2e-14 Smith-Waterman score: 552; 32.584% identity in 356 aa overlap Entrez lookup Re-search database >KHHUH 225- 568: -----------------------------------------: 190 200 210 220 230 240 250 260 QUERY KKVYLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKNIDEQMRKFEIFKINYISIKNHNKLNK : ..:..: :.:.. .: .... : :. .:. :: : KHHUH MWATLPLLCAGAWLLGVPVCGAAELSVNSLEKFHFKSWMSKHRKTYST-EEYHHRLQTFASNWRKINAHN--NG 10 20 30 40 50 60 70 270 280 290 300 310 320 330 340 QUERY NAMYKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILDYREKGI : .: .:::::.: :.:. : : :.. : :.:. . . : .:.:.:: KHHUH NHTFKMALNQFSDMSFAEIKH--KYLWSEPQNCSATKS----NYLRGT------------------GPYPPSVDWRKKGN 80 90 100 110 120 350 360 370 380 390 400 410 420 QUERY VHEP-KDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKD--NFGCDGGHPFYSFLYVLQNELCLG-DEYKY : :.:: :::::.:...: .::..: . ..::..::..:::..: :.::.:: : .: :.: :. .: : : : KHHUH FVSPVKNQGACGSCWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNYGCQGGLPSQAFEYILYNKGIMGEDTYPY 130 140 150 160 170 180 190 200 430 440 450 460 470 480 490 QUERY KAKDDMFCLNYRCKRKVSL----SSIGAVKENQLILALNEVGPLSVNVGVNNDFVAYSEGVYNGT-C---SEELNHSVLL ..:: .: ... . ... ..: :. .. :. .:.: :..::. : :.:..: : ...::.:: KHHUH QGKDG-YC-KFQPGKAIGFVKDVANITIYDEEAMVEAVALYNPVSFAFEVTQDFMMYRTGIYSSTSCHKTPDKVNHAVLA 210 220 230 240 250 260 270 280 500 510 520 530 540 550 560 QUERY VGYGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFYPIL :::: :: : : :::.::::. .:: ::.. . :.:: .::.. . ::: KHHUH VGYG--EK-----------------------NGIPYWIVKNSWGPQWGMNGYFLIERGKN----MCGLAACASYPIPLV 290 300 310 320 330 --------------------------------------------------------------------------- >>JQ1111 cysteine proteinase (EC 3.4.22.-) EP-B 1 precursor - barley (371 aa) initn: 476 init1: 181 opt: 364 Z-score: 380.1 expect() 2.4e-14 Smith-Waterman score: 502; 30.791% identity in 354 aa overlap Entrez lookup Re-search database >JQ1111 225- 568: -----------------------------------------: 190 200 210 220 230 240 250 260 QUERY KKVYLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKNIDEQMRKFEIFKINYISIKNHNKLNK . .... : .: .. :. :.: :: : :..::: . JQ1111 KKLLVASMVAAVLAVAAVELCSAIPMEDKDLESEEALWDLYERWQSAH-RVRRHHAEKHRRFGTFKSNAHFIHSHNKRGD 10 20 30 40 50 60 70 80 270 280 290 300 310 320 330 340 QUERY NAMYKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILDYREKGI . :. ..:.:.:... :.. : : . . .:: . :. . : .:. : .:.:.:: JQ1111 HP-YRLHLNRFGDMDQAEFRATF-----VGDLRRDTPAKPPS--------VPGFMYAAL-NVSDL----PPSVDWRQKGA 90 100 110 120 130 140 350 360 370 380 390 400 410 420 QUERY VHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDC-SKDNFGCDGGHPFYSFLYVLQNE-LCLGDEYKYKA : :::: :::::::..: ..:.. : .. ...:.::::..:: . :: ::.:: .: :. .: : : :.: JQ1111 VTGVKDQGKCGSCWAFSTVVSVEGINAIRTGSLVSLSEQELIDCDTADNDGCQGGLMDNAFEYIKNNGGLITEAAYPYRA 150 160 170 180 190 200 210 220 430 440 450 460 470 480 490 QUERY KDDMFCLNYRCKRKVSL-------SSIGAVKENQLILALNEVGPLSVNVGVNND-FVAYSEGVYNGTCSEELNHSVLLVG : : .. . ... : .:..: :. . :.:: : ... :. :::::..: :. ::.:.: .:: JQ1111 ARGT-CNVARAAQNSPVVVHIDGHQDVPANSEEDLARAVAN-QPVSVAVEASGKAFMFYSEGVFTGDCGTELDHGVAVVG 230 240 250 260 270 280 290 300 500 510 520 530 540 550 560 QUERY YGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFYPIL :: .: : :: .::::. .:::.:..:. ....... .:::. :. ::. JQ1111 YGVAEDGKA------------------------YWTVKNSWGPSWGEQGYIRVEKDSGASGGLCGIAMEASYPVKTYNKP 310 320 330 340 350 JQ1111 MPRRALGAWESQ 360 370 --------------------------------------------------------------------------- >>JQ1110 cysteine proteinase (EC 3.4.22.-) EP-B 4 precursor - barley (373 aa) initn: 476 init1: 181 opt: 364 Z-score: 380.1 expect() 2.4e-14 Smith-Waterman score: 506; 31.073% identity in 354 aa overlap Entrez lookup Re-search database >JQ1110 225- 568: -----------------------------------------: 190 200 210 220 230 240 250 260 QUERY KKVYLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKNIDEQMRKFEIFKINYISIKNHNKLNK . .... : .: .. :. :.: :: : :..::: . JQ1110 KKLLVASMVAAVLAVAAVELCSAIPMEDKDLESEEALWDLYERWQSAH-RVRRHHAEKHRRFGTFKSNAHFIHSHNKRGD 10 20 30 40 50 60 70 80 270 280 290 300 310 320 330 340 QUERY NAMYKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILDYREKGI . :. ..:.:.:... :.. : : . . ::: . :. . : .:. : .:.:.:: JQ1110 HP-YRLHLNRFGDMDQAEFRATF-----VGDLRRDTPSKPPS--------VPGFMYAAL-NVSDL----PPSVDWRQKGA 90 100 110 120 130 140 350 360 370 380 390 400 410 420 QUERY VHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDC-SKDNFGCDGGHPFYSFLYVLQNE-LCLGDEYKYKA : :::: :::::::..: ..:.. : .. ...:.::::..:: . :: ::.:: .: :. .: : : :.: JQ1110 VTGVKDQGKCGSCWAFSTVVSVEGINAIRTGSLVSLSEQELIDCDTADNDGCQGGLMDNAFEYIKNNGGLITEAAYPYRA 150 160 170 180 190 200 210 220 430 440 450 460 470 480 490 QUERY KDDMFCLNYRCKRKVSL-------SSIGAVKENQLILALNEVGPLSVNVGVNND-FVAYSEGVYNGTCSEELNHSVLLVG : : .. . ... : .:..: :. . :.:: : ... :. :::::..: :. ::.:.: .:: JQ1110 ARGT-CNVARAAQNSPVVVHIDGHQDVPANSEEDLARAVAN-QPVSVAVEASGKAFMFYSEGVFTGECGTELDHGVAVVG 230 240 250 260 270 280 290 300 500 510 520 530 540 550 560 QUERY YGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFYPIL :: .: : :: .::::. .:::.:..:. ....... .:::. :. ::. JQ1110 YGVAEDGKA------------------------YWTVKNSWGPSWGEQGYIRVEKDSGASGGLCGIAMEASYPVKTYSKP 310 320 330 340 350 JQ1110 KPTPRRALGARESL 360 370 --------------------------------------------------------------------------- >>S24602 cysteine proteinase tpp (EC 3.4.22.-) - garden pea (464 aa) initn: 486 init1: 200 opt: 364 Z-score: 378.6 expect() 2.9e-14 Smith-Waterman score: 561; 33.152% identity in 368 aa overlap Entrez lookup Re-search database >S24602 212- 568: -------------------------------------------: 180 190 200 210 220 230 240 250 QUERY EINKLILREEKDDKKVYLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKNIDEQMRKFEIFKI : : .: . . . ... .:.: :. . :. ..::::: S24602 MLSKLTILFITLTFTLSLALDMCIISYDKTHPDKSTPRTNDQVLTMYEEWLVKHGKNYNALGEKEKRFEIFKD 10 20 30 40 50 60 70 260 270 280 290 300 310 320 330 QUERY NYISIKNHNKLNKNAMYKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFS : : .:: .:: .. .:.:.: ..:: . : ::. .: .. :: : . . S24602 NLGFIDEHN--SKNLSFRLGLNRFADLTNEEYRTRFLGTRINPNRRNRKVNSQ---------------TN--RYATRVGD 80 90 100 110 120 130 340 350 360 370 380 390 400 410 QUERY KVPEILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDC-SKDNFGCDGGHPFYSFLYVLQN :.:: .:.:..: : :::: :::::::.... .:.: . ...:.::::.::: .. : ::.:: :.: ... : S24602 KLPESVDWRKEGAVVGVKDQGSCGSCWAFSAIAAVEGVNKLATGDLISLSEQELVDCDTSYNEGCNGGLMDYAFEFII-N 140 150 160 170 180 190 200 210 420 430 440 450 460 470 480 QUERY ELCLGDE--YKYKAKDDMFCLNYRCKRKVSLSS---IGAVKENQLILAL-NEVGPLSVNVGVNNDFVAYSEGVYNGTCSE . : : : :.: : : . . ::... . : :. : :. :.: ..:. : . .: :. ::..: :. S24602 MVALTPEEDYPYRAIDGRCDQNRKNAKVVSIDQYEDVPAYDEGALKKAVANQVIAVAVEGG-GREFQLYDSGVFTGRCGT 220 230 240 250 260 270 280 290 490 500 510 520 530 540 550 560 QUERY ELNHSVLLVGYGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRN----KNGDNVFCGI :.:.: :::: : : :. :::..:::. .::: :..:: :: :.: ::: S24602 ALDHGVAAVGYG--------------TENGKD-----------YWIVRNSWGGSWGEAGYIRLERNLATSKSGK---CGI 300 310 320 330 340 QUERY GEEVFYPIL . : ::: S24602 AIEPSYPIKNGLNPPKPAPSPPSPVKPPSVCDSYSCAEGSTCCCIFDYGGSCFEWGCCPLESATCCDDHYSCCPHEYPVC 350 360 370 380 390 400 410 420 --------------------------------------------------------------------------- >>KHBH aleurain (EC 3.4.22.-) precursor - barley (361 aa) initn: 422 init1: 196 opt: 361 Z-score: 377.2 expect() 3.5e-14 Smith-Waterman score: 491; 30.278% identity in 360 aa overlap Entrez lookup Re-search database >KHBH 220- 569: ------------------------------------------: 180 190 200 210 220 230 240 250 QUERY EEKDDKKVYLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKNIDEQMRKFEIFKINYISIKNH ..: .: .: ...: :.. : :.:.::. . ... KHBH TAAVAVASSSSFADSNPIRPVTDRAASTLESAVLGALGRTRHALRFARFAVRYGKSYESAAEVRRRFRIFSESLEEVRST 20 30 40 50 60 70 80 90 260 270 280 290 300 310 320 330 QUERY NKLNKNAMYKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILDY :. :. :. .:.:::.: ::.. : : . .. : : :. .: . .:: :. KHBH NR--KGLPYRLGINRFSDMSWEEFQA---TRLGA--------AQTCSATLAGNHLM-----------RDA-AALPETKDW 100 110 120 130 140 150 340 350 360 370 380 390 400 410 QUERY REKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSK--DNFGCDGGHPFYSFLYVLQNELCLGDE :: ::: :.:. :::::.:...: .:..... . . .:.:::..:::. .::::.:: : .: : .. . . KHBH REDGIVSPVKNQAHCGSCWTFSTTGALEAAYTQATGKNISLSEQQLVDCAGGFNNFGCNGGLPSQAFEYQYNGGIDTEES 160 170 180 190 200 210 220 230 420 430 440 450 460 470 480 QUERY YKYKAKDDMFCLNYRCKRKVS--LSSIGAV--KENQLILALNEVGPLSVNVGVNNDFVAYSEGVYN----GTCSEELNHS : ::. . . : .:. . . :.:.. . :..: :.. : :.:: : . : :. :::. :: ...::. KHBH YPYKGVNGV-C-HYKAENAAVQVLDSVNITLNAEDELKNAVGLVRPVSVAFQVIDGFRQYKSGVYTSDHCGTTPDDVNHA 240 250 260 270 280 290 300 490 500 510 520 530 540 550 560 QUERY VLLVGYGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFYPIL :: :::: :: : . ::. ::::. ::.::.... .:: .:.:. . ::.. KHBH VLAVGYG-VE------------------------NGVPYWLTKNSWGADWGDNGYFKMEMGKN----MCAIATCASYPVV 310 320 330 340 350 KHBH AA 360 --------------------------------------------------------------------------- >>S67481 cysteine proteinase CP1 - fruit fly (Drosophila melanogaster) (fragm (218 aa) initn: 379 init1: 183 opt: 355 Z-score: 374.3 expect() 5.1e-14 Smith-Waterman score: 447; 35.366% identity in 246 aa overlap Entrez lookup Re-search database >S67481 333- 569: ----------------------------: 300 310 320 330 340 350 360 370 QUERY VPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAK .:. .:.: :: : :::: :::::::.:.: .:. . S67481 LPKSVDWRTKGAVTAVKDQGHCGSCWAFSSTGALEGQHFR 10 20 30 40 380 390 400 410 420 430 440 QUERY KNKNILSFSEQEVVDCSK--DNFGCDGGHPFYSFLYVLQNE-LCLGDEYKYKAKDDMFCLNYRCKRKVS---LSSIGAVK :. ..:.:::..:::: : ::.:: .: :. .: . : :.: :: : : . .. ...: S67481 KSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFPYIKDNGGIDTEKSYPYEAIDDS-CHFNRAQVGATDRGFTDIPQGD 50 60 70 80 90 100 110 450 460 470 480 490 500 510 520 QUERY ENQLILALNEVGPLSVNVGVNND-FVAYSEGVYN-GTC-SEELNHSVLLVGYGQVEKTKLNYNNKIQTYNTKENSNQPDD :... . :::.:: . .... : ::::::: : ...:.:.::.::.: : :.... S67481 EKKMPEPVPTVGPVSVAIDASHESFQFYSEGVYNEPQCDAQNLDHGVLVVGFG-----------------TDESGED--- 120 130 140 150 160 170 530 540 550 560 QUERY NIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFYPIL ::..::::. ::..::... :::... :::. ::.. S67481 ----YWLVKNSWGTTWGDKGFIKMLRNKENQ---CGIASPSSYPLV 180 190 200 210 --------------------------------------------------------------------------- >>S47433 cathepsin L (EC 3.4.22.15) - Norway lobster (313 aa) initn: 385 init1: 194 opt: 355 Z-score: 372.0 expect() 6.9e-14 Smith-Waterman score: 484; 30.595% identity in 353 aa overlap Entrez lookup Re-search database >S47433 228- 567: -----------------------------------------: 190 200 210 220 230 240 250 260 QUERY YLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKNIDEQMRKFEIFKINYISIKNHNKL--NKN : .... : . :.. . ..:. : .. :: : . S47433 CGLALATASPSWEHFKTQYGRKYGDAKEELYRQRVFQQNEQLVEAFNKKFENGE 10 20 30 40 50 270 280 290 300 310 320 330 340 QUERY AMYKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILDYREKGIV . .: .:::.:...::. : ... :.: ... . : ..:. :. :.: :: : S47433 VTFKVAMNQFGDMTNEEF-----------NAVMKGYKKGSRGEPT-----TVFTAEGRPMAADV--------DWRTKGAV 60 70 80 90 100 110 350 360 370 380 390 400 410 420 QUERY HEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKD--NFGCDGGHPFYSFLYVLQNE-LCLGDEYKYKA :::: :::::::...:..:. ::....:.::::.:::: . : :: :: .: :. .: . . : :.: S47433 TPVKDQGQCGSCWAFSATGSLEGQHFLKNNELVSLSEQELVDCSTEYGNDGCGGGWMTSAFDYIKDNGGIDTESSYPYEA 120 130 140 150 160 170 180 190 430 440 450 460 470 480 490 QUERY KD-----DMFCLNYRCKRKVSLSSIGAVKENQLILALNEVGPLSVNVGVNN-DFVAYSEGVY-NGTCSE-ELNHSVLLVG .: : .. : : .. :. : :....::.:: . ... .: :: ::: . :: .:.:.:: :: S47433 QDRSCRFDANSIGATCTGFVEVQHT----EEALHEAVSDIGPISVAIDASHFSFQFYSSGVYYEKKCSPTNLDHGVLAVG 200 210 220 230 240 250 260 500 510 520 530 540 550 560 QUERY YGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFYPIL :: : .:.. ::..::::.. ::. :....:::.... :::. : :: S47433 YG--------------TESTED-----------YWLVKNSWGSGWGDAGYIKMSRNRDNN---CGIASEPSYPTV 270 280 290 300 310 --------------------------------------------------------------------------- >>S29245 cysteine proteinase (EC 3.4.22.-) precursor - Leishmania mexicana (443 aa) initn: 403 init1: 270 opt: 357 Z-score: 371.8 expect() 7.1e-14 Smith-Waterman score: 448; 28.169% identity in 355 aa overlap Entrez lookup Re-search database >S29245 222- 568: ------------------------------------------: 190 200 210 220 230 240 250 260 QUERY KDDKKVYLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKNIDEQMRKFEIFKINYISIKNHNK :. : .: . ....:... :..... :. : ...:. S29245 MATSRAALCAVAVVCVVLAAACAPARAIHVGTPAAALFEEFKRTYGRAYETLAEEQQRLANFERNLELMREHQA 10 20 30 40 50 60 70 270 280 290 300 310 320 330 340 QUERY LNKNAMYKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILDYRE : .:.. ...: : :: :. . . : . : .. : .. . :. : ::. .:.:: S29245 RNPHAQFG--ITKFFDLSEAEFAARYLNG--------AAYFAAAKRH------AAQHY---RKARADL-SAVPDAVDWRE 80 90 100 110 120 130 350 360 370 380 390 400 410 QUERY KGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKDNFGCDGGHPFYSFLYVLQN---ELCLGDEY :: : :::: :::::::..:::::. . .....:.:::..:.:. . ::.:: . .: ..::: .: : : S29245 KGAVTPVKDQGACGSCWAFSAVGNIEGQWYLAGHELVSLSEQQLVSCDDMDNGCSGGLMLQAFDWLLQNTNGHLHTEDSY 140 150 160 170 180 190 200 210 420 430 440 450 460 470 480 490 QUERY KYKAKDDMF--CLNYR---CKRKVSLSSIGAVKENQLILALNEVGPLSVNVGVNNDFVAYSEGVYNGTCSEELNHSVLLV : . . . : : ... . . .:. . : . ::... . ... :..:. :: .. ...:::.:::: S29245 PYVSGNGYVPECSNSSELVVGAQIDGHVLIGSSEKAMAAWLAKNGPIAIALDASS-FMSYKSGVLTACIGKQLNHGVLLV 220 230 240 250 260 270 280 290 500 510 520 530 540 550 560 QUERY GYGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFYPIL :: . : : . ::.:::::. :::.:..:. . :. : ..: ::. S29245 GYDM----------------TGE---------VPYWVIKNSWGGDWGEQGYVRVVMGVNA----CLLSE---YPVSAHVR 300 310 320 330 340 S29245 ESAAPGTSTSSETPAPRPVVVEQVICFDKNCRRGCRKTLIKANECHKNGGGGASMIKCSPQKVTMCTYSNEFCVGGGLCF 350 360 370 380 390 400 410 420 --------------------------------------------------------------------------- >>A48566 cysteine proteinase Lpcys2 (EC 3.4.22.-) - Leishmania pifanoi (444 aa) initn: 415 init1: 282 opt: 353 Z-score: 367.7 expect() 1.2e-13 Smith-Waterman score: 467; 29.494% identity in 356 aa overlap Entrez lookup Re-search database >A48566 222- 568: ------------------------------------------: 190 200 210 220 230 240 250 260 QUERY KDDKKVYLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKNIDEQMRKFEIFKINYISIKNHNK :. : .: . ....:... :..... :. : ...:. A48566 MATSRAALCAVAVVCVVLAAACAPARAIHVGTPAAALFEEFKRTYGRAYETLAEEQQRLANFERNLELMREHQA 10 20 30 40 50 60 70 270 280 290 300 310 320 330 340 QUERY LNKNAMYKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILDYRE : .:.. ...: : :: :. . . : . : .. : .. . :. : ::. .:.:: A48566 RNPHAQFG--ITKFFDLSEAEFAARYLNG--------AAYFAAAKRH------AAQHY---RKARADL-SAVPDAVDWRE 80 90 100 110 120 130 350 360 370 380 390 400 410 QUERY KGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKDNFGCDGGHPFYSFLYVLQN---ELCLGDEY :: : :::: :::::::..:::::. . .....:.:::..:.:. : ::::: . .: ..::: .: : : A48566 KGAVTPVKDQGACGSCWAFSAVGNIEGQWYLAGHELVSLSEQQLVSCDDMNDGCDGGLMLQAFDWLLQNTNGHLHTEDSY 140 150 160 170 180 190 200 210 420 430 440 450 460 470 480 490 QUERY KYKAKDDMF--CLNYRCKRKVSLSSIGAV----KENQLILALNEVGPLSVNVGVNNDFVAYSEGVYNGTCSEELNHSVLL : . . . : : . :. . : : .:. . : . ::... . ... :..:. :: .. ...:::.::: A48566 PYVSGNGYVPECSNSSEELVVGAQIDGHVLIGSSEKAMAAWLAKNGPIAIALDASS-FMSYKSGVLTACIGKQLNHGVLL 220 230 240 250 260 270 280 290 500 510 520 530 540 550 560 QUERY VGYGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFYPIL ::: . : : . ::.:::::. :::.:..:. . :. : ..: ::. A48566 VGYDM----------------TGE---------VPYWVIKNSWGGDWGEQGYVRVVMGVNA----CLLSE---YPVSAHV 300 310 320 330 340 A48566 RESAAPGTSTSSETPAPRPVMVEQVICFDKNCTQGCRKTLIKANECHKNGGGGASMIKCSPQKVTMCTYSNEFCVGGGLC 350 360 370 380 390 400 410 420 --------------------------------------------------------------------------- >>S19651 cysteine proteinase (EC 3.4.22.-) precursor (clone LCP3) - American (320 aa) initn: 333 init1: 170 opt: 350 Z-score: 366.7 expect() 1.4e-13 Smith-Waterman score: 488; 30.946% identity in 349 aa overlap Entrez lookup Re-search database >S19651 228- 567: -----------------------------------------: 190 200 210 220 230 240 250 260 QUERY YLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKNIDEQMRKFEIFKINYISIKNHNKL--NKN : .... : . :.. . ..:. : :.. :: : . S19651 KVAALFLCGLALATASPSWDHFKTQYGRKYGDAKEELYRQRVFQQNEQLIEDFNKKFENGE 10 20 30 40 50 60 270 280 290 300 310 320 330 340 QUERY AMYKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILDYREKGIV . .: .:::.:...::. : ... :.: ... : . : ..:. .:. :.: :..: S19651 VTFKVAMNQFGDMTNEEF-----------NAVMKGYKKGSRGEPK-----AVFTAEGRPMARDV--------DWRTKALV 70 80 90 100 110 350 360 370 380 390 400 410 420 QUERY HEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKD--NFGCDGGHPFYSFLYVLQNE-LCLGDEYKYKA ::: :::::::...: .:. :: ...:.:::..:::: : : :: :: .: :. .: . . : :.: S19651 TPVKDQEQCGSCWAFSATGALEGQHFLKNDELVSLSEQQLVDCSTDYGNDGCGGGWMTSAFDYIKDNGGIDTESSYPYEA 120 130 140 150 160 170 180 190 430 440 450 460 470 480 490 QUERY KDDMFCLNYRCKRKVSLSSIGAVK-ENQLILALNEVGPLSVNVGVNN-DFVAYSEGVY-NGTCSEE-LNHSVLLVGYGQV .: .. . .:. . . :. : :.. :::.:: . ... .: :: ::: . .:: :.:.:: :::: S19651 EDRSCRFDANSIGAICTGSVEVQHTEEALQEAVSGVGPISVAIDASHFSFQFYSSGVYYEQNCSPTFLDHGVLAVGYG-- 200 210 220 230 240 250 260 270 500 510 520 530 540 550 560 QUERY EKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFYPIL : .::. ::..::::...::. :....:::.... :::. : :: S19651 ------------TESTKD-----------YWLVKNSWGSSWGDAGYIKMSRNRDNN---CGIASEPSYPTV 280 290 300 310 320 --------------------------------------------------------------------------- >>JQ1121 cysteine proteinase homolog COT44 - rape (328 aa) initn: 430 init1: 204 opt: 347 Z-score: 363.5 expect() 2e-13 Smith-Waterman score: 539; 29.213% identity in 356 aa overlap Entrez lookup Re-search database >JQ1121 223- 568: ------------------------------------------: 190 200 210 220 230 240 250 QUERY DDKKVYLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKN----IDEQMRKFEIFKINYISIKN : .... ::.: .: :..: ..:.::: : : JQ1121 MSIYLRWSLEHGKSNSNSNGIINQQDERFNIFKDNLRFIDL 10 20 30 40 260 270 280 290 300 310 320 330 QUERY HNKLNKNAMYKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILD ::. :::: :: .. :.. ...: . . : . .. ... .: . ..: . .. ..:: .: JQ1121 HNENNKNATYKLGLTIFANLTNDEYRSLY---LGARTEPVRRITKAKNVNMKYSAAVN-------------VDEVPVTVD 50 60 70 80 90 100 340 350 360 370 380 390 400 410 QUERY YREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKD-NFGCDGGHPFYSFLYVLQNE-LCLGD .:.:: :. :::: :::::::.... .:.. . ...:.::::.:::.:. : ::.:: :.: ....: : JQ1121 WRQKGAVNAIKDQGTCGSCWAFSTAAAVEGINKIVTGELVSLSEQELVDCDKSYNQGCNGGLMDYAFQFIMKNGGLNTEK 110 120 130 140 150 160 170 180 420 430 440 450 460 470 480 490 QUERY EYKYKAKDDMFCLNYRCKRKVSLSS---IGAVKENQLILALNEVGPLSVNVGVNND-FVAYSEGVYNGTCSEELNHSVLL .: :.. . . .: :.... . . :. : :.. :.:: . ... : :. :...: :. ...:.:. JQ1121 DYPYHGTNGKCNSLLKNSRVVTIDGYEDVPSKDETALKRAVS-YQPVSVAIDAGGRAFQHYQSGIFTGKCGTNMDHAVVA 190 200 210 220 230 240 250 260 500 510 520 530 540 550 560 QUERY VGYGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFYPIL ::::. .: . :::..:::. .:::.:..:. :: . . :::. :. ::. JQ1121 VGYGS-------------------------ENGVDYWIVRNSWGTRWGEDGYIRMERNVASKSGKCGIAIEASYPVKYSP 270 280 290 300 310 JQ1121 NPVRGTSSV 320 --------------------------------------------------------------------------- >>KHCHL cathepsin L (EC 3.4.22.15) - chicken (218 aa) initn: 361 init1: 199 opt: 344 Z-score: 363.1 expect() 2.2e-13 Smith-Waterman score: 474; 36.327% identity in 245 aa overlap Entrez lookup Re-search database >KHCHL 334- 569: ----------------------------: 300 310 320 330 340 350 360 370 QUERY PNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKK :. .:.:::: : :::: :::::::...: .:. .: KHCHL APRSVDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQHFRK 10 20 30 40 380 390 400 410 420 430 440 QUERY NKNILSFSEQEVVDCSKD--NFGCDGGHPFYSFLYVLQNELCLGDE-YKYKAKDDMFC---LNYRCKRKVSLSSIGAVKE . ...:.:::..::::. : ::.:: .: :: .: ..: : : :::: : .: ... .: .: KHCHL TGKLVSLSEQNLVDCSRPEGNQGCNGGLMDQAFQYVQDNGGIDSEESYPYTAKDDEDCRYKAEYNAANDTGFVDIPQGHE 50 60 70 80 90 100 110 120 450 460 470 480 490 500 510 520 QUERY NQLILALNEVGPLSVNVGV-NNDFVAYSEGVY-NGTCS-EELNHSVLLVGYGQVEKTKLNYNNKIQTYNTKENSNQPDDN :. :. :::.:: . . ...: :. :.: . :: :.:.:.::.:::: : : KHCHL RALMKAVASVGPVSVAIDAGHSSFQFYQSGIYYEPDCSSEDLDHGVLVVGYG-FEGGK---------------------- 130 140 150 160 170 530 540 550 560 QUERY IIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFYPIL :::.::::..:::..:.. ....... :::. . ::.. KHCHL --KYWIVKNSWGEKWGDKGYIYMAKDRKNH---CGIATAASYPLV 180 190 200 210 --------------------------------------------------------------------------- >>A55090 cathepsin O (EC 3.4.-.-) precursor - human (321 aa) initn: 297 init1: 200 opt: 344 Z-score: 360.6 expect() 3e-13 Smith-Waterman score: 408; 30.000% identity in 280 aa overlap Entrez lookup Re-search database >A55090 298- 564: --------------------------------: 260 270 280 290 300 310 320 330 QUERY NHNKLNKNAMYKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEI- :...: : ...: : :. .. : .. ..:.. A55090 FTPTWPRSREREAAAFRESLNRHRYLNSLFPSENSTAFYGINQFSYLFPEEFKAIYLRSK-PSKFPRYSAEVHMSIPNVS 30 40 50 60 70 80 90 100 340 350 360 370 380 390 400 410 QUERY ----LDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKDNFGCDGGHPF--YSFLYVLQN .:.:.: .: . ..: .::.::::. :: .::..: :.: . ..: :.:.::: .:.::.:: . ..: .: A55090 LPLRFDWRDKQVVTQVRNQQMCGGCWAFSVVGAVESAYAIKGKPLEDLSVQQVIDCSYNNYGCNGGSTLNALNWLNKMQV 110 120 130 140 150 160 170 180 420 430 440 450 460 470 480 QUERY ELCLGDEYKYKAKDDMFCLNYRCKRK-VSLSSIGAV----KENQLILALNEVGPLSVNVGVNNDFVAYSEGVYNGTCSE- .: .:: .::.. . : . ... :... .: .:... :: ::: : : . . . : :. . :: A55090 KLVKDSEYPFKAQNGL-CHYFSGSHSGFSIKGYSAYDFSDQEDEMAKALLTFGPLVVIVDAVS-WQDYLGGIIQHHCSSG 190 200 210 220 230 240 250 260 490 500 510 520 530 540 550 560 QUERY ELNHSVLLVGYGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEV : ::.::..:. .. .: : :::..:::...:: .:. .. : :.:: :::.. : A55090 EANHAVLITGFDKTGST-------------------P------YWIVRNSWGSSWGVDGYAHV---KMGSNV-CGIADSV 270 280 290 300 310 QUERY FYPIL A55090 SSIFV 320 --------------------------------------------------------------------------- >>S19650 cysteine proteinase (EC 3.4.22.-) precursor (clone LCP2) - American (323 aa) initn: 332 init1: 179 opt: 342 Z-score: 358.5 expect() 3.9e-13 Smith-Waterman score: 474; 30.141% identity in 355 aa overlap Entrez lookup Re-search database >S19650 228- 569: -----------------------------------------: 190 200 210 220 230 240 250 260 QUERY YLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKNIDEQMRKFEIFKINYISIKNHNKLNKNA- : .... : . .:. . ::. : :.. :: .:. S19650 MKVAVLFLCGVALAAASPSWEHFKGKYGRQYVDAEEDSYRRVIFEQNQKYIEEFNKKYENGE 10 20 30 40 50 60 270 280 290 300 310 320 330 340 QUERY -MYKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILDYREKGIV .. .:.:.:.. ::.. .: ..: . : : .: :: .:. .. :. :.: :: : S19650 VTFNLAMNKFGDMTLEEFNAVMKG--NIP-----RRSAP----------VSVFYP-----KKETGPQATEV-DWRTKGAV 70 80 90 100 110 350 360 370 380 390 400 410 420 QUERY HEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKDNF--GCDGGHPFYSFLYVLQNE-LCLGDEYKYKA :::: :::::::...:..:. :. ...:..::..::::. ::.:: .: :. :. . : :.: S19650 TPVKDQGQCGSCWAFSTTGSLEGQHFLKTGSLISLAEQQLVDCSRPYGPQGCNGGWMNDAFDYIKANNGIDTEAAYPYEA 120 130 140 150 160 170 180 190 430 440 450 460 470 480 490 QUERY KD-----DMFCLNYRCKRKVSLSSIGAVKENQLILALNEVGPLSVNV-GVNNDFVAYSEGVY-NGTCSEE-LNHSVLLVG .: : . :. .....: :. :. : :. ..::.::.. .....: :: ::: . .:: :.:.:: :: S19650 RDGSCRFDSNSVAATCSGHTNIAS-GS--ETGLQQAVRDIGPISVTIDAAHSSFQFYSSGVYYEPSCSPSYLDHAVLAVG 200 210 220 230 240 250 260 270 500 510 520 530 540 550 560 QUERY YGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFYPIL :: :. .: .:..::::. .::. :....:::.:.. :::. . ::.. S19650 YG---------------------SEGGQD----FWLVKNSWATSWGDAGYIKMSRNRNNN---CGIATVASYPLV 280 290 300 310 320 --------------------------------------------------------------------------- >>KHRTH cathepsin H (EC 3.4.22.16) precursor - rat (333 aa) initn: 358 init1: 153 opt: 342 Z-score: 358.3 expect() 4e-13 Smith-Waterman score: 537; 30.978% identity in 368 aa overlap Entrez lookup Re-search database >KHRTH 213- 568: -------------------------------------------: 180 190 200 210 220 230 240 250 QUERY INKLILREEKDDKKVYLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKNIDEQMRKFEIFKIN : .: :. .: ..::.:.:.:.. : .....: : KHRTH MWTALPLLCAGAWLLSAGATAELTVNAIE-KFHFTSWMKQHQKTYSS-REYSHRLQVFANN 10 20 30 40 50 260 270 280 290 300 310 320 330 QUERY YISIKNHNKLNKNAMYKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSK . .:. ::. .: .: .:::::.: :.:. : : :.. : :.:. . . KHRTH WRKIQAHNQ--RNHTFKMGLNQFSDMSFAEIKH--KYLWSEPQNCSATKS----NYLRGT------------------GP 60 70 80 90 100 110 340 350 360 370 380 390 400 QUERY VPEILDYREKGIVHEP-KDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKD--NFGCDGGHPFYSFLYVLQ : .:.:.:: : : :.:: :::::.:...: .::. : . ......::..:::... : ::.:: : .: :.: KHRTH YPSSMDWRKKGNVVSPVKNQGACGSCWTFSTTGALESAVAIASGKMMTLAEQQLVDCAQNFNNHGCQGGLPSQAFEYILY 120 130 140 150 160 170 180 190 410 420 430 440 450 460 470 480 QUERY NELCLG-DEYKYKAKDDMFCLNYRCKRKVSLS----SIGAVKENQLILALNEVGPLSVNVGVNNDFVAYSEGVYNG-TCS :. .: : : : .:. . : .. .. :.. .: : .. :. .:.: :..::. :. :::.. .: KHRTH NKGIMGEDSYPYIGKNGQ-C-KFNPEKAVAFVKNVVNITLNDEAAMVEAVALYNPVSFAFEVTEDFMMYKSGVYSSNSCH 200 210 220 230 240 250 260 270 490 500 510 520 530 540 550 560 QUERY ---EELNHSVLLVGYGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGI ...::.:: ::::. .: . :::.::::...::.::.. . :.:: .::. KHRTH KTPDKVNHAVLAVGYGE-------------------------QNGLLYWIVKNSWGSNWGNNGYFLIERGKN----MCGL 280 290 300 310 320 QUERY GEEVFYPIL . . ::: KHRTH AACASYPIPQV 330 --------------------------------------------------------------------------- >>I58002 cathepsin-related protein - rat (fragment) (236 aa) initn: 320 init1: 176 opt: 338 Z-score: 356.5 expect() 5.1e-13 Smith-Waterman score: 445; 33.468% identity in 248 aa overlap Entrez lookup Re-search database >I58002 326- 567: -----------------------------: 290 300 310 320 330 340 350 360 QUERY YFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILDYREKGIVHEPKDQGLCGSCWAFASVGN .:.. .:.. :.:..: : ..:: ::::::::.:: I58002 NPAAVTNPSAQKQVSIGLPNFKDWRKEGYVTPVRNQGKCGSCWAFAAVGA 10 20 30 40 50 370 380 390 400 410 420 430 440 QUERY IESVFAKKNKNILSFSEQEVVDCSKDNFGCDGGHPFYSFLYVLQNE-LCLGDEYKYKAKDDMFCLNYRCKRKVSLSSIGA ::. .. :. :. .: :...: .....: : .: :::.:. : : :..:: : . . ....... I58002 IEGQMSLKTGNLTPLSAQNLLDTKSEGIGLPWGTAHQAFNYVLKNKGLEAEATYPYEGKDGP-CRYHSENASANITGFVN 60 70 80 90 100 110 120 450 460 470 480 490 500 510 QUERY VKENQLIL--ALNEVGPLSVNVGVNND-FVAYSEGVYN-GTCSEEL-NHSVLLVGYGQVEKTKLNYNNKIQTYNTKENSN . :.: : :. .::.:. . ...: : :: :::. .:: . ::.::.:::: ..: I58002 LPPNELYLWVAVASIGPVSAAIDASHDSFRFYSGGVYHEPNCSSYVVNHAVLVVGYG-------------------FEGN 130 140 150 160 170 180 190 520 530 540 550 560 QUERY QPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFYPIL . : : ::.:::::...:: ::::......:. :::. .. .: I58002 ETDGN--NYWLIKNSWGEEWGINGFMKIAKDRNNH---CGIASQASFPDIF 200 210 220 230 --------------------------------------------------------------------------- >>TAGB actinidain (EC 3.4.22.14) precursor - kiwi fruit (380 aa) initn: 407 init1: 138 opt: 341 Z-score: 356.4 expect() 5.1e-13 Smith-Waterman score: 507; 28.382% identity in 377 aa overlap Entrez lookup Re-search database >TAGB 200- 568: ---------------------------------------------: 160 170 180 190 200 210 220 230 QUERY RKELSNILHKLLEINKLILREEKDDKKVYLINDNYDEKGALEIGMN-EEMKYKKEDPINNIKYASKFFKFMKEHNKVYKN : ...: ... . .: .. . : : ..:. .: :.. TAGB MGLPKSFVSMSLLFFSTLLILSLAFNAKNLTQRTNDEVKAM-YESWLIKY----GKSYNS 10 20 30 40 50 240 250 260 270 280 290 300 310 QUERY IDEQMRKFEIFKINYISIKNHNKLNKNAMYKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEF . : :.::::: . : .:: . : :: .:::.: ..::.. . . :. : :. TAGB LGEWERRFEIFKETLRFIDEHNA-DTNRSYKVGLNQFADLTDEEFRSTYLGFTSGSNKT--KVSN--------------- 60 70 80 90 100 110 320 330 340 350 360 370 380 390 QUERY YTNGKRNEKDIFSKVPEILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDC--SKDNFGCD : : . . .: .:.: : : . :.:: ::.::::......:.. . ..:.::::..:: .... ::. TAGB -----RYEPRVGQVLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIDCGRTQNTRGCN 120 130 140 150 160 170 180 190 400 410 420 430 440 450 460 470 QUERY GGHPFYSFLYVLQNE-LCLGDEYKYKAKDDMFCLNYRCKRKVSLSSIGAVKENQ---LILALNEVGPLSVNVGVNND-FV ::. .: ....: . ..: : :.: .. . .. :.... : :. : :.. :.:: . . .: : TAGB GGYITDGFQFIINNGGINTEENYPYTAQDGECNVELQNEKYVTIDTYENVPYNNEWALQTAVT-YQPVSVALDAAGDAFK 200 210 220 230 240 250 260 270 480 490 500 510 520 530 540 550 QUERY AYSEGVYNGTCSEELNHSVLLVGYGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNK :: :...: :. ..:.: .:::: .. : :::.::::. :::.:.::. :: TAGB QYSSGIFTGPCGTAIDHAVTIVGYGT-------------------------EGGIDYWIVKNSWDTTWGEEGYMRILRNV 280 290 300 310 320 560 QUERY NGDNVFCGIGEEVFYPIL .: .. :::. ::. TAGB GGAGT-CGIATMPSYPVKYNNQNYPEPYSSLINPPAFSMSKDGPVGVEDGQRYSA 330 340 350 360 370 380 --------------------------------------------------------------------------- >>S53027 cathepsin L (EC 3.4.22.15) precursor - penaeid shrimp (Penaeus vanam (326 aa) initn: 390 init1: 201 opt: 340 Z-score: 356.4 expect() 5.1e-13 Smith-Waterman score: 503; 30.168% identity in 358 aa overlap Entrez lookup Re-search database >S53027 228- 569: -----------------------------------------: 190 200 210 220 230 240 250 260 QUERY YLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKNIDEQMRKFEIFKINYISIKNHNKL--NKN : ::.. : ...:. .. .:. : : .:: : . S53027 KSLAVLACVVAVAVATPSLRQQWQNFKAEHGRRYASVQEERYRLSVFEQNQQFIDDHNARFENGE 10 20 30 40 50 60 270 280 290 300 310 320 330 340 QUERY AMYKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILDYREKGIV . . ..:::.:.. ::. .. .: .:.. .: .: : ... . :: .:.: :: : S53027 VTFTLQMNQFGDMTSEEIVATMNGFLGAPTR------RP------AAVL--------KADDETL----PEKVDWRTKGAV 70 80 90 100 110 120 350 360 370 380 390 400 410 420 QUERY HEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKD--NFGCDGGHPFYSFLYVLQNE-LCLGDEYKYKA ::: :::::::...:..:. :. ...:.:::..:::: :.:: :: .: :. :. . : : :.: S53027 TPVKDQKQCGSCWAFSTTGSLEGQHFLKDGKLVSLSEQNLVDCSDKFGNMGCMGGLMDQAFRYIKANKGIDTEDSYPYEA 130 140 150 160 170 180 190 200 430 440 450 460 470 480 490 QUERY KDDMFCLNYRCK---RKVSLSSIGAV-----KENQLILALNEVGPLSVNVGVNND-FVAYSEGVY-NGTCSEE-LNHSVL .: .:. .:. .. : : .:. : :. .::.::.. .... : : ::: . :: :.:.:: S53027 QDG------KCRFDASNVGATDTGYVDVEHGSESALKKAVATIGPISVGIDASQSTFHFYHTGVYHDDHCSSTMLDHGVL 210 220 230 240 250 260 270 500 510 520 530 540 550 560 QUERY LVGYGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFYPIL ::::. :.: .:..::::. .::..:....:::.:.. :::. .. ::.. S53027 AVGYGS------------------------DENGGDFWLVKNSWNTSWGDKGYIKMSRNRNNN---CGIASQASYPLV 280 290 300 310 320 --------------------------------------------------------------------------- >>JC5691 cysteine proteinase (EC 3.4.-.-) - Bombyx mori nuclear polyhedrosis (323 aa) initn: 398 init1: 207 opt: 331 Z-score: 347.3 expect() 1.6e-12 Smith-Waterman score: 490; 31.856% identity in 361 aa overlap Entrez lookup Re-search database >JC5691 208- 564: --------------------------------------------: 170 180 190 200 210 220 230 240 QUERY HKLLEINKLILREEKDDKKVYLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKNIDEQMRKFE .: ::.. .: : .:... :: :.. :..:.:. JC5691 MNKILFYLFVYAVVKSAAYDPLKAPNY---FEEFVHRFNKNYSSEVEKLRRFK 10 20 30 40 50 250 260 270 280 290 300 310 320 QUERY IFKINYISIKNHNKLNKNAMYKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEK ::. : : :.:. : .: :. .:.::: :..: . : : .:.. .: : ::... .: JC5691 IFQHNLNEIINKNQ-NDSAKYE--INKFSDLSKDETIAKY-TGLSLPTQT--------QNFCK-VILLDQPPGKG----- 60 70 80 90 100 110 330 340 350 360 370 380 390 400 QUERY DIFSKVPEILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKDNFGCDGGHPFYSFLYV : .:.:. . : :.::.::.:::::..:..:: :: :.......:::...::. . ::.:: .: . JC5691 ------PLEFDWRRLNKVTSVKNQGMCGACWAFATLGSLESQFAIKHNELINLSEQQMIDCDFVDAGCNGGLLHTAFEAI 120 130 140 150 160 170 180 410 420 430 440 450 460 470 480 QUERY LQ-NELCLGDEYKYKAKDDMFCLNYRCKRKVSLSSIG---AVKENQLILALNEVGPLSVNVGVNNDFVAYSEGVYNGTCS .. . . : ..: :.: :. : : :.... : :..: : :::. . . . :.: :..:. . . JC5691 IKMGGVQLESDYPYEA-DNNNCRMNSNKFLVQVKDCYRYIIVYEEKLKDLLPLVGPIPMAIDAA-DIVNYKQGIIKYCFD 190 200 210 220 230 240 250 260 490 500 510 520 530 540 550 560 QUERY EELNHSVLLVGYGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEE :::.::::::: :: :: : :: .::.:. :::.::.:...: :. ::. .: JC5691 SGLNHAVLLVGYG-VE------NN------------------IPYWTFKNTWGTDWGEDGFFRVQQNINA----CGMRNE 270 280 290 300 310 QUERY VFYPIL . JC5691 LASTAVIY 320 --------------------------------------------------------------------------- >>S12099 cysteine proteinase (EC 3.4.22.-) precursor - Trypanosoma brucei (450 aa) initn: 364 init1: 228 opt: 332 Z-score: 346.1 expect() 1.9e-12 Smith-Waterman score: 473; 30.088% identity in 339 aa overlap Entrez lookup Re-search database >S12099 224- 552: ----------------------------------------- : 190 200 210 220 230 240 250 260 QUERY DKKVYLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKNIDEQMRKFEIFKINYISIKNHNKLN .: : :...::::. :. .:. :. :. . : . : S12099 MPRTEMVRFVRLPVVLLAMAACLASVALGSLHVEESLEMRFAAFKKKYGKVYKDAKEEAFRFRAFEENMEQAKIQAAAN 10 20 30 40 50 60 70 270 280 290 300 310 320 330 340 QUERY KNAMYKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILDYREKG : . :. :::...::.. ... .: ..... .. . :.:. .: .:.:::: S12099 PYATFG--VTPFSDMTREEFRARYRNG--------ASYFAAAQKRVRKTVNV----TTGR---------APAAVDWREKG 80 90 100 110 120 130 350 360 370 380 390 400 410 420 QUERY IVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKDNFGCDGGHPFYSFLYVLQNELCLGDEYKYKAK : :::: :::::::...::::. . .. ..:.::: .:.:. .::: :: .: ...... : . .:. S12099 AVTPVKDQGQCGSCWAFSTIGNIEGQWQVAGNPLVSLSEQMLVSCDTIDFGCGGGLMDNAFNWIVNSN---GGNVFTEAS 140 150 160 170 180 190 200 210 430 440 450 460 470 480 490 QUERY DDMFCLNYRCKR-KVSLSSIGAV---------KENQLILALNEVGPLSVNVGVNNDFVAYSEGVYNGTCSEELNHSVLLV . : . . ... :::. :. . : : :::.. : ... :. :. :. .. ::.:.:.:::: S12099 YPYVSGNGEQPQCQMNGHEIGAAITDHVDLPQDEDAIAAYLAENGPLAIAVDATS-FMDYNGGILTSCTSEQLDHGVLLV 220 230 240 250 260 270 280 290 500 510 520 530 540 550 560 QUERY GYGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFYPIL ::. .::: : :::::::::. :::.:..:. .. : S12099 GYN-------------------DNSNPP------YWIIKNSWSNMWGEDGYIRIEKGTNQCLMNQAVSSAVVGGPTPPPP 300 310 320 330 340 S12099 PPPPPSATFTQDFCEGKGCTKGCSHATFPTGECVQTTGVGSVIATCGASNLTQIIYPLSRSCSGPSVPITVPLDKCIPIL 350 360 370 380 390 400 410 420 --------------------------------------------------------------------------- >>S62736 cathepsin-like cysteine proteinase (EC 3.4.22.-) - Autographa califo (323 aa) initn: 373 init1: 199 opt: 326 Z-score: 342.2 expect() 3.2e-12 Smith-Waterman score: 470; 31.429% identity in 350 aa overlap Entrez lookup Re-search database >S62736 219- 564: ------------------------------------------: 180 190 200 210 220 230 240 250 QUERY REEKDDKKVYLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKNIDEQMRKFEIFKINYISIKN .: . : .:... :: : . :..:.:.::. : : : S62736 MNKILFYLFVYGVVNSAAYDLLKAPNYFEEFVHRFNKDYGSEVEKLRRFKIFQHNLNEIIN 10 20 30 40 50 60 260 270 280 290 300 310 320 330 QUERY HNKLNKNAMYKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILD .:. : .: :. .:.::: :..: . : : .: . .: : :.... .: : .: S62736 KNQ-NDSAKYE--INKFSDLSKDETIAKY-TGLSLPIQT--------QNFCK-VIVLDQPPGKG-----------PLEFD 70 80 90 100 110 340 350 360 370 380 390 400 410 QUERY YREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKDNFGCDGGHPFYSFLYVLQ-NELCLGDE .:. . : :.::.::.:::::.....:: :: :.......:::...::. . ::.:: .: ... . . : .. S62736 WRRLNKVTSVKNQGMCGACWAFATLASLESQFAIKHNQLINLSEQQMIDCDFVDAGCNGGLLHTAFEAIIKMGGVQLESD 120 130 140 150 160 170 180 190 420 430 440 450 460 470 480 490 QUERY YKYKAKDDMFCLNYRCKRKVSLSS---IGAVKENQLILALNEVGPLSVNVGVNNDFVAYSEGVYNGTCSEELNHSVLLVG : :.: :. : : :.... .: :..: : :::. . . . :.: :..:. . . :::.::::: S62736 YPYEA-DNNNCRMNSNKFLVQVKDCYRYITVYEEKLKDLLRLVGPIPMAIDAA-DIVNYKQGIIKYCFNSGLNHAVLLVG 200 210 220 230 240 250 260 270 500 510 520 530 540 550 560 QUERY YGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFYPIL :: :: :: : :: .::.:. :::.::.:...: :. ::. .:. S62736 YG-VE------NN------------------IPYWTFKNTWGTDWGEDGFFRVQQNINA----CGMRNELASTAVIY 280 290 300 310 320 --------------------------------------------------------------------------- >>S62735 cathepsin - Choristoneura fumiferana nuclear polyhedrosis virus (324 aa) initn: 321 init1: 211 opt: 325 Z-score: 341.1 expect() 3.6e-12 Smith-Waterman score: 478; 30.000% identity in 350 aa overlap Entrez lookup Re-search database >S62735 219- 564: ------------------------------------------: 180 190 200 210 220 230 240 250 QUERY REEKDDKKVYLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKNIDEQMRKFEIFKINYISIKN .: . : :... :: :.. .:..:.:.::. : : : S62735 MNKIVLYLLVYGAVQCAAYDVLKAPNYFEDFLHKFNKSYSSESEKLRRFQIFRHNLEEIIN 10 20 30 40 50 60 260 270 280 290 300 310 320 330 QUERY HNKLNKNAMYKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILD .:. ...:.:. .:.:.: :..: . : : :.... .... . . : : .: S62735 KNHNDSTAQYE--INKFADLSKDETISKYTGL-----------SLPLQTQNFCEVVVLDRPPD----------KGPLEFD 70 80 90 100 110 340 350 360 370 380 390 400 410 QUERY YREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKDNFGCDGGHPFYSFLYVLQ-NELCLGDE .:. . : :.::.::.:::::..:..:: :: :.......:::...::. . ::::: .: :.. . . .. S62735 WRRLNKVTSVKNQGMCGACWAFATLGSLESQFAIKHNQFINLSEQQLIDCDFVDAGCDGGLLHTAFEAVMNMGGIQAESD 120 130 140 150 160 170 180 190 420 430 440 450 460 470 480 490 QUERY YKYKAKDDMFCLN---YRCKRKVSLSSIGAVKENQLILALNEVGPLSVNVGVNNDFVAYSEGVYNGTCSEELNHSVLLVG : :.:.. : . : : : .: :..: : :::. : . .. :.: :..:... .. :::.::::: S62735 YPYEANNGDCRANAAKFVVKVKKCYRYI-TVFEEKLKDLLRSVGPIPVAIDAS-DIVNYKRGIMKYCANHGLNHAVLLVG 200 210 220 230 240 250 260 270 500 510 520 530 540 550 560 QUERY YGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFYPIL :. :: : . .::.::.:. :::.:..:...: :. ::: .:. S62735 YA-VE------------------------NGVPFWILKNTWGADWGEQGYFRVQQNINA----CGIQNELPSSAEIY 280 290 300 310 320 --------------------------------------------------------------------------- >>S49451 cysteine proteinase - chickpea (325 aa) initn: 429 init1: 176 opt: 324 Z-score: 340.1 expect() 4.1e-12 Smith-Waterman score: 492; 29.429% identity in 350 aa overlap Entrez lookup Re-search database >S49451 225- 568: -----------------------------------------: 190 200 210 220 230 240 250 260 QUERY KKVYLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKNIDEQMRKFEIFKINYISIKNHNKLNK . :.. .:.:.:... :. .:.::: : : .:: . S49451 MTMYEKWLVKHQKMYNGLGEKDTRFQIFKDNLRFIDEHNA--Q 10 20 30 40 270 280 290 300 310 320 330 340 QUERY NAMYKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILDYREKGI : :: .:.:.: ..:: .... .. . : .: ... : .: . :: :.: :: S49451 NYSYKVGLNKFADINNEEYRDMYLGTKSDAKRRVMK-TKITGHRITYNSVI-------------VTVKV----DWRLKGA 50 60 70 80 90 100 350 360 370 380 390 400 410 420 QUERY VHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKD-NFGCDGGHPFYSFLYVLQNE-LCLGDEYKYKA : . :::: :::::::......:.. . ...:.::::.:::.. : ::.:: :.: ....: . ..: :.. S49451 VTHIKDQGSCGSCWAFSTIATVEAINKIVTGKFVSLSEQELVDCDRAFNEGCNGGLMDYAFEFIIRNGGIDTDQDYPYNG 110 120 130 140 150 160 170 180 430 440 450 460 470 480 490 QUERY KDDMFCLNYRCKRKVSLSSIGAVKE--NQLILALNEVGPLSVNV-GVNNDFVAYSEGVYNGTCSEELNHSVLLVGYGQVE . . . . ::... : : : :. . :.:: . :.. . :. ::..: :. .:.:.:..::::. S49451 FERKCDPTKKNAKVVSIDGYEDVPSYMNALKKAVAH-QPVSVAIAGLGRALQLYQSGVFTGKCGTDLDHGVVVVGYGS-- 190 200 210 220 230 240 250 260 500 510 520 530 540 550 560 QUERY KTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRL-SRNKNGDNVFCGIGEEVFYPIL .: . ::...:::. .:::.:.... ::: .. :::. :. ::. S49451 -----------------------ENGVDYWLVRNSWGTNWGEDGYFKIASRNVKSLYRKCGIAMEASYPVKYGQNTNSAA 270 280 290 300 310 S49451 PQLYVTSA 320 --------------------------------------------------------------------------- >>S41425 cysteine proteinase (EC 3.4.22.-) CP3 precursor - Trichomonas vagina (278 aa) initn: 283 init1: 247 opt: 323 Z-score: 340.1 expect() 4.1e-12 Smith-Waterman score: 380; 30.556% identity in 288 aa overlap Entrez lookup Re-search database >S41425 225- 500: ----------------------------------: 190 200 210 220 230 240 250 260 QUERY KKVYLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKNIDEQMRKFEIFKINYISIKNHNKLNK :. .:. :... . :: .. ... : ...::. :. S41425 MFSAFFATASSKLFLQHEEKAFLDWMRSTNNMFVG-DEYHFRLGVYNTNKRRVQEHNRANS 10 20 30 40 50 60 270 280 290 300 310 320 330 340 QUERY NAMYKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFS-KVPEILDYREKG . :. .:..: .. : .:.:: .: : .: .:.. : ::. ::. .:.:. S41425 G--YQLTMNHLSCMTPSE----YKVLL---GH---KQTKKIEGEAK------------------IFKGDVPDAVDWRNAK 70 80 90 100 110 350 360 370 380 390 400 410 420 QUERY IVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKDNFGCDGGHPFYSFLYVLQNE---LCLGDEYKY ::. :::. :::::::. : :: .: :. ..::..::..::: .::::: . .. ::.... : .: : S41425 IVNPIKDQAQCGSCWAFSVVQVQESQWALKKGQLLSLAEQNMVDCVDTCYGCDGGDEYLAYDYVIKHQKGLWMLETDYPY 120 130 140 150 160 170 180 190 430 440 450 460 470 480 490 QUERY KAKDDMFCLNYRCKRKVSLSS-----IGAVKENQLILALNEVGPLSVNVGVNN-DFVAYSEGVYNG-TCSEE-LNHSVLL :.: : ... . :.:.. . .:..: . . : .:. . ... :: :: :.:: .:: :.:.: : S41425 TARDGS-C-KFKAAKGVTLTKSYVRPTTTQNEDELKAGCAKGGVVSIAIDASGYDFQLYSSGIYNPKSCSSTFLDHAVGL 200 210 220 230 240 250 260 500 510 520 530 540 550 560 QUERY VGYGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFYPIL :::: .:. S41425 VGYGTENKVD 270 --------------------------------------------------------------------------- >>S37048 cysteine proteinase - Trypanosoma congolense (447 aa) initn: 380 init1: 212 opt: 325 Z-score: 339.0 expect() 4.7e-12 Smith-Waterman score: 456; 29.167% identity in 336 aa overlap Entrez lookup Re-search database >S37048 224- 552: ----------------------------------------- : 190 200 210 220 230 240 250 260 QUERY DKKVYLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKNIDEQMRKFEIFKINYISIKNHNKLN .: : ..... ::. :. .:..:: :. :.. : S37048 MPRSEMTRTLRFSVGLLAVAACFVPVALGVLHAEQSLQQQFAAFKQKYSRSYKDATEEAFRFRVFKQNMERAKEEAAAN 10 20 30 40 50 60 70 270 280 290 300 310 320 330 340 QUERY KNAMYKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILDYREKG : . :..:::.: :: :.. : . : :. .. : . .: .:.: .:.:.:: S37048 PYATFG--VTRFSDMSPEE----FRATYH---NGAEYYAAALKRPRK-VVNVST-------------GKAPPAVDWRKKG 80 90 100 110 120 130 350 360 370 380 390 400 410 420 QUERY IVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKDNFGCDGGHPFYSFLYVL---QNELCLGDEYKY : :::: :::::::...::::. . .... :.::: .:.:. ..:: :: :. ... .... .. : : S37048 AVTPVKDQGACGSCWAFSAIGNIEGQWKVAGHELTSLSEQMLVSCDTTDYGCRGGLMDKSLQWIVSSNKGNVFTAQSYPY 140 150 160 170 180 190 200 210 430 440 450 460 470 480 490 QUERY KAKDDMFCLNYRCKRKVSLSSIGAVK----ENQLILALNEVGPLSVNVGVNNDFVAYSEGVYNGTCSEELNHSVLLVGYG . . . . :. . : .. :: . : . ::... : ... :..:. :: .. :. :.:.:::::: S37048 ASGGGKMPPCNKSGKVVGAKISGHINLPKDENAIAEWLAKNGPVAIAVDATS-FLGYKGGVLTSCISKGLDHDVLLVGY- 220 230 240 250 260 270 280 290 500 510 520 530 540 550 560 QUERY QVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFYPIL .....: :::::::::: :::.:..:. .. : S37048 -------------------DDTSKPP-----YWIIKNSWSKGWGEEGYIRIEKGTNQCLMKNYARSAVVSGPPPPPPPPA 300 310 320 330 340 350 S37048 STFTQEFCEGAECQSGCTKATFPTGKCVQFGGAGSVIASCGSNNLTQIVYPLSSSCSGFSIPLTVPLDKCLPIVVGSVMY 360 370 380 390 400 410 420 430 --------------------------------------------------------------------------- >>S41427 cysteine proteinase (EC 3.4.22.-) CP1 precursor - Trichomonas vagina (309 aa) initn: 383 init1: 238 opt: 322 Z-score: 338.3 expect() 5.2e-12 Smith-Waterman score: 438; 28.409% identity in 352 aa overlap Entrez lookup Re-search database >S41427 225- 567: -----------------------------------------: 190 200 210 220 230 240 250 260 QUERY KKVYLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKNIDEQMRKFEIFKINYISIKNHNKLNK :. .:.: .... . :: ..: :. : ...:: : S41427 MMYQAHEQKSFLGWMRETGNMFTG-DEYHQRFGIWLSNKRLVQQHNAANG 10 20 30 40 270 280 290 300 310 320 330 340 QUERY NAMYKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILDYREKGI . . .:... : : .:.:: :. ::. .. :. .: .:.::::. S41427 G--FVLAMNKLAHLSPSE----YKALLGFKNEKRSDRVKPIASN----------YV------------APASIDWREKGV 50 60 70 80 90 100 350 360 370 380 390 400 410 420 QUERY VHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKDNFGCDGGHPFYSFLYV---LQNELCLGDEYKYK :. :::: :::::.:... .:: .: :. .. :.:::..::: .::.:: .. :: .... .: :: S41427 VNPIKDQGQCGSCWTFSTIQAMESQWAVKHTKLYSLSEQNLVDCVTTCYGCNGGLMELAYDYVKTYQKGKFMTEADYPYK 110 120 130 140 150 160 170 180 430 440 450 460 470 480 490 QUERY AKDDMFCLNYRCKRKVSLSSIGAVKENQLILALNEV---GPLSVNVGVNN-DFVAYSEGVYN-GTCSEE-LNHSVLLVGY : :. .: . .... .: :.. .:.: :: .. . ... .: :: :.:. ..:: : :.:.: ::: S41427 AIDQSCKFNAAKVAEPTVTGYITVTEGDEKDLMNKVAQYGPAAIAIDASHYSFQLYSSGIYDESSCSPEGLDHAVGCVGY 190 200 210 220 230 240 250 260 500 510 520 530 540 550 560 QUERY GQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFYPIL :. :.:.. :::..:::. .:::.:..:. ..::.. :: . . : S41427 GS------------------EGSKN-------YWIVRNSWGVSWGEKGYIRMIKDKNNQ---CGEASAACIPTVSA 270 280 290 300 --------------------------------------------------------------------------- >>I52525 testin precursor - rat (333 aa) initn: 291 init1: 170 opt: 319 Z-score: 334.8 expect() 8.1e-12 Smith-Waterman score: 427; 28.367% identity in 349 aa overlap Entrez lookup Re-search database >I52525 231- 569: -----------------------------------------: 200 210 220 230 240 250 260 QUERY NDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKNIDEQMRKFEIFKINYISIKNHN--KLNKNAMY .:.:.: :..:. : ... :. :. :: :. . I52525 MIAVLFLAILCLEVDSTAPTPDPSLDVEWNEWRTKHGKTY-NMNEERLKRAVWEKNFKMIELHNWEYLEGRHDF 10 20 30 40 50 60 70 270 280 290 300 310 320 330 340 QUERY KKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILDYREKGIVHEP .: :.: .. :. : . . :.: .. :..: : ::. .:.:. : : I52525 TMAMNAFGDLTN---IEFVKMMTGFQRQKIKK-THIFQDHQ--------------------FLYVPKRVDWRQLGYVTPV 80 90 100 110 120 350 360 370 380 390 400 410 420 QUERY KDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKDN--FGCDGGHPFYSFLYVLQNE-LCLGDEYKYKAKDD :.:: :.: :::...:..:. . .:.. .. .:::...:: .: ::.:: :.: :: .: : . : :... I52525 KNQGHCASSWAFSATGSLEGQMFRKTERLIPLSEQNLLDCMGSNVTHGCSGGFMQYAFQYVKDNGGLATEESYPYRGQGR 130 140 150 160 170 180 190 200 430 440 450 460 470 480 490 500 QUERY MFCLNYRCKRKVSLSSIGAV--KENQLILALNEVGPLSVNVGVNN-DFVAYSEGVY-NGTCSE-ELNHSVLLVGYGQVEK : . . ... .. . .:. :. :. .:::.:: : ... .: :. :.: . :.. .:::.::.:::: I52525 E-CRYHAENSAANVRDFVQIPGSEEALMKAVAKVGPISVAVDASHGSFQFYGSGIYYEPQCKRVHLNHAVLVVGYG---- 210 220 230 240 250 260 270 280 510 520 530 540 550 560 QUERY TKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFYPIL .. .:. : : .:..::::...:: .:.:.:... .. :::. :::. I52525 -----------FEGEES----DGN--SFWLVKNSWGEEWGMKGYMKLAKDWSNH---CGIATYSTYPIV 290 300 310 320 330 --------------------------------------------------------------------------- >>A60667 cysteine proteinase cruzain (EC 3.4.22.-) - Trypanosoma cruzi (467 aa) initn: 369 init1: 192 opt: 320 Z-score: 333.6 expect() 9.4e-12 Smith-Waterman score: 454; 28.367% identity in 349 aa overlap Entrez lookup Re-search database >A60667 223- 563: ------------------------------------------: 190 200 210 220 230 240 250 260 QUERY DDKKVYLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKNIDEQMRKFEIFKINYISIKNHNKL :.: .: ..:..::.. :. .. .:. : . . : A60667 MSGWARALLLAAVLVVMACLVPAATASLHAEETLTSQFAEFKQKHGRVYESAAEEAFRLSVFRENLFLARLHAAA 10 20 30 40 50 60 70 270 280 290 300 310 320 330 340 QUERY NKNAMYKKKVNQFSDYSEEELK-EYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILDYRE : .: . :. ::: ..::.. .: . : . :. : .: ... : .: .:.: A60667 NPHATFG--VTPFSDLTREEFRSRYHNGAAHFAAAQ-ERARVP----VKVEVV-------G----------APAAVDWRA 80 90 100 110 120 130 350 360 370 380 390 400 410 QUERY KGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKDNFGCDGGHPFYSFLYVLQNE---LCLGDEY .: : :::: :::::::...::.: . .. . ..::: .:.:.: . ::.:: .: ...:.. . : : A60667 RGAVTAVKDQGQCGSCWAFSAIGNVECQWFLAGHPLTNLSEQMLVSCDKTDSGCSGGLMNNAFEWIVQENNGAVYTEDSY 140 150 160 170 180 190 200 210 420 430 440 450 460 470 480 490 QUERY KYKAKDDMFCLNYRCKRKVSLSSIGAVK----ENQLILALNEVGPLSVNVGVNNDFVAYSEGVYNGTCSEELNHSVLLVG : . . . . :. . : :. : :. : ::..: : ... ...:. ::... ::.:.:.::::: A60667 PYASGEGISPPCTTSGHTVGATITGHVELPQDEAQIAAWLAVNGPVAVAVDASS-WMTYTGGVMTSCVSEQLDHGVLLVG 220 230 240 250 260 270 280 290 500 510 520 530 540 550 560 QUERY YGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFYPIL :. :. . ::::::::. .:::.:..:.....: : . ::. A60667 YN-------------------------DSAAVPYWIIKNSWTTQWGEEGYIRIAKGSNQ----CLVKEEASSAVVGGPGP 300 310 320 330 340 A60667 TPEPTTTTTTSAPGPSPSYFVQMSCTDAACIVGCENVTLPTGQCLLTTSGVSAIVTCGAETLTEEVFLTSTHCSGPSVRS 350 360 370 380 390 400 410 420 --------------------------------------------------------------------------- >>S27044 papain-like protein - Autographa californica nuclear polyhedrosis vi (208 aa) initn: 335 init1: 199 opt: 313 Z-score: 331.7 expect() 1.2e-11 Smith-Waterman score: 386; 33.190% identity in 232 aa overlap Entrez lookup Re-search database >S27044 337- 564: ----------------------------: 300 310 320 330 340 350 360 370 QUERY MIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKN . .:. . : :.::.::.:::::.....:: :: :... S27044 IHWRRLNKVTSVKNQGMCGACWAFATLASLESQFAIKHNQ 10 20 30 40 380 390 400 410 420 430 440 450 QUERY ILSFSEQEVVDCSKDNFGCDGGHPFYSFLYVLQ-NELCLGDEYKYKAKDDMFCLNYRCKRKVSLSSIG---AVKENQLIL ....:::...::. . ::.:: .: ... . . : ..: :.: :. : : :.... .: :..: S27044 LINLSEQQMIDCDFVDAGCNGGLLHTAFEAIIKMGGVQLESDYPYEA-DNNNCRMNSNKFLVQVKDCYRYITVYEEKLKD 50 60 70 80 90 100 110 460 470 480 490 500 510 520 530 QUERY ALNEVGPLSVNVGVNNDFVAYSEGVYNGTCSEELNHSVLLVGYGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIK : :::. . . . :.: :..:. . . :::.::::::: :: :: : :: .: S27044 LLRLVGPIPMAIDAA-DIVNYKQGIIKYCFNSGLNHAVLLVGYG-VE------NN------------------IPYWTFK 120 130 140 150 160 170 540 550 560 QUERY NSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFYPIL :.:. :::.::.:...: :. ::. .:. S27044 NTWGTDWGEDGFFRVQQNINA----CGMRNELASTAVIY 180 190 200 --------------------------------------------------------------------------- >>KHRZOG oryzain (EC 3.4.22.-) gamma precursor - rice (362 aa) initn: 420 init1: 200 opt: 316 Z-score: 331.2 expect() 1.3e-11 Smith-Waterman score: 492; 31.755% identity in 359 aa overlap Entrez lookup Re-search database >KHRZOG 222- 569: ------------------------------------------: 190 200 210 220 230 240 250 260 QUERY KDDKKVYLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKNIDEQMRKFEIFKINYISIKNHNK : .: .: .:.: : . : .:.:.::. . ... :. KHRZOG SAVAAASSGFDDSNPIRSVTDHAASALESTVIAALGRTRGALRFARFAVRHGKRYGDAAEVQRRFRIFSESLELVRSTNR 20 30 40 50 60 70 80 90 270 280 290 300 310 320 330 340 QUERY LNKNAMYKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILDYRE .. :. .:.:.:.: ::.. . : . .. . :.. .: .:: :.:: KHRZOG --RGLPYRLGINRFADMSWEEFQA---SRLGAAQNCSATLA-------------------GNHRMRDA-PALPETKDWRE 100 110 120 130 140 150 350 360 370 380 390 400 410 QUERY KGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSK--DNFGCDGGHPFYSFLYVLQNE-LCLGDEY ::: :::: ::::: :...:..:. ... . .:.:::...::. .::::.:: : .: :. : : . : KHRZOG DGIVSPVKDQGHCGSCWPFSTTGSLEARYTQATGPPVSLSEQQLADCATRYNNFGCSGGLPSQAFEYIKYNGGLDTEEAY 160 170 180 190 200 210 220 230 420 430 440 450 460 470 480 490 QUERY KYKAKDDMFCLNYRCKR---KVSLS-SIGAVKENQLILALNEVGPLSVNVGVNNDFVAYSEGVYN----GTCSEELNHSV : . . . : .:. . :: : .: : :..: :.. : :.:: : : : :. :::. :: ..::.: KHRZOG PYTGVNGI-C-HYKPENAGVKVLDSVNITLVAEDELKNAVGLVRPVSVAFQVINGFRMYKSGVYTSDHCGTSPMDVNHAV 240 250 260 270 280 290 300 310 500 510 520 530 540 550 560 QUERY LLVGYGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFYPIL : :::: :: : . ::.:::::. ::.::.. . .:: .:::. . :::. KHRZOG LAVGYG-VE------------------------NGVPYWLIKNSWGADWGDNGYFTMEMGKN----MCGIATCASYPIVA 320 330 340 350 360 --------------------------------------------------------------------------- >>A47306 cysteine proteinase - Tetrahymena thermophila (SGC5) (336 aa) initn: 384 init1: 175 opt: 315 Z-score: 330.7 expect() 1.4e-11 Smith-Waterman score: 481; 32.184% identity in 348 aa overlap Entrez lookup Re-search database >A47306 227- 560: -----------------------------------------: 190 200 210 220 230 240 250 260 QUERY VYLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKNIDEQMRKFEIFKINYISIKNHNKLNKNA :. ......: : ::.. . .: : .::.::. : : A47306 MNKKFIILSIIMLMPLCLAQDISVEKLLAYNKWSSQNQRAYLNEDEKLYRQIVFFENLQKIKEHNS-NPNN 10 20 30 40 50 60 70 270 280 290 300 310 320 330 340 QUERY MYKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILDYREKGIVH :. ..:::::...::. : : :.. . .:. : : . .. : .: :.. . . .:.: :: : A47306 TYSIHLNQFSDMTREEFAE--KILMK--QDLINDYMKGIGQQATHNNANNETQMNSQNHT------LAASIDWRTKGAVT 80 90 100 110 120 130 140 350 360 370 380 390 400 410 420 QUERY EPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDC-SKDN----FGCDGGHPFYSFLYVLQNELCLGDEYKYK :::: :::::.:.... .:: .:: ...::::..::: . .: .:: :: : . :. . . :.: : A47306 SVKDQGQCGSCWSFSAAALMESFNFIQNKALVNFSEQQLVDCVTPENGYPSYGCKGGWPATCLDYASKVGITTLDKYPYV 150 160 170 180 190 200 210 220 430 440 450 460 470 480 490 QUERY AKDDMFCLN-----YRCKRKVSLSSIGAVKENQLILALNEVGPLSVNVGVNN-DFVAYSEGVYNGTCSE---ELNHSVLL : . .. .. :. . . . . :.: ::: .:.:: : ..: :. :: :..:: :.. .:::.:: A47306 AVQKNCTVTGTNNGFKLKKWIVIPNTS----NDLKSALN-FSPVSVLVDATNWDY--YSSGIFNG-CNQTNINLNHAVLA 230 240 250 260 270 280 290 500 510 520 530 540 550 560 QUERY VGYGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFYPIL ::: . :.: ::.::::: :::.:..::. :.. ::: A47306 VGYDE-----------------KDN-----------WIVKNSWSAGWGEHGYIRLAPNNT-----CGILSSNIQVTA 300 310 320 330 --------------------------------------------------------------------------- >>A45629 cruzipain - Trypanosoma cruzi (467 aa) initn: 368 init1: 191 opt: 317 Z-score: 330.6 expect() 1.4e-11 Smith-Waterman score: 460; 28.286% identity in 350 aa overlap Entrez lookup Re-search database >A45629 222- 563: ------------------------------------------: 190 200 210 220 230 240 250 260 QUERY KDDKKVYLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKNIDEQMRKFEIFKINYISIKNHNK ::.: .: ..:..::.. :. .. .:. : . . : A45629 MSGWARALSLAAVLVVMACLVPAATASLHAEETLASQFVEFKQKHGRVYESAAEERFRLSVFRENLFLARLHAA 10 20 30 40 50 60 70 270 280 290 300 310 320 330 340 QUERY LNKNAMYKKKVNQFSDYSEEELK-EYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILDYR : .: . :. ::: ..::.. .: . : . :. : :. . : .: .:.: A45629 ANPHATFG--VTPFSDLTREEFRSRYHNGAAHFAAAQ-ERARVPV------NVEVV-----G----------APAAVDWR 80 90 100 110 120 130 350 360 370 380 390 400 410 QUERY EKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKDNFGCDGGHPFYSFLYVLQNE---LCLGDE .: : :::: :::::::...::.: . .. . ..::: .:.:.: . :: :: .: ...:.. . : A45629 ARGAVTAVKDQGQCGSCWAFSAIGNVECQWFLAGHPLTNLSEQMLVSCDKTDSGCGGGLMNNAFEWIVQENNGAVYTEDS 140 150 160 170 180 190 200 210 420 430 440 450 460 470 480 490 QUERY YKYKAKDDMFCLNYRCKRKVSLSSIGAVK----ENQLILALNEVGPLSVNVGVNNDFVAYSEGVYNGTCSEELNHSVLLV : : . . . . :. . : :. : :. . ::..: : ... ...:. ::... ::.:.:.:::: A45629 YPYASGEGISPPCTTSGHTVGATITGHVELPQDEAQIAACVAVNGPVAVAVDASS-WMTYTGGVMTSCVSEQLDHGVLLV 220 230 240 250 260 270 280 500 510 520 530 540 550 560 QUERY GYGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFYPIL ::. :. . ::::::::. .:::.:..:.....: : . ::. A45629 GYN-------------------------DSAAVPYWIIKNSWTAQWGEDGYIRIAKGSNQ----CLVKEEASSAVVGGPG 290 300 310 320 330 340 A45629 PTPEPTTTTTTSAPGPSPSYFVQMSCTDAACIVGCENVTLPTGQCLLTTSGVSAIVTCGAETLTEEVFFTSTHCSGPSVR 350 360 370 380 390 400 410 420 --------------------------------------------------------------------------- >>S19649 cysteine proteinase (EC 3.4.22.-) precursor (clone LCP1) - American (322 aa) initn: 352 init1: 183 opt: 313 Z-score: 328.9 expect() 1.7e-11 Smith-Waterman score: 476; 28.977% identity in 352 aa overlap Entrez lookup Re-search database >S19649 227- 567: -----------------------------------------: 190 200 210 220 230 240 250 260 QUERY VYLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKNIDEQMRKFEIFKINYISIKNHNKLNK-- .: . .. : ...:. ....: : :.. :: . S19649 MKVVALFLFGLALAAANPSWEEFKGKFGRKYVDLEEERYRLNVFLDNLQYIEEFNKKYERG 10 20 30 40 50 60 270 280 290 300 310 320 330 340 QUERY NAMYKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILDYREKGI .. :. .:::::...:... .: . : .: . .:. : . :. :.: :: S19649 EVTYNLAINQFSDMTNEKFNAVMKGYKKGP--------RP-----------AAVFTS-----TDAAPESTEV-DWRTKGA 70 80 90 100 110 350 360 370 380 390 400 410 420 QUERY VHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKDNF---GCDGGHPFYSFLYVLQNE-LCLGDEYKY : :::: :::::::...:.::. :. ..:.:::..:::. .. ::.:: ...:: .: . . : : S19649 VTPVKDQGQCGSCWAFSTTGGIEGQHFLKTGRLVSLSEQQLVDCAGGSYYNQGCNGGWVERAIMYVRDNGGVDTESSYPY 120 130 140 150 160 170 180 190 430 440 450 460 470 480 490 QUERY KAKDDMFCLNYRCKRKVSLSSIGAVK--ENQLILALNEVGPLSVNVGVNN-DFVAYSEGVY-NGTCSE-ELNHSVLLVGY .:.:. .: . . .: .. :. : : ..::.:: . ... .: .: ::: . .:: .:.:.:: ::: S19649 EARDNTCRFNSNTIGATCTGYVGIAQGSESALKTATRDIGPISVAIDASHRSFQSYYTGVYYEPSCSSSQLDHAVLAVGY 200 210 220 230 240 250 260 270 500 510 520 530 540 550 560 QUERY GQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFYPIL : :. .: .:..::::. .:::.:.....::.:.. :::. .. :: S19649 G---------------------SEGGQD----FWLVKNSWATSWGESGYIKMARNRNNN---CGIATDACYPTV 280 290 300 310 320 --------------------------------------------------------------------------- >>S66348 senescence-associated cysteine proteinase precursor (clone SENU3) - (356 aa) initn: 417 init1: 195 opt: 310 Z-score: 325.2 expect() 2.8e-11 Smith-Waterman score: 516; 30.233% identity in 387 aa overlap Entrez lookup Re-search database >S66348 195- 569: ---------------------------------------------: 160 170 180 190 200 210 220 230 QUERY NNEENRKELSNILHKLLEINKLILREEKDDKKVYLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNK :.. .. . .:.. . ... . : .: .: .: : S66348 MSRLSLVLILVAGLFATALAGPATFADKNPIRQVVFPDELENGILQVVGQTRSALSFARFAIRHRK 10 20 30 40 50 60 240 250 260 270 280 290 300 310 QUERY VYKNIDEQMRKFEIFKINYISIKNHNKLNKNAMYKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNIL : ...: ..:::: : :..::. :. :: .:.:.: . .:... : . :. : :. S66348 RYDSVEEIKQRFEIFLDNLKMIRSHNR--KGLSYKLGINEFTDLTWDEFRK----------HKLGA-SQNCSATTKGNLK 70 80 90 100 110 120 130 320 330 340 350 360 370 380 390 QUERY ISEFYTNGKRNEKDIFSKVPEILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSK--DN ... .:: :.:. ::: : :: :::::.:...: .:...:. . .:.:::..:::. .: S66348 LTNVV-------------LPETKDWRKDGIVSPVKAQGKCGSCWTFSTTGALEAAYAQAFGKGISLSEQQLVDCAGAFNN 140 150 160 170 180 190 200 400 410 420 430 440 450 460 QUERY FGCDGGHPFYSFLYV-LQNELCLGDEYKYKAKDDMFC----LNYRCKRKVSLS-SIGAVKENQLILALNEVGPLSVNVGV :::.:: : .: :. ... : . : : .:. . : : : :.. ..:: : .: :. : :.:: : S66348 FGCNGGLPSQAFEYIKFNGGLDTEEAYPYTGKNGI-CKFSQANIGVKVISSVNITLGA--EYELKYAVALVRPVSVAFEV 210 220 230 240 250 260 270 470 480 490 500 510 520 530 540 QUERY NNDFVAYSEGVYNGT-CSE---ELNHSVLLVGYGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGEN . : :. ::: .: :.. ..::.:: :::: :: .. : ::.:::::. :::. S66348 VKGFKQYKSGVYASTECGDTPMDVNHAVLAVGYG-VE------------------NGTP------YWLIKNSWGADWGED 280 290 300 310 320 330 550 560 QUERY GFMRLSRNKNGDNVFCGIGEEVFYPIL :.... .:: .::.. . :::. S66348 GYFKMEMGKN----MCGVATCASYPIVA 340 350 --------------------------------------------------------------------------- >>S59598 cysteine proteinase 2 precursor - maize (360 aa) initn: 422 init1: 203 opt: 307 Z-score: 322.0 expect() 4.2e-11 Smith-Waterman score: 475; 30.919% identity in 359 aa overlap Entrez lookup Re-search database >S59598 222- 569: ------------------------------------------: 190 200 210 220 230 240 250 260 QUERY KDDKKVYLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKNIDEQMRKFEIFKINYISIKNHNK : .: .: ...: :.. : ..:.::. . ... :. S59598 DTAAVVNSGFADSNPIRPVTDRAASALESTVFAALGRTRDALRFARFAVRYGKSYESAAEVHKRFRIFSESLQLVRSTNR 20 30 40 50 60 70 80 90 270 280 290 300 310 320 330 340 QUERY LNKNAMYKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILDYRE :. :. .:.:.:.: ::.. : : . .. : :...: . . .:: :.:: S59598 --KGLSYRLGINRFADMSWEEFRA---TRLGAAQNC------------------SATLTGNHRMRAAAVA-LPETKDWRE 100 110 120 130 140 150 350 360 370 380 390 400 410 QUERY KGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDC--SKDNFGCDGGHPFYSFLYVLQNE-LCLGDEY ::: :.:: :::::.:...: .:..... . . .:.:::..::: . .::::.:: : .: :. : : . : S59598 DGIVSPVKNQGHCGSCWTFSTTGALEAAYTQATGKPISLSEQQLVDCGFAFNNFGCNGGLPSQAFEYIKYNGGLDTEESY 160 170 180 190 200 210 220 230 420 430 440 450 460 470 480 490 QUERY KYKAKDDMFC--LNYRCKRKVSLSSIGAV--KENQLILALNEVGPLSVNVGVNNDFVAYSEGVYN----GTCSEELNHSV :.. . . : : :: :.:.. . :..: :.. : :.:: : . : :. :::. :: ..::.: S59598 PYQGVNGI-CKFKNENVGVKV-LDSVNITLGAEDELKDAVGLVRPVSVAFEVITGFRLYKSGVYTSDHCGTTPMDVNHAV 240 250 260 270 280 290 300 500 510 520 530 540 550 560 QUERY LLVGYGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFYPIL : :::: :: :.. : :.:::::. ::..:.... .:: .::.. . :::. S59598 LAVGYG-VE-----------------------DGVPY-WLIKNSWGADWGDEGYFKMEMGKN----MCGVATCASYPIVA 310 320 330 340 350 360 --------------------------------------------------------------------------- >>A44938 cysteine proteinase (EC 3.4.22.-) - Trypanosoma cruzi (fragment) (183 aa) initn: 296 init1: 186 opt: 301 Z-score: 320.3 expect() 5.2e-11 Smith-Waterman score: 313; 30.144% identity in 209 aa overlap Entrez lookup Re-search database >A44938 334- 535: -------------------------: 300 310 320 330 340 350 360 370 QUERY PNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKK : .:.: .: : :::: :::::::...::. . . A44938 APAAVDWRARGAVTAVKDQGQCGSCWAFSAIGNVSGQWFLA 10 20 30 40 380 390 400 410 420 430 440 QUERY NKNILSFSEQEVVDCSKDNFGCDGGHPFYSFLYVLQNE---LCLGDEYKYKAKDDMFCLNYRCKRKVSLSSIGAVK---- .. . ..::: .:.:.: . ::.:: .: ...:.. . : : : . . . . :. . : :. A44938 GHPLTNLSEQMLVSCDKTDSGCSGGLMNNAFEWIVQENNGGVYTEDSYPYASGEGISPPCTTSGHTVGATITGHVELPQD 50 60 70 80 90 100 110 120 450 460 470 480 490 500 510 520 QUERY ENQLILALNEVGPLSVNVGVNNDFVAYSEGVYNGTCSEELNHSVLLVGYGQVEKTKLNYNNKIQTYNTKENSNQPDDNII : :. : :: : :. .....:. ::... ::.:.:..:::::. :. . A44938 EAQIAAWLAVNGP--VAVAHASSWMTYTGGVMTSCVSEQLDHGLLLVGYN-------------------------DSAAV 130 140 150 160 170 530 540 550 560 QUERY YYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFYPIL :::.:::: A44938 PYWIVKNSW 180 --------------------------------------------------------------------------- >>S07051 cysteine proteinase (EC 3.4.22.-) precursor - Trypanosoma brucei (450 aa) initn: 292 init1: 208 opt: 306 Z-score: 319.6 expect() 5.7e-11 Smith-Waterman score: 452; 29.499% identity in 339 aa overlap Entrez lookup Re-search database >S07051 224- 552: ----------------------------------------- : 190 200 210 220 230 240 250 260 QUERY DKKVYLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKNIDEQMRKFEIFKINYISIKNHNKLN .: : :...::::. :. .:. :. :. . : . : S07051 MPRTEMVRFVRLPVVLLAMAACLASVALGSLHVEESLEMRFAAFKKKYGKVYKDAKEEAFRFRAFEENMEQAKIQAAAN 10 20 30 40 50 60 70 270 280 290 300 310 320 330 340 QUERY KNAMYKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILDYREKG : . :. :::...::.. ... .: ...:. .. . :.:. .: .:.:::: S07051 PYATFG--VTPFSDMTREEFRARYRNG--------ASYFAAAQKRLRKTVNV----TTGR---------APAAVDWREKG 80 90 100 110 120 130 350 360 370 380 390 400 410 420 QUERY IVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKDNFGCDGGHPFYSFLYVLQNELCLGDEYKYKAK : : :: :::::::...::::. . .. ..:.::: .:.:. . ::.:: .: ...... : . .:. S07051 AVTPVKVQGQCGSCWAFSTIGNIEGQWQVAGNPLVSLSEQMLVSCDTIDSGCNGGLMDNAFNWIVNSN---GGNVFTEAS 140 150 160 170 180 190 200 210 430 440 450 460 470 480 490 QUERY DDMFCLNYRCKR-KVSLSSIGAV---------KENQLILALNEVGPLSVNVGVNNDFVAYSEGVYNGTCSEELNHSVLLV . : . . ... :::. :. . : : :::.. : ... :. :. :. .. :..:.:.:::: S07051 YPYVSGNGEQPQCQMNGHEIGAAITDHVDLPQDEDAIAAYLAENGPLAIAVDAES-FMDYNGGILTSCTSKQLDHGVLLV 220 230 240 250 260 270 280 290 500 510 520 530 540 550 560 QUERY GYGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFYPIL ::. .::: : :::::::::. :::.:..:. .. : S07051 GYN-------------------DNSNPP------YWIIKNSWSNMWGEDGYIRIEKGTNQCLMNQAVSSAVVGGPTPPPP 300 310 320 330 340 S07051 PPPPPSATFTQDFCEGKGCTKGCSHATFPTGECVQTTGVGSVIATCGASNLTQIIYPLSRSCSGPSVPITVPLDKCIPIL 350 360 370 380 390 400 410 420 --------------------------------------------------------------------------- >>S47432 cathepsin L (EC 3.4.22.15) - Norway lobster (324 aa) initn: 327 init1: 171 opt: 299 Z-score: 314.5 expect() 1.1e-10 Smith-Waterman score: 452; 28.693% identity in 352 aa overlap Entrez lookup Re-search database >S47432 227- 567: -----------------------------------------: 190 200 210 220 230 240 250 260 QUERY VYLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKNIDEQMRKFEIFKINYISIKNHNKLNKN- .: . .. : ...:. ....: : :.. :: .. S47432 MKVVALFLFGLALAAANPSWEEFKGKFGRKYVDLEEERYRLNVFLDNLQYIEEFNKKYESG 10 20 30 40 50 60 270 280 290 300 310 320 330 340 QUERY -AMYKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILDYREKGI . :. .::::: ...: :.... : ... :. . .. : .. : .. :. :.: :: S47432 EVTYNLAINQFSDLTNDE----FNSMM-----------KGYKTSLRPKP-VAVFTST------DAAPETTEV-DWRTKGC 70 80 90 100 110 350 360 370 380 390 400 410 420 QUERY VHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKD---NFGCDGGHPFYSFLYVLQNE-LCLGDEYKY : . :::: :::::::...:..:. : ...:..::..:::. : ::.:: .: :. : . . : : S47432 VTHVKDQGQCGSCWAFSATGSLEGQHFLKYGELVSLAEQQLVDCAGGIYYNQGCNGGWVNQAFKYIKANGGIDTESSYPY 120 130 140 150 160 170 180 190 430 440 450 460 470 480 490 QUERY KAKDDM--FCLNYRCKRKVSLSSIGAVKENQLILALNEVGPLSVNV-GVNNDFVAYSEGVY-NGTCSE-ELNHSVLLVGY .:.:. : : .. ::. .:. . ...::.:: . ... .: .:: ::: . .:: .:.:.:: ::: S47432 EARDNTCRFNSNSVAATCSGFVSIAQGSESPEVRRTTNTGPISVAIDAAHRSFQSYSSGVYYEPSCSSSQLDHAVLAVGY 200 210 220 230 240 250 260 270 500 510 520 530 540 550 560 QUERY GQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFYPIL : :. .: .:..::::. .:: :.. ..::.:.. :::. .. :: S47432 G---------------------SEGGQD----FWLVKNSWGTSWGSAGYINMARNRNNN---CGIATDASYPTV 280 290 300 310 320 --------------------------------------------------------------------------- >>S46535 probable cysteine proteinase (EC 3.4.22.-) (clone A1494) - Arabidops (313 aa) initn: 459 init1: 188 opt: 294 Z-score: 309.6 expect() 2e-10 Smith-Waterman score: 495; 31.519% identity in 349 aa overlap Entrez lookup Re-search database >S46535 228- 560: -----------------------------------------: 190 200 210 220 230 240 250 260 QUERY YLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKNIDEQMRKFEIFKINYISIKNHNKLNKNAM : :. .::: .:.:.. .: .:: : . :.:.. .: S46535 ALFKKKFGKVYGSIEEHYYRFSVFKANLLRAMRHQKMDPSA- 10 20 30 40 270 280 290 300 310 320 330 340 QUERY YKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIF--SKVPEILDYREKGIV .. :.:::: .. :.. . : : . .. : :: :. :. ...:: .:.:..: : S46535 -RHGVTQFSDLTRSEFR---RKHLGVKGG----FKLP-----KDA------------NQAPILPTQNLPEEFDWRDRGAV 50 60 70 80 90 350 360 370 380 390 400 410 QUERY HEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKD---------NFGCDGGHPFYSFLYVLQNE-LCLG :.:: :::::.:...: .:.. . ...:.:::..:::... . ::.:: .: :.:.. : S46535 TPVKNQGSCGSCWSFSTTGALEGAHFLATGKLVSLSEQQLVDCDHECDPEEEGSCDSGCNGGLMNSAFEYTLKTGGLMRE 100 110 120 130 140 150 160 170 420 430 440 450 460 470 480 490 QUERY DEYKYKAKDDMFCLNYRCKRKVSLS--SIGAVKENQLILALNEVGPLSVNVGVNNDFV-AYSEGVYNG-TCSEELNHSVL .: : . : : : : .:.: :. ...:.:. : . :::.: ..: .. .: :: ::..:::.:: S46535 KDYPYTGTDGGSCKLDRSKIVASVSNFSVVSINEDQIAANLIKNGPLAV--AINAAYMQTYIGGVSCPYICSRRLNHGVL 180 190 200 210 220 230 240 250 500 510 520 530 540 550 560 QUERY LVGYGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFYPIL :::::.. .. ..: : ::::::::...::::::... ...: .::. S46535 LVGYGSAGFSQARLKEK------------P------YWIIKNSWGESWGENGFYKICKGRN----ICGVDSLVSTVAATT 260 270 280 290 300 310 S46535 S --------------------------------------------------------------------------- >>A45087 cathepsin S (EC 3.4.22.27) - rat (330 aa) initn: 390 init1: 172 opt: 291 Z-score: 306.2 expect() 3.2e-10 Smith-Waterman score: 440; 30.141% identity in 355 aa overlap Entrez lookup Re-search database >A45087 231- 567: -----------------------------------------: 200 210 220 230 240 250 260 QUERY NDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKNID---EQMRKFEIFKINYISIKNHNKLNKNAM ..... .: : :..:.. :.. : : :: .. .: A45087 MAVLGAPGVLCDNGATAERPTLDHHWDLWKKTRMRRNTDQNEEDVRRL-IWEKNLKFIMLHNLEHSMGM 10 20 30 40 50 60 270 280 290 300 310 320 330 340 QUERY YKKKV--NQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILDYREKGIV .. .: :...:.. ::. :. .: ..: .:.. ... : : . :. .:. .:.:::: : A45087 HSYSVGMNHMGDMTPEEVIGYMGSL-RIP--------RPWN---RSGTLKS------SSNQT-----LPDSVDWREKGCV 70 80 90 100 110 120 350 360 370 380 390 400 410 420 QUERY HEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKD----NFGCDGGHPFYSFLYVLQNELCLGDEYKYK . : :: :::::::.. : .:. . :. ...:.: :..:::: . : :: :: .: :.... . : :: A45087 TNVKYQGSCGSCWAFSAEGALEGQLKLKTGKLVSLSAQNLVDCSTEEKYGNKGCGGGFMTEAFQYIIDTSIDSEASYPYK 130 140 150 160 170 180 190 200 430 440 450 460 470 480 490 QUERY AKDDMFCL------NYRCKRKVSLSSIGAVKENQLILALNEVGPLSVNV--GVNNDFVAYSEGVYNG-TCSEELNHSVLL : :. :: :.: . : .: :. : :. ::.::.. . ...: :. :::. .:.:..::.::. A45087 AMDEK-CLYDPKNRAATCSRYIELP-FG--DEEALKEAVATKGPVSVGIDDASHSSFFLYQSGVYDDPSCTENMNHGVLV 210 220 230 240 250 260 270 280 500 510 520 530 540 550 560 QUERY VGYGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFYPIL :::: : . :. ::..::::. ..:..:..:..::... :::. :: A45087 VGYG--------------TLDGKD-----------YWLVKNSWGLHFGDQGYIRMARNNKNH---CGIASYCSYPEI 290 300 310 320 330 --------------------------------------------------------------------------- >>KHDO cysteine proteinase 1 (EC 3.4.22.-) precursor - slime mold (Dictyostel (343 aa) initn: 288 init1: 148 opt: 290 Z-score: 305.0 expect() 3.7e-10 Smith-Waterman score: 495; 29.315% identity in 365 aa overlap Entrez lookup Re-search database >KHDO 223- 569: ------------------------------------------: 190 200 210 220 230 240 250 260 QUERY DDKKVYLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKNIDEQMRKFEIFKINYISIKNHN-- :.:..:. . :: :.. .: ...::::: : .:.. : KHDO MKVILLFVLAVFTVFVSSRGIPPEEQSQFLEFQDKFNKKYSH-EEYLERFEIFKSNLGKIEELNLI 10 20 30 40 50 60 270 280 290 300 310 320 330 340 QUERY KLNKNAMYKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILDYR .:..: : ::.:.: : .:.:.:. . :. : . :.. .... . ......: .:.: KHDO AINHKADTKFGVNKFADLSSDEFKNYY-----LNNK---------EAIFTDDLPVADYLDD------EFINSIPTAFDWR 70 80 90 100 110 120 350 360 370 380 390 400 QUERY EKGIVHEPKDQGLCGSCWAFASVGNIESV-FAKKNKNILSFSEQEVVDCSKD----------NFGCDGGHPFYSFLYVLQ .: : :.:: :::::.:...::.:. : ..:: ..:.:::..:::... . ::.:: .. :... KHDO TRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNK-LVSLSEQNLVDCDHECMEYEGEEACDEGCNGGLQPNAYNYIIK 130 140 150 160 170 180 190 200 410 420 430 440 450 460 470 480 QUERY NE-LCLGDEYKYKAKDDMFCLNYRCKRKVSLSSIGAVKENQLILA--LNEVGPLSVNV-GVNNDFVAYSEGVYNGTCS-E : . . : : :. : . ...:.. . .:. ..: . .:::.. . .:. .: : ::.. :. . KHDO NGGIQTESSYPYTAETGTQCNFNSANIGAKISNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQF--YIGGVFDIPCNPN 210 220 230 240 250 260 270 280 490 500 510 520 530 540 550 560 QUERY ELNHSVLLVGYGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEV :.:..:.:::. . :: .:.: :::.::::. :::.:.. : :.:: ::... : KHDO SLDHGILIVGYS--------------AKNTIFRKNMP------YWIVKNSWGADWGEQGYIYLRRGKNT----CGVSNFV 290 300 310 320 330 QUERY FYPIL :. KHDO STSII 340 --------------------------------------------------------------------------- >>KHSYO4 oil bodies-associated protein P34 precursor - soybean (379 aa) initn: 442 init1: 218 opt: 290 Z-score: 304.3 expect() 4.1e-10 Smith-Waterman score: 490; 29.781% identity in 366 aa overlap Entrez lookup Re-search database >KHSYO4 220- 567: ------------------------------------------: 180 190 200 210 220 230 240 250 QUERY EEKDDKKVYLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMK-EHNKVYKNIDEQMRKFEIFKINYISIKN : .:..:.. : ::..::.: .:. ...:::: : :.. KHSYO4 MGFLVLLLFSLLGLSSSSSISTHRSILDLDLTKFTTQKQVSSLFQLWKSEHGRVYHNHEEEAKRLEIFKNNSNYIRD 10 20 30 40 50 60 70 260 270 280 290 300 310 320 330 QUERY HNKLNKNAM-YKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFS--KVPE : :. .. .:.:.: . .:.. : :..: : ...: .:. .:. .: . : KHSYO4 MNANRKSPHSHRLGLNKFADITPQEFS---KKYLQAP--------KDVSQQIK---------MANKKMKKEQYSCDHPPA 80 90 100 110 120 130 340 350 360 370 380 390 400 410 QUERY ILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKDNFGCDGGHPFYSFLYVLQNE-LCL :.:.::.. . : :: :: :::...: ::.. : . ...:.::::.::: ... : .: . :: .::.. . KHSYO4 SWDWRKKGVITQVKYQGGCGRGWAFSATGAIEAAHAIATGDLVSLSEQELVDCVEESEGSYNGWQYQSFEWVLEHGGIAT 140 150 160 170 180 190 200 210 420 430 440 450 460 470 480 QUERY GDEYKYKAKDDMFCLNYRCKRKVSLS----------SIGAVKENQLILALNEVGPLSVNVGVNNDFVAYSEGVYNG-TCS :.: :.::. : . . ::... : . :. .. :. : :.::.. .. :: :. :.:.: .:. KHSYO4 DDDYPYRAKEGR-CKANKIQDKVTIDGYETLIMSDESTESETEQAFLSAILE-QPISVSIDAK-DFHLYTGGIYDGENCT 220 230 240 250 260 270 280 290 490 500 510 520 530 540 550 560 QUERY EE--LNHSVLLVGYGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIG .:: :::::::... . ::: ::::. :::.:.. ..:: .. ::.. KHSYO4 SPYGINHFVLLVGYGSADG-------------------------VDYWIAKNSWGFDWGEDGYIWIQRNTGNLLGVCGMN 300 310 320 330 340 QUERY EEVFYPIL . :: KHSYO4 YFASYPTKEESETLVSARVKGHRRVDHSPL 350 360 370 --------------------------------------------------------------------------- >>S42882 cysteine proteinase (EC 3.4.22.-) precursor - spring vetch (358 aa) initn: 329 init1: 206 opt: 283 Z-score: 297.5 expect() 9.7e-10 Smith-Waterman score: 486; 30.728% identity in 371 aa overlap Entrez lookup Re-search database >S42882 212- 564: -------------------------------------------: 180 190 200 210 220 230 240 250 QUERY EINKLILREEKDDKKVYLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKNIDEQMRKFEIFKI .:: . : .. .: .: .. .: : . .:. .: .:: S42882 MDRRFIFALFLFAATATAATDDFLIRQVVDNEEDHLLNAEH--HFTSFKSKFSKSYATKEEHDYRFGVFKA 10 20 30 40 50 60 260 270 280 290 300 310 320 QUERY NYISIKNHNKLNKNAMYKKKVNQFSDYSEEELKEYFKTL---LHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKD : :. : :.::. .: .. ...::: . :... : : :..: : . : : S42882 NLIKAKLHQKLDPTA--EHGITKFSDLTASEFRRQFLGLNKRLRLPAHAQKAPILPTTN--------------------- 70 80 90 100 110 120 330 340 350 360 370 380 390 QUERY IFSKVPEILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSK--D-------NFGCDGGH .:: .:.:::: : :::: :::::::...: .:.. . ...:.:::..:::.. : . ::.:: S42882 ----LPEDFDWREKGAVTPVKDQGSCGSCWAFSTTGALEGAHYLATGKLVSLSEQQLVDCDHVCDPEEAGSCDSGCNGGL 130 140 150 160 170 180 190 200 400 410 420 430 440 450 460 470 QUERY PFYSFLYVLQNELCLGD-EYKYKAKDDMFCLNYRCKRKVSLSSIGAVK--ENQLILALNEVGPLSVNVGVNNDFV-AYSE .: :.::. . . .: : ..: : . : .:.:....:. :.:. : . :::.: ..: .. :: S42882 MNNAFEYLLQSGGVVQEKDYAYTGRDGS-CKFDKSKVVASVSNFSVVSLDEEQIAANLVKNGPLAV--AINAAWMQAYMS 210 220 230 240 250 260 270 480 490 500 510 520 530 540 550 QUERY GVYNG-TCSE-ELNHSVLLVGYGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNG :: .:.. .:.:.:::::.:. .: . ...: ::::::::...:::.:.... :..: S42882 GVSCPYVCAKARLDHGVLLVGFGK------------GAYAPIRLKEKP------YWIIKNSWGQNWGEQGYYKICRGRN- 280 290 300 310 320 330 340 560 QUERY DNVFCGIGEEVFYPIL ::. : S42882 ---VCGVDSMVSTVAAAQSNN 350 --------------------------------------------------------------------------- >>C44938 cysteine proteinase (EC 3.4.22.-) - Entamoeba histolytica (fragment) (165 aa) initn: 265 init1: 181 opt: 278 Z-score: 297.4 expect() 9.8e-10 Smith-Waterman score: 291; 33.508% identity in 191 aa overlap Entrez lookup Re-search database >C44938 351- 535: -----------------------: 320 330 340 350 360 370 380 390 QUERY DNILISEFYTNGKRNEKDIFSKVPEILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSK :: :::::.: ... .:. : .. :::::..:::. C44938 QGQCGSCWTFCTTAVLEGRVNKDLGKLYSFSEQQLVDCDA 10 20 30 40 400 410 420 430 440 450 460 QUERY DNFGCDGGHPFYSFLYVLQNE-LCLGDEYKYKAKDDMFCLNYRCKRKVSLSS-IGAVKENQLILALNEVGPLSVNVGVNN .. ::. ::: :. .. .:. : : ..: ::: : . . :. : . .:. : . : ::..:.. .. C44938 SDNGCERGHPSNSLKFIQENNGLGLESDYPYKAVAGT-CKKVKNVATVTGSRRVTDGSETGLQTIIAENGPVAVGMDASR 50 60 70 80 90 100 110 470 480 490 500 510 520 530 540 QUERY -DFVAYSEG-VYNGT-C-SEELNHSVLLVGYGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGF .: :..: .:. : : :. .:: : ::::. : :.: :::.:::: C44938 PSFQLYKKGTIYSDTKCRSRMMNHCVTAVGYGS------NSNGK-------------------YWIVKNSW 120 130 140 150 160 550 560 QUERY MRLSRNKNGDNVFCGIGEEVFYPIL --------------------------------------------------------------------------- >>S11862 cysteine proteinase homolog - garden pea (363 aa) initn: 314 init1: 206 opt: 283 Z-score: 297.4 expect() 9.8e-10 Smith-Waterman score: 468; 29.650% identity in 371 aa overlap Entrez lookup Re-search database >S11862 212- 564: -------------------------------------------: 180 190 200 210 220 230 240 250 QUERY EINKLILREEKDDKKVYLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKNIDEQMRKFEIFKI .:: . : .. .: .: .. .: : . .:. .: .:: S11862 MDRRFLFALFLFAAVATAVTDDTNNDDFIIRQVVDNEEDHLLNAEH--HFTSFKSKFSKSYATKEEHDYRFGVFKS 10 20 30 40 50 60 70 260 270 280 290 300 310 320 QUERY NYISIKNHNKLNKNAMYKKKVNQFSDYSEEELKEYF---KTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKD : :. : :. :.. .. ...::: . :... : : :..: : . : : S11862 NLIKAKLHQ--NRDPTAEHGITKFSDLTASEFRRQFLGLKKRLRLPAHAQKAPILPTTN--------------------- 80 90 100 110 120 130 330 340 350 360 370 380 390 QUERY IFSKVPEILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSK--D-------NFGCDGGH .:: .:.:::: : :::: :::::::...: .:.. . ...:.:::..:::.. : . ::.:: S11862 ----LPEDFDWREKGAVTPVKDQGSCGSCWAFSTTGALEGAHYLATGKLVSLSEQQLVDCDHVCDPEQAGSCDSGCNGGL 140 150 160 170 180 190 200 400 410 420 430 440 450 460 470 QUERY PFYSFLYVLQNELCLGD-EYKYKAKDDMFCLNYRCKRKVSLSSIGAVK--ENQLILALNEVGPLSVNVGVNNDFV-AYSE .: :.:.. . . .: : ..: : . : .:.:....: :.:. : . :::.: ..: .. .: S11862 MNNAFEYLLESGGVVQEKDYAYTGRDGS-CKFDKSKVVASVSNFSVVTLDEDQIAANLVKNGPLAV--AINAAWMQTYMS 210 220 230 240 250 260 270 280 480 490 500 510 520 530 540 550 QUERY GVYNG-TCSE-ELNHSVLLVGYGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNG :: .:.. .:.:.:::::.:. .: . ...: ::::::::...:::.:.... :..: S11862 GVSCPYVCAKSRLDHGVLLVGFGK------------GAYAPIRLKEKP------YWIIKNSWGQNWGEQGYYKICRGRN- 290 300 310 320 330 340 560 QUERY DNVFCGIGEEVFYPIL ::. : S11862 ---VCGVDSMVSTVAAAQSNH 350 360 --------------------------------------------------------------------------- >>S55923 cysteine proteinase (EC 3.4.22.-) precursor - soybean (380 aa) initn: 448 init1: 199 opt: 280 Z-score: 294.1 expect() 1.5e-09 Smith-Waterman score: 473; 31.492% identity in 362 aa overlap Entrez lookup Re-search database >S55923 219- 564: ------------------------------------------: 180 190 200 210 220 230 240 250 QUERY REEKDDKKVYLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKNIDEQMRKFEIFKINYISIKN .. .:: ::..... :.. .: .:.. :: :.. . S55923 ALMCLARVSLFLCALTLSAAHGSTTVQDIARKLKLGDNELLRTEKKFKVFMENYGRSYSTEEEYLRRLGIFAQNMVRAAE 10 20 30 40 50 60 70 80 260 270 280 290 300 310 320 330 QUERY HNKLNKNAMYKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILD :. :. .:.. :.:::: .:.:... . . . . : :. .: . ..: .:: .: S55923 HQALDPTAVHG--VTQFSDLTEDEFEKLYTGV---------NGGFPSSNNAAGGI-APPLEVDG----------LPENFD 90 100 110 120 130 140 340 350 360 370 380 390 400 QUERY YREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDC----------SKDNFGCDGGHPFYSFLYVL .:::: : : : :: :::::::...:.::.. . ...:.:::...:: : :: ::.:: .. :.: S55923 WREKGAVTEVKLQGRCGSCWAFSTTGSIEGANFLATGKLVSLSEQQLLDCDNKCDITEKTSCDN-GCNGGLMTNAYNYLL 150 160 170 180 190 200 210 220 410 420 430 440 450 460 470 480 QUERY QNE-LCLGDEYKYKAKDDMFCLNYRCKRKVSLSSIGAV--KENQLILALNEVGPLSVNVGVNNDFV-AYSEGVYNGT-CS .. : . : : .. : : :..... . :::. : . :::.. ::: :. .: :: :: S55923 ESGGLEEESSYPYTGERGE-CKFDPEKIAVKITNFTNIPADENQIAAYLVKNGPLAM--GVNAIFMQTYIGGVSCPLICS 230 240 250 260 270 280 290 300 490 500 510 520 530 540 550 560 QUERY EE-LNHSVLLVGYGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGE .. :::.::::::: . .. . .:.: ::::::::..::::.:...: :... .:::. S55923 KKRLNHGVLLVGYGA------------KGFSILRLGNKP------YWIIKNSWGEKWGEDGYYKLCRGHG----MCGINT 310 320 330 340 350 QUERY EVFYPIL : S55923 MVSAAMVPQPQTTPTKNYASY 360 370 380 --------------------------------------------------------------------------- >>S24988 cysteine proteinase (EC 3.4.22.-) precursor - tomato (361 aa) initn: 357 init1: 177 opt: 272 Z-score: 286.2 expect() 4.1e-09 Smith-Waterman score: 467; 30.110% identity in 362 aa overlap Entrez lookup Re-search database >S24988 217- 564: ------------------------------------------: 180 190 200 210 220 230 240 250 QUERY ILREEKDDKKVYLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMK-EHNKVYKNIDEQMRKFEIFKINYIS :.. : . :...: . .:.: . .:. .....:: : S24988 RLFLLSFLAFALFSSAIAFSDDDPLIRQVVSGNDDNHMLNAEHHFSLFKAKFGKIYASQEEHDHRLKVFKANLHR 10 20 30 40 50 60 70 260 270 280 290 300 310 320 330 QUERY IKNHNKLNKNAMYKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPE : :. :. .: .. ..:::: . :... . : . : .: : : :: . ::. : S24988 AKRHQLLDPSA--EHGITQFSDLTPSEFRRTYLGL-NKP--------RPNLNAEKAPILPT----------KDL----PS 80 90 100 110 120 130 340 350 360 370 380 390 400 QUERY ILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKD---------NFGCDGGHPFYSFLY .:.:::: : . :.:: :::::.:...: .:.. . ...:.:::..:::... . ::.:: .: : S24988 DFDWREKGAVTDVKNQGSCGSCWSFSTTGAVEGAHFLATGELVSLSEQQLVDCDHECDPVEKNDCDAGCNGGLMTTAFEY 140 150 160 170 180 190 200 210 410 420 430 440 450 460 470 480 QUERY VLQ-NELCLGDEYKYKAKDDMFCLNY-RCKRKVSLSSIGAVKENQLILALNEVGPLSVNVGVNNDFV-AYSEGVYNGT-C .:. . : : .: : ... .. : .:: :. .. :.:. : . :::.: :.: .. .: .:: : S24988 TLKAGGLQLEKDYPYTGRNGKCHFDKSRIAASVSNFSVVGLDEDQIAANLLKHGPLAV--GINAAWMQTYVRGVSCPLIC 220 230 240 250 260 270 280 490 500 510 520 530 540 550 560 QUERY SEELNHSVLLVGYGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGE .. .:.:::::::. . . . .:.: ::::::::.: :::.:.... : : .. ::. S24988 FKRQDHGVLLVGYGS------------EGFAPIRLKNKP------YWIIKNSWGKTWGEHGYYKICR---GHHI-CGVDA 290 300 310 320 330 340 QUERY EVFYPIL : S24988 MVSTVTATHTTNPNL 350 360 --------------------------------------------------------------------------- >>S68783 cathepsin L (EC 3.4.22.15) precursor - Paramecium tetraurelia (SGC5) (314 aa) initn: 329 init1: 132 opt: 270 Z-score: 285.1 expect() 4.8e-09 Smith-Waterman score: 389; 28.809% identity in 361 aa overlap Entrez lookup Re-search database >S68783 217- 569: ------------------------------------------: 180 190 200 210 220 230 240 250 QUERY ILREEKDDKKVYLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKNIDEQMRKFEIFKINYISI ..: :. . .. ..:. : : ..: ....: : : S68783 MMLLGASLYLNNTQEVSDEIDTANLYANWKMKYNRRYTNQRDEMYRYKVFTDNLNYI 10 20 30 40 50 260 270 280 290 300 310 320 330 QUERY KNHNKLNKNAMYKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEI . . ..: . ..:::.:.:..:. . . .: .:: : : :.: .: . S68783 RAFYESPEEATFTLELNQFADMSQQEFAQTYLSL-KVPRTA------------KLNAANSNFQYKGAE------------ 60 70 80 90 100 110 340 350 360 370 380 390 400 410 QUERY LDYREKGIVHEP--KDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCS--KDNFGCDGGHPFYSFLYVLQNEL .:. .. :. : :.:: :::::::..:: .: . . .:::..:::: :: ::.:: .: :: .: : S68783 VDWTDNKKVKYPAVKNQGSCGSCWAFSAVGALEINTDIELNRKYELSEQDLVDCSGPYDNDGCNGGWMDSAFEYVADNGL 120 130 140 150 160 170 180 190 420 430 440 450 460 470 480 QUERY CLGDEYKYKAKDDMFCLNYRCKRKVS--LSSIGAVKENQLILALNE-VGPLSVNVGVN-NDFVAYSEGVYNGTCSEELNH . .: : ::: :: .:. . . . :. . : . . .: :.:. : . : :: . :...::: S68783 AEAKDYPYTAKDGT------CKTSVKRPYTHVQGFKDIDSCDELAQTIQERTVAVAVDANPWQFYRSGVLS-KCTKNLNH 200 210 220 230 240 250 260 490 500 510 520 530 540 550 560 QUERY SVLLVGYGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFYPI .:.::: : : : :.:::...::: : .::. .::. ::: .:: S68783 GVVLVGV------------------------QADGA----WKIRNSWGSSWGEAGHIRLA---GGDT--CGICAAPSFPI 270 280 290 300 310 QUERY L : S68783 LG --------------------------------------------------------------------------- >>S25267 cysteine proteinase (EC 3.4.22.-) precursor - Leishmania mexicana (354 aa) initn: 370 init1: 220 opt: 268 Z-score: 282.3 expect() 6.9e-09 Smith-Waterman score: 424; 27.536% identity in 345 aa overlap Entrez lookup Re-search database >S25267 215- 552: ------------------------------------------ : 180 190 200 210 220 230 240 250 QUERY KLILREEKDDKKVYLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKNIDEQMRKFEIFKINYI :..:. .... .: :.:.:.. . :. ..:. :: :. S25267 MARRNPLLFAIVVTILFVVCYGSALIAQTPPPVDNFVASAHYGSFKKRHGKAFGGDAEEGHRFNAFKQNMQ 10 20 30 40 50 60 70 260 270 280 290 300 310 320 330 QUERY SIKNHNKLNKNAMYKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVP . : : .: : . ..:.: . .: : : :.. :.. ..:: :... ... .: ..: S25267 TAYFLNTQNPHAHYDVS-GKFADLTPQE----FAKLYLNPDY----YARHLKNH-KEDVHVDDSAPSG------VMS--- 80 90 100 110 120 130 340 350 360 370 380 390 400 410 QUERY EILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKDNFGCDGGHPFYSFLYVLQNE--- .:.:.:: : :.::::::::::...::::. .: ......:.::: .:.:.. . ::.:: .. ...:.. S25267 --VDWRDKGAVTPVKNQGLCGSCWAFSAIGNIEGQWAASGHSLVSLSEQMLVSCDNIDEGCNGGLMDQAMNWIMQSHNGS 140 150 160 170 180 190 200 210 420 430 440 450 460 470 480 QUERY LCLGDEYKYKAKDDMF--CLNYRCKRKVSLSSIGAVKENQLILA--LNEVGPLSVNVGVNNDFVAYSEGVYNGTCSEELN . : : . : . . . ...... .. ... .: ... ::..: : ... . : :: . . :: S25267 VFTEASYPYTSGGGTRPPCHD-EGEVGAKITGFLSLPHDEERIAEWVEKRGPVAVAVDATT-WQLYFGGVVSLCLAWSLN 220 230 240 250 260 270 280 490 500 510 520 530 540 550 560 QUERY HSVLLVGYGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFYP :.::.::. :: :.. : :::.::::...:::.:..::. ..: S25267 HGVLIVGF-----------NK--------NAKPP------YWIVKNSWGSSWGEKGYIRLAMGSNQCMLKNYPVSATVES 290 300 310 320 330 340 QUERY IL S25267 PHTPHVPTTTA 350 --------------------------------------------------------------------------- >>B23705 cysteine proteinase (EC 3.4.22.-) - Entamoeba histolytica (strain SA (312 aa) initn: 260 init1: 91 opt: 264 Z-score: 279.0 expect() 1e-08 Smith-Waterman score: 361; 27.424% identity in 361 aa overlap Entrez lookup Re-search database >B23705 219- 567: ------------------------------------------: 180 190 200 210 220 230 240 250 QUERY REEKDDKKVYLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKNIDEQMRKFEIFKINYISIKN : : : . ..:: . . : .:. ::..: . B23705 VILMLAIANAIDFNTWAANNNKHFTAV-EALRRRAIFNMNARFVA- 10 20 30 40 260 270 280 290 300 310 320 330 QUERY HNKLNKNAMYKKKVN-QFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEIL ..::.. .: .:. :. ...:: ..::: : .. :. ::: . .: ..:: . B23705 --EFNKKGSFKLSVDGPFAAMTNEE----YRTLL--------KSKRTVEE-------------NGKVTYLNI--QAPESV 50 60 70 80 90 340 350 360 370 380 390 400 410 QUERY DYREKGIVHEPKDQGLCGSCWAFASVGNIES-VFAKK--NKNILSFSEQEVVDCSKDNF--GCDGGHPFYSFLYVLQNEL :.: .: : .::. ::::..:.:.. .:. .. .: : : :..::...:.:..:: ::.:: . :..:: . B23705 DWRAQGKVTPIRDQAQCGSCYTFGSLAALEGRLLIEKGGNANTLDLSEEHMVQCTRDNGNNGCNGGLGSNVYDYIIQNGV 100 110 120 130 140 150 160 170 420 430 440 450 460 470 480 QUERY CLGDEYKYKAKDDMFCLNYRCKRKVS-LSSIGAVKENQLILALNEVGPLSVNVGVNN-DFVAYSEGVYNGT-CSEE---L ..: : . :. : . :.. ... .: .: ::.. : ..:.. ... : :. :.:. : :... : B23705 AKESDYPYTGTDSTCKTNVKAFAKITGYNKVPRNNEAELKAALSQ-GLVDVSIDASSAKFQLYKSGAYSDTKCKNNFFAL 180 190 200 210 220 230 240 250 490 500 510 520 530 540 550 560 QUERY NHSVLLVGYGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFY :: : :::: :. :: ::..:::. ::..:.. . . : ::.. . .: B23705 NHEVCAVGYGVVDG--------------KEC-----------WIVRNSWGTGWGDKGYINMVIEGNT----CGVATDPLY 260 270 280 290 300 QUERY PIL : B23705 PTGVQYL 310 --------------------------------------------------------------------------- >>JN0718 drought-inducible cysteine proteinase (EC 3.4.22.-) RD19A precursor (368 aa) initn: 465 init1: 181 opt: 262 Z-score: 275.9 expect() 1.6e-08 Smith-Waterman score: 493; 30.077% identity in 389 aa overlap Entrez lookup Re-search database >JN0718 190- 564: ----------------------------------------------: 150 160 170 180 190 200 210 220 QUERY KFINENNEENRKELSNILHKLLEINKLILREEKDDKKVYLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFM .::. : .: : . .:: .: : JN0718 MDRLKLYFSVFVLSFFIVSVSSSDVNDGDDLVIRQVVGGAEPQVLTSED---------HFSLFK 10 20 30 40 50 230 240 250 260 270 280 290 300 QUERY KEHNKVYKNIDEQMRKFEIFKINYISIKNHNKLNKNAMYKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHL .. .::: . .:. .: .:: : . :.::. .: . :.:::: .. :.. : : : . .. : . . JN0718 RKFGKVYASNEEHDYRFSVFKANLRRARRHQKLDPSATHG--VTQFSDLTRSEFR---KKHLGVRSG----FKLPKDAN- 60 70 80 90 100 110 120 310 320 330 340 350 360 370 380 QUERY KDNILISEFYTNGKRNEKDIFSKVPEILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCS : :: .: ..:: .:.:..: : :.:: :::::.:...: .:.. . ...:.:::..:::. JN0718 KAPILPTE--------------NLPEDFDWRDHGAVTPVKNQGSCGSCWSFSATGALEGANFLATGKLVSLSEQQLVDCD 130 140 150 160 170 180 190 390 400 410 420 430 440 450 QUERY KD---------NFGCDGGHPFYSFLYVLQNELCLGDE-YKYKAKDDMFCLNYRCKRKVSLS--SIGAVKENQLILALNEV .. . ::.:: .: :.:.. . .: : : .:: : . : .:.: :. .. :.:. : . JN0718 HECDPEEADSCDSGCNGGLMNSAFEYTLKTGGLMKEEDYPYTGKDGKTCKLDKSKIVASVSNFSVISIDEEQIAANLVKN 200 210 220 230 240 250 260 270 460 470 480 490 500 510 520 530 QUERY GPLSV--NVGVNNDFVAYSEGVYNGTCSEELNHSVLLVGYGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSW :::.: :.: . ... : :...:::.::::::: . . ...: : :::::::: JN0718 GPLAVAINAGYMQTYIGGVSCPY--ICTRRLNHGVLLVGYGAAGYAPARFKEK------------P------YWIIKNSW 280 290 300 310 320 330 540 550 560 QUERY SKKWGENGFMRLSRNKNGDNVFCGIGEEVFYPIL .. ::::::... ...: .::. : JN0718 GETWGENGFYKICKGRN----ICGVDSMVSTVAATVSTTAH 340 350 360 --------------------------------------------------------------------------- >>S59597 cysteine proteinase 1 precursor - maize (371 aa) initn: 341 init1: 124 opt: 261 Z-score: 274.8 expect() 1.8e-08 Smith-Waterman score: 446; 29.692% identity in 357 aa overlap Entrez lookup Re-search database >S59597 223- 564: ------------------------------------------: 190 200 210 220 230 240 250 260 QUERY DDKKVYLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKNIDEQMRKFEIFKINYISIKNHNKL :.:..:... .: ::. ::. .. .:: : . :. : S59597 LLLLSLASAAAVAAAVDAEDPLIRQVVPGGDDNDLELNAESHFLSFVQRFGKSYKDADEHAYRLSVFKDNLRRARRHQLL 10 20 30 40 50 60 70 80 270 280 290 300 310 320 330 340 QUERY NKNAMYKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILDYREK . .: .. :..::: . :... . : . .... . :. . .: :.: .:. .:.:.. S59597 DPSA--EHGVTKFSDLTPAEFRRTYLGLRKSRRALLRELG---ESAHEAPVLP----TDG----------LPDDFDWRDH 90 100 110 120 130 140 350 360 370 380 390 400 410 QUERY GIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKD---------NFGCDGGHPFYSFLYVLQNELC : : :.:: :::::.:.. : .:.. . .. .:::. :::... . ::.:: .: : ::. S59597 GAVGPVKNQGSCGSCWSFSASGALEGAHYLATGKLEVLSEQQFVDCDHECDSSEPDSCDSGCNGGLMTTAFSY-LQKAGG 150 160 170 180 190 200 210 220 420 430 440 450 460 470 480 QUERY LGDE--YKYKAKDDMFCLNYRCKRKVSLS--SIGAVKENQLILALNEVGPLSVNVGVNNDFV-AYSEGVYNG-TCSEELN : .: : : ..: : . : .:.. :. .: : :. : . :::.. :.: .. .: :: :...:. S59597 LESEKDYPYTGSDGK-CKFDKSKIVASVQNFSVVSVDEAQISANLIKHGPLAI--GINAAYMQTYIGGVSCPYICGRHLD 230 240 250 260 270 280 290 300 490 500 510 520 530 540 550 560 QUERY HSVLLVGYGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFYP :.::::::: . . ..: : ::::::::...:::::.... :..: : ::. : S59597 HGVLLVGYGASGFAPIRLKDK------------P------YWIIKNSWGENWGENGYYKICRGSNVRNK-CGVDSMVSTV 310 320 330 340 350 360 QUERY IL S59597 SAVHASKE 370 --------------------------------------------------------------------------- >>S57427 cysteine proteinase (EC 3.4.22.-) 4 - Tritrichomonas foetus (fragmen (152 aa) initn: 175 init1: 146 opt: 254 Z-score: 273.4 expect() 2.1e-08 Smith-Waterman score: 285; 36.735% identity in 147 aa overlap Entrez lookup Re-search database >S57427 359- 496: -----------------: 320 330 340 350 360 370 380 390 QUERY YTNGKRNEKDIFSKVPEILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKDNFGCDGG :::.. .::. : . :...:::::..:::. .. :: :: S57427 AFATTQCMESINALRFKSLFSFSEQNLVDCDPQSNGCAGG 10 20 30 40 400 410 420 430 440 450 460 470 QUERY HPFYSFLYV--LQN-ELCLGDEYKYKAKDDMFCLNYRCK---RKVSLSSIGAVKENQLILALNEVGPLSVNVGVN-NDFV :: .:... :: .. : :.: : . : : : : ... :. : .:..:. . :::..: . .. .: S57427 SPFSAFMFISRTQNGQINLEDDYPYTGTDTNDCKFDPSKGYGRITGFMSVQAQSEEDLFKCVASVGPIAVCIDASLASFN 50 60 70 80 90 100 110 120 480 490 500 510 520 530 540 QUERY AYSEGVYNGT-CSEE-LNHSVLLVGYGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSR .:: :.:: :: :.:.: .::: S57427 SYSSGIYNDRQCSSTVLDHAVGCIGYGAEGGA 130 140 150 --------------------------------------------------------------------------- >>PQ0650 senescence-associated protein SAG2 - Arabidopsis thaliana (fragment) (95 aa) initn: 201 init1: 201 opt: 250 Z-score: 272.4 expect() 2.4e-08 Smith-Waterman score: 250; 43.750% identity in 80 aa overlap Entrez lookup Re-search database >PQ0650 333- 410: --------- : 300 310 320 330 340 350 360 370 QUERY VPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAK .:: :.:: ::: :::: :::::.:...: .:... . PQ0650 TEAALPETKDWREDGIVSPVKDQGGCGSCWTFSTTGALEAAYHQ 10 20 30 40 380 390 400 410 420 430 440 450 QUERY KNKNILSFSEQEVVDCSK--DNFGCDGGHPFYSFLYVLQNELCLGDEYKYKAKDDMFCLNYRCKRKVSLSSIGAVKENQL . .:.:::..:::. .:.: .:: : .: :. .: PQ0650 AFGKGISLSEQQLVDCAGAFNNYGSNGGLPSQAFEYIKSNGGLDTEKAYRY 50 60 70 80 90 --------------------------------------------------------------------------- >>S68784 cathepsin L - Paramecium tetraurelia (SGC5) (fragment) (294 aa) initn: 300 init1: 188 opt: 252 Z-score: 267.1 expect() 4.8e-08 Smith-Waterman score: 314; 26.781% identity in 351 aa overlap Entrez lookup Re-search database >S68784 212- 560: -------------------------------------------: 180 190 200 210 220 230 240 250 QUERY EINKLILREEKDDKKVYLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKNIDEQMRKFEIFKI .:: :. : .. ..:: : . .:.. ..::.. S68784 TAGYYHLQEDDTND------FERWALKNNKFYTE-SEKLYRMEIYNS 10 20 30 40 260 270 280 290 300 310 320 330 QUERY NYISIKNHNKLNKNAMYKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFS : :..::. ... :. ::: :.::. . . :. . .. .. . : .: .. S68784 NKRMIEEHNQ-REDVTYQMGENQFMTLSHEEFVDLY---LQKSDSSVNIMGAS----------LPEVQLEG-------LG 50 60 70 80 90 340 350 360 370 380 390 400 410 QUERY KVPEILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKDNFGCDGGHPFYSFLYVLQNE : :.:. : :.:: :.: :::. ...:. .: .. . .. : :..:::. .: ::.::. :.. :::. S68784 AV----DWRNYTTV---KEQGQCASGWAFSVSNSLEAWYAIRGFQKINASTQQIVDCDYNNTGCSGGYNAYAMEYVLRVG 100 110 120 130 140 150 160 170 420 430 440 450 460 470 480 QUERY LCLGDEYKYKAKDDMFCLNYRCKRKV--SLSSIGAVKENQLILALNEVGPLSVNVGVNNDFVAYSEGVYNGTCSEELNHS : . .: : ::.. : . : . : .:. . : : ::. :.::.: ..: . : :.... :. :: S68784 LVSSTNYPYVAKNQT-CKQSRNGTYFINGYSFVGGSQSN-LQYYLNNY-PISVGVEASN-WQFYRSGLFSNCSSNGTNHY 180 190 200 210 220 230 240 490 500 510 520 530 540 550 560 QUERY VLLVGYGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFYPIL .: ::. .. :: ::..:::. .:::.: .:: ... ::: S68784 ALAVGFDSA-------NN---------------------WIVQNSWGTQWGESGNIRLYPQNT-----CGILNYPYQVY 250 260 270 280 290 --------------------------------------------------------------------------- >>S57423 cysteine proteinase (EC 3.4.22.-) 9 - Tritrichomonas foetus (fragmen (152 aa) initn: 197 init1: 123 opt: 243 Z-score: 262.2 expect() 9e-08 Smith-Waterman score: 243; 32.680% identity in 153 aa overlap Entrez lookup Re-search database >S57423 359- 502: ------------------: 320 330 340 350 360 370 380 390 QUERY YTNGKRNEKDIFSKVPEILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKDNFGCDGG ::..: :. .:. . ..::.: :...::. :. :: :: S57423 AFSAVCAQEGQWARTKGELLSLSVQNLLDCDDDSEGCGGG 10 20 30 40 400 410 420 430 440 450 460 470 QUERY HPFYSFLYVL--QN-ELCLGDEYKYKAKDDMFCLNYRCKRKVSLSSIGA---VKENQLILALNEVGPLSVNVGVNN-DFV :: ....:. :: : : ..: : .... : . .. :: .. . ..:.... : : : .: . . ::. S57423 WPFSGIFHVISEQNGEWMLENDYPYTSHSSNQCY-FDASKGVSKTTKIVQLPINEEKILAACAEYGVISCCIDSSPIDFM 50 60 70 80 90 100 110 480 490 500 510 520 530 540 QUERY AYSEGVYN-GTCSE-ELNHSVLLVGYGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSR ::::... :. ::.:.: .:::: :: S57423 YYSEGIFDTDQCNAWELDHAVNIVGYGAEAGTK 120 130 140 150 --------------------------------------------------------------------------- >>S30149 probable cysteine proteinase precursor (clone CYP-7) - common tobacc (363 aa) initn: 319 init1: 141 opt: 247 Z-score: 260.6 expect() 1.1e-07 Smith-Waterman score: 460; 30.508% identity in 354 aa overlap Entrez lookup Re-search database >S30149 225- 564: -----------------------------------------: 190 200 210 220 230 240 250 260 QUERY KKVYLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKNIDEQMRKFEIFKINYISIKNHNKLNK : : .. .:.: . .:. ..:..:: : . .. :. S30149 LSLLAFVLFSSAIAFSDEDPLIRQVVSETDDSHLLNAEHHFSLFKSKFGKIYASEEEHDHRFKVFKANLRRARLNQLLDP 10 20 30 40 50 60 70 80 270 280 290 300 310 320 330 340 QUERY NAMYKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILDYREKGI .: .. ...::: . :... . : : : :: : : :: . : .: .:.:..: S30149 SA--EHGITKFSDLTPSEFRRTYLGL-HKP--------KPKLNAEKAPILPT--------------SDLPADFDWRDHGA 90 100 110 120 130 140 350 360 370 380 390 400 410 QUERY VHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKD---------NFGCDGGHPFYSFLYVLQ-NELCL : :.:: :::::.:...: .:.. . ...:.:::..:::... . :: ::: .: :.:. . : : S30149 VTGVKNQGSCGSCWSFSTTGAVEGAHFLATGELVSLSEQQLVDCDHECDPEQQDACDAGCGGGHYATAFEYTLKAGGLQL 150 160 170 180 190 200 210 220 420 430 440 450 460 470 480 490 QUERY GDEYKYKAKDD--MFCLNYRCKRKVSLSSIGAVKENQLILALNEVGPLSVNVGVNNDFV-AYSEGVYNGT-CSEELNHSV .: : .:: : . : ...: :: . :.:. : . :::.: :.: .. .: :: : .. .:.: S30149 EKDYPYTGKDGKCHFDKSKICAAVTNFSVIG-LDEDQIAANLVKHGPLAV--GINAAWMQTYVGGVSCPLICFKRQDHGV 230 240 250 260 270 280 290 500 510 520 530 540 550 560 QUERY LLVGYGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFYPIL :::::: .. . ::.. ::::::::...:::.:.... : : :. ::. : S30149 LLVGYG---------SHGFAPIRLKEKA---------YWIIKNSWGENWGEHGYYKICR---GHNI-CGVDAMVSTVTAA 300 310 320 330 340 350 S30149 HTTNPNL 360 --------------------------------------------------------------------------- >>S30150 probable cysteine proteinase precursor (clone CYP-8) - common tobacc (365 aa) initn: 274 init1: 141 opt: 247 Z-score: 260.6 expect() 1.1e-07 Smith-Waterman score: 439; 29.661% identity in 354 aa overlap Entrez lookup Re-search database >S30150 225- 564: -----------------------------------------: 190 200 210 220 230 240 250 260 QUERY KKVYLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKNIDEQMRKFEIFKINYISIKNHNKLNK : : .. .:.: . .:. ..:..:: : . .. :. S30150 LPRFALFSSAIAFPDEDPLIRQVVSETETDDSHLLNAEHHFSLFKSKFGKIYASEEEHDHRFKVFKANLRRARLNQLLDP 10 20 30 40 50 60 70 80 270 280 290 300 310 320 330 340 QUERY NAMYKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILDYREKGI .: .. ...::: . :... . : : : :: : : :: . : .: :.:..: S30150 SA--EHGITKFSDLTPSEFRRTYLGL-HKP--------KPKVNAEKAPILPT--------------SDLPADYDWRDHGA 90 100 110 120 130 140 350 360 370 380 390 400 410 QUERY VHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKD---------NFGCDGGHPFYSFLYVLQ-NELCL : :.:: :::::.:...: .:.. . ...:.:::..:::... . :: :: .: :.:. . : : S30150 VTGVKNQGSCGSCWSFSTTGAVEGAHFLATGELVSLSEQQLVDCDHECDSEQQDSCDAGCGGGLMTTAFEYTLKAGGLQL 150 160 170 180 190 200 210 220 420 430 440 450 460 470 480 490 QUERY GDEYKYKAKDDMFCLNYRCKRKVSLS--SIGAVKENQLILALNEVGPLSVNVGVNNDFV-AYSEGVYNGT-CSEELNHSV .: : .:: : . : .... :. .. :.:. : . :::.: :.: .. .: :: : .. .:.: S30150 EKDYPYTGKDGK-CHFDKSKIAAAVTNFSVIGLDEDQIAANLVKHGPLAV--GINAAWMQTYVGGVSCPLICFKRQDHGV 230 240 250 260 270 280 290 300 500 510 520 530 540 550 560 QUERY LLVGYGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFYPIL :::::: .. . ::.. ::::::::...:::.:.... : : :. ::. : S30150 LLVGYG---------SHGFAPIRLKEKA---------YWIIKNSWGENWGEHGYYKICR---GHNI-CGVDAMVSTVTAA 310 320 330 340 350 S30150 HTTNPNL 360 --------------------------------------------------------------------------- >>S57426 cysteine proteinase (EC 3.4.22.-) 5 - Tritrichomonas foetus (fragmen (155 aa) initn: 135 init1: 135 opt: 236 Z-score: 254.9 expect() 2.3e-07 Smith-Waterman score: 264; 32.026% identity in 153 aa overlap Entrez lookup Re-search database >S57426 359- 499: -----------------: 320 330 340 350 360 370 380 390 QUERY YTNGKRNEKDIFSKVPEILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKDNFGCDGG ::... :. .. ..: .:::..:::. . ::::: S57426 AFSTIVAQEGCHQIETGELLRLSEQNLVDCADNCHGCDGG 10 20 30 40 400 410 420 430 440 450 460 QUERY HPFYSFLYVLQNE---LCLGDEYKYKAKDDMFCLNYRCKRKVS-LSSIGAVKENQLILALNEV----GPLSVNVGVN-ND :. .: :::... : :.: : :.. ..: :: .. :: ..:. . ... :..:: ::...:: : .. S57426 WPIEAFNYVLNKQGGKYCTDDDYPYTAEQALLCYFYRVQQPVSNIASVYQIPQGDE-EAMKEVVANWGPVAINVDSNYGS 50 60 70 80 90 100 110 470 480 490 500 510 520 530 540 QUERY FVAYSEGVY-NGTCSEEL--NHSVLLVGYGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMR : :. :.: . .:. . .:.. ..:::..: S57426 FNFYDGGIYVEESCQVKYVYSHAMGIIGYGSAEGQD 120 130 140 150 --------------------------------------------------------------------------- >>A61500 allergen Der f I precursor - house-dust mite (Dermatophagoides farin (319 aa) initn: 284 init1: 166 opt: 236 Z-score: 250.2 expect() 4.2e-07 Smith-Waterman score: 332; 28.696% identity in 345 aa overlap Entrez lookup Re-search database >A61500 224- 552: ----------------------------------------- : 190 200 210 220 230 240 250 260 QUERY DKKVYLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKNIDEQMRKFEIFKINYISIKNHNKLN : :.: : :: : ...:. :. . :.. .. . : A61500 MKFVLAIASLLVLTVYARPASIKTFEFKKAFNKNYATVEEE----EVARKNFLESLKYVEAN 10 20 30 40 50 270 280 290 300 310 320 330 340 QUERY KNAMYKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFS-KVPEILDYREK :.: .:..:: : .:.:. . :. .. :: .:: ..: :.. . : : .:: :: : A61500 KGA-----INHLSDLSLDEFKNRY--LMS---------AEAFE-QLK-----TQFDLNAETSACRINSVNVPSELDLRSL 60 70 80 90 100 110 350 360 370 380 390 400 410 420 QUERY GIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKDNFGCDGGHPFYSFLYVLQNELCLGDEYKYKA : . :: :::::::..:. ::.. .. :..::::.:::.... :: : .. :. :: . : : : A61500 RTVTPIRMQGGCGSCWAFSGVAATESAYLAYRNTSLDLSEQELVDCASQH-GCHGDTIPRGIEYIQQNGVVEERSYPYVA 120 130 140 150 160 170 180 190 430 440 450 460 470 480 QUERY KDDMFCLNYRCKRKVS----LSSIGAVKE---NQLILALNEVGP-LSVNVGVNNDFVAYSEGVYNG-TCSEELN------ ... ::.: : .:. . .:. ::... ..: .:.. :. :... :.: : .. : A61500 REQ------RCRRPNSQHYGISNYCQIYPPDVKQIREALTQTHTAIAVIIGIK-DLRAFQH--YDGRTIIQHDNGYQPNY 200 210 220 230 240 250 260 490 500 510 520 530 540 550 560 QUERY HSVLLVGYGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFYP :.: .:::: :.: :: :::..:::. ::..:. .. ..: A61500 HAVNIVGYG---------------------STQGDD----YWIVRNSWDTTWGDSGYGYFQAGNNLMMIEQYPYVVIM 270 280 290 300 310 QUERY IL --------------------------------------------------------------------------- >>S21864 probable cysteine proteinase (EC 3.4.22.-) - Euroglyphus maynei (211 aa) initn: 268 init1: 158 opt: 224 Z-score: 240.7 expect() 1.4e-06 Smith-Waterman score: 277; 27.679% identity in 224 aa overlap Entrez lookup Re-search database >S21864 333- 552: ---------------------------: 300 310 320 330 340 350 360 370 QUERY VPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAK .: :: : : . :: :::::::..:.. ::.. S21864 TYACSINSVSLPSELDLRSLRTVTPIRMQGGCGSCWAFSGVASTESAYLA 10 20 30 40 50 380 390 400 410 420 430 440 450 QUERY KNKNILSFSEQEVVDCSKDNFGCDGGHPFYSFLYVLQNELCLGDEYKYKAKDDMFCLNYRCKRK--VSLSSIGAVKENQL . :...:::.:::...: :: : .. :. :: . : : :... : .: . .:. :.. S21864 YRNMSLDLAEQELVDCASQN-GCHGDTIPRGIEYIQQNGVVQEHYYPYVAREQS-CHRPNAQRYGLKNYCQISPPDSNKI 60 70 80 90 100 110 120 460 470 480 490 500 510 520 QUERY ILALNEVGP-LSVNVGVNNDFVAYSEGVYNGTCSEELNHSVLLVGYGQVEKTKLNYNNKIQ-TYNTKENSNQPDDNIIYY ::... ..: .:.. :. :. . :.: .: ....: : .:.. . . . . . : S21864 RQALTQTHTAVAVIIGIK-DLNAFRH--YDG-------------------RTIMQHDNGYQPNYHAVNIVGYGNTQGVDY 130 140 150 160 170 180 530 540 550 560 QUERY WIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFYPIL ::..:::. ::.::. .. : : S21864 WIVRNSWDTTWGDNGYGYFAANINL 190 200 210 --------------------------------------------------------------------------- >>JQ0337 allergen Der p 1 - house-dust mite (Dermatophagoides pteronyssinus) (245 aa) initn: 232 init1: 145 opt: 220 Z-score: 235.6 expect() 2.7e-06 Smith-Waterman score: 273; 27.381% identity in 252 aa overlap Entrez lookup Re-search database >JQ0337 302- 550: ------------------------------- : 270 280 290 300 310 320 330 340 QUERY LNKNAMYKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILDYRE .. :: ::: ..: :.. : .: ...: .: :. JQ0337 KNRFLMSAEAFE-HLK-----TQFDLNAETNACSINGNAPAEIDLRQ 10 20 30 40 350 360 370 380 390 400 410 420 QUERY KGIVHEPKDQGLCGSCWAFASVGNIESVF-AKKNKNILSFSEQEVVDCSKDNFGCDGGHPFYSFLYVLQNELCLGDEYKY : . :: :::::::..:. ::.. :..:.. :...:::.:::.... :: : .. :. .: . . :.: JQ0337 MRTVTPIRMQGGCGSCWAFSGVAATESAYLAHRNQS-LDLAEQELVDCASQH-GCHGDTIPRGIEYIQHNGVVQESYYRY 50 60 70 80 90 100 110 430 440 450 460 470 480 490 QUERY KAKDDMFCLNYRCKR-KVS-LSSIGAVKENQLILALNEVGPLSVNVGVNNDFVAYSEGVYNGTCSEELNHSVLLVGYGQV :... : .: .: .: . :.. :: . . . .:. :. . :.: .... : JQ0337 VAREQS-CRRPNAQRFGISNYCQIYPPNANKIREALAQPQRYCRHYWTIKDLDAFRH--YDG-------RTIIQRDNG-- 120 130 140 150 160 170 180 500 510 520 530 540 550 560 QUERY EKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFYPIL :. . .. : :: . :::..:::. .::.::. .. : JQ0337 ------YQPNYHAVNIVGYSNAQG---VDYWIVRNSWDTNWGDNGYGYFAANIDLMMIEEYPYVVIL 190 200 210 220 230 240 --------------------------------------------------------------------------- >>S60479 cathepsin B-like cysteine proteinase (EC 3.4.22.-) - Aztec tobacco (356 aa) initn: 210 init1: 107 opt: 220 Z-score: 233.2 expect() 3.7e-06 Smith-Waterman score: 255; 25.369% identity in 339 aa overlap Entrez lookup Re-search database >S60479 256- 564: --------------------------------------: 220 230 240 250 260 270 280 290 QUERY INNIKYASKFFKFMKEHNKVYKNIDEQMRKFEIFKINYISIKNHNKLNKNAMYKKKVN-QFSDYSEEELKEYFKTLLHV- .:. :. :..: .: .: .::... . :: :: : S60479 HMSLTTLFLLIGASIIVLQVVAEQPISQAKAESAILQDSIVKQVNE-NEKAGWKAALNPRFSNFTVSQ----FKRLLGVK 10 20 30 40 50 60 70 300 310 320 330 340 350 360 370 QUERY PNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKK :.. . . :. .: : : .:: : . : . .::: :: :::::::..: .. . : . S60479 PTRKGDLKGIPILTHPKLLELPQEF---DARVAWSNCSTIGRILD------------QGHCGSCWAFGAVESLSDRFCIH 80 90 100 110 120 130 140 380 390 400 410 420 430 440 QUERY NKNILSFSEQEVVDCSKDNF----GCDGGHPFYSFLYVLQNELCLGDEYKYKAKDDMFCLN------Y---RCKRKVSLS .:.: ... : .: :::::.:. .. : ... . . : :. : . : .:.:: . S60479 YGLNISLSANDLYACC--GFLCGDGCDGGYPLQAWKYFVRKGVVTDECDPYF--DNEGCSHPGCEPAYPTPKCHRKCVKQ 150 160 170 180 190 200 210 220 450 460 470 480 490 500 QUERY SIGAVKENQL-----------ILALNEV---GPLSVNVGVNNDFVAYSEGVYNGTCSEELN-HSVLLVGYGQVEKTKLNY .. . ... . ..:: ::. :. : .::. :. :::. . .. .. :.: :.:.: S60479 NLLWSRSKHFGVNAYMITSDPLSIMTEVYKNGPVEVSFTVYEDFAHYKSGVYKHVTGDVMGGHAVKLIGWG--------- 230 240 250 260 270 280 290 510 520 530 540 550 560 QUERY NNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFYPIL :.:.... ::.. :.:.. ::..:.... :. : : : .:: S60479 --------TSEDGED-------YWLLANQWNRGWGDDGYFKIRRGTNE----CEIEDEVVAGLPSARNLNVELDVSDAYL 300 310 320 330 340 350 S60479 DAAM --------------------------------------------------------------------------- >>S57422 cysteine proteinase (EC 3.4.22.-) 8 - Tritrichomonas foetus (fragmen (152 aa) initn: 121 init1: 121 opt: 212 Z-score: 230.5 expect() 5.2e-06 Smith-Waterman score: 212; 32.680% identity in 153 aa overlap Entrez lookup Re-search database >S57422 359- 502: ------------------: 320 330 340 350 360 370 380 390 QUERY YTNGKRNEKDIFSKVPEILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKDNFGCDGG ::... ::: :. .. .:::..::: .::.:: S57422 AFSAIQAAESVNCIKSGKLERYSEQNLVDCVTACYGCNGG 10 20 30 40 400 410 420 430 440 450 460 470 QUERY HPFYSFLYVL--QN-ELCLGDEYKYKAKDDMFCLNYRCKRKVSLSSIGAVKEN---QLILALNEVGPLSVNVGVNN-DFV :. :.. :: .: : .: : : : : . .:... :..: .: .. ::..: . ..: .: S57422 LMDASYEYIIDSQNGHLNLEADYPYTAVDGT-CKYAQYTPVASITKYVNVNQNDEDDLAAKVETYGPVAVAIDASNWSFQ 50 60 70 80 90 100 110 480 490 500 510 520 530 540 QUERY AYSEGVYN-GTCSE-ELNHSVLLVGYGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSR :. :::. .:: :.:.: ::.: .:: S57422 LYTGGVYDEPSCSPYSLDHGVGCVGFGAEGSTK 120 130 140 150 --------------------------------------------------------------------------- >>S57421 cysteine proteinase (EC 3.4.22.-) 6 - Tritrichomonas foetus (fragmen (152 aa) initn: 128 init1: 128 opt: 211 Z-score: 229.5 expect() 6e-06 Smith-Waterman score: 211; 32.653% identity in 147 aa overlap Entrez lookup Re-search database >S57421 359- 496: -----------------: 320 330 340 350 360 370 380 390 QUERY YTNGKRNEKDIFSKVPEILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKDNFGCDGG ::... ::::.: . ..::.:::..::: ::.:: S57421 AFSAIQAIESVYAIGTGTLLSLSEQNLVDCVDTCEGCNGG 10 20 30 40 400 410 420 430 440 450 460 470 QUERY HPFYSFLYVL--QN-ELCLGDEYKYKAKDDMFCLNYRCKRKVSLS---SIGAVKENQLILALNEVGPLSVNVGVNN-DFV .. ::. :: .. : : . :. :. . .. :.: ...: .:..: ... :: .: . .. : S57421 LMDAAYDYVIEKQNGQFNTEASYWYIGIDET-CMFDKYEKAGSISGYYNVAASSEDDLAAKVEQYGPAAVAIDASAVGFQ 50 60 70 80 90 100 110 480 490 500 510 520 530 540 QUERY AYSEGVY-NGTCSE-ELNHSVLLVGYGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSR : :.: :. :: :.:.: ::.: S57421 LYWGGIYDNSGCSSVMLDHGVGCVGFGVEGGTQ 120 130 140 150 --------------------------------------------------------------------------- >>S57425 cysteine proteinase (EC 3.4.22.-) 7 - Tritrichomonas foetus (fragmen (152 aa) initn: 111 init1: 111 opt: 210 Z-score: 228.5 expect() 6.8e-06 Smith-Waterman score: 210; 30.128% identity in 156 aa overlap Entrez lookup Re-search database >S57425 359- 502: ------------------: 320 330 340 350 360 370 380 390 QUERY YTNGKRNEKDIFSKVPEILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKDNFGCDGG ::... ::. : .. .. :.:::..::: .::.:: S57425 AFSAIQAAESANAISTGTLESYSEQNLVDCVTACYGCNGG 10 20 30 40 400 410 420 430 440 450 460 QUERY HPFYSFLYVLQNELCLGDEYKYKAKDDMFCLNYRCKRKVSLSSIGAVK---------ENQLILALNEVGPLSVNVGVNN- :. :.. .. : ...:.. . :. :: .. ...:.:. :..: . ::..: . ..: S57425 LMDASYEYIVAKQ---GGKMNYESDYVYTALDGTCKF-TQYTAVGSVSKYVNVAQGDEDDLASKCETYGPIAVAIDASNW 50 60 70 80 90 100 110 470 480 490 500 510 520 530 540 QUERY DFVAYSEGVYN-GTCSE-ELNHSVLLVGYGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMR .: :: :.:. .:: :.:.: :::: .:: S57425 SFQLYSGGIYDEKSCSSYSLDHGVGCVGYGVEGSTK 120 130 140 150 --------------------------------------------------------------------------- >>B44938 cysteine proteinase (EC 3.4.22.-) - Trypanosoma brucei (fragment) (166 aa) initn: 253 init1: 143 opt: 209 Z-score: 226.9 expect() 8.3e-06 Smith-Waterman score: 234; 30.612% identity in 196 aa overlap Entrez lookup Re-search database >B44938 351- 535: -----------------------: 320 330 340 350 360 370 380 390 QUERY DNILISEFYTNGKRNEKDIFSKVPEILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSK :: :::::::...::::. . .. ..:.::: .: :. B44938 QGQCGSCWAFSTIGNIEGQWQVAGNPLVSLSEQILVYCDP 10 20 30 40 400 410 420 430 440 450 460 QUERY DNFGCDGGHPFYSFLYVLQNELCLGDEYKYKAKDDMFCLNYRCKR-KVSLSSIGAV---------KENQLILALNEVGPL .:: :: .: ...... : . .:. . : . . ... :::. :. . : : :: B44938 L-IGCGGGLMDNAFNWIVNSN---GGNVFTEASYPYVSGNGEQPQCQMNGHEIGAAITDHVDLPQDEDAIAAYLAENRPL 50 60 70 80 90 100 110 470 480 490 500 510 520 530 QUERY SVNVGVNNDFVAYSEGVYNGTC-SEELNHSVLLVGYGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKW .. : . . : ... : .: ::.:.:.::::::. .::: : :::.:::: B44938 AIAVEAPQ-FYGHNGGYILTSCTSEQLDHGVLLVGYN-------------------DNSNPP------YWIVKNSW 120 130 140 150 160 540 550 560 QUERY GENGFMRLSRNKNGDNVFCGIGEEVFYPIL --------------------------------------------------------------------------- >>A41158 dipeptidyl-peptidase I (EC 3.4.14.1) precursor - rat (462 aa) initn: 301 init1: 96 opt: 173 Z-score: 183.4 expect() 0.0022 Smith-Waterman score: 364; 28.460% identity in 383 aa overlap Entrez lookup Re-search database >A41158 194- 552: -------------------------------------------- : 160 170 180 190 200 210 220 230 QUERY ENNEENRKELSNILHKLLEINKLILREEKDDKKVYLINDNYDEKGALEIGM-NEEMKYKKEDPINNIKYASKFFKFMKEH :. ::. :.. .: : .: .. ..: : : .: A41158 HLKKLDTAYDEVGNSGYFTLIYNQGFEIVLNDYKWFAFFKYEVKGSRAISYCHETMTGWVHDVLGR-NWACFVGKKMANH 70 80 90 100 110 120 130 140 240 250 260 270 280 290 300 310 QUERY N-KVYKNIDEQMRKFEIFKINYISIKNHNKLNKNAMYKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKD . ::: :. . : .. : .::: .. .:. . . : : : : .. :.. .: . .: . : A41158 SEKVYVNVAHLGGLQEKYSERLYS-HNHNFVKAINSVQKSWTA-TTYEEYE-KLSIRDLIRRSGHS-GRILRPKPAPITD 150 160 170 180 190 200 210 220 320 330 340 350 360 370 380 QUERY NILISEFYTNGKRNEKDIFSKVPEILDYRE-KGI--VHEPKDQGLCGSCWAFASVGNIES---VFAKKNKNILSFSEQEV .: ...:.: .:: :.:. .:: : ..: ::::..:::.: .:. ........ . .: ::: A41158 EI------------QQQILS-LPESWDWRNVRGINFVSPVRNQESCGSCYSFASLGMLEARIRILTNNSQTPI-LSPQEV 230 240 250 260 270 280 390 400 410 420 430 440 450 QUERY VDCSKDNFGCDGGHPF-----YSFLYVLQNELCL---GDEYKYKAKDDMFCLNYRCKRKVSLSSI-GAVKENQLILALNE :.:: ::::: :. :. . . .: :. . . : :.. :: : .. .... :. .: . : : . A41158 VSCSPYAQGCDGGFPYLIAGKYAQDFGVVEENCFPYTATDAPCKPKEN--CLRYYSSEYYYVGGFYGGCNEALMKLELVK 290 300 310 320 330 340 350 360 460 470 480 490 500 510 520 QUERY VGPLSVNVGVNNDFVAYSEGVYNGTCS-------EELNHSVLLVGYGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYW ::..: :..::. : :.:. : : ::.:::::::. : :.: : A41158 HGPMAVAFEVHDDFLHYHSGIYHHTGLSDPFNPFELTNHAVLLVGYGKDPVTGLDY-----------------------W 370 380 390 400 410 420 530 540 550 560 QUERY IIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFYPIL :.::::...:::.:..:. :. . A41158 IVKNSWGSQWGESGYFRIRRGTDECAIESIAMAAIPIPKL 430 440 450 460 --------------------------------------------------------------------------- >>S02729 actinidain (EC 3.4.22.14) precursor (clone pAC.7) - kiwi fruit (frag (184 aa) initn: 238 init1: 131 opt: 163 Z-score: 179.2 expect() 0.0038 Smith-Waterman score: 273; 29.609% identity in 179 aa overlap Entrez lookup Re-search database >S02729 394- 568: ---------------------: 360 370 380 390 400 410 420 430 QUERY CGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKDNFGCDGGHPFYSFLYVLQNE-LCLGDEYKYKAKDDMFCLNYR ::.::. .: ....: . ..: : :.: :. . S02729 TRGCNGGYITDGFQFIINNGGINTEENYPYTAQDGECNLDLQ 10 20 30 40 440 450 460 470 480 490 500 QUERY CKRKVSLSSIGAVKENQLILALNEVG--PLSVNVGVNND-FVAYSEGVYNGTCSEELNHSVLLVGYGQVEKTKLNYNNKI .. :.... : :. . : :.:: . . .: : :: :...: :. ..:.: .:::: S02729 NEKYVTIDTYENVPYNNEWALQTAVTYQPVSVALDAAGDAFKHYSSGIFTGPCGTAIDHAVTIVGYGT------------ 50 60 70 80 90 100 110 510 520 530 540 550 560 QUERY QTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFYPIL .. : :::.::::. :::.:.::. :: .: .. :::. ::. S02729 -------------EGGIDYWIVKNSWDTTWGEEGYMRILRNVGGAGT-CGIATMPSYPVKYNNQNYPKPYSSLINPSAFS 120 130 140 150 160 170 S02729 MSKDGPVE 180 --------------------------------------------------------------------------- >>S58770 cathepsin B (EC 3.4.22.1) precursor - chicken (340 aa) initn: 229 init1: 87 opt: 164 Z-score: 176.2 expect() 0.0055 Smith-Waterman score: 242; 23.826% identity in 298 aa overlap Entrez lookup Re-search database >S58770 314- 564: ------------------------------: 280 290 300 310 320 330 340 QUERY QFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSK---VPEILDYREK----GIVH : . : . : :. :.. .:. .: :.. . S58770 RSIPYYPPLSSDLVNHINKLNTTGRAGHNFHNTDMSYVKKLCGTFLGGPKAPERVDFAEDMDLPDTFDTRKQWPNCPTIS 20 30 40 50 60 70 80 90 350 360 370 380 390 400 410 420 QUERY EPKDQGLCGSCWAFASVGNI-ESVFAKKNKNI-LSFSEQEVVDCS--KDNFGCDGGHPFYSFLYVLQNELCLGDEYKYKA : .::: :::::::..: : . . .. : .. . : .....: . ..::.::.: .. : . : : : .. S58770 EIRDQGSCGSCWAFGAVEAISDRICVHTNAKVSVEVSAEDLLSCCGFECGMGCNGGYPSGAWRYWTERGLVSGGLYDSHV 100 110 120 130 140 150 160 170 430 440 450 460 QUERY KDDMF----CLNY----------------RCKRK--------------VSLSSIGAVKENQLILA-LNEVGPLSVNVGVN . : .. ::.:. ...: :. . .. :.: . . ::. : S58770 GCRAYTIPPCEHHVNGSRPPCTGEGGETPRCSRHCEPGYSPSYKEDKHYGITSYGVPRSEKEIMAEIYKNGPVEGAFIVY 180 190 200 210 220 230 240 250 470 480 490 500 510 520 530 540 QUERY NDFVAYSEGVYNGTCSEELN-HSVLLVGYGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMR .::. :. :::. . .:... :.. ..:.: :: .. : ::. :::. :: .::.. S58770 EDFLMYKSGVYQHVSGEQVGGHAIRILGWG-VE------------------NGTP------YWLAANSWNTDWGITGFFK 260 270 280 290 300 310 550 560 QUERY LSRNKNGDNVFCGIGEEVFYPIL . :... ::: :. S58770 ILRGED----HCGIESEIVAGVPRMEQYWTRV 320 330 340 --------------------------------------------------------------------------- >>S02728 actinidain (EC 3.4.22.14) precursor (clone pAC.1) - kiwi fruit (frag (302 aa) initn: 371 init1: 126 opt: 158 Z-score: 170.9 expect() 0.011 Smith-Waterman score: 460; 28.840% identity in 319 aa overlap Entrez lookup Re-search database >S02728 256- 568: --------------------------------------: 220 230 240 250 260 270 280 290 QUERY INNIKYASKFFKFMKEHNKVYKNIDEQMRKFEIFKINYISIKNHNKLNKNAMYKKKVNQFSDYSEEELKEYFKTLLHVPN : .:: . : :: .:::.: . ::.. . . : S02728 LRFIDEHNA-DTNRSYKVGLNQFADLTGEEFRSTYLGFTGGSN 10 20 30 40 300 310 320 330 340 350 360 370 QUERY HMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNK . : :. : : . . .: .:.: : : . :.:: ::.::::......:.. . S02728 KT--KVSN--------------------RYEPRVSQVLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTG 50 60 70 80 90 100 380 390 400 410 420 430 440 450 QUERY NILSFSEQEVVDC--SKDNFGCDGGHPFYSFLYVLQNE-LCLGDEYKYKAKDDMFCLNYRCKRKVSLSSIGAVKENQLIL ..:.::::.. : .... ::.::. .: ....: . :..: : :.: :. . .. :.... : : :. S02728 VLISLSEQELIGCGGTQNTRGCNGGYITDGFQFIINNGGINTGENYPYTAQDGECNLDLQNEKYVTIDTYGNVPYNNEWA 110 120 130 140 150 160 170 180 460 470 480 490 500 510 520 QUERY ALNEVG--PLSVNVGVNND-FVAYSEGVYNGTCSEELNHSVLLVGYGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYW . : :.:: . . .: : :: :...: :. ..:.: .:::: .. : :: S02728 LQTAVTYQPVSVALDAAGDAFKHYSSGIFTGPCGTAIDHAVTIVGYGT-------------------------EGGIDYW 190 200 210 220 230 530 540 550 560 QUERY IIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFYPIL :..:::. :::.:.::. :: .: .. :::. ::. S02728 IVENSWDTTWGEEGYMRILRNVGGAGT-CGIATMPSYPVKYNNQNYPKPYSSLINPSAFSMSKDGPVE 240 250 260 270 280 290 300 --------------------------------------------------------------------------- >>B48566 cysteine proteinase Lpcys1 (EC 3.4.22.-) - Leishmania pifanoi (149 aa) initn: 162 init1: 126 opt: 152 Z-score: 169.3 expect() 0.013 Smith-Waterman score: 152; 25.676% identity in 148 aa overlap Entrez lookup Re-search database >B48566 359- 500: ------------------: 320 330 340 350 360 370 380 390 QUERY YTNGKRNEKDIFSKVPEILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKDNFGCDGG ::...::::. .: ......:.::: .:.:.. . ::.:: B48566 AFSAIGNIEGQWAASGHSLVSLSEQMLVSCDNIDEGCNGG 10 20 30 40 400 410 420 430 440 450 460 470 QUERY HPFYSFLYVLQNE---LCLGDEYKYKAKDDMF--CLNY-RCKRKVSLSSIGAVKENQLILALNEVGPLSVNVGVNNDFVA .. ...:.. . : : . : . . :.. :... ... ::..: : ... . B48566 LMDQAMNWIMQSHNGSVFTEASYPYTSGGGTRPPCHDEGEVGAKITGFLSLPHDEERIAEWVEKRGPVAVAVDATT-WQL 50 60 70 80 90 100 110 480 490 500 510 520 530 540 550 QUERY YSEGVYNGTCSEELNHSVLLVGYGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKN : :: . . :::.::.::... : B48566 YFGGVVSLCLAWSLNHGVLIVGFNKNAKPP 120 130 140 --------------------------------------------------------------------------- >>S66504 dipeptidyl-peptidase I (EC 3.4.14.1) precursor - human (463 aa) initn: 339 init1: 106 opt: 159 Z-score: 169.1 expect() 0.014 Smith-Waterman score: 367; 26.291% identity in 426 aa overlap Entrez lookup Re-search database >S66504 172- 552: ----------------------------------------------- : 140 150 160 170 180 190 200 QUERY SKKEEYINLERILLEKYKKFINENNEENRKELSNILHKLLEINKLILREEKDDKKVYL--INDNYDE---KGALEIGMNE ..: .. .. ::: .. ::. .: . : .:. S66504 ALLLLLSGDGAVRCDTPANCTYLDLLGTWVFQVGSSGSQRDVNCSVMGPQEKKVVVYLQKLDTAYDDLGNSGHFTIIYNQ 20 30 40 50 60 70 80 90 210 220 230 240 250 260 270 QUERY EMKYKKEDPINNIKYASKFFKFMKEHNKVYKNIDEQM---------RKFEIFKINYISIKNHNKLNKNAMYKKKVNQFSD . : .:. :. . :::. .: .:: .: : :.. : . .. ..: ..: :.. ...:. S66504 GF----EIVLNDYKWFA-FFKYKEEGSKVTTYCNETMTGWVHDVLGRNWACFTGKKVGTASENVYVNTAHLKNSQEKYSN 100 110 120 130 140 150 160 280 290 300 310 320 330 340 QUERY YSEEELKEYFKTLLHVPNHMIEKYSKPFEN-HLKDNILISEFYTNGKRNEK------DIFSKV---PEILDYRE-KGI-- . ... :.. . . .:. : : : : .. : .: .:. : :.:. .:: S66504 RLYKYDHNFVKAINAIQKSWTATTYMEYETLTLGDMIRRSGGHSRKIPRPKPAPLTAEIQQKILHLPTSWDWRNVHGINF 170 180 190 200 210 220 230 240 350 360 370 380 390 400 410 QUERY VHEPKDQGLCGSCWAFASVGNIESVFA--KKNKNILSFSEQEVVDCSKDNFGCDGGHPF-----YSFLYVLQNELCL--- : ..:. ::::..:::.: .:. . .:.. .: ::::.::. ::.:: :. :. . : .: :. S66504 VSPVRNQASCGSCYSFASMGMLEARIRILTNNSQTPILSPQEVVSCSQYAQGCEGGFPYLIAGKYAQDFGLVEEACFPYT 250 260 270 280 290 300 310 320 420 430 440 450 460 470 480 QUERY GDEYKYKAKDDMFCLNYRCKRKVSLSSI-GAVKENQLILALNEVGPLSVNVGVNNDFVAYSEGVYNGTCS-------EEL : . : :.: :. : .. .... :. .: . : : . ::..: : .::. :..:.:. : : S66504 GTDSPCKMKED--CFRYYSSEYHYVGGFYGGCNEALMKLELVHHGPMAVAFEVYDDFLHYKKGIYHHTGLRDPFNPFELT 330 340 350 360 370 380 390 400 490 500 510 520 530 540 550 560 QUERY NHSVLLVGYGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFY ::.::::::: . ..: ::.::::. :::::..:. :. . S66504 NHAVLLVGYGTDSASGMDY-----------------------WIVKNSWGTGWGENGYFRIRRGTDECAIESIAVAATPI 410 420 430 440 450 460 QUERY PIL S66504 PKL --------------------------------------------------------------------------- >>A45524 cysteine proteinase (EC 3.4.22.-) AC-1 precursor - nematode (Haemonc (342 aa) initn: 164 init1: 70 opt: 152 Z-score: 163.9 expect() 0.027 Smith-Waterman score: 238; 25.000% identity in 296 aa overlap Entrez lookup Re-search database >A45524 308- 560: -------------------------------: 270 280 290 300 310 320 330 340 QUERY YKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILDYRE---KGI : : :....: . : .: : :. . A45524 AQRLTGEPLVAYLRRSQNLFEVNSAPTPNFEQKIMDIKYKHQKLNLMVKE--------DPDPEVDIPPSYDPRDVWKNCT 30 40 50 60 70 80 90 100 350 360 370 380 390 400 410 420 QUERY VHEPKDQGLCGSCWAFASVGNIES--VFAKKNKNILSFSEQEVVDCSKDNFG--CDGGHPFYSFLYVLQNELCLGDEYKY . .::. :::::: .... : . .:.: .. ...: ... : . . : :.:: :. .. : . . . : :: A45524 TFYIRDQANCGSCWAVSTAAAISDRICIASKAEKQVNISATDIMTCCRPQCGDGCEGGWPIEAWKYFIYDGVVSGGEYLT 110 120 130 140 150 160 170 180 430 440 450 460 QUERY K--------------AKDDMF--CLNYR----CKRKV-----SLSSIG------AVKENQLILAL-NEV---GPLSVNVG : ..: .. : . :::: .. : : .: . :. .:. ::. .. . A45524 KDVCRPYPIHPCGHHGNDTYYGECRGTAPTPPCKRKCRPGVRKMYRIDKRYGKDAYIVKQSVKAIQSEILRNGPVVASFA 190 200 210 220 230 240 250 260 470 480 490 500 510 520 530 540 QUERY VNNDFVAYSEGVYNGTCSEELN-HSVLLVGYGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGF : .:: :. :.:. : .: . :.: ..:.: : :. : .:.: ::: . :::.:. A45524 VYEDFRHYKSGIYKHTAGELRGYHAVKMIGWG--------------------NENNTD-----FWLIANSWHNDWGEKGY 270 280 290 300 310 550 560 QUERY MRLSRNKNGDNVFCGIGEEVFYPIL .:. :. : ::: A45524 FRIIRGTND----CGIEGTIAAGIVDTESL 320 330 340 --------------------------------------------------------------------------- >>A44965 cysteine proteinase (EC 3.4.22.-) AC-2 precursor - nematode (Haemonc (342 aa) initn: 164 init1: 70 opt: 152 Z-score: 163.9 expect() 0.027 Smith-Waterman score: 240; 25.000% identity in 296 aa overlap Entrez lookup Re-search database >A44965 308- 560: -------------------------------: 270 280 290 300 310 320 330 340 QUERY YKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILDYRE---KGI : : :....: . : .: : :. . A44965 AQRLTGEPLVAYLRRSQNLFEVNSDPTPDFEQKIMSIKYKHQKLNLMVKE--------DPDPEVDIPPSYDPRDVWKNCT 30 40 50 60 70 80 90 100 350 360 370 380 390 400 410 420 QUERY VHEPKDQGLCGSCWAFASVGNIES--VFAKKNKNILSFSEQEVVDCSKDNFG--CDGGHPFYSFLYVLQNELCLGDEYKY . .::. :::::: .... : . .:.: .. ...: ... : . . : :.:: :. .. : . . . : :: A44965 TFYIRDQANCGSCWAVSTAAAISDRICIASKAEKQVNISATDIMTCCRPQCGDGCEGGWPIEAWKYFIYDGVVSGGEYLT 110 120 130 140 150 160 170 180 430 440 450 460 QUERY K--------------AKDDMF--CLNYR----CKRKV-----SLSSIG------AVKENQLILAL-NEV---GPLSVNVG : ..: .. : . :::: .. : : .: . :. .:. ::. .. . A44965 KDVCRPYPIHPCGHHGNDTYYGECRGTAPTPPCKRKCRPGVRKMYRIDKRYGKDAYIVKQSVKAIQSEILKNGPVVASFA 190 200 210 220 230 240 250 260 470 480 490 500 510 520 530 540 QUERY VNNDFVAYSEGVYNGTCSEELN-HSVLLVGYGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGF : .:: :. :.:. : .: . :.: ..:.: : :. : .:.: ::: . :::.:. A44965 VYEDFRHYKSGIYKHTAGELRGYHAVKMIGWG--------------------NENNTD-----FWLIANSWHNDWGEKGY 270 280 290 300 310 550 560 QUERY MRLSRNKNGDNVFCGIGEEVFYPIL .:. :..: ::: A44965 FRIVRGSND----CGIEGTIAAGIVDTESL 320 330 340 --------------------------------------------------------------------------- >>B26074 cysteine proteinase (EC 3.4.22.-) 13 - papaya (fragment) (96 aa) initn: 180 init1: 110 opt: 143 Z-score: 163.0 expect() 0.03 Smith-Waterman score: 193; 35.780% identity in 109 aa overlap Entrez lookup Re-search database >B26074 458- 564: -------------: 420 430 440 450 460 470 480 490 QUERY YKYKAKDDMFCLNYRCKRKVSLSSIGAVKENQLILALNEVGPLSVNVGVNNDFV-AYSEGVYNG-TCSEELNHSVLLVGY :::.: ..: .. .: :: ::..:::.:::::: B26074 GPLAV--AINAAYMQTYIGGVSCPYICSRRLNHGVLLVGY 10 20 30 500 510 520 530 540 550 560 QUERY GQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFYPIL :.. . . ..: : ::.:::::...:::::.... :..: .::. : B26074 GSAGYAPIRLKEK------------P------YWVIKNSWGENWGENGYYKICRGRN----ICGVDSMVSTVAAVHTTSQ 40 50 60 70 80 90 --------------------------------------------------------------------------- >>S60456 cysteine proteinase (EC 3.4.22.-), glucose starvation-induced - maiz (145 aa) initn: 171 init1: 111 opt: 145 Z-score: 162.3 expect() 0.033 Smith-Waterman score: 209; 30.921% identity in 152 aa overlap Entrez lookup Re-search database >S60456 417- 564: ------------------: 380 390 400 410 420 430 440 450 QUERY ILSFSEQEVVDCSKDNFGCDGGHPFYSFLYVLQNELCLGDEYKYKAKDDMFCLNYRCKRKVSLSSIGAVKENQLILALNE .: : ..: : . : .:......:. .. .. :. S60456 ESEKDYPYTGSDGK-CKFDKSKIVASVQNFSVVSVDEAQISANR 10 20 30 40 460 470 480 490 500 510 520 530 QUERY V--GPLSVNVGVNNDFV-AYSEGVYNG-TCSEELNHSVLLVGYGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIK . :::.. :.: .. .: :: :...:.:.::::::: . . ..: : ::::: S60456 IKHGPLAI--GINAAYMQTYIGGVSCPYICGRHLDHGVLLVGYGASGFAPMRLKDK------------P------YWIIK 50 60 70 80 90 100 540 550 560 QUERY NSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFYPIL :::...:::::.... :..: : ::. : S60456 NSWGENWGENGYYKICRGSNVRNK-CGVDSMVSTVSAVHASKE 110 120 130 140 --------------------------------------------------------------------------- >>A23770 asparagine-rich protein - Plasmodium falciparum (537 aa) initn: 64 init1: 64 opt: 152 Z-score: 161.0 expect() 0.039 Smith-Waterman score: 152; 23.105% identity in 277 aa overlap Entrez lookup Re-search database >A23770 65- 323: -------------------------------- : 30 40 50 60 70 80 90 100 QUERY KKFLKKKEKRTFVLSIYAFITFIIFCIGILYFTNKSSAHNNNNNKNEHSLKKEEIELLRVLLEKYKKQKDGILNESSNEE ::::::... .:. . : .:.. .. :. .:. A23770 YNNNNKNNNNNDDGNINYQNTNEFKDNKKNMNFKNQYNNNY 10 20 30 40 110 120 130 140 150 160 170 QUERY DEEKYTLNSETYNNKN-NVS-NIKNDSI---KSKKEEYINLERILLEKYKKFI-NENNEENRKE----LSNILH--KLLE .. ::.:....: :: ...:.:: .:. ..: : . ....:. :::..::.. ..: .: . . A23770 KFDENMNNSNTMHSRNSNVEEHLRNNSIDMNNSNINNYTNQQT----RFSSFMENENENENKNYHTGGMNNNIHFKNKYD 50 60 70 80 90 100 110 180 190 200 210 220 230 240 QUERY INKLILREEKDDKKVYLINDNYDEKGALEIGMNEEMKYKKE-DPINNIKYASK--FF-KFMKEHN-KVYKNIDEQMRKFE :. ... ..: . .:. ::... :..: : .. . ::. : ..: :. ..:...: : .: ...: : A23770 NNNSSMKNTDNNKT----DTSYNMKGTINND-NNNMDYLRNINNINEYKGSAKNKFYTNYMNKNNLKFTQNNNDNMNINE 120 130 140 150 160 170 180 190 250 260 270 280 290 300 310 320 QUERY IFKINYISIKNHNKLNKNAMYKK-KVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNE . : :.:. :.:..... . :... . ..:. ... .. :.: . :. ... ..:. .. : : : A23770 DNNNN-----NNNNNNNNGVFSNYQNNNMNRNNSINIKRNLNNNNNINNNMNKMGSQDKNQNSNNNFYMNYNYQNRKNSM 200 210 220 230 240 250 260 330 340 350 360 370 380 390 400 QUERY KDIFSKVPEILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKDNFGCDGGHPFYSFLY A23770 NNNMNNNMNNNMNHNMNNNMNHNMNNNMNHNMNNNMNHNMNNNMNNINSLDSDMSPNYHAHVKMSMMNYNNNESNTANPN 270 280 290 300 310 320 330 340 --------------------------------------------------------------------------- >>S31914 cysteine proteinase - chickpea (fragment) (111 aa) initn: 120 init1: 60 opt: 141 Z-score: 160.0 expect() 0.044 Smith-Waterman score: 141; 41.379% identity in 58 aa overlap Entrez lookup Re-search database >S31914 356- 410: ------ : 320 330 340 350 360 370 380 390 QUERY SEFYTNGKRNEKDIFSKVPEILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKK--NKNILSFSEQEVVDCSKD-N :::::..: . : : .: . ...:.::::. ::.. : S31914 SCWAFSDVQPV-SEFINKIVTGKFVSLSEQELGDCDRAFN 10 20 30 400 410 420 430 440 450 460 470 QUERY FGCDGGHPFYSFLYVLQNELCLGDEYKYKAKDDMFCLNYRCKRKVSLSSIGAVKENQLILALNEVGPLSVNVGVNNDFVA ::.:: :.: ....: S31914 EGCNGGLMDYAFEFIIRNGGIDTDQDYPYNGFERKCDPTKKNAKVVSIDGYEDVPSYNENALKKAVAHQPVS 40 50 60 70 80 90 100 110 --------------------------------------------------------------------------- >>S46541 cysteine proteinase - chickpea (fragment) (111 aa) initn: 120 init1: 60 opt: 141 Z-score: 160.0 expect() 0.044 Smith-Waterman score: 141; 41.379% identity in 58 aa overlap Entrez lookup Re-search database >S46541 356- 410: ------ : 320 330 340 350 360 370 380 390 QUERY SEFYTNGKRNEKDIFSKVPEILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKK--NKNILSFSEQEVVDCSKD-N :::::..: . : : .: . ...:.::::. ::.. : S46541 SCWAFSDVQPV-SEFINKIVTGKFVSLSEQELGDCDRAFN 10 20 30 400 410 420 430 440 450 460 470 QUERY FGCDGGHPFYSFLYVLQNELCLGDEYKYKAKDDMFCLNYRCKRKVSLSSIGAVKENQLILALNEVGPLSVNVGVNNDFVA ::.:: :.: ....: S46541 EGCNGGLMDYAFEFIIRNGGIDTDQDYPYNGFERKCDPTKKNAKVVSIDGYEDVPSYMEMALKKAVAHQPVS 40 50 60 70 80 90 100 110 --------------------------------------------------------------------------- >>B48435 cysteine proteinase AC-5 - nematode (Haemonchus contortus) (348 aa) initn: 168 init1: 112 opt: 148 Z-score: 159.7 expect() 0.046 Smith-Waterman score: 247; 24.739% identity in 287 aa overlap Entrez lookup Re-search database >B48435 325- 564: -----------------------------: 290 300 310 320 330 340 350 360 QUERY EYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILD----YREKGIVHEPKDQGLCGSCWAF ...: . .:: : ... . : .::. :::::: B48435 YLQKNQDLFEVRTTPTPGFKYKLMDKAFANANQNLNPVVNDDNDTGADLPENYDPRIVWKNCSSFHTIRDQANCGSCWAV 50 60 70 80 90 100 110 120 370 380 390 400 410 420 QUERY ASVGNIES--VFAKKNKNILSFSEQEVVDC--SKDNFGCDGGHPFYSFLYVLQNELCLGDEYKYKA-------------K .... : . .: :.:. . :. ... : .. ..:: :: :. .. . . . : : :. B48435 STAAAISDRICIATKGKKQVYASDTDILTCCGARCGLGCRGGWPIEAWKFFEYDGVVSGGPYLGKGCCSPYPLHPCGRHG 130 140 150 160 170 180 190 200 430 440 450 460 470 QUERY DDMF---CLNYR----CKRKVSLSSIGAVK-----------------ENQLILALNEVGPLSVNVGVNNDFVAYSEGVYN .: : :... :::: . . : . : .. ..: : . . .: .:: :. :.:. B48435 NDTFYGNCVGMAPTPPCKRKCQPGFRGMYRVDKRYGEPGRTYTLPRSEVKIRRDIKERGSVVAVFAVYEDFSHYQSGIYK 210 220 230 240 250 260 270 280 480 490 500 510 520 530 540 550 QUERY GTCSEELN--HSVLLVGYGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVF : .. . :.: ..:.:. :: ::.: ::: ::::::.:. :. :. B48435 HTAGRFTGGYHAVKMIGWGK-------------------------DNGTDYWLIANSWHDDWGENGFFRMIRGINN---- 290 300 310 320 330 560 QUERY CGIGEEVFYPIL ::: :.: B48435 CGIEEQVDAGIVDVESL 340 --------------------------------------------------------------------------- >>JC6009 surface-located membrane protein lmp3 - Mycoplasma hominis (SGC3) (1302 aa) initn: 66 init1: 41 opt: 153 Z-score: 156.3 expect() 0.071 Smith-Waterman score: 170; 23.295% identity in 352 aa overlap Entrez lookup Re-search database >JC6009 55- 384: ---------------------------------------- : 20 30 40 50 60 70 80 90 QUERY PSLVETLNKKKKFLKKKEKRTFVLSIYAFITFIIFCIGILYFTNKSSAHNNNNNKNEHSLKK--EEIELL-RVLLEKYKK ... ... :.. ::. . .: .:::.. . .::. .: JC6009 AIAAMCSISNNSTKKYNEAKSRLLNLIKKLDNEGQKKANDFIARQDKKFNSTAFKNHSNTSKLIDEIEIFSKKILENLQK 30 40 50 60 70 80 90 100 100 110 120 130 140 150 160 170 QUERY QKDGILNESSNEEDEEKYTLNSETYNNKNNVSNI-KNDSIKSKKEEYINLERILLEKYKKFINENNEENRKELSNILHKL .. :.: .. . : :. .. :.:. .:. ..:. :. .:. :. . . ..: :: ::: :. .::: : .. JC6009 DEKQRLEELNKLN---KLRLDLQNLINSNDGQNVDSSDAKKALNENQID-DSLPIDKIKK-TNENLENAKKELLNKINAE 110 120 130 140 150 160 170 180 190 200 210 220 230 QUERY LEINKLILREEKDD-KKVYLINDNYD------EKGALEIGMNEEMKYKK-EDPINNIKYA-----SKFFKFMKEHNKVYK :... :. :.:.. :.: ..:. . .: .: ..:: . . .. : ... : ::... ... . .. JC6009 RELQSKIFNEKKQELKRVLDLEDTKEVDFTKEQKVFIETNINETSSIEDIKNKIIEVEKATSSLTSKILNTKQQELQEFE 180 190 200 210 220 230 240 250 240 250 260 270 280 290 300 310 QUERY NIDEQMRKFEIFKIN---YISIKNHNKLNKNAMYKKKVNQFSDYSEEELKEYFKTLLHVPNHM-IEKYSKPFENHLKDNI :: .... : :.: : :::.. . :.. . .:. : .:.: ..:... .. .:: . .: .::.. JC6009 NIKKDLQDFINTKLNDAKYQSIKQKALDKINSL--NGINKNSTI--KEIKAGQNALIKAKEEAGLEKEKLDGQN-IKDTL 260 270 280 290 300 310 320 330 320 330 340 350 360 370 380 390 QUERY LISEFYTNGKRNEKDIFSKVPEILDYREKGIVHEPK-DQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKDN .: .:.:. .: .... .:.: . . . : .:.: . .. :.... .. . :.::. .:: JC6009 --KETINNAKEFKKLLIDNDQKIVDLKSNLDNEISKAEQSLSKDKESMESANDLLNTKLIEYKEILNKFNQEKEAKFNEL 340 350 360 370 380 390 400 400 410 420 430 440 450 460 470 QUERY FGCDGGHPFYSFLYVLQNELCLGDEYKYKAKDDMFCLNYRCKRKVSLSSIGAVKENQLILALNEVGPLSVNVGVNNDFVA JC6009 EQTRKNIENFLTDEVKNNPNYATLVKDLTNAKDAKKSVTNSSNKSDIIAANEALIQALADANKAKDQVDEANKSIKEQLN 410 420 430 440 450 460 470 480 --------------------------------------------------------------------------- >>S23207 DNA-directed RNA polymerase (EC 2.7.7.6) chain a - euglenid (Astasia (528 aa) initn: 92 init1: 69 opt: 147 Z-score: 156.0 expect() 0.074 Smith-Waterman score: 147; 23.022% identity in 417 aa overlap Entrez lookup Re-search database >S23207 2- 397:-------------------------------------------------: 10 20 30 QUERY MVAIKEMKE--LAFARPSLVETLNKKKKFLKKKEKRTFVLS . ::. :. :: :. .. :.. ::::. . . S23207 IIFKKKYISFCKIPFMTEKGTFICNGNTRIIINQLIRSPGIYIKKKKNCLLATLIPKTGSWITIKRN--KKKENFIKIDK 90 100 110 120 130 140 150 160 40 50 60 70 80 90 100 110 QUERY IYAFITFIIFCIGILYFTNKS---SAHNNNNNKNEHSLKKEEIELLRVLLEKYKKQKDGILNESSNEEDEEKYTLNSETY : : .. : ..:: ...:. : ..:: .:: . ::..::. . : . .:... . .. ... . :::..: S23207 INNSIPLFTF-LNILGLSKKKITLSLNKNNYSKNFKISKKKKIEIPTINLTQLSKENNIMRLLKTIRNNLYNRFLNSDNY 170 180 190 200 210 220 230 240 120 130 140 150 160 170 180 190 QUERY NNKNNVSNIKNDSIKSKKEEYINLERILLEKYKKFINENNE-ENRKELSNILHKLLEINKLILR--EEKDDKKVYLINDN : .... .: .. : : . : . .. : . ..: . .: . :: . : ::..: : ..: .:.: S23207 N-LGDTGRFKINKKIYKTEFFTNKKILMPEDFLGIFNYMIKIKNINIKSNKIDDLK--NKIVLSVGELMQNKFNNIIKDI 250 260 270 280 290 300 310 200 210 220 230 240 250 260 270 QUERY YDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKNIDEQMRKFEIFKINYIS--IKNHNKLNKNAMYKKK : . .: . :.: :.:. :. : :. :. : . ::. ....:: . : .: ... : :.. .:.: S23207 YTK--IIEKINKFEQKKKQEEKYNKEKKEEKI-KINKTYFINSKNLTDNIKKF--ITTNPLSQLLNDLNPLSE-LTHKRK 320 330 340 350 360 370 380 390 280 290 300 310 320 330 340 QUERY VNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFE-NHLKDNILISEFYTNGKRNEKDI----FSKVPEILDYREKGI-- .. .. . :. : : . .. : . :.: .. :. :. . . . :.. . : :: . ..::: S23207 ISTLGIGGIEKNKASTK-IREIHNSHYGRIC-PIETSEGKNAGLVLSLAKDIRINKHGFIESPFYKVIKGNIKKNKGIFF 400 410 420 430 440 450 460 470 350 360 370 380 390 400 410 420 QUERY VHEPKDQGL-CGSCWAFASVGNIESVFAKKNKNIL---SFSEQEVVDCSKDNFGCDGGHPFYSFLYVLQNELCLGDEYKY . ..... . : . . .... .. :::: . :.. . .. : :.:. : S23207 ISSENEKNIKIAPCDILKNF-KLNKNYGVKNKNEFYYDSYKSINFISTSTDQFNSIGT 480 490 500 510 520 430 440 450 460 470 480 490 500 QUERY KAKDDMFCLNYRCKRKVSLSSIGAVKENQLILALNEVGPLSVNVGVNNDFVAYSEGVYNGTCSEELNHSVLLVGYGQVEK --------------------------------------------------------------------------- >>A69493 cysteine proteinase homolog - Archaeoglobus fulgidus (1088 aa) initn: 199 init1: 97 opt: 151 Z-score: 155.4 expect() 0.08 Smith-Waterman score: 186; 25.610% identity in 246 aa overlap Entrez lookup Re-search database >A69493 325- 544: --------------------------- : 290 300 310 320 330 340 350 360 QUERY EYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILDYREKGIVHEPKDQGLCGSCWAFASVG :. :. : .: .:.:. . .::: :::::: ..:. A69493 EEREDNNCASRILTFTSTCSDGVQNGDEEGIDCGGSCLPCNRCDMAS-LPSRFDWRDYTGLSAVRDQGSCGSCWAHSAVA 550 560 570 580 590 600 610 620 370 380 390 400 410 420 QUERY NIES--VFAKKNKNILSFSEQEVVDCSKD---NFG---------CDGGHPFYSFLYVLQNELCLGDEYKYKAKDDMF--- .:: . . .. ...:::....: .: ..: :::: : .. ....: . . . : : . A69493 ALESALIVESGASSSIDLSEQHLLSCEQDCEVGIGDWCWASSGDCDGGWPHKALNFIINNGVPDESCFPYTATNGNCGSK 630 640 650 660 670 680 690 700 430 440 450 460 470 480 490 500 QUERY CLNYRCKRKVSLSSIGAVKENQLIL--ALNEVGPLSVNVGVNNDFVAYSEGVYNGTCSEELNHSVLLVGYGQVEKTKLNY : ... . . .. : :. : : :: ::::: ::. .:..::::: A69493 CGDWEDRTEGAVYR-GKVSSNVEALKRALICHGPLSVA-------------------SENWEHALLLVGY---------- 710 720 730 740 750 510 520 530 540 550 560 QUERY NNKIQTYNTKENSNQPDDNIIYYWIIKNSWS-------KKWGENGFMRLSRNKNGDNVFCGIGEEVFYPIL . ..: :.. ... ::.::::. : : :. A69493 -DDLSTICTQKYGKSG------CWILKNSWGVFSGFSHDVWHEYGYAYIPYSGFKYSDIKNGAYYVIPSDYTLHADFEMM 760 770 780 790 800 810 820 A69493 DGLAAGDLDGDGMAEIVHADRGDLLQIFNLGGLQSSEQMDFEEGDRIATGDVDGDGRMDVIHADRGDEVSIHFQPPVGVV 830 840 850 860 870 880 890 900 --------------------------------------------------------------------------- >>S58729 hypothetical protein N2485 - yeast (Saccharomyces cerevisiae) (237 aa) initn: 34 init1: 34 opt: 141 Z-score: 155.1 expect() 0.083 Smith-Waterman score: 141; 25.333% identity in 225 aa overlap Entrez lookup Re-search database >S58729 55- 269: -------------------------- : 20 30 40 50 60 70 80 90 QUERY PSLVETLNKKKKFLKKKEKRTFVLSIYAFITFIIFCIGILYF--TNKSSAHNNNNNKNEHSLKKEEIELLRVLLEKYKKQ :: .. :: ...:.. : :.:. : .. : .. S58729 MNKRKKNKKNSSSRYFRLVTDSSLDLKESNNSAHEQKEEKQEEFEFPLFSF--- 10 20 30 40 50 100 110 120 130 140 150 160 170 QUERY KDGILNESSNEEDEEKYTLNSETYNNKNNVSNIKNDSIKSKKEEYINLERILLEKYKKFINENNEENRK-ELSNILHKLL :... :.. .::. . ..: . ...:: .: :.: .:: :: :: : : . . ... . . :.: . .. S58729 --GVVEASTSPAQEEQGSSTQEKDTPQTEVSLMKI-SLKEPEEEIINQER---PKDYYFASYSADQKLQFQQSSIDYDVI 60 70 80 90 100 110 120 180 190 200 210 220 230 240 QUERY --EINKLILREEK-DDKKVYL---INDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKF-FKFMKEHNKVYKNIDEQMR : .:.. . . :: : : : : ... .:. ...:.: ::. : .. . :.:. .. ::.. ..: .:.. S58729 IQESTKILEDDLRIRDKWPYCQGRIIDLYKHNARIELEQQKELKIKKRRPGQKQRAAKKLALERTKERDTKAREIKKQLK 130 140 150 160 170 180 190 200 250 260 270 280 290 300 310 320 QUERY KFEIFKINYISIKNHNKLNKNAMYKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKR : .. : . . ::..:. : . : S58729 K-KFHKRG--GKKNKKKVPLNPLAKAGSTPKFRTE 210 220 230 --------------------------------------------------------------------------- >>S16162 cruzipain (EC 3.4.22.-) - Trypanosoma cruzi (fragment) (173 aa) initn: 98 init1: 98 opt: 138 Z-score: 154.0 expect() 0.095 Smith-Waterman score: 138; 44.737% identity in 38 aa overlap Entrez lookup Re-search database >S16162 526- 563: ----: 490 500 510 520 530 540 550 560 QUERY LNHSVLLVGYGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVF . ::::::::. .:::.:..:.....: : . ::. S16162 AAVPYWIIKNSWTAQWGEDGYIRIAKGSNQ----CLVKEEAS 10 20 30 QUERY YPIL S16162 SAVVGGPGPTPEPTTTTTTSAPGPSPSYFVQMSCTDAACIVGCENVTLPTGQCLLTTSGVSAIVTCGAETLTEEVFFTST 40 50 60 70 80 90 100 110 --------------------------------------------------------------------------- >>S62150 hypothetical protein YNL050c - yeast (Saccharomyces cerevisiae) (270 aa) initn: 34 init1: 34 opt: 140 Z-score: 153.2 expect() 0.11 Smith-Waterman score: 140; 25.229% identity in 218 aa overlap Entrez lookup Re-search database >S62150 60- 269: -------------------------- : 20 30 40 50 60 70 80 90 QUERY TLNKKKKFLKKKEKRTFVLSIYAFITFIIFCIGILYFTNKSSAHNNNNNKNEHSLKKEEIELLRVLLEKYKKQKDGILNE :: ...:.. : :.:. : .. : .. :... S62150 EGEGLGEDYDSNSSSKNNSEHVEVLVPPTEFEFVEVERTDSSLDLKESNNSAHEQKEEKQEEFEFPLFSF-----GVVEA 20 30 40 50 60 70 80 100 110 120 130 140 150 160 170 QUERY SSNEEDEEKYTLNSETYNNKNNVSNIKNDSIKSKKEEYINLERILLEKYKKFINENNEENRK-ELSNILHKLL--EINKL :.. .::. . ..: . ...:: .: :.: .:: :: :: : : . . ... . . :.: . .. : .:. S62150 STSPAQEEQGSSTQEKDTPQTEVSLMKI-SLKEPEEEIINQER---PKDYYFASYSADQKLQFQQSSIDYDVIIQESTKI 90 100 110 120 130 140 150 160 180 190 200 210 220 230 240 250 QUERY ILREEK-DDKKVYL---INDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKF-FKFMKEHNKVYKNIDEQMRKFEIFKI . . . :: : : : : ... .:. ...:.: ::. : .. . :.:. .. ::.. ..: .:..: .. : S62150 LEDDLRIRDKWPYCQGRIIDLYKHNARIELEQQKELKIKKRRPGQKQRAAKKLALERTKERDTKAREIKKQLKK-KFHKR 170 180 190 200 210 220 230 240 260 270 280 290 300 310 320 330 QUERY NYISIKNHNKLNKNAMYKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFS . . ::..:. : . : S62150 G--GKKNKKKVPLNPLAKAGSTPKFRTE 250 260 270 --------------------------------------------------------------------------- >>S57451 cysteine proteinase (EC 3.4.22.-) 3 - Tritrichomonas foetus (fragmen (157 aa) initn: 136 init1: 99 opt: 136 Z-score: 152.6 expect() 0.11 Smith-Waterman score: 214; 28.289% identity in 152 aa overlap Entrez lookup Re-search database >S57451 359- 497: -----------------: 320 330 340 350 360 370 380 390 QUERY YTNGKRNEKDIFSKVPEILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKDNFGCDGG .::. . .:... .. .....::: ::: : ::: :: S57451 SFAACAAFEGAWFASSGKLVKISEQLFVDCCKYCFGCYGG 10 20 30 40 400 410 420 430 440 450 460 QUERY HPFYSFLYVLQN---ELCLGDEYKYKAKDDMFC-----LNY-RCKRKVSLSSIGAVKENQLIL-ALNEVGPLSVNVGVNN .. ..... ..:: ..: : . . . : . : . .. : . :.. .....:. .:.:.:::.: . ... S57451 SADAAYNWAIHENDGKVCLHEDYPYTGTQGV-CRYKSSMAYGHVSQYVRVFSLSEISDEDLMCQTLEEIGPLTVAIDADG 50 60 70 80 90 100 110 470 480 490 500 510 520 530 540 QUERY -DFVAYSEGVY-NGTCSE-ELNHSVLLVGYGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFM : :. :.: . :: . . ::.: .::::. S57451 AKFRLYDSGIYYDDTCVQGDANHAVAVVGYGEEDNGEQ 120 130 140 150 --------------------------------------------------------------------------- >>KHRTB cathepsin B (EC 3.4.22.1) precursor - rat (339 aa) initn: 247 init1: 100 opt: 136 Z-score: 147.6 expect() 0.22 Smith-Waterman score: 238; 23.636% identity in 275 aa overlap Entrez lookup Re-search database >KHRTB 333- 564: ----------------------------: 300 310 320 330 340 350 360 QUERY VPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILDYREK----GIVHEPKDQGLCGSCWAFASVGNI-E .:: .: ::. . . .::: :::::::..: . . KHRTB TWQAGRNFYNVDISYLKKLCGTVLGGPNLPERVGFSEDINLPESFDAREQWSNCPTIAQIRDQGSCGSCWAFGAVEAMSD 40 50 60 70 80 90 100 110 370 380 390 400 410 QUERY SVFAKKNKNI-LSFSEQEVVDCS--KDNFGCDGGHPFYSFLYVLQNEL-----------CL------------------- . . : . . : .... : . . ::.::.: .. . .. : :: KHRTB RICIHTNGRVNVEVSAEDLLTCCGIQCGDGCNGGYPSGAWNFWTRKGLVSGGVYNSHIGCLPYTIPPCEHHVNGSRPPCT 120 130 140 150 160 170 180 190 420 430 440 450 460 470 480 QUERY --GDEYK-YKAKDDMFCLNYRCKRKVSLSSIGAVKENQLILA-LNEVGPLSVNVGVNNDFVAYSEGVYNGTCSEELN-HS :: : : . . .:. .. . .: .. .. :.: . . ::. : .::..:. :::. .. .. :. KHRTB GEGDTPKCNKMCEAGYSTSYKEDKHYGYTSYSVSDSEKEIMAEIYKNGPVEGAFTVFSDFLTYKSGVYKHEAGDVMGGHA 200 210 220 230 240 250 260 270 490 500 510 520 530 540 550 560 QUERY VLLVGYGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFYPIL . ..:.: .: : . ::.. :::. ::.:::... :..: ::: :. KHRTB IRILGWG-IE------------------------NGVPYWLVANSWNVDWGDNGFFKILRGEN----HCGIESEIVAGIP 280 290 300 310 320 330 KHRTB RTQQYWGRF --------------------------------------------------------------------------- >>A48454 cathepsin B-like cysteine proteinase (EC 3.4.-.-) - nematode (Ostert (342 aa) initn: 217 init1: 107 opt: 136 Z-score: 147.6 expect() 0.22 Smith-Waterman score: 259; 26.354% identity in 277 aa overlap Entrez lookup Re-search database >A48454 333- 564: ----------------------------: 300 310 320 330 340 350 360 QUERY VPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILDYREK-----GIVHEPKDQGLCGSCWAFASVGNIE .:: : : . .. : : ::. :::::: .:.. . A48454 VTATPVPYFKQRLMDLKYIDQNNIPDEEVEDEELEENNDDIPESYDPRIQWANCSSLFHIP-DQANCGSCWAVSSAAAMS 60 70 80 90 100 110 120 370 380 390 400 410 420 QUERY S---VFAKKNKNILSFSEQEVVDC-SKDNFGCDGGHPFYSFLYVLQNELCLGDEYKYKAK-------------DDMF--- . . .: :..: .: :.::.: . . ::.:: :. .: . .. . : .:. :.. .. . A48454 DRICIASKGAKQVL-ISAQDVVSCCTWCGDGCEGGWPISAFRFHADEGVVTGGDYNTKGSCRPYEIHPCGHHGNETYYGE 130 140 150 160 170 180 190 200 430 440 450 460 470 480 QUERY CLNY----RCKRKVSLSSIGAV--------KENQLILALNEV-------GPLSVNVGVNNDFVAYSEGVYNGTCSEELN- :... ::::. :. . : :: ... . ::. .. : .::. : :.:. ... . A48454 CVGMADTPRCKRRCLLGYPKSYPSDRYYGKKAYQLKNSVKAIQKDIMKNGPVVATYTVYEDFAHYRSGIYKHKAGRKTGL 210 220 230 240 250 260 270 280 490 500 510 520 530 540 550 560 QUERY HSVLLVGYGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFYP :.: ..:.: :... : :::. ::: ::::::.:. :..: ::. :.. A48454 HAVKVIGWG-------------------EEKGTP------YWIVANSWHDDWGENGFFRMHRGSND----CGFEERMAAG 290 300 310 320 330 QUERY IL A48454 SVQ 340 --------------------------------------------------------------------------- >>S14535 asparagine-rich protein (clone 25C4) - Plasmodium falciparum (669 aa) initn: 77 init1: 63 opt: 139 Z-score: 146.3 expect() 0.26 Smith-Waterman score: 177; 21.127% identity in 284 aa overlap Entrez lookup Re-search database >S14535 57- 321: -------------------------------- : 20 30 40 50 60 70 80 90 QUERY LVETLNKKKKFLKKKEKRTFVLSIYAFITFIIFCIGILYFTNKSSAHNNNNNKNEHSLKK--EEIELLRVLLEKYKKQKD .: : :::::... ::. : .. ... .... S14535 QNNHVLSNDTNVSLESNASNKTNQINKLNLNKQISNCFSKDNNNNNETQFSLSTCLTESSFMDKKMKENSSKET 10 20 30 40 50 60 70 100 110 120 130 140 150 160 170 QUERY GILNESSNEEDEEKYTLNSETYNNKNNVSNIKNDSIKSKKEEYINLERILLEKYKKFINENNEENRKELSNILHKLLE-- .: ::..: :: :. .. ::..:..:.. :. : .: . . ..: ::.::..: .. .:. ... . S14535 NITNENQNGED---YS--NDILNNNDNMNNVHCDNSTFMKIHYNDQ---FNNHYPITINNNNNNNDSNNNNMYNNMSNNI 80 90 100 110 120 130 140 180 190 200 210 220 230 240 250 QUERY INKLILREEKDDKKVYLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKNIDEQMRKFEIFKIN ... ...... : : . .. . .::..:. . .. .:: . .:.. .:....: . : S14535 YHNIYHNISNNNNNISNNNMNNNMNNISNNNMNNNMNNNMNNNMNN-----------NMNNNMNNNMNNNMNN------N 150 160 170 180 190 200 260 270 280 290 300 310 QUERY YISIKNHNKLNKNAMYKKKVNQFSDYSEEELKEYFKTLL------HVPNHMIEKYSKPFENHLK---DNI------LISE .:.:.:..:.. . .. .::..: . :...... : .. :.. .. :. :.:..: .:: :..: S14535 TTNINNNNQFNHTHFNNNLINQYNDQYNIPLNNHLNNQLTTQGNCNMNNELSKQLSNQFNNQVKSPLNNIVGSANNLVNE 210 220 230 240 250 260 270 280 320 330 340 350 360 370 380 390 QUERY FYTNGKRNEKDIFSKVPEILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKDNFGCDG .... S14535 YFSDMMVYISWVPKSARCQYPRELPKSAAERNKLSEYIVAKEQMLYILRVMGYDEIDNIYFHPPKGSHIKIKFKSMICMN 290 300 310 320 330 340 350 360 --------------------------------------------------------------------------- >>H64709 hypothetical protein HP1520 - Helicobacter pylori (strain 26695) (430 aa) initn: 58 init1: 58 opt: 136 Z-score: 146.1 expect() 0.26 Smith-Waterman score: 136; 24.752% identity in 303 aa overlap Entrez lookup Re-search database >H64709 64- 342: ---------------------------------- : 30 40 50 60 70 80 90 100 QUERY KKKFLKKKEKRTFVLSIYAFITFIIFCIGILYFTNKSSAHNNNNNK-NEHSLKKEEIELLRVLLEKYKKQKDGILNESSN ::. : : . .: .::. . :: :.. .... : . H64709 YGIGKIQKTSLDFSKSNSYLLYAQNGVFKTSFAKSLTDLINNEMPKDNFYPNRKSKIEI-EFNGEKILKENVAVFH-SYD 20 30 40 50 60 70 80 110 120 130 140 150 160 170 QUERY EE--DEEKYTLNSETYNNKNNVSNIKNDSIKSKKEEYINLERILLE-KYKKFINENNEENRKELSNILHKLLEINKLILR :: .:.. : . :.. .:: . : :: .:. : :.. :. ..:. : . .:: :.: H64709 EEFSSEDSVTTFMAKSDLKQQYDNILLELEKEKKALLKSLRDIASGFDYEEEIKTIKNEKNKSFYEILD-----NHLTEI 90 100 110 120 130 140 150 160 180 190 200 210 220 230 240 QUERY EEKDDKKVYLINDNYDEKGALE--IGMNEEMKYKKEDPINNIKYASKFFKFMK------EHNKVYKNIDEQMRKFEI--- : .. . . : .: . .. .. .... . . ... ::.:: :. .: :. :. : :. H64709 ESSEKHYSFKYRDIFDGSKKVKDFVNKHHDLIEQYFNKYQELLSQSKIFKHMNSGDFGTNHADDLKKALENNRFFKANHS 170 180 190 200 210 220 230 240 250 260 270 280 290 300 310 320 QUERY FKINYISIKNHNKLNKNAMYKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHL-KDNILISEFYTNGKRNEK .:: : :..::. ..... :.. ..::::: : . .: : .: : :.. . ::: :..:: . .: H64709 LKIAGEEITNYQKLSD--IFENEKNRI--LNNEELKESFDKIEKVIN--ANKELKAFKDAISKDNTLLTEFLDYDSFRKK 250 260 270 280 290 300 310 330 340 350 360 370 380 390 QUERY DIFSKVPEILD--------YREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKDNFGCDGGH .:: . .... :::: H64709 VLFSYLKQVIQNVKSLVNLYREKKPEIEEIIKQASKDQKEWESVIEIFNQRFLVPFKVELQNQKDILLNKDAAQFRFIFS 320 330 340 350 360 370 380 390 --------------------------------------------------------------------------- >>D48435 cysteine proteinase AC-3 - nematode (Haemonchus contortus) (341 aa) initn: 234 init1: 97 opt: 134 Z-score: 145.5 expect() 0.28 Smith-Waterman score: 250; 24.684% identity in 316 aa overlap Entrez lookup Re-search database >D48435 296- 564: ---------------------------------: 260 270 280 290 300 310 320 330 QUERY IKNHNKLNKNAMYKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGK----RNEKDIFS ...: : : . .:..:. .: ... ... . . D48435 TLCAYLYVASGADVNAAQEIPLEAQTLSGEPLVAYLRKNQNLFEVNSTPTPG-FKQKIMDIKFRNQNPNLIVKDDPEPED 10 20 30 40 50 60 70 80 340 350 360 370 380 390 400 QUERY KVPEILDYRE---KGIVHEPKDQGLCGSCWAFASVGNIES--VFAKKNKNILSFSEQEVVDCSKD--NFGCDGGHPFYSF .:: : :. . .::. :::::: .... : . .: : .. ...: ..: : .:::::: . .. D48435 DIPEEYDPRKIWSNCTSFYIRDQANCGSCWAVSTAAAISDRICIATKARKQVNISATDLVTCCTPTCGFGCDGGWSIKAW 90 100 110 120 130 140 150 160 410 420 430 440 450 QUERY LYVLQNELCLGDEYKYK--------------AKDDMF--CLNY----RCKRKVS--------LSSIGAVKENQLILALNE : : : ::. : ..: .. : . ::.: . ... .. :: ... D48435 EYFTYAGLVSGGEYRSKRCCRPYPIHPCGHHGNDTYYGECPEEASTPSCKKKCQPGYRKLYRMDKRYGTDAFQLPKSVEA 170 180 190 200 210 220 230 240 460 470 480 490 500 510 520 QUERY V-------GPLSVNVGVNNDFVAYSEGVYNGTCSE-ELNHSVLLVGYGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYY . ::.... .: .:: :. :.: : .: . :.: ..:.: . :. : : D48435 IQKELLKNGPVTASFAVYEDFSLYKSGIYRHTAGELRGYHAVKMIGWG--------------------TENRTD-----Y 250 260 270 280 290 530 540 550 560 QUERY WIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFYPIL :.: ::: :::::..:. :. : ::: :.: D48435 WLIANSWHDDWGENGYFRIIRGIND----CGIEENVAAGLIDVESL 300 310 320 330 340 --------------------------------------------------------------------------- >>B48454 cathepsin B-like cysteine proteinase (EC 3.4.-.-) CP-3 - nematode (O (174 aa) initn: 143 init1: 98 opt: 129 Z-score: 144.8 expect() 0.31 Smith-Waterman score: 152; 30.973% identity in 113 aa overlap Entrez lookup Re-search database >B48454 458- 569: -------------: 420 430 440 450 460 470 480 490 QUERY YKYKAKDDMFCLNYRCKRKVSLSSIGAVKENQLILALNEVGPLSVNVGVNNDFVAYSEGVYNGTCSEELN-HSVLLVGYG ::. .. : .::. :. :.:. : .. . :.: ..:.: B48454 CQKTCQRGYLKAYKEDKHFGKSAYRLPNNVKAIQRDIMKNGPVVAGFIVYEDFAHYKSGIYKHTAGRMTGGHAVKIIGWG 60 70 80 90 100 110 120 130 500 510 520 530 540 550 560 QUERY QVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFYPIL . . : : ::.: ::: :::.::.:. :. :. : : : :: :. B48454 KEKGT-------------------P------YWLIANSWHDDWGEKGFYRMIRGINN----CRIEEMVFAGIV 140 150 160 170 --------------------------------------------------------------------------- >>S35580 proteinase IV - mountain papaya (fragment) (43 aa) initn: 102 init1: 102 opt: 119 Z-score: 143.6 expect() 0.36 Smith-Waterman score: 119; 44.444% identity in 36 aa overlap Entrez lookup Re-search database >S35580 334- 369: ----: 300 310 320 330 340 350 360 370 QUERY PNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKK :: .:.:.:: : :.:: :: :::... ..:.. S35580 YPESIDWRKKGAVTPVKNQGSXGSXWAFSTIVTVEGINKIR 10 20 30 40 380 390 400 410 420 430 440 450 QUERY NKNILSFSEQEVVDCSKDNFGCDGGHPFYSFLYVLQNELCLGDEYKYKAKDDMFCLNYRCKRKVSLSSIGAVKENQLILA S35580 TG --------------------------------------------------------------------------- >>KHMSB cathepsin B (EC 3.4.22.1) precursor - mouse (339 aa) initn: 243 init1: 95 opt: 131 Z-score: 142.5 expect() 0.42 Smith-Waterman score: 232; 23.636% identity in 275 aa overlap Entrez lookup Re-search database >KHMSB 333- 564: ----------------------------: 300 310 320 330 340 350 360 QUERY VPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILDYREK----GIVHEPKDQGLCGSCWAFASVGNI-- .:: .: ::. . . .::: :::::::..: : KHMSB TWQAGRNFYNVDISYLKKLCGTVLGGPKLPGRVAFGEDIDLPETFDAREQWSNCPTIGQIRDQGSCGSCWAFGAVEAISD 40 50 60 70 80 90 100 110 370 380 390 400 410 QUERY ESVFAKKNKNILSFSEQEVVDCS--KDNFGCDGGHPFYSFLYVLQNEL-----------CL------------------- .. . ... . : .... : . . ::.::.: .. . .. : :: KHMSB RTCIHTNGRVNVEVSAEDLLTCCGIQCGDGCNGGYPSGAWSFWTKKGLVSGGVYNSHVGCLPYTIPPCEHHVNGSRPPCT 120 130 140 150 160 170 180 190 420 430 440 450 460 470 480 QUERY --GDEYK-YKAKDDMFCLNYRCKRKVSLSSIGAVKENQLILA-LNEVGPLSVNVGVNNDFVAYSEGVYNGTCSEELN-HS :: . :. . . .:. .. . .: .. . . :.: . . ::. : .::..:. :::. .. .. :. KHMSB GEGDTPRCNKSCEAGYSPSYKEDKHFGYTSYSVSNSVKEIMAEIYKNGPVEGAFTVFSDFLTYKSGVYKHEAGDMMGGHA 200 210 220 230 240 250 260 270 490 500 510 520 530 540 550 560 QUERY VLLVGYGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFYPIL . ..:.: :: : . ::. :::. ::.:::... :..: ::: :. KHMSB IRILGWG-VE------------------------NGVPYWLAANSWNLDWGDNGFFKILRGEN----HCGIESEIVAGIP 280 290 300 310 320 330 KHMSB RTDQYWGRF --------------------------------------------------------------------------- >>A29172 cathepsin B-like cysteine proteinase (EC 3.4.22.-) - bovine (fragmen (73 aa) initn: 115 init1: 115 opt: 121 Z-score: 142.3 expect() 0.43 Smith-Waterman score: 121; 51.724% identity in 29 aa overlap Entrez lookup Re-search database >A29172 528- 554: --- : 490 500 510 520 530 540 550 560 QUERY HSVLLVGYGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRL--SRNKNGDNVFCGIGEEVF :::..:::.. :::.:.::. : :.:.. A29172 SEYNDQAFINHIVSVAGWGVSDGMEYWIVRNSWGEPWGEHGWMRIVTSTYKGGEGARYNLAIEES 10 20 30 40 50 60 QUERY YPIL A29172 CTFGDPIV 70 --------------------------------------------------------------------------- >>C64246 hypothetical protein MG419 - Mycoplasma genitalium (SGC3) (287 aa) initn: 46 init1: 46 opt: 129 Z-score: 141.6 expect() 0.47 Smith-Waterman score: 138; 26.744% identity in 258 aa overlap Entrez lookup Re-search database >C64246 136- 384: ------------------------------ : 100 110 120 130 140 150 160 170 QUERY ILNESSNEEDEEKYTLNSETYNNKNNVSNIKNDSIKSKKEEYINLERILLEKYKKFINENNEENRKELSNILHKLLEINK : .:.:. : :.::.. . ..:. .. .: : C64246 MIETLNFEKQHYAFLIKAIEENTNFGLSQLT-LIDRLKAI-- 10 20 30 180 190 200 210 220 230 240 250 QUERY LILREEKDDKKVYLINDNYDEKGA-LEIGMNEEMKYKKEDPINNIKYASKFFK-FMKEHNKVYKNIDEQMRKF-EIFKIN .: .: ..: :.. .::. :: . :. : :: . .. :. : : : :. . .:. :: . :: . :.: C64246 VISYNEFFNQKPLTISSPSNEKSLHLETEYLEKKKIKKSNHKQDQKHFSLFEKSFIDKSEKTPKNDEVTNNKFLDTSKLN 40 50 60 70 80 90 100 110 260 270 280 290 300 310 320 330 QUERY YISIKNHNKLNKNAMYKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTN--GKRNEKDIF .: :. .: :.: .. ..:..:: : .. :... :.: : ...: : . .:: C64246 LANI---------AL---AINAFND---NKWINHFQNLLSV-------FQTKFNDKDKQNNL--SYFNNFIDKYSARDIV 120 130 140 150 160 170 340 350 360 370 380 390 400 QUERY SKVPEILDYREKGIVHEPKDQGLCGSCWAFA-SVGNIE-SVFAKKNKNIL--SFSEQEVVDCSKDNFGCDGGHPFYSFLY :. .:. ::: .:: . : : ::.. ..: :.:.. ::::.. C64246 -KATKIVKASSFGIVILFEDQKIAMRLWKEAIEEGNVQATIFQIFNQNLFLASFSEHQYKTTITEETKNQKYQTEVLNLT 180 190 200 210 220 230 240 250 410 420 430 440 450 460 470 480 QUERY VLQNELCLGDEYKYKAKDDMFCLNYRCKRKVSLSSIGAVKENQLILALNEVGPLSVNVGVNNDFVAYSEGVYNGTCSEEL C64246 QLENLAKPFLKEKKRSLSQKMVDKYFKGLFEEK 260 270 280 --------------------------------------------------------------------------- >>KHHUB cathepsin B (EC 3.4.22.1) precursor - human (339 aa) initn: 256 init1: 102 opt: 130 Z-score: 141.5 expect() 0.48 Smith-Waterman score: 250; 24.916% identity in 297 aa overlap Entrez lookup Re-search database >KHHUB 314- 564: ------------------------------: 280 290 300 310 320 330 340 QUERY QFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFS---KVPEILDYREK----GIVH : . : . : .. .:. :.: .: ::. .. KHHUB RSRPSFHPVSDELVNYVNKRNTTWQAGHNFYNVDMSYLKRLCGTFLGGPKPPQRVMFTEDLKLPASFDAREQWPQCPTIK 20 30 40 50 60 70 80 90 350 360 370 380 390 400 410 QUERY EPKDQGLCGSCWAFASVGNI-ESVFAKKNKNI-LSFSEQEVVDC--SKDNFGCDGGHP-----FYSFLYVLQNEL----- : .::: :::::::..: : . . . : .. . : .... : : . ::.::.: :.. .... : KHHUB EIRDQGSCGSCWAFGAVEAISDRICIHTNAHVSVEVSAEDLLTCCGSMCGDGCNGGYPAEAWNFWTRKGLVSGGLYESHV 100 110 120 130 140 150 160 170 420 430 440 450 460 QUERY --------------------CLG--DEYK-YKAKDDMFCLNYRCKRKVSLSSIGAVKENQLILA-LNEVGPLSVNVGVNN : : : : : . . .:. .. . .: .. . .. :.: . . ::. .: . KHHUB GCRPYSIPPCEHHVNGSRPPCTGEGDTPKCSKICEPGYSPTYKQDKHYGYNSYSVSNSEKDIMAEIYKNGPVEGAFSVYS 180 190 200 210 220 230 240 250 470 480 490 500 510 520 530 540 QUERY DFVAYSEGVYNGTCSEELN-HSVLLVGYGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRL ::. :. :::. . .: .. :.. ..:.: :: .. : ::.. :::. ::.:::... KHHUB DFLLYKSGVYQHVTGEMMGGHAIRILGWG-VE------------------NGTP------YWLVANSWNTDWGDNGFFKI 260 270 280 290 300 310 550 560 QUERY SRNKNGDNVFCGIGEEVFYPIL :... ::: :: KHHUB LRGQD----HCGIESEVVAGIPRTDQYWEKI 320 330 --------------------------------------------------------------------------- >>C64409 hypothetical protein MJ0875 - Methanococcus jannaschii (748 aa) initn: 66 init1: 40 opt: 134 Z-score: 140.4 expect() 0.54 Smith-Waterman score: 165; 26.423% identity in 246 aa overlap Entrez lookup Re-search database >C64409 109- 339: ----------------------------- : 70 80 90 100 110 120 130 140 QUERY KNEHSLKKEEIELLRVLLEKYKKQKDGILNESSNEEDEEKYTLNSETYNNKNNVSNIKNDSIKSKKEEYINLERILLEKY : :.. :..:..: . ::. ..:: . .:::: C64409 IMAKEYSFSNEFDKAREFYKKAEELFLELGIKKSAMYCFYYYLKTYIYEEKEKVEKKDNDKYLELLNKYIIEAEQFLEKY 70 80 90 100 110 120 130 140 150 160 170 180 190 200 210 QUERY KKFIN-----ENNEENRKELSNILHKLLE--INKLILREEK----DDKKVYLIND-NYDEKGALEIGMNEEMKYKKEDPI :.: . . . :.:: : :. .: ..: : :: ... .:: :: . .. . .. .: . C64409 KEFSDIWMYFDIKIYYYKKLS-IKHRKFEGNLDKAIELTEKCYKLAEESYNKFNDKNYKKAEIFNKHFYYNLMAQKFESE 150 160 170 180 190 200 210 220 220 230 240 250 260 270 280 290 QUERY NNIKYASKFFKFMKEHNKVYKNIDEQMRKFEIFKINYISIKNHNKLNKNAM--YKKKVNQFSDYSEEELKEYFKTLLHVP ..: :....: : . . :.:::.. .. . .: . .:: ::. . : .:. .::. ..:.::. :. C64409 RKFKEAAEYYK--KSGDTI-KEIDEKI-AYDEYANSYKWLAIENKYNKEKFEEYINKAIEFSEKRGDKLQEYYYLGLKY- 230 240 250 260 270 280 290 300 310 320 330 340 350 360 370 QUERY NHMIEKYSKPFENHLKDNILIS-EFYTNGKRNEKDIFSKVPEILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKK .:.. .... .:... : : : :.: .: : :.: : :.: C64409 DHLV-RFANDLEEKI-DYIKKSKEYYYRSKSFE---FAKYMEYLEYYYQFKYELLNGNYEKALNFLQKAKKSLKNVKISN 300 310 320 330 340 350 360 370 380 390 400 410 420 430 440 450 QUERY NKNILSFSEQEVVDCSKDNFGCDGGHPFYSFLYVLQNELCLGDEYKYKAKDDMFCLNYRCKRKVSLSSIGAVKENQLILA C64409 IIFSKYTLECDELICRFYLSISQGEFKKSVELLDEYLEISLKILSDWKNTRKYKFYEYLKPCVEILSKESFTKDDLFLLE 380 390 400 410 420 430 440 450 --------------------------------------------------------------------------- >>A56677 neuronal cell cycle withdrawal protein QN1 - quail (fragment) (1251 aa) initn: 51 init1: 51 opt: 137 Z-score: 140.2 expect() 0.56 Smith-Waterman score: 139; 21.512% identity in 344 aa overlap Entrez lookup Re-search database >A56677 5- 328:---------------------------------------- : 10 20 30 40 QUERY MVAIKEMKELAFARPSLVETLNKKKKFLKKKEKRTFVLSIYAFI :.... : : . .. :... : .. :: . :. : A56677 DLLDKDAARLKEAREEIEKLKQEVKKLRAEAGDHQCVQQKKRLRDRA-ADAKRIQDLERQIKEMEGILKRRYPNSLPA-- 820 830 840 850 860 870 880 890 50 60 70 80 90 100 110 QUERY TFIIFCIGILYFTNKSSAHNNNNNKNEHSLKKEEIEL----------LRVLLEKYKKQKDGILNESSNEEDEEKYTLNSE .:. . :: ::..:... :. .:: : :: ::.. ....: : . :. : :. . A56677 --LIYAAAAAEKTNDLSAKTNTTDFLERRIKKLETELEGKDDEAKTSLRAMEQQFQKIK--MQYEQRLAELEQLLA---- 900 910 920 930 940 950 960 120 130 140 150 160 170 180 190 QUERY TYNNKNNVSNIKNDSIKSKKEEYINLERILLEKYKKFINENNEENRKELSNILHKLLEINKLILREEKDDKKVYLINDNY :. :.. ....: : . :.:: . :.. :: . . :. . :. :. . ..: :: .::.: . : . .. A56677 -YKWKSESPKLNGD-----KANCIELE-LQLQNLKKTHQITVENLQTEIENLKSQ---NSQLKLRSKKDNKDLQLADWQM 970 980 990 1000 1010 1020 1030 200 210 220 230 240 250 260 QUERY DEKGALE--IGMNEEMKYKKEDPINNIKYASKFFKF---MKEHNKVYKNIDEQMRKFEIFKINYISIKNHNKLNKNAMY- . .. : . .:.:. :... . : . :. : : :.. .. :.. . : .: : .. ...:. :.. . A56677 KQGNTKEKLLKLNQELITKNREIQDLTKTVEKLQKERMAMLSDNNLRNKTDNKENRQESLKNNTVATEKRNSCNSEPLIG 1040 1050 1060 1070 1080 1090 1100 1110 270 280 290 300 310 320 330 340 QUERY ---KKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPF-ENHLKDNILISEFYTNGKRNEKDIFSKVPEILDYREKGI . :. : ..:. .. : .. .. .. .:: : . ....:.. .. .: .: ..: A56677 IFNNDKIYQPHNFSDSNVLEVLQENARLKEE-VEKLSLEMNQQRVKSQATLAYSENNIRRIQEDTAEYVAALKASHQREV 1120 1130 1140 1150 1160 1170 1180 1190 350 360 370 380 390 400 410 420 QUERY VHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKDNFGCDGGHPFYSFLYVLQNELCLGDEYKYKAKD A56677 EKILSQYTKDDSASKVAELNGRISTQEILIKHLQEQISEHQRHQEALLVSQMREEF 1200 1210 1220 1230 1240 1250 --------------------------------------------------------------------------- >>KHBOB cathepsin B (EC 3.4.22.1) precursor - bovine (335 aa) initn: 254 init1: 101 opt: 128 Z-score: 139.5 expect() 0.61 Smith-Waterman score: 252; 25.000% identity in 288 aa overlap Entrez lookup Re-search database >KHBOB 323- 564: -----------------------------: 290 300 310 320 330 340 350 QUERY LKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSK---VPEILDYREK----GIVHEPKDQGLCG : ..: :. .:: .: ::. ..: .::: :: KHBOB SDELVNFVNKQNTTWKAGHNFYNVDLSYVKKLCGAILGGPKLPQRDAFAADVVLPESFDAREQWPNCPTIKEIRDQGSCG 30 40 50 60 70 80 90 100 360 370 380 390 400 410 QUERY SCWAFASVGNI-ESVFAKKNKNI-LSFSEQEVVDCSKDNF--GCDGGHP-----FYSFLYVLQNEL-------------- :::::..: : . . ..: . . : .... : . ::.:: : :.. .... : KHBOB SCWAFGAVEAISDRICIHSNGRVNVEVSAEDMLTCCGGECGDGCNGGFPSGAWNFWTKKGLVSGGLYNSHVGCRPYSIPP 110 120 130 140 150 160 170 180 420 430 440 450 460 470 QUERY -----------CLG--DEYK-YKAKDDMFCLNYRCKRKVSLSSIGAVKENQLILA-LNEVGPLSVNVGVNNDFVAYSEGV : : : : :. . . .:. .. . :: ....... :.: . . ::. .: .::. :. :: KHBOB CEHHVNGSRPPCTGEGDTPKCSKTCEPGYSPSYKEDKHFGCSSYSVANNEKEIMAEIYKNGPVEGAFSVYSDFLLYKSGV 190 200 210 220 230 240 250 260 480 490 500 510 520 530 540 550 QUERY YNGTCSEELN-HSVLLVGYGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNV :. . .: .. :.. ..:.: :: .. : ::.. :::. ::.:::... :... KHBOB YQHVSGEIMGGHAIRILGWG-VE------------------NGTP------YWLVGNSWNTDWGDNGFFKILRGQD---- 270 280 290 300 310 560 QUERY FCGIGEEVFYPIL ::: :. KHBOB HCGIESEIVAGMPCTHQY 320 330 --------------------------------------------------------------------------- >>A38194 desmoplakin I - human (2677 aa) initn: 89 init1: 53 opt: 140 Z-score: 138.3 expect() 0.72 Smith-Waterman score: 150; 23.636% identity in 275 aa overlap Entrez lookup Re-search database >A38194 75- 329: ------------------------------- : 40 50 60 70 80 90 100 110 QUERY TFVLSIYAFITFIIFCIGILYFTNKSSAHNNNNNKNEHSLKKEEIELLRVLLEK---YKKQKDGILNESSNEEDEEKYTL :. ::: :::::.. :.. .. : . :. .:: .: A38194 SVEDRFDQQKNDYDQLQKARQCEKENLGWQKLESEKAIKEKEYEIERLRVLLQEEGTRKREYENELAKVRNHYNEEMSNL 940 950 960 970 980 990 1000 1010 120 130 140 150 160 170 180 190 QUERY NSETYNNKNNVSNIKNDSIKSKKEEYINLERILLEKYKKFINENNEENRKELSNILHKLLEINKLILR-EEKDDKKVYLI .. :... :... :. .::. . : :.. ....:.. . :. . ..:. .. : ::. .. A38194 RNK-YETEINITKTTIKEISMQKEDDSKNLRNQLDR----LSRENRDLKDEIVRLNDSILQATEQRRRAEENALQQKACG 1020 1030 1040 1050 1060 1070 1080 200 210 220 230 240 250 QUERY NDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKNI-----DEQMRKFE----IFKI--NY----IS .. ...: ::: ... :. ..:: . . . : ....:: . . .: :..: . :. :: :: A38194 SEIMQKKQHLEIELKQVMQQRSEDNARHKQSLEEAAKTIQDKNKEIERLKAEFQEEAKRRWEYENELSKVRNNYDEEIIS 1090 1100 1110 1120 1130 1140 1150 1160 260 270 280 290 300 310 320 330 QUERY IKNHNKLNKNAMYKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFE-NHLKDNILISEFYTNGKRNEKDIFSKVP .::. . . : . : ..:.. .::. . : .. : :. : : ..::... .. : .: :.:: A38194 LKNQFETEIN-ITKTTIHQLTMQKEEDTSGYRA---QIDNLTRENRSLSEEIKRLKNTL--TQTTENLRRVEEDIQQQKA 1170 1180 1190 1200 1210 1220 1230 1240 340 350 360 370 380 390 400 410 QUERY EILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKDNFGCDGGHPFYSFLYVLQNELCL A38194 TGSEVSQRKQQLEVELRQVTQMRTEESVRYKQSLDDAAKTIQDKNKEIERLKQLIDKETNDRKCLEDENARLQRVQYDLQ 1250 1260 1270 1280 1290 1300 1310 1320 --------------------------------------------------------------------------- >>A42771 reticulocyte-binding protein 1 - Plasmodium vivax (2829 aa) initn: 62 init1: 62 opt: 140 Z-score: 138.0 expect() 0.75 Smith-Waterman score: 170; 24.149% identity in 323 aa overlap Entrez lookup Re-search database >A42771 56- 355: ------------------------------------- : 20 30 40 50 60 70 80 90 QUERY SLVETLNKKKKFLKKKEKRTFVLSIYAFITFIIFCIGILYFTNKSSAHNNNNNKNEHSLKKEEIELLRVLLEKY-KKQKD :. :. .:. : .:. .:.:. : .::. ..:. A42771 KMAKKVHYLKELLSLKGKSSVYFTEMNELLNTASYDNMEGFSAKKEKADNDINALYNSVYREDINALIEEVEKFVTENKE 650 660 670 680 690 700 710 720 100 110 120 130 140 150 160 170 QUERY GILNESSNEEDEEKYTLNSETYNNKNNVSNIKNDSIKSK-KEEYINLERILLE-KYKKFINENNEENRKELSNILHKLLE . :. ..:: ::: .::. . : ::. : .. .: . : : : : : :.: :: ... ::.:. . .: A42771 STLEMLKDEEMEEKLQDAKETFAKLNFVSDDKLTDVYTKMSAEVTNAEGIKKEIAQKQF--ENVHKKMKEFSDAFSTKFE 730 740 750 760 770 780 790 800 180 190 200 210 220 230 QUERY I--NKLILREEKDD------------KKVYLINDNYDEKGALEIGMNEEMKYKKEDPINNI-KYASKFFKFMKEHNKVYK :.. ... : .. :. :.. .: . : .::..: :. ::. . ... . . .: . A42771 ALQNSMQQYNQEGDAIEKHKQNRSEKEEEYFKNESVEEDLSRE--ETEEQEYTKHK--NNFSRRKGEISAEITNMREVIN 810 820 830 840 850 860 870 240 250 260 270 280 290 300 310 QUERY NIDEQMRKFEIFKINYISIKNHNKLNKNAMYKKKVNQFS-----DYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDN .:. :. . ... . : ..:... :.:. . : : : :.:: ... .. . :.. :: ... . : A42771 KIESQLNYYGVIEKYFSLIGDQNEVSTAKALKEKIVSDSLRDKIDQYETEFKEKTSAVENTVS-TIQSLSKAIDSLKRLN 880 890 900 910 920 930 940 950 320 330 340 350 360 370 380 390 QUERY ILISEFYTNGKRNEKDIFSKVPEILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKDN :. : :. . :: .: ::. . :: :: A42771 GSIN----NCKKYNTDIDLLRSKIKTLREEVQKEMPKRGDKCGENTTALLLKSLRDKMGKINEKLNDGRLNSLDTKKEDL 960 970 980 990 1000 1010 1020 1030 400 410 420 430 440 450 460 470 QUERY FGCDGGHPFYSFLYVLQNELCLGDEYKYKAKDDMFCLNYRCKRKVSLSSIGAVKENQLILALNEVGPLSVNVGVNNDFVA A42771 LKFYSESKSKIHLSKDQKGPQDPLNRIDEWEDIKRDVDELNVNYQVISENKVTLFKNNSVTYIEAMHSHINTVAHGITSN 1040 1050 1060 1070 1080 1090 1100 1110 --------------------------------------------------------------------------- >>A38748 3-phosphatidylinositol kinase (EC 2.7.1.-) 85K chain - human (724 aa) initn: 71 init1: 71 opt: 131 Z-score: 137.6 expect() 0.78 Smith-Waterman score: 131; 23.729% identity in 177 aa overlap Entrez lookup Re-search database >A38748 69- 243: ---------------------- : 30 40 50 60 70 80 90 100 QUERY KKKEKRTFVLSIYAFITFIIFCIGILYFTNKSSAHNNNNNKNEHSLKKEEIELLRVLLEKYKKQKDGILNESSNEEDE-- .... .:...:: . :..:. : ..:.: : :. A38748 FSDPLTFSSVVELINHYRNESLAQYNPKLDVKLLYPVSKYQQDQVVKEDNIEAVGKKLHEYNTQ----FQEKSREYDRLY 400 410 420 430 440 450 460 110 120 130 140 150 160 170 180 QUERY EKYTLNSETYNNKNNVSNIKNDSIKSKKEEYINLERILLEKYKKFINENNEENRKELSNILHKLLEINKLILREEKDDKK :.:: .:. . : .. . :..:: .:. . :: : .:: :.:: ::.. :.:. .... : : :... A38748 EEYTRTSQEIQMKRTAIEAFNETIKIFEEQCQTQERYSKEYIEKFKREGNE---KEIQRIMHNYDKLKSRI-SEIIDSRR 470 480 490 500 510 520 530 540 190 200 210 220 230 240 250 260 QUERY VYLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKNIDEQMRKFEIFKINYISIKNHNKLNKNA .... ...: ....:. : : :. : .... .. ... :...: . A38748 R--LEEDLKKQAAEYREIDKRMNSIKPDLIQLRKTRDQYLMWLTQKGVRQKKLNEWLGNENTEDQYSLVEDDEDLPHHDE 550 560 570 580 590 600 610 620 270 280 290 300 310 320 330 340 QUERY MYKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILDYREKGIVH A38748 KTWNVGSSNRNKAENLLRGKRDGTFLVRESSKQGCYACSVVVDGEVKHCVINKTATGYGFAEPYNLYSSLKELVLHYQHT 630 640 650 660 670 680 690 700 --------------------------------------------------------------------------- >>S41720 intermediate filament - goldfish (472 aa) initn: 77 init1: 53 opt: 128 Z-score: 137.3 expect() 0.81 Smith-Waterman score: 128; 20.816% identity in 245 aa overlap Entrez lookup Re-search database >S41720 74- 311: ----------------------------- : 40 50 60 70 80 90 100 110 QUERY RTFVLSIYAFITFIIFCIGILYFTNKSSAHNNNNNKNEHSLKKEEIELLRVLLEKYKKQKDGILNESSN-EEDEEKYTLN : ..::. :: ::. . .:. .. : .: ::: .: S41720 NDRFAMFIDKVRNLEQHNKVLEAELVTLRQRQTEPSRLAELYQQEIRELRSQLEELNAEKNQMMFERDNIEEDLQKL--- 100 110 120 130 140 150 160 170 120 130 140 150 160 170 180 QUERY SETYNNKNNVSNIKNDSIKSKKEEYIN--LERILLEK-YKKFINENNEENRKELSNILHKLLEINKLILREEKDDKKVYL .: .... . . ....:. :.. : . :. ::: . ...: : . . .... . :. . : . : : S41720 QEKFEEEMRIREEAEQTLKAFKKDVDNATMVRLDLEKKVEALLDEINFIRKVHEEEVIELMNMIQAAQVSVEMEVAKPDL 180 190 200 210 220 230 240 250 190 200 210 220 230 240 250 260 QUERY INDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNK---VYKNIDEQMRKFEIFKINYISIKNHNKLNKNA . . .: : :.... .: : ::: . .. :: : . :.. .:. ... .:. .. . : S41720 TSALKEIRGQYEAMANKNLHSAEE------WYKSKFTDLSEQANKSNEVIRASREELNEFR-RQLQSKTIEIESLRGTNE 260 270 280 290 300 310 320 270 280 290 300 310 320 330 340 QUERY MYKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILDYREKGIVH ....... : . :. : :. .. :.. :. . ::.. S41720 SLERQIHEMEDTHNAEVMGYQDTIGQLDNELRTTKSE-MARHLREYQDLLNVKMALDIEIAAYRKLLEGEETRISTGITY 330 340 350 360 370 380 390 400 350 360 370 380 390 400 410 420 QUERY EPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKDNFGCDGGHPFYSFLYVLQNELCLGDEYKYKAKDDM S41720 PTPTSGSSYSYQSRMYSSSSVSGKKEVKDDDDKHQQSSKPGKGSSQSDDYKKSDKIDSGDVNPTNQKN 410 420 430 440 450 460 470 --------------------------------------------------------------------------- >>S49394 HsdR1 protein - Mycoplasma pulmonis (SGC3) (986 aa) initn: 84 init1: 54 opt: 132 Z-score: 136.6 expect() 0.89 Smith-Waterman score: 149; 27.181% identity in 298 aa overlap Entrez lookup Re-search database >S49394 58- 333: ---------------------------------- : 20 30 40 50 60 70 80 90 QUERY VETLNKKKKFLKKKEKRTFVLSIYAFITFIIFCIGILYFTNKSSAHNNNNNKNEHSLKKEEIELLRVLLEKYKKQKDGIL : .: . .: ...: ::. : . : :. .. : S49394 KEYNSENNSIDIVIVVDMLLTGFDSPRTNTLYINKELKNHNLIQAFSRTNRLSDYS-KKRGIIVNFSLEEQSINDAFKIY 590 600 610 620 630 640 650 660 100 110 120 130 140 150 160 170 QUERY NESSNEEDE-----EKYTLNSETYNNKNNVSNIKNDSIKSKKEEYINLERILLEKYKKFINENNEENRKELSNILHKLLE .::..: . ::: : . : : .:. ..: ..:.. : .. : ::. ::.. .: ...:::. .: S49394 ANSSDKEIQQLVYGEKYEQVVEDFINFWNSLKISFSNIYDEKNNEI-FRNISLENKKKYL-----KNLSQVSNIFSSLKT 670 680 690 700 710 720 730 180 190 200 210 220 230 240 QUERY INKLILREEKDDKKVYLINDNYDEKGALEIGMN------EEMKYKKEDPIN--NIKYASK--FFKFMKEHNKVYKNIDEQ ... :. .: .. .:. .: : :: : :...:. . :. :::.: : .. . .: .. : .. S49394 FKEYGKNEKISDFSLEQLNQY--QKWANEIKKNLSTNEKEKISYEVLNSIDISNIKFAYKEMIIDEIYLENLLFFN--KK 740 750 760 770 780 790 800 810 250 260 270 280 290 300 310 QUERY MRKFEIFKINY---IS-IKNHNKLNKNAMYKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHL--KDNILI- . :. ...: .: : .: .: :: . . :.:: ::: :: ...:... ..: . ::. : S49394 ISKYPNNRLTYEDTLSEIDKHIQLIKNNYNQGKINQ---------KEYEIFLL-----LVQKWKNEIKNFFIKKDKSLDE 820 830 840 850 860 870 320 330 340 350 360 370 380 390 QUERY SEFYTNGKRNEKDIFSKVPEILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKDNFGC .:: ::: :..:.:: S49394 KEFIDYGKRILKSVFQKVKNQIEAMWLEKILKEYHGINNDQIRKDWKKRINDKDLDDIEKSEFIKKWSRRSKEVDKDIID 880 890 900 910 920 930 940 950 --------------------------------------------------------------------------- >>S35578 cysteine proteinase II - mountain papaya (fragment) (43 aa) initn: 66 init1: 66 opt: 112 Z-score: 136.5 expect() 0.9 Smith-Waterman score: 112; 44.444% identity in 36 aa overlap Entrez lookup Re-search database >S35578 334- 369: ----: 300 310 320 330 340 350 360 370 QUERY PNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKK : .:.:.:: : :::. :: :::..:...:.. S35578 YPGSVDWRQKGAVTPVKDQNPXGSXWAFSTVATVEGINKIV 10 20 30 40 380 390 400 410 420 430 440 450 QUERY NKNILSFSEQEVVDCSKDNFGCDGGHPFYSFLYVLQNELCLGDEYKYKAKDDMFCLNYRCKRKVSLSSIGAVKENQLILA S35578 TG --------------------------------------------------------------------------- >>S42488 heat shock protein 70 - Pyrenomonas salina nucleomorph (649 aa) initn: 78 init1: 78 opt: 129 Z-score: 136.3 expect() 0.93 Smith-Waterman score: 129; 28.144% identity in 167 aa overlap Entrez lookup Re-search database >S42488 44- 199: ------------------- : 10 20 30 40 50 60 70 QUERY IKEMKELAFARPSLVETLNKKKKFLKKKEKRTFVLSIYAFITFIIFCIGILYFT--NKSSAHNNN----NNKNEHSLKKE .:: : ::: . .::....:. :.:.. :.:: S42488 PGVLIQVFEGERSRTKDNNILGKFELTGIPPAPRGVPQIEVTFDIDANGILNVSASDKSTGKSNKITITNDKGR--LSKE 450 460 470 480 490 500 510 80 90 100 110 120 130 140 150 QUERY EIELLRVLLEKYKKQKDGILNESSNEEDEEKYTLNSETYNNKNNVSNIK-NDSIKSKKEEYIN--LERIL--LEKYKKFI ::: . :::: . .:. ... : : .:.. .:: .:.: . : ...:. . .. :. ....: .: . . S42488 EIERMVEEAEKYKTE-----DEKLDKKLEAKNSLENYAYNIRNTVRDEKLKEKIQEEDKKSIEEKVKEVLEFIETNEDLE 520 530 540 550 560 570 580 590 160 170 180 190 200 210 220 230 QUERY NENNEENRKELSNILHKLLEINKLILREEKDDKKVYLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEH .:. ::..:::.:. . . :.:: . : . ..: .. .: S42488 KEEYEEKEKELKNFANPI--ISKLYQQGSVPDMGNFSTGQNEQDDNANMGPKIEEVG 600 610 620 630 640 240 250 260 270 280 290 300 310 QUERY NKVYKNIDEQMRKFEIFKINYISIKNHNKLNKNAMYKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDN --------------------------------------------------------------------------- >>S28104 probable DNA-directed RNA polymerase (EC 2.7.7.6) - gill mushroom (A (1102 aa) initn: 48 init1: 48 opt: 132 Z-score: 135.9 expect() 0.98 Smith-Waterman score: 134; 25.769% identity in 260 aa overlap Entrez lookup Re-search database >S28104 53- 291: ------------------------------ : 20 30 40 50 60 70 80 90 QUERY ARPSLVETLNKKKKFLKKKEKRTFVLSIYAFITFIIFCIGILYFTNKSSAHNNNNNKNEHSLKKEEIELLRVLLEKYKKQ .:: . : ..: . . : : : . :. .:: . . S28104 VEKLLNLICNVLDRNVNELNDSVFLIIKDIEKECKYYVSNVLYRSVGS--RKNRGREVEWSSYKYNKEFNKVLDKGIISI 60 70 80 90 100 110 120 130 100 110 120 130 140 150 QUERY KDGILNESSNEEDE--EKYTLNSETYNNK-----NNVS----NIKNDSIKSKKEEYI-NLERILLEKYKKFINENNEEN- .. .:. :.:.. :. . : .:: ::.. .::: : .:: . ..:.: : ..:::. :.. : S28104 NNEVLKFISKEREGYIERVESIAVTVKNKILELNNNIAEVLLSIKNKVIVLNKESVVAKVEEINYEVHNKFIKGNGNTNF 140 150 160 170 180 190 200 210 160 170 180 190 200 210 220 230 QUERY -RKELSNILHKLLEINKLILREEKDDKKVYLINDNYDE-KGALEIGMN-EEMKYKKEDPINNIKYASKFFKF-MKEH--- ..:..: : :.::. . ... .: .: : :. .. :. . : . ..: ::. : : S28104 SNRNLTEIKSILKELNKMEILDNRINKLSTKESDLLKVIKEILDSNLIIEDKQLAIEKTV--VEYELTFFRHNMDTHETR 220 230 240 250 260 270 280 240 250 260 270 280 290 300 310 QUERY NKVYKNIDEQMRKFEIFKINYISIKNHNKLNKNAMYKKKVNQFSDYSE-EELKEYFKTLLHVPNHMIEKYSKPFENHLKD ::. .:: .. : : . . :::. . ::... .:. :: . ::..: .. S28104 NKIIHNIYPKLNK------AYTELLANYKLNRYSKIKKSIHLISNKSEGTKSKEMIKLIVVLVILYIGIDKCISYSFYQI 290 300 310 320 330 340 350 360 320 330 340 350 360 370 380 390 QUERY NILISEFYTNGKRNEKDIFSKVPEILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKD S28104 INLLTNARDGTSRTNIAINLGFRIIKVLKYIKLDENPSLNALYPINKLKDEISKLDNEGIYWIGDTLLGLITANCDIVVE 370 380 390 400 410 420 430 440 --------------------------------------------------------------------------- >>S64493 hypothetical protein YGR179c - yeast (Saccharomyces cerevisiae) (406 aa) initn: 107 init1: 53 opt: 125 Z-score: 135.2 expect() 1.1 Smith-Waterman score: 130; 24.895% identity in 237 aa overlap Entrez lookup Re-search database >S64493 63- 296: ---------------------------- : 30 40 50 60 70 80 90 100 QUERY KKKKFLKKKEKRTFVLSIYAFITFIIFCIGILYFTNKSSAHNNNNNKNEHSLKKEEIELLRVLLEKYKKQ-KDGILNESS :..::. .. :::. ::...: : ::. ... . S64493 HWKKPSKIMIGSILRLLETNTVSALDSVFEKYEKEMNQMTHGDNNEVKRIYSKKER--LLEIILTKIKKKLRQAKFPSRI 150 160 170 180 190 200 210 220 110 120 130 140 150 160 170 180 QUERY NEEDEE-KYTLNSETYNNKNNVSNIKNDSIKSKKEEYINLERILLEKYKKFINENNEENRKELSNILHKLLEINKLILRE .:.: . .: ... . .. ....:. . : .. :. :::. .:. . . .:.:.:.. ::: S64493 SERDLDIEYIYSKRQFIQNRYSQELQNNE---RLEAILSREQNLLEETRKLCMNLKTNNKKRLTE---------KLI--- 230 240 250 260 270 280 290 190 200 210 220 230 240 250 260 QUERY EKDDKKVYLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKNIDEQMRKFEIFKINYISIKNHN .:: . : .: .. .:: . : :. . :.. .. :. ...: .. . .. : .. : :.... :.:... S64493 QKDLHPV--LNKAMEYTYGLE-STNGFMH--PDGPVT-FRNDSHELNLML-NDPIKSTADVRLDKEEVLSL-LPSLKEYT 300 310 320 330 340 350 360 270 280 290 300 310 320 330 QUERY KLNKNAMYKKKVNQF-SDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILDY : .:. :. ..:. :: :::.:: : ::.: S64493 KKSKE--LKETMGQMISDSHEEEIKEVF-----VPHHESHQDKTEEDIH 370 380 390 400 --------------------------------------------------------------------------- >>S57624 cysteine proteinase LmCPb19 - Leishmania mexicana (fragment) (136 aa) initn: 147 init1: 91 opt: 118 Z-score: 135.1 expect() 1.1 Smith-Waterman score: 163; 29.921% identity in 127 aa overlap Entrez lookup Re-search database >S57624 442- 568: ---------------: 410 420 430 440 450 460 470 480 QUERY YSFLYVLQNELCLGDEYKYKAKDDMFCLNYRCKRKVSLSSIGAVKENQLILALNEVGPLSVNVGVNNDFVAYSEGVYNGT ::. .:. . : . ::... . ... :..:. :: .. S57624 ELVVGAQIDGHVLIGS-SEKAMAAWLAKNGPIAIALDASS-FMSYKSGVLTAC 10 20 30 40 50 490 500 510 520 530 540 550 560 QUERY CSEELNHSVLLVGYGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIG ...:::.:::::: . : : . ::.:::::. :::.:..:. . :. : .. S57624 IGKQLNHGVLLVGYDM----------------TGE---------VPYWVIKNSWGGDWGEQGYVRVVMGVNA----CLLS 60 70 80 90 100 QUERY EEVFYPIL : ::. S57624 E---YPVSAHVRESAAPGTSTSSETPAHNSVMVEQVY 110 120 130 --------------------------------------------------------------------------- >>S64439 hypothetical protein YGR130c - yeast (Saccharomyces cerevisiae) (816 aa) initn: 58 init1: 58 opt: 129 Z-score: 134.8 expect() 1.1 Smith-Waterman score: 143; 25.097% identity in 259 aa overlap Entrez lookup Re-search database >S64439 104- 349: ------------------------------ : 70 80 90 100 110 120 130 QUERY NNNNNKNEHSLKKEEIELLRVLLEKYKKQKDGILNESSNEEDEEKYTLNSETYNNK-----NNVSNIKNDSIKSKKEEYI ....:: ... :..: :....:.: :.:. .:: S64439 PENPELIVKTKEHGYLSKAVYDKINYDEKIHQAWLADLRAKEKDKYDAKNKEYKEKLQDLQNQIDEIEN-SMKAMREE-- 410 420 430 440 450 460 470 140 150 160 170 180 190 200 210 QUERY NLERILLEK---YKKFINENNEENRKELSNILHKLLEINKLILREEKDDKKVYLINDNYDEKGALEIGMNEEMKYKKE-D . :.: . : ::.:. : :.: :.: .::.. .. : :.. .::. . .... . :.: : S64439 TSEKIEVSKNRLVKKIIDVNAEHNNKKL------------MILKDTENMK-----NQKLQEKNEV---LDKQTNVKSEID 480 490 500 510 520 530 220 230 240 250 260 270 280 290 QUERY PINNIKYASKFFKFMKEHNKVYKNIDEQMRKFEIFKINYISIKNHNKLNKNAMYKKKVNQFSDYSEEELKEYFKTLLHVP .:: : . : ... . .:...:. .::::: :..:. . :. .:: ... .::. : . :.. .: S64439 DLNNEKTNVQ--KEFNDWTTNLSNLSQQLDA-QIFKINQINLKQGKVQNEIDNLEKKKEDLVTQTEENKKLHEKNV-QVL 540 550 560 570 580 590 600 610 300 310 320 330 340 350 360 370 QUERY NHMIEKYSKPFENHLKDNI--LISE--FYTNGKRNEKDIFSKVPEILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVF . . .: : : . ..: :..: . . . ::: .: . . :. .... :: . S64439 ESVENKEYLPQINDIDNQISSLLNEVTIIKQENANEKTQLSAITKRLEDERRA--HEEQLKLEAEERKRKEENLLEKQRQ 620 630 640 650 660 670 680 690 380 390 400 410 420 430 440 450 QUERY AKKNKNILSFSEQEVVDCSKDNFGCDGGHPFYSFLYVLQNELCLGDEYKYKAKDDMFCLNYRCKRKVSLSSIGAVKENQL S64439 ELEEQAHQAQLDHEQQITQVKQTYNDQLTELQDKLATEEKELEAVKRERTRLQAEKAIEEQTRQKNADEALKQEILSRQH 700 710 720 730 740 750 760 770 --------------------------------------------------------------------------- >>A64224 hypothetical protein MG218 - Mycoplasma genitalium (SGC3) (1805 aa) initn: 41 init1: 41 opt: 134 Z-score: 134.7 expect() 1.1 Smith-Waterman score: 150; 22.759% identity in 290 aa overlap Entrez lookup Re-search database >A64224 75- 338: --------------------------------- : 40 50 60 70 80 90 100 110 QUERY TFVLSIYAFITFIIFCIGILYFTNKSSAHNNNNNKNEHSLKKEEIELLRVLLEKYK--KQKDGILNESSNEEDEEKYTLN ::. .. :.. .: . :... :. ::: .... .:. A64224 KYTNLLDLKENLERTKDQLDKKHRSIFARLTKFANDLRFEKKQLLKAQRIVDDKNRLLKENERNLHFLSNETERKRAVLE 1260 1270 1280 1290 1300 1310 1320 1330 120 130 140 150 160 170 180 QUERY SE-TY--NNKNNVSNIKNDSIKSKKEEYINLERILLEKYKKFINENNE-----ENRKELSNILHKLLEINKLILREEKDD .. .: ........ : : :.. .:...:.: . . ::. ..:.:. : ::::..: . . ... A64224 DQISYFEKQRKQATDAILASHKEVKKKEGELQKLLVELETRKTKLNNDFAKFSRQREEFENQRLKLLELQKTLQTQTNSN 1340 1350 1360 1370 1380 1390 1400 1410 190 200 210 220 230 240 250 QUERY ----KKVYLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEH-----NKVYKNIDEQMRKFEIF--KINY : . :...: . :: ::....:.. .: . ..:. :... ..: . : .:: ... . : A64224 NFKTKAIQEIENSYKR------GM-EELNFQKKEFDKNKSRLYEYFRKMRDEIERKESQVKLVLKETQRKANLLEAQANK 1420 1430 1440 1450 1460 1470 1480 260 270 280 290 300 310 320 QUERY ISI-KNHNKLNKNAM--YKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHL--KDNILISEFYTNGKRNEKD ..: :: .... . .: ::.: : .... :: .. ::. :..... : . ::..: ... : ::. .: A64224 LNIEKNTIDFKEKELKAFKDKVDQDIDSTNKQRKE-LNELLN-ENKLLQQSLIERERAINSKDSLLNKKIETI-KRQLHD 1490 1500 1510 1520 1530 1540 1550 1560 330 340 350 360 370 380 390 400 QUERY IFSKVPEILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKDNFGCDGGHPFYSFLYVL .: ...: A64224 KEMRVLRLVDRMKLAEQKYQTEINRLRTQTFDSEKQDIKNFFPPLFKINGNDMAFPYLYPWLYPQQKQDDNTLQIRQLFE 1570 1580 1590 1600 1610 1620 1630 1640 --------------------------------------------------------------------------- >>S38939 probable cathepsin B-like cysteine proteinase (EC 3.4.22.-) 29K, pre (344 aa) initn: 260 init1: 101 opt: 123 Z-score: 134.2 expect() 1.2 Smith-Waterman score: 255; 26.007% identity in 273 aa overlap Entrez lookup Re-search database >S38939 331- 560: ----------------------------: 300 310 320 330 340 350 360 QUERY LHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILDYREK----GIVHEPKDQGLCGSCWAFASVGNI : ::: .: :. . : .::: :::::::..: . S38939 YDKSVPRSHFRRLMGVHPDAHKFTLHEKSLVLGEEVGLADSDVPEEFDARKAWPNCPTIGEIRDQGSCGSCWAFGAVEAM 50 60 70 80 90 100 110 120 370 380 390 400 410 420 430 QUERY -ESVFAKKNKNI-LSFSEQEVVDCSKD-NFGCDGGHPFYSFLYVLQNELCLGDEY-------KYKAKDDMFCLN------ . . ..: .: . :: ...:.: . .:::.:: : .. : .. . : : :. .: S38939 SDRLCIHSNATIHFHFSADDLVSCCHTCGFGCNGGFPGAAWAYWTRKGIVSGGPYGSSQGCRPYEIAPCEHHVNGTRPPC 130 140 150 160 170 180 190 200 440 450 460 470 480 QUERY -----------YRCKRKVSL---------SSIGAVKEN--QLILALNEVGPLSVNVGVNNDFVAYSEGVYNGTCSEELN- ..:... .. :. .::.: .. . . ::. : .:.. :..:::. . ..::. S38939 DGEHGKTPSCRHECQKSYDVDYKTDKHFGSKSYSVKRNVKDIQKEIMQNGPVEGAFTVYEDLILYKDGVYQHVHGRELGG 210 220 230 240 250 260 270 280 490 500 510 520 530 540 550 560 QUERY HSVLLVGYGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFYP :.. ..:.: .:: : ::.: :::. ::.:::... :... ::: S38939 HAIRILGWGVENKT-------------------P------YWLIANSWNTDWGNNGFFKMLRGED----HCGIESAIAAG 290 300 310 320 330 340 QUERY IL S38939 LPKV --------------------------------------------------------------------------- >>S41426 cysteine proteinase (EC 3.4.22.-) CP4 precursor - Trichomonas vagina (100 aa) initn: 96 init1: 96 opt: 114 Z-score: 133.1 expect() 1.4 Smith-Waterman score: 114; 66.667% identity in 21 aa overlap Entrez lookup Re-search database >S41426 338- 358: ---: 300 310 320 330 340 350 360 370 QUERY IEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNI :.:.:: :. :::: ::::: S41426 VALNKLAHLTPAEYNSLLGFRMNKAERKAVKSNAIANADCDWRKKGAVNPIKDQGQCGSCW 40 50 60 70 80 90 100 380 390 400 410 420 430 440 450 QUERY LSFSEQEVVDCSKDNFGCDGGHPFYSFLYVLQNELCLGDEYKYKAKDDMFCLNYRCKRKVSLSSIGAVKENQLILALNEV --------------------------------------------------------------------------- >>A41404 cathepsin L (EC 3.4.22.15) - cat (fragment) (139 aa) initn: 195 init1: 87 opt: 116 Z-score: 133.0 expect() 1.4 Smith-Waterman score: 237; 33.333% identity in 162 aa overlap Entrez lookup Re-search database >A41404 397- 551: -------------------: 360 370 380 390 400 410 420 430 QUERY CWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKDNFGCDGGHPFYSFLYVLQNELCLGDE-YKYKAKDDMFCLNYRCKR :: .: :: .: ..: : :.:. : : .:: . A41404 GGLIDDAFQYVKDNGGLDSEESYPYHAQGDS-C-KYRPEN 10 20 30 440 450 460 470 480 490 500 QUERY KVS-LSSIGAV--KENQLILALNEVGPLSVNVGVNND-FVAYSEGVY-NGTCS-EELNHSVLLVGYGQVEKTKLNYNNKI .:. ... . :::.:...: :::.:. . .. : : :.::.: . .:: :...:.::.:::: . A41404 SVANVTDYWDIPSKENELMITLAAVGPISAAIDASLDTFRFYKEGIYYDPSCSSEDVDHGVLVVGYGA---------DGT 40 50 60 70 80 90 100 510 520 530 540 550 560 QUERY QTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFYPIL .: : : ::::::::. :: .:........ A41404 ETENKK------------YWIIKNSWGTDWGMDGYIKMAKDR 110 120 130 --------------------------------------------------------------------------- >>A28121 major merozoite surface antigen - Plasmodium yoelii (fragment) (680 aa) initn: 61 init1: 61 opt: 126 Z-score: 132.9 expect() 1.4 Smith-Waterman score: 133; 25.751% identity in 233 aa overlap Entrez lookup Re-search database >A28121 56- 267: -------------------------- : 20 30 40 50 60 70 80 QUERY SLVETLNKKKKFLKKKEKRTFVLSIYAFITFIIFCIGILYFTNKS-SAHNNNNNKNEHSLKKEEIELLRVLLEKY----- :::.: .... ... .::::. .. . .. : . A28121 GSGTDTRVAGSSVDDNEDDDIYQIASGQSEDAPEKDILSEFTNESLYVYTKRLGSTYKSLKKHMLREFSTIKEDMTNGLN 280 290 300 310 320 330 340 350 90 100 110 120 130 140 150 QUERY -KKQK-DGILNESSNEEDEEKYTLNSETYNNKNNVSNIKNDSIKSKKEEYINLE---RILLEKYK------KFINENNEE :.:: . .:. :.: : : :... : .: . . ::. : .. .::. . . : . ::.:. : A28121 NKSQKRNDFLEVLSHELDLFK-DLSTNKYVIRNPYQLLDNDK---KDKQIVNLKYATKGINEDIETTTDGIKFFNKMVEV 360 370 380 390 400 410 420 160 170 180 190 200 210 220 230 QUERY NRKELSNILHKLLEINKLILREEKDDKKVYLINDNYDEKGALE--IGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVY .:. . ... :. .:..:: : : : :: : ::. ::.. . .. ..: : . :... .. . : A28121 YNTQLAAVKEQIATIEAETNDTNKEEKKKY-IPILEDLKGLYETVIGQAEEYSEELQNRLDNYKNEKAEFEILTKNLEKY 430 440 450 460 470 480 490 500 240 250 260 270 280 290 300 310 QUERY KNIDEQMRKF-EIFKIN-YISIKNHNKLNKNAMYKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNIL .:::.. .: : . : .:. :.:::... A28121 IQIDEKLDEFVEHAENNKHIASIALNNLNKSGLVGEGESKKILAKMLNMDGMDLLGVDPKHVCVDTRDIPKNAGCFRDDN 510 520 530 540 550 560 570 580 --------------------------------------------------------------------------- >>A38747 phosphatidlyinositol 3-kinase (EC 2.7.1.-) 85K chain - mouse (724 aa) initn: 66 init1: 66 opt: 126 Z-score: 132.5 expect() 1.5 Smith-Waterman score: 126; 23.729% identity in 177 aa overlap Entrez lookup Re-search database >A38747 69- 243: ---------------------- : 30 40 50 60 70 80 90 100 QUERY KKKEKRTFVLSIYAFITFIIFCIGILYFTNKSSAHNNNNNKNEHSLKKEEIELLRVLLEKYKKQKDGILNESSNEEDE-- .... .:...:: . :..:. : ..:.: : :. A38747 FSDPLTFNSVVELINHYRNESLAQYNPKLDVKLLYPVSKYQQDQVVKEDNIEAVGKKLHEYNTQ----FQEKSREYDRLY 400 410 420 430 440 450 460 110 120 130 140 150 160 170 180 QUERY EKYTLNSETYNNKNNVSNIKNDSIKSKKEEYINLERILLEKYKKFINENNEENRKELSNILHKLLEINKLILREEKDDKK :.:: .:. . : .. . :..:: .:. . :: : :: :.:: ::.. :.:. .... : : :... A38747 EEYTRTSQEIQMKRTAIEAFNETIKIFEEQCQTQERYSKEYIGKFKREGNE---KEIQRIMHNHDKLKSRI-SEIIDSRR 470 480 490 500 510 520 530 540 190 200 210 220 230 240 250 260 QUERY VYLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKNIDEQMRKFEIFKINYISIKNHNKLNKNA .... ...: ....:. : : :. : .... .. ... :...: . A38747 R--LEEDLKKQAAEYREIDKRMNSIKPDLIQLRKTRDQYLMWLTQKGVRQKKLNEWLGNENTEDQYSLVEDDEDLPHHDE 550 560 570 580 590 600 610 620 270 280 290 300 310 320 330 340 QUERY MYKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILDYREKGIVH A38747 KTWNVGSSNRNKAENLLRGKRDGTFLVRESSKQGCYACSVVVDGEVKHCVINKTATGYGFAEPYNLYSSLKELVIHYQHT 630 640 650 660 670 680 690 700 --------------------------------------------------------------------------- >>SAZQK1 major merozoite surface antigen precursor - Plasmodium falciparum (s (1631 aa) initn: 107 init1: 55 opt: 131 Z-score: 132.3 expect() 1.5 Smith-Waterman score: 131; 22.062% identity in 485 aa overlap Entrez lookup Re-search database >SAZQK1 65- 519: -------------------------------------------------------- : 30 40 50 60 70 80 90 100 QUERY KKFLKKKEKRTFVLSIYAFITFIIFCIGILYFTNKSSAHNNNNNKNEHSLKKEE--IELLRVLLEKYKKQKDGILNESSN .: .: : .::.. :: . :.:. :: : : ... SAZQK1 NNVCANDYCQIPFNLKIRANELDVLKKLVFGYRKPLDNIKDNVGKMEDYIKKNKKTIENINELIEESKKTIDKNKNATKE 190 200 210 220 230 240 250 260 110 120 130 140 150 160 170 QUERY EEDEEKYTLNSET--YNNK----NNVSNIKNDSIKS-KKEEYINLERILLEKYKKFINENNEENRKELSNILHKLLEINK :: .. : . . ::.. .:. .. . : . ::.: : . ::.: ::: .. . .: . ::. :: SAZQK1 EEKKKLYQAQYDLFIYNKQLEEAHNLISVLEKRIDTLKKNENI---KELLDK----INEIKNPPPANSGNTPNTLLDKNK 270 280 290 300 310 320 330 180 190 200 210 220 230 240 QUERY LILREEKDDKKVY-LINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEH-NKV-----YKNIDEQMRKFEI : ..::. :.. :. : : . . .. .. :... . : .: .:. : : :..:.. . ... SAZQK1 KIEEHEKEIKEIAKTIKFNIDSLFTDPLELEYYLREKNKNIDISAKVETKESTEPNEYPNGVTYPLSYNDINNALNELNS 340 350 360 370 380 390 400 410 250 260 270 280 290 300 310 320 QUERY FKINYISIKNHNKL-NKNAMYKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKD-NILISEFYTNGKRNE : . :. ...: .:: . .. ..: . .:..: : .. .. : :: ... :. . :..:.: . :. SAZQK1 FG-DLINPFDYTKEPSKNIYTDNERKKFINEIKEKIK-IEKKKIESDKKSYEDRSKSLNDITKEYEKLLNEIYDSKFNNN 420 430 440 450 460 470 480 490 330 340 350 360 370 380 390 400 QUERY KDI--FSKV-PEILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKDNFGCDGGHPFY- :. : :. . .:. . ..:. .::: : .:.: . :. . . . : : :. . .: SAZQK1 IDLTNFEKMMGKRYSYKVEKLTHHN----------TFASYEN-----SKHNLEKLTKALKYMEDYSLRNIVVEKELKYYK 500 510 520 530 540 550 410 420 430 440 450 460 470 QUERY SFLYVLQNELCLGDEYKYKAKDDMFCL------NYRCKRKVSLSSIGAVKENQLILALNEVGPLS-VNVGVNNDFVAYSE ... ..::. : : ....: : .. . .:.: :. : .: .:.. :. ... ..: . .. SAZQK1 NLISKIENEIETLVENIKKDEEQLFEKKITKDENKPDEKILEVSDIVKVQV-QKVLLMNKIDELKKTQLILKNVELKHNI 560 570 580 590 600 610 620 630 480 490 500 510 520 530 540 550 QUERY GVYNGTCSEELNHSV-LLVGYGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGD : :. .:. .. :.: ...: :. . :... ..:..: SAZQK1 HVPNSYKQENKQEPYYLIVLKKEIDKLKV-FMPKVESLINEEKKNIKTEGQSDNSEPSTEGEITGQATTKPGQQAGSALE 640 650 660 670 680 690 700 710 560 QUERY NVFCGIGEEVFYPIL SAZQK1 GDSVQAQAQEQKQAQPPVPVPVPEAKAQVPTPPAPVNNKTENVSKLDYLEKLYEFLNTSYICHKYILVSHSTMNEKILKQ 720 730 740 750 760 770 780 790 --------------------------------------------------------------------------- >>S05603 major merozoite surface antigen precursor - Plasmodium falciparum (s (1639 aa) initn: 128 init1: 51 opt: 131 Z-score: 132.3 expect() 1.5 Smith-Waterman score: 131; 22.062% identity in 485 aa overlap Entrez lookup Re-search database >S05603 65- 519: -------------------------------------------------------- : 30 40 50 60 70 80 90 100 QUERY KKFLKKKEKRTFVLSIYAFITFIIFCIGILYFTNKSSAHNNNNNKNEHSLKKEE--IELLRVLLEKYKKQKDGILNESSN .: .: : .::.. :: . :.:. :: : : ... S05603 NDVCANDYCQIPFNLKIRANELDVLKKLVFGYRKPLDNIKDNVGKMEDYIKKNKKTIENINELIEESKKTIDKNKNATKE 200 210 220 230 240 250 260 270 110 120 130 140 150 160 170 QUERY EEDEEKYTLNSET--YNNK----NNVSNIKNDSIKS-KKEEYINLERILLEKYKKFINENNEENRKELSNILHKLLEINK :: .. : . . ::.. .:. .. . : . ::.: : . ::.: ::: .. . .: . ::. :: S05603 EEKKKLYQAQYDLSIYNKQLEEAHNLISVLEKRIDTLKKNENI---KELLDK----INEIKNPPPANSGNTPNTLLDKNK 280 290 300 310 320 330 340 180 190 200 210 220 230 240 QUERY LILREEKDDKKVY-LINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEH-NKV-----YKNIDEQMRKFEI : ..::. :.. :. : : . . .. .. :... . : .: .:. : : :..:.. . ... S05603 KIEEHEKEIKEIAKTIKFNIDSLFTDPLELEYYLREKNKNIDISAKVETKESTEPNEYPNGVTYPLSYNDINNALNELNS 350 360 370 380 390 400 410 420 250 260 270 280 290 300 310 320 QUERY FKINYISIKNHNKL-NKNAMYKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKD-NILISEFYTNGKRNE : . :. ...: .:: . .. ..: . .:..: : .. .. : :: ... :. . :..:.: . :. S05603 FG-DLINPFDYTKEPSKNIYTDNERKKFINEIKEKIK-IEKKKIESDKKSYEDRSKSLNDITKEYEKLLNEIYDSKFNNN 430 440 450 460 470 480 490 500 330 340 350 360 370 380 390 400 QUERY KDI--FSKV-PEILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKDNFGCDGGHPFY- :. : :. . .:. . ..:. .::: : .:.: . :. . . . : : :. . .: S05603 IDLTNFEKMMGKRYSYKVEKLTHHN----------TFASYEN-----SKHNLEKLTKALKYMEDYSLRNIVVEKELKYYK 510 520 530 540 550 560 410 420 430 440 450 460 470 QUERY SFLYVLQNELCLGDEYKYKAKDDMFCL------NYRCKRKVSLSSIGAVKENQLILALNEVGPLS-VNVGVNNDFVAYSE ... ..::. : : ....: : .. . .:.: :. : .: .:.. :. ... ..: . .. S05603 NLISKIENEIETLVENIKKDEEQLFEKKITKDENKPDEKILEVSDIVKVQV-QKVLLMNKIDELKKTQLILKNVELKHNI 570 580 590 600 610 620 630 640 480 490 500 510 520 530 540 550 QUERY GVYNGTCSEELNHSV-LLVGYGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGD : :. .:. .. :.: ...: :. . :... ..:..: S05603 HVPNSYKQENKQEPYYLIVLKKEIDKLKV-FMPKVESLINEEKKNIKTEGQSDNSEPSTEGEITGQATTKPGQQAGSALE 650 660 670 680 690 700 710 720 560 QUERY NVFCGIGEEVFYPIL S05603 GDSVQAQAQEQKQAQPPVPVPVPEAKAQVPTPPAPVNNKTENVSKLDYLEKLYEFLNTSYICHKYILVSHSTMNEKILKQ 730 740 750 760 770 780 790 800 --------------------------------------------------------------------------- >>C48435 cysteine proteinase AC-4 - nematode (Haemonchus contortus) (342 aa) initn: 276 init1: 98 opt: 121 Z-score: 132.2 expect() 1.6 Smith-Waterman score: 237; 25.478% identity in 314 aa overlap Entrez lookup Re-search database >C48435 296- 560: ---------------------------------: 260 270 280 290 300 310 320 330 QUERY IKNHNKLNKNAMYKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKP---FENHLKDNILISEFYTNGKRNEKDIFSK ...: :.: ::... : . .. . .:. . C48435 TLCAYLCAASGASINAAQEIPLEAQTLTGEPLVAYLRKNQNLFEVNSEPTPNFEQKIMDIKFKNQKLNFVVKNDPEPNED 10 20 30 40 50 60 70 80 340 350 360 370 380 390 400 QUERY VPEILDYRE--KGIVHEPKDQGLCGSCWAFASVGNIES--VFAKKNKNILSFSEQEVVDCSKD--NFGCDGGHPFYSFLY .:: : :: : . .::. :::::: .... : . .: .... ...: ... : . .::: :: . .. : C48435 IPEEYDPREKFKCSTFYIRDQANCGSCWAVSTAAAISDRICIATNGEKQVNISSTDILTCCNPQCGFGCGGGWSIRAWEY 90 100 110 120 130 140 150 160 410 420 430 440 450 QUERY VLQNELCLGDEYKYKA--------------KDDMFCLNYR------CKRKVS--LSSI-------GAV------KENQLI . . . : :: :. .: .. : ::.: . ..: : : ::. . C48435 FVYEGVVSGGEYLTKGVCRPYPIHPCGHHGNDTYYGECPREAATPPCKKKCQPGYKKIFRMDKRQGKVAYGVEPKEEAIQ 170 180 190 200 210 220 230 240 460 470 480 490 500 510 520 530 QUERY LALNEVGPLSVNVGVNNDFVAYSEGVYNGTCSEELN-HSVLLVGYGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWI . . ::. .. .: .:: :. :::. : . . :.: ..:.: ::: .: :. C48435 REILRHGPVVASFAVYEDFSLYKTGVYKHTAGALRGYHAVKMMGWGVDSKTKAKY-----------------------WL 250 260 270 280 290 300 540 550 560 QUERY IKNSWSKKWGENGFMRLSRNKNG----DNVFCGIGEEVFYPIL : ::: . :::::..:. :. : :.: :: C48435 IANSWHNDWGENGYFRFIRGINDCEIEDTVAAGIVDVDSL 310 320 330 340 --------------------------------------------------------------------------- >>A57480 tubulointerstitial nephritis antigen precursor - rabbit (474 aa) initn: 118 init1: 95 opt: 123 Z-score: 132.2 expect() 1.6 Smith-Waterman score: 189; 23.770% identity in 244 aa overlap Entrez lookup Re-search database >A57480 354- 556: ------------------------- : 320 330 340 350 360 370 380 390 QUERY LISEFYTNGKRNEKDIFSKVPEILDYREKGIVHEPKDQGLCGSCWAF--ASVGNIESVFAKKNKNILSFSEQEVVDC-SK :.. ::: :::. . .. .... ..: :....: .: A57480 VLLSMNEMRATLPETTDLPEFFIAFLQMAWMDSWAIGSKNCAASWAFSTASVAADRIAIQSNGRYTANLSPQNLISCCAK 200 210 220 230 240 250 260 270 400 410 420 430 440 QUERY DNFGCDGG----------------HPFYSFL--YVLQNELC-LGDEYKYKAKDDMF--CLN--------YRCKRKVSLSS . ::..: : : .. ..:. : . .. ..: : : :.:. .:: A57480 NRHGCNSGSIDRAWWYLRKRGLVSHACYPLFKDQNISNNTCAMTSKADGRGKRHATRPCPNNIEKSNRIYQCSPPYRVSS 280 290 300 310 320 330 340 350 450 460 470 480 490 500 510 QUERY IGAVKENQLILALNEVGPLSVNVGVNNDFVAYSEGVYNGTCS--EE-------LNHSVLLVGYGQVEKTKLNYNNKIQTY .:.... . . ::... . :..:: :. :.: . : :: .:.: :.:.: .. .. A57480 ----NETEIMKEIMQNGPVQAIMQVHEDFFHYKTGIYRHVISTNEESEKYRKLQTHAVKLTGWGTLKGAR---------- 360 370 380 390 400 410 420 520 530 540 550 560 QUERY NTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFYPIL . ::. .:: :::.:.:::::..:. :. : ... A57480 GQKEK----------FWIAANSWGKSWGENGYFRILRGVNESDIEKLIIAAWGQLTSSDEP 430 440 450 460 470 --------------------------------------------------------------------------- >>H64387 hypothetical protein MJ0704 - Methanococcus jannaschii (377 aa) initn: 78 init1: 78 opt: 120 Z-score: 130.6 expect() 1.9 Smith-Waterman score: 122; 28.743% identity in 167 aa overlap Entrez lookup Re-search database >H64387 19- 180: --------------------: 10 20 30 40 50 QUERY MVAIKEMKELAFARPSLVETLNKKKKFLKKKEKRTFVLSIYAFITFIIFCIGILYFTN ...:: :.: .... . . :. :. . . : : H64387 GGYGDLTDAESRLFAVYEIIEKRIKVGDVYAVIKCLNMINKNFNKYWMFIKDEKREKY---LRDFLR-ILSKLRVEYRKN 190 200 210 220 230 240 250 60 70 80 90 100 110 120 130 QUERY KSSAHNNN-NNKNEHSLKKEE--IELLRVLLEKYKKQKDGILNESSNEEDEEKYTLNSETYNNKNNVSNIKNDSIKSKKE . .:. ..:. ...:: : . . ... :: : . . .: :. :::::: .: .. :: :. : H64387 CLNRKNKPYSKKGLEAFKKTEFIVSFYKTIEEKIKTRDINSIN---------KFL--SETYNNISNY-EMFND--KTYLE 260 270 280 290 300 310 320 140 150 160 170 180 190 200 210 QUERY EYINLERILLEKYKKFINEN--NEENRKELSNILHKLLEINKLILREEKDDKKVYLINDNYDEKGALEIGMNEEMKYKKE ... . :::::.: ::: .:: .::.. :.::.. . ::: H64387 IFVHHLKELLEKYEKEKNENKLDEEIFNELKDELEKLINKCNNKLRELESQNNN 330 340 350 360 370 220 230 240 250 260 270 280 290 QUERY DPINNIKYASKFFKFMKEHNKVYKNIDEQMRKFEIFKINYISIKNHNKLNKNAMYKKKVNQFSDYSEEELKEYFKTLLHV --------------------------------------------------------------------------- >>A37488 Ras guanine nucleotide exchange factor son-of-sevenless (sos) 1 - hu (1333 aa) initn: 33 init1: 33 opt: 128 Z-score: 130.6 expect() 1.9 Smith-Waterman score: 128; 19.856% identity in 418 aa overlap Entrez lookup Re-search database >A37488 64- 454: ------------------------------------------------ : 30 40 50 60 70 80 90 QUERY KKKFLKKKEKRTFVLSIYAFITFIIFCIGILYFTNKSSAHNNNNNKNEHSLKKEEIELL----RVLLEKYKKQKDGILNE : : .. . :..:.. .::.. .... . : A37488 VEKIHPLLKEVLGYKIDHQVSVYIVAVLEYISADILKLVGNYVRNIRHYEITKQDIKVAMCADKVLMDMFHQDVEDINIL 110 120 130 140 150 160 170 180 100 110 120 130 140 150 160 170 QUERY SSNEEDEEKYTLNSETYNNKNNVSNIKNDSIKSKKEEYINLERILLEKYKK-FINENNEENRKELSNILHKLLEINKLIL : . ::: : . .:: . .: .. .. ..:: .... ... :..... . ... ::. ....:..: . A37488 SLT--DEEPSTSGEQTYYDL-----VK--AFMAEIRQYIRELNLIIKVFREPFVSNSKLFSANDVENIFSRIVDIHELSV 190 200 210 220 230 240 250 180 190 200 210 220 230 240 QUERY R---------EEKDDKKVY-LINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFF-KFMKEHNKVY-KNIDEQMRKF . : :. . . :... ... : :.... .: .. : . ..:. .. : .: ..: : ... A37488 KLLGHIEDTVEMTDEGSPHPLVGSCFEDL-AEELAFDPYESYARD--ILRPGFHDRFLSQLSKPGAALYLQSIGEGFKEA 260 270 280 290 300 310 320 330 250 260 270 280 290 300 310 320 QUERY EIFKINYISIKN-HNKLNKNAMYKKKVNQFSDYSEEE-LKEYFKTLLHVPNHMIEKYSKPF-ENHLKDNILISEFYTNGK . . . . .. :. . :. .. : ..: ::. . .::.: . : . :: . . .:... .::.. A37488 VQYVLPRLLLAPVYHCLHYFELLKQLEEKSEDQEDKECLKQAITALLNVQSGMEKICSKSLAKRRLSESA--CRFYSQQM 340 350 360 370 380 390 400 410 330 340 350 360 370 380 390 400 QUERY RNEKDIFSKVPEILDYREKGIVH-EPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKDNFGCDGGHPFY .... ..:. :: .:.: : :: : : : : :.. : ::....:. :. . : :.: : A37488 KGKQLAIKKMNEI----QKNIDGWEGKDIGQC--CNEFIMEGTLTRVGAKHERHIFLFDGLMI--CCKSNHGQPRLPGAS 420 430 440 450 460 470 480 410 420 430 440 450 460 470 QUERY SFLYVLQNELCLGDEYKYKAKDDM------FCLNYRCKRKVSLSSIGAVKENQLILALNEVGPLSVNVGVNNDFVAYSEG . : :.... . . . . ::: : . . . .: .:. .: ..:. . :: A37488 NAEYRLKEKFFM-RKVQINDKDDTNEYKHAFEIILKDENSVIFSAKSAEEKNNWMAALISLQYRSTLERMLDVTMLQEEK 490 500 510 520 530 540 550 560 480 490 500 510 520 530 540 550 QUERY VYNGTCSEELNHSVLLVGYGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNV A37488 EEQMRLPSADVYRFAEPDSEENIIFEENMQPKAGIPIIKAGTVIKLIERLTYHMYADPNFVRTFLTTYRSFCKPQELLSL 570 580 590 600 610 620 630 640 --------------------------------------------------------------------------- >>S35577 cysteine proteinase I - mountain papaya (fragment) (43 aa) initn: 72 init1: 72 opt: 106 Z-score: 130.4 expect() 2 Smith-Waterman score: 106; 42.424% identity in 33 aa overlap Entrez lookup Re-search database >S35577 337- 369: ----: 300 310 320 330 340 350 360 370 QUERY MIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKN .:.:.:: : ..:: :: :.:.::. .:... S35577 IVASIDWRQKGAVTPVRNQGSXGSXWTFSSVAAVEGIIKIRGT 10 20 30 40 380 390 400 410 420 430 440 450 QUERY ILSFSEQEVVDCSKDNFGCDGGHPFYSFLYVLQNELCLGDEYKYKAKDDMFCLNYRCKRKVSLSSIGAVKENQLILALNE --------------------------------------------------------------------------- >>A24594 probable major surface antigen (83K, 19K, 42K) precursor - Plasmodiu (1640 aa) initn: 127 init1: 51 opt: 129 Z-score: 130.2 expect() 2 Smith-Waterman score: 133; 22.268% identity in 485 aa overlap Entrez lookup Re-search database >A24594 65- 519: -------------------------------------------------------- : 30 40 50 60 70 80 90 100 QUERY KKFLKKKEKRTFVLSIYAFITFIIFCIGILYFTNKSSAHNNNNNKNEHSLKKEE--IELLRVLLEKYKKQKDGILNESSN .: .: : .::.. :: . :.:. :: : : ... A24594 NDVCANDYCQIPFNLKIRANELDVLKKLVFGYRKPLDNIKDNVGKMEDYIKKNKKTIENINELIEESKKTIDKNKNATKE 200 210 220 230 240 250 260 270 110 120 130 140 150 160 170 QUERY EEDEEKYTLNSET--YNNK----NNVSNIKNDSIKS-KKEEYINLERILLEKYKKFINENNEENRKELSNILHKLLEINK :: .. : . . ::.. .:. .. . : . ::.: : . ::.: ::: .. . .: . ::. :: A24594 EEKKKLYQAQYDLSIYNKQLEEAHNLISVLEKRIDTLKKNENI---KELLDK----INEIKNPPPANSGNTPNTLLDKNK 280 290 300 310 320 330 340 180 190 200 210 220 230 240 QUERY LILREEKDDKKVY-LINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEH-NKV-----YKNIDEQMRKFEI : ..::. :.. :. : : . . .. .. :... . : .: .:. : : :..:.. . ... A24594 KIEEHEKEIKEIAKTIKFNIDSLFTDPLELEYYLREKNKNIDISAKVETKESTEPNEYPNGVTYPLSYNDINNALNELNS 350 360 370 380 390 400 410 420 250 260 270 280 290 300 310 320 QUERY FKINYISIKNHNKL-NKNAMYKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKD-NILISEFYTNGKRNE : . :. ...: .:: . .. ..: . .:..: : .. .. : :: ... :. . :..:.: . :. A24594 FG-DLINPFDYTKEPSKNIYTDNERKKFINEIKEKIK-IEKKKIESDKKSYEDRSKSLNDITKEYEKLLNEIYDSKFNNN 430 440 450 460 470 480 490 500 330 340 350 360 370 380 390 400 QUERY KDI--FSKV-PEILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKDNFGCDGGHPFY- :. : :. . .:. . ..: :. .::: : .:.: . :. . . . : : :. . .: A24594 IDLTNFEKMMGKRYSYKVEKLTH-PN---------TFASYEN-----SKHNLEKLTKALKYMEDYSLRNIVVEKELKYYK 510 520 530 540 550 560 410 420 430 440 450 460 470 QUERY SFLYVLQNELCLGDEYKYKAKDDMFCL------NYRCKRKVSLSSIGAVKENQLILALNEVGPLS-VNVGVNNDFVAYSE ... ..::. : : ....: : .. . .:.: :. : .: .:.. :. ... ..: . .. A24594 NLISKIENEIETLVENIKKDEEQLFEKKITKDENKPDEKILEVSDIVKVQV-QKVLLMNKIDELKKTQLILKNVELKHNI 570 580 590 600 610 620 630 640 480 490 500 510 520 530 540 550 QUERY GVYNGTCSEELNHSV-LLVGYGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGD : :. .:. .. :.: ...: :. . :... ..:..: A24594 HVPNSYKQENKQEPYYLIVLKKEIDKLKV-FMPKVESLINEEKKNIKTQGQSDNSEPSTEGEITGQATTKPGQQAGSALE 650 660 670 680 690 700 710 720 560 QUERY NVFCGIGEEVFYPIL A24594 GDSVQAQAQEQKQAQPPVPVPVPEAKAQVPTPPAPVNNKTENVSKLDYLEKLYQFLNTSYICHKYILVSHSTMNEKILKQ 730 740 750 760 770 780 790 800 --------------------------------------------------------------------------- >>A38749 3-phosphatidylinositol kinase (EC 2.7.1.-) 85K chain alpha - bovine (724 aa) initn: 70 init1: 70 opt: 123 Z-score: 129.4 expect() 2.2 Smith-Waterman score: 123; 23.164% identity in 177 aa overlap Entrez lookup Re-search database >A38749 69- 243: ---------------------- : 30 40 50 60 70 80 90 100 QUERY KKKEKRTFVLSIYAFITFIIFCIGILYFTNKSSAHNNNNNKNEHSLKKEEIELLRVLLEKYKKQKDGILNESSNEEDE-- .... .:...:: . :..:. : ..:.: : :. A38749 FSDPLTFNSVVELINHYRNESLAQYNPKLDVKLLYPVSKYQQDQVVKEDNIEAVGKKLHEYNTQ----FQEKSREYDRLY 400 410 420 430 440 450 460 110 120 130 140 150 160 170 180 QUERY EKYTLNSETYNNKNNVSNIKNDSIKSKKEEYINLERILLEKYKKFINENNEENRKELSNILHKLLEINKLILREEKDDKK : :: .:. . : .. . :..:: .:. . :: : .:: :.:: :.. :.:. .... : : :... A38749 EDYTRTSQEIQMKRTAIEAFNETIKIFEEQCQTQERYSKEYIEKFKREGNET---EIQRIMHNYEKLKSRI-SEIVDSRR 470 480 490 500 510 520 530 540 190 200 210 220 230 240 250 260 QUERY VYLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKNIDEQMRKFEIFKINYISIKNHNKLNKNA .... ...: ....:. : : :. : .... .. ... :...: . A38749 R--LEEDLKKQAAEYREIDKRMNSIKPDLIQLRKTRDQYLMWLTQKGVRQKKLNEWLGNENTEDQYSLVEDDEDLPHHDE 550 560 570 580 590 600 610 620 270 280 290 300 310 320 330 340 QUERY MYKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILDYREKGIVH A38749 KTWNVGSSNRNKAENLLRGKRDGTFLVRESSKQGCYACSVVVDGEVKHCVINKTATGYGFAEPYNLYSSLKELVLHYQHT 630 640 650 660 670 680 690 700 --------------------------------------------------------------------------- >>G64245 hypothetical protein homolog MG413 - Mycoplasma genitalium (SGC3) (728 aa) initn: 135 init1: 105 opt: 123 Z-score: 129.4 expect() 2.2 Smith-Waterman score: 130; 21.739% identity in 299 aa overlap Entrez lookup Re-search database >G64245 56- 342: ----------------------------------- : 20 30 40 50 60 70 80 90 QUERY SLVETLNKKKKFLKKKEKRTFVLSIYAFITFIIFCIGILYFTNKSS---AHNNNNNKNEHSLKKEEIELLRVLLEKYKKQ : ..:: . :.:.. . : .. .:.. ::.::. G64245 MKVPPVFEQNNAQWNEPFPGRSSWFFSINSNKQVDAHWMSINEFK------EKMKKD 10 20 30 40 50 100 110 120 130 140 150 160 QUERY KDGILNESSNEEDEEKY-------TLNSETYNNKNNVSNIKNDSIKSKKEEYINLERILLEKYKKFINENNEENRKELSN . : ..:.: . .. :: ... .. : : : .: . . .... .. . .. .:: .. .: G64245 RKIIEAFTNNREITQTILRLGNGLVVRPETEDSRREAFNSKWDIFK--HTPILPEQEVVHHQISFYLRQNNPDEVLPIS- 60 70 80 90 100 110 120 170 180 190 200 210 220 230 240 QUERY ILHKLLEINKLILREEKDDKKVYLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKNIDEQMRK . ...:: . . . : ::. .. : .:.: : : .::. .::.. .:: . . . :::.. : G64245 ----FSNLSKLWISQLEFD-----INSLVSQT---EKTLNKETIGTKIKP--TIKFKDKFINAIKEVSLINQRIDESIDK 130 140 150 160 170 180 190 250 260 270 280 290 300 310 320 QUERY FEIFKINYISIKNHNKLNKNAMYKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRN : .: :: .:.... : : .. :.: ..::: ....: .. . .... . . .. ... .:. :: G64245 NEALKSFNISTNNQSNFYPNLDYLYNLLQMSPNNKEEL-FFIRNLPRMVKTIFDRSTLTVKVKIGNSV--NEITL--LRN 200 210 220 230 240 250 260 330 340 350 360 370 380 390 400 QUERY EKDIF--SKVPEILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKDNFGCDGGHPFYS .:. : :.. :.: ..:. G64245 NKNYFDLSSIEEFLTEKQKADNEMNIEFLALNFFVDGYESENISNYSVEPLFDSIKKLSTIKRTKDGFEYKFKYRKDFNE 270 280 290 300 310 320 330 340 --------------------------------------------------------------------------- >>A64505 P115 homolog - Methanococcus jannaschii (1169 aa) initn: 129 init1: 68 opt: 126 Z-score: 129.4 expect() 2.2 Smith-Waterman score: 150; 23.625% identity in 309 aa overlap Entrez lookup Re-search database >A64505 5- 290:------------------------------------ : 10 20 30 40 QUERY MVAIKEMKELAFARPSLVETLNKKKKFLKKKEKRTFVLSI---- .:. :. : : :: :. ::. :.:: ... A64505 DLLKIINISPIERRKIIDEISGIAEFDEKKKKAEEELKKARELIEMIDIRISEVE--NNLKKLKKEKEDAEKYIKLNEEL 150 160 170 180 190 200 210 220 50 60 70 80 90 100 110 QUERY ----YAFITFIIFCIGILYFTNKSSAHNNNNNKNEHSLKKEEIELLRVLLEKYKKQKDGI---LNESSNEEDEEKYT--- ::.: . ...: . ... .: .. ::: : .::. : .:. : . ..: :::..::: : . A64505 KAAKYALILKKVSYLNVLLENIQNDIKNLEELKNEFLSKVREID---VEIENLKLRLNNIINELNEKGNEEVLELHKSIK 230 240 250 260 270 280 290 300 120 130 140 150 160 170 180 QUERY -LNSETYNNKNNVSNIKNDSIKSKKEEYINLERILLEKYKKFINENNE--ENRKELSNILHKLLEIN--KLILRE---EK :. : :.:. ... :. .:. . : : .. . : ::.:.. . :.......: .:. ..: : :.: :. A64505 ELEVEIENDKKVLDSSINE-LKKVEVEIENKKKEIKETQKKIIENRDSIIEKEQQIKEIEEKIKNLNYEKERLKEAIAES 310 320 330 340 350 360 370 380 190 200 210 220 230 240 250 260 QUERY DDKKVYLINDNYDEKGALEIGMNEEMKYKKE-DPINNIKYASKFFKFMKEHNKVYKNIDEQMRKFEIFKINYISIKNHNK .. .: ..... . ..:: .. ::: . ..:. ..: ....:.. :.. :... : . . .. .: A64505 ESIIKHLKESEMEIADEIAKNQNELYRLKKELNDLDNLINRKNFE--IEKNNEMIKKLKEELETVEDVDTKPLYLELEN- 390 400 410 420 430 440 450 270 280 290 300 310 320 330 340 QUERY LNKNAMYKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILDYRE :: . ..:. . . ...::. . : A64505 LNVEIEFSKRGIKELEEKKKELQAKLDELHAEYVKENARIKALKEMEELSMDRAIREILNANLPGIIDIVGNLGKTKIEY 460 470 480 490 500 510 520 530 --------------------------------------------------------------------------- >>S54052 DOS1 protein - yeast (Saccharomyces cerevisiae) (310 aa) initn: 53 init1: 53 opt: 117 Z-score: 128.8 expect() 2.4 Smith-Waterman score: 136; 25.592% identity in 211 aa overlap Entrez lookup Re-search database >S54052 15- 207: ----------------------- : 10 20 30 40 QUERY MVAIKEMKELAFARPSLVETLNKKKKFLKKKEK-----RTFVLSIYAFITFIIF : :: . .:.::: .. .... : .. . : S54052 DKTNEAFQKLEEEVNKRYEKTTSAFKKLVIEKDDGIEINLPISNETTETAQKYLKKLDENIHSVESLAQSYWSKMKTKNF 50 60 70 80 90 100 110 120 50 60 70 80 90 100 110 QUERY CIGILYFTNKSSAHNNNNNKNEHSLKKEEIEL----LRVLLEKYKKQKDGILNESSN------EEDEEKYTLNSETYNNK :. : : .:.:..:.:.:.: :..:: . .. :. .:.:. :... . . ::. . : ..: S54052 WSGFSSFDN--AAENDSNDKDENS-KENEIAVGGNRTEAELRTLSKDKSVYLDNKMDLQLDPFDVDEKTEEICSILQGDK 130 140 150 160 170 180 190 120 130 140 150 160 170 180 190 QUERY NNVSNIKNDSIKSKKEEYINLERILLEKYKKFINENNEENRKELSNILHKLLEINKLIL---REEKDDKKVYLINDNYDE . .:.. :: : .: : .. .: . . .:.... : .:::. . .: : ... .::.:: :: . :: . S54052 D-ISKLMND-IVPHKISYKDFWHIYFLQRNKILDK--ESKRKEILSKKEKETEEKEVEWDDEEEEEDDDKVEAVADNKS- 200 210 220 230 240 250 260 270 200 210 220 230 240 250 260 270 QUERY KGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKNIDEQMRKFEIFKINYISIKNHNKLNKNAMYKKKVNQFS :: ......: S54052 KGETKVAVSQEGLKDVSDHVGLANKDESKDDDDDDDWE 280 290 300 310 --------------------------------------------------------------------------- >>A23705 cysteine proteinase (EC 3.4.22.-) - Entamoeba histolytica (strain HM (312 aa) initn: 202 init1: 80 opt: 117 Z-score: 128.7 expect() 2.4 Smith-Waterman score: 333; 25.762% identity in 361 aa overlap Entrez lookup Re-search database >A23705 219- 567: ------------------------------------------: 180 190 200 210 220 230 240 250 QUERY REEKDDKKVYLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKNIDEQMRKFEIFKINYISIKN : :. : .. ..:: . . :..:. ::..: : A23705 VILMFYIGYGIDFNTWVANNNKHFTAV-ESLRRRAIFNMN-ARIVA 10 20 30 40 260 270 280 290 300 310 320 330 QUERY HNKLNKNAMYKKKVN-QFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEIL .: :.. .: .:. :. ...:: ...::.. : : ..... : : ..:. . A23705 EN--NRKETFKLSVDGPFAAMTNEE----YNSLLKL------KRSGEEKGEVR--------YLN---------IQAPKAV 50 60 70 80 90 340 350 360 370 380 390 400 410 QUERY DYREKGIVHEPKDQGLCGSCWAFASVGNIES-VFAKK--NKNILSFSEQEVVDCSKD--NFGCDGGHPFYSFLYVLQNEL :.:.:: : .::: ::::..:.:.. .:. .. .: ... :..::...:.:... : ::.:: . :...: . A23705 DWRKKGKVTPIRDQGNCGSCYTFGSIAALEGRLLIEKGGDSETLDLSEEHMVQCTREDGNNGCNGGLGSNVYNYIMENGI 100 110 120 130 140 150 160 170 420 430 440 450 460 470 480 QUERY CLGDEYKYKAKDDMFCLNYRCKRKV-SLSSIGAVKENQLILALNEVGPLSVNVGVNN-DFVAYSEGVYNGT-CSEE---L ..: : ..:. . . :. : . .. .: .: :... : ..:.. ... .: :. :.:. : :... : A23705 AKESDYPYTGSDSTCRSDVKAFAKIKSYNRVARNNEVELKAAISQ-GLVDVSIDASSVQFQLYKSGAYTDTQCKNNYFAL 180 190 200 210 220 230 240 250 490 500 510 520 530 540 550 560 QUERY NHSVLLVGYGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFY :: : :::: . . :: ::..:::. :::.:.. . . : ::.. . .: A23705 NHEVCAVGYG--------------VADGKEC-----------WIVRNSWGTGWGEKGYINMVIEGNT----CGVATDPLY 260 270 280 290 300 QUERY PIL : A23705 PTGVEYL 310 --------------------------------------------------------------------------- >>A39340 neurofilament protein 60K splice form NF60 - longfin squid (511 aa) initn: 64 init1: 64 opt: 120 Z-score: 128.6 expect() 2.5 Smith-Waterman score: 120; 20.147% identity in 273 aa overlap Entrez lookup Re-search database >A39340 78- 333: -------------------------------- : 40 50 60 70 80 90 100 110 QUERY LSIYAFITFIIFCIGILYFTNKSSAHNNNNNKNEHSLKKEEIELLRVLLEKYKKQK---DGILNESSNEEDEEKYTLNSE :: .:: .:. .:.: ..:: . ..: .. .. ::.: A39340 TELIDQLERQQKDLEESRTYHQIDQEQIARQNQQLADLEGEISMLRRSIESLEKEKMRQSNILAKMNDEMEKMRMDLNNE 180 190 200 210 220 230 240 250 120 130 140 150 160 170 180 190 QUERY TYN--NKNNVSNIKNDSIKSKKEEYIN-LERILLEKYKKFINENNEENRKELSNILHKLLEINKLILREEKDDKKVYLIN : : . .: . .. .. .:. . . :... :. :: : :.::.. .. . . . . : ..: : A39340 TINHLDAENRRQTLEEELEFQKDVHAQELKELAALAYRDTTAENREFWRNELAQAIRDIQQEYDAKCDQMRGDIEAYY-N 260 270 280 290 300 310 320 330 200 210 220 230 240 250 260 QUERY DNYDE------KGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKNIDEQMRKFEIFKINYISIKNHNKLNKN . .: : .:. :.: . : .. ...:. ... . .. .. .. .. .: .: .:.: .. A39340 LKVQEFRTGATKQNMEVTRNKEENTKLKSNMTEIR--NRLADLEARNAQLERTNQDLLRDLE-------EKDRQNEL-ES 340 350 360 370 380 390 400 270 280 290 300 310 320 330 340 QUERY AMYKKKVNQFSDYSEEELKEYFKTLLHVPNHM---IEKYSKPFENHLKDNIL--ISEFYTNGKRNEKDIFSKVPEILDYR .::...... : ::: .. :. . . : : : .:.. . . : : .... :: ...: . A39340 CQYKEEITKLRGEMESILKE-LQDLMDIKLSLELEIAAYRKLLEGEESRVGMKQIVEQVVGARPNEAEVLSTILTRSEGG 410 420 430 440 450 460 470 350 360 370 380 390 400 410 420 QUERY EKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKDNFGCDGGHPFYSFLYVLQNELCLGDEYKY A39340 YEATGGITTTTTTSSQERRSMSEEKKSMGSSD 480 490 500 510 --------------------------------------------------------------------------- >>S23941 dipeptidyl-peptidase I (EC 3.4.14.1) - human (fragments) (119 aa) initn: 76 init1: 76 opt: 110 Z-score: 127.8 expect() 2.7 Smith-Waterman score: 110; 34.247% identity in 73 aa overlap Entrez lookup Re-search database >S23941 338- 404: --------- : 300 310 320 330 340 350 360 370 QUERY IEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILDYRE-KGI--VHEPKDQGLCGSCWAFASVGNIES-VFAKK : :. .:: : ..:. ::::..:::.: .:. . S23941 LPTSDVRNVHGINFVSPVRNQASCGSCYSFASMGMLEARIRILT 10 20 30 40 380 390 400 410 420 430 440 450 QUERY NKNILSFSEQEVVDCSKDNFGC--DGGHPFYSFLYVLQNELCLGDEYKYKAKDDMFCLNYRCKRKVSLSSIGAVKENQLI :.. .: ::::. ..: :: ... :. .. S23941 NSQTPILSPQEVVSYAQD-FGLVEEASFPYTDYYSSEYHYVGGFYGGMNEALMKLELVRHGPMAVAFEYVYDFLHY 50 60 70 80 90 100 110 --------------------------------------------------------------------------- >>H64474 hypothetical protein MJ1401 - Methanococcus jannaschii (808 aa) initn: 76 init1: 76 opt: 122 Z-score: 127.7 expect() 2.8 Smith-Waterman score: 122; 25.229% identity in 218 aa overlap Entrez lookup Re-search database >H64474 84- 289: -------------------------- : 50 60 70 80 90 100 110 120 QUERY ITFIIFCIGILYFTNKSSAHNNNNNKNEHSLKKEEIELLRVLLEKYKKQKDGI--LNESSNEEDEEKYTLNSETYNNKNN ....: ::.:: : .. ... :: . :.. . : .. H64474 MLIVRKPKKKKDEIEIVKVGGKIEDGIEVKNNQKIFANYKK 10 20 30 40 130 140 150 160 170 180 190 QUERY VSNIKNDSIKSK-KEEYINLERIL--LEKYKKFINENNEENRKEL---SNILHKLLEINKLILREEKDDKKVY--LINDN :.. : . . .. :. ..: :.. : :: ..:::. .:. :. .:. . : :..: : .: H64474 VGD-KYKLYRCRVGDKLIQPSKVLELLKSDKIFILKENEEEIEEVLKSYNLKFDYIELCPFCLL-----KNIYKRLTRNN 50 60 70 80 90 100 110 200 210 220 230 240 250 260 270 QUERY YDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYK--NIDEQMRKFEIFKINYISIKNHNKLNKNAMYKKK . : ::: .: .. ::. . .. ::.: .:. .:: . : . . : :. . . :. ....:... : H64474 RCRYGNLEICINCGINEIKEEVKISEEFIEKFLKRFKDVDKVLSLLRIRNPLDKPELTRYDIITGSEEDKIENY-----K 120 130 140 150 160 170 180 190 280 290 300 310 320 330 340 350 QUERY VNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILDYREKGIVHEPKDQ .... : :: ::: .:. H64474 IDEL-DIPEE-LKEIIKSRGIEELLPVQTLSVKAGLLNGDDLLIISATSSGKTLIGELAGIKNLIKTGKKFLFLVPLVAL 200 210 220 230 240 250 260 --------------------------------------------------------------------------- >>B39340 neurofilament protein 70K splice form NF70 - longfin squid (615 aa) initn: 64 init1: 64 opt: 120 Z-score: 127.4 expect() 2.9 Smith-Waterman score: 120; 20.147% identity in 273 aa overlap Entrez lookup Re-search database >B39340 78- 333: -------------------------------- : 40 50 60 70 80 90 100 110 QUERY LSIYAFITFIIFCIGILYFTNKSSAHNNNNNKNEHSLKKEEIELLRVLLEKYKKQK---DGILNESSNEEDEEKYTLNSE :: .:: .:. .:.: ..:: . ..: .. .. ::.: B39340 TELIDQLERQQKDLEESRTYHQIDQEQIARQNQQLADLEGEISMLRRSIESLEKEKMRQSNILAKMNDEMEKMRMDLNNE 180 190 200 210 220 230 240 250 120 130 140 150 160 170 180 190 QUERY TYN--NKNNVSNIKNDSIKSKKEEYIN-LERILLEKYKKFINENNEENRKELSNILHKLLEINKLILREEKDDKKVYLIN : : . .: . .. .. .:. . . :... :. :: : :.::.. .. . . . . : ..: : B39340 TINHLDAENRRQTLEEELEFQKDVHAQELKELAALAYRDTTAENREFWRNELAQAIRDIQQEYDAKCDQMRGDIEAYY-N 260 270 280 290 300 310 320 330 200 210 220 230 240 250 260 QUERY DNYDE------KGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKNIDEQMRKFEIFKINYISIKNHNKLNKN . .: : .:. :.: . : .. ...:. ... . .. .. .. .. .: .: .:.: .. B39340 LKVQEFRTGATKQNMEVTRNKEENTKLKSNMTEIR--NRLADLEARNAQLERTNQDLLRDLE-------EKDRQNEL-ES 340 350 360 370 380 390 400 270 280 290 300 310 320 330 340 QUERY AMYKKKVNQFSDYSEEELKEYFKTLLHVPNHM---IEKYSKPFENHLKDNIL--ISEFYTNGKRNEKDIFSKVPEILDYR .::...... : ::: .. :. . . : : : .:.. . . : : .... :: ...: . B39340 CQYKEEITKLRGEMESILKE-LQDLMDIKLSLELEIAAYRKLLEGEESRVGMKQIVEQVVGARPNEAEVLSTILTRSEGG 410 420 430 440 450 460 470 350 360 370 380 390 400 410 420 QUERY EKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKDNFGCDGGHPFYSFLYVLQNELCLGDEYKY B39340 YEATGDSQISMKMMRGELAAKTTYQRTSKGSVSIKEADSQGCFIALETKKEENLTGWKIVRKVDDNKVYTYEIPNLVLKT 480 490 500 510 520 530 540 550 --------------------------------------------------------------------------- >>S67069 hypothetical protein YOR177c - yeast (Saccharomyces cerevisiae) (464 aa) initn: 48 init1: 48 opt: 118 Z-score: 127.2 expect() 3 Smith-Waterman score: 133; 22.581% identity in 217 aa overlap Entrez lookup Re-search database >S67069 154- 353: ------------------------ : 120 130 140 150 160 170 180 190 QUERY ETYNNKNNVSNIKNDSIKSKKEEYINLERILLEKYKKFINENNEENRKELSNILHKLLEINKLILREEKDDKKVYLIN-- :..: :.: :. . .::...: ::. : .:. . S67069 SDLAQTFETLAVGITHETNRKAECERSKNAIDSLYYHEQLEKKELNEKSLQMAIDHLLKVTKQNLRQADDGNKLKETEAL 140 150 160 170 180 190 200 210 200 210 220 230 240 250 260 QUERY ----DNYDEKGALEIGMNE-EMKYKKEDPINNI------KYASKFFKFMKEHNKVYKNIDEQMRKFEIFKINYISIKNHN .. .: .:..: ... .: ::: : . .. .: ... . ..::.. : .... ..::. S67069 KSFIEEIEEVDDNKISINSLQQQLLEEKTANNILRRDYYKLQERGRRLCHEFQELQDDYSKQMKQKE-YEVQ--KLKNEI 220 230 240 250 260 270 280 290 270 280 290 300 310 320 330 QUERY KLNKNAMYKKKVNQFSDYSEEELKEYFKTLLHVP---NHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEIL :. : . :... . ::..: :.::. .. ::. :.:.. .. : :..:.. . . .. :... .. S67069 KVLLNMNDNLKAEK-AHYSQKE-KQYFQKYTYIEKYMNHVKEEYNRKEDECKKLNFIIDKSMKKIEHLERSLQTQFTAQN 300 310 320 330 340 350 360 370 340 350 360 370 380 390 400 410 QUERY DYREKGIVHE-PKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKDNFGCDGGHPFYSFLYVLQNELCLGD .. : .: ::: : S67069 SFSTAMIQEEGPKDAHLKDRYHKVKEFMEQKLQTSKINDPSCSEAEALDNVLCLIESSMKTLDKNSKCYPIATKKCIKYV 380 390 400 410 420 430 440 450 --------------------------------------------------------------------------- >>H64245 hypothetical protein MG414 - Mycoplasma genitalium (SGC3) (1036 aa) initn: 135 init1: 105 opt: 123 Z-score: 127.1 expect() 3 Smith-Waterman score: 130; 21.739% identity in 299 aa overlap Entrez lookup Re-search database >H64245 56- 342: ----------------------------------- : 20 30 40 50 60 70 80 90 QUERY SLVETLNKKKKFLKKKEKRTFVLSIYAFITFIIFCIGILYFTNKSS---AHNNNNNKNEHSLKKEEIELLRVLLEKYKKQ : ..:: . :.:.. . : .. .:.. ::.::. H64245 PDFWWGLAMTPIRAYRFWWDLELVKVPPVFEQNNAQWNEPFPGRSSWFFSINSNKQVDAHWMSINEFK------EKMKKD 290 300 310 320 330 340 350 100 110 120 130 140 150 160 QUERY KDGILNESSNEEDEEKY-------TLNSETYNNKNNVSNIKNDSIKSKKEEYINLERILLEKYKKFINENNEENRKELSN . : ..:.: . .. :: ... .. : : : .: . . .... .. . .. .:: .. .: H64245 RKIIEAFTNNREITQTILRLGNGLVVRPETEDSRREAFNSKWDIFK--HTPILPEQEVVHHQISFYLRQNNPDEVLPIS- 360 370 380 390 400 410 420 430 170 180 190 200 210 220 230 240 QUERY ILHKLLEINKLILREEKDDKKVYLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKNIDEQMRK . ...:: . . . : ::. .. : .:.: : : .::. .::.. .:: . . . :::.. : H64245 ----FSNLSKLWISQLEFD-----INSLVSQT---EKTLNKETIGTKIKP--TIKFKDKFINAIKEVSLINQRIDESIDK 440 450 460 470 480 490 500 250 260 270 280 290 300 310 320 QUERY FEIFKINYISIKNHNKLNKNAMYKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRN : .: :: .:.... : : .. :.: ..::: ....: .. . .... . . .. ... .:. :: H64245 NEALKSFNISTNNQSNFYPNLDYLYNLLQMSPNNKEEL-FFIRNLPRMVKTIFDRSTLTVKVKIGNSV--NEITL--LRN 510 520 530 540 550 560 570 330 340 350 360 370 380 390 400 QUERY EKDIF--SKVPEILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKDNFGCDGGHPFYS .:. : :.. :.: ..:. H64245 NKNYFDLSSIEEFLTEKQKADNEMNIEFLALNFFVDGYESENISNYSVEPLFDSIKKLSTIKRTKDGFEYKFKYRKDFNE 580 590 600 610 620 630 640 650 --------------------------------------------------------------------------- >>S31909 cathepsin B-like cysteine proteinase (EC 3.4.22.-) - fluke (Schistos (316 aa) initn: 183 init1: 77 opt: 115 Z-score: 126.6 expect() 3.2 Smith-Waterman score: 213; 22.140% identity in 271 aa overlap Entrez lookup Re-search database >S31909 325- 552: ---------------------------- : 290 300 310 320 330 340 350 360 QUERY EYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILDYREK----GIVHEPKDQGLCGSCWAF ...:. ..: .: :.: . . .::. :.: :: S31909 PNAGWKADKSDRFHSVDDARILLGGRREDPNLRQKRRPTVDHHDLKVEIPSHFDSRKKWPRCKSISQIRDQSRCASSWAV 20 30 40 50 60 70 80 90 370 380 390 400 410 QUERY ASVGNIES--VFAKKNKNILSFSEQEVVDCSKD-NFGCDGGHPFYSFLYVLQNELCLG--------------------DE ..:: . . . . .:. . .: ....: .. . ::::: : .. : ... . : .. S31909 SAVGAMSDRICIQSGGKQSVELSAIDLISCCENCGSGCDGGFPGPAWDYWVSHGIVTGGSKENHTGCQPYPFPKCEHHSK 100 110 120 130 140 150 160 170 420 430 440 450 460 470 480 QUERY YKYKAKDDMFCLNYRCKRKVSLS--------------SIGAVK-ENQLILALNEVGPLSVNVGVNNDFVAYSEGVYNGTC :: . : . . .:::: . . ::...: :. . . ::. . . . .::. :. :.: : S31909 GKYPSCGDKMYKTPQCKRKCQKGYKTPYEHDKHYGGISINVIKNESAIQKEIMMYGPVEAYLLIFEDFLNYKSGIYRYTT 180 190 200 210 220 230 240 250 490 500 510 520 530 540 550 560 QUERY SEELN-HSVLLVGYGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIG . .. : : ..:.: .: : ::. :.:.. :::.:..:. :..: S31909 GSFVGEHYVRIIGWG-IE------------------------NGTAYWLAANTWNEDWGEKGYFRIVRGRNECSVESVVV 260 270 280 290 300 310 QUERY EEVFYPIL S31909 AGRLKS --------------------------------------------------------------------------- >>S57751 protein phosphatase 2A chain B - yeast (Candida tropicalis) (508 aa) initn: 64 init1: 39 opt: 118 Z-score: 126.6 expect() 3.2 Smith-Waterman score: 123; 23.143% identity in 350 aa overlap Entrez lookup Re-search database >S57751 84- 403: ---------------------------------------- : 50 60 70 80 90 100 110 QUERY ITFIIFCIGILYFTNKSSAHNNNNNKNEHSLKKEEIELLRVLLEKYKKQKDG-----ILNESSNEEDEEKYTLNSETYNN ::.:. ...: ...: .... : : . : .. S57751 SQCFGDKGDIENITEADIISTVEFDHTGDFLATGDKGGRVVLFERNQSKKKQSCEYKFFTEFQSHDAEFDYLKSLEIEEK 10 20 30 40 50 60 70 80 120 130 140 150 160 170 180 QUERY KNNVSNIK--NDSI--KSKKEEYINLERILLEKYKKFINENNEENRKEL--SNILHKLLEINKLILREE---KDDKKVYL :... .: :::. : ... :.: .: :. :...::: .. ..: ::: . :.. .: :... . ::.: S57751 INKIKWLKSANDSLCLLSTNDKTIKLWKIQ-ERQIKLVSENNLNGLNHLPSSNIGIESLKLPQLQLHDKLISAQPKKIYA 90 100 110 120 130 140 150 160 190 200 210 220 230 240 250 260 QUERY INDNYDEKGALEIGMNEEMKYKKED-PIN--NIKYASKFFKFMKEHNKVYKNIDEQMR--KFEIFKIN---YISIKNHNK : . ... .. ..: . .: :: :. :.. :... . .... : . .:. .. : : : :. : S57751 NAHAY-HINSISVNSDQETYLSADDLRINLWNLGIADQSFNIVDIKPANMEELTEVITSAEFHPLQCNLFMYSSSKGTIK 170 180 190 200 210 220 230 240 270 280 290 300 310 320 330 QUERY LNKNAMYKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKR-NEKDIFS-KVPEILDY :. . :.. : . ..::. :. :. : ... ..: .: .:. .: .. :. . : . S57751 LSDM-----RSNSLCDSHAKIFEEYLD-----PS------SHNFFTEITSSISDVKFSHDGRYIASRDYMTVKIWD-LAM 250 260 270 280 290 300 340 350 360 370 380 390 400 410 QUERY REKGI----VHEPKDQGLCGSCWAFASVGNIESVFAKKNKNIL--SFSEQEVVDCSKDNFGCDGGHPFYSFLYVLQNELC ..: : ::: . :: . : ..: :. ::... :...: :. . : : : : : S57751 ENKPIKTIDVHEHLRERLCDTYENDAIFDKFEVQFGGDNKSVMTGSYNNQFVIYPNAVNTGNDDKPKFKSAFKNSSKRSK 310 320 330 340 350 360 370 380 420 430 440 450 460 470 480 490 QUERY LGDEYKYKAKDDMFCLNYRCKRKVSLSSIGAVKENQLILALNEVGPLSVNVGVNNDFVAYSEGVYNGTCSEELNHSVLLV S57751 KNGFSTRTTDDDDDDDDDDDDEEADDEFDEEVPATKNSPGSQLEDDDEQEEIILQADKSAFKSKKSGQHPMRRRMTSGVG 390 400 410 420 430 440 450 460 --------------------------------------------------------------------------- >>S31907 cathepsin B (EC 3.4.22.1) - fluke (Schistosoma japonicum) (342 aa) initn: 227 init1: 86 opt: 115 Z-score: 126.1 expect() 3.4 Smith-Waterman score: 214; 23.985% identity in 271 aa overlap Entrez lookup Re-search database >S31907 325- 552: ---------------------------- : 290 300 310 320 330 340 350 360 QUERY EYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILDYREK----GIVHEPKDQGLCGSCWAF ...:. ..: .: :.: . . .::. ::::::: S31907 PDAGWKADKSDRFHSLDDARILMGARKEDAEMKRNRRPTVDHHDLNVEIPSQFDSRKKWPHCKSISQIRDQSRCGSCWAF 50 60 70 80 90 100 110 120 370 380 390 400 410 QUERY ASVGNIESVFAKKNKNILS--FSEQEVVDCSKD-NFGCDGGHPFYSFLYVLQNELCLG---------DEY---------- ..: . . . .. . : .: ....: :: . ::.:: : .. : .. . : . : S31907 GAVEAMTDRICIQSGGGQSAELSALDLISCCKDCGDGCQGGFPGVAWDYWVKRGIVTGGSKENHTGCQPYPFPKCEHHTK 130 140 150 160 170 180 190 200 420 430 440 450 460 470 480 QUERY -KYKAKDDMFCLNYRCKRKVSLS---------SIG----AVKENQLILA--LNEVGPLSVNVGVNNDFVAYSEGVY-NGT :: : . . .::. . . : :..:. .. . ::. . : .::. :. :.: . : S31907 GKYPACGTKIYKTPQCKQTCQKGYKTPYEQDKHYGDESYNVQNNEKVIQRDIMMYGPVEAAFDVYEDFLNYKSGIYRHVT 210 220 230 240 250 260 270 280 490 500 510 520 530 540 550 560 QUERY CSEELNHSVLLVGYGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIG : .:.. ..:.: ::: : ::.: :::.. :::.:..:. :... S31907 GSIVGGHAIRIIGWG-VEKRT------------------P------YWLIANSWNEDWGEKGLFRMVRGRDECSIESDVV 290 300 310 320 330 QUERY EEVFYPIL S31907 AGLIKT 340 --------------------------------------------------------------------------- >>S55940 telomerase component p95 - Tetrahymena thermophila (SGC5) (872 aa) initn: 44 init1: 44 opt: 120 Z-score: 125.1 expect() 3.9 Smith-Waterman score: 133; 23.372% identity in 261 aa overlap Entrez lookup Re-search database >S55940 53- 293: ------------------------------ : 20 30 40 50 60 70 80 90 QUERY ARPSLVETLNKKKKFLKKKEKRTFVLSIYAFITFIIFCIGILYFTNKSSAHNN-NNNKNEHSLKKEEIELLRVLLEKYKK .: : :... :. :.. .:.. .... :::: . .. :. S55940 QAPIGNETNLDFVLQNLEVYKSQIEHYKTQQQQIKEEDLKLLKFKNQDQDGNSGNDDDDEENNSNKQQELLRRV-NQIKQ 10 20 30 40 50 60 70 80 100 110 120 130 140 150 160 QUERY QKDGILNESSNEEDEEKYTLNSETYNNKNNVS--NIKNDSIKSKKEEYINLERILLE-KYKKFINEN--NEENRKELSNI : . : . .:. : . .:: : :.::..: ..:...... :: .. . .... :. .::. ....:.: . S55940 QVQLIKKVGSKVEKD--LNLN-EDENKKNGLSEQQVKEEQLRTITEEQVKYQNLVFNMDYQLDLNESGGHRRHRRETDYD 90 100 110 120 130 140 150 160 170 180 190 200 210 220 230 QUERY LHKLLEINK------LILREEKDDK----KVYLINDNYDEKGALEIGMN----EEMKYKKEDPINNIKYASKFFKFMKEH .: .::.. : ..: . : :. ..:::. :....: : : .: ..:: ... .. .. S55940 TEKWFEISHDQKNYVSIYANQKTSYCWWLKDYFNKNNYDH---LNVSINRLETEAEFYAFDDFSQTIKLTNNSYQTVN-- 170 180 190 200 210 220 230 240 240 250 260 270 280 290 300 310 QUERY NKVYKNIDEQMRKFEIFKINYISIKNHNKLNKNAMYKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDN . :.:... . .... .:.. : :: . : . :: :. :.. : ..:.. : S55940 --IDVNFDNNLCILALLRF-LLSLERFNILNIRSSYTR--NQ---YNFEKIGELLETIFAVVFSHRHLQGIHLQVPCEAF 250 260 270 280 290 300 310 320 330 340 350 360 370 380 390 QUERY ILISEFYTNGKRNEKDIFSKVPEILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKDN S55940 QYLVNSSSQISVKDSQLQVYSFSTDLKLVDTNKVQDYFKFLQEFPRLTHVSQQAIPVSATNAVENLNVLLKKVKHANLNL 320 330 340 350 360 370 380 390 --------------------------------------------------------------------------- >>JQ1515 heat-shock protein HSP70 - Chlamydomonas reinhardtii (649 aa) initn: 36 init1: 36 opt: 118 Z-score: 125.0 expect() 3.9 Smith-Waterman score: 127; 25.532% identity in 235 aa overlap Entrez lookup Re-search database >JQ1515 105- 322: --------------------------- : 70 80 90 100 110 120 130 QUERY NNNNKNEHSLKKEEIELLRVLLEKYKKQKDGILNESSNEEDEEKYTLNSETYNNKNNVSNIKNDSIKSKKEEYI-----N : .:. :: .. .: .: .: .:::. . : JQ1515 NGKELNKSINPDEAVAYGAAVQAAILTGEGGEKVQDLLLLDVTPLSLGLETAGGVMTVLIPRNTTIPTKKEQVFSTYSDN 360 370 380 390 400 410 420 430 140 150 160 170 180 190 200 210 QUERY LERILLEKYKKFINENNEENRKELSNILHKLLEINKLILREEKDDKKVYLINDNYDEKGALEIGMNEEMKYKKED-PINN .:.. :. .:. : . .:.: :. :.. : . .. .: : : .: :... ... .:. :.: JQ1515 QPGVLIQVYE------GERARTKDNNLLGKF-ELTG-IPPAPRGVPQINVIFD-IDANGILNVSAEDKTTGNKNKITITN 440 450 460 470 480 490 500 510 220 230 240 250 260 270 280 290 QUERY IK-YASK--FFKFMKEHNKVYKNIDEQMRKFEIFKINYISIKNHNKLNKNAMYKKKV-NQFSDYSEEELKEYFKTLLHV- : :: . ....: .: :: :::..: : . :..:. .:.. . :: .:.: ..: ... . . . JQ1515 DKGRLSKDEIERMVQEAEK-YKADDEQLKKVEAKN----SLENYAYNMRNTIREDKVASQLSASDKESMEKALTAAMDWL 520 530 540 550 560 570 580 300 310 320 330 340 350 360 QUERY -PNHMIEKYSKPFENHLKD-----NILISEFYTNGKRNEKDIFSKVPEILDYREKGIVHEPKDQGLCGSCWAFASVGNIE :.: : . ::.:::. : .:...: .: JQ1515 EANQMAE--VEEFEHHLKELEGLCNPIITRLYQGGAGAGGMPGGGAGAGAAPSGGSGAGPKIEEVD 590 600 610 620 630 640 --------------------------------------------------------------------------- >>S67600 hypothetical protein YDL065c - yeast (Saccharomyces cerevisiae) (350 aa) initn: 77 init1: 46 opt: 114 Z-score: 124.9 expect() 4 Smith-Waterman score: 114; 23.348% identity in 227 aa overlap Entrez lookup Re-search database >S67600 67- 281: --------------------------- : 30 40 50 60 70 80 90 100 QUERY FLKKKEKRTFVLSIYAFITFIIFCIGILYFTNKSSAHNNNNNKNEHSLKKEEIELLRVLLEK-----YKKQKDGILNESS :: ::...: :... : ::. ......: . .: S67600 EKNAESKDSDGVQVANESEEDPELKEMMVDLQNEFANLMKNNGNENNVKTEDFNKLISALEEAAKVPHQQMEQGCSSLKS 60 70 80 90 100 110 120 130 110 120 130 140 150 160 170 180 QUERY NEEDEEKYTLNSETYNNKNNVSNIKNDSIKSKKEEYINLERILLEKYKKFINENNEENRKELSNILHKLLEINKLILREE : : : :.:. . . :: ::: .. ::. ... : :. :. ....: ...:: .::. .: S67600 NSTD--KGTVNGSNPGFKNIVSN----TLDRLKENGNKVDTSLAEETKESQRSGQNNN---IDDILSQLLDQMVASGGKE 140 150 160 170 180 190 200 190 200 210 220 230 240 250 QUERY KDDKKVYLINDNYDE--KGALEIGMNEEMKYKKEDPINNIKYASKF---FKFMKEHNKVYKNIDEQMRKFEIFK--INYI . ... : . ..:. :. ..:. : .:..... :.: :. :... ..: :.:.: .: S67600 SAENQFDLKDGEMDDAITKILDQMTSKEVLY---EPMKEMR--SEFGVWFQENGENEEHKEKIGTYKRQFNIVDEIVNIY 210 220 230 240 250 260 270 260 270 280 290 300 310 320 330 QUERY SIKNHNKLNKNAMYKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVP .:....:. .: .:... : :. S67600 ELKDYDELK----HKDRVTELLDELEQLGDSPIRSANSPLKHGNEEEELMKMLEIDGNDPNLGNLDKELTDGCKQQ 280 290 300 310 320 330 340 350 --------------------------------------------------------------------------- >>B33501 myosin heavy chain 2, smooth muscle - rabbit (fragment) (484 aa) initn: 47 init1: 47 opt: 116 Z-score: 124.9 expect() 4 Smith-Waterman score: 117; 20.588% identity in 238 aa overlap Entrez lookup Re-search database >B33501 71- 291: --------------------------- : 40 50 60 70 80 90 100 QUERY KEKRTFVLSIYAFITFIIFCIGILYFTNKSSAHNNNNNKNEHSLK-KEEIE----LLRVLLEKYKKQKDGILNESSNEED :..:. :::.: .:.. .: ..:: . ... .: . B33501 QLLAEEKNISSKYADERDRAEAEAREKETKALSLARALEEALEAKEELERTNKMLKAEMEDLVSSKDDV-GKNVHELE 10 20 30 40 50 60 70 110 120 130 140 150 160 170 QUERY EEKYTLNSETYNNKNNVSNIKNDSIKSKKEEYINLE------RILLEKYKKFINENNEENRKELSNILHKLLEINKLILR . : .:... . :... ... : ... .. . :: .. .:. . .:.:::.:..:. ::. . :. B33501 KSKRALETQMEEMKTQLEELE-DELQATEDAKLRLEVNMQALKVQFERDLQARDEQNEEKRRQLQRQLHEY----ETELE 80 90 100 110 120 130 140 150 180 190 200 210 220 230 240 250 QUERY EEKDDKKVYLINDNYDEKG--ALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKNIDEQMRKFEIFKINYISIK .:. .. . . : ::. . .: ..: . .: ... :..: . . . :: : : : . : B33501 DERKQRALAAAAKKKLEGDLKDLELQADSAIKGREEAIKQLLKLQAQMKDFQRELEDARASRDEI---FATAKENEKKAK 160 170 180 190 200 210 220 260 270 280 290 300 310 320 330 QUERY NHN----KLNKNAMYKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKV . . .:... ... . .: .::: : . . : B33501 SLEADLMQLQEDLAAAERARKQADLEKEELAEELASSLSGRNALQDEKRRLEARIAQLEEELEEEQGNMEAMSDRVRKAT 230 240 250 260 270 280 290 300 --------------------------------------------------------------------------- >>F64055 prrD protein homolog - Haemophilus influenzae (strain Rd KW20) (163 aa) initn: 35 init1: 35 opt: 109 Z-score: 124.8 expect() 4.1 Smith-Waterman score: 109; 23.438% identity in 128 aa overlap Entrez lookup Re-search database >F64055 209- 325: -------------- : 170 180 190 200 210 220 230 240 QUERY KLLEINKLILREEKDDKKVYLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVY---KNIDEQ-MR .:: ::. ... ::..: : : ...... .: F64055 MTQYKTIAESNNFIVLDQYNKFVEESNAGYQTERSLEREFIR 10 20 30 40 250 260 270 280 290 300 310 QUERY KFEIFKINYIS-IKNHNKLNKNAMYK-KKVNQFSDYSEEELKEYFKTLLHVP-NHMIEKYSKPFENHLKDNIL----ISE .. .:.. ..::..: :: . ...:. .:. : ..... : : ...::: : .... : .. :.. F64055 DLQAQGYEYLQWLNNHDELIKNLRAQLQRLNNVV-FSDAEWQRFLEEYLDKPSDNLIEKTRKIHDDYIYDFVFDNGRIQN 50 60 70 80 90 100 110 120 320 330 340 350 360 370 380 390 QUERY FYTNGKRNEKDIFSKVPEILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKDNFGCDG .: :.: F64055 IYLLDKKNLANNSLQVINQFKQTGSYDNRYDVTILVNGLPLY 130 140 150 160 --------------------------------------------------------------------------- >>A64238 lipase-esterase lip1 homolog - Mycoplasma genitalium (SGC3) (273 aa) initn: 35 init1: 35 opt: 112 Z-score: 124.5 expect() 4.2 Smith-Waterman score: 112; 26.316% identity in 171 aa overlap Entrez lookup Re-search database >A64238 97- 264: --------------------: 60 70 80 90 100 110 120 130 QUERY TNKSSAHNNNNNKNEHSLKKEEIELLRVLLEKYKKQKDGILNESSNEEDEEKYTLNSETYNNKNNVSNIKNDSIKSKKEE .: . .. :. : .: : .::.. :.: :. A64238 YKELSPIHYGELLAAFIENKDLENIVLIGHSMGAAVCSYAMNLLNAKRVEKLILLAPLSYCNLLRYFKIKSSFKKDKAER 70 80 90 100 110 120 130 140 140 150 160 170 180 190 200 210 QUERY YINLERILLEKYKKFINENNEENRKELSNILHKLLEINKLILREEKDDKKVYLINDNYDEKGALEIGMNEEMKYKKEDPI . :.. .. :.... .::. ::. . . : : : ::.: .:.: : : ...:. . : A64238 MANFKAMFQTKFSNLTDENSWENELSKHSKMAKKLSNN--ILKELPVLNKTY---KNLKLPVFLVLAQNDLFMPTKL--- 150 160 170 180 190 200 210 220 220 230 240 250 260 270 280 290 QUERY NNIKYASKFF-KFMKEHNKVYKNIDEQM--RKFEIFKINYISIKNHNKLNKNAMYKKKVNQFSDYSEEELKEYFKTLLHV ...: .:.. : . ...: : ..:: :.: : . .: :::::.: A64238 -TLSYFNKYLIKNNNLQSSVILNSEHQMFNSKYESFCKAMDDILNHNKLSKIY 230 240 250 260 270 300 310 320 330 340 350 360 370 QUERY PNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKK --------------------------------------------------------------------------- >>S04511 keratin 3, type I, cytoskeletal (clone pUF451) - African clawed frog (327 aa) initn: 50 init1: 50 opt: 113 Z-score: 124.3 expect() 4.3 Smith-Waterman score: 113; 21.212% identity in 231 aa overlap Entrez lookup Re-search database >S04511 72- 290: --------------------------- : 40 50 60 70 80 90 100 110 QUERY EKRTFVLSIYAFITFIIFCIGILYFTNKSSAHNNNNNKNEHSLKKEEIELLRVLLEKYKKQKDGILNESSNEEDEEKYTL :.:. . :: :.. : :.::. : . .. .. . ::. S04511 TMQNLNDRLASYLDKVHALETANTELERTIKEWYEKQRAGSSSGDGAKDYSKYYTM 10 20 30 40 50 120 130 140 150 160 170 180 QUERY NSETYNN--KNNVSNIK----NDSIKSKKEEY-INLERILLEKY-KKFINENNEENRKELSNILHKLLEINKLILREEKD .. :. .. : : ::. : ... ...: :.: .. .. . . :. .... . .... . : . S04511 INDLKNQIIAASIENAKFLLQNDNAKLAADDFKMKFEN---EQYMRQTVEADINGARRVMDDLTLSKSDLESQL--ESLS 60 70 80 90 100 110 120 130 190 200 210 220 230 240 250 QUERY DKKVYLINDNYDE-KG--ALEIG-MNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKNIDEQMRKFEIFKINYISIKNH .. .:: ... :: :: . ..: .: :: : .: . :.... . :... .. : ..: .: . S04511 EELAYLKKNHEDELKGMQVTQVGQVNVEM---------NAAPSSDLTKILNDMRSQYEDLAKRNRAAAEEQFNRMSTDLK 140 150 160 170 180 190 200 260 270 280 290 300 310 320 330 QUERY NKLNKNAMYKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILDY : :... .:. :. :: : .:: S04511 NTLSQGIEQQKE-------SKSELTELKRTLQSLEIELQSQLAMKKSLEMTLAEVEGSFCMKLSRLQEMIVNVEEQIARL 210 220 230 240 250 260 270 --------------------------------------------------------------------------- >>S07533 puff II/9A-2 protein precursor - fungus gnat (Sciara coprophila) (286 aa) initn: 52 init1: 52 opt: 112 Z-score: 124.2 expect() 4.4 Smith-Waterman score: 112; 27.389% identity in 157 aa overlap Entrez lookup Re-search database >S07533 64- 213: ------------------ : 30 40 50 60 70 80 90 QUERY KKKFLKKKEKRTFVLSIYAFITFIIFCIGILYFTNKSSAHNNNNNKNEHSLKKEEIELLRVLLEKYKKQKDGI------L :.: .: .::::. ..:: :..::. : S07533 IQELQGGSVVTVDDKCTCKDTLNTLTKGQLIDRLVLCNQRNDNLEKIIDGLKKEN-NILR-------KENDGLRAENCQL 20 30 40 50 60 70 80 100 110 120 130 140 150 160 170 QUERY NESSNEEDEEKYTLNSETYNNKNNVSNIKNDSIKSKKEEYINLERILLEKYKKFINENNEENRKELSNILHKLLEINKLI .:. ..: : . .. . ..:. :.: ..:.. :.: . .. : :: :: . . ..:: : :..: . :..: S07533 SEALKREKEARQKAEKALKECQKNTENLK-ETIEQLKKELAEAQKAL-EKCKKELADCKKENAKLLNKIEELNCTITQLQ 90 100 110 120 130 140 150 160 180 190 200 210 220 230 240 250 QUERY LREEKDDKKVYLINDNYDE-KGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKNIDEQMRKFEIFKINYISI . :. . .. . :: : :.: :: . .:.. S07533 EKLERCRGRERDLQCQLDECKKKLNICNNELIACRKQQEELRCKIERLNTEIEKLRKQNAACEKDLNTLRCETSEFLAIA 170 180 190 200 210 220 230 240 --------------------------------------------------------------------------- >>S46426 probable botulinum neurotoxin regulator protein 22 - Clostridium bot (179 aa) initn: 39 init1: 39 opt: 109 Z-score: 124.2 expect() 4.4 Smith-Waterman score: 109; 24.242% identity in 165 aa overlap Entrez lookup Re-search database >S46426 122- 275: ------------------- : 90 100 110 120 130 140 150 160 QUERY LRVLLEKYKKQKDGILNESSNEEDEEKYTLNSETYNNKNNVSNIKNDSIKSKKEEYINLE-RILLEKYKKFINENNEENR . :.:.:. .. ..:.. . .:: .: : . S46426 MNDLFYAIENLKHDN---QHFNFIEMSLKKYIEKTSKKYNLYYDYYN 10 20 30 40 170 180 190 200 210 220 230 QUERY KELSNILHKLLEINKLILREEKDDKKV-------YLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHN : .. ..:.::: . : : .: : :: .. .: .: :. ::: : .: . :.:. . S46426 DILYHLWKELIEINLKNFNSELDLRKYISTSIKRYCINICKKKNRDKKIIYNSEVTYKKLDAVNVYSLYCDNFEFL-DLI 50 60 70 80 90 100 110 120 240 250 260 270 280 290 300 310 QUERY KVYKNIDEQ---MRKFEIFKINYISIKNHNKLNKNAMYKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLK .. . ..: :. :: : : :.:. .:.....:: ...... S46426 SILNYKEKQIIYMKFFEGRKDNEIAIRL--RLSRQSIYKIRITSLKKLYPIVMQLVNI 130 140 150 160 170 320 330 340 350 360 370 380 390 QUERY DNILISEFYTNGKRNEKDIFSKVPEILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSK --------------------------------------------------------------------------- >>S58691 kinesin-related polypeptides SpKRP95 - sea urchin (Strongylocentrotu (742 aa) initn: 42 init1: 42 opt: 118 Z-score: 124.1 expect() 4.4 Smith-Waterman score: 122; 22.566% identity in 226 aa overlap Entrez lookup Re-search database >S58691 67- 275: -------------------------- : 30 40 50 60 70 80 90 100 QUERY FLKKKEKRTFVLSIYAFITFIIFCIGILYFTNKSSAHNNNNNKNEHSLKKEEIELLRVLLEKYKKQKDGILNESSNEEDE .. .:.. . .: :. . .: ...:. :. ..: : S58691 EFQEEISRLKQALDKKGPSDGRKKGKKRKPGEQGGDDDIEDETEEEGDEMDEEEMYKESQQKLEEEKEKIMANQSMIA-E 360 370 380 390 400 410 420 430 110 120 130 140 150 160 170 180 QUERY EKYTLNSETYNNKNNVSNIKNDSIKSKKEEYIN-LERILLEKYKKFINENNEENRKELSNILHKLLEINKLILREEKD-- :: : ::. . ..... :. . : : :. .: :: :......::..:: .: ..:.: :::. S58691 EKQKLLSEVQKRQGEIK--KEHQQKEMLEGKIKAMESKLLVGGKSIVDHTNEQQRK---------IEEQRLLLAEEKNRE 440 450 460 470 480 490 500 190 200 210 220 230 240 250 QUERY ---DKKVYLINDNYDE-KGALEIGMNE-EMKYKKEDPI-NNIK-YASKFFKFMKEHNKVYKNID----EQMRKFEIFKI- ..:. .:. : .:.. ..: :.: :: . ... : : . .. :: . .... : .:.... :. S58691 RDMERKLKEQDDKTVEIEGTFSSLQQEVEVKTKKLKKLFAKLQSYKSDIQDLQDEHARERQELEQTQNELIRELKLKKVI 510 520 530 540 550 560 570 580 260 270 280 290 300 310 320 QUERY --NYISIKNHNKLNKNAMYKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDI :.: .....:.. :.. ...... S58691 ADNFIPVEERTKITTRAVFDEETEEWLLTPLAKAEGPSQMAKRPVSAVGNRRPIADYARMAAQMGGNPRYKAENILSVDL 590 600 610 620 630 640 650 660 --------------------------------------------------------------------------- >>C64439 asparagine synthetase (EC 6.3.-.-) - Methanococcus jannaschii (544 aa) initn: 58 init1: 58 opt: 116 Z-score: 124.1 expect() 4.4 Smith-Waterman score: 123; 20.775% identity in 284 aa overlap Entrez lookup Re-search database >C64439 85- 342: -------------------------------- : 50 60 70 80 90 100 110 QUERY TFIIFCIGILYFTNKSSAHNNNNNKNEHSLKKEEIELLRVLLEKYKKQKDGILNESSNEEDEE---KYTLNS--ETYNNK ..: .:: . . ..: : :: .: . :: . . C64439 WHLLINIDGCERDLDELNSKIKTLKPNSQLIYYLDDNRFEIIEGFKKLELNYMKERSYEEAKEYLDRALKNSVLKRVRGL 190 200 210 220 230 240 250 260 120 130 140 150 160 170 180 190 QUERY NNVSNIKNDSIKSKKEEYINLERILLEKYKKFINENNEENRKELSNILHKL-LEINKLILREEKDDKKVYLINDNYDEKG ..:. : . .. :. .: . : .. .: :. . . : :.. : :. ::. .. :. . :: C64439 DKVGIICSGGVDSSL--IAKLASLYCEVILYAVGTENSEDLIYAERLAKDLNLKLRKKIISEEEYEEYVFKVAKAIDEVD 270 280 290 300 310 320 330 200 210 220 230 240 250 260 270 QUERY ALEIGMNEEM----KYKKEDPINNI---KYASKFFKFMKEHNKVYKNIDEQMRKFEIFKINYISIKNHNKLNKNAMYKKK ..::.. . .. .:: .. . . :...: . .:...:.. :. : :..: : : . . . . . . C64439 LMKIGVGIPIYVASEMANEDGLKVVLSGQGADELFGGYARHERIYRERGEEELKKELLKDVYNLYKVNLERDDHCTMANG 340 350 360 370 380 390 400 410 280 290 300 310 320 330 QUERY VNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFE-----------NHLKD-NILISEFYTNGKRNEKDIFSK-VPEILD :. . .::. : .. . .: : ..:. : ::. :. :. . :. .:. :. .:. . C64439 VELRVPFLDEEVVEIALSI-PIEYKMSELSNRPYAESNISLKSEPINGLKNTNLNIKCVRSVRKKILRDVASQYLPDYIA 420 430 440 450 460 470 480 490 340 350 360 370 380 390 400 410 QUERY YREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKDNFGCDGGHPFYSFLYVLQNELCLGDEY :: : C64439 YRPKKAAQYGSGGEKMIYKVAKKYGFSKKRINEFLDMLKRKIVSEF 500 510 520 530 540 --------------------------------------------------------------------------- >>S21175 heat shock protein 71 - rainbow trout (651 aa) initn: 92 init1: 92 opt: 117 Z-score: 124.0 expect() 4.5 Smith-Waterman score: 117; 26.866% identity in 134 aa overlap Entrez lookup Re-search database >S21175 44- 171: ---------------- : 10 20 30 40 50 60 70 QUERY IKEMKELAFARPSLVETLNKKKKFLKKKEKRTFVLSIYAFITFIIFCIGILYFT--NKSSAHNN----NNNKNEHSLKKE .:: : ::. . .::....: .:.:.. :.:: S21175 PGVLIQVYEGERAMTKDNNLLGKFELTGIPPAPRGVPQIEVTFDIDANGIMNVSAADKSTGKENKITITNDKGR--LSKE 440 450 460 470 480 490 500 510 80 90 100 110 120 130 140 150 QUERY EIELLRVLLEKYKKQKDGILNESSNEEDEEKYTLNSETYNNKNNVSNIKNDSIKSKKEEYINLERILLEKYKKFINENNE .:: . :::: . : .. :.... :.:..: .. . ..... .: :.: : : :.: . .:. : S21175 DIERMVQEAEKYKCEDDVQRDKVSSKNSLESYAFNMKSTVEDEKLQGKISDEDKTKILEKCNEVIGWLDKNQTAEKEEYE 520 530 540 550 560 570 580 590 160 170 180 190 200 210 220 230 QUERY ENRKELSNILHKLLEINKLILREEKDDKKVYLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYK ...::: .. . .. S21175 HHQKELEKVCNPIITKLYQGAGGMPGGMPEGMAGGFPGAGGAAPGGGGSSGPTIEEVD 600 610 620 630 640 650 --------------------------------------------------------------------------- >>F69723 trigger factor (prolyl isomerase) tig - Bacillus subtilis (424 aa) initn: 83 init1: 50 opt: 113 Z-score: 122.7 expect() 5.3 Smith-Waterman score: 114; 26.190% identity in 168 aa overlap Entrez lookup Re-search database >F69723 77- 240: ---------------------: 40 50 60 70 80 90 100 110 QUERY VLSIYAFITFIIFCIGILYFTNKSSAHNNNNNKNEHSLKKEEIELLRVLLEKYKKQ-KDGILNESSNEEDEE---KYTLN ::.: : : :: ::. ... ::.. . :: : . : F69723 PEEYHAEDLAGKPAVFKVKIHEIKAKELPELDDEFAKDIDEEVETLAELTEKTKKRLEEAKENEADAKLREELVLKASEN 220 230 240 250 260 270 280 290 120 130 140 150 160 170 180 190 QUERY SETYNNKNNVSNIKNDSIKSKKEEYINLERILLEKYKKFINENNEENRKELSNILHKLLEINKLILREEKDDKKVYLIND .: . : . . : . .. :. .... . :: : .: .... ...... .: .. : : :. ... . .. F69723 AEIDVPQAMV-DTELDRMLKEFEQRLQMQGMNLELYTQFSGQDEAALKEQMKEDAEKRVKSN-LTLEAIAKAENLEVSDE 300 310 320 330 340 350 360 370 200 210 220 230 240 250 260 270 QUERY NYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKNIDEQMRKFEIFKINYISIKNHNKLNKNAMYKKKV . : : : : ... :..::: : ::: :: : :: F69723 EVD---AELTKMAEAYNM----PVENIKQAIGSTDAMKEDLKVRKAIDFLVENR 380 390 400 410 420 280 290 300 310 320 330 340 350 QUERY NQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILDYREKGIVHEPKDQG --------------------------------------------------------------------------- >>S40460 ribosomal protein S3 - Chlamydomonas frankii chloroplast (fragment) (809 aa) initn: 68 init1: 41 opt: 117 Z-score: 122.6 expect() 5.4 Smith-Waterman score: 119; 20.847% identity in 307 aa overlap Entrez lookup Re-search database >S40460 5- 292:------------------------------------ : 10 20 30 40 QUERY MVAIKEMKELAFARPSLVETLNKKKKFLKKKEKRTFVLSIYAFI :. :. ..::.:.. ::::..... . .. . . S40460 ENEISTNAAKLSPVLKNKKKKKGQKLKRVKKVNIASNFINKKNKKRFISRPKLTKQEYKKKKLVQQSLRNRQIIRRW-YR 160 170 180 190 200 210 220 230 50 60 70 80 90 100 110 QUERY TFIIFCIGILYFTNKSSAHNNNNNKNEHSL-------KKEEIELLRVLLEKYKKQKDGILNESSNEEDEEKYTLNSETYN :: . .. :.:. .. ..:.. . : . . . : .: ::.:. :. .... .:.: :... S40460 QFIAKGL-LINKTGKKIKRKIATKKGKTCFGRFKKFNKVNSFSKTKNLTNKLKKKKQTPLSVATTNFSENK--TNKKSSF 240 250 260 270 280 290 300 310 120 130 140 150 160 170 180 190 QUERY NKNNVSNIKNDSIKSKKEEYI-NL---ERILLEKYKKFINEN-NEENRKELSNILHKLLEINKLILREEKDDKKVYLIND :: . : .. : ..: .. :: . :. .::.. :. :.: :.: :..:.. ....:...:.:. :: S40460 IKNPILNKQTKSKQKKGGLFVKNLGVKSAVYLKIKRKFVTLYLNKVNKKFLKN-LKELMKYWFFVFEEKNSDSKIN-INT 320 330 340 350 360 370 380 390 200 210 220 230 240 250 260 QUERY NYDEKGALEI-----GMNEEMKYKKEDPINNIKYAS--KFFKFMKEHNKVYKNIDEQMRKFEIFKINYISIKNHNKLNKN . . . .: :.:.. :: ..: :. :.:. ..... . :.. .: ::.. . .: . S40460 SISSLTSAKINFAPFGYNKNWDIKKLRVLKNQPLAKLKKLFEVLEKKSFI---------KLQALKQYYIGFGSLSKTQAY 400 410 420 430 440 450 460 270 280 290 300 310 320 330 340 QUERY AMYKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILDYREKGIV .... : : .. .:. . ..:. S40460 SFFQMIV--FLKQLKKLIKRRYYAVLQKYKNKNFVFKQSLNTERGSLSSKFLQQTKNASGLETNLQKSESGINKSIKQKT 470 480 490 500 510 520 530 540 --------------------------------------------------------------------------- >>S67593 transport protein USO1 - yeast (Saccharomyces cerevisiae) (1790 aa) initn: 51 init1: 51 opt: 122 Z-score: 122.5 expect() 5.4 Smith-Waterman score: 171; 25.000% identity in 248 aa overlap Entrez lookup Re-search database >S67593 65- 303: ----------------------------- : 30 40 50 60 70 80 90 100 QUERY KKFLKKKEKRTFVLSIYAFITFIIFCIGILYFTNKSSAHNNNNNKNEHSLKKEEIELLRVLLEKYKKQ---KDGILNESS :. : .::.::. : : . :.::..: :. :: S67593 LETKLETSEKALKEVKENEEHLKEEKIQLEKEATETKQQLNSLRANLESLEKEH-EDLAAQLKKYEEQIANKERQYNEEI 1110 1120 1130 1140 1150 1160 1170 110 120 130 140 150 160 170 QUERY NEEDEEKYTLN--SETYNNKNNVSNIKNDSIKSKKEEYINLERILLEKYKKFINENNEENRKELSNILH--KLLEINKLI .. ..: . . .:. ..::. . . ..:: .:: ::.. .. . :.: ...:. . ...:. : .: . . S67593 SQLNDEITSTQQENESIKKKNDELEGEVKAMKSTSEEQSNLKKSEIDALNLQIKELKKKNETNEASLLESIKSVESETVK 1180 1190 1200 1210 1220 1230 1240 1250 180 190 200 210 220 230 240 250 QUERY LREEKDDKKVYLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKNIDEQMR--KFEIFKINYIS ..: .:. :. :: . :. : : : . : ::.....:: .:. ...: . :... ::. .: S67593 IKELQDEC-------NFKEKEVSEL----EDKLKASEDKN-----SKYLELQKESEKIKEELDAKTTELKIQLEKITNLS 1260 1270 1280 1290 1300 1310 1320 260 270 280 290 300 310 320 330 QUERY IKNHNKLNKNAMYKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPE .... .. . :: .. .::.: : .:. ... :. .:: : S67593 KAKEKSESELSRLKKTSSEERKNAEEQL-EKLKNEIQIKNQAFEKERKLLNEGSSTITQEYSEKINTLEDELIRLQNENE 1330 1340 1350 1360 1370 1380 1390 1400 340 350 360 370 380 390 400 410 QUERY ILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKDNFGCDGGHPFYSFLYVLQNELCLG S67593 LKAKEIDNTRSELEKVSLSNDELLEEKQNTIKSLQDEILSYKDKITRNDEKLLSIERDNKRDLESLKEQLRAAQESKAKV 1410 1420 1430 1440 1450 1460 1470 1480 --------------------------------------------------------------------------- >>S49369 mob protein - Campylobacter coli (321 aa) initn: 64 init1: 41 opt: 111 Z-score: 122.4 expect() 5.5 Smith-Waterman score: 144; 24.561% identity in 285 aa overlap Entrez lookup Re-search database >S49369 53- 332: -----------------------------------: 20 30 40 50 60 70 80 90 QUERY ARPSLVETLNKKKKFLKKKEKRTFVLSIYAFITFIIFCIGILYFTNKSSAHNNNNNKNEHSLKKEEIELLRVLLEKYKKQ : .: .: . . :. :.. :. . ..:.: S49369 NAKGEKQTHYHAHAVFFTLDNNGLQLARREASLNKANLSKIQTLTAQSLKMERGANRYENNEKQPQYIQDYKTYAQFKEQ 40 50 60 70 80 90 100 110 100 110 120 130 140 150 160 QUERY KDGILNESSNEEDEEKYTLNSETYNNKNNVSNIKNDSIKSKKEEYINLERILLEKYKKFINE---NNEENRKELSNILHK . ..:.. .:.:.: : . ..:.. . : .:::..: . .:...: :.. ..:. :::.. ..: S49369 EKALLQRI--QEQEHKLTQMALELKKKEKEIQDKAKELKSKENEL----QAKIEQHQKHIQNLELGHERALKELTQEFEK 120 130 140 150 160 170 180 170 180 190 200 210 220 230 240 QUERY LLEINKLILREEKDDKKVYLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMK-EHNKVYKNIDEQMRKFEI : . : :: : . :: .: :.:. :. .: ..: :.: ....: :..:: . :. :: S49369 RLSLWKNILTFGKYNAKVR--EDYQLTKNAFLISTDES---RRE--------ANKELEYLKFEYHKVKDERDNLKTLFEA 190 200 210 220 230 240 250 250 260 270 280 290 300 310 320 QUERY FKINYISIKNHNKLNKNAMYKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKD : . ..: ...:.. . . .: . : :.::: : . .. . ::.. ::: ... :. :::.. S49369 HKTK--NVKLETRLKEIGKWCEK-----NLSVEQLKEIFPLKAERIEKEL-KYQRAFENSFEQ--------TKTKRNDRG 260 270 280 290 300 310 330 340 350 360 370 380 390 400 QUERY I-FSKVPEILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKDNFGCDGGHPFYSFLYV . ::. S49369 FGFSR 320 --------------------------------------------------------------------------- >>S05362 probable DNA-directed DNA polymerase (EC 2.7.7.7) - fungus (Ascobolu (1202 aa) initn: 83 init1: 52 opt: 119 Z-score: 122.0 expect() 5.8 Smith-Waterman score: 119; 24.180% identity in 244 aa overlap Entrez lookup Re-search database >S05362 191- 424: ----------------------------- : 160 170 180 190 200 210 220 QUERY FINENNEENRKELSNILHKLLEINKLILREEKDDKKVYLIND-NYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFM :: : .. . : : . . :.... . .. .: : :. S05362 SYDIIGHMIINDGENVITFNRAVDNSIIKIFTVTDSMGNTNDPNLFKRIVEEKGNQTVYVYENNETVCVVQ--DKKFGFI 460 470 480 490 500 510 520 230 240 250 260 270 280 290 300 QUERY KEHNKVYKNIDEQMRKFEIFKINYISIKNHNKLNKNAM--YK-KKVNQF--SDYSEEELKEYFKTLLHVPNHMIEKYSKP .:. :: ..... . .. . ..:.: : :. ::.: : .: .::. :::::. :: . : . S05362 ---SKIAKNKTVNLKSISTLDLE-TRMDTNNRLIPICMSYYNNKKLNTFLFKDDWQEEMLAAFKTLLKSTNHGKKFYVHN 530 540 550 560 570 580 590 600 310 320 330 340 350 360 370 380 QUERY FENHLKDNILISEFYTNGKRNEKDIFSKVPEILD----YREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSF . :. :...: . : .: .. ::. . .:. .. : : . . : . :. :. ..: .::. : S05362 LA-HF-DSVFILD--TLSKLGKIDIIMRDDKIMKLKITFKIPGKNTEYSISFLDSLLMLPNSLDNLSKAFNIENKK--SV 610 620 630 640 650 660 670 390 400 410 420 430 440 450 460 QUERY SEQEVVDCSKDNFGCDGGHPFYSFLYVLQNELCLGDEYKYKAKDDMFCLNYRCKRKVSLSSIGAVKENQLILALNEVGPL . .. . :. :. : : ..: :. :.:: :: S05362 FPLKFTNGAVTPFNYIGAVPGYEYFYNTPNKKFTKDDYKKYCKDFNNNWDFNKELKNYCEIDCLALHDILTLFAKMIHNE 680 690 700 710 720 730 740 750 470 480 490 500 510 520 530 540 QUERY SVNVGVNNDFVAYSEGVYNGTCSEELNHSVLLVGYGQVEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWG S05362 FSVDITRYVSLSSITFAIFRTNFLPENKIPNITCTKLHYILKQAYTGGYCDVFKPEGKNIHSYDINSLYPSAMAKFDMPT 760 770 780 790 800 810 820 830 --------------------------------------------------------------------------- >>A49464 chromosome segregation protein SMC1 - yeast (Saccharomyces cerevisia (1225 aa) initn: 92 init1: 62 opt: 119 Z-score: 121.9 expect() 5.9 Smith-Waterman score: 132; 26.064% identity in 188 aa overlap Entrez lookup Re-search database >A49464 69- 240: ---------------------- : 30 40 50 60 70 80 90 100 QUERY KKKEKRTFVLSIYAFITFIIFCIGILYFTNKSSAHNNNNNKNEHSLKKEEIELLRVLLEKYKKQKDGILNESSNEEDEEK ..: . ... .: . :. :...::: .:: :: A49464 EMKSLEEQEYAIEMKIGSIESKLEEHKNHLDELQKKFVTKQSELNSSEDILEDMNSNLQVLKRERDGI------KEDIEK 860 870 880 890 900 910 920 930 110 120 130 140 150 160 170 QUERY YTLNSETYNNKNNVSNIK---------ND-SIKSKKEEYINLERILLEKYK----KFINENNEENRKELSNILHKLLEIN . :. : .. ..:::. .: :.: .: :.. . .:: :. ..:.. :::: . .:.. :: A49464 FDLERVTALKNCKISNINIPISSETTIDDLPISSTDNEAITISNSIDINYKGLPKKYKENNTDSARKELEQKIHEVEEI- 940 950 960 970 980 990 1000 1010 180 190 200 210 220 230 240 250 QUERY KLILREEKDDKKVYLINDNYDE-KGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVY-KNIDEQMRKFEIFKIN : : . . .. . ::: .: .:. :: . : :. : ..:.:. :...... :..: A49464 ---LNELQPNARAL---ERYDEAEGRFEVINNETEQLKAEEK----KILNQFLKIKKKRKELFEKTFDYVSDHLDAIYRE 1020 1030 1040 1050 1060 1070 1080 260 270 280 290 300 310 320 330 QUERY YISIKNHNKLNKNAMYKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSK A49464 LTKNPNSNVELAGGNASLTIEDEDEPFNAGIKYHATPPLKRFKDMEYLSGGEKTVAALALLFAINSYQPSPFFVLDEVDA 1090 1100 1110 1120 1130 1140 1150 1160 --------------------------------------------------------------------------- >>S20614 hypothetical protein 1738 - beechdrops plastid (1738 aa) initn: 62 init1: 54 opt: 121 Z-score: 121.7 expect() 6 Smith-Waterman score: 140; 23.116% identity in 398 aa overlap Entrez lookup Re-search database >S20614 37- 395: --------------------------------------------- : 10 20 30 40 50 60 70 QUERY MVAIKEMKELAFARPSLVETLNKKKKFLKKKEKRTFVLSIYAFITFIIFCIGILYFTNKSSAHNNNNNKNEHSLKK ..:: ::: . . .: . :. ... .:...... S20614 LIGHILLIKWLGLVLVWIRQNNYIRSNKYLVSEFINYMARIFSILLFITCVYY-LGRIPSPFFSTKLTETLKKEQQKVED 180 190 200 210 220 230 240 250 80 90 100 110 120 130 140 150 QUERY EEIELLRVL----LEKYKKQKDGILNESSNEEDEEKYTLNSETYNNKNNVSNIKNDSIKSKKEEYINLERILLEKYKKF- :.:: . . ::: .....: .. .:: .:: ::: .... .. ...::.: .: :::. S20614 EDIETVNQMPPLGLEKREQEQEGSNIKDFYYFSEEMVNLN----NNKIDIDETEKKLVNGKKDELYPRLKITETGYKKIP 260 270 280 290 300 310 320 330 160 170 180 190 200 QUERY ---------INEN--NEE----NRK-ELSNILHKLLEINKLILREEKDDKKVYLINDNYDEKGALEIGMNE------EMK :::: : . :.: : .:.: : . .::. ... .. :.... : :: . :.. : S20614 IFEESSIININENPYNSRFKILNKKFEKKNLLIKEKLFVNLIFDKNRWNRPFRYIKNTHFE-GATRNEMSQFFFNICEND 340 350 360 370 380 390 400 410 210 220 230 240 250 260 270 280 QUERY YKKEDPINNIKYASKFFKFMKEH---NKVYKNIDEQMRKFEIFKINYISIKNHNK--LNKNAMYKKKVNQFSDYSEEELK :.. .. .. : :....:.. .:. : . ... .: .. : :::: . .:. :: : .:. : . S20614 GKERISFTYLSSLSFFLEMIKKKINSHKLEKALTNKLSNFWLYT-NKKSIKNFTDEFINRIEALDKK---FISYNILETR 420 430 440 450 460 470 480 290 300 310 320 330 340 350 360 QUERY EYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILDYREKGIVHEPKDQGLCGS---CWAFA : : :. : :: .. :. . . .: : . :..:.: : . :: . .:. : : : S20614 ----TRLCNDNYTKEYLSKKYDPLLNRSYQKT-IYINLST---PILKKTPTINLIDNFGI---NRIHGILLSYTDCQEFE 490 500 510 520 530 540 550 370 380 390 400 410 420 430 QUERY SVGNIESVFAKKNK----NILSFSEQEVVDCSKDNFGCDGGHPFYSFLYVLQNELCLGDEYKYKAKDDMFCLNYRCKRKV . .:. . :. . ..:.: . :.. . :...: S20614 Q--KIKRFYKKQLSTEIVDLLTFISKVVIELGPDSLNCKIFSDVKIIKKFLLYLLTQIVTNVNDPKIIKKSTRIKKICKK 560 570 580 590 600 610 620 630 --------------------------------------------------------------------------- >>A54639 parasitophorous vacuole antigen p126 - Plasmodium falciparum (isolat (427 aa) initn: 129 init1: 64 opt: 112 Z-score: 121.6 expect() 6.1 Smith-Waterman score: 215; 23.206% identity in 418 aa overlap Entrez lookup Re-search database >A54639 34- 404: ----------------------------------------------: 10 20 30 40 50 60 70 QUERY MVAIKEMKELAFARPSLVETLNKKKKFLKKKEKRTFVLSIYAFITFIIFCIGILYFTNKSSAHNNNNNKNEHS ..: : .: . .:. . : .:....::. . .. A54639 FRSSSSSSSSSSSSESLPANGPDSPTVKPPRNLQNICETGKNFKLVVYIKENTLILKWKV-YGETKDTTENNKVDVRKYL 10 20 30 40 50 60 70 80 80 90 100 110 120 130 140 QUERY LKKEEIELLRVLLEKYKKQKDGILNESSN-----EEDEEKYTLNSETYNNKN-NVSNIKNDSIKSKKEE-----YINLER ....: . .:.. ::... : ::.: . :. :: :. . . : :. . . .. .::. : : . A54639 INEKETPFTNILIHAYKEHNGTNLIESKNYAIGSDIPEKCDTLASNCFLSGNFNIEKCFQCALLVEKENKNDVCYKYLSE 90 100 110 120 130 140 150 160 150 160 170 180 190 200 QUERY ILLEKYKKFINE---NNEENRKE------LSNILHKLLEINK------LILREEKDDK-KVYLIN-----DNYDEKGALE .. :.:.. : ..:.. : ..::: :... :. :: :: ::. :. :.: . : :.:. A54639 DIVSKFKEIKAETEDDDEDDYTEYKLTESIDNILVKMFKTNENNDKSELIKLEEVDDSLKLELMNYCSLLKDVDTTGTLD 170 180 190 200 210 220 230 240 210 220 230 240 250 260 270 QUERY -IGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKNIDEQMRKFEIFKINYISIKNHNK--LNKNAMYKKKVNQ---- ::..:: : .::.: ... :.. .::. :: . . .:: . .:: .. ..: A54639 NYGMGNEM-----DIFNNLK------RLLIYHSE--ENINTLKNKF---RNAAVCLKNVDDWIVNKRGLVLPELNYDLEY 250 260 270 280 290 300 280 290 300 310 320 330 340 QUERY FSDY------SEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILDYREKGIVHEP :... : :. . : ..:: . . .. . ..: .::.. .. : : ..:.. .:.. . A54639 FNEHLYNDKNSPEDKDNKGKGVVHVDTTLEKEDTLSYDN--SDNMFCNKEYCNRLKDENNCISNL-------------QV 310 320 330 340 350 360 350 360 370 380 390 400 410 420 QUERY KDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKDNFG--CDGGHPFYSFLYVLQNELCLGDEYKYKAKDDM .::: : . : ::: ..:.. :. . ..: :..: : . :: : . : A54639 EDQGNCDTSWIFASKYHLETIRCMKGYEPTKISALYVANCYKGEHKDRCDEGSSPMEF 370 380 390 400 410 420 430 440 450 460 470 480 490 500 QUERY FCLNYRCKRKVSLSSIGAVKENQLILALNEVGPLSVNVGVNNDFVAYSEGVYNGTCSEELNHSVLLVGYGQVEKTKLNYN --------------------------------------------------------------------------- >>C56657 PfEMP2/MESA (clone 9025/60) - Plasmodium falciparum (fragment) (230 aa) initn: 88 init1: 61 opt: 108 Z-score: 121.5 expect() 6.2 Smith-Waterman score: 109; 23.153% identity in 203 aa overlap Entrez lookup Re-search database >C56657 87- 285: ------------------------ : 50 60 70 80 90 100 110 120 QUERY IIFCIGILYFTNKSSAHNNNNNKNEHSLKKEEIELLRVLLEKYKKQKDGILNESSNEEDEEKYTLNSETYNNKNNVSNIK :: .::: .:.:...:. .:: .. ..: :. C56657 KDKVLGEGDKEDVKEKNDEQKDKVLGEGDKEDVKEK----NDGKKDKVIGSEKT 10 20 30 40 50 130 140 150 160 170 180 190 200 QUERY NDSIKSKKEEYINLERILLEKYKKFINENNEENRKELSNILHKLLEINKLILREEKDDKKVYLINDNYDEKGALEIG--M . :: : :. . .. .: :: :.::. :. .... . :... : .. .: : . : :: : . C56657 QKEIKEKVEKRV--KKKCKKKVKKGIKENDTEGNDKVKGPEIIIEEVKEEIKKQVEDGIKENDTEGNDKVKGPEIITEEV 60 70 80 90 100 110 120 210 220 230 240 250 260 270 280 QUERY NEEMKYKKEDPI--NNIKYASKFFKFMKEHNKVYKNIDEQMRKFEIFKINYISIKNHNKLNKNAMYKKKVNQFSDYSEEE .::.: . :. : :. . .: .: . . ... :...: .. . .::... .:. . ... ... .: C56657 KEEIKKQVEEGIKENDTEGNDK----VKGPEIITEEVKEEIKK-QVEE----GIKENDTESKDKVIGQEI--ITEEVKEG 130 140 150 160 170 180 190 290 300 310 320 330 340 350 360 QUERY LKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILDYREKGIVHEPKDQGLCGSCWAFAS .:: C56657 IKENDTENKDKVIGQEIITEEVKEGIKENDTEN 200 210 220 230 --------------------------------------------------------------------------- >>D64332 hypothetical protein MJ0259 - Methanococcus jannaschii (202 aa) initn: 59 init1: 59 opt: 107 Z-score: 121.3 expect() 6.3 Smith-Waterman score: 108; 25.234% identity in 214 aa overlap Entrez lookup Re-search database >D64332 47- 250: -------------------------: 10 20 30 40 50 60 70 80 QUERY MKELAFARPSLVETLNKKKKFLKKKEKRTFVLSIYAFITFIIFCIGILYFTNKSSAHNNNNNKNEHSLKKEEIELLRVLL ... :::. . .. : :::: :. .::: .:. D64332 MKKLLLIIGIISLMTSMSMCLNNNNLNNLDLKKS------ILV 10 20 30 90 100 110 120 130 140 150 QUERY EKYKKQKDGILNESSNEEDEEKY--TLNSETYN--NKNNVSNIKNDSIKSKKEEYINLERIL--LEKYKKFINENN--EE : . : . .: : : : . : :: ... . ::.: : :: ... .. :: ...: .: : D64332 EVNGTPIEIPLRATVGEAKEVKLINTTDREIYNYYHSKILIYIKGDMNISVKEGGVSIVDLVTKLEWFNQFYPHNIVVEL 40 50 60 70 80 90 100 110 160 170 180 190 200 210 220 230 QUERY NRKELSNILHKLLEINKLILREEKDDKKVYLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKN :: . . ..... .: . : : ... ::...: . ..:: .: .. :..:. . :. : :.. :. D64332 NRTNSTVTVKSIFANGKTSITELKVNESEYLMHNN--KTMVIEI-----LKTHNTATITKINNT-----FIIEGNSL-KE 120 130 140 150 160 170 180 240 250 260 270 280 290 300 310 QUERY IDEQMRKF--EIFKINYISIKNHNKLNKNAMYKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILIS .:. .: ..:: D64332 LDNAETRFVIDMFKGSIT 190 200 --------------------------------------------------------------------------- >>D64245 peripheral membrane protein B homolog - Mycoplasma genitalium (SGC3) (329 aa) initn: 88 init1: 61 opt: 110 Z-score: 121.2 expect() 6.4 Smith-Waterman score: 110; 31.683% identity in 101 aa overlap Entrez lookup Re-search database >D64245 119- 216: ------------ : 80 90 100 110 120 130 140 150 QUERY IELLRVLLEKYKKQKDGILNESSNEEDEEKYTLNSETYNNKNNVSNIKNDSIKSKK-EEYINLERILLEKYKKFINE-NN :: . :: ..: .: ..: .: . ...::: :. :: D64245 MEKNIKALWKNFQLKLEKIKHYRKLYEQQIKEYKKKITGLNN 10 20 30 40 160 170 180 190 200 210 220 230 QUERY EENRKELSNILHKLLEINKLI-LREEKDDKKVYLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKV : . .:.: : ... .:.:: ... ::. .:. ..:::...:: : .. :.:. . D64245 ETDANEISRIKNEIEILNRLIKIKNTKDN----VIKKDFDEKNVFEI-RNFNFWYNKNKQVLFDINLDIKRNKITALIGK 50 60 70 80 90 100 110 240 250 260 270 280 290 300 310 QUERY YKNIDEQMRKFEIFKINYISIKNHNKLNKNAMYKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILI D64245 SGCGKSTFIRCLNKLNDLNENTRWTGDIYFLGKNINSGIINDLTLRTSVGMVFQKLTPFNFSIFENIAYGIRAHGIHNKN 120 130 140 150 160 170 180 190 --------------------------------------------------------------------------- >>F64639 hypothetical protein HP0958 - Helicobacter pylori (strain 26695) (254 aa) initn: 68 init1: 68 opt: 108 Z-score: 120.9 expect() 6.7 Smith-Waterman score: 108; 26.087% identity in 138 aa overlap Entrez lookup Re-search database >F64639 53- 188: ----------------- : 20 30 40 50 60 70 80 90 QUERY ARPSLVETLNKKKKFLKKKEKRTFVLSIYAFITFIIFCIGILYFTNKSSAHNNNNNKNEHSLKKEEIELLRVLLEKYKKQ :: . ... : . . .:::..:. . .. . . . . F64639 KQLIEISHLDKEIDSLEPLIREKRKDLDKALNDKEAKNKAILNLEEEKLALKLQVSKNEQTLQDTNTKIASIQKKMSEIK 10 20 30 40 50 60 70 80 100 110 120 130 140 150 160 170 QUERY KDGILNESSNEEDEEKYTLNSETYNNKNNVSNIKNDSIKSK--KEEYINLERILLEKYKKFINENNEENRKELSNILHKL .. : . ::: : :. . . .: ..::. : :.. :.:...::. :... .... ::. .: :: ..:. : F64639 SERELRSLNIEEDIAKERSNQANREIENLQNEIKHKSEKQEDLKKEMLELEK-LVQQLESLV-ENEVKNIKETQQIIFKK 90 100 110 120 130 140 150 160 180 190 200 210 220 230 240 250 QUERY LEINKLILREEKDDKKVYLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKNIDEQMRKFEIFK : .:. :: . :.: F64639 KE--ELV---EKTEPKIYSFYERIRRWAKNTSIVTIKKQACGGCFIRLNDKIYTEVLTSGDMITCPYCGRILYAEGAYEN 170 180 190 200 210 220 230 --------------------------------------------------------------------------- >>A57681 hypothetical protein - Mycoplasma capricolum (SGC3) (655 aa) initn: 64 init1: 64 opt: 114 Z-score: 120.9 expect() 6.7 Smith-Waterman score: 140; 22.874% identity in 341 aa overlap Entrez lookup Re-search database >A57681 4- 338:------------------------------------------ : 10 20 30 40 QUERY MVAIKEMKELAFARPSLVETLNKK--KKFLKKKEKRTF--VLSIYA :...:.: : :. . ::: .:..: :: : . .: A57681 MKRTIKYLSFLGLIPFLSITTISCVKQAKENNNKNQLISQFKQLIFILNSF-DLDNKKLESKIIKAIEKSDFNKISNINL 10 20 30 40 50 60 70 50 60 70 80 90 100 110 120 QUERY FITFIIFCIGILYFTNKSSAHNNNNNKNEHSLKKEEIELLRVLLEKYKKQKDGILNESSNEEDEEKYTLNSETYNNKNNV .: : : : . .. . :.: . : : ...: . : . . : ..:. :...: : : ... .::.:. A57681 ELT-IKFLTRIKNELETKTISQLNKNDKLDILTKIKVHLGSLNLIELVNIVDELVNKL-NQKEEIKNTHKDKIEKNKDNI 80 90 100 110 120 130 140 150 130 140 150 160 170 180 190 200 QUERY SNIKNDSIKSKKEEYINLERILLEKYKKFINENNEENRKELSNILHKLLEINKLILREEKDDKKVYLINDNYDEKGALEI .: ..... . .:: .. . :.: . . :: ::: . .. . :: :: :.... A57681 EDIDDSKLEILESKYIPNQHNYPDYVKNFKTVSAEEIYKELYDRTFSIKFLVKL-----KDGG---LLSNG--------T 160 170 180 190 200 210 220 210 220 230 240 250 260 270 280 QUERY GMNEEMKYKKEDPINNIKY--ASKFFKFMKEHNKVYKNIDEQMRKFEIFKINYISIKNHNKLNKNAMYKKKVNQFSDYSE : . . :.: . :. :. :... . :.. ::: ..: :: . . ::. .. :... .:.:. A57681 GTGWLLDYHKYSNTNKYKMFIATNLHVLADFSNSL---TDEQNKEF-----NYYD-PSGNKVIGLGL--GKADNVTDFSR 230 240 250 260 270 280 290 290 300 310 320 330 340 350 360 QUERY EELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILDYREKGIVHEPKDQGLCGSCWAF .. . :. .. :.... .. :::.::... . ...: . : .:. : . : A57681 KNNNS--KSENNIANYYLN--NQDFENYLKNDFWSVNKFSKGISEPKIVFGAVDFMKDRAIKNHYEALQKEAINYYNYKK 300 310 320 330 340 350 360 370 380 390 400 410 420 430 440 QUERY ASVGNIESVFAKKNKNILSFSEQEVVDCSKDNFGCDGGHPFYSFLYVLQNELCLGDEYKYKAKDDMFCLNYRCKRKVSLS A57681 NNNEINDDNKIAWNNFLNNKDIPIMIDFAVFEFDVDLDLVDYNLKSWISNAISGLDNYLDRLNKAPILPNQDKKISKYLQ 370 380 390 400 410 420 430 440 --------------------------------------------------------------------------- >>S12319 pre-mRNA splicing factor PRP6 - yeast (Saccharomyces cerevisiae) (899 aa) initn: 55 init1: 55 opt: 116 Z-score: 120.9 expect() 6.7 Smith-Waterman score: 116; 22.381% identity in 210 aa overlap Entrez lookup Re-search database >S12319 91- 291: ------------------------- : 60 70 80 90 100 110 120 130 QUERY IGILYFTNKSSAHNNNNNKNEHSLKKEEIELLRVLLEKYKKQKDGILNESSNEEDEEKYTLNSETYNNKNNVSNIKNDSI :.:. . :...... ..: :. ::. . :. :.: S12319 MERPSFLDQEPPAGYVPGIGRGATGFSTKEKQVVSNDDKGRRIPKRYR---ENLNNHLQ-SQPKDDED 10 20 30 40 50 60 140 150 160 170 180 190 200 QUERY KSKKEEYINLERILLEKYKKFINENNEENRKELSNILHKLLEINK-LILREEKDDKKVYLIND--NYDEKGALEIGMNEE . . .:: : .: :: ::....: . ::. ... .... : :.. . .: .... .. .:.. S12319 DEAANVFKTLELKLAQKKKKRANEKDDDNSVDSSNVKRQFADLKESLAAVTESEWMDIPDATDFTRRNKRNRIQEQLNRK 70 80 90 100 110 120 130 140 210 220 230 240 250 260 270 280 QUERY MKYKKEDPINNIKYASKFFKFMKEHNKVYKN-IDEQMRKFEIFKINYISIKNHNKLNKNAMYKK-----KVNQFSDYSEE : : . : . :. .:..:. .. :::.. .. : :.... : . : : .::..:: . : S12319 T-YAAPDSL--IPGNVDLNKLTEEREKLLQSQIDENLAQLTKNASNPIQVNKPNAATDALSYLKDLENDRVNSLSDATLE 150 160 170 180 190 200 210 220 290 300 310 320 330 340 350 360 QUERY ELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILDYREKGIVHEPKDQGLCGSCWAFA .:... .:.:. S12319 DLQKM-RTILKSYRKADPTNPQGWIASARLEEKARKFSVAKKIIENGCQECPRSSDIWLENIRLHESDVHYCKTLVATAI 230 240 250 260 270 280 290 300 --------------------------------------------------------------------------- >>S70790 lipA protein - Mycoplasma pulmonis (SGC3) (fragment) (261 aa) initn: 80 init1: 53 opt: 108 Z-score: 120.7 expect() 6.8 Smith-Waterman score: 108; 22.628% identity in 137 aa overlap Entrez lookup Re-search database >S70790 58- 186: ---------------- : 20 30 40 50 60 70 80 90 QUERY VETLNKKKKFLKKKEKRTFVLSIYAFITFIIFCIGILYFTNKSSAHNNNNNKNEHSLKKEEIELLRVLLEKYKKQKDGIL ::.:... .:.: .. ... : . . .: .. : S70790 HNVAKKEDKTQSDSSNLSNKTNKSDPNDHLKDKDKNVSQDNKDSTNKAVSNENSQTQSQKTNESSQNTKDDSSKTSNLIT 30 40 50 60 70 80 90 100 100 110 120 130 140 150 160 QUERY NESSNEEDEEKYTLNSETYNNKN-NVSNIKNDSIKSKKEEYINL-----ERILLEKYKK--FINENNEENRKELSNILHK ...:. . . : :... ...: .. :.. ..:..: :.: ..:.. .: . .....:. :. ::. : S70790 DQNSSSNTKSKIQENKQAQKDQNTSAVNVSALEKQTKNDENISLVNSKDTNVILKNDEKVALAKDDSKEKSKNSSNLNLK 110 120 130 140 150 160 170 180 170 180 190 200 210 220 230 240 QUERY LLEINKLILREEKDDKKVYLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKNIDEQMRKFEIF :. : :::::. S70790 TPVENRQNKNEVKDDKKALQWWQKLNESASILESFSYDQTSLSLTFKEGMPLGLEVVLKLENLDSHEEKEIS 190 200 210 220 230 240 250 260 --------------------------------------------------------------------------- >>JQ0647 Div protein - Bacillus subtilis (841 aa) initn: 100 init1: 47 opt: 115 Z-score: 120.3 expect() 7.2 Smith-Waterman score: 131; 23.832% identity in 214 aa overlap Entrez lookup Re-search database >JQ0647 125- 326: ------------------------- : 90 100 110 120 130 140 150 160 QUERY LLEKYKKQKDGILNESSNEEDEEKYTLNSETYNNKNNVSNIKNDSIKSKKE--EYINLERILLEK-YKKFINENNEENRK ...... :.:. .: .. : : ::. .. . :: . JQ0647 LMRRFGAERTMAMLDRFGMDDSTPIQSKMVSRAVESSQKRVEGNNFDSRKQLLQYDDVLRQQREVIYKQRFEVIDSENLR 550 560 570 580 590 600 610 620 170 180 190 200 210 220 230 QUERY EL-SNILHKLLE--INKLILREE-----KDDKKVYLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHN :. :.... :: : ::: : : : ::: .: ..:::: . .. :. : . .. . . :. .... JQ0647 EIVENMIKSSLERAIAAYTPREELPEEWKLDGLVDLINTTYLDEGALE---KSDIFGKEPDEMLELIMDRIITKYNEKEE 630 640 650 660 670 680 690 700 240 250 260 270 280 290 300 310 QUERY KVYKNIDEQMRKFE-IFKINYISIKNHNKLNKNAMYKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDN . : ::::.:: .. . .. : .... . .. .. . . . :.:: . . .::::. .:... JQ0647 QFGK---EQMREFEKVIVLRAVDSKWMDHIDAMDQLRQGIHLRAYAQTNPLREYQMEGFAMFEHMIES----IEDEVAKF 710 720 730 740 750 760 770 320 330 340 350 360 370 380 390 QUERY ILISEFYTNGKRNEKDIFSKVPEILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKDN .. .:. .: .:.: JQ0647 VMKAEIENNLEREEVVQGQTTAHQPQEGDDNKKAKKAPVRKVVDIGRNAPCHCGSGKKYKNCCGRTE 780 790 800 810 820 830 840 --------------------------------------------------------------------------- >>F69704 preprotein translocase subunit secA - Bacillus subtilis (841 aa) initn: 100 init1: 47 opt: 115 Z-score: 120.3 expect() 7.2 Smith-Waterman score: 131; 23.832% identity in 214 aa overlap Entrez lookup Re-search database >F69704 125- 326: ------------------------- : 90 100 110 120 130 140 150 160 QUERY LLEKYKKQKDGILNESSNEEDEEKYTLNSETYNNKNNVSNIKNDSIKSKKE--EYINLERILLEK-YKKFINENNEENRK ...... :.:. .: .. : : ::. .. . :: . F69704 LMRRFGAERTMAMLDRFGMDDSTPIQSKMVSRAVESSQKRVEGNNFDSRKQLLQYDDVLRQQREVIYKQRFEVIDSENLR 550 560 570 580 590 600 610 620 170 180 190 200 210 220 230 QUERY EL-SNILHKLLE--INKLILREE-----KDDKKVYLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHN :. :.... :: : ::: : : : ::: .: ..:::: . .. :. : . .. . . :. .... F69704 EIVENMIKSSLERAIAAYTPREELPEEWKLDGLVDLINTTYLDEGALE---KSDIFGKEPDEMLELIMDRIITKYNEKEE 630 640 650 660 670 680 690 700 240 250 260 270 280 290 300 310 QUERY KVYKNIDEQMRKFE-IFKINYISIKNHNKLNKNAMYKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDN . : ::::.:: .. . .. : .... . .. .. . . . :.:: . . .::::. .:... F69704 QFGK---EQMREFEKVIVLRAVDSKWMDHIDAMDQLRQGIHLRAYAQTNPLREYQMEGFAMFEHMIES----IEDEVAKF 710 720 730 740 750 760 770 320 330 340 350 360 370 380 390 QUERY ILISEFYTNGKRNEKDIFSKVPEILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKDN .. .:. .: .:.: F69704 VMKAEIENNLEREEVVQGQTTAHQPQEGDDNKKAKKAPVRKVVDIGRNAPCHCGSGKKYKNCCGRTE 780 790 800 810 820 830 840 --------------------------------------------------------------------------- >>A56157 chromosome segregation protein SMC2 - yeast (Saccharomyces cerevisia (1170 aa) initn: 69 init1: 46 opt: 117 Z-score: 120.2 expect() 7.3 Smith-Waterman score: 142; 20.890% identity in 292 aa overlap Entrez lookup Re-search database >A56157 65- 339: ---------------------------------- : 30 40 50 60 70 80 90 100 QUERY KKFLKKKEKRTFVLSIYAFITFIIFCIGILYFTNKSSAHNNNNNKNEHSLKK--EEIELLRVLLEKYKKQKDGILNESSN :. . .. :::: ::. .. ...: ..: . ::: .. A56157 SDLNLSLHKLDLAKRNLDANPSSQIIARNEEILRDIGECENEIKTKQMSLKKCQEEVSTIEKDMKEYDSDKGSKLNELKK 720 730 740 750 760 770 780 790 110 120 130 140 150 160 170 QUERY E------EDEEKYTLNSETYNNKNNVSNIKNDSIKSKKEEYINLERILLEKYKKFINENNEENRKELSNILHKLLEINKL : : ::. . . . :. .:. ........:. .. .. ::... : :. . :: ..: . .: . . A56157 ELKLLAKELEEQESESERKYDLFQNL-ELETEQLSSE----LDSNKTLLHNHLKSIESLKLENSDLEGKI--RGVEDDLV 800 810 820 830 840 850 860 870 180 190 200 210 220 230 240 250 QUERY ILREEKDDKKVYLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKNIDEQMRKFEIFKINYISI .. : ...: :. : :: . :: ..... :: . .. : . . :. .. :.. : :.. .: :... .. . A56157 TVQTELNEEKKRLM-DIDDELNELETLIKKKQDEKKSSELELQKLVHDLNKYKSNTNNMEKIIEDLRQKHEFLE-DFDLV 880 890 900 910 920 930 940 260 270 280 290 300 310 320 QUERY KNHNKLNKNA---MYKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHL----KDNILISEFYTNGKRNEKDI .: : :.. :... .:... .: :. ..... .. .:: ... . ::.. :.: .. .. ... A56157 RNIVKQNEGIDLDTYRERSKQLNEKFQELRKKVNPNIMNMIEN-VEKKEAALKTMIKTIEKDKMKIQETISKLNEYKRET 950 960 970 980 990 1000 1010 1020 330 340 350 360 370 380 390 400 QUERY FSKVPE--ILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKDNFGCDGGHPFYSFLYV . :. : ::. A56157 LVKTWEKVTLDFGNIFADLLPNSFAKLVPCEGKDVTQGLEVKVKLGNIWKESLIELSGGQRSLIALSLIMALLQFRPAPM 1030 1040 1050 1060 1070 1080 1090 1100 --------------------------------------------------------------------------- >>B64136 molybdenum cofactor biosynthesis protein A - Haemophilus influenzae (337 aa) initn: 67 init1: 44 opt: 109 Z-score: 120.1 expect() 7.4 Smith-Waterman score: 109; 24.800% identity in 125 aa overlap Entrez lookup Re-search database >B64136 240- 357: --------------- : 200 210 220 230 240 250 260 270 QUERY LEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKNIDEQMRKFE-IFKINYISIKNHNKLNKNAMYKKKVNQFSDY :. :: .. :...: ..: .. : :: . :. .:: . B64136 GYRMAKDVADWKKAGITSINVSVDSLDPKMFHQITGINKFDDVMRGIDRAFEVGYNKVKVNSVLMKN-LNDKEFEQFLAW 110 120 130 140 150 160 170 180 280 290 300 310 320 330 340 350 QUERY -SEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNG----KRNEKDIFSKVPEILDYR-EKGIVHEPKDQG ... .. : :... . .... : ::. ..: ... :: .... : .:: :: : :.. : ... B64136 VKDRPIQMRFIELMQTGE--MDSFFDKF--HLSGQVLADKLLKNGWTLQHKSHTDGPAKVFTHPDYAGEIGLIM-PYEKN 190 200 210 220 230 240 250 260 360 370 380 390 400 410 420 430 QUERY LCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKDNFGCDGGHPFYSFLYVLQNELCLGDEYKYKAKDDMFCLNYR .:.:: B64136 FCASCNRLRVSAKGKLHLCLFGEEGIELRDLLQSHEQQGILQARIFAALQGKREHHYLHIGDTGVRNHLASIGG 270 280 290 300 310 320 330 --------------------------------------------------------------------------- >>A46194 high-molecular-weight neurofilament protein NF-220 - Squid (1200 aa) initn: 94 init1: 64 opt: 117 Z-score: 120.0 expect() 7.5 Smith-Waterman score: 118; 21.091% identity in 275 aa overlap Entrez lookup Re-search database >A46194 78- 332: -------------------------------- : 40 50 60 70 80 90 100 110 QUERY LSIYAFITFIIFCIGILYFTNKSSAHNNNNNKNEHSLKKEEIELLRVL---LEKYKKQKDGILNESSNEEDEEKYTLNSE :: .:: ::: : ....:: . ..: .. .. ::.: A46194 TELIDQLERQQKDLEESRTYHQIDQEQIARQNQQLADLEGEISMLRRSIESLEKEKMRQSNILAKMNDEMEKMRMDLNNE 180 190 200 210 220 230 240 250 120 130 140 150 160 170 180 190 QUERY TYN--NKNNVSNIKNDSIKSKKEEYIN-LERILLEKYKKFINENNEENRKELSNILHKLLEINKLILREEKDDKKVYLIN : : . .: . .. .. .:. . . :... :. :: : :.::.. .. . . . . : ..: : A46194 TINHLDAENRRQTLEEELEFQKDVHAQELKELAALAYRDTTAENREFWRNELAQAIRDIQQEYDAKCDQMRGDIEAYY-N 260 270 280 290 300 310 320 330 200 210 220 230 240 250 260 QUERY DNYDE------KGALEIGMNEEMKYKKEDPINNIK-----YASKFFKFMKEHNKVYKNIDEQMRKFEIFKINYISIKNHN . .: : .:. :.: . : .. ...:. .. .. . .. . ....:. :. :. . .: .. A46194 LKVQEFRTGATKQNMEVTRNKEENTKLKSNMTEIRNRLADLEARNAQLERTNQDLLRDLEEKDRQNELESCQYKEEITKL 340 350 360 370 380 390 400 410 270 280 290 300 310 320 330 QUERY KLNKNAMYKKKVNQFSDYS---EEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEIL . . ... :. .... : . : :. : : ::. . :: :. .. . :::: : :: A46194 RGEMESILKE-LQDLMDIKLSLELEIAAYRK-LLEGEESRIELVHFPMTIGTREAYRPELIKTNGKSASDDDSSKDGTVR 420 430 440 450 460 470 480 340 350 360 370 380 390 400 410 QUERY DYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKDNFGCDGGHPFYSFLYVLQNELCLGDE A46194 AKSVSPDVVAETKLTTSTSYCGGDEADDEGKDSDSDTHTEAEAEETRADSDADTGTGLDEVKEESVLKSEEKDKSVKRDD 490 500 510 520 530 540 550 560 --------------------------------------------------------------------------- >>F64501 hypothetical protein MJ1615 - Methanococcus jannaschii (255 aa) initn: 67 init1: 50 opt: 107 Z-score: 119.8 expect() 7.7 Smith-Waterman score: 107; 20.270% identity in 222 aa overlap Entrez lookup Re-search database >F64501 133- 345: --------------------------: 100 110 120 130 140 150 160 170 QUERY KDGILNESSNEEDEEKYTLNSETYNNKNNVSNIKNDSIKSKKEEYINLERILLEKYKKFINENNEENRKELSNILHKLLE : : . ..:..: : :. . :. . ... .:... F64501 MIDKSSEIARFSGKGILITPKTLEKPLLKWEKLEIILYKDKIVFEFVDKTIEVGVEDIEDVGAELPKKVID 10 20 30 40 50 60 70 180 190 200 210 220 230 240 QUERY INKLILREEKDDKKVYLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFM--KEHNKVYKNIDE--QMRKFEI : : :.. ... . . .. : . .:. : . . ::.: . :.: .. :.. :. : : . :.: F64501 IAKSTLEDITYHSSIIIKSKEF---GNVMVGFAPETSIYGKAPIDN--FLRKLFYILLNKKEVKILYNAGENSENTKWEN 80 90 100 110 120 130 140 250 260 270 280 290 300 310 320 QUERY FKINYISIKNHNKLNKNAMYKKKVNQFSDYSEEELKEYFKTL--LHVPNHMIEKYSKPFEN--HLKDNI-LISEFYTNGK ...:. . .. : . :. : .. : . .. . :... ... .. .. .: . ..::. .:: .::. : F64501 GFLTFIKKRIKDGLVTKIEYRLVV-EILDNEDSKIYDIFSNIKDVEIEEKDVDGEIEPVLKILQVKDGKDIISYLYTKDK 150 160 170 180 190 200 210 220 330 340 350 360 370 380 390 400 QUERY RNEKDIFSKVPEILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKDNFGCDGGHPFYS . . :. . .:::. ::.. F64501 KVRLFILRYMVILLDYKYIGILRYLQETVE 230 240 250 --------------------------------------------------------------------------- >>F32946 cysteine proteinase (EC 3.4.22.-) - Caenorhabditis elegans (fragment (53 aa) initn: 105 init1: 80 opt: 97 Z-score: 119.8 expect() 7.7 Smith-Waterman score: 97; 30.189% identity in 53 aa overlap Entrez lookup Re-search database >F32946 351- 399: ------: 320 330 340 350 360 370 380 QUERY DNILISEFYTNGKRNEKDIFSKVPEILDYREKGIVHEPKDQGLCGSCWAFASVGNIE--SVFAKKNKNILSFSEQEVVDC :: :::::::... : . .:... . . ... : F32946 QGQCGSCWAFSTAEVISDGTCMASNGTQQPIICPTDLLTC 10 20 30 40 390 400 410 420 430 440 450 460 QUERY SKD--NFGCDGGHPFYSFLYVLQNELCLGDEYKYKAKDDMFCLNYRCKRKVSLSSIGAVKENQLILALNEVGPLSVNVGV . . ::.::. F32946 CWNVCGEGCNGGY 50 --------------------------------------------------------------------------- >>S77691 probable finger protein YBR267w - yeast (Saccharomyces cerevisiae) (393 aa) initn: 76 init1: 52 opt: 109 Z-score: 119.1 expect() 8.4 Smith-Waterman score: 130; 21.091% identity in 275 aa overlap Entrez lookup Re-search database >S77691 50- 319: --------------------------------- : 10 20 30 40 50 60 70 80 QUERY LAFARPSLVETLNKKKKFLKKKEKRTFVLSIYAFITFIIFCIGILYFTNKSSAHNNNNNKNEHSLKKEEIELLRVLLEKY :. . .... :: ... ....::.. .: . .: . S77691 MSSSGVYTCNSCVLTFDSSDEQRAHMKSD-WHRYNLKRRVAQLPPISFETF 10 20 30 40 50 90 100 110 120 130 140 150 160 QUERY KKQKDGILNESSNEEDEEKYTLNSETYNNKNNVSNIKNDSIKSKKEEYINLERI-LLEKYKKFINENNEENRKELSNILH .. .. .:. ..:: . .:.... ..... ::.. ... : .::...: ..: : .::.. S77691 DSKVSAAAASTSKSAEKEKPV-------TKKELKRREKQALLEKKKKLLEIARANMLENMQK----SQEGNTPDLSKLSL 60 70 80 90 100 110 170 180 190 200 210 220 230 240 QUERY KLLEINKLILREEKDDKKVYLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKNIDEQMRKFEI . : :: ::.. : .. .:. : .. :.:... . . :... . :::: .:...:.... . S77691 QENEENK-----EKEEPKKEEPEQLTEEEMAERV-MQENVRNRVDIPLEQCLFC--------EHNKHFKDVEENLEH--M 120 130 140 150 160 170 180 250 260 270 280 290 300 310 320 QUERY FKINYISIKNHNKL-NKNAMYKKKVNQFSDYSEEELKEYF-KTLLHVPNHMIEK-YSK-PFENHLKDNILISEFYTNGKR :. . . : ... : .: .. : .... . . .: .:: : .::. : . : :.:.. . . ::::: S77691 FRTHGFYIPEQKYLVDKIGLVKYMSEKIGLGNICIVCNYQGRTLTAVRQHMLAKRHCKIPYESE-DERLEISEFYDFTSS 190 200 210 220 230 240 250 260 330 340 350 360 370 380 390 400 QUERY NEKDIFSKVPEILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKDNFGCDGGHPFYSF S77691 YANFNSNTTPDNEDDWEDVGSDEAGSDDEDLPQEYLYNDGIELHLPTGIKVGHRSLQRYYKQDLKPEVILTEGQGTLVAA 270 280 290 300 310 320 330 340 --------------------------------------------------------------------------- >>S41415 heat shock protein 70 - rat (641 aa) initn: 76 init1: 76 opt: 112 Z-score: 119.0 expect() 8.6 Smith-Waterman score: 112; 29.496% identity in 139 aa overlap Entrez lookup Re-search database >S41415 44- 171: ---------------- : 10 20 30 40 50 60 70 QUERY IKEMKELAFARPSLVETLNKKKKFLKKKEKRTFVLSIYAFITFIIFCIGILYFT--NKSSAHNN----NNNKNEHSLKKE .:: : ::: : .::... : .:.:.. :.:: S41415 PGVLIQVYEGERAMTRDNNLLGRFDLTGIPPAPRGVPQIEVTFDIDANGILNVTAMDKSTGKANKITITNDKGR--LSKE 440 450 460 470 480 490 500 510 80 90 100 110 120 130 140 150 QUERY EIELLRVLLEKYKKQKDGILNESSNEEDEEKYTLNSETYNNKNNVSN--IKNDSIKSKKEEYIN-LERIL--LEKYKKFI ::: . :.:: . .: . :. : .:.: ..: :. :.. .:. .: :.. .. ..: :: . S41415 EIERMVQEAERYKAEDEG-----QREKIAAKNALESYAFNMKSAVGDEGLKDKISESDKKKILDKCSEVLSWLEANQLAE 520 530 540 550 560 570 580 590 160 170 180 190 200 210 220 230 QUERY NENNEENRKELSNILHKLLEINKLILREEKDDKKVYLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEH .:. ...:::: :. . .. S41415 KEEFDHKRKELENMCNPIITKLYQSGCTGPTCAPGYTPGRARTGPTIEEVD 600 610 620 630 640 --------------------------------------------------------------------------- >>I49761 heat shock protein 70 - mouse (641 aa) initn: 78 init1: 78 opt: 112 Z-score: 119.0 expect() 8.6 Smith-Waterman score: 112; 26.866% identity in 134 aa overlap Entrez lookup Re-search database >I49761 44- 171: ---------------- : 10 20 30 40 50 60 70 QUERY IKEMKELAFARPSLVETLNKKKKFLKKKEKRTFVLSIYAFITFIIFCIGILYFT--NKSSAHNN----NNNKNEHSLKKE .:: : ::: : .::... : .:.:.. :.:: I49761 PGVLIQVYEGERAMTRDNNLLGRFDLTGIPPAPRGVPQIEVTFDIDANGILNVTAMDKSTGKANKITITNDKGR--LSKE 440 450 460 470 480 490 500 510 80 90 100 110 120 130 140 150 QUERY EIELLRVLLEKYKKQKDGILNESSNEEDEEKYTLNSETYNNKNNVSNIKNDSIKSKKEEYINLERILLEKYKKFINENNE ::: . :.:: . .: .. . .. :.:..: .. . ..... ..: :.: . : :: . ... . I49761 EIERMVQEAERYKAEDEGQREKIAAKNALESYAFNMKSAVGDEGLKDKISESDKKKILDKCNEVLSWLEANQLAEKDEFD 520 530 540 550 560 570 580 590 160 170 180 190 200 210 220 230 QUERY ENRKELSNILHKLLEINKLILREEKDDKKVYLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYK ..:::: :. . .. I49761 HKRKELENMCNPIITKLYQSGCTGPTCTPGYTPGRAATGPTIEEVD 600 610 620 630 640 --------------------------------------------------------------------------- >>A27077 heat shock cognate protein 70 - human (646 aa) initn: 85 init1: 85 opt: 112 Z-score: 118.9 expect() 8.6 Smith-Waterman score: 112; 26.866% identity in 134 aa overlap Entrez lookup Re-search database >A27077 44- 171: ---------------- : 10 20 30 40 50 60 70 QUERY IKEMKELAFARPSLVETLNKKKKFLKKKEKRTFVLSIYAFITFIIFCIGILYFT--NKSSAHNN----NNNKNEHSLKKE .:: : ::: . .::....: .:.:.. :.:: A27077 PGVLIQVYEGERAMTKDNNLLGKFELTGIPPAPRGVPQIEVTFDIDANGILNVSAVDKSTGKENKITITNDKGR--LSKE 440 450 460 470 480 490 500 510 80 90 100 110 120 130 140 150 QUERY EIELLRVLLEKYKKQKDGILNESSNEEDEEKYTLNSETYNNKNNVSNIKNDSIKSKKEEYINLERILLEKYKKFINENNE .:: . :::: . . .. :.... :.:..: .. . ..... :: :.: . : :.: . .:. : A27077 DIERMVQEAEKYKAEDEKQRDKVSSKNSLESYAFNMKATVEDEKLQGKINDEDKQKILDKCNEIINWLDKNQTAEKEEFE 520 530 540 550 560 570 580 590 160 170 180 190 200 210 220 230 QUERY ENRKELSNILHKLLEINKLILREEKDDKKVYLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYK ...::: .. . .. A27077 HQQKELEKVCNPIITKLYQSAGGMPGGMPGGFPGGGAPPSGGASSGPTIEEVD 600 610 620 630 640 --------------------------------------------------------------------------- >>JC4853 heat-shock protein 73 - mouse (646 aa) initn: 85 init1: 85 opt: 112 Z-score: 118.9 expect() 8.6 Smith-Waterman score: 112; 26.866% identity in 134 aa overlap Entrez lookup Re-search database >JC4853 44- 171: ---------------- : 10 20 30 40 50 60 70 QUERY IKEMKELAFARPSLVETLNKKKKFLKKKEKRTFVLSIYAFITFIIFCIGILYFT--NKSSAHNN----NNNKNEHSLKKE .:: : ::: . .::....: .:.:.. :.:: JC4853 PGVLIQVYEGERAMTKDNNLLGKFELTGIPPAPRGVPQIEVTFDIDANGILNVSAVDKSTGKENKITITNDKGR--LSKE 440 450 460 470 480 490 500 510 80 90 100 110 120 130 140 150 QUERY EIELLRVLLEKYKKQKDGILNESSNEEDEEKYTLNSETYNNKNNVSNIKNDSIKSKKEEYINLERILLEKYKKFINENNE .:: . :::: . . .. :.... :.:..: .. . ..... :: :.: . : :.: . .:. : JC4853 DIERMVQEAEKYKAEDEKQRDKVSSKNSLESYAFNMKATVEDEKLQGKINDEDKQKILDKCNEIISWLDKNQTAEKEEFE 520 530 540 550 560 570 580 590 160 170 180 190 200 210 220 230 QUERY ENRKELSNILHKLLEINKLILREEKDDKKVYLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYK ...::: .. . .. JC4853 HQQKELEKVCNPIITKLYQSAGGMPGGMPGGFPGGGAPPSGGASSGPTIEEVD 600 610 620 630 640 --------------------------------------------------------------------------- >>A35922 heat shock cognate protein 70 - Chinese hamster (646 aa) initn: 85 init1: 85 opt: 112 Z-score: 118.9 expect() 8.6 Smith-Waterman score: 112; 26.866% identity in 134 aa overlap Entrez lookup Re-search database >A35922 44- 171: ---------------- : 10 20 30 40 50 60 70 QUERY IKEMKELAFARPSLVETLNKKKKFLKKKEKRTFVLSIYAFITFIIFCIGILYFT--NKSSAHNN----NNNKNEHSLKKE .:: : ::: . .::....: .:.:.. :.:: A35922 PGVLIQVYEGERAMTKDNNLLGKFELTGIPPAPRGVPQIEVTFDIDANGILNVSAVDKSTGKENKITITNDKGR--LSKE 440 450 460 470 480 490 500 510 80 90 100 110 120 130 140 150 QUERY EIELLRVLLEKYKKQKDGILNESSNEEDEEKYTLNSETYNNKNNVSNIKNDSIKSKKEEYINLERILLEKYKKFINENNE .:: . :::: . . .. :.... :.:..: .. . ..... :: :.: . : :.: . .:. : A35922 DIERMVQEAEKYKAEDEKQRDKVSSKNSLESYAFNMKATVEDEKLQGKINDEDKQKILDKCNEIISWLDKNQTAEKEEFE 520 530 540 550 560 570 580 590 160 170 180 190 200 210 220 230 QUERY ENRKELSNILHKLLEINKLILREEKDDKKVYLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYK ...::: .. . .. A35922 HQQKELEKVCNPIITKLYQSAGGMPGGMPGGFPGGGAPPSGGASSGPTIEEVD 600 610 620 630 640 --------------------------------------------------------------------------- >>A45935 heat shock cognate protein 70 - mouse (646 aa) initn: 85 init1: 85 opt: 112 Z-score: 118.9 expect() 8.6 Smith-Waterman score: 112; 26.866% identity in 134 aa overlap Entrez lookup Re-search database >A45935 44- 171: ---------------- : 10 20 30 40 50 60 70 QUERY IKEMKELAFARPSLVETLNKKKKFLKKKEKRTFVLSIYAFITFIIFCIGILYFT--NKSSAHNN----NNNKNEHSLKKE .:: : ::: . .::....: .:.:.. :.:: A45935 PGVLIQVYEGERAMTKDNNLLGKFELTGIPPAPRGVPQIEVTFDIDANGILNVSAVDKSTGKENKITITNDKGR--LSKE 440 450 460 470 480 490 500 510 80 90 100 110 120 130 140 150 QUERY EIELLRVLLEKYKKQKDGILNESSNEEDEEKYTLNSETYNNKNNVSNIKNDSIKSKKEEYINLERILLEKYKKFINENNE .:: . :::: . . .. :.... :.:..: .. . ..... :: :.: . : :.: . .:. : A45935 DIERMVQEAEKYKAEDEKQRDKVSSKNSLESYAFNMKATVEDEKLQGKINDEDKQKILDKCNEIISWLDKNQTAEKEEFE 520 530 540 550 560 570 580 590 160 170 180 190 200 210 220 230 QUERY ENRKELSNILHKLLEINKLILREEKDDKKVYLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYK ...::: .. . .. A45935 HQQKELEKVCNPIITKLYQSAGGMPGGMPGGFPGGGAPPSGGASSGPTIEEVD 600 610 620 630 640 --------------------------------------------------------------------------- >>S07197 heat shock cognate protein hsc73 - rat (646 aa) initn: 85 init1: 85 opt: 112 Z-score: 118.9 expect() 8.6 Smith-Waterman score: 112; 26.866% identity in 134 aa overlap Entrez lookup Re-search database >S07197 44- 171: ---------------- : 10 20 30 40 50 60 70 QUERY IKEMKELAFARPSLVETLNKKKKFLKKKEKRTFVLSIYAFITFIIFCIGILYFT--NKSSAHNN----NNNKNEHSLKKE .:: : ::: . .::....: .:.:.. :.:: S07197 PGVLIQVYEGERAMTKDNNLLGKFELTGIPPAPRGVPQIEVTFDIDANGILNVSAVDKSTGKENKITITNDKGR--LSKE 440 450 460 470 480 490 500 510 80 90 100 110 120 130 140 150 QUERY EIELLRVLLEKYKKQKDGILNESSNEEDEEKYTLNSETYNNKNNVSNIKNDSIKSKKEEYINLERILLEKYKKFINENNE .:: . :::: . . .. :.... :.:..: .. . ..... :: :.: . : :.: . .:. : S07197 DIERMVQEAEKYKAEDEKQRDKVSSKNSLESYAFNMKATVEDEKLQGKINDEDKQKILDKCNEIISWLDKNQTAEKEEFE 520 530 540 550 560 570 580 590 160 170 180 190 200 210 220 230 QUERY ENRKELSNILHKLLEINKLILREEKDDKKVYLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYK ...::: .. . .. S07197 HQQKELEKVCNPIITKLYQSAGGMPGGMPGGFPGGGAPPSGGASSGPTIEEVD 600 610 620 630 640 --------------------------------------------------------------------------- >>S31716 hsp72-ps1 protein - rat (646 aa) initn: 85 init1: 85 opt: 112 Z-score: 118.9 expect() 8.6 Smith-Waterman score: 112; 26.866% identity in 134 aa overlap Entrez lookup Re-search database >S31716 44- 171: ---------------- : 10 20 30 40 50 60 70 QUERY IKEMKELAFARPSLVETLNKKKKFLKKKEKRTFVLSIYAFITFIIFCIGILYFT--NKSSAHNN----NNNKNEHSLKKE .:: : ::: . .::....: .:.:.. :.:: S31716 PGVLIQVYEGERAMTKDNNLLGKFELTGIPPAPRGVPQIEVTFDIDANGILNVSAVDKSTGKENKITITNDKGR--LSKE 440 450 460 470 480 490 500 510 80 90 100 110 120 130 140 150 QUERY EIELLRVLLEKYKKQKDGILNESSNEEDEEKYTLNSETYNNKNNVSNIKNDSIKSKKEEYINLERILLEKYKKFINENNE .:: . :::: . . .. :.... :.:..: .. . ..... :: :.: . : :.: . .:. : S31716 DIERMVQEAEKYKAEDEKQRDKVSSKNSLESYAFNMKATVEDEKLQGKINDEDKQKILDKCNEIISWLDKNQTAEKEEFE 520 530 540 550 560 570 580 590 160 170 180 190 200 210 220 230 QUERY ENRKELSNILHKLLEINKLILREEKDDKKVYLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYK ...::: .. . .. S31716 HQQKELEKVCNPIITKLYQSAGGMPGGMPGGFPGGGAPPSGGASSGPTIEEVD 600 610 620 630 640 --------------------------------------------------------------------------- >>D64467 hypothetical protein MJ1341 - Methanococcus jannaschii (312 aa) initn: 55 init1: 55 opt: 107 Z-score: 118.5 expect() 9.1 Smith-Waterman score: 107; 25.541% identity in 231 aa overlap Entrez lookup Re-search database >D64467 124- 349: ---------------------------- : 90 100 110 120 130 140 150 160 QUERY VLLEKYKKQKDGILNESSNEEDEEKYTLNSETYNNKNNVSNIKNDSIKSKKEEYINLERILLEKYKKFINENNEENRKEL :.::. :.. :: : ::: . ..::. : D64467 LPSGSSAVFLSMWIAKIYSNEISIPDMGGWQGFLKFPKLLNLKNNMIET------NLGIIDLEKLDESLKENSSLILTSL 40 50 60 70 80 90 100 110 170 180 190 200 210 220 230 240 QUERY SNIL--HKLLEINKLILREEKDDKKVYLINDNYDEKGALEIGMNEEMKYKKEDP-INNIKYASKFFKFMKEHNKVYKNID .. : . : ::.:: ....: .:.: . :. . :... . . : : : .:.. :. . :: .. : D64467 AGYLAPQPLKEIKKLC-----EEREVLFIEDISGKIGG-DCGYGDIVVCSTGTPKILNCEYGG-FLGISKEIEEKLGNAL 120 130 140 150 160 170 180 250 260 270 280 290 300 310 QUERY EQMRKF-EIFK-INYISIKNHNKLNKNAMYKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEF .... . . .: :::... ... :: . ::: :. . ..: . ::. . . . . ..: . : : ::. D64467 NDIKILSKTYKTINYFGLLKEELLNAKKTYKKYVEASKIIKDEIENAYFREFEGISVFI--ECDNPKNISKKINSLIKL- 190 200 210 220 230 240 250 260 320 330 340 350 360 370 380 390 QUERY YTNGKRNEKDIFSKVPEILDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKDNFGCDGG :.:.: . :. ..::: : : D64467 -----DNRKSITTICPNYDRILKNGIVFETKKIDISELNREVINEIIIALSSIL 270 280 290 300 310 --------------------------------------------------------------------------- >>A61061 actinidain (EC 3.4.22.14) - kiwi fruit (cv. Hayward) (fragments) (110 aa) initn: 163 init1: 96 opt: 100 Z-score: 118.1 expect() 9.5 Smith-Waterman score: 133; 25.926% identity in 135 aa overlap Entrez lookup Re-search database >A61061 377- 507: ----------------: 340 350 360 370 380 390 400 410 QUERY LDYREKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKDNFGCDGGHPFYSFLYVLQNE-LCLG ..:.::::..::.. :::::. .: ..... . A61061 SAGAVVDIKIVTGVLISLSEQELIDCGR---GCDGGYITDGFQFIINDGGINTE 10 20 30 40 50 420 430 440 450 460 470 480 490 QUERY DEYKYKAKDDMFCLNYRCKRKVSLSSIGAVKENQLILALNEVGPLSVNVGVNNDFVAYSEGVYNGTCSEELNHSVLLVGY ..: : :.: : :.:. : :: :...: :. ..:.: .::: A61061 ENYPYTAQDG------DCD--VALQ----------------------------DQKHYSSGIFTGPCGTAIDHAVTIVGY 60 70 80 90 500 510 520 530 540 550 560 QUERY GQ---VEKTKLNYNNKIQTYNTKENSNQPDDNIIYYWIIKNSWSKKWGENGFMRLSRNKNGDNVFCGIGEEVFYPIL : .. ..::: A61061 GTEGGIDYWIVKYNN 100 110 --------------------------------------------------------------------------- >>PC4035 cell-cycle-dependent 350K nuclear protein - human (fragment) (1017 aa) initn: 49 init1: 49 opt: 114 Z-score: 118.0 expect() 9.7 Smith-Waterman score: 114; 22.695% identity in 282 aa overlap Entrez lookup Re-search database >PC4035 76- 342: --------------------------------- : 40 50 60 70 80 90 100 QUERY FVLSIYAFITFIIFCIGILYFTNKSSAHNNNNNKNEHSLKKEEIELLRV----LLEKYKKQ--KDGILNESSNEEDEEKY ..:.:: .. ..:: .:. :. :.. .: .. PC4035 QNLELRNLTVELEQKIQVLQSKNASLQDTLEVLQSSYKNLENELELTKMDKMSFVEKVNKMTAKETELQREMHEMAQKTA 460 470 480 490 500 510 520 530 110 120 130 140 150 160 170 180 QUERY TLNSETYNNKNNVSN---IKNDSIKSKKEEYINLERILLE--KYKKFINENNEENRKELSNILHKLLEINKLILREEKDD :. : ..:: ... . . :::.:.. :... :: . :: .. .... .. ... ... : .: :.: . PC4035 ELQEELSGEKNRLAGELQLLLEEIKSSKDQ---LKELTLENSELKKSLDCMHKDQVEKEGKVREEIAEY-QLRLHEAEKK 540 550 560 570 580 590 600 610 190 200 210 220 230 240 250 260 QUERY KKVYLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKE--HN--KVYKNIDEQMRKFEIFKINYISIKNHN ... :.. : .. .:: .: .::. ... : ..: :: .: :. .: :...: .. ...:.. . . PC4035 HQALLLDTN--KQYEVEIQTYREKLTSKEECLSSQKLEIDLLKSSKEELNNSLKATTQILEELKKTKMDNLKYVN--QLK 620 630 640 650 660 670 680 270 280 290 300 310 320 330 340 QUERY KLNKNAMYKKKVNQFSDYSEEELKEYFKTLLHVPNHMIEKYSKPFENHLKDNILISEFYTNGKRNEKDIFSKVPEILDYR : :. :. : :. : . :: :: .. : . :: . : ..:. :. :. .. . :. : .: PC4035 KENERAQGKMKLLIKSCKQLEEEKEILQKELSQLQAAQEKQKTGTVMDTK----VDELTTEIKELKETLEEKTKEADEYL 690 700 710 720 730 740 750 760 350 360 370 380 390 400 410 420 QUERY EKGIVHEPKDQGLCGSCWAFASVGNIESVFAKKNKNILSFSEQEVVDCSKDNFGCDGGHPFYSFLYVLQNELCLGDEYKY .: PC4035 DKYCSLLISHEKLEKAKEMLETQVAHLCSQQSKQDSRGSPLLGPVVPGPSPIPSVTEKRLSSGQNKASGKRQRSSGIWEN 770 780 790 800 810 820 830 840 --------------------------------------------------------------------------- >>PWSP1 H+-transporting ATP synthase (EC 3.6.1.34) chain I - spinach chloropl (184 aa) initn: 65 init1: 65 opt: 103 Z-score: 117.9 expect() 9.9 Smith-Waterman score: 103; 26.733% identity in 101 aa overlap Entrez lookup Re-search database >PWSP1 46- 142: ----------- : 10 20 30 40 50 60 70 80 QUERY EMKELAFARPSLVETLNKKKKFLKKKEKRTFVLSIYAFITFIIFCIGILY-FTNKSSAHNNNNNKNEHSLKKEEIELL-- .:.: :.: . .. . . :. .: . :. . :: : PWSP1 MKNVTDSFVFLGHWPSAGSFGFNTDILATNLINLSVVLGVLIFFGKGVLSDLLDNRKQRILNTIRNSEELRGKAIEQLEK 10 20 30 40 50 60 70 80 90 100 110 120 130 140 150 160 QUERY -RVLLEKYKKQKDGILNESSNEEDEEKYTLNSETYNNKNNVSNIKNDSIKSKKEEYINLERILLEKYKKFINENNEENRK :. :.: . . : . .. .: ..::..: . ::.. .. : ::..:. .... :: : PWSP1 ARARLKKVEMDADQFRVNGYSEIEREKMNLINSTYKTLEQFENYKNETIQFEQQKAINQVRQRVFQQALQGALGTLNSCL 90 100 110 120 130 140 150 160 170 180 190 200 210 220 230 240 QUERY ELSNILHKLLEINKLILREEKDDKKVYLINDNYDEKGALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKNIDE PWSP1 NNELHLRTINANIGMFGAMNEITD 170 180 --------------------------------------------------------------------------- >>S60818 M protein precursor - Streptococcus pyogenes (serotype M47) (fragmen (116 aa) initn: 41 init1: 41 opt: 100 Z-score: 117.8 expect() 10 Smith-Waterman score: 100; 27.500% identity in 80 aa overlap Entrez lookup Re-search database >S60818 159- 237: ----------: 120 130 140 150 160 170 180 190 QUERY KNNVSNIKNDSIKSKKEEYINLERILLEKYKKFINENNEENRKELSNILHKLLEINKLILREEKDDK-KVYLINDNYDEK :..: ..:. :..:: : ... :::: :. :.: . . S60818 LRKLKKGTASVAVALTVLGAGLVVNTNEVGAATLTRNQRESLDFLNGLVDINDLEIHQLKDDKEKLQSQNENLQSQ 10 20 30 40 50 60 70 200 210 220 230 240 250 260 270 QUERY GALEIGMNEEMKYKKEDPINNIKYASKFFKFMKEHNKVYKNIDEQMRKFEIFKINYISIKNHNKLNKNAMYKKKVNQFSD . ..::... .:. :. : . . ...::. : S60818 NENLQSQNENLQSQKDKLTNEKKVLEEKVEETEQNNKALK 80 90 100 110 --------------------------------------------------------------------------- 569 residues in 1 query sequences 33852246 residues in 105998 library sequences Tcomplib (4 proc)[version 3.1t02 March, 1998] start: Sat Apr 25 19:42:20 1998 done: Sat Apr 25 19:43:04 1998 Scan time: 104.117 Display time: 15.667 Function used was FASTA