BLASTP 2.2.22 [Sep-27-2009] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Reference for compositional score matrix adjustment: Altschul, Stephen F., John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis, Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109. Reference for composition-based statistics starting in round 2: Schaffer, Alejandro A., L. Aravind, Thomas L. Madden, Sergei Shavirin, John L. Spouge, Yuri I. Wolf, Eugene V. Koonin, and Stephen F. Altschul (2001), "Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005. Query= gi|255764500|ref|YP_003064987.2| hypothetical protein CLIBASIA_02305 [Candidatus Liberibacter asiaticus str. psy62] (200 letters) Database: nr 14,124,377 sequences; 4,842,793,630 total letters Searching..................................................done Results from round 1 >gi|255764500|ref|YP_003064987.2| hypothetical protein CLIBASIA_02305 [Candidatus Liberibacter asiaticus str. psy62] gi|254547851|gb|ACT57047.2| hypothetical protein CLIBASIA_02305 [Candidatus Liberibacter asiaticus str. psy62] Length = 200 Score = 414 bits (1065), Expect = e-114, Method: Compositional matrix adjust. Identities = 200/200 (100%), Positives = 200/200 (100%) Query: 1 MPLSGYSVSIKKLGIVSVLIPVVLGVSNCDHSDSQHPPVIPILKDDKNDGEPRKPTPLDH 60 MPLSGYSVSIKKLGIVSVLIPVVLGVSNCDHSDSQHPPVIPILKDDKNDGEPRKPTPLDH Sbjct: 1 MPLSGYSVSIKKLGIVSVLIPVVLGVSNCDHSDSQHPPVIPILKDDKNDGEPRKPTPLDH 60 Query: 61 SDSQHPPVLPNSENNAHGDEPIKKSEKPTFRRNQPVIENLRKPTEAEKEKILAARDRYQK 120 SDSQHPPVLPNSENNAHGDEPIKKSEKPTFRRNQPVIENLRKPTEAEKEKILAARDRYQK Sbjct: 61 SDSQHPPVLPNSENNAHGDEPIKKSEKPTFRRNQPVIENLRKPTEAEKEKILAARDRYQK 120 Query: 121 TNNEEAIASIIWHRYQHMDYKGKTEQEKRALAKQSRDDFQRYATAQAANQKAADMLMLAT 180 TNNEEAIASIIWHRYQHMDYKGKTEQEKRALAKQSRDDFQRYATAQAANQKAADMLMLAT Sbjct: 121 TNNEEAIASIIWHRYQHMDYKGKTEQEKRALAKQSRDDFQRYATAQAANQKAADMLMLAT 180 Query: 181 YGLQYDDSLTKIQDPPKMEE 200 YGLQYDDSLTKIQDPPKMEE Sbjct: 181 YGLQYDDSLTKIQDPPKMEE 200 >gi|190406267|gb|EDV09534.1| protein SYG1 [Saccharomyces cerevisiae RM11-1a] gi|259147210|emb|CAY80463.1| Syg1p [Saccharomyces cerevisiae EC1118] Length = 902 Score = 37.0 bits (84), Expect = 1.5, Method: Composition-based stats. Identities = 19/71 (26%), Positives = 35/71 (49%), Gaps = 6/71 (8%) Query: 67 PVLPNSENNAHGDEPIKKSEKPTFRRNQPVIENLRK------PTEAEKEKILAARDRYQK 120 P+ P+ +NN H + + TFRR V EN+ + T+ ++ + DR + Sbjct: 832 PITPSHDNNPHSFAEPMPAYRGTFRRRSSVFENISRSIPWAHATDFQRPTVNTVDDRSPE 891 Query: 121 TNNEEAIASII 131 T++E + SI+ Sbjct: 892 TDSESEVESIM 902 >gi|151943112|gb|EDN61447.1| conserved protein [Saccharomyces cerevisiae YJM789] Length = 902 Score = 37.0 bits (84), Expect = 1.5, Method: Composition-based stats. Identities = 19/71 (26%), Positives = 35/71 (49%), Gaps = 6/71 (8%) Query: 67 PVLPNSENNAHGDEPIKKSEKPTFRRNQPVIENLRK------PTEAEKEKILAARDRYQK 120 P+ P+ +NN H + + TFRR V EN+ + T+ ++ + DR + Sbjct: 832 PITPSHDNNPHSFAEPMPAYRGTFRRRSSVFENISRSIPWAHATDFQRPTVNTVDDRSPE 891 Query: 121 TNNEEAIASII 131 T++E + SI+ Sbjct: 892 TDSESEVESIM 902 >gi|256269788|gb|EEU05054.1| Syg1p [Saccharomyces cerevisiae JAY291] Length = 902 Score = 37.0 bits (84), Expect = 1.5, Method: Composition-based stats. Identities = 19/71 (26%), Positives = 35/71 (49%), Gaps = 6/71 (8%) Query: 67 PVLPNSENNAHGDEPIKKSEKPTFRRNQPVIENLRK------PTEAEKEKILAARDRYQK 120 P+ P+ +NN H + + TFRR V EN+ + T+ ++ + DR + Sbjct: 832 PITPSHDNNPHSFAEPMPAYRGTFRRRSSVFENISRSIPWAHATDFQRPTVNTVDDRSPE 891 Query: 121 TNNEEAIASII 131 T++E + SI+ Sbjct: 892 TDSESEVESIM 902 >gi|6322142|ref|NP_012217.1| Syg1p [Saccharomyces cerevisiae S288c] gi|731805|sp|P40528|SYG1_YEAST RecName: Full=Protein SYG1 gi|600001|emb|CAA86904.1| Syg1p [Saccharomyces cerevisiae] gi|643449|gb|AAA91621.1| Syg1p [Saccharomyces cerevisiae] gi|285812601|tpg|DAA08500.1| TPA: Syg1p [Saccharomyces cerevisiae S288c] Length = 902 Score = 37.0 bits (84), Expect = 1.5, Method: Composition-based stats. Identities = 19/71 (26%), Positives = 35/71 (49%), Gaps = 6/71 (8%) Query: 67 PVLPNSENNAHGDEPIKKSEKPTFRRNQPVIENLRK------PTEAEKEKILAARDRYQK 120 P+ P+ +NN H + + TFRR V EN+ + T+ ++ + DR + Sbjct: 832 PITPSHDNNPHSFAEPMPAYRGTFRRRSSVFENISRSIPWAHATDFQRPTVNTVDDRSPE 891 Query: 121 TNNEEAIASII 131 T++E + SI+ Sbjct: 892 TDSESEVESIM 902 >gi|323348138|gb|EGA82392.1| Syg1p [Saccharomyces cerevisiae Lalvin QA23] Length = 902 Score = 37.0 bits (84), Expect = 1.5, Method: Composition-based stats. Identities = 19/71 (26%), Positives = 35/71 (49%), Gaps = 6/71 (8%) Query: 67 PVLPNSENNAHGDEPIKKSEKPTFRRNQPVIENLRK------PTEAEKEKILAARDRYQK 120 P+ P+ +NN H + + TFRR V EN+ + T+ ++ + DR + Sbjct: 832 PITPSHDNNPHSFAEPMPAYRGTFRRRSSVFENISRSIPWAHATDFQRPTVNTVDDRSPE 891 Query: 121 TNNEEAIASII 131 T++E + SI+ Sbjct: 892 TDSESEVESIM 902 >gi|21357621|ref|NP_649644.1| CG17919 [Drosophila melanogaster] gi|7298931|gb|AAF54136.1| CG17919 [Drosophila melanogaster] gi|18447224|gb|AAL68202.1| GH14494p [Drosophila melanogaster] gi|220944538|gb|ACL84812.1| CG17919-PA [synthetic construct] gi|220954412|gb|ACL89749.1| CG17919-PA [synthetic construct] Length = 202 Score = 35.0 bits (79), Expect = 5.7, Method: Compositional matrix adjust. Identities = 21/63 (33%), Positives = 36/63 (57%), Gaps = 3/63 (4%) Query: 133 HRYQHMDYK--GKTEQEKRALAKQSRDDFQRYATAQ-AANQKAADMLMLATYGLQYDDSL 189 HRY + YK GK E ++ ++K+SR D +++ A+ A N + + + Y QYDD + Sbjct: 133 HRYVFLLYKQSGKLEFDEERVSKRSRKDRPKFSAAKFAINHELGNPIAGTFYQAQYDDYV 192 Query: 190 TKI 192 K+ Sbjct: 193 PKL 195 >gi|310774950|gb|ADP21893.1| Hypothetical protein Y39H10B.2b [Caenorhabditis elegans] Length = 161 Score = 35.0 bits (79), Expect = 6.4, Method: Compositional matrix adjust. Identities = 26/89 (29%), Positives = 43/89 (48%), Gaps = 15/89 (16%) Query: 39 VIPILKDDKNDGEPRKPTPLDHSD-----SQHPPVLPNSENNAHGDEPIKKSEKPTFRRN 93 ++ LKDD ++ R T LD+S+ Q ++ NSE KS++ T + Sbjct: 25 IVQELKDDADERRNRAQTSLDNSEYETLVQQRQKLIENSE----------KSQQETKEQL 74 Query: 94 QPVIENLRKPTEAEKEKILAARDRYQKTN 122 + ++E RK + KEK A + Q+TN Sbjct: 75 EKIVELQRKSCDQNKEKFTALELQLQETN 103 >gi|269123576|ref|YP_003306153.1| hypothetical protein Smon_0808 [Streptobacillus moniliformis DSM 12112] gi|268314902|gb|ACZ01276.1| hypothetical protein Smon_0808 [Streptobacillus moniliformis DSM 12112] Length = 685 Score = 34.7 bits (78), Expect = 7.7, Method: Composition-based stats. Identities = 28/79 (35%), Positives = 40/79 (50%), Gaps = 4/79 (5%) Query: 76 AHGDEPIKKSEKPTFRRNQPVIENLRKPTEAEKEKILAARDRYQKTNNEE-AIASIIWHR 134 A G++ K S K F + NL+K E E IL A +Y+ NN E AI +I R Sbjct: 88 AGGEKQFKSSGKREFNTKGYIGINLKKKLTEELELILNASYQYKDNNNHEGAIKELIKSR 147 Query: 135 YQHMDYKGKTEQEKRALAK 153 ++K + E++K AL K Sbjct: 148 ---GEWKKEFEEDKEALKK 163 >gi|164426529|ref|XP_961292.2| hypothetical protein NCU04136 [Neurospora crassa OR74A] gi|157071372|gb|EAA32056.2| hypothetical protein NCU04136 [Neurospora crassa OR74A] Length = 751 Score = 34.3 bits (77), Expect = 9.5, Method: Composition-based stats. Identities = 31/107 (28%), Positives = 48/107 (44%), Gaps = 17/107 (15%) Query: 22 VVLGVSNCDHSDSQHP------PVIPILKDDKNDGEPRKPTPLDHSDSQHPPVLPNSENN 75 V GV+N D DS P P+ P L++D L+ D +HPPV S Sbjct: 221 VFTGVTNRDVLDSSAPLKLGPLPMEPELREDVR-------AELEDEDQRHPPVEGRSTLV 273 Query: 76 AHGDEPIKKSEK---PTFRRNQPVIENLRKPTEAEKEKILAARDRYQ 119 D+ IK+ E P+ R + P+ + + E +K+ RDR++ Sbjct: 274 EEFDQRIKREESADAPS-RTDLPLPPSRARDVVMEMQKVRENRDRFK 319 Searching..................................................done Results from round 2 CONVERGED! >gi|255764500|ref|YP_003064987.2| hypothetical protein CLIBASIA_02305 [Candidatus Liberibacter asiaticus str. psy62] gi|254547851|gb|ACT57047.2| hypothetical protein CLIBASIA_02305 [Candidatus Liberibacter asiaticus str. psy62] Length = 200 Score = 400 bits (1028), Expect = e-110, Method: Composition-based stats. Identities = 200/200 (100%), Positives = 200/200 (100%) Query: 1 MPLSGYSVSIKKLGIVSVLIPVVLGVSNCDHSDSQHPPVIPILKDDKNDGEPRKPTPLDH 60 MPLSGYSVSIKKLGIVSVLIPVVLGVSNCDHSDSQHPPVIPILKDDKNDGEPRKPTPLDH Sbjct: 1 MPLSGYSVSIKKLGIVSVLIPVVLGVSNCDHSDSQHPPVIPILKDDKNDGEPRKPTPLDH 60 Query: 61 SDSQHPPVLPNSENNAHGDEPIKKSEKPTFRRNQPVIENLRKPTEAEKEKILAARDRYQK 120 SDSQHPPVLPNSENNAHGDEPIKKSEKPTFRRNQPVIENLRKPTEAEKEKILAARDRYQK Sbjct: 61 SDSQHPPVLPNSENNAHGDEPIKKSEKPTFRRNQPVIENLRKPTEAEKEKILAARDRYQK 120 Query: 121 TNNEEAIASIIWHRYQHMDYKGKTEQEKRALAKQSRDDFQRYATAQAANQKAADMLMLAT 180 TNNEEAIASIIWHRYQHMDYKGKTEQEKRALAKQSRDDFQRYATAQAANQKAADMLMLAT Sbjct: 121 TNNEEAIASIIWHRYQHMDYKGKTEQEKRALAKQSRDDFQRYATAQAANQKAADMLMLAT 180 Query: 181 YGLQYDDSLTKIQDPPKMEE 200 YGLQYDDSLTKIQDPPKMEE Sbjct: 181 YGLQYDDSLTKIQDPPKMEE 200 >gi|238501784|ref|XP_002382126.1| conserved hypothetical protein [Aspergillus flavus NRRL3357] gi|220692363|gb|EED48710.1| conserved hypothetical protein [Aspergillus flavus NRRL3357] Length = 709 Score = 44.0 bits (102), Expect = 0.011, Method: Composition-based stats. Identities = 33/118 (27%), Positives = 50/118 (42%), Gaps = 13/118 (11%) Query: 38 PVIPILKDDKNDGEPRKPTPLDHSDSQHPPVLPNSENNAHG-DEPIKKSEKPTFRRNQPV 96 PVIP +P P P + SQH P P A+G +P K SE P R + Sbjct: 490 PVIP---------KPATPQPSGPAASQHAPTSPEKAAEANGTSQPEKPSESPMPRPEKKQ 540 Query: 97 IENLRKPTEAEKEKILAARDRYQKTNNEEAIASIIWHRYQHMDYKGKTEQEKRALAKQ 154 L + + E+ A RD + + + + + W +Y H+ E+ K AL +Q Sbjct: 541 SNGLFV-SNVDNEQ--AVRDLFPEEDKAKIVKLDKWGKYNHVVMFNSVEEAKAALDRQ 595 >gi|169769314|ref|XP_001819127.1| Ser/Thr protein phosphatase family protein [Aspergillus oryzae RIB40] gi|83766985|dbj|BAE57125.1| unnamed protein product [Aspergillus oryzae] Length = 705 Score = 44.0 bits (102), Expect = 0.013, Method: Composition-based stats. Identities = 33/118 (27%), Positives = 50/118 (42%), Gaps = 13/118 (11%) Query: 38 PVIPILKDDKNDGEPRKPTPLDHSDSQHPPVLPNSENNAHG-DEPIKKSEKPTFRRNQPV 96 PVIP +P P P + SQH P P A+G +P K SE P R + Sbjct: 486 PVIP---------KPATPQPSGPAASQHAPTSPEKAAEANGTSQPEKPSESPMPRPEKKQ 536 Query: 97 IENLRKPTEAEKEKILAARDRYQKTNNEEAIASIIWHRYQHMDYKGKTEQEKRALAKQ 154 L + + E+ A RD + + + + + W +Y H+ E+ K AL +Q Sbjct: 537 SNGLFV-SNVDNEQ--AVRDLFPEEDKAKIVKLDKWGKYNHVVMFNSVEEAKAALDRQ 591 >gi|195573873|ref|XP_002104916.1| GD18202 [Drosophila simulans] gi|194200843|gb|EDX14419.1| GD18202 [Drosophila simulans] Length = 933 Score = 41.3 bits (95), Expect = 0.077, Method: Composition-based stats. Identities = 40/154 (25%), Positives = 61/154 (39%), Gaps = 23/154 (14%) Query: 29 CDHSDSQHPPVIPILKDD-KNDGEPRKPTPLDHSDSQHPPVLPNSENNAHGDEPIKKSEK 87 CD D + V+ ++K K G PT PP LP N P SE Sbjct: 216 CDLCDKTYKYVMALVKHKHKEHGSSLLPTKSGRKMPGRPPTLPEDANPY---SPASTSET 272 Query: 88 PTFR---RNQPVIENLRKPTEAEKEKILAARDRY----QKTNNEEAIASIIWHRY-QHMD 139 T R N ++ ++ K E E + D Y Q + + + I S+I H++ +H+D Sbjct: 273 STHRSRTNNDALVHSIIKAVELSDEDGHSGEDSYYNCDQCSKSYKYIVSLIKHKHKEHLD 332 Query: 140 YKGKTEQEKRALAKQSRDDFQRYATAQAANQKAA 173 + E DD +T+ +AN K A Sbjct: 333 KQSDDE-----------DDRPLPSTSASANAKPA 355 >gi|219130507|ref|XP_002185405.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1] gi|217403119|gb|EEC43074.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1] Length = 389 Score = 40.9 bits (94), Expect = 0.089, Method: Composition-based stats. Identities = 24/91 (26%), Positives = 41/91 (45%), Gaps = 4/91 (4%) Query: 85 SEKPTFRRNQPVIENLRKPTEAEKEKILAARDRYQKTNNEEAIASIIWHRYQHMDYKGKT 144 S++ T + P + R+ E EKE+ + K + A I ++ Q +D+ G+ Sbjct: 165 SQRNTAQAQSPKVPTRRRSVEQEKER----QQSPHKNEAHQITADFILNKMQVLDFAGRR 220 Query: 145 EQEKRALAKQSRDDFQRYATAQAANQKAADM 175 E ++ D+ QRY T QA Q +M Sbjct: 221 ESPPCKPTRRCSDEHQRYMTQQAIAQVLDEM 251 >gi|238576584|ref|XP_002388087.1| hypothetical protein MPER_12939 [Moniliophthora perniciosa FA553] gi|215449082|gb|EEB89017.1| hypothetical protein MPER_12939 [Moniliophthora perniciosa FA553] Length = 252 Score = 40.5 bits (93), Expect = 0.12, Method: Composition-based stats. Identities = 22/70 (31%), Positives = 38/70 (54%), Gaps = 7/70 (10%) Query: 21 PVVLGV-SNCDHSDSQHPPVIPILKDDKNDGEPRKPTPLDHSDSQHPPVLPNSENNA-HG 78 PV L S+ D+ ++ P+ + DD++D PR+P D ++PP P+S N++ + Sbjct: 65 PVYLDADSSHDNGHAEREPLFGVPSDDEDDDTPRRP-----HDIENPPPAPHSRNSSDNS 119 Query: 79 DEPIKKSEKP 88 P+ KS P Sbjct: 120 PRPVHKSSSP 129 >gi|24646066|ref|NP_524319.2| mutagen-sensitive 309 [Drosophila melanogaster] gi|17366114|sp|Q9VGI8|BLM_DROME RecName: Full=Bloom syndrome protein homolog; Short=Dmblm; AltName: Full=Mutagen-sensitive protein 309; AltName: Full=RecQ helicase homolog gi|7299503|gb|AAF54691.1| mutagen-sensitive 309 [Drosophila melanogaster] Length = 1487 Score = 40.5 bits (93), Expect = 0.14, Method: Composition-based stats. Identities = 30/110 (27%), Positives = 46/110 (41%), Gaps = 10/110 (9%) Query: 52 PRKPTPLDHSDSQHPPVLPNSENNAHGDEPIKKSEKPTFR-------RNQPVIENLRKPT 104 P+KPT LD S S +L + +P++ T+R N+ + + LRKP Sbjct: 109 PKKPTSLDLSVSPLAELLAKKSDRDSPKKPVQNENSYTYRGLSESPVENKSIGDTLRKPP 168 Query: 105 EAE-KEKILAARDRYQK--TNNEEAIASIIWHRYQHMDYKGKTEQEKRAL 151 + E K I+ D +K T NE I R+ D+ K + L Sbjct: 169 QKERKTSIVWLSDSPEKKVTQNERKILDSPLQRFSFEDFPNKENGNRHHL 218 >gi|159117406|ref|XP_001708923.1| Hypothetical protein GL50803_31978 [Giardia lamblia ATCC 50803] gi|157437037|gb|EDO81249.1| hypothetical protein GL50803_31978 [Giardia lamblia ATCC 50803] Length = 1824 Score = 40.1 bits (92), Expect = 0.16, Method: Composition-based stats. Identities = 26/109 (23%), Positives = 51/109 (46%), Gaps = 7/109 (6%) Query: 1 MPLSGYSVSIKKLGIVSVLIPVVLGVSNCDHSDSQHPPVIPILKDDKNDGEPRKPTPLDH 60 +P ++ L + ++ +P S+ ++ + P+ D N P+PL Sbjct: 1639 LPTQALTIPQVPLNVSAITVPKFTASSHTCNTPAVSEVTTPVNPTDLNATMIPLPSPL-- 1696 Query: 61 SDSQHPPVLPNSENNAHGDEPIKKSEKPTFRRNQPVIENLRKPTEAEKE 109 + ++P+S N EPI E+P +R QP+I N KP+++++E Sbjct: 1697 ---RKNVLVPDSSNEKGCQEPI--PEQPKDKRPQPLIFNFFKPSKSKQE 1740 >gi|256424973|ref|YP_003125626.1| pseudouridine synthase [Chitinophaga pinensis DSM 2588] gi|256039881|gb|ACU63425.1| pseudouridine synthase [Chitinophaga pinensis DSM 2588] Length = 389 Score = 40.1 bits (92), Expect = 0.18, Method: Composition-based stats. Identities = 17/58 (29%), Positives = 27/58 (46%), Gaps = 2/58 (3%) Query: 31 HSDSQHPPVIPILKDDKNDGEPRKPTPLDHSDSQHPPVLPNSENNAHGDEPIKKSEKP 88 H + Q P +P K+ + G R P H + Q P P ++++ H +P K KP Sbjct: 286 HREEQEPAFMP--KNKPSAGHNRAAAPAKHREQQEPAFTPKNKSSTHKHKPADKDSKP 341 >gi|326673852|ref|XP_699567.5| PREDICTED: hypothetical protein LOC570939 [Danio rerio] Length = 1167 Score = 39.7 bits (91), Expect = 0.21, Method: Composition-based stats. Identities = 37/140 (26%), Positives = 60/140 (42%), Gaps = 14/140 (10%) Query: 23 VLGVSNCDHSDSQHPPVIPILKDDKND--GEPRKPTPLDHSDSQHPPVLPNSENNAHGDE 80 +L ++C HSD P +P ++ G R T H +S PV P + N G++ Sbjct: 994 LLDTNSCLHSD----PSLPANPQNERPSMGSTRSTTIQPHEESSPHPVKPLTNNTLQGEK 1049 Query: 81 PIKKSEKPTFRRNQPVIENLRKP--TEAEKEK----ILAARDRYQKTNNEEAIASIIWHR 134 K + + + I R T AE+EK ++ AR YQ N E IA ++ Sbjct: 1050 --KAESRSVLEKLKSTINPGRSALATTAEEEKQQLSLMEARAHYQNMTNMELIALLLQQE 1107 Query: 135 YQHMDYKGKTEQEKRALAKQ 154 + + +TE + L K+ Sbjct: 1108 LEIKKQRAETEVQVVMLEKR 1127 >gi|126303449|ref|XP_001373298.1| PREDICTED: hypothetical protein [Monodelphis domestica] Length = 462 Score = 39.7 bits (91), Expect = 0.22, Method: Composition-based stats. Identities = 40/149 (26%), Positives = 63/149 (42%), Gaps = 23/149 (15%) Query: 31 HSDSQH------PPVIPILKDDKNDG-EPRKPTPLDHSDSQHPPVLPNSENNAHGDEPIK 83 HSDS+ P + +DK G E RK + + Q P + D + Sbjct: 133 HSDSKALKNWEVPNASKRIDNDKESGPETRKASTPEEEKIQSPSSSSRNIQQMLADSINR 192 Query: 84 ------KSEKPTFRRNQPVIENLRKPTEAEKEKILAARDRYQKTNNEEAIASIIWHRYQH 137 + +K T R+ Q + LR+ E + +I QK+N EE +Q Sbjct: 193 MKTYGFQQKKETIRKKQD--DELRQVEEEKTRQICKNWKETQKSNKEE-------REWQE 243 Query: 138 MDYKGKTEQEKR-ALAKQSRDDFQRYATA 165 + K EKR ++AKQ+RDD++R + A Sbjct: 244 SLRRSKAADEKRRSIAKQARDDYKRLSLA 272 >gi|189533777|ref|XP_001919339.1| PREDICTED: hypothetical protein LOC553309 [Danio rerio] Length = 826 Score = 39.3 bits (90), Expect = 0.27, Method: Composition-based stats. Identities = 31/113 (27%), Positives = 51/113 (45%), Gaps = 32/113 (28%) Query: 33 DSQHPPVIPI-----------LKDDKNDGEPRKP---TPLDHS------DSQHPPVLPNS 72 DS HPP +P+ L+D++ EP P +D S ++ P V N Sbjct: 603 DSDHPPEVPVLLTNLVECVSSLQDNEASDEPVAPEQEASMDESTESPIEETNEPAV--NK 660 Query: 73 ENNAHGDEPIKKSEKPTFRRNQPVIENLRKPTEAEKEKILAARDRYQKTNNEE 125 E+ DEP++ E +QP E+ +E E+EK+ D++++ EE Sbjct: 661 EDETISDEPLETQE------SQPGTED----SENEEEKLPEGEDKHEQKEGEE 703 >gi|225389915|ref|ZP_03759639.1| hypothetical protein CLOSTASPAR_03665 [Clostridium asparagiforme DSM 15981] gi|225044026|gb|EEG54272.1| hypothetical protein CLOSTASPAR_03665 [Clostridium asparagiforme DSM 15981] Length = 899 Score = 39.3 bits (90), Expect = 0.31, Method: Composition-based stats. Identities = 42/175 (24%), Positives = 73/175 (41%), Gaps = 22/175 (12%) Query: 30 DHSDSQHPPVIPILKDDKNDGEPRKPTP-LDHSDSQHPPVLPNSENNAHGDEPIKKS--- 85 D S+ +PPV +D+ + EP + P +D S+ +PPV E+N DEP + + Sbjct: 344 DESEQDNPPVDEPEQDNPSVDEPEQGNPPVDESEESNPPVDEPEESNPPVDEPEENNPPV 403 Query: 86 EKPTFRRNQPVIENLR-KPTEAEKEKILAARDRYQKTNNEEAIASIIWHRYQHMDYKGKT 144 ++P N PV E + P E E+ D ++ N E + DYK Sbjct: 404 DEPE-ENNPPVDEPEQGNPPVDEPEEGNPPVDEPEQDNEPEEKSG---------DYKKSD 453 Query: 145 EQEKRA-----LAKQSRDDFQRYATAQAANQKAADMLMLATYGLQYDDSLTKIQD 194 + A + D F + ++ ++AD ++ G + ++ L K D Sbjct: 454 SPKTNATMFSLFNVEIEDSFTTFDIGESGGSQSADEVV--NEGGKVNEVLNKAYD 506 >gi|320544014|ref|NP_001188950.1| Stretchin-Mlck, isoform K [Drosophila melanogaster] gi|318068620|gb|ADV37196.1| Stretchin-Mlck, isoform K [Drosophila melanogaster] Length = 9270 Score = 39.3 bits (90), Expect = 0.31, Method: Composition-based stats. Identities = 32/124 (25%), Positives = 53/124 (42%), Gaps = 9/124 (7%) Query: 39 VIPILKDDKNDGEPRKPTPLDHSDSQHPPVLPNSENNAHGDEPIKKSEKPTFRRNQPVIE 98 V+P ++ E + P P+D P LP E +HG K SE + + Sbjct: 5150 VVPEKISEEKVAEIKTPEPMDSKAKSKPDGLPADE-KSHG---AKVSESVPVKNEAEKTD 5205 Query: 99 NL--RKPTEAEKEKILAARDRYQKTNNEEAIASIIWHRYQHMDYKGKTEQEKRALAKQSR 156 L +KPT +++ ++ R Y E+ SI Y+ MD + K +E R+ ++ Sbjct: 5206 QLSAKKPTVLDEDLVVPKRKPYLA---EQTADSISLQTYKSMDSEYKDRKESRSAKRKPT 5262 Query: 157 DDFQ 160 D Q Sbjct: 5263 VDIQ 5266 >gi|320544016|ref|NP_001188951.1| Stretchin-Mlck, isoform L [Drosophila melanogaster] gi|318068621|gb|ADV37197.1| Stretchin-Mlck, isoform L [Drosophila melanogaster] Length = 9839 Score = 39.3 bits (90), Expect = 0.32, Method: Composition-based stats. Identities = 32/124 (25%), Positives = 53/124 (42%), Gaps = 9/124 (7%) Query: 39 VIPILKDDKNDGEPRKPTPLDHSDSQHPPVLPNSENNAHGDEPIKKSEKPTFRRNQPVIE 98 V+P ++ E + P P+D P LP E +HG K SE + + Sbjct: 5150 VVPEKISEEKVAEIKTPEPMDSKAKSKPDGLPADE-KSHG---AKVSESVPVKNEAEKTD 5205 Query: 99 NL--RKPTEAEKEKILAARDRYQKTNNEEAIASIIWHRYQHMDYKGKTEQEKRALAKQSR 156 L +KPT +++ ++ R Y E+ SI Y+ MD + K +E R+ ++ Sbjct: 5206 QLSAKKPTVLDEDLVVPKRKPYLA---EQTADSISLQTYKSMDSEYKDRKESRSAKRKPT 5262 Query: 157 DDFQ 160 D Q Sbjct: 5263 VDIQ 5266 >gi|320544026|ref|NP_001188956.1| Stretchin-Mlck, isoform Q [Drosophila melanogaster] gi|318068626|gb|ADV37202.1| Stretchin-Mlck, isoform Q [Drosophila melanogaster] Length = 9528 Score = 39.3 bits (90), Expect = 0.32, Method: Composition-based stats. Identities = 32/124 (25%), Positives = 53/124 (42%), Gaps = 9/124 (7%) Query: 39 VIPILKDDKNDGEPRKPTPLDHSDSQHPPVLPNSENNAHGDEPIKKSEKPTFRRNQPVIE 98 V+P ++ E + P P+D P LP E +HG K SE + + Sbjct: 5150 VVPEKISEEKVAEIKTPEPMDSKAKSKPDGLPADE-KSHG---AKVSESVPVKNEAEKTD 5205 Query: 99 NL--RKPTEAEKEKILAARDRYQKTNNEEAIASIIWHRYQHMDYKGKTEQEKRALAKQSR 156 L +KPT +++ ++ R Y E+ SI Y+ MD + K +E R+ ++ Sbjct: 5206 QLSAKKPTVLDEDLVVPKRKPYLA---EQTADSISLQTYKSMDSEYKDRKESRSAKRKPT 5262 Query: 157 DDFQ 160 D Q Sbjct: 5263 VDIQ 5266 >gi|170104810|ref|XP_001883618.1| predicted protein [Laccaria bicolor S238N-H82] gi|164641253|gb|EDR05514.1| predicted protein [Laccaria bicolor S238N-H82] Length = 1137 Score = 38.9 bits (89), Expect = 0.34, Method: Composition-based stats. Identities = 22/76 (28%), Positives = 32/76 (42%), Gaps = 5/76 (6%) Query: 17 SVLIPVVLGVSNCDHSDSQHPPVIPILKDDKNDGEPRKPTPLDHSDSQHPPVLPNSENNA 76 S ++P L S C S P + D++ G P PTPL S P +P + Sbjct: 872 SPMLPTPLPTSKCPELPSMPAPTM-----DQDGGSPMLPTPLPTSKRPELPSMPAPTMDQ 926 Query: 77 HGDEPIKKSEKPTFRR 92 G P+ + PT +R Sbjct: 927 DGGSPMLPTPLPTSKR 942 >gi|320544022|ref|NP_001188954.1| Stretchin-Mlck, isoform O [Drosophila melanogaster] gi|318068624|gb|ADV37200.1| Stretchin-Mlck, isoform O [Drosophila melanogaster] Length = 9354 Score = 38.9 bits (89), Expect = 0.36, Method: Composition-based stats. Identities = 32/124 (25%), Positives = 53/124 (42%), Gaps = 9/124 (7%) Query: 39 VIPILKDDKNDGEPRKPTPLDHSDSQHPPVLPNSENNAHGDEPIKKSEKPTFRRNQPVIE 98 V+P ++ E + P P+D P LP E +HG K SE + + Sbjct: 5150 VVPEKISEEKVAEIKTPEPMDSKAKSKPDGLPADE-KSHG---AKVSESVPVKNEAEKTD 5205 Query: 99 NL--RKPTEAEKEKILAARDRYQKTNNEEAIASIIWHRYQHMDYKGKTEQEKRALAKQSR 156 L +KPT +++ ++ R Y E+ SI Y+ MD + K +E R+ ++ Sbjct: 5206 QLSAKKPTVLDEDLVVPKRKPYLA---EQTADSISLQTYKSMDSEYKDRKESRSAKRKPT 5262 Query: 157 DDFQ 160 D Q Sbjct: 5263 VDIQ 5266 >gi|300681006|sp|Q0CT11|CREB_ASPTN RecName: Full=Probable ubiquitin carboxyl-terminal hydrolase creB; AltName: Full=Carbon catabolite repression protein B; AltName: Full=Deubiquitinating enzyme creB; AltName: Full=Ubiquitin thiolesterase creB; AltName: Full=Ubiquitin-hydrolyzing enzyme creB; AltName: Full=Ubiquitin-specific-processing protease creB Length = 768 Score = 38.9 bits (89), Expect = 0.39, Method: Composition-based stats. Identities = 28/123 (22%), Positives = 52/123 (42%), Gaps = 15/123 (12%) Query: 52 PRKPTPLDHSDSQHPPVLPNSENNAHGDEPIKKSEKPTFRRNQPVIENLRKPTEAEKEKI 111 P PT +H+D+ P +P A PI ++ + +++ ++ + EKE+ Sbjct: 534 PSLPTHYEHTDTNSAPSIPKPAM-APPVPPIPETHSMPLSPKKSDLQSRKERAKEEKERK 592 Query: 112 LAARDRYQKTNNEEAIASIIWHRYQHMDYKGKTEQEKRALAKQSRDDFQRYATAQAANQK 171 A ++R ++ E+ + EQE RA Q R+D + A +A+ Sbjct: 593 AAEKERERQRRKEQEAT--------------RKEQEARARENQRREDAELKAALEASRAS 638 Query: 172 AAD 174 AD Sbjct: 639 KAD 641 >gi|311978029|ref|YP_003987149.1| hypothetical protein MIMI_gp0678 [Acanthamoeba polyphaga mimivirus] gi|82000406|sp|Q5UR77|YL631_MIMIV RecName: Full=Uncharacterized protein L631 gi|55417242|gb|AAV50892.1| unknown [Acanthamoeba polyphaga mimivirus] gi|308204510|gb|ADO18311.1| hypothetical protein [Acanthamoeba polyphaga mimivirus] Length = 219 Score = 38.9 bits (89), Expect = 0.43, Method: Composition-based stats. Identities = 28/86 (32%), Positives = 42/86 (48%), Gaps = 16/86 (18%) Query: 55 PTPLDHS-----DSQHPPVLPNSENNAHGDEPIKKSEKPTFRRNQPVIENLRKPTEAEKE 109 P P+ S DS HP V+ +N+ P K+E T QP IE+ ++PT ++E Sbjct: 51 PIPVTTSKSSQLDSAHPTVVHIGDNH-----PEPKNESKT----QPKIESKKEPTLKQEE 101 Query: 110 KILAARDRYQKTNNEEAIASIIWHRY 135 + + A + QK EE S + RY Sbjct: 102 QTIQAEEEAQKIAKEETRESFL--RY 125 >gi|171695444|ref|XP_001912646.1| hypothetical protein [Podospora anserina S mat+] gi|170947964|emb|CAP60128.1| unnamed protein product [Podospora anserina S mat+] Length = 578 Score = 38.6 bits (88), Expect = 0.45, Method: Composition-based stats. Identities = 38/146 (26%), Positives = 47/146 (32%), Gaps = 28/146 (19%) Query: 51 EPRKPTPL-----DHSDSQHPPVLPNSENNAHGDEPIKKSEKPTFRRNQPVIENLRKPTE 105 EPR PTPL DHS Q P +P S H S + RRNQ Sbjct: 37 EPRSPTPLGGDLYDHSPVQDRPPIPRSHTPGH------LSSRSQDRRNQ----------- 79 Query: 106 AEKEKILAARDRYQKT---NNEEAIASIIWHRYQHMDYKGKTEQEKRALAKQSRDDFQRY 162 LA R R Q + A A+ I H Y + K + Q Sbjct: 80 ---SGDLALRQRSQSAAGHTSSRAAAAGIPHSYSTTSFSSNPSASKAPTPNSAPPSRQPT 136 Query: 163 ATAQAANQKAADMLMLATYGLQYDDS 188 T N + + + T G DS Sbjct: 137 ITENGDNNSTSSLTVNLTLGRDRGDS 162 >gi|195128987|ref|XP_002008940.1| GI11530 [Drosophila mojavensis] gi|193920549|gb|EDW19416.1| GI11530 [Drosophila mojavensis] Length = 897 Score = 38.6 bits (88), Expect = 0.45, Method: Composition-based stats. Identities = 26/92 (28%), Positives = 37/92 (40%), Gaps = 6/92 (6%) Query: 21 PVVLGVSNCDHSDSQHPPVIPILKDDKNDGEP---RKPTPLDHSDSQHPPVLPNSENNAH 77 P G S + P L D +DGEP R P+ + H + ++N Sbjct: 3 PTQSGGSGTSAAKMGEPIEYVTLISDDSDGEPSPKRNPSGSGSQAASHTKAKFDDDSN-- 60 Query: 78 GDEPIKKSEKPTFRRNQPVIENLRKPTEAEKE 109 D P E+ T RRN+P ++ KP A E Sbjct: 61 -DTPATSDERRTSRRNKPKVDYTNKPATASAE 91 >gi|115389692|ref|XP_001212351.1| ubiquitin carboxyl-terminal hydrolase 12 [Aspergillus terreus NIH2624] gi|114194747|gb|EAU36447.1| ubiquitin carboxyl-terminal hydrolase 12 [Aspergillus terreus NIH2624] Length = 757 Score = 38.6 bits (88), Expect = 0.46, Method: Composition-based stats. Identities = 28/123 (22%), Positives = 52/123 (42%), Gaps = 15/123 (12%) Query: 52 PRKPTPLDHSDSQHPPVLPNSENNAHGDEPIKKSEKPTFRRNQPVIENLRKPTEAEKEKI 111 P PT +H+D+ P +P A PI ++ + +++ ++ + EKE+ Sbjct: 523 PSLPTHYEHTDTNSAPSIPKPAM-APPVPPIPETHSMPLSPKKSDLQSRKERAKEEKERK 581 Query: 112 LAARDRYQKTNNEEAIASIIWHRYQHMDYKGKTEQEKRALAKQSRDDFQRYATAQAANQK 171 A ++R ++ E+ + EQE RA Q R+D + A +A+ Sbjct: 582 AAEKERERQRRKEQEAT--------------RKEQEARARENQRREDAELKAALEASRAS 627 Query: 172 AAD 174 AD Sbjct: 628 KAD 630 >gi|291451374|ref|ZP_06590764.1| predicted protein [Streptomyces albus J1074] gi|291354323|gb|EFE81225.1| predicted protein [Streptomyces albus J1074] Length = 458 Score = 38.6 bits (88), Expect = 0.48, Method: Composition-based stats. Identities = 19/59 (32%), Positives = 27/59 (45%), Gaps = 1/59 (1%) Query: 46 DKNDGEPRKPTPLDHSDSQHPPVLPNSENNAHGD-EPIKKSEKPTFRRNQPVIENLRKP 103 D+ DG R P + + H PV P ++ GD P E+P +R QP +E P Sbjct: 97 DEGDGGHRGDGPAEGGEGVHRPVEPGADRGPGGDLAPQPGQERPALQRGQPRLERRHVP 155 >gi|85095476|ref|XP_960098.1| hypothetical protein NCU05838 [Neurospora crassa OR74A] gi|28921557|gb|EAA30862.1| predicted protein [Neurospora crassa OR74A] Length = 417 Score = 38.6 bits (88), Expect = 0.53, Method: Composition-based stats. Identities = 24/100 (24%), Positives = 45/100 (45%), Gaps = 14/100 (14%) Query: 31 HSDSQHPPVIPILKDD----KNDGEP---------RKPTPLDHSDSQHPPVLPNSENNAH 77 H+ S + P+ P++ + K+ G P + P +H D+QH L + A Sbjct: 24 HTKSFNSPIAPLVTRNAQLQKSVGSPLVFRSTFSSKPPIQPNHIDTQHEKELAQQKLKAD 83 Query: 78 GDE-PIKKSEKPTFRRNQPVIENLRKPTEAEKEKILAARD 116 + + S +P F ++QP + PTE+ K+ + +D Sbjct: 84 PEHISVDSSVRPFFEQDQPTAAKAKDPTESLKDDLGLVKD 123 >gi|296809175|ref|XP_002844926.1| conserved hypothetical protein [Arthroderma otae CBS 113480] gi|238844409|gb|EEQ34071.1| conserved hypothetical protein [Arthroderma otae CBS 113480] Length = 1156 Score = 38.6 bits (88), Expect = 0.53, Method: Composition-based stats. Identities = 19/69 (27%), Positives = 31/69 (44%), Gaps = 5/69 (7%) Query: 57 PLDHSDSQHPPVLPNSENNAHGDEPIKKSEKPTFRRNQPVIENLRKPTEAEKEKILAARD 116 P + +DS + P N H DEP P NQP I+++R PT + + A Sbjct: 844 PNEQNDSARQTLPPVPHNMKHTDEP-----APAGHPNQPPIQDIRMPTNCQSSSLNKALT 898 Query: 117 RYQKTNNEE 125 R+ + ++ Sbjct: 899 RFPRVQEQK 907 >gi|150865331|ref|XP_001384499.2| hypothetical protein PICST_31520 [Scheffersomyces stipitis CBS 6054] gi|149386587|gb|ABN66470.2| predicted protein [Scheffersomyces stipitis CBS 6054] Length = 240 Score = 38.2 bits (87), Expect = 0.59, Method: Composition-based stats. Identities = 36/135 (26%), Positives = 55/135 (40%), Gaps = 16/135 (11%) Query: 54 KPTPLDHSDSQHPPVLPNSENNAHG--DEPIKKSEKPTFRRNQPVIENLRKPTEAEKEKI 111 +P PL + S+ P+L +S AH DE E TFR + V +N+RKP E Sbjct: 7 EPLPLPYKTSK--PIL-SSVQTAHKLEDEDETAVETKTFRHQRKVSDNIRKPIRLET--- 60 Query: 112 LAARDRYQKTNNEEAIASIIWHRYQHMDYKGKTEQEKRALAKQSRD--DFQRYATAQAAN 169 R N+E ++ +H D K R + RD D +R + Sbjct: 61 ----GRTTSKANDELRDRLLMRSQEHFDSNEKITPVSRTVTSIRRDLLDIERLLKTE--R 114 Query: 170 QKAADMLMLATYGLQ 184 +++ L LA L+ Sbjct: 115 EQSLSSLQLAKLELE 129 >gi|323348138|gb|EGA82392.1| Syg1p [Saccharomyces cerevisiae Lalvin QA23] Length = 902 Score = 38.2 bits (87), Expect = 0.65, Method: Composition-based stats. Identities = 19/71 (26%), Positives = 35/71 (49%), Gaps = 6/71 (8%) Query: 67 PVLPNSENNAHGDEPIKKSEKPTFRRNQPVIENLRK------PTEAEKEKILAARDRYQK 120 P+ P+ +NN H + + TFRR V EN+ + T+ ++ + DR + Sbjct: 832 PITPSHDNNPHSFAEPMPAYRGTFRRRSSVFENISRSIPWAHATDFQRPTVNTVDDRSPE 891 Query: 121 TNNEEAIASII 131 T++E + SI+ Sbjct: 892 TDSESEVESIM 902 >gi|256269788|gb|EEU05054.1| Syg1p [Saccharomyces cerevisiae JAY291] Length = 902 Score = 38.2 bits (87), Expect = 0.65, Method: Composition-based stats. Identities = 19/71 (26%), Positives = 35/71 (49%), Gaps = 6/71 (8%) Query: 67 PVLPNSENNAHGDEPIKKSEKPTFRRNQPVIENLRK------PTEAEKEKILAARDRYQK 120 P+ P+ +NN H + + TFRR V EN+ + T+ ++ + DR + Sbjct: 832 PITPSHDNNPHSFAEPMPAYRGTFRRRSSVFENISRSIPWAHATDFQRPTVNTVDDRSPE 891 Query: 121 TNNEEAIASII 131 T++E + SI+ Sbjct: 892 TDSESEVESIM 902 >gi|207344312|gb|EDZ71499.1| YIL047Cp-like protein [Saccharomyces cerevisiae AWRI1631] Length = 457 Score = 38.2 bits (87), Expect = 0.65, Method: Composition-based stats. Identities = 19/71 (26%), Positives = 35/71 (49%), Gaps = 6/71 (8%) Query: 67 PVLPNSENNAHGDEPIKKSEKPTFRRNQPVIENLRK------PTEAEKEKILAARDRYQK 120 P+ P+ +NN H + + TFRR V EN+ + T+ ++ + DR + Sbjct: 387 PITPSHDNNPHSFAEPMPAYRGTFRRRSSVFENISRSIPWAHATDFQRPTVNTVDDRSPE 446 Query: 121 TNNEEAIASII 131 T++E + SI+ Sbjct: 447 TDSESEVESIM 457 >gi|190406267|gb|EDV09534.1| protein SYG1 [Saccharomyces cerevisiae RM11-1a] gi|259147210|emb|CAY80463.1| Syg1p [Saccharomyces cerevisiae EC1118] Length = 902 Score = 38.2 bits (87), Expect = 0.65, Method: Composition-based stats. Identities = 19/71 (26%), Positives = 35/71 (49%), Gaps = 6/71 (8%) Query: 67 PVLPNSENNAHGDEPIKKSEKPTFRRNQPVIENLRK------PTEAEKEKILAARDRYQK 120 P+ P+ +NN H + + TFRR V EN+ + T+ ++ + DR + Sbjct: 832 PITPSHDNNPHSFAEPMPAYRGTFRRRSSVFENISRSIPWAHATDFQRPTVNTVDDRSPE 891 Query: 121 TNNEEAIASII 131 T++E + SI+ Sbjct: 892 TDSESEVESIM 902 >gi|151943112|gb|EDN61447.1| conserved protein [Saccharomyces cerevisiae YJM789] Length = 902 Score = 38.2 bits (87), Expect = 0.65, Method: Composition-based stats. Identities = 19/71 (26%), Positives = 35/71 (49%), Gaps = 6/71 (8%) Query: 67 PVLPNSENNAHGDEPIKKSEKPTFRRNQPVIENLRK------PTEAEKEKILAARDRYQK 120 P+ P+ +NN H + + TFRR V EN+ + T+ ++ + DR + Sbjct: 832 PITPSHDNNPHSFAEPMPAYRGTFRRRSSVFENISRSIPWAHATDFQRPTVNTVDDRSPE 891 Query: 121 TNNEEAIASII 131 T++E + SI+ Sbjct: 892 TDSESEVESIM 902 >gi|6322142|ref|NP_012217.1| Syg1p [Saccharomyces cerevisiae S288c] gi|731805|sp|P40528|SYG1_YEAST RecName: Full=Protein SYG1 gi|600001|emb|CAA86904.1| Syg1p [Saccharomyces cerevisiae] gi|643449|gb|AAA91621.1| Syg1p [Saccharomyces cerevisiae] gi|285812601|tpg|DAA08500.1| TPA: Syg1p [Saccharomyces cerevisiae S288c] Length = 902 Score = 38.2 bits (87), Expect = 0.65, Method: Composition-based stats. Identities = 19/71 (26%), Positives = 35/71 (49%), Gaps = 6/71 (8%) Query: 67 PVLPNSENNAHGDEPIKKSEKPTFRRNQPVIENLRK------PTEAEKEKILAARDRYQK 120 P+ P+ +NN H + + TFRR V EN+ + T+ ++ + DR + Sbjct: 832 PITPSHDNNPHSFAEPMPAYRGTFRRRSSVFENISRSIPWAHATDFQRPTVNTVDDRSPE 891 Query: 121 TNNEEAIASII 131 T++E + SI+ Sbjct: 892 TDSESEVESIM 902 >gi|123477278|ref|XP_001321807.1| hypothetical protein [Trichomonas vaginalis G3] gi|121904641|gb|EAY09584.1| hypothetical protein TVAG_056280 [Trichomonas vaginalis G3] Length = 1677 Score = 38.2 bits (87), Expect = 0.75, Method: Composition-based stats. Identities = 21/74 (28%), Positives = 35/74 (47%), Gaps = 3/74 (4%) Query: 38 PVIPILKDDKNDGEPRKPTPLDHSDSQHP--PVLPNSENNAHGDEPIKKSEKPTFRRNQP 95 PV P + +P PTP + +Q+P P E + +EP + E+PT +P Sbjct: 1482 PVDPTPDPENPTPDPENPTPDPENPTQNPEEPTQNPEEPTQNPEEPTQNPEEPTQNPEEP 1541 Query: 96 VIENLRKPTEAEKE 109 +N +PT+ +E Sbjct: 1542 T-QNPEEPTQNPEE 1554 >gi|320544024|ref|NP_001188955.1| Stretchin-Mlck, isoform P [Drosophila melanogaster] gi|318068625|gb|ADV37201.1| Stretchin-Mlck, isoform P [Drosophila melanogaster] Length = 7779 Score = 37.8 bits (86), Expect = 0.79, Method: Composition-based stats. Identities = 32/124 (25%), Positives = 53/124 (42%), Gaps = 9/124 (7%) Query: 39 VIPILKDDKNDGEPRKPTPLDHSDSQHPPVLPNSENNAHGDEPIKKSEKPTFRRNQPVIE 98 V+P ++ E + P P+D P LP E +HG K SE + + Sbjct: 5150 VVPEKISEEKVAEIKTPEPMDSKAKSKPDGLPADE-KSHG---AKVSESVPVKNEAEKTD 5205 Query: 99 NL--RKPTEAEKEKILAARDRYQKTNNEEAIASIIWHRYQHMDYKGKTEQEKRALAKQSR 156 L +KPT +++ ++ R Y E+ SI Y+ MD + K +E R+ ++ Sbjct: 5206 QLSAKKPTVLDEDLVVPKRKPYLA---EQTADSISLQTYKSMDSEYKDRKESRSAKRKPT 5262 Query: 157 DDFQ 160 D Q Sbjct: 5263 VDIQ 5266 >gi|24653978|ref|NP_725510.1| Stretchin-Mlck, isoform D [Drosophila melanogaster] gi|21645322|gb|AAF58087.2| Stretchin-Mlck, isoform D [Drosophila melanogaster] Length = 7210 Score = 37.8 bits (86), Expect = 0.79, Method: Composition-based stats. Identities = 32/124 (25%), Positives = 53/124 (42%), Gaps = 9/124 (7%) Query: 39 VIPILKDDKNDGEPRKPTPLDHSDSQHPPVLPNSENNAHGDEPIKKSEKPTFRRNQPVIE 98 V+P ++ E + P P+D P LP E +HG K SE + + Sbjct: 5150 VVPEKISEEKVAEIKTPEPMDSKAKSKPDGLPADE-KSHG---AKVSESVPVKNEAEKTD 5205 Query: 99 NL--RKPTEAEKEKILAARDRYQKTNNEEAIASIIWHRYQHMDYKGKTEQEKRALAKQSR 156 L +KPT +++ ++ R Y E+ SI Y+ MD + K +E R+ ++ Sbjct: 5206 QLSAKKPTVLDEDLVVPKRKPYLA---EQTADSISLQTYKSMDSEYKDRKESRSAKRKPT 5262 Query: 157 DDFQ 160 D Q Sbjct: 5263 VDIQ 5266 >gi|260817987|ref|XP_002603866.1| hypothetical protein BRAFLDRAFT_205898 [Branchiostoma floridae] gi|229289190|gb|EEN59877.1| hypothetical protein BRAFLDRAFT_205898 [Branchiostoma floridae] Length = 502 Score = 37.8 bits (86), Expect = 0.87, Method: Composition-based stats. Identities = 25/89 (28%), Positives = 40/89 (44%), Gaps = 1/89 (1%) Query: 89 TFRRNQPV-IENLRKPTEAEKEKILAARDRYQKTNNEEAIASIIWHRYQHMDYKGKTEQE 147 T R PV +E + E+ KIL R Q +A W R++ ++ K E+E Sbjct: 119 TMRPAAPVKLEGIESEIYKERLKILTPRQTDQNLGKLQARFDDKWARHEQTMFREKMEEE 178 Query: 148 KRALAKQSRDDFQRYATAQAANQKAADML 176 +R QS + + A+ A QK ++L Sbjct: 179 QRYRRLQSEESQKAVEKARMARQKQTELL 207 >gi|161076243|ref|NP_001104465.1| Glutamine:fructose-6-phosphate aminotransferase 1, isoform H [Drosophila melanogaster] gi|158529700|gb|EDP28058.1| Glutamine:fructose-6-phosphate aminotransferase 1, isoform H [Drosophila melanogaster] Length = 685 Score = 37.8 bits (86), Expect = 0.95, Method: Composition-based stats. Identities = 26/98 (26%), Positives = 47/98 (47%), Gaps = 7/98 (7%) Query: 21 PVVLGVSNCDHSDSQHPPVIPILKDDKNDGEPRKPTPLDHSDSQHPPVLPNSENNAHGDE 80 P+++G+ + H P++ D + G+P+ P H S+ PVLP SE+ + Sbjct: 198 PLLVGIKTKTRLATDHIPIL--YGKDADSGKPQDIRP--HGQSRELPVLPRSESTSEF-M 252 Query: 81 PIKKSEKPTFRRN--QPVIENLRKPTEAEKEKILAARD 116 P+++ E F + VIE+ + E + + A RD Sbjct: 253 PLEEKEVEYFFASDASAVIEHTNRVIYLEDDDVAAVRD 290 >gi|134082410|emb|CAK42424.1| unnamed protein product [Aspergillus niger] Length = 2165 Score = 37.8 bits (86), Expect = 0.95, Method: Composition-based stats. Identities = 25/82 (30%), Positives = 38/82 (46%), Gaps = 14/82 (17%) Query: 30 DHSDSQHPPVI-PILKDDKNDGEPRKPTPLDHSDSQHPPV-LPNSENNAHGDEPIKKSEK 87 +HS S+HPP P + ++ P + P +H S+HPP P SE+ G P+ + Sbjct: 1431 EHSPSEHPPSEHPPSEHPPSEHPPSEHPPSEHPPSEHPPSEHPPSEHPPAGQPPVDNNP- 1489 Query: 88 PTFRRNQPVIENLRKPTEAEKE 109 NQP+ P E +E Sbjct: 1490 -----NQPI------PDEGSRE 1500 >gi|21357621|ref|NP_649644.1| CG17919 [Drosophila melanogaster] gi|7298931|gb|AAF54136.1| CG17919 [Drosophila melanogaster] gi|18447224|gb|AAL68202.1| GH14494p [Drosophila melanogaster] gi|220944538|gb|ACL84812.1| CG17919-PA [synthetic construct] gi|220954412|gb|ACL89749.1| CG17919-PA [synthetic construct] Length = 202 Score = 37.8 bits (86), Expect = 0.95, Method: Composition-based stats. Identities = 21/64 (32%), Positives = 36/64 (56%), Gaps = 3/64 (4%) Query: 133 HRYQHMDYK--GKTEQEKRALAKQSRDDFQRYATAQ-AANQKAADMLMLATYGLQYDDSL 189 HRY + YK GK E ++ ++K+SR D +++ A+ A N + + + Y QYDD + Sbjct: 133 HRYVFLLYKQSGKLEFDEERVSKRSRKDRPKFSAAKFAINHELGNPIAGTFYQAQYDDYV 192 Query: 190 TKIQ 193 K+ Sbjct: 193 PKLH 196 >gi|195378586|ref|XP_002048064.1| GJ11550 [Drosophila virilis] gi|194155222|gb|EDW70406.1| GJ11550 [Drosophila virilis] Length = 900 Score = 37.8 bits (86), Expect = 0.96, Method: Composition-based stats. Identities = 28/100 (28%), Positives = 40/100 (40%), Gaps = 6/100 (6%) Query: 21 PVVLGVSNCDHSDSQHPPVIPILKDDKNDGEP---RKPTPLDHSDSQHPPVLPNSENNAH 77 P G S+ P L D +DGEP R PT SQ + ++ Sbjct: 3 PTQSGGSSTSTGKMGEPIEYVTLISDDSDGEPSPKRNPT---GGGSQAGNQTKSKFDDDS 59 Query: 78 GDEPIKKSEKPTFRRNQPVIENLRKPTEAEKEKILAARDR 117 D P E+ T RRN+P ++ KP A E A+ ++ Sbjct: 60 NDTPATSDERRTSRRNKPKVDYTNKPATASAENASASYEK 99 >gi|320544010|ref|NP_001188948.1| Stretchin-Mlck, isoform H [Drosophila melanogaster] gi|318068618|gb|ADV37194.1| Stretchin-Mlck, isoform H [Drosophila melanogaster] Length = 7294 Score = 37.4 bits (85), Expect = 1.1, Method: Composition-based stats. Identities = 32/124 (25%), Positives = 53/124 (42%), Gaps = 9/124 (7%) Query: 39 VIPILKDDKNDGEPRKPTPLDHSDSQHPPVLPNSENNAHGDEPIKKSEKPTFRRNQPVIE 98 V+P ++ E + P P+D P LP E +HG K SE + + Sbjct: 5150 VVPEKISEEKVAEIKTPEPMDSKAKSKPDGLPADE-KSHG---AKVSESVPVKNEAEKTD 5205 Query: 99 NL--RKPTEAEKEKILAARDRYQKTNNEEAIASIIWHRYQHMDYKGKTEQEKRALAKQSR 156 L +KPT +++ ++ R Y E+ SI Y+ MD + K +E R+ ++ Sbjct: 5206 QLSAKKPTVLDEDLVVPKRKPYLA---EQTADSISLQTYKSMDSEYKDRKESRSAKRKPT 5262 Query: 157 DDFQ 160 D Q Sbjct: 5263 VDIQ 5266 >gi|323307478|gb|EGA60749.1| Fol1p [Saccharomyces cerevisiae FostersO] Length = 743 Score = 37.0 bits (84), Expect = 1.6, Method: Composition-based stats. Identities = 16/46 (34%), Positives = 30/46 (65%) Query: 82 IKKSEKPTFRRNQPVIENLRKPTEAEKEKILAARDRYQKTNNEEAI 127 I+ SE+ RR+ P+I+ +R+ TE ++K++ + D Y+ +EAI Sbjct: 602 IQASEEEEIRRSIPLIKAIRESTELPQDKVILSIDTYRSNVAKEAI 647 >gi|323303322|gb|EGA57118.1| Fol1p [Saccharomyces cerevisiae FostersB] Length = 864 Score = 37.0 bits (84), Expect = 1.6, Method: Composition-based stats. Identities = 16/46 (34%), Positives = 30/46 (65%) Query: 82 IKKSEKPTFRRNQPVIENLRKPTEAEKEKILAARDRYQKTNNEEAI 127 I+ SE+ RR+ P+I+ +R+ TE ++K++ + D Y+ +EAI Sbjct: 602 IQASEEEEIRRSIPLIKAIRESTELPQDKVILSIDTYRSNVAKEAI 647 >gi|259149112|emb|CAY82354.1| Fol1p [Saccharomyces cerevisiae EC1118] Length = 864 Score = 37.0 bits (84), Expect = 1.6, Method: Composition-based stats. Identities = 16/46 (34%), Positives = 30/46 (65%) Query: 82 IKKSEKPTFRRNQPVIENLRKPTEAEKEKILAARDRYQKTNNEEAI 127 I+ SE+ RR+ P+I+ +R+ TE ++K++ + D Y+ +EAI Sbjct: 602 IQASEEEEIRRSIPLIKAIRESTELPQDKVILSIDTYRSNVAKEAI 647 >gi|256272358|gb|EEU07341.1| Fol1p [Saccharomyces cerevisiae JAY291] Length = 824 Score = 37.0 bits (84), Expect = 1.6, Method: Composition-based stats. Identities = 16/46 (34%), Positives = 30/46 (65%) Query: 82 IKKSEKPTFRRNQPVIENLRKPTEAEKEKILAARDRYQKTNNEEAI 127 I+ SE+ RR+ P+I+ +R+ TE ++K++ + D Y+ +EAI Sbjct: 562 IQASEEEEIRRSIPLIKAIRESTELPQDKVILSIDTYRSNVAKEAI 607 >gi|151944291|gb|EDN62569.1| dihydro-6-hydroxymethylpterin pyrophosphokinase [Saccharomyces cerevisiae YJM789] Length = 824 Score = 37.0 bits (84), Expect = 1.6, Method: Composition-based stats. Identities = 16/46 (34%), Positives = 30/46 (65%) Query: 82 IKKSEKPTFRRNQPVIENLRKPTEAEKEKILAARDRYQKTNNEEAI 127 I+ SE+ RR+ P+I+ +R+ TE ++K++ + D Y+ +EAI Sbjct: 562 IQASEEEEIRRSIPLIKAIRESTELPQDKVILSIDTYRSNVAKEAI 607 >gi|1255962|emb|CAA65488.1| unnamed protein product [Saccharomyces cerevisiae] gi|1302305|emb|CAA96163.1| unnamed protein product [Saccharomyces cerevisiae] Length = 864 Score = 37.0 bits (84), Expect = 1.6, Method: Composition-based stats. Identities = 16/46 (34%), Positives = 30/46 (65%) Query: 82 IKKSEKPTFRRNQPVIENLRKPTEAEKEKILAARDRYQKTNNEEAI 127 I+ SE+ RR+ P+I+ +R+ TE ++K++ + D Y+ +EAI Sbjct: 602 IQASEEEEIRRSIPLIKAIRESTELPQDKVILSIDTYRSNVAKEAI 647 >gi|88192442|pdb|2BMB|A Chain A, X-Ray Structure Of The Bifunctional 6-Hydroxymethyl-7,8- Dihydroxypterin Pyrophosphokinase Dihydropteroate Synthase From Saccharomyces Cerevisiae Length = 545 Score = 37.0 bits (84), Expect = 1.6, Method: Composition-based stats. Identities = 16/46 (34%), Positives = 30/46 (65%) Query: 82 IKKSEKPTFRRNQPVIENLRKPTEAEKEKILAARDRYQKTNNEEAI 127 I+ SE+ RR+ P+I+ +R+ TE ++K++ + D Y+ +EAI Sbjct: 283 IQASEEEEIRRSIPLIKAIRESTELPQDKVILSIDTYRSNVAKEAI 328 >gi|37362687|ref|NP_014143.2| Fol1p [Saccharomyces cerevisiae S288c] gi|73920209|sp|P53848|FOL1_YEAST RecName: Full=Folic acid synthesis protein FOL1; Includes: RecName: Full=Dihydroneopterin aldolase; Short=DHNA; AltName: Full=FASA; AltName: Full=FASB; Includes: RecName: Full=2-amino-4-hydroxy-6-hydroxymethyldihydropteridine pyrophosphokinase; AltName: Full=6-hydroxymethyl-7,8-dihydropterin pyrophosphokinase; Short=PPPK; AltName: Full=7, 8-dihydro-6-hydroxymethylpterin-pyrophosphokinase; Short=HPPK; AltName: Full=FASC; Includes: RecName: Full=Dihydropteroate synthase; Short=DHPS; AltName: Full=Dihydropteroate pyrophosphorylase; AltName: Full=FASD gi|190409229|gb|EDV12494.1| dihydro-6-hydroxymethylpterin pyrophosphokinase [Saccharomyces cerevisiae RM11-1a] gi|285814409|tpg|DAA10303.1| TPA: Fol1p [Saccharomyces cerevisiae S288c] Length = 824 Score = 37.0 bits (84), Expect = 1.6, Method: Composition-based stats. Identities = 16/46 (34%), Positives = 30/46 (65%) Query: 82 IKKSEKPTFRRNQPVIENLRKPTEAEKEKILAARDRYQKTNNEEAI 127 I+ SE+ RR+ P+I+ +R+ TE ++K++ + D Y+ +EAI Sbjct: 562 IQASEEEEIRRSIPLIKAIRESTELPQDKVILSIDTYRSNVAKEAI 607 >gi|328862188|gb|EGG11289.1| hypothetical protein MELLADRAFT_74019 [Melampsora larici-populina 98AG31] Length = 583 Score = 37.0 bits (84), Expect = 1.6, Method: Composition-based stats. Identities = 38/137 (27%), Positives = 59/137 (43%), Gaps = 6/137 (4%) Query: 52 PRKPTPLDHSDSQHPPVLPNSENNAHGDEPIKKSEKPTFRRNQPVIENLRKPTEAEKEKI 111 PRK T L+ + PP + N +NN P+ K R Q N R A+ + Sbjct: 238 PRKVTSLEQKEWMIPPCISNWKNNKGYTIPLDKRLAADGRGLQDTHINDRF---AQLSES 294 Query: 112 LAARDRYQKTNNEEAIASIIWHRYQHMDYKGKTEQEKRALAKQSRDDFQRYATAQAANQK 171 L DR+ + E S++ R + K + E+ R LA+++R++ AA+ Sbjct: 295 LYIADRHAR--EEVRQRSLLQQRLAQKE-KEQKEENLRLLAQRAREERSGIPRVVAASGG 351 Query: 172 AADMLMLATYGLQYDDS 188 A LA YG D+S Sbjct: 352 GAMPTTLAGYGSDSDES 368 >gi|317035721|ref|XP_001396887.2| hypothetical protein ANI_1_1368134 [Aspergillus niger CBS 513.88] Length = 1339 Score = 36.6 bits (83), Expect = 1.8, Method: Composition-based stats. Identities = 25/82 (30%), Positives = 38/82 (46%), Gaps = 14/82 (17%) Query: 30 DHSDSQHPPVI-PILKDDKNDGEPRKPTPLDHSDSQHPPV-LPNSENNAHGDEPIKKSEK 87 +HS S+HPP P + ++ P + P +H S+HPP P SE+ G P+ + Sbjct: 608 EHSPSEHPPSEHPPSEHPPSEHPPSEHPPSEHPPSEHPPSEHPPSEHPPAGQPPVDNNP- 666 Query: 88 PTFRRNQPVIENLRKPTEAEKE 109 NQP+ P E +E Sbjct: 667 -----NQPI------PDEGSRE 677 >gi|118098001|ref|XP_414870.2| PREDICTED: similar to PACT [Gallus gallus] Length = 1802 Score = 36.6 bits (83), Expect = 1.9, Method: Composition-based stats. Identities = 36/111 (32%), Positives = 54/111 (48%), Gaps = 12/111 (10%) Query: 44 KDDKNDGEPRKPTPLDHSDSQHPPVLPNSENNAHGDEPIK-KSEKPTFR--RNQPVIENL 100 KD+K G PRK P D HP P E A D P + KSEKP+ + +++ +E Sbjct: 1092 KDEKTLGTPRKVHPKVAKD--HPEARPAKEEKAKKDHPKELKSEKPSSKEDKSKKAVEK- 1148 Query: 101 RKPTEAEKEKILAARDRYQKTNNEEAIASIIWHRYQHMDY----KGKTEQE 147 K ++A+ EK R +K + E AS+ + + + KGKTE + Sbjct: 1149 SKSSDAKAEK--RKRKADEKADKEHEAASVKASKPETAESKTSPKGKTEPD 1197 >gi|326918492|ref|XP_003205522.1| PREDICTED: SH2 domain-containing protein 4A-like [Meleagris gallopavo] Length = 450 Score = 36.6 bits (83), Expect = 2.1, Method: Composition-based stats. Identities = 29/85 (34%), Positives = 42/85 (49%), Gaps = 18/85 (21%) Query: 106 AEKEKILAARDRYQKTNNEEAIASIIWHRYQHMDYKGKTEQEKR-ALAKQSRDDFQRYAT 164 +E +K+L QK++ E +Q K K EKR +LA+Q+RDD++R + Sbjct: 211 SESQKVL------QKSDENEP-------EWQESLRKSKAADEKRRSLARQARDDYRRLSL 257 Query: 165 AQAANQKAADMLMLATYG----LQY 185 A K AD+ AT G LQY Sbjct: 258 QGAHRGKQADISKNATAGDRRPLQY 282 >gi|195026318|ref|XP_001986230.1| GH21245 [Drosophila grimshawi] gi|193902230|gb|EDW01097.1| GH21245 [Drosophila grimshawi] Length = 846 Score = 36.6 bits (83), Expect = 2.1, Method: Composition-based stats. Identities = 21/72 (29%), Positives = 37/72 (51%), Gaps = 6/72 (8%) Query: 28 NCDHSDSQHPPVIPILKDDKNDGEPRKPTPLDHSDSQHPPVLPNSENNAHGDEPIKKSEK 87 N + + +PP+ P+ D+ + E + +PLD+S+S LP E N +E + E Sbjct: 473 NVLSASANNPPMPPLPVDESHGVELKNDSPLDNSES-----LPKPEANV-AEEVKEVKET 526 Query: 88 PTFRRNQPVIEN 99 P ++N +EN Sbjct: 527 PAAKKNLTKVEN 538 >gi|52486979|ref|NP_001004424.1| transforming, acidic coiled-coil containing protein 3 [Rattus norvegicus] Length = 585 Score = 36.2 bits (82), Expect = 2.3, Method: Composition-based stats. Identities = 26/74 (35%), Positives = 35/74 (47%), Gaps = 6/74 (8%) Query: 32 SDSQHPPVIPILKDDKNDGEPRKPTPLDHSDSQHPPVLPNSENNAHGDEPIKKSEKPTFR 91 S+ PPV P+L + P+ P PL S+ PPV P E G EP+ SE P Sbjct: 171 SEPVTPPVKPVL-----EPTPQGPEPLLDSEPVTPPVKPVLEPTPQGSEPLLDSE-PVTL 224 Query: 92 RNQPVIENLRKPTE 105 +PV E+ + E Sbjct: 225 PVEPVPESSHQGPE 238 Score = 34.7 bits (78), Expect = 7.4, Method: Composition-based stats. Identities = 24/76 (31%), Positives = 35/76 (46%), Gaps = 10/76 (13%) Query: 37 PPVIPILKDDKNDGEPRKPTPLDHSDSQHPPVLPNSENNAHGDEPIKKSEKPTFRRNQPV 96 PP+ P+ + P+ P PL S+ PPV P E G EP+ SE T P Sbjct: 152 PPIEPM-----PEPSPQGPEPLLDSEPVTPPVKPVLEPTPQGPEPLLDSEPVT-----PP 201 Query: 97 IENLRKPTEAEKEKIL 112 ++ + +PT E +L Sbjct: 202 VKPVLEPTPQGSEPLL 217 >gi|195480136|ref|XP_002101150.1| GE15780 [Drosophila yakuba] gi|194188674|gb|EDX02258.1| GE15780 [Drosophila yakuba] Length = 1872 Score = 36.2 bits (82), Expect = 2.3, Method: Composition-based stats. Identities = 18/75 (24%), Positives = 35/75 (46%), Gaps = 3/75 (4%) Query: 27 SNCDHSDSQHPPVIPILKDDKNDGEPRKPTPLDHSDSQHPPVLPNSENNAHGDEPIKKSE 86 S + S+ Q P + K ++ D E + P+D+S+++ P N +N +P K+ Sbjct: 764 STSEDSEDQQPETLEDSKQEQEDSEDEQLQPVDNSENEQQPTSSNKRDNLENKQPKKRKS 823 Query: 87 KPTFRRN---QPVIE 98 N +P++E Sbjct: 824 SACSDSNNSKKPLLE 838 >gi|268573178|ref|XP_002641566.1| Hypothetical protein CBG09865 [Caenorhabditis briggsae] Length = 490 Score = 36.2 bits (82), Expect = 2.6, Method: Composition-based stats. Identities = 32/120 (26%), Positives = 49/120 (40%), Gaps = 3/120 (2%) Query: 33 DSQHPPVIPILKDDKNDGEPRKPTPLDHSDSQHPPVLPNSENNAHGDEPIKKSEKPTFRR 92 D P I L+ E T ++ SD QH VL ++ HG K SE +RR Sbjct: 264 DDDESPRIKELESRIQKTEEEWTTRINESDQQHAIVLATTKAEMHGALEKKDSEIEQWRR 323 Query: 93 NQPVIENLRKPTEA---EKEKILAARDRYQKTNNEEAIASIIWHRYQHMDYKGKTEQEKR 149 V+E EK + L A ++ + N E I + + Q + + E++KR Sbjct: 324 KCAVLEQQDADANQRWNEKVEKLQAMNKALEAENNEMIDKLSEAKTQGVKAVLEEEEKKR 383 >gi|145546029|ref|XP_001458698.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124426519|emb|CAK91301.1| unnamed protein product [Paramecium tetraurelia] Length = 300 Score = 35.9 bits (81), Expect = 3.7, Method: Composition-based stats. Identities = 28/105 (26%), Positives = 49/105 (46%), Gaps = 2/105 (1%) Query: 97 IENLRKPTEAEKE--KILAARDRYQKTNNEEAIASIIWHRYQHMDYKGKTEQEKRALAKQ 154 IE +K E ++E ++ A Q N ++ I + ++DYK KTE++ + L KQ Sbjct: 69 IEEQKKQLEQKREMHQLEGATFHPQILNKDQRIRTPEQFYKDNIDYKNKTEKQIQQLIKQ 128 Query: 155 SRDDFQRYATAQAANQKAADMLMLATYGLQYDDSLTKIQDPPKME 199 D + + N+K+ M+ Y D L K Q+ K++ Sbjct: 129 KEDSINNRSCSPKLNKKSVQMVAQPFYDRLKDKQLEKEQNLLKIK 173 >gi|239609290|gb|EEQ86277.1| conserved hypothetical protein [Ajellomyces dermatitidis ER-3] Length = 972 Score = 35.5 bits (80), Expect = 3.8, Method: Composition-based stats. Identities = 30/131 (22%), Positives = 54/131 (41%), Gaps = 5/131 (3%) Query: 41 PILKDDKNDGEPRKPTPLDHSDSQHPPVLPNSE-NNAHGDEPIKKSEKPTFRRNQPVIEN 99 P K +NDG P +P + + P + + + N G + + +P F +P I Sbjct: 741 PSFKMVRNDGRPVRPNHVPSFNHPQPQIFDHEDFVNPRGPSSFRDASRPVFVTQEPPIRR 800 Query: 100 LRKPTEAEKEKILAARDRYQKTNNEEAIASIIWHRYQHMDYKGKTEQEKRALAKQSRDDF 159 P E + L + D + N ++A SI H Q + + ++ R + R F Sbjct: 801 KLLPDENTRISSLHSHDFVRPVNLQDADGSIS-HEPQGGPARPASPRKTRYV---PRGAF 856 Query: 160 QRYATAQAANQ 170 + Y +A +Q Sbjct: 857 KAYDRVRAESQ 867 >gi|261188282|ref|XP_002620557.1| conserved hypothetical protein [Ajellomyces dermatitidis SLH14081] gi|239593304|gb|EEQ75885.1| conserved hypothetical protein [Ajellomyces dermatitidis SLH14081] Length = 956 Score = 35.5 bits (80), Expect = 4.3, Method: Composition-based stats. Identities = 30/131 (22%), Positives = 54/131 (41%), Gaps = 5/131 (3%) Query: 41 PILKDDKNDGEPRKPTPLDHSDSQHPPVLPNSE-NNAHGDEPIKKSEKPTFRRNQPVIEN 99 P K +NDG P +P + + P + + + N G + + +P F +P I Sbjct: 738 PSFKMVRNDGRPVRPNHVPSFNHPQPQIFDHEDFVNPRGPSSFRDASRPVFVTQEPPIRR 797 Query: 100 LRKPTEAEKEKILAARDRYQKTNNEEAIASIIWHRYQHMDYKGKTEQEKRALAKQSRDDF 159 P E + L + D + N ++A SI H Q + + ++ R + R F Sbjct: 798 KLLPDENTRISSLHSHDFVRPVNLQDADGSIS-HEPQGGPARPASPRKTRYV---PRGAF 853 Query: 160 QRYATAQAANQ 170 + Y +A +Q Sbjct: 854 KAYDRVRAESQ 864 >gi|327357310|gb|EGE86167.1| hypothetical protein BDDG_09112 [Ajellomyces dermatitidis ATCC 18188] Length = 956 Score = 35.5 bits (80), Expect = 4.4, Method: Composition-based stats. Identities = 30/131 (22%), Positives = 54/131 (41%), Gaps = 5/131 (3%) Query: 41 PILKDDKNDGEPRKPTPLDHSDSQHPPVLPNSE-NNAHGDEPIKKSEKPTFRRNQPVIEN 99 P K +NDG P +P + + P + + + N G + + +P F +P I Sbjct: 738 PSFKMVRNDGRPVRPNHVPSFNHPQPQIFDHEDFVNPRGPSSFRDASRPVFVTQEPPIRR 797 Query: 100 LRKPTEAEKEKILAARDRYQKTNNEEAIASIIWHRYQHMDYKGKTEQEKRALAKQSRDDF 159 P E + L + D + N ++A SI H Q + + ++ R + R F Sbjct: 798 KLLPDENTRISSLHSHDFVRPVNLQDADGSIS-HEPQGGPARPASPRKTRYV---PRGAF 853 Query: 160 QRYATAQAANQ 170 + Y +A +Q Sbjct: 854 KAYDRVRAESQ 864 >gi|123489198|ref|XP_001325336.1| M protein [Trichomonas vaginalis G3] gi|121908234|gb|EAY13113.1| M protein, putative [Trichomonas vaginalis G3] Length = 260 Score = 35.5 bits (80), Expect = 4.4, Method: Composition-based stats. Identities = 21/65 (32%), Positives = 36/65 (55%), Gaps = 4/65 (6%) Query: 139 DYKGKTE---QEKRALAKQSRDDFQRYATAQAANQKAADMLMLATYGLQYD-DSLTKIQD 194 D KGK + +E + LA+Q D+F ++ TA+ K+ + Y L D D+L ++ Sbjct: 185 DLKGKQQKELEEAKKLAQQQLDEFHKFITAELTPYKSKLQSLKQQYQLAQDKDALWNVKH 244 Query: 195 PPKME 199 PP+M+ Sbjct: 245 PPQMK 249 >gi|260823692|ref|XP_002606214.1| hypothetical protein BRAFLDRAFT_105004 [Branchiostoma floridae] gi|229291554|gb|EEN62224.1| hypothetical protein BRAFLDRAFT_105004 [Branchiostoma floridae] Length = 1819 Score = 35.5 bits (80), Expect = 4.6, Method: Composition-based stats. Identities = 22/92 (23%), Positives = 41/92 (44%), Gaps = 9/92 (9%) Query: 59 DHSDSQHPPVLPNSENNAHGDEPIKKSEKPTFRRNQPVIEN-----LRKPTEAEKEKILA 113 D + S P + N + ++P +E PT R+ P + + + PT EK++ A Sbjct: 1317 DPTVSHEDPTVSNEDPTVSHEDPTVSNEDPTVSRDCPAVSHGDPVGSQDPTTREKQRRKA 1376 Query: 114 ARDRYQKTNNEEAIASIIWHRYQHMDYKGKTE 145 RY + +++ IW + Q +G +E Sbjct: 1377 PARRYSSSRSDDG----IWEKLQRWVSEGSSE 1404 >gi|312219787|emb|CBX99729.1| hypothetical protein [Leptosphaeria maculans] Length = 1242 Score = 35.5 bits (80), Expect = 4.8, Method: Composition-based stats. Identities = 41/166 (24%), Positives = 66/166 (39%), Gaps = 28/166 (16%) Query: 55 PTPLDHSDSQHPPVLPNSENNAHGDEPIK-KSEKPTFRRN-QPVIENLRKPTEAEKEKIL 112 P P HS S +P L + +NN + DE S P R+N QP+I + +P + Sbjct: 697 PAPSTHSHS-NPWDLDDIDNNPYRDEQASMASTSPLERQNRQPLILSATEPANGAPWDLT 755 Query: 113 AARDRYQKTNNE-----EAIASIIWHRYQHMDYKGKTEQEKRAL---------------- 151 R+ + +NE EA + HR + + + E R+ Sbjct: 756 PRRNHERIASNETEAEHEAFQKDLAHRQRLIQENLRARAEGRSTSPAPAPGPSGGLKTAL 815 Query: 152 ----AKQSRDDFQRYATAQAANQKAADMLMLATYGLQYDDSLTKIQ 193 AK SR+ F A Q++N+ + + A+Y SL +Q Sbjct: 816 NMLRAKSSRESFATVAEQQSSNKSIRKLTVGASYANGSSTSLAGMQ 861 >gi|1141708|gb|AAA84448.1| g-RICH [Carassius auratus] Length = 411 Score = 35.5 bits (80), Expect = 4.8, Method: Composition-based stats. Identities = 24/87 (27%), Positives = 39/87 (44%), Gaps = 15/87 (17%) Query: 33 DSQHPPVIPILKDDKNDGEPR---------KPTPLDHSDSQHPPVLPNSENNAHGDEPIK 83 DS+ PP P + + P KP + S+ PPV NSE +E +K Sbjct: 58 DSEAPPAKPSEPEVAPEKSPEETPAAESSAKPPEPEQKKSEEPPVQVNSEPEKQEEEAVK 117 Query: 84 KSEKPTFRRNQPVIENLRKPTEAEKEK 110 ++E ++P N KP E++K++ Sbjct: 118 EAE------SKPTAVNEAKPEESDKDE 138 >gi|306844607|ref|ZP_07477194.1| PAS/PAC sensor signal transduction histidine kinase [Brucella sp. BO1] gi|306275051|gb|EFM56814.1| PAS/PAC sensor signal transduction histidine kinase [Brucella sp. BO1] Length = 1036 Score = 35.1 bits (79), Expect = 4.9, Method: Composition-based stats. Identities = 26/93 (27%), Positives = 40/93 (43%), Gaps = 5/93 (5%) Query: 42 ILKDDKNDGEPRKPTPLDHSDSQHPPVLPNSENNA---HGDEPIKKSEKPTFRRNQPVIE 98 + + NDGE R+ L H+D PV N+ NA G + S P N PV Sbjct: 586 LFNSESNDGETRQSMVLRHADGSEEPV--NAHLNAIAWRGGRALMLSLMPVAAANLPVPA 643 Query: 99 NLRKPTEAEKEKILAARDRYQKTNNEEAIASII 131 L ++ E+++ L A KT + A ++ Sbjct: 644 ELPAASDDEEKQALEAHVEELKTILDTATDGVV 676 >gi|145551833|ref|XP_001461593.1| hypothetical protein [Paramecium tetraurelia strain d4-2] gi|124429428|emb|CAK94220.1| unnamed protein product [Paramecium tetraurelia] Length = 300 Score = 35.1 bits (79), Expect = 5.4, Method: Composition-based stats. Identities = 27/106 (25%), Positives = 51/106 (48%), Gaps = 2/106 (1%) Query: 97 IENLRKPTEAEKE--KILAARDRYQKTNNEEAIASIIWHRYQHMDYKGKTEQEKRALAKQ 154 IE +K + ++E ++ A Q N ++ I + ++DYK KTE++ + L KQ Sbjct: 69 IEEQKKRLDQKREMDQLEGATFHPQILNKDQKIRTPDQFYKDNLDYKNKTEKQVQQLIKQ 128 Query: 155 SRDDFQRYATAQAANQKAADMLMLATYGLQYDDSLTKIQDPPKMEE 200 D + + N+K+ M++ Y D L K Q+ K+++ Sbjct: 129 KEDQINNRSCSPKINKKSIQMVVQPFYDRLKDKQLEKEQNLLKIKK 174 >gi|301610113|ref|XP_002934596.1| PREDICTED: myosin-IXa [Xenopus (Silurana) tropicalis] Length = 2551 Score = 35.1 bits (79), Expect = 6.0, Method: Composition-based stats. Identities = 33/136 (24%), Positives = 50/136 (36%) Query: 21 PVVLGVSNCDHSDSQHPPVIPILKDDKNDGEPRKPTPLDHSDSQHPPVLPNSENNAHGDE 80 P++ V+ + S P I D +KP + S+S N H E Sbjct: 1411 PIISNVTEENASFHHTPSAASIAVDISQSSWRQKPQGFEISESAPAKSSLPVFNRTHTRE 1470 Query: 81 PIKKSEKPTFRRNQPVIENLRKPTEAEKEKILAARDRYQKTNNEEAIASIIWHRYQHMDY 140 P+ S PTF Q +K TE + + R QK N E+ + + Sbjct: 1471 PLLSSSLPTFYLPQQETVRTQKLTETGEIPTESVLQRLQKLNEEKEQQQKQLQQEKERQM 1530 Query: 141 KGKTEQEKRALAKQSR 156 + QEK+ L +Q R Sbjct: 1531 MEQIRQEKQELERQRR 1546 >gi|195540163|gb|AAI68016.1| Unknown (protein for IMAGE:7683330) [Xenopus (Silurana) tropicalis] Length = 2010 Score = 35.1 bits (79), Expect = 6.1, Method: Composition-based stats. Identities = 33/136 (24%), Positives = 50/136 (36%) Query: 21 PVVLGVSNCDHSDSQHPPVIPILKDDKNDGEPRKPTPLDHSDSQHPPVLPNSENNAHGDE 80 P++ V+ + S P I D +KP + S+S N H E Sbjct: 1411 PIISNVTEENASFHHTPSAASIAVDISQSSWRQKPQGFEISESAPAKSSLPVFNRTHTRE 1470 Query: 81 PIKKSEKPTFRRNQPVIENLRKPTEAEKEKILAARDRYQKTNNEEAIASIIWHRYQHMDY 140 P+ S PTF Q +K TE + + R QK N E+ + + Sbjct: 1471 PLLSSSLPTFYLPQQETVRTQKLTETGEIPTESVLQRLQKLNEEKEQQQKQLQQEKERQM 1530 Query: 141 KGKTEQEKRALAKQSR 156 + QEK+ L +Q R Sbjct: 1531 MEQIRQEKQELERQRR 1546 >gi|309361560|emb|CAP29412.2| hypothetical protein CBG_09865 [Caenorhabditis briggsae AF16] Length = 1676 Score = 35.1 bits (79), Expect = 6.3, Method: Composition-based stats. Identities = 32/120 (26%), Positives = 49/120 (40%), Gaps = 3/120 (2%) Query: 33 DSQHPPVIPILKDDKNDGEPRKPTPLDHSDSQHPPVLPNSENNAHGDEPIKKSEKPTFRR 92 D P I L+ E T ++ SD QH VL ++ HG K SE +RR Sbjct: 273 DDDESPRIKELESRIQKTEEEWTTRINESDQQHAIVLATTKAEMHGALEKKDSEIEQWRR 332 Query: 93 NQPVIENLRKPTEA---EKEKILAARDRYQKTNNEEAIASIIWHRYQHMDYKGKTEQEKR 149 V+E EK + L A ++ + N E I + + Q + + E++KR Sbjct: 333 KCAVLEQQDADANQRWNEKVEKLQAMNKALEAENNEMIDKLSEAKTQGVKAVLEEEEKKR 392 >gi|289620588|emb|CBI52949.1| unnamed protein product [Sordaria macrospora] Length = 848 Score = 34.7 bits (78), Expect = 6.4, Method: Composition-based stats. Identities = 22/88 (25%), Positives = 37/88 (42%), Gaps = 9/88 (10%) Query: 61 SDSQHPPVLPNSENNAHGDE--PIKKSEKPTFRRNQPVIENLRKPTEAEKEKILAARDRY 118 S +H ++P A GD P + S +P + LR+P A E++ + Sbjct: 220 SSIEHGAIMPGPTAVASGDGAGPSEPSARPNYTTKTIAQRRLRRPHSAHSEQLARLPSPF 279 Query: 119 QKTNNEEAIASIIWHRYQHMDYKGKTEQ 146 ++ H +QHMD+ GK +Q Sbjct: 280 FSLWSKR-------HHHQHMDHNGKPQQ 300 >gi|326427947|gb|EGD73517.1| hypothetical protein PTSG_05221 [Salpingoeca sp. ATCC 50818] Length = 423 Score = 34.7 bits (78), Expect = 6.5, Method: Composition-based stats. Identities = 22/76 (28%), Positives = 33/76 (43%), Gaps = 1/76 (1%) Query: 48 NDGEPRKPTPLDHSDSQHPPVLPNSENNAHGDEPIKKSEKPTFRRNQPVIENLRKPTEAE 107 N P TP ++ + PPVL + + KK ++P R++ P E TE Sbjct: 18 NSFTPAPCTPPPVAEFEAPPVLSTQQQQQQQQQQQKK-DRPKDRKDLPSDETSSSATEGN 76 Query: 108 KEKILAARDRYQKTNN 123 A RD+ QKT + Sbjct: 77 SGSTKAGRDKQQKTKD 92 >gi|148705482|gb|EDL37429.1| transforming, acidic coiled-coil containing protein 3, isoform CRA_b [Mus musculus] Length = 697 Score = 34.7 bits (78), Expect = 7.3, Method: Composition-based stats. Identities = 30/100 (30%), Positives = 45/100 (45%), Gaps = 10/100 (10%) Query: 15 IVSVLIPVVLGVSNCDHSDSQHPPVIPILKDDKNDGEPRKPTPLDHSDSQHPPVLPNSEN 74 I VL P G+ S+ PPV P+L+ + EP + S+ PP+ P E Sbjct: 215 IEPVLEPSHQGLEPVLESELVTPPVEPVLEPSHQELEP-----VLESELVTPPIEPVLEP 269 Query: 75 NAHGDEPIKKSEKPTFRRNQPVIENLRKPTEAEKEKILAA 114 + G EP+ SE T P IE + +P+ E +L + Sbjct: 270 SHQGLEPVLDSELVT-----PPIEPVLEPSHQGLEPVLES 304 >gi|156085192|ref|XP_001610079.1| ski-interacting protein [Babesia bovis T2Bo] gi|154797331|gb|EDO06511.1| ski-interacting protein, putative [Babesia bovis] Length = 458 Score = 34.7 bits (78), Expect = 7.5, Method: Composition-based stats. Identities = 29/109 (26%), Positives = 47/109 (43%), Gaps = 6/109 (5%) Query: 53 RKPTPLDHSDSQHPPVLPNSENNAHGDEPIKKSEKPTFRRNQPVIENLRKPTEAEKEKIL 112 RK T D + PP + N +N+ PI K + RR Q V N + E + Sbjct: 201 RKLTKEDQLAWKIPPCISNWKNSKGYTIPIDKRVQADGRRLQEVFVNEKFAVFGESLSLA 260 Query: 113 AARDRYQKTNNEEAIASIIWHRYQHMDYKGKTEQEKRALAKQSRDDFQR 161 ++T EE HR + + + E++ RALA ++R++ R Sbjct: 261 ------ERTAREEVRLRNEAHRMEKLKEAQEKEEQLRALAARAREERSR 303 >gi|196006143|ref|XP_002112938.1| hypothetical protein TRIADDRAFT_56576 [Trichoplax adhaerens] gi|190584979|gb|EDV25048.1| hypothetical protein TRIADDRAFT_56576 [Trichoplax adhaerens] Length = 902 Score = 34.7 bits (78), Expect = 7.7, Method: Composition-based stats. Identities = 28/104 (26%), Positives = 45/104 (43%), Gaps = 5/104 (4%) Query: 52 PRKPTPLDHSDSQHPPVLPNSENNAHGDEPIKKSEKPTFRRNQPVIENLRKPTEAEKEKI 111 P + T + +DS P P+SE+ DE K+ T R+N +I T + Sbjct: 691 PARLTSYEANDSLLFPTTPDSEHQRLVDELEKEILNKTLRQNPEIITENHNDTTINMSES 750 Query: 112 LAARDRYQKTN-NEEAIASIIWHRYQHMDYKGKTEQEKRALAKQ 154 L DR + + IA ++H H D+ G + + LA+Q Sbjct: 751 LPPNDRMEDSKLGHRTIADGVFH--SHSDFGG--DLSPQLLARQ 790 >gi|126335924|ref|XP_001375978.1| PREDICTED: similar to SET domain containing 2, [Monodelphis domestica] Length = 2576 Score = 34.7 bits (78), Expect = 7.8, Method: Composition-based stats. Identities = 33/117 (28%), Positives = 53/117 (45%), Gaps = 14/117 (11%) Query: 13 LGIVSVLIPVVLGVSNCDHSDSQHPPVIPILK----DDKNDGEPRKPTP-LDHSDSQHPP 67 LG+ S L LG S HS +PP P+ + N G+ PTP +D S P Sbjct: 2155 LGMTSPLPYESLGYSGPHHSFVGYPPGYPMQAYVDPSNPNAGKVLLPTPSMDPMCS--PA 2212 Query: 68 VLPNSENNAHGDEPIKKSEKPTFRRNQPVIENLRKPTEAEKEKILAARDR--YQKTN 122 +S+N H EP + +P PV++++ P E + +A + +Q++N Sbjct: 2213 TYEHSQNLGHSVEPSLTTPQPV-----PVVQHVATPMEVSTSQYVAQNEAVVHQESN 2264 >gi|296828264|ref|XP_002851302.1| shugoshin C terminal domain-containing protein [Arthroderma otae CBS 113480] gi|238838856|gb|EEQ28518.1| shugoshin C terminal domain-containing protein [Arthroderma otae CBS 113480] Length = 602 Score = 34.7 bits (78), Expect = 8.3, Method: Composition-based stats. Identities = 29/97 (29%), Positives = 40/97 (41%), Gaps = 12/97 (12%) Query: 38 PVIPILKDDKNDGEPRKPTP-----LDHS-DSQHPPVLPNSENNAHGDEPIKKSEKPTFR 91 PV L D DG + TP LD S++ P NSE A G P ++S P Sbjct: 399 PVEISLNDQSQDGLSSENTPKLENGLDKQWHSEYQPTTTNSERQARGARPSRRSRGPINY 458 Query: 92 RNQPVIENLRKPTE------AEKEKILAARDRYQKTN 122 + +R+PTE AE +R +Y T+ Sbjct: 459 AEPSLRGKMRRPTEDLVDAVAEHMTKRLSRSQYDHTD 495 >gi|313235468|emb|CBY19746.1| unnamed protein product [Oikopleura dioica] Length = 706 Score = 34.3 bits (77), Expect = 8.7, Method: Composition-based stats. Identities = 13/57 (22%), Positives = 31/57 (54%) Query: 71 NSENNAHGDEPIKKSEKPTFRRNQPVIENLRKPTEAEKEKILAARDRYQKTNNEEAI 127 +S+ ++ G+ K KP+ R+N+P R+ E +K+ L + +Y + +++ + Sbjct: 545 HSQPDSGGNRRQKSRRKPSVRKNKPTRSESRRSQEPKKDDFLEKKSQYDRLKHDKIV 601 >gi|83768016|dbj|BAE58155.1| unnamed protein product [Aspergillus oryzae] Length = 200 Score = 34.3 bits (77), Expect = 9.2, Method: Composition-based stats. Identities = 28/104 (26%), Positives = 40/104 (38%), Gaps = 18/104 (17%) Query: 52 PRKP-TPLDHSDSQHP--------PVLPNSENNAHGDEPIKKSEKPTF---------RRN 93 PR P TP+ H + P P NNAH + P KK + PT R+ Sbjct: 46 PRSPITPIQHHHNSPPTTQEGIQNPQCNKHPNNAHHEPPTKKLQNPTRTFAKPSPTKRKP 105 Query: 94 QPVIENLRKPTEAEKEKILAARDRYQKTNNEEAIASIIWHRYQH 137 QP ++ T AE E +++ N + + HR H Sbjct: 106 QPTYPQGKQETNAESENAYLRPNKHAPNENPPNLHPRLIHRPLH 149 Database: nr Posted date: May 22, 2011 12:22 AM Number of letters in database: 999,999,966 Number of sequences in database: 2,987,313 Database: /data/usr2/db/fasta/nr.01 Posted date: May 22, 2011 12:30 AM Number of letters in database: 999,999,796 Number of sequences in database: 2,903,041 Database: /data/usr2/db/fasta/nr.02 Posted date: May 22, 2011 12:36 AM Number of letters in database: 999,999,281 Number of sequences in database: 2,904,016 Database: /data/usr2/db/fasta/nr.03 Posted date: May 22, 2011 12:41 AM Number of letters in database: 999,999,960 Number of sequences in database: 2,935,328 Database: /data/usr2/db/fasta/nr.04 Posted date: May 22, 2011 12:46 AM Number of letters in database: 842,794,627 Number of sequences in database: 2,394,679 Lambda K H 0.311 0.129 0.370 Lambda K H 0.267 0.0401 0.140 Matrix: BLOSUM62 Gap Penalties: Existence: 11, Extension: 1 Number of Hits to DB: 3,917,790,186 Number of Sequences: 14124377 Number of extensions: 164074270 Number of successful extensions: 383613 Number of sequences better than 10.0: 246 Number of HSP's better than 10.0 without gapping: 33 Number of HSP's successfully gapped in prelim test: 626 Number of HSP's that attempted gapping in prelim test: 382926 Number of HSP's gapped (non-prelim): 1182 length of query: 200 length of database: 4,842,793,630 effective HSP length: 132 effective length of query: 68 effective length of database: 2,978,375,866 effective search space: 202529558888 effective search space used: 202529558888 T: 11 A: 40 X1: 16 ( 7.2 bits) X2: 38 (14.6 bits) X3: 64 (24.7 bits) S1: 42 (21.8 bits) S2: 77 (34.3 bits)