BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= psy9966
(1101 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|357624104|gb|EHJ75000.1| hypothetical protein KGM_18742 [Danaus plexippus]
Length = 595
Score = 694 bits (1791), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 311/435 (71%), Positives = 377/435 (86%), Gaps = 4/435 (0%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF ++VG+QS+VYTGDYNMTPDRHLGAAWIDKCRPDLLI+ESTYATTIRDSKRCRERDFL
Sbjct: 163 MFWIRVGSQSVVYTGDYNMTPDRHLGAAWIDKCRPDLLISESTYATTIRDSKRCRERDFL 222
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHECV++GGKVLIPVFALGRAQELCILLETYWERMNL+ P+YFA+GLTEKA NYYKMF
Sbjct: 223 KKVHECVEKGGKVLIPVFALGRAQELCILLETYWERMNLKYPVYFALGLTEKANNYYKMF 282
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
ITWTNQKIRKTFVQRNMFDFKHI+PFDKS+IDNPG MVVFATPGMLH+GLSL IFKKWAP
Sbjct: 283 ITWTNQKIRKTFVQRNMFDFKHIKPFDKSYIDNPGAMVVFATPGMLHAGLSLNIFKKWAP 342
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
E NMLIMPGFCVQGT+GHK+L+G KK+EFEN+Q+++VKMA+EYMSFSAHADAKGIMQLI
Sbjct: 343 YEQNMLIMPGFCVQGTVGHKILNGAKKIEFENRQVVEVKMAVEYMSFSAHADAKGIMQLI 402
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
QYCEPKNVLLVHGEA KM FLK+KI++EF + C+MPANGE+ + T KI IDVS+ LLK
Sbjct: 403 QYCEPKNVLLVHGEAQKMEFLKDKIEKEFKISCYMPANGETAIINTPTKIPIDVSLRLLK 462
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
EAV+YN++PP+P + R +HGVL +KD+ +S +D+DE C E+GI+RHI+RFTS V+ D+
Sbjct: 463 AEAVRYNAQPPDPKRRRVVHGVLCVKDNRLSFLDIDEMCDEIGINRHIIRFTSTVRFDDA 522
Query: 361 SPINKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLLSVERIDDKNKRVFVTWPNQ 420
KT E +L + L +K Q + I +++ ++S++SVL+ VE DD K ++V+W NQ
Sbjct: 523 GSAIKTAE----KLKTLLAEKLQGWSITISDGNISVESVLIKVEGEDDNTKSIYVSWTNQ 578
Query: 421 DEEVGKIVLHVLKSM 435
DE++G +L +L+SM
Sbjct: 579 DEDLGSYILGLLQSM 593
>gi|91086147|ref|XP_969343.1| PREDICTED: similar to CG1972 CG1972-PA [Tribolium castaneum]
gi|270009886|gb|EFA06334.1| hypothetical protein TcasGA2_TC009205 [Tribolium castaneum]
Length = 595
Score = 687 bits (1773), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 307/436 (70%), Positives = 379/436 (86%), Gaps = 4/436 (0%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF ++VG QS+VYTGDYNMTPDRHLGAAWIDKCRPDLLI+ESTYATTIRDSKRCRERDFL
Sbjct: 163 MFWIRVGAQSVVYTGDYNMTPDRHLGAAWIDKCRPDLLISESTYATTIRDSKRCRERDFL 222
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHEC+DRGGKVLIPVFALGRAQELCILLETYWERMNL+AP+YFA+GLTEKA NYYKMF
Sbjct: 223 KKVHECMDRGGKVLIPVFALGRAQELCILLETYWERMNLKAPVYFALGLTEKANNYYKMF 282
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
ITWTNQKIRKTFVQRNMFDFKHI+PFD+S+IDNPGPMVVFATPGMLH+GLSL IFKKWAP
Sbjct: 283 ITWTNQKIRKTFVQRNMFDFKHIKPFDRSYIDNPGPMVVFATPGMLHAGLSLQIFKKWAP 342
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
ENNM+IMPGFCVQGT+GHK+L+G K++EFENKQI++VKM++EYMSFSAHADAKGIMQLI
Sbjct: 343 NENNMVIMPGFCVQGTVGHKILNGAKRVEFENKQIVEVKMSVEYMSFSAHADAKGIMQLI 402
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
Q+CEP+NV+LVHGEA KM FLK+KI QEF+++C+ PANGE+C + T +KI IDVS+ LLK
Sbjct: 403 QHCEPRNVMLVHGEAEKMEFLKQKILQEFSINCYNPANGETCVISTPVKIPIDVSLPLLK 462
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
EA K+++ PP+P + R +HGVLV+K++++ LMDVD+ACKE GI+RHI+RF+S V ++D+
Sbjct: 463 SEARKFSALPPDPKRRRTLHGVLVMKENNVCLMDVDDACKEAGINRHIIRFSSTVHLNDT 522
Query: 361 SPINKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLLSVERIDDKNKRVFVTWPNQ 420
P T +L ++++ ++ + E +SI+SVL+ VE +D+ K V+V+W NQ
Sbjct: 523 GPALATAH----KLHGFIKENLPQWPVVFAEGEISIESVLVKVEGDEDEQKNVYVSWTNQ 578
Query: 421 DEEVGKIVLHVLKSMS 436
DE++G +L++L M
Sbjct: 579 DEDLGSYILNLLNRMG 594
>gi|242013971|ref|XP_002427672.1| Endoribonuclease YSH1, putative [Pediculus humanus corporis]
gi|212512102|gb|EEB14934.1| Endoribonuclease YSH1, putative [Pediculus humanus corporis]
Length = 572
Score = 685 bits (1767), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 312/434 (71%), Positives = 378/434 (87%), Gaps = 4/434 (0%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF ++VGN S+VYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL
Sbjct: 140 MFWIRVGNLSVVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 199
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHEC+++GGKVLIPVFALGRAQELCILLETYWERMNL+ PIYFAVGLTEKA NYYKMF
Sbjct: 200 KKVHECIEKGGKVLIPVFALGRAQELCILLETYWERMNLKVPIYFAVGLTEKANNYYKMF 259
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
ITWTNQKIRKTFVQRNMFDFKHI+PFD+S+ID PMVVFATPGMLH+GLSL IFKKWAP
Sbjct: 260 ITWTNQKIRKTFVQRNMFDFKHIKPFDRSYIDQAWPMVVFATPGMLHAGLSLQIFKKWAP 319
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
ENNM+IMPGFCV GT+GHK+LSG KK+EFEN+QI+DVKM+++YMSFSAHADAKGIMQLI
Sbjct: 320 NENNMVIMPGFCVAGTVGHKILSGQKKIEFENRQIVDVKMSVQYMSFSAHADAKGIMQLI 379
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
QYCEPKNVLLVHGEA+KM FL++KIKQEF+++C+MPANGE+C + KI ID+S+NLLK
Sbjct: 380 QYCEPKNVLLVHGEAAKMEFLRDKIKQEFHINCYMPANGETCTIDIPHKIPIDISLNLLK 439
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
E+ YN+ PN K R +HGVLV K+ S+ LM+V++ACKE G++RHI+RFTS V++ D
Sbjct: 440 SESQAYNALLPNLSKSRILHGVLVNKEHSLCLMNVEDACKEAGVTRHIIRFTSTVRMEDP 499
Query: 361 SPINKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLLSVERIDDKNKRVFVTWPNQ 420
P NK I ++LL +++K + Y IQ T+SS+S+++VL+ VE +D K V+V+W NQ
Sbjct: 500 RPPNK----IAEKLLKIVKEKLKNYSIQFTDSSISVETVLIKVEGDEDDRKCVYVSWTNQ 555
Query: 421 DEEVGKIVLHVLKS 434
DE++G +L++L++
Sbjct: 556 DEDLGSYILNLLQN 569
>gi|307215032|gb|EFN89859.1| Integrator complex subunit 11 [Harpegnathos saltator]
Length = 594
Score = 676 bits (1743), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 302/437 (69%), Positives = 372/437 (85%), Gaps = 4/437 (0%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF V+VG+QSIVYTGDYNMTPDRHLGAAWIDKCRPDLLI+ESTYATTIRDSKRCRERDFL
Sbjct: 162 MFWVRVGSQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLISESTYATTIRDSKRCRERDFL 221
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHEC+DRGGKVLIPVFALGRAQELCILLETYWERMNL+ P+YFA+GLTEKA NYYKMF
Sbjct: 222 KKVHECIDRGGKVLIPVFALGRAQELCILLETYWERMNLKVPVYFALGLTEKANNYYKMF 281
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
ITWTNQKI+KTFVQRNMFDFKHI+PFDK++IDNPG MVVFATPGMLH+GLSL IFKKWAP
Sbjct: 282 ITWTNQKIKKTFVQRNMFDFKHIKPFDKAYIDNPGAMVVFATPGMLHAGLSLQIFKKWAP 341
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
E+NM+IMPGFCVQGT+GHKVL+G +++EFEN+QI++VKMA+EYMSFSAHADAKGIMQLI
Sbjct: 342 NESNMVIMPGFCVQGTVGHKVLNGTRRIEFENRQIVEVKMAVEYMSFSAHADAKGIMQLI 401
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
QYCEPKN++LVHGE +KM +LKEKIKQEF +C+ PANGE+C + T K+ +D S+ LLK
Sbjct: 402 QYCEPKNIILVHGEFAKMEYLKEKIKQEFGTNCYNPANGETCIITTTSKVPVDASLALLK 461
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
EA +Y++ PP+P + R +H VL++++ + L+D DE K GI++H+VRFTS VQ+ D
Sbjct: 462 AEAKRYSALPPDPKRRRLLHSVLMLREDGVCLVDADEVAKAAGITKHVVRFTSTVQVRDP 521
Query: 361 SPINKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLLSVERIDDKNKRVFVTWPNQ 420
P + T +LL L+++ + +QLT+ S+S++SVL+ VE +D K V+V+W NQ
Sbjct: 522 GPAHATTL----KLLQPLKERLSGWTVQLTDGSISVESVLVKVEGDEDDQKSVYVSWTNQ 577
Query: 421 DEEVGKIVLHVLKSMSN 437
DE++G +L L++M N
Sbjct: 578 DEDLGSYILGFLQTMLN 594
>gi|380011463|ref|XP_003689822.1| PREDICTED: integrator complex subunit 11-like [Apis florea]
Length = 595
Score = 675 bits (1741), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 304/437 (69%), Positives = 371/437 (84%), Gaps = 4/437 (0%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF ++VG+QSIVYTGDYNMTPDRHLGAAWIDKCRPDLLI+ESTYATTIRDSKRCRERDFL
Sbjct: 163 MFWIRVGSQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLISESTYATTIRDSKRCRERDFL 222
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHEC+DRGGKVLIPVFALGRAQELCILLETYWERMNL+ P+YFA+GLTEKA NYYKMF
Sbjct: 223 KKVHECIDRGGKVLIPVFALGRAQELCILLETYWERMNLKVPVYFALGLTEKANNYYKMF 282
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
ITWTNQKI+KTFVQRNMFDFKHI+PFDK++IDNPG MVVFATPGMLH+GLSL IFKKWAP
Sbjct: 283 ITWTNQKIKKTFVQRNMFDFKHIKPFDKAYIDNPGAMVVFATPGMLHAGLSLQIFKKWAP 342
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
E NM+IMPGFCVQGT+GHKVL+G +++EFEN+QI++VKMA+EYMSFSAHADAKGIMQLI
Sbjct: 343 NEANMVIMPGFCVQGTVGHKVLNGSRRIEFENRQIVEVKMAVEYMSFSAHADAKGIMQLI 402
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
QYCEPKNV+LVHGE +KM FLKEKIKQEF +C+ PANGE+C + T K+ +D S+ LLK
Sbjct: 403 QYCEPKNVMLVHGEFAKMEFLKEKIKQEFGTNCYNPANGETCVITTISKVPVDASLALLK 462
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
EA KY++ PP+P + R +H VL++++ + L+D DE K GI++H+VRFTS VQ+ D
Sbjct: 463 AEAKKYSALPPDPKRRRLLHSVLMLREDGVCLVDADEVTKAAGITKHVVRFTSTVQVRDP 522
Query: 361 SPINKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLLSVERIDDKNKRVFVTWPNQ 420
P + T +LL L+++ + +QLT+ S+S++SVL+ VE +D K V+V+W NQ
Sbjct: 523 GPAHSTTL----KLLQPLKERLSGWTVQLTDGSISVESVLVKVEGDEDDQKSVYVSWTNQ 578
Query: 421 DEEVGKIVLHVLKSMSN 437
DE++G +L L++M N
Sbjct: 579 DEDLGSYILGFLQTMLN 595
>gi|340728535|ref|XP_003402577.1| PREDICTED: integrator complex subunit 11-like [Bombus terrestris]
gi|350421011|ref|XP_003492700.1| PREDICTED: integrator complex subunit 11-like [Bombus impatiens]
Length = 595
Score = 675 bits (1741), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 304/437 (69%), Positives = 371/437 (84%), Gaps = 4/437 (0%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF ++VG+QSIVYTGDYNMTPDRHLGAAWIDKCRPDLLI+ESTYATTIRDSKRCRERDFL
Sbjct: 163 MFWIRVGSQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLISESTYATTIRDSKRCRERDFL 222
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHEC+DRGGKVLIPVFALGRAQELCILLETYWERMNL+ P+YFA+GLTEKA NYYKMF
Sbjct: 223 KKVHECIDRGGKVLIPVFALGRAQELCILLETYWERMNLKVPVYFALGLTEKANNYYKMF 282
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
ITWTNQKI+KTFVQRNMFDFKHI+PFDK++IDNPG MVVFATPGMLH+GLSL IFKKWAP
Sbjct: 283 ITWTNQKIKKTFVQRNMFDFKHIKPFDKAYIDNPGAMVVFATPGMLHAGLSLQIFKKWAP 342
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
E NM+IMPGFCVQGT+GHKVL+G +++EFEN+QI++VKMA+EYMSFSAHADAKGIMQLI
Sbjct: 343 NEANMVIMPGFCVQGTVGHKVLNGSRRIEFENRQIVEVKMAVEYMSFSAHADAKGIMQLI 402
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
QYCEPKNV+LVHGE +KM FLKEKIKQEF +C+ PANGE+C + T K+ +D S+ LLK
Sbjct: 403 QYCEPKNVMLVHGEFAKMEFLKEKIKQEFGTNCYNPANGETCVITTIAKVPVDASLALLK 462
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
EA +Y++ PP+P + R +H VL++++ + L+D DE K GI++HIVRFTS VQ+ D
Sbjct: 463 SEAKRYSALPPDPKRRRLLHSVLMLREDGVCLVDADEVTKAAGITKHIVRFTSTVQVRDP 522
Query: 361 SPINKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLLSVERIDDKNKRVFVTWPNQ 420
P + T +LL L+++ + +QLT+ S+S++SVL+ VE +D K V+V+W NQ
Sbjct: 523 GPAHSTTL----KLLQPLKERLPGWTVQLTDGSISVESVLVKVEGDEDDQKSVYVSWTNQ 578
Query: 421 DEEVGKIVLHVLKSMSN 437
DE++G +L L++M N
Sbjct: 579 DEDLGSYILGFLQTMLN 595
>gi|307170840|gb|EFN62951.1| Integrator complex subunit 11 [Camponotus floridanus]
Length = 595
Score = 674 bits (1739), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 301/437 (68%), Positives = 371/437 (84%), Gaps = 4/437 (0%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF ++VG+QSIVYTGDYNMTPDRHLGAAWIDKCRPDLLI+ESTYATTIRDSKRCRERDFL
Sbjct: 163 MFWIRVGSQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLISESTYATTIRDSKRCRERDFL 222
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHEC+D+GGKVLIPVFALGRAQELCILLETYWERMNL+ P+YFA+GLTEKA NYYKMF
Sbjct: 223 KKVHECIDKGGKVLIPVFALGRAQELCILLETYWERMNLKVPVYFALGLTEKANNYYKMF 282
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
ITWTNQKI+KTFVQRNMF+FKHI+PFDK++IDNPG MVVFATPGMLH+GLSL IFKKWAP
Sbjct: 283 ITWTNQKIKKTFVQRNMFEFKHIKPFDKAYIDNPGAMVVFATPGMLHAGLSLQIFKKWAP 342
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
E NM+IMPGFCVQGT+GHKVL+G +++EFEN+QI++VKMA+EYMSFSAHADAKGIMQLI
Sbjct: 343 NEANMVIMPGFCVQGTVGHKVLNGTRRIEFENRQIVEVKMAVEYMSFSAHADAKGIMQLI 402
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
QYCEPKNV+LVHGE +KM +LKEKIKQEF + C+ PANGE+C + T K+ +D S+ LLK
Sbjct: 403 QYCEPKNVMLVHGEFAKMEYLKEKIKQEFGVSCYNPANGETCVITTTSKVPVDASLALLK 462
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
EA +Y++ PP+P + R +H VL++++ + L+D DE K GI++H+VRFTS VQ+ D
Sbjct: 463 AEAKRYSALPPDPKRRRLLHSVLMLREDGVCLVDADEVAKAAGITKHVVRFTSTVQVRDP 522
Query: 361 SPINKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLLSVERIDDKNKRVFVTWPNQ 420
P + T +LL L+++ + +QLT+ S+S++SVL+ VE DD K V+V+W NQ
Sbjct: 523 GPAHATTL----KLLQPLKERLSGWTVQLTDGSISVESVLVKVEGDDDDQKSVYVSWTNQ 578
Query: 421 DEEVGKIVLHVLKSMSN 437
DE++G +L L++M N
Sbjct: 579 DEDLGSYILGFLQTMLN 595
>gi|383859338|ref|XP_003705152.1| PREDICTED: integrator complex subunit 11-like isoform 2 [Megachile
rotundata]
Length = 494
Score = 673 bits (1736), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 302/437 (69%), Positives = 370/437 (84%), Gaps = 4/437 (0%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF ++VG+QSIVYTGDYNMTPDRHLGAAWIDKCRPDLLI+ESTYATTIRDSKRCRERDFL
Sbjct: 62 MFWIRVGSQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLISESTYATTIRDSKRCRERDFL 121
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHEC+DRGGKVLIPVFALGRAQELCILLETYWERMNL+ P+YFA+GLTEKA NYYKMF
Sbjct: 122 KKVHECIDRGGKVLIPVFALGRAQELCILLETYWERMNLKVPVYFALGLTEKANNYYKMF 181
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
ITWTNQKI+KTFVQRNMFDFKHI+PFDK++IDNPG MVVFATPGMLH+GLSL IFKKWAP
Sbjct: 182 ITWTNQKIKKTFVQRNMFDFKHIKPFDKAYIDNPGAMVVFATPGMLHAGLSLQIFKKWAP 241
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
E NM+IMPGFCVQGT+GHKVL+G +++EFEN+QI++VKMA+EYMSFSAHADAKGIMQLI
Sbjct: 242 NEANMVIMPGFCVQGTVGHKVLNGSRRIEFENRQIVEVKMAVEYMSFSAHADAKGIMQLI 301
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
QYCEPKNV+LVHGE +KM FLKEKIKQEF +C+ PANGE+C + T K+ +D S+ LLK
Sbjct: 302 QYCEPKNVMLVHGEFAKMEFLKEKIKQEFGTNCYNPANGETCVITTISKVPVDASLALLK 361
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
EA +Y++ PP+P + R +H +L++++ L+D DE K GI++H+VRFTS VQ+ D
Sbjct: 362 AEAKRYSALPPDPKRRRLLHSILMLREDGACLVDADEVTKAAGITKHVVRFTSTVQVRDP 421
Query: 361 SPINKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLLSVERIDDKNKRVFVTWPNQ 420
P + T +LL L+++ + +QLT+ S+S++SVL+ VE +D K V+V+W NQ
Sbjct: 422 GPAHSTTL----KLLQPLKERLSGWTVQLTDGSISVESVLVKVEGDEDDQKSVYVSWTNQ 477
Query: 421 DEEVGKIVLHVLKSMSN 437
DE++G +L L++M N
Sbjct: 478 DEDLGSYILGFLQTMLN 494
>gi|383859336|ref|XP_003705151.1| PREDICTED: integrator complex subunit 11-like isoform 1 [Megachile
rotundata]
Length = 595
Score = 672 bits (1735), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 302/437 (69%), Positives = 370/437 (84%), Gaps = 4/437 (0%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF ++VG+QSIVYTGDYNMTPDRHLGAAWIDKCRPDLLI+ESTYATTIRDSKRCRERDFL
Sbjct: 163 MFWIRVGSQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLISESTYATTIRDSKRCRERDFL 222
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHEC+DRGGKVLIPVFALGRAQELCILLETYWERMNL+ P+YFA+GLTEKA NYYKMF
Sbjct: 223 KKVHECIDRGGKVLIPVFALGRAQELCILLETYWERMNLKVPVYFALGLTEKANNYYKMF 282
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
ITWTNQKI+KTFVQRNMFDFKHI+PFDK++IDNPG MVVFATPGMLH+GLSL IFKKWAP
Sbjct: 283 ITWTNQKIKKTFVQRNMFDFKHIKPFDKAYIDNPGAMVVFATPGMLHAGLSLQIFKKWAP 342
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
E NM+IMPGFCVQGT+GHKVL+G +++EFEN+QI++VKMA+EYMSFSAHADAKGIMQLI
Sbjct: 343 NEANMVIMPGFCVQGTVGHKVLNGSRRIEFENRQIVEVKMAVEYMSFSAHADAKGIMQLI 402
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
QYCEPKNV+LVHGE +KM FLKEKIKQEF +C+ PANGE+C + T K+ +D S+ LLK
Sbjct: 403 QYCEPKNVMLVHGEFAKMEFLKEKIKQEFGTNCYNPANGETCVITTISKVPVDASLALLK 462
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
EA +Y++ PP+P + R +H +L++++ L+D DE K GI++H+VRFTS VQ+ D
Sbjct: 463 AEAKRYSALPPDPKRRRLLHSILMLREDGACLVDADEVTKAAGITKHVVRFTSTVQVRDP 522
Query: 361 SPINKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLLSVERIDDKNKRVFVTWPNQ 420
P + T +LL L+++ + +QLT+ S+S++SVL+ VE +D K V+V+W NQ
Sbjct: 523 GPAHSTTL----KLLQPLKERLSGWTVQLTDGSISVESVLVKVEGDEDDQKSVYVSWTNQ 578
Query: 421 DEEVGKIVLHVLKSMSN 437
DE++G +L L++M N
Sbjct: 579 DEDLGSYILGFLQTMLN 595
>gi|156546030|ref|XP_001608037.1| PREDICTED: integrator complex subunit 11-like isoform 1 [Nasonia
vitripennis]
gi|345498393|ref|XP_003428220.1| PREDICTED: integrator complex subunit 11-like isoform 2 [Nasonia
vitripennis]
gi|345498395|ref|XP_003428221.1| PREDICTED: integrator complex subunit 11-like isoform 3 [Nasonia
vitripennis]
Length = 595
Score = 669 bits (1725), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 300/437 (68%), Positives = 370/437 (84%), Gaps = 4/437 (0%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF ++VG+QSIVYTGDYNMTPDRHLGAAWIDKCRPDLLI+ESTYATTIRDSKRCRERDFL
Sbjct: 163 MFWIRVGSQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLISESTYATTIRDSKRCRERDFL 222
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHEC+D+GGKVLIPVFALGRAQELCILLETYWERMNL+AP+YFA+GLTEKA NYYKMF
Sbjct: 223 KKVHECIDKGGKVLIPVFALGRAQELCILLETYWERMNLKAPVYFALGLTEKANNYYKMF 282
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
ITWTNQKI+KTFVQRNMFDFKHI+PFDKS+IDNPG MVVFATPGMLH+GLSL IFKKWAP
Sbjct: 283 ITWTNQKIKKTFVQRNMFDFKHIKPFDKSYIDNPGAMVVFATPGMLHAGLSLQIFKKWAP 342
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
E NM+IMPGFCVQGT+GHKVL+G K++EFEN+QI++VKM +EYMSFSAHADAKGIMQLI
Sbjct: 343 NEANMVIMPGFCVQGTVGHKVLNGAKRIEFENRQIVEVKMTVEYMSFSAHADAKGIMQLI 402
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
QYCEPKNV+LVHGE +KM +LK+KIKQEF ++C+ PANGE+C + T K+ ID S+ LLK
Sbjct: 403 QYCEPKNVMLVHGEFAKMEYLKDKIKQEFGINCYNPANGETCIITTKSKVPIDASLALLK 462
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
EA KY++ PP+P + R +H VL+ ++ I L+D +E K G++RHIVRFTS +Q+ D
Sbjct: 463 SEAKKYSALPPDPKRRRLLHSVLMCREDGICLLDSEEVSKAAGVTRHIVRFTSAIQVRDP 522
Query: 361 SPINKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLLSVERIDDKNKRVFVTWPNQ 420
P T +LL LQ++ + I++ +SS+ +++V++ +E +D+ K V+V+W NQ
Sbjct: 523 GPAGGTAR----KLLLPLQERLGGWGIEMIDSSIMVETVVVKIEGDEDEQKTVYVSWTNQ 578
Query: 421 DEEVGKIVLHVLKSMSN 437
DE++G +L L++M N
Sbjct: 579 DEDLGSYILGFLQTMLN 595
>gi|328776642|ref|XP_003249190.1| PREDICTED: integrator complex subunit 11-like [Apis mellifera]
Length = 603
Score = 668 bits (1723), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 300/425 (70%), Positives = 364/425 (85%), Gaps = 4/425 (0%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF ++VG+QSIVYTGDYNMTPDRHLGAAWIDKCRPDLLI+ESTYATTIRDSKRCRERDFL
Sbjct: 163 MFWIRVGSQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLISESTYATTIRDSKRCRERDFL 222
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHEC+DRGGKVLIPVFALGRAQELCILLETYWERMNL+ P+YFA+GLTEKA NYYKMF
Sbjct: 223 KKVHECIDRGGKVLIPVFALGRAQELCILLETYWERMNLKVPVYFALGLTEKANNYYKMF 282
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
ITWTNQKI+KTFVQRNMFDFKHI+PFDK++IDNPG MVVFATPGMLH+GLSL IFKKWAP
Sbjct: 283 ITWTNQKIKKTFVQRNMFDFKHIKPFDKAYIDNPGAMVVFATPGMLHAGLSLQIFKKWAP 342
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
E NM+IMPGFCVQGT+GHKVL+G +++EFEN+QI++VKMA+EYMSFSAHADAKGIMQLI
Sbjct: 343 NEANMVIMPGFCVQGTVGHKVLNGSRRIEFENRQIVEVKMAVEYMSFSAHADAKGIMQLI 402
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
QYCEPKNV+LVHGE +KM FLKEKIKQEF +C+ PANGE+C + T K+ +D S+ LLK
Sbjct: 403 QYCEPKNVMLVHGEFAKMEFLKEKIKQEFGTNCYNPANGETCVITTISKVPVDASLALLK 462
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
EA KY++ PP+P + R +H VL++++ + L+D DE K GI++H+VRFTS VQ+ D
Sbjct: 463 AEAKKYSALPPDPKRRRLLHSVLMLREDGVCLVDADEVTKAAGITKHVVRFTSTVQVRDP 522
Query: 361 SPINKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLLSVERIDDKNKRVFVTWPNQ 420
P + T +LL L+++ + +QLT+ S+S++SVL+ VE +D K V+V+W NQ
Sbjct: 523 GPAHSTTL----KLLQPLKERLSGWTVQLTDGSISVESVLVKVEGDEDDQKSVYVSWTNQ 578
Query: 421 DEEVG 425
DE++G
Sbjct: 579 DEDLG 583
>gi|195503187|ref|XP_002098546.1| GE23879 [Drosophila yakuba]
gi|194184647|gb|EDW98258.1| GE23879 [Drosophila yakuba]
Length = 597
Score = 667 bits (1722), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 305/438 (69%), Positives = 375/438 (85%), Gaps = 7/438 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +KVG+QS+VYTGDYNMTPDRHLGAAWIDKCRPDLLI+ESTYATTIRDSKRCRERDFL
Sbjct: 163 MFWIKVGSQSVVYTGDYNMTPDRHLGAAWIDKCRPDLLISESTYATTIRDSKRCRERDFL 222
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHECV +GGKVLIPVFALGRAQELCILLETYWERMNL+ PIYFA+GLTEKA YYKMF
Sbjct: 223 KKVHECVAKGGKVLIPVFALGRAQELCILLETYWERMNLKYPIYFALGLTEKANTYYKMF 282
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
ITWTNQKIRKTFV RNMFDFKHI+PFDK++IDNPG MVVFATPGMLH+GLSL IFKKWAP
Sbjct: 283 ITWTNQKIRKTFVHRNMFDFKHIKPFDKAYIDNPGAMVVFATPGMLHAGLSLQIFKKWAP 342
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
ENNM+IMPG+CVQGT+G+K+L G KK+EFEN+Q+++VKMA+EYMSFSAHADAKGIMQLI
Sbjct: 343 NENNMVIMPGYCVQGTVGNKILGGAKKVEFENRQVVEVKMAVEYMSFSAHADAKGIMQLI 402
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
Q CEPKNV+LVHGEA KM FL+ KIK EFNL+ +MPANGE+C + T +KI +D SV+LLK
Sbjct: 403 QNCEPKNVMLVHGEAEKMKFLRSKIKDEFNLETYMPANGETCVISTPVKIPVDASVSLLK 462
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
EA YN++PP+P + R IHGVLV+KD+ I L ++ +A KE+GI+RH++RFTS V++ DS
Sbjct: 463 AEARSYNAQPPDPKRRRLIHGVLVMKDNRIMLQNLTDALKEIGINRHVMRFTSKVKMDDS 522
Query: 361 SPINKTLELIYDQLLSYLQDKSQEYKIQLTES-SLSIDSVLLSVE--RIDDKNKRVFVTW 417
P+ +T E +L++ L++K + + + E+ S++I+SV + VE D K K + ++W
Sbjct: 523 GPVIRTSE----RLMALLEEKLAGWTVTMQENGSIAIESVEVKVEEDEKDPKQKNILISW 578
Query: 418 PNQDEEVGKIVLHVLKSM 435
NQDE++G +L+VL++M
Sbjct: 579 TNQDEDIGAYILNVLQNM 596
>gi|194906134|ref|XP_001981318.1| GG11690 [Drosophila erecta]
gi|190655956|gb|EDV53188.1| GG11690 [Drosophila erecta]
Length = 597
Score = 666 bits (1718), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 305/438 (69%), Positives = 374/438 (85%), Gaps = 7/438 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +KVG+QS+VYTGDYNMTPDRHLGAAWIDKCRPDLLI+ESTYATTIRDSKRCRERDFL
Sbjct: 163 MFWIKVGSQSVVYTGDYNMTPDRHLGAAWIDKCRPDLLISESTYATTIRDSKRCRERDFL 222
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHECV +GGKVLIPVFALGRAQELCILLETYWERMNL+ PIYFA+GLTEKA YYKMF
Sbjct: 223 KKVHECVAKGGKVLIPVFALGRAQELCILLETYWERMNLKYPIYFALGLTEKANTYYKMF 282
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
ITWTNQKIRKTFV RNMFDFKHI+PFDK++IDNPG MVVFATPGMLH+GLSL IFKKWAP
Sbjct: 283 ITWTNQKIRKTFVHRNMFDFKHIKPFDKAYIDNPGAMVVFATPGMLHAGLSLQIFKKWAP 342
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
ENNM+IMPG+CVQGT+G+K+L G KK+EFEN+Q+++VKMA+EYMSFSAHADAKGIMQLI
Sbjct: 343 NENNMVIMPGYCVQGTVGNKILGGAKKVEFENRQVVEVKMAVEYMSFSAHADAKGIMQLI 402
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
Q CEPKNV+LVHGEA KM FL+ KIK EFNL+ +MPANGE+C + T +KI +D SV+LLK
Sbjct: 403 QNCEPKNVMLVHGEAEKMKFLRSKIKDEFNLETYMPANGETCVISTPVKIPVDASVSLLK 462
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
EA YN++PP+P + R IHGVLV+KD+ I L ++ +A KE+GI+RH++RFTS V++ DS
Sbjct: 463 AEARSYNAQPPDPKRRRLIHGVLVMKDNRIMLQNLTDALKEIGINRHVMRFTSKVKMDDS 522
Query: 361 SPINKTLELIYDQLLSYLQDKSQEYKIQLTES-SLSIDSVLLSVE--RIDDKNKRVFVTW 417
P+ +T E +L + L++K + + + E+ S++I+SV + VE D K K + ++W
Sbjct: 523 GPVIRTSE----RLAALLEEKLAGWTVTMQENGSIAIESVEVKVEEDEKDPKQKNILISW 578
Query: 418 PNQDEEVGKIVLHVLKSM 435
NQDE++G +L+VL++M
Sbjct: 579 TNQDEDIGAYILNVLQNM 596
>gi|21358523|ref|NP_651721.1| integrator 11 [Drosophila melanogaster]
gi|7301822|gb|AAF56931.1| integrator 11 [Drosophila melanogaster]
gi|16768852|gb|AAL28645.1| LD08814p [Drosophila melanogaster]
gi|220943570|gb|ACL84328.1| CG1972-PA [synthetic construct]
gi|220953494|gb|ACL89290.1| CG1972-PA [synthetic construct]
Length = 597
Score = 665 bits (1716), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 305/438 (69%), Positives = 374/438 (85%), Gaps = 7/438 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +KVG+QS+VYTGDYNMTPDRHLGAAWIDKCRPDLLI+ESTYATTIRDSKRCRERDFL
Sbjct: 163 MFWIKVGSQSVVYTGDYNMTPDRHLGAAWIDKCRPDLLISESTYATTIRDSKRCRERDFL 222
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHECV +GGKVLIPVFALGRAQELCILLETYWERMNL+ PIYFA+GLTEKA YYKMF
Sbjct: 223 KKVHECVAKGGKVLIPVFALGRAQELCILLETYWERMNLKYPIYFALGLTEKANTYYKMF 282
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
ITWTNQKIRKTFV RNMFDFKHI+PFDK++IDNPG MVVFATPGMLH+GLSL IFKKWAP
Sbjct: 283 ITWTNQKIRKTFVHRNMFDFKHIKPFDKAYIDNPGAMVVFATPGMLHAGLSLQIFKKWAP 342
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
ENNM+IMPG+CVQGT+G+K+L G KK+EFEN+Q+++VKMA+EYMSFSAHADAKGIMQLI
Sbjct: 343 NENNMVIMPGYCVQGTVGNKILGGAKKVEFENRQVVEVKMAVEYMSFSAHADAKGIMQLI 402
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
Q CEPKNV+LVHGEA KM FL+ KIK EFNL+ +MPANGE+C + T +KI +D SV+LLK
Sbjct: 403 QNCEPKNVMLVHGEAGKMKFLRSKIKDEFNLETYMPANGETCVISTPVKIPVDASVSLLK 462
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
EA YN++PP+P + R IHGVLV+KD+ I L ++ +A KE+GI+RH++RFTS V++ DS
Sbjct: 463 AEARSYNAQPPDPKRRRLIHGVLVMKDNRIMLQNLTDALKEIGINRHVMRFTSKVKMDDS 522
Query: 361 SPINKTLELIYDQLLSYLQDKSQEYKIQLTES-SLSIDSVLLSVE--RIDDKNKRVFVTW 417
P+ +T E +L + L++K + + + E+ S++I+SV + VE D K K + ++W
Sbjct: 523 GPVIRTSE----RLKTLLEEKLAGWTVTMQENGSIAIESVEVKVEEDEKDPKQKNILISW 578
Query: 418 PNQDEEVGKIVLHVLKSM 435
NQDE++G +L+VL++M
Sbjct: 579 TNQDEDIGAYILNVLQNM 596
>gi|195341281|ref|XP_002037239.1| GM12816 [Drosophila sechellia]
gi|195574829|ref|XP_002105386.1| GD21460 [Drosophila simulans]
gi|194131355|gb|EDW53398.1| GM12816 [Drosophila sechellia]
gi|194201313|gb|EDX14889.1| GD21460 [Drosophila simulans]
Length = 597
Score = 665 bits (1715), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 305/438 (69%), Positives = 374/438 (85%), Gaps = 7/438 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +KVG+QS+VYTGDYNMTPDRHLGAAWIDKCRPDLLI+ESTYATTIRDSKRCRERDFL
Sbjct: 163 MFWIKVGSQSVVYTGDYNMTPDRHLGAAWIDKCRPDLLISESTYATTIRDSKRCRERDFL 222
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHECV +GGKVLIPVFALGRAQELCILLETYWERMNL+ PIYFA+GLTEKA YYKMF
Sbjct: 223 KKVHECVAKGGKVLIPVFALGRAQELCILLETYWERMNLKYPIYFALGLTEKANTYYKMF 282
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
ITWTNQKIRKTFV RNMFDFKHI+PFDK++IDNPG MVVFATPGMLH+GLSL IFKKWAP
Sbjct: 283 ITWTNQKIRKTFVHRNMFDFKHIKPFDKNYIDNPGAMVVFATPGMLHAGLSLQIFKKWAP 342
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
ENNM+IMPG+CVQGT+G+K+L G KK+EFEN+Q+++VKMA+EYMSFSAHADAKGIMQLI
Sbjct: 343 NENNMVIMPGYCVQGTVGNKILGGAKKVEFENRQVVEVKMAVEYMSFSAHADAKGIMQLI 402
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
Q CEPKNV+LVHGEA KM FL+ KIK EFNL+ +MPANGE+C + T +KI +D SV+LLK
Sbjct: 403 QNCEPKNVMLVHGEAGKMKFLRSKIKDEFNLETYMPANGETCVISTPVKIPVDASVSLLK 462
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
EA YN++PP+P + R IHGVLV+KD+ I L ++ +A KE+GI+RH++RFTS V++ DS
Sbjct: 463 AEARSYNAQPPDPKRRRLIHGVLVMKDNRIMLQNLTDALKEIGINRHVMRFTSKVKMDDS 522
Query: 361 SPINKTLELIYDQLLSYLQDKSQEYKIQLTES-SLSIDSVLLSVE--RIDDKNKRVFVTW 417
P+ +T E +L + L++K + + + E+ S++I+SV + VE D K K + ++W
Sbjct: 523 GPVIRTSE----RLKTLLEEKLTGWTVTMQENGSIAIESVEVKVEEDEKDPKQKNILISW 578
Query: 418 PNQDEEVGKIVLHVLKSM 435
NQDE++G +L+VL++M
Sbjct: 579 TNQDEDIGAYILNVLQNM 596
>gi|125773833|ref|XP_001358175.1| GA15164 [Drosophila pseudoobscura pseudoobscura]
gi|54637910|gb|EAL27312.1| GA15164 [Drosophila pseudoobscura pseudoobscura]
Length = 597
Score = 661 bits (1705), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 305/438 (69%), Positives = 371/438 (84%), Gaps = 7/438 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +KVG+QS+VYTGDYNMTPDRHLGAAWIDKCRPDLLI+ESTYATTIRDSKRCRERDFL
Sbjct: 163 MFWIKVGSQSVVYTGDYNMTPDRHLGAAWIDKCRPDLLISESTYATTIRDSKRCRERDFL 222
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHECV RGGKVLIPVFALGRAQELCILLETYWERMNL+ PIYFA+GLTEKA YYKMF
Sbjct: 223 KKVHECVARGGKVLIPVFALGRAQELCILLETYWERMNLKYPIYFALGLTEKANTYYKMF 282
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
ITWTNQKIRKTFV RNMFDFKHI+PFDK++IDNPG MVVFATPGMLH+GLSL IFKKWAP
Sbjct: 283 ITWTNQKIRKTFVHRNMFDFKHIKPFDKAYIDNPGAMVVFATPGMLHAGLSLQIFKKWAP 342
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
ENNM+IMPG+CVQGT+G+K+L G KK+EFEN+Q+++VKMA+EYMSFSAHADAKGIMQLI
Sbjct: 343 NENNMVIMPGYCVQGTVGNKILGGAKKVEFENRQVVEVKMAVEYMSFSAHADAKGIMQLI 402
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
Q CEPKNV+LVHGEA KM FL+ KIK EFNL+ +MPANGE+C + T ++I +D +V+LLK
Sbjct: 403 QNCEPKNVMLVHGEAEKMKFLRSKIKDEFNLETYMPANGETCVISTPVRIPVDAAVSLLK 462
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
EA YN++PP+P + R IHGVLV+KD+ I L ++ +A KE+GI+RH++RFTS V+I D
Sbjct: 463 AEARSYNAQPPDPKRRRLIHGVLVMKDNRIMLQNLTDALKEIGINRHVMRFTSKVKIDDP 522
Query: 361 SPINKTLELIYDQLLSYLQDKSQEYKIQLTES-SLSIDSVLLSVER--IDDKNKRVFVTW 417
P T E +L + L++K + + L E+ S++I+SV + VE D K K + ++W
Sbjct: 523 GPTGLTSE----RLKTLLEEKLVGWTVALQENGSITIESVEVKVEEDAKDPKQKTLLISW 578
Query: 418 PNQDEEVGKIVLHVLKSM 435
NQDE++G +L+VL++M
Sbjct: 579 TNQDEDIGAYILNVLQNM 596
>gi|194765324|ref|XP_001964777.1| GF23370 [Drosophila ananassae]
gi|190615049|gb|EDV30573.1| GF23370 [Drosophila ananassae]
Length = 597
Score = 661 bits (1705), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 304/438 (69%), Positives = 372/438 (84%), Gaps = 7/438 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +KVG+QS+VYTGDYNMTPDRHLGAAWIDKCRPDLLI+ESTYATTIRDSKRCRERDFL
Sbjct: 163 MFWIKVGSQSVVYTGDYNMTPDRHLGAAWIDKCRPDLLISESTYATTIRDSKRCRERDFL 222
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHECV +GGKVLIPVFALGRAQELCILLETYW+RMNL+ PIYFA+GLTEKA YYKMF
Sbjct: 223 KKVHECVAKGGKVLIPVFALGRAQELCILLETYWDRMNLKYPIYFALGLTEKANTYYKMF 282
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
ITWTNQKIRKTFV RNMFDFKHI+PFDK++IDNPG MVVFATPGMLH+GLSL IFKKWAP
Sbjct: 283 ITWTNQKIRKTFVHRNMFDFKHIKPFDKAYIDNPGAMVVFATPGMLHAGLSLQIFKKWAP 342
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
ENNM+IMPG+CVQGT+G+K+L G KK+EFEN+Q+++VKMA+EYMSFSAHADAKGIMQLI
Sbjct: 343 NENNMVIMPGYCVQGTVGNKILGGAKKVEFENRQVVEVKMAVEYMSFSAHADAKGIMQLI 402
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
Q CEPKNV+LVHGEA KM FL+ KIK EFNL+ +MPANGE+C + T +KI +D SV+LLK
Sbjct: 403 QNCEPKNVMLVHGEAEKMKFLRSKIKDEFNLETYMPANGETCVISTPVKIPVDASVSLLK 462
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
EA YN++PP+P + R IHGVLV+KD+ I L ++ +A KE+GI+RH++RFTS V++ DS
Sbjct: 463 AEARTYNAQPPDPKRRRLIHGVLVMKDNRIMLQNLTDALKEIGINRHVMRFTSKVKMDDS 522
Query: 361 SPINKTLELIYDQLLSYLQDKSQEYKIQLTESS-LSIDSVLLSVE--RIDDKNKRVFVTW 417
P +T E +L + L++K + + + E+ +SI+SV + VE D K K + ++W
Sbjct: 523 GPTIRTSE----RLKTILEEKLTGWTVTMQENGCISIESVEVKVEEDEKDSKQKNILISW 578
Query: 418 PNQDEEVGKIVLHVLKSM 435
NQDE++G +L+VL++M
Sbjct: 579 TNQDEDIGAYILNVLQNM 596
>gi|157107341|ref|XP_001649735.1| cleavage and polyadenylation specificity factor [Aedes aegypti]
gi|108879612|gb|EAT43837.1| AAEL004757-PA [Aedes aegypti]
Length = 613
Score = 660 bits (1703), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 303/455 (66%), Positives = 372/455 (81%), Gaps = 24/455 (5%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF ++VG+QS+VYTGDYNMTPDRHLGAAWIDKC+PDLLITESTYATTIRDSKRCRERDFL
Sbjct: 163 MFWIRVGSQSVVYTGDYNMTPDRHLGAAWIDKCKPDLLITESTYATTIRDSKRCRERDFL 222
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHECV +GGKVLIPVFALGRAQELCILLETYWERMNL+ PIYFAVGLTEKA NYYKMF
Sbjct: 223 KKVHECVAKGGKVLIPVFALGRAQELCILLETYWERMNLKYPIYFAVGLTEKANNYYKMF 282
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
ITWTNQKIRKTFVQRNMFDFKHI+PFDK +IDNPG MVVFATPGMLH+GLSL IFKKWAP
Sbjct: 283 ITWTNQKIRKTFVQRNMFDFKHIKPFDKGYIDNPGAMVVFATPGMLHAGLSLQIFKKWAP 342
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
ENNM+IMPG+CVQGT+GHK+L G KK+EFEN+Q+++VKM++EYMSFSAHADAKGIMQLI
Sbjct: 343 NENNMVIMPGYCVQGTVGHKILGGAKKVEFENRQVVEVKMSVEYMSFSAHADAKGIMQLI 402
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
QYCEPKNV+LVHGEA KM FLKEKIK+EF+++C+ PANGE+C + T +KI ++ S +LK
Sbjct: 403 QYCEPKNVMLVHGEAVKMEFLKEKIKEEFHIECYTPANGETCVINTPIKIPVEASTTILK 462
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
EEA KYN+EPP+P + R IHG+L++KD+ ISLM +DE E GI+RH++RF S V+I D
Sbjct: 463 EEAKKYNAEPPDPKRRRVIHGLLIMKDNKISLMSIDEIFSECGINRHVIRFISKVKIDDP 522
Query: 361 SPINKTLELIYDQLLSYLQDKSQEYKIQLTES-SLSIDSVLLSVE--------------- 404
P +T+E +Y+ L ++ + + + E+ S++++SV +++E
Sbjct: 523 GPTLRTIEKLYN----LLNERLSGWTVSMNENGSINVESVNITIEGDESIAGPAGSASVP 578
Query: 405 ----RIDDKNKRVFVTWPNQDEEVGKIVLHVLKSM 435
NK V+W NQDE++G +L +L+S+
Sbjct: 579 PPPPNAPTLNKICSVSWVNQDEDLGSYILGLLQSL 613
>gi|195394529|ref|XP_002055895.1| GJ10637 [Drosophila virilis]
gi|194142604|gb|EDW59007.1| GJ10637 [Drosophila virilis]
Length = 597
Score = 660 bits (1702), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 302/438 (68%), Positives = 372/438 (84%), Gaps = 7/438 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +KVG+QS+VYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL
Sbjct: 163 MFWIKVGSQSVVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 222
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHECV +GGKVLIPVFALGRAQELCILLETYWERMNL+ PIYFA+GLTEKA YYKMF
Sbjct: 223 KKVHECVSKGGKVLIPVFALGRAQELCILLETYWERMNLKYPIYFALGLTEKANTYYKMF 282
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
ITWTNQKIRKTFV RNMFDFKHI+PFDK++IDNPG MVVFATPGMLH+GLSL IFKKWAP
Sbjct: 283 ITWTNQKIRKTFVHRNMFDFKHIKPFDKAYIDNPGAMVVFATPGMLHAGLSLQIFKKWAP 342
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
ENNM+IMPG+CVQGT+G+K+L G KK+EFEN+Q+++VKMA+EYMSFSAHADAKGIMQLI
Sbjct: 343 NENNMVIMPGYCVQGTVGNKILGGAKKVEFENRQVVEVKMAVEYMSFSAHADAKGIMQLI 402
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
Q CEPK+V+LVHGEA KM FL+ KI+ EFNL+ +MPANGE+C + T +KI +D S++LLK
Sbjct: 403 QNCEPKHVMLVHGEAEKMKFLRAKIRDEFNLETYMPANGETCVISTPVKIPVDASISLLK 462
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
EA YN++PP+P + R IHGVLV+KD+ I L ++ +A KE+GI+RH++RFTS V++ D
Sbjct: 463 AEARSYNAQPPDPKRRRLIHGVLVMKDNRIMLQNLTDALKEIGINRHVMRFTSKVKMDDP 522
Query: 361 SPINKTLELIYDQLLSYLQDKSQEYKIQLTESS-LSIDSVLLSVE--RIDDKNKRVFVTW 417
P +T E +L + L++K + + + ES ++++SV ++VE D K K + +TW
Sbjct: 523 GPKLRTSE----KLKALLEEKLTGWTVTMQESGIITVESVEINVEEDEKDPKQKTILITW 578
Query: 418 PNQDEEVGKIVLHVLKSM 435
NQDE++G +L+VL++M
Sbjct: 579 TNQDEDIGAYILNVLQNM 596
>gi|195445135|ref|XP_002070189.1| GK11920 [Drosophila willistoni]
gi|194166274|gb|EDW81175.1| GK11920 [Drosophila willistoni]
Length = 597
Score = 655 bits (1691), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 299/438 (68%), Positives = 370/438 (84%), Gaps = 7/438 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +KVG+QS+VYTGDYNMTPDRHLGAAWIDKCRPDLLI+ESTYATTIRDSKRCRERDFL
Sbjct: 163 MFWIKVGSQSVVYTGDYNMTPDRHLGAAWIDKCRPDLLISESTYATTIRDSKRCRERDFL 222
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHECV +GGKVLIPVFALGRAQELCILLETYWERMNL+ PIYFA+GLTEKA YYKMF
Sbjct: 223 KKVHECVAKGGKVLIPVFALGRAQELCILLETYWERMNLKYPIYFALGLTEKANTYYKMF 282
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
ITWTNQKIRKTFV RNMFDFKHI+PFDK++IDNPG MVVFATPGMLH+GLSL IFKKWAP
Sbjct: 283 ITWTNQKIRKTFVHRNMFDFKHIKPFDKAYIDNPGAMVVFATPGMLHAGLSLQIFKKWAP 342
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
ENNM+IMPG+CVQGT+G+K+L G KK+EFEN+Q+++VKMA+EYMSFSAHADAKGIMQLI
Sbjct: 343 NENNMVIMPGYCVQGTVGNKILGGAKKVEFENRQVVEVKMAVEYMSFSAHADAKGIMQLI 402
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
Q CEPKNV+LVHGEA KM FL+ KI+ EFNL+ +MPANGE+C + T +KI +D +V+LLK
Sbjct: 403 QNCEPKNVMLVHGEAEKMKFLRSKIRDEFNLETYMPANGETCVISTPVKIPVDAAVSLLK 462
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
EA YN++PP+P + R IHGVLV+KD+ I L ++ +A KE+GI+RH++RFTS V++ D
Sbjct: 463 AEARSYNAQPPDPKRRRLIHGVLVMKDNRIMLQNLTDALKEIGINRHVMRFTSKVKMDDP 522
Query: 361 SPINKTLELIYDQLLSYLQDKSQEYKIQLTESS-LSIDSVLLSVE--RIDDKNKRVFVTW 417
P +T+E +L S L +K + + + E+ +++++V + VE D K K + +TW
Sbjct: 523 GPKERTIE----KLKSLLDEKMVGWTVLMQETGYITVETVEIRVEEDEKDHKQKTILITW 578
Query: 418 PNQDEEVGKIVLHVLKSM 435
NQDE++G + ++L++M
Sbjct: 579 TNQDEDIGAYIHNLLQNM 596
>gi|195112455|ref|XP_002000788.1| GI10422 [Drosophila mojavensis]
gi|193917382|gb|EDW16249.1| GI10422 [Drosophila mojavensis]
Length = 597
Score = 655 bits (1690), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 304/438 (69%), Positives = 371/438 (84%), Gaps = 7/438 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +KVG+QS+VYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL
Sbjct: 163 MFWIKVGSQSVVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 222
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHECV +GGKVLIPVFALGRAQELCILLETYWERMNL+ PIYFA+GLTEKA YYKMF
Sbjct: 223 KKVHECVLKGGKVLIPVFALGRAQELCILLETYWERMNLKYPIYFALGLTEKANTYYKMF 282
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
ITWTNQKIRKTFV RNMFDFKHI+PFDK++IDNPG MVVFATPGMLH+GLSL IFKKWAP
Sbjct: 283 ITWTNQKIRKTFVHRNMFDFKHIKPFDKAYIDNPGAMVVFATPGMLHAGLSLQIFKKWAP 342
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
ENNM+IMPG+CVQGT+G+K+L G KK+E EN+QII+VKMA+EYMSFSAHADAKGIMQLI
Sbjct: 343 NENNMVIMPGYCVQGTVGNKILGGAKKVECENRQIIEVKMAVEYMSFSAHADAKGIMQLI 402
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
Q CEPK+V+LVHGEA KM FL+ KI+ EFNL+ +MPANGE+C + T +KI +D S++LLK
Sbjct: 403 QNCEPKHVMLVHGEAEKMKFLRAKIRDEFNLETYMPANGETCVISTPVKIPVDASISLLK 462
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
EA YN++PP+P + R IHGVLV+KD+ I L ++ +A KE+GI+RH++RFTS V++ D
Sbjct: 463 AEARTYNAQPPDPKRRRLIHGVLVMKDNRIMLQNLTDALKEIGINRHVMRFTSKVKMDDP 522
Query: 361 SPINKTLELIYDQLLSYLQDKSQEYKIQLTESS-LSIDSVLLSVE--RIDDKNKRVFVTW 417
P +T E +L + L++K + + L ES ++I++V ++VE D K K + +TW
Sbjct: 523 GPKLRTSE----KLKTLLEEKLIGWNVMLQESGIITIETVEINVEEDEKDPKQKTILITW 578
Query: 418 PNQDEEVGKIVLHVLKSM 435
NQDE++G +L+VL++M
Sbjct: 579 TNQDEDIGAYILNVLQNM 596
>gi|195143691|ref|XP_002012831.1| GL23717 [Drosophila persimilis]
gi|194101774|gb|EDW23817.1| GL23717 [Drosophila persimilis]
Length = 597
Score = 654 bits (1687), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 302/438 (68%), Positives = 368/438 (84%), Gaps = 7/438 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF + VG+QS+VYTGDYNMTPDRHLGAAWID RPDLLI+ESTYATTIRDSKRCRERDFL
Sbjct: 163 MFWINVGSQSVVYTGDYNMTPDRHLGAAWIDNARPDLLISESTYATTIRDSKRCRERDFL 222
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHECV RGGKVLIPVFALGRAQELCILLETYWERMNL+ PIYFA+GLTEKA YYKMF
Sbjct: 223 KKVHECVARGGKVLIPVFALGRAQELCILLETYWERMNLKYPIYFALGLTEKANTYYKMF 282
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
ITWTNQKIRKTFV RNMFDFKHI+PFDK++IDNPG MVVFATPGMLH+GLSL IFKKWAP
Sbjct: 283 ITWTNQKIRKTFVHRNMFDFKHIKPFDKAYIDNPGAMVVFATPGMLHAGLSLQIFKKWAP 342
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
ENNM+IMPG+CVQGT+G+K+L G KK+EFEN+Q+++VKMA+EYMSFSAHADAKGIMQLI
Sbjct: 343 NENNMVIMPGYCVQGTVGNKILGGAKKVEFENRQVVEVKMAVEYMSFSAHADAKGIMQLI 402
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
Q CEPKNV+LVHGEA KM FL+ KIK EFNL+ +MPANGE+C + T ++I +D +V+LLK
Sbjct: 403 QNCEPKNVMLVHGEAEKMKFLRSKIKDEFNLETYMPANGETCVISTPVRIPVDAAVSLLK 462
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
EA YN++PP+P + R IHGVLV+KD+ I L ++ +A KE+GI+RH++RFTS V+I D
Sbjct: 463 AEARSYNAQPPDPKRRRLIHGVLVMKDNRIMLQNLTDALKEIGINRHVMRFTSKVKIDDP 522
Query: 361 SPINKTLELIYDQLLSYLQDKSQEYKIQLTES-SLSIDSVLLSVER--IDDKNKRVFVTW 417
P T E +L + L++K + + L E+ S++I+SV + VE D K K + ++W
Sbjct: 523 GPTGLTSE----RLKTLLEEKLVGWTVALQENGSITIESVEVKVEEDAKDPKQKTLLISW 578
Query: 418 PNQDEEVGKIVLHVLKSM 435
NQDE++G +L+VL++M
Sbjct: 579 TNQDEDIGAYILNVLQNM 596
>gi|170052069|ref|XP_001862054.1| cleavage and polyadenylation specificity factor subunit 3 [Culex
quinquefasciatus]
gi|167873079|gb|EDS36462.1| cleavage and polyadenylation specificity factor subunit 3 [Culex
quinquefasciatus]
Length = 615
Score = 654 bits (1686), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 298/457 (65%), Positives = 374/457 (81%), Gaps = 26/457 (5%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF ++VG+QS+VYTGDYNMTPDRHLGAAWIDKC+PDLLITESTYATTIRDSKRCRERDFL
Sbjct: 163 MFWIRVGSQSVVYTGDYNMTPDRHLGAAWIDKCKPDLLITESTYATTIRDSKRCRERDFL 222
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHECV +GGKVLIPVFALGRAQELCILLETYWERMNL+ P+YFAVGLTEKA NYYKMF
Sbjct: 223 KKVHECVAKGGKVLIPVFALGRAQELCILLETYWERMNLKYPVYFAVGLTEKANNYYKMF 282
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
ITWTNQKIRKTFVQRNMFDFKHI+PFDK +IDNPG MVVFATPGMLH+GLSL IFKKWAP
Sbjct: 283 ITWTNQKIRKTFVQRNMFDFKHIKPFDKGYIDNPGAMVVFATPGMLHAGLSLQIFKKWAP 342
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
ENNM+IMPG+CVQGT+GHK+L G KK+EFEN+Q+++VKM++EYMSFSAHADAKGIMQLI
Sbjct: 343 NENNMVIMPGYCVQGTVGHKILGGAKKVEFENRQVVEVKMSVEYMSFSAHADAKGIMQLI 402
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
QYCEPKNV+LVHGEA KM FLK+KI++EF++DCF PANGE+C + T +KI ++ S+++LK
Sbjct: 403 QYCEPKNVMLVHGEAVKMEFLKDKIREEFHIDCFTPANGETCVITTPIKIPVEASLSILK 462
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
+EA KYN+EPP+P + R +HG+LV+KD+ ISLM +DE E GI+RH++RF S V+I D
Sbjct: 463 DEAKKYNAEPPDPKRRRIVHGLLVMKDNKISLMSIDEIFNECGINRHVIRFISKVKIDDP 522
Query: 361 SPINKTLELIYDQLLSYLQDKSQEYKIQLTES-SLSIDSVLLSVERID------------ 407
P +T+E +Y+ L ++ + + + ++ S++++SV + +E +
Sbjct: 523 GPTPRTIEKLYN----LLNERLSGWTVTMADNGSINVESVNIKIESEEAAAAAGAVTAGP 578
Query: 408 ---------DKNKRVFVTWPNQDEEVGKIVLHVLKSM 435
+K V+W NQDE++G +L +++S+
Sbjct: 579 QAATTSGGPPPHKICSVSWVNQDEDLGSYILGLMQSL 615
>gi|158298905|ref|XP_319042.4| AGAP009923-PA [Anopheles gambiae str. PEST]
gi|157014111|gb|EAA13845.4| AGAP009923-PA [Anopheles gambiae str. PEST]
Length = 608
Score = 652 bits (1682), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 298/448 (66%), Positives = 365/448 (81%), Gaps = 15/448 (3%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF ++VG+QS+VYTGDYNMTPDRHLGAAWIDKC+PDLLITESTYATTIRDSKRCRERDFL
Sbjct: 163 MFWIRVGSQSVVYTGDYNMTPDRHLGAAWIDKCKPDLLITESTYATTIRDSKRCRERDFL 222
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHECV +GGKVLIPVFALGRAQELCILLETYWERMNL+ P+YFAVGLTEKA NYYKMF
Sbjct: 223 KKVHECVAKGGKVLIPVFALGRAQELCILLETYWERMNLKYPVYFAVGLTEKANNYYKMF 282
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
ITWTNQKIRKTFVQRNMFDFKHI+PFDK +IDNPG MVVFATPGMLH+GLSL IFKKWAP
Sbjct: 283 ITWTNQKIRKTFVQRNMFDFKHIKPFDKGYIDNPGAMVVFATPGMLHAGLSLQIFKKWAP 342
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
ENNM+IMPG+CVQGT+GHK+L G KK+EFEN+Q++DVKM++EYMSFSAHADAKGIMQLI
Sbjct: 343 NENNMVIMPGYCVQGTVGHKILGGAKKVEFENRQVVDVKMSVEYMSFSAHADAKGIMQLI 402
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
Q+CEP+NV+LVHGEA KM FLKEKI++EF ++C+ PANGE+C + T +KI ++ S+++LK
Sbjct: 403 QFCEPRNVMLVHGEAVKMEFLKEKIREEFKIECYTPANGETCTISTPIKIPVETSLSILK 462
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
+E KYN++PP+P + R IHG+LV+KD+ ISLM +DE E GI RH++RF S +++ D
Sbjct: 463 DEVKKYNAQPPDPKRRRIIHGILVMKDNKISLMSIDEIFNEAGIDRHVIRFISKIKLDDP 522
Query: 361 SPINKTLELIYDQLLSYLQD-------------KSQEYKIQLTESSLSIDSVLLSVERID 407
P +T+E +Y+ L L +S KI+ E +I SV +
Sbjct: 523 GPPMRTIEKLYNMLNERLSGWTVTMNDNNSINVESVNIKIETEEQPAAIGSV--GQPSVP 580
Query: 408 DKNKRVFVTWPNQDEEVGKIVLHVLKSM 435
+K V+W NQDE++G +L +L+S+
Sbjct: 581 GMHKICNVSWVNQDEDLGSYILGLLQSL 608
>gi|195062087|ref|XP_001996130.1| GH14325 [Drosophila grimshawi]
gi|193891922|gb|EDV90788.1| GH14325 [Drosophila grimshawi]
Length = 597
Score = 649 bits (1675), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 302/438 (68%), Positives = 371/438 (84%), Gaps = 7/438 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +KVG+QS+VYTGDYNMTPDRHLGAAWIDKCRPDLLI+ESTYATTIRDSKRCRERDFL
Sbjct: 163 MFWIKVGSQSVVYTGDYNMTPDRHLGAAWIDKCRPDLLISESTYATTIRDSKRCRERDFL 222
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHECV +GGKVLIPVFALGRAQELCILLETYWERMNL+ PIYFA+GLTEKA YYKMF
Sbjct: 223 KKVHECVAKGGKVLIPVFALGRAQELCILLETYWERMNLKYPIYFALGLTEKANTYYKMF 282
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
ITWTNQKIRKTFV RNMFDFKHI+PFDK++IDNPG MVVFATPGMLH+GLSL IFKKWAP
Sbjct: 283 ITWTNQKIRKTFVHRNMFDFKHIKPFDKAYIDNPGAMVVFATPGMLHAGLSLQIFKKWAP 342
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
ENNM+IMPG+CVQGT+G+K+L G KK+EFEN+Q+++VKMA+EYMSFSAHADAKGIMQLI
Sbjct: 343 NENNMVIMPGYCVQGTVGNKILGGAKKVEFENRQVVEVKMAVEYMSFSAHADAKGIMQLI 402
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
Q CEPKNV+LVHGEA KM FL+ KI+ EFNL+ +MPANGE+C + T +KI +D S++LLK
Sbjct: 403 QNCEPKNVMLVHGEAEKMKFLRAKIRDEFNLETYMPANGETCVISTPVKIPVDASISLLK 462
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
EA YN++PP+P + R IHGVLV+KD+ I L ++ +A KE+GI+RH++RFTS V++ D
Sbjct: 463 AEARAYNAQPPDPKRRRLIHGVLVMKDNRIMLQNLTDALKEIGINRHVMRFTSKVKMDDP 522
Query: 361 SPINKTLELIYDQLLSYLQDKSQEYKIQLTES-SLSIDSVLLSVE--RIDDKNKRVFVTW 417
P +T ++L L +K + + L E+ ++S++SV ++VE D K K + +TW
Sbjct: 523 GPKLRT----SEKLKLLLDEKLTGWTVVLQETGTISVESVEINVEEDEKDPKQKTLLITW 578
Query: 418 PNQDEEVGKIVLHVLKSM 435
NQDE++G +L+VL++M
Sbjct: 579 TNQDEDIGAYILNVLQNM 596
>gi|312381513|gb|EFR27247.1| hypothetical protein AND_06171 [Anopheles darlingi]
Length = 624
Score = 640 bits (1652), Expect = e-180, Method: Compositional matrix adjust.
Identities = 294/466 (63%), Positives = 372/466 (79%), Gaps = 35/466 (7%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF ++VG+QS+VYTGDYNMTPDRHLGAAWIDKC+PDLLITESTYATTIRDSKRCRERDFL
Sbjct: 163 MFWIRVGSQSVVYTGDYNMTPDRHLGAAWIDKCKPDLLITESTYATTIRDSKRCRERDFL 222
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHECV +GGKVLIPVFALGRAQELCILLETYWERMNL+ P+YFAVGLTEKA NYYKMF
Sbjct: 223 KKVHECVAKGGKVLIPVFALGRAQELCILLETYWERMNLKYPVYFAVGLTEKANNYYKMF 282
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
ITWTNQKIRKTFVQRNMFDFKHI+PFDK +IDNPG MVVFATPGMLH+GLSL IFKKWAP
Sbjct: 283 ITWTNQKIRKTFVQRNMFDFKHIKPFDKGYIDNPGAMVVFATPGMLHAGLSLQIFKKWAP 342
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
ENNM+IMPG+CVQGT+GHK+L G KK+EFEN+Q+++VKM++EYMSFSAHADAKGIMQLI
Sbjct: 343 NENNMVIMPGYCVQGTVGHKILGGAKKVEFENRQVVEVKMSVEYMSFSAHADAKGIMQLI 402
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
Q+CEP+NV+LVHGEA KM FLKEKI++EF ++C+ PANGE+C + T +KI ++ S+++LK
Sbjct: 403 QFCEPRNVMLVHGEAVKMEFLKEKIREEFRIECYTPANGETCTISTPIKIPVETSLSILK 462
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQI--- 357
+E KYN++PP+P + R IHG+LV+KD+ ISLM +DE E GI RH++R+ S+V++
Sbjct: 463 DEVKKYNAQPPDPKRRRIIHGILVMKDNKISLMSIDEIFNEAGIDRHVIRYASDVEVLKE 522
Query: 358 ----------SDSSPINKTLELIYDQLLSYLQDKSQEYKIQLTE-SSLSIDSVLLSVERI 406
D P +T+E +Y+ L ++ + + + E +S++++SV + +E
Sbjct: 523 RKIFISKIKLDDPGPPMRTIEKLYNM----LNERLSGWTVTMNENNSINVESVNIKIETE 578
Query: 407 DDK-----------------NKRVFVTWPNQDEEVGKIVLHVLKSM 435
+ +K V+W NQDE++G +L +L+S+
Sbjct: 579 EPSAPGVGNASGSSAGSIVGHKICNVSWVNQDEDLGSYILGLLQSL 624
>gi|241245173|ref|XP_002402434.1| cleavage and polyadenylation specificity factor, putative [Ixodes
scapularis]
gi|215496345|gb|EEC05985.1| cleavage and polyadenylation specificity factor, putative [Ixodes
scapularis]
Length = 596
Score = 627 bits (1618), Expect = e-177, Method: Compositional matrix adjust.
Identities = 288/439 (65%), Positives = 360/439 (82%), Gaps = 7/439 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF ++VG+QS+VYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL
Sbjct: 163 MFHIRVGSQSVVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 222
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KVH+C+D+GGKVLIPVFALGRAQELCILLETYW+RMNL+ PIYFAVGLTEKATNYYKMF
Sbjct: 223 TKVHDCIDKGGKVLIPVFALGRAQELCILLETYWDRMNLKVPIYFAVGLTEKATNYYKMF 282
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
ITWTNQKIRKTFVQRNMFDFKHI+PFD++FIDNPGPMVVFATPGMLH+GLSL IFKKWAP
Sbjct: 283 ITWTNQKIRKTFVQRNMFDFKHIKPFDRAFIDNPGPMVVFATPGMLHAGLSLQIFKKWAP 342
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
VE NM+IMPG+CV GT+GHK+LSG +K+E +N+Q+++VKM+++YMSFSAHADAKGIMQLI
Sbjct: 343 VEGNMVIMPGYCVAGTVGHKILSGARKVELDNRQVVEVKMSVQYMSFSAHADAKGIMQLI 402
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
CEP NVLLVHGEASKM FL++K+ QEFN+DC+MPANGE+ + T I IDVS+ LLK
Sbjct: 403 HQCEPSNVLLVHGEASKMEFLRKKVLQEFNIDCYMPANGETVQIDTPDIIPIDVSLPLLK 462
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
+ S+ P + IHG L+++D++I LMD +A +E+G+ H +RFT+ V +S
Sbjct: 463 RALSQGGSDSKKP---KLIHGALIMQDNTIRLMDSGQAMQELGVVGHQIRFTTTVSVSSR 519
Query: 361 SPINKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLLSV-ERIDDKNKRVFVTWPN 419
+ KT + IY + + ++D S + L + S+S+ S L+ V + D+ NK + V+W
Sbjct: 520 GSLTKTADAIYGNIKNKIKDPSIQL---LPDGSISVASALIKVGGQEDESNKAICVSWTL 576
Query: 420 QDEEVGKIVLHVLKSMSNT 438
QDEE+G +L +++ M ++
Sbjct: 577 QDEELGSYLLGLVQEMRHS 595
>gi|442751667|gb|JAA67993.1| Putative cleavage and polyadenylation specificity factor cpsf
subunit [Ixodes ricinus]
Length = 596
Score = 623 bits (1607), Expect = e-175, Method: Compositional matrix adjust.
Identities = 287/439 (65%), Positives = 357/439 (81%), Gaps = 7/439 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF ++VG+QS+VYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL
Sbjct: 163 MFHIRVGSQSVVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 222
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KVH+C+D GGKVLIPVFALGRAQELCILLETYW+RMNL+ PIYFAVGLTEKATNYYKMF
Sbjct: 223 TKVHDCIDEGGKVLIPVFALGRAQELCILLETYWDRMNLKVPIYFAVGLTEKATNYYKMF 282
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
ITWTNQKIRKTFVQRNMFDFKHI+PFD++FIDNPGPMVVFATPGMLH+GLSL IFKKWAP
Sbjct: 283 ITWTNQKIRKTFVQRNMFDFKHIKPFDRAFIDNPGPMVVFATPGMLHAGLSLQIFKKWAP 342
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
VE NM+IMPG+CV GT+GHK+LSG +K+E +N+Q+++VKM+++YMSFSAHADAKGIMQLI
Sbjct: 343 VEGNMVIMPGYCVAGTVGHKILSGARKVELDNRQVVEVKMSVQYMSFSAHADAKGIMQLI 402
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
CEP NVLLVHGEASKM L++K+ QEFN+DC+MPANGE+ + T I IDVS+ LLK
Sbjct: 403 HQCEPSNVLLVHGEASKMELLRKKVLQEFNIDCYMPANGETVQIDTPDIIPIDVSLPLLK 462
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
+ S+ P + IHG L+++D+ I LMD +A +E+G+ H +RFT+ V +S
Sbjct: 463 RALSQGGSDSKKP---KLIHGALIMQDNMIRLMDSGQAMQELGVVGHQIRFTTTVSVSSR 519
Query: 361 SPINKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLLSV-ERIDDKNKRVFVTWPN 419
+ KT + IY + + ++D S + L + S+S+ S L+ V + D+ NK + V+W
Sbjct: 520 GSLTKTADAIYGNIKNKIKDPSIQL---LPDGSISVASALIKVGGQEDESNKAICVSWTL 576
Query: 420 QDEEVGKIVLHVLKSMSNT 438
QDEE+G +L +++ M ++
Sbjct: 577 QDEELGSYLLGLVQEMRHS 595
>gi|321457255|gb|EFX68345.1| hypothetical protein DAPPUDRAFT_218302 [Daphnia pulex]
Length = 597
Score = 622 bits (1603), Expect = e-175, Method: Compositional matrix adjust.
Identities = 286/436 (65%), Positives = 362/436 (83%), Gaps = 7/436 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF VKVGNQS+VYTGDYNMTPDRHLGAAWIDKCRP++LI+ESTYATTIRDSKRCRERDFL
Sbjct: 162 MFHVKVGNQSVVYTGDYNMTPDRHLGAAWIDKCRPNILISESTYATTIRDSKRCRERDFL 221
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVH+CVDRGGKVLIPVFALGRAQELCILLETYWERMNL+APIYFAVGLTEKA NYYKMF
Sbjct: 222 KKVHDCVDRGGKVLIPVFALGRAQELCILLETYWERMNLKAPIYFAVGLTEKANNYYKMF 281
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
ITWTNQKIRKTFVQRNMF+FKHIRPFDKS+ D PGPMVVFATPGMLH+GLSL +FKKWAP
Sbjct: 282 ITWTNQKIRKTFVQRNMFEFKHIRPFDKSYADTPGPMVVFATPGMLHAGLSLQLFKKWAP 341
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
ENNMLIMPG+CV GT+G KVLSG KK+E EN+Q+I+VKM+++YMSFSAHADAKGIMQLI
Sbjct: 342 NENNMLIMPGYCVSGTVGAKVLSGAKKVEIENRQVIEVKMSVQYMSFSAHADAKGIMQLI 401
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
QYCEP+NVLLVHGE KM FLK+KI+QE+ +DC+MPANGES ++ + +DVS++LLK
Sbjct: 402 QYCEPQNVLLVHGEGVKMEFLKQKIQQEYGIDCYMPANGESAYIPCKPVVPVDVSLSLLK 461
Query: 301 EEAVKYNS-EPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISD 359
+E Y++ P+P + ++HGVLV+KD+ + LM+ ++A E+G++RH +RFTS +Q+
Sbjct: 462 KEMALYDTVGAPDPKRPCRMHGVLVMKDNVLKLMEPEQAFGELGLNRHSIRFTSCIQLRL 521
Query: 360 SSPINKTLELIYDQLLSYLQDKSQEYKIQLT-ESSLSIDSVLLSVERIDDKNKR-VFVTW 417
++ + DQ+ L ++ Y ++ + E L+++SV + +E D ++++ + ++W
Sbjct: 522 PGTASRAM----DQIQLILNSQNTNYPVKPSDEGLLNLESVSIKMETTDKEDRKSMMISW 577
Query: 418 PNQDEEVGKIVLHVLK 433
QDE++ ++ L+
Sbjct: 578 ALQDEDLATWIMSSLR 593
>gi|321468347|gb|EFX79332.1| hypothetical protein DAPPUDRAFT_304859 [Daphnia pulex]
Length = 597
Score = 620 bits (1600), Expect = e-174, Method: Compositional matrix adjust.
Identities = 287/436 (65%), Positives = 361/436 (82%), Gaps = 7/436 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF VKVGNQS+VYTGDYNMTPDRHLGAAWIDKCRP++LI+ESTYATTIRDSKRCRERDFL
Sbjct: 162 MFHVKVGNQSVVYTGDYNMTPDRHLGAAWIDKCRPNILISESTYATTIRDSKRCRERDFL 221
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVH+CVDRGGKVLIPVFALGRAQELCILLETYWERMNL+APIYFAVGLTEKA NYYKMF
Sbjct: 222 KKVHDCVDRGGKVLIPVFALGRAQELCILLETYWERMNLKAPIYFAVGLTEKANNYYKMF 281
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
ITWTNQKIRKTFVQRNMF+FKHIRPFDKS+ D PGPMVVFATPGMLH+GLSL +FKKWAP
Sbjct: 282 ITWTNQKIRKTFVQRNMFEFKHIRPFDKSYADTPGPMVVFATPGMLHAGLSLQLFKKWAP 341
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
ENNMLIMPG+CV GT+G KVLSG KK+E EN+Q+I+VKM+++YMSFSAHADAKGIMQLI
Sbjct: 342 NENNMLIMPGYCVSGTVGAKVLSGAKKVEIENRQVIEVKMSVQYMSFSAHADAKGIMQLI 401
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
QYCEP+NVLLVHGE KM FLK+KI+QE+ +DC+MPANGES ++ + +DVS++LLK
Sbjct: 402 QYCEPQNVLLVHGEGVKMEFLKQKIQQEYGIDCYMPANGESAYIPCKPVVPVDVSLSLLK 461
Query: 301 EEAVKYNS-EPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISD 359
+E Y++ P+P + ++HGVLV+KD+ + LM+ ++A E+G++RH +RFTS +Q+
Sbjct: 462 KEMALYDTVGAPDPKRPCRMHGVLVMKDNVLKLMEPEQAFGELGLNRHSIRFTSCIQLR- 520
Query: 360 SSPINKTLELIYDQLLSYLQDKSQEYKIQLT-ESSLSIDSVLLSVERIDDKNKR-VFVTW 417
+ T DQ+ L ++ Y ++ + E L+++SV + +E D ++++ + ++W
Sbjct: 521 ---LPGTAIRAMDQIQLILNSQNTNYPVKPSDEGLLNLESVSIKMETTDKEDRKSMMISW 577
Query: 418 PNQDEEVGKIVLHVLK 433
QDE++ ++ L+
Sbjct: 578 ALQDEDLATWIMSSLR 593
>gi|405958713|gb|EKC24813.1| Integrator complex subunit 11 [Crassostrea gigas]
Length = 575
Score = 614 bits (1583), Expect = e-173, Method: Compositional matrix adjust.
Identities = 280/433 (64%), Positives = 352/433 (81%), Gaps = 2/433 (0%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +KVG QS+VYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL
Sbjct: 136 MFHIKVGQQSVVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 195
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVH+CV++GGKVLIPVFALGRAQELCILLE+YW+RMN++ PIYF++GLTEKA +YYK+F
Sbjct: 196 KKVHDCVEKGGKVLIPVFALGRAQELCILLESYWDRMNIKVPIYFSLGLTEKANHYYKLF 255
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
ITWT+QKI+KTFVQRNMF+FKHI+PFD++FIDNPGPMVVFATPGMLH+GLSL IFKKWAP
Sbjct: 256 ITWTSQKIKKTFVQRNMFEFKHIKPFDRAFIDNPGPMVVFATPGMLHAGLSLQIFKKWAP 315
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
E NM+IMPG+CV GT+GHK+L+G +KLE ENKQI+DVK++++YMSFSAHADAKGIMQLI
Sbjct: 316 NELNMVIMPGYCVAGTVGHKILNGARKLELENKQILDVKLSVQYMSFSAHADAKGIMQLI 375
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
CEP+NV+LVHGEA+KM FLK KI+QEF +DC+ PANGES ++T I +D+ ++LLK
Sbjct: 376 SQCEPRNVMLVHGEAAKMDFLKNKIQQEFGIDCYKPANGESVTIETTHNIHVDIPLSLLK 435
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
EA K PP+ + R +HG LV++ + + ++D +A KE+G+ H +RFTS + I +
Sbjct: 436 REASKTGLLPPDAKRPRVLHGALVMRGNRLQIIDPSQALKEMGVEEHHLRFTSTISIEEE 495
Query: 361 SPINKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLLSVERIDDKNKRVFVTWPNQ 420
P NK E IY + L+D S Y + S S+L+ V +D+ K V V+W +Q
Sbjct: 496 GPSNKVAEKIYQLVKRKLKDFSIHYSSDGSISISD--SLLIQVSGDEDETKSVLVSWSHQ 553
Query: 421 DEEVGKIVLHVLK 433
DE++G +L +L+
Sbjct: 554 DEDIGSFLLQLLR 566
>gi|260790823|ref|XP_002590440.1| hypothetical protein BRAFLDRAFT_289082 [Branchiostoma floridae]
gi|229275634|gb|EEN46451.1| hypothetical protein BRAFLDRAFT_289082 [Branchiostoma floridae]
Length = 597
Score = 608 bits (1568), Expect = e-171, Method: Compositional matrix adjust.
Identities = 276/436 (63%), Positives = 355/436 (81%), Gaps = 6/436 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +KVG++S+VYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL
Sbjct: 156 MFLIKVGSESVVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 215
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHE +++GGKVLIPVFALGRAQELCILLET+WERMN++APIYF+ GLTEKA NYY++F
Sbjct: 216 KKVHETIEKGGKVLIPVFALGRAQELCILLETFWERMNIKAPIYFSTGLTEKANNYYRLF 275
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
ITWTNQKIRKTFV+RNMF+FKHI+ FD+S+IDNPGPMVVFATPGMLH+GLSL IFKKWAP
Sbjct: 276 ITWTNQKIRKTFVKRNMFEFKHIKAFDRSYIDNPGPMVVFATPGMLHAGLSLQIFKKWAP 335
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
NM+IMPG+CV GT+GHK+L+G++K+EFENK++++V+M +EYMSFSAHADA+GIMQLI
Sbjct: 336 DSKNMVIMPGYCVAGTVGHKILNGIRKIEFENKKVLEVRMTVEYMSFSAHADAQGIMQLI 395
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
+YCEP+NV+LVHGE KM FL +KI QEF + CF PANGE+ ++T+ I +D S+NLLK
Sbjct: 396 RYCEPRNVMLVHGEEQKMDFLSKKITQEFGVQCFFPANGETVTIETNPNIPVDASLNLLK 455
Query: 301 EEA-VKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISD 359
EA + P+P + R +HG L++K++S+ L+ +A +E+G+ H +RFTS V I D
Sbjct: 456 REASLTGLCRLPDPKRPRVLHGALIMKENSLRLVSPSQAMRELGLKEHEIRFTSTVTIHD 515
Query: 360 SSPINKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLLSVERI--DDKNKRVFVTW 417
P T+E +Y L L+D S ++ + + S+ IDSVL+ V +++ K V+W
Sbjct: 516 PQPTQATVERLYTHLKGTLKDHSIQH---MPDGSIMIDSVLIKVSGSSEEEETKDFLVSW 572
Query: 418 PNQDEEVGKIVLHVLK 433
P +DE++G ++ VL+
Sbjct: 573 PYEDEDLGSHLMTVLR 588
>gi|427785581|gb|JAA58242.1| Putative mrna cleavage and polyadenylation factor ii complex brr5
cpsf subunit [Rhipicephalus pulchellus]
Length = 587
Score = 602 bits (1552), Expect = e-169, Method: Compositional matrix adjust.
Identities = 280/438 (63%), Positives = 356/438 (81%), Gaps = 17/438 (3%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF+++VG+QS+VYTGDYNMTPDRHLGAAW+DKCRPDLLITESTYATTIRDSKRCRERDFL
Sbjct: 163 MFRIRVGSQSVVYTGDYNMTPDRHLGAAWLDKCRPDLLITESTYATTIRDSKRCRERDFL 222
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KVH+C+D+GGKVLIPVFALGRAQELCILLETYW+RMNL+ PIYFAVGLTEKATNYYKMF
Sbjct: 223 TKVHDCIDKGGKVLIPVFALGRAQELCILLETYWDRMNLRVPIYFAVGLTEKATNYYKMF 282
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
ITWTNQKIRKTFVQRNMFDFKHI+PFD++FIDNPGPMVVFATPGMLH+GLSL IFKKWAP
Sbjct: 283 ITWTNQKIRKTFVQRNMFDFKHIKPFDRAFIDNPGPMVVFATPGMLHAGLSLQIFKKWAP 342
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
E NM+IMPG+CV GT+GHK+LSG +K+E +N+Q+++VKM+++YMSFSAHADAKGIMQLI
Sbjct: 343 FEANMVIMPGYCVAGTVGHKILSGARKVELDNRQVVEVKMSVQYMSFSAHADAKGIMQLI 402
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
Q CEP NVLLVHGEA KM FL++KI+QEF +DC MPANGE+ ++T + IDV+V LLK
Sbjct: 403 QQCEPANVLLVHGEAGKMDFLRKKIRQEFAIDCSMPANGETVVIETPDTVPIDVAVPLLK 462
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
A+ +S+ P + IHG LV+ D + L+D +A +E+G++ H +RFT+ V +
Sbjct: 463 -HALGSDSKRP-----KLIHGALVMHDRGVRLLDSRQAMQELGLTSHQIRFTTTV----A 512
Query: 361 SPINKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLLSV--ERIDDKNKRVFVTWP 418
SP + D L + L+D+ ++ L + S+++ S L+ V + D ++ + V+W
Sbjct: 513 SP----RAALADALYTRLRDRLPGVQL-LPDGSIAVASALVKVGGQEDDAASRAICVSWT 567
Query: 419 NQDEEVGKIVLHVLKSMS 436
QDE++G +L +++ M+
Sbjct: 568 LQDEDLGSHLLSLVQEMA 585
>gi|443694305|gb|ELT95478.1| hypothetical protein CAPTEDRAFT_151615 [Capitella teleta]
Length = 600
Score = 600 bits (1546), Expect = e-168, Method: Compositional matrix adjust.
Identities = 277/436 (63%), Positives = 351/436 (80%), Gaps = 6/436 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
M ++VG QS+VYTGDYNMTPDRHLGAAWID+CRPDLLITESTYATTIRDSKRCRERDFL
Sbjct: 163 MIHIRVGEQSVVYTGDYNMTPDRHLGAAWIDRCRPDLLITESTYATTIRDSKRCRERDFL 222
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVH+ VD+GGKVLIPVFALGRAQELCILLETYW+RMNL+ PIYF++GLTEKA +YYKMF
Sbjct: 223 KKVHDAVDKGGKVLIPVFALGRAQELCILLETYWDRMNLKVPIYFSMGLTEKANHYYKMF 282
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
ITWTNQKI+ TFVQRNMFDFKHI+PFDK + DNPGPMVVFATPGMLH GLSL IFKKW
Sbjct: 283 ITWTNQKIKNTFVQRNMFDFKHIKPFDKVYADNPGPMVVFATPGMLHGGLSLQIFKKWCG 342
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
E NM+IMPG+CV GTIG K+L+G +K+E ENKQII+VKM+++YMSFSAHADAKGIMQLI
Sbjct: 343 GEKNMVIMPGYCVSGTIGWKILNGQRKIEMENKQIIEVKMSVQYMSFSAHADAKGIMQLI 402
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
+ C+P NV+LVHGEA KM FLK KI +EF + CF PANGE+ ++ I +D+S+NLLK
Sbjct: 403 RQCQPSNVMLVHGEAEKMEFLKTKINEEFGISCFNPANGETVSIEARHGIPVDISLNLLK 462
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
E K S P P + R +HG LV+++ ++ L++ EA KE+G++ H VRFTS + I+D
Sbjct: 463 REPAKSASGLPEPKRSRLLHGALVMREQAMKLVEPREAMKELGLNEHDVRFTSTLTITD- 521
Query: 361 SPINKTLELIYDQLLSYLQDKSQEYKIQL-TESSLSI-DSVLLSVERIDDKNKRVFVTWP 418
N ++ + ++L+ ++ +Q++ +Q ++ S+SI DSVL+ V D+ +K V ++W
Sbjct: 522 ---NDSVAEMAEKLMKIFRESAQQHSLQQSSDGSISIADSVLVKVSESDENDKSVLLSWS 578
Query: 419 NQDEEVGKIVLHVLKS 434
QDEE+G +L ++K+
Sbjct: 579 LQDEELGSHLLKIIKT 594
>gi|326932364|ref|XP_003212289.1| PREDICTED: integrator complex subunit 11-like [Meleagris gallopavo]
Length = 600
Score = 594 bits (1532), Expect = e-167, Method: Compositional matrix adjust.
Identities = 278/435 (63%), Positives = 353/435 (81%), Gaps = 7/435 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MFQ+KVG +S+VYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL
Sbjct: 163 MFQIKVGCESVVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 222
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHE V+RGGKVLIPVFALGRAQELCILLET+WERMNL+APIYF+ GLTEKA +YYK+F
Sbjct: 223 KKVHETVERGGKVLIPVFALGRAQELCILLETFWERMNLKAPIYFSTGLTEKANHYYKLF 282
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
ITWTNQKIRKTFVQRNMF+FKHI+ FD++F DNPGPMVVFATPGMLH+G SL IF+KWA
Sbjct: 283 ITWTNQKIRKTFVQRNMFEFKHIKAFDRAFADNPGPMVVFATPGMLHAGQSLQIFRKWAG 342
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
E NM+IMPG+CVQGT+GHK+LSG +KLE E +QI++VKM +EYMSFSAHADAKGIMQLI
Sbjct: 343 NEKNMVIMPGYCVQGTVGHKILSGQRKLEMEGRQILEVKMQVEYMSFSAHADAKGIMQLI 402
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
+ EP+NVLLVHGEA KM FLK+KI+QEF+++C+MPANGE+ + T+ I +D+S+ LLK
Sbjct: 403 RQAEPRNVLLVHGEAKKMEFLKQKIEQEFHVNCYMPANGETTTIFTNPSIPVDISLGLLK 462
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
E P+ K + +HG L++KD+S L+ ++A KE+G++ H +RFT V I D
Sbjct: 463 RETAM--GLLPDAKKPKLMHGTLIMKDNSFRLVSPEQALKELGLAEHQLRFTCRVHIQDP 520
Query: 361 SPINKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLL-SVERIDDKNKRV-FVTWP 418
++T+ +Y+ L L+D S ++ L + S++++S+L+ + ++D+ +V V+W
Sbjct: 521 RKEHETVLRVYNHLKGVLKDYSVQH---LPDGSITVESILIQATAHLEDQGTKVLLVSWT 577
Query: 419 NQDEEVGKIVLHVLK 433
QDEE+G + +LK
Sbjct: 578 YQDEELGSYLTSLLK 592
>gi|449268484|gb|EMC79348.1| Integrator complex subunit 11 [Columba livia]
Length = 600
Score = 594 bits (1532), Expect = e-167, Method: Compositional matrix adjust.
Identities = 278/435 (63%), Positives = 350/435 (80%), Gaps = 7/435 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MFQ+KVG +S+VYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL
Sbjct: 163 MFQIKVGCESVVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 222
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHE V+RGGKVLIPVFALGRAQELCILLET+WERMNL+APIYF+ GLTEKA +YYK+F
Sbjct: 223 KKVHETVERGGKVLIPVFALGRAQELCILLETFWERMNLKAPIYFSTGLTEKANHYYKLF 282
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
ITWTNQKIRKTFVQRNMF+FKHI+ FD++F DNPGPMVVFATPGMLH+G SL IF+KWA
Sbjct: 283 ITWTNQKIRKTFVQRNMFEFKHIKAFDRAFADNPGPMVVFATPGMLHAGQSLQIFRKWAG 342
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
E NM+IMPG+CVQGT+GHK+LSG +KLE E +QI++VKM +EYMSFSAHADAKGIMQLI
Sbjct: 343 NEKNMVIMPGYCVQGTVGHKILSGQRKLEMEGRQILEVKMQVEYMSFSAHADAKGIMQLI 402
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
+ EP+NVLLVHGEA KM FLK+KI+QEF+++C+MPANGE+ + T+ I +D+S+ LLK
Sbjct: 403 RQAEPRNVLLVHGEAKKMEFLKQKIEQEFHVNCYMPANGETTTIFTNPSIPVDISLGLLK 462
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
E P+ K + +HG L++KD+S L+ ++A KE+G++ H +RFT V I D
Sbjct: 463 RETA--TGLLPDSKKPKLMHGTLMMKDNSFRLVSPEQALKELGLTEHQLRFTCRVHIQDP 520
Query: 361 SPINKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLL--SVERIDDKNKRVFVTWP 418
++T+ +Y+ L L+D S ++ L + S++++S+L+ + D K + V+W
Sbjct: 521 RKEHETVLRVYNHLKGVLKDYSVQH---LPDGSITVESILIQATAHSEDQGTKVLLVSWT 577
Query: 419 NQDEEVGKIVLHVLK 433
QDEE+G + +LK
Sbjct: 578 YQDEELGSFLTSLLK 592
>gi|224079882|ref|XP_002197797.1| PREDICTED: integrator complex subunit 11 [Taeniopygia guttata]
Length = 600
Score = 594 bits (1531), Expect = e-166, Method: Compositional matrix adjust.
Identities = 278/435 (63%), Positives = 349/435 (80%), Gaps = 7/435 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MFQ+KVG +S+VYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL
Sbjct: 163 MFQIKVGCESVVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 222
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHE V+RGGKVLIPVFALGRAQELCILLET+WERMNL+APIYF+ GLTEKA +YYK+F
Sbjct: 223 KKVHETVERGGKVLIPVFALGRAQELCILLETFWERMNLKAPIYFSTGLTEKANHYYKLF 282
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
ITWTNQKIRKTFVQRNMF+FKHI+ FD++F DNPGPMVVFATPGMLH+G SL IF+KWA
Sbjct: 283 ITWTNQKIRKTFVQRNMFEFKHIKAFDRAFADNPGPMVVFATPGMLHAGQSLQIFRKWAG 342
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
E NM+IMPG+CVQGT+GHK+LSG +KLE E +QI++VKM +EYMSFSAHADAKGIMQLI
Sbjct: 343 NEKNMVIMPGYCVQGTVGHKILSGQRKLEMEGRQILEVKMQVEYMSFSAHADAKGIMQLI 402
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
+ EP+NVLLVHGEA KM FL++KI+QEF++ CFMPANGE+ + T+ I +D+S+ LLK
Sbjct: 403 RQAEPRNVLLVHGEAKKMEFLRQKIEQEFHVSCFMPANGETTTILTNPCIPVDISLGLLK 462
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
E P+ K + +HG L++KD+S L+ ++A KE+G++ H +RFT V I D
Sbjct: 463 RETAI--GAAPDAKKPKLMHGTLLMKDNSFRLVSPEQALKELGLAEHQLRFTCRVHIQDP 520
Query: 361 SPINKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLL--SVERIDDKNKRVFVTWP 418
++T+ +Y+ L L+D S ++ L + S++++S+L+ + D K + V+W
Sbjct: 521 RKEHETVLRVYNHLKGILKDYSVQH---LPDGSITVESILIQATAHSEDQGTKVLLVSWT 577
Query: 419 NQDEEVGKIVLHVLK 433
QDEE+G + +LK
Sbjct: 578 YQDEELGSYLTSLLK 592
>gi|61098197|ref|NP_001012854.1| integrator complex subunit 11 [Gallus gallus]
gi|75571225|sp|Q5ZIH0.1|INT11_CHICK RecName: Full=Integrator complex subunit 11; Short=Int11; AltName:
Full=Cleavage and polyadenylation-specific factor 3-like
protein; Short=CPSF3-like protein
gi|53135966|emb|CAG32473.1| hypothetical protein RCJMB04_26e19 [Gallus gallus]
Length = 600
Score = 593 bits (1530), Expect = e-166, Method: Compositional matrix adjust.
Identities = 278/435 (63%), Positives = 350/435 (80%), Gaps = 7/435 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MFQ+KVG +S+VYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL
Sbjct: 163 MFQIKVGCESVVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 222
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHE V+RGGKVLIPVFALGRAQELCILLET+WERMNL+APIYF+ GLTEKA +YYK+F
Sbjct: 223 KKVHETVERGGKVLIPVFALGRAQELCILLETFWERMNLKAPIYFSTGLTEKANHYYKLF 282
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
ITWTNQKIRKTFVQRNMF+FKHI+ FD++F DNPGPMVVFATPGMLH+G SL IF+KWA
Sbjct: 283 ITWTNQKIRKTFVQRNMFEFKHIKAFDRAFADNPGPMVVFATPGMLHAGQSLQIFRKWAG 342
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
E NM+IMPG+CVQGT+GHK+LSG +KLE E +QI++VKM +EYMSFSAHADAKGIMQLI
Sbjct: 343 NEKNMVIMPGYCVQGTVGHKILSGQRKLEMEGRQILEVKMQVEYMSFSAHADAKGIMQLI 402
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
+ EP+NVLLVHGEA KM FLK+KI+QEF+++C+MPANGE+ + T+ I +D+S+ LLK
Sbjct: 403 RQAEPRNVLLVHGEAKKMEFLKQKIEQEFHVNCYMPANGETTTIFTNPSIPVDISLGLLK 462
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
E P+ K + +HG L++KD+S L+ ++A KE+G++ H +RFT V I D
Sbjct: 463 RETAI--GLLPDAKKPKLMHGTLIMKDNSFRLVSPEQALKELGLAEHQLRFTCRVHIQDP 520
Query: 361 SPINKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLL--SVERIDDKNKRVFVTWP 418
++T+ +Y+ L L+D S ++ L + S++++S+L+ + D K + V+W
Sbjct: 521 RKEHETVLRVYNHLKGVLKDYSVQH---LPDGSITVESILIQATAHSEDQGTKVLLVSWT 577
Query: 419 NQDEEVGKIVLHVLK 433
QDEE+G + +LK
Sbjct: 578 YQDEELGSYLTSLLK 592
>gi|410928941|ref|XP_003977858.1| PREDICTED: integrator complex subunit 11-like [Takifugu rubripes]
Length = 601
Score = 592 bits (1525), Expect = e-166, Method: Compositional matrix adjust.
Identities = 276/436 (63%), Positives = 349/436 (80%), Gaps = 8/436 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
M Q+KVG++S+VYTGDYNMTPDRHLGAAWIDKCRPD+LI+ESTYATTIRDSKRCRERDFL
Sbjct: 163 MVQIKVGSESVVYTGDYNMTPDRHLGAAWIDKCRPDILISESTYATTIRDSKRCRERDFL 222
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHE ++RGGKVLIPVFALGRAQELCILLET+WERMNL+APIYF+ GLTEKA +YYK+F
Sbjct: 223 KKVHESIERGGKVLIPVFALGRAQELCILLETFWERMNLKAPIYFSTGLTEKANHYYKLF 282
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
ITWTNQKIRKTFVQRNMF+FKHI+ FD+S+ DNPGPMVVFATPGMLH+G SL IFKKWA
Sbjct: 283 ITWTNQKIRKTFVQRNMFEFKHIKAFDRSYADNPGPMVVFATPGMLHAGQSLQIFKKWAG 342
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
E NM+IMPG+CVQGTIGHK+L+G +KLE E + +DVK+ +EYMSFSAHADAKGIMQLI
Sbjct: 343 NEKNMVIMPGYCVQGTIGHKILNGQRKLEMEGRATLDVKLQVEYMSFSAHADAKGIMQLI 402
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
+ EP+N+LLVHGEA+KM FLK KI+QEFN+DC+MPANGE+ V T+ + +D+S+NLLK
Sbjct: 403 RMAEPRNMLLVHGEAAKMEFLKGKIEQEFNIDCYMPANGETVTVTTNPSVPVDISLNLLK 462
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
E V P+P K R +HG L++K++S+ L+ ++A KE+G++ H +RFT V + D
Sbjct: 463 REMV-LGGPLPDPKKPRAMHGTLIMKENSLRLVSSEQALKELGLNEHQLRFTCRVHLQDP 521
Query: 361 SPINKTLELIYDQLLSYLQDKSQEYKIQ-LTESSLSIDSVLLSVERI--DDKNKRVFVTW 417
N TL +Y L S L+ Y +Q L + ++ ++S+++ V D K + ++W
Sbjct: 522 HSDNDTLHRVYTHLKSVLKG----YTVQHLPDGTVMVESIVVKVSSSAEDASMKVLLLSW 577
Query: 418 PNQDEEVGKIVLHVLK 433
QDE++G + +LK
Sbjct: 578 SYQDEDLGSFLSTLLK 593
>gi|432866809|ref|XP_004070946.1| PREDICTED: integrator complex subunit 11-like [Oryzias latipes]
Length = 599
Score = 590 bits (1521), Expect = e-165, Method: Compositional matrix adjust.
Identities = 276/436 (63%), Positives = 352/436 (80%), Gaps = 8/436 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
M +KVG++S+VYTGDYNMTPDRHLGAAWIDKCRPD+LI+ESTYATTIRDSKRCRERDFL
Sbjct: 163 MVYIKVGSESVVYTGDYNMTPDRHLGAAWIDKCRPDILISESTYATTIRDSKRCRERDFL 222
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHE ++RGGKVLIPVFALGRAQELCILLET+WERMNL+APIYF+ G+TEKA +YYK+F
Sbjct: 223 KKVHESIERGGKVLIPVFALGRAQELCILLETFWERMNLKAPIYFSTGMTEKANHYYKLF 282
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
ITWTNQKIRKTFVQRNMF+FKHI+ FD+S+ DNPGPMVVFATPGMLH+G SL IFKKWA
Sbjct: 283 ITWTNQKIRKTFVQRNMFEFKHIKAFDRSYADNPGPMVVFATPGMLHAGQSLQIFKKWAG 342
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
E NM+IMPG+CVQGTIGHK+L+G +KLE E + +DVK+ +EYMSFSAHADAKGIMQLI
Sbjct: 343 NEKNMVIMPGYCVQGTIGHKILNGQRKLEMEGRATLDVKLQVEYMSFSAHADAKGIMQLI 402
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
+ EP+N+LLVHGEA+KM FLK KI+QEF++DC+MPANGE+ V+T+ + +D+SVNLLK
Sbjct: 403 RMAEPRNMLLVHGEAAKMEFLKGKIEQEFSIDCYMPANGETVTVKTNPSVPVDMSVNLLK 462
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
E + P+P K R +HG L++K+SS+ L+ ++A KE+G++ H +RFT V + D
Sbjct: 463 RE-MALGGPLPDPKKPRSMHGTLIMKESSLKLVSSEQALKELGLNEHQLRFTCRVPLQDP 521
Query: 361 SPINKTLELIYDQLLSYLQDKSQEYKIQ-LTESSLSIDSVLLSV-ERIDDKNKRV-FVTW 417
++TL IY L S L+ Y IQ L + ++ ++S+++ V +D N ++ ++W
Sbjct: 522 HSDSETLHRIYTHLKSLLKG----YTIQHLPDGTVMVESIVIKVSSSAEDANAKILLLSW 577
Query: 418 PNQDEEVGKIVLHVLK 433
QDE++G + VLK
Sbjct: 578 SYQDEDLGSFLSSVLK 593
>gi|327288530|ref|XP_003228979.1| PREDICTED: integrator complex subunit 11-like [Anolis carolinensis]
Length = 600
Score = 590 bits (1521), Expect = e-165, Method: Compositional matrix adjust.
Identities = 280/436 (64%), Positives = 351/436 (80%), Gaps = 9/436 (2%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MFQ+KVG +S+VYTGDYNMTPDRHLGAAWIDKCRPDLLI+ESTYATTIRDSKRCRERDFL
Sbjct: 163 MFQIKVGCESVVYTGDYNMTPDRHLGAAWIDKCRPDLLISESTYATTIRDSKRCRERDFL 222
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHE ++RGGKVLIPVFALGRAQELCILLET+WERMNL+APIYF++GLTEKA +YYK+F
Sbjct: 223 KKVHETIERGGKVLIPVFALGRAQELCILLETFWERMNLKAPIYFSMGLTEKANHYYKLF 282
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
ITWTNQKIRKTFVQRNMF+FKHI+ FD++F DNPGPMVVFATPGMLH+G SL IFKKWA
Sbjct: 283 ITWTNQKIRKTFVQRNMFEFKHIKAFDRAFADNPGPMVVFATPGMLHAGQSLQIFKKWAG 342
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
E NM+IMPG+CVQGT+GHK+L G +KLE E +QI++VKM +EYMSFSAHADAKGIMQLI
Sbjct: 343 NEKNMVIMPGYCVQGTVGHKILGGQRKLEMEGRQILEVKMQVEYMSFSAHADAKGIMQLI 402
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
+ EP+NVLLVHGEA KM FLK+KI+QEF+++C MPANGE+ V T+ I +DVS+ LLK
Sbjct: 403 RQAEPRNVLLVHGEAKKMEFLKQKIEQEFHVNCHMPANGETVTVFTNPNIPVDVSLGLLK 462
Query: 301 EEAVKYNSEP-PNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISD 359
E + P P+ K + +HG L+IKD+S L+ ++A KE+G+ H +RFT + D
Sbjct: 463 RE---LSIGPVPDSKKPKLMHGTLIIKDNSFRLVSPEQALKELGLGEHQLRFTCRIHFQD 519
Query: 360 SSPINKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLL--SVERIDDKNKRVFVTW 417
++ +Y+ L S L+D S ++ L+++S+ ++S+L+ +V+ D K + V+W
Sbjct: 520 PRKEHEAGLRVYNHLKSVLKDYSVQH---LSDTSIMVESILIQVTVQSEDPATKLLLVSW 576
Query: 418 PNQDEEVGKIVLHVLK 433
QDEE+G V +LK
Sbjct: 577 TYQDEEMGSYVTSLLK 592
>gi|338722203|ref|XP_001496423.3| PREDICTED: integrator complex subunit 11 [Equus caballus]
Length = 571
Score = 588 bits (1515), Expect = e-165, Method: Compositional matrix adjust.
Identities = 274/435 (62%), Positives = 346/435 (79%), Gaps = 7/435 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MFQ+KVG++S+VYTGDYNMTPDRHLGAAWIDKCRP+LLITESTYATTIRDSKRCRERDFL
Sbjct: 134 MFQIKVGSESVVYTGDYNMTPDRHLGAAWIDKCRPNLLITESTYATTIRDSKRCRERDFL 193
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHE V+RGGKVLIPVFALGRAQELCILLET+WER+NL+APIYF+ GLTEKA +YYK+F
Sbjct: 194 KKVHETVERGGKVLIPVFALGRAQELCILLETFWERVNLKAPIYFSTGLTEKANHYYKLF 253
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
ITWTNQKIRKTFVQRNMF+FKHI+ FD++F DNPGPMVVFATPGMLH+G SL IF+KWA
Sbjct: 254 ITWTNQKIRKTFVQRNMFEFKHIKAFDRAFADNPGPMVVFATPGMLHAGQSLQIFRKWAG 313
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
E NM+IMPG+CVQGT+GHK+LSG +KLE E +Q+++VKM +EYMSFSAHADAKGIMQL+
Sbjct: 314 NEKNMVIMPGYCVQGTVGHKILSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLV 373
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
EP++VLLVHGEA KM FLK+KI+QEF ++C+MPANGE+ + T I + +S+ LLK
Sbjct: 374 GQAEPESVLLVHGEAKKMEFLKQKIEQEFRVNCYMPANGETVTLPTSPSIPVGISLGLLK 433
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
E + P+ K R +HG L++KDSS L+ ++A KE+G++ H +RFT V + DS
Sbjct: 434 REMAQ--GLLPSAKKPRLLHGTLIMKDSSFQLVSSEQALKELGLAEHQLRFTCRVHLHDS 491
Query: 361 SPINKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLL--SVERIDDKNKRVFVTWP 418
+T +Y L S L+D ++ L + S++++S+L+ + D K + V+W
Sbjct: 492 RKEQETAARVYSHLKSVLKDHCVQH---LPDGSVTVESILIQAAAHSEDPGTKVLLVSWT 548
Query: 419 NQDEEVGKIVLHVLK 433
QDEE+G + +LK
Sbjct: 549 YQDEELGSYLTSLLK 563
>gi|348503157|ref|XP_003439132.1| PREDICTED: integrator complex subunit 11-like [Oreochromis
niloticus]
Length = 601
Score = 587 bits (1514), Expect = e-165, Method: Compositional matrix adjust.
Identities = 276/436 (63%), Positives = 350/436 (80%), Gaps = 8/436 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
M +KVG++S+VYTGDYNMTPDRHLGAAWIDKCRPD+LI+ESTYATTIRDSKRCRERDFL
Sbjct: 163 MVHIKVGSESVVYTGDYNMTPDRHLGAAWIDKCRPDILISESTYATTIRDSKRCRERDFL 222
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHE ++RGGKVLIPVFALGRAQELCILLET+WERMNL+APIYF+ GLTEKA +YYK+F
Sbjct: 223 KKVHESIERGGKVLIPVFALGRAQELCILLETFWERMNLKAPIYFSTGLTEKANHYYKLF 282
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
ITWTNQKIRKTFVQRNMF+FKHI+ FD+S+ DNPGPMVVFATPGMLH+G SL IFKKWA
Sbjct: 283 ITWTNQKIRKTFVQRNMFEFKHIKAFDRSYADNPGPMVVFATPGMLHAGQSLQIFKKWAG 342
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
E NM+IMPG+CVQGTIGHK+L+G +KLE E + +DVK+ +EYMSFSAHADAKGIMQLI
Sbjct: 343 NEKNMVIMPGYCVQGTIGHKILNGQRKLEMEGRATLDVKLQVEYMSFSAHADAKGIMQLI 402
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
+ EP+N+LLVHGEA KM FLK KI+QEF++DC+MPANGE+ V T+ + +D+S+NLLK
Sbjct: 403 RMAEPRNMLLVHGEAVKMEFLKGKIEQEFSIDCYMPANGETTTVTTNPSVPVDISLNLLK 462
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
E + P+P K R +HG L++K++S+ L+ ++A KE+G++ H +RFT +VQ+ D
Sbjct: 463 RE-MALGGPLPDPKKPRTMHGTLIMKENSLKLVSSEQALKELGLNEHQLRFTCHVQLQDP 521
Query: 361 SPINKTLELIYDQLLSYLQDKSQEYKIQ-LTESSLSIDSVLLSV-ERIDDKNKRV-FVTW 417
TL IY L S L+ Y IQ L + ++ ++S+++ V +D N +V ++W
Sbjct: 522 HSDADTLHRIYTHLKSVLKG----YTIQHLPDGTVIVESIVIKVSSSTEDSNTKVLLLSW 577
Query: 418 PNQDEEVGKIVLHVLK 433
QDE++G + +LK
Sbjct: 578 SYQDEDLGSFLSSLLK 593
>gi|354495797|ref|XP_003510015.1| PREDICTED: integrator complex subunit 11-like [Cricetulus griseus]
gi|344251677|gb|EGW07781.1| Integrator complex subunit 11 [Cricetulus griseus]
Length = 600
Score = 586 bits (1511), Expect = e-164, Method: Compositional matrix adjust.
Identities = 274/436 (62%), Positives = 345/436 (79%), Gaps = 7/436 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MFQ+KVG++S+VYTGDYNMTPDRHLGAAWIDKCRP+LLITESTYATTIRDSKRCRERDFL
Sbjct: 163 MFQIKVGSESVVYTGDYNMTPDRHLGAAWIDKCRPNLLITESTYATTIRDSKRCRERDFL 222
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHE V+RGGKVLIPVFALGRAQELCILLET+WERMNL+ PIYF+ GLTEKA +YYK+F
Sbjct: 223 KKVHETVERGGKVLIPVFALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLF 282
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
ITWTNQKIRKTFVQRNMF+FKHI+ FD++F DNPGPMVVFATPGMLH+G SL IF+KWA
Sbjct: 283 ITWTNQKIRKTFVQRNMFEFKHIKAFDRTFADNPGPMVVFATPGMLHAGQSLQIFRKWAG 342
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
E NM+IMPG+CVQGT+GHK+LSG +KLE E +Q+++VKM +EYMSFSAHADAKGIMQL+
Sbjct: 343 NEKNMVIMPGYCVQGTVGHKILSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLV 402
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
EP++VLLVHGEA KM FL++KI+QEF + C+MPANGE+ + T I + +S+ LLK
Sbjct: 403 GQAEPESVLLVHGEAKKMEFLRQKIEQEFRVSCYMPANGETVTLPTSPSIPVGISLGLLK 462
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
E V+ P K R +HG LV+KDSS L+ ++A KE+G++ H +RFT V + D+
Sbjct: 463 REMVQ--GLLPEAKKPRLLHGTLVMKDSSFRLVSSEQALKELGLAEHQLRFTCRVHLQDT 520
Query: 361 SPINKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLL--SVERIDDKNKRVFVTWP 418
+T +Y L S L+D ++ L + S++++S+L+ + D K + V+W
Sbjct: 521 RKEQETALRVYSHLKSTLKDHCVQH---LPDGSVTVESILIQAAAHSEDPGTKVLLVSWT 577
Query: 419 NQDEEVGKIVLHVLKS 434
QDEE+G + +LK+
Sbjct: 578 YQDEELGSFLTALLKN 593
>gi|444519369|gb|ELV12789.1| Integrator complex subunit 11 [Tupaia chinensis]
Length = 601
Score = 586 bits (1510), Expect = e-164, Method: Compositional matrix adjust.
Identities = 274/435 (62%), Positives = 343/435 (78%), Gaps = 7/435 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MFQ+KVG++S+VYTGDYNMTPDRHLGAAWIDKCRP+LLITESTYATTIRDSKRCRERDFL
Sbjct: 164 MFQIKVGSESVVYTGDYNMTPDRHLGAAWIDKCRPNLLITESTYATTIRDSKRCRERDFL 223
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHE V+RGGKVLIPVFALGRAQELCILLET+WERMNL+ PIYF+ GLTEKA +YYK+F
Sbjct: 224 KKVHETVERGGKVLIPVFALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLF 283
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
ITWTNQKIRKTFVQRNMF+FKHI+ FD++F DNPGPMVVFATPGMLH+G SL IF+KWA
Sbjct: 284 ITWTNQKIRKTFVQRNMFEFKHIKAFDRAFADNPGPMVVFATPGMLHAGQSLQIFRKWAG 343
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
E NM+IMPG+CVQGT+GHK+LSG +KLE E +Q+++VKM +EYMSFSAHADAKGIMQL+
Sbjct: 344 NEKNMVIMPGYCVQGTVGHKILSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLV 403
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
EP+NVLLVHGEA KM FLK+KI+QEF + C+MPANGE+ + T I + +S+ LLK
Sbjct: 404 GQAEPENVLLVHGEAKKMEFLKQKIEQEFRVSCYMPANGETVTLPTSPSIPVGISLGLLK 463
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
E + P K R +HG L++KDS+ L+ ++A KE+G++ H +RFT V + D+
Sbjct: 464 REMAQ--GLLPEAKKPRLLHGTLIMKDSNFRLVSSEQALKELGLAEHQLRFTCRVHLHDT 521
Query: 361 SPINKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLL--SVERIDDKNKRVFVTWP 418
+T IY L S L+D ++ L + S++++S+L+ + D K + V+W
Sbjct: 522 RKEQETATRIYSHLKSILKDHCVQH---LPDGSVTVESILVQAAAHSEDPGTKVLLVSWT 578
Query: 419 NQDEEVGKIVLHVLK 433
QDEE+G + +LK
Sbjct: 579 YQDEELGSYLTSLLK 593
>gi|301618510|ref|XP_002938656.1| PREDICTED: integrator complex subunit 11 isoform 1 [Xenopus
(Silurana) tropicalis]
gi|301618512|ref|XP_002938657.1| PREDICTED: integrator complex subunit 11 isoform 2 [Xenopus
(Silurana) tropicalis]
Length = 600
Score = 585 bits (1509), Expect = e-164, Method: Compositional matrix adjust.
Identities = 275/435 (63%), Positives = 347/435 (79%), Gaps = 7/435 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MFQ+KVG++S+VYTGDYNMTPDRHLGAAWIDKCRPDLLI+ESTYATTIRDSKRCRERDFL
Sbjct: 163 MFQIKVGSESVVYTGDYNMTPDRHLGAAWIDKCRPDLLISESTYATTIRDSKRCRERDFL 222
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHE V++GGKVLIPVFALGRAQELCILLET+WERMNL+APIYF+ GLTEKA +YYK+F
Sbjct: 223 KKVHETVEKGGKVLIPVFALGRAQELCILLETFWERMNLKAPIYFSTGLTEKANHYYKLF 282
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
ITWTNQKIRKTFVQRNMF+FKHI+ FD++F DNPGPMVVFATPGMLH+G SL IF+KWA
Sbjct: 283 ITWTNQKIRKTFVQRNMFEFKHIKAFDRAFADNPGPMVVFATPGMLHAGQSLQIFRKWAG 342
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
+ NM+IMPG+CVQGT+GHK+LSG +KLE E +Q ++VKM +EYMSFSAHADAKGIMQLI
Sbjct: 343 NDKNMVIMPGYCVQGTVGHKILSGQRKLEMEGRQTLEVKMQVEYMSFSAHADAKGIMQLI 402
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
+ EP+NVLLVHGEA KM FLK KI+QEF ++C MP NGE+ + T+ I +D+S+ LLK
Sbjct: 403 RQAEPRNVLLVHGEAKKMEFLKLKIEQEFRINCAMPLNGETATITTNPNIPVDISLGLLK 462
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
+E+ P+ K R +HG L++KD+S L+ ++A KE+G+ H +RFT + + D+
Sbjct: 463 KESSL--GLVPDCKKPRLMHGTLIMKDNSFRLVSPEQALKEIGLGEHQLRFTCRLHVQDT 520
Query: 361 SPINKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLL--SVERIDDKNKRVFVTWP 418
+T+ IY+ L S L+D S ++ L + S+ ++S+LL S D K + ++W
Sbjct: 521 HKEQETVLRIYNHLKSVLKDYSVQH---LPDGSIMVESILLQVSTHSEDPGTKVLLISWT 577
Query: 419 NQDEEVGKIVLHVLK 433
QDEE+G + +LK
Sbjct: 578 YQDEELGSFLTSLLK 592
>gi|281348165|gb|EFB23749.1| hypothetical protein PANDA_020173 [Ailuropoda melanoleuca]
Length = 591
Score = 585 bits (1508), Expect = e-164, Method: Compositional matrix adjust.
Identities = 272/435 (62%), Positives = 345/435 (79%), Gaps = 7/435 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MFQ+KVG++S+VYTGDYNMTPDRHLGAAWID+CRP+LLITESTYATTIRDSKRCRERDFL
Sbjct: 154 MFQIKVGSESVVYTGDYNMTPDRHLGAAWIDRCRPNLLITESTYATTIRDSKRCRERDFL 213
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHE V+RGGKVLIPVFALGRAQELCILLET+WERMNL+APIYF+ GLTEKA +YYK+F
Sbjct: 214 KKVHEAVERGGKVLIPVFALGRAQELCILLETFWERMNLKAPIYFSTGLTEKANHYYKLF 273
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
ITWTNQKIRKTFVQRNMF+FKHI+ FD++F DNPGPMVVFATPGMLH+G SL IF+KWA
Sbjct: 274 ITWTNQKIRKTFVQRNMFEFKHIKAFDRAFADNPGPMVVFATPGMLHAGQSLQIFRKWAG 333
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
E NM+IMPG+CVQGT+GHK+LSG +KLE E +Q+++VKM +EYMSFSAHADAKGIMQL+
Sbjct: 334 NEKNMVIMPGYCVQGTVGHKILSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLV 393
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
EP++VLLVHGEA KM FLK+KI+QEF ++C+MPANGE+ + T I + +S+ LLK
Sbjct: 394 GQAEPESVLLVHGEAKKMEFLKQKIEQEFRVNCYMPANGETVTLPTSPSIPVGISLGLLK 453
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
E + P+ K R +HG L++KDS+ L+ ++A KE+G++ H +RFT V + D
Sbjct: 454 REMAQ--GLLPDAKKPRLLHGTLIMKDSNFRLVSSEQALKELGLAEHQLRFTCRVHLHDP 511
Query: 361 SPINKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLL--SVERIDDKNKRVFVTWP 418
+T +Y L S L+D ++ L + S++++S+L+ + D K + V+W
Sbjct: 512 RKEQETAMRVYSHLKSMLKDHCVQH---LPDGSVTVESILIQAAAHSEDPGTKVLLVSWT 568
Query: 419 NQDEEVGKIVLHVLK 433
QDEE+G + +LK
Sbjct: 569 YQDEELGSYLTSLLK 583
>gi|301788922|ref|XP_002929872.1| PREDICTED: integrator complex subunit 11-like [Ailuropoda
melanoleuca]
Length = 600
Score = 585 bits (1507), Expect = e-164, Method: Compositional matrix adjust.
Identities = 272/435 (62%), Positives = 345/435 (79%), Gaps = 7/435 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MFQ+KVG++S+VYTGDYNMTPDRHLGAAWID+CRP+LLITESTYATTIRDSKRCRERDFL
Sbjct: 163 MFQIKVGSESVVYTGDYNMTPDRHLGAAWIDRCRPNLLITESTYATTIRDSKRCRERDFL 222
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHE V+RGGKVLIPVFALGRAQELCILLET+WERMNL+APIYF+ GLTEKA +YYK+F
Sbjct: 223 KKVHEAVERGGKVLIPVFALGRAQELCILLETFWERMNLKAPIYFSTGLTEKANHYYKLF 282
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
ITWTNQKIRKTFVQRNMF+FKHI+ FD++F DNPGPMVVFATPGMLH+G SL IF+KWA
Sbjct: 283 ITWTNQKIRKTFVQRNMFEFKHIKAFDRAFADNPGPMVVFATPGMLHAGQSLQIFRKWAG 342
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
E NM+IMPG+CVQGT+GHK+LSG +KLE E +Q+++VKM +EYMSFSAHADAKGIMQL+
Sbjct: 343 NEKNMVIMPGYCVQGTVGHKILSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLV 402
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
EP++VLLVHGEA KM FLK+KI+QEF ++C+MPANGE+ + T I + +S+ LLK
Sbjct: 403 GQAEPESVLLVHGEAKKMEFLKQKIEQEFRVNCYMPANGETVTLPTSPSIPVGISLGLLK 462
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
E + P+ K R +HG L++KDS+ L+ ++A KE+G++ H +RFT V + D
Sbjct: 463 REMAQ--GLLPDAKKPRLLHGTLIMKDSNFRLVSSEQALKELGLAEHQLRFTCRVHLHDP 520
Query: 361 SPINKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLL--SVERIDDKNKRVFVTWP 418
+T +Y L S L+D ++ L + S++++S+L+ + D K + V+W
Sbjct: 521 RKEQETAMRVYSHLKSMLKDHCVQH---LPDGSVTVESILIQAAAHSEDPGTKVLLVSWT 577
Query: 419 NQDEEVGKIVLHVLK 433
QDEE+G + +LK
Sbjct: 578 YQDEELGSYLTSLLK 592
>gi|66472504|ref|NP_001018457.1| integrator complex subunit 11 [Danio rerio]
gi|82192739|sp|Q503E1.1|INT11_DANRE RecName: Full=Integrator complex subunit 11; Short=Int11; AltName:
Full=Cleavage and polyadenylation-specific factor 3-like
protein; Short=CPSF3-like protein
gi|63102425|gb|AAH95364.1| Zgc:110671 [Danio rerio]
Length = 598
Score = 585 bits (1507), Expect = e-164, Method: Compositional matrix adjust.
Identities = 275/436 (63%), Positives = 349/436 (80%), Gaps = 8/436 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
M Q+KVG++S+VYTGDYNMTPDRHLGAAWIDKCRPD+LI+ESTYATTIRDSKRCRERDFL
Sbjct: 163 MVQIKVGSESVVYTGDYNMTPDRHLGAAWIDKCRPDILISESTYATTIRDSKRCRERDFL 222
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHE V+RGGKVLIPVFALGRAQELCILLET+WERMNL+APIYF+ GLTEKA +YYK+F
Sbjct: 223 KKVHETVERGGKVLIPVFALGRAQELCILLETFWERMNLKAPIYFSTGLTEKANHYYKLF 282
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
ITWTNQKIRKTFVQRNMF+FKHI+ FD+S+ DNPGPMVVFATPGMLH+G SL IFKKWA
Sbjct: 283 ITWTNQKIRKTFVQRNMFEFKHIKAFDRSYADNPGPMVVFATPGMLHAGQSLQIFKKWAG 342
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
E NM+IMPG+CVQGT+GHK+L+G KKLE E + +DVK+ +EYMSFSAHADAKGIMQLI
Sbjct: 343 NEKNMVIMPGYCVQGTVGHKILNGQKKLEMEGRATLDVKLQVEYMSFSAHADAKGIMQLI 402
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
+ EP+N+LLVHGEA KM FLK+KI+QEF++ CFMPANGE+ + T+ + +D+S+NLLK
Sbjct: 403 RMAEPRNMLLVHGEAKKMEFLKDKIEQEFSISCFMPANGETTTIVTNPSVPVDISLNLLK 462
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
E + P+ + R +HG L++KD+S+ L+ ++A KE+G++ H +RFT VQ+ D
Sbjct: 463 RE-MALGGPLPDAKRPRTMHGTLIMKDNSLRLVSPEQALKELGLNEHQLRFTCRVQLHDP 521
Query: 361 SPINKTLELIYDQLLSYLQDKSQEYKIQ-LTESSLSIDSVLLSV-ERIDDKN-KRVFVTW 417
TL IY L S L + Y IQ + + ++ ++S+++ V ++ N K + ++W
Sbjct: 522 HSDTDTLSRIYTHLKSVL----KSYSIQHVPDGTVIVESIVIKVTSSAEEPNLKVILLSW 577
Query: 418 PNQDEEVGKIVLHVLK 433
QDEE+G + +LK
Sbjct: 578 SYQDEELGSFLSTLLK 593
>gi|395840793|ref|XP_003793236.1| PREDICTED: integrator complex subunit 11 isoform 2 [Otolemur
garnettii]
Length = 499
Score = 584 bits (1506), Expect = e-164, Method: Compositional matrix adjust.
Identities = 273/435 (62%), Positives = 342/435 (78%), Gaps = 7/435 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MFQ+KVG++S+VYTGDYNMTPDRHLGAAWIDKCRP+LLITESTYATTIRDSKRCRERDFL
Sbjct: 62 MFQIKVGSESVVYTGDYNMTPDRHLGAAWIDKCRPNLLITESTYATTIRDSKRCRERDFL 121
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHE V+RGGKVLIPVFALGRAQELCILLET+WERMNL+ PIYF+ GLTEKA +YYK+F
Sbjct: 122 KKVHETVERGGKVLIPVFALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLF 181
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
ITWTNQKIRKTFVQRNMF+FKHI+ FD++F DNPGPMVVFATPGMLH+G SL IF+KWA
Sbjct: 182 ITWTNQKIRKTFVQRNMFEFKHIKAFDRAFADNPGPMVVFATPGMLHAGQSLQIFRKWAG 241
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
E NM+IMPG+CVQGT+GHK+LSG +KLE E +Q+++VKM +EYMSFSAHADAKGIMQL+
Sbjct: 242 NEKNMVIMPGYCVQGTVGHKILSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLV 301
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
EP++VLLVHGEA KM FLK+KI+QEF + C+MPANGE+ + T I +S+ LLK
Sbjct: 302 GQAEPQSVLLVHGEAKKMEFLKQKIEQEFRVSCYMPANGETVALPTSPSIPGGISLGLLK 361
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
E + P K R +HG L++KDS+ L+ ++A KE+G++ H +RFT V + DS
Sbjct: 362 REMAQ--GLLPEAKKPRLLHGTLIMKDSNFRLISSEQALKELGLAEHQLRFTCRVHLHDS 419
Query: 361 SPINKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLL--SVERIDDKNKRVFVTWP 418
+T +Y L S L+D ++ L + S++++S+L+ + D K + V+W
Sbjct: 420 RKEQETAMRVYSHLKSILKDHCVQH---LPDGSVTVESILIQAAAPSEDPATKVLLVSWT 476
Query: 419 NQDEEVGKIVLHVLK 433
QDEE+G + +LK
Sbjct: 477 YQDEELGSYLTSLLK 491
>gi|427779771|gb|JAA55337.1| Putative mrna cleavage and polyadenylation factor ii complex brr5
cpsf subunit [Rhipicephalus pulchellus]
Length = 621
Score = 584 bits (1505), Expect = e-163, Method: Compositional matrix adjust.
Identities = 280/472 (59%), Positives = 356/472 (75%), Gaps = 51/472 (10%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF+++VG+QS+VYTGDYNMTPDRHLGAAW+DKCRPDLLITESTYATTIRDSKRCRERDFL
Sbjct: 163 MFRIRVGSQSVVYTGDYNMTPDRHLGAAWLDKCRPDLLITESTYATTIRDSKRCRERDFL 222
Query: 61 KKVHECVDRGGKVLIPVF----------------------------------ALGRAQEL 86
KVH+C+D+GGKVLIPVF ALGRAQEL
Sbjct: 223 TKVHDCIDKGGKVLIPVFXTTIRDSKRCRERDFLTKVHDCIDKGGKVLIPVFALGRAQEL 282
Query: 87 CILLETYWERMNLQAPIYFAVGLTEKATNYYKMFITWTNQKIRKTFVQRNMFDFKHIRPF 146
CILLETYW+RMNL+ PIYFAVGLTEKATNYYKMFITWTNQKIRKTFVQRNMFDFKHI+PF
Sbjct: 283 CILLETYWDRMNLRVPIYFAVGLTEKATNYYKMFITWTNQKIRKTFVQRNMFDFKHIKPF 342
Query: 147 DKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVK 206
D++FIDNPGPMVVFATPGMLH+GLSL IFKKWAP E NM+IMPG+CV GT+GHK+LSG +
Sbjct: 343 DRAFIDNPGPMVVFATPGMLHAGLSLQIFKKWAPFEANMVIMPGYCVAGTVGHKILSGAR 402
Query: 207 KLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIK 266
K+E +N+Q+++VKM+++YMSFSAHADAKGIMQLIQ CEP NVLLVHGEA KM FL++KI+
Sbjct: 403 KVELDNRQVVEVKMSVQYMSFSAHADAKGIMQLIQQCEPANVLLVHGEAGKMDFLRKKIR 462
Query: 267 QEFNLDCFMPANGESCFVQTDMKISIDVSVNLLKEEAVKYNSEPPNPLKERQIHGVLVIK 326
QEF +DC MPANGE+ ++T + IDV+V LLK A+ +S+ P + IHG LV+
Sbjct: 463 QEFAIDCSMPANGETVVIETPDTVPIDVAVPLLK-HALGSDSKRP-----KLIHGALVMH 516
Query: 327 DSSISLMDVDEACKEVGISRHIVRFTSNVQISDSSPINKTLELIYDQLLSYLQDKSQEYK 386
D + L+D +A +E+G++ H +RFT+ V +SP + D L + L+D+ +
Sbjct: 517 DRGVRLLDSRQAMQELGLTSHQIRFTTTV----ASP----RAALADALYTRLRDRLPGVQ 568
Query: 387 IQLTESSLSIDSVLLSV--ERIDDKNKRVFVTWPNQDEEVGKIVLHVLKSMS 436
+ L + S+++ S L+ V + D ++ + V+W QDE++G +L +++ M+
Sbjct: 569 L-LPDGSIAVASALVKVGGQEDDAASRAICVSWTLQDEDLGSHLLSLVQEMA 619
>gi|34783058|gb|AAH00675.2| CPSF3L protein, partial [Homo sapiens]
Length = 473
Score = 583 bits (1504), Expect = e-163, Method: Compositional matrix adjust.
Identities = 272/435 (62%), Positives = 342/435 (78%), Gaps = 7/435 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MFQ+KVG++S+VYTGDYNMTPDRHLGAAWIDKCRP+LLITESTYATTIRDSKRCRERDFL
Sbjct: 36 MFQIKVGSESVVYTGDYNMTPDRHLGAAWIDKCRPNLLITESTYATTIRDSKRCRERDFL 95
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHE V+RGGKVLIPVFALGRAQELCILLET+WERMNL+ PIYF+ GLTEKA +YYK+F
Sbjct: 96 KKVHETVERGGKVLIPVFALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLF 155
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
I WTNQKIRKTFVQRNMF+FKHI+ FD++F DNPGPMVVFATPGMLH+G SL IF+KWA
Sbjct: 156 IPWTNQKIRKTFVQRNMFEFKHIKAFDRAFADNPGPMVVFATPGMLHAGQSLQIFRKWAG 215
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
E NM+IMPG+CVQGT+GHK+LSG +KLE E +Q+++VKM +EYMSFSAHADAKGIMQL+
Sbjct: 216 NEKNMVIMPGYCVQGTVGHKILSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLV 275
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
EP++VLLVHGEA KM FLK+KI+QE ++C+MPANGE+ + T I + +S+ LLK
Sbjct: 276 GQAEPESVLLVHGEAKKMEFLKQKIEQELRVNCYMPANGETVTLPTSPSIPVGISLGLLK 335
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
E + P K R +HG L++KDS+ L+ ++A KE+G++ H +RFT V + D+
Sbjct: 336 REMAQ--GLLPEAKKPRLLHGTLIMKDSNFRLVSSEQALKELGLAEHQLRFTCRVHLHDT 393
Query: 361 SPINKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLL--SVERIDDKNKRVFVTWP 418
+T +Y L S L+D ++ L + S++++SVLL + D K + V+W
Sbjct: 394 RKEQETALRVYSHLKSVLKDHCVQH---LPDGSVTVESVLLQAAAPSEDPGTKVLLVSWT 450
Query: 419 NQDEEVGKIVLHVLK 433
QDEE+G + +LK
Sbjct: 451 YQDEELGSFLTSLLK 465
>gi|119576648|gb|EAW56244.1| cleavage and polyadenylation specific factor 3-like, isoform CRA_j
[Homo sapiens]
Length = 476
Score = 583 bits (1504), Expect = e-163, Method: Compositional matrix adjust.
Identities = 272/435 (62%), Positives = 342/435 (78%), Gaps = 7/435 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MFQ+KVG++S+VYTGDYNMTPDRHLGAAWIDKCRP+LLITESTYATTIRDSKRCRERDFL
Sbjct: 39 MFQIKVGSESVVYTGDYNMTPDRHLGAAWIDKCRPNLLITESTYATTIRDSKRCRERDFL 98
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHE V+RGGKVLIPVFALGRAQELCILLET+WERMNL+ PIYF+ GLTEKA +YYK+F
Sbjct: 99 KKVHETVERGGKVLIPVFALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLF 158
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
I WTNQKIRKTFVQRNMF+FKHI+ FD++F DNPGPMVVFATPGMLH+G SL IF+KWA
Sbjct: 159 IPWTNQKIRKTFVQRNMFEFKHIKAFDRAFADNPGPMVVFATPGMLHAGQSLQIFRKWAG 218
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
E NM+IMPG+CVQGT+GHK+LSG +KLE E +Q+++VKM +EYMSFSAHADAKGIMQL+
Sbjct: 219 NEKNMVIMPGYCVQGTVGHKILSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLV 278
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
EP++VLLVHGEA KM FLK+KI+QE ++C+MPANGE+ + T I + +S+ LLK
Sbjct: 279 GQAEPESVLLVHGEAKKMEFLKQKIEQELRVNCYMPANGETVTLPTSPSIPVGISLGLLK 338
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
E + P K R +HG L++KDS+ L+ ++A KE+G++ H +RFT V + D+
Sbjct: 339 REMAQ--GLLPEAKKPRLLHGTLIMKDSNFRLVSSEQALKELGLAEHQLRFTCRVHLHDT 396
Query: 361 SPINKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLL--SVERIDDKNKRVFVTWP 418
+T +Y L S L+D ++ L + S++++SVLL + D K + V+W
Sbjct: 397 RKEQETALRVYSHLKSVLKDHCVQH---LPDGSVTVESVLLQAAAPSEDPGTKVLLVSWT 453
Query: 419 NQDEEVGKIVLHVLK 433
QDEE+G + +LK
Sbjct: 454 YQDEELGSFLTSLLK 468
>gi|374253826|ref|NP_001243391.1| integrator complex subunit 11 isoform 4 [Homo sapiens]
Length = 502
Score = 583 bits (1503), Expect = e-163, Method: Compositional matrix adjust.
Identities = 272/435 (62%), Positives = 342/435 (78%), Gaps = 7/435 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MFQ+KVG++S+VYTGDYNMTPDRHLGAAWIDKCRP+LLITESTYATTIRDSKRCRERDFL
Sbjct: 65 MFQIKVGSESVVYTGDYNMTPDRHLGAAWIDKCRPNLLITESTYATTIRDSKRCRERDFL 124
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHE V+RGGKVLIPVFALGRAQELCILLET+WERMNL+ PIYF+ GLTEKA +YYK+F
Sbjct: 125 KKVHETVERGGKVLIPVFALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLF 184
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
I WTNQKIRKTFVQRNMF+FKHI+ FD++F DNPGPMVVFATPGMLH+G SL IF+KWA
Sbjct: 185 IPWTNQKIRKTFVQRNMFEFKHIKAFDRAFADNPGPMVVFATPGMLHAGQSLQIFRKWAG 244
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
E NM+IMPG+CVQGT+GHK+LSG +KLE E +Q+++VKM +EYMSFSAHADAKGIMQL+
Sbjct: 245 NEKNMVIMPGYCVQGTVGHKILSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLV 304
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
EP++VLLVHGEA KM FLK+KI+QE ++C+MPANGE+ + T I + +S+ LLK
Sbjct: 305 GQAEPESVLLVHGEAKKMEFLKQKIEQELRVNCYMPANGETVTLPTSPSIPVGISLGLLK 364
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
E + P K R +HG L++KDS+ L+ ++A KE+G++ H +RFT V + D+
Sbjct: 365 REMAQ--GLLPEAKKPRLLHGTLIMKDSNFRLVSSEQALKELGLAEHQLRFTCRVHLHDT 422
Query: 361 SPINKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLL--SVERIDDKNKRVFVTWP 418
+T +Y L S L+D ++ L + S++++SVLL + D K + V+W
Sbjct: 423 RKEQETALRVYSHLKSVLKDHCVQH---LPDGSVTVESVLLQAAAPSEDPGTKVLLVSWT 479
Query: 419 NQDEEVGKIVLHVLK 433
QDEE+G + +LK
Sbjct: 480 YQDEELGSFLTSLLK 494
>gi|119576647|gb|EAW56243.1| cleavage and polyadenylation specific factor 3-like, isoform CRA_i
[Homo sapiens]
Length = 502
Score = 583 bits (1503), Expect = e-163, Method: Compositional matrix adjust.
Identities = 272/435 (62%), Positives = 342/435 (78%), Gaps = 7/435 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MFQ+KVG++S+VYTGDYNMTPDRHLGAAWIDKCRP+LLITESTYATTIRDSKRCRERDFL
Sbjct: 65 MFQIKVGSESVVYTGDYNMTPDRHLGAAWIDKCRPNLLITESTYATTIRDSKRCRERDFL 124
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHE V+RGGKVLIPVFALGRAQELCILLET+WERMNL+ PIYF+ GLTEKA +YYK+F
Sbjct: 125 KKVHETVERGGKVLIPVFALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLF 184
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
I WTNQKIRKTFVQRNMF+FKHI+ FD++F DNPGPMVVFATPGMLH+G SL IF+KWA
Sbjct: 185 IPWTNQKIRKTFVQRNMFEFKHIKAFDRAFADNPGPMVVFATPGMLHAGQSLQIFRKWAG 244
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
E NM+IMPG+CVQGT+GHK+LSG +KLE E +Q+++VKM +EYMSFSAHADAKGIMQL+
Sbjct: 245 NEKNMVIMPGYCVQGTVGHKILSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLV 304
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
EP++VLLVHGEA KM FLK+KI+QE ++C+MPANGE+ + T I + +S+ LLK
Sbjct: 305 GQAEPESVLLVHGEAKKMEFLKQKIEQELRVNCYMPANGETVTLPTSPSIPVGISLGLLK 364
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
E + P K R +HG L++KDS+ L+ ++A KE+G++ H +RFT V + D+
Sbjct: 365 REMAQ--GLLPEAKKPRLLHGTLIMKDSNFRLVSSEQALKELGLAEHQLRFTCRVHLHDT 422
Query: 361 SPINKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLL--SVERIDDKNKRVFVTWP 418
+T +Y L S L+D ++ L + S++++SVLL + D K + V+W
Sbjct: 423 RKEQETALRVYSHLKSVLKDHCVQH---LPDGSVTVESVLLQAAAPSEDPGTKVLLVSWT 479
Query: 419 NQDEEVGKIVLHVLK 433
QDEE+G + +LK
Sbjct: 480 YQDEELGSFLTSLLK 494
>gi|346472285|gb|AEO35987.1| hypothetical protein [Amblyomma maculatum]
Length = 510
Score = 583 bits (1503), Expect = e-163, Method: Compositional matrix adjust.
Identities = 264/374 (70%), Positives = 316/374 (84%), Gaps = 10/374 (2%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF ++VG+QS+VYTGDYNMTPDRHLGAAW+DKCRPDLLITESTYATTIRDSKRCRERDFL
Sbjct: 134 MFWIRVGSQSVVYTGDYNMTPDRHLGAAWVDKCRPDLLITESTYATTIRDSKRCRERDFL 193
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KVH+C+D+GGKVLIPVFALGRAQELCILLETYW+RMNL+ PIYFAVGLTEKATNYYKMF
Sbjct: 194 TKVHDCIDKGGKVLIPVFALGRAQELCILLETYWDRMNLRVPIYFAVGLTEKATNYYKMF 253
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
ITWTNQKIRKTFVQRNMFDFKHI+PFD++FIDNPGPMVVFATPGMLH+GLSL IFKKWAP
Sbjct: 254 ITWTNQKIRKTFVQRNMFDFKHIKPFDRAFIDNPGPMVVFATPGMLHAGLSLQIFKKWAP 313
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
E NM+IMPG+CV GT+GHKVLSG +K+E EN+Q+++VKM+++YMSFSAHADAKGIMQLI
Sbjct: 314 FEANMVIMPGYCVAGTVGHKVLSGARKVELENRQVVEVKMSVQYMSFSAHADAKGIMQLI 373
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
Q CEP NVLLVHGEA KM FL++KI QEF++DC+MPANGE+ ++T + +DV+V LLK
Sbjct: 374 QQCEPANVLLVHGEAGKMDFLRKKILQEFSVDCYMPANGETVVIETPDTVPVDVAVPLLK 433
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
N P + IHG LV+ D + L+D +A +E+G++ H +RFT+ V +
Sbjct: 434 HALGSDNKRP------KLIHGALVMHDRGVKLLDSRQAMQELGLTGHQIRFTTTV----A 483
Query: 361 SPINKTLELIYDQL 374
SP + +Y +L
Sbjct: 484 SPRAALADALYTRL 497
>gi|74198351|dbj|BAE39661.1| unnamed protein product [Mus musculus]
Length = 600
Score = 583 bits (1503), Expect = e-163, Method: Compositional matrix adjust.
Identities = 272/436 (62%), Positives = 345/436 (79%), Gaps = 7/436 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MFQ+KVG++S+VYTGDYNMTPDRHLGAAWIDKCRP+LLITESTYATTIRDSKRCRERDFL
Sbjct: 163 MFQIKVGSESVVYTGDYNMTPDRHLGAAWIDKCRPNLLITESTYATTIRDSKRCRERDFL 222
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHE V+RGGKVLIPVFALGRAQELCILLET+WERMNL+ PIYF+ GLTEKA +YYK+F
Sbjct: 223 KKVHETVERGGKVLIPVFALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLF 282
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
ITWTNQKIRKTFVQRNMF+FKHI+ FD++F DNPGPMVVFATPGMLH+G SL IF+KWA
Sbjct: 283 ITWTNQKIRKTFVQRNMFEFKHIKAFDRTFADNPGPMVVFATPGMLHAGQSLQIFRKWAG 342
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
E NM+IMPG+CVQGT+GHK+LSG +KLE E +Q+++VKM +EYMSFSAHADAKGIMQL+
Sbjct: 343 NEKNMVIMPGYCVQGTVGHKILSGQRKLEMEGRQMLEVKMQVEYMSFSAHADAKGIMQLV 402
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
EP++VLLVHGEA KM FL++KI+QEF + C+MPANGE+ + T I + +S+ LLK
Sbjct: 403 GQAEPESVLLVHGEAKKMEFLRQKIEQEFRVSCYMPANGETVTLPTSPSIPVGISLGLLK 462
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
E V+ P K R +HG L++KDS+ L+ ++A KE+G++ H +RFT V + D+
Sbjct: 463 REMVQ--GLLPEAKKPRLLHGTLIMKDSNFRLVSSEQALKELGLAEHQLRFTCRVHLQDT 520
Query: 361 SPINKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLL--SVERIDDKNKRVFVTWP 418
+T +Y L S L+D ++ L + S++++S+L+ + D K + V+W
Sbjct: 521 RKEQETALRVYSHLKSTLKDHRVQH---LPDGSVTVESILIQAAAHSEDPGTKVLLVSWT 577
Query: 419 NQDEEVGKIVLHVLKS 434
QDEE+G + +LK+
Sbjct: 578 YQDEELGSFLTTLLKN 593
>gi|21312614|ref|NP_082296.1| integrator complex subunit 11 [Mus musculus]
gi|81904239|sp|Q9CWS4.1|INT11_MOUSE RecName: Full=Integrator complex subunit 11; Short=Int11; AltName:
Full=Cleavage and polyadenylation-specific factor 3-like
protein; Short=CPSF3-like protein
gi|12845859|dbj|BAB26928.1| unnamed protein product [Mus musculus]
gi|26355309|dbj|BAC41135.1| unnamed protein product [Mus musculus]
gi|74192536|dbj|BAE43054.1| unnamed protein product [Mus musculus]
gi|74219576|dbj|BAE29558.1| unnamed protein product [Mus musculus]
gi|148683102|gb|EDL15049.1| cleavage and polyadenylation specific factor 3-like, isoform CRA_b
[Mus musculus]
Length = 600
Score = 583 bits (1502), Expect = e-163, Method: Compositional matrix adjust.
Identities = 272/436 (62%), Positives = 345/436 (79%), Gaps = 7/436 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MFQ+KVG++S+VYTGDYNMTPDRHLGAAWIDKCRP+LLITESTYATTIRDSKRCRERDFL
Sbjct: 163 MFQIKVGSESVVYTGDYNMTPDRHLGAAWIDKCRPNLLITESTYATTIRDSKRCRERDFL 222
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHE V+RGGKVLIPVFALGRAQELCILLET+WERMNL+ PIYF+ GLTEKA +YYK+F
Sbjct: 223 KKVHETVERGGKVLIPVFALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLF 282
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
ITWTNQKIRKTFVQRNMF+FKHI+ FD++F DNPGPMVVFATPGMLH+G SL IF+KWA
Sbjct: 283 ITWTNQKIRKTFVQRNMFEFKHIKAFDRTFADNPGPMVVFATPGMLHAGQSLQIFRKWAG 342
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
E NM+IMPG+CVQGT+GHK+LSG +KLE E +Q+++VKM +EYMSFSAHADAKGIMQL+
Sbjct: 343 NEKNMVIMPGYCVQGTVGHKILSGQRKLEMEGRQMLEVKMQVEYMSFSAHADAKGIMQLV 402
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
EP++VLLVHGEA KM FL++KI+QEF + C+MPANGE+ + T I + +S+ LLK
Sbjct: 403 GQAEPESVLLVHGEAKKMEFLRQKIEQEFRVSCYMPANGETVTLPTSPSIPVGISLGLLK 462
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
E V+ P K R +HG L++KDS+ L+ ++A KE+G++ H +RFT V + D+
Sbjct: 463 REMVQ--GLLPEAKKPRLLHGTLIMKDSNFRLVSSEQALKELGLAEHQLRFTCRVHLQDT 520
Query: 361 SPINKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLL--SVERIDDKNKRVFVTWP 418
+T +Y L S L+D ++ L + S++++S+L+ + D K + V+W
Sbjct: 521 RKEQETALRVYSHLKSTLKDHCVQH---LPDGSVTVESILIQAAAHSEDPGTKVLLVSWT 577
Query: 419 NQDEEVGKIVLHVLKS 434
QDEE+G + +LK+
Sbjct: 578 YQDEELGSFLTTLLKN 593
>gi|296206479|ref|XP_002750226.1| PREDICTED: integrator complex subunit 11 isoform 2 [Callithrix
jacchus]
Length = 499
Score = 583 bits (1502), Expect = e-163, Method: Compositional matrix adjust.
Identities = 271/435 (62%), Positives = 341/435 (78%), Gaps = 7/435 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MFQ+KVG++S+VYTGDYNMTPDRHLGAAWIDKCRP+LLITESTYATTIRDSKRCRERDFL
Sbjct: 62 MFQIKVGSESVVYTGDYNMTPDRHLGAAWIDKCRPNLLITESTYATTIRDSKRCRERDFL 121
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHE V+RGGKVLIPVFALGRAQELCILLET+WERMNL+ PIYF+ GLTEKA +YYK+F
Sbjct: 122 KKVHETVERGGKVLIPVFALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLF 181
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
I WTNQKIRKTFVQRNMF+FKHI+ FD++F DNPGPMVVFATPGMLH+G SL IF+KWA
Sbjct: 182 IPWTNQKIRKTFVQRNMFEFKHIKAFDRAFADNPGPMVVFATPGMLHAGQSLQIFRKWAG 241
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
E NM+IMPG+CVQGT+GHK+LSG +KLE E +Q+++VKM +EYMSFSAHADAKGIMQL+
Sbjct: 242 NEKNMVIMPGYCVQGTVGHKILSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLV 301
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
EP++VLLVHGEA KM FLK+KI+QE + C+MPANGE+ + T + I + +S+ LLK
Sbjct: 302 GQAEPESVLLVHGEAKKMEFLKQKIEQELRVSCYMPANGETVTLPTSLSIPVGISLGLLK 361
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
E + P K R +HG L++KDS+ L+ ++A KE+G++ H +RFT V + D+
Sbjct: 362 REMAQ--GLLPEAKKPRLLHGTLIMKDSNFRLVSSEQALKELGLAEHQLRFTCRVHLHDT 419
Query: 361 SPINKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLL--SVERIDDKNKRVFVTWP 418
+T +Y L S L+D Y L + S++++S+L+ + D K + V+W
Sbjct: 420 RKEQETALRVYSHLKSILKD---HYVQHLPDGSVTVESILIQATAPSEDPGTKVLLVSWT 476
Query: 419 NQDEEVGKIVLHVLK 433
QDEE+G + +LK
Sbjct: 477 YQDEELGSFLTSLLK 491
>gi|15029864|gb|AAH11155.1| Cleavage and polyadenylation specific factor 3-like [Mus musculus]
Length = 600
Score = 583 bits (1502), Expect = e-163, Method: Compositional matrix adjust.
Identities = 272/436 (62%), Positives = 345/436 (79%), Gaps = 7/436 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MFQ+KVG++S+VYTGDYNMTPDRHLGAAWIDKCRP+LLITESTYATTIRDSKRCRERDFL
Sbjct: 163 MFQIKVGSESVVYTGDYNMTPDRHLGAAWIDKCRPNLLITESTYATTIRDSKRCRERDFL 222
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHE V+RGGKVLIPVFALGRAQELCILLET+WERMNL+ PIYF+ GLTEKA +YYK+F
Sbjct: 223 KKVHETVERGGKVLIPVFALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLF 282
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
ITWTNQKIRKTFVQRNMF+FKHI+ FD++F DNPGPMVVFATPGMLH+G SL IF+KWA
Sbjct: 283 ITWTNQKIRKTFVQRNMFEFKHIKAFDRTFADNPGPMVVFATPGMLHAGQSLQIFRKWAG 342
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
E NM+IMPG+CVQGT+GHK+LSG +KLE E +Q+++VKM +EYMSFSAHADAKGIMQL+
Sbjct: 343 NEKNMVIMPGYCVQGTVGHKILSGQRKLEMEGRQMLEVKMQVEYMSFSAHADAKGIMQLV 402
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
EP++VLLVHGEA KM FL++KI+QEF + C+MPANGE+ + T I + +S+ LLK
Sbjct: 403 GQAEPESVLLVHGEAKKMEFLRQKIEQEFRVSCYMPANGETVTLPTSPSIPVGISLGLLK 462
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
E V+ P K R +HG L++KDS+ L+ ++A KE+G++ H +RFT V + D+
Sbjct: 463 REMVQ--GLLPEAKKPRLLHGTLIMKDSNFRLVSSEQALKELGLAEHQLRFTCRVHLQDT 520
Query: 361 SPINKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLL--SVERIDDKNKRVFVTWP 418
+T +Y L S L+D ++ L + S++++S+L+ + D K + V+W
Sbjct: 521 RKEQETALRVYSHLKSTLKDHCVQH---LPDGSVTVESILIQAAAHSEDPGTKVLLVSWT 577
Query: 419 NQDEEVGKIVLHVLKS 434
QDEE+G + +LK+
Sbjct: 578 YQDEELGSFLTTLLKN 593
>gi|359319514|ref|XP_003639102.1| PREDICTED: integrator complex subunit 11-like [Canis lupus
familiaris]
Length = 600
Score = 583 bits (1502), Expect = e-163, Method: Compositional matrix adjust.
Identities = 272/435 (62%), Positives = 344/435 (79%), Gaps = 7/435 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MFQ+KVG++S+VYTGDYNMTPDRHLGAAWIDKCRP+LLITESTYATTIRDSKRCRERDFL
Sbjct: 163 MFQIKVGSESVVYTGDYNMTPDRHLGAAWIDKCRPNLLITESTYATTIRDSKRCRERDFL 222
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHE V+RGGKVLIPVFALGRAQELCILLET+WERMNL+APIYF+ GLTEKA +YYK+F
Sbjct: 223 KKVHEAVERGGKVLIPVFALGRAQELCILLETFWERMNLKAPIYFSTGLTEKANHYYKLF 282
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
ITWTNQKIRKTFVQRNMF+FKHI+ FD++F DNPGPMVVFATPGMLH+G SL IF+KWA
Sbjct: 283 ITWTNQKIRKTFVQRNMFEFKHIKAFDRAFADNPGPMVVFATPGMLHAGQSLQIFRKWAG 342
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
E NM+IMPG+CVQGT+GHK+LSG +KLE E +Q+++VKM +EYMSFSAHADAKGIMQL+
Sbjct: 343 NEKNMVIMPGYCVQGTVGHKILSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLV 402
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
EP++VLLVHGEA KM FLK+KI+QEF ++C+MPANGE+ + T I + +S+ LLK
Sbjct: 403 GQAEPESVLLVHGEAKKMEFLKQKIEQEFRVNCYMPANGETVTLPTSPSIPVGISLGLLK 462
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
E + P+ K R +HG L++KDS+ L+ ++A KE+G++ H +RFT V + D
Sbjct: 463 REMAQ--GLLPDVKKPRLLHGTLIMKDSNFRLVSSEQALKELGLAEHQLRFTCRVHLHDP 520
Query: 361 SPINKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLL--SVERIDDKNKRVFVTWP 418
+ +Y L S L+D ++ L + S++++S+L+ + D K + V+W
Sbjct: 521 RKEQEMAVRVYSHLKSVLKDHCVQH---LPDGSVTVESILIQAAAHSEDPGTKVLLVSWT 577
Query: 419 NQDEEVGKIVLHVLK 433
QDEE+G + +LK
Sbjct: 578 YQDEELGSYLTSLLK 592
>gi|395840791|ref|XP_003793235.1| PREDICTED: integrator complex subunit 11 isoform 1 [Otolemur
garnettii]
Length = 600
Score = 582 bits (1501), Expect = e-163, Method: Compositional matrix adjust.
Identities = 273/435 (62%), Positives = 342/435 (78%), Gaps = 7/435 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MFQ+KVG++S+VYTGDYNMTPDRHLGAAWIDKCRP+LLITESTYATTIRDSKRCRERDFL
Sbjct: 163 MFQIKVGSESVVYTGDYNMTPDRHLGAAWIDKCRPNLLITESTYATTIRDSKRCRERDFL 222
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHE V+RGGKVLIPVFALGRAQELCILLET+WERMNL+ PIYF+ GLTEKA +YYK+F
Sbjct: 223 KKVHETVERGGKVLIPVFALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLF 282
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
ITWTNQKIRKTFVQRNMF+FKHI+ FD++F DNPGPMVVFATPGMLH+G SL IF+KWA
Sbjct: 283 ITWTNQKIRKTFVQRNMFEFKHIKAFDRAFADNPGPMVVFATPGMLHAGQSLQIFRKWAG 342
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
E NM+IMPG+CVQGT+GHK+LSG +KLE E +Q+++VKM +EYMSFSAHADAKGIMQL+
Sbjct: 343 NEKNMVIMPGYCVQGTVGHKILSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLV 402
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
EP++VLLVHGEA KM FLK+KI+QEF + C+MPANGE+ + T I +S+ LLK
Sbjct: 403 GQAEPQSVLLVHGEAKKMEFLKQKIEQEFRVSCYMPANGETVALPTSPSIPGGISLGLLK 462
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
E + P K R +HG L++KDS+ L+ ++A KE+G++ H +RFT V + DS
Sbjct: 463 REMAQ--GLLPEAKKPRLLHGTLIMKDSNFRLISSEQALKELGLAEHQLRFTCRVHLHDS 520
Query: 361 SPINKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLL--SVERIDDKNKRVFVTWP 418
+T +Y L S L+D ++ L + S++++S+L+ + D K + V+W
Sbjct: 521 RKEQETAMRVYSHLKSILKDHCVQH---LPDGSVTVESILIQAAAPSEDPATKVLLVSWT 577
Query: 419 NQDEEVGKIVLHVLK 433
QDEE+G + +LK
Sbjct: 578 YQDEELGSYLTSLLK 592
>gi|344283025|ref|XP_003413273.1| PREDICTED: integrator complex subunit 11-like [Loxodonta africana]
Length = 719
Score = 582 bits (1501), Expect = e-163, Method: Compositional matrix adjust.
Identities = 271/435 (62%), Positives = 346/435 (79%), Gaps = 7/435 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MFQ+KVG++S+VYTGDYNMTPDRHLGAAWID+CRP+LLITESTYATTIRDSKRCRERDFL
Sbjct: 282 MFQIKVGSESVVYTGDYNMTPDRHLGAAWIDRCRPNLLITESTYATTIRDSKRCRERDFL 341
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHE V+RGGKVLIPVFALGRAQELCILLET+WERMNL+APIYF+ GLTEKA +YYK+F
Sbjct: 342 KKVHETVERGGKVLIPVFALGRAQELCILLETFWERMNLKAPIYFSTGLTEKANHYYKLF 401
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
ITWTNQKIRKTFVQRNMF+FKHI+ FD++F DNPGPMVVFATPGMLH+G SL IF+KWA
Sbjct: 402 ITWTNQKIRKTFVQRNMFEFKHIKAFDRAFADNPGPMVVFATPGMLHAGQSLQIFRKWAG 461
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
E NM+IMPG+CVQGT+GHK+LSG +KLE E +Q+++VKM +EYMSFSAHADAKGIMQLI
Sbjct: 462 NEKNMVIMPGYCVQGTVGHKILSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLI 521
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
EP++VLLVHGEA KM FLK+KI+QEF ++C+MPANGE+ + T I + +S+ LLK
Sbjct: 522 GQAEPEHVLLVHGEAKKMEFLKQKIEQEFRVNCYMPANGETVTLPTSPSIPVGISLGLLK 581
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
E + P+ + R + G L++KDS+ L+ ++A KE+G++ H +RFT V + D+
Sbjct: 582 REMAQ--GMLPDAKRPRLLQGTLIMKDSNFRLVSSEQALKELGLAEHQLRFTCRVHLHDA 639
Query: 361 SPINKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLL--SVERIDDKNKRVFVTWP 418
+T+ +Y L S L+D ++ L + S++++S+L+ + D K + V+W
Sbjct: 640 RKEQETVMRVYSHLKSILKDHCVQH---LPDGSVTVESILIQAAAHSEDPGTKVLLVSWT 696
Query: 419 NQDEEVGKIVLHVLK 433
QDEE+G + +LK
Sbjct: 697 YQDEELGSYLTSLLK 711
>gi|374253828|ref|NP_001243392.1| integrator complex subunit 11 isoform 5 [Homo sapiens]
gi|119576639|gb|EAW56235.1| cleavage and polyadenylation specific factor 3-like, isoform CRA_c
[Homo sapiens]
gi|119576644|gb|EAW56240.1| cleavage and polyadenylation specific factor 3-like, isoform CRA_c
[Homo sapiens]
Length = 499
Score = 582 bits (1501), Expect = e-163, Method: Compositional matrix adjust.
Identities = 272/435 (62%), Positives = 342/435 (78%), Gaps = 7/435 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MFQ+KVG++S+VYTGDYNMTPDRHLGAAWIDKCRP+LLITESTYATTIRDSKRCRERDFL
Sbjct: 62 MFQIKVGSESVVYTGDYNMTPDRHLGAAWIDKCRPNLLITESTYATTIRDSKRCRERDFL 121
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHE V+RGGKVLIPVFALGRAQELCILLET+WERMNL+ PIYF+ GLTEKA +YYK+F
Sbjct: 122 KKVHETVERGGKVLIPVFALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLF 181
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
I WTNQKIRKTFVQRNMF+FKHI+ FD++F DNPGPMVVFATPGMLH+G SL IF+KWA
Sbjct: 182 IPWTNQKIRKTFVQRNMFEFKHIKAFDRAFADNPGPMVVFATPGMLHAGQSLQIFRKWAG 241
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
E NM+IMPG+CVQGT+GHK+LSG +KLE E +Q+++VKM +EYMSFSAHADAKGIMQL+
Sbjct: 242 NEKNMVIMPGYCVQGTVGHKILSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLV 301
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
EP++VLLVHGEA KM FLK+KI+QE ++C+MPANGE+ + T I + +S+ LLK
Sbjct: 302 GQAEPESVLLVHGEAKKMEFLKQKIEQELRVNCYMPANGETVTLPTSPSIPVGISLGLLK 361
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
E + P K R +HG L++KDS+ L+ ++A KE+G++ H +RFT V + D+
Sbjct: 362 REMAQ--GLLPEAKKPRLLHGTLIMKDSNFRLVSSEQALKELGLAEHQLRFTCRVHLHDT 419
Query: 361 SPINKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLL--SVERIDDKNKRVFVTWP 418
+T +Y L S L+D ++ L + S++++SVLL + D K + V+W
Sbjct: 420 RKEQETALRVYSHLKSVLKDHCVQH---LPDGSVTVESVLLQAAAPSEDPGTKVLLVSWT 476
Query: 419 NQDEEVGKIVLHVLK 433
QDEE+G + +LK
Sbjct: 477 YQDEELGSFLTSLLK 491
>gi|417403203|gb|JAA48419.1| Putative mrna cleavage and polyadenylation factor ii complex brr5
cpsf subunit [Desmodus rotundus]
Length = 603
Score = 582 bits (1501), Expect = e-163, Method: Compositional matrix adjust.
Identities = 271/435 (62%), Positives = 343/435 (78%), Gaps = 7/435 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MFQ+KVG++S+VYTGDYNMTPDRHLGAAWIDKCRP LLITESTYATTIRDSKRCRERDFL
Sbjct: 163 MFQIKVGSESVVYTGDYNMTPDRHLGAAWIDKCRPTLLITESTYATTIRDSKRCRERDFL 222
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHE V+RGGKVLIPVFALGRAQELCILLET+WERMNL+APIYF+ GLTEKA +YYK+F
Sbjct: 223 KKVHETVERGGKVLIPVFALGRAQELCILLETFWERMNLKAPIYFSTGLTEKANHYYKLF 282
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
ITWTNQKIRKTFVQRNMF+FKHI+ FD++F DNPGPMVVFATPGMLH+G SL IF+KWA
Sbjct: 283 ITWTNQKIRKTFVQRNMFEFKHIKAFDRAFADNPGPMVVFATPGMLHAGQSLQIFRKWAG 342
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
E NM+IMPG+CVQGT+GHK+LSG +KLE E +Q+++V+M +EYMSFSAHADAKGIMQL+
Sbjct: 343 NEKNMVIMPGYCVQGTVGHKILSGQRKLEMEGRQVLEVRMQVEYMSFSAHADAKGIMQLV 402
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
EP++VLLVHGEA KM FLK+KI+QEF + C+MPANGE+ + T I + +S+ LLK
Sbjct: 403 GQAEPESVLLVHGEAKKMEFLKQKIEQEFRVSCYMPANGETVTLPTSPSIPVGISLGLLK 462
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
E + P+ K R +HG L++KDS+ L+ ++A KE+G++ H +RFT V + D+
Sbjct: 463 REMAQ--GLLPDAKKPRLLHGTLIMKDSNFRLVSSEQALKELGLAEHQLRFTCRVHLHDT 520
Query: 361 SPINKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLL--SVERIDDKNKRVFVTWP 418
+T +Y L L+D ++ L + S++++S+L+ + D K + V+W
Sbjct: 521 RKEQETAMRVYSHLKGLLKDHCVQH---LPDGSVTVESILIQAAAHSEDPGTKVLLVSWT 577
Query: 419 NQDEEVGKIVLHVLK 433
QDEE+G + +LK
Sbjct: 578 YQDEELGSYLTSLLK 592
>gi|334321967|ref|XP_001364674.2| PREDICTED: integrator complex subunit 11-like [Monodelphis
domestica]
Length = 600
Score = 582 bits (1500), Expect = e-163, Method: Compositional matrix adjust.
Identities = 274/435 (62%), Positives = 345/435 (79%), Gaps = 7/435 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MFQ+KVG++S VYTGDYNMTPDRHLGAAWIDKCRP+LLITESTYATTIRDSKRCRERDFL
Sbjct: 163 MFQIKVGSESAVYTGDYNMTPDRHLGAAWIDKCRPNLLITESTYATTIRDSKRCRERDFL 222
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHE V+RGGKVLIPVFALGRAQELCILLET+WERMNL+APIYF+ GLTEKA +YYK+F
Sbjct: 223 KKVHETVERGGKVLIPVFALGRAQELCILLETFWERMNLKAPIYFSTGLTEKANHYYKLF 282
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
ITWTNQKIRKTFVQRNMF+FKHI+ FD++F DNPGPMVVFATPGMLH+G SL IF+KWA
Sbjct: 283 ITWTNQKIRKTFVQRNMFEFKHIKAFDRAFADNPGPMVVFATPGMLHAGQSLQIFRKWAG 342
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
E NM+IMPG+CVQGT+GHK+LSG +KLE E +QI++VKM +EYMSFSAHADAKGIMQL+
Sbjct: 343 NEKNMVIMPGYCVQGTVGHKILSGQRKLEMEGRQILEVKMQVEYMSFSAHADAKGIMQLV 402
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
+ EP NVLLVHGEA KM FLK+KI+QEF+++C+MPANGE+ + T I + +S+ LLK
Sbjct: 403 RQAEPDNVLLVHGEAKKMEFLKQKIEQEFHVNCYMPANGETVTLLTHPNIPVGISLGLLK 462
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
E P+ K + +HG L++KD++ L+ ++A KE+G++ H +RFT V I D+
Sbjct: 463 REMAL--GLLPDAKKPKLMHGTLIMKDNNFRLVSSEQALKELGLAEHQLRFTCRVHIHDT 520
Query: 361 SPINKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLL--SVERIDDKNKRVFVTWP 418
+T+ +Y L S L+ S ++ L + S+ ++S+L+ + D K + V+W
Sbjct: 521 RKEQETVLRVYSHLKSVLKAHSVQH---LPDGSVMVESILIQATAHSEDPGTKVLLVSWT 577
Query: 419 NQDEEVGKIVLHVLK 433
QDEE+G + +LK
Sbjct: 578 YQDEELGSYLTSLLK 592
>gi|76559911|ref|NP_001029064.1| integrator complex subunit 11 [Rattus norvegicus]
gi|119371245|sp|Q3MHC2.1|INT11_RAT RecName: Full=Integrator complex subunit 11; Short=Int11; AltName:
Full=Cleavage and polyadenylation-specific factor 3-like
protein; Short=CPSF3-like protein
gi|75867808|gb|AAI05304.1| Cleavage and polyadenylation specific factor 3-like [Rattus
norvegicus]
Length = 600
Score = 582 bits (1500), Expect = e-163, Method: Compositional matrix adjust.
Identities = 271/436 (62%), Positives = 345/436 (79%), Gaps = 7/436 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MFQ+KVG++S+VYTGDYNMTPDRHLGAAWIDKCRP+LLITESTYATTIRDSKRCRERDFL
Sbjct: 163 MFQIKVGSESVVYTGDYNMTPDRHLGAAWIDKCRPNLLITESTYATTIRDSKRCRERDFL 222
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHE V+RGGKVLIPVFALGRAQELCILLET+WERMNL+ PIYF+ GLTEKA +YYK+F
Sbjct: 223 KKVHETVERGGKVLIPVFALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLF 282
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
ITWTNQKIRKTFVQRNMF+FKHI+ FD++F DNPGPMVVFATPGMLH+G SL IF+KWA
Sbjct: 283 ITWTNQKIRKTFVQRNMFEFKHIKAFDRTFADNPGPMVVFATPGMLHAGQSLQIFRKWAG 342
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
E NM+IMPG+CVQGT+GHK+LSG +KLE E +Q+++VKM +EYMSFSAHADAKGIMQL+
Sbjct: 343 NEKNMVIMPGYCVQGTVGHKILSGQRKLEMEGRQMLEVKMQVEYMSFSAHADAKGIMQLV 402
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
EP++VLLVHGEA KM FL++KI+QEF + C+MPANGE+ + T I + +S+ LLK
Sbjct: 403 GQAEPESVLLVHGEAKKMEFLRQKIEQEFRVSCYMPANGETVTLPTSPSIPVGISLGLLK 462
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
E V+ P K R +HG L++KD++ L+ ++A KE+G++ H +RFT V + D+
Sbjct: 463 REMVQ--GLLPEAKKPRLLHGTLIMKDNNFRLVSSEQALKELGLAEHQLRFTCRVHLQDT 520
Query: 361 SPINKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLL--SVERIDDKNKRVFVTWP 418
+T +Y L S L+D ++ L + S++++S+L+ + D K + V+W
Sbjct: 521 RKEQETALRVYSHLKSTLKDHCVQH---LPDGSVTVESILIQAAAHSEDPGTKVLLVSWT 577
Query: 419 NQDEEVGKIVLHVLKS 434
QDEE+G + +LK+
Sbjct: 578 YQDEELGSFLTALLKN 593
>gi|397476280|ref|XP_003809535.1| PREDICTED: integrator complex subunit 11 isoform 3 [Pan paniscus]
Length = 499
Score = 582 bits (1500), Expect = e-163, Method: Compositional matrix adjust.
Identities = 272/435 (62%), Positives = 341/435 (78%), Gaps = 7/435 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MFQ+KVG++S+VYTGDYNMTPDRHLGAAWIDKCRP+LLITESTYATTIRDSKRCRERDFL
Sbjct: 62 MFQIKVGSESVVYTGDYNMTPDRHLGAAWIDKCRPNLLITESTYATTIRDSKRCRERDFL 121
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHE V+RGGKVLIPVFALGRAQELCILLET+WERMNL+ PIYF+ GLTEKA +YYK+F
Sbjct: 122 KKVHETVERGGKVLIPVFALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLF 181
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
I WTNQKIRKTFVQRNMF+FKHI+ FD++F DNPGPMVVFATPGMLH+G SL IF+KWA
Sbjct: 182 IPWTNQKIRKTFVQRNMFEFKHIKAFDRAFADNPGPMVVFATPGMLHAGQSLQIFRKWAG 241
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
E NM+IMPG+CVQGT+GHK+LSG +KLE E +Q+++VKM +EYMSFSAHADAKGIMQL+
Sbjct: 242 NEKNMVIMPGYCVQGTVGHKILSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLV 301
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
EP++VLLVHGEA KM FLK+KI+QE + C+MPANGE+ + T I + +S+ LLK
Sbjct: 302 GQAEPESVLLVHGEAKKMEFLKQKIEQELRVSCYMPANGETVTLPTSPSIPVGISLGLLK 361
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
E + P K R +HG L++KDS+ L+ ++A KE+G++ H +RFT V + D+
Sbjct: 362 REMAQ--GLLPEAKKPRLLHGTLIMKDSNFRLVSSEQALKELGLAEHQLRFTCRVHLHDT 419
Query: 361 SPINKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLL--SVERIDDKNKRVFVTWP 418
+T +Y L S L+D ++ L + S++++SVLL + D K + V+W
Sbjct: 420 RKEQETALRVYSHLKSVLKDHCVQH---LPDGSVTVESVLLQAAAPSEDPGTKVLLVSWT 476
Query: 419 NQDEEVGKIVLHVLK 433
QDEE+G + +LK
Sbjct: 477 YQDEELGSFLTSLLK 491
>gi|149024842|gb|EDL81339.1| similar to RIKEN cDNA 2410006F12 [Rattus norvegicus]
Length = 601
Score = 582 bits (1499), Expect = e-163, Method: Compositional matrix adjust.
Identities = 271/436 (62%), Positives = 345/436 (79%), Gaps = 7/436 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MFQ+KVG++S+VYTGDYNMTPDRHLGAAWIDKCRP+LLITESTYATTIRDSKRCRERDFL
Sbjct: 164 MFQIKVGSESVVYTGDYNMTPDRHLGAAWIDKCRPNLLITESTYATTIRDSKRCRERDFL 223
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHE V+RGGKVLIPVFALGRAQELCILLET+WERMNL+ PIYF+ GLTEKA +YYK+F
Sbjct: 224 KKVHETVERGGKVLIPVFALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLF 283
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
ITWTNQKIRKTFVQRNMF+FKHI+ FD++F DNPGPMVVFATPGMLH+G SL IF+KWA
Sbjct: 284 ITWTNQKIRKTFVQRNMFEFKHIKAFDRTFADNPGPMVVFATPGMLHAGQSLQIFRKWAG 343
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
E NM+IMPG+CVQGT+GHK+LSG +KLE E +Q+++VKM +EYMSFSAHADAKGIMQL+
Sbjct: 344 NEKNMVIMPGYCVQGTVGHKILSGQRKLEMEGRQMLEVKMQVEYMSFSAHADAKGIMQLV 403
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
EP++VLLVHGEA KM FL++KI+QEF + C+MPANGE+ + T I + +S+ LLK
Sbjct: 404 GQAEPESVLLVHGEAKKMEFLRQKIEQEFRVSCYMPANGETVTLPTSPSIPVGISLGLLK 463
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
E V+ P K R +HG L++KD++ L+ ++A KE+G++ H +RFT V + D+
Sbjct: 464 REMVQ--GLLPEAKKPRLLHGTLIMKDNNFRLVSSEQALKELGLAEHQLRFTCRVHLQDT 521
Query: 361 SPINKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLL--SVERIDDKNKRVFVTWP 418
+T +Y L S L+D ++ L + S++++S+L+ + D K + V+W
Sbjct: 522 RKEQETALRVYSHLKSTLKDHCVQH---LPDGSVTVESILIQAAAHSEDPGTKVLLVSWT 578
Query: 419 NQDEEVGKIVLHVLKS 434
QDEE+G + +LK+
Sbjct: 579 YQDEELGSFLTALLKN 594
>gi|158256210|dbj|BAF84076.1| unnamed protein product [Homo sapiens]
Length = 606
Score = 582 bits (1499), Expect = e-163, Method: Compositional matrix adjust.
Identities = 273/435 (62%), Positives = 342/435 (78%), Gaps = 7/435 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MFQ+KVG++S+VYTGDYNMTPDRHLGAAWIDKCRP+LLITESTYATTIRDSKRCRERDFL
Sbjct: 169 MFQIKVGSESVVYTGDYNMTPDRHLGAAWIDKCRPNLLITESTYATTIRDSKRCRERDFL 228
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHE V+RGGKVLIPVFALGRAQELCILLET+WERMNL+ PIYF+ GLTEKA +YYK+F
Sbjct: 229 KKVHETVERGGKVLIPVFALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLF 288
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
I WTNQKIRKTFVQRNMF+FKHI+ FD++F DNPGPMVVFATPGMLH+G SL IF+KWA
Sbjct: 289 IPWTNQKIRKTFVQRNMFEFKHIKAFDRAFADNPGPMVVFATPGMLHAGQSLQIFRKWAG 348
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
E NM+IMPG+CVQGT+GHK+LSG +KLE E +Q+++VKM +EYMSFSAHADAKGIMQL+
Sbjct: 349 NEKNMVIMPGYCVQGTVGHKILSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLV 408
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
EP++VLLVHGEA KM FLK+KI+QE +DC+MPANGE+ + T I + +S+ LLK
Sbjct: 409 GQAEPESVLLVHGEAKKMEFLKQKIEQELRVDCYMPANGETVTLPTSPSIPVGISLGLLK 468
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
E + P K R +HG L++KDS+ L+ ++A KE+G++ H +RFT V + D+
Sbjct: 469 REMAQ--GLLPEAKKPRLLHGTLIMKDSNFRLVSSEQALKELGLAEHQLRFTCRVHLHDT 526
Query: 361 SPINKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLL--SVERIDDKNKRVFVTWP 418
+T +Y L S L+D ++ L + S++++SVLL + D K + V+W
Sbjct: 527 RKEQETALRVYSHLKSVLKDHCVQH---LPDGSVTVESVLLQAAAPSEDPGTKVLLVSWT 583
Query: 419 NQDEEVGKIVLHVLK 433
QDEE+G + +LK
Sbjct: 584 YQDEELGSFLTSLLK 598
>gi|426327398|ref|XP_004024505.1| PREDICTED: integrator complex subunit 11 isoform 5 [Gorilla gorilla
gorilla]
Length = 502
Score = 582 bits (1499), Expect = e-163, Method: Compositional matrix adjust.
Identities = 270/435 (62%), Positives = 342/435 (78%), Gaps = 7/435 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MFQ+KVG++S+VYTGDYNMTPDRHLGAAWIDKCRP+LLITESTYATTIRDSKRCRERDFL
Sbjct: 65 MFQIKVGSESVVYTGDYNMTPDRHLGAAWIDKCRPNLLITESTYATTIRDSKRCRERDFL 124
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHE V+RGGKVLIPVFALGRAQELCILLET+WERMNL+ PIYF+ GLTEKA +YYK+F
Sbjct: 125 KKVHETVERGGKVLIPVFALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLF 184
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
I WTNQKIRKTFVQRNMF+FKHI+ FD++F DNPGPMVVFATPGMLH+G SL IF+KWA
Sbjct: 185 IPWTNQKIRKTFVQRNMFEFKHIKAFDRAFADNPGPMVVFATPGMLHAGQSLQIFRKWAG 244
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
E NM+IMPG+CVQGT+GHK+LSG +KLE E +Q+++VKM +EYMSFSAHADAKGIMQL+
Sbjct: 245 NEKNMVIMPGYCVQGTVGHKILSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLV 304
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
EP++VLLVHGEA KM FLK+KI+QE + C+MPANGE+ + T I + +S+ LLK
Sbjct: 305 GQAEPESVLLVHGEAKKMEFLKQKIEQELRVSCYMPANGETVTLPTSPSIPVGISLGLLK 364
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
E + P+ K R +HG L++KDS+ L+ ++A KE+G++ H +RFT V + D+
Sbjct: 365 REMAQ--GLLPDAKKPRLLHGTLIMKDSNFRLVSSEQALKELGLAEHQLRFTCRVHLHDT 422
Query: 361 SPINKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLL--SVERIDDKNKRVFVTWP 418
+T +Y L S L+D ++ L + S++++S+L+ + D K + V+W
Sbjct: 423 RKEQETALRVYSHLKSILKDHCVQH---LPDGSVTVESILIQAAAPSEDPGTKVLLVSWT 479
Query: 419 NQDEEVGKIVLHVLK 433
QDEE+G + +LK
Sbjct: 480 YQDEELGSFLTSLLK 494
>gi|296206477|ref|XP_002750225.1| PREDICTED: integrator complex subunit 11 isoform 1 [Callithrix
jacchus]
Length = 600
Score = 581 bits (1497), Expect = e-163, Method: Compositional matrix adjust.
Identities = 271/435 (62%), Positives = 341/435 (78%), Gaps = 7/435 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MFQ+KVG++S+VYTGDYNMTPDRHLGAAWIDKCRP+LLITESTYATTIRDSKRCRERDFL
Sbjct: 163 MFQIKVGSESVVYTGDYNMTPDRHLGAAWIDKCRPNLLITESTYATTIRDSKRCRERDFL 222
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHE V+RGGKVLIPVFALGRAQELCILLET+WERMNL+ PIYF+ GLTEKA +YYK+F
Sbjct: 223 KKVHETVERGGKVLIPVFALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLF 282
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
I WTNQKIRKTFVQRNMF+FKHI+ FD++F DNPGPMVVFATPGMLH+G SL IF+KWA
Sbjct: 283 IPWTNQKIRKTFVQRNMFEFKHIKAFDRAFADNPGPMVVFATPGMLHAGQSLQIFRKWAG 342
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
E NM+IMPG+CVQGT+GHK+LSG +KLE E +Q+++VKM +EYMSFSAHADAKGIMQL+
Sbjct: 343 NEKNMVIMPGYCVQGTVGHKILSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLV 402
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
EP++VLLVHGEA KM FLK+KI+QE + C+MPANGE+ + T + I + +S+ LLK
Sbjct: 403 GQAEPESVLLVHGEAKKMEFLKQKIEQELRVSCYMPANGETVTLPTSLSIPVGISLGLLK 462
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
E + P K R +HG L++KDS+ L+ ++A KE+G++ H +RFT V + D+
Sbjct: 463 REMAQ--GLLPEAKKPRLLHGTLIMKDSNFRLVSSEQALKELGLAEHQLRFTCRVHLHDT 520
Query: 361 SPINKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLL--SVERIDDKNKRVFVTWP 418
+T +Y L S L+D Y L + S++++S+L+ + D K + V+W
Sbjct: 521 RKEQETALRVYSHLKSILKD---HYVQHLPDGSVTVESILIQATAPSEDPGTKVLLVSWT 577
Query: 419 NQDEEVGKIVLHVLK 433
QDEE+G + +LK
Sbjct: 578 YQDEELGSFLTSLLK 592
>gi|350585498|ref|XP_003127541.3| PREDICTED: integrator complex subunit 11-like [Sus scrofa]
Length = 599
Score = 581 bits (1497), Expect = e-163, Method: Compositional matrix adjust.
Identities = 271/435 (62%), Positives = 342/435 (78%), Gaps = 7/435 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MFQ+KVG++S+VYTGDYNMTPDRHLGAAWIDKCRP+LLITESTYATTIRDSKRCRERDFL
Sbjct: 163 MFQIKVGSESVVYTGDYNMTPDRHLGAAWIDKCRPNLLITESTYATTIRDSKRCRERDFL 222
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHE V+RGGKVLIPVFALGRAQELCILLET+WERM+L+APIYF+ GLTEKA +YYK+F
Sbjct: 223 KKVHETVERGGKVLIPVFALGRAQELCILLETFWERMDLKAPIYFSTGLTEKANHYYKLF 282
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
I WTNQKIRKTFVQRNMF+FKHI+ FD++F D+PGPMVVFATPGMLH+G SL IF+KWA
Sbjct: 283 IPWTNQKIRKTFVQRNMFEFKHIKAFDRAFADSPGPMVVFATPGMLHAGQSLQIFRKWAG 342
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
E NM+IMPG+CVQGT+GHK+LSG +KLE E +Q+++VKM +EYMSFSAHADAKGIMQL+
Sbjct: 343 NEKNMVIMPGYCVQGTVGHKILSGQRKLELEGRQVLEVKMQVEYMSFSAHADAKGIMQLV 402
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
EP+NVLLVHGEA KM FLK+KI+QEF L C+MPANGE+ + T I + +S+ LLK
Sbjct: 403 GQAEPENVLLVHGEAKKMEFLKQKIEQEFRLSCYMPANGETVTLPTSPSIPVGISLGLLK 462
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
E + P+ K R +HG L++KDS+ L+ ++A KE+G++ H +RFT V + D
Sbjct: 463 REMAQ--GLLPDAKKARLLHGTLIMKDSTFRLVSSEQALKELGLAEHQLRFTCRVHLHDP 520
Query: 361 SPINKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLL--SVERIDDKNKRVFVTWP 418
+ +Y L S L+D ++ L + S++++S+L+ + D K + V+W
Sbjct: 521 RKEQEMAMRVYSHLKSLLKDHCVQH---LPDGSVTVESILIQAAAHSEDPGTKVLLVSWT 577
Query: 419 NQDEEVGKIVLHVLK 433
QDEE+G + +LK
Sbjct: 578 YQDEELGSYLTSLLK 592
>gi|10433243|dbj|BAB13943.1| unnamed protein product [Homo sapiens]
Length = 499
Score = 581 bits (1497), Expect = e-163, Method: Compositional matrix adjust.
Identities = 271/435 (62%), Positives = 342/435 (78%), Gaps = 7/435 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MFQ+KVG++S+VYTGDYNMTPDRHLGAAWIDKCRP+LLITESTYATTIRDSKRCRERDFL
Sbjct: 62 MFQIKVGSESVVYTGDYNMTPDRHLGAAWIDKCRPNLLITESTYATTIRDSKRCRERDFL 121
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHE V+RGGKVLIPVFALGRAQELCILL+T+WERMNL+ PIYF+ GLTEKA +YYK+F
Sbjct: 122 KKVHETVERGGKVLIPVFALGRAQELCILLKTFWERMNLKVPIYFSTGLTEKANHYYKLF 181
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
I WTNQKIRKTFVQRNMF+FKHI+ FD++F DNPGPMVVFATPGMLH+G SL IF+KWA
Sbjct: 182 IPWTNQKIRKTFVQRNMFEFKHIKAFDRAFADNPGPMVVFATPGMLHAGQSLQIFRKWAG 241
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
E NM+IMPG+CVQGT+GHK+LSG +KLE E +Q+++VKM +EYMSFSAHADAKGIMQL+
Sbjct: 242 NEKNMVIMPGYCVQGTVGHKILSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLV 301
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
EP++VLLVHGEA KM FLK+KI+QE ++C+MPANGE+ + T I + +S+ LLK
Sbjct: 302 GQAEPESVLLVHGEAKKMEFLKQKIEQELRVNCYMPANGETVTLPTSPSIPVGISLGLLK 361
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
E + P K R +HG L++KDS+ L+ ++A KE+G++ H +RFT V + D+
Sbjct: 362 REMAQ--GLLPEAKKPRLLHGTLIMKDSNFRLVSSEQALKELGLAEHQLRFTCRVHLHDT 419
Query: 361 SPINKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLL--SVERIDDKNKRVFVTWP 418
+T +Y L S L+D ++ L + S++++SVLL + D K + V+W
Sbjct: 420 RKEQETALRVYSHLKSVLKDHCVQH---LPDGSVTVESVLLQAAAPSEDPGTKVLLVSWT 476
Query: 419 NQDEEVGKIVLHVLK 433
QDEE+G + +LK
Sbjct: 477 YQDEELGSFLTSLLK 491
>gi|12053137|emb|CAB66747.1| hypothetical protein [Homo sapiens]
gi|49065540|emb|CAG38588.1| FLJ20542 [Homo sapiens]
gi|117645260|emb|CAL38096.1| hypothetical protein [synthetic construct]
gi|208966056|dbj|BAG73042.1| cleavage and polyadenylation specific factor 3-like [synthetic
construct]
Length = 600
Score = 580 bits (1496), Expect = e-162, Method: Compositional matrix adjust.
Identities = 272/435 (62%), Positives = 342/435 (78%), Gaps = 7/435 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MFQ+KVG++S+VYTGDYNMTPDRHLGAAWIDKCRP+LLITESTYATTIRDSKRCRERDFL
Sbjct: 163 MFQIKVGSESVVYTGDYNMTPDRHLGAAWIDKCRPNLLITESTYATTIRDSKRCRERDFL 222
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHE V+RGGKVLIPVFALGRAQELCILLET+WERMNL+ PIYF+ GLTEKA +YYK+F
Sbjct: 223 KKVHETVERGGKVLIPVFALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLF 282
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
I WTNQKIRKTFVQRNMF+FKHI+ FD++F DNPGPMVVFATPGMLH+G SL IF+KWA
Sbjct: 283 IPWTNQKIRKTFVQRNMFEFKHIKAFDRAFADNPGPMVVFATPGMLHAGQSLQIFRKWAG 342
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
E NM+IMPG+CVQGT+GHK+LSG +KLE E +Q+++VKM +EYMSFSAHADAKGIMQL+
Sbjct: 343 NEKNMVIMPGYCVQGTVGHKILSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLV 402
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
EP++VLLVHGEA KM FLK+KI+QE ++C+MPANGE+ + T I + +S+ LLK
Sbjct: 403 GQAEPESVLLVHGEAKKMEFLKQKIEQELRVNCYMPANGETVTLPTSPSIPVGISLGLLK 462
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
E + P K R +HG L++KDS+ L+ ++A KE+G++ H +RFT V + D+
Sbjct: 463 REMAQ--GLLPEAKKPRLLHGTLIMKDSNFRLVSSEQALKELGLAEHQLRFTCRVHLHDT 520
Query: 361 SPINKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLL--SVERIDDKNKRVFVTWP 418
+T +Y L S L+D ++ L + S++++SVLL + D K + V+W
Sbjct: 521 RKEQETALRVYSHLKSVLKDHCVQH---LPDGSVTVESVLLQAAAPSEDPGTKVLLVSWT 577
Query: 419 NQDEEVGKIVLHVLK 433
QDEE+G + +LK
Sbjct: 578 YQDEELGSFLTSLLK 592
>gi|374253821|ref|NP_001243389.1| integrator complex subunit 11 isoform 3 [Homo sapiens]
gi|194386866|dbj|BAG59799.1| unnamed protein product [Homo sapiens]
Length = 571
Score = 580 bits (1496), Expect = e-162, Method: Compositional matrix adjust.
Identities = 272/435 (62%), Positives = 342/435 (78%), Gaps = 7/435 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MFQ+KVG++S+VYTGDYNMTPDRHLGAAWIDKCRP+LLITESTYATTIRDSKRCRERDFL
Sbjct: 134 MFQIKVGSESVVYTGDYNMTPDRHLGAAWIDKCRPNLLITESTYATTIRDSKRCRERDFL 193
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHE V+RGGKVLIPVFALGRAQELCILLET+WERMNL+ PIYF+ GLTEKA +YYK+F
Sbjct: 194 KKVHETVERGGKVLIPVFALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLF 253
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
I WTNQKIRKTFVQRNMF+FKHI+ FD++F DNPGPMVVFATPGMLH+G SL IF+KWA
Sbjct: 254 IPWTNQKIRKTFVQRNMFEFKHIKAFDRAFADNPGPMVVFATPGMLHAGQSLQIFRKWAG 313
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
E NM+IMPG+CVQGT+GHK+LSG +KLE E +Q+++VKM +EYMSFSAHADAKGIMQL+
Sbjct: 314 NEKNMVIMPGYCVQGTVGHKILSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLV 373
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
EP++VLLVHGEA KM FLK+KI+QE ++C+MPANGE+ + T I + +S+ LLK
Sbjct: 374 GQAEPESVLLVHGEAKKMEFLKQKIEQELRVNCYMPANGETVTLPTSPSIPVGISLGLLK 433
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
E + P K R +HG L++KDS+ L+ ++A KE+G++ H +RFT V + D+
Sbjct: 434 REMAQ--GLLPEAKKPRLLHGTLIMKDSNFRLVSSEQALKELGLAEHQLRFTCRVHLHDT 491
Query: 361 SPINKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLL--SVERIDDKNKRVFVTWP 418
+T +Y L S L+D ++ L + S++++SVLL + D K + V+W
Sbjct: 492 RKEQETALRVYSHLKSVLKDHCVQH---LPDGSVTVESVLLQAAAPSEDPGTKVLLVSWT 548
Query: 419 NQDEEVGKIVLHVLK 433
QDEE+G + +LK
Sbjct: 549 YQDEELGSFLTSLLK 563
>gi|33300633|ref|NP_060341.2| integrator complex subunit 11 isoform 2 [Homo sapiens]
gi|118572557|sp|Q5TA45.2|INT11_HUMAN RecName: Full=Integrator complex subunit 11; Short=Int11; AltName:
Full=Cleavage and polyadenylation-specific factor 3-like
protein; Short=CPSF3-like protein; AltName: Full=Protein
related to CPSF subunits of 68 kDa; Short=RC-68
gi|14124912|gb|AAH07978.1| Cleavage and polyadenylation specific factor 3-like [Homo sapiens]
gi|60650138|tpg|DAA05669.1| TPA_exp: beta-lactamase fold protein family member RC-68 [Homo
sapiens]
gi|78100161|tpg|DAA05728.1| TPA_exp: integrator complex subunit 11 [Homo sapiens]
gi|119576636|gb|EAW56232.1| cleavage and polyadenylation specific factor 3-like, isoform CRA_a
[Homo sapiens]
gi|119576638|gb|EAW56234.1| cleavage and polyadenylation specific factor 3-like, isoform CRA_a
[Homo sapiens]
Length = 600
Score = 580 bits (1496), Expect = e-162, Method: Compositional matrix adjust.
Identities = 272/435 (62%), Positives = 342/435 (78%), Gaps = 7/435 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MFQ+KVG++S+VYTGDYNMTPDRHLGAAWIDKCRP+LLITESTYATTIRDSKRCRERDFL
Sbjct: 163 MFQIKVGSESVVYTGDYNMTPDRHLGAAWIDKCRPNLLITESTYATTIRDSKRCRERDFL 222
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHE V+RGGKVLIPVFALGRAQELCILLET+WERMNL+ PIYF+ GLTEKA +YYK+F
Sbjct: 223 KKVHETVERGGKVLIPVFALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLF 282
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
I WTNQKIRKTFVQRNMF+FKHI+ FD++F DNPGPMVVFATPGMLH+G SL IF+KWA
Sbjct: 283 IPWTNQKIRKTFVQRNMFEFKHIKAFDRAFADNPGPMVVFATPGMLHAGQSLQIFRKWAG 342
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
E NM+IMPG+CVQGT+GHK+LSG +KLE E +Q+++VKM +EYMSFSAHADAKGIMQL+
Sbjct: 343 NEKNMVIMPGYCVQGTVGHKILSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLV 402
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
EP++VLLVHGEA KM FLK+KI+QE ++C+MPANGE+ + T I + +S+ LLK
Sbjct: 403 GQAEPESVLLVHGEAKKMEFLKQKIEQELRVNCYMPANGETVTLPTSPSIPVGISLGLLK 462
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
E + P K R +HG L++KDS+ L+ ++A KE+G++ H +RFT V + D+
Sbjct: 463 REMAQ--GLLPEAKKPRLLHGTLIMKDSNFRLVSSEQALKELGLAEHQLRFTCRVHLHDT 520
Query: 361 SPINKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLL--SVERIDDKNKRVFVTWP 418
+T +Y L S L+D ++ L + S++++SVLL + D K + V+W
Sbjct: 521 RKEQETALRVYSHLKSVLKDHCVQH---LPDGSVTVESVLLQAAAPSEDPGTKVLLVSWT 577
Query: 419 NQDEEVGKIVLHVLK 433
QDEE+G + +LK
Sbjct: 578 YQDEELGSFLTSLLK 592
>gi|348551496|ref|XP_003461566.1| PREDICTED: integrator complex subunit 11 [Cavia porcellus]
Length = 600
Score = 580 bits (1496), Expect = e-162, Method: Compositional matrix adjust.
Identities = 271/435 (62%), Positives = 343/435 (78%), Gaps = 7/435 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MFQ+KVG++S+VYTGDYNMTPDRHLGAAWIDKCRP+LLITESTYATTIRDSKRCRERDFL
Sbjct: 163 MFQIKVGSESVVYTGDYNMTPDRHLGAAWIDKCRPNLLITESTYATTIRDSKRCRERDFL 222
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHE V+RGGKVLIPVFALGRAQELCILLET+WERMNL+ PIYF+ GLTEKA +YYK+F
Sbjct: 223 KKVHETVERGGKVLIPVFALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLF 282
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
ITWTNQKIRKTFVQRNMF+FKHI+ FD++F DNPGPMVVFATPGMLH+G SL IF+KWA
Sbjct: 283 ITWTNQKIRKTFVQRNMFEFKHIKAFDRAFADNPGPMVVFATPGMLHAGQSLQIFRKWAG 342
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
E NM+IMPG+CVQGT+GHK+LSG +KLE E +Q+++VKM +EYMSFSAHADAKGIMQL+
Sbjct: 343 NEKNMVIMPGYCVQGTVGHKILSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLV 402
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
EP++VLLVHGEA KM FL++KI+QEF + C+MPANGE+ + T I + +S+ LLK
Sbjct: 403 GQAEPESVLLVHGEAKKMEFLRQKIEQEFRVSCYMPANGETVTLPTSPSIPVGISLGLLK 462
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
+E + P K R +HG L++KDSS L+ ++A KE+G++ H +RFT V + D+
Sbjct: 463 QEMAQ--GLLPEAKKPRLLHGTLIMKDSSFRLVSSEQALKELGLAEHQLRFTCRVHLQDT 520
Query: 361 SPINKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLL--SVERIDDKNKRVFVTWP 418
+ +Y L S L+D ++ L + S++++S+L+ + D K + V+W
Sbjct: 521 RKEQEMALRVYSHLKSTLKDHCVQH---LPDGSVTVESILIQAAAHSEDPGIKVLLVSWT 577
Query: 419 NQDEEVGKIVLHVLK 433
QDEE+G + +LK
Sbjct: 578 YQDEELGSFLTTLLK 592
>gi|74220481|dbj|BAE31460.1| unnamed protein product [Mus musculus]
Length = 600
Score = 580 bits (1496), Expect = e-162, Method: Compositional matrix adjust.
Identities = 271/436 (62%), Positives = 344/436 (78%), Gaps = 7/436 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MFQ+KVG++S+VYTGDYNMTPDRHLGAAWIDKCRP+LLITESTYATTIRDSKRCRERDFL
Sbjct: 163 MFQIKVGSESVVYTGDYNMTPDRHLGAAWIDKCRPNLLITESTYATTIRDSKRCRERDFL 222
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHE V+RGGKVLIPVFALGRAQELCILLET+WERMNL+ PIYF+ GLTEKA +YYK+F
Sbjct: 223 KKVHETVERGGKVLIPVFALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLF 282
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
ITWTNQKIRKTFVQ NMF+FKHI+ FD++F DNPGPMVVFATPGMLH+G SL IF+KWA
Sbjct: 283 ITWTNQKIRKTFVQGNMFEFKHIKAFDRTFADNPGPMVVFATPGMLHAGQSLQIFRKWAG 342
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
E NM+IMPG+CVQGT+GHK+LSG +KLE E +Q+++VKM +EYMSFSAHADAKGIMQL+
Sbjct: 343 NEKNMVIMPGYCVQGTVGHKILSGQRKLEMEGRQMLEVKMQVEYMSFSAHADAKGIMQLV 402
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
EP++VLLVHGEA KM FL++KI+QEF + C+MPANGE+ + T I + +S+ LLK
Sbjct: 403 GQAEPESVLLVHGEAKKMEFLRQKIEQEFRVSCYMPANGETVTLPTSPSIPVGISLGLLK 462
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
E V+ P K R +HG L++KDS+ L+ ++A KE+G++ H +RFT V + D+
Sbjct: 463 REMVQ--GLLPEAKKPRLLHGTLIMKDSNFRLVSSEQALKELGLAEHQLRFTCRVHLQDT 520
Query: 361 SPINKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLL--SVERIDDKNKRVFVTWP 418
+T +Y L S L+D ++ L + S++++S+L+ + D K + V+W
Sbjct: 521 RKEQETALRVYSHLKSTLKDHCVQH---LPDGSVTVESILIQAAAHSEDPGTKVLLVSWT 577
Query: 419 NQDEEVGKIVLHVLKS 434
QDEE+G + +LK+
Sbjct: 578 YQDEELGSFLTTLLKN 593
>gi|426327396|ref|XP_004024504.1| PREDICTED: integrator complex subunit 11 isoform 4 [Gorilla gorilla
gorilla]
Length = 499
Score = 580 bits (1496), Expect = e-162, Method: Compositional matrix adjust.
Identities = 270/435 (62%), Positives = 342/435 (78%), Gaps = 7/435 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MFQ+KVG++S+VYTGDYNMTPDRHLGAAWIDKCRP+LLITESTYATTIRDSKRCRERDFL
Sbjct: 62 MFQIKVGSESVVYTGDYNMTPDRHLGAAWIDKCRPNLLITESTYATTIRDSKRCRERDFL 121
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHE V+RGGKVLIPVFALGRAQELCILLET+WERMNL+ PIYF+ GLTEKA +YYK+F
Sbjct: 122 KKVHETVERGGKVLIPVFALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLF 181
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
I WTNQKIRKTFVQRNMF+FKHI+ FD++F DNPGPMVVFATPGMLH+G SL IF+KWA
Sbjct: 182 IPWTNQKIRKTFVQRNMFEFKHIKAFDRAFADNPGPMVVFATPGMLHAGQSLQIFRKWAG 241
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
E NM+IMPG+CVQGT+GHK+LSG +KLE E +Q+++VKM +EYMSFSAHADAKGIMQL+
Sbjct: 242 NEKNMVIMPGYCVQGTVGHKILSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLV 301
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
EP++VLLVHGEA KM FLK+KI+QE + C+MPANGE+ + T I + +S+ LLK
Sbjct: 302 GQAEPESVLLVHGEAKKMEFLKQKIEQELRVSCYMPANGETVTLPTSPSIPVGISLGLLK 361
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
E + P+ K R +HG L++KDS+ L+ ++A KE+G++ H +RFT V + D+
Sbjct: 362 REMAQ--GLLPDAKKPRLLHGTLIMKDSNFRLVSSEQALKELGLAEHQLRFTCRVHLHDT 419
Query: 361 SPINKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLL--SVERIDDKNKRVFVTWP 418
+T +Y L S L+D ++ L + S++++S+L+ + D K + V+W
Sbjct: 420 RKEQETALRVYSHLKSILKDHCVQH---LPDGSVTVESILIQAAAPSEDPGTKVLLVSWT 476
Query: 419 NQDEEVGKIVLHVLK 433
QDEE+G + +LK
Sbjct: 477 YQDEELGSFLTSLLK 491
>gi|343958314|dbj|BAK63012.1| protein related to CPSF subunits 68 kDa [Pan troglodytes]
Length = 600
Score = 580 bits (1495), Expect = e-162, Method: Compositional matrix adjust.
Identities = 272/435 (62%), Positives = 341/435 (78%), Gaps = 7/435 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MFQ+KVG++S+VYTGDYNMTPDRHLGAAWIDKCRP+LLITESTYATTIRDSKRCRERDFL
Sbjct: 163 MFQIKVGSESVVYTGDYNMTPDRHLGAAWIDKCRPNLLITESTYATTIRDSKRCRERDFL 222
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHE V+RGGKVLIPVFALGRAQELCILLET+WERMNL+ PIYF+ GLTEKA +YYK+F
Sbjct: 223 KKVHETVERGGKVLIPVFALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLF 282
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
I WTNQKIRKTFVQRNMF+FKHI+ FD++F DNPGPMVVFATPGMLH+G SL IF+KWA
Sbjct: 283 IPWTNQKIRKTFVQRNMFEFKHIKAFDRAFADNPGPMVVFATPGMLHAGQSLQIFRKWAG 342
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
E NM+IMPG+CVQGT+GHK+LSG +KLE E +Q+++VKM +EYMSFSAHADAKGIMQL+
Sbjct: 343 NEKNMVIMPGYCVQGTVGHKILSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLV 402
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
EP++VLLVHGEA KM FLK+KI+QE + C+MPANGE+ + T I + +S+ LLK
Sbjct: 403 GQAEPESVLLVHGEAKKMEFLKQKIEQELRVSCYMPANGETVTLPTSPSIPVGISLGLLK 462
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
E + P K R +HG L++KDS+ L+ ++A KE+G++ H +RFT V + D+
Sbjct: 463 REMAQ--GLLPEAKKPRLLHGTLIMKDSNFRLVSSEQALKELGLAEHQLRFTCRVHLHDT 520
Query: 361 SPINKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLL--SVERIDDKNKRVFVTWP 418
+T +Y L S L+D ++ L + S++++SVLL + D K + V+W
Sbjct: 521 RKEQETALRVYSHLKSVLKDHCVQH---LPDGSVTVESVLLQAAAPSEDPGTKVLLVSWT 577
Query: 419 NQDEEVGKIVLHVLK 433
QDEE+G + +LK
Sbjct: 578 YQDEELGSFLTSLLK 592
>gi|440911726|gb|ELR61363.1| Integrator complex subunit 11 [Bos grunniens mutus]
Length = 599
Score = 580 bits (1494), Expect = e-162, Method: Compositional matrix adjust.
Identities = 270/435 (62%), Positives = 344/435 (79%), Gaps = 7/435 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MFQ+KVG++S+VYTGDYNMTPDRHLGAAWIDKCRP LLITESTYATTIRDSKRCRERDFL
Sbjct: 163 MFQIKVGSESVVYTGDYNMTPDRHLGAAWIDKCRPSLLITESTYATTIRDSKRCRERDFL 222
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHE V+RGGKVLIPVFALGRAQELCILLET+WERM+L+APIYF+ GLTEKA +YYK+F
Sbjct: 223 KKVHETVERGGKVLIPVFALGRAQELCILLETFWERMDLKAPIYFSTGLTEKANHYYKLF 282
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
I WTNQKIRKTFVQRNMF+FKHI+ FD++F D+PGPMVVFATPGMLH+G SL IF+KWA
Sbjct: 283 IPWTNQKIRKTFVQRNMFEFKHIKAFDRAFADSPGPMVVFATPGMLHAGQSLQIFRKWAG 342
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
E NM+IMPG+CVQGT+GHK+LSG +KLE E +Q+++VKM +EYMSFSAHADAKGIMQL+
Sbjct: 343 NEKNMVIMPGYCVQGTVGHKILSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLV 402
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
EP+NVLLVHGEA KM FLK+KI+QEF ++C+MPANGE+ + T I + +S+ LLK
Sbjct: 403 GQAEPENVLLVHGEAKKMEFLKQKIEQEFRVNCYMPANGETVTLPTSPSIPVGISLGLLK 462
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
E + P+ K R +HG L++KDS+ L+ ++A KE+G++ H +RFT V + D+
Sbjct: 463 REMAQ--GLLPDAKKPRLLHGTLIMKDSNFRLVSSEQALKELGLAEHQLRFTCRVHLHDT 520
Query: 361 SPINKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLL--SVERIDDKNKRVFVTWP 418
+ +Y+ L S L+D ++ L + S++++S+L+ + D K + V+W
Sbjct: 521 RKEQEMAMRVYNHLKSVLKDHCVQH---LPDGSVTVESILVQAAAHSEDPGTKVLLVSWT 577
Query: 419 NQDEEVGKIVLHVLK 433
QDEE+G + +LK
Sbjct: 578 YQDEELGSYLTSLLK 592
>gi|374253819|ref|NP_001243385.1| integrator complex subunit 11 isoform 1 [Homo sapiens]
gi|119576642|gb|EAW56238.1| cleavage and polyadenylation specific factor 3-like, isoform CRA_f
[Homo sapiens]
Length = 606
Score = 580 bits (1494), Expect = e-162, Method: Compositional matrix adjust.
Identities = 272/435 (62%), Positives = 342/435 (78%), Gaps = 7/435 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MFQ+KVG++S+VYTGDYNMTPDRHLGAAWIDKCRP+LLITESTYATTIRDSKRCRERDFL
Sbjct: 169 MFQIKVGSESVVYTGDYNMTPDRHLGAAWIDKCRPNLLITESTYATTIRDSKRCRERDFL 228
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHE V+RGGKVLIPVFALGRAQELCILLET+WERMNL+ PIYF+ GLTEKA +YYK+F
Sbjct: 229 KKVHETVERGGKVLIPVFALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLF 288
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
I WTNQKIRKTFVQRNMF+FKHI+ FD++F DNPGPMVVFATPGMLH+G SL IF+KWA
Sbjct: 289 IPWTNQKIRKTFVQRNMFEFKHIKAFDRAFADNPGPMVVFATPGMLHAGQSLQIFRKWAG 348
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
E NM+IMPG+CVQGT+GHK+LSG +KLE E +Q+++VKM +EYMSFSAHADAKGIMQL+
Sbjct: 349 NEKNMVIMPGYCVQGTVGHKILSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLV 408
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
EP++VLLVHGEA KM FLK+KI+QE ++C+MPANGE+ + T I + +S+ LLK
Sbjct: 409 GQAEPESVLLVHGEAKKMEFLKQKIEQELRVNCYMPANGETVTLPTSPSIPVGISLGLLK 468
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
E + P K R +HG L++KDS+ L+ ++A KE+G++ H +RFT V + D+
Sbjct: 469 REMAQ--GLLPEAKKPRLLHGTLIMKDSNFRLVSSEQALKELGLAEHQLRFTCRVHLHDT 526
Query: 361 SPINKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLL--SVERIDDKNKRVFVTWP 418
+T +Y L S L+D ++ L + S++++SVLL + D K + V+W
Sbjct: 527 RKEQETALRVYSHLKSVLKDHCVQH---LPDGSVTVESVLLQAAAPSEDPGTKVLLVSWT 583
Query: 419 NQDEEVGKIVLHVLK 433
QDEE+G + +LK
Sbjct: 584 YQDEELGSFLTSLLK 598
>gi|397476276|ref|XP_003809533.1| PREDICTED: integrator complex subunit 11 isoform 1 [Pan paniscus]
gi|410206788|gb|JAA00613.1| cleavage and polyadenylation specific factor 3-like [Pan
troglodytes]
gi|410251172|gb|JAA13553.1| cleavage and polyadenylation specific factor 3-like [Pan
troglodytes]
gi|410297680|gb|JAA27440.1| cleavage and polyadenylation specific factor 3-like [Pan
troglodytes]
gi|410349815|gb|JAA41511.1| cleavage and polyadenylation specific factor 3-like [Pan
troglodytes]
Length = 600
Score = 580 bits (1494), Expect = e-162, Method: Compositional matrix adjust.
Identities = 272/435 (62%), Positives = 341/435 (78%), Gaps = 7/435 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MFQ+KVG++S+VYTGDYNMTPDRHLGAAWIDKCRP+LLITESTYATTIRDSKRCRERDFL
Sbjct: 163 MFQIKVGSESVVYTGDYNMTPDRHLGAAWIDKCRPNLLITESTYATTIRDSKRCRERDFL 222
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHE V+RGGKVLIPVFALGRAQELCILLET+WERMNL+ PIYF+ GLTEKA +YYK+F
Sbjct: 223 KKVHETVERGGKVLIPVFALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLF 282
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
I WTNQKIRKTFVQRNMF+FKHI+ FD++F DNPGPMVVFATPGMLH+G SL IF+KWA
Sbjct: 283 IPWTNQKIRKTFVQRNMFEFKHIKAFDRAFADNPGPMVVFATPGMLHAGQSLQIFRKWAG 342
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
E NM+IMPG+CVQGT+GHK+LSG +KLE E +Q+++VKM +EYMSFSAHADAKGIMQL+
Sbjct: 343 NEKNMVIMPGYCVQGTVGHKILSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLV 402
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
EP++VLLVHGEA KM FLK+KI+QE + C+MPANGE+ + T I + +S+ LLK
Sbjct: 403 GQAEPESVLLVHGEAKKMEFLKQKIEQELRVSCYMPANGETVTLPTSPSIPVGISLGLLK 462
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
E + P K R +HG L++KDS+ L+ ++A KE+G++ H +RFT V + D+
Sbjct: 463 REMAQ--GLLPEAKKPRLLHGTLIMKDSNFRLVSSEQALKELGLAEHQLRFTCRVHLHDT 520
Query: 361 SPINKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLL--SVERIDDKNKRVFVTWP 418
+T +Y L S L+D ++ L + S++++SVLL + D K + V+W
Sbjct: 521 RKEQETALRVYSHLKSVLKDHCVQH---LPDGSVTVESVLLQAAAPSEDPGTKVLLVSWT 577
Query: 419 NQDEEVGKIVLHVLK 433
QDEE+G + +LK
Sbjct: 578 YQDEELGSFLTSLLK 592
>gi|274326663|ref|NP_001094578.1| integrator complex subunit 11 [Bos taurus]
gi|152941100|gb|ABS44987.1| related to CPSF subunits 68 kDa [Bos taurus]
Length = 599
Score = 580 bits (1494), Expect = e-162, Method: Compositional matrix adjust.
Identities = 270/435 (62%), Positives = 343/435 (78%), Gaps = 7/435 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MFQ+KVG++S+VYTGDYNMTPDRHLGAAWIDKCRP LLITESTYATTIRDSKRCRERDFL
Sbjct: 163 MFQIKVGSESVVYTGDYNMTPDRHLGAAWIDKCRPSLLITESTYATTIRDSKRCRERDFL 222
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHE V+RGGKVLIPVFALGRAQELCILLET+WERM+L+APIYF+ GLTEKA +YYK+F
Sbjct: 223 KKVHETVERGGKVLIPVFALGRAQELCILLETFWERMDLKAPIYFSTGLTEKANHYYKLF 282
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
I WTNQKIRKTFVQRNMF+FKHI+ FD++F D+PGPMVVFATPGMLH+G SL IF+KWA
Sbjct: 283 IPWTNQKIRKTFVQRNMFEFKHIKAFDRAFADSPGPMVVFATPGMLHAGQSLQIFRKWAG 342
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
E NM+IMPG+CVQGT+GHK+LSG +KLE E +Q+++VKM +EYMSFSAHADAKGIMQL+
Sbjct: 343 NEKNMVIMPGYCVQGTVGHKILSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLV 402
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
EP+NVLLVHGEA KM FLK+KI+QEF ++C+MPANGE+ + T I + +S+ LLK
Sbjct: 403 GQAEPENVLLVHGEAKKMEFLKQKIEQEFRVNCYMPANGETVTLPTSPSIPVGISLGLLK 462
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
E + P+ K R +HG L++KDS+ L+ ++A KE+G++ H +RFT V + D+
Sbjct: 463 REMAQ--GLLPDAKKPRLLHGTLIMKDSNFRLVSSEQALKELGLAEHQLRFTCRVHLHDT 520
Query: 361 SPINKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLL--SVERIDDKNKRVFVTWP 418
+ +Y L S L+D ++ L + S++++S+L+ + D K + V+W
Sbjct: 521 RKEQEMAMRVYSHLKSVLKDHCVQH---LPDGSVTVESILVQAAAHSEDPGTKVLLVSWT 577
Query: 419 NQDEEVGKIVLHVLK 433
QDEE+G + +LK
Sbjct: 578 YQDEELGSYLTSLLK 592
>gi|403297740|ref|XP_003939710.1| PREDICTED: integrator complex subunit 11 isoform 2 [Saimiri
boliviensis boliviensis]
Length = 499
Score = 580 bits (1494), Expect = e-162, Method: Compositional matrix adjust.
Identities = 269/435 (61%), Positives = 341/435 (78%), Gaps = 7/435 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MFQ+KVG++S+VYTGDYNMTPDRHLGAAWIDKCRP+LLITESTYATTIRDSKRCRERDFL
Sbjct: 62 MFQIKVGSESVVYTGDYNMTPDRHLGAAWIDKCRPNLLITESTYATTIRDSKRCRERDFL 121
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHE V+ GGKVLIPVFALGRAQELCILLET+WERMNL+ PIYF+ GLTEKA +YYK+F
Sbjct: 122 KKVHETVEHGGKVLIPVFALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLF 181
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
I WTNQKIRKTFVQRNMF+FKHI+ FD++F DNPGPMVVFATPGMLH+G SL IF+KWA
Sbjct: 182 IPWTNQKIRKTFVQRNMFEFKHIKAFDRAFADNPGPMVVFATPGMLHAGQSLQIFRKWAG 241
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
E NM+IMPG+CVQGT+GHK+LSG +KLE E +Q+++VKM +EYMSFSAHADAKGIMQL+
Sbjct: 242 NEKNMVIMPGYCVQGTVGHKILSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLV 301
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
EP+NVLLVHGEA KM FLK+KI+QE + C+MPANGE+ + T + I + +S+ LLK
Sbjct: 302 GQAEPENVLLVHGEAKKMEFLKQKIEQELRVSCYMPANGETVTLPTSLSIPVGISLGLLK 361
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
E + P K R +HG L++KDS+ L+ ++A KE+G++ H +RFT V + D+
Sbjct: 362 REMAQ--GLLPEAKKPRLLHGTLIMKDSNFRLVSSEQALKELGLAEHQLRFTCRVHLHDT 419
Query: 361 SPINKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLL--SVERIDDKNKRVFVTWP 418
+T +Y L S L++ ++ L + S++++S+L+ + D K + V+W
Sbjct: 420 RKEQETALRVYSHLKSILKEHCVQH---LPDGSVTVESILIQAAAPSEDPGTKVLLVSWT 476
Query: 419 NQDEEVGKIVLHVLK 433
QDEE+G + +LK
Sbjct: 477 YQDEELGSFLTSLLK 491
>gi|431922648|gb|ELK19568.1| Integrator complex subunit 11 [Pteropus alecto]
Length = 603
Score = 579 bits (1493), Expect = e-162, Method: Compositional matrix adjust.
Identities = 270/435 (62%), Positives = 343/435 (78%), Gaps = 7/435 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MFQ++VG++S+VYTGDYNMTPDRHLGAAWID+CRP+LLITESTYATTIRDSKRCRERDFL
Sbjct: 163 MFQIRVGSESVVYTGDYNMTPDRHLGAAWIDRCRPNLLITESTYATTIRDSKRCRERDFL 222
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHE V+RGGKVLIPVFALGRAQELCILLET+WERMNL+APIYF+ GLTEKA +YYK+F
Sbjct: 223 KKVHETVERGGKVLIPVFALGRAQELCILLETFWERMNLKAPIYFSTGLTEKANHYYKLF 282
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
ITWTNQKIRKTFVQRNMF+FKHI+ FD++F DNPGPMVVFATPGMLH+G SL IF+KWA
Sbjct: 283 ITWTNQKIRKTFVQRNMFEFKHIKAFDRAFADNPGPMVVFATPGMLHAGQSLQIFRKWAG 342
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
E NM+IMPG+CVQGT+GHK+LSG +KLE E +Q+++VKM +EYMSFSAHADAKGIMQL+
Sbjct: 343 NEKNMVIMPGYCVQGTVGHKILSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLV 402
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
EP++VLLVHGEA KM FLK+KI+QEF + C+MPANGE+ + T I + +S+ LLK
Sbjct: 403 GQAEPESVLLVHGEAKKMEFLKQKIEQEFRVSCYMPANGETVTLPTSPSIPVGISLGLLK 462
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
E + P+ K R +HG LV+KD++ L+ ++A KE+G++ H +RFT V + D
Sbjct: 463 REMAQ--GLLPDAKKPRLLHGTLVMKDNNFRLVSSEQALKELGLAEHQLRFTCRVHLHDP 520
Query: 361 SPINKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLL--SVERIDDKNKRVFVTWP 418
+ +Y L S L+D ++ L + S++++S+L+ + D K + V+W
Sbjct: 521 RKEQEMALRVYSHLKSVLKDHCVQH---LPDGSVTVESILIQAAAHSEDPGTKVLLVSWT 577
Query: 419 NQDEEVGKIVLHVLK 433
QDEE+G + +LK
Sbjct: 578 YQDEELGSYLTSLLK 592
>gi|193608339|ref|XP_001949326.1| PREDICTED: integrator complex subunit 11-like isoform 1
[Acyrthosiphon pisum]
gi|328710634|ref|XP_003244318.1| PREDICTED: integrator complex subunit 11-like isoform 2
[Acyrthosiphon pisum]
Length = 603
Score = 579 bits (1493), Expect = e-162, Method: Compositional matrix adjust.
Identities = 271/441 (61%), Positives = 350/441 (79%), Gaps = 7/441 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +KVGN S+VYTGD++MTPDRHLGAAWIDKCRP LLITESTYATTIRDSKRCRERDFL
Sbjct: 166 MFYIKVGNDSVVYTGDFSMTPDRHLGAAWIDKCRPTLLITESTYATTIRDSKRCRERDFL 225
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
K VHEC+DRGGKVLIP+FALGRAQELCIL++TYW+RM L+ P+YFA GLTEKA +YYKMF
Sbjct: 226 KNVHECIDRGGKVLIPIFALGRAQELCILIDTYWDRMGLKVPVYFAAGLTEKANSYYKMF 285
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
ITWTNQK+R+TFVQRNMFDFKHI+PFDK+++ NPGPMVVFATPGMLH+GLSL IFKKWAP
Sbjct: 286 ITWTNQKVRQTFVQRNMFDFKHIKPFDKTYMHNPGPMVVFATPGMLHAGLSLNIFKKWAP 345
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
E NMLI+PG+CV GT+G+KVLSG KK+E E + IDVKM++EY+SFSAHAD KGI+QLI
Sbjct: 346 DEKNMLIVPGYCVSGTVGNKVLSGSKKIEAEPNKFIDVKMSVEYLSFSAHADGKGIIQLI 405
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
+ CEP+NVLLVHGE KM FL+ KI QEFN++C+MPANGE+ ++T + ++S L+K
Sbjct: 406 KNCEPQNVLLVHGEEEKMKFLRAKIMQEFNINCYMPANGETVEIETANIHTFNMSTKLVK 465
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISL--MDVDEACKEVGISRHIVRFTSNVQIS 358
E ++ N +R +HG ++K+ S +L M V + KE+ I RH +RFT V+I
Sbjct: 466 EILDDQSNNLGN--GKRLVHGAFLLKNGSSNLNAMSVKDTFKELDIQRHTLRFTCTVEIK 523
Query: 359 DSSPINKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLLSVE-RIDDKNKRVFVTW 417
+SS I KT ++D L + L K+ +++ S+S+ S+L++VE +D+ K++ ++W
Sbjct: 524 ESSTIQKTKTKLFDMLENNLPIFKN--KLTMSDGSISLASILVTVEDGMDNNKKKIVISW 581
Query: 418 PNQDEEVGKIVLHVLKSMSNT 438
N+DE +G VL +L++MS +
Sbjct: 582 ANEDEPLGNHVLEMLQNMSKS 602
>gi|296479091|tpg|DAA21206.1| TPA: cleavage and polyadenylation specific factor 3-like [Bos
taurus]
Length = 599
Score = 579 bits (1493), Expect = e-162, Method: Compositional matrix adjust.
Identities = 270/435 (62%), Positives = 343/435 (78%), Gaps = 7/435 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MFQ+KVG++S+VYTGDYNMTPDRHLGAAWIDKCRP LLITESTYATTIRDSKRCRERDFL
Sbjct: 163 MFQIKVGSESVVYTGDYNMTPDRHLGAAWIDKCRPSLLITESTYATTIRDSKRCRERDFL 222
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHE V+RGGKVLIPVFALGRAQELCILLET+WERM+L+APIYF+ GLTEKA +YYK+F
Sbjct: 223 KKVHETVERGGKVLIPVFALGRAQELCILLETFWERMDLKAPIYFSTGLTEKANHYYKLF 282
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
I WTNQKIRKTFVQRNMF+FKHI+ FD++F D+PGPMVVFATPGMLH+G SL IF+KWA
Sbjct: 283 IPWTNQKIRKTFVQRNMFEFKHIKAFDRAFADSPGPMVVFATPGMLHAGQSLQIFRKWAG 342
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
E NM+IMPG+CVQGT+GHK+LSG +KLE E +Q+++VKM +EYMSFSAHADAKGIMQL+
Sbjct: 343 NEKNMVIMPGYCVQGTVGHKILSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLV 402
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
EP+NVLLVHGEA KM FLK+KI+QEF ++C+MPANGE+ + T I + +S+ LLK
Sbjct: 403 GQAEPENVLLVHGEAKKMEFLKQKIEQEFRVNCYMPANGETVTLPTSPSIPVGISLGLLK 462
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
E + P+ K R +HG L++KDS+ L+ ++A KE+G++ H +RFT V + D+
Sbjct: 463 REMAQ--GLLPDAKKPRLLHGTLIMKDSNFRLVSSEQALKELGLAEHQLRFTCRVHLHDT 520
Query: 361 SPINKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLL--SVERIDDKNKRVFVTWP 418
+ +Y L S L+D ++ L + S++++S+L+ + D K + V+W
Sbjct: 521 RKEQEMAMRVYSHLKSVLKDHCVQH---LPDGSVTVESILVQAAAHSEDPGTKVLLVSWT 577
Query: 419 NQDEEVGKIVLHVLK 433
QDEE+G + +LK
Sbjct: 578 YQDEELGSYLTSLLK 592
>gi|118572556|sp|Q2YDM2.2|INT11_BOVIN RecName: Full=Integrator complex subunit 11; Short=Int11; AltName:
Full=Cleavage and polyadenylation-specific factor 3-like
protein; Short=CPSF3-like protein
gi|158455110|gb|AAI10156.2| CPSF3L protein [Bos taurus]
Length = 599
Score = 579 bits (1492), Expect = e-162, Method: Compositional matrix adjust.
Identities = 270/435 (62%), Positives = 343/435 (78%), Gaps = 7/435 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MFQ+KVG++S+VYTGDYNMTPDRHLGAAWIDKCRP LLITESTYATTIRDSKRCRERDFL
Sbjct: 163 MFQIKVGSESVVYTGDYNMTPDRHLGAAWIDKCRPSLLITESTYATTIRDSKRCRERDFL 222
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHE V+RGGKVLIPVFALGRAQELCILLET+WERM+L+APIYF+ GLTEKA +YYK+F
Sbjct: 223 KKVHETVERGGKVLIPVFALGRAQELCILLETFWERMDLKAPIYFSTGLTEKANHYYKLF 282
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
I WTNQKIRKTFVQRNMF+FKHI+ FD++F D+PGPMVVFATPGMLH+G SL IF+KWA
Sbjct: 283 IPWTNQKIRKTFVQRNMFEFKHIKAFDRAFADSPGPMVVFATPGMLHAGQSLQIFRKWAG 342
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
E NM+IMPG+CVQGT+GHK+LSG +KLE E +Q+++VKM +EYMSFSAHADAKGIMQL+
Sbjct: 343 NEKNMVIMPGYCVQGTVGHKILSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLV 402
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
EP+NVLLVHGEA KM FLK+KI+QEF ++C+MPANGE+ + T I + +S+ LLK
Sbjct: 403 GQAEPENVLLVHGEAKKMEFLKQKIEQEFRVNCYMPANGETVTLPTSPSIPVGISLGLLK 462
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
E + P+ K R +HG L++KDS+ L+ ++A KE+G++ H +RFT V + D+
Sbjct: 463 REMAQ--GLLPDAKKPRLLHGTLIMKDSNFRLVSSEQALKELGLAEHQLRFTCRVHLHDT 520
Query: 361 SPINKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLL--SVERIDDKNKRVFVTWP 418
+ +Y L S L+D ++ L + S++++S+L+ + D K + V+W
Sbjct: 521 RKEQEMAMRVYSHLKSVLKDHCVQH---LPDGSVTVESILVQAAAHSEDPGTKVLLVSWT 577
Query: 419 NQDEEVGKIVLHVLK 433
QDEE+G + +LK
Sbjct: 578 YQDEELGSYLTSLLK 592
>gi|397476278|ref|XP_003809534.1| PREDICTED: integrator complex subunit 11 isoform 2 [Pan paniscus]
Length = 606
Score = 579 bits (1492), Expect = e-162, Method: Compositional matrix adjust.
Identities = 272/435 (62%), Positives = 341/435 (78%), Gaps = 7/435 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MFQ+KVG++S+VYTGDYNMTPDRHLGAAWIDKCRP+LLITESTYATTIRDSKRCRERDFL
Sbjct: 169 MFQIKVGSESVVYTGDYNMTPDRHLGAAWIDKCRPNLLITESTYATTIRDSKRCRERDFL 228
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHE V+RGGKVLIPVFALGRAQELCILLET+WERMNL+ PIYF+ GLTEKA +YYK+F
Sbjct: 229 KKVHETVERGGKVLIPVFALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLF 288
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
I WTNQKIRKTFVQRNMF+FKHI+ FD++F DNPGPMVVFATPGMLH+G SL IF+KWA
Sbjct: 289 IPWTNQKIRKTFVQRNMFEFKHIKAFDRAFADNPGPMVVFATPGMLHAGQSLQIFRKWAG 348
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
E NM+IMPG+CVQGT+GHK+LSG +KLE E +Q+++VKM +EYMSFSAHADAKGIMQL+
Sbjct: 349 NEKNMVIMPGYCVQGTVGHKILSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLV 408
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
EP++VLLVHGEA KM FLK+KI+QE + C+MPANGE+ + T I + +S+ LLK
Sbjct: 409 GQAEPESVLLVHGEAKKMEFLKQKIEQELRVSCYMPANGETVTLPTSPSIPVGISLGLLK 468
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
E + P K R +HG L++KDS+ L+ ++A KE+G++ H +RFT V + D+
Sbjct: 469 REMAQ--GLLPEAKKPRLLHGTLIMKDSNFRLVSSEQALKELGLAEHQLRFTCRVHLHDT 526
Query: 361 SPINKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLL--SVERIDDKNKRVFVTWP 418
+T +Y L S L+D ++ L + S++++SVLL + D K + V+W
Sbjct: 527 RKEQETALRVYSHLKSVLKDHCVQH---LPDGSVTVESVLLQAAAPSEDPGTKVLLVSWT 583
Query: 419 NQDEEVGKIVLHVLK 433
QDEE+G + +LK
Sbjct: 584 YQDEELGSFLTSLLK 598
>gi|426327390|ref|XP_004024501.1| PREDICTED: integrator complex subunit 11 isoform 1 [Gorilla gorilla
gorilla]
Length = 600
Score = 579 bits (1492), Expect = e-162, Method: Compositional matrix adjust.
Identities = 270/435 (62%), Positives = 342/435 (78%), Gaps = 7/435 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MFQ+KVG++S+VYTGDYNMTPDRHLGAAWIDKCRP+LLITESTYATTIRDSKRCRERDFL
Sbjct: 163 MFQIKVGSESVVYTGDYNMTPDRHLGAAWIDKCRPNLLITESTYATTIRDSKRCRERDFL 222
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHE V+RGGKVLIPVFALGRAQELCILLET+WERMNL+ PIYF+ GLTEKA +YYK+F
Sbjct: 223 KKVHETVERGGKVLIPVFALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLF 282
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
I WTNQKIRKTFVQRNMF+FKHI+ FD++F DNPGPMVVFATPGMLH+G SL IF+KWA
Sbjct: 283 IPWTNQKIRKTFVQRNMFEFKHIKAFDRAFADNPGPMVVFATPGMLHAGQSLQIFRKWAG 342
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
E NM+IMPG+CVQGT+GHK+LSG +KLE E +Q+++VKM +EYMSFSAHADAKGIMQL+
Sbjct: 343 NEKNMVIMPGYCVQGTVGHKILSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLV 402
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
EP++VLLVHGEA KM FLK+KI+QE + C+MPANGE+ + T I + +S+ LLK
Sbjct: 403 GQAEPESVLLVHGEAKKMEFLKQKIEQELRVSCYMPANGETVTLPTSPSIPVGISLGLLK 462
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
E + P+ K R +HG L++KDS+ L+ ++A KE+G++ H +RFT V + D+
Sbjct: 463 REMAQ--GLLPDAKKPRLLHGTLIMKDSNFRLVSSEQALKELGLAEHQLRFTCRVHLHDT 520
Query: 361 SPINKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLL--SVERIDDKNKRVFVTWP 418
+T +Y L S L+D ++ L + S++++S+L+ + D K + V+W
Sbjct: 521 RKEQETALRVYSHLKSILKDHCVQH---LPDGSVTVESILIQAAAPSEDPGTKVLLVSWT 577
Query: 419 NQDEEVGKIVLHVLK 433
QDEE+G + +LK
Sbjct: 578 YQDEELGSFLTSLLK 592
>gi|426327394|ref|XP_004024503.1| PREDICTED: integrator complex subunit 11 isoform 3 [Gorilla gorilla
gorilla]
Length = 571
Score = 579 bits (1492), Expect = e-162, Method: Compositional matrix adjust.
Identities = 270/435 (62%), Positives = 342/435 (78%), Gaps = 7/435 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MFQ+KVG++S+VYTGDYNMTPDRHLGAAWIDKCRP+LLITESTYATTIRDSKRCRERDFL
Sbjct: 134 MFQIKVGSESVVYTGDYNMTPDRHLGAAWIDKCRPNLLITESTYATTIRDSKRCRERDFL 193
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHE V+RGGKVLIPVFALGRAQELCILLET+WERMNL+ PIYF+ GLTEKA +YYK+F
Sbjct: 194 KKVHETVERGGKVLIPVFALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLF 253
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
I WTNQKIRKTFVQRNMF+FKHI+ FD++F DNPGPMVVFATPGMLH+G SL IF+KWA
Sbjct: 254 IPWTNQKIRKTFVQRNMFEFKHIKAFDRAFADNPGPMVVFATPGMLHAGQSLQIFRKWAG 313
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
E NM+IMPG+CVQGT+GHK+LSG +KLE E +Q+++VKM +EYMSFSAHADAKGIMQL+
Sbjct: 314 NEKNMVIMPGYCVQGTVGHKILSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLV 373
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
EP++VLLVHGEA KM FLK+KI+QE + C+MPANGE+ + T I + +S+ LLK
Sbjct: 374 GQAEPESVLLVHGEAKKMEFLKQKIEQELRVSCYMPANGETVTLPTSPSIPVGISLGLLK 433
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
E + P+ K R +HG L++KDS+ L+ ++A KE+G++ H +RFT V + D+
Sbjct: 434 REMAQ--GLLPDAKKPRLLHGTLIMKDSNFRLVSSEQALKELGLAEHQLRFTCRVHLHDT 491
Query: 361 SPINKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLL--SVERIDDKNKRVFVTWP 418
+T +Y L S L+D ++ L + S++++S+L+ + D K + V+W
Sbjct: 492 RKEQETALRVYSHLKSILKDHCVQH---LPDGSVTVESILIQAAAPSEDPGTKVLLVSWT 548
Query: 419 NQDEEVGKIVLHVLK 433
QDEE+G + +LK
Sbjct: 549 YQDEELGSFLTSLLK 563
>gi|343958192|dbj|BAK62951.1| protein related to CPSF subunits 68 kDa [Pan troglodytes]
Length = 600
Score = 578 bits (1490), Expect = e-162, Method: Compositional matrix adjust.
Identities = 271/435 (62%), Positives = 341/435 (78%), Gaps = 7/435 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MFQ+KVG++S+VYTGDYNMTPDRHLGAAWIDKCRP+LLITESTYATTIRDSKRCRERDFL
Sbjct: 163 MFQIKVGSESVVYTGDYNMTPDRHLGAAWIDKCRPNLLITESTYATTIRDSKRCRERDFL 222
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHE V+RGGKVLIPVFALGRAQELCILLET+WERMNL+ PIYF+ GLTEKA +YYK+F
Sbjct: 223 KKVHETVERGGKVLIPVFALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLF 282
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
I WTNQKIRKTFVQRNMF+FKHI+ FD++F DNPGPMVVFATPGMLH+G SL IF+KWA
Sbjct: 283 IPWTNQKIRKTFVQRNMFEFKHIKAFDRAFADNPGPMVVFATPGMLHAGQSLQIFRKWAG 342
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
E NM+IMPG+CVQGT+GH++LSG +KLE E +Q+++VKM +EYMSFSAHADAKGIMQL+
Sbjct: 343 NEKNMVIMPGYCVQGTVGHEILSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLV 402
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
EP++VLLVHGEA KM FLK+KI+QE + C+MPANGE+ + T I + +S+ LLK
Sbjct: 403 GQAEPESVLLVHGEAKKMEFLKQKIEQELRVSCYMPANGETVTLPTSPSIPVGISLGLLK 462
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
E + P K R +HG L++KDS+ L+ ++A KE+G++ H +RFT V + D+
Sbjct: 463 REMAQ--GLLPEAKKPRLLHGTLIMKDSNFRLVSSEQALKELGLAEHQLRFTCRVHLHDT 520
Query: 361 SPINKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLL--SVERIDDKNKRVFVTWP 418
+T +Y L S L+D ++ L + S++++SVLL + D K + V+W
Sbjct: 521 RKEQETALRVYSHLKSVLKDHCVQH---LPDGSVTVESVLLQAAAPSEDPGTKVLLVSWT 577
Query: 419 NQDEEVGKIVLHVLK 433
QDEE+G + +LK
Sbjct: 578 YQDEELGSFLTSLLK 592
>gi|118572558|sp|Q5NVE6.2|INT11_PONAB RecName: Full=Integrator complex subunit 11; Short=Int11; AltName:
Full=Cleavage and polyadenylation-specific factor 3-like
protein; Short=CPSF3-like protein
Length = 600
Score = 578 bits (1490), Expect = e-162, Method: Compositional matrix adjust.
Identities = 270/435 (62%), Positives = 341/435 (78%), Gaps = 7/435 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MFQ+KVG++S+VYTGDYNMTPDRHLGAAWIDKCRP+LLITESTYATTIRDSKRCRERDFL
Sbjct: 163 MFQIKVGSESVVYTGDYNMTPDRHLGAAWIDKCRPNLLITESTYATTIRDSKRCRERDFL 222
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHE V+RGGKVLIPVFALGRAQELCILLET+WERMNL+ PIYF+ GLTEKA +YYK+F
Sbjct: 223 KKVHETVERGGKVLIPVFALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLF 282
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
I WTNQKIRKTFVQRNMF+FKHI+ FD++F DNPGPMVVFATPGMLH+G SL IF+KWA
Sbjct: 283 IPWTNQKIRKTFVQRNMFEFKHIKAFDRAFADNPGPMVVFATPGMLHAGQSLQIFRKWAG 342
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
E NM+IMPG+CVQGT+GHK+LSG +KLE E +Q+++VKM +EYMSFSAHADAKGIMQL+
Sbjct: 343 NEKNMVIMPGYCVQGTVGHKILSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLV 402
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
EP++VLLVHGEA KM FLK+KI+QE + C+MPANGE+ + T I + +S+ LLK
Sbjct: 403 GQAEPESVLLVHGEAKKMEFLKQKIEQELRVSCYMPANGETVTLPTSPSIPVGISLGLLK 462
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
E + P K R +HG L++KDS+ L+ ++A KE+G++ H +RFT V + D+
Sbjct: 463 REMAQ--GLLPEAKKPRLLHGTLIMKDSNFRLVSSEQALKELGLAEHQLRFTCRVHLHDT 520
Query: 361 SPINKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLL--SVERIDDKNKRVFVTWP 418
+T +Y L S L+D ++ L + S++++S+L+ + D K + V+W
Sbjct: 521 RKEQETALRVYSHLKSILKDHCVQH---LPDGSVTVESILIQAAAPSEDPGTKVLLVSWT 577
Query: 419 NQDEEVGKIVLHVLK 433
QDEE+G + +LK
Sbjct: 578 YQDEELGSFLTSLLK 592
>gi|426327392|ref|XP_004024502.1| PREDICTED: integrator complex subunit 11 isoform 2 [Gorilla gorilla
gorilla]
Length = 606
Score = 578 bits (1489), Expect = e-162, Method: Compositional matrix adjust.
Identities = 270/435 (62%), Positives = 342/435 (78%), Gaps = 7/435 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MFQ+KVG++S+VYTGDYNMTPDRHLGAAWIDKCRP+LLITESTYATTIRDSKRCRERDFL
Sbjct: 169 MFQIKVGSESVVYTGDYNMTPDRHLGAAWIDKCRPNLLITESTYATTIRDSKRCRERDFL 228
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHE V+RGGKVLIPVFALGRAQELCILLET+WERMNL+ PIYF+ GLTEKA +YYK+F
Sbjct: 229 KKVHETVERGGKVLIPVFALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLF 288
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
I WTNQKIRKTFVQRNMF+FKHI+ FD++F DNPGPMVVFATPGMLH+G SL IF+KWA
Sbjct: 289 IPWTNQKIRKTFVQRNMFEFKHIKAFDRAFADNPGPMVVFATPGMLHAGQSLQIFRKWAG 348
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
E NM+IMPG+CVQGT+GHK+LSG +KLE E +Q+++VKM +EYMSFSAHADAKGIMQL+
Sbjct: 349 NEKNMVIMPGYCVQGTVGHKILSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLV 408
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
EP++VLLVHGEA KM FLK+KI+QE + C+MPANGE+ + T I + +S+ LLK
Sbjct: 409 GQAEPESVLLVHGEAKKMEFLKQKIEQELRVSCYMPANGETVTLPTSPSIPVGISLGLLK 468
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
E + P+ K R +HG L++KDS+ L+ ++A KE+G++ H +RFT V + D+
Sbjct: 469 REMAQ--GLLPDAKKPRLLHGTLIMKDSNFRLVSSEQALKELGLAEHQLRFTCRVHLHDT 526
Query: 361 SPINKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLL--SVERIDDKNKRVFVTWP 418
+T +Y L S L+D ++ L + S++++S+L+ + D K + V+W
Sbjct: 527 RKEQETALRVYSHLKSILKDHCVQH---LPDGSVTVESILIQAAAPSEDPGTKVLLVSWT 583
Query: 419 NQDEEVGKIVLHVLK 433
QDEE+G + +LK
Sbjct: 584 YQDEELGSFLTSLLK 598
>gi|403297738|ref|XP_003939709.1| PREDICTED: integrator complex subunit 11 isoform 1 [Saimiri
boliviensis boliviensis]
Length = 600
Score = 578 bits (1489), Expect = e-162, Method: Compositional matrix adjust.
Identities = 269/435 (61%), Positives = 341/435 (78%), Gaps = 7/435 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MFQ+KVG++S+VYTGDYNMTPDRHLGAAWIDKCRP+LLITESTYATTIRDSKRCRERDFL
Sbjct: 163 MFQIKVGSESVVYTGDYNMTPDRHLGAAWIDKCRPNLLITESTYATTIRDSKRCRERDFL 222
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHE V+ GGKVLIPVFALGRAQELCILLET+WERMNL+ PIYF+ GLTEKA +YYK+F
Sbjct: 223 KKVHETVEHGGKVLIPVFALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLF 282
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
I WTNQKIRKTFVQRNMF+FKHI+ FD++F DNPGPMVVFATPGMLH+G SL IF+KWA
Sbjct: 283 IPWTNQKIRKTFVQRNMFEFKHIKAFDRAFADNPGPMVVFATPGMLHAGQSLQIFRKWAG 342
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
E NM+IMPG+CVQGT+GHK+LSG +KLE E +Q+++VKM +EYMSFSAHADAKGIMQL+
Sbjct: 343 NEKNMVIMPGYCVQGTVGHKILSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLV 402
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
EP+NVLLVHGEA KM FLK+KI+QE + C+MPANGE+ + T + I + +S+ LLK
Sbjct: 403 GQAEPENVLLVHGEAKKMEFLKQKIEQELRVSCYMPANGETVTLPTSLSIPVGISLGLLK 462
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
E + P K R +HG L++KDS+ L+ ++A KE+G++ H +RFT V + D+
Sbjct: 463 REMAQ--GLLPEAKKPRLLHGTLIMKDSNFRLVSSEQALKELGLAEHQLRFTCRVHLHDT 520
Query: 361 SPINKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLL--SVERIDDKNKRVFVTWP 418
+T +Y L S L++ ++ L + S++++S+L+ + D K + V+W
Sbjct: 521 RKEQETALRVYSHLKSILKEHCVQH---LPDGSVTVESILIQAAAPSEDPGTKVLLVSWT 577
Query: 419 NQDEEVGKIVLHVLK 433
QDEE+G + +LK
Sbjct: 578 YQDEELGSFLTSLLK 592
>gi|402852595|ref|XP_003891003.1| PREDICTED: integrator complex subunit 11 isoform 2 [Papio anubis]
Length = 499
Score = 578 bits (1489), Expect = e-162, Method: Compositional matrix adjust.
Identities = 269/435 (61%), Positives = 340/435 (78%), Gaps = 7/435 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MFQ+KVG++S+VYTGDYNMTPDRHLGAAWIDKCRP+LLITESTYATTIRDSKRCRERDFL
Sbjct: 62 MFQIKVGSESVVYTGDYNMTPDRHLGAAWIDKCRPNLLITESTYATTIRDSKRCRERDFL 121
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHE V+RGGKVLIPVFALGRAQELCILLET+WERMNL+ PIYF+ GLTEKA +YYK+F
Sbjct: 122 KKVHETVERGGKVLIPVFALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLF 181
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
I WTNQKIRKTFVQRNMF+FKHI+ FD++F DNPGPMVVFATPGMLH+G SL IF+KWA
Sbjct: 182 IPWTNQKIRKTFVQRNMFEFKHIKAFDRAFADNPGPMVVFATPGMLHAGQSLQIFRKWAG 241
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
E NM+IMPG+CVQGT+GHK+LSG +KLE E +Q+++VKM +EYMSFSAHADAKGIMQL+
Sbjct: 242 NEKNMVIMPGYCVQGTVGHKILSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLV 301
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
EP++VLLVHGEA KM FLK+KI+QE + C+MPANGE+ + T I + +S+ LLK
Sbjct: 302 GQAEPESVLLVHGEAKKMEFLKQKIEQELRVSCYMPANGETVTLPTSPSIPVGISLGLLK 361
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
E + P K R +HG L++KDS+ L+ ++A KE+G++ H +RFT V + D+
Sbjct: 362 REMAQ--GLLPEAKKPRLLHGTLIMKDSNFRLVSSEQALKELGLAEHQLRFTCRVHLHDT 419
Query: 361 SPINKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLL--SVERIDDKNKRVFVTWP 418
+ +Y L S L+D ++ L + S++++S+L+ + D K + V+W
Sbjct: 420 RKEQEMALRVYSHLKSVLKDHCVQH---LPDGSVTVESILIQAAAPSEDPGTKVLLVSWT 476
Query: 419 NQDEEVGKIVLHVLK 433
QDEE+G + +LK
Sbjct: 477 YQDEELGSFLTSLLK 491
>gi|193786492|dbj|BAG51775.1| unnamed protein product [Homo sapiens]
Length = 606
Score = 577 bits (1488), Expect = e-162, Method: Compositional matrix adjust.
Identities = 271/435 (62%), Positives = 341/435 (78%), Gaps = 7/435 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MFQ+KVG++S+VYTGDYNMTPDRHLGAAWIDKCRP+LLITESTYATTIRDSKRCRERDFL
Sbjct: 169 MFQIKVGSESVVYTGDYNMTPDRHLGAAWIDKCRPNLLITESTYATTIRDSKRCRERDFL 228
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHE V+RGGKVLIPVFALGRAQELCILLET+WERMNL+ PIYF+ GLTEKA +YYK+F
Sbjct: 229 KKVHETVERGGKVLIPVFALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLF 288
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
I WTNQKIRKTFVQRNMF+FKHI+ FD++F DNPGPMVVFATPGMLH+G SL IF+KWA
Sbjct: 289 IPWTNQKIRKTFVQRNMFEFKHIKAFDRAFADNPGPMVVFATPGMLHAGQSLQIFRKWAG 348
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
E NM+IMPG+CVQGT+GHK+LSG +KLE E +Q+++VKM +EYMSFSAHADAKGIMQL+
Sbjct: 349 NEKNMVIMPGYCVQGTVGHKILSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLV 408
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
EP++VLLVHGEA KM FLK+KI+QE ++C+MP NGE+ + T I + +S+ LLK
Sbjct: 409 GQAEPESVLLVHGEAKKMEFLKQKIEQELRVNCYMPVNGETVTLPTSPSIPVGISLGLLK 468
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
E + P K R +HG L++KDS+ L+ ++A KE+G++ H +RFT V + D+
Sbjct: 469 REMAQ--GLLPEAKKPRLLHGTLIMKDSNFRLVSSEQALKELGLAEHQLRFTCRVHLHDT 526
Query: 361 SPINKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLL--SVERIDDKNKRVFVTWP 418
+T +Y L S L+D ++ L + S++++SVLL + D K + V+W
Sbjct: 527 RKEQETALRVYSHLKSVLKDHCVQH---LPDGSVTVESVLLQAAAPSEDPGTKVLLVSWT 583
Query: 419 NQDEEVGKIVLHVLK 433
QDEE+G + +LK
Sbjct: 584 YQDEELGSFLTSLLK 598
>gi|297279172|ref|XP_001092173.2| PREDICTED: integrator complex subunit 11 isoform 3 [Macaca mulatta]
Length = 579
Score = 577 bits (1487), Expect = e-161, Method: Compositional matrix adjust.
Identities = 269/435 (61%), Positives = 340/435 (78%), Gaps = 7/435 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MFQ+KVG++S+VYTGDYNMTPDRHLGAAWIDKCRP+LLITESTYATTIRDSKRCRERDFL
Sbjct: 142 MFQIKVGSESVVYTGDYNMTPDRHLGAAWIDKCRPNLLITESTYATTIRDSKRCRERDFL 201
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHE V+RGGKVLIPVFALGRAQELCILLET+WERMNL+ PIYF+ GLTEKA +YYK+F
Sbjct: 202 KKVHETVERGGKVLIPVFALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLF 261
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
I WTNQKIRKTFVQRNMF+FKHI+ FD++F DNPGPMVVFATPGMLH+G SL IF+KWA
Sbjct: 262 IPWTNQKIRKTFVQRNMFEFKHIKAFDRAFADNPGPMVVFATPGMLHAGQSLQIFRKWAG 321
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
E NM+IMPG+CVQGT+GHK+LSG +KLE E +Q+++VKM +EYMSFSAHADAKGIMQL+
Sbjct: 322 NEKNMVIMPGYCVQGTVGHKILSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLV 381
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
EP++VLLVHGEA KM FLK+KI+QE + C+MPANGE+ + T I + +S+ LLK
Sbjct: 382 GQAEPESVLLVHGEAKKMEFLKQKIEQELRVSCYMPANGETVTLPTSPSIPVGISLGLLK 441
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
E + P K R +HG L++KDS+ L+ ++A KE+G++ H +RFT V + D+
Sbjct: 442 REMAQ--GLLPEAKKPRLLHGTLIMKDSNFRLVSSEQALKELGLAEHQLRFTCRVHLHDT 499
Query: 361 SPINKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLL--SVERIDDKNKRVFVTWP 418
+ +Y L S L+D ++ L + S++++S+L+ + D K + V+W
Sbjct: 500 RKEQEMALRVYSHLKSVLKDHCVQH---LPDGSVTVESILIQAAAPSEDPGTKVLLVSWT 556
Query: 419 NQDEEVGKIVLHVLK 433
QDEE+G + +LK
Sbjct: 557 YQDEELGSFLTSLLK 571
>gi|351697497|gb|EHB00416.1| Integrator complex subunit 11 [Heterocephalus glaber]
Length = 672
Score = 577 bits (1487), Expect = e-161, Method: Compositional matrix adjust.
Identities = 269/435 (61%), Positives = 341/435 (78%), Gaps = 7/435 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MFQ+KVG++S+VYTGDYNMTPDRHLGAAWIDKCRP+LLITESTYATTIRDSKRCRERDFL
Sbjct: 236 MFQIKVGSESVVYTGDYNMTPDRHLGAAWIDKCRPNLLITESTYATTIRDSKRCRERDFL 295
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHE V+RGGKVLIPVFALGRAQELCILLET+WERMNL+ PIYF+ GLTEKA +YYK+F
Sbjct: 296 KKVHETVERGGKVLIPVFALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLF 355
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
ITWTNQKIRKTFVQRNMF+FKHI+ FD++F DNPGPMVVFATPGMLH+G SL IF+KWA
Sbjct: 356 ITWTNQKIRKTFVQRNMFEFKHIKAFDRAFADNPGPMVVFATPGMLHAGQSLQIFRKWAG 415
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
E NM+IMPG+CVQGT+GHK+LSG +KLE E +Q+++VKM +EYMSFSAHADAKGIMQL+
Sbjct: 416 NEKNMVIMPGYCVQGTVGHKILSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLV 475
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
EP++VLLVHGEA KM FL++KI+QE + C+MPANGE+ + T I + +S+ LLK
Sbjct: 476 GQAEPESVLLVHGEAKKMGFLRQKIEQELRVSCYMPANGETVTLPTSPSIPVGISLGLLK 535
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
E + P K R +HG L++KDS+ L+ ++A KE+G++ H +RFT V + D+
Sbjct: 536 REMAQ--GLLPEAKKPRLLHGTLIMKDSNFRLVSSEQALKELGLAEHQLRFTCRVHLQDT 593
Query: 361 SPINKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLL--SVERIDDKNKRVFVTWP 418
+ +Y L S L+D ++ L + S++++S+L+ + D K + V+W
Sbjct: 594 RKEQEMALRVYSHLKSTLKDHCVQH---LPDGSVTVESILIQAAAHSEDPGAKVLLVSWT 650
Query: 419 NQDEEVGKIVLHVLK 433
QDEE+G + +LK
Sbjct: 651 YQDEELGSFLTALLK 665
>gi|197099184|ref|NP_001124760.1| integrator complex subunit 11 [Pongo abelii]
gi|55725797|emb|CAH89679.1| hypothetical protein [Pongo abelii]
Length = 655
Score = 577 bits (1486), Expect = e-161, Method: Compositional matrix adjust.
Identities = 269/435 (61%), Positives = 340/435 (78%), Gaps = 7/435 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MFQ+KVG++S+VYTGDYNMTPDRHLGAAWIDKCRP+LLITESTYATTIRDSKRCRERDFL
Sbjct: 163 MFQIKVGSESVVYTGDYNMTPDRHLGAAWIDKCRPNLLITESTYATTIRDSKRCRERDFL 222
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHE V+RGGKVLIPVFALGRAQELCILLET+WERMNL+ PIYF+ GLTEKA +YYK+F
Sbjct: 223 KKVHETVERGGKVLIPVFALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLF 282
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
I WTNQKIRKTFVQRNMF+FKHI+ FD++F DNPGPMVVFATPGMLH+G SL IF+KWA
Sbjct: 283 IPWTNQKIRKTFVQRNMFEFKHIKAFDRAFTDNPGPMVVFATPGMLHAGQSLQIFRKWAG 342
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
E NM+IMPG+CVQGT+GHK+LSG +KLE E +Q+++VKM +EYMSFSAHADAKGIMQL+
Sbjct: 343 NEKNMVIMPGYCVQGTVGHKILSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLV 402
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
EP++VLLVHGEA KM F K+KI+QE + C+MPANGE+ + T I + +S+ LLK
Sbjct: 403 GQAEPESVLLVHGEAKKMEFQKQKIEQELRVSCYMPANGETVTLPTSPSIPVGISLGLLK 462
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
E + P K R +HG L++KDS+ L+ ++A KE+G++ H +RFT V + D+
Sbjct: 463 REMAQ--GLLPEAKKPRLLHGTLIMKDSNFRLVSSEQALKELGLAEHQLRFTCRVHLHDT 520
Query: 361 SPINKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLL--SVERIDDKNKRVFVTWP 418
+T +Y L S L+D ++ L + S++++S+L+ + D K + V+W
Sbjct: 521 RKEQETALRVYSHLKSILKDHCVQH---LPDGSVTVESILIQAAAPSEDPGTKVLLVSWT 577
Query: 419 NQDEEVGKIVLHVLK 433
QDEE+G + +LK
Sbjct: 578 YQDEELGSFLTSLLK 592
>gi|391330858|ref|XP_003739869.1| PREDICTED: integrator complex subunit 11-like [Metaseiulus
occidentalis]
Length = 601
Score = 576 bits (1484), Expect = e-161, Method: Compositional matrix adjust.
Identities = 267/427 (62%), Positives = 331/427 (77%), Gaps = 4/427 (0%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +KVG+ SIVYTGDYNMTPDRHLGAAWID+CRPDLLITESTYATTIRDSKRCRERDFL
Sbjct: 163 MFHIKVGHISIVYTGDYNMTPDRHLGAAWIDRCRPDLLITESTYATTIRDSKRCRERDFL 222
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KVH+C++RGGKVLIP FALGRAQELCILLETYWERMNL+ PIYFA GLTEKATNYYKMF
Sbjct: 223 NKVHDCIERGGKVLIPAFALGRAQELCILLETYWERMNLKCPIYFAAGLTEKATNYYKMF 282
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
ITWTNQKIR TFV NMFDFKHI+PFD+++IDNPGPMVVFATPGMLH+GLSL IFKKWAP
Sbjct: 283 ITWTNQKIRNTFVDHNMFDFKHIKPFDRAYIDNPGPMVVFATPGMLHAGLSLQIFKKWAP 342
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
E NM+IMPG+CV GT+G K+L+G +++E E IDVK+++EY+SFSAHADAKGIMQLI
Sbjct: 343 FEENMVIMPGYCVSGTVGAKILNGARRVEIEKGNFIDVKLSVEYLSFSAHADAKGIMQLI 402
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
+ CEPKNVLLVHGE KM FLK KI++EF + C+MPANGE+ + I +DV + LLK
Sbjct: 403 RQCEPKNVLLVHGEGDKMKFLKGKIEEEFKIPCYMPANGETTLISVPEHIVLDVDLPLLK 462
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
+ +E + K +QIH +++ D+ + L++ ++A E+G+ H +RFT+ + I ++
Sbjct: 463 RALPQ--AEIVSEKKSKQIHAAILMYDNVMKLVEPEDALNELGVREHQIRFTTTISIPET 520
Query: 361 --SPINKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLLSVERIDDKNKRVFVTWP 418
K EL+ D + L QE L + SLS+ S ++ V +D K V V+W
Sbjct: 521 FQGSTAKLAELVQDLVKDKLPSDDQETMQMLQDGSLSLGSAIVRVNGFEDDMKSVCVSWA 580
Query: 419 NQDEEVG 425
N+DE++G
Sbjct: 581 NEDEDLG 587
>gi|380798915|gb|AFE71333.1| integrator complex subunit 11 isoform 2, partial [Macaca mulatta]
Length = 588
Score = 576 bits (1484), Expect = e-161, Method: Compositional matrix adjust.
Identities = 269/435 (61%), Positives = 340/435 (78%), Gaps = 7/435 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MFQ+KVG++S+VYTGDYNMTPDRHLGAAWIDKCRP+LLITESTYATTIRDSKRCRERDFL
Sbjct: 151 MFQIKVGSESVVYTGDYNMTPDRHLGAAWIDKCRPNLLITESTYATTIRDSKRCRERDFL 210
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHE V+RGGKVLIPVFALGRAQELCILLET+WERMNL+ PIYF+ GLTEKA +YYK+F
Sbjct: 211 KKVHETVERGGKVLIPVFALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLF 270
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
I WTNQKIRKTFVQRNMF+FKHI+ FD++F DNPGPMVVFATPGMLH+G SL IF+KWA
Sbjct: 271 IPWTNQKIRKTFVQRNMFEFKHIKAFDRAFADNPGPMVVFATPGMLHAGQSLQIFRKWAG 330
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
E NM+IMPG+CVQGT+GHK+LSG +KLE E +Q+++VKM +EYMSFSAHADAKGIMQL+
Sbjct: 331 NEKNMVIMPGYCVQGTVGHKILSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLV 390
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
EP++VLLVHGEA KM FLK+KI+QE + C+MPANGE+ + T I + +S+ LLK
Sbjct: 391 GQAEPESVLLVHGEAKKMEFLKQKIEQELRVSCYMPANGETVTLPTSPSIPVGISLGLLK 450
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
E + P K R +HG L++KDS+ L+ ++A KE+G++ H +RFT V + D+
Sbjct: 451 REMAQ--GLLPEAKKPRLLHGTLIMKDSNFRLVSSEQALKELGLAEHQLRFTCRVHLHDT 508
Query: 361 SPINKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLL--SVERIDDKNKRVFVTWP 418
+ +Y L S L+D ++ L + S++++S+L+ + D K + V+W
Sbjct: 509 RKEQEMALRVYSHLKSVLKDHCVQH---LPDGSVTVESILIQAAAPSEDPGTKVLLVSWT 565
Query: 419 NQDEEVGKIVLHVLK 433
QDEE+G + +LK
Sbjct: 566 YQDEELGSFLTSLLK 580
>gi|402852593|ref|XP_003891002.1| PREDICTED: integrator complex subunit 11 isoform 1 [Papio anubis]
gi|355557446|gb|EHH14226.1| hypothetical protein EGK_00111 [Macaca mulatta]
gi|387540112|gb|AFJ70683.1| integrator complex subunit 11 [Macaca mulatta]
Length = 600
Score = 576 bits (1484), Expect = e-161, Method: Compositional matrix adjust.
Identities = 269/435 (61%), Positives = 340/435 (78%), Gaps = 7/435 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MFQ+KVG++S+VYTGDYNMTPDRHLGAAWIDKCRP+LLITESTYATTIRDSKRCRERDFL
Sbjct: 163 MFQIKVGSESVVYTGDYNMTPDRHLGAAWIDKCRPNLLITESTYATTIRDSKRCRERDFL 222
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHE V+RGGKVLIPVFALGRAQELCILLET+WERMNL+ PIYF+ GLTEKA +YYK+F
Sbjct: 223 KKVHETVERGGKVLIPVFALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLF 282
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
I WTNQKIRKTFVQRNMF+FKHI+ FD++F DNPGPMVVFATPGMLH+G SL IF+KWA
Sbjct: 283 IPWTNQKIRKTFVQRNMFEFKHIKAFDRAFADNPGPMVVFATPGMLHAGQSLQIFRKWAG 342
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
E NM+IMPG+CVQGT+GHK+LSG +KLE E +Q+++VKM +EYMSFSAHADAKGIMQL+
Sbjct: 343 NEKNMVIMPGYCVQGTVGHKILSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLV 402
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
EP++VLLVHGEA KM FLK+KI+QE + C+MPANGE+ + T I + +S+ LLK
Sbjct: 403 GQAEPESVLLVHGEAKKMEFLKQKIEQELRVSCYMPANGETVTLPTSPSIPVGISLGLLK 462
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
E + P K R +HG L++KDS+ L+ ++A KE+G++ H +RFT V + D+
Sbjct: 463 REMAQ--GLLPEAKKPRLLHGTLIMKDSNFRLVSSEQALKELGLAEHQLRFTCRVHLHDT 520
Query: 361 SPINKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLL--SVERIDDKNKRVFVTWP 418
+ +Y L S L+D ++ L + S++++S+L+ + D K + V+W
Sbjct: 521 RKEQEMALRVYSHLKSVLKDHCVQH---LPDGSVTVESILIQAAAPSEDPGTKVLLVSWT 577
Query: 419 NQDEEVGKIVLHVLK 433
QDEE+G + +LK
Sbjct: 578 YQDEELGSFLTSLLK 592
>gi|207079923|ref|NP_001128922.1| DKFZP459J1110 protein [Pongo abelii]
gi|56403907|emb|CAI29738.1| hypothetical protein [Pongo abelii]
Length = 600
Score = 575 bits (1483), Expect = e-161, Method: Compositional matrix adjust.
Identities = 269/435 (61%), Positives = 340/435 (78%), Gaps = 7/435 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MFQ+KVG++S+VYTGDYNMTPDRHLGAAWIDKCRP+LLIT STYATTIRDSKRCRERDFL
Sbjct: 163 MFQIKVGSESVVYTGDYNMTPDRHLGAAWIDKCRPNLLITGSTYATTIRDSKRCRERDFL 222
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHE V+RGGKVLIPVFALGRAQELCILLET+WERMNL+ PIYF+ GLTEKA +YYK+F
Sbjct: 223 KKVHETVERGGKVLIPVFALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLF 282
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
I WTNQKIRKTFVQRNMF+FKHI+ FD++F DNPGPMVVFATPGMLH+G SL IF+KWA
Sbjct: 283 IPWTNQKIRKTFVQRNMFEFKHIKAFDRAFADNPGPMVVFATPGMLHAGQSLQIFRKWAG 342
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
E NM+IMPG+CVQGT+GHK+LSG +KLE E +Q+++VKM +EYMSFSAHADAKGIMQL+
Sbjct: 343 NEKNMVIMPGYCVQGTVGHKILSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLV 402
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
EP++VLLVHGEA KM FLK+KI+QE + C+MPANGE+ + T I + +S+ LLK
Sbjct: 403 GQAEPESVLLVHGEAKKMEFLKQKIEQELRVSCYMPANGETVTLPTSPSIPVGISLGLLK 462
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
E + P K R +HG L++KDS+ L+ ++A KE+G++ H +RFT V + D+
Sbjct: 463 REMAQ--GLLPEAKKPRLLHGTLIMKDSNFRLVSSEQALKELGLAEHQLRFTCRVHLHDT 520
Query: 361 SPINKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLL--SVERIDDKNKRVFVTWP 418
+T +Y L S L+D ++ L + S++++S+L+ + D K + V+W
Sbjct: 521 RKEQETALRVYSHLKSILKDHCVQH---LPDGSVTVESILIQAAAPSEDPGTKVLLVSWT 577
Query: 419 NQDEEVGKIVLHVLK 433
QDEE+G + +LK
Sbjct: 578 YQDEELGSFLTSLLK 592
>gi|119576641|gb|EAW56237.1| cleavage and polyadenylation specific factor 3-like, isoform CRA_e
[Homo sapiens]
Length = 578
Score = 575 bits (1483), Expect = e-161, Method: Compositional matrix adjust.
Identities = 270/433 (62%), Positives = 340/433 (78%), Gaps = 7/433 (1%)
Query: 3 QVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFLKK 62
Q+KVG++S+VYTGDYNMTPDRHLGAAWIDKCRP+LLITESTYATTIRDSKRCRERDFLKK
Sbjct: 143 QIKVGSESVVYTGDYNMTPDRHLGAAWIDKCRPNLLITESTYATTIRDSKRCRERDFLKK 202
Query: 63 VHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMFIT 122
VHE V+RGGKVLIPVFALGRAQELCILLET+WERMNL+ PIYF+ GLTEKA +YYK+FI
Sbjct: 203 VHETVERGGKVLIPVFALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLFIP 262
Query: 123 WTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAPVE 182
WTNQKIRKTFVQRNMF+FKHI+ FD++F DNPGPMVVFATPGMLH+G SL IF+KWA E
Sbjct: 263 WTNQKIRKTFVQRNMFEFKHIKAFDRAFADNPGPMVVFATPGMLHAGQSLQIFRKWAGNE 322
Query: 183 NNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLIQY 242
NM+IMPG+CVQGT+GHK+LSG +KLE E +Q+++VKM +EYMSFSAHADAKGIMQL+
Sbjct: 323 KNMVIMPGYCVQGTVGHKILSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLVGQ 382
Query: 243 CEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLKEE 302
EP++VLLVHGEA KM FLK+KI+QE ++C+MPANGE+ + T I + +S+ LLK E
Sbjct: 383 AEPESVLLVHGEAKKMEFLKQKIEQELRVNCYMPANGETVTLPTSPSIPVGISLGLLKRE 442
Query: 303 AVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDSSP 362
+ P K R +HG L++KDS+ L+ ++A KE+G++ H +RFT V + D+
Sbjct: 443 MAQ--GLLPEAKKPRLLHGTLIMKDSNFRLVSSEQALKELGLAEHQLRFTCRVHLHDTRK 500
Query: 363 INKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLL--SVERIDDKNKRVFVTWPNQ 420
+T +Y L S L+D ++ L + S++++SVLL + D K + V+W Q
Sbjct: 501 EQETALRVYSHLKSVLKDHCVQH---LPDGSVTVESVLLQAAAPSEDPGTKVLLVSWTYQ 557
Query: 421 DEEVGKIVLHVLK 433
DEE+G + +LK
Sbjct: 558 DEELGSFLTSLLK 570
>gi|56403864|emb|CAI29717.1| hypothetical protein [Pongo abelii]
Length = 600
Score = 575 bits (1481), Expect = e-161, Method: Compositional matrix adjust.
Identities = 269/435 (61%), Positives = 340/435 (78%), Gaps = 7/435 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MFQ+KVG++S+VYTGDYNMTPDRHLGAAWIDKCRP+LLITESTYATTIRDSKRCRERDFL
Sbjct: 163 MFQIKVGSESVVYTGDYNMTPDRHLGAAWIDKCRPNLLITESTYATTIRDSKRCRERDFL 222
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHE V+RGGKVLIPVFALGRAQELCILLET+WERMNL+ PIYF+ GLTEKA +YYK+F
Sbjct: 223 KKVHETVERGGKVLIPVFALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLF 282
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
I WTNQKIRKT VQRNMF+FKHI+ FD++F DNPGPMVVFATPGMLH+G SL IF+KWA
Sbjct: 283 IPWTNQKIRKTSVQRNMFEFKHIKAFDRAFADNPGPMVVFATPGMLHAGQSLQIFRKWAG 342
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
E NM+IMPG+CVQGT+GHK+LSG +KLE E +Q+++VKM +EYMSFSAHADAKGIMQL+
Sbjct: 343 NEKNMVIMPGYCVQGTVGHKILSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLV 402
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
EP++VLLVHGEA KM FLK+KI+QE + C+MPANGE+ + T I + +S+ LLK
Sbjct: 403 GQAEPESVLLVHGEAKKMEFLKQKIEQELRVSCYMPANGETVTLPTSPSIPVGISLGLLK 462
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
E + P K R +HG L++KDS+ L+ ++A KE+G++ H +RFT V + D+
Sbjct: 463 REMAQ--GLLPEAKKPRLLHGTLIMKDSNFRLVSSEQALKELGLAEHQLRFTCRVHLHDT 520
Query: 361 SPINKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLL--SVERIDDKNKRVFVTWP 418
+T +Y L S L+D ++ L + S++++S+L+ + D K + V+W
Sbjct: 521 RKEQETALRVYSHLKSILKDHCVQH---LPDGSVTVESILIQAAAPSEDPGTKVLLVSWT 577
Query: 419 NQDEEVGKIVLHVLK 433
QDEE+G + +LK
Sbjct: 578 YQDEELGSFLTSLLK 592
>gi|39645207|gb|AAH13904.2| CPSF3L protein, partial [Homo sapiens]
Length = 429
Score = 569 bits (1466), Expect = e-159, Method: Compositional matrix adjust.
Identities = 266/426 (62%), Positives = 333/426 (78%), Gaps = 7/426 (1%)
Query: 10 SIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFLKKVHECVDR 69
S+VYTGDYNMTPDRHLGAAWIDKCRP+LLITESTYATTIRDSKRCRERDFLKKVHE V+R
Sbjct: 1 SVVYTGDYNMTPDRHLGAAWIDKCRPNLLITESTYATTIRDSKRCRERDFLKKVHETVER 60
Query: 70 GGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMFITWTNQKIR 129
GGKVLIPVFALGRAQELCILLET+WERMNL+ PIYF+ GLTEKA +YYK+FI WTNQKIR
Sbjct: 61 GGKVLIPVFALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLFIPWTNQKIR 120
Query: 130 KTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAPVENNMLIMP 189
KTFVQRNMF+FKHI+ FD++F DNPGPMVVFATPGMLH+G SL IF+KWA E NM+IMP
Sbjct: 121 KTFVQRNMFEFKHIKAFDRAFADNPGPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIMP 180
Query: 190 GFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLIQYCEPKNVL 249
G+CVQGT+GHK+LSG +KLE E +Q+++VKM +EYMSFSAHADAKGIMQL+ EP++VL
Sbjct: 181 GYCVQGTVGHKILSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLVGQAEPESVL 240
Query: 250 LVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLKEEAVKYNSE 309
LVHGEA KM FLK+KI+QE ++C+MPANGE+ + T I + +S+ LLK E +
Sbjct: 241 LVHGEAKKMEFLKQKIEQELRVNCYMPANGETVTLPTSPSIPVGISLGLLKREMAQ--GL 298
Query: 310 PPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDSSPINKTLEL 369
P K R +HG L++KDS+ L+ ++A KE+G++ H +RFT V + D+ +T
Sbjct: 299 LPEAKKPRLLHGTLIMKDSNFRLVSSEQALKELGLAEHQLRFTCRVHLHDTRKEQETALR 358
Query: 370 IYDQLLSYLQDKSQEYKIQLTESSLSIDSVLL--SVERIDDKNKRVFVTWPNQDEEVGKI 427
+Y L S L+D ++ L + S++++SVLL + D K + V+W QDEE+G
Sbjct: 359 VYSHLKSVLKDHCVQH---LPDGSVTVESVLLQAAAPSEDPGTKVLLVSWTYQDEELGSF 415
Query: 428 VLHVLK 433
+ +LK
Sbjct: 416 LTSLLK 421
>gi|417403209|gb|JAA48422.1| Putative mrna cleavage and polyadenylation factor ii complex brr5
cpsf subunit [Desmodus rotundus]
Length = 604
Score = 563 bits (1451), Expect = e-157, Method: Compositional matrix adjust.
Identities = 266/436 (61%), Positives = 339/436 (77%), Gaps = 8/436 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MFQ+KVG++S+VYTGDYNMTPDRHLGAAWIDKCRP LLITESTYATTIRDSKRCRERDFL
Sbjct: 163 MFQIKVGSESVVYTGDYNMTPDRHLGAAWIDKCRPTLLITESTYATTIRDSKRCRERDFL 222
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHE V+RGGKVLIPVFALGRAQELCILLET+WERMNL+APIYF+ GLTEKA +YYK+F
Sbjct: 223 KKVHETVERGGKVLIPVFALGRAQELCILLETFWERMNLKAPIYFSTGLTEKANHYYKLF 282
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFID-NPGPMVVFATPGMLHSGLSLIIFKKWA 179
ITWTNQKIRKTFVQRNMF+FKHI+ FD++ +P MVVFATPGMLH+G SL IF+KWA
Sbjct: 283 ITWTNQKIRKTFVQRNMFEFKHIKAFDRAXXXAHPCAMVVFATPGMLHAGQSLQIFRKWA 342
Query: 180 PVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQL 239
E NM+IMPG+CVQGT+GHK+LSG +KLE E +Q+++V+M +EYMSFSAHADAKGIMQL
Sbjct: 343 GNEKNMVIMPGYCVQGTVGHKILSGQRKLEMEGRQVLEVRMQVEYMSFSAHADAKGIMQL 402
Query: 240 IQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLL 299
+ EP++VLLVHGEA KM FLK+KI+QEF + C+MPANGE+ + T I + +S+ LL
Sbjct: 403 VGQAEPESVLLVHGEAKKMEFLKQKIEQEFRVSCYMPANGETVTLPTSPSIPVGISLGLL 462
Query: 300 KEEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISD 359
K E + P+ K R +HG L++KDS+ L+ ++A KE+G++ H +RFT V + D
Sbjct: 463 KREMAQ--GLLPDAKKPRLLHGTLIMKDSNFRLVSSEQALKELGLAEHQLRFTCRVHLHD 520
Query: 360 SSPINKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLL--SVERIDDKNKRVFVTW 417
+ +T +Y L L+D ++ L + S++++S+L+ + D K + V+W
Sbjct: 521 TRKEQETAMRVYSHLKGLLKDHCVQH---LPDGSVTVESILIQAAAHSEDPGTKVLLVSW 577
Query: 418 PNQDEEVGKIVLHVLK 433
QDEE+G + +LK
Sbjct: 578 TYQDEELGSYLTSLLK 593
>gi|156403103|ref|XP_001639929.1| predicted protein [Nematostella vectensis]
gi|156227060|gb|EDO47866.1| predicted protein [Nematostella vectensis]
Length = 527
Score = 552 bits (1423), Expect = e-154, Method: Compositional matrix adjust.
Identities = 251/374 (67%), Positives = 316/374 (84%), Gaps = 3/374 (0%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF ++VG +S+VYTGDYNMTPDRHLG+AWIDKCRPD+LITESTYATTIRDSKRCRERDFL
Sbjct: 134 MFHMRVGTESVVYTGDYNMTPDRHLGSAWIDKCRPDILITESTYATTIRDSKRCRERDFL 193
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHE +++GGKVLIPVFALGRAQELCILLETYWERMNL+APIYF+ GLTEKA +YYK+F
Sbjct: 194 KKVHETMEKGGKVLIPVFALGRAQELCILLETYWERMNLKAPIYFSTGLTEKANHYYKLF 253
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
ITWTNQKI+ TFVQRNMF+F+HI+PFD+S+IDNPGPMVVFATPGMLH+GLSL IFKKWA
Sbjct: 254 ITWTNQKIKNTFVQRNMFEFEHIKPFDRSYIDNPGPMVVFATPGMLHAGLSLQIFKKWAS 313
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
ENNM+++PG+CV GT+GHKVL+G KK+E + K +IDVK++++YMSFSAHADA GIMQLI
Sbjct: 314 NENNMVVIPGYCVAGTVGHKVLAGQKKIELDKKNVIDVKLSVQYMSFSAHADAAGIMQLI 373
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
+ EP+NV+LVHGE KM FLK+KI+QEF + C+ PANGE+ V T I++D+S++LLK
Sbjct: 374 RTAEPRNVILVHGEGGKMEFLKQKIEQEFRIPCYKPANGETVTVPTSPVIAVDMSLSLLK 433
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
+ ++ P N + GVLV+KD S+ L++ +EA KE+G+++H +RFTS V + S
Sbjct: 434 RK-IEARLTPDN--NHLPLQGVLVMKDESLRLLESEEAMKELGLTQHQLRFTSTVTMETS 490
Query: 361 SPINKTLELIYDQL 374
P ++ LE ++ L
Sbjct: 491 EPNHQILEKLHSHL 504
>gi|198413502|ref|XP_002128796.1| PREDICTED: similar to cleavage and polyadenylation specific factor
3-like [Ciona intestinalis]
Length = 605
Score = 552 bits (1422), Expect = e-154, Method: Compositional matrix adjust.
Identities = 260/446 (58%), Positives = 340/446 (76%), Gaps = 9/446 (2%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +KVG S++YTGDYNMTPDRHLGAAW+DKCRPD+LITESTYATTIRDSKRCRERDFL
Sbjct: 164 MFLLKVGTDSVLYTGDYNMTPDRHLGAAWVDKCRPDVLITESTYATTIRDSKRCRERDFL 223
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHE V+ GGKVLIPVFALGRAQELCILLE+YW+RMNL+ PIYF+ GLT KAT YYK+F
Sbjct: 224 KKVHERVEDGGKVLIPVFALGRAQELCILLESYWDRMNLKVPIYFSAGLTNKATEYYKLF 283
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
ITWTNQKI+ TFV+RNMFDFKHI+ F++S+IDNPGPMVVFATPGMLH GLSL IFK+W
Sbjct: 284 ITWTNQKIKDTFVERNMFDFKHIKEFNRSYIDNPGPMVVFATPGMLHGGLSLEIFKRWCT 343
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
E NM+IMPG+CV GT+G+K+LSG++K+E + Q++++KM +EYMSFSAHADAKGIMQLI
Sbjct: 344 NEKNMIIMPGYCVAGTVGNKILSGMRKIELKPGQVVEIKMQVEYMSFSAHADAKGIMQLI 403
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
+ CEP NV+LVHGEA KM FLK KI QE +L CF PANGE+ + T+ I++D S NLL+
Sbjct: 404 RMCEPSNVMLVHGEAGKMDFLKSKINQELHLPCFKPANGETVVITTNPPITVDASTNLLR 463
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQI--S 358
+ + NP + + +HG +++K + I ++D +EA +E+ + +H VRFTS + + S
Sbjct: 464 KNREILGTS-YNPKRVKTLHGAVLMKGNEIKVVDGEEAVEELQLKKHEVRFTSTIHMPTS 522
Query: 359 DSSPINKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLLSVE--RIDDKNK-RVFV 415
P + ++E ++D L L+ + ++ +L IDS+LL + D+ + V V
Sbjct: 523 QPQPTSSSMEQLHDHLKKTLRTHNVS---ATSDGNLLIDSILLKAQPSSTDESSHLDVLV 579
Query: 416 TWPNQDEEVGKIVLHVLKSMSNTQGQ 441
+W QDEE+G + +L++ + Q
Sbjct: 580 SWSIQDEELGSYITDLLRTGLPARAQ 605
>gi|332030612|gb|EGI70300.1| Integrator complex subunit 11 [Acromyrmex echinatior]
Length = 536
Score = 550 bits (1416), Expect = e-153, Method: Compositional matrix adjust.
Identities = 246/375 (65%), Positives = 310/375 (82%), Gaps = 4/375 (1%)
Query: 63 VHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMFIT 122
VHEC+DRGGKVLIPVFALGRAQELCILLETYWERMNL+ P+YFA+GLTEKA NYYKMFIT
Sbjct: 166 VHECIDRGGKVLIPVFALGRAQELCILLETYWERMNLKVPVYFALGLTEKANNYYKMFIT 225
Query: 123 WTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAPVE 182
WTNQKI+KTFVQRNMFDFKHI+PFDK++IDNPG MVVFATPGMLH+GLSL IFKKWAP E
Sbjct: 226 WTNQKIKKTFVQRNMFDFKHIKPFDKAYIDNPGAMVVFATPGMLHAGLSLQIFKKWAPNE 285
Query: 183 NNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLIQY 242
+NM+IMPGFCVQGT+GHKVL+G +++EFEN+QI++VKMA+EYMSFSAHADAKGIMQLIQY
Sbjct: 286 SNMVIMPGFCVQGTVGHKVLNGTRRIEFENRQIVEVKMAVEYMSFSAHADAKGIMQLIQY 345
Query: 243 CEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLKEE 302
CEPKNV+LVHGE +KM +LKEKIKQEF + C+ PANGE+C + T K+ +D S+ LLK E
Sbjct: 346 CEPKNVMLVHGEFAKMEYLKEKIKQEFGVSCYNPANGETCIITTTSKVPVDASLALLKAE 405
Query: 303 AVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDSSP 362
A +Y++ PP+P + R +H VL++++ + L+D DE K GI++H+VRFTS VQ+ D P
Sbjct: 406 AKRYSALPPDPKRRRLLHSVLMLREDGVCLVDADEVAKAAGITKHVVRFTSTVQVRDPGP 465
Query: 363 INKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLLSVERIDDKNKRVFVTWPNQDE 422
T +LL L+++ + +QLTE S+S++SVL+ VE +D K V+V+W NQDE
Sbjct: 466 AQATTL----KLLQPLKERLSGWTVQLTEGSISVESVLVKVEGDEDDQKSVYVSWTNQDE 521
Query: 423 EVGKIVLHVLKSMSN 437
++G +L L++M N
Sbjct: 522 DLGSYILGFLQTMLN 536
>gi|196007172|ref|XP_002113452.1| hypothetical protein TRIADDRAFT_57642 [Trichoplax adhaerens]
gi|190583856|gb|EDV23926.1| hypothetical protein TRIADDRAFT_57642 [Trichoplax adhaerens]
Length = 596
Score = 548 bits (1411), Expect = e-153, Method: Compositional matrix adjust.
Identities = 258/425 (60%), Positives = 325/425 (76%), Gaps = 11/425 (2%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF VKVG +S++YTGDYNMTPDRHLGAAWIDKCRP+LLITESTYATTIRDSKRCRERDFL
Sbjct: 163 MFLVKVGCESVLYTGDYNMTPDRHLGAAWIDKCRPNLLITESTYATTIRDSKRCRERDFL 222
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KVHECV+RGGKVLIPVFALGRAQELCILLETYW+RMNL+ PIYF+ GLTEKA +YYK+F
Sbjct: 223 TKVHECVERGGKVLIPVFALGRAQELCILLETYWDRMNLKVPIYFSTGLTEKANHYYKLF 282
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
ITWTNQKIR+TFVQ NMF+FKHI+PFD++ IDNP PMVVFATPGMLH GLSL IFKKWAP
Sbjct: 283 ITWTNQKIRRTFVQHNMFEFKHIKPFDRALIDNPNPMVVFATPGMLHGGLSLQIFKKWAP 342
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
+ NM+I+PG+CV GT+G+K+LSG + +E ENKQIIDVK+A+EYMSFSAHADAKGIMQLI
Sbjct: 343 DDKNMVILPGYCVAGTVGNKILSGQRTVELENKQIIDVKLAVEYMSFSAHADAKGIMQLI 402
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
++ EP+NV+LVHGEASKM FL +KI+QE C+MP NGE+ ++ KI++D+S L K
Sbjct: 403 KHAEPENVMLVHGEASKMNFLMQKIEQEIGTPCYMPKNGETVKIRASPKIAVDMSFELYK 462
Query: 301 EEA--------VKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFT 352
+ + N +P + R I GV+++ D+S L+D +A E+G+ H + F
Sbjct: 463 QNQPQISLKSHIHENKKPRLDVDPRIIEGVIIMDDNSTRLVDSKQALHELGMKSHNLNFC 522
Query: 353 SNVQISDSSPINKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLLSVERIDDKNKR 412
S + + D N+ L +Y L D +E Q ++ S+ + SV++ + DD
Sbjct: 523 STISVPDKVVRNELLRELY---LHVKSDFPKEEVSQSSDGSIKLHSVVIEYQVNDDSEAV 579
Query: 413 VFVTW 417
+ ++W
Sbjct: 580 IAISW 584
>gi|355744837|gb|EHH49462.1| hypothetical protein EGM_00117, partial [Macaca fascicularis]
Length = 592
Score = 544 bits (1401), Expect = e-151, Method: Compositional matrix adjust.
Identities = 260/435 (59%), Positives = 330/435 (75%), Gaps = 7/435 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MFQ+KVG++S+VYTGDYNMTP+RHLGAAWIDKCRP+LLITESTYATTIRDSKRCRERDFL
Sbjct: 155 MFQIKVGSESVVYTGDYNMTPERHLGAAWIDKCRPNLLITESTYATTIRDSKRCRERDFL 214
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHE V+RGGKVLIPVFALGRAQELCILLET+WERMNL+ PIYF+ GLTEKA +YYK+F
Sbjct: 215 KKVHETVERGGKVLIPVFALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLF 274
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
I WTNQKIRKTFVQRNMF+FKHI+ FD++F DNPGPMVVFATPGMLH+G SL IF+KWA
Sbjct: 275 IPWTNQKIRKTFVQRNMFEFKHIKAFDRAFADNPGPMVVFATPGMLHAGQSLQIFRKWAG 334
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
E NM+IMPG+CVQGT+GHK+LSG +KLE E +Q+++VKM +EYMSFSAHADAKGIMQL+
Sbjct: 335 NEKNMVIMPGYCVQGTVGHKILSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLV 394
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
EP++VLLVHGEA KM FLK+KI+QE E+ + T I + +S+ LLK
Sbjct: 395 GQAEPESVLLVHGEAKKMEFLKQKIEQELRRGSRGQLLHEAVTLPTSPSIPVGISLGLLK 454
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
E + P K R +HG L++KDS+ L+ ++A KE+G++ H +RFT V + D+
Sbjct: 455 REMAQ--GLLPEAKKPRLLHGTLIMKDSNFRLVSSEQALKELGLAEHQLRFTCRVHLHDT 512
Query: 361 SPINKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLL--SVERIDDKNKRVFVTWP 418
+ + S L+D ++ L + S++++S+L+ + D K + V+W
Sbjct: 513 RKEXXXXXGLTEPPHSVLKDHCVQH---LPDGSVTVESILIQAAAPSEDPGTKVLLVSWT 569
Query: 419 NQDEEVGKIVLHVLK 433
QDEE+G + +LK
Sbjct: 570 YQDEELGSFLTSLLK 584
>gi|441671688|ref|XP_004093259.1| PREDICTED: LOW QUALITY PROTEIN: integrator complex subunit 11
[Nomascus leucogenys]
Length = 585
Score = 544 bits (1401), Expect = e-151, Method: Compositional matrix adjust.
Identities = 248/360 (68%), Positives = 301/360 (83%), Gaps = 2/360 (0%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MFQ+KVG++S+VYTGDYNMTPDRHLGAAWIDKCRP+LLITESTYATTIRDSKRCRERDFL
Sbjct: 163 MFQIKVGSESVVYTGDYNMTPDRHLGAAWIDKCRPNLLITESTYATTIRDSKRCRERDFL 222
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHE V+RGGKVLIPV ALGRAQELCILLET+WERMNL+ PIYF+ GLTEKA +YYK+F
Sbjct: 223 KKVHETVERGGKVLIPVXALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLF 282
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
I WTNQKIRKTFVQRNMF+FKHI+ FD++F DNPGPMVVFATPGMLH+G SL IF+KWA
Sbjct: 283 IPWTNQKIRKTFVQRNMFEFKHIKAFDRAFADNPGPMVVFATPGMLHAGQSLQIFRKWAG 342
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
E NM+IMPG+CVQGT+GHK+LSG +KLE E +Q+++VKM +EYMSFSAHADAKGIMQL+
Sbjct: 343 NEKNMVIMPGYCVQGTVGHKILSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLV 402
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
EP++VLLVHGEA KM FLK+KI+QE + C+MPANGE+ + T I + +S+ LLK
Sbjct: 403 GQAEPESVLLVHGEAKKMEFLKQKIEQELRVSCYMPANGETVTLPTSPSIPVGISLGLLK 462
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
E + P K R +HG L++KDS+ L+ ++A KE+G++ H +RFT V + D+
Sbjct: 463 REMAQ--GLLPEAKKPRLLHGTLIMKDSNFRLVSSEQALKELGLAEHQLRFTCKVHLHDT 520
>gi|410989914|ref|XP_004001198.1| PREDICTED: LOW QUALITY PROTEIN: integrator complex subunit 11
[Felis catus]
Length = 598
Score = 541 bits (1394), Expect = e-151, Method: Compositional matrix adjust.
Identities = 256/435 (58%), Positives = 326/435 (74%), Gaps = 9/435 (2%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MFQ+KVG++S+VYTGDYNMTPDRHLGAAWIDKCRP+LLITESTYATTIRDSKRCRERDFL
Sbjct: 163 MFQIKVGSESVVYTGDYNMTPDRHLGAAWIDKCRPNLLITESTYATTIRDSKRCRERDFL 222
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHE V+RGGKVLIPVFALGRAQELCILLET+WERMNL+APIYF+ GLTEKA +YYK+F
Sbjct: 223 KKVHEAVERGGKVLIPVFALGRAQELCILLETFWERMNLKAPIYFSTGLTEKANHYYKLF 282
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
ITWTNQKIRKTFVQRNMF+FKHI+ FD++F DNPGPMVVFATPGMLH+G SL IF+KWA
Sbjct: 283 ITWTNQKIRKTFVQRNMFEFKHIKAFDRAFADNPGPMVVFATPGMLHAGQSLQIFRKWAG 342
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
E NM+IMPG+CVQGT+GHK+LSG +KLE E +Q+++VKM +EYMSFSAHADAKGIMQL+
Sbjct: 343 NEKNMVIMPGYCVQGTVGHKILSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLV 402
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
E + V + S F + C+MPANGE+ + T I + +S+ LLK
Sbjct: 403 GQAEARGVSMGRASXSXQAFCPSPSGP--GVSCYMPANGETVTLPTSPSIPVGISLGLLK 460
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
E + P+ K R +HG L++KDS+ L+ ++A KE+G++ H +RFT V + D
Sbjct: 461 REMAQ--GLLPDAKKPRLLHGTLIMKDSNFRLVSSEQALKELGLAEHQLRFTCRVHLHDP 518
Query: 361 SPINKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLL--SVERIDDKNKRVFVTWP 418
+T +Y L S L++ ++ L + S++++S+L+ + D K + V+W
Sbjct: 519 RKEQETAMRVYSHLKSVLKEHCVQH---LPDGSVTVESILIQAAAHSEDPGTKVLLVSWT 575
Query: 419 NQDEEVGKIVLHVLK 433
QDEE+G + +LK
Sbjct: 576 YQDEELGSHLTSLLK 590
>gi|340381556|ref|XP_003389287.1| PREDICTED: integrator complex subunit 11-like [Amphimedon
queenslandica]
Length = 610
Score = 534 bits (1376), Expect = e-149, Method: Compositional matrix adjust.
Identities = 255/435 (58%), Positives = 329/435 (75%), Gaps = 8/435 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF V+VG+QS+VYTGDYNMTPDRHLG+AWID+CRPDLLITESTYATTIRDSKRCRERDFL
Sbjct: 164 MFHVRVGHQSVVYTGDYNMTPDRHLGSAWIDRCRPDLLITESTYATTIRDSKRCRERDFL 223
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KK+HEC++R GKVLIPVFALGRAQELCILLE+YWERMNL+ PIYF+ GLTEKA +YYK+F
Sbjct: 224 KKLHECLERDGKVLIPVFALGRAQELCILLESYWERMNLKYPIYFSTGLTEKANHYYKLF 283
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
I+WTNQKI+ TF+ RNMFDFKHI+ FD+S+ID PGPM+VFATPGMLH+GLSL IFKKWA
Sbjct: 284 ISWTNQKIKNTFIHRNMFDFKHIKAFDRSYIDQPGPMIVFATPGMLHAGLSLQIFKKWAE 343
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
E NMLIMPG+CV GTIGHKVLSG KK+E + K++++++++++YMSFSAHADAKGIMQLI
Sbjct: 344 DEKNMLIMPGYCVPGTIGHKVLSGTKKIEID-KKLVNIRLSVQYMSFSAHADAKGIMQLI 402
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
Q EPKNVLLVHGEA+KM FL++KI +EF + C+MPANGE+ + T IS+++S L K
Sbjct: 403 QLAEPKNVLLVHGEAAKMEFLRQKINEEFGIHCYMPANGETVAIATTPSISVNMSSLLFK 462
Query: 301 EEAVKYNSEPPNPL-----KERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNV 355
+ PP+ + G+LV+K+ +SL+ E E+G++RH ++F+ +
Sbjct: 463 RALEESIGAPPSKRLCSLPNSAVVDGLLVMKNEEVSLLTHKEVSDEIGLTRHKLKFSQTL 522
Query: 356 QISDSS-PINKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLLSVERIDDKNKRVF 414
+ P + LEL+ + + D + LS SV LS+ + + V
Sbjct: 523 TLDGHFLPDSNILELLTTSINKLVPDSKVLQTKEGYGLELSGTSVTLSIAQFLPQ-PLVN 581
Query: 415 VTWPNQDEEVGKIVL 429
+TW QD+ +G I++
Sbjct: 582 LTWELQDDHIGSILV 596
>gi|47229058|emb|CAG03810.1| unnamed protein product [Tetraodon nigroviridis]
Length = 698
Score = 529 bits (1362), Expect = e-147, Method: Compositional matrix adjust.
Identities = 250/409 (61%), Positives = 305/409 (74%), Gaps = 45/409 (11%)
Query: 15 GDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFLKKVHECVDRGGKVL 74
GDYNMTPDRHLGAAWIDKCRPD+LI+ESTYATTIRDSKRCRERDFLKKVHE ++RGGKVL
Sbjct: 245 GDYNMTPDRHLGAAWIDKCRPDILISESTYATTIRDSKRCRERDFLKKVHETIERGGKVL 304
Query: 75 IPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMFITWTNQKIRKTFVQ 134
IPVFALGRAQELCILLET+WERMNL+APIYF+ GLTEKA +YYK+FITWTNQKIRKTFVQ
Sbjct: 305 IPVFALGRAQELCILLETFWERMNLKAPIYFSTGLTEKANHYYKLFITWTNQKIRKTFVQ 364
Query: 135 RNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAPVENNML-------- 186
RNMF+FKHI+ FD+S+ DNPGPMVVFATPGMLH+G SL IFKKWA E NM+
Sbjct: 365 RNMFEFKHIKAFDRSYADNPGPMVVFATPGMLHAGQSLQIFKKWAGNEKNMVQFLRRECG 424
Query: 187 ------------IMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAK 234
MPG+CVQGTIGHK+L+G +KLE E + +DVK+ +EYMSFSAHADAK
Sbjct: 425 RQQKHAALLSWSSMPGYCVQGTIGHKILNGQRKLEMEGRATLDVKLQVEYMSFSAHADAK 484
Query: 235 GIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF------------------------N 270
GIMQLI+ EP+N+LLVHGEA+KM FLK KI+QEF
Sbjct: 485 GIMQLIRMAEPRNMLLVHGEAAKMEFLKGKIEQEFSESRAAPRRAATGPDSPANPAFPPG 544
Query: 271 LDCFMPANGESCFVQTDMKISIDVSVNLLKEEAVKYNSEPPNPLKERQIHGVLVIKDSSI 330
+DC+MPANGE+ V T+ + +D+S+NLLK E + P+P K R +HG L++K++S+
Sbjct: 545 IDCYMPANGETATVSTNPSVPVDISLNLLKRE-MALGGPLPDPKKPRAMHGTLIMKENSL 603
Query: 331 SLMDVDEACKEVGISRHIVRFTSNVQISDSSPINKTLELIYDQLLSYLQ 379
L+ ++A KE+G++ H +RFT V + D N TL+ +Y L S L+
Sbjct: 604 RLVPPEQALKELGLNEHQLRFTCRVHLQDPHSDNDTLQRVYTHLRSVLK 652
>gi|291238246|ref|XP_002739041.1| PREDICTED: cleavage and polyadenylation specific factor 3-like
[Saccoglossus kowalevskii]
Length = 573
Score = 517 bits (1331), Expect = e-143, Method: Compositional matrix adjust.
Identities = 247/437 (56%), Positives = 319/437 (72%), Gaps = 37/437 (8%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF VKVG+QS+VYTGDYNMT DRHLG CRERDFL
Sbjct: 163 MFHVKVGSQSVVYTGDYNMTADRHLG---------------------------CRERDFL 195
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
+KVH+CV++GGKVLIPVFALGRAQELCILLET+W+RMNL+APIYF+ GLTEKA +YYK+F
Sbjct: 196 QKVHDCVEKGGKVLIPVFALGRAQELCILLETFWDRMNLKAPIYFSTGLTEKANHYYKLF 255
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
ITWTNQKIRKTFVQRNMF+F+HI+PFD+S+ DNPGPMVVFATPGMLH GLSL +FKKWA
Sbjct: 256 ITWTNQKIRKTFVQRNMFEFRHIKPFDRSYTDNPGPMVVFATPGMLHGGLSLHVFKKWAS 315
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
E NM+IMPG+CV GT+GHK+L+G +K+E EN+Q IDVK++++YMSFSAHADAKGIMQLI
Sbjct: 316 NEKNMVIMPGYCVAGTVGHKILNGQRKIELENRQTIDVKLSVQYMSFSAHADAKGIMQLI 375
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
+ CEPKNV+LVHGEA KM FLK+KI Q+F + CFMP NGES ++T I D+SVNLLK
Sbjct: 376 KQCEPKNVMLVHGEAKKMDFLKQKIVQQFGVQCFMPPNGESVVIETHSNIPADISVNLLK 435
Query: 301 EEA--VKYNSE-PPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQI 357
A ++S +P K +HG LV+ D+ I++++ +A KE S H +RFTS V +
Sbjct: 436 RNAALAGFSSTLSTDPKKLCTMHGALVMHDNRITIVEASQALKECSASEHEIRFTSTVSL 495
Query: 358 SDSSPINKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLLSVERIDDKNKRVFVTW 417
P +++++Y L + ++ ++ SL +DSVL+ + D+ + V+W
Sbjct: 496 ----PPACSMDVLYKTLKQRFSGHNVQHA---SDGSLQVDSVLVKISGESDEGTDMLVSW 548
Query: 418 PNQDEEVGKIVLHVLKS 434
P QDEE+G +L +L++
Sbjct: 549 PYQDEELGSSLLTMLQN 565
>gi|256084683|ref|XP_002578556.1| cleavage and polyadenylation specificity factor [Schistosoma
mansoni]
gi|350644758|emb|CCD60512.1| cleavage and polyadenylation specificity factor,putative
[Schistosoma mansoni]
Length = 619
Score = 489 bits (1260), Expect = e-135, Method: Compositional matrix adjust.
Identities = 226/371 (60%), Positives = 291/371 (78%), Gaps = 6/371 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF V+VG S++YTGDYNMTPDRHLGAAW+ +CRPDLLITESTYATTIRDSKR RER+FL
Sbjct: 163 MFLVRVGTNSVLYTGDYNMTPDRHLGAAWVSRCRPDLLITESTYATTIRDSKRTREREFL 222
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
+K+H V+ GGKVLIPVFALGRAQELCILLETYWERMN+ PIYF++G+ EKA YYK+F
Sbjct: 223 EKIHARVEAGGKVLIPVFALGRAQELCILLETYWERMNISVPIYFSMGMAEKANEYYKLF 282
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
I+WTNQKI++TFV+RNMFDFKHI+P + +DNPGPMVVFATPGMLH+G SL IF+KWA
Sbjct: 283 ISWTNQKIKETFVKRNMFDFKHIKPLGQGTVDNPGPMVVFATPGMLHAGQSLHIFRKWAS 342
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
E NM+++PG+CV GT+G+K+L+GV++LEF+ KQ+++VKMA+EY+SFSAHADA+GIMQLI
Sbjct: 343 DERNMVVIPGYCVAGTVGYKILNGVRRLEFD-KQVLEVKMAVEYLSFSAHADARGIMQLI 401
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
+C+PK+V+LVHGEA KM FLK KI+QEF L C PANGE ++T+ + ++ S L
Sbjct: 402 SHCQPKHVMLVHGEAIKMDFLKSKIEQEFGLPCSKPANGEVVHIKTEQQFIVEASREFLN 461
Query: 301 EEAVKYNSEPPNPL--KERQIHGVLVIK-DSSISLMDVDEACKEVGISRHIVRFTSNVQI 357
+ Y+ +P PL + R I G + I+ D + L+D + + H++RFTS
Sbjct: 462 QSY--YSDDPNEPLIKRSRSIQGGIQIEPDETPKLLDRASVLASLSLKEHVMRFTSTHVF 519
Query: 358 SDSSPINKTLE 368
S SS + L+
Sbjct: 520 SSSSNLATILD 530
>gi|355680857|gb|AER96662.1| cleavage and polyadenylation specific factor 3-like protein
[Mustela putorius furo]
Length = 440
Score = 485 bits (1248), Expect = e-134, Method: Compositional matrix adjust.
Identities = 217/269 (80%), Positives = 249/269 (92%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MFQ+KVG++S+VYTGDYNMTPDRHLGAAWIDKCRP+LLITESTYATTIRDSKRCRERDFL
Sbjct: 172 MFQIKVGSESVVYTGDYNMTPDRHLGAAWIDKCRPNLLITESTYATTIRDSKRCRERDFL 231
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHE V+RGGKVLIPVFALGRAQELCILLET+WERMNL+APIYF+ GLTEKA +YYK+F
Sbjct: 232 KKVHEAVERGGKVLIPVFALGRAQELCILLETFWERMNLKAPIYFSTGLTEKANHYYKLF 291
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
ITWTNQKIRKTFVQRNMF+FKHI+ FD++F DNPGPMVVFATPGMLH+G SL IF+KWA
Sbjct: 292 ITWTNQKIRKTFVQRNMFEFKHIKAFDRAFADNPGPMVVFATPGMLHAGQSLQIFRKWAG 351
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
E NM+IMPG+CVQGT+GHK+LSG +KLE E +Q+++VKM +EYMSFSAHADAKGIMQL+
Sbjct: 352 NEKNMVIMPGYCVQGTVGHKILSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLV 411
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEF 269
EP++VLLVHGEA KM FLK+KI+QE
Sbjct: 412 GQAEPESVLLVHGEAKKMEFLKQKIEQEL 440
>gi|320170221|gb|EFW47120.1| integrator complex subunit 11 [Capsaspora owczarzaki ATCC 30864]
Length = 661
Score = 472 bits (1215), Expect = e-130, Method: Compositional matrix adjust.
Identities = 202/299 (67%), Positives = 259/299 (86%), Gaps = 2/299 (0%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF V+VG++S+VYTGD+NMTPDRHLGAAWID+CRPDLLITESTYATTIRDSKR RE +FL
Sbjct: 162 MFHVRVGSESVVYTGDFNMTPDRHLGAAWIDRCRPDLLITESTYATTIRDSKRNREGEFL 221
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
+K+HECV++GGKVLIPVFALGRAQELCIL+ETYWER+ L P+YF+ GLT KA NYYK+F
Sbjct: 222 RKIHECVEQGGKVLIPVFALGRAQELCILVETYWERLGLTVPVYFSAGLTAKANNYYKLF 281
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
ITWTNQKI++TFV+RNMF+FKHI+PFD++F+DNPGPMV+FATPGMLH+G+SL F+KWAP
Sbjct: 282 ITWTNQKIKRTFVERNMFEFKHIKPFDRAFLDNPGPMVLFATPGMLHAGMSLDAFRKWAP 341
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQ--IIDVKMAIEYMSFSAHADAKGIMQ 238
+ NM+I+PG+CV GT+G+KVL+G K++E ++ +IDV+++++ +SFSAHADAKGI+Q
Sbjct: 342 NDKNMVILPGYCVAGTVGNKVLAGHKQIEMPDRARTVIDVRLSVQNLSFSAHADAKGIVQ 401
Query: 239 LIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVN 297
LI++ EP+NV+LVHGE +KM FLK KI E + CF PANG + ++T IS+ V N
Sbjct: 402 LIRHAEPRNVMLVHGEKAKMAFLKAKIISEIGIPCFDPANGATVTIETAHPISVSVPPN 460
>gi|358333178|dbj|GAA51732.1| integrator complex subunit 11 [Clonorchis sinensis]
Length = 649
Score = 468 bits (1205), Expect = e-129, Method: Compositional matrix adjust.
Identities = 206/299 (68%), Positives = 256/299 (85%), Gaps = 1/299 (0%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF ++VG+QS++YTGDYNMTPDRHLGAAW+ +C PD+LITESTYATTIRDSKR RER+FL
Sbjct: 232 MFLIRVGSQSVLYTGDYNMTPDRHLGAAWVSRCCPDILITESTYATTIRDSKRAREREFL 291
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
+K+H V+ GGKVLIPVFALGRAQELCILLETYWERMN+ PIYF++G+ EKA YYK+F
Sbjct: 292 EKIHARVEAGGKVLIPVFALGRAQELCILLETYWERMNISVPIYFSMGMAEKANEYYKLF 351
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
I+WTNQKI++TFV+RNMF+FKHI+P + +DNPGPMVVFATPGMLH+G SL IF+KWAP
Sbjct: 352 ISWTNQKIKETFVKRNMFEFKHIKPLGQGIVDNPGPMVVFATPGMLHAGQSLHIFRKWAP 411
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
E NM+++PG+CV GT+G+K+L+GVK+LEF+ +Q +DV M +EY+SFSAHADA+GIMQLI
Sbjct: 412 DERNMVVIPGYCVAGTVGYKILNGVKRLEFD-RQALDVNMRVEYLSFSAHADARGIMQLI 470
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLL 299
+C+P+NVLLVHGEASKM FLK KI+QEF L C PANGE V+T+ + + LL
Sbjct: 471 SHCQPRNVLLVHGEASKMDFLKAKIEQEFGLPCLKPANGEIVHVKTESQFVANAPTELL 529
>gi|322790916|gb|EFZ15582.1| hypothetical protein SINV_14424 [Solenopsis invicta]
Length = 277
Score = 467 bits (1202), Expect = e-128, Method: Compositional matrix adjust.
Identities = 204/276 (73%), Positives = 248/276 (89%)
Query: 62 KVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMFI 121
+VHEC+DRGGKVLIPVFALGRAQELCILLETYWERMNL+ P+YFA+GLTEKA NYYKMFI
Sbjct: 1 QVHECIDRGGKVLIPVFALGRAQELCILLETYWERMNLKVPVYFALGLTEKANNYYKMFI 60
Query: 122 TWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAPV 181
TWTNQKI+KTFVQRNMFDFKHI+PFDK++IDNPG MVVFATPGMLH+GLSL IFKKWAP
Sbjct: 61 TWTNQKIKKTFVQRNMFDFKHIKPFDKAYIDNPGAMVVFATPGMLHAGLSLQIFKKWAPN 120
Query: 182 ENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLIQ 241
E+NM+IMPGFCVQGT+GHKVL+G +++EFEN+QI++VKMA+EYMSFSAHADAKGIMQLIQ
Sbjct: 121 ESNMVIMPGFCVQGTVGHKVLNGSRRIEFENRQIVEVKMAVEYMSFSAHADAKGIMQLIQ 180
Query: 242 YCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLKE 301
YCEPKNV+LVHGE +KM +LKEKIKQEF ++C+ PANGE+C + T K+ +D S+ LLK
Sbjct: 181 YCEPKNVMLVHGEFAKMEYLKEKIKQEFGVNCYNPANGETCIITTTSKVPVDASLALLKT 240
Query: 302 EAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDE 337
EA +Y++ PP+P + R +H VL++++ + L+D DE
Sbjct: 241 EAKRYSALPPDPKRRRLLHSVLMLREDGVCLVDADE 276
>gi|440801023|gb|ELR22048.1| cleavage and polyadenylation specific factor 3like, putative
[Acanthamoeba castellanii str. Neff]
Length = 657
Score = 466 bits (1199), Expect = e-128, Method: Compositional matrix adjust.
Identities = 233/472 (49%), Positives = 318/472 (67%), Gaps = 44/472 (9%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF V+VG+QS+VYTGDYNMTPDRHLGAAWI+K RPD+LITESTYATTIRDSKR RERDFL
Sbjct: 162 MFYVRVGDQSVVYTGDYNMTPDRHLGAAWIEKLRPDVLITESTYATTIRDSKRWRERDFL 221
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
K+VH CV++GGKVLIPVFALGRAQELCILLETYWERMNL PIYF+ GLTEKATNYYK+F
Sbjct: 222 KRVHSCVEKGGKVLIPVFALGRAQELCILLETYWERMNLTVPIYFSAGLTEKATNYYKLF 281
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
I WTN+KI++TFV RNMFDFKHI F++ D PGPMV+FATPGMLH+G SL +FKKWAP
Sbjct: 282 IHWTNEKIKRTFVHRNMFDFKHISTFERGLADQPGPMVLFATPGMLHAGTSLEVFKKWAP 341
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVK---KLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
E N++I+PG+CV GT+G+K+ +G K K++ +++ ++V+ ++ +SFSAHADAKGIM
Sbjct: 342 NEKNLVIIPGYCVVGTVGNKLAAGRKGSFKVDLDSRTSLEVRCKVKNLSFSAHADAKGIM 401
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVN 297
QLI+ +P NV+LVHGE KM LK+++ +EF + CF P NG + ++T + + +S
Sbjct: 402 QLIKMSQPANVILVHGEKGKMQSLKQRVIREFEIPCFDPPNGSTVTIKTARSVPVQISPA 461
Query: 298 LLKEEA--------------------VKYNSEP-------------PNPLKERQIHGVLV 324
L++ A + +SE P+P + GV++
Sbjct: 462 LIRAHAKRPLPSSLTPASSSSSSSSSIGIDSEGTTQPPLKRFKWTEPSPADRALLTGVVL 521
Query: 325 IKDS-SISLMDVDEACKEVGISRHIVRFTSNVQISDSSPINKTLELIYDQLLSYLQDKSQ 383
+KDS +I L+D +EA + + + RH ++F++ S P ++E + D+L L +
Sbjct: 522 VKDSQAIELVDEEEAVERLDMPRHALKFSA----VRSFPAFISIEALLDELYVALSKRVA 577
Query: 384 EYKIQLTESSLSIDSVLLSVERIDDKNKRVFVTWPNQDEEVGKIVLHVLKSM 435
+ S SV +SV+ +N + +W +DE + VL V+ +
Sbjct: 578 AAASRDGRSIRVAGSVRISVK---SENTALHTSWHYRDESLALQVLAVVDRL 626
>gi|281201684|gb|EFA75892.1| integrator complex subunit 11 [Polysphondylium pallidum PN500]
Length = 648
Score = 460 bits (1183), Expect = e-126, Method: Compositional matrix adjust.
Identities = 234/467 (50%), Positives = 318/467 (68%), Gaps = 42/467 (8%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF KVG +S+VYTGDYNMTPDRHLG+AWID P LLITE+TYATTIRDSKR RERDFL
Sbjct: 189 MFYCKVGEESVVYTGDYNMTPDRHLGSAWIDAVNPTLLITETTYATTIRDSKRGRERDFL 248
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
K+VHECV++GGKVLIPVFALGR QELCIL++TYWE+M L PIYF+ GL EKA YYK+F
Sbjct: 249 KRVHECVEKGGKVLIPVFALGRVQELCILIDTYWEQMGLSVPIYFSEGLAEKANFYYKLF 308
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
I WTNQKI++TFV+RNMFDFKHI+PFD+ +D PGPMV+FATPGMLH+G SL +FKKWAP
Sbjct: 309 IGWTNQKIKQTFVKRNMFDFKHIKPFDRMLVDAPGPMVLFATPGMLHAGASLEVFKKWAP 368
Query: 181 VENNMLIMPGFCVQGTIGHKVL---SGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
E NM I+PG+CV GT+G+K+L SG + +E + K ++VK I +SFSAHADAKGIM
Sbjct: 369 SELNMTIIPGYCVVGTVGNKLLSNASGPQMVEIDKKTTLEVKCKIHNLSFSAHADAKGIM 428
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVN 297
QLI+ +PKNVLLVHGE KM FL +KI ++FN+ C+ PANG + +++ I ID+S+
Sbjct: 429 QLIKMSQPKNVLLVHGEKEKMRFLSDKISKDFNIPCYFPANGVTVNIESSKSIPIDISLK 488
Query: 298 LLKEE----------AVKYNS---------------EPPNPLKERQIHGVLVIKD-SSIS 331
LLK + A +YNS + +P+ I G++V K +I
Sbjct: 489 LLKRQISEYSRESATAFEYNSNLKDEDTDSLEQETKKSRSPIVSVPIQGIMVSKGYQNIK 548
Query: 332 LMDVDEACKEVGISRHIVRFTSNVQISDSSPINKTLELIYDQLLSYLQDKSQEYKIQ--- 388
L+D EA KE+ I+ + F ++ + ++++Y +L +L + E Q
Sbjct: 549 LLDPSEASKELSINNVQLNFILYWKLVQE--VKDCIDVLYLELCRWLVGATIERIDQNKI 606
Query: 389 LTESSLSIDSVLLSVERIDDKNKRVFVTWPNQDEEVGKIVLHVLKSM 435
L +SS+SI+ + ++ + W N++E + + +L +++++
Sbjct: 607 LVKSSISIE--------YEKPQQQFTIKWTNKEEYLSEHILSIIQNL 645
>gi|324504608|gb|ADY41989.1| Integrator complex subunit 11 [Ascaris suum]
Length = 588
Score = 456 bits (1174), Expect = e-125, Method: Compositional matrix adjust.
Identities = 208/360 (57%), Positives = 278/360 (77%), Gaps = 3/360 (0%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDK-CRPDLLITESTYATTIRDSKRCRERDF 59
MF +KVG++S++YTGD+N T DRHLGAA ++ +PDLLI+E+TYATTIRDSKR RERDF
Sbjct: 163 MFLIKVGSESVIYTGDFNTTADRHLGAAHVEPGLKPDLLISETTYATTIRDSKRARERDF 222
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKM 119
LKKVH+CV GGKVLIPVFALGRAQELCILLE+YWERM+L PI+F+ GL EKAT YY++
Sbjct: 223 LKKVHDCVANGGKVLIPVFALGRAQELCILLESYWERMDLTVPIFFSHGLAEKATQYYRL 282
Query: 120 FITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWA 179
FI+WTN+KI++TFV RNMFDFKHIRPFD+SF D+PGPMV+F+TPGMLH G SL +FKKW
Sbjct: 283 FISWTNEKIKRTFVHRNMFDFKHIRPFDQSFSDSPGPMVLFSTPGMLHGGQSLRVFKKWC 342
Query: 180 PVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQL 239
E NM+IMPGFCV GTIG KV+ G KK+E + K ++D+ + +EYMSFSAHADAKGIMQL
Sbjct: 343 SDEKNMVIMPGFCVAGTIGAKVIGGAKKVEIDGK-MLDINLGVEYMSFSAHADAKGIMQL 401
Query: 240 IQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLL 299
I+ CEPKNV+ VHGE +KM FLKEK+++EF L + PANGES V T++ ++ V L+
Sbjct: 402 IRQCEPKNVMFVHGENAKMEFLKEKVEKEFGLRVYKPANGESICVDTELNATLTVPSQLI 461
Query: 300 KEEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISD 359
++++ + P + + ++ + S + ++ A ++ ++ H + F+ V++ +
Sbjct: 462 -DKSISLDPTPSKKFCPFRAYVIMDKQSSQLEVISAKTAARQFNVNLHTITFSDTVEVDE 520
>gi|170595519|ref|XP_001902415.1| RNA-metabolising metallo-beta-lactamase family protein [Brugia
malayi]
gi|158589929|gb|EDP28737.1| RNA-metabolising metallo-beta-lactamase family protein [Brugia
malayi]
Length = 589
Score = 456 bits (1173), Expect = e-125, Method: Compositional matrix adjust.
Identities = 220/439 (50%), Positives = 309/439 (70%), Gaps = 24/439 (5%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDK-CRPDLLITESTYATTIRDSKRCRERDF 59
MFQ+ VG++S++YTGD+N TPDRHLGAA ++ +PDLLI+ESTYATTIRDSKR RERDF
Sbjct: 163 MFQIMVGSESVLYTGDFNTTPDRHLGAARVEPGLKPDLLISESTYATTIRDSKRARERDF 222
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKM 119
LKKVH+ V GGKVLIPVFALGRAQELCILLE+YWERMNL+ PI+F+ GL EKA YY++
Sbjct: 223 LKKVHDTVSNGGKVLIPVFALGRAQELCILLESYWERMNLKYPIFFSQGLAEKANQYYRL 282
Query: 120 FITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWA 179
FI+WTN+KI++TFV+RNMFDFKHIRPF++S+I++PGPMV+F+TPGMLH G SL +F KW
Sbjct: 283 FISWTNEKIKRTFVERNMFDFKHIRPFEQSYIESPGPMVLFSTPGMLHGGQSLRVFTKWC 342
Query: 180 PVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQL 239
E N++IMPGFCV T+G KV+SG+KK+E + + ++ + +EYMSFSAHADAKGIMQL
Sbjct: 343 SDEKNLIIMPGFCVANTVGAKVISGMKKIEIAGR-VREINLGVEYMSFSAHADAKGIMQL 401
Query: 240 IQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLL 299
I+ C+P+NV+ VHGE +KM FLKEK+++EFNL F PANGE+ ++T + +S+++ V +L
Sbjct: 402 IRDCQPRNVMFVHGEDAKMEFLKEKVEKEFNLKVFKPANGETVTIETPLNVSLNIPVQIL 461
Query: 300 KEEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDV---DEACKEVGISRHIVRFTSNVQ 356
+ S P P K + D S ++V + A +++G S H + F+ ++
Sbjct: 462 D----RCISLDPTPSKRFCPFRAFLTMDKQTSQLEVITPEAAARQLGTSLHTIAFSETIE 517
Query: 357 ISDSSPINKTLELIYDQLLSYLQDKSQEY--KIQLTESSLSIDSVLLSVERIDDKNKRVF 414
+ + +L K + Y +Q+ E + + +++ + D K V
Sbjct: 518 VGHVN-------------WKFLASKLKLYDSNLQVKEDGIEMFDGEITLTSVTDYPKHVE 564
Query: 415 VTWPNQDEEVGKIVLHVLK 433
+TW EE + +++ L+
Sbjct: 565 ITWDEIREEWSEEIINALR 583
>gi|432090010|gb|ELK23618.1| Integrator complex subunit 11 [Myotis davidii]
Length = 561
Score = 456 bits (1173), Expect = e-125, Method: Compositional matrix adjust.
Identities = 201/250 (80%), Positives = 233/250 (93%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MFQ+KVG++S+VYTGDYNMTPDRHLGAAWIDKCRP+LLITESTYATTIRDSKRCRERDFL
Sbjct: 197 MFQIKVGSESVVYTGDYNMTPDRHLGAAWIDKCRPNLLITESTYATTIRDSKRCRERDFL 256
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHE V+RGGKVLIPVFALGRAQELCILLET+WERMNL+APIYF+ GLTEKA +YYK+F
Sbjct: 257 KKVHETVERGGKVLIPVFALGRAQELCILLETFWERMNLKAPIYFSTGLTEKANHYYKLF 316
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
ITWTNQKIRKTFVQRNMF+FKHI+ FD++F DNPGPMVVFATPGMLH+G SL IF+KWA
Sbjct: 317 ITWTNQKIRKTFVQRNMFEFKHIKAFDRAFADNPGPMVVFATPGMLHAGQSLQIFRKWAG 376
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
E NM+IMPG+CVQGT+GHK+LSG +KLE E +Q+++++M +EYMSFSAHADAKGIMQL+
Sbjct: 377 NEKNMVIMPGYCVQGTVGHKILSGQRKLEMEGRQVLEIRMQVEYMSFSAHADAKGIMQLV 436
Query: 241 QYCEPKNVLL 250
EP++VLL
Sbjct: 437 GQAEPESVLL 446
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 33/104 (31%), Positives = 59/104 (56%), Gaps = 5/104 (4%)
Query: 332 LMDVDEACKEVGISRHIVRFTSNVQISDSSPINKTLELIYDQLLSYLQDKSQEYKIQLTE 391
L+ ++A KE+G++ H +RFT V + DS +T +Y L S L+D ++ L +
Sbjct: 450 LVSSEQALKELGLAEHQLRFTCRVHLHDSRKEQETAMRVYSHLKSLLKDHCVQH---LPD 506
Query: 392 SSLSIDSVLL--SVERIDDKNKRVFVTWPNQDEEVGKIVLHVLK 433
S++++S+L+ + D K + V+W QDEE+G + +LK
Sbjct: 507 GSVTVESILIQAAAHSEDPGTKVLLVSWTYQDEELGSYLTSLLK 550
>gi|326435554|gb|EGD81124.1| integrator complex subunit 11 [Salpingoeca sp. ATCC 50818]
Length = 620
Score = 454 bits (1168), Expect = e-124, Method: Compositional matrix adjust.
Identities = 210/387 (54%), Positives = 275/387 (71%), Gaps = 7/387 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF V+VG++S+VYTGDYNMTPDRHLG AWID C+PD +ITESTYATTIRDSKRCRERDFL
Sbjct: 196 MFHVRVGDKSVVYTGDYNMTPDRHLGTAWIDFCQPDAIITESTYATTIRDSKRCRERDFL 255
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KVH CV GGKVLIPVFALGRAQELCILLETYWER L PIYF+ GLTEKA YY++F
Sbjct: 256 TKVHRCVKNGGKVLIPVFALGRAQELCILLETYWERYKLDTPIYFSTGLTEKANEYYRLF 315
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
+ +TNQKI+ TFV RN+FDFKHIR FD+S+ D PGP V+FATPGMLH+G++L +F KWA
Sbjct: 316 VMYTNQKIKDTFVDRNLFDFKHIRAFDRSYADQPGPQVLFATPGMLHAGVALEVFAKWAG 375
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
NM+I+PG+CV GT+G +V++G K++ +Q+++V++ +EY+SFSAHADAKGIM LI
Sbjct: 376 DPRNMVILPGYCVAGTVGAQVIAGKKEIRV-GQQMVNVRLQVEYLSFSAHADAKGIMTLI 434
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
+ C KNV+LVHGE KM FL +++++E + CF P NGE + + ID+S LL
Sbjct: 435 KQCGAKNVVLVHGEKKKMEFLADRVRRELDTPCFFPKNGERLQISVSPPVPIDISNQLLT 494
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDS 360
V + P +E ++HGVL+ +S+ ++ EA G+ H + F ++
Sbjct: 495 RAPVLHTKHGPIRPQEVRVHGVLLRHSTSLKVVSHREASSR-GVDVHTITFKQTLKTPAL 553
Query: 361 SPINKTLELI-----YDQLLSYLQDKS 382
+P + L +D L L D +
Sbjct: 554 TPTSLEAALTAWCKEHDMFLRRLGDTA 580
>gi|312083284|ref|XP_003143797.1| RNA-metabolising metallo-beta-lactamase [Loa loa]
gi|307761039|gb|EFO20273.1| RNA-metabolising metallo-beta-lactamase [Loa loa]
Length = 644
Score = 451 bits (1161), Expect = e-124, Method: Compositional matrix adjust.
Identities = 222/438 (50%), Positives = 309/438 (70%), Gaps = 22/438 (5%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDK-CRPDLLITESTYATTIRDSKRCRERDF 59
MFQ+ VG++S++YTGD+N TPDRHLGAA ++ +PDLLI+ESTYATTIRDSKR RERDF
Sbjct: 218 MFQIMVGSESVLYTGDFNTTPDRHLGAARVEPGLKPDLLISESTYATTIRDSKRARERDF 277
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKM 119
LKKVH+ V GGKVLIPVFALGRAQELCILLE+YWERMNL+ PI+F+ GL EKA YY++
Sbjct: 278 LKKVHDTVSNGGKVLIPVFALGRAQELCILLESYWERMNLKYPIFFSQGLAEKANQYYRL 337
Query: 120 FITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWA 179
FI+WTN+KI++TFV+RNMFDFKHIRPF++S+ D+PGPMV+F+TPGMLH G SL +F KW
Sbjct: 338 FISWTNEKIKRTFVERNMFDFKHIRPFEQSYTDSPGPMVLFSTPGMLHGGQSLRVFTKWC 397
Query: 180 PVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQL 239
E N++IMPGFCV T+G KV+SG+KK+E + + ++ + +EYMSFSAHADAKGIMQL
Sbjct: 398 SDEKNLIIMPGFCVANTVGAKVISGMKKIEIAGR-MREINLGVEYMSFSAHADAKGIMQL 456
Query: 240 IQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLL 299
I+ C+PKNV+ VHGE +KM FLKEK+++EFNL F PANGE+ +++ + +S++V V +L
Sbjct: 457 IRDCQPKNVMFVHGEDAKMEFLKEKVEKEFNLKVFKPANGETITIESPLNVSLNVPVQIL 516
Query: 300 KEEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDV---DEACKEVGISRHIVRFTSNVQ 356
+ S P P K + D + ++V + A +++G S H + F+ ++
Sbjct: 517 D----RCISLDPTPSKRFCPFRAFLTMDKQTNQLEVITPEAAARQLGTSLHTIAFSETIE 572
Query: 357 ISDSSPINKTLEL-IYDQLLSYLQDKSQEYKIQLTESSLSIDSVLLSVERIDDKNKRVFV 415
+ + +L +YD L +D I++ +++ SV+ D K V +
Sbjct: 573 VDHINWNCFAAKLKVYDPNLQVKED-----GIEMFNGEVTLASVM-------DNPKHVEI 620
Query: 416 TWPNQDEEVGKIVLHVLK 433
TW E+ + ++ L+
Sbjct: 621 TWDEIREDWSEEIISALR 638
>gi|328873132|gb|EGG21499.1| integrator complex subunit 11 [Dictyostelium fasciculatum]
Length = 645
Score = 450 bits (1158), Expect = e-123, Method: Compositional matrix adjust.
Identities = 233/473 (49%), Positives = 314/473 (66%), Gaps = 54/473 (11%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF KVG +S+VYTGDYNMTPDRHLG+AWID+ RP+LLITE+TYATTIRDSKR RERDFL
Sbjct: 162 MFYAKVGEESVVYTGDYNMTPDRHLGSAWIDQVRPNLLITETTYATTIRDSKRGRERDFL 221
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
K+VHECV++GGKVLIPVFALGR QELCIL+++YWE+MNL PIYF+ GL EKA YYK+F
Sbjct: 222 KRVHECVEKGGKVLIPVFALGRVQELCILIDSYWEQMNLNVPIYFSEGLAEKANFYYKLF 281
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
ITWTNQKI++TFV+RNMFDFKHI+PFD+ D PGPMV+FATPGMLH+G SL +FKKWAP
Sbjct: 282 ITWTNQKIKQTFVKRNMFDFKHIKPFDRHLADAPGPMVLFATPGMLHAGASLEVFKKWAP 341
Query: 181 VENNMLIMPGFCVQGTIGHKVLS---GVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
E NM I+PG+CV GT+G+K+LS G + +E + K ++VK I +SFSAHADAKGIM
Sbjct: 342 NELNMTIIPGYCVVGTVGNKLLSNAGGPQMVEIDKKTTLEVKCKIHNLSFSAHADAKGIM 401
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVN 297
QLI+ +PKNVLLVHGE KM +L +KI ++F + C+ PANG + +++ I ID+S
Sbjct: 402 QLIKMSQPKNVLLVHGEKEKMRYLSDKISKDFGVPCYFPANGVTVNIESSKLIPIDISSK 461
Query: 298 LLKEEAVKY--------------------------------NSEPP--------NPLKER 317
LLK + ++Y N +P +P+ +
Sbjct: 462 LLKRQILEYSQANSKQFEDQQQQQQTHQQQQKQDNSDENINNLDPDEQPFKKIRSPITQL 521
Query: 318 QIHGVLVIKD-SSISLMDVDEACKEVGISRHIVRFTSNVQISDSSPINKT-----LELIY 371
I G+LV KD ++ L+D EA E+ I+ + + N +I ++ T LE++
Sbjct: 522 PIQGILVAKDYQNLKLLDPSEASSELSINNINLNYILNYKIQTNNKSTTTSAMDGLEILQ 581
Query: 372 DQLLSYLQDKSQEYKIQLTESSLSIDSVLLSVERIDDKNKRVFVTWPNQDEEV 424
D++ L + ++ +S L I + + V + D ++ + W N E++
Sbjct: 582 DRITELL---PHDQINRINQSQLVIKNQV--VIQFDKVQQQFTIRWSNHHEQL 629
>gi|402590428|gb|EJW84358.1| RNA-metabolising metallo-beta-lactamase [Wuchereria bancrofti]
Length = 579
Score = 447 bits (1151), Expect = e-122, Method: Compositional matrix adjust.
Identities = 222/454 (48%), Positives = 315/454 (69%), Gaps = 35/454 (7%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDK-CRPDLLITESTYATTIRDSKRCRERDF 59
MFQ+ VG++S++YTGD+N TPDRHLGAA ++ +PDLLI+ESTYATTIRDSKR RERDF
Sbjct: 134 MFQIMVGSESVLYTGDFNTTPDRHLGAARVEPGLKPDLLISESTYATTIRDSKRARERDF 193
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKM 119
LKKVH+ V GGKVLIPVFALGRAQELCILLE+YWERMNL+ PI+F+ GL EKA YY++
Sbjct: 194 LKKVHDTVSNGGKVLIPVFALGRAQELCILLESYWERMNLKYPIFFSQGLAEKANQYYRL 253
Query: 120 FITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWA 179
FI+WTN+KI++TFV+RNMFDFKHIRPF++S+ID+PGPMV+F+TPGMLH G SL +F KW
Sbjct: 254 FISWTNEKIKRTFVERNMFDFKHIRPFEQSYIDSPGPMVLFSTPGMLHGGQSLRVFTKWC 313
Query: 180 PVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQL 239
E N++IMPGFCV T+G KV+SG+KK+E + + ++ + +EYMSFSAHADAKGIMQL
Sbjct: 314 SDEKNLIIMPGFCVANTVGAKVISGMKKIEIAGR-MREINLGVEYMSFSAHADAKGIMQL 372
Query: 240 IQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLL 299
I+ C+P+NV+ VHGE +KM FLKEK+++EFNL F PANGE+ ++T + +S+++ V +L
Sbjct: 373 IRDCQPRNVMFVHGEDAKMEFLKEKVEKEFNLKVFKPANGETITIETPLNVSLNIPVQIL 432
Query: 300 KEEAVKYNSEPPN---PLK------ERQIHGVLVI----------KDSSISLMDVDEACK 340
+ + + P P + +I LV+ K + + ++ + A +
Sbjct: 433 -DRCISLDPTPSKRFCPFRAFLTMDNSEISFALVLYMYVLTCFFFKTNELKVITPEAAAR 491
Query: 341 EVGISRHIVRFTSNVQISDSSPINKTLEL-IYDQLLSYLQDKSQEYKIQLTESSLSIDSV 399
++ S H + F+ +++ + +L +YD L +D I++ + +++ SV
Sbjct: 492 QLDTSLHTIAFSETIEVGHVNWKFLASKLRVYDSNLQVKED-----GIEMFDGEITLTSV 546
Query: 400 LLSVERIDDKNKRVFVTWPNQDEEVGKIVLHVLK 433
+ D K V +TW EE + ++ L+
Sbjct: 547 V-------DYPKHVEITWDEIREEWSEEIISALR 573
>gi|339244969|ref|XP_003378410.1| putative metallo-beta-lactamase domain protein [Trichinella
spiralis]
gi|316972680|gb|EFV56345.1| putative metallo-beta-lactamase domain protein [Trichinella
spiralis]
Length = 562
Score = 444 bits (1143), Expect = e-122, Method: Compositional matrix adjust.
Identities = 220/392 (56%), Positives = 287/392 (73%), Gaps = 11/392 (2%)
Query: 15 GDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFLKKVHECVDRGGKVL 74
GD+NMTPDRHLG A ID+CRPD+LI+ESTYATTIRDSKR RERDFLKKVH+C++ GGKVL
Sbjct: 144 GDFNMTPDRHLGPAEIDRCRPDVLISESTYATTIRDSKRARERDFLKKVHDCINNGGKVL 203
Query: 75 IPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMFITWTNQKIRKTFVQ 134
IPVFALGRAQELCILLE+YWERMNL PIY + G+ EKA +YYK+F+TWT++KI+KTFV+
Sbjct: 204 IPVFALGRAQELCILLESYWERMNLSIPIYVSKGMAEKAVDYYKLFVTWTSEKIKKTFVK 263
Query: 135 RNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQ 194
RNMFDFKH+ PF+ SF D PGPMVVFATPGMLHSG SL IFKKWA E NM+IMPG+CVQ
Sbjct: 264 RNMFDFKHVLPFEDSFADTPGPMVVFATPGMLHSGQSLKIFKKWATNEKNMVIMPGYCVQ 323
Query: 195 GTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGE 254
GT+G K+++GV+KL+ EN ++VK+++EYMSFSAHAD+KGI+QLI+ C+P++VL VHGE
Sbjct: 324 GTVGSKLIAGVRKLDLENGATLEVKLSVEYMSFSAHADSKGIVQLIRNCDPRHVLFVHGE 383
Query: 255 ASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLKEEAVKYNSEPPNPL 314
SKM FLK+K+++ ++ P NG++ V T I I+ S LLK+ +N +P+
Sbjct: 384 DSKMEFLKQKVERMLSIPVLKPRNGQNVAVNTIPSIVIETSPQLLKDSLSNFN----HPV 439
Query: 315 KERQIHGVLVIKDS--SISLMDVDEACKEVGISRHIVRFTSNVQISDSSPINKTLELIYD 372
L + + SI L+ DE K + H V F +V + + +N LE I
Sbjct: 440 DPDIFQASLFVDKNLKSIELLRSDEVLKRMEWEPHQVEFILDVGVCEGG-LNCLLEQIQQ 498
Query: 373 QLLSYLQDKSQEYKIQLTESSLSIDS-VLLSV 403
++ + +SQ + + L ID+ V+LSV
Sbjct: 499 EMYETGRFRSQ---FKSGTNDLIIDNEVILSV 527
>gi|384486005|gb|EIE78185.1| hypothetical protein RO3G_02889 [Rhizopus delemar RA 99-880]
Length = 613
Score = 444 bits (1143), Expect = e-121, Method: Compositional matrix adjust.
Identities = 224/423 (52%), Positives = 288/423 (68%), Gaps = 24/423 (5%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF V+VG +S+VYTGDYNMTPDRHLG+AWIDK RPD+L+TESTYATTIRDSKR RERDFL
Sbjct: 140 MFYVRVGQESVVYTGDYNMTPDRHLGSAWIDKVRPDVLVTESTYATTIRDSKRSRERDFL 199
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KVHECV GG V+IPVFALGRAQELCIL+E+YW+RM L PIYF+ GLTE+AT +YK+F
Sbjct: 200 TKVHECVLNGGNVIIPVFALGRAQELCILIESYWDRMGLDVPIYFSTGLTERATEFYKLF 259
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
I WTNQKI+ TF QRNMFDFKHI+ +++++ID PGP V+FATPGML++G SL +FKKWAP
Sbjct: 260 INWTNQKIKSTFSQRNMFDFKHIKTWNRNYIDQPGPKVLFATPGMLNAGTSLEVFKKWAP 319
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
NM+IMPGFCV GT+G KVL G K +E + +V + + +SFSAHADAKGIMQLI
Sbjct: 320 DPKNMVIMPGFCVAGTVGSKVLLGQKVIEIDKFTRFEVNLQVRNLSFSAHADAKGIMQLI 379
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
+ CEP+NV+LVHGE KM FL KI +EF + C+MPANG S ++T + +S NLLK
Sbjct: 380 RQCEPRNVVLVHGERGKMNFLSSKIMKEFAIPCYMPANGCSIKMETLQVLEAKISTNLLK 439
Query: 301 EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRF-----TSNV 355
+ L+++Q L ++++I+ GI ++ F T NV
Sbjct: 440 SFLSNHQQSSLESLEQQQ--QSLTEREAAINTFRPQSNIPIQGIL--LMNFEGKQATPNV 495
Query: 356 QISDSSPINKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLLSVER----IDDKNK 411
+I SS +L Q+++Q E L + ++L + I+DK K
Sbjct: 496 EIVQSS-----------ELFEKYQNETQRQLYFQVEKPLHLSTILSDIAHTESSIEDKQK 544
Query: 412 RVF 414
RV
Sbjct: 545 RVL 547
>gi|426240429|ref|XP_004014105.1| PREDICTED: integrator complex subunit 11 [Ovis aries]
Length = 515
Score = 444 bits (1141), Expect = e-121, Method: Compositional matrix adjust.
Identities = 195/240 (81%), Positives = 225/240 (93%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MFQ+KVG++S+VYTGDYNMTPDRHLGAAWIDKCRP+LLITESTYATTIRDSKRCRERDFL
Sbjct: 163 MFQIKVGSESVVYTGDYNMTPDRHLGAAWIDKCRPNLLITESTYATTIRDSKRCRERDFL 222
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHE V+RGGKVLIPVFALGRAQELCILLET+WERM+L+APIYF+ GLTEKA +YYK+F
Sbjct: 223 KKVHETVERGGKVLIPVFALGRAQELCILLETFWERMDLKAPIYFSTGLTEKANHYYKLF 282
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
I WTNQKIRKTFVQRNMF+FKHI+ FD++F D+PGPMVVFATPGMLH+G SL IF+KWA
Sbjct: 283 IPWTNQKIRKTFVQRNMFEFKHIKAFDRAFADSPGPMVVFATPGMLHAGQSLQIFRKWAG 342
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
E NM+IMPG+CVQGT+GHK+LSG +KLE E +Q+++VKM +EYMSFSAHADAKGIMQL+
Sbjct: 343 NEKNMVIMPGYCVQGTVGHKILSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLV 402
Score = 57.0 bits (136), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 32/115 (27%), Positives = 63/115 (54%), Gaps = 5/115 (4%)
Query: 321 GVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDSSPINKTLELIYDQLLSYLQD 380
G++ + + L+ ++A KE+G++ H +RFT V + D+ + +Y L S L+D
Sbjct: 397 GIMQLVGQNFRLVSSEQALKELGLAEHQLRFTCRVHLHDTRKEQEMAMRVYSHLKSVLKD 456
Query: 381 KSQEYKIQLTESSLSIDSVLL--SVERIDDKNKRVFVTWPNQDEEVGKIVLHVLK 433
++ L + S++++S+L+ + D K + V+W QDEE+G + +LK
Sbjct: 457 HCVQH---LPDGSVTVESILVQAAAHSEDPGTKVLLVSWTYQDEELGSYLTSLLK 508
>gi|330796066|ref|XP_003286090.1| hypothetical protein DICPUDRAFT_30371 [Dictyostelium purpureum]
gi|325083909|gb|EGC37349.1| hypothetical protein DICPUDRAFT_30371 [Dictyostelium purpureum]
Length = 468
Score = 432 bits (1112), Expect = e-118, Method: Compositional matrix adjust.
Identities = 197/306 (64%), Positives = 248/306 (81%), Gaps = 4/306 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF KVG++S+VYTGDYNMTPDRHLG+AWID+ +PD+LITE+TYATTIRDSKR RERDFL
Sbjct: 162 MFYAKVGDESVVYTGDYNMTPDRHLGSAWIDQVKPDVLITETTYATTIRDSKRGRERDFL 221
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNL-QAPIYFAVGLTEKATNYYKM 119
K+VHECV++GGKVLIPVFALGR QELCIL+++YWE+MNL PIYF+ GL EKA YYK+
Sbjct: 222 KRVHECVEKGGKVLIPVFALGRVQELCILIDSYWEQMNLSHVPIYFSAGLAEKANLYYKL 281
Query: 120 FITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWA 179
FI WTNQKI++TFV+RNMFDFKHI+PF +D PG MV+FATPGMLH+G SL +FKKWA
Sbjct: 282 FINWTNQKIKQTFVKRNMFDFKHIKPFQSHLVDAPGAMVLFATPGMLHAGASLEVFKKWA 341
Query: 180 PVENNMLIMPGFCVQGTIGHKVL---SGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGI 236
P E NM I+PG+CV GT+G+K+L +G + +E + K ++VK I +SFSAHADAKGI
Sbjct: 342 PNELNMTIIPGYCVVGTVGNKLLANNAGSQMVEIDKKTTLEVKCKIHNLSFSAHADAKGI 401
Query: 237 MQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSV 296
+QLI+ P+NV+LVHGE KM FL +KI +E ++C+ PANG + + T I ID+S+
Sbjct: 402 LQLIKMSNPRNVILVHGEKEKMGFLSQKIIKEMGVNCYYPANGVTITIDTMKSIPIDISL 461
Query: 297 NLLKEE 302
LLK+E
Sbjct: 462 GLLKKE 467
>gi|66816359|ref|XP_642189.1| integrator complex subunit 11 [Dictyostelium discoideum AX4]
gi|74856745|sp|Q54YL3.1|INT11_DICDI RecName: Full=Integrator complex subunit 11 homolog
gi|60470287|gb|EAL68267.1| integrator complex subunit 11 [Dictyostelium discoideum AX4]
Length = 744
Score = 432 bits (1111), Expect = e-118, Method: Compositional matrix adjust.
Identities = 197/319 (61%), Positives = 250/319 (78%), Gaps = 12/319 (3%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF KVG++S+VYTGDYNMTPDRHLG+AWID+ +PD+LITE+TYATTIRDSKR RERDFL
Sbjct: 162 MFYAKVGDESVVYTGDYNMTPDRHLGSAWIDQVKPDVLITETTYATTIRDSKRGRERDFL 221
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNL-QAPIYFAVGLTEKATNYYKM 119
K++HECV++GGKVLIPVFALGR QELCIL+++YWE+MNL PIYF+ GL EKA YYK+
Sbjct: 222 KRIHECVEKGGKVLIPVFALGRVQELCILIDSYWEQMNLGHIPIYFSAGLAEKANLYYKL 281
Query: 120 FITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWA 179
FI WTNQKI++TFV+RNMFDFKHI+PF +D PG MV+FATPGMLH+G SL +FKKWA
Sbjct: 282 FINWTNQKIKQTFVKRNMFDFKHIKPFQSHLVDAPGAMVLFATPGMLHAGASLEVFKKWA 341
Query: 180 PVENNMLIMPGFCVQGTIGHKVLSG-----------VKKLEFENKQIIDVKMAIEYMSFS 228
P E NM I+PG+CV GT+G+K+L+ + +E + K I+VK I +SFS
Sbjct: 342 PNELNMTIIPGYCVVGTVGNKLLTTGSDQQQQSKPQSQMVEIDKKTTIEVKCKIHNLSFS 401
Query: 229 AHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDM 288
AHADAKGI+QLI+ P+NV+LVHGE KM FL +KI +E ++C+ PANG + + T
Sbjct: 402 AHADAKGILQLIKMSNPRNVILVHGEKEKMGFLSQKIIKEMGVNCYYPANGVTIIIDTMK 461
Query: 289 KISIDVSVNLLKEEAVKYN 307
I ID+S+NLLK + + Y+
Sbjct: 462 SIPIDISLNLLKRQILDYS 480
>gi|313238583|emb|CBY13629.1| unnamed protein product [Oikopleura dioica]
Length = 618
Score = 431 bits (1109), Expect = e-118, Method: Compositional matrix adjust.
Identities = 214/449 (47%), Positives = 298/449 (66%), Gaps = 37/449 (8%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF++ VG++S++YTGD+NMTPDRHLGAAW D+C+P +LI+ESTYATTIRDSKR RERDFL
Sbjct: 167 MFKITVGDESVLYTGDFNMTPDRHLGAAWADRCKPTVLISESTYATTIRDSKRSRERDFL 226
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KK+H CV+ GGKVLIPVFALGRAQELCILLE YW+RM L P+YF GL EKATNYYK+F
Sbjct: 227 KKIHRCVENGGKVLIPVFALGRAQELCILLEQYWDRMKLNVPVYFTAGLAEKATNYYKLF 286
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSF-IDNPGPMVVFATPGMLHSGLSLIIFKKWA 179
+ WTN+KI+ +FV+RN+FDFK+I+ F K ++ GP V FATPGMLH+G+SL IF+ W
Sbjct: 287 VNWTNEKIKSSFVERNLFDFKYIKAFQKEIHMNQSGPQVCFATPGMLHAGMSLEIFQNWC 346
Query: 180 PVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQL 239
E N +IMPG+CV GT+GH++L G + +F + ++ +EYMSFSAHADAKGIMQ+
Sbjct: 347 TDEKNCIIMPGYCVAGTVGHRLLHGERHFKFNGVNVTS-RIKVEYMSFSAHADAKGIMQI 405
Query: 240 IQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGE----SCFVQTDMKISIDVS 295
I+ EPKNV+ VHGEA+KM FL +K++++ L CF PANGE SC Q +KI D++
Sbjct: 406 IKMTEPKNVMFVHGEAAKMEFLAKKVQKDMGLPCFTPANGETVNISCPPQIKVKIESDLA 465
Query: 296 VNLLKEEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNV 355
++K L H LV + + L+ D A K++ + +H + ++V
Sbjct: 466 RKIMKRS-----------LNRAAFHAALVNDGNELELLHPDVAMKKLKVKQHHFQMNTSV 514
Query: 356 QISDSSPINKTLELIYDQLLSYLQDKSQEYK----------IQLTESSLSIDSVLLSVER 405
+ + + + + + L+ D S K I+L +S ++ + E+
Sbjct: 515 NLPEGNSTPEQIIVSLRLLIKTYIDGSLMEKWVPVGNMMNSIELASPDMS-QTISIVAEK 573
Query: 406 I-------DDKNKRVF-VTWP-NQDEEVG 425
+ + K++RV VTWP + D+E+G
Sbjct: 574 LKTFDTFGNHKSERVLKVTWPLSVDDEIG 602
>gi|268530366|ref|XP_002630309.1| Hypothetical protein CBG00745 [Caenorhabditis briggsae]
Length = 637
Score = 423 bits (1087), Expect = e-115, Method: Compositional matrix adjust.
Identities = 194/316 (61%), Positives = 251/316 (79%), Gaps = 6/316 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWI-DKCRPDLLITESTYATTIRDSKRCRERDF 59
MF+++VG+ S++YTGDYNMTPDRHLGAA + RP +LI+ESTYATTIRDSKR RERDF
Sbjct: 193 MFEIRVGDHSVLYTGDYNMTPDRHLGAARVLPGVRPTILISESTYATTIRDSKRARERDF 252
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKM 119
L+KVHE V +GGKV+IPVFALGRAQELCILLE+YWERM L PIYF+ GL E+A YY++
Sbjct: 253 LRKVHETVMKGGKVIIPVFALGRAQELCILLESYWERMALNVPIYFSQGLAERANQYYRL 312
Query: 120 FITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWA 179
FI+WTN+ I+KTFV+RNMF+FKHIRP +K D PGP V+F+TPGMLH G SL +FKKW
Sbjct: 313 FISWTNENIKKTFVERNMFEFKHIRPMEKGCEDQPGPQVLFSTPGMLHGGQSLKVFKKWC 372
Query: 180 PVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQL 239
NM+IMPG+CV GT+G +V++G K++E + K + ++K+ +EYMSFSAHADAKGIMQL
Sbjct: 373 SDPLNMIIMPGYCVAGTVGARVINGEKRIEIDGK-VHEIKLGVEYMSFSAHADAKGIMQL 431
Query: 240 IQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLL 299
I+ CEPK+V+ VHGEA KM FLK K+++E+++ MPANGE+C ++ + K+ I V +
Sbjct: 432 IRQCEPKHVMFVHGEAEKMEFLKGKVEKEYDIPVHMPANGETCVIEVEPKMDIRVPL--- 488
Query: 300 KEEAVKYNSEPPNPLK 315
E+ K + PNP K
Sbjct: 489 -EKINKSIALDPNPSK 503
>gi|341890123|gb|EGT46058.1| hypothetical protein CAEBREN_05882 [Caenorhabditis brenneri]
Length = 618
Score = 421 bits (1082), Expect = e-114, Method: Compositional matrix adjust.
Identities = 194/318 (61%), Positives = 253/318 (79%), Gaps = 6/318 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWI-DKCRPDLLITESTYATTIRDSKRCRERDF 59
MF+++VG+ S++YTGDYNMTPDRHLGAA + RP +LI+ESTYATTIRDSKR RERDF
Sbjct: 167 MFEIRVGDHSVLYTGDYNMTPDRHLGAARVLPGVRPTVLISESTYATTIRDSKRARERDF 226
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKM 119
L+KVHE V +GGKV+IPVFALGRAQELCILLE+YWERM L PIYF+ GL E+A YY++
Sbjct: 227 LRKVHESVMKGGKVIIPVFALGRAQELCILLESYWERMALTVPIYFSQGLAERANQYYRL 286
Query: 120 FITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWA 179
FI+WTN+ I+KTFV+RNMF+FKHIRP +K D PGP V+F+TPGMLH G SL +FKKW
Sbjct: 287 FISWTNENIKKTFVERNMFEFKHIRPMEKGCEDMPGPQVLFSTPGMLHGGQSLKVFKKWC 346
Query: 180 PVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQL 239
NM+IMPG+CV GT+G KV++G K++E + K + D+++ +EYMSFSAHADAKGIMQL
Sbjct: 347 SDPINMIIMPGYCVAGTVGAKVINGEKRIEIDGK-MHDIRLGVEYMSFSAHADAKGIMQL 405
Query: 240 IQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLL 299
I+ CEP++V+ VHGEA KM FLK K+++E+N+ MPANGE+ + K+ I V ++ +
Sbjct: 406 IRQCEPQHVMFVHGEAEKMEFLKGKVEKEYNVPVHMPANGETVSITAHPKLDIQVPMDQI 465
Query: 300 KEEAVKYNSEPPNPLKER 317
E+++ + PNP K+R
Sbjct: 466 -EQSLNLD---PNPAKQR 479
>gi|449686516|ref|XP_002164154.2| PREDICTED: integrator complex subunit 11-like [Hydra
magnipapillata]
Length = 526
Score = 419 bits (1076), Expect = e-114, Method: Compositional matrix adjust.
Identities = 217/413 (52%), Positives = 289/413 (69%), Gaps = 23/413 (5%)
Query: 47 TIRDSKRCRERDFLKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFA 106
T ++ K C ++ +H+CV +VLIPVFALGRAQELCILLET+W+RMN++APIYF+
Sbjct: 122 TSQNIKDCMKKVIPVNLHQCVQL--QVLIPVFALGRAQELCILLETFWDRMNIKAPIYFS 179
Query: 107 VGLTEKATNYYKMFITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGML 166
+GLTEKA +YYK+FITWTNQKIR TFVQRN+FDFKHI+ FD+S+IDNPGPMVVFATPGML
Sbjct: 180 MGLTEKANHYYKLFITWTNQKIRNTFVQRNLFDFKHIKAFDRSYIDNPGPMVVFATPGML 239
Query: 167 HSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMS 226
H+GLSL IFKKWAP ENN LIMPG+CV GT+GHKVLSG KK+E + K +DVK++++Y+S
Sbjct: 240 HAGLSLQIFKKWAPDENNTLIMPGYCVIGTVGHKVLSGQKKIELDKKTTVDVKLSVQYLS 299
Query: 227 FSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQT 286
FSAHADA GIMQLI+ CEPKNV+LVHGE SKM FLK +I +EFNLDC+MPANG+S +
Sbjct: 300 FSAHADASGIMQLIRQCEPKNVMLVHGEGSKMEFLKSQINKEFNLDCYMPANGQSITIPM 359
Query: 287 DMKISIDVSVNLLKEEAVKYNSEPPNPL-----KERQIHGVLVIKDSSISLMDVDEACKE 341
K+ ID+S +LK + S+ E + G+LV+K+ L++ A E
Sbjct: 360 QPKVVIDMSKGMLKRKQANAESQSKRHRLIKVNNEVLVDGLLVMKNMEAKLVESHMAAHE 419
Query: 342 VGISRHIVRFTSNVQISDSSPINKTLELIYDQLLSYLQDKSQEYKIQ-LTESSLSIDSVL 400
+ + H +RFTS+V+I S +TL+ + L L++ +Q L+ SL++DSVL
Sbjct: 420 LNLDDHQLRFTSSVKIESS----ETLDEFFSNLKLDLEECFTTNSVQKLSAHSLAVDSVL 475
Query: 401 L---SVERIDDKN------KRVFVTWPNQDEEVGKIVLHVLKSMSNTQGQPFE 444
L S I+ N K + ++W +D+ +G VL L M +T+ F+
Sbjct: 476 LKYHSSAAIESNNKDSSSVKHLLISWKFEDDVLGSRVLDFL--MPSTETTEFD 526
>gi|341903207|gb|EGT59142.1| hypothetical protein CAEBREN_31222 [Caenorhabditis brenneri]
Length = 571
Score = 418 bits (1075), Expect = e-114, Method: Compositional matrix adjust.
Identities = 193/318 (60%), Positives = 253/318 (79%), Gaps = 6/318 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWI-DKCRPDLLITESTYATTIRDSKRCRERDF 59
MF+++VG+ S++YTGDYNMTPDRHLGAA + RP +LI+ESTYATTIRDSKR RERDF
Sbjct: 170 MFEIRVGDHSVLYTGDYNMTPDRHLGAARVLPGVRPTVLISESTYATTIRDSKRARERDF 229
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKM 119
L+KVHE V +GGKV+IPVFALGRAQELCILLE+YWERM L PIYF+ GL E+A YY++
Sbjct: 230 LRKVHESVMKGGKVIIPVFALGRAQELCILLESYWERMALTVPIYFSQGLAERANQYYRL 289
Query: 120 FITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWA 179
FI+WTN+ I+KTFV+RNMF+FKHIRP +K D PGP V+F+TPGMLH G SL +FKKW
Sbjct: 290 FISWTNENIKKTFVERNMFEFKHIRPMEKGCEDMPGPQVLFSTPGMLHGGQSLKVFKKWC 349
Query: 180 PVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQL 239
NM+IMPG+CV GT+G KV++G K++E + K + D+++ +EYMSFSAHADAKGIMQL
Sbjct: 350 SDPINMIIMPGYCVAGTVGAKVINGEKRIEIDGK-MHDIRLGVEYMSFSAHADAKGIMQL 408
Query: 240 IQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLL 299
I+ CEP++V+ VHGEA KM FLK K+++E+++ MPANGE+ + K+ I V ++ +
Sbjct: 409 IRQCEPQHVMFVHGEAEKMEFLKGKVEKEYSVPVHMPANGETVSITAHPKLDIQVPMDQI 468
Query: 300 KEEAVKYNSEPPNPLKER 317
E+++ + PNP K+R
Sbjct: 469 -EQSLNLD---PNPAKQR 482
>gi|167525469|ref|XP_001747069.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163774364|gb|EDQ87993.1| predicted protein [Monosiga brevicollis MX1]
Length = 730
Score = 417 bits (1072), Expect = e-113, Method: Compositional matrix adjust.
Identities = 185/322 (57%), Positives = 242/322 (75%), Gaps = 7/322 (2%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF V+VGNQS+VYTGDYNM+PDRHLGAAWID+C PD++I+ESTYATTIRDS+R RE D L
Sbjct: 168 MFYVRVGNQSVVYTGDYNMSPDRHLGAAWIDRCEPDVIISESTYATTIRDSRRAREHDLL 227
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
K+ +CV RGGKVL+PVFALGRAQELCILLET+W+R ++ PIYF+ GLT +A YYK+F
Sbjct: 228 TKITQCVQRGGKVLLPVFALGRAQELCILLETHWQRTGMRVPIYFSTGLTARANEYYKLF 287
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
ITWTNQK+++TFV+RN+FDF+H++PFD+S++++ GP V+FATPGMLH+G SL+ F W
Sbjct: 288 ITWTNQKLKETFVERNLFDFQHVQPFDRSYLEHAGPQVLFATPGMLHAGTSLLAFTHWCE 347
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
NM+I+PG+C GT+G K+++G+++L+ E + V+M +EY+SFSAHADAKGIMQL+
Sbjct: 348 DPRNMVILPGYCTAGTVGAKIIAGIRELDIEGRHYT-VRMDVEYLSFSAHADAKGIMQLV 406
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLL- 299
K V+LVHGE KM FL +I+QE L C P NG + T+ + I VS L
Sbjct: 407 SQSGAKQVVLVHGEQQKMAFLAGRIQQELGLPCHFPPNGHCLTLSTNNPVPILVSDAFLP 466
Query: 300 --KEEAVKYN---SEPPNPLKE 316
K+E K S P P E
Sbjct: 467 VAKDEVAKAEPSFSPAPGPKAE 488
>gi|32564696|ref|NP_495706.2| Protein F10B5.8 [Caenorhabditis elegans]
gi|26985793|emb|CAB54223.2| Protein F10B5.8 [Caenorhabditis elegans]
Length = 608
Score = 417 bits (1072), Expect = e-113, Method: Compositional matrix adjust.
Identities = 192/316 (60%), Positives = 249/316 (78%), Gaps = 6/316 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWI-DKCRPDLLITESTYATTIRDSKRCRERDF 59
MF++++G+ S++YTGDYNMTPDRHLGAA + RP +LI+ESTYATTIRDSKR RERDF
Sbjct: 167 MFEIRLGDHSVLYTGDYNMTPDRHLGAARVLPGVRPTVLISESTYATTIRDSKRARERDF 226
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKM 119
L+KVHECV +GGKV+IPVFALGRAQELCILLE+YWERM L PIYF+ GL E+A YY++
Sbjct: 227 LRKVHECVMKGGKVIIPVFALGRAQELCILLESYWERMALNVPIYFSQGLAERANQYYRL 286
Query: 120 FITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWA 179
FI+WTN+ I+KTFV+RNMF+FKHI+P +K D PGP V+F+TPGMLH G SL +FKKW
Sbjct: 287 FISWTNENIKKTFVERNMFEFKHIKPMEKGCEDQPGPQVLFSTPGMLHGGQSLKVFKKWC 346
Query: 180 PVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQL 239
NM+IMPG+CV GT+G +V++G KK+E + K + ++++ +EYMSFSAHADAKGIMQL
Sbjct: 347 SDPLNMIIMPGYCVAGTVGARVINGEKKIEIDQK-MHEIRLGVEYMSFSAHADAKGIMQL 405
Query: 240 IQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLL 299
I+ CEP++V+ VHGEASKM FLK K+++E+ + MPANGE+ + K+ I V +
Sbjct: 406 IRQCEPQHVMFVHGEASKMEFLKGKVEKEYKVPVHMPANGETVVISAQPKLDIRVPL--- 462
Query: 300 KEEAVKYNSEPPNPLK 315
E+ + S PNP K
Sbjct: 463 -EKIDRSLSLDPNPAK 477
>gi|308509314|ref|XP_003116840.1| hypothetical protein CRE_01624 [Caenorhabditis remanei]
gi|308241754|gb|EFO85706.1| hypothetical protein CRE_01624 [Caenorhabditis remanei]
Length = 612
Score = 414 bits (1065), Expect = e-112, Method: Compositional matrix adjust.
Identities = 192/316 (60%), Positives = 247/316 (78%), Gaps = 3/316 (0%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWI-DKCRPDLLITESTYATTIRDSKRCRERDF 59
MF++++G+ S++YTGDYNMTPDRHLGAA + RP +LI+ESTYATTIRDSKR RERDF
Sbjct: 167 MFEIRLGDHSVLYTGDYNMTPDRHLGAARVLPGVRPTVLISESTYATTIRDSKRARERDF 226
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKM 119
L+KVHE V +GGKV+IPVFALGRAQELCILLE+YWERM L PIYF+ GL E+A YY++
Sbjct: 227 LRKVHETVMKGGKVIIPVFALGRAQELCILLESYWERMALSVPIYFSQGLAERANQYYRL 286
Query: 120 FITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWA 179
FI+WTN+ I+KTFV+RNMF+FKHIRP +K D PGP V+F+TPGMLH G SL +FKKW
Sbjct: 287 FISWTNENIKKTFVERNMFEFKHIRPMEKGCEDQPGPQVLFSTPGMLHGGQSLKVFKKWC 346
Query: 180 PVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQL 239
NM+IMPG+CV GT+G +V++G KK+E + K + D+K+ +EYMSFSAHADAKGIMQL
Sbjct: 347 GDPLNMIIMPGYCVAGTVGARVINGEKKIEIDGK-MHDIKLGVEYMSFSAHADAKGIMQL 405
Query: 240 IQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLL 299
I+ CEP++V+ VHGEA KM FLK K+++E+ + MPANGE+ + K+ I V + +
Sbjct: 406 IRQCEPQHVMFVHGEAEKMEFLKGKVEKEYKVPVHMPANGETVVITAQPKLDIKVPLEVC 465
Query: 300 KEEAVKYNSEPPNPLK 315
+ + S PNP K
Sbjct: 466 N-KIDRSLSLDPNPSK 480
>gi|428177137|gb|EKX46018.1| hypothetical protein GUITHDRAFT_70813 [Guillardia theta CCMP2712]
Length = 485
Score = 414 bits (1064), Expect = e-112, Method: Compositional matrix adjust.
Identities = 193/304 (63%), Positives = 238/304 (78%), Gaps = 9/304 (2%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF ++VG QSI+YTGDYNMTPDRHLG+A DK RPDLLITESTYATTIR+SKR RERD L
Sbjct: 164 MFYIRVGQQSILYTGDYNMTPDRHLGSARCDKLRPDLLITESTYATTIRESKRWRERDML 223
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
+V ECV GGKVLIPVFALGRAQELC+LL+ +WER L+ PIYF+ GLTEKA YYKM+
Sbjct: 224 NQVSECVRNGGKVLIPVFALGRAQELCLLLDAFWERTGLKVPIYFSAGLTEKANLYYKMY 283
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
I+WTNQKI+ TFV+RN+FDF+HI+PFD++FID PGPMV+FATPGMLH GLS+ +FKKWAP
Sbjct: 284 ISWTNQKIKDTFVKRNVFDFQHIQPFDRAFIDRPGPMVLFATPGMLHGGLSMEVFKKWAP 343
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEF------ENKQIIDVKMAIEYMSFSAHADAK 234
+ N++IMPG+CV GT+GHKVLS K + E + V+ I+++SFSAHADAK
Sbjct: 344 SDKNLVIMPGYCVAGTLGHKVLSNGGKPQIIDMPRDEGGGKLHVRCKIKHLSFSAHADAK 403
Query: 235 GIMQLIQYC---EPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKIS 291
GIM LI+ +P NV+LVHGE KM LKE+IK EF L C+ PANG + ++ T +
Sbjct: 404 GIMTLIKQAISVDPVNVMLVHGERHKMEALKERIKTEFGLPCYDPANGATTYITTFHGVP 463
Query: 292 IDVS 295
+ +S
Sbjct: 464 VQLS 467
>gi|168034228|ref|XP_001769615.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162679157|gb|EDQ65608.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 563
Score = 412 bits (1060), Expect = e-112, Method: Compositional matrix adjust.
Identities = 194/356 (54%), Positives = 260/356 (73%), Gaps = 4/356 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF VK G+ ++VYTGDYNMTPDRHLGAA ID+ PDLLITESTYATT+RDSKR RER+FL
Sbjct: 156 MFWVKAGDDTVVYTGDYNMTPDRHLGAAQIDRLEPDLLITESTYATTVRDSKRAREREFL 215
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
K VH+CV GGKVLIPVFALGRAQELCILL+ YWER NL PIY + GLT +A YYK+
Sbjct: 216 KAVHKCVAAGGKVLIPVFALGRAQELCILLDEYWERTNLDMPIYISAGLTMQANVYYKLL 275
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
I+WTNQK++ T+V RN FDFKH+ PF++S ID PGP V+FATPGML GLSL +FK WAP
Sbjct: 276 ISWTNQKVKDTYVTRNTFDFKHVIPFERSKIDAPGPCVLFATPGMLSGGLSLEVFKHWAP 335
Query: 181 VENNMLIMPGFCVQGTIGHKVLSG-VKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQL 239
E+NM+I+PGFCV GT+G K++ G K++ + + +DV+ I+++SFSAH DAKGI+ L
Sbjct: 336 SESNMIILPGFCVAGTVGSKLMPGKPAKIDLDKRTTLDVRCQIQHLSFSAHTDAKGILDL 395
Query: 240 IQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLL 299
+++ P+NV+LVHGE KM LK+KI + + C+ PAN E+ + I + VS L
Sbjct: 396 VRHVAPRNVVLVHGEKPKMAILKKKISSDLCIPCYDPANLETVEITPRCPIKVGVSKQFL 455
Query: 300 KEEAVKYNS--EPPNPLKERQIHGVLVIKDS-SISLMDVDEACKEVGISRHIVRFT 352
+ K + + P ++ ++G ++++DS I ++ DEA +G++RH + F+
Sbjct: 456 ESNLKKKDRQFQLPRAREDILVNGFVIMEDSLRIKVIHPDEASSVLGLTRHHMSFS 511
>gi|255570075|ref|XP_002526000.1| cleavage and polyadenylation specificity factor, putative [Ricinus
communis]
gi|223534732|gb|EEF36424.1| cleavage and polyadenylation specificity factor, putative [Ricinus
communis]
Length = 963
Score = 393 bits (1009), Expect = e-106, Method: Compositional matrix adjust.
Identities = 198/413 (47%), Positives = 274/413 (66%), Gaps = 25/413 (6%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF KVG+ ++VYTGDYNMTPDRHLGAA ID+ + DLLITESTYATTIRDSK RER+FL
Sbjct: 162 MFYAKVGDSAMVYTGDYNMTPDRHLGAAQIDRLQLDLLITESTYATTIRDSKYAREREFL 221
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
K VH+CV GGKVLIP FALGRAQELC+LL+ YWERMNL+ PIYF+ GLT +A YYKM
Sbjct: 222 KVVHKCVAGGGKVLIPTFALGRAQELCLLLDDYWERMNLKVPIYFSAGLTIQANMYYKML 281
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
I WT+QKI++T+ RN FDFK++ FD+S +D PGP V+FATPGM+ G SL +FK+WAP
Sbjct: 282 IGWTSQKIKETYTSRNAFDFKNVYTFDRSLLDAPGPCVLFATPGMISGGFSLEVFKRWAP 341
Query: 181 VENNMLIMPGFCVQGTIGHKVLSG-VKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQL 239
E N++ +PG+CV GTIGHK++SG K+ + IDV+ I +SFS H DAKGIM L
Sbjct: 342 CEMNLVTLPGYCVAGTIGHKLMSGKPSKINLDKDTQIDVRCQIHQLSFSPHTDAKGIMDL 401
Query: 240 IQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLL 299
+++ PK+V+LVHGE KM LKE+I+ E + C++PAN E+ + + + + D S +
Sbjct: 402 VKFLSPKHVILVHGEKPKMASLKERIQSELEIQCYVPANCETLCIPSTLFVKADASEAFI 461
Query: 300 K-----------------EEAVKYNSEPPNPLK---ERQIHGVLVI-KDSSISLMDVDEA 338
+ + V +++ + L+ ER G+LV+ K+ ++ DE
Sbjct: 462 RSCLSPNFRFLNKSLKDTSDLVLHSTNATSRLEVSDERVAEGILVVEKNKKARVVHQDEL 521
Query: 339 CKEVGISRHIVRFT--SNVQISDSSPINKTLELIYDQLLSYLQDKSQEYKIQL 389
+G +H V+F VQ+ + + +D+LL+ L++++ + I L
Sbjct: 522 LLMLGAKQHEVQFAYCCPVQVDNMDQTRRDPSSTHDELLT-LENETGKSAIDL 573
>gi|403337788|gb|EJY68117.1| Integrator complex subunit 11 [Oxytricha trifallax]
Length = 771
Score = 393 bits (1009), Expect = e-106, Method: Compositional matrix adjust.
Identities = 182/302 (60%), Positives = 234/302 (77%), Gaps = 2/302 (0%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF V+ +S+VYTGDYNMT DRHLGAAWIDK RPD+ ITE+TYATTIRDSKR RER+FL
Sbjct: 180 MFYVECNGESVVYTGDYNMTADRHLGAAWIDKLRPDVCITETTYATTIRDSKRSREREFL 239
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
K VHE +D GGKVLIPVFALGRAQELC+LLETYW R NLQ PIYF+ GLTEKA YYK+F
Sbjct: 240 KVVHETLDNGGKVLIPVFALGRAQELCVLLETYWNRTNLQYPIYFSGGLTEKANFYYKLF 299
Query: 121 ITWTNQKIRKTFVQ-RNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWA 179
I WTN+KI+KTF + +NMF F+H++ D + I + PMV FA+PGMLH G SL IFK WA
Sbjct: 300 INWTNEKIKKTFTKNQNMFQFQHVKTLDTASIKSDQPMVCFASPGMLHGGYSLQIFKDWA 359
Query: 180 PVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQL 239
E N LI+PG+C+ GT+G+K+LSG KK+ + + IDV++ I MSFSAHAD+KGIM+L
Sbjct: 360 GQEKNTLIIPGYCMPGTVGNKLLSGEKKINIDGRD-IDVRIKIYNMSFSAHADSKGIMEL 418
Query: 240 IQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLL 299
+ + E KNV+LVHGE KM LK +I+++ + C+ PAN + ++TD K+ + +S +++
Sbjct: 419 LSHLESKNVVLVHGEKQKMAVLKLQIEEQLDQKCYYPANYATLQIKTDPKVEMLISSHII 478
Query: 300 KE 301
+
Sbjct: 479 NK 480
>gi|224108267|ref|XP_002314781.1| predicted protein [Populus trichocarpa]
gi|222863821|gb|EEF00952.1| predicted protein [Populus trichocarpa]
Length = 639
Score = 391 bits (1005), Expect = e-106, Method: Compositional matrix adjust.
Identities = 183/347 (52%), Positives = 242/347 (69%), Gaps = 21/347 (6%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF KVG+ ++VYTGDYNMTPDRHLGAA ID+ DLLITESTYATTIRDSK RER+FL
Sbjct: 162 MFYAKVGDSAMVYTGDYNMTPDRHLGAAQIDRLELDLLITESTYATTIRDSKYAREREFL 221
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
K VHECV GGKVLIP FALGRAQELCILL+ YWERMNL+ PIYF+ GLT +A YYK+
Sbjct: 222 KAVHECVAGGGKVLIPTFALGRAQELCILLDDYWERMNLKVPIYFSAGLTIQANLYYKIL 281
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
I+WT+QK+++T+ RN FDFKH+ FD+S I+ PGP V+FATPGM+ G SL +FK+WAP
Sbjct: 282 ISWTSQKVKETYATRNAFDFKHVHNFDRSLINAPGPCVLFATPGMISGGFSLEVFKQWAP 341
Query: 181 VENNMLIMPGFCVQGTIGHKVLSG-VKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQL 239
E N++ +PG+CV GT+GHK++SG K+ + IDV+ I +SFS H D+KGIM L
Sbjct: 342 CEMNLITLPGYCVAGTVGHKLMSGKPTKINLDKDTQIDVRCQIHQLSFSPHTDSKGIMDL 401
Query: 240 IQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLL 299
++ P+NV+LVHGE KMV LKE+I+ E + C++PAN ++ + + + + S +
Sbjct: 402 TKFLSPRNVILVHGEKPKMVSLKERIQTELRIPCYLPANCDAVHIPSTIYVKAHASNTFI 461
Query: 300 K-----------------EEAVKYNSEPPNPLK---ERQIHGVLVIK 326
+ + V N+ P PL+ ER G+L+++
Sbjct: 462 RSCLNPNFRFLKRSKEDNSDQVLRNTNPTAPLQVNDERVAEGILIME 508
>gi|115479027|ref|NP_001063107.1| Os09g0397900 [Oryza sativa Japonica Group]
gi|50252615|dbj|BAD28786.1| putative FEG protein [Oryza sativa Japonica Group]
gi|113631340|dbj|BAF25021.1| Os09g0397900 [Oryza sativa Japonica Group]
gi|218202115|gb|EEC84542.1| hypothetical protein OsI_31281 [Oryza sativa Indica Group]
gi|222641522|gb|EEE69654.1| hypothetical protein OsJ_29268 [Oryza sativa Japonica Group]
Length = 559
Score = 388 bits (997), Expect = e-105, Method: Compositional matrix adjust.
Identities = 184/366 (50%), Positives = 248/366 (67%), Gaps = 15/366 (4%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
M KVG+ +IVYTGDYNMTPDRHLGAA ID+ + DLLITESTYA T+RDSK RER+FL
Sbjct: 165 MIYAKVGDAAIVYTGDYNMTPDRHLGAAQIDRLKLDLLITESTYAKTVRDSKHAREREFL 224
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
K VH+CV GGKVLIP FALGRAQELCILL+ YWERMNL+ PIYF+ GLT +A YYKM
Sbjct: 225 KAVHKCVSGGGKVLIPAFALGRAQELCILLDDYWERMNLKIPIYFSAGLTIQANMYYKML 284
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
I WT+QKI+ ++ N FDFKH+ F++SFI+NPGP V+FATPGM+ G SL +FKKWAP
Sbjct: 285 IGWTSQKIKNSYTVHNPFDFKHVCHFERSFINNPGPCVLFATPGMISGGFSLEVFKKWAP 344
Query: 181 VENNMLIMPGFCVQGTIGHKVLSG-VKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQL 239
E N++ +PG+CV GTIGHK++SG +++ + IDV+ I +SFS H D+KGIM L
Sbjct: 345 SEKNLVTLPGYCVAGTIGHKLMSGKPTRIDIDKDTHIDVRCQIHQLSFSPHTDSKGIMDL 404
Query: 240 IQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLL 299
++ P +V+LVHGE +M FLKE+I+ E + C PAN E+ + T + I+ + +
Sbjct: 405 TEFLSPSHVILVHGEKPQMAFLKERIESELGMQCCYPANNETVSIPTSQNLKINATEKFI 464
Query: 300 ------------KEEAVKYNSEPPNPLKERQI-HGVLVI-KDSSISLMDVDEACKEVGIS 345
+++ + + + P + + GVL++ K + ++ DE +G+
Sbjct: 465 VSFCMDETENDPQKQNLNFGGDMPQGCRTEGVAEGVLLMEKSKTPKILREDELLHSLGME 524
Query: 346 RHIVRF 351
H V F
Sbjct: 525 THFVHF 530
>gi|328766828|gb|EGF76880.1| hypothetical protein BATDEDRAFT_14507, partial [Batrachochytrium
dendrobatidis JAM81]
Length = 475
Score = 387 bits (995), Expect = e-104, Method: Compositional matrix adjust.
Identities = 182/303 (60%), Positives = 228/303 (75%), Gaps = 8/303 (2%)
Query: 1 MFQVKVGN-----QSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCR 55
MF V+V + QS+VYTGDYNMTPDRHLGAA ID C PDL+ITE+TYATTIRDSKR R
Sbjct: 167 MFYVRVTDGYGVTQSVVYTGDYNMTPDRHLGAAQIDGCEPDLIITETTYATTIRDSKRAR 226
Query: 56 ERDFLKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNL---QAPIYFAVGLTEK 112
ERDFLKKVH+CV GGKVL+PVFALGRAQEL IL+E+YW RM+ + P+YF+ GLTE+
Sbjct: 227 ERDFLKKVHDCVSGGGKVLVPVFALGRAQELLILIESYWRRMDDLCDKVPVYFSTGLTER 286
Query: 113 ATNYYKMFITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSL 172
A YYK+FI+WTN+ ++ V+RNMFDF HIR + SF D PG MV+FATPGMLH+G SL
Sbjct: 287 ANEYYKLFISWTNENVKSALVERNMFDFAHIRSWSHSFADEPGAMVLFATPGMLHAGTSL 346
Query: 173 IIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHAD 232
+FKKW NM+IMPG+CV GT+G KVL+G K + + ++V + I +SFSAHAD
Sbjct: 347 EVFKKWCHDPKNMIIMPGYCVAGTVGAKVLAGEKVINVDRFTKLNVNLQISNLSFSAHAD 406
Query: 233 AKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISI 292
AKGI+QLI+ + KNV+LVHGE KM LK +I++E + CF PANG + + T ++I
Sbjct: 407 AKGILQLIRASKAKNVMLVHGEKKKMAQLKIRIERELGIPCFDPANGAAICIPTKLEIPA 466
Query: 293 DVS 295
VS
Sbjct: 467 LVS 469
>gi|119576643|gb|EAW56239.1| cleavage and polyadenylation specific factor 3-like, isoform CRA_g
[Homo sapiens]
gi|119576646|gb|EAW56242.1| cleavage and polyadenylation specific factor 3-like, isoform CRA_g
[Homo sapiens]
Length = 342
Score = 387 bits (995), Expect = e-104, Method: Compositional matrix adjust.
Identities = 184/339 (54%), Positives = 247/339 (72%), Gaps = 7/339 (2%)
Query: 97 MNLQAPIYFAVGLTEKATNYYKMFITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGP 156
MNL+ PIYF+ GLTEKA +YYK+FI WTNQKIRKTFVQRNMF+FKHI+ FD++F DNPGP
Sbjct: 1 MNLKVPIYFSTGLTEKANHYYKLFIPWTNQKIRKTFVQRNMFEFKHIKAFDRAFADNPGP 60
Query: 157 MVVFATPGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQII 216
MVVFATPGMLH+G SL IF+KWA E NM+IMPG+CVQGT+GHK+LSG +KLE E +Q++
Sbjct: 61 MVVFATPGMLHAGQSLQIFRKWAGNEKNMVIMPGYCVQGTVGHKILSGQRKLEMEGRQVL 120
Query: 217 DVKMAIEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMP 276
+VKM +EYMSFSAHADAKGIMQL+ EP++VLLVHGEA KM FLK+KI+QE ++C+MP
Sbjct: 121 EVKMQVEYMSFSAHADAKGIMQLVGQAEPESVLLVHGEAKKMEFLKQKIEQELRVNCYMP 180
Query: 277 ANGESCFVQTDMKISIDVSVNLLKEEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVD 336
ANGE+ + T I + +S+ LLK E + P K R +HG L++KDS+ L+ +
Sbjct: 181 ANGETVTLPTSPSIPVGISLGLLKREMAQ--GLLPEAKKPRLLHGTLIMKDSNFRLVSSE 238
Query: 337 EACKEVGISRHIVRFTSNVQISDSSPINKTLELIYDQLLSYLQDKSQEYKIQLTESSLSI 396
+A KE+G++ H +RFT V + D+ +T +Y L S L+D ++ L + S+++
Sbjct: 239 QALKELGLAEHQLRFTCRVHLHDTRKEQETALRVYSHLKSVLKDHCVQH---LPDGSVTV 295
Query: 397 DSVLL--SVERIDDKNKRVFVTWPNQDEEVGKIVLHVLK 433
+SVLL + D K + V+W QDEE+G + +LK
Sbjct: 296 ESVLLQAAAPSEDPGTKVLLVSWTYQDEELGSFLTSLLK 334
>gi|449518964|ref|XP_004166505.1| PREDICTED: LOW QUALITY PROTEIN: cleavage and polyadenylation
specificity factor subunit 3-II-like, partial [Cucumis
sativus]
Length = 472
Score = 384 bits (986), Expect = e-103, Method: Compositional matrix adjust.
Identities = 181/358 (50%), Positives = 247/358 (68%), Gaps = 6/358 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF KVG+ ++VYTGDYNMTPDRHLGAA ID+ + DLLITESTYATTIRDSK RER+FL
Sbjct: 6 MFYAKVGDAAMVYTGDYNMTPDRHLGAAQIDRMQLDLLITESTYATTIRDSKYAREREFL 65
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
K VH C+ GGKVLIP FALGRAQELC+LL+ YWERMNL+ PIY + GLT +A YYKM
Sbjct: 66 KAVHNCLASGGKVLIPTFALGRAQELCVLLDDYWERMNLKFPIYVSAGLTVQANMYYKML 125
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
I+WT+QK+++T+ RN FDFK+++ FD+S ID PGP V+FATPGM+ SG SL +FK+WAP
Sbjct: 126 ISWTSQKVKETYTTRNAFDFKNVQKFDRSMIDAPGPCVLFATPGMISSGFSLEVFKRWAP 185
Query: 181 VENNMLIMPGFCVQGTIGHKVLSG-VKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQL 239
+ N++ +PG+CV GT+GHK++SG K++ + IDV+ + ++FS H D+KGIM L
Sbjct: 186 SKLNLITLPGYCVAGTVGHKLMSGKPTKIDLDKVTQIDVQCQVHQLAFSPHTDSKGIMDL 245
Query: 240 IQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLL 299
+++ PK+V+LVHGE KM LKE+I E + C PAN E+ + + + + + S +
Sbjct: 246 VKFLSPKHVILVHGEKPKMAVLKERIHSELGIPCHDPANNETVSISSTLSVKAEASSMFI 305
Query: 300 KEEAVKYNSEPPNPL----KERQIHGVLVIKDS-SISLMDVDEACKEVGISRHIVRFT 352
+ + + P +R G+LV+++ + DE +G H VRF
Sbjct: 306 QSCSTPNFNLSCGPALQVSDDRVNEGILVMENGKKXKALHQDELLLLLGQQEHEVRFA 363
>gi|302787435|ref|XP_002975487.1| hypothetical protein SELMODRAFT_52099 [Selaginella moellendorffii]
gi|300156488|gb|EFJ23116.1| hypothetical protein SELMODRAFT_52099 [Selaginella moellendorffii]
Length = 517
Score = 381 bits (979), Expect = e-102, Method: Compositional matrix adjust.
Identities = 173/300 (57%), Positives = 223/300 (74%), Gaps = 1/300 (0%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF VK GN ++VYTGDYNMTPDRHLGAA ID+ +PDLLITESTYATTIR+S+ +E +FL
Sbjct: 154 MFYVKAGNSTVVYTGDYNMTPDRHLGAAQIDRLKPDLLITESTYATTIRESRLAKEAEFL 213
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
VH CV +GGKVLIP+ ALGRAQELCILL+ YWERMNL+ PIYF+ GLT ++ YYK+
Sbjct: 214 NVVHTCVSKGGKVLIPISALGRAQELCILLDEYWERMNLKVPIYFSAGLTMQSNAYYKLL 273
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
I+WTNQ+I+ T+V RN FDFKH+ PFD++ +D PGP ++FATPGML GLSL + K WAP
Sbjct: 274 ISWTNQRIKDTYVTRNAFDFKHVFPFDRTQLDGPGPCILFATPGMLTGGLSLEVLKHWAP 333
Query: 181 VENNMLIMPGFCVQGTIGHKVLSG-VKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQL 239
VE N+LI+PGFC+ GT+ K+ SG ++E + + IDV+ I ++FSAH DAKGIM L
Sbjct: 334 VEQNLLIIPGFCLAGTVAQKLCSGKPTRVEVDKRTTIDVRCQIHLLAFSAHTDAKGIMDL 393
Query: 240 IQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLL 299
++ EP NV+LVHGE KM LK +I E + C PAN + V + +++ S L+
Sbjct: 394 VRQVEPHNVILVHGEKLKMDVLKARINNELGIPCHNPANHDVVEVPSHCLFNVEASKELV 453
>gi|297737628|emb|CBI26829.3| unnamed protein product [Vitis vinifera]
Length = 686
Score = 381 bits (979), Expect = e-102, Method: Compositional matrix adjust.
Identities = 203/465 (43%), Positives = 285/465 (61%), Gaps = 30/465 (6%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF KVG+ ++VYTGDYNMTPDRHLGAA ID+ + DLLITESTYATT+RDSK RER+FL
Sbjct: 217 MFYAKVGDAAMVYTGDYNMTPDRHLGAAQIDRLQLDLLITESTYATTVRDSKYAREREFL 276
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
K VH+CV GGKVLIP FALGRAQELCILL+ YWERMNL+ PIYF+ GLT +A YYKM
Sbjct: 277 KAVHKCVADGGKVLIPTFALGRAQELCILLDNYWERMNLKVPIYFSAGLTIQANMYYKML 336
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
I+WTNQ++++T+ N FDFK++R FD+S I+ PGP V+FATPGM+ G SL +FK WAP
Sbjct: 337 ISWTNQRVKETYATHNAFDFKNVRSFDRSLINAPGPCVLFATPGMISGGFSLEVFKLWAP 396
Query: 181 VENNMLIMPGFCVQGTIGHKVLSG-VKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQL 239
E N++ +PG+C+ GTIGHK+ +G K++ + I V+ I +SFS H DAKGIM L
Sbjct: 397 SEMNLVTLPGYCLAGTIGHKLTTGKPTKIDLDKDIQISVRCQIHQLSFSPHTDAKGIMDL 456
Query: 240 IQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLL 299
+++ PK+V+LVHGE KM LK KI+ + + C+ PAN ++ + + + D S +
Sbjct: 457 VKFLSPKHVILVHGEKPKMASLKGKIESDLGIQCYYPANNDTVCIPSTCWLKADTSKTFI 516
Query: 300 KE------EAVKYNSEPPNPL--------------KERQIHGVLVI-KDSSISLMDVDEA 338
+ + VK SE + L ER G+L++ K ++ +E
Sbjct: 517 RSSLNPNFKFVKTISEDKSNLVSKETEATSVLQVHDERVAEGILIVEKSKKAKVVHQNEL 576
Query: 339 CKEVGISRHIVRFTSNV-QISDSSPINKTLELIYDQLLSYLQDKSQEYKIQLTESSLSID 397
+G +H V+F + +S L L++ +L + L Q++ L S+ I
Sbjct: 577 LLMIGKDKHDVQFAYCCPNVVSTSDECSWLHLLFAKLATKLGGNIQDFGQHLQVDSIHIS 636
Query: 398 SVLLSV--ERIDD---KNKRVF--VTWPNQDEEVGKIVLHVLKSM 435
L + R D K VF TW D + ++ +++++
Sbjct: 637 VCLKDICPYRTTDGPQKEPAVFFCCTWSVADVNLAWEIISIMENL 681
>gi|449460766|ref|XP_004148116.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
3-II-like [Cucumis sativus]
Length = 649
Score = 380 bits (975), Expect = e-102, Method: Compositional matrix adjust.
Identities = 169/296 (57%), Positives = 224/296 (75%), Gaps = 1/296 (0%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF KVG+ ++VYTGDYNMTPDRHLGAA ID+ + DLLITESTYATTIRDSK RER+FL
Sbjct: 162 MFYAKVGDAAMVYTGDYNMTPDRHLGAAQIDRMQLDLLITESTYATTIRDSKYAREREFL 221
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
K VH C+ GGKVLIP FALGRAQELC+LL+ YWERMNL+ PIY + GLT +A YYKM
Sbjct: 222 KAVHNCLASGGKVLIPTFALGRAQELCVLLDDYWERMNLKFPIYVSAGLTVQANMYYKML 281
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
I+WT+QK+++T+ RN FDFK+++ FD+S ID PGP V+FATPGM+ SG SL +FK+WAP
Sbjct: 282 ISWTSQKVKETYTTRNAFDFKNVQKFDRSMIDAPGPCVLFATPGMISSGFSLEVFKRWAP 341
Query: 181 VENNMLIMPGFCVQGTIGHKVLSG-VKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQL 239
+ N++ +PG+CV GT+GHK++SG K++ + IDV+ + ++FS H D+KGIM L
Sbjct: 342 SKLNLITLPGYCVAGTVGHKLMSGKPTKIDLDKVTQIDVQCQVHQLAFSPHTDSKGIMDL 401
Query: 240 IQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVS 295
+++ PK+V+LVHGE KM LKE+I E + C PAN E+ + + + + + S
Sbjct: 402 VKFLSPKHVILVHGEKPKMAVLKERIHSELGIPCHDPANNETVSISSTLSVKAEAS 457
>gi|291000374|ref|XP_002682754.1| predicted protein [Naegleria gruberi]
gi|284096382|gb|EFC50010.1| predicted protein [Naegleria gruberi]
Length = 458
Score = 379 bits (973), Expect = e-102, Method: Compositional matrix adjust.
Identities = 178/301 (59%), Positives = 234/301 (77%), Gaps = 11/301 (3%)
Query: 1 MFQVK--VGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERD 58
MF VK S+VYTGDYNMT DRHLG+A ID+CRPDLLITE+TYATTIRDSK RERD
Sbjct: 158 MFYVKDLATGASVVYTGDYNMTADRHLGSATIDRCRPDLLITETTYATTIRDSKSSRERD 217
Query: 59 FLKKVHE-CVDRGGKVLIPVFALGRAQELCILLETYWERMNL--QAPIYFAVGLTEKATN 115
F K+V++ V++ GKVLIPVFALGR QELCILLETYWER NL PIYF+ G+ EKA
Sbjct: 218 FCKQVYDTVVNKKGKVLIPVFALGRVQELCILLETYWERKNLGKSVPIYFSAGMVEKANY 277
Query: 116 YYKMFITWTNQKIRKTFV--QRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLI 173
YY+++I WTN+KI+ T +RN+F+F +I+ F++ +DNPGPMV+FATPGMLH+G+SL
Sbjct: 278 YYQLYINWTNEKIKTTLFDQKRNLFNFSNIQSFERFLMDNPGPMVLFATPGMLHAGMSLE 337
Query: 174 IFKKWAPVENNMLIMPGFCVQGTIGHKVLSG----VKKLEFENKQIIDVKMAIEYMSFSA 229
+FKKWAP ENN +I+PG+CV+GT+G+KVL K+E +++ +ID++ ++ +SFSA
Sbjct: 338 VFKKWAPGENNKVILPGYCVEGTVGNKVLRNKDLKSSKIEIDSRTVIDMRCEVKAISFSA 397
Query: 230 HADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMK 289
HADAKGI+QLI+ +PKNV+LVHGE KM F K+KI+ EF + CF P NG + ++T +
Sbjct: 398 HADAKGILQLIRTAQPKNVMLVHGERDKMAFFKKKIESEFKIPCFDPPNGGTIKIKTTSQ 457
Query: 290 I 290
+
Sbjct: 458 V 458
>gi|414881435|tpg|DAA58566.1| TPA: putative RNA-metabolising metallo-beta-lactamase [Zea mays]
Length = 558
Score = 379 bits (973), Expect = e-102, Method: Compositional matrix adjust.
Identities = 186/374 (49%), Positives = 249/374 (66%), Gaps = 23/374 (6%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
M KVG+ ++VYTGDYNMTPDRHLGAA ID+ + D+LITESTYA +IRDSK RER+FL
Sbjct: 165 MIYAKVGDAAMVYTGDYNMTPDRHLGAAQIDRLKLDVLITESTYAKSIRDSKPAREREFL 224
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
K VH+CV GGKVLIP FALGRAQELC+LL+ YWERM L+ PIYF+ GLT +A YYKM
Sbjct: 225 KAVHKCVSGGGKVLIPTFALGRAQELCMLLDDYWERMGLKVPIYFSAGLTIQANVYYKML 284
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
I WT+QKI+ + N FDFKH+ F++SFI+NPGP V+FATPGM+ G SL FKKWAP
Sbjct: 285 IGWTSQKIKDSHTVHNPFDFKHVCHFERSFINNPGPCVLFATPGMITGGFSLEAFKKWAP 344
Query: 181 VENNMLIMPGFCVQGTIGHKVLSG-VKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQL 239
E N++ +PG+CV GTIGHK++ G ++++E+ IDV+ I ++FS H D+KGIM L
Sbjct: 345 SEKNLVTLPGYCVSGTIGHKLMCGKPTRVDYEDTH-IDVRCQIHQLAFSPHTDSKGIMDL 403
Query: 240 IQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLL 299
++ PKNV+LVHGE +M FLKE+I+ E + C+ P N ES + T + + + +
Sbjct: 404 TEFLSPKNVILVHGEKPQMAFLKERIESELGMPCYYPGNNESVSIPTTQNLKMSATERFI 463
Query: 300 KEEAVKYNSEPPNPLKERQI-----------------HGVLVI-KDSSISLMDVDEACKE 341
AV+ L +R + GVL++ K S ++ DE +
Sbjct: 464 TGCAVEQGK---RSLHKRNLIYGTGLSEEIGSDEAAGEGVLLMQKHKSPKILCEDELLEV 520
Query: 342 VGISRHIVRFTSNV 355
+G+ RH+V+F V
Sbjct: 521 LGMGRHLVQFEPMV 534
>gi|359472593|ref|XP_002281277.2| PREDICTED: uncharacterized protein LOC100261060 [Vitis vinifera]
Length = 2299
Score = 378 bits (971), Expect = e-102, Method: Compositional matrix adjust.
Identities = 198/441 (44%), Positives = 272/441 (61%), Gaps = 41/441 (9%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF KVG+ ++VYTGDYNMTPDRHLGAA ID+ + DLLITESTYATT+RDSK RER+FL
Sbjct: 1411 MFYAKVGDAAMVYTGDYNMTPDRHLGAAQIDRLQLDLLITESTYATTVRDSKYAREREFL 1470
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
K VH+CV GGKVLIP FALGRAQELCILL+ YWERMNL+ PIYF+ GLT +A YYKM
Sbjct: 1471 KAVHKCVADGGKVLIPTFALGRAQELCILLDNYWERMNLKVPIYFSAGLTIQANMYYKML 1530
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
I+WTNQ++++T+ N FDFK++R FD+S I+ PGP V+FATPGM+ G SL +FK WAP
Sbjct: 1531 ISWTNQRVKETYATHNAFDFKNVRSFDRSLINAPGPCVLFATPGMISGGFSLEVFKLWAP 1590
Query: 181 VENNMLIMPGFCVQGTIGHKVLSG-VKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQL 239
E N++ +PG+C+ GTIGHK+ +G K++ + I V+ I +SFS H DAKGIM L
Sbjct: 1591 SEMNLVTLPGYCLAGTIGHKLTTGKPTKIDLDKDIQISVRCQIHQLSFSPHTDAKGIMDL 1650
Query: 240 IQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLL 299
+++ PK+V+LVHGE KM LK KI+ + + C+ PAN ++ + + + D S +
Sbjct: 1651 VKFLSPKHVILVHGEKPKMASLKGKIESDLGIQCYYPANNDTVCIPSTCWLKADTSKTFI 1710
Query: 300 KE------EAVKYNSEPPNPL--------------KERQIHGVLVI-KDSSISLMDVDEA 338
+ + VK SE + L ER G+L++ K ++ +E
Sbjct: 1711 RSSLNPNFKFVKTISEDKSNLVSKETEATSVLQVHDERVAEGILIVEKSKKAKVVHQNEL 1770
Query: 339 CKEVGISRHIVRFT--SNVQISD-----------------SSPINKTLELIYDQLLSYLQ 379
+G +H V+F VQI + +S L L++ +L + L
Sbjct: 1771 LLMIGKDKHDVQFAYCCPVQIGNLERTKGMDFALSKNVVSTSDECSWLHLLFAKLATKLG 1830
Query: 380 DKSQEYKIQLTESSLSIDSVL 400
Q++ L S+ I S+
Sbjct: 1831 GNIQDFGQHLQVDSIHISSIF 1851
>gi|302793925|ref|XP_002978727.1| hypothetical protein SELMODRAFT_109555 [Selaginella moellendorffii]
gi|300153536|gb|EFJ20174.1| hypothetical protein SELMODRAFT_109555 [Selaginella moellendorffii]
Length = 522
Score = 378 bits (971), Expect = e-101, Method: Compositional matrix adjust.
Identities = 172/300 (57%), Positives = 223/300 (74%), Gaps = 1/300 (0%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF VK GN ++VYTGDYNMTPDRHLGAA ID+ +PDLLITESTYATTIR+S+ +E +FL
Sbjct: 159 MFYVKAGNSTVVYTGDYNMTPDRHLGAAQIDRLKPDLLITESTYATTIRESRLAKEAEFL 218
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
VH CV +GGKVLIP+ ALGRAQELCILL+ YWERMNL+ PIYF+ GLT ++ YYK+
Sbjct: 219 NVVHTCVSKGGKVLIPISALGRAQELCILLDEYWERMNLKVPIYFSAGLTMQSNAYYKLL 278
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
I+WTNQ+I+ T+V RN FDFKH+ PFD++ +D GP ++FATPGML GLSL + K WAP
Sbjct: 279 ISWTNQRIKDTYVTRNAFDFKHVFPFDRTQLDGNGPCILFATPGMLTGGLSLEVLKHWAP 338
Query: 181 VENNMLIMPGFCVQGTIGHKVLSG-VKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQL 239
VE N+LI+PGFC+ GT+ K+ SG ++E + + IDV+ I ++FSAH DAKGIM L
Sbjct: 339 VEQNLLIIPGFCLAGTVAQKLCSGKPTRVEVDKRTTIDVRCQIHLLAFSAHTDAKGIMDL 398
Query: 240 IQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLL 299
++ EP+NV+LVHGE KM LK +I E + C PAN + V + +++ S L+
Sbjct: 399 VRQVEPRNVILVHGEKLKMDVLKARINNELGMPCHNPANHDVVEVPSHCLFNVEASKELV 458
>gi|357158307|ref|XP_003578085.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
3-II-like [Brachypodium distachyon]
Length = 553
Score = 378 bits (970), Expect = e-101, Method: Compositional matrix adjust.
Identities = 184/366 (50%), Positives = 250/366 (68%), Gaps = 15/366 (4%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
M KVG+ ++VYTGDYNMTPDRHLGAA I++ + DLLITESTYA TIRDSK RER+FL
Sbjct: 166 MVYAKVGDAAMVYTGDYNMTPDRHLGAAQIERLKLDLLITESTYAKTIRDSKHAREREFL 225
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
K VH+CV GGKVLIP FALGRAQELCILL+ YWERMNL+ PIYF+ GLT +A YYKM
Sbjct: 226 KAVHKCVSEGGKVLIPTFALGRAQELCILLDDYWERMNLKIPIYFSAGLTIQANMYYKML 285
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
I WT+QKI+ ++ +N FDFKH+ F++SFI++PGP V+FATPGM+ G SL +FK+WA
Sbjct: 286 IGWTSQKIKDSYTVQNPFDFKHVCHFERSFINDPGPCVLFATPGMISGGFSLEVFKRWAT 345
Query: 181 VENNMLIMPGFCVQGTIGHKVLSG-VKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQL 239
+ N++ +PG+CV GTIGHK++SG +++ + +DV+ I +SFS H D+KGIM L
Sbjct: 346 SDKNLVTLPGYCVAGTIGHKLMSGKPTRIDIDKDTHVDVRCQIHQLSFSPHTDSKGIMDL 405
Query: 240 IQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQT--DMKIS------ 291
++ P +V+LVHGE +M FLK++I+ E + C+ PAN E+ + T +MKIS
Sbjct: 406 TEFLSPNHVILVHGEKPQMAFLKDRIESELGMSCYYPANNETVSIPTTQNMKISATEKFI 465
Query: 292 -----IDVSVNLLKEEAVKYNSEPPNPLKERQIHGVLVI-KDSSISLMDVDEACKEVGIS 345
+ +L K + + ER G+L++ K + ++ DE K +G
Sbjct: 466 TNCAALQARDSLQKSDLISSIHLSGVNEDERLAEGILLMEKSKAPKILCEDEFLKLLGTD 525
Query: 346 RHIVRF 351
RH V+F
Sbjct: 526 RHSVQF 531
>gi|242053629|ref|XP_002455960.1| hypothetical protein SORBIDRAFT_03g028040 [Sorghum bicolor]
gi|241927935|gb|EES01080.1| hypothetical protein SORBIDRAFT_03g028040 [Sorghum bicolor]
Length = 558
Score = 377 bits (969), Expect = e-101, Method: Compositional matrix adjust.
Identities = 186/371 (50%), Positives = 245/371 (66%), Gaps = 17/371 (4%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
M KVG+ ++VYTGDYNMTPDRHLGAA ID + DLLITESTYA TIRDSK RER+FL
Sbjct: 165 MIYAKVGDAAMVYTGDYNMTPDRHLGAAQIDHLKLDLLITESTYAKTIRDSKHAREREFL 224
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
K VH+CV GGKVLIP FALGRAQELC+LL+ YWERM+L+ PIYF+ GLT +A YYKM
Sbjct: 225 KAVHKCVSGGGKVLIPTFALGRAQELCMLLDDYWERMDLKVPIYFSAGLTIQANVYYKML 284
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
I WT+QKI+ + N FDFKH+ F++SFI+NPGP V+FATPGM+ G SL FKKWAP
Sbjct: 285 IGWTSQKIKDSHAVHNPFDFKHVCHFERSFINNPGPCVLFATPGMISGGFSLEAFKKWAP 344
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQI-IDVKMAIEYMSFSAHADAKGIMQL 239
E N++ +PG+CV GTIGHK++ G K + K I IDV+ I ++FS H D+KGIM L
Sbjct: 345 SEKNLITLPGYCVSGTIGHKLMCG-KPTRIDYKDIHIDVRCQIHQLAFSPHTDSKGIMDL 403
Query: 240 IQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLL 299
++ PK+V+LVHGE +M FLKE+I+ E + C+ P N ES + T + + + +
Sbjct: 404 TEFLSPKHVILVHGEKPQMAFLKERIESELGMPCYYPGNNESVSIPTTQNLKMSATERFI 463
Query: 300 KEEAVKYNSEPPNPLK--------------ERQIHGVLVI-KDSSISLMDVDEACKEVGI 344
AV+ + E G+L++ K S ++ DE + +G+
Sbjct: 464 TSCAVEQGKRSLHKRNLICGTGLSEVIGSDEEAAEGILLMEKHKSPKILCEDELLEVLGM 523
Query: 345 SRHIVRFTSNV 355
+H+V+F V
Sbjct: 524 EQHLVQFEPMV 534
>gi|356502382|ref|XP_003519998.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
3-II-like [Glycine max]
Length = 516
Score = 377 bits (967), Expect = e-101, Method: Compositional matrix adjust.
Identities = 173/282 (61%), Positives = 213/282 (75%), Gaps = 1/282 (0%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF KVG+ +VYTGDYNMTPDRHLGAA ID+ R DLLITESTYATTIRDS+ RER+FL
Sbjct: 162 MFYAKVGDAEMVYTGDYNMTPDRHLGAAQIDRLRLDLLITESTYATTIRDSRYAREREFL 221
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
K VH+CV GGKVLIP FALGRAQELCILLE YWERMNL+ PIYF+ GLT +A YYKM
Sbjct: 222 KAVHKCVSCGGKVLIPTFALGRAQELCILLEDYWERMNLKVPIYFSAGLTIQANAYYKML 281
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
I WT QKI+ T+ + N FDFK+++ F++S ID PGP V+FATPGML G S+ +FK WA
Sbjct: 282 IRWTRQKIKDTYSKHNAFDFKNVQKFERSMIDAPGPCVLFATPGMLSGGFSVEVFKHWAV 341
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGV-KKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQL 239
ENN++ +PG+CV GTIGHK++S K++ + IDV+ I ++FS H D+KGIM L
Sbjct: 342 SENNLVSLPGYCVPGTIGHKLMSDKHDKVDLDPNTKIDVRCQIHQLAFSPHTDSKGIMDL 401
Query: 240 IQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGES 281
+ + PK+V+LVHGE KM LKEKI E + C+ PAN E+
Sbjct: 402 VNFLSPKHVILVHGEKHKMASLKEKIHSELGIQCYDPANNET 443
>gi|46360445|gb|AAS80153.1| ACT11D09.9 [Cucumis melo]
Length = 708
Score = 370 bits (950), Expect = 3e-99, Method: Compositional matrix adjust.
Identities = 170/297 (57%), Positives = 222/297 (74%), Gaps = 6/297 (2%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF KVG+ ++VYTGDYNMTPDRHLGAA ID+ + DLLITESTYATTIRDSK RER+FL
Sbjct: 194 MFYAKVGDAAMVYTGDYNMTPDRHLGAAQIDRMQLDLLITESTYATTIRDSKYAREREFL 253
Query: 61 KKVHECVDRGGKVLIPVFALGRAQ-ELCILLETYWERMNLQAPIYFAVGLTEKATNYYKM 119
K VH C+ GGKVLIP FALGRAQ ELC+LL+ YWERMNL+ PIY + GLT +A YYKM
Sbjct: 254 KAVHNCLASGGKVLIPTFALGRAQQELCVLLDDYWERMNLKFPIYVSAGLTVQANMYYKM 313
Query: 120 FITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWA 179
I+WT+QK+++T+ RN FDFK+++ FD+S ID PGP V+FATPGM+ SG SL +FK+WA
Sbjct: 314 LISWTSQKVKETYTTRNAFDFKNVQKFDRSMIDAPGPCVLFATPGMISSGFSLEVFKRWA 373
Query: 180 PVENNMLIMPGFCVQGTIGHKVLSG-VKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQ 238
P + N++ +PG+CV GT+GHK++SG K++ + IDV ++FS H D+KGIM
Sbjct: 374 PSKLNLITLPGYCVAGTVGHKLMSGKPTKIDLDKDTQIDVHQ----LAFSPHTDSKGIMD 429
Query: 239 LIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVS 295
L+++ PK+V+LVHGE KM LKE+I E + C PAN E+ + + + I + S
Sbjct: 430 LVKFLSPKHVILVHGEKPKMAVLKERIHSELGIPCHDPANNETVSISSTLSIKAEAS 486
>gi|388507878|gb|AFK42005.1| unknown [Medicago truncatula]
Length = 534
Score = 370 bits (949), Expect = 3e-99, Method: Compositional matrix adjust.
Identities = 172/301 (57%), Positives = 220/301 (73%), Gaps = 1/301 (0%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF VKVG+ +VYTGDYNMTPDRHLGAA ID+ R DLLITESTYATTIRDSK RER+FL
Sbjct: 162 MFYVKVGDAEMVYTGDYNMTPDRHLGAAQIDRLRLDLLITESTYATTIRDSKYAREREFL 221
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
K VH+CV GGKVLIP FALGRAQEL ILL+ YWERMNL+ PIYF+ GLT +A Y+KM
Sbjct: 222 KAVHKCVSGGGKVLIPTFALGRAQELRILLDDYWERMNLKVPIYFSSGLTIQANTYHKML 281
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
I WT+QKI+ T+ N FDFK++ F++S +D PGP V+FATPGML G SL +FK WAP
Sbjct: 282 IGWTSQKIKDTYSTHNAFDFKNVHKFERSMLDAPGPCVLFATPGMLIGGFSLEVFKHWAP 341
Query: 181 VENNMLIMPGFCVQGTIGHKVLSG-VKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQL 239
E N++ +PG+C+ GT+GH++ SG K++ + IDV+ I ++FSAH D+KGIM L
Sbjct: 342 SEKNLVALPGYCMAGTVGHRLTSGKPTKVDTDPDTQIDVRCQIHQLAFSAHTDSKGIMDL 401
Query: 240 IQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLL 299
+++ PK+V+LVHG+ KMV LKE+I E + C PAN E + + ++ + S
Sbjct: 402 VKFLSPKHVMLVHGDKPKMVSLKERIDSELGIPCSHPANNEIVTISSTQYVNAEASDTFT 461
Query: 300 K 300
K
Sbjct: 462 K 462
>gi|401827835|ref|XP_003888210.1| putative RNA-processing beta-lactamase-fold exonuclease
[Encephalitozoon hellem ATCC 50504]
gi|392999410|gb|AFM99229.1| putative RNA-processing beta-lactamase-fold exonuclease
[Encephalitozoon hellem ATCC 50504]
Length = 496
Score = 369 bits (946), Expect = 8e-99, Method: Compositional matrix adjust.
Identities = 165/306 (53%), Positives = 231/306 (75%), Gaps = 5/306 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF V VG+QS+VYTGDY+ TPD+HLG A I RPDLLITESTY + RD +R +ER+FL
Sbjct: 158 MFHVVVGDQSVVYTGDYSTTPDKHLGPASIKCVRPDLLITESTYGSITRDCRRVKEREFL 217
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
K + +C+ RGG+VLIP+FALGRAQELC+LL+ YWER L+ P+YF+ GLTEKA YK F
Sbjct: 218 KAISDCIARGGRVLIPIFALGRAQELCLLLDGYWERTGLKVPVYFSSGLTEKANEIYKKF 277
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
I++TN+ ++K +RN+F++KHI+PF K ++DN GPMV+FA+PGMLHSG+SL +FK+W
Sbjct: 278 ISYTNETVKKKIFERNVFEYKHIKPFQKYYMDNEGPMVLFASPGMLHSGMSLRMFKEWCS 337
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
E N++I+PG+CV+GTIG KVL+G K+LE ++ D+++ I+ +SFSAHADA+GI+ +I
Sbjct: 338 DEKNLVIIPGYCVRGTIGEKVLNGAKRLEILGEE-KDIRIEIKNLSFSAHADAQGILNVI 396
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
+ C P+NV+LVHGE S+M LK I++ F + F+P NG ++ + VS+ + K
Sbjct: 397 EQCSPRNVMLVHGERSRMKQLKRSIEERFGIPTFLPPNG----TLINIPLKSTVSLRIRK 452
Query: 301 EEAVKY 306
E +Y
Sbjct: 453 ESISRY 458
>gi|303391170|ref|XP_003073815.1| putative beta-lactamase fold-containing exonuclease
[Encephalitozoon intestinalis ATCC 50506]
gi|303302963|gb|ADM12455.1| putative beta-lactamase fold-containing exonuclease
[Encephalitozoon intestinalis ATCC 50506]
Length = 496
Score = 364 bits (935), Expect = 1e-97, Method: Compositional matrix adjust.
Identities = 158/279 (56%), Positives = 219/279 (78%), Gaps = 1/279 (0%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF V VG+QS+VYTGDY+ TPD+HLG A I RPDLLITESTY + RD +R +ER+FL
Sbjct: 158 MFHVSVGDQSVVYTGDYSTTPDKHLGPASIKCIRPDLLITESTYGSITRDCRRVKEREFL 217
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
K V +C+ RGG+VLIP+FALGRAQELC+LL+ YWER L+ P+YF+ GLTEKA YK F
Sbjct: 218 KAVSDCIARGGRVLIPIFALGRAQELCLLLDGYWERTGLEIPVYFSSGLTEKANEIYKKF 277
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
I +TN+ +++ +RN+F++KHI+PF + ++DN GPMV+FA+PGMLHSG+SL IFK+W
Sbjct: 278 IGYTNETVKRKIFERNVFEYKHIKPFQRYYMDNKGPMVLFASPGMLHSGMSLRIFKEWCE 337
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
E N++I+PG+CV+GTIG K+L+G K+LE ++ D+K+ I+ + FSAHADA+GI+ +I
Sbjct: 338 DEKNLVIIPGYCVRGTIGEKILNGAKRLEILGEE-KDIKLEIKNLGFSAHADAQGILSVI 396
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANG 279
+ C P+NV+LVHGE S+M LK+ +++ F + F+P NG
Sbjct: 397 EQCAPRNVMLVHGERSRMKQLKKSVEERFGIPTFLPPNG 435
>gi|146170679|ref|XP_001017643.2| metallo beta lactamase domain containing protein [Tetrahymena
thermophila]
gi|146145062|gb|EAR97398.2| metallo beta lactamase domain containing protein [Tetrahymena
thermophila SB210]
Length = 675
Score = 364 bits (934), Expect = 2e-97, Method: Compositional matrix adjust.
Identities = 168/310 (54%), Positives = 222/310 (71%), Gaps = 31/310 (10%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF ++ N +VYTGD++ DRHLGAAWI+K +PDLLI+E TY T IRDSKR RE++FL
Sbjct: 195 MFMIEYRNVKVVYTGDFHSNADRHLGAAWIEKVKPDLLISECTYGTIIRDSKRAREKNFL 254
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
K++ E +D+GGKVLIPVFALGRAQELCILLETYW+R Q P+YFA G+ EKA YYK+F
Sbjct: 255 KQIQETIDQGGKVLIPVFALGRAQELCILLETYWQRTQSQVPVYFAAGMIEKANFYYKLF 314
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
+ WTN+KI+ +++ NMFDFK+I+PF +S I GPMV+FATPGMLH+GLS+ +FK+W
Sbjct: 315 VNWTNEKIKSSYLTDNMFDFKYIKPFSRSLIKTNGPMVLFATPGMLHAGLSMQVFKEWCY 374
Query: 181 VENNMLIMPGFCVQGT------------------------------IGHKVLSGVKKLEF 210
E N LI+PG+CV GT IG+K+LSG K++
Sbjct: 375 DEKNTLIIPGYCVAGTLGCVLISSFFFHIMKINFEFYLFFTLFDKIIGNKLLSGCKQVIL 434
Query: 211 ENKQIIDVKMAIEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFN 270
+ K++ DV M ++ MSFSAHADAKGI+ LI++CEPKNV+LVHG+ ++M+ L E I+ +F
Sbjct: 435 D-KKVYDVNMQVKNMSFSAHADAKGILGLIKHCEPKNVMLVHGDVTRMMPLAETIRDQFE 493
Query: 271 LDCFMPANGE 280
+ F PAN E
Sbjct: 494 IQVFTPANFE 503
>gi|396082329|gb|AFN83939.1| putative beta-lactamase fold-containingexonuclease [Encephalitozoon
romaleae SJ-2008]
Length = 496
Score = 362 bits (930), Expect = 5e-97, Method: Compositional matrix adjust.
Identities = 163/306 (53%), Positives = 230/306 (75%), Gaps = 5/306 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF V VG+QS+VYTGDY+ TPD+HLG A I RPDLLITESTY + RD +R +ER+FL
Sbjct: 158 MFHVVVGDQSVVYTGDYSTTPDKHLGPASIKCVRPDLLITESTYGSITRDCRRVKEREFL 217
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
K V +C+ RGG+VLIP+FALGRAQELC+LL+ YWER L+ P+YF+ GLTEKA YK F
Sbjct: 218 KAVSDCIARGGRVLIPIFALGRAQELCLLLDGYWERTGLKIPVYFSSGLTEKANEIYKKF 277
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
I++TN+ +++ +RN+F++KHI+PF K +++N GPMV+FA+PGMLHSG+SL +FK+W
Sbjct: 278 ISYTNETVKRKIFERNVFEYKHIKPFQKYYMENKGPMVLFASPGMLHSGMSLRMFKEWCE 337
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
E N++I+PG+CV+GTIG KVL+G K+LE ++ D+++ + ++FSAHADA+GI+ +I
Sbjct: 338 DEKNLVIIPGYCVRGTIGEKVLNGAKRLEVLGEE-KDIRLEVRNLAFSAHADAQGILNVI 396
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
+ C P+NV+LVHGE S+M LK I++ F + F+P NG + K +I++ + K
Sbjct: 397 EQCSPRNVMLVHGERSRMKQLKRNIEERFGIPTFLPPNGT--LINIPSKNTINLRIR--K 452
Query: 301 EEAVKY 306
E KY
Sbjct: 453 ESISKY 458
>gi|19074744|ref|NP_586250.1| similarity to HYPOTHETICAL PROTEIN YO47_METJA [Encephalitozoon
cuniculi GB-M1]
gi|19069386|emb|CAD25854.1| similarity to HYPOTHETICAL PROTEIN YO47_METJA [Encephalitozoon
cuniculi GB-M1]
gi|449329879|gb|AGE96147.1| hypothetical protein ECU10_1350 [Encephalitozoon cuniculi]
Length = 496
Score = 361 bits (926), Expect = 1e-96, Method: Compositional matrix adjust.
Identities = 159/281 (56%), Positives = 219/281 (77%), Gaps = 5/281 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF V VG+QS+VYTGDY+ TPD+HLG A I RPDLLITESTY + RD ++ +ER+FL
Sbjct: 158 MFHVVVGDQSVVYTGDYSTTPDKHLGPASIKCIRPDLLITESTYGSITRDCRKVKEREFL 217
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
K V +CV RGG+VLIP+FALGRAQELC+LL+ YWER L+ P+YF+ GLTEKA YK F
Sbjct: 218 KAVSDCVARGGRVLIPIFALGRAQELCLLLDGYWERTGLKTPVYFSSGLTEKANEIYKKF 277
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
I++TN+ +RK +RNMF++KHI+PF + ++++ GPMV+FA+PGMLHSG+SL IFK+W
Sbjct: 278 ISYTNETVRKKIFERNMFEYKHIKPFQRHYMESKGPMVLFASPGMLHSGMSLKIFKEWCE 337
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEF--ENKQIIDVKMAIEYMSFSAHADAKGIMQ 238
E N++I+PG+CV+GTIG KVL+G KLE E+++I ++ ++ ++FSAHADA GI+
Sbjct: 338 DEKNLVIIPGYCVRGTIGEKVLNGATKLEILGESREI---RLEVKNLAFSAHADAPGILN 394
Query: 239 LIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANG 279
+++ C P+NV+LVHGE S+M LK I++ F + F+P NG
Sbjct: 395 VVEQCSPRNVMLVHGEKSRMKVLKRSIEERFGIPTFLPPNG 435
>gi|402471873|gb|EJW05382.1| hypothetical protein EDEG_00046 [Edhazardia aedis USNM 41457]
Length = 507
Score = 360 bits (925), Expect = 2e-96, Method: Compositional matrix adjust.
Identities = 164/309 (53%), Positives = 226/309 (73%), Gaps = 5/309 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF VKVG +S+VYTGDYN TPDRHLG AWID RPDL ITESTY +TIRD ++ +ER+FL
Sbjct: 159 MFYVKVGCESLVYTGDYNTTPDRHLGGAWIDCLRPDLFITESTYGSTIRDCRKAKEREFL 218
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
++ECV GGKVLIP FALGRAQE+C+L+++YWE+MNL P+YF G+ E+A Y+++
Sbjct: 219 SSIYECVKNGGKVLIPTFALGRAQEMCLLIDSYWEKMNLSVPVYFTAGMAERANQIYRLY 278
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
I +TN+ IRK ++RN+F++K+I+ DK ID PGPMV+ A+PGMLHSG SL +F K
Sbjct: 279 INYTNETIRKKILERNLFEYKYIKSLDKGVIDLPGPMVILASPGMLHSGNSLNLFLKICH 338
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
+NNM+++PG+CV+GT+G KVL G K +E K+I DV + ++ ++FSAHADAKGI++LI
Sbjct: 339 DKNNMIVIPGYCVRGTVGDKVLKGEKTIEV-GKKIYDVNIKVKNLAFSAHADAKGIIELI 397
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFV----QTDMKISIDVSV 296
+ C+PKNV+LVHG+ + M LK +I+ + F P NG + +T+ S D+
Sbjct: 398 KQCQPKNVMLVHGDKNCMKNLKARIENNLRITTFAPENGTMVDIPKNTETEFVFSRDLLN 457
Query: 297 NLLKEEAVK 305
+LKE K
Sbjct: 458 GILKENENK 466
>gi|300706889|ref|XP_002995677.1| hypothetical protein NCER_101357 [Nosema ceranae BRL01]
gi|239604869|gb|EEQ82006.1| hypothetical protein NCER_101357 [Nosema ceranae BRL01]
Length = 500
Score = 359 bits (922), Expect = 4e-96, Method: Compositional matrix adjust.
Identities = 161/279 (57%), Positives = 220/279 (78%), Gaps = 1/279 (0%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF V+VG++S+VYTGDYNMTPDRHLGAA I RPDLLITESTY + RD ++ +ER+F
Sbjct: 160 MFHVQVGSRSVVYTGDYNMTPDRHLGAASIPCLRPDLLITESTYGSITRDCRKSKEREFF 219
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
K V +CV GGKVLIP+FALGRAQELC+LL+++WERM L+ PIYF+ GLTEKA N YK F
Sbjct: 220 KAVLDCVSNGGKVLIPIFALGRAQELCLLLDSHWERMQLKVPIYFSSGLTEKANNIYKQF 279
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
+++TN+ I+K N+FDFKH F K F+D PMV+FA+PGMLHSG+SL +FK+W
Sbjct: 280 LSYTNETIKKNAFNHNVFDFKHTTTFQKHFLDLNIPMVLFASPGMLHSGMSLKVFKEWCT 339
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
N++I+PG+CV+GT+G KVL+G K++E ++ ++K+ ++ ++FSAHADA+GI+ LI
Sbjct: 340 DPKNLVIIPGYCVKGTVGDKVLNGNKEIEILG-ELKEIKIQVKNLAFSAHADAQGILNLI 398
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANG 279
+ C+PKNV+LVHGE S++ LK+ I+++FN+ F PANG
Sbjct: 399 KMCQPKNVMLVHGEKSRISLLKKSIEKKFNIPVFKPANG 437
>gi|297814408|ref|XP_002875087.1| hypothetical protein ARALYDRAFT_322516 [Arabidopsis lyrata subsp.
lyrata]
gi|297320925|gb|EFH51346.1| hypothetical protein ARALYDRAFT_322516 [Arabidopsis lyrata subsp.
lyrata]
Length = 819
Score = 359 bits (922), Expect = 4e-96, Method: Compositional matrix adjust.
Identities = 177/333 (53%), Positives = 231/333 (69%), Gaps = 5/333 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
M KVG+ +IVYTGDYNMT DRHLGAA ID+ + DLLI+ESTYATTIR SK RER+FL
Sbjct: 162 MVYAKVGDAAIVYTGDYNMTTDRHLGAAKIDRLQLDLLISESTYATTIRGSKYPREREFL 221
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
+ VH+CV GGK LIP FALGRAQELC+LL+ YWERMN++ PIYF+ GLT +A YYKM
Sbjct: 222 QAVHKCVAGGGKALIPSFALGRAQELCMLLDDYWERMNIKVPIYFSSGLTIQANMYYKML 281
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
I+WT+Q +++ N FDFK+++ FD+S I PGP V+FATPGML +G SL +FK WAP
Sbjct: 282 ISWTSQNVKEKHNTHNPFDFKNVKDFDRSLIHAPGPCVLFATPGMLCAGFSLEVFKHWAP 341
Query: 181 VENNMLIMPGFCVQGTIGHKVLSG-VKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQL 239
N++ +PG+ V GT+GHK++SG ++ N +DV+ I ++FS H DAKGIM L
Sbjct: 342 SPLNLVALPGYSVAGTVGHKLMSGKPTTVDLYNGTKVDVRCKIHQVAFSPHTDAKGIMDL 401
Query: 240 IQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLL 299
++ PKNV+LVHGE M+ LK+KI E ++ CF+PANGE+ V + I + S L
Sbjct: 402 TKFLSPKNVVLVHGEKPSMMILKDKITSELDIPCFVPANGETVSVASTTYIKANASDMFL 461
Query: 300 KEEA---VKY-NSEPPNPLKERQIHGVLVIKDS 328
K + K+ NS +R GVLVI+ S
Sbjct: 462 KSCSSPNFKFSNSTQLRVTDQRTADGVLVIEKS 494
>gi|4220489|gb|AAD12712.1| putative cleavage and polyadenylation specifity factor [Arabidopsis
thaliana]
Length = 837
Score = 355 bits (910), Expect = 1e-94, Method: Compositional matrix adjust.
Identities = 174/333 (52%), Positives = 229/333 (68%), Gaps = 5/333 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
M K+G+ +IVYTGDYNMT DRHLGAA ID+ + DLLI+ESTYATTIR SK RER+FL
Sbjct: 162 MVYAKMGDAAIVYTGDYNMTTDRHLGAAKIDRLQLDLLISESTYATTIRGSKYPREREFL 221
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
+ VH+CV GGK LIP FALGRAQELC+LL+ YWERMN++ PIYF+ GLT +A YYKM
Sbjct: 222 QAVHKCVAGGGKALIPSFALGRAQELCMLLDDYWERMNIKVPIYFSSGLTIQANMYYKML 281
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
I+WT+Q +++ N FDFK+++ FD+S I PGP V+FATPGML +G SL +FK WAP
Sbjct: 282 ISWTSQNVKEKHNTHNPFDFKNVKDFDRSLIHAPGPCVLFATPGMLCAGFSLEVFKHWAP 341
Query: 181 VENNMLIMPGFCVQGTIGHKVLSG-VKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQL 239
N++ +PG+ V GT+GHK+++G ++ N +DV+ + ++FS H DAKGIM L
Sbjct: 342 SPLNLVALPGYSVAGTVGHKLMAGKPTTVDLYNGTKVDVRCKVHQVAFSPHTDAKGIMDL 401
Query: 240 IQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLL 299
++ PKNV+LVHGE M+ LKEKI E ++ CF+PANGE+ + I + S L
Sbjct: 402 TKFLSPKNVVLVHGEKPSMMILKEKITSELDIPCFVPANGETVSFASTTYIKANASDMFL 461
Query: 300 KEEA---VKY-NSEPPNPLKERQIHGVLVIKDS 328
K + K+ NS R GVLVI+ S
Sbjct: 462 KSCSNPNFKFSNSTQLRVTDHRTADGVLVIEKS 494
>gi|30677952|ref|NP_178282.2| cleavage and polyadenylation specificity factor subunit 3-II
[Arabidopsis thaliana]
gi|332278175|sp|Q8GUU3.2|CPS3B_ARATH RecName: Full=Cleavage and polyadenylation specificity factor
subunit 3-II; AltName: Full=Cleavage and polyadenylation
specificity factor 73 kDa subunit II; Short=AtCPSF73-II;
Short=CPSF 73 kDa subunit II; AltName: Full=Protein
EMBRYO SAC DEVELOPMENT ARREST 26
gi|62320470|dbj|BAD94982.1| putative cleavage and polyadenylation specifity factor [Arabidopsis
thaliana]
gi|330250395|gb|AEC05489.1| cleavage and polyadenylation specificity factor subunit 3-II
[Arabidopsis thaliana]
Length = 613
Score = 355 bits (910), Expect = 1e-94, Method: Compositional matrix adjust.
Identities = 174/333 (52%), Positives = 229/333 (68%), Gaps = 5/333 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
M K+G+ +IVYTGDYNMT DRHLGAA ID+ + DLLI+ESTYATTIR SK RER+FL
Sbjct: 162 MVYAKMGDAAIVYTGDYNMTTDRHLGAAKIDRLQLDLLISESTYATTIRGSKYPREREFL 221
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
+ VH+CV GGK LIP FALGRAQELC+LL+ YWERMN++ PIYF+ GLT +A YYKM
Sbjct: 222 QAVHKCVAGGGKALIPSFALGRAQELCMLLDDYWERMNIKVPIYFSSGLTIQANMYYKML 281
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
I+WT+Q +++ N FDFK+++ FD+S I PGP V+FATPGML +G SL +FK WAP
Sbjct: 282 ISWTSQNVKEKHNTHNPFDFKNVKDFDRSLIHAPGPCVLFATPGMLCAGFSLEVFKHWAP 341
Query: 181 VENNMLIMPGFCVQGTIGHKVLSG-VKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQL 239
N++ +PG+ V GT+GHK+++G ++ N +DV+ + ++FS H DAKGIM L
Sbjct: 342 SPLNLVALPGYSVAGTVGHKLMAGKPTTVDLYNGTKVDVRCKVHQVAFSPHTDAKGIMDL 401
Query: 240 IQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLL 299
++ PKNV+LVHGE M+ LKEKI E ++ CF+PANGE+ + I + S L
Sbjct: 402 TKFLSPKNVVLVHGEKPSMMILKEKITSELDIPCFVPANGETVSFASTTYIKANASDMFL 461
Query: 300 KEEA---VKY-NSEPPNPLKERQIHGVLVIKDS 328
K + K+ NS R GVLVI+ S
Sbjct: 462 KSCSNPNFKFSNSTQLRVTDHRTADGVLVIEKS 494
>gi|395518397|ref|XP_003763348.1| PREDICTED: integrator complex subunit 11 [Sarcophilus harrisii]
Length = 393
Score = 354 bits (909), Expect = 1e-94, Method: Compositional matrix adjust.
Identities = 166/227 (73%), Positives = 189/227 (83%), Gaps = 13/227 (5%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MFQ+KVG++S VYTGDYNMTPDRHLGAAWIDKCRP+LLITESTYATTIRDSKRCRERDFL
Sbjct: 163 MFQIKVGSESAVYTGDYNMTPDRHLGAAWIDKCRPNLLITESTYATTIRDSKRCRERDFL 222
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHE V+RGGKVLIPVFALGRAQELCILLET+WERMNL+APIYF+ GLTEKA +YYK+F
Sbjct: 223 KKVHETVERGGKVLIPVFALGRAQELCILLETFWERMNLKAPIYFSTGLTEKANHYYKLF 282
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
ITWTNQKIRKTFVQRNMF+FKHI+ FD++F DNPGPMV P L + + WA
Sbjct: 283 ITWTNQKIRKTFVQRNMFEFKHIKAFDRAFADNPGPMVGEGGP-------WLDLVQAWAG 335
Query: 181 VENNM------LIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMA 221
E +IMPG+CVQGT+GHK+LSG +KLE E +QI K+A
Sbjct: 336 EEEGAATFCPQVIMPGYCVQGTVGHKILSGQRKLEMEGRQIGAGKVA 382
>gi|345311496|ref|XP_001519388.2| PREDICTED: integrator complex subunit 11-like, partial
[Ornithorhynchus anatinus]
Length = 359
Score = 352 bits (903), Expect = 8e-94, Method: Compositional matrix adjust.
Identities = 180/323 (55%), Positives = 217/323 (67%), Gaps = 39/323 (12%)
Query: 6 VGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFLKKVHE 65
VG++S+VYTGDYNMTPDRHLGAAWIDKCRP+LLITESTYATTIRDSKRCRERDFLKKVHE
Sbjct: 76 VGSESVVYTGDYNMTPDRHLGAAWIDKCRPNLLITESTYATTIRDSKRCRERDFLKKVHE 135
Query: 66 CVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMFITWTN 125
V+RGGKVLIPVFALGRAQELCILLET+W VG+T + T
Sbjct: 136 TVERGGKVLIPVFALGRAQELCILLETFWL-------TGVVVGVTAGTPTSPNLPFT--- 185
Query: 126 QKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAPVENNM 185
P F PG + TP +G + P
Sbjct: 186 ------------------PPGGSGF---PGVARMGGTP----AGTERAVPSPLPP--GAP 218
Query: 186 LIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLIQYCEP 245
IMPG+CVQGT+GHK+LSG +KLE E +QI++VKM +EYMSFSAHADAKGIMQL++ EP
Sbjct: 219 AIMPGYCVQGTVGHKILSGQRKLEMEGRQILEVKMQVEYMSFSAHADAKGIMQLVRQAEP 278
Query: 246 KNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLKEEAVK 305
+NVLLVHGE KM FLK+KI+QEF ++C+MPANGE+ + T+ I + +S+ LLK E
Sbjct: 279 QNVLLVHGEGKKMEFLKQKIEQEFRVNCYMPANGETVTLLTNPNIPVGISLGLLKRETTL 338
Query: 306 YNSEPPNPLKERQIHGVLVIKDS 328
PP+P K R +HG LV+KD+
Sbjct: 339 --GVPPDPQKPRLMHGTLVMKDN 359
>gi|27372065|gb|AAN87883.1| FEG protein [Arabidopsis thaliana]
Length = 613
Score = 350 bits (897), Expect = 3e-93, Method: Compositional matrix adjust.
Identities = 173/333 (51%), Positives = 228/333 (68%), Gaps = 5/333 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
M K+G+ +IVYTGDYNMT DRHLGAA ID+ + DLLI+ESTYATTIR SK RER+FL
Sbjct: 162 MVYAKMGDAAIVYTGDYNMTTDRHLGAAKIDRLQLDLLISESTYATTIRGSKYPREREFL 221
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
+ VH+CV GGK LIP FALGRAQELC+LL+ YWERMN++ PIYF+ GLT +A YYKM
Sbjct: 222 QAVHKCVAGGGKALIPSFALGRAQELCMLLDDYWERMNIKVPIYFSSGLTIQANMYYKML 281
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
I+WT+Q +++ N FDFK+++ FD+S I PGP V+FA PGML +GLSL +FK WAP
Sbjct: 282 ISWTSQNVKEKHNTHNPFDFKNVKDFDRSLIHAPGPCVLFAIPGMLCAGLSLEVFKHWAP 341
Query: 181 VENNMLIMPGFCVQGTIGHKVLSG-VKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQL 239
N++ + G+ V GT+GHK+++G ++ N +DV+ + ++FS H DAKGIM L
Sbjct: 342 SPLNLVALLGYSVAGTVGHKLMAGKPTTVDLHNGTKVDVRCKVHQVAFSPHTDAKGIMDL 401
Query: 240 IQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLL 299
++ PKNV+LVHGE M+ LKEKI E ++ CF+PANGE+ + I + S L
Sbjct: 402 TKFLSPKNVVLVHGEKPSMMILKEKITSELDIPCFVPANGETVSFASTTYIKANASDMFL 461
Query: 300 KEEA---VKY-NSEPPNPLKERQIHGVLVIKDS 328
K + K+ NS R GVLVI+ S
Sbjct: 462 KSCSNPNFKFSNSTQLRVTDHRTADGVLVIEKS 494
>gi|340509014|gb|EGR34593.1| hypothetical protein IMG5_006210 [Ichthyophthirius multifiliis]
Length = 456
Score = 350 bits (897), Expect = 4e-93, Method: Compositional matrix adjust.
Identities = 161/269 (59%), Positives = 208/269 (77%), Gaps = 3/269 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF ++ +VYTGD++ DRHLGAAWIDK PDLLI+E TY T +R+SKR RER FL
Sbjct: 187 MFLIEYKGIKVVYTGDFHSNADRHLGAAWIDKINPDLLISECTYGTIVRESKRARERTFL 246
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
++V E +D+GGKVLIPVFALGRAQELC+LLETYW+R QAP+YFA G+ EKA YYK+F
Sbjct: 247 QQVQETIDQGGKVLIPVFALGRAQELCVLLETYWQRTQNQAPVYFAAGMIEKANFYYKLF 306
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
+ WTN+KI+ ++ NMF+FKHI+PF KS I PMV+FATPGMLH+GLS+ +FK+W
Sbjct: 307 VNWTNEKIKSCYLIDNMFNFKHIKPFQKSLIKANMPMVLFATPGMLHAGLSMQVFKEWCY 366
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
N LI+PG+CV GT+G+K+LSG K++ + K+ DV M ++ MSFSAHADAKGI+ LI
Sbjct: 367 DSKNTLIIPGYCVAGTLGNKLLSGCKQVILD-KKTYDVNMKVKNMSFSAHADAKGILGLI 425
Query: 241 QYCEPKNVLLVHGEASKMVFLKEK--IKQ 267
++CE KNVLLVHG+ ++M+ L E IKQ
Sbjct: 426 KHCEAKNVLLVHGDVARMMSLAESEYIKQ 454
>gi|119576637|gb|EAW56233.1| cleavage and polyadenylation specific factor 3-like, isoform CRA_b
[Homo sapiens]
Length = 329
Score = 344 bits (883), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 154/184 (83%), Positives = 172/184 (93%)
Query: 3 QVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFLKK 62
Q+KVG++S+VYTGDYNMTPDRHLGAAWIDKCRP+LLITESTYATTIRDSKRCRERDFLKK
Sbjct: 143 QIKVGSESVVYTGDYNMTPDRHLGAAWIDKCRPNLLITESTYATTIRDSKRCRERDFLKK 202
Query: 63 VHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMFIT 122
VHE V+RGGKVLIPVFALGRAQELCILLET+WERMNL+ PIYF+ GLTEKA +YYK+FI
Sbjct: 203 VHETVERGGKVLIPVFALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLFIP 262
Query: 123 WTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAPVE 182
WTNQKIRKTFVQRNMF+FKHI+ FD++F DNPGPMVVFATPGMLH+G SL IF+KWA E
Sbjct: 263 WTNQKIRKTFVQRNMFEFKHIKAFDRAFADNPGPMVVFATPGMLHAGQSLQIFRKWAGNE 322
Query: 183 NNML 186
NM+
Sbjct: 323 KNMV 326
>gi|429963288|gb|ELA42832.1| hypothetical protein VICG_00147 [Vittaforma corneae ATCC 50505]
Length = 513
Score = 343 bits (881), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 149/301 (49%), Positives = 225/301 (74%), Gaps = 2/301 (0%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF VKVG+QS+VYTGDY+ T D+HLG AWID RPDL+ITESTY + IRD ++ +ER+FL
Sbjct: 160 MFYVKVGDQSVVYTGDYSTTADQHLGTAWIDTLRPDLMITESTYGSVIRDCRKAKEREFL 219
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
+ +H C++RGGK LIP+FALGRAQE+C+++E+YWERM L+ P+YFA G+TEKA YK F
Sbjct: 220 QSIHNCIERGGKTLIPIFALGRAQEICLIVESYWERMGLEIPVYFAGGMTEKANEIYKRF 279
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
I +TN+ +R+ +++N+F+F HI+P+ K + GP V+F++PGMLHSG SL IFK
Sbjct: 280 INYTNESVREKILEKNVFEFSHIKPYRKGS-ELQGPCVIFSSPGMLHSGTSLRIFKNICS 338
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
N++I+PG+CV+GT+G +VL+G +K EN++ + + ++ ++FSAHAD GIM++I
Sbjct: 339 DPRNLVILPGYCVRGTLGDRVLNGSRKERIENEERA-INLEVKNIAFSAHADTCGIMKII 397
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
C PK+++LVHGE ++M L++ I+ EF++ +MP NG + T+ + I + ++L+
Sbjct: 398 DQCRPKSLMLVHGEKARMKVLQKNIRNEFHIPVYMPPNGVVLSISTNDTVKIYLDKHVLR 457
Query: 301 E 301
+
Sbjct: 458 K 458
>gi|387594235|gb|EIJ89259.1| integrator complex subunit 11 [Nematocida parisii ERTm3]
gi|387594982|gb|EIJ92609.1| integrator complex subunit 11 [Nematocida parisii ERTm1]
Length = 502
Score = 328 bits (840), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 149/281 (53%), Positives = 201/281 (71%), Gaps = 1/281 (0%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
M+ VKVG++S+VYTGDYNMTPDRHL AW+ K P +LITESTYA +RD +R +ERDF+
Sbjct: 163 MYHVKVGDESVVYTGDYNMTPDRHLDGAWMPKVYPSVLITESTYALLVRDCRREKERDFI 222
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
+ V +CV GGKVLIPVFALGRA ELC+LL+T+WE+ L PIY + LT KA + YK F
Sbjct: 223 ESVVQCVKNGGKVLIPVFALGRAHELCLLLDTHWEKTKLDIPIYTSATLTHKANDIYKQF 282
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
I +T++ IR T +RN+FDF+H++ FD + GPM++F++PGMLHSG SL IFKKW
Sbjct: 283 IDYTHEHIRSTLHKRNLFDFRHVKQFDSNLASLEGPMILFSSPGMLHSGPSLSIFKKWCG 342
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
NNM+I PG+CV+GTIG +VL+G ++E + V+M ++ M FSAHAD KGI+ L+
Sbjct: 343 DPNNMVIFPGYCVRGTIGERVLNGASQIEV-GGIVYPVRMKVKNMPFSAHADQKGILALV 401
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGES 281
Q CEP NV+LVHG+ +M +K ++ + P G +
Sbjct: 402 QQCEPDNVILVHGDIMRMKRMKNILETTLGVPTLHPPIGAT 442
>gi|209875817|ref|XP_002139351.1| RNA-metabolising metallo-beta-lactamase family protein
[Cryptosporidium muris RN66]
gi|209554957|gb|EEA05002.1| RNA-metabolising metallo-beta-lactamase family protein
[Cryptosporidium muris RN66]
Length = 797
Score = 327 bits (839), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 148/290 (51%), Positives = 207/290 (71%), Gaps = 8/290 (2%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF V+VGN+SIVYTGDYNM DRHLG A I K PD+L++ESTYAT IR S+R ER F
Sbjct: 183 MFHVEVGNESIVYTGDYNMVRDRHLGPASIKKLFPDVLLSESTYATYIRPSRRSTERIFC 242
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
+ V +C+++GGKVLIPVFA+GRAQELCILLE +W RM L+ PIYF +TEK++ YY+++
Sbjct: 243 EMVLQCLEKGGKVLIPVFAVGRAQELCILLEFFWRRMQLRYPIYFGGAMTEKSSLYYQLY 302
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
WTN T + ++F F H+ P+D+S + N GP V+FATPGMLH+GLSL FK WAP
Sbjct: 303 TNWTN-----TALSDDLFSFPHVLPYDRSVLTNTGPAVLFATPGMLHAGLSLQAFKCWAP 357
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENK---QIIDVKMAIEYMSFSAHADAKGIM 237
NN+ I+PGFCV GT+G ++++G K++ + K ID++ ++Y+SFS+HAD+ GI
Sbjct: 358 DPNNLTIIPGFCVAGTLGARIIAGAKRIILDPKDPSSSIDIRCDVKYLSFSSHADSIGIQ 417
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTD 287
LIQ+ EP N++ +HGE M+ + ++++F + F P NG + + D
Sbjct: 418 SLIQHIEPDNIVFLHGERQGMLSMASFVQEQFQIPTFCPHNGSTVSIPID 467
>gi|67624341|ref|XP_668453.1| ENSANGP00000013258 [Cryptosporidium hominis TU502]
gi|54659666|gb|EAL38233.1| ENSANGP00000013258 [Cryptosporidium hominis]
Length = 750
Score = 327 bits (837), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 180/414 (43%), Positives = 250/414 (60%), Gaps = 40/414 (9%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF V+V ++SIVYTGD+NM DRHLG A I K P LLI+ESTYAT IR S+R ER F
Sbjct: 182 MFHVQVSDESIVYTGDFNMVRDRHLGPALIPKLLPSLLISESTYATYIRPSRRSTERTFC 241
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
+ V+ C+ RGGKVLIPVFA+GRAQELCILLE YW RM ++ PI+F +TEKA +YY++F
Sbjct: 242 EMVYSCLKRGGKVLIPVFAIGRAQELCILLEIYWRRMQIRFPIFFGGSMTEKANSYYQLF 301
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
WTN T + N+F F H+ P+DKS + GP V+FATPGMLH+GLSL FK WAP
Sbjct: 302 TNWTN-----TPLADNIFTFPHVLPYDKSILTLSGPAVLFATPGMLHTGLSLQAFKMWAP 356
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQ---IIDVKMAIEYMSFSAHADAKGIM 237
NN+ I+PGFCV GTIG K+++G K++ + K I+V+ ++Y+SFS+HAD+ GI
Sbjct: 357 DSNNLTIIPGFCVSGTIGSKIINGAKRVYIDQKDPTTCIEVRCNVKYLSFSSHADSIGIQ 416
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANG--------ESC--FVQTD 287
LI + EP+++ VHGE M+ L I EF + F P NG E+C
Sbjct: 417 SLISHSEPQSIAFVHGERHGMLNLASFINSEFKIPTFCPHNGSITTLPTKENCKYVCYLS 476
Query: 288 MKISIDVSVNLLKEEAVKYNSEPPNPLKERQIHGVLVIK---DSSISLMD--VDEAC--- 339
+K S+ ++N ++ V N LK R I I+ + SISL++ C
Sbjct: 477 IKYSLINAINSIQNSFV-------NKLKARNIIEKFRIRQLNNPSISLLENSAHSICHLR 529
Query: 340 ----KEVGISRHIVRFTSNVQISDSSPINKTLELIYDQLLSYLQDKSQEYKIQL 389
+ GI + + + +D + LI ++L +YL+ KS+ ++ +L
Sbjct: 530 SLLIEYYGIPKLFPIYIFTSKYTDHK---HSFILIKNELYTYLRGKSKIFQKKL 580
>gi|66357778|ref|XP_626067.1| CPSF metallobeta-lactamase [Cryptosporidium parvum Iowa II]
gi|46227299|gb|EAK88249.1| CPSF metallobeta-lactamase [Cryptosporidium parvum Iowa II]
Length = 751
Score = 325 bits (832), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 152/283 (53%), Positives = 197/283 (69%), Gaps = 8/283 (2%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF V+V ++SIVYTGD+NM DRHLG A I K P LLI+ESTYAT IR S+R ER F
Sbjct: 184 MFHVQVSDESIVYTGDFNMVRDRHLGPALIPKLLPSLLISESTYATYIRPSRRSTERTFC 243
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
+ V+ C+ RGGKVLIPVFA+GRAQELCILLE YW RM ++ PI+F +TEKA +YY++F
Sbjct: 244 EMVYSCLKRGGKVLIPVFAIGRAQELCILLEIYWRRMQIRFPIFFGGSMTEKANSYYQLF 303
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
WTN T + N+F F H+ P+DKS + GP V+FATPGMLH+GLSL FK WAP
Sbjct: 304 TNWTN-----TPLADNIFTFPHVLPYDKSILTLSGPAVLFATPGMLHTGLSLQAFKMWAP 358
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQ---IIDVKMAIEYMSFSAHADAKGIM 237
NN+ I+PGFCV GTIG K+++G K++ + K I+V+ ++Y+SFS+HAD+ GI
Sbjct: 359 DSNNLTIIPGFCVSGTIGSKIINGAKRVYIDQKDPTTCIEVRCNVKYLSFSSHADSIGIQ 418
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGE 280
LI + EP+++ VHGE M+ L I EF + F P NG
Sbjct: 419 SLISHSEPQSIAFVHGERHGMLNLASFINSEFKIPTFCPHNGS 461
>gi|378756880|gb|EHY66904.1| cleavage and polyadenylation specificity factor subunit 3
[Nematocida sp. 1 ERTm2]
Length = 501
Score = 324 bits (831), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 146/281 (51%), Positives = 202/281 (71%), Gaps = 1/281 (0%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF V+VG++S+VYTGDYNMTPDRHL AW+ K P++LITESTYA +RD +R +ER+F+
Sbjct: 163 MFHVRVGDESVVYTGDYNMTPDRHLDGAWMPKVYPNVLITESTYALLVRDCRREKEREFI 222
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
+ V +CV GGKVLIPVFALGRA ELC+LL+T+WE+ L PIY + LT KA + YK F
Sbjct: 223 ESVVQCVKNGGKVLIPVFALGRAHELCLLLDTHWEKSKLSIPIYTSATLTHKANDIYKQF 282
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
I +T++ IR T +RN+FDF+H++ FD + GPM++F++PGMLHSG SL IFKKW
Sbjct: 283 IDYTHEHIRNTMHKRNLFDFQHVKQFDSNLASLEGPMILFSSPGMLHSGPSLSIFKKWCG 342
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
NM+I PG+CV+GTIG +VL+G ++E + V+M ++ M FSAHAD KGI+ L+
Sbjct: 343 DPKNMVIFPGYCVRGTIGERVLNGASQIEV-GGIVYPVRMKVKNMPFSAHADQKGILSLV 401
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGES 281
Q CEP+N++LVHG+ +M +K ++ + P G +
Sbjct: 402 QQCEPENIILVHGDIMRMKRMKNILETTLGIPTLHPPIGAT 442
>gi|145507230|ref|XP_001439570.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124406765|emb|CAK72173.1| unnamed protein product [Paramecium tetraurelia]
Length = 484
Score = 322 bits (825), Expect = 7e-85, Method: Compositional matrix adjust.
Identities = 157/303 (51%), Positives = 217/303 (71%), Gaps = 13/303 (4%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF VK N+ YN DRHLG A+ID+ +PDL+I+ESTYAT IRDSKR RER+FL
Sbjct: 156 MFHVKYKNEI------YNSIADRHLGGAYIDQLQPDLVISESTYATIIRDSKRTRERNFL 209
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
V E +DRGGKVLIPVFALGRAQEL +LLE+YW+R +A +Y+A GL EKA YYK+F
Sbjct: 210 MHVQEVLDRGGKVLIPVFALGRAQELSVLLESYWQRTKCKAGLYYAAGLIEKANQYYKIF 269
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
W ++KI+++F+ N+F+FK+I+PFD++ I +P MV+ ATPGMLH GLS+ +FK+W
Sbjct: 270 TGWESEKIQQSFLDDNIFNFKYIQPFDRNLIKSPLSMVLLATPGMLHGGLSMQVFKEWCG 329
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
NN L++PG+CV GT+G+K+L+GVK + +NK D+KM I MSFSAHAD++GI+QL+
Sbjct: 330 CANNSLVIPGYCVPGTLGNKLLAGVKHVRIDNKD-YDIKMQINNMSFSAHADSRGIIQLV 388
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKI--KQEFNLDCFMPANGESCFVQTDMKISIDVSVNL 298
+ +PK++ VHGE +M L +I K + +DC PAN E ++ + +SID + +
Sbjct: 389 NHLQPKSIAFVHGEYHRMKTLGAEIIDKIQIRVDC--PANFE--LLEYEKILSIDRPLTI 444
Query: 299 LKE 301
E
Sbjct: 445 KGE 447
>gi|221482308|gb|EEE20663.1| RNA-metabolising metallo-beta-lactamase domain-containing protein,
putative [Toxoplasma gondii GT1]
Length = 1090
Score = 320 bits (821), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 144/307 (46%), Positives = 205/307 (66%), Gaps = 7/307 (2%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF++K+G S+VYTGD+N PDRHLG+A + RPD+LI+E TYA+ +R SKR ERDF
Sbjct: 316 MFELKLGKASVVYTGDFNTIPDRHLGSAALPCLRPDVLISECTYASFVRPSKRTVERDFC 375
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
VH+ + +GGKVLIPVFA+GRAQELC+LLE YWERM+L+ PIYFA G+TE+A YY+++
Sbjct: 376 AVVHDTLTKGGKVLIPVFAVGRAQELCMLLENYWERMHLRFPIYFAGGMTERANAYYRLY 435
Query: 121 ITWT----NQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFK 176
+ W+ N R F F HI PF S + P P+V+ ATPGMLH GL+L K
Sbjct: 436 VHWSKADANVDADPEDALRTAFSFPHILPFQSSLLSAPTPLVLLATPGMLHGGLALKALK 495
Query: 177 KWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGI 236
W ++++PG+CV+GT+G +++G +++ + ++VK I YMSFSAHAD GI
Sbjct: 496 AWGGDPATLVLLPGYCVRGTVGAMLIAGQRQIPLDGHATLNVKCRIRYMSFSAHADTTGI 555
Query: 237 MQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMK---ISID 293
QLI +P+NV+LVHGE M L I+++FN+ + PA G++ + D + + +
Sbjct: 556 QQLIVNTQPRNVILVHGEKEGMAKLAAVIRRDFNIPVYTPATGQTISIPVDRRERVVPVY 615
Query: 294 VSVNLLK 300
V +N L+
Sbjct: 616 VHLNFLR 622
>gi|221055463|ref|XP_002258870.1| RNA-metabolising metallo-beta-lactamase [Plasmodium knowlesi strain
H]
gi|193808940|emb|CAQ39643.1| RNA-metabolising metallo-beta-lactamase,putative [Plasmodium
knowlesi strain H]
Length = 914
Score = 318 bits (816), Expect = 9e-84, Method: Compositional matrix adjust.
Identities = 141/292 (48%), Positives = 203/292 (69%), Gaps = 4/292 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
+++++V N S++YTGDYN PD+HLG+ I P++ I+ESTYAT +R +++ E D
Sbjct: 239 IYKIEVNNFSVIYTGDYNTVPDKHLGSTKIPSLNPEIFISESTYATYVRPTRKASELDLC 298
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
VHECV +GGKVLIPVFA+GRAQEL ILL++YW +M + PIYF GLTE A YY+++
Sbjct: 299 NLVHECVHKGGKVLIPVFAIGRAQELSILLDSYWRKMKINYPIYFGCGLTENANKYYRIY 358
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
+W N T ++N+FDF +I PF +++D PMV+FATPGMLH+GLSL FK WA
Sbjct: 359 SSWVNSNCVST-DKKNLFDFANISPFVNNYLDENRPMVLFATPGMLHTGLSLKAFKAWAG 417
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
NN++++PG+CVQGT+GHK++ G +K+ F+ ++V I Y+SFSAHAD+ GI QLI
Sbjct: 418 SSNNLIVLPGYCVQGTVGHKLIMGERKISFDGSSYLNVACRIIYLSFSAHADSNGIQQLI 477
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESC---FVQTDMK 289
++ P+NVL VHGE + M L + I + ++ P+ G+ C F + +MK
Sbjct: 478 RHVLPQNVLFVHGEKNGMEKLSKHISSNYLINSLCPSLGQHCEFNFSKGNMK 529
>gi|221502797|gb|EEE28511.1| RNA-metabolising metallo-beta-lactamase domain-containing protein,
putative [Toxoplasma gondii VEG]
Length = 1072
Score = 317 bits (811), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 140/285 (49%), Positives = 195/285 (68%), Gaps = 4/285 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF++K+G S+VYTGD+N PDRHLG+A + RPD+LI+E TYA+ +R SKR ERDF
Sbjct: 308 MFELKLGKASVVYTGDFNTIPDRHLGSAALPCLRPDVLISECTYASFVRPSKRTVERDFC 367
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
VH+ + +GGKVLIPVFA+GRAQELC+LLE YWERM+L+ PIYFA G+TE+A YY+++
Sbjct: 368 AVVHDTLTKGGKVLIPVFAVGRAQELCMLLENYWERMHLRFPIYFAGGMTERANAYYRLY 427
Query: 121 ITWT----NQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFK 176
+ W+ N R F F HI PF S + P P+V+ ATPGMLH GL+L K
Sbjct: 428 VHWSKADANVDADPEDALRTAFSFPHILPFQSSLLSAPTPLVLLATPGMLHGGLALKALK 487
Query: 177 KWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGI 236
W ++++PG+CV+GT+G +++G +++ + ++VK I YMSFSAHAD GI
Sbjct: 488 AWGGDPATLVLLPGYCVRGTVGAMLIAGQRQIPLDGHATLNVKCRIRYMSFSAHADTTGI 547
Query: 237 MQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGES 281
QLI +P+NV+LVHGE M L I+++FN+ + PA G++
Sbjct: 548 QQLIVNTQPRNVILVHGEKEGMAKLAAVIRRDFNIPVYTPATGQT 592
>gi|14198352|gb|AAH08240.1| Cpsf3l protein [Mus musculus]
Length = 302
Score = 316 bits (810), Expect = 4e-83, Method: Compositional matrix adjust.
Identities = 151/300 (50%), Positives = 213/300 (71%), Gaps = 7/300 (2%)
Query: 137 MFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGT 196
MF+FKHI+ FD++F DNPGPMVVFATPGMLH+G SL IF+KWA E NM+IMPG+CVQGT
Sbjct: 1 MFEFKHIKAFDRTFADNPGPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIMPGYCVQGT 60
Query: 197 IGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEAS 256
+GHK+LSG +KLE E +Q+++VKM +EYMSFSAHADAKGIMQL+ EP++VLLVHGEA
Sbjct: 61 VGHKILSGQRKLEMEGRQMLEVKMQVEYMSFSAHADAKGIMQLVGQAEPESVLLVHGEAK 120
Query: 257 KMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLKEEAVKYNSEPPNPLKE 316
KM FL++KI+QEF + C+MPANGE+ + T I + +S+ LLK E V+ P K
Sbjct: 121 KMEFLRQKIEQEFRVSCYMPANGETVTLPTSPSIPVGISLGLLKREMVQ--GLLPEAKKP 178
Query: 317 RQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDSSPINKTLELIYDQLLS 376
R +HG L++KDS+ L+ ++A KE+G++ H +RFT V + D+ +T +Y L S
Sbjct: 179 RLLHGTLIMKDSNFRLVSSEQALKELGLAEHQLRFTCRVHLQDTRKEQETALRVYSHLKS 238
Query: 377 YLQDKSQEYKIQLTESSLSIDSVLL--SVERIDDKNKRVFVTWPNQDEEVGKIVLHVLKS 434
L+D ++ L + S++++S+L+ + D K + V+W QDEE+G + +LK+
Sbjct: 239 TLKDHCVQH---LPDGSVTVESILIQAAAHSEDPGTKVLLVSWTYQDEELGSFLTTLLKN 295
>gi|237842097|ref|XP_002370346.1| RNA-metabolising metallo-beta-lactamase domain-containing protein
[Toxoplasma gondii ME49]
gi|211968010|gb|EEB03206.1| RNA-metabolising metallo-beta-lactamase domain-containing protein
[Toxoplasma gondii ME49]
Length = 1089
Score = 316 bits (810), Expect = 4e-83, Method: Compositional matrix adjust.
Identities = 140/285 (49%), Positives = 195/285 (68%), Gaps = 4/285 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF++K+G S+VYTGD+N PDRHLG+A + RPD+LI+E TYA+ +R SKR ERDF
Sbjct: 308 MFELKLGKASVVYTGDFNTIPDRHLGSAALPCLRPDVLISECTYASFVRPSKRTVERDFC 367
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
VH+ + +GGKVLIPVFA+GRAQELC+LLE YWERM+L+ PIYFA G+TE+A YY+++
Sbjct: 368 AVVHDTLTKGGKVLIPVFAVGRAQELCMLLENYWERMHLRFPIYFAGGMTERANAYYRLY 427
Query: 121 ITWT----NQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFK 176
+ W+ N R F F HI PF S + P P+V+ ATPGMLH GL+L K
Sbjct: 428 VHWSKADANVDADPEDALRTAFSFPHILPFQSSLLSAPTPLVLLATPGMLHGGLALKALK 487
Query: 177 KWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGI 236
W ++++PG+CV+GT+G +++G +++ + ++VK I YMSFSAHAD GI
Sbjct: 488 AWGGDPATLVLLPGYCVRGTVGAMLIAGQRQIPLDGHATLNVKCRIRYMSFSAHADTTGI 547
Query: 237 MQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGES 281
QLI +P+NV+LVHGE M L I+++FN+ + PA G++
Sbjct: 548 QQLIVNTQPRNVILVHGEKEGMAKLAAVIRRDFNIPVYTPATGQT 592
>gi|389583415|dbj|GAB66150.1| RNA-metabolising metallo-beta-lactamase domain containing protein
[Plasmodium cynomolgi strain B]
Length = 713
Score = 315 bits (807), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 139/292 (47%), Positives = 202/292 (69%), Gaps = 4/292 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
+F+++V N S++YTGDYN PD+HLG+ I P++ I+ESTYAT +R +++ E D
Sbjct: 239 IFKIEVNNFSVIYTGDYNTVPDKHLGSTKIPSLTPEIFISESTYATYVRPTRKASELDLC 298
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
VHECV +GGKVLIPVFA+GRAQEL ILL++YW +M + PIYF GLTE A YY+++
Sbjct: 299 NLVHECVHKGGKVLIPVFAIGRAQELSILLDSYWRKMKINYPIYFGCGLTENANKYYRIY 358
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
+W N T ++N+FDF +I PF +++ PMV+FATPGMLH+GLSL FK WA
Sbjct: 359 SSWVNSSCVST-DKKNLFDFANISPFVNNYLGENRPMVLFATPGMLHTGLSLKAFKAWAG 417
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
N++++PG+CVQGT+GHK++ G +K+ F+ +++ I Y+SFSAHAD+ GI QLI
Sbjct: 418 SSKNLIVLPGYCVQGTVGHKLIMGERKISFDGNSYMNIACKIIYLSFSAHADSNGIQQLI 477
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESC---FVQTDMK 289
++ P+NVL VHGE + M L + I + ++ P+ G+ C F +++MK
Sbjct: 478 RHVLPQNVLFVHGEKNGMEKLSKHISSHYLINSLCPSLGQHCEFNFSKSNMK 529
>gi|327408312|emb|CCA30123.1| unnamed protein product [Neospora caninum Liverpool]
Length = 1183
Score = 314 bits (805), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 144/308 (46%), Positives = 209/308 (67%), Gaps = 8/308 (2%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF++K+GN S+VYTGD+N PDRHLG+A + RPD+LI+E TYA+ +R SKR ERDF
Sbjct: 437 MFELKIGNTSVVYTGDFNTIPDRHLGSASLPCLRPDVLISECTYASFVRPSKRTVERDFC 496
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
VH+ + +GGKVLIPVFA+GRAQELC+LLE YWERM+L PIYFA G+TE+A YY+++
Sbjct: 497 AVVHDTLTKGGKVLIPVFAVGRAQELCMLLENYWERMHLHFPIYFAGGMTERANVYYRLY 556
Query: 121 ITWT--NQKIRKTF---VQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIF 175
+ W+ N + + + F F HI PF S + P P+V+ ATPGMLH GL+L
Sbjct: 557 VHWSKANGSVDAGAGDELPTSAFSFPHILPFQSSLLSAPTPLVLLATPGMLHGGLALKAL 616
Query: 176 KKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKG 235
K WA + N++++PG+CV+GT+G +++G +++ + ++VK I YMSFSAHAD+ G
Sbjct: 617 KAWAGDQANLVLLPGYCVRGTVGAMLIAGQRQIPLDGHATLNVKCRIRYMSFSAHADSLG 676
Query: 236 IMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMK---ISI 292
I QLI +P++V+LVHGE M L I+++FN + PA G++ + + + +
Sbjct: 677 IQQLILNTQPRSVVLVHGEKDGMEKLANVIRRDFNTPVYTPATGQTISIPVLRRQRVVPV 736
Query: 293 DVSVNLLK 300
V +N L+
Sbjct: 737 YVHLNFLR 744
>gi|156096985|ref|XP_001614526.1| RNA-metabolising metallo-beta-lactamase domain containing protein
[Plasmodium vivax Sal-1]
gi|148803400|gb|EDL44799.1| RNA-metabolising metallo-beta-lactamase domain containing protein
[Plasmodium vivax]
Length = 911
Score = 313 bits (801), Expect = 4e-82, Method: Compositional matrix adjust.
Identities = 140/292 (47%), Positives = 201/292 (68%), Gaps = 4/292 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
+F+++V N S++YTGDYN PD+HLG+ I P++ I+ESTYAT +R +++ E D
Sbjct: 239 IFKIEVNNFSVIYTGDYNTVPDKHLGSTKIPSLTPEIFISESTYATYVRPTRKASELDLC 298
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
VHECV +GGKVLIPVFA+GRAQEL ILL++YW++M + PIYF GLTE A YY+++
Sbjct: 299 NLVHECVHKGGKVLIPVFAIGRAQELSILLDSYWKKMKINYPIYFGCGLTENANKYYRIY 358
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
+W N T ++N+FDF +I PF S++ PMV+FATPGMLH+GLSL FK W+
Sbjct: 359 SSWVNSSCVST-DKKNLFDFANISPFVNSYLGENRPMVLFATPGMLHTGLSLKAFKAWSG 417
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
N++++PG+CVQGT+GHK++ G +K+ F+ ++V I Y+SFSAHAD+ GI QLI
Sbjct: 418 CSKNLIVLPGYCVQGTVGHKLIMGERKISFDGNTYLNVACRIIYLSFSAHADSNGIQQLI 477
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESC---FVQTDMK 289
++ P+NVL VHGE M L + I + ++ P+ G+ C F + +MK
Sbjct: 478 RHVLPQNVLFVHGEKHGMEKLSKHIASHYLINSLCPSLGQHCEFSFSKGNMK 529
>gi|7020719|dbj|BAA91246.1| unnamed protein product [Homo sapiens]
Length = 292
Score = 312 bits (800), Expect = 5e-82, Method: Compositional matrix adjust.
Identities = 150/291 (51%), Positives = 207/291 (71%), Gaps = 7/291 (2%)
Query: 137 MFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGT 196
MF+FKHI+ FD++F DNPGPMVVFATPGMLH+G SL IF+KWA E NM+IMPG+CVQGT
Sbjct: 1 MFEFKHIKAFDRAFADNPGPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIMPGYCVQGT 60
Query: 197 IGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEAS 256
+GHK+LSG +KLE E +Q+++VKM +EYMSFSAHADAKGIMQL+ EP++VLLVHGEA
Sbjct: 61 VGHKILSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLVGQAEPESVLLVHGEAK 120
Query: 257 KMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLKEEAVKYNSEPPNPLKE 316
KM FLK+KI+QE ++C+MPANGE+ + T I + +S+ LLK E + P K
Sbjct: 121 KMEFLKQKIEQELRVNCYMPANGETVTLPTSPSIPVGISLGLLKREMAQ--GLLPEAKKP 178
Query: 317 RQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDSSPINKTLELIYDQLLS 376
R +HG L++KDS+ L+ ++A KE+G++ H +RFT V + D+ +T +Y L S
Sbjct: 179 RLLHGTLIMKDSNFRLVSSEQALKELGLAEHQLRFTCRVHLHDTRKEQETALRVYSHLKS 238
Query: 377 YLQDKSQEYKIQLTESSLSIDSVLL--SVERIDDKNKRVFVTWPNQDEEVG 425
L+D ++ L + S++++SVLL + D K + V+W QDEE+G
Sbjct: 239 VLKDHCVQH---LPDGSVTVESVLLQAAAPSEDPGTKVLLVSWTYQDEELG 286
>gi|68077031|ref|XP_680435.1| cleavage and polyadenylation specificity factor protein [Plasmodium
berghei strain ANKA]
gi|56501360|emb|CAH96636.1| cleavage and polyadenylation specificity factor protein, putative
[Plasmodium berghei]
Length = 967
Score = 308 bits (788), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 138/282 (48%), Positives = 197/282 (69%), Gaps = 1/282 (0%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
M++++V N S++YTGDYN PD+HLG+ I P++ I+ESTYA+ +R +++ E +
Sbjct: 234 MYRLEVNNISVIYTGDYNTIPDKHLGSTKIPVLTPEIFISESTYASYVRPTRKSSELELC 293
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
V+ECV +GGKVLIPVFA+GRAQEL ILLE YWE+M + PIYF GLTE A YYK++
Sbjct: 294 NLVNECVHKGGKVLIPVFAIGRAQELSILLEEYWEKMKINCPIYFGCGLTENANKYYKIY 353
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
+W + T V +N+FDF +I F ++++ PMV+FATPGMLH+GL+L FK WA
Sbjct: 354 SSWISNNCVSTEV-KNLFDFSNISQFSNNYLNENRPMVLFATPGMLHTGLALKAFKAWAS 412
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
NN++I+PG+CVQGTIGHK++ G KK+ + I V I Y+SFSAHAD+ GI QLI
Sbjct: 413 NPNNLIILPGYCVQGTIGHKLIMGEKKILLDGSTYIYVNCKIIYLSFSAHADSNGIQQLI 472
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESC 282
++ PKNV+ VHG+ + M L + I +++++ P G++C
Sbjct: 473 KHVMPKNVIFVHGDKNGMEKLSKHISNQYHINSICPYMGQNC 514
>gi|223647718|gb|ACN10617.1| Integrator complex subunit 11 [Salmo salar]
Length = 343
Score = 307 bits (787), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 135/158 (85%), Positives = 152/158 (96%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
M Q+KVG++S+VYTGDYNMTPDRHLGAAWIDKCRPD+LI+ESTYATTIRDSKRCRERDFL
Sbjct: 163 MVQIKVGSESVVYTGDYNMTPDRHLGAAWIDKCRPDILISESTYATTIRDSKRCRERDFL 222
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHE V+RGGKVLIPVFALGRAQELCILLET+WERMN++APIYF+ GLTEKA +YYK+F
Sbjct: 223 KKVHETVERGGKVLIPVFALGRAQELCILLETFWERMNMKAPIYFSTGLTEKANHYYKLF 282
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMV 158
ITWTNQKIRKTFVQRNMF+FKHI+ FD+S+ DNPGPMV
Sbjct: 283 ITWTNQKIRKTFVQRNMFEFKHIKAFDRSYADNPGPMV 320
>gi|82704800|ref|XP_726704.1| hypothetical protein [Plasmodium yoelii yoelii 17XNL]
gi|23482224|gb|EAA18269.1| hypothetical protein [Plasmodium yoelii yoelii]
Length = 954
Score = 307 bits (786), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 137/282 (48%), Positives = 197/282 (69%), Gaps = 1/282 (0%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
M++++V N S++YTGDYN PD+HLG+ I P++ I+ESTYA+ +R +++ E +
Sbjct: 234 MYRLEVNNISVIYTGDYNTIPDKHLGSTKIPVLTPEIFISESTYASYVRPTRKSSELELC 293
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
V+ECV +GGKVLIPVFA+GRAQEL ILLE YWE+M + PIYF GLTE A YYK++
Sbjct: 294 NLVNECVHKGGKVLIPVFAIGRAQELSILLEEYWEKMKINCPIYFGCGLTENANKYYKIY 353
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
+W + T V +N+FDF +I F ++++ PMV+FATPGMLH+GL+L FK WA
Sbjct: 354 SSWISNNCVSTEV-KNLFDFSNISQFSNNYLNENRPMVLFATPGMLHTGLALKAFKAWAS 412
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
NN++I+PG+CVQGTIGHK++ G KK+ + + V I Y+SFSAHAD+ GI QLI
Sbjct: 413 NPNNLIILPGYCVQGTIGHKLIMGEKKILLDGNTYVYVNCKIIYLSFSAHADSNGIQQLI 472
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESC 282
++ PKNV+ VHG+ + M L + I +++++ P G++C
Sbjct: 473 KHVMPKNVIFVHGDKNGMEKLSKHISNQYHINSICPYMGQNC 514
>gi|124505029|ref|XP_001351256.1| cleavage and polyadenylation specificity factor protein, putative
[Plasmodium falciparum 3D7]
gi|3758842|emb|CAB11127.1| cleavage and polyadenylation specificity factor protein, putative
[Plasmodium falciparum 3D7]
Length = 1017
Score = 306 bits (783), Expect = 6e-80, Method: Compositional matrix adjust.
Identities = 137/282 (48%), Positives = 194/282 (68%), Gaps = 1/282 (0%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
+++++V N S++YTGDYN PD+HLG+A I P++ I+ESTYAT +R +K+ E +
Sbjct: 299 IYKIEVRNFSVIYTGDYNTIPDKHLGSANIPSLNPEIFISESTYATYVRPTKKASELELC 358
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
VHECV +GGKVLIPVFA+GRAQEL ILL+ YW++M + PIYF GLTE A YYK++
Sbjct: 359 NLVHECVHKGGKVLIPVFAIGRAQELSILLDDYWKKMKIHYPIYFGCGLTENANKYYKIY 418
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
+W N + N+FDF +I PF ++++ PMV+FATPGMLH+GLSL FK WA
Sbjct: 419 SSWINSSCMSN-EKENLFDFANISPFLNNYLNEKRPMVLFATPGMLHTGLSLKAFKAWAG 477
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
N++++PG+CVQGT+GHK++ G K++ + I V I Y+SFSAHAD+ GI QLI
Sbjct: 478 NPQNLIVLPGYCVQGTVGHKLIMGEKQISLDGTTYIKVLCKIIYLSFSAHADSNGIQQLI 537
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESC 282
++ PKNV+ VHGE + M L + I + ++ P+ G+ C
Sbjct: 538 KHVSPKNVIFVHGEKNGMQKLAKYISNKHMINSMCPSLGQQC 579
>gi|9280039|dbj|BAB01576.1| unnamed protein product [Macaca fascicularis]
Length = 328
Score = 305 bits (780), Expect = 1e-79, Method: Composition-based stats.
Identities = 136/167 (81%), Positives = 150/167 (89%)
Query: 20 TPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFLKKVHECVDRGGKVLIPVFA 79
TP AAWIDKCRP+LLITESTYATTIRDSKRCRERDFLKKVHE V+RGGKVLIPVFA
Sbjct: 118 TPPPCTRAAWIDKCRPNLLITESTYATTIRDSKRCRERDFLKKVHETVERGGKVLIPVFA 177
Query: 80 LGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMFITWTNQKIRKTFVQRNMFD 139
LGRAQELCILLET+WERMNL+ PIYF+ GLTEKA +YYK+FI WTNQKIRKTFVQRNMF+
Sbjct: 178 LGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLFIPWTNQKIRKTFVQRNMFE 237
Query: 140 FKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAPVENNML 186
FKHI+ FD++F DNPGPMVVFATPGMLH+G SL IF+KWA E NM+
Sbjct: 238 FKHIKAFDRAFADNPGPMVVFATPGMLHAGQSLQIFRKWAGNEKNMV 284
>gi|414881433|tpg|DAA58564.1| TPA: putative RNA-metabolising metallo-beta-lactamase [Zea mays]
Length = 400
Score = 301 bits (772), Expect = 9e-79, Method: Compositional matrix adjust.
Identities = 138/229 (60%), Positives = 176/229 (76%), Gaps = 2/229 (0%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
M KVG+ ++VYTGDYNMTPDRHLGAA ID+ + D+LITESTYA +IRDSK RER+FL
Sbjct: 165 MIYAKVGDAAMVYTGDYNMTPDRHLGAAQIDRLKLDVLITESTYAKSIRDSKPAREREFL 224
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
K VH+CV GGKVLIP FALGRAQELC+LL+ YWERM L+ PIYF+ GLT +A YYKM
Sbjct: 225 KAVHKCVSGGGKVLIPTFALGRAQELCMLLDDYWERMGLKVPIYFSAGLTIQANVYYKML 284
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
I WT+QKI+ + N FDFKH+ F++SFI+NPGP V+FATPGM+ G SL FKKWAP
Sbjct: 285 IGWTSQKIKDSHTVHNPFDFKHVCHFERSFINNPGPCVLFATPGMITGGFSLEAFKKWAP 344
Query: 181 VENNMLIMPGFCVQGTIGHKVLSG-VKKLEFENKQIIDVKMAIEYMSFS 228
E N++ +PG+CV GTIGHK++ G ++++E+ IDV+ +++ S
Sbjct: 345 SEKNLVTLPGYCVSGTIGHKLMCGKPTRVDYEDTH-IDVRCQVDFHVIS 392
>gi|269860949|ref|XP_002650191.1| cleavage and polyadenylation specificity factor subunit
[Enterocytozoon bieneusi H348]
gi|220066365|gb|EED43849.1| cleavage and polyadenylation specificity factor subunit
[Enterocytozoon bieneusi H348]
Length = 501
Score = 300 bits (769), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 137/301 (45%), Positives = 207/301 (68%), Gaps = 1/301 (0%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF V+ ++++VYTGD++ PDR+L AA ID PDLLITESTY +RD ++ +ER+ +
Sbjct: 161 MFFVRYLDETVVYTGDFSTIPDRYLRAATIDCLYPDLLITESTYGNIVRDLRKSKEREMI 220
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
VH+ +D GGKVLIP+FALGRAQE+C+LL+ Y ER+ L PIYF GL +K + Y F
Sbjct: 221 MAVHKTIDIGGKVLIPIFALGRAQEICLLLKNYCERIQLSVPIYFTTGLIDKINDIYLKF 280
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
++TN+ + + R++ + K ++PF+K ++++PGPM++FATP ML +G SL IFK
Sbjct: 281 ASYTNESLEQPLKIRSILNSKFVKPFEKEYLNSPGPMIIFATPAMLINGPSLNIFKSICH 340
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
N +I+PG+C +GTIG K+++G+K++E K I D+ M + +S S HAD GI+++I
Sbjct: 341 DSKNTIILPGYCSKGTIGEKIINGLKRIEI-GKNIYDINMKVYNISLSGHADMTGILKII 399
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
+ C+P NV+LVHG+ KM LKEKIK EF++ + P N + T+ KI I + N ++
Sbjct: 400 EQCKPSNVMLVHGDKGKMNILKEKIKVEFDIPVYYPPNYTLIEIPTNKKIPISIEKNFIQ 459
Query: 301 E 301
+
Sbjct: 460 K 460
>gi|414881434|tpg|DAA58565.1| TPA: putative RNA-metabolising metallo-beta-lactamase [Zea mays]
Length = 400
Score = 299 bits (766), Expect = 6e-78, Method: Compositional matrix adjust.
Identities = 137/220 (62%), Positives = 172/220 (78%), Gaps = 2/220 (0%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
M KVG+ ++VYTGDYNMTPDRHLGAA ID+ + D+LITESTYA +IRDSK RER+FL
Sbjct: 165 MIYAKVGDAAMVYTGDYNMTPDRHLGAAQIDRLKLDVLITESTYAKSIRDSKPAREREFL 224
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
K VH+CV GGKVLIP FALGRAQELC+LL+ YWERM L+ PIYF+ GLT +A YYKM
Sbjct: 225 KAVHKCVSGGGKVLIPTFALGRAQELCMLLDDYWERMGLKVPIYFSAGLTIQANVYYKML 284
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
I WT+QKI+ + N FDFKH+ F++SFI+NPGP V+FATPGM+ G SL FKKWAP
Sbjct: 285 IGWTSQKIKDSHTVHNPFDFKHVCHFERSFINNPGPCVLFATPGMITGGFSLEAFKKWAP 344
Query: 181 VENNMLIMPGFCVQGTIGHKVLSG-VKKLEFENKQIIDVK 219
E N++ +PG+CV GTIGHK++ G ++++E+ IDV+
Sbjct: 345 SEKNLVTLPGYCVSGTIGHKLMCGKPTRVDYEDTH-IDVR 383
>gi|428671580|gb|EKX72498.1| cleavage and polyadenylation specificity factor, putative [Babesia
equi]
Length = 656
Score = 293 bits (749), Expect = 5e-76, Method: Compositional matrix adjust.
Identities = 134/279 (48%), Positives = 191/279 (68%), Gaps = 2/279 (0%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF V+ S++YTGD+N PD+HLG A + + PD+LI ESTYAT +R ++ E +
Sbjct: 234 MFHVECDGFSVLYTGDFNTVPDKHLGPAKVPRLCPDVLICESTYATVVRQPRKATEMELC 293
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
VH+ + +GGKVLIPVFA+GRAQEL I+L++YW ++ L+ PIYF GL+EKATNYYK+
Sbjct: 294 TVVHDTLLKGGKVLIPVFAVGRAQELAIILDSYWSKLELKYPIYFGGGLSEKATNYYKLH 353
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
WTN+ + N F +I+PFD +++ PMV+FATPGM+H+GLSL K WAP
Sbjct: 354 SCWTNEH-NIPGLNENTFSMSYIQPFDNGYLNENRPMVLFATPGMVHAGLSLRACKLWAP 412
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
NN++++PG+CVQGT+G+K++SG K ++ + I+VK + Y+SFSAHAD+ GI+QL
Sbjct: 413 NPNNLIVIPGYCVQGTVGNKLISGEKVIQ-TSAGPINVKCKVRYLSFSAHADSAGIIQLA 471
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANG 279
+ PKN+LLVHGE+ M + + + PANG
Sbjct: 472 RQVSPKNILLVHGESESMKKFSKHLNHILGVPVHCPANG 510
>gi|156082980|ref|XP_001608974.1| RNA-metabolising metallo-beta-lactamase and metallo-beta-lactamase
superfamily domain containing protein [Babesia bovis
T2Bo]
gi|154796224|gb|EDO05406.1| RNA-metabolising metallo-beta-lactamase and metallo-beta-lactamase
superfamily domain containing protein [Babesia bovis]
Length = 760
Score = 291 bits (745), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 137/286 (47%), Positives = 196/286 (68%), Gaps = 4/286 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF V+ S++YTGD+NMTPD+HLG A + PD++I ESTYA+ IR ++R E +
Sbjct: 255 MFLVECDGISVLYTGDFNMTPDKHLGPARVPSLNPDIMICESTYASIIRQARRSTEMELC 314
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
VH+C+ GGKVLIPVFA+GRAQEL I+L+TYW ++ L+ PIYF GL+E+AT+YYK+
Sbjct: 315 TVVHDCLLAGGKVLIPVFAVGRAQELAIILDTYWSKLQLRFPIYFGGGLSERATSYYKLH 374
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
WT+ + + N F +H+ PF+ SF+ PMV+FATPGM+HSGLSL K WAP
Sbjct: 375 SLWTDSR-NIPNMGDNCFSLEHMLPFENSFLTEDRPMVLFATPGMVHSGLSLKACKLWAP 433
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
N++++PG+ VQGT G+K++SG K ++ + +VK + Y+SFSAHAD+ GIM+LI
Sbjct: 434 NPKNLIVVPGYAVQGTPGNKLISGEKVIKTSSGS-FEVKCKVRYLSFSAHADSAGIMRLI 492
Query: 241 QYC--EPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFV 284
++ +PKN++LVHGE M + + + N+ F PANG+ V
Sbjct: 493 KHTKVQPKNLVLVHGEYDGMKKFAKHVNMDVNIPVFHPANGQEIAV 538
>gi|71027091|ref|XP_763189.1| hypothetical protein [Theileria parva strain Muguga]
gi|68350142|gb|EAN30906.1| hypothetical protein TP03_0171 [Theileria parva]
Length = 678
Score = 291 bits (744), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 137/307 (44%), Positives = 202/307 (65%), Gaps = 8/307 (2%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF + ++YTGD+N PD+HLG A + PD+LI ESTYAT +R SKR E +
Sbjct: 256 MFLSECDGFKVLYTGDFNTIPDKHLGPAKVPTLEPDVLICESTYATFVRQSKRATEMELC 315
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
VH+ + GGKVLIPVFA+GRAQEL I+L YW +++ PIYF GL+EKATNYYK+
Sbjct: 316 TTVHDTLINGGKVLIPVFAVGRAQELAIILNNYWNNLSISFPIYFGGGLSEKATNYYKLH 375
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
+WTN T ++ N F +++ FD+SF+++ PMV+FATPGM+H+GLSL K W+
Sbjct: 376 SSWTNNN-SITNLRENPFSLRNLLQFDQSFLNDNRPMVLFATPGMVHTGLSLKACKLWSQ 434
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
NN++++PG+CVQGT+G+K+++G K ++ N ++++K + Y+SFSAHAD+ GI+QLI
Sbjct: 435 NPNNLILIPGYCVQGTVGNKLIAGEKTIK-TNIGVMNIKCKVRYLSFSAHADSPGILQLI 493
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK 300
++ PKN++ VHGE M + I + + P NG++ +K + D S N ++
Sbjct: 494 KHIRPKNIVFVHGELESMKRFSKHINNTLKIPVYYPCNGQT------IKFTKDTSKNDIR 547
Query: 301 EEAVKYN 307
E V N
Sbjct: 548 EVYVHPN 554
>gi|403223285|dbj|BAM41416.1| uncharacterized protein TOT_030000678 [Theileria orientalis strain
Shintoku]
Length = 706
Score = 288 bits (736), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 134/293 (45%), Positives = 195/293 (66%), Gaps = 14/293 (4%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF + ++YTGD+N PD+HLG A + PD+LI E+TYAT +R SK+ E +
Sbjct: 238 MFLAESDGFKVLYTGDFNTVPDKHLGPAKVPSLEPDVLICETTYATFVRQSKKATEVELC 297
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
VH+ + GGKVLIPVFA+GRAQEL I+L YW ++L PIYF GL+EKATNYYK+
Sbjct: 298 NLVHDTLINGGKVLIPVFAVGRAQELAIILNNYWNNLSLLFPIYFGGGLSEKATNYYKLH 357
Query: 121 ITWT-NQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWA 179
+WT N I K ++ N F +++ FD+SF+++ PMV+FATPGM+H+GLSL K W+
Sbjct: 358 SSWTDNNNISK--LKENPFAMENLLQFDQSFLNDNRPMVLFATPGMVHTGLSLKACKIWS 415
Query: 180 PVENNMLIMPGFCVQGTIGHKVLSGVKKLEFE-----------NKQIIDVKMAIEYMSFS 228
N++++PG+CVQGT+G+K++SG K E++ N +I++K ++Y+SFS
Sbjct: 416 SNPKNLILIPGYCVQGTVGNKLISGTKGREYKIYTATTICIKTNTGVINIKCKVKYLSFS 475
Query: 229 AHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGES 281
AHAD+ GI++LI++ PKN++ VHGE M + I N+ + PANGE+
Sbjct: 476 AHADSPGILKLIKHVRPKNIVFVHGELDSMKKFSKHITSTLNIPVYYPANGET 528
>gi|85000301|ref|XP_954869.1| hypothetical protein [Theileria annulata strain Ankara]
gi|65303015|emb|CAI75393.1| hypothetical protein, conserved [Theileria annulata]
Length = 663
Score = 283 bits (723), Expect = 5e-73, Method: Compositional matrix adjust.
Identities = 131/288 (45%), Positives = 193/288 (67%), Gaps = 8/288 (2%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF + ++YTGD+N PD+HLG A + PD+LI ESTYAT +R SKR E +
Sbjct: 256 MFLSECDGFKVLYTGDFNTIPDKHLGPAKVPTLEPDVLICESTYATFVRQSKRATEMELC 315
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
VHE + GGKVLIPVFA+GRAQEL I+L YW ++L PIYF GL+EKATNYYK+
Sbjct: 316 TTVHETLINGGKVLIPVFAVGRAQELAIILNNYWNNLSLSFPIYFGGGLSEKATNYYKLH 375
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
+WTN T ++ N F +++ FD+SF+++ PMV+FATPGM+H+GLSL K W+
Sbjct: 376 SSWTNNN-NITNLRENPFSLRNLLQFDQSFLNDNRPMVLFATPGMVHTGLSLKACKLWSQ 434
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVK-KLEFENKQI------IDVKMAIEYMSFSAHADA 233
+N++++PG+CVQGT+G+K+++G + + ++ K I +++K + Y+SFSAHAD+
Sbjct: 435 NPSNLILIPGYCVQGTVGNKLIAGNESQYQYREKSIKTNIGVMNIKCKVRYLSFSAHADS 494
Query: 234 KGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGES 281
GI+QLI++ PKN++ VHGE M + I + + P+NG++
Sbjct: 495 PGILQLIKHIRPKNIVFVHGELESMKRFSKHINSTLKIPVYYPSNGQT 542
>gi|332862132|ref|XP_001152447.2| PREDICTED: LOW QUALITY PROTEIN: integrator complex subunit 11,
partial [Pan troglodytes]
Length = 332
Score = 282 bits (721), Expect = 9e-73, Method: Compositional matrix adjust.
Identities = 125/143 (87%), Positives = 136/143 (95%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MFQ+KVG++S+VYTGDYNMTPDRHLGAAWIDKCRP+LLITESTYATTIRDSKRCRERDFL
Sbjct: 25 MFQIKVGSESVVYTGDYNMTPDRHLGAAWIDKCRPNLLITESTYATTIRDSKRCRERDFL 84
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHE V+RGGKVLIPVFALGRAQELCILLET+WERMNL+ PIYF+ GLTEKA +YYK+F
Sbjct: 85 KKVHETVERGGKVLIPVFALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLF 144
Query: 121 ITWTNQKIRKTFVQRNMFDFKHI 143
I WTNQKI KTFVQRNMF FKHI
Sbjct: 145 IPWTNQKIHKTFVQRNMFXFKHI 167
Score = 77.8 bits (190), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 43/125 (34%), Positives = 72/125 (57%), Gaps = 5/125 (4%)
Query: 311 PNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDSSPINKTLELI 370
P K R +HG L++KDS+ L+ ++A KE+G++ H +RFT V + D+ +T +
Sbjct: 203 PEAKKPRLLHGTLIMKDSNFRLVSSEQALKELGLAEHQLRFTCRVHLHDTRKEQETALRV 262
Query: 371 YDQLLSYLQDKSQEYKIQLTESSLSIDSVLL--SVERIDDKNKRVFVTWPNQDEEVGKIV 428
Y L S L+D ++ L + S++++SVLL + D K + V+W QDEE+G +
Sbjct: 263 YSHLKSVLKDHCVQH---LPDGSVTVESVLLQAAAPSEDPGTKVLLVSWTYQDEELGSFL 319
Query: 429 LHVLK 433
+LK
Sbjct: 320 TSLLK 324
>gi|399216826|emb|CCF73513.1| unnamed protein product [Babesia microti strain RI]
Length = 646
Score = 271 bits (694), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 137/309 (44%), Positives = 192/309 (62%), Gaps = 29/309 (9%)
Query: 1 MFQVKVGNQSIVYTG---------------DYNMTPDRHLGAAWIDKCRPDLLITESTYA 45
+F V+ +Q +VYTG D+N D+HLG A I K PD+LI ESTYA
Sbjct: 201 IFAVRYKSQLVVYTGTNSFNAIRQKTIQLGDFNTMSDKHLGPAKIPKLEPDVLICESTYA 260
Query: 46 TTIRDSKRCRERDFLKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYF 105
T +R S+R E + K V + +D GGKVLIPVFA+GRAQEL I+LE +W+R+NL PIYF
Sbjct: 261 TIVRPSRRSAEVELCKAVKDTLDHGGKVLIPVFAVGRAQELAIILECFWKRVNLNYPIYF 320
Query: 106 AVGLTEKATNYYKMFITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGM 165
A G++E+A+ YYK+ + +F+ I FD FI+ PMV+FATPGM
Sbjct: 321 AGGMSERASTYYKL------HSYALMDLDGQLFESTLISAFDHDFINEKRPMVLFATPGM 374
Query: 166 LHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYM 225
L+ GLSL + K WAP +N++I+PG+C+QGT+G++++ G K ++ N +ID+K I Y+
Sbjct: 375 LNGGLSLSVCKAWAPDPHNLIIIPGYCIQGTVGNRLIMGEKLIKTVNG-VIDIKCKIRYL 433
Query: 226 SFSAHADAKGIMQ-------LIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPAN 278
SFSAHAD+ GI Q +I Y PKN++LVHGE + IK EF + F P
Sbjct: 434 SFSAHADSAGIQQFINHVSLIITYIRPKNIILVHGERDGIQKFARHIKSEFGIPVFCPQT 493
Query: 279 GESCFVQTD 287
G+S ++T+
Sbjct: 494 GQSITIKTE 502
>gi|14165488|gb|AAH08041.1| CPSF3L protein [Homo sapiens]
gi|18044933|gb|AAH20199.1| CPSF3L protein [Homo sapiens]
Length = 389
Score = 271 bits (694), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 117/154 (75%), Positives = 139/154 (90%)
Query: 87 CILLETYWERMNLQAPIYFAVGLTEKATNYYKMFITWTNQKIRKTFVQRNMFDFKHIRPF 146
CILLET+WERMNL+ PIYF+ GLTEKA +YYK+FI WTNQKIRKTFVQRNMF+FKHI+ F
Sbjct: 129 CILLETFWERMNLKVPIYFSTGLTEKANHYYKLFIPWTNQKIRKTFVQRNMFEFKHIKAF 188
Query: 147 DKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVK 206
D++F DNPGPMVVFATPGMLH+G SL IF+KWA E NM+IMPG+CVQGT+GHK+LSG +
Sbjct: 189 DRAFADNPGPMVVFATPGMLHAGQSLQIFRKWAGNEKNMVIMPGYCVQGTVGHKILSGQR 248
Query: 207 KLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLI 240
KLE E +Q+++VKM +EYMSFSAHADAKGIMQL+
Sbjct: 249 KLEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLV 282
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 34/106 (32%), Positives = 60/106 (56%), Gaps = 5/106 (4%)
Query: 330 ISLMDVDEACKEVGISRHIVRFTSNVQISDSSPINKTLELIYDQLLSYLQDKSQEYKIQL 389
+ L+ ++A KE+G++ H +RFT V + D+ +T +Y L S L+D ++ L
Sbjct: 279 MQLVSSEQALKELGLAEHQLRFTCRVHLHDTRKEQETALRVYSHLKSVLKDHCVQH---L 335
Query: 390 TESSLSIDSVLL--SVERIDDKNKRVFVTWPNQDEEVGKIVLHVLK 433
+ S++++SVLL + D K + V+W QDEE+G + +LK
Sbjct: 336 PDGSVTVESVLLQAAAPSEDPGTKVLLVSWTYQDEELGSFLTSLLK 381
>gi|193661985|ref|XP_001944814.1| PREDICTED: uncharacterized protein C12orf26-like isoform 1
[Acyrthosiphon pisum]
gi|328722221|ref|XP_003247513.1| PREDICTED: uncharacterized protein C12orf26-like isoform 2
[Acyrthosiphon pisum]
Length = 496
Score = 271 bits (692), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 159/445 (35%), Positives = 255/445 (57%), Gaps = 40/445 (8%)
Query: 617 SQVVAAVTNSCDSSHIIDLGST-----ITVDELFSKLQARKCGLVVESAKVSQLMSEKKS 671
+Q+ ++T+ S+ +L S + +D+ ++ L++ + S + + M KK
Sbjct: 63 NQIPDSLTHFVTSTKYNNLNSYPSSCYMKLDDAYNVLKSWGYKPMNNSLNLKEFMGAKKM 122
Query: 672 YEVQVMSQVVAAVTNSCDSSHIIDLGGGQGYLSTILALQHGKKTLSLDYNQVNTHGAAVR 731
+EV+VM+++VA + + IID+GGG+GYLS++LAL + L +D +N+ GA R
Sbjct: 123 HEVEVMAKLVADFSKYAGTDVIIDIGGGKGYLSSLLALAYNFNVLGIDSQSINSEGARNR 182
Query: 732 SKKLEDSEKESKGPFQSYA------GVINKKLWMRTQNRQTSHASVEHHGKNWKRKSKAP 785
+ K E + K ++ +I +KL + E + + + RKS
Sbjct: 183 TIKFEKYWQSIKKKSNNFPEEKLTPPIIKEKL-----------DTYEEYYEKYSRKS--- 228
Query: 786 VVSIKDEELVVCKDKCKQITHFVTPDSDISSILSQAYPQDSLHNVCIMGLHTCGDLSGTA 845
V KQIT +VT +++I+ + + ++ ++GLHTCG L T
Sbjct: 229 ---------AVLGSTYKQITEYVTENTNIAHLAQTEFEVNNKEEFTLIGLHTCGSLGTTC 279
Query: 846 LRLFT--KSSLQCLVQVGCCYHLLEEEFIRSPFWKDVDQSLYEH-GYGFPLSEHLRSRKF 902
L+LFT K++L+ LV VGCCYHL+EEEF PFW D++ L ++ YGFP+S LR +KF
Sbjct: 280 LKLFTEPKNNLKLLVNVGCCYHLIEEEFTNDPFWDDIENELNKNESYGFPVSNCLREKKF 339
Query: 903 FLGRNVRMSGTQSPERVIDLKQ--TQTLPLFYRALLEKYLRSKITINDEEPKVVGRLATK 960
LGR+ RM +QSP++ + K T PLFYRALL+ YL ++ ++ VGRLA K
Sbjct: 340 QLGRDARMCASQSPDKFFNSKDIINSTKPLFYRALLQVYLVQELGHDNISRCHVGRLAHK 399
Query: 961 CSNFVEYVHRAVDKLKLDLEVDDEEVTRLFNSHQREYEYLQIYYFLKTALAPVIEALIVL 1020
C F EYVH+AV +L+L++E+D+E++ ++ + EYE L++++ LK ALA VIE LI++
Sbjct: 400 CLTFNEYVHKAVKRLQLNIEIDNEKINAFYDKYLIEYERLKVFFILKQALAQVIEGLIIM 459
Query: 1021 DRVLYLREQQ-KPQIISNLFLSLLT 1044
DR+LYL EQ ++ LF LL+
Sbjct: 460 DRLLYLHEQGINEAFVAKLFDPLLS 484
Score = 63.2 bits (152), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 36/156 (23%), Positives = 78/156 (50%), Gaps = 7/156 (4%)
Query: 482 MELIKNTIDEMIEFITPNIACVNTHTVHFMVEKLWETTIPKAILSQVNNSPSFDSLLEEF 541
ME +++ ++E+I ++ P + N+H V+F+ WET IP+ I +++ + + + F
Sbjct: 1 MEQLRSKLEEVIIYLKPYLPIANSHVVNFITHNFWETIIPRDIQEEIDLKGT-EFVFSNF 59
Query: 542 WKSRQDNHVETNSELVKFFQAADKFRLTSLMNGESIITVDELFSKLQARKCGQVVESAKV 601
W Q + + F + K+ + + +D+ ++ L++ + S +
Sbjct: 60 WSENQI------PDSLTHFVTSTKYNNLNSYPSSCYMKLDDAYNVLKSWGYKPMNNSLNL 113
Query: 602 SQLMSEKKSYEVQVMSQVVAAVTNSCDSSHIIDLGS 637
+ M KK +EV+VM+++VA + + IID+G
Sbjct: 114 KEFMGAKKMHEVEVMAKLVADFSKYAGTDVIIDIGG 149
>gi|148683101|gb|EDL15048.1| cleavage and polyadenylation specific factor 3-like, isoform CRA_a
[Mus musculus]
Length = 207
Score = 267 bits (683), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 119/134 (88%), Positives = 130/134 (97%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MFQ+KVG++S+VYTGDYNMTPDRHLGAAWIDKCRP+LLITESTYATTIRDSKRCRERDFL
Sbjct: 74 MFQIKVGSESVVYTGDYNMTPDRHLGAAWIDKCRPNLLITESTYATTIRDSKRCRERDFL 133
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHE V+RGGKVLIPVFALGRAQELCILLET+WERMNL+ PIYF+ GLTEKA +YYK+F
Sbjct: 134 KKVHETVERGGKVLIPVFALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLF 193
Query: 121 ITWTNQKIRKTFVQ 134
ITWTNQKIRKTFVQ
Sbjct: 194 ITWTNQKIRKTFVQ 207
>gi|390367057|ref|XP_784887.2| PREDICTED: integrator complex subunit 11-like, partial
[Strongylocentrotus purpuratus]
Length = 355
Score = 267 bits (682), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 127/209 (60%), Positives = 154/209 (73%), Gaps = 29/209 (13%)
Query: 95 ERMNLQAPIYFAVGLTEKATNYYKMFITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNP 154
ERMNL+APIYF+ GLTE+A +YYK+FITWTNQKI+KTFV+RN+F+F+HI+PFD+ +I NP
Sbjct: 173 ERMNLKAPIYFSTGLTERANHYYKLFITWTNQKIKKTFVKRNLFEFQHIKPFDRGYIHNP 232
Query: 155 GPMVVFATPGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQ 214
GPMVVFATPGMLHSGLSL IFK+WA E N+ I
Sbjct: 233 GPMVVFATPGMLHSGLSLQIFKQWASGEKNIHI--------------------------- 265
Query: 215 IIDVKMAIEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCF 274
DVK++++YMSFSAHADAKGIMQLI+ CEPKNV+LVHGEA KM FLK+KI +EF +DCF
Sbjct: 266 --DVKLSVQYMSFSAHADAKGIMQLIRMCEPKNVVLVHGEAGKMAFLKDKICKEFKIDCF 323
Query: 275 MPANGESCFVQTDMKISIDVSVNLLKEEA 303
MPANGES + + DVS+ LLK A
Sbjct: 324 MPANGESINIPCKTNLMADVSLGLLKRSA 352
>gi|159487337|ref|XP_001701679.1| predicted protein [Chlamydomonas reinhardtii]
gi|158280898|gb|EDP06654.1| predicted protein [Chlamydomonas reinhardtii]
Length = 460
Score = 243 bits (621), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 117/291 (40%), Positives = 179/291 (61%), Gaps = 12/291 (4%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDK-------C---RPDLLITESTYATTIRD 50
M + G+ + +YTGD+N PDRHLG+A + C PD+LI+E+TYA ++RD
Sbjct: 159 MVTMTAGHLTALYTGDFNSAPDRHLGSAELAAGGAGPAGCLMREPDVLISEATYAASLRD 218
Query: 51 SKRCRERDFLKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLT 110
SKR RERD L+ V + V GGKVLIP FA+GRAQEL +LL W R L PIYF+ +
Sbjct: 219 SKRGRERDLLQAVEDTVAAGGKVLIPTFAMGRAQELLMLLADCWRRKGLTVPIYFSSAMA 278
Query: 111 EKATNYYKMFITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGL 170
+A YY++ + WTN +RK V+ +++ RP+D+S + PGP V+FA+PG + SG+
Sbjct: 279 SRALTYYQLLLNWTNANVRKA-VEADVYGMFRTRPWDRSLLQAPGPAVLFASPGNITSGV 337
Query: 171 SLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAH 230
SL F+ WA N++++ G+ V+G G + L+ + V+ ++ ++FSAH
Sbjct: 338 SLEAFRAWAGSSRNLVVLAGYQVRGEWGGR-LTNRSWSQAHAHAHAQVRCRVKMLAFSAH 396
Query: 231 ADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGES 281
AD +G+M L++ C P+ V+LVHG+ M FL+ +I+ ++C P G +
Sbjct: 397 ADLRGLMGLVRRCRPRAVVLVHGQREPMEFLRGRIEATLGIECHAPPTGST 447
>gi|119576645|gb|EAW56241.1| cleavage and polyadenylation specific factor 3-like, isoform CRA_h
[Homo sapiens]
Length = 123
Score = 242 bits (617), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 107/123 (86%), Positives = 118/123 (95%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MFQ+KVG++S+VYTGDYNMTPDRHLGAAWIDKCRP+LLITESTYATTIRDSKRCRERDFL
Sbjct: 1 MFQIKVGSESVVYTGDYNMTPDRHLGAAWIDKCRPNLLITESTYATTIRDSKRCRERDFL 60
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
KKVHE V+RGGKVLIPVFALGRAQELCILLET+WERMNL+ PIYF+ GLTEKA +YYK+F
Sbjct: 61 KKVHETVERGGKVLIPVFALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLF 120
Query: 121 ITW 123
I W
Sbjct: 121 IPW 123
>gi|302846726|ref|XP_002954899.1| hypothetical protein VOLCADRAFT_65253 [Volvox carteri f.
nagariensis]
gi|300259874|gb|EFJ44098.1| hypothetical protein VOLCADRAFT_65253 [Volvox carteri f.
nagariensis]
Length = 477
Score = 237 bits (605), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 120/308 (38%), Positives = 180/308 (58%), Gaps = 16/308 (5%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKC------------RPDLLITESTYATTI 48
M + G+ + +YTGD+N +PDRHLG A PD+LI+E+TYA T+
Sbjct: 172 MVHMTAGHLTALYTGDFNSSPDRHLGPAEAPLALLQGGPSGASVRHPDVLISEATYAATL 231
Query: 49 RDSKRCRERDFLKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVG 108
RDSKR RERD L V E V GGKVLIP FA+GRAQEL +L+ WER LQ PIYF+
Sbjct: 232 RDSKRARERDLLGAVVETVAAGGKVLIPTFAMGRAQELLMLITDCWERNGLQVPIYFSSA 291
Query: 109 LTEKATNYYKMFITWTNQK-IRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLH 167
+ +A YY++ + WTN I FV ++ HI + PGP ++FA+PG +
Sbjct: 292 MAARALVYYQLLLNWTNANHIHCVFVNVHICVCTHIH--TTWMMLAPGPALLFASPGNIA 349
Query: 168 SGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLS-GVKKLEFENKQIIDVKMAIEYMS 226
SG++L F+ WA N+L++ G+ V+G GH L G ++ ++V+ I+ ++
Sbjct: 350 SGVALEAFRSWAGSSKNLLVLAGYQVRGGAGHDNLCGGRSGGRSGGERRVEVRCRIKMLA 409
Query: 227 FSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQT 286
FSAHAD +G++ L++ C+P+ V+LVHG+ M FL+ +I+ ++C PA G + +
Sbjct: 410 FSAHADLRGLLGLVRRCKPRAVVLVHGQREPMEFLRGRIETHVGIECHAPATGTTVTIAP 469
Query: 287 DMKISIDV 294
+ + V
Sbjct: 470 RRAVPLGV 477
>gi|339237605|ref|XP_003380357.1| cleavage and polyadenylation specificity factor subunit 3
[Trichinella spiralis]
gi|316976818|gb|EFV60027.1| cleavage and polyadenylation specificity factor subunit 3
[Trichinella spiralis]
Length = 687
Score = 237 bits (604), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 153/450 (34%), Positives = 233/450 (51%), Gaps = 38/450 (8%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ I+YTGDY+ DRHL AA + RPD+LI ESTY T I +++ RE F
Sbjct: 172 MFMIEIAGVRILYTGDYSRLEDRHLCAAEVPSIRPDVLIAESTYGTQIHENREDREHRFT 231
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWER-MNLQ-APIYFAVGLTEKATNYYK 118
V+ V RGG+ LIPVFALGRAQEL ++L+ +W + LQ PI+FA L +K Y+
Sbjct: 232 SMVYTIVSRGGRCLIPVFALGRAQELLLILDEFWTKHAELQNIPIFFASSLAKKCMAVYQ 291
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
FI+ NQ I+K +N F FKH+R F ++ GP VV A+PGML SGLS +F+
Sbjct: 292 TFISGMNQNIQKQIAVQNPFLFKHVRSLRSIDFFEDIGPCVVLASPGMLQSGLSRELFEM 351
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W N I+ G+CV+GT+ +LS K++ N + ++M + Y+SFSAHAD K
Sbjct: 352 WCTDTKNGCIIAGYCVEGTLAKHILSEPKEIVGLNGAKLALRMQVAYVSFSAHADYKQTS 411
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLD-----CFMPANGESCFVQTDMKISI 292
+ I+ +P N++ VHGEA++M+ LK I +E+ D F P N E
Sbjct: 412 EFIRRLKPPNLVFVHGEATEMIRLKAAIMREYEDDPTCMQSFSPRNTEP----------- 460
Query: 293 DVSVNLLKEEAVKYNSEPPNPLKERQ--IHGVLVIKDSSISLMDVDEACKEVGISRHIVR 350
VS++ E+ K + E + GV++ ++ LM ++ +S+ +
Sbjct: 461 -VSLHFRGEKTTKVVGQMATKKAEHGDVVSGVMIRRNFDYHLMHSNDVSSYTDLSKGNIL 519
Query: 351 FTSNVQISDSSPINKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLLSVERIDDKN 410
T NV + + LL+Y ++ K+++ E + LL ++
Sbjct: 520 QTENVFFNG-----------HLTLLNYFLNEVTAGKLEILEEYKDGEGKLLRAFKV--IT 566
Query: 411 KRVFVTW---PNQDEEVGKIVLHVLKSMSN 437
RV + W P D IV VL++ +
Sbjct: 567 IRVKIPWFSSPVNDMYADTIVTAVLEAQTG 596
>gi|156552097|ref|XP_001605081.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
3-like [Nasonia vitripennis]
Length = 688
Score = 231 bits (589), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 153/444 (34%), Positives = 238/444 (53%), Gaps = 32/444 (7%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ I+YTGD++ DRHL AA I PD+LITESTY T I + + RE F
Sbjct: 174 MFMIEIAGVKILYTGDFSRQEDRHLMAAEIPNVHPDVLITESTYGTHIHEKREDREGRFT 233
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
VHE V+RGG+ LIPVFALGRAQEL ++L+ YW + + PIY+A L +K Y+
Sbjct: 234 NLVHEIVNRGGRCLIPVFALGRAQELLLILDEYWSQHPELHEIPIYYASSLAKKCMAVYQ 293
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
++ N KIR+ N F FKHI D+ GP VV A+PGM+ SGLS +F+
Sbjct: 294 TYVNAMNDKIRRQIAINNPFVFKHISNLKGIDHFDDIGPCVVMASPGMMQSGLSRELFES 353
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W N +I+ G+CV+GT+ +LS +++ + Q + +KM+++Y+SFSAH D +
Sbjct: 354 WCTDAKNGVIIAGYCVEGTLAKTILSEPEEITTMSGQKLSLKMSVDYISFSAHTDYQQTS 413
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLD--CFMPANGESCFVQTDMKISIDVS 295
+ I+ +P +V+LVHGE ++M LK +++E+ D M + V ++ + +
Sbjct: 414 EFIRILKPPHVILVHGEQNEMSRLKAALQREYEDDPSTAMEIHNPRNTVAVELYFRGEKT 473
Query: 296 VNLLKEEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNV 355
++ A+ E P P +++ GVLV ++ + ++D + K +S V +V
Sbjct: 474 AKVMGALAM----ETPKP--GQKLSGVLVKRNFNYHMLDPCDLSKYTDMSISQVIQRQSV 527
Query: 356 QISDSSP--------INKTLELIYDQLLSYLQ--DKSQEYKIQLTE------SSLSIDSV 399
S S P I TL++I D+ L + D + E KI E + + D+V
Sbjct: 528 YYSASLPTLKHLLTQIAGTLDIIDDKKLRVFKNVDLTIEGKIVTMEWLATPVNDMYADAV 587
Query: 400 ---LLSVERIDDKNKRVFVTWPNQ 420
LL E +D K +T P +
Sbjct: 588 LTALLQAETMDSNPK--MLTAPTK 609
>gi|391348443|ref|XP_003748457.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
3-like [Metaseiulus occidentalis]
Length = 673
Score = 231 bits (588), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 118/291 (40%), Positives = 179/291 (61%), Gaps = 12/291 (4%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ I+YTGD++ DRHL +A I +PD+LI ESTY T I + ++ RE F
Sbjct: 162 MFFIEIAGVKILYTGDFSRQEDRHLMSAEIPSVKPDVLIIESTYGTHIHEKRQDREHRFT 221
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYW----ERMNLQAPIYFAVGLTEKATNY 116
V E V RGG+ LIPVFALGRAQEL ++L+ YW E ++ PIY+A L +K
Sbjct: 222 HLVQEIVTRGGRCLIPVFALGRAQELLLILDEYWGLHPELHDI--PIYYASSLAKKCMAV 279
Query: 117 YKMFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIF 175
Y+ ++ N++IR+ N F FKHI D+ GP V+ ATPGM+ SGLS +F
Sbjct: 280 YQTYVNAMNERIRRQIAISNPFVFKHISNLKSIDHFDDVGPCVIMATPGMMQSGLSRELF 339
Query: 176 KKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKG 235
+ W N +I+ G+CV+GT+ ++LS +++ N Q + +KM+++Y+SFSAH D +
Sbjct: 340 EAWCGDTKNGVIIAGYCVEGTLAKQILSEPQEVTSMNGQKMPLKMSVDYISFSAHTDYQQ 399
Query: 236 IMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF-----NLDCFMPANGES 281
+ I+ +P N++LVHGE ++M LK I++E+ +D + PANG +
Sbjct: 400 TSEFIRALKPPNIILVHGEQNEMSRLKAAIEREYEGEDLKMDVYNPANGHA 450
>gi|313244184|emb|CBY15021.1| unnamed protein product [Oikopleura dioica]
Length = 690
Score = 231 bits (588), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 148/472 (31%), Positives = 246/472 (52%), Gaps = 43/472 (9%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFLK 61
F V++ +++TGD++ DRHL AA + +PD+LI ESTY T + + + RE F
Sbjct: 163 FMVEIAGVKVLFTGDFSREEDRHLMAAEVPPQKPDILIMESTYGTHLHEKREEREHRFTS 222
Query: 62 KVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYKM 119
+H+ ++RGG+ LIPVFALGRAQEL ++L+ YW + PIY+A L +K + Y+
Sbjct: 223 VIHDIINRGGRCLIPVFALGRAQELLLILDDYWAQHPELHDIPIYYASTLAKKCMSVYQT 282
Query: 120 FITWTNQKIRKTFVQRNMFDFKHIRPFD--KSFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
+ N KI+K RN F F+HI ++F D+ GP VV A+PGM+ SGLS +F+K
Sbjct: 283 YTNAMNSKIQKAITTRNPFQFRHISNLKGMEAFDDDIGPSVVLASPGMMQSGLSRELFEK 342
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W + N +I+ G+ V+GT+ H++ + ++ + Q + +KM +EYMSFSAHAD + I
Sbjct: 343 WCTNKRNGVILAGYAVEGTLAHQIKTEPDEIVTMSGQKLPLKMQVEYMSFSAHADYRQIS 402
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDC------FMPANGESCFVQTDMKIS 291
+ ++ +P +++LVHGEA++M LK ++ E+ D MP N E
Sbjct: 403 EFVREIKPPHIVLVHGEANEMGRLKRQLDIEYEHDAETDITIHMPRNAEK---------- 452
Query: 292 IDVSVNLLKEEAVKYNSEPPN--PLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIV 349
V + E+ K + P +E +I G+LV K+ + ++ ++ + H
Sbjct: 453 --VKFHFRGEKNAKVVGSLAHYLPKQEDEIEGILVRKNFNYKIVAAED------LKTHAE 504
Query: 350 RFTSNVQISDSSPINKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLLSVERI--D 407
T+ V+ S N +L L YL+ S Y ++ + ++ RI D
Sbjct: 505 LETTQVKQRLSVDFNAPADL----LKFYLEQLS--YDVKEFAKNTKCGFIVFRCIRIVYD 558
Query: 408 DKNKRVFVTW---PNQDEEVGKIVLHVLK--SMSNTQGQPFEFYPYTDTVFV 454
++N V + W P D ++ V++ + + +P +P VF+
Sbjct: 559 EQNNMVILEWKSTPTADAYADSVLAMVIRVDTEAKVTAKPQIIHPDGGEVFL 610
>gi|313216448|emb|CBY37756.1| unnamed protein product [Oikopleura dioica]
Length = 690
Score = 231 bits (588), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 148/472 (31%), Positives = 246/472 (52%), Gaps = 43/472 (9%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFLK 61
F V++ +++TGD++ DRHL AA + +PD+LI ESTY T + + + RE F
Sbjct: 163 FMVEIAGVKVLFTGDFSREEDRHLMAAEVPPQKPDILIMESTYGTHLHEKREEREHRFTS 222
Query: 62 KVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYKM 119
+H+ ++RGG+ LIPVFALGRAQEL ++L+ YW + PIY+A L +K + Y+
Sbjct: 223 VIHDIINRGGRCLIPVFALGRAQELLLILDDYWAQHPELHDIPIYYASTLAKKCMSVYQT 282
Query: 120 FITWTNQKIRKTFVQRNMFDFKHIRPFD--KSFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
+ N KI+K RN F F+HI ++F D+ GP VV A+PGM+ SGLS +F+K
Sbjct: 283 YTNAMNSKIQKAITTRNPFQFRHISNLKGMEAFDDDIGPSVVLASPGMMQSGLSRELFEK 342
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W + N +I+ G+ V+GT+ H++ + ++ + Q + +KM +EYMSFSAHAD + I
Sbjct: 343 WCTNKRNGVILAGYAVEGTLAHQIKTEPDEIVTMSGQKLPLKMQVEYMSFSAHADYRQIS 402
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDC------FMPANGESCFVQTDMKIS 291
+ ++ +P +++LVHGEA++M LK ++ E+ D MP N E
Sbjct: 403 EFVREIKPPHIVLVHGEANEMGRLKRQLDIEYEHDAETDITIHMPRNAEK---------- 452
Query: 292 IDVSVNLLKEEAVKYNSEPPN--PLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIV 349
V + E+ K + P +E +I G+LV K+ + ++ ++ + H
Sbjct: 453 --VKFHFRGEKNAKVVGSLAHYLPKQEDEIEGILVRKNFNYKIVAAED------LKTHAE 504
Query: 350 RFTSNVQISDSSPINKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLLSVERI--D 407
T+ V+ S N +L L YL+ S Y ++ + ++ RI D
Sbjct: 505 LETTQVKQRLSVDFNAPADL----LKFYLEQLS--YDVKEFAKNTKCGFIVFRCIRIVYD 558
Query: 408 DKNKRVFVTW---PNQDEEVGKIVLHVLK--SMSNTQGQPFEFYPYTDTVFV 454
++N V + W P D ++ V++ + + +P +P VF+
Sbjct: 559 EQNNMVILEWKSTPTADAYADSVLAMVIRVDTEAKVTAKPQIIHPDGGEVFL 610
>gi|307177772|gb|EFN66769.1| Cleavage and polyadenylation specificity factor subunit 3 [Camponotus
floridanus]
Length = 1750
Score = 230 bits (587), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 114/275 (41%), Positives = 171/275 (62%), Gaps = 3/275 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ I+YTGD++ DRHL AA I PD+LITESTY T I + + RE F
Sbjct: 1235 MFMIEIAGVKILYTGDFSRQEDRHLMAAEIPNIHPDVLITESTYGTHIHEKREDREGRFT 1294
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
VHE V+RGG+ LIPVFALGRAQEL ++L+ YW + + + PIY+A L +K Y+
Sbjct: 1295 NLVHEIVNRGGRCLIPVFALGRAQELLLILDEYWSQHSELHEIPIYYASSLAKKCMAVYQ 1354
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
++ N KIR+ N F FKHI ++ GP VV A+PGM+ SGLS +F+
Sbjct: 1355 TYVNAMNDKIRRQIAINNPFVFKHISNLKGIDHFEDIGPCVVMASPGMMQSGLSRELFES 1414
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W N +I+ G+CV+GT+ +LS +++ + Q + +KM+++Y+SFSAH D +
Sbjct: 1415 WCTDAKNGVIIAGYCVEGTLAKTILSEPEEITTMSGQKLPLKMSVDYISFSAHTDYQQTS 1474
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLD 272
+ I+ +P +V+LVHGE ++M LK +++E+ D
Sbjct: 1475 EFIRTLKPPHVVLVHGEQNEMGRLKAALQREYEDD 1509
>gi|348686031|gb|EGZ25846.1| hypothetical protein PHYSODRAFT_478942 [Phytophthora sojae]
Length = 733
Score = 228 bits (580), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 130/339 (38%), Positives = 193/339 (56%), Gaps = 11/339 (3%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
M+ +++G ++YTGDY++ DRHL AA + C PD+LI ESTY + S RE F
Sbjct: 169 MYLIEIGGVKVLYTGDYSLENDRHLMAAELPACSPDVLIVESTYGVQVHQSVVEREGRFT 228
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWE-RMNLQ-APIYFAVGLTEKATNYYK 118
+V V RGG+ LIPVFALGR QEL ++L+ +W +LQ PIYFA L KA Y+
Sbjct: 229 GQVEAVVRRGGRCLIPVFALGRTQELLLILDEHWRSHPDLQDIPIYFASKLAAKALRVYQ 288
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDKSF--IDNPGPMVVFATPGMLHSGLSLIIFK 176
+I N +IRK N F F+HI KS D+ GP VV A+PGML SG+S +F+
Sbjct: 289 TYINMMNDRIRKQIAISNPFQFEHISNL-KSMDDFDDSGPSVVMASPGMLQSGVSRQLFE 347
Query: 177 KWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGI 236
+W + N ++PG+ V+GT+ K+LS ++ + +II + +EY+SFSAHAD G
Sbjct: 348 RWCSDKRNACLIPGYVVEGTLAKKILSEPTEIAALDGRIIPMNCTVEYISFSAHADFVGT 407
Query: 237 MQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFN-LDCFMPANGESCFVQTDMKISIDVS 295
++ P N++LVHGE ++M+ LK + ++FN + P+ +Q +I ++
Sbjct: 408 SGFVEKLTPPNIVLVHGEKNEMMRLKSALNKKFNDPKVYHPSISTPANMQ---EIVLEFK 464
Query: 296 VNLLKEEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMD 334
+ + S+ P + I G+LV DS LMD
Sbjct: 465 GEKIAKAIGGLASDQPK--NGKVISGLLVEVDSQTHLMD 501
>gi|320163324|gb|EFW40223.1| CPSF3 protein [Capsaspora owczarzaki ATCC 30864]
Length = 802
Score = 228 bits (580), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 118/290 (40%), Positives = 171/290 (58%), Gaps = 9/290 (3%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF V++ ++YTGDY+ DRHL A I PD+L ESTY + + + RE F
Sbjct: 190 MFMVEIAGVKLLYTGDYSRHEDRHLMPAEIPTIAPDVLCVESTYGVRVHEPRVEREGRFT 249
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNL--QAPIYFAVGLTEKATNYYK 118
K VH+ V RGGK L+PVFALGRAQEL ++L+ +WE PIY+A L K Y+
Sbjct: 250 KDVHDIVMRGGKCLLPVFALGRAQELLLILDEFWESKPALHNIPIYYASSLARKCMAIYQ 309
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPF-DKSFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
+I N++IR+ F N F FKHI S ID GPMV+ A+PGML +GLS +F++
Sbjct: 310 TYINQMNERIRRQFAISNPFMFKHIASIKSASEIDQSGPMVMMASPGMLQNGLSRDLFEQ 369
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W P N +I+ G+ V+GT+ +LS K++ Q + ++M++ Y+SFSAHAD
Sbjct: 370 WCPDSRNGVIVTGYSVEGTLAKSILSAPKEVPSLTGQKLPLRMSVTYVSFSAHADFAQTS 429
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQ------EFNLDCFMPANGES 281
+ I +P +++LVHGEA++M LK + + E ++ PAN +S
Sbjct: 430 EFIDALKPPHIVLVHGEATEMGRLKAALTRKYEDHAEIKIEVHTPANAQS 479
>gi|301111988|ref|XP_002905073.1| cleavage and polyadenylation specificity factor subunit 3
[Phytophthora infestans T30-4]
gi|262095403|gb|EEY53455.1| cleavage and polyadenylation specificity factor subunit 3
[Phytophthora infestans T30-4]
Length = 724
Score = 227 bits (579), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 116/274 (42%), Positives = 168/274 (61%), Gaps = 5/274 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
M+ +++G ++YTGDY++ DRHL AA + C PD+LI ESTY + S RE F
Sbjct: 169 MYLIEIGGVKVLYTGDYSLENDRHLMAAELPACSPDVLIVESTYGVQVHQSVVEREGRFT 228
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWE-RMNLQ-APIYFAVGLTEKATNYYK 118
+V V RGG+ LIPVFALGR QEL ++L+ +W +LQ PIYFA L KA Y+
Sbjct: 229 GQVEAVVRRGGRCLIPVFALGRTQELLLILDEHWRSHPDLQDIPIYFASKLAAKALRVYQ 288
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDKSF--IDNPGPMVVFATPGMLHSGLSLIIFK 176
+I N +IRK N F F+HI KS D+ GP VV A+PGML SG+S +F+
Sbjct: 289 TYINMMNDRIRKQIAISNPFQFEHISNL-KSMDDFDDSGPSVVMASPGMLQSGVSRQLFE 347
Query: 177 KWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGI 236
+W + N ++PG+ V+GT+ K+LS ++ + +II + +EY+SFSAHAD G
Sbjct: 348 RWCSDKRNACLIPGYVVEGTLAKKILSEPTEIAALDGRIIPMNCTVEYISFSAHADFVGT 407
Query: 237 MQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFN 270
++ P N++LVHGE ++M+ LK + ++FN
Sbjct: 408 SGFVEKLTPPNIVLVHGEKNEMMRLKSALNKKFN 441
>gi|322786053|gb|EFZ12664.1| hypothetical protein SINV_01905 [Solenopsis invicta]
Length = 686
Score = 227 bits (578), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 147/434 (33%), Positives = 229/434 (52%), Gaps = 43/434 (9%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ I+YTGD++ DRHL AA I PD+LITESTY T I + + RE F
Sbjct: 171 MFMIEIAGVKILYTGDFSRQEDRHLMAAEIPNIHPDVLITESTYGTHIHEKREDREGRFT 230
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
VHE V+RGG+ LIPVFALGRAQEL ++L+ YW + + + PIY+A L +K Y+
Sbjct: 231 NLVHEIVNRGGRCLIPVFALGRAQELLLILDEYWSQHSELHEIPIYYASSLAKKCMAVYQ 290
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
++ N KIR+ N F FKHI ++ GP VV A+PGM+ SGLS +F+
Sbjct: 291 TYVNAMNDKIRRQIAINNPFVFKHISNLKGIDHFEDIGPCVVMASPGMMQSGLSRELFES 350
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W N +I+ G+CV+GT+ +LS +++ + Q + +KM+++Y+SFSAH D +
Sbjct: 351 WCTDAKNGVIIAGYCVEGTLAKTILSEPEEITTMSGQKLPLKMSVDYISFSAHTDYQQTS 410
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLD--CFMPANGESCFVQTDMKISIDVS 295
+ I+ +P +V+LVHGE ++M LK +++E+ D M + V ++ + +
Sbjct: 411 EFIRTLKPPHVVLVHGEQNEMGRLKAALQREYEDDPSTTMEIHNPRNTVAVELYFRGEKT 470
Query: 296 VNLLKEEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNV 355
++ A+ E P P +++ GVLV ++ + ++ + K +S V +V
Sbjct: 471 AKVMGTLAM----ETPRP--GQKLSGVLVKRNFNYHMLAPCDLSKYTDMSMSQVIQRQSV 524
Query: 356 QISDSSPINKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLLSVERIDDKNKRVFV 415
S S P+ K L + + S+E +DDK RVF
Sbjct: 525 YFSASLPVLKHL----------------------------LTQIAGSLEVVDDKKLRVF- 555
Query: 416 TWPNQDEEV-GKIV 428
N D + GKIV
Sbjct: 556 --KNVDVTIDGKIV 567
>gi|444731702|gb|ELW72051.1| Cleavage and polyadenylation specificity factor subunit 3 [Tupaia
chinensis]
Length = 587
Score = 227 bits (578), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 135/399 (33%), Positives = 218/399 (54%), Gaps = 30/399 (7%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ ++YTGD++ DRHL AA I +PD+LI ESTY T I + + RE F
Sbjct: 67 MFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPNIKPDILIIESTYGTHIHEKREEREARFC 126
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
VH+ V+RGG+ LIPVFALGRAQEL ++L+ YW+ PIY+A L +K Y+
Sbjct: 127 NTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQNHPELHDIPIYYASSLAKKCMAVYQ 186
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
++ N KIRK N F FKHI D+ GP VV A+PGM+ SGLS +F+
Sbjct: 187 TYVNAMNDKIRKQININNPFVFKHISNLKSMDHFDDIGPSVVMASPGMMQSGLSRELFES 246
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W + N +I+ G+CV+GT+ ++S +++ + Q + +KM+++Y+SFSAH D +
Sbjct: 247 WCTDKRNGVIIAGYCVEGTLAKHIMSEPEEITTMSGQKLPLKMSVDYISFSAHTDYQQTS 306
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF------NLDCFMPANGESCFVQTDMKIS 291
+ I+ +P +V+LVHGE ++M LK + +E+ +++ P N E+
Sbjct: 307 EFIRALKPPHVILVHGEQNEMARLKAALIREYEDNDEVHIEVHNPRNTEA---------- 356
Query: 292 IDVSVNLLKEEAVKYNS--EPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIV 349
V++N E+ K P + +++ G+LV ++ + ++ C +S +
Sbjct: 357 --VTLNFRGEKLAKVMGFLADKKPEQGQRVSGILVKRNFNYHILS---PCD---LSNYTD 408
Query: 350 RFTSNVQISDSSPINKTLELIYDQLLSYLQDKSQEYKIQ 388
S V+ + + P + L+Y QL D +E +IQ
Sbjct: 409 LAMSTVKQTQAIPYTGSFNLLYYQLQKLTGD-VEELEIQ 446
>gi|443725897|gb|ELU13297.1| hypothetical protein CAPTEDRAFT_184406 [Capitella teleta]
Length = 668
Score = 227 bits (578), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 125/342 (36%), Positives = 195/342 (57%), Gaps = 19/342 (5%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ ++YTGD++ DRHL AA I +PD+LITESTY T I + + RE F
Sbjct: 152 MFMIEIAGVKVLYTGDFSREEDRHLMAAEIPNVKPDVLITESTYGTHIHEPREEREGRFT 211
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYW-ERMNLQ-APIYFAVGLTEKATNYYK 118
+ + V+RGG+ LIPVFALGRAQEL ++L+ YW + LQ PIY+A L +K + Y+
Sbjct: 212 SLISDIVNRGGRCLIPVFALGRAQELLLILDEYWSQHPELQDIPIYYASSLAKKCMSVYQ 271
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
+I N KI++ N F FKHI D+ GP VV A+PGM+ SGLS +F+
Sbjct: 272 TYINAMNDKIKRQINTNNPFVFKHISNLKSMEHFDDIGPSVVMASPGMMQSGLSRELFEN 331
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W + N I+ G+CV+GT+ +LS +++ Q + +K +++Y+SFSAH D K
Sbjct: 332 WCTDKRNGCIIAGYCVEGTLAKHILSEPEEIVTMGGQKLPLKCSVDYISFSAHTDYKQTS 391
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKI------KQEFNLDCFMPANGESCFVQTDMKIS 291
+ ++ +P +++LVHGEA++M LK + E+N+D F P N E
Sbjct: 392 EFVRILKPTHIVLVHGEANEMGRLKAALIREYEDDPEYNIDIFNPRNTE----------P 441
Query: 292 IDVSVNLLKEEAVKYNSEPPNPLKERQIHGVLVIKDSSISLM 333
+++ K + N P + +Q+ G+LV ++ + +M
Sbjct: 442 VELYFRGEKMAKIVGNLATDRPKQGQQLSGILVKRNFNYHIM 483
>gi|332019331|gb|EGI59837.1| Cleavage and polyadenylation specificity factor subunit 3
[Acromyrmex echinatior]
Length = 685
Score = 227 bits (578), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 114/275 (41%), Positives = 171/275 (62%), Gaps = 3/275 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ I+YTGD++ DRHL AA I PD+LITESTY T I + + RE F
Sbjct: 171 MFMIEIAGVKILYTGDFSRQEDRHLMAAEIPNIHPDVLITESTYGTHIHEKREDREGRFT 230
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
VHE V+RGG+ LIPVFALGRAQEL ++L+ YW + + + PIY+A L +K Y+
Sbjct: 231 NLVHEIVNRGGRCLIPVFALGRAQELLLILDEYWSQHSELHEIPIYYASSLAKKCMAVYQ 290
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
++ N KIR+ N F FKHI ++ GP VV A+PGM+ SGLS +F+
Sbjct: 291 TYVNAMNDKIRRQIAINNPFVFKHISNLKGIDHFEDIGPCVVMASPGMMQSGLSRELFES 350
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W N +I+ G+CV+GT+ +LS +++ + Q + +KM+++Y+SFSAH D +
Sbjct: 351 WCTDAKNGVIIAGYCVEGTLAKTILSEPEEITTMSGQKLPLKMSVDYISFSAHTDYQQTS 410
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLD 272
+ I+ +P +V+LVHGE ++M LK +++E+ D
Sbjct: 411 EFIRTLKPPHVVLVHGEQNEMGRLKAALQREYEDD 445
>gi|432954006|ref|XP_004085503.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
3-like [Oryzias latipes]
Length = 686
Score = 226 bits (575), Expect = 7e-56, Method: Compositional matrix adjust.
Identities = 132/394 (33%), Positives = 211/394 (53%), Gaps = 29/394 (7%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ ++YTGD++ DRHL AA I +PD+LITESTY T I + + RE F
Sbjct: 171 MFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPSVKPDILITESTYGTHIHEKREEREARFC 230
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
VH+ V+R G+ LIPVFALGRAQEL ++L+ YW+ PIY+A L K Y+
Sbjct: 231 NTVHDIVNREGRCLIPVFALGRAQELLLILDEYWQNHPELHDIPIYYASSLARKCMAVYQ 290
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
+I N KIRK N F FKHI D+ GP VV A+PGM+ SGLS +F+
Sbjct: 291 TYINAMNDKIRKAINVNNPFVFKHISNLKSMDHFDDIGPSVVMASPGMMQSGLSRELFES 350
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W + N +I+ G+CV+GT+ +++ ++ + Q + +KM+++Y+SFSAH D +
Sbjct: 351 WCTDKRNGVIIAGYCVEGTLAKHIMTEPDEITTMSGQKLPLKMSVDYISFSAHTDYQQTS 410
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF------NLDCFMPANGESCFVQTDMKIS 291
+ I+ +P +V+LVHGE ++M LK + +E+ +++ P N E+
Sbjct: 411 EFIRALKPPHVILVHGEQNEMARLKAALIREYEDNDEVHIEVHNPRNTEA---------- 460
Query: 292 IDVSVNLLKEEAVKYNSEPPNP--LKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIV 349
V++N E+ K + + +++ G+LV ++ + ++ + CK +S
Sbjct: 461 --VTLNFRGEKLAKVMGSLADQKCAQGQRVSGILVKRNFNYHILMPSDLCKYTDLS---- 514
Query: 350 RFTSNVQISDSSPINKTLELIYDQLLSYLQDKSQ 383
V S + P + L+ QL + D Q
Sbjct: 515 --VGTVTQSQAIPFTGPISLLVSQLRNLAGDVQQ 546
>gi|325186851|emb|CCA21396.1| cleavage and polyadenylation specific factor 3 puta [Albugo
laibachii Nc14]
Length = 759
Score = 226 bits (575), Expect = 7e-56, Method: Compositional matrix adjust.
Identities = 129/344 (37%), Positives = 196/344 (56%), Gaps = 21/344 (6%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
M+ +++G ++YTGDY++ DRHL AA + C PD+LI ESTY + S RE F
Sbjct: 170 MYLIEIGGVKVLYTGDYSLENDRHLMAAELPACSPDVLIVESTYGVQVHQSVVEREGRFT 229
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWE-RMNLQ-APIYFAVGLTEKATNYYK 118
+V + RGG+ LIPVFALGR QEL ++L+ +W+ +L PIYFA L KA Y+
Sbjct: 230 GQVESVIRRGGRCLIPVFALGRTQELLLILDEHWQAHPDLHDIPIYFASKLAAKALRVYQ 289
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDKSF--IDNPGPMVVFATPGMLHSGLSLIIFK 176
+I N +IRK N F F HI KS D+ GP VV A+PGML SG+S +F+
Sbjct: 290 TYINMMNDRIRKQIAVSNPFLFDHISNL-KSMDDFDDSGPCVVMASPGMLQSGVSRQLFE 348
Query: 177 KWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGI 236
+W + N ++PG+ V+GT+ K+LS ++ + +++ + +EY+SFSAHAD G
Sbjct: 349 RWCSDKRNACLIPGYVVEGTLAKKILSEPTEIVAMDGRVLPMNCTVEYISFSAHADFVGT 408
Query: 237 MQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFN------LDCFMPANGESCFVQTDMKI 290
++ P N++LVHGE ++M+ LK + ++F+ + F PAN + +I
Sbjct: 409 SGFVEKLVPPNIVLVHGEKNEMMRLKSALNKKFHDPKTYRPNIFTPANMQ--------EI 460
Query: 291 SIDVSVNLLKEEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMD 334
++ + + K SE P + + I G+LV DS I LM+
Sbjct: 461 VLEFKGEKIAKAIGKIASEQPQ--QGKVISGLLVEHDSFIHLME 502
>gi|55250298|gb|AAH85402.1| Cleavage and polyadenylation specific factor 3 [Danio rerio]
gi|182889046|gb|AAI64567.1| Cpsf3 protein [Danio rerio]
Length = 690
Score = 226 bits (575), Expect = 7e-56, Method: Compositional matrix adjust.
Identities = 147/455 (32%), Positives = 237/455 (52%), Gaps = 38/455 (8%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ ++YTGD++ DRHL AA I +PD+LITESTY T I + + RE F
Sbjct: 171 MFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPSVKPDILITESTYGTHIHEKREEREARFC 230
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
VH+ V+R G+ LIPVFALGRAQEL ++L+ YW+ PIY+A L +K Y+
Sbjct: 231 NTVHDIVNREGRCLIPVFALGRAQELLLILDEYWQNHPELHDIPIYYASSLAKKCMAVYQ 290
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
++ N KIRK N F FKHI D+ GP VV A+PGM+ SGLS +F+
Sbjct: 291 TYVNAMNDKIRKAININNPFVFKHISNLKSMDHFDDIGPSVVMASPGMMQSGLSRELFES 350
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W + N +I+ G+CV+GT+ ++S +++ + Q + +KM+++Y+SFSAH D +
Sbjct: 351 WCTDKRNGVIIAGYCVEGTLAKHIMSEPEEITTMSGQKLPLKMSVDYISFSAHTDYQQTS 410
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF------NLDCFMPANGESCFVQTDMKIS 291
+ I+ +P +V+LVHGE ++M LK + +E+ +++ P N E+
Sbjct: 411 EFIRALKPPHVILVHGEQNEMARLKAALIREYEDNDEVHIEVHNPRNTEA---------- 460
Query: 292 IDVSVNLLKEEAVKYNSEPPNP--LKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIV 349
V++N E+ K + + +++ G+LV K+ S ++ + +S +
Sbjct: 461 --VTLNFRGEKLAKVMGSLADKKCSQGQRVSGILVKKNFSYHILSPSD------LSNYTD 512
Query: 350 RFTSNVQISDSSPINKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLLSVERIDDK 409
S V+ + + P L+ QL D + I+++E S V S+ I +
Sbjct: 513 LAMSTVKQTQAIPFTGPFPLLLSQLRHLTGDVEE---IEMSEKSTV--KVFNSITVIHE- 566
Query: 410 NKRVFVTW---PNQDEEVGKIVLHVLKSMSNTQGQ 441
N V + W P D + VL+ SN + Q
Sbjct: 567 NNLVVLEWFANPLNDMYADAVTTVVLEVQSNPKAQ 601
>gi|449283675|gb|EMC90280.1| Cleavage and polyadenylation specificity factor subunit 3, partial
[Columba livia]
Length = 667
Score = 226 bits (575), Expect = 7e-56, Method: Compositional matrix adjust.
Identities = 136/399 (34%), Positives = 218/399 (54%), Gaps = 30/399 (7%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ ++YTGD++ DRHL AA I +PD+LI ESTY T I + + RE F
Sbjct: 147 MFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPNIKPDILIIESTYGTHIHEKREEREARFC 206
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
VH+ V+RGG+ LIPVFALGRAQEL ++L+ YW+ PIY+A L +K + Y+
Sbjct: 207 NTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQNHPELHDIPIYYASSLAKKCMSVYQ 266
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
++ N KIRK N F FKHI D+ GP +V A+PGM+ SGLS +F+
Sbjct: 267 TYVNAMNDKIRKQININNPFVFKHISNLKSMDHFDDIGPSIVMASPGMMQSGLSRELFES 326
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W + N +I+ G+CV+GT+ V+S +++ + Q + +KM+++Y+SFSAH D +
Sbjct: 327 WCTDKRNGVIIAGYCVEGTLAKHVMSEPEEITTMSGQKLPMKMSVDYISFSAHTDYQQTS 386
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF------NLDCFMPANGESCFVQTDMKIS 291
+ I+ +P +V+LVHGE ++M LK + +E+ +++ P N E+
Sbjct: 387 EFIRALKPPHVILVHGEQNEMARLKAALIREYEDNDEVHIEVHNPRNTEA---------- 436
Query: 292 IDVSVNLLKEEAVKYNSEPPN--PLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIV 349
V++N E+ K + P + ++I G+LV ++ + ++ C +S +
Sbjct: 437 --VTLNFRGEKLAKVMGSLADKKPEQGQRISGILVKRNFNYHIL---SPCD---LSNYTD 488
Query: 350 RFTSNVQISDSSPINKTLELIYDQLLSYLQDKSQEYKIQ 388
S V + + P L+Y QL D +E +IQ
Sbjct: 489 LAMSTVTQTQAIPYTGPFNLLYYQLQKLTGD-VEEIEIQ 526
>gi|307199387|gb|EFN80012.1| Cleavage and polyadenylation specificity factor subunit 3
[Harpegnathos saltator]
Length = 685
Score = 226 bits (575), Expect = 7e-56, Method: Compositional matrix adjust.
Identities = 133/372 (35%), Positives = 210/372 (56%), Gaps = 11/372 (2%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ I+YTGD++ DRHL AA I PD+LITESTY T I + + RE F
Sbjct: 170 MFMIEIAGVKILYTGDFSRQEDRHLMAAEIPNIHPDVLITESTYGTHIHEKREDREGRFT 229
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
VHE V+RGG+ LIPVFALGRAQEL ++L+ YW + + + PIY+A L +K Y+
Sbjct: 230 NLVHEIVNRGGRCLIPVFALGRAQELLLILDEYWGQHSELHEIPIYYASSLAKKCMAVYQ 289
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
++ N KIR+ N F FKHI ++ GP VV A+PGM+ SGLS +F+
Sbjct: 290 TYVNAMNDKIRRQIAINNPFVFKHISNLKGIDHFEDIGPCVVMASPGMMQSGLSRELFES 349
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W N +I+ G+CV+GT+ +LS +++ + Q + +KM+++Y+SFSAH D +
Sbjct: 350 WCTDAKNGVIIAGYCVEGTLAKGILSEPEEITTMSGQKLPLKMSVDYISFSAHTDYQQTS 409
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLD--CFMPANGESCFVQTDMKISIDVS 295
+ I+ +P +V+LVHGE ++M LK +++E+ D M + V ++ + +
Sbjct: 410 EFIRILKPPHVVLVHGEQNEMGRLKAALQREYEDDPNTTMEIHNPRNTVAVELYFRGEKT 469
Query: 296 VNLLKEEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNV 355
++ A+ E P P +++ GVLV ++ + ++ + K +S V +V
Sbjct: 470 AKVMGTLAM----ETPKP--GQKLSGVLVKRNFNYHMLAPCDLSKYTDMSMSQVIQRQSV 523
Query: 356 QISDSSPINKTL 367
S S P+ K L
Sbjct: 524 YFSASLPVLKHL 535
>gi|51467896|ref|NP_001003836.1| cleavage and polyadenylation specificity factor subunit 3 [Danio
rerio]
gi|49619053|gb|AAT68111.1| cleavage and polyadenylation specificity factor 3 [Danio rerio]
Length = 690
Score = 226 bits (575), Expect = 7e-56, Method: Compositional matrix adjust.
Identities = 147/455 (32%), Positives = 237/455 (52%), Gaps = 38/455 (8%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ ++YTGD++ DRHL AA I +PD+LITESTY T I + + RE F
Sbjct: 171 MFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPSVKPDILITESTYGTHIHEKREEREARFC 230
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
VH+ V+R G+ LIPVFALGRAQEL ++L+ YW+ PIY+A L +K Y+
Sbjct: 231 NTVHDIVNREGRCLIPVFALGRAQELLLILDEYWQNHPELHDIPIYYASSLAKKCMAVYQ 290
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
++ N KIRK N F FKHI D+ GP VV A+PGM+ SGLS +F+
Sbjct: 291 TYVNAMNDKIRKAININNPFVFKHISNLKSMDHFDDIGPSVVMASPGMMQSGLSRELFES 350
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W + N +I+ G+CV+GT+ ++S +++ + Q + +KM+++Y+SFSAH D +
Sbjct: 351 WCTDKRNGVIIAGYCVEGTLAKHIMSEPEEITTMSGQKLPLKMSVDYISFSAHTDYQQTS 410
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF------NLDCFMPANGESCFVQTDMKIS 291
+ I+ +P +V+LVHGE ++M LK + +E+ +++ P N E+
Sbjct: 411 EFIRALKPPHVILVHGEQNEMARLKAALIREYEDNDEVHIEVHNPRNTEA---------- 460
Query: 292 IDVSVNLLKEEAVKYNSEPPNP--LKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIV 349
V++N E+ K + + +++ G+LV K+ S ++ + +S +
Sbjct: 461 --VTLNFRGEKLAKVMGSLADKKCSQGQRVSGILVKKNFSYHILSPSD------LSNYTD 512
Query: 350 RFTSNVQISDSSPINKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLLSVERIDDK 409
S V+ + + P L+ QL D + I+++E S V S+ I +
Sbjct: 513 LAMSTVKQTQAIPFTGPFPLLLSQLRHLTGDVEE---IEMSEKSTV--KVFNSITVIHE- 566
Query: 410 NKRVFVTW---PNQDEEVGKIVLHVLKSMSNTQGQ 441
N V + W P D + VL+ SN + Q
Sbjct: 567 NNLVVLEWFANPLNDMYADAVTTVVLEVQSNPKAQ 601
>gi|198421242|ref|XP_002128016.1| PREDICTED: similar to Cleavage and polyadenylation specificity
factor subunit 3 (Cleavage and polyadenylation
specificity factor 73 kDa subunit) (CPSF 73 kDa subunit)
(mRNA 3-end-processing endonuclease CPSF-73) [Ciona
intestinalis]
Length = 690
Score = 226 bits (575), Expect = 8e-56, Method: Compositional matrix adjust.
Identities = 116/275 (42%), Positives = 169/275 (61%), Gaps = 3/275 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ ++YTGDY+ DRHL AA I RPD+LITE+TY T I + + RE F
Sbjct: 169 MFMIQIAGVRVLYTGDYSREEDRHLMAAEIPAVRPDVLITEATYGTHIHEPREEREARFT 228
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
V + V+RGG+ LIPVFALGRAQEL ++L+ YW PIY+A L +K Y+
Sbjct: 229 NTVQDIVNRGGRCLIPVFALGRAQELLLILDDYWANHPELHDIPIYYASSLAKKCMAVYQ 288
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
+ NQKI+K N F FKHI D+ GP VV A+PGM+ SGLS +F+
Sbjct: 289 TYSNAMNQKIQKQLNISNPFQFKHISNLKGMEHFDDVGPSVVMASPGMMQSGLSRELFES 348
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W N +I+ G+CV+GT+ +LS +++ + Q I +K++++Y+SFSAHAD K
Sbjct: 349 WCNDRRNGVIVAGYCVEGTLAKHILSEPEEVVSMSGQKIPLKLSVDYISFSAHADYKQCS 408
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLD 272
+ ++ +P +V+LVHGEA++M LK + +E+ D
Sbjct: 409 EFVRAMKPPHVVLVHGEANEMNRLKLALNREYEDD 443
>gi|383861262|ref|XP_003706105.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
3-like [Megachile rotundata]
Length = 686
Score = 226 bits (575), Expect = 8e-56, Method: Compositional matrix adjust.
Identities = 114/275 (41%), Positives = 170/275 (61%), Gaps = 3/275 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ I+YTGD++ DRHL AA I PD+LITESTY T I + + RE F
Sbjct: 171 MFMIEIAGVKILYTGDFSRQEDRHLMAAEIPNIHPDVLITESTYGTHIHEKREDREGRFT 230
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
VHE V+RGG+ LIPVFALGRAQEL ++L+ YW + + PIY+A L +K Y+
Sbjct: 231 NLVHEIVNRGGRCLIPVFALGRAQELLLILDEYWSQHPELHEIPIYYASSLAKKCMAVYQ 290
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
++ N KIR+ N F FKHI ++ GP VV A+PGM+ SGLS +F+
Sbjct: 291 TYVNAMNDKIRRQIAINNPFVFKHISNLKGIDHFEDIGPCVVMASPGMMQSGLSRELFES 350
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W N +I+ G+CV+GT+ +LS +++ + Q + +KM+++Y+SFSAH D +
Sbjct: 351 WCTDAKNGVIIAGYCVEGTLAKTILSEPEEITTMSGQKLPLKMSVDYISFSAHTDYQQTS 410
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLD 272
+ I+ +P +V+LVHGE ++M LK +++E+ D
Sbjct: 411 EFIRILKPPHVVLVHGEQNEMGRLKAALQREYEDD 445
>gi|380012076|ref|XP_003690115.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
3-like [Apis florea]
Length = 686
Score = 225 bits (574), Expect = 8e-56, Method: Compositional matrix adjust.
Identities = 114/275 (41%), Positives = 170/275 (61%), Gaps = 3/275 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ I+YTGD++ DRHL AA I PD+LITESTY T I + + RE F
Sbjct: 171 MFMIEIAGVKILYTGDFSRQEDRHLMAAEIPNIHPDVLITESTYGTHIHEKREDREGRFT 230
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
VHE V+RGG+ LIPVFALGRAQEL ++L+ YW + + PIY+A L +K Y+
Sbjct: 231 NLVHEIVNRGGRCLIPVFALGRAQELLLILDEYWSQHPELHEIPIYYASSLAKKCMAVYQ 290
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
++ N KIR+ N F FKHI ++ GP VV A+PGM+ SGLS +F+
Sbjct: 291 TYVNAMNDKIRRQIAINNPFVFKHISNLKGIDHFEDIGPCVVMASPGMMQSGLSRELFES 350
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W N +I+ G+CV+GT+ +LS +++ + Q + +KM+++Y+SFSAH D +
Sbjct: 351 WCTDAKNGVIIAGYCVEGTLAKTILSEPEEITTMSGQKLPLKMSVDYISFSAHTDYQQTS 410
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLD 272
+ I+ +P +V+LVHGE ++M LK +++E+ D
Sbjct: 411 EFIRSLKPPHVVLVHGEQNEMGRLKAALQREYEDD 445
>gi|269860830|ref|XP_002650133.1| cleavage and polyadenylation specificity factor, 73 kDa subunit
[Enterocytozoon bieneusi H348]
gi|220066453|gb|EED43934.1| cleavage and polyadenylation specificity factor, 73 kDa subunit
[Enterocytozoon bieneusi H348]
Length = 657
Score = 225 bits (574), Expect = 9e-56, Method: Compositional matrix adjust.
Identities = 113/293 (38%), Positives = 174/293 (59%), Gaps = 8/293 (2%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++G ++YTGD++ DRHL +A C+ D+LITESTY T + RE F
Sbjct: 162 MFLLEIGQTKLLYTGDFSREDDRHLKSAETPNCKLDILITESTYGTQCHLPRIERENRFT 221
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
K V + V+RGGK L+PVFALGRAQEL ++L+ YWE + PI++A L +K Y+
Sbjct: 222 KVVSDVVERGGKCLLPVFALGRAQELLLILDEYWEENPHLKKIPIFYASALAKKCMGIYQ 281
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPF-DKSFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
++ N++++K + RN FDFK++ D + + GP V+ A+PGML SG+S IF++
Sbjct: 282 TYVNMMNERMQKLNLTRNPFDFKNVENIKDAKTVRDGGPCVIMASPGMLQSGVSRDIFER 341
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W N +++ G+CV+GT+ +VL K++E Q++ + M +EY+SFSAH D
Sbjct: 342 WCSDSKNGVVIAGYCVEGTLAKEVLKEPKEIESHRGQLLKLNMTVEYISFSAHVDYTQNA 401
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKI-----KQEFNLDCFMPANGESCFVQ 285
+ I C P + VHGEA++M LK+ I K + + NGE ++
Sbjct: 402 EFIYTCMPNYLFFVHGEATEMGRLKQSIQNKNEKNNIQMAIYTLKNGEEALIK 454
>gi|429963183|gb|ELA42727.1| hypothetical protein VICG_00042 [Vittaforma corneae ATCC 50505]
Length = 642
Score = 225 bits (574), Expect = 9e-56, Method: Compositional matrix adjust.
Identities = 119/289 (41%), Positives = 176/289 (60%), Gaps = 10/289 (3%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ I+YTGD++ DRHL AA C D LITESTY + RE F
Sbjct: 159 MFLLEIEKSKILYTGDFSREEDRHLKAAESPGCCLDALITESTYGVQCHLPRYEREARFT 218
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYW-ERMNLQA-PIYFAVGLTEKATNYYK 118
V V RGG+ L+PVFALGRAQEL ++LE +W E +L+ PIY+A L +K + Y+
Sbjct: 219 SIVSHVVLRGGRCLLPVFALGRAQELLLILEEHWDENPHLKGIPIYYASALAQKCMSVYQ 278
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFD--KSFIDNPGPMVVFATPGMLHSGLSLIIFK 176
+I N++I+K + +N FDF+++ +SF D GP V+ A+PGML SG S +F+
Sbjct: 279 TYINMMNERIQKASLVKNPFDFRNVESIKDIQSFKDT-GPCVMMASPGMLQSGFSRELFE 337
Query: 177 KWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGI 236
KW E N +++PG+CV+GT+ ++LS K++E ++ + M++EY+SFSAH D
Sbjct: 338 KWCSNEKNGVVIPGYCVEGTLAKEILSEPKEVESMKGGLLKLNMSVEYISFSAHVDYTQN 397
Query: 237 MQLIQYCEPKNVLLVHGEASKMVFLKEKI-----KQEFNLDCFMPANGE 280
+Q I C+PKN+ VHGEA++M+ LK + K ++ F NGE
Sbjct: 398 IQFIDECQPKNLFFVHGEANEMMRLKNSVQMRNEKNSVKMNLFALRNGE 446
>gi|410955844|ref|XP_003984560.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
3 [Felis catus]
Length = 686
Score = 225 bits (574), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 135/401 (33%), Positives = 217/401 (54%), Gaps = 32/401 (7%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ ++YTGD++ DRHL AA I +PD+LI ESTY T I + + RE F
Sbjct: 164 MFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPNIKPDILIIESTYGTHIHEKREEREARFC 223
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
VH+ V+RGG+ LIPVFALGRAQEL ++L+ YW+ PIY+A L +K Y+
Sbjct: 224 NTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQNHPELHDIPIYYASSLAKKCMAVYQ 283
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
++ N KIRK N F FKHI D+ GP VV A+PGM+ SGLS +F+
Sbjct: 284 TYVNAMNDKIRKQININNPFVFKHISNLKSMDHFDDIGPSVVMASPGMMQSGLSRELFES 343
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W + N +I+ G+CV+GT+ ++S +++ + Q + +KM+++Y+SFSAH D +
Sbjct: 344 WCTDKRNGVIIAGYCVEGTLAKHIMSEPEEITTMSGQKLPLKMSVDYISFSAHTDYQQTS 403
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF------NLDCFMPANGESCFVQTDMKIS 291
+ I+ +P +V+LVHGE ++M LK + +E+ +++ P N E+
Sbjct: 404 EFIRALKPPHVILVHGEQNEMARLKAALIREYEDNDEVHIEVHNPRNTEA---------- 453
Query: 292 IDVSVNLLKEEAVKYNSEP----PNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRH 347
V++N E+ K P + +++ G+LV ++ + ++ C +S +
Sbjct: 454 --VTLNFRGEKLAKVKGHGLLADKKPEQGQRVSGILVKRNFNYHILS---PCD---LSNY 505
Query: 348 IVRFTSNVQISDSSPINKTLELIYDQLLSYLQDKSQEYKIQ 388
S V+ + + P L+Y QL D +E +IQ
Sbjct: 506 TDLAMSTVKQTQAIPYTGPFNLLYYQLQKLTGD-VEELEIQ 545
>gi|197102904|ref|NP_001127045.1| cleavage and polyadenylation specificity factor subunit 3 [Pongo
abelii]
gi|55733623|emb|CAH93488.1| hypothetical protein [Pongo abelii]
Length = 647
Score = 225 bits (574), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 135/399 (33%), Positives = 217/399 (54%), Gaps = 30/399 (7%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ ++YTGD++ DRHL AA I +PD+LI ESTY T I + + RE F
Sbjct: 127 MFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPNIKPDILIIESTYGTHIHEKREEREARFC 186
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
VH+ V+RGG+ LIPVFALGRAQEL ++L+ YW+ PIY+A L +K Y+
Sbjct: 187 NTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQNHPELHDIPIYYASSLAKKCMAVYQ 246
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
++ N KIRK N F FKHI D+ GP VV A+PGM+ SGLS +F+
Sbjct: 247 TYVNAMNDKIRKQININNPFVFKHISNLKSMDHFDDIGPSVVMASPGMMQSGLSRELFES 306
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W + N +I+ G+CV+GT+ ++S +++ + Q + +KM+++Y+SFSAH D +
Sbjct: 307 WCTDKRNGVIIAGYCVEGTLAKHIMSEPEEITTMSGQKLPLKMSVDYISFSAHTDYQQTS 366
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF------NLDCFMPANGESCFVQTDMKIS 291
+ I+ +P +V+LVHGE ++M LK + +E+ +++ P N E+
Sbjct: 367 EFIRALKPPHVILVHGEQNEMARLKAALIREYEDNDEVHIEVHNPRNTEA---------- 416
Query: 292 IDVSVNLLKEEAVKYNS--EPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIV 349
V++N E+ K P + +++ G+LV ++ + ++ C +S +
Sbjct: 417 --VTLNFRGEKLAKVMGFLADKKPEQGQRVSGILVKRNFNYHIL---SPCD---LSNYTD 468
Query: 350 RFTSNVQISDSSPINKTLELIYDQLLSYLQDKSQEYKIQ 388
S V+ + + P L+Y QL D +E +IQ
Sbjct: 469 LAMSTVKQTQAIPYTGPFNLLYYQLQKLTGD-VEELEIQ 506
>gi|357114659|ref|XP_003559115.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
3-I-like [Brachypodium distachyon]
Length = 768
Score = 225 bits (574), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 118/289 (40%), Positives = 165/289 (57%), Gaps = 10/289 (3%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF V + I+YTGDY+ DRHL AA I + PD+ I ESTY + RE+ F
Sbjct: 251 MFMVDIAGVRILYTGDYSREEDRHLKAAEIPQFSPDVCIVESTYGVQQHQPRHVREKRFT 310
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
+H V +GG+VLIP FALGRAQEL ++L+ YW + PIY+A L +K Y+
Sbjct: 311 DAIHNTVSQGGRVLIPAFALGRAQELLLILDEYWSNHPELHKIPIYYASPLAKKCMAVYQ 370
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDN---PGPMVVFATPGMLHSGLSLIIF 175
+I N++IR F Q N F FKHI P + IDN GP VV A+PG L SGLS +F
Sbjct: 371 TYINSMNERIRNQFAQSNPFHFKHIEPLNS--IDNFHDVGPSVVMASPGTLQSGLSRQLF 428
Query: 176 KKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKG 235
KW + N ++PGF ++GT+ +++ +++ N + M I Y+SFSAHAD
Sbjct: 429 DKWCTDKKNTCVIPGFVIEGTLSKTIINEPREVTLANGLTAPLHMQIFYISFSAHADFPQ 488
Query: 236 IMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF---NLDCFMPANGES 281
+ P N++LVHGEA++M LK+K+ +F N+ P N +S
Sbjct: 489 TSAFLDELRPPNIILVHGEANEMGRLKQKLITQFDGTNIKIVSPKNCQS 537
>gi|431911821|gb|ELK13965.1| Cleavage and polyadenylation specificity factor subunit 3, partial
[Pteropus alecto]
Length = 667
Score = 225 bits (573), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 135/399 (33%), Positives = 217/399 (54%), Gaps = 30/399 (7%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ ++YTGD++ DRHL AA I +PD+LI ESTY T I + + RE F
Sbjct: 147 MFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPNIKPDILIIESTYGTHIHEKREEREARFC 206
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
VH+ V+RGG+ LIPVFALGRAQEL ++L+ YW+ PIY+A L +K Y+
Sbjct: 207 NTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQNHPELHDIPIYYASSLAKKCMAVYQ 266
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
++ N KIRK N F FKHI D+ GP VV A+PGM+ SGLS +F+
Sbjct: 267 TYVNAMNDKIRKQININNPFVFKHISNLKSMDHFDDIGPSVVMASPGMMQSGLSRELFES 326
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W + N +I+ G+CV+GT+ ++S +++ + Q + +KM+++Y+SFSAH D +
Sbjct: 327 WCTDKRNGVIIAGYCVEGTLAKHIMSEPEEITTMSGQKLPLKMSVDYISFSAHTDYQQTS 386
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF------NLDCFMPANGESCFVQTDMKIS 291
+ I+ +P +V+LVHGE ++M LK + +E+ +++ P N E+
Sbjct: 387 EFIRALKPPHVILVHGEQNEMARLKAALIREYEDNDEVHIEVHNPRNTEA---------- 436
Query: 292 IDVSVNLLKEEAVKYNS--EPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIV 349
V++N E+ K P + +++ G+LV ++ + ++ C +S +
Sbjct: 437 --VTLNFRGEKLAKVMGFLADKKPEQGQRVSGILVKRNFNYHILS---PCD---LSNYTD 488
Query: 350 RFTSNVQISDSSPINKTLELIYDQLLSYLQDKSQEYKIQ 388
S V+ + + P L+Y QL D +E +IQ
Sbjct: 489 LAMSTVKQTQAIPYTGPFNLLYYQLQKLTGD-VEELEIQ 526
>gi|402890043|ref|XP_003908303.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
3 [Papio anubis]
Length = 684
Score = 225 bits (573), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 135/399 (33%), Positives = 217/399 (54%), Gaps = 30/399 (7%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ ++YTGD++ DRHL AA I +PD+LI ESTY T I + + RE F
Sbjct: 164 MFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPNIKPDILIIESTYGTHIHEKREEREARFC 223
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
VH+ V+RGG+ LIPVFALGRAQEL ++L+ YW+ PIY+A L +K Y+
Sbjct: 224 NTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQNHPELHDIPIYYASSLAKKCMAVYQ 283
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
++ N KIRK N F FKHI D+ GP VV A+PGM+ SGLS +F+
Sbjct: 284 TYVNAMNDKIRKQININNPFVFKHISNLKSMDHFDDIGPSVVMASPGMMQSGLSRELFES 343
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W + N +I+ G+CV+GT+ ++S +++ + Q + +KM+++Y+SFSAH D +
Sbjct: 344 WCTDKRNGVIIAGYCVEGTLAKHIMSEPEEITTMSGQKLPLKMSVDYISFSAHTDYQQTS 403
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF------NLDCFMPANGESCFVQTDMKIS 291
+ I+ +P +V+LVHGE ++M LK + +E+ +++ P N E+
Sbjct: 404 EFIRALKPPHVILVHGEQNEMARLKAALIREYEDNDEVHIEVHNPRNTEA---------- 453
Query: 292 IDVSVNLLKEEAVKYNS--EPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIV 349
V++N E+ K P + +++ G+LV ++ + ++ C +S +
Sbjct: 454 --VTLNFRGEKLAKVMGFLADKKPEQGQRVSGILVKRNFNYHILS---PCD---LSNYTD 505
Query: 350 RFTSNVQISDSSPINKTLELIYDQLLSYLQDKSQEYKIQ 388
S V+ + + P L+Y QL D +E +IQ
Sbjct: 506 LAMSTVKQTQAIPYTGPFNLLYYQLQKLTGD-VEELEIQ 543
>gi|281351872|gb|EFB27456.1| hypothetical protein PANDA_012399 [Ailuropoda melanoleuca]
Length = 648
Score = 225 bits (573), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 135/399 (33%), Positives = 217/399 (54%), Gaps = 30/399 (7%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ ++YTGD++ DRHL AA I +PD+LI ESTY T I + + RE F
Sbjct: 128 MFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPNIKPDILIIESTYGTHIHEKREEREARFC 187
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
VH+ V+RGG+ LIPVFALGRAQEL ++L+ YW+ PIY+A L +K Y+
Sbjct: 188 NTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQNHPELHDIPIYYASSLAKKCMAVYQ 247
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
++ N KIRK N F FKHI D+ GP VV A+PGM+ SGLS +F+
Sbjct: 248 TYVNAMNDKIRKQININNPFVFKHISNLKSMDHFDDIGPSVVMASPGMMQSGLSRELFES 307
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W + N +I+ G+CV+GT+ ++S +++ + Q + +KM+++Y+SFSAH D +
Sbjct: 308 WCTDKRNGVIIAGYCVEGTLAKHIMSEPEEITTMSGQKLPLKMSVDYISFSAHTDYQQTS 367
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF------NLDCFMPANGESCFVQTDMKIS 291
+ I+ +P +V+LVHGE ++M LK + +E+ +++ P N E+
Sbjct: 368 EFIRALKPPHVILVHGEQNEMARLKAALIREYEDNDEVHIEVHNPRNTEA---------- 417
Query: 292 IDVSVNLLKEEAVKYNS--EPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIV 349
V++N E+ K P + +++ G+LV ++ + ++ C +S +
Sbjct: 418 --VTLNFRGEKLAKVMGFLADKKPEQGQRVSGILVKRNFNYHIL---SPCD---LSNYTD 469
Query: 350 RFTSNVQISDSSPINKTLELIYDQLLSYLQDKSQEYKIQ 388
S V+ + + P L+Y QL D +E +IQ
Sbjct: 470 LAMSTVKQTQAIPYTGPFNLLYYQLQKLTGD-VEELEIQ 507
>gi|296224527|ref|XP_002758090.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
3 [Callithrix jacchus]
Length = 684
Score = 225 bits (573), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 135/399 (33%), Positives = 217/399 (54%), Gaps = 30/399 (7%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ ++YTGD++ DRHL AA I +PD+LI ESTY T I + + RE F
Sbjct: 164 MFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPNIKPDILIIESTYGTHIHEKREEREARFC 223
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
VH+ V+RGG+ LIPVFALGRAQEL ++L+ YW+ PIY+A L +K Y+
Sbjct: 224 NTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQNHPELHDIPIYYASSLAKKCMAVYQ 283
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
++ N KIRK N F FKHI D+ GP VV A+PGM+ SGLS +F+
Sbjct: 284 TYVNAMNDKIRKQININNPFVFKHISNLKSMDHFDDIGPSVVMASPGMMQSGLSRELFES 343
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W + N +I+ G+CV+GT+ ++S +++ + Q + +KM+++Y+SFSAH D +
Sbjct: 344 WCTDKRNGVIIAGYCVEGTLAKHIMSEPEEITTMSGQKLPLKMSVDYISFSAHTDYQQTS 403
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF------NLDCFMPANGESCFVQTDMKIS 291
+ I+ +P +V+LVHGE ++M LK + +E+ +++ P N E+
Sbjct: 404 EFIRALKPPHVILVHGEQNEMARLKAALIREYEDNDEVHIEVHNPRNTEA---------- 453
Query: 292 IDVSVNLLKEEAVKYNS--EPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIV 349
V++N E+ K P + +++ G+LV ++ + ++ C +S +
Sbjct: 454 --VTLNFRGEKLAKVMGFLADKKPEQGQRVSGILVKRNFNYHILS---PCD---LSNYTD 505
Query: 350 RFTSNVQISDSSPINKTLELIYDQLLSYLQDKSQEYKIQ 388
S V+ + + P L+Y QL D +E +IQ
Sbjct: 506 LAMSTVKQTQAIPYTGPFNLLYYQLQKLTGD-VEELEIQ 543
>gi|332247248|ref|XP_003272765.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
3 [Nomascus leucogenys]
gi|67969340|dbj|BAE01022.1| unnamed protein product [Macaca fascicularis]
gi|355751093|gb|EHH55348.1| hypothetical protein EGM_04543 [Macaca fascicularis]
gi|380813676|gb|AFE78712.1| cleavage and polyadenylation specificity factor subunit 3 [Macaca
mulatta]
gi|383419123|gb|AFH32775.1| cleavage and polyadenylation specificity factor subunit 3 [Macaca
mulatta]
gi|384940728|gb|AFI33969.1| cleavage and polyadenylation specificity factor subunit 3 [Macaca
mulatta]
Length = 684
Score = 225 bits (573), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 135/399 (33%), Positives = 217/399 (54%), Gaps = 30/399 (7%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ ++YTGD++ DRHL AA I +PD+LI ESTY T I + + RE F
Sbjct: 164 MFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPNIKPDILIIESTYGTHIHEKREEREARFC 223
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
VH+ V+RGG+ LIPVFALGRAQEL ++L+ YW+ PIY+A L +K Y+
Sbjct: 224 NTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQNHPELHDIPIYYASSLAKKCMAVYQ 283
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
++ N KIRK N F FKHI D+ GP VV A+PGM+ SGLS +F+
Sbjct: 284 TYVNAMNDKIRKQININNPFVFKHISNLKSMDHFDDIGPSVVMASPGMMQSGLSRELFES 343
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W + N +I+ G+CV+GT+ ++S +++ + Q + +KM+++Y+SFSAH D +
Sbjct: 344 WCTDKRNGVIIAGYCVEGTLAKHIMSEPEEITTMSGQKLPLKMSVDYISFSAHTDYQQTS 403
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF------NLDCFMPANGESCFVQTDMKIS 291
+ I+ +P +V+LVHGE ++M LK + +E+ +++ P N E+
Sbjct: 404 EFIRALKPPHVILVHGEQNEMARLKAALIREYEDNDEVHIEVHNPRNTEA---------- 453
Query: 292 IDVSVNLLKEEAVKYNS--EPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIV 349
V++N E+ K P + +++ G+LV ++ + ++ C +S +
Sbjct: 454 --VTLNFRGEKLAKVMGFLADKKPEQGQRVSGILVKRNFNYHILS---PCD---LSNYTD 505
Query: 350 RFTSNVQISDSSPINKTLELIYDQLLSYLQDKSQEYKIQ 388
S V+ + + P L+Y QL D +E +IQ
Sbjct: 506 LAMSTVKQTQAIPYTGPFNLLYYQLQKLTGD-VEELEIQ 543
>gi|355565449|gb|EHH21878.1| hypothetical protein EGK_05038 [Macaca mulatta]
Length = 650
Score = 225 bits (573), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 135/399 (33%), Positives = 217/399 (54%), Gaps = 30/399 (7%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ ++YTGD++ DRHL AA I +PD+LI ESTY T I + + RE F
Sbjct: 161 MFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPNIKPDILIIESTYGTHIHEKREEREARFC 220
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
VH+ V+RGG+ LIPVFALGRAQEL ++L+ YW+ PIY+A L +K Y+
Sbjct: 221 NTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQNHPELHDIPIYYASSLAKKCMAVYQ 280
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
++ N KIRK N F FKHI D+ GP VV A+PGM+ SGLS +F+
Sbjct: 281 TYVNAMNDKIRKQININNPFVFKHISNLKSMDHFDDIGPSVVMASPGMMQSGLSRELFES 340
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W + N +I+ G+CV+GT+ ++S +++ + Q + +KM+++Y+SFSAH D +
Sbjct: 341 WCTDKRNGVIIAGYCVEGTLAKHIMSEPEEITTMSGQKLPLKMSVDYISFSAHTDYQQTS 400
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF------NLDCFMPANGESCFVQTDMKIS 291
+ I+ +P +V+LVHGE ++M LK + +E+ +++ P N E+
Sbjct: 401 EFIRALKPPHVILVHGEQNEMARLKAALIREYEDNDEVHIEVHNPRNTEA---------- 450
Query: 292 IDVSVNLLKEEAVKYNS--EPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIV 349
V++N E+ K P + +++ G+LV ++ + ++ C +S +
Sbjct: 451 --VTLNFRGEKLAKVMGFLADKKPEQGQRVSGILVKRNFNYHILS---PCD---LSNYTD 502
Query: 350 RFTSNVQISDSSPINKTLELIYDQLLSYLQDKSQEYKIQ 388
S V+ + + P L+Y QL D +E +IQ
Sbjct: 503 LAMSTVKQTQAIPYTGPFNLLYYQLQKLTGD-VEELEIQ 540
>gi|350539083|ref|NP_001233296.1| cleavage and polyadenylation specificity factor subunit 3 [Pan
troglodytes]
gi|397513374|ref|XP_003826991.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
3 [Pan paniscus]
gi|426334660|ref|XP_004028859.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
3 [Gorilla gorilla gorilla]
gi|343961085|dbj|BAK62132.1| cleavage and polyadenylation specificity factor 73 kDa subunit [Pan
troglodytes]
gi|343961781|dbj|BAK62478.1| cleavage and polyadenylation specificity factor 73 kDa subunit [Pan
troglodytes]
gi|410254182|gb|JAA15058.1| cleavage and polyadenylation specific factor 3, 73kDa [Pan
troglodytes]
gi|410291448|gb|JAA24324.1| cleavage and polyadenylation specific factor 3, 73kDa [Pan
troglodytes]
gi|410339611|gb|JAA38752.1| cleavage and polyadenylation specific factor 3, 73kDa [Pan
troglodytes]
Length = 684
Score = 225 bits (573), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 135/399 (33%), Positives = 217/399 (54%), Gaps = 30/399 (7%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ ++YTGD++ DRHL AA I +PD+LI ESTY T I + + RE F
Sbjct: 164 MFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPNIKPDILIIESTYGTHIHEKREEREARFC 223
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
VH+ V+RGG+ LIPVFALGRAQEL ++L+ YW+ PIY+A L +K Y+
Sbjct: 224 NTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQNHPELHDIPIYYASSLAKKCMAVYQ 283
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
++ N KIRK N F FKHI D+ GP VV A+PGM+ SGLS +F+
Sbjct: 284 TYVNAMNDKIRKQININNPFVFKHISNLKSMDHFDDIGPSVVMASPGMMQSGLSRELFES 343
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W + N +I+ G+CV+GT+ ++S +++ + Q + +KM+++Y+SFSAH D +
Sbjct: 344 WCTDKRNGVIIAGYCVEGTLAKHIMSEPEEITTMSGQKLPLKMSVDYISFSAHTDYQQTS 403
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF------NLDCFMPANGESCFVQTDMKIS 291
+ I+ +P +V+LVHGE ++M LK + +E+ +++ P N E+
Sbjct: 404 EFIRALKPPHVILVHGEQNEMARLKAALIREYEDNDEVHIEVHNPRNTEA---------- 453
Query: 292 IDVSVNLLKEEAVKYNS--EPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIV 349
V++N E+ K P + +++ G+LV ++ + ++ C +S +
Sbjct: 454 --VTLNFRGEKLAKVMGFLADKKPEQGQRVSGILVKRNFNYHILS---PCD---LSNYTD 505
Query: 350 RFTSNVQISDSSPINKTLELIYDQLLSYLQDKSQEYKIQ 388
S V+ + + P L+Y QL D +E +IQ
Sbjct: 506 LAMSTVKQTQAIPYTGPFNLLYYQLQKLTGD-VEELEIQ 543
>gi|194220982|ref|XP_001502516.2| PREDICTED: cleavage and polyadenylation specificity factor subunit
3-like [Equus caballus]
gi|301775721|ref|XP_002923277.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
3-like [Ailuropoda melanoleuca]
Length = 684
Score = 225 bits (573), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 135/399 (33%), Positives = 217/399 (54%), Gaps = 30/399 (7%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ ++YTGD++ DRHL AA I +PD+LI ESTY T I + + RE F
Sbjct: 164 MFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPNIKPDILIIESTYGTHIHEKREEREARFC 223
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
VH+ V+RGG+ LIPVFALGRAQEL ++L+ YW+ PIY+A L +K Y+
Sbjct: 224 NTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQNHPELHDIPIYYASSLAKKCMAVYQ 283
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
++ N KIRK N F FKHI D+ GP VV A+PGM+ SGLS +F+
Sbjct: 284 TYVNAMNDKIRKQININNPFVFKHISNLKSMDHFDDIGPSVVMASPGMMQSGLSRELFES 343
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W + N +I+ G+CV+GT+ ++S +++ + Q + +KM+++Y+SFSAH D +
Sbjct: 344 WCTDKRNGVIIAGYCVEGTLAKHIMSEPEEITTMSGQKLPLKMSVDYISFSAHTDYQQTS 403
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF------NLDCFMPANGESCFVQTDMKIS 291
+ I+ +P +V+LVHGE ++M LK + +E+ +++ P N E+
Sbjct: 404 EFIRALKPPHVILVHGEQNEMARLKAALIREYEDNDEVHIEVHNPRNTEA---------- 453
Query: 292 IDVSVNLLKEEAVKYNS--EPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIV 349
V++N E+ K P + +++ G+LV ++ + ++ C +S +
Sbjct: 454 --VTLNFRGEKLAKVMGFLADKKPEQGQRVSGILVKRNFNYHILS---PCD---LSNYTD 505
Query: 350 RFTSNVQISDSSPINKTLELIYDQLLSYLQDKSQEYKIQ 388
S V+ + + P L+Y QL D +E +IQ
Sbjct: 506 LAMSTVKQTQAIPYTGPFNLLYYQLQKLTGD-VEELEIQ 543
>gi|67969643|dbj|BAE01170.1| unnamed protein product [Macaca fascicularis]
Length = 684
Score = 225 bits (573), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 135/399 (33%), Positives = 217/399 (54%), Gaps = 30/399 (7%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ ++YTGD++ DRHL AA I +PD+LI ESTY T I + + RE F
Sbjct: 164 MFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPNIKPDILIIESTYGTHIHEKREEREARFC 223
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
VH+ V+RGG+ LIPVFALGRAQEL ++L+ YW+ PIY+A L +K Y+
Sbjct: 224 NTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQNHPELHDIPIYYASSLAKKCMAVYQ 283
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
++ N KIRK N F FKHI D+ GP VV A+PGM+ SGLS +F+
Sbjct: 284 TYVNAMNDKIRKQININNPFVFKHISNLKSMDHFDDIGPSVVMASPGMMQSGLSRELFES 343
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W + N +I+ G+CV+GT+ ++S +++ + Q + +KM+++Y+SFSAH D +
Sbjct: 344 WCTDKRNGVIIAGYCVEGTLAKHIMSEPEEITTMSGQKLPLKMSVDYISFSAHTDYQQTS 403
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF------NLDCFMPANGESCFVQTDMKIS 291
+ I+ +P +V+LVHGE ++M LK + +E+ +++ P N E+
Sbjct: 404 EFIRALKPPHVILVHGEQNEMARLKAALIREYEDNDEVHIEVHNPRNTEA---------- 453
Query: 292 IDVSVNLLKEEAVKYNS--EPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIV 349
V++N E+ K P + +++ G+LV ++ + ++ C +S +
Sbjct: 454 --VTLNFRGEKLAKVMGFLADKKPEQGQRVSGILVKRNFNYHILS---PCD---LSNYTD 505
Query: 350 RFTSNVQISDSSPINKTLELIYDQLLSYLQDKSQEYKIQ 388
S V+ + + P L+Y QL D +E +IQ
Sbjct: 506 LAMSTVKQTQAIPYTGPFNLLYYQLQKLTGD-VEELEIQ 543
>gi|27805863|ref|NP_776709.1| cleavage and polyadenylation specificity factor subunit 3 [Bos
taurus]
gi|426223116|ref|XP_004005724.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
3 [Ovis aries]
gi|18202362|sp|P79101.1|CPSF3_BOVIN RecName: Full=Cleavage and polyadenylation specificity factor
subunit 3; AltName: Full=Cleavage and polyadenylation
specificity factor 73 kDa subunit; Short=CPSF 73 kDa
subunit; AltName: Full=mRNA 3'-end-processing
endonuclease CPSF-73
gi|1707412|emb|CAA65151.1| Cleavage and Polyadenylation Specifity Factor protein [Bos taurus]
gi|75773721|gb|AAI04554.1| Cleavage and polyadenylation specific factor 3, 73kDa [Bos taurus]
gi|296482248|tpg|DAA24363.1| TPA: cleavage and polyadenylation specificity factor subunit 3 [Bos
taurus]
gi|440897562|gb|ELR49218.1| Cleavage and polyadenylation specificity factor subunit 3 [Bos
grunniens mutus]
Length = 684
Score = 224 bits (572), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 135/399 (33%), Positives = 217/399 (54%), Gaps = 30/399 (7%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ ++YTGD++ DRHL AA I +PD+LI ESTY T I + + RE F
Sbjct: 164 MFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPNIKPDILIIESTYGTHIHEKREEREARFC 223
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
VH+ V+RGG+ LIPVFALGRAQEL ++L+ YW+ PIY+A L +K Y+
Sbjct: 224 NTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQNHPELHDIPIYYASSLAKKCMAVYQ 283
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
++ N KIRK N F FKHI D+ GP VV A+PGM+ SGLS +F+
Sbjct: 284 TYVNAMNDKIRKQININNPFVFKHISNLKSMDHFDDIGPSVVMASPGMMQSGLSRELFES 343
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W + N +I+ G+CV+GT+ ++S +++ + Q + +KM+++Y+SFSAH D +
Sbjct: 344 WCTDKRNGVIIAGYCVEGTLAKHIMSEPEEITTMSGQKLPLKMSVDYISFSAHTDYQQTS 403
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF------NLDCFMPANGESCFVQTDMKIS 291
+ I+ +P +V+LVHGE ++M LK + +E+ +++ P N E+
Sbjct: 404 EFIRALKPPHVILVHGEQNEMARLKAALIREYEDNDEVHIEVHNPRNTEA---------- 453
Query: 292 IDVSVNLLKEEAVKYNS--EPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIV 349
V++N E+ K P + +++ G+LV ++ + ++ C +S +
Sbjct: 454 --VTLNFRGEKLAKVMGFLADKKPEQGQRVSGILVKRNFNYHILS---PCD---LSNYTD 505
Query: 350 RFTSNVQISDSSPINKTLELIYDQLLSYLQDKSQEYKIQ 388
S V+ + + P L+Y QL D +E +IQ
Sbjct: 506 LAMSTVKQTQAIPYTGPFNLLYYQLQKLTGD-VEELEIQ 543
>gi|291412514|ref|XP_002722528.1| PREDICTED: cleavage and polyadenylation specific factor 3, 73kDa
[Oryctolagus cuniculus]
Length = 684
Score = 224 bits (572), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 135/399 (33%), Positives = 217/399 (54%), Gaps = 30/399 (7%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ ++YTGD++ DRHL AA I +PD+LI ESTY T I + + RE F
Sbjct: 164 MFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPNIKPDILIIESTYGTHIHEKREEREARFC 223
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
VH+ V+RGG+ LIPVFALGRAQEL ++L+ YW+ PIY+A L +K Y+
Sbjct: 224 NTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQNHPELHDIPIYYASSLAKKCMAVYQ 283
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
++ N KIRK N F FKHI D+ GP VV A+PGM+ SGLS +F+
Sbjct: 284 TYVNAMNDKIRKQININNPFVFKHISNLKSMDHFDDIGPSVVMASPGMMQSGLSRELFES 343
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W + N +I+ G+CV+GT+ ++S +++ + Q + +KM+++Y+SFSAH D +
Sbjct: 344 WCTDKRNGVIIAGYCVEGTLAKHIMSEPEEITTMSGQKLPLKMSVDYISFSAHTDYQQTS 403
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF------NLDCFMPANGESCFVQTDMKIS 291
+ I+ +P +V+LVHGE ++M LK + +E+ +++ P N E+
Sbjct: 404 EFIRALKPPHVILVHGEQNEMARLKAALIREYEDNDEVHIEVHNPRNTEA---------- 453
Query: 292 IDVSVNLLKEEAVKYNS--EPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIV 349
V++N E+ K P + +++ G+LV ++ + ++ C +S +
Sbjct: 454 --VTLNFRGEKLAKVMGFLADKKPEQGQRVSGILVKRNFNYHILS---PCD---LSNYTD 505
Query: 350 RFTSNVQISDSSPINKTLELIYDQLLSYLQDKSQEYKIQ 388
S V+ + + P L+Y QL D +E +IQ
Sbjct: 506 LAMSTVKQTQAIPYTGPFNLLYYQLQKLTGD-VEEIEIQ 543
>gi|359321645|ref|XP_003639652.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
3-like [Canis lupus familiaris]
Length = 717
Score = 224 bits (572), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 135/399 (33%), Positives = 217/399 (54%), Gaps = 30/399 (7%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ ++YTGD++ DRHL AA I +PD+LI ESTY T I + + RE F
Sbjct: 197 MFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPNIKPDILIIESTYGTHIHEKREEREARFC 256
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
VH+ V+RGG+ LIPVFALGRAQEL ++L+ YW+ PIY+A L +K Y+
Sbjct: 257 NTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQNHPELHDIPIYYASSLAKKCMAVYQ 316
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
++ N KIRK N F FKHI D+ GP VV A+PGM+ SGLS +F+
Sbjct: 317 TYVNAMNDKIRKQININNPFVFKHISNLKSMDHFDDIGPSVVMASPGMMQSGLSRELFES 376
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W + N +I+ G+CV+GT+ ++S +++ + Q + +KM+++Y+SFSAH D +
Sbjct: 377 WCTDKRNGVIIAGYCVEGTLAKHIMSEPEEITTMSGQKLPLKMSVDYISFSAHTDYQQTS 436
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF------NLDCFMPANGESCFVQTDMKIS 291
+ I+ +P +V+LVHGE ++M LK + +E+ +++ P N E+
Sbjct: 437 EFIRALKPPHVILVHGEQNEMARLKAALIREYEDNDEVHIEVHNPRNTEA---------- 486
Query: 292 IDVSVNLLKEEAVKYNS--EPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIV 349
V++N E+ K P + +++ G+LV ++ + ++ C +S +
Sbjct: 487 --VTLNFRGEKLAKVMGFLADKKPEQGQRVSGILVKRNFNYHILS---PCD---LSNYTD 538
Query: 350 RFTSNVQISDSSPINKTLELIYDQLLSYLQDKSQEYKIQ 388
S V+ + + P L+Y QL D +E +IQ
Sbjct: 539 LAMSTVKQTQAIPYTGPFNLLYYQLQKLTGD-VEELEIQ 576
>gi|357117889|ref|XP_003560694.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
3-I-like [Brachypodium distachyon]
Length = 690
Score = 224 bits (572), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 119/289 (41%), Positives = 166/289 (57%), Gaps = 10/289 (3%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF V + I+YTGDY+ DRHL AA I + PD+ I ESTY + RE+ F
Sbjct: 173 MFMVDIAGVRILYTGDYSREEDRHLKAAEIPQFSPDVCIVESTYGVQQHQPRHVREKRFT 232
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERM-NLQA-PIYFAVGLTEKATNYYK 118
+H V +GG+VLIP FALGRAQEL ++L+ YW LQ PIY+A L +K Y+
Sbjct: 233 DAIHNTVSQGGRVLIPAFALGRAQELLLILDEYWSNHPELQKIPIYYASPLAKKCMAVYQ 292
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDN---PGPMVVFATPGMLHSGLSLIIF 175
+I N++IR F Q N F FKHI P + IDN GP VV A+PG L SGLS +F
Sbjct: 293 TYINSMNERIRNQFAQSNPFHFKHIEPLNS--IDNFHDVGPSVVMASPGSLQSGLSRQLF 350
Query: 176 KKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKG 235
KW + N ++PG+ ++GT+ +++ +++ N + M I Y+SFSAHAD
Sbjct: 351 DKWCTDKKNTCVIPGYVIEGTLAKTIINEPREVTLANGLTAPLHMQIFYISFSAHADFPQ 410
Query: 236 IMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF---NLDCFMPANGES 281
+ P N++LVHGEA++M LK+K+ +F N+ P N +S
Sbjct: 411 TSGFLDELRPPNIILVHGEANEMGRLKQKLITQFDGTNIKIVSPKNCQS 459
>gi|346466613|gb|AEO33151.1| hypothetical protein [Amblyomma maculatum]
Length = 618
Score = 224 bits (572), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 125/342 (36%), Positives = 195/342 (57%), Gaps = 19/342 (5%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ ++YTGD++ DRHL AA I PD+LI ESTY T I + + RE F
Sbjct: 101 MFMIEIAGVKVLYTGDFSRQEDRHLMAAEIPNIHPDVLIIESTYGTHIHEKREEREARFT 160
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
VH+ V+RGG+ LIPVFALGRAQEL ++L+ YW PIY+A L +K Y+
Sbjct: 161 GLVHDIVNRGGRCLIPVFALGRAQELLLILDEYWSNHPELHDIPIYYASSLAKKCMAVYQ 220
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
++ N++IR+ N F FKHI ++ GP VV A+PGM+ SGLS +F+
Sbjct: 221 TYVNAMNERIRRQITINNPFVFKHISNLKSIEHFEDIGPCVVMASPGMMQSGLSRELFES 280
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W N +I+ G+CV+GT+ +LS +++ Q + +KM+++Y+SFSAH D +
Sbjct: 281 WCTDPKNGVIIAGYCVEGTLAKTILSEPEEISTMVGQKLPLKMSVDYISFSAHTDYQQTS 340
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF------NLDCFMPANGESCFVQTDMKIS 291
+ I+ +P +++LVHGE ++M LK I +E+ ++ P N ++ ++
Sbjct: 341 EFIRTLKPPHIVLVHGEQNEMGRLKAAIVREYEDDLETRIEVHNPRNTQA----VELHFR 396
Query: 292 IDVSVNLLKEEAVKYNSEPPNPLKERQIHGVLVIKDSSISLM 333
+ + ++ AV EPP P RQ+ GVLV ++ S L+
Sbjct: 397 GEKTTKVMGSLAV----EPPEP--GRQLSGVLVKRNFSYHLL 432
>gi|327261273|ref|XP_003215455.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
3-like [Anolis carolinensis]
Length = 651
Score = 224 bits (572), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 114/290 (39%), Positives = 176/290 (60%), Gaps = 9/290 (3%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ ++YTGD++ DRHL AA I +PD+LI ESTY T I + + RE F
Sbjct: 130 MFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPNIKPDILIIESTYGTHIHEKREEREARFC 189
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
VH+ V+RGG+ LIPVFALGRAQEL ++L+ YW+ + PIY+A L +K Y+
Sbjct: 190 NTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQNHPELHEIPIYYASSLAKKCMAVYQ 249
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
++ N KIRK N F FKHI D+ GP VV A+PGM+ SGLS +F+
Sbjct: 250 TYVNAMNDKIRKQININNPFVFKHISNLKSMDHFDDIGPSVVMASPGMMQSGLSRELFES 309
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W + N +I+ G+CV+GT+ ++S +++ + Q + +KM+++Y+SFSAH D +
Sbjct: 310 WCTDKRNGVIIAGYCVEGTLAKHIMSEPEEITTMSGQKLPLKMSVDYISFSAHTDYQQTS 369
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF------NLDCFMPANGES 281
+ I+ +P +V+LVHGE ++M LK + +E+ +++ P N E+
Sbjct: 370 EFIRALKPPHVILVHGEQNEMARLKAALIREYEDNDEVHIEVHNPRNTEA 419
>gi|55741994|ref|NP_001006770.1| cleavage and polyadenylation specificity factor 3 [Xenopus
(Silurana) tropicalis]
gi|49522504|gb|AAH75564.1| cleavage and polyadenylation specific factor 3, 73kDa [Xenopus
(Silurana) tropicalis]
Length = 692
Score = 224 bits (571), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 110/272 (40%), Positives = 168/272 (61%), Gaps = 3/272 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ ++YTGD++ DRHL AA I +PD+LI ESTY T I + + RE F
Sbjct: 171 MFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPNVKPDILIIESTYGTHIHEKREEREARFC 230
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
VH+ V+RGG+ LIPVFALGRAQEL ++L+ YW+ PIY+A L +K Y+
Sbjct: 231 NTVHDIVNRGGRALIPVFALGRAQELLLILDEYWQNHPELHDIPIYYASSLAKKCMAVYQ 290
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
++ N KIRK N F FKHI D+ GP VV A+PGM+ SGLS +F+
Sbjct: 291 TYVNAMNDKIRKQININNPFVFKHISNLKSMDHFDDIGPSVVMASPGMMQSGLSRELFES 350
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W + N +I+ G+CV+GT+ ++S +++ + Q + +KM+++Y+SFSAH D +
Sbjct: 351 WCTDKRNGVIIAGYCVEGTLAKHIMSEPEEIATMSGQKLPLKMSVDYISFSAHTDYQQTS 410
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF 269
+ ++ +P +V+LVHGE ++M LK + +E+
Sbjct: 411 EFVRALKPPHVILVHGEQNEMARLKAALIREY 442
>gi|89267474|emb|CAJ83498.1| cleavage and polyadenylation specific factor 3 [Xenopus (Silurana)
tropicalis]
Length = 692
Score = 224 bits (571), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 110/272 (40%), Positives = 168/272 (61%), Gaps = 3/272 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ ++YTGD++ DRHL AA I +PD+LI ESTY T I + + RE F
Sbjct: 171 MFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPNVKPDILIIESTYGTHIHEKREEREARFC 230
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
VH+ V+RGG+ LIPVFALGRAQEL ++L+ YW+ PIY+A L +K Y+
Sbjct: 231 NTVHDIVNRGGRALIPVFALGRAQELLLILDEYWQNHPELHDIPIYYASSLAKKCMAVYQ 290
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
++ N KIRK N F FKHI D+ GP VV A+PGM+ SGLS +F+
Sbjct: 291 TYVNAMNDKIRKQININNPFVFKHISNLKSMDHFDDIGPSVVMASPGMMQSGLSRELFES 350
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W + N +I+ G+CV+GT+ ++S +++ + Q + +KM+++Y+SFSAH D +
Sbjct: 351 WCTDKRNGVIIAGYCVEGTLAKHIMSEPEEIATMSGQKLPLKMSVDYISFSAHTDYQQTS 410
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF 269
+ ++ +P +V+LVHGE ++M LK + +E+
Sbjct: 411 EFVRALKPPHVILVHGEQNEMARLKAALIREY 442
>gi|149641381|ref|XP_001505542.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
3-like, partial [Ornithorhynchus anatinus]
Length = 595
Score = 224 bits (570), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 111/272 (40%), Positives = 168/272 (61%), Gaps = 3/272 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ ++YTGD++ DRHL AA I +PD+LI ESTY T I + + RE F
Sbjct: 75 MFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPNIKPDILIIESTYGTHIHEKREEREARFC 134
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
VH+ V+RGG+ LIPVFALGRAQEL ++L+ YW+ PIY+A L +K Y+
Sbjct: 135 NTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQNHPELHDIPIYYASSLAKKCMAVYQ 194
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
++ N KIRK N F FKHI D+ GP VV A+PGM+ SGLS +F+
Sbjct: 195 TYVNAMNDKIRKQININNPFVFKHISNLKSMDHFDDIGPSVVMASPGMMQSGLSRELFES 254
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W + N +I+ G+CV+GT+ ++S +++ + Q + +KM+++Y+SFSAH D +
Sbjct: 255 WCTDKRNGVIIAGYCVEGTLAKHIMSEPEEITTMSGQKLPLKMSVDYISFSAHTDYQQTS 314
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF 269
+ I+ +P +V+LVHGE ++M LK + +E+
Sbjct: 315 EFIRALKPPHVILVHGEQNEMARLKAALIREY 346
>gi|344257704|gb|EGW13808.1| Cleavage and polyadenylation specificity factor subunit 3
[Cricetulus griseus]
Length = 647
Score = 224 bits (570), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 134/399 (33%), Positives = 217/399 (54%), Gaps = 30/399 (7%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ ++YTGD++ DRHL AA I +PD+LI ESTY T I + + RE F
Sbjct: 127 MFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPNIKPDILIIESTYGTHIHEKREEREARFC 186
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
VH+ V+RGG+ LIPVFALGRAQEL ++L+ YW+ PIY+A L +K Y+
Sbjct: 187 NTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQNHPELHDIPIYYASSLAKKCMAVYQ 246
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
++ N KIRK N F FKHI D+ GP VV A+PGM+ +GLS +F+
Sbjct: 247 TYVNAMNDKIRKQININNPFVFKHISNLKSMDHFDDIGPSVVMASPGMIQNGLSRELFES 306
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W + N +I+ G+CV+GT+ ++S +++ + Q + +KM+++Y+SFSAH D +
Sbjct: 307 WCTDKRNGVIIAGYCVEGTLAKHIMSEPEEITTMSGQKLPLKMSVDYISFSAHTDYQQTS 366
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF------NLDCFMPANGESCFVQTDMKIS 291
+ I+ +P +V+LVHGE ++M LK + +E+ +++ P N E+
Sbjct: 367 EFIRALKPPHVILVHGEQNEMARLKAALIREYEDNDEVHIEVHNPRNTEA---------- 416
Query: 292 IDVSVNLLKEEAVKYNS--EPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIV 349
V++N E+ K P + +++ G+LV ++ + ++ C +S +
Sbjct: 417 --VTLNFRGEKLAKVMGFLADKKPEQGQRVSGILVKRNFNYHIL---SPCD---LSNYTD 468
Query: 350 RFTSNVQISDSSPINKTLELIYDQLLSYLQDKSQEYKIQ 388
S V+ + + P L+Y QL D +E +IQ
Sbjct: 469 LAMSTVKQTQAIPYTGPFNLLYYQLQKLTGD-VEELEIQ 506
>gi|321461562|gb|EFX72593.1| hypothetical protein DAPPUDRAFT_308207 [Daphnia pulex]
Length = 689
Score = 224 bits (570), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 115/277 (41%), Positives = 170/277 (61%), Gaps = 7/277 (2%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ ++YTGD++ DRHL AA I RPD+LITESTY T I + + RE F
Sbjct: 170 MFMIEIAGVKVLYTGDFSRQEDRHLMAAEIPTVRPDILITESTYGTHIHEKREDRESRFT 229
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
+HE V+RGG+ LIPVFALGRAQEL ++L+ YW + PIY+A L +K Y+
Sbjct: 230 GLIHEIVNRGGRCLIPVFALGRAQELLLILDEYWSLHPELHEIPIYYASSLAQKCMAVYQ 289
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHI---RPFDKSFIDNPGPMVVFATPGMLHSGLSLIIF 175
+I N KIR+ N F FKHI + D+ ++ GP V+ A+PGM+ SGLS +F
Sbjct: 290 TYINAMNDKIRRQIAINNPFIFKHISSLKGIDQ--FEDVGPCVIMASPGMMQSGLSRELF 347
Query: 176 KKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKG 235
+ W N I+ G+CV+GT+ VLS + + Q + +K++++Y+SFSAH D +
Sbjct: 348 EAWCTDPKNGCIIAGYCVEGTLAKHVLSEPEDITSMAGQKLPMKLSVDYISFSAHTDYQQ 407
Query: 236 IMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLD 272
+ I+ +P +V+LVHGEA++M LK + +E+ D
Sbjct: 408 TSEFIRLLKPPHVVLVHGEANEMNRLKAALIREYEHD 444
>gi|344280152|ref|XP_003411849.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
3-like [Loxodonta africana]
Length = 903
Score = 224 bits (570), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 135/399 (33%), Positives = 217/399 (54%), Gaps = 30/399 (7%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ ++YTGD++ DRHL AA I +PD+LI ESTY T I + + RE F
Sbjct: 383 MFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPNIKPDILIIESTYGTHIHEKREEREARFC 442
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
VH+ V+RGG+ LIPVFALGRAQEL ++L+ YW+ PIY+A L +K Y+
Sbjct: 443 NTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQNHPELHDIPIYYASSLAKKCMAVYQ 502
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
++ N KIRK N F FKHI D+ GP VV A+PGM+ SGLS +F+
Sbjct: 503 TYVNAMNDKIRKQININNPFVFKHISNLKSMDHFDDIGPSVVMASPGMMQSGLSRELFES 562
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W + N +I+ G+CV+GT+ ++S +++ + Q + +KM+++Y+SFSAH D +
Sbjct: 563 WCTDKRNGVIIAGYCVEGTLAKHIMSEPEEITTMSGQKLPLKMSVDYISFSAHTDYQQTS 622
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF------NLDCFMPANGESCFVQTDMKIS 291
+ I+ +P +V+LVHGE ++M LK + +E+ +++ P N E+
Sbjct: 623 EFIRALKPPHVILVHGEQNEMARLKAALIREYEDNDEVHIEVHNPRNTEA---------- 672
Query: 292 IDVSVNLLKEEAVKYNS--EPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIV 349
V++N E+ K P + +++ G+LV ++ + ++ C +S +
Sbjct: 673 --VTLNFRGEKLAKVMGFLADKKPEQGQRVSGILVKRNFNYHIL---SPCD---LSNYTD 724
Query: 350 RFTSNVQISDSSPINKTLELIYDQLLSYLQDKSQEYKIQ 388
S V+ + + P L+Y QL D +E +IQ
Sbjct: 725 LAMSTVKQTQAIPYTGPFNLLYYQLQKLTGD-VEELEIQ 762
>gi|363732494|ref|XP_419942.3| PREDICTED: cleavage and polyadenylation specificity factor subunit
3 [Gallus gallus]
Length = 672
Score = 224 bits (570), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 123/338 (36%), Positives = 194/338 (57%), Gaps = 23/338 (6%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ ++YTGD++ DRHL AA I +PD+LI ESTY T I + + RE F
Sbjct: 151 MFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPNIKPDILIIESTYGTHIHEKREEREARFC 210
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
VH+ V+RGG+ LIPVFALGRAQEL ++L+ YW+ PIY+A L +K Y+
Sbjct: 211 NTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQNHPELHDIPIYYASSLAKKCMAVYQ 270
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
++ N KIRK N F FKHI D+ GP VV A+PGM+ SGLS +F+
Sbjct: 271 TYVNAMNDKIRKQININNPFVFKHISNLKSMDHFDDIGPSVVMASPGMMQSGLSRELFES 330
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W + N +I+ G+CV+GT+ ++S +++ + Q + +KM+++Y+SFSAH D +
Sbjct: 331 WCTDKRNGVIIAGYCVEGTLAKHIMSEPEEITTMSGQKLPLKMSVDYISFSAHTDYQQTS 390
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF------NLDCFMPANGESCFVQTDMKIS 291
+ I+ +P +V+LVHGE ++M LK + +E+ +++ P N E+
Sbjct: 391 EFIRALKPPHVILVHGEQNEMARLKAALIREYEDNDEVHIEVHNPRNTEA---------- 440
Query: 292 IDVSVNLLKEEAVKYNSEPPN--PLKERQIHGVLVIKD 327
V++N E+ K + P + ++I G+LV ++
Sbjct: 441 --VTLNFRGEKLAKVMGSLADKKPEQGQRISGILVKRN 476
>gi|348558392|ref|XP_003465002.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
3-like [Cavia porcellus]
Length = 684
Score = 224 bits (570), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 111/272 (40%), Positives = 168/272 (61%), Gaps = 3/272 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ ++YTGD++ DRHL AA I +PD+LI ESTY T I + + RE F
Sbjct: 164 MFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPNIKPDILIIESTYGTHIHEKREEREARFC 223
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
VH+ V+RGG+ LIPVFALGRAQEL ++L+ YW+ PIY+A L +K Y+
Sbjct: 224 NTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQNHPELHDIPIYYASSLAKKCMAVYQ 283
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
++ N KIRK N F FKHI D+ GP VV A+PGM+ SGLS +F+
Sbjct: 284 TYVNAMNDKIRKQININNPFVFKHISNLKSMDHFDDIGPSVVMASPGMMQSGLSRELFES 343
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W + N +I+ G+CV+GT+ ++S +++ + Q + +KM+++Y+SFSAH D +
Sbjct: 344 WCTDKRNGVIIAGYCVEGTLAKHIMSEPEEIATMSGQKLPLKMSVDYISFSAHTDYQQTS 403
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF 269
+ I+ +P +V+LVHGE ++M LK + +E+
Sbjct: 404 EFIRALKPPHVILVHGEQNEMARLKAALIREY 435
>gi|157117185|ref|XP_001652976.1| cleavage and polyadenylation specificity factor [Aedes aegypti]
gi|108876120|gb|EAT40345.1| AAEL007904-PA [Aedes aegypti]
Length = 687
Score = 223 bits (569), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 130/358 (36%), Positives = 203/358 (56%), Gaps = 19/358 (5%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ I+YTGD++ DRHL AA I +PD+LITESTY T I + + RE F
Sbjct: 171 MFMIEIAGVKILYTGDFSRQEDRHLMAAEIPAMKPDVLITESTYGTHIHEKREDRESRFT 230
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYW-ERMNLQA-PIYFAVGLTEKATNYYK 118
V + V +GG+ LIPVFALGRAQEL ++L+ YW + LQ PIY+A L +K Y+
Sbjct: 231 SLVQKIVQQGGRCLIPVFALGRAQELLLILDEYWSQNPELQEFPIYYASSLAKKCMAVYQ 290
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
+I N KIR+ N F F+HI ++ GP VV A+PGM+ SGLS +F+
Sbjct: 291 TYINAMNDKIRRQIAVNNPFVFRHISNLKGIDHFEDIGPCVVMASPGMMQSGLSRELFET 350
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W N +I+ G+CV+GT+ +LS +++ + Q + + M+++Y+SFSAH D +
Sbjct: 351 WCTDPKNGVIIAGYCVEGTLAKTILSEPEEITSMSGQKLPLNMSVDYISFSAHTDYQQTS 410
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF------NLDCFMPANGESCFVQTDMKIS 291
+ I+ +P +V+LVHGE ++M LK +++E+ N+ + P N + + + +
Sbjct: 411 EFIRILKPAHVILVHGEQNEMNRLKSALQREYESDPNANITLYNPKNTHAVELYFRGEKT 470
Query: 292 IDVSVNLLKEEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIV 349
V NL AVK +P + +++ GVLV +D L+ + K +S +V
Sbjct: 471 AKVMGNL----AVK------SPEEGQKLSGVLVKRDFKYHLLAASDLSKYTDMSMSVV 518
>gi|71795627|ref|NP_001025201.1| cleavage and polyadenylation specificity factor subunit 3 [Rattus
norvegicus]
gi|71121802|gb|AAH99817.1| Cleavage and polyadenylation specificity factor 3 [Rattus
norvegicus]
Length = 685
Score = 223 bits (569), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 134/399 (33%), Positives = 217/399 (54%), Gaps = 30/399 (7%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ ++YTGD++ DRHL AA I +PD+LI ESTY T I + + RE F
Sbjct: 164 MFMIEIAGMKLLYTGDFSRQEDRHLMAAEIPNIKPDILIIESTYGTHIHEKREEREARFC 223
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
VH+ V+RGG+ LIPVFALGRAQEL ++L+ YW+ PIY+A L +K Y+
Sbjct: 224 NTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQNHPELHDIPIYYASSLAKKCMAVYQ 283
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
++ N KIRK N F FKHI D+ GP VV A+PGM+ +GLS +F+
Sbjct: 284 TYVNAMNDKIRKQININNPFVFKHISNLKSMDHFDDIGPSVVMASPGMIQNGLSRELFES 343
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W + N +I+ G+CV+GT+ ++S +++ + Q + +KM+++Y+SFSAH D +
Sbjct: 344 WCTDKRNGVIIAGYCVEGTLAKHIMSEPEEITTMSGQKLPLKMSVDYISFSAHTDYQQTS 403
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF------NLDCFMPANGESCFVQTDMKIS 291
+ I+ +P +V+LVHGE ++M LK + +E+ +++ P N E+
Sbjct: 404 EFIRALKPPHVILVHGEQNEMARLKAALIREYEDNDEVHIEVHNPRNTEA---------- 453
Query: 292 IDVSVNLLKEEAVKYNS--EPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIV 349
V++N E+ K P + +++ G+LV ++ + ++ C +S +
Sbjct: 454 --VTLNFRGEKLAKVMGFLADKKPEQGQRVSGILVKRNFNYHILS---PCD---LSNYTD 505
Query: 350 RFTSNVQISDSSPINKTLELIYDQLLSYLQDKSQEYKIQ 388
S V+ + + P L+Y QL D +E +IQ
Sbjct: 506 LAMSTVKQTQAIPYTGPFYLLYYQLQKLTGD-VEELEIQ 543
>gi|354504216|ref|XP_003514173.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
3 [Cricetulus griseus]
Length = 684
Score = 223 bits (569), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 134/399 (33%), Positives = 217/399 (54%), Gaps = 30/399 (7%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ ++YTGD++ DRHL AA I +PD+LI ESTY T I + + RE F
Sbjct: 164 MFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPNIKPDILIIESTYGTHIHEKREEREARFC 223
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
VH+ V+RGG+ LIPVFALGRAQEL ++L+ YW+ PIY+A L +K Y+
Sbjct: 224 NTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQNHPELHDIPIYYASSLAKKCMAVYQ 283
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
++ N KIRK N F FKHI D+ GP VV A+PGM+ +GLS +F+
Sbjct: 284 TYVNAMNDKIRKQININNPFVFKHISNLKSMDHFDDIGPSVVMASPGMIQNGLSRELFES 343
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W + N +I+ G+CV+GT+ ++S +++ + Q + +KM+++Y+SFSAH D +
Sbjct: 344 WCTDKRNGVIIAGYCVEGTLAKHIMSEPEEITTMSGQKLPLKMSVDYISFSAHTDYQQTS 403
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF------NLDCFMPANGESCFVQTDMKIS 291
+ I+ +P +V+LVHGE ++M LK + +E+ +++ P N E+
Sbjct: 404 EFIRALKPPHVILVHGEQNEMARLKAALIREYEDNDEVHIEVHNPRNTEA---------- 453
Query: 292 IDVSVNLLKEEAVKYNS--EPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIV 349
V++N E+ K P + +++ G+LV ++ + ++ C +S +
Sbjct: 454 --VTLNFRGEKLAKVMGFLADKKPEQGQRVSGILVKRNFNYHILS---PCD---LSNYTD 505
Query: 350 RFTSNVQISDSSPINKTLELIYDQLLSYLQDKSQEYKIQ 388
S V+ + + P L+Y QL D +E +IQ
Sbjct: 506 LAMSTVKQTQAIPYTGPFNLLYYQLQKLTGD-VEELEIQ 543
>gi|147905468|ref|NP_001088278.1| cleavage and polyadenylation specific factor 3, 73kDa [Xenopus
laevis]
gi|54038587|gb|AAH84286.1| LOC495111 protein [Xenopus laevis]
Length = 692
Score = 223 bits (569), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 122/338 (36%), Positives = 193/338 (57%), Gaps = 23/338 (6%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ ++YTGD++ DRHL AA I +PD+LI ESTY T I + + RE F
Sbjct: 171 MFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPNIKPDILIIESTYGTHIHEKREEREARFC 230
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
VH+ V+RGG+ LIPVFALGRAQEL ++L+ YW+ PIY+A L +K Y+
Sbjct: 231 NTVHDIVNRGGRSLIPVFALGRAQELLLILDEYWQNHPELHDIPIYYASSLAKKCMAVYQ 290
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
++ N KIRK N F FKHI D+ GP VV A+PGM+ SGLS +F+
Sbjct: 291 TYVNAMNDKIRKQININNPFVFKHISNLKSMDHFDDIGPSVVMASPGMMQSGLSRELFES 350
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W + N +I+ G+CV+GT+ ++S +++ + Q + +KM+++Y+SFSAH D +
Sbjct: 351 WCTDKRNGVIIAGYCVEGTLAKHIMSEPEEIVTMSGQKLPLKMSVDYISFSAHTDYQQTS 410
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF------NLDCFMPANGESCFVQTDMKIS 291
+ ++ +P +V+LVHGE ++M LK + +E+ +++ P N E+
Sbjct: 411 EFVRALKPPHVILVHGEQNEMARLKAALIREYEDNEEVDIEVHNPRNTEA---------- 460
Query: 292 IDVSVNLLKEEAVKYNS--EPPNPLKERQIHGVLVIKD 327
V++N E+ K P + ++I G+LV ++
Sbjct: 461 --VTLNFRGEKLAKVMGFLADKKPEQAQRISGILVKRN 496
>gi|242007002|ref|XP_002424331.1| Cleavage and polyadenylation specificity factor 73 kDa subunit,
putative [Pediculus humanus corporis]
gi|212507731|gb|EEB11593.1| Cleavage and polyadenylation specificity factor 73 kDa subunit,
putative [Pediculus humanus corporis]
Length = 692
Score = 223 bits (569), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 111/275 (40%), Positives = 170/275 (61%), Gaps = 3/275 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ ++YTGD++ DRHL AA I +PD+LITESTY T I + + RE F
Sbjct: 172 MFMIEIAGVRVLYTGDFSRQEDRHLMAAEIPSIKPDVLITESTYGTHIHEKREERETRFT 231
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
+H ++RGG+ LIPVFALGRAQEL ++L+ YW + PIY+A L +K Y+
Sbjct: 232 NLIHTIINRGGRCLIPVFALGRAQELLLILDDYWSQHPELHDIPIYYASSLAKKCMAVYQ 291
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
++ N KIR+ N F F+HI D+ GP VV A+PGM+ SGLS +F+
Sbjct: 292 TYVNAMNDKIRRQIAINNPFIFRHIHNLKGIDHFDDIGPCVVMASPGMMQSGLSRELFEL 351
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W N +I+ G+CV+GT+ ++LS +++ + Q + +KM+++Y+SFSAH D K
Sbjct: 352 WCTDSKNGVIIAGYCVEGTLAKQILSEPEEIVTLSGQKLPLKMSVDYISFSAHTDYKQTS 411
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLD 272
+ ++ +P +V+LVHGE ++M LK +K+E+ D
Sbjct: 412 EFVRALKPPHVVLVHGEQNEMNRLKAALKREYEDD 446
>gi|432100623|gb|ELK29151.1| Cleavage and polyadenylation specificity factor subunit 3 [Myotis
davidii]
Length = 684
Score = 223 bits (569), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 111/272 (40%), Positives = 168/272 (61%), Gaps = 3/272 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ ++YTGD++ DRHL AA I +PD+LI ESTY T I + + RE F
Sbjct: 164 MFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPNIKPDILIIESTYGTHIHEKREEREARFC 223
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
VH+ V+RGG+ LIPVFALGRAQEL ++L+ YW+ PIY+A L +K Y+
Sbjct: 224 NTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQNHPELHDIPIYYASSLAKKCMAVYQ 283
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
++ N KIRK N F FKHI D+ GP VV A+PGM+ SGLS +F+
Sbjct: 284 TYVNAMNDKIRKQININNPFVFKHISNLKSMDHFDDIGPSVVMASPGMMQSGLSRELFES 343
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W + N +I+ G+CV+GT+ ++S +++ + Q + +KM+++Y+SFSAH D +
Sbjct: 344 WCTDKRNGVIIAGYCVEGTLAKHIMSEPEEITTMSGQKLPLKMSVDYISFSAHTDYQQTS 403
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF 269
+ I+ +P +V+LVHGE ++M LK + +E+
Sbjct: 404 EFIRALKPPHVILVHGEQNEMARLKAALIREY 435
>gi|119621395|gb|EAX00990.1| cleavage and polyadenylation specific factor 3, 73kDa, isoform
CRA_b [Homo sapiens]
Length = 647
Score = 223 bits (569), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 111/272 (40%), Positives = 168/272 (61%), Gaps = 3/272 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ ++YTGD++ DRHL AA I +PD+LI ESTY T I + + RE F
Sbjct: 127 MFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPNIKPDILIIESTYGTHIHEKREEREARFC 186
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
VH+ V+RGG+ LIPVFALGRAQEL ++L+ YW+ PIY+A L +K Y+
Sbjct: 187 NTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQNHPELHDIPIYYASSLAKKCMAVYQ 246
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
++ N KIRK N F FKHI D+ GP VV A+PGM+ SGLS +F+
Sbjct: 247 TYVNAMNDKIRKQININNPFVFKHISNLKSMDHFDDIGPSVVMASPGMMQSGLSRELFES 306
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W + N +I+ G+CV+GT+ ++S +++ + Q + +KM+++Y+SFSAH D +
Sbjct: 307 WCTDKRNGVIIAGYCVEGTLAKHIMSEPEEITTMSGQKLPLKMSVDYISFSAHTDYQQTS 366
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF 269
+ I+ +P +V+LVHGE ++M LK + +E+
Sbjct: 367 EFIRALKPPHVILVHGEQNEMARLKAALIREY 398
>gi|427779921|gb|JAA55412.1| Putative cleavage and polyadenylation specificity factor cpsf
subunit [Rhipicephalus pulchellus]
Length = 737
Score = 223 bits (569), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 124/342 (36%), Positives = 195/342 (57%), Gaps = 19/342 (5%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ ++YTGD++ DRHL AA I PD+LI ESTY T I + + RE F
Sbjct: 220 MFMIEIAGVKVLYTGDFSRQEDRHLMAAEIPNIHPDVLIIESTYGTHIHEKREEREARFT 279
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
VH+ V+RGG+ LIPVFALGRAQEL ++L+ YW PIY+A L +K Y+
Sbjct: 280 GLVHDIVNRGGRCLIPVFALGRAQELLLILDEYWSNHPELHDIPIYYASSLAKKCMAVYQ 339
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
++ N++IR+ N F FKHI ++ GP VV A+PGM+ SGLS +F+
Sbjct: 340 TYVNAMNERIRRQITINNPFVFKHISNLKSIEHFEDIGPCVVMASPGMMQSGLSRELFES 399
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W N +I+ G+CV+GT+ +LS +++ Q + +KM+++Y+SFSAH D +
Sbjct: 400 WCTDPKNGVIIAGYCVEGTLAKTILSEPEEISTMVGQKLPLKMSVDYISFSAHTDYQQTS 459
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF------NLDCFMPANGESCFVQTDMKIS 291
+ I+ +P +++LVHGE ++M LK I +E+ ++ P N ++ ++
Sbjct: 460 EFIRTLKPPHIVLVHGEQNEMGRLKAAIVREYEDDLETRIEVHNPRNTQA----VELHFR 515
Query: 292 IDVSVNLLKEEAVKYNSEPPNPLKERQIHGVLVIKDSSISLM 333
+ + ++ AV +PP P RQ+ GVLV ++ S L+
Sbjct: 516 GEKTAKVMGSLAV----QPPEP--GRQLSGVLVKRNFSYHLL 551
>gi|62898706|dbj|BAD97207.1| cleavage and polyadenylation specific factor 3, 73kDa variant [Homo
sapiens]
Length = 684
Score = 223 bits (569), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 111/272 (40%), Positives = 168/272 (61%), Gaps = 3/272 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ ++YTGD++ DRHL AA I +PD+LI ESTY T I + + RE F
Sbjct: 164 MFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPNIKPDILIIESTYGTHIHEKREEREARFC 223
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
VH+ V+RGG+ LIPVFALGRAQEL ++L+ YW+ PIY+A L +K Y+
Sbjct: 224 NTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQNHPELHDIPIYYASSLAKKCMAVYQ 283
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
++ N KIRK N F FKHI D+ GP VV A+PGM+ SGLS +F+
Sbjct: 284 TYVNAMNDKIRKQININNPFVFKHISNLKSMDHFDDIGPSVVMASPGMMQSGLSRELFES 343
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W + N +I+ G+CV+GT+ ++S +++ + Q + +KM+++Y+SFSAH D +
Sbjct: 344 WCTDKRNGVIIAGYCVEGTLAKHIMSEPEEITTMSGQKLPLKMSVDYISFSAHTDYQQTS 403
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF 269
+ I+ +P +V+LVHGE ++M LK + +E+
Sbjct: 404 EFIRALKPPHVILVHGEQNEMARLKAALIREY 435
>gi|2394306|gb|AAB70268.1| 73 kDA subunit of cleavage and polyadenylation specificity factor
[Homo sapiens]
Length = 379
Score = 223 bits (568), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 111/272 (40%), Positives = 168/272 (61%), Gaps = 3/272 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ ++YTGD++ DRHL AA I +PD+LI ESTY T I + + RE F
Sbjct: 45 MFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPNIKPDILIIESTYGTHIHEKREEREARFC 104
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
VH+ V+RGG+ LIPVFALGRAQEL ++L+ YW+ PIY+A L +K Y+
Sbjct: 105 NTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQNHPELXDXPIYYASSLAKKCMAVYQ 164
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
++ N KIRK N F FKHI D+ GP VV A+PGM+ SGLS +F+
Sbjct: 165 TYVNAMNDKIRKQININNPFVFKHISNLKSMDHFDDIGPSVVMASPGMMQSGLSRELFES 224
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W + N +I+ G+CV+GT+ ++S +++ + Q + +KM+++Y+SFSAH D +
Sbjct: 225 WCTDKRNGVIIAGYCVEGTLAKHIMSEPEEITTMSGQKLPLKMSVDYISFSAHTDYQQTS 284
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF 269
+ I+ +P +V+LVHGE ++M LK + +E+
Sbjct: 285 EFIRALKPPHVILVHGEQNEMARLKAALIREY 316
>gi|326426580|gb|EGD72150.1| cleavage and polyadenylation specificity factor subunit 3
[Salpingoeca sp. ATCC 50818]
Length = 790
Score = 223 bits (568), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 122/340 (35%), Positives = 186/340 (54%), Gaps = 25/340 (7%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ +VYTGD++ DRHL AA + PD+LITEST+ +S++ RE F
Sbjct: 203 MFDIEIAGVRLVYTGDFSREEDRHLMAAEVPPNSPDILITESTFGVRQHESRQTREHRFT 262
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
K +H+ VDRGG+ LIPVFALGRAQEL ++L+ YW+ + + PIY+A L + YK
Sbjct: 263 KTIHDVVDRGGRCLIPVFALGRAQELLLILDDYWQNHDELHRVPIYYASALARRCMAVYK 322
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHI------RPFDKSFIDNPGPMVVFATPGMLHSGLSL 172
++ + I+KT N F+F+H+ FD + GP V+ A+PGML SGLS
Sbjct: 323 TYVNVMKESIQKTISINNPFNFRHVSYIRNLHQFDGEY--GGGPCVMLASPGMLQSGLSR 380
Query: 173 IIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHAD 232
IF++WA + N +++ G+ V GT+ +L KK+ E ID + I Y+SFSAH D
Sbjct: 381 EIFERWASNKANCVLLAGYVVNGTLAKDLLKAPKKVTSEAGHKIDRECDIAYISFSAHVD 440
Query: 233 AKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLD-----CFMPANGESCFVQTD 287
I+ +P +++LVHGE +M K +I + D F P N E+ +
Sbjct: 441 YAQNRDFIRALDPTHIVLVHGEKHEMGRFKMQITTDLEADQKTASVFDPRNAETVLLHYR 500
Query: 288 MKISIDVSVNLLKEEAVKYNSEPPNPLKERQIHGVLVIKD 327
+ V +L + P++ R++ G+LV K+
Sbjct: 501 GEKMAKVLGSLAQS----------GPVQGRRVSGILVAKE 530
>gi|15079675|gb|AAH11654.1| Cleavage and polyadenylation specific factor 3, 73kDa [Homo
sapiens]
gi|157929136|gb|ABW03853.1| cleavage and polyadenylation specific factor 3, 73kDa [synthetic
construct]
Length = 684
Score = 223 bits (568), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 111/272 (40%), Positives = 168/272 (61%), Gaps = 3/272 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ ++YTGD++ DRHL AA I +PD+LI ESTY T I + + RE F
Sbjct: 164 MFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPNIKPDILIIESTYGTHIHEKREEREARFC 223
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
VH+ V+RGG+ LIPVFALGRAQEL ++L+ YW+ PIY+A L +K Y+
Sbjct: 224 NTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQNHPELHDIPIYYASSLAKKCMAVYQ 283
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
++ N KIRK N F FKHI D+ GP VV A+PGM+ SGLS +F+
Sbjct: 284 TYVNAMNDKIRKQININNPFVFKHISNLKSMDHFDDIGPSVVMASPGMMQSGLSRELFES 343
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W + N +I+ G+CV+GT+ ++S +++ + Q + +KM+++Y+SFSAH D +
Sbjct: 344 WCTDKRNGVIIAGYCVEGTLAKHIMSEPEEITTMSGQKLPLKMSVDYISFSAHTDYQQTS 403
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF 269
+ I+ +P +V+LVHGE ++M LK + +E+
Sbjct: 404 EFIRALKPPHVILVHGEQNEMARLKAALIREY 435
>gi|335285899|ref|XP_003354974.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
3-like [Sus scrofa]
Length = 684
Score = 223 bits (568), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 111/272 (40%), Positives = 168/272 (61%), Gaps = 3/272 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ ++YTGD++ DRHL AA I +PD+LI ESTY T I + + RE F
Sbjct: 164 MFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPNIKPDILIIESTYGTHIHEKREEREARFC 223
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
VH+ V+RGG+ LIPVFALGRAQEL ++L+ YW+ PIY+A L +K Y+
Sbjct: 224 NTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQNHPELHDIPIYYASSLAKKCMAVYQ 283
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
++ N KIRK N F FKHI D+ GP VV A+PGM+ SGLS +F+
Sbjct: 284 TYVNAMNDKIRKQININNPFVFKHISNLKSMDHFDDIGPSVVMASPGMMQSGLSRELFES 343
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W + N +I+ G+CV+GT+ ++S +++ + Q + +KM+++Y+SFSAH D +
Sbjct: 344 WCTDKRNGVIIAGYCVEGTLAKHIMSEPEEITTMSGQKLPLKMSVDYISFSAHTDYQQTS 403
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF 269
+ I+ +P +V+LVHGE ++M LK + +E+
Sbjct: 404 EFIRALKPPHVILVHGEQNEMARLKAALIREY 435
>gi|417412420|gb|JAA52597.1| Putative cleavage and polyadenylation specificity factor cpsf
subunit, partial [Desmodus rotundus]
Length = 714
Score = 223 bits (568), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 111/272 (40%), Positives = 168/272 (61%), Gaps = 3/272 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ ++YTGD++ DRHL AA I +PD+LI ESTY T I + + RE F
Sbjct: 194 MFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPNIKPDILIIESTYGTHIHEKREEREARFC 253
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
VH+ V+RGG+ LIPVFALGRAQEL ++L+ YW+ PIY+A L +K Y+
Sbjct: 254 NTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQNHPELHDIPIYYASSLAKKCMAVYQ 313
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
++ N KIRK N F FKHI D+ GP VV A+PGM+ SGLS +F+
Sbjct: 314 TYVNAMNDKIRKQININNPFVFKHISNLKSMDHFDDIGPSVVMASPGMMQSGLSRELFES 373
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W + N +I+ G+CV+GT+ ++S +++ + Q + +KM+++Y+SFSAH D +
Sbjct: 374 WCTDKRNGVIIAGYCVEGTLAKHIMSEPEEITTMSGQKLPLKMSVDYISFSAHTDYQQTS 433
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF 269
+ I+ +P +V+LVHGE ++M LK + +E+
Sbjct: 434 EFIRALKPPHVILVHGEQNEMARLKAALIREY 465
>gi|7706427|ref|NP_057291.1| cleavage and polyadenylation specificity factor subunit 3 [Homo
sapiens]
gi|18203503|sp|Q9UKF6.1|CPSF3_HUMAN RecName: Full=Cleavage and polyadenylation specificity factor
subunit 3; AltName: Full=Cleavage and polyadenylation
specificity factor 73 kDa subunit; Short=CPSF 73 kDa
subunit; AltName: Full=mRNA 3'-end-processing
endonuclease CPSF-73
gi|6002955|gb|AAF00224.1|AF171877_1 cleavage and polyadenylation specificity factor 73 kDa subunit
[Homo sapiens]
gi|18044212|gb|AAH20211.1| Cleavage and polyadenylation specific factor 3, 73kDa [Homo
sapiens]
gi|62822309|gb|AAY14858.1| unknown [Homo sapiens]
gi|119621394|gb|EAX00989.1| cleavage and polyadenylation specific factor 3, 73kDa, isoform
CRA_a [Homo sapiens]
Length = 684
Score = 223 bits (568), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 111/272 (40%), Positives = 168/272 (61%), Gaps = 3/272 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ ++YTGD++ DRHL AA I +PD+LI ESTY T I + + RE F
Sbjct: 164 MFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPNIKPDILIIESTYGTHIHEKREEREARFC 223
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
VH+ V+RGG+ LIPVFALGRAQEL ++L+ YW+ PIY+A L +K Y+
Sbjct: 224 NTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQNHPELHDIPIYYASSLAKKCMAVYQ 283
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
++ N KIRK N F FKHI D+ GP VV A+PGM+ SGLS +F+
Sbjct: 284 TYVNAMNDKIRKQININNPFVFKHISNLKSMDHFDDIGPSVVMASPGMMQSGLSRELFES 343
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W + N +I+ G+CV+GT+ ++S +++ + Q + +KM+++Y+SFSAH D +
Sbjct: 344 WCTDKRNGVIIAGYCVEGTLAKHIMSEPEEITTMSGQKLPLKMSVDYISFSAHTDYQQTS 403
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF 269
+ I+ +P +V+LVHGE ++M LK + +E+
Sbjct: 404 EFIRALKPPHVILVHGEQNEMARLKAALIREY 435
>gi|351704796|gb|EHB07715.1| Cleavage and polyadenylation specificity factor subunit 3
[Heterocephalus glaber]
Length = 692
Score = 223 bits (568), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 111/272 (40%), Positives = 168/272 (61%), Gaps = 3/272 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ ++YTGD++ DRHL AA I +PD+LI ESTY T I + + RE F
Sbjct: 164 MFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPNIKPDILIIESTYGTHIHEKREEREARFC 223
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
VH+ V+RGG+ LIPVFALGRAQEL ++L+ YW+ PIY+A L +K Y+
Sbjct: 224 NTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQNHPELHDIPIYYASSLAKKCMAVYQ 283
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
++ N KIRK N F FKHI D+ GP VV A+PGM+ SGLS +F+
Sbjct: 284 TYVHAMNDKIRKQININNPFVFKHISNLKSMDHFDDIGPSVVMASPGMMQSGLSRELFES 343
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W + N +I+ G+CV+GT+ ++S +++ + Q + +KM+++Y+SFSAH D +
Sbjct: 344 WCTDKRNGVIIAGYCVEGTLAKHIMSEPEEIATMSGQKLPLKMSVDYISFSAHTDYQQTS 403
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF 269
+ I+ +P +V+LVHGE ++M LK + +E+
Sbjct: 404 EFIRALKPPHVILVHGEQNEMARLKAALIREY 435
>gi|300676780|gb|ADK26656.1| cleavage and polyadenylation specific factor 3, 73kDa [Zonotrichia
albicollis]
Length = 721
Score = 223 bits (568), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 123/338 (36%), Positives = 194/338 (57%), Gaps = 23/338 (6%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ ++YTGD++ DRHL AA I +PD+LI ESTY T I + + RE F
Sbjct: 201 MFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPNIKPDILIIESTYGTHIHEKREEREARFC 260
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
VH+ V+RGG+ LIPVFALGRAQEL ++L+ YW+ PIY+A L +K Y+
Sbjct: 261 NTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQNHPELHDIPIYYASSLAKKCMAVYQ 320
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
++ N KIRK N F FKHI D+ GP VV A+PGM+ SGLS +F+
Sbjct: 321 TYVNAMNDKIRKQININNPFVFKHISNLKSMDHFDDIGPSVVMASPGMMQSGLSRELFES 380
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W + N +I+ G+CV+GT+ ++S +++ + Q + +KM+++Y+SFSAH D +
Sbjct: 381 WCTDKRNGVIIAGYCVEGTLAKHIMSEPEEITTMSGQKLPLKMSVDYISFSAHTDYQQTS 440
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF------NLDCFMPANGESCFVQTDMKIS 291
+ I+ +P +V+LVHGE ++M LK + +E+ +++ P N E+
Sbjct: 441 EFIRALKPPHVILVHGEQNEMARLKAALIREYEDNDEVHIEVHNPRNTEA---------- 490
Query: 292 IDVSVNLLKEEAVKYNSEPPN--PLKERQIHGVLVIKD 327
V++N E+ K + P + ++I G+LV ++
Sbjct: 491 --VTLNFRGEKLAKVMGSLADKKPEQGQRISGILVKRN 526
>gi|291233360|ref|XP_002736621.1| PREDICTED: cleavage and polyadenylation specific factor 3,
73kDa-like [Saccoglossus kowalevskii]
Length = 715
Score = 223 bits (568), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 114/274 (41%), Positives = 169/274 (61%), Gaps = 7/274 (2%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ ++YTGD++ DRHL AA + RPD+LI ESTY T I + + RE F
Sbjct: 171 MFMIEIAGVKLLYTGDFSRQEDRHLMAAELPSVRPDVLIIESTYGTHIHEKREEREARFT 230
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
VH+ V+RGG+ LIPVFALGRAQEL ++L+ YW PIY+A L +K + Y+
Sbjct: 231 GTVHDIVNRGGRCLIPVFALGRAQELLLILDEYWANHPELHDIPIYYASSLAKKCMSVYQ 290
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHI---RPFDKSFIDNPGPMVVFATPGMLHSGLSLIIF 175
+I N KI++ N F FKHI R D D+ GP VV A+PGM+ SGLS +F
Sbjct: 291 TYINAMNDKIKRQITISNPFVFKHISNLRGMDH--FDDIGPSVVMASPGMMQSGLSRELF 348
Query: 176 KKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKG 235
+ W N +I+ G+CV+GT+ +LS +++ + Q + +KM+++Y+SFSAHAD +
Sbjct: 349 ESWCTDRRNGVIIAGYCVEGTLAKHILSQPEEVTTMSGQKLPLKMSVDYISFSAHADYQQ 408
Query: 236 IMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF 269
I I+ P +V+LVHGE ++M LK + +++
Sbjct: 409 ISDFIRKLIPPHVILVHGEQNEMGRLKAAVVRDY 442
>gi|74221128|dbj|BAE42066.1| unnamed protein product [Mus musculus]
Length = 684
Score = 223 bits (568), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 134/399 (33%), Positives = 217/399 (54%), Gaps = 30/399 (7%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ ++YTGD++ DRHL AA I +PD+LI ESTY T I + + RE F
Sbjct: 164 MFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPNIKPDILIIESTYGTHIHEKREEREARFC 223
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
VH+ V+RGG+ LIPVFALGRAQEL ++L+ YW+ PIY+A L +K Y+
Sbjct: 224 NTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQNHPELHDIPIYYASSLAKKCMAVYQ 283
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
++ N KIRK N F FKHI D+ GP VV A+PGM+ +GLS +F+
Sbjct: 284 TYVNAMNDKIRKQININNPFVFKHISNLKSMDHFDDIGPSVVMASPGMIQNGLSRELFES 343
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W + N +I+ G+CV+GT+ ++S +++ + Q + +KM+++Y+SFSAH D +
Sbjct: 344 WCTDKRNGVIIAGYCVEGTLAKHIMSEPEEITTMSGQKLPLKMSVDYISFSAHTDYQQTS 403
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF------NLDCFMPANGESCFVQTDMKIS 291
+ I+ +P +V+LVHGE ++M LK + +E+ +++ P N E+
Sbjct: 404 EFIRALKPPHVILVHGEKNEMARLKAALIREYEDNDEVHIEVHNPRNTEA---------- 453
Query: 292 IDVSVNLLKEEAVKYNS--EPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIV 349
V++N E+ K P + +++ G+LV ++ + ++ C +S +
Sbjct: 454 --VTLNFRGEKLAKVMGFLADKKPEQGQRVSGILVKRNFNYHILS---PCD---LSNYTD 505
Query: 350 RFTSNVQISDSSPINKTLELIYDQLLSYLQDKSQEYKIQ 388
S V+ + + P L+Y QL D +E +IQ
Sbjct: 506 LAMSTVKQTQAIPYTGPFYLLYYQLQKLTGD-VEELEIQ 543
>gi|403270697|ref|XP_003927303.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
3 [Saimiri boliviensis boliviensis]
Length = 658
Score = 223 bits (568), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 114/290 (39%), Positives = 175/290 (60%), Gaps = 9/290 (3%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ ++YTGD++ DRHL AA I +PD+LI ESTY T I + + RE F
Sbjct: 168 MFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPNIKPDILIIESTYGTHIHEKREEREARFC 227
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
VH+ V+RGG+ LIPVFALGRAQEL ++L+ YW+ PIY+A L +K Y+
Sbjct: 228 NTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQNHPELHDIPIYYASSLAKKCMAVYQ 287
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
++ N KIRK N F FKHI D+ GP VV A+PGM+ SGLS +F+
Sbjct: 288 TYVNAMNDKIRKQININNPFVFKHISNLKSMDHFDDIGPSVVMASPGMMQSGLSRELFES 347
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W + N +I+ G+CV+GT+ ++S +++ + Q + +KM+++Y+SFSAH D +
Sbjct: 348 WCTDKRNGVIIAGYCVEGTLAKHIMSEPEEITTMSGQKLPLKMSVDYISFSAHTDYQQTS 407
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF------NLDCFMPANGES 281
+ I+ +P +V+LVHGE ++M LK + +E+ +++ P N E+
Sbjct: 408 EFIRALKPPHVILVHGEQNEMARLKAALIREYEDNDEVHIEVHNPRNTEA 457
>gi|410928245|ref|XP_003977511.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
3-like [Takifugu rubripes]
Length = 696
Score = 223 bits (568), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 112/272 (41%), Positives = 166/272 (61%), Gaps = 3/272 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ ++YTGD++ DRHL AA I +PD+LITESTY T I + + RE F
Sbjct: 171 MFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPSVKPDILITESTYGTHIHEKREEREARFC 230
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
VH+ V+R G+ LIPVFALGRAQEL ++L+ YW+ PIY+A L K Y+
Sbjct: 231 NTVHDIVNREGRCLIPVFALGRAQELLLILDEYWQNHPELHDIPIYYASSLARKCMAVYQ 290
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
+I N KIRK N F FKHI D+ GP VV A+PGM+ SGLS +F+
Sbjct: 291 TYINAMNDKIRKAININNPFVFKHISNLKSMDHFDDIGPSVVMASPGMMQSGLSRELFES 350
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W + N +I+ G+CV+GT+ ++S ++ + Q + +KM+++Y+SFSAH D +
Sbjct: 351 WCTDKRNGVIIAGYCVEGTLAKHIMSEPDEITTMSGQKLALKMSVDYISFSAHTDYQQTS 410
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF 269
+ I+ +P +V+LVHGE ++M LK + +E+
Sbjct: 411 EFIRALKPPHVILVHGEQNEMARLKAALIREY 442
>gi|31980904|ref|NP_061283.2| cleavage and polyadenylation specificity factor subunit 3 [Mus
musculus]
gi|341940395|sp|Q9QXK7.2|CPSF3_MOUSE RecName: Full=Cleavage and polyadenylation specificity factor
subunit 3; AltName: Full=Cleavage and polyadenylation
specificity factor 73 kDa subunit; Short=CPSF 73 kDa
subunit; Short=mRNA 3'-end-processing endonuclease
CPSF-73
gi|23271024|gb|AAH23297.1| Cleavage and polyadenylation specificity factor 3 [Mus musculus]
Length = 684
Score = 223 bits (568), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 134/399 (33%), Positives = 217/399 (54%), Gaps = 30/399 (7%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ ++YTGD++ DRHL AA I +PD+LI ESTY T I + + RE F
Sbjct: 164 MFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPNIKPDILIIESTYGTHIHEKREEREARFC 223
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
VH+ V+RGG+ LIPVFALGRAQEL ++L+ YW+ PIY+A L +K Y+
Sbjct: 224 NTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQNHPELHDIPIYYASSLAKKCMAVYQ 283
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
++ N KIRK N F FKHI D+ GP VV A+PGM+ +GLS +F+
Sbjct: 284 TYVNAMNDKIRKQININNPFVFKHISNLKSMDHFDDIGPSVVMASPGMIQNGLSRELFES 343
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W + N +I+ G+CV+GT+ ++S +++ + Q + +KM+++Y+SFSAH D +
Sbjct: 344 WCTDKRNGVIIAGYCVEGTLAKHIMSEPEEITTMSGQKLPLKMSVDYISFSAHTDYQQTS 403
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF------NLDCFMPANGESCFVQTDMKIS 291
+ I+ +P +V+LVHGE ++M LK + +E+ +++ P N E+
Sbjct: 404 EFIRALKPPHVILVHGEQNEMARLKAALIREYEDNDEVHIEVHNPRNTEA---------- 453
Query: 292 IDVSVNLLKEEAVKYNS--EPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIV 349
V++N E+ K P + +++ G+LV ++ + ++ C +S +
Sbjct: 454 --VTLNFRGEKLAKVMGFLADKKPEQGQRVSGILVKRNFNYHILS---PCD---LSNYTD 505
Query: 350 RFTSNVQISDSSPINKTLELIYDQLLSYLQDKSQEYKIQ 388
S V+ + + P L+Y QL D +E +IQ
Sbjct: 506 LAMSTVKQTQAIPYTGPFYLLYYQLQKLTGD-VEELEIQ 543
>gi|148702078|gb|EDL34025.1| cleavage and polyadenylation specificity factor 3, isoform CRA_b
[Mus musculus]
Length = 701
Score = 223 bits (568), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 134/399 (33%), Positives = 217/399 (54%), Gaps = 30/399 (7%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ ++YTGD++ DRHL AA I +PD+LI ESTY T I + + RE F
Sbjct: 181 MFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPNIKPDILIIESTYGTHIHEKREEREARFC 240
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
VH+ V+RGG+ LIPVFALGRAQEL ++L+ YW+ PIY+A L +K Y+
Sbjct: 241 NTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQNHPELHDIPIYYASSLAKKCMAVYQ 300
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
++ N KIRK N F FKHI D+ GP VV A+PGM+ +GLS +F+
Sbjct: 301 TYVNAMNDKIRKQININNPFVFKHISNLKSMDHFDDIGPSVVMASPGMIQNGLSRELFES 360
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W + N +I+ G+CV+GT+ ++S +++ + Q + +KM+++Y+SFSAH D +
Sbjct: 361 WCTDKRNGVIIAGYCVEGTLAKHIMSEPEEITTMSGQKLPLKMSVDYISFSAHTDYQQTS 420
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF------NLDCFMPANGESCFVQTDMKIS 291
+ I+ +P +V+LVHGE ++M LK + +E+ +++ P N E+
Sbjct: 421 EFIRALKPPHVILVHGEQNEMARLKAALIREYEDNDEVHIEVHNPRNTEA---------- 470
Query: 292 IDVSVNLLKEEAVKYNS--EPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIV 349
V++N E+ K P + +++ G+LV ++ + ++ C +S +
Sbjct: 471 --VTLNFRGEKLAKVMGFLADKKPEQGQRVSGILVKRNFNYHILS---PCD---LSNYTD 522
Query: 350 RFTSNVQISDSSPINKTLELIYDQLLSYLQDKSQEYKIQ 388
S V+ + + P L+Y QL D +E +IQ
Sbjct: 523 LAMSTVKQTQAIPYTGPFYLLYYQLQKLTGD-VEELEIQ 560
>gi|395507218|ref|XP_003757924.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
3 [Sarcophilus harrisii]
Length = 684
Score = 223 bits (568), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 111/272 (40%), Positives = 168/272 (61%), Gaps = 3/272 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ ++YTGD++ DRHL AA I +PD+LI ESTY T I + + RE F
Sbjct: 164 MFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPNIKPDILIIESTYGTHIHEKREEREARFC 223
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
VH+ V+RGG+ LIPVFALGRAQEL ++L+ YW+ PIY+A L +K Y+
Sbjct: 224 NTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQNHPELHDIPIYYASSLAKKCMAVYQ 283
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
++ N KIRK N F FKHI D+ GP VV A+PGM+ SGLS +F+
Sbjct: 284 TYVNAMNDKIRKQININNPFVFKHISNLKSMDHFDDIGPSVVMASPGMMQSGLSRELFES 343
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W + N +I+ G+CV+GT+ ++S +++ + Q + +KM+++Y+SFSAH D +
Sbjct: 344 WCTDKRNGVIIAGYCVEGTLAKHIMSEPEEITTMSGQKLPLKMSVDYISFSAHTDYQQTS 403
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF 269
+ I+ +P +V+LVHGE ++M LK + +E+
Sbjct: 404 EFIRALKPPHVILVHGEQNEMARLKAALIREY 435
>gi|149050991|gb|EDM03164.1| cleavage and polyadenylation specificity factor 3, isoform CRA_a
[Rattus norvegicus]
Length = 685
Score = 223 bits (568), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 134/399 (33%), Positives = 217/399 (54%), Gaps = 30/399 (7%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ ++YTGD++ DRHL AA I +PD+LI ESTY T I + + RE F
Sbjct: 164 MFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPNIKPDILIIESTYGTHIHEKREEREARFC 223
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
VH+ V+RGG+ LIPVFALGRAQEL ++L+ YW+ PIY+A L +K Y+
Sbjct: 224 NTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQNHPELHDIPIYYASSLAKKCMAVYQ 283
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
++ N KIRK N F FKHI D+ GP VV A+PGM+ +GLS +F+
Sbjct: 284 TYVNAMNDKIRKQININNPFVFKHISNLKSMDHFDDIGPSVVMASPGMIQNGLSRELFES 343
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W + N +I+ G+CV+GT+ ++S +++ + Q + +KM+++Y+SFSAH D +
Sbjct: 344 WCTDKRNGVIIAGYCVEGTLAKHIMSEPEEITTMSGQKLPLKMSVDYISFSAHTDYQQTS 403
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF------NLDCFMPANGESCFVQTDMKIS 291
+ I+ +P +V+LVHGE ++M LK + +E+ +++ P N E+
Sbjct: 404 EFIRALKPPHVILVHGEQNEMARLKAALIREYEDNDEVHIEVHNPRNTEA---------- 453
Query: 292 IDVSVNLLKEEAVKYNS--EPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIV 349
V++N E+ K P + +++ G+LV ++ + ++ C +S +
Sbjct: 454 --VTLNFRGEKLAKVMGFLADKKPEQGQRVSGILVKRNFNYHILS---PCD---LSNYTD 505
Query: 350 RFTSNVQISDSSPINKTLELIYDQLLSYLQDKSQEYKIQ 388
S V+ + + P L+Y QL D +E +IQ
Sbjct: 506 LAMSTVKQTQAIPYTGPFYLLYYQLQKLTGD-VEELEIQ 543
>gi|195145744|ref|XP_002013850.1| GL23169 [Drosophila persimilis]
gi|194102793|gb|EDW24836.1| GL23169 [Drosophila persimilis]
Length = 684
Score = 223 bits (567), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 131/377 (34%), Positives = 209/377 (55%), Gaps = 10/377 (2%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ I+YTGD++ DRHL AA + +PD+LITESTY T I + + RE F
Sbjct: 170 MFMIEIAGIKILYTGDFSRQEDRHLMAAEVPPMKPDVLITESTYGTHIHEKREDRENRFT 229
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
V + V +GG+ LIPVFALGRAQEL ++L+ +W + + PIY+A L +K Y+
Sbjct: 230 SLVQKTVLQGGRCLIPVFALGRAQELLLILDEFWSQNPELHEIPIYYASSLAKKCMAVYQ 289
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
+I N +IR+ N F F+HI ++ GP V+ A+PGM+ SGLS +F+
Sbjct: 290 TYINAMNDRIRRQIAVNNPFVFRHISNLKGIDHFEDIGPCVIMASPGMMQSGLSRELFES 349
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W N +I+ G+CV+GT+ +LS +++ + Q + + M+++Y+SFSAH D +
Sbjct: 350 WCTDPKNGVIIAGYCVEGTLAKTILSEPEEITTLSGQKLPLNMSVDYISFSAHTDYQQTS 409
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVN 297
+ I+ P +V+LVHGE ++M LK +++E+ D A+ + F ++D+
Sbjct: 410 EFIRLLRPTHVVLVHGEQNEMSRLKLALQREYEAD----ASTDIKFYNPRNTHAVDLYFR 465
Query: 298 LLKEEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQI 357
K V N NP ++ GVLV +D L+ + K +S +V T I
Sbjct: 466 GEKTAKVMGNLAAKNPEVGSKLSGVLVKRDFKYHLLAPSDLGKYTDMSMSVV--TQRQSI 523
Query: 358 SDSSPINKTLELIYDQL 374
+S + TLEL+ D++
Sbjct: 524 PWTSSLG-TLELLLDRI 539
>gi|198451826|ref|XP_001358526.2| GA20526 [Drosophila pseudoobscura pseudoobscura]
gi|198131664|gb|EAL27667.2| GA20526 [Drosophila pseudoobscura pseudoobscura]
Length = 684
Score = 223 bits (567), Expect = 6e-55, Method: Compositional matrix adjust.
Identities = 131/377 (34%), Positives = 209/377 (55%), Gaps = 10/377 (2%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ I+YTGD++ DRHL AA + +PD+LITESTY T I + + RE F
Sbjct: 170 MFMIEIAGIKILYTGDFSRQEDRHLMAAEVPPMKPDVLITESTYGTHIHEKREDRENRFT 229
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
V + V +GG+ LIPVFALGRAQEL ++L+ +W + + PIY+A L +K Y+
Sbjct: 230 SLVQKTVLQGGRCLIPVFALGRAQELLLILDEFWSQNPELHEIPIYYASSLAKKCMAVYQ 289
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
+I N +IR+ N F F+HI ++ GP V+ A+PGM+ SGLS +F+
Sbjct: 290 TYINAMNDRIRRQIAVNNPFVFRHISNLKGIDHFEDIGPCVIMASPGMMQSGLSRELFES 349
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W N +I+ G+CV+GT+ +LS +++ + Q + + M+++Y+SFSAH D +
Sbjct: 350 WCTDPKNGVIIAGYCVEGTLAKTILSEPEEITTLSGQKLPLNMSVDYISFSAHTDYQQTS 409
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVN 297
+ I+ P +V+LVHGE ++M LK +++E+ D A+ + F ++D+
Sbjct: 410 EFIRLLRPTHVVLVHGEQNEMSRLKLALQREYEAD----ASTDIKFYNPRNTHAVDLYFR 465
Query: 298 LLKEEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQI 357
K V N NP ++ GVLV +D L+ + K +S +V T I
Sbjct: 466 GEKTAKVMGNLAAKNPEVGSKLSGVLVKRDFKYHLLAPSDLGKYTDMSMSVV--TQRQSI 523
Query: 358 SDSSPINKTLELIYDQL 374
+S + TLEL+ D++
Sbjct: 524 PWTSSLG-TLELLLDRI 539
>gi|126303222|ref|XP_001371997.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
3-like [Monodelphis domestica]
Length = 684
Score = 223 bits (567), Expect = 6e-55, Method: Compositional matrix adjust.
Identities = 111/272 (40%), Positives = 168/272 (61%), Gaps = 3/272 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ ++YTGD++ DRHL AA I +PD+LI ESTY T I + + RE F
Sbjct: 164 MFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPNIKPDILIIESTYGTHIHEKREEREARFC 223
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
VH+ V+RGG+ LIPVFALGRAQEL ++L+ YW+ PIY+A L +K Y+
Sbjct: 224 NTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQNHPELHDIPIYYASSLAKKCMAVYQ 283
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
++ N KIRK N F FKHI D+ GP VV A+PGM+ SGLS +F+
Sbjct: 284 TYVNAMNDKIRKQININNPFVFKHISNLKSMDHFDDIGPSVVMASPGMMQSGLSRELFES 343
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W + N +I+ G+CV+GT+ ++S +++ + Q + +KM+++Y+SFSAH D +
Sbjct: 344 WCTDKRNGVIIAGYCVEGTLAKHIMSEPEEITTMSGQKLPLKMSVDYISFSAHTDYQQTS 403
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF 269
+ I+ +P +V+LVHGE ++M LK + +E+
Sbjct: 404 EFIRALKPPHVILVHGEQNEMARLKAALIREY 435
>gi|340383473|ref|XP_003390242.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
3-like [Amphimedon queenslandica]
Length = 726
Score = 223 bits (567), Expect = 6e-55, Method: Compositional matrix adjust.
Identities = 109/274 (39%), Positives = 170/274 (62%), Gaps = 7/274 (2%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ ++YTGD++ DRHL AA + PD+LI+ESTY T I + + RE F
Sbjct: 215 MFMIEIAGVKVLYTGDFSRVEDRHLMAAEVPNSSPDILISESTYGTHIHEKREQREARFT 274
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYW----ERMNLQAPIYFAVGLTEKATNY 116
K+H+ V RGG LIPVFALGRAQEL ++L+ YW E ++ PIY+A L +K
Sbjct: 275 TKIHDIVTRGGHCLIPVFALGRAQELLLILDEYWSCHPELHDI--PIYYASSLAKKCMAV 332
Query: 117 YKMFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIF 175
Y+ +I N++IR+ N F FKHI D+ GP V+ A+PGM+ SGLS +F
Sbjct: 333 YQTYIGAMNERIRRQIGISNPFVFKHISSLKNIDNFDDIGPCVILASPGMMQSGLSRQLF 392
Query: 176 KKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKG 235
+ W + N +++ G+CV+GT+ +LS ++ N Q + ++M+++Y+SFSAH D +
Sbjct: 393 ESWCTDKRNGVVVAGYCVEGTLAKHILSEPSEVVTMNGQKLPLRMSVDYISFSAHTDYEQ 452
Query: 236 IMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF 269
+ I+ P +++LVHGE ++M+ LK+ + +EF
Sbjct: 453 TSEFIRILNPPHIVLVHGEQNEMMRLKQGLLREF 486
>gi|168026077|ref|XP_001765559.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162683197|gb|EDQ69609.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 682
Score = 223 bits (567), Expect = 6e-55, Method: Compositional matrix adjust.
Identities = 133/346 (38%), Positives = 190/346 (54%), Gaps = 22/346 (6%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF V + ++YTGDY+ DRHL AA + + PD+ I ESTY I + RER F
Sbjct: 170 MFMVDIAGMRVLYTGDYSCEEDRHLRAAEMPRFSPDVCIIESTYGVQIHQPRIMRERRFT 229
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWE-RMNLQ-APIYFAVGLTEKATNYYK 118
V + V +GGKVLIP FALGRAQEL ++L+ YWE LQ PIY+A L +K Y+
Sbjct: 230 DTVAQTVSQGGKVLIPAFALGRAQELLLILDEYWEAHPELQHIPIYYASPLAKKCMAVYQ 289
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
+I N++I+K F N FDFKHI+P D+ GP VV A+PG L SGLS +F
Sbjct: 290 TYINAMNERIQKQFEVSNPFDFKHIQPLKNIDEFDDIGPAVVMASPGGLQSGLSRQLFDI 349
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W + N ++PG+ V+GT +++ K++ + ++ + M + Y+SFSAHAD
Sbjct: 350 WCQDKKNSCVIPGYVVEGTPAKAIMNEPKEVTLLSGLVVPLNMRVHYISFSAHADFTQTS 409
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF---NLDCFMPANGESC--FVQTDMKISI 292
+ P N++LVHGEA++M LK K+ +F N+ P N ++ F + + I
Sbjct: 410 AFLHELRPPNIILVHGEANEMGRLKAKLTTQFAEQNVKILSPKNCQTVEMFFKGE---KI 466
Query: 293 DVSVNLLKEEAVKYNSEPPNPLKERQI-HGVLVIKDSSISLMDVDE 337
+V L E+ P KE I G+LV K + LM D+
Sbjct: 467 AKAVGRLAEK----------PAKEGDIVSGLLVRKGFTYQLMAPDD 502
>gi|449498153|ref|XP_002196255.2| PREDICTED: LOW QUALITY PROTEIN: cleavage and polyadenylation
specificity factor subunit 3 [Taeniopygia guttata]
Length = 746
Score = 223 bits (567), Expect = 6e-55, Method: Compositional matrix adjust.
Identities = 123/338 (36%), Positives = 194/338 (57%), Gaps = 23/338 (6%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ ++YTGD++ DRHL AA I +PD+LI ESTY T I + + RE F
Sbjct: 226 MFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPNIKPDILIIESTYGTHIHEKREEREARFC 285
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
VH+ V+RGG+ LIPVFALGRAQEL ++L+ YW+ PIY+A L +K Y+
Sbjct: 286 NTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQNHPELHDIPIYYASSLAKKCMAVYQ 345
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
++ N KIRK N F FKHI D+ GP VV A+PGM+ SGLS +F+
Sbjct: 346 TYVNAMNDKIRKQININNPFVFKHISNLKSMDHFDDIGPSVVMASPGMMQSGLSRELFES 405
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W + N +I+ G+CV+GT+ ++S +++ + Q + +KM+++Y+SFSAH D +
Sbjct: 406 WCTDKRNGVIIAGYCVEGTLAKHIMSEPEEITTMSGQKLPLKMSVDYISFSAHTDYQQTS 465
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF------NLDCFMPANGESCFVQTDMKIS 291
+ I+ +P +V+LVHGE ++M LK + +E+ +++ P N E+
Sbjct: 466 EFIRALKPPHVILVHGEQNEMARLKAALIREYEDNDEVHIEVHNPRNTEA---------- 515
Query: 292 IDVSVNLLKEEAVKYNSEPPN--PLKERQIHGVLVIKD 327
V++N E+ K + P + ++I G+LV ++
Sbjct: 516 --VTLNFRGEKLAKVMGSLADKKPEQGQRISGILVKRN 551
>gi|355680849|gb|AER96661.1| cleavage and polyadenylation specific factor 3, 73kDa [Mustela
putorius furo]
Length = 600
Score = 223 bits (567), Expect = 7e-55, Method: Compositional matrix adjust.
Identities = 114/290 (39%), Positives = 175/290 (60%), Gaps = 9/290 (3%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ ++YTGD++ DRHL AA I +PD+LI ESTY T I + + RE F
Sbjct: 113 MFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPNIKPDILIIESTYGTHIHEKREEREARFC 172
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
VH+ V+RGG+ LIPVFALGRAQEL ++L+ YW+ PIY+A L +K Y+
Sbjct: 173 NTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQNHPELHDIPIYYASSLAKKCMAVYQ 232
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
++ N KIRK N F FKHI D+ GP VV A+PGM+ SGLS +F+
Sbjct: 233 TYVNAMNDKIRKQININNPFVFKHISNLKSMDHFDDIGPSVVMASPGMMQSGLSRELFES 292
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W + N +I+ G+CV+GT+ ++S +++ + Q + +KM+++Y+SFSAH D +
Sbjct: 293 WCTDKRNGVIIAGYCVEGTLAKHIMSEPEEITTMSGQKLPLKMSVDYISFSAHTDYQQTS 352
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF------NLDCFMPANGES 281
+ I+ +P +V+LVHGE ++M LK + +E+ +++ P N E+
Sbjct: 353 EFIRALKPPHVVLVHGEQNEMARLKAALIREYEDNDEVHIEVHNPRNTEA 402
>gi|326916480|ref|XP_003204535.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
3-like [Meleagris gallopavo]
Length = 759
Score = 222 bits (566), Expect = 8e-55, Method: Compositional matrix adjust.
Identities = 123/338 (36%), Positives = 194/338 (57%), Gaps = 23/338 (6%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ ++YTGD++ DRHL AA I +PD+LI ESTY T I + + RE F
Sbjct: 238 MFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPNIKPDILIIESTYGTHIHEKREEREARFC 297
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
VH+ V+RGG+ LIPVFALGRAQEL ++L+ YW+ PIY+A L +K Y+
Sbjct: 298 NTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQNHPELHDIPIYYASSLAKKCMAVYQ 357
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
++ N KIRK N F FKHI D+ GP VV A+PGM+ SGLS +F+
Sbjct: 358 TYVNAMNDKIRKQININNPFVFKHISNLKSMDHFDDIGPSVVMASPGMMQSGLSRELFES 417
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W + N +I+ G+CV+GT+ ++S +++ + Q + +KM+++Y+SFSAH D +
Sbjct: 418 WCTDKRNGVIIAGYCVEGTLAKHIMSEPEEITTMSGQKLPLKMSVDYISFSAHTDYQQTS 477
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF------NLDCFMPANGESCFVQTDMKIS 291
+ I+ +P +V+LVHGE ++M LK + +E+ +++ P N E+
Sbjct: 478 EFIRALKPPHVILVHGEQNEMARLKAALIREYEDNDEVHIEVHNPRNTEA---------- 527
Query: 292 IDVSVNLLKEEAVKYNSEPPN--PLKERQIHGVLVIKD 327
V++N E+ K + P + ++I G+LV ++
Sbjct: 528 --VTLNFRGEKLAKVMGSLADKKPEQGQRISGILVKRN 563
>gi|6625904|gb|AAF19420.1|AF203969_1 cleavage and polyadenylation specificity factor 73 kDa subunit [Mus
musculus]
Length = 684
Score = 222 bits (566), Expect = 8e-55, Method: Compositional matrix adjust.
Identities = 130/385 (33%), Positives = 211/385 (54%), Gaps = 29/385 (7%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ ++YTGD++ DRHL AA I +PD+LI ESTY T I + + RE F
Sbjct: 164 MFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPNIKPDILIIESTYGTHIHEKREEREARFW 223
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
VH+ V+RGG+ LIPVFALGRAQEL ++L+ YW+ PIY+A L +K Y+
Sbjct: 224 HTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQNHPELHDIPIYYASSLAKKCMAVYQ 283
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
++ N KIRK N F FKHI D+ GP VV A+PGM+ +GLS +F+
Sbjct: 284 TYVNAMNDKIRKQININNPFVFKHISNLKSMDHFDDIGPSVVMASPGMIQNGLSRELFES 343
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W + N +I+ G+CV+GT+ ++S +++ + Q + +KM+++Y+SFSAH D +
Sbjct: 344 WCTDKRNGVIIAGYCVEGTLAKHIMSEPEEITTMSGQKLPLKMSVDYISFSAHTDYQQTS 403
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF------NLDCFMPANGESCFVQTDMKIS 291
+ I+ +P +V+LVHGE ++M LK + +E+ +++ P N E+
Sbjct: 404 EFIRALKPPHVILVHGEQNEMARLKAALIREYEDNDEVHIEVHNPRNTEA---------- 453
Query: 292 IDVSVNLLKEEAVKYNS--EPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIV 349
V++N E+ K P + +++ G+LV ++ + ++ C +S +
Sbjct: 454 --VTLNFRGEKLAKVMGFLADKKPEQGQRVSGILVKRNFNYHIL---SPCD---LSNYTD 505
Query: 350 RFTSNVQISDSSPINKTLELIYDQL 374
S V+ + + P L+Y QL
Sbjct: 506 LAMSTVKQTQAIPYTGPFYLLYYQL 530
>gi|74211665|dbj|BAE29190.1| unnamed protein product [Mus musculus]
Length = 684
Score = 222 bits (565), Expect = 9e-55, Method: Compositional matrix adjust.
Identities = 134/399 (33%), Positives = 217/399 (54%), Gaps = 30/399 (7%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ ++YTGD++ DRHL AA I +PD+LI ESTY T I + + RE F
Sbjct: 164 MFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPNIKPDILIIESTYRTHIHEKREEREARFC 223
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
VH+ V+RGG+ LIPVFALGRAQEL ++L+ YW+ PIY+A L +K Y+
Sbjct: 224 NTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQNHPELHDIPIYYASSLAKKCMAVYQ 283
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
++ N KIRK N F FKHI D+ GP VV A+PGM+ +GLS +F+
Sbjct: 284 TYVNAMNDKIRKQININNPFVFKHISNLKSMDHFDDIGPSVVMASPGMIQNGLSRELFES 343
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W + N +I+ G+CV+GT+ ++S +++ + Q + +KM+++Y+SFSAH D +
Sbjct: 344 WCTDKRNGVIIAGYCVEGTLAKHIMSEPEEITTMSGQKLPLKMSVDYISFSAHTDYQQTS 403
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF------NLDCFMPANGESCFVQTDMKIS 291
+ I+ +P +V+LVHGE ++M LK + +E+ +++ P N E+
Sbjct: 404 EFIRALKPPHVILVHGEQNEMARLKAALIREYEDNDEVHIEVHNPRNTEA---------- 453
Query: 292 IDVSVNLLKEEAVKYNS--EPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIV 349
V++N E+ K P + +++ G+LV ++ + ++ C +S +
Sbjct: 454 --VTLNFRGEKLAKVMGFLADKKPEQGQRVSGILVKRNFNYHILS---PCD---LSNYTD 505
Query: 350 RFTSNVQISDSSPINKTLELIYDQLLSYLQDKSQEYKIQ 388
S V+ + + P L+Y QL D +E +IQ
Sbjct: 506 LAMSTVKQTQAIPYTGPFYLLYYQLQKLTGD-VEELEIQ 543
>gi|260815130|ref|XP_002602327.1| hypothetical protein BRAFLDRAFT_282200 [Branchiostoma floridae]
gi|229287635|gb|EEN58339.1| hypothetical protein BRAFLDRAFT_282200 [Branchiostoma floridae]
Length = 687
Score = 222 bits (565), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 115/290 (39%), Positives = 171/290 (58%), Gaps = 9/290 (3%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ I+YTGD++ DRHL AA + PD+LI E+TY T I + + RE F
Sbjct: 171 MFMIEIAGVKILYTGDFSRQEDRHLMAAEVPAIHPDVLIIEATYGTHIHEKREEREARFT 230
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
VH+ V+RGG+ LIPVFALGRAQEL ++L+ YW PIY+A L +K Y+
Sbjct: 231 STVHDIVNRGGRCLIPVFALGRAQELLLILDEYWSNHPELHDIPIYYASSLAKKCMAVYQ 290
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
+I N+KIRK N F FKHI D+ GP VV A+PGM+ SGLS +F+
Sbjct: 291 TYINAMNEKIRKQISVSNPFVFKHISNLKGMDHFDDIGPSVVMASPGMMQSGLSRELFES 350
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W N I+ G+CV+GT+ ++S +++ + Q I +K +++Y+SFSAH D +
Sbjct: 351 WCTDRRNGCIIAGYCVEGTLAKHIMSEPEEITTMSGQKIPLKCSVDYISFSAHTDYQQTS 410
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF------NLDCFMPANGES 281
I+ +P +V+LVHGEA++M LK + +E+ N++ P N ++
Sbjct: 411 DFIRQLKPPHVVLVHGEANEMSRLKAAVIREYEDDPDVNIEVHNPKNTQA 460
>gi|74178650|dbj|BAE33998.1| unnamed protein product [Mus musculus]
Length = 684
Score = 222 bits (565), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 133/399 (33%), Positives = 217/399 (54%), Gaps = 30/399 (7%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ ++YTGD++ DRHL AA I +PD+LI ESTY T I + + RE F
Sbjct: 164 MFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPNIKPDILIIESTYGTHIHEKREEREARFC 223
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
VH+ V+RGG+ LIPVFALGRAQEL ++L+ YW+ PIY+A L +K Y+
Sbjct: 224 NTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQNHPELHDIPIYYASSLAKKCMAVYQ 283
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
++ N KIRK N F FKHI D+ GP VV A+PGM+ +GLS +F+
Sbjct: 284 TYVNAMNDKIRKQININNPFVFKHISNLKSMDHFDDIGPSVVMASPGMIQNGLSRELFES 343
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W + N +I+ G+CV+GT+ ++S +++ + Q + +KM+++Y+SFSAH D +
Sbjct: 344 WCTDKRNGVIIAGYCVEGTLAKHIMSEPEEITTMSGQKLPLKMSVDYISFSAHTDYQQTS 403
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF------NLDCFMPANGESCFVQTDMKIS 291
+ I+ +P +V+LVHGE ++M L+ + +E+ +++ P N E+
Sbjct: 404 EFIRALKPPHVILVHGEQNEMARLRAALIREYEDNDEVHIEVHNPRNTEA---------- 453
Query: 292 IDVSVNLLKEEAVKYNS--EPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIV 349
V++N E+ K P + +++ G+LV ++ + ++ C +S +
Sbjct: 454 --VTLNFRGEKLAKVMGFLADKKPEQGQRVSGILVKRNFNYHILS---PCD---LSNYTD 505
Query: 350 RFTSNVQISDSSPINKTLELIYDQLLSYLQDKSQEYKIQ 388
S V+ + + P L+Y QL D +E +IQ
Sbjct: 506 LAMSTVKQTQAIPYTGPFYLLYYQLQKLTGD-VEELEIQ 543
>gi|149050992|gb|EDM03165.1| cleavage and polyadenylation specificity factor 3, isoform CRA_b
[Rattus norvegicus]
Length = 605
Score = 222 bits (565), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 134/399 (33%), Positives = 217/399 (54%), Gaps = 30/399 (7%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ ++YTGD++ DRHL AA I +PD+LI ESTY T I + + RE F
Sbjct: 164 MFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPNIKPDILIIESTYGTHIHEKREEREARFC 223
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
VH+ V+RGG+ LIPVFALGRAQEL ++L+ YW+ PIY+A L +K Y+
Sbjct: 224 NTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQNHPELHDIPIYYASSLAKKCMAVYQ 283
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
++ N KIRK N F FKHI D+ GP VV A+PGM+ +GLS +F+
Sbjct: 284 TYVNAMNDKIRKQININNPFVFKHISNLKSMDHFDDIGPSVVMASPGMIQNGLSRELFES 343
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W + N +I+ G+CV+GT+ ++S +++ + Q + +KM+++Y+SFSAH D +
Sbjct: 344 WCTDKRNGVIIAGYCVEGTLAKHIMSEPEEITTMSGQKLPLKMSVDYISFSAHTDYQQTS 403
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF------NLDCFMPANGESCFVQTDMKIS 291
+ I+ +P +V+LVHGE ++M LK + +E+ +++ P N E+
Sbjct: 404 EFIRALKPPHVILVHGEQNEMARLKAALIREYEDNDEVHIEVHNPRNTEA---------- 453
Query: 292 IDVSVNLLKEEAVKYNS--EPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIV 349
V++N E+ K P + +++ G+LV ++ + ++ C +S +
Sbjct: 454 --VTLNFRGEKLAKVMGFLADKKPEQGQRVSGILVKRNFNYHILS---PCD---LSNYTD 505
Query: 350 RFTSNVQISDSSPINKTLELIYDQLLSYLQDKSQEYKIQ 388
S V+ + + P L+Y QL D +E +IQ
Sbjct: 506 LAMSTVKQTQAIPYTGPFYLLYYQLQKLTGD-VEELEIQ 543
>gi|242032211|ref|XP_002463500.1| hypothetical protein SORBIDRAFT_01g000850 [Sorghum bicolor]
gi|241917354|gb|EER90498.1| hypothetical protein SORBIDRAFT_01g000850 [Sorghum bicolor]
Length = 695
Score = 222 bits (565), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 114/287 (39%), Positives = 168/287 (58%), Gaps = 8/287 (2%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF V + I+YTGDY+ DRHL AA + + PD+ I ESTY + RE+ F
Sbjct: 180 MFMVDIAGVRILYTGDYSREEDRHLRAAELPQFSPDICIIESTYGVQQHQPRIVREKRFT 239
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
+ +H V +GG+VLIP FALGRAQEL ++L+ YW + + PIY+A L ++ Y+
Sbjct: 240 EVIHNTVSQGGRVLIPAFALGRAQELLLILDEYWSKHPELHKIPIYYASPLAKRCMAVYQ 299
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDN---PGPMVVFATPGMLHSGLSLIIF 175
+I N++IR F Q N F FKHI + IDN GP VV A+PG L SGLS +F
Sbjct: 300 TYINSMNERIRNQFAQSNPFHFKHIESLNS--IDNFHDVGPSVVMASPGGLQSGLSRQLF 357
Query: 176 KKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKG 235
KW + N ++PG+ V+GT+ +++ +++ N + M++ Y+SFSAHAD
Sbjct: 358 DKWCTDKKNACVIPGYVVEGTLAKTIINEPREVTLANGLTAPLHMSVHYISFSAHADFPQ 417
Query: 236 IMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFN-LDCFMPANGES 281
+ P N++LVHGEA++M LK+K+K +F+ + P N +S
Sbjct: 418 TSNFLDELRPPNIILVHGEANEMSRLKQKLKTQFDGTNIVSPKNCQS 464
>gi|402594378|gb|EJW88304.1| cleavage and polyadenylation specificity factor subunit 3
[Wuchereria bancrofti]
Length = 694
Score = 222 bits (565), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 128/362 (35%), Positives = 197/362 (54%), Gaps = 27/362 (7%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ I+YTGD++ DRHL AA + PD+LI ESTY T + +S+ RE+ F
Sbjct: 162 MFMIEIAGVRILYTGDFSRLEDRHLCAAELPTVSPDVLICESTYGTQVHESRDEREKRFT 221
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERM-NLQ-APIYFAVGLTEKATNYYK 118
VHE V RGG+ LIP FALGRAQEL ++L+ YWE LQ P+Y+A L +K Y+
Sbjct: 222 SIVHEIVGRGGRCLIPAFALGRAQELLLILDEYWESHPELQDIPVYYASSLAKKCMAVYQ 281
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
F++ N +I+K N F FKH+ ++ GP VV A+PGML +GLS +F+
Sbjct: 282 TFVSGMNSRIQKQIALNNPFVFKHVSNLKSIDHFEDVGPCVVLASPGMLQNGLSRELFEN 341
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W N I+ G+CV+GT+ +LS +++ N Q + +++ + Y+SFSAH D
Sbjct: 342 WCTDSKNGCIIAGYCVEGTLAKHILSEPEEIVAMNGQKLAMRLQVAYISFSAHTDYTQTS 401
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKI------KQEFNLDCFMPANGESCFVQTDMKIS 291
I+ P +++LVHGE ++M LK I + +F+++ + P N ES
Sbjct: 402 DFIRALRPPHLVLVHGEMNEMNRLKAAIIRQYEDESDFHIEVYNPRNTES---------- 451
Query: 292 IDVSVNLLKEEAVKYNSE----PPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRH 347
V ++ E+ K + PP R + GVL+ ++ + LM D+ +S
Sbjct: 452 --VELHFRGEKTAKVVGKMAMTPPG--DGRVLSGVLIRRNFNYHLMHADDLSAYTDLSNS 507
Query: 348 IV 349
I+
Sbjct: 508 IL 509
>gi|224140919|ref|XP_002323824.1| predicted protein [Populus trichocarpa]
gi|222866826|gb|EEF03957.1| predicted protein [Populus trichocarpa]
Length = 699
Score = 221 bits (564), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 116/290 (40%), Positives = 166/290 (57%), Gaps = 12/290 (4%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF V + ++YTGDY+ DRHL AA + + PD+ I ESTY + + RE+ F
Sbjct: 177 MFMVDIAGVRVLYTGDYSREEDRHLRAAEMPQFSPDICIIESTYGVQLHQPRHIREKRFT 236
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYW----ERMNLQAPIYFAVGLTEKATNY 116
+H + GG+VLIP FALGRAQEL ++L+ YW E N+ P+Y+A L +K
Sbjct: 237 DVIHSTISLGGRVLIPAFALGRAQELLLILDEYWSNHPELHNI--PVYYASPLAKKCMTV 294
Query: 117 YKMFITWTNQKIRKTFVQRNMFDFKHIRPFD--KSFIDNPGPMVVFATPGMLHSGLSLII 174
Y+ +I N++IR F N F FKHI P + + F D GP VV ATPG L SGLS +
Sbjct: 295 YQTYILSMNERIRNQFADSNPFKFKHISPLNSIEDFTD-VGPSVVMATPGGLQSGLSRQL 353
Query: 175 FKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAK 234
F W + N ++PGF V+GT+ +++ K+++ N + M + Y+SFSAHAD
Sbjct: 354 FDMWCSDKKNACVIPGFLVEGTLAKTIINEPKEVQLMNGLTAPLNMQVHYISFSAHADYA 413
Query: 235 GIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF---NLDCFMPANGES 281
++ P N++LVHGEA++M LK+K+ EF N P N +S
Sbjct: 414 QTSTFLKELMPPNIILVHGEANEMGRLKQKLITEFTDGNTKIITPKNCQS 463
>gi|170587204|ref|XP_001898368.1| cpsf3-prov protein [Brugia malayi]
gi|158594194|gb|EDP32780.1| cpsf3-prov protein, putative [Brugia malayi]
Length = 700
Score = 221 bits (564), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 126/358 (35%), Positives = 199/358 (55%), Gaps = 19/358 (5%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ I+YTGD++ DRHL AA + PD+LI ESTY T + +S+ RE+ F
Sbjct: 168 MFMIEIAGVRILYTGDFSRLEDRHLCAAELPTVSPDVLICESTYGTQVHESRDEREKRFT 227
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERM-NLQ-APIYFAVGLTEKATNYYK 118
VHE V RGG+ LIP FALGRAQEL ++L+ YWE LQ P+Y+A L +K Y+
Sbjct: 228 SIVHEIVGRGGRCLIPAFALGRAQELLLILDEYWESHPELQDIPVYYASSLAKKCMAVYQ 287
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
F++ N +I+K N F FKH+ ++ GP VV A+PGML +GLS +F+
Sbjct: 288 TFVSGMNSRIQKQIALNNPFVFKHVSNLKSIDHFEDVGPCVVLASPGMLQNGLSRELFEN 347
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W N I+ G+CV+GT+ +LS +++ N Q + +++ + Y+SFSAH D
Sbjct: 348 WCTDSKNGCIIAGYCVEGTLAKHILSEPEEIVAMNGQKLAMRLQVAYISFSAHTDYTQTS 407
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKI------KQEFNLDCFMPANGESCFVQTDMKIS 291
I+ P +++LVHGE ++M LK I + +F+++ + P N ES ++
Sbjct: 408 DFIRALRPPHLVLVHGEMNEMNRLKAAIIRQYEDESDFHIEVYNPRNTES----VELHFR 463
Query: 292 IDVSVNLLKEEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIV 349
+ + ++ + A+ PP R + GVL+ ++ + LM D+ +S I+
Sbjct: 464 GEKTAKVVGKMAMT----PPG--DGRVLSGVLIRRNFNYHLMHADDLSAYTELSNSIL 515
>gi|412990885|emb|CCO18257.1| predicted protein [Bathycoccus prasinos]
Length = 825
Score = 221 bits (564), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 121/335 (36%), Positives = 189/335 (56%), Gaps = 18/335 (5%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFLK 61
F+V VG ++YTGDY+ DRHL AA I K P ++I ESTY + K RE F
Sbjct: 217 FEVNVGGCRVLYTGDYSRVADRHLPAADIPKKTPHVVIVESTYGVSPHTPKEEREARFTD 276
Query: 62 KVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYKM 119
K+H + RGGK L+PV ALGRAQEL ++LE YWE+ P+Y A L KA ++
Sbjct: 277 KIHGILGRGGKCLLPVVALGRAQELLLILEDYWEKHPEMSHVPVYQASALARKAMTVFET 336
Query: 120 FITWTNQKIRKTFVQRNMFDFKHIRPFDKS--FIDNPGPMVVFATPGMLHSGLSLIIFKK 177
+I N I++ F ++N F+FKH++ +++ N GP VV ATP ML SG S +F+
Sbjct: 337 YINVLNADIKRQFEEKNPFNFKHVQSLNRASDLDGNTGPCVVLATPSMLQSGTSRELFEN 396
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W +N +++ F VQGT+ ++LS VK ++ + + + ++ +++ +SFSAHAD
Sbjct: 397 WCESSDNGVVICDFAVQGTLAREILSDVKTVKARDGRELQLRCSVDAISFSAHADYPQTQ 456
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF-----NLDCFMPANGESCFVQTDMKISI 292
Q + P +V+LVHGE S+M LK ++ + ++ F P N +S ++ +
Sbjct: 457 QFLDILNPPHVVLVHGETSEMARLKRALEAKAVADGKEMNVFAPKNCQSVEIRYRGEKVA 516
Query: 293 DVSVNLLKEEAVKYNSEPPNPLKERQIHGVLVIKD 327
V+ +L + P++ +++ GVLV KD
Sbjct: 517 KVAGSLASDCL---------PVEGQEVRGVLVAKD 542
>gi|126030713|pdb|2I7T|A Chain A, Structure Of Human Cpsf-73
gi|126030714|pdb|2I7V|A Chain A, Structure Of Human Cpsf-73
Length = 459
Score = 221 bits (564), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 111/272 (40%), Positives = 168/272 (61%), Gaps = 3/272 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ ++YTGD++ DRHL AA I +PD+LI ESTY T I + + RE F
Sbjct: 164 MFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPNIKPDILIIESTYGTHIHEKREEREARFC 223
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
VH+ V+RGG+ LIPVFALGRAQEL ++L+ YW+ PIY+A L +K Y+
Sbjct: 224 NTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQNHPELHDIPIYYASSLAKKCMAVYQ 283
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
++ N KIRK N F FKHI D+ GP VV A+PGM+ SGLS +F+
Sbjct: 284 TYVNAMNDKIRKQININNPFVFKHISNLKSMDHFDDIGPSVVMASPGMMQSGLSRELFES 343
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W + N +I+ G+CV+GT+ ++S +++ + Q + +KM+++Y+SFSAH D +
Sbjct: 344 WCTDKRNGVIIAGYCVEGTLAKHIMSEPEEITTMSGQKLPLKMSVDYISFSAHTDYQQTS 403
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF 269
+ I+ +P +V+LVHGE ++M LK + +E+
Sbjct: 404 EFIRALKPPHVILVHGEQNEMARLKAALIREY 435
>gi|303391080|ref|XP_003073770.1| putative beta-lactamase fold-containing exonuclease
[Encephalitozoon intestinalis ATCC 50506]
gi|303302918|gb|ADM12410.1| putative beta-lactamase fold-containing exonuclease
[Encephalitozoon intestinalis ATCC 50506]
Length = 696
Score = 221 bits (564), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 117/289 (40%), Positives = 174/289 (60%), Gaps = 10/289 (3%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF V++ I+YTGD++ DRHL AA C+ D LITESTY + RE F
Sbjct: 159 MFLVEIEKSKILYTGDFSREEDRHLKAAESPGCKIDALITESTYGVQCHLPRSEREGRFT 218
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWE-RMNLQA-PIYFAVGLTEKATNYYK 118
V V RGG+ L+PVFALGRAQEL ++LE +W +LQ PIY+A L ++ Y+
Sbjct: 219 SIVQNVVQRGGRCLLPVFALGRAQELLLILEEHWSSNASLQKIPIYYASALAKRCMGVYQ 278
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFD--KSFIDNPGPMVVFATPGMLHSGLSLIIFK 176
+I N++I+K + RN F FK+++ SF D+ GP V+ A+PGML SGLS +F+
Sbjct: 279 TYIGMMNERIQKLSLVRNPFAFKYVKNLKGIDSF-DDEGPCVIMASPGMLQSGLSRDLFE 337
Query: 177 KWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGI 236
+W N +I+PG+CV GT+ ++LS K++E N + + + M++EY+SFSAH D
Sbjct: 338 RWCSDSKNAVIIPGYCVDGTLAKEILSEPKEIEALNGKRLRLNMSVEYISFSAHVDFTQN 397
Query: 237 MQLIQYCEPKNVLLVHGEASKMVFLKEKIKQE-----FNLDCFMPANGE 280
Q I+ C+P+++ VHGE ++M L+ I+Q ++ + NGE
Sbjct: 398 SQFIEECQPRHLFFVHGEMNEMQRLRNVIQQRNEKKGIEMELYTLRNGE 446
>gi|170060909|ref|XP_001866010.1| cleavage and polyadenylation specificity factor [Culex
quinquefasciatus]
gi|167879247|gb|EDS42630.1| cleavage and polyadenylation specificity factor [Culex
quinquefasciatus]
Length = 688
Score = 221 bits (564), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 127/352 (36%), Positives = 197/352 (55%), Gaps = 7/352 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ I+YTGD++ DRHL AA I +PD+LITESTY T I + + RE F
Sbjct: 171 MFMIEIAGVKILYTGDFSRQEDRHLMAAEIPTMKPDVLITESTYGTHIHEKREDRESRFT 230
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYW-ERMNLQA-PIYFAVGLTEKATNYYK 118
V + V +GG+ LIPVFALGRAQEL ++L+ YW + LQ PIY+A L +K Y+
Sbjct: 231 SLVQKIVQQGGRCLIPVFALGRAQELLLILDEYWSQNPELQEIPIYYASSLAKKCMAVYQ 290
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
+I N KIR+ N F F+HI ++ GP VV A+PGM+ SGLS +F+
Sbjct: 291 TYINAMNDKIRRQIAVNNPFVFRHISNLKGIDHFEDIGPCVVMASPGMMQSGLSRELFET 350
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W N +I+ G+CV+GT+ VLS +++ + Q + + M+++Y+SFSAH D +
Sbjct: 351 WCSDPKNGVIIAGYCVEGTLAKTVLSEPEEITSMSGQKLRLNMSVDYISFSAHTDYQQTS 410
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVN 297
+ I+ +P +V+LVHGE ++M LK +++E+ D N ++++
Sbjct: 411 EFIRLLKPTHVVLVHGEQNEMNRLKSALQREYEND----PNANITLHNPRNTHAVELYFR 466
Query: 298 LLKEEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIV 349
K V N +P + +++ GVLV +D L+ + K +S +V
Sbjct: 467 GEKTAKVMGNLAVKSPEEGQKLSGVLVKRDFKYHLLAASDLSKYTDMSMSVV 518
>gi|348518441|ref|XP_003446740.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
3-like [Oreochromis niloticus]
Length = 686
Score = 221 bits (564), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 111/272 (40%), Positives = 165/272 (60%), Gaps = 3/272 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ ++YTGD++ DRHL AA I +PD+LITESTY T I + + RE F
Sbjct: 171 MFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPSVKPDILITESTYGTHIHEKREEREARFC 230
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
VH+ V+R G+ LIPVFALGRAQEL ++L+ YW+ PIY+A L K Y+
Sbjct: 231 NTVHDIVNREGRCLIPVFALGRAQELLLILDEYWQNHPELHDIPIYYASSLARKCMAVYQ 290
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
+I N KIRK N F FKHI D+ GP VV A+PGM+ SGLS +F+
Sbjct: 291 TYINAMNDKIRKAININNPFVFKHISNLKSMDHFDDIGPSVVMASPGMMQSGLSRELFES 350
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W N +I+ G+CV+GT+ +++ ++ + Q + +KM+++Y+SFSAH D +
Sbjct: 351 WCTDRRNGVIIAGYCVEGTLAKHIMTEPDEITTMSGQKLPLKMSVDYISFSAHTDYQQTS 410
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF 269
+ I+ +P +V+LVHGE ++M LK + +E+
Sbjct: 411 EFIRALKPPHVILVHGEQNEMARLKAALIREY 442
>gi|410898094|ref|XP_003962533.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
3-like [Takifugu rubripes]
Length = 691
Score = 221 bits (564), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 110/272 (40%), Positives = 167/272 (61%), Gaps = 3/272 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ ++YTGD++ DRHL AA I +PD+LI ESTY T I + + RE F
Sbjct: 171 MFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPSVKPDILIIESTYGTHIHEKREEREARFC 230
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
VH+ V+R G+ LIPVFALGRAQEL ++L+ YW+ PIY+A L +K Y+
Sbjct: 231 NTVHDIVNREGRCLIPVFALGRAQELLLILDEYWQNHPELHDIPIYYASSLAKKCMAVYQ 290
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
++ N KIRK N F FKHI D+ GP VV A+PGM+ SGLS +F+
Sbjct: 291 TYVNAMNDKIRKAININNPFVFKHISNLKSMDHFDDIGPSVVMASPGMMQSGLSRELFES 350
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W + N +I+ G+CV+GT+ ++S +++ + Q + +KM+++Y+SFSAH D +
Sbjct: 351 WCTDKRNGVIIAGYCVEGTLAKHIMSEPEEITTMSGQKLQLKMSVDYISFSAHTDYQQTS 410
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF 269
+ I+ +P +V+LVHGE ++M LK + +E+
Sbjct: 411 EFIRALKPPHVILVHGEQNEMARLKAALIREY 442
>gi|348531581|ref|XP_003453287.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
3-like [Oreochromis niloticus]
Length = 690
Score = 221 bits (563), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 110/272 (40%), Positives = 167/272 (61%), Gaps = 3/272 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ ++YTGD++ DRHL AA I +PD+LI ESTY T I + + RE F
Sbjct: 171 MFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPSVKPDILIIESTYGTHIHEKREEREARFC 230
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
VH+ V+R G+ LIPVFALGRAQEL ++L+ YW+ PIY+A L +K Y+
Sbjct: 231 NTVHDIVNREGRCLIPVFALGRAQELLLILDEYWQNHPELHDIPIYYASSLAKKCMAVYQ 290
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
++ N KIRK N F FKHI D+ GP VV A+PGM+ SGLS +F+
Sbjct: 291 TYVNAMNDKIRKAININNPFVFKHISNLKSMDHFDDIGPSVVMASPGMMQSGLSRELFES 350
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W + N +I+ G+CV+GT+ ++S +++ + Q + +KM+++Y+SFSAH D +
Sbjct: 351 WCTDKRNGVIIAGYCVEGTLAKHIMSEPEEITTMSGQKLQLKMSVDYISFSAHTDYQQTS 410
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF 269
+ I+ +P +V+LVHGE ++M LK + +E+
Sbjct: 411 EFIRALKPPHVILVHGEQNEMARLKAALIREY 442
>gi|168007963|ref|XP_001756677.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162692273|gb|EDQ78631.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 682
Score = 221 bits (563), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 118/287 (41%), Positives = 166/287 (57%), Gaps = 6/287 (2%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF V + ++YTGDY+ DRHL AA + PD+ I ESTY I + RER F
Sbjct: 170 MFMVDIAGMRVLYTGDYSCEEDRHLRAAEMPHFSPDVCIIESTYGVQIHQPRIMRERRFT 229
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWE-RMNLQ-APIYFAVGLTEKATNYYK 118
V + V +GGKVLIP FALGRAQEL ++L+ YWE LQ PIY+A L +K Y+
Sbjct: 230 DTVAQTVSQGGKVLIPAFALGRAQELLLILDEYWEAHPELQHIPIYYASPLAKKCMAVYQ 289
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
+I N +I+K F N FDFKHI+P D+ GP VV A+PG L SGLS +F
Sbjct: 290 TYINAMNDRIQKQFEVSNPFDFKHIQPLKNIDGFDDIGPAVVMASPGGLQSGLSRQLFDI 349
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W + N I+PG+ V+GT+ +++ K++ + ++ + M + Y+SFSAHAD
Sbjct: 350 WCQDKKNSCIIPGYVVEGTLAKAIMNEPKEVTLLSGLVVPLNMRVHYISFSAHADFTQTN 409
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF---NLDCFMPANGES 281
+ P N++LVHGEA++M LK K+ +F N+ P N ++
Sbjct: 410 AFLHELRPPNIILVHGEANEMGRLKAKLITQFAEQNVKILSPKNCQT 456
>gi|392512873|emb|CAD25809.2| similarity to HYPOTHETICAL PROTEIN Y162_METJA [Encephalitozoon
cuniculi GB-M1]
Length = 643
Score = 221 bits (563), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 116/271 (42%), Positives = 167/271 (61%), Gaps = 5/271 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF V++ I+YTGD++ DRHL AA C+ D LITESTY + RE F
Sbjct: 159 MFLVEIEKSKILYTGDFSREEDRHLKAAESPGCKIDALITESTYGVQCHLPRAEREGRFT 218
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYW-ERMNLQA-PIYFAVGLTEKATNYYK 118
V V RGG+ L+PVFALGRAQEL ++LE +W +LQ PIY+A L ++ Y+
Sbjct: 219 SIVQNVVQRGGRCLLPVFALGRAQELLLILEEHWGSNTSLQKIPIYYASALAKRCMGVYQ 278
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK--SFIDNPGPMVVFATPGMLHSGLSLIIFK 176
+I N++I+K + RN F FK+++ SF D+ GP V+ A+PGML SGLS +F+
Sbjct: 279 TYIGMMNERIQKLSLVRNPFAFKYVKNLKGIDSF-DDEGPCVIMASPGMLQSGLSRDLFE 337
Query: 177 KWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGI 236
+W N +I+PG+CV GT+ ++LS K++E N + + + M++EY+SFSAH D
Sbjct: 338 RWCSDSKNAVIIPGYCVDGTLAKEILSEPKEIEAMNGKKLRLNMSVEYISFSAHVDFTQN 397
Query: 237 MQLIQYCEPKNVLLVHGEASKMVFLKEKIKQ 267
Q I+ C+PK + VHGE ++M LK I+Q
Sbjct: 398 SQFIEECQPKYLFFVHGEMNEMQRLKNVIQQ 428
>gi|195995883|ref|XP_002107810.1| hypothetical protein TRIADDRAFT_19764 [Trichoplax adhaerens]
gi|190588586|gb|EDV28608.1| hypothetical protein TRIADDRAFT_19764 [Trichoplax adhaerens]
Length = 636
Score = 221 bits (563), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 109/272 (40%), Positives = 167/272 (61%), Gaps = 3/272 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ I+YTGD++ DRHL AA I +PD+LI ESTY I + + RE+ F
Sbjct: 175 MFMIEIAGVKILYTGDFSRQEDRHLMAAEIPSVKPDVLIIESTYGVHIHEKREIREKRFT 234
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
VH+ V+RGG+ LIPVFALGRAQEL ++L+ YW PIY+A L +K Y+
Sbjct: 235 STVHDIVNRGGRCLIPVFALGRAQELLLILDEYWSNHTELHDIPIYYASSLAKKCMAVYQ 294
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
+++ N KIR N F FKHI D+ GP VV A+PGM+ SGLS +F+K
Sbjct: 295 TYVSAMNDKIRNQIAISNPFIFKHISNLKGIDHFDDIGPCVVMASPGMMQSGLSRELFEK 354
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W N +++ G+CV+GT+ +V+S +++ + Q + K++++Y+SFSAH D +
Sbjct: 355 WCTDSKNGVVIAGYCVEGTLAKEVMSEPEEVLSMSGQKLARKLSVDYISFSAHTDYEQTS 414
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF 269
+ I+ P +++LVHGE ++M+ LK + +E+
Sbjct: 415 EFIRMLMPPHIILVHGEQNEMMRLKTALVREY 446
>gi|18377654|gb|AAL66977.1| putative cleavage and polyadenylation specificity factor
[Arabidopsis thaliana]
Length = 693
Score = 221 bits (563), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 116/287 (40%), Positives = 162/287 (56%), Gaps = 6/287 (2%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF V + I+YTGDY+ DRHL AA + + PD+ I EST + S+ RE+ F
Sbjct: 174 MFMVDIAGVRILYTGDYSREEDRHLRAAELPQFSPDICIIESTSGVQLHQSRHIREKRFT 233
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
+H V +GG+VLIP FALGRAQEL ++L+ YW PIY+A L +K Y+
Sbjct: 234 DVIHSTVAQGGRVLIPAFALGRAQELLLILDEYWANHPDLHNIPIYYASPLAKKCMAVYQ 293
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
+I N +IR F N F FKHI P + ++ GP VV ATPG L SGLS +F
Sbjct: 294 TYILSMNDRIRNQFANSNPFVFKHISPLNSIDDFNDVGPSVVMATPGGLQSGLSRQLFDS 353
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W + N I+PG+ V+GT+ +++ K++ N + M + Y+SFSAHAD
Sbjct: 354 WCSDKKNACIIPGYMVEGTLAKTIINEPKEVTLMNGLTAPLNMQVHYISFSAHADYAQTS 413
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF---NLDCFMPANGES 281
++ P N++LVHGEA++M+ LK+K+ EF N P N ES
Sbjct: 414 TFLKELMPPNIILVHGEANEMMRLKQKLLTEFPDGNTKIMTPKNCES 460
>gi|19074699|ref|NP_586205.1| similarity to HYPOTHETICAL PROTEIN Y162_METJA [Encephalitozoon
cuniculi GB-M1]
Length = 730
Score = 221 bits (562), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 116/271 (42%), Positives = 167/271 (61%), Gaps = 5/271 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF V++ I+YTGD++ DRHL AA C+ D LITESTY + RE F
Sbjct: 246 MFLVEIEKSKILYTGDFSREEDRHLKAAESPGCKIDALITESTYGVQCHLPRAEREGRFT 305
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYW-ERMNLQA-PIYFAVGLTEKATNYYK 118
V V RGG+ L+PVFALGRAQEL ++LE +W +LQ PIY+A L ++ Y+
Sbjct: 306 SIVQNVVQRGGRCLLPVFALGRAQELLLILEEHWGSNTSLQKIPIYYASALAKRCMGVYQ 365
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK--SFIDNPGPMVVFATPGMLHSGLSLIIFK 176
+I N++I+K + RN F FK+++ SF D+ GP V+ A+PGML SGLS +F+
Sbjct: 366 TYIGMMNERIQKLSLVRNPFAFKYVKNLKGIDSF-DDEGPCVIMASPGMLQSGLSRDLFE 424
Query: 177 KWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGI 236
+W N +I+PG+CV GT+ ++LS K++E N + + + M++EY+SFSAH D
Sbjct: 425 RWCSDSKNAVIIPGYCVDGTLAKEILSEPKEIEAMNGKKLRLNMSVEYISFSAHVDFTQN 484
Query: 237 MQLIQYCEPKNVLLVHGEASKMVFLKEKIKQ 267
Q I+ C+PK + VHGE ++M LK I+Q
Sbjct: 485 SQFIEECQPKYLFFVHGEMNEMQRLKNVIQQ 515
>gi|15219848|ref|NP_176297.1| cleavage and polyadenylation specificity factor subunit 3-I
[Arabidopsis thaliana]
gi|30696512|ref|NP_849835.1| cleavage and polyadenylation specificity factor subunit 3-I
[Arabidopsis thaliana]
gi|79320389|ref|NP_001031215.1| cleavage and polyadenylation specificity factor subunit 3-I
[Arabidopsis thaliana]
gi|75262219|sp|Q9C952.1|CPSF3_ARATH RecName: Full=Cleavage and polyadenylation specificity factor
subunit 3-I; AltName: Full=Cleavage and polyadenylation
specificity factor 73 kDa subunit I; Short=AtCPSF73-I;
Short=CPSF 73 kDa subunit I
gi|12323330|gb|AAG51638.1|AC018908_4 putative cleavage and polyadenylation specificity factor;
72745-70039 [Arabidopsis thaliana]
gi|23297661|gb|AAN13003.1| putative cleavage and polyadenylation specificity factor
[Arabidopsis thaliana]
gi|24415578|gb|AAN41458.1| putative cleavage and polyadenylation specificity factor 73 kDa
subunit [Arabidopsis thaliana]
gi|222422865|dbj|BAH19419.1| AT1G61010 [Arabidopsis thaliana]
gi|222423059|dbj|BAH19511.1| AT1G61010 [Arabidopsis thaliana]
gi|332195645|gb|AEE33766.1| cleavage and polyadenylation specificity factor subunit 3-I
[Arabidopsis thaliana]
gi|332195646|gb|AEE33767.1| cleavage and polyadenylation specificity factor subunit 3-I
[Arabidopsis thaliana]
gi|332195647|gb|AEE33768.1| cleavage and polyadenylation specificity factor subunit 3-I
[Arabidopsis thaliana]
Length = 693
Score = 221 bits (562), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 116/287 (40%), Positives = 162/287 (56%), Gaps = 6/287 (2%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF V + I+YTGDY+ DRHL AA + + PD+ I EST + S+ RE+ F
Sbjct: 174 MFMVDIAGVRILYTGDYSREEDRHLRAAELPQFSPDICIIESTSGVQLHQSRHIREKRFT 233
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
+H V +GG+VLIP FALGRAQEL ++L+ YW PIY+A L +K Y+
Sbjct: 234 DVIHSTVAQGGRVLIPAFALGRAQELLLILDEYWANHPDLHNIPIYYASPLAKKCMAVYQ 293
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
+I N +IR F N F FKHI P + ++ GP VV ATPG L SGLS +F
Sbjct: 294 TYILSMNDRIRNQFANSNPFVFKHISPLNSIDDFNDVGPSVVMATPGGLQSGLSRQLFDS 353
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W + N I+PG+ V+GT+ +++ K++ N + M + Y+SFSAHAD
Sbjct: 354 WCSDKKNACIIPGYMVEGTLAKTIINEPKEVTLMNGLTAPLNMQVHYISFSAHADYAQTS 413
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF---NLDCFMPANGES 281
++ P N++LVHGEA++M+ LK+K+ EF N P N ES
Sbjct: 414 TFLKELMPPNIILVHGEANEMMRLKQKLLTEFPDGNTKIMTPKNCES 460
>gi|213512037|ref|NP_001133354.1| cleavage and polyadenylation specificity factor subunit 3 [Salmo
salar]
gi|209151738|gb|ACI33081.1| Cleavage and polyadenylation specificity factor subunit 3 [Salmo
salar]
Length = 690
Score = 221 bits (562), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 110/272 (40%), Positives = 167/272 (61%), Gaps = 3/272 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ I+YTGD++ DRHL AA I +PD+LITESTY T I + + RE F
Sbjct: 171 MFMIEIAGVKILYTGDFSRQEDRHLMAAEIPSVKPDILITESTYGTHIHEKREEREARFC 230
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
+H+ V+R G+ LIPVFALGRAQEL ++L+ YW+ PIY+A L +K Y+
Sbjct: 231 NTIHDIVNREGRCLIPVFALGRAQELLLILDEYWQNHPELHDIPIYYASSLAKKCMAVYQ 290
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
++ N KIRK N F FKHI D+ GP VV A+PGM+ SGLS +F+
Sbjct: 291 TYVNAMNDKIRKAINVNNPFVFKHISNLKSMDHFDDIGPSVVMASPGMMQSGLSRELFES 350
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W + N +I+ G+ V+GT+ ++S +++ + Q + +KM+++Y+SFSAH D +
Sbjct: 351 WCTDKRNGVIIAGYSVEGTLAKHIMSEPEEITTMSGQKLQLKMSVDYISFSAHTDYQQTS 410
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF 269
+ I+ +P +V+LVHGE ++M LK + +E+
Sbjct: 411 EFIRALKPPHVILVHGEQNEMARLKAALIREY 442
>gi|297837375|ref|XP_002886569.1| hypothetical protein ARALYDRAFT_475225 [Arabidopsis lyrata subsp.
lyrata]
gi|297332410|gb|EFH62828.1| hypothetical protein ARALYDRAFT_475225 [Arabidopsis lyrata subsp.
lyrata]
Length = 693
Score = 220 bits (561), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 116/287 (40%), Positives = 162/287 (56%), Gaps = 6/287 (2%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF V + I+YTGDY+ DRHL AA + + PD+ I EST + S+ RE+ F
Sbjct: 174 MFMVDIAGVRILYTGDYSREEDRHLRAAELPQFSPDICIIESTSGVQLHQSRHIREKRFT 233
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
+H V +GG+VLIP FALGRAQEL ++L+ YW PIY+A L +K Y+
Sbjct: 234 DVIHSTVAQGGRVLIPAFALGRAQELLLILDEYWANHPDLHNIPIYYASPLAKKCMAVYQ 293
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
+I N +IR F N F FKHI P + ++ GP VV ATPG L SGLS +F
Sbjct: 294 TYILSMNDRIRNQFANSNPFVFKHISPLNSIDDFNDVGPSVVMATPGGLQSGLSRQLFDS 353
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W + N I+PG+ V+GT+ +++ K++ N + M + Y+SFSAHAD
Sbjct: 354 WCSDKKNACIIPGYMVEGTLAKTIINEPKEVTLMNGLTAPLNMQVHYISFSAHADYAQTS 413
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF---NLDCFMPANGES 281
++ P N++LVHGEA++M+ LK+K+ EF N P N ES
Sbjct: 414 TFLKELMPPNIILVHGEANEMMRLKQKLFTEFPDGNTKIMTPKNCES 460
>gi|194743214|ref|XP_001954095.1| GF18101 [Drosophila ananassae]
gi|190627132|gb|EDV42656.1| GF18101 [Drosophila ananassae]
Length = 684
Score = 220 bits (561), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 124/352 (35%), Positives = 199/352 (56%), Gaps = 7/352 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ I+YTGD++ DRHL AA + +PD+LITESTY T I + + RE F
Sbjct: 170 MFMIEIAGIKILYTGDFSRQEDRHLMAAEVPPMKPDVLITESTYGTHIHEKREDRENRFT 229
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYW-ERMNL-QAPIYFAVGLTEKATNYYK 118
V + V +GG+ LIPVFALGRAQEL ++L+ +W + +L + PIY+A L +K Y+
Sbjct: 230 SLVQKTVQQGGRCLIPVFALGRAQELLLILDEFWSQNPDLHEIPIYYASSLAKKCMAVYQ 289
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
+I N +IR+ N F F+HI ++ GP V+ A+PGM+ SGLS +F+
Sbjct: 290 TYINAMNDRIRRQIAVNNPFVFRHISNLKGIDHFEDIGPCVIMASPGMMQSGLSRELFES 349
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W N +I+ G+CV+GT+ +LS +++ + Q + + M+++Y+SFSAH D +
Sbjct: 350 WCTDPKNGVIIAGYCVEGTLAKTILSEPEEITTLSGQKLPLNMSVDYISFSAHTDYQQTS 409
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVN 297
+ I+ +P +V+LVHGE ++M LK +++E+ D A+ + F ++D+
Sbjct: 410 EFIRMLKPTHVVLVHGEQNEMSRLKLALQREYEAD----ASTDIKFYNPRNTHAVDLYFR 465
Query: 298 LLKEEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIV 349
K V N NP ++ GVLV +D L+ + K +S +V
Sbjct: 466 GEKTAKVMGNLAAKNPEVGSKLSGVLVKRDFKYHLLAPSDLGKYTDMSMSVV 517
>gi|47230093|emb|CAG10507.1| unnamed protein product [Tetraodon nigroviridis]
Length = 730
Score = 220 bits (560), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 110/272 (40%), Positives = 167/272 (61%), Gaps = 3/272 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ ++YTGD++ DRHL AA I +PD+LI ESTY T I + + RE F
Sbjct: 161 MFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPSVKPDILIIESTYGTHIHEKREEREARFC 220
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
VH+ V+R G+ LIPVFALGRAQEL ++L+ YW+ PIY+A L +K Y+
Sbjct: 221 NTVHDIVNREGRCLIPVFALGRAQELLLILDEYWQNHPELHDIPIYYASSLAKKCMAVYQ 280
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
++ N KIRK N F FKHI D+ GP VV A+PGM+ SGLS +F+
Sbjct: 281 TYVNAMNDKIRKAININNPFVFKHISNLKSMDHFDDIGPSVVMASPGMMQSGLSRELFES 340
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W + N +I+ G+CV+GT+ ++S +++ + Q + +KM+++Y+SFSAH D +
Sbjct: 341 WCTDKRNGVIIAGYCVEGTLAKHIMSEPEEITTMSGQKLQLKMSVDYISFSAHTDYQQTS 400
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF 269
+ I+ +P +V+LVHGE ++M LK + +E+
Sbjct: 401 EFIRALKPPHVILVHGEQNEMARLKAALIREY 432
>gi|326503296|dbj|BAJ99273.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 693
Score = 220 bits (560), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 116/289 (40%), Positives = 164/289 (56%), Gaps = 10/289 (3%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF V + I+YTGDY+ DRHL AA + + PD+ I ESTY + RE+ F
Sbjct: 176 MFMVDIAGVRILYTGDYSREEDRHLKAAEVPQFSPDICIIESTYGVQQHQPRHVREKRFT 235
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
+H V +GG+VLIP +ALGRAQEL ++L+ YW + PIY+A L +K Y+
Sbjct: 236 DAIHNTVSQGGRVLIPAYALGRAQELLLILDEYWSNHPELHKIPIYYASPLAKKCMAVYQ 295
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDN---PGPMVVFATPGMLHSGLSLIIF 175
+I N++IR F Q N F FKHI P + IDN GP VV A+PG L SGLS +F
Sbjct: 296 TYINSMNERIRNQFAQSNPFHFKHIEPLNS--IDNFHDVGPSVVMASPGSLQSGLSRQLF 353
Query: 176 KKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKG 235
KW + N ++PGF V+G++ +++ +++ N + M I Y+SFSAHAD
Sbjct: 354 DKWCTDKKNTCVIPGFAVEGSLVKTIINEPREVTLANGLTAPLHMQIFYISFSAHADFLQ 413
Query: 236 IMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF---NLDCFMPANGES 281
+ P N++LVHGEA++M LK+K+ +F N P N +S
Sbjct: 414 TSGFLDELRPPNIILVHGEANEMGRLKQKLTTQFDGTNTKIVSPKNCQS 462
>gi|401827745|ref|XP_003888165.1| putative RNA-processing beta-lactamase-fold exonuclease
[Encephalitozoon hellem ATCC 50504]
gi|392999365|gb|AFM99184.1| putative RNA-processing beta-lactamase-fold exonuclease
[Encephalitozoon hellem ATCC 50504]
Length = 643
Score = 220 bits (560), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 113/271 (41%), Positives = 168/271 (61%), Gaps = 5/271 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ ++YTGD++ DRHL AA C+ D LITESTY + RE F
Sbjct: 159 MFLIEIEKSKVLYTGDFSREEDRHLKAAESPGCKIDALITESTYGVQCHLPRAEREGRFT 218
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYW-ERMNLQA-PIYFAVGLTEKATNYYK 118
V V RGG+ L+PVFALGRAQEL ++LE +W +LQ PIY+A L ++ Y+
Sbjct: 219 SIVQNVVQRGGRCLLPVFALGRAQELLLILEEHWGSNASLQKIPIYYASALAKRCMGVYQ 278
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK--SFIDNPGPMVVFATPGMLHSGLSLIIFK 176
+I N++I+K + RN F FK+++ SF D+ GP V+ A+PGML SGLS +F+
Sbjct: 279 TYIGMMNERIQKLSLVRNPFAFKYVKNLKGIDSF-DDEGPCVIMASPGMLQSGLSRDLFE 337
Query: 177 KWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGI 236
+W N +I+PG+CV GT+ ++LS K++E N + + + M++EY+SFSAH D
Sbjct: 338 RWCSDSRNAVIIPGYCVDGTLAKEILSEPKEIEALNGKKLRLNMSVEYISFSAHVDFTQN 397
Query: 237 MQLIQYCEPKNVLLVHGEASKMVFLKEKIKQ 267
Q I+ C+PK++ VHGE ++M L+ I+Q
Sbjct: 398 SQFIEECQPKHLFFVHGEMNEMQRLRNVIQQ 428
>gi|326495416|dbj|BAJ85804.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 704
Score = 220 bits (560), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 117/289 (40%), Positives = 164/289 (56%), Gaps = 10/289 (3%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF V + I+YTGDY+ DRHL AA I + PD+ I ESTY + RE+ F
Sbjct: 188 MFMVDIAGVRILYTGDYSREEDRHLKAAEIPQFSPDICIIESTYGVQQHQPRHVREKRFT 247
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
+H V +GG+VLIP FALGRAQEL ++L+ YW + PIY+A L +K Y+
Sbjct: 248 DAIHNTVSQGGRVLIPAFALGRAQELLLILDEYWSNHPELHKIPIYYASPLAKKCMAVYQ 307
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDN---PGPMVVFATPGMLHSGLSLIIF 175
+I N++IR F Q N F FKHI P + IDN GP VV A+PG L SGLS +F
Sbjct: 308 TYINSMNERIRNQFAQSNPFHFKHIDPLNS--IDNFHDVGPSVVMASPGSLQSGLSRQLF 365
Query: 176 KKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKG 235
KW + N ++PG+ V+G++ +++ +++ N + M I Y+SFSAHAD
Sbjct: 366 DKWCTDKKNTCVIPGYAVEGSLAKTIINEPREVTLANGLTAPLNMQIFYISFSAHADFPQ 425
Query: 236 IMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF---NLDCFMPANGES 281
+ P N++LVHGEA++M LK+K+ +F N P N +S
Sbjct: 426 TSGFLGELCPPNIILVHGEANEMGRLKQKLITQFDGTNTKIVSPKNCQS 474
>gi|326487902|dbj|BAJ89790.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 704
Score = 220 bits (560), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 117/289 (40%), Positives = 164/289 (56%), Gaps = 10/289 (3%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF V + I+YTGDY+ DRHL AA I + PD+ I ESTY + RE+ F
Sbjct: 188 MFMVDIAGVRILYTGDYSREEDRHLKAAEIPQFSPDICIIESTYGVQQHQPRHVREKRFT 247
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
+H V +GG+VLIP FALGRAQEL ++L+ YW + PIY+A L +K Y+
Sbjct: 248 DAIHNTVSQGGRVLIPAFALGRAQELLLILDEYWSNHPELHKIPIYYASPLAKKCMAVYQ 307
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDN---PGPMVVFATPGMLHSGLSLIIF 175
+I N++IR F Q N F FKHI P + IDN GP VV A+PG L SGLS +F
Sbjct: 308 TYINSMNERIRNQFAQSNPFHFKHIDPLNS--IDNFHDVGPSVVMASPGSLQSGLSRQLF 365
Query: 176 KKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKG 235
KW + N ++PG+ V+G++ +++ +++ N + M I Y+SFSAHAD
Sbjct: 366 DKWCTDKKNTCVIPGYAVEGSLAKTIINEPREVTLANGLTAPLNMQIFYISFSAHADFPQ 425
Query: 236 IMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF---NLDCFMPANGES 281
+ P N++LVHGEA++M LK+K+ +F N P N +S
Sbjct: 426 TSGFLGELCPPNIILVHGEANEMGRLKQKLITQFDGTNTKIVSPKNCQS 474
>gi|221106537|ref|XP_002161150.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
3-like [Hydra magnipapillata]
Length = 677
Score = 220 bits (560), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 115/291 (39%), Positives = 177/291 (60%), Gaps = 11/291 (3%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ +I+YTGD++ DRHL +A I PD+LI ESTY T + + + RE+ F
Sbjct: 162 MFMIEIAGVNILYTGDFSRQEDRHLMSAEIPNISPDVLIMESTYGTHVHEKREQREKRFT 221
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYW-ERMNLQ-APIYFAVGLTEKATNYYK 118
+H + RGG+ LIPVFALGRAQEL ++L+ YW + LQ P+Y+A L +K Y+
Sbjct: 222 STIHNIISRGGRCLIPVFALGRAQELLLILDEYWNQHPELQDVPVYYASSLAKKCMAVYQ 281
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFD--KSFIDNPGPMVVFATPGMLHSGLSLIIFK 176
+I+ N+KIR+ N F FKHI SF D+ GP VV A+PGM+ SGLS +F+
Sbjct: 282 TYISAMNEKIRRQISISNPFVFKHISNLKGIDSF-DDIGPSVVLASPGMMQSGLSRELFE 340
Query: 177 KWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGI 236
W N +I+ G+CV+GT+ +++S +++ N Q + ++I+Y+SFSAHAD + I
Sbjct: 341 TWCTDPRNGVIIAGYCVEGTLAKELMSEPEEVTAMNGQRMKRNISIDYISFSAHADFEQI 400
Query: 237 MQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF------NLDCFMPANGES 281
+ + P +++L HGE ++M LK I ++F N+ + P N +S
Sbjct: 401 SEFVDILRPPHIVLNHGEQNEMSRLKAAIIRDFEDRGISNIQVYNPKNCQS 451
>gi|396082284|gb|AFN83894.1| putative beta-lactamase fold-containing exonuclease
[Encephalitozoon romaleae SJ-2008]
Length = 643
Score = 219 bits (559), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 113/271 (41%), Positives = 168/271 (61%), Gaps = 5/271 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ ++YTGD++ DRHL AA C+ D LITESTY + RE F
Sbjct: 159 MFLIEIEKSKVLYTGDFSREEDRHLKAAESPGCKIDGLITESTYGVQCHLPRAEREGRFT 218
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWE-RMNLQA-PIYFAVGLTEKATNYYK 118
V V RGG+ L+PVFALGRAQEL ++LE +W +LQ PIY+A L ++ Y+
Sbjct: 219 SIVQNVVQRGGRCLLPVFALGRAQELLLILEEHWNSNTSLQKIPIYYASALAKRCMGVYQ 278
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFD--KSFIDNPGPMVVFATPGMLHSGLSLIIFK 176
+I N++I+K + RN F FK+++ SF D+ GP V+ A+PGML SGLS +F+
Sbjct: 279 TYIGMMNERIQKLSLVRNPFAFKYVKNLKGIDSF-DDEGPCVIMASPGMLQSGLSRDLFE 337
Query: 177 KWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGI 236
+W N +I+PG+CV GT+ ++LS K++E N + + + M++EY+SFSAH D
Sbjct: 338 RWCSDSKNAVIIPGYCVDGTLAKEILSEPKEIEALNGKKLRLNMSVEYISFSAHVDFTQN 397
Query: 237 MQLIQYCEPKNVLLVHGEASKMVFLKEKIKQ 267
Q I+ C+PK++ VHGE ++M L+ I+Q
Sbjct: 398 SQFIEECQPKHLFFVHGEMNEMQRLRNVIQQ 428
>gi|71682600|gb|AAI00570.1| Cpsf3 protein [Mus musculus]
Length = 512
Score = 219 bits (559), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 110/272 (40%), Positives = 168/272 (61%), Gaps = 3/272 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ ++YTGD++ DRHL AA I +PD+LI ESTY T I + + RE F
Sbjct: 164 MFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPNIKPDILIIESTYGTHIHEKREEREARFC 223
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
VH+ V+RGG+ LIPVFALGRAQEL ++L+ YW+ PIY+A L +K Y+
Sbjct: 224 NTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQNHPELHDIPIYYASSLAKKCMAVYQ 283
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
++ N KIRK N F FKHI D+ GP VV A+PGM+ +GLS +F+
Sbjct: 284 TYVNAMNDKIRKQININNPFVFKHISNLKSMDHFDDIGPSVVMASPGMIQNGLSRELFES 343
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W + N +I+ G+CV+GT+ ++S +++ + Q + +KM+++Y+SFSAH D +
Sbjct: 344 WCTDKRNGVIIAGYCVEGTLAKHIMSEPEEITTMSGQKLPLKMSVDYVSFSAHTDYQQTS 403
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF 269
+ I+ +P +V+LVHGE ++M LK + +E+
Sbjct: 404 EFIRALKPPHVILVHGEQNEMARLKAALIREY 435
>gi|226497180|ref|NP_001146407.1| uncharacterized protein LOC100279987 [Zea mays]
gi|219887045|gb|ACL53897.1| unknown [Zea mays]
gi|414873991|tpg|DAA52548.1| TPA: hypothetical protein ZEAMMB73_264007 [Zea mays]
Length = 697
Score = 219 bits (559), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 114/289 (39%), Positives = 166/289 (57%), Gaps = 10/289 (3%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF V + I+YTGDY+ DRHL AA + + PD+ I ESTY + RE+ F
Sbjct: 180 MFMVDIAGVRILYTGDYSREEDRHLRAAELPQFSPDICIIESTYGVQQHQPRIIREKRFT 239
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
+ +H V +GG+VLIP FALGRAQEL ++L+ YW + + PIY+A L ++ Y+
Sbjct: 240 EVIHNTVSQGGRVLIPAFALGRAQELLLILDEYWSKHPELHKIPIYYASPLAKRCMAVYQ 299
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDN---PGPMVVFATPGMLHSGLSLIIF 175
+I N++IR F Q N F FKHI + IDN GP VV A+PG L SGLS +F
Sbjct: 300 TYINSMNERIRNQFAQSNPFHFKHIESLNS--IDNFHDVGPSVVMASPGGLQSGLSRQLF 357
Query: 176 KKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKG 235
KW + N ++PG+ V+GT+ +++ +++ N + M++ Y+SFSAHAD
Sbjct: 358 DKWCTDKKNACVIPGYVVEGTLAKTIINEPREVTLANGLTAPLHMSVHYISFSAHADFPQ 417
Query: 236 IMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF---NLDCFMPANGES 281
+ P N++LVHGEA++M LK+K+ +F N P N +S
Sbjct: 418 TSNFLDELRPPNIILVHGEANEMSRLKQKLITQFDGTNTKIVSPKNCQS 466
>gi|125546484|gb|EAY92623.1| hypothetical protein OsI_14368 [Oryza sativa Indica Group]
Length = 700
Score = 219 bits (559), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 114/289 (39%), Positives = 165/289 (57%), Gaps = 10/289 (3%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF V + ++YTGDY+ DRHL AA + + PD+ I ESTY + RE+ F
Sbjct: 183 MFMVDIAGVRVLYTGDYSREEDRHLKAAELPQFSPDICIIESTYGVQQHQPRHVREKRFT 242
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
+H V +GG+VLIP FALGRAQEL ++L+ YW + PIY+A L +K Y+
Sbjct: 243 DVIHTTVSQGGRVLIPAFALGRAQELLLILDEYWANHPELHKIPIYYASPLAKKCMAVYQ 302
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDN---PGPMVVFATPGMLHSGLSLIIF 175
+I N++IR F Q N F FKHI + IDN GP VV A+PG L SGLS +F
Sbjct: 303 TYINSMNERIRNQFAQSNPFHFKHIESLNS--IDNFHDVGPSVVMASPGGLQSGLSRQLF 360
Query: 176 KKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKG 235
KW + N ++PG+ V+GT+ +++ +++ N + M + Y+SFSAHAD
Sbjct: 361 DKWCTDKKNSCVIPGYVVEGTLAKTIINEPREVTLANGLTAPLHMQVHYISFSAHADFPQ 420
Query: 236 IMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF---NLDCFMPANGES 281
+ +P N++LVHGEA++M LK+K+ +F N+ P N +S
Sbjct: 421 TSTFLDELQPPNIVLVHGEANEMSRLKQKLISQFDGTNIKVVNPKNCQS 469
>gi|240975718|ref|XP_002402161.1| cleavage and polyadenylation specificity factor, putative [Ixodes
scapularis]
gi|215491113|gb|EEC00754.1| cleavage and polyadenylation specificity factor, putative [Ixodes
scapularis]
Length = 694
Score = 219 bits (558), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 123/342 (35%), Positives = 194/342 (56%), Gaps = 19/342 (5%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ ++YTGD++ DRHL AA I PD+LI ESTY T I + + RE F
Sbjct: 177 MFMIEIAGVKVLYTGDFSRQEDRHLMAAEIPNIHPDVLIIESTYGTHIHEKREEREARFT 236
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
VH+ V+RGG+ LIPVFALGRAQEL ++L+ YW PIY+A L +K Y+
Sbjct: 237 GLVHDIVNRGGRCLIPVFALGRAQELLLILDEYWSNHPELHDIPIYYASSLAKKCMAVYQ 296
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
++ N++IR+ N F FKHI ++ GP VV A+PGM+ SGLS +F+
Sbjct: 297 TYVNAMNERIRRQITINNPFVFKHISNLKSIEHFEDVGPCVVMASPGMMQSGLSRELFES 356
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W N +I+ G+CV+GT+ +LS +++ Q + +KM+++Y+SFSAH D +
Sbjct: 357 WCTDPKNGVIIAGYCVEGTLAKTILSEPEEISTMVGQKLPLKMSVDYISFSAHTDYQQTS 416
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF------NLDCFMPANGESCFVQTDMKIS 291
+ I+ +P +++LVHGE ++M LK I +E+ ++ P N ++ ++
Sbjct: 417 EFIRTLKPPHIVLVHGEQNEMGRLKAAIVREYEDDVETRIEVHNPRNTQA----VELYFR 472
Query: 292 IDVSVNLLKEEAVKYNSEPPNPLKERQIHGVLVIKDSSISLM 333
+ + ++ AV + P P RQ+ GVLV ++ S L+
Sbjct: 473 GEKTAKVMGSLAV----QAPEP--GRQLSGVLVKRNFSYHLL 508
>gi|115456655|ref|NP_001051928.1| Os03g0852900 [Oryza sativa Japonica Group]
gi|27573349|gb|AAO20067.1| putative cleavage and polyadenylation specifity factor protein
[Oryza sativa Japonica Group]
gi|29126360|gb|AAO66552.1| putative cleavage and polyadenylation specifity factor [Oryza
sativa Japonica Group]
gi|108712151|gb|ABF99946.1| Cleavage and polyadenylation specificity factor, 73 kDa subunit,
putative, expressed [Oryza sativa Japonica Group]
gi|113550399|dbj|BAF13842.1| Os03g0852900 [Oryza sativa Japonica Group]
gi|125588676|gb|EAZ29340.1| hypothetical protein OsJ_13407 [Oryza sativa Japonica Group]
Length = 700
Score = 219 bits (558), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 114/289 (39%), Positives = 165/289 (57%), Gaps = 10/289 (3%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF V + ++YTGDY+ DRHL AA + + PD+ I ESTY + RE+ F
Sbjct: 183 MFMVDIAGVRVLYTGDYSREEDRHLKAAELPQFSPDICIIESTYGVQQHQPRHVREKRFT 242
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
+H V +GG+VLIP FALGRAQEL ++L+ YW + PIY+A L +K Y+
Sbjct: 243 DVIHTTVSQGGRVLIPAFALGRAQELLLILDEYWANHPELHKIPIYYASPLAKKCMAVYQ 302
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDN---PGPMVVFATPGMLHSGLSLIIF 175
+I N++IR F Q N F FKHI + IDN GP VV A+PG L SGLS +F
Sbjct: 303 TYINSMNERIRNQFAQSNPFHFKHIESLNS--IDNFHDVGPSVVMASPGGLQSGLSRQLF 360
Query: 176 KKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKG 235
KW + N ++PG+ V+GT+ +++ +++ N + M + Y+SFSAHAD
Sbjct: 361 DKWCTDKKNSCVIPGYVVEGTLAKTIINEPREVTLANGLTAPLHMQVHYISFSAHADFPQ 420
Query: 236 IMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF---NLDCFMPANGES 281
+ +P N++LVHGEA++M LK+K+ +F N+ P N +S
Sbjct: 421 TSTFLDELQPPNIVLVHGEANEMSRLKQKLISQFDGTNIKVVNPKNCQS 469
>gi|66820693|ref|XP_643926.1| beta-lactamase domain-containing protein [Dictyostelium discoideum
AX4]
gi|74860395|sp|Q86A79.1|CPSF3_DICDI RecName: Full=Cleavage and polyadenylation specificity factor
subunit 3; Short=Cleavage and polyadenylation
specificity factor 3
gi|60472339|gb|EAL70292.1| beta-lactamase domain-containing protein [Dictyostelium discoideum
AX4]
Length = 774
Score = 219 bits (557), Expect = 8e-54, Method: Compositional matrix adjust.
Identities = 130/384 (33%), Positives = 211/384 (54%), Gaps = 23/384 (5%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ I+YTGD++ DRHL A + D+LI ESTY + + + RE+ F
Sbjct: 191 MFMIEIAGVKILYTGDFSRQEDRHLMGAETPPVKVDVLIIESTYGVQVHEPRLEREKRFT 250
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQ---APIYFAVGLTEKATNYY 117
VH+ V+R GK LIPVFALGRAQEL ++L+ YW N Q PIY+A L +K Y
Sbjct: 251 SSVHQVVERNGKCLIPVFALGRAQELLLILDEYW-IANPQLHHVPIYYASALAKKCMGVY 309
Query: 118 KMFITWTNQKIRKTFVQRNMFDFKHIRPFD--KSFIDNPGPMVVFATPGMLHSGLSLIIF 175
+ +I N ++R F N F+FKHI+ +SF D+ GP V A+PGML SGLS +F
Sbjct: 310 RTYINMMNDRVRAQFDVSNPFEFKHIKNIKGIESF-DDRGPCVFMASPGMLQSGLSRQLF 368
Query: 176 KKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKG 235
++W + N +++PG+ V+GT+ ++S ++ + + + + + Y+SFSAH+D
Sbjct: 369 ERWCSDKRNGIVIPGYSVEGTLAKHIMSEPAEITRLDNVNVPLNLTVSYVSFSAHSDFLQ 428
Query: 236 IMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF-NLDCFMPANGESCFVQTDMKISIDV 294
+ IQ +P +V+LVHG+A++M L++ + +F ++ P N S V
Sbjct: 429 TSEFIQEIQPPHVVLVHGDANEMSRLRQSLVAKFKTINVLTPKNAMS------------V 476
Query: 295 SVNLLKEEAVK-YNSEPPNPLKERQ-IHGVLVIKDSSISLMDVDEACKEVGISRHIVRFT 352
++ E+ K S NP K+ I G+LV KD + ++ + + +I++
Sbjct: 477 ALEFRPEKVAKTLGSIITNPPKQNDIIQGILVTKDFTHHILSASDIHNYTNLKTNIIKQK 536
Query: 353 SNVQISDSSPI-NKTLELIYDQLL 375
+ + + I TLE IY+Q++
Sbjct: 537 LTLPFAQTYHILISTLEQIYEQII 560
>gi|326508058|dbj|BAJ86772.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 704
Score = 219 bits (557), Expect = 9e-54, Method: Compositional matrix adjust.
Identities = 117/289 (40%), Positives = 163/289 (56%), Gaps = 10/289 (3%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF V + I YTGDY+ DRHL AA I + PD+ I ESTY + RE+ F
Sbjct: 188 MFMVDIAGVRIRYTGDYSREEDRHLKAAEIPQFSPDICIIESTYGVQQHQPRHVREKRFT 247
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
+H V +GG+VLIP FALGRAQEL ++L+ YW + PIY+A L +K Y+
Sbjct: 248 DAIHNTVSQGGRVLIPAFALGRAQELLLILDEYWSNHPELHKIPIYYASPLAKKCMAVYQ 307
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDN---PGPMVVFATPGMLHSGLSLIIF 175
+I N++IR F Q N F FKHI P + IDN GP VV A+PG L SGLS +F
Sbjct: 308 TYINSMNERIRNQFAQSNPFHFKHIDPLNS--IDNFHDVGPSVVMASPGSLQSGLSRQLF 365
Query: 176 KKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKG 235
KW + N ++PG+ V+G++ +++ +++ N + M I Y+SFSAHAD
Sbjct: 366 DKWCTDKKNTCVIPGYAVEGSLAKTIINEPREVTLANGLTAPLNMQIFYISFSAHADFPQ 425
Query: 236 IMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF---NLDCFMPANGES 281
+ P N++LVHGEA++M LK+K+ +F N P N +S
Sbjct: 426 TSGFLGELCPPNIILVHGEANEMGRLKQKLITQFDGTNTKIVSPKNCQS 474
>gi|224140921|ref|XP_002323825.1| predicted protein [Populus trichocarpa]
gi|222866827|gb|EEF03958.1| predicted protein [Populus trichocarpa]
Length = 696
Score = 218 bits (556), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 115/290 (39%), Positives = 166/290 (57%), Gaps = 12/290 (4%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF V + ++YTGDY+ DRHL AA + + PD+ I ESTY + + RE+ F
Sbjct: 177 MFMVDIAGVRVLYTGDYSREEDRHLCAAEMPQFSPDICIIESTYGVQLHQPRHLREKRFT 236
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYW----ERMNLQAPIYFAVGLTEKATNY 116
+H + GG+VLIP FALGRAQEL ++L+ YW E N+ PIY+A L +K
Sbjct: 237 DVIHSTISLGGRVLIPAFALGRAQELLLILDEYWSNHPELHNI--PIYYASPLAKKCMTV 294
Query: 117 YKMFITWTNQKIRKTFVQRNMFDFKHIRPFD--KSFIDNPGPMVVFATPGMLHSGLSLII 174
Y+ +I N++IR F N F FKHI P + + F D GP VV A+PG L SGLS +
Sbjct: 295 YQTYILSMNERIRNQFANSNPFKFKHISPLNSIEDFSDV-GPSVVMASPGGLQSGLSRQL 353
Query: 175 FKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAK 234
F W + N ++PG+ V+GT+ +++ K+++ N + M + Y+SFSAHAD
Sbjct: 354 FDMWCSDKKNACVIPGYVVEGTLAKTIINEPKEVQLMNGLTAPLNMQVHYISFSAHADYA 413
Query: 235 GIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF---NLDCFMPANGES 281
++ P N++LVHGEA++M LK+K+ EF N P N +S
Sbjct: 414 QTSTFLKELMPPNIILVHGEANEMGRLKQKLITEFTDGNTKIITPKNCQS 463
>gi|359486185|ref|XP_003633408.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
3-I-like [Vitis vinifera]
Length = 694
Score = 218 bits (556), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 114/290 (39%), Positives = 166/290 (57%), Gaps = 12/290 (4%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF V + ++YTGDY+ DRHL AA I + PD+ I ESTY + + RE+ F
Sbjct: 176 MFMVDIAGVRVLYTGDYSREEDRHLRAAEIPQFSPDICIIESTYGVQLHQPRHVREKRFT 235
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYW----ERMNLQAPIYFAVGLTEKATNY 116
+H + +GG+VLIP FALGRAQEL ++L+ YW E N+ PIY+A L ++
Sbjct: 236 DVIHSTISQGGRVLIPAFALGRAQELLLILDEYWSNHPELHNI--PIYYASPLAKRCMAV 293
Query: 117 YKMFITWTNQKIRKTFVQRNMFDFKHIRPFDKSF--IDNPGPMVVFATPGMLHSGLSLII 174
Y+ +I N++IR F N FDFKHI P KS ++ GP VV A+P L SGLS +
Sbjct: 294 YQTYINSMNERIRNQFANSNPFDFKHISPL-KSIENFNDVGPSVVMASPSGLQSGLSRQL 352
Query: 175 FKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAK 234
F W + N ++PG+ V+GT+ +++ K++ N + M + Y+SFSAHAD
Sbjct: 353 FDMWCSDKKNACVIPGYVVEGTLAKTIINEPKEVTLMNGLTAPLNMQVHYISFSAHADFA 412
Query: 235 GIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF---NLDCFMPANGES 281
++ P N++LVHGEA++M LK+K+ +F N P N +S
Sbjct: 413 QTSTFLKELMPPNIILVHGEANEMGRLKQKLITQFADRNTKIISPKNCQS 462
>gi|147787280|emb|CAN71414.1| hypothetical protein VITISV_029216 [Vitis vinifera]
Length = 687
Score = 218 bits (556), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 114/290 (39%), Positives = 166/290 (57%), Gaps = 12/290 (4%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF V + ++YTGDY+ DRHL AA I + PD+ I ESTY + + RE+ F
Sbjct: 169 MFMVDIAGVRVLYTGDYSREEDRHLRAAEIPQFSPDICIIESTYGVQLHQPRHVREKRFT 228
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYW----ERMNLQAPIYFAVGLTEKATNY 116
+H + +GG+VLIP FALGRAQEL ++L+ YW E N+ PIY+A L ++
Sbjct: 229 DVIHSTISQGGRVLIPAFALGRAQELLLILDEYWSNHPELHNI--PIYYASPLAKRCMAV 286
Query: 117 YKMFITWTNQKIRKTFVQRNMFDFKHIRPFDKSF--IDNPGPMVVFATPGMLHSGLSLII 174
Y+ +I N++IR F N FDFKHI P KS ++ GP VV A+P L SGLS +
Sbjct: 287 YQTYINSMNERIRNQFANSNPFDFKHISPL-KSIENFNDVGPSVVMASPSGLQSGLSRQL 345
Query: 175 FKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAK 234
F W + N ++PG+ V+GT+ +++ K++ N + M + Y+SFSAHAD
Sbjct: 346 FDMWCSDKKNACVIPGYVVEGTLAKTIINEPKEVTLMNGLTAPLNMQVHYISFSAHADFA 405
Query: 235 GIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF---NLDCFMPANGES 281
++ P N++LVHGEA++M LK+K+ +F N P N +S
Sbjct: 406 QTSTFLKELMPPNIILVHGEANEMGRLKQKLITQFADRNTKIISPKNCQS 455
>gi|195497711|ref|XP_002096215.1| GE25184 [Drosophila yakuba]
gi|194182316|gb|EDW95927.1| GE25184 [Drosophila yakuba]
Length = 684
Score = 218 bits (555), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 129/377 (34%), Positives = 209/377 (55%), Gaps = 10/377 (2%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ I+YTGD++ DRHL AA + +PD+LITESTY T I + + RE F
Sbjct: 170 MFMIEIAGIKILYTGDFSRQEDRHLMAAEVPPMKPDVLITESTYGTHIHEKREDRENRFT 229
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYW-ERMNL-QAPIYFAVGLTEKATNYYK 118
V + V +GG+ LIPVFALGRAQEL ++L+ +W + +L + PIY+A L +K Y+
Sbjct: 230 SLVQKIVQQGGRCLIPVFALGRAQELLLILDEFWSQNPDLHEIPIYYASSLAKKCMAVYQ 289
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
+I N +IR+ N F F+HI ++ GP V+ A+PGM+ SGLS +F+
Sbjct: 290 TYINAMNDRIRRQIAVNNPFVFRHISNLKGIDHFEDIGPCVIMASPGMMQSGLSRELFES 349
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W N +I+ G+CV+GT+ VLS +++ + Q + + M+++Y+SFSAH D +
Sbjct: 350 WCTDPKNGVIIAGYCVEGTLAKAVLSEPEEITTLSGQKLPLNMSVDYISFSAHTDYQQTS 409
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVN 297
+ I+ +P +V+LVHGE ++M LK +++E+ D A+ + F ++D+
Sbjct: 410 EFIRLLKPTHVVLVHGEQNEMSRLKLALQREYEAD----ASTDIKFYNPRNTHAVDLYFR 465
Query: 298 LLKEEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQI 357
K V + N ++ GVLV +D L+ + K +S +V ++
Sbjct: 466 GEKTAKVMGSLAAKNSEVGSKLSGVLVKRDFKYHLLAPSDLGKYTDMSMSVVTQRQSIPW 525
Query: 358 SDSSPINKTLELIYDQL 374
S TLEL+ D++
Sbjct: 526 GSSL---ATLELLLDRI 539
>gi|195108751|ref|XP_001998956.1| GI24246 [Drosophila mojavensis]
gi|193915550|gb|EDW14417.1| GI24246 [Drosophila mojavensis]
Length = 686
Score = 218 bits (555), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 128/377 (33%), Positives = 209/377 (55%), Gaps = 10/377 (2%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ I+YTGD++ DRHL AA + +PD+LITESTY T I + + RE F
Sbjct: 172 MFMIEIAGIKILYTGDFSRQEDRHLMAAEVPPKKPDVLITESTYGTHIHEKREDRESRFT 231
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYW-ERMNL-QAPIYFAVGLTEKATNYYK 118
V + V +GG+ LIPVFALGRAQEL ++L+ +W + +L + PIY+A L +K Y+
Sbjct: 232 SLVQKIVMQGGRCLIPVFALGRAQELLLILDEFWSQNPDLHEIPIYYASSLAKKCMAVYQ 291
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
+I N +IR+ N F F+HI D+ GP V+ A+PGM+ SGLS +F+
Sbjct: 292 TYINAMNDRIRRQIAVNNPFVFRHISNLKGIDHFDDIGPCVIMASPGMMQSGLSRELFES 351
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W N +I+ G+CV+GT+ +LS +++ + Q + + M+++Y+SFSAH D +
Sbjct: 352 WCTDPKNGVIIAGYCVEGTLAKTILSEPEEITTLSGQKLPLNMSVDYISFSAHTDYQQTS 411
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVN 297
+ I+ P +V+LVHGE ++M LK +++E+ D A+ + F ++D+
Sbjct: 412 EFIRMLRPNHVVLVHGEQNEMSRLKLALQREYEAD----ASTDIKFYNPRNTHAVDLYFR 467
Query: 298 LLKEEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQI 357
K V + P ++ GVLV +D L+ + K +S +V ++
Sbjct: 468 GEKTAKVMGHLAAKKPEVGSKLSGVLVKRDFKYHLLAPSDLGKYTDMSMSVVTQRQSIPW 527
Query: 358 SDSSPINKTLELIYDQL 374
+ S TL+L+ D++
Sbjct: 528 TSSL---ATLQLLLDRI 541
>gi|195395198|ref|XP_002056223.1| GJ10819 [Drosophila virilis]
gi|194142932|gb|EDW59335.1| GJ10819 [Drosophila virilis]
Length = 686
Score = 218 bits (555), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 128/377 (33%), Positives = 209/377 (55%), Gaps = 10/377 (2%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ I+YTGD++ DRHL AA + +PD+LITESTY T I + + RE F
Sbjct: 172 MFMIEIAGIKILYTGDFSRQEDRHLMAAEVPPKKPDVLITESTYGTHIHEKREDRESRFT 231
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYW-ERMNL-QAPIYFAVGLTEKATNYYK 118
V + V +GG+ LIPVFALGRAQEL ++L+ +W + +L + PIY+A L +K Y+
Sbjct: 232 SLVQKIVMQGGRCLIPVFALGRAQELLLILDEFWSQNPDLHEIPIYYASSLAKKCMAVYQ 291
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
+I N +IR+ N F F+HI D+ GP V+ A+PGM+ SGLS +F+
Sbjct: 292 TYINAMNDRIRRQIAVNNPFVFRHISNLKGIDHFDDIGPCVIMASPGMMQSGLSRELFES 351
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W N +I+ G+CV+GT+ +LS +++ + Q + + M+++Y+SFSAH D +
Sbjct: 352 WCTDPKNGVIIAGYCVEGTLAKTILSEPEEITTLSGQKLPLNMSVDYISFSAHTDYQQTS 411
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVN 297
+ I+ P +V+LVHGE ++M LK +++E+ D A+ + F ++D+
Sbjct: 412 EFIRMLRPNHVVLVHGEQNEMSRLKLALQREYEAD----ASTDIKFYNPRNTHAVDLYFR 467
Query: 298 LLKEEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQI 357
K V + P ++ GVLV +D L+ + K +S +V ++
Sbjct: 468 GEKTAKVMGHLAAKKPEVGSKLSGVLVKRDFKYHLLAPSDLGKYTDMSMSVVTQRQSIPW 527
Query: 358 SDSSPINKTLELIYDQL 374
+ S TL+L+ D++
Sbjct: 528 TSSL---ATLQLLLDRI 541
>gi|195037533|ref|XP_001990215.1| GH19212 [Drosophila grimshawi]
gi|193894411|gb|EDV93277.1| GH19212 [Drosophila grimshawi]
Length = 686
Score = 217 bits (553), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 124/352 (35%), Positives = 197/352 (55%), Gaps = 7/352 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ I+YTGD++ DRHL AA + +PD+LITESTY T I + + RE F
Sbjct: 172 MFMIEIAGIKILYTGDFSRQEDRHLMAAEVPPKKPDVLITESTYGTHIHEKREDRESRFT 231
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYW-ERMNL-QAPIYFAVGLTEKATNYYK 118
V + V +GG+ LIPVFALGRAQEL ++L+ YW + +L + PIY+A L +K Y+
Sbjct: 232 TLVQKIVQQGGRCLIPVFALGRAQELLLILDEYWSQNPDLHEIPIYYASSLAKKCMAVYQ 291
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
+I N +IR+ N F F+HI D+ GP V+ A+PGM+ SGLS +F+
Sbjct: 292 TYINAMNDRIRRQIAVNNPFVFRHISNLKGIDHFDDIGPCVIMASPGMMQSGLSRELFES 351
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W N +I+ G+CV+GT+ +LS +++ + Q + + M+++Y+SFSAH D +
Sbjct: 352 WCTDPKNGVIIAGYCVEGTLAKTILSEPEEIITLSGQKLPLNMSVDYISFSAHTDYQQTS 411
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVN 297
+ I+ P +V+LVHGE ++M LK +++E+ D A+ + F ++D+
Sbjct: 412 EFIRLLRPNHVVLVHGEQNEMSRLKLALQREYEAD----ASTDIKFYNPRNTHAVDLYFR 467
Query: 298 LLKEEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIV 349
K V + P ++ GVLV +D L+ + K +S +V
Sbjct: 468 GEKTAKVMGHLAAKKPEVGSKLSGVLVKRDFKYHLLAPSDLGKYTDMSMSVV 519
>gi|440795785|gb|ELR16901.1| putative cleavage and polyadenylation specificity factor, putative
[Acanthamoeba castellanii str. Neff]
Length = 589
Score = 217 bits (552), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 125/347 (36%), Positives = 190/347 (54%), Gaps = 22/347 (6%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ I+YTGD++ DRHL AA D++I ESTY I + + RE F
Sbjct: 44 MFMIEIAGVRILYTGDFSRQEDRHLMAAETPAYTADIVIVESTYGVQIHEPRIERETRFT 103
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
K VH V RGG+ L+PVFALGRAQEL ++L+ YWE + PIY+A L +K Y+
Sbjct: 104 KLVHTIVRRGGRCLLPVFALGRAQELLLILDEYWEAHPELHKVPIYYASSLAKKCMTVYQ 163
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
+I N+ IRK F N F FKHI D+ GP VV A+PGML SGLS +F+K
Sbjct: 164 TYINMMNENIRKQFAVSNPFVFKHISNLKGMQHFDDSGPCVVMASPGMLQSGLSRELFEK 223
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHAD---AK 234
W N +I+PG+CV+GT+ ++S ++ + +++ +K ++ Y+SFSAH+D
Sbjct: 224 WCSNAKNGVIIPGYCVEGTLAKHIMSEPSEVTAMDGRMLPLKASVHYISFSAHSDFLQTA 283
Query: 235 GIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF---NLDCFMPANGESCFVQTDMKIS 291
G + +IQ P V+LVHG+A++M LK + + N+ P N ++ ++ + +
Sbjct: 284 GFLDIIQ---PPYVVLVHGDANEMSRLKASLVNRYEGRNIQILTPKNCQTVQLKFRGEKT 340
Query: 292 IDVSVNLLKEEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEA 338
V L E A + + G++V KD + +M +A
Sbjct: 341 AKVLGTLAGEGAA----------PGKVVSGIVVRKDFNYHIMAPSDA 377
>gi|360043111|emb|CCD78523.1| cleavage and polyadenylation specificity factor-related
[Schistosoma mansoni]
Length = 670
Score = 217 bits (552), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 126/343 (36%), Positives = 193/343 (56%), Gaps = 13/343 (3%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ I+YTGD++ DRHL A I RPD+LITE+TY I D + RE F
Sbjct: 168 MFLIEIAGVKILYTGDFSRQEDRHLMCAEIPPIRPDVLITEATYGIHIHDKREEREARFT 227
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
VH+ V RGG+ LIP FALGRAQEL ++L+ YW+ PIY+A L K Y+
Sbjct: 228 SLVHDIVTRGGRCLIPAFALGRAQELMLILDEYWDNHPELHDIPIYYASQLARKCMAVYQ 287
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
+I N++IR N F F+HI D+ GP VV A+PGM+ SGLS +F+
Sbjct: 288 TYIYAMNERIRNQLANNNPFCFRHISNLKSIEHFDDSGPCVVMASPGMMQSGLSRELFEN 347
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W + N +I+ G+CV+GT+ ++LS ++ + Q++ +K +++Y+SFSAH D +
Sbjct: 348 WCTDKRNGVIIAGYCVEGTLAKQILSLPTEVPTMSGQMLPLKCSVDYISFSAHTDYQQTS 407
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQT-DMKISIDVSV 296
I+ +P V+LVHGE ++M+ L +++E+ D E+C ++ K + V++
Sbjct: 408 AFIRELKPSYVVLVHGEQNEMLRLCGALQREYEDD-------ETCRLEIFTPKNCVPVNL 460
Query: 297 NLLKEEAVK-YNSEPPNPLKERQ-IHGVLVIKDSSISLMDVDE 337
E+ K S + KE Q I GVLV K+ + ++ E
Sbjct: 461 RFRGEKVAKILGSLAHSAPKEGQTISGVLVKKNFAYHILTPGE 503
>gi|256086716|ref|XP_002579538.1| cleavage and polyadenylation specificity factor-related
[Schistosoma mansoni]
Length = 670
Score = 217 bits (552), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 126/343 (36%), Positives = 193/343 (56%), Gaps = 13/343 (3%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ I+YTGD++ DRHL A I RPD+LITE+TY I D + RE F
Sbjct: 168 MFLIEIAGVKILYTGDFSRQEDRHLMCAEIPPIRPDVLITEATYGIHIHDKREEREARFT 227
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
VH+ V RGG+ LIP FALGRAQEL ++L+ YW+ PIY+A L K Y+
Sbjct: 228 SLVHDIVTRGGRCLIPAFALGRAQELMLILDEYWDNHPELHDIPIYYASQLARKCMAVYQ 287
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
+I N++IR N F F+HI D+ GP VV A+PGM+ SGLS +F+
Sbjct: 288 TYIYAMNERIRNQLASNNPFCFRHISNLKSIEHFDDSGPCVVMASPGMMQSGLSRELFEN 347
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W + N +I+ G+CV+GT+ ++LS ++ + Q++ +K +++Y+SFSAH D +
Sbjct: 348 WCTDKRNGVIIAGYCVEGTLAKQILSLPTEVPTMSGQMLPLKCSVDYISFSAHTDYQQTS 407
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQT-DMKISIDVSV 296
I+ +P V+LVHGE ++M+ L +++E+ D E+C ++ K + V++
Sbjct: 408 AFIRELKPSYVVLVHGEQNEMLRLCGALQREYEDD-------ETCRLEIFTPKNCVPVNL 460
Query: 297 NLLKEEAVK-YNSEPPNPLKERQ-IHGVLVIKDSSISLMDVDE 337
E+ K S + KE Q I GVLV K+ + ++ E
Sbjct: 461 RFRGEKVAKILGSLAHSAPKEGQTISGVLVKKNFAYHILTPGE 503
>gi|359486187|ref|XP_002271646.2| PREDICTED: cleavage and polyadenylation specificity factor subunit
3-I-like [Vitis vinifera]
Length = 693
Score = 216 bits (551), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 112/279 (40%), Positives = 163/279 (58%), Gaps = 10/279 (3%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF V + ++YTGDY+ DRHL AA I + PD+ I ESTY + + RE+ F
Sbjct: 175 MFMVDIAGVRVLYTGDYSREEDRHLRAAEIPQFCPDICIIESTYGVQLHQPRHVREKRFT 234
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYW----ERMNLQAPIYFAVGLTEKATNY 116
+H + +GG+VLIP +ALGRAQEL ++L+ YW E N+ PIY+A L ++
Sbjct: 235 DVIHSTISQGGRVLIPAYALGRAQELLLILDEYWSNHPELHNV--PIYYASPLAKRCMAV 292
Query: 117 YKMFITWTNQKIRKTFVQRNMFDFKHIRPFDKSF--IDNPGPMVVFATPGMLHSGLSLII 174
Y+ +I N++IR F N FDFKHI P KS ++ GP VV A+PG L SGLS +
Sbjct: 293 YQTYINSMNERIRNQFANSNPFDFKHISPL-KSIENFNDVGPSVVMASPGGLQSGLSRQL 351
Query: 175 FKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAK 234
F W + N ++PG+ V GT+ +++ K++ N + M + Y+SFSAHAD
Sbjct: 352 FDMWCSDKKNACVIPGYVVGGTLAKTIINEPKEVTLMNGLTAPLNMQVHYISFSAHADFA 411
Query: 235 GIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDC 273
++ P N++LVHGEA++M LK+K+ +F DC
Sbjct: 412 QTSTFLKELMPPNIILVHGEANEMGRLKQKLITQF-ADC 449
>gi|405963469|gb|EKC29039.1| Cleavage and polyadenylation specificity factor subunit 3
[Crassostrea gigas]
Length = 686
Score = 216 bits (550), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 110/275 (40%), Positives = 165/275 (60%), Gaps = 3/275 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ ++YTGD++ DRHL AA I + PD++I ESTY T I + + RE F
Sbjct: 169 MFMIEIAGVRVLYTGDFSRQEDRHLMAAEIPRIHPDVVIIESTYGTHIHEKREDREARFT 228
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
VH+ V RGG+ LIPVFALGRAQEL ++L+ YW PIY+A L +K + Y+
Sbjct: 229 GLVHDIVSRGGRCLIPVFALGRAQELLLILDEYWSNHPELHDIPIYYASSLAKKCMSVYQ 288
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
+I N+KIR+ N F FKHI ++ GP VV A+PGM+ SGLS +F+
Sbjct: 289 TYINAMNEKIRRQINISNPFVFKHISNLKSMEHFEDIGPSVVLASPGMMQSGLSRELFES 348
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W + N I+ G+CV+GT+ +LS ++ Q + +K +++Y+SFSAH D K
Sbjct: 349 WCTDKRNGCIIAGYCVEGTLAKHILSEPDEIMTMTGQKLPLKCSVDYISFSAHTDYKQTS 408
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLD 272
+ ++ +P +V+LVHGE ++M LK + +E+ D
Sbjct: 409 EFLRALKPAHVVLVHGEQNEMSRLKAALIREYEDD 443
>gi|226505292|ref|NP_001151522.1| cleavage and polyadenylation specificity factor, 73 kDa subunit
[Zea mays]
gi|195647398|gb|ACG43167.1| cleavage and polyadenylation specificity factor, 73 kDa subunit
[Zea mays]
gi|224034229|gb|ACN36190.1| unknown [Zea mays]
gi|413932397|gb|AFW66948.1| cleavage and polyadenylation specificity factor, subunit [Zea mays]
Length = 694
Score = 216 bits (550), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 113/289 (39%), Positives = 165/289 (57%), Gaps = 10/289 (3%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF V + I+YTGDY+ DRHL AA + + PD+ I ESTY + RE+ F
Sbjct: 180 MFMVDIAGVRILYTGDYSREEDRHLRAAELPQFSPDICIIESTYGVQQHQPRIVREKRFT 239
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
+ +H V +GG+VLIP FALGRAQEL ++L+ YW + + PIY+A L ++ Y+
Sbjct: 240 EVIHNTVSQGGRVLIPAFALGRAQELLLILDEYWSKHPELHKIPIYYASPLAKRCMAVYQ 299
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDN---PGPMVVFATPGMLHSGLSLIIF 175
+I N++IR F Q N F FKHI + IDN GP VV A+P L SGLS +F
Sbjct: 300 TYINSMNERIRNQFAQSNPFHFKHIESLNS--IDNFHDVGPSVVMASPSGLQSGLSRQLF 357
Query: 176 KKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKG 235
KW + N ++PG+ V+GT+ +++ +++ N + M++ Y+SFSAHAD
Sbjct: 358 DKWCTDKRNACVIPGYVVEGTLAKTIINEPREVALANGLTAPLHMSVHYISFSAHADFPQ 417
Query: 236 IMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF---NLDCFMPANGES 281
+ P N++LVHGEA++M LK+K+ +F N P N +S
Sbjct: 418 TSNFLDELRPPNIILVHGEANEMSRLKQKLITQFDGTNTKIVSPKNCQS 466
>gi|347965534|ref|XP_321933.5| AGAP001224-PA [Anopheles gambiae str. PEST]
gi|333470467|gb|EAA01794.5| AGAP001224-PA [Anopheles gambiae str. PEST]
Length = 690
Score = 216 bits (549), Expect = 8e-53, Method: Compositional matrix adjust.
Identities = 111/272 (40%), Positives = 166/272 (61%), Gaps = 3/272 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ ++YTGD++ DRHL AA I RPD+LITESTY T I + + RE F
Sbjct: 172 MFMIEIAGIRVLYTGDFSRQEDRHLMAAEIPAMRPDVLITESTYGTHIHEKREDRENRFT 231
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYW-ERMNLQA-PIYFAVGLTEKATNYYK 118
V + V +GG+ LIPVFALGRAQEL ++L+ YW + +LQ PIY+A L +K Y+
Sbjct: 232 SLVQKIVQQGGRCLIPVFALGRAQELLLILDEYWSQNPDLQEIPIYYASSLAKKCMAVYQ 291
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
+I N KIR+ N F F+ I D+ GP VV A+PGM+ SGLS +F+
Sbjct: 292 TYINAMNDKIRRQIAINNPFVFRFISNLKGIDHFDDVGPCVVMASPGMMQSGLSRELFES 351
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W N +I+ G+CV+GT+ +L +++ N Q + + M+++Y+SFSAH D +
Sbjct: 352 WCTDPKNGVIIAGYCVEGTLAKTILFEPEEITSMNGQKLPLNMSVDYISFSAHTDYQQTS 411
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF 269
+ I+ +P +V+LVHGE ++M LK + +E+
Sbjct: 412 EFIRLLQPTHVVLVHGEQNEMGRLKSALMREY 443
>gi|156379813|ref|XP_001631650.1| predicted protein [Nematostella vectensis]
gi|156218694|gb|EDO39587.1| predicted protein [Nematostella vectensis]
Length = 688
Score = 215 bits (548), Expect = 9e-53, Method: Compositional matrix adjust.
Identities = 110/274 (40%), Positives = 170/274 (62%), Gaps = 7/274 (2%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ I+YTGD++ DRHL AA I PD+LI ESTY T I + + RE F
Sbjct: 173 MFMLEIAGVKILYTGDFSRQEDRHLMAAEIPSISPDVLIIESTYGTHIHEKREEREARFT 232
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
VH+ V+RGG+ LIPVFALGRAQEL ++L+ YW+ PIY+A L +K + ++
Sbjct: 233 GTVHDIVNRGGRCLIPVFALGRAQELLLILDEYWQNHPELHDIPIYYASQLAKKCMSVFQ 292
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHI---RPFDKSFIDNPGPMVVFATPGMLHSGLSLIIF 175
++ N KI+K N F FKHI + D+ D+ GP VV A+PGM+ SGLS +F
Sbjct: 293 TYVNAMNDKIKKQIAISNPFVFKHISNLKSIDQ--FDDIGPSVVMASPGMMQSGLSRELF 350
Query: 176 KKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKG 235
++W N +I+ G+CV+GT+ ++S ++++ + Q I K +++Y+SFSAH D +
Sbjct: 351 EQWCTDRRNGVIIAGYCVEGTLAKNLMSEPEEVQTMSGQKIPRKCSVDYISFSAHTDFEQ 410
Query: 236 IMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF 269
+ I+ +P +++LVHGE ++M LK + +E+
Sbjct: 411 TSEFIRILKPPHIVLVHGEQNEMGRLKAALIREY 444
>gi|328350068|emb|CCA36468.1| hypothetical protein PP7435_Chr1-0308 [Komagataella pastoris CBS
7435]
Length = 741
Score = 215 bits (548), Expect = 9e-53, Method: Compositional matrix adjust.
Identities = 138/434 (31%), Positives = 236/434 (54%), Gaps = 31/434 (7%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ +++TGD++ DRHL A + RPD+LITEST+ T + + +E+
Sbjct: 116 MFFIEIAGIKVLFTGDFSREEDRHLSVAEVPPVRPDVLITESTFGTATHEPREEKEKKLT 175
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYW-ERMNLQA-PIYFAVGLTEKATNYYK 118
+H + GG+VL+PVFALGRAQEL ++L+ YW + +L+ +Y+A L K Y+
Sbjct: 176 TMIHSTLANGGRVLMPVFALGRAQELLLILDEYWSQHQDLENIKVYYASDLARKCLAVYQ 235
Query: 119 MFITWTNQKIRKTF--VQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIF 175
+I N+ IRK F +N F F++I+ S D+ P VV A+PGML +G+S +
Sbjct: 236 TYINMMNENIRKKFRDTNKNPFQFQYIKNIKNLSKFDDFQPSVVVASPGMLQNGVSRALL 295
Query: 176 KKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDV----KMAIEYMSFSAHA 231
+KWAP N LIM G+ V+GT+ ++L ++ E ++Q DV +M +E +SF+AH
Sbjct: 296 EKWAPDPRNTLIMTGYSVEGTMAKEIL--LEPTEIPSQQNPDVLIPRRMTVEEISFAAHV 353
Query: 232 DAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFN--------LDCFMPANGESCF 283
D + + I+ PK ++LVHGE++ M LK + +++ + + P N E
Sbjct: 354 DYEQNSKFIELVNPKTIVLVHGESNPMGRLKSALLSKYSKYKNTPDEVKVYNPRNCEDLL 413
Query: 284 VQTDMKISIDVSVNLLKEEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVG 343
V+ I I ++ + E+ K + K +I+GVLV K+ +SL+ + + + G
Sbjct: 414 VEFK-GIKIAKAMGTITEDVNKVLKQDVKSAKS-EINGVLVQKNFDLSLLKIQDLREYTG 471
Query: 344 ISRHIVRFTSNVQISDSSPINKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLLSV 403
++ +++ ++ +S + L L L D ++EY++++ +DSVLL++
Sbjct: 472 LTTTMIKQRQTLRSHATSSLVHYHLLQMFGYLDILIDDAEEYEVKI------MDSVLLNM 525
Query: 404 ERIDDKNKRVFVTW 417
DK V V W
Sbjct: 526 ----DKRGIVTVEW 535
>gi|195569857|ref|XP_002102925.1| GD20157 [Drosophila simulans]
gi|194198852|gb|EDX12428.1| GD20157 [Drosophila simulans]
Length = 684
Score = 215 bits (547), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 126/358 (35%), Positives = 201/358 (56%), Gaps = 19/358 (5%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ I+YTGD++ DRHL AA + +PD+LITESTY T I + + RE F
Sbjct: 170 MFMIEIAGIKILYTGDFSRQEDRHLMAAEVPPMKPDVLITESTYGTHIHEKREDRENRFT 229
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYW-ERMNL-QAPIYFAVGLTEKATNYYK 118
V + V +GG+ LIPVFALGRAQEL ++L+ +W + +L + PIY+A L +K Y+
Sbjct: 230 SLVQKIVQQGGRCLIPVFALGRAQELLLILDEFWSQNPDLHEIPIYYASSLAKKCMAVYQ 289
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
+I N +IR+ N F F+HI ++ GP V+ A+PGM+ SGLS +F+
Sbjct: 290 TYINAMNDRIRRQIAVNNPFVFRHISNLKGIDHFEDIGPCVIMASPGMMQSGLSRELFES 349
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W N +I+ G+CV+GT+ VLS +++ + Q + + M+++Y+SFSAH D +
Sbjct: 350 WCTDPKNGVIIAGYCVEGTLAKTVLSEPEEITTLSGQKLPLNMSVDYISFSAHTDYQQTS 409
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDC------FMPANGESCFVQTDMKIS 291
+ I+ +P +V+LVHGE ++M LK +++E+ D + P N + D+
Sbjct: 410 EFIRLLKPTHVVLVHGEQNEMSRLKLALQREYEADASTDIKFYNPRNTHA----VDLYFR 465
Query: 292 IDVSVNLLKEEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIV 349
+ + ++ A K NSE + L GVLV +D L+ + K +S +V
Sbjct: 466 GEKTAKVMGSLAAK-NSEVGSKLS-----GVLVKRDFKYHLLAPSDLGKYTDMSMSVV 517
>gi|195343244|ref|XP_002038208.1| GM18692 [Drosophila sechellia]
gi|194133058|gb|EDW54626.1| GM18692 [Drosophila sechellia]
Length = 684
Score = 215 bits (547), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 126/358 (35%), Positives = 201/358 (56%), Gaps = 19/358 (5%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ I+YTGD++ DRHL AA + +PD+LITESTY T I + + RE F
Sbjct: 170 MFMIEIAGIKILYTGDFSRQEDRHLMAAEVPPMKPDVLITESTYGTHIHEKREDRENRFT 229
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYW-ERMNL-QAPIYFAVGLTEKATNYYK 118
V + V +GG+ LIPVFALGRAQEL ++L+ +W + +L + PIY+A L +K Y+
Sbjct: 230 SLVQKIVQQGGRCLIPVFALGRAQELLLILDEFWSQNPDLHEIPIYYASSLAKKCMAVYQ 289
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
+I N +IR+ N F F+HI ++ GP V+ A+PGM+ SGLS +F+
Sbjct: 290 TYINAMNDRIRRQIAVNNPFVFRHISNLKGIDHFEDIGPCVIMASPGMMQSGLSRELFES 349
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W N +I+ G+CV+GT+ VLS +++ + Q + + M+++Y+SFSAH D +
Sbjct: 350 WCTDPKNGVIIAGYCVEGTLAKTVLSEPEEITTLSGQKLPLNMSVDYISFSAHTDYQQTS 409
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDC------FMPANGESCFVQTDMKIS 291
+ I+ +P +V+LVHGE ++M LK +++E+ D + P N + D+
Sbjct: 410 EFIRLLKPTHVVLVHGEQNEMSRLKLALQREYEADASTDIKFYNPRNTHA----VDLYFR 465
Query: 292 IDVSVNLLKEEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIV 349
+ + ++ A K NSE + L GVLV +D L+ + K +S +V
Sbjct: 466 GEKTAKVMGSLAAK-NSEVGSKLS-----GVLVKRDFKYHLLAPSDLGKYTDMSMSVV 517
>gi|24648013|ref|NP_650738.1| cleavage and polyadenylation specificity factor 73 [Drosophila
melanogaster]
gi|21430620|gb|AAM50988.1| RE31408p [Drosophila melanogaster]
gi|23171662|gb|AAF55578.2| cleavage and polyadenylation specificity factor 73 [Drosophila
melanogaster]
gi|220948314|gb|ACL86700.1| CG7698-PA [synthetic construct]
Length = 684
Score = 215 bits (547), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 123/352 (34%), Positives = 198/352 (56%), Gaps = 7/352 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ I+YTGD++ DRHL AA + +PD+LITESTY T I + + RE F
Sbjct: 170 MFMIEIAGIKILYTGDFSRQEDRHLMAAEVPPMKPDVLITESTYGTHIHEKREDRENRFT 229
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYW-ERMNL-QAPIYFAVGLTEKATNYYK 118
V + V +GG+ LIPVFALGRAQEL ++L+ +W + +L + PIY+A L +K Y+
Sbjct: 230 SLVQKIVQQGGRCLIPVFALGRAQELLLILDEFWSQNPDLHEIPIYYASSLAKKCMAVYQ 289
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
+I N +IR+ N F F+HI ++ GP V+ A+PGM+ SGLS +F+
Sbjct: 290 TYINAMNDRIRRQIAVNNPFVFRHISNLKGIDHFEDIGPCVIMASPGMMQSGLSRELFES 349
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W N +I+ G+CV+GT+ VLS +++ + Q + + M+++Y+SFSAH D +
Sbjct: 350 WCTDPKNGVIIAGYCVEGTLAKAVLSEPEEITTLSGQKLPLNMSVDYISFSAHTDYQQTS 409
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVN 297
+ I+ +P +V+LVHGE ++M LK +++E+ D A+ + F ++D+
Sbjct: 410 EFIRLLKPTHVVLVHGEQNEMSRLKLALQREYEAD----ASTDIKFYNPRNTHAVDLYFR 465
Query: 298 LLKEEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIV 349
K V + N ++ GVLV +D L+ + K +S +V
Sbjct: 466 GEKTAKVMGSLAAKNSEVGSKLSGVLVKRDFKYHLLAPSDLGKYTDMSMSVV 517
>gi|328867689|gb|EGG16071.1| beta-lactamase domain-containing protein [Dictyostelium
fasciculatum]
Length = 786
Score = 215 bits (547), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 126/339 (37%), Positives = 191/339 (56%), Gaps = 18/339 (5%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF V++ I+YTGD++ DRHL A D+LI ESTY + + + RER F
Sbjct: 184 MFMVEIAGVRILYTGDFSRQEDRHLMGAESPPVDVDVLIIESTYGVQVHEPRLERERRFT 243
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYW-ERMNLQA-PIYFAVGLTEKATNYYK 118
+HE V RGG+ LIPVFALGRAQEL ++L+ YW L PIY+A L +K Y+
Sbjct: 244 TSIHEIVRRGGRCLIPVFALGRAQELLLILDEYWIAHPELHGIPIYYASALAKKCMKVYQ 303
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFD--KSFIDNPGPMVVFATPGMLHSGLSLIIFK 176
+I N++IR F N F FKHI+ + +F DN GP V A+PGML SGLS +F+
Sbjct: 304 TYIQMMNERIRAQFAVSNPFIFKHIKDINGIDNFNDN-GPCVFMASPGMLQSGLSRQLFE 362
Query: 177 KWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGI 236
+W N +++PG+ V+GT+ ++S ++ + + + +++ Y+SFSAH+D
Sbjct: 363 RWCSDRRNGVVIPGYSVEGTLAKHIMSEPSEITRLDGINVPLNLSVTYVSFSAHSDFLQT 422
Query: 237 MQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF-NLDCFMPANGESC-FVQTDMKISIDV 294
+ IQ P +++LVHG+A++M LK + +F ++ P N S V K++
Sbjct: 423 SEFIQDIHPPHIVLVHGDANEMSRLKHSLVSKFKTVNVMTPKNSMSVRMVFKPQKVA--- 479
Query: 295 SVNLLKEEAVKYNSEPPNPLKERQIHGVLVIKDSSISLM 333
++ E A NS P N + G+LVIKD + ++M
Sbjct: 480 --KIVGEMA---NSAPAN---GTSVQGLLVIKDFTHTIM 510
>gi|312372474|gb|EFR20427.1| hypothetical protein AND_20124 [Anopheles darlingi]
Length = 692
Score = 214 bits (546), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 111/272 (40%), Positives = 166/272 (61%), Gaps = 3/272 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ ++YTGD++ DRHL AA I RPD+LITESTY T I + + RE F
Sbjct: 174 MFMIEIAGIRVLYTGDFSRQEDRHLMAAEIPAMRPDVLITESTYGTHIHEKREDRENRFT 233
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYW-ERMNLQA-PIYFAVGLTEKATNYYK 118
V + V +GG+ LIPVFALGRAQEL ++L+ YW + +LQ PIY+A L +K Y+
Sbjct: 234 SLVQKIVTQGGRCLIPVFALGRAQELLLILDEYWSQNPDLQEIPIYYASSLAKKCMAVYQ 293
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
+I N KIR+ N F F+ I D+ GP VV A+PGM+ SGLS +F+
Sbjct: 294 TYINAMNDKIRRQIAINNPFVFRFISNLKGIDHFDDVGPCVVMASPGMMQSGLSRELFET 353
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W N +I+ G+CV+GT+ +L +++ N Q + + M+++Y+SFSAH D +
Sbjct: 354 WCTDPKNGVIIAGYCVEGTLAKTILFEPEEITSMNGQKLPLNMSVDYISFSAHTDYQQTS 413
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF 269
+ I+ +P +V+LVHGE ++M LK + +E+
Sbjct: 414 EFIRLLQPTHVVLVHGEQNEMGRLKSALMREY 445
>gi|194900154|ref|XP_001979622.1| GG16362 [Drosophila erecta]
gi|190651325|gb|EDV48580.1| GG16362 [Drosophila erecta]
Length = 684
Score = 214 bits (545), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 122/352 (34%), Positives = 197/352 (55%), Gaps = 7/352 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ I+YTGD++ DRHL AA + +PD+LITESTY T + + + RE F
Sbjct: 170 MFMIEIAGIKILYTGDFSRQEDRHLMAAEVPPMKPDVLITESTYGTHVHEKREDRENRFT 229
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYW-ERMNL-QAPIYFAVGLTEKATNYYK 118
V + V +GG+ LIPVFALGRAQEL ++L+ +W + +L + PIY+A L +K Y+
Sbjct: 230 SLVQKIVQQGGRCLIPVFALGRAQELLLILDEFWSQNPDLHEIPIYYASSLAKKCMAVYQ 289
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
+I N +IR+ N F F+HI ++ GP V+ A+PGM+ SGLS +F+
Sbjct: 290 TYINAMNDRIRRQIAVNNPFVFRHISNLKGIDHFEDIGPCVIMASPGMMQSGLSRELFES 349
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W N +I+ G+CV+GT+ VLS +++ + Q + + M+++Y+SFSAH D +
Sbjct: 350 WCTDPKNGVIIAGYCVEGTLAKAVLSEPEEITTLSGQKLPLNMSVDYISFSAHTDYQQTS 409
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVN 297
+ I+ P +V+LVHGE ++M LK +++E+ D A+ + F ++D+
Sbjct: 410 EFIRLLRPTHVVLVHGEQNEMSRLKLALQREYEAD----ASTDIKFYNPRNTHAVDLYFR 465
Query: 298 LLKEEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIV 349
K V + N ++ GVLV +D L+ + K +S +V
Sbjct: 466 GEKTAKVMGSLAAKNSEVGSKLSGVLVKRDFKYHLLAPSDLGKYTDMSMSVV 517
>gi|195452860|ref|XP_002073532.1| GK13096 [Drosophila willistoni]
gi|194169617|gb|EDW84518.1| GK13096 [Drosophila willistoni]
Length = 684
Score = 214 bits (545), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 122/352 (34%), Positives = 196/352 (55%), Gaps = 7/352 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ I+YTGD++ DRHL AA + +PD+LITESTY T I + + RE F
Sbjct: 170 MFMIEIAGIKILYTGDFSRQEDRHLMAAEVPPTKPDVLITESTYGTHIHEKREDRESRFT 229
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYW-ERMNL-QAPIYFAVGLTEKATNYYK 118
V + V +GG+ LIPVFALGRAQEL ++L+ +W + +L + PIY+A L +K Y+
Sbjct: 230 SLVQKTVMQGGRCLIPVFALGRAQELLLILDEFWSQNPDLHEIPIYYASSLAKKCMAVYQ 289
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
+I N +IR+ N F F+HI ++ GP V+ A+PGM+ SGLS +F+
Sbjct: 290 TYINAMNDRIRRQIAVNNPFVFRHISNLKGIDHFEDIGPCVIMASPGMMQSGLSRELFES 349
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W N +I+ G+CV+GT+ +LS +++ + Q + + M+++Y+SFSAH D +
Sbjct: 350 WCTDPKNGVIVAGYCVEGTLAKTILSEPEEITTLSGQKLPLNMSVDYISFSAHTDYQQTS 409
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVN 297
+ I+ P +V+LVHGE ++M LK +++E+ D A + F ++D+
Sbjct: 410 EFIRMLRPNHVVLVHGEQNEMSRLKLALQREYEAD----ATTDIKFYNPRNTHAVDLYFR 465
Query: 298 LLKEEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIV 349
K V + P ++ GVLV +D L+ + K +S +V
Sbjct: 466 GEKTAKVMGHLAANKPELASKLSGVLVKRDFKYHLLAPSDLGKYTDMSMSVV 517
>gi|384499309|gb|EIE89800.1| hypothetical protein RO3G_14511 [Rhizopus delemar RA 99-880]
Length = 654
Score = 214 bits (544), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 121/371 (32%), Positives = 204/371 (54%), Gaps = 24/371 (6%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ ++YTGDY+ DRHL AA + D+LITESTY + + +E F
Sbjct: 155 MFLIEIAGVKVLYTGDYSREEDRHLMAAEKPEGSVDVLITESTYGVQSHEPRIAKETRFT 214
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWE-RMNLQA-PIYFAVGLTEKATNYYK 118
VH V RGG+ L+PVFALGRAQEL ++L+ +WE L + PIY+A L ++ Y+
Sbjct: 215 SLVHNIVTRGGRCLMPVFALGRAQELLLILDEFWEAHPELDSIPIYYASSLAKRCMAVYQ 274
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
+I N +IRK F N F FKHI ++ GP V+ A+PGML +GLS +F++
Sbjct: 275 TYINMMNARIRKQFAISNPFVFKHISNLKNVEQFEDSGPCVMMASPGMLQNGLSRELFER 334
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
WAP + N L++ G+CV+ T+ + ++ + + + + +KM+++Y+SFSAH D
Sbjct: 335 WAPDKKNGLVITGYCVENTLARQAMNEPSDFQAMDGRKVPLKMSVDYISFSAHVDFTQNS 394
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF-----NLDCFMPANGESCFVQTDMKISI 292
+ I + +V+LVHGEA+ M LK ++ ++ N+ + P N ++ +
Sbjct: 395 KFIDEVKAPHVILVHGEANAMYRLKSALQSKYSEKEENVTIYTPKNCDTV----KLHFRG 450
Query: 293 DVSVNLLKEEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGI------SR 346
+ + A KY P + + ++GVL++KD +++M ++ + G+ R
Sbjct: 451 EKMAKTIGRLAAKY------PTENQALNGVLLVKDFQLNIMSPEDLNELGGLITTVITQR 504
Query: 347 HIVRFTSNVQI 357
+V F + + +
Sbjct: 505 QVVPFNAGIGL 515
>gi|303275006|ref|XP_003056813.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226461165|gb|EEH58458.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 803
Score = 213 bits (543), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 125/341 (36%), Positives = 191/341 (56%), Gaps = 18/341 (5%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF V + ++YTGDY+ TPDRHL A + P ++I+ESTY + + RE+ F
Sbjct: 170 MFFVDIDGLRVLYTGDYSRTPDRHLPGADLPSVPPHVVISESTYGVSPHTPREEREKRFT 229
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQA--PIYFAVGLTEKATNYYK 118
+V++ ++RGGKVL+PV ALGRAQEL ++LE +W++ A PIY A L +A Y+
Sbjct: 230 DRVYQILNRGGKVLLPVVALGRAQELLLILEDHWKKHPELANVPIYQASALARRAMTVYQ 289
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDKS-FIDNPGPMVVFATPGMLHSGLSLIIFKK 177
+I N ++ F + N F F H++ + +D+ GP VV ATP ML SGLS +F+
Sbjct: 290 TYINVLNSDMKAAFEEANPFVFNHVQHLSHAGGLDDVGPCVVLATPSMLQSGLSRELFEM 349
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W NN +I+ F VQGT+ ++LS K + N I +KM+++ +SFSAHAD
Sbjct: 350 WCGDANNGVIIADFAVQGTLAREILSDCKSITSRNGGEIPLKMSVDAISFSAHADYPQTQ 409
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLD-----CFMPANGESCFVQTDMKISI 292
Q + P +V+LVHGE +M LK ++ + D + P N +S ++K S
Sbjct: 410 QFLDALAPPHVVLVHGETGEMGKLKRALEGKAAADGKTMSVYSPKNCQS----VEIKYSG 465
Query: 293 DVSVNLLKEEAVKYNSEPPNPLKERQIHGVLVIKDSSISLM 333
V ++ + A E P + +R + G+LV KD + L+
Sbjct: 466 SKIVKVMGKLA-----ENPPKMGDR-VRGLLVKKDFGLMLL 500
>gi|330842661|ref|XP_003293292.1| hypothetical protein DICPUDRAFT_158104 [Dictyostelium purpureum]
gi|325076396|gb|EGC30185.1| hypothetical protein DICPUDRAFT_158104 [Dictyostelium purpureum]
Length = 789
Score = 213 bits (542), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 109/290 (37%), Positives = 173/290 (59%), Gaps = 6/290 (2%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF V++ I+YTGD++ DRHL A + D+LI ESTY + + + RE+ F
Sbjct: 186 MFMVEIAGVKILYTGDFSRQEDRHLMGAETPPVKVDVLIIESTYGVQVHEPRLEREKRFT 245
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYW-ERMNLQA-PIYFAVGLTEKATNYYK 118
VH+ V RGG+ LIPVFALGRAQEL ++L+ YW +L PIY+A L +K Y+
Sbjct: 246 TSVHDVVSRGGRCLIPVFALGRAQELLLILDEYWIANPSLHGIPIYYASALAKKCMGVYR 305
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFD--KSFIDNPGPMVVFATPGMLHSGLSLIIFK 176
+I N ++R F N F+FK+I +SF DN GP V A+PGML SGLS +F+
Sbjct: 306 TYINMMNDRVRAQFDVSNPFEFKYISNIKGIESFDDN-GPCVFMASPGMLQSGLSRQLFE 364
Query: 177 KWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGI 236
+W + N +++PG+ V+GT+ ++S ++ + + + +++ Y+SFSAH+D
Sbjct: 365 RWCTSKRNGVVIPGYSVEGTLAKHIMSEPTEITRLDNVSVPLNLSVSYVSFSAHSDFLQT 424
Query: 237 MQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF-NLDCFMPANGESCFVQ 285
+ IQ +P +V+LVHG+A++M L+ + F +++ P N +S ++
Sbjct: 425 SEFIQEIQPPHVVLVHGDANEMSRLRNALIGRFKSINVLTPKNAQSVSLE 474
>gi|281206064|gb|EFA80253.1| beta-lactamase domain-containing protein [Polysphondylium pallidum
PN500]
Length = 656
Score = 213 bits (541), Expect = 6e-52, Method: Compositional matrix adjust.
Identities = 123/333 (36%), Positives = 187/333 (56%), Gaps = 18/333 (5%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ I+YTGD++ DRHL A D+LI ESTY + + + RE+ F
Sbjct: 69 MFMIEIAGVKILYTGDFSRQEDRHLMGAETPPVNVDILIIESTYGVQVHEPRLEREKRFT 128
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYW-ERMNLQA-PIYFAVGLTEKATNYYK 118
+HE V RGG+ LIPVFALGRAQEL ++L+ YW LQ PIY+A L K + Y+
Sbjct: 129 SSIHEVVKRGGRCLIPVFALGRAQELLLILDEYWIAHPELQKIPIYYASALARKCMSVYQ 188
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFD--KSFIDNPGPMVVFATPGMLHSGLSLIIFK 176
+I N++IR F N F FKHI + F D+ GP V A+PGML SGLS +F+
Sbjct: 189 TYINMMNERIRAQFDLSNPFSFKHIENISGIERFTDD-GPCVFMASPGMLQSGLSRQLFE 247
Query: 177 KWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGI 236
+W + N +++PG+ V+GT+ ++S ++ + + + + + Y+SFSAH+D
Sbjct: 248 RWCSDKMNGVVIPGYNVEGTLAKHIMSEPDEITRTDGVNVPLHLTVTYVSFSAHSDFLQT 307
Query: 237 MQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF-NLDCFMPANGESCFVQTDMKISIDVS 295
+ IQ P +V+LVHG+A++M LK + +F ++ P N + VQ M+ +
Sbjct: 308 SEFIQEIHPPHVVLVHGDANEMSRLKTSLIAKFKTINIMTPKN--THIVQ--MEFRAEKV 363
Query: 296 VNLLKEEAVKYNSEPPNPLKE-RQIHGVLVIKD 327
+L A NP +E ++I G+LV KD
Sbjct: 364 AKMLGAIAA-------NPPRESKKIAGLLVTKD 389
>gi|217075845|gb|ACJ86282.1| unknown [Medicago truncatula]
gi|388517311|gb|AFK46717.1| unknown [Medicago truncatula]
Length = 277
Score = 213 bits (541), Expect = 6e-52, Method: Compositional matrix adjust.
Identities = 96/205 (46%), Positives = 138/205 (67%), Gaps = 1/205 (0%)
Query: 97 MNLQAPIYFAVGLTEKATNYYKMFITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGP 156
MNL+ PIYF+ GLT +A Y+KM I WT+QKI+ T+ N FDFK++ F++S +D PGP
Sbjct: 1 MNLKVPIYFSSGLTIQANTYHKMLIGWTSQKIKDTYSTHNAFDFKNVHKFERSMLDAPGP 60
Query: 157 MVVFATPGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSG-VKKLEFENKQI 215
V+FATPGML G SL +FK WAP E N++ +PG+C+ GT+GH++ SG K++ +
Sbjct: 61 CVLFATPGMLIGGFSLEVFKHWAPSEKNLVALPGYCMAGTVGHRLTSGKPTKVDTDPDTQ 120
Query: 216 IDVKMAIEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFM 275
IDV+ I ++FSAH D+KGIM L+++ PK+V+LVHG+ KMV LKE+I E + C
Sbjct: 121 IDVRCQIHQLAFSAHTDSKGIMDLVKFLSPKHVMLVHGDKPKMVSLKERIDSELGIPCSH 180
Query: 276 PANGESCFVQTDMKISIDVSVNLLK 300
PAN E + + ++ + S K
Sbjct: 181 PANNEIVTISSTQYVNAEASDTFTK 205
>gi|167526212|ref|XP_001747440.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163774275|gb|EDQ87907.1| predicted protein [Monosiga brevicollis MX1]
Length = 668
Score = 212 bits (540), Expect = 9e-52, Method: Compositional matrix adjust.
Identities = 123/345 (35%), Positives = 187/345 (54%), Gaps = 18/345 (5%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF++ + ++YTGD++ DRHL AA I +PD+LI EST ++++ RER F
Sbjct: 176 MFEIDIAGVKLLYTGDFSRDEDRHLMAASIPPIKPDILIAESTLGDLEHENRQDRERRFT 235
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
K+VH V RGG+ LIPVFALGRAQEL ++L+ YW++ PIY+A L ++ +K
Sbjct: 236 KEVHTIVQRGGRCLIPVFALGRAQELLLILDEYWQQHPELHNVPIYYASALAKRCMGVFK 295
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
F+ N KI++ N F F+ I K D+ G VV ATPGML +GLS +F++
Sbjct: 296 AFVNMMNPKIQQQMKISNPFQFQFIHNLRKLDEFDDHGSSVVLATPGMLQNGLSRELFER 355
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
WAP +N +I+ G+ V+GT+ H++L +++ + I +SF+AH D+
Sbjct: 356 WAPNRHNGVILAGYHVEGTLAHELLKQPRQIRSMAGGTVPRNCTIANISFNAHVDSIQNR 415
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFN-----LDCFMPANGESCFVQTDMKISI 292
I EP++++LVHG+ S+M LKE + ++F + + P N E + +
Sbjct: 416 DFIGELEPQHLVLVHGQESQMRKLKESVLKDFEQRDRLISVYNPKNTEKQLFHYRGEKNA 475
Query: 293 DVSVNLLKEEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDE 337
V L +E A R+I GVLV K +M DE
Sbjct: 476 KVLGKLAREFAA----------GSRRISGVLVSKAFDYKIMHPDE 510
>gi|32566029|ref|NP_502553.2| Protein CPSF-3 [Caenorhabditis elegans]
gi|26985920|emb|CAC44310.2| Protein CPSF-3 [Caenorhabditis elegans]
Length = 707
Score = 212 bits (539), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 106/276 (38%), Positives = 164/276 (59%), Gaps = 7/276 (2%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFLK 61
F +++ ++YTGD++ DRHL AA I P +LITESTY T + + RE+ F +
Sbjct: 166 FMIEIAGVRVLYTGDFSCLEDRHLCAAEIPPITPQVLITESTYGTQTHEDRAVREKRFTQ 225
Query: 62 KVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYKM 119
VH+ V RGG+ LIP FA+G AQEL ++L+ YWE P+Y+A L +K + Y+
Sbjct: 226 MVHDIVTRGGRCLIPAFAIGPAQELMLILDEYWESHQELHDIPVYYASSLAKKCMSVYQT 285
Query: 120 FITWTNQKIRKTFVQRNMFDFKHI---RPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFK 176
F+ N +I+K +N F FKH+ R D+ ++ GP VV ATPGML SG S +F+
Sbjct: 286 FVNGMNSRIQKQIAVKNPFIFKHVSTLRGMDQ--FEDAGPCVVLATPGMLQSGFSRELFE 343
Query: 177 KWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGI 236
W P N I+ G+CV+GT+ +LS +++ + + + ++M + Y+SFSAH D
Sbjct: 344 SWCPDTKNGCIIAGYCVEGTLAKHILSEPEEIVSLSGEKLPMRMQVGYVSFSAHTDYHQT 403
Query: 237 MQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLD 272
++ +P +++LVHGE +M LK I+++F D
Sbjct: 404 SNFVKALKPPHLVLVHGELHEMSRLKSGIERQFQDD 439
>gi|164658265|ref|XP_001730258.1| hypothetical protein MGL_2640 [Malassezia globosa CBS 7966]
gi|159104153|gb|EDP43044.1| hypothetical protein MGL_2640 [Malassezia globosa CBS 7966]
Length = 741
Score = 211 bits (538), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 125/356 (35%), Positives = 192/356 (53%), Gaps = 23/356 (6%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF + + ++YTGDY+ DRHL A + RPD+LI ESTY T + + +E F
Sbjct: 85 MFMIDMAGLRVLYTGDYSREEDRHLVQAEVPPMRPDVLICESTYGTQSLEPRLDKEMRFT 144
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
+H + RGG+VL+PVF LGRAQEL +LL+ YWE PIY+A L K + Y+
Sbjct: 145 SLIHSIIRRGGRVLLPVFVLGRAQELLLLLDEYWEAHPELHSVPIYYASSLARKCMSIYQ 204
Query: 119 MFITWTNQKIRKTFVQR-NMFDFKHI---RPFDKSFIDNPGPMVVFATPGMLHSGLSLII 174
+I NQ IR F +R N F FKH+ R DK D+ GP V+ A+PG + SG+S +
Sbjct: 205 TYIHTMNQHIRARFHRRDNPFVFKHVSNLRSLDK--FDDKGPCVMMASPGFMQSGISREL 262
Query: 175 FKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAK 234
++WAP + N +I+ G+ V+GT+ +LS + N Q I +M+++Y+SFSAH D
Sbjct: 263 LERWAPDKRNGVIVSGYSVEGTMARDILSDPDDIVALNGQRIPRRMSVDYISFSAHVDYT 322
Query: 235 GIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFN-----LDCFMPANGESCFVQTDMK 289
+ I + K+V+LVHGE M L+ ++ ++ + +MP N E +
Sbjct: 323 QNSRFIDQVKAKHVVLVHGELKNMSGLRAALQSRYSDREEEIHIYMPRNCEPL----TLS 378
Query: 290 ISIDVSVNLLKEEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGIS 345
D + ++ A + P P I G+LV KD S +++ ++ + G+
Sbjct: 379 FRADRTAKVIGSLA----AHAPKP--SDTIDGLLVAKDFSYTVLAPEDLTEFTGLG 428
>gi|290978816|ref|XP_002672131.1| predicted protein [Naegleria gruberi]
gi|284085705|gb|EFC39387.1| predicted protein [Naegleria gruberi]
Length = 749
Score = 211 bits (537), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 113/286 (39%), Positives = 168/286 (58%), Gaps = 6/286 (2%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF V++ ++YTGD++ PDRHL A PD+LI ESTY + +S+ RE+ F
Sbjct: 191 MFMVEIAGVRVLYTGDFSRQPDRHLLGAETPTMSPDVLIVESTYGIQVHESQSEREKRFT 250
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWE-RMNLQ-APIYFAVGLTEKATNYYK 118
+ V E V RGG+ LIPVFALGRAQEL ++L+ +WE +LQ PIY+A L +K ++
Sbjct: 251 QMVTEIVKRGGRCLIPVFALGRAQELLLILDEFWETHQDLQHIPIYYASSLAKKCMTIFQ 310
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFD--KSFIDNPGPMVVFATPGMLHSGLSLIIFK 176
+I N KIRK F N F FKHI + F DN GP V+ A+PGML SGLS +F+
Sbjct: 311 TYINMMNDKIRKQFDIHNPFVFKHISNLRSIEDFQDN-GPCVIMASPGMLQSGLSKELFE 369
Query: 177 KWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGI 236
W N +I+ G+ V GT+ K++S + + N + ++M++ +SFSAH+D
Sbjct: 370 LWCQDAKNGVIIAGYSVDGTLAKKIMSEPETVTLSNGNTVPLRMSVRTISFSAHSDKAQT 429
Query: 237 MQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF-NLDCFMPANGES 281
+ I +P +++LVHG+ + LK ++ +F + + P N S
Sbjct: 430 EEFIGTIKPPHIILVHGDLANCNRLKHSLQSKFTDSKVYAPKNCTS 475
>gi|308492421|ref|XP_003108401.1| CRE-CPSF-3 protein [Caenorhabditis remanei]
gi|308249249|gb|EFO93201.1| CRE-CPSF-3 protein [Caenorhabditis remanei]
Length = 712
Score = 211 bits (536), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 120/346 (34%), Positives = 191/346 (55%), Gaps = 22/346 (6%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFLK 61
F +++ ++YTGD++ DRHL AA I P +LITESTY T + + RE+ F +
Sbjct: 166 FMIEIAGVRVLYTGDFSCLEDRHLCAAEIPPITPQVLITESTYGTQTHEERSVREKRFTQ 225
Query: 62 KVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYKM 119
VH+ V RGG+ LIP FA+G AQEL ++L+ YWE P+Y+A L +K + Y+
Sbjct: 226 MVHDIVTRGGRCLIPAFAIGPAQELMLILDEYWESHQELHDIPVYYASSLAKKCMSVYQT 285
Query: 120 FITWTNQKIRKTFVQRNMFDFKHI---RPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFK 176
F+ N +I+K +N F FKH+ R D+ ++ GP VV ATPGML SG S +F+
Sbjct: 286 FVNGMNSRIQKQIAVKNPFIFKHVSTLRGMDQ--FEDAGPCVVLATPGMLQSGFSRELFE 343
Query: 177 KWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGI 236
W N I+ G+CV+GT+ +L+ +++ + + + ++M + Y+SFSAH D
Sbjct: 344 NWCSDSKNGCIIAGYCVEGTLARHILTEPEEIVSLSGEKLPMRMQVGYVSFSAHTDFNQT 403
Query: 237 MQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFN-----LDCFMPANGESCFVQTDMKIS 291
++ +P +++LVHGE +M LK I+++F ++ P N E + D++
Sbjct: 404 SNFVKALKPPHLVLVHGELHEMSRLKAGIERQFQDANIPIEVHNPRNTE----RLDLQFR 459
Query: 292 IDVSVNLLKEEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDE 337
+ + ++ + A K P I GVLV + S SLM +E
Sbjct: 460 GEKTAKVIGKLAQKM------PENGEIISGVLVKNNFSYSLMVYEE 499
>gi|255084461|ref|XP_002508805.1| predicted protein [Micromonas sp. RCC299]
gi|226524082|gb|ACO70063.1| predicted protein [Micromonas sp. RCC299]
Length = 728
Score = 210 bits (534), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 113/287 (39%), Positives = 164/287 (57%), Gaps = 5/287 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF V +G ++YTGDY+ TPDRHL A + P ++I E+TY + + RER F
Sbjct: 180 MFYVDIGGLRVLYTGDYSRTPDRHLPGADLPPIPPHVVIVEATYGVSPHSPREERERRFT 239
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYW-ERMNLQA-PIYFAVGLTEKATNYYK 118
VH + RGGKVL+PV ALGRAQE+ ++LE YW + L+ PIY A L ++A Y+
Sbjct: 240 DMVHRVLTRGGKVLLPVVALGRAQEVLLILEDYWVKHPELKGVPIYQASALAKRAMTVYQ 299
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPF-DKSFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
+I N ++ F + N F F H+ + S +D+ GP VV ATP ML SGLS +F+
Sbjct: 300 TYINVLNSDMKAAFEESNPFVFNHVNHLANSSGLDDVGPCVVLATPSMLQSGLSRDLFES 359
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W N +I+ F VQGT+ ++LS K + Q + ++M ++ +SFSAHAD
Sbjct: 360 WCGDSKNGVIICDFAVQGTLAREILSDCKTVTSRTGQELPLRMTVDAISFSAHADYPQTQ 419
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDC--FMPANGESC 282
Q + P +V+LVHGEA +M LK ++ + D M N ++C
Sbjct: 420 QFLDALAPPHVVLVHGEAGEMGKLKRALEGKAAADGKKMMVYNPKNC 466
>gi|268552491|ref|XP_002634228.1| Hypothetical protein CBG01798 [Caenorhabditis briggsae]
Length = 722
Score = 209 bits (533), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 122/346 (35%), Positives = 188/346 (54%), Gaps = 22/346 (6%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFLK 61
F +++ ++YTGD++ DRHL AA I P +LITESTY T + + RE+ F +
Sbjct: 166 FMIEIAGVRVLYTGDFSCLEDRHLCAAEIPPVSPQVLITESTYGTQTHEDRSVREKRFTQ 225
Query: 62 KVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYKM 119
VH+ V RGG+ LIP FA+G AQEL ++L+ YWE P+Y+A L +K + Y+
Sbjct: 226 MVHDIVTRGGRCLIPAFAIGPAQELMLILDEYWEAHQELHDIPVYYASSLAKKCMSVYQT 285
Query: 120 FITWTNQKIRKTFVQRNMFDFKHI---RPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFK 176
F+ N +I+K +N F FKH+ R D+ ++ GP VV ATPGML SG S +F+
Sbjct: 286 FVNGMNSRIQKQIAIKNPFIFKHVSTLRGMDQ--FEDAGPCVVLATPGMLQSGFSRELFE 343
Query: 177 KWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGI 236
W N I+ G+CV+GT+ +L+ +++ + + + ++M + Y+SFSAH D
Sbjct: 344 NWCSDSKNGCIIAGYCVEGTLAKHILTEPEEIVSLSGEKLPMRMQVGYVSFSAHTDFNQT 403
Query: 237 MQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFN-----LDCFMPANGESCFVQTDMKIS 291
++ +P +++LVHGE +M LK I++ F ++ P N E +Q + +
Sbjct: 404 SNFVKTLKPPHLVLVHGELHEMSRLKAGIERLFQDINIPIEVHNPRNTERLELQFRGEKT 463
Query: 292 IDVSVNLLKEEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDE 337
V + L E P + I GVLV + S SLM DE
Sbjct: 464 AKV-IGKLAERM---------PENNKIISGVLVKNNFSYSLMAKDE 499
>gi|410032124|ref|XP_003307804.2| PREDICTED: integrator complex subunit 11-like [Pan troglodytes]
Length = 313
Score = 209 bits (532), Expect = 6e-51, Method: Compositional matrix adjust.
Identities = 94/145 (64%), Positives = 118/145 (81%)
Query: 158 VVFATPGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIID 217
VVFATPGMLH+G SL IF+KWA E NM+IMPG+CVQGT+GHK+LSG +KLE E +Q+++
Sbjct: 132 VVFATPGMLHAGQSLQIFRKWAGNEKNMVIMPGYCVQGTVGHKILSGQRKLEMEGRQVLE 191
Query: 218 VKMAIEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPA 277
VKM +EYMSFSAHADAKGIMQL+ EP++VLLVHGEA KM FLK+KI+QE + C+MPA
Sbjct: 192 VKMQVEYMSFSAHADAKGIMQLVGQAEPESVLLVHGEAKKMEFLKQKIEQELRVSCYMPA 251
Query: 278 NGESCFVQTDMKISIDVSVNLLKEE 302
NGE+ + T I + +S+ LLK E
Sbjct: 252 NGETVTLPTSPSIPVGISLGLLKRE 276
>gi|299116292|emb|CBN76100.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 752
Score = 209 bits (532), Expect = 7e-51, Method: Compositional matrix adjust.
Identities = 107/276 (38%), Positives = 168/276 (60%), Gaps = 13/276 (4%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ ++YTGDY+M DRHL AA + PD+LI ESTY + + ++ RE F+
Sbjct: 62 MFMIEIAGVHVLYTGDYSMEADRHLMAAEMPSTSPDVLIVESTYGVQVHEPRKERESRFV 121
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWE--RMNLQAPIYFAVGLTEKATNYYK 118
V + V +GG+ LIPVFALGRAQEL ++L+ YW+ R PIY+A L K
Sbjct: 122 GTVSKAVKKGGRCLIPVFALGRAQELLLILDEYWQQHRELHHIPIYYASRLASKT----- 176
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHI---RPFDKSFIDNPGPMVVFATPGMLHSGLSLIIF 175
+I N+ IR+ N F F+HI + D+ D+ GP VV A+PGML SG+S ++F
Sbjct: 177 -YINMMNEHIRQQMDVANPFKFQHITNLKSIDQ--FDDSGPSVVMASPGMLQSGVSRMLF 233
Query: 176 KKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKG 235
+W + N +++PG+ V+GT+ K+LS +++ + ++ + +EY+SFSAH D
Sbjct: 234 DRWCTDDKNSVLIPGYSVEGTLAKKLLSMPDEVQGMDGRVRQRRCEVEYISFSAHVDFVQ 293
Query: 236 IMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNL 271
I+ +P NV+LVHGE + M+ LK +++++F +
Sbjct: 294 NKGFIEGVQPANVILVHGEETGMLRLKTELEKQFAM 329
>gi|324506922|gb|ADY42942.1| Cleavage and polyadenylation specificity factor subunit 3 [Ascaris
suum]
Length = 706
Score = 208 bits (530), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 117/346 (33%), Positives = 189/346 (54%), Gaps = 19/346 (5%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ ++YTGD++ DRHL AA + PD+LI ESTY T + + + RE+ F
Sbjct: 177 MFMIEIAGVRVLYTGDFSRLEDRHLCAAELPTVSPDVLICESTYGTQVHEGREEREKRFT 236
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWE-RMNLQ-APIYFAVGLTEKATNYYK 118
VHE V RGG+ LIP FALGRAQEL ++L+ YWE LQ P+Y+A L +K Y+
Sbjct: 237 STVHEIVGRGGRCLIPAFALGRAQELLLILDEYWEAHPELQDIPVYYASSLAKKCMAVYQ 296
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
F++ N +I+K N F F+H+ ++ GP VV A+PGML +GLS +F+
Sbjct: 297 TFVSGMNSRIQKQIALNNPFVFRHVSNLKSIEHFEDVGPCVVLASPGMLQNGLSRELFEN 356
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W N I+ G+CV+GT+ +LS +++ + Q + +++ + Y+SFSAH D
Sbjct: 357 WCTDSKNGCIIAGYCVEGTLAKHILSEPEEIVTMSGQKLAMRLQVAYISFSAHTDYLQTS 416
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKI------KQEFNLDCFMPANGESCFVQTDMKIS 291
++ +++LVHGE ++M LK I +F+++ P N ES ++
Sbjct: 417 NFVRSMRSPHLVLVHGEINEMNRLKAAIIRQYEDDADFHIEVHNPRNTES----VELHFR 472
Query: 292 IDVSVNLLKEEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDE 337
+ + ++ + A+ P + GVLV ++ + LM D+
Sbjct: 473 GEKTAKVVGKMAMS------APKDGEMLSGVLVRRNFNYHLMRADD 512
>gi|358333242|dbj|GAA51791.1| cleavage and polyadenylation specificity factor subunit 3
[Clonorchis sinensis]
Length = 697
Score = 208 bits (530), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 126/384 (32%), Positives = 189/384 (49%), Gaps = 53/384 (13%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ ++YTGD++ DRHL A I RPD+LITE+TY I D + RE F
Sbjct: 108 MFLIEIAGVKVLYTGDFSRQEDRHLMCAEIPHVRPDVLITEATYGIHIHDKREDREARFT 167
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
+ VH+ V RGG+ LIP FALGRAQEL ++L+ YW PIY+A L K Y+
Sbjct: 168 RLVHDIVGRGGRCLIPAFALGRAQELMLILDEYWANHPELHDIPIYYASQLARKCMAVYQ 227
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIR---------------------------------- 144
+I N+KIR N F F+HI
Sbjct: 228 TYIHAMNEKIRNQLANNNPFCFRHISNLKAMRSYSISEQTEHALASKAWLYVAYSRFPVI 287
Query: 145 ---------PFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQG 195
P D+ GP VV A+PGM+ SG+S +F+ W N +I+ G+CV+G
Sbjct: 288 GTVAAGTNVPTSIEHFDDSGPCVVMASPGMMQSGMSRELFENWCTDRRNGVIIAGYCVEG 347
Query: 196 TIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEA 255
T+ ++LS ++ + Q + +K +++Y+SFSAH D + I+ +P V+LVHGE
Sbjct: 348 TLAKQILSLPAEIPTMSGQTLPLKCSVDYISFSAHTDYQQTSAFIRELKPNYVILVHGEQ 407
Query: 256 SKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLKEEAVK-YNSEPPNPL 314
++M+ L +++E+ D C K + V++ E+ K S NP
Sbjct: 408 NEMLRLAGALQREYEDD------DTCCLELFTPKNCVPVNLRFRGEKVAKVLGSLARNPP 461
Query: 315 KERQ-IHGVLVIKDSSISLMDVDE 337
K Q I GVLV K+ + ++ +E
Sbjct: 462 KNNQSISGVLVKKNFAYHILTPEE 485
>gi|302806483|ref|XP_002984991.1| hypothetical protein SELMODRAFT_234671 [Selaginella moellendorffii]
gi|302825687|ref|XP_002994439.1| hypothetical protein SELMODRAFT_236963 [Selaginella moellendorffii]
gi|300137630|gb|EFJ04498.1| hypothetical protein SELMODRAFT_236963 [Selaginella moellendorffii]
gi|300147201|gb|EFJ13866.1| hypothetical protein SELMODRAFT_234671 [Selaginella moellendorffii]
Length = 677
Score = 208 bits (530), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 138/426 (32%), Positives = 218/426 (51%), Gaps = 33/426 (7%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF V + ++YTGDY+ DRHL AA + + PD+ I ESTY I + RE+ F
Sbjct: 168 MFMVDIAGIRVLYTGDYSREEDRHLKAAEMPEFSPDVCIIESTYGVQIHQPRHVREKRFT 227
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWE-RMNLQ-APIYFAVGLTEKATNYYK 118
+ + + V GG+VLIP FALGRAQEL ++L+ YWE LQ PIY+A L +K Y+
Sbjct: 228 ETIAQTVSHGGRVLIPAFALGRAQELLLILDEYWEAHPELQHIPIYYASPLAKKCMAVYQ 287
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDKSF--IDNPGPMVVFATPGMLHSGLSLIIFK 176
+I N KI+ + N F+FKHI P KS ++ GP +V A+P L SGLS +F
Sbjct: 288 TYINSMNDKIKSQYENSNPFNFKHISPL-KSIEQFEDVGPSIVMASPSGLQSGLSRQLFD 346
Query: 177 KWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGI 236
+W N ++PG+ V+GT+ +L+ K++ + ++ + M + Y+SFSAHAD
Sbjct: 347 RWCQDRKNACVIPGYVVEGTLAKTILNEPKEVALVSGLVVPLNMRVVYISFSAHADFAQT 406
Query: 237 MQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF---NLDCFMPANGESC-FVQTDMKISI 292
+ P +++LVHGE ++M LK K++ +F N+ P N + F K++
Sbjct: 407 SAFLGELRPPHIVLVHGEQNEMGRLKVKLQAQFAEQNVKINSPKNCQPVEFFFKGEKVA- 465
Query: 293 DVSVNLLKEEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFT 352
+V L E+ P + + G+L+ K + LM D+ +SR ++
Sbjct: 466 -KAVGRLAEK---------TPDEGGAVSGLLIKKSFTYQLMAPDDLHVYTQLSRGSIQQR 515
Query: 353 SNVQISDSSPI-NKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLLSVERIDDKNK 411
+V + + +E +YD + + +SQ +I D V +S E N+
Sbjct: 516 LSVPYDGAFVVLRHRIEQMYDGVEHVTKAESQTLRIH--------DKVTISQE----SNE 563
Query: 412 RVFVTW 417
V + W
Sbjct: 564 HVVLQW 569
>gi|302808975|ref|XP_002986181.1| hypothetical protein SELMODRAFT_234972 [Selaginella moellendorffii]
gi|300146040|gb|EFJ12712.1| hypothetical protein SELMODRAFT_234972 [Selaginella moellendorffii]
Length = 684
Score = 208 bits (529), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 138/426 (32%), Positives = 218/426 (51%), Gaps = 33/426 (7%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF V + ++YTGDY+ DRHL AA + + PD+ I ESTY I + RE+ F
Sbjct: 175 MFMVDIAGIRVLYTGDYSREEDRHLKAAEMPEFSPDVCIIESTYGVQIHQPRHVREKRFT 234
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWE-RMNLQ-APIYFAVGLTEKATNYYK 118
+ + + V GG+VLIP FALGRAQEL ++L+ YWE LQ PIY+A L +K Y+
Sbjct: 235 ETIAQTVSHGGRVLIPAFALGRAQELLLILDEYWEAHPELQHIPIYYASPLAKKCMAVYQ 294
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDKSF--IDNPGPMVVFATPGMLHSGLSLIIFK 176
+I N KI+ + N F+FKHI P KS ++ GP +V A+P L SGLS +F
Sbjct: 295 TYINSMNDKIKSQYENSNPFNFKHISPL-KSIEQFEDVGPSIVMASPSGLQSGLSRQLFD 353
Query: 177 KWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGI 236
+W N ++PG+ V+GT+ +L+ K++ + ++ + M + Y+SFSAHAD
Sbjct: 354 RWCQDRKNACVIPGYVVEGTLAKTILNEPKEVALVSGLVVPLNMRVVYISFSAHADFAQT 413
Query: 237 MQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF---NLDCFMPANGESC-FVQTDMKISI 292
+ P +++LVHGE ++M LK K++ +F N+ P N + F K++
Sbjct: 414 SAFLGELRPPHIVLVHGEQNEMGRLKVKLQAQFAEQNVKINSPKNCQPVEFFFKGEKVA- 472
Query: 293 DVSVNLLKEEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFT 352
+V L E+ P + + G+L+ K + LM D+ +SR ++
Sbjct: 473 -KAVGRLAEK---------TPDEGGAVSGLLIKKSFTYQLMAPDDLHVYTQLSRGSIQQR 522
Query: 353 SNVQISDSSPI-NKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLLSVERIDDKNK 411
+V + + +E +YD + + +SQ +I D V +S E N+
Sbjct: 523 LSVPYDGAFVVLRHRIEQMYDGVEHVTKAESQTLRIH--------DKVTISQE----SNE 570
Query: 412 RVFVTW 417
V + W
Sbjct: 571 HVVLQW 576
>gi|356525973|ref|XP_003531594.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
3-I-like [Glycine max]
Length = 688
Score = 207 bits (528), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 109/289 (37%), Positives = 161/289 (55%), Gaps = 12/289 (4%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF V + ++YTGDY+ DRHL AA I + PD+ I ESTY + RE+ F
Sbjct: 172 MFMVDIAGVRVLYTGDYSREEDRHLRAAEIPQFSPDVCIIESTYGVQHHQPRHTREKRFT 231
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYW----ERMNLQAPIYFAVGLTEKATNY 116
+H + +GG+VLIP +ALGRAQEL ++L+ YW E N+ PIY+A L +K
Sbjct: 232 DVIHSTISQGGRVLIPAYALGRAQELLLILDEYWANHPELHNI--PIYYASPLAKKCLTV 289
Query: 117 YKMFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIF 175
Y+ + N +++ + N F FKHI + GP VV A+PG L SGLS +F
Sbjct: 290 YETYTLSMNDRVQNA--KSNPFSFKHISALSSIEVFKDVGPSVVMASPGGLQSGLSRQLF 347
Query: 176 KKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKG 235
KW + N ++PGF V+GT+ +++ K++ N + M + Y+SFSAHAD+
Sbjct: 348 DKWCSDKKNTCVLPGFVVEGTLAKTIMTEPKEVTLMNGLSAPLNMQVHYISFSAHADSAQ 407
Query: 236 IMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF---NLDCFMPANGES 281
++ P N++LVHGEA++M LK+K+ +F N P N +S
Sbjct: 408 TSAFLEELNPPNIILVHGEANQMGRLKQKLTSQFADRNTKILTPKNCQS 456
>gi|323453344|gb|EGB09216.1| hypothetical protein AURANDRAFT_71470 [Aureococcus anophagefferens]
Length = 1101
Score = 207 bits (527), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 128/385 (33%), Positives = 205/385 (53%), Gaps = 22/385 (5%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++G ++YTGDY++ DRHL A + P +LI ESTY T +S+ RE F
Sbjct: 62 MFMIEIGGVRLLYTGDYSLEEDRHLVPAEVPTLEPHVLIMESTYGTQKHESRDVREALFT 121
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYW-ERMNLQ-APIYFAVGLTEKATNYYK 118
+ V RGG+ LIPVFALGRAQEL ++L+ YW ER +LQ P+++A + +A Y+
Sbjct: 122 STIERIVQRGGRCLIPVFALGRAQELLLILDEYWKEREDLQRVPVFYASKMASRALRVYQ 181
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
+I N +R N F F H++ +D+ GP+VV A PGML SG+S +F +
Sbjct: 182 TYINMMNMHVRDQMDISNPFKFDHVQNLASIDDLDDSGPVVVLAAPGMLQSGVSRQLFDR 241
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
WA E N +++ G+ V+GT+ ++LS +++ ++ + + + +SFSAH D
Sbjct: 242 WASSERNGVVIAGYSVEGTLAKQILSEPDEVKTQDGRTQPRRCTVVSISFSAHVDYFQNF 301
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFN-------LDCFMPANGESCFVQTDMKI 290
Q ++ P N++LVHGE ++M LK K+ QE P NG++ +M+
Sbjct: 302 QFVESTMPNNIVLVHGEKNEMSRLKGKLVQETGKWPSESRPTVSSPENGQA----VNMRF 357
Query: 291 SID---VSVNLLKEEAVKYNSEPPN----PLKERQI-HGVLVIKDSSISLMDVDEACKEV 342
+ D V +E+ K+ + P K R + G+LV KD +L DE +
Sbjct: 358 ARDRRVACVGAREEDPFKHKRRADDADAAPEKRRVVKSGILVSKDLRCTLYADDELRESS 417
Query: 343 GISRHIVRFTSNVQISDSSPINKTL 367
+S +R T +Q++ + + L
Sbjct: 418 PLSVTSLRQTLRLQLASDVGVFRAL 442
>gi|145350779|ref|XP_001419775.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144580007|gb|ABO98068.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 767
Score = 207 bits (527), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 110/275 (40%), Positives = 159/275 (57%), Gaps = 3/275 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF V +G ++YTGDY+ DRHL AA + P ++I ESTY + + RE F
Sbjct: 165 MFNVDIGGLRVLYTGDYSRIADRHLPAADVPAIPPHVVIVESTYGVSPHSPREEREIRFT 224
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYW-ERMNLQ-APIYFAVGLTEKATNYYK 118
+KV + RGG+VL+PV ALGRAQEL ++LE +W + +LQ PIY A L KA Y+
Sbjct: 225 EKVQTILRRGGRVLLPVVALGRAQELLLILEDFWAQNPDLQRVPIYQASALARKAMTIYQ 284
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDKSF-IDNPGPMVVFATPGMLHSGLSLIIFKK 177
+I N ++ F + N F F H++ KS +D+ GP VV ATP ML SGLS +F+
Sbjct: 285 TYINVLNSDMKAAFEEANPFVFNHVKHVSKSSELDDVGPCVVLATPSMLQSGLSRELFES 344
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W N +I+ F VQGT+ ++LS V K+ + + + + M+++ +SFSAHAD
Sbjct: 345 WCEDPKNGVIIADFAVQGTLAREILSDVNKIIARDGRELQLNMSVDAISFSAHADYPQTQ 404
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLD 272
+ P +V+LVHGEA +M LK + + D
Sbjct: 405 AFLDALAPPHVILVHGEAGEMGRLKRALDNKAAAD 439
>gi|213409816|ref|XP_002175678.1| endoribonuclease ysh1 [Schizosaccharomyces japonicus yFS275]
gi|212003725|gb|EEB09385.1| endoribonuclease ysh1 [Schizosaccharomyces japonicus yFS275]
Length = 771
Score = 207 bits (527), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 125/374 (33%), Positives = 200/374 (53%), Gaps = 28/374 (7%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
M+ +++ ++YTGD++ DRHL A + +P++LI+ESTY T + +E L
Sbjct: 142 MYFIEIAGVKLLYTGDFSREEDRHLNIAEVPPQKPNILISESTYGTASHQPRLDKEARLL 201
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWE-RMNLQA-PIYFAVGLTEKATNYYK 118
VH V GG+VL+PVFALGRAQEL ++L+ YW L++ PIY+A L K Y+
Sbjct: 202 NLVHTTVRNGGRVLMPVFALGRAQELLLILDEYWHSHAELRSVPIYYASSLARKCMAVYQ 261
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHI---RPFDKSFIDNPGPMVVFATPGMLHSGLSLIIF 175
+I N KIRK F +RN F F++I R DK D+ GP V+ A+PGML +G+S +
Sbjct: 262 TYINMMNDKIRKAFAERNPFIFRYIKSLRSIDK--FDDIGPSVILASPGMLQNGVSRTLL 319
Query: 176 KKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKG 235
++WAP N L++ G+ V+GT+ + + ++ + Q I +M +E +SF+AH D
Sbjct: 320 ERWAPDARNTLLLTGYSVEGTMAKLIANEPIEITTLSGQKIPRRMTVEELSFAAHVDYIQ 379
Query: 236 IMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF-----NLDCFMPANGESCFVQTDMKI 290
+ I P +++LVHGE + LK + ++ ++ + P N C V ++
Sbjct: 380 NSEFIDAVNPDHIILVHGEQTNTGRLKSALMSKYHNKKMDVKVYNPKN---C-VPLELHF 435
Query: 291 SIDVSVNLLKEEAVKYNSEPPNPLKERQIHGVLVIKDSSISLM------DVDEACKEVGI 344
D V L A+K E N + + G+LV KDS LM D + V +
Sbjct: 436 KGDRIVKALGNIAIKKAKE--NDI----VSGILVQKDSIFKLMVAENLRDFSDLTTTVVM 489
Query: 345 SRHIVRFTSNVQIS 358
+ ++ F +N ++
Sbjct: 490 QKQVIPFYANFSLA 503
>gi|449016323|dbj|BAM79725.1| cleavage and polyadenylation specifity factor protein
[Cyanidioschyzon merolae strain 10D]
Length = 749
Score = 206 bits (525), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 114/293 (38%), Positives = 174/293 (59%), Gaps = 10/293 (3%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDK-CRPDLLITESTYATTIRDSKRCRERDF 59
MF V+V S++YTGD++ DRHL A I + D+LI ESTY + + +R RE F
Sbjct: 164 MFLVEVAGVSVLYTGDFSRQEDRHLMEAEIPRGIHIDVLICESTYGVQVHEPRRVREARF 223
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWE-RMNLQA-PIYFAVGLTEKATNYY 117
++V E V RGG+ L+PVFALGRAQEL ++LE YW+ LQ PIY++ + ++ Y
Sbjct: 224 TQRVAEVVKRGGRCLLPVFALGRAQELLLILEEYWDAHPELQEIPIYYSSSIAKRCMAIY 283
Query: 118 KMFITWTNQKIRKTFVQ-RNMFDFKH---IRPFDKSFIDNPGPMVVFATPGMLHSGLSLI 173
+I NQ I++ + + N F FK+ IR D+ ++ GP V A+PGML SG+S
Sbjct: 284 STYIHQMNQNIQQRYRRFGNPFAFKYVMNIRSLDE--FEDSGPCVFMASPGMLQSGMSRR 341
Query: 174 IFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADA 233
+F+KW N +I+PG+ VQGT+ +L+ + + Q + ++ +++Y++FSAH+D
Sbjct: 342 LFEKWCSDRRNGVILPGYSVQGTLAKYILTDPATVPRLDGQHVPLRCSVDYITFSAHSDF 401
Query: 234 KGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDC-FMPANGESCFVQ 285
+ I+ C P N++LVHGE S+M L + + FN F A E+ VQ
Sbjct: 402 MQTSEFIEQCRPSNLVLVHGEKSEMQRLAQALDSRFNRKATFEAAIAEASAVQ 454
>gi|157876175|ref|XP_001686447.1| putative cleavage and polyadenylation specificity factor
[Leishmania major strain Friedlin]
gi|68129521|emb|CAJ08064.1| putative cleavage and polyadenylation specificity factor
[Leishmania major strain Friedlin]
Length = 756
Score = 206 bits (524), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 122/298 (40%), Positives = 175/298 (58%), Gaps = 14/298 (4%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF V + +YTGD++ PDRHL A + PD+LI EST +S+ RE F
Sbjct: 176 MFMVDIAGMRALYTGDFSRVPDRHLLGAEVPPYSPDILIAESTNGIRELESREEREHLFT 235
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWE-RMNLQ-APIYFAVGLTEKATNYYK 118
VH+ V RGG+ L+PVFALGRAQEL ++LE +W+ LQ PIY+A L ++ Y+
Sbjct: 236 SSVHDVVRRGGRCLVPVFALGRAQELLLILEEFWDAHKELQNIPIYYASSLAQRCMKLYQ 295
Query: 119 MFITWTNQKIRKTFV-QRNMFDFKHIRPF--DKSFIDNPGPMVVFATPGMLHSGLSLIIF 175
F++ N ++++ N F FK+IR KSF DN GP VV A+PGML SG+SL +F
Sbjct: 296 TFVSAMNDRVKQQHANHHNPFVFKYIRSLMDTKSFEDN-GPCVVLASPGMLQSGISLELF 354
Query: 176 KKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMA-IEYMSFSAHADAK 234
++W N +IM G+CV GTI VL+ K++ + +++ ++M+ IE +SFSAH+D +
Sbjct: 355 ERWCGDRRNGIIMAGYCVDGTIAKDVLAKPKEMTKPDGKVLPLRMSTIEAVSFSAHSDGR 414
Query: 235 GIMQLIQ-YCEPKNVLLVHGEASKMVFLKEKIKQEF---NLDCFMPANGESC---FVQ 285
IQ + K+ +LVHG M LK K+ Q+F N+ + N ES FVQ
Sbjct: 415 QTRDFIQNLTKVKHTILVHGNPGAMGQLKNKLLQDFRDRNMSVYTTMNQESIRIPFVQ 472
>gi|335775092|gb|AEH58456.1| cleavage and polyadenylation specificity facto subunit 3-like
protein, partial [Equus caballus]
Length = 499
Score = 206 bits (523), Expect = 7e-50, Method: Compositional matrix adjust.
Identities = 127/376 (33%), Positives = 203/376 (53%), Gaps = 30/376 (7%)
Query: 24 HLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFLKKVHECVDRGGKVLIPVFALGRA 83
HL AA I +PD+LI+ESTY T I + + RE F VH+ V+RGG+ LIPVFALGRA
Sbjct: 2 HLMAAEIPNIKPDILISESTYGTHIHEKREEREARFCNTVHDIVNRGGRGLIPVFALGRA 61
Query: 84 QELCILLETYWERMN--LQAPIYFAVGLTEKATNYYKMFITWTNQKIRKTFVQRNMFDFK 141
QEL ++L+ YW+ PIY+A L +K Y+ ++ N KIRK N F FK
Sbjct: 62 QELLLILDEYWQNHPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQININNPFVFK 121
Query: 142 HIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHK 200
HI D+ GP VV A+PGM+ SGLS +F+ W + N +I+ G+CV+GT+
Sbjct: 122 HISNLKSMDHFDDIGPSVVMASPGMMQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAKH 181
Query: 201 VLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVF 260
++S +++ + Q + +KM+++Y+SFSAH D + + I+ +P +V+LVHGE ++M
Sbjct: 182 IMSEPEEITTMSGQKLPLKMSVDYISFSAHTDYQQTSEFIRALKPPHVILVHGEQNEMAR 241
Query: 261 LKEKIKQEF------NLDCFMPANGESCFVQTDMKISIDVSVNLLKEEAVKYNS--EPPN 312
LK + +E+ +++ P N E+ V++N E+ K
Sbjct: 242 LKAALIREYEDNDEVHIEVHNPRNTEA------------VTLNFRGEKLAKVMGFLADKK 289
Query: 313 PLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDSSPINKTLELIYD 372
P + +++ G+LV ++ + ++ C +S + S V+ + + P L+Y
Sbjct: 290 PEQGQRVSGILVKRNFNYHILS---PCD---LSNYTDLAMSTVKQTQAIPYTGPFNLLYY 343
Query: 373 QLLSYLQDKSQEYKIQ 388
QL D +E +IQ
Sbjct: 344 QLQKLTGD-VEELEIQ 358
>gi|300706475|ref|XP_002995499.1| hypothetical protein NCER_101581 [Nosema ceranae BRL01]
gi|239604633|gb|EEQ81828.1| hypothetical protein NCER_101581 [Nosema ceranae BRL01]
Length = 671
Score = 205 bits (522), Expect = 9e-50, Method: Compositional matrix adjust.
Identities = 106/271 (39%), Positives = 162/271 (59%), Gaps = 5/271 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +K+ + ++YTGDY+ DRHL AA C+ LITESTY S+ RE F
Sbjct: 183 MFLIKIEDSVMLYTGDYSREEDRHLKAAESPNCKIHALITESTYGVQCHLSRDERESRFT 242
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQ---APIYFAVGLTEKATNYY 117
+ + V RGG+ L+PVFALGRAQEL ++L+ +W N Q PIY+A L +K Y
Sbjct: 243 STITKIVTRGGRCLLPVFALGRAQELLLILDEHWSN-NPQLHSIPIYYASALAKKCIGIY 301
Query: 118 KMFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFK 176
+ +I N I+K+ + +N F F++++ F ++ P V+ A+PGML SGLS +F+
Sbjct: 302 QTYINMMNDHIKKSSLIKNPFAFQYVKNLKSIDFFEDNSPCVIMASPGMLQSGLSRELFE 361
Query: 177 KWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGI 236
KW N +I+PG+ V GT+ ++L+ K++ ++ + M+++Y+SFSAH D
Sbjct: 362 KWCGDRRNGVIIPGYSVDGTLAKEILNEPKEVMSMKGNVLKLNMSVDYISFSAHVDFAQN 421
Query: 237 MQLIQYCEPKNVLLVHGEASKMVFLKEKIKQ 267
+Q I C+P ++ VHGEA++M LK I Q
Sbjct: 422 VQFIDECQPDHLFFVHGEANEMNRLKNVIAQ 452
>gi|393217572|gb|EJD03061.1| Metallo-hydrolase/oxidoreductase [Fomitiporia mediterranea MF3/22]
Length = 826
Score = 205 bits (522), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 122/363 (33%), Positives = 189/363 (52%), Gaps = 25/363 (6%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF + + I+YTGDY+ DRHL A I RPD+LI ESTY + + +E F
Sbjct: 162 MFLIDIAGLRILYTGDYSREEDRHLVKAEIPPVRPDVLIVESTYGVQGHEERDTKEHRFT 221
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
VH + RGG L+PVFALGRAQEL ++LE YW++ PIY+A L K Y+
Sbjct: 222 NLVHSIIRRGGHALLPVFALGRAQELLLILEDYWKKHPDLHNVPIYYASNLARKCMAVYQ 281
Query: 119 MFITWTNQKIRKTFVQR-NMFDFKH------IRPFDKSFIDNPGPMVVFATPGMLHSGLS 171
+I N IR F +R N F FKH +R ++K + P P V+ TPGML G S
Sbjct: 282 TYIHTMNSNIRSRFAKRDNPFVFKHVSNIPQVRGWEKRIAEGP-PCVILCTPGMLQPGPS 340
Query: 172 LIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHA 231
+ + WAP N LI+ G+ V+GT+ +++ +++ I +++++Y+SFSAH
Sbjct: 341 RELLELWAPDPRNGLIITGYSVEGTLARDIVNEPQEIPSVKGDSIPRRLSVDYISFSAHV 400
Query: 232 DAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF-----NLDCFMPANGESCFVQT 286
D ++ I+ + ++++LVHG+A+ M L+ ++Q + + P N E+
Sbjct: 401 DGPQNIEFIEQVKAQHIVLVHGDAAAMNRLRVSLQQRYKDREEEIKIHTPRNLET----- 455
Query: 287 DMKISIDVSVNLLKEEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISR 346
+ + + + S PP P + G+LV KD S +L+D + G+S
Sbjct: 456 ---LKLTFRGERVAKAIGTLASNPPKP--NSVLSGLLVSKDYSYTLLDPRDLRDFAGLST 510
Query: 347 HIV 349
IV
Sbjct: 511 CIV 513
>gi|328704356|ref|XP_001945120.2| PREDICTED: cleavage and polyadenylation specificity factor subunit
3-like [Acyrthosiphon pisum]
Length = 694
Score = 205 bits (521), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 121/339 (35%), Positives = 194/339 (57%), Gaps = 22/339 (6%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ ++YTGD++ DRHL AA I RP++LITESTY T I + + RER F
Sbjct: 174 MFMIEIAGVKVLYTGDFSRQEDRHLMAAEIPPSRPEILITESTYGTHIHEKREERERRFT 233
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYW----ERMNLQAPIYFAVGLTEKATNY 116
V++ V+RGG+ LIPVFALGRAQEL ++L+ YW E ++ PIY+A L +K
Sbjct: 234 MLVNDIVNRGGRCLIPVFALGRAQELLLILDEYWGLHPELHDI--PIYYASSLAKKCMAV 291
Query: 117 YKMFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIF 175
Y+ +I N +I++ N F FKHI ++ GP V+ A+PG++ SGLS +F
Sbjct: 292 YQTYINAMNDRIKRQIAVNNPFVFKHITNLKSIDHFEDIGPCVIMASPGVMESGLSRELF 351
Query: 176 KKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKG 235
+ W N +I+ G+ VQGT+ +LS + + Q + +KM+++Y+SFSAH D +
Sbjct: 352 EMWCTDSKNGVIIAGYVVQGTLAKAILSEPEDITTMTGQKLPLKMSVDYISFSAHTDYQQ 411
Query: 236 IMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF-----NLDCFMPANGESCFVQTDMKI 290
+ I +P +++LVHGE ++M LK + +E+ ++ + P N V D
Sbjct: 412 TREFINILKPPHIVLVHGEQNEMQRLKSALVREYEENSEDIKVYNPRNT----VGVDFYF 467
Query: 291 SIDVSVNLLKEEAVKYNSEPPNPLKERQIHGVLVIKDSS 329
+ + + ++ E AV+ P ++ + GVL+ K+ S
Sbjct: 468 TGEKTAKVMGEIAVE------KPAEDNVLSGVLLKKNFS 500
>gi|388852694|emb|CCF53612.1| related to YSH1-component of pre-mRNA polyadenylation factor PF I
[Ustilago hordei]
Length = 888
Score = 204 bits (520), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 107/276 (38%), Positives = 167/276 (60%), Gaps = 8/276 (2%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ I+YTGD++ DRHL A I +PD+LI ESTY T + + +E F
Sbjct: 180 MFLIEIAGLRILYTGDFSREEDRHLVQAEIPPVKPDVLICESTYGTQTHEPRHDKEHRFT 239
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYW-ERMNLQA-PIYFAVGLTEKATNYYK 118
++H + RGG+VL+PVF LGRAQEL +LL+ YW LQ+ PIY+A L +K + Y+
Sbjct: 240 SQIHHIIKRGGRVLLPVFVLGRAQELLLLLDEYWAAHPELQSVPIYYASALAKKCISVYQ 299
Query: 119 MFITWTNQKIRKTFVQR-NMFDFKHI---RPFDKSFIDNPGPMVVFATPGMLHSGLSLII 174
+I N IR F +R N F FKHI R +K ++ GP V+ A+PG + SG+S +
Sbjct: 300 TYIHTMNDHIRTRFNRRDNPFVFKHISNLRSLEK--FEDRGPCVMMASPGFMQSGVSREL 357
Query: 175 FKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAK 234
++WAP + N LI+ G+ V+GT+ +L+ ++ N Q I +M+++Y+SFSAH D
Sbjct: 358 LERWAPDKRNGLIVSGYSVEGTMARNILNEPDEIIGMNGQKIPRRMSVDYISFSAHVDFA 417
Query: 235 GIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFN 270
+ I + ++++LVHGE + M L+ ++ +F
Sbjct: 418 QNSRFIDEIKAQHIVLVHGEQNNMSKLRAALQAKFT 453
>gi|401428833|ref|XP_003878899.1| cleavage and polyadenylation specificity factor,putative
[Leishmania mexicana MHOM/GT/2001/U1103]
gi|322495148|emb|CBZ30452.1| cleavage and polyadenylation specificity factor,putative
[Leishmania mexicana MHOM/GT/2001/U1103]
Length = 756
Score = 204 bits (519), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 122/298 (40%), Positives = 175/298 (58%), Gaps = 14/298 (4%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF V + +YTGD++ PDRHL A + PD+LI EST +S+ RE+ F
Sbjct: 176 MFMVDIAGMRALYTGDFSRVPDRHLLGAEVPPYSPDILIAESTNGIRELESREEREQLFT 235
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWE-RMNLQ-APIYFAVGLTEKATNYYK 118
VHE V RGG+ L+PVFALGRAQEL ++LE +W+ LQ PIY+A L ++ Y+
Sbjct: 236 GSVHEVVRRGGRCLVPVFALGRAQELLLILEEFWDAHKELQNIPIYYASSLAQRCMKLYQ 295
Query: 119 MFITWTNQKIRKTFV-QRNMFDFKHIRPF--DKSFIDNPGPMVVFATPGMLHSGLSLIIF 175
F++ N ++++ N F FK+I KSF DN GP VV A+PGML SG+SL +F
Sbjct: 296 TFVSAMNDRVKQQHANHHNPFVFKYIHSLMDTKSFEDN-GPCVVLASPGMLQSGISLELF 354
Query: 176 KKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMA-IEYMSFSAHADAK 234
++W N +IM G+CV GTI VL+ K++ + +++ ++M+ IE +SFSAH+D +
Sbjct: 355 ERWCGDRRNGIIMAGYCVDGTIAKDVLAKPKEVTKPDGKVLPLRMSTIEAVSFSAHSDGR 414
Query: 235 GIMQLIQ-YCEPKNVLLVHGEASKMVFLKEKIKQEF---NLDCFMPANGESC---FVQ 285
IQ + K+ +LVHG M LK K+ Q+F N+ + N ES FVQ
Sbjct: 415 QTRDFIQNLTKVKHTILVHGNPGAMGQLKNKLLQDFRDRNMSVYTTMNQESIRIPFVQ 472
>gi|343428147|emb|CBQ71677.1| related to YSH1-component of pre-mRNA polyadenylation factor PF I
[Sporisorium reilianum SRZ2]
Length = 878
Score = 204 bits (519), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 119/356 (33%), Positives = 195/356 (54%), Gaps = 23/356 (6%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ I+YTGD++ DRHL A I +PD+LI ESTY T + + +E F
Sbjct: 180 MFLIEIAGLRILYTGDFSREEDRHLVQAEIPPVKPDVLICESTYGTQTHEPRLDKEHRFT 239
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
++H + RGG+VL+PVF LGRAQEL +LL+ YW PIY+A L +K + Y+
Sbjct: 240 SQIHHIIKRGGRVLLPVFVLGRAQELLLLLDEYWAAHPELHSVPIYYASALAKKCISVYQ 299
Query: 119 MFITWTNQKIRKTFVQR-NMFDFKHI---RPFDKSFIDNPGPMVVFATPGMLHSGLSLII 174
+I N IR F +R N F FKHI R +K ++ GP V+ A+PG + SG+S +
Sbjct: 300 TYIHTMNDHIRTRFNRRDNPFVFKHISNLRSLEK--FEDRGPCVMMASPGFMQSGVSREL 357
Query: 175 FKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAK 234
++WAP + N LI+ G+ V+GT+ +L+ ++ N Q I +M+++Y+SFSAH D
Sbjct: 358 LERWAPDKRNGLIVSGYSVEGTMARNILNEPDEIIGMNGQKIPRRMSVDYISFSAHVDFA 417
Query: 235 GIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF-----NLDCFMPANGESCFVQTDMK 289
+ I + ++++LVHGE + M L+ ++ F ++ P N E +Q +
Sbjct: 418 QNSRFIDEIKAQHIVLVHGEQNNMSKLRAALQARFTARGSDVKIHTPRNCEPLVLQFRAQ 477
Query: 290 ISIDVSVNLLKEEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGIS 345
+ +A+ + P P + + G+L+ KD + +++D + G+S
Sbjct: 478 RT---------AKAIGTIAAKP-PAQGDLVDGLLISKDFAYTILDPKDLTDFTGLS 523
>gi|242021401|ref|XP_002431133.1| conserved hypothetical protein [Pediculus humanus corporis]
gi|212516382|gb|EEB18395.1| conserved hypothetical protein [Pediculus humanus corporis]
Length = 547
Score = 204 bits (519), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 124/373 (33%), Positives = 201/373 (53%), Gaps = 42/373 (11%)
Query: 661 KVSQLMSEKKSYEVQVMSQVVAAVTNSCDSSHIIDLGGGQGYLSTILALQHGKKTLSLDY 720
K Q MS+KK +EV++ S V +++ N S HI+D+GGG+GYLSTILALQH K LSLD
Sbjct: 186 KFKQCMSKKKLHEVKLTSNVASSLCNLTKSKHIVDIGGGKGYLSTILALQHSLKVLSLDS 245
Query: 721 NQVNTHGAAVRSKKLEDSEKESKGPFQSYAGVINKKLWMRTQNRQTSHASVEHHGKNWKR 780
+ T GAA SK++ ++ G F G + N++ VE H K
Sbjct: 246 SDFKTCGAAKMSKRMATFWRD--GTFNKLKGKCPE------SNKEIRAKFVETHRK---- 293
Query: 781 KSKAPVVSIKDEELVVCKDKCKQITHFVTPDSDISSILSQAYPQDSLHNVCIMGLHTCGD 840
EL ++TP + + +IL + + Q+ + + + GLHTCGD
Sbjct: 294 -----------LELA-----------YITPKTKLKTILEKNFDQEIIEKITMTGLHTCGD 331
Query: 841 LSGTALRLFTK-SSLQCLVQVGCCYHLLEEEFIRSPFWKDVDQSLYEHGYGFPLSEHLRS 899
LS T L+LF S ++ +V GCCY+L+ E+F+ + KD + + +GFP+S+ L+
Sbjct: 332 LSPTCLKLFVNDSDIKSVVNFGCCYNLITEKFLYNS--KDDIRGENDKDFGFPMSDFLKE 389
Query: 900 RKFFLGRNVRMSGTQSPERVIDLKQTQTL---PLFYRALLEKYLRSKITINDEEPKVVGR 956
+KF +G RM + S E++ + +F+RA+L+ L + + +E +G+
Sbjct: 390 KKFHIGEQSRMVASYSLEKLQSTMSENAMLHTSMFFRAILQTILEKNLCL--KERVKIGK 447
Query: 957 LATKCSNFVEYVHRAVDKLKLDLEVDDEEVTRLFNSHQREYEYLQIYYFLKTALAPVIEA 1016
LA K NF EY+ +A + L++ +D+++ L+ + E +FL+ LAP+IE+
Sbjct: 448 LAYKIKNFHEYITKANEMCNLNINMDEKDAEELYEKYMSEKPKFIALFFLRVTLAPIIES 507
Query: 1017 LIVLDRVLYLREQ 1029
+I+LDR L+L E
Sbjct: 508 IILLDRYLFLLEN 520
Score = 48.1 bits (113), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 33/157 (21%), Positives = 72/157 (45%), Gaps = 7/157 (4%)
Query: 485 IKNTIDEMIEFITPNIACVNTHTVHFMVEKLWETTIPKAILSQVNNSPSFDSLLEEFWKS 544
+K + ++I+ + N++ ++LW +PKAI ++N S + ++ ++
Sbjct: 70 VKLHLTKLIQTFESGRSLANSNMTDAFTKQLWSKHVPKAIQVELNTQESLKTFIKHYFDI 129
Query: 545 RQDN----HVETNSELVKFFQAADKFRLTSLMNGESIITVDELFSKLQARKCGQVVESAK 600
+ + L+ +F ++ +T N ++I+ E + + K
Sbjct: 130 LNNETPVEFAQNYPNLISYFNNNLEYVIT---NDKNIVLSLEKLENIFTERNSLSKSDFK 186
Query: 601 VSQLMSEKKSYEVQVMSQVVAAVTNSCDSSHIIDLGS 637
Q MS+KK +EV++ S V +++ N S HI+D+G
Sbjct: 187 FKQCMSKKKLHEVKLTSNVASSLCNLTKSKHIVDIGG 223
Score = 42.4 bits (98), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 19/59 (32%), Positives = 35/59 (59%)
Query: 1036 SNLFLSLLTQVDDEEVTRLFDSHQREYEYLQIYYFLKTALAPVIEALIVLDRVLYLREQ 1094
+N +L +D+++ L++ + E +FL+ LAP+IE++I+LDR L+L E
Sbjct: 462 ANEMCNLNINMDEKDAEELYEKYMSEKPKFIALFFLRVTLAPIIESIILLDRYLFLLEN 520
>gi|71005902|ref|XP_757617.1| hypothetical protein UM01470.1 [Ustilago maydis 521]
gi|74703664|sp|Q4PEJ3.1|YSH1_USTMA RecName: Full=Endoribonuclease YSH1; AltName: Full=mRNA
3'-end-processing protein YSH1
gi|46097110|gb|EAK82343.1| hypothetical protein UM01470.1 [Ustilago maydis 521]
Length = 880
Score = 204 bits (519), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 119/356 (33%), Positives = 195/356 (54%), Gaps = 23/356 (6%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ I+YTGD++ DRHL A I +PD+LI ESTY T + + +E F
Sbjct: 180 MFLIEIAGLRILYTGDFSREEDRHLVQAEIPPVKPDVLICESTYGTQTHEPRLDKEHRFT 239
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
++H + RGG+VL+PVF LGRAQEL +LL+ YW PIY+A L +K + Y+
Sbjct: 240 SQIHHIIKRGGRVLLPVFVLGRAQELLLLLDEYWAAHPELHSVPIYYASALAKKCISVYQ 299
Query: 119 MFITWTNQKIRKTFVQR-NMFDFKHI---RPFDKSFIDNPGPMVVFATPGMLHSGLSLII 174
+I N IR F +R N F FKHI R +K ++ GP V+ A+PG + SG+S +
Sbjct: 300 TYIHTMNDHIRTRFNRRDNPFVFKHISNLRSLEK--FEDRGPCVMMASPGFMQSGVSREL 357
Query: 175 FKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAK 234
++WAP + N LI+ G+ V+GT+ +L+ ++ N Q I +M+++Y+SFSAH D
Sbjct: 358 LERWAPDKRNGLIVSGYSVEGTMARNILNEPDEIIGINGQKIPRRMSVDYISFSAHVDFA 417
Query: 235 GIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF-----NLDCFMPANGESCFVQTDMK 289
+ I + ++++LVHGE + M L+ ++ F ++ P N E +Q +
Sbjct: 418 QNSRFIDEIKAQHIVLVHGEQNNMSKLRAALQARFTARGSDVKIHTPRNCEPLVLQFRAQ 477
Query: 290 ISIDVSVNLLKEEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGIS 345
+ +A+ + P P + + G+L+ KD + +++D + G+S
Sbjct: 478 RTA---------KAIGTIAAKP-PAQGDIVDGLLISKDFAYTILDPKDLTDFTGLS 523
>gi|210075949|ref|XP_504965.2| YALI0F03817p [Yarrowia lipolytica]
gi|223634672|sp|Q6C2Z7.2|YSH1_YARLI RecName: Full=Endoribonuclease YSH1; AltName: Full=mRNA
3'-end-processing protein YSH1
gi|199424917|emb|CAG77772.2| YALI0F03817p [Yarrowia lipolytica CLIB122]
Length = 827
Score = 204 bits (518), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 134/405 (33%), Positives = 217/405 (53%), Gaps = 40/405 (9%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
M+ ++VG +++TGDY+ DRHL A + +PD+LI ESTY T + RE+
Sbjct: 190 MYTIEVGGVKVLFTGDYSREEDRHLNQAEVPPMKPDILICESTYGTGTHLPRLEREQRLT 249
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWE-RMNLQA-PIYFAVGLTEKATNYYK 118
+H +D+GGK L+PVFALGRAQE+ ++L+ YWE +LQ IY+A L +K Y+
Sbjct: 250 GLIHSTLDKGGKCLLPVFALGRAQEILLILDEYWEAHPDLQEFSIYYASALAKKCIAVYQ 309
Query: 119 MFITWTNQKIRKTFVQR--NMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIF 175
+I N IR+ F + N F FK+I+ D+ GP V+ A+PGML SG+S +
Sbjct: 310 TYINMMNDNIRRRFRDQKTNPFRFKYIKNIKNLDRFDDMGPCVMVASPGMLQSGVSRSLL 369
Query: 176 KKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVK----MAIEYMSFSAHA 231
++WAP N LI+ G+ V+GT+ ++++ + E + Q D+K +A+E +SF+AH
Sbjct: 370 ERWAPDPKNTLILTGYSVEGTMAKQIIN--EPNEIPSAQNPDLKVPRRLAVEELSFAAHV 427
Query: 232 DAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFN--------LDCFMPANGESCF 283
D + + I + KN++LVHGE + M LK + ++ + P N E
Sbjct: 428 DFQQNSEFIDLVDSKNIILVHGELNNMQRLKAALLAKYRGLKNSPREKTIYNPRNCEE-- 485
Query: 284 VQTDMK-ISIDVSVNLLKEEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEV 342
V+ K + + +V + EE P + I GV+V KD + LM V + + V
Sbjct: 486 VELAFKGVKVAKTVGKMAEE---------KPHVGQIISGVVVQKDFNYGLMGVADLREHV 536
Query: 343 GISRHIVRFTSNVQISDSSPINKTLELI---YDQLLSYLQDKSQE 384
G+S TS+V + +N ++L+ +Q+ Y++ + E
Sbjct: 537 GLS------TSSVLERQTVTVNAGVDLVKYHLEQMFGYVEMRETE 575
>gi|429966185|gb|ELA48182.1| hypothetical protein VCUG_00420 [Vavraia culicis 'floridensis']
Length = 669
Score = 203 bits (517), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 134/360 (37%), Positives = 192/360 (53%), Gaps = 49/360 (13%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHL-GA--AWIDKC---------------RPDLLITES 42
MF ++ I+YTGDY+ DRHL GA AWI K D+LI ES
Sbjct: 159 MFMIENDRVKILYTGDYSTEEDRHLKGADTAWISKYGNMDEKEHSNDETVHHLDVLICES 218
Query: 43 TYATTIRDSKRCRERDFLKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNL--Q 100
TY + RER F + V++ V RGGK L+PVFALGRAQEL ++LE YW+R
Sbjct: 219 TYGVQCHLPREERERRFTQVVNDIVTRGGKCLLPVFALGRAQELLLILEDYWDRNPHLHN 278
Query: 101 APIYFAVGLTEKATNYYKMFITWTNQKIRKTFVQRNMFDFKHIRPF---DKSFIDNPGPM 157
PIY+A L + + Y+ + N KI+K + F+FKHIR D I N
Sbjct: 279 IPIYYASALANRCLSIYQAYTHMMNLKIKK-----DAFNFKHIRNLKSVDNHLIKNA--C 331
Query: 158 VVFATPGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIID 217
VV A+PGML SGLS +F+ W NN ++PG+CVQGT+ ++++ K++ N +
Sbjct: 332 VVMASPGMLQSGLSRELFESWCEDANNGTVIPGYCVQGTLAKEIMTEPKEIVAMNGHRLR 391
Query: 218 VKMAIEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPA 277
+ M +EY+SFSAH D I+ C P+ V+LVHGE ++M+ LK ++++ ++
Sbjct: 392 LNMRVEYISFSAHVDYVQNTSFIEKCTPRLVMLVHGEVNEMMRLKAALEKKHSV--LALK 449
Query: 278 NGESCFVQTDMKISIDVSVNLLKEEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDE 337
NGE Q ++KI ++E+V L E I GV+V + S I + DE
Sbjct: 450 NGE----QHEIKI---------RKESVAVGKN----LVEGDIEGVVVNELSGIRVYRRDE 492
>gi|388579831|gb|EIM20151.1| Metallo-hydrolase/oxidoreductase [Wallemia sebi CBS 633.66]
Length = 626
Score = 203 bits (517), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 122/369 (33%), Positives = 199/369 (53%), Gaps = 19/369 (5%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ ++YTGDY+ DRHL AA I K + D+LI ESTY +++ +E+ F
Sbjct: 187 MFLIEIAGLKVLYTGDYSREEDRHLHAAEIPKEQTDVLIVESTYGVQTLENRPEKEKRFT 246
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWER-MNLQA-PIYFAVGLTEKATNYYK 118
+ VH + RGG+VL+P FALGRAQEL ++L+ YW+R +L + PIY+A L K Y+
Sbjct: 247 ELVHNIIRRGGRVLMPSFALGRAQELLLILDEYWQRNPDLHSIPIYYASNLARKCMAVYQ 306
Query: 119 MFITWTNQKIRKTF-VQRNMFDFKHIRPF-DKSFIDNPGPMVVFATPGMLHSGLSLIIFK 176
+I N+ I + F N F FK I D + GP V+ A+PGML SG S + +
Sbjct: 307 AYIRTMNKNINRRFDSGENPFQFKFISELGDLRKWQDKGPCVMLASPGMLQSGTSRELLE 366
Query: 177 KWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGI 236
+WAP N LI+ G+ V+GT+ H +++ ++ N I K++++Y+SFSAH D
Sbjct: 367 RWAPDPKNGLIICGYSVEGTMAHSIVNEPDEITGINGNKIPRKLSVDYISFSAHVDFTQN 426
Query: 237 MQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF-----NLDCFMPANGESCFVQTDMKIS 291
Q I +P++V+LVHG + M L ++ + ++ MP N E ++ +++
Sbjct: 427 TQFIDEIKPQHVVLVHGALTNMSRLAAALRSRYADRGLDIKVHMPKNAEPLKLEFKPEMT 486
Query: 292 IDVSVNLLKEEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRF 351
L +++ P + + G+LV KD + +L+D + G++ + +
Sbjct: 487 AKAIGKLAEKQ----------PAEGEIVQGLLVNKDFTYTLLDRSDLKDFAGLATNTIIQ 536
Query: 352 TSNVQISDS 360
V I S
Sbjct: 537 QQKVNIGVS 545
>gi|356543411|ref|XP_003540154.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
3-I-like [Glycine max]
Length = 689
Score = 203 bits (517), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 107/289 (37%), Positives = 159/289 (55%), Gaps = 12/289 (4%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF V + ++YTGDY+ DRHL AA + PD+ I ESTY + RE+ F
Sbjct: 173 MFMVDIAGVRVLYTGDYSREEDRHLRAAETPQFSPDVCIIESTYGVQHHQPRHTREKRFT 232
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYW----ERMNLQAPIYFAVGLTEKATNY 116
+H + +GG+VLIP FALGRAQEL ++L+ YW E N+ PIY+A L +K
Sbjct: 233 DVIHSTISQGGRVLIPAFALGRAQELLLILDEYWANHPELQNI--PIYYASPLAKKCLTV 290
Query: 117 YKMFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIF 175
Y+ + N +I+ + N F FKH+ + GP VV A+PG L SGLS +F
Sbjct: 291 YETYTLSMNDRIQNA--KSNPFSFKHVSALSSIEVFKDVGPSVVMASPGGLQSGLSRQLF 348
Query: 176 KKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKG 235
W + N ++PG+ V+GT+ +++ K++ N + M + Y+SFSAHAD+
Sbjct: 349 DMWCSDKKNSCVLPGYVVEGTLAKTIINEPKEVTLMNGLTAPLNMQVHYISFSAHADSAQ 408
Query: 236 IMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF---NLDCFMPANGES 281
++ P N++LVHGEA++M LK+K+ +F N P N +S
Sbjct: 409 TSAFLEELNPPNIILVHGEANEMGRLKQKLISQFADRNTKILTPKNCQS 457
>gi|154336691|ref|XP_001564581.1| putative cleavage and polyadenylation specificity factor
[Leishmania braziliensis MHOM/BR/75/M2904]
gi|134061616|emb|CAM38647.1| putative cleavage and polyadenylation specificity factor
[Leishmania braziliensis MHOM/BR/75/M2904]
Length = 756
Score = 203 bits (516), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 120/298 (40%), Positives = 174/298 (58%), Gaps = 14/298 (4%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF V + +YTGD++ PDRHL A + PD+LI EST +S+ RE F
Sbjct: 176 MFMVDIAGMRALYTGDFSRVPDRHLLGAEVPPYSPDILIAESTNGIRELESREEREHLFT 235
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWE-RMNLQ-APIYFAVGLTEKATNYYK 118
VH+ V RGG+ L+PVFALGRAQEL ++LE +W+ LQ PIY+A L ++ Y+
Sbjct: 236 SSVHDVVRRGGRCLVPVFALGRAQELLLILEEFWDAHKELQNIPIYYASSLAQRCMKLYQ 295
Query: 119 MFITWTNQKIRKTFV-QRNMFDFKHIRPF--DKSFIDNPGPMVVFATPGMLHSGLSLIIF 175
F++ N ++++ N F FK+I KSF DN GP VV A+PGML SG+SL +F
Sbjct: 296 TFVSAMNDRVKQQHANHHNPFVFKYIHSLIDTKSFEDN-GPCVVLASPGMLQSGISLELF 354
Query: 176 KKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMA-IEYMSFSAHADAK 234
++W N +IM G+CV GTI VL+ K++ + +++ ++M+ IE +SFSAH+D +
Sbjct: 355 ERWCGDRRNGIIMAGYCVDGTIAKDVLAKPKEVAKPDGKVLPLRMSTIEAVSFSAHSDGR 414
Query: 235 GIMQLIQY-CEPKNVLLVHGEASKMVFLKEKIKQEF---NLDCFMPANGESC---FVQ 285
I + + K+ +LVHG M LK K+ Q+F N+ + N ES FVQ
Sbjct: 415 QTRDFIHHLTKVKHTILVHGNPGAMGQLKNKLLQDFRDRNMSVYTTMNQESIRIPFVQ 472
>gi|146099573|ref|XP_001468678.1| putative cleavage and polyadenylation specificity factor
[Leishmania infantum JPCM5]
gi|134073046|emb|CAM71766.1| putative cleavage and polyadenylation specificity factor
[Leishmania infantum JPCM5]
Length = 756
Score = 203 bits (516), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 121/298 (40%), Positives = 174/298 (58%), Gaps = 14/298 (4%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF V + +YTGD++ PDRHL A + PD+LI EST +S+ RE F
Sbjct: 176 MFMVDIAGMRALYTGDFSRVPDRHLLGAEVPPYSPDILIAESTNGIRELESREEREHLFT 235
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWE-RMNLQ-APIYFAVGLTEKATNYYK 118
VH+ V RGG+ L+PVFALGRAQEL ++LE +W+ LQ PIY+A L ++ Y+
Sbjct: 236 SSVHDVVRRGGRCLVPVFALGRAQELLLILEEFWDAHKELQNIPIYYASSLAQRCMKLYQ 295
Query: 119 MFITWTNQKIRKTFV-QRNMFDFKHIRPF--DKSFIDNPGPMVVFATPGMLHSGLSLIIF 175
F++ N ++++ N F FK+I KSF DN GP VV A+PGML SG+SL +F
Sbjct: 296 TFVSAMNDRVKQQHANHHNPFVFKYIHSLMDTKSFEDN-GPCVVLASPGMLQSGISLELF 354
Query: 176 KKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMA-IEYMSFSAHADAK 234
++W N +IM G+CV GTI VL+ K++ + +++ ++M+ IE +SFSAH+D +
Sbjct: 355 ERWCGDRRNGIIMAGYCVDGTIAKDVLAKPKEVAKPDGKVLPLRMSTIEAVSFSAHSDGR 414
Query: 235 GIMQLIQ-YCEPKNVLLVHGEASKMVFLKEKIKQEF---NLDCFMPANGESC---FVQ 285
IQ + K+ +LVHG M LK K+ Q+F N+ + N ES FVQ
Sbjct: 415 QTRDFIQSLTKVKHTILVHGNPGAMGQLKSKLLQDFRDRNMSVYTTMNQESIRIPFVQ 472
>gi|398022636|ref|XP_003864480.1| cleavage and polyadenylation specificity factor, putative
[Leishmania donovani]
gi|322502715|emb|CBZ37798.1| cleavage and polyadenylation specificity factor, putative
[Leishmania donovani]
Length = 756
Score = 203 bits (516), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 121/298 (40%), Positives = 174/298 (58%), Gaps = 14/298 (4%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF V + +YTGD++ PDRHL A + PD+LI EST +S+ RE F
Sbjct: 176 MFMVDIAGMRALYTGDFSRVPDRHLLGAEVPPYSPDILIAESTNGIRELESREEREHLFT 235
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWE-RMNLQ-APIYFAVGLTEKATNYYK 118
VH+ V RGG+ L+PVFALGRAQEL ++LE +W+ LQ PIY+A L ++ Y+
Sbjct: 236 SSVHDVVRRGGRCLVPVFALGRAQELLLILEEFWDAHKELQNIPIYYASSLAQRCMKLYQ 295
Query: 119 MFITWTNQKIRKTFV-QRNMFDFKHIRPF--DKSFIDNPGPMVVFATPGMLHSGLSLIIF 175
F++ N ++++ N F FK+I KSF DN GP VV A+PGML SG+SL +F
Sbjct: 296 TFVSAMNDRVKQQHANHHNPFVFKYIHSLMDTKSFEDN-GPCVVLASPGMLQSGISLELF 354
Query: 176 KKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMA-IEYMSFSAHADAK 234
++W N +IM G+CV GTI VL+ K++ + +++ ++M+ IE +SFSAH+D +
Sbjct: 355 ERWCGDRRNGIIMAGYCVDGTIAKDVLAKPKEVAKPDGKVLPLRMSTIEAVSFSAHSDGR 414
Query: 235 GIMQLIQ-YCEPKNVLLVHGEASKMVFLKEKIKQEF---NLDCFMPANGESC---FVQ 285
IQ + K+ +LVHG M LK K+ Q+F N+ + N ES FVQ
Sbjct: 415 QTRDFIQSLTKVKHTILVHGNPGAMGQLKSKLLQDFRDRNMSVYTTMNQESIRIPFVQ 472
>gi|223997482|ref|XP_002288414.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220975522|gb|EED93850.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 557
Score = 203 bits (516), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 118/336 (35%), Positives = 179/336 (53%), Gaps = 10/336 (2%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDK--CRPDLLITESTYATTIRDSKRCRERD 58
MF ++VG +S++YTGDY+M DRHL AA + K PDLLI ESTY + S+ RE
Sbjct: 170 MFFIEVGGRSVLYTGDYSMEEDRHLMAAELPKYHASPDLLIVESTYGVQVHASRAEREAR 229
Query: 59 FLKKVHECVDRGGKVLIPVFALGRAQELCILLETYW-ERMNLQA-PIYFAVGLTEKATNY 116
F + V GG+ LIPVFALGRAQEL ++L+ YW E +LQ+ PIY+A + +A
Sbjct: 230 FTGTIERIVTGGGRCLIPVFALGRAQELLLILDEYWQEHPHLQSIPIYYASKMASRALRV 289
Query: 117 YKMFITWTNQKIRKTFVQRNMFDFKHIR---PFDKSFIDNPGPMVVFATPGMLHSGLSLI 173
Y+ + N +IR N F F HIR D + D+ GP VVFA+PGML SG+S
Sbjct: 290 YQTYANMMNARIRAQMDLGNPFHFSHIRNLKSIDVNNFDDRGPSVVFASPGMLQSGVSRQ 349
Query: 174 IFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADA 233
+F +WA N +++ G+ V+ T+ +++S K++ + + ++Y+SFSAH D
Sbjct: 350 LFDRWAGDPKNGVMLAGYAVEHTLAKEIMSQPKEVVTLEGRRQPLNCLVDYVSFSAHVDF 409
Query: 234 KGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISID 293
I P+N++LVHG +M LK + +N +P N ++ +
Sbjct: 410 VQNRDFITSVNPRNIILVHGAKEEMGRLKGALMLHYNK---LPENKRPTIAMPPNEVDVK 466
Query: 294 VSVNLLKEEAVKYNSEPPNPLKERQIHGVLVIKDSS 329
+ + V P + + G+LV + +S
Sbjct: 467 LIFTRRRSAKVMGKLAEAPPREGDPVRGILVTQQNS 502
>gi|429243009|ref|NP_594263.2| mRNA cleavage and polyadenylation specificity factor complex
endoribonuclease subunit Ysh1 [Schizosaccharomyces pombe
972h-]
gi|384872669|sp|O13794.2|YSH1_SCHPO RecName: Full=Endoribonuclease ysh1; AltName: Full=mRNA
3'-end-processing protein ysh1
gi|347834169|emb|CAB16227.2| mRNA cleavage and polyadenylation specificity factor complex
endoribonuclease subunit Ysh1 [Schizosaccharomyces
pombe]
Length = 757
Score = 202 bits (515), Expect = 6e-49, Method: Compositional matrix adjust.
Identities = 122/375 (32%), Positives = 204/375 (54%), Gaps = 28/375 (7%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
M+ V++ +I++TGDY+ DRHL A + RPD+LITESTY T + +E L
Sbjct: 171 MYFVEMAGVNILFTGDYSREEDRHLHVAEVPPKRPDVLITESTYGTASHQPRLEKEARLL 230
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYW-ERMNLQA-PIYFAVGLTEKATNYYK 118
+H + GG+VL+PVFALGRAQEL ++L+ YW ++L++ PIY+A L K ++
Sbjct: 231 NIIHSTIRNGGRVLMPVFALGRAQELLLILDEYWNNHLDLRSVPIYYASSLARKCMAIFQ 290
Query: 119 MFITWTNQKIRKTFVQRNMFDF---KHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIF 175
++ N IRK F +RN F F K +R +K D+ GP V+ A+PGML +G+S +
Sbjct: 291 TYVNMMNDNIRKIFAERNPFIFRFVKSLRNLEK--FDDIGPSVILASPGMLQNGVSRTLL 348
Query: 176 KKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKG 235
++WAP N L++ G+ V+GT+ ++ + ++ + Q I +MA+E +SF+AH D
Sbjct: 349 ERWAPDPRNTLLLTGYSVEGTMAKQITNEPIEIVSLSGQKIPRRMAVEELSFAAHVDYLQ 408
Query: 236 IMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF-----NLDCFMPANGESCFVQTDMKI 290
+ I +++LVHGE + M LK + +F ++ + P N C V +
Sbjct: 409 NSEFIDLVNADHIILVHGEQTNMGRLKSALASKFHNRKVDVKVYTPRN---C-VPLYLPF 464
Query: 291 SIDVSVNLLKEEAVKYNSEPPNPLKERQIHGVLVIKDSSISLM------DVDEACKEVGI 344
+ V L + AV P + + G+L+ KD++ LM D + V
Sbjct: 465 KGERLVRALGKVAVH------KPKEGDIMSGILIQKDANYKLMSAEDLRDFSDLTTTVLT 518
Query: 345 SRHIVRFTSNVQISD 359
+ ++ F S++++++
Sbjct: 519 QKQVIPFFSSMELAN 533
>gi|393245131|gb|EJD52642.1| Metallo-hydrolase/oxidoreductase [Auricularia delicata TFB-10046
SS5]
Length = 751
Score = 202 bits (514), Expect = 8e-49, Method: Compositional matrix adjust.
Identities = 105/278 (37%), Positives = 157/278 (56%), Gaps = 10/278 (3%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF + + ++YTGDY+ DRHL A I RPD+LI ESTY ++ +E FL
Sbjct: 162 MFHIDIAGVKVLYTGDYSREEDRHLVKAEIPPVRPDVLIVESTYGVQSVGNREEKEGRFL 221
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
+HE + RGG L+PVFALGRAQEL ++L+ YW + PIY+A L K Y+
Sbjct: 222 SLIHEIIKRGGHALLPVFALGRAQELLLVLDDYWAKHPELHSVPIYYASNLARKCMAVYQ 281
Query: 119 MFITWTNQKIRKTFVQR-NMFDFKHI------RPFDKSFIDNPGPMVVFATPGMLHSGLS 171
+I N IR+ F +R N F FKHI R ++ D P P VV A+PGML SG S
Sbjct: 282 TYIHTMNSNIRQRFARRDNPFIFKHISHLPQTRGLERKIADGP-PCVVLASPGMLQSGTS 340
Query: 172 LIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHA 231
+ + WAP N L++ G+ V+GT+ +L+ +++ I ++++ Y+SFSAH
Sbjct: 341 RELLELWAPDPRNALVVTGYSVEGTLARDILNDPDEIQGLRGNTIPRRLSVHYISFSAHV 400
Query: 232 DAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF 269
D + I+ + ++V+LVHGE + M L+ ++ +
Sbjct: 401 DYAQNSEFIELIKAQHVVLVHGEQNNMGRLRAALQSRY 438
>gi|378756364|gb|EHY66388.1| cleavage and polyadenylation specificity factor [Nematocida sp. 1
ERTm2]
Length = 692
Score = 201 bits (512), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 114/288 (39%), Positives = 166/288 (57%), Gaps = 8/288 (2%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF VK N S++YTGDY+ DRHL AA I D+LI+ESTY SK RE F+
Sbjct: 162 MFLVKNENISLLYTGDYSREEDRHLKAAVIPPMPIDILISESTYGVQCHQSKEEREHRFI 221
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWE-RMNLQA-PIYFAVGLTEKATNYYK 118
V + V RGGK L+PVFALGRAQEL ++L+ +WE R +LQ PI +A L ++ Y+
Sbjct: 222 TGVSDVVKRGGKCLLPVFALGRAQELLLILDEFWEARKDLQGIPILYASALAKRFMAVYQ 281
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
++ N +I+ N F FKH++ ++ GP V+ A+PGML +GLS +F+
Sbjct: 282 TYLNMMNDRIQGMAEISNPFHFKHVQNIKNIEAYEDRGPCVMMASPGMLQNGLSRDLFEM 341
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W + N I+PG+CV+GT+ +L ++ + V+ +I+Y+SFSAH D
Sbjct: 342 WCGDKRNGCIIPGYCVEGTLAKDLLCEPDEITSLKGNKLVVRSSIDYISFSAHVDFLQNA 401
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKI-----KQEFNLDCFMPANGE 280
+ I+ C+ V+LVHGE+S+M LK + + N+ P NGE
Sbjct: 402 EFIEGCKVSEVVLVHGESSEMNRLKSALVHRSEAKSENITIHTPRNGE 449
>gi|443899092|dbj|GAC76423.1| mRNA cleavage and polyadenylation factor II complex, BRR5
[Pseudozyma antarctica T-34]
Length = 884
Score = 201 bits (512), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 119/356 (33%), Positives = 191/356 (53%), Gaps = 23/356 (6%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ I+YTGD++ DRHL A I RPD+LI ESTY T + + +E F
Sbjct: 180 MFLIEIAGLRILYTGDFSREEDRHLVQAEIPPVRPDVLICESTYGTQTHEPRLDKEHRFT 239
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
++H + RGG+VL+PVF LGRAQEL +LL+ YW PIY+A L +K + Y+
Sbjct: 240 SQIHHIIKRGGRVLLPVFVLGRAQELLLLLDEYWAAHPELHSVPIYYASALAKKCISVYQ 299
Query: 119 MFITWTNQKIRKTFVQR-NMFDFKHI---RPFDKSFIDNPGPMVVFATPGMLHSGLSLII 174
+I N IR F +R N F FKHI R +K ++ GP V+ A+PG + SG+S +
Sbjct: 300 TYIHTMNDHIRTRFNRRDNPFVFKHISNLRSLEK--FEDRGPCVMMASPGFMQSGVSREL 357
Query: 175 FKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAK 234
++WAP + N LI+ G+ V+GT+ +L+ ++ N I +M+++Y+SFSAH D
Sbjct: 358 LERWAPDKRNGLIVSGYSVEGTMARNILNEPDEIIGMNGIKIPRRMSVDYISFSAHVDFA 417
Query: 235 GIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF-----NLDCFMPANGESCFVQTDMK 289
+ I + ++++LVHGE + M L+ ++ F ++ P N E +Q
Sbjct: 418 QNSRFIDEIKAQHIVLVHGEQNNMSKLRAALQARFTARGSDVKIHTPRNCEPLTLQ---- 473
Query: 290 ISIDVSVNLLKEEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGIS 345
+ + A K P + + G+L+ KD + +++D + G+S
Sbjct: 474 FRAQRTAKAIGTIAAKL------PAQGDTVDGLLISKDFAYTILDPKDLTDFTGLS 523
>gi|384252038|gb|EIE25515.1| Metallo-hydrolase/oxidoreductase [Coccomyxa subellipsoidea C-169]
Length = 696
Score = 201 bits (511), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 104/272 (38%), Positives = 162/272 (59%), Gaps = 4/272 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF V++ +YTGDY+ DRH+ AA + P ++I E+TY + + RE+ F+
Sbjct: 163 MFMVEIAGMRALYTGDYSRLADRHMSAADLPSPPPHIVIVEATYGVSRHLPREGREQRFV 222
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWER-MNLQA-PIYFAVGLTEKATNYYK 118
+ V RGG+ L+PV ALGRAQEL ++LE YW+R +L+ PIY A GL +A ++
Sbjct: 223 NMIRAVVQRGGRCLLPVVALGRAQELMLILEDYWDRNADLRGVPIYQASGLARRALGIFQ 282
Query: 119 MFITWTNQKIRKTFVQR-NMFDFKHIRPFD-KSFIDNPGPMVVFATPGMLHSGLSLIIFK 176
+I N I+ F Q N F+FK+I + +D+ GP VV ATP ML SGLS +F
Sbjct: 283 TYIAMMNDDIKAAFGQSANPFNFKYITELKTQGGLDDVGPCVVLATPSMLQSGLSRELFD 342
Query: 177 KWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGI 236
W + N +I+ F VQGT+ +L+ + + + ++M++E++SFSAHAD
Sbjct: 343 AWCEDKRNGVIIADFAVQGTLARDILASPSHVLTKAGAKVPLRMSVEHISFSAHADFDQT 402
Query: 237 MQLIQYCEPKNVLLVHGEASKMVFLKEKIKQE 268
Q ++ +P +V+LVHGEA +M L++ ++Q+
Sbjct: 403 SQFVELLDPPHVILVHGEAVEMGRLRKALEQQ 434
>gi|157821479|ref|NP_001101559.1| methyltransferase like 25 [Rattus norvegicus]
gi|149067049|gb|EDM16782.1| similar to CG33154-PB (predicted) [Rattus norvegicus]
gi|171847417|gb|AAI61980.1| Similar to CG33154-PB [Rattus norvegicus]
Length = 600
Score = 200 bits (509), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 163/608 (26%), Positives = 273/608 (44%), Gaps = 123/608 (20%)
Query: 488 TIDEMIEFITPNIACVNTHTVHFMVEKLWETTI---PKAILSQVNNSPSFDSLLEEFWKS 544
T+ +++ F+ +A + HTV F + +W+ + P+++L+ + S + L E
Sbjct: 19 TLQDLLSFLRGALAISSAHTVDFYTKSIWQELVDLPPESVLATLRKSAAEAELHE----- 73
Query: 545 RQDNHVETNSELVKFFQAADKFRLTSLMNGESIITVDELFSKLQARKCGQVVESAKVSQL 604
L G I + ++F C + V
Sbjct: 74 --------------------TLPLGEAEAGSGFIDLPKIF-------CETSQKLLSVEAF 106
Query: 605 MSEKKSYEVQVMSQVVAAVTNSCDSSHIIDLGSTITVDELFSKLQARKCGLVVESAKVSQ 664
+ K Y VQ +LG ++L + L+ E+ K +
Sbjct: 107 VLTAKHYSVQ-------------------NLGLCTPSEQLLTALRVNTEQRTDENVKAIE 147
Query: 665 LMSEKKSYEVQVMSQVVAAVTNSCDSSHIIDLGGGQGYLSTILALQHGKKTLSLDYNQVN 724
M+ KKS+EVQ MS+++ ++ + C IID+G G+GYLS+ L+L++G +D + N
Sbjct: 148 FMNTKKSHEVQAMSELICSIADYCGLKQIIDVGSGKGYLSSFLSLKYGLNVYGIDSSSTN 207
Query: 725 THGAAVRSKKLEDSEK--ESKGPFQSYAGVINKKLWMRTQNRQTSHASVEHHGKNWKRK- 781
THGA R++KL+ S+ S V+ + Q+R E K+ +
Sbjct: 208 THGAKERNRKLKKHWNLCHSQSRADSNGLVLKMPKESKVQSRSKCEGESERVQKSRRGDE 267
Query: 782 ----------SKAPVVSIK-----------DEELVVCKDK--------------CKQITH 806
S +PV I+ +EE + C+D Q+ H
Sbjct: 268 DASAGVLADFSGSPVSVIRKQQRNLHAHPVEEEKLCCEDAFSLVDFLPVDAIEPTSQVCH 327
Query: 807 FVTPD--------------SDISSILSQAYPQDS-LHNV------CIM-GLHTCGDLSGT 844
P S I S L+ D+ LH++ C+M GLHTCGDL+
Sbjct: 328 TEAPGLRKERGNATSKTRGSSIYSPLTSFITADTQLHDIIEDLEDCLMVGLHTCGDLAPN 387
Query: 845 ALRLF-TKSSLQCLVQVGCCYHLLEEEFIRSPFWKDVDQSLYEHGYGFPLSEHLRSRKFF 903
LR+F +K+ ++ + VGCCYHLL EEF ++ + + +GFP+ +L+ ++
Sbjct: 388 TLRIFASKAEVKGVCSVGCCYHLLSEEF------ENQHKGYAKENWGFPMCHYLKEERWC 441
Query: 904 LGRNVRMSGTQSPERVIDLKQTQTLPLFYRALLEKYLRSKITINDEEPKVVGRLATKCSN 963
GRN RMS + ERV+ + T LFYRA+L+ ++ I+ E + VG++ +KCS+
Sbjct: 442 CGRNARMSACLALERVVVGQGLPTESLFYRAVLQNIIKDCYGISKCE-QYVGKIYSKCSS 500
Query: 964 FVEYVHRAVDKLKLD-LEVDDEEVTRLFNSHQREYEYLQIYYFLKTALAPVIEALIVLDR 1022
F+EYV ++ KL LD ++ E + + ++ L+ + LK LAP IE LI+LDR
Sbjct: 501 FLEYVRVSLKKLGLDESKISAEIIMDYYEKYKPRMNELEAFNMLKVVLAPCIETLILLDR 560
Query: 1023 VLYLREQQ 1030
+ YL+EQ+
Sbjct: 561 LCYLKEQE 568
Score = 41.6 bits (96), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 18/52 (34%), Positives = 32/52 (61%)
Query: 1044 TQVDDEEVTRLFDSHQREYEYLQIYYFLKTALAPVIEALIVLDRVLYLREQQ 1095
+++ E + ++ ++ L+ + LK LAP IE LI+LDR+ YL+EQ+
Sbjct: 517 SKISAEIIMDYYEKYKPRMNELEAFNMLKVVLAPCIETLILLDRLCYLKEQE 568
>gi|261327437|emb|CBH10412.1| cleavage and polyadenylation specificity factor subunit, putative
[Trypanosoma brucei gambiense DAL972]
Length = 770
Score = 200 bits (508), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 110/275 (40%), Positives = 167/275 (60%), Gaps = 6/275 (2%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
+F V + ++YTGD++ PDRHL A + PD+LI EST +S+ RE F
Sbjct: 189 LFMVDIAGMKLLYTGDFSRVPDRHLLGAEVPPYSPDILIAESTNGIRELESREERESLFT 248
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWE-RMNLQ-APIYFAVGLTEKATNYYK 118
VH+ V GG+ L+PVFALGRAQEL ++LE YWE LQ PIY+A L ++ Y+
Sbjct: 249 TWVHDVVKGGGRCLVPVFALGRAQELLLILEEYWEAHKELQHIPIYYASSLAQRCMKLYQ 308
Query: 119 MFITWTNQKIRKTFV-QRNMFDFKHIRP-FDKSFIDNPGPMVVFATPGMLHSGLSLIIFK 176
F++ N +++K RN F FK+I+ D ++ GP VV A+PGML SG+SL +F+
Sbjct: 309 TFVSAMNDRVKKQHENHRNPFVFKYIQSLLDTRSFEDTGPCVVLASPGMLQSGISLELFE 368
Query: 177 KWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKM-AIEYMSFSAHADAKG 235
+W + N +I+ G+CV GTI +LS +++ + +++ ++M I+ +SFSAH+D +
Sbjct: 369 RWCGDKRNGIIVAGYCVDGTIAKDILSKPREITKPDGKVLPLRMRTIQSVSFSAHSDGRQ 428
Query: 236 IMQLIQYC-EPKNVLLVHGEASKMVFLKEKIKQEF 269
IQ + K+V+LVHG M LK K++Q+F
Sbjct: 429 TRDFIQALPKTKHVILVHGNVGAMGQLKNKLQQDF 463
>gi|154422115|ref|XP_001584070.1| RNA-metabolising metallo-beta-lactamase family protein [Trichomonas
vaginalis G3]
gi|121918315|gb|EAY23084.1| RNA-metabolising metallo-beta-lactamase family protein [Trichomonas
vaginalis G3]
Length = 588
Score = 200 bits (508), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 117/314 (37%), Positives = 162/314 (51%), Gaps = 25/314 (7%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFLK 61
F V+V S +YTGD++ D HL + + PDLLITESTY +RDS RER F++
Sbjct: 163 FFVQVRGLSFIYTGDFSAIADHHLSGHAVPRLFPDLLITESTYGNQVRDSIAKRERSFVQ 222
Query: 62 KVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMFI 121
VH+ V GGKVLIPVFA+GR QE+C++LE YW RM PIY+ L E YK +
Sbjct: 223 MVHQVVGEGGKVLIPVFAVGRLQEICLMLEDYWNRMGYTEPIYYTTNLGENCMKVYKQCV 282
Query: 122 TWTNQKIRKTFVQRNMFD-------FKHIRPFD--KSFIDNPGPMVVFATPGMLHSG--- 169
W N VQ N+FD F + R F+ KS ID +V+ AT GML+ G
Sbjct: 283 NWMNPT-----VQTNLFDNGSTAFKFTYSRNFNPKKSKIDESRGLVMLATSGMLNPGTPA 337
Query: 170 LSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLS---GVKKLEFENKQI-----IDVKMA 221
+ + +KW NM+I PG+C T G VL+ +++F +++ I +K
Sbjct: 338 FNFFVNEKWYDDPRNMVIFPGYCGPNTFGRAVLTRDLTTNRVQFTSRRPAMTVDIIIKCK 397
Query: 222 IEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGES 281
+E +SFSAHAD I+ L P V+ +HG+ + L +I Q + P N
Sbjct: 398 VERISFSAHADQFEIISLCDRVRPSKVVTIHGDKPSVDALATRITQTTGIPAEAPRNNAK 457
Query: 282 CFVQTDMKISIDVS 295
T I +S
Sbjct: 458 VTTPTKNPNVISIS 471
>gi|72387720|ref|XP_844284.1| cleavage and polyadenylation specificity factor subunit
[Trypanosoma brucei brucei strain 927/4 GUTat10.1]
gi|62359436|gb|AAX79873.1| cleavage and polyadenylation specificity factor subunit, putative
[Trypanosoma brucei]
gi|70800817|gb|AAZ10725.1| cleavage and polyadenylation specificity factor subunit, putative
[Trypanosoma brucei brucei strain 927/4 GUTat10.1]
Length = 770
Score = 200 bits (508), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 110/275 (40%), Positives = 167/275 (60%), Gaps = 6/275 (2%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
+F V + ++YTGD++ PDRHL A + PD+LI EST +S+ RE F
Sbjct: 189 LFMVDIAGMKLLYTGDFSRVPDRHLLGAEVPPYSPDILIAESTNGIRELESREERESLFT 248
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWE-RMNLQ-APIYFAVGLTEKATNYYK 118
VH+ V GG+ L+PVFALGRAQEL ++LE YWE LQ PIY+A L ++ Y+
Sbjct: 249 TWVHDVVKGGGRCLVPVFALGRAQELLLILEEYWEAHKELQHIPIYYASSLAQRCMKLYQ 308
Query: 119 MFITWTNQKIRKTFV-QRNMFDFKHIRP-FDKSFIDNPGPMVVFATPGMLHSGLSLIIFK 176
F++ N +++K RN F FK+I+ D ++ GP VV A+PGML SG+SL +F+
Sbjct: 309 TFVSAMNDRVKKQHENHRNPFVFKYIQSLLDTRSFEDTGPCVVLASPGMLQSGISLELFE 368
Query: 177 KWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKM-AIEYMSFSAHADAKG 235
+W + N +I+ G+CV GTI +LS +++ + +++ ++M I+ +SFSAH+D +
Sbjct: 369 RWCGDKRNGIIVAGYCVDGTIAKDILSKPREITKPDGKVLPLRMRTIQSVSFSAHSDGRQ 428
Query: 236 IMQLIQYC-EPKNVLLVHGEASKMVFLKEKIKQEF 269
IQ + K+V+LVHG M LK K++Q+F
Sbjct: 429 TRDFIQALPKTKHVILVHGNVGAMGQLKNKLQQDF 463
>gi|357445375|ref|XP_003592965.1| Cleavage and polyadenylation specificity factor subunit 3-I
[Medicago truncatula]
gi|357445453|ref|XP_003593004.1| Cleavage and polyadenylation specificity factor subunit 3-I
[Medicago truncatula]
gi|355482013|gb|AES63216.1| Cleavage and polyadenylation specificity factor subunit 3-I
[Medicago truncatula]
gi|355482052|gb|AES63255.1| Cleavage and polyadenylation specificity factor subunit 3-I
[Medicago truncatula]
Length = 690
Score = 200 bits (508), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 107/289 (37%), Positives = 158/289 (54%), Gaps = 12/289 (4%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF V + ++YTGDY+ DRHL AA + PD+ I ESTY + RE+ F
Sbjct: 174 MFMVDIAGVRVLYTGDYSREEDRHLRAAETPQFSPDVCIIESTYGVQHHQPRHTREKRFT 233
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYW----ERMNLQAPIYFAVGLTEKATNY 116
+H + +GG+VLIP +ALGRAQEL ++L+ YW E N+ PIY+A L +K
Sbjct: 234 DVIHSTISQGGRVLIPAYALGRAQELLLILDEYWANHPELQNI--PIYYASPLAKKCLTV 291
Query: 117 YKMFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIF 175
Y+ + N +I+ + N F FKHI + GP VV A+PG L SGLS +F
Sbjct: 292 YETYTLSMNDRIQNA--KSNPFAFKHISALSSIDIFKDVGPSVVMASPGGLQSGLSRQLF 349
Query: 176 KKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKG 235
W + N ++PG+ V+GT+ +L+ K++ N + M + Y+SFSAHAD+
Sbjct: 350 DMWCSDKKNSCVIPGYVVEGTLAKTILNEPKEVTLMNGLSAPLHMQVHYISFSAHADSAQ 409
Query: 236 IMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF---NLDCFMPANGES 281
++ P N++LVHG A++M LK+K+ +F N P N +S
Sbjct: 410 TSAFLEELNPPNIILVHGAANEMGRLKQKLMTQFADRNTKILTPKNCQS 458
>gi|403373777|gb|EJY86813.1| Cleavage and polyadenylation specificity factor subunit 3
[Oxytricha trifallax]
Length = 755
Score = 200 bits (508), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 102/272 (37%), Positives = 166/272 (61%), Gaps = 7/272 (2%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ I+YTGDY+ DRHL A + C D+LI ESTY I + + RE F
Sbjct: 179 MFLIEIDGVKILYTGDYSREEDRHLKPAELPNCEVDVLIVESTYGVQIHEQRDKREERFT 238
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERM-NLQ-APIYFAVGLTEKATNYYK 118
K VH+ V RGGK L+PVFALGRAQE+ ++L YW++ ++Q PIY++ L +K+ ++
Sbjct: 239 KLVHDIVKRGGKCLLPVFALGRAQEILLILNEYWQKNPDIQNVPIYYSGSLAQKSLTVFQ 298
Query: 119 MFITWTNQKIRKTFVQ-RNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
+ ++R N F F+ I F+ ++ P+V+ A+PGML +G S +F K
Sbjct: 299 TYRNMMGDQLRMELESGNNPFHFEPITTFND---ESEFPLVIMASPGMLQNGQSRDLFVK 355
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
WAP N ++ G+ V+GT+ V++ K L +Q +++K+++EY+SFSAHAD +
Sbjct: 356 WAPDPKNGIVFTGYSVEGTLAKSVMNRPKILTVAEQQ-VELKLSVEYISFSAHADYQHTS 414
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF 269
+ I+ +P N++LVHG+A++M L+ ++ +
Sbjct: 415 EYIKQLQPPNIVLVHGDANEMKKLQNDLQARY 446
>gi|397639513|gb|EJK73612.1| hypothetical protein THAOC_04754 [Thalassiosira oceanica]
Length = 454
Score = 200 bits (508), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 105/277 (37%), Positives = 162/277 (58%), Gaps = 7/277 (2%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDK--CRPDLLITESTYATTIRDSKRCRERD 58
MF +++G +S++YTGDY+M DRHL AA + + PD+LI ESTY + ++ RE
Sbjct: 169 MFYIEIGGRSVLYTGDYSMEEDRHLMAAELPRYHASPDVLIVESTYGVQVHPTRAEREAR 228
Query: 59 FLKKVHECVDRGGKVLIPVFALGRAQELCILLETYW-ERMNLQA-PIYFAVGLTEKATNY 116
F + V GG+ LIPVFALGRAQEL ++L+ YW E +LQ+ P+Y+A + +A
Sbjct: 229 FTGTIERIVTGGGRCLIPVFALGRAQELLLILDEYWQEHPHLQSVPVYYASKMASRALRV 288
Query: 117 YKMFITWTNQKIRKTFVQRNMFDFKHIR---PFDKSFIDNPGPMVVFATPGMLHSGLSLI 173
Y+ + N +IR N F F+HIR D + D+ GP VVFA+PGML SG+S
Sbjct: 289 YQTYANMMNARIRTQMDLGNPFSFRHIRNLKSIDVNNFDDRGPSVVFASPGMLQSGVSRQ 348
Query: 174 IFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADA 233
+F +WA N +++ G+ V+ T+ +++S K++ + + ++Y+SFSAH D
Sbjct: 349 LFDRWATDPKNGVLIAGYAVEHTLAKEIMSQPKEVVTMEGRRQPLNCLVDYVSFSAHVDF 408
Query: 234 KGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFN 270
I P+N++LVHG +M LK + ++N
Sbjct: 409 VQNRDFITNVNPRNIILVHGAKEEMGRLKGALMMQYN 445
>gi|449546825|gb|EMD37794.1| hypothetical protein CERSUDRAFT_154677 [Ceriporiopsis subvermispora
B]
Length = 820
Score = 200 bits (508), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 118/363 (32%), Positives = 190/363 (52%), Gaps = 25/363 (6%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF + + I+YTGDY+ DRHL A + RPD+LI ESTY + + +E+ F
Sbjct: 162 MFLIDIAGLKILYTGDYSREEDRHLVKAEVPPIRPDVLIVESTYGVQTLEGREEKEQRFT 221
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
VH + RGG VL+P FALGRAQEL ++L+ YW++ PIY+A L K Y+
Sbjct: 222 TLVHNIIRRGGHVLLPTFALGRAQELLLILDEYWKKHPDLHNVPIYYASSLARKCMAVYQ 281
Query: 119 MFITWTNQKIRKTFVQR-NMFDFKHI------RPFDKSFIDNPGPMVVFATPGMLHSGLS 171
+I N +R F +R N F FKHI R +++ + P P VV A+PG + SG S
Sbjct: 282 TYIHTMNANVRTRFAKRDNPFVFKHISNVPQARGWERKIAEGP-PCVVLASPGFVTSGPS 340
Query: 172 LIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHA 231
+ + WAP N +I+ G+ V+GT+ +L+ +++ Q I K++++Y+SFSAH
Sbjct: 341 RELLELWAPDSRNGIIVTGYSVEGTMARDILNEPEEITSVKGQTIPRKLSVDYISFSAHV 400
Query: 232 DAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF-----NLDCFMPANGESCFVQT 286
D + I+ ++++LVHGE + M L+ ++ + ++ P N E+
Sbjct: 401 DYSQNSEFIELVNAQHIVLVHGEQTAMGRLRAAMQDRYKNRDQDVKIHTPRNLET----- 455
Query: 287 DMKISIDVSVNLLKEEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISR 346
+ + + + ++PP P + G+LV KD S +L+D + G+S
Sbjct: 456 ---LELTFRGERVAKAIGTLAAKPPQP--NDVLSGLLVAKDYSYTLLDPRDLRDFAGLST 510
Query: 347 HIV 349
+V
Sbjct: 511 CVV 513
>gi|392569726|gb|EIW62899.1| mRNA 3'-end-processing protein YSH1 [Trametes versicolor FP-101664
SS1]
Length = 805
Score = 199 bits (507), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 116/347 (33%), Positives = 182/347 (52%), Gaps = 23/347 (6%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF + + I+YTGDY+ DRHL A I PD+LI ESTY + + +E F
Sbjct: 162 MFLIDIAGLKILYTGDYSREEDRHLVKAEIPPVHPDVLIVESTYGVQSHEPREDKETRFT 221
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
VH + RGG VL+P FALGRAQEL ++L+ YW + P+Y+A L K Y+
Sbjct: 222 NLVHSIIRRGGHVLLPTFALGRAQELLLILDEYWAKHPDLHNVPVYYASSLARKCMAVYQ 281
Query: 119 MFITWTNQKIRKTFVQR-NMFDFKHI------RPFDKSFIDNPGPMVVFATPGMLHSGLS 171
+I N +R F + N F FKHI R +++ + P P VV A+PG + +G S
Sbjct: 282 TYIHTMNANVRTRFAKHDNPFVFKHITNVPGTRGWERKIAEGP-PCVVLASPGFMQTGPS 340
Query: 172 LIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHA 231
+ + WAP N LI+ G+ ++GT+ ++L+ +++ I K++++YMSFSAH
Sbjct: 341 RELLELWAPDGRNGLIVTGYSIEGTMAREILTEPEEITTMKGNTIPRKISVDYMSFSAHV 400
Query: 232 DAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKIS 291
D + ++ +P++++LVHGEA+ M LK ++ + + E + T +
Sbjct: 401 DYSQNSEFMELVKPQHIVLVHGEATGMGRLKAAMQDRYK------SRDEEIKIHTPRNLD 454
Query: 292 IDVSVNLLKEEAVK----YNSEPPNPLKERQIHGVLVIKDSSISLMD 334
+ + E K ++PP P I G+LV KD S +L+D
Sbjct: 455 T-LELTFRSERVAKAIGTLANKPPQP--NDIISGLLVSKDYSYTLLD 498
>gi|342180524|emb|CCC90000.1| putative cleavage and polyadenylation specificity factor subunit
[Trypanosoma congolense IL3000]
Length = 766
Score = 199 bits (505), Expect = 9e-48, Method: Compositional matrix adjust.
Identities = 110/275 (40%), Positives = 167/275 (60%), Gaps = 6/275 (2%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
+F V + ++YTGD++ PDRHL A + PD+LI EST +S+ RE F
Sbjct: 188 LFMVDIAGMKVLYTGDFSRVPDRHLLGAEVPPYSPDILIAESTNGIRELESREERETLFT 247
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWE-RMNLQ-APIYFAVGLTEKATNYYK 118
VH+ V GG+ LIPVFALGRAQEL ++LE YWE LQ PIY+A L ++ Y+
Sbjct: 248 TWVHDVVKGGGRCLIPVFALGRAQELLLILEEYWEAHKELQHIPIYYASSLAQRCMKLYQ 307
Query: 119 MFITWTNQKIRKTFV-QRNMFDFKHIRP-FDKSFIDNPGPMVVFATPGMLHSGLSLIIFK 176
F++ N ++++ RN F FK+I+ D ++ GP VV A+PGML SG+SL +F+
Sbjct: 308 TFVSAMNDRVKEQHENHRNPFVFKYIQSLLDTRSFEDTGPCVVLASPGMLQSGISLELFE 367
Query: 177 KWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKM-AIEYMSFSAHADAKG 235
+W + N +I+ G+CV GTI ++LS K++ + +++ ++M ++ +SFSAH+D +
Sbjct: 368 RWCGDKRNGIIVAGYCVDGTIAKEILSKPKEITKPDGKVLPLRMRTVQSVSFSAHSDGRQ 427
Query: 236 IMQLIQYC-EPKNVLLVHGEASKMVFLKEKIKQEF 269
IQ K+V+LVHG M LK K++Q+F
Sbjct: 428 TRDFIQALPNTKHVILVHGNIGAMGQLKNKLQQDF 462
>gi|387594760|gb|EIJ89784.1| cleavage and polyadenylation specificity factor 3 [Nematocida
parisii ERTm3]
gi|387596392|gb|EIJ94013.1| cleavage and polyadenylation specificity factor 3 [Nematocida
parisii ERTm1]
Length = 696
Score = 199 bits (505), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 113/300 (37%), Positives = 172/300 (57%), Gaps = 8/300 (2%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF VK + S++YTGDY+ DRHL AA I D+LI+ESTY SK RE F+
Sbjct: 162 MFLVKNEDISLLYTGDYSREEDRHLKAAVIPPMPIDILISESTYGVQCHQSKEERETRFI 221
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWE-RMNLQA-PIYFAVGLTEKATNYYK 118
V + V RGGK L+PVFALGRAQEL ++L+ +W+ R +LQ PI +A L ++ Y+
Sbjct: 222 TGVSDVVKRGGKCLLPVFALGRAQELLLILDEFWDSRKDLQGIPILYASALAKRFMAVYQ 281
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
++ N +I+ N F FKH++ ++ GP V+ A+PGML +GLS +F+
Sbjct: 282 TYLNMMNDRIQGMAEISNPFHFKHVQSIKNIEAYEDRGPCVMMASPGMLQNGLSRDLFEM 341
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W + N I+PG+CV+GT+ +L ++ + V+ +I+Y+SFSAH D
Sbjct: 342 WCGDKRNGCIIPGYCVEGTLAKDLLCEPDEITSLKGNKLVVRSSIDYISFSAHVDFLQNA 401
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKI-----KQEFNLDCFMPANGESCFVQTDMKISI 292
+ I+ C+ V+LVHGE+S+M LK + + N+ P NGE ++ +I++
Sbjct: 402 EFIEGCKVSEVVLVHGESSEMNRLKSALVHRSESKSENMVVHTPRNGEWVQIKGTGEINV 461
>gi|254565077|ref|XP_002489649.1| Putative endoribonuclease [Komagataella pastoris GS115]
gi|238029445|emb|CAY67368.1| Putative endoribonuclease [Komagataella pastoris GS115]
Length = 784
Score = 198 bits (504), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 134/434 (30%), Positives = 227/434 (52%), Gaps = 41/434 (9%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ +++TGD++ DRHL A + RPD+LITEST+ T + + +E+
Sbjct: 169 MFFIEIAGIKVLFTGDFSREEDRHLSVAEVPPVRPDVLITESTFGTATHEPREEKEKKLT 228
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYW-ERMNLQA-PIYFAVGLTEKATNYYK 118
+H + GG+VL+PVFALGRAQEL ++L+ YW + +L+ +Y+A L K Y+
Sbjct: 229 TMIHSTLANGGRVLMPVFALGRAQELLLILDEYWSQHQDLENIKVYYASDLARKCLAVYQ 288
Query: 119 MFITWTNQKIRKTF--VQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIF 175
+I N+ IRK F +N F F++I+ S D+ P VV A+PGML +G+S +
Sbjct: 289 TYINMMNENIRKKFRDTNKNPFQFQYIKNIKNLSKFDDFQPSVVVASPGMLQNGVSRALL 348
Query: 176 KKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDV----KMAIEYMSFSAHA 231
+KWAP N LIM ++ T E ++Q DV +M +E +SF+AH
Sbjct: 349 EKWAPDPRNTLIMTEILLEPT------------EIPSQQNPDVLIPRRMTVEEISFAAHV 396
Query: 232 DAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFN--------LDCFMPANGESCF 283
D + + I+ PK ++LVHGE++ M LK + +++ + + P N E
Sbjct: 397 DYEQNSKFIELVNPKTIVLVHGESNPMGRLKSALLSKYSKYKNTPDEVKVYNPRNCEDLL 456
Query: 284 VQTDMKISIDVSVNLLKEEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVG 343
V+ I I ++ + E+ K + K +I+GVLV K+ +SL+ + + + G
Sbjct: 457 VEFK-GIKIAKAMGTITEDVNKVLKQDVKSAKS-EINGVLVQKNFDLSLLKIQDLREYTG 514
Query: 344 ISRHIVRFTSNVQISDSSPINKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLLSV 403
++ +++ ++ +S + L L L D ++EY++++ +DSVLL++
Sbjct: 515 LTTTMIKQRQTLRSHATSSLVHYHLLQMFGYLDILIDDAEEYEVKI------MDSVLLNM 568
Query: 404 ERIDDKNKRVFVTW 417
DK V V W
Sbjct: 569 ----DKRGIVTVEW 578
>gi|156840674|ref|XP_001643716.1| hypothetical protein Kpol_1009p4 [Vanderwaltozyma polyspora DSM
70294]
gi|156114339|gb|EDO15858.1| hypothetical protein Kpol_1009p4 [Vanderwaltozyma polyspora DSM
70294]
Length = 778
Score = 198 bits (503), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 105/285 (36%), Positives = 165/285 (57%), Gaps = 20/285 (7%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MFQ+++ +++TGDY+ DRHL +A + D+LI EST+ T + + RE+
Sbjct: 166 MFQIEIAGLRVLFTGDYSREMDRHLNSAEVPPLPSDVLIVESTFGTATHEPRLNREKKLT 225
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYW-----ERMNLQAPIYFAVGLTEKATN 115
+ +H V RGG+VL+PVFALGRAQEL ++L+ YW E + Q PIY+A L +K +
Sbjct: 226 QLIHSTVGRGGRVLMPVFALGRAQELMLILDEYWSQHADELGSGQVPIYYASNLAKKCMS 285
Query: 116 YYKMFITWTNQKIRKTF--VQRNMFDFKHIRPFDKSFIDNP------GPMVVFATPGMLH 167
Y+ ++ N IRK F Q N F FKHI S++ N GP V+ A+PGML
Sbjct: 286 VYQTYVNMMNDDIRKKFRDSQTNPFIFKHI-----SYLKNLDEFQDFGPSVMLASPGMLQ 340
Query: 168 SGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDV--KMAIEYM 225
+GLS + +KW P + NM+++ G+ V+GT+ ++ + + N + + + IE +
Sbjct: 341 NGLSRDLLEKWCPEDKNMVLITGYSVEGTMAKYIMLEPENIPSINNPDVSIPRRCQIEEI 400
Query: 226 SFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFN 270
SF+AH D + ++ I+ NV+LVHGE++ M LK + F+
Sbjct: 401 SFAAHVDFQENIEFIEKISANNVILVHGESNPMGRLKSALLSNFS 445
>gi|320583131|gb|EFW97347.1| Putative endoribonuclease [Ogataea parapolymorpha DL-1]
Length = 702
Score = 197 bits (500), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 135/446 (30%), Positives = 231/446 (51%), Gaps = 26/446 (5%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF V++G ++TGDY+ DRHL +A + RPDLLITEST+ T + RE
Sbjct: 163 MFFVELGGLKFLFTGDYSREEDRHLSSAELPPSRPDLLITESTFGTATHVPRVEREAKLT 222
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERM-NLQ-APIYFAVGLTEKATNYYK 118
+H + +GG+ L+PVFALGRAQE+ ++L+ YW+ LQ PIY+A L +K Y+
Sbjct: 223 HVIHSTIQQGGRCLLPVFALGRAQEILLILDEYWQNNPELQNVPIYYASDLAKKCMAVYQ 282
Query: 119 MFITWTNQKIRKTFVQ--RNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIF 175
++ N IRK F + +N F FK+I+ I++ V+ A+PGML +G+S I
Sbjct: 283 RYVNMMNDSIRKKFTETNQNPFHFKYIKNITNIEKINDLDSSVLIASPGMLQNGISRKIL 342
Query: 176 KKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLE--FENKQIIDVKMAIEYMSFSAHADA 233
+KW+P N I+ G+ V+GT+ +L+ ++ +I K+ IE +SF+AH D
Sbjct: 343 EKWSPDPRNSCILTGYSVEGTMAKILLTEPSEIPSIINPDVMIPRKINIEEISFAAHVDY 402
Query: 234 KGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISID 293
+ + I+ PK ++LVHGE ++M LK + ++ F E +
Sbjct: 403 EQNSKFIELVNPKAIILVHGETNQMGRLKSALLSKYQ--KFKGTENEIKVYNPRNSSKLS 460
Query: 294 VSVNLLKEEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTS 353
+S +K V P + I+GVLV K+ +SL+ +++ + G++ +VR
Sbjct: 461 LSFEGIKVAKVMGQLATSLPKEHDTINGVLVQKNFDLSLLKIEDLREFAGLTTTVVRQRQ 520
Query: 354 NVQISDSSPINKTLELIYDQLLSYLQ---DKSQEYKIQLTESSLSIDSVLLSVERIDDKN 410
+++ + + + ++ Q+ Y+Q D+ EY +++ + +V +++E K
Sbjct: 521 SLRCTATKSL---IKWQLTQMFGYIQELVDEDDEYVVKV------MSAVKITLE---SKE 568
Query: 411 KRVFVTWPNQ--DEEVGKIVLHVLKS 434
V W + D+ + VL +L S
Sbjct: 569 STATVEWNSGMVDDTIADSVLAILMS 594
>gi|307110126|gb|EFN58363.1| hypothetical protein CHLNCDRAFT_142438 [Chlorella variabilis]
Length = 709
Score = 197 bits (500), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 103/271 (38%), Positives = 157/271 (57%), Gaps = 19/271 (7%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF V+VG ++YTGDY+ PDRH+ AA + RP +++ ESTY + + RE+ F+
Sbjct: 177 MFMVEVGGMRLLYTGDYSRIPDRHMPAADLPAQRPHIVVVESTYGVSRHLPREEREQRFV 236
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERM-NLQA-PIYFAVGLTEKATNYYK 118
+++H V RGG+VL+PV ALGRAQEL ++LE YWER L PIY A GL +A + YK
Sbjct: 237 QRIHTAVARGGRVLLPVVALGRAQELLLILEEYWERHPELHGVPIYQASGLARRAISVYK 296
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDKS--FIDNPGPMVVFATPGMLHSGLSLIIFK 176
+I N+ I++ F N F+FKHI + F D SG+S +F+
Sbjct: 297 AYIEMMNEDIKRAFTVANPFEFKHISHLKSAAHFDD---------------SGMSRELFE 341
Query: 177 KWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGI 236
W N +++ F VQGT+ +L ++ N + ++M ++ +SFSAHAD
Sbjct: 342 AWCEDARNCVVIADFAVQGTLARDILGNPSEVMTRNGVKVPLRMQVDAISFSAHADFPQT 401
Query: 237 MQLIQYCEPKNVLLVHGEASKMVFLKEKIKQ 267
+ + +P +V+LVHGEA++M LK+ ++Q
Sbjct: 402 SEFLDALQPPHVVLVHGEATEMGRLKKALEQ 432
>gi|328773999|gb|EGF84036.1| hypothetical protein BATDEDRAFT_9083 [Batrachochytrium
dendrobatidis JAM81]
Length = 669
Score = 197 bits (500), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 111/311 (35%), Positives = 179/311 (57%), Gaps = 13/311 (4%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCR--PDLLITESTYATTIRDSKRCRERD 58
MF +++ ++YTGDY+ DRHL AA P++LI EST+ + + RE+
Sbjct: 209 MFLLEIAGVRLLYTGDYSREEDRHLMAAERPPSSIIPEVLICESTFGVQTLEPRLDREQR 268
Query: 59 FLKKVHECVDRGGKVLIPVFALGRAQELCILLETYWE-RMNLQA-PIYFAVGLTEKATNY 116
F + VH V RGG+ L+PVFALGRAQEL ++L+ YW +L + PIY+A + +K
Sbjct: 269 FTRMVHTIVKRGGRCLLPVFALGRAQELLLILDEYWHAHADLHSVPIYYASAIAKKCMAV 328
Query: 117 YKMFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIF 175
Y+ + N +IR+ N F FKHI + D+ GP V+ A+PGML SGLS +
Sbjct: 329 YQTYTNMMNGRIREMAKISNPFQFKHISNLKSIAQFDDVGPCVMMASPGMLQSGLSRELL 388
Query: 176 KKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKG 235
+ W + N +I+PG+ V+GT+G ++LS ++ N + +++++EY+SFSAH D +
Sbjct: 389 ELWCVDKRNGVIIPGYVVEGTLGKQILSQPDEIPAMNGSKLPLRLSVEYISFSAHVDYRE 448
Query: 236 IMQLIQYCEPKNVLLVHGEASKMVFLKEKIK-----QEFNLDCFMPANGES---CFVQTD 287
+ I+ +N++LVHG++++M L+ ++ +E L + P N E+ F
Sbjct: 449 NSEFIEMVGSQNLILVHGDSNEMGRLRSALQSRYAEREVPLHIYTPRNCETVELVFRGEK 508
Query: 288 MKISIDVSVNL 298
M + DVSV +
Sbjct: 509 MAKTTDVSVQI 519
>gi|426197081|gb|EKV47008.1| hypothetical protein AGABI2DRAFT_203789 [Agaricus bisporus var.
bisporus H97]
Length = 794
Score = 196 bits (498), Expect = 7e-47, Method: Compositional matrix adjust.
Identities = 116/345 (33%), Positives = 183/345 (53%), Gaps = 19/345 (5%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF + + I+YTGDY+ DRHL A + RPD+L+ ESTY +S+ +E F
Sbjct: 172 MFLIDIAGLKILYTGDYSREEDRHLIKAELPPIRPDVLVVESTYGVHTGESREEKEHRFT 231
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
VH + RGG VL+P FALGRAQEL ++L+ YW++ P+Y+A GL K Y+
Sbjct: 232 SLVHSIIRRGGHVLLPTFALGRAQELLLILDDYWKKHPDLHNVPVYYASGLARKCMAVYQ 291
Query: 119 MFITWTNQKIRKTFVQR-NMFDFKHI------RPFDKSFIDNPGPMVVFATPGMLHSGLS 171
+I N IR F +R N F FKHI R ++K D P P VV A+PG + G S
Sbjct: 292 TYIHTMNANIRSRFARRDNPFVFKHISNVPQTRGWEKKIADGP-PCVVLASPGFMQVGPS 350
Query: 172 LIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHA 231
+F+ W P N LI+ G+ ++GT +++ ++ I +++++ +SFSAH
Sbjct: 351 RELFEHWCPDARNGLIITGYSIEGTPARDIMTEPEEFTTLKGATIPRRISVDEISFSAHV 410
Query: 232 DAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKI- 290
D + I+ ++V+LVHGEA+ M L+ ++ F E + T +
Sbjct: 411 DYAQNSEFIEQVRAQHVVLVHGEATGMGRLRAAMQSRFK------EKDEDVKIHTPRNLE 464
Query: 291 SIDVSVNLLK-EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMD 334
++++S + +A+ ++ P P + G+LV KD S +L+D
Sbjct: 465 TLNLSFRGERVAKAIGTLADSP-PQTGDIVSGLLVAKDYSYTLLD 508
>gi|190346159|gb|EDK38177.2| hypothetical protein PGUG_02275 [Meyerozyma guilliermondii ATCC
6260]
Length = 770
Score = 196 bits (497), Expect = 9e-47, Method: Compositional matrix adjust.
Identities = 144/473 (30%), Positives = 237/473 (50%), Gaps = 66/473 (13%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
M+ V++G +++TGDY+ DRHL A + RPD+LITEST+ T + + +E
Sbjct: 174 MYFVEIGGLKVLFTGDYSREEDRHLQVAEVPPMRPDILITESTFGTATHEPRLEKEARMT 233
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWER----MNLQAPIYFAVGLTEKATNY 116
K +H + +GG++L+PVFALGRAQEL ++LE YW + N+ ++FA L K
Sbjct: 234 KIIHSTLLKGGRILMPVFALGRAQELLLILEEYWSQNEDLHNIN--VFFASSLARKCMAV 291
Query: 117 YKMFITWTNQKIR-----KTFVQRNMFDFKH---IRPFDKSFIDNPGPMVVFATPGMLHS 168
Y+ + N IR + + N F FKH IR DK + GP VV A PGML +
Sbjct: 292 YQTYTNIMNDNIRHGVSSASGGKSNPFQFKHIKLIRSLDK--FQDIGPCVVVAAPGMLQN 349
Query: 169 GLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDV----KMAIEY 224
G+S + ++WAP N +IM G+ V+GT+ ++L+ + ++ Q DV +MAIE
Sbjct: 350 GVSRELLERWAPDAKNAVIMTGYSVEGTMAKELLT--EPHTIQSSQNADVTIPRRMAIEE 407
Query: 225 MSFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFN--------LDCFMP 276
+SF+AH D + I P ++LVHGE++ M LK + +++ + F P
Sbjct: 408 ISFAAHVDFQENAGFIDMINPSKIILVHGESNPMGRLKSALLSKYSSRKGTEQEVKVFNP 467
Query: 277 ANGESCFVQT-DMKISIDVSVNLLKEEAVKYNSEPPNPLKERQ----------------- 318
N + + +KI+ V NL +EE K + LKE +
Sbjct: 468 RNCDELSIGIRGLKIA-KVLGNLAEEELAKLQKDISKKLKEDESKMDETTDGEVNGDTSE 526
Query: 319 -------IHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQI-SDSSPINKTLELI 370
I GVLV KD +S + + + + +S I++ +++I +D S + LE +
Sbjct: 527 PIKTGHTIAGVLVSKDFDLSFLQMQDLHEYTQLSTSILKSKISLKINADISLMKWHLEQM 586
Query: 371 YDQLLSYLQDKSQEYKIQLTESSLSIDSVLLSVERIDDKNKRVFVT--WPNQD 421
+ ++ + D +E++ L +D + + V+R +F+T W N +
Sbjct: 587 FG-YINIVNDDEEEWE------CLIMDKIEVLVDRTKTTGGGLFITVEWINDN 632
>gi|409080187|gb|EKM80547.1| hypothetical protein AGABI1DRAFT_70926 [Agaricus bisporus var.
burnettii JB137-S8]
Length = 841
Score = 196 bits (497), Expect = 9e-47, Method: Compositional matrix adjust.
Identities = 117/349 (33%), Positives = 184/349 (52%), Gaps = 27/349 (7%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF + + I+YTGDY+ DRHL A + RPD+L+ ESTY +S+ +E F
Sbjct: 172 MFLIDIAGLKILYTGDYSREEDRHLIKAELPPIRPDVLVVESTYGVHTGESREEKEHRFT 231
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
VH + RGG VL+P FALGRAQEL ++L+ YW++ P+Y+A GL K Y+
Sbjct: 232 SLVHSIIRRGGHVLLPTFALGRAQELLLILDDYWKKHPDLHNVPVYYASGLARKCMAVYQ 291
Query: 119 MFITWTNQKIRKTFVQR-NMFDFKHI------RPFDKSFIDNPGPMVVFATPGMLHSGLS 171
+I N IR F +R N F FKHI R ++K D P P VV A+PG + G S
Sbjct: 292 TYIHTMNANIRSRFARRDNPFVFKHISNVPQTRGWEKKIADGP-PCVVLASPGFMQVGPS 350
Query: 172 LIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHA 231
+F+ W P N LI+ G+ ++GT +++ ++ I +++++ +SFSAH
Sbjct: 351 RELFEHWCPDARNGLIITGYSIEGTPARDIMTEPEEFTTLKGATIPRRISVDEISFSAHV 410
Query: 232 DAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF-----NLDCFMPANGESCFVQT 286
D + I+ ++V+LVHGEA+ M L+ ++ F ++ P N E
Sbjct: 411 DYAQNSEFIEQVRAQHVVLVHGEATGMGRLRAAMQSRFKEKDEDVKIHTPRNLE------ 464
Query: 287 DMKISIDVSVNLLK-EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMD 334
++++S + +A+ ++ P P + G+LV KD S +L+D
Sbjct: 465 ----TLNLSFRGERVAKAIGTLADSP-PQTGDIVSGLLVAKDYSYTLLD 508
>gi|167395302|ref|XP_001733549.1| Cleavage and polyadenylation specificity factor subunit [Entamoeba
dispar SAW760]
gi|165894214|gb|EDR22276.1| Cleavage and polyadenylation specificity factor subunit, putative
[Entamoeba dispar SAW760]
Length = 736
Score = 195 bits (496), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 110/314 (35%), Positives = 182/314 (57%), Gaps = 15/314 (4%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ I+YTGD++ DRHL AA + + D+++ ESTY ++S+ RE F+
Sbjct: 171 MFCIEINGVKILYTGDFSGETDRHLQAAEVPPFQIDVMMCESTYGIIEQESRIDRENAFI 230
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
+++ E + RGGK LIPVF+LGRAQE ++LE YW+ I+F + +K T Y++
Sbjct: 231 RQIMEILKRGGKCLIPVFSLGRAQEFELILEEYWQNHKDLWSVSIFFFSSIAKKCTTYFE 290
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNP---GPMVVFATPGMLHSGLSLIIF 175
F ++ NQ +RK + FDFK IR S D P VV A+PGML G+S IF
Sbjct: 291 KFTSFMNQDLRKK--TKQAFDFKFIREGSSSVDDGAIDYKPCVVMASPGMLQDGISRKIF 348
Query: 176 KKWAPVENNMLIMPGFCVQGTIGHKVLSGVKK--LEFENKQIIDVKMAIEYMSFSAHADA 233
++W + N +I+PG+CV+GT+ ++ K + E Q+I K ++ +SF AH+D
Sbjct: 349 ERWCTDKKNGVIIPGYCVEGTLAKDLILDSTKPFINSEGDQVIP-KCSVTEISFCAHSDF 407
Query: 234 KGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF-NLDCFMPANGESCFV----QTDM 288
+ I +PK+++L+HGE M L +K+E+ L+ +MP N + + + ++
Sbjct: 408 AHTRKFIGNVKPKHLVLIHGEGKSMEQLHNALKKEYPELNIYMPCNTQPIKIPIQPKHEV 467
Query: 289 KISIDVSVNLLKEE 302
++ +++ N+LK E
Sbjct: 468 RLLGEIASNILKME 481
>gi|71661559|ref|XP_817799.1| cleavage and polyadenylation specificity factor [Trypanosoma cruzi
strain CL Brener]
gi|70883012|gb|EAN95948.1| cleavage and polyadenylation specificity factor, putative
[Trypanosoma cruzi]
Length = 625
Score = 195 bits (496), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 109/276 (39%), Positives = 165/276 (59%), Gaps = 8/276 (2%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
+F V + +YTGD++ PDRHL A + PD+LI EST +S+ RE F
Sbjct: 51 LFMVDIAGMKTLYTGDFSRVPDRHLLGAEVPSYSPDILIAESTNGIRELESREERETLFT 110
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWE-RMNLQ-APIYFAVGLTEKATNYYK 118
VH+ V GG+ L+PVFALGRAQEL ++LE YWE LQ PIY+A L ++ Y+
Sbjct: 111 TWVHDVVKGGGRCLVPVFALGRAQELLLILEEYWEAHKELQHIPIYYASSLAQRCMKLYQ 170
Query: 119 MFITWTNQKIRKTFV-QRNMFDFKHIRPF--DKSFIDNPGPMVVFATPGMLHSGLSLIIF 175
F++ N ++++ RN F FK+I +SF D GP VV A+PGML SG+SL +F
Sbjct: 171 TFVSAMNDRVKQQHANHRNPFVFKYIHSLMETRSFEDT-GPCVVLASPGMLQSGISLELF 229
Query: 176 KKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKM-AIEYMSFSAHADAK 234
++W N +I+ G+CV GTI +L+ K++ + +++ ++M I+ +SFSAH+D +
Sbjct: 230 ERWCGDRRNGIIIAGYCVDGTIAKDILTKPKEVTKPDGKVLPLRMRTIQSVSFSAHSDGR 289
Query: 235 GIMQLIQYC-EPKNVLLVHGEASKMVFLKEKIKQEF 269
IQ + ++V+LVHG M L+ K++Q+F
Sbjct: 290 QTRDFIQALPKTQHVILVHGNVGAMGQLRNKLQQDF 325
>gi|402465801|gb|EJW01455.1| hypothetical protein EDEG_00447 [Edhazardia aedis USNM 41457]
Length = 774
Score = 195 bits (496), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 111/291 (38%), Positives = 174/291 (59%), Gaps = 20/291 (6%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKC--RPDLLITESTYATTIRDSKRCRERD 58
MF +K+G++ ++YTGDY+ PDRHL A KC + + LITESTY + RE+
Sbjct: 159 MFLLKIGSKKLLYTGDYSTEPDRHLKEA---KCPGKINFLITESTYGVQCHLPREEREKR 215
Query: 59 FLKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERM-NLQ-APIYFAVGLTEKATNY 116
FL V + + R GKVL+PVFALGRAQE+ ++LE YW+ +LQ PIY+A L +
Sbjct: 216 FLNAVRDIIKRRGKVLLPVFALGRAQEILLILEEYWDNNEDLQNVPIYYASALARRCIGI 275
Query: 117 YKMFITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFK 176
Y+ + + +++ + F FK+IR + +F D P VV A+PGML SGLS +F+
Sbjct: 276 YQQY-SQSDKNVD--------FKFKYIRNIN-TFDDRNLPCVVMASPGMLQSGLSRDLFE 325
Query: 177 KWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGI 236
KW + N +I+ G+CVQGT+ ++L+ +++E + + +++ +SFSAH D
Sbjct: 326 KWCEDKRNGVIIAGYCVQGTLAKEILNEPEEVEGIAGNKLKLNCSVDCISFSAHVDYLQN 385
Query: 237 MQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTD 287
+ I C+P+ + LVHGE ++M LK +K++ + F NGE ++ D
Sbjct: 386 KEFITSCDPEVLFLVHGEFNEMNRLKNALKRD---NTFGLRNGEFLHIKID 433
>gi|395332776|gb|EJF65154.1| Metallo-hydrolase/oxidoreductase [Dichomitus squalens LYAD-421 SS1]
Length = 809
Score = 195 bits (496), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 117/347 (33%), Positives = 179/347 (51%), Gaps = 23/347 (6%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF + + I+YTGDY+ DRHL A I PD+LI ESTY + + +E F
Sbjct: 162 MFLIDIAGLKILYTGDYSREEDRHLVKAEIPPIHPDVLIVESTYGVQSHEPRDDKEARFT 221
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
VH + RGG VL+P FALGRAQEL ++L+ YW + PIY+A L K Y+
Sbjct: 222 NLVHSIIRRGGHVLLPTFALGRAQELLLILDEYWAKHPDLHNVPIYYASSLARKCMAVYQ 281
Query: 119 MFITWTNQKIRKTFVQR-NMFDFKHI------RPFDKSFIDNPGPMVVFATPGMLHSGLS 171
+I N +R F +R N F FKHI R +++ + P P VV A+PG ++SG S
Sbjct: 282 TYIHTMNSNVRTRFAKRDNPFVFKHITNVPGTRGWERKIAEGP-PCVVLASPGFMNSGPS 340
Query: 172 LIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHA 231
+ + WA N I+ G+ V+GT+ +L+ ++ I K++++Y+SFSAH
Sbjct: 341 RELLELWASDSKNGCIVTGYSVEGTMARDILNEPSEIVGMKGNTIQRKLSVDYISFSAHV 400
Query: 232 DAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKIS 291
D + ++ +P++++LVHGEA+ M LK ++ + + E + T +
Sbjct: 401 DYSQNSEFMELVKPQHIVLVHGEATGMGRLKAAMQDRYK------SRDEEIKIHTPRNLE 454
Query: 292 IDVSVNLLKEEAVK----YNSEPPNPLKERQIHGVLVIKDSSISLMD 334
+ + E K + PP P I G+LV KD S +L+D
Sbjct: 455 T-LELTFRSERVAKAIGTLAAHPPQP--NDVISGLLVSKDYSYTLLD 498
>gi|407851025|gb|EKG05159.1| cleavage and polyadenylation specificity factor, putative
[Trypanosoma cruzi]
Length = 762
Score = 195 bits (495), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 109/276 (39%), Positives = 165/276 (59%), Gaps = 8/276 (2%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
+F V + +YTGD++ PDRHL A + PD+LI EST +S+ RE F
Sbjct: 188 LFMVDIAGMKTLYTGDFSRVPDRHLLGAEVPSYSPDILIAESTNGIRELESREERETLFT 247
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWE-RMNLQ-APIYFAVGLTEKATNYYK 118
VH+ V GG+ L+PVFALGRAQEL ++LE YWE LQ PIY+A L ++ Y+
Sbjct: 248 TWVHDVVKGGGRCLVPVFALGRAQELLLILEEYWEAHKELQHIPIYYASSLAQRCMKLYQ 307
Query: 119 MFITWTNQKIRKTFV-QRNMFDFKHIRPF--DKSFIDNPGPMVVFATPGMLHSGLSLIIF 175
F++ N ++++ RN F FK+I +SF D GP VV A+PGML SG+SL +F
Sbjct: 308 TFVSAMNDRVKQQHANHRNPFVFKYIHSLMETRSFEDT-GPCVVLASPGMLQSGISLELF 366
Query: 176 KKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKM-AIEYMSFSAHADAK 234
++W N +I+ G+CV GTI +L+ K++ + +++ ++M I+ +SFSAH+D +
Sbjct: 367 ERWCGDRRNGIIIAGYCVDGTIAKDILTKPKEVTKPDGKVLPLRMRTIQSVSFSAHSDGR 426
Query: 235 GIMQLIQYC-EPKNVLLVHGEASKMVFLKEKIKQEF 269
IQ + ++V+LVHG M L+ K++Q+F
Sbjct: 427 QTRDFIQALPKTQHVILVHGNVGAMGQLRNKLQQDF 462
>gi|146421308|ref|XP_001486604.1| hypothetical protein PGUG_02275 [Meyerozyma guilliermondii ATCC
6260]
Length = 770
Score = 195 bits (495), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 143/471 (30%), Positives = 237/471 (50%), Gaps = 62/471 (13%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
M+ V++G +++TGDY+ DRHL A + RPD+LITEST+ T + + +E
Sbjct: 174 MYFVEIGGLKVLFTGDYSREEDRHLQVAEVPPMRPDILITESTFGTATHEPRLEKEARMT 233
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYW----ERMNLQAPIYFAVGLTEKATNY 116
K +H + +GG++L+PVFALGRAQEL ++LE YW + N+ ++FA L K
Sbjct: 234 KIIHLTLLKGGRILMPVFALGRAQELLLILEEYWLQNEDLHNIN--VFFASSLARKCMAV 291
Query: 117 YKMFITWTNQKIR-----KTFVQRNMFDFKH---IRPFDKSFIDNPGPMVVFATPGMLHS 168
Y+ + N IR + + N F FKH IR DK + GP VV A PGML +
Sbjct: 292 YQTYTNIMNDNIRHGVSSASGGKLNPFQFKHIKLIRSLDK--FQDIGPCVVVAAPGMLQN 349
Query: 169 GLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLE-FENKQI-IDVKMAIEYMS 226
G+S + ++WAP N +IM G+ V+GT+ ++L+ ++ +N + I +MAIE +S
Sbjct: 350 GVSRELLERWAPDAKNAVIMTGYSVEGTMAKELLTEPHTIQSLQNADVTIPRRMAIEEIS 409
Query: 227 FSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFN--------LDCFMPAN 278
F+AH D + I P ++LVHGE++ M LK + +++ + F P N
Sbjct: 410 FAAHVDFQENAGFIDMINPLKIILVHGESNPMGRLKSALLSKYSSRKGTEQEVKVFNPRN 469
Query: 279 GESCFVQT-DMKISIDVSVNLLKEEAVKYNSEPPNPLKERQ------------------- 318
+ + +KI+ V NL +EE K + LKE +
Sbjct: 470 CDELLIGIRGLKIA-KVLGNLAEEELAKLQKDISKKLKEDESKMDETTDGEVNGDTSEPI 528
Query: 319 -----IHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQI-SDSSPINKTLELIYD 372
I GVLV KD +S + + + + +S I++ +++I +D S + LE ++
Sbjct: 529 KTGHTIAGVLVSKDFDLSFLQMQDLHEYTQLSTSILKSKISLKINADISLMKWHLEQMFG 588
Query: 373 QLLSYLQDKSQEYKIQLTESSLSIDSVLLSVERIDDKNKRVFVT--WPNQD 421
++ + D +E++ L +D + + V+R +F+T W N +
Sbjct: 589 -YINIVNDDEEEWE------CLIMDKIEVLVDRTKTTGGGLFITVEWINDN 632
>gi|50363261|gb|AAT75333.1| cleavage polyadenylation specificity factor CPSF73 [Trypanosoma
cruzi]
Length = 762
Score = 195 bits (495), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 109/276 (39%), Positives = 165/276 (59%), Gaps = 8/276 (2%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
+F V + +YTGD++ PDRHL A + PD+LI EST +S+ RE F
Sbjct: 188 LFMVDIAGMKTLYTGDFSRVPDRHLLGAEVPSYSPDILIAESTNGIRELESREERETLFT 247
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWE-RMNLQ-APIYFAVGLTEKATNYYK 118
VH+ V GG+ L+PVFALGRAQEL ++LE YWE LQ PIY+A L ++ Y+
Sbjct: 248 TWVHDVVKGGGRCLVPVFALGRAQELLLILEEYWEAHKELQHIPIYYASSLAQRCMKLYQ 307
Query: 119 MFITWTNQKIRKTFV-QRNMFDFKHIRPF--DKSFIDNPGPMVVFATPGMLHSGLSLIIF 175
F++ N ++++ RN F FK+I +SF D GP VV A+PGML SG+SL +F
Sbjct: 308 TFVSAMNDRVKQQHANHRNPFVFKYIHSLMETRSFEDT-GPCVVLASPGMLQSGISLELF 366
Query: 176 KKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKM-AIEYMSFSAHADAK 234
++W N +I+ G+CV GTI +L+ K++ + +++ ++M I+ +SFSAH+D +
Sbjct: 367 ERWCGDRRNGIIIAGYCVDGTIAKDILTKPKEVTKPDGKVLPLRMRTIQSVSFSAHSDGR 426
Query: 235 GIMQLIQYC-EPKNVLLVHGEASKMVFLKEKIKQEF 269
IQ + ++V+LVHG M L+ K++Q+F
Sbjct: 427 QTRDFIQALPKTQHVILVHGNVGAMGQLRNKLQQDF 462
>gi|407411604|gb|EKF33594.1| cleavage and polyadenylation specificity factor, putative
[Trypanosoma cruzi marinkellei]
Length = 763
Score = 195 bits (495), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 109/276 (39%), Positives = 165/276 (59%), Gaps = 8/276 (2%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
+F V + +YTGD++ PDRHL A + PD+LI EST +S+ RE F
Sbjct: 189 LFMVDIAGMKTLYTGDFSRVPDRHLLGAEVPSYSPDILIAESTNGIRELESREERETLFT 248
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWE-RMNLQ-APIYFAVGLTEKATNYYK 118
VH+ V GG+ L+PVFALGRAQEL ++LE YWE LQ PIY+A L ++ Y+
Sbjct: 249 TWVHDVVKGGGRCLVPVFALGRAQELLLILEEYWEAHKELQHIPIYYASSLAQRCMKLYQ 308
Query: 119 MFITWTNQKIRKTFV-QRNMFDFKHIRPF--DKSFIDNPGPMVVFATPGMLHSGLSLIIF 175
F++ N ++++ RN F FK+I +SF D GP VV A+PGML SG+SL +F
Sbjct: 309 TFVSAMNDRVKQQHANHRNPFVFKYIHSLMETRSFEDT-GPCVVLASPGMLQSGISLELF 367
Query: 176 KKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKM-AIEYMSFSAHADAK 234
++W N +I+ G+CV GTI +L+ K++ + +++ ++M I+ +SFSAH+D +
Sbjct: 368 ERWCGDRRNGIIIAGYCVDGTIAKDILTKPKEVTKPDGKVLPLRMRTIQSVSFSAHSDGR 427
Query: 235 GIMQLIQYC-EPKNVLLVHGEASKMVFLKEKIKQEF 269
IQ + ++V+LVHG M L+ K++Q+F
Sbjct: 428 QTRDFIQALPKTQHVILVHGNVGAMGQLRNKLQQDF 463
>gi|126339387|ref|XP_001364140.1| PREDICTED: uncharacterized protein C12orf26-like [Monodelphis
domestica]
Length = 608
Score = 194 bits (494), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 136/472 (28%), Positives = 233/472 (49%), Gaps = 90/472 (19%)
Query: 635 LGSTITVDELFSKLQARKCGLVVESAKVSQLMSEKKSYEVQVMSQVVAAVTNSCDSSHII 694
G +++ L L+ + + K M+ KKS+EVQVMS+++ TN C +I
Sbjct: 122 FGVCTSLEHLLVTLKGNQHQRLGNKVKSDDFMNAKKSHEVQVMSELIGCFTNFCGIKQVI 181
Query: 695 DLGGGQGYLSTILALQHGKKTLSLDYNQVNTHGAAVRSKKL------------------- 735
DLG G+GYLS+ L++Q+G K +D + VNTHGA R++KL
Sbjct: 182 DLGSGKGYLSSFLSMQYGLKVYGIDSSNVNTHGAKERNRKLKKHWRVYQTRSRAHISSLV 241
Query: 736 -EDSEKESKGPFQSYAGVINKKLWMRTQNRQTSHAS---------------VEHHGK--- 776
E +E+ + Q I +KL + T H S E+H
Sbjct: 242 LEKAEENTTLDEQKCKANIKEKLLSINNSLTTQHTSSISDTVLPGFTDTRQTENHSDPLP 301
Query: 777 ----NWKRKSKAPVVSI------------------------KDEELVVCKDKCKQ----- 803
N+ ++K P++++ + + V KD+ K+
Sbjct: 302 QSDGNFLLENKCPLLNVLPIDAIELTSPATKDYKKFSEEHKRRKNTVPSKDQAKESNESS 361
Query: 804 ----ITHFVTPDSDISSILSQAYPQDSLHNVCIMGLHTCGDLSGTALRLFT-KSSLQCLV 858
+T F+T +S++ I++ L + ++GLHTCGDL+ LR+FT KS ++ +
Sbjct: 362 IYSPLTSFITAESELHDIITD------LEDCIMVGLHTCGDLAPNTLRIFTSKSEIKGVC 415
Query: 859 QVGCCYHLLEEEFIRSPFWKDVDQSLYEHGYGFPLSEHLRSRKFFLGRNVRMSGTQSPER 918
VGCCYHLL E++ + ++D ++ GFP+ + L+ + GRN RMS + ER
Sbjct: 416 SVGCCYHLLSEKYEKQ--FQDYTLEMW----GFPMCQFLKEEGWCCGRNARMSACLALER 469
Query: 919 VIDLKQTQTLPLFYRALLEKYLRSKITINDEEPKVVGRLATKCSNFVEYVHRAVDKLKLD 978
V + LFYRA+L++ ++ I + K VG++ +K ++F++YV +++ KL+LD
Sbjct: 470 VAVGQGVAIESLFYRAVLQEIIKECYGITQCD-KHVGKVYSKSASFLDYVRKSLKKLELD 528
Query: 979 -LEVDDEEVTRLFNSHQREYEYLQIYYFLKTALAPVIEALIVLDRVLYLREQ 1029
++ D+++ + ++ L+ + LK LAP IE LI++DR+ YL+EQ
Sbjct: 529 ESKLTDQKIMDYYEKYKPRMNELEAFNMLKVVLAPCIETLILMDRLCYLKEQ 580
Score = 42.4 bits (98), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 17/51 (33%), Positives = 33/51 (64%)
Query: 1044 TQVDDEEVTRLFDSHQREYEYLQIYYFLKTALAPVIEALIVLDRVLYLREQ 1094
+++ D+++ ++ ++ L+ + LK LAP IE LI++DR+ YL+EQ
Sbjct: 530 SKLTDQKIMDYYEKYKPRMNELEAFNMLKVVLAPCIETLILMDRLCYLKEQ 580
>gi|365990355|ref|XP_003672007.1| hypothetical protein NDAI_0I01950 [Naumovozyma dairenensis CBS 421]
gi|343770781|emb|CCD26764.1| hypothetical protein NDAI_0I01950 [Naumovozyma dairenensis CBS 421]
Length = 757
Score = 194 bits (494), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 103/285 (36%), Positives = 162/285 (56%), Gaps = 20/285 (7%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MFQ+++ +++TGDY+ DRHL +A + ++LI EST+ T + + RE+
Sbjct: 133 MFQIEIAGMRVLFTGDYSREKDRHLNSAEVPPLSSNILIVESTFGTATHEPRLNREKKLT 192
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYW-----ERMNLQAPIYFAVGLTEKATN 115
+ +H V GG+VL+PVFALGRAQEL ++L+ YW E + Q PIY+A L K +
Sbjct: 193 QMIHHTVSHGGRVLMPVFALGRAQELMLILDEYWAQHAEELGDGQVPIYYASNLARKCMS 252
Query: 116 YYKMFITWTNQKIRKTF--VQRNMFDFKHIRPFDKSFIDN------PGPMVVFATPGMLH 167
++ ++ N IRK F Q N F FK+I S++ N GP V+ A+PGML
Sbjct: 253 VFQTYVNMMNDDIRKKFRDSQTNPFIFKNI-----SYLKNLEEFQDLGPSVMLASPGMLQ 307
Query: 168 SGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDV--KMAIEYM 225
SGLS + ++W P E N++++ G+ ++GT+ ++ + N + V + IE +
Sbjct: 308 SGLSRDLLERWCPDEKNLVLITGYSIEGTMAKYLMLEPDTIPSVNNPEVTVARRCNIEEI 367
Query: 226 SFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFN 270
SF+AH D + ++ IQ N++LVHGE++ M LK I FN
Sbjct: 368 SFAAHVDFQENLEFIQKINATNIILVHGESNPMGRLKSAILSNFN 412
>gi|260942735|ref|XP_002615666.1| hypothetical protein CLUG_04548 [Clavispora lusitaniae ATCC 42720]
gi|238850956|gb|EEQ40420.1| hypothetical protein CLUG_04548 [Clavispora lusitaniae ATCC 42720]
Length = 797
Score = 194 bits (492), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 107/302 (35%), Positives = 174/302 (57%), Gaps = 20/302 (6%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
M+ V++G +++TGDY+ DRHL A + RPD+LITEST+ T + + +E +
Sbjct: 183 MYFVEIGGLKVLFTGDYSREEDRHLKVAEVPPTRPDILITESTFGTATHEPRLEKETRMM 242
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWE-RMNLQ-APIYFAVGLTEKATNYYK 118
K +H + +GG++L+PVFALGRAQEL ++LE YW ++Q IY+A L K Y+
Sbjct: 243 KNIHSTILKGGRILMPVFALGRAQELLLILEEYWSLNEDIQNVNIYYASNLARKCMAVYQ 302
Query: 119 MFITWTNQKIR---KTFVQRNMFDFKH---IRPFDKSFIDNPGPMVVFATPGMLHSGLSL 172
+ + N+KIR + + N F FKH I+ DK I + GP VV A+PGML SG+S
Sbjct: 303 TYTSIMNEKIRLSASSSEKTNPFQFKHIKSIKSIDK--IQDMGPCVVVASPGMLQSGVSR 360
Query: 173 IIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKL-EFENKQI-IDVKMAIEYMSFSAH 230
+ ++WAP N +I+ G+ V+GT+ ++L+ + N ++ I +++I+ +SF+AH
Sbjct: 361 QLLERWAPDPKNAVILTGYSVEGTMAKELLAEPHTIPSITNPEVSIPRRLSIDEISFAAH 420
Query: 231 ADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFN--------LDCFMPANGESC 282
D + I+ PK ++LVHG+++ M LK + ++ + F P N E
Sbjct: 421 VDFQQNAGFIEEVGPKKIILVHGDSNPMGRLKSALLSRYSSRKGTEEEVKVFNPRNCEEL 480
Query: 283 FV 284
+
Sbjct: 481 LI 482
>gi|444315239|ref|XP_004178277.1| hypothetical protein TBLA_0A09750 [Tetrapisispora blattae CBS 6284]
gi|387511316|emb|CCH58758.1| hypothetical protein TBLA_0A09750 [Tetrapisispora blattae CBS 6284]
Length = 781
Score = 194 bits (492), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 108/299 (36%), Positives = 166/299 (55%), Gaps = 22/299 (7%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MFQ+++ I++TGDY+ DRHL +A I D+LI EST+ T + + RE
Sbjct: 176 MFQIEIAGLRILFTGDYSRELDRHLNSAEIPPLASDILIVESTFGTATHEPRLNREMKLT 235
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYW-----ERMNLQAPIYFAVGLTEKATN 115
+ VH V RGG+VL+PVFALGRAQE+ ++L+ YW E Q PIY+A L +K +
Sbjct: 236 QLVHSIVSRGGRVLMPVFALGRAQEIMLILDEYWNNHHEELGGGQVPIYYASSLAKKCMS 295
Query: 116 YYKMFITWTNQKIRKTF--VQRNMFDFKHI---RPFDKSFIDNPGPMVVFATPGMLHSGL 170
++ ++ N IRK F Q N F FK+I R D ++ GP V+ A+PGML SG+
Sbjct: 296 VFQTYVNMMNDDIRKKFRDSQTNPFIFKNISYLRNLDN--FEDFGPSVLLASPGMLQSGI 353
Query: 171 SLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDV--KMAIEYMSFS 228
S + ++W P + NM+++ G+ V+GT+ ++ + N I + + IE +SF+
Sbjct: 354 SRDLLERWCPEDKNMVLITGYSVEGTMAKYLMVEPDTIPSINNPEISIPRRCKIEEISFA 413
Query: 229 AHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFN--------LDCFMPANG 279
AH D + ++ I+ N++LVHGE++ M LK I F + + P NG
Sbjct: 414 AHVDFQENLEFIEKINASNIILVHGESNPMGRLKSAILSNFTSLKGTENEVHVYNPRNG 472
>gi|429965431|gb|ELA47428.1| hypothetical protein VCUG_01079 [Vavraia culicis 'floridensis']
Length = 642
Score = 194 bits (492), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 101/285 (35%), Positives = 168/285 (58%), Gaps = 21/285 (7%)
Query: 34 RPDLLITESTYATTIRDSKRCRERDFLKKVHECVDRGGKVLIPVFALGRAQELCILLETY 93
RPD++ITE TY + R+ ++ + R+ + V +C++R GKVLIPVFA+GRAQE+ +L +
Sbjct: 352 RPDVMITECTYGSVDRECRKTKIRELITTVVQCIERNGKVLIPVFAIGRAQEMYTMLSAF 411
Query: 94 WERMNLQAPIYFAVGLTEKATNYYKMFITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDN 153
++ NL P ++ + E+ YK F +T +K+F FDF IR F ++ +
Sbjct: 412 FKNANLNVPFFYCSTILERGLKIYKRFDDYT----KKSF----HFDFDGIRMFKNEYLVS 463
Query: 154 PGPMVVFATPGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENK 213
P +VF++PGMLH+G SL IFK N++I+PG+C++ T+ K+L+G++ + + +
Sbjct: 464 DKPFIVFSSPGMLHTGNSLKIFKALCEDARNLVIIPGYCMKNTLAEKLLNGIRTITLDRE 523
Query: 214 QIIDVKMAIEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQ------ 267
I ++ + + FSAHAD+ GI++ I P+NV+LVHG+ ++MV K I++
Sbjct: 524 YTI--RIQVRNIGFSAHADSSGILRFIAKVRPRNVVLVHGDKNRMVKFKRIIEKMCVDED 581
Query: 268 --EFNLDCFMPANGESCFVQTDMKISIDV-SVNLLKEEAVKYNSE 309
+F FMP N C V K I + S N +K++ + + E
Sbjct: 582 GFDFRCHVFMPRN--KCLVDLPGKNEIILKSRNEIKDDEIAVDVE 624
>gi|392593709|gb|EIW83034.1| Metallo-hydrolase oxidoreductase [Coniophora puteana RWD-64-598
SS2]
Length = 770
Score = 193 bits (491), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 119/361 (32%), Positives = 184/361 (50%), Gaps = 29/361 (8%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF + + I+YTGDY+ DRHL A + RPD+LI ESTY + + +E F
Sbjct: 162 MFLIDIAGLKILYTGDYSREEDRHLVKAEVPPVRPDVLIVESTYGVQSLECREDKEARFT 221
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
VH + RGG VL+P FALGRAQEL ++L+ YW+R PIY+A L K Y+
Sbjct: 222 GLVHSIIRRGGHVLLPAFALGRAQELLLILDEYWKRHPDLHNVPIYYASNLARKCMAVYQ 281
Query: 119 MFITWTNQKIRKTFVQR-NMFDFKHI------RPFDKSFIDNPGPMVVFATPGMLHSGLS 171
+I N IR F +R N F FKHI + +++ + P P VV A+PG SG S
Sbjct: 282 TYIHTMNSNIRTRFAKRDNPFVFKHISNLPQPKGWERKIAEGP-PCVVLASPGFCQSGPS 340
Query: 172 LIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHA 231
+ + WAP N I+ G+ V+GT+ +L+ ++ II K++++Y+SFSAH
Sbjct: 341 RELLELWAPDARNGFILTGYSVEGTMARDILNEPDEIMSTKGNIIPRKISVDYISFSAHV 400
Query: 232 DAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF-----NLDCFMPANGESCFVQT 286
D + I + ++++LVHGE + M L+ + ++ ++ P N E+
Sbjct: 401 DYSQNAEFIDLVKAQHIVLVHGEQTAMGRLRGALTSKYKDREEDVKIHTPRNLET----- 455
Query: 287 DMKISIDVSVNLLKEEAVK-YNSEPPNPLKERQI-HGVLVIKDSSISLMDVDEACKEVGI 344
+ + E K + NP ++ + G+LV KD S +L+D + G+
Sbjct: 456 -------LELTFRGERVAKAIGTLADNPPQQNDVLSGLLVAKDYSYTLLDPRDLRDFAGL 508
Query: 345 S 345
S
Sbjct: 509 S 509
>gi|255718827|ref|XP_002555694.1| KLTH0G15202p [Lachancea thermotolerans]
gi|238937078|emb|CAR25257.1| KLTH0G15202p [Lachancea thermotolerans CBS 6340]
Length = 755
Score = 193 bits (491), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 103/285 (36%), Positives = 166/285 (58%), Gaps = 20/285 (7%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MFQV++ I++TGDY+ DRHL +A + D+LI EST+ T + + RE+
Sbjct: 171 MFQVEIAGLKILFTGDYSRETDRHLNSAEVPPSSSDVLIVESTFGTATHEPRINREKKLT 230
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYW-----ERMNLQAPIYFAVGLTEKATN 115
+ +H V RGG+VL+PVFALGRAQE+ ++L+ YW E N Q P+++A L +K +
Sbjct: 231 QLIHSTVMRGGRVLLPVFALGRAQEIMLILDEYWSQHADELGNGQVPVFYASNLAKKCMS 290
Query: 116 YYKMFITWTNQKIRKTF--VQRNMFDFKHIRPFDKSFIDNP------GPMVVFATPGMLH 167
++ ++ N IRK F Q N F FK+I S++ N GP V+ A+PGML
Sbjct: 291 VFQTYVNMMNDDIRKKFRDSQSNPFIFKNI-----SYLKNLDEFQDFGPSVMLASPGMLQ 345
Query: 168 SGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDV--KMAIEYM 225
+GLS + +KW P E N++++ G+ V+GT+ ++ + N I++ + IE +
Sbjct: 346 NGLSRDLLEKWCPGEKNLVLITGYSVEGTMAKFIMLEPDTIPSINNPEINIPRRCQIEEI 405
Query: 226 SFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFN 270
SF+AH D + ++ I+ +N++LVHGE++ M LK + F+
Sbjct: 406 SFAAHVDFRENLEFIEKVGARNIILVHGESNPMGRLKSALLSNFS 450
>gi|67479721|ref|XP_655242.1| cleavage and polyadenylation specificity factor 73 kDa subunit
[Entamoeba histolytica HM-1:IMSS]
gi|56472366|gb|EAL49856.1| cleavage and polyadenylation specificity factor 73 kDa subunit,
putative [Entamoeba histolytica HM-1:IMSS]
gi|449703858|gb|EMD44220.1| cleavage and polyadenylation specificity factor 73 kDa subunit,
putative [Entamoeba histolytica KU27]
Length = 755
Score = 193 bits (491), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 108/312 (34%), Positives = 182/312 (58%), Gaps = 15/312 (4%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ I+YTGD++ DRHL AA + + D+++ ESTY ++S+ RE F+
Sbjct: 171 MFCIEINGVKILYTGDFSGETDRHLQAAEVPPFQIDVMMCESTYGIIEQESRIDRENAFI 230
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
+++ E + RGGK LIPVF+LGRAQE ++LE YW+ I+F + +K T Y++
Sbjct: 231 RQIIEILKRGGKCLIPVFSLGRAQEFELILEEYWQNHKDLWSVSIFFFSSIAKKCTTYFE 290
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNP---GPMVVFATPGMLHSGLSLIIF 175
F ++ NQ++RK + FDFK IR S D P VV A+PGML G+S IF
Sbjct: 291 KFTSFMNQELRKK--TKQAFDFKFIREGSSSVDDGAIDYKPCVVMASPGMLQDGISRKIF 348
Query: 176 KKWAPVENNMLIMPGFCVQGTIGHKVLSGVKK--LEFENKQIIDVKMAIEYMSFSAHADA 233
++W + N +I+PG+CV+GT+ ++ K + E Q++ K ++ +SF AH+D
Sbjct: 349 ERWCTDKKNGVIIPGYCVEGTLAKDLILDSTKPFINSEGDQVVP-KCSVTEISFCAHSDF 407
Query: 234 KGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF-NLDCFMPANGESCFV----QTDM 288
+ I +PK+++L+HGE M L +K+E+ L+ +MP N + + + ++
Sbjct: 408 AHTRKFIGNVKPKHLVLIHGEGKSMEQLHNALKKEYPELNIYMPCNTQPIKIPIQPKHEV 467
Query: 289 KISIDVSVNLLK 300
++ +++ N+LK
Sbjct: 468 RLLGEIASNILK 479
>gi|407041778|gb|EKE40943.1| cleavage and polyadenylation specificity factor 73 kDa subunit,
putative [Entamoeba nuttalli P19]
Length = 751
Score = 193 bits (491), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 108/312 (34%), Positives = 182/312 (58%), Gaps = 15/312 (4%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ I+YTGD++ DRHL AA + + D+++ ESTY ++S+ RE F+
Sbjct: 171 MFCIEINGVKILYTGDFSGETDRHLQAAEVPPFQIDVMMCESTYGIIEQESRIDRENAFI 230
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
+++ E + RGGK LIPVF+LGRAQE ++LE YW+ I+F + +K T Y++
Sbjct: 231 RQIIEILKRGGKCLIPVFSLGRAQEFELILEEYWQNHKDLWSVSIFFFSSIAKKCTTYFE 290
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNP---GPMVVFATPGMLHSGLSLIIF 175
F ++ NQ++RK + FDFK IR S D P VV A+PGML G+S IF
Sbjct: 291 KFTSFMNQELRKK--TKQAFDFKFIREGSSSVDDGAIDYKPCVVMASPGMLQDGISRKIF 348
Query: 176 KKWAPVENNMLIMPGFCVQGTIGHKVLSGVKK--LEFENKQIIDVKMAIEYMSFSAHADA 233
++W + N +I+PG+CV+GT+ ++ K + E Q++ K ++ +SF AH+D
Sbjct: 349 ERWCTDKKNGVIIPGYCVEGTLAKDLILDSTKPFINSEGDQVVP-KCSVTEISFCAHSDF 407
Query: 234 KGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF-NLDCFMPANGESCFV----QTDM 288
+ I +PK+++L+HGE M L +K+E+ L+ +MP N + + + ++
Sbjct: 408 AHTRKFIGNVKPKHLVLIHGEGKSMEQLHNALKKEYPELNIYMPCNTQPIKIPIQPKHEV 467
Query: 289 KISIDVSVNLLK 300
++ +++ N+LK
Sbjct: 468 RLLGEIASNILK 479
>gi|390602470|gb|EIN11863.1| Metallo-hydrolase/oxidoreductase, partial [Punctularia
strigosozonata HHB-11173 SS5]
Length = 721
Score = 192 bits (489), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 120/364 (32%), Positives = 187/364 (51%), Gaps = 26/364 (7%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF + + I+YTGDY+ DRHL A + RPD+LI EST+ DS+ +E F
Sbjct: 162 MFLIDIAGLKILYTGDYSREEDRHLVKAELPPIRPDVLIAESTWGVQSGDSREEKEARFT 221
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
VH + RGG VL+P FA+GRAQEL ++L+ YW + PIY+A L K Y+
Sbjct: 222 NIVHSIIKRGGHVLMPTFAIGRAQELLLILDEYWSKHPELHNVPIYYASSLARKCMAVYQ 281
Query: 119 MFITWTNQKIRKTFVQR-NMFDFKHI------RPFDKSFIDNPGPMVVFATPGMLHSGLS 171
+I N IR F +R N F FKHI R +++ + P P V+ A+PGML SG S
Sbjct: 282 TYIHTMNSNIRSRFAKRDNPFVFKHISHAPQNRGWERKLAEGP-PCVILASPGMLQSGPS 340
Query: 172 LIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLE-FENKQIIDVKMAIEYMSFSAH 230
+ + AP N L++ G+ V+GT +++ +++ + I K++++Y+SFSAH
Sbjct: 341 RELLELLAPDSRNGLVLTGYSVEGTPARDIINEPDEIQSYRTGAPIPRKLSVDYISFSAH 400
Query: 231 ADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFN-----LDCFMPANGESCFVQ 285
D + I+ ++V+LVHGE + M L ++ ++ + MP N E+
Sbjct: 401 VDGAQNQEFIEKVGAQHVILVHGEQTAMGRLSAALQAKYKARDEEVKIHMPRNSETL--- 457
Query: 286 TDMKISIDVSVNLLKEEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGIS 345
D+ + + A S+PP + G+LV KD S +L+ + G+S
Sbjct: 458 -DLTFRGERVAKAIGTLA----SKPPQ--ANETLSGLLVSKDYSYTLLAPRDLRDFAGLS 510
Query: 346 RHIV 349
+V
Sbjct: 511 TCVV 514
>gi|299752177|ref|XP_001830756.2| mRNA 3'-end-processing protein YSH1 [Coprinopsis cinerea
okayama7#130]
gi|298409712|gb|EAU91125.2| mRNA 3'-end-processing protein YSH1 [Coprinopsis cinerea
okayama7#130]
Length = 846
Score = 192 bits (489), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 122/368 (33%), Positives = 185/368 (50%), Gaps = 35/368 (9%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF + + I+YTGDY+ DRHL A + RPD+LI ESTY + + +E F
Sbjct: 162 MFLIDIAGLKILYTGDYSREEDRHLVKAELPPIRPDVLIVESTYGVHTLEGREEKEARFT 221
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
VH + RGG VL+P FALGRAQEL ++L+ YW++ PIY+A L K Y+
Sbjct: 222 TLVHSIIRRGGHVLLPAFALGRAQELLLILDEYWKKHPDLHNVPIYYASSLARKCMAVYQ 281
Query: 119 MFITWTNQKIRKTFVQR-NMFDFKHI------RPFDKSFIDNPGPMVVFATPGMLHSGLS 171
+I N IR F +R N F FK+I R ++K + P P VV A+PG + G S
Sbjct: 282 TYIHTMNANIRTRFAKRDNPFVFKYISNLPQTRGWEKKIAEGP-PCVVLASPGFMQVGPS 340
Query: 172 LIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHA 231
+F+ WAP N LI+ G+ ++GT+ +++ + I K++++Y+SFSAH
Sbjct: 341 RELFELWAPDARNGLIITGYSIEGTLARDIMTEPEDFIGLKGNTIPRKISVDYISFSAHV 400
Query: 232 DAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKIS 291
D + IQ + ++V+LVHGE + M L+ + +F + D +
Sbjct: 401 DYAQNSEFIQAVKAQHVVLVHGEQNTMGRLRAAMAAKFK--------------ERDEDVK 446
Query: 292 IDVSVNLLKEE----------AVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKE 341
I NL E A+ ++ P P + G+LV KD S +L+D +
Sbjct: 447 IHTPRNLETLELSFRGERVAKAIGTLADKP-PQANDVLSGLLVAKDYSYTLLDPRDLKDF 505
Query: 342 VGISRHIV 349
G+S +V
Sbjct: 506 TGLSTCVV 513
>gi|219121689|ref|XP_002181194.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217407180|gb|EEC47117.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 602
Score = 192 bits (488), Expect = 8e-46, Method: Compositional matrix adjust.
Identities = 102/270 (37%), Positives = 156/270 (57%), Gaps = 5/270 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF + +G + I+YTGDY+M DRHL AA I PD+LI E+TY + S+ RE F
Sbjct: 160 MFFLSLGGRKILYTGDYSMEDDRHLMAAEIPAESPDVLIVEATYGVQVHASRAEREARFT 219
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWE-RMNLQ-APIYFAVGLTEKATNYYK 118
+ + RGG+ LIPVFALGRAQEL ++L+ YW+ +LQ PI++A L +A Y+
Sbjct: 220 GTIERVISRGGRCLIPVFALGRAQELLLILDEYWQANPHLQNIPIWYASKLASRALRVYQ 279
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHI---RPFDKSFIDNPGPMVVFATPGMLHSGLSLIIF 175
+ N +IR N F F+ I + D + D+ GP VVFA+PGML SG+S +F
Sbjct: 280 TYANMMNARIRSQMDVSNPFRFRFIQNLKSIDVNSFDDSGPSVVFASPGMLQSGVSRQLF 339
Query: 176 KKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKG 235
+WA N +++ G+ V+ T+ ++++ K++ + + ++Y+SFSAH D
Sbjct: 340 DRWASDHKNGVLIAGYAVEHTLAKEIMAQPKEVVTLEGRRQPLNALVDYVSFSAHVDFVQ 399
Query: 236 IMQLIQYCEPKNVLLVHGEASKMVFLKEKI 265
I PK+++LVHG+ +M LK +
Sbjct: 400 NRSFINQVAPKHIILVHGQKDEMGRLKSAL 429
>gi|428172766|gb|EKX41673.1| hypothetical protein GUITHDRAFT_74597 [Guillardia theta CCMP2712]
Length = 615
Score = 192 bits (488), Expect = 9e-46, Method: Compositional matrix adjust.
Identities = 116/353 (32%), Positives = 190/353 (53%), Gaps = 21/353 (5%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF V++ ++YTGDY+ DRHL AA + + +L+ ESTY + +R RE+ FL
Sbjct: 178 MFIVQIAGVRLLYTGDYSRQEDRHLMAAEMPSVQVHVLVVESTYGVQTHEPRRSREKRFL 237
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQA---PIYFAVGLTEKATNYY 117
+ V + GG+VL+PVFA+GRAQEL +LL+ YW R N + PI G+ ++ Y
Sbjct: 238 EAVVSTLQLGGRVLLPVFAIGRAQELLLLLDEYW-RKNPELHRYPIICLSGMAKRCIASY 296
Query: 118 KMFITWTNQKIRKTFVQRNMFDFKHIRPFD--KSFIDNPGPMVVFATPGMLHSGLSLIIF 175
+ +I N +IR N F+F+HIR F DN P VV A+PGML +G S +F
Sbjct: 297 QTYINQMNNRIRHLNDIENPFEFRHIRYMTTMAEFQDN-CPCVVMASPGMLQNGPSRDLF 355
Query: 176 KKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLE-FENKQIIDVKMAIEYMSFSAHADAK 234
+W +N +++ G+CVQ T+ ++L ++ + + +K+ + Y+SFSAHAD
Sbjct: 356 DRWCEYRHNSVVITGYCVQNTLAKELLDAQPATHTLQDGKEVPLKIRVHYISFSAHADFI 415
Query: 235 GIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF---NLDCFMPANGESCFVQTDMKIS 291
Q I P +++LVHGEA+ M LK+++ ++ + + P N E +
Sbjct: 416 QTSQFIDELRPSHIVLVHGEANMMASLKKQLVSKYEHMGVGVWSPQNTE----------N 465
Query: 292 IDVSVNLLKEEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGI 344
+ ++ + K V + P +HGV++ K+ ++++ VDE G+
Sbjct: 466 VTLTFHESKVAKVVGSLAVERPKTGGSLHGVVLHKNFQLTIVGVDEMGAHAGL 518
>gi|242220452|ref|XP_002475992.1| predicted protein [Postia placenta Mad-698-R]
gi|220724781|gb|EED78801.1| predicted protein [Postia placenta Mad-698-R]
Length = 825
Score = 192 bits (488), Expect = 9e-46, Method: Compositional matrix adjust.
Identities = 118/362 (32%), Positives = 190/362 (52%), Gaps = 23/362 (6%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF + + I+YTGDY+ DRHL A + RPD+LI ESTY + + +E F
Sbjct: 153 MFLIDIAGLKILYTGDYSREEDRHLVKAEVPPIRPDVLIIESTYGVQTLEGREEKELRFT 212
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERM-NLQ-APIYFAVGLTEKATNYYK 118
VH + RGG VL+P FALGRAQEL ++L+ YW++ +LQ PIY+A L K+ Y+
Sbjct: 213 NLVHSIIRRGGHVLLPTFALGRAQELLLILDEYWKKHPDLQNVPIYYASSLARKSMAVYQ 272
Query: 119 MFITWTNQKIRKTFVQR-NMFDFKHI------RPFDKSFIDNPGPMVVFATPGMLHSGLS 171
+I N +R F +R N F FKHI + +++ + P P VV A+PG + SG S
Sbjct: 273 TYIHTMNSNVRSRFAKRDNPFVFKHISNLPQSKGWERKIAEGP-PCVVLASPGFMTSGAS 331
Query: 172 LIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHA 231
+ + WAP N +I+ G+ ++GT+ ++ S +++ I K++++ +SFSAH
Sbjct: 332 RELLELWAPDSRNGVIITGYSIEGTMAREIQSEPEEIVSLKGVPIPRKISVDEISFSAHV 391
Query: 232 DAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKIS 291
D + I+ + ++++LVHGE + M L+ + + + E + T +
Sbjct: 392 DYSQNSEFIEMIKAQHIVLVHGEQTAMGRLRAAMTDRYK------SRDEDVKIHTPRNLE 445
Query: 292 IDVSVNLLKEEAVK----YNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRH 347
+ + E K S+PP P + G+LV KD S +L+D + G+S
Sbjct: 446 T-LELTFRGERVAKAIGTLASQPPQPGD--TLSGLLVSKDYSYTLLDPRDLRDFAGLSTC 502
Query: 348 IV 349
+V
Sbjct: 503 VV 504
>gi|336371935|gb|EGO00275.1| hypothetical protein SERLA73DRAFT_73000 [Serpula lacrymans var.
lacrymans S7.3]
gi|336384684|gb|EGO25832.1| hypothetical protein SERLADRAFT_437559 [Serpula lacrymans var.
lacrymans S7.9]
Length = 748
Score = 192 bits (487), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 122/362 (33%), Positives = 186/362 (51%), Gaps = 31/362 (8%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF + + I+YTGDY+ DRHL +A + RPD+LI ESTY +++ +E F
Sbjct: 162 MFLIDIAGLKILYTGDYSREEDRHLVSAEVPPVRPDVLIVESTYGVQSLEARDEKEVRFT 221
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
VH + RGG VL+P FALGRAQEL ++L+ YW++ IY+A L K Y+
Sbjct: 222 SLVHSIIRRGGHVLLPTFALGRAQELLLILDEYWKKHPDLHNVTIYYASSLARKCMAVYQ 281
Query: 119 MFITWTNQKIRKTFVQR-NMFDFKHI------RPFDKSFIDNPGPMVVFATPGMLHSGLS 171
+I N IR F +R N F FKHI R +++ D P P VV A+PG L SG S
Sbjct: 282 TYIHTMNANIRSRFAKRDNPFVFKHISNLAQPRGWERKIADGP-PCVVLASPGFLQSGPS 340
Query: 172 LIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHA 231
+ + WAP N LI+ G+ V+GT+ +++ ++ I K++++Y+SFSAH
Sbjct: 341 RELLELWAPDPRNGLIVTGYSVEGTLARDIMNEPDEIMSLKGNPIPRKISVDYISFSAHV 400
Query: 232 DAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF-----NLDCFMPANGES---CF 283
D I+ + ++V+LVHGE + M L+ + + ++ P N E+ F
Sbjct: 401 DYTQNSDFIEQVKAQHVVLVHGEQTAMGRLRAAMTSRYKDRDEDVKIHTPRNLETLELSF 460
Query: 284 VQTDMKISIDVSVNLLKEEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVG 343
+ +I N + E PN + + G+LV KD S +L+D + G
Sbjct: 461 RGERVAKAIGTLANKVPE---------PNSI----LSGLLVSKDYSYTLLDPRDLKDFTG 507
Query: 344 IS 345
+S
Sbjct: 508 LS 509
>gi|366992944|ref|XP_003676237.1| hypothetical protein NCAS_0D02950 [Naumovozyma castellii CBS 4309]
gi|342302103|emb|CCC69876.1| hypothetical protein NCAS_0D02950 [Naumovozyma castellii CBS 4309]
Length = 771
Score = 192 bits (487), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 101/284 (35%), Positives = 164/284 (57%), Gaps = 20/284 (7%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MFQ+++ +++TGDY+ DRHL +A + D+LI EST+ T + + RE++
Sbjct: 167 MFQIEIAGLRVLFTGDYSREMDRHLNSAEVPSLPSDVLIVESTFGTATHEPRLNREKNLT 226
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYW-----ERMNLQAPIYFAVGLTEKATN 115
+ +H V RGG+VL+PVFALGRAQE+ ++L+ YW E + Q PIY+A L +K +
Sbjct: 227 QLIHSTVSRGGRVLLPVFALGRAQEIMLILDEYWSQHAEELGSGQVPIYYASNLAKKCMS 286
Query: 116 YYKMFITWTNQKIRKTF--VQRNMFDFKHIRPFDKSFIDNP------GPMVVFATPGMLH 167
++ ++ N IR+ F Q N F FK+I S++ N GP V+ A+PGML
Sbjct: 287 VFQTYVNMMNDDIRRKFRDSQTNPFIFKNI-----SYLRNLEEFQDFGPSVMLASPGMLQ 341
Query: 168 SGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDV--KMAIEYM 225
SGLS + +KW P E N++++ G+ ++GT+ ++ + N + V + +E +
Sbjct: 342 SGLSRDVLEKWCPDEKNLVLITGYSIEGTMAKFIMLEPDTIPSINNPEVTVPRRCNVEEI 401
Query: 226 SFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF 269
SF+AH D + ++ I+ N++LVHGE++ M LK I F
Sbjct: 402 SFAAHVDFQENLEFIEKISANNIILVHGESNPMGRLKSAILSNF 445
>gi|312080023|ref|XP_003142424.1| cpsf3-prov protein [Loa loa]
Length = 715
Score = 191 bits (486), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 123/389 (31%), Positives = 192/389 (49%), Gaps = 60/389 (15%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ I+YTGD++ DRHL AA + PD+LI ESTY T + +S+ RE+
Sbjct: 168 MFMIEIAGVRILYTGDFSRLEDRHLCAAELPTVSPDVLICESTYGTQVHESRDEREK--- 224
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWE------------------RMNLQAP 102
V RGG+ LIP FALGRAQEL ++L+ YWE M + P
Sbjct: 225 -----VVGRGGRCLIPAFALGRAQELLLILDEYWEAHPELQDIPNNPVCCNADEMTVVEP 279
Query: 103 -----------IYF--AVGLTEKATNYYKMFITWTNQKIRKTFVQRNMFDFKHIRPFDK- 148
I+F A L +K Y+ F++ N +I+K N F FKH+
Sbjct: 280 NRSVIVGIDLLIFFDHASSLAKKCMAVYQTFVSGMNSRIQKQIALNNPFVFKHVSNLKSI 339
Query: 149 SFIDNPGPMVVFATPGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKL 208
++ GP VV A+PGML +GLS +F+ W N I+ G+CV+GT+ +LS +++
Sbjct: 340 DHFEDVGPCVVLASPGMLQNGLSRELFENWCTDSKNGCIIAGYCVEGTLAKHILSEPEEI 399
Query: 209 EFENKQIIDVKMAIEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKI--- 265
N Q + +++ + Y+SFSAH D I+ P +++LVHGE ++M LK I
Sbjct: 400 VAMNGQKLAMRLQVAYISFSAHTDYTQTSDFIRALRPPHLVLVHGEMNEMNRLKAAIIRQ 459
Query: 266 ---KQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLKEEAVKYNSEPP--NPLKERQIH 320
+ +F+++ + P N ES V ++ E+ K + P R +
Sbjct: 460 YEDESDFHIEVYNPRNTES------------VELHFRGEKTAKVVGKMAMTTPGDGRILS 507
Query: 321 GVLVIKDSSISLMDVDEACKEVGISRHIV 349
GVL+ ++ + LM D+ +S I+
Sbjct: 508 GVLIRRNFNYHLMHADDLSAYTDLSNSIL 536
>gi|452819966|gb|EME27015.1| cleavage and polyadenylation specifity factor protein [Galdieria
sulphuraria]
Length = 717
Score = 191 bits (486), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 121/361 (33%), Positives = 193/361 (53%), Gaps = 34/361 (9%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF V++ ++YTGD++ DRHL A I PD++I ESTY + + ++ RE F
Sbjct: 176 MFLVEIAGVRVLYTGDFSRQEDRHLKEAEIPPFPPDIIIVESTYGVQVHEPRKIREARFT 235
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWE-RMNLQ-APIYFAVGLTEKATNYYK 118
+KV E V RGG+VL+PVFALGRAQEL ++LE YWE +LQ PIY+A L ++ + Y+
Sbjct: 236 QKVAEIVRRGGRVLLPVFALGRAQELLLILEEYWEAHPDLQDIPIYYASSLAKRCMSVYQ 295
Query: 119 MFITWTNQKIRKTFVQRNMFDFKH------IRPFDKSFIDNPGPMVVFATPGMLHSGLSL 172
+I N IRK + N F FK+ I+ FD S GP V A+PGML SGLS
Sbjct: 296 TYINMMNDNIRKRYEVSNPFAFKYVLNVKNIQDFDDS-----GPCVFMASPGMLQSGLSR 350
Query: 173 IIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHAD 232
+ ++W N +I+PG+ V+GT+ +LS + + + + ++ +++Y++FSAH+D
Sbjct: 351 ELCERWCTDRRNGIILPGYSVEGTLAKHILSEPSTITRLDGKEVPLRCSVDYITFSAHSD 410
Query: 233 AKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF-------NLDCFMPANGESCFVQ 285
+ I +++LVHGE ++M L+ + ++ N+ + P N ++
Sbjct: 411 FLQTSEFIDRSRSGHIVLVHGEMTEMGRLRNALDLKYNKGQSVPNIRIYTPKNCQT---- 466
Query: 286 TDMKISIDVSVNLLKEEAVKYNSEPP--NPLKERQIHGVLVIKDSSISLMDVDEACKEVG 343
V + E+ K E P + G+LV +D + +LM +E G
Sbjct: 467 --------VRLTFRAEKVAKAIGEVAKRKPEDGASMSGILVKRDFNYTLMSPNELNTYTG 518
Query: 344 I 344
+
Sbjct: 519 L 519
>gi|443721867|gb|ELU10992.1| hypothetical protein CAPTEDRAFT_157654 [Capitella teleta]
Length = 475
Score = 191 bits (486), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 126/411 (30%), Positives = 213/411 (51%), Gaps = 46/411 (11%)
Query: 626 SCDSSHIIDLGSTITVDELFSKLQARKCGLVVESAKVSQLMSEKKSYEVQVMSQVVAAVT 685
+C+ + +L +DE F + + ++VES M++KK +EV +M+++ +
Sbjct: 73 ACEHHTLPNLRVLTPLDE-FQSTEDYEGKVIVES-----FMTDKKCHEVHLMAELCFHLA 126
Query: 686 NSCDSSHIIDLGGGQGYLSTILALQHGKKTLSLDYNQVNTHGAAVRSKKLEDSEKESKGP 745
S ++ ++D+G G+GYL L+L H L +D ++VNT A R ++LE
Sbjct: 127 KSSNTQRVVDIGSGRGYLGCHLSLMHNLCVLGIDSSEVNTSSADQRKQRLE--------- 177
Query: 746 FQSYAGVINKKLWMRTQNRQTSHASVEHHGKNWKRKSKAPVVSIKDEELVVCKDKCKQIT 805
+ + G++ ++N + + +S K IT
Sbjct: 178 -KHWQGLLRN-----SENPERKKGRRRRKSPQGQPRSTP------------AKSNYVPIT 219
Query: 806 HFVTPDSDISSILSQAYPQDS---LHNVCIMGLHTCGDLSGTALRLF-TKSSLQCLVQVG 861
HFVT ++D+ +++ +P + N + GLHTCG+L + +RLF + S L L +G
Sbjct: 220 HFVTNETDLMQLVTDHFPFEGDAECGNFMLAGLHTCGNLGASIMRLFASNSDLGILCNIG 279
Query: 862 CCYHLLEEEFIRSPFWK--DVDQSLYEHGYGFPLSEHLRSRKFFLGRNVRMSGTQSPERV 919
CCYHLLEE+++R+P K + D L GFPL +L +K +GRN RM QS ER+
Sbjct: 280 CCYHLLEEDYLRNPHLKHDEPDSPL----TGFPLCSYLTGKKAQIGRNARMLAAQSLERM 335
Query: 920 IDLKQTQTLPLFYRALLEKYLRSKITINDEEPKVVGRLATKCSNFVEYVHRAVDKLKLD- 978
D Q + LF+RA+L+ LR + + + VGR+A+KCS F +Y+ R DKL++D
Sbjct: 336 ADGCQLPNISLFWRAVLQVMLRDAVGYVRADWQ-VGRVASKCSGFNDYLRRCWDKLQMDS 394
Query: 979 LEVDDEEVTRLFNSHQREYEYLQIYYFLKTALAPVIEALIVLDRVLYLREQ 1029
++ D+ + + + + ++ LK AP IE+LI+LDR+ +L EQ
Sbjct: 395 AKLKDDSADEYLTKYPLQKQ-MSAFFLLKALFAPCIESLILLDRLAFLLEQ 444
>gi|367005895|ref|XP_003687679.1| hypothetical protein TPHA_0K01110 [Tetrapisispora phaffii CBS 4417]
gi|357525984|emb|CCE65245.1| hypothetical protein TPHA_0K01110 [Tetrapisispora phaffii CBS 4417]
Length = 790
Score = 191 bits (485), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 101/280 (36%), Positives = 164/280 (58%), Gaps = 12/280 (4%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MFQ+++G+ +++TGDY+ DRHL +A I D+LI EST+ T + + RE+
Sbjct: 166 MFQIELGSLRVLFTGDYSRELDRHLNSAEIPPLASDVLIVESTFGTATHEPRLSREKKLT 225
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYW-----ERMNLQAPIYFAVGLTEKATN 115
+ +H V +GG+VL+PVFALGRAQEL ++L+ YW E N Q PIY+A L +++ +
Sbjct: 226 QLIHSTVTKGGRVLMPVFALGRAQELMLILDEYWSHNEEELGNGQVPIYYASNLAKRSMS 285
Query: 116 YYKMFITWTNQKIRKTF--VQRNMFDFKHIRPFDK--SFIDNPGPMVVFATPGMLHSGLS 171
++ ++ N IRK F + N F FK+I SF D GP V+ A PGML +GLS
Sbjct: 286 VFQTYVNMMNDSIRKKFRDSKTNPFIFKNISYLKNIDSFQD-FGPSVMLAAPGMLQNGLS 344
Query: 172 LIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDV--KMAIEYMSFSA 229
+ +KW P NM+++ G+ V+G++ ++ + + N +++ + +E +SF+A
Sbjct: 345 RDLLEKWCPEPKNMVLITGYSVEGSMAKYLMLEPENIPSVNNPEVNIPRRCQVEEISFAA 404
Query: 230 HADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF 269
H D + + I+ N++LVHGE++ M LK + F
Sbjct: 405 HVDFQENIDFIEQIRASNIILVHGESNPMGRLKSALLSNF 444
>gi|50304897|ref|XP_452404.1| hypothetical protein [Kluyveromyces lactis NRRL Y-1140]
gi|74636942|sp|Q6CUI5.1|YSH1_KLULA RecName: Full=Endoribonuclease YSH1; AltName: Full=mRNA
3'-end-processing protein YSH1
gi|49641537|emb|CAH01255.1| KLLA0C04598p [Kluyveromyces lactis]
Length = 764
Score = 191 bits (485), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 100/282 (35%), Positives = 168/282 (59%), Gaps = 14/282 (4%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MFQ+++ +++TGDY+ DRHL +A + D++I EST+ T + ++ RER
Sbjct: 173 MFQIEIAGVRVLFTGDYSREVDRHLNSAEVPPQSSDVIIVESTFGTATHEPRQNRERKLT 232
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYW-----ERMNLQAPIYFAVGLTEKATN 115
+ +H V +GG+VL+PVFALGRAQE+ ++L+ YW E N Q PI++A L +K +
Sbjct: 233 QLIHTVVSKGGRVLLPVFALGRAQEIMLILDEYWQNHKEELGNGQVPIFYASNLAKKCMS 292
Query: 116 YYKMFITWTNQKIRKTF--VQRNMFDFKHI---RPFDKSFIDNPGPMVVFATPGMLHSGL 170
++ ++ N IRK F Q N F FK+I + D+ ++ GP V+ A+PGML +GL
Sbjct: 293 VFQTYVNMMNDDIRKKFKDSQTNPFIFKNISYLKNLDE--FEDFGPSVMLASPGMLQNGL 350
Query: 171 SLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKL-EFENKQI-IDVKMAIEYMSFS 228
S I +KW P E N++++ G+ V+GT+ +L + + N +I I + ++ ++F+
Sbjct: 351 SRDILEKWCPEEKNLVLVTGYSVEGTMAKYLLLEPEAIPSVHNPEITIPRRCQVDEITFA 410
Query: 229 AHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFN 270
AH D + ++ I+ N++LVHGE++ M LK + F+
Sbjct: 411 AHVDFRENLEFIELIGASNIILVHGESNPMGRLKSALLSNFS 452
>gi|159488791|ref|XP_001702386.1| subunit of mRNA cleavage and polyadenylation specificity factor
[Chlamydomonas reinhardtii]
gi|158271180|gb|EDO97006.1| subunit of mRNA cleavage and polyadenylation specificity factor
[Chlamydomonas reinhardtii]
Length = 690
Score = 191 bits (485), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 103/272 (37%), Positives = 151/272 (55%), Gaps = 6/272 (2%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF V+V +YTGDY+ PDRHL AA I +P ++I ESTY T+ + RE+ L
Sbjct: 151 MFLVEVAGCRCLYTGDYSRLPDRHLPAADIPPVKPHIVIVESTYGTSRHLPRLQREQLLL 210
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQ---APIYFAVGLTEKATNYY 117
+ ++RGG+V++PV ALGRAQEL +LL+ YWE + PIY A + KA Y
Sbjct: 211 DTIRNTINRGGRVIMPVVALGRAQELLLLLDEYWEAHKSELSGIPIYQASSMMSKALGVY 270
Query: 118 KMFITWTNQKIRKTFVQRNMFDFKHIRPFDKS--FI-DNPGPMVVFATPGMLHSGLSLII 174
+ ++ N I++ F +RN F F+H++ FI D GP V+ ATP L SG S
Sbjct: 271 QTYVESLNDDIKRVFHERNPFKFRHVQTLKNPAHFISDYSGPCVIMATPSGLQSGASRDF 330
Query: 175 FKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAK 234
F+ W N I+ F VQGT+ ++L G + + + +++A+ +SFSAHAD
Sbjct: 331 FEAWCEDSRNTCIICDFAVQGTLAKEILGGPSSITTREGRRVPLRIAVHNISFSAHADYD 390
Query: 235 GIMQLIQYCEPKNVLLVHGEASKMVFLKEKIK 266
+ P +V+LVHGE +M L + +K
Sbjct: 391 QTSGFLDAVRPPHVVLVHGEYGEMRKLAKALK 422
>gi|302679538|ref|XP_003029451.1| hypothetical protein SCHCODRAFT_59058 [Schizophyllum commune H4-8]
gi|300103141|gb|EFI94548.1| hypothetical protein SCHCODRAFT_59058 [Schizophyllum commune H4-8]
Length = 786
Score = 191 bits (484), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 120/362 (33%), Positives = 183/362 (50%), Gaps = 35/362 (9%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF + + I+YTGDY+ DRHL A + RPD+LI ESTY + + +E F
Sbjct: 162 MFLIDMAGLRILYTGDYSREEDRHLVKAELPPIRPDVLIVESTYGVQSHEPRDEKELRFT 221
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
VH + RGG VL+P FALGRAQEL ++L+ YW++ PIY+A GL K+ Y+
Sbjct: 222 NLVHSIIRRGGHVLLPQFALGRAQELLLILDEYWKKHPDLHNVPIYYASGLARKSMAVYQ 281
Query: 119 MFITWTNQKIRKTFVQR-NMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
+I N IR F +R N F FK + P P VV ATPG + +G S +F+
Sbjct: 282 TYIHTMNSNIRSRFAKRDNPFVFK------CKIAEGP-PCVVLATPGFMQTGSSRELFEL 334
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
WAP N LI+ G+ V+GT+ +++ ++ + +I K++++Y+SFSAH D
Sbjct: 335 WAPDSRNGLIVTGYSVEGTLARDIMTEPEEFQSVKGHMIQRKISVDYISFSAHVDYTQNA 394
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVN 297
+ I+ ++V+LVHGE + M L+ + + + D + I N
Sbjct: 395 EFIETVRAQHVVLVHGEQTAMGRLRAAMSSRYK--------------ERDEDVKIHTPRN 440
Query: 298 LLKEE----------AVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRH 347
L E A+ ++ P P + G+LV KD S +L+D + G+S
Sbjct: 441 LETLELSFRGERVAKAIGTLADKP-PQSSDVLSGLLVAKDYSYTLLDPRDLKDFAGLSTC 499
Query: 348 IV 349
+V
Sbjct: 500 VV 501
>gi|189234823|ref|XP_001809803.1| PREDICTED: similar to AGAP010960-PA [Tribolium castaneum]
gi|270002251|gb|EEZ98698.1| hypothetical protein TcasGA2_TC001236 [Tribolium castaneum]
Length = 451
Score = 191 bits (484), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 128/393 (32%), Positives = 209/393 (53%), Gaps = 63/393 (16%)
Query: 639 ITVDELFSKLQARKCGLVVESAKVSQLMSEKKSYEVQVMSQVVAAVTNSCDSSHIIDLGG 698
+ +D KLQ+ C +++ K+ M+ KKS+EV+++S V AA+ +SH++DLG
Sbjct: 91 LNLDSFTQKLQSWGCD-TLDTFKLEIFMNAKKSHEVEILSAVAAALFRVSQASHVVDLGD 149
Query: 699 GQGYLSTILALQHGKKTLSLDYNQVNTHGAAVRSKKLEDS-EKESKGPFQSYAGVINKKL 757
G+GYLS++LALQH + +D + NT GA R+ KL K P +K L
Sbjct: 150 GKGYLSSMLALQHQIPVVGIDASNTNTCGAIKRATKLSKVWNGIPKAP--------HKSL 201
Query: 758 WMRTQNRQTSHASVEHHGKNWKRKSKAPVVSIKDEELVVCKDKCKQITHFVTPDSDISSI 817
+T+N + H + KQ+T FV D+ +
Sbjct: 202 PKKTENFASPHVELY-----------------------------KQVTRFVDERFDLLGL 232
Query: 818 LSQAYPQDSLHNVCIMGLHTCGDLSGTALRLFTKS-SLQCLVQVGCCYHLLEEEFIRSPF 876
+ +P S ++ ++GLHTCGDL+ ++L++F+++ +++ + VGCCYHLL+E
Sbjct: 233 VRDVFPNVS--HLGLVGLHTCGDLAASSLKIFSRNEAVKSVCNVGCCYHLLDES------ 284
Query: 877 WKDVDQSLYEHGYGFPLSEHLRSRKFFLGRNVRMSGTQSPERVIDLKQTQTLPLFYRALL 936
GFPLS L R F LGR+ RM QS ERV+ + + +FYRA+L
Sbjct: 285 -------------GFPLSRFLTDRGFVLGRSARMIANQSVERVLQEGELPNITIFYRAIL 331
Query: 937 EKYLRSKITINDEEPKVVGRLATKCSNFVEYVHRAVDKLKLDLEVDDEEVTRLFNSHQRE 996
+ L T D K VG+ NF++YV A+ ++ + L++ D EV +F+ +++
Sbjct: 332 QVLLEEFCT--DLPTKHVGKFRKVPVNFLDYVRLALKRIDVTLDLTDNEVGAIFSRYEKR 389
Query: 997 YEYLQIYYFLKTALAPVIEALIVLDRVLYLREQ 1029
L ++Y L+ L+PV+E+LI+LDR+L+L+EQ
Sbjct: 390 LNELNVFYLLRCKLSPVVESLILLDRLLFLQEQ 422
Score = 63.5 bits (153), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 40/152 (26%), Positives = 76/152 (50%), Gaps = 9/152 (5%)
Query: 485 IKNTIDEMIEFITPNIACVNTHTVHFMVEKLWETTIPKAILSQVNNSPSFDSLLEEFWKS 544
I+ TID ++ F+ P + N H V F + ++ + I +++ +++ F
Sbjct: 6 IRQTIDSLLAFLEPLLGFANCHMVDFYTQNSYKKFVSPEIQNEIEQIGYENTIKRIFL-- 63
Query: 545 RQDNHVETNSELVKFFQAADKFRLTSLMNGESIITVDELFSKLQARKCGQVVESAKVSQL 604
N + L +F + + KF +L N + +D KLQ+ C +++ K+
Sbjct: 64 ---NEFDATPHLKQFVEDSSKF---TLKNCHVCLNLDSFTQKLQSWGC-DTLDTFKLEIF 116
Query: 605 MSEKKSYEVQVMSQVVAAVTNSCDSSHIIDLG 636
M+ KKS+EV+++S V AA+ +SH++DLG
Sbjct: 117 MNAKKSHEVEILSAVAAALFRVSQASHVVDLG 148
>gi|406601461|emb|CCH46911.1| hypothetical protein BN7_6516 [Wickerhamomyces ciferrii]
Length = 679
Score = 191 bits (484), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 120/402 (29%), Positives = 219/402 (54%), Gaps = 22/402 (5%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++G +++TGDY+ +RHL A + +PD+++TEST+ T + + +E
Sbjct: 129 MFFIEIGGLKLLFTGDYSREENRHLNPAEVPPTKPDVMVTESTFGTATHEPRLEKEVRLT 188
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYW-ERMNLQ-APIYFAVGLTEKATNYYK 118
+H + +GG+VL+PVFALG AQEL ++L+ YW + +L+ +Y+A L +K ++
Sbjct: 189 NLIHSTLIKGGRVLLPVFALGTAQELLLILDEYWSQHQDLENVNVYYASSLAKKCLAVFQ 248
Query: 119 MFITWTNQKIRKTFVQRNM--FDFKH---IRPFDKSFIDNPGPMVVFATPGMLHSGLSLI 173
+I N IRK F +N F FK+ I+ DK D+ GP VV A+PGML +G+S
Sbjct: 249 TYINMMNDNIRKQFRDQNSNPFQFKYIKNIKNLDK--FDDFGPCVVVASPGMLQNGVSRE 306
Query: 174 IFKKWAPVENNMLIMPGFCVQGTIGHKVLS---GVKKLEFENKQIIDVKMAIEYMSFSAH 230
+ ++WAP N +I+ G+ V+GT+ +L+ + L+ + + ++ IE +SF+AH
Sbjct: 307 LLERWAPDSRNSVILTGYSVEGTLAKTLLTEPDHIPSLQNPDNN-VPRRLNIEEISFAAH 365
Query: 231 ADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKI 290
D + + I+ +PK+++LVHGE++ M LK + +++ F E
Sbjct: 366 VDYEQNSKFIELVDPKSIILVHGESNPMGRLKSALLSKYS--KFKGTENEVRVYNPRNCD 423
Query: 291 SIDVSVNLLKEEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVR 350
+D+ + LK A + S K + GVLV KD ++ M VD+ + G++ IV+
Sbjct: 424 ELDLKLKGLK-IAKAFGSIVAEKDKADTLSGVLVSKDFDLNFMKVDDLREYAGLTSTIVK 482
Query: 351 FTSNVQISDSSPINKTLELIYDQLLSYLQ---DKSQEYKIQL 389
+++ + ++ Q+ Y++ D +E+++++
Sbjct: 483 ERQTIRVDAGRDL---VQWHLSQMFGYVEILIDDKEEFELRV 521
>gi|195056661|ref|XP_001995139.1| GH22793 [Drosophila grimshawi]
gi|193899345|gb|EDV98211.1| GH22793 [Drosophila grimshawi]
Length = 497
Score = 190 bits (483), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 148/547 (27%), Positives = 255/547 (46%), Gaps = 114/547 (20%)
Query: 485 IKNTIDEMIEFITPNIACVNTHTVHFMVEKLWETTIPKAILSQVNNSPSFDSLLEE-FWK 543
++ +ID ++E++ P+ VN H V+++ + W +P A+ +++ + + +E+ FW
Sbjct: 8 LQRSIDRLLEYLDPHWKFVNCHMVNYLTDNHWLQFVPDAVRAELRSDIDINLAIEKLFW- 66
Query: 544 SRQDNHVETNSE----LVKFFQAADKFRLTSLMNGESIITVDELFSKLQARKCGQVVESA 599
H E +S+ L KF AA++ RL KC
Sbjct: 67 -----HTECDSQQFPALSKFLAAAERQRL----------------------KC------- 92
Query: 600 KVSQLMSEKKSYEVQVMSQVVAAVTNSCDSSHIIDLGSTITVDELFSKLQARKCGLVVES 659
S+L++ V+ + +++AA T++ +
Sbjct: 93 -CSELLTT-----VEQVEELLAASTDT-------------------------------QQ 115
Query: 660 AKVSQLMSEKKSYEVQVMSQVVAAVTNSCDSS---HIIDLGGGQGYLSTILALQHGKKTL 716
+ + MS KK +EV++ + +V + S ++ +I+D G G+GYLS+ LALQ G + L
Sbjct: 116 LSIREFMSAKKCHEVELTAALVNKLVQSSSAAANCYIVDAGDGKGYLSSRLALQFGHRVL 175
Query: 717 SLDYNQVNTHGAAVRSKKLEDSEKESKGPFQSYAGVINKKLWMRTQNRQTSHASVEHHGK 776
+D N NT A R++KL+ R N T A +E G
Sbjct: 176 GIDANISNTENALTRNRKLQ-----------------------RAWNGLTERADLESQGI 212
Query: 777 NWKRKSKAPVVSIKDEELVVCKDKCKQITHFVTPDSDISSILSQAYPQDSLHN----VCI 832
R+ K + + D K + F+T + +++++ + + Q +C+
Sbjct: 213 KPARRGKKQKSEVDKSIITPPPDNYKTTSRFITTELKLTALIEEHFEQAQAARNSPQICL 272
Query: 833 MGLHTCGDLSGTALRLF-TKSSLQCLVQVGCCYHLLEEEFIRSPFWKDVDQSLYEHGYGF 891
GLHTCG+L+ T L++F + + L VGCCYHLL E + + F+ + + YGF
Sbjct: 273 TGLHTCGNLAATCLQVFHAQQECRILCNVGCCYHLLRERYSQHEFFGNKALMEQQTDYGF 332
Query: 892 PLSEHLRSRKFFLGRNVRMSGTQSPERVIDLKQTQTLPLFYRALLEKYL-RSKITINDEE 950
PLS +L+SR+ LGRN RM QS ER I K+ + L+YR+LLE + R + +E
Sbjct: 333 PLSNYLQSRQLRLGRNARMLAAQSMERTIAAKELPNITLYYRSLLELLVCRHAPHLKNEL 392
Query: 951 PKVVGRLATKCSNFVEYVHRAVDKLKLD--LEVDDEEVTRLFNSHQREYEYLQIYYFLKT 1008
VG++ K + FVEYV + DK+ VD+EE+ L ++ + YL ++Y L+
Sbjct: 393 Q--VGKV-RKFNGFVEYVQKCADKISAPWLAAVDEEELQLLEQQYELDRHYLSLFYLLRM 449
Query: 1009 ALAPVIE 1015
+ PV+E
Sbjct: 450 SFGPVLE 456
>gi|374110195|gb|AEY99100.1| FAGR279Cp [Ashbya gossypii FDAG1]
Length = 771
Score = 190 bits (483), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 100/285 (35%), Positives = 165/285 (57%), Gaps = 20/285 (7%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MFQV++ I++TGDY+ DRHL +A I D+LI EST+ T + + +E+
Sbjct: 170 MFQVEIAGLRILFTGDYSRELDRHLNSAEIPTLPSDILIVESTFGTATHEPRTSKEKKLT 229
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERM-----NLQAPIYFAVGLTEKATN 115
+ +H V +GG+VL+PVFALGRAQE+ ++L+ YW + N Q PI++A L K +
Sbjct: 230 QLIHTTVSKGGRVLLPVFALGRAQEIMLILDEYWSQHAEQLGNGQVPIFYASNLARKCMS 289
Query: 116 YYKMFITWTNQKIRKTF--VQRNMFDFKHIRPFDKSFIDNP------GPMVVFATPGMLH 167
++ ++ N KIRK F Q N F FK+I S++ N GP V+ A+PGML
Sbjct: 290 VFQTYVNMMNDKIRKKFRDSQTNPFIFKNI-----SYLKNLDEFQDFGPSVMLASPGMLQ 344
Query: 168 SGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDV--KMAIEYM 225
+GLS + +KW P E N++++ G+ V+GT+ ++ + + N + + + +E +
Sbjct: 345 NGLSRDLLEKWCPDEKNLVLITGYSVEGTMAKFLMLEPETIPSINNSDVSIPRRCQVEEI 404
Query: 226 SFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFN 270
SF+AH D + ++ ++ N++LVHGE++ M LK + F+
Sbjct: 405 SFAAHVDFRENLEFVEKIGAPNIILVHGESNPMGRLKSALLSNFS 449
>gi|302309512|ref|NP_986945.2| AGR279Cp [Ashbya gossypii ATCC 10895]
gi|442570103|sp|Q74ZC0.2|YSH1_ASHGO RecName: Full=Endoribonuclease YSH1; AltName: Full=mRNA
3'-end-processing protein YSH1
gi|299788393|gb|AAS54769.2| AGR279Cp [Ashbya gossypii ATCC 10895]
Length = 771
Score = 190 bits (483), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 100/285 (35%), Positives = 165/285 (57%), Gaps = 20/285 (7%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MFQV++ I++TGDY+ DRHL +A I D+LI EST+ T + + +E+
Sbjct: 170 MFQVEIAGLRILFTGDYSRELDRHLNSAEIPTLPSDILIVESTFGTATHEPRTSKEKKLT 229
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERM-----NLQAPIYFAVGLTEKATN 115
+ +H V +GG+VL+PVFALGRAQE+ ++L+ YW + N Q PI++A L K +
Sbjct: 230 QLIHTTVSKGGRVLLPVFALGRAQEIMLILDEYWSQHAEQLGNGQVPIFYASNLARKCMS 289
Query: 116 YYKMFITWTNQKIRKTF--VQRNMFDFKHIRPFDKSFIDNP------GPMVVFATPGMLH 167
++ ++ N KIRK F Q N F FK+I S++ N GP V+ A+PGML
Sbjct: 290 VFQTYVNMMNDKIRKKFRDSQTNPFIFKNI-----SYLKNLDEFQDFGPSVMLASPGMLQ 344
Query: 168 SGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDV--KMAIEYM 225
+GLS + +KW P E N++++ G+ V+GT+ ++ + + N + + + +E +
Sbjct: 345 NGLSRDLLEKWCPDEKNLVLITGYSVEGTMAKFLMLEPETIPSINNSDVSIPRRCQVEEI 404
Query: 226 SFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFN 270
SF+AH D + ++ ++ N++LVHGE++ M LK + F+
Sbjct: 405 SFAAHVDFRENLEFVEKIGAPNIILVHGESNPMGRLKSALLSNFS 449
>gi|237839761|ref|XP_002369178.1| cleavage and polyadenylation specificity factor, putative
[Toxoplasma gondii ME49]
gi|211966842|gb|EEB02038.1| cleavage and polyadenylation specificity factor, putative
[Toxoplasma gondii ME49]
Length = 1100
Score = 189 bits (481), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 105/285 (36%), Positives = 161/285 (56%), Gaps = 10/285 (3%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++G ++YTGD++ DRH+ A + LLI ESTY + D ++ RER FL
Sbjct: 285 MFLIEIGGVRMLYTGDFSRESDRHVPIAEVPPVDVQLLICESTYGIHVHDDRQLRERRFL 344
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
K V + V+RGGK L+PVFALGRAQEL ++LE YW PI F L+ K +
Sbjct: 345 KAVVDIVNRGGKCLLPVFALGRAQELLLILEEYWTAHPEIRHVPILFLSPLSSKCAVVFD 404
Query: 119 MFITWTNQKIR-KTFVQRNMFDFKHIRPFD-----KSFIDNPGPMVVFATPGMLHSGLSL 172
F+ + +R + N F F+ ++ + +I + GP VV A PGML SG S
Sbjct: 405 AFVDMCGEAVRSRALRGENPFAFRFVKNVKSVEAARVYIHHDGPAVVMAAPGMLQSGASR 464
Query: 173 IIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHAD 232
IF+ WAP N +I+ G+ V+GT+ ++ + ++ ++ ++ + + E +SFSAH+D
Sbjct: 465 EIFEAWAPDAKNGVILTGYSVKGTLADELKREPETIQLPDR-VLRRRCSFEMISFSAHSD 523
Query: 233 AKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF-NLDCFMP 276
+ + I + NV+LVHGE +M LKEK+++E L F P
Sbjct: 524 YQQTQEFIGKLKVPNVVLVHGERGEMRRLKEKLEEERPALSVFTP 568
>gi|221484558|gb|EEE22852.1| cleavage and polyadenylation specificity factor, putative
[Toxoplasma gondii GT1]
Length = 1100
Score = 189 bits (480), Expect = 8e-45, Method: Compositional matrix adjust.
Identities = 124/395 (31%), Positives = 198/395 (50%), Gaps = 35/395 (8%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++G ++YTGD++ DRH+ A + LLI ESTY + D ++ RER FL
Sbjct: 285 MFLIEIGGVRMLYTGDFSRERDRHVPIAEVPPVDVQLLICESTYGIHVHDDRQLRERRFL 344
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
K V + V+RGGK L+PVFALGRAQEL ++LE YW PI F L+ K +
Sbjct: 345 KAVVDIVNRGGKCLLPVFALGRAQELLLILEEYWTAHPEIRHVPILFLSPLSSKCAVVFD 404
Query: 119 MFITWTNQKIR-KTFVQRNMFDFKHIRPFD-----KSFIDNPGPMVVFATPGMLHSGLSL 172
F+ + +R + N F F+ ++ + +I + GP VV A PGML SG S
Sbjct: 405 AFVDMCGEAVRSRALRGENPFAFRFVKNVKSVEAARVYIHHDGPAVVMAAPGMLQSGASR 464
Query: 173 IIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHAD 232
IF+ WAP N +I+ G+ V+GT+ ++ + ++ ++ ++ + + E +SFSAH+D
Sbjct: 465 EIFEAWAPDAKNGVILTGYSVKGTLADELKREPETIQLPDR-VLRRRCSFEMISFSAHSD 523
Query: 233 AKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF-NLDCFMPA---------NGESC 282
+ + I + NV+LVHGE +M LKEK+++E L F P C
Sbjct: 524 YQQTQEFIGKLKVPNVVLVHGERGEMRRLKEKLEEERPALSVFTPEILQKVSLQFTPSRC 583
Query: 283 FVQTDMKISIDVSVNLLKEEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEV 342
V K++ D+ K +AV +R+ G V S+ + D+ ++
Sbjct: 584 VVAAG-KLADDIQRLAEKSDAVS--------TADRESAGGYVKAQSAQRCGEEDDGEEKK 634
Query: 343 GISRHIVRFTSNVQ-------ISDSSPINKTLELI 370
G R F+S+ ++ S + LE++
Sbjct: 635 GNERETTSFSSDASRSQGQAAVAGESGEGRALEIV 669
>gi|440298403|gb|ELP91039.1| Cleavage and polyadenylation specificity factor subunit, putative
[Entamoeba invadens IP1]
Length = 788
Score = 189 bits (480), Expect = 8e-45, Method: Compositional matrix adjust.
Identities = 104/284 (36%), Positives = 169/284 (59%), Gaps = 11/284 (3%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ I+YTGD++ DRH+ +A + D++I ESTY ++ + RE F+
Sbjct: 171 MFCLEINGVKILYTGDFSGESDRHMHSAEVPPFEIDVMICESTYGIMDQEPRVDRENRFV 230
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWE-RMNLQA-PIYFAVGLTEKATNYYK 118
K++ E + RGGK LIPVF+LGRAQE ++LE YW+ L A I+F + +K Y++
Sbjct: 231 KQIVEILKRGGKCLIPVFSLGRAQEFELILEEYWQSHKELWAYSIFFFSSIAKKCMTYFE 290
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIR----PFDKSFIDNPGPMVVFATPGMLHSGLSLII 174
+ ++ NQ++RK +R F+FK IR D S IDN P VV A+PGML G S +
Sbjct: 291 KYTSFMNQELRKR--KRQAFNFKFIRDGSSSVDDSTIDN-HPCVVLASPGMLQDGFSRTL 347
Query: 175 FKKWAPVENNMLIMPGFCVQGTIGHKVLS-GVKKLEFENKQIIDVKMAIEYMSFSAHADA 233
F++W +NN +I+PG+CV+GT+ ++++ K ++I K ++ +SF AH+D
Sbjct: 348 FERWCTDKNNGVIIPGYCVEGTLAKQIINDSSKPFTSTTGEVITPKCSVVEISFCAHSDH 407
Query: 234 KGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF-NLDCFMP 276
I +PK+++L+HGE M L +K+ + +L+ +MP
Sbjct: 408 VHTTNFIGAVKPKHLVLIHGERKSMEQLYNSLKKTYPDLNIYMP 451
>gi|221504752|gb|EEE30417.1| cleavage and polyadenylation specificity factor, putative
[Toxoplasma gondii VEG]
Length = 1100
Score = 189 bits (480), Expect = 9e-45, Method: Compositional matrix adjust.
Identities = 105/285 (36%), Positives = 161/285 (56%), Gaps = 10/285 (3%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++G ++YTGD++ DRH+ A + LLI ESTY + D ++ RER FL
Sbjct: 285 MFLIEIGGVRMLYTGDFSRERDRHVPIAEVPPVDVQLLICESTYGIHVHDDRQLRERRFL 344
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
K V + V+RGGK L+PVFALGRAQEL ++LE YW PI F L+ K +
Sbjct: 345 KAVVDIVNRGGKCLLPVFALGRAQELLLILEEYWTAHPEIRHVPILFLSPLSSKCAVVFD 404
Query: 119 MFITWTNQKIR-KTFVQRNMFDFKHIRPFD-----KSFIDNPGPMVVFATPGMLHSGLSL 172
F+ + +R + N F F+ ++ + +I + GP VV A PGML SG S
Sbjct: 405 AFVDMCGEAVRSRALRGENPFAFRFVKNVKSVEAARVYIHHDGPAVVMAAPGMLQSGASR 464
Query: 173 IIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHAD 232
IF+ WAP N +I+ G+ V+GT+ ++ + ++ ++ ++ + + E +SFSAH+D
Sbjct: 465 EIFEAWAPDAKNGVILTGYSVKGTLADELKREPETIQLPDR-VLRRRCSFEMISFSAHSD 523
Query: 233 AKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF-NLDCFMP 276
+ + I + NV+LVHGE +M LKEK+++E L F P
Sbjct: 524 YQQTQEFIGKLKVPNVVLVHGERGEMRRLKEKLEEERPALSVFTP 568
>gi|430813249|emb|CCJ29377.1| unnamed protein product [Pneumocystis jirovecii]
Length = 574
Score = 189 bits (479), Expect = 9e-45, Method: Compositional matrix adjust.
Identities = 131/452 (28%), Positives = 226/452 (50%), Gaps = 44/452 (9%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF ++V I++TGDY+ DRHL A + +PD+LITESTY T +E
Sbjct: 56 MFFIEVAGIKILFTGDYSREEDRHLIPAEVPPIQPDILITESTYGTASHQPISEKESRLT 115
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERM-NLQA-PIYFAVGLTEKATNYYK 118
+H + RGG+VLIPVFALGR QEL ++++ YW L + P+Y+A L +K Y+
Sbjct: 116 SIIHSIIRRGGRVLIPVFALGRTQELMLIIDEYWHNHPELHSIPVYYACSLAKKCMTVYQ 175
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHI---RPFDKSFIDNPGPMVVFATPGMLHSGLSLIIF 175
K F +RN F F++I + D+ ++ GP V+ A+PGML SG+S +
Sbjct: 176 T----------KIFEERNPFIFRYISSLKSLDR--FEDIGPCVMLASPGMLQSGVSRALL 223
Query: 176 KKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKG 235
+KW P N LI+ G+CV+GT+ +L+ ++ + Q I +M +E +SF AH D
Sbjct: 224 EKWCPDPKNGLIVAGYCVEGTMAKHILNEPSEIISLSGQKIPRRMTVEEISFEAHVDYIQ 283
Query: 236 IMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFN-----LDCFMPANGESC-FVQTDMK 289
Q I +++LVHGE + M LK + +++ + + P N E+ K
Sbjct: 284 NSQFIDLINANHIILVHGEQNNMGRLKSALLSKYSHRKNEVHIYNPKNCETLKLTFKGDK 343
Query: 290 ISIDVSVNLLKEEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIV 349
I+ + + + +E P P I G++V KD IS+M ++ + G++ ++
Sbjct: 344 IA----------KTIGHIAETP-PENNHVISGIMVQKDFQISIMSSEDLKEFSGLTTSVI 392
Query: 350 RFTSNVQISDSSPINKTLELIYDQLLSYLQDKSQEYKIQLTESSLSIDSVLLSVERIDDK 409
+ + S I L +Q+ ++D+ E +L ++S+++++ +
Sbjct: 393 MQRQTLSFNASINI---LHYHLEQMFGAIEDQIDENNNRL----FKVNSLVMNIVSVRCL 445
Query: 410 NKRVF-VTWPNQ--DEEVGKIVLHVLKSMSNT 438
K + + WP ++ + VL ++ + N
Sbjct: 446 QKNIISLEWPGNALNDTIADSVLAIILAAENN 477
>gi|158287844|ref|XP_309746.4| AGAP010960-PA [Anopheles gambiae str. PEST]
gi|157019384|gb|EAA45304.4| AGAP010960-PA [Anopheles gambiae str. PEST]
Length = 510
Score = 189 bits (479), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 148/543 (27%), Positives = 253/543 (46%), Gaps = 91/543 (16%)
Query: 482 MELIKNTIDEMIEFITPNIACVNTHTVHFMVEKLWETTIPKAILSQVNNSPSFDSLLEEF 541
ME I+ +D ++ F+ PN+ VN H V ++ E+ W+ +P A+ S++ + E F
Sbjct: 10 MERIRQQVDTIVRFLQPNLPFVNCHMVDYLTEQHWKRFVPVAMQSELRTVSDYLQAKEIF 69
Query: 542 WK------SRQDNHVETNSELVKFFQAADKFRLTSLMNGESIITVDELFSKLQARKCGQV 595
W +++ + +F + +FRL + +T+DE L C +
Sbjct: 70 WGQFEPSFDGEEHDGGCFPAVREFIKNTRQFRLGGAEARGTALTLDEFMDALS--DCRRE 127
Query: 596 VESAKVSQLMSEKKSYEVQVMSQVVAAVTNSCDSSHIIDLGSTITVDELFSKLQARKCGL 655
K+++LM+ KK +EV+V + VVA++ N + A + G
Sbjct: 128 TR-LKMTELMNVKKCHEVEVAAAVVASLCNG---------------------MAATQPGT 165
Query: 656 VVESAKVSQLMSEKKSYEVQVMSQVVAAVTNSCDSSHIIDLGGGQGYLSTILALQHGKKT 715
+E V ID G G+GYLS+ +AL+HG K
Sbjct: 166 SLEDILV-------------------------------IDAGDGKGYLSSRIALEHGIKV 194
Query: 716 LSLDYNQVNTHGAAVRSKKLEDSEKESKGPFQSYAGVINKKLWMRTQNRQTSHASVEHHG 775
L +D N+ NT A R ++L+ K+ S G IN L+ + Q + + EH
Sbjct: 195 LGVDCNEENTSNAEKRRERLKVRMKD-----WSRKGKIN--LFSSVSSEQNTKSQDEHFT 247
Query: 776 KNWKRKSKAPVVSIKDEELVVCKDKCKQITHFVTPDSDISSILSQAYPQDSLHNVCIMGL 835
+R S + + T + ++++ + + +P C+ GL
Sbjct: 248 NLLQRGSLDTLY--------------RTATQLIDFNTNLIELAQEYFPGGHHSTFCLCGL 293
Query: 836 HTCGDLSGTALRLFTKS-SLQCLVQVGCCYHLLEEEFIRSPFWKDVDQSLYEHGYGFPLS 894
HTCG+L LRLF ++ +++ + VGCCYHL++E+F+ F+ S GYGFP+S
Sbjct: 294 HTCGNLGPNCLRLFHENPTIKGICNVGCCYHLMQEQFVVDEFYNPTKVS-DNPGYGFPMS 352
Query: 895 EHLRSRKFFLGRNVRMSGTQSPERVIDLKQTQTLPLFYRALLEKYLRSKITINDEEPKVV 954
++L+ + F+LGRN R +S ER ++ + L YRALL+ L + +++ V
Sbjct: 353 KYLQGKSFYLGRNARNLAAESIERACTNRENPSDKLGYRALLQVVL---LECGEKKSHQV 409
Query: 955 GRLATKCSNFVEYVHRAVDKLKLD--LEVDDEEVTRLFNSHQREYEYLQIYYFLKTALAP 1012
GRL KC FV+YV R+V +L L+ + + D+ + L E E L+++Y ++ AP
Sbjct: 410 GRL--KCDGFVDYVRRSVRRLALEQRVSITDDSLQELEERFSAELEQLKVFYLIRQQFAP 467
Query: 1013 VIE 1015
V+E
Sbjct: 468 VVE 470
>gi|403216468|emb|CCK70965.1| hypothetical protein KNAG_0F03030 [Kazachstania naganishii CBS
8797]
Length = 820
Score = 189 bits (479), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 101/285 (35%), Positives = 165/285 (57%), Gaps = 20/285 (7%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MFQ+++ I++TGDY+ DRHL +A + D+L+ EST+ T + + RE
Sbjct: 167 MFQIEIAGLKILFTGDYSREMDRHLNSAEVPPQSSDILVVESTFGTATHEPRLHRENKLT 226
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERM-----NLQAPIYFAVGLTEKATN 115
+ +H V RGG+VL+PVFALGRAQEL ++L+ YW++ + Q PI++A L K +
Sbjct: 227 QLIHTTVGRGGRVLMPVFALGRAQELMLILDEYWQKHSDELGSGQVPIFYASDLARKCMS 286
Query: 116 YYKMFITWTNQKIRKTF--VQRNMFDFKHIRPFDKSFIDNP------GPMVVFATPGMLH 167
++ ++ N IRK F Q N F FK+I S++ N GP V+ A+PGML
Sbjct: 287 VFQTYVNMMNDDIRKKFRDSQTNPFIFKNI-----SYLKNLEEFQDFGPSVMLASPGMLQ 341
Query: 168 SGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKL-EFENKQI-IDVKMAIEYM 225
SGLS + +KW P + N++++ G+ V+GT+ ++ + N +I I + +E +
Sbjct: 342 SGLSRDLLEKWCPEQKNLVLITGYSVEGTMAKYIMLEPDTIPAIGNPEINIPRRCQVEEI 401
Query: 226 SFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFN 270
SF+AH D + ++ I+ +N++LVHGE++ M LK + F+
Sbjct: 402 SFAAHVDFQENLEFIEKINAQNIILVHGESNPMGRLKSALLSNFS 446
>gi|321478997|gb|EFX89953.1| hypothetical protein DAPPUDRAFT_309622 [Daphnia pulex]
Length = 475
Score = 189 bits (479), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 131/384 (34%), Positives = 199/384 (51%), Gaps = 62/384 (16%)
Query: 664 QLMSEKKSYEVQVMSQVVAAVTNSCDSSHIIDLGGGQGYLSTILALQHGKKTLSLDYNQV 723
+ M+ KKS+EV + +A+ N DS I+DLG G+G L+++L+L HG +D
Sbjct: 121 KFMTVKKSHEVGALCDTIASFANYTDSKLIVDLGCGKGALASMLSLNHGLYVSGIDAAGF 180
Query: 724 NTHGAAVRSKKLEDSEKESKGPFQSYAGVINKKLWMRTQNRQTSHASVEHHGKNWKRKSK 783
+ H R L+ K +K K
Sbjct: 181 SAHAEEGRQSMLQ---------------------------------------KTFKSHVK 201
Query: 784 APVVSIKDEELVVCKDKCKQITHFVTPDSDISSILSQA--YPQDSLHNVCIMGLHTCGDL 841
++ +DEE V + ++ T +++ + DI ++ + + Q S + V ++GLHTCG+L
Sbjct: 202 KARLT-EDEEAVDLSVRFRRSTLYLSNNFDIRPLIEECSQHFQQSFNQVGLVGLHTCGNL 260
Query: 842 SGTALRLFTKS-SLQCLVQVGCCYHLLEEEFIRSPFWKDVDQSLYEHGYGFPLSEHLRSR 900
+ T+++LF S + L VGCCYHLL+E F +S ++Q GFPLS HL+S+
Sbjct: 261 ASTSVQLFVNSPESRFLCNVGCCYHLLDEFFDKSN--SGLEQ-------GFPLSSHLKSK 311
Query: 901 KFFLGRNVRMSGTQSPERVIDLKQTQTLPLFYRALLEKYLRSKITINDEEPKVVGRLATK 960
FFLGRN RM +Q ER KQ Q LFYR+LL+ L K D VGRL
Sbjct: 312 SFFLGRNARMVSSQPLERYATRKQMQPDVLFYRSLLQVILEEKFPSTDSVEFQVGRLRKP 371
Query: 961 CSNFVEYVHRAVDKLKLDLEVDDEEV----TRLFNSHQREYEYLQIYYFLKTALAPVIEA 1016
+NF +Y+ A KLKL LE+ DEE+ TR +N +R + Y ++ L+ LA ++E
Sbjct: 372 VNNFRDYLQWATKKLKLSLELTDEEIEAYLTR-YNGDKRRHLY--AFFQLRLLLASLVEY 428
Query: 1017 LIVLDRVLYLREQQKPQIISNLFL 1040
+I+LDR++YL EQ +++SN FL
Sbjct: 429 VILLDRLVYLTEQ---EVVSNSFL 449
Score = 42.7 bits (99), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 40/149 (26%), Positives = 69/149 (46%), Gaps = 19/149 (12%)
Query: 495 FITPNIACVNTHTVHFMVEKLWE-------TTIPKAILSQVNNSPSFDSLLEEFWKSRQD 547
F+ P + VNTH VHF ++++E + + +Q+ P+ D L KS+
Sbjct: 18 FLEPFLPIVNTHMVHFYTDQVFEKLNQDLQSDLMALTDTQIAELPN-DYL--NISKSKP- 73
Query: 548 NHVETNSELVKFFQAADKFRLTSLMNGESIITVDELFSKLQARKCGQVVESAKVSQLMSE 607
+H+ +N L++ + D L SL +D+L S + + + M+
Sbjct: 74 SHIASN--LLQLMEDIDGHNLDSLN------VLDDLPSVWKEMDIKPLTSLIHFDKFMTV 125
Query: 608 KKSYEVQVMSQVVAAVTNSCDSSHIIDLG 636
KKS+EV + +A+ N DS I+DLG
Sbjct: 126 KKSHEVGALCDTIASFANYTDSKLIVDLG 154
>gi|195381307|ref|XP_002049394.1| GJ20770 [Drosophila virilis]
gi|194144191|gb|EDW60587.1| GJ20770 [Drosophila virilis]
Length = 493
Score = 189 bits (479), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 125/384 (32%), Positives = 199/384 (51%), Gaps = 49/384 (12%)
Query: 662 VSQLMSEKKSYEVQ----VMSQVVAAVTNSCDSS-HIIDLGGGQGYLSTILALQHGKKTL 716
+ + MS KK +EV +++Q+V + T ++ +I+D G G+GYLS+ LALQ+G L
Sbjct: 114 IKEFMSAKKCHEVDQTAALVNQLVRSTTPEASATCYIVDAGDGKGYLSSRLALQYGHYVL 173
Query: 717 SLDYNQVNTHGAAVRSKKLEDSEKESKGPFQSYAGVINKKLWMRTQNRQTSHASVEHHGK 776
+D + NT A R++KLE R N T A ++ G
Sbjct: 174 GVDAKKSNTESALARNRKLE-----------------------RAWNGLTERAELQSQGI 210
Query: 777 NWKRKSKAPVVSIKDEELVVCKDKCKQITHFVTPDSDISSILSQAYPQD---SLHNVCIM 833
R+ K V ++K K + F+T + ++ ++ + + Q+ + +C+
Sbjct: 211 RPSRRGKKSATKETPVNRPV-QNKYKTTSRFITTELNLCDLVKEHFEQEPNVTKPRICLT 269
Query: 834 GLHTCGDLSGTALRLF-TKSSLQCLVQVGCCYHLLEEEFIRSPFWKDVDQSLYEHG--YG 890
GLHTCG+L+ T L++F + + + L VGCCYHLL E + + ++ +++L E YG
Sbjct: 270 GLHTCGNLAATCLQVFHAQDNCRVLCNVGCCYHLLRERYSQQESYE--NKALMEESTDYG 327
Query: 891 FPLSEHLRSRKFFLGRNVRMSGTQSPERVIDLKQTQTLPLFYRALLEKYLRSKITINDEE 950
FPLS +L+ R LGRN RM QS ER + K+ + L+YRALLE I I
Sbjct: 328 FPLSRYLQQRSVCLGRNARMLAAQSIERTLSSKELPNITLYYRALLE------ILICRRA 381
Query: 951 PKVVGRLAT----KCSNFVEYVHRAVDKLKLD--LEVDDEEVTRLFNSHQREYEYLQIYY 1004
P + +L K +NF EYV KL V + E+ L H ++ YL ++Y
Sbjct: 382 PHLKNKLKVGKVHKFNNFFEYVKICAKKLNEPWLAAVGEAELEALLQQHDVDWHYLSLFY 441
Query: 1005 FLKTALAPVIEALIVLDRVLYLRE 1028
L+ + APV+E++I+LDR+LYL+E
Sbjct: 442 LLRMSFAPVLESVILLDRLLYLKE 465
Score = 51.6 bits (122), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/51 (41%), Positives = 35/51 (68%)
Query: 1043 LTQVDDEEVTRLFDSHQREYEYLQIYYFLKTALAPVIEALIVLDRVLYLRE 1093
L V + E+ L H ++ YL ++Y L+ + APV+E++I+LDR+LYL+E
Sbjct: 415 LAAVGEAELEALLQQHDVDWHYLSLFYLLRMSFAPVLESVILLDRLLYLKE 465
>gi|367016955|ref|XP_003682976.1| hypothetical protein TDEL_0G03980 [Torulaspora delbrueckii]
gi|359750639|emb|CCE93765.1| hypothetical protein TDEL_0G03980 [Torulaspora delbrueckii]
Length = 775
Score = 188 bits (477), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 103/281 (36%), Positives = 160/281 (56%), Gaps = 14/281 (4%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MFQ+++ I++TGDY+ DRHL +A + D+ I EST+ T + + RER
Sbjct: 166 MFQIEIAGVRILFTGDYSRELDRHLNSAEVPTLPSDVHIVESTFGTATHEPRVNRERKLT 225
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYW-----ERMNLQAPIYFAVGLTEKATN 115
+ +H V RGG+VL+PVFALGRAQE+ ++L+ YW E Q PIY+A L +K +
Sbjct: 226 QLIHSTVSRGGRVLLPVFALGRAQEIMLILDEYWTQHSDELGGGQVPIYYASNLAKKCMS 285
Query: 116 YYKMFITWTNQKIRKTF--VQRNMFDFKHI---RPFDKSFIDNPGPMVVFATPGMLHSGL 170
++ ++ N IRK F Q N F FK+I R D + GP V+ A+PGML SGL
Sbjct: 286 VFQTYVNMMNDDIRKKFRDSQTNPFVFKNISYLRNIDD--FQDFGPSVMLASPGMLQSGL 343
Query: 171 SLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFEN--KQIIDVKMAIEYMSFS 228
S + +KW P + N++++ G+ V+GT+ ++ + N + I + IE +SF+
Sbjct: 344 SRDVLEKWCPEDKNLVLITGYSVEGTMAKFLMLEPDTIPSINNPEATIPRRCQIEEISFA 403
Query: 229 AHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF 269
AH D + ++ I+ N++LVHGE++ M LK + F
Sbjct: 404 AHVDFRENLEFIEKISAPNIILVHGESNPMGRLKSALLSNF 444
>gi|389740019|gb|EIM81211.1| mRNA 3'-end-processing protein YSH1 [Stereum hirsutum FP-91666 SS1]
Length = 841
Score = 188 bits (477), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 118/363 (32%), Positives = 187/363 (51%), Gaps = 25/363 (6%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
M+ + + I+YTGDY+ DRHL A + RPD+LI ESTY +++ +E F
Sbjct: 162 MYLIDMAGIKILYTGDYSREEDRHLVKAEVPPVRPDVLIVESTYGVQSLEARDEKELRFT 221
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
VH + RGG VL+P FALGRAQEL ++L+ YW++ PIY+A L K Y+
Sbjct: 222 SLVHSIIRRGGHVLLPAFALGRAQELLLILDEYWKKHPDLHNVPIYYASSLARKCMAVYQ 281
Query: 119 MFITWTNQKIRKTFVQR-NMFDFKHI------RPFDKSFIDNPGPMVVFATPGMLHSGLS 171
+I N IR F +R N F FKHI +++ + P P VV A+PG + SG S
Sbjct: 282 TYIHTMNSNIRTRFAKRDNPFVFKHISNMPQSSGWERKIAEGP-PCVVLASPGFMQSGPS 340
Query: 172 LIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHA 231
+ + WAP N LI+ G+ V+GT+ ++++ ++ I K++++Y+SFSAH
Sbjct: 341 RQLLELWAPDSRNGLIVTGYSVEGTLAREIMTEPDEIISMKGATIQRKLSVDYISFSAHV 400
Query: 232 DAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIK-----QEFNLDCFMPANGESCFVQT 286
D + I+ + +++LVHGE + M L+ ++ +E ++ P N E+ + T
Sbjct: 401 DYSQNSEFIEMIKAPHIVLVHGEQTAMGRLRAALQARYKNREEDVKIHTPRNLETLNL-T 459
Query: 287 DMKISIDVSVNLLKEEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISR 346
+ ++ L K N I +LV KD S +L+D + G+S
Sbjct: 460 FRGERVAKAIGTLAARPPKAND---------TISALLVSKDFSYTLLDPRDLRDFAGLST 510
Query: 347 HIV 349
+V
Sbjct: 511 CVV 513
>gi|392297785|gb|EIW08884.1| Ysh1p [Saccharomyces cerevisiae CEN.PK113-7D]
Length = 772
Score = 188 bits (477), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 100/284 (35%), Positives = 163/284 (57%), Gaps = 20/284 (7%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MFQ+++ +++TGDY+ DRHL +A + ++LI EST+ T + + RER
Sbjct: 162 MFQIEIAGLRVLFTGDYSREVDRHLNSAEVPPLSSNVLIVESTFGTATHEPRLNRERKLT 221
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERM-----NLQAPIYFAVGLTEKATN 115
+ +H V RGG+VL+PVFALGRAQE+ ++L+ YW R Q PI++A L +K +
Sbjct: 222 QLIHSTVMRGGRVLLPVFALGRAQEIMLILDEYWSRHADELGGGQVPIFYASNLAKKCMS 281
Query: 116 YYKMFITWTNQKIRKTF--VQRNMFDFKHIRPFDKSFIDNP------GPMVVFATPGMLH 167
++ ++ N IRK F Q N F FK+I S++ N GP V+ A+PGML
Sbjct: 282 VFQTYVNMMNDDIRKKFRDSQTNPFIFKNI-----SYLRNLEDFQDFGPSVMLASPGMLQ 336
Query: 168 SGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKL-EFENKQI-IDVKMAIEYM 225
SGLS + ++W P + N++++ G+ ++GT+ ++ + N +I I + +E +
Sbjct: 337 SGLSRDLLERWCPEDKNLVLITGYSIEGTMAKFIMLEPDTIPSINNPEITIPRRCQVEEI 396
Query: 226 SFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF 269
SF+AH D + ++ I+ N++LVHGEA+ M LK + F
Sbjct: 397 SFAAHVDFQENLEFIEKISAPNIILVHGEANPMGRLKSALLSNF 440
>gi|123439147|ref|XP_001310348.1| RNA-metabolising metallo-beta-lactamase family protein [Trichomonas
vaginalis G3]
gi|121892114|gb|EAX97418.1| RNA-metabolising metallo-beta-lactamase family protein [Trichomonas
vaginalis G3]
Length = 679
Score = 188 bits (477), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 126/385 (32%), Positives = 192/385 (49%), Gaps = 48/385 (12%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKC------RPDLLITESTYATTIRDSKRC 54
M+ V++ ++YTGD+++ +RHL A I K RPD+LI EST+ +S+
Sbjct: 164 MWLVEIDGVKVLYTGDFSLENERHLQGAEIPKSLSGEIIRPDVLIMESTHGLARIESRVD 223
Query: 55 RERDFLKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNL--QAPIYFAVGLTEK 112
RE F+ V + + RGG+ LIP+FALGRAQEL I+L+ YWE PIY+ L ++
Sbjct: 224 REYRFIDNVTKIIKRGGRCLIPIFALGRAQELLIILDEYWESHPEYNGVPIYYGSNLAKQ 283
Query: 113 ATNYYKMFITWTNQKIRKTFVQRNMFDF---KHIRPFDKSFIDNPGPMVVFATPGMLHSG 169
A Y F N ++ + F+F K+IR +D D+ P VV +P ML +G
Sbjct: 284 AIAAYNAFYQDHNSRV---VTAKGKFEFSYVKYIRDYD---FDDSLPCVVLCSPAMLQNG 337
Query: 170 LSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSA 229
+S IF+ W N LI+PG+ V GT+ ++ ++ + +II K++I+Y+SFS
Sbjct: 338 MSRKIFEAWCSNSVNGLIIPGYIVDGTLPQVLMKNPAEITTLSGKIIPRKISIDYVSFSG 397
Query: 230 HADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMK 289
HAD + I +PK ++L+HG M+ LKEK+ Q F D
Sbjct: 398 HADFNQTSRFITELKPKRIVLIHGVCGLMMQLKEKLLQMFVED----------------- 440
Query: 290 ISIDVSVNLLKEEAVKYNSEPPNPL-------KERQIHGVLVIKDSSISLMDVDEACKEV 342
++V L E+A + P+ + + I G++V KD +M E
Sbjct: 441 -GLEVYTPGLCEKATMWFQSNPSAIITGGLSDNKDNISGIIVRKDGQNMIMSASE----- 494
Query: 343 GISRHIVRFTSNVQISDSSPINKTL 367
+S H T N +S PINK L
Sbjct: 495 -LSTHTTLKTLNATMSQEVPINKPL 518
>gi|302832928|ref|XP_002948028.1| hypothetical protein VOLCADRAFT_79885 [Volvox carteri f.
nagariensis]
gi|300266830|gb|EFJ51016.1| hypothetical protein VOLCADRAFT_79885 [Volvox carteri f.
nagariensis]
Length = 728
Score = 187 bits (476), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 101/272 (37%), Positives = 150/272 (55%), Gaps = 6/272 (2%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF V+V +YTGDY+ PDRHL A P ++I ESTY T+ ++ RE+ +
Sbjct: 176 MFMVEVAGLRCLYTGDYSRLPDRHLPGADTPPVTPHIVIVESTYGTSRHLPRQQREQLLI 235
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQ---APIYFAVGLTEKATNYY 117
+ ++RGG+VL+P+ ALGRAQEL +LL+ YWE + PIY A + KA Y
Sbjct: 236 DNIRTTLNRGGRVLMPIVALGRAQELLLLLDEYWEAHKSELGGIPIYQASSMMSKALGVY 295
Query: 118 KMFITWTNQKIRKTFVQRNMFDFKHIRPFDKS--FI-DNPGPMVVFATPGMLHSGLSLII 174
+ ++ N+ I+K F RN F F+H++ FI D GP V+ ATP L SG S
Sbjct: 296 QTYVESLNEDIKKVFHDRNPFKFRHVQTLKNPAHFIADYSGPCVIMATPSGLQSGASRDF 355
Query: 175 FKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAK 234
F+ W N I+ F VQGT+ ++L G + + + +++A+ +SFSAHAD
Sbjct: 356 FEAWCEDARNTCIICDFAVQGTLAKEILGGPSSITTREGRRVPLRIAVHNISFSAHADFD 415
Query: 235 GIMQLIQYCEPKNVLLVHGEASKMVFLKEKIK 266
+ +P +V+LVHGE +M L + +K
Sbjct: 416 QTSGFLDTVKPPHVVLVHGEYGEMRKLAKALK 447
>gi|401837471|gb|EJT41396.1| YSH1-like protein [Saccharomyces kudriavzevii IFO 1802]
Length = 779
Score = 187 bits (476), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 99/284 (34%), Positives = 162/284 (57%), Gaps = 20/284 (7%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MFQ+++ +++TGDY+ DRHL +A + ++LI EST+ T + + RER
Sbjct: 169 MFQIEIAGLRVLFTGDYSREVDRHLNSAEVPPLSSNVLIVESTFGTATHEPRLNRERKLT 228
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYW-----ERMNLQAPIYFAVGLTEKATN 115
+ +H V RGG+VL+PVFALGRAQE+ ++L+ YW E Q PI++A L +K +
Sbjct: 229 QLIHSTVMRGGRVLLPVFALGRAQEIMLILDEYWSQHADELGGGQVPIFYASNLAKKCMS 288
Query: 116 YYKMFITWTNQKIRKTF--VQRNMFDFKHIRPFDKSFIDNP------GPMVVFATPGMLH 167
++ ++ N IRK F Q N F FK+I S++ N GP V+ A+PGML
Sbjct: 289 VFQTYVNMMNDDIRKKFRDSQTNPFIFKNI-----SYLRNLEDFQDFGPSVMLASPGMLQ 343
Query: 168 SGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDV--KMAIEYM 225
SGLS + ++W P + N++++ G+ ++GT+ ++ + N I + + +E +
Sbjct: 344 SGLSRDLLERWCPEDKNLVLITGYSIEGTMAKFIMLEPDTIPSINNSEITIPRRCQVEEI 403
Query: 226 SFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF 269
SF+AH D + ++ I+ N++LVHGEA+ M LK + F
Sbjct: 404 SFAAHVDFQENLEFIEKISAPNIILVHGEANPMGRLKSALLSNF 447
>gi|296418744|ref|XP_002838985.1| hypothetical protein [Tuber melanosporum Mel28]
gi|295634979|emb|CAZ83176.1| unnamed protein product [Tuber melanosporum]
Length = 783
Score = 187 bits (476), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 120/389 (30%), Positives = 198/389 (50%), Gaps = 40/389 (10%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ I++TGDY+ DRHL +A + +PDLLI ESTY T + +E +
Sbjct: 175 MFFIEIAGLKILFTGDYSREDDRHLVSAEVPHQKPDLLICESTYGTATHMPRLEKEARLM 234
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNL--QAPIYFAVGLTEKATNYYK 118
K E ++RGG+VL+PVFALGRAQEL ++L+ YWE+ PIY+A L K + Y+
Sbjct: 235 KMTTEILNRGGRVLMPVFALGRAQELLLILDEYWEKHPAYQSYPIYYASNLARKCMDVYR 294
Query: 119 MFITWTNQKIRKTFVQ---RNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLII 174
+I N KI++ + RN +DF+ +R ++ G V+ A+PGML +G+S +
Sbjct: 295 TYINTMNDKIKRAMFEGEGRNPWDFRWVRSLKTIDRFEDVGGCVMLASPGMLQNGVSREL 354
Query: 175 FKKWAPVENNMLIMPGFCVQGTIGHKVL--------------SGVKKLEFENKQI-IDVK 219
++W P N L++ G+ V+GT+ +++ SG + Q+ + +
Sbjct: 355 LERWCPDPRNGLVITGYSVEGTMAKQIMNEPTEIPAVVTANRSGSTASRGVDGQVMVPRR 414
Query: 220 MAIEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKI-------KQEFNLD 272
++E +SF+AH D I+ K V+LVHGE + M LK + + + +
Sbjct: 415 CSVEELSFAAHVDYAQNSGFIEEVGAKVVILVHGEQNNMGRLKSALLSKNSDRRDKDKVK 474
Query: 273 CFMPANGESCFVQTDMKISIDVSVNLLKEEAVKYNSEPPN----PLKERQIHGVLVIKDS 328
+ P N E ++ I + + + K +PP LKE+ + GVLV D
Sbjct: 475 IYNPKNCE--------ELRIPFKADKIAKVVGKLAQKPPGTQTLALKEQLLTGVLVQNDF 526
Query: 329 SISLMDVDEACKEVGISRHIVRFTSNVQI 357
+SLM D+ + G++ +V + I
Sbjct: 527 KLSLMTPDDLREYAGLTTTVVSCRQKIPI 555
>gi|170093225|ref|XP_001877834.1| predicted protein [Laccaria bicolor S238N-H82]
gi|164647693|gb|EDR11937.1| predicted protein [Laccaria bicolor S238N-H82]
Length = 772
Score = 187 bits (475), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 117/356 (32%), Positives = 181/356 (50%), Gaps = 30/356 (8%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF + + I+YTGDY+ DRHL A + RPD+LI ESTY + + +E+ F
Sbjct: 162 MFLIDIAGLKILYTGDYSREEDRHLVKAELPPVRPDVLIVESTYGVQSLEGREEKEQRFT 221
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
VH + RGG VL+P FALGRAQEL ++L+ YW++ PIY+A L K Y+
Sbjct: 222 NLVHSVIRRGGHVLLPAFALGRAQELLLILDEYWKKHPDLHNVPIYYASSLARKCMAVYQ 281
Query: 119 MFITWTNQKIRKTFVQR-NMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
+I N IR F +R N F FK K + P P VV A+PG + G S +F+
Sbjct: 282 TYIHTMNNNIRSRFAKRDNPFVFKC-----KKIAEGP-PCVVLASPGFMQVGPSRELFEL 335
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
WAP N LI+ G+ ++GT+ +++ ++ I K++++Y+SFSAH D
Sbjct: 336 WAPDARNGLIITGYSIEGTLARDIMTEPEEFTSLKGATIPRKISVDYVSFSAHVDYSQNS 395
Query: 238 QLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVN 297
+ I+ + ++V+LVHGE + M L+ + F D+KI ++
Sbjct: 396 EFIELVKAQHVVLVHGEQTAMGRLRGAMTSRFKER------------DEDVKIHTPRNLE 443
Query: 298 LLK--------EEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGIS 345
L+ +A+ ++ P P + G+LV KD S +L+D + G+S
Sbjct: 444 TLQLSFRGERVAKAIGTLADKP-PQTNDLLSGLLVAKDYSYTLLDPRDLRDFAGLS 498
>gi|195333644|ref|XP_002033497.1| GM21342 [Drosophila sechellia]
gi|194125467|gb|EDW47510.1| GM21342 [Drosophila sechellia]
Length = 494
Score = 187 bits (474), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 118/367 (32%), Positives = 187/367 (50%), Gaps = 43/367 (11%)
Query: 662 VSQLMSEKKSYEVQVMSQVV-AAVTNSCDSSHIIDLGGGQGYLSTILALQHGKKTLSLDY 720
+ + MS KK +EV++ + +V V NS I+D G G+GYLS+ LALQ+G + L +D
Sbjct: 115 IREFMSAKKCHEVELAAALVDQLVKNSGQECFIVDAGDGKGYLSSRLALQYGHRVLGIDA 174
Query: 721 NQVNTHGAAVRSKKLEDSEKESKGPFQSYAGVINKKLWMRTQNRQTSHASVEHHGKNWKR 780
N NT A R++KL+ R N T A ++ G KR
Sbjct: 175 NAANTENALNRNRKLQ-----------------------RAWNGLTERAQLQVQGITPKR 211
Query: 781 KSKAPVVSIKDEELVVCKDKCKQITHFVTPDSDISSILSQAYPQ---DSLHNVCIMGLHT 837
+ K + ++ K F+T + + ++L++ + Q D N+C+ GLHT
Sbjct: 212 RGKKSSARESSKSAPALENY-KTTAKFITTELNFGALLAEHFTQLGPDDSPNICLTGLHT 270
Query: 838 CGDLSGTALRLF-TKSSLQCLVQVGCCYHLLEEEFIRSPFWKDVDQSLYEHGYGFPLSEH 896
CG+L+ T L++F +S L VGCCYHLL E + + F+ + + YGFPLS++
Sbjct: 271 CGNLAATCLQVFHAQSDCLLLCNVGCCYHLLRERYSQQEFFGNKALMELQTDYGFPLSQY 330
Query: 897 LRSRKFFLGRNVRMSGTQSPERVIDLKQTQTLPLFYRALLEKYLRSKITINDEEPKV--- 953
LR R+ +GRN RM QS ER +D K+ + L+YRALLE + +N P++
Sbjct: 331 LRERQVRMGRNARMLAAQSIERTVDAKELPNVTLYYRALLE------VLVNRHAPQLKNE 384
Query: 954 --VGRLATKCSNFVEYVHRAVDKLKLDL--EVDDEEVTRLFNSHQREYEYLQIYYFLKTA 1009
VG++ K +F EY+ + KL V EE+ L + + +L ++Y L+ +
Sbjct: 385 LQVGKV-RKFESFEEYIQKCATKLDAPWLTSVKKEELQSLLQEYALDKHFLDLFYLLRMS 443
Query: 1010 LAPVIEA 1016
APV+E+
Sbjct: 444 FAPVLES 450
Score = 44.3 bits (103), Expect = 0.43, Method: Compositional matrix adjust.
Identities = 36/155 (23%), Positives = 73/155 (47%), Gaps = 15/155 (9%)
Query: 485 IKNTIDEMIEFITPNIACVNTHTVHFMVEKLWETTIPKAILSQVNNSPSFDSLLEE-FWK 543
++ +D ++ F+ P+ VN H V+++ + WE + + + S++ +E+ FW+
Sbjct: 9 LQRRLDGLLAFLNPHWDFVNCHMVNYLTDHHWEGFLSETLKSEIAGKEDVALAIEDLFWE 68
Query: 544 SRQDNHVETNSELVKFFQAADKFRLTSLMNGESIITVDELFSKLQARKCGQVVESA-KVS 602
+ + E F + + RL ++ E + +V+EL GQ + +
Sbjct: 69 TDESVRFPAWRE---FLGKSKQERLA--LHPELLTSVEELIE-------GQENSTQLSIR 116
Query: 603 QLMSEKKSYEVQVMSQVV-AAVTNSCDSSHIIDLG 636
+ MS KK +EV++ + +V V NS I+D G
Sbjct: 117 EFMSAKKCHEVELAAALVDQLVKNSGQECFIVDAG 151
>gi|323336337|gb|EGA77605.1| Ysh1p [Saccharomyces cerevisiae Vin13]
Length = 745
Score = 187 bits (474), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 100/284 (35%), Positives = 163/284 (57%), Gaps = 20/284 (7%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MFQ+++ +++TGDY+ DRHL +A + ++LI EST+ T + + RER
Sbjct: 135 MFQIEIAGLRVLFTGDYSREVDRHLNSAEVPPLSSNVLIVESTFGTATHEPRLNRERKLT 194
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYW-----ERMNLQAPIYFAVGLTEKATN 115
+ +H V RGG+VL+PVFALGRAQE+ ++L+ YW E Q PI++A L +K +
Sbjct: 195 QLIHSTVMRGGRVLLPVFALGRAQEIMLILDEYWSQHADELGGGQVPIFYASNLAKKCMS 254
Query: 116 YYKMFITWTNQKIRKTF--VQRNMFDFKHIRPFDKSFIDNP------GPMVVFATPGMLH 167
++ ++ N IRK F Q N F FK+I S++ N GP V+ A+PGML
Sbjct: 255 VFQTYVNMMNDDIRKKFRDSQTNPFIFKNI-----SYLRNLEDFQDFGPSVMLASPGMLQ 309
Query: 168 SGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKL-EFENKQI-IDVKMAIEYM 225
SGLS + ++W P + N++++ G+ ++GT+ ++ + N +I I + +E +
Sbjct: 310 SGLSRDLLERWCPEDKNLVLITGYSIEGTMAKFIMLEPDTIPSINNPEITIPRRCQVEEI 369
Query: 226 SFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF 269
SF+AH D + ++ I+ N++LVHGEA+ M LK + F
Sbjct: 370 SFAAHVDFQENLEFIEKISAPNIILVHGEANPMGRLKSALLSNF 413
>gi|365764103|gb|EHN05628.1| Ysh1p [Saccharomyces cerevisiae x Saccharomyces kudriavzevii VIN7]
Length = 699
Score = 187 bits (474), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 100/284 (35%), Positives = 163/284 (57%), Gaps = 20/284 (7%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MFQ+++ +++TGDY+ DRHL +A + ++LI EST+ T + + RER
Sbjct: 89 MFQIEIAGLRVLFTGDYSREVDRHLNSAEVPPLSSNVLIVESTFGTATHEPRLNRERKLT 148
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYW-----ERMNLQAPIYFAVGLTEKATN 115
+ +H V RGG+VL+PVFALGRAQE+ ++L+ YW E Q PI++A L +K +
Sbjct: 149 QLIHSTVMRGGRVLLPVFALGRAQEIMLILDEYWSQHADELGGGQVPIFYASNLAKKCMS 208
Query: 116 YYKMFITWTNQKIRKTF--VQRNMFDFKHIRPFDKSFIDNP------GPMVVFATPGMLH 167
++ ++ N IRK F Q N F FK+I S++ N GP V+ A+PGML
Sbjct: 209 VFQTYVNMMNDDIRKKFRDSQTNPFIFKNI-----SYLRNLEDFQDFGPSVMLASPGMLQ 263
Query: 168 SGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKL-EFENKQI-IDVKMAIEYM 225
SGLS + ++W P + N++++ G+ ++GT+ ++ + N +I I + +E +
Sbjct: 264 SGLSRDLLERWCPEDKNLVLITGYSIEGTMAKFIMLEPDTIPSINNPEITIPRRCQVEEI 323
Query: 226 SFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF 269
SF+AH D + ++ I+ N++LVHGEA+ M LK + F
Sbjct: 324 SFAAHVDFQENLEFIEKISAPNIILVHGEANPMGRLKSALLSNF 367
>gi|6323307|ref|NP_013379.1| Ysh1p [Saccharomyces cerevisiae S288c]
gi|74644951|sp|Q06224.1|YSH1_YEAST RecName: Full=Endoribonuclease YSH1; AltName: Full=Yeast 73 kDa
homolog 1; AltName: Full=mRNA 3'-end-processing protein
YSH1
gi|577190|gb|AAB67367.1| Ysh1p: subunit of polyadenylation factor I (PF I) [Saccharomyces
cerevisiae]
gi|151940984|gb|EDN59365.1| cleavage factor II (CF II) component [Saccharomyces cerevisiae
YJM789]
gi|190405336|gb|EDV08603.1| hypothetical protein SCRG_04228 [Saccharomyces cerevisiae RM11-1a]
gi|256269831|gb|EEU05091.1| Ysh1p [Saccharomyces cerevisiae JAY291]
gi|285813694|tpg|DAA09590.1| TPA: Ysh1p [Saccharomyces cerevisiae S288c]
gi|323332373|gb|EGA73782.1| Ysh1p [Saccharomyces cerevisiae AWRI796]
Length = 779
Score = 187 bits (474), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 100/284 (35%), Positives = 163/284 (57%), Gaps = 20/284 (7%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MFQ+++ +++TGDY+ DRHL +A + ++LI EST+ T + + RER
Sbjct: 169 MFQIEIAGLRVLFTGDYSREVDRHLNSAEVPPLSSNVLIVESTFGTATHEPRLNRERKLT 228
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYW-----ERMNLQAPIYFAVGLTEKATN 115
+ +H V RGG+VL+PVFALGRAQE+ ++L+ YW E Q PI++A L +K +
Sbjct: 229 QLIHSTVMRGGRVLLPVFALGRAQEIMLILDEYWSQHADELGGGQVPIFYASNLAKKCMS 288
Query: 116 YYKMFITWTNQKIRKTF--VQRNMFDFKHIRPFDKSFIDNP------GPMVVFATPGMLH 167
++ ++ N IRK F Q N F FK+I S++ N GP V+ A+PGML
Sbjct: 289 VFQTYVNMMNDDIRKKFRDSQTNPFIFKNI-----SYLRNLEDFQDFGPSVMLASPGMLQ 343
Query: 168 SGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKL-EFENKQI-IDVKMAIEYM 225
SGLS + ++W P + N++++ G+ ++GT+ ++ + N +I I + +E +
Sbjct: 344 SGLSRDLLERWCPEDKNLVLITGYSIEGTMAKFIMLEPDTIPSINNPEITIPRRCQVEEI 403
Query: 226 SFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF 269
SF+AH D + ++ I+ N++LVHGEA+ M LK + F
Sbjct: 404 SFAAHVDFQENLEFIEKISAPNIILVHGEANPMGRLKSALLSNF 447
>gi|349579985|dbj|GAA25146.1| K7_Ysh1p [Saccharomyces cerevisiae Kyokai no. 7]
Length = 779
Score = 187 bits (474), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 100/284 (35%), Positives = 163/284 (57%), Gaps = 20/284 (7%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MFQ+++ +++TGDY+ DRHL +A + ++LI EST+ T + + RER
Sbjct: 169 MFQIEIAGLRVLFTGDYSREVDRHLNSAEVPPLSSNVLIVESTFGTATHEPRLNRERKLT 228
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYW-----ERMNLQAPIYFAVGLTEKATN 115
+ +H V RGG+VL+PVFALGRAQE+ ++L+ YW E Q PI++A L +K +
Sbjct: 229 QLIHSTVMRGGRVLLPVFALGRAQEIMLILDEYWSQHADELGGGQVPIFYASNLAKKCMS 288
Query: 116 YYKMFITWTNQKIRKTF--VQRNMFDFKHIRPFDKSFIDNP------GPMVVFATPGMLH 167
++ ++ N IRK F Q N F FK+I S++ N GP V+ A+PGML
Sbjct: 289 VFQTYVNMMNDDIRKKFRDSQTNPFIFKNI-----SYLRNLEDFQDFGPSVMLASPGMLQ 343
Query: 168 SGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKL-EFENKQI-IDVKMAIEYM 225
SGLS + ++W P + N++++ G+ ++GT+ ++ + N +I I + +E +
Sbjct: 344 SGLSRDLLERWCPEDKNLVLITGYSIEGTMAKFIMLEPDTIPSINNPEITIPRRCQVEEI 403
Query: 226 SFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF 269
SF+AH D + ++ I+ N++LVHGEA+ M LK + F
Sbjct: 404 SFAAHVDFQENLEFIEKISAPNIILVHGEANPMGRLKSALLSNF 447
>gi|323303815|gb|EGA57598.1| Ysh1p [Saccharomyces cerevisiae FostersB]
Length = 727
Score = 187 bits (474), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 100/284 (35%), Positives = 163/284 (57%), Gaps = 20/284 (7%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MFQ+++ +++TGDY+ DRHL +A + ++LI EST+ T + + RER
Sbjct: 135 MFQIEIAGLRVLFTGDYSREVDRHLNSAEVPPLSSNVLIVESTFGTATHEPRLNRERKLT 194
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYW-----ERMNLQAPIYFAVGLTEKATN 115
+ +H V RGG+VL+PVFALGRAQE+ ++L+ YW E Q PI++A L +K +
Sbjct: 195 QLIHSTVMRGGRVLLPVFALGRAQEIMLILDEYWSQHADELGGGQVPIFYASNLAKKCMS 254
Query: 116 YYKMFITWTNQKIRKTF--VQRNMFDFKHIRPFDKSFIDNP------GPMVVFATPGMLH 167
++ ++ N IRK F Q N F FK+I S++ N GP V+ A+PGML
Sbjct: 255 VFQTYVNMMNDDIRKKFRDSQTNPFIFKNI-----SYLRNLEDFQDFGPSVMLASPGMLQ 309
Query: 168 SGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKL-EFENKQI-IDVKMAIEYM 225
SGLS + ++W P + N++++ G+ ++GT+ ++ + N +I I + +E +
Sbjct: 310 SGLSRDLLERWCPEDKNLVLITGYSIEGTMAKFIMLEPDTIPSINNPEITIPRRCQVEEI 369
Query: 226 SFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF 269
SF+AH D + ++ I+ N++LVHGEA+ M LK + F
Sbjct: 370 SFAAHVDFQENLEFIEKISAPNIILVHGEANPMGRLKSALLSNF 413
>gi|259148260|emb|CAY81507.1| Ysh1p [Saccharomyces cerevisiae EC1118]
Length = 779
Score = 186 bits (473), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 100/284 (35%), Positives = 163/284 (57%), Gaps = 20/284 (7%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MFQ+++ +++TGDY+ DRHL +A + ++LI EST+ T + + RER
Sbjct: 169 MFQIEIAGLRVLFTGDYSREVDRHLNSAEVPPLSSNVLIVESTFGTATHEPRLNRERKLT 228
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYW-----ERMNLQAPIYFAVGLTEKATN 115
+ +H V RGG+VL+PVFALGRAQE+ ++L+ YW E Q PI++A L +K +
Sbjct: 229 QLIHSTVMRGGRVLLPVFALGRAQEIMLILDEYWSQHADELGGGQVPIFYASNLAKKCMS 288
Query: 116 YYKMFITWTNQKIRKTF--VQRNMFDFKHIRPFDKSFIDNP------GPMVVFATPGMLH 167
++ ++ N IRK F Q N F FK+I S++ N GP V+ A+PGML
Sbjct: 289 VFQTYVNMMNDDIRKKFRDSQTNPFIFKNI-----SYLRNLEDFQDFGPSVMLASPGMLQ 343
Query: 168 SGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKL-EFENKQI-IDVKMAIEYM 225
SGLS + ++W P + N++++ G+ ++GT+ ++ + N +I I + +E +
Sbjct: 344 SGLSRDLLERWCPEDKNLVLITGYSIEGTMAKFIMLEPDTIPSINNPEITIPRRCQVEEI 403
Query: 226 SFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF 269
SF+AH D + ++ I+ N++LVHGEA+ M LK + F
Sbjct: 404 SFAAHVDFQENLEFIEKISAPNIILVHGEANPMGRLKSALLSNF 447
>gi|70926510|ref|XP_735783.1| hypothetical protein [Plasmodium chabaudi chabaudi]
gi|56509750|emb|CAH82972.1| hypothetical protein PC300249.00.0 [Plasmodium chabaudi chabaudi]
Length = 258
Score = 186 bits (472), Expect = 6e-44, Method: Composition-based stats.
Identities = 85/181 (46%), Positives = 122/181 (67%), Gaps = 1/181 (0%)
Query: 102 PIYFAVGLTEKATNYYKMFITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFA 161
PIYF GLTE A YYK++ +W N T + +N+FDF +I F ++++ PMV+FA
Sbjct: 1 PIYFGCGLTENANKYYKIYSSWINNDCVSTEL-KNLFDFSNISQFSNNYLNENRPMVLFA 59
Query: 162 TPGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMA 221
TPGMLH+GL+L FK WA NN++I+PG+CVQGTIGHK++ G KK+ + I V
Sbjct: 60 TPGMLHTGLALKAFKAWASNPNNLIILPGYCVQGTIGHKLIMGEKKILLDGSTYIYVNCK 119
Query: 222 IEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGES 281
I Y+SFSAHAD+ GI QLI++ PKNV+ VHG+ + M L + I +++++ P G++
Sbjct: 120 IIYLSFSAHADSNGIQQLIKHVMPKNVIFVHGDKNGMEKLSKHISNQYHINSICPYMGQN 179
Query: 282 C 282
C
Sbjct: 180 C 180
>gi|342319748|gb|EGU11695.1| Endoribonuclease YSH1 [Rhodotorula glutinis ATCC 204091]
Length = 857
Score = 186 bits (472), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 121/366 (33%), Positives = 195/366 (53%), Gaps = 27/366 (7%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCR--PDLLITESTYATTIRDSKRCRERD 58
MF ++V ++YTGDY+ DRHL A + PD++I ESTY + + +E
Sbjct: 192 MFLIEVAGARVLYTGDYSTEEDRHLVPAKVPNWERPPDVMICESTYGVQSHEPRLEKEAQ 251
Query: 59 FLKKVHECVDRGGKVLIPVFALGRAQELCILLETYW-ERMNLQ-APIYFAVGLTEKATNY 116
F V + RGG+VL+PVFALGRAQEL ++L+ YW E LQ PIY+ L K +
Sbjct: 252 FTNLVRSILKRGGRVLLPVFALGRAQELLLILDEYWAEHPELQHIPIYYVSSLAIKCMDV 311
Query: 117 YKMFITWTNQKIRKTFVQR-NMFDFKH----IRPFDK--SFIDNPGPMVVFATPGMLHSG 169
Y+ +I + +R F + N FDFK IRP D+ S +++ P VV A+PG L SG
Sbjct: 312 YRQYIHTMSPNVRSKFARGINPFDFKRKDSFIRPLDRGISKLNDRNPCVVMASPGFLTSG 371
Query: 170 LSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQI-IDVKMAIEYMSFS 228
+S + +KWAP N LI+ G+ V+G + +++ ++ N I +++++Y+SFS
Sbjct: 372 VSRELLEKWAPDPRNGLIITGYSVEGVMARTIMNEPTEIAALNGGAKIARRLSVDYISFS 431
Query: 229 AHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF-----NLDCFMPANGESCF 283
AH D + I P +++LVHGE + M L+ +K F ++ + P N E+
Sbjct: 432 AHVDYTQNSKFIDEVMPSHLILVHGEVNNMSRLRAALKTRFAERKNDVQIYTPRNVETVK 491
Query: 284 VQTDMKISIDVSVNLLKEEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVG 343
++ + + ++ L + A+ P PL G+LV KD S +L+ + + G
Sbjct: 492 LKFRGE-RMAKALGSLAQTALT----PATPLS-----GLLVSKDFSYTLLSPADLREFTG 541
Query: 344 ISRHIV 349
+S ++
Sbjct: 542 LSTSLI 547
>gi|448517227|ref|XP_003867743.1| endoribonuclease [Candida orthopsilosis Co 90-125]
gi|380352082|emb|CCG22306.1| endoribonuclease [Candida orthopsilosis]
Length = 769
Score = 186 bits (472), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 132/473 (27%), Positives = 237/473 (50%), Gaps = 63/473 (13%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
M+ +++G +++TGDY+ +RHL +A + +PD+LITEST+ T + K E+
Sbjct: 177 MYLIEIGGLKVLFTGDYSREENRHLPSAEVPPVKPDVLITESTFGTGTLEPKAELEKKLT 236
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERM-NLQ-APIYFAVGLTEKATNYYK 118
+H + +GG+VL+PVFALG AQEL ++L+ YWE+ +LQ +Y+ L K Y+
Sbjct: 237 NHIHATITKGGRVLLPVFALGNAQELLLILDEYWEKNEDLQNVSVYYCSDLARKCMAVYE 296
Query: 119 MFITWTNQKIR----KTFVQRNMFDFKHIRPF-DKSFIDNPGPMVVFATPGMLHSGLSLI 173
+ N KIR + N FDFK+I+ + S + GP VV ATPGML +G+S
Sbjct: 297 TYTGIMNDKIRLSSSSDDSKSNPFDFKYIKSIRNLSKFSDLGPSVVVATPGMLQAGVSRQ 356
Query: 174 IFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDV--KMAIEYMSFSAHA 231
+ +KWAP + N++I+ G+ V+GT+ +L + ++ N + + ++ IE +SF+AH
Sbjct: 357 LLEKWAPEQKNLVILTGYSVEGTMAKDLLKEPQVIQSLNNPDLSIPRRIGIEEISFAAHV 416
Query: 232 DAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFN--------LDCFMPANGESCF 283
D + + I P ++LVHG++ M LK + +++ + F P N E
Sbjct: 417 DFQQNSEFIDKVSPSRIILVHGDSVPMGRLKSALLSKYSSRKGTDKEVKVFNPRNCEELN 476
Query: 284 VQ-TDMKIS------IDVSVNLLKEEAVKYNSEPPNPLKERQ------------------ 318
+ +KI+ + + LK+E + +E + ++ +
Sbjct: 477 IAFKGLKIAKVLGSLAEAQLQALKQEIEEKITEADDATEDVKMEGGDDSKEQVNGTSNTF 536
Query: 319 -----IHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDSSPINKTLELIY-- 371
+ GVLV KD + L+ + + + +S IV+ N++I+ + P+ +I+
Sbjct: 537 QPGQIVSGVLVSKDFELDLVQLQDLNEFTQLSTSIVKSKMNLKINANLPL-----VIWHL 591
Query: 372 DQLLSY---LQDKSQEYKIQLTESSLSIDSVLLSVERIDDKNKRVFVTWPNQD 421
+Q+ Y + D E++ + +D V + V+R + V W N +
Sbjct: 592 EQMFGYINIINDDDNEWECVI------MDVVDILVDRSKGPGIYISVEWINDN 638
>gi|254582142|ref|XP_002497056.1| ZYRO0D14410p [Zygosaccharomyces rouxii]
gi|238939948|emb|CAR28123.1| ZYRO0D14410p [Zygosaccharomyces rouxii]
Length = 772
Score = 185 bits (470), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 102/284 (35%), Positives = 161/284 (56%), Gaps = 20/284 (7%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MFQ+++ +++TGDY+ DRHL +A I D+LI EST+ T + + RER
Sbjct: 167 MFQIEIAGLRVLFTGDYSRELDRHLNSAEIPPFPSDVLIVESTFGTATHEPRINRERKLT 226
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYW-----ERMNLQAPIYFAVGLTEKATN 115
+ +H V +GG+VL+PVFALGRAQEL ++L+ YW E Q PIY+A L K +
Sbjct: 227 QLIHSTVTKGGRVLLPVFALGRAQELMLILDEYWSQHAEELGGGQVPIYYASNLARKCMS 286
Query: 116 YYKMFITWTNQKIRKTF--VQRNMFDFKHIRPFDKSFIDNP------GPMVVFATPGMLH 167
++ ++ N IR+ F Q N F FK+I S++ N GP V+ A+PGML
Sbjct: 287 VFQTYVNMMNDDIRRKFRDSQTNPFVFKNI-----SYLKNIDEFQDFGPSVMLASPGMLQ 341
Query: 168 SGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKL-EFENKQI-IDVKMAIEYM 225
+GLS + ++W P N++++ G+ V+GT+ ++ + N +I I + IE +
Sbjct: 342 NGLSREVLERWCPEGKNLVLITGYSVEGTMAKFLMLEPDTIPSINNPEITIPRRCQIEEI 401
Query: 226 SFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF 269
SF+AH D + ++ I+ N++LVHGE++ M LK + F
Sbjct: 402 SFAAHVDFQENLEFIEKISAPNIVLVHGESNPMGRLKSALLSNF 445
>gi|195551860|ref|XP_002076314.1| GD15405 [Drosophila simulans]
gi|194201963|gb|EDX15539.1| GD15405 [Drosophila simulans]
Length = 494
Score = 185 bits (470), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 120/368 (32%), Positives = 188/368 (51%), Gaps = 45/368 (12%)
Query: 662 VSQLMSEKKSYEVQVMSQVV-AAVTNSCDSSHIIDLGGGQGYLSTILALQHGKKTLSLDY 720
+ + MS KK +EV++ + +V V NS I+D G G+GYLS+ LALQ+G + L +D
Sbjct: 115 IREFMSAKKCHEVELAAALVDQLVKNSGQECFIVDAGDGKGYLSSRLALQYGHRVLGIDA 174
Query: 721 NQVNTHGAAVRSKKLEDSEKESKGPFQSYAGVINKKLWMRTQNRQTSHASVEHHGKNWKR 780
N NT A R++KL+ R N T A ++ G KR
Sbjct: 175 NAANTENALNRNRKLQ-----------------------RAWNGLTERAELQVQGITPKR 211
Query: 781 KSKAPVVSIKDEELVVCKDKCKQITHFVTPDSDISSILS----QAYPQDSLHNVCIMGLH 836
+ K + + + K F+T + + ++L+ Q P+DS N+C+ GLH
Sbjct: 212 RGKKSPAR-ESTKTAPALENYKTTAKFITTELNFGALLAGHFTQLGPEDS-PNICLTGLH 269
Query: 837 TCGDLSGTALRLF-TKSSLQCLVQVGCCYHLLEEEFIRSPFWKDVDQSLYEHGYGFPLSE 895
TCG+L+ T L++F +S L VGCCYHLL E + + F+ + + YGFPLS+
Sbjct: 270 TCGNLAATCLQVFHAQSDCHLLCNVGCCYHLLRERYSQQEFFGNKALMELQTDYGFPLSQ 329
Query: 896 HLRSRKFFLGRNVRMSGTQSPERVIDLKQTQTLPLFYRALLEKYLRSKITINDEEPKV-- 953
+LR R+ +GRN RM QS ER +D K+ + L+YRALLE I ++ P++
Sbjct: 330 YLRERQVRMGRNARMLAAQSIERTVDAKELPNVTLYYRALLE------ILVHRHAPQLKN 383
Query: 954 ---VGRLATKCSNFVEYVHRAVDKLKLDL--EVDDEEVTRLFNSHQREYEYLQIYYFLKT 1008
VG++ K +F EY+ + KL V EE+ L + + +L ++Y L+
Sbjct: 384 ELQVGKV-RKFESFEEYIQKCATKLDAPWLTAVKKEELQSLLQEYALDKHFLDLFYLLRM 442
Query: 1009 ALAPVIEA 1016
+ APV+E+
Sbjct: 443 SFAPVLES 450
Score = 46.2 bits (108), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 37/155 (23%), Positives = 73/155 (47%), Gaps = 15/155 (9%)
Query: 485 IKNTIDEMIEFITPNIACVNTHTVHFMVEKLWETTIPKAILSQVNNSPSFDSLLEE-FWK 543
++ +D ++ F+ P+ VN H V+++ + WE + + + S++ +E+ FWK
Sbjct: 9 LQRRLDGLLAFLNPHWDFVNCHMVNYLTDHHWEGFLSETLKSEIAGKEDVALAIEDLFWK 68
Query: 544 SRQDNHVETNSELVKFFQAADKFRLTSLMNGESIITVDELFSKLQARKCGQVVESA-KVS 602
+ + E F + + RL ++ E + +V+EL GQ + +
Sbjct: 69 TDESVRFPAWRE---FLSKSKQERLA--LHPELLTSVEELIE-------GQENSTQLSIR 116
Query: 603 QLMSEKKSYEVQVMSQVV-AAVTNSCDSSHIIDLG 636
+ MS KK +EV++ + +V V NS I+D G
Sbjct: 117 EFMSAKKCHEVELAAALVDQLVKNSGQECFIVDAG 151
>gi|363750442|ref|XP_003645438.1| hypothetical protein Ecym_3113 [Eremothecium cymbalariae
DBVPG#7215]
gi|356889072|gb|AET38621.1| Hypothetical protein Ecym_3113 [Eremothecium cymbalariae
DBVPG#7215]
Length = 773
Score = 185 bits (470), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 99/285 (34%), Positives = 164/285 (57%), Gaps = 20/285 (7%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MFQV++ I++TGDY+ DRHL +A + D+LI EST+ T + + +ER
Sbjct: 170 MFQVEIAGLRILFTGDYSRELDRHLNSAEVPSLPSDILIVESTFGTATHEPRVSKERKLT 229
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYW-----ERMNLQAPIYFAVGLTEKATN 115
+ +H V +GG+VL+PVFALGRAQE+ ++L+ YW E Q PI++A L K +
Sbjct: 230 QLIHTTVAKGGRVLLPVFALGRAQEIMLILDEYWSQHAEELGTGQVPIFYASNLARKCMS 289
Query: 116 YYKMFITWTNQKIRKTF--VQRNMFDFKHIRPFDKSFIDNP------GPMVVFATPGMLH 167
++ ++ N KIRK F Q N F FK+I S++ N GP V+ A+PGML
Sbjct: 290 VFQTYVNMMNDKIRKKFRDSQTNPFIFKNI-----SYLKNLDEFQDFGPSVMLASPGMLQ 344
Query: 168 SGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDV--KMAIEYM 225
+GLS + +KW P E N++++ G+ V+GT+ ++ + + N + + + +E +
Sbjct: 345 NGLSRDLLEKWCPDEKNLVLITGYSVEGTMAKFLILEPESIPSINNPDVTIPRRCQVEEI 404
Query: 226 SFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFN 270
SF+AH D + ++ ++ +++LVHGE++ M LK + F+
Sbjct: 405 SFAAHVDFRENLEFVEKIGAPSIILVHGESNPMGRLKSALLSNFS 449
>gi|440493393|gb|ELQ75870.1| mRNA cleavage and polyadenylation factor II complex, BRR5 (CPSF
subunit) [Trachipleistophora hominis]
Length = 801
Score = 185 bits (470), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 117/308 (37%), Positives = 167/308 (54%), Gaps = 33/308 (10%)
Query: 36 DLLITESTYATTIRDSKRCRERDFLKKVHECVDRGGKVLIPVFALGRAQELCILLETYWE 95
D+LI ESTY + RER F + V++ V RGGK L+PVFALGRAQEL ++LE YWE
Sbjct: 266 DVLICESTYGVQCHLPREERERRFTQIVNDIVTRGGKCLLPVFALGRAQELLLILEDYWE 325
Query: 96 RMNLQ---APIYFAVGLTEKATNYYKMFITWTNQKIRKTFVQRNMFDFKHIRPF---DKS 149
R N Q PIY+A L + + Y+ + N KI+K + F FKHIR D
Sbjct: 326 R-NPQLHNVPIYYASALANRCLSIYQAYTHMMNLKIKK-----DAFTFKHIRNLKSVDNH 379
Query: 150 FIDNPGPMVVFATPGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLE 209
I N VV A+PGML SGLS +F+ W NN ++PG+CVQGT+ ++++ K++
Sbjct: 380 LIKNA--CVVMASPGMLQSGLSRELFESWCEDANNGTVIPGYCVQGTLAKEIMTEPKEIV 437
Query: 210 FENKQIIDVKMAIEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF 269
N + + M +EY+SFSAH D I+ C P V+LVHGE ++M+ LK +++
Sbjct: 438 AMNGHRLRLNMRVEYISFSAHVDYVQNTSFIEKCTPMIVMLVHGEVNEMMRLKAALEKRH 497
Query: 270 NLDCFMPANGESCFVQTDMKISIDVSVNLLKEEAVKYNSEPPNPLKERQIHGVLVIKDSS 329
++ NGE ++ + S+ V NL+ E I GV+V + S
Sbjct: 498 SV--LALKNGEMHEIEV-RRESVAVGKNLV----------------EGDIEGVVVNEQSG 538
Query: 330 ISLMDVDE 337
I + DE
Sbjct: 539 IRVYRRDE 546
>gi|401624491|gb|EJS42547.1| ysh1p [Saccharomyces arboricola H-6]
Length = 779
Score = 185 bits (470), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 99/284 (34%), Positives = 163/284 (57%), Gaps = 20/284 (7%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MFQ+++ +++TGDY+ DRHL +A + ++LI EST+ T + + RE+
Sbjct: 169 MFQIEIAGLRVLFTGDYSREVDRHLNSAEVPPLSSNVLIVESTFGTATHEPRLNREKKLT 228
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYW-----ERMNLQAPIYFAVGLTEKATN 115
+ +H V RGG+VL+PVFALGRAQE+ ++L+ YW E Q PI++A L +K +
Sbjct: 229 QLIHSTVMRGGRVLLPVFALGRAQEIMLILDEYWSQHTDELGGGQVPIFYASNLAKKCMS 288
Query: 116 YYKMFITWTNQKIRKTF--VQRNMFDFKHIRPFDKSFIDNP------GPMVVFATPGMLH 167
++ ++ N IRK F Q N F FK+I S++ N GP V+ A+PGML
Sbjct: 289 VFQTYVNMMNDDIRKKFRDSQTNPFIFKNI-----SYLRNLEDFQDFGPSVMLASPGMLQ 343
Query: 168 SGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKL-EFENKQI-IDVKMAIEYM 225
SGLS + ++W P + N++++ G+ ++GT+ ++ + N +I I + +E +
Sbjct: 344 SGLSRDLLERWCPEDKNLVLITGYSIEGTMAKFIMLEPDTIPSINNPEITIPRRCQVEEI 403
Query: 226 SFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF 269
SF+AH D + ++ I+ N++LVHGEA+ M LK + F
Sbjct: 404 SFAAHVDFQENLEFIEKISAPNIILVHGEANPMGRLKSALLSNF 447
>gi|409044817|gb|EKM54298.1| hypothetical protein PHACADRAFT_146128 [Phanerochaete carnosa
HHB-10118-sp]
Length = 869
Score = 185 bits (469), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 112/348 (32%), Positives = 177/348 (50%), Gaps = 25/348 (7%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF + + I+YTGDY+ DRHL A + RPD+LI EST+ + + +E F
Sbjct: 163 MFLIDIAGLKILYTGDYSREEDRHLVKAEVPPIRPDVLIVESTFGVQTLEGREEKELRFT 222
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
VH + RGG VL+P FALGRAQEL ++L+ YW++ P+Y+A L K Y+
Sbjct: 223 NLVHNIIRRGGHVLLPTFALGRAQELLLILDEYWKKHPDLHNVPVYYASSLARKCMAVYQ 282
Query: 119 MFITWTNQKIRKTFVQR-NMFDFKHI------RPFDKSFIDNPGPMVVFATPGMLHSGLS 171
+I N +R F +R N F FKHI R +++ + P VV A+PG + SG S
Sbjct: 283 TYIHTMNSNVRSRFAKRDNPFVFKHISNVPHSRGWERKIAEGPS-CVVLASPGFMESGPS 341
Query: 172 LIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHA 231
+ + WAP N +I+ G+ ++GT+ + + ++ I K++I+ +SF AH
Sbjct: 342 RELLELWAPDSRNGVILTGYSIEGTMARDIQTEPDEIPSVKGHTIPRKLSIDEISFGAHV 401
Query: 232 DAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF-NLDCFMPANGESCFVQTDMKI 290
D + I+ ++++LVHGE + M L+ ++ + N D E + T +
Sbjct: 402 DYSQNSEFIELIHAQHIVLVHGEQNAMGRLRAALQDRYKNRD-------EDVKIHTPRNL 454
Query: 291 SIDVSVNLLKEEAVK----YNSEPPNPLKERQIHGVLVIKDSSISLMD 334
+ + E K ++PP P I G+LV KD S +L+D
Sbjct: 455 E-PLQLTFRGERVAKAIGTLAAKPPQP--NEVISGLLVSKDYSYTLLD 499
>gi|410074967|ref|XP_003955066.1| hypothetical protein KAFR_0A04950 [Kazachstania africana CBS 2517]
gi|372461648|emb|CCF55931.1| hypothetical protein KAFR_0A04950 [Kazachstania africana CBS 2517]
Length = 769
Score = 185 bits (469), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 97/284 (34%), Positives = 161/284 (56%), Gaps = 20/284 (7%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MFQ+++ +++TGDY+ DRHL +A + D+LI EST+ T + + RE+
Sbjct: 166 MFQIEIAGLRVLFTGDYSRETDRHLNSAEVPPLSSDILIVESTFGTATHEPRLSREKKLT 225
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYW-----ERMNLQAPIYFAVGLTEKATN 115
+ +H V +GG+VL+PVFALGRAQEL ++L+ +W E Q PI++A L K +
Sbjct: 226 QLIHTTVSQGGRVLMPVFALGRAQELMLILDEFWSQHADELGGGQVPIFYASDLARKCMS 285
Query: 116 YYKMFITWTNQKIRKTF--VQRNMFDFKHIRPFDKSFIDNP------GPMVVFATPGMLH 167
++ ++ N IRK F Q N F FK+I S++ N GP V+ A+PGML
Sbjct: 286 VFQTYVNMMNDDIRKKFRDSQTNPFIFKNI-----SYLKNLEEFQDFGPSVMLASPGMLQ 340
Query: 168 SGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDV--KMAIEYM 225
SG+S + ++W P + N++++ G+ V+GT+ ++ + N I + + +E +
Sbjct: 341 SGISRDLLERWCPDDKNLVLITGYSVEGTMAKFIMLEPDTIPSVNNPEITIPRRCQVEEI 400
Query: 226 SFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF 269
SF+AH D + ++ I+ N++LVHGE++ M LK + F
Sbjct: 401 SFAAHVDFQENLEFIEKINANNIILVHGESNPMGRLKSALLSNF 444
>gi|344302811|gb|EGW33085.1| hypothetical protein SPAPADRAFT_66091 [Spathaspora passalidarum
NRRL Y-27907]
Length = 762
Score = 184 bits (468), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 102/297 (34%), Positives = 168/297 (56%), Gaps = 17/297 (5%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
M+ +++G +++TGDY+ +RHL AA I +PD+LI+EST+ T +SK E+
Sbjct: 173 MYFIEIGGLKVLFTGDYSREENRHLHAAEIPPVKPDILISESTFGTGTLESKADLEKKLT 232
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERM-NLQ-APIYFAVGLTEKATNYYK 118
+H + +GG+VL+PVFALG QEL ++L+ YW +LQ +Y+A L +K Y+
Sbjct: 233 NHIHATLTKGGRVLLPVFALGNTQELLLILDEYWNNNEDLQNINVYYASSLAKKCMAVYE 292
Query: 119 MFITWTNQKIR----KTFVQRNMFDFKHIRPF-DKSFIDNPGPMVVFATPGMLHSGLSLI 173
+ + N KIR + + N FDFK+I+ D + GP VV A PGML +G+S
Sbjct: 293 TYTSIMNDKIRLSASSSGHKSNPFDFKYIKSIRDLGKFQDMGPSVVIAAPGMLQAGISRQ 352
Query: 174 IFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFEN--KQIIDVKMAIEYMSFSAHA 231
+ +KWAP N++I+ G+ V+GT+ ++L ++ N + I ++ IE +SF+AH
Sbjct: 353 LLEKWAPDPKNLVILTGYSVEGTMAKELLKEPHTIQSVNNPEMTIPRRIGIEEISFAAHV 412
Query: 232 DAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFN--------LDCFMPANGE 280
D + + I+ P ++LVHG++ M LK + +++ + F P N E
Sbjct: 413 DFQQNSEFIEKVSPSKIILVHGDSVPMGRLKSALLSKYSSRKGTEQEVKVFNPKNCE 469
>gi|50287519|ref|XP_446189.1| hypothetical protein [Candida glabrata CBS 138]
gi|74637743|sp|Q6FUA5.1|YSH1_CANGA RecName: Full=Endoribonuclease YSH1; AltName: Full=mRNA
3'-end-processing protein YSH1
gi|49525496|emb|CAG59113.1| unnamed protein product [Candida glabrata]
Length = 771
Score = 184 bits (468), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 99/280 (35%), Positives = 162/280 (57%), Gaps = 20/280 (7%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MFQ+++ +++TGDY+ DRHL +A + D+LI EST+ T + + RE+
Sbjct: 168 MFQIEIAGLRVLFTGDYSREIDRHLNSAEVPPLPSDILIVESTFGTATHEPRLHREKKLT 227
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYW-----ERMNLQAPIYFAVGLTEKATN 115
+ +H V++GG+VL+PVFALGRAQEL ++L+ YW E + Q PI++A L K +
Sbjct: 228 QLIHSTVNKGGRVLMPVFALGRAQELMLILDEYWSQHKEELGSNQIPIFYASNLARKCLS 287
Query: 116 YYKMFITWTNQKIRKTF--VQRNMFDFKHIRPFDKSFIDNP------GPMVVFATPGMLH 167
++ ++ N IRK F Q N F FK+I ++I N GP V+ A+PGML
Sbjct: 288 VFQTYVNMMNDNIRKKFRDSQTNPFIFKNI-----AYIKNLDEFQDFGPSVMLASPGMLQ 342
Query: 168 SGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKL-EFENKQI-IDVKMAIEYM 225
+GLS + ++W P E N++++ G+ V+GT+ +L + N ++ I + +E +
Sbjct: 343 NGLSRDLLERWCPDEKNLVLITGYSVEGTMAKYLLLEPDTIPSVSNPEVTIPRRCRVEEL 402
Query: 226 SFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKI 265
SF+AH D + ++ I+ N++LVHGE + M LK +
Sbjct: 403 SFAAHVDFQENLEFIEQINASNIILVHGEPNPMGRLKSAL 442
>gi|323307973|gb|EGA61229.1| Ysh1p [Saccharomyces cerevisiae FostersO]
Length = 727
Score = 184 bits (467), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 99/284 (34%), Positives = 162/284 (57%), Gaps = 20/284 (7%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MFQ+++ +++TGDY+ DRHL +A + ++LI EST+ T + + RER
Sbjct: 135 MFQIEIAGLRVLFTGDYSREVDRHLNSAEVPPLSSNVLIVESTFGTATHEPRLNRERKLT 194
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYW-----ERMNLQAPIYFAVGLTEKATN 115
+ +H V RGG+VL+PVFALGRAQE+ ++L+ YW E Q PI++A L +K +
Sbjct: 195 QLIHSTVMRGGRVLLPVFALGRAQEIMLILDEYWSQHADELGGGQVPIFYASNLAKKCMS 254
Query: 116 YYKMFITWTNQKIRKTF--VQRNMFDFKHIRPFDKSFIDNP------GPMVVFATPGMLH 167
++ ++ N I K F Q N F FK+I S++ N GP V+ A+PGML
Sbjct: 255 VFQTYVNMMNDDIXKKFRDSQTNPFIFKNI-----SYLRNLEDFQDFGPSVMLASPGMLQ 309
Query: 168 SGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKL-EFENKQI-IDVKMAIEYM 225
SGLS + ++W P + N++++ G+ ++GT+ ++ + N +I I + +E +
Sbjct: 310 SGLSRDLLERWCPEDKNLVLITGYSIEGTMAKFIMLEPDTIPSINNPEITIPRRCQVEEI 369
Query: 226 SFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF 269
SF+AH D + ++ I+ N++LVHGEA+ M LK + F
Sbjct: 370 SFAAHVDFQENLEFIEKISAPNIILVHGEANPMGRLKSALLSNF 413
>gi|392575747|gb|EIW68879.1| hypothetical protein TREMEDRAFT_44189 [Tremella mesenterica DSM
1558]
Length = 738
Score = 184 bits (466), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 119/382 (31%), Positives = 198/382 (51%), Gaps = 15/382 (3%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ I+YTGDY+ DRHL A + +PD++I EST+ ++ +E F
Sbjct: 183 MFMIEIAGLKILYTGDYSREEDRHLVIAEVPPVKPDVMICESTFGVHTLPDRKEKEEQFT 242
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYW-ERMNLQ-APIYFAVGLTEKATNYYK 118
V V RGG+ L+P+ + G QEL +LL+ YW + LQ PI+FA GL ++ YK
Sbjct: 243 TLVSNIVKRGGRCLMPIPSFGNGQELALLLDEYWHDHPELQNIPIFFASGLFQRGMRVYK 302
Query: 119 MFITWTNQKIRKTFVQR-NMFDFKHIRPF-DKSFIDN-PGPMVVFATPGMLHSGLSLIIF 175
++ N IR F +R N FDFK+++P D DN P VV A+ + GLS +
Sbjct: 303 TYVHTMNANIRSRFARRDNPFDFKYVKPLKDGRRGDNFKSPCVVMASAQFMSFGLSRELL 362
Query: 176 KKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKG 235
+ WAP E N +I+ G+ ++GT+ +L +E I ++ ++ +SFSAH D
Sbjct: 363 EDWAPGEKNGVIVTGYSIEGTMARTLLGEPDHIESLRGTKIPRRLTVKEISFSAHVDYGQ 422
Query: 236 IMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVS 295
+ IQ ++++LVHGE+S+M+ L+ ++ ++ A G+ + T ++
Sbjct: 423 NSRFIQEVGAQHIVLVHGESSQMLRLRAALRDSYS------ARGQEVNIHTPRNCE-PLT 475
Query: 296 VNLLKEEAVKY--NSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTS 353
+ +E VK + P + G+LV KD S +L+D + G+S +
Sbjct: 476 LTFRQERQVKAIGSLAAHRPEHGAIVKGLLVSKDFSYTLLDPKDLQDFTGLSTSTIVQKQ 535
Query: 354 NVQIS-DSSPINKTLELIYDQL 374
+V +S D S + LE +Y ++
Sbjct: 536 SVPLSVDWSVVRWHLEGMYGEV 557
>gi|353239750|emb|CCA71648.1| related to YSH1-component of pre-mRNA polyadenylation factor PF I
[Piriformospora indica DSM 11827]
Length = 756
Score = 184 bits (466), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 120/385 (31%), Positives = 199/385 (51%), Gaps = 36/385 (9%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF + + ++YTGDY+ DRHL A + RPD+LI ESTY + RE+ F+
Sbjct: 161 MFLIDIAGLKVLYTGDYSREEDRHLVQAQVPSIRPDVLICESTYGVQKHEELSGREKRFV 220
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWER-MNLQA-PIYFAVGLTEKATNYYK 118
V V RGG VL+P FALGRAQE+ ++LE +W R +L PIY+ L +K Y+
Sbjct: 221 DLVTAVVKRGGHVLLPAFALGRAQEILLILEEHWSRNPDLHGVPIYYVSSLAKKCMAVYQ 280
Query: 119 MFITWTNQKIRKTF-VQRNMFDFKHI------RPFDKSFIDNPGPMVVFATPGMLHSGLS 171
I+ N KI++ + Q N F FK+I R +K + P P VV A+PG + +G S
Sbjct: 281 TNISSMNSKIQERWKKQENPFVFKYITNLPQTRGAEKKVAEGP-PCVVLASPGFMDNGSS 339
Query: 172 LIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHA 231
+ + WAP N +I+ G+ V+GT+ + + ++ I ++A+E +SFSAH
Sbjct: 340 RELLELWAPDPRNAVIVTGYSVEGTMARDIQNSPDEIVSLKGTTIPRRLAVEVISFSAHV 399
Query: 232 DAKGIMQLIQYCEPKNV-----------LLVHGEASKMVFLKEKIKQEF-----NLDCFM 275
D + + I+ + +++ +L+HGE+ M LK +++Q + +
Sbjct: 400 DGEQNLDFIEKVKAQHIVSLLIASSDVKVLMHGESKAMFSLKNRLEQRYAGNEEGIQIHA 459
Query: 276 PANGESCFVQTDMKISIDVSVNLLKEEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDV 335
P N E+ + T I ++ + E+ VK ++ QI G+L+ KD + +L+D
Sbjct: 460 PWNTETLKL-TFRGDRIAKAIGRIAEKGVKEDA---------QIQGLLLSKDFTFTLLDP 509
Query: 336 DEACKEVGISRHIVRFTSNVQISDS 360
+ + G+S ++ T + +S S
Sbjct: 510 RDLEEVTGLSTSVIVQTQRIALSVS 534
>gi|68489322|ref|XP_711502.1| hypothetical protein CaO19.12941 [Candida albicans SC5314]
gi|68489371|ref|XP_711478.1| hypothetical protein CaO19.5486 [Candida albicans SC5314]
gi|74584420|sp|Q59P50.1|YSH1_CANAL RecName: Full=Endoribonuclease YSH1; AltName: Full=mRNA
3'-end-processing protein YSH1
gi|46432783|gb|EAK92250.1| hypothetical protein CaO19.5486 [Candida albicans SC5314]
gi|46432809|gb|EAK92275.1| hypothetical protein CaO19.12941 [Candida albicans SC5314]
Length = 870
Score = 184 bits (466), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 98/273 (35%), Positives = 160/273 (58%), Gaps = 8/273 (2%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
M+ +++G +++TGDY+ +RHL AA + +PD+LI+EST+ T + + ER
Sbjct: 262 MYFIEIGGLKVLFTGDYSREENRHLHAAEVPPLKPDILISESTFGTGTLEPRIELERKLT 321
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYW-ERMNLQ-APIYFAVGLTEKATNYYK 118
+H + +GG+VL+PVFALG AQEL ++L+ YW + +LQ +++A L +K Y+
Sbjct: 322 THIHATIAKGGRVLLPVFALGNAQELLLILDEYWSQNEDLQNVNVFYASNLAKKCMAVYE 381
Query: 119 MFITWTNQKIRKTFV---QRNMFDFKHIRPF-DKSFIDNPGPMVVFATPGMLHSGLSLII 174
+ N KIR + + N FDFK+I+ D S + GP VV ATPGML +G+S +
Sbjct: 382 TYTGIMNDKIRLSSASSEKSNPFDFKYIKSIKDLSKFQDMGPSVVVATPGMLQAGVSRQL 441
Query: 175 FKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENK--QIIDVKMAIEYMSFSAHAD 232
+KWAP N++I+ G+ V+GT+ ++L ++ I ++ IE +SF+AH D
Sbjct: 442 LEKWAPDGKNLVILTGYSVEGTMAKELLKEPTMIQSATNPDMTIPRRIGIEEISFAAHVD 501
Query: 233 AKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKI 265
+ + I+ P V+LVHG++ M LK +
Sbjct: 502 FQQNSEFIEKVSPSKVILVHGDSVPMGRLKSAL 534
>gi|238882385|gb|EEQ46023.1| hypothetical protein CAWG_04366 [Candida albicans WO-1]
Length = 783
Score = 183 bits (465), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 98/277 (35%), Positives = 162/277 (58%), Gaps = 8/277 (2%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
M+ +++G +++TGDY+ +RHL AA + +PD+LI+EST+ T + + ER
Sbjct: 175 MYFIEIGGLKVLFTGDYSREENRHLHAAEVPPLKPDILISESTFGTGTLEPRIELERKLT 234
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYW-ERMNLQ-APIYFAVGLTEKATNYYK 118
+H + +GG+VL+PVFALG AQEL ++L+ YW + +LQ +++A L +K Y+
Sbjct: 235 THIHATIAKGGRVLLPVFALGNAQELLLILDEYWSQNEDLQNVNVFYASNLAKKCMAVYE 294
Query: 119 MFITWTNQKIRKTFV---QRNMFDFKHIRPF-DKSFIDNPGPMVVFATPGMLHSGLSLII 174
+ N KIR + + N FDFK+I+ D S + GP VV ATPGML +G+S +
Sbjct: 295 TYTGIMNDKIRLSSASSEKSNPFDFKYIKSIKDLSKFQDMGPSVVVATPGMLQAGVSRQL 354
Query: 175 FKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENK--QIIDVKMAIEYMSFSAHAD 232
+KWAP N++I+ G+ V+GT+ ++L ++ I ++ IE +SF+AH D
Sbjct: 355 LEKWAPDGKNLVILTGYSVEGTMAKELLKEPTMIQSATNPDMTIPRRIGIEEISFAAHVD 414
Query: 233 AKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF 269
+ + I+ P V+LVHG++ M LK + ++
Sbjct: 415 FQQNSEFIEKVSPSKVILVHGDSVPMGRLKSALLSKY 451
>gi|85725010|ref|NP_001033941.1| CG33964, partial [Drosophila melanogaster]
gi|28317177|gb|AAO39598.1| GM20958p [Drosophila melanogaster]
gi|71999126|gb|AAF58558.4| CG33964, partial [Drosophila melanogaster]
Length = 494
Score = 183 bits (464), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 119/368 (32%), Positives = 189/368 (51%), Gaps = 45/368 (12%)
Query: 662 VSQLMSEKKSYEVQVMSQVV-AAVTNSCDSSHIIDLGGGQGYLSTILALQHGKKTLSLDY 720
+ + MS KK +EV + + +V V NS I+D G G+GYLS+ LALQ+G + L +D
Sbjct: 115 IREFMSAKKCHEVDLTAVLVDQLVKNSGQDCFIVDAGDGKGYLSSRLALQYGHRVLGIDA 174
Query: 721 NQVNTHGAAVRSKKLEDSEKESKGPFQSYAGVINKKLWMRTQNRQTSHASVEHHGKNWKR 780
N NT A R++KL+ R N T A ++ G KR
Sbjct: 175 NAANTENALNRNRKLQ-----------------------RAWNGLTERAELQVQGITPKR 211
Query: 781 KSKAPVVSIKDEELVVCKDKCKQITHFVTPDSDISSILSQAY----PQDSLHNVCIMGLH 836
+SK + ++ K F+T + + ++L++ + P+DS N+C+ GLH
Sbjct: 212 RSKKSPARESTKSAPALENY-KTTAKFITTELNFGALLAENFTQFRPEDS-PNICLTGLH 269
Query: 837 TCGDLSGTALRLF-TKSSLQCLVQVGCCYHLLEEEFIRSPFWKDVDQSLYEHGYGFPLSE 895
TCG+L+ T L++F ++ + L VGCCYHLL E + + F+ + + YGFPLS+
Sbjct: 270 TCGNLAATCLQVFHAQTDCRLLCNVGCCYHLLRERYSQQEFFGNKSLMELQTDYGFPLSQ 329
Query: 896 HLRSRKFFLGRNVRMSGTQSPERVIDLKQTQTLPLFYRALLEKYLRSKITINDEEPKV-- 953
+LR R+ +GRN RM QS ER +D K+ + L+YRALLE I + P++
Sbjct: 330 YLRERQVRMGRNARMLAAQSIERTLDTKELPNVTLYYRALLE------ILVCRHAPELKN 383
Query: 954 ---VGRLATKCSNFVEYVHRAVDKLKLDL--EVDDEEVTRLFNSHQREYEYLQIYYFLKT 1008
VG++ K +F EY+ + KL V+ EE+ L + +L ++Y L+
Sbjct: 384 ELQVGKV-RKFESFEEYIQKCATKLDAPWLAAVEKEELQSLLQEYAVHKYFLDLFYLLRM 442
Query: 1009 ALAPVIEA 1016
+ APV+E+
Sbjct: 443 SFAPVLES 450
Score = 43.5 bits (101), Expect = 0.74, Method: Compositional matrix adjust.
Identities = 35/155 (22%), Positives = 74/155 (47%), Gaps = 15/155 (9%)
Query: 485 IKNTIDEMIEFITPNIACVNTHTVHFMVEKLWETTIPKAILSQVNNSPSFDSLLEE-FWK 543
++ +D ++ F+ P+ VN H V+++ ++ W+ + + + S++ +E+ FWK
Sbjct: 9 LQRRLDGLLAFLNPHWDFVNCHMVNYLTDQHWDGFLSETLKSEIAGKEDVALAIEDLFWK 68
Query: 544 SRQDNHVETNSELVKFFQAADKFRLTSLMNGESIITVDELFSKLQARKCGQVVESA-KVS 602
+ + E F + + RL+ ++ + + +V+EL GQ + +
Sbjct: 69 TDESERFPAWRE---FLGKSKQERLS--LHPKLLTSVEELIE-------GQGNSTQLSIR 116
Query: 603 QLMSEKKSYEVQVMSQVV-AAVTNSCDSSHIIDLG 636
+ MS KK +EV + + +V V NS I+D G
Sbjct: 117 EFMSAKKCHEVDLTAVLVDQLVKNSGQDCFIVDAG 151
>gi|312382988|gb|EFR28237.1| hypothetical protein AND_04072 [Anopheles darlingi]
Length = 485
Score = 183 bits (464), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 145/546 (26%), Positives = 246/546 (45%), Gaps = 113/546 (20%)
Query: 482 MELIKNTIDEMIEFITPNIACVNTHTVHFMVEKLWETTIPKAILSQVNNSPSFDSLLEEF 541
ME ++ ID + F+ PN A +N H V ++ E+ W+ +PK I ++ + + F
Sbjct: 1 MERLRQQIDTIARFLEPNAAFINCHMVDYLTEQHWDRYVPKPIQRELLSVEDYLRAKHVF 60
Query: 542 WKSRQDNHVETNSELVKFFQAADKFRLTSLMNGESIITVDELFSKLQARKCGQVVESAKV 601
W +F Q +D DEL V
Sbjct: 61 WG--------------QFDQPSDS-------------VSDEL---------------PAV 78
Query: 602 SQLMSEKKSYEVQVMSQVVAAVTNSCDSSHIIDLGSTITVDELFSKLQARKCGLVVESAK 661
+ E + Y ++ S+V G+ +T+ E + L+ KC K
Sbjct: 79 KAFIEETRKYRLE-GSEVQ---------------GTALTLAEFQNALE--KCRKETR-LK 119
Query: 662 VSQLMSEKKSYEVQVMSQVVAAVTNSC---------DSSHIIDLGGGQGYLSTILALQHG 712
+++LM+ KK +EV++ + VVA++ + D +ID G G+GYLS+ +A++HG
Sbjct: 120 MTELMNVKKCHEVEIAAAVVASLCTAVASGLPDGTQDDILVIDAGDGKGYLSSRIAVEHG 179
Query: 713 KKTLSLDYNQVNTHGAAVRSKKLEDSEKESKGPFQSYAGVINKKLWMRTQNRQTSHASVE 772
K L +D N+ NT A R +L+ +K P K + + H +
Sbjct: 180 IKVLGVDCNEDNTSNAEKRRDRLK-----TKIP----------KAVKKANLEEDEHFTNL 224
Query: 773 HHGKNWKRKSKAPVVSIKDEELVVCKDKCKQITHFVTPDSDISSILSQAYPQDSLHNVCI 832
G K + T + ++D+ ++ +P D+ C+
Sbjct: 225 LQGDTLKSLYRT-------------------TTQLIDFETDLIALAKHHFPADNHRTFCL 265
Query: 833 MGLHTCGDLSGTALRLFTKS-SLQCLVQVGCCYHLLEEEFIRSPFWKDVDQSLYEHGYGF 891
GLHTCG+L LRLF ++ ++ ++ VGCCYHL+ EEF+ F+ S GYGF
Sbjct: 266 CGLHTCGNLGPNCLRLFHQNRTIAGIMNVGCCYHLMREEFVMDDFYNPAKIS-DNPGYGF 324
Query: 892 PLSEHLRSRKFFLGRNVRMSGTQSPERVIDLKQTQTLPLFYRALLEKYLRSKITINDEEP 951
P+S +LR+R+F +GRN R ++S ER ++ + L YRALL+ L + ++
Sbjct: 325 PMSSYLRNRQFAIGRNARNLASESIERACINRENPSDKLGYRALLQVVL---LQYGQKKS 381
Query: 952 KVVGRLATKCSNFVEYVHRAVDKLKLD--LEVDDEEVTRLFNSHQREYEYLQIYYFLKTA 1009
VGRL K +F++YV ++V +L L+ + + D + L E E L+++Y ++
Sbjct: 382 LQVGRL--KSGDFIDYVRKSVRRLGLEDRVTISDNSLLELETRFSTELEQLKVFYLIRQQ 439
Query: 1010 LAPVIE 1015
APV+E
Sbjct: 440 FAPVVE 445
>gi|47213099|emb|CAF89519.1| unnamed protein product [Tetraodon nigroviridis]
Length = 810
Score = 183 bits (464), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 95/235 (40%), Positives = 140/235 (59%), Gaps = 3/235 (1%)
Query: 38 LITESTYATTIRDSKRCRERDFLKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERM 97
L ESTY T I + + RE F VH+ V+R G+ LIPVFALGRAQEL ++L+ YW+
Sbjct: 257 LSQESTYGTHIHEKREEREARFCNTVHDIVNREGRCLIPVFALGRAQELLLILDEYWQNH 316
Query: 98 N--LQAPIYFAVGLTEKATNYYKMFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNP 154
PIY+A L K Y+ +I N KIRK N F FKHI D+
Sbjct: 317 PELHDIPIYYASSLARKCMAVYQTYINAMNDKIRKAININNPFVFKHISNLKSMDHFDDI 376
Query: 155 GPMVVFATPGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQ 214
GP VV A+PGM+ SGLS +F+ W + N +I+ G+CV+GT+ ++S ++ + Q
Sbjct: 377 GPSVVMASPGMMQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSEPDEITTMSGQ 436
Query: 215 IIDVKMAIEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF 269
+ +KM+++Y+SFSAH D + + I+ +P +V+LVHGE ++M LK + +E+
Sbjct: 437 KLALKMSVDYISFSAHTDYQQTSEFIRALKPPHVILVHGEQNEMARLKAALIREY 491
>gi|403158620|ref|XP_003319317.2| hypothetical protein PGTG_01491 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375166386|gb|EFP74898.2| hypothetical protein PGTG_01491 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 778
Score = 182 bits (463), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 117/374 (31%), Positives = 190/374 (50%), Gaps = 46/374 (12%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAA----WIDKCRPDLLITESTYATTIRDSKRCRE 56
MF +++ ++YTGDY+ DRHL A W +K PD++I ESTY + + +E
Sbjct: 187 MFLIEISGIRVLYTGDYSTEEDRHLIPARVPNWNEK--PDVMICESTYGVQSLEPRFEKE 244
Query: 57 RDFLKKVHECVDRGGKVLIPVFALGRAQELCILLETYWER---MNLQAPIYFAVGLTEKA 113
F V + RGG+VL+PVFALGRAQEL ++L+ YW +N Q PIY+ L K
Sbjct: 245 ERFTTLVQSILKRGGRVLMPVFALGRAQELLLILDEYWANHPELN-QIPIYYISNLAAKC 303
Query: 114 TNYYKMFITWTNQKIRKTFVQR-NMFDF-------------KHIRPFDKSFIDNPGPMVV 159
Y+ FI N +I++ F Q N + F +++ DK D+ GP VV
Sbjct: 304 MKVYQTFIHGMNDQIKRKFNQGINPWTFYREGKGVFKKGYVTNLKAIDK--FDDRGPCVV 361
Query: 160 FATPGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVK 219
A+PG + SG+S + ++WAP N L++ G+ ++GT+ ++L ++ I +
Sbjct: 362 MASPGFMQSGVSRELLERWAPDRRNALLVTGYSIEGTMAREMLKEPNEIIGMKGNKIPRR 421
Query: 220 MAIEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANG 279
+ + Y+SFSAH D I P +++LVHGE + M LK +K ++ L
Sbjct: 422 LDVHYISFSAHVDYTQNAAFIDQIMPTHLVLVHGELNNMTRLKNALKDKYTLK------- 474
Query: 280 ESCFVQTDMKISIDVSVNLLK--------EEAVKYNSEPPNPLKERQIHGVLVIKDSSIS 331
++D++I +V L+ A+ +E PL + G+LV KD++ +
Sbjct: 475 -----KSDIQIHTPRNVETLRLTFQASRIARAIGSLAEDTAPLAGSSLSGILVSKDAAYT 529
Query: 332 LMDVDEACKEVGIS 345
L+ + + G+S
Sbjct: 530 LLSPADLREYTGLS 543
>gi|156083689|ref|XP_001609328.1| cleavage and polyadenylation specifity factor [Babesia bovis T2Bo]
gi|154796579|gb|EDO05760.1| cleavage and polyadenylation specifity factor [Babesia bovis]
Length = 709
Score = 182 bits (463), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 107/286 (37%), Positives = 166/286 (58%), Gaps = 12/286 (4%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF V++ I+YTGDY+ DRH+ A I LLI ESTY I + + RER FL
Sbjct: 199 MFLVEMDGVRILYTGDYSTEVDRHVPCAEIPPINAHLLICESTYGIRIHEERVQRERRFL 258
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWE-RMNLQA-PIYFAVGLTEKATNYYK 118
+ V E V RGGK L+PVFALGRAQE+ ++L+ YW+ NLQ PI++ L +K+ Y+
Sbjct: 259 RSVIEIVTRGGKCLLPVFALGRAQEILLILDEYWQANRNLQPIPIFYISPLAQKSLRVYE 318
Query: 119 MFITWTNQKIRK-TFVQRNMFDFKHIRPFDKS------FIDNPGPMVVFATPGMLHSGLS 171
F+ I++ + N F+F ++ + +S ++ GP +V +PGML G S
Sbjct: 319 TFVGLCGDYIKECVYNGFNPFNFTFVK-YARSVAEISQYLQADGPCIVMTSPGMLQGGPS 377
Query: 172 LIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHA 231
L IF+K AP N +++ G+ V+GT+ ++ + + +K I + +++ +SFSAHA
Sbjct: 378 LQIFEKIAPDSRNGVVLTGYTVKGTLADELRRDPEVVNVGHKTIKRL-CSVDQISFSAHA 436
Query: 232 DAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFN-LDCFMP 276
D + I+ NV+LVHGE ++M +++K+ +E N L FMP
Sbjct: 437 DYHQTKEFIRRLSVPNVILVHGERNEMTRMRDKLSEEINELSVFMP 482
>gi|354543719|emb|CCE40441.1| hypothetical protein CPAR2_104770 [Candida parapsilosis]
Length = 776
Score = 182 bits (462), Expect = 9e-43, Method: Compositional matrix adjust.
Identities = 134/481 (27%), Positives = 238/481 (49%), Gaps = 75/481 (15%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
M+ +++G I++TGDY+ +RHL +A + +PD+LITEST+ T + + E
Sbjct: 180 MYLIEIGGLKILFTGDYSREENRHLPSAEVPPVKPDVLITESTFGTGTLEPRAELETKLT 239
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERM-NLQ-APIYFAVGLTEKATNYYK 118
+H + +GG+VL+PVFALG AQEL ++L+ YWE+ +LQ +Y+ L K Y+
Sbjct: 240 NHIHATLTKGGRVLLPVFALGNAQELLLILDEYWEKNEDLQNVSVYYCSDLARKCMAVYE 299
Query: 119 MFITWTNQKIR----KTFVQRNMFDFKHIRPF-DKSFIDNPGPMVVFATPGMLHSGLSLI 173
+ N KIR + + FDFK+I+ + S + GP VV ATPGML +G+S
Sbjct: 300 TYTGIMNDKIRLSSSSDDSKSSPFDFKYIKSIRNLSKFSDLGPSVVVATPGMLQAGVSRQ 359
Query: 174 IFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDV--KMAIEYMSFSAHA 231
+ +KWAP + N++I+ G+ V+GT+ +L + ++ N + + ++ IE +SF+AH
Sbjct: 360 LLEKWAPEQKNLVILTGYSVEGTMAKDLLKEPQVIQSLNNPDLSIPRRIGIEEISFAAHV 419
Query: 232 DAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFN--------LDCFMPANGESCF 283
D + + I P ++LVHG++ M LK + +++ + F P N E
Sbjct: 420 DFQQNSEFIDKVSPSKIILVHGDSVPMGRLKSALLSKYSSRKGTDKEVKVFNPRNCEELT 479
Query: 284 VQ-TDMKIS------------------------IDVSVNLLKEEAVKYNSEPPNPLKERQ 318
+ +KI+ +D N +K E V + + K+RQ
Sbjct: 480 IPFKGLKIAKVLGSLAEEQLQTLKHEIEEKLTEVDNDGNDVKMEEV----DDEDSEKKRQ 535
Query: 319 IH-------------GVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDSSPINK 365
++ GVLV KD + L+ + + + +S IV+ N++I+ + P+
Sbjct: 536 VNGTLSTFKPGQVVSGVLVAKDFELDLVQLQDLNEFTQLSTSIVKSKMNLKINANLPL-- 593
Query: 366 TLELIY--DQLLSY---LQDKSQEYKIQLTESSLSIDSVLLSVERIDDKNKRVFVTWPNQ 420
+I+ +Q+ Y + D +Q ++ + +D V + ++R + V W N
Sbjct: 594 ---MIWHLEQMFGYINVINDDAQSWE------CVVMDVVDILIDRSKGPGLYISVEWIND 644
Query: 421 D 421
+
Sbjct: 645 N 645
>gi|241951638|ref|XP_002418541.1| cleavage and polyadenylation factor specificity complex subunit,
putative; endonuclease, putative [Candida dubliniensis
CD36]
gi|223641880|emb|CAX43843.1| cleavage and polyadenylation factor specificity complex subunit,
putative [Candida dubliniensis CD36]
Length = 787
Score = 182 bits (462), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 98/277 (35%), Positives = 161/277 (58%), Gaps = 8/277 (2%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
M+ V++G +++TGDY+ +RHL AA + +PD+LI EST+ T + + ER
Sbjct: 176 MYFVEIGGLKVLFTGDYSREENRHLHAAEVPPLKPDILICESTFGTGTLEPRLELERKLT 235
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYW-ERMNLQ-APIYFAVGLTEKATNYYK 118
+H + +GG+VL+PVFALG AQEL ++L+ YW + +LQ +++A L +K Y+
Sbjct: 236 THIHATIAKGGRVLLPVFALGNAQELLLILDEYWSQNEDLQNVNVFYASNLAKKCMAVYE 295
Query: 119 MFITWTNQKIRKTFV---QRNMFDFKHIRPF-DKSFIDNPGPMVVFATPGMLHSGLSLII 174
+ N KIR + + N FDFK I+ D S + GP VV ATPGML +G+S +
Sbjct: 296 TYTGIMNDKIRLSSASSKKSNPFDFKFIKSIKDLSKFQDMGPSVVVATPGMLQAGVSRQL 355
Query: 175 FKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDV--KMAIEYMSFSAHAD 232
+KWAP N++I+ G+ V+GT+ ++L ++ + + ++ IE +SF+AH D
Sbjct: 356 LEKWAPDGKNLVILTGYSVEGTMAKELLKEPTMIQSATNPDLTIPRRIGIEEISFAAHVD 415
Query: 233 AKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF 269
+ + I+ P V+LVHG++ M LK + ++
Sbjct: 416 FQQNSEFIEKVSPSKVILVHGDSVPMGRLKSALLSKY 452
>gi|195122318|ref|XP_002005659.1| GI20589 [Drosophila mojavensis]
gi|193910727|gb|EDW09594.1| GI20589 [Drosophila mojavensis]
Length = 495
Score = 182 bits (462), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 115/370 (31%), Positives = 190/370 (51%), Gaps = 48/370 (12%)
Query: 662 VSQLMSEKKSYEVQVMSQVVAAVTNSC---DSSHIIDLGGGQGYLSTILALQHGKKTLSL 718
+ + MS KK +EV++ + +V + S + +I+D G G+GYLS+ L+L++G L +
Sbjct: 118 IKEFMSAKKCHEVELTAALVNKLVKSIIDESNCYIVDAGDGKGYLSSRLSLEYGLSVLGI 177
Query: 719 DYNQVNTHGAAVRSKKLEDSEKESKGPFQSYAGVINKKLWMRTQNRQTSHASVEHHGKNW 778
D N+ NT A R KL+ R N T A + + GK
Sbjct: 178 DANESNTKNALSRDSKLQ-----------------------RAWNGLTERAEILNQGKTP 214
Query: 779 KRKSKAPVVSIKDEELVVCKDKCKQITHFVTPDSDISSILSQAYPQDSLH----NVCIMG 834
R+ K K+ + +K K + ++T + ++S ++ + + +D L +C+ G
Sbjct: 215 PRRGKKS--PTKETHINTPTNKYKTTSIYITTELNLSDLIKEHF-KDELQVVDPRICLTG 271
Query: 835 LHTCGDLSGTALRLF-TKSSLQCLVQVGCCYHLLEEEFIRSPFWKDVDQSLYEHGYGFPL 893
LHTCG+L+ T L++F +++ + L VGCCYHLL E + F+ + YGFP+
Sbjct: 272 LHTCGNLAATCLQVFHAQNNCRVLCNVGCCYHLLNERYSEQEFFGNKALMKQSTDYGFPM 331
Query: 894 SEHLRSRKFFLGRNVRMSGTQSPERVIDLKQTQTLPLFYRALLEKYLRSKITINDEEPKV 953
S +L+ RK LGRN RM QS ER + K+ + L+YRALLE + + P +
Sbjct: 332 SRYLQERKVKLGRNARMLAAQSIERTLSAKELPNISLYYRALLE------VLVCSHAPDL 385
Query: 954 -----VGRLATKCSNFVEYVHRAVDKLKLD--LEVDDEEVTRLFNSHQREYEYLQIYYFL 1006
VG++ K +NF+EYV + KL EV ++E+ L H+ + YL ++Y L
Sbjct: 386 KNSLQVGKV-QKFNNFIEYVQKCSKKLNAPWLAEVKEDELEALLQQHESDRHYLSLFYLL 444
Query: 1007 KTALAPVIEA 1016
+ + APV+E+
Sbjct: 445 RMSFAPVVES 454
>gi|403419016|emb|CCM05716.1| predicted protein [Fibroporia radiculosa]
Length = 828
Score = 181 bits (460), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 114/362 (31%), Positives = 182/362 (50%), Gaps = 23/362 (6%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF + + I+YTGDY+ D HL A + RPD+LI ESTY + + +E+ F
Sbjct: 162 MFLIDIAGLKILYTGDYSREEDCHLVKAEVPPFRPDVLIIESTYGVQTLECREDKEQRFT 221
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
VH + RGG VL+P FALGRAQEL ++L+ YW++ PIY+A L K Y+
Sbjct: 222 NLVHSIIRRGGHVLLPTFALGRAQELLLILDEYWKKHPDLHNVPIYYASSLARKCMAVYQ 281
Query: 119 MFITWTNQKIRKTFVQR-NMFDFKHI------RPFDKSFIDNPGPMVVFATPGMLHSGLS 171
+I N +R F +R N F FKHI R +++ D P P VV A+PG + G S
Sbjct: 282 TYIHTMNANVRSRFAKRDNPFVFKHISNLPHTRGWERKVADGP-PCVVLASPGFVTVGAS 340
Query: 172 LIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHA 231
+ + WAP N +I+ G+ ++GT+ + S ++ I K++++ +SFSAH
Sbjct: 341 RELLEMWAPDSRNGIIITGYSIEGTMARDIQSEPDEIMSLKGVPIPRKLSVDEISFSAHV 400
Query: 232 DAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKIS 291
D + I+ + ++++LVHGE + M L+ + + + E + T +
Sbjct: 401 DYSQNSEFIEMVKAQHIVLVHGEQTAMGRLRAAMTDRYK------SRDEDVKIHTPRNLE 454
Query: 292 IDVSVNLLKEEAVK----YNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRH 347
+ + E K ++PP + G+LV KD S +L+D + G+S
Sbjct: 455 T-LELTFRGERVAKAIGTLAAQPPQ--SGDLLSGLLVAKDYSYTLLDPRDLRDFAGLSTC 511
Query: 348 IV 349
+V
Sbjct: 512 VV 513
>gi|149245028|ref|XP_001527048.1| hypothetical protein LELG_01877 [Lodderomyces elongisporus NRRL
YB-4239]
gi|146449442|gb|EDK43698.1| hypothetical protein LELG_01877 [Lodderomyces elongisporus NRRL
YB-4239]
Length = 812
Score = 181 bits (459), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 103/310 (33%), Positives = 166/310 (53%), Gaps = 25/310 (8%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
M+ +++G +++TGDY+ +RHL AA + RPD+LITEST+ T +SK E+
Sbjct: 185 MYFIEIGGLKVLFTGDYSREENRHLQAAEVPPTRPDILITESTFGTGTLESKAELEKKLT 244
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
+H + RGG+VL+PVFALG AQE+ ++LE YWE+ +Y+ L K Y+
Sbjct: 245 SHIHATITRGGRVLLPVFALGNAQEILLILEEYWEKNEDLHNVNVYYCSDLARKCMAVYE 304
Query: 119 MFITWTNQKIR------------KTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGM 165
+ N KIR + N FDFK+I+ S + GP VV ATPGM
Sbjct: 305 TYTGIMNDKIRLSSSSSSSTSSSNNSTKSNPFDFKYIKSIKNLSKFSDLGPSVVVATPGM 364
Query: 166 LHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKL-EFENKQI-IDVKMAIE 223
L +G+S + +KWAP + N++I+ G+ V+GT+ ++ + + F N + I ++ +E
Sbjct: 365 LQAGVSRQLLEKWAPEQKNLVILTGYSVEGTMAKDIMKEPQVIPSFNNPDLSIPRRIGVE 424
Query: 224 YMSFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFN--------LDCFM 275
+SF+AH D + I P ++LVHG++ M LK + +++ + F
Sbjct: 425 EISFAAHVDFQQNSDFIDKVSPSKIILVHGDSIPMGRLKSALLSKYSSRKGTDKEVKVFN 484
Query: 276 PANGESCFVQ 285
P N E ++
Sbjct: 485 PRNCEELCIE 494
>gi|150865856|ref|XP_001385241.2| hypothetical protein PICST_89936 [Scheffersomyces stipitis CBS
6054]
gi|149387112|gb|ABN67212.2| predicted protein [Scheffersomyces stipitis CBS 6054]
Length = 793
Score = 181 bits (459), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 101/296 (34%), Positives = 170/296 (57%), Gaps = 16/296 (5%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
M+ V++G +++TGDY+ +RHL AA + RPD+LITEST+ T + K E+ +
Sbjct: 176 MYFVEIGGLKVLFTGDYSREENRHLHAAEVPPTRPDILITESTFGTGTLEPKADLEKRLV 235
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERM-NLQA-PIYFAVGLTEKATNYYK 118
+ +H + +GG+VL+PVF+LG AQEL ++L+ YWE+ +LQ ++FA L K Y+
Sbjct: 236 QNIHATLTKGGRVLMPVFSLGNAQELLLILDEYWEKNEDLQNISVFFASKLARKCMAVYQ 295
Query: 119 MFITWTNQKIR---KTFVQRNMFDFKHIRPF-DKSFIDNPGPMVVFATPGMLHSGLSLII 174
+ + N IR + + + FDFK+I+ D + GP VV A+PGML +G+S +
Sbjct: 296 TYTSIMNDNIRLSSRIGQKSSPFDFKYIKSIKDLGKFSDMGPSVVVASPGMLQAGVSRQL 355
Query: 175 FKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFE-NKQI-IDVKMAIEYMSFSAHAD 232
+KWAP N+++M G+ V+GT+ +L+ ++ N I I ++ + +SF+AH D
Sbjct: 356 LEKWAPDPKNLVVMTGYSVEGTMAKDLLNEPHTIKSAVNPDITIPRRINVSEISFAAHVD 415
Query: 233 AKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFN--------LDCFMPANGE 280
+ + I+ P ++LVHG++ M LK + +++ + F P N E
Sbjct: 416 FQQNSEFIEKVAPSKIILVHGDSVPMGRLKSALLSKYSARKGTEQEVKVFNPKNCE 471
>gi|226295077|gb|EEH50497.1| endoribonuclease ysh1 [Paracoccidioides brasiliensis Pb18]
Length = 888
Score = 181 bits (459), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 121/406 (29%), Positives = 208/406 (51%), Gaps = 54/406 (13%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDK-CRPDLLITESTYATTIRDSKRCRERDF 59
MF + + +I++TGDY+ DRHL +A + K + D+LITEST+ + + RE
Sbjct: 181 MFLISIAGLNILFTGDYSREEDRHLISAEVPKGIKIDVLITESTFGISSNPPRLEREAAL 240
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERM-NLQA-PIYFAVGLTEKATNYY 117
+K + ++RGG+VL+PVFALGRAQEL ++L+ YW R LQ PIY+ + K Y
Sbjct: 241 MKSITTILNRGGRVLMPVFALGRAQELLLILDEYWSRHPELQKIPIYYIGNMARKCIIVY 300
Query: 118 KMFITWTNQKIRKTFVQR--------------NMFDFKHIRPFDK-SFIDNPGPMVVFAT 162
+ +I N+ I++ F +R ++F+++R D+ G V+ A+
Sbjct: 301 QTYIGAMNENIKRVFRERMAEADAAGANSATAGPWNFRYVRSVKNIERFDDVGGCVMLAS 360
Query: 163 PGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKL-------------- 208
PGML +G S + ++WAP E N +IM G+ V+GT+G ++L+ +++
Sbjct: 361 PGMLQTGTSRELLERWAPNERNGIIMTGYSVEGTMGKQILNEPEQIPAVMSGRNAVGPAR 420
Query: 209 -----EFENKQIIDVKMAIEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKE 263
+ E K +I + ++ +SF+AH D + I+ V+LVHGE +M+ LK
Sbjct: 421 RMTGGDEEQKVMIPRRCTVDEISFAAHVDGVENREFIEEVGAPVVILVHGEKHQMMRLKS 480
Query: 264 KI-----KQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLKEEAVKYNSEPPNPLKE-- 316
K+ ++ + F PAN E ++ I V+ + + + PP P E
Sbjct: 481 KLLSLNADKKVKVKVFTPANCE--------EVRIPFQVDKMAKVVGRLAERPP-PADESE 531
Query: 317 -RQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDSS 361
R ++GVLV +SLM ++ + G++ + ++ +S +S
Sbjct: 532 SRLMNGVLVQNGFKLSLMAPEDLREYAGLATTTITCKQHITLSTAS 577
>gi|225677757|gb|EEH16041.1| endoribonuclease ysh1 [Paracoccidioides brasiliensis Pb03]
Length = 888
Score = 181 bits (458), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 121/406 (29%), Positives = 208/406 (51%), Gaps = 54/406 (13%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDK-CRPDLLITESTYATTIRDSKRCRERDF 59
MF + + +I++TGDY+ DRHL +A + K + D+LITEST+ + + RE
Sbjct: 181 MFLISIAGLNILFTGDYSREEDRHLISAEVPKGIKIDVLITESTFGISSNPPRLEREAAL 240
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERM-NLQA-PIYFAVGLTEKATNYY 117
+K + ++RGG+VL+PVFALGRAQEL ++L+ YW R LQ PIY+ + K Y
Sbjct: 241 MKSITTILNRGGRVLMPVFALGRAQELLLILDEYWSRHPELQKIPIYYIGNMARKCIIVY 300
Query: 118 KMFITWTNQKIRKTFVQR--------------NMFDFKHIRPFDK-SFIDNPGPMVVFAT 162
+ +I N+ I++ F +R ++F+++R D+ G V+ A+
Sbjct: 301 QTYIGAMNENIKRVFRERMAEADAAGANSATAGPWNFRYVRSVKNIERFDDVGGCVMLAS 360
Query: 163 PGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKL-------------- 208
PGML +G S + ++WAP E N +IM G+ V+GT+G ++L+ +++
Sbjct: 361 PGMLQTGTSRELLERWAPNERNGIIMTGYSVEGTMGKQILNEPEQIPAVMSGRNAVGPAR 420
Query: 209 -----EFENKQIIDVKMAIEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKE 263
+ E K +I + ++ +SF+AH D + I+ V+LVHGE +M+ LK
Sbjct: 421 RMTGGDEEQKVMIPRRCTVDEISFAAHVDGVENREFIEEVGAPVVILVHGEKHQMMRLKS 480
Query: 264 KI-----KQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLKEEAVKYNSEPPNPLKE-- 316
K+ ++ + F PAN E ++ I V+ + + + PP P E
Sbjct: 481 KLLSLNADKKVKVKVFTPANCE--------EVRIPFQVDKMAKVVGRLAERPP-PADESE 531
Query: 317 -RQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDSS 361
R ++GVLV +SLM ++ + G++ + ++ +S +S
Sbjct: 532 SRLMNGVLVQNGFKLSLMAPEDLREYAGLATTTITCKQHITLSTAS 577
>gi|195485485|ref|XP_002091112.1| GE12415 [Drosophila yakuba]
gi|194177213|gb|EDW90824.1| GE12415 [Drosophila yakuba]
Length = 492
Score = 181 bits (458), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 117/368 (31%), Positives = 189/368 (51%), Gaps = 45/368 (12%)
Query: 662 VSQLMSEKKSYEVQVMSQVVAAVTNS-CDSSHIIDLGGGQGYLSTILALQHGKKTLSLDY 720
+ + MS KK +EV++ + +V + S I+D G G+GYLS+ LALQ+G + L +D
Sbjct: 115 IREFMSAKKCHEVELAAALVDHLVKSFGQGGFIVDAGDGKGYLSSRLALQYGHRVLGIDA 174
Query: 721 NQVNTHGAAVRSKKLEDSEKESKGPFQSYAGVINKKLWMRTQNRQTSHASVEHHGKNWKR 780
N NT A R++KL+ R N T A ++ G KR
Sbjct: 175 NAENTQNALSRNRKLQ-----------------------RAWNGLTERAELQIQGITPKR 211
Query: 781 KSKAPVVSIKDEELVVCKDKCKQITHFVTPDSDISSILSQAY----PQDSLHNVCIMGLH 836
+ K + ++ K F+T + + ++L++ + P+DS N C+ GLH
Sbjct: 212 RGKKSPARDSTKSAPALENY-KTTAKFITTELNFGALLAEHFTQLGPEDS-PNFCLTGLH 269
Query: 837 TCGDLSGTALRLF-TKSSLQCLVQVGCCYHLLEEEFIRSPFWKDVDQSLYEHGYGFPLSE 895
TCG+L+ T L++F ++ + L VGCCYHLL E + + F+ + + YGFPLS+
Sbjct: 270 TCGNLAATCLQVFHAQADCRLLCNVGCCYHLLRERYSQQEFFGNKALMDLQTDYGFPLSQ 329
Query: 896 HLRSRKFFLGRNVRMSGTQSPERVIDLKQTQTLPLFYRALLEKYLRSKITINDEEPKV-- 953
+L+ RK +GRN RM QS ER +D K+ + L+YRALLE I + P++
Sbjct: 330 YLQERKVRMGRNARMLAAQSIERTVDAKELPNVTLYYRALLE------ILVCRHAPQLKN 383
Query: 954 ---VGRLATKCSNFVEYVHRAVDKLKLDL--EVDDEEVTRLFNSHQREYEYLQIYYFLKT 1008
VG++ K +F EY+ + KL V++EE+ L + + YL ++Y L+
Sbjct: 384 ELQVGKV-RKFDSFQEYIQKCSTKLDAPWLAAVEEEELQSLLQEYAVDKHYLDLFYLLRM 442
Query: 1009 ALAPVIEA 1016
+ APV+E+
Sbjct: 443 SFAPVLES 450
Score = 45.4 bits (106), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 38/157 (24%), Positives = 80/157 (50%), Gaps = 19/157 (12%)
Query: 485 IKNTIDEMIEFITPNIACVNTHTVHFMVEKLWETTIPKAILSQVNNSPSFDSLLEE-FWK 543
++ +D ++ F+ P+ VN H V+++ ++ W+ + + + S+++ S ++E+ FWK
Sbjct: 9 LQRRLDTLLAFLNPHWDFVNCHMVNYLTDQHWDCFLSETLKSEISGSEDVALVIEDLFWK 68
Query: 544 SRQDNHVETNSELVKFFQAADKFRLTSLMNGESIITVDELFSKLQARKCGQVVESA---K 600
N E V+ F A +F S N E + EL ++++ + +V E++
Sbjct: 69 ---------NDESVR-FTAWREFLGKS--NQERLALHPELLTRVE--ELIEVQENSTQLS 114
Query: 601 VSQLMSEKKSYEVQVMSQVVAAVTNS-CDSSHIIDLG 636
+ + MS KK +EV++ + +V + S I+D G
Sbjct: 115 IREFMSAKKCHEVELAAALVDHLVKSFGQGGFIVDAG 151
>gi|295657429|ref|XP_002789283.1| endoribonuclease ysh1 [Paracoccidioides sp. 'lutzii' Pb01]
gi|226283953|gb|EEH39519.1| endoribonuclease ysh1 [Paracoccidioides sp. 'lutzii' Pb01]
Length = 892
Score = 181 bits (458), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 121/406 (29%), Positives = 208/406 (51%), Gaps = 54/406 (13%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDK-CRPDLLITESTYATTIRDSKRCRERDF 59
MF + + +I++TGDY+ DRHL +A + K + D+LITEST+ + + RE
Sbjct: 185 MFVISIAGLNILFTGDYSREEDRHLISAEVPKGIKIDVLITESTFGISSNPPRLEREAAL 244
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERM-NLQA-PIYFAVGLTEKATNYY 117
+K + ++RGG+VL+PVFALGRAQEL ++L+ YW R LQ PIY+ + K Y
Sbjct: 245 MKSITTILNRGGRVLMPVFALGRAQELLLILDEYWSRHPELQKIPIYYIGNMARKCIIVY 304
Query: 118 KMFITWTNQKIRKTFVQR--------------NMFDFKHIRPFDK-SFIDNPGPMVVFAT 162
+ +I N+ I++ F +R ++F+++R D+ G V+ A+
Sbjct: 305 QTYIGAMNENIKRVFRERMAEADAAGANSATAGPWNFRYVRSVKNIERFDDVGGCVMLAS 364
Query: 163 PGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKL-------------- 208
PGML +G S + ++WAP E N +IM G+ V+GT+G ++L+ +++
Sbjct: 365 PGMLQTGTSRELLERWAPNERNGVIMTGYSVEGTMGKQILNEPEQIPAVMSGKNAVGPAR 424
Query: 209 -----EFENKQIIDVKMAIEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKE 263
+ E K +I + ++ +SF+AH D + I+ V+LVHGE +M+ LK
Sbjct: 425 RMTGGDEEQKVMIPRRCTVDEISFAAHVDGVENREFIEEVGAPVVILVHGEKHQMMRLKS 484
Query: 264 KI-----KQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLKEEAVKYNSEPPNPLKE-- 316
K+ ++ + F PAN E ++ I V+ + + + PP P E
Sbjct: 485 KLLSLNADKKIKVKVFTPANCE--------EVRIPFQVDKVAKVVGRLAERPP-PADESE 535
Query: 317 -RQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDSS 361
R ++GVLV +SLM ++ + G++ + ++ +S +S
Sbjct: 536 SRLMNGVLVQNGFKLSLMAPEDLREYAGLATTTITCKQHITLSTAS 581
>gi|255724858|ref|XP_002547358.1| hypothetical protein CTRG_01665 [Candida tropicalis MYA-3404]
gi|240135249|gb|EER34803.1| hypothetical protein CTRG_01665 [Candida tropicalis MYA-3404]
Length = 783
Score = 180 bits (457), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 96/278 (34%), Positives = 161/278 (57%), Gaps = 8/278 (2%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
M+ +++G +++TGDY+ +RHL AA + +PD+LI+EST+ T + + ER
Sbjct: 175 MYFIEIGGLKVLFTGDYSREENRHLHAAEVPPLKPDILISESTFGTGTLEPRVELERKLT 234
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYW-ERMNLQ-APIYFAVGLTEKATNYYK 118
+H V +GG+VL+PVFALG AQEL ++L+ YW + +LQ +++A L +K Y+
Sbjct: 235 THIHATVTKGGRVLLPVFALGNAQELLLILDEYWSQNEDLQNVNVFYASNLAKKCMAVYE 294
Query: 119 MFITWTNQKIR---KTFVQRNMFDFKHIRPF-DKSFIDNPGPMVVFATPGMLHSGLSLII 174
+ N KIR + + N FD K I+ D S + GP VV ATPGML +G+S +
Sbjct: 295 TYTGIMNDKIRLSSSSGEKSNPFDLKFIKSIKDLSKFQDMGPSVVVATPGMLQAGVSRQL 354
Query: 175 FKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDV--KMAIEYMSFSAHAD 232
+KWAP N++I+ G+ V+GT+ ++L ++ + + ++ IE +SF+AH D
Sbjct: 355 LEKWAPDNKNLVILTGYSVEGTMAKELLKEPTMIQSATNPDLTIPRRIGIEEISFAAHVD 414
Query: 233 AKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFN 270
+ + I P ++LVHG++ M LK + +++
Sbjct: 415 FQQNSEFIDKVSPSKIILVHGDSVPMGRLKSALLSKYS 452
>gi|395828536|ref|XP_003787428.1| PREDICTED: LOW QUALITY PROTEIN: cleavage and polyadenylation
specificity factor subunit 3 [Otolemur garnettii]
Length = 634
Score = 180 bits (456), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 89/207 (42%), Positives = 125/207 (60%), Gaps = 3/207 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ ++YTGD++ DRHL AA I +PD+LI ESTY T I + + RE F
Sbjct: 163 MFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPNIKPDILIIESTYGTHIHEKREEREARFC 222
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
VH+ V+RGG+ LIPVFALGRAQEL ++L+ YW+ PIY+A L +K Y+
Sbjct: 223 NTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQNHPELHDIPIYYASSLAKKCMAVYQ 282
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
++ N KIRK N F FKHI D+ GP VV A+PGM+ SGLS +F+
Sbjct: 283 TYVNAMNDKIRKQININNPFVFKHISNLKSMDHFDDIGPSVVMASPGMMQSGLSRELFES 342
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSG 204
W + N +I+ G+CV+GT+ ++ G
Sbjct: 343 WCTDKRNGVIIAGYCVEGTLAKILVHG 369
>gi|294658126|ref|XP_460457.2| DEHA2F02134p [Debaryomyces hansenii CBS767]
gi|218511903|sp|Q6BMW3.2|YSH1_DEBHA RecName: Full=Endoribonuclease YSH1; AltName: Full=mRNA
3'-end-processing protein YSH1
gi|202952895|emb|CAG88764.2| DEHA2F02134p [Debaryomyces hansenii CBS767]
Length = 815
Score = 179 bits (455), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 99/282 (35%), Positives = 159/282 (56%), Gaps = 15/282 (5%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
M+ +++G +++TGDY+ DRHL A + +PD+LITEST+ T + + +E
Sbjct: 182 MYFIEIGGLKVLFTGDYSSEEDRHLQVAEVPPIKPDILITESTFGTATHEPRLEKETRMT 241
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWE-RMNLQA-PIYFAVGLTEKATNYYK 118
+H + +GG++L+PVFALGRAQEL ++LE YW +LQ IY+A L K Y+
Sbjct: 242 NIIHSTLLKGGRILMPVFALGRAQELLLILEEYWSLNDDLQNINIYYASSLARKCMAVYQ 301
Query: 119 MFITWTNQKIRKTFV------QRNMFDFKHIRP---FDKSFIDNPGPMVVFATPGMLHSG 169
+ N IR T ++N F FK I+ DK + GP VV A+PGML +G
Sbjct: 302 TYTNIMNDSIRLTTSATNSSKKQNPFQFKFIKSIKNLDK--FQDFGPCVVVASPGMLQNG 359
Query: 170 LSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLE--FENKQIIDVKMAIEYMSF 227
+S + ++WAP N +IM G+ V+GT+ +L+ ++ + I +++IE +SF
Sbjct: 360 VSRELLERWAPDPKNAVIMTGYSVEGTMAKDLLTEPHTIQSAMNSDMTIPRRLSIEEISF 419
Query: 228 SAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF 269
+AH D + I+ P ++LVHGE++ M LK + ++
Sbjct: 420 AAHVDFQQNASFIEKVNPSKIILVHGESNPMGRLKSALLSKY 461
>gi|448118544|ref|XP_004203525.1| Piso0_001136 [Millerozyma farinosa CBS 7064]
gi|448120951|ref|XP_004204108.1| Piso0_001136 [Millerozyma farinosa CBS 7064]
gi|359384393|emb|CCE79097.1| Piso0_001136 [Millerozyma farinosa CBS 7064]
gi|359384976|emb|CCE78511.1| Piso0_001136 [Millerozyma farinosa CBS 7064]
Length = 809
Score = 179 bits (455), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 97/284 (34%), Positives = 160/284 (56%), Gaps = 19/284 (6%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
M+ +++G +++TGD++ DRHL A I +PD+LI+EST+ T + + +E
Sbjct: 182 MYLIEIGGLKVLFTGDFSCEEDRHLQVAEIPPVKPDILISESTFGTATHEPRLEKEARMT 241
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYW----ERMNLQAPIYFAVGLTEKATNY 116
+H + +GG++L+PVFALGRAQEL ++LE YW + N+ IYFA L K
Sbjct: 242 SIIHSTLLKGGRILMPVFALGRAQELLLILEEYWGLNDDLHNIN--IYFASSLARKCMAV 299
Query: 117 YKMFITWTNQKIR------KTFVQRNMFDFKH---IRPFDKSFIDNPGPMVVFATPGMLH 167
Y+ + N IR + +RN F FK+ IR DK + GP VV A+PGML
Sbjct: 300 YQTYTNIMNDSIRLSTSSTNSGEKRNPFQFKYIKNIRSLDK--FQDFGPCVVVASPGMLQ 357
Query: 168 SGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDV--KMAIEYM 225
+G+S + ++WAP N +IM G+ V+GT+ ++L+ ++ + + ++ +E +
Sbjct: 358 NGVSRELLERWAPDPRNAVIMTGYSVEGTMAKELLTEPPTIQSATNADVTIPRRINVEEI 417
Query: 226 SFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF 269
SF+AH D + I+ P ++LVHGE++ M LK + ++
Sbjct: 418 SFAAHVDFQQNSNFIEQINPSKIILVHGESNPMGRLKSALLSKY 461
>gi|194883762|ref|XP_001975969.1| GG20255 [Drosophila erecta]
gi|190659156|gb|EDV56369.1| GG20255 [Drosophila erecta]
Length = 492
Score = 179 bits (455), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 116/364 (31%), Positives = 190/364 (52%), Gaps = 37/364 (10%)
Query: 662 VSQLMSEKKSYEVQVMSQVV-AAVTNSCDSSHIIDLGGGQGYLSTILALQHGKKTLSLDY 720
+ + MS KK +EV++ + +V V N I+D G G+GYLS+ LALQ+G + L +D
Sbjct: 115 IREFMSAKKCHEVELAAALVDHLVKNYGQGGFIVDAGDGKGYLSSRLALQYGHRVLGIDA 174
Query: 721 NQVNTHGAAVRSKKLEDSEKESKGPFQSYAGVINKKLWMRTQNRQTSHASVEHHGKNWKR 780
N NT A R+ KL+ R N T A ++ G KR
Sbjct: 175 NAENTQNALSRNLKLQ-----------------------RAWNGLTERAELQIQGITPKR 211
Query: 781 KSKAPVVSIKDEELVVCKDKCKQITHFVTPDSDISSILSQAY----PQDSLHNVCIMGLH 836
+ K + ++ K F+T + + ++L++ + P++S N+C+ GLH
Sbjct: 212 RGKKSPARDSTKSAPALENY-KTTAKFITTELNFGALLAEHFTQLGPKES-PNICLTGLH 269
Query: 837 TCGDLSGTALRLF-TKSSLQCLVQVGCCYHLLEEEFIRSPFWKDVDQSLYEHGYGFPLSE 895
TCG+L+ T L++F ++ + L VGCCYHLL E + + F+ + + YGFPLS+
Sbjct: 270 TCGNLAATCLQVFHAQTDCRLLCNVGCCYHLLRERYSQQEFFGNKALMDLQTDYGFPLSQ 329
Query: 896 HLRSRKFFLGRNVRMSGTQSPERVIDLKQTQTLPLFYRALLEKYL-RSKITINDEEPKVV 954
+L+ R+ +GRN RM QS ER +D K+ + L+YRALLE + R + +E V
Sbjct: 330 YLQERQVRMGRNARMLAAQSIERTVDAKELPNVTLYYRALLETLVYRHAPQLKNE--LQV 387
Query: 955 GRLATKCSNFVEYVHRAVDKLKLDL--EVDDEEVTRLFNSHQREYEYLQIYYFLKTALAP 1012
G++ K +F EY+ + +KL V++EE+ L + + YL ++Y L+ + AP
Sbjct: 388 GKV-RKFESFQEYIQKCANKLDAPWLAAVEEEELQSLLQEYAVDKHYLDLFYLLRMSFAP 446
Query: 1013 VIEA 1016
V+E+
Sbjct: 447 VLES 450
Score = 42.0 bits (97), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 38/158 (24%), Positives = 73/158 (46%), Gaps = 21/158 (13%)
Query: 485 IKNTIDEMIEFITPNIACVNTHTVHFMVEKLWETTIPKAILSQVNNSP----SFDSLLEE 540
++ +D ++ F+ P+ VN H V+++ E+ W + + + S+++ S + D+L
Sbjct: 9 LQRRLDGLLAFLNPHWDFVNCHMVNYLTEEHWNAFLSETLKSEISGSEDVTLAIDNL--- 65
Query: 541 FWKSRQDNHVETNSELVKFFQAADKFRLTSLMNGESIITVDELFSKLQARKCGQVVESA- 599
FWK+ + E F K +L + E + +V+EL GQ +
Sbjct: 66 FWKTDESVRFPAWRE----FLGKSKQECLAL-HPELLTSVEELIE-------GQENSTQL 113
Query: 600 KVSQLMSEKKSYEVQVMSQVV-AAVTNSCDSSHIIDLG 636
+ + MS KK +EV++ + +V V N I+D G
Sbjct: 114 SIREFMSAKKCHEVELAAALVDHLVKNYGQGGFIVDAG 151
>gi|401404496|ref|XP_003881737.1| hypothetical protein NCLIV_014990 [Neospora caninum Liverpool]
gi|325116150|emb|CBZ51704.1| hypothetical protein NCLIV_014990 [Neospora caninum Liverpool]
Length = 1033
Score = 179 bits (454), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 103/286 (36%), Positives = 158/286 (55%), Gaps = 11/286 (3%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++G ++YTGD++ DRH+ A + LLI ESTY + D ++ RER FL
Sbjct: 281 MFLIEIGGVRMLYTGDFSREKDRHVPIAEVPPVDVQLLICESTYGIHVHDDRQLRERRFL 340
Query: 61 KKVHECV-DRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYY 117
K V + V +RGGK L+PVFALGRAQEL ++LE YW PI F L+ K +
Sbjct: 341 KAVVDIVLNRGGKCLLPVFALGRAQELLLILEEYWTAHPEVCHVPILFLSPLSSKCMVVF 400
Query: 118 KMFITWTNQKIR-KTFVQRNMFDFKHIRPFD-----KSFIDNPGPMVVFATPGMLHSGLS 171
F+ +R + N F F+ ++ + +I + GP V+ A PGML SG S
Sbjct: 401 DAFVDMCGDAVRNRALRGENPFAFRFVKNLKSVESARVYIHHDGPAVIMAAPGMLQSGAS 460
Query: 172 LIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHA 231
IF+ AP N +I+ G+ V+GT+ ++ + ++ ++ ++ + + E +SFSAH+
Sbjct: 461 REIFEALAPESKNGVILTGYSVKGTLADELKREPETIQLPDR-VLRRRCSFEMISFSAHS 519
Query: 232 DAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDC-FMP 276
D + I + NV+LVHGE +M LKEK+++E C F P
Sbjct: 520 DYQQTQDFIGKLKVPNVVLVHGERGEMRRLKEKLEEERPALCVFTP 565
>gi|399216074|emb|CCF72762.1| unnamed protein product [Babesia microti strain RI]
Length = 725
Score = 179 bits (454), Expect = 8e-42, Method: Compositional matrix adjust.
Identities = 108/285 (37%), Positives = 157/285 (55%), Gaps = 10/285 (3%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF V++G ++YTGDY+ DRH+ A I LLI ESTY T I D + RE +
Sbjct: 170 MFMVEIGGIRVLYTGDYSSEQDRHVPKAEIPPIDVHLLICESTYGTRIHDERTQRETRLI 229
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWE--RMNLQAPIYFAVGLTEKATNYYK 118
+ + VD GGK L+PVFALGRAQE+ ++LE YW+ R + PI++ L+ KA Y+
Sbjct: 230 RSILNAVDNGGKCLLPVFALGRAQEILLILEEYWKANRRLHRVPIFYISPLSSKALKVYE 289
Query: 119 MFITWTNQKIRKTFVQ-RNMFDFKHIRPFD-----KSFIDNPGPMVVFATPGMLHSGLSL 172
FI + I++ Q N + F HI+ +S + P V+ +PGML G S
Sbjct: 290 TFIGVCGEHIKRRVQQGENPYHFTHIKYAPTVDSVRSHLLRDAPCVIMTSPGMLQGGPSR 349
Query: 173 IIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHAD 232
+F+ AP N +I+ G+ V+GT+ ++ ++ E+ II + IE +SFSAHAD
Sbjct: 350 DVFEIIAPDNRNGVILTGYTVKGTLADELKKEPDVIKLED-NIIKRRCFIEQISFSAHAD 408
Query: 233 AKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF-NLDCFMP 276
I + NVLLVHGE +M L +K+ ++ NL FMP
Sbjct: 409 YNQTRDFIDNLQVPNVLLVHGERKEMKRLHDKLLEDRPNLSVFMP 453
>gi|194754148|ref|XP_001959359.1| GF12827 [Drosophila ananassae]
gi|190620657|gb|EDV36181.1| GF12827 [Drosophila ananassae]
Length = 492
Score = 179 bits (454), Expect = 8e-42, Method: Compositional matrix adjust.
Identities = 112/364 (30%), Positives = 182/364 (50%), Gaps = 40/364 (10%)
Query: 662 VSQLMSEKKSYEVQVMSQVV-AAVTNSCDSSHIIDLGGGQGYLSTILALQHGKKTLSLDY 720
+ + MS KK +EV++ + +V V NS +I+D G G+GYLS+ LALQ+G L +D
Sbjct: 116 IREFMSAKKCHEVELTAALVDHLVKNSNQGCYIVDAGDGKGYLSSRLALQYGHHVLGIDA 175
Query: 721 NQVNTHGAAVRSKKLEDSEKESKGPFQSYAGVINKKLWMRTQNRQTSHASVEHHGKNWKR 780
N NT A R++KL+ R N T A ++ G KR
Sbjct: 176 NAANTENALSRNRKLQ-----------------------RAWNGLTERAELQSQGITPKR 212
Query: 781 KSKAPVVSIKDEELVVCKDKCKQITHFVTPDSDISSILSQAYPQDSLHNVCIMGLHTCGD 840
+ K + V + K F+T + + +L + + + +C+ GLHTCG+
Sbjct: 213 RGKKSPTR-DSSKTVPALENYKTTARFITTEINFGELLEEHFSKKETSEICLTGLHTCGN 271
Query: 841 LSGTALRLF-TKSSLQCLVQVGCCYHLLEEEFIRSPFWKDVDQSLYEHGYGFPLSEHLRS 899
L+ T L++F +S + L +GCCYHLL E + + F+ + + +GFPLS++L+
Sbjct: 272 LASTCLQVFHAQSDCRILCNIGCCYHLLRERYSTNEFFGNKALMDMQSDFGFPLSQYLQL 331
Query: 900 RKFFLGRNVRMSGTQSPERVIDLKQTQTLPLFYRALLEKYLRSKITINDEEPKV-----V 954
+K LGRN RM QS ER + K+ + L+YRALLE + + P + V
Sbjct: 332 KKVRLGRNARMLAAQSIERTLAAKELPNMTLYYRALLE------VLVCRHAPHLKNELQV 385
Query: 955 GRLATKCSNFVEYVHRAVDKLKLDL--EVDDEEVTRLFNSHQREYEYLQIYYFLKTALAP 1012
G++ K +F EYV + KL ++D E+ + + E YL ++Y ++ + AP
Sbjct: 386 GKV-RKFEDFKEYVKKCASKLDAPWLAAIEDTELESILQEYAMEKHYLDLFYLVRMSFAP 444
Query: 1013 VIEA 1016
V+E+
Sbjct: 445 VLES 448
Score = 58.2 bits (139), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 40/154 (25%), Positives = 75/154 (48%), Gaps = 12/154 (7%)
Query: 485 IKNTIDEMIEFITPNIACVNTHTVHFMVEKLWETTIPKAILSQVNNSPSFDSLLEE-FWK 543
++ +D +IEF+ P+ VN H V+F+ + W T IP A+ +++ + +E+ FW+
Sbjct: 9 LQKRLDGLIEFLNPHWDFVNCHMVNFLTDFHWGTFIPAALRNEIGCKEDLELAMEQLFWQ 68
Query: 544 SRQDNHVETNSELVKFFQAADKFRLTSLMNGESIITVDELFSKLQARKCGQVVESAKVSQ 603
+ + E F ++ RL+ + E + TVD+L + + +
Sbjct: 69 TGKS--TSKFPEWESFLSKGEQERLS--FHSELLTTVDQLIDGPEN------ATQLSIRE 118
Query: 604 LMSEKKSYEVQVMSQVV-AAVTNSCDSSHIIDLG 636
MS KK +EV++ + +V V NS +I+D G
Sbjct: 119 FMSAKKCHEVELTAALVDHLVKNSNQGCYIVDAG 152
>gi|302499334|ref|XP_003011663.1| hypothetical protein ARB_02217 [Arthroderma benhamiae CBS 112371]
gi|291175215|gb|EFE31023.1| hypothetical protein ARB_02217 [Arthroderma benhamiae CBS 112371]
Length = 749
Score = 179 bits (453), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 119/396 (30%), Positives = 202/396 (51%), Gaps = 41/396 (10%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDK-CRPDLLITESTYATTIRDSKRCRERDF 59
MF + + +I++TGDY+ DRHL +A + K + D++ITEST+ + + RE
Sbjct: 74 MFLISIAGLNILFTGDYSREEDRHLISAEVPKGVKIDVMITESTFGISSNPPRLEREAAL 133
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERM-NLQ-APIYFAVGLTEKATNYY 117
+K V ++RGG+VL+PVFALGRAQEL ++L+ YW R LQ PIY+ + + Y
Sbjct: 134 MKSVTSIINRGGRVLMPVFALGRAQELLLILDEYWSRHPELQKVPIYYIGNMARRCMVVY 193
Query: 118 KMFITWTNQKIRKTFVQR--------------NMFDFKHIRPF-DKSFIDNPGPMVVFAT 162
+ +I N+ I++ F QR +DF+ +R + ++ G V+ A+
Sbjct: 194 QTYIGAMNENIKRLFRQRMAEAEARGDKSVTAGPWDFRFVRSLRNLDRFEDVGGCVMLAS 253
Query: 163 PGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEF------------ 210
PGML +G S + ++WAP E N +IM G+ V+GT+G ++++ +++
Sbjct: 254 PGMLQTGTSRELLERWAPNERNGVIMTGYSVEGTMGKQIINEPEQIPAVMSAKNSAGPSD 313
Query: 211 ENKQIIDVKMAIEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEK-----I 265
+ K +I + ++ +SF+AH D + I+ V+LVHGE +M+ LK K +
Sbjct: 314 DQKIMIQRRCTVDEISFAAHVDGVENREFIESVAAPVVILVHGEKHQMMRLKSKLLSLNV 373
Query: 266 KQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLKEEAVKYNSEPPNPLKERQIHGVLVI 325
+E + + PAN + + M + V L E S P R + GVLV
Sbjct: 374 DKEVKVKVYTPANCDEVRIPF-MVDKVARVVGRLAE-----TSPPIGQDDSRLMDGVLVQ 427
Query: 326 KDSSISLMDVDEACKEVGISRHIVRFTSNVQISDSS 361
+S+M D+ + G++ +V + +S +S
Sbjct: 428 NGFKLSMMASDDLREYAGLTTTMVTCKQYITLSTAS 463
>gi|239612611|gb|EEQ89598.1| endoribonuclease ysh1 [Ajellomyces dermatitidis ER-3]
Length = 904
Score = 178 bits (452), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 121/404 (29%), Positives = 205/404 (50%), Gaps = 50/404 (12%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDK-CRPDLLITESTYATTIRDSKRCRERDF 59
MF + + +I++TGDY+ DRHL +A K + D+LITEST+ + + RE
Sbjct: 181 MFLISIAGLNILFTGDYSREEDRHLISAEAPKGIKIDVLITESTFGVSSNPPRLEREAAL 240
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERM-NLQA-PIYFAVGLTEKATNYY 117
+K + ++RGG+VL+PVFALGRAQEL ++L+ YW R LQ PIY+ + + Y
Sbjct: 241 MKSITGVLNRGGRVLMPVFALGRAQELLLILDEYWSRHPELQKIPIYYIGNIARRCMVVY 300
Query: 118 KMFITWTNQKIRKTFVQR--------------NMFDFKHIRPFDK-SFIDNPGPMVVFAT 162
+ +I N+ I++ F QR +DF+ +R D+ G V+ A+
Sbjct: 301 QTYIGAMNENIKRLFRQRMAEAEASGDKSVSAGPWDFRFVRSVRSIERFDDVGGCVMLAS 360
Query: 163 PGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKL-------------- 208
PGML +G S + ++WAP E N +IM G+ V+GT+G ++L+ +++
Sbjct: 361 PGMLQTGTSRELLERWAPSERNGVIMTGYSVEGTMGKQILNEPEQIPAVMSGRNAVGPAR 420
Query: 209 -----EFENKQIIDVKMAIEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKE 263
+ E K +I + ++ +SF+AH D + I+ V+LVHGE ++M+ LK
Sbjct: 421 RMAAGDDEQKVMIPRRCTVDEISFAAHVDGMENREFIEEVAAPVVILVHGEKNQMMRLKS 480
Query: 264 KI-----KQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLKEEAVKYNSEPPNPLKERQ 318
K+ + + + PAN E + + +D ++ A S PP+ E +
Sbjct: 481 KLLSLNADKTVKVKVYTPANCE----EVRIPFQVDKVAKVVGRLA---ESSPPSDHNESR 533
Query: 319 I-HGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDSS 361
I +GVLV +SLM ++ + G++ + + +S +S
Sbjct: 534 IMNGVLVQNGFKLSLMAPEDLREYAGLATTTITCKQLITLSTAS 577
>gi|302667649|ref|XP_003025406.1| hypothetical protein TRV_00467 [Trichophyton verrucosum HKI 0517]
gi|291189514|gb|EFE44795.1| hypothetical protein TRV_00467 [Trichophyton verrucosum HKI 0517]
Length = 865
Score = 178 bits (452), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 119/396 (30%), Positives = 202/396 (51%), Gaps = 41/396 (10%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDK-CRPDLLITESTYATTIRDSKRCRERDF 59
MF + + +I++TGDY+ DRHL +A + K + D++ITEST+ + + RE
Sbjct: 178 MFLISIAGLNILFTGDYSREEDRHLISAEVPKGVKIDVMITESTFGISSNPPRLEREAAL 237
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERM-NLQ-APIYFAVGLTEKATNYY 117
+K V ++RGG+VL+PVFALGRAQEL ++L+ YW R LQ PIY+ + + Y
Sbjct: 238 MKSVTSIINRGGRVLMPVFALGRAQELLLILDEYWSRHPELQKVPIYYIGNMARRCMVVY 297
Query: 118 KMFITWTNQKIRKTFVQR--------------NMFDFKHIRPF-DKSFIDNPGPMVVFAT 162
+ +I N+ I++ F QR +DF+ +R + ++ G V+ A+
Sbjct: 298 QTYIGAMNENIKRLFRQRMAEAEARGDKSVTAGPWDFRFVRSLRNLDRFEDVGGCVMLAS 357
Query: 163 PGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEF------------ 210
PGML +G S + ++WAP E N +IM G+ V+GT+G ++++ +++
Sbjct: 358 PGMLQTGTSRELLERWAPNERNGVIMTGYSVEGTMGKQIINEPEQIPAVMSAKNAAGPSD 417
Query: 211 ENKQIIDVKMAIEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEK-----I 265
+ K +I + ++ +SF+AH D + I+ V+LVHGE +M+ LK K +
Sbjct: 418 DQKIMIQRRCTVDEISFAAHVDGVENREFIESVAAPVVILVHGEKHQMMRLKSKLLSLNV 477
Query: 266 KQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLKEEAVKYNSEPPNPLKERQIHGVLVI 325
+E + + PAN + + M + V L E S P R + GVLV
Sbjct: 478 DKEVKVKVYTPANCDEVRIPF-MVDKVARVVGRLAE-----TSPPIGQDDSRLMDGVLVQ 531
Query: 326 KDSSISLMDVDEACKEVGISRHIVRFTSNVQISDSS 361
+S+M D+ + G++ +V + +S +S
Sbjct: 532 NGFKLSMMASDDLREYAGLTATMVTCKQYITLSTAS 567
>gi|344232758|gb|EGV64631.1| Metallo-hydrolase/oxidoreductase [Candida tenuis ATCC 10573]
Length = 782
Score = 178 bits (452), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 104/311 (33%), Positives = 170/311 (54%), Gaps = 26/311 (8%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
++ +++G ++TGDY+ +RHL A + +PD+LITEST+ T + + +E
Sbjct: 173 LYLIEIGGLKALFTGDYSREENRHLPVAEVPSVKPDILITESTFGTATHEPRMEKENRMT 232
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYW-ERMNLQA-PIYFAVGLTEKATNYYK 118
+ +H + +GG+VL+PVFALG AQEL ++LE YW + +LQ +YFA L K Y+
Sbjct: 233 RIIHSTLSKGGRVLMPVFALGTAQELLLILEEYWSQNKDLQNIDVYFASSLARKCLAVYQ 292
Query: 119 MFITWTNQKIR-----KTFVQRNMFDFKHIRPFDKSF--IDNPGPMVVFATPGMLHSGLS 171
+ N KIR ++ ++N F FK+I+ KS + GP VV A+PGML SG S
Sbjct: 293 TYTNIMNDKIRSMASSSSYDRKNPFTFKYIKTL-KSLDRFQDFGPSVVIASPGMLQSGFS 351
Query: 172 LIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFEN--KQIIDVKMAIEYMSFSA 229
+ +KWAP N ++M G+ V+GT+ +L + N + I +++IE +SF+A
Sbjct: 352 RQLLEKWAPDPKNTVLMTGYSVEGTMAKDLLIEPPTIPSVNNPEMTITRRLSIEEISFAA 411
Query: 230 HADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF--------NLDCFMPANGES 281
H D + + I ++LVHG++ M LK + ++ + F P NG+
Sbjct: 412 HVDFQQNAEFIGEVNAPRIILVHGDSIPMGRLKSALLSKYANRKGTDQEVKVFNPRNGD- 470
Query: 282 CFVQTDMKISI 292
++KI+I
Sbjct: 471 -----ELKIAI 476
>gi|327293421|ref|XP_003231407.1| endoribonuclease ysh1 [Trichophyton rubrum CBS 118892]
gi|326466523|gb|EGD91976.1| endoribonuclease ysh1 [Trichophyton rubrum CBS 118892]
Length = 855
Score = 178 bits (452), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 119/396 (30%), Positives = 202/396 (51%), Gaps = 41/396 (10%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDK-CRPDLLITESTYATTIRDSKRCRERDF 59
MF + + +I++TGDY+ DRHL +A + K + D++ITEST+ + + RE
Sbjct: 180 MFLISIAGLNILFTGDYSREEDRHLISAEVPKGVKIDVMITESTFGISSNPPRLEREAAL 239
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERM-NLQ-APIYFAVGLTEKATNYY 117
+K V ++RGG+VL+PVFALGRAQEL ++L+ YW R LQ PIY+ + + Y
Sbjct: 240 MKSVTSIINRGGRVLMPVFALGRAQELLLILDEYWSRHPELQKVPIYYIGNMARRCMVVY 299
Query: 118 KMFITWTNQKIRKTFVQR--------------NMFDFKHIRPF-DKSFIDNPGPMVVFAT 162
+ +I N+ I++ F QR +DF+ +R + ++ G V+ A+
Sbjct: 300 QTYIGAMNENIKRLFRQRMAEAEARGDKSVTAGPWDFRFVRSLRNLDRFEDVGGCVMLAS 359
Query: 163 PGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEF------------ 210
PGML +G S + ++WAP E N +IM G+ V+GT+G ++++ +++
Sbjct: 360 PGMLQTGTSRELLERWAPNERNGVIMTGYSVEGTMGKQIINEPEQIPAVMSAKNAAGPSD 419
Query: 211 ENKQIIDVKMAIEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEK-----I 265
+ K +I + ++ +SF+AH D + I+ V+LVHGE +M+ LK K +
Sbjct: 420 DQKIMIQRRCTVDEISFAAHVDGVENREFIESVAAPVVILVHGEKHQMMRLKSKLLSLNV 479
Query: 266 KQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLKEEAVKYNSEPPNPLKERQIHGVLVI 325
+E + + PAN + + M + V L E S P R + GVLV
Sbjct: 480 DKEVKVKVYTPANCDEVRIPF-MVDKVARVVGRLAE-----TSPPIGQDDSRLMDGVLVQ 533
Query: 326 KDSSISLMDVDEACKEVGISRHIVRFTSNVQISDSS 361
+S+M D+ + G++ +V + +S +S
Sbjct: 534 NGFKLSMMASDDLREYAGLTTTMVTCKQYITLSTAS 569
>gi|261191614|ref|XP_002622215.1| endoribonuclease ysh1 [Ajellomyces dermatitidis SLH14081]
gi|239589981|gb|EEQ72624.1| endoribonuclease ysh1 [Ajellomyces dermatitidis SLH14081]
Length = 894
Score = 178 bits (452), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 121/404 (29%), Positives = 205/404 (50%), Gaps = 50/404 (12%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDK-CRPDLLITESTYATTIRDSKRCRERDF 59
MF + + +I++TGDY+ DRHL +A K + D+LITEST+ + + RE
Sbjct: 188 MFLISIAGLNILFTGDYSREEDRHLISAEAPKGIKIDVLITESTFGVSSNPPRLEREAAL 247
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERM-NLQA-PIYFAVGLTEKATNYY 117
+K + ++RGG+VL+PVFALGRAQEL ++L+ YW R LQ PIY+ + + Y
Sbjct: 248 MKSITGVLNRGGRVLMPVFALGRAQELLLILDEYWSRHPELQKIPIYYIGNIARRCMVVY 307
Query: 118 KMFITWTNQKIRKTFVQR--------------NMFDFKHIRPFDK-SFIDNPGPMVVFAT 162
+ +I N+ I++ F QR +DF+ +R D+ G V+ A+
Sbjct: 308 QTYIGAMNENIKRLFRQRMAEAEASGDKSVSAGPWDFRFVRSVRSIERFDDVGGCVMLAS 367
Query: 163 PGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKL-------------- 208
PGML +G S + ++WAP E N +IM G+ V+GT+G ++L+ +++
Sbjct: 368 PGMLQTGTSRELLERWAPSERNGVIMTGYSVEGTMGKQILNEPEQIPAVMSGRNAVGPAR 427
Query: 209 -----EFENKQIIDVKMAIEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKE 263
+ E K +I + ++ +SF+AH D + I+ V+LVHGE ++M+ LK
Sbjct: 428 RMAAGDDEQKVMIPRRCTVDEISFAAHVDGMENREFIEEVAAPVVILVHGEKNQMMRLKS 487
Query: 264 KI-----KQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLKEEAVKYNSEPPNPLKERQ 318
K+ + + + PAN E + + +D ++ A S PP+ E +
Sbjct: 488 KLLSLNADKTVKVKVYTPANCE----EVRIPFQVDKVAKVVGRLA---ESSPPSDHNESR 540
Query: 319 I-HGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDSS 361
I +GVLV +SLM ++ + G++ + + +S +S
Sbjct: 541 IMNGVLVQNGFKLSLMAPEDLREYAGLATTTITCKQLITLSTAS 584
>gi|225561321|gb|EEH09601.1| endoribonuclease ysh1 [Ajellomyces capsulatus G186AR]
Length = 903
Score = 178 bits (452), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 121/405 (29%), Positives = 207/405 (51%), Gaps = 51/405 (12%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDK-CRPDLLITESTYATTIRDSKRCRERDF 59
MF + + +I++TGDY+ DRHL +A K + D+LITEST+ + + RE
Sbjct: 181 MFLISIAGLNILFTGDYSREEDRHLISAEAPKGVKVDVLITESTFGVSSNPPRLEREAAL 240
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERM-NLQA-PIYFAVGLTEKATNYY 117
+K + ++RGG+VL+PVFALGRAQEL ++L+ YW R LQ PIY+ + + Y
Sbjct: 241 IKSITSILNRGGRVLMPVFALGRAQELLLILDEYWSRHPELQKIPIYYIGNIARRCMVVY 300
Query: 118 KMFITWTNQKIRKTFVQR--------------NMFDFKHIRPFDK-SFIDNPGPMVVFAT 162
+ +I N+ I++ F QR +DF+ +R D+ G V+ A+
Sbjct: 301 QTYIGAMNENIKRLFRQRMAEAEASGDKSISAGPWDFRFVRSVRSIERFDDVGGCVMLAS 360
Query: 163 PGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKL-------------- 208
PGML +G S + ++WAP E N +IM G+ V+GT+G ++L+ +++
Sbjct: 361 PGMLQTGTSRELLERWAPNERNGVIMTGYSVEGTMGKQILNEPEQIPAVMSGRNAVGPAR 420
Query: 209 ------EFENKQIIDVKMAIEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLK 262
+ E K +I + +++ +SF+AH D I+ V+LVHGE ++M+ LK
Sbjct: 421 RMATGDDDEQKVMIPRRCSVDEISFAAHVDGMENRAFIEEVGAPVVILVHGEKNQMMRLK 480
Query: 263 EKI-----KQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLKEEAVKYNSEPPNPLKE- 316
K+ + + + PAN E + + +D ++ A + PP+ KE
Sbjct: 481 SKLLSLNTDKTVKVKVYTPANCE----EVRIPFQVDKVAKVVGRLA---ETVPPSDHKES 533
Query: 317 RQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDSS 361
R ++GVLV +SLM ++ + G++ V ++ ++ +S
Sbjct: 534 RLLNGVLVQHGFKLSLMAPEDLREYAGLATTTVTCKQHITLNTAS 578
>gi|327356883|gb|EGE85740.1| endoribonuclease ysh1 [Ajellomyces dermatitidis ATCC 18188]
Length = 887
Score = 178 bits (451), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 121/404 (29%), Positives = 205/404 (50%), Gaps = 50/404 (12%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDK-CRPDLLITESTYATTIRDSKRCRERDF 59
MF + + +I++TGDY+ DRHL +A K + D+LITEST+ + + RE
Sbjct: 181 MFLISIAGLNILFTGDYSREEDRHLISAEAPKGIKIDVLITESTFGVSSNPPRLEREAAL 240
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERM-NLQA-PIYFAVGLTEKATNYY 117
+K + ++RGG+VL+PVFALGRAQEL ++L+ YW R LQ PIY+ + + Y
Sbjct: 241 MKSITGVLNRGGRVLMPVFALGRAQELLLILDEYWSRHPELQKIPIYYIGNIARRCMVVY 300
Query: 118 KMFITWTNQKIRKTFVQR--------------NMFDFKHIRPFDK-SFIDNPGPMVVFAT 162
+ +I N+ I++ F QR +DF+ +R D+ G V+ A+
Sbjct: 301 QTYIGAMNENIKRLFRQRMAEAEASGDKSVSAGPWDFRFVRSVRSIERFDDVGGCVMLAS 360
Query: 163 PGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKL-------------- 208
PGML +G S + ++WAP E N +IM G+ V+GT+G ++L+ +++
Sbjct: 361 PGMLQTGTSRELLERWAPSERNGVIMTGYSVEGTMGKQILNEPEQIPAVMSGRNAVGPAR 420
Query: 209 -----EFENKQIIDVKMAIEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKE 263
+ E K +I + ++ +SF+AH D + I+ V+LVHGE ++M+ LK
Sbjct: 421 RMAAGDDEQKVMIPRRCTVDEISFAAHVDGMENREFIEEVAAPVVILVHGEKNQMMRLKS 480
Query: 264 KI-----KQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLKEEAVKYNSEPPNPLKERQ 318
K+ + + + PAN E + + +D ++ A S PP+ E +
Sbjct: 481 KLLSLNADKTVKVKVYTPANCE----EVRIPFQVDKVAKVVGRLA---ESSPPSDHNESR 533
Query: 319 I-HGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDSS 361
I +GVLV +SLM ++ + G++ + + +S +S
Sbjct: 534 IMNGVLVQNGFKLSLMAPEDLREYAGLATTTITCKQLITLSTAS 577
>gi|154282371|ref|XP_001541981.1| hypothetical protein HCAG_02152 [Ajellomyces capsulatus NAm1]
gi|150410161|gb|EDN05549.1| hypothetical protein HCAG_02152 [Ajellomyces capsulatus NAm1]
Length = 925
Score = 178 bits (451), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 124/407 (30%), Positives = 207/407 (50%), Gaps = 55/407 (13%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDK-CRPDLLITESTYATTIRDSKRCRERDF 59
MF + + +I++TGDY+ DRHL +A K + D+LITEST+ + + RE
Sbjct: 181 MFLISIAGLNILFTGDYSREEDRHLISAEAPKGVKVDVLITESTFGVSSNPPRLEREAAL 240
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERM-NLQA-PIYFAVGLTEKATNYY 117
+K + ++RGG+VL+PVFALGRAQEL ++L+ YW R LQ PIY+ + + Y
Sbjct: 241 IKSITSILNRGGRVLMPVFALGRAQELLLILDEYWSRHPELQKIPIYYIGNIARRCMVVY 300
Query: 118 KMFITWTNQKIRKTFVQR--------------NMFDFKHIRPFDK-SFIDNPGPMVVFAT 162
+ +I N+ I++ F QR +DF+ +R D+ G V+ A+
Sbjct: 301 QTYIGAMNENIKRLFRQRMAEAEASGDKSISAGPWDFRFVRSVRSIERFDDVGGCVMLAS 360
Query: 163 PGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKL-------------- 208
PGML +G S + ++WAP E N +IM G+ V+GT+G ++L+ +++
Sbjct: 361 PGMLQTGTSRELLERWAPNERNGVIMTGYSVEGTMGKQILNEPEQIPAVMSGRNAVGPAR 420
Query: 209 ------EFENKQIIDVKMAIEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLK 262
+ E K +I + +++ +SF+AH D I+ V+LVHGE ++M+ LK
Sbjct: 421 RMATGDDDEQKVMIPRRCSVDEISFAAHVDGMENRAFIEEVGAPVVILVHGEKNQMMRLK 480
Query: 263 EKI-----KQEFNLDCFMPANGESCFV--QTDMKISIDVSVNLLKEEAVKYNSEPPNPLK 315
K+ + + + PAN E + Q D + V L E + PP+ K
Sbjct: 481 SKLLSLNTDKTVKIKVYTPANCEEVRIPFQVDKVAKV---VGRLAE------TVPPSGHK 531
Query: 316 E-RQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDSS 361
E R ++GVLV +SLM ++ + G++ V ++ ++ +S
Sbjct: 532 ESRLLNGVLVQHGFKLSLMAPEDLREYAGLATTTVTCKQHITLNTAS 578
>gi|116283804|gb|AAH30988.1| CPSF3 protein [Homo sapiens]
Length = 554
Score = 178 bits (451), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 89/207 (42%), Positives = 125/207 (60%), Gaps = 3/207 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ ++YTGD++ DRHL AA I +PD+LI ESTY T I + + RE F
Sbjct: 164 MFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPNIKPDILIIESTYGTHIHEKREEREARFC 223
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
VH+ V+RGG+ LIPVFALGRAQEL ++L+ YW+ PIY+A L +K Y+
Sbjct: 224 NTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQNHPELHDIPIYYASSLAKKCMAVYQ 283
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
++ N KIRK N F FKHI D+ GP VV A+PGM+ SGLS +F+
Sbjct: 284 TYVNAMNDKIRKQININNPFVFKHISNLKSMDHFDDIGPSVVMASPGMMQSGLSRELFES 343
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSG 204
W + N +I+ G+CV+GT+ ++ G
Sbjct: 344 WCTDKRNGVIIAGYCVEGTLAKILVHG 370
>gi|345563625|gb|EGX46611.1| hypothetical protein AOL_s00097g515 [Arthrobotrys oligospora ATCC
24927]
Length = 791
Score = 177 bits (450), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 126/437 (28%), Positives = 213/437 (48%), Gaps = 58/437 (13%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDK-CRPDLLITESTYATTIRDSKRCRERDF 59
MF +++ I++TGDY+ DRHL +A + K +PD+LITESTY T + +E F
Sbjct: 175 MFLIEIAGLKILFTGDYSREDDRHLVSASLPKHIKPDILITESTYGTASHMPRPEKEARF 234
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWE--RMNLQAPIYFAVGLTEKATNYY 117
+ V +DRGG+VL+PVFALGRAQEL ++LE YWE Q PIY+A L + + Y
Sbjct: 235 ISLVTSILDRGGRVLMPVFALGRAQELLLILEEYWEVHERYRQYPIYYASSLARRCMSVY 294
Query: 118 KMFITWTNQKIRKTFVQR------------------NMFDFKHIRPFDKSF--IDNPGPM 157
+ +I N I+ F + N F+ + +R KS D+ G
Sbjct: 295 QTYIHAMNDNIKALFRSKMAAIGEAAGKDGQVIGGTNPFEMRWVRSL-KSLDRFDDVGGC 353
Query: 158 VVFATPGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKL--------- 208
V+ A PGM+ +G+S + ++W P N +I+ G+ V+GT+ +L+ ++
Sbjct: 354 VMLAAPGMMQNGVSRELLERWCPDPKNGVILTGYSVEGTLAKSILNEPTEIQAFKKEHTS 413
Query: 209 -----EFENKQIIDVKMAIEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKE 263
E ++ +I + +I+ +SF+AH D I+ K ++LVHGE + M LK
Sbjct: 414 RRSGREEADRVMIPRRCSIDELSFAAHVDYGQNSSFIEEVGAKVIILVHGETNAMGRLKS 473
Query: 264 KIKQEFNLDCFMPA--NGESCFVQTDMKISIDVSVNLLKEEAVKYNSEPPNPLK------ 315
+ +++ P N ++C +++I + K PP+P K
Sbjct: 474 ALLSKYHDRKDKPRIYNPKNC-----EELNIPFKGEKIANVVGKLAKHPPHPSKDVDKAL 528
Query: 316 -----ERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDSSP--INKTLE 368
E+ GVLV D +SLM D+ + G++ V V +S + + +LE
Sbjct: 529 VKTGQEQIFAGVLVQNDFKLSLMSPDDLREYAGLTTTTVMCKQRVPLSTAGAGLVRWSLE 588
Query: 369 LIYDQLLSYLQDKSQEY 385
++ +++ ++++ Y
Sbjct: 589 SMFGGIVTVKKERTNGY 605
>gi|326475916|gb|EGD99925.1| endoribonuclease ysh1 [Trichophyton tonsurans CBS 112818]
Length = 855
Score = 177 bits (450), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 119/396 (30%), Positives = 202/396 (51%), Gaps = 41/396 (10%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDK-CRPDLLITESTYATTIRDSKRCRERDF 59
MF + + +I++TGDY+ DRHL +A + K + D++ITEST+ + + RE
Sbjct: 180 MFLISIAGLNILFTGDYSREEDRHLISAEVPKGVKIDVMITESTFGISSNPPRLEREAAL 239
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERM-NLQ-APIYFAVGLTEKATNYY 117
+K V ++RGG+VL+PVFALGRAQEL ++L+ YW R LQ PIY+ + + Y
Sbjct: 240 MKSVTSIINRGGRVLMPVFALGRAQELLLILDEYWSRHPELQKVPIYYIGNMARRCMVVY 299
Query: 118 KMFITWTNQKIRKTFVQR--------------NMFDFKHIRPF-DKSFIDNPGPMVVFAT 162
+ +I N+ I++ F QR +DF+ +R + ++ G V+ A+
Sbjct: 300 QTYIGAMNENIKRLFRQRMAEAEARGDKSVTAGPWDFRFVRSLRNLDRFEDVGGCVMLAS 359
Query: 163 PGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEF------------ 210
PGML +G S + ++WAP E N +IM G+ V+GT+G ++++ +++
Sbjct: 360 PGMLQTGTSRELLERWAPNERNGVIMTGYSVEGTMGKQIINEPEQIPAVMSAKNAAGPSD 419
Query: 211 ENKQIIDVKMAIEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEK-----I 265
+ K +I + ++ +SF+AH D + I+ V+LVHGE +M+ LK K +
Sbjct: 420 DQKIMIQRRCTVDEVSFAAHVDGVENREFIESVAAPVVILVHGEKHQMMRLKSKLLSLNV 479
Query: 266 KQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLKEEAVKYNSEPPNPLKERQIHGVLVI 325
+E + + PAN + + M + V L E S P R + GVLV
Sbjct: 480 DKEVKVKVYTPANCDEVRIPF-MVDKVARVVGRLAE-----TSPPIGQDDSRLMDGVLVQ 533
Query: 326 KDSSISLMDVDEACKEVGISRHIVRFTSNVQISDSS 361
+S+M D+ + G++ +V + +S +S
Sbjct: 534 NGFKLSMMASDDLREYAGLTTTMVTCKQYITLSTAS 569
>gi|315043764|ref|XP_003171258.1| endoribonuclease ysh1 [Arthroderma gypseum CBS 118893]
gi|311345047|gb|EFR04250.1| endoribonuclease ysh1 [Arthroderma gypseum CBS 118893]
Length = 853
Score = 177 bits (450), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 119/396 (30%), Positives = 201/396 (50%), Gaps = 41/396 (10%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDK-CRPDLLITESTYATTIRDSKRCRERDF 59
MF + + +I++TGDY+ DRHL +A + K + D++ITEST+ + + RE
Sbjct: 180 MFLISIAGLNILFTGDYSREEDRHLISAEVPKSVKIDVMITESTFGISSNPPRLEREAAL 239
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERM-NLQ-APIYFAVGLTEKATNYY 117
+K V ++RGG+VL+PVFALGRAQEL ++L+ YW R LQ PIY+ + + Y
Sbjct: 240 MKSVTSVINRGGRVLMPVFALGRAQELLLILDEYWSRHPELQKVPIYYIGNMARRCMVVY 299
Query: 118 KMFITWTNQKIRKTFVQR--------------NMFDFKHIRPF-DKSFIDNPGPMVVFAT 162
+ +I N+ I++ F QR +DF+ +R + ++ G V+ A+
Sbjct: 300 QTYIGAMNENIKRLFRQRMAEAEARGDKSVTAGPWDFRFVRSLRNLDRFEDVGGCVMLAS 359
Query: 163 PGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEF------------ 210
PGML +G S + ++WAP E N +IM G+ V+GT+G ++++ +++
Sbjct: 360 PGMLQTGTSRELLERWAPNERNGVIMTGYSVEGTMGKQIINEPEQIPAVMSAKNAAGPSD 419
Query: 211 ENKQIIDVKMAIEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEK-----I 265
+ K +I + ++ +SF+AH D + I+ V+LVHGE +M+ LK K +
Sbjct: 420 DQKIMIQRRCTVDEISFAAHVDGVENREFIESVAAPVVILVHGEKHQMMRLKSKLLSLNV 479
Query: 266 KQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLKEEAVKYNSEPPNPLKERQIHGVLVI 325
+E + + PAN + + M + V L E S P R + GVLV
Sbjct: 480 DKEVKVKVYTPANCDEVRIPF-MVDKVARVVGRLAE-----TSPPVGQDDSRLMDGVLVQ 533
Query: 326 KDSSISLMDVDEACKEVGISRHIVRFTSNVQISDSS 361
+S+M D+ + G++ V + +S +S
Sbjct: 534 NGFKLSMMASDDLREYAGLTTTTVTCKQYITLSTAS 569
>gi|71027889|ref|XP_763588.1| cleavage and polyadenylation specificity factor protein [Theileria
parva strain Muguga]
gi|68350541|gb|EAN31305.1| cleavage and polyadenylation specificity factor protein, putative
[Theileria parva]
Length = 708
Score = 177 bits (450), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 133/461 (28%), Positives = 229/461 (49%), Gaps = 53/461 (11%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++G I+YTGDY+M DRHL +A I LLI+ESTY + + + RE FL
Sbjct: 174 MFLIEIGGVRILYTGDYSMEKDRHLPSAEIPLTNVHLLISESTYGIRVHEERSQREMRFL 233
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQ---APIYFAVGLTEKATNYY 117
V + + R GK L+PVFALGR+QE+ ++L+ YWE N Q PI++ L K+ Y
Sbjct: 234 HVVMDIIMRNGKCLLPVFALGRSQEILLILDDYWEN-NKQLHNVPIFYISPLASKSLKVY 292
Query: 118 KMFITWTNQKIRKTFVQR-NMFDFKHIRPFD-----KSFIDNPGPMVVFATPGMLHSGLS 171
+ F+ + I+++ N F+FK +R ++++ GP ++ +PGML G S
Sbjct: 293 ETFVGQCGEYIKQSVYNGFNPFNFKFVRYARSIKQIRNYLLRDGPCIIMTSPGMLQGGPS 352
Query: 172 LIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHA 231
L +F+ P N +++ G+ V+GT+ ++ + + K +I + ++E +SFSAHA
Sbjct: 353 LEVFELLCPDNRNGVVLTGYAVKGTLADELKKDPELINLGTK-VIKPRCSVEQISFSAHA 411
Query: 232 DAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF-NLDCFMP--------ANGESC 282
D I+ NV+LVHGE ++M +K+K++++ L FMP C
Sbjct: 412 DYNQTKGFIKKLSVPNVILVHGERNEMKRMKDKLEEDIPQLTVFMPEVLQEVTLTFTPQC 471
Query: 283 FVQTDMKISIDVSVNLLKEEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEV 342
F+ T +S D+ + L + GV + D + ++DE V
Sbjct: 472 FIDTIGSLSSDLQL-----------------LDSTTLGGVFSVLDGN---KEIDEVVSAV 511
Query: 343 GISRHIVR--FTSNVQISDS---SPINKTLELIYDQLLSYLQD--KSQEYKIQLTESSLS 395
+ R ++ +V+ D+ + +N+ + + L YL KS K++L +
Sbjct: 512 LLKDKDERIIYSDDVKEFDNLSINALNQKMFFTFPHKLEYLVKALKSVYNKVELISEN-- 569
Query: 396 IDSVLLSVERIDDKNKRVFVTW---PNQDEEVGKIVLHVLK 433
++ +L ++ N ++ +TW P D I+L V+K
Sbjct: 570 -EANVLDQVKLYKPNNQLLLTWESGPVSDFIADSIILMVIK 609
>gi|296803464|ref|XP_002842585.1| endoribonuclease ysh1 [Arthroderma otae CBS 113480]
gi|238838904|gb|EEQ28566.1| endoribonuclease ysh1 [Arthroderma otae CBS 113480]
Length = 854
Score = 177 bits (450), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 119/396 (30%), Positives = 202/396 (51%), Gaps = 41/396 (10%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDK-CRPDLLITESTYATTIRDSKRCRERDF 59
MF + + +I++TGDY+ DRHL +A + K + D++ITEST+ + + RE
Sbjct: 180 MFLISIAGLNILFTGDYSREEDRHLISAEVPKGVKIDVMITESTFGISSNPPRLEREAAL 239
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERM-NLQ-APIYFAVGLTEKATNYY 117
+K V ++RGG+VL+PVFALGRAQEL ++L+ YW R LQ PIY+ + + Y
Sbjct: 240 IKSVTSIINRGGRVLMPVFALGRAQELLLILDEYWSRHPELQKVPIYYIGNMARRCMVVY 299
Query: 118 KMFITWTNQKIRKTFVQR--------------NMFDFKHIRPF-DKSFIDNPGPMVVFAT 162
+ +I N+ I++ F QR +DF+ +R + ++ G V+ A+
Sbjct: 300 QTYIGAMNENIKRLFRQRMAEAEARGDKSVTAGPWDFRFVRSLRNLDRFEDVGGCVMLAS 359
Query: 163 PGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEF------------ 210
PGML +G S + ++WAP E N +IM G+ V+GT+G ++++ +++
Sbjct: 360 PGMLQTGTSRELLERWAPNERNGVIMTGYSVEGTMGKQIINEPEQIPAVMSAKNAAGPSD 419
Query: 211 ENKQIIDVKMAIEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEK-----I 265
+ K +I + ++ +SF+AH D + I+ V+LVHGE +M+ LK K +
Sbjct: 420 DQKIMIQRRCTVDEISFAAHVDGVENREFIESVAAPVVILVHGEKHQMMRLKSKLLSLNV 479
Query: 266 KQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLKEEAVKYNSEPPNPLKERQIHGVLVI 325
+E + + PAN + + M + V L E S P R + GVLV
Sbjct: 480 DKEVKVKVYTPANCDEVRIPF-MVDKVARVVGRLAE-----TSPPMGQDDSRLMDGVLVQ 533
Query: 326 KDSSISLMDVDEACKEVGISRHIVRFTSNVQISDSS 361
+S+M D+ + G++ +V + +S +S
Sbjct: 534 NGFKLSMMASDDLREYAGLTTTMVTCKQYITLSTAS 569
>gi|325090760|gb|EGC44070.1| endoribonuclease ysh1 [Ajellomyces capsulatus H88]
Length = 893
Score = 177 bits (450), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 124/407 (30%), Positives = 206/407 (50%), Gaps = 55/407 (13%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDK-CRPDLLITESTYATTIRDSKRCRERDF 59
MF + + +I++TGDY+ DRHL +A K + D+LITEST+ + + RE
Sbjct: 181 MFLISIAGLNILFTGDYSREEDRHLISAEAPKGVKVDVLITESTFGVSSNPPRLEREAAL 240
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERM-NLQA-PIYFAVGLTEKATNYY 117
+K + ++RGG+VL+PVFALGRAQEL ++L+ YW R LQ PIY+ + + Y
Sbjct: 241 IKSITSILNRGGRVLMPVFALGRAQELLLILDEYWSRHPELQKIPIYYIGNIARRCMVVY 300
Query: 118 KMFITWTNQKIRKTFVQR--------------NMFDFKHIRPFDK-SFIDNPGPMVVFAT 162
+ +I N+ I++ F QR +DF+ +R D+ G V+ A+
Sbjct: 301 QTYIGAMNENIKRLFRQRMAEAEASGDKSISAGPWDFRFVRSVRSIERFDDVGGCVMLAS 360
Query: 163 PGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKL-------------- 208
PGML +G S + ++WAP E N +IM G+ V+GT+G ++L+ +++
Sbjct: 361 PGMLQTGTSRELLERWAPNERNGVIMTGYSVEGTMGKQILNEPEQIPAVMSGRNAVGPAR 420
Query: 209 ------EFENKQIIDVKMAIEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLK 262
+ E K +I + +++ +SF+AH D I+ V+LVHGE ++M+ LK
Sbjct: 421 RMATGDDDEQKVMIPRRCSVDEISFAAHVDGMENRAFIEEVGAPVVILVHGEKNQMMRLK 480
Query: 263 EKI-----KQEFNLDCFMPANGESCFV--QTDMKISIDVSVNLLKEEAVKYNSEPPNPLK 315
K+ + + + PAN E + Q D + V L E PP+ K
Sbjct: 481 SKLLSLNTDKTVKVKVYTPANCEEVRIPFQVDKVAKV---VGRLAETV------PPSDHK 531
Query: 316 E-RQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDSS 361
E R ++GVLV +SLM ++ + G++ V ++ ++ +S
Sbjct: 532 ESRLLNGVLVQHGFKLSLMAPEDLREYAGLATTTVTCKQHITLNTAS 578
>gi|67517547|ref|XP_658594.1| hypothetical protein AN0990.2 [Aspergillus nidulans FGSC A4]
gi|74598547|sp|Q5BEP0.1|YSH1_EMENI RecName: Full=Endoribonuclease ysh1; AltName: Full=mRNA
3'-end-processing protein ysh1
gi|40746402|gb|EAA65558.1| hypothetical protein AN0990.2 [Aspergillus nidulans FGSC A4]
gi|259488717|tpe|CBF88384.1| TPA: Endoribonuclease ysh1 (EC 3.1.27.-)(mRNA 3'-end-processing
protein ysh1) [Source:UniProtKB/Swiss-Prot;Acc:Q5BEP0]
[Aspergillus nidulans FGSC A4]
Length = 884
Score = 177 bits (448), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 130/436 (29%), Positives = 209/436 (47%), Gaps = 48/436 (11%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDK-CRPDLLITESTYATTIRDSKRCRERDF 59
MF + + +I++TGDY+ DRHL A + + + D+LITEST+ + + RE
Sbjct: 180 MFLISIAGLNILFTGDYSREEDRHLIPATVPRGVKIDVLITESTFGISSNPPRLEREAAL 239
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWE-RMNLQA-PIYFAVGLTEKATNYY 117
+K + ++RGG+VL+PVFALGRAQEL ++LE YWE LQ PIY+ + Y
Sbjct: 240 MKSITGVLNRGGRVLMPVFALGRAQELLLILEEYWETHPELQKIPIYYIGNTARRCMVVY 299
Query: 118 KMFITWTNQKIRKTFVQR--------------NMFDFKHIRPFDK-SFIDNPGPMVVFAT 162
+ +I N I++ F QR +DFK++R D+ G V+ A+
Sbjct: 300 QTYIGAMNDNIKRLFRQRMAEAEASGDKSVSAGPWDFKYVRSLRSLERFDDVGGCVMLAS 359
Query: 163 PGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIG----------HKVLS----GVKKL 208
PGML +G S + ++WAP E N ++M G+ V+GT+ H V+S G+ +
Sbjct: 360 PGMLQTGTSRELLERWAPNERNGVVMTGYSVEGTMAKQLLNEPDQIHAVMSRAATGMGRT 419
Query: 209 EF-----ENKQIIDVKMAIEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKE 263
E K +I + ++ +SF+AH D I+ V+LVHGE +M+ LK
Sbjct: 420 RMNGNDEEQKIMIPRRCTVDEISFAAHVDGVENRNFIEEVSAPVVILVHGEKHQMMRLKS 479
Query: 264 KI-----KQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLKEEAVKYNSEPPNPLKERQ 318
K+ ++ + + PAN E + V L + N + PL
Sbjct: 480 KLLSLNAEKTVKVKVYTPANCEEVRIPFRKDKIAKVVGKLAQTTLPTDNEDGDGPL---- 535
Query: 319 IHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDSSP--INKTLELIYDQLLS 376
+ GVLV +SLM D+ + G++ + ++ +S +S I LE + +
Sbjct: 536 MAGVLVQNGFDLSLMAPDDLREYAGLATTTITCKQHITLSSASMDLIKWALEGTFGAIEE 595
Query: 377 YLQDKSQEYKIQLTES 392
D+ E + Q +ES
Sbjct: 596 IGTDEDAEKEDQQSES 611
>gi|321264788|ref|XP_003197111.1| cleavage and polyadenylation specificity factor [Cryptococcus
gattii WM276]
gi|317463589|gb|ADV25324.1| Cleavage and polyadenylation specificity factor, putative
[Cryptococcus gattii WM276]
Length = 778
Score = 176 bits (446), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 116/382 (30%), Positives = 194/382 (50%), Gaps = 15/382 (3%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ I+YTGDY+ DRHL A I +PD++I EST+ ++ +E F
Sbjct: 187 MFLIEIAGLKILYTGDYSREEDRHLVMAEIPPVKPDVMICESTFGVHTLPDRKEKEEQFT 246
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYW-ERMNLQA-PIYFAVGLTEKATNYYK 118
V V RGG+ L+P+ + G QEL +LL+ YW + LQ P+YFA L ++ YK
Sbjct: 247 TLVANIVRRGGRCLMPIPSFGNGQELALLLDEYWNDHPELQNIPVYFASSLFQRGMRVYK 306
Query: 119 MFITWTNQKIRKTFVQR-NMFDFKHIRPF--DKSFIDNPGPMVVFATPGMLHSGLSLIIF 175
++ N IR F +R N FDF+ ++ + +N GP V+ ++P + GLS +
Sbjct: 307 TYVHTMNANIRSRFARRDNPFDFRFVKWLKDPQKLRENKGPCVIMSSPQFMSFGLSRDLL 366
Query: 176 KKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKG 235
++WAP N +I+ G+ ++GT+ +LS +E I ++ ++ +SF AH D
Sbjct: 367 EEWAPDSKNGVIVTGYSIEGTMARTLLSEPDHIESLKGGNIPRRLTVKEISFGAHVDYAQ 426
Query: 236 IMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVS 295
+ IQ ++V+LVHGEAS+M L+ ++ + A G+ + T K ++
Sbjct: 427 NSKFIQEIGAQHVVLVHGEASQMGRLRAALRDTYA------AKGQEINIHTP-KNCEPLT 479
Query: 296 VNLLKEEAVKY--NSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTS 353
+ +E VK + P + G+LV KD S +L+ + G+S +
Sbjct: 480 LTFRQERMVKAIGSLASTRPEHGTSVKGLLVSKDFSYTLLSPADLHDFTGLSTSTIIQKQ 539
Query: 354 NVQIS-DSSPINKTLELIYDQL 374
V IS D + + LE +Y ++
Sbjct: 540 GVAISVDWAVVRWYLEGMYGEV 561
>gi|402696939|gb|AFQ90658.1| 73kDa cleavage and polyadenylation specific factor 3, partial
[Draco beccarii]
Length = 220
Score = 176 bits (446), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 88/199 (44%), Positives = 122/199 (61%), Gaps = 3/199 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ ++YTGD++ DRHL AA I +PD+LI ESTY T I + + RE F
Sbjct: 22 MFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPNIKPDILIIESTYGTHIHEKREEREARFC 81
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWE--RMNLQAPIYFAVGLTEKATNYYK 118
VH+ V+RGG+ LIPVFALGRAQEL ++L+ YW+ + PIY+A L +K Y+
Sbjct: 82 NTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQXXXXXXEIPIYYASSLAKKCMAVYQ 141
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
++ N KIRK N F FKHI D+ GP VV A+PGM+ SGLS +F+
Sbjct: 142 TYVNAMNDKIRKQININNPFVFKHISNLKSMDHFDDIGPSVVMASPGMMQSGLSRELFES 201
Query: 178 WAPVENNMLIMPGFCVQGT 196
W + N +I+ G+CV+GT
Sbjct: 202 WCTDKRNGVIIAGYCVEGT 220
>gi|119185911|ref|XP_001243562.1| hypothetical protein CIMG_03003 [Coccidioides immitis RS]
gi|392870265|gb|EJB11994.1| endoribonuclease ysh1 [Coccidioides immitis RS]
Length = 881
Score = 176 bits (446), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 119/405 (29%), Positives = 205/405 (50%), Gaps = 51/405 (12%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDK-CRPDLLITESTYATTIRDSKRCRERDF 59
MF + + +I++TGDY+ DRHL +A + K + D+LI EST+ + + RE
Sbjct: 181 MFLISIAGLNILFTGDYSREEDRHLVSAEVPKGIKIDVLIAESTFGISSNPPRLERETAL 240
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERM-NLQA-PIYFAVGLTEKATNYY 117
+K V ++RGG+VL+PVFALGRAQEL ++L+ YW R LQ PIY+ + + Y
Sbjct: 241 MKSVTSVLNRGGRVLMPVFALGRAQELLLILDEYWGRHPELQKIPIYYIGNMARRCMVVY 300
Query: 118 KMFITWTNQKIRKTFVQR--------------NMFDFKHIRPF-DKSFIDNPGPMVVFAT 162
+ +I N I++ F QR +DFK +R + D+ G V+ A+
Sbjct: 301 QTYIGAMNDNIKRLFRQRMAEAEARGDKSTTAGPWDFKFVRSVRNLERFDDVGGCVMLAS 360
Query: 163 PGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKL-------------- 208
PGML +G S + ++WAP E N +IM G+ V+GT+G ++L+ +++
Sbjct: 361 PGMLQTGTSRELLERWAPSERNGVIMTGYSVEGTMGKQILNEPEQIPAVMSARTAAGPGR 420
Query: 209 ------EFENKQIIDVKMAIEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLK 262
+ E K +I + +E +SF+AH D + I+ V+LVHGE +M+ LK
Sbjct: 421 GRIPDGDEEQKVMIPRRCTVEEISFAAHVDGVENREFIEAVAAPVVILVHGEKHQMMRLK 480
Query: 263 EKI-----KQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLKEEAVKYNSEPPNPLKE- 316
K+ ++ + + P N E + + ++D ++ A + PP +E
Sbjct: 481 SKLLSLNAEKTVKVKIYSPTNCE----EIRIPFTVDKVAKVVGRLA---ETSPPAADQES 533
Query: 317 RQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDSS 361
R ++GVLV +S+M ++ + G++ + ++ + +S
Sbjct: 534 RLMNGVLVQNGFQLSMMAPEDLREYAGLTTTTITCKQHITLGTAS 578
>gi|195149772|ref|XP_002015829.1| GL10807 [Drosophila persimilis]
gi|194109676|gb|EDW31719.1| GL10807 [Drosophila persimilis]
Length = 494
Score = 176 bits (445), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 111/380 (29%), Positives = 190/380 (50%), Gaps = 48/380 (12%)
Query: 655 LVVESAKVSQL-----MSEKKSYEVQVMSQVVAAVTNSCDSSH---IIDLGGGQGYLSTI 706
L+ E K SQ+ MS KK +EV++ + +V + S H I+D G G+GYLS+
Sbjct: 103 LIAEQDKASQITIREFMSAKKCHEVELTAALVDNLIQSTGRQHDFFIVDAGDGKGYLSSR 162
Query: 707 LALQHGKKTLSLDYNQVNTHGAAVRSKKLEDSEKESKGPFQSYAGVINKKLWMRTQNRQT 766
+ALQ+G + L +D N NT A R++KL+ R N T
Sbjct: 163 MALQYGHRVLGIDANAANTESALNRNRKLQ-----------------------RAWNGLT 199
Query: 767 SHASVEHHGKNWKRKSKAPVVSIKDEELVVCKDKCKQITHFVTPDSDISSILSQAYPQDS 826
A ++ G KR+ K + + V ++ K F+ +++ ++L++ +P+ S
Sbjct: 200 ERAELQSQGITPKRRGKKSPTRNTAKSIPVLENY-KTTAKFINTETNFGNLLAEHFPKPS 258
Query: 827 LH---NVCIMGLHTCGDLSGTALRLF-TKSSLQCLVQVGCCYHLLEEEFIRSPFWKDVDQ 882
++C+ GLHTCG+L+ T L++F + Q L +GCCYHLL+E + + F+ +
Sbjct: 259 SAGNLSICLTGLHTCGNLAATCLQVFHAQPDCQILCNIGCCYHLLKERYSQQEFFGNKTL 318
Query: 883 SLYEHGYGFPLSEHLRSRKFFLGRNVRMSGTQSPERVIDLKQTQTLPLFYRALLEKYLRS 942
+ +GFPLS++L+ R+ LGRN RM QS ER ++ K+ + L+YRALLE
Sbjct: 319 MEMQTDFGFPLSQYLQMRQIRLGRNARMLAAQSIERTLNGKELPNITLYYRALLE----- 373
Query: 943 KITINDEEPKVVGRLAT----KCSNFVEYVHRAVDKLKLDL--EVDDEEVTRLFNSHQRE 996
+ + P + L K NF +Y+ + KL ++ E+ + +
Sbjct: 374 -VLVCRHAPHLKNELQVGKVRKFDNFQDYIQKCARKLDAPWLASIETAELQSILQEFALD 432
Query: 997 YEYLQIYYFLKTALAPVIEA 1016
+L ++Y ++ + APV+E+
Sbjct: 433 KHFLDMFYLIRMSFAPVLES 452
Score = 53.5 bits (127), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 37/156 (23%), Positives = 79/156 (50%), Gaps = 14/156 (8%)
Query: 485 IKNTIDEMIEFITPNIACVNTHTVHFMVEKLWETTIPKAILSQVNNSPSFDSLLEE-FWK 543
++ +D ++ ++ P+ +N H V+++ ++ W +P+A+ +++N+ D +E FW+
Sbjct: 8 LEKRLDALLLYLQPHWDFLNCHMVNYLTDQHWTRFLPEAMRNEINSKKDLDLAIENLFWE 67
Query: 544 SRQDNHVETNSELVKFFQAADKFRLTSLMNGESIITVDELFSKLQARKCGQVVESAKVSQ 603
++ SE F A+ L+ + E + TVD+L + + K Q+ + +
Sbjct: 68 KKRS--TTEFSEWEDFLTNAEHEHLSE--HPELLTTVDQLIA--EQDKASQIT----IRE 117
Query: 604 LMSEKKSYEVQVMSQVVAAVTNSCDSSH---IIDLG 636
MS KK +EV++ + +V + S H I+D G
Sbjct: 118 FMSAKKCHEVELTAALVDNLIQSTGRQHDFFIVDAG 153
>gi|303323846|ref|XP_003071912.1| metallo-beta-lactamase superfamily protein [Coccidioides posadasii
C735 delta SOWgp]
gi|240111619|gb|EER29767.1| metallo-beta-lactamase superfamily protein [Coccidioides posadasii
C735 delta SOWgp]
Length = 881
Score = 176 bits (445), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 119/405 (29%), Positives = 205/405 (50%), Gaps = 51/405 (12%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDK-CRPDLLITESTYATTIRDSKRCRERDF 59
MF + + +I++TGDY+ DRHL +A + K + D+LI EST+ + + RE
Sbjct: 181 MFLISIAGLNILFTGDYSREEDRHLVSAEVPKGIKIDVLIAESTFGISSNPPRLERETAL 240
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERM-NLQA-PIYFAVGLTEKATNYY 117
+K V ++RGG+VL+PVFALGRAQEL ++L+ YW R LQ PIY+ + + Y
Sbjct: 241 MKSVTSVLNRGGRVLMPVFALGRAQELLLILDEYWGRHPELQKIPIYYIGNMARRCMVVY 300
Query: 118 KMFITWTNQKIRKTFVQR--------------NMFDFKHIRPF-DKSFIDNPGPMVVFAT 162
+ +I N I++ F QR +DFK +R + D+ G V+ A+
Sbjct: 301 QTYIGAMNDNIKRLFRQRMAEAEARGDKSTTAGPWDFKFVRSVRNLERFDDVGGCVMLAS 360
Query: 163 PGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKL-------------- 208
PGML +G S + ++WAP E N +IM G+ V+GT+G ++L+ +++
Sbjct: 361 PGMLQTGTSRELLERWAPSERNGVIMTGYSVEGTMGKQILNEPEQIPAVMSARTAAGPGR 420
Query: 209 ------EFENKQIIDVKMAIEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLK 262
+ E K +I + +E +SF+AH D + I+ V+LVHGE +M+ LK
Sbjct: 421 GRIPDGDEEQKVMIPRRCTVEEISFAAHVDGVENREFIEAVAAPVVILVHGEKHQMMRLK 480
Query: 263 EKI-----KQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLKEEAVKYNSEPPNPLKE- 316
K+ ++ + + P N E + + ++D ++ A + PP +E
Sbjct: 481 SKLLSLNAEKTVKVKIYSPTNCE----EIRIPFTVDKVAKVVGRLA---ETSPPAADQES 533
Query: 317 RQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDSS 361
R ++GVLV +S+M ++ + G++ + ++ + +S
Sbjct: 534 RLMNGVLVQNGFQLSMMAPEDLREYAGLTTTTITCKQHITLGTAS 578
>gi|320032162|gb|EFW14117.1| cleavage and polyadenylation specificity factor [Coccidioides
posadasii str. Silveira]
Length = 881
Score = 176 bits (445), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 119/405 (29%), Positives = 205/405 (50%), Gaps = 51/405 (12%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDK-CRPDLLITESTYATTIRDSKRCRERDF 59
MF + + +I++TGDY+ DRHL +A + K + D+LI EST+ + + RE
Sbjct: 181 MFLISIAGLNILFTGDYSREEDRHLVSAEVPKGIKIDVLIAESTFGISSNPPRLERETAL 240
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERM-NLQA-PIYFAVGLTEKATNYY 117
+K V ++RGG+VL+PVFALGRAQEL ++L+ YW R LQ PIY+ + + Y
Sbjct: 241 MKSVTSVLNRGGRVLMPVFALGRAQELLLILDEYWGRHPELQKIPIYYIGNMARRCMVVY 300
Query: 118 KMFITWTNQKIRKTFVQR--------------NMFDFKHIRPF-DKSFIDNPGPMVVFAT 162
+ +I N I++ F QR +DFK +R + D+ G V+ A+
Sbjct: 301 QTYIGAMNDNIKRLFRQRMAEAEARGDKSTTAGPWDFKFVRSVRNLERFDDVGGCVMLAS 360
Query: 163 PGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKL-------------- 208
PGML +G S + ++WAP E N +IM G+ V+GT+G ++L+ +++
Sbjct: 361 PGMLQTGTSRELLERWAPSERNGVIMTGYSVEGTMGKQILNEPEQIPAVMSARTAAGPGR 420
Query: 209 ------EFENKQIIDVKMAIEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLK 262
+ E K +I + +E +SF+AH D + I+ V+LVHGE +M+ LK
Sbjct: 421 GRIPDGDEEQKVMIPRRCTVEEISFAAHVDGVENREFIEAVAAPVVILVHGEKHQMMRLK 480
Query: 263 EKI-----KQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLKEEAVKYNSEPPNPLKE- 316
K+ ++ + + P N E + + ++D ++ A + PP +E
Sbjct: 481 SKLLSLNAEKTVKVKIYSPTNCE----EIRIPFTVDKVAKVVGRLA---ETSPPAADQES 533
Query: 317 RQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDSS 361
R ++GVLV +S+M ++ + G++ + ++ + +S
Sbjct: 534 RLMNGVLVQNGFQLSMMAPEDLREYAGLTTTTITCKQHITLGTAS 578
>gi|405124298|gb|AFR99060.1| endoribonuclease YSH1 [Cryptococcus neoformans var. grubii H99]
Length = 770
Score = 176 bits (445), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 116/379 (30%), Positives = 192/379 (50%), Gaps = 15/379 (3%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ I+YTGDY+ DRHL A I +PD++I EST+ ++ +E F
Sbjct: 187 MFLIEIAGLMILYTGDYSREEDRHLVMAEIPPVKPDVMICESTFGVHTLPDRKEKEEQFT 246
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYW-ERMNLQ-APIYFAVGLTEKATNYYK 118
V V RGG+ L+P+ + G QEL +LL+ YW + LQ P+YFA L ++ YK
Sbjct: 247 TLVANIVRRGGRCLMPIPSFGNGQELALLLDEYWHDHPELQNIPVYFASSLFQRGMRVYK 306
Query: 119 MFITWTNQKIRKTFVQR-NMFDFKHIRPF--DKSFIDNPGPMVVFATPGMLHSGLSLIIF 175
++ N IR F +R N FDF+ ++ + +N GP V+ ++P + GLS +
Sbjct: 307 TYVHTMNANIRSRFARRDNPFDFRFVKWLKDPQKLRENKGPCVIMSSPQFMSFGLSRDLL 366
Query: 176 KKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKG 235
++WAP N +I+ G+ ++GT+ +LS +E I ++ ++ +SF AH D
Sbjct: 367 EEWAPDSKNGVIVTGYSIEGTMARTLLSEPDHIESLKGGNIPRRLTVKEISFGAHVDYAQ 426
Query: 236 IMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVS 295
+ IQ ++V+LVHGEAS+M L+ ++ + A G+ + T K ++
Sbjct: 427 NSKFIQEIGAQHVVLVHGEASQMGRLRAALRDTYA------AKGQEINIHTP-KNCEPLT 479
Query: 296 VNLLKEEAVKY--NSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTS 353
+ +E VK + P + G+LV KD S +L+ + G+S +
Sbjct: 480 LTFRQERMVKAIGSLAATRPEHGTSVKGLLVSKDFSYTLLSPADLHDFTGLSTSTIIQKQ 539
Query: 354 NVQIS-DSSPINKTLELIY 371
V IS D + + LE +Y
Sbjct: 540 GVAISVDWAVVRWYLEGMY 558
>gi|401882746|gb|EJT46990.1| cleavage and polyadenylation specificity factor [Trichosporon
asahii var. asahii CBS 2479]
gi|406700483|gb|EKD03650.1| cleavage and polyadenylation specificity factor [Trichosporon
asahii var. asahii CBS 8904]
Length = 738
Score = 175 bits (444), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 117/386 (30%), Positives = 192/386 (49%), Gaps = 38/386 (9%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ ++YTGDY+ DRHL A + +PD++I EST+ ++ +E F
Sbjct: 195 MFLIEIAGLKVLYTGDYSREEDRHLVIAEVPPVKPDVMICESTFGVHTLPDRKDKEEQFT 254
Query: 61 KK------------VHECVDRGGKVLIPVFALGRAQELCILLETYW-ERMNLQA-PIYFA 106
+ V V RGGKVL+P+ + G QEL +LL+ YW + LQ PIYFA
Sbjct: 255 SELISRATQLTSALVSNIVRRGGKVLMPIPSFGNGQELALLLDDYWNDHPELQGVPIYFA 314
Query: 107 VGLTEKATNYYKMFITWTNQKIRKTFVQR-NMFDFKHIRPF-DKSFIDNPGPMVVFATPG 164
GL ++ YK ++ N IR F +R N FDFK+++ D +D+ P VV A+
Sbjct: 315 SGLFQRGMRVYKKYVHTMNANIRSRFARRDNPFDFKYVKWLKDPKRLDHKQPCVVMASAQ 374
Query: 165 MLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLE-FENKQIIDVKMAIE 223
+ GLS + ++WAP N +I+ G+ ++GT+ +L +E + I + ++
Sbjct: 375 FMSFGLSRELLEEWAPDPKNGVIVTGYSIEGTMARTLLGEPDHIESLRGGRSIPRRCTVK 434
Query: 224 YMSFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF-----NLDCFMPAN 278
+SF AH D + IQ ++V+LVHGEAS+M L+ ++ ++ ++ P N
Sbjct: 435 EISFGAHVDYAQNSKFIQEIGAQHVVLVHGEASQMGRLRAALRDQYATRGQEINIHTPKN 494
Query: 279 GESCFVQTDMKISIDVSVNLLKEEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEA 338
E + + + ++ L EE + +E + G+LV KD S +L+D +
Sbjct: 495 LEPLVLNFRTE-RVAKAIGTLAEERPTHGAE---------VRGLLVSKDFSYTLLDPKDL 544
Query: 339 CKEVGISRHIVRFTSNVQISDSSPIN 364
G+S TS + S PI
Sbjct: 545 KDFTGLS------TSTIVQRQSLPIG 564
>gi|121700651|ref|XP_001268590.1| cleavage and polyadenylation specifity factor, 73 kDa subunit,
putative [Aspergillus clavatus NRRL 1]
gi|119396733|gb|EAW07164.1| cleavage and polyadenylation specifity factor, 73 kDa subunit,
putative [Aspergillus clavatus NRRL 1]
Length = 878
Score = 175 bits (444), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 123/403 (30%), Positives = 197/403 (48%), Gaps = 48/403 (11%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDK-CRPDLLITESTYATTIRDSKRCRERDF 59
MF V + +I++TGDY+ DRHL A + K + D+LITEST+ + + RE
Sbjct: 180 MFLVSIAGLNILFTGDYSREEDRHLIPAEVPKGIKIDVLITESTFGISTNPPRLEREAAL 239
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWE-RMNLQA-PIYFAVGLTEKATNYY 117
+K + ++RGG+VL+PVFALGRAQEL ++LE YWE +LQ PIY+ + Y
Sbjct: 240 MKAITGVLNRGGRVLMPVFALGRAQELLLILEEYWETHPDLQKIPIYYIGNTARRCMVVY 299
Query: 118 KMFITWTNQKIRKTFVQR--------------NMFDFKHIRPFDK-SFIDNPGPMVVFAT 162
+ +I N I++ F QR +DF+ +R D+ G V+ A+
Sbjct: 300 QTYIGAMNDNIKRLFRQRMAEAEASGDKSASAGPWDFRFVRSLRSLERFDDVGGCVMLAS 359
Query: 163 PGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLS------------------- 203
PGML +G S + ++WAP E N ++M G+ V+GT+ ++L+
Sbjct: 360 PGMLQTGTSRELLERWAPNERNGVVMTGYSVEGTMAKQLLNEPDQIPAVMSRAPGGLSRR 419
Query: 204 GVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKE 263
G+ + E K +I + ++ +SF+AH D I+ V+LVHGE +M+ LK
Sbjct: 420 GLASGDEEQKIMIPRRCTVDEISFAAHVDGVENRNFIEQVAAPVVILVHGEKHQMMRLKS 479
Query: 264 KI-----KQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLKEEAVKYNSEPPNPLKERQ 318
K+ + + + PAN E + K I V L + A P R
Sbjct: 480 KLLSLNADKTVKVKVYTPANCEEVCIPF-RKDKIAKVVGKLAQLAA-----PSEQDDGRL 533
Query: 319 IHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDSS 361
+ GVLV ++SLM D+ + G++ + ++ +S +S
Sbjct: 534 MSGVLVQNGFNLSLMAPDDLREYAGLTTTTITCKQHITLSSAS 576
>gi|402084516|gb|EJT79534.1| endoribonuclease YSH1 [Gaeumannomyces graminis var. tritici
R3-111a-1]
Length = 868
Score = 175 bits (444), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 117/413 (28%), Positives = 204/413 (49%), Gaps = 60/413 (14%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDK-CRPDLLITESTYATTIRDSKRCRERDF 59
MF +++ ++ +TGDY+ DRHL +A + + + D+LITESTY + RE+
Sbjct: 180 MFLIEIAGLNVFFTGDYSREQDRHLVSAEVPRGVQIDVLITESTYGIASHVPRMEREQAL 239
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYY 117
+K + ++RGG+VL+PVFALGRAQEL ++L+ YW+R + + PIY+A L K Y
Sbjct: 240 MKSITGILNRGGRVLMPVFALGRAQELLLILDEYWDRHSEYQKVPIYYASNLARKCMVVY 299
Query: 118 KMFITWTNQKIRKTFVQR--------------NMFDFKHIRPFDK-SFIDNPGPMVVFAT 162
+ ++ N I++ F +R +DFK IR D+ GP V+ A+
Sbjct: 300 QTYVGAMNDNIKRLFRERLAEAEAAGNVGTGGGPWDFKFIRSLKNLDRFDDLGPCVMLAS 359
Query: 163 PGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEF------------ 210
PGML +G+S + ++WAP + N +++ G+ V+GT+ +++ ++
Sbjct: 360 PGMLQTGVSRELLERWAPSDKNGVVITGYSVEGTMAKQIMQEPDQIHAVMTRNAAAASRR 419
Query: 211 --ENKQIIDVKMAIEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKI--- 265
K +I + ++ SF+AH D + I+ V+LVHGE + M+ LK K+
Sbjct: 420 ADGEKLMIPRRCSVAEFSFAAHVDGTENREFIEEVNAPVVILVHGEVNNMMRLKSKLLSL 479
Query: 266 --KQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLKEEAVKYNSEPP------------ 311
+ + F P N E ++ I + + + K + PP
Sbjct: 480 NANKTEKVKVFSPRNCE--------ELRIPFKADKMAKVVGKLAAIPPPTKLPPADGGLD 531
Query: 312 NPLKERQ---IHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDSS 361
PL+++Q I GVLV D +SLM ++ + G++ + +++S +
Sbjct: 532 GPLQQQQQRLITGVLVQNDFKLSLMAPEDLREYAGLTTTTIVCRQRIRLSAAG 584
>gi|156064885|ref|XP_001598364.1| conserved hypothetical protein [Sclerotinia sclerotiorum 1980]
gi|154691312|gb|EDN91050.1| conserved hypothetical protein [Sclerotinia sclerotiorum 1980
UF-70]
Length = 820
Score = 175 bits (444), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 117/384 (30%), Positives = 192/384 (50%), Gaps = 47/384 (12%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDK-CRPDLLITESTYATTIRDSKRCRERDF 59
MF +++ I +TGDY+ DRHL +A + K + D+LITESTY + RE+
Sbjct: 179 MFLIEIAGLKIFFTGDYSREDDRHLVSAEVPKGVKIDVLITESTYGIASHIPRLEREQAL 238
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYY 117
+K V ++RGG+VL+PVFALGRAQEL ++L+ YW++ + PIY+A L K Y
Sbjct: 239 MKSVTSILNRGGRVLMPVFALGRAQELLLILDEYWDKHPEFQKIPIYYASNLARKCMLVY 298
Query: 118 KMFITWTNQKIRKTFVQR--------------NMFDFKHIRPFDK-SFIDNPGPMVVFAT 162
+ ++ N+ I++ F +R +DFK+IR D+ G V+ A+
Sbjct: 299 QTYVGSMNENIKRLFRERMAEAEANSTSGGRGGPWDFKYIRSLKNLDRFDDVGGCVILAS 358
Query: 163 PGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHK----------VLSGVKKLEFEN 212
PGML +G+S + ++WAP + N +I+ G+ V+GT+ + ++SG + +
Sbjct: 359 PGMLQNGISRQLLERWAPSDKNGVIITGYSVEGTMAKQIMQEPDHIQAIMSGPQGFTRRS 418
Query: 213 ------KQIIDVKMAIEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKI- 265
K +I + +++ SF+AH D + I+ V+LVHGE M+ LK K+
Sbjct: 419 VPGETEKAMIPRRCSVQEFSFAAHVDGVENREFIEEIAAPVVILVHGEQHNMMRLKSKLL 478
Query: 266 ----KQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLKEEAVKYNSEPPNPLKERQIHG 321
+ + F P N E + I + + K + PP + I G
Sbjct: 479 SLNADKTTKVKVFSPKNCE--------PLRIPFVTSKTAKVVGKLANNPPPADSSQLITG 530
Query: 322 VLVIKDSSISLMDVDEACKEVGIS 345
VLV D +SLM ++ + G++
Sbjct: 531 VLVQNDFKMSLMAPEDLREYAGLT 554
>gi|443713763|gb|ELU06463.1| hypothetical protein CAPTEDRAFT_141423, partial [Capitella teleta]
Length = 369
Score = 175 bits (444), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 100/284 (35%), Positives = 159/284 (55%), Gaps = 19/284 (6%)
Query: 59 FLKKVHECVDRGGKVLIPVFALGRAQELCILLETYW-ERMNLQ-APIYFAVGLTEKATNY 116
F + + V+RGG+ LIPVFALGRAQEL ++L+ YW + LQ PIY+A L +K +
Sbjct: 3 FTSLISDIVNRGGRCLIPVFALGRAQELLLILDEYWSQHPELQDIPIYYASSLAKKCMSV 62
Query: 117 YKMFITWTNQKIRKTFVQRNMFDFKHIRPFD-KSFIDNPGPMVVFATPGMLHSGLSLIIF 175
Y+ +I N KI++ N F FKHI S D+ GP VV A+PGM+ SGLS +F
Sbjct: 63 YQTYINAMNDKIKRQINTNNPFVFKHISNLKASSHFDDIGPSVVMASPGMMQSGLSRELF 122
Query: 176 KKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKG 235
+ W + N I+ G+CV+GT+ +LS +++ Q + +K +++Y+SFSAH D K
Sbjct: 123 ENWCTDKRNGCIIAGYCVEGTLAKHILSEPEEIVTMGGQKLPLKCSVDYISFSAHTDYKQ 182
Query: 236 IMQLIQYCEPKNVLLVHGEASKMVFLKEKI------KQEFNLDCFMPANGESCFVQTDMK 289
+ ++ +P +++LVHGEA++M LK + E+N+D F P N E
Sbjct: 183 TSEFVRILKPTHIVLVHGEANEMGRLKAALIREYEDDPEYNIDIFNPRNTE--------- 233
Query: 290 ISIDVSVNLLKEEAVKYNSEPPNPLKERQIHGVLVIKDSSISLM 333
+++ K + N P + +Q+ G+LV ++ + +M
Sbjct: 234 -PVELYFRGEKMAKIVGNLATDRPKQGQQLSGILVKRNFNYHIM 276
>gi|58270576|ref|XP_572444.1| hypothetical protein CNH02710 [Cryptococcus neoformans var.
neoformans JEC21]
gi|134118056|ref|XP_772409.1| hypothetical protein CNBL2750 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|338819805|sp|P0CM89.1|YSH1_CRYNB RecName: Full=Endoribonuclease YSH1; AltName: Full=mRNA
3'-end-processing protein YSH1
gi|338819806|sp|P0CM88.1|YSH1_CRYNJ RecName: Full=Endoribonuclease YSH1; AltName: Full=mRNA
3'-end-processing protein YSH1
gi|50255022|gb|EAL17762.1| hypothetical protein CNBL2750 [Cryptococcus neoformans var.
neoformans B-3501A]
gi|57228702|gb|AAW45137.1| hypothetical protein CNH02710 [Cryptococcus neoformans var.
neoformans JEC21]
Length = 773
Score = 175 bits (443), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 115/379 (30%), Positives = 192/379 (50%), Gaps = 15/379 (3%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ I+YTGDY+ DRHL A I +PD++I EST+ ++ +E F
Sbjct: 187 MFLIEIAGLKILYTGDYSREEDRHLVMAEIPPVKPDVMICESTFGVHTLPDRKEKEEQFT 246
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYW-ERMNLQ-APIYFAVGLTEKATNYYK 118
V V RGG+ L+P+ + G QEL +LL+ YW + LQ P+YFA L ++ YK
Sbjct: 247 TLVANIVRRGGRCLMPIPSFGNGQELALLLDEYWNDHPELQNIPVYFASSLFQRGMRVYK 306
Query: 119 MFITWTNQKIRKTFVQR-NMFDFKHIRPF--DKSFIDNPGPMVVFATPGMLHSGLSLIIF 175
++ N IR F +R N FDF+ ++ + +N GP V+ ++P + GLS +
Sbjct: 307 TYVHTMNANIRSRFARRDNPFDFRFVKWLKDPQKLRENKGPCVIMSSPQFMSFGLSRDLL 366
Query: 176 KKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKG 235
++WAP N +I+ G+ ++GT+ +LS +E + ++ ++ +SF AH D
Sbjct: 367 EEWAPDSKNGVIVTGYSIEGTMARTLLSEPDHIESLKGGNVPRRLTVKEISFGAHVDYAQ 426
Query: 236 IMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVS 295
+ IQ ++V+LVHGEAS+M L+ ++ + A G+ + T K ++
Sbjct: 427 NSKFIQEIGAQHVVLVHGEASQMGRLRAALRDTYA------AKGQEINIHTP-KNCEPLT 479
Query: 296 VNLLKEEAVKY--NSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTS 353
+ +E VK + P + G+LV KD S +L+ + G+S +
Sbjct: 480 LTFRQERMVKAIGSLAATRPEHGTSVKGLLVSKDFSYTLLSPADLHDFTGLSTSTIIQKQ 539
Query: 354 NVQIS-DSSPINKTLELIY 371
V IS D + + LE +Y
Sbjct: 540 GVAISVDWAVVRWYLEGMY 558
>gi|66356658|ref|XP_625507.1| cleavage and polyadenylation specifity factor protein, CPSF
metallobeta-lactamase [Cryptosporidium parvum Iowa II]
gi|46226496|gb|EAK87490.1| cleavage and polyadenylation specifity factor protein, CPSF
metallobeta-lactamase [Cryptosporidium parvum Iowa II]
Length = 780
Score = 175 bits (443), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 99/262 (37%), Positives = 150/262 (57%), Gaps = 9/262 (3%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF V++G I+YTGDY+ DRH+ A I +LI ESTY T I + + RE+ FL
Sbjct: 187 MFLVEIGGVRILYTGDYSREDDRHVPRAEIPPIDVHVLICESTYGTRIHEPRIDREKRFL 246
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERM-NLQ-APIYFAVGLTEKATNYYK 118
V + R GK L+PVFA+GRAQEL ++LE +W R ++Q PI +A ++ K ++
Sbjct: 247 GGVQSIITRKGKCLLPVFAIGRAQELLLILEEHWSRTPSIQNVPIIYASPMSIKCMRVFE 306
Query: 119 MFITWTNQKIRKTF-VQRNMFDFKHIRPFD-----KSFIDNPGPMVVFATPGMLHSGLSL 172
+I +R+ + N F F +I+ + K I NPGP VV A PGML +G S
Sbjct: 307 TYINQCGDSVRRQADLGINPFQFNYIKTVNSLNEIKDIIYNPGPCVVMAAPGMLQNGTSR 366
Query: 173 IIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHAD 232
IF+ WAP + N +I+ G+ V+GT +++ + ++ K +I ++ + +SFSAH+D
Sbjct: 367 DIFEIWAPDKRNGIILTGYAVRGTPAYELRKEPEMIQLGEK-VIPMRAKFDQISFSAHSD 425
Query: 233 AKGIMQLIQYCEPKNVLLVHGE 254
+ I + NV+LVHGE
Sbjct: 426 FTQTQEFINSLKVPNVILVHGE 447
>gi|344266431|ref|XP_003405284.1| PREDICTED: uncharacterized protein C12orf26-like [Loxodonta africana]
Length = 601
Score = 174 bits (442), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 158/598 (26%), Positives = 275/598 (45%), Gaps = 87/598 (14%)
Query: 482 MELIKNTIDEMIEFITPNIACVNTHTVHFMVEKLWETTI---PKAILSQVNNS------P 532
+ ++ + ++ F+ + N HTV F E LWE + P+ +L+ + S P
Sbjct: 13 LSTLRAKLQGLLRFLKDALPISNAHTVDFYTESLWEQLVDLPPETVLAVLRRSAAEAWPP 72
Query: 533 SFDSLLEEFWKSRQDNH----VETNSELVKFFQAADKFRLTSLMNGESIITVDELFSKLQ 588
L E +S + ET+ +LV A + SL N ++L LQ
Sbjct: 73 EARPLGETEGESGISDFSKIFCETSQKLVSVETFALAAKYYSLQNLGICTPFEQLLIALQ 132
Query: 589 ARKCGQVVESAKVSQLMSEKKSYEVQVMSQVVAAVTNSCDSSHIIDLGSTITVDELFSKL 648
+ + E+ K + M+ KKS+EV +MS++++++ + C +IDLGS S
Sbjct: 133 GNQKQRTGENMKPDEFMNMKKSHEVHIMSELISSIADFCGIKQVIDLGSG---KGYLSSF 189
Query: 649 QARKCGLVV---ESAKVSQLMSEKKSYEVQVMSQVVAAVTNSCDSSHIIDLGGGQGYLST 705
+ K GL V +S+ VS +E+++ +++ +V + +D+ G
Sbjct: 190 LSLKYGLKVYGIDSSNVSTHGAEERNRKLKKHWRVYRTRSR-------LDVNG------- 235
Query: 706 ILALQHGKKTLSLDYNQVNTHGAA-VRSKKLEDSEKESKGPFQSYAGV----INKK---L 757
LAL+ K+ D + + + E G F ++ I KK L
Sbjct: 236 -LALKMAKERKVQDEIKCKADTEEECNNSAINQEEMSLSGVFPDFSDPVICNIRKKVENL 294
Query: 758 WMRTQNRQ---------------------TSHASVEHHGKNWKRKSKAPVVSIKDEELVV 796
+ Q + TS + V G + K + + S E +
Sbjct: 295 YAEPQREENLCFKSVFPLIDFVPVNAVEPTSSSQVSSRGISEANKERKKITSKSSESNIY 354
Query: 797 CKDKCKQITHFVTPDSDISSILSQAYPQDSLHNVCIMGLHTCGDLSGTALRLFT-KSSLQ 855
+T F+T DS++ ++ L + ++GLHTCGDL+ LR+FT K ++
Sbjct: 355 -----SPLTSFITADSELCDVIKD------LEDCLMVGLHTCGDLAPNTLRIFTSKYEIK 403
Query: 856 CLVQVGCCYHLLEEEFIRSPFWKDVDQSLYEHGYGFPLSEHLRSRKFFLGRNVRMSGTQS 915
+ VGCCYHLL EEF ++ + L + +GFP+ +L+ ++ GRN RMS +
Sbjct: 404 GVCSVGCCYHLLSEEF------ENQHKELTQEKWGFPMCHYLKEERWCCGRNARMSACLA 457
Query: 916 PERVIDLKQTQTLPLFYRALLEKYLRS--KITINDEEPKVVGRLATKCSNFVEYVHRAVD 973
ERV + T LFYRA+L+ ++ IT D+ VG++ +K S+F++YV +++
Sbjct: 458 LERVAVGQGLPTESLFYRAVLQDIIKDCYGITKCDQH---VGKIYSKSSSFLDYVRKSLK 514
Query: 974 KLKLD-LEVDDEEVTRLFNSHQREYEYLQIYYFLKTALAPVIEALIVLDRVLYLREQQ 1030
KL LD ++ ++ + + ++ L+ + LK LAP IE LI+LDR+ YL+EQ+
Sbjct: 515 KLGLDESKLPEKIIMDYYKKYKPRMNELEAFNMLKVVLAPCIETLILLDRLCYLKEQE 572
>gi|156343760|ref|XP_001621104.1| hypothetical protein NEMVEDRAFT_v1g222359 [Nematostella vectensis]
gi|156206741|gb|EDO29004.1| predicted protein [Nematostella vectensis]
Length = 388
Score = 174 bits (442), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 89/206 (43%), Positives = 127/206 (61%), Gaps = 7/206 (3%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ I+YTGD++ DRHL AA I PD+LI ESTY T I + + RE F
Sbjct: 99 MFMLEIAGVKILYTGDFSRQEDRHLMAAEIPSISPDVLIIESTYGTHIHEKREEREARFT 158
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
VH+ V+RGG+ LIPVFALGRAQEL ++L+ YW+ PIY+A L +K + ++
Sbjct: 159 GTVHDIVNRGGRCLIPVFALGRAQELLLILDEYWQNHPELHDIPIYYASQLAKKCMSVFQ 218
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHI---RPFDKSFIDNPGPMVVFATPGMLHSGLSLIIF 175
++ N KI+K N F FKHI + D+ D+ GP VV A+PGM+ SGLS +F
Sbjct: 219 TYVNAMNDKIKKQIAISNPFVFKHISNLKSIDQ--FDDIGPSVVMASPGMMQSGLSRELF 276
Query: 176 KKWAPVENNMLIMPGFCVQGTIGHKV 201
++W N +I+ G+CV+GT+ +V
Sbjct: 277 EQWCTDRRNGVIIAGYCVEGTLAKEV 302
>gi|209876680|ref|XP_002139782.1| cleavage and polyadenylation specificity factor subunit 3
[Cryptosporidium muris RN66]
gi|209555388|gb|EEA05433.1| cleavage and polyadenylation specificity factor subunit 3, putative
[Cryptosporidium muris RN66]
Length = 767
Score = 174 bits (442), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 121/392 (30%), Positives = 206/392 (52%), Gaps = 19/392 (4%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF V++G I+YTGDY+ DRH+ A I +LI ESTY T + + ++ RE+ FL
Sbjct: 190 MFLVEIGGVRILYTGDYSREDDRHVPRAEIPPVDVHVLICESTYGTRLHEPRKDREKRFL 249
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNL--QAPIYFAVGLTEKATNYYK 118
V + R GK L+PVFA+GRAQEL ++L+ +W + + PIY+A ++ K ++
Sbjct: 250 GCVQSILSRQGKCLLPVFAIGRAQELLLILDEHWAQTSCLHNIPIYYASPMSVKCMRVFE 309
Query: 119 MFITWTNQKIRKTF-VQRNMFDFKHIRPFD-----KSFIDNPGPMVVFATPGMLHSGLSL 172
+I +RK + N F+F+ ++ + K I + GP V+ A PGML +G S
Sbjct: 310 TYINQCGDAVRKQADMGINPFNFQFVKTVNSISEIKDAIYSEGPCVIMAAPGMLQNGTSR 369
Query: 173 IIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHAD 232
IF+ WAP + N +I+ G+ ++GT +++ ++ +K ++ +K + +SFSAH+D
Sbjct: 370 DIFEVWAPDKRNGVILTGYAIRGTPAYELRREPDIIQLGDK-LVPLKAKFDQISFSAHSD 428
Query: 233 AKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQ-EFNLDCFMPANGESCFVQTDMKIS 291
+ I + NV+LVHGE + L++K+ + +L F P + + ++++
Sbjct: 429 FAQTQEFISNLKVPNVILVHGERGECKRLRDKLTELRPSLAVFAPEILQKVSLTFPIQLN 488
Query: 292 IDV--SVNLLKEEAVKYNS--EPPNPLKERQIHGVLVIKDS--SISLMDVDEACKEVGIS 345
I S+ + +K NS + K I LVI S ++ +E + +G
Sbjct: 489 ITAVGSIADTIKRELKQNSIEDSIKTHKTADIEDALVIARPFESTLIIHPNELFQVLGTK 548
Query: 346 RHIVRFTSNVQISDS---SPINKTLELIYDQL 374
R +R T + I + S I KTL+ I+D +
Sbjct: 549 RSCIRHTLKINIPGNLQLSGIAKTLQGIFDDV 580
>gi|428671767|gb|EKX72682.1| cleavage and polyadenylation specificity factor protein, putative
[Babesia equi]
Length = 732
Score = 174 bits (441), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 107/286 (37%), Positives = 162/286 (56%), Gaps = 12/286 (4%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF V++ I+YTGDY+ DRHL +A I LLI+ESTY I + + RE FL
Sbjct: 196 MFLVEMDGVRILYTGDYSTEYDRHLPSAEIPNINVHLLISESTYGIRIHEERTQREARFL 255
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYW-ERMNLQA-PIYFAVGLTEKATNYYK 118
V + + R GK L+PVFALGRAQE+ ++LE YW LQ+ PI++ L K+ Y+
Sbjct: 256 HVVLDILMRDGKCLLPVFALGRAQEILLILEEYWAANKQLQSIPIFYISPLASKSLRVYE 315
Query: 119 MFITWTNQKIRKT-FVQRNMFDFKHIRPFD-----KSFIDNPGPMVVFATPGMLHSGLSL 172
FI + ++++ + N F+FK ++ ++++ GP VV +PGML G SL
Sbjct: 316 TFIGLCGEYVKESVYNGHNPFNFKFVKYAKSVESIRTYLLRDGPCVVMTSPGMLQGGPSL 375
Query: 173 IIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHAD 232
+F+ +AP N +I+ G+ V+GT+ + + L ++I + IE +SFSAH D
Sbjct: 376 EVFEIFAPDNRNGVILTGYTVKGTLADALKKDPEVLNL-GSRVIKRRCTIEQISFSAHTD 434
Query: 233 AKGIMQLI-QYCEPKNVLLVHGEASKMVFLKEKIKQEF-NLDCFMP 276
I + C P NV+LVHGE S+M +++K+ +E L FMP
Sbjct: 435 YNQTKDFIRRLCVP-NVILVHGERSEMKRMRDKLSEEMPELSIFMP 479
>gi|297739590|emb|CBI29772.3| unnamed protein product [Vitis vinifera]
Length = 680
Score = 174 bits (441), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 87/214 (40%), Positives = 126/214 (58%), Gaps = 9/214 (4%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF V + ++YTGDY+ DRHL AA I + PD+ I ESTY + + RE+ F
Sbjct: 176 MFMVDIAGVRVLYTGDYSREEDRHLRAAEIPQFSPDICIIESTYGVQLHQPRHVREKRFT 235
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYW----ERMNLQAPIYFAVGLTEKATNY 116
+H + +GG+VLIP FALGRAQEL ++L+ YW E N+ PIY+A L ++
Sbjct: 236 DVIHSTISQGGRVLIPAFALGRAQELLLILDEYWSNHPELHNI--PIYYASPLAKRCMAV 293
Query: 117 YKMFITWTNQKIRKTFVQRNMFDFKHIRPFDKSF--IDNPGPMVVFATPGMLHSGLSLII 174
Y+ +I N++IR F N FDFKHI P KS ++ GP VV A+P L SGLS +
Sbjct: 294 YQTYINSMNERIRNQFANSNPFDFKHISPL-KSIENFNDVGPSVVMASPSGLQSGLSRQL 352
Query: 175 FKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKL 208
F W + N ++PG+ V+GT+ +++ K++
Sbjct: 353 FDMWCSDKKNACVIPGYVVEGTLAKTIINEPKEM 386
>gi|340545979|gb|AEK51788.1| cleavage and polyadenylation specific factor 3 [Heteronotia binoei]
gi|402696941|gb|AFQ90659.1| 73kDa cleavage and polyadenylation specific factor 3, partial
[Malaclemys terrapin]
gi|402696943|gb|AFQ90660.1| 73kDa cleavage and polyadenylation specific factor 3, partial
[Testudo hermanni]
Length = 220
Score = 174 bits (441), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 88/199 (44%), Positives = 121/199 (60%), Gaps = 3/199 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ ++YTGD++ DRHL AA I +PD+LI ESTY T I + + RE F
Sbjct: 22 MFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPNIKPDILIIESTYGTHIHEKREEREARFC 81
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
VH+ V+RGG+ LIPVFALGRAQEL ++L+ YW+ PIY+A L +K Y+
Sbjct: 82 NTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQNHPELHDIPIYYASSLAKKCMAVYQ 141
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
++ N KIRK N F FKHI D+ GP VV A+PGM+ SGLS +F+
Sbjct: 142 TYVNAMNDKIRKQININNPFVFKHISNLKSMDHFDDIGPSVVMASPGMMQSGLSRELFES 201
Query: 178 WAPVENNMLIMPGFCVQGT 196
W + N +I+ G+CV+GT
Sbjct: 202 WCTDKRNGVIIAGYCVEGT 220
>gi|242778797|ref|XP_002479311.1| cleavage and polyadenylation specifity factor, 73 kDa subunit,
putative [Talaromyces stipitatus ATCC 10500]
gi|218722930|gb|EED22348.1| cleavage and polyadenylation specifity factor, 73 kDa subunit,
putative [Talaromyces stipitatus ATCC 10500]
Length = 861
Score = 174 bits (440), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 120/403 (29%), Positives = 200/403 (49%), Gaps = 48/403 (11%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDK-CRPDLLITESTYATTIRDSKRCRERDF 59
MF V + +I++TGDY+ DRHL A + + + D+LITEST+ + + RE
Sbjct: 181 MFLVSIAGLNILFTGDYSREEDRHLIPAEVPRGIKIDVLITESTFGISSNPPRLEREAAL 240
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYY 117
+K + ++RGG+VL+PVFALGRAQEL ++LE YWER + PIY+ + + Y
Sbjct: 241 MKSITGILNRGGRVLMPVFALGRAQELLLILEEYWERHPEYQKVPIYYIGNMARRCMVVY 300
Query: 118 KMFITWTNQKIRKTFVQR--------------NMFDFKHIRPFDK-SFIDNPGPMVVFAT 162
+ +I N I++ F QR +DF+++R D+ G V+ A+
Sbjct: 301 QTYIGAMNDNIKRLFRQRMAEAEASGNKNVAAGPWDFRYVRSLRSLERFDDIGSCVMLAS 360
Query: 163 PGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKL-------------- 208
PGML +G S + ++WAP E N ++M G+ V+GT+ ++L+ +++
Sbjct: 361 PGMLQTGTSRELLERWAPSERNGVVMTGYSVEGTMAKQLLNEPEQIPATMSKSAAGPGSR 420
Query: 209 -----EFENKQIIDVKMAIEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKE 263
+ + K +I + I+ +SF+AH D I+ V+LVHGE +M+ LK
Sbjct: 421 ANPMADEDQKVMIPRRCTIDEISFAAHVDGVENRNFIEQVAAPVVILVHGEKHQMMRLKS 480
Query: 264 KI-----KQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLKEEAVKYNSEPPNPLKERQ 318
K+ ++ + + PAN E + K I V L EEA +P +
Sbjct: 481 KLLSLNAEKTNKVKVYTPANCEEIRIPF-RKDKIAKVVGKLAEEA-----QPTEIEGGKL 534
Query: 319 IHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDSS 361
+ GVLV +SLM ++ + G++ + + +S +S
Sbjct: 535 MAGVLVQNGFDLSLMAPEDLREYAGLTTTTITCKQLLTLSSAS 577
>gi|443926404|gb|ELU45071.1| mRNA 3'-end-processing protein YSH1 [Rhizoctonia solani AG-1 IA]
Length = 409
Score = 174 bits (440), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 96/263 (36%), Positives = 145/263 (55%), Gaps = 14/263 (5%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF + + I+YTGDY+ DRHL A + RPDLLI ESTY +++ RE F
Sbjct: 130 MFLIDIAGLQILYTGDYSREEDRHLVRAELPPIRPDLLIVESTYGVQGHEARESREARFT 189
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYW-ERMNLQA-PIYFAVGLTEKATNYYK 118
VH V RGG VL+PVFALGRAQEL ++L+ YW L P+Y+A L K Y+
Sbjct: 190 SSVHTIVKRGGHVLLPVFALGRAQELLLILDEYWAAHPELHGVPVYYASNLARKCMAVYQ 249
Query: 119 MFITWTNQKIRKTFVQR-NMFDFKHIR--PFDKSF---IDNPGPMVVFATPGMLHSGLSL 172
+I N IR F ++ N F FKHI P + + I GP V+ A+PG + SG S
Sbjct: 250 TYIHTMNSHIRSRFARKDNPFVFKHISHLPATRGWERKIAEAGPCVILASPGFMSSGPSR 309
Query: 173 IIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLE------FENKQIIDVKMAIEYMS 226
+ + WAP N +I+ G+ ++GT+ ++ +++ + ++I ++++EY+S
Sbjct: 310 ELLELWAPDAKNGVIITGYSIEGTMARDIILEPDEIKPYRSDGPDGNKMIPRRLSVEYIS 369
Query: 227 FSAHADAKGIMQLIQYCEPKNVL 249
FSAH D + I+ ++V+
Sbjct: 370 FSAHVDGPQNTEFIEAVGARHVV 392
>gi|70996586|ref|XP_753048.1| cleavage and polyadenylation specifity factor, 73 kDa subunit,
putative [Aspergillus fumigatus Af293]
gi|74672067|sp|Q4WRC2.1|YSH1_ASPFU RecName: Full=Endoribonuclease ysh1; AltName: Full=mRNA
3'-end-processing protein ysh1
gi|66850683|gb|EAL91010.1| cleavage and polyadenylation specifity factor, 73 kDa subunit,
putative [Aspergillus fumigatus Af293]
gi|159131784|gb|EDP56897.1| cleavage and polyadenylation specifity factor, 73 kDa subunit,
putative [Aspergillus fumigatus A1163]
Length = 872
Score = 173 bits (439), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 122/404 (30%), Positives = 198/404 (49%), Gaps = 50/404 (12%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDK-CRPDLLITESTYATTIRDSKRCRERDF 59
MF + + +I++TGDY+ DRHL A + K + D+LITEST+ + + RE
Sbjct: 181 MFLISIAGLNILFTGDYSREEDRHLIPAEVPKGIKIDVLITESTFGISTNPPRLEREAAL 240
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWE-RMNLQA-PIYFAVGLTEKATNYY 117
+K + ++RGG+VL+PVFALGRAQEL ++L+ YWE LQ PIY+ + Y
Sbjct: 241 MKSITGILNRGGRVLMPVFALGRAQELLLILDEYWETHPELQKIPIYYIGNTARRCMVVY 300
Query: 118 KMFITWTNQKIRKTFVQR--------------NMFDFKHIRPFDK-SFIDNPGPMVVFAT 162
+ +I N I++ F QR +DFK +R D+ G V+ A+
Sbjct: 301 QTYIGAMNDNIKRLFRQRMAEAEASGDKSASAGPWDFKFVRSLRSLERFDDVGGCVMLAS 360
Query: 163 PGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLS------------------- 203
PGML +G S + ++WAP E N ++M G+ V+GT+ ++L+
Sbjct: 361 PGMLQTGTSRELLERWAPNERNGVVMTGYSVEGTMAKQLLNEPEQIPAVMSRSAGGVSRR 420
Query: 204 GVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKE 263
G+ + E K +I + ++ +SF+AH D I+ V+LVHGE +M+ LK
Sbjct: 421 GLAGTDEEQKIMIPRRCTVDEISFAAHVDGVENRNFIEEVAAPVVILVHGEKHQMMRLKS 480
Query: 264 KI-----KQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLKEEAVKYNSEPPNPLKE-R 317
K+ + + + PAN + + K I V L + A PP+ + R
Sbjct: 481 KLLSLNADKAVKVKVYTPANCDEVRIPF-RKDKIAKVVGKLAQVA------PPSDQDDGR 533
Query: 318 QIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDSS 361
+ GVLV +SLM D+ + G++ + ++ +S +S
Sbjct: 534 LMSGVLVQNGFDLSLMAPDDLREYAGLTTTTITCKQHITLSSAS 577
>gi|402696937|gb|AFQ90657.1| 73kDa cleavage and polyadenylation specific factor 3, partial
[Dibamus sp. JJF-2012]
Length = 220
Score = 173 bits (439), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 87/199 (43%), Positives = 120/199 (60%), Gaps = 3/199 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ ++YTGD++ DRHL AA I +PD+LI ESTY T I + + RE F
Sbjct: 22 MFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPNIKPDILIIESTYGTHIHEKREEREARFC 81
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNL--QAPIYFAVGLTEKATNYYK 118
VH+ V+RGG+ LIPVFALGRAQEL ++L+ YW+ PIY+A L +K Y+
Sbjct: 82 NXVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQNHXXLHDIPIYYASSLAKKCMAVYQ 141
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
++ N KIRK N F KHI D+ GP VV A+PGM+ SGLS +F+
Sbjct: 142 TYVNAMNDKIRKXXXINNPFVJKHISNLKSMDHFDDIGPSVVMASPGMMQSGLSRELFES 201
Query: 178 WAPVENNMLIMPGFCVQGT 196
W + N +I+ G+CV+GT
Sbjct: 202 WCTDKRNGVIIAGYCVEGT 220
>gi|154322621|ref|XP_001560625.1| hypothetical protein BC1G_00653 [Botryotinia fuckeliana B05.10]
gi|347837188|emb|CCD51760.1| similar to cleavage and polyadenylation specifity factor
[Botryotinia fuckeliana]
Length = 828
Score = 173 bits (439), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 117/384 (30%), Positives = 191/384 (49%), Gaps = 47/384 (12%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDK-CRPDLLITESTYATTIRDSKRCRERDF 59
MF +++ I +TGDY+ DRHL +A + K + D+LITESTY + RE+
Sbjct: 179 MFLIEIAGLKIFFTGDYSREDDRHLVSAEVPKGVKIDVLITESTYGIASHIPRLEREQAL 238
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYY 117
+K V ++RGG+VL+PVFALGRAQEL ++L+ YW + + PIY+A L K Y
Sbjct: 239 MKSVTSILNRGGRVLMPVFALGRAQELLLILDEYWGKHPEFQKIPIYYASNLARKCMLVY 298
Query: 118 KMFITWTNQKIRKTFVQR--------------NMFDFKHIRPFDK-SFIDNPGPMVVFAT 162
+ ++ N+ I++ F +R +DFK+IR D+ G V+ A+
Sbjct: 299 QTYVGSMNENIKRLFRERMAEAEANSTSGGRGGPWDFKYIRSLKNLDRFDDVGGCVILAS 358
Query: 163 PGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHK----------VLSGVKKLEFEN 212
PGML +G+S + ++WAP + N +I+ G+ V+GT+ + ++SG + +
Sbjct: 359 PGMLQNGISRQLLERWAPSDKNGVIITGYSVEGTMAKQIMQEPDHIQAIMSGPQGFTRRS 418
Query: 213 ------KQIIDVKMAIEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKI- 265
K +I + +++ SF+AH D + I+ V+LVHGE M+ LK K+
Sbjct: 419 APGETEKAMIPRRCSVQEYSFAAHVDGVENREFIEEIAAPVVILVHGEQHNMMRLKSKLL 478
Query: 266 ----KQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLKEEAVKYNSEPPNPLKERQIHG 321
+ + F P N E + I + + K + PP + I G
Sbjct: 479 SLNADKTTKVKVFSPKNCEP--------LRIPFMTSKTAKVVGKLANNPPPTDSAQLITG 530
Query: 322 VLVIKDSSISLMDVDEACKEVGIS 345
VLV D +SLM ++ + G++
Sbjct: 531 VLVQNDFKMSLMAPEDLREYAGLT 554
>gi|302927041|ref|XP_003054415.1| predicted protein [Nectria haematococca mpVI 77-13-4]
gi|256735356|gb|EEU48702.1| predicted protein [Nectria haematococca mpVI 77-13-4]
Length = 827
Score = 173 bits (438), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 115/387 (29%), Positives = 196/387 (50%), Gaps = 46/387 (11%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDK-CRPDLLITESTYATTIRDSKRCRERDF 59
MF +++G +I +TGDY+ DRHL +A + K + D+LITESTY + RE+
Sbjct: 180 MFLIEIGGLNIFFTGDYSREQDRHLVSAEVPKGVKIDVLITESTYGIASHVPRLEREQAL 239
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYY 117
+K + ++RGG+VL+PVFALGRAQEL ++L+ YW + + + PIY+A L K Y
Sbjct: 240 MKSITSILNRGGRVLMPVFALGRAQELLLILDEYWGKHSDFQKYPIYYASNLARKCMLVY 299
Query: 118 KMFITWTNQKIRKTFVQR--------------NMFDFKHIRPFDK-SFIDNPGPMVVFAT 162
+ ++ N I++ F +R +DFK+IR D+ G V+ A+
Sbjct: 300 QTYVGAMNDNIKRLFRERMAEAEASGDGAGKGGPWDFKYIRSLKNLDRFDDVGGCVMLAS 359
Query: 163 PGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKV--------------LSGVKKL 208
PGML +G+S + ++WAP E N +I+ G+ V+GT+ ++ ++G +++
Sbjct: 360 PGMLQNGVSRELLERWAPSEKNGVIITGYSVEGTMAKQIMQEPDQIQAVMSRSMAGARRM 419
Query: 209 EFEN--KQIIDVKMAIEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKI- 265
+ K +I + +++ SF+AH D + I+ V+LVHGE M+ LK K+
Sbjct: 420 PGGDGEKVLIPRRCSVQEYSFAAHVDGVENREFIEEVAAPVVILVHGEQHNMMRLKSKLL 479
Query: 266 ----KQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLK---EEAVKYNSEPPNPLKERQ 318
+ + + P N E + + V L +++ + PL
Sbjct: 480 SLNANKTAKVKVYSPRNCEELRIPFKADKTAKVVGKLASIQPPQSIHLDPATAPPL---- 535
Query: 319 IHGVLVIKDSSISLMDVDEACKEVGIS 345
+ GVLV D +SLM ++ + G++
Sbjct: 536 VTGVLVQNDFKLSLMAPEDLREYAGLN 562
>gi|85079519|ref|XP_956368.1| hypothetical protein NCU03479 [Neurospora crassa OR74A]
gi|74630409|sp|Q8WZS6.1|YSH1_NEUCR RecName: Full=Endoribonuclease ysh-1; AltName: Full=mRNA
3'-end-processing protein ysh-1
gi|18376069|emb|CAD21097.1| related to BRR5 (component of pre-mRNA polyadenylation factor PF I)
[Neurospora crassa]
gi|28917429|gb|EAA27132.1| hypothetical protein NCU03479 [Neurospora crassa OR74A]
Length = 850
Score = 173 bits (438), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 121/407 (29%), Positives = 201/407 (49%), Gaps = 47/407 (11%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDK-CRPDLLITESTYATTIRDSKRCRERDF 59
MF +++ I +TGDY+ DRHL +A + K + D+LITESTY + RE+
Sbjct: 179 MFLIEIAGLKIFFTGDYSREEDRHLISAKVPKGVKIDVLITESTYGIASHIPRPEREQAL 238
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYY 117
+K + ++RGG+VL+PVFALGRAQEL ++L+ YW + + PIY+A L K Y
Sbjct: 239 MKSITGILNRGGRVLMPVFALGRAQELLLILDEYWGKHAEYQKYPIYYASNLARKCMLVY 298
Query: 118 KMFITWTNQKIRKTFVQR--------------NMFDFKHIRPFDKSF--IDNPGPMVVFA 161
+ ++ N I++ F +R +DF+ IR KS ++ G V+ A
Sbjct: 299 QTYVGSMNDNIKRLFRERLAESESSGDGAGKGGPWDFRFIRSL-KSLDRFEDVGGCVMLA 357
Query: 162 TPGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVL--------------SGVKK 207
+PGML +G+S + ++WAP E N +I+ G+ V+GT+ ++L +G ++
Sbjct: 358 SPGMLQNGVSRELLERWAPSEKNGVIITGYSVEGTMAKQLLQEPEQIQAVMSRNIAGARR 417
Query: 208 LEFEN--KQIIDVKMAIEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKI 265
+ K +I + ++ SF+AH D + I+ V+LVHGE M+ LK K+
Sbjct: 418 GPGGDAEKVMIPRRCTVQEFSFAAHVDGVENREFIEEVAAPVVILVHGEVHNMMRLKSKL 477
Query: 266 -----KQEFNLDCFMPANGESCFVQTDMKISIDVSVNL------LKEEAVKYNSEPPNPL 314
+E + F P N E + V L LKE ++ P+
Sbjct: 478 LSLNATKEHKVKVFSPRNCEELRIPFKTDKVAKVVGKLASIPPSLKEAKTGHDGPLPSST 537
Query: 315 KERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDSS 361
+ + I GVLV D +SLM ++ + G++ + +++S +
Sbjct: 538 EPQLITGVLVQNDFKMSLMAPEDLREYAGLTTTTIACKQRLKLSAAG 584
>gi|156405894|ref|XP_001640966.1| predicted protein [Nematostella vectensis]
gi|156228103|gb|EDO48903.1| predicted protein [Nematostella vectensis]
Length = 515
Score = 172 bits (437), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 121/388 (31%), Positives = 197/388 (50%), Gaps = 37/388 (9%)
Query: 665 LMSEKKSYEVQVMSQVVAAVTNSCDSSHIIDLGGGQGYLSTILALQHGKKTLSLDYNQVN 724
+M+ KKS+EV VM+ VV + + + ++DLG G+GYLS LAL +G +D +N
Sbjct: 101 MMTPKKSHEVSVMAHVVNQIASQVKAEKVLDLGSGKGYLSQALALDYGLTVTGVDSKDMN 160
Query: 725 THGAAVRSKKLEDS-----EKESKGPFQS----------YAGVINKKLWMRTQNRQTSHA 769
T A RS KL + KE+K + A + +KK + Q
Sbjct: 161 TQNAVKRSMKLLKALQGRMRKENKSHNMADDSSGRKDTKSATICSKKSDITEQTIDIMCV 220
Query: 770 SVEHHGKNWKRKSKAPV-VSIKDEELVVCKDKC-----KQITHFVTPDSDISSILSQAYP 823
G +K+P+ + +K + K+KC IT +V + S L
Sbjct: 221 KDNPCGHFVHSDNKSPINMGLKSD----LKEKCISSSYTPITQYVDVSAFQSGDLFGLST 276
Query: 824 QDSLHNVCIMGLHTCGDLSGTALRLF-TKSSLQCLVQVGCCYHLLEEEFIRSPFWKDVDQ 882
+D+ + ++GLHTCGDL+ TA++LF S++ L VGCCYHLL + D
Sbjct: 277 EDTDPRLVLVGLHTCGDLAPTAIKLFIGDESVKALCLVGCCYHLLSQAGSGDIQGGDP-- 334
Query: 883 SLYEHGYGFPLSEHLRSRKFFLGRNVRMSGTQSPERVIDLKQTQTLPLFYRALLEKYLRS 942
GFP+SE+L+ KF L RN M Q+ +++ L + +FYRA+L+ L+
Sbjct: 335 -------GFPMSEYLQREKFTLSRNATMVAQQAADKIATLSKLPPKSVFYRAVLQVILKE 387
Query: 943 KITINDEEPKVVGRLATKCSNFVEYVHRAVDKLKLDL-EVDDEEVTRLFNSHQREYEYLQ 1001
K + D + VGR+ +KC F EYV + + KL L ++ D E+ + Q E +++
Sbjct: 388 KFQL-DTRGQHVGRIGSKCKTFHEYVKKGLSKLGLPFHKITDLEIEMYYERFQPEEKHIL 446
Query: 1002 IYYFLKTALAPVIEALIVLDRVLYLREQ 1029
++ L+ ++AP IE+L ++DR+ +++EQ
Sbjct: 447 AFHQLRASMAPSIESLFIIDRLCFMKEQ 474
>gi|198456542|ref|XP_001360363.2| GA17328 [Drosophila pseudoobscura pseudoobscura]
gi|198135650|gb|EAL24938.2| GA17328 [Drosophila pseudoobscura pseudoobscura]
Length = 494
Score = 172 bits (437), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 110/380 (28%), Positives = 189/380 (49%), Gaps = 48/380 (12%)
Query: 655 LVVESAKVSQL-----MSEKKSYEVQVMSQVVAAVTNSCDSSH---IIDLGGGQGYLSTI 706
L+ E K SQ+ MS KK +EV++ + +V + S H I+D G G+GYLS+
Sbjct: 103 LIAEQDKASQITIREFMSAKKCHEVELTAALVDNLIQSTGRQHDFFIVDAGDGKGYLSSR 162
Query: 707 LALQHGKKTLSLDYNQVNTHGAAVRSKKLEDSEKESKGPFQSYAGVINKKLWMRTQNRQT 766
+ALQ+ + L +D N NT A R++KL+ R N T
Sbjct: 163 MALQYEHRVLGIDANAANTESALNRNRKLQ-----------------------RAWNGLT 199
Query: 767 SHASVEHHGKNWKRKSKAPVVSIKDEELVVCKDKCKQITHFVTPDSDISSILSQAYPQDS 826
A ++ G KR+ K + + V ++ K F+ +++ ++L++ +P+ S
Sbjct: 200 ERAELQSQGITPKRRGKKSPTRNTAKSIPVLENY-KTTARFINTETNFGNLLAEHFPKPS 258
Query: 827 LH---NVCIMGLHTCGDLSGTALRLF-TKSSLQCLVQVGCCYHLLEEEFIRSPFWKDVDQ 882
++C+ GLHTCG+L+ T L++F + Q L +GCCYHLL+E + + F+ +
Sbjct: 259 SAGNLSICLTGLHTCGNLAATCLQVFHAQPDCQILCNIGCCYHLLKERYSQQEFFGNKTL 318
Query: 883 SLYEHGYGFPLSEHLRSRKFFLGRNVRMSGTQSPERVIDLKQTQTLPLFYRALLEKYLRS 942
+ +GFPLS++L+ R+ LGRN RM QS ER ++ K+ + L+YRALLE
Sbjct: 319 MEMQTDFGFPLSQYLQMRQIRLGRNARMLAAQSIERTLNGKELPNITLYYRALLE----- 373
Query: 943 KITINDEEPKVVGRLAT----KCSNFVEYVHRAVDKLKLDL--EVDDEEVTRLFNSHQRE 996
+ + P + L K NF +Y+ + KL ++ E+ + +
Sbjct: 374 -VLVCRHAPHLKNELQVGKVRKFDNFQDYIQKCARKLDAPWLASIETAELQSILQEFALD 432
Query: 997 YEYLQIYYFLKTALAPVIEA 1016
+L ++Y ++ + APV+E+
Sbjct: 433 KHFLDMFYLIRMSFAPVLES 452
>gi|119494361|ref|XP_001264076.1| cleavage and polyadenylation specifity factor, 73 kDa subunit,
putative [Neosartorya fischeri NRRL 181]
gi|119412238|gb|EAW22179.1| cleavage and polyadenylation specifity factor, 73 kDa subunit,
putative [Neosartorya fischeri NRRL 181]
Length = 878
Score = 172 bits (437), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 121/409 (29%), Positives = 199/409 (48%), Gaps = 60/409 (14%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDK-CRPDLLITESTYATTIRDSKRCRERDF 59
MF V + +I++TGDY+ DRHL A + K + D+LITEST+ + + RE
Sbjct: 181 MFLVSIAGLNILFTGDYSREEDRHLIPAEVPKGIKIDVLITESTFGISTNPPRLEREAAL 240
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWE-RMNLQA-PIYFAVGLTEKATNYY 117
+K + ++RGG+VL+PVFALGRAQEL ++L+ YWE LQ PIY+ + Y
Sbjct: 241 MKSITGILNRGGRVLMPVFALGRAQELLLILDEYWETHPELQKIPIYYIGNTARRCMVVY 300
Query: 118 KMFITWTNQKIRKTFVQR--------------NMFDFKHIRPFDK-SFIDNPGPMVVFAT 162
+ +I N I++ F QR +DFK +R D+ G V+ A+
Sbjct: 301 QTYIGAMNDNIKRLFRQRMAEAEASGDKSASAGPWDFKFVRSLRSLERFDDLGGCVMLAS 360
Query: 163 PGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLS------------------- 203
PGML +G S + ++WAP E N ++M G+ V+GT+ ++L+
Sbjct: 361 PGMLQTGTSRELLERWAPNERNGVVMTGYSVEGTMAKQLLNEPEQIPAVMSRSAGGVSRR 420
Query: 204 GVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKE 263
G+ + E K +I + ++ +SF+AH D I+ V+LVHGE +M+ LK
Sbjct: 421 GLAGTDEEQKIMIPRRCTVDEISFAAHVDGVENRNFIEEVAAPVVILVHGEKHQMMRLKS 480
Query: 264 KI-----KQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLKEEAVKYNSE-----PPNP 313
K+ + + + PAN + +V + K++ K + PP+
Sbjct: 481 KLLSLNADKTVKVKVYTPANCD------------EVRIPFRKDKVAKVVGKLAQVAPPSD 528
Query: 314 LKE-RQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDSS 361
+ R + GVLV +SLM D+ + G++ + ++ +S +S
Sbjct: 529 QDDGRLMSGVLVQNGFDLSLMAPDDLREYAGLTTTTITCKQHITLSSAS 577
>gi|408390480|gb|EKJ69876.1| hypothetical protein FPSE_09963 [Fusarium pseudograminearum CS3096]
Length = 833
Score = 172 bits (437), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 116/390 (29%), Positives = 199/390 (51%), Gaps = 52/390 (13%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDK-CRPDLLITESTYATTIRDSKRCRERDF 59
MF +++ +I +TGDY+ DRHL +A + K + D+LITESTY + RE+
Sbjct: 180 MFLIEIAGLNIFFTGDYSREQDRHLVSAEVPKGVKIDVLITESTYGIASHVPRLEREQAL 239
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWER-MNLQA-PIYFAVGLTEKATNYY 117
+K + ++RGG+VL+PVFALGRAQEL ++L+ YW + + Q PIY+A L K Y
Sbjct: 240 MKSITSILNRGGRVLMPVFALGRAQELLLILDEYWGKHADFQKYPIYYASNLARKCMLIY 299
Query: 118 KMFITWTNQKIRKTFVQR--------------NMFDFKHIRPFDK-SFIDNPGPMVVFAT 162
+ ++ N I++ F +R +DFK+IR D+ G V+ A+
Sbjct: 300 QTYVGAMNDNIKRLFRERMAEAEASGDGAGKGGPWDFKYIRSLKNLDRFDDVGGCVMLAS 359
Query: 163 PGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKV--------------LSGVKKL 208
PGML +G+S + ++WAP E N +I+ G+ V+GT+ ++ ++G +++
Sbjct: 360 PGMLQNGVSRELLERWAPSEKNGVIITGYSVEGTMAKQIMQEPDQIQAVMSRSMAGARRM 419
Query: 209 EFEN--KQIIDVKMAIEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKI- 265
+ K +I + +++ SF+AH D + I+ + V+LVHGE M+ LK K+
Sbjct: 420 PGGDGEKVLIPRRCSVQEYSFAAHVDGVENREFIEEVQAPVVILVHGEQHNMMRLKSKLL 479
Query: 266 ----KQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLKEEAVKYNSEPPNPLKERQ--- 318
+ + + P N E + D + ++ + A + +PP + Q
Sbjct: 480 SLNANKTAKVKVYSPRNCEELRI----PFKADKTAKVVGKLA---SIQPPQSIHPDQTAT 532
Query: 319 ---IHGVLVIKDSSISLMDVDEACKEVGIS 345
+ GVLV D +SLM ++ + G++
Sbjct: 533 PPLVTGVLVQNDFKLSLMAPEDLREYAGLN 562
>gi|297739612|emb|CBI29794.3| unnamed protein product [Vitis vinifera]
Length = 581
Score = 172 bits (437), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 96/247 (38%), Positives = 139/247 (56%), Gaps = 20/247 (8%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF V + ++YTGDY+ DRHL AA I + PD+ I ESTY + + RE+ F
Sbjct: 175 MFMVDIAGVRVLYTGDYSREEDRHLRAAEIPQFCPDICIIESTYGVQLHQPRHVREKRFT 234
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYW----ERMNLQAPIYFAVGLTEKATNY 116
+H + +GG+VLIP +ALGRAQEL ++L+ YW E N+ PIY+A L ++
Sbjct: 235 DVIHSTISQGGRVLIPAYALGRAQELLLILDEYWSNHPELHNV--PIYYASPLAKRCMAV 292
Query: 117 YKMFITWTNQKIRKTFVQRNMFDFKHIRPFDKSF--IDNPGPMVVFATPGMLHSGLSLII 174
Y+ +I N++IR F N FDFKHI P KS ++ GP VV A+PG L SGLS +
Sbjct: 293 YQTYINSMNERIRNQFANSNPFDFKHISPL-KSIENFNDVGPSVVMASPGGLQSGLSRQL 351
Query: 175 FKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAK 234
F W + N ++PG+ V GT+ +++ K EN Q ++ M F++ AK
Sbjct: 352 FDMWCSDKKNACVIPGYVVGGTLAKTIINEPK----ENCQSVE-------MYFNSEKMAK 400
Query: 235 GIMQLIQ 241
I +L +
Sbjct: 401 TIGRLAE 407
>gi|238483863|ref|XP_002373170.1| cleavage and polyadenylation specifity factor, 73 kDa subunit
[Aspergillus flavus NRRL3357]
gi|220701220|gb|EED57558.1| cleavage and polyadenylation specifity factor, 73 kDa subunit
[Aspergillus flavus NRRL3357]
Length = 870
Score = 172 bits (436), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 122/402 (30%), Positives = 201/402 (50%), Gaps = 48/402 (11%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDK-CRPDLLITESTYATTIRDSKRCRERDF 59
MF + + +I++TGDY+ DRHL A + K + D+LITEST+ + + RE
Sbjct: 181 MFLISIAGLNILFTGDYSREEDRHLIPAEVPKGIKIDVLITESTFGISSNPPRLEREAAL 240
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERM-NLQ-APIYFAVGLTEKATNYY 117
+K + ++RGG+VL+PVFALGRAQEL ++L+ YWE+ LQ PIY+ + Y
Sbjct: 241 MKSITGVLNRGGRVLMPVFALGRAQELLLILDEYWEKHPELQKVPIYYIGNTARRCMVVY 300
Query: 118 KMFITWTNQKIRKTFVQR--------------NMFDFKHIRPFDK-SFIDNPGPMVVFAT 162
+ +I N I++ F QR +DF+ +R D+ G V+ A+
Sbjct: 301 QTYIGAMNDNIKRLFRQRMAEAEASGDKSISAGPWDFRFVRSLRSLERFDDVGGCVMLAS 360
Query: 163 PGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVL--------------SGVKKL 208
PGML +G S + ++WAP E N ++M G+ V+GT+ ++L SG+ +
Sbjct: 361 PGMLQTGTSRELLERWAPNERNGVVMTGYSVEGTMAKQLLNEPEQIPAVMSRAASGLARR 420
Query: 209 ---EFENKQIIDVKMAIEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKI 265
+ E K +I + ++ +SF+AH D I+ V+LVHGE +M+ LK K+
Sbjct: 421 GGNDEEQKVMIPRRCTVDEISFAAHVDGVENRNFIEEVSAPVVILVHGEKHQMMRLKSKL 480
Query: 266 -----KQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLKEEAVKYNSEPPNPLKERQI- 319
++ + + PAN E + K I V L + A PP+ + +
Sbjct: 481 LSLNAEKTVKVKVYTPANCEEVRIPFK-KDKIAKVVGKLAQIA------PPSEQDDGHLM 533
Query: 320 HGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDSS 361
GVLV ++SLM D+ + G++ + ++ +S +S
Sbjct: 534 AGVLVQNGFNLSLMAPDDLREYAGLTTTTITCKQHITLSSAS 575
>gi|169767044|ref|XP_001817993.1| endoribonuclease ysh1 [Aspergillus oryzae RIB40]
gi|83765848|dbj|BAE55991.1| unnamed protein product [Aspergillus oryzae RIB40]
gi|391872741|gb|EIT81836.1| mRNA cleavage and polyadenylation factor II complex, BRR5
[Aspergillus oryzae 3.042]
Length = 870
Score = 172 bits (436), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 122/402 (30%), Positives = 201/402 (50%), Gaps = 48/402 (11%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDK-CRPDLLITESTYATTIRDSKRCRERDF 59
MF + + +I++TGDY+ DRHL A + K + D+LITEST+ + + RE
Sbjct: 181 MFLISIAGLNILFTGDYSREEDRHLIPAEVPKGIKIDVLITESTFGISSNPPRLEREAAL 240
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERM-NLQ-APIYFAVGLTEKATNYY 117
+K + ++RGG+VL+PVFALGRAQEL ++L+ YWE+ LQ PIY+ + Y
Sbjct: 241 MKSITGVLNRGGRVLMPVFALGRAQELLLILDEYWEKHPELQKVPIYYIGNTARRCMVVY 300
Query: 118 KMFITWTNQKIRKTFVQR--------------NMFDFKHIRPFDK-SFIDNPGPMVVFAT 162
+ +I N I++ F QR +DF+ +R D+ G V+ A+
Sbjct: 301 QTYIGAMNDNIKRLFRQRMAEAEASGDKSISAGPWDFRFVRSLRSLERFDDVGGCVMLAS 360
Query: 163 PGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVL--------------SGVKKL 208
PGML +G S + ++WAP E N ++M G+ V+GT+ ++L SG+ +
Sbjct: 361 PGMLQTGTSRELLERWAPNERNGVVMTGYSVEGTMAKQLLNEPEQIPAVMSRAASGLARR 420
Query: 209 ---EFENKQIIDVKMAIEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKI 265
+ E K +I + ++ +SF+AH D I+ V+LVHGE +M+ LK K+
Sbjct: 421 GGNDEEQKVMIPRRCTVDEISFAAHVDGVENRNFIEEVSAPVVILVHGEKHQMMRLKSKL 480
Query: 266 -----KQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLKEEAVKYNSEPPNPLKERQI- 319
++ + + PAN E + K I V L + A PP+ + +
Sbjct: 481 LSLNAEKTVKVKVYTPANCEEVRIPFK-KDKIAKVVGKLAQIA------PPSEQDDGHLM 533
Query: 320 HGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDSS 361
GVLV ++SLM D+ + G++ + ++ +S +S
Sbjct: 534 AGVLVQNGFNLSLMAPDDLREYAGLTTTTITCKQHITLSSAS 575
>gi|346323812|gb|EGX93410.1| cleavage and polyadenylation specifity factor, 73 kDa subunit,
putative [Cordyceps militaris CM01]
Length = 879
Score = 172 bits (436), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 115/395 (29%), Positives = 198/395 (50%), Gaps = 58/395 (14%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDK-CRPDLLITESTYATTIRDSKRCRERDF 59
MF +++ +I +TGDY+ DRHL +A + K + D+LITESTY + RE+
Sbjct: 189 MFLIEIAGLNIFFTGDYSREQDRHLVSAEVPKGVKIDVLITESTYGIASHVPRLEREQAL 248
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYY 117
+K + ++RGG+ L+PVFALGRAQEL ++L+ YW + + PIY+A L +K Y
Sbjct: 249 MKSITNILNRGGRALLPVFALGRAQELLLILDEYWGKHAEFQKYPIYYASNLAKKCMLIY 308
Query: 118 KMFITWTNQKIRKTFVQR-------------NMFDFKHIRPFDK-SFIDNPGPMVVFATP 163
+ ++ N I++ F +R +DFK+IR D+ G V+ A+P
Sbjct: 309 QTYVGAMNDNIKRLFRERMAEAETSGGAGAGGPWDFKYIRSLKNLDRFDDVGGCVMLASP 368
Query: 164 GMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVL--------------SGVKKLE 209
GML +G+S +F++WAP + N +I+ G+ V+GT+ +++ +G ++
Sbjct: 369 GMLQNGVSRELFERWAPNDKNGVIITGYSVEGTMARQIMKEPEQIQAVMSRSIAGARRAP 428
Query: 210 FEN--KQIIDVKMAIEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKI-- 265
+ K +I ++ +++ SF+AH D + I+ V+LVHGE + M+ LK K+
Sbjct: 429 GGDGEKVMIPLRCSVQEYSFAAHVDGVENREFIEEVAAPVVILVHGEQTNMMRLKSKLLS 488
Query: 266 ---KQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLKEEAVKYNS-EPPNPLKERQ--- 318
+ + + P N E ++ I + + + K S PP P+
Sbjct: 489 LNANKTVKVKVYSPRNTE--------ELRIPFKADKIAKVVGKLASIPPPRPIAASDDAD 540
Query: 319 --------IHGVLVIKDSSISLMDVDEACKEVGIS 345
+ GVLV D +SLM ++ + G++
Sbjct: 541 TGVAGGPLVSGVLVQNDFKLSLMAPEDLREYAGLN 575
>gi|440638117|gb|ELR08036.1| hypothetical protein GMDG_02874 [Geomyces destructans 20631-21]
Length = 831
Score = 172 bits (436), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 120/413 (29%), Positives = 205/413 (49%), Gaps = 60/413 (14%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWID-KCRPDLLITESTYATTIRDSKRCRERDF 59
MF + + +I++TGDY++ PDRHL +A + + D+LITESTY + RE+
Sbjct: 180 MFLISISGLTILFTGDYSIEPDRHLISASVPANVKVDVLITESTYGVASHVPRLEREQAL 239
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWER-MNLQ-APIYFAVGLTEKATNYY 117
+K + ++RGG+VL+PVFALGRAQEL ++L+ YW R +LQ PIY+A L K Y
Sbjct: 240 MKSITGILNRGGRVLMPVFALGRAQELLLILDEYWSRHKDLQNIPIYYASNLARKCMLVY 299
Query: 118 KMFITWTNQKIRKTFVQR-----------NMFDFKHIRPFDKSF--IDNPGPMVVFATPG 164
+ ++ N+ I++ F +R +DFK+IR KS D+ G V+ A+PG
Sbjct: 300 QTYVGAMNENIKRLFRERMAESEAGGTNGGPWDFKYIRSL-KSLERFDDVGSCVMLASPG 358
Query: 165 MLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVL------------SGVKKL---- 208
M+ +G+S + ++WAP + N +++ G+ V+GT+ ++ +G +
Sbjct: 359 MMQNGVSRELLERWAPSDKNGVVITGYSVEGTMAKSIMQEPDQIQAIMSRNGANRRAGRG 418
Query: 209 -EFENKQIIDVKMAIEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEK--- 264
E K ++ + +++ SF+AH D + I+ V+LVHGE M+ LK K
Sbjct: 419 GEDGEKVMVPRRCSVQEFSFAAHVDGVENREFIEKVAAPVVILVHGEQHNMMRLKSKLLS 478
Query: 265 ----IKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLKEEAVKYNSEPPNPLKERQ-- 318
+ + + + P N E + + D ++ + A N PP L +
Sbjct: 479 LNANLPESKKVKVYSPRNCE----EVRIPFKADKIAKVVGKLA---NLSPPTLLTSGEGG 531
Query: 319 ----------IHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDSS 361
I GVLV D +SLM ++ + G++ +V +++ +
Sbjct: 532 ELQFDDGGKIISGVLVENDFKLSLMAPEDLREYAGLTTTVVMCKQRIRLGAAG 584
>gi|336468884|gb|EGO57047.1| hypothetical protein NEUTE1DRAFT_84705 [Neurospora tetrasperma FGSC
2508]
gi|350288819|gb|EGZ70044.1| Endoribonuclease ysh-1 [Neurospora tetrasperma FGSC 2509]
Length = 853
Score = 172 bits (436), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 120/407 (29%), Positives = 199/407 (48%), Gaps = 47/407 (11%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDK-CRPDLLITESTYATTIRDSKRCRERDF 59
MF +++ I +TGDY+ DRHL +A + K + D+LITESTY + RE+
Sbjct: 179 MFLIEIAGLKIFFTGDYSREEDRHLISAKVPKGVKIDVLITESTYGIASHIPRPEREQAL 238
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYY 117
+K + ++RGG+VL+PVFALGRAQEL ++L+ YW + + PIY+A L K Y
Sbjct: 239 MKSITGILNRGGRVLMPVFALGRAQELLLILDEYWGKHAEYQKYPIYYASNLARKCMLVY 298
Query: 118 KMFITWTNQKIRKTFVQR--------------NMFDFKHIRPFDKSF--IDNPGPMVVFA 161
+ ++ N I++ F +R +DF+ IR KS ++ G V+ A
Sbjct: 299 QTYVGSMNDNIKRLFRERLAESESSGDGAGKGGPWDFRFIRSL-KSLDRFEDVGGCVMLA 357
Query: 162 TPGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEF----------- 210
+PGML +G+S + ++WAP E N +I+ G+ V+GT+ ++L ++++
Sbjct: 358 SPGMLQNGVSRELLERWAPSEKNGVIITGYSVEGTMAKQLLQEPEQIQAVMSRNIAGARR 417
Query: 211 -----ENKQIIDVKMAIEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKI 265
K +I + ++ SF+AH D + I+ V+LVHGE M+ LK K+
Sbjct: 418 GPGGDAEKVMIPRRCTVQEFSFAAHVDGVENREFIEEVAAPVVILVHGEVHNMMRLKSKL 477
Query: 266 -----KQEFNLDCFMPANGESCFVQTDMKISIDVSVNL------LKEEAVKYNSEPPNPL 314
+E + F P N E + V L LKE ++ P
Sbjct: 478 LSLNATKEHKVKVFSPRNCEELRIPFKTDKVAKVVGKLASIPPSLKEAKTAHDGPLPPST 537
Query: 315 KERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDSS 361
+ + I GVLV D +SLM ++ + G++ + +++S +
Sbjct: 538 EPQLITGVLVQNDFKMSLMAPEDLREYAGLTTTTIACKQRLKLSAAG 584
>gi|406866779|gb|EKD19818.1| metallo-beta-lactamase superfamily protein [Marssonina brunnea f.
sp. 'multigermtubi' MB_m1]
Length = 823
Score = 172 bits (435), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 118/390 (30%), Positives = 192/390 (49%), Gaps = 47/390 (12%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDK-CRPDLLITESTYATTIRDSKRCRERDF 59
MF +++ I +TGDY+ DRHL +A + K + D+LITESTY + RE+
Sbjct: 181 MFLIEIAGLKIFFTGDYSREDDRHLVSAEVPKGVKIDVLITESTYGIAAHVPRVEREQQL 240
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYY 117
+K + ++RGG+VL+PVFALGRAQEL ++L+ YW + PIY+A L K Y
Sbjct: 241 MKSITSILNRGGRVLMPVFALGRAQELLLILDEYWALHPEFQKIPIYYASNLARKCMLVY 300
Query: 118 KMFITWTNQKIRKTFVQR--------------NMFDFKHIRPFDK-SFIDNPGPMVVFAT 162
+ ++ N+ I++ F +R +DFK+IR D+ G V+ A+
Sbjct: 301 QTYVGAMNENIKRLFRERMAEAEASSDTAAKGGPWDFKYIRSLKNLDRFDDVGRCVMLAS 360
Query: 163 PGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVL--------------SGVKK- 207
PGML +G+S + ++WAP E N +++ G+ V+GT+ +++ G ++
Sbjct: 361 PGMLQNGVSRELLERWAPSEKNGVVITGYSVEGTMAKQIMQEPDQIQAIMSRNTGGARRG 420
Query: 208 --LEFENKQIIDVKMAIEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKI 265
L K +I + +++ SF+AH D + I+ V+LVHGE M+ LK K+
Sbjct: 421 PDLSESEKVMIPRRCSVQEFSFAAHVDGIENREFIEEVAAPVVILVHGEVHNMMRLKSKL 480
Query: 266 KQEFNLD------CFMPANGESCFVQTDMKISIDVSVNLLKEEAVKYNSEPPNPLKERQI 319
N D F P N E + + V L A NS + + +
Sbjct: 481 -LSLNADKTNKVKVFSPRNCEELRIPFKAVKTARVVGKLASIPAPAANSN-----QSQLM 534
Query: 320 HGVLVIKDSSISLMDVDEACKEVGISRHIV 349
GV+V D +SLM ++ + G++ I+
Sbjct: 535 TGVIVQNDFKMSLMAPEDLREYAGLTTTII 564
>gi|258578481|ref|XP_002543422.1| predicted protein [Uncinocarpus reesii 1704]
gi|237903688|gb|EEP78089.1| predicted protein [Uncinocarpus reesii 1704]
Length = 875
Score = 172 bits (435), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 118/409 (28%), Positives = 209/409 (51%), Gaps = 59/409 (14%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDK-CRPDLLITESTYATTIRDSKRCRERDF 59
MF + + +I++TGDY+ DRHL +A + K + D+LI EST+ + + RE
Sbjct: 181 MFLISIAGLNILFTGDYSREEDRHLISAEVPKGIKIDVLIAESTFGISSSPPRLERETAL 240
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERM-NLQ-APIYFAVGLTEKATNYY 117
+K V ++RGG+VL+PVFALGRAQEL ++L+ YW R +LQ PI++ + + Y
Sbjct: 241 MKSVTSILNRGGRVLMPVFALGRAQELLLILDEYWSRHPDLQKVPIFYIGNMARRCMVVY 300
Query: 118 KMFITWTNQKIRKTFVQR--------------NMFDFKHIRPF-DKSFIDNPGPMVVFAT 162
+ +I N I++ F +R +DFK +R + D+ G V+ A+
Sbjct: 301 QTYIGAMNDNIKRLFRERMAEAEAKGDKSTTAGPWDFKFVRSVRNLERFDDVGGCVMLAS 360
Query: 163 PGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKL-------------- 208
PGML +G S + ++WAP E N +IM G+ V+GT+G ++L+ +++
Sbjct: 361 PGMLQTGTSRELLERWAPSERNGVIMTGYSVEGTMGKQILNEPEQIPAVMSARTGIGLGR 420
Query: 209 ------EFENKQIIDVKMAIEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLK 262
+ E K +I + +E +SF+AH D + I+ V+LVHGE +M+ LK
Sbjct: 421 GRIPDGDDEQKVMIPRRCTVEEISFAAHVDGVENREFIEAVAAPVVILVHGEKHQMMRLK 480
Query: 263 EKI-----KQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLKEEAVKYNSEPPNPLKE- 316
K+ ++ + + P+N E + + ++D ++ A + PP +
Sbjct: 481 SKLLSLNAEKTVKVKIYSPSNCE----EIRIPFTVDKIAKVVGRLA---ETSPPTADYDD 533
Query: 317 -RQIHGVLVIKDSSISLMDVDEACKEVGIS-------RHIVRFTSNVQI 357
R ++GVLV +S+M ++ + G++ +HI T+++++
Sbjct: 534 SRLMNGVLVQNGFQLSMMAPEDLREYAGLTTTTITCKQHITLGTASIEL 582
>gi|145230249|ref|XP_001389433.1| endoribonuclease ysh1 [Aspergillus niger CBS 513.88]
gi|134055550|emb|CAK37196.1| unnamed protein product [Aspergillus niger]
Length = 874
Score = 172 bits (435), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 121/405 (29%), Positives = 200/405 (49%), Gaps = 51/405 (12%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDK-CRPDLLITESTYATTIRDSKRCRERDF 59
MF + + +I++TGDY+ DRHL A + K + D+LITEST+ + + RE
Sbjct: 181 MFLISIAGLNILFTGDYSREEDRHLIPAEVPKGVKIDVLITESTFGISSNPPRLEREAAL 240
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWE-RMNLQA-PIYFAVGLTEKATNYY 117
+K + ++RGG+VL+PVFALGRAQEL ++L+ YWE LQ PIY+ + Y
Sbjct: 241 MKAITGVLNRGGRVLMPVFALGRAQELLLILDEYWETHPELQKIPIYYIGNTARRCMVVY 300
Query: 118 KMFITWTNQKIRKTFVQR--------------NMFDFKHIRPFDK-SFIDNPGPMVVFAT 162
+ +I N I++ F QR +DF+ +R D+ G V+ A+
Sbjct: 301 QTYIGAMNDNIKRLFRQRMAEAEASGDKSVSAGPWDFRFVRSLRSLERFDDVGGCVMLAS 360
Query: 163 PGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKL-------------- 208
PGML +G S + ++WAP E N ++M G+ V+GT+ ++L+ +++
Sbjct: 361 PGMLQTGTSRELLERWAPNERNGVVMTGYSVEGTMAKQILNEPEQIPAVMSRATTGLVRR 420
Query: 209 ------EFENKQIIDVKMAIEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLK 262
E E K +I + ++ +SF+AH D I+ V+LVHGE +M+ LK
Sbjct: 421 GMAAGNEEEQKVMIPRRCTVDEVSFAAHVDGVENRNFIEEVSAPVVILVHGEKHQMMRLK 480
Query: 263 EKI-----KQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLKEEAVKYNSEPPNPLKER 317
K+ ++ + + P N E + K I V L + A PP+ +
Sbjct: 481 SKLLSLNAEKTVKVKVYTPGNCEEVRIPFK-KDKIAKVVGKLAQIA------PPSEQDDG 533
Query: 318 QI-HGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDSS 361
Q+ GVLV ++SLM D+ + G++ + ++ +S +S
Sbjct: 534 QLMSGVLVQNGFNLSLMAPDDLREYAGLTTTAITCKQHITLSSAS 578
>gi|451852830|gb|EMD66124.1| hypothetical protein COCSADRAFT_34708 [Cochliobolus sativus ND90Pr]
Length = 872
Score = 171 bits (434), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 127/452 (28%), Positives = 217/452 (48%), Gaps = 63/452 (13%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDK-CRPDLLITESTYATTIRDSKRCRERDF 59
MF +++ I++TGDY+ DRHL +A + + D+LITEST+ ++ + RE
Sbjct: 180 MFLMEIAGLKILFTGDYSREDDRHLVSASVPPGVKIDVLITESTFGISMHTPRVEREAQL 239
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYY 117
+K + + ++RGG+ L+PVFALGRAQEL ++L+ YW + + PIY+ L K Y
Sbjct: 240 MKAITDVLNRGGRALLPVFALGRAQELLLILDEYWSKHPEVQKIPIYYNSSLARKCMQVY 299
Query: 118 KMFITWTNQKIRKTFV-------------QRNMFDFKHIRPFDKSF--IDNPGPMVVFAT 162
+ +++ N I++ F +R +DFK +R KS D+ G V+ A+
Sbjct: 300 QTYVSAMNDNIKRLFAERMAEAEAAGDTGRRGAWDFKFVRSL-KSLERFDDLGGCVMLAS 358
Query: 163 PGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVL---------------SGVKK 207
PGM+ SG S + ++WAP N +I+ G+ V+GT+ +++ + +
Sbjct: 359 PGMMQSGTSRELLERWAPDPRNGVIITGYSVEGTMAKQIVHEPDQIPAIMTRASNTARRP 418
Query: 208 LEFENKQ-IIDVKMAIEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKI- 265
+ EN+Q +I + ++ SF+AH D K M+ +Q V+LVHGE M LK K+
Sbjct: 419 GQRENEQTMIPRRCTVQEYSFAAHVDGKENMEFVQEVGAPVVILVHGEKGNMTRLKSKLL 478
Query: 266 ----KQEFNLDCFMPAN-----------------GESCFVQTDMKISIDVSVNLLKEEAV 304
++ + PAN G+ + M S+ + + + V
Sbjct: 479 SFNAQKTIPTKIYSPANCEELRIPFKTDKIAKVVGKLASITPPMPRSLTLDGSEANQVGV 538
Query: 305 KYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDSSP-- 362
K E N I GVL+ D ISLM ++ + G++ + ++ +S +
Sbjct: 539 KQEPEGEN---VDIISGVLIQNDFKISLMAPEDLKEYAGLTTTTILCRQHITLSAAGIDL 595
Query: 363 INKTLELIYDQLL-SYLQDKSQEYKIQLTESS 393
I LE + + + L D + E K+ TE+
Sbjct: 596 IRWALEGTFGAIKETRLTDAAVEGKMNGTENG 627
>gi|115397403|ref|XP_001214293.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
gi|114192484|gb|EAU34184.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
Length = 870
Score = 171 bits (434), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 123/403 (30%), Positives = 201/403 (49%), Gaps = 49/403 (12%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDK-CRPDLLITESTYATTIRDSKRCRERDF 59
MF V + +I++TGDY+ DRHL A + K + D+LITEST+ + + RE
Sbjct: 181 MFLVSIAGLNILFTGDYSREEDRHLIPAEVPKGVKIDVLITESTFGISSNPPRLEREAAL 240
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWE-RMNLQA-PIYFAVGLTEKATNYY 117
+K + ++RGG+VL+PVFALGRAQEL ++L+ YWE LQ PIY+ + Y
Sbjct: 241 MKSITGVLNRGGRVLMPVFALGRAQELLLILDEYWETHPELQKIPIYYIGNTARRCMVVY 300
Query: 118 KMFITWTNQKIRKTFVQR--------------NMFDFKHIRPFDK-SFIDNPGPMVVFAT 162
+ +I N I++ F QR +DF+ +R D+ G V+ A+
Sbjct: 301 QTYIGAMNDNIKRLFRQRMAEAEASGDKNASAGPWDFRFVRSLRSLERFDDVGGCVMLAS 360
Query: 163 PGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKL-------------- 208
PGML SG S + ++WAP E N +IM G+ V+GT+ ++L+ +++
Sbjct: 361 PGMLQSGTSRELLERWAPNERNGVIMTGYSVEGTMAKQLLNEPEQIPAVMSRAATGLIRR 420
Query: 209 ----EFENKQIIDVKMAIEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEK 264
+ E K +I + ++ +SF+AH D I+ V+LVHGE +M+ LK K
Sbjct: 421 GANSDEEQKVMIPRRCTVDEISFAAHVDGVENRNFIEEVSAPVVILVHGEKHQMMRLKSK 480
Query: 265 I-----KQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLKEEAVKYNSEPPNPLKERQI 319
+ ++ + + PAN + + K I V L + A PP+ + Q+
Sbjct: 481 LLSLNAEKTVKVKVYTPANCDEVRIPFK-KDKIAKVVGKLAQIA------PPSEQDDGQL 533
Query: 320 H-GVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDSS 361
GVLV ++SLM D+ + G++ + ++ +S +S
Sbjct: 534 MTGVLVQNGFNLSLMAPDDLREYAGLTTTSITCKQHITLSSAS 576
>gi|85001073|ref|XP_955255.1| cleavage and polyadenylation specificty factor, subunit [Theileria
annulata strain Ankara]
gi|65303401|emb|CAI75779.1| cleavage and polyadenylation specificty factor, subunit, putative
[Theileria annulata]
Length = 1282
Score = 171 bits (434), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 97/285 (34%), Positives = 162/285 (56%), Gaps = 10/285 (3%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF V++ I+YTGDY++ D+HL +A I LLI+ESTY + + + RE FL
Sbjct: 173 MFLVEIDGVRILYTGDYSVEKDKHLPSAEIPSTNVHLLISESTYGIRVHEERSQREMRFL 232
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWE--RMNLQAPIYFAVGLTEKATNYYK 118
V + + R GK L+PVFALGR+QE+ ++L+ YWE R PI++ L K+ Y+
Sbjct: 233 HVVMDIIMREGKCLLPVFALGRSQEILLILDNYWENNRQLHNVPIFYISPLASKSLRVYE 292
Query: 119 MFITWTNQKIRKT-FVQRNMFDFKHIRPFD-----KSFIDNPGPMVVFATPGMLHSGLSL 172
F+ I+++ + N FDFK ++ ++++ GP ++ +PGML G SL
Sbjct: 293 TFVGQCGDYIKQSVYNGFNPFDFKFVKYARSIKQIRNYLLRDGPCIIMTSPGMLQGGPSL 352
Query: 173 IIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHAD 232
+F+ P N +++ G+ V+GT+ ++ + + NK ++ + ++E +SFSAHAD
Sbjct: 353 EVFELICPDNRNGVVLTGYTVKGTLADELKKDPEFINLGNK-VVKPRCSVEQISFSAHAD 411
Query: 233 AKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF-NLDCFMP 276
I+ NV+LVHGE ++M +K+K++++ L FMP
Sbjct: 412 YNQTKDFIKKLSVPNVILVHGERNEMKRMKDKLEEDIRQLTVFMP 456
>gi|342879865|gb|EGU81098.1| hypothetical protein FOXB_08372 [Fusarium oxysporum Fo5176]
Length = 858
Score = 171 bits (434), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 116/390 (29%), Positives = 199/390 (51%), Gaps = 52/390 (13%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDK-CRPDLLITESTYATTIRDSKRCRERDF 59
MF +++ +I +TGDY+ DRHL +A + K + D+LITESTY + RE+
Sbjct: 206 MFLIEIAGLNIFFTGDYSREQDRHLVSAEVPKGVKIDVLITESTYGIASHVPRLEREQAL 265
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWER-MNLQA-PIYFAVGLTEKATNYY 117
+K + ++RGG+VL+PVFALGRAQEL ++L+ YW + + Q PIY+A L K Y
Sbjct: 266 MKSITSILNRGGRVLMPVFALGRAQELLLILDEYWGKHADFQKYPIYYASNLARKCMLIY 325
Query: 118 KMFITWTNQKIRKTFVQR--------------NMFDFKHIRPFDK-SFIDNPGPMVVFAT 162
+ ++ N I++ F +R +DFK+IR D+ G V+ A+
Sbjct: 326 QTYVGAMNDNIKRLFRERMAEAEASGDGAGKGGPWDFKYIRSLKNLDRFDDVGGCVMLAS 385
Query: 163 PGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKV--------------LSGVKKL 208
PGML +G+S + ++WAP E N +I+ G+ V+GT+ ++ ++G +++
Sbjct: 386 PGMLQNGVSRELLERWAPSEKNGVIITGYSVEGTMAKQIMQEPDQIQAVMSRSMAGARRM 445
Query: 209 EFEN--KQIIDVKMAIEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKI- 265
+ K +I + +++ SF+AH D + I+ V+LVHGE M+ LK K+
Sbjct: 446 PGGDGEKVLIPRRCSVQEYSFAAHVDGVENREFIEEVAAPVVILVHGEQHNMMRLKSKLL 505
Query: 266 ----KQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLKEEAVKYNSEPPNPLKERQ--- 318
+ + + P N E + + D + ++ + A + +PP + Q
Sbjct: 506 SLNANKTAKVKVYSPRNCE----ELRIPFKADKTAKVVGKLA---SIQPPQSIHPDQSAA 558
Query: 319 ---IHGVLVIKDSSISLMDVDEACKEVGIS 345
+ GVLV D +SLM ++ + G++
Sbjct: 559 PPLVTGVLVQNDFKLSLMAPEDLREYAGLN 588
>gi|393912282|gb|EJD76666.1| cleavage and polyadenylation specific factor 3 [Loa loa]
Length = 475
Score = 171 bits (433), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 102/302 (33%), Positives = 159/302 (52%), Gaps = 23/302 (7%)
Query: 59 FLKKVHECVDRGGKVLIPVFALGRAQELCILLETYWE-RMNLQ-APIYFAVGLTEKATNY 116
F VHE V RGG+ LIP FALGRAQEL ++L+ YWE LQ P+Y+A L +K
Sbjct: 7 FTSIVHEVVGRGGRCLIPAFALGRAQELLLILDEYWEAHPELQDIPVYYASSLAKKCMAV 66
Query: 117 YKMFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIF 175
Y+ F++ N +I+K N F FKH+ ++ GP VV A+PGML +GLS +F
Sbjct: 67 YQTFVSGMNSRIQKQIALNNPFVFKHVSNLKSIDHFEDVGPCVVLASPGMLQNGLSRELF 126
Query: 176 KKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKG 235
+ W N I+ G+CV+GT+ +LS +++ N Q + +++ + Y+SFSAH D
Sbjct: 127 ENWCTDSKNGCIIAGYCVEGTLAKHILSEPEEIVAMNGQKLAMRLQVAYISFSAHTDYTQ 186
Query: 236 IMQLIQYCEPKNVLLVHGEASKMVFLKEKI------KQEFNLDCFMPANGESCFVQTDMK 289
I+ P +++LVHGE ++M LK I + +F+++ + P N ES
Sbjct: 187 TSDFIRALRPPHLVLVHGEMNEMNRLKAAIIRQYEDESDFHIEVYNPRNTES-------- 238
Query: 290 ISIDVSVNLLKEEAVKYNSEPP--NPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRH 347
V ++ E+ K + P R + GVL+ ++ + LM D+ +S
Sbjct: 239 ----VELHFRGEKTAKVVGKMAMTTPGDGRILSGVLIRRNFNYHLMHADDLSAYTDLSNS 294
Query: 348 IV 349
I+
Sbjct: 295 IL 296
>gi|212533753|ref|XP_002147033.1| cleavage and polyadenylation specifity factor, 73 kDa subunit,
putative [Talaromyces marneffei ATCC 18224]
gi|210072397|gb|EEA26486.1| cleavage and polyadenylation specifity factor, 73 kDa subunit,
putative [Talaromyces marneffei ATCC 18224]
Length = 866
Score = 171 bits (433), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 119/388 (30%), Positives = 194/388 (50%), Gaps = 50/388 (12%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDK-CRPDLLITESTYATTIRDSKRCRERDF 59
MF V + +I++TGDY+ DRHL A + + + D+LITEST+ + + RE
Sbjct: 181 MFLVSIAGLNILFTGDYSREEDRHLIPAEVPRGIKIDVLITESTFGISSNPPRLEREAAL 240
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYY 117
+K + ++RGG+VL+PVFALGRAQEL ++LE YWER + PIY+ + + Y
Sbjct: 241 MKSITGILNRGGRVLMPVFALGRAQELLLILEEYWERHPEFQKIPIYYIGNMARRCMVVY 300
Query: 118 KMFITWTNQKIRKTFVQR--------------NMFDFKHIRPFDK-SFIDNPGPMVVFAT 162
+ +I N I++ F QR +DF+++R D+ G V+ A+
Sbjct: 301 QTYIGAMNDNIKRLFRQRMAEAEASGNKNVAAGPWDFRYVRSLRSLERFDDIGSCVMLAS 360
Query: 163 PGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVL------------------SG 204
PGML +G S + ++WAP E N ++M G+ V+GT+ ++L S
Sbjct: 361 PGMLQTGTSRELLERWAPSERNGVVMTGYSVEGTMAKQLLNEPEQIPATMSKSAAGPGSR 420
Query: 205 VKKLEFENKQI-IDVKMAIEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKE 263
V + E++++ I + I+ +SF+AH D I+ V+LVHGE +M+ LK
Sbjct: 421 VNPMADEDQKVMIPRRCTIDEISFAAHVDGVENRDFIEQVAAPVVILVHGEKHQMMRLKS 480
Query: 264 K-----IKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLKEEA-VKYNSEPPNPLKER 317
K + + + + PAN + + K I V L E+A + N E +
Sbjct: 481 KLLSLNVDKTVKVKVYTPANCDEIRIPF-RKDKIAKVVGKLAEDAQLTENDE------SK 533
Query: 318 QIHGVLVIKDSSISLMDVDEACKEVGIS 345
+ GVLV +SLM ++ + G++
Sbjct: 534 LMAGVLVQNGFDLSLMAPEDLREYAGLT 561
>gi|452002411|gb|EMD94869.1| hypothetical protein COCHEDRAFT_1222148 [Cochliobolus
heterostrophus C5]
Length = 872
Score = 170 bits (431), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 127/452 (28%), Positives = 215/452 (47%), Gaps = 63/452 (13%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDK-CRPDLLITESTYATTIRDSKRCRERDF 59
MF +++ I++TGDY+ DRHL +A + + D+LITEST+ ++ + RE
Sbjct: 180 MFLMEIAGLKILFTGDYSREDDRHLVSASVPPGVKIDVLITESTFGISMHTPRVEREAQL 239
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYY 117
+K + + ++RGG+ L+PVFALGRAQEL ++L+ YW + + PIY+ L K Y
Sbjct: 240 MKAITDVLNRGGRALLPVFALGRAQELLLILDEYWSKHPEVQKIPIYYNSSLARKCMQVY 299
Query: 118 KMFITWTNQKIRKTFV-------------QRNMFDFKHIRPFDKSF--IDNPGPMVVFAT 162
+ +++ N I++ F +R +DFK +R KS D+ G V+ A+
Sbjct: 300 QTYVSAMNDNIKRLFAERMAEAEAAGDTGRRGAWDFKFVRSL-KSLERFDDLGGCVMLAS 358
Query: 163 PGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVL---------------SGVKK 207
PGM+ SG S + ++WAP N +I+ G+ V+GT+ ++ + +
Sbjct: 359 PGMMQSGTSRELLERWAPDPRNGVIITGYSVEGTMAKHIVHEPDQIPAIMTRASNTARRP 418
Query: 208 LEFENKQ-IIDVKMAIEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKI- 265
+ EN+Q +I + ++ SF+AH D K M+ +Q V+LVHGE M LK K+
Sbjct: 419 GQRENEQTMIPRRCTVQEYSFAAHVDGKENMEFVQEVGAPVVILVHGEKGNMTRLKSKLL 478
Query: 266 ----KQEFNLDCFMPAN-----------------GESCFVQTDMKISIDVSVNLLKEEAV 304
++ + PAN G+ + M S+ + +
Sbjct: 479 SFNAQKTIPTKIYSPANCEELRIPFKTDKIAKVVGKLASITPPMPRSLTPGGSESDQAGA 538
Query: 305 KYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDSSP-- 362
K SE N I GVL+ D ISLM ++ + G++ + ++ +S +
Sbjct: 539 KQESEGEN---IDIISGVLIQNDFKISLMAPEDLKEYAGLTTTTILCRQHITLSAAGIDL 595
Query: 363 INKTLELIYDQLL-SYLQDKSQEYKIQLTESS 393
I LE + + + L D + E K+ TE+
Sbjct: 596 IRWALEGTFGAIKETRLTDAAVEGKMNGTENG 627
>gi|442570104|sp|Q4IPN9.2|YSH1_GIBZE RecName: Full=Endoribonuclease YSH1; AltName: Full=mRNA
3'-end-processing protein YSH1
Length = 833
Score = 170 bits (431), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 116/391 (29%), Positives = 198/391 (50%), Gaps = 54/391 (13%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDK-CRPDLLITESTYATTIRDSKRCRERDF 59
MF +++ +I +TGDY+ DRHL +A + K + D+LITESTY + RE+
Sbjct: 180 MFLIEIAGLNIFFTGDYSREQDRHLVSAEVPKGVKIDVLITESTYGIASHVPRLEREQAL 239
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWER-MNLQA-PIYFAVGLTEKATNYY 117
+K + ++RGG+VL+PVFALGRAQEL ++L+ YW + + Q PIY+A L K Y
Sbjct: 240 MKSITSILNRGGRVLMPVFALGRAQELLLILDEYWGKHADFQKYPIYYASNLARKCMLIY 299
Query: 118 KMFITWTNQKIRKTFVQR--------------NMFDFKHIRPFDK-SFIDNPGPMVVFAT 162
+ ++ N I++ F +R +DFK+IR D+ G V+ A+
Sbjct: 300 QTYVGAMNDNIKRLFRERMAEAEASGDGAGKGGPWDFKYIRSLKNLDRFDDVGGCVMLAS 359
Query: 163 PGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKV--------------LSGVKKL 208
PGML +G+S + ++WAP E N +I+ G+ V+GT+ ++ ++G +++
Sbjct: 360 PGMLQNGVSRELLERWAPSEKNGVIITGYSVEGTMAKQIMQEPDQIQAVMSRSMAGARRM 419
Query: 209 EFEN--KQIIDVKMAIEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKI- 265
+ K +I + +++ SF+AH D + I+ + V+LVHGE M+ LK K+
Sbjct: 420 PGGDGEKVLIPRRCSVQEYSFAAHVDGVENREFIEEVQAPVVILVHGEQHNMMRLKSKLL 479
Query: 266 ----KQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLKEEAVKYNS-EPPNPLKERQ-- 318
+ + + P N E ++ I + + + K +PP + Q
Sbjct: 480 SLNANKTAKVKVYSPRNCE--------ELRIPFKADKIAKVVGKLACIQPPQSIHPDQTA 531
Query: 319 ----IHGVLVIKDSSISLMDVDEACKEVGIS 345
+ GVLV D +SLM ++ + G++
Sbjct: 532 TPPLVTGVLVQNDFKLSLMAPEDLREYAGLN 562
>gi|46107872|ref|XP_380995.1| hypothetical protein FG00819.1 [Gibberella zeae PH-1]
Length = 864
Score = 170 bits (431), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 116/391 (29%), Positives = 198/391 (50%), Gaps = 54/391 (13%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDK-CRPDLLITESTYATTIRDSKRCRERDF 59
MF +++ +I +TGDY+ DRHL +A + K + D+LITESTY + RE+
Sbjct: 211 MFLIEIAGLNIFFTGDYSREQDRHLVSAEVPKGVKIDVLITESTYGIASHVPRLEREQAL 270
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWER-MNLQA-PIYFAVGLTEKATNYY 117
+K + ++RGG+VL+PVFALGRAQEL ++L+ YW + + Q PIY+A L K Y
Sbjct: 271 MKSITSILNRGGRVLMPVFALGRAQELLLILDEYWGKHADFQKYPIYYASNLARKCMLIY 330
Query: 118 KMFITWTNQKIRKTFVQR--------------NMFDFKHIRPFDK-SFIDNPGPMVVFAT 162
+ ++ N I++ F +R +DFK+IR D+ G V+ A+
Sbjct: 331 QTYVGAMNDNIKRLFRERMAEAEASGDGAGKGGPWDFKYIRSLKNLDRFDDVGGCVMLAS 390
Query: 163 PGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKV--------------LSGVKKL 208
PGML +G+S + ++WAP E N +I+ G+ V+GT+ ++ ++G +++
Sbjct: 391 PGMLQNGVSRELLERWAPSEKNGVIITGYSVEGTMAKQIMQEPDQIQAVMSRSMAGARRM 450
Query: 209 EFEN--KQIIDVKMAIEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKI- 265
+ K +I + +++ SF+AH D + I+ + V+LVHGE M+ LK K+
Sbjct: 451 PGGDGEKVLIPRRCSVQEYSFAAHVDGVENREFIEEVQAPVVILVHGEQHNMMRLKSKLL 510
Query: 266 ----KQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLKEEAVKYNS-EPPNPLKERQ-- 318
+ + + P N E ++ I + + + K +PP + Q
Sbjct: 511 SLNANKTAKVKVYSPRNCE--------ELRIPFKADKIAKVVGKLACIQPPQSIHPDQTA 562
Query: 319 ----IHGVLVIKDSSISLMDVDEACKEVGIS 345
+ GVLV D +SLM ++ + G++
Sbjct: 563 TPPLVTGVLVQNDFKLSLMAPEDLREYAGLN 593
>gi|400600571|gb|EJP68245.1| metallo-beta-lactamase superfamily protein [Beauveria bassiana
ARSEF 2860]
Length = 866
Score = 170 bits (430), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 97/298 (32%), Positives = 165/298 (55%), Gaps = 33/298 (11%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDK-CRPDLLITESTYATTIRDSKRCRERDF 59
MF +++ +I +TGDY+ DRHL +A + K + D+LITESTY + RE+
Sbjct: 180 MFLIEIAGLNIFFTGDYSREQDRHLVSAEVPKGVKIDVLITESTYGIASHVPRLEREQAL 239
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYY 117
+K + ++RGG+ L+PVFALGRAQEL ++L+ YW + + + PIY+A L +K Y
Sbjct: 240 MKSITNILNRGGRALLPVFALGRAQELLLILDEYWGKHSEFQKYPIYYASNLAKKCMLIY 299
Query: 118 KMFITWTNQKIRKTFVQR-------------NMFDFKHIRPFDK-SFIDNPGPMVVFATP 163
+ ++ N I++ F +R +DFK+IR D+ G V+ A+P
Sbjct: 300 QTYVGAMNDNIKRLFRERMAEAETSGEAGAGGPWDFKYIRSLKNLDRFDDVGGCVMLASP 359
Query: 164 GMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVL--------------SGVKKLE 209
GML +G+S +F++WAP + N +I+ G+ V+GT+ +++ +G ++
Sbjct: 360 GMLQNGVSRELFERWAPSDKNGVIITGYSVEGTMARQIMKEPEQIQAVMSRSIAGARRAP 419
Query: 210 FEN--KQIIDVKMAIEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKI 265
+ K +I ++ +++ SF+AH D + I+ V+LVHGE + M+ LK K+
Sbjct: 420 GGDGEKVMIPLRCSVQEYSFAAHVDGVENREFIEEVAAPVVILVHGEQTNMMRLKSKL 477
>gi|396488788|ref|XP_003842943.1| similar to cleavage and polyadenylation specifity factor
[Leptosphaeria maculans JN3]
gi|312219521|emb|CBX99464.1| similar to cleavage and polyadenylation specifity factor
[Leptosphaeria maculans JN3]
Length = 861
Score = 170 bits (430), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 115/408 (28%), Positives = 202/408 (49%), Gaps = 51/408 (12%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDK-CRPDLLITESTYATTIRDSKRCRERDF 59
MF +++ I++TGDY+ DRHL +A + + D+LITEST+ ++ + RE
Sbjct: 180 MFLMEIAGLKILFTGDYSREDDRHLVSASVPAGVKVDVLITESTFGISMHTPRVEREAQL 239
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYY 117
+K + + ++RGG+ L+PVFALGRAQEL ++L+ YW + + PIY+ L K Y
Sbjct: 240 MKAITDILNRGGRALLPVFALGRAQELLLILDEYWSKHPEVQKIPIYYNSSLARKCMQVY 299
Query: 118 KMFITWTNQKIRKTFV-------------QRNMFDFKHIRPFDKSF--IDNPGPMVVFAT 162
+ +++ N I++ F +R +DFK +R KS D+ G V+ A+
Sbjct: 300 QTYVSAMNDNIKRLFAERMAEAEAAGDTGRRGAWDFKFVRSL-KSLERFDDLGGCVMLAS 358
Query: 163 PGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVL---------------SGVKK 207
PGM+ SG S + ++WAP N +I+ G+ V+GT+ +++ + +
Sbjct: 359 PGMMQSGTSRELLERWAPDPRNGVIITGYSVEGTMAKQIVHEPDQIPAIMTRTNNNSRRP 418
Query: 208 LEFENKQ-IIDVKMAIEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKI- 265
+ E +Q +I + ++ SF+AH D K M+ +Q V+LVHGE M L+ K+
Sbjct: 419 GQREGEQTMIPRRCTVQEYSFAAHVDGKENMEFVQEVAAPVVILVHGEKGNMTRLRSKLL 478
Query: 266 ----KQEFNLDCFMPANGESCFV--QTD---------MKISIDVSVNLLKEEAVKYNSEP 310
++ + PAN E + +TD I+ + +L + + N +
Sbjct: 479 SFNAQKSVPTKIYSPANCEELRIPFKTDKIAKVVGKLASIAPPIPRSLDADGDERLNDQD 538
Query: 311 PNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQIS 358
K I GVL+ D +SLM ++ + G++ + ++ +S
Sbjct: 539 SEGEKLDMIAGVLIQNDFKLSLMAPEDLKEYAGLTTTTIMCRQHITLS 586
>gi|390364725|ref|XP_003730668.1| PREDICTED: uncharacterized protein C12orf26 homolog
[Strongylocentrotus purpuratus]
Length = 311
Score = 169 bits (429), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 98/269 (36%), Positives = 150/269 (55%), Gaps = 32/269 (11%)
Query: 795 VVCKDKCKQITHFVTPDSDISSILSQAYPQDSLHNV----CIMGLHTCGDLSGTALRLF- 849
V+C+D HF P S+ HN ++GLHTCGDL+ + LR+F
Sbjct: 59 VICRD------HFTIPTSN--------------HNFGTPFMMVGLHTCGDLAPSMLRIFQ 98
Query: 850 TKSSLQCLVQVGCCYHLLEEEFIRSPFWKDVDQSLYEHG--YGFPLSEHLRSRKFFLGRN 907
+ S+ CL VGCCYH + EEF + W D + S + G +GFP+S++LR +K LGR
Sbjct: 99 SNPSMHCLCNVGCCYHHVTEEFAKKDEW-DREHSGIKMGSSFGFPMSDYLREKKVALGRG 157
Query: 908 VRMSGTQSPERVIDLKQTQTLPLFYRALLEKYLRSKITINDEEPKVVGRLATKCSNFVEY 967
RM+ + +R+ K LFYRA+L+ R TI + +++G+LA KC +F+EY
Sbjct: 158 ARMTACMAADRIASEKTVDVKGLFYRAVLQVICRDHFTIPTRK-QIIGKLAPKCKSFMEY 216
Query: 968 VHRAVDKLKL-DLEVDDEEVTRLFNSHQREYEYLQIYYFLKTALAPVIEALIVLDRVLYL 1026
+A+ KLK+ D ++ D+E++ ++ Y +L Y L+ AL+P IEA+I+LDR+LYL
Sbjct: 217 ARKALKKLKMDDTKISDDEISAYEAEYRPFYRHLAAYNQLRAALSPSIEAIILLDRLLYL 276
Query: 1027 REQQKPQIISNLFLSLLTQVDDEEVTRLF 1055
+EQ P I + + L V +F
Sbjct: 277 KEQ--PNIKTACIVRLFDPVTSPRCYGIF 303
Score = 48.9 bits (115), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 22/51 (43%), Positives = 36/51 (70%)
Query: 1044 TQVDDEEVTRLFDSHQREYEYLQIYYFLKTALAPVIEALIVLDRVLYLREQ 1094
T++ D+E++ ++ Y +L Y L+ AL+P IEA+I+LDR+LYL+EQ
Sbjct: 229 TKISDDEISAYEAEYRPFYRHLAAYNQLRAALSPSIEAIILLDRLLYLKEQ 279
>gi|336259697|ref|XP_003344648.1| hypothetical protein SMAC_07216 [Sordaria macrospora k-hell]
gi|380088385|emb|CCC13649.1| unnamed protein product [Sordaria macrospora k-hell]
Length = 857
Score = 169 bits (429), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 122/405 (30%), Positives = 200/405 (49%), Gaps = 48/405 (11%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDK-CRPDLLITESTYATTIRDSKRCRERDF 59
MF +++ I +TGDY+ DRHL +A + K + D+LITESTY + RE+
Sbjct: 179 MFLIEIAGLKIFFTGDYSREEDRHLISAEVPKGVKIDVLITESTYGIASHIPRVEREQAL 238
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYY 117
+K + ++RGG+VL+PVFALGRAQEL ++L+ YW + + PIY+A L K Y
Sbjct: 239 MKSITGILNRGGRVLMPVFALGRAQELLLILDEYWGKHAEFQKYPIYYASNLARKCMLVY 298
Query: 118 KMFITWTNQKIRKTFVQR--------------NMFDFKHIRPFDKSF--IDNPGPMVVFA 161
+ ++ N I++ F +R +DFK IR KS ++ G V+ A
Sbjct: 299 QTYVGSMNDNIKRLFRERLAESESSGDGAGKGGPWDFKFIRSL-KSIDRFEDVGGCVMLA 357
Query: 162 TPGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEF----------- 210
+PGML +G+S + ++WAP E N +I+ G+ V+GT+ ++ ++
Sbjct: 358 SPGMLQNGVSRELLERWAPSEKNGVIITGYSVEGTMAKHIMQEPDTIQAVMSRNIAGARR 417
Query: 211 -------ENKQIIDVKMAIEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKE 263
K +I + ++ SF+AH D + I+ V+LVHGE M+ LK
Sbjct: 418 GPGGGGDAEKVMIPRRCTVQEFSFAAHVDGVENREFIEEVAAPVVILVHGEVHNMMRLKS 477
Query: 264 KI-----KQEFNLDCFMPANGESCFV--QTD--MKISIDVSVNLLKEEAVKYNSEPPNPL 314
K+ +E + F P N E + +TD K+ ++ + K + + P P
Sbjct: 478 KLLSLNANKEHKVKVFSPKNCEELRIPFKTDKVAKVVGKLAAIPPALKEAKNSHDGPLPS 537
Query: 315 KERQ-IHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQIS 358
E Q I GVLV D +SLM ++ + G++ + +++S
Sbjct: 538 SEPQLITGVLVQNDFKMSLMAPEDLKEYAGLTTTTIACKQRLRLS 582
>gi|358365452|dbj|GAA82074.1| cleavage and polyadenylation specifity factor, 73 kDa subunit
[Aspergillus kawachii IFO 4308]
Length = 882
Score = 169 bits (429), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 119/410 (29%), Positives = 200/410 (48%), Gaps = 61/410 (14%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDK-CRPDLLITESTYATTIRDSKRCRERDF 59
MF + + +I++TGDY+ DRHL A + K + D+LITEST+ + + RE
Sbjct: 181 MFLISIAGLNILFTGDYSREEDRHLIPAEVPKGVKIDVLITESTFGISSNPPRLEREAAL 240
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWE-RMNLQA-PIYFAVGLTEKATNYY 117
+K + ++RGG+VL+PVFALGRAQEL ++L+ YWE LQ PIY+ + Y
Sbjct: 241 MKAITGVLNRGGRVLMPVFALGRAQELLLILDEYWETHPELQKIPIYYIGNTARRCMVVY 300
Query: 118 KMFITWTNQKIRKTFVQR--------------NMFDFKHIRPFDK-SFIDNPGPMVVFAT 162
+ +I N I++ F QR +DF+ +R D+ G V+ A+
Sbjct: 301 QTYIGAMNDNIKRLFRQRMAEAEASGDKSVSAGPWDFRFVRSLRSLERFDDVGGCVMLAS 360
Query: 163 PGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKL-------------- 208
PGML +G S + ++WAP E N ++M G+ V+GT+ ++L+ +++
Sbjct: 361 PGMLQTGTSRELLERWAPNERNGVVMTGYSVEGTMAKQILNEPEQIPAVMSRATTGLVRR 420
Query: 209 ------EFENKQIIDVKMAIEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLK 262
E E K +I + ++ +SF+AH D I+ V+LVHGE +M+ LK
Sbjct: 421 GMAAGNEEEQKVMIPRRCTVDEVSFAAHVDGVENRNFIEEVSAPVVILVHGEKHQMMRLK 480
Query: 263 EKI-----KQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLKEEAVKYNSE-----PPN 312
K+ ++ + + P N E +V + K++ K PP+
Sbjct: 481 SKLLSLNAEKTVKVKVYTPGNCE------------EVRIPFKKDKIAKVVGRLAQITPPS 528
Query: 313 PLKERQI-HGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDSS 361
+ Q+ GVLV ++SLM D+ + G++ + ++ +S +S
Sbjct: 529 EQDDGQLMSGVLVQNGFNLSLMAPDDLREYAGLTTTAIICKQHITLSSAS 578
>gi|189208340|ref|XP_001940503.1| endoribonuclease YSH1 [Pyrenophora tritici-repentis Pt-1C-BFP]
gi|187976596|gb|EDU43222.1| endoribonuclease YSH1 [Pyrenophora tritici-repentis Pt-1C-BFP]
Length = 871
Score = 169 bits (429), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 101/319 (31%), Positives = 168/319 (52%), Gaps = 40/319 (12%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDK-CRPDLLITESTYATTIRDSKRCRERDF 59
MF +++ I++TGDY+ DRHL +A + + D+LITEST+ ++ + RE
Sbjct: 180 MFLMEIAGLKILFTGDYSREDDRHLVSASVPAGVKVDVLITESTFGISMHTPRVEREAQL 239
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYY 117
+K + + ++RGG+ L+PVFALGRAQEL ++L+ YW + + PIY+ L K Y
Sbjct: 240 MKAITDVLNRGGRALLPVFALGRAQELLLILDEYWSKHPEVQKIPIYYNSSLARKCMQVY 299
Query: 118 KMFITWTNQKIRKTFV-------------QRNMFDFKHIRPFDKSF--IDNPGPMVVFAT 162
+ +++ N I++ F +R +DFK +R KS D+ G V+ A+
Sbjct: 300 QTYVSAMNDNIKRLFAERMAEAEAAGDTGRRGAWDFKFVRSL-KSLERFDDLGGCVMLAS 358
Query: 163 PGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVL---------------SGVKK 207
PGM+ SG S + ++WAP N +I+ G+ V+GT+ +++ + +
Sbjct: 359 PGMMQSGTSRELLERWAPDPRNGVIITGYSVEGTMAKQIVHEPDQIPAIMTRASNTARRP 418
Query: 208 LEFENKQ-IIDVKMAIEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKI- 265
+ EN+Q +I + ++ SF+AH D K M+ +Q V+LVHGE M LK K+
Sbjct: 419 GQRENEQTMIPRRCTVQEYSFAAHVDGKENMEFVQEVAAPVVILVHGEKGNMTRLKSKLL 478
Query: 266 ----KQEFNLDCFMPANGE 280
++ + PAN E
Sbjct: 479 SFNAQKAIPTKIYSPANCE 497
>gi|330923041|ref|XP_003300074.1| hypothetical protein PTT_11224 [Pyrenophora teres f. teres 0-1]
gi|311325959|gb|EFQ91831.1| hypothetical protein PTT_11224 [Pyrenophora teres f. teres 0-1]
Length = 705
Score = 169 bits (429), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 101/319 (31%), Positives = 168/319 (52%), Gaps = 40/319 (12%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDK-CRPDLLITESTYATTIRDSKRCRERDF 59
MF +++ I++TGDY+ DRHL +A + + D+LITEST+ ++ + RE
Sbjct: 180 MFLMEIAGLKILFTGDYSREDDRHLVSASVPAGVKVDVLITESTFGISMHTPRVEREAQL 239
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYY 117
+K + + ++RGG+ L+PVFALGRAQEL ++L+ YW + + PIY+ L K Y
Sbjct: 240 MKAITDVLNRGGRALLPVFALGRAQELLLILDEYWSKHPEVQKIPIYYNSSLARKCMQVY 299
Query: 118 KMFITWTNQKIRKTFV-------------QRNMFDFKHIRPFDKSF--IDNPGPMVVFAT 162
+ +++ N I++ F +R +DFK +R KS D+ G V+ A+
Sbjct: 300 QTYVSAMNDNIKRLFAERMAEAEAAGDTGRRGAWDFKFVRSL-KSLERFDDLGGCVMLAS 358
Query: 163 PGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVL---------------SGVKK 207
PGM+ SG S + ++WAP N +I+ G+ V+GT+ +++ + +
Sbjct: 359 PGMMQSGTSRELLERWAPDPRNGVIITGYSVEGTMAKQIVHEPDQIPAIMTRASNTARRP 418
Query: 208 LEFENKQ-IIDVKMAIEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKI- 265
+ EN+Q +I + ++ SF+AH D K M+ +Q V+LVHGE M LK K+
Sbjct: 419 GQRENEQTMIPRRCTVQEYSFAAHVDGKENMEFVQEVAAPVVILVHGEKGNMTRLKSKLL 478
Query: 266 ----KQEFNLDCFMPANGE 280
++ + PAN E
Sbjct: 479 SFNAQKAIPTKIYSPANCE 497
>gi|322710530|gb|EFZ02104.1| cleavage and polyadenylation specifity factor [Metarhizium
anisopliae ARSEF 23]
Length = 831
Score = 169 bits (429), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 116/390 (29%), Positives = 194/390 (49%), Gaps = 53/390 (13%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDK-CRPDLLITESTYATTIRDSKRCRERDF 59
MF +++ +I +TGDY+ DRHL +A + K + D+LITESTY + RE+
Sbjct: 182 MFLIEIAGLNIFFTGDYSREQDRHLVSAEVPKDVKIDVLITESTYGIASHVPRLEREQAL 241
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYY 117
+K + ++RGG+ L+PVFALGRAQEL ++L+ YW + + PIY+A L K Y
Sbjct: 242 MKSITGILNRGGRALLPVFALGRAQELLLILDEYWGKHPEFQKYPIYYASNLARKCMVVY 301
Query: 118 KMFITWTNQKIRKTFVQR--------------NMFDFKHIRPFDK-SFIDNPGPMVVFAT 162
+ ++ N I++ F +R +DFK+IR D+ G V+ A+
Sbjct: 302 QTYVGAMNDNIKRLFRERMAEAEASGDGAGQGGPWDFKYIRSLKNLDRFDDVGGCVMLAS 361
Query: 163 PGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVL---------------SGVKK 207
PGML SG+S +F++WAP E N +I+ G+ V+GT+ +++ +G +
Sbjct: 362 PGMLQSGVSRELFERWAPSEKNGVIITGYSVEGTMARQIMQEPDQIPAVMSRNLAAGRRA 421
Query: 208 LEFENKQI-IDVKMAIEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKI- 265
E++++ I + ++ SF+AH D + I+ V+LVHGE M+ LK K+
Sbjct: 422 PGGESERVLIPRRCSVAEYSFAAHVDGVENREFIEEVAAPVVILVHGEQHNMMRLKSKLL 481
Query: 266 ----KQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLKEEAVKYNSEPP------NPLK 315
+ + + P N E ++ I + + + K S PP +
Sbjct: 482 SLNASKTTKVKVYSPRNCE--------ELRIPFKGDKMAKVVGKLASIPPPQDVNVDSAS 533
Query: 316 ERQIHGVLVIKDSSISLMDVDEACKEVGIS 345
+ GVLV D +SLM ++ + G++
Sbjct: 534 APLVTGVLVQNDFKLSLMAPEDLREYAGLN 563
>gi|116200035|ref|XP_001225829.1| hypothetical protein CHGG_08173 [Chaetomium globosum CBS 148.51]
gi|88179452|gb|EAQ86920.1| hypothetical protein CHGG_08173 [Chaetomium globosum CBS 148.51]
Length = 854
Score = 169 bits (428), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 123/411 (29%), Positives = 202/411 (49%), Gaps = 62/411 (15%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDK-CRPDLLITESTYATTIRDSKRCRERDF 59
MF +++ +I++TGDY+ DRHL +A + K R D+LITESTY + RE+
Sbjct: 180 MFLIEIAGLNILFTGDYSREQDRHLVSAEVPKGVRVDVLITESTYGIASHVPRLEREQAL 239
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYW--ERMNLQAPIYFAVGLTEKATNYY 117
+K + ++RGG+VL+PVFALGRAQEL ++L+ YW R + PIY+A L K Y
Sbjct: 240 MKSITGVLNRGGRVLMPVFALGRAQELLLILDEYWGKHRDYQRYPIYYASNLARKCMLVY 299
Query: 118 KMFITWTNQKIRKTFVQR--------------NMFDFKHIRPFDKSF--IDNPGPMVVFA 161
+ ++ N I++ F +R +DFK IR KS ++ G V+ A
Sbjct: 300 QTYVGAMNDNIKRLFRERMAEAEASGDGAGKGGPWDFKFIRSL-KSIDRFEDVGGCVMLA 358
Query: 162 TPGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEF----------- 210
+PGML +G+S + ++WAP E N +I+ G+ V+GT+ +++ ++++
Sbjct: 359 SPGMLQNGVSRELLERWAPSEKNGVIITGYSVEGTMAKQIMQEPEQIQAVMTRSSAGGRR 418
Query: 211 -----ENKQIIDVKMAIEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKI 265
K +I + ++ SF+AH D + I+ V+LVHGE M+ LK K+
Sbjct: 419 APGGDSEKVMIPRRCTVQEYSFAAHVDGTENREFIEEVSAPVVILVHGEVHNMMRLKSKL 478
Query: 266 -----KQEFNLDCFMPANGESCFV--QTDMKISIDVSVNLLKEEAVKYNSEPPNPLK--- 315
+ + F P N E + +TD + V + P P+K
Sbjct: 479 LSLNANKTDKVKVFSPRNCEELRIPFKTDKTAKV-----------VGKLASIPQPMKIAA 527
Query: 316 ----ERQ-IHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDSS 361
E Q I GVLV D +SLM ++ + G++ + +++S +
Sbjct: 528 DDSEEPQLITGVLVQNDFKMSLMAPEDLREYAGLTTTTIACKQRLKLSAAG 578
>gi|429862463|gb|ELA37111.1| cleavage and polyadenylation specifity 73 kda [Colletotrichum
gloeosporioides Nara gc5]
Length = 831
Score = 169 bits (428), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 116/380 (30%), Positives = 189/380 (49%), Gaps = 43/380 (11%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDK-CRPDLLITESTYATTIRDSKRCRERDF 59
MF +++ I +TGDY+ DRHL +A + K + D+LITESTY + RE+
Sbjct: 176 MFLIEIAGLKIFFTGDYSREQDRHLVSAEVPKGVKIDVLITESTYGIASHVPRLEREQAL 235
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYY 117
+K + ++RGG+VL+PVFALGRAQEL ++L+ YW + + PIY+A L K Y
Sbjct: 236 MKSITGILNRGGRVLMPVFALGRAQELLLILDEYWGKHGEYQKFPIYYASNLARKCMVVY 295
Query: 118 KMFITWTNQKIRKTFVQR--------------NMFDFKHIRPFDK-SFIDNPGPMVVFAT 162
+ ++ N I++ F +R +DFK+IR D+ G V+ A+
Sbjct: 296 QTYVGAMNDNIKRLFRERMAEAEASGDGSGKGGPWDFKYIRSLKNLDRFDDVGGCVMLAS 355
Query: 163 PGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVL------SGVKKLEFENKQII 216
PGML +G+S + ++WAP E N +I+ G+ V+GT+ +++ V +
Sbjct: 356 PGMLQNGVSRELLERWAPNEKNGVIITGYSVEGTMAKQIMQEPDQIQAVMPPPARDADPE 415
Query: 217 DVKMAIEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKI-----KQEFNL 271
+ ++ SF+AH D + I+ V+LVHGE M+ LK K+ + +
Sbjct: 416 ERARSVAEYSFAAHVDGTENREFIEEVAAPVVILVHGEQHNMMRLKSKLLSLNAGKTTKV 475
Query: 272 DCFMPANGESCFVQTDMKISIDVSVNLLKEEAVKYNSEPP----NPLKERQ--IHGVLVI 325
F P N E ++ I + + K + PP P +E+Q + GVLV
Sbjct: 476 KVFSPKNCE--------ELRIPFKQDKTAKVVGKLATIPPPTSLTPAQEQQQLVTGVLVQ 527
Query: 326 KDSSISLMDVDEACKEVGIS 345
D +SLM ++ + G++
Sbjct: 528 NDFKLSLMAPEDLREYAGLN 547
>gi|358396914|gb|EHK46289.1| hypothetical protein TRIATDRAFT_132454 [Trichoderma atroviride IMI
206040]
Length = 881
Score = 169 bits (428), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 114/386 (29%), Positives = 194/386 (50%), Gaps = 48/386 (12%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDK-CRPDLLITESTYATTIRDSKRCRERDF 59
MF +++ +I +TGDY+ DRHL +A + K + D+LITESTY + RE+
Sbjct: 190 MFLIEIAGLNIFFTGDYSREQDRHLVSAEVPKGLKIDVLITESTYGIASHVPRVEREQAL 249
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYY 117
+K + ++RGG+ L+PVFALGRAQEL ++L+ YW + + PIY+A L K Y
Sbjct: 250 MKSITGILNRGGRALLPVFALGRAQELLLILDEYWGKHTEFQKFPIYYASNLARKCMVIY 309
Query: 118 KMFITWTNQKIRKTFVQR--------------NMFDFKHIRPFDK-SFIDNPGPMVVFAT 162
+ ++ N I++ F +R +DFK+IR D+ G V+ A+
Sbjct: 310 QTYVGAMNDNIKRLFRERMAEAEASGDGAGKNGPWDFKYIRSLKNLDRFDDVGGCVMLAS 369
Query: 163 PGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVL--------------SGVKKL 208
PGML +G+S +F++WAP E N +I+ G+ V+GT+ +++ +G ++
Sbjct: 370 PGMLQNGVSRELFERWAPSEKNGVIITGYSVEGTMARQIMQEPDQIQAVMSRSIAGARRG 429
Query: 209 EFEN--KQIIDVKMAIEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKI- 265
+ K +I + +++ SF+AH D + I+ V+LVHGE M+ LK K+
Sbjct: 430 PGADSEKVLIPRRCSVQEYSFAAHVDGVENREFIEEVAAPVVILVHGEQHNMMRLKSKLL 489
Query: 266 ----KQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLKEEAVKYNSEPPNPLKERQ--I 319
+ + + P N E + D + ++ + A + PP + +
Sbjct: 490 SLNASKTTKVKVYSPRNCEELRI----PFKADKTAKVVGKLA---SISPPQDIGASAPLV 542
Query: 320 HGVLVIKDSSISLMDVDEACKEVGIS 345
GVLV D +SLM ++ + G++
Sbjct: 543 TGVLVQNDFKLSLMAPEDLREYAGLN 568
>gi|10435258|dbj|BAB14541.1| unnamed protein product [Homo sapiens]
Length = 278
Score = 169 bits (427), Expect = 9e-39, Method: Compositional matrix adjust.
Identities = 76/83 (91%), Positives = 82/83 (98%)
Query: 3 QVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFLKK 62
Q+KVG++S+VYTGDYNMTPDRHLGAAWIDKCRP+LLITESTYATTIRDSKRCRERDFLKK
Sbjct: 143 QIKVGSESVVYTGDYNMTPDRHLGAAWIDKCRPNLLITESTYATTIRDSKRCRERDFLKK 202
Query: 63 VHECVDRGGKVLIPVFALGRAQE 85
VHE V+RGGKVLIPVFALGRAQE
Sbjct: 203 VHETVERGGKVLIPVFALGRAQE 225
>gi|367034742|ref|XP_003666653.1| hypothetical protein MYCTH_2311535 [Myceliophthora thermophila ATCC
42464]
gi|347013926|gb|AEO61408.1| hypothetical protein MYCTH_2311535 [Myceliophthora thermophila ATCC
42464]
Length = 879
Score = 169 bits (427), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 117/403 (29%), Positives = 200/403 (49%), Gaps = 46/403 (11%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDK-CRPDLLITESTYATTIRDSKRCRERDF 59
MF +++ +I++TGDY+ DRHL +A + K + D+LITESTY + RE+
Sbjct: 180 MFLIEIAGLNILFTGDYSREQDRHLVSAEVPKGVKIDVLITESTYGIASHVPRLEREQAL 239
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYY 117
+K + ++RGG+VL+PVFALGRAQEL ++L+ YW + + PIY+A L K Y
Sbjct: 240 MKSITSVLNRGGRVLMPVFALGRAQELLLILDEYWAKHKEYQKYPIYYASNLARKCMLVY 299
Query: 118 KMFITWTNQKIRKTFVQR--------------NMFDFKHIRPFDKSF--IDNPGPMVVFA 161
+ ++ N I++ F +R +DFK IR KS ++ G V+ A
Sbjct: 300 QTYVGAMNDNIKRLFRERMAEAEASGDGAGKGGPWDFKFIRSL-KSIDRFEDVGGCVMLA 358
Query: 162 TPGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEF----------- 210
+PGML +G+S + ++WAP E N +I+ G+ V+GT+ ++ ++++
Sbjct: 359 SPGMLQNGVSRELLERWAPSEKNGVIITGYSVEGTMAKHIMQEPEQIQAVMTRSSAGGRR 418
Query: 211 -----ENKQIIDVKMAIEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKI 265
K +I + ++ SF+AH D + I+ V+LVHGE M+ LK K+
Sbjct: 419 GPGGDAEKVMIPRRCTVQEFSFAAHVDGTENREFIEEVGAPVVILVHGEVHNMMRLKSKL 478
Query: 266 -----KQEFNLDCFMPANGESCFV--QTDMKISIDVSVNLLKEEAVKYNSEPPNPLKERQ 318
+ + + P N E + +TD + + + + +E P R
Sbjct: 479 LSLNANKTNKVKVYSPRNCEELRIPFKTDKTAKVVGKLASIPQPMKMPKAESEEP---RL 535
Query: 319 IHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDSS 361
I GVLV D +SLM ++ + G++ + +++S +
Sbjct: 536 ITGVLVQNDFKMSLMAPEDLREYAGLTTTTIACKQRLRLSAAG 578
>gi|322699261|gb|EFY91024.1| cleavage and polyadenylation specifity factor [Metarhizium acridum
CQMa 102]
Length = 829
Score = 169 bits (427), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 116/390 (29%), Positives = 194/390 (49%), Gaps = 53/390 (13%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDK-CRPDLLITESTYATTIRDSKRCRERDF 59
MF +++ +I +TGDY+ DRHL +A + K + D+LITESTY + RE+
Sbjct: 182 MFLIEIAGLNIFFTGDYSREQDRHLVSAEVPKDVKIDVLITESTYGIASHVPRLEREQAL 241
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYY 117
+K + ++RGG+ L+PVFALGRAQEL ++L+ YW + + PIY+A L K Y
Sbjct: 242 MKSITGILNRGGRALLPVFALGRAQELLLILDEYWGKHPEFQKYPIYYASNLARKCMVVY 301
Query: 118 KMFITWTNQKIRKTFVQR--------------NMFDFKHIRPFDK-SFIDNPGPMVVFAT 162
+ ++ N I++ F +R +DFK+IR D+ G V+ A+
Sbjct: 302 QTYVGAMNDNIKRLFRERMAEAEASGDGAGQGGPWDFKYIRSLKNLDRFDDVGGCVMLAS 361
Query: 163 PGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVL---------------SGVKK 207
PGML SG+S +F++WAP E N +I+ G+ V+GT+ +++ +G +
Sbjct: 362 PGMLQSGVSRELFERWAPNEKNGVIITGYSVEGTMARQIMQEPDQIPAVMSRNLAAGRRA 421
Query: 208 LEFENKQI-IDVKMAIEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKI- 265
E++++ I + ++ SF+AH D + I+ V+LVHGE M+ LK K+
Sbjct: 422 PGGESERVLIPRRCSVAEYSFAAHVDGVENREFIEEVAAPVVILVHGEQHNMMRLKSKLL 481
Query: 266 ----KQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLKEEAVKYNSEPP------NPLK 315
+ + + P N E ++ I + + + K S PP +
Sbjct: 482 SLNASKTTKVKVYSPRNCE--------ELRIPFKGDKMAKVVGKLASIPPPQDVNVDSAS 533
Query: 316 ERQIHGVLVIKDSSISLMDVDEACKEVGIS 345
+ GVLV D +SLM ++ + G++
Sbjct: 534 APLVTGVLVQNDFKLSLMAPEDLREYAGLN 563
>gi|224140917|ref|XP_002323823.1| predicted protein [Populus trichocarpa]
gi|222866825|gb|EEF03956.1| predicted protein [Populus trichocarpa]
Length = 250
Score = 168 bits (426), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 86/209 (41%), Positives = 122/209 (58%), Gaps = 9/209 (4%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF V + ++YTGDY+ DRHL AA + + PD+ I ESTY + + RE+ F
Sbjct: 45 MFMVDIAGVRVLYTGDYSREEDRHLRAAEMPQFSPDICIIESTYGVQLHQPRHLREKRFT 104
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYW----ERMNLQAPIYFAVGLTEKATNY 116
+H + GG+VLIP FALGRAQEL ++L+ YW E N+ PIY+A L +K
Sbjct: 105 DVIHSTISLGGRVLIPAFALGRAQELLLILDEYWANHPELHNI--PIYYASPLAKKCMTV 162
Query: 117 YKMFITWTNQKIRKTFVQRNMFDFKHIRPFD--KSFIDNPGPMVVFATPGMLHSGLSLII 174
Y+ +I N++IR F N F FKHI P + + F D GP VV A+PG L SGLS +
Sbjct: 163 YQTYILSMNERIRNQFANSNPFKFKHISPLNSIEDFSD-VGPSVVMASPGGLQSGLSRQL 221
Query: 175 FKKWAPVENNMLIMPGFCVQGTIGHKVLS 203
F W + N ++PG+ V+GT+ +++
Sbjct: 222 FDMWCSDKKNACVLPGYVVEGTLAKTIIN 250
>gi|389634325|ref|XP_003714815.1| endoribonuclease YSH1 [Magnaporthe oryzae 70-15]
gi|351647148|gb|EHA55008.1| endoribonuclease YSH1 [Magnaporthe oryzae 70-15]
gi|440467574|gb|ELQ36790.1| endoribonuclease YSH1 [Magnaporthe oryzae Y34]
gi|440483131|gb|ELQ63565.1| endoribonuclease YSH1 [Magnaporthe oryzae P131]
Length = 829
Score = 168 bits (426), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 95/298 (31%), Positives = 160/298 (53%), Gaps = 33/298 (11%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDK-CRPDLLITESTYATTIRDSKRCRERDF 59
MF +++ +I +TGDY+ DRHL +A + + + D+LITESTY + RE+
Sbjct: 180 MFLIEIAGMNIFFTGDYSREQDRHLVSAEVPRGVKIDVLITESTYGIASHVPRVEREQAL 239
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYY 117
+K + ++RGG+VL+PVFALGRAQEL ++L+ YW + + PIY+A L K Y
Sbjct: 240 MKSITGILNRGGRVLMPVFALGRAQELLLILDEYWGKHQEYQKVPIYYASNLARKCMVVY 299
Query: 118 KMFITWTNQKIRKTFVQR---------------NMFDFKHIRPFDK-SFIDNPGPMVVFA 161
+ ++ N I++ F +R +DFK+IR D+ GP V+ A
Sbjct: 300 QTYVGAMNDNIKRLFRERLAEAEASGKSGAGGGGPWDFKYIRSLKNLDRFDDLGPCVMLA 359
Query: 162 TPGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEF----------- 210
+PGML +G+S + ++WAP + N +++ G+ V+GT+ +++ +++
Sbjct: 360 SPGMLQNGVSRELLERWAPSDKNGVVITGYSVEGTMAKQIMQEPEQIPAIMTRNAAAASR 419
Query: 211 ---ENKQIIDVKMAIEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKI 265
K +I + ++ SF+AH D + I+ V+LVHGE + M+ LK K+
Sbjct: 420 RADGEKPMIPRRCSVAEFSFAAHVDGTENREFIEEVAAPVVILVHGEQNNMMRLKSKL 477
>gi|358378169|gb|EHK15851.1| hypothetical protein TRIVIDRAFT_65314 [Trichoderma virens Gv29-8]
Length = 873
Score = 168 bits (426), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 117/391 (29%), Positives = 194/391 (49%), Gaps = 58/391 (14%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDK-CRPDLLITESTYATTIRDSKRCRERDF 59
MF +++ +I +TGDY+ DRHL +A + K + D+LITESTY + RE+
Sbjct: 194 MFLIEIAGLNIFFTGDYSREQDRHLVSAEVPKGLKIDVLITESTYGIASHVPRLEREQAL 253
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYY 117
+K + ++RGG+ L+PVFALGRAQEL ++L+ YW + + PIY+A L K Y
Sbjct: 254 MKSITGILNRGGRALLPVFALGRAQELLLILDEYWGKHPEFQRFPIYYASNLARKCMVIY 313
Query: 118 KMFITWTNQKIRKTFVQR--------------NMFDFKHIRPFDK-SFIDNPGPMVVFAT 162
+ ++ N I++ F +R +DFK+IR D+ G V+ A+
Sbjct: 314 QTYVGAMNDNIKRLFRERMAEAEASGDAAGKNGPWDFKYIRSLKNLDRFDDVGGCVMLAS 373
Query: 163 PGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVL--------------SGVKKL 208
PGML +G+S +F++WAP E N +I+ G+ V+GT+ +++ +G ++
Sbjct: 374 PGMLQNGVSRELFERWAPSEKNGVIITGYSVEGTMARQIMQEPDQIQAVMSRSIAGARRG 433
Query: 209 EFEN--KQIIDVKMAIEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKI- 265
+ K +I + +++ SF+AH D + I+ V+LVHGE M+ LK K+
Sbjct: 434 PGADSEKVLIPRRCSVQEYSFAAHVDGVENREFIEEVAAPVVILVHGEQHNMMRLKSKLL 493
Query: 266 ----KQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLKEEAVKYNSEPP-------NPL 314
+ + + P N E ++ I + + K S PP PL
Sbjct: 494 SLNASKTTKVKVYSPRNCE--------ELRIPFKADKTAKVVGKLASIPPPKDIDASAPL 545
Query: 315 KERQIHGVLVIKDSSISLMDVDEACKEVGIS 345
+ GVLV D +SLM ++ + G++
Sbjct: 546 ----VTGVLVQNDFKLSLMAPEDLREYAGLN 572
>gi|340521586|gb|EGR51820.1| predicted protein [Trichoderma reesei QM6a]
Length = 887
Score = 168 bits (425), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 98/299 (32%), Positives = 161/299 (53%), Gaps = 34/299 (11%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDK-CRPDLLITESTYATTIRDSKRCRERDF 59
MF +++ +I +TGDY+ DRHL +A + K + D+LITESTY + RE+
Sbjct: 216 MFLIEIAGLNIFFTGDYSREQDRHLVSAEVPKGIKIDVLITESTYGIASHVPRLEREQAL 275
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYY 117
+K + ++RGG+ L+PVFALGRAQEL ++L+ YW + + PIY+A L K Y
Sbjct: 276 MKSITGILNRGGRALLPVFALGRAQELLLILDEYWAKHPEYQKFPIYYASNLARKCMVIY 335
Query: 118 KMFITWTNQKIRKTFVQR--------------NMFDFKHIRPFDK-SFIDNPGPMVVFAT 162
+ ++ N I++ F +R +DFK+IR D+ G V+ A+
Sbjct: 336 QTYVGAMNDNIKRLFRERMAEAEASGDSAGKNGPWDFKYIRSLKNLDRFDDVGGCVMLAS 395
Query: 163 PGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVL--------------SGVKKL 208
PGML +G+S +F++WAP E N +I+ G+ V+GT+ +++ +G ++
Sbjct: 396 PGMLQNGVSRELFERWAPSEKNGVIITGYSVEGTMARQIMQEPDQIQAVMSRSIAGARRG 455
Query: 209 EFEN--KQIIDVKMAIEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKI 265
+ K +I + +++ SF+AH D + I+ V+LVHGE M+ LK K+
Sbjct: 456 PGSDSEKVLIPRRCSVQEYSFAAHVDGVENREFIEEVAAPVVILVHGEQHNMMRLKSKL 514
>gi|380494427|emb|CCF33158.1| endoribonuclease YSH1 [Colletotrichum higginsianum]
Length = 846
Score = 168 bits (425), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 116/391 (29%), Positives = 196/391 (50%), Gaps = 54/391 (13%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDK-CRPDLLITESTYATTIRDSKRCRERDF 59
MF +++ I +TGDY+ DRHL +A + K + D+LITESTY + RE+
Sbjct: 180 MFLIEIAGLKIFFTGDYSREQDRHLVSAEVPKGVKIDVLITESTYGIASHVPRLEREQAL 239
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYY 117
+K + ++RGG+VL+PVFALGRAQEL ++L+ YW + + + PIY+A L K Y
Sbjct: 240 MKSITSILNRGGRVLMPVFALGRAQELLLILDEYWGKHSEFQKFPIYYASNLARKCMVVY 299
Query: 118 KMFITWTNQKIRKTFVQR--------------NMFDFKHIRPFDK-SFIDNPGPMVVFAT 162
+ ++ N I++ F +R +DFK+IR D+ G V+ A+
Sbjct: 300 QTYVGAMNDNIKRLFRERMAEAEASGDGSGKGGPWDFKYIRSLKNLDRFDDVGGCVMLAS 359
Query: 163 PGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVL--------------SGVKKL 208
PGML +G+S + ++WAP + N +I+ G+ V+GT+ +++ +G ++
Sbjct: 360 PGMLQNGVSRELLERWAPNDKNGVIITGYSVEGTMAKQIMQEPDQIQAVMSRTTAGARRG 419
Query: 209 EF--ENKQIIDVKMAIEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKI- 265
K +I + ++ SF+AH D + I+ V+LVHGE M+ LK K+
Sbjct: 420 PGGESEKVLIPRRCSVAEYSFAAHVDGTENREFIEEVAAPVVILVHGEQHNMMRLKSKLL 479
Query: 266 ----KQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLKEEAVKYNSEPP-----NPLKE 316
+ + F P N E ++ I + + + K + PP +P +
Sbjct: 480 SLNAGKTQKVKVFSPKNCE--------ELRIPFKQDKMAKVVGKLATIPPPTSLSSPTQH 531
Query: 317 RQ--IHGVLVIKDSSISLMDVDEACKEVGIS 345
+Q + GVLV D +SLM ++ + G++
Sbjct: 532 QQQLVTGVLVQNDFKLSLMAPEDLREYAGLN 562
>gi|395538078|ref|XP_003771013.1| PREDICTED: uncharacterized protein C12orf26 homolog [Sarcophilus
harrisii]
Length = 624
Score = 168 bits (425), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 155/607 (25%), Positives = 278/607 (45%), Gaps = 84/607 (13%)
Query: 478 QKETMELIKNTIDEMIEFITPNIACVNTHTVHFMVEKLWETTI---PKAILSQVNN---- 530
+ E +++++ + E++ F+ + N HTV F LWE + P ++ + +
Sbjct: 9 ESEELDVLRGKLRELLRFLRKALPIANAHTVDFYTGSLWERFVEGSPDKVMEALRSRERP 68
Query: 531 ---SPSF--DSLLEEFWKSR-----QDNHVETNSELVKFFQAADKFRLTSLMNGESIITV 580
+P+ + LLE +S + E + +LV A + SL ++
Sbjct: 69 AEAAPAHCAERLLEGAGESSGISEFSNIFCENSQKLVNVEAFAFAAKYYSLPYFGVCTSL 128
Query: 581 DELFSKLQARKCGQVVESAKVSQLMSEKKSYEVQVMSQVVAAVTNSCDSSHIIDLGSTIT 640
++L + + + K M+ KKS+EVQVMS+++ + + C +IDLGS
Sbjct: 129 EQLLITFNGDQRLRPGNNVKPDDFMNAKKSHEVQVMSELIGCIADFCGIKQVIDLGSGKG 188
Query: 641 VDELFSKLQARKCGLVVESAKVS---------QLMSEKKSYEVQVMSQVVAAVTNSCDSS 691
F +Q ++S+ V+ +L K Y+ Q + + A V S
Sbjct: 189 YLSSFLSMQYGLKVYGIDSSNVNTHGAKERNRKLKKHWKVYQTQSRAHIDAQVLEKVKES 248
Query: 692 HIIDLGGGQGYLSTILALQHGKKTLSLDYNQVNTHGAAVRSKKLEDSEK----------- 740
D + + +K LS++ +N +V L D
Sbjct: 249 TAQDEQKCKTKIK--------EKLLSINSCLINQDKRSVSDSVLSDFTDTVISNVTRQME 300
Query: 741 ---------------ESKGPFQSYAGVINKKLWMRTQNRQTSHASVEHHGKNWKRKSKAP 785
E++ P + V KKL S A+ ++ + + K +
Sbjct: 301 NHTDPLPQTAGSFLLENRSPLLNVLPVDAKKL--------ASPATKDYRKLSEEHKERKK 352
Query: 786 VVSIKDEELVVCKDKC-KQITHFVTPDSDISSILSQAYPQDSLHNVCIMGLHTCGDLSGT 844
V+IKD+ + +T F+T +S++ I++ L + ++GLHTCGDL+
Sbjct: 353 TVTIKDQTKESIESSIYSPLTSFITAESELCDIITD------LEDCIMVGLHTCGDLAAN 406
Query: 845 ALRLFT-KSSLQCLVQVGCCYHLLEEEFIRSPFWKDVDQSLYEHGYGFPLSEHLRSRKFF 903
LR+FT KS ++ + VGCCYHLL E++ K S E +GFPL ++L+ +
Sbjct: 407 TLRIFTSKSEIKGVCSVGCCYHLLSEKY-----EKQCKDSTLEM-WGFPLCQYLKEEGWC 460
Query: 904 LGRNVRMSGTQSPERVIDLKQTQTLPLFYRALLEKYLRSKITINDEEPKVVGRLATKCSN 963
GRN RMS + ERV+ + T LFYRA+L++ ++ I E K VG++ +K ++
Sbjct: 461 CGRNARMSACLALERVVVGQGLATESLFYRAVLQEIIKECYGIA-ECDKHVGKVYSKSTS 519
Query: 964 FVEYVHRAVDKLKL-DLEVDDEEVTRLFNSHQREYEYLQIYYFLKTALAPVIEALIVLDR 1022
F++YV +++ KL+L D ++ ++++ + ++ L+ + LK LAP IE LI++DR
Sbjct: 520 FLDYVRKSLKKLELDDSKLTEKKIMEYYEKYKLRINELEAFNMLKVILAPCIETLILMDR 579
Query: 1023 VLYLREQ 1029
+ YL+EQ
Sbjct: 580 LCYLKEQ 586
Score = 40.0 bits (92), Expect = 7.8, Method: Compositional matrix adjust.
Identities = 16/51 (31%), Positives = 33/51 (64%)
Query: 1044 TQVDDEEVTRLFDSHQREYEYLQIYYFLKTALAPVIEALIVLDRVLYLREQ 1094
+++ ++++ ++ ++ L+ + LK LAP IE LI++DR+ YL+EQ
Sbjct: 536 SKLTEKKIMEYYEKYKLRINELEAFNMLKVILAPCIETLILMDRLCYLKEQ 586
>gi|348580367|ref|XP_003475950.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein C12orf26-like
[Cavia porcellus]
Length = 596
Score = 167 bits (424), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 126/473 (26%), Positives = 221/473 (46%), Gaps = 96/473 (20%)
Query: 634 DLGSTITVDELFSKLQARKCGLVVESAKVSQLMSEKKSYEVQVMSQVVAAVTNSCDSSHI 693
+LG +++L L+ K + K + M+ KKS+EVQ MS++++++ + +
Sbjct: 115 NLGICTPLEQLLIALRGNKKQRTDNNLKPHEFMNVKKSHEVQAMSELISSIADYYGVKQV 174
Query: 694 IDLGGGQGYLSTILALQ-------------HGKKT---------LSLDYNQVNTHGAAVR 731
ID G G+GYLS L+L HG ++ L ++V+ G A++
Sbjct: 175 IDXGSGKGYLSXFLSLSMATCLWMILNSNTHGARSGQELKKRWKLHRTQSRVDVSGLALK 234
Query: 732 SKKLEDSEKESK----------------------GPFQSYAGVINKKLWMRTQNRQT--- 766
K + + E K G Q ++G + + ++ +N T
Sbjct: 235 MAKEKKVQNEMKYKADTETLYSNSSAKQEKISTSGFLQDFSGPVISDIRIQIENLHTQPY 294
Query: 767 -------------------------SHASVEHHGKNWKRKSKAPVVSIKDEELVVCKDKC 801
S + + + K + +VS E +
Sbjct: 295 EEENVCFMNAFSLIDFLPIDAIEPISSLQIPNREMSEANKERRKMVSQSSESNIY----- 349
Query: 802 KQITHFVTPDSDISSILSQAYPQDSLHNVCIMGLHTCGDLSGTALRLFTKSS-LQCLVQV 860
+T F+T DS++ I+ L + ++GLHTCGDL+ LR+FT +S ++ + V
Sbjct: 350 SPLTSFITADSELHDIIKD------LEDCVMVGLHTCGDLAPNTLRIFTSNSEIKGVCSV 403
Query: 861 GCCYHLLEEEFIRSPFWKDVDQSLYEHGYGFPLSEHLRSRKFFLGRNVRMSGTQSPERVI 920
GCCYHLL EEF ++ + E +GFP+ ++L+ ++ G N RMS + ERV
Sbjct: 404 GCCYHLLSEEF------ENEHKECTEEKWGFPMCQYLKEERWCCGHNARMSACLALERVA 457
Query: 921 DLKQTQTLPLFYRALLEKYLRS--KITINDEEPKVVGRLATKCSNFVEYVHRAVDKLKLD 978
+ T LFYRA+L+ ++ IT E VG++ +KCS+F++YV +++ KL LD
Sbjct: 458 VGQGLSTESLFYRAVLQDIIKDCYGITKCSEH---VGKIYSKCSSFLDYVRKSLKKLGLD 514
Query: 979 -LEVDDEEVTRLFNSHQREYEYLQIYYFLKTALAPVIEALIVLDRVLYLREQQ 1030
++ ++ + + ++ L+ + LK LAP +E LI+LDR+ YL+EQ+
Sbjct: 515 ETQLPEKTIMDYYEKYKPRMNELEAFNMLKVVLAPCVETLILLDRLCYLKEQE 567
Score = 43.1 bits (100), Expect = 0.83, Method: Compositional matrix adjust.
Identities = 18/52 (34%), Positives = 33/52 (63%)
Query: 1044 TQVDDEEVTRLFDSHQREYEYLQIYYFLKTALAPVIEALIVLDRVLYLREQQ 1095
TQ+ ++ + ++ ++ L+ + LK LAP +E LI+LDR+ YL+EQ+
Sbjct: 516 TQLPEKTIMDYYEKYKPRMNELEAFNMLKVVLAPCVETLILLDRLCYLKEQE 567
>gi|310796189|gb|EFQ31650.1| metallo-beta-lactamase superfamily protein [Glomerella graminicola
M1.001]
Length = 855
Score = 167 bits (424), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 116/391 (29%), Positives = 195/391 (49%), Gaps = 54/391 (13%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDK-CRPDLLITESTYATTIRDSKRCRERDF 59
MF +++ I +TGDY+ DRHL +A + K + D+LITESTY + RE+
Sbjct: 180 MFLIEIAGLKIFFTGDYSREQDRHLVSAEVPKGVKIDVLITESTYGIASHVPRLEREQAL 239
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYY 117
+K + ++RGG+VL+PVFALGRAQEL ++L+ YW + + PIY+A L K Y
Sbjct: 240 MKSITSILNRGGRVLMPVFALGRAQELLLILDEYWGKHAEFQKFPIYYASNLARKCMVVY 299
Query: 118 KMFITWTNQKIRKTFVQR--------------NMFDFKHIRPFDK-SFIDNPGPMVVFAT 162
+ ++ N I++ F +R +DFK+IR D+ G V+ A+
Sbjct: 300 QTYVGAMNDNIKRLFRERMAEAEASGDGSGKGGPWDFKYIRSLKNLDRFDDVGGCVMLAS 359
Query: 163 PGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVL--------------SGVKKL 208
PGML +G+S + ++WAP + N +I+ G+ V+GT+ +++ +G ++
Sbjct: 360 PGMLQNGVSRELLERWAPNDKNGVIITGYSVEGTMAKQIMQEPDQIQAVMSRTTAGARRG 419
Query: 209 EF--ENKQIIDVKMAIEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKI- 265
K I + ++ SF+AH D + I+ V+LVHGE M+ LK K+
Sbjct: 420 PGGESEKLFIPRRCSVAEYSFAAHVDGTENREFIEEVAAPVVILVHGEQHNMMRLKSKLL 479
Query: 266 ----KQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLKEEAVKYNSEPP-----NPLKE 316
+ + F P N E ++ I + + + K + PP +P ++
Sbjct: 480 SLNAGKTQKVKVFSPKNCE--------ELRIPFKQDKMAKVVGKLATIPPPTSLSSPTQQ 531
Query: 317 RQ--IHGVLVIKDSSISLMDVDEACKEVGIS 345
+Q + GVLV D +SLM ++ + G++
Sbjct: 532 QQQLVTGVLVQNDFKLSLMAPEDLREYAGLN 562
>gi|367054168|ref|XP_003657462.1| hypothetical protein THITE_2123200 [Thielavia terrestris NRRL 8126]
gi|347004728|gb|AEO71126.1| hypothetical protein THITE_2123200 [Thielavia terrestris NRRL 8126]
Length = 859
Score = 167 bits (423), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 116/385 (30%), Positives = 192/385 (49%), Gaps = 41/385 (10%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDK-CRPDLLITESTYATTIRDSKRCRERDF 59
MF +++ +I++TGDY+ DRHL +A + K + D+LITESTY + RE+
Sbjct: 180 MFLIEIAGLNILFTGDYSREQDRHLVSAEVPKGVKIDVLITESTYGVASHIPRLEREQAL 239
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYY 117
+K + ++RGG+VL+PVFALGRAQEL ++L+ YW + + PIY+A L K Y
Sbjct: 240 MKSITGILNRGGRVLMPVFALGRAQELLLILDEYWGKHKEYQKYPIYYASNLARKCMLVY 299
Query: 118 KMFITWTNQKIRKTFVQR--------------NMFDFKHIRPFDKSF--IDNPGPMVVFA 161
+ ++ N I++ F +R +DFK IR KS D+ G V+ A
Sbjct: 300 QTYVGAMNDNIKRLFRERMAEAEASGDAAGKGGPWDFKFIRSL-KSIDRFDDVGGCVMLA 358
Query: 162 TPGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVL--------------SGVKK 207
+PGML +G+S + ++WAP E N +I+ G+ V+GT+ +++ +G ++
Sbjct: 359 SPGMLQNGVSRELLERWAPSEKNGVIITGYSVEGTMAKQLMQEPDQIQAVMTRSSAGGRR 418
Query: 208 LEFEN--KQIIDVKMAIEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKI 265
+ K +I + ++ SF+AH D + I+ V+LVHGE M+ LK K+
Sbjct: 419 APGGDAEKVMIPRRCTVQEFSFAAHVDGTENREFIEEVGAPVVILVHGEVHNMMRLKSKL 478
Query: 266 -----KQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLKEEAVKYNSEPPNPLKERQIH 320
+ + + P N E + + V L + N + + I
Sbjct: 479 LSLNANKTEKVKVYSPRNCEELRIPFKTDKTAKVVGKLASIPQPMKMPKDDNAQEPQLIT 538
Query: 321 GVLVIKDSSISLMDVDEACKEVGIS 345
GVLV D +SLM ++ + G++
Sbjct: 539 GVLVQNDFKMSLMAPEDLREYAGLT 563
>gi|407919362|gb|EKG12612.1| Beta-lactamase-like protein [Macrophomina phaseolina MS6]
Length = 842
Score = 167 bits (422), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 120/397 (30%), Positives = 196/397 (49%), Gaps = 54/397 (13%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDK-CRPDLLITESTYATTIRDSKRCRERDF 59
MF +++ I++TGDY+ DRHL +A + K + D+LITEST+ + RE
Sbjct: 180 MFLIEIAGLKILFTGDYSREEDRHLISAEVPKNVKVDVLITESTFGIASHVPRLEREAAL 239
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYY 117
+K + ++RGG+ L+PVFALGRAQEL ++L+ YW + + PIY+A + K Y
Sbjct: 240 MKSITGIINRGGRALLPVFALGRAQELLLILDEYWAKHPEFQKIPIYYASNIARKCMVVY 299
Query: 118 KMFITWTNQKIRKTFVQR-------------NMFDFKHIRPFDKSF--IDNPGPMVVFAT 162
+ ++ N I++ F +R +DFK++R KS D+ G V+ A+
Sbjct: 300 QTYVYAMNDNIKRLFRERMEEAERNGDASKAGPWDFKYVRSL-KSLERFDDVGSCVMLAS 358
Query: 163 PGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEF------------ 210
PGM+ +G+S + ++WAP + N +IM G+ V+GT+G +L +++
Sbjct: 359 PGMMQNGVSRELLERWAPDQRNGVIMTGYSVEGTMGKMILHEPEQIPAVMTRANVARRGP 418
Query: 211 --ENKQI-IDVKMAIEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQ 267
E +Q+ + + +++ SF+AH D + I+ V+LVHGE M+ LK K+
Sbjct: 419 NREGEQVLLPRRCSVQEFSFAAHVDGTENREFIEEVSAPVVILVHGEKHNMMRLKSKLLS 478
Query: 268 EFNLD------CFMPANGESCFVQTDMKISIDVSVNLL----------KEEAVKYNSEPP 311
N D F PAN E + V L K+E ++
Sbjct: 479 -LNADKKQPVKVFSPANCEELRIPFKADKIAKVVGKLAQIPPPMPHRPKQEDGTDDAREG 537
Query: 312 NPLKERQ---IHGVLVIKDSSISLMDVDEACKEVGIS 345
+ERQ I GVLV D +SLM ++ + G++
Sbjct: 538 GVEEERQTQLISGVLVQNDFKMSLMAPEDLREYAGLT 574
>gi|328853485|gb|EGG02623.1| hypothetical protein MELLADRAFT_38438 [Melampsora larici-populina
98AG31]
Length = 672
Score = 166 bits (421), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 112/371 (30%), Positives = 186/371 (50%), Gaps = 41/371 (11%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAA----WIDKCRPDLLITESTYATTIRDSKRCRE 56
MF +++ ++YTGDY+ DRHL A W +K PD++I ESTY + + +E
Sbjct: 152 MFLIEIAGIRVLYTGDYSTEEDRHLIPAKVPNWNEK--PDVMICESTYGVQSLEPRPEKE 209
Query: 57 RDFLKKVHECVDRGGKVLIPVFALGRAQELCILLETYWER---MNLQAPIYFAVGLTEKA 113
F V + RGG+VL+PVFALGRAQEL ++L+ YW +N PIY+ L K
Sbjct: 210 ERFTALVQMILKRGGRVLMPVFALGRAQELLLILDEYWSNHPELN-SIPIYYISNLAAKC 268
Query: 114 TNYYKMFITWTNQKIRK---------TFVQRNMFDFK-----HIRPFDKSFIDNPGPMVV 159
Y+ FI N++I+ TF + FK +++ DK D+ GP VV
Sbjct: 269 MKVYQTFIHGMNEEIKSKFNKGINPWTFFREGKGLFKKGYVTNLKTLDK--FDDRGPCVV 326
Query: 160 FATPGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVK 219
A+PG + +G S + ++WAP N L++ G+ ++GT+ ++L +++ I +
Sbjct: 327 MASPGFMTNGASRELLERWAPDRRNGLLVTGYSIEGTMAREMLKEPQEITGIKGNKIPRR 386
Query: 220 MAIEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF-----NLDCF 274
+ + Y+SFSAH D I P +++LVHGE + M L+ +K ++ + +
Sbjct: 387 IEVIYISFSAHVDYTQNAAFIDQVMPTHLVLVHGEFNNMTRLRNALKDKYLLAKNEMQIY 446
Query: 275 MPANGESCFVQTDMKISIDVSVNLLKEEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMD 334
P N E+ ++ I ++ L + + + I G+LV KDS+ +L+
Sbjct: 447 TPRNVETLRIKFKAS-RIARAIGSLADTNLSVG---------QTISGILVSKDSAYTLLS 496
Query: 335 VDEACKEVGIS 345
+ G+S
Sbjct: 497 STDLKDYTGLS 507
>gi|171689890|ref|XP_001909884.1| hypothetical protein [Podospora anserina S mat+]
gi|170944907|emb|CAP71018.1| unnamed protein product [Podospora anserina S mat+]
Length = 835
Score = 166 bits (421), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 117/401 (29%), Positives = 197/401 (49%), Gaps = 46/401 (11%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDK-CRPDLLITESTYATTIRDSKRCRERDF 59
MF +++ +I +TGDY+ DRHL +A + + + D+LITESTY + RE+
Sbjct: 181 MFLIEIAGLNIFFTGDYSREQDRHLVSAQVPRGVKIDVLITESTYGIASHVPRLEREQAL 240
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYY 117
+K + ++RGG+VL+PVFALGRAQEL ++L+ YW + + PIY+A L K Y
Sbjct: 241 MKSITGILNRGGRVLMPVFALGRAQELLLILDEYWGKHAEYQKYPIYYASNLARKCMLVY 300
Query: 118 KMFITWTNQKIRKTFVQR--------------NMFDFKHIRPFDKSF--IDNPGPMVVFA 161
+ ++ N I++ F +R +DFK IR KS ++ G V+ A
Sbjct: 301 QTYVGAMNDNIKRLFRERMAEAEASGDGAGKGGPWDFKFIRSL-KSIDRFEDVGGCVMLA 359
Query: 162 TPGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEF----------- 210
+PGML +G+S + ++WAP E N +I+ G+ V+GT+ +++ + ++
Sbjct: 360 SPGMLQNGVSRELLERWAPSEKNGVIITGYSVEGTMAKQIMQEPEHIQAVMSRNTGGGRR 419
Query: 211 -----ENKQIIDVKMAIEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKI 265
K +I + ++ SF+AH D + I+ V+LVHGE M+ LK K+
Sbjct: 420 GPGRDAEKVLIPRRCTVQEYSFAAHVDGTENREFIEEVAAPVVILVHGEVHNMMRLKSKL 479
Query: 266 -----KQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLKEEAVKYNSEPPNPLKERQIH 320
+ + F P N E + + V L ++ S P P + I
Sbjct: 480 LSLNANKTSKVKVFSPKNCEELRIPFKTDKTAKVVGKLAS--ILQPLSSPEEP---QLIT 534
Query: 321 GVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDSS 361
GVLV D +SLM ++ + G++ + +++S +
Sbjct: 535 GVLVQNDFKMSLMAPEDLREFAGLTTTTISCKQRMRLSAAG 575
>gi|308807807|ref|XP_003081214.1| mRNA cleavage and polyadenylation factor II complex, BRR5 (CPSF
subunit) (ISS) [Ostreococcus tauri]
gi|116059676|emb|CAL55383.1| mRNA cleavage and polyadenylation factor II complex, BRR5 (CPSF
subunit) (ISS) [Ostreococcus tauri]
Length = 572
Score = 166 bits (420), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 96/261 (36%), Positives = 138/261 (52%), Gaps = 19/261 (7%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF V +G ++YTGDY+ DRHL AA I P ++I ESTY + + RE
Sbjct: 169 MFNVDIGGLRVLYTGDYSRIADRHLPAADIPAIPPHVVIVESTYGVSPHSPREERE---- 224
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYW-ERMNLQ-APIYFAVGLTEKATNYYK 118
+ RAQEL ++LE +W + +LQ PIY A L KA Y+
Sbjct: 225 ------------IRXXXXXXXRAQELLLILEDFWAQNPDLQRVPIYQASTLARKAMTIYQ 272
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
+I N ++ F + N F F H++ K S +D+ GP VV ATP ML SGLS +F+
Sbjct: 273 TYINVLNADMKAAFEEANPFVFNHVKHISKASELDDVGPCVVLATPSMLQSGLSRELFES 332
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
W N +I+ F VQGT+ ++LS V K+ + Q + + M+++ +SFSAHAD
Sbjct: 333 WCEEPKNGVIIADFAVQGTLAREILSDVNKVTARDGQELQLNMSVDAISFSAHADYPQTQ 392
Query: 238 QLIQYCEPKNVLLVHGEASKM 258
+ P +V+LVHGEA +M
Sbjct: 393 AFLDTLAPPHVVLVHGEAGEM 413
>gi|403222958|dbj|BAM41089.1| cleavage and polyadenylation specificty factor subunit [Theileria
orientalis strain Shintoku]
Length = 700
Score = 166 bits (420), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 93/285 (32%), Positives = 160/285 (56%), Gaps = 10/285 (3%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF V++ I+YTGDY++ D+HL +A I LLI+ESTY + + + RE F+
Sbjct: 192 MFLVEIDGVRILYTGDYSVEKDKHLPSAEIPLINVHLLISESTYGIRVHEERGQRESRFM 251
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYW--ERMNLQAPIYFAVGLTEKATNYYK 118
V + + R GK L+PVFALGR+QE+ ++L+ YW R PI++ L K+ Y+
Sbjct: 252 HVVLDIIMREGKCLLPVFALGRSQEILLILDEYWANNRQLQNVPIFYISPLASKSLKVYE 311
Query: 119 MFITWTNQKIRKT-FVQRNMFDFKHIRPFD-----KSFIDNPGPMVVFATPGMLHSGLSL 172
F+ I+++ + N F+FK ++ ++++ GP ++ +PGML G SL
Sbjct: 312 TFVGLCGDYIKESIYNGHNPFNFKFVKYARSVRQIRNYLLREGPCIIMTSPGMLQGGPSL 371
Query: 173 IIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHAD 232
+F+ +P N +++ G+ V+GT+ ++ + + K +I + ++E +SFSAHAD
Sbjct: 372 EVFELISPDNRNGVVLTGYTVKGTLADELKKDPELINLGTK-VIKPRCSVEQISFSAHAD 430
Query: 233 AKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF-NLDCFMP 276
+ I+ NV+LVHGE +M +++K+ ++ L FMP
Sbjct: 431 YNQTKEFIKKLSVPNVILVHGERGEMKRMRDKLIEDIPQLTVFMP 475
>gi|195436260|ref|XP_002066087.1| GK22174 [Drosophila willistoni]
gi|194162172|gb|EDW77073.1| GK22174 [Drosophila willistoni]
Length = 490
Score = 165 bits (418), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 111/365 (30%), Positives = 190/365 (52%), Gaps = 41/365 (11%)
Query: 662 VSQLMSEKKSYEVQVMSQVVAAV---TNSCDSSHIIDLGGGQGYLSTILALQHGKKTLSL 718
+ + MS KK +EV++ + +V + T ++ +++D G G+GYLS+ LALQ+ + L +
Sbjct: 117 IKEFMSAKKCHEVELTASLVDRLIQSTGEAENFYVVDAGDGKGYLSSRLALQYKYRVLGI 176
Query: 719 DYNQVNTHGAAVRSKKLEDSEKESKGPFQSYAGVINKKLWMRTQNRQTSHASVEHHGKNW 778
D N NT A R++KL+ R N T A +E G
Sbjct: 177 DANAANTENALERNRKLQ-----------------------RAWNGLTERAELESRGIPP 213
Query: 779 KRKSKAPVVSIKDEELVVCKDKCKQITHFVTPDSDISSILSQAYPQDSLHN---VCIMGL 835
K + K S KD + K + F+T + +++ +L++ + + S+ +C+ GL
Sbjct: 214 KCRDKQS--SPKD---LPSLQNYKTTSRFITTELNLNQLLAEQFTEFSISPSSPICLTGL 268
Query: 836 HTCGDLSGTALRLF-TKSSLQCLVQVGCCYHLLEEEFIRSPFWKDVDQSLYEHGYGFPLS 894
HTCG+L+ T L++F ++S Q L +GCCYHLL+E F + F+ + + GFPLS
Sbjct: 269 HTCGNLAATCLQVFHSQSQCQILCNIGCCYHLLKETFSQQEFFGNKSVMDLQTDCGFPLS 328
Query: 895 EHLRSRKFFLGRNVRMSGTQSPERVIDLKQTQTLPLFYRALLEKYLRSKITINDEEPKVV 954
+L++R+ +GRN RM QS ER ++ K + L+YRA LE L + N + V
Sbjct: 329 RYLQNRQVRMGRNARMLAAQSIERTLNAKDLPNISLYYRACLE-VLVCRHAQNLKNELQV 387
Query: 955 GRLATKCSNFVEYVHRAVDKLK---LDLEVDDEEVTRLFNSHQREYEYLQIYYFLKTALA 1011
G++ +F +Y+ + KL LD +V+ +E+ L + + YL+++Y L+ A
Sbjct: 388 GKVRN-FKDFSDYIQKCAKKLDAKWLD-DVNTDELKALLRDYSLDKHYLELFYLLRMTFA 445
Query: 1012 PVIEA 1016
P +E+
Sbjct: 446 PALES 450
Score = 50.8 bits (120), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 32/153 (20%), Positives = 75/153 (49%), Gaps = 15/153 (9%)
Query: 489 IDEMIEFITPNIACVNTHTVHFMVEKLWETTIPKAILSQVNNSPSFDSLLEEFWKSRQDN 548
+D ++ ++ P +N H V+++ +K W + +P+ + ++ + +EE + +R+
Sbjct: 13 LDGLLAYLQPYWDFLNCHMVNYLTDKHWISFVPEELRQELECEEDINLAIEELFWNRKQG 72
Query: 549 HVETNSELVKFFQAADKFRLTSLMNGESIITVDELFSKLQARKCG--QVVESAKVSQLMS 606
E +KF + D+ L + I T++EL CG + ++ + + MS
Sbjct: 73 ESERFPNWLKFLRRGDQEHLKHF--KDVITTLEEL--------CGGNESMQQLSIKEFMS 122
Query: 607 EKKSYEVQVMSQVVAAV---TNSCDSSHIIDLG 636
KK +EV++ + +V + T ++ +++D G
Sbjct: 123 AKKCHEVELTASLVDRLIQSTGEAENFYVVDAG 155
>gi|255957115|ref|XP_002569310.1| Pc21g23430 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211591021|emb|CAP97240.1| Pc21g23430 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 862
Score = 165 bits (418), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 118/405 (29%), Positives = 197/405 (48%), Gaps = 51/405 (12%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHL-GAAWIDKCRPDLLITESTYATTIRDSKRCRERDF 59
MF++ + ++TGDY+ DRHL AA + D+LITEST+ + + RE
Sbjct: 181 MFKIDIAGLVTLFTGDYSREEDRHLIPAAVPSGTKIDVLITESTFGISSNPPRLEREAAL 240
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWE-RMNLQA-PIYFAVGLTEKATNYY 117
+K + ++RGG+VL+PVFALGRAQEL ++L+ YWE LQ PIY+ + + Y
Sbjct: 241 MKSITGILNRGGRVLMPVFALGRAQELLLILDEYWETHPELQKFPIYYIGNMARRCMVVY 300
Query: 118 KMFITWTNQKIRKTFVQR--------------NMFDFKHIRPFDK-SFIDNPGPMVVFAT 162
+ +I N I++ F QR +DF+ +R D+ G V+ A+
Sbjct: 301 QTYIGAMNDNIKRLFRQRMAEAEASGNKSVSVGPWDFRFVRSLRSLERFDDVGGCVMLAS 360
Query: 163 PGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKL-------------- 208
PGML +G S + ++WAP + N ++M G+ V+GT+ +L+ ++
Sbjct: 361 PGMLQTGTSRELLERWAPSDRNGVVMTGYSVEGTMAKGLLNEPDQIPAVMSKVSTGHGRG 420
Query: 209 ------EFENKQIIDVKMAIEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLK 262
+ + K +I + ++ +SF+AH D I+ V+LVHGE +M+ LK
Sbjct: 421 RVPGVNDEDQKVMIPRRCTVDEVSFAAHVDGVENRTFIEEVAAPVVILVHGEKHQMMRLK 480
Query: 263 EKI-----KQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLKEEAVKYNSEPPNPLKER 317
K+ + + + PAN + + K I V L E A PP+ +
Sbjct: 481 SKLLSLNADKAVKVKVYTPANCDEVRIPFK-KDKIAKVVGKLAEVA------PPSENDDS 533
Query: 318 QI-HGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDSS 361
Q+ GVLV ++SLM D+ + G++ + ++ +S +S
Sbjct: 534 QLMAGVLVQNGFNLSLMAPDDLQEYAGLTTTTITCKQHITLSSAS 578
>gi|378730429|gb|EHY56888.1| endoribonuclease ysh1 [Exophiala dermatitidis NIH/UT8656]
Length = 868
Score = 165 bits (418), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 119/402 (29%), Positives = 184/402 (45%), Gaps = 65/402 (16%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDK----CRPDLLITESTYATTIRDSKRCRE 56
MF + + +I +T DY+ DRHL AA + + DLLITEST+ + + RE
Sbjct: 181 MFLINIAGLNIWFTADYSREQDRHLVAAEVPNKSTVGKIDLLITESTFGISNAPPRAERE 240
Query: 57 RDFLKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERM-NLQA-PIYFAVGLTEKAT 114
LK V ++RGGKVL+PVFALGRAQEL ++LE YW + LQ PIY+ K
Sbjct: 241 AGLLKAVTNILNRGGKVLMPVFALGRAQELLLILEDYWSKHPELQKYPIYYTGNTARKCM 300
Query: 115 NYYKMFITWTNQKIRKTFVQR---------------NMFDFKHIRPF-DKSFIDNPGPMV 158
Y+ +I N I++ F +R +DF+ +R + D+ G V
Sbjct: 301 VVYQTYINAMNDNIKRIFRERMAEAEAAGNAKGVSAGPWDFRFVRSLRNLDRFDDVGGCV 360
Query: 159 VFATPGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLS--------------- 203
+ A+PGML SG+S ++ ++WAP N +IM G+ V+GT+ +LS
Sbjct: 361 MLASPGMLQSGMSRVLLERWAPDPRNGVIMTGYNVEGTMARTILSEPEMIPAIMSGGNTG 420
Query: 204 -----GVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEASKM 258
G K + E +I + +E F+AH + I+ +V+LVHGE S+
Sbjct: 421 VGQQMGRKGRDDEGAVMIPRRCTVEEFQFAAHVTGVENVDFIEQVASPHVILVHGERSQA 480
Query: 259 VFLKEKI-----------KQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLKEEAVKYN 307
L+ ++ + P NG +I I + + + K
Sbjct: 481 ARLRSRLLDLNSRRLASNPHAHQTKVYSPENG--------AEIKIPYQKDKMAKVVGKLA 532
Query: 308 SEPP----NPLKERQIHGVLVIKDSSISLMDVDEACKEVGIS 345
PP +P R + GVLV +SLM ++ + G++
Sbjct: 533 QIPPPASTDPEDTRVVSGVLVQNGFKLSLMAPEDLREYAGLT 574
>gi|425780830|gb|EKV18826.1| Endoribonuclease ysh1 [Penicillium digitatum PHI26]
gi|425783067|gb|EKV20936.1| Endoribonuclease ysh1 [Penicillium digitatum Pd1]
Length = 862
Score = 165 bits (417), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 118/405 (29%), Positives = 197/405 (48%), Gaps = 51/405 (12%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHL-GAAWIDKCRPDLLITESTYATTIRDSKRCRERDF 59
MF++ + ++TGDY+ DRHL AA + D+LITEST+ + + RE
Sbjct: 181 MFKIDIAGLVTLFTGDYSREEDRHLIPAAVPSGTKIDVLITESTFGISSNPPRLEREAAL 240
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWE-RMNLQA-PIYFAVGLTEKATNYY 117
+K + ++RGG+VL+PVFALGRAQEL ++L+ YWE LQ PIY+ + + Y
Sbjct: 241 MKSITSILNRGGRVLMPVFALGRAQELLLILDEYWETHPELQKFPIYYIGNMARRCMVVY 300
Query: 118 KMFITWTNQKIRKTFVQR--------------NMFDFKHIRPFDK-SFIDNPGPMVVFAT 162
+ +I N I++ F QR +DF+ +R D+ G V+ A+
Sbjct: 301 QTYIGAMNDNIKRLFRQRMAEAEASGNKSVSVGPWDFRFVRSLRSLERFDDVGGCVMLAS 360
Query: 163 PGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKL-------------- 208
PGML +G S + ++WAP + N ++M G+ V+GT+ +L+ ++
Sbjct: 361 PGMLQTGTSRELLERWAPSDRNGVVMTGYSVEGTMAKGLLNEPDQIPAVMSKVSTGHGRG 420
Query: 209 ------EFENKQIIDVKMAIEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLK 262
+ + K +I + ++ +SF+AH D I+ V+LVHGE +M+ LK
Sbjct: 421 RVPGVNDEDQKVMIPRRCTVDEVSFAAHVDGVENRTFIEEVAAPVVILVHGEKHQMMRLK 480
Query: 263 EKI-----KQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLKEEAVKYNSEPPNPLKER 317
K+ + + + PAN + + K I V L E A PP+ +
Sbjct: 481 SKLLSLNADKAVKVKVYTPANCDEVRIPFK-KDKIAKVVGKLAELA------PPSENDDA 533
Query: 318 QI-HGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDSS 361
Q+ GVLV ++SLM D+ + G++ + ++ +S +S
Sbjct: 534 QLMAGVLVQNGFNLSLMAPDDLHEYAGLTTTTITCKQHITLSSAS 578
>gi|402217247|gb|EJT97328.1| Metallo-hydrolase/oxidoreductase [Dacryopinax sp. DJM-731 SS1]
Length = 780
Score = 165 bits (417), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 106/360 (29%), Positives = 184/360 (51%), Gaps = 26/360 (7%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRP-DLLITESTYATTIRDSKRCRERDF 59
MF +++ + I+YTGDY+ RHL A + D+LI EST+ + +R +E F
Sbjct: 168 MFLIQLADLRILYTGDYSREESRHLVRAEVPPGAGIDVLIIESTFGVQSTEGRREKEERF 227
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYY 117
+H + RGG VL+PVFA+G AQEL ++L+ ++E+ + PIY+A L K Y
Sbjct: 228 TSLIHRILMRGGHVLMPVFAVGGAQELLLILDDFFEKHPELHKFPIYYASALARKCMAVY 287
Query: 118 KMFITWTNQKIRKTFVQ-RNMFDFKHIRPFDKSF-----IDNPGPMVVFATPGMLHSGLS 171
+ ++ N IR+ F +N F F+H+ +S I P V+ A+PGM+ SG S
Sbjct: 288 QGYVHVMNNNIRQRFANNQNPFVFRHVSHIPRSSGWEKKIGEGPPCVILASPGMMQSGAS 347
Query: 172 LIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHA 231
+ + WAP N +++ G+ V+G++ +++ ++ I ++ ++ +SFSAH
Sbjct: 348 RELLEMWAPDRRNGIVLTGYSVEGSMARNIMNEPDEINAMKGTPIPLRCTVDNISFSAHV 407
Query: 232 DAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF-----NLDCFMPANGESC-FVQ 285
D + I+ +V+LVHGE S+M LK ++ + ++ P N E+ +
Sbjct: 408 DYAQNREFIEAIGAPHVVLVHGEQSQMFRLKAALQAGYKERNEHITIHTPKNCETLELIF 467
Query: 286 TDMKISIDVSVNLLKEEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGIS 345
+++ ++ L E A P Q+ G+LV KD + +L+D + G+S
Sbjct: 468 RGERVA--KAIGTLAEHA---------PQAGAQLSGLLVSKDFTYTLLDPRDLRDFTGLS 516
>gi|341038970|gb|EGS23962.1| hypothetical protein CTHT_0006720 [Chaetomium thermophilum var.
thermophilum DSM 1495]
Length = 894
Score = 164 bits (415), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 98/300 (32%), Positives = 161/300 (53%), Gaps = 36/300 (12%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDK-CRPDLLITESTYATTIRDSKRCRERDF 59
MF +++ +I++TGDY+ DRHL +A + K + D+LITESTY + RE+
Sbjct: 179 MFLIEIAGLNILFTGDYSREQDRHLVSAQVPKGVKIDVLITESTYGIATHVPRLEREQAL 238
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYY 117
+K + ++RGG+VL+PVFALGRAQEL ++L+ YW + + PIY+A L K Y
Sbjct: 239 MKSITGILNRGGRVLMPVFALGRAQELLLILDEYWAKHKEYQKYPIYYASNLARKCMLVY 298
Query: 118 KMFITWTNQKIRKTFVQR--------------NMFDFKHIRPFDKSF--IDNPGPMVVFA 161
+ ++ N I++ F +R +DFK IR KS ++ G V+ A
Sbjct: 299 QTYVGAMNDNIKRLFRERMAEAEASGDSAGKGGPWDFKFIRSL-KSIDRFEDVGGCVMLA 357
Query: 162 TPGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEF----------- 210
+PGML +G+S + ++WAP E N +I+ G+ V+GT+ +++ ++++
Sbjct: 358 SPGMLQNGVSRELLERWAPNEKNGVIITGYSVEGTMAKQLMQEPEQIQAVMTRSSAGGRR 417
Query: 211 -----ENKQIIDVKMAIEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKI 265
K +I + ++ SF+AH D + I+ V+LVHGE M+ LK K+
Sbjct: 418 GPGGESEKVMIPRRCTVQEFSFAAHVDGNENREFIEEVGAPVVILVHGEVHNMMRLKSKL 477
>gi|385305954|gb|EIF49896.1| mrna cleavage and polyadenylation specificity factor complex
subunit ysh1 [Dekkera bruxellensis AWRI1499]
Length = 295
Score = 164 bits (414), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 85/208 (40%), Positives = 125/208 (60%), Gaps = 5/208 (2%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF V++G ++TGDY+ DRHL +A + PDLLI EST+ T + RE
Sbjct: 79 MFLVEMGGLKFLFTGDYSREEDRHLSSAEVPDVTPDLLIVESTFGTATHVPRLERENKLT 138
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWER-MNLQ-APIYFAVGLTEKATNYYK 118
+H + +GG+ L+PVFALGRAQE+ ++L+ YW+R +LQ PIY+A L +K Y+
Sbjct: 139 TVIHSTLQQGGRCLLPVFALGRAQEILLILDEYWQRHKDLQNVPIYYASSLAKKCMAVYE 198
Query: 119 MFITWTNQKIRKTFVQ--RNMFDFKHIRPFDKS-FIDNPGPMVVFATPGMLHSGLSLIIF 175
+I N IRK F + N F FK+I+ + ID+ P V+ A+PGML +G+S +
Sbjct: 199 RYINMMNDSIRKKFTETNENPFHFKYIKNVAHADRIDDLNPCVMIASPGMLQNGVSRQLL 258
Query: 176 KKWAPVENNMLIMPGFCVQGTIGHKVLS 203
+KW P N +IM G+ V GT+ K+L+
Sbjct: 259 EKWCPDPRNTVIMTGYSVDGTMAKKLLT 286
>gi|145478255|ref|XP_001425150.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124392218|emb|CAK57752.1| unnamed protein product [Paramecium tetraurelia]
Length = 690
Score = 163 bits (413), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 97/273 (35%), Positives = 146/273 (53%), Gaps = 16/273 (5%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF V++ ++YTGDY+ +RHL A + + +LI E+TY T +++ RE +FL
Sbjct: 163 MFMVEIDGVRVLYTGDYSTEKERHLRPAQLPLEKIHVLIVEATYGDTQHETRTKREENFL 222
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWER--MNLQAPIYFAVGLTEKATNYYK 118
K++ ++ GG VL+PVFA GR EL I+L+ YW + Q PIY L K T+ ++
Sbjct: 223 KEIVSTLNGGGNVLLPVFATGRCHELLIILDEYWSKNPQVQQFPIYSTCTLAIKCTHIFQ 282
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHI--RPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFK 176
K K N+F F HI + + ++N P VV A+PG+L SG S I++
Sbjct: 283 KHFNKLGNKYHKG---ENLFKFNHINTKKHLQDILNNQKPKVVMASPGLLQSGHSKQIYE 339
Query: 177 KWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGI 236
W E N +I+ G VQGTI H+++ + D+K+ +SFSAHAD
Sbjct: 340 YWCKDEKNQVIITGPAVQGTIAHQLIHNPEP---------DIKIRPAQISFSAHADYLQT 390
Query: 237 MQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF 269
I P++V+LVHG K L++KI+ F
Sbjct: 391 SSFIDSLRPQHVILVHGTQHKCRDLQKKIEINF 423
>gi|449296201|gb|EMC92221.1| hypothetical protein BAUCODRAFT_569527 [Baudoinia compniacensis
UAMH 10762]
Length = 834
Score = 162 bits (411), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 117/392 (29%), Positives = 188/392 (47%), Gaps = 48/392 (12%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDK-CRPDLLITESTYATTIRDSKRCRERDF 59
M+ +++ +I++TGDY+ +RHL A I + D LITEST+ + + RE
Sbjct: 187 MYLIEIAGLNILFTGDYSREDNRHLMPASIPRHVNVDCLITESTFGISTHVPRAERETAL 246
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYY 117
++ + ++RGG+ L+P FALG AQEL ++LE YW R + PIYFA L K Y
Sbjct: 247 MRSITGILNRGGRALLPTFALGGAQELLLILEDYWARHPEYQRFPIYFASSLARKCMVVY 306
Query: 118 KMFITWTNQKIRKTF--VQRN------MFDFKHIRPFDKSF--IDNPGPMVVFATPGMLH 167
+ +I N+ IR F Q N +DF+HIR KS D+ G V+ A+PGML
Sbjct: 307 QTYIDAMNENIRTKFQAAQANPDGVGGPWDFQHIRSL-KSLERFDDVGGCVMLASPGMLQ 365
Query: 168 SGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLE--FENKQ----------- 214
+G+S + ++WAP N +I+ G+ V+GT+ ++ + N+Q
Sbjct: 366 NGVSRSLLERWAPDAKNGVIITGYSVEGTMAKSIMLEPDSIPAVMTNRQPQTVILGKRTG 425
Query: 215 -----IIDVKMAIEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF 269
+I + ++ SF+AH D + I+ V+LVHGE M LK ++
Sbjct: 426 ADENVMIPRRCTVQEFSFAAHVDGQENRTFIEEVNAPVVILVHGEKHNMNRLKSRLLSLQ 485
Query: 270 NLDCFMPANGESCFVQTDMKISIDVSVNLLKEEAVKYNSEPPNP-----------LKERQ 318
+ + PAN E + V L E + ++ P+P K+R+
Sbjct: 486 KMKVYSPANCEELHIPFRQDKYARVVGRLANEISPPTAAQQPSPPASDEEGNESEPKKRK 545
Query: 319 -----IHGVLVIKDSSISLMDVDEACKEVGIS 345
+ GVLV D +SLM ++ + G++
Sbjct: 546 DEGQLVSGVLVQNDFKLSLMAPEDLKEYAGLT 577
>gi|320593246|gb|EFX05655.1| cleavage and polyadenylation specificity factor subunit [Grosmannia
clavigera kw1407]
Length = 857
Score = 162 bits (410), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 114/397 (28%), Positives = 197/397 (49%), Gaps = 60/397 (15%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDK-CRPDLLITESTYATTIRDSKRCRERDF 59
MF +++ I++TGDY+ DRHL +A + K + D+LITESTY + RE+
Sbjct: 179 MFLIEIAGLKIMFTGDYSRELDRHLVSATVPKGVKVDVLITESTYGIASHVPRLEREQAL 238
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYY 117
+K + ++RGG+VL+PVFALGRAQEL ++L+ YW + + PIY+A L +K Y
Sbjct: 239 MKSITGILNRGGRVLMPVFALGRAQELLLILDEYWSKHSDFQNYPIYYASNLAKKCMVVY 298
Query: 118 KMFITWTNQKIRKTFVQR--------------NMFDFKHIRPFDK-SFIDNPGPMVVFAT 162
+ + N I++ + +R +DF+ IR ++ G V+ A+
Sbjct: 299 QTYTGAMNDNIKRLYAERAKEAEATGNSAGGGGPWDFRFIRSLKNLDRFEDIGGCVMLAS 358
Query: 163 PGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVL---------------SGVKK 207
PGML +G+S + ++WAP + N +I+ G+ V+GT+ +++ G ++
Sbjct: 359 PGMLQNGVSRELLERWAPSDKNGVIITGYSVEGTMAKQIMQEPDHIQAVMSRSLAGGGRR 418
Query: 208 LEFEN--KQIIDVKMAIEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKI 265
+ K +I + +++ SF+AH D + I+ V+LVHGE M+ LK K+
Sbjct: 419 APGADSEKVMIPRRCSVQEFSFAAHVDGTENREFIEEVNAPAVILVHGEQHNMMRLKSKL 478
Query: 266 -------KQEFNLDCFMPANGESCFV--QTDMKI--------SIDVSVNLLKEEAVKYNS 308
+ + + P N E + +TD KI S+ + LL+++ +
Sbjct: 479 LSLNAGKVDKDKMKVYSPRNCEELRIPFKTD-KIAKVVGKLASVTLPTQLLQDQQEHQTA 537
Query: 309 EPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGIS 345
P + GVLV D +SLM ++ + G++
Sbjct: 538 GP-------ILTGVLVQNDFKLSLMAPEDLHEYAGLT 567
>gi|221059687|ref|XP_002260489.1| cleavage and polyadenylation specifity protein [Plasmodium knowlesi
strain H]
gi|193810562|emb|CAQ41756.1| cleavage and polyadenylation specifity protein,putative [Plasmodium
knowlesi strain H]
Length = 938
Score = 162 bits (410), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 104/292 (35%), Positives = 162/292 (55%), Gaps = 23/292 (7%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF V++ N +YTGDY+ DRH+ A I +LI E TY + D ++ RE FL
Sbjct: 217 MFLVEINNIRFLYTGDYSREVDRHIPIAEIPTIDVHVLICEGTYGIRVHDDRKKREVRFL 276
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWER-MNLQA-PIYFAVGLTEKATNYYK 118
+ ++ GKVL+PVFALGRAQEL +++E +WE+ LQ PI++ + K+ Y+
Sbjct: 277 NMITSILNNKGKVLLPVFALGRAQELLLIMEEHWEKNTQLQKIPIFYISSMATKSLCIYE 336
Query: 119 MFITWTNQKIRKTFVQ-RNMFDFKHIRPFDKS--------FIDNPGPMVVFATPGMLHSG 169
FI +R + +N F+FK+++ + KS + DN P VV A+PGML +G
Sbjct: 337 TFINLCGDFVRHVLNEGKNPFNFKYVK-YAKSLDSILNYLYQDN-NPCVVMASPGMLQNG 394
Query: 170 LSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFE----NKQIIDVKMAIEYM 225
+S IF AP + + +I+ G+ V+GT+ H++ K E E N + + + E +
Sbjct: 395 ISKNIFNIIAPDKKSGVILTGYTVKGTLAHEL-----KTEPEYVLINDKPVKRRCRFEEI 449
Query: 226 SFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFN-LDCFMP 276
SFSAH+D I+ + NV+LVHG+ +++ LK K+ +E L F P
Sbjct: 450 SFSAHSDFNQTKTFIEKLKCPNVVLVHGDRNELNRLKNKLTEEKKYLSVFTP 501
>gi|452985743|gb|EME85499.1| hypothetical protein MYCFIDRAFT_130659 [Pseudocercospora fijiensis
CIRAD86]
Length = 844
Score = 162 bits (409), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 115/425 (27%), Positives = 202/425 (47%), Gaps = 49/425 (11%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDK-CRPDLLITESTYATTIRDSKRCRERDF 59
M+ +++ +I++TGDY+ DRHL A + + + D LITEST+ + R ++ RE
Sbjct: 189 MYLIEIAGLNILFTGDYSRETDRHLIPATVPRNVKVDCLITESTFGISTRTPRQERENAL 248
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYY 117
+K + ++RGG+VL+P A+G QEL ++LE YW+R + PIY+A GL K Y
Sbjct: 249 IKSITTILNRGGRVLMPTTAVGNTQELLLILEDYWQRHEEYRKFPIYYASGLARKVMVVY 308
Query: 118 KMFITWTNQKIRKTF----VQRNMFDFKHIRPFDKSFI---------DNPGPMVVFATPG 164
+ ++ N I+ F V +++ + P+D F+ ++ G VV A+PG
Sbjct: 309 QTYVDDMNDTIKAKFQASAVGQSVGEGGTAGPWDFQFVRALKGIDRFEDVGGSVVLASPG 368
Query: 165 MLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVL------------------SGVK 206
ML +G S + ++WAP N +++ G+ V+GT+ +L S K
Sbjct: 369 MLQNGPSRALLERWAPEAKNGVVITGYSVEGTMAKTILMEPDEIPAVTQNRSANIPSMGK 428
Query: 207 KLEFEN-KQIIDVKMAIEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKI 265
+ E KQ+I + ++ +F+ H D + + I+ V+LVHGE M LK ++
Sbjct: 429 RGGVEGEKQMIKRRCTVQEFNFAVHVDGQENREFIEEVGAPVVILVHGEKHNMNRLKSRL 488
Query: 266 KQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLKEEAVKYNSEPPNPLKERQ------I 319
+ + PAN E + + V L + +PP L + +
Sbjct: 489 LGIGRMKVYSPANCEEVRIPFRQEKVARVVGRLASQ------IQPPTLLPKHDSSLGQLV 542
Query: 320 HGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDSSP--INKTLELIYDQLLSY 377
GVLV D +SLM ++ + G++ + ++ + +SP I LE + + +
Sbjct: 543 SGVLVQNDFKLSLMAPEDLKEYAGLATTTLVCRQHITLGSASPDLIKWALEGTFGSITTI 602
Query: 378 LQDKS 382
D +
Sbjct: 603 TPDSN 607
>gi|452845681|gb|EME47614.1| hypothetical protein DOTSEDRAFT_146416 [Dothistroma septosporum
NZE10]
Length = 839
Score = 161 bits (408), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 94/310 (30%), Positives = 158/310 (50%), Gaps = 30/310 (9%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWID-KCRPDLLITESTYATTIRDSKRCRERDF 59
M+ +++ +I++TGDY+ DRHL A I + D LITEST+ + R+ ++ RE
Sbjct: 190 MYHIEIAGLNILFTGDYSREIDRHLIPATIPPNVKIDCLITESTFGISTREPRQERENQL 249
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYY 117
+K V ++RGG+VL+P A+G QEL ++LE YW+R + PIY+A GL K Y
Sbjct: 250 MKSVTNILNRGGRVLMPTTAVGNTQELLLILEDYWQRHEEYRRFPIYYASGLARKVMVVY 309
Query: 118 KMFITWTNQKIRKTF----------VQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGML 166
+ ++ N +I+ F +DF+ +R ++ G VV A+PGML
Sbjct: 310 QTYVDNMNDRIKAKFQASAAAAGDGGAAGPWDFQFVRALKGVDRFEDVGGSVVLASPGML 369
Query: 167 HSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKV----------------LSGVKKLEF 210
+G S + ++WAP N +++ G+ V+GT+ ++ +S K+
Sbjct: 370 QNGPSRALLERWAPDPKNGVVITGYSVEGTMAKQIMLEPDSIAAVTNRTANMSLGKRAGA 429
Query: 211 ENKQIIDVKMAIEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFN 270
E +Q+I + ++ +F+ H D + I+ V+LVHGE M LK ++
Sbjct: 430 EEQQMIPRRCTVQEFNFAVHVDGQENRDFIEETGAPVVILVHGEKHNMGRLKSRLLSLKK 489
Query: 271 LDCFMPANGE 280
+ + PAN E
Sbjct: 490 MKVYSPANCE 499
>gi|300176709|emb|CBK24374.2| unnamed protein product [Blastocystis hominis]
Length = 711
Score = 161 bits (408), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 104/306 (33%), Positives = 162/306 (52%), Gaps = 34/306 (11%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF V++ ++YTGD++ DRHL AA I R +++I ESTY +S+R RE
Sbjct: 137 MFCVEIDRIRVLYTGDFSREEDRHLPAAEIPPLRLNVVICESTYGRISHESRREREDRLT 196
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWER---MNLQAPIYFAVGLTEKATNYY 117
K+V + + + GKVLIPVFALGR QEL +LL YWE+ ++ I + L +K+
Sbjct: 197 KEVVDTLRKRGKVLIPVFALGRTQELLLLLNEYWEKHPEISRHGNIVYLNPLAKKS---- 252
Query: 118 KMFITWTNQKIRKTFVQRNMFDFKHI----------------RPFDKSFIDNPGPMVVFA 161
M + ++ + ++ N FD ++ R F + F P P V+
Sbjct: 253 -MMLFQAHKHTILSRLEENPFDLPNVDIPDPIDSVSPFFSPHRQFLRDFNKLP-PCVILC 310
Query: 162 TPGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQI--IDVK 219
+P M+ +G S + ++ A ENN++I+ G+C+ TI ++ + + KQ+ IDVK
Sbjct: 311 SPAMMENGTSRDVLERMASGENNLVILTGYCMARTIARRLQDNDRIIPLSGKQMETIDVK 370
Query: 220 MAIEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEASKM----VFLKEKIKQEFNLDC-- 273
+ +SFSAH+D G IQ +P+ V+LVHGE S+M LKEK++ N D
Sbjct: 371 CKVVTISFSAHSDFPGTSSFIQATKPERVVLVHGEVSRMNDLVSGLKEKLEDLVNRDLIS 430
Query: 274 -FMPAN 278
+ P N
Sbjct: 431 FYTPEN 436
>gi|350638481|gb|EHA26837.1| hypothetical protein ASPNIDRAFT_35736 [Aspergillus niger ATCC 1015]
Length = 915
Score = 161 bits (408), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 121/405 (29%), Positives = 199/405 (49%), Gaps = 60/405 (14%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDK-CRPDLLITESTYATTIRDSKRCRERDF 59
MF + + +I++TGDY+ DRHL A + K + D+LITEST+ + + RE
Sbjct: 227 MFLISIAGLNILFTGDYSREEDRHLIPAEVPKGVKIDVLITESTFGISSNPPRLEREAAL 286
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWE-RMNLQA-PIYFAVGLTEKATNYY 117
+K + ++RGG+VL+PVFALGRAQEL ++L+ YWE LQ PIY+ +G T +
Sbjct: 287 MKAITGVLNRGGRVLMPVFALGRAQELLLILDEYWETHPELQKIPIYY-IGNTARRCAM- 344
Query: 118 KMFITWTNQKIRKTFVQR--------------NMFDFKHIRPFDK-SFIDNPGPMVVFAT 162
N I++ F QR +DF+ +R D+ G V+ A+
Sbjct: 345 -------NDNIKRLFRQRMAEAEASGDKSVSAGPWDFRFVRSLRSLERFDDVGGCVMLAS 397
Query: 163 PGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKL-------------- 208
PGML +G S + ++WAP E N ++M G+ V+GT+ ++L+ +++
Sbjct: 398 PGMLQTGTSRELLERWAPNERNGVVMTGYSVEGTMAKQILNEPEQIPAVMSRATTGLVRR 457
Query: 209 ------EFENKQIIDVKMAIEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLK 262
E E K +I + ++ +SF+AH D I+ V+LVHGE +M+ LK
Sbjct: 458 GMAAGNEEEQKVMIPRRCTVDEVSFAAHVDGVENRNFIEEVSAPVVILVHGEKHQMMRLK 517
Query: 263 EKI-----KQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLKEEAVKYNSEPPNPLKER 317
K+ ++ + + P N E + K I V L + A PP+ +
Sbjct: 518 SKLLSLNAEKTVKVKVYTPGNCEEVRIPFK-KDKIAKVVGKLAQIA------PPSEQDDG 570
Query: 318 QI-HGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDSS 361
Q+ GVLV ++SLM D+ + G++ + ++ +S +S
Sbjct: 571 QLMSGVLVQNGFNLSLMAPDDLREYAGLTTTAITCKQHITLSSAS 615
>gi|358060736|dbj|GAA93507.1| hypothetical protein E5Q_00148 [Mixia osmundae IAM 14324]
Length = 1378
Score = 161 bits (408), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 104/375 (27%), Positives = 185/375 (49%), Gaps = 30/375 (8%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAA----WIDKCRPDLLITESTYATTIRDSKRCRE 56
MF +++G ++YTGDY+ D HL A W K RPD++I EST+ + K RE
Sbjct: 203 MFLIEIGGARVLYTGDYSTEADMHLIPASVPTWGGK-RPDVMICESTFGVQSFEPKAIRE 261
Query: 57 RDFLKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKAT 114
F K+ + RGGKVL+P F+ G +QEL ++L+ +WE+ + PIY+ L +
Sbjct: 262 AQFTNKIKTILKRGGKVLLPAFSSGVSQELLLVLDDFWEKNPDLHEFPIYYVTSLASRVL 321
Query: 115 NYYKMFITWTNQKIRKTFVQR-NMFDF------KHIRPFDKSFIDNPGPMVVFATPGMLH 167
Y+ I+ +QKI++ N +DF K +R + D P VV ATPGML
Sbjct: 322 KVYRQHISSQSQKIQQRAASGDNPYDFGKGRFVKELRSIRRGVADK-SPCVVVATPGMLQ 380
Query: 168 SGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSF 227
G S + ++WA N LI+ G+ V+G++ + + + + I +++++ +SF
Sbjct: 381 PGTSRELLERWAGDRRNGLILCGYSVEGSLARDLQAEPDEFMSVEGRRIPRRISVDVISF 440
Query: 228 SAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF-----NLDCFMPANGESC 282
+ H D I P N++LVHGEA + L+ +++ + + + P N +
Sbjct: 441 AQHVDFSQNASFIDAVRPANLILVHGEARNVTTLQAALQKIYADKKDEMKVYAPRNLQPI 500
Query: 283 FVQTDMKISIDVSVNLLKEEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEV 342
++ + + +V L +E + + + G+L +KD+ +L+D + +
Sbjct: 501 KIRI-ARNQVARAVGGLAKEHLPDGAP---------VGGLLFVKDAQYTLLDPKDLKEFT 550
Query: 343 GISRHIVRFTSNVQI 357
G+S + +Q+
Sbjct: 551 GLSTSTIAQKQRIQL 565
>gi|302412663|ref|XP_003004164.1| endoribonuclease YSH1 [Verticillium albo-atrum VaMs.102]
gi|261356740|gb|EEY19168.1| endoribonuclease YSH1 [Verticillium albo-atrum VaMs.102]
Length = 730
Score = 161 bits (407), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 116/403 (28%), Positives = 195/403 (48%), Gaps = 52/403 (12%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDK-CRPDLLITESTYATTIRDSKRCRERDF 59
MF +++ I +TGDY+ DRHL +A + K + D+LITESTY + RE+
Sbjct: 73 MFLIEIAGLKIFFTGDYSREQDRHLVSAEVPKGVKIDVLITESTYGIASHVPRVEREQAL 132
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERM-NLQA-PIYFAVGLTEKATNYY 117
+K + ++RGG+VL+PVFALGRAQEL ++L+ YW + + Q PIY+A L K Y
Sbjct: 133 MKSITSILNRGGRVLMPVFALGRAQELLLILDEYWGKHPDFQKYPIYYASNLARKCMVVY 192
Query: 118 KMFITWTNQKIRKTFVQ--------------RNMFDFKHIRPFDK-SFIDNPGPMVVFAT 162
+ ++ N I++ F + +DF +IR D+ G V+ A+
Sbjct: 193 QTYVGAMNDNIKRLFREGMAQAEASGDGSGKGGPWDFNYIRSLKNLDRFDDLGGCVMLAS 252
Query: 163 PGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEF------------ 210
PGML +G+S + ++WAP + N +I+ G+ V+GT+ +++ +++
Sbjct: 253 PGMLQNGVSRELLERWAPNDKNGVIITGYSVEGTMAKQIMQEPDQIQAVMSRTMANARRG 312
Query: 211 ----ENKQIIDVKMAIEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKI- 265
K +I + +++ SF+AH D + I+ V+LVHGE M+ LK K+
Sbjct: 313 PGGDGEKFLIPRRCSVQEYSFAAHVDGTENREFIEEVAAPVVILVHGEQHNMMRLKSKLL 372
Query: 266 ----KQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLKEEAVKYNSEPP--NPLKERQ- 318
+ F P N E + + V L +PP + L++ Q
Sbjct: 373 SLNANKSIKAKIFSPKNCEELRIPFKQDKTAKVVGKLA-------TIQPPTKSTLEDGQN 425
Query: 319 ---IHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQIS 358
I GVLV D +SLM ++ + G++ + +++S
Sbjct: 426 APLITGVLVQNDFKLSLMAPEDLREYAGLNTTTITCKQRLKLS 468
>gi|389585470|dbj|GAB68201.1| cleavage and polyadenylation specifity factor protein, partial
[Plasmodium cynomolgi strain B]
Length = 936
Score = 160 bits (406), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 104/292 (35%), Positives = 161/292 (55%), Gaps = 23/292 (7%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF V++ N +YTGDY+ DRH+ A I +LI E TY + D ++ RE FL
Sbjct: 217 MFLVEINNIRFLYTGDYSREVDRHIPIAEIPTIDVHVLICEGTYGIKVHDDRKKREVRFL 276
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWER-MNLQA-PIYFAVGLTEKATNYYK 118
+ ++ GKVL+PVFALGRAQEL +++E +WER LQ PI++ + K+ Y+
Sbjct: 277 NMITSILNNKGKVLLPVFALGRAQELLLIMEEHWERNTQLQKIPIFYISSMATKSLCIYE 336
Query: 119 MFITWTNQKIRKTFVQ-RNMFDFKHIRPFDKS--------FIDNPGPMVVFATPGMLHSG 169
FI +R + +N F+FK ++ + KS + DN P VV A+PGML +G
Sbjct: 337 TFINLCGDFVRHVLNEGKNPFNFKFVK-YAKSLDSILNYLYQDN-NPCVVMASPGMLQNG 394
Query: 170 LSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFE----NKQIIDVKMAIEYM 225
+S IF AP + + +I+ G+ V+GT+ ++ K E E N +++ + E +
Sbjct: 395 ISKNIFNIIAPDKKSGVILTGYTVKGTLADEL-----KTEPEYVLINDKLVKRRCRFEEI 449
Query: 226 SFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFN-LDCFMP 276
SFSAH+D I+ + NV+LVHG+ +++ LK K+ +E L F P
Sbjct: 450 SFSAHSDFNQTKTFIEKLKCPNVVLVHGDRNELNRLKNKLTEEKKYLSVFTP 501
>gi|71654879|ref|XP_816051.1| cleavage and polyadenylation specificity factor [Trypanosoma cruzi
strain CL Brener]
gi|70881152|gb|EAN94200.1| cleavage and polyadenylation specificity factor, putative
[Trypanosoma cruzi]
Length = 430
Score = 160 bits (404), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 89/212 (41%), Positives = 126/212 (59%), Gaps = 6/212 (2%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
+F V + +YTGD++ PDRHL A + PD+LI EST +S+ RE F
Sbjct: 188 LFMVDIAGMKTLYTGDFSRVPDRHLLGAEVPSYSPDILIAESTNGIRELESREERETLFT 247
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWE-RMNLQ-APIYFAVGLTEKATNYYK 118
VH+ V GG+ L+PVFALGRAQEL ++LE YWE LQ PIY+A L ++ Y+
Sbjct: 248 TWVHDVVKGGGRCLVPVFALGRAQELLLILEEYWEAHKELQHIPIYYASSLAQRCMKLYQ 307
Query: 119 MFITWTNQKIRKTFV-QRNMFDFKHIRPF--DKSFIDNPGPMVVFATPGMLHSGLSLIIF 175
F++ N ++++ RN F FK+I +SF D GP VV A+PGML SG+SL +F
Sbjct: 308 TFVSAMNDRVKQQHANHRNPFVFKYIHSLMETRSFEDT-GPCVVLASPGMLQSGISLELF 366
Query: 176 KKWAPVENNMLIMPGFCVQGTIGHKVLSGVKK 207
++W N +I+ G+CV GTI +L+ K+
Sbjct: 367 ERWCGDRRNGIIIAGYCVDGTIAKDILTNRKE 398
>gi|82596170|ref|XP_726151.1| cleavage and polyadenylation specificity factor, 73 kDa subunit
[Plasmodium yoelii yoelii 17XNL]
gi|23481438|gb|EAA17716.1| cleavage and polyadenylation specificity factor, 73 kDa subunit
[Plasmodium yoelii yoelii]
Length = 942
Score = 160 bits (404), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 98/279 (35%), Positives = 162/279 (58%), Gaps = 14/279 (5%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF V++ N ++YTGDY+ DRH+ A I +LI E TY + D++R RE FL
Sbjct: 210 MFLVEINNIRLLYTGDYSREIDRHIPIAEIPNIDVHVLICEGTYGIKVHDNRRKREAIFL 269
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERM-NLQA-PIYFAVGLTEKATNYYK 118
+ ++ GKVL+PVFALGRAQE+ ++LE +W+R NLQ PI++ + K+ Y+
Sbjct: 270 NMLTNILNSKGKVLLPVFALGRAQEILLILEEHWDRNPNLQKIPIFYISSMATKSLCIYE 329
Query: 119 MFITWTNQKIRKTFVQ-RNMFDFKHIRPFDKS--------FIDNPGPMVVFATPGMLHSG 169
+I + I+K + +N F+FK+++ + KS + DN P V+ A+PGML SG
Sbjct: 330 TYINLCSDFIKKIVNEGKNPFNFKYVK-YAKSLDSILNYLYQDNY-PCVIMASPGMLQSG 387
Query: 170 LSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSA 229
+S IF A + + +I+ G+ ++GT+ ++ + + + N +++ K E +SFSA
Sbjct: 388 ISKSIFNIIASDKKSGVIITGYTIKGTLADELKTEPEFVTI-NDKLVKRKCQFEQISFSA 446
Query: 230 HADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQE 268
H+D I+ + NV+LVHG+ +++ LK K+ E
Sbjct: 447 HSDFNQTKTFIEKLKCPNVVLVHGDRNELNRLKNKLIDE 485
>gi|392338884|ref|XP_003753662.1| PREDICTED: LOW QUALITY PROTEIN: cleavage and polyadenylation
specificity factor subunit 3-like [Rattus norvegicus]
gi|392345811|ref|XP_003749372.1| PREDICTED: LOW QUALITY PROTEIN: cleavage and polyadenylation
specificity factor subunit 3-like [Rattus norvegicus]
Length = 1015
Score = 160 bits (404), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 120/402 (29%), Positives = 197/402 (49%), Gaps = 33/402 (8%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDK---CRPDLLITESTYATTIRDSKRCRER 57
MF +++ I+YTGD++ RHL AA I + D+LI ES Y T I + E
Sbjct: 165 MFVIEIAGVKILYTGDFSRQEVRHLMAAEIPXYLYIKSDILIVESMYGTHIYEKHEEXEA 224
Query: 58 DFLKKVHECVDRGGKVLIPVFALGRAQELCILLETYWE-RMNLQ-APIYFAVGLTEKATN 115
F VH V++ G+ LIPV LGRAQEL +L+ YW+ LQ PIY+ + L +K
Sbjct: 225 RFCNAVHNIVNKXGRGLIPVLVLGRAQELLPILDEYWQDHPELQDIPIYYTLPLXKKCMA 284
Query: 116 YYKMFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLII 174
Y+ + N KI K N F F+HI D+ G VV +PGM+ +G+S
Sbjct: 285 VYQTYANDMNGKIWKQININNPFVFEHISNLKSMDHFDDIGLSVVMXSPGMIQNGVSRER 344
Query: 175 FKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAK 234
F++ + N +I+ G+C +G + ++S +++ Q + +KM+++Y+SFSA D
Sbjct: 345 FERXWCHKRNGVIVTGYCAEGILAKHIMSEPEEITTMPGQKLPLKMSVDYISFSARTDYW 404
Query: 235 GIMQLIQYCEPKNVLLVHGEASKMVFLKEKI------KQEFNLDCFMPANGESCFVQTDM 288
+ I +P +V+ VHGE +++ LK + E ++ P N E+
Sbjct: 405 QTSEFIHALKPSHVISVHGEQNEIDRLKAALIKKYEDNDEVHIKVHSPWNTET------- 457
Query: 289 KISIDVSVNLLKEEAVKYN--SEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGISR 346
V++N + E+ K S P + +Q+ G+L ++ + D +S
Sbjct: 458 -----VALNFMGEKLAKVMGVSADKKPEQGQQVSGILAKRNFN------DHILFPCDLSN 506
Query: 347 HIVRFTSNVQISDSSPINKTLELIYDQLLSYLQDKSQEYKIQ 388
+I S V+ + + P K+ L+Y QL + D +E +IQ
Sbjct: 507 YIDLAMSTVRQTQAIPYTKSFYLLYYQLQKLMGD-VEELEIQ 547
>gi|260810354|ref|XP_002599929.1| hypothetical protein BRAFLDRAFT_74053 [Branchiostoma floridae]
gi|229285213|gb|EEN55941.1| hypothetical protein BRAFLDRAFT_74053 [Branchiostoma floridae]
Length = 627
Score = 159 bits (403), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 161/616 (26%), Positives = 266/616 (43%), Gaps = 98/616 (15%)
Query: 479 KETMELIKNTIDEMIEFITPNIACVNTHTVHFMVEKLWETTIPKAILSQVNN--SPSFDS 536
+++ +++ ++ F+ +N H V F E W+ +PK + ++ + P S
Sbjct: 6 EDSHHILQTRLENQWSFLQSKSDLINAHLVDFFTEDTWDRLVPKDLQDELLSLSDPDLSS 65
Query: 537 LLEEFWKSRQDNHVETNSELVKFFQAADKFRLTSLMNGESIITVDELFSKLQARKCGQVV 596
L R+ S+ + ++ + T L+N + + +K CG+
Sbjct: 66 LPSSTTLQRK------RSQATETEDDKEEHKHTLLLNESTDLDSSIGNAKATENFCGKCE 119
Query: 597 ESAKVSQLMSEKKSYEVQVMSQVVAAVTNSCDSSHIIDLGSTITVDELFSKLQARKCGLV 656
S L +K + V+S NS + + ++ T+ VD +K + ++
Sbjct: 120 CEKSQSSLEEFRKEAQQFVLS-------NSGVCASLDEVWRTLGVD---TKDEDKR---- 165
Query: 657 VESAKVSQLMSEKKSYEVQVMSQVVAAVTNSCDSSHIIDLGGGQGYLS---------TIL 707
V + M KKS+EV+V+S VVA + C ++D+G G+GYLS T+L
Sbjct: 166 ---RFVKEHMCVKKSHEVEVLSGVVAKLCQVCHVDQVVDIGSGKGYLSECLYLQHGLTVL 222
Query: 708 ALQ------HGKKTLSLDYNQ-----VNTHGAAVRSKKLEDSE-------KESKGPFQSY 749
+ HG K S + VN + + + DSE K ++
Sbjct: 223 GMDASDSNTHGAKKRSKLLERQWTALVNRPITSANKECINDSEPIREAVTKNTESEEAGN 282
Query: 750 AGVINKKLWMRTQN---------RQTSHASVEHHGKNWKRKSKAP-------------VV 787
+++ TQ +QT + E + K+ S P +V
Sbjct: 283 DSCAKEQICDTTQQQVCKETTTQKQTRCSKSEDREVSSKQGSFTPTTAYINPDTDLADLV 342
Query: 788 SIKDEE------------LVVCKDKCKQITHFVTPDSDISSILSQAYPQDSLHNVCIMGL 835
+++ + L C D + D++SIL D ++ + GL
Sbjct: 343 GVRNSDETAADLRLLLTGLHTCGDLSPTMLRLFMTCPDLTSILFITAVTDRSPHLLLTGL 402
Query: 836 HTCGDLSGTALRLF-TKSSLQCLVQVGCCYHLLEEEFIRSPFWKDVDQSLYEHGYGFPLS 894
HTCGDLS T L LF T L VGCCY+LL E + P E GFPLS
Sbjct: 403 HTCGDLSPTMLHLFVTCPQATVLCSVGCCYNLLTE--VEKP-------GAMEEKCGFPLS 453
Query: 895 EHLRSRKFFLGRNVRMSGTQSPERVIDLKQTQTLPLFYRALLEKYLRSKITINDEEPKVV 954
+ L+++ LGR RM Q+ ER+ + + LFYRA+L+ +R K D K V
Sbjct: 454 QLLQTKSVHLGRTSRMLSCQAVERISTEGRLPAMSLFYRAVLQVIIRDKFGHIDNN-KTV 512
Query: 955 GRLATKCSNFVEYVHRAVDKLKLD-LEVDDEEVTRLFNSHQREYEYLQIYYFLKTALAPV 1013
G++A K +FV+YV RA+ +L+L+ + DEE+ F +Q + +L + L+ LAP
Sbjct: 513 GKVAAKSKDFVDYVRRALRRLQLEHAHLSDEEINSYFEQYQGKLRHLHAFIQLRACLAPC 572
Query: 1014 IEALIVLDRVLYLREQ 1029
IEA I+LDR+ +L EQ
Sbjct: 573 IEAFILLDRLCFLHEQ 588
Score = 47.0 bits (110), Expect = 0.056, Method: Compositional matrix adjust.
Identities = 20/50 (40%), Positives = 31/50 (62%)
Query: 1045 QVDDEEVTRLFDSHQREYEYLQIYYFLKTALAPVIEALIVLDRVLYLREQ 1094
+ DEE+ F+ +Q + +L + L+ LAP IEA I+LDR+ +L EQ
Sbjct: 539 HLSDEEINSYFEQYQGKLRHLHAFIQLRACLAPCIEAFILLDRLCFLHEQ 588
>gi|170062356|ref|XP_001866633.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167880275|gb|EDS43658.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 459
Score = 159 bits (403), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 133/518 (25%), Positives = 225/518 (43%), Gaps = 105/518 (20%)
Query: 503 VNTHTVHFMVEKLWETTIPKAILSQVNNSPSFDSLLEEFWKS-RQDNHVETNSELVKFFQ 561
+N H V ++ E W IP+ + ++ P E FW +D L +
Sbjct: 4 LNCHMVDYLTEGHWNRYIPEEVREELKGMPDLVGSKEVFWNQFNKDFDSSKFPRLTNHIE 63
Query: 562 AADKFRLTSLMNGESIITVDELFSKLQARKCGQVVESAKVSQLMSEKKSYEVQVMSQVVA 621
+ ++RL +L ++ +T ++L S + C + K+ +LMS KK +EV+V S VVA
Sbjct: 64 SMKRYRLDAL--PDACMTTEQLNSAFDS--CLKETR-LKMPELMSIKKCHEVEVASAVVA 118
Query: 622 AVTNSCDSSHIIDLGSTITVDELFSKLQARKCGLVVESAKVSQLMSEKKSYEVQVMSQVV 681
++ C +V + L
Sbjct: 119 SL-----------------------------CTVVATGTEGKSL---------------- 133
Query: 682 AAVTNSCDSSHIIDLGGGQGYLSTILALQHGKKTLSLDYNQVNTHGAAVRSKKLEDSEKE 741
D +ID G G+GYLS+ +AL+HG K L +D N+ NT A R +L
Sbjct: 134 -------DHVAVIDAGDGKGYLSSRIALEHGIKVLGIDCNEGNTASAERRRDQLR----- 181
Query: 742 SKGPFQSYAGVINKKLWMRTQNRQTSHASVEHHGKNWKRKSKAPVVSIKDEELVVCKDKC 801
KK+ + + + + S+ K + P+
Sbjct: 182 -------------KKVPVALKRSKLTEESI------LKDAPEDPLY-------------- 208
Query: 802 KQITHFVTPDSDISSILSQAYPQDSLHNVCIMGLHTCGDLSGTALRLFTKSS-LQCLVQV 860
K T + +++ ++ + +PQ + GLHTCG+L+ +LRLF ++ ++ L V
Sbjct: 209 KTTTKMIDFQTNLIELVREQFPQTQATRFTLAGLHTCGNLAPNSLRLFHQNRHIRALCNV 268
Query: 861 GCCYHLLEEEFIRSPFWKDVDQSLYEHGYGFPLSEHLRSRKFFLGRNVRMSGTQSPERVI 920
GCCYHLL E F F+ + D++ G+GFP+S + + F LGRN R +S ER
Sbjct: 269 GCCYHLLRERFCEDEFF-NRDKARDNAGFGFPMSAFMAGKGFALGRNARNLAAESIERSA 327
Query: 921 DLKQTQTLPLFYRALLEKYLRSKITINDEEPKVVGRLATKCSNFVEYVHRAVDKLKLDLE 980
++ + L YRA+L+ L+ D + VG+L KC FV+YV +++ +L +D
Sbjct: 328 FHRENPSDKLGYRAMLQIVLKELDVRVDRQ---VGKL--KCDGFVDYVRKSLKRLGVDGG 382
Query: 981 --VDDEEVTRLFNSHQREYEYLQIYYFLKTALAPVIEA 1016
V + + L ++ E E L+++Y + APV+E
Sbjct: 383 ELVTEAYLEGLERRYEVELEQLKVFYLYRMTYAPVVEG 420
Score = 156 bits (395), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 112/369 (30%), Positives = 182/369 (49%), Gaps = 56/369 (15%)
Query: 661 KVSQLMSEKKSYEVQVMSQVVAAVTN---------SCDSSHIIDLGGGQGYLSTILALQH 711
K+ +LMS KK +EV+V S VVA++ S D +ID G G+GYLS+ +AL+H
Sbjct: 97 KMPELMSIKKCHEVEVASAVVASLCTVVATGTEGKSLDHVAVIDAGDGKGYLSSRIALEH 156
Query: 712 GKKTLSLDYNQVNTHGAAVRSKKLEDSEKESKGPFQSYAGVINKKLWMRTQNRQTSHASV 771
G K L +D N+ NT A R +L KK+ + + + + S+
Sbjct: 157 GIKVLGIDCNEGNTASAERRRDQLR------------------KKVPVALKRSKLTEESI 198
Query: 772 EHHGKNWKRKSKAPVVSIKDEELVVCKDKCKQITHFVTPDSDISSILSQAYPQDSLHNVC 831
K + P+ K T + +++ ++ + +PQ
Sbjct: 199 ------LKDAPEDPLY--------------KTTTKMIDFQTNLIELVREQFPQTQATRFT 238
Query: 832 IMGLHTCGDLSGTALRLFTKSS-LQCLVQVGCCYHLLEEEFIRSPFWKDVDQSLYEHGYG 890
+ GLHTCG+L+ +LRLF ++ ++ L VGCCYHLL E F F+ + D++ G+G
Sbjct: 239 LAGLHTCGNLAPNSLRLFHQNRHIRALCNVGCCYHLLRERFCEDEFF-NRDKARDNAGFG 297
Query: 891 FPLSEHLRSRKFFLGRNVRMSGTQSPERVIDLKQTQTLPLFYRALLEKYLRSKITINDEE 950
FP+S + + F LGRN R +S ER ++ + L YRA+L+ L+ D +
Sbjct: 298 FPMSAFMAGKGFALGRNARNLAAESIERSAFHRENPSDKLGYRAMLQIVLKELDVRVDRQ 357
Query: 951 PKVVGRLATKCSNFVEYVHRAVDKLKLDLE--VDDEEVTRLFNSHQREYEYLQIYYFLKT 1008
VG+L KC FV+YV +++ +L +D V + + L ++ E E L+++Y +
Sbjct: 358 ---VGKL--KCDGFVDYVRKSLKRLGVDGGELVTEAYLEGLERRYEVELEQLKVFYLYRM 412
Query: 1009 ALAPVIEAL 1017
APV+E L
Sbjct: 413 TYAPVVEGL 421
>gi|346972312|gb|EGY15764.1| endoribonuclease YSH1 [Verticillium dahliae VdLs.17]
Length = 837
Score = 159 bits (403), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 128/480 (26%), Positives = 223/480 (46%), Gaps = 67/480 (13%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDK-CRPDLLITESTYATTIRDSKRCRERDF 59
MF +++ I +TGDY+ DRHL +A + K + D+LITESTY + RE+
Sbjct: 180 MFLIEIAGLKIFFTGDYSREQDRHLVSAEVPKGVKIDVLITESTYGIASHVPRVEREQAL 239
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYY 117
+K + ++RGG+VL+PVFALGRAQEL ++L+ YW + + PIY+A L K Y
Sbjct: 240 VKSITGILNRGGRVLMPVFALGRAQELLLILDEYWGKHPDYQKYPIYYASNLARKCMVVY 299
Query: 118 KMFITWTNQKIRKTFVQ--------------RNMFDFKHIRPFDK-SFIDNPGPMVVFAT 162
+ ++ N I++ F + +DF +IR D+ G V+ A+
Sbjct: 300 QTYVGAMNDNIKRLFREGMAQAEASGDGSGKGGPWDFNYIRSLKNLDRFDDLGGCVMLAS 359
Query: 163 PGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEF------------ 210
PGML +G+S + ++WAP + N +I+ G+ V+GT+ +++ +++
Sbjct: 360 PGMLQNGVSRELLERWAPNDKNGVIITGYSVEGTMAKQIMQEPDQIQAVMSRTMANARRG 419
Query: 211 ----ENKQIIDVKMAIEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKI- 265
K +I + +++ SF+AH D + I+ V+LVHGE M+ LK K+
Sbjct: 420 PGGDGEKFLIPRRCSVQEYSFAAHVDGTENREFIEEVAAPVVILVHGEQHNMMRLKSKLL 479
Query: 266 ----KQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLKEEAVKYNSEPP--NPLKERQ- 318
+ F P N E + + V L +PP + L++ Q
Sbjct: 480 SLNANKSVKAKIFSPKNCEELRIPFKQDKTAKVVGKLA-------TIQPPTKSTLEDGQN 532
Query: 319 ---IHGVLVIKDSSISLMDVDEACKEVGISRHIVRFTSNVQISDSSPINKTLELI---YD 372
I GVLV D +SLM ++ + G++ + +++S + ++LI +
Sbjct: 533 ATLITGVLVQNDFKLSLMAPEDLREYAGLNTTTITCKQRLKLSAAG-----IDLIRWGLE 587
Query: 373 QLLSYLQDKSQEYKIQLTESSLSIDSVLLSVERIDDKNKRVFVTWPNQDEEVGKIVLHVL 432
+ +++ + +K +L+E+ D D+N DEEV ++ L
Sbjct: 588 GVFGTVEELPEMHKKKLSETHNGEDEEA-------DENTNEVTRQEEADEEVASLIAAYL 640
>gi|70945220|ref|XP_742453.1| cleavage and polyadenylation specifity factor protein, [Plasmodium
chabaudi chabaudi]
gi|56521446|emb|CAH74696.1| cleavage and polyadenylation specifity factor protein, putative
[Plasmodium chabaudi chabaudi]
Length = 564
Score = 159 bits (403), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 101/288 (35%), Positives = 167/288 (57%), Gaps = 15/288 (5%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF V++ N ++YTGDY+ DRH+ A I +LI E TY + D++R RE FL
Sbjct: 211 MFLVEINNIRLLYTGDYSREIDRHIPIAEIPNIDVHVLICEGTYGIKVHDNRRKRETIFL 270
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERM-NLQA-PIYFAVGLTEKATNYYK 118
+ ++ GKVL+PVFALGRAQE+ ++LE +W+R NLQ PI++ + K+ Y+
Sbjct: 271 NMLTNILNSKGKVLLPVFALGRAQEILLILEEHWDRNPNLQKIPIFYISSMATKSLCIYE 330
Query: 119 MFITWTNQKIRKTFVQ-RNMFDFKHIRPFDKS--------FIDNPGPMVVFATPGMLHSG 169
+I ++ I+K + +N F+FK+++ + KS + DN P V+ A+PGML SG
Sbjct: 331 TYINLCSEFIKKIVNEGKNPFNFKYVK-YAKSLDSILNYLYQDNY-PCVIMASPGMLQSG 388
Query: 170 LSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSA 229
+S IF A + + +I+ G+ ++GT+ ++ + + + N +++ K E +SFSA
Sbjct: 389 ISKSIFNIIASDKKSGVIITGYTIKGTLADELKTEPEFVTI-NDKLVKRKCQFEQISFSA 447
Query: 230 HADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEK-IKQEFNLDCFMP 276
H+D I+ + NV+LVHG+ +++ LK K I ++ L F P
Sbjct: 448 HSDFNQTKTFIEKLKCPNVVLVHGDRNELNRLKNKLIDEKKYLSVFTP 495
>gi|68076685|ref|XP_680262.1| cleavage and polyadenylation specifity factor protein, [Plasmodium
berghei strain ANKA]
gi|56501171|emb|CAH99756.1| cleavage and polyadenylation specifity factor protein, putative
[Plasmodium berghei]
Length = 888
Score = 159 bits (402), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 102/288 (35%), Positives = 165/288 (57%), Gaps = 15/288 (5%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF V++ N ++YTGDY+ DRH+ A I +LI E TY + D++R RE FL
Sbjct: 211 MFLVEINNIRLLYTGDYSREIDRHIPIAEIPNIDVHVLICEGTYGIKVHDNRRKREAIFL 270
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERM-NLQA-PIYFAVGLTEKATNYYK 118
+ ++ GKVL+PVFALGRAQE+ ++LE +W R NLQ PI++ + K+ Y+
Sbjct: 271 NMLTNILNSKGKVLLPVFALGRAQEILLILEEHWNRNPNLQKIPIFYISSMATKSLCIYE 330
Query: 119 MFITWTNQKIRKTFVQ-RNMFDFKHIRPFDKS--------FIDNPGPMVVFATPGMLHSG 169
+I + I+K + +N F+FK+++ + KS + DN P VV A+PGML SG
Sbjct: 331 TYINLCSDFIKKIVNEGKNPFNFKYVK-YAKSLDSILNYLYQDN-YPCVVMASPGMLQSG 388
Query: 170 LSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSA 229
+S IF A + + +I+ G+ ++GT+ ++ + + + N +++ K E +SFSA
Sbjct: 389 ISKSIFNIIASDKKSGVIITGYTIKGTLADELKTEPEFVTI-NDKLVKRKCQFEQISFSA 447
Query: 230 HADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEK-IKQEFNLDCFMP 276
H+D I+ + NV+LVHG+ +++ LK K I ++ L F P
Sbjct: 448 HSDFNQTKTFIEKLKCPNVVLVHGDRNELNRLKNKLIDEKKYLSVFTP 495
>gi|156100557|ref|XP_001616006.1| cleavage and polyadenylation specifity factor protein [Plasmodium
vivax Sal-1]
gi|148804880|gb|EDL46279.1| cleavage and polyadenylation specifity factor protein, putative
[Plasmodium vivax]
Length = 934
Score = 159 bits (401), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 104/292 (35%), Positives = 160/292 (54%), Gaps = 23/292 (7%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF V++ N +YTGDY+ DRH+ A I +LI E TY + D ++ RE FL
Sbjct: 217 MFLVEINNIRFLYTGDYSREVDRHIPIAEIPAIDVHVLICEGTYGIKVHDDRKKREVRFL 276
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERM-NLQA-PIYFAVGLTEKATNYYK 118
+ ++ GKVL+PVFALGRAQEL +++E +WER LQ PI++ + K+ Y+
Sbjct: 277 NMITSILNNKGKVLLPVFALGRAQELLLIMEEHWERNPQLQKIPIFYISSMATKSLCIYE 336
Query: 119 MFITWTNQKIRKTFVQ-RNMFDFKHIRPFDKS--------FIDNPGPMVVFATPGMLHSG 169
FI +R + +N F+FK ++ + KS + DN P VV A+PGML +G
Sbjct: 337 TFINLCGDFVRHVLNEGKNPFNFKFVK-YAKSLDSILNYLYQDN-NPCVVMASPGMLQNG 394
Query: 170 LSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFE----NKQIIDVKMAIEYM 225
+S IF AP + + +I+ G+ V+GT+ ++ K E E N + + + E +
Sbjct: 395 ISKNIFNIIAPDKKSGVILTGYTVKGTLADEL-----KTEPEYVLINDKPVKRRCRFEEI 449
Query: 226 SFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFN-LDCFMP 276
SFSAH+D I+ + NV+LVHG+ +++ LK K+ +E L F P
Sbjct: 450 SFSAHSDFNQTKTFIEMLKCPNVVLVHGDRNELNRLKNKLTEEKKYLSVFTP 501
>gi|156092661|ref|XP_001612477.1| cleavage and polyadenylation specifity factor protein [Plasmodium
vivax Sal-1]
gi|148801279|gb|EDL42684.1| cleavage and polyadenylation specifity factor protein, putative
[Plasmodium vivax]
Length = 858
Score = 159 bits (401), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 104/292 (35%), Positives = 160/292 (54%), Gaps = 23/292 (7%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF V++ N +YTGDY+ DRH+ A I +LI E TY + D ++ RE FL
Sbjct: 217 MFLVEINNIRFLYTGDYSREVDRHIPIAEIPAIDVHVLICEGTYGIKVHDDRKKREVRFL 276
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERM-NLQA-PIYFAVGLTEKATNYYK 118
+ ++ GKVL+PVFALGRAQEL +++E +WER LQ PI++ + K+ Y+
Sbjct: 277 NMITSILNNKGKVLLPVFALGRAQELLLIMEEHWERNPQLQKIPIFYISSMATKSLCIYE 336
Query: 119 MFITWTNQKIRKTFVQ-RNMFDFKHIRPFDKS--------FIDNPGPMVVFATPGMLHSG 169
FI +R + +N F+FK ++ + KS + DN P VV A+PGML +G
Sbjct: 337 TFINLCGDFVRHVLNEGKNPFNFKFVK-YAKSLDSILNYLYQDN-NPCVVMASPGMLQNG 394
Query: 170 LSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFE----NKQIIDVKMAIEYM 225
+S IF AP + + +I+ G+ V+GT+ ++ K E E N + + + E +
Sbjct: 395 ISKNIFNIIAPDKKSGVILTGYTVKGTLADEL-----KTEPEYVLINDKPVKRRCRFEEI 449
Query: 226 SFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFN-LDCFMP 276
SFSAH+D I+ + NV+LVHG+ +++ LK K+ +E L F P
Sbjct: 450 SFSAHSDFNQTKTFIEMLKCPNVVLVHGDRNELNRLKNKLTEEKKYLSVFTP 501
>gi|124809291|ref|XP_001348538.1| cleavage and polyadenylation specificity factor protein, putative
[Plasmodium falciparum 3D7]
gi|23497434|gb|AAN36977.1| cleavage and polyadenylation specificity factor protein, putative
[Plasmodium falciparum 3D7]
Length = 876
Score = 158 bits (399), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 96/279 (34%), Positives = 161/279 (57%), Gaps = 14/279 (5%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF V++ N +YTGDY+ DRH+ A I +LI E TY + D ++ RE FL
Sbjct: 209 MFLVEINNIRFLYTGDYSREIDRHIPIAEIPNIDVHVLICEGTYGIKVHDDRKKREIRFL 268
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWER-MNLQA-PIYFAVGLTEKATNYYK 118
+ ++ GKVL+PVFALGRAQEL ++LE +W++ +LQ PI++ + K+ Y+
Sbjct: 269 NILTSMINNKGKVLLPVFALGRAQELLLILEEHWDKNKHLQNIPIFYISSMATKSLCIYE 328
Query: 119 MFITWTNQKIRKTFVQ-RNMFDFKHIRPFDKS--------FIDNPGPMVVFATPGMLHSG 169
FI + ++K + +N F+FK+++ + KS + DN P V+ A+PGML +G
Sbjct: 329 TFINLCGEFVKKVVNEGKNPFNFKYVK-YAKSLESISSYLYQDN-NPCVIMASPGMLQNG 386
Query: 170 LSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSA 229
+S IF A + + +I+ G+ V+GT+ ++ + + + N +++ K E +SFSA
Sbjct: 387 ISKNIFNIIASDKKSGVILTGYTVKGTLADELKTEPEFVTI-NDKVVKRKCRFEQISFSA 445
Query: 230 HADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQE 268
H+D I+ + NV+LVHG+ +++ LK K+ +E
Sbjct: 446 HSDFNQTKTFIEKLKCPNVVLVHGDKNELNRLKNKLIEE 484
>gi|253742053|gb|EES98907.1| Cleavage and polyadenylation specificity factor, 73 kDa subunit
[Giardia intestinalis ATCC 50581]
Length = 757
Score = 157 bits (396), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 94/278 (33%), Positives = 147/278 (52%), Gaps = 21/278 (7%)
Query: 1 MFQVKVGNQSIVYTGDYNMTP-DRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDF 59
MF + + N +YTGD++ P DRHL A + R DLLI ESTY T + + RERDF
Sbjct: 199 MFHISIDNFHALYTGDFSCEPEDRHLQPATFPQVRLDLLIIESTYGTIRQKERMTRERDF 258
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYW---ERMNLQAPIYFAVGLTEKATNY 116
+ + V + G VL+PVF++GR QEL +L+ YW E+ + IY+ + + A
Sbjct: 259 IDLIVSTVKKDGCVLLPVFSIGRVQELLCILQEYWREHEQEMARVTIYYVSAIADNARQL 318
Query: 117 Y---KMFITWTNQKIR--KTFVQRNMFDFKHIRPFDKSFIDNP-GPMVVFATPGMLHSGL 170
Y K F+ + + +T +++ + RP NP P V+F TPGML SG+
Sbjct: 319 YSKDKGFLRHGDTGLSDIQTGKRKDRIIYTKTRP------KNPKKPYVMFCTPGMLQSGV 372
Query: 171 SLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAH 230
S ++ + +N+L++ G+ Q T+ +K+L G + D KM IE +SFSAH
Sbjct: 373 SKEMYNELCGSPDNLLLVTGYATQDTLLYKLLEGKPPGAYA-----DAKMRIEELSFSAH 427
Query: 231 ADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQE 268
+D + +++ P+NV+ +HG + LK I E
Sbjct: 428 SDYNQTLDVLRKTRPRNVVFIHGSEKSISSLKRTIANE 465
>gi|398406895|ref|XP_003854913.1| hypothetical protein MYCGRDRAFT_55193, partial [Zymoseptoria
tritici IPO323]
gi|339474797|gb|EGP89889.1| hypothetical protein MYCGRDRAFT_55193 [Zymoseptoria tritici IPO323]
Length = 855
Score = 155 bits (392), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 94/315 (29%), Positives = 159/315 (50%), Gaps = 35/315 (11%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDK-CRPDLLITESTYATTIRDSKRCRERDF 59
M+ +++ +I++TGDY+ DRHL A + + + D LITEST+ + R ++ RE
Sbjct: 189 MYLIEIAGLNILFTGDYSRETDRHLIPAAVPRNVKIDCLITESTFGISTRTPRQERENAL 248
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYY 117
+K + ++RGG+VL+P A+G QEL ++LE YW+R + P+Y+A GL +K Y
Sbjct: 249 IKSITGILNRGGRVLMPTTAVGNTQELMLILEDYWQRHEEYRRFPMYYASGLAKKVMIVY 308
Query: 118 KMFITWTNQKIRKTF---VQRNMFDFKHIRPFDKSFI---------DNPGPMVVFATPGM 165
+ ++ N I+ F P+D +FI ++ GP VV A+PGM
Sbjct: 309 QTYVETMNDTIKAKFQASAAAASDSSGAGGPWDFNFIRQLKSMDRYEDVGPSVVLASPGM 368
Query: 166 LHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVL---------SGVKK--------- 207
L +G S + ++WAP N +I+ G+ V+GT+ ++ +G K
Sbjct: 369 LQNGPSRTLLERWAPDAKNGVIITGYSVEGTMAKTIMTEPDSIPAVTGKKNTGMNIGKRP 428
Query: 208 --LEFENKQIIDVKMAIEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKI 265
L +Q+I + ++ +F+ H D + + I+ V+LVHGE M LK ++
Sbjct: 429 GALADGERQMIPRRCTVQEFNFAVHVDGQENREFIEEVGAPVVILVHGEKHNMNRLKSRL 488
Query: 266 KQEFNLDCFMPANGE 280
+ + PAN E
Sbjct: 489 LGLGKMKVYSPANCE 503
>gi|453087099|gb|EMF15140.1| Metallo-hydrolase/oxidoreductase [Mycosphaerella populorum SO2202]
Length = 845
Score = 155 bits (391), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 93/320 (29%), Positives = 159/320 (49%), Gaps = 45/320 (14%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDK-CRPDLLITESTYATTIRDSKRCRERDF 59
M+ +++ ++++TGDY+ DRHL A + + + D LITEST+ + R ++ RE
Sbjct: 189 MYLIEIAGLNVLFTGDYSRENDRHLIPAAVPRNVKVDCLITESTFGISTRTPRQERENAL 248
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYY 117
+K + ++RGG+VL+P A+G QEL ++LE +W R + PIY+A GL K Y
Sbjct: 249 IKSITTILNRGGRVLMPTTAVGNTQELLLILEDHWHRHEEYRRFPIYYASGLARKVMVVY 308
Query: 118 KMFITWTNQKIRKTF----VQRNMFDFKHIRPFDKSFI---------DNPGPMVVFATPG 164
+ ++ N +I+ F ++ D P+D F+ D+ G VV A+PG
Sbjct: 309 QTYVDDMNDRIKAKFQASATGPSVGDGGTAGPWDFQFVRALKGVDRFDDVGGSVVLASPG 368
Query: 165 MLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFEN------------ 212
ML +G S + ++WAP N +I+ G+ V+GT+ +L LE +N
Sbjct: 369 MLQNGPSRALLERWAPDSKNGVIITGYSVEGTMAKNIL-----LEPDNIPAVTVDRSAGN 423
Query: 213 ------------KQIIDVKMAIEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVF 260
+Q++ + ++ +F+ H D + + I+ V+LVHGE M
Sbjct: 424 VLGKKAGGPDGERQMVPRRCTVQEFNFAVHVDGQENREFIEEVGAPVVILVHGEKHNMNR 483
Query: 261 LKEKIKQEFNLDCFMPANGE 280
LK ++ + + PAN E
Sbjct: 484 LKSRLLGTGKMKVYSPANCE 503
>gi|348515035|ref|XP_003445045.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein C12orf26
homolog [Oreochromis niloticus]
Length = 590
Score = 154 bits (390), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 89/229 (38%), Positives = 137/229 (59%), Gaps = 15/229 (6%)
Query: 804 ITHFVTPDSDISSILSQAYPQDSLHNVCIMGLHTCGDLSGTALRLF-TKSSLQCLVQVGC 862
+T +VT ++++ ++++ L + ++GLHTCGDL+ + LR+F K L + VGC
Sbjct: 341 LTSYVTAETELRDLINE------LEDAVMVGLHTCGDLAPSTLRMFVAKPELAAVCSVGC 394
Query: 863 CYHLLEEEFIRSPFWKDVDQSLYEHGYGFPLSEHLRSRKFFLGRNVRMSGTQSPERVIDL 922
CYHLL EEF P ++ QS+ GFPLS++LR + F GRN RMS + ERV
Sbjct: 395 CYHLLSEEF--DPAGQECLQSVC----GFPLSQYLRDQSCFCGRNARMSACLALERVALG 448
Query: 923 KQTQTLPLFYRALLEKYLRSKITINDEEPKVVGRLATKCSNFVEYVHRAVDKLKLD-LEV 981
+ Q LFYRA+L LR + E K VG + +K +FV+YV RA+ +L+LD ++
Sbjct: 449 QGIQMESLFYRAVLHVILRDHYSSYKSE-KRVGNVYSKAKSFVDYVRRALRRLELDESKL 507
Query: 982 DDEEVTRLFNSHQREYEYLQIYYFLKTALAPVIEALIVLDRVLYLREQQ 1030
D ++ N+++ + + + LK LAP IE LI+LDR+ YL+EQ+
Sbjct: 508 SDSDIQDYHNTYRVRMDEMHAFNMLKVTLAPCIEGLILLDRLCYLKEQE 556
Score = 75.1 bits (183), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 33/73 (45%), Positives = 51/73 (69%)
Query: 664 QLMSEKKSYEVQVMSQVVAAVTNSCDSSHIIDLGGGQGYLSTILALQHGKKTLSLDYNQV 723
+ M+ KKS+EVQ MS+VVA + C + +ID+G G+GYLS+ L+LQ+ + +D +
Sbjct: 136 EFMTSKKSHEVQSMSEVVACLAQHCGVNQVIDVGSGKGYLSSFLSLQYALRVYGIDSSST 195
Query: 724 NTHGAAVRSKKLE 736
NTHGA R++KL+
Sbjct: 196 NTHGAQERNRKLK 208
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 47/164 (28%), Positives = 80/164 (48%), Gaps = 15/164 (9%)
Query: 485 IKNTIDEMIEFITPNIACVNTHTVHFMVEKLWETTI---PKAILSQVNN-----SPSFDS 536
I+ IDE+I F++ + N HTV F +W+ + P+ +LS V + SP +
Sbjct: 11 IQQQIDEVIRFLSITLNIANAHTVEFYTHDVWKRFVAVSPEDVLSAVGSDIPYLSPXTEQ 70
Query: 537 LLEEFWKSRQDNHVETNSELVKFFQAADKFRLTSL---MNGESIITVDELFSKLQARKCG 593
F + ELV QAA + L L M+ E+++ ++ ++
Sbjct: 71 PSTTFGFCNDTKRLVDTQELV---QAAKEHSLPGLGLCMSRENLLQ-GLRDTRSESGTPE 126
Query: 594 QVVESAKVSQLMSEKKSYEVQVMSQVVAAVTNSCDSSHIIDLGS 637
+ V + + + M+ KKS+EVQ MS+VVA + C + +ID+GS
Sbjct: 127 ETVAAFEPDEFMTSKKSHEVQSMSEVVACLAQHCGVNQVIDVGS 170
Score = 41.6 bits (96), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 17/52 (32%), Positives = 33/52 (63%)
Query: 1044 TQVDDEEVTRLFDSHQREYEYLQIYYFLKTALAPVIEALIVLDRVLYLREQQ 1095
+++ D ++ ++++ + + + LK LAP IE LI+LDR+ YL+EQ+
Sbjct: 505 SKLSDSDIQDYHNTYRVRMDEMHAFNMLKVTLAPCIEGLILLDRLCYLKEQE 556
>gi|426373580|ref|XP_004053676.1| PREDICTED: methyltransferase-like protein 25 [Gorilla gorilla
gorilla]
Length = 603
Score = 154 bits (390), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 92/270 (34%), Positives = 151/270 (55%), Gaps = 30/270 (11%)
Query: 763 NRQTSHASVEHHGKNWKRKSKAPVVSIKDEELVVCKDKCKQITHFVTPDSDISSILSQAY 822
NR+TS A+ E WK S++ + S +T F+T DS++ I+
Sbjct: 333 NRETSEANKERRKMTWK-SSESNIYS--------------PLTSFITADSELHDIIKD-- 375
Query: 823 PQDSLHNVCIMGLHTCGDLSGTALRLFTKSS-LQCLVQVGCCYHLLEEEFIRSPFWKDVD 881
L + ++GLHTCGDL+ LR+FT +S ++ + VGCCYHLL EEF ++
Sbjct: 376 ----LEDCLMVGLHTCGDLAPNTLRIFTSNSEIKGVCSVGCCYHLLSEEF------ENQH 425
Query: 882 QSLYEHGYGFPLSEHLRSRKFFLGRNVRMSGTQSPERVIDLKQTQTLPLFYRALLEKYLR 941
+ + +GFP+ +L+ ++ GRN RMS + ERV + T LFYRA+L+ ++
Sbjct: 426 KERTQEKWGFPMCHYLKEERWCCGRNARMSACLALERVAAGQGLPTESLFYRAVLQDIIK 485
Query: 942 SKITINDEEPKVVGRLATKCSNFVEYVHRAVDKLKLD-LEVDDEEVTRLFNSHQREYEYL 1000
I + + VG++ +KCS+F++YV R++ KL LD ++ ++ + + ++ L
Sbjct: 486 DCYGITKCD-RHVGKIYSKCSSFLDYVRRSLKKLGLDESKLPEKIIMNYYEKYKPRMNEL 544
Query: 1001 QIYYFLKTALAPVIEALIVLDRVLYLREQQ 1030
+ + LK LAP IE LI+LDR+ YL+EQ+
Sbjct: 545 EAFNMLKVVLAPCIETLILLDRLCYLKEQE 574
Score = 78.6 bits (192), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 63/255 (24%), Positives = 110/255 (43%), Gaps = 52/255 (20%)
Query: 485 IKNTIDEMIEFITPNIACVNTHTVHFMVEKLWETTI---PKAILSQVNNSPSFDSLLEEF 541
++ + +++F+ ++ N HTV F E +WE + P +L+ + S S L
Sbjct: 16 LRAKLQGLLQFLRDALSISNAHTVDFYTESVWEELVDLPPATVLAALRKSASETEALP-- 73
Query: 542 WKSRQDNHVETNSELVKFFQAADKFRLTSLMNGESIITVDELFSKLQARKCGQVVESAKV 601
SE +A + +T F K+ ++V V
Sbjct: 74 ------------SETRPLVEAEWEAGMTD-------------FPKIFCETSQKLV---SV 105
Query: 602 SQLMSEKKSYEVQVMSQVVAAVTNSCDSSHIIDLGSTITVDELFSKLQARKCGLVVESAK 661
K Y VQ LG ++L L+ + + E+ K
Sbjct: 106 EAFALAAKYYSVQ-------------------KLGICTPFEQLLVALRGNQNQRIGENQK 146
Query: 662 VSQLMSEKKSYEVQVMSQVVAAVTNSCDSSHIIDLGGGQGYLSTILALQHGKKTLSLDYN 721
+ M+ KKS+EVQ MS++++++ + +IDLG G+GYLS+ L+L++G K +D +
Sbjct: 147 AVEFMNMKKSHEVQAMSELISSIADYYGIKQVIDLGSGKGYLSSFLSLKYGLKVYGIDSS 206
Query: 722 QVNTHGAAVRSKKLE 736
NTHGA R++KL+
Sbjct: 207 NTNTHGAEERNRKLK 221
>gi|308162204|gb|EFO64613.1| Cleavage and polyadenylation specificity factor, 73 kDa subunit
[Giardia lamblia P15]
Length = 737
Score = 154 bits (389), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 93/282 (32%), Positives = 149/282 (52%), Gaps = 21/282 (7%)
Query: 1 MFQVKVGNQSIVYTGDYNMTP-DRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDF 59
MF + + N +YTGD++ P DRHL A + + DLLI ESTY T + + RERDF
Sbjct: 177 MFHISIDNFHALYTGDFSCEPEDRHLQPATFPQVKLDLLIIESTYGTIRQKERMTRERDF 236
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYW---ERMNLQAPIYFAVGLTEKATNY 116
+ + V + G VL+PVF++GR QEL +L+ YW E+ + IY+ + + A
Sbjct: 237 IDLIVSTVKKDGCVLLPVFSIGRVQELLCILQEYWREHEQEMARITIYYVSAIADNARQL 296
Query: 117 Y---KMFITWTNQKIR--KTFVQRNMFDFKHIRPFDKSFIDNP-GPMVVFATPGMLHSGL 170
Y K F+ + + +T +++ + RP NP P V+F TPGML SG+
Sbjct: 297 YSKDKGFLRHGDTGLSDIQTGKRKDKIIYTKTRP------KNPKKPYVMFCTPGMLQSGV 350
Query: 171 SLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAH 230
S ++ + +N+L++ G+ Q T+ +K+L G + D KM IE +SFSAH
Sbjct: 351 SKEMYNELCGSPDNLLLVTGYATQDTLLYKLLEGKPPGGYA-----DAKMRIEELSFSAH 405
Query: 231 ADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLD 272
+D + +++ P+NV+ +HG + LK I E ++
Sbjct: 406 SDYNQTLDVLRKTRPRNVVFIHGSEKSISSLKRTIANETRIN 447
>gi|159111399|ref|XP_001705931.1| Cleavage and polyadenylation specificity factor, 73 kDa subunit
[Giardia lamblia ATCC 50803]
gi|157434022|gb|EDO78257.1| Cleavage and polyadenylation specificity factor, 73 kDa subunit
[Giardia lamblia ATCC 50803]
Length = 757
Score = 154 bits (388), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 93/278 (33%), Positives = 146/278 (52%), Gaps = 21/278 (7%)
Query: 1 MFQVKVGNQSIVYTGDYNMTP-DRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDF 59
MF + + N +YTGD++ P DRHL A + + DLLI ESTY T + + RERDF
Sbjct: 199 MFHISIDNFHALYTGDFSCEPEDRHLQPATFPQVKLDLLIIESTYGTIRQKERMTRERDF 258
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYW---ERMNLQAPIYFAVGLTEKATNY 116
+ + V + G VL+PVF++GR QEL +L+ YW E+ + IY+ + + A
Sbjct: 259 IDLIVSTVKKDGCVLLPVFSIGRVQELLCILQEYWREHEQEMARVTIYYVSAIADNARQL 318
Query: 117 Y---KMFITWTNQKIR--KTFVQRNMFDFKHIRPFDKSFIDNP-GPMVVFATPGMLHSGL 170
Y K F+ + + +T +++ + RP NP P V+F TPGML SG+
Sbjct: 319 YSKDKGFLRHGDTGLSDIQTGKRKDRIIYTKTRP------KNPKKPYVMFCTPGMLQSGV 372
Query: 171 SLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAH 230
S ++ + +N+L++ G+ Q T+ +K+L G + D KM IE +SFSAH
Sbjct: 373 SKEMYNELCGSPDNLLLVTGYATQDTLLYKLLEGKPPGGYA-----DAKMRIEELSFSAH 427
Query: 231 ADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQE 268
+D + +++ P+NV +HG + LK I E
Sbjct: 428 SDYNQTLDVLRKTRPRNVAFIHGSEKSISSLKRTIANE 465
>gi|291389663|ref|XP_002711413.1| PREDICTED: hypothetical protein [Oryctolagus cuniculus]
Length = 605
Score = 153 bits (386), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 88/231 (38%), Positives = 137/231 (59%), Gaps = 19/231 (8%)
Query: 804 ITHFVTPDSDISSILSQAYPQDSLHNVCIMGLHTCGDLSGTALRLFTKSS-LQCLVQVGC 862
+T F+T DS++ I+ L + ++GLHTCGDL+ LR+FT +S ++ + VGC
Sbjct: 361 LTSFITADSELHDIIKD------LEDCLMVGLHTCGDLAPNTLRIFTSNSEIKGVCSVGC 414
Query: 863 CYHLLEEEFIRSPFWKDVDQSLYEHGYGFPLSEHLRSRKFFLGRNVRMSGTQSPERVIDL 922
CYHLL EEF P K+ Q +GFP+ +L+ K+ GRN RMS + ERV
Sbjct: 415 CYHLLSEEFENQP--KECPQ----ETWGFPMCRYLKEEKWCCGRNARMSACLALERVAVG 468
Query: 923 KQTQTLPLFYRALLEKYLRS--KITINDEEPKVVGRLATKCSNFVEYVHRAVDKLKLD-L 979
+ T LFYRA+L+ ++ IT D+ VG++ +KCS+F+EYV +++ KL LD
Sbjct: 469 QGLPTESLFYRAVLQNIIKDCYGITKCDQH---VGKIYSKCSSFLEYVRKSLKKLGLDES 525
Query: 980 EVDDEEVTRLFNSHQREYEYLQIYYFLKTALAPVIEALIVLDRVLYLREQQ 1030
++ ++ + + ++ L+ + LK LAP IE LI+LDR+ YL+EQ+
Sbjct: 526 KLPEKLIMEYYEKYKPRMNELEAFNMLKVVLAPCIETLILLDRLCYLKEQE 576
Score = 85.5 bits (210), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 67/248 (27%), Positives = 113/248 (45%), Gaps = 52/248 (20%)
Query: 492 MIEFITPNIACVNTHTVHFMVEKLWETTI---PKAILSQVNNSPSFDSLLEEFWKSRQDN 548
++ F+ ++ N HTV F E LWE + P+ +L+ + S
Sbjct: 23 LLWFLRDALSISNAHTVDFYTESLWEELVDLPPETVLAALRKSA---------------- 66
Query: 549 HVETNSELVKFFQAADKFRLTSLMNGESIITVDELFSKLQARKCGQVVESAKVSQLMSEK 608
+E + L A+ L G I ++FS+ Q + SA+ L +
Sbjct: 67 -LEPQARL------AESRPLLEAEMGSGITDFPKIFSET-----SQKLVSAEAFALAA-- 112
Query: 609 KSYEVQVMSQVVAAVTNSCDSSHIIDLGSTITVDELFSKLQARKCGLVVESAKVSQLMSE 668
K + +Q +LG ++L LQ + E+ K + M+
Sbjct: 113 KCFSMQ-------------------NLGICTPFEQLLVALQGNQKQKTGENVKPVEFMNV 153
Query: 669 KKSYEVQVMSQVVAAVTNSCDSSHIIDLGGGQGYLSTILALQHGKKTLSLDYNQVNTHGA 728
KKS+EVQ MS++++++ + C IIDLG G+GYLS+ L+L++G K +D + NTHGA
Sbjct: 154 KKSHEVQAMSELISSIADYCGIKQIIDLGSGKGYLSSFLSLKYGLKVYGIDSSNTNTHGA 213
Query: 729 AVRSKKLE 736
R++KL+
Sbjct: 214 EERNRKLK 221
>gi|349604454|gb|AEQ00001.1| Uncharacterized protein C12orf26-like protein, partial [Equus
caballus]
Length = 371
Score = 153 bits (386), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 87/231 (37%), Positives = 137/231 (59%), Gaps = 19/231 (8%)
Query: 804 ITHFVTPDSDISSILSQAYPQDSLHNVCIMGLHTCGDLSGTALRLFT-KSSLQCLVQVGC 862
+T F+T DSD++ I+ L N ++GLHTCGDL+ LR+FT KS ++ + VGC
Sbjct: 127 LTSFITADSDLNDIIKD------LENCLMVGLHTCGDLAPNTLRIFTSKSEIKGVCSVGC 180
Query: 863 CYHLLEEEFIRSPFWKDVDQSLYEHGYGFPLSEHLRSRKFFLGRNVRMSGTQSPERVIDL 922
CYHLL EEF P + + +GFP+S +L+ ++ GRN RMS + ERV
Sbjct: 181 CYHLLSEEFENPP------KECTQQKWGFPMSCYLKEERWCCGRNARMSACLALERVAVG 234
Query: 923 KQTQTLPLFYRALLEKYLRS--KITINDEEPKVVGRLATKCSNFVEYVHRAVDKLKLD-L 979
+ T LFYRA+L+ ++ IT D+ VG++ +K S+F++YV +++ KL LD
Sbjct: 235 RGLPTESLFYRAVLQDIIKDCYGITKCDQH---VGKIYSKSSSFLDYVRKSLKKLGLDES 291
Query: 980 EVDDEEVTRLFNSHQREYEYLQIYYFLKTALAPVIEALIVLDRVLYLREQQ 1030
++ ++ + + ++ L+ + LK LAP IE LI+LDR+ YL+EQ+
Sbjct: 292 KLPEKIIMDYYEKYKPRMNELEAFNMLKVVLAPCIETLILLDRLCYLKEQE 342
>gi|149742881|ref|XP_001490811.1| PREDICTED: uncharacterized protein C12orf26-like [Equus caballus]
Length = 603
Score = 152 bits (385), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 87/231 (37%), Positives = 137/231 (59%), Gaps = 19/231 (8%)
Query: 804 ITHFVTPDSDISSILSQAYPQDSLHNVCIMGLHTCGDLSGTALRLFT-KSSLQCLVQVGC 862
+T F+T DSD++ I+ L N ++GLHTCGDL+ LR+FT KS ++ + VGC
Sbjct: 359 LTSFITADSDLNDIIKD------LENCLMVGLHTCGDLAPNTLRIFTSKSEIKGVCSVGC 412
Query: 863 CYHLLEEEFIRSPFWKDVDQSLYEHGYGFPLSEHLRSRKFFLGRNVRMSGTQSPERVIDL 922
CYHLL EEF P + + +GFP+S +L+ ++ GRN RMS + ERV
Sbjct: 413 CYHLLSEEFENPP------KECTQQKWGFPMSCYLKEERWCCGRNARMSACLALERVAVG 466
Query: 923 KQTQTLPLFYRALLEKYLRS--KITINDEEPKVVGRLATKCSNFVEYVHRAVDKLKLD-L 979
+ T LFYRA+L+ ++ IT D+ VG++ +K S+F++YV +++ KL LD
Sbjct: 467 RGLPTESLFYRAVLQDIIKDCYGITKCDQH---VGKIYSKSSSFLDYVRKSLKKLGLDES 523
Query: 980 EVDDEEVTRLFNSHQREYEYLQIYYFLKTALAPVIEALIVLDRVLYLREQQ 1030
++ ++ + + ++ L+ + LK LAP IE LI+LDR+ YL+EQ+
Sbjct: 524 KLPEKIIMDYYEKYKPRMNELEAFNMLKVVLAPCIETLILLDRLCYLKEQE 574
Score = 84.7 bits (208), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 67/248 (27%), Positives = 109/248 (43%), Gaps = 52/248 (20%)
Query: 492 MIEFITPNIACVNTHTVHFMVEKLWETTI---PKAILSQVNNSPSFDSLLEEFWKSRQDN 548
++ F+ +A N HTV F E LWE + P+ +L+ + +S +
Sbjct: 23 LLRFLREALAISNAHTVDFYTESLWEQLVDLPPETVLAVLRSSAA--------------- 67
Query: 549 HVETNSELVKFFQAADKFRLTSLMNGESIITVDELFSKLQARKCGQVVESAKVSQLMSEK 608
ET A L+ GE + + F K+ ++V V
Sbjct: 68 --ETE---------ARPSEARPLVEGEREAGITD-FPKIFCETSQKLVS---VEGFALAA 112
Query: 609 KSYEVQVMSQVVAAVTNSCDSSHIIDLGSTITVDELFSKLQARKCGLVVESAKVSQLMSE 668
K Y +Q +LG ++L L+ + E+ K + M+
Sbjct: 113 KYYSIQ-------------------NLGICTPFEQLLVALRGNQKQRTGENVKPDEFMNL 153
Query: 669 KKSYEVQVMSQVVAAVTNSCDSSHIIDLGGGQGYLSTILALQHGKKTLSLDYNQVNTHGA 728
KKS+EVQ MS+V+++V C +IDLG G+GYLS+ L+L++G K +D + NTHGA
Sbjct: 154 KKSHEVQAMSEVISSVAEYCGIKQVIDLGSGKGYLSSFLSLKYGLKVYGIDSSNTNTHGA 213
Query: 729 AVRSKKLE 736
R++KL+
Sbjct: 214 EERNRKLK 221
>gi|355564503|gb|EHH21003.1| hypothetical protein EGK_03973 [Macaca mulatta]
Length = 603
Score = 152 bits (383), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 93/270 (34%), Positives = 151/270 (55%), Gaps = 30/270 (11%)
Query: 763 NRQTSHASVEHHGKNWKRKSKAPVVSIKDEELVVCKDKCKQITHFVTPDSDISSILSQAY 822
NR+TS A+ E K + S++ V S +T F+T DS++ I+
Sbjct: 333 NRETSEANKERR-KMTSKSSESNVYS--------------PLTSFITADSELHDIIKD-- 375
Query: 823 PQDSLHNVCIMGLHTCGDLSGTALRLFT-KSSLQCLVQVGCCYHLLEEEFIRSPFWKDVD 881
L + ++GLHTCGDL+ LR+FT KS ++ + VGCCYHLL EEF ++
Sbjct: 376 ----LEDCLMVGLHTCGDLAPNTLRIFTSKSEIKGVCSVGCCYHLLSEEF------ENQH 425
Query: 882 QSLYEHGYGFPLSEHLRSRKFFLGRNVRMSGTQSPERVIDLKQTQTLPLFYRALLEKYLR 941
+ + +GFP+ +L+ ++ GRN RMS + ERV + T LFYRA+L+ ++
Sbjct: 426 KERTQEKWGFPMCHYLKEERWCCGRNARMSACLALERVAAGQGLPTESLFYRAVLQDIIK 485
Query: 942 SKITINDEEPKVVGRLATKCSNFVEYVHRAVDKLKLD-LEVDDEEVTRLFNSHQREYEYL 1000
I + + VG++ +KCS+F++YV R++ KL LD ++ ++ + + ++ L
Sbjct: 486 DCYGITKCD-RHVGKIYSKCSSFLDYVRRSLKKLGLDESKLPEKIIMNYYEKYKPRMNEL 544
Query: 1001 QIYYFLKTALAPVIEALIVLDRVLYLREQQ 1030
+ + LK LAP IE LI+LDR+ YL+EQ+
Sbjct: 545 EAFNMLKVVLAPCIETLILLDRLCYLKEQE 574
Score = 89.0 bits (219), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 66/255 (25%), Positives = 114/255 (44%), Gaps = 52/255 (20%)
Query: 485 IKNTIDEMIEFITPNIACVNTHTVHFMVEKLWETTI---PKAILSQVNNSPSFDSLLEEF 541
++ + +++F+ ++ N HTV F E +WE + P+ +L+ + S S E
Sbjct: 16 LRAKLQGLLQFLRDALSISNAHTVDFYTESVWEELVDLTPETVLTALRKSAS----EAEA 71
Query: 542 WKSRQDNHVETNSELVKFFQAADKFRLTSLMNGESIITVDELFSKLQARKCGQVVESAKV 601
W S VE GE+ +T F K+ ++V V
Sbjct: 72 WPSEMRRLVEAE--------------------GEAGVTD---FPKIFCETSQKLV---SV 105
Query: 602 SQLMSEKKSYEVQVMSQVVAAVTNSCDSSHIIDLGSTITVDELFSKLQARKCGLVVESAK 661
K Y +Q +LG ++L L+ + + E+ K
Sbjct: 106 EAFALAAKYYSIQ-------------------NLGICTPFEQLLVALRGNQNQRIGETQK 146
Query: 662 VSQLMSEKKSYEVQVMSQVVAAVTNSCDSSHIIDLGGGQGYLSTILALQHGKKTLSLDYN 721
+ M+ KKS+EVQ MS++++++ + C +IDLG G+GYLS+ L+L++G K +D +
Sbjct: 147 AVEFMNMKKSHEVQAMSELISSIADYCGIKQVIDLGSGKGYLSSFLSLKYGFKVYGIDSS 206
Query: 722 QVNTHGAAVRSKKLE 736
NTHGA R+ KL+
Sbjct: 207 NTNTHGAEERNSKLK 221
>gi|355786347|gb|EHH66530.1| hypothetical protein EGM_03541 [Macaca fascicularis]
Length = 603
Score = 152 bits (383), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 93/270 (34%), Positives = 151/270 (55%), Gaps = 30/270 (11%)
Query: 763 NRQTSHASVEHHGKNWKRKSKAPVVSIKDEELVVCKDKCKQITHFVTPDSDISSILSQAY 822
NR+TS A+ E K + S++ V S +T F+T DS++ I+
Sbjct: 333 NRETSEANKERR-KMTSKSSESNVYS--------------PLTSFITADSELHDIIKD-- 375
Query: 823 PQDSLHNVCIMGLHTCGDLSGTALRLFT-KSSLQCLVQVGCCYHLLEEEFIRSPFWKDVD 881
L + ++GLHTCGDL+ LR+FT KS ++ + VGCCYHLL EEF ++
Sbjct: 376 ----LEDCLMVGLHTCGDLAPNTLRIFTSKSEIKGVCSVGCCYHLLSEEF------ENQH 425
Query: 882 QSLYEHGYGFPLSEHLRSRKFFLGRNVRMSGTQSPERVIDLKQTQTLPLFYRALLEKYLR 941
+ + +GFP+ +L+ ++ GRN RMS + ERV + T LFYRA+L+ ++
Sbjct: 426 KERTQEKWGFPMCHYLKEERWCCGRNARMSACLALERVAAGQGLPTESLFYRAVLQDIIK 485
Query: 942 SKITINDEEPKVVGRLATKCSNFVEYVHRAVDKLKLD-LEVDDEEVTRLFNSHQREYEYL 1000
I + + VG++ +KCS+F++YV R++ KL LD ++ ++ + + ++ L
Sbjct: 486 DCYGITKCD-RHVGKIYSKCSSFLDYVRRSLKKLGLDESKLPEKIIMNYYEKYKPRMNEL 544
Query: 1001 QIYYFLKTALAPVIEALIVLDRVLYLREQQ 1030
+ + LK LAP IE LI+LDR+ YL+EQ+
Sbjct: 545 EAFNMLKVVLAPCIETLILLDRLCYLKEQE 574
Score = 85.5 bits (210), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 65/256 (25%), Positives = 116/256 (45%), Gaps = 54/256 (21%)
Query: 485 IKNTIDEMIEFITPNIACVNTHTVHFMVEKLWETTI---PKAILSQVNNSPSFDSLLEEF 541
++ + +++F+ ++ N HTV F E +WE + P+ +L+ + S S
Sbjct: 16 LRAKLQGLLQFLRDALSISNAHTVDFYTESVWEELVDLTPETVLTALRKSAS-------- 67
Query: 542 WKSRQDNHVETN-SELVKFFQAADKFRLTSLMNGESIITVDELFSKLQARKCGQVVESAK 600
E SE + +A GE+ +T F K+ ++V
Sbjct: 68 -------EAEARPSETRRLVEA----------EGEAGVTD---FPKIFCETSQKLV---S 104
Query: 601 VSQLMSEKKSYEVQVMSQVVAAVTNSCDSSHIIDLGSTITVDELFSKLQARKCGLVVESA 660
V K Y +Q +LG ++L L+ + + E+
Sbjct: 105 VEAFALAAKYYSIQ-------------------NLGICTPFEQLLVALRGNQNQRIGETQ 145
Query: 661 KVSQLMSEKKSYEVQVMSQVVAAVTNSCDSSHIIDLGGGQGYLSTILALQHGKKTLSLDY 720
K + M+ KKS+EVQ MS++++++ + C +IDLG G+GYLS+ L+L++G K +D
Sbjct: 146 KAVEFMNMKKSHEVQAMSELISSIADYCGIKQVIDLGSGKGYLSSFLSLKYGFKVYGIDS 205
Query: 721 NQVNTHGAAVRSKKLE 736
+ NTHGA R++KL+
Sbjct: 206 SNTNTHGAEERNRKLK 221
>gi|326482980|gb|EGE06990.1| endoribonuclease ysh1 [Trichophyton equinum CBS 127.97]
Length = 818
Score = 151 bits (382), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 112/379 (29%), Positives = 188/379 (49%), Gaps = 44/379 (11%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDK-CRPDLLITESTYATTIRDSKRCRERDF 59
MF + + +I++TGDY+ DRHL +A + K + D++ITEST+ + + RE
Sbjct: 180 MFLISIAGLNILFTGDYSREEDRHLISAEVPKGVKIDVMITESTFGISSNPPRLEREAAL 239
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERM-NLQ-APIYFAVGLTEKATNYY 117
+K V ++RGG+VL+PVFALGRAQEL ++L+ YW R LQ PIY+ + + Y
Sbjct: 240 MKSVTSIINRGGRVLMPVFALGRAQELLLILDEYWSRHPELQKVPIYYIGNMARRCMVVY 299
Query: 118 KMFITWTNQKIRKTFVQR--------------NMFDFKHIRPF-DKSFIDNPGPMVVFAT 162
+ +I N+ I++ F QR +DF+ +R + ++ G V+ A+
Sbjct: 300 QTYIGAMNENIKRLFRQRMAEAEARGDKSVTAGPWDFRFVRSLRNLDRFEDVGGCVMLAS 359
Query: 163 PGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAI 222
PGML +G S + ++WAP E N +IM G+ V+GT+G ++++ E +QI V A
Sbjct: 360 PGMLQTGTSRELLERWAPNERNGVIMTGYSVEGTMGKQIIN-------EPEQIPAVMSA- 411
Query: 223 EYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESC 282
+ + +D + IM + + C + L L + +E + + PAN +
Sbjct: 412 --KNAAGPSDDQKIM-IQRRCTVDEISLQ---------LTLNVDKEVKVKVYTPANCDEV 459
Query: 283 FVQTDMKISIDVSVNLLKEEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEV 342
+ M + V L E S P R + GVLV +S+M D+ +
Sbjct: 460 RIPF-MVDKVARVVGRLAE-----TSPPIGQDDSRLMDGVLVQNGFKLSMMASDDLREYA 513
Query: 343 GISRHIVRFTSNVQISDSS 361
G++ +V + +S +S
Sbjct: 514 GLTTTMVTCKQYITLSTAS 532
>gi|296212475|ref|XP_002752848.1| PREDICTED: uncharacterized protein C12orf26 [Callithrix jacchus]
Length = 603
Score = 151 bits (381), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 85/231 (36%), Positives = 138/231 (59%), Gaps = 19/231 (8%)
Query: 804 ITHFVTPDSDISSILSQAYPQDSLHNVCIMGLHTCGDLSGTALRLFTKSS-LQCLVQVGC 862
+T F+T DS++ I+ + L + I+GLHTCGDL+ LR+FT +S ++ + VGC
Sbjct: 359 LTSFITADSELHDII------EDLEDCLIVGLHTCGDLAPNTLRIFTSNSEIKGVCSVGC 412
Query: 863 CYHLLEEEFIRSPFWKDVDQSLYEHGYGFPLSEHLRSRKFFLGRNVRMSGTQSPERVIDL 922
CYHLL EEF ++ + + +GFP+ +L+ ++ GRN RMS + ERV
Sbjct: 413 CYHLLSEEF------ENQHKECTQEKWGFPMCRYLKEERWCCGRNARMSACLALERVAVG 466
Query: 923 KQTQTLPLFYRALLEKYLRS--KITINDEEPKVVGRLATKCSNFVEYVHRAVDKLKLD-L 979
+ T LFYRA+L+ ++ IT D+ VG++ +KCS+F++YV R++ KL LD
Sbjct: 467 QGLPTESLFYRAVLQDIIKDCYGITKCDQH---VGKIYSKCSSFLDYVRRSLKKLGLDES 523
Query: 980 EVDDEEVTRLFNSHQREYEYLQIYYFLKTALAPVIEALIVLDRVLYLREQQ 1030
++ ++ + + ++ L+ + LK LAP IE LI+LDR+ YL+EQ+
Sbjct: 524 KLPEKIILNYYEKYKPRMNELEAFNMLKVVLAPCIETLILLDRLCYLKEQE 574
Score = 79.7 bits (195), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 64/255 (25%), Positives = 115/255 (45%), Gaps = 52/255 (20%)
Query: 485 IKNTIDEMIEFITPNIACVNTHTVHFMVEKLWETTI---PKAILSQVNNSPSFDSLLEEF 541
++ + ++ F+ ++ N HTV F E +WE + P+ +L+ +
Sbjct: 16 LRAKLQGLLRFLRDTLSISNAHTVDFYTESVWEELVDLPPETVLAALR------------ 63
Query: 542 WKSRQDNHVETNSELVKFFQAADKFRLTSLMNGESIITVDELFSKLQARKCGQVVESAKV 601
KS + V SE +A GE+ +T F K+ ++V V
Sbjct: 64 -KSASEAEVRP-SEARPLMEA----------EGETDMTD---FPKIFCETSQKLV---SV 105
Query: 602 SQLMSEKKSYEVQVMSQVVAAVTNSCDSSHIIDLGSTITVDELFSKLQARKCGLVVESAK 661
K Y +Q +LG + ++L LQ + + ++ K
Sbjct: 106 EAFALAAKYYSIQ-------------------NLGICTSFEQLLVVLQGNQNQRIGKTLK 146
Query: 662 VSQLMSEKKSYEVQVMSQVVAAVTNSCDSSHIIDLGGGQGYLSTILALQHGKKTLSLDYN 721
+ M+ KKS+EVQ MS++++++ + +IDLG G+GYLS+ L+L++G K +D +
Sbjct: 147 AVEFMNMKKSHEVQAMSELISSIADYYGLKQVIDLGSGKGYLSSFLSLKYGLKVYGIDSS 206
Query: 722 QVNTHGAAVRSKKLE 736
NTHGA R++KL+
Sbjct: 207 NTNTHGAEERNRKLK 221
>gi|383409083|gb|AFH27755.1| hypothetical protein LOC84190 [Macaca mulatta]
Length = 603
Score = 150 bits (380), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 83/229 (36%), Positives = 136/229 (59%), Gaps = 15/229 (6%)
Query: 804 ITHFVTPDSDISSILSQAYPQDSLHNVCIMGLHTCGDLSGTALRLFT-KSSLQCLVQVGC 862
+T F+T DS++ I+ L + ++GLHTCGDL+ LR+FT KS ++ + VGC
Sbjct: 359 LTSFITADSELHDIIKD------LEDCLMVGLHTCGDLAPNTLRIFTSKSEIKGVCSVGC 412
Query: 863 CYHLLEEEFIRSPFWKDVDQSLYEHGYGFPLSEHLRSRKFFLGRNVRMSGTQSPERVIDL 922
CYHLL EEF ++ + + +GFP+ +L+ ++ GRN RMS + ERV
Sbjct: 413 CYHLLSEEF------ENQHKERTQEKWGFPMCHYLKEERWCCGRNARMSACLALERVAAG 466
Query: 923 KQTQTLPLFYRALLEKYLRSKITINDEEPKVVGRLATKCSNFVEYVHRAVDKLKLD-LEV 981
+ T LFYRA+L+ ++ I + + VG++ +KCS+F++YV R++ KL LD ++
Sbjct: 467 QGLPTESLFYRAVLQDIIKDCYGITKCD-RHVGKIYSKCSSFLDYVRRSLKKLGLDESKL 525
Query: 982 DDEEVTRLFNSHQREYEYLQIYYFLKTALAPVIEALIVLDRVLYLREQQ 1030
++ + + ++ L+ + LK LAP IE LI+LDR+ YL+EQ+
Sbjct: 526 PEKIIMNYYEKYKPRMNELEAFNMLKVVLAPCIETLILLDRLCYLKEQE 574
Score = 84.7 bits (208), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 65/256 (25%), Positives = 115/256 (44%), Gaps = 54/256 (21%)
Query: 485 IKNTIDEMIEFITPNIACVNTHTVHFMVEKLWETTI---PKAILSQVNNSPSFDSLLEEF 541
++ + +++F+ + N HTV F E +WE + P+ +L+ + S S
Sbjct: 16 LRAKLQGLLQFLRDALPISNAHTVDFYTESVWEELVDLTPETVLTALRKSAS-------- 67
Query: 542 WKSRQDNHVETN-SELVKFFQAADKFRLTSLMNGESIITVDELFSKLQARKCGQVVESAK 600
E SE + +A GE+ +T F K+ ++V
Sbjct: 68 -------EAEARPSETRRLVEA----------EGEAGVTD---FPKIFCETSQKLV---S 104
Query: 601 VSQLMSEKKSYEVQVMSQVVAAVTNSCDSSHIIDLGSTITVDELFSKLQARKCGLVVESA 660
V K Y +Q +LG ++L L+ + + E+
Sbjct: 105 VEAFALAAKYYSIQ-------------------NLGICTPFEQLLVALRGNQNQRIGETQ 145
Query: 661 KVSQLMSEKKSYEVQVMSQVVAAVTNSCDSSHIIDLGGGQGYLSTILALQHGKKTLSLDY 720
K + M+ KKS+EVQ MS++++++ + C +IDLG G+GYLS+ L+L++G K +D
Sbjct: 146 KAVEFMNMKKSHEVQAMSELISSIADYCGIKQVIDLGSGKGYLSSFLSLKYGFKVYGIDS 205
Query: 721 NQVNTHGAAVRSKKLE 736
+ NTHGA R++KL+
Sbjct: 206 SNTNTHGAEERNRKLK 221
>gi|114646075|ref|XP_509245.2| PREDICTED: uncharacterized protein C12orf26 homolog [Pan troglodytes]
gi|410330613|gb|JAA34253.1| chromosome 12 open reading frame 26 [Pan troglodytes]
Length = 603
Score = 150 bits (380), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 92/270 (34%), Positives = 151/270 (55%), Gaps = 30/270 (11%)
Query: 763 NRQTSHASVEHHGKNWKRKSKAPVVSIKDEELVVCKDKCKQITHFVTPDSDISSILSQAY 822
NR+TS A+ E K + S++ + S +T F+T DS++ I+
Sbjct: 333 NRETSEANKERR-KMTSKSSESNIYS--------------PLTSFITADSELHDIIKD-- 375
Query: 823 PQDSLHNVCIMGLHTCGDLSGTALRLFTKSS-LQCLVQVGCCYHLLEEEFIRSPFWKDVD 881
L + ++GLHTCGDL+ LR+FT +S ++ + VGCCYHLL EEF ++
Sbjct: 376 ----LEDCLMVGLHTCGDLAPNTLRIFTSNSEIKGVCSVGCCYHLLSEEF------ENQH 425
Query: 882 QSLYEHGYGFPLSEHLRSRKFFLGRNVRMSGTQSPERVIDLKQTQTLPLFYRALLEKYLR 941
+ + +GFP+ +L+ K+ GRN RMS + ERV + T LFYRA+L+ ++
Sbjct: 426 KERTQEKWGFPMCHYLKEEKWCCGRNARMSACLALERVAAGQGLPTESLFYRAVLQDIIK 485
Query: 942 SKITINDEEPKVVGRLATKCSNFVEYVHRAVDKLKLD-LEVDDEEVTRLFNSHQREYEYL 1000
I + + VG++ +KCS+F++YV R++ KL LD ++ ++ + + ++ L
Sbjct: 486 DCYGITKCD-RHVGKIYSKCSSFLDYVRRSLKKLGLDESKLPEKIIMNYYEKYKPRMNEL 544
Query: 1001 QIYYFLKTALAPVIEALIVLDRVLYLREQQ 1030
+ + LK LAP IE LI+LDR+ YL+EQ+
Sbjct: 545 EAFNMLKVVLAPCIETLILLDRLCYLKEQE 574
Score = 80.9 bits (198), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 63/255 (24%), Positives = 112/255 (43%), Gaps = 52/255 (20%)
Query: 485 IKNTIDEMIEFITPNIACVNTHTVHFMVEKLWETTI---PKAILSQVNNSPSFDSLLEEF 541
++ + +++F+ ++ N HTV F E +WE + P+ +L+ + S S L
Sbjct: 16 LRAKLQGLLQFLRDALSISNAHTVDFYTESVWEELVNLPPETVLAALRKSASETEALP-- 73
Query: 542 WKSRQDNHVETNSELVKFFQAADKFRLTSLMNGESIITVDELFSKLQARKCGQVVESAKV 601
SE +A + +T F K+ ++V V
Sbjct: 74 ------------SETRPLVEAEWEAGMTD-------------FPKIFCETSQKLV---SV 105
Query: 602 SQLMSEKKSYEVQVMSQVVAAVTNSCDSSHIIDLGSTITVDELFSKLQARKCGLVVESAK 661
K Y VQ +LG ++L L+ + + E+ K
Sbjct: 106 EAFALAAKYYSVQ-------------------NLGICTPFEQLLVALRGNQTQRIGENQK 146
Query: 662 VSQLMSEKKSYEVQVMSQVVAAVTNSCDSSHIIDLGGGQGYLSTILALQHGKKTLSLDYN 721
+ M+ KKS+EVQ MS++++++ + +IDLG G+GYLS+ L+L++G K +D +
Sbjct: 147 AVEFMNMKKSHEVQAMSELISSIADYYGIKQVIDLGSGKGYLSSFLSLKYGLKVYGIDSS 206
Query: 722 QVNTHGAAVRSKKLE 736
NTHGA R++KL+
Sbjct: 207 NTNTHGAEERNRKLK 221
>gi|452825586|gb|EME32582.1| RNA-metabolising metallo-beta-lactamase family protein [Galdieria
sulphuraria]
Length = 370
Score = 150 bits (379), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 81/210 (38%), Positives = 121/210 (57%), Gaps = 8/210 (3%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCR--PDLLITESTYATTIRDSKRCRERD 58
MF + V N I+YTGD+ P HL A +D PD++++E+TYAT+ +D + + +
Sbjct: 136 MFSIVVDNHRILYTGDFTSCPTFHLPPARVDDIPYPPDVILSEATYATSFKDGRLNNQVE 195
Query: 59 FLKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYK 118
F++ V +C+ GGKVL+PVFA+GRAQEL +LLE YW+R +L PI F+ + Y
Sbjct: 196 FIQNVLDCLLDGGKVLVPVFAIGRAQELLLLLEMYWQRFHLSFPILFSTKNAHQVLQIYT 255
Query: 119 MFITWT-NQKIRKTFVQRNMFDFKHIRPFD-KSFIDN----PGPMVVFATPGMLHSGLSL 172
F WT R + + ++ D + +D P+V TPG L GLSL
Sbjct: 256 EFAHWTRTPSTRDEQMMSYQTWWSRVQVVDPEQLLDAVEEWDRPLVALTTPGTLARGLSL 315
Query: 173 IIFKKWAPVENNMLIMPGFCVQGTIGHKVL 202
+F++ AP E N+LI+P FC+ GTI ++L
Sbjct: 316 QVFRRIAPDEKNLLIIPHFCISGTIEKRLL 345
>gi|332221017|ref|XP_003259654.1| PREDICTED: methyltransferase-like protein 25 [Nomascus leucogenys]
Length = 603
Score = 150 bits (379), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 82/229 (35%), Positives = 137/229 (59%), Gaps = 15/229 (6%)
Query: 804 ITHFVTPDSDISSILSQAYPQDSLHNVCIMGLHTCGDLSGTALRLFTKSS-LQCLVQVGC 862
+T F+T DS++ I+ + L + ++GLHTCGDL+ LR+FT +S ++ + VGC
Sbjct: 359 LTSFITADSELHDII------EDLEDCLMVGLHTCGDLAPNTLRIFTSNSEIKGVCSVGC 412
Query: 863 CYHLLEEEFIRSPFWKDVDQSLYEHGYGFPLSEHLRSRKFFLGRNVRMSGTQSPERVIDL 922
CYHLL EEF ++ + + +GFP+ +L+ ++ GRN RMS + ERV
Sbjct: 413 CYHLLSEEF------ENQHKERTQEKWGFPMCHYLKEERWCCGRNARMSACLALERVAAG 466
Query: 923 KQTQTLPLFYRALLEKYLRSKITINDEEPKVVGRLATKCSNFVEYVHRAVDKLKLD-LEV 981
+ T LFYRA+L+ ++ I + + VG++ +KCS+F++YV R++ KL LD ++
Sbjct: 467 QGLPTESLFYRAVLQDIIKDCYGITKCD-RHVGKIYSKCSSFLDYVRRSLKKLGLDESKL 525
Query: 982 DDEEVTRLFNSHQREYEYLQIYYFLKTALAPVIEALIVLDRVLYLREQQ 1030
++ + + ++ L+ + LK LAP IE LI+LDR+ YL+EQ+
Sbjct: 526 PEKIIMNYYEKYKPRMNELEAFNMLKVVLAPCIETLILLDRLCYLKEQE 574
Score = 80.1 bits (196), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 63/255 (24%), Positives = 113/255 (44%), Gaps = 52/255 (20%)
Query: 485 IKNTIDEMIEFITPNIACVNTHTVHFMVEKLWETTI---PKAILSQVNNSPSFDSLLEEF 541
++ + +++F+ ++ N HTV F E +WE + P+ +L+ + S S
Sbjct: 16 LRANLQGLLQFLRDALSISNAHTVDFYTESVWEELVDLPPETVLAALRKSAS-------- 67
Query: 542 WKSRQDNHVETNSELVKFFQAADKFRLTSLMNGESIITVDELFSKLQARKCGQVVESAKV 601
E + A+ GE+ +T F K+ ++V V
Sbjct: 68 -------EAEARPSETRPLVEAE---------GEAGMTD---FPKIFCETSQKLV---SV 105
Query: 602 SQLMSEKKSYEVQVMSQVVAAVTNSCDSSHIIDLGSTITVDELFSKLQARKCGLVVESAK 661
K Y VQ +LG ++L L+ + + E+ K
Sbjct: 106 EAFALAAKYYSVQ-------------------NLGICTPFEQLLVALRGNQNQRIGENQK 146
Query: 662 VSQLMSEKKSYEVQVMSQVVAAVTNSCDSSHIIDLGGGQGYLSTILALQHGKKTLSLDYN 721
+ M+ KKS+EVQ MS++++++ + +IDLG G+GYLS+ L+L++G K +D +
Sbjct: 147 AVEFMNMKKSHEVQAMSELISSIADYYGIKQVIDLGSGKGYLSSFLSLKYGLKVYGIDSS 206
Query: 722 QVNTHGAAVRSKKLE 736
NTHGA R++KL+
Sbjct: 207 NTNTHGAEERNRKLK 221
>gi|81884989|sp|Q6NXH8.1|MET25_MOUSE RecName: Full=Methyltransferase-like protein 25
gi|45219809|gb|AAH67068.1| CDNA sequence BC067068 [Mus musculus]
Length = 597
Score = 150 bits (378), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 86/228 (37%), Positives = 138/228 (60%), Gaps = 14/228 (6%)
Query: 804 ITHFVTPDSDISSILSQAYPQDSLHNVCIMGLHTCGDLSGTALRLFT-KSSLQCLVQVGC 862
+T F+T DS + I+ + L + ++GLHTCGDL+ + LR+FT K+ ++ + VGC
Sbjct: 349 LTSFITADSQLHDII------EDLEDCLMVGLHTCGDLAPSTLRIFTSKAEVKAVCSVGC 402
Query: 863 CYHLLEEEFIRSPFWKDVDQSLYEHGYGFPLSEHLRSRKFFLGRNVRMSGTQSPERVIDL 922
CYHLL EEF + D+ E+ +GFP+ +L+ ++ GRN RMS + +RV
Sbjct: 403 CYHLLSEEFEN----QHKDRCANEN-WGFPMCHYLKEERWCCGRNARMSACLALQRVAVG 457
Query: 923 KQTQTLPLFYRALLEKYLRSKITINDEEPKVVGRLATKCSNFVEYVHRAVDKLKLD-LEV 981
+ T LFYRA+L+ ++ I+ E + VG++ +KCS+F+EYV ++ KL LD +V
Sbjct: 458 QGLPTESLFYRAVLQNIIKDYYGISKCE-QHVGKIYSKCSSFLEYVRMSLKKLGLDESKV 516
Query: 982 DDEEVTRLFNSHQREYEYLQIYYFLKTALAPVIEALIVLDRVLYLREQ 1029
+E + + +++ L+ + LK LAP IE LI+LDR+ YL+EQ
Sbjct: 517 SEEIIMDYYENYKPRMNELEAFNMLKVVLAPCIETLILLDRLCYLKEQ 564
Score = 84.3 bits (207), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 42/117 (35%), Positives = 69/117 (58%)
Query: 620 VAAVTNSCDSSHIIDLGSTITVDELFSKLQARKCGLVVESAKVSQLMSEKKSYEVQVMSQ 679
V A + + +LG ++L + LQ K V E+ K + M+ KKS+EVQ MS+
Sbjct: 97 VEAFARTAKHYSVQNLGLCTPSEQLLTALQGNKRQRVDENVKAIEFMNTKKSHEVQAMSE 156
Query: 680 VVAAVTNSCDSSHIIDLGGGQGYLSTILALQHGKKTLSLDYNQVNTHGAAVRSKKLE 736
++ ++ + C IID+G G+GYLS+ L+L++G +D + NTHGA R++KL+
Sbjct: 157 LICSIADYCGLKQIIDVGSGKGYLSSFLSLKYGLNVYGIDSSNTNTHGAKERNRKLK 213
Score = 57.0 bits (136), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 41/168 (24%), Positives = 79/168 (47%), Gaps = 23/168 (13%)
Query: 485 IKNTIDEMIEFITPNIACVNTHTVHFMVEKLWETTI---PKAILSQVNNSP--------- 532
+ + + ++ F+ +A + HTV F + +W+ + P+++L+ + S
Sbjct: 16 LHDKLQGLLSFLRGALAISSAHTVDFYTKSVWQELVDLPPESVLAALRESAVEAEPREAE 75
Query: 533 ---SFDSLLEEFWKSRQDNHVETNSELVKFFQAADKFRLTSLMNGESIITVDELFSKLQA 589
F L + F ET+ +L+ A + S+ N ++L + LQ
Sbjct: 76 TGSGFTELPKIF--------CETSQKLLSVEAFARTAKHYSVQNLGLCTPSEQLLTALQG 127
Query: 590 RKCGQVVESAKVSQLMSEKKSYEVQVMSQVVAAVTNSCDSSHIIDLGS 637
K +V E+ K + M+ KKS+EVQ MS+++ ++ + C IID+GS
Sbjct: 128 NKRQRVDENVKAIEFMNTKKSHEVQAMSELICSIADYCGLKQIIDVGS 175
Score = 42.7 bits (99), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 19/51 (37%), Positives = 33/51 (64%)
Query: 1044 TQVDDEEVTRLFDSHQREYEYLQIYYFLKTALAPVIEALIVLDRVLYLREQ 1094
++V +E + +++++ L+ + LK LAP IE LI+LDR+ YL+EQ
Sbjct: 514 SKVSEEIIMDYYENYKPRMNELEAFNMLKVVLAPCIETLILLDRLCYLKEQ 564
>gi|283135209|ref|NP_997405.2| methyltransferase-like protein 25 [Mus musculus]
gi|74194739|dbj|BAE25973.1| unnamed protein product [Mus musculus]
Length = 600
Score = 150 bits (378), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 86/228 (37%), Positives = 138/228 (60%), Gaps = 14/228 (6%)
Query: 804 ITHFVTPDSDISSILSQAYPQDSLHNVCIMGLHTCGDLSGTALRLFT-KSSLQCLVQVGC 862
+T F+T DS + I+ + L + ++GLHTCGDL+ + LR+FT K+ ++ + VGC
Sbjct: 352 LTSFITADSQLHDII------EDLEDCLMVGLHTCGDLAPSTLRIFTSKAEVKAVCSVGC 405
Query: 863 CYHLLEEEFIRSPFWKDVDQSLYEHGYGFPLSEHLRSRKFFLGRNVRMSGTQSPERVIDL 922
CYHLL EEF + D+ E+ +GFP+ +L+ ++ GRN RMS + +RV
Sbjct: 406 CYHLLSEEFEN----QHKDRCANEN-WGFPMCHYLKEERWCCGRNARMSACLALQRVAVG 460
Query: 923 KQTQTLPLFYRALLEKYLRSKITINDEEPKVVGRLATKCSNFVEYVHRAVDKLKLD-LEV 981
+ T LFYRA+L+ ++ I+ E + VG++ +KCS+F+EYV ++ KL LD +V
Sbjct: 461 QGLPTESLFYRAVLQNIIKDYYGISKCE-QHVGKIYSKCSSFLEYVRMSLKKLGLDESKV 519
Query: 982 DDEEVTRLFNSHQREYEYLQIYYFLKTALAPVIEALIVLDRVLYLREQ 1029
+E + + +++ L+ + LK LAP IE LI+LDR+ YL+EQ
Sbjct: 520 SEEIIMDYYENYKPRMNELEAFNMLKVVLAPCIETLILLDRLCYLKEQ 567
Score = 84.3 bits (207), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 42/117 (35%), Positives = 69/117 (58%)
Query: 620 VAAVTNSCDSSHIIDLGSTITVDELFSKLQARKCGLVVESAKVSQLMSEKKSYEVQVMSQ 679
V A + + +LG ++L + LQ K V E+ K + M+ KKS+EVQ MS+
Sbjct: 100 VEAFARTAKHYSVQNLGLCTPSEQLLTALQGNKRQRVDENVKAIEFMNTKKSHEVQAMSE 159
Query: 680 VVAAVTNSCDSSHIIDLGGGQGYLSTILALQHGKKTLSLDYNQVNTHGAAVRSKKLE 736
++ ++ + C IID+G G+GYLS+ L+L++G +D + NTHGA R++KL+
Sbjct: 160 LICSIADYCGLKQIIDVGSGKGYLSSFLSLKYGLNVYGIDSSNTNTHGAKERNRKLK 216
Score = 57.0 bits (136), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 41/168 (24%), Positives = 79/168 (47%), Gaps = 23/168 (13%)
Query: 485 IKNTIDEMIEFITPNIACVNTHTVHFMVEKLWETTI---PKAILSQVNNSP--------- 532
+ + + ++ F+ +A + HTV F + +W+ + P+++L+ + S
Sbjct: 19 LHDKLQGLLSFLRGALAISSAHTVDFYTKSVWQELVDLPPESVLAALRESAVEAEPREAE 78
Query: 533 ---SFDSLLEEFWKSRQDNHVETNSELVKFFQAADKFRLTSLMNGESIITVDELFSKLQA 589
F L + F ET+ +L+ A + S+ N ++L + LQ
Sbjct: 79 TGSGFTELPKIF--------CETSQKLLSVEAFARTAKHYSVQNLGLCTPSEQLLTALQG 130
Query: 590 RKCGQVVESAKVSQLMSEKKSYEVQVMSQVVAAVTNSCDSSHIIDLGS 637
K +V E+ K + M+ KKS+EVQ MS+++ ++ + C IID+GS
Sbjct: 131 NKRQRVDENVKAIEFMNTKKSHEVQAMSELICSIADYCGLKQIIDVGS 178
Score = 42.7 bits (99), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 19/51 (37%), Positives = 33/51 (64%)
Query: 1044 TQVDDEEVTRLFDSHQREYEYLQIYYFLKTALAPVIEALIVLDRVLYLREQ 1094
++V +E + +++++ L+ + LK LAP IE LI+LDR+ YL+EQ
Sbjct: 517 SKVSEEIIMDYYENYKPRMNELEAFNMLKVVLAPCIETLILLDRLCYLKEQ 567
>gi|148689734|gb|EDL21681.1| cDNA sequence BC067068 [Mus musculus]
Length = 597
Score = 150 bits (378), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 86/228 (37%), Positives = 138/228 (60%), Gaps = 14/228 (6%)
Query: 804 ITHFVTPDSDISSILSQAYPQDSLHNVCIMGLHTCGDLSGTALRLFT-KSSLQCLVQVGC 862
+T F+T DS + I+ + L + ++GLHTCGDL+ + LR+FT K+ ++ + VGC
Sbjct: 349 LTSFITADSQLHDII------EDLEDCLMVGLHTCGDLAPSTLRIFTSKAEVKAVCSVGC 402
Query: 863 CYHLLEEEFIRSPFWKDVDQSLYEHGYGFPLSEHLRSRKFFLGRNVRMSGTQSPERVIDL 922
CYHLL EEF + D+ E+ +GFP+ +L+ ++ GRN RMS + +RV
Sbjct: 403 CYHLLSEEFEN----QHKDRCANEN-WGFPMCHYLKEERWCCGRNARMSACLALQRVAVG 457
Query: 923 KQTQTLPLFYRALLEKYLRSKITINDEEPKVVGRLATKCSNFVEYVHRAVDKLKLD-LEV 981
+ T LFYRA+L+ ++ I+ E + VG++ +KCS+F+EYV ++ KL LD +V
Sbjct: 458 QGLPTESLFYRAVLQNIIKDYYGISKCE-QHVGKIYSKCSSFLEYVRMSLKKLGLDESKV 516
Query: 982 DDEEVTRLFNSHQREYEYLQIYYFLKTALAPVIEALIVLDRVLYLREQ 1029
+E + + +++ L+ + LK LAP IE LI+LDR+ YL+EQ
Sbjct: 517 SEEIIMDYYENYKPRMNELEAFNMLKVVLAPCIETLILLDRLCYLKEQ 564
Score = 84.3 bits (207), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 42/117 (35%), Positives = 69/117 (58%)
Query: 620 VAAVTNSCDSSHIIDLGSTITVDELFSKLQARKCGLVVESAKVSQLMSEKKSYEVQVMSQ 679
V A + + +LG ++L + LQ K V E+ K + M+ KKS+EVQ MS+
Sbjct: 97 VEAFARTAKHYSVQNLGLCTPSEQLLTALQGNKRQRVDENVKAIEFMNTKKSHEVQAMSE 156
Query: 680 VVAAVTNSCDSSHIIDLGGGQGYLSTILALQHGKKTLSLDYNQVNTHGAAVRSKKLE 736
++ ++ + C IID+G G+GYLS+ L+L++G +D + NTHGA R++KL+
Sbjct: 157 LICSIADYCGLKQIIDVGSGKGYLSSFLSLKYGLNVYGIDSSNTNTHGAKERNRKLK 213
Score = 57.0 bits (136), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 41/168 (24%), Positives = 79/168 (47%), Gaps = 23/168 (13%)
Query: 485 IKNTIDEMIEFITPNIACVNTHTVHFMVEKLWETTI---PKAILSQVNNSP--------- 532
+ + + ++ F+ +A + HTV F + +W+ + P+++L+ + S
Sbjct: 16 LHDKLQGLLSFLRGALAISSAHTVDFYTKSVWQELVDLPPESVLAALRESAVEAEPREAE 75
Query: 533 ---SFDSLLEEFWKSRQDNHVETNSELVKFFQAADKFRLTSLMNGESIITVDELFSKLQA 589
F L + F ET+ +L+ A + S+ N ++L + LQ
Sbjct: 76 TGSGFTELPKIF--------CETSQKLLSVEAFARTAKHYSVQNLGLCTPSEQLLTALQG 127
Query: 590 RKCGQVVESAKVSQLMSEKKSYEVQVMSQVVAAVTNSCDSSHIIDLGS 637
K +V E+ K + M+ KKS+EVQ MS+++ ++ + C IID+GS
Sbjct: 128 NKRQRVDENVKAIEFMNTKKSHEVQAMSELICSIADYCGLKQIIDVGS 175
Score = 42.7 bits (99), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 19/51 (37%), Positives = 33/51 (64%)
Query: 1044 TQVDDEEVTRLFDSHQREYEYLQIYYFLKTALAPVIEALIVLDRVLYLREQ 1094
++V +E + +++++ L+ + LK LAP IE LI+LDR+ YL+EQ
Sbjct: 514 SKVSEEIIMDYYENYKPRMNELEAFNMLKVVLAPCIETLILLDRLCYLKEQ 564
>gi|255542245|ref|XP_002512186.1| Cleavage and polyadenylation specificity factor 73 kDa subunit,
putative [Ricinus communis]
gi|223548730|gb|EEF50220.1| Cleavage and polyadenylation specificity factor 73 kDa subunit,
putative [Ricinus communis]
Length = 361
Score = 150 bits (378), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 78/177 (44%), Positives = 105/177 (59%), Gaps = 9/177 (5%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF V + ++YTGDY+ DRHL AA + + PD+ I ESTY + + RE+ F
Sbjct: 176 MFMVDIAGVRLLYTGDYSREEDRHLRAAEMPQFSPDICIIESTYGVQLHQPRHIREKRFT 235
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYW----ERMNLQAPIYFAVGLTEKATNY 116
+H + +GG+VLIP FALGRAQEL ++L+ YW E N+ PIY+A L +K
Sbjct: 236 DVIHSTISQGGRVLIPAFALGRAQELLLILDEYWSNHPELHNV--PIYYASPLAKKCMTV 293
Query: 117 YKMFITWTNQKIRKTFVQRNMFDFKHIRPFD--KSFIDNPGPMVVFATPGMLHSGLS 171
Y+ +I N++IR F N F FKHI P + + F D GP VV A+PG L SGLS
Sbjct: 294 YQTYILSMNERIRNQFANSNPFKFKHISPLNSIEDFTDV-GPSVVMASPGGLQSGLS 349
>gi|343958716|dbj|BAK63213.1| hypothetical protein [Pan troglodytes]
gi|410212734|gb|JAA03586.1| chromosome 12 open reading frame 26 [Pan troglodytes]
gi|410294342|gb|JAA25771.1| chromosome 12 open reading frame 26 [Pan troglodytes]
Length = 603
Score = 150 bits (378), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 91/270 (33%), Positives = 151/270 (55%), Gaps = 30/270 (11%)
Query: 763 NRQTSHASVEHHGKNWKRKSKAPVVSIKDEELVVCKDKCKQITHFVTPDSDISSILSQAY 822
NR+TS A+ E K + S++ + S +T F+T DS++ I+
Sbjct: 333 NRETSEANKERR-KMTSKSSESNIYS--------------PLTSFITADSELHDIIKD-- 375
Query: 823 PQDSLHNVCIMGLHTCGDLSGTALRLFTKSS-LQCLVQVGCCYHLLEEEFIRSPFWKDVD 881
L + ++GLHTCGDL+ LR+FT +S ++ + VGCCYHLL EEF ++
Sbjct: 376 ----LEDCLMVGLHTCGDLAPNTLRIFTSNSEIKGVCSVGCCYHLLSEEF------ENQH 425
Query: 882 QSLYEHGYGFPLSEHLRSRKFFLGRNVRMSGTQSPERVIDLKQTQTLPLFYRALLEKYLR 941
+ + +GFP+ +L+ ++ GRN RMS + ERV + T LFYRA+L+ ++
Sbjct: 426 KERTQEKWGFPMCHYLKEERWCCGRNARMSACLALERVAAGQGLPTESLFYRAVLQDIIK 485
Query: 942 SKITINDEEPKVVGRLATKCSNFVEYVHRAVDKLKLD-LEVDDEEVTRLFNSHQREYEYL 1000
I + + VG++ +KCS+F++YV R++ KL LD ++ ++ + + ++ L
Sbjct: 486 DCYGITKCD-RHVGKIYSKCSSFLDYVRRSLKKLGLDESKLPEKIIMNYYEKYKPRMNEL 544
Query: 1001 QIYYFLKTALAPVIEALIVLDRVLYLREQQ 1030
+ + LK LAP IE LI+LDR+ YL+EQ+
Sbjct: 545 EAFNMLKVVLAPCIETLILLDRLCYLKEQE 574
Score = 80.9 bits (198), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 63/255 (24%), Positives = 112/255 (43%), Gaps = 52/255 (20%)
Query: 485 IKNTIDEMIEFITPNIACVNTHTVHFMVEKLWETTI---PKAILSQVNNSPSFDSLLEEF 541
++ + +++F+ ++ N HTV F E +WE + P+ +L+ + S S L
Sbjct: 16 LRAKLQGLLQFLRDALSISNAHTVDFYTESVWEELVNLPPETVLAALRKSASETEALP-- 73
Query: 542 WKSRQDNHVETNSELVKFFQAADKFRLTSLMNGESIITVDELFSKLQARKCGQVVESAKV 601
SE +A + +T F K+ ++V V
Sbjct: 74 ------------SETRPLVEAEWEAGMTD-------------FPKIFCETSQKLV---SV 105
Query: 602 SQLMSEKKSYEVQVMSQVVAAVTNSCDSSHIIDLGSTITVDELFSKLQARKCGLVVESAK 661
K Y VQ +LG ++L L+ + + E+ K
Sbjct: 106 EAFALAAKYYSVQ-------------------NLGICTPFEQLLVALRGNQTQRIGENQK 146
Query: 662 VSQLMSEKKSYEVQVMSQVVAAVTNSCDSSHIIDLGGGQGYLSTILALQHGKKTLSLDYN 721
+ M+ KKS+EVQ MS++++++ + +IDLG G+GYLS+ L+L++G K +D +
Sbjct: 147 AVEFMNMKKSHEVQAMSELISSIADYYGIKQVIDLGSGKGYLSSFLSLKYGLKVYGIDSS 206
Query: 722 QVNTHGAAVRSKKLE 736
NTHGA R++KL+
Sbjct: 207 NTNTHGAEERNRKLK 221
>gi|283046842|ref|NP_115606.2| methyltransferase-like protein 25 [Homo sapiens]
gi|317373346|sp|Q8N6Q8.2|MET25_HUMAN RecName: Full=Methyltransferase-like protein 25
Length = 603
Score = 150 bits (378), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 91/270 (33%), Positives = 151/270 (55%), Gaps = 30/270 (11%)
Query: 763 NRQTSHASVEHHGKNWKRKSKAPVVSIKDEELVVCKDKCKQITHFVTPDSDISSILSQAY 822
NR+TS A+ E K + S++ + S +T F+T DS++ I+
Sbjct: 333 NRETSEANKERR-KMTSKSSESNIYS--------------PLTSFITADSELHDIIKD-- 375
Query: 823 PQDSLHNVCIMGLHTCGDLSGTALRLFTKSS-LQCLVQVGCCYHLLEEEFIRSPFWKDVD 881
L + ++GLHTCGDL+ LR+FT +S ++ + VGCCYHLL EEF ++
Sbjct: 376 ----LEDCLMVGLHTCGDLAPNTLRIFTSNSEIKGVCSVGCCYHLLSEEF------ENQH 425
Query: 882 QSLYEHGYGFPLSEHLRSRKFFLGRNVRMSGTQSPERVIDLKQTQTLPLFYRALLEKYLR 941
+ + +GFP+ +L+ ++ GRN RMS + ERV + T LFYRA+L+ ++
Sbjct: 426 KERTQEKWGFPMCHYLKEERWCCGRNARMSACLALERVAAGQGLPTESLFYRAVLQDIIK 485
Query: 942 SKITINDEEPKVVGRLATKCSNFVEYVHRAVDKLKLD-LEVDDEEVTRLFNSHQREYEYL 1000
I + + VG++ +KCS+F++YV R++ KL LD ++ ++ + + ++ L
Sbjct: 486 DCYGITKCD-RHVGKIYSKCSSFLDYVRRSLKKLGLDESKLPEKIIMNYYEKYKPRMNEL 544
Query: 1001 QIYYFLKTALAPVIEALIVLDRVLYLREQQ 1030
+ + LK LAP IE LI+LDR+ YL+EQ+
Sbjct: 545 EAFNMLKVVLAPCIETLILLDRLCYLKEQE 574
Score = 80.1 bits (196), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 63/255 (24%), Positives = 112/255 (43%), Gaps = 52/255 (20%)
Query: 485 IKNTIDEMIEFITPNIACVNTHTVHFMVEKLWETTI---PKAILSQVNNSPSFDSLLEEF 541
++ + +++F+ ++ N HTV F E +WE + P+ +L+ + S S L
Sbjct: 16 LRAKLQGLLQFLRDALSISNAHTVDFYTESVWEELVDLPPETVLAALRKSASETEALP-- 73
Query: 542 WKSRQDNHVETNSELVKFFQAADKFRLTSLMNGESIITVDELFSKLQARKCGQVVESAKV 601
SE +A + +T F K+ ++V V
Sbjct: 74 ------------SETRPLVEAEWEAGMTD-------------FPKIFCETSQKLV---SV 105
Query: 602 SQLMSEKKSYEVQVMSQVVAAVTNSCDSSHIIDLGSTITVDELFSKLQARKCGLVVESAK 661
K Y VQ +LG ++L L+ + + E+ K
Sbjct: 106 EAFALAAKYYSVQ-------------------NLGICTPFEQLLVALRGNQNQRIGENQK 146
Query: 662 VSQLMSEKKSYEVQVMSQVVAAVTNSCDSSHIIDLGGGQGYLSTILALQHGKKTLSLDYN 721
+ M+ KKS+EVQ MS++++++ + +IDLG G+GYLS+ L+L++G K +D +
Sbjct: 147 AVEFMNMKKSHEVQAMSELISSIADYYGIKQVIDLGSGKGYLSSFLSLKYGLKVYGIDSS 206
Query: 722 QVNTHGAAVRSKKLE 736
NTHGA R++KL+
Sbjct: 207 NTNTHGAEERNRKLK 221
>gi|20810107|gb|AAH29120.1| Chromosome 12 open reading frame 26 [Homo sapiens]
gi|119617787|gb|EAW97381.1| chromosome 12 open reading frame 26, isoform CRA_a [Homo sapiens]
Length = 603
Score = 149 bits (377), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 91/270 (33%), Positives = 151/270 (55%), Gaps = 30/270 (11%)
Query: 763 NRQTSHASVEHHGKNWKRKSKAPVVSIKDEELVVCKDKCKQITHFVTPDSDISSILSQAY 822
NR+TS A+ E K + S++ + S +T F+T DS++ I+
Sbjct: 333 NRETSEANKERR-KMTSKSSESNIYS--------------PLTSFITADSELHDIIKD-- 375
Query: 823 PQDSLHNVCIMGLHTCGDLSGTALRLFTKSS-LQCLVQVGCCYHLLEEEFIRSPFWKDVD 881
L + ++GLHTCGDL+ LR+FT +S ++ + VGCCYHLL EEF ++
Sbjct: 376 ----LEDCLMVGLHTCGDLAPNTLRIFTSNSEIKGVCSVGCCYHLLSEEF------ENQH 425
Query: 882 QSLYEHGYGFPLSEHLRSRKFFLGRNVRMSGTQSPERVIDLKQTQTLPLFYRALLEKYLR 941
+ + +GFP+ +L+ ++ GRN RMS + ERV + T LFYRA+L+ ++
Sbjct: 426 KERTQEKWGFPMCHYLKEERWCCGRNARMSACLALERVAAGQGLPTESLFYRAVLQDIIK 485
Query: 942 SKITINDEEPKVVGRLATKCSNFVEYVHRAVDKLKLD-LEVDDEEVTRLFNSHQREYEYL 1000
I + + VG++ +KCS+F++YV R++ KL LD ++ ++ + + ++ L
Sbjct: 486 DCYGITKCD-RHVGKIYSKCSSFLDYVRRSLKKLGLDESKLPEKIIMNYYEKYKPRMNEL 544
Query: 1001 QIYYFLKTALAPVIEALIVLDRVLYLREQQ 1030
+ + LK LAP IE LI+LDR+ YL+EQ+
Sbjct: 545 EAFNMLKVVLAPCIETLILLDRLCYLKEQE 574
Score = 80.1 bits (196), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 63/255 (24%), Positives = 112/255 (43%), Gaps = 52/255 (20%)
Query: 485 IKNTIDEMIEFITPNIACVNTHTVHFMVEKLWETTI---PKAILSQVNNSPSFDSLLEEF 541
++ + +++F+ ++ N HTV F E +WE + P+ +L+ + S S L
Sbjct: 16 LRAKLQGLLQFLRDALSISNAHTVDFYTESVWEELVDLPPETVLAALRKSASETEALP-- 73
Query: 542 WKSRQDNHVETNSELVKFFQAADKFRLTSLMNGESIITVDELFSKLQARKCGQVVESAKV 601
SE +A + +T F K+ ++V V
Sbjct: 74 ------------SETRPLVEAEWEAGMTD-------------FPKIFCETSQKLV---SV 105
Query: 602 SQLMSEKKSYEVQVMSQVVAAVTNSCDSSHIIDLGSTITVDELFSKLQARKCGLVVESAK 661
K Y VQ +LG ++L L+ + + E+ K
Sbjct: 106 EAFALAAKYYSVQ-------------------NLGICTPFEQLLVALRGNQNQRIGENQK 146
Query: 662 VSQLMSEKKSYEVQVMSQVVAAVTNSCDSSHIIDLGGGQGYLSTILALQHGKKTLSLDYN 721
+ M+ KKS+EVQ MS++++++ + +IDLG G+GYLS+ L+L++G K +D +
Sbjct: 147 AVEFMNMKKSHEVQAMSELISSIADYYGIKQVIDLGSGKGYLSSFLSLKYGLKVYGIDSS 206
Query: 722 QVNTHGAAVRSKKLE 736
NTHGA R++KL+
Sbjct: 207 NTNTHGAEERNRKLK 221
>gi|397480888|ref|XP_003811696.1| PREDICTED: uncharacterized protein C12orf26 homolog [Pan paniscus]
Length = 603
Score = 149 bits (377), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 82/229 (35%), Positives = 136/229 (59%), Gaps = 15/229 (6%)
Query: 804 ITHFVTPDSDISSILSQAYPQDSLHNVCIMGLHTCGDLSGTALRLFTKSS-LQCLVQVGC 862
+T F+T DS++ I+ L + ++GLHTCGDL+ LR+FT +S ++ + VGC
Sbjct: 359 LTSFITADSELHDIIKD------LEDCLMVGLHTCGDLAPNTLRIFTSNSEIKGVCSVGC 412
Query: 863 CYHLLEEEFIRSPFWKDVDQSLYEHGYGFPLSEHLRSRKFFLGRNVRMSGTQSPERVIDL 922
CYHLL EEF ++ + + +GFP+ +L+ ++ GRN RMS + ERV
Sbjct: 413 CYHLLSEEF------ENQHKERTQEKWGFPMCHYLKEARWCCGRNARMSACLALERVAAG 466
Query: 923 KQTQTLPLFYRALLEKYLRSKITINDEEPKVVGRLATKCSNFVEYVHRAVDKLKLD-LEV 981
+ T LFYRA+L+ ++ I + + VG++ +KCS+F++YV R++ KL LD ++
Sbjct: 467 QGLPTESLFYRAVLQDIIKDCYGITKCD-RHVGKIYSKCSSFLDYVRRSLKKLGLDESKL 525
Query: 982 DDEEVTRLFNSHQREYEYLQIYYFLKTALAPVIEALIVLDRVLYLREQQ 1030
++ + + ++ L+ + LK LAP IE LI+LDR+ YL+EQ+
Sbjct: 526 PEKIIMNYYEKYKPRMNELEAFNMLKVVLAPCIETLILLDRLCYLKEQE 574
Score = 80.1 bits (196), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 63/255 (24%), Positives = 112/255 (43%), Gaps = 52/255 (20%)
Query: 485 IKNTIDEMIEFITPNIACVNTHTVHFMVEKLWETTI---PKAILSQVNNSPSFDSLLEEF 541
++ + +++F+ ++ N HTV F E +WE + P+ +L+ + S S L
Sbjct: 16 LRAKLQGLLQFLRDALSISNAHTVDFYTESVWEELVNLPPETVLAALRKSASETEALP-- 73
Query: 542 WKSRQDNHVETNSELVKFFQAADKFRLTSLMNGESIITVDELFSKLQARKCGQVVESAKV 601
SE +A + +T F K+ ++V V
Sbjct: 74 ------------SETRPLVEAEWEAGMTD-------------FPKIFCETSQKLV---SV 105
Query: 602 SQLMSEKKSYEVQVMSQVVAAVTNSCDSSHIIDLGSTITVDELFSKLQARKCGLVVESAK 661
K Y VQ +LG ++L L+ + + E+ K
Sbjct: 106 EAFALAAKYYSVQ-------------------NLGICTPFEQLLVALRGNQNQRIGENQK 146
Query: 662 VSQLMSEKKSYEVQVMSQVVAAVTNSCDSSHIIDLGGGQGYLSTILALQHGKKTLSLDYN 721
+ M+ KKS+EVQ MS++++++ + +IDLG G+GYLS+ L+L++G K +D +
Sbjct: 147 AVEFMNMKKSHEVQAMSELISSIADYYGIKQVIDLGSGKGYLSSFLSLKYGLKVYGIDSS 206
Query: 722 QVNTHGAAVRSKKLE 736
NTHGA R++KL+
Sbjct: 207 NTNTHGAEERNRKLK 221
>gi|395820136|ref|XP_003783431.1| PREDICTED: uncharacterized protein C12orf26 homolog [Otolemur
garnettii]
Length = 609
Score = 148 bits (374), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 85/231 (36%), Positives = 136/231 (58%), Gaps = 19/231 (8%)
Query: 804 ITHFVTPDSDISSILSQAYPQDSLHNVCIMGLHTCGDLSGTALRLFT-KSSLQCLVQVGC 862
+T F+T DS++ I+ L + ++GLHTCGDL+ LR+FT KS ++ + VGC
Sbjct: 365 LTSFITADSELHDIIK------DLEDCLMVGLHTCGDLAPNTLRIFTSKSEIKGVCSVGC 418
Query: 863 CYHLLEEEFIRSPFWKDVDQSLYEHGYGFPLSEHLRSRKFFLGRNVRMSGTQSPERVIDL 922
CYHLL EEF + + +GFP+ +L+ K+ GRN RMS + ERV
Sbjct: 419 CYHLLSEEF------ETQTTECTQEKWGFPMCHYLKEEKWCCGRNARMSACLALERVTVG 472
Query: 923 KQTQTLPLFYRALLEKYLRS--KITINDEEPKVVGRLATKCSNFVEYVHRAVDKLKLD-L 979
+ T LFYRA+L+ ++ IT D+ VG++ +KCS+F++YV +++ KL+LD
Sbjct: 473 QGLPTESLFYRAVLQDIIKDCYGITKCDQH---VGKIYSKCSSFLDYVRKSLVKLQLDES 529
Query: 980 EVDDEEVTRLFNSHQREYEYLQIYYFLKTALAPVIEALIVLDRVLYLREQQ 1030
++ ++ + + ++ L+ + LK LAP IE LI+LDR+ YL+EQ+
Sbjct: 530 KLPEKLIMNYYEMYKPRMNELEAFNMLKVVLAPCIETLILLDRLCYLKEQE 580
Score = 76.6 bits (187), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 59/255 (23%), Positives = 110/255 (43%), Gaps = 52/255 (20%)
Query: 485 IKNTIDEMIEFITPNIACVNTHTVHFMVEKLWETTI---PKAILSQVNNSPSFDSLLEEF 541
++ + ++ F+ ++ N HTV F E +WE + P+ +L+ + S +
Sbjct: 16 LRAKLQGLLGFLRDALSISNAHTVDFYTESVWEELVDLPPETVLTVLRRSAA-------- 67
Query: 542 WKSRQDNHVETNSELVKFFQAADKFRLTSLMNGESIITVDELFSKLQARKCGQVVESAKV 601
ET + + ++ L G + + ++F C + V
Sbjct: 68 ---------ETEA------RPSESRPLVEAEKGSGMANLPKIF-------CETSQKLVNV 105
Query: 602 SQLMSEKKSYEVQVMSQVVAAVTNSCDSSHIIDLGSTITVDELFSKLQARKCGLVVESAK 661
K Y VQ ++G ++L LQ + E+ +
Sbjct: 106 EAFALAAKYYSVQ-------------------NMGICTPFEQLLVALQGNEKQRSGETVR 146
Query: 662 VSQLMSEKKSYEVQVMSQVVAAVTNSCDSSHIIDLGGGQGYLSTILALQHGKKTLSLDYN 721
+ M+ KKS+EVQ MS++++++ +IDLG G+GYLS+ L+L++G K +D +
Sbjct: 147 PVEFMNMKKSHEVQAMSELISSIAAYYGVKQVIDLGSGKGYLSSFLSLKYGLKVYGIDSS 206
Query: 722 QVNTHGAAVRSKKLE 736
NTHGA R++KL+
Sbjct: 207 NTNTHGAEKRNRKLK 221
>gi|403272162|ref|XP_003927949.1| PREDICTED: uncharacterized protein C12orf26 homolog [Saimiri
boliviensis boliviensis]
Length = 640
Score = 148 bits (373), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 83/231 (35%), Positives = 137/231 (59%), Gaps = 19/231 (8%)
Query: 804 ITHFVTPDSDISSILSQAYPQDSLHNVCIMGLHTCGDLSGTALRLF-TKSSLQCLVQVGC 862
+T F+T DS++ I+ + L + ++GLHTCGDL+ LR+F + S ++ + VGC
Sbjct: 396 LTSFITADSELHDII------EDLEDCLMVGLHTCGDLAPNTLRIFASNSEIKGVCSVGC 449
Query: 863 CYHLLEEEFIRSPFWKDVDQSLYEHGYGFPLSEHLRSRKFFLGRNVRMSGTQSPERVIDL 922
CYHLL EEF ++ + + +GFP+ +L+ ++ GRN RMS + ERV
Sbjct: 450 CYHLLSEEF------ENQHKECTQKKWGFPMCRYLKEERWCCGRNARMSACLALERVAVG 503
Query: 923 KQTQTLPLFYRALLEKYLRS--KITINDEEPKVVGRLATKCSNFVEYVHRAVDKLKLD-L 979
+ T LFYRA+L+ ++ IT D+ VG++ +KCS+F++YV R++ KL LD
Sbjct: 504 QGLPTESLFYRAVLQDIIKDCYGITKCDQH---VGKIYSKCSSFLDYVRRSLKKLGLDES 560
Query: 980 EVDDEEVTRLFNSHQREYEYLQIYYFLKTALAPVIEALIVLDRVLYLREQQ 1030
++ ++ + + ++ L+ + LK LAP IE LI+LDR+ YL+EQ+
Sbjct: 561 KLPEKIIMNYYEKYKPRMNELEAFNMLKVVLAPCIETLILLDRLCYLKEQE 611
Score = 75.5 bits (184), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 33/76 (43%), Positives = 53/76 (69%)
Query: 661 KVSQLMSEKKSYEVQVMSQVVAAVTNSCDSSHIIDLGGGQGYLSTILALQHGKKTLSLDY 720
K ++ M+ KKS+EVQ MS+++ ++ N +IDLG G+GYLS+ L+L++G K +D
Sbjct: 183 KAAEFMNMKKSHEVQAMSELIGSIANYYGLKQVIDLGSGKGYLSSFLSLKYGLKVYGIDS 242
Query: 721 NQVNTHGAAVRSKKLE 736
+ NTHGA R++KL+
Sbjct: 243 SNTNTHGAEERNRKLK 258
>gi|410918877|ref|XP_003972911.1| PREDICTED: uncharacterized protein C12orf26 homolog [Takifugu
rubripes]
Length = 595
Score = 147 bits (371), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 87/229 (37%), Positives = 129/229 (56%), Gaps = 15/229 (6%)
Query: 804 ITHFVTPDSDISSILSQAYPQDSLHNVCIMGLHTCGDLSGTALRLF-TKSSLQCLVQVGC 862
+T VT ++++ ++ + L ++GLHTCGDL+ + LR+F K L + VGC
Sbjct: 343 LTSHVTAETELGELIQE------LEEAVMVGLHTCGDLAPSTLRMFVAKPELAAVCSVGC 396
Query: 863 CYHLLEEEFIRSPFWKDVDQSLYEHGYGFPLSEHLRSRKFFLGRNVRMSGTQSPERVIDL 922
CYHLL E+F S Q + GFPLS++LR F GRN RMS + ERV
Sbjct: 397 CYHLLSEQFGPS------GQECLQGACGFPLSQYLRQNSCFCGRNARMSACLALERVSLG 450
Query: 923 KQTQTLPLFYRALLEKYLRSKITINDEEPKVVGRLATKCSNFVEYVHRAVDKLKLD-LEV 981
+ Q LF+RA+L LR + E K VG + +K +FVEYV A+ +L+LD ++
Sbjct: 451 RGIQMESLFFRAVLHVILRERYDFVKSE-KRVGNVYSKTKSFVEYVRGALRRLELDESKL 509
Query: 982 DDEEVTRLFNSHQREYEYLQIYYFLKTALAPVIEALIVLDRVLYLREQQ 1030
D ++ +++ + + + LK ALAP IE LI+LDR+ YLREQ+
Sbjct: 510 SDGDIQGYHDAYAPRMDEMHAFNMLKVALAPCIEGLILLDRLCYLREQE 558
Score = 79.7 bits (195), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 70/264 (26%), Positives = 114/264 (43%), Gaps = 64/264 (24%)
Query: 485 IKNTIDEMIEFITPNIACVNTHTVHFMVEKLWETTI---PKAILSQVNN------SPSF- 534
I IDE+ F++ ++ N HTV F +W + P+ +L V++ +P
Sbjct: 11 IHRRIDEVKRFLSITLSIANAHTVEFYTHDVWSRFMAVEPQEVLRAVSSHNDRQRAPEHK 70
Query: 535 --DSLLEEFWKSRQDNHVETNSELVKFFQAADKFRLTSLMNGESIITVDELFSKLQARKC 592
++L F N + EL+ QAA L L + ++ DEL L
Sbjct: 71 TKENLATTFGFCNDTNRLVDAQELL---QAATTHSLPGL---QVCMSRDELLLALG---- 120
Query: 593 GQVVESAKVSQLMSEKKSYEVQVMSQVVAAVTNSCDSSHIIDLGSTITVDELFSKLQARK 652
G+ ++ K ++ ++NS G+ + DE
Sbjct: 121 GKTGDAGKGQGSFGKRH------------VLSNS---------GAELVPDEF-------- 151
Query: 653 CGLVVESAKVSQLMSEKKSYEVQVMSQVVAAVTNSCDSSHIIDLGGGQGYLSTILALQHG 712
M+ KKS+EVQ MS+VVA + C +ID+G G+GYLS+ L+LQ+G
Sbjct: 152 -------------MNSKKSHEVQSMSEVVAGLAQRCGVKQVIDVGSGKGYLSSFLSLQYG 198
Query: 713 KKTLSLDYNQVNTHGAAVRSKKLE 736
+ +D + NTHGA R++KL+
Sbjct: 199 LRVYGIDSSTTNTHGAQERNRKLK 222
Score = 46.6 bits (109), Expect = 0.079, Method: Compositional matrix adjust.
Identities = 20/52 (38%), Positives = 33/52 (63%)
Query: 1044 TQVDDEEVTRLFDSHQREYEYLQIYYFLKTALAPVIEALIVLDRVLYLREQQ 1095
+++ D ++ D++ + + + LK ALAP IE LI+LDR+ YLREQ+
Sbjct: 507 SKLSDGDIQGYHDAYAPRMDEMHAFNMLKVALAPCIEGLILLDRLCYLREQE 558
>gi|431892090|gb|ELK02537.1| hypothetical protein PAL_GLEAN10022432 [Pteropus alecto]
Length = 724
Score = 146 bits (368), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 82/229 (35%), Positives = 134/229 (58%), Gaps = 15/229 (6%)
Query: 804 ITHFVTPDSDISSILSQAYPQDSLHNVCIMGLHTCGDLSGTALRLFT-KSSLQCLVQVGC 862
+T F+T DS++ I+ L + ++GLHTCGDL+ LR+FT KS ++ + VGC
Sbjct: 480 LTSFITADSELHDIIKD------LEDCLMVGLHTCGDLASNTLRIFTSKSEIKGVCSVGC 533
Query: 863 CYHLLEEEFIRSPFWKDVDQSLYEHGYGFPLSEHLRSRKFFLGRNVRMSGTQSPERVIDL 922
CYHLL EEF + Q + +GFP+ +L+ ++ GRN RMS + ERV
Sbjct: 534 CYHLLSEEF------EKPHQECIQEKWGFPMCHYLKEERWCCGRNARMSACLALERVAVG 587
Query: 923 KQTQTLPLFYRALLEKYLRSKITINDEEPKVVGRLATKCSNFVEYVHRAVDKLKLD-LEV 981
+ T LFYRA+L+ ++ I + + VG++ +K S+F++YV +++ KL LD ++
Sbjct: 588 QGLPTESLFYRAVLQDIIKDCYGITKCD-RHVGKIYSKSSSFLDYVRKSLKKLGLDESKL 646
Query: 982 DDEEVTRLFNSHQREYEYLQIYYFLKTALAPVIEALIVLDRVLYLREQQ 1030
++ + + ++ L+ + LK LAP IE LI+LDR+ YL+EQ+
Sbjct: 647 PEKIIMDYYEKYKPRMNELEAFNMLKVVLAPCIETLILLDRLCYLKEQE 695
Score = 75.9 bits (185), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 63/262 (24%), Positives = 116/262 (44%), Gaps = 31/262 (11%)
Query: 480 ETMELIKNTIDEMIEFITPNIACVNTHTVHFMVEKLWETTI---PKAILSQVNNSPSFDS 536
+ + ++ + ++ F+ ++ N HTV F E +WE I P+ +L+ +
Sbjct: 108 QDLSTLRAKLQGLLRFLREALSISNAHTVDFYTESVWEQLIDLPPETVLAVLR------- 160
Query: 537 LLEEFWKSRQDNHVETNSELVKFFQAADKFRLTSLMNGESIITVDELFSKLQARKCGQVV 596
R E + A++ +GE + E + G
Sbjct: 161 --------RSTEEAEARPGAARPLVEAER------GSGEWRVGGAEREAGPSVPSSGNRA 206
Query: 597 ESA--KVSQLMSEKKSYEVQVMSQVVAAVTNSCDSSHIIDLGSTITVDELFSKLQARKCG 654
ES Q+ V V + +AA + C I +LG ++L L+ +
Sbjct: 207 ESGITDFPQIFCGTSQKLVSVEAFALAA-KHYC----IQNLGICTPFEQLLISLRGNQKQ 261
Query: 655 LVVESAKVSQLMSEKKSYEVQVMSQVVAAVTNSCDSSHIIDLGGGQGYLSTILALQHGKK 714
++ K + M+ KKS+EVQ MS++++++ + +IDLG G+GYLS+ L+L++ K
Sbjct: 262 KTGKNVKPDEFMNLKKSHEVQAMSELISSIADYYGIKQVIDLGSGKGYLSSFLSLKYELK 321
Query: 715 TLSLDYNQVNTHGAAVRSKKLE 736
+D + NTHGA R++KL+
Sbjct: 322 VYGIDSSNTNTHGAQERNRKLK 343
>gi|170290968|ref|YP_001737784.1| beta-lactamase domain-containing protein [Candidatus Korarchaeum
cryptofilum OPF8]
gi|170175048|gb|ACB08101.1| beta-lactamase domain protein [Candidatus Korarchaeum cryptofilum
OPF8]
Length = 432
Score = 145 bits (365), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 87/284 (30%), Positives = 145/284 (51%), Gaps = 11/284 (3%)
Query: 6 VGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATT--IRDSKRCRERDFLKKV 63
V SI+YTGD N R L A + + D LI E TY I S++ ER F++ +
Sbjct: 146 VDGPSILYTGDINTAGTRTLRGAETELPKVDYLIIEGTYGGDDDIHPSRKKVERQFIEDI 205
Query: 64 HECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMFITW 123
+ GG +IP FALGRAQE+ + L ++ E LQ F G+ + + YY + +W
Sbjct: 206 RNVIASGGVTIIPTFALGRAQEVLLTLISHMESGVLQEVPIFVDGMIREISKYYNAYWSW 265
Query: 124 TNQKIRKTF--VQRNMFDFKHIRPF--DKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWA 179
++++ +R +FD + I + ++ P ++ T GML G L K +
Sbjct: 266 LRPELQRMIRESKRGLFDHRAIEEVRNREELLEISEPFIIVTTSGMLQGGPVLTYLKHFG 325
Query: 180 PVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQL 239
N++ + G+ V+GT G +L G++++ + II+VK +++ FSAHAD ++
Sbjct: 326 TKRGNLIYLTGYQVRGTRGRMLLDGIRQIPMPDG-IIEVKSDVKFADFSAHADQPNLINF 384
Query: 240 IQYCEP---KNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGE 280
I K V+LVHGE K++ L+ K++ + ++P GE
Sbjct: 385 ITKVAGRGLKEVILVHGEPDKLIQLRRKLEAR-GIRAYIPYEGE 427
>gi|92096541|gb|AAI15314.1| Zgc:136889 [Danio rerio]
Length = 589
Score = 144 bits (364), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 84/229 (36%), Positives = 131/229 (57%), Gaps = 14/229 (6%)
Query: 804 ITHFVTPDSDISSILSQAYPQDSLHNVCIMGLHTCGDLSGTALRLF-TKSSLQCLVQVGC 862
+T +VT ++++ +++++ L N ++GLHTCGDL+ ++LR+F K L+ + VGC
Sbjct: 343 LTSYVTAETELKTLITE------LENAVLVGLHTCGDLASSSLRMFRAKQELRAVCSVGC 396
Query: 863 CYHLLEEEFIRSPFWKDVDQSLYEHGYGFPLSEHLRSRKFFLGRNVRMSGTQSPERVIDL 922
CYHLL EEF +D + GFP+S++L+ + F GRN RMS + ERV
Sbjct: 397 CYHLLSEEFD-----QDRPGCTSDGVCGFPMSQYLKDQACFCGRNARMSACLALERVSAG 451
Query: 923 KQTQTLPLFYRALLEKYLRSKITINDEEPKVVGRLATKCSNFVEYVHRAVDKLKLD-LEV 981
LFYRA+L L+ E K VG + +K ++FV+YV RA+ KL LD ++
Sbjct: 452 GGLPMESLFYRAVLHVILQDHYDCFKSE-KRVGNVYSKATSFVDYVRRALRKLDLDESKL 510
Query: 982 DDEEVTRLFNSHQREYEYLQIYYFLKTALAPVIEALIVLDRVLYLREQQ 1030
D ++ + + + + LK LAP IE LI+LDR+ YL+EQ+
Sbjct: 511 SDGDIQSYHDRYTARMNEMVAFNMLKVTLAPCIEGLILLDRLCYLKEQE 559
Score = 68.2 bits (165), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 40/123 (32%), Positives = 71/123 (57%), Gaps = 7/123 (5%)
Query: 620 VAAVTNSCDSSHIIDLGSTITVDELFSKLQARKCGLVV---ESAKV---SQLMSEKKSYE 673
++A+ + + + LG + ++EL L+ CG S KV + M+ KK++E
Sbjct: 87 ISALLRAAKAHCLPGLGVCMQLEELMHNLRL-MCGDTAAGQSSNKVLAPDEFMNWKKAHE 145
Query: 674 VQVMSQVVAAVTNSCDSSHIIDLGGGQGYLSTILALQHGKKTLSLDYNQVNTHGAAVRSK 733
VQ MS+VVA++ C +ID+G G+GYL + L++++ + +D + NTHGA R++
Sbjct: 146 VQSMSEVVASLAKCCRVKQVIDVGSGKGYLCSYLSMRYNLQVFGIDSSTTNTHGAQERNR 205
Query: 734 KLE 736
KL+
Sbjct: 206 KLK 208
Score = 56.6 bits (135), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 48/164 (29%), Positives = 79/164 (48%), Gaps = 15/164 (9%)
Query: 485 IKNTIDEMIEFITPNIACVNTHTVHFMVEKLWETTI---PKAILSQVNNSPSFDSLLEEF 541
+K IDE+ F++ +++ N HTV F +W T I P+ +L ++++ LEE
Sbjct: 11 VKKKIDEVKHFLSISLSISNAHTVDFYTRDVWSTFISVSPEDVLCAISSTQDSTGALEEK 70
Query: 542 WKSRQD--NHVETNSELVKFFQAADKFRLTSLMNGESIITVDELFSKLQARKCGQVV--- 596
+ N E ++ +AA L L + ++EL L+ CG
Sbjct: 71 ENTTFGFCNASEKLLDISALLRAAKAHCLPGL---GVCMQLEELMHNLRL-MCGDTAAGQ 126
Query: 597 ESAKV---SQLMSEKKSYEVQVMSQVVAAVTNSCDSSHIIDLGS 637
S KV + M+ KK++EVQ MS+VVA++ C +ID+GS
Sbjct: 127 SSNKVLAPDEFMNWKKAHEVQSMSEVVASLAKCCRVKQVIDVGS 170
Score = 41.6 bits (96), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 18/52 (34%), Positives = 30/52 (57%)
Query: 1044 TQVDDEEVTRLFDSHQREYEYLQIYYFLKTALAPVIEALIVLDRVLYLREQQ 1095
+++ D ++ D + + + LK LAP IE LI+LDR+ YL+EQ+
Sbjct: 508 SKLSDGDIQSYHDRYTARMNEMVAFNMLKVTLAPCIEGLILLDRLCYLKEQE 559
>gi|73978159|ref|XP_539700.2| PREDICTED: uncharacterized protein C12orf26 [Canis lupus familiaris]
Length = 606
Score = 144 bits (363), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 80/229 (34%), Positives = 136/229 (59%), Gaps = 15/229 (6%)
Query: 804 ITHFVTPDSDISSILSQAYPQDSLHNVCIMGLHTCGDLSGTALRLF-TKSSLQCLVQVGC 862
+T F+T DS++ I+ L + ++GLHTCGDL+ LR+F +KS ++ + VGC
Sbjct: 362 LTSFITADSELHDIIKD------LEDCLMVGLHTCGDLAPNTLRIFASKSEIKGVCSVGC 415
Query: 863 CYHLLEEEFIRSPFWKDVDQSLYEHGYGFPLSEHLRSRKFFLGRNVRMSGTQSPERVIDL 922
CYHLL EEF ++ + + +GFP+ ++L+ ++ GRN RMS + ERV
Sbjct: 416 CYHLLSEEF------ENPHKESTQEKWGFPMCQYLKEERWCCGRNARMSACLALERVAVG 469
Query: 923 KQTQTLPLFYRALLEKYLRSKITINDEEPKVVGRLATKCSNFVEYVHRAVDKLKLD-LEV 981
+ T LFYRA+L+ ++ I + + VG++ +K S+F++YV +++ KL LD ++
Sbjct: 470 QGLSTQSLFYRAVLQDIIKECYGITKCD-RHVGKIYSKSSSFLDYVRKSLKKLGLDESKL 528
Query: 982 DDEEVTRLFNSHQREYEYLQIYYFLKTALAPVIEALIVLDRVLYLREQQ 1030
++ + + ++ L+ + LK LAP IE LI+LDR+ YL+EQ+
Sbjct: 529 PEKVIMDYYEKYKPRMNELEAFNMLKVVLAPCIETLILLDRLCYLKEQE 577
Score = 82.0 bits (201), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 37/103 (35%), Positives = 66/103 (64%)
Query: 634 DLGSTITVDELFSKLQARKCGLVVESAKVSQLMSEKKSYEVQVMSQVVAAVTNSCDSSHI 693
+LG ++L + L+ + E+ K+ + M+ KKS+EV+ MS++++ + + C +
Sbjct: 123 NLGICTPFEQLLTALRGSQKQRTGENVKLDEFMNLKKSHEVKAMSELISGIADYCGIKQV 182
Query: 694 IDLGGGQGYLSTILALQHGKKTLSLDYNQVNTHGAAVRSKKLE 736
IDLG G+GYLS+ L+L++G K +D + NTHGA R++KL+
Sbjct: 183 IDLGSGKGYLSSFLSLKYGLKVYGIDSSNTNTHGAEERNRKLK 225
Score = 57.0 bits (136), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 43/172 (25%), Positives = 80/172 (46%), Gaps = 19/172 (11%)
Query: 485 IKNTIDEMIEFITPNIACVNTHTVHFMVEKLWETTI---PKAILSQVNNSPSFDS----- 536
++ + E+++F+ ++ N HTV F E +WE I P+A+L + S
Sbjct: 16 LRAKLQELLQFLREALSISNAHTVDFYTESVWEQLIDLPPEAVLGVLRKSAVEAEAEAEA 75
Query: 537 ------LLEEFWKSRQDNH-----VETNSELVKFFQAADKFRLTSLMNGESIITVDELFS 585
LL E + ET+ +L+ A R S+ N ++L +
Sbjct: 76 RVSEARLLSEAESPSGITNFPKIFCETSQKLMSVEAFALAVRYYSVQNLGICTPFEQLLT 135
Query: 586 KLQARKCGQVVESAKVSQLMSEKKSYEVQVMSQVVAAVTNSCDSSHIIDLGS 637
L+ + + E+ K+ + M+ KKS+EV+ MS++++ + + C +IDLGS
Sbjct: 136 ALRGSQKQRTGENVKLDEFMNLKKSHEVKAMSELISGIADYCGIKQVIDLGS 187
>gi|433656047|ref|YP_007299755.1| putative exonuclease of the beta-lactamase fold involved in RNA
processing [Thermoanaerobacterium thermosaccharolyticum
M0795]
gi|433294236|gb|AGB20058.1| putative exonuclease of the beta-lactamase fold involved in RNA
processing [Thermoanaerobacterium thermosaccharolyticum
M0795]
Length = 820
Score = 144 bits (362), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 92/293 (31%), Positives = 155/293 (52%), Gaps = 13/293 (4%)
Query: 6 VGNQ-SIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFLKKVH 64
VGN+ SI Y+GD++ + A I K RPD+ ESTY + ++ E ++K++
Sbjct: 159 VGNEGSIFYSGDFSRFRQNTIEGASIPKLRPDVAFFESTYGDKLHANRELEESRLVEKIN 218
Query: 65 ECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMFITWT 124
+ GGKVLIP FALGRAQE+ ++L+ + + + +Y G+ + YK+ +
Sbjct: 219 AVLKNGGKVLIPAFALGRAQEIILILKKAINKGMINSKVYVD-GMVKDICRIYKLNPNYL 277
Query: 125 NQKI-RKTFVQRNMFDFKHIRPFDKSFIDN---PGPMVVFATPGMLHSGLSLIIFKKWAP 180
Q + +K F +F ++ P D+ + P V+ ++ GML G S +K A
Sbjct: 278 RQNLAKKIFKGGEIFFDDNVVPVDRPEMREDIIKEPCVIVSSSGMLTGGPSQWYAEKLAD 337
Query: 181 VENNMLIMPGFCVQGTIGHKVL------SGVKKLEFENKQIIDVKMAIEYMSFSAHADAK 234
E N++ + G+ + + G K+L S KKL+ +K+ I VK A++ SAHAD
Sbjct: 338 DEKNLIAITGYQDEESPGRKLLELTDEKSEDKKLKLGDKE-IPVKCAVDKFGLSAHADMS 396
Query: 235 GIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTD 287
I+ L+ PK V LVHG+ + FL ++I+++ D ++P NG++ + D
Sbjct: 397 EILSLVNNLHPKKVFLVHGDPDTINFLGKEIQKDIKSDVYVPLNGDAYDIDID 449
>gi|301769225|ref|XP_002920030.1| PREDICTED: uncharacterized protein C12orf26-like [Ailuropoda
melanoleuca]
gi|281340146|gb|EFB15730.1| hypothetical protein PANDA_008705 [Ailuropoda melanoleuca]
Length = 602
Score = 144 bits (362), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 80/229 (34%), Positives = 136/229 (59%), Gaps = 15/229 (6%)
Query: 804 ITHFVTPDSDISSILSQAYPQDSLHNVCIMGLHTCGDLSGTALRLFT-KSSLQCLVQVGC 862
+T F+T DS++ I+ L + ++GLHTCGDL+ LR+FT KS ++ + VGC
Sbjct: 358 LTSFITADSELHDIIKD------LEDCLMVGLHTCGDLAPNTLRIFTSKSEIKGVCSVGC 411
Query: 863 CYHLLEEEFIRSPFWKDVDQSLYEHGYGFPLSEHLRSRKFFLGRNVRMSGTQSPERVIDL 922
CYHLL EEF ++ + + +GFP+ ++L+ ++ GRN RMS + ERV
Sbjct: 412 CYHLLSEEF------ENPHKESTQEKWGFPMCQYLKEERWCCGRNARMSACLALERVAVG 465
Query: 923 KQTQTLPLFYRALLEKYLRSKITINDEEPKVVGRLATKCSNFVEYVHRAVDKLKL-DLEV 981
+ T LFYRA+L+ ++ I + + VG++ +K S+F++YV +++ KL L + ++
Sbjct: 466 QGLPTESLFYRAVLQDIIKECYGITKCD-RHVGKIYSKSSSFLDYVRKSLKKLGLEEFKL 524
Query: 982 DDEEVTRLFNSHQREYEYLQIYYFLKTALAPVIEALIVLDRVLYLREQQ 1030
++ + + ++ L+ + LK LAP IE LI+LDR+ YL+EQ+
Sbjct: 525 PEKVIMDYYEKYKPRMNELEAFNMLKVVLAPCIETLILLDRLCYLKEQE 573
Score = 75.9 bits (185), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 36/95 (37%), Positives = 62/95 (65%)
Query: 642 DELFSKLQARKCGLVVESAKVSQLMSEKKSYEVQVMSQVVAAVTNSCDSSHIIDLGGGQG 701
++L + L+ K E+ K + M+ KKS+EVQ MS++++++ + +IDLG G+G
Sbjct: 127 EQLLTVLRGNKKQRTDENVKPDEFMNLKKSHEVQAMSELISSIADYYGIKQVIDLGSGKG 186
Query: 702 YLSTILALQHGKKTLSLDYNQVNTHGAAVRSKKLE 736
YLS+ L+L++G K +D + NTHGA R++KL+
Sbjct: 187 YLSSFLSLKYGLKVYGIDSSNTNTHGAEERNRKLK 221
Score = 60.1 bits (144), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 45/168 (26%), Positives = 81/168 (48%), Gaps = 15/168 (8%)
Query: 485 IKNTIDEMIEFITPNIACVNTHTVHFMVEKLWETTI---PKAILSQVNNS-------PSF 534
++ + E+++F+ ++ N HTV F E +W+ + P+ +L + S PS
Sbjct: 16 LRAKLQELLQFLRKALSISNAHTVDFYTESVWDQLVDLPPETVLGVLRKSVAEAEARPSE 75
Query: 535 DSLLEEFWKSRQDNH-----VETNSELVKFFQAADKFRLTSLMNGESIITVDELFSKLQA 589
LL E K + ET+ +LV R S+ N E ++L + L+
Sbjct: 76 ARLLVEAEKRSGITNFPKIFCETSQKLVSVEAFVLAVRYYSVQNLEICTPFEQLLTVLRG 135
Query: 590 RKCGQVVESAKVSQLMSEKKSYEVQVMSQVVAAVTNSCDSSHIIDLGS 637
K + E+ K + M+ KKS+EVQ MS++++++ + +IDLGS
Sbjct: 136 NKKQRTDENVKPDEFMNLKKSHEVQAMSELISSIADYYGIKQVIDLGS 183
Score = 39.7 bits (91), Expect = 9.1, Method: Compositional matrix adjust.
Identities = 17/41 (41%), Positives = 27/41 (65%)
Query: 1055 FDSHQREYEYLQIYYFLKTALAPVIEALIVLDRVLYLREQQ 1095
++ ++ L+ + LK LAP IE LI+LDR+ YL+EQ+
Sbjct: 533 YEKYKPRMNELEAFNMLKVVLAPCIETLILLDRLCYLKEQE 573
>gi|113677179|ref|NP_001038522.1| methyltransferase like 25 [Danio rerio]
Length = 590
Score = 143 bits (360), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 83/229 (36%), Positives = 130/229 (56%), Gaps = 14/229 (6%)
Query: 804 ITHFVTPDSDISSILSQAYPQDSLHNVCIMGLHTCGDLSGTALRLF-TKSSLQCLVQVGC 862
+T +VT ++++ +++++ L N ++GLHTCGDL+ ++LR+F K L+ + VGC
Sbjct: 344 LTSYVTAETELKTLITE------LENAVLVGLHTCGDLASSSLRMFRAKQELRAVCSVGC 397
Query: 863 CYHLLEEEFIRSPFWKDVDQSLYEHGYGFPLSEHLRSRKFFLGRNVRMSGTQSPERVIDL 922
CYHLL EEF +D + GFP+S++L+ + F GRN RMS + ERV
Sbjct: 398 CYHLLSEEFD-----QDRPGCTSDGVCGFPMSQYLKDQACFCGRNARMSACLALERVSAG 452
Query: 923 KQTQTLPLFYRALLEKYLRSKITINDEEPKVVGRLATKCSNFVEYVHRAVDKLKLD-LEV 981
LFYRA+L L+ E K VG + +K ++F +YV RA+ KL LD ++
Sbjct: 453 GGLPMESLFYRAVLHVILQDHYDCFKSE-KRVGNVYSKATSFADYVRRALRKLDLDESKL 511
Query: 982 DDEEVTRLFNSHQREYEYLQIYYFLKTALAPVIEALIVLDRVLYLREQQ 1030
D ++ + + + + LK LAP IE LI+LDR+ YL+EQ+
Sbjct: 512 SDGDIQSYHDRYTPRMNEMVAFNMLKVTLAPCIEGLILLDRLCYLKEQE 560
Score = 68.2 bits (165), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 40/123 (32%), Positives = 71/123 (57%), Gaps = 7/123 (5%)
Query: 620 VAAVTNSCDSSHIIDLGSTITVDELFSKLQARKCGLVV---ESAKV---SQLMSEKKSYE 673
++A+ + + + LG + ++EL L+ CG S KV + M+ KK++E
Sbjct: 88 ISALLRAAKAHCLPGLGVCMQLEELMHNLRL-MCGDTAAGQSSNKVLAPDEFMNWKKAHE 146
Query: 674 VQVMSQVVAAVTNSCDSSHIIDLGGGQGYLSTILALQHGKKTLSLDYNQVNTHGAAVRSK 733
VQ MS+VVA++ C +ID+G G+GYL + L++++ + +D + NTHGA R++
Sbjct: 147 VQSMSEVVASLAKCCRVKQVIDVGSGKGYLCSYLSMRYNLQVFGIDSSTTNTHGAQERNR 206
Query: 734 KLE 736
KL+
Sbjct: 207 KLK 209
Score = 55.1 bits (131), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 47/164 (28%), Positives = 79/164 (48%), Gaps = 15/164 (9%)
Query: 485 IKNTIDEMIEFITPNIACVNTHTVHFMVEKLWETTI---PKAILSQVNNSPSFDSLLEEF 541
+K IDE+ F++ +++ N HTV F +W T I P+ +L ++++ LEE
Sbjct: 12 VKKKIDEVKHFLSISLSISNAHTVDFYTRGVWSTFISVSPEDVLCAISSTQDSTGALEEK 71
Query: 542 WKSRQD--NHVETNSELVKFFQAADKFRLTSLMNGESIITVDELFSKLQARKCGQVV--- 596
+ N + ++ +AA L L + ++EL L+ CG
Sbjct: 72 ENTTFGFCNASKKLLDISALLRAAKAHCLPGL---GVCMQLEELMHNLRL-MCGDTAAGQ 127
Query: 597 ESAKV---SQLMSEKKSYEVQVMSQVVAAVTNSCDSSHIIDLGS 637
S KV + M+ KK++EVQ MS+VVA++ C +ID+GS
Sbjct: 128 SSNKVLAPDEFMNWKKAHEVQSMSEVVASLAKCCRVKQVIDVGS 171
>gi|146163951|ref|XP_001012725.2| metallo beta lactamase domain containing protein [Tetrahymena
thermophila]
gi|146145850|gb|EAR92480.2| metallo beta lactamase domain containing protein [Tetrahymena
thermophila SB210]
Length = 750
Score = 142 bits (359), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 94/296 (31%), Positives = 158/296 (53%), Gaps = 18/296 (6%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF V++ ++YTGDY+ D + A I + D+LI E TY +S+ RE+
Sbjct: 154 MFLVEIDGVRVLYTGDYSTEKDILIPPAQIPNEKVDVLIVEGTYGKCNHESRSEREQQLT 213
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAP---IYFAVGLTEKATNYY 117
++V V G+ L+PVFALGRAQE+ ++LE +W++ + + I+F L++KA +
Sbjct: 214 QEVMRIVKNKGQCLLPVFALGRAQEIVLILEEFWKQNSSELADIKIHFTQNLSKKANQIF 273
Query: 118 KMFITWTNQKIRKTFVQRNMFDFKHIRPFD---KSFIDNPGPMVVFATPGMLHSGLSLII 174
+ + + IRK Q N F+ H P + ID P V+ ++P L SG+S I
Sbjct: 274 QYYKSMMADPIRKN--QLNPFNL-HYAPNNITKHDEIDENKPCVILSSPFNLQSGISRTI 330
Query: 175 FKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVK-----MAIEYMSFSA 229
++ N +I+ GFC + T+ ++ + +K + IID + M+++ +SFSA
Sbjct: 331 IERICSKPQNGVILTGFCPENTLSKEIQN--EKQDKTINSIIDGRKIPFNMSVKNISFSA 388
Query: 230 HADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF-NLDCFMPANGES-CF 283
HAD I+ EP++V+LVHG + K+ L++++ + F L P N + CF
Sbjct: 389 HADLNQTTNFIRAIEPQHVILVHGSSIKIKELQKELMKTFVGLQVHAPNNQQKICF 444
>gi|15668334|ref|NP_247130.1| putative mRNA 3'-end processing factor 3 [Methanocaldococcus
jannaschii DSM 2661]
gi|2495836|sp|Q57626.1|Y162_METJA RecName: Full=Uncharacterized protein MJ0162
gi|1590919|gb|AAB98146.1| putative mRNA 3'-end processing factor 3 [Methanocaldococcus
jannaschii DSM 2661]
Length = 421
Score = 142 bits (359), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 90/281 (32%), Positives = 154/281 (54%), Gaps = 10/281 (3%)
Query: 4 VKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATT--IRDSKRCRERDFLK 61
++V + I+YTGD N R L A D D+LI ESTY + I+ +++ ER ++
Sbjct: 141 LEVDGKKILYTGDINEGVSRTLLPADTDIDEIDVLIIESTYGSPLDIKPARKTLERQLIE 200
Query: 62 KVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQ-APIYFAVGLTEKATNYYKMF 120
++ E ++ GGKV+IPVFA+GRAQE+ +++ Y L+ PIY L AT Y +
Sbjct: 201 EISETIENGGKVIIPVFAIGRAQEILLIINNYIRSGKLRDVPIYTDGSLIH-ATAVYMSY 259
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
I W N KI K V+ + F I+ D+S + N P ++ +T GM+ G ++ + K
Sbjct: 260 INWLNPKI-KNMVENRINPFGEIKKADESLVFNKEPCIIVSTSGMVQGG-PVLKYLKLLK 317
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLE-FENKQIIDVKMAIEYMSFSAHADAKGIMQL 239
N LI+ G+ +GT+G ++ G K+++ F+NK I ++ + + FSAH D +++
Sbjct: 318 DPKNKLILTGYQAEGTLGRELEEGAKEIQPFKNK--IPIRGKVVKIEFSAHGDYNSLVRY 375
Query: 240 IQYC-EPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANG 279
I+ +P+ +++HGE + + I + + F+P G
Sbjct: 376 IKKIPKPEKAIVMHGERYQSLSFAMTIWKTLKIPTFVPVRG 416
>gi|336477059|ref|YP_004616200.1| KH-domain/beta-lactamase-domain-containing protein [Methanosalsum
zhilinae DSM 4017]
gi|335930440|gb|AEH60981.1| KH-domain/beta-lactamase-domain protein [Methanosalsum zhilinae DSM
4017]
Length = 636
Score = 142 bits (359), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 96/297 (32%), Positives = 153/297 (51%), Gaps = 17/297 (5%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATT--IRDSKRCRERDF 59
F + G +IV TGDY P R A R + +ITESTY + I+ S + E++
Sbjct: 338 FHIGDGRHNIVMTGDYKYGPTRLFDPAVNKFPRVETVITESTYGNSNAIQPSLKEAEKNL 397
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNL-QAPIYFAVGLTEKATNYYK 118
V +DR G VLIP FA+GR+QE+ I+LE + + P+Y G+ +AT +
Sbjct: 398 QNIVKRTIDRNGIVLIPAFAVGRSQEVMIVLEEAIRKGIISDIPVYLD-GMIWEATAIHA 456
Query: 119 MFITWTNQKIRKTFVQR--NMFDFKHIRPFD-----KSFIDNPGPMVVFATPGMLHSGLS 171
+ + N +RK Q+ N F ++ R D ++ ID+P P V+ AT GM+++G
Sbjct: 457 TYPEYLNNDLRKLIFQKGQNPFLSENFRSVDSNELRRNIIDDPHPCVILATAGMMNAGPI 516
Query: 172 LIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIID-VKMAIEYM---SF 227
L FK +AP E N L+ G+ GT+G ++ G K++ K D VKM +E F
Sbjct: 517 LDYFKAFAPDERNTLVFVGYQADGTMGRRIQKGWKEVPLPAKNGSDMVKMNMEVEIVDGF 576
Query: 228 SAHADAKGIMQLIQYCEPK--NVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESC 282
S H+D K +M ++ +P+ + +HGE S + L + +++ L+ N E+
Sbjct: 577 SGHSDRKQLMDYVKKMKPRPERIFTMHGEESSCIDLASSMYKKYRLETRALTNLETV 633
>gi|224094152|ref|XP_002192209.1| PREDICTED: methyltransferase-like protein 25 [Taeniopygia guttata]
Length = 543
Score = 142 bits (359), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 78/229 (34%), Positives = 135/229 (58%), Gaps = 16/229 (6%)
Query: 804 ITHFVTPDSDISSILSQAYPQDSLHNVCIMGLHTCGDLSGTALRLFT-KSSLQCLVQVGC 862
+T +T +++++ I++ L + ++GLHTCGDL+ LR+FT K ++ + VGC
Sbjct: 298 LTSCITAETELNDIITD------LEDCMMVGLHTCGDLAANTLRIFTAKPEIKAVCSVGC 351
Query: 863 CYHLLEEEFIRSPFWKDVDQSLYEHGYGFPLSEHLRSRKFFLGRNVRMSGTQSPERVIDL 922
CYHLL E+F + + ++ +GFP+ ++L+++ + GRN RMS + ERV
Sbjct: 352 CYHLLSEQF-------ENQEECHDQVWGFPMCQYLKNKGWCCGRNARMSACLALERVAVG 404
Query: 923 KQTQTLPLFYRALLEKYLRSKITINDEEPKVVGRLATKCSNFVEYVHRAVDKLKL-DLEV 981
+ T LFYRA+L+ + + + + VG+ +K S+FV+YV R++ KL+L D ++
Sbjct: 405 QMLPTESLFYRAVLQVIVEEIYGVTKSD-RHVGKTFSKSSSFVDYVRRSLKKLELDDSKL 463
Query: 982 DDEEVTRLFNSHQREYEYLQIYYFLKTALAPVIEALIVLDRVLYLREQQ 1030
D + + ++ L+ + LK LAP IE LI+LDR+ YL+EQ+
Sbjct: 464 PDSCIMDYYEKYKHRMNELEAFNMLKVVLAPCIEVLILLDRLCYLKEQE 512
Score = 74.7 bits (182), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 37/103 (35%), Positives = 63/103 (61%), Gaps = 2/103 (1%)
Query: 634 DLGSTITVDELFSKLQARKCGLVVESAKVSQLMSEKKSYEVQVMSQVVAAVTNSCDSSHI 693
+LG ++++ L+ G + K + M+ KKS+EVQVMS++V + N C +
Sbjct: 49 NLGVCTPLEDVLEALKGDSEGAI--GIKPDEFMNNKKSHEVQVMSELVDGIANYCGIKQV 106
Query: 694 IDLGGGQGYLSTILALQHGKKTLSLDYNQVNTHGAAVRSKKLE 736
ID+G G+GYLS+ L++Q+ K +D + NT+GA R++KL+
Sbjct: 107 IDIGSGKGYLSSFLSMQYNLKVYGIDCSSSNTNGAHERNRKLK 149
Score = 42.4 bits (98), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 42/81 (51%), Gaps = 4/81 (4%)
Query: 557 VKFFQAADKFRLTSLMNGESIITVDELFSKLQARKCGQVVESAKVSQLMSEKKSYEVQVM 616
V F A K+ SL N ++++ L+ G + K + M+ KKS+EVQVM
Sbjct: 35 VHLFALAAKYY--SLSNLGVCTPLEDVLEALKGDSEGAI--GIKPDEFMNNKKSHEVQVM 90
Query: 617 SQVVAAVTNSCDSSHIIDLGS 637
S++V + N C +ID+GS
Sbjct: 91 SELVDGIANYCGIKQVIDIGS 111
Score = 42.4 bits (98), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 18/52 (34%), Positives = 32/52 (61%)
Query: 1044 TQVDDEEVTRLFDSHQREYEYLQIYYFLKTALAPVIEALIVLDRVLYLREQQ 1095
+++ D + ++ ++ L+ + LK LAP IE LI+LDR+ YL+EQ+
Sbjct: 461 SKLPDSCIMDYYEKYKHRMNELEAFNMLKVVLAPCIEVLILLDRLCYLKEQE 512
>gi|326912591|ref|XP_003202632.1| PREDICTED: uncharacterized protein C12orf26-like [Meleagris
gallopavo]
Length = 536
Score = 142 bits (358), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 76/228 (33%), Positives = 134/228 (58%), Gaps = 16/228 (7%)
Query: 804 ITHFVTPDSDISSILSQAYPQDSLHNVCIMGLHTCGDLSGTALRLFT-KSSLQCLVQVGC 862
+T +++ ++++S I++ L + ++GLHTCGDL+ LR+FT K ++ + VGC
Sbjct: 291 LTSYISAETELSDIITD------LEDCIMVGLHTCGDLAANTLRIFTAKPEIKAVCSVGC 344
Query: 863 CYHLLEEEFIRSPFWKDVDQSLYEHGYGFPLSEHLRSRKFFLGRNVRMSGTQSPERVIDL 922
CYHLL E++ + + E +GFP+ ++L+ + + GRN RMS + ERV
Sbjct: 345 CYHLLSEQY-------ENQEDCTEEVWGFPMCQYLKDKGWCCGRNARMSACLALERVAVG 397
Query: 923 KQTQTLPLFYRALLEKYLRSKITINDEEPKVVGRLATKCSNFVEYVHRAVDKLKL-DLEV 981
+ T LFYRA+L+ + + + + VG++ +K S+F++YV +++ KL+L D ++
Sbjct: 398 RTLPTESLFYRAVLQVIIEEMYGVTKSD-RHVGKIFSKSSSFLDYVRKSLKKLELDDSKL 456
Query: 982 DDEEVTRLFNSHQREYEYLQIYYFLKTALAPVIEALIVLDRVLYLREQ 1029
D + + ++ L+ + LK LAP IE LI+LDR+ YL+EQ
Sbjct: 457 PDSCIMDYYEKYKHRMNELEAFNMLKVVLAPCIEVLILLDRLCYLKEQ 504
Score = 75.5 bits (184), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 46/157 (29%), Positives = 78/157 (49%), Gaps = 21/157 (13%)
Query: 634 DLGSTITVDELFSKLQARKCGLVVESAKVSQLMSEKKSYEVQVMSQVVAAVTNSCDSSHI 693
+LG ++E+ L+ G K + M+ KKS+EVQVMS++V + C +
Sbjct: 47 NLGVCTPLEEILEALRGDSQG--ASGIKTDEFMNNKKSHEVQVMSELVDKIAKYCGLKQV 104
Query: 694 IDLGGGQGYLSTILALQHGKKTLSLDYNQVNTHGAAVRSKKLEDSEKESKGPFQSYAGVI 753
ID+G G+GYLS+ L++Q+ K +D ++ NT+GA R++KL
Sbjct: 105 IDIGSGKGYLSSFLSMQYDLKVYGIDSSKTNTNGAHERNRKL------------------ 146
Query: 754 NKKLWMRTQNRQTSHASVEHHGKNWKRKSKAPVVSIK 790
KK W Q+R +++ + K K + K ++ K
Sbjct: 147 -KKHWKTYQSRARANSETQRLEKPIKNEVKCKAINEK 182
Score = 41.6 bits (96), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 18/51 (35%), Positives = 31/51 (60%)
Query: 1044 TQVDDEEVTRLFDSHQREYEYLQIYYFLKTALAPVIEALIVLDRVLYLREQ 1094
+++ D + ++ ++ L+ + LK LAP IE LI+LDR+ YL+EQ
Sbjct: 454 SKLPDSCIMDYYEKYKHRMNELEAFNMLKVVLAPCIEVLILLDRLCYLKEQ 504
Score = 40.4 bits (93), Expect = 5.7, Method: Compositional matrix adjust.
Identities = 28/98 (28%), Positives = 46/98 (46%), Gaps = 2/98 (2%)
Query: 540 EFWKSRQDNHVETNSELVKFFQAADKFRLTSLMNGESIITVDELFSKLQARKCGQVVESA 599
E+ + D E + +L+ A + SL N ++E+ L+ Q
Sbjct: 14 EYDNTFSDVFCENSKKLINVHLFALAAKHYSLPNLGVCTPLEEILEALRGD--SQGASGI 71
Query: 600 KVSQLMSEKKSYEVQVMSQVVAAVTNSCDSSHIIDLGS 637
K + M+ KKS+EVQVMS++V + C +ID+GS
Sbjct: 72 KTDEFMNNKKSHEVQVMSELVDKIAKYCGLKQVIDIGS 109
>gi|261403765|ref|YP_003247989.1| beta-lactamase [Methanocaldococcus vulcanius M7]
gi|261370758|gb|ACX73507.1| beta-lactamase domain protein [Methanocaldococcus vulcanius M7]
Length = 421
Score = 142 bits (357), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 89/281 (31%), Positives = 154/281 (54%), Gaps = 10/281 (3%)
Query: 4 VKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATT--IRDSKRCRERDFLK 61
++V + I+YTGD N R L +A D D+LI ESTY + ++ +++ ER ++
Sbjct: 141 LEVDGKKILYTGDINENVSRTLLSADTDFDEIDVLIVESTYGSPLDVKPARKTLERQLIE 200
Query: 62 KVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNL-QAPIYFAVGLTEKATNYYKMF 120
++ E ++ GGKV+IPVFA+GRAQE+ ++L Y NL + PIY L AT Y +
Sbjct: 201 EIAETIENGGKVIIPVFAIGRAQEILLILNNYMRSGNLTEVPIYTDGSLIH-ATAVYMSY 259
Query: 121 ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
W N KI K ++ + F I+ D + + N P ++ +T GM+ G ++ + K
Sbjct: 260 TNWLNPKI-KNMIENGINPFGEIKKADDNLVFNKEPCIIVSTSGMVQGG-PILKYLKLLK 317
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLE-FENKQIIDVKMAIEYMSFSAHADAKGIMQL 239
N LI+ G+ +GTIG ++ G ++++ F+NK I+ K+ + FSAH D +++
Sbjct: 318 DPKNKLILTGYQAEGTIGRELEEGAEEIQPFKNKIPINGKVV--KIEFSAHGDYNSLVRY 375
Query: 240 IQYC-EPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANG 279
I+ +P+ +++HGE + + I + + F+P G
Sbjct: 376 IKKIPKPEKAIVMHGERYQALSFAMTIWKTLKIPTFVPVKG 416
>gi|402886986|ref|XP_003906890.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein C12orf26
homolog [Papio anubis]
Length = 470
Score = 141 bits (356), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 78/202 (38%), Positives = 123/202 (60%), Gaps = 10/202 (4%)
Query: 831 CIM-GLHTCGDLSGTALRLFT-KSSLQCLVQVGCCYHLLEEEFIRSPFWKDVDQSLYEHG 888
C+M GLHTCGDL+ LR+FT KS ++ + VGCCYHLL EEF ++ + +
Sbjct: 250 CLMVGLHTCGDLAPNTLRIFTAKSEIKGVCSVGCCYHLLSEEF------ENQHKERTQEK 303
Query: 889 YGFPLSEHLRSRKFFLGRNVRMSGTQSPERVIDLKQTQTLPLFYRALLEKYLRSKITIND 948
+GFP+ +L+ ++ GRN RMS + ERV + T LFYRA+L+ ++ I
Sbjct: 304 WGFPMCHYLKEERWCCGRNARMSACLALERVAAGQGLPTESLFYRAVLQDIIKDCYGITK 363
Query: 949 EEPKVVGRLATKCSNFVEYVHRAVDKLKLD-LEVDDEEVTRLFNSHQREYEYLQIYYFLK 1007
+ + VG++ +KCS+F++YV R++ KL LD ++ ++ + + ++ L+ + LK
Sbjct: 364 CD-RHVGKIYSKCSSFLDYVRRSLKKLGLDESKLPEKIIMNYYEKYKPRMNELEAFNMLK 422
Query: 1008 TALAPVIEALIVLDRVLYLREQ 1029
LAP IE LI+LDR+ YL+EQ
Sbjct: 423 VVLAPCIETLILLDRLCYLKEQ 444
Score = 81.3 bits (199), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 64/256 (25%), Positives = 115/256 (44%), Gaps = 54/256 (21%)
Query: 485 IKNTIDEMIEFITPNIACVNTHTVHFMVEKLWETTI---PKAILSQVNNSPSFDSLLEEF 541
++ + +++F+ ++ N HTV F E +WE + P+ +L+ + S S
Sbjct: 16 LRAKLQGLLQFLRDALSISNAHTVDFYTESVWEELVDLTPETVLTALRKSAS-------- 67
Query: 542 WKSRQDNHVETN-SELVKFFQAADKFRLTSLMNGESIITVDELFSKLQARKCGQVVESAK 600
E SE + +A GE+ +T
Sbjct: 68 -------EAEARPSETRRLVEA----------EGEAGVT--------------------D 90
Query: 601 VSQLMSEKKSYEVQVMSQVVAAVTNSCDSSHIIDLGSTITVDELFSKLQARKCGLVVESA 660
++ E V V + +AA S I +LG ++L L+ + + E+
Sbjct: 91 FPKIFCETSQKLVSVEAFALAAKYYS-----IQNLGICTPFEQLLVALRGNQNQRIGETQ 145
Query: 661 KVSQLMSEKKSYEVQVMSQVVAAVTNSCDSSHIIDLGGGQGYLSTILALQHGKKTLSLDY 720
K + M+ KKS+EVQ MS++++++ + +IDLG G+GYLS+ L+L++G K +D
Sbjct: 146 KAVEFMNMKKSHEVQAMSELISSIADYYGIKQVIDLGSGKGYLSSFLSLKYGFKVYGIDS 205
Query: 721 NQVNTHGAAVRSKKLE 736
+ NTHGA R++KL+
Sbjct: 206 SNTNTHGAEERNRKLK 221
>gi|357622694|gb|EHJ74116.1| hypothetical protein KGM_16062 [Danaus plexippus]
Length = 769
Score = 141 bits (356), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 108/371 (29%), Positives = 186/371 (50%), Gaps = 49/371 (13%)
Query: 654 GLVVESAKVSQLMSEKKSYEVQVMSQVVAAVTNSCDSSHIIDLGGGQGYLSTILALQHGK 713
G + ++++ M+ KKSYEV+ MS ++A++ CD +H ++ GGG+G L L+L +
Sbjct: 110 GEIKPQVQITEFMTSKKSYEVKTMSHLIASLCTVCDVTHCVEAGGGKGNLPVALSLSYHL 169
Query: 714 KTLSLDYNQVNTHGAAVRSKKLEDSEKESKGPFQSYAGVINKKLWMRTQNRQTSHASVEH 773
+L++D N + + R K + Q I+KK+ + ++ + S+E
Sbjct: 170 PSLTIDCNPIAVNNGEKRVKII-----------QKQWHAISKKV--KDGSKHLASDSIE- 215
Query: 774 HGKNWKRKSKAPVVSIKDEELVVCKDKCKQITHFVTPDSDISSILSQAYPQDSLH-NVCI 832
N R + A ++T D+D + ++ + +P+ S + +
Sbjct: 216 --TNLHRFAAA----------------------YITTDTDFTRLVREKFPEYSGDVKLLL 251
Query: 833 MGLHTCGDLSGTALRLFTKS-SLQCLVQVGCCYHLLEEEFIRSPFWKDVDQSLYEH---G 888
GLHTCG+L +L +FT + S+ L V CCYHLL E+ F DV Q E
Sbjct: 252 TGLHTCGNLGPDSLVIFTTNPSISSLFNVPCCYHLLTEDVDVELF--DVFQRYGEGCGGS 309
Query: 889 YGFPLSEHLRSRKFFLGRNVRMSGTQSPERVIDLKQTQTLPLFYRALLEKYLRSKIT-IN 947
GFP+SE L+ + LGRN RM QS RV+ KQ L YRAL++ ++ ++ ++
Sbjct: 310 KGFPMSEGLKG--YNLGRNARMLAAQSIHRVVYNKQIPDKGLLYRALIQIIIKQRLPDLH 367
Query: 948 DEEPKVVGRLATKCSNFVEYVHRAVDKLKLDLEVDDEEVTRLFNSHQREYEYLQIYYFLK 1007
E K+ G +++KC NF +Y A LK+ ++ + E + +++ + ++Y L+
Sbjct: 368 VSEGKLKG-ISSKCQNFDDYAKMADAILKIGVDQNSEIYLEVQKDIDVKWKKIVMFYLLR 426
Query: 1008 TALAPVIEALI 1018
LA VIE +I
Sbjct: 427 LCLAQVIEHVI 437
Score = 65.1 bits (157), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 39/153 (25%), Positives = 75/153 (49%), Gaps = 6/153 (3%)
Query: 485 IKNTIDEMIEFITPNIACVNTHTVHFMVEKLWETTIPKAILSQVNNSPSFDSLLEEFWKS 544
+K +D F+ + N H V F E W+ +PK + + ++ ++ FWK
Sbjct: 8 VKGDLDAATTFLDQYLHLANCHMVEFFTESHWDRLVPKKLRNYLDVC-ELSQAIDNFWKY 66
Query: 545 RQDNHVETNSELVKFFQAADKFRLTSLMNGESIITVDELFSKLQARKCGQVVESAKVSQL 604
N + N EL K+ + + K+ + +N I T E ++ G++ ++++
Sbjct: 67 ADGNCCDDN-ELNKWIKESRKYY--TALNTYCIST--EKLQEIIKSWGGEIKPQVQITEF 121
Query: 605 MSEKKSYEVQVMSQVVAAVTNSCDSSHIIDLGS 637
M+ KKSYEV+ MS ++A++ CD +H ++ G
Sbjct: 122 MTSKKSYEVKTMSHLIASLCTVCDVTHCVEAGG 154
>gi|333910570|ref|YP_004484303.1| beta-lactamase domain-containing protein [Methanotorris igneus Kol
5]
gi|333751159|gb|AEF96238.1| beta-lactamase domain protein [Methanotorris igneus Kol 5]
Length = 419
Score = 141 bits (356), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 93/284 (32%), Positives = 149/284 (52%), Gaps = 21/284 (7%)
Query: 6 VGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATT--IRDSKRCRERDFLKKV 63
+ + I+YTGD N TP R L A D D+LI ESTY + I+ S++ E+ + ++
Sbjct: 142 IDGKKILYTGDINETPTRTLLPADTDVDEIDVLIIESTYGSPLDIKPSRKALEKQLIDEI 201
Query: 64 HECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQ-APIYFAVGLTEKATNYYKMFIT 122
E ++ GGKV+IPVFA+GRAQE+ +++ Y L+ P+Y L AT Y ++
Sbjct: 202 AETIENGGKVIIPVFAIGRAQEILLIINNYIRSGKLKDVPVYVDGSLIH-ATGIYMSYLD 260
Query: 123 WTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSG-----LSLIIFKK 177
W N K+ K V+ + F ++ D S N P ++ +T GM+ G LSL+ K
Sbjct: 261 WLNPKL-KNMVENRVNPFGELKKADNSVF-NKEPCIIISTSGMVQGGPVLQYLSLLKSPK 318
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKL-EFENKQIIDVKMAIEYMSFSAHADAKGI 236
N LI+ G+ +GT+G ++ G +++ F+NK I V+ + + FSAH D +
Sbjct: 319 ------NKLILTGYQAEGTLGRELEEGAEEITPFKNK--IPVRGKVVKIEFSAHGDYNSL 370
Query: 237 MQLIQYCE-PKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANG 279
++ I+ PK +VHGE + + I + + F P NG
Sbjct: 371 VRYIKKIPTPKKAFVVHGERYQTLSFAMTIWKSLKIPTFAPVNG 414
>gi|432943433|ref|XP_004083212.1| PREDICTED: methyltransferase-like protein 25-like [Oryzias latipes]
Length = 591
Score = 141 bits (355), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 84/230 (36%), Positives = 133/230 (57%), Gaps = 17/230 (7%)
Query: 804 ITHFVTPDSDISSILSQAYPQDSLHNVCIMGLHTCGDLSGTALRLF-TKSSLQCLVQVGC 862
+T +VT ++++ ++++ L + ++GLHTCG+L+ + LR+F K L + VGC
Sbjct: 341 LTSYVTAETELRELITE------LEDAVMVGLHTCGNLAPSTLRMFVAKPELAAVCSVGC 394
Query: 863 CYHLLEEEFIRSPFWKD-VDQSLYEHGYGFPLSEHLRSRKFFLGRNVRMSGTQSPERVID 921
CYHLL EEF P + +D+S GFPLS++L R F GRN RMS + ERV
Sbjct: 395 CYHLLSEEF--DPAGPESLDRSC-----GFPLSQYLHDRSCFCGRNARMSACLALERVSL 447
Query: 922 LKQTQTLPLFYRALLEKYLRSKITINDEEPKVVGRLATKCSNFVEYVHRAVDKLKLD-LE 980
+ Q LFYRA+L LR + + VG + +K +F++YV RA+ KL+LD +
Sbjct: 448 GQGIQMESLFYRAVLHVILRDHYGSYKSD-RRVGNVYSKAKSFLDYVRRALRKLELDESK 506
Query: 981 VDDEEVTRLFNSHQREYEYLQIYYFLKTALAPVIEALIVLDRVLYLREQQ 1030
+ D ++ + ++ + + LK +AP IE LI+LDR+ YL+EQ+
Sbjct: 507 LSDNDIQDYHDKYRPRMAEMHAFNMLKVTIAPCIEGLILLDRLCYLKEQE 556
Score = 74.7 bits (182), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 36/83 (43%), Positives = 55/83 (66%), Gaps = 1/83 (1%)
Query: 654 GLVVESAKVSQLMSEKKSYEVQVMSQVVAAVTNSCDSSHIIDLGGGQGYLSTILALQHGK 713
G+V E + + M+ KKS+EVQ MS+VVA + +ID+G G+GYLS+ L+LQ+G
Sbjct: 131 GIVTE-LEADEFMNSKKSHEVQSMSEVVAGLARHSGVKQVIDVGSGKGYLSSFLSLQYGL 189
Query: 714 KTLSLDYNQVNTHGAAVRSKKLE 736
+ +D + NTHGA R++KL+
Sbjct: 190 QVYGIDSSSTNTHGAQERNRKLK 212
Score = 57.0 bits (136), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 46/169 (27%), Positives = 79/169 (46%), Gaps = 21/169 (12%)
Query: 485 IKNTIDEMIEFITPNIACVNTHTVHFMVEKLWETTI---PKAILSQVNNSPSFDSLLEEF 541
I+ IDE+++F++ + N HTV F + +W+ + P+ +LS V +P+ D +E
Sbjct: 11 IQQRIDEVMQFLSVTLPIANAHTVEFYTQDVWKRFMAVSPEEVLSAV--TPNSDRKMEPE 68
Query: 542 WKSRQDN--------HVETNSELVKFFQAADKFRLTSLMNGESIITVDELFSKLQARKCG 593
+ + + H + +F Q A L L E ++ + +LQ
Sbjct: 69 RRRTEPSGTMFGFCSHSNRLLDTHEFLQGAKAHSLPGL---EVCMSRSDFLLELQGNWSK 125
Query: 594 QVVESAKVSQL-----MSEKKSYEVQVMSQVVAAVTNSCDSSHIIDLGS 637
V V++L M+ KKS+EVQ MS+VVA + +ID+GS
Sbjct: 126 STVLPGIVTELEADEFMNSKKSHEVQSMSEVVAGLARHSGVKQVIDVGS 174
Score = 41.2 bits (95), Expect = 3.4, Method: Compositional matrix adjust.
Identities = 17/52 (32%), Positives = 31/52 (59%)
Query: 1044 TQVDDEEVTRLFDSHQREYEYLQIYYFLKTALAPVIEALIVLDRVLYLREQQ 1095
+++ D ++ D ++ + + LK +AP IE LI+LDR+ YL+EQ+
Sbjct: 505 SKLSDNDIQDYHDKYRPRMAEMHAFNMLKVTIAPCIEGLILLDRLCYLKEQE 556
>gi|363727602|ref|XP_416122.3| PREDICTED: uncharacterized protein C12orf26 [Gallus gallus]
Length = 460
Score = 140 bits (354), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 75/228 (32%), Positives = 133/228 (58%), Gaps = 16/228 (7%)
Query: 804 ITHFVTPDSDISSILSQAYPQDSLHNVCIMGLHTCGDLSGTALRLFT-KSSLQCLVQVGC 862
+T +++ ++++S I++ L + ++GLHTCGDL+ LR+FT K ++ + VGC
Sbjct: 215 LTSYISAETELSDIITD------LEDCIMVGLHTCGDLAANTLRIFTAKPEIKAVCSVGC 268
Query: 863 CYHLLEEEFIRSPFWKDVDQSLYEHGYGFPLSEHLRSRKFFLGRNVRMSGTQSPERVIDL 922
CYHLL E++ + + E +GFP+ ++L+ + GRN RMS + ERV
Sbjct: 269 CYHLLSEQY-------ENQEDCTEEMWGFPMCQYLKDEGWCCGRNARMSACLALERVAVG 321
Query: 923 KQTQTLPLFYRALLEKYLRSKITINDEEPKVVGRLATKCSNFVEYVHRAVDKLKL-DLEV 981
+ T LFYRA+L+ + + + + VG++ +K S+F++YV +++ KL+L D ++
Sbjct: 322 QMLPTESLFYRAVLQVIVEETYGVTKSD-RHVGKIFSKSSSFLDYVRKSLKKLELDDSKL 380
Query: 982 DDEEVTRLFNSHQREYEYLQIYYFLKTALAPVIEALIVLDRVLYLREQ 1029
D + + ++ L+ + LK LAP +E LI+LDR+ YL+EQ
Sbjct: 381 PDSCIMDYYEKYKHRMNELEAFNMLKVVLAPCVEVLILLDRLCYLKEQ 428
Score = 68.2 bits (165), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 30/71 (42%), Positives = 49/71 (69%)
Query: 666 MSEKKSYEVQVMSQVVAAVTNSCDSSHIIDLGGGQGYLSTILALQHGKKTLSLDYNQVNT 725
M+ KKS+EVQ MS++V + C +ID+G G+GYLS+ L++Q+ K +D ++ NT
Sbjct: 1 MNNKKSHEVQAMSELVDKIAKYCGVEQVIDIGSGKGYLSSFLSMQYNLKVYGIDSSKTNT 60
Query: 726 HGAAVRSKKLE 736
+GA R++KL+
Sbjct: 61 NGAHERNRKLK 71
Score = 41.2 bits (95), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 17/51 (33%), Positives = 31/51 (60%)
Query: 1044 TQVDDEEVTRLFDSHQREYEYLQIYYFLKTALAPVIEALIVLDRVLYLREQ 1094
+++ D + ++ ++ L+ + LK LAP +E LI+LDR+ YL+EQ
Sbjct: 378 SKLPDSCIMDYYEKYKHRMNELEAFNMLKVVLAPCVEVLILLDRLCYLKEQ 428
>gi|47218388|emb|CAG01909.1| unnamed protein product [Tetraodon nigroviridis]
Length = 664
Score = 140 bits (354), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 94/276 (34%), Positives = 142/276 (51%), Gaps = 32/276 (11%)
Query: 832 IMGLHTCGDLSGTALRLF-TKSSLQCLVQVGCCYHLLEEEFIRSPFWKDVDQSLYEHGYG 890
++GLHTCGDL+ + LR+F K+ L + VGCCYHLL E+F S Q + G
Sbjct: 380 MVGLHTCGDLAPSTLRMFVAKAELAAVCSVGCCYHLLSEQFDPSA------QECLQGACG 433
Query: 891 FPLSEHLRSRKFFLGRNVRMSGTQSPERVIDLKQTQTLPLFYRALLEKYLRSKITINDEE 950
FPLS++L R F GRN RMS + ERV + Q LF+RA+L L+ ++ E
Sbjct: 434 FPLSQYLLQRSCFCGRNARMSACLALERVSLGQGIQMESLFFRAVLHVILKERLGFVRSE 493
Query: 951 PKVVGRLATKCSNFVEYVHRAVDKLKLD----------LEVDDEEVTRLFNSHQREYEYL 1000
K VG + +K +FV+YV RA+ +L+LD + + + +L S +
Sbjct: 494 -KRVGNVYSKTKSFVDYVRRALSRLELDGSKVEPLGFLCKSEGNVLGKLLKSCGVAGQQC 552
Query: 1001 Q-IYYFLKTALAPVIEALIVLDRVLYLREQQKPQIISNLFLSLLTQVDDEEVTRLFDSHQ 1059
Q + LKT R + + QI F S+ +Q+ D+++ D++
Sbjct: 553 QKLRASLKT-------------RAMCVSVSPHSQIPYKSFYSVSSQLSDDDIQGYHDAYA 599
Query: 1060 REYEYLQIYYFLKTALAPVIEALIVLDRVLYLREQQ 1095
+ + + LK ALAP IE LI+LDR+ YL+EQ+
Sbjct: 600 PRMDEMHAFNMLKVALAPCIEGLILLDRLCYLKEQE 635
Score = 73.6 bits (179), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 40/102 (39%), Positives = 61/102 (59%), Gaps = 1/102 (0%)
Query: 635 LGSTITVDELFSKLQARKCGLVVESAKVSQLMSEKKSYEVQVMSQVVAAVTNSCDSSHII 694
LG ++ +EL L K G + M+ KKS+EVQ MS+VVA + C +I
Sbjct: 102 LGICMSREELLLAL-GEKTGDTGAEWVPDEFMNSKKSHEVQSMSEVVARLAQHCGVKQVI 160
Query: 695 DLGGGQGYLSTILALQHGKKTLSLDYNQVNTHGAAVRSKKLE 736
D+G G+GYLS+ L+L++G + +D + NTHGA R++KL+
Sbjct: 161 DVGSGKGYLSSSLSLRYGLQVYGIDSSSTNTHGAQERNRKLK 202
Score = 53.5 bits (127), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 46/163 (28%), Positives = 69/163 (42%), Gaps = 10/163 (6%)
Query: 481 TMELIKNTIDEMIEFITPNIACVNTHTVHFMVEKLWETTI---PKAILSQVNNSPSFDSL 537
T+ I IDE+ F++ ++ N HTV F +W + P+ +L + S
Sbjct: 6 TLPEIHRRIDEVKRFLSITLSIANAHTVEFYTHDVWNRFMAVDPQEVLHAFSASNDQQMA 65
Query: 538 LEEFWKSRQDNHVETNSELV---KFFQAADKFRLTSLMNGESIITVDELFSKLQARKCGQ 594
E K + LV + QAA L L ++ +EL L K G
Sbjct: 66 PEHHIKGTTFGFCNDTNLLVDPRELLQAATAHSLPGL---GICMSREELLLAL-GEKTGD 121
Query: 595 VVESAKVSQLMSEKKSYEVQVMSQVVAAVTNSCDSSHIIDLGS 637
+ M+ KKS+EVQ MS+VVA + C +ID+GS
Sbjct: 122 TGAEWVPDEFMNSKKSHEVQSMSEVVARLAQHCGVKQVIDVGS 164
>gi|327272830|ref|XP_003221187.1| PREDICTED: uncharacterized protein C12orf26-like [Anolis
carolinensis]
Length = 626
Score = 140 bits (353), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 78/228 (34%), Positives = 133/228 (58%), Gaps = 16/228 (7%)
Query: 804 ITHFVTPDSDISSILSQAYPQDSLHNVCIMGLHTCGDLSGTALRLF-TKSSLQCLVQVGC 862
+T ++T ++++ I+ QD L + ++GLHTCGDL+ + LRLF K ++ + VGC
Sbjct: 376 LTSYITANTELRDIV-----QD-LEDCMLVGLHTCGDLAPSTLRLFNAKPEIKAVCSVGC 429
Query: 863 CYHLLEEEFIRSPFWKDVDQSLYEHGYGFPLSEHLRSRKFFLGRNVRMSGTQSPERVIDL 922
CYHLL EE ++++ +GFPL +L+ +F GRN RM+ + ERV
Sbjct: 430 CYHLLSEEH-------EIEKDGTHRVWGFPLCRYLKEEGWFCGRNARMAACLALERVAVG 482
Query: 923 KQTQTLPLFYRALLEKYLRSKITINDEEPKVVGRLATKCSNFVEYVHRAVDKLKLD-LEV 981
+ T LFYRA+L+ + + + + + +G++ +K S+F++YV +++ KL+LD ++
Sbjct: 483 QMLPTESLFYRAVLQVIVVEEYGVKRSD-RNIGKVYSKTSSFLDYVRKSLKKLELDESKI 541
Query: 982 DDEEVTRLFNSHQREYEYLQIYYFLKTALAPVIEALIVLDRVLYLREQ 1029
D V F ++ ++ + LK L P IE LI+LDR+ YL+EQ
Sbjct: 542 SDSRVMEYFEKYKHRMNEMEAFNMLKVVLGPCIEVLILLDRLCYLQEQ 589
Score = 87.0 bits (214), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 62/203 (30%), Positives = 94/203 (46%), Gaps = 29/203 (14%)
Query: 634 DLGSTITVDELFSKLQARKCGLVVESAKVSQLMSEKKSYEVQVMSQVVAAVTNSCDSSHI 693
+LG I ++++ L ++ K + M+ KKS+EVQ+MS++V + C S +
Sbjct: 123 NLGVCIPLEQMLEILNRKQQPESGTEMKTDEFMNNKKSHEVQLMSELVYGIAKYCGLSQV 182
Query: 694 IDLGGGQGYLSTILALQHGKKTLSLDYNQVNTHGAAVRSKKLEDSEKESKGPFQSYAGVI 753
IDLG G+GYLS+ L+LQ+G K +D + NTHGA R++KL
Sbjct: 183 IDLGSGKGYLSSFLSLQYGLKVYGIDSSNTNTHGANERNRKL------------------ 224
Query: 754 NKKLWMRTQNRQTSHASVEHH----GKNWKRKSKAPVVSIKDEELVVCKDKCKQITHFVT 809
KK W Q R + A+V H K + K + E L+ + +
Sbjct: 225 -KKHWTAYQRR--ARANVRGHVLEEKKERPEQDKEKCKANSSENLLSTNNSLQSQAQVAA 281
Query: 810 PDSDISSILSQAYPQDSLHNVCI 832
PD SILSQ + N+CI
Sbjct: 282 PD----SILSQTADVFTDSNICI 300
Score = 51.2 bits (121), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 43/168 (25%), Positives = 72/168 (42%), Gaps = 15/168 (8%)
Query: 481 TMELIKNTIDEMIEFITPNIACVNTHTVHFMVEKLWETTI---PKAILSQVNNSPSFDSL 537
+ E + I + F+ + HTV F LWE + P+ +L + S D L
Sbjct: 24 SAERVAARIRAVARFLRRALPLCLAHTVDFYTRGLWEKMVALPPETVLEAL----SGDRL 79
Query: 538 --------LEEFWKSRQDNHVETNSELVKFFQAADKFRLTSLMNGESIITVDELFSKLQA 589
LEE + E + L+ A + S+ N I ++++ L
Sbjct: 80 RRLLEQRPLEEPAGEFYNIFCEKSQRLINVQAFALAAKCYSMPNLGVCIPLEQMLEILNR 139
Query: 590 RKCGQVVESAKVSQLMSEKKSYEVQVMSQVVAAVTNSCDSSHIIDLGS 637
++ + K + M+ KKS+EVQ+MS++V + C S +IDLGS
Sbjct: 140 KQQPESGTEMKTDEFMNNKKSHEVQLMSELVYGIAKYCGLSQVIDLGS 187
Score = 43.1 bits (100), Expect = 0.92, Method: Compositional matrix adjust.
Identities = 18/51 (35%), Positives = 30/51 (58%)
Query: 1044 TQVDDEEVTRLFDSHQREYEYLQIYYFLKTALAPVIEALIVLDRVLYLREQ 1094
+++ D V F+ ++ ++ + LK L P IE LI+LDR+ YL+EQ
Sbjct: 539 SKISDSRVMEYFEKYKHRMNEMEAFNMLKVVLGPCIEVLILLDRLCYLQEQ 589
>gi|147826468|emb|CAN66105.1| hypothetical protein VITISV_012700 [Vitis vinifera]
Length = 344
Score = 140 bits (353), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 68/104 (65%), Positives = 77/104 (74%), Gaps = 4/104 (3%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF KVG+ ++VYTGDYNMTPDRHLGAA ID+ + DLLITESTYATT+RDSK RER+FL
Sbjct: 225 MFYAKVGDAAMVYTGDYNMTPDRHLGAAQIDRLQLDLLITESTYATTVRDSKYAREREFL 284
Query: 61 KKVHECVDRGGKVLIPVFALGRAQE----LCILLETYWERMNLQ 100
K VH+CV GGKVLIP FALGRAQ L+ W R Q
Sbjct: 285 KAVHKCVADGGKVLIPTFALGRAQNQRYLTAFLISLVWXRKGTQ 328
>gi|14520957|ref|NP_126432.1| mRNA 3'-end processing factor, [Pyrococcus abyssi GE5]
gi|5458174|emb|CAB49663.1| Cleavage and polyadenylation specficity factor [Pyrococcus abyssi
GE5]
Length = 651
Score = 140 bits (353), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 94/296 (31%), Positives = 152/296 (51%), Gaps = 19/296 (6%)
Query: 1 MFQVKVGN--QSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTY--ATTIRDSKRCRE 56
+ + +GN +I TGD+ P R L A R + L+ ESTY A I+ + E
Sbjct: 350 IVHLHIGNGLHNIAITGDFKFIPTRLLEPANAKFPRLETLVMESTYGGANDIQMPREEAE 409
Query: 57 RDFLKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNY 116
+ ++ +H+ + RGGKVLIP A+GRAQE+ ++LE Y + APIY G+ +AT
Sbjct: 410 KRLIEVIHQTLKRGGKVLIPAMAVGRAQEVMMVLEDYARIGAIDAPIYLD-GMIWEATAI 468
Query: 117 YKMFITWTNQKIRKTFVQR--NMFDFKHIRPFDKS-----FIDNPGPMVVFATPGMLHSG 169
+ + + ++++R+ + N F + P S ID+ P ++ A+ GML G
Sbjct: 469 HTAYPEYLSRRLREQIFKEGYNPFLSEIFHPVANSKERQDIIDSNEPAIIIASSGMLVGG 528
Query: 170 LSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKL----EFENKQIIDVKMAIEYM 225
S+ FK+ AP N +I + +GT+G +V SGV+++ E ++I V M + +
Sbjct: 529 PSVEYFKQLAPDPRNSIIFVSYQAEGTLGRQVQSGVREIPMVGEEGRTEVIKVNMEVHTI 588
Query: 226 -SFSAHADAKGIMQLIQYCEPK--NVLLVHGEASKMVFLKEKIKQEFNLDCFMPAN 278
FS HAD + +M + P+ V+ VHGE K + L I ++F L P N
Sbjct: 589 DGFSGHADRRELMNYVAKVRPRPERVITVHGEPQKCLDLATSIHRKFGLSTRAPNN 644
>gi|410965192|ref|XP_003989134.1| PREDICTED: uncharacterized protein C12orf26 homolog [Felis catus]
Length = 602
Score = 140 bits (352), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 78/229 (34%), Positives = 134/229 (58%), Gaps = 15/229 (6%)
Query: 804 ITHFVTPDSDISSILSQAYPQDSLHNVCIMGLHTCGDLSGTALRLFT-KSSLQCLVQVGC 862
+T F+T DS++ I+ L + ++GLHTCGDL+ LR+FT + ++ + VGC
Sbjct: 358 LTSFITADSELHDIIKD------LEDCLMVGLHTCGDLAPNTLRIFTSQPEIKGVCSVGC 411
Query: 863 CYHLLEEEFIRSPFWKDVDQSLYEHGYGFPLSEHLRSRKFFLGRNVRMSGTQSPERVIDL 922
CYHLL EEF +++ + + +GFP+ +L+ ++ GRN RMS + ERV
Sbjct: 412 CYHLLSEEF------ENLHKESTQDKWGFPMCHYLKEERWCCGRNARMSACLALERVAVG 465
Query: 923 KQTQTLPLFYRALLEKYLRSKITINDEEPKVVGRLATKCSNFVEYVHRAVDKLKLD-LEV 981
+ T LFYRA+L+ ++ I + + VG++ +K S+F++Y +++ KL LD ++
Sbjct: 466 QGLPTESLFYRAVLQDIIKECYGITKCD-RHVGKIYSKSSSFLDYARKSLKKLGLDESKL 524
Query: 982 DDEEVTRLFNSHQREYEYLQIYYFLKTALAPVIEALIVLDRVLYLREQQ 1030
++ + + ++ L+ + LK LAP IE LI+LDR+ YL+EQ+
Sbjct: 525 PEKVIMDYYEKYKPRMNELEAFNMLKVVLAPCIETLILLDRLCYLKEQE 573
Score = 84.3 bits (207), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 39/103 (37%), Positives = 67/103 (65%)
Query: 634 DLGSTITVDELFSKLQARKCGLVVESAKVSQLMSEKKSYEVQVMSQVVAAVTNSCDSSHI 693
+LG ++L + L+ + E+ K + M+ KKS+EVQVMS++++++ + C +
Sbjct: 119 NLGICTPFEQLLTALRGNQKQRTGENVKPDEFMNLKKSHEVQVMSELISSIADYCGIKQV 178
Query: 694 IDLGGGQGYLSTILALQHGKKTLSLDYNQVNTHGAAVRSKKLE 736
IDLG G+GYLS+ L+L++G K +D + NTHGA R++KL+
Sbjct: 179 IDLGSGKGYLSSFLSLKYGLKVYGIDSSNTNTHGAKERNRKLK 221
Score = 55.1 bits (131), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 42/173 (24%), Positives = 82/173 (47%), Gaps = 15/173 (8%)
Query: 480 ETMELIKNTIDEMIEFITPNIACVNTHTVHFMVEKLWETTI---PKAILSQVNNS----- 531
+ + ++ + +++F+ ++ + HTV F E +W+ + P+ L + S
Sbjct: 11 QDLSTLRAKLQGLLQFLRQALSISSAHTVDFYTESVWKQLVDLPPETALRVLRRSAAEAE 70
Query: 532 ---PSFDSLLEEFWKSRQDNH----VETNSELVKFFQAADKFRLTSLMNGESIITVDELF 584
P L+E S + ET+ +LV A R S+ N ++L
Sbjct: 71 ARSPEARLLVEAEMGSGIASFPQIFCETSQKLVSVEAFALAVRYYSVQNLGICTPFEQLL 130
Query: 585 SKLQARKCGQVVESAKVSQLMSEKKSYEVQVMSQVVAAVTNSCDSSHIIDLGS 637
+ L+ + + E+ K + M+ KKS+EVQVMS++++++ + C +IDLGS
Sbjct: 131 TALRGNQKQRTGENVKPDEFMNLKKSHEVQVMSELISSIADYCGIKQVIDLGS 183
Score = 40.0 bits (92), Expect = 7.1, Method: Compositional matrix adjust.
Identities = 17/41 (41%), Positives = 27/41 (65%)
Query: 1055 FDSHQREYEYLQIYYFLKTALAPVIEALIVLDRVLYLREQQ 1095
++ ++ L+ + LK LAP IE LI+LDR+ YL+EQ+
Sbjct: 533 YEKYKPRMNELEAFNMLKVVLAPCIETLILLDRLCYLKEQE 573
>gi|380741511|tpe|CCE70145.1| TPA: mRNA 3'-end processing factor, putative [Pyrococcus abyssi
GE5]
Length = 648
Score = 140 bits (352), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 94/296 (31%), Positives = 152/296 (51%), Gaps = 19/296 (6%)
Query: 1 MFQVKVGN--QSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTY--ATTIRDSKRCRE 56
+ + +GN +I TGD+ P R L A R + L+ ESTY A I+ + E
Sbjct: 347 IVHLHIGNGLHNIAITGDFKFIPTRLLEPANAKFPRLETLVMESTYGGANDIQMPREEAE 406
Query: 57 RDFLKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNY 116
+ ++ +H+ + RGGKVLIP A+GRAQE+ ++LE Y + APIY G+ +AT
Sbjct: 407 KRLIEVIHQTLKRGGKVLIPAMAVGRAQEVMMVLEDYARIGAIDAPIYLD-GMIWEATAI 465
Query: 117 YKMFITWTNQKIRKTFVQR--NMFDFKHIRPFDKS-----FIDNPGPMVVFATPGMLHSG 169
+ + + ++++R+ + N F + P S ID+ P ++ A+ GML G
Sbjct: 466 HTAYPEYLSRRLREQIFKEGYNPFLSEIFHPVANSKERQDIIDSNEPAIIIASSGMLVGG 525
Query: 170 LSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKL----EFENKQIIDVKMAIEYM 225
S+ FK+ AP N +I + +GT+G +V SGV+++ E ++I V M + +
Sbjct: 526 PSVEYFKQLAPDPRNSIIFVSYQAEGTLGRQVQSGVREIPMVGEEGRTEVIKVNMEVHTI 585
Query: 226 -SFSAHADAKGIMQLIQYCEPK--NVLLVHGEASKMVFLKEKIKQEFNLDCFMPAN 278
FS HAD + +M + P+ V+ VHGE K + L I ++F L P N
Sbjct: 586 DGFSGHADRRELMNYVAKVRPRPERVITVHGEPQKCLDLATSIHRKFGLSTRAPNN 641
>gi|409095356|ref|ZP_11215380.1| RNA-metabolising metallo-beta-lactamase, beta-CASP family protein
[Thermococcus zilligii AN1]
Length = 648
Score = 139 bits (351), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 94/299 (31%), Positives = 152/299 (50%), Gaps = 25/299 (8%)
Query: 1 MFQVKVGN--QSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRE-- 56
+ + +GN +I TGD+ P R A R + LI ESTY D + RE
Sbjct: 347 IVHLHIGNGLHNIAVTGDFKFIPTRLFEPATSKFPRLETLIMESTYGAG-NDYQMPREEA 405
Query: 57 -RDFLKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATN 115
+ ++ +H+ + RGGKVLIP A+GRAQE+ ++LE Y +L+ PIY G+ +AT
Sbjct: 406 EKRLMEVIHQTIKRGGKVLIPAMAVGRAQEIMMVLEEYARVGSLEVPIYLD-GMIWEATA 464
Query: 116 YYKMFITWTNQKIRKTFVQRNMFDFKHIRPFDKS---------FIDNPGPMVVFATPGML 166
+ + + ++ +R+ F + P KS ID+ P ++ AT GML
Sbjct: 465 IHTAYPEYLSKNLREQIFHEGYNPF--LNPIFKSVANGRERQDIIDSGEPAIIIATSGML 522
Query: 167 HSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKL----EFENKQIIDVKMAI 222
G S+ FK+ AP N +I+ + +GT+G +V G+K++ E ++I+V M +
Sbjct: 523 VGGPSVEYFKQLAPDPKNSIILVSYQSEGTLGRQVQRGLKEIPLVGEDGRTEVINVNMEV 582
Query: 223 EYM-SFSAHADAKGIMQLIQYCEPK--NVLLVHGEASKMVFLKEKIKQEFNLDCFMPAN 278
+ FS HAD +M + +P+ ++ VHGEA K + L I ++F L P+N
Sbjct: 583 HTIEGFSGHADRGELMSYVAKLKPRPERIITVHGEAQKCLDLASSIHKKFGLPTRAPSN 641
>gi|289191606|ref|YP_003457547.1| RNA-metabolising metallo-beta-lactamase [Methanocaldococcus sp.
FS406-22]
gi|288938056|gb|ADC68811.1| RNA-metabolising metallo-beta-lactamase [Methanocaldococcus sp.
FS406-22]
Length = 421
Score = 139 bits (350), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 87/277 (31%), Positives = 151/277 (54%), Gaps = 10/277 (3%)
Query: 8 NQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATT--IRDSKRCRERDFLKKVHE 65
+ I+YTGD N R L A D D+LI ESTY + I+ +++ E+ ++++ E
Sbjct: 145 GKKILYTGDINEGVSRTLLPADTDIDEIDVLIIESTYGSPLDIKPARKTLEKQLIEEIAE 204
Query: 66 CVDRGGKVLIPVFALGRAQELCILLETYWERMNL-QAPIYFAVGLTEKATNYYKMFITWT 124
++ GGKV+IPVFA+GRAQE+ +++ Y L + PIY L AT Y +I W
Sbjct: 205 TIENGGKVIIPVFAIGRAQEILLIINNYMRSGGLKEVPIYTDGSLIH-ATAVYMSYINWL 263
Query: 125 NQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAPVENN 184
N KI K V+ + F I+ D++ + N P ++ +T GM+ G ++ + K N
Sbjct: 264 NPKI-KNMVENRINPFGEIKKADENLVFNKEPCIIVSTSGMVQGG-PVLKYLKLLKDPKN 321
Query: 185 MLIMPGFCVQGTIGHKVLSGVKKLE-FENKQIIDVKMAIEYMSFSAHADAKGIMQLIQYC 243
LI+ G+ +GT+G ++ G K+++ F+NK I ++ + + FSAH D +++ I+
Sbjct: 322 KLILTGYQAEGTLGRELEEGAKEIQPFKNK--IPIRGKVVKIEFSAHGDYNSLVRYIKKI 379
Query: 244 -EPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANG 279
+P+ +++HGE + + I + + F+P G
Sbjct: 380 PKPEKAIVMHGERYQSLSFAMTIWKTLKIPTFVPVRG 416
>gi|374636870|ref|ZP_09708413.1| RNA-metabolising metallo-beta-lactamase [Methanotorris formicicus
Mc-S-70]
gi|373557613|gb|EHP84014.1| RNA-metabolising metallo-beta-lactamase [Methanotorris formicicus
Mc-S-70]
Length = 419
Score = 139 bits (350), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 93/284 (32%), Positives = 148/284 (52%), Gaps = 21/284 (7%)
Query: 6 VGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATT--IRDSKRCRERDFLKKV 63
+ + I+YTGD N TP R L A D D+LI ESTY + I+ S++ E+ + ++
Sbjct: 142 IDGKKILYTGDINETPTRTLLPADKDIDEIDVLIIESTYGSPLDIKPSRKTLEKQLIDEI 201
Query: 64 HECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQ-APIYFAVGLTEKATNYYKMFIT 122
E ++ GGKV+IPVFA+GRAQE+ +++ Y L+ P+Y L AT Y ++
Sbjct: 202 AETIENGGKVIIPVFAIGRAQEILLIINNYIRSGKLKDVPVYVDGSLIH-ATGIYMSYLD 260
Query: 123 WTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSG-----LSLIIFKK 177
W N K+ K V+ + F ++ D N P ++ +T GM+ G LSL+ K
Sbjct: 261 WLNPKL-KNMVENRINPFGELKKADNGVF-NKEPCIIVSTSGMVQGGPVLQYLSLLKSPK 318
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKL-EFENKQIIDVKMAIEYMSFSAHADAKGI 236
N LI+ G+ +GT+G + GV+++ F+NK I V+ + + FSAH D +
Sbjct: 319 ------NKLILTGYQAEGTLGRALEEGVEEITPFKNK--IPVRGKVVKIEFSAHGDYNSL 370
Query: 237 MQLIQYCE-PKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANG 279
++ I+ PK +VHGE + + I + + F P NG
Sbjct: 371 VRYIKKIPTPKKAFVVHGERYQALSFAMTIWKSLKIPTFAPVNG 414
>gi|359065221|ref|XP_002687238.2| PREDICTED: uncharacterized protein C12orf26 [Bos taurus]
Length = 615
Score = 139 bits (349), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 80/228 (35%), Positives = 132/228 (57%), Gaps = 15/228 (6%)
Query: 804 ITHFVTPDSDISSILSQAYPQDSLHNVCIMGLHTCGDLSGTALRLFT-KSSLQCLVQVGC 862
+T +T DS++ I+ L + ++GLHTCGDL+ LR+FT KS ++ + VGC
Sbjct: 371 LTSIITADSELHDIIKD------LEDCLMVGLHTCGDLAPNTLRIFTSKSEVKGVCSVGC 424
Query: 863 CYHLLEEEFIRSPFWKDVDQSLYEHGYGFPLSEHLRSRKFFLGRNVRMSGTQSPERVIDL 922
CYHLL EEF +++ + + GFP+ +L+ ++ GRN RMS + ERV
Sbjct: 425 CYHLLSEEF------ENLHKEYTQERRGFPMCHYLKEERWCCGRNARMSACLALERVAVG 478
Query: 923 KQTQTLPLFYRALLEKYLRSKITINDEEPKVVGRLATKCSNFVEYVHRAVDKLKLD-LEV 981
+ T LFYRA+L+ ++ I + + VG++ +K S+F +YV +++ KL LD ++
Sbjct: 479 QGLPTESLFYRAILQDIIKECYGITKCD-RHVGKIYSKSSSFPDYVRKSLKKLGLDESKL 537
Query: 982 DDEEVTRLFNSHQREYEYLQIYYFLKTALAPVIEALIVLDRVLYLREQ 1029
++ + + ++ L+ + LK LAP IE LI+LDR+ YL+EQ
Sbjct: 538 PEKIIMDYYEKYKPRMNELEAFNMLKVVLAPCIETLILLDRLCYLKEQ 585
Score = 45.4 bits (106), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 32/44 (72%), Gaps = 2/44 (4%)
Query: 693 IIDLGGGQGYLSTILALQHGKKTLSLDYNQVNTHGAAVRSKKLE 736
IIDLG G+GYLS+ L+L++G K +D + NT GA R++KL+
Sbjct: 194 IIDLGSGKGYLSSFLSLKYGLKVYGIDSS--NTRGAEERNRKLK 235
>gi|256810164|ref|YP_003127533.1| beta-lactamase [Methanocaldococcus fervens AG86]
gi|256793364|gb|ACV24033.1| beta-lactamase domain protein [Methanocaldococcus fervens AG86]
Length = 421
Score = 139 bits (349), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 87/277 (31%), Positives = 152/277 (54%), Gaps = 10/277 (3%)
Query: 8 NQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATT--IRDSKRCRERDFLKKVHE 65
+ I+YTGD N R L A D D+LI ESTY ++ I+ +++ E+ ++++ E
Sbjct: 145 GKKILYTGDINEGISRTLLPADTDIDEIDVLIIESTYGSSLDIKPARKTLEKQLIEEIAE 204
Query: 66 CVDRGGKVLIPVFALGRAQELCILLETYWERMNL-QAPIYFAVGLTEKATNYYKMFITWT 124
++ GGKV+IPVFA+GRAQE+ +++ Y L + PIY L AT Y +I W
Sbjct: 205 TIENGGKVIIPVFAIGRAQEILLIINNYMRSGKLKEVPIYTDGALIH-ATAVYLSYINWL 263
Query: 125 NQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAPVENN 184
N KI K V+ + F ++ D+S + N P ++ +T GM+ G ++ + K N
Sbjct: 264 NPKI-KNMVENRINPFGVVKKADESLVFNKEPCIIVSTSGMVQGG-PVLKYLKLLKDPKN 321
Query: 185 MLIMPGFCVQGTIGHKVLSGVKKLE-FENKQIIDVKMAIEYMSFSAHADAKGIMQLIQYC 243
LI+ G+ +GT+G ++ G K+++ F+NK I V+ + + FSAH D +++ I+
Sbjct: 322 KLILTGYQAEGTLGRELEEGAKEIQPFKNK--IPVRGKVVKIEFSAHGDYNSLVRYIKKI 379
Query: 244 -EPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANG 279
+P+ +++HGE + + I + + ++P G
Sbjct: 380 PKPEKAIVMHGERYQSLSFAMTIWKTLKIPTYVPVKG 416
>gi|194666993|ref|XP_001790025.1| PREDICTED: uncharacterized protein C12orf26 [Bos taurus]
Length = 599
Score = 139 bits (349), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 80/228 (35%), Positives = 132/228 (57%), Gaps = 15/228 (6%)
Query: 804 ITHFVTPDSDISSILSQAYPQDSLHNVCIMGLHTCGDLSGTALRLFT-KSSLQCLVQVGC 862
+T +T DS++ I+ L + ++GLHTCGDL+ LR+FT KS ++ + VGC
Sbjct: 355 LTSIITADSELHDIIKD------LEDCLMVGLHTCGDLAPNTLRIFTSKSEVKGVCSVGC 408
Query: 863 CYHLLEEEFIRSPFWKDVDQSLYEHGYGFPLSEHLRSRKFFLGRNVRMSGTQSPERVIDL 922
CYHLL EEF +++ + + GFP+ +L+ ++ GRN RMS + ERV
Sbjct: 409 CYHLLSEEF------ENLHKEYTQERRGFPMCHYLKEERWCCGRNARMSACLALERVAVG 462
Query: 923 KQTQTLPLFYRALLEKYLRSKITINDEEPKVVGRLATKCSNFVEYVHRAVDKLKLD-LEV 981
+ T LFYRA+L+ ++ I + + VG++ +K S+F +YV +++ KL LD ++
Sbjct: 463 QGLPTESLFYRAILQDIIKECYGITKCD-RHVGKIYSKSSSFPDYVRKSLKKLGLDESKL 521
Query: 982 DDEEVTRLFNSHQREYEYLQIYYFLKTALAPVIEALIVLDRVLYLREQ 1029
++ + + ++ L+ + LK LAP IE LI+LDR+ YL+EQ
Sbjct: 522 PEKIIMDYYEKYKPRMNELEAFNMLKVVLAPCIETLILLDRLCYLKEQ 569
Score = 81.6 bits (200), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 67/251 (26%), Positives = 111/251 (44%), Gaps = 54/251 (21%)
Query: 489 IDEMIEFITPNIACVNTHTVHFMVEKLWETTI---PKAILSQVNNSPSFDSLLEEFWKSR 545
+ +++ F+ + N HTV F E +WE + P+ +LS + S +
Sbjct: 20 LQKLLRFLREALPISNAHTVDFYTESVWEKLVDLPPETVLSALRTSAA------------ 67
Query: 546 QDNHVETNSELVKFFQAADKFRLTSLMNGESIITVDELFSKLQARKCGQVVESAKVSQLM 605
SE +A M+GE + V + F K+ ++V V
Sbjct: 68 -------ASEACPLEEAGR-------MSGEWRVDVTD-FPKIFCETSQKLV---NVEAFA 109
Query: 606 SEKKSYEVQVMSQVVAAVTNSCDSSHIIDLGSTITVDELFSKLQARKCGLVVESAKVSQL 665
K Y VQ +LG + D+L L + E+ K+ +
Sbjct: 110 LAAKYYSVQ-------------------NLGICTSFDQLLVALHGNQKQKTDENVKLDEF 150
Query: 666 MSEKKSYEVQVMSQVVAAVTNSCDSSHIIDLGGGQGYLSTILALQHGKKTLSLDYNQVNT 725
M+ KKS+EVQ MS++++++ C IIDLG G+GYLS+ L+L++G K +D + NT
Sbjct: 151 MNLKKSHEVQAMSELISSIAEYCGIKQIIDLGSGKGYLSSFLSLKYGLKVYGIDSS--NT 208
Query: 726 HGAAVRSKKLE 736
GA R++KL+
Sbjct: 209 RGAEERNRKLK 219
>gi|449435478|ref|XP_004135522.1| PREDICTED: LOW QUALITY PROTEIN: cleavage and polyadenylation
specificity factor subunit 3-I-like [Cucumis sativus]
Length = 481
Score = 138 bits (348), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 73/183 (39%), Positives = 103/183 (56%), Gaps = 9/183 (4%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF V + ++YTGDY+ DRHL AA + + PD+ I ESTY + + RE+ F
Sbjct: 177 MFMVDIAGVRVLYTGDYSREEDRHLRAAEMPQFSPDVCIIESTYGVQLHQPRHIREKRFT 236
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYW----ERMNLQAPIYFAVGLTEKATNY 116
VH + +GG+VLIP FALGRAQEL ++L+ YW E N+ PIY+A L ++
Sbjct: 237 DVVHSTISQGGRVLIPAFALGRAQELLLILDEYWANHPELHNI--PIYYASPLAKRCLTV 294
Query: 117 YKMFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIF 175
Y+ + N +I+ + N F FK+I P + GP VV A+P L SGLS +F
Sbjct: 295 YETYTLSMNDRIQNA--KSNPFRFKYISPLKSIEVFKDVGPSVVMASPSGLQSGLSRQLF 352
Query: 176 KKW 178
+ W
Sbjct: 353 EMW 355
>gi|296487995|tpg|DAA30108.1| TPA: hypothetical protein BOS_4910 [Bos taurus]
Length = 507
Score = 138 bits (348), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 80/228 (35%), Positives = 132/228 (57%), Gaps = 15/228 (6%)
Query: 804 ITHFVTPDSDISSILSQAYPQDSLHNVCIMGLHTCGDLSGTALRLFT-KSSLQCLVQVGC 862
+T +T DS++ I+ L + ++GLHTCGDL+ LR+FT KS ++ + VGC
Sbjct: 263 LTSIITADSELHDIIKD------LEDCLMVGLHTCGDLAPNTLRIFTSKSEVKGVCSVGC 316
Query: 863 CYHLLEEEFIRSPFWKDVDQSLYEHGYGFPLSEHLRSRKFFLGRNVRMSGTQSPERVIDL 922
CYHLL EEF +++ + + GFP+ +L+ ++ GRN RMS + ERV
Sbjct: 317 CYHLLSEEF------ENLHKEYTQERRGFPMCHYLKEERWCCGRNARMSACLALERVAVG 370
Query: 923 KQTQTLPLFYRALLEKYLRSKITINDEEPKVVGRLATKCSNFVEYVHRAVDKLKLD-LEV 981
+ T LFYRA+L+ ++ I + + VG++ +K S+F +YV +++ KL LD ++
Sbjct: 371 QGLPTESLFYRAILQDIIKECYGITKCD-RHVGKIYSKSSSFPDYVRKSLKKLGLDESKL 429
Query: 982 DDEEVTRLFNSHQREYEYLQIYYFLKTALAPVIEALIVLDRVLYLREQ 1029
++ + + ++ L+ + LK LAP IE LI+LDR+ YL+EQ
Sbjct: 430 PEKIIMDYYEKYKPRMNELEAFNMLKVVLAPCIETLILLDRLCYLKEQ 477
Score = 45.8 bits (107), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 22/44 (50%), Positives = 32/44 (72%), Gaps = 2/44 (4%)
Query: 693 IIDLGGGQGYLSTILALQHGKKTLSLDYNQVNTHGAAVRSKKLE 736
IIDLG G+GYLS+ L+L++G K +D + NT GA R++KL+
Sbjct: 86 IIDLGSGKGYLSSFLSLKYGLKVYGIDSS--NTRGAEERNRKLK 127
>gi|426224231|ref|XP_004006277.1| PREDICTED: LOW QUALITY PROTEIN: methyltransferase-like protein 25
[Ovis aries]
Length = 601
Score = 138 bits (347), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 80/228 (35%), Positives = 131/228 (57%), Gaps = 15/228 (6%)
Query: 804 ITHFVTPDSDISSILSQAYPQDSLHNVCIMGLHTCGDLSGTALRLFT-KSSLQCLVQVGC 862
+T +T DS++ I+ L + ++GLHTCGDL+ LR+FT KS ++ + VGC
Sbjct: 357 LTSIITADSELHDIIKD------LEDCLMVGLHTCGDLAPNTLRIFTSKSEVKGICSVGC 410
Query: 863 CYHLLEEEFIRSPFWKDVDQSLYEHGYGFPLSEHLRSRKFFLGRNVRMSGTQSPERVIDL 922
CYHLL EEF ++ + + GFP+ +L+ ++ GRN RMS + ERV
Sbjct: 411 CYHLLSEEF------ENPHKEYTQERRGFPMCHYLKEERWCCGRNARMSACLALERVAVG 464
Query: 923 KQTQTLPLFYRALLEKYLRSKITINDEEPKVVGRLATKCSNFVEYVHRAVDKLKLD-LEV 981
+ T LFYRA+L+ ++ I + + VG++ +K S+F +YV +++ KL LD ++
Sbjct: 465 QGLPTESLFYRAVLQDIIKECYGITKCD-RHVGKIYSKSSSFPDYVRKSLKKLGLDESKL 523
Query: 982 DDEEVTRLFNSHQREYEYLQIYYFLKTALAPVIEALIVLDRVLYLREQ 1029
++ + + ++ L+ + LK LAP IE LI+LDR+ YL+EQ
Sbjct: 524 PEKIIMDYYEKYKPRMNELEAFNMLKVVLAPCIETLILLDRLCYLKEQ 571
Score = 81.3 bits (199), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 64/260 (24%), Positives = 112/260 (43%), Gaps = 54/260 (20%)
Query: 480 ETMELIKNTIDEMIEFITPNIACVNTHTVHFMVEKLWETTI---PKAILSQVNNSPSFDS 536
+ + ++ + +++ F+ + N HTV F E +WE + P+ +LS + S +
Sbjct: 11 QDLSTLRAKLQKLLRFLREALPISNAHTVDFYTESVWEKLVDSPPETVLSALRASAGVAT 70
Query: 537 LLEEFWKSRQDNHVETNSELVKFFQAADKFRLTSLMNGESIITVDELFSKLQARKCGQVV 596
L + + H+ + + +F F KL ++V
Sbjct: 71 AL------KTETHLIVDXDWEAWFTD---------------------FPKLFCETSQKLV 103
Query: 597 ESAKVSQLMSEKKSYEVQVMSQVVAAVTNSCDSSHIIDLGSTITVDELFSKLQARKCGLV 656
V K Y VQ +LG D+L L +
Sbjct: 104 ---NVEAFALAAKCYSVQ-------------------NLGICTPFDQLLVALHGNQKQKT 141
Query: 657 VESAKVSQLMSEKKSYEVQVMSQVVAAVTNSCDSSHIIDLGGGQGYLSTILALQHGKKTL 716
E+ K+ + M+ KKS+EVQ MS++++++ C IIDLG G+GYLS+ L+L++G K
Sbjct: 142 DENVKLDEFMNLKKSHEVQAMSELISSIAEYCGIKQIIDLGSGKGYLSSFLSLKYGLKVY 201
Query: 717 SLDYNQVNTHGAAVRSKKLE 736
+D + NT GA R++KL+
Sbjct: 202 GIDSS--NTRGAEERNRKLK 219
>gi|14591202|ref|NP_143278.1| mRNA 3'-end processing factor [Pyrococcus horikoshii OT3]
gi|294979445|pdb|3AF5|A Chain A, The Crystal Structure Of An Archaeal Cpsf Subunit, Ph1404
From Pyrococcus Horikoshii
gi|294979446|pdb|3AF6|A Chain A, The Crystal Structure Of An Archaeal Cpsf Subunit, Ph1404
From Pyrococcus Horikoshii Complexed With Rna-Analog
gi|3257827|dbj|BAA30510.1| 651aa long hypothetical protein [Pyrococcus horikoshii OT3]
Length = 651
Score = 137 bits (345), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 90/296 (30%), Positives = 151/296 (51%), Gaps = 19/296 (6%)
Query: 1 MFQVKVGN--QSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTY--ATTIRDSKRCRE 56
+ + +GN +I TGD+ P R L A R + L+ ESTY A I+ + E
Sbjct: 350 IVHLHIGNGLHNIAITGDFKFIPTRLLEPANAKFPRLETLVMESTYGGANDIQMPREEAE 409
Query: 57 RDFLKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNY 116
+ ++ +H + RGGKVLIP A+GRAQE+ ++LE Y ++ PIY G+ +AT
Sbjct: 410 KRLIEVIHNTIKRGGKVLIPAMAVGRAQEVMMVLEEYARIGGIEVPIYLD-GMIWEATAI 468
Query: 117 YKMFITWTNQKIRKTFVQR--NMFDFKHIRPFDKS-----FIDNPGPMVVFATPGMLHSG 169
+ + + ++++R+ + N F + P S ID+ P ++ A+ GML G
Sbjct: 469 HTAYPEYLSRRLREQIFKEGYNPFLSEIFHPVANSRERQDIIDSNEPAIIIASSGMLVGG 528
Query: 170 LSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKL----EFENKQIIDVKMAIEYM 225
S+ FK+ AP N +I + +GT+G +V SG++++ E ++I V M + +
Sbjct: 529 PSVEYFKQLAPDPKNSIIFVSYQAEGTLGRQVQSGIREIPMVGEEGRTEVIKVNMEVHTI 588
Query: 226 -SFSAHADAKGIMQLIQYCEPK--NVLLVHGEASKMVFLKEKIKQEFNLDCFMPAN 278
FS HAD + +M + P+ ++ VHGE K + L I ++F + P N
Sbjct: 589 DGFSGHADRRELMNYVAKVRPRPERIITVHGEPQKCLDLATSIHRKFGISTRAPNN 644
>gi|223478352|ref|YP_002582770.1| RNA-metabolising metallo-beta-lactamase family protein
[Thermococcus sp. AM4]
gi|214033578|gb|EEB74405.1| RNA-metabolising metallo-beta-lactamase family [Thermococcus sp.
AM4]
Length = 648
Score = 137 bits (344), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 94/302 (31%), Positives = 151/302 (50%), Gaps = 31/302 (10%)
Query: 1 MFQVKVGN--QSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRE-- 56
+ + +GN +I TGD+ P R L A R + L+ ESTY + D + RE
Sbjct: 347 IVHLHIGNGLHNIAVTGDFKFIPTRLLEPAVSRFPRVETLVMESTYGGS-NDYQMPREEA 405
Query: 57 -RDFLKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATN 115
+ ++ +H + RGGKVLIP A+GRAQE+ ++LE Y + PIY G+ +AT
Sbjct: 406 EKRLIEVIHHTIKRGGKVLIPAMAVGRAQEIMMVLEEYARVGGIDVPIYLD-GMIWEATA 464
Query: 116 YYKMFITWTNQKIRKTFVQRNMFDFKHIRPFDKS---------FIDNPGPMVVFATPGML 166
+ + + ++++R+ F + P KS ID+ P ++ AT GML
Sbjct: 465 IHTAYPEYLSRRLREQIFHEGYNPF--LNPIFKSVANSRERQDIIDSGEPAIIIATSGML 522
Query: 167 HSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKL----EFENKQIIDVKMAI 222
G S+ FK+ AP E N +I + +GT+G +V G++++ E +++ V M +
Sbjct: 523 VGGPSVEYFKQLAPDEKNSIIFVSYQAEGTLGRQVQRGLREIPLVGEDGRTEVVQVNMEV 582
Query: 223 EYM-SFSAHADAKGIMQLIQYC-----EPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMP 276
+ FS HAD + +LI Y P+ V+ VHGE K + L + ++FNL P
Sbjct: 583 HTIDGFSGHADRR---ELISYVARLRPRPERVITVHGEPHKCLDLASSLHKKFNLSTRAP 639
Query: 277 AN 278
N
Sbjct: 640 NN 641
>gi|18977777|ref|NP_579134.1| cleavage and polyadenylation specifity factor protein [Pyrococcus
furiosus DSM 3638]
gi|18893520|gb|AAL81529.1| cleavage and polyadenylation specifity factor protein [Pyrococcus
furiosus DSM 3638]
Length = 651
Score = 137 bits (344), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 93/296 (31%), Positives = 150/296 (50%), Gaps = 19/296 (6%)
Query: 1 MFQVKVGN--QSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTY--ATTIRDSKRCRE 56
+ + +GN +I TGD+ P R L A R + L+ ESTY A I+ + E
Sbjct: 350 IVHLHIGNGLHNIAVTGDFKFIPTRLLEPASYRFPRLETLVMESTYGGANDIQMPREEAE 409
Query: 57 RDFLKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNY 116
+ ++ +H+ + RGGKVLIP A+GRAQE+ ++LE Y + PIY G+ +AT
Sbjct: 410 KRLIEVIHQTIRRGGKVLIPAMAVGRAQEIMMVLEEYARVGGIDVPIYLD-GMIWEATAI 468
Query: 117 YKMFITWTNQKIRKTFVQR--NMFDFKHIRPFDKS-----FIDNPGPMVVFATPGMLHSG 169
+ + + ++ +R+ + N F + P S ID+ P ++ A+ GML G
Sbjct: 469 HTAYPEYLSKTLREQIFKEDYNPFLSEIFHPVANSKERQDIIDSNEPAIIIASSGMLVGG 528
Query: 170 LSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKL----EFENKQIIDVKMAIEYM 225
S+ FK+ A + N +I + +GT+G +V SGV+++ E ++I V M + +
Sbjct: 529 PSVEYFKQLASDKRNSIIFVSYQAEGTLGRQVQSGVREIPMIGESGRTEVIKVNMEVHTI 588
Query: 226 -SFSAHADAKGIMQLIQYCEPK--NVLLVHGEASKMVFLKEKIKQEFNLDCFMPAN 278
FS HAD K +M + P+ V+ VHGE K + L I ++F L P N
Sbjct: 589 DGFSGHADRKELMNYVAKVRPRPERVITVHGEPQKCLDLATSIHKKFGLSTRAPNN 644
>gi|397651897|ref|YP_006492478.1| cleavage and polyadenylation specifity factor protein [Pyrococcus
furiosus COM1]
gi|393189488|gb|AFN04186.1| cleavage and polyadenylation specifity factor protein [Pyrococcus
furiosus COM1]
Length = 648
Score = 137 bits (344), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 92/288 (31%), Positives = 146/288 (50%), Gaps = 17/288 (5%)
Query: 7 GNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTY--ATTIRDSKRCRERDFLKKVH 64
G +I TGD+ P R L A R + L+ ESTY A I+ + E+ ++ +H
Sbjct: 355 GLHNIAVTGDFKFIPTRLLEPASYRFPRLETLVMESTYGGANDIQMPREEAEKRLIEVIH 414
Query: 65 ECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMFITWT 124
+ + RGGKVLIP A+GRAQE+ ++LE Y + PIY G+ +AT + + +
Sbjct: 415 QTIRRGGKVLIPAMAVGRAQEIMMVLEEYARVGGIDVPIYLD-GMIWEATAIHTAYPEYL 473
Query: 125 NQKIRKTFVQR--NMFDFKHIRPFDKS-----FIDNPGPMVVFATPGMLHSGLSLIIFKK 177
++ +R+ + N F + P S ID+ P ++ A+ GML G S+ FK+
Sbjct: 474 SKTLREQIFKEDYNPFLSEIFHPVANSKERQDIIDSNEPAIIIASSGMLVGGPSVEYFKQ 533
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKL----EFENKQIIDVKMAIEYM-SFSAHAD 232
A + N +I + +GT+G +V SGV+++ E ++I V M + + FS HAD
Sbjct: 534 LASDKRNSIIFVSYQAEGTLGRQVQSGVREIPMIGESGRTEVIKVNMEVHTIDGFSGHAD 593
Query: 233 AKGIMQLIQYCEPK--NVLLVHGEASKMVFLKEKIKQEFNLDCFMPAN 278
K +M + P+ V+ VHGE K + L I ++F L P N
Sbjct: 594 RKELMNYVAKVRPRPERVITVHGEPQKCLDLATSIHKKFGLSTRAPNN 641
>gi|57641363|ref|YP_183841.1| cleavage and polyadenylation specficity factor subunit-like protein
[Thermococcus kodakarensis KOD1]
gi|57159687|dbj|BAD85617.1| cleavage and polyadenylation specficity factor subunit homolog
[Thermococcus kodakarensis KOD1]
Length = 648
Score = 136 bits (342), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 91/299 (30%), Positives = 150/299 (50%), Gaps = 25/299 (8%)
Query: 1 MFQVKVGN--QSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRE-- 56
+ + +GN +I TGD+ P R A R + L+ ESTY + D + RE
Sbjct: 347 IVHLHIGNGLHNIAITGDFKFIPTRLFEPAVSRFPRLETLVMESTYGGS-NDYQMPREEA 405
Query: 57 -RDFLKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATN 115
+ ++ +H+ + RGGKVLIP A+GRAQE+ ++LE Y ++ PIY G+ +AT
Sbjct: 406 EKRLIEVIHQTLKRGGKVLIPAMAVGRAQEIMMVLEEYARVGGIEVPIYLD-GMIWEATA 464
Query: 116 YYKMFITWTNQKIRKTFVQRNMFDFKHIRPFDKS---------FIDNPGPMVVFATPGML 166
+ + + ++ IR+ F + P KS ID+ P ++ AT GML
Sbjct: 465 IHTAYPEYLSKHIREQIFHEGYNPF--LNPIFKSVANSRERQDIIDSGEPAIIIATSGML 522
Query: 167 HSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKL----EFENKQIIDVKMAI 222
G S+ FK+ AP N +I + +GT+G +V G++++ E ++I+V M +
Sbjct: 523 VGGPSVEYFKQLAPDPKNSIIFVSYQAEGTLGRQVQRGLREIPIVGEDGRTEVINVNMEV 582
Query: 223 EYM-SFSAHADAKGIMQLIQYCEPK--NVLLVHGEASKMVFLKEKIKQEFNLDCFMPAN 278
+ FS HAD + +M + P+ ++ VHGEA K + L I ++F + P N
Sbjct: 583 HTIDGFSGHADRRELMSYVARVRPRPERIITVHGEAHKCLDLSSSIHKKFGISTRAPNN 641
>gi|296109857|ref|YP_003616806.1| KH-domain/beta-lactamase-domain protein [methanocaldococcus
infernus ME]
gi|295434671|gb|ADG13842.1| KH-domain/beta-lactamase-domain protein [Methanocaldococcus
infernus ME]
Length = 631
Score = 136 bits (342), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 95/289 (32%), Positives = 149/289 (51%), Gaps = 23/289 (7%)
Query: 7 GNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYAT--TIRDSKRCRERDFLKKVH 64
G ++ YTGD R L A R + LI ESTY + + E++ L+ V
Sbjct: 338 GLYNLAYTGDLKFDTSRLLEPAVCQFPRLEALIIESTYGGYDDVLPDRAEAEKELLRIVI 397
Query: 65 ECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMFITWT 124
E +++GGK+LIPVF +GRAQEL ++LE + + AP+Y G+ +AT + + +
Sbjct: 398 EHIEKGGKILIPVFGVGRAQELMLVLEEGYNQGIFNAPVYLD-GMIWEATAIHTAYPEYL 456
Query: 125 NQKIRKTFVQR--NMF---DFKHIRPFD--KSFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
++ +R+ Q N F F+ +R + + ID+ P V+ AT GML G SL FK
Sbjct: 457 SKDMRRKIFQEGDNPFLSDVFQRVRNTNDRRRIIDDSEPCVILATSGMLTGGPSLEYFKN 516
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENK----QIIDVKMAIEYM-SFSAHAD 232
AP E N +I G+ +GT+G KV G K++ K + I + M + + FS H+D
Sbjct: 517 LAPDEKNAIIFVGYQAEGTLGRKVQKGWKEIPIVTKNGKTKSIPINMGVYTVEGFSGHSD 576
Query: 233 AKGIMQLIQYCE-----PKNVLLVHGEASKMVFLKEKIKQEFNLDCFMP 276
K QLI+Y P+ +++VHGE SK + + + F + + P
Sbjct: 577 RK---QLIRYVRKVKPSPEKIIMVHGEESKCLDFSSTVNRLFKKETYAP 622
>gi|146305000|ref|YP_001192316.1| beta-lactamase domain-containing protein [Metallosphaera sedula DSM
5348]
gi|145703250|gb|ABP96392.1| beta-lactamase domain protein [Metallosphaera sedula DSM 5348]
Length = 642
Score = 136 bits (342), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 82/274 (29%), Positives = 150/274 (54%), Gaps = 15/274 (5%)
Query: 7 GNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFLKKVHEC 66
G ++VYTGD+ R L A + R D LI E+TY ++ ++ E+ + +++
Sbjct: 342 GVHNLVYTGDFKYARTRLLDRAVSEFPRVDTLIMETTYGAQLQTNRDESEKQLIDVINKT 401
Query: 67 VDRGGKVLIPVFALGRAQELCILLETYWERMNL-QAPIYFAVGLTEKATNYYKMFITWTN 125
++RGGKVLIPV A+GR QE+ +++ +R + + P+Y GL ++ T + + W
Sbjct: 402 LNRGGKVLIPVLAVGRGQEIMLVINDAMKRKLIPEVPVY-VTGLFDEVTAIHTAYPEWLG 460
Query: 126 QKIRKT--FVQRNMFD---FKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
+++R + F N F FK I + + + P ++ AT GML+ G ++ FK+ AP
Sbjct: 461 KEVRDSILFRDENPFTSEFFKRIEGYREDIAEGE-PSIILATSGMLNGGPAVEFFKQLAP 519
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEF--ENKQIIDVKMAIEYM---SFSAHADAKG 235
N LI + +GT+G KV G ++++ + ++ ++K+ +E FS H+D +
Sbjct: 520 DPKNSLIFVSYQAEGTLGRKVRDGAREIQIIGRDGRVDNIKVNLETTPIDGFSGHSDRRQ 579
Query: 236 IMQLIQYCEPK--NVLLVHGEASKMVFLKEKIKQ 267
+++ ++ PK N++L HGEAS + K+ I+
Sbjct: 580 LLKFLEDLTPKPRNIILNHGEASAIREFKKNIEN 613
>gi|332159620|ref|YP_004424899.1| mRNA 3'-end processing factor [Pyrococcus sp. NA2]
gi|331035083|gb|AEC52895.1| mRNA 3'-end processing factor, putative [Pyrococcus sp. NA2]
Length = 651
Score = 135 bits (339), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 90/296 (30%), Positives = 151/296 (51%), Gaps = 19/296 (6%)
Query: 1 MFQVKVGN--QSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATT--IRDSKRCRE 56
+ + +GN +I TGD+ P R L A R + L+ ESTY + I+ + E
Sbjct: 350 IVHLHIGNGLHNIAITGDFKFIPTRLLEPANARFPRLETLVMESTYGGSNDIQMPREEAE 409
Query: 57 RDFLKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNY 116
+ ++ +H+ + RGGKVLIP A+GRAQE+ ++LE Y + PIY G+ +AT
Sbjct: 410 KRLIEVIHKTIKRGGKVLIPAMAVGRAQEVMMVLEEYARIGGIDVPIYLD-GMIWEATAI 468
Query: 117 YKMFITWTNQKIRKTFVQR--NMFDFKHIRPFDKS-----FIDNPGPMVVFATPGMLHSG 169
+ + + ++++R+ + N F + P S ID+ P ++ A+ GML G
Sbjct: 469 HTAYPEYLSRRLREQIFKEGYNPFLSEIFHPVANSRERQDIIDSNEPAIIIASSGMLVGG 528
Query: 170 LSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKL----EFENKQIIDVKMAIEYM 225
S+ FK+ AP N +I + +GT+G +V SG +++ E ++I V M + +
Sbjct: 529 PSVEYFKQLAPDPKNSIIFVSYQAEGTLGRQVQSGAREIPMIGEEGRTEVIKVNMEVYTI 588
Query: 226 -SFSAHADAKGIMQLIQYCEPK--NVLLVHGEASKMVFLKEKIKQEFNLDCFMPAN 278
FS HAD + +M + +P+ V+ VHGE K + L I ++F + P N
Sbjct: 589 DGFSGHADRRELMNYVAKVKPRPERVITVHGEPQKCLDLATSIHKKFGISTRAPNN 644
>gi|256810348|ref|YP_003127717.1| KH-domain/beta-lactamase-domain-containing protein
[Methanocaldococcus fervens AG86]
gi|256793548|gb|ACV24217.1| KH-domain/beta-lactamase-domain protein [Methanocaldococcus fervens
AG86]
Length = 634
Score = 135 bits (339), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 92/295 (31%), Positives = 154/295 (52%), Gaps = 17/295 (5%)
Query: 7 GNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYAT--TIRDSKRCRERDFLKKVH 64
G ++ YTGD R L A R + LI ESTY + + ER+ L+ V
Sbjct: 341 GLYNLAYTGDIKFETSRLLEPAVCQFPRLETLIIESTYGAYDDVLPERDEAERELLRVVS 400
Query: 65 ECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMFITWT 124
E ++GGKVLIPVF +GRAQEL ++LE + + AP+Y G+ +AT + + +
Sbjct: 401 ETTEKGGKVLIPVFGVGRAQELMLVLEEGYNQGIFNAPVYLD-GMIWEATAIHTAYPEYL 459
Query: 125 NQKIRKTFVQR--NMF---DFKHIRPFD--KSFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
++++R+ N F FK + + + ID+ P ++ AT GML G S+ K
Sbjct: 460 SKEMRQKIFHEGDNPFLSEVFKRVGSTNERRKVIDSDEPCIILATSGMLTGGPSVEYLKH 519
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENK----QIIDVKMAIEYM-SFSAHAD 232
AP E N +I G+ +GT+G KV G K++ + + I + M + + FS H+D
Sbjct: 520 LAPDEKNAIIFVGYQAEGTLGRKVQRGWKEIPITTRNGKTKSIPINMKVYTIEGFSGHSD 579
Query: 233 AKGIMQLIQYCE--PKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQ 285
K +++ I+ + P+ V++VHGE SK + + +++ F ++P N ++ V+
Sbjct: 580 RKQLIKYIRRVKPSPEKVIMVHGEESKCLDFADTVRRLFKKQTYVPMNLDAIRVK 634
>gi|15897661|ref|NP_342266.1| mRNA 3'-end processing factor [Sulfolobus solfataricus P2]
gi|6015715|emb|CAB57542.1| mRNA 3'-end polyadenylation factor [Sulfolobus solfataricus P2]
gi|13813930|gb|AAK41056.1| mRNA 3'-end processing factor, putative [Sulfolobus solfataricus
P2]
Length = 639
Score = 135 bits (339), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 84/286 (29%), Positives = 156/286 (54%), Gaps = 16/286 (5%)
Query: 7 GNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFLKKVHEC 66
G +IVYTGD+ + L A + R D LI E+TY + ++ E + L+ +++
Sbjct: 347 GKHNIVYTGDFKYAKTKLLDKANTEFPRVDTLIMETTYGAQDQPNREESELELLEIINKT 406
Query: 67 VDRGGKVLIPVFALGRAQELCILLETYWERMNL-QAPIYFAVGLTEKATNYYKMFITWTN 125
+++GGKVLIPV A+GR QE+ +++ + ++ + + P+Y GL ++ T + + W
Sbjct: 407 LNKGGKVLIPVLAVGRGQEIMLIINDFMKKKLIPEVPVY-VTGLVDEVTAIHNAYPEWLG 465
Query: 126 QKIRKTFVQR--NMFDFKHIRPFD--KSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAPV 181
+++R+ + + N F +H + + K I P ++ AT GML+ G ++ FK AP
Sbjct: 466 REVREEILYKDENPFTSEHFKRIEGYKEDIAKGEPSIILATSGMLNGGPAVEFFKTMAPD 525
Query: 182 ENNMLIMPGFCVQGTIGHKVLSGVKKLEFENK----QIIDVKMAIEYM-SFSAHADAKGI 236
N +I + +GT+G KV G K+++ ++ + I + M +E + FS H+D + +
Sbjct: 526 PKNAIIFVSYQAEGTLGRKVRDGAKEVQILDRDGRVESIQINMEVEAVEGFSGHSDKRQL 585
Query: 237 MQLIQYCE--PKNVLLVHGEASKMVFLKEKIKQE---FNLDCFMPA 277
+ ++ E PKNV+L HGEAS + I+++ + + + PA
Sbjct: 586 LNFLRNIEPKPKNVILNHGEASSIRAFANYIREDRLGYKPNIYTPA 631
>gi|389852761|ref|YP_006354995.1| mRNA 3'-end processing factor [Pyrococcus sp. ST04]
gi|388250067|gb|AFK22920.1| putative mRNA 3'-end processing factor [Pyrococcus sp. ST04]
Length = 651
Score = 135 bits (339), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 89/296 (30%), Positives = 150/296 (50%), Gaps = 19/296 (6%)
Query: 1 MFQVKVGN--QSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATT--IRDSKRCRE 56
+ + +GN +I TGD+ P + L A R + L+ ESTY + I+ + E
Sbjct: 350 IVHLHIGNGLHNIAITGDFKFIPTKLLEPANAKFPRLETLVMESTYGGSNDIQMPREEAE 409
Query: 57 RDFLKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNY 116
+ ++ +H + RGGKVLIP A+GRAQE+ ++LE Y + APIY G+ +AT
Sbjct: 410 KRLIEVIHHTIKRGGKVLIPAMAVGRAQEVMMVLEEYARIGGIDAPIYLD-GMIWEATAI 468
Query: 117 YKMFITWTNQKIRKTFVQR--NMFDFKHIRPFDKS-----FIDNPGPMVVFATPGMLHSG 169
+ + + ++++R+ + N F + P S ID+ P ++ A+ GML G
Sbjct: 469 HTAYPEYLSRRLREQIFKEGYNPFLSEIFHPVANSRERQDIIDSKEPAIIIASSGMLVGG 528
Query: 170 LSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKL----EFENKQIIDVKMAIEYM 225
S+ FK+ AP N +I + +GT+G +V +G +++ E ++I V M + +
Sbjct: 529 PSVEYFKQLAPDPKNAIIFVSYQAEGTLGRQVQNGAREIPMIGEGGRTEVIKVNMDVHTI 588
Query: 226 -SFSAHADAKGIMQLIQYCEPK--NVLLVHGEASKMVFLKEKIKQEFNLDCFMPAN 278
FS HAD + +M + P+ V+ VHGE K + L I ++F + P N
Sbjct: 589 DGFSGHADRRELMNYVAKVRPRPERVITVHGEPQKCLDLATSIHKKFGISTRAPNN 644
>gi|337284211|ref|YP_004623685.1| mRNA 3'-end processing factor [Pyrococcus yayanosii CH1]
gi|334900145|gb|AEH24413.1| mRNA 3'-end processing factor, putative [Pyrococcus yayanosii CH1]
Length = 648
Score = 135 bits (339), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 89/296 (30%), Positives = 150/296 (50%), Gaps = 19/296 (6%)
Query: 1 MFQVKVGN--QSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATT--IRDSKRCRE 56
+ + +GN +I TGD+ P R L A R + L+ E+TY + I+ + E
Sbjct: 347 IVHLHIGNGLHNIAVTGDFKFIPTRLLEPANARFPRLETLVMEATYGGSNDIQMPREEAE 406
Query: 57 RDFLKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNY 116
+ ++ +H + RGGKVLIP A+GRAQE+ ++LE Y + PIY G+ +AT
Sbjct: 407 KRLIEVIHRTIKRGGKVLIPAMAVGRAQEVMMVLEEYARIGGIDVPIYLD-GMIWEATAI 465
Query: 117 YKMFITWTNQKIRKTFVQR--NMFDFKHIRPFDKS-----FIDNPGPMVVFATPGMLHSG 169
+ + + ++++R+ N F + +P S ID+ P ++ A+ GML G
Sbjct: 466 HTAYPEYLSKRLREQIFHEGYNPFLNEVFKPVANSRERQDIIDSNEPAIIIASSGMLVGG 525
Query: 170 LSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKL----EFENKQIIDVKMAIEYM 225
S+ FK+ AP N +I + +GT+G +V +G +++ E ++I V M + +
Sbjct: 526 PSVEYFKQLAPDPKNSMIFVSYQAEGTLGRQVQNGAREIPMVGEGGKTEVIKVNMEVYTI 585
Query: 226 -SFSAHADAKGIMQLIQYCEPK--NVLLVHGEASKMVFLKEKIKQEFNLDCFMPAN 278
FS HAD + +M + +P+ V+ VHGE K + L I ++F L P N
Sbjct: 586 DGFSGHADRRELMNYVARVKPRPERVITVHGEPQKCLDLATSIHKKFGLSTRAPNN 641
>gi|284174985|ref|ZP_06388954.1| mRNA 3'-end processing factor, putative [Sulfolobus solfataricus
98/2]
Length = 638
Score = 135 bits (339), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 84/286 (29%), Positives = 156/286 (54%), Gaps = 16/286 (5%)
Query: 7 GNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFLKKVHEC 66
G +IVYTGD+ + L A + R D LI E+TY + ++ E + L+ +++
Sbjct: 346 GKHNIVYTGDFKYAKTKLLDKANTEFPRVDTLIMETTYGAQDQPNREESELELLEIINKT 405
Query: 67 VDRGGKVLIPVFALGRAQELCILLETYWERMNL-QAPIYFAVGLTEKATNYYKMFITWTN 125
+++GGKVLIPV A+GR QE+ +++ + ++ + + P+Y GL ++ T + + W
Sbjct: 406 LNKGGKVLIPVLAVGRGQEIMLIINDFMKKKLIPEVPVY-VTGLVDEVTAIHNAYPEWLG 464
Query: 126 QKIRKTFVQR--NMFDFKHIRPFD--KSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAPV 181
+++R+ + + N F +H + + K I P ++ AT GML+ G ++ FK AP
Sbjct: 465 REVREEILYKDENPFTSEHFKRIEGYKEDIAKGEPSIILATSGMLNGGPAVEFFKTMAPD 524
Query: 182 ENNMLIMPGFCVQGTIGHKVLSGVKKLEFENK----QIIDVKMAIEYM-SFSAHADAKGI 236
N +I + +GT+G KV G K+++ ++ + I + M +E + FS H+D + +
Sbjct: 525 PKNAIIFVSYQAEGTLGRKVRDGAKEVQILDRDGRVESIQINMEVEAVEGFSGHSDKRQL 584
Query: 237 MQLIQYCE--PKNVLLVHGEASKMVFLKEKIKQE---FNLDCFMPA 277
+ ++ E PKNV+L HGEAS + I+++ + + + PA
Sbjct: 585 LNFLRNIEPKPKNVILNHGEASSIRAFANYIREDRLGYKPNIYTPA 630
>gi|384434275|ref|YP_005643633.1| KH-domain/beta-lactamase-domain-containing protein [Sulfolobus
solfataricus 98/2]
gi|261602429|gb|ACX92032.1| KH-domain/beta-lactamase-domain protein [Sulfolobus solfataricus
98/2]
Length = 635
Score = 135 bits (339), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 84/286 (29%), Positives = 156/286 (54%), Gaps = 16/286 (5%)
Query: 7 GNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFLKKVHEC 66
G +IVYTGD+ + L A + R D LI E+TY + ++ E + L+ +++
Sbjct: 343 GKHNIVYTGDFKYAKTKLLDKANTEFPRVDTLIMETTYGAQDQPNREESELELLEIINKT 402
Query: 67 VDRGGKVLIPVFALGRAQELCILLETYWERMNL-QAPIYFAVGLTEKATNYYKMFITWTN 125
+++GGKVLIPV A+GR QE+ +++ + ++ + + P+Y GL ++ T + + W
Sbjct: 403 LNKGGKVLIPVLAVGRGQEIMLIINDFMKKKLIPEVPVY-VTGLVDEVTAIHNAYPEWLG 461
Query: 126 QKIRKTFVQR--NMFDFKHIRPFD--KSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAPV 181
+++R+ + + N F +H + + K I P ++ AT GML+ G ++ FK AP
Sbjct: 462 REVREEILYKDENPFTSEHFKRIEGYKEDIAKGEPSIILATSGMLNGGPAVEFFKTMAPD 521
Query: 182 ENNMLIMPGFCVQGTIGHKVLSGVKKLEFENK----QIIDVKMAIEYM-SFSAHADAKGI 236
N +I + +GT+G KV G K+++ ++ + I + M +E + FS H+D + +
Sbjct: 522 PKNAIIFVSYQAEGTLGRKVRDGAKEVQILDRDGRVESIQINMEVEAVEGFSGHSDKRQL 581
Query: 237 MQLIQYCE--PKNVLLVHGEASKMVFLKEKIKQE---FNLDCFMPA 277
+ ++ E PKNV+L HGEAS + I+++ + + + PA
Sbjct: 582 LNFLRNIEPKPKNVILNHGEASSIRAFANYIREDRLGYKPNIYTPA 627
>gi|374633103|ref|ZP_09705470.1| universal archaeal KH-domain/beta-lactamase-domain protein
[Metallosphaera yellowstonensis MK1]
gi|373524587|gb|EHP69464.1| universal archaeal KH-domain/beta-lactamase-domain protein
[Metallosphaera yellowstonensis MK1]
Length = 639
Score = 135 bits (339), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 80/265 (30%), Positives = 145/265 (54%), Gaps = 15/265 (5%)
Query: 7 GNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFLKKVHEC 66
G ++VYTGD+ R L A + R D LI E+TY + ++ E+ + +++
Sbjct: 342 GTHNLVYTGDFKYARTRLLDRAVAEFPRVDTLIMETTYGAQGQTNREESEKQLIDVINKT 401
Query: 67 VDRGGKVLIPVFALGRAQELCILLETYWERMNL-QAPIYFAVGLTEKATNYYKMFITWTN 125
++RGGKVLIPV A+GR QE+ +++ +R + + P+Y GL ++ T + + W
Sbjct: 402 INRGGKVLIPVLAVGRGQEIMLVINDAMKRKLIPEIPVY-VTGLFDEVTAIHTAYPEWLG 460
Query: 126 QKIRKT--FVQRNMFD---FKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
+++R + F N F FK I F + I P ++ AT GML+ G ++ FK AP
Sbjct: 461 KEVRDSILFRDENPFTSDYFKRIEGF-REDIAQGEPSIILATSGMLNGGPAVEFFKYLAP 519
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEF--ENKQIIDVKMAIEYM---SFSAHADAKG 235
N L+ + +GT+G KV G K+++ + ++ +K+ +E+ FS H+D +
Sbjct: 520 DPKNSLVFVSYQAEGTLGRKVRDGAKEIQVVGRDGRVESIKVNLEHTPIDGFSGHSDRRQ 579
Query: 236 IMQLIQ--YCEPKNVLLVHGEASKM 258
+++ ++ +P+N++L HGE+S +
Sbjct: 580 LLKFLEDMVPKPRNIILNHGESSAI 604
>gi|449283457|gb|EMC90096.1| hypothetical protein A306_00818, partial [Columba livia]
Length = 508
Score = 134 bits (338), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 76/228 (33%), Positives = 127/228 (55%), Gaps = 26/228 (11%)
Query: 804 ITHFVTPDSDISSILSQAYPQDSLHNVCIMGLHTCGDLSGTALRLFT-KSSLQCLVQVGC 862
+T +T ++++ I++ L + I+GLHTCGDL+ LR+FT K ++ + VGC
Sbjct: 273 LTSCITAETELGDIITD------LEDCMIVGLHTCGDLAANTLRIFTAKPEIKAVCSVGC 326
Query: 863 CYHLLEEEFIRSPFWKDVDQSLYEHGYGFPLSEHLRSRKFFLGRNVRMSGTQSPERVIDL 922
CYHLL E+F + + E +GFP+ ++L+ + GRN RMS P
Sbjct: 327 CYHLLSEQF-------ENQEDCPEEVWGFPMCQYLKDTGWCCGRNARMSACLLP------ 373
Query: 923 KQTQTLPLFYRALLEKYLRSKITINDEEPKVVGRLATKCSNFVEYVHRAVDKLKLD-LEV 981
T LFYRA+L+ + ++ + + VG+ +K S+F++YV +++ KL+LD ++
Sbjct: 374 ----TESLFYRAVLQVIIEEMYGVSKSD-RHVGKTFSKSSSFIDYVRKSLKKLELDDSKI 428
Query: 982 DDEEVTRLFNSHQREYEYLQIYYFLKTALAPVIEALIVLDRVLYLREQ 1029
D + + ++ L+ + LK LAP IE LI+LDR+ YL+EQ
Sbjct: 429 PDSCIMDYYEKYKHRMNELEAFNMLKVVLAPCIEVLILLDRLCYLKEQ 476
Score = 73.9 bits (180), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 36/103 (34%), Positives = 62/103 (60%), Gaps = 2/103 (1%)
Query: 634 DLGSTITVDELFSKLQARKCGLVVESAKVSQLMSEKKSYEVQVMSQVVAAVTNSCDSSHI 693
+LG ++++ L+ G K + M+ KKS+EVQ+MS++V + N C +
Sbjct: 25 NLGVCTPLEDVLEALRGDNQGTT--GIKTDEFMNNKKSHEVQMMSELVDKIANYCGIKQV 82
Query: 694 IDLGGGQGYLSTILALQHGKKTLSLDYNQVNTHGAAVRSKKLE 736
ID+G G+GYLS+ L++Q+ K +D + NT+GA R++KL+
Sbjct: 83 IDIGSGKGYLSSFLSMQYNLKVYGIDSSNTNTNGAHERNRKLK 125
Score = 42.7 bits (99), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 18/51 (35%), Positives = 31/51 (60%)
Query: 1044 TQVDDEEVTRLFDSHQREYEYLQIYYFLKTALAPVIEALIVLDRVLYLREQ 1094
+++ D + ++ ++ L+ + LK LAP IE LI+LDR+ YL+EQ
Sbjct: 426 SKIPDSCIMDYYEKYKHRMNELEAFNMLKVVLAPCIEVLILLDRLCYLKEQ 476
Score = 40.0 bits (92), Expect = 7.1, Method: Compositional matrix adjust.
Identities = 17/38 (44%), Positives = 26/38 (68%)
Query: 600 KVSQLMSEKKSYEVQVMSQVVAAVTNSCDSSHIIDLGS 637
K + M+ KKS+EVQ+MS++V + N C +ID+GS
Sbjct: 50 KTDEFMNNKKSHEVQMMSELVDKIANYCGIKQVIDIGS 87
>gi|333897915|ref|YP_004471789.1| beta-lactamase [Thermoanaerobacterium xylanolyticum LX-11]
gi|333113180|gb|AEF18117.1| beta-lactamase domain protein [Thermoanaerobacterium xylanolyticum
LX-11]
Length = 820
Score = 134 bits (338), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 87/285 (30%), Positives = 147/285 (51%), Gaps = 11/285 (3%)
Query: 6 VGNQ-SIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFLKKVH 64
VGN+ S Y+GD++ + A I K RPD+ ESTY + ++ E ++K+
Sbjct: 159 VGNEGSFFYSGDFSRFRQNTIEGASIPKLRPDVAFFESTYGDKLHANRELEESRLVEKIG 218
Query: 65 ECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMFITWT 124
+ GGKVLIP FALGRAQE+ ++L+ + ++ +Y G+ + YK+ +
Sbjct: 219 YVIKNGGKVLIPAFALGRAQEIILILKKAINKGMIETKVYVD-GMVKDICRIYKLNPNYL 277
Query: 125 NQKI-RKTFVQRNMFDFKHIRPFDKSFIDN---PGPMVVFATPGMLHSGLSLIIFKKWAP 180
+ + +K F +F +I P DKS + P V+ ++ GML G S +K A
Sbjct: 278 RESLAKKIFKGGEIFFDDNIMPVDKSEMREEIIKEPCVIVSSSGMLTGGPSQWYAEKLAG 337
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQI-----IDVKMAIEYMSFSAHADAKG 235
E N++ + G+ + + G K+L + + + K I +K A++ SAHAD
Sbjct: 338 DEKNLIAITGYQDEESPGRKLLELTDEKDDDKKLKLADKEIPIKCAVDKFGLSAHADMSE 397
Query: 236 IMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGE 280
I+ L PK V L+HG+ + FL ++I+++ N D ++P NG+
Sbjct: 398 ILSLANILHPKKVFLIHGDPDTINFLGKEIQKDINTDVYVPQNGD 442
>gi|240103777|ref|YP_002960086.1| RNA-metabolising metallo-beta-lactamase, beta-CASP family protein
[Thermococcus gammatolerans EJ3]
gi|239911331|gb|ACS34222.1| RNA-metabolising metallo-beta-lactamase, beta-CASP family protein
[Thermococcus gammatolerans EJ3]
Length = 651
Score = 134 bits (338), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 93/302 (30%), Positives = 150/302 (49%), Gaps = 31/302 (10%)
Query: 1 MFQVKVGN--QSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCR--- 55
+ + +GN +I TGD+ P R L A R + L+ ESTY + D + R
Sbjct: 350 IVHLHIGNGLHNIAITGDFKFIPTRLLEPAVSRFPRVETLVMESTYGGS-NDYQMPRDEA 408
Query: 56 ERDFLKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATN 115
E+ ++ +H + RGGKVLIP A+GRAQE+ ++LE Y + PIY G+ +AT
Sbjct: 409 EKRLIEVIHHTIKRGGKVLIPAMAVGRAQEIMMVLEEYARVGGIDVPIYLD-GMIWEATA 467
Query: 116 YYKMFITWTNQKIRKTFVQRNMFDFKHIRPFDKS---------FIDNPGPMVVFATPGML 166
+ + + ++++R+ F + P KS ID+ P ++ AT GML
Sbjct: 468 IHTAYPEYLSRRLREQIFHEGYNPF--LNPIFKSVANSRERQDIIDSGEPAIIIATSGML 525
Query: 167 HSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKL----EFENKQIIDVKMAI 222
G S+ FK+ AP N +I + +GT+G +V G++++ E +++ V M +
Sbjct: 526 VGGPSVEYFKQLAPDPKNSIIFVSYQAEGTLGRQVQRGLREIPLVGEDGRTEVVQVNMEV 585
Query: 223 EYM-SFSAHADAKGIMQLIQYC-----EPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMP 276
+ FS HAD + +LI Y P+ V+ VHGE K + L + ++FNL P
Sbjct: 586 HTIDGFSGHADRR---ELISYVARLRPRPERVITVHGEPHKCLDLASSLHKKFNLSTRAP 642
Query: 277 AN 278
N
Sbjct: 643 NN 644
>gi|340053150|emb|CCC47437.1| putative cleavage and polyadenylation specificity factor, fragment
[Trypanosoma vivax Y486]
Length = 584
Score = 134 bits (337), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 82/227 (36%), Positives = 127/227 (55%), Gaps = 23/227 (10%)
Query: 50 DSKRCRERDFLKKVHECVDRGGKVLIPVFALGRAQELCILLETYWE-RMNLQ-APIYFAV 107
DS C E D + H +D G +P F YWE LQ PIY+A
Sbjct: 67 DSINCGEIDLVLITHFHLDHCGA--LPYF-------------WYWEAHKELQHIPIYYAS 111
Query: 108 GLTEKATNYYKMFITWTNQKIRKTFV-QRNMFDFKHIRPF--DKSFIDNPGPMVVFATPG 164
L ++ Y+ F++ N ++++ RN F FK+I+P +SF DN GP VV A+PG
Sbjct: 112 SLAQRCMKLYQTFVSAMNDRVKQQHANHRNPFVFKYIQPLVDTRSFEDN-GPCVVLASPG 170
Query: 165 MLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKM-AIE 223
ML SG+SL +F+KW N +I+ G+C+ GTI VLS +++ + +++ ++M I+
Sbjct: 171 MLQSGISLELFEKWCGDRRNGIIIAGYCIDGTIAKDVLSKPREITKPDGKVLPLRMRTIQ 230
Query: 224 YMSFSAHADAKGIMQLIQYC-EPKNVLLVHGEASKMVFLKEKIKQEF 269
+SFSAH+D + IQ + ++V+LVHG A M+ LK K++Q+F
Sbjct: 231 SVSFSAHSDGRQTRDFIQALPKTQHVILVHGNAGAMIQLKNKLQQDF 277
>gi|385775961|ref|YP_005648529.1| beta-lactamase [Sulfolobus islandicus REY15A]
gi|323474709|gb|ADX85315.1| beta-lactamase domain protein [Sulfolobus islandicus REY15A]
Length = 638
Score = 134 bits (336), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 81/264 (30%), Positives = 146/264 (55%), Gaps = 13/264 (4%)
Query: 7 GNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFLKKVHEC 66
G +IVYTGD+ + L A + R D LI E+TY + ++ E + L+ +++
Sbjct: 346 GKHNIVYTGDFKYAKTKLLDKANTEFPRVDTLIMETTYGAQDQPNREESELELLEIINKT 405
Query: 67 VDRGGKVLIPVFALGRAQELCILLETYWERMNL-QAPIYFAVGLTEKATNYYKMFITWTN 125
++RGGKVLIPV A+GR QE+ +++ + ++ + + P+Y GL ++ T + + W
Sbjct: 406 LNRGGKVLIPVLAVGRGQEIMLIINDFMKKKLIPEVPVYLT-GLVDEVTAIHNAYPEWLG 464
Query: 126 QKIRKTFVQR--NMFDFKHIRPFD--KSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAPV 181
+++R+ + + N F +H + + K I P ++ AT GML+ G ++ FK AP
Sbjct: 465 REVREEILYKDENPFTSEHFKRIEGYKEDIAKGEPSIILATSGMLNGGPAVEFFKTMAPD 524
Query: 182 ENNMLIMPGFCVQGTIGHKVLSGVKKLEFENK----QIIDVKMAIEYM-SFSAHADAKGI 236
N +I + +GT+G KV G K+++ ++ + I + M +E + FS H+D + +
Sbjct: 525 PKNAIIFVSYQAEGTLGRKVRDGAKEVQILDRDGRVESIQINMEVEAVEGFSGHSDRRQL 584
Query: 237 MQLIQYCE--PKNVLLVHGEASKM 258
++ E PKN++L HGEAS +
Sbjct: 585 FNFLRTIEPKPKNIILNHGEASAI 608
>gi|333910182|ref|YP_004483915.1| KH-domain/beta-lactamase-domain-containing protein [Methanotorris
igneus Kol 5]
gi|333750771|gb|AEF95850.1| KH-domain/beta-lactamase-domain protein [Methanotorris igneus Kol
5]
Length = 635
Score = 133 bits (335), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 104/298 (34%), Positives = 152/298 (51%), Gaps = 23/298 (7%)
Query: 7 GNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYAT--TIRDSKRCRERDFLKKVH 64
G +I YTGD R L A R + LI ESTY + + K E++ LK V
Sbjct: 342 GQYNIAYTGDIKFEASRLLEPAVCQFPRLETLIIESTYGSFDDVLPEKEETEKELLKIVS 401
Query: 65 ECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMFITWT 124
E + RGGKVLIPVF +GRAQEL ++LE + + AP+Y G+ +AT + + +
Sbjct: 402 ETLQRGGKVLIPVFGVGRAQELMLVLEEGYNQGVFNAPVYLD-GMIWEATAIHTAYPEYL 460
Query: 125 NQKIRKTFVQR--NMF---DFKHIRPFD--KSFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
++ IR N F FK + + + ID+ P ++ AT GML G S+ FK
Sbjct: 461 SKVIRNRIFHEGDNPFLSEVFKRVGSTNERRRVIDSDEPCIILATSGMLTGGPSVEYFKN 520
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENK----QIIDVKMAIEYM-SFSAHAD 232
AP E N L+ G+ +GT+G KV G K++ K + I +KM I + FS H+D
Sbjct: 521 LAPDEKNTLVFVGYQAEGTLGRKVQKGWKEIPITTKNGKTKSIPIKMEIHTLEGFSGHSD 580
Query: 233 AKGIMQLIQYCE-----PKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQ 285
K QLI+Y P+ +L++HGEA+K + L + F + P N +S V+
Sbjct: 581 RK---QLIKYLRKLKPMPERILMIHGEANKCIDLASTAYKLFKRETRAPMNLDSIRVK 635
>gi|229579166|ref|YP_002837564.1| beta-lactamase domain-containing protein [Sulfolobus islandicus
Y.G.57.14]
gi|229582079|ref|YP_002840478.1| beta-lactamase domain-containing protein [Sulfolobus islandicus
Y.N.15.51]
gi|284997854|ref|YP_003419621.1| beta-lactamase domain-containing protein [Sulfolobus islandicus
L.D.8.5]
gi|228009880|gb|ACP45642.1| beta-lactamase domain protein [Sulfolobus islandicus Y.G.57.14]
gi|228012795|gb|ACP48556.1| beta-lactamase domain protein [Sulfolobus islandicus Y.N.15.51]
gi|284445749|gb|ADB87251.1| beta-lactamase domain protein [Sulfolobus islandicus L.D.8.5]
Length = 638
Score = 133 bits (335), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 81/264 (30%), Positives = 146/264 (55%), Gaps = 13/264 (4%)
Query: 7 GNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFLKKVHEC 66
G +IVYTGD+ + L A + R D LI E+TY + ++ E + L+ +++
Sbjct: 346 GKHNIVYTGDFKYAKTKLLDKANTEFPRVDTLIMETTYGAQDQPNREESELELLEIINKT 405
Query: 67 VDRGGKVLIPVFALGRAQELCILLETYWERMNL-QAPIYFAVGLTEKATNYYKMFITWTN 125
++RGGKVLIPV A+GR QE+ +++ + ++ + + P+Y GL ++ T + + W
Sbjct: 406 LNRGGKVLIPVLAVGRGQEIMLIINDFMKKKLIPEVPVY-VTGLVDEVTAIHNAYPEWLG 464
Query: 126 QKIRKTFVQR--NMFDFKHIRPFD--KSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAPV 181
+++R+ + + N F +H + + K I P ++ AT GML+ G ++ FK AP
Sbjct: 465 REVREEILYKDENPFTSEHFKRIEGYKEDIAKGEPSIILATSGMLNGGPAVEFFKTMAPD 524
Query: 182 ENNMLIMPGFCVQGTIGHKVLSGVKKLEFENK----QIIDVKMAIEYM-SFSAHADAKGI 236
N +I + +GT+G KV G K+++ ++ + I + M +E + FS H+D + +
Sbjct: 525 PKNAIIFVSYQAEGTLGRKVRDGAKEVQILDRDGRVESIQINMEVEAVEGFSGHSDRRQL 584
Query: 237 MQLIQYCE--PKNVLLVHGEASKM 258
++ E PKN++L HGEAS +
Sbjct: 585 FNFLRTIEPKPKNIILNHGEASAI 608
>gi|227830349|ref|YP_002832129.1| beta-lactamase domain-containing protein [Sulfolobus islandicus
L.S.2.15]
gi|227456797|gb|ACP35484.1| beta-lactamase domain protein [Sulfolobus islandicus L.S.2.15]
Length = 638
Score = 133 bits (335), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 81/264 (30%), Positives = 146/264 (55%), Gaps = 13/264 (4%)
Query: 7 GNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFLKKVHEC 66
G +IVYTGD+ + L A + R D LI E+TY + ++ E + L+ +++
Sbjct: 346 GKHNIVYTGDFKYAKTKLLDKANTEFPRVDTLIMETTYGAQDQPNREESELELLEIINKT 405
Query: 67 VDRGGKVLIPVFALGRAQELCILLETYWERMNL-QAPIYFAVGLTEKATNYYKMFITWTN 125
++RGGKVLIPV A+GR QE+ +++ + ++ + + P+Y GL ++ T + + W
Sbjct: 406 LNRGGKVLIPVLAVGRGQEIMLIINDFMKKKLIPEVPVY-VTGLVDEVTAIHNAYPEWLG 464
Query: 126 QKIRKTFVQR--NMFDFKHIRPFD--KSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAPV 181
+++R+ + + N F +H + + K I P ++ AT GML+ G ++ FK AP
Sbjct: 465 REVREEILYKDENPFTSEHFKRIEGYKEDIAKGEPSIILATSGMLNGGPAVEFFKTMAPD 524
Query: 182 ENNMLIMPGFCVQGTIGHKVLSGVKKLEFENK----QIIDVKMAIEYM-SFSAHADAKGI 236
N +I + +GT+G KV G K+++ ++ + I + M +E + FS H+D + +
Sbjct: 525 PKNAIIFVSYQAEGTLGRKVRDGAKEVQILDRDGRVESIQINMEVEAVEGFSGHSDRRQL 584
Query: 237 MQLIQYCE--PKNVLLVHGEASKM 258
++ E PKN++L HGEAS +
Sbjct: 585 FNFLRTIEPKPKNIILNHGEASAI 608
>gi|385773323|ref|YP_005645889.1| beta-lactamase [Sulfolobus islandicus HVE10/4]
gi|323477437|gb|ADX82675.1| beta-lactamase domain protein [Sulfolobus islandicus HVE10/4]
Length = 638
Score = 133 bits (335), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 81/264 (30%), Positives = 146/264 (55%), Gaps = 13/264 (4%)
Query: 7 GNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFLKKVHEC 66
G +IVYTGD+ + L A + R D LI E+TY + ++ E + L+ +++
Sbjct: 346 GKHNIVYTGDFKYAKTKLLDKANTEFPRVDTLIMETTYGAQDQPNREESELELLEIINKT 405
Query: 67 VDRGGKVLIPVFALGRAQELCILLETYWERMNL-QAPIYFAVGLTEKATNYYKMFITWTN 125
++RGGKVLIPV A+GR QE+ +++ + ++ + + P+Y GL ++ T + + W
Sbjct: 406 LNRGGKVLIPVLAVGRGQEIMLIINDFMKKKLIPEVPVY-VTGLVDEVTAIHNAYPEWLG 464
Query: 126 QKIRKTFVQR--NMFDFKHIRPFD--KSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAPV 181
+++R+ + + N F +H + + K I P ++ AT GML+ G ++ FK AP
Sbjct: 465 REVREEILYKDENPFTSEHFKRIEGYKEDIAKGEPSIILATSGMLNGGPAVEFFKTMAPD 524
Query: 182 ENNMLIMPGFCVQGTIGHKVLSGVKKLEFENK----QIIDVKMAIEYM-SFSAHADAKGI 236
N +I + +GT+G KV G K+++ ++ + I + M +E + FS H+D + +
Sbjct: 525 PKNAIIFVSYQAEGTLGRKVRDGAKEVQILDRDGRVESIQINMEVEAVEGFSGHSDRRQL 584
Query: 237 MQLIQYCE--PKNVLLVHGEASKM 258
++ E PKN++L HGEAS +
Sbjct: 585 FNFLRTIEPKPKNIILNHGEASAI 608
>gi|227827653|ref|YP_002829433.1| beta-lactamase [Sulfolobus islandicus M.14.25]
gi|229584857|ref|YP_002843359.1| beta-lactamase domain-containing protein [Sulfolobus islandicus
M.16.27]
gi|238619821|ref|YP_002914647.1| beta-lactamase domain-containing protein [Sulfolobus islandicus
M.16.4]
gi|227459449|gb|ACP38135.1| beta-lactamase domain protein [Sulfolobus islandicus M.14.25]
gi|228019907|gb|ACP55314.1| beta-lactamase domain protein [Sulfolobus islandicus M.16.27]
gi|238380891|gb|ACR41979.1| beta-lactamase domain protein [Sulfolobus islandicus M.16.4]
Length = 638
Score = 133 bits (335), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 81/264 (30%), Positives = 146/264 (55%), Gaps = 13/264 (4%)
Query: 7 GNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFLKKVHEC 66
G +IVYTGD+ + L A + R D LI E+TY + ++ E + L+ +++
Sbjct: 346 GKHNIVYTGDFKYAKTKLLDKANTEFPRVDTLIMETTYGAQDQPNREESELELLEIINKT 405
Query: 67 VDRGGKVLIPVFALGRAQELCILLETYWERMNL-QAPIYFAVGLTEKATNYYKMFITWTN 125
++RGGKVLIPV A+GR QE+ +++ + ++ + + P+Y GL ++ T + + W
Sbjct: 406 LNRGGKVLIPVLAVGRGQEIMLIINDFMKKKLIPEVPVY-VTGLVDEVTAIHNAYPEWLG 464
Query: 126 QKIRKTFVQR--NMFDFKHIRPFD--KSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAPV 181
+++R+ + + N F +H + + K I P ++ AT GML+ G ++ FK AP
Sbjct: 465 REVREEILYKDENPFTSEHFKRIEGYKEDIAKGEPSIILATSGMLNGGPAVEFFKTMAPD 524
Query: 182 ENNMLIMPGFCVQGTIGHKVLSGVKKLEFENK----QIIDVKMAIEYM-SFSAHADAKGI 236
N +I + +GT+G KV G K+++ ++ + I + M +E + FS H+D + +
Sbjct: 525 PKNAIIFVSYQAEGTLGRKVRDGAKEVQILDRDGRVESIQINMEVEAVEGFSGHSDRRQL 584
Query: 237 MQLIQYCE--PKNVLLVHGEASKM 258
++ E PKN++L HGEAS +
Sbjct: 585 FNFLRTIEPKPKNIILNHGEASAI 608
>gi|91772428|ref|YP_565120.1| beta-lactamase-like protein [Methanococcoides burtonii DSM 6242]
gi|91711443|gb|ABE51370.1| RNA-metabolizing metallo-beta-lactamase protein [Methanococcoides
burtonii DSM 6242]
Length = 636
Score = 133 bits (334), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 87/296 (29%), Positives = 154/296 (52%), Gaps = 17/296 (5%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTY--ATTIRDSKRCRERDF 59
F + G ++V TGDY P R A R + +ITESTY A+ ++ S + E++
Sbjct: 338 FHIGDGLHNVVVTGDYKYGPTRLFDPAVNKFPRVETVITESTYGAASAVQPSLKEAEKNL 397
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQ-APIYFAVGLTEKATNYYK 118
+ + ++ G VLIP FA+GR+QE+ I+LE R + P+Y G+ +AT +
Sbjct: 398 QQIIKNTLNNKGIVLIPAFAVGRSQEVMIVLEDAIRRGIIDDVPVYLD-GMIWEATAIHA 456
Query: 119 MFITWTNQKIRKTFVQR--NMFDFKHIRPFD-----KSFIDNPGPMVVFATPGMLHSGLS 171
+ + N +RK Q+ N F + +P D K I++P P V+ +T GM+++G
Sbjct: 457 TYPEYLNNNLRKLIFQKGQNPFLAECFKPVDSNDLRKKIIEDPHPCVILSTSGMMNAGPV 516
Query: 172 LIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENK---QIIDVKMAIEYM-SF 227
+ FK +AP NN L+ G+ GT+G ++ G K++ +K +++ + M +E + F
Sbjct: 517 MEYFKAFAPDPNNTLVFVGYQADGTLGRRIQKGWKEIPVSSKGGTEVVKMNMGVEVVDGF 576
Query: 228 SAHADAKGIMQLIQYC--EPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGES 281
S H+D K +M+ + +P+ V HG+ + L + +++ L+ N E+
Sbjct: 577 SGHSDRKQLMEYFKKMKPQPERVFTEHGDERSCIDLASSLHKKYRLETRALTNLET 632
>gi|297545551|ref|YP_003677853.1| RNA-metabolising metallo-beta-lactamase [Thermoanaerobacter
mathranii subsp. mathranii str. A3]
gi|296843326|gb|ADH61842.1| RNA-metabolising metallo-beta-lactamase [Thermoanaerobacter
mathranii subsp. mathranii str. A3]
Length = 829
Score = 132 bits (333), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 81/288 (28%), Positives = 151/288 (52%), Gaps = 26/288 (9%)
Query: 10 SIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFLKKVHECVDR 69
SI Y+GD++ + A+I K RPD+ I ESTY + ++ E ++KV ++
Sbjct: 164 SIFYSGDFSGFRQNTIEGAFIPKLRPDVAIFESTYGDKLHANRELEETRLIEKVSSIINE 223
Query: 70 GGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMFITWTNQKI- 128
GGKV+IP FALGRAQE+ ++L+ + L+ +Y G+ + YK+ + Q +
Sbjct: 224 GGKVIIPAFALGRAQEIILILKKAINKGLLKTKVYVD-GMVREVCRVYKLNPNYLRQNLA 282
Query: 129 RKTFVQRNMFDFKHIRPFDKSFIDNPGP----------MVVFATPGMLHSGLSLIIFKKW 178
+K F ++F FD + I+ P V+ ++ GM+ G S +K
Sbjct: 283 KKIFKGNDIF-------FDDNVIEVEKPEMREEIIKESCVILSSSGMITGGPSQWYVEKL 335
Query: 179 APVENNMLIMPGFCVQGTIGHKVLSGV------KKLEFENKQIIDVKMAIEYMSFSAHAD 232
A E N++ + G+ + + G ++L + +K++ ++K+I+ + +I+ SAHAD
Sbjct: 336 AQDEKNLIAITGYQDEESPGRRLLDIMEESPENRKIKIDDKEIL-INASIDKFGLSAHAD 394
Query: 233 AKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGE 280
I+ L PK + LVHG+ + FL ++++++ + ++P+NG+
Sbjct: 395 MGEILALANSLYPKKIFLVHGDPEIINFLGKEVQKDLKSEVYIPSNGD 442
>gi|289579410|ref|YP_003478037.1| RNA-metabolising metallo-beta-lactamase [Thermoanaerobacter
italicus Ab9]
gi|289529123|gb|ADD03475.1| RNA-metabolising metallo-beta-lactamase [Thermoanaerobacter
italicus Ab9]
Length = 829
Score = 132 bits (333), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 81/288 (28%), Positives = 151/288 (52%), Gaps = 26/288 (9%)
Query: 10 SIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFLKKVHECVDR 69
SI Y+GD++ + A+I K RPD+ I ESTY + ++ E ++KV ++
Sbjct: 164 SIFYSGDFSGFRQNTIEGAFIPKLRPDVAIFESTYGDKLHANRELEETRLIEKVSSIINE 223
Query: 70 GGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMFITWTNQKI- 128
GGKV+IP FALGRAQE+ ++L+ + L+ +Y G+ + YK+ + Q +
Sbjct: 224 GGKVIIPAFALGRAQEIILILKKAINKGLLKTKVYVD-GMVREVCRVYKLNPNYLRQNLA 282
Query: 129 RKTFVQRNMFDFKHIRPFDKSFIDNPGP----------MVVFATPGMLHSGLSLIIFKKW 178
+K F ++F FD + I+ P V+ ++ GM+ G S +K
Sbjct: 283 KKIFKGNDIF-------FDDNVIEVEKPEMREEIIKESCVILSSSGMITGGPSQWYVEKL 335
Query: 179 APVENNMLIMPGFCVQGTIGHKVLSGV------KKLEFENKQIIDVKMAIEYMSFSAHAD 232
A E N++ + G+ + + G ++L + +K++ ++K+I+ + +I+ SAHAD
Sbjct: 336 AQDEKNLIAITGYQDEESPGRRLLDIMEESPENRKIKIDDKEIL-INASIDKFGLSAHAD 394
Query: 233 AKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGE 280
I+ L PK + LVHG+ + FL ++++++ + ++P+NG+
Sbjct: 395 MGEILALANSLYPKKIFLVHGDPEIINFLGKEVQKDLKSEVYIPSNGD 442
>gi|390960706|ref|YP_006424540.1| hypothetical protein CL1_0538 [Thermococcus sp. CL1]
gi|390519014|gb|AFL94746.1| hypothetical protein containing KH domain /beta-lactamase-domain
protein [Thermococcus sp. CL1]
Length = 648
Score = 132 bits (332), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 89/299 (29%), Positives = 149/299 (49%), Gaps = 25/299 (8%)
Query: 1 MFQVKVGN--QSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRE-- 56
+ + +GN +I TGD+ P R A R + L+ ESTY + D + RE
Sbjct: 347 IVHLHIGNGLHNIAITGDFKFIPTRLFEPAVSRFPRLETLVMESTYGGS-NDYQMPREEA 405
Query: 57 -RDFLKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATN 115
+ ++ +H+ + RGGKVLIP A+GRAQE+ ++LE Y ++ PIY G+ +AT
Sbjct: 406 EKRLIEVIHQTIRRGGKVLIPAMAVGRAQEIMMVLEEYARVGGIEVPIYLD-GMIWEATA 464
Query: 116 YYKMFITWTNQKIRKTFVQRNMFDFKHIRPFDKS---------FIDNPGPMVVFATPGML 166
+ + + ++ +R+ F + P KS ID+ P ++ AT GML
Sbjct: 465 IHTAYPEYLSRHLREQIFHEGYNPF--LNPIFKSVANSRERQDIIDSGEPAIIIATSGML 522
Query: 167 HSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKL----EFENKQIIDVKMAI 222
G S+ FK+ AP N ++ + +GT+G +V G++++ E +++ V M +
Sbjct: 523 VGGPSVEYFKQLAPDPKNSIVFVSYQAEGTLGRQVQRGLREIPLVGEDGKTEVVPVNMEV 582
Query: 223 EYM-SFSAHADAKGIMQLIQYCEPK--NVLLVHGEASKMVFLKEKIKQEFNLDCFMPAN 278
+ FS HAD + ++ I P+ V+ VHGEA K + L I ++F + P N
Sbjct: 583 HTIDGFSGHADRRELISYIARLRPRPERVITVHGEAHKCLDLSTSIHKKFGISTRAPNN 641
>gi|345018855|ref|YP_004821208.1| RNA-metabolising metallo-beta-lactamase [Thermoanaerobacter
wiegelii Rt8.B1]
gi|344034198|gb|AEM79924.1| RNA-metabolising metallo-beta-lactamase [Thermoanaerobacter
wiegelii Rt8.B1]
Length = 829
Score = 132 bits (332), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 81/288 (28%), Positives = 151/288 (52%), Gaps = 26/288 (9%)
Query: 10 SIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFLKKVHECVDR 69
S+ Y+GD++ + A+I K RPD+ I ESTY + ++ E ++KV ++
Sbjct: 164 SVFYSGDFSGFRQNTIEGAFIPKLRPDVAIFESTYGDKLHANRELEETRLIEKVSSIINE 223
Query: 70 GGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMFITWTNQKI- 128
GGKV+IP FALGRAQE+ ++L+ + L+ +Y G+ + YK+ + Q +
Sbjct: 224 GGKVIIPAFALGRAQEIILILKKAINKGILKTKVYVD-GMVREVCRVYKLNPNYLRQNLA 282
Query: 129 RKTFVQRNMFDFKHIRPFDKSFIDNPGP----------MVVFATPGMLHSGLSLIIFKKW 178
+K F ++F FD + I P V+ ++ GM+ G S +K
Sbjct: 283 KKIFKGNDIF-------FDDNVIAVEKPEMREEIIKESCVILSSSGMITGGPSQWYVEKL 335
Query: 179 APVENNMLIMPGFCVQGTIGHKVLSGV------KKLEFENKQIIDVKMAIEYMSFSAHAD 232
A E N++ + G+ + + G ++L + +K++ ++K+I+ +K +I+ SAHAD
Sbjct: 336 AQDEKNLIAITGYQDEESPGRRLLDIMEESPENRKIKIDDKEIL-IKCSIDKFGLSAHAD 394
Query: 233 AKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGE 280
I+ L PK + LVHG+ + FL ++++++ + ++P+NG+
Sbjct: 395 MGEILALANSLYPKKIFLVHGDPEIINFLGKEVQKDLKSEVYIPSNGD 442
>gi|222445335|ref|ZP_03607850.1| hypothetical protein METSMIALI_00963 [Methanobrevibacter smithii
DSM 2375]
gi|261350104|ref|ZP_05975521.1| putative mRNA 3-end processing factor [Methanobrevibacter smithii
DSM 2374]
gi|222434900|gb|EEE42065.1| arCOG00543 universal archaeal KH-domain/beta-lactamase-domain
protein [Methanobrevibacter smithii DSM 2375]
gi|288860890|gb|EFC93188.1| putative mRNA 3-end processing factor [Methanobrevibacter smithii
DSM 2374]
Length = 636
Score = 132 bits (332), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 91/295 (30%), Positives = 157/295 (53%), Gaps = 17/295 (5%)
Query: 7 GNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYAT--TIRDSKRCRERDFLKKVH 64
G ++VYTGD+ R L A R + +I ESTY I+ S+ E++ +K ++
Sbjct: 343 GAHNLVYTGDFKYERSRLLEPATFRFPRAETVIMESTYGGREDIQPSRNSAEKEMMKTIY 402
Query: 65 ECVDRGGKVLIPVFALGRAQELCILLETYWER-MNLQAPIYFAVGLTEKATNYYKMFITW 123
+ + RGGKVL+PVFA+GRAQEL ++LE Y M + PIY G+ +AT + +
Sbjct: 403 KTLKRGGKVLVPVFAVGRAQELMVVLEEYMRHGMIDEVPIYID-GMIWEATAIHTARPEY 461
Query: 124 TNQKIRKTF--VQRNMF---DFKHIRPFD--KSFIDNPGPMVVFATPGMLHSGLSLIIFK 176
++ +R + RN F F ++ D K +++ P ++ +T GML G S+ FK
Sbjct: 462 LSKDLRDQIFHMGRNPFISDMFNKVQNLDQRKDIVESNSPAIILSTSGMLTGGNSVEYFK 521
Query: 177 KWAPVENNMLIMPGFCVQGTIGHKVLSGVKK--LEFENK-QIIDVKMAIEYMS-FSAHAD 232
+ N LI G+ +G++G ++ G K+ LE +NK ++ +VKM I+ ++ FS H++
Sbjct: 522 WLCEDDRNTLIFVGYQSEGSLGRRIQKGWKEVPLEEDNKTKVFNVKMEIKTINGFSGHSN 581
Query: 233 AKGIMQLIQYCEPK--NVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQ 285
+ +M ++ P+ V+ HG+ K L I + + ++ P N +S +
Sbjct: 582 RRQLMDYVKRLNPRPDKVITCHGDPYKTTDLASSIHRSYKIETKSPVNLDSVRIH 636
>gi|385806123|ref|YP_005842521.1| beta-lactamase [Fervidicoccus fontis Kam940]
gi|383795986|gb|AFH43069.1| beta-lactamase domain protein [Fervidicoccus fontis Kam940]
Length = 619
Score = 132 bits (331), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 92/282 (32%), Positives = 146/282 (51%), Gaps = 16/282 (5%)
Query: 1 MFQVKVGN--QSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERD 58
M + +GN +IVYTGD+ + L A R + LI ESTY T + S+ E +
Sbjct: 314 MAHLHIGNGLHNIVYTGDFKYAHTKLLNKATDKFPRLETLIMESTYGETKQPSRAEAESN 373
Query: 59 FLKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQ-APIYFAVGLTEKATNYY 117
+ + + V RGGK+L+PV ++GR QE+ ++L + + LQ P+Y G+ + T +
Sbjct: 374 LINIIRKTVQRGGKILVPVMSVGRGQEIMLILSEAFSKGQLQDIPVYIE-GMVTEVTALH 432
Query: 118 KMFITWTNQKIRKTF-VQRNMF---DFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSL 172
+ +Q + K + N F +F ++ DK S PGP ++ AT GML+ G S+
Sbjct: 433 THYPELMSQSVEKAIHLGENPFMNKNFVVVQSKDKRSEALEPGPCIILATSGMLNGGPSV 492
Query: 173 IIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENK----QIIDVKMAIEYM-SF 227
K A N LI + V+GT+G K+ G K+L F N + I + M I + F
Sbjct: 493 EYLKSLAEDPKNSLIFVSYQVEGTLGRKIKDGQKELTFLNPDGKIETIKINMEIHSIEGF 552
Query: 228 SAHADAKGIMQLIQYCE--PKNVLLVHGEASKMVFLKEKIKQ 267
S H+D ++ I+ E PKN++L HGE S ++ L I++
Sbjct: 553 SGHSDKNELISFIENIEPKPKNIILNHGELSAIMSLYNSIEK 594
>gi|451818434|ref|YP_007454635.1| RNA-metabolising metallo-beta-lactamase [Clostridium
saccharoperbutylacetonicum N1-4(HMT)]
gi|451784413|gb|AGF55381.1| RNA-metabolising metallo-beta-lactamase [Clostridium
saccharoperbutylacetonicum N1-4(HMT)]
Length = 851
Score = 132 bits (331), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 81/289 (28%), Positives = 147/289 (50%), Gaps = 14/289 (4%)
Query: 10 SIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFLKKVHECVDR 69
++ Y+GD+++ R + A + K RPD+ I ESTY + ++ ER+ ++ V EC++
Sbjct: 166 AVFYSGDFSVFSQRTVEGAKLPKLRPDVAIVESTYGDRLHSNREVEERNLIEAVKECIEN 225
Query: 70 GGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYK---MFITWT-N 125
GK+LIP FALGRAQE+ ++L++ + ++ + G+ Y +F+ +
Sbjct: 226 NGKMLIPAFALGRAQEVILILKSAMNKGLIKNVKVYVDGMVRDINRVYNKNPLFLKNSLG 285
Query: 126 QKIRKT---FVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAPVE 182
+KI + F N+ + I + ++ P+++ ++ GML G S +K AP+E
Sbjct: 286 KKILRGASPFYDDNIMEI-MITDNREEILNQKEPVIIISSSGMLTGGQSAFYAEKIAPME 344
Query: 183 NNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQI------IDVKMAIEYMSFSAHADAKGI 236
N +I+ G+ + + G K+L + E + + I VK + + SAHAD I
Sbjct: 345 NGYIIITGYQDEESPGRKILELTEADENSERYLNINGLNIPVKCNVRKVGLSAHADKNEI 404
Query: 237 MQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQ 285
+++ +N++LVHG + L ++I EF F PA GE +
Sbjct: 405 KGVLERISARNIILVHGNEEVIRSLGKEISDEFIGRTFTPACGEEIVIN 453
>gi|148643098|ref|YP_001273611.1| metal-dependent RNase [Methanobrevibacter smithii ATCC 35061]
gi|148552115|gb|ABQ87243.1| predicted metal-dependent RNase [Methanobrevibacter smithii ATCC
35061]
Length = 636
Score = 132 bits (331), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 91/295 (30%), Positives = 157/295 (53%), Gaps = 17/295 (5%)
Query: 7 GNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYAT--TIRDSKRCRERDFLKKVH 64
G ++VYTGD+ R L A R + +I ESTY I+ S+ E++ +K ++
Sbjct: 343 GAHNLVYTGDFKYERSRLLEPATFRFPRAETVIMESTYGGREDIQPSRNSAEKEMMKTIY 402
Query: 65 ECVDRGGKVLIPVFALGRAQELCILLETYWER-MNLQAPIYFAVGLTEKATNYYKMFITW 123
+ + RGGKVL+PVFA+GRAQEL ++LE Y M + PIY G+ +AT + +
Sbjct: 403 KTLKRGGKVLVPVFAVGRAQELMVVLEEYMRHGMIDEVPIYID-GMIWEATAIHTARPEY 461
Query: 124 TNQKIRKTF--VQRNMF---DFKHIRPFD--KSFIDNPGPMVVFATPGMLHSGLSLIIFK 176
++ +R + RN F F ++ D K +++ P ++ +T GML G S+ FK
Sbjct: 462 LSKDLRDQIFHMGRNPFISDMFNKVQNLDQRKDIVESNSPAIILSTSGMLTRGNSVEYFK 521
Query: 177 KWAPVENNMLIMPGFCVQGTIGHKVLSGVKK--LEFENK-QIIDVKMAIEYMS-FSAHAD 232
+ N LI G+ +G++G ++ G K+ LE +NK ++ +VKM I+ ++ FS H++
Sbjct: 522 WLCEDDRNTLIFVGYQSEGSLGRRIQKGWKEVPLEEDNKTKVFNVKMEIKTINGFSGHSN 581
Query: 233 AKGIMQLIQYCEPK--NVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQ 285
+ +M ++ P+ V+ HG+ K L I + + ++ P N +S +
Sbjct: 582 RRQLMDYVKRLNPRPDKVITCHGDPYKTTDLASSIHRSYKIETKSPVNLDSVRIH 636
>gi|327401049|ref|YP_004341888.1| KH-domain/beta-lactamase-domain-containing protein [Archaeoglobus
veneficus SNP6]
gi|327316557|gb|AEA47173.1| KH-domain/beta-lactamase-domain protein [Archaeoglobus veneficus
SNP6]
Length = 634
Score = 132 bits (331), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 89/303 (29%), Positives = 151/303 (49%), Gaps = 29/303 (9%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATT--IRDSKRCRERDF 59
F + G +I +TGD+ R A + R + LI E+TY + + ++R E
Sbjct: 336 FHIGEGLYNIAFTGDFKFERTRLFDRAETNFPRLEALIMEATYGGSQDFQPTRREAEEKL 395
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKM 119
+ ++E + GGKVLIP FA+GR+QE+ I LE L+ + G+ +AT +
Sbjct: 396 ISIINETLKNGGKVLIPTFAVGRSQEVMIALEEAIRNKKLEEVTVYLDGMIHEATAIHTA 455
Query: 120 FITWTNQKIRKTFVQRNMFDFKH-IRPF-DKSF------------IDNPGPMVVFATPGM 165
+ + N +R F H I PF +SF I+ P ++ AT GM
Sbjct: 456 YPEYLNASLRDLI-------FHHGINPFISESFVKVDSASKRQEVIEEESPSIILATSGM 508
Query: 166 LHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFEN---KQIIDVKMAI 222
L+ G + FK+ A E N LI G+ +GT+G ++ G +++ F + +++++V M I
Sbjct: 509 LNGGPVMEYFKRLAGDEKNTLIFVGYQAEGTLGRRIQKGWREVPFPSNGKREVVNVNMRI 568
Query: 223 EYM-SFSAHADAKGIMQLIQY--CEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANG 279
E + FS H+D K +M ++ +P+ V+ VHG+ +K V L I + + ++ P N
Sbjct: 569 ETVDGFSGHSDRKQLMSYVRALSSKPEKVITVHGDENKCVDLASSIYKTYRVETRAPYNL 628
Query: 280 ESC 282
E+
Sbjct: 629 ETV 631
>gi|288931295|ref|YP_003435355.1| KH-domain/beta-lactamase-domain protein [Ferroglobus placidus DSM
10642]
gi|288893543|gb|ADC65080.1| KH-domain/beta-lactamase-domain protein [Ferroglobus placidus DSM
10642]
Length = 634
Score = 132 bits (331), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 89/300 (29%), Positives = 153/300 (51%), Gaps = 23/300 (7%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATT--IRDSKRCRERDF 59
F + G ++ +TGD+ R A + R + L+ E+TY + + S++ E
Sbjct: 336 FHIGEGLYNVAFTGDFKFERTRLFDKAETNFPRLEALVMEATYGGSEDFQPSRKEAEEKL 395
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLE-TYWERMNLQAPIYFAVGLTEKATNYYK 118
++ +++ V RGGKVLIP FA+GR+QE+ I+LE E+ + P+Y G+ +AT +
Sbjct: 396 IEVINKTVSRGGKVLIPAFAVGRSQEVMIVLEEAIREKKIPEVPVYLD-GMIYEATAIHT 454
Query: 119 MFITWTNQKIRKTFVQRNMFDF---KHIRPFDKS----FIDNPGPMVVFATPGMLHSGLS 171
+ + N +R + + F +R S I++ P V+ AT GML+ G
Sbjct: 455 AYPEYLNAHLRDLIFHQGVNPFISESFVRVDSASKRDEVINDKSPAVIIATSGMLNGGPV 514
Query: 172 LIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFEN---KQIIDVKMAIEYM-SF 227
+ FK AP E N L+ G+ +GT+G ++ G K++ F + ++++ V M + + F
Sbjct: 515 MEYFKALAPDEKNTLVFVGYQAEGTLGRRIQKGWKEVPFPSNGKREVVYVNMEVVTVDGF 574
Query: 228 SAHADAKGIMQLIQYC-----EPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESC 282
S H+D + QLI Y +P+ VL VHG+ K + L I + F ++ P N E+
Sbjct: 575 SGHSDRR---QLINYVKSLSSKPEKVLTVHGDEGKCIELASAIYKTFRIETRAPMNLETV 631
>gi|307595786|ref|YP_003902103.1| KH-domain/beta-lactamase-domain-containing protein [Vulcanisaeta
distributa DSM 14429]
gi|307550987|gb|ADN51052.1| KH-domain/beta-lactamase-domain protein [Vulcanisaeta distributa
DSM 14429]
Length = 643
Score = 131 bits (330), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 92/306 (30%), Positives = 155/306 (50%), Gaps = 26/306 (8%)
Query: 1 MFQVKVGNQ--SIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATT--IRDSKRCRE 56
+ + +GN +I+YTGD P R L A R +LLI ESTY ++ ++ E
Sbjct: 337 VVHLHIGNGLYNIIYTGDMKYGPTRLLNPAHNKFKRAELLIMESTYGGKEDVQPPRQESE 396
Query: 57 RDFLKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNL-QAPIYFAVGLTEKATN 115
+ + V V+ GKVLIPVF+ GRAQE+ ++L L + PIY G+ + N
Sbjct: 397 QKLIDLVKHTVEHEGKVLIPVFSTGRAQEILLVLNEAINNKQLPKIPIYVD-GMVLETLN 455
Query: 116 YYKMFITWTNQKIRKTFVQR-NMFDFKHIRPFDKS------------FIDNPGPMVVFAT 162
+ MF + N+ +R+ N F ++++P +++ + P P V+ A
Sbjct: 456 VHLMFPDYLNRALREMIYDGVNPFLSEYVKPIERARDPEKRKEQVMEILQGP-PAVILAP 514
Query: 163 PGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLE---FENKQIIDVK 219
GML+ G + F A E NMLI + + TIG ++ G +KL + ++ +D+K
Sbjct: 515 HGMLNGGPIMDYFVHAAEDERNMLIFVSYQAENTIGRRIQQGERKLTVRYYSDRVTLDIK 574
Query: 220 MAIEYM-SFSAHADAKGIMQLIQYCEPK--NVLLVHGEASKMVFLKEKIKQEFNLDCFMP 276
M +E + FS H+D + ++ ++ EPK ++LVHGE SK++ L ++ + +
Sbjct: 575 MKVESIPGFSGHSDRRQLINYVRNMEPKPHKIMLVHGEPSKIMNLALTLELQLKIPTVFM 634
Query: 277 ANGESC 282
NGE+
Sbjct: 635 NNGETI 640
>gi|341581963|ref|YP_004762455.1| cleavage and polyadenylation specificity factor subunit-like
protein [Thermococcus sp. 4557]
gi|340809621|gb|AEK72778.1| cleavage and polyadenylation specificity factor subunit-like
protein [Thermococcus sp. 4557]
Length = 651
Score = 131 bits (329), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 90/297 (30%), Positives = 149/297 (50%), Gaps = 21/297 (7%)
Query: 1 MFQVKVGN--QSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRE-- 56
+ + +GN +I TGD+ P R A R + L+ ESTY + D + RE
Sbjct: 350 IVHLHIGNGLHNIAITGDFKFIPTRLFEPAVSRFPRLETLVMESTYGGS-NDYQMPREEA 408
Query: 57 -RDFLKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATN 115
+ ++ +H+ + RGGKVLIP A+GRAQE+ ++LE Y L+ PIY G+ +AT
Sbjct: 409 EKRLIEVIHQTIRRGGKVLIPAMAVGRAQEIMMVLEEYARIGGLEVPIYLD-GMIWEATA 467
Query: 116 YYKMFITWTNQKIRKTFVQR--NMFDFKHIRPFDKS-----FIDNPGPMVVFATPGMLHS 168
+ + + ++ +R+ N F +P S ID+ P ++ AT GML
Sbjct: 468 IHTAYPEYLSRHLREQIFHEGYNPFLNPIFKPVANSRERQDIIDSGEPAIIIATSGMLVG 527
Query: 169 GLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKL----EFENKQIIDVKMAIEY 224
G S+ FK+ A N ++ + +GT+G +V G++++ E ++++V M +
Sbjct: 528 GPSVEYFKQLASDPKNSMVFVSYQAEGTLGRQVQRGLREIPLVGEGGKTEVVNVNMEVHT 587
Query: 225 M-SFSAHADAKGIMQLIQYCEPK--NVLLVHGEASKMVFLKEKIKQEFNLDCFMPAN 278
+ FS HAD + ++ I P+ V+ VHGEA K + L I ++F L P N
Sbjct: 588 IDGFSGHADRRELISYIARLRPRPERVITVHGEAHKCLDLSTSIHKKFGLSTRAPNN 644
>gi|374636367|ref|ZP_09707940.1| KH-domain/beta-lactamase-domain protein [Methanotorris formicicus
Mc-S-70]
gi|373559249|gb|EHP85554.1| KH-domain/beta-lactamase-domain protein [Methanotorris formicicus
Mc-S-70]
Length = 635
Score = 131 bits (329), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 104/298 (34%), Positives = 150/298 (50%), Gaps = 23/298 (7%)
Query: 7 GNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYAT--TIRDSKRCRERDFLKKVH 64
G +I YTGD R L A R + LI ESTY + I K E++ LK V
Sbjct: 342 GQYNIAYTGDIKFEASRLLEPAVCQFPRLETLIIESTYGSYDDILPEKEETEKELLKIVS 401
Query: 65 ECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMFITWT 124
E + RGGKVLIPVF +GRAQEL ++LE + + AP+Y G+ +AT + + +
Sbjct: 402 ETLQRGGKVLIPVFGVGRAQELMLVLEEGYNQGIFNAPVYLD-GMIWEATAIHTAYPEYL 460
Query: 125 NQKIRKTFVQR--NMF---DFKHIRPFD--KSFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
++ IR N F FK + + + ID P ++ AT GML G S+ FK
Sbjct: 461 SKVIRNRIFHEGDNPFLSEVFKRVGSTNERRRVIDGDEPCIILATSGMLTGGPSVEYFKN 520
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENK----QIIDVKMAIEYM-SFSAHAD 232
AP E N L+ G+ +GT+G KV G K++ K + I +KM I + FS H+D
Sbjct: 521 LAPDEKNTLVFVGYQAEGTLGRKVQKGWKEIPITTKNGKTKSIPIKMDIHTLEGFSGHSD 580
Query: 233 AKGIMQLIQYCE-----PKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQ 285
K QLI+Y P+ +L++HGE +K + L + F + P N +S V+
Sbjct: 581 RK---QLIKYLRKLKPMPERILMIHGEINKCIDLASTAYKLFKRETRAPMNLDSIRVK 635
>gi|41614872|ref|NP_963370.1| hypothetical protein NEQ076 [Nanoarchaeum equitans Kin4-M]
gi|40068596|gb|AAR38931.1| NEQ076 [Nanoarchaeum equitans Kin4-M]
Length = 635
Score = 130 bits (328), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 85/282 (30%), Positives = 144/282 (51%), Gaps = 8/282 (2%)
Query: 7 GNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFLKKVHEC 66
G +++YTGDY + P A I + + ++ ITESTY T + E++FL V E
Sbjct: 352 GYSNVLYTGDYKVKPTFLFDGAQIPQAKVNIAITESTYGDTYHKPREEVEKEFLSFVKEV 411
Query: 67 VDRGGKVLIPVFALGRAQELCILLETYWERMNL-QAPIYFAVGLTEKATNYYKMFITWTN 125
+ R GK+LIPV +GRAQE+ LL L + PIY G+ + T + + +
Sbjct: 412 IKRKGKLLIPVLGVGRAQEILYLLVKSIRANKLDEVPIYLD-GVVWEITAIHTAYPEYLK 470
Query: 126 QKIR-KTFVQRNMF--DFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKKWAPV 181
++IR K +N F F H P ++ + + P ++ AT GML G S+ K A
Sbjct: 471 EEIRNKILAGKNPFIDKFVHRAPRNREEIVYSTEPSIILATSGMLVGGPSVQYLKLMAED 530
Query: 182 ENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIE-YMSFSAHADAKGIMQLI 240
N + + T+G ++ SGV+++E + Q+I + + + + FS HAD I +
Sbjct: 531 PKNAIAFVSYQGPNTLGRQIQSGVREVEIDG-QLIKINLEVRSFEGFSGHADKGEIESFL 589
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESC 282
+ P V++ HGE SK++ L ++++ N++ P N ++
Sbjct: 590 RRVRPGRVVINHGERSKVLHLASYLRKKLNVETNAPRNMDAL 631
>gi|11498093|ref|NP_069318.1| mRNA 3'-end processing factor [Archaeoglobus fulgidus DSM 4304]
gi|2650146|gb|AAB90756.1| mRNA 3'-end processing factor, putative [Archaeoglobus fulgidus DSM
4304]
Length = 632
Score = 130 bits (328), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 85/302 (28%), Positives = 146/302 (48%), Gaps = 27/302 (8%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYA--TTIRDSKRCRERDF 59
F + G+ +I +TGD+ R A + R + L+ E+TY + S++ E
Sbjct: 334 FHIGEGHYNIAFTGDFKFEKTRLFDRAATNFPRLEALVMEATYGGPNDFQPSRKEAEERL 393
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKM 119
++ ++ +DRGGKVLIP FA+GR+QE+ I+LE L+ + G+ +AT +
Sbjct: 394 IEVINRTLDRGGKVLIPTFAVGRSQEVMIVLEEAMREKRLRETYVYLDGMIYEATAIHTA 453
Query: 120 FITWTNQKIRKTFVQRNMFDFKHIRPF--------DKS-----FIDNPGPMVVFATPGML 166
+ + N ++ R++ + I PF D S I +P P ++ AT GML
Sbjct: 454 YPEYLNAQL------RDLIFYHGINPFISENFVRVDSSSKREEVISDPSPSIIIATSGML 507
Query: 167 HSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEF----ENKQIIDVKMAI 222
+ G + F+ A E N ++ G+ +GT+G K+ G K++ F + +
Sbjct: 508 NGGPVMEYFRHLAEDERNTIVFVGYQAEGTLGRKIQKGWKEVPFPVDGRREVVEVKMEVE 567
Query: 223 EYMSFSAHADAKGIMQLIQY--CEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGE 280
FS H+D K +M I+Y +P+ V VHG+ SK + L I + + ++ P N E
Sbjct: 568 TVDGFSGHSDRKQLMNYIRYLNSKPEKVATVHGDESKCIDLASSIYKTYRIETRAPMNLE 627
Query: 281 SC 282
+
Sbjct: 628 TI 629
>gi|332796323|ref|YP_004457823.1| beta-lactamase domain-containing protein [Acidianus hospitalis W1]
gi|332694058|gb|AEE93525.1| beta-lactamase domain protein [Acidianus hospitalis W1]
Length = 638
Score = 130 bits (327), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 80/272 (29%), Positives = 145/272 (53%), Gaps = 15/272 (5%)
Query: 7 GNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFLKKVHEC 66
G +IVYTGD+ R L A + + D LI E+TY + ++ E+ + ++
Sbjct: 341 GTHNIVYTGDFKYAKTRLLDRATSEFPKVDTLIMETTYGAQEQTNREESEKQLIDIINNT 400
Query: 67 VDRGGKVLIPVFALGRAQELCILLETYWE-RMNLQAPIYFAVGLTEKATNYYKMFITWTN 125
++RGGKVLIPV A+GR QE+ +++ + ++ + P+Y GL ++ T + + W +
Sbjct: 401 INRGGKVLIPVLAVGRGQEMMLVINNALKNKLIPEVPVY-VTGLFDEVTAIHTAYPEWLS 459
Query: 126 QKIRKTFVQR--NMF---DFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
+++R + R N F FK I + + I P ++ AT GML+ G ++ FK AP
Sbjct: 460 REVRDAILYRDENPFVSDQFKRIEGY-REDIAQGEPSIILATSGMLNGGPAVEFFKAMAP 518
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENK----QIIDVKMAIEYM-SFSAHADAKG 235
N ++ + +GT+G KV G K+++ + + I + M ++ + FS H+D +
Sbjct: 519 DSKNSIVFVSYQAEGTLGRKVRDGAKEVQIIGRDGRVENIQINMEVKVVDGFSGHSDRRQ 578
Query: 236 IMQLIQ--YCEPKNVLLVHGEASKMVFLKEKI 265
+++ + +PKN++L HGE + M+ K I
Sbjct: 579 LLKFLHDLTPKPKNIILDHGEYNSMMTFKRLI 610
>gi|428307781|ref|YP_007144606.1| beta-lactamase [Crinalium epipsammum PCC 9333]
gi|428249316|gb|AFZ15096.1| beta-lactamase domain protein [Crinalium epipsammum PCC 9333]
Length = 894
Score = 130 bits (327), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 85/263 (32%), Positives = 137/263 (52%), Gaps = 18/263 (6%)
Query: 4 VKVGNQSIVYTGDYNMTPDRHL-GAAWIDKCRPDLLITESTYATTIRDSKRCRERDFLKK 62
++ G +S++YTGDYN T R G D + D+LITESTY S++ +E + L+
Sbjct: 462 IRYGERSLLYTGDYNTTNSRTTDGLRLADLPQADILITESTYGADTHPSRKAQETELLQA 521
Query: 63 VHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMFIT 122
V E V GG VLIP FALGRAQE+ + + T L+ P+Y GL T+ ++ +
Sbjct: 522 VAEVVSSGGNVLIPAFALGRAQEIILAIRTSALFHKLKIPVYID-GLVRAVTDVFRDHLD 580
Query: 123 WTNQKIRKTFVQRN-MFDFKHIRPFDKSFIDNP--------GPMVVFATPGMLHSGLSLI 173
+ ++ +Q+ FD P I +P P V+ A+ GML G S+
Sbjct: 581 FLPGSVQNFALQQEPFFDPNGKPPIIP--IGHPRERPLAIAKPSVIIASSGMLSGGASVY 638
Query: 174 IFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKK---LEFENKQIIDVKMAIEYMSFSAH 230
+ +N + + G+ + + G + L G++K +E E K++I V+ + + SAH
Sbjct: 639 YGQALLERDNAAIFISGYTDEESPG-RFLQGLQKEDEVEIEGKKLI-VRATVRRFNLSAH 696
Query: 231 ADAKGIMQLIQYCEPKNVLLVHG 253
AD G+ Q+I PK+++L+HG
Sbjct: 697 ADKVGLTQVIHRVNPKHLILIHG 719
>gi|159041038|ref|YP_001540290.1| beta-lactamase domain-containing protein [Caldivirga maquilingensis
IC-167]
gi|157919873|gb|ABW01300.1| beta-lactamase domain protein [Caldivirga maquilingensis IC-167]
Length = 636
Score = 130 bits (327), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 95/304 (31%), Positives = 158/304 (51%), Gaps = 26/304 (8%)
Query: 3 QVKVGNQ--SIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATT--IRDSKRCRERD 58
+ +GN +IVYTGD+ P R L A R +LLI ESTY ++ + E +
Sbjct: 332 HLHIGNGLYNIVYTGDFKYGPSRLLNPAHSIFKRVELLIMESTYGGKDDVQKPREEAEAE 391
Query: 59 FLKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNL-QAPIYFAVGLTEKATNYY 117
++ V + ++ GGKVLIPVF+ GRAQE+ LL + L +APIY G+ + N +
Sbjct: 392 LIQLVGKTLENGGKVLIPVFSTGRAQEILFLLNDSMQAGKLTKAPIYVD-GMVLQTLNVH 450
Query: 118 KMFITWTNQKIRKTFVQR-NMFDFKHIRPFDKS------------FIDNPGPMVVFATPG 164
MF + N +++ N F ++++P +++ + P P ++ A G
Sbjct: 451 LMFPDYLNNNVKELIYDGVNPFISEYVKPVERARNPDKRQEQVMDILQGP-PSIILAPHG 509
Query: 165 MLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKK--LEFENKQI-IDVKMA 221
ML G ++ F A N LI + +GT+G +++ G +K L++ ++ I ++V M
Sbjct: 510 MLSGGPAMDYFVHMADDAKNSLIFVSYQGEGTLGRRLIQGERKVNLKYYDQDITVNVNMN 569
Query: 222 IEY-MSFSAHADAKGIMQLIQYCEPK--NVLLVHGEASKMVFLKEKIKQEFNLDCFMPAN 278
+ + FS H+D K +M + EPK V+LVHGE SK++ L I+ ++ +D M +
Sbjct: 570 VFHEPGFSGHSDRKQLMNYVGRIEPKPHKVMLVHGEPSKLMNLALSIELKYKIDTVMLNH 629
Query: 279 GESC 282
ES
Sbjct: 630 LESI 633
>gi|375082947|ref|ZP_09729988.1| Cleavage and polyadenylation specificity factor subunit like
protein [Thermococcus litoralis DSM 5473]
gi|374742377|gb|EHR78774.1| Cleavage and polyadenylation specificity factor subunit like
protein [Thermococcus litoralis DSM 5473]
Length = 651
Score = 130 bits (327), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 94/297 (31%), Positives = 145/297 (48%), Gaps = 21/297 (7%)
Query: 1 MFQVKVGN--QSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRE-- 56
+ + VGN +I TGD+ P R A R + LI ESTY + RD + RE
Sbjct: 350 IVHLHVGNGLHNIAVTGDFKFIPTRLFEPANAKFPRLETLIMESTYGGS-RDYQMPREEA 408
Query: 57 -RDFLKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATN 115
+ ++ + + + R GKVLIP A+GRAQE+ I LE Y L PIY G+ +AT
Sbjct: 409 EKRLIEVILQTIKRKGKVLIPAMAVGRAQEIMIALEDYARVGGLDVPIYLD-GMIWEATA 467
Query: 116 YYKMFITWTNQKIRKTFVQR--NMFDFKHIRPF-----DKSFIDNPGPMVVFATPGMLHS 168
+ + + ++ +R N F + +P K I++ P ++ A+ GML
Sbjct: 468 IHTAYPEYLSKNLRNQIFHEGYNPFLNEIFKPVANANERKDIIESEEPAIIIASSGMLVG 527
Query: 169 GLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKL----EFENKQIIDVKMAIEY 224
G S+ FK AP N LI + +GT+G +V G++++ E ++I V M I
Sbjct: 528 GPSVEYFKHLAPDPRNSLIFVSYQAEGTLGRQVQRGLREIPTVGEGGKTEVIQVNMEIHT 587
Query: 225 M-SFSAHADAKGIMQLIQYCEPK--NVLLVHGEASKMVFLKEKIKQEFNLDCFMPAN 278
+ FS HAD + +M + +P+ V+ VHGE K + L I ++F L P N
Sbjct: 588 IDGFSGHADRRELMSYVARVKPRPERVITVHGEPQKCLDLASSIHKKFGLSTRAPNN 644
>gi|284161558|ref|YP_003400181.1| RNA-metabolising metallo-beta-lactamase [Archaeoglobus profundus
DSM 5631]
gi|284011555|gb|ADB57508.1| RNA-metabolising metallo-beta-lactamase [Archaeoglobus profundus
DSM 5631]
Length = 631
Score = 130 bits (327), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 86/302 (28%), Positives = 156/302 (51%), Gaps = 29/302 (9%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATT--IRDSKRCRERDF 59
F + G +I +TGD+ R A + R + LI E+TY + + S++ E +
Sbjct: 333 FHIGEGLYNIAFTGDFKFERTRLFDKAETNFPRLEALIMEATYGGSQDFQPSRKEAEENL 392
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNL-QAPIYFAVGLTEKATNYYK 118
++ V+E + +GGKVLIP FA+GR+QE+ I+LE + + P+Y G+ +AT +
Sbjct: 393 IRIVNETISKGGKVLIPAFAVGRSQEVMIVLEEAIRTKKIDEVPVYID-GMIYEATAIHT 451
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPF-DKSF------------IDNPGPMVVFATPGM 165
+ + N + R++ + I PF +SF ID+ P V+ +T GM
Sbjct: 452 AYPEYLNAHL------RDLIFHQGINPFISESFVRVDSPSKRQEVIDSSEPCVIISTSGM 505
Query: 166 LHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFE---NKQIIDVKMAI 222
L+ G + F+ A E N ++ G+ +GT+G ++ G K++ F ++++ V M +
Sbjct: 506 LNGGPVMEYFRALAGDEKNTIVFVGYQAEGTLGRRIQKGWKEVPFTVDGRREVVKVNMRV 565
Query: 223 EYM-SFSAHADAKGIMQLIQYC--EPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANG 279
E + FS H+D + +M ++ +P+ ++ VHGE SK + L I + + ++ P N
Sbjct: 566 ETVDGFSGHSDRRQLMNYVKALSDKPEKIITVHGEESKCIDLASSIYKTYKIETRAPMNL 625
Query: 280 ES 281
E+
Sbjct: 626 ET 627
>gi|255513556|gb|EET89822.1| beta-lactamase domain protein [Candidatus Micrarchaeum acidiphilum
ARMAN-2]
Length = 419
Score = 130 bits (326), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 85/280 (30%), Positives = 147/280 (52%), Gaps = 13/280 (4%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYA--TTIRDSKRCRERD 58
+ +V G +++YTGD N+ + L A + D+LITESTY T I +S+ +
Sbjct: 129 LIRVTDGKNTLIYTGDINLAKTKLLNGADLKNLSGDVLITESTYGGKTDIFESEVSILKS 188
Query: 59 FLKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNL-QAPIYFAVGLTEKATNYY 117
+ + + + GGKV+IP FA+GRAQE+ + ++ Y + + PIY G+ KA +
Sbjct: 189 LVYGIKDTIKNGGKVVIPSFAVGRAQEVLLSIDDYINSGAIPKVPIYVD-GMINKALRIH 247
Query: 118 KMFITWTNQKIRKTFVQRNMFDFK--HIRPFDK-----SFIDNPGPMVVFATPGMLHSGL 170
+ + + ++++ + + FK + P +K + + + ++ T GM+ G
Sbjct: 248 RHNVIYCRKELQSRILMSDFDPFKSPNFVPVEKKGTRNAIVTSDESSIIVTTSGMITGGP 307
Query: 171 SLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAH 230
+K A N LI+ G+ +GTIG + G K +E + K+ I ++M+IE SAH
Sbjct: 308 ITFYLQKMAGNSLNKLILVGYQAEGTIGRMLQDGEKSIEIDGKR-IKIEMSIEMHHLSAH 366
Query: 231 ADAKGIMQLIQYCEP-KNVLLVHGEASKMVFLKEKIKQEF 269
AD K + Q+ Q K + +VHGE SK + LKE++ + F
Sbjct: 367 ADRKQLEQIPQRIAGIKKIFIVHGEYSKALSLKEELSKRF 406
>gi|325969789|ref|YP_004245981.1| beta-lactamase [Vulcanisaeta moutnovskia 768-28]
gi|323708992|gb|ADY02479.1| beta-lactamase domain protein [Vulcanisaeta moutnovskia 768-28]
Length = 650
Score = 129 bits (325), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 91/301 (30%), Positives = 153/301 (50%), Gaps = 26/301 (8%)
Query: 6 VGNQ--SIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATT--IRDSKRCRERDFLK 61
+GN +I+YTGD P R L A R +LLI ESTY I+ ++ E+ ++
Sbjct: 349 IGNGLYNIIYTGDMKYGPTRLLNPAHNKFKRAELLIMESTYGGKEDIQPPRQESEQKLVE 408
Query: 62 KVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNL-QAPIYFAVGLTEKATNYYKMF 120
V V+ GKVLIPVF+ GRAQE+ ++L L + PIY G+ + N + MF
Sbjct: 409 LVKHTVEHDGKVLIPVFSTGRAQEILLVLNEAINNKQLSKVPIYVD-GMVLETLNVHLMF 467
Query: 121 ITWTNQKIRKTFVQR-NMFDFKHIRPFDKS------------FIDNPGPMVVFATPGMLH 167
+ N+ +R+ N F ++++P +++ + P P V+ A GML+
Sbjct: 468 PDYLNRTLRELIYDGVNPFLSEYVKPIERARDPEKRKEQVMEILQGP-PAVILAPHGMLN 526
Query: 168 SGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLE---FENKQIIDVKMAIEY 224
G + F A E NML+ + + TIG ++ G +KL + ++ +D+KM ++
Sbjct: 527 GGPIMDYFVHAAEDERNMLLFVSYQAENTIGRRIQQGERKLTVRYYSDRVTLDIKMKVDS 586
Query: 225 M-SFSAHADAKGIMQLIQYCEPK--NVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGES 281
+ FS H+D + + ++ EPK V+LVHGE +K++ L ++ + + NGE+
Sbjct: 587 IPGFSGHSDRRQLFNYVRNMEPKPHKVMLVHGEPAKIMNLALALELQLKMPTVFMNNGET 646
Query: 282 C 282
Sbjct: 647 I 647
>gi|330835859|ref|YP_004410587.1| beta-lactamase domain-containing protein [Metallosphaera cuprina
Ar-4]
gi|329567998|gb|AEB96103.1| beta-lactamase domain-containing protein [Metallosphaera cuprina
Ar-4]
Length = 638
Score = 129 bits (325), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 81/278 (29%), Positives = 152/278 (54%), Gaps = 21/278 (7%)
Query: 7 GNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFLKKVHEC 66
G ++VYTGD+ R L A + R D LI E+TY + ++ E+ ++ ++
Sbjct: 342 GVHNLVYTGDFKYARTRLLDRAVSEFPRVDTLIMETTYGVQEQTNRDQSEKKLIETINNT 401
Query: 67 VDRGGKVLIPVFALGRAQELCILL-ETYWERMNLQAPIYFAVGLTEKATNYYKMFITWTN 125
++RGGKVLIPV A+GR QE+ +++ + ++M + P+Y GL ++ T + + W
Sbjct: 402 INRGGKVLIPVLAVGRGQEIMLVINDAMKKKMIPEVPVY-VTGLFDEVTAIHTAYPEWLG 460
Query: 126 QKIRKT--FVQRNMFD---FKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
+++R + F N F FK I + + + + P ++ AT GML+ G ++ FK+ AP
Sbjct: 461 KEVRDSILFKDENPFTSDLFKRIEGY-REDVAHGEPSIILATSGMLNGGPAVEFFKELAP 519
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEF--ENKQIIDVKMAIEYM---SFSAHADAKG 235
N L+ + +GT+G KV G ++++ + ++ ++K+ +E FS H+D
Sbjct: 520 DSRNSLVFVSYQAEGTLGRKVRDGAREIQIIGRDGRVDNIKVNLETTPIDGFSGHSDK-- 577
Query: 236 IMQLIQYC-----EPKNVLLVHGEASKMVFLKEKIKQE 268
QL+ + +P+NV+L HGEA+ + K+ ++++
Sbjct: 578 -WQLLNFLGDITPKPRNVILNHGEANAIREFKKVVERQ 614
>gi|434385192|ref|YP_007095803.1| putative exonuclease of the beta-lactamase fold involved in RNA
processing [Chamaesiphon minutus PCC 6605]
gi|428016182|gb|AFY92276.1| putative exonuclease of the beta-lactamase fold involved in RNA
processing [Chamaesiphon minutus PCC 6605]
Length = 885
Score = 129 bits (324), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 87/264 (32%), Positives = 135/264 (51%), Gaps = 10/264 (3%)
Query: 4 VKVGNQSIVYTGDYNMTPDRHL-GAAWIDKCRPDLLITESTYATTIRDSKRCRERDFLKK 62
+K GN+SI+YTGDYN T R G D + D+LITESTY + S+R +E D +K
Sbjct: 445 MKYGNRSILYTGDYNTTSSRTAEGLKLADLPQADILITESTYGSDTHPSRRTQESDLIKA 504
Query: 63 VHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKM--- 119
+ E V GG VLIP FALGRAQE+ + + T ++ P+Y GL + T+ ++
Sbjct: 505 IVEVVQAGGNVLIPAFALGRAQEIILAIRTSALFHSINVPVYVD-GLVREVTDLFQTQLE 563
Query: 120 FITWTNQKIRKT----FVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIF 175
+ + Q KT F +++ I + + P V+ A+ GML G S+
Sbjct: 564 LLPTSVQNFAKTQSPFFSEKSSPRIISIASPKERPLAIAHPSVIIASSGMLTGGASIGYA 623
Query: 176 KKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFE-NKQIIDVKMAIEYMSFSAHADAK 234
K EN + + G+ + + G + S E E + + V+ I+ + SAHAD
Sbjct: 624 KILLERENAAVFISGYTDEESPGRFLQSLEPGSEIELDGTALTVRAKIQRFNLSAHADRV 683
Query: 235 GIMQLIQYCEPKNVLLVHGEASKM 258
GI Q+I P++++L+HG S +
Sbjct: 684 GITQVIHRVNPQHLILIHGSQSAL 707
>gi|312136618|ref|YP_004003955.1| kh-domain/beta-lactamase-domain-containing protein [Methanothermus
fervidus DSM 2088]
gi|311224337|gb|ADP77193.1| KH-domain/beta-lactamase-domain protein [Methanothermus fervidus
DSM 2088]
Length = 650
Score = 129 bits (323), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 91/298 (30%), Positives = 149/298 (50%), Gaps = 26/298 (8%)
Query: 7 GNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATT--IRDSKRCRERDFLKKVH 64
G ++VYTGD+ R L A D R + L+ ESTY I+ S+ E+ +K ++
Sbjct: 358 GQHNLVYTGDFKYEKTRLLEPAVTDFPRLETLVMESTYGGKEDIQPSREMAEKRLIKVIY 417
Query: 65 ECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMFITWT 124
+ + RGGKVLIPVFA+GRAQEL ++L+ Y + L+ F G+ +AT + T
Sbjct: 418 DTIKRGGKVLIPVFAVGRAQELMVVLDEYIKSGILEEIPIFVDGMVWEATAIH----TTR 473
Query: 125 NQKIRKTFVQRNMFDFKHIRPFDKSFIDNPG------------PMVVFATPGMLHSGLSL 172
+ + KT + +F + + PF + P V+ +T GML G S+
Sbjct: 474 PEYLSKTLSDQ-IFHYGY-NPFTSEIFNKVSSSDERKDVVEGEPCVILSTSGMLAGGNSV 531
Query: 173 IIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENK---QIIDVKMAIEYM-SFS 228
FK E N L+ G+ +G++G ++ + + FE ++ VKM IE + FS
Sbjct: 532 EYFKWLCEDEKNTLVFVGYQAEGSLGRRLQKNRRDITFEENGKMKVYKVKMQIETIEGFS 591
Query: 229 AHADAKGIMQLIQYCEPK--NVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFV 284
H+D K +M+ + PK +L+ HG A K + L I +++ ++ P N E+ +
Sbjct: 592 GHSDRKQLMKYVHNLSPKPEKILVCHGNAYKSLDLASSIYKKYRIETRAPLNLEAVRI 649
>gi|15669421|ref|NP_248231.1| putative mRNA 3'-end processing factor 2 [Methanocaldococcus
jannaschii DSM 2661]
gi|2496166|sp|Q58633.1|Y1236_METJA RecName: Full=Uncharacterized protein MJ1236
gi|1591868|gb|AAB99240.1| putative mRNA 3'-end processing factor 2 [Methanocaldococcus
jannaschii DSM 2661]
Length = 634
Score = 129 bits (323), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 97/298 (32%), Positives = 154/298 (51%), Gaps = 23/298 (7%)
Query: 7 GNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYAT--TIRDSKRCRERDFLKKVH 64
G ++ YTGD R L A R + LI ESTY + + ER+ L+ V
Sbjct: 341 GLYNLAYTGDIKFETSRLLEPAVCQFPRLETLIIESTYGAYDDVLPEREEAERELLRVVS 400
Query: 65 ECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMFITWT 124
E DRGGKVLIPVF +GRAQEL ++LE + + AP+Y G+ +AT + + +
Sbjct: 401 ETTDRGGKVLIPVFGVGRAQELMLVLEEGYNQGIFNAPVYLD-GMIWEATAIHTAYPEYL 459
Query: 125 NQKIRKTFVQR--NMF---DFKHIRPFD--KSFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
++++R+ N F FK + + + ID+ P V+ AT GML G S+ K
Sbjct: 460 SKEMRQKIFHEGDNPFLSEVFKRVGSTNERRKVIDSDEPCVILATSGMLTGGPSVEYLKH 519
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEF--ENKQIIDVKMAIEYMS---FSAHAD 232
AP E N +I G+ +GT+G KV SG K++ N + + + ++ + FS H+D
Sbjct: 520 LAPDEKNAIIFVGYQAEGTLGRKVQSGWKEIPIITRNGKTKSIPINLQVYTIEGFSGHSD 579
Query: 233 AKGIMQLIQYCE-----PKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQ 285
K QLI+Y P+ +++VHGE SK + + +++ F ++P N ++ V+
Sbjct: 580 RK---QLIKYIRRLKPSPEKIIMVHGEESKCLDFADTVRRLFKKQTYVPMNLDAIRVK 634
>gi|315231930|ref|YP_004072366.1| polyadenylation specificity factor-like protein [Thermococcus
barophilus MP]
gi|315184958|gb|ADT85143.1| polyadenylation specificity factor-like protein [Thermococcus
barophilus MP]
Length = 648
Score = 129 bits (323), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 88/297 (29%), Positives = 147/297 (49%), Gaps = 21/297 (7%)
Query: 1 MFQVKVGN--QSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRE-- 56
+ + +GN ++ TGD+ P R A R + LI ESTY + D + RE
Sbjct: 347 IVHLHIGNGLHNVAVTGDFKFIPTRLFEPANARFPRLETLIMESTYGGS-NDYQMPREDA 405
Query: 57 -RDFLKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATN 115
+ ++ +H+ + R GKVLIP A+GRAQE+ ++LE Y ++ PIY G+ +AT
Sbjct: 406 EKKLIEVIHQTIKRKGKVLIPAMAVGRAQEIMMVLEEYARVGGIEVPIYLD-GMIWEATA 464
Query: 116 YYKMFITWTNQKIRKTFVQR--NMFDFKHIRPFDKS-----FIDNPGPMVVFATPGMLHS 168
+ + + ++ +R N F + +P S ID+ P ++ A+ GML
Sbjct: 465 IHTAYPEYLSKHLRDQIFHEGYNPFLNEIFKPVANSKERQDIIDSEEPAIIIASSGMLVG 524
Query: 169 GLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKL----EFENKQIIDVKMAIEY 224
G S+ FK+ AP N +I + +GT+G +V G+K++ E ++I V M +
Sbjct: 525 GPSVEYFKQLAPDPRNSIIFVSYQAEGTLGRQVQRGLKEIPTIGEGGRTEVIRVNMEVHT 584
Query: 225 M-SFSAHADAKGIMQLIQYCEPK--NVLLVHGEASKMVFLKEKIKQEFNLDCFMPAN 278
+ FS HAD + +M + P+ V+ VHGE K + L + + F++ P N
Sbjct: 585 IDGFSGHADRRELMSYVAKVRPRPERVITVHGEPQKCLDLASSLHKRFSISTRAPNN 641
>gi|409039929|gb|EKM49418.1| hypothetical protein PHACADRAFT_167242 [Phanerochaete carnosa
HHB-10118-sp]
Length = 702
Score = 128 bits (322), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 89/336 (26%), Positives = 143/336 (42%), Gaps = 66/336 (19%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF + + I+YTGDY+ D+HL A + PD L +E F
Sbjct: 137 MFLIDIAGLKILYTGDYSREEDQHLVKAEVPPVHPDGL--------------EEKELQFT 182
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFAVGLTEKATNYYK 118
VH + RGG VL+P F LGRAQEL ++L+ YW++ P+Y+A L K Y+
Sbjct: 183 NSVHNFIQRGGHVLLPTFTLGRAQELLLILDEYWKKHPDLHNVPVYYASSLAGKCMAMYQ 242
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKW 178
+I N + F +RN PF +F
Sbjct: 243 TYIRTMNSNVHFCFAKRN-------NPF---------------------------VFNFE 268
Query: 179 APVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQ 238
P N I+ G+ ++GT+ + + ++ + +I K++I+ +SF AH D +
Sbjct: 269 PPDSRNGTILTGYSIEGTMVRDIQTEPDEIPSVKEHMIPHKLSIDEISFGAHVDYSQNSE 328
Query: 239 LIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNL 298
I+ ++++LVHGE + M L+ + + C+ D + I NL
Sbjct: 329 FIELVHTQHIVLVHGEQNAMGRLRAAL--------------QDCYKNRDEDVKIHTPRNL 374
Query: 299 LKEEAVKYNSEPPNPLKERQIHGVLVIKDSSISLMD 334
+ ++ PP P I G+LV KD S +L+D
Sbjct: 375 EPLQLTFHSDLPPQP--NEVISGLLVSKDYSYTLLD 408
>gi|336121952|ref|YP_004576727.1| KH-domain/beta-lactamase-domain-containing protein
[Methanothermococcus okinawensis IH1]
gi|334856473|gb|AEH06949.1| KH-domain/beta-lactamase-domain protein [Methanothermococcus
okinawensis IH1]
Length = 635
Score = 128 bits (322), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 92/291 (31%), Positives = 146/291 (50%), Gaps = 23/291 (7%)
Query: 7 GNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYAT--TIRDSKRCRERDFLKKVH 64
G ++ YTGD R L A R + +I ESTY + + E++FL V
Sbjct: 342 GLYNVAYTGDIKFEASRLLEPAVCQFPRLETIIIESTYGGYDDVLPDRAETEKEFLNVVA 401
Query: 65 ECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMFITWT 124
E + R GKV+IPVF +GRAQEL ++LE + + AP+Y G+ +AT + + +
Sbjct: 402 ETIKRKGKVIIPVFGIGRAQELMLVLEEGYNQGIFNAPVYLD-GMIWEATAIHTAYPEYL 460
Query: 125 NQKIRKTFVQR--NMF---DFKHIRPFD--KSFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
++ IR N F F+ ++ + ++ ID+ P ++ T GML G S+ FK
Sbjct: 461 SKNIRNRIFHEGDNPFLSEVFRKVKNTNNRRNIIDSNEPCIILTTSGMLSGGPSVEYFKS 520
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENK----QIIDVKMAIEYM-SFSAHAD 232
A E N ++ G+ +GT+G K+ G K++ N+ + + V +++ + FS H+D
Sbjct: 521 LAEDEKNAIVFVGYQAEGTLGRKIQRGWKEIPIMNRNGKSKAVKVNLSVHTLEGFSGHSD 580
Query: 233 AKGIMQLIQYCE-----PKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPAN 278
K QLI+Y P+ +L VHGEASK V L + F + P N
Sbjct: 581 RK---QLIKYLRKLKPMPERILTVHGEASKCVDLASTAYKLFKKETRAPMN 628
>gi|332799023|ref|YP_004460522.1| beta-lactamase domain-containing protein [Tepidanaerobacter
acetatoxydans Re1]
gi|332696758|gb|AEE91215.1| beta-lactamase domain protein [Tepidanaerobacter acetatoxydans Re1]
Length = 831
Score = 128 bits (321), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 84/293 (28%), Positives = 148/293 (50%), Gaps = 19/293 (6%)
Query: 6 VGNQ-SIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFLKKVH 64
VG++ S+ Y+GD + + A + K RPD+ I ESTY + +++ E+ ++ ++
Sbjct: 159 VGSEGSLFYSGDISGFRQNTIEGATVPKLRPDVAIFESTYGDKLHANRQVEEQRLIESIN 218
Query: 65 ECVDRGGKVLIPVFALGRAQELCILLETYWERMN-LQAPIYFAVGLTEKATNYYKMFITW 123
E + GGKVL+P FALGRAQE+ ++L+ R L PIY G+ + YK+ +
Sbjct: 219 EVIINGGKVLVPAFALGRAQEVILILKKAMNRGQLLSCPIYVD-GMVKDICRIYKLNPNY 277
Query: 124 TNQKI-RKTFVQRNMFDFKHIRP-----FDKSFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
+ +K F ++F ++ P F K +++ P V+ A+ GML G S + +
Sbjct: 278 LRSNLAKKVFRGVDIFYDDNVSPVANPEFRKEILESKNPCVIIASSGMLTGGPSQMYAQS 337
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQI----------IDVKMAIEYMSF 227
A ENN++ + G+ + + G +L ++ + +K+ I VK +
Sbjct: 338 LATNENNLIAVTGYQDEESPGKDLLKIIETDDDADKKQDRTIKLGEREISVKCRVGKFGL 397
Query: 228 SAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGE 280
SAHAD I+ + P+ + LVHG + L ++I+++ N + P NGE
Sbjct: 398 SAHADKMEILNIANNLYPRRIFLVHGNPEAINSLGKEIQKDINGFIYAPRNGE 450
>gi|212223311|ref|YP_002306547.1| cleavage and polyadenylation specficity factor subunit-like protein
[Thermococcus onnurineus NA1]
gi|212008268|gb|ACJ15650.1| cleavage and polyadenylation specficity factor subunit-like protein
[Thermococcus onnurineus NA1]
Length = 648
Score = 127 bits (320), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 87/297 (29%), Positives = 147/297 (49%), Gaps = 21/297 (7%)
Query: 1 MFQVKVGN--QSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRE-- 56
+ + +GN +I TGD+ P R A R + L+ ESTY + D + RE
Sbjct: 347 IVHLHIGNGLHNIAITGDFKFIPTRLFEPAVSRFPRLETLVMESTYGGS-NDHQMPREEA 405
Query: 57 -RDFLKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATN 115
+ ++ +H+ + R GKVLIP A+GRAQE+ ++LE Y ++ PIY G+ +AT
Sbjct: 406 EKRLIEVIHQTIRRKGKVLIPAMAVGRAQEIMMVLEEYARVGGIEVPIYLD-GMIWEATA 464
Query: 116 YYKMFITWTNQKIRKTFVQR--NMFDFKHIRPFDKS-----FIDNPGPMVVFATPGMLHS 168
+ + + ++ +R+ N F +P S ID+ P ++ AT GML
Sbjct: 465 IHTAYPEYLSRHLREQIFHEGYNPFLNPIFKPVANSRERQDIIDSGEPAIIIATSGMLVG 524
Query: 169 GLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKL----EFENKQIIDVKMAIEY 224
G S+ FK+ AP N +I + +GT+G +V G++++ E +++ V M +
Sbjct: 525 GPSVEYFKQLAPDPKNSMIFVSYQAEGTLGRQVQRGLREIPLVGEGGKTEVVKVNMEVHT 584
Query: 225 M-SFSAHADAKGIMQLIQYCEPK--NVLLVHGEASKMVFLKEKIKQEFNLDCFMPAN 278
+ FS HAD + ++ I P+ ++ VHGE K + L I ++F + P N
Sbjct: 585 IDGFSGHADRRELISYIARLRPRPERIITVHGEPHKCLDLSTSIHKKFGISTRAPNN 641
>gi|150401471|ref|YP_001325237.1| beta-lactamase domain-containing protein [Methanococcus aeolicus
Nankai-3]
gi|150014174|gb|ABR56625.1| beta-lactamase domain protein [Methanococcus aeolicus Nankai-3]
Length = 426
Score = 127 bits (320), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 88/281 (31%), Positives = 143/281 (50%), Gaps = 11/281 (3%)
Query: 6 VGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYA--TTIRDSKRCRERDFLKKV 63
+G + ++YTGD N R L +A D D LI ESTY T I+ S++ E+ + ++
Sbjct: 144 IGGKKLLYTGDINEIETRTLKSADTDIDEIDTLIIESTYGAPTDIKPSRKALEKQLIDEI 203
Query: 64 HECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQ-APIYFAVGLTEKATNYYKMFIT 122
+++GGKV+IPVFA+GR+QE+ +L Y L+ PIY L T Y
Sbjct: 204 SATIEKGGKVIIPVFAVGRSQEIIAVLHNYMRSKLLKRVPIYVEGSLVH-TTGIYMSHSE 262
Query: 123 WTNQKIRKTFVQR-NMF-DFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
W N KIR R N F D + + S I N P ++ +T GM+ G ++ + K
Sbjct: 263 WLNPKIRNNLENRINPFGDVTIVNKENSSEIYNKEPCIIISTSGMVQGG-PILQYLKLLK 321
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKL-EFENKQIIDVKMAIEYMSFSAHADAKGIMQL 239
N +I+ GF + T+G ++ G +++ F NK I V+ + + FSAH+D +++
Sbjct: 322 SPKNKIILTGFQAEETLGRQLEDGAEEITPFRNK--IPVRGEVVKVEFSAHSDYNSLIRY 379
Query: 240 IQYC-EPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANG 279
++ +PK V ++HGE + + I + F + P G
Sbjct: 380 MKKIPKPKKVFVMHGERYQSLSFAMTIWKNFKIPAIAPTAG 420
>gi|304317834|ref|YP_003852979.1| RNA-metabolising metallo-beta-lactamase [Thermoanaerobacterium
thermosaccharolyticum DSM 571]
gi|302779336|gb|ADL69895.1| RNA-metabolising metallo-beta-lactamase [Thermoanaerobacterium
thermosaccharolyticum DSM 571]
Length = 820
Score = 127 bits (319), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 83/287 (28%), Positives = 144/287 (50%), Gaps = 15/287 (5%)
Query: 6 VGNQ-SIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFLKKVH 64
VGN+ S Y+GD++ + A I K RPD+ ESTY + ++ E ++K+
Sbjct: 159 VGNEGSFFYSGDFSRFRQNTIEGASIPKLRPDVAFFESTYGDKLHANRELEESRLVEKIG 218
Query: 65 ECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMFITWT 124
V GGKVLIP FALGRAQE+ ++L+ + + +Y G+ + YK+ +
Sbjct: 219 SVVKNGGKVLIPAFALGRAQEIILILKKAINKGMINTKVYVD-GMVKDICRIYKLNPNYL 277
Query: 125 NQKI-RKTFVQRNMFDFKHIRPFD-----KSFIDNPGPMVVFATPGMLHSGLSLIIFKKW 178
+ + +K F +F ++ P D + I P V+ ++ GML G S +K
Sbjct: 278 RENLAKKIFKGGEIFFDDNVMPVDIPEMREEIIKE--PCVIVSSSGMLTGGPSQWYAEKL 335
Query: 179 APVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQI-----IDVKMAIEYMSFSAHADA 233
A E N++ + G+ + + G ++L + + + K I VK A++ SAHAD
Sbjct: 336 AGDEKNLIAITGYQDEESPGRRLLELTDEKDDDKKLKLGDKEIQVKCAVDKFGLSAHADM 395
Query: 234 KGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGE 280
I+ + + PK V L+HG+ + FL +++++ D ++P NG+
Sbjct: 396 SEILSFVNFLYPKKVFLIHGDPDVIDFLGREVQRDIRTDVYVPQNGD 442
>gi|374628280|ref|ZP_09700665.1| KH-domain/beta-lactamase-domain protein [Methanoplanus limicola DSM
2279]
gi|373906393|gb|EHQ34497.1| KH-domain/beta-lactamase-domain protein [Methanoplanus limicola DSM
2279]
Length = 630
Score = 127 bits (319), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 81/292 (27%), Positives = 140/292 (47%), Gaps = 12/292 (4%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATT--IRDSKRCRERDF 59
F V G +I +TGD+N R G A R + + ESTY + ++ S++ E
Sbjct: 335 FHVGDGLYNIAFTGDFNYAKSRLFGPATSQFPRLEAVFMESTYGGSNDMQPSRKDAEEKL 394
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKM 119
+ V++ V+RGGK++IP FA+GR+QE+ + LE + + G+ ++AT +
Sbjct: 395 YEAVNDTVNRGGKIIIPAFAVGRSQEVMLALEEGMRLEKIPKVKVYLDGMIKEATAIHTT 454
Query: 120 FITWTNQKIRKTFVQRNMFDF--KHIRPFDKS-----FIDNPGPMVVFATPGMLHSGLSL 172
+ + N +RK Q M F D S I + P V+ T GML+ G +
Sbjct: 455 YPEYLNPDLRKLIFQDGMNPFLADCFEQVDSSSKRGDIIGSGDPCVIITTSGMLNGGPVM 514
Query: 173 IIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYM-SFSAHA 231
A E NMLI G+ GT G ++ G K++ ++ I++ + I+ + FS H+
Sbjct: 515 EYLYGLAQDEKNMLIFVGYQADGTTGRRIQKGWKEIPLGRRETININLEIKTVDGFSGHS 574
Query: 232 DAKGIMQLIQYCEPK--NVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGES 281
D K +M + + P+ + +HG+ + L I + + ++ P N E+
Sbjct: 575 DRKQLMAYVNHLNPRPEKIFTIHGDEKSAIDLASSIYKRYKIETHAPRNLET 626
>gi|390933363|ref|YP_006390868.1| RNA-metabolising metallo-beta-lactamase [Thermoanaerobacterium
saccharolyticum JW/SL-YS485]
gi|389568864|gb|AFK85269.1| RNA-metabolising metallo-beta-lactamase [Thermoanaerobacterium
saccharolyticum JW/SL-YS485]
Length = 820
Score = 127 bits (319), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 83/285 (29%), Positives = 143/285 (50%), Gaps = 11/285 (3%)
Query: 6 VGNQ-SIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFLKKVH 64
VGN S Y+GD++ + A I K RPD+ ESTY + ++ E ++K+
Sbjct: 159 VGNGGSFFYSGDFSRFRQNTIEGASIPKLRPDVAFFESTYGDKLHANRELEESRLVEKIG 218
Query: 65 ECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMFITWT 124
V GGKVLIP FALGRAQE+ ++L+ + + +Y G+ + YK+ +
Sbjct: 219 SVVKNGGKVLIPAFALGRAQEIILILKKAINKGMINTKVYVD-GMVKDICRIYKLNPNYL 277
Query: 125 NQKI-RKTFVQRNMFDFKHIRPFDKSFIDN---PGPMVVFATPGMLHSGLSLIIFKKWAP 180
+ + +K F +F ++ P D + P V+ ++ GML G S +K A
Sbjct: 278 RENLAKKIFKGGEIFFDGNVMPVDMPEMREEIIKEPCVIVSSSGMLTGGPSQWYAEKLAG 337
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQI-----IDVKMAIEYMSFSAHADAKG 235
E N++ + G+ + + G ++L + + + K I VK A++ SAHAD
Sbjct: 338 DEKNLIAITGYQDEESPGRRLLELTDEKDDDKKLKLGDKEIPVKCAVDKFGLSAHADMSE 397
Query: 236 IMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGE 280
I+ L+ + PK V L+HG+ + FL +++++ D ++P NG+
Sbjct: 398 ILSLVNFLYPKKVFLIHGDPDVIDFLGREVQRDIRTDVYVPQNGD 442
>gi|150400487|ref|YP_001324253.1| beta-lactamase domain-containing protein [Methanococcus aeolicus
Nankai-3]
gi|150013190|gb|ABR55641.1| beta-lactamase domain protein [Methanococcus aeolicus Nankai-3]
Length = 635
Score = 127 bits (318), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 91/291 (31%), Positives = 146/291 (50%), Gaps = 23/291 (7%)
Query: 7 GNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYAT--TIRDSKRCRERDFLKKVH 64
G +I YTGD R L A + R + LI ESTY + K E++FLK +
Sbjct: 342 GLYNIAYTGDIKFEASRLLEPAVCNFPRLETLIIESTYGAYDDVLPDKGETEQNFLKVIA 401
Query: 65 ECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMFITWT 124
E + R GK++IPVF +GRAQEL ++LE + + AP+Y G+ +AT + + +
Sbjct: 402 ETIKRKGKIIIPVFGIGRAQELMLVLEEGYNQGIFNAPVYLD-GMIWEATAIHTAYPEYL 460
Query: 125 NQKIRKTFVQR--NMF---DFKHIRPFD--KSFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
++ +R N F FK +R + ++ ID+ P ++ T GML G S+ FK
Sbjct: 461 SKNMRNRIFHEGDNPFLSEVFKKVRNTNNRRNIIDSDEPCIILTTSGMLSGGPSVEYFKS 520
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENK----QIIDVKMAIE-YMSFSAHAD 232
A E N ++ G+ +GT+G K+ G ++ N+ + + V +++ + FS H+D
Sbjct: 521 LAHDERNAIVFVGYQSEGTMGRKIQRGWNEIPVMNRNGKSRAVKVNLSVHTFEGFSGHSD 580
Query: 233 AKGIMQLIQYCE-----PKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPAN 278
+ QLI+Y P +L VHGE+SK + L + F + P N
Sbjct: 581 RR---QLIKYLRKLKPMPDRILTVHGESSKCIDLASAAYKLFKKETRAPMN 628
>gi|296109715|ref|YP_003616664.1| RNA-metabolising metallo-beta-lactamase [methanocaldococcus
infernus ME]
gi|295434529|gb|ADG13700.1| RNA-metabolising metallo-beta-lactamase [Methanocaldococcus
infernus ME]
Length = 419
Score = 127 bits (318), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 86/281 (30%), Positives = 156/281 (55%), Gaps = 10/281 (3%)
Query: 4 VKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATT--IRDSKRCRERDFLK 61
+++ + I+YTGD N + R + AA D D+LI E+TY + I+ +++ ER+ +
Sbjct: 139 LEIDGKRILYTGDINESMQRTVRAADTDIDDIDILIIEATYGSPLDIKPARKVLERELVN 198
Query: 62 KVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMFI 121
+V E ++RGGKV+IPVFA+GRAQE+ ++L Y L+A I F G AT+ Y +
Sbjct: 199 EVKETIERGGKVIIPVFAVGRAQEILLVLNNYIRSGELEAKI-FTDGSLIHATSIYLSYT 257
Query: 122 TWTNQKIRKTFVQRNMFDFKHIRPFDKSFI-DNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
W N K+ K ++ + F ++ D+S I P V+ +T GM+ G ++ + +
Sbjct: 258 DWLNPKL-KNLIETGINPFGNVEKADESRIFKKDEPCVIVSTSGMVQGG-PVLKYLRLLK 315
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLE-FENKQIIDVKMAIEYMSFSAHADAKGIMQL 239
N LI+ G+ +GT+G + G+K+++ F+NK I ++ + + FSAH D +++
Sbjct: 316 DPKNKLILTGYQAEGTLGRALEEGIKEIKPFKNK--IPIRGEVVKIEFSAHGDYNSLVRY 373
Query: 240 IQYC-EPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANG 279
I+ +P+ +++HGE + + I + + ++P G
Sbjct: 374 IKKIPKPEKAIVMHGERYQSLAFAMTIWKNLKIPAYVPIKG 414
>gi|438002123|ref|YP_007271866.1| Metallo-beta-lactamase family protein, RNA-specific
[Tepidanaerobacter acetatoxydans Re1]
gi|432178917|emb|CCP25890.1| Metallo-beta-lactamase family protein, RNA-specific
[Tepidanaerobacter acetatoxydans Re1]
Length = 469
Score = 127 bits (318), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 84/293 (28%), Positives = 148/293 (50%), Gaps = 19/293 (6%)
Query: 6 VGNQ-SIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFLKKVH 64
VG++ S+ Y+GD + + A + K RPD+ I ESTY + +++ E+ ++ ++
Sbjct: 159 VGSEGSLFYSGDISGFRQNTIEGATVPKLRPDVAIFESTYGDKLHANRQVEEQRLIESIN 218
Query: 65 ECVDRGGKVLIPVFALGRAQELCILLETYWERMN-LQAPIYFAVGLTEKATNYYKMFITW 123
E + GGKVL+P FALGRAQE+ ++L+ R L PIY G+ + YK+ +
Sbjct: 219 EVIINGGKVLVPAFALGRAQEVILILKKAMNRGQLLSCPIYVD-GMVKDICRIYKLNPNY 277
Query: 124 TNQKI-RKTFVQRNMFDFKHIRP-----FDKSFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
+ +K F ++F ++ P F K +++ P V+ A+ GML G S + +
Sbjct: 278 LRSNLAKKVFRGVDIFYDDNVSPVANPEFRKEILESKNPCVIIASSGMLTGGPSQMYAQS 337
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQI----------IDVKMAIEYMSF 227
A ENN++ + G+ + + G +L ++ + +K+ I VK +
Sbjct: 338 LATNENNLIAVTGYQDEESPGKDLLKIIETDDDADKKQDRTIKLGEREISVKCRVGKFGL 397
Query: 228 SAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGE 280
SAHAD I+ + P+ + LVHG + L ++I+++ N + P NGE
Sbjct: 398 SAHADKMEILNIANNLYPRRIFLVHGNPEAINSLGKEIQKDINGFIYAPRNGE 450
>gi|428304260|ref|YP_007141085.1| beta-lactamase [Crinalium epipsammum PCC 9333]
gi|428245795|gb|AFZ11575.1| beta-lactamase domain protein [Crinalium epipsammum PCC 9333]
Length = 782
Score = 127 bits (318), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 88/291 (30%), Positives = 139/291 (47%), Gaps = 20/291 (6%)
Query: 4 VKVGNQSIVYTGDYNMTPDRHL-GAAWIDKCRPDLLITESTYATTIRDSKRCRERDFLKK 62
++ G++S++YTGDYN T R G D + D+LITESTY S++ +E + L+
Sbjct: 348 IRYGDRSLLYTGDYNTTNSRTTDGLRLADLPQADILITESTYGADTHPSRKAQETELLQA 407
Query: 63 VHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMFIT 122
V E V GG VLIP FALGRAQE+ + + T L+ P+Y GL T+ ++ I
Sbjct: 408 VAEVVSHGGNVLIPAFALGRAQEIILAIRTSALFHKLKIPVYID-GLVRAVTDVFRDHID 466
Query: 123 WTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPG-----------PMVVFATPGMLHSGLS 171
+ ++ +Q+ F P K I G P V+ A+ GML G S
Sbjct: 467 FLPGSVQNFALQQEPF----FDPDGKPPIIPIGHPRERPLAIAKPSVIIASSGMLSGGAS 522
Query: 172 LIIFKKWAPVENNMLIMPGFCVQGTIGH--KVLSGVKKLEFENKQIIDVKMAIEYMSFSA 229
+ + +N + + G+ + + G + L ++E E K++ V + + SA
Sbjct: 523 VYYGQALLERDNAAIFISGYTDEESPGRFLQALQKGDEVEIEGKKLT-VHATVRRFNLSA 581
Query: 230 HADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGE 280
HAD G+ Q+I PK+++L+HG + L +PA GE
Sbjct: 582 HADKVGLTQVIHRVNPKHLILIHGSMDALHELSHTGDLRDKYFIHIPAVGE 632
>gi|134045919|ref|YP_001097405.1| beta-lactamase domain-containing protein [Methanococcus maripaludis
C5]
gi|132663544|gb|ABO35190.1| beta-lactamase domain protein [Methanococcus maripaludis C5]
Length = 635
Score = 127 bits (318), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 92/298 (30%), Positives = 147/298 (49%), Gaps = 23/298 (7%)
Query: 3 QVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYAT--TIRDSKRCRERDFL 60
V G ++ YTGD R L A R + LI ESTY + + E++FL
Sbjct: 338 HVGEGLYNVAYTGDIKFEASRLLEPAVCQFPRLETLIIESTYGGYDDVLPERDETEKEFL 397
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
+ + E + R GKV+IPVF +GRAQEL ++LE + + AP+Y G+ +AT + +
Sbjct: 398 RVIAETIARKGKVIIPVFGIGRAQELMLVLEEGYNQGMFNAPVYLD-GMIWEATAIHTAY 456
Query: 121 ITWTNQKIRKTFVQR--NMF---DFKHIRPFD--KSFIDNPGPMVVFATPGMLHSGLSLI 173
+ ++ +R N F FK ++ + ++ ID+ P ++ T GML G S+
Sbjct: 457 PEYLSKNMRNRIFHEGDNPFLSEVFKKVKNTNDRRNIIDSDEPSIILTTSGMLSGGPSVE 516
Query: 174 IFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENK----QIIDVKMAIEYM-SFS 228
FK A E N ++ G+ +GT+G K+ G K++ K + + V +++ + FS
Sbjct: 517 YFKNLADNEKNAIVFVGYQSEGTLGRKIQKGFKEIPLMGKNGRSKAVKVNLSVHTLEGFS 576
Query: 229 AHADAKGIMQLIQYCE-----PKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGES 281
H+D K QLI+Y P +L VHGE SK + L + F + P N +S
Sbjct: 577 GHSDRK---QLIKYLRKLKPIPDRILTVHGEVSKCIDLASTAYKLFKKETKAPMNLDS 631
>gi|298674377|ref|YP_003726127.1| KH-domain/beta-lactamase-domain-containing protein [Methanohalobium
evestigatum Z-7303]
gi|298287365|gb|ADI73331.1| KH-domain/beta-lactamase-domain protein [Methanohalobium
evestigatum Z-7303]
Length = 638
Score = 127 bits (318), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 84/271 (30%), Positives = 141/271 (52%), Gaps = 19/271 (7%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATT--IRDSKRCRERDF 59
F + G ++V+TGDY R A R + ++TESTY ++ ++ + + E+
Sbjct: 338 FHIGDGLHNVVFTGDYKYGSTRLFDPAVNKFPRVETVVTESTYGSSNAVQPTLKDAEKHL 397
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQ-APIYFAVGLTEKATNYYK 118
+ + + + G VLIP FA+GR+QE+ I+LE + + PIY G+ +AT +
Sbjct: 398 QQVIINTLQKDGIVLIPAFAVGRSQEVMIVLEDAIRKGLIDDVPIYLD-GMIWEATAIHA 456
Query: 119 MFITWTNQKIRKTFVQR--NMFDFKHIRPFD-----KSFIDNPGPMVVFATPGMLHSGLS 171
+ + N ++RK+ Q+ N F +P D + ID P P V+ AT GM+++G
Sbjct: 457 TYPEYLNNELRKSIFQKGQNPFLSDSFKPVDSNELRQKIIDEPHPCVILATAGMMNAGPI 516
Query: 172 LIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENK-----QIIDVKMAIEYM- 225
L FK +A ENN L+ G+ GT+G ++ G K++ + Q I+V M +E +
Sbjct: 517 LEYFKAFASNENNTLVFVGYQADGTLGRRIQKGWKEIPMSSSVNGGTQTINVNMNVEVVD 576
Query: 226 SFSAHADAKGIMQLIQYCEPK--NVLLVHGE 254
FS H+D K +M ++ +P+ V HGE
Sbjct: 577 GFSGHSDRKQLMDFVKKMKPRPERVYTEHGE 607
>gi|294495430|ref|YP_003541923.1| KH-domain/beta-lactamase-domain protein [Methanohalophilus mahii
DSM 5219]
gi|292666429|gb|ADE36278.1| KH-domain/beta-lactamase-domain protein [Methanohalophilus mahii
DSM 5219]
Length = 637
Score = 126 bits (317), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 81/283 (28%), Positives = 146/283 (51%), Gaps = 18/283 (6%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATT--IRDSKRCRERDF 59
F + G ++V TGD+ P R A R + +ITESTY + ++ + + E++
Sbjct: 338 FHIGDGLHNVVITGDFKYGPTRLFNPAVNKFPRVETVITESTYGASNSMQPALKDAEKNL 397
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNL-QAPIYFAVGLTEKATNYYK 118
+ + E +DR G VLIP FA+GR+QE+ I++E + + + P+Y G+ +AT +
Sbjct: 398 QRIIKETIDRQGVVLIPAFAVGRSQEVMIVIEEAIRKGIIDEVPVYLD-GMIWEATAIHA 456
Query: 119 MFITWTNQKIRKTFVQR--NMFDFKHIRPFD-----KSFIDNPGPMVVFATPGMLHSGLS 171
+ + N +R+ Q+ N F + +P D ++ I+NP P V+ +T GM+ G
Sbjct: 457 TYPEYLNNDLRRLIFQKGQNPFLSECFKPVDSNELRRNIIENPHPCVILSTSGMMSGGPV 516
Query: 172 LIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEF----ENKQIIDVKMAIEYM-S 226
+ FK +AP E N L+ G+ GT+G ++ G K++ + + + + M +E +
Sbjct: 517 MEYFKAFAPNERNTLVFVGYQADGTLGRRIQKGWKEIPLSSGGKGSETVLMNMDVEIVDG 576
Query: 227 FSAHADAKGIMQLIQYC--EPKNVLLVHGEASKMVFLKEKIKQ 267
FS H+D K +M+ + +P+ V HG+ + L I +
Sbjct: 577 FSGHSDRKQLMEFFKRMKPQPERVFTEHGDERSCIDLASSIHK 619
>gi|300867486|ref|ZP_07112137.1| hypothetical protein OSCI_3400008 [Oscillatoria sp. PCC 6506]
gi|300334480|emb|CBN57305.1| hypothetical protein OSCI_3400008 [Oscillatoria sp. PCC 6506]
Length = 871
Score = 126 bits (316), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 85/261 (32%), Positives = 128/261 (49%), Gaps = 18/261 (6%)
Query: 7 GNQSIVYTGDYNMTPDRHL-GAAWIDKCRPDLLITESTYATTIRDSKRCRERDFLKKVHE 65
G +S++YTGDYN T R G D + D+LITESTY ++ +E + + E
Sbjct: 456 GERSLIYTGDYNTTSSRTTTGLRLADLPQADILITESTYGGDTHPGRKTQETALIDAIAE 515
Query: 66 CVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMFITWTN 125
V GG VLIP FALGRAQE+ + + T L+ P+Y GL T ++ +
Sbjct: 516 VVKSGGNVLIPAFALGRAQEILLAIRTSTTFHTLKVPVYVD-GLVRSVTETFRDNLDLLP 574
Query: 126 QKIRKTFVQRN---MFDFKHIRPFDKSFIDNP--------GPMVVFATPGMLHSGLSLII 174
++ Q N FD + P I NP P V+ A+ GML G S+
Sbjct: 575 DSVKNFVKQCNREPFFDLNGMPPIIP--IGNPKERPLAMAKPSVIVASSGMLSGGASVYY 632
Query: 175 FKKWAPVENNMLIMPGFCVQGTIGHKV--LSGVKKLEFENKQIIDVKMAIEYMSFSAHAD 232
K EN + + G+ + + G + L+ ++E + K+I VK I+ + SAHAD
Sbjct: 633 AKTLLERENAAIFISGYTDEESPGRLLQNLAVGDEIELDGKKIT-VKAQIKRFNLSAHAD 691
Query: 233 AKGIMQLIQYCEPKNVLLVHG 253
G+ Q+I PK+++L+HG
Sbjct: 692 KIGLTQVINKVNPKHLILIHG 712
>gi|242398880|ref|YP_002994304.1| Cleavage and polyadenylation specficity factor subunit like protein
[Thermococcus sibiricus MM 739]
gi|242265273|gb|ACS89955.1| Cleavage and polyadenylation specficity factor subunit like protein
[Thermococcus sibiricus MM 739]
Length = 648
Score = 126 bits (316), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 90/292 (30%), Positives = 140/292 (47%), Gaps = 21/292 (7%)
Query: 6 VGN--QSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRE---RDFL 60
VGN +I TGD+ P R A R + LI ESTY + RD + RE + +
Sbjct: 352 VGNGLHNIAVTGDFKFAPTRLFEPANSKFPRLETLIMESTYGGS-RDYQMPREEAEKKLI 410
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
+ + + R GKVLIP A+GR+QE+ I LE Y + PIY G+ +AT + +
Sbjct: 411 ETIMHTIKRKGKVLIPAMAVGRSQEIMIALEDYARVGGFEVPIYLD-GMIWEATAIHTAY 469
Query: 121 ITWTNQKIRKTFVQR--NMFDFKHIRPF-----DKSFIDNPGPMVVFATPGMLHSGLSLI 173
+ ++ +R N F + +P K I++ P ++ A+ GML G S+
Sbjct: 470 PEYLSKNLRNQIFHEGYNPFLNEIFKPVANANERKDIIESEEPAIIIASSGMLVGGPSVE 529
Query: 174 IFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKL----EFENKQIIDVKMAIEYM-SFS 228
FK AP N L+ + +GT+G +V G++++ E + I + M I + FS
Sbjct: 530 YFKHLAPDPRNSLVFVSYQAEGTLGRQVQRGLREIPMIGEGGKTEAIQINMEIHTIDGFS 589
Query: 229 AHADAKGIMQLIQYCEPK--NVLLVHGEASKMVFLKEKIKQEFNLDCFMPAN 278
HAD + +M I P+ V+ VHGE K + L I ++F + P N
Sbjct: 590 GHADRRELMNYISRVRPRPERVITVHGEPQKCLDLASSIHKKFGISTRAPNN 641
>gi|126458658|ref|YP_001054936.1| beta-lactamase domain-containing protein [Pyrobaculum calidifontis
JCM 11548]
gi|126248379|gb|ABO07470.1| beta-lactamase domain protein [Pyrobaculum calidifontis JCM 11548]
Length = 634
Score = 126 bits (316), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 95/315 (30%), Positives = 152/315 (48%), Gaps = 45/315 (14%)
Query: 1 MFQVKVGNQ--SIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATT--IRDSKRCRE 56
M + +GN +I+YTGD+ R L A R ++LI ESTY ++ + E
Sbjct: 325 MAHLHIGNGRYNILYTGDFKYGKTRLLNRAVNKFKRVEMLIMESTYGGRDDVQPPRVEAE 384
Query: 57 RDFLKKVHECVDRGGKVLIPVFALGRAQELCILLETYWE-RMNLQAPIYFAVGLTEKATN 115
K V E V +GGKVLIP F+ GR QE+ +L E + + PIY G+ + N
Sbjct: 385 NALAKHVAETVAKGGKVLIPAFSTGRGQEILYILNKMMEGGLIPRVPIYVD-GMIVETLN 443
Query: 116 YYKMFITWTNQKIRKTFVQRNMFDFKHIRPFDKS----FIDNPG---------------- 155
Y M+ + N ++ + + + PF S +D
Sbjct: 444 VYLMYPHYLNPEVAEEI-------YSGVNPFTTSGAVVIVDRAKRVEDRINQVAKIAQSE 496
Query: 156 -PMVVFATPGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQ 214
P V+ A GML+ G L F + AP E N LI + +GT+G ++L+G + EF +
Sbjct: 497 EPAVIIAPHGMLNGGPILDYFAQLAPDERNKLIFVSYQAEGTLGRRILNGER--EFTVRS 554
Query: 215 I----IDVKMAIEYMS---FSAHADAKGIMQLIQYCE--PKNVLLVHGEASKMVFLKEKI 265
+ I V+M +E S FS H+D + +M+ +++ E PK ++L+HGE SK++ L I
Sbjct: 555 LTAGEIKVQMRMEAASIPGFSGHSDRRELMKYVEHIEPKPKKIVLIHGEPSKIISLATSI 614
Query: 266 KQEFNLDCFMPANGE 280
+ ++ + +P GE
Sbjct: 615 ELKYKITTVVPKVGE 629
>gi|315426611|dbj|BAJ48239.1| beta-lactamase domain protein [Candidatus Caldiarchaeum
subterraneum]
Length = 635
Score = 125 bits (315), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 91/299 (30%), Positives = 158/299 (52%), Gaps = 32/299 (10%)
Query: 7 GNQSIVYTGDYNMTPDRHLGAAWIDKC-----RPDLLITESTY-ATTIRDSKRCRERDFL 60
G +IVYTGD+ ++ + +D C R + LI ESTY AT + + E
Sbjct: 342 GLHNIVYTGDF-----KYERSTALDPCVSKFPRVETLIMESTYGATPVPYTLEQSEALLA 396
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILL-ETYWERMNLQAPIYFAVGLTEKATNYYKM 119
+K+ ++RGGKV+IPV A+GRAQE+ ++L + + E+ ++ P++ GL +AT +
Sbjct: 397 EKITATINRGGKVIIPVPAIGRAQEIMLVLNKLFTEKKLVETPVFLD-GLVIEATAIHTG 455
Query: 120 FITWTNQKIRKTFVQ-RNMFDFKHIRPFD-----KSFIDNPGPMVVFATPGMLHSGLSLI 173
F + ++++ + N+F ++ P + ++ PMVV +T GML G L
Sbjct: 456 FPDYFTAELQQRLREGENIFLSEYFTPVKGESQRQEILEMREPMVVISTSGMLEGGPVLK 515
Query: 174 IFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFEN----KQIIDVKMAIEYM-SFS 228
K + ENN+L+ + V+GT+G +L GV+++ N ++++VKM +E + FS
Sbjct: 516 YLKAFGGDENNLLLFVSYQVEGTLGRTLLKGVREVLLRNGEGKPEVLNVKMQVEKVDGFS 575
Query: 229 AHADAKGIMQLIQYC-----EPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESC 282
H+ + QLI Y +P+N++LVHGE + + L + + F P N E+
Sbjct: 576 GHSSRQ---QLINYVKRVMPKPRNIILVHGEENAVESLATTLAKITPASIFTPHNLETI 631
>gi|315426553|dbj|BAJ48183.1| beta-lactamase domain protein [Candidatus Caldiarchaeum
subterraneum]
gi|343485301|dbj|BAJ50955.1| beta-lactamase domain protein [Candidatus Caldiarchaeum
subterraneum]
Length = 635
Score = 125 bits (315), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 91/299 (30%), Positives = 158/299 (52%), Gaps = 32/299 (10%)
Query: 7 GNQSIVYTGDYNMTPDRHLGAAWIDKC-----RPDLLITESTY-ATTIRDSKRCRERDFL 60
G +IVYTGD+ ++ + +D C R + LI ESTY AT + + E
Sbjct: 342 GLHNIVYTGDF-----KYERSTALDPCVSKFPRVETLIMESTYGATPVPYTLEQSEALLA 396
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILL-ETYWERMNLQAPIYFAVGLTEKATNYYKM 119
+K+ ++RGGKV+IPV A+GRAQE+ ++L + + E+ ++ P++ GL +AT +
Sbjct: 397 EKITATINRGGKVIIPVPAIGRAQEIMLVLNKLFTEKKLVETPVFLD-GLVIEATAIHTG 455
Query: 120 FITWTNQKIRKTFVQ-RNMFDFKHIRPFD-----KSFIDNPGPMVVFATPGMLHSGLSLI 173
F + ++++ + N+F ++ P + ++ PMVV +T GML G L
Sbjct: 456 FPDYFTAELQQRLREGENIFLSEYFTPVKGESQRQEILEMREPMVVISTSGMLEGGPVLK 515
Query: 174 IFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFEN----KQIIDVKMAIEYM-SFS 228
K + ENN+L+ + V+GT+G +L GV+++ N ++++VKM +E + FS
Sbjct: 516 YLKAFGGDENNLLLFVSYQVEGTLGRTLLKGVREVLLRNGEGKPEVLNVKMQVEKVDGFS 575
Query: 229 AHADAKGIMQLIQYC-----EPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESC 282
H+ + QLI Y +P+N++LVHGE + + L + + F P N E+
Sbjct: 576 GHSSRQ---QLINYVKRVMPKPRNIILVHGEENAVESLATALTRITPASIFTPHNLETI 631
>gi|315425373|dbj|BAJ47039.1| mRNA 3'-end processing factor [Candidatus Caldiarchaeum
subterraneum]
Length = 638
Score = 125 bits (315), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 91/299 (30%), Positives = 158/299 (52%), Gaps = 32/299 (10%)
Query: 7 GNQSIVYTGDYNMTPDRHLGAAWIDKC-----RPDLLITESTY-ATTIRDSKRCRERDFL 60
G +IVYTGD+ ++ + +D C R + LI ESTY AT + + E
Sbjct: 345 GLHNIVYTGDF-----KYERSTALDPCVSKFPRVETLIMESTYGATPVPYTLEQSEALLA 399
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILL-ETYWERMNLQAPIYFAVGLTEKATNYYKM 119
+K+ ++RGGKV+IPV A+GRAQE+ ++L + + E+ ++ P++ GL +AT +
Sbjct: 400 EKITATINRGGKVIIPVPAIGRAQEIMLVLNKLFTEKKLVETPVFLD-GLVIEATAIHTG 458
Query: 120 FITWTNQKIRKTFVQ-RNMFDFKHIRPFD-----KSFIDNPGPMVVFATPGMLHSGLSLI 173
F + ++++ + N+F ++ P + ++ PMVV +T GML G L
Sbjct: 459 FPDYFTAELQQRLREGENIFLSEYFTPVKGESQRQEILEMREPMVVISTSGMLEGGPVLK 518
Query: 174 IFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFEN----KQIIDVKMAIEYM-SFS 228
K + ENN+L+ + V+GT+G +L GV+++ N ++++VKM +E + FS
Sbjct: 519 YLKAFGGDENNLLLFVSYQVEGTLGRTLLKGVREVLLRNGEGKPEVLNVKMQVEKVDGFS 578
Query: 229 AHADAKGIMQLIQYC-----EPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESC 282
H+ + QLI Y +P+N++LVHGE + + L + + F P N E+
Sbjct: 579 GHSSRQ---QLINYVKRVMPKPRNIILVHGEENAVESLATALTRITPASIFTPHNLETI 634
>gi|305662491|ref|YP_003858779.1| KH-domain/beta-lactamase-domain-containing protein [Ignisphaera
aggregans DSM 17230]
gi|304377060|gb|ADM26899.1| KH-domain/beta-lactamase-domain protein [Ignisphaera aggregans DSM
17230]
Length = 641
Score = 125 bits (315), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 81/275 (29%), Positives = 147/275 (53%), Gaps = 15/275 (5%)
Query: 7 GNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFLKKVHEC 66
G +IVYTGD+ R L A + R D +I ESTY T + ++ E + ++ V E
Sbjct: 343 GFHNIVYTGDFKFGRTRLLDRANYEFPRVDTIIMESTYGATEQPARDRAEDELIRIVKET 402
Query: 67 VDRGGKVLIPVFALGRAQELCILLETYWERMNL-QAPIYFAVGLTEKATNYYKMFITWTN 125
DRGGKVLIP ++GRAQE+ ++L ++ L + P+Y G+ + T + + +
Sbjct: 403 YDRGGKVLIPSLSVGRAQEVMLILADAMKQNKLPKIPVYIE-GMIHEVTAIHTAYPDLLS 461
Query: 126 QKIRKTFVQ-RNMFDFKHI-----RPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWA 179
+++ + N FD++ R +++ P ++ AT GML G ++ FK A
Sbjct: 462 KELGQRLKSGENPFDYETFIRLEGREPRTEIVESSEPAIIIATSGMLTGGPAVEYFKLMA 521
Query: 180 PVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENK----QIIDVKMAIEYM-SFSAHADAK 234
N+ L+ + V+GT+G ++ G +++ F N+ ++I +KM + + FS H+D
Sbjct: 522 SNPNHSLVFVSYQVEGTLGRRIRDGAREVAFVNERGKVEVIRIKMQVYSIEGFSGHSDRP 581
Query: 235 GIMQLIQYCEPK--NVLLVHGEASKMVFLKEKIKQ 267
+++ +++ EPK +LVHGE + ++ L +I +
Sbjct: 582 QLLEYLRHIEPKPSTAILVHGEKNAVMNLANEISR 616
>gi|330508618|ref|YP_004385046.1| RNA-metabolising metallo-beta-lactamase [Methanosaeta concilii GP6]
gi|328929426|gb|AEB69228.1| RNA-metabolising metallo-beta-lactamase [Methanosaeta concilii GP6]
Length = 635
Score = 125 bits (315), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 83/295 (28%), Positives = 150/295 (50%), Gaps = 16/295 (5%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATT--IRDSKRCRERDF 59
F + G ++ +TGDY R A + R + L+TE+TY T I+ S++ E +
Sbjct: 338 FHIGDGLYNVAFTGDYKFERTRLFDPAVCNFPRLETLVTEATYGGTNSIQPSRKEAEVNL 397
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNL-QAPIYFAVGLTEKATNYYK 118
+K V E ++RGGKV+IP F++GR+QE+ ++LE + + + P+ G+ +AT +
Sbjct: 398 IKVVKETINRGGKVVIPAFSVGRSQEVMVVLEEAIRKKVIDEVPVCLD-GMIYEATAIHT 456
Query: 119 MFITWTNQKIRKTFVQRNMFDF------KHIRPFDKSFIDNPGPMVVFATPGMLHSGLSL 172
+ + N +R + + F + P ++ I P V+ AT GML+ G +
Sbjct: 457 TYPEYLNNDLRDLIFHKGINPFLAECFVQVESPKQRTEIVEGPPCVILATSGMLNGGPVI 516
Query: 173 IIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFE-NKQIIDVKMAIEYMS---FS 228
K+ P E N L++ G+ +GT+G ++ G K++ + + VK+ +E + FS
Sbjct: 517 EYLKRLGPDEKNTLVIVGYQAEGTLGRRIQKGWKEVPLSVDGKTQTVKINLEVTTVDGFS 576
Query: 229 AHADAKGIMQLIQ--YCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGES 281
H+D + +M+ ++ Y +P VL HGE + L I + + + P N E+
Sbjct: 577 GHSDRQQLMEYLRRIYPKPGRVLTNHGEEGNCLDLASSIYKRYRIPTMAPMNLET 631
>gi|359414339|ref|ZP_09206804.1| RNA-metabolising metallo-beta-lactamase [Clostridium sp. DL-VIII]
gi|357173223|gb|EHJ01398.1| RNA-metabolising metallo-beta-lactamase [Clostridium sp. DL-VIII]
Length = 863
Score = 125 bits (315), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 77/284 (27%), Positives = 141/284 (49%), Gaps = 14/284 (4%)
Query: 10 SIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFLKKVHECVDR 69
++ Y+GD+++ R + + + RPD+ I ESTY + ++ E+ + V+EC +
Sbjct: 178 ALFYSGDFSVFSQRTVEGTKLPRLRPDVAILESTYGDKLHSNREIEEKALIDIVNECSEN 237
Query: 70 GGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMFITWTNQKIR 129
GK++IP FALGRAQE+ ++L+ + +L+ + G+ YK + +
Sbjct: 238 NGKMIIPAFALGRAQEVILILKGAINKGSLKKIKIYVDGMVRDVNRVYKNNPLYLRNSLG 297
Query: 130 KT-------FVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAPVE 182
K F N+ + K + ++ P+V+ ++ GML G S +K AP+E
Sbjct: 298 KKVLRGVEPFYDENIIEVKQTDN-RQDILEQKEPVVIISSSGMLTGGPSTFYAEKIAPME 356
Query: 183 NNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQI------IDVKMAIEYMSFSAHADAKGI 236
N +++ G+ + + G KVL ++ E ++ + I VK ++ + SAH+D I
Sbjct: 357 NGYIVITGYQDEESPGRKVLELMETDEESDRYLNINGLNIPVKCKVKKIGLSAHSDKNEI 416
Query: 237 MQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGE 280
+I+ +N++LVHG + L ++I EF F P GE
Sbjct: 417 KGVIERISARNIILVHGNEEVVRSLGKEISNEFIGRTFTPGCGE 460
>gi|307353378|ref|YP_003894429.1| KH-domain/beta-lactamase-domain-containing protein [Methanoplanus
petrolearius DSM 11571]
gi|307156611|gb|ADN35991.1| KH-domain/beta-lactamase-domain protein [Methanoplanus petrolearius
DSM 11571]
Length = 629
Score = 125 bits (314), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 84/296 (28%), Positives = 138/296 (46%), Gaps = 21/296 (7%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTY--ATTIRDSKRCRERDF 59
F V G +I ++GD+N R G A R + + ESTY A I+ SK+ E
Sbjct: 335 FHVGDGLYNIAFSGDFNYGKTRLFGPATSQFPRLETIFVESTYGGANDIQPSKKDAEEKL 394
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKM 119
+ V+ + RGGKV+IP FA+GR+QE+ + LE + + G+ ++AT +
Sbjct: 395 YEAVNTTIKRGGKVIIPAFAVGRSQEVMLTLEEGMRLDKIPKVKVYLDGMIKEATAIHTT 454
Query: 120 FITWTNQKIRKTFVQRNMFDFKHIRPFDKSF--IDNPG---------PMVVFATPGMLHS 168
+ + N ++RK Q M F F +D+P P ++ T GML
Sbjct: 455 YPEYLNPELRKQIFQDGMNPF-----LSDCFVQVDSPAIRENVIGGEPSIIITTSGMLSG 509
Query: 169 GLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYM-SF 227
G + A E NMLI G+ GT+G ++ G K++ + I + + I+ + F
Sbjct: 510 GPVMEYLNGLAADEKNMLIFVGYQADGTMGRRIQKGWKEIPMGRRGSIVINLEIQTIDGF 569
Query: 228 SAHADAKGIMQLIQYCEPK--NVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGES 281
S H+D +M+ I + P+ V +HG+ + L I + F ++ P N E+
Sbjct: 570 SGHSDRNQLMRYINHLNPRPEKVFTIHGDEKSAIDLASSIYKRFRIETHAPRNLET 625
>gi|348688343|gb|EGZ28157.1| hypothetical protein PHYSODRAFT_309075 [Phytophthora sojae]
Length = 669
Score = 125 bits (314), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 70/290 (24%), Positives = 144/290 (49%), Gaps = 10/290 (3%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFLK 61
F +++G ++++ D+N+T R L A I + P++LIT S++A T+ +++ ER+ +K
Sbjct: 195 FYLEIGVSTVMFANDFNLTGGRVLLPAQIPRLEPNVLITSSSFAVTVSETQNAMERELVK 254
Query: 62 KVHECVDRGGKVLIPVFALGRAQELCILLETYWERM------NLQAPIYFAVGLTEKATN 115
+ EC+ GKV+IPV+ LG EL +L +W+++ + PI+ + E +
Sbjct: 255 VILECISSNGKVVIPVYRLGYFHELITILLEHWQQIKDVTGKSAGCPIFLSDADMEYPSR 314
Query: 116 YYKMFITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIF 175
+ + ++ T ++ ++ ++ FD + PGP V+F P + G SL
Sbjct: 315 FLPV-LSRTCTPAVQSLLRERPPSPTDLQVFDWKRLQQPGPFVLFTGPANISQGGSLRAI 373
Query: 176 KKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKG 235
K A N++++ +C GT+ + + + ++ + ++V+ + Y D K
Sbjct: 374 KAVASDPKNLIVLSEYCTPGTVNYLLYADPERKRVSKRLGVNVECGVHYQPCGDEVDTKS 433
Query: 236 IMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQ 285
I+QL+ P+ V+L + + F+K ++ + +D P G V+
Sbjct: 434 IVQLVSRVAPRQVILDYLVPDDLAFVKSHMQNQLKID---PTVGSDVLVK 480
>gi|289192183|ref|YP_003458124.1| KH-domain/beta-lactamase-domain protein [Methanocaldococcus sp.
FS406-22]
gi|288938633|gb|ADC69388.1| KH-domain/beta-lactamase-domain protein [Methanocaldococcus sp.
FS406-22]
Length = 634
Score = 125 bits (313), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 95/298 (31%), Positives = 153/298 (51%), Gaps = 23/298 (7%)
Query: 7 GNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYAT--TIRDSKRCRERDFLKKVH 64
G ++ YTGD R L A R + LI ESTY + + ER+ L+ V
Sbjct: 341 GLYNLAYTGDIKFETSRLLEPAVCQFPRLETLIIESTYGAYDDVLPEREEAERELLRVVS 400
Query: 65 ECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMFITWT 124
E D+GGKVLIPVF +GRAQEL ++LE + + AP+Y G+ +AT + + +
Sbjct: 401 ETTDKGGKVLIPVFGVGRAQELMLVLEEGYNQGIFNAPVYLD-GMIWEATAIHTAYPEYL 459
Query: 125 NQKIRKTFVQR--NMF---DFKHIRPFD--KSFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
++++R+ N F FK + + + ID+ P V+ AT GML G S+ K
Sbjct: 460 SKEMRQKIFHEGDNPFLSEVFKRVGSTNERRRVIDSDEPCVILATSGMLTGGPSVEYLKH 519
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENK----QIIDVKMAIEYM-SFSAHAD 232
AP E N +I G+ +GT+G KV G K++ + + I + + + + FS H+D
Sbjct: 520 LAPDEKNAIIFVGYQAEGTLGRKVQRGWKEIPITTRNGKTKSIPINLQVYTIEGFSGHSD 579
Query: 233 AKGIMQLIQYCE-----PKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQ 285
K QLI+Y P+ +++VHGE SK + + +++ F ++P N ++ V+
Sbjct: 580 RK---QLIKYIRRLKPSPEKIIMVHGEESKCLDFADTVRRLFKKQTYVPMNLDAIRVK 634
>gi|146296048|ref|YP_001179819.1| beta-lactamase domain-containing protein [Caldicellulosiruptor
saccharolyticus DSM 8903]
gi|145409624|gb|ABP66628.1| beta-lactamase domain protein [Caldicellulosiruptor saccharolyticus
DSM 8903]
Length = 821
Score = 124 bits (312), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 80/293 (27%), Positives = 142/293 (48%), Gaps = 16/293 (5%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
M ++ S++YTGD++ + A I K RPD++I ESTY + ++ E
Sbjct: 152 MIFIQTQEGSVLYTGDFSTDKQLTVDKASIPKIRPDVVICESTYGDRLHTNRNYEEERLF 211
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
++E + +GGKVLIP FA+GRAQE+ ++L Y ++ ++ + F G+ + YK
Sbjct: 212 NSIYEFISKGGKVLIPAFAIGRAQEIILILRNYMKKKKVEFNV-FIDGMVREVIRVYKNN 270
Query: 121 ITWTNQKIRK-------TFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLI 173
T+ + + K F+ N+ D + + + + P V+ ++ GML G S+
Sbjct: 271 PTFLSSRYYKKVLKGEEIFLSDNI-DIITDKKQREEIMSSSDPCVIISSSGMLTGGPSVF 329
Query: 174 IFKKWAPVENNMLIMPGFCVQGTIGHKVL------SGVKKLEFENKQIIDVKMAIEYMSF 227
+K +N ++ + G+ + + G K+L +K+E K+ +VK +E
Sbjct: 330 YAEKLVENQNALIAITGYQDEESPGRKLLELTELPESERKIELNGKE-YEVKCKVEKYGL 388
Query: 228 SAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGE 280
SAHAD I+ + +PK V+ HG + L + +E +P NGE
Sbjct: 389 SAHADRDSILGFLAMLKPKTVVFAHGSEEAISQLSDMAIKEIESAVLIPQNGE 441
>gi|325958080|ref|YP_004289546.1| KH-domain/beta-lactamase-domain-containing protein
[Methanobacterium sp. AL-21]
gi|325329512|gb|ADZ08574.1| KH-domain/beta-lactamase-domain protein [Methanobacterium sp.
AL-21]
Length = 634
Score = 124 bits (312), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 86/293 (29%), Positives = 150/293 (51%), Gaps = 14/293 (4%)
Query: 7 GNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYAT--TIRDSKRCRERDFLKKVH 64
G + VYTGD+ R L A R + L+ ESTY ++ S+ E++ +K ++
Sbjct: 342 GQHNFVYTGDFKYERSRLLEPAVSKFPRIESLVMESTYGGHDDVQPSRNDAEKELVKTIY 401
Query: 65 ECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMFITWT 124
++RGGKVL+PVFA+GRAQEL I+L+ Y ++ F G+ +AT + +
Sbjct: 402 RTLERGGKVLMPVFAVGRAQELMIVLDEYIRHGMIEEVPVFIDGMIWEATAIHTARPEYL 461
Query: 125 NQKIRKTF--VQRNMF--DFKHI--RPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKW 178
++ +R + RN F D H ++ I P ++ +T GML G S+ FK+
Sbjct: 462 SKDLRDQIFHMGRNPFISDVFHKVNGGNERQEIVEGEPAIILSTSGMLTGGNSVEYFKQL 521
Query: 179 APVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENK---QIIDVKMAIEYM-SFSAHADAK 234
E N L+ G+ +G++G ++ G K++ + + + +VKM I+ + FS H+D +
Sbjct: 522 CGDEKNSLVFVGYQAEGSLGRRLQKGWKEIPLKEEGKTNVYNVKMHIKTIEGFSGHSDRR 581
Query: 235 GIMQLIQYCEPK--NVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQ 285
+M ++ PK +L HG+ K + L I + + ++ P N E+ +Q
Sbjct: 582 QLMDYVRRLSPKPEKILFCHGDNYKTLDLASSIYRSYKIETKTPMNLETVRIQ 634
>gi|355571913|ref|ZP_09043121.1| KH-domain/beta-lactamase-domain protein [Methanolinea tarda NOBI-1]
gi|354825009|gb|EHF09244.1| KH-domain/beta-lactamase-domain protein [Methanolinea tarda NOBI-1]
Length = 629
Score = 124 bits (312), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 80/291 (27%), Positives = 140/291 (48%), Gaps = 11/291 (3%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATT--IRDSKRCRERDF 59
F + G +I +TGD+N + R A + R + LI ESTY + I+ ++ E
Sbjct: 335 FHIGDGLYNIAFTGDFNFSKSRLFNPATCNFPRLEALIMESTYGGSGDIQPTRAEAEEKL 394
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKM 119
+ V+ + RGGKV+IP FA+GR+QE+ + LE R + + G+ +AT +
Sbjct: 395 YETVNNTIRRGGKVIIPAFAVGRSQEVMLALEEGMRRQKIPTVKIYLDGMIREATAIHTT 454
Query: 120 FITWTNQKIRKTFVQR--NMFDFKHIRPFDKSFID----NPGPMVVFATPGMLHSGLSLI 173
+ + N ++R + N F P D S + P V+ T GML+ G +
Sbjct: 455 YPEYLNSELRTQIFKEDHNPFLADCFIPVDSSELREKVIRGDPCVIITTSGMLNGGPVME 514
Query: 174 IFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYM-SFSAHAD 232
A E N L+ G+ +GT+G ++ G +++ K I + + I + FS H+D
Sbjct: 515 YLSNLASDEKNALVFVGYQAEGTLGRRIQKGWREVPIGRKGTIVINLEIVTIDGFSGHSD 574
Query: 233 AKGIMQLIQYCEPK--NVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGES 281
K +M I + +P+ + VHG+ + + L I + ++++ P N E+
Sbjct: 575 RKQLMSFIAHVQPRPEKIFTVHGDENNTIDLASSIYKRYHIETHSPMNLET 625
>gi|15920543|ref|NP_376212.1| cleavage and polyadenylation specificity factor large subunit
[Sulfolobus tokodaii str. 7]
gi|342306155|dbj|BAK54244.1| putative ribonuclease J [Sulfolobus tokodaii str. 7]
Length = 637
Score = 124 bits (312), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 77/274 (28%), Positives = 147/274 (53%), Gaps = 13/274 (4%)
Query: 7 GNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFLKKVHEC 66
G +IVYTGD+ + L A + R D +I E+TY ++++ E ++ ++
Sbjct: 342 GTHNIVYTGDFKYARTKLLDKANDEFPRVDTIIMETTYGDHEQENREEAEAKLIEIINRT 401
Query: 67 VDRGGKVLIPVFALGRAQELCILL-ETYWERMNLQAPIYFAVGLTEKATNYYKMFITWTN 125
+ RGG+VLIPV A+GR QE+ ++L + +++ + PIY GL E+ T + + +
Sbjct: 402 ISRGGRVLIPVLAVGRGQEIMLVLNDAMKKKLIPEVPIY-VTGLVEEITAIHNAYPEMLS 460
Query: 126 QKIRKTFVQRN----MFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAPV 181
+++R+ + ++ M +F H + I P ++ AT GML+ G ++ FK AP
Sbjct: 461 REVREAILYKDENPFMSEFFHRIEGYREDIAQGEPSIILATSGMLNGGPAVEFFKTMAPD 520
Query: 182 ENNMLIMPGFCVQGTIGHKVLSGVKKLEFENK----QIIDVKMAIEYM-SFSAHADAKGI 236
N +I + +GT+G KV G ++++ ++ + I +KM + + FS H+D + +
Sbjct: 521 SRNSIIFVSYQAEGTLGRKVSDGAQEIQIIDRDGRIENIQIKMEVTRVEGFSGHSDRRQL 580
Query: 237 MQLIQ--YCEPKNVLLVHGEASKMVFLKEKIKQE 268
+ ++ +PKN+LL HGE + + +++E
Sbjct: 581 LNFLKNLNVKPKNLLLNHGEPGSIESFRRLVERE 614
>gi|255513971|gb|EET90235.1| beta-lactamase domain protein [Candidatus Micrarchaeum acidiphilum
ARMAN-2]
Length = 453
Score = 124 bits (312), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 86/288 (29%), Positives = 140/288 (48%), Gaps = 16/288 (5%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYA--TTIRDSKRCRERD 58
M ++ G +S++YTGD N+ L A ++ D LI ESTY + S+ R
Sbjct: 172 MIRLSYGGKSLLYTGDMNLQSSLVLNGADMNGISADTLIMESTYGGDADVFGSRETRAAG 231
Query: 59 FLKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQ-APIYFAVGLTEKATNYY 117
+K ++E + +GGKV+IP F +GR+QE+ +LL ER LQ PIY G K +
Sbjct: 232 MVKSINETILQGGKVIIPSFGIGRSQEVLMLLHGAMERGQLQRTPIYVD-GAIGKIMKIH 290
Query: 118 KMFITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
+ + + + +R F+F R D++ I G +V T GM+ G + +K
Sbjct: 291 RQNLRYCRKDVRSAMQSDKNFEFVGKR--DRNRIAKGGSCIVVTTSGMMKGGPVIFYLRK 348
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
A NN LI+ G+ +GT G + G K++E + K ++ +++ IE AH+D M
Sbjct: 349 LAGDPNNKLIIVGYQAEGTPGRLIRDGAKEVELDGK-MLQIRLKIEQHRIMAHSDR---M 404
Query: 238 QLIQYCE----PKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGES 281
L + E + + LVHG+ K L + + +P GE+
Sbjct: 405 MLDRTPEMVRGVRKIFLVHGDIEKQRSLGTHMSPRYA--TIVPKIGET 450
>gi|15679214|ref|NP_276331.1| cleavage and polyadenylation specificity factor
[Methanothermobacter thermautotrophicus str. Delta H]
gi|333944120|pdb|2YCB|A Chain A, Structure Of The Archaeal Beta-Casp Protein With
N-Terminal Kh Domains From Methanothermobacter
Thermautotrophicus
gi|333944121|pdb|2YCB|B Chain B, Structure Of The Archaeal Beta-Casp Protein With
N-Terminal Kh Domains From Methanothermobacter
Thermautotrophicus
gi|2622312|gb|AAB85692.1| cleavage and polyadenylation specificity factor
[Methanothermobacter thermautotrophicus str. Delta H]
Length = 636
Score = 124 bits (312), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 87/306 (28%), Positives = 155/306 (50%), Gaps = 29/306 (9%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYAT--TIRDSKRCRERDF 59
+ G ++VYTGD+ R L AA R + L+ ESTY ++ S+ E++
Sbjct: 338 LHIGDGQHNMVYTGDFKYEQSRLLEAAANRFPRIETLVMESTYGGHEDVQPSRNRAEKEL 397
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNL-QAPIYFAVGLTEKATNYYK 118
+K ++ + RGGK+LIPVFA+GRAQEL I+LE Y + + P+Y G+ +A +
Sbjct: 398 VKTIYSTLRRGGKILIPVFAVGRAQELMIVLEEYIRTGIIDEVPVYID-GMIWEANAIHT 456
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPF------------DKSFIDNPGPMVVFATPGML 166
+ ++ +R +F H PF ++ I P ++ +T GML
Sbjct: 457 ARPEYLSKDLRD-----QIFHMGH-NPFISDIFHKVNGMDERREIVEGEPSIILSTSGML 510
Query: 167 HSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENK----QIIDVKMAI 222
G SL FK +N L+ G+ +G++G ++ G K++ +++ ++ +V+M I
Sbjct: 511 TGGNSLEYFKWLCEDPDNSLVFVGYQAEGSLGRRIQKGWKEIPLKDEDDKMRVYNVRMNI 570
Query: 223 EYM-SFSAHADAKGIMQLIQYCEPK--NVLLVHGEASKMVFLKEKIKQEFNLDCFMPANG 279
+ + FS H+D + +M+ ++ PK +LL HG+ K + L I + + ++ P N
Sbjct: 571 KTIEGFSGHSDRRQLMEYVKRISPKPEKILLCHGDNYKTLDLASSIYRTYRIETKTPLNL 630
Query: 280 ESCFVQ 285
E+ +Q
Sbjct: 631 ETVRIQ 636
>gi|301117336|ref|XP_002906396.1| integrator complex subunit, putative [Phytophthora infestans T30-4]
gi|262107745|gb|EEY65797.1| integrator complex subunit, putative [Phytophthora infestans T30-4]
Length = 669
Score = 124 bits (312), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 70/290 (24%), Positives = 138/290 (47%), Gaps = 10/290 (3%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFLK 61
F +++G ++++ D+N+T R L A I + P +IT S++A T+ +++ ER+ +K
Sbjct: 195 FYLEIGATTVLFANDFNLTGGRVLLPAQIPRLEPTAMITRSSFAVTVSETQSAMERELVK 254
Query: 62 KVHECVDRGGKVLIPVFALGRAQELCILLETYWERM------NLQAPIYFAVGLTEKATN 115
+ EC+ GKV+IPV+ LG EL ++ +W+++ + PIY + E +
Sbjct: 255 AILECIASNGKVVIPVYRLGYFHELITIILEHWQQIKDATGKTGKCPIYLSHASMEYPSR 314
Query: 116 YYKMFITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIF 175
+ + + +N ++ FD + PGP V+F P + G SL
Sbjct: 315 FLPVLSRTCTSATQNLLRDKNPI-AADLQVFDWKNLQQPGPFVLFTGPANISQGDSLRAI 373
Query: 176 KKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKG 235
K A N++++ +C GT+ + + + ++ + ++V+ + Y D K
Sbjct: 374 KAVASDPKNLIVLSEYCTPGTVNYLLYADPERKRVSKRLGVNVECGVHYQPCGDEVDTKS 433
Query: 236 IMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQ 285
I+QL+ P+ V+L + + F+K ++ +D P G S V+
Sbjct: 434 IVQLVSRVAPRQVILDYTVPDDLEFVKTHVQNHLKMD---PDVGTSVLVK 480
>gi|159904583|ref|YP_001548245.1| beta-lactamase domain-containing protein [Methanococcus maripaludis
C6]
gi|159886076|gb|ABX01013.1| beta-lactamase domain protein [Methanococcus maripaludis C6]
Length = 635
Score = 124 bits (312), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 91/298 (30%), Positives = 146/298 (48%), Gaps = 23/298 (7%)
Query: 3 QVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYAT--TIRDSKRCRERDFL 60
V G ++ YTGD R L A R + LI ESTY + + E++FL
Sbjct: 338 HVGDGLYNVAYTGDIKFEASRLLEPAVCQFPRLETLIIESTYGGYDDVLPERDETEKEFL 397
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
+ + E + R GK +IPVF +GRAQEL ++LE + + AP+Y G+ +AT + +
Sbjct: 398 RVIAETIARKGKAIIPVFGIGRAQELMLVLEEGYNQGIFNAPVYLD-GMIWEATAIHTAY 456
Query: 121 ITWTNQKIRKTFVQR--NMF---DFKHIRPFD--KSFIDNPGPMVVFATPGMLHSGLSLI 173
+ ++ +R N F FK ++ + ++ ID+ P ++ T GML G S+
Sbjct: 457 PEYLSKNMRNRIFHEGDNPFLSEVFKKVKNTNDRRNIIDSDEPGIILTTSGMLSGGPSVE 516
Query: 174 IFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENK----QIIDVKMAIEYM-SFS 228
FK A E N ++ G+ +GT+G K+ G K++ K + + V +++ + FS
Sbjct: 517 YFKNLADDEKNAIVFVGYQSEGTLGRKIQKGFKEIPLMGKNGRSKAVKVNLSVHTLEGFS 576
Query: 229 AHADAKGIMQLIQYCE-----PKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGES 281
H+D K QLI+Y P +L VHGE SK + L + F + P N +S
Sbjct: 577 GHSDRK---QLIKYLRKLKPIPDRILTVHGEVSKCIDLASTAYKLFKKETKAPMNLDS 631
>gi|116753406|ref|YP_842524.1| beta-lactamase domain-containing protein [Methanosaeta thermophila
PT]
gi|116664857|gb|ABK13884.1| beta-lactamase domain protein [Methanosaeta thermophila PT]
Length = 635
Score = 124 bits (311), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 86/301 (28%), Positives = 150/301 (49%), Gaps = 28/301 (9%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTY--ATTIRDSKRCRERDF 59
F + G ++ +TGD+ R A R + L+ E+TY A +I+ S++ E
Sbjct: 338 FHIGDGLYNVAFTGDFKYEKTRLFDPAVNSFPRLETLVIEATYGGANSIQPSRKEAENHL 397
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNL-QAPIYFAVGLTEKATNYYK 118
L+ V E + RGGKV+IP FA+GR+QE+ ++LE + + + P+Y G+ +AT +
Sbjct: 398 LRVVRETIKRGGKVVIPAFAVGRSQEVMVVLEEAIRKGLIGEFPVYLD-GMIYEATAIHT 456
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPF------------DKSFIDNPGPMVVFATPGML 166
+ + N ++ R+M K I PF + I N P V+ AT GML
Sbjct: 457 SYPEYLNNEL------RDMIFHKGINPFLAECFVQVENSKQRDEIINGEPAVILATSGML 510
Query: 167 HSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFE-NKQIIDVKMAIEYM 225
+ G + K P E N L++ G+ +GT+G ++ G K++ + VKM +E +
Sbjct: 511 NGGPIMEYLKGLGPDEKNTLVIVGYQAEGTLGRRIQKGWKEIPLTVEGKTQTVKMNMEVV 570
Query: 226 S---FSAHADAKGIMQLIQ--YCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGE 280
+ FS H+D +M+ ++ Y +P ++ HG+ S + L I +++ + P N E
Sbjct: 571 TVDGFSGHSDRNQLMEYVRRVYPKPSRIITNHGDESNCLDLASSIYKKYRIPTSAPMNLE 630
Query: 281 S 281
+
Sbjct: 631 T 631
>gi|167043166|gb|ABZ07875.1| putative Metallo-beta-lactamase superfamily protein [uncultured
marine crenarchaeote HF4000_ANIW141J13]
Length = 643
Score = 124 bits (311), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 87/293 (29%), Positives = 145/293 (49%), Gaps = 20/293 (6%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATT--IRDSKRCRERDF 59
F + G + VYTGD L +A + R + L+ ESTY I+ ++ E F
Sbjct: 344 FHIGNGEHNFVYTGDIKYGKSMLLESANTNYPRVETLLIESTYGAKEDIQPDRQEVESAF 403
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNL-QAPIYFAVGLTEKATNYYK 118
+ V+ + GGKVLIP+ A+GRAQEL ++++ Y + L +AP+ F G+ ++AT ++
Sbjct: 404 IASVNSVIKEGGKVLIPIPAVGRAQELMLVIDQYMKSGGLVEAPV-FMEGMIQEATAIHE 462
Query: 119 MFITWTNQKIRKTFVQR--NMFD------FKHIRPFDKSFIDNPGPMVVFATPGMLHSGL 170
F + + ++K ++ N FD H D+ +N P ++ AT GML G
Sbjct: 463 AFPEYLVRDLKKKILETDDNPFDSEYFTNITHQDGRDEPLREN-SPCIIIATSGMLEGGP 521
Query: 171 SLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENK----QIIDVKMAIEYM- 225
L F+ AP N ++ + V GT+G +V+ G K++ K ++I + + E +
Sbjct: 522 VLEYFRNVAPHAKNKILFVSYQVNGTLGRRVMDGAKQVSILGKEGKIEVISINCSTERLD 581
Query: 226 SFSAHADAKGIMQLIQYCEPK--NVLLVHGEASKMVFLKEKIKQEFNLDCFMP 276
FS H+D +M + PK VL+ HGE K L I++ + + P
Sbjct: 582 GFSGHSDYNQLMSFVHRLRPKLRRVLVNHGERRKSENLSMSIRRMYRVSSHYP 634
>gi|150403634|ref|YP_001330928.1| beta-lactamase domain-containing protein [Methanococcus maripaludis
C7]
gi|150034664|gb|ABR66777.1| beta-lactamase domain protein [Methanococcus maripaludis C7]
Length = 635
Score = 124 bits (311), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 91/299 (30%), Positives = 145/299 (48%), Gaps = 31/299 (10%)
Query: 3 QVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYAT--TIRDSKRCRERDFL 60
V G ++ YTGD R L A R + LI ESTY + + E++FL
Sbjct: 338 HVGDGLYNVAYTGDIKFEASRLLEPAVCQFPRLETLIIESTYGGYDDVLPERDETEKEFL 397
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
+ + E + R GK +IPVF +GRAQEL ++LE + + AP+Y G+ +AT + +
Sbjct: 398 RVIAETIARKGKAIIPVFGIGRAQELMLVLEEGYNQGIFNAPVYLD-GMIWEATAIHTAY 456
Query: 121 ITWTNQKIRKTFVQR--NMF---DFKHIRPFD--KSFIDNPGPMVVFATPGMLHSGLSLI 173
+ ++ +R N F FK ++ + ++ ID+ P ++ T GML G S+
Sbjct: 457 PEYLSKNMRNRIFHEGDNPFLSEVFKKVKNTNDRRNIIDSDEPGIILTTSGMLSGGPSVE 516
Query: 174 IFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENK----QIIDVKMAIEYM-SFS 228
FK A E N ++ G+ +GT+G K+ G K++ K + + V +++ + FS
Sbjct: 517 YFKNLADDEKNAIVFVGYQSEGTLGRKIQKGFKEIPLMGKNGRSKAVKVNLSVHTLEGFS 576
Query: 229 AHADAKGIMQLIQYCE-----PKNVLLVHGEASKMV--------FLKEKIKQEFNLDCF 274
H+D K QLI+Y P +L VHGE SK + K++ K NLD
Sbjct: 577 GHSDRK---QLIKYLRKLKPIPDRILTVHGEVSKCIDLASTAYKLFKKETKAPMNLDAI 632
>gi|114107740|gb|AAI23069.1| LOC779563 protein [Xenopus (Silurana) tropicalis]
Length = 457
Score = 124 bits (311), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 88/228 (38%), Positives = 131/228 (57%), Gaps = 15/228 (6%)
Query: 804 ITHFVTPDSDISSILSQAYPQDSLHNVCIMGLHTCGDLSGTALRLFT-KSSLQCLVQVGC 862
+T +T ++++S I+ + L ++GLHTCGDL+ LR+FT K ++ + VGC
Sbjct: 214 LTCKITVETELSEII------EDLEESIMVGLHTCGDLAPNTLRIFTAKPEIKAVCSVGC 267
Query: 863 CYHLLEEEFIRSPFWKDVDQSLYEHGYGFPLSEHLRSRKFFLGRNVRMSGTQSPERVIDL 922
CYHLL EEF V +GFP+S HL+ +F GRN RMS + ERV
Sbjct: 268 CYHLLSEEFESPEEEITVTT------WGFPMSIHLKESSWFFGRNARMSACLALERVAVG 321
Query: 923 KQTQTLPLFYRALLEKYLRSKITINDEEPKVVGRLATKCSNFVEYVHRAVDKLKL-DLEV 981
+ T LFYRA+L+ L+ ++ + K VG++ +K NF EYV +++ KL D E+
Sbjct: 322 QGLATESLFYRAVLQVILKDCFSVTQSD-KRVGKIFSKSGNFTEYVRKSLKKLGHDDSEL 380
Query: 982 DDEEVTRLFNSHQREYEYLQIYYFLKTALAPVIEALIVLDRVLYLREQ 1029
D E+ F ++ ++ + LK LAP IEALI+LDR+LYL+EQ
Sbjct: 381 SDMEIMEYFEKYKPRKNEMEAFNMLKVLLAPCIEALILLDRLLYLKEQ 428
Score = 69.7 bits (169), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 31/73 (42%), Positives = 50/73 (68%)
Query: 664 QLMSEKKSYEVQVMSQVVAAVTNSCDSSHIIDLGGGQGYLSTILALQHGKKTLSLDYNQV 723
Q M++KK +EV +MSQ+V ++ N C +IDLG G+GYLS+ L++++ K +D +
Sbjct: 3 QFMNDKKLHEVLIMSQLVQSIANICGVKQVIDLGAGKGYLSSYLSMKYNLKVYGIDSSDT 62
Query: 724 NTHGAAVRSKKLE 736
NT GA R++KL+
Sbjct: 63 NTDGANRRNQKLK 75
>gi|261402298|ref|YP_003246522.1| KH-domain/beta-lactamase-domain-containing protein
[Methanocaldococcus vulcanius M7]
gi|261369291|gb|ACX72040.1| KH-domain/beta-lactamase-domain protein [Methanocaldococcus
vulcanius M7]
Length = 634
Score = 124 bits (311), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 95/291 (32%), Positives = 148/291 (50%), Gaps = 23/291 (7%)
Query: 7 GNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYAT--TIRDSKRCRERDFLKKVH 64
G ++ YTGD R L A R + LI ESTY + + ER+ LK V
Sbjct: 341 GLYNLAYTGDIKFETSRLLEPAVCQFPRLETLIIESTYGAYDDVLPEREEAERELLKVVS 400
Query: 65 ECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMFITWT 124
E D+GGKVLIPVF +GRAQEL ++LE + + AP+Y G+ +AT + + +
Sbjct: 401 ETTDKGGKVLIPVFGVGRAQELMLVLEEGYNQGIFNAPVYLD-GMIWEATAIHTAYPEYL 459
Query: 125 NQKIRKTFVQR--NMF---DFKHIRPFD--KSFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
++++R+ N F FK + + + ID+ P V+ AT GML G S+ K
Sbjct: 460 SKEMRQKIFHEGDNPFLSEVFKRVGSTNERRKVIDSDEPCVILATSGMLTGGPSVEYLKH 519
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEF--ENKQIIDVKMAIEYMS---FSAHAD 232
AP E N +I G+ +GT+G KV G K++ N + + + ++ + FS H+D
Sbjct: 520 LAPDEKNAIIFVGYQAEGTLGRKVQRGWKEIPIVTRNGKTKSIPINLQVYTIEGFSGHSD 579
Query: 233 AKGIMQLIQYCE-----PKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPAN 278
K QLI+Y P+ +++VHGE SK + + +++ F +P N
Sbjct: 580 RK---QLIKYVRRLKPSPEKIIMVHGEESKCLDFADTVRRLFKKQTCVPMN 627
>gi|124485834|ref|YP_001030450.1| ADP-ribosylation/crystallin J1 [Methanocorpusculum labreanum Z]
gi|124363375|gb|ABN07183.1| beta-lactamase domain protein [Methanocorpusculum labreanum Z]
Length = 636
Score = 124 bits (311), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 83/292 (28%), Positives = 141/292 (48%), Gaps = 12/292 (4%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATT--IRDSKRCRERDF 59
F V G ++ +TGD R A R + L+ ESTY + S+ E
Sbjct: 341 FNVGNGVYNVAFTGDLFYGKSRLFNPAVSQFPRLESLVIESTYGGSDDFSPSRVEAEERL 400
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKM 119
+ ++E + RGGKVLIP FA+GR+QEL + +E + + G+ ++AT +
Sbjct: 401 YEVINETLGRGGKVLIPAFAVGRSQELMLAIEEGMRNNRIPKCKVYLDGMIKEATAMHTA 460
Query: 120 FITWTNQKIRKTFVQRN----MFD-FKHIRPFDK--SFIDNPGPMVVFATPGMLHSGLSL 172
+ + N +R + N + D F+ + +DK I + P V+ +T GM++ G +
Sbjct: 461 YPEYLNTDLRNLIFRDNYNPFLADCFEQVDSYDKRQQVIFSEDPCVIISTSGMMNGGPVI 520
Query: 173 IIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYM-SFSAHA 231
A NM++ G+ +GT G ++ G +++ + I + M I+ + FS HA
Sbjct: 521 EYLSNLAESPKNMIVFVGYQAEGTYGRRIQKGWREVPIGKRGTIVINMEIQTVEGFSGHA 580
Query: 232 DAKGIMQLIQYCEPK--NVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGES 281
D K +M +QY +PK V +HGE SK + I ++F++ P N E+
Sbjct: 581 DRKQLMNYVQYVQPKPERVFTIHGEESKTIDFASSIYKKFHIQTVAPQNLET 632
>gi|150400309|ref|YP_001324076.1| beta-lactamase domain-containing protein [Methanococcus vannielii
SB]
gi|150013012|gb|ABR55464.1| beta-lactamase domain protein [Methanococcus vannielii SB]
Length = 635
Score = 124 bits (310), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 92/298 (30%), Positives = 145/298 (48%), Gaps = 23/298 (7%)
Query: 3 QVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYAT--TIRDSKRCRERDFL 60
V G ++ YTGD R L A R + LI ESTY + + E++FL
Sbjct: 338 HVGDGLYNVAYTGDIKFEASRLLEPATCQFPRLETLIIESTYGGYDDVLPERDETEKEFL 397
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
+ E RGGKV+IPVF +GRAQEL ++LE + + AP+Y G+ +AT + +
Sbjct: 398 NVMLETTSRGGKVIIPVFGIGRAQELMLVLEEGYNQGIFNAPVYLD-GMIWEATAIHTAY 456
Query: 121 ITWTNQKIRKTFVQR--NMF---DFKHIRPFD--KSFIDNPGPMVVFATPGMLHSGLSLI 173
+ ++ +R N F FK ++ + ++ +D ++ T GML G S+
Sbjct: 457 PEYLSKAMRNRIFHEGDNPFLSEVFKKVKNTNDRRNIMDGDEACIILTTSGMLSGGPSVE 516
Query: 174 IFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENK----QIIDVKMAIEYM-SFS 228
FK A E N ++ G+ +GT+G K+ G K++ K + + V +++ + FS
Sbjct: 517 YFKNLAEDEKNAIVFVGYQSEGTLGRKIQRGFKEIPLMGKNGRSKAVKVNLSVHTLEGFS 576
Query: 229 AHADAKGIMQLIQYCE-----PKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGES 281
H+D K QLI+Y P +L VHGEASK + L + F + P N +S
Sbjct: 577 GHSDRK---QLIKYLRKLKPIPDRILTVHGEASKCIDLASTAYKLFKKETRSPMNLDS 631
>gi|301604039|ref|XP_002931678.1| PREDICTED: uncharacterized protein C12orf26 [Xenopus (Silurana)
tropicalis]
Length = 432
Score = 124 bits (310), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 88/228 (38%), Positives = 131/228 (57%), Gaps = 15/228 (6%)
Query: 804 ITHFVTPDSDISSILSQAYPQDSLHNVCIMGLHTCGDLSGTALRLFT-KSSLQCLVQVGC 862
+T +T ++++S I+ + L ++GLHTCGDL+ LR+FT K ++ + VGC
Sbjct: 189 LTCKITVETELSEII------EDLEESIMVGLHTCGDLAPNTLRIFTAKPEIKAVCSVGC 242
Query: 863 CYHLLEEEFIRSPFWKDVDQSLYEHGYGFPLSEHLRSRKFFLGRNVRMSGTQSPERVIDL 922
CYHLL EEF V +GFP+S HL+ +F GRN RMS + ERV
Sbjct: 243 CYHLLSEEFESPEEEITVTT------WGFPMSIHLKESSWFFGRNARMSACLALERVAVG 296
Query: 923 KQTQTLPLFYRALLEKYLRSKITINDEEPKVVGRLATKCSNFVEYVHRAVDKLKL-DLEV 981
+ T LFYRA+L+ L+ ++ + K VG++ +K NF EYV +++ KL D E+
Sbjct: 297 QGLATESLFYRAVLQVILKDCFSVTQSD-KRVGKIFSKSGNFTEYVRKSLKKLGHDDSEL 355
Query: 982 DDEEVTRLFNSHQREYEYLQIYYFLKTALAPVIEALIVLDRVLYLREQ 1029
D E+ F ++ ++ + LK LAP IEALI+LDR+LYL+EQ
Sbjct: 356 SDMEIMEYFEKYKPRKNEMEAFNMLKVLLAPCIEALILLDRLLYLKEQ 403
Score = 45.8 bits (107), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 19/44 (43%), Positives = 31/44 (70%)
Query: 693 IIDLGGGQGYLSTILALQHGKKTLSLDYNQVNTHGAAVRSKKLE 736
+IDLG G+GYLS+ L++++ K +D + NT GA R++KL+
Sbjct: 7 VIDLGAGKGYLSSYLSMKYNLKVYGIDSSDTNTDGANRRNQKLK 50
>gi|150399442|ref|YP_001323209.1| beta-lactamase domain-containing protein [Methanococcus vannielii
SB]
gi|150012145|gb|ABR54597.1| beta-lactamase domain protein [Methanococcus vannielii SB]
Length = 421
Score = 124 bits (310), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 94/284 (33%), Positives = 148/284 (52%), Gaps = 21/284 (7%)
Query: 6 VGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATT--IRDSKRCRERDFLKKV 63
V + ++YTGD N R L +A D D +I ESTY + I+ +++ E+ + ++
Sbjct: 143 VDGKKLLYTGDINEIETRTLCSADTDIDEIDTIIIESTYGSPLDIKPARKVLEKQLIDEI 202
Query: 64 HECVDRGGKVLIPVFALGRAQELCILLETYWERMNL-QAPIYFAVGLTEKATNYYKMFIT 122
E ++ GGKV+IPVFA+GR+QE+ +++ Y + L + PIY LT Y +
Sbjct: 203 SETIENGGKVIIPVFAVGRSQEIIVVINNYMKSGALKEVPIYVNGSLTH-TIGMYMGYSE 261
Query: 123 WTNQKIRKTFVQR-NMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSG-----LSLIIFK 176
W N KI+ T R N F I+ D+ F N P ++ +T GM+ G LSL+
Sbjct: 262 WLNPKIKNTIENRINPFG-NLIKNSDEVF--NKEPCIIISTSGMVQGGPILQYLSLLKNP 318
Query: 177 KWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGI 236
K N LI+ G+ +GTIG + GVK++ K+ I V + + FSAHAD +
Sbjct: 319 K------NKLILTGYQAEGTIGRSLEEGVKEIT-PFKRAIPVNGTVIKVEFSAHADYNSL 371
Query: 237 MQ-LIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANG 279
++ L + EPK +++HGE + + L I + F + P NG
Sbjct: 372 IRYLKKIPEPKKAIVMHGERYQALSLAMTIWKTFKIPAIAPTNG 415
>gi|428308354|ref|YP_007119299.1| beta-lactamase [Crinalium epipsammum PCC 9333]
gi|428249902|gb|AFZ15677.1| beta-lactamase domain protein [Crinalium epipsammum PCC 9333]
Length = 822
Score = 123 bits (309), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 84/266 (31%), Positives = 134/266 (50%), Gaps = 14/266 (5%)
Query: 4 VKVGNQSIVYTGDYNMTPDRHL-GAAWIDKCRPDLLITESTYATTIRDSKRCRERDFLKK 62
++ G +S++YTGDYN + R G D D+LITESTY +++ +E++ LK
Sbjct: 399 IRYGERSLLYTGDYNTSNSRTTDGLRLADLPEADMLITESTYGADNHPNRKTQEQELLKA 458
Query: 63 VHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMFIT 122
V E V +GG VLIP FALGRAQE+ + + T L P+Y GL T+ + I
Sbjct: 459 VAEVVSKGGNVLIPAFALGRAQEIILAIRTSALFHKLNIPVYVD-GLVRAVTDVFIENIN 517
Query: 123 WTNQKIRKTFVQRNMF----DFKHIRPF---DKSFIDNPGPMVVFATPGMLHSGLSLIIF 175
I+ +Q F I P + + P V+ A+ GML G S+
Sbjct: 518 LLPASIQNFALQSEPFFNPDSHPPIIPIAHPKERPLAIAKPSVIIASSGMLSGGASVYYA 577
Query: 176 KKWAPVENNMLIMPGFCVQGTIGHKVLSGVKK---LEFENKQIIDVKMAIEYMSFSAHAD 232
+N + + G+ + + G + L G+++ +EF NK+++ VK I + S HAD
Sbjct: 578 TTLLERDNAAVFISGYTDEESPG-RFLQGLQQGDEIEFNNKKLV-VKADIRRFNLSGHAD 635
Query: 233 AKGIMQLIQYCEPKNVLLVHGEASKM 258
G+ Q+I PK+++L+HG + +
Sbjct: 636 KVGLTQVINRVNPKHLILIHGSMNAL 661
>gi|424811901|ref|ZP_18237141.1| putative metal-dependent Rnase [Candidatus Nanosalinarum sp.
J07AB56]
gi|339756123|gb|EGQ39706.1| putative metal-dependent Rnase [Candidatus Nanosalinarum sp.
J07AB56]
Length = 619
Score = 123 bits (309), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 86/286 (30%), Positives = 144/286 (50%), Gaps = 16/286 (5%)
Query: 7 GNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCR---ERDFLKKV 63
G +++YTGDYN L A D R + +ITESTY D ++ R ++ FL K+
Sbjct: 333 GLHNLLYTGDYNYDDSHMLRPASTDFHRVETVITESTYGGR-DDEQQPRNQAQKQFLSKM 391
Query: 64 HECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMFITW 123
+ + +GGKV++P FA+GR+QE+ ++ + A +Y G+ A + + +
Sbjct: 392 KQTLAKGGKVIVPAFAVGRSQEILGVMADELSHEHFDATVYLD-GMIRDANALHTAYPEY 450
Query: 124 TNQKI-RKTFVQRNMFDFKHIRPFD-----KSFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
++KI +K F + F IR K ++P VV T GM+ G + KK
Sbjct: 451 LSKKIQKKIFAGNSPFTDDQIRSISSHEERKQAFEDPS-CVVLTTSGMVTGGPIMSYLKK 509
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMS-FSAHADAKGI 236
A +N LI G+ GT+G ++L+G ++E + ++ ++V M +E +S FSAH+D + I
Sbjct: 510 EASRSDNTLIFVGYQADGTLGRRILNGANQVEIDGER-VEVDMNVENVSGFSAHSDREQI 568
Query: 237 MQLIQYCEPK--NVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGE 280
++ + + +V HGE K L I + N+ P N E
Sbjct: 569 KDFVRSLDGRADHVFCNHGETKKCFQLASSIYKSLNVSTSAPDNLE 614
>gi|428304113|ref|YP_007140938.1| beta-lactamase [Crinalium epipsammum PCC 9333]
gi|428245648|gb|AFZ11428.1| beta-lactamase domain protein [Crinalium epipsammum PCC 9333]
Length = 804
Score = 123 bits (309), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 84/266 (31%), Positives = 134/266 (50%), Gaps = 14/266 (5%)
Query: 4 VKVGNQSIVYTGDYNMTPDRHL-GAAWIDKCRPDLLITESTYATTIRDSKRCRERDFLKK 62
++ G +S++YTGDYN + R G D D+LITESTY +++ +E++ LK
Sbjct: 381 IRYGERSLLYTGDYNTSNSRTTDGLRLADLPEADMLITESTYGADNHPNRKTQEQELLKA 440
Query: 63 VHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMFIT 122
V E V +GG VLIP FALGRAQE+ + + T L P+Y GL T+ + I
Sbjct: 441 VAEVVSKGGNVLIPAFALGRAQEIILAIRTSALFHKLNIPVYVD-GLVRAVTDVFIENIN 499
Query: 123 WTNQKIRKTFVQRNMF----DFKHIRPF---DKSFIDNPGPMVVFATPGMLHSGLSLIIF 175
I+ +Q F I P + + P V+ A+ GML G S+
Sbjct: 500 LLPASIQNFALQSEPFFNPDSHPPIIPIAHPKERPLAIAKPSVIIASSGMLSGGASVYYA 559
Query: 176 KKWAPVENNMLIMPGFCVQGTIGHKVLSGVKK---LEFENKQIIDVKMAIEYMSFSAHAD 232
+N + + G+ + + G + L G+++ +EF NK+++ VK I + S HAD
Sbjct: 560 TTLLERDNAAVFISGYTDEESPG-RFLQGLQQGDEIEFNNKKLV-VKADIRRFNLSGHAD 617
Query: 233 AKGIMQLIQYCEPKNVLLVHGEASKM 258
G+ Q+I PK+++L+HG + +
Sbjct: 618 KVGLTQVINRVNPKHLILIHGSMNAL 643
>gi|145590331|ref|YP_001152333.1| beta-lactamase domain-containing protein [Pyrobaculum arsenaticum
DSM 13514]
gi|145282099|gb|ABP49681.1| beta-lactamase domain protein [Pyrobaculum arsenaticum DSM 13514]
Length = 635
Score = 123 bits (308), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 90/319 (28%), Positives = 152/319 (47%), Gaps = 45/319 (14%)
Query: 1 MFQVKVGNQ--SIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATT--IRDSKRCRE 56
M + +GN +I+YTGD+ R L A R ++LI ESTY ++ + E
Sbjct: 326 MAHLHIGNGRYNILYTGDFKYGKTRLLNRAVSKFKRVEMLIMESTYGGKDDVQPPRHEAE 385
Query: 57 RDFLKKVHECVDRGGKVLIPVFALGRAQELCILLETYWER-MNLQAPIYFAVGLTEKATN 115
K + E V +GGKVLIP F+ GR QE+ +L + + + PIY G+ + N
Sbjct: 386 NALAKHISETVSKGGKVLIPAFSTGRGQEILYILNKMMDGGLVPRVPIYVD-GMIVETLN 444
Query: 116 YYKMFITWTNQKIRKTFVQRNMFDFKHIRPFDKS----FIDNPG---------------- 155
Y M+ + N ++ + + + PF S +D
Sbjct: 445 VYLMYPHYLNPEVAEEI-------YGGVNPFTTSGNVVIVDRAKRVEDRINQVAKIAQSE 497
Query: 156 -PMVVFATPGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQ 214
P ++ A GML+ G L F + AP + N L+ + +GTIG ++L+G + EF +
Sbjct: 498 EPAIIIAPHGMLNGGPVLDYFAQLAPDDKNKLVFVSYQAEGTIGRRILNGER--EFVVRS 555
Query: 215 IID----VKMAIEYMS---FSAHADAKGIMQLIQYCE--PKNVLLVHGEASKMVFLKEKI 265
++ + M +E +S FS H+D + +M+ +++ E PK + L+HGE SK++ L I
Sbjct: 556 LVGGESKINMRMEVVSIPGFSGHSDRRELMKYVEHIEPKPKKIALIHGEPSKIISLATSI 615
Query: 266 KQEFNLDCFMPANGESCFV 284
+ ++ + +P GE V
Sbjct: 616 ELKYKITTVIPKVGERIRV 634
>gi|397780985|ref|YP_006545458.1| hypothetical protein BN140_1819 [Methanoculleus bourgensis MS2]
gi|396939487|emb|CCJ36742.1| putative protein MJ1236 [Methanoculleus bourgensis MS2]
Length = 630
Score = 123 bits (308), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 79/296 (26%), Positives = 137/296 (46%), Gaps = 21/296 (7%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATT--IRDSKRCRERDF 59
F + G +I +TGD+N R A R + L ESTY + I+ +R E
Sbjct: 336 FHIGEGLYNIAFTGDFNYQKTRLFSPAVSSFPRLEALFMESTYGGSNDIQPQRRDAEERL 395
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKM 119
+ V + RGGKV+IP FA+GR+QE+ + LE + + + G+ +AT +
Sbjct: 396 YEAVSTTLKRGGKVIIPAFAVGRSQEVMLALEEGIRKKKIPPVKIYLDGMIREATAIHTT 455
Query: 120 FITWTNQKIRKTFVQRNMFDFKHIRPFDKSF--IDNPG---------PMVVFATPGMLHS 168
+ + N +R + + F +F +D+P P V+ T GML
Sbjct: 456 YPEYLNNDLRNQIFREGLNPF-----LADAFVQVDSPDLREQVISGEPCVIITTSGMLSG 510
Query: 169 GLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYM-SF 227
G + K P E NMLI G+ GT+G ++ G +++ ++ I + + I F
Sbjct: 511 GPVMEYLKALGPDERNMLIFVGYQADGTLGRRIQKGWREIPLGWRETIVINLEIATTDGF 570
Query: 228 SAHADAKGIMQLIQYCEPK--NVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGES 281
S H+D K +M + + +P+ + +HG+ +K + L I + + ++ P N E+
Sbjct: 571 SGHSDRKQLMNYVAHLQPRPEKIFTIHGDENKTIDLASSIYKRYRIETHSPMNLET 626
>gi|20094663|ref|NP_614510.1| metal-dependent RNase [Methanopyrus kandleri AV19]
gi|19887824|gb|AAM02440.1| Predicted metal-dependent RNase, consists of a
metallo-beta-lactamase domain and an RNA-binding KH
domain [Methanopyrus kandleri AV19]
Length = 652
Score = 123 bits (308), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 84/294 (28%), Positives = 144/294 (48%), Gaps = 31/294 (10%)
Query: 8 NQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFLKKVHECV 67
+ VYTGD N TP R L A R D ++ E+TY + S+R E F K V + +
Sbjct: 360 GHNFVYTGDINPTPSRLLEGADNRFKRVDSMVVEATYGDSRHGSRRKEENRFRKIVRDTL 419
Query: 68 DRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMFITWTNQK 127
+GGKVLIP FA+GRAQE+ ++LE + L+ P+Y G+ +AT + + + N++
Sbjct: 420 KKGGKVLIPSFAVGRAQEVMLVLEDMHRKDELEGPVYLD-GMIYEATAIHTAYPEYLNRR 478
Query: 128 IRKTFVQR--NMFDFKHIRPFDKS-----FIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
++ + + F + P + S +++ P V+ +T GML G L ++ +
Sbjct: 479 LQHRILHEDDDPFTSEVFEPVEGSDHRQAIMEDDEPAVILSTSGMLEGGPILEYLRELSD 538
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENK----QIIDVKMAIEYMS-FSAHADAKG 235
N LI G+ +GT+G ++ G K++ + + +++ +E +S FS H D
Sbjct: 539 DPKNTLIFVGYQAEGTLGRQIQEGAKEVPLPTPTGKTETLRIELRVETVSGFSGHGDK-- 596
Query: 236 IMQLIQYCE------PKNVLLVHGEASKMVF----LKEKIKQEF-----NLDCF 274
++L +Y P V HGE + L+ +++ F NL+CF
Sbjct: 597 -IELTKYVRSIRPSPPSKVFTNHGEPRACKYFSNHLRRTVRKVFSMAPENLECF 649
>gi|297619164|ref|YP_003707269.1| KH-domain/beta-lactamase-domain-containing protein [Methanococcus
voltae A3]
gi|297378141|gb|ADI36296.1| KH-domain/beta-lactamase-domain protein [Methanococcus voltae A3]
Length = 635
Score = 123 bits (308), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 91/294 (30%), Positives = 145/294 (49%), Gaps = 23/294 (7%)
Query: 7 GNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYAT--TIRDSKRCRERDFLKKVH 64
G ++ YTGD R L A R + LI ESTY + + E++ LK +
Sbjct: 342 GLYNVAYTGDIKFEASRLLEPAVCQFPRLETLIMESTYGGYDDVLPERDETEKELLKVIS 401
Query: 65 ECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMFITWT 124
E + +GGKV++PVF +GRAQEL ++LE + + AP++ G+ +AT + + +
Sbjct: 402 ETIAKGGKVILPVFGIGRAQELMLVLEEGYNQGIFNAPVFLD-GMIWEATAIHTAYPEYL 460
Query: 125 NQKIRKTFVQR--NMF---DFKHIRPFD--KSFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
++ +R N F FK ++ ++ I P ++ AT GML G S+ FK
Sbjct: 461 SKNMRNRIFHEGDNPFLSEVFKKVKDTKDRRNVIGRDEPCIILATSGMLTGGPSVEYFKT 520
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENK----QIIDVKMAIEYM-SFSAHAD 232
A E N ++ G+ +GT+G K+ G K++ K + I VK+++ + FS H D
Sbjct: 521 LADDEKNAIVFVGYQSEGTLGRKIQKGWKEIPLMGKNGKTRAIKVKLSVHTLEGFSGHCD 580
Query: 233 AKGIMQLIQYCE-----PKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGES 281
K QLI+Y P +L +HGEASK + L + F + P N +S
Sbjct: 581 RK---QLIKYLRKLKPIPDKILTIHGEASKCIDLASTAYKLFKKETKAPMNLDS 631
>gi|167045029|gb|ABZ09693.1| putative Metallo-beta-lactamase superfamily protein [uncultured
marine crenarchaeote HF4000_APKG8G15]
Length = 643
Score = 123 bits (308), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 87/294 (29%), Positives = 147/294 (50%), Gaps = 22/294 (7%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATT--IRDSKRCRERDF 59
F + G + VYTGD L +A + R + L+ ESTY I+ ++ E F
Sbjct: 344 FHIGNGEHNFVYTGDIKYGKSMLLESANTNYPRVETLLIESTYGAKEDIQPDRQEVESAF 403
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNL-QAPIYFAVGLTEKATNYYK 118
+ V+ + GGKVLIP+ A+GRAQEL ++++ Y + L +AP+ F G+ ++AT ++
Sbjct: 404 IASVNSVLKEGGKVLIPIPAVGRAQELMLVIDQYMKSGELVEAPV-FMEGMIQEATAIHE 462
Query: 119 MFITWTNQKIRKTFVQR--NMFD---FKHIRPFDKSFIDNP----GPMVVFATPGMLHSG 169
F + + ++K ++ N FD F +I D+ D P P ++ AT GML G
Sbjct: 463 AFPEYLVRDLKKKILETDDNPFDSEYFTNITHQDER--DEPLRENSPCIIIATSGMLEGG 520
Query: 170 LSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENK----QIIDVKMAIEYM 225
L F+ AP N ++ + V GT+G +V+ G +++ K +++ + + E +
Sbjct: 521 PVLDYFRNIAPHTKNKILFVSYQVNGTLGRRVMDGARQVSILGKEGRIEVVSINCSTERL 580
Query: 226 -SFSAHADAKGIMQLIQYCEPK--NVLLVHGEASKMVFLKEKIKQEFNLDCFMP 276
FS H+D +M + PK VL+ HGE K L I++ + + P
Sbjct: 581 DGFSGHSDYNQLMSFVHRLRPKLRRVLVNHGEKRKSENLSTSIRRMYRVSSHYP 634
>gi|126465835|ref|YP_001040944.1| beta-lactamase domain-containing protein [Staphylothermus marinus
F1]
gi|126014658|gb|ABN70036.1| beta-lactamase domain protein [Staphylothermus marinus F1]
Length = 652
Score = 122 bits (307), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 83/271 (30%), Positives = 142/271 (52%), Gaps = 11/271 (4%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFLK 61
+ +G +IVYTGD+ P R L A R + LI ESTY T + S+R E + ++
Sbjct: 346 LHIGMGLHNIVYTGDFKYAPTRLLNRAEDKFPRVETLIMESTYGATRQQSRREAEAELIR 405
Query: 62 KVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMFI 121
V + ++R G VLIPVFA+GR QE+ ++L + + F GL + T + +
Sbjct: 406 IVKKTIERKGIVLIPVFAVGRGQEIMLVLNDAISKGLIPKVDVFIEGLVNEVTAIHTQYP 465
Query: 122 TWTNQKIRKTFVQ-RNMF--DFKHIRPFDKSFID--NPGPMVVFATPGMLHSGLSLIIFK 176
+ N+ I++ + N F D+ HI + D P ++ AT GML G ++ K
Sbjct: 466 EYLNRNIKEMIYRGENPFTADYFHIIESGTARPDIVETKPSIILATSGMLTGGPAVEYLK 525
Query: 177 KWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLE--FENK-QIIDVKMAIEYM-SFSAHAD 232
A N L+ G+ +GT+G ++ G+++L+ +NK +II + + + + FS H+D
Sbjct: 526 LIASDPRNSLVFVGYQAEGTLGRRIKDGMRELQTVVDNKVEIIRINLEVYSIDGFSGHSD 585
Query: 233 AKGIMQLIQYCEPK--NVLLVHGEASKMVFL 261
+++ ++ PK N++L HGE S ++ L
Sbjct: 586 QSELLRYVRNISPKPRNIILNHGEPSAIMTL 616
>gi|340623726|ref|YP_004742179.1| beta-lactamase-like protein [Methanococcus maripaludis X1]
gi|339903994|gb|AEK19436.1| beta-lactamase-like protein [Methanococcus maripaludis X1]
Length = 635
Score = 122 bits (307), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 90/298 (30%), Positives = 146/298 (48%), Gaps = 23/298 (7%)
Query: 3 QVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYAT--TIRDSKRCRERDFL 60
V G ++ YTGD R L A R + LI ESTY + + E++FL
Sbjct: 338 HVGDGLYNVAYTGDIKFEASRLLEPAVCQFPRLETLIIESTYGGYDDVLPERDETEKEFL 397
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
+ + E + R GK +IPVF +GRAQEL ++LE + + AP+Y G+ +AT + +
Sbjct: 398 RVIAETIARKGKAIIPVFGIGRAQELMLVLEEGYNQGIFNAPVYLD-GMIWEATAIHTAY 456
Query: 121 ITWTNQKIRKTFVQR--NMF---DFKHIRPFD--KSFIDNPGPMVVFATPGMLHSGLSLI 173
+ ++ +R N F FK ++ + ++ +D+ P ++ T GML G S+
Sbjct: 457 PEYLSKNMRNRIFHEGDNPFLSEVFKKVKNTNDRRNIMDSDEPGIILTTSGMLSGGPSVE 516
Query: 174 IFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENK----QIIDVKMAIEYM-SFS 228
FK A E N ++ G+ +GT+G K+ G K++ K + + V +++ + FS
Sbjct: 517 YFKNLADDEKNSIVFVGYQSEGTLGRKIQKGFKEIPLMGKNGRSKAVKVNLSVHTLEGFS 576
Query: 229 AHADAKGIMQLIQYCE-----PKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGES 281
H+D K QLI+Y P +L VHGE SK + L + F + P N +S
Sbjct: 577 GHSDRK---QLIKYLRKLKPIPDRILTVHGEVSKCIDLASTAYKLFKKETKAPMNLDS 631
>gi|379005388|ref|YP_005261060.1| universal KH-domain/beta-lactamase-domain protein [Pyrobaculum
oguniense TE7]
gi|375160841|gb|AFA40453.1| universal archaeal KH-domain/beta-lactamase-domain protein
[Pyrobaculum oguniense TE7]
Length = 635
Score = 122 bits (307), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 90/319 (28%), Positives = 152/319 (47%), Gaps = 45/319 (14%)
Query: 1 MFQVKVGNQ--SIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATT--IRDSKRCRE 56
M + +GN +I+YTGD+ R L A R ++LI ESTY ++ + E
Sbjct: 326 MAHLHIGNGRYNILYTGDFKYGKTRLLNRAVSKFKRVEMLIMESTYGGKDDVQPPRHEAE 385
Query: 57 RDFLKKVHECVDRGGKVLIPVFALGRAQELCILLETYWER-MNLQAPIYFAVGLTEKATN 115
K + E V +GGKVLIP F+ GR QE+ +L + + + PIY G+ + N
Sbjct: 386 NALAKHISETVSKGGKVLIPAFSTGRGQEILYVLNKMMDGGLVPRVPIYVD-GMIVETLN 444
Query: 116 YYKMFITWTNQKIRKTFVQRNMFDFKHIRPFDKS----FIDNPG---------------- 155
Y M+ + N ++ + + + PF S +D
Sbjct: 445 VYLMYPHYLNPEVAEEI-------YGGVNPFTTSGNVVIVDRAKRVEDRINQVAKIAQSE 497
Query: 156 -PMVVFATPGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQ 214
P ++ A GML+ G L F + AP + N L+ + +GTIG ++L+G + EF +
Sbjct: 498 EPAIIIAPHGMLNGGPVLDYFAQLAPDDKNKLVFVSYQAEGTIGRRILNGER--EFVVRS 555
Query: 215 IID----VKMAIEYMS---FSAHADAKGIMQLIQYCE--PKNVLLVHGEASKMVFLKEKI 265
++ + M +E +S FS H+D + +M+ +++ E PK + L+HGE SK++ L I
Sbjct: 556 LVGGESKINMRMEVVSIPGFSGHSDRRELMKYVEHIEPKPKKIALIHGEPSKIISLATSI 615
Query: 266 KQEFNLDCFMPANGESCFV 284
+ ++ + +P GE V
Sbjct: 616 ELKYKITTVIPKVGERIRV 634
>gi|357161637|ref|XP_003579155.1| PREDICTED: protein RRNAD1-like [Brachypodium distachyon]
Length = 564
Score = 122 bits (307), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 120/450 (26%), Positives = 207/450 (46%), Gaps = 59/450 (13%)
Query: 662 VSQLMSEKKSYEVQVMSQVVAAVTNSCDSSHIIDLGGGQGYLSTILALQHGKKTLSLDYN 721
++Q M+ KK +E++ + +V A+ SC + ++D+G GQGYL+ L+ ++ +++D
Sbjct: 122 LAQGMNTKKKHEIETLGAIVDAIARSCGAKTVVDVGSGQGYLAQALSFEYQLPVVAID-- 179
Query: 722 QVNTHGAAVRSKKLEDSEKESKGPFQSYAGVINKKLWMRTQNRQTSHASVEHHGKNWKRK 781
++H A+V + + E +K YA +K R T H
Sbjct: 180 -ASSHHASVTNTRAERIKKH-------YAAKCVEKQQFRVPRTVTCHV-----------L 220
Query: 782 SKAPVVSIKDEELVVCKDKCKQITHFVTPDSDISSILSQ-AYPQDSLHNVCIMGLHTCGD 840
S A + ++ D C+ + ++ + I Q P S+ + + GLH CGD
Sbjct: 221 SSATLAAVT-------LDACQDVHGEHVTETYCTGISPQIENPNHSIPQLVLAGLHACGD 273
Query: 841 LSGTALRLFTKSS-LQCLVQVGCCYHLLEEEFIRSPFWKDVDQSLYEHGYGFPLSEHLRS 899
LS LR+F +Q LV VGCCY+LL E+ +D D GFP+S +S
Sbjct: 274 LSVNMLRVFVSCEQVQALVSVGCCYNLLSED-----CHEDRDTC-----PGFPISNAAKS 323
Query: 900 RKFFLGRNVRMSGTQSPERVIDLKQTQTLPLF----YRA----LLEKYLRSKITINDEEP 951
K LG+++R QS ER +L + L F +RA +LEKY ++
Sbjct: 324 SKLVLGKSIRDLACQSAERWRNLTKEMALENFDVHAFRAAFQMVLEKYFPEVSRLSPSIG 383
Query: 952 KVVGRLATKCSNFVEYVHRAVDKL-KLDLEVDDEEVTRLFNSHQREYEYLQ-IYYFLKTA 1009
+ L + V H A+ K L E+ + +S E L+ F+ ++
Sbjct: 384 RQGKALRRQRLRKVMESHMAMGKNDDLPYSNSKEQNMTMDSSLPTEPTDLEGAVGFVDSS 443
Query: 1010 LAPVIEALIVLDRVLYLREQQKPQIISNLFLSLLTQVD-----DEEVTRLFDSHQREYEY 1064
++ V V +YL E K + + +S L ++D D + ++ + QR E+
Sbjct: 444 ISRV----NVGPTGIYLDECDKFTLFKDFTVSGLARLDCGFVEDVHLLEIWKAVQRFAEF 499
Query: 1065 LQIYYFLKTALAPVIEALIVLDRVLYLREQ 1094
+ ++ L+ AL P++E I+LDR+L+L+EQ
Sbjct: 500 IGPFWCLRAALGPLVETYILLDRLLFLQEQ 529
Score = 40.0 bits (92), Expect = 7.4, Method: Compositional matrix adjust.
Identities = 36/158 (22%), Positives = 68/158 (43%), Gaps = 15/158 (9%)
Query: 480 ETMELIKNTIDEMIEFITPNIACVNTHTVHFMVEKLWETTIPKAILSQVNNSPSFDSLLE 539
ET + +D + F+ + + H V+F ++LWE + + + P +SLL+
Sbjct: 16 ETAAQTREWMDAIASFLGRHRPLLEAHVVNFFKDRLWE-MVDAEWMECLRGEP-VESLLK 73
Query: 540 EFWKSRQDNHVETNSELVKFFQAADKFRLTSLMNGESIITVDELFSKLQARKCGQVVESA 599
QD+ S L +F A L+ + L+ G V
Sbjct: 74 LPSGCVQDHW---PSSLQEFILTARSLALSRDQK-----SPQSFLPNLRVASIGTV---- 121
Query: 600 KVSQLMSEKKSYEVQVMSQVVAAVTNSCDSSHIIDLGS 637
++Q M+ KK +E++ + +V A+ SC + ++D+GS
Sbjct: 122 -LAQGMNTKKKHEIETLGAIVDAIARSCGAKTVVDVGS 158
>gi|45358257|ref|NP_987814.1| beta-lactamase-like protein [Methanococcus maripaludis S2]
gi|44921014|emb|CAF30250.1| Beta-lactamase-like:KH domain [Methanococcus maripaludis S2]
Length = 635
Score = 122 bits (307), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 90/298 (30%), Positives = 146/298 (48%), Gaps = 23/298 (7%)
Query: 3 QVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYAT--TIRDSKRCRERDFL 60
V G ++ YTGD R L A R + LI ESTY + + E++FL
Sbjct: 338 HVGDGLYNVAYTGDIKFEASRLLEPAVCQFPRLETLIIESTYGGYDDVLPERDETEKEFL 397
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
+ + E + R GK +IPVF +GRAQEL ++LE + + AP+Y G+ +AT + +
Sbjct: 398 RVIAETIARKGKAIIPVFGIGRAQELMLVLEEGYNQGIFNAPVYLD-GMIWEATAIHTAY 456
Query: 121 ITWTNQKIRKTFVQR--NMF---DFKHIRPFD--KSFIDNPGPMVVFATPGMLHSGLSLI 173
+ ++ +R N F FK ++ + ++ +D+ P ++ T GML G S+
Sbjct: 457 PEYLSKNMRNRIFHEGDNPFLSEVFKKVKNTNDRRNIMDSDEPGIILTTSGMLSGGPSVE 516
Query: 174 IFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENK----QIIDVKMAIEYM-SFS 228
FK A E N ++ G+ +GT+G K+ G K++ K + + V +++ + FS
Sbjct: 517 YFKNLADDEKNSIVFVGYQSEGTLGRKIQKGFKEIPLMGKNGRSKAVKVNLSVHTLEGFS 576
Query: 229 AHADAKGIMQLIQYCE-----PKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGES 281
H+D K QLI+Y P +L VHGE SK + L + F + P N +S
Sbjct: 577 GHSDRK---QLIKYLRKLKPIPDRILTVHGEVSKCIDLASTAYKLFKKETKAPMNLDS 631
>gi|288561234|ref|YP_003424720.1| RNA-metabolising metallo-beta-lactamase [Methanobrevibacter
ruminantium M1]
gi|288543944|gb|ADC47828.1| RNA-metabolising metallo-beta-lactamase [Methanobrevibacter
ruminantium M1]
Length = 636
Score = 122 bits (307), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 84/291 (28%), Positives = 152/291 (52%), Gaps = 17/291 (5%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYAT--TIRDSKRCRERDF 59
F + G +++YTGD+ R L A R + I ESTY + S+ E++
Sbjct: 338 FHIGDGAHNLLYTGDFKYEKSRLLEPATTRFPRVESCIMESTYGGHEDVTPSRNNAEKEL 397
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQA-PIYFAVGLTEKATNYYK 118
+K +++ + RGGKVL+PVFA+GRAQEL I+LE Y ++ PI+ G+ +AT +
Sbjct: 398 MKTIYKTIKRGGKVLLPVFAVGRAQELMIVLEEYMHHDIIETVPIHLD-GMIWEATAVHT 456
Query: 119 MFITWTNQKIRKTFVQ--RNMF---DFKHIR-PFDKSFIDNPGPMVVFATPGMLHSGLSL 172
+ ++++R + RN F F ++ ++ I P ++ +T GM+ G S+
Sbjct: 457 ARPEYLSKELRDQILHMSRNPFMTESFNQVQNNAERKEIVEGEPSIILSTSGMMTGGNSV 516
Query: 173 IIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENK----QIIDVKMAIEYM-SF 227
FK + N LI G+ +G++G K+ G K+L FE++ ++ +V+M ++ + F
Sbjct: 517 EYFKWLCEDKKNTLIFVGYQSEGSLGRKIQKGHKQLPFEDEDGKTRVFNVEMEVKTIEGF 576
Query: 228 SAHADAKGIMQLIQYCEPK--NVLLVHGEASKMVFLKEKIKQEFNLDCFMP 276
S H++ + +M+ + P+ ++ HG+ K V L I + F ++ P
Sbjct: 577 SGHSNRRQLMEFAKRLHPRPDKIITCHGDPYKTVDLASSIHRSFKVETKTP 627
>gi|297527491|ref|YP_003669515.1| KH-domain/beta-lactamase-domain-containing protein [Staphylothermus
hellenicus DSM 12710]
gi|297256407|gb|ADI32616.1| KH-domain/beta-lactamase-domain protein [Staphylothermus hellenicus
DSM 12710]
Length = 647
Score = 122 bits (306), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 84/271 (30%), Positives = 141/271 (52%), Gaps = 11/271 (4%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFLK 61
+ +G +IVYTGD+ P R L A R + LI ESTY T + S+R E + +K
Sbjct: 341 LHIGMGLHNIVYTGDFKYAPTRLLNRAEDKFPRVETLIMESTYGATRQQSRREAEAELIK 400
Query: 62 KVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMFI 121
V + ++R G VLIPVFA+GR QE+ ++L + F GL + T + +
Sbjct: 401 IVKKTIERKGIVLIPVFAVGRGQEIMLVLNDAINNGLIPKVDVFIEGLVNEVTAIHTQYP 460
Query: 122 TWTNQKIRKTFVQ-RNMF--DFKHIRPFDKSFID--NPGPMVVFATPGMLHSGLSLIIFK 176
+ N+ I++ + N F D+ HI + D P ++ AT GML G ++ K
Sbjct: 461 EYLNRNIKEMIYRGENPFTADYFHIIEGGTARPDIVETRPSIIIATSGMLTGGPAVEYLK 520
Query: 177 KWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLE--FENK-QIIDVKMAIEYM-SFSAHAD 232
A N L+ G+ +GT+G ++ G+++L+ +NK +II + + + + FS H+D
Sbjct: 521 LIASDPRNSLVFVGYQAEGTLGRRIKDGMRELQTVVDNKVEIIRINLEVYSIDGFSGHSD 580
Query: 233 AKGIMQLIQYCEPK--NVLLVHGEASKMVFL 261
+++ ++ PK N++L HGE S ++ L
Sbjct: 581 QSELLRYVRNISPKPRNIILNHGEPSAIMTL 611
>gi|336121871|ref|YP_004576646.1| RNA-metabolising metallo-beta-lactamase [Methanothermococcus
okinawensis IH1]
gi|334856392|gb|AEH06868.1| RNA-metabolising metallo-beta-lactamase [Methanothermococcus
okinawensis IH1]
Length = 422
Score = 122 bits (305), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 82/279 (29%), Positives = 143/279 (51%), Gaps = 10/279 (3%)
Query: 6 VGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATT--IRDSKRCRERDFLKKV 63
V + ++YTGD N + L A D D LI ESTY + I+ +++ E+ + ++
Sbjct: 143 VDGKKVLYTGDINEIETKTLRPADTDIDEIDTLIIESTYGSPMDIKPARKVLEKQLVDEI 202
Query: 64 HECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQ-APIYFAVGLTEKATNYYKMFIT 122
+ +++GGKV+IPVFA+GRAQE+ ++ Y L+ P+Y L AT Y +
Sbjct: 203 SKTIEKGGKVIIPVFAVGRAQEIMAIISNYMRSGLLKNVPVYVDGSLIH-ATGVYLSYSE 261
Query: 123 WTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAPVE 182
W N KIR R + F ++ ++ ID +V +T GM+ G ++ + K
Sbjct: 262 WLNPKIRNGLENR-INPFGDVKKANRQIIDKEDSCIVISTSGMVQGG-PVLQYLKLLKDP 319
Query: 183 NNMLIMPGFCVQGTIGHKVLSGVKKL-EFENKQIIDVKMAIEYMSFSAHADAKGIMQ-LI 240
N +I+ G+ + T+G ++ G K++ F+NK + + + + FSAH D +++ L
Sbjct: 320 KNKIILTGYQAEDTLGRQLEEGAKEITPFKNK--LPINGEVVKIEFSAHGDYNSLIRYLK 377
Query: 241 QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANG 279
+ +PK V ++HGE + + L I + F + P G
Sbjct: 378 KIPKPKKVFVMHGERYQALSLAMTIWKNFKIPSTAPTVG 416
>gi|408404164|ref|YP_006862147.1| beta-lactamase [Candidatus Nitrososphaera gargensis Ga9.2]
gi|408364760|gb|AFU58490.1| beta-lactamase domain protein [Candidatus Nitrososphaera gargensis
Ga9.2]
Length = 700
Score = 122 bits (305), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 86/296 (29%), Positives = 145/296 (48%), Gaps = 30/296 (10%)
Query: 7 GNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATT--IRDSKRCRERDFLKKVH 64
G +I+Y+GDY + L +A R + LITESTY T + ++ R F + ++
Sbjct: 408 GAHNILYSGDYKYARTQLLDSAVSMYPRVETLITESTYGNTTDVMPDQQVVYRSFTESIN 467
Query: 65 ECVDRGGKVLIPVFALGRAQELCILLETYWERMNL-QAPIYFAVGLTEKATNYYKMFITW 123
+ + GGKVLIPV A+GRAQE+ +++ L ++PIY G+ +A+ + + +
Sbjct: 468 KTLIEGGKVLIPVPAVGRAQEIMLVMAKEMREGRLVESPIYIE-GMISEASAIHMSYAHY 526
Query: 124 TNQKIRKTFVQRNMFDFKHIRPFDKSF-------------IDNPGPMVVFATPGMLHSGL 170
++RK+ Q I PF + +++ P +V AT GML G
Sbjct: 527 LGSEVRKSVSQ-------GINPFQSEYFTVISGHGKRDDVLNDENPAIVMATSGMLEGGP 579
Query: 171 SLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSG----VKKLEFENK-QIIDVKMAIEYM 225
S+ FK+ AP N ++ + + GT+G +VL G V ++ K +++ V+ + +
Sbjct: 580 SVEYFKELAPNPKNKIMFVSYQINGTLGRRVLDGAMSEVSMMDKTGKVKVVPVRCQTQKI 639
Query: 226 -SFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGE 280
FS H+D I+ + +PK VL+ HGE SK + I + +P N E
Sbjct: 640 DGFSGHSDFNQILNFVSRVKPKRVLVNHGERSKSENVASAIYSRLKIRSGVPDNRE 695
>gi|304315326|ref|YP_003850473.1| metal-dependent RNase [Methanothermobacter marburgensis str.
Marburg]
gi|302588785|gb|ADL59160.1| predicted metal-dependent RNase [Methanothermobacter marburgensis
str. Marburg]
Length = 636
Score = 121 bits (304), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 86/301 (28%), Positives = 151/301 (50%), Gaps = 29/301 (9%)
Query: 7 GNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYAT--TIRDSKRCRERDFLKKVH 64
G ++VYTGD+ R L A R + L+ ESTY ++ + E++ +K ++
Sbjct: 343 GQHNMVYTGDFKYEQSRLLEPAASRFPRIETLVMESTYGGHEDVQPPRTRAEKELVKTIY 402
Query: 65 ECVDRGGKVLIPVFALGRAQELCILLETYWERMNL-QAPIYFAVGLTEKATNYYKMFITW 123
+ RGGK+LIPVFA+GRAQEL I+LE Y + + P+Y G+ +A + +
Sbjct: 403 STLKRGGKILIPVFAVGRAQELMIVLEEYIRTGIIDEVPVYID-GMIWEANAIHTARPEY 461
Query: 124 TNQKIRKTFVQRNMFDFKHIRPF------------DKSFIDNPGPMVVFATPGMLHSGLS 171
++ +R +F H PF ++ I P ++ +T GML G S
Sbjct: 462 LSKDLRD-----QIFHMGH-NPFISEIFHKVNGMDERREIVEGEPSIILSTSGMLTGGNS 515
Query: 172 LIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQ----IIDVKMAIEYM-S 226
L FK N L+ G+ +G++G ++ G K++ ++++ + +VKM I+ +
Sbjct: 516 LEYFKWLCEDPKNSLVFVGYQAEGSLGRRIQKGWKEIPLKDEEDKMRVYNVKMNIKTIEG 575
Query: 227 FSAHADAKGIMQLIQYCEPK--NVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFV 284
FS H+D + +M+ ++ PK +LL HG+ K + L I + + ++ P N E+ +
Sbjct: 576 FSGHSDRRQLMEYVKRISPKPEKILLCHGDNYKTLDLASSIYRTYRIETKTPLNLETVRI 635
Query: 285 Q 285
Q
Sbjct: 636 Q 636
>gi|300856380|ref|YP_003781364.1| metal-dependent RNase [Clostridium ljungdahlii DSM 13528]
gi|300436495|gb|ADK16262.1| putative metal-dependent RNase [Clostridium ljungdahlii DSM 13528]
Length = 825
Score = 121 bits (304), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 78/285 (27%), Positives = 143/285 (50%), Gaps = 26/285 (9%)
Query: 10 SIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFLKKVHECVDR 69
S+ Y+GD+++ + + + K RPD I ESTY + ++ E + +++C+++
Sbjct: 163 SLFYSGDFSIFSQKTVEGLKVPKLRPDAAIFESTYGDRLHSNREVEEEKLIDTINDCIEK 222
Query: 70 GGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMFITWTNQKIR 129
GK+LIP FALGRAQE+ ++++ + L+ + G+ + YK+ + +
Sbjct: 223 KGKMLIPAFALGRAQEVLLIIKKAVNKNILKDTKVYVDGMIKDINRTYKLNPLYLKNSLG 282
Query: 130 KTFVQRNMFDFKHIRPF-DKSFI--DNPG----------PMVVFATPGMLHSGLSLIIFK 176
K + + I PF D + I DN P V+ ++ GML G S +
Sbjct: 283 KKIL-------RGIEPFYDDNIIAVDNNKLREEVLKDDEPCVIVSSSGMLTGGYSQYYAE 335
Query: 177 KWAPVENNMLIMPGFCVQGTIGHKVLSGV-----KKLEFENKQIIDVKMAIEYMSFSAHA 231
K AP+EN +++ G+ + + G K+L+ + KKL + N++ + V +E + SAH+
Sbjct: 336 KIAPMENGYIVITGYQDEESPGRKLLNLLDEEKDKKL-YINEKTVPVNCKVEKVGLSAHS 394
Query: 232 DAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMP 276
D I LI + PKN+ LVHG + L +++ ++ + P
Sbjct: 395 DKYEIKSLIDFLTPKNIFLVHGNEEVVENLSKELSKDVRGRVYAP 439
>gi|352683091|ref|YP_004893615.1| mRNA 3'-end processing factor [Thermoproteus tenax Kra 1]
gi|350275890|emb|CCC82537.1| mRNA 3'-end processing factor [Thermoproteus tenax Kra 1]
Length = 635
Score = 121 bits (304), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 87/305 (28%), Positives = 145/305 (47%), Gaps = 39/305 (12%)
Query: 7 GNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATT--IRDSKRCRERDFLKKVH 64
G +I+YTGD+ L A R ++LI ESTY ++ + E +K +
Sbjct: 334 GRYNILYTGDFKFGRTNLLNRAVNKFKRVEMLIMESTYGGRDDVQPPRIEAENTLVKNIT 393
Query: 65 ECVDRGGKVLIPVFALGRAQELCILLETYWERMNL-QAPIYFAVGLTEKATNYYKMFITW 123
E ++RGGKVLIP F+ GRAQE+ +L + +L +APIY G+ + N Y M+ +
Sbjct: 394 ETIERGGKVLIPAFSTGRAQEILYILNREMNKGSLKKAPIYVD-GMIVETLNAYLMYPHF 452
Query: 124 TNQKIRKTFVQRNMFDFKHIRPFDKS---------------------FIDNPGPMVVFAT 162
N+++ + + I PF S + + P V+ A
Sbjct: 453 LNREVAEEI-------YNGINPFTSSGNIQIIERAKRLEDRINQVAKIVQDGQPGVIIAP 505
Query: 163 PGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAI 222
GML+ G L F AP N LI + + T+G ++L+G + + + +VK+++
Sbjct: 506 HGMLNGGPILEYFVHLAPDPANKLIFVSYQAENTLGRRILNGERDFIVRSLSMGEVKVSM 565
Query: 223 EYM-----SFSAHADAKGIMQLIQYCE--PKNVLLVHGEASKMVFLKEKIKQEFNLDCFM 275
+ FS H+D + +M+ I+ E PK ++L+HGE SK + L I+ + + +
Sbjct: 566 KMGVLSIPGFSGHSDRRELMKYIETLEPRPKKIVLIHGEPSKTISLATSIEIRYKVTTVI 625
Query: 276 PANGE 280
P GE
Sbjct: 626 PKVGE 630
>gi|312135668|ref|YP_004003006.1| RNA-metabolising metallo-beta-lactamase [Caldicellulosiruptor
owensensis OL]
gi|311775719|gb|ADQ05206.1| RNA-metabolising metallo-beta-lactamase [Caldicellulosiruptor
owensensis OL]
Length = 821
Score = 121 bits (304), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 78/292 (26%), Positives = 140/292 (47%), Gaps = 14/292 (4%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
M ++ SI+YTGD++ + A + K RPD++I ESTY + ++ E
Sbjct: 152 MIFIQTQEGSILYTGDFSADKQLTVDRASVPKIRPDIVICESTYGDRLHTNRSFEEERLF 211
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
+ E + +GGKVLIP FA+GRAQE+ ++L Y ++ + + F G+ + Y+
Sbjct: 212 NTIFEVISQGGKVLIPAFAIGRAQEVILILRNYMKKRKVSFNV-FIDGMVREVIRVYRNN 270
Query: 121 ITWTNQKIRKTFVQ-RNMFDFKHIRPFD-----KSFIDNPGPMVVFATPGMLHSGLSLII 174
T+ + + K ++ +F +I + I + P V+ ++ GML G SL
Sbjct: 271 PTYLSSRYYKRVLKGEEIFLADNINVVSDKKQREEIISSSDPCVIISSSGMLTGGPSLFY 330
Query: 175 FKKWAPVENNMLIMPGFCVQGTIGHKVL------SGVKKLEFENKQIIDVKMAIEYMSFS 228
+K +N ++ + G+ + G K+L KK+ K+ +VK +E S
Sbjct: 331 AEKIVQSQNALVAITGYQDEEAPGRKLLELAELPESEKKINLNGKE-YEVKCRVEKYGLS 389
Query: 229 AHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGE 280
AH+D I+ + P+ V+ VHG + + + +E + + +P NGE
Sbjct: 390 AHSDRDKILGFLATLRPRTVIFVHGSEDAISQISDMAVKELDANIVVPQNGE 441
>gi|393794892|ref|ZP_10378256.1| beta-lactamase domain-containing protein [Candidatus
Nitrosoarchaeum limnia BG20]
Length = 658
Score = 121 bits (304), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 84/293 (28%), Positives = 150/293 (51%), Gaps = 20/293 (6%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATT--IRDSKRCRERDF 59
F + G+ + VY+GD AA + R + L+ ESTY I+ +++ E F
Sbjct: 359 FHIGNGDHNFVYSGDIKFGKSILFEAASWNFPRVETLLIESTYGAKEDIQPTRQEVESAF 418
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNL-QAPIYFAVGLTEKATNYYK 118
+ V+ + GGKVLIP+ A+GRAQE+ ++++ Y + + +AP+ F G+ +A+ ++
Sbjct: 419 INAVNNTLADGGKVLIPIPAVGRAQEIMMVIDHYMKSGEMVEAPV-FTEGMISEASAIHE 477
Query: 119 MFITWTNQKIRKTFVQR--NMFD------FKHIRPFDKSFIDNPGPMVVFATPGMLHSGL 170
+ + +++++ ++ N FD +H ++ DN P ++ AT GML G
Sbjct: 478 AYPEYLARELKQKILETDDNPFDSEYFTNIEHADAREEPMRDN-SPCIILATSGMLEGGP 536
Query: 171 SLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENK----QIIDVKMAIEYM- 225
L FK AP + N ++ + V GT+G +VL G K++ K +++ + ++E +
Sbjct: 537 VLEYFKNIAPDKKNKVLFVSYQVNGTMGRRVLDGSKQVTMLGKEGKVEVVTINCSVEKLD 596
Query: 226 SFSAHADAKGIMQLIQYCEPK--NVLLVHGEASKMVFLKEKIKQEFNLDCFMP 276
FS H+D +M +Q PK VL+ HGE K L I++ F + P
Sbjct: 597 GFSGHSDYNQLMSFVQRLRPKLRRVLVNHGERRKSENLAMNIRRMFRVPAHYP 649
>gi|329766104|ref|ZP_08257663.1| beta-lactamase domain-containing protein [Candidatus
Nitrosoarchaeum limnia SFB1]
gi|329137375|gb|EGG41652.1| beta-lactamase domain-containing protein [Candidatus
Nitrosoarchaeum limnia SFB1]
Length = 645
Score = 121 bits (304), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 84/293 (28%), Positives = 150/293 (51%), Gaps = 20/293 (6%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATT--IRDSKRCRERDF 59
F + G+ + VY+GD AA + R + L+ ESTY I+ +++ E F
Sbjct: 346 FHIGNGDHNFVYSGDIKFGKSILFEAASWNFPRVETLLIESTYGAKEDIQPTRQEVESAF 405
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNL-QAPIYFAVGLTEKATNYYK 118
+ V+ + GGKVLIP+ A+GRAQE+ ++++ Y + + +AP+ F G+ +A+ ++
Sbjct: 406 INAVNNTLADGGKVLIPIPAVGRAQEIMMVIDHYMKSGEMVEAPV-FTEGMISEASAIHE 464
Query: 119 MFITWTNQKIRKTFVQR--NMFD------FKHIRPFDKSFIDNPGPMVVFATPGMLHSGL 170
+ + +++++ ++ N FD +H ++ DN P ++ AT GML G
Sbjct: 465 AYPEYLARELKQKILETDDNPFDSEYFTNIEHADAREEPMRDN-SPCIILATSGMLEGGP 523
Query: 171 SLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENK----QIIDVKMAIEYM- 225
L FK AP + N ++ + V GT+G +VL G K++ K +++ + ++E +
Sbjct: 524 VLEYFKNIAPDKKNKVLFVSYQVNGTMGRRVLDGSKQVTMLGKEGKVEVVTINCSVEKLD 583
Query: 226 SFSAHADAKGIMQLIQYCEPK--NVLLVHGEASKMVFLKEKIKQEFNLDCFMP 276
FS H+D +M +Q PK VL+ HGE K L I++ F + P
Sbjct: 584 GFSGHSDYNQLMSFVQRLRPKLRRVLVNHGERRKSENLAMNIRRMFRVPAHYP 636
>gi|344996811|ref|YP_004799154.1| RNA-metabolising metallo-beta-lactamase [Caldicellulosiruptor
lactoaceticus 6A]
gi|343965030|gb|AEM74177.1| RNA-metabolising metallo-beta-lactamase [Caldicellulosiruptor
lactoaceticus 6A]
Length = 822
Score = 121 bits (304), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 79/299 (26%), Positives = 142/299 (47%), Gaps = 14/299 (4%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
M +K SI+YTGD++ + A + K RPD++I ESTY + ++ E
Sbjct: 152 MIFIKTQEGSILYTGDFSADRQLTVDKASVPKIRPDVVICESTYGDRLHTNRSFEEERLF 211
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
V E V +GGKVLIP FA+GRAQE+ ++L Y ++ + + F G+ + Y+
Sbjct: 212 NTVAEVVSQGGKVLIPAFAIGRAQEIILILRNYMKKRKVSFNV-FIDGMVREVIRVYRNN 270
Query: 121 ITWTNQKIRKTFVQ-RNMFDFKHIRPFD-----KSFIDNPGPMVVFATPGMLHSGLSLII 174
T+ + + K ++ +F +I + + + P V+ ++ GML G S+
Sbjct: 271 PTYLSSRYYKRVLKGEEIFLADNINVVSDKKQREEIVSSSDPCVIISSSGMLTGGPSVFY 330
Query: 175 FKKWAPVENNMLIMPGFCVQGTIGHKVL------SGVKKLEFENKQIIDVKMAIEYMSFS 228
+K +N ++ + G+ + G K+L +K+E K+ +VK +E S
Sbjct: 331 AEKIVQSQNALIAITGYQDEEAPGRKLLELAELPENERKIELNGKE-YEVKCRVEKYGLS 389
Query: 229 AHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTD 287
AH+D I+ + +P+ V+ HG + + + +E + +P NGE + D
Sbjct: 390 AHSDRDRILGFLATLKPRTVIFAHGSEDAISQISDMAVKELEANIIVPQNGEINSISID 448
>gi|449435476|ref|XP_004135521.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
3-I-like [Cucumis sativus]
Length = 392
Score = 121 bits (303), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 65/166 (39%), Positives = 92/166 (55%), Gaps = 9/166 (5%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF V + ++YTGDY+ DRHL AA + + PD+ I ESTY + + RE+ F
Sbjct: 176 MFMVDIAGVRVLYTGDYSREEDRHLRAAEMPQFSPDVCIIESTYGVQLHQPRHIREKRFT 235
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYW----ERMNLQAPIYFAVGLTEKATNY 116
VH + +GG+VLIP FALGRAQEL ++L+ YW E N+ PIY+A L ++
Sbjct: 236 DVVHSTISQGGRVLIPAFALGRAQELLLILDEYWANHPELHNI--PIYYASPLAKRCLTV 293
Query: 117 YKMFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFA 161
Y+ + N +I+ + N F FK+I P + GP VV A
Sbjct: 294 YETYTLSMNDRIQNA--KSNPFRFKYISPLKSIEVFKDVGPSVVMA 337
>gi|340345702|ref|ZP_08668834.1| Beta-lactamase domain protein [Candidatus Nitrosoarchaeum koreensis
MY1]
gi|339520843|gb|EGP94566.1| Beta-lactamase domain protein [Candidatus Nitrosoarchaeum koreensis
MY1]
Length = 645
Score = 121 bits (303), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 84/293 (28%), Positives = 150/293 (51%), Gaps = 20/293 (6%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATT--IRDSKRCRERDF 59
F + G+ + VY+GD AA + R + L+ ESTY I+ +++ E F
Sbjct: 346 FHIGNGDHNFVYSGDIKFGKSILFEAASWNFPRVETLLIESTYGAKEDIQPTRQEVESAF 405
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNL-QAPIYFAVGLTEKATNYYK 118
+ V+ + GGKVLIP+ A+GRAQE+ ++++ Y + + +AP+ F G+ +A+ ++
Sbjct: 406 INAVNNTLADGGKVLIPIPAVGRAQEIMMVIDHYMKSGEMVEAPV-FTEGMISEASAIHE 464
Query: 119 MFITWTNQKIRKTFVQR--NMFD------FKHIRPFDKSFIDNPGPMVVFATPGMLHSGL 170
+ + +++++ ++ N FD +H ++ DN P ++ AT GML G
Sbjct: 465 AYPEYLARELKQKILETDDNPFDSEYFTNIEHADTREEPMRDN-SPCIILATSGMLEGGP 523
Query: 171 SLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENK----QIIDVKMAIEYM- 225
L FK AP + N ++ + V GT+G +VL G K++ K +++ + ++E +
Sbjct: 524 VLEYFKNIAPDKKNKVLFVSYQVNGTMGRRVLDGSKQVTMLGKEGKVEVVTINCSVEKLD 583
Query: 226 SFSAHADAKGIMQLIQYCEPK--NVLLVHGEASKMVFLKEKIKQEFNLDCFMP 276
FS H+D +M +Q PK VL+ HGE K L I++ F + P
Sbjct: 584 GFSGHSDYNQLMSFVQRLRPKLRRVLVNHGERRKSENLAMNIRRMFRVPAHYP 636
>gi|18312205|ref|NP_558872.1| mRNA 3'-end processing factor [Pyrobaculum aerophilum str. IM2]
gi|18159643|gb|AAL63054.1| mRNA 3'-end processing factor, conjectural [Pyrobaculum aerophilum
str. IM2]
Length = 634
Score = 121 bits (303), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 91/315 (28%), Positives = 151/315 (47%), Gaps = 45/315 (14%)
Query: 1 MFQVKVGNQ--SIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATT--IRDSKRCRE 56
M + +GN +I+YTGD+ R L A R ++LI ESTY ++ + E
Sbjct: 325 MAHLHIGNGRYNILYTGDFKYGKTRLLNRAANKFKRVEMLIMESTYGGRDDVQPPRVEAE 384
Query: 57 RDFLKKVHECVDRGGKVLIPVFALGRAQELCILLETYWE-RMNLQAPIYFAVGLTEKATN 115
K V + V RGGKVLIP F+ GR QE+ +L E + + P+Y G+ + N
Sbjct: 385 NALAKHVSDAVTRGGKVLIPAFSTGRGQEILYILNKMMEGGLIPRVPVYVD-GMIVETLN 443
Query: 116 YYKMFITWTNQKIRKTFVQRNMFDFKHIRPFDKS----FIDNPG---------------- 155
Y M+ + N ++ + + + PF S +D
Sbjct: 444 VYLMYPHYLNPEVAEEI-------YGGVNPFTTSGSVVIVDRAKRVEDRINQVAKIAQSE 496
Query: 156 -PMVVFATPGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQ 214
P V+ A GML+ G + F + A E N LI + +GT+G ++L+G + EF K
Sbjct: 497 EPAVIIAPHGMLNGGPVVDYFAQLAHDERNKLIFVSYQAEGTLGRRILNGER--EFVIKS 554
Query: 215 IID----VKMAIEYMS---FSAHADAKGIMQLIQYCE--PKNVLLVHGEASKMVFLKEKI 265
++ + M +E +S FS H+D + +M+ +++ E PK ++L+HGE SK++ L I
Sbjct: 555 LVGGESKINMRMEVVSIPGFSGHSDRRELMKYVEHMEPKPKKIVLIHGEPSKIISLATSI 614
Query: 266 KQEFNLDCFMPANGE 280
+ ++ + +P GE
Sbjct: 615 ELKYKITTIIPKVGE 629
>gi|222529916|ref|YP_002573798.1| beta-lactamase domain-containing protein [Caldicellulosiruptor
bescii DSM 6725]
gi|222456763|gb|ACM61025.1| beta-lactamase domain protein [Caldicellulosiruptor bescii DSM
6725]
Length = 821
Score = 120 bits (302), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 77/292 (26%), Positives = 140/292 (47%), Gaps = 14/292 (4%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
M ++ SI+YTGD++ + A + K RPD++I ESTY + ++ E
Sbjct: 152 MIFIQTQEGSILYTGDFSADRQLTVDKASVPKIRPDVVICESTYGDRLHTNRNFEEERLF 211
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
V E + +GGKVLIP FA+GRAQE+ ++L Y ++ + + F G+ + Y+
Sbjct: 212 NTVAEVISQGGKVLIPAFAIGRAQEIILILRNYMKKRKVSFNV-FIDGMVREVIRVYRNN 270
Query: 121 ITWTNQKIRKTFVQ-RNMFDFKHIRPFD-----KSFIDNPGPMVVFATPGMLHSGLSLII 174
T+ + + K ++ +F +I + I + P V+ ++ GML G S+
Sbjct: 271 PTYLSSRYYKRVLKGEEIFFADNINVVSDKKQREEIISSSDPCVIISSSGMLTGGPSVFY 330
Query: 175 FKKWAPVENNMLIMPGFCVQGTIGHKVL------SGVKKLEFENKQIIDVKMAIEYMSFS 228
+K +N ++ + G+ + G K+L +K+E K+ +VK +E S
Sbjct: 331 AEKIVQSQNALIAITGYQDEEAPGRKLLELAELPENERKIELNGKE-YEVKCRVEKYGLS 389
Query: 229 AHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGE 280
AH+D I+ + P+ V+ HG ++ + + +E + +P NGE
Sbjct: 390 AHSDRDRILGFLATLRPRTVIFAHGSEDAILQISDMAVKELEANIIVPQNGE 441
>gi|134045242|ref|YP_001096728.1| beta-lactamase domain-containing protein [Methanococcus maripaludis
C5]
gi|132662867|gb|ABO34513.1| beta-lactamase domain protein [Methanococcus maripaludis C5]
Length = 422
Score = 120 bits (302), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 82/279 (29%), Positives = 143/279 (51%), Gaps = 11/279 (3%)
Query: 6 VGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATT--IRDSKRCRERDFLKKV 63
+ + ++YTGD N R L AA D D +I ESTY + I+ +++ E+ + ++
Sbjct: 143 IDGKKLLYTGDINEIETRTLRAADTDFDNIDTIIIESTYGSPLDIKPARKVLEKQLIDEI 202
Query: 64 HECVDRGGKVLIPVFALGRAQELCILLETYWERMNL-QAPIYFAVGLTEKATNYYKMFIT 122
E +D GGKV+IPVFA+GR+QE+ +++ Y + L + PIY LT T Y +
Sbjct: 203 SETIDNGGKVIIPVFAVGRSQEIIVVINNYMKSGALKEVPIYINGSLTH-TTGMYMGYSD 261
Query: 123 WTNQKIRKTFVQR-NMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAPV 181
W N KI+ R N F I+ D+ F + P ++ +T GM+ G ++ +
Sbjct: 262 WLNPKIKNAIENRINPFG-NLIKGGDEVF--SREPCIIISTSGMVQGG-PVLQYLSLLKN 317
Query: 182 ENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLIQ 241
N +I+ G+ +GTIG + G +++ K+ I V + + FSAHAD +++ ++
Sbjct: 318 PRNKIILTGYQAEGTIGRSLEEGATEVK-PFKRAIPVNGKVVKIEFSAHADYNSLLRFMK 376
Query: 242 YC-EPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANG 279
+P +++HGE + + L I + + P+ G
Sbjct: 377 KIPQPSKAIVMHGERYQALSLAMTIWKTLKIPALAPSIG 415
>gi|84489102|ref|YP_447334.1| exonuclease [Methanosphaera stadtmanae DSM 3091]
gi|84372421|gb|ABC56691.1| predicted exonuclease [Methanosphaera stadtmanae DSM 3091]
Length = 635
Score = 120 bits (302), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 79/295 (26%), Positives = 151/295 (51%), Gaps = 17/295 (5%)
Query: 7 GNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDS--KRCRERDFLKKVH 64
G + +YTGD+ R LG R + ++ ESTY ++ + E++ +K ++
Sbjct: 342 GKHNFLYTGDFKNEQSRLLGTPKQHFARIESMVMESTYGGREDNTPTRNTAEKELVKAIY 401
Query: 65 ECVDRGGKVLIPVFALGRAQELCILLETYWERMNL-QAPIYFAVGLTEKATNYYKMFITW 123
+ + +GGK+LIPVFA+GRAQE+ I+LE + R L + P+Y G+ +AT + +
Sbjct: 402 DTIQKGGKILIPVFAVGRAQEIMIVLEEFINRGILGKVPVYLD-GMIWEATAIHTAHPEF 460
Query: 124 TNQKIRKTFVQ-------RNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFK 176
+ +++K ++F+ + +++ P V+ +T GML G S+ FK
Sbjct: 461 LSNELQKQIFHAGENPFTSDVFNKVTNNEQRRKLLESNEPCVILSTSGMLTGGNSVEYFK 520
Query: 177 KWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENK---QIIDVKMAIEYM-SFSAHAD 232
+ E N +I G+ +G++G ++ G ++ E Q+ + + + + FS H+D
Sbjct: 521 ELCEDEKNRIIFVGYQSEGSLGRRIQKGFDEVPLEKDGVTQLFHINIEVTTIDGFSGHSD 580
Query: 233 AKGIMQLIQYCEPK--NVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQ 285
K +M+ ++ PK +L+ HG+A K + L I + + ++ P N E+ +Q
Sbjct: 581 RKQLMEYVRKLSPKPDKILVCHGDAYKALDLASSIYRTYKIETKTPLNLETTRLQ 635
>gi|395645476|ref|ZP_10433336.1| KH-domain/beta-lactamase-domain protein [Methanofollis liminatans
DSM 4140]
gi|395442216|gb|EJG06973.1| KH-domain/beta-lactamase-domain protein [Methanofollis liminatans
DSM 4140]
Length = 629
Score = 120 bits (302), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 79/291 (27%), Positives = 140/291 (48%), Gaps = 11/291 (3%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTY--ATTIRDSKRCRERDF 59
F + G +I +TGD+N R A R + L ESTY A ++ ++ E
Sbjct: 335 FHLGDGLYNIAFTGDFNYQKTRLFSPAVSTFPRLEALFMESTYGGANALQPAREDAEAKL 394
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKM 119
V ++RGGKV+IP FA+GR+QE+ + LE + + + G+ ++AT +
Sbjct: 395 YDIVTRTIERGGKVIIPAFAVGRSQEVMLALEEGMRKEKIPRVKVYLDGMIKEATAIHTT 454
Query: 120 FITWTNQKIRKTFVQRNMFD-----FKHIRPFD-KSFIDNPGPMVVFATPGMLHSGLSLI 173
+ + N +R + M F + D + + P V+ T GML G +
Sbjct: 455 YPEYLNADLRNQIFRDGMNPFLAECFVQVDSSDLREKVIGGDPCVIVTTSGMLSGGPVME 514
Query: 174 IFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYM-SFSAHAD 232
AP E N L+ G+ +GT G ++ G +++ N++ + VK+ I+ + FS H+D
Sbjct: 515 YLYALAPDERNTLVFVGYQAEGTFGRRLQKGWREIPVGNRETMVVKLDIDTVDGFSGHSD 574
Query: 233 AKGIMQLIQYCEPK--NVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGES 281
K +M IQ+ +P+ + +HG+ + + L I + F+++ P N E+
Sbjct: 575 RKQLMGFIQHIQPRPEKIFTIHGDENSTIDLASSIYKRFHIETRSPLNLET 625
>gi|302872375|ref|YP_003841011.1| RNA-metabolising metallo-beta-lactamase [Caldicellulosiruptor
obsidiansis OB47]
gi|302575234|gb|ADL43025.1| RNA-metabolising metallo-beta-lactamase [Caldicellulosiruptor
obsidiansis OB47]
Length = 821
Score = 120 bits (302), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 77/292 (26%), Positives = 140/292 (47%), Gaps = 14/292 (4%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
M ++ SI+YTGD++ + A + K RPD++I ESTY + ++ E
Sbjct: 152 MIFIQTQEGSILYTGDFSADKQLTVDKASVPKIRPDIVICESTYGDRLHTNRSFEEERLF 211
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
++E + +GGKVLIP FA+GRAQE+ ++L Y ++ + I F G+ + Y+
Sbjct: 212 NTIYEVISQGGKVLIPAFAIGRAQEVILILRNYMKKRKVSFNI-FIDGMVREVIKVYRNN 270
Query: 121 ITWTNQKIRKTFVQ-RNMFDFKHIRPFD-----KSFIDNPGPMVVFATPGMLHSGLSLII 174
T+ + + K ++ +F +I + I + P V+ ++ GML G S+
Sbjct: 271 PTYLSSRYYKRVLKGEEIFLADNINVVSDKKQREEIISSSDPCVIISSSGMLTGGPSVFY 330
Query: 175 FKKWAPVENNMLIMPGFCVQGTIGHKVL------SGVKKLEFENKQIIDVKMAIEYMSFS 228
+K N ++ + G+ + G K+L KK++ K+ +VK +E S
Sbjct: 331 AEKIVQSPNALIAITGYQDEEAPGRKLLELAELSESEKKIDLNGKE-YEVKCRVEKYGLS 389
Query: 229 AHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGE 280
AH+D I+ + P+ V+ HG + + + +E + + +P NGE
Sbjct: 390 AHSDRDKILGFLSTLRPRTVIFAHGSEEAISQISDMAVKELDANIVVPQNGE 441
>gi|45358944|ref|NP_988501.1| beta-lactamase-like protein [Methanococcus maripaludis S2]
gi|340624691|ref|YP_004743144.1| beta-lactamase-like protein [Methanococcus maripaludis X1]
gi|45047810|emb|CAF30937.1| Beta-lactamase-like:ATP/GTP-binding site motif A (P-loop)
[Methanococcus maripaludis S2]
gi|339904959|gb|AEK20401.1| beta-lactamase-like protein [Methanococcus maripaludis X1]
Length = 422
Score = 120 bits (301), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 82/279 (29%), Positives = 143/279 (51%), Gaps = 11/279 (3%)
Query: 6 VGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATT--IRDSKRCRERDFLKKV 63
+ + ++YTGD N R L AA D D +I ESTY + I+ +++ E+ + ++
Sbjct: 143 IDGKKLLYTGDINEIETRTLRAADTDFDEIDTIIIESTYGSPLDIKPARKVLEKQLIDEI 202
Query: 64 HECVDRGGKVLIPVFALGRAQELCILLETYWERMNL-QAPIYFAVGLTEKATNYYKMFIT 122
E +D GGKV+IPVFA+GR+QE+ +++ Y + L + PIY LT T Y +
Sbjct: 203 SETIDNGGKVIIPVFAVGRSQEIIVVINNYMKSGALKEVPIYINGSLTH-TTGMYMGYSD 261
Query: 123 WTNQKIRKTFVQR-NMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAPV 181
W N KI+ R N F I+ D+ F + P ++ +T GM+ G ++ +
Sbjct: 262 WLNPKIKNAIENRINPFG-NLIKGGDEVF--SREPCIIISTSGMVQGG-PVLQYLSLLKN 317
Query: 182 ENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLIQ 241
N +I+ G+ +GTIG + G +++ K+ I V + + FSAHAD +++ ++
Sbjct: 318 PRNKIILTGYQAEGTIGRSLEEGATEVK-PFKRAIPVNGKVVKIEFSAHADYNSLLRFMK 376
Query: 242 YC-EPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANG 279
+P +++HGE + + L I + + P+ G
Sbjct: 377 KIPQPDKAIVMHGERYQALSLAMTIWKTLKIPALAPSIG 415
>gi|150402550|ref|YP_001329844.1| beta-lactamase domain-containing protein [Methanococcus maripaludis
C7]
gi|150033580|gb|ABR65693.1| beta-lactamase domain protein [Methanococcus maripaludis C7]
Length = 422
Score = 120 bits (301), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 81/279 (29%), Positives = 144/279 (51%), Gaps = 11/279 (3%)
Query: 6 VGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATT--IRDSKRCRERDFLKKV 63
+ + ++YTGD N R L AA D D +I ESTY + I+ +++ E+ + ++
Sbjct: 143 IDGKKLLYTGDINEIETRTLRAADTDFDEIDTIIIESTYGSPLDIKPARKVLEKQLIDEI 202
Query: 64 HECVDRGGKVLIPVFALGRAQELCILLETYWERMNL-QAPIYFAVGLTEKATNYYKMFIT 122
E ++ GGKV+IPVFA+GR+QE+ +++ Y + L + PIY LT T Y +
Sbjct: 203 SETIENGGKVIIPVFAVGRSQEIIVVINNYMKSGALKEVPIYINGSLTH-TTGMYMGYSD 261
Query: 123 WTNQKIRKTFVQR-NMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAPV 181
W N KI+ R N F I+ D++F + P ++ +T GM+ G ++ +
Sbjct: 262 WLNPKIKNAIENRINPFG-NLIKGGDEAF--SKEPCIIISTSGMVQGG-PVLQYLSLLKN 317
Query: 182 ENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLIQ 241
N +I+ G+ +GTIG + G +++ K+ I V + + FSAHAD +++ ++
Sbjct: 318 PRNKIILTGYQAEGTIGRSLEEGATEVK-PFKRAIPVNGKVVKIEFSAHADYNSLLRFMK 376
Query: 242 YC-EPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANG 279
+P +++HGE + + L I + + P+ G
Sbjct: 377 KIPQPSKAIVMHGERYQALSLAMTIWKTLKIPALAPSIG 415
>gi|312792978|ref|YP_004025901.1| RNA-metabolising metallo-beta-lactamase [Caldicellulosiruptor
kristjanssonii 177R1B]
gi|312180118|gb|ADQ40288.1| RNA-metabolising metallo-beta-lactamase [Caldicellulosiruptor
kristjanssonii 177R1B]
Length = 822
Score = 120 bits (301), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 78/299 (26%), Positives = 142/299 (47%), Gaps = 14/299 (4%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
M ++ SI+YTGD++ + A + K RPD++I ESTY + ++ E
Sbjct: 152 MIFIQTQEGSILYTGDFSADRQLTVDKASVPKIRPDVVICESTYGDRLHTNRSFEEERLF 211
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
V E V +GGKVLIP FA+GRAQE+ ++L Y ++ + + F G+ + Y+
Sbjct: 212 NTVAEVVSQGGKVLIPAFAIGRAQEIILILRNYMKKRKVSFNV-FIDGMVREVIRVYRNN 270
Query: 121 ITWTNQKIRKTFVQ-RNMFDFKHIRPFD-----KSFIDNPGPMVVFATPGMLHSGLSLII 174
T+ + + K ++ +F +I + + + P V+ ++ GML G S+
Sbjct: 271 PTYLSSRYYKRVLKGEEIFLADNINVVSDKKQREEIVSSSDPCVIISSSGMLTGGPSVFY 330
Query: 175 FKKWAPVENNMLIMPGFCVQGTIGHKVL------SGVKKLEFENKQIIDVKMAIEYMSFS 228
+K +N ++ + G+ + G K+L +K+E K+ +VK +E S
Sbjct: 331 AEKIVQSQNALIAITGYQDEEAPGRKLLELAELPENERKIELNGKE-YEVKCRVEKYGLS 389
Query: 229 AHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTD 287
AH+D I+ + +P+ V+ HG + + + +E + +P NGE + D
Sbjct: 390 AHSDRDRILGFLATLKPRTVIFAHGSEDAISQISDMAVKELGANIIVPQNGEINSISID 448
>gi|119719226|ref|YP_919721.1| beta-lactamase domain-containing protein [Thermofilum pendens Hrk
5]
gi|119524346|gb|ABL77718.1| beta-lactamase domain protein [Thermofilum pendens Hrk 5]
Length = 639
Score = 120 bits (301), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 96/297 (32%), Positives = 150/297 (50%), Gaps = 22/297 (7%)
Query: 7 GNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATT--IRDSKRCRERDFLKKVH 64
G +IVYTGD L A+ R ++LI ESTY + + S+ + + K V
Sbjct: 341 GLINIVYTGDMKYARTMLLDPAYNKFPRAEVLIIESTYGSKSDVLPSEDDAKLELAKIVL 400
Query: 65 ECVDRGGKVLIPVFALGRAQE-LCILLETYWERMNLQAPIYFAVGLTEKATNYYKMFITW 123
E ++R G VLIPV A+GRAQE L LL+ + + PI F G+ ++ + + F +
Sbjct: 401 ETIERKGVVLIPVLAVGRAQEVLLALLDLMKKGAVPRVPI-FIEGMIDEVSAVHMTFPEY 459
Query: 124 TNQKIRK-TFVQRNMFDFKHI---RPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWA 179
+ IR+ + N F ++I R + I P ++ AT GML G L + A
Sbjct: 460 LSATIREMIYRDENPFTSENIHVIRGEAREDITEKKPSIILATSGMLTGGPVLDYLRILA 519
Query: 180 PVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENK--QII--DVKMAIEYM-SFSAHADAK 234
EN+ L+ + V+GT+G K+L G++KL F + ++I D+KM + + FS H+D +
Sbjct: 520 DDENSSLVFVSYQVEGTLGRKILQGLRKLSFTDPSGKVINKDLKMRVYRVEGFSGHSDRR 579
Query: 235 GIMQLIQYCE--PKNVLLVHGEASKMVFLKEKIKQEF-------NLDCFMPANGESC 282
+ I+ E P+ V+L HGEA K+ + + E+ NLD P N ES
Sbjct: 580 QLEAFIRRMEPSPRTVILNHGEAGKIEEFRRYLSSEWFKKKTGLNLDVLTPKNAESI 636
>gi|333988300|ref|YP_004520907.1| KH-domain/beta-lactamase-domain-containing protein
[Methanobacterium sp. SWAN-1]
gi|333826444|gb|AEG19106.1| KH-domain/beta-lactamase-domain protein [Methanobacterium sp.
SWAN-1]
Length = 634
Score = 120 bits (301), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 85/294 (28%), Positives = 152/294 (51%), Gaps = 16/294 (5%)
Query: 7 GNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYAT--TIRDSKRCRERDFLKKVH 64
G + VYTGD+ R L A R + ++ ESTY ++ ++ E++ +K ++
Sbjct: 342 GQHNFVYTGDFKFERSRLLEPAVAKFPRIESMVMESTYGGHEDVQPTRNDAEKELIKTIY 401
Query: 65 ECVDRGGKVLIPVFALGRAQELCILLETYWERMNL-QAPIYFAVGLTEKATNYYKMFITW 123
+ + RGGKVLIPVFA+GRAQE+ I+LE Y + + P+Y G+ +AT + +
Sbjct: 402 QTLQRGGKVLIPVFAVGRAQEIMIVLEEYIRHGIIEEVPVYID-GMIWEATAIHTARPEY 460
Query: 124 TNQKIRKTF--VQRNMF--DFKHI--RPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
++ +R + +N F D H ++ I P ++ +T GML G S+ FK
Sbjct: 461 LSKDLRDQIFHMGKNPFISDVFHKVNGGNNRQEIIEGEPSIILSTSGMLTGGNSVDYFKW 520
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENK---QIIDVKMAIEYM-SFSAHADA 233
E N +I G+ +G++G ++ G K++ + + +VKM I+ + FS H+D
Sbjct: 521 LCEDEKNSIIFVGYQSEGSLGRRLQKGWKEIPLKEDGKTNVYNVKMQIKTVQGFSGHSDR 580
Query: 234 KGIMQLIQYCEPK--NVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQ 285
+ +M I+ PK +++ HG+ K + L I +++ ++ P N E+ +Q
Sbjct: 581 RQLMDYIRRLSPKPEKIMICHGDNYKTLDLASSIYRQYKIETKTPMNLETIRIQ 634
>gi|159905675|ref|YP_001549337.1| beta-lactamase domain-containing protein [Methanococcus maripaludis
C6]
gi|159887168|gb|ABX02105.1| beta-lactamase domain protein [Methanococcus maripaludis C6]
Length = 422
Score = 120 bits (301), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 81/279 (29%), Positives = 144/279 (51%), Gaps = 11/279 (3%)
Query: 6 VGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATT--IRDSKRCRERDFLKKV 63
+ + ++YTGD N R L AA D D +I ESTY + I+ +++ E+ + ++
Sbjct: 143 IDGKKLLYTGDINEIETRTLRAADTDFDEIDTIIIESTYGSPLDIKPARKVLEKQLIDEI 202
Query: 64 HECVDRGGKVLIPVFALGRAQELCILLETYWERMNL-QAPIYFAVGLTEKATNYYKMFIT 122
E ++ GGKV+IPVFA+GR+QE+ +++ Y + L + PIY LT T Y +
Sbjct: 203 SETIENGGKVIIPVFAVGRSQEIIVVINNYMKSGALKEVPIYINGSLTH-TTGMYMGYSD 261
Query: 123 WTNQKIRKTFVQR-NMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAPV 181
W N KI+ R N F I+ D++F + P ++ +T GM+ G ++ +
Sbjct: 262 WLNPKIKNAIENRINPFG-NLIKGGDEAF--SKEPCIIISTSGMVQGG-PVLQYLSLLKN 317
Query: 182 ENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLIQ 241
N +I+ G+ +GTIG + G +++ K+ I V + + FSAHAD +++ ++
Sbjct: 318 PRNKIILTGYQAEGTIGRSLEEGATEVK-PFKRAIPVNGKVVKIEFSAHADYNSLLRFMK 376
Query: 242 YC-EPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANG 279
+P +++HGE + + L I + + P+ G
Sbjct: 377 KIPQPSKAIVMHGERYQALSLAMTIWKTLKIPALAPSIG 415
>gi|407465689|ref|YP_006776571.1| beta-lactamase domain-containing protein [Candidatus Nitrosopumilus
sp. AR2]
gi|407048877|gb|AFS83629.1| beta-lactamase domain-containing protein [Candidatus Nitrosopumilus
sp. AR2]
Length = 646
Score = 120 bits (300), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 83/293 (28%), Positives = 148/293 (50%), Gaps = 20/293 (6%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYA--TTIRDSKRCRERDF 59
F + G+ + VY+GD AA + R + L+ ESTY I+ +++ E F
Sbjct: 347 FHIGNGDHNFVYSGDIKFGKSILFEAASWNFPRVETLLIESTYGLKEDIQPTRQEVESAF 406
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNL-QAPIYFAVGLTEKATNYYK 118
+ V+ + GGKVLIP+ A+GRAQE+ ++++ Y + + +AP+ F G+ +A+ ++
Sbjct: 407 INAVNNTLADGGKVLIPIPAVGRAQEIMMVIDHYMKSGEMVEAPV-FTEGMISEASAIHE 465
Query: 119 MFITWTNQKIRKTFVQR--NMFD------FKHIRPFDKSFIDNPGPMVVFATPGMLHSGL 170
+ + +++++ ++ N FD +H ++ DN P ++ AT GML G
Sbjct: 466 SYPEYLARELKQKILETDDNPFDSEYFTNIEHADAREEPMRDN-SPCIILATSGMLEGGP 524
Query: 171 SLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENK----QIIDVKMAIEYM- 225
L FK AP + N ++ + V GT+G +VL G K+ K +++ + +E +
Sbjct: 525 VLEYFKNIAPDKKNKVLFVSYQVNGTLGRRVLDGSKQATMLGKEGKVEVVSINCGVEKLD 584
Query: 226 SFSAHADAKGIMQLIQYCEPK--NVLLVHGEASKMVFLKEKIKQEFNLDCFMP 276
FS H+D +M +Q PK VL+ HGE K L I++ + + P
Sbjct: 585 GFSGHSDYNQLMSFVQRLRPKLRRVLVNHGERKKSENLAMNIRRMYKVPAHYP 637
>gi|126179147|ref|YP_001047112.1| beta-lactamase domain-containing protein [Methanoculleus marisnigri
JR1]
gi|125861941|gb|ABN57130.1| beta-lactamase domain protein [Methanoculleus marisnigri JR1]
Length = 630
Score = 119 bits (299), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 77/296 (26%), Positives = 137/296 (46%), Gaps = 21/296 (7%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATT--IRDSKRCRERDF 59
F + G +I +TGD+N R A R + L ESTY + + ++ E
Sbjct: 336 FHIGEGLYNIAFTGDFNYQKSRLFSPAVSSFPRLEALFMESTYGGSNDFQPQRKEAEDKL 395
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKM 119
+ V + RGGKV+IP FA+GR+QE+ + LE R + + G+ ++AT +
Sbjct: 396 YETVSTTIKRGGKVIIPAFAVGRSQEVMLALEEGIRRQKIPPVKIYLDGMIKEATAIHTT 455
Query: 120 FITWTNQKIRKTFVQRNMFDFKHIRPFDKSF--IDNPG---------PMVVFATPGMLHS 168
+ + N +R + + F ++F +D+P P V+ T GML+
Sbjct: 456 YPEYLNSDLRNQIFREGLNPF-----LSEAFVQVDSPAIREKVVTGDPCVIITTSGMLNG 510
Query: 169 GLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYM-SF 227
G + K P E N LI G+ GT+G ++ G +++ ++ I + + I F
Sbjct: 511 GPVMEYLKALGPDERNSLIFVGYQADGTLGRRIQKGWREIPLGWRETIVINLEIATSDGF 570
Query: 228 SAHADAKGIMQLIQYCEPK--NVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGES 281
S H+D K +M + + +P+ + +HG+ +K + L I + ++ P N E+
Sbjct: 571 SGHSDRKQLMNYVAHLQPRPEKIFTIHGDENKTIDLASSIYKRHRIETHSPMNLET 626
>gi|374327735|ref|YP_005085935.1| mRNA 3'-end processing factor [Pyrobaculum sp. 1860]
gi|356643004|gb|AET33683.1| mRNA 3'-end processing factor, conjectural [Pyrobaculum sp. 1860]
Length = 644
Score = 119 bits (299), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 89/313 (28%), Positives = 149/313 (47%), Gaps = 41/313 (13%)
Query: 1 MFQVKVGNQ--SIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATT--IRDSKRCRE 56
M + +GN +I+YTGD+ R L A R ++LI ESTY ++ + E
Sbjct: 335 MAHLHIGNGRYNILYTGDFKYGKTRLLNRAVNKFKRVEMLIMESTYGGKDDVQPPRLEAE 394
Query: 57 RDFLKKVHECVDRGGKVLIPVFALGRAQELCILLETYWE-RMNLQAPIYFAVGLTEKATN 115
K V + V +GGKVLIP F+ GR QE+ +L E + + PIY G+ + N
Sbjct: 395 NALAKHVSDTVSKGGKVLIPAFSTGRGQEILYILNKMIEGGLVPRVPIYVD-GMIVETLN 453
Query: 116 YYKMFITWTNQKIRKTFVQRNMFDFKHIRPFDKS----FIDNPG---------------- 155
Y M+ + N ++ + + + PF S +D
Sbjct: 454 AYLMYPHYLNPEVAEEI-------YGGVNPFTTSGSVVIVDRAKRVEDRINQVAKIAQSE 506
Query: 156 -PMVVFATPGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFE--- 211
P V+ A GML+ G L F + A E N LI + +GT+G ++L+G ++
Sbjct: 507 EPAVIIAPHGMLNGGPVLDYFAQLAYDERNKLIFVSYQAEGTLGRRILNGEREFVIRSLV 566
Query: 212 -NKQIIDVKMAIEYM-SFSAHADAKGIMQLIQYCE--PKNVLLVHGEASKMVFLKEKIKQ 267
+ ID++M + + FS H+D + +M+ +++ E PK ++L+HGE SK++ L I+
Sbjct: 567 GGESKIDMRMEVASIPGFSGHSDRRELMKYVEHMEPKPKKIVLIHGEPSKIISLATSIEL 626
Query: 268 EFNLDCFMPANGE 280
++ + +P GE
Sbjct: 627 KYKITTVIPKVGE 639
>gi|268323698|emb|CBH37286.1| conserved hypothetical protein, RNA-metabolising
metallo-beta-lactamase family [uncultured archaeon]
Length = 628
Score = 119 bits (298), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 85/297 (28%), Positives = 151/297 (50%), Gaps = 25/297 (8%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATT--IRDSKRCRERDF 59
F V G ++ +TGD A+ R + L+ ESTY + S+R E++
Sbjct: 336 FHVGNGLYNVAFTGDMKYERTFLFDPAFNSFTRLEALVMESTYGGMHDFQPSRREAEKNL 395
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKM 119
+++ + +GGKV+IP FA+GR+QE+ I LE M L+ P+Y G+ +AT +
Sbjct: 396 KEEIERTLKQGGKVIIPAFAVGRSQEVMIALEG----MQLEVPVYLD-GMIWEATAIHSA 450
Query: 120 FITWTNQKIRKTFVQ-RNMF---DFKHIRPFDK--SFIDNPGPMVVFATPGMLHSGLSLI 173
+ + N+ ++ + Q N F F + +K I+ ++ +T GML+ G L
Sbjct: 451 YPEYLNKNLKNSIYQGANPFLSDIFVQVDDAEKRREIIEGDESSIILSTSGMLNGGPVLE 510
Query: 174 IFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFE-NKQIIDVKMAIEYMS---FSA 229
K A E N LI G+ +GT+G ++ G ++++ + +++++KM I + FS
Sbjct: 511 YLKGLAGDERNTLIFVGYQAEGTLGRRIQKGWEEIQLSADGKMVNIKMKIGTATIDGFSG 570
Query: 230 HADAKGIMQLIQYCE-----PKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGES 281
H+D QL++Y + P V+ +HGE++K + L I ++ N+D P N E+
Sbjct: 571 HSDRN---QLLEYVKRIRPLPSKVMCMHGESNKCIALASGIHKKLNIDTSAPMNLET 624
>gi|171184545|ref|YP_001793464.1| beta-lactamase domain-containing protein [Pyrobaculum neutrophilum
V24Sta]
gi|170933757|gb|ACB39018.1| beta-lactamase domain protein [Pyrobaculum neutrophilum V24Sta]
Length = 634
Score = 119 bits (298), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 91/315 (28%), Positives = 151/315 (47%), Gaps = 45/315 (14%)
Query: 1 MFQVKVGNQ--SIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATT--IRDSKRCRE 56
M + +GN +I+YTGD+ R L A R ++LI ESTY ++ + E
Sbjct: 325 MAHLHIGNGRYNILYTGDFKYGKTRLLNRAVSKFKRVEMLIMESTYGGRDDVQPPRVEAE 384
Query: 57 RDFLKKVHECVDRGGKVLIPVFALGRAQELCILLETYWE-RMNLQAPIYFAVGLTEKATN 115
K V E V RGGKVLIP F+ GR QE+ +L E + + P+Y G+ + N
Sbjct: 385 NALAKHVAEAVSRGGKVLIPAFSTGRGQEILYILNKMIEGGLIPRVPVYVD-GMIVETLN 443
Query: 116 YYKMFITWTNQKIRKTFVQRNMFDFKHIRPFDKS----FIDNPG---------------- 155
Y M+ + N ++ + + + PF S +D
Sbjct: 444 AYLMYPHYLNPEVAEEV-------YGGVNPFTTSGSVVIVDRAKRVEDRINQVAKIAQSE 496
Query: 156 -PMVVFATPGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQ 214
P V+ A GML+ G + F + A N L+ + +GT+G ++L+G + EF +
Sbjct: 497 EPAVIIAPHGMLNGGPVVDYFSQLAHDPRNKLVFVSYQAEGTLGRRILNGER--EFVVRS 554
Query: 215 IID----VKMAIEYMS---FSAHADAKGIMQLIQYCEPK--NVLLVHGEASKMVFLKEKI 265
++ V+M +E +S FS H+D + +M+ +++ EPK ++L+HGE SK+V L I
Sbjct: 555 LVGGEAKVEMRMEAVSIPGFSGHSDRRELMKYVEHLEPKPRKIVLIHGEPSKIVSLATSI 614
Query: 266 KQEFNLDCFMPANGE 280
+ ++ + +P GE
Sbjct: 615 ELKYKITTVVPKVGE 629
>gi|118576728|ref|YP_876471.1| cleavage and polyadenylation specificity factor/metal-dependent
RNase [Cenarchaeum symbiosum A]
gi|118195249|gb|ABK78167.1| cleavage and polyadenylation specificity factor/metal-dependent
RNase [Cenarchaeum symbiosum A]
Length = 645
Score = 119 bits (298), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 81/297 (27%), Positives = 148/297 (49%), Gaps = 18/297 (6%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATT--IRDSKRCRERDF 59
F + G+ + VY+GD AA + R + L+ ESTY I+ +++ E F
Sbjct: 346 FHIGSGDHNFVYSGDIKFGKSILFEAASWNFPRVETLLIESTYGAKEDIQPTRQEVESAF 405
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKM 119
+ V+ + GGKVLIP+ A+GRAQE+ ++++ Y + + F G+ +A+ ++
Sbjct: 406 INAVNGALADGGKVLIPIPAVGRAQEIMMVIDHYMKSGEMAEAPVFTEGMISEASAIHEA 465
Query: 120 FITWTNQKIRKTFVQR--NMFD------FKHIRPFDKSFIDNPGPMVVFATPGMLHSGLS 171
+ +++++ ++ N FD +H D++ D P ++ AT GML G
Sbjct: 466 HPEYLARELKQKILETDDNPFDSEYFTNVEHADGRDEALRDG-SPCIILATSGMLEGGPV 524
Query: 172 LIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENK----QIIDVKMAIEYM-S 226
L FK AP + N ++ + V GT+G +VL G +++ N+ ++++++ +E +
Sbjct: 525 LEYFKSIAPHKQNKILFVSYQVNGTLGRRVLDGARQVPLMNRGGKIEVVNIECRMEKLDG 584
Query: 227 FSAHADAKGIMQLIQYCEPK--NVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGES 281
FS H+D + +Q PK VL+ HGE K L +++ F + P ES
Sbjct: 585 FSGHSDYNQLTGFVQKLRPKLRRVLVNHGERRKSENLALAVRRMFRIPAHYPQIQES 641
>gi|440783043|ref|ZP_20960854.1| metal-dependent RNase [Clostridium pasteurianum DSM 525]
gi|440219618|gb|ELP58829.1| metal-dependent RNase [Clostridium pasteurianum DSM 525]
Length = 828
Score = 119 bits (298), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 78/282 (27%), Positives = 137/282 (48%), Gaps = 10/282 (3%)
Query: 10 SIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFLKKVHECVDR 69
+I Y+GD+++ + + + K RPD+ I E+TY + ++ E+ L + EC
Sbjct: 163 TIFYSGDFSIFSQKTIDGLKVPKLRPDVAIFETTYGDRLHSNREVEEQRLLDFIRECEVN 222
Query: 70 GGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMFITWTNQKIR 129
GK+LIP FALGRAQE+ ++++ + ++ + G+ + YK+ + +
Sbjct: 223 KGKILIPAFALGRAQEILLIIKKALNKNIIKNIKVYVDGMIKDINRVYKINPLYLKNSLG 282
Query: 130 KTFVQR-NMFDFKHIRPFDKSFIDN-----PGPMVVFATPGMLHSGLSLIIFKKWAPVEN 183
K ++ F +I P D + N P V+ ++ GML G S +K A +EN
Sbjct: 283 KKILKGVEPFYDDNIIPVDNKELRNKILEEKEPCVIVSSSGMLTGGYSQYYAEKIASMEN 342
Query: 184 NMLIMPGFCVQGTIGHKVLSGVKKLEFE----NKQIIDVKMAIEYMSFSAHADAKGIMQL 239
+IM G+ + + G K+L+ + K E N ++I V +E + SAH+D I L
Sbjct: 343 GYIIMTGYQDEESPGRKLLNLLDKDEDRVIEINGKLIPVNCNVEKVGLSAHSDKSEIKSL 402
Query: 240 IQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGES 281
I P N+ +VHG + +++ +E + P GES
Sbjct: 403 INLLTPNNIFMVHGNEEIVKSFTKELFKEVRGRVYAPECGES 444
>gi|386875398|ref|ZP_10117572.1| KH/beta-lactamase domain protein [Candidatus Nitrosopumilus salaria
BD31]
gi|386806797|gb|EIJ66242.1| KH/beta-lactamase domain protein [Candidatus Nitrosopumilus salaria
BD31]
Length = 646
Score = 119 bits (297), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 83/293 (28%), Positives = 148/293 (50%), Gaps = 20/293 (6%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYA--TTIRDSKRCRERDF 59
F + G+ + VY+GD AA + R + L+ ESTY I+ +++ E F
Sbjct: 347 FHIGNGDHNFVYSGDIKFGKSILFEAASWNFPRVETLLIESTYGLKEDIQPTRQEVESAF 406
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNL-QAPIYFAVGLTEKATNYYK 118
+ V+ + GGKVLIP+ A+GRAQE+ ++++ Y + + +AP+ F G+ +A+ ++
Sbjct: 407 INAVNNTLADGGKVLIPIPAVGRAQEIMMVIDHYMKSGEMIEAPV-FTEGMISEASAIHE 465
Query: 119 MFITWTNQKIRKTFVQR--NMFD------FKHIRPFDKSFIDNPGPMVVFATPGMLHSGL 170
+ + +++++ ++ N FD +H ++ DN P ++ AT GML G
Sbjct: 466 SYPEYLARELKQKILETDDNPFDSEYFTNIEHGDAREEPMRDN-SPCIILATSGMLEGGP 524
Query: 171 SLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENK----QIIDVKMAIEYM- 225
L FK AP + N ++ + V GT+G +VL G K+ K +++ + +E +
Sbjct: 525 VLEYFKNIAPDKKNKVLFVSYQVNGTLGRRVLDGSKQATMLGKEGKVEVVSINCGVEKLD 584
Query: 226 SFSAHADAKGIMQLIQYCEPK--NVLLVHGEASKMVFLKEKIKQEFNLDCFMP 276
FS H+D +M +Q PK VL+ HGE K L I++ + + P
Sbjct: 585 GFSGHSDYNQLMSFVQRLRPKLRRVLVNHGERKKSENLAMNIRRMYKVPAHYP 637
>gi|320101459|ref|YP_004177051.1| KH-domain/beta-lactamase-domain-containing protein [Desulfurococcus
mucosus DSM 2162]
gi|319753811|gb|ADV65569.1| KH-domain/beta-lactamase-domain protein [Desulfurococcus mucosus
DSM 2162]
Length = 647
Score = 119 bits (297), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 85/292 (29%), Positives = 146/292 (50%), Gaps = 29/292 (9%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFLK 61
V +G +IVYTGD+ + R L A R + LI E TY T+++++ E+ +
Sbjct: 347 LHVGMGLHNIVYTGDFKYSDSRLLNKANTVFPRVETLIMEGTYGATLQENRAQAEQGLVD 406
Query: 62 KVHECVDRGGKVLIPVFALGRAQELCILLETYWE-----RMNLQAPIYFAVGLTEKATNY 116
V +R G VLIPVFA+GR QE+ ++L + RMN+ + GL + T
Sbjct: 407 IVKRTAERKGIVLIPVFAVGRGQEIILILNEAMKTGRIPRMNI-----YVEGLVNEVTAI 461
Query: 117 YKMFITWTNQKIRKT-FVQRNMFDFKHIRPFDKSFIDNPG-----PMVVFATPGMLHSGL 170
+ + + N+ +R+ + N F + ++P + S + P P V+ AT GML G
Sbjct: 462 HTQYPEYLNRSLREAIYNGDNPFTSEWLKPIE-SGVARPDIVEDRPSVIIATSGMLTGGP 520
Query: 171 SLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEF--ENK-QIIDVKMAIEYM-S 226
++ + A E N L+ G+ +GT+G K+ G +++ EN+ + I +K+ + +
Sbjct: 521 AVDYLRLLAGDERNSLVFVGYQAEGTLGRKIKDGAREVTMVSENRVEAIHIKLEVYSIEG 580
Query: 227 FSAHADAKGIMQLIQYC-----EPKNVLLVHGEASKMVFLKEKIKQEFNLDC 273
FS H+D + +L++Y P+ +LL HGE S + L +K + L
Sbjct: 581 FSGHSDQR---ELLEYARDISPRPRRILLNHGEPSALSALSTLLKGDRELRA 629
>gi|386002906|ref|YP_005921205.1| RNA-metabolising metallo-beta-lactamase [Methanosaeta harundinacea
6Ac]
gi|357210962|gb|AET65582.1| RNA-metabolising metallo-beta-lactamase [Methanosaeta harundinacea
6Ac]
Length = 635
Score = 118 bits (296), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 84/300 (28%), Positives = 145/300 (48%), Gaps = 26/300 (8%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYA--TTIRDSKRCRERDF 59
F V G ++ +TGD+ R A R + L+ E+TY + + S++ E
Sbjct: 338 FHVGDGLYNVAFTGDFKFEKTRLFDPAVHTFPRLETLVMEATYGGSNSTQPSRKEAEGRL 397
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKM 119
LK V + + RGGKV+IP FA+GR+QE+ I LE + +L+ + G+ +AT +
Sbjct: 398 LKVVRDTIKRGGKVIIPAFAVGRSQEVMIALEEAIRKKSLEEISVYLDGMIYEATAIHTT 457
Query: 120 FITWTNQKIRKTFVQRNMFDFKHIRPF------------DKSFIDNPGPMVVFATPGMLH 167
+ + N ++ R++ K I PF +S I + P VV AT GM++
Sbjct: 458 YPEYLNNEL------RDLIFHKGINPFLSECFVQVETSSQRSKIVDGDPCVVLATSGMMN 511
Query: 168 SGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENK-QIIDVKMAIEYMS 226
G L K P E N L++ G+ +GT+G ++ G ++ + + VK+ +E ++
Sbjct: 512 GGPILEYLKALGPDERNTLVIVGYQAEGTLGRRIQKGWNEIPLSAEGKTQTVKINLEVVT 571
Query: 227 ---FSAHADAKGIMQLIQ--YCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGES 281
FS H+D +M ++ Y +P V+ HG+ S + L I ++ + P N E+
Sbjct: 572 VDGFSGHSDRNQLMDYVRRVYPKPSRVITNHGDESNCLDLASSIYKKHRIPTMAPMNLET 631
>gi|312127068|ref|YP_003991942.1| RNA-metabolising metallo-beta-lactamase [Caldicellulosiruptor
hydrothermalis 108]
gi|311777087|gb|ADQ06573.1| RNA-metabolising metallo-beta-lactamase [Caldicellulosiruptor
hydrothermalis 108]
Length = 822
Score = 118 bits (296), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 76/292 (26%), Positives = 140/292 (47%), Gaps = 14/292 (4%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
M ++ SI+YTGD++ + A + K RPD++I ESTY + ++ E
Sbjct: 152 MIFIQTQEGSILYTGDFSADKQLTVDKASVPKIRPDIVICESTYGDRLHTNRSFEEERLF 211
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
V E + +GGKVLIP FA+GRAQE+ ++L Y ++ + + F G+ + Y+
Sbjct: 212 NTVAEVISQGGKVLIPAFAIGRAQEIILILRNYMKKRKVSFNV-FIDGMVREVIRVYRNN 270
Query: 121 ITWTNQKIRKTFVQ-RNMFDFKHI-----RPFDKSFIDNPGPMVVFATPGMLHSGLSLII 174
T+ + + K ++ +F +I + + I + P V+ ++ GML G S+
Sbjct: 271 PTYLSSRYYKRVLKGEEIFLADNINVVSDKKQREEIISSSDPCVIISSSGMLTGGPSVFY 330
Query: 175 FKKWAPVENNMLIMPGFCVQGTIGHKVL------SGVKKLEFENKQIIDVKMAIEYMSFS 228
+K +N ++ + G+ + G K+L +K++ K+ +VK +E S
Sbjct: 331 AEKIVQSQNALIAITGYQDEEAPGRKLLELAELPENERKIDLNGKE-YEVKCRVEKYGLS 389
Query: 229 AHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGE 280
AH+D I+ + P+ V+ HG + + + +E + +P NGE
Sbjct: 390 AHSDRDRILGFLATLRPRTVIFAHGSEDAISQISDMAVKELESNIIVPQNGE 441
>gi|70952759|ref|XP_745526.1| cleavage and polyadenylation specificity factor protein [Plasmodium
chabaudi chabaudi]
gi|56525876|emb|CAH78255.1| cleavage and polyadenylation specificity factor protein, putative
[Plasmodium chabaudi chabaudi]
Length = 327
Score = 118 bits (296), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 51/94 (54%), Positives = 71/94 (75%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
M++++V N S++YTGDYN PD+HLG+ I P++ I+ESTYA+ +R +++ E +
Sbjct: 234 MYRLEVNNVSVIYTGDYNTIPDKHLGSTKIPVLTPEIFISESTYASYVRPTRKSSELELC 293
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYW 94
V+ECV +GGKVLIPVFA+GRAQEL ILLE YW
Sbjct: 294 NLVNECVHKGGKVLIPVFAIGRAQELSILLEEYW 327
>gi|119872614|ref|YP_930621.1| beta-lactamase domain-containing protein [Pyrobaculum islandicum
DSM 4184]
gi|119674022|gb|ABL88278.1| beta-lactamase domain protein [Pyrobaculum islandicum DSM 4184]
Length = 640
Score = 118 bits (296), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 87/313 (27%), Positives = 147/313 (46%), Gaps = 41/313 (13%)
Query: 1 MFQVKVGNQ--SIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATT--IRDSKRCRE 56
M + +GN +I+YTGD+ R L A R ++LI ESTY ++ + E
Sbjct: 331 MAHLHIGNGRYNILYTGDFKYGKTRLLNRAVSKFKRVEMLIMESTYGGRDDVQPPRVEAE 390
Query: 57 RDFLKKVHECVDRGGKVLIPVFALGRAQELCILLETYWE-RMNLQAPIYFAVGLTEKATN 115
K V E V RGGKVLIP F+ GR QE+ +L E + + P+Y G+ + N
Sbjct: 391 NALAKHVAEAVSRGGKVLIPAFSTGRGQEILYILNKMIEGGLIPRVPVYVD-GMIVETLN 449
Query: 116 YYKMFITWTNQKIRKTFVQRNMFDFKHIRPFDKS----FIDNPG---------------- 155
Y M+ + N ++ + + + PF S +D
Sbjct: 450 AYLMYPHYLNPEVAEEI-------YGGVNPFTTSGSVVIVDRAKRVEDRINQVAKIAQSE 502
Query: 156 -PMVVFATPGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFE--- 211
P V+ A GML+ G + F + A N L+ + +GT+G ++L+G ++
Sbjct: 503 EPAVIIAPHGMLNGGPVVDYFSQLAHDPRNKLVFVSYQAEGTLGRRILNGEREFTIRSLV 562
Query: 212 -NKQIIDVKM-AIEYMSFSAHADAKGIMQLIQYCE--PKNVLLVHGEASKMVFLKEKIKQ 267
+ ++V+M + FS H+D + +M+ +++ E PK ++LVHGE SK++ L I+
Sbjct: 563 GGESKVEVRMEVVSIPGFSGHSDRRELMKYVEHIEPKPKKIVLVHGEPSKIISLATSIEL 622
Query: 268 EFNLDCFMPANGE 280
++ + +P GE
Sbjct: 623 KYKITTIIPKVGE 635
>gi|384483536|gb|EIE75716.1| hypothetical protein RO3G_00420 [Rhizopus delemar RA 99-880]
Length = 321
Score = 118 bits (296), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 79/235 (33%), Positives = 122/235 (51%), Gaps = 26/235 (11%)
Query: 803 QITHFVTPDSDISSILSQAYPQDSLHNVCIMGLHTCGDLSGTALRLFTKSS-LQCLVQVG 861
+ F+TP+ + S+ILS+ + + + GLHTCGDL+ LRLF S + CLV VG
Sbjct: 73 HVNEFITPE-NASTILSR-WSKTQNEKWLLCGLHTCGDLASMILRLFASSDDMTCLVNVG 130
Query: 862 CCYHLLEEEFIRSPFWKDVDQSLYEHGYGFPLSEHLRSRKFFLGRNVRMSGTQSPERVID 921
CCYH L E+ + GFP+S+ ++ +G RM +Q PER ++
Sbjct: 131 CCYHFLSEQ---------------KPHCGFPMSDVVKKTNLVIGHTSRMLSSQVPERWVE 175
Query: 922 LKQTQTLPL---FYRALLEKYLRSKITINDEEPKVVGRLATK--CSNFVEYVHRAVDKLK 976
+ + L F+RALL+ + K V+GRL K ++F YV A+ +L
Sbjct: 176 KSEDTLVALEHHFFRALLQFIMVKKGLAEPGAAPVIGRLNKKKDFTSFSVYVQAALSRLG 235
Query: 977 LDL-EVDDEEVTRLFNSHQREYEYLQIYYF--LKTALAPVIEALIVLDRVLYLRE 1028
L + DEE +N ++ + QI Y L+ LAPV+E++I++DR LYL +
Sbjct: 236 LPAGTITDEEANDYYNMYKAKQIDKQIAYVWSLRAILAPVLESIILIDRWLYLND 290
Score = 41.6 bits (96), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 21/52 (40%), Positives = 31/52 (59%)
Query: 677 MSQVVAAVTNSCDSSHIIDLGGGQGYLSTILALQHGKKTLSLDYNQVNTHGA 728
M+ V+ + + I+DLG GQGYLS LA + K L++D ++V T GA
Sbjct: 1 MTPVIKQLASDHQVESIMDLGAGQGYLSRSLAYRSNLKVLAVDSSEVQTCGA 52
>gi|254168266|ref|ZP_04875112.1| RNA-metabolising metallo-beta-lactamase family [Aciduliprofundum
boonei T469]
gi|254168398|ref|ZP_04875243.1| RNA-metabolising metallo-beta-lactamase family [Aciduliprofundum
boonei T469]
gi|289595800|ref|YP_003482496.1| KH-domain/beta-lactamase-domain protein [Aciduliprofundum boonei
T469]
gi|197622679|gb|EDY35249.1| RNA-metabolising metallo-beta-lactamase family [Aciduliprofundum
boonei T469]
gi|197622775|gb|EDY35344.1| RNA-metabolising metallo-beta-lactamase family [Aciduliprofundum
boonei T469]
gi|289533587|gb|ADD07934.1| KH-domain/beta-lactamase-domain protein [Aciduliprofundum boonei
T469]
Length = 647
Score = 118 bits (295), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 88/297 (29%), Positives = 152/297 (51%), Gaps = 21/297 (7%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFLK 61
F + G +IV TGD AA R + +I ESTY R+ + R+ +
Sbjct: 349 FHIGEGLYNIVITGDIKYERSWLFNAAHNRFPRVETVIMESTYGG--REDFQPSRREAAE 406
Query: 62 KVHECV----DRGGKVLIPVFALGRAQELCILLETYWERMNL-QAPIYFAVGLTEKATNY 116
++ + V D+GGK+LIPVFA+GR+QE+ ++LE++ L + PIY G+ +AT
Sbjct: 407 RLKDVVMRTYDKGGKILIPVFAVGRSQEVMLVLESFMRNGELPEMPIYLD-GMIWEATTI 465
Query: 117 YKMFITWTNQKIRKTFVQR--NMF---DFKHIRPFDK--SFIDNPGPMVVFATPGMLHSG 169
+ + + N+ +R+ Q+ N F F + ++ I + P++V +T GM++ G
Sbjct: 466 HAAYPEYLNKDLRELIFQKKENPFLSPIFHRVESVERREEVISSSDPLIVLSTSGMMNGG 525
Query: 170 LSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKK--LEFENKQI-IDVKMAIEYM- 225
L FK WA N L+ G+ GT+G ++ +G+K+ + K + V+M IE +
Sbjct: 526 PVLEYFKHWADDPRNTLVFVGYQAVGTLGRRIQNGLKEVLMSLGGKPFTVKVEMNIETID 585
Query: 226 SFSAHADAKGIMQLIQYCEPK--NVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGE 280
FS H+D + ++Q I PK V++ HGE +K + L + + + ++ N E
Sbjct: 586 GFSGHSDRRQLVQYISSMNPKPERVIICHGEENKCIDLAIGLHKRYGMETMALKNLE 642
>gi|359415409|ref|ZP_09207872.1| mRNA cleavage/polyadenylation specificity factor, 100 kD subunit
[Candidatus Haloredivivus sp. G17]
gi|358034266|gb|EHK02708.1| mRNA cleavage/polyadenylation specificity factor, 100 kD subunit
[Candidatus Haloredivivus sp. G17]
Length = 623
Score = 117 bits (294), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 86/290 (29%), Positives = 140/290 (48%), Gaps = 14/290 (4%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRE---RD 58
V G +I+YTGDYN L A + R + +ITESTY D + RE +
Sbjct: 332 IHVGEGLHNILYTGDYNYDTTETLREADTNFQRVETMITESTYGGR-DDEQTPREEANKK 390
Query: 59 FLKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYK 118
FL KV + + +GGKV++PVFA+GR+QE+ LL +R +Y G+ A +
Sbjct: 391 FLSKVKQTLSKGGKVIVPVFAVGRSQEVLALLADEMDRSYFDYNVYID-GMINDANALHT 449
Query: 119 MFITWTNQKIR-KTFVQRNMFDFKHIRPF----DKSFIDNPGPMVVFATPGMLHSGLSLI 173
+ + + KI+ K + + F +I+P ++ + ++ T G + G +
Sbjct: 450 AYPEFLSNKIQDKVYQDESPFLKDNIKPIGSHSERKEVFEDDACIILTTSGSITGGPVMS 509
Query: 174 IFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMS-FSAHAD 232
+K A +N LI G+ +G++G K+ G +++E K ++V + +S FSAH+D
Sbjct: 510 YLEKEASDPDNSLIFVGYQFEGSLGRKIQEGAEEVEINGKN-VEVNLDTSSVSGFSAHSD 568
Query: 233 AKGIMQLIQYCE--PKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGE 280
I+ + P VL HGE SK L + + N+D P N E
Sbjct: 569 RDQIINFAKNLRSTPNRVLTNHGERSKCFSLASSLHKALNIDTSAPQNLE 618
>gi|390939088|ref|YP_006402826.1| KH-domain/beta-lactamase-domain-containing protein [Desulfurococcus
fermentans DSM 16532]
gi|390192195|gb|AFL67251.1| KH-domain/beta-lactamase-domain protein [Desulfurococcus fermentans
DSM 16532]
Length = 647
Score = 117 bits (294), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 82/289 (28%), Positives = 144/289 (49%), Gaps = 27/289 (9%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFLK 61
+ +G +IVYTGD+ + R L A R + LI E TY T+++S+ E+ +
Sbjct: 347 LHIGMGLHNIVYTGDFKYSETRLLNKANTVFPRVETLIMEGTYGATLQESRIQAEQQLID 406
Query: 62 KVHECVDRGGKVLIPVFALGRAQELCILLETYWE-----RMNLQAPIYFAVGLTEKATNY 116
V +R G VLIPVFA+GR QE+ ++L + RMN+ + GL + T
Sbjct: 407 IVKRTSERNGVVLIPVFAVGRGQEIILILNEAMKNGRIPRMNI-----YVEGLVNEVTAI 461
Query: 117 YKMFITWTNQKIRKT-FVQRNMFDFKHIRPFDKSF----IDNPGPMVVFATPGMLHSGLS 171
+ + + N+ +R+ + N F + ++P + I P ++ AT GML G +
Sbjct: 462 HTQYPEYLNKSLREAIYNGENPFTSEWLKPIETGVARPDIIEDKPSIIIATSGMLTGGPA 521
Query: 172 LIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEF--ENK-QIIDVKMAIEYM-SF 227
+ + A E N L+ G+ +GT+G K+ G +++ ENK + I +K+ + + F
Sbjct: 522 VDYLRLLASDERNSLVFVGYQAEGTLGRKIKDGAREITMVVENKVEAIPIKLEVYSIEGF 581
Query: 228 SAHADAKGIMQLIQYC-----EPKNVLLVHGEASKMVFLKEKIKQEFNL 271
S H+D + +L++Y P+ +LL HGE + + L ++ + L
Sbjct: 582 SGHSDQR---ELLEYARDVSPRPRKILLNHGEPTALSTLATLLRSDKEL 627
>gi|218884598|ref|YP_002428980.1| mRNA 3'-end polyadenylation factor [Desulfurococcus kamchatkensis
1221n]
gi|218766214|gb|ACL11613.1| mRNA 3'-end polyadenylation factor [Desulfurococcus kamchatkensis
1221n]
Length = 647
Score = 117 bits (294), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 82/289 (28%), Positives = 144/289 (49%), Gaps = 27/289 (9%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFLK 61
+ +G +IVYTGD+ + R L A R + LI E TY T+++S+ E+ +
Sbjct: 347 LHIGMGLHNIVYTGDFKYSETRLLNKANTVFPRVETLIMEGTYGATLQESRIQAEQQLID 406
Query: 62 KVHECVDRGGKVLIPVFALGRAQELCILLETYWE-----RMNLQAPIYFAVGLTEKATNY 116
V +R G VLIPVFA+GR QE+ ++L + RMN+ + GL + T
Sbjct: 407 IVKRTSERNGVVLIPVFAVGRGQEIILILNEAMKNGRIPRMNI-----YVEGLVNEVTAI 461
Query: 117 YKMFITWTNQKIRKT-FVQRNMFDFKHIRPFDKSF----IDNPGPMVVFATPGMLHSGLS 171
+ + + N+ +R+ + N F + ++P + I P ++ AT GML G +
Sbjct: 462 HTQYPEYLNKSLREAIYNGENPFTSEWLKPIETGVARPDIIEDKPSIIIATSGMLTGGPA 521
Query: 172 LIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEF--ENK-QIIDVKMAIEYM-SF 227
+ + A E N L+ G+ +GT+G K+ G +++ ENK + I +K+ + + F
Sbjct: 522 VDYLRLLASDERNSLVFVGYQAEGTLGRKIKDGAREITMVVENKVEAIPIKLEVYSIEGF 581
Query: 228 SAHADAKGIMQLIQYC-----EPKNVLLVHGEASKMVFLKEKIKQEFNL 271
S H+D + +L++Y P+ +LL HGE + + L ++ + L
Sbjct: 582 SGHSDQR---ELLEYARDVSPRPRKILLNHGEPTALSTLATLLRSDKEL 627
>gi|73667742|ref|YP_303757.1| cleavage and polyadenylation specificity factor [Methanosarcina
barkeri str. Fusaro]
gi|72394904|gb|AAZ69177.1| cleavage and polyadenylation specificity factor [Methanosarcina
barkeri str. Fusaro]
Length = 637
Score = 117 bits (294), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 84/297 (28%), Positives = 145/297 (48%), Gaps = 18/297 (6%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATT--IRDSKRCRERDF 59
F + G ++V+TGDY R A R + +++E+TY + + S + ER
Sbjct: 338 FHIGDGLHNVVFTGDYKYEKTRLFDPAVNKFPRVETVVSEATYGNSNAFQPSLKDAERHL 397
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWER-MNLQAPIYFAVGLTEKATNYYK 118
V V+RGG +IP FA+GR+QE+ I+LE + + + P+Y G+ +AT +
Sbjct: 398 QMVVKNTVERGGICIIPAFAVGRSQEVMIVLEESIRKGLIPEVPVYLD-GMIWEATAIHA 456
Query: 119 MFITWTNQKIRKTFVQR--NMFDFKHIRPFD-----KSFIDNPGPMVVFATPGMLHSGLS 171
+ N +RK Q+ N F + +P D + I NP P V+ +T GM++ G
Sbjct: 457 THPEYLNNDLRKLIFQKGQNPFLSECFKPVDSHDMRQKIIQNPHPCVIISTSGMMNGGPV 516
Query: 172 LIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENK----QIIDVKMAIEYM-S 226
+ FK +A N L+ G+ GTIG ++ G K++ K +I+ + M ++ +
Sbjct: 517 MEYFKAFADEPRNSLVFVGYQADGTIGRRIQKGWKEIPMTGKGGSTEILKLNMEVQVVDG 576
Query: 227 FSAHADAKGIMQLIQYCEPK--NVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGES 281
FS H+D + +M I+ +P+ V HG+ V L I ++ ++ N E+
Sbjct: 577 FSGHSDRRQLMDYIKRMQPRPERVFTEHGDEKACVDLASSIYKKLKIETRALTNLET 633
>gi|410721908|ref|ZP_11361230.1| KH-domain/beta-lactamase-domain protein [Methanobacterium sp.
Maddingley MBC34]
gi|410598026|gb|EKQ52618.1| KH-domain/beta-lactamase-domain protein [Methanobacterium sp.
Maddingley MBC34]
Length = 634
Score = 117 bits (294), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 84/294 (28%), Positives = 151/294 (51%), Gaps = 16/294 (5%)
Query: 7 GNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYAT--TIRDSKRCRERDFLKKVH 64
G + VYTGD+ R L A R + L+ ESTY ++ ++ E++ +K ++
Sbjct: 342 GQHNFVYTGDFKYERSRLLEPAVSKFPRIESLVMESTYGGHEDVQPTRNDAEKELIKTIY 401
Query: 65 ECVDRGGKVLIPVFALGRAQELCILLETYWERMNL-QAPIYFAVGLTEKATNYYKMFITW 123
++R GK+LIPVFA+GRAQEL I+L+ Y + + PIY G+ +AT + +
Sbjct: 402 HTLERKGKILIPVFAVGRAQELMIVLDEYIRHGIIDEVPIYID-GMIWEATAIHTARPEY 460
Query: 124 TNQKIRKTF--VQRNMFD---FKHIRPF-DKSFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
++ +R + RN F F + ++ I P ++ +T GML G S+ FK
Sbjct: 461 LSKDLRDQIFHMGRNPFISEVFHKVNGVEERKDIVEGEPAIILSTSGMLTGGNSVEYFKW 520
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENK---QIIDVKMAIEYM-SFSAHADA 233
E + L+ G+ +G++G ++ G K++ + + + +VKM I+ + FS H+D
Sbjct: 521 LCEDERSSLVFVGYQAEGSLGRRLQKGWKEIPLKEEGKTNVYNVKMNIKTIEGFSGHSDR 580
Query: 234 KGIMQLIQYCEPK--NVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQ 285
+ +M ++ PK +L+ HG+ K + L I + + ++ P N E+ +Q
Sbjct: 581 RQLMDYVRRISPKPEKILICHGDNYKTLDLASSIYRSYKIETKTPMNLETVRIQ 634
>gi|432328821|ref|YP_007246965.1| universal archaeal KH-domain/beta-lactamase-domain protein
[Aciduliprofundum sp. MAR08-339]
gi|432135530|gb|AGB04799.1| universal archaeal KH-domain/beta-lactamase-domain protein
[Aciduliprofundum sp. MAR08-339]
Length = 647
Score = 117 bits (293), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 89/297 (29%), Positives = 150/297 (50%), Gaps = 21/297 (7%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFLK 61
F + G ++V TGD AA R + +I ESTY R+ + R+ +
Sbjct: 349 FHIGNGLYNVVITGDIKFERSWLFNAAHNRFPRVETVIMESTYGG--REDYQPSRREAAE 406
Query: 62 KVHECV----DRGGKVLIPVFALGRAQELCILLETYWERMNL-QAPIYFAVGLTEKATNY 116
++ + V +RGGK+LIPVFA+GR+QE+ ++LE++ L + PIY G+ +AT
Sbjct: 407 RLKDIVSRTFERGGKILIPVFAVGRSQEVMLVLESFMRNGELPEIPIYLD-GMIWEATTI 465
Query: 117 YKMFITWTNQKIRKTFVQR--NMF---DFKHIRPFDK--SFIDNPGPMVVFATPGMLHSG 169
+ + + N+ +R+ Q+ N F F + ++ I + P +V +T GM++ G
Sbjct: 466 HAAYPEYLNKDLRELIFQKKENPFLSPIFHRVESSERREEVISSSDPAIVLSTSGMMNGG 525
Query: 170 LSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKK--LEFENKQI-IDVKMAIEYM- 225
L FK WA N LI G+ GT+G ++ +G+ + + K I I V+M IE +
Sbjct: 526 PVLEYFKHWADDPRNTLIFVGYQAVGTLGRRIQNGLSEVLMSLGGKPITIKVEMNIETID 585
Query: 226 SFSAHADAKGIMQLIQYCEPK--NVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGE 280
FS H+D + ++Q I PK ++ HGE SK + L + + + ++ N E
Sbjct: 586 GFSGHSDRRQLVQFIGSMNPKPERIITCHGEESKCIDLAIGLHKRYGMETMALKNLE 642
>gi|255513745|gb|EET90010.1| beta-lactamase domain protein [Candidatus Micrarchaeum acidiphilum
ARMAN-2]
Length = 655
Score = 117 bits (293), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 88/300 (29%), Positives = 152/300 (50%), Gaps = 26/300 (8%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCR---ERD 58
V G +IV+TGD R L A + R D L ESTY RD + R ER+
Sbjct: 358 LHVGEGMYNIVHTGDMKYGFTRLLDPASVKYPRVDALFIESTYGGP-RDISQNRHDTERE 416
Query: 59 FLKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN---LQAPIYFAVGLTEKATN 115
++ + ++R GKVLIP+FA+GR+QEL ++LE Y L P+Y G+ +A+
Sbjct: 417 LMELIKRTINRNGKVLIPLFAVGRSQELQLVLENYMTNNGPYKLDVPVYLD-GMILEASA 475
Query: 116 YYKMFITWTNQKIRKTFV-QRNMFD---FKHIRPFDKSFIDNPGPMVVFATPGMLHSGLS 171
+ + + + +R + R+ F+ F+ I+ ++S + + GP ++ A+ GML+ G S
Sbjct: 476 IHTAYPEYLKENVRNRILNNRSPFESEIFEVIK-GERSDVFDKGPAIILASGGMLNGGAS 534
Query: 172 LIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQ----IIDVKMAIEYM-S 226
+ FK +N ++ G+ ++G ++ +G+K++ N+ ++VKM + +
Sbjct: 535 VEYFKGLLANPDNAMVFVGYNSANSLGRRIQNGLKEVALPNEDGKITPMEVKMNVATVEG 594
Query: 227 FSAHADAKGIMQLIQYCE-----PKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGES 281
FS H+D QLI + E PK++ +HGE K L + + + D P N +S
Sbjct: 595 FSGHSDRH---QLIDFVEKLNVKPKSIFTMHGEEQKCEDLARTLGKIVHADSRAPMNLDS 651
>gi|70606450|ref|YP_255320.1| metallo-beta-lactamase superfamily protein [Sulfolobus
acidocaldarius DSM 639]
gi|449066664|ref|YP_007433746.1| metallo-beta-lactamase superfamily protein [Sulfolobus
acidocaldarius N8]
gi|449068938|ref|YP_007436019.1| metallo-beta-lactamase superfamily protein [Sulfolobus
acidocaldarius Ron12/I]
gi|68567098|gb|AAY80027.1| hypothetical metallo-beta-lactamase superfamily protein [Sulfolobus
acidocaldarius DSM 639]
gi|449035172|gb|AGE70598.1| metallo-beta-lactamase superfamily protein [Sulfolobus
acidocaldarius N8]
gi|449037446|gb|AGE72871.1| metallo-beta-lactamase superfamily protein [Sulfolobus
acidocaldarius Ron12/I]
Length = 631
Score = 117 bits (292), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 76/269 (28%), Positives = 141/269 (52%), Gaps = 15/269 (5%)
Query: 7 GNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFLKKVHEC 66
G ++VYTGD+ + L A + R D +I E+TY ++++ E + +++
Sbjct: 339 GLHNVVYTGDFKYARTKLLDKANDEFIRVDTMIMETTYGAHDQENREESEAKLIDIINKT 398
Query: 67 VDRGGKVLIPVFALGRAQELCILLETYWERMNL-QAPIYFAVGLTEKATNYYKMFITWTN 125
+ +GGKVLIPV ++GR QE+ +++ + + + P+Y G+ ++ T + + W +
Sbjct: 399 ISKGGKVLIPVLSVGRGQEIMLVINDAMKNKKIPEVPVYIT-GMVDEITAIHTAYPEWLS 457
Query: 126 QKIRKTFVQR--NMFD---FKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
+++R+ + R N F FK I + K I P ++ AT GML+ G ++ FK A
Sbjct: 458 RELREAILYRDENPFMSEFFKRIEGY-KEDIAQGEPSIIIATSGMLNGGPAVEFFKNMAH 516
Query: 181 VENNMLIMPGFCVQGTIGHKVLSGVKKLEFENK----QIIDVKMAIEYM-SFSAHADAKG 235
N ++ + +GT+G KV G K ++ ++ + I V M I + FS H+D +
Sbjct: 517 DTRNSIVFVSYQAEGTLGRKVRDGAKDIQILDRDGRVENIKVNMEIGVVEGFSGHSDRRQ 576
Query: 236 IMQLIQ--YCEPKNVLLVHGEASKMVFLK 262
++ ++ +PKN++L HGE S + K
Sbjct: 577 LLAFLRNLNVKPKNLVLNHGEPSAISSFK 605
>gi|257388099|ref|YP_003177872.1| beta-lactamase [Halomicrobium mukohataei DSM 12286]
gi|257170406|gb|ACV48165.1| beta-lactamase domain protein [Halomicrobium mukohataei DSM 12286]
Length = 640
Score = 117 bits (292), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 79/298 (26%), Positives = 145/298 (48%), Gaps = 19/298 (6%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYA--TTIRDSKRCRERDF 59
F + G+ ++ ++GD + T R L A D R + L+ ESTY + + ER
Sbjct: 340 FHIGDGHYNVAFSGDIHYTDTRLLDGAVNDFPRVETLVLESTYGGKNDYQTDQADSERVL 399
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQA-PIYFAVGLTEKATNYYK 118
K ++E DRGGK+LIP FA+GR+QEL ++LE ++ P+Y G+ +AT +
Sbjct: 400 KKAINEAHDRGGKILIPAFAVGRSQELMLVLEEAMRSGDIPTVPVYLD-GMIREATAIHT 458
Query: 119 MFITWTNQKIRKTFV--QRNMFDFKHIRPFD-----KSFIDNPGPMVVFATPGMLHSGLS 171
+ + +R+ + N F + + D + I + P ++ T GM+ G
Sbjct: 459 AYPEYLRDDLRQRILYEDENPFLAEQFQQVDGGQEMRQDIADDEPCIILTTSGMVTGGPV 518
Query: 172 LIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFEN-----KQIIDVKMAIEYM- 225
+ + +N ++ G+ +GT+G ++ G ++ + + + +KM IE +
Sbjct: 519 MSWLRLLGSDPDNTMMFVGYQAEGTLGRQIQRGQDEITLSDGSGPRAERVSLKMEIETVD 578
Query: 226 SFSAHADAKGIMQLIQ--YCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGES 281
FS HAD +G+ ++ + P+ V+ VHG+ L + Q+FN+ + P N E+
Sbjct: 579 GFSGHADRQGLETFVETMHPRPEKVICVHGDEQSTNQLSSALYQKFNMRTYNPKNLET 636
>gi|218187232|gb|EEC69659.1| hypothetical protein OsI_39073 [Oryza sativa Indica Group]
Length = 665
Score = 116 bits (291), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 123/461 (26%), Positives = 206/461 (44%), Gaps = 71/461 (15%)
Query: 662 VSQLMSEKKSYEVQVMSQVVAAVTNSCDSSHIIDLGGGQGYLSTILALQHGKKTLSLDYN 721
++Q M+ KK +E++ ++ +V A+T SC + +ID+G GQGYL+ L+ ++ +++D
Sbjct: 120 LAQGMNSKKKHEIETLAGLVHAITKSCGAKTVIDVGSGQGYLAQALSFEYQLPVVAID-- 177
Query: 722 QVNTHGAAVRSKKLEDSEKESKGPFQSYAGVINKKLWMRTQNRQTSHASVEHHGKNWKRK 781
++H A+V + + E +K YA K +R T H
Sbjct: 178 -ASSHHASVTNTRAERIKK-------YYAAKCVGKQQLRVPRTVTCH------------- 216
Query: 782 SKAPVVSIKDEELVVCKDKCKQITHFVTPDSDISSILSQAY--PQDSLHNVCIMGLHTCG 839
V D V + C+ P+S + S P DS+ + + GLH CG
Sbjct: 217 -----VLSSDTLAAVTLEACQDDHAEHVPESKNFNESSPQIEKPNDSIPPLVLAGLHACG 271
Query: 840 DLSGTALRLFTKSS-LQCLVQVGCCYHLLEEEFIRSPFWKDVDQSLYEHGYGFPLSEHLR 898
DLS LRLF ++ L+ +GCCY+LL EE +D + GFP+S+ +
Sbjct: 272 DLSVNMLRLFVSCEQVKALISIGCCYNLLSEE-----CHEDTNTC-----PGFPMSKAAK 321
Query: 899 SRKFFLGRNVRMSGTQSPERVIDLKQTQTLPLF----YRA----LLEKYL----RSKITI 946
LG++ R G QS ER L + L F +RA +LEK+ RS +I
Sbjct: 322 LSNLVLGKSTRDLGCQSAERWRSLTKDIALQNFDIHAFRAAFQMVLEKHFPEVSRSNPSI 381
Query: 947 NDEEPKVVGRLATKCSNFVEYVHRAVDKLKLD------LEVDDEEVTRLFNSHQREYEYL 1000
+ K + R + D L + DD T N + ++L
Sbjct: 382 G-RQGKALRRQRLRKVMESPMAMAETDALSYSTQKEQIMTKDDPLPTGPNNFKEVHVDFL 440
Query: 1001 Q--IYYFLKTALAPVIEALIVLDRVLYLREQQKPQIISNLFLSLLTQ-----VDDEEVTR 1053
F+ +A V A +V D + YL + QK + + +S L + V++ +
Sbjct: 441 PELSTGFVDSA---VSGAAVVPDDI-YLDKSQKFTLFKDFTVSGLGRLGCDFVENVSLLE 496
Query: 1054 LFDSHQREYEYLQIYYFLKTALAPVIEALIVLDRVLYLREQ 1094
++ Q E++ ++ L+ AL P++E I+LDR+L+L+EQ
Sbjct: 497 IWKDVQPFTEFIGPFWCLRVALGPLVETYILLDRLLFLQEQ 537
Score = 43.1 bits (100), Expect = 0.81, Method: Compositional matrix adjust.
Identities = 40/163 (24%), Positives = 71/163 (43%), Gaps = 27/163 (16%)
Query: 480 ETMELIKNTIDEMIEFITPNIACVNTHTVHFMVEKLWETTIPKAILSQVNNSPSFDSLLE 539
ET E + ++ + F+ + + H V+F ++LWE + + + P +SLL
Sbjct: 16 ETAEQTREWMEAIAAFLRLHRPLLEAHVVNFFKDRLWEM-VDAEWMECLRREP-VESLLM 73
Query: 540 -----EFWKSRQDNHVETNSELVKFFQAADKFRLTSLMNGESIITVDELFSKLQARKCGQ 594
E W S + V T LV + L ++ SI TV
Sbjct: 74 LPSGCEHWPSSLRDFVLTAKSLVLPREQKSPRSLLPDLHVASINTV-------------- 119
Query: 595 VVESAKVSQLMSEKKSYEVQVMSQVVAAVTNSCDSSHIIDLGS 637
++Q M+ KK +E++ ++ +V A+T SC + +ID+GS
Sbjct: 120 ------LAQGMNSKKKHEIETLAGLVHAITKSCGAKTVIDVGS 156
>gi|410672076|ref|YP_006924447.1| beta-lactamase-like protein [Methanolobus psychrophilus R15]
gi|409171204|gb|AFV25079.1| beta-lactamase-like protein [Methanolobus psychrophilus R15]
Length = 636
Score = 116 bits (291), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 84/297 (28%), Positives = 145/297 (48%), Gaps = 17/297 (5%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTY--ATTIRDSKRCRERDF 59
F + G ++V+TGD+ R AA R + ++ ESTY A ++ S E +
Sbjct: 338 FHIGDGLHNVVFTGDFKYEKTRLFDAAVNKFPRVESVVMESTYGSANALQPSLLEAENNL 397
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNL-QAPIYFAVGLTEKATNYYK 118
++ + R G VLIP FA+GR+QE+ I+LE + + P+Y G+ +AT +
Sbjct: 398 KNVINTTLIRNGVVLIPAFAVGRSQEVMIVLEDAIRKGEIPNVPVYLD-GMIWEATAIHA 456
Query: 119 MFITWTNQKIRKTFVQR--NMFDFKHIRPFD-----KSFIDNPGPMVVFATPGMLHSGLS 171
+ + N +RK Q+ N F + +P D + + P P V+ AT GM+++G
Sbjct: 457 TYPEYLNNDLRKLIFQKGENPFLAECFKPVDSNELRQKILQEPEPCVILATSGMMNAGPV 516
Query: 172 LIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENK---QIIDVKMAIEYM-SF 227
+ FKK+A E N L+ G+ GT+G ++ G K++ +I + M +E + F
Sbjct: 517 IEYFKKFADDEKNTLVFVGYQADGTLGRRIQKGWKEIPLSTNNGTHVISMNMQVEVVDGF 576
Query: 228 SAHADAKGIMQLIQYCEPK--NVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESC 282
S H+D + +M I+ +P+ V HG+ + L I ++ L+ N E+
Sbjct: 577 SGHSDRRQLMDYIKKMKPRPERVFTEHGDERSCLDLASSIHKKNKLETRALTNLETV 633
>gi|20092670|ref|NP_618745.1| cleavage and polyadenylation specificity factor [Methanosarcina
acetivorans C2A]
gi|19917954|gb|AAM07225.1| cleavage and polyadenylation specificity factor [Methanosarcina
acetivorans C2A]
Length = 637
Score = 116 bits (290), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 81/297 (27%), Positives = 146/297 (49%), Gaps = 18/297 (6%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATT--IRDSKRCRERDF 59
F + G ++V+TGDY R A R + +I+E+TY + + + + E+
Sbjct: 338 FHIGDGLHNVVFTGDYKYEKTRLFDPAVNKFPRVETVISEATYGNSNAFQPALKDAEKHL 397
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWER-MNLQAPIYFAVGLTEKATNYYK 118
V V+RGG LIP FA+GR+QE+ I+LE + + + P+Y G+ +AT +
Sbjct: 398 QMVVKNTVERGGIALIPAFAVGRSQEVMIVLEESIRKGLIPEVPVYLD-GMIWEATAIHA 456
Query: 119 MFITWTNQKIRKTFVQR--NMFDFKHIRPFD-----KSFIDNPGPMVVFATPGMLHSGLS 171
+ N +RK Q+ N F + +P D + I NP P V+ +T GM++ G
Sbjct: 457 THPEYLNNDLRKLIFQKGQNPFLSECFKPVDSHETRQKIIQNPHPCVILSTSGMMNGGPV 516
Query: 172 LIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENK----QIIDVKMAIEYM-S 226
+ F+ +A N L+ G+ GT+G ++ G K++ K +I+ + M ++ +
Sbjct: 517 MDYFRTFAEDPRNTLVFVGYQADGTVGRRIQKGWKEIPMTGKNGSTEILKMNMEVQVVDG 576
Query: 227 FSAHADAKGIMQLIQYCEPK--NVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGES 281
FS H+D + +M+ I+ +P+ V HG+ V L + ++ ++ N E+
Sbjct: 577 FSGHSDRRQLMEYIKRMQPRPERVFTEHGDEKACVDLASSVYKKLKIETRALTNLET 633
>gi|194386428|dbj|BAG61024.1| unnamed protein product [Homo sapiens]
Length = 406
Score = 116 bits (290), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 57/154 (37%), Positives = 93/154 (60%), Gaps = 1/154 (0%)
Query: 117 YKMFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIF 175
Y+ ++ N KIRK N F FKHI D+ GP VV A+PGM+ SGLS +F
Sbjct: 4 YQTYVNAMNDKIRKQININNPFVFKHISNLKSMDHFDDIGPSVVMASPGMMQSGLSRELF 63
Query: 176 KKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKG 235
+ W + N +I+ G+CV+GT ++S +++ + Q + +KM+++Y+SFSAH D +
Sbjct: 64 ESWCTDKRNGVIIAGYCVEGTFAKHIMSEPEEITTMSGQKLPLKMSVDYISFSAHTDYQQ 123
Query: 236 IMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF 269
+ I+ +P +V+LVHGE ++M LK + +E+
Sbjct: 124 TSEFIRALKPPHVILVHGEQNEMARLKAALIREY 157
>gi|308198432|pdb|2XR1|A Chain A, Dimeric Archaeal Cleavage And Polyadenylation Specificity
Factor With N-Terminal Kh Domains (Kh-Cpsf) From
Methanosarcina Mazei
gi|308198433|pdb|2XR1|B Chain B, Dimeric Archaeal Cleavage And Polyadenylation Specificity
Factor With N-Terminal Kh Domains (Kh-Cpsf) From
Methanosarcina Mazei
Length = 640
Score = 116 bits (290), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 81/297 (27%), Positives = 147/297 (49%), Gaps = 18/297 (6%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTY--ATTIRDSKRCRERDF 59
F + G ++V+TGDY R A R + +I+E+TY A + + + E+
Sbjct: 341 FHIGDGLHNVVFTGDYKYEKTRLFDPAVNKFPRVETVISEATYGNANAFQPALKDAEKHL 400
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLE-TYWERMNLQAPIYFAVGLTEKATNYYK 118
V ++RGG +IP FA+GR+QE+ I+LE + + + + P+Y G+ +AT +
Sbjct: 401 QMVVKNTIERGGIAVIPAFAVGRSQEVMIVLEESIRKGLIPEVPVYLD-GMIWEATAIHA 459
Query: 119 MFITWTNQKIRKTFVQR--NMFDFKHIRPFD-----KSFIDNPGPMVVFATPGMLHSGLS 171
+ N +RK Q+ N F + +P D + I NP P V+ AT GM++ G
Sbjct: 460 THPEYLNNDLRKLIFQKGQNPFLSECFKPVDSHEARQKIIQNPQPCVILATSGMMNGGPV 519
Query: 172 LIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENK----QIIDVKMAIEYM-S 226
+ FK +A N L+ G+ GTIG ++ G K++ K +++ + M ++ +
Sbjct: 520 MEYFKAFAEDPRNTLVFVGYQADGTIGRRIQKGWKEIPMTGKNGSTEMLKMNMEVQVVDG 579
Query: 227 FSAHADAKGIMQLIQYCEPK--NVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGES 281
FS H+D + +M+ ++ +P+ V HG+ V L + ++ ++ N E+
Sbjct: 580 FSGHSDRRQLMEYVKRMQPRPERVFTEHGDEKACVDLASSVYKKLKIETRALTNLET 636
>gi|327311459|ref|YP_004338356.1| beta-lactamase domain-containing protein [Thermoproteus uzoniensis
768-20]
gi|326947938|gb|AEA13044.1| beta-lactamase domain protein [Thermoproteus uzoniensis 768-20]
Length = 635
Score = 116 bits (290), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 85/305 (27%), Positives = 140/305 (45%), Gaps = 39/305 (12%)
Query: 7 GNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATT--IRDSKRCRERDFLKKVH 64
G +I+YTGD+ L A R ++LI ESTY ++ + E +K +
Sbjct: 334 GRYNILYTGDFKFGRTNLLNKAVNKFKRVEMLIMESTYGGRDDVQPPRVEAENTLVKAIT 393
Query: 65 ECVDRGGKVLIPVFALGRAQELCILLETYWERMNL-QAPIYFAVGLTEKATNYYKMFITW 123
E ++ GKVLIP F+ GRAQE+ +L R NL + PIY G+ + N Y M+ +
Sbjct: 394 ETIENRGKVLIPAFSTGRAQEILYILNREINRGNLKRVPIYVD-GMIVETLNAYLMYPHF 452
Query: 124 TNQKIRKTFVQRNMFDFKHIRPFDKS---------------------FIDNPGPMVVFAT 162
N+++ + + + PF S + P V+ A
Sbjct: 453 LNREVAEEI-------YGGVNPFTSSGSIEVVERAKRIEDRINQIARITQDERPGVIIAP 505
Query: 163 PGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMA- 221
GML+ G L F AP N LI + + T+G ++L+G K + + + K++
Sbjct: 506 HGMLNGGPILDYFVHLAPDPANKLIFVSYQAENTLGRRILNGEKDFIVRSISMGETKVSM 565
Query: 222 ----IEYMSFSAHADAKGIMQLIQYCE--PKNVLLVHGEASKMVFLKEKIKQEFNLDCFM 275
+ FS H+D + +M+ ++ E P+ V+L+HGE SK+V L I+ + + +
Sbjct: 566 RMGVVSIPGFSGHSDRRELMKYVETVEPRPRKVVLIHGEPSKIVSLATSIELRYKITTVI 625
Query: 276 PANGE 280
P GE
Sbjct: 626 PRVGE 630
>gi|21226797|ref|NP_632719.1| cleavage and polyadenylation specificity factor 100 kD subunit
[Methanosarcina mazei Go1]
gi|452209278|ref|YP_007489392.1| cleavage and polyadenylation specificity factor [Methanosarcina
mazei Tuc01]
gi|20905093|gb|AAM30391.1| cleavage and polyadenylation specificity factor 100 kD subunit
[Methanosarcina mazei Go1]
gi|452099180|gb|AGF96120.1| cleavage and polyadenylation specificity factor [Methanosarcina
mazei Tuc01]
Length = 637
Score = 116 bits (290), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 81/297 (27%), Positives = 146/297 (49%), Gaps = 18/297 (6%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTY--ATTIRDSKRCRERDF 59
F + G ++V+TGDY R A R + +I+E+TY A + + + E+
Sbjct: 338 FHIGDGLHNVVFTGDYKYEKTRLFDPAVNKFPRVETVISEATYGNANAFQPALKDAEKHL 397
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWER-MNLQAPIYFAVGLTEKATNYYK 118
V ++RGG +IP FA+GR+QE+ I+LE + + + P+Y G+ +AT +
Sbjct: 398 QMVVKNTIERGGIAVIPAFAVGRSQEVMIVLEESIRKGLIPEVPVYLD-GMIWEATAIHA 456
Query: 119 MFITWTNQKIRKTFVQR--NMFDFKHIRPFD-----KSFIDNPGPMVVFATPGMLHSGLS 171
+ N +RK Q+ N F + +P D + I NP P V+ AT GM++ G
Sbjct: 457 THPEYLNNDLRKLIFQKGQNPFLSECFKPVDSHEARQKIIQNPQPCVILATSGMMNGGPV 516
Query: 172 LIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENK----QIIDVKMAIEYM-S 226
+ FK +A N L+ G+ GTIG ++ G K++ K +++ + M ++ +
Sbjct: 517 MEYFKAFAEDPRNTLVFVGYQADGTIGRRIQKGWKEIPMTGKNGSTEMLKMNMEVQVVDG 576
Query: 227 FSAHADAKGIMQLIQYCEPK--NVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGES 281
FS H+D + +M+ ++ +P+ V HG+ V L + ++ ++ N E+
Sbjct: 577 FSGHSDRRQLMEYVKRMQPRPERVFTEHGDEKACVDLASSVYKKLKIETRALTNLET 633
>gi|312621851|ref|YP_004023464.1| RNA-metabolising metallo-beta-lactamase [Caldicellulosiruptor
kronotskyensis 2002]
gi|312202318|gb|ADQ45645.1| RNA-metabolising metallo-beta-lactamase [Caldicellulosiruptor
kronotskyensis 2002]
Length = 821
Score = 116 bits (290), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 75/292 (25%), Positives = 139/292 (47%), Gaps = 14/292 (4%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
M ++ SI+YTGD++ + A + K RPD++I ESTY + ++ E
Sbjct: 152 MIFIQTQEGSILYTGDFSADRQLTVDKASVPKIRPDVVICESTYGDRLHTNRNFEEERLF 211
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
V E + +GGKVLIP FA+GRAQE+ ++L + ++ + + F G+ + Y+
Sbjct: 212 NTVAEVISQGGKVLIPAFAIGRAQEIILILRNFMKKRKVSFNV-FIDGMVREVIRVYRNN 270
Query: 121 ITWTNQKIRKTFVQ-RNMFDFKHIRPFD-----KSFIDNPGPMVVFATPGMLHSGLSLII 174
T+ + + K ++ +F +I + I + P V+ ++ GML G S+
Sbjct: 271 PTYLSSRYYKRVLKGEEIFLADNINVISDKKQREEIISSSDPCVIISSSGMLTGGPSVFY 330
Query: 175 FKKWAPVENNMLIMPGFCVQGTIGHKVL------SGVKKLEFENKQIIDVKMAIEYMSFS 228
+K +N ++ + G+ + G K+L +K++ K+ +VK +E S
Sbjct: 331 AEKIVQNQNALIAITGYQDEEAPGRKLLELAELPENERKIDLNGKE-YEVKCRVEKYGLS 389
Query: 229 AHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGE 280
AH+D I+ + P+ V+ HG + + + +E + +P NGE
Sbjct: 390 AHSDRDRILGFLVTLRPRTVIFAHGSEDAISQISDMAVKELEANIIVPQNGE 441
>gi|408382616|ref|ZP_11180159.1| KH-domain/beta-lactamase-domain-containing protein
[Methanobacterium formicicum DSM 3637]
gi|407814692|gb|EKF85316.1| KH-domain/beta-lactamase-domain-containing protein
[Methanobacterium formicicum DSM 3637]
Length = 634
Score = 115 bits (289), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 83/294 (28%), Positives = 150/294 (51%), Gaps = 16/294 (5%)
Query: 7 GNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYAT--TIRDSKRCRERDFLKKVH 64
G + VYTGD+ R L A R + ++ ESTY ++ ++ E++ +K ++
Sbjct: 342 GQHNFVYTGDFKYERSRLLEPAVSKFPRIESMVMESTYGGHEDVQPTRNDAEKELIKTIY 401
Query: 65 ECVDRGGKVLIPVFALGRAQELCILLETYWERMNL-QAPIYFAVGLTEKATNYYKMFITW 123
++R GK+LIPVFA+GRAQEL I+L+ Y + + PIY G+ +AT + +
Sbjct: 402 HTLERKGKILIPVFAVGRAQELMIVLDEYIRHGIIDEVPIYID-GMIWEATAIHTARPEY 460
Query: 124 TNQKIRKTF--VQRNMFD---FKHIRPF-DKSFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
++ +R + RN F F + ++ I P ++ +T GML G S+ FK
Sbjct: 461 LSKDLRDQIFHMGRNPFISEVFHKVNNVEERKDIVEGEPSIILSTSGMLTGGNSVEYFKW 520
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENK---QIIDVKMAIEYM-SFSAHADA 233
E + L+ G+ +G++G ++ G K++ + + + VKM I+ + FS H+D
Sbjct: 521 LCEDERSSLVFVGYQAEGSLGRRLQKGWKEIPLKEEGKTNVYHVKMGIKTIEGFSGHSDR 580
Query: 234 KGIMQLIQYCEPK--NVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQ 285
+ +M ++ PK +L+ HG+ K + L I + + ++ P N E+ +Q
Sbjct: 581 RQLMDYVRRISPKPEKILICHGDNYKTLDLASSIYRSYKIETKTPMNLETVRIQ 634
>gi|325284382|ref|YP_004256922.1| beta-lactamase domain protein [Deinococcus proteolyticus MRP]
gi|324316446|gb|ADY27559.1| beta-lactamase domain protein [Deinococcus proteolyticus MRP]
Length = 526
Score = 115 bits (289), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 90/307 (29%), Positives = 143/307 (46%), Gaps = 34/307 (11%)
Query: 10 SIVYTGDYNMTPDRHLGAAWIDKCRP---DLLITESTYATTIRDSKRCRERDFLKKVHEC 66
++ +TGD + + AAW+ + D +++E+TY T+ +++ + + F+ V +
Sbjct: 162 TVFHTGDVSNVSTPVVDAAWMPPRKTHAVDAVVSEATYGDTLLPARKIQVQAFMHGVAQV 221
Query: 67 VDRGGKVLIPVFALGRAQELCILLETYW-ERMNLQAPIYFAVGLTEKATNYYKMFITWTN 125
V+ GG+VLIP FALGRAQE+ LL + + P+Y G+ T Y+ +
Sbjct: 222 VEGGGRVLIPSFALGRAQEIVQLLHGGMVSGIIPEVPLYLD-GMVRTVTEAYQDMLPLLP 280
Query: 126 QKIR---KTFVQRNMF-----------DFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLS 171
Q +R KT Q F D + I+ DK P VV A+ GMLH+G S
Sbjct: 281 QALRNRVKTSQQPAFFTGTVEAVQSAKDRERIQRSDK-------PAVVIASSGMLHAGAS 333
Query: 172 LIIFKKWAPVENNMLIMPGFCVQGTIGHKVL----SGVKKLEFENK-QIIDVKMAIEYMS 226
+ W P N L + G+ + G ++L G L +N + IE
Sbjct: 334 PSYARAWLPDSGNALFVVGYQDGESPGRRLLELQQGGEVLLPDQNGFSAVSAYARIERFY 393
Query: 227 FSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQT 286
SAH+D G++ +I +PK VLL HGEA+ L +K D +PA GE ++
Sbjct: 394 LSAHSDQGGLLSMIARYDPKKVLLTHGEAAPRRALAGFLKTR---DVALPAAGEMVSLRA 450
Query: 287 DMKISID 293
K ++
Sbjct: 451 QPKRKVE 457
>gi|12846103|dbj|BAB27031.1| unnamed protein product [Mus musculus]
Length = 406
Score = 115 bits (289), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 59/172 (34%), Positives = 101/172 (58%), Gaps = 7/172 (4%)
Query: 117 YKMFITWTNQKIRKTFVQRNMFDFKHIRPFDK-SFIDNPGPMVVFATPGMLHSGLSLIIF 175
Y+ ++ N KIRK N F FKHI D+ GP VV A+PGM+ +GLS +F
Sbjct: 4 YQTYVNAMNDKIRKQININNPFVFKHISNLKSMDHFDDIGPSVVMASPGMIQNGLSRELF 63
Query: 176 KKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKG 235
+ W + N +I+ G+CV+GT+ ++S +++ + Q + +KM+++Y+SFSAH D +
Sbjct: 64 ESWCTDKRNGVIIAGYCVEGTLAKHIMSEPEEITTMSGQKLPLKMSVDYISFSAHTDYQQ 123
Query: 236 IMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF------NLDCFMPANGES 281
+ I+ +P +V+LVHGE ++M LK + +E+ +++ P N E+
Sbjct: 124 TSEFIRALKPPHVILVHGEQNEMARLKAALIREYEDNDEVHIEVHNPRNTEA 175
>gi|255524630|ref|ZP_05391583.1| beta-lactamase domain protein [Clostridium carboxidivorans P7]
gi|255511654|gb|EET87941.1| beta-lactamase domain protein [Clostridium carboxidivorans P7]
Length = 827
Score = 115 bits (289), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 76/287 (26%), Positives = 136/287 (47%), Gaps = 17/287 (5%)
Query: 10 SIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFLKKVHECVDR 69
S+ Y+GD+++ + + + K RPD I E+TY + ++ E L+ ++EC
Sbjct: 163 SLFYSGDFSVFSQKTVEGLKVPKLRPDAAIFETTYGDRLHANREVEEERLLQLINECEAN 222
Query: 70 GGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMFITWTNQKIR 129
GK+LIP FALGRAQE+ ++++ + +++ + G+ + YK+ + +
Sbjct: 223 RGKMLIPAFALGRAQEIILIIKKALNKKSIKNVKIYVDGMIKDINRVYKLNPLYLKNSLG 282
Query: 130 KT-------FVQRNMF--DFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAP 180
K F N+ D K IR + +++ P ++ ++ GML G S +K AP
Sbjct: 283 KKILRGIEPFYDDNIIAVDKKEIR---EKILEDKEPCIIISSSGMLTGGYSQYYAEKIAP 339
Query: 181 VENNMLIMPGFCVQGTIGHK----VLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGI 236
+EN +++ G+ + + G K + K N + I VK +E + SAH D I
Sbjct: 340 MENGYIVITGYQDEESPGRKLLNLLEEEEDKKLELNGRSIPVKCKVEKVGLSAHGDKGEI 399
Query: 237 MQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCF 283
LI P N+ +VHG + +++ E + P GE C+
Sbjct: 400 KSLINLLTPNNIFMVHGNEEVVESFSKELLSEVRARVYAPKCGE-CY 445
>gi|256003924|ref|ZP_05428910.1| RNA-metabolizing metallo-beta-lactamase [Clostridium thermocellum
DSM 2360]
gi|281417321|ref|ZP_06248341.1| RNA-metabolising metallo-beta-lactamase [Clostridium thermocellum
JW20]
gi|385778965|ref|YP_005688130.1| RNA-metabolising metallo-beta-lactamase [Clostridium thermocellum
DSM 1313]
gi|419723019|ref|ZP_14250154.1| RNA-metabolising metallo-beta-lactamase [Clostridium thermocellum
AD2]
gi|419725077|ref|ZP_14252132.1| RNA-metabolising metallo-beta-lactamase [Clostridium thermocellum
YS]
gi|255992052|gb|EEU02148.1| RNA-metabolizing metallo-beta-lactamase [Clostridium thermocellum
DSM 2360]
gi|281408723|gb|EFB38981.1| RNA-metabolising metallo-beta-lactamase [Clostridium thermocellum
JW20]
gi|316940645|gb|ADU74679.1| RNA-metabolising metallo-beta-lactamase [Clostridium thermocellum
DSM 1313]
gi|380771697|gb|EIC05562.1| RNA-metabolising metallo-beta-lactamase [Clostridium thermocellum
YS]
gi|380780786|gb|EIC10449.1| RNA-metabolising metallo-beta-lactamase [Clostridium thermocellum
AD2]
Length = 541
Score = 115 bits (289), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 77/277 (27%), Positives = 142/277 (51%), Gaps = 18/277 (6%)
Query: 36 DLLITESTYATTIRDSKRCRERDFLKKVHECVDRGGKVLIPVFALGRAQELCILL----E 91
D L+ ESTY + + + + FL ++ +++GG V+IP FA+GR QEL L E
Sbjct: 206 DYLVIESTYGNRLHNERVDKAEYFLNLINSTIEKGGNVIIPSFAVGRTQELIYELNKKKE 265
Query: 92 TYWERMNL--QAPIYFAVGLTEKATNYYKMFITWTNQKIRKTFVQR--NMFDFKHIR--- 144
Y E++ + P+Y + AT ++ + +++ R+ +++ N DF ++
Sbjct: 266 VYDEKLKMLFATPVYIDSPMAISATEVFRNNLDCYDEEARE-YIENGDNPLDFPGLQFTR 324
Query: 145 --PFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVL 202
K+ + ++ A GM +G K +++ +I G+ +GT+G ++L
Sbjct: 325 TAEESKALNERKESSIIIAASGMCEAGRIRHHLKHNLWRKDSTIIFVGYQAEGTLGRRLL 384
Query: 203 SGVKKLEFENKQIIDVKMAIEYM-SFSAHADAKGIMQLIQYC--EPKNVLLVHGEASKMV 259
G KK+ ++ I V+ IE + FS HAD +G+++ I+ +PK + +VHGE M+
Sbjct: 385 DGAKKVRLFGEE-ISVEARIEMIEGFSGHADKEGLLKWIERFNRKPKKIFIVHGEEDAMI 443
Query: 260 FLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSV 296
E+I + FN++ +P+ GES + I+ID V
Sbjct: 444 EFSEEINRRFNIETVIPSRGESFILNAHNVIAIDEKV 480
>gi|346469487|gb|AEO34588.1| hypothetical protein [Amblyomma maculatum]
Length = 510
Score = 115 bits (288), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 102/380 (26%), Positives = 175/380 (46%), Gaps = 62/380 (16%)
Query: 669 KKSYEVQVMSQVVAAVTNSCDSSHIIDLGGGQGYLSTILALQHGKKTLSLDYNQVNTHGA 728
KK +E+ ++ VV + C H++D+G GQG+L+ +LAL + ++D V +H A
Sbjct: 145 KKQHEITRLAMVVELLAQKCGCQHVLDVGSGQGHLARLLALDKQLRVATVDL--VGSHLA 202
Query: 729 AVRSKKLEDSEKESKGPFQSYAGVINKKLWMRTQNRQTSHASVEHHGKNWKRKSKAPVVS 788
+ + R ++ H + G+ + S+ P V
Sbjct: 203 SAQ----------------------------RFDHQAVLHVQKKAAGEE-SKTSEPPGVL 233
Query: 789 IKDEELVVCKDKCKQITHFVTPDSDISSILSQAYPQ--DSLHNVCIMGLHTCGDLSGTAL 846
+ EL +++ TP +++ + +A+ +HN +GLHTCGDL + +
Sbjct: 234 PQHVEL--------EVSMTTTP-TELEQVAKKAWGTGVSDVHNFIFVGLHTCGDLGPSMI 284
Query: 847 RLFTKS-SLQCLVQVGCCYHLLEEEFIRSPFWKDVDQSLYEHGYGFPLSEHLRS--RKFF 903
RLF S + + LV VGCCY L E D+ Q+ +P+S H+R+ +
Sbjct: 285 RLFADSPAARGLVSVGCCYMKLNESKHEG---ADMKQA-------YPMSSHVRAMGNRSM 334
Query: 904 LGRNVRMSGTQSPERVIDLKQTQTLPL---FYRALLEKYLRSKITINDEEPKVVGRLATK 960
L R + E + + L YRALLE+ + ++ + D+ +G +
Sbjct: 335 LSYQAREVACHAIEMYAKKLKDSPMALKIHCYRALLERLIVNRYS--DQRHCGLGCVKHA 392
Query: 961 CS-NFVEYVHRAVDKLKLDLEVDDEEVTRLFNSHQREYEYLQIYYFLKTALAPVIEALIV 1019
S F EY +A+ K L +D + L E++ + I+Y L+ LAPVIE+LI+
Sbjct: 393 SSMTFAEYAKKALHKFPFPLSGEDMDAPEL-ELAMNEWQRVVIFYSLRLLLAPVIESLIL 451
Query: 1020 LDRVLYLREQQKPQIISNLF 1039
+DR+LYL +Q P + +LF
Sbjct: 452 VDRMLYLWDQCIPSCLVSLF 471
Score = 42.7 bits (99), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 19/37 (51%), Positives = 29/37 (78%)
Query: 1061 EYEYLQIYYFLKTALAPVIEALIVLDRVLYLREQQKP 1097
E++ + I+Y L+ LAPVIE+LI++DR+LYL +Q P
Sbjct: 428 EWQRVVIFYSLRLLLAPVIESLILVDRMLYLWDQCIP 464
>gi|300121617|emb|CBK22135.2| unnamed protein product [Blastocystis hominis]
Length = 545
Score = 115 bits (287), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 87/288 (30%), Positives = 132/288 (45%), Gaps = 19/288 (6%)
Query: 1 MFQVKVGNQSIVYTGDYN-----MTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCR 55
M++++ +SI+YTGD++ P L A ++ D+LI E TY T S R
Sbjct: 161 MWEIEAYGRSILYTGDFSDEPGGFIPSYQLPARFLRPGNLDMLIMECTYGNTEFKSYEDR 220
Query: 56 ERDFLKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATN 115
+ L V + GKVLIP + +G QEL + W NL PIYF+ T+
Sbjct: 221 KNTLLDSVLSTIQNRGKVLIPCYGIGHTQELLAMFLDLWNSKNLTTPIYFSSQDTKSTLP 280
Query: 116 YYKMFITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIF 175
+ + +WT+ + I+PF F + P ++FAT + SG+S +F
Sbjct: 281 LFSYYSSWTSHPLS--------LQSDRIQPFRMEFAASEEPFLLFATHPTMTSGVSKAVF 332
Query: 176 KKWAPVENNMLIMPGFCVQGTIGHKVL-SGVKKLEFENKQIIDVKMAIEYMSFSAHADAK 234
A NM+I G TI +++ S L+ +N I V + + FSAH D K
Sbjct: 333 PSIAGNPKNMVIFLGQNSPDTIAGQIMNSPAVTLDGKN---IPVLCQRKIVPFSAHPDRK 389
Query: 235 GIMQLIQYCEPKNVLLVHGEASKMVFL-KEKIKQEFNLDCFM-PANGE 280
+LI+ V+L+HG+ + V L K K N FM P NG+
Sbjct: 390 ANCRLIERTGASCVVLIHGDKANCVGLSKYYSKMHKNGPIFMIPENGK 437
>gi|168053943|ref|XP_001779393.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162669191|gb|EDQ55783.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 502
Score = 115 bits (287), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 110/401 (27%), Positives = 179/401 (44%), Gaps = 81/401 (20%)
Query: 662 VSQLMSEKKSYEVQV----MSQVVAAVTNSCDSSHIIDLGGGQGYLSTILALQHGKKTLS 717
++Q MS KK +EV V ++ V+A S +ID+G GQGYL+ +LA ++ LS
Sbjct: 114 LAQGMSVKKRHEVGVQIGILAAVIAGTAELSGSKDVIDVGAGQGYLALVLAFEY---RLS 170
Query: 718 LDYNQVNTHGAAVRSKKLEDSEKESKGPFQSYAGVINKKLWMRTQNRQTSHASVEHHGKN 777
+ H A V +K+ + EK + T+ R++ +
Sbjct: 171 VTAVDACAHHADVTNKRAQRIEK-----------------YYNTRLRKSQQVGLG----- 208
Query: 778 WKRKSKAPVVSIKDEELVVCKDKCKQITHFVTP------DSDISSILSQAYPQDSLHN-- 829
+++ + V C+ + + ++ +S +S + + P D N
Sbjct: 209 --------CSAVRAPQTVTCRVGTGEFSAALSSLLPAFEESKLSEVFANERPADHDENTF 260
Query: 830 -------VCIMGLHTCGDLSGTALRLFTKS-SLQCLVQVGCCYHLLEEEFIRSPFWKDVD 881
V + GLH CGDLS T LR F + + ++ V CCYHLL EE D
Sbjct: 261 ADSKNCSVVLAGLHACGDLSATMLRTFIECKEVTAVINVACCYHLLTEE-------SSND 313
Query: 882 QSLYEHGYGFPLSEHLRSRKFFLGRNVRMSGTQSPERVIDLKQTQTLPLF----YRA--- 934
+ + YGFP+SE + + LGR+ R +SP+R + T+ + F +RA
Sbjct: 314 KKEF---YGFPMSEGVSNLGLELGRSARELACESPDRWKEHHPTRAIQNFELHAFRAAFQ 370
Query: 935 -LLEKY--LRSKITINDEEPKVVGRLATKCSNFVEYVHRAVDKLKLDLEVDDEEVTRLFN 991
L+E L TIN E + + + F +Y ++ L DL V R
Sbjct: 371 LLVENVGDLNPSSTINAEFA-----VEQQAARFEDYAKPVLELL--DLPPLSSSVFRCIW 423
Query: 992 SHQREYEYL-QIYYFLKTALAPVIEALIVLDRVLYLREQQK 1031
S ++ L +Y L+T LAPVIE+ I+LDR+LY++E+ +
Sbjct: 424 SEVAAHQDLVGPFYSLRTVLAPVIESYILLDRLLYVKERAE 464
>gi|407463310|ref|YP_006774627.1| beta-lactamase domain-containing protein [Candidatus Nitrosopumilus
koreensis AR1]
gi|407046932|gb|AFS81685.1| beta-lactamase domain-containing protein [Candidatus Nitrosopumilus
koreensis AR1]
Length = 646
Score = 114 bits (286), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 82/293 (27%), Positives = 147/293 (50%), Gaps = 20/293 (6%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYA--TTIRDSKRCRERDF 59
F + G+ + VY+GD AA + R + L+ ESTY I+ S++ E F
Sbjct: 347 FHIGNGDHNFVYSGDIKFGKSILFEAASWNFPRVETLLIESTYGLKEDIQPSRQEVESAF 406
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNL-QAPIYFAVGLTEKATNYYK 118
+ V+ + GGKVLIP+ A+GRAQE+ ++++ Y + + +AP+ F G+ +A+ ++
Sbjct: 407 INAVNNTLADGGKVLIPIPAVGRAQEIMMVIDHYMKSGEMIEAPV-FTEGMISEASAIHE 465
Query: 119 MFITWTNQKIRKTFVQR--NMFD------FKHIRPFDKSFIDNPGPMVVFATPGMLHSGL 170
+ + +++++ ++ N FD +H ++ +N P ++ AT GML G
Sbjct: 466 SYPEYLARELKQKILETDDNPFDSEYFTNIEHADAREEPMREN-SPCIILATSGMLEGGP 524
Query: 171 SLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENK----QIIDVKMAIEYM- 225
L FK AP + + ++ + V GT+G +VL G K+ K +++ + +E +
Sbjct: 525 VLEYFKNIAPDKKSKVLFVSYQVNGTLGRRVLDGSKQATMLGKEGKVEVVTINCGVEKLD 584
Query: 226 SFSAHADAKGIMQLIQYCEPK--NVLLVHGEASKMVFLKEKIKQEFNLDCFMP 276
FS H+D +M +Q PK VL+ HGE K L I++ + P
Sbjct: 585 GFSGHSDYNQLMSFVQRLRPKLRRVLVNHGERKKSENLAMNIRRMHRVPAHYP 637
>gi|297619493|ref|YP_003707598.1| beta-lactamase domain-containing protein [Methanococcus voltae A3]
gi|297378470|gb|ADI36625.1| beta-lactamase domain protein [Methanococcus voltae A3]
Length = 422
Score = 114 bits (286), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 85/285 (29%), Positives = 141/285 (49%), Gaps = 18/285 (6%)
Query: 6 VGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATT--IRDSKRCRERDFLKKV 63
+ + ++YTGD N R L A D D +I ESTY + ++ S++ E+ + ++
Sbjct: 143 IDGKKLLYTGDINEIETRTLNPADTDIDEIDTIIIESTYGSPLDVKPSRKVLEKQLIDEI 202
Query: 64 HECVDRGGKVLIPVFALGRAQELCILLETYWERMNL-QAPIYFAVGLTEKATNYYKMFIT 122
E ++ GKV+IPVFA+GRAQE+ +++ Y L + P+Y LT T Y +
Sbjct: 203 SETIEENGKVIIPVFAVGRAQEIIVIINNYIRSGLLKKVPVYVCGSLTH-TTGMYMSYSE 261
Query: 123 WTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSG-----LSLIIFKK 177
W N KI + F ++ D + +N P ++ +T GM+ G LSL+ K
Sbjct: 262 WLNPKIN-NLMNNGTNPFGNLLKADDNIFNNNEPCIIISTSGMVQGGPVLQYLSLLKNPK 320
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIM 237
N LI+ G+ +GTIG + G ++ K+ I +K I + FSAH D ++
Sbjct: 321 ------NKLILTGYQGEGTIGRSLEEGATEIT-PFKKPIQIKGKITKIEFSAHGDYNSLV 373
Query: 238 Q-LIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGES 281
+ L + EPK +++HGE + + L I + F + P G +
Sbjct: 374 RYLKKIPEPKKAIVMHGERYQALSLAMTIWKMFKIPAIAPTIGST 418
>gi|125973129|ref|YP_001037039.1| RNA-metabolising metallo-beta-lactamase [Clostridium thermocellum
ATCC 27405]
gi|125713354|gb|ABN51846.1| RNA-metabolising metallo-beta-lactamase [Clostridium thermocellum
ATCC 27405]
Length = 541
Score = 114 bits (286), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 77/277 (27%), Positives = 141/277 (50%), Gaps = 18/277 (6%)
Query: 36 DLLITESTYATTIRDSKRCRERDFLKKVHECVDRGGKVLIPVFALGRAQELCILL----E 91
D L+ ESTY + + + + FL ++ +++GG V+IP FA GR QEL L E
Sbjct: 206 DYLVIESTYGNRLHNERVDKAEYFLNLINSTIEKGGNVIIPSFAAGRTQELIYELNKKKE 265
Query: 92 TYWERMNL--QAPIYFAVGLTEKATNYYKMFITWTNQKIRKTFVQR--NMFDFKHIR--- 144
Y E++ + P+Y + AT ++ + +++ R+ +++ N DF ++
Sbjct: 266 VYDEKLKMLFATPVYIDSPMAISATEVFRNNLDCYDEEARE-YIENGDNPLDFPGLQFTR 324
Query: 145 --PFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVL 202
K+ + ++ A GM +G K +++ +I G+ +GT+G ++L
Sbjct: 325 TAEESKALNERKESSIIIAASGMCEAGRIRHHLKHNLWRKDSTIIFVGYQAEGTLGRRLL 384
Query: 203 SGVKKLEFENKQIIDVKMAIEYM-SFSAHADAKGIMQLIQYC--EPKNVLLVHGEASKMV 259
G KK+ ++ I V+ IE + FS HAD +G+++ I+ +PK + +VHGE M+
Sbjct: 385 DGAKKVRLFGEE-ISVEARIEMIEGFSGHADKEGLLKWIERFNRKPKKIFIVHGEEDAMI 443
Query: 260 FLKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSV 296
E+I + FN++ +P+ GES + I+ID V
Sbjct: 444 EFSEEINRRFNIETVIPSRGESFILNAHNVIAIDEKV 480
>gi|161529235|ref|YP_001583061.1| beta-lactamase domain-containing protein [Nitrosopumilus maritimus
SCM1]
gi|160340536|gb|ABX13623.1| beta-lactamase domain protein [Nitrosopumilus maritimus SCM1]
Length = 646
Score = 114 bits (286), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 83/292 (28%), Positives = 147/292 (50%), Gaps = 18/292 (6%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYA--TTIRDSKRCRERDF 59
F + G+ + VY+GD AA + R + L+ ESTY I+ S++ E F
Sbjct: 347 FHIGNGDHNFVYSGDIKFGKSILFEAASWNFPRVETLLIESTYGLKEDIQPSRQEVESAF 406
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNL-QAPIYFAVGLTEKATNYYK 118
+ V+ + GGKVLIP+ A+GRAQE+ ++++ Y + + +AP+ F G+ +A+ ++
Sbjct: 407 INAVNNTLADGGKVLIPIPAVGRAQEIMMVIDHYMKSGEMIEAPV-FTEGMISEASAIHE 465
Query: 119 MFITWTNQKIRKTFVQR--NMFD---FKHIRPFD--KSFIDNPGPMVVFATPGMLHSGLS 171
+ + +++++ ++ N FD F +I D + + P ++ AT GML G
Sbjct: 466 SYPEYLARELKQKILETDDNPFDSEYFTNIEHADAREEPMREDSPCIILATSGMLEGGPV 525
Query: 172 LIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENK----QIIDVKMAIEYM-S 226
L FK AP + + ++ + V GT+G +VL G K+ K +++ + +E +
Sbjct: 526 LEYFKNIAPDKKSKVLFVSYQVNGTLGRRVLDGSKQATMLGKDGKVEVVTINCGVEKLDG 585
Query: 227 FSAHADAKGIMQLIQYCEPK--NVLLVHGEASKMVFLKEKIKQEFNLDCFMP 276
FS H+D +M +Q PK VL+ HGE K L I++ + P
Sbjct: 586 FSGHSDYNQLMSFVQRLRPKLRRVLVNHGERKKSENLAMNIRRMHRVPAHYP 637
>gi|410460059|ref|ZP_11313745.1| RNA-metabolising metallo-beta-lactamase [Bacillus azotoformans LMG
9581]
gi|409927679|gb|EKN64809.1| RNA-metabolising metallo-beta-lactamase [Bacillus azotoformans LMG
9581]
Length = 907
Score = 114 bits (285), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 83/280 (29%), Positives = 134/280 (47%), Gaps = 13/280 (4%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHL-GAAWIDKCRPDLLITESTYATTIRDSKRCRERDF 59
MF ++ Q ++ TGD + R + GA + PD+++ ESTY + ER
Sbjct: 155 MFLIEGDQQQLLITGDLSFRAGRTIPGAEVPHQVNPDVIVIESTYGNRAHTDRNTEERLL 214
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWER-MNLQAPIYFAVGLT--EKATNY 116
+ V E + GG VLIP FALGRAQE+ ++L+ Y ER + + PIY +T K N
Sbjct: 215 AEHVSEVISSGGFVLIPAFALGRAQEVLLVLQDYMERGLIPEFPIYVDGLVTPISKIYNS 274
Query: 117 YKMFITW-TNQKIRKT---FVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSL 172
Y ++ +IRK F+ + + P +++ + P + A+ GML G S
Sbjct: 275 YPQYLKGPVAHRIRKNGDVFITEGR--VRAVSPKERNEVLQGKPGCIVASSGMLTGGASS 332
Query: 173 IIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFE---NKQIIDVKMAIEYMSFSA 229
++ E N + + G+ + + G K+L+ + +E + N +VK + S
Sbjct: 333 WYAERLISNEKNAIFITGYQDEESPGRKLLNLAENIENQLELNGTTYEVKCLVRKYGLSG 392
Query: 230 HADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEF 269
HADA I + IQ +P LLVHG+ + +I Q F
Sbjct: 393 HADANEINRFIQSLKPTYTLLVHGDDDARSQIASQIDQRF 432
>gi|385803114|ref|YP_005839514.1| beta-lactamase [Haloquadratum walsbyi C23]
gi|339728606|emb|CCC39763.1| beta-lactamase domain protein [Haloquadratum walsbyi C23]
Length = 639
Score = 114 bits (285), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 80/300 (26%), Positives = 145/300 (48%), Gaps = 23/300 (7%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRC----RER 57
F + G ++ ++GD + R A D R + LI ESTY RD + ER
Sbjct: 339 FHIGDGLYNVAFSGDIHYEDTRLFNGAVNDFPRVETLILESTYGG--RDDYQTDQEDSER 396
Query: 58 DFLKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNL-QAPIYFAVGLTEKATNY 116
+ ++ ++E DRGGKV+IP FA+GR+QE+ ++LE + Q P++ G+ +AT
Sbjct: 397 ELIEVINETYDRGGKVVIPAFAVGRSQEIMLVLEEAMRTGEIPQMPVHLD-GMIWEATAI 455
Query: 117 YKMFITWTNQKIRKTFVQ--RNMF---DFKHIRPF--DKSFIDNPGPMVVFATPGMLHSG 169
+ + + +R N F +F HI ++ + + P +V +T GM+ G
Sbjct: 456 HTTYPEYLRDNLRDRIFHDDENPFLAEEFNHIDAGEEERQAVSDGEPCIVLSTSGMITGG 515
Query: 170 LSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQI-----IDVKMAIEY 224
+ + ++ L+ G+ QGT+G ++ G ++ Q+ + ++M +
Sbjct: 516 PIMSWLQHIGTDPDSRLVFVGYQAQGTLGRRIQDGWDEIPINGDQLGRSETLTLEMDVTT 575
Query: 225 MS-FSAHADAKGIMQLIQYCEPK--NVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGES 281
+S FS HAD +G+ ++ P+ +L VHG+ + L + ++NL F P N E+
Sbjct: 576 VSGFSGHADRQGLENFVKTMNPRPEKILCVHGDEQSVQDLSSGLYHDYNLRTFAPKNLET 635
>gi|374710344|ref|ZP_09714778.1| beta-lactamase domain-containing protein [Sporolactobacillus
inulinus CASD]
Length = 916
Score = 114 bits (285), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 83/293 (28%), Positives = 135/293 (46%), Gaps = 17/293 (5%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHL-GAAWIDKCRPDLLITESTYATTIRDSKRCRERDF 59
MF ++ G + ++ +GD + R + GA RPD LI ESTY + E+
Sbjct: 161 MFTIETGEEKLLVSGDLSFKAGRTIPGAEVPTGSRPDALIIESTYGNREHSDRNTEEKRL 220
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNL-QAPIYFAVGLTEKATNYYK 118
V E + GG LIP FALGRAQE+ ++L+ Y +R + Q PIY GL + Y+
Sbjct: 221 ADNVAEVIAAGGFALIPAFALGRAQEVLLILQDYMDRGLIPQFPIYVD-GLVTPISKIYR 279
Query: 119 MFITWTNQKIRKTFVQRNMFDF------KHIRPFDKSFIDNPGPMVVFATPGMLHSGLSL 172
+ + + +Q+N F + P D+ + P + A+ GML G S+
Sbjct: 280 RYPHYLKGPVAHR-IQQNGDAFLTEGRCTAVEPKDREQVLKGKPACIVASSGMLIGGASV 338
Query: 173 IIFKKWAPVENNMLIMPGFCVQGTIGHKVL----SGVKKLEFENKQIIDVKMAIEYMSFS 228
++ E N + + G+ + + G K+L ++LE N + VK ++ S
Sbjct: 339 WYAERLVGSEKNAVFITGYQDEESPGRKLLRLAEGESRELEL-NGVVHQVKCRVDKYGLS 397
Query: 229 AHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGES 281
AH DA + + I P L+VHG+ L E+I + + + NGE+
Sbjct: 398 AHGDAGELTRFISMIRPAKTLIVHGDDEARTALLERIDRRSH--PLLTENGEA 448
>gi|405960407|gb|EKC26333.1| Uncharacterized protein C12orf26 [Crassostrea gigas]
Length = 225
Score = 114 bits (285), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 79/205 (38%), Positives = 110/205 (53%), Gaps = 14/205 (6%)
Query: 832 IMGLHTCGDLSGTALRLFTKS-SLQCLVQVGCCYHLLEEEFIRSPFWKDVDQSLYEHGYG 890
+ GLHTCG L LRLF S +++ L V CCYHL+ E+F+ SPF D S G
Sbjct: 2 LTGLHTCGALGSNILRLFVNSPNVKVLCNVACCYHLMSEKFVASPFEDKEDVS---EEAG 58
Query: 891 FPLSEHLRSRKFFLGRNVRMSGTQSPERVIDLKQTQTLP-LFYRALLEKYLRSKITINDE 949
FPLSE L+ +GR R +QS R + T+ L F R LL+ LR ++
Sbjct: 59 FPLSEFLQRNTVGIGRQARNLASQSVNR--RAQDTEVLAKQFPRMLLQVILRDQLGYVRS 116
Query: 950 EPKVVGRLATKCSNFVEYVHRAVDKLKLDLEVDDEEVTRLFNSHQREYE----YLQIYYF 1005
+ K + +L K +YV RA KL L + EE L +S+ ++YE ++
Sbjct: 117 DWKGLRKLDQKSQGMFDYVTRAFRKLGLPSDGITEE---LIDSYCQKYEDQKKKFAAFFQ 173
Query: 1006 LKTALAPVIEALIVLDRVLYLREQQ 1030
LKT LAP IEA++VLDR+ +L EQ+
Sbjct: 174 LKTVLAPCIEAVVVLDRLCFLLEQE 198
Score = 41.2 bits (95), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 21/46 (45%), Positives = 31/46 (67%), Gaps = 4/46 (8%)
Query: 1054 LFDSHQREYE----YLQIYYFLKTALAPVIEALIVLDRVLYLREQQ 1095
L DS+ ++YE ++ LKT LAP IEA++VLDR+ +L EQ+
Sbjct: 153 LIDSYCQKYEDQKKKFAAFFQLKTVLAPCIEAVVVLDRLCFLLEQE 198
>gi|217075847|gb|ACJ86283.1| unknown [Medicago truncatula]
Length = 230
Score = 114 bits (285), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 53/67 (79%), Positives = 58/67 (86%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF VKVG+ +VYTGDYNMTPDRHLGAA ID+ R DLLITESTYATTIRDSK RER+FL
Sbjct: 162 MFYVKVGDAEMVYTGDYNMTPDRHLGAAQIDRLRLDLLITESTYATTIRDSKYAREREFL 221
Query: 61 KKVHECV 67
K VH+CV
Sbjct: 222 KAVHKCV 228
>gi|386854313|ref|YP_006258693.1| Beta-lactamase domain protein [Deinococcus gobiensis I-0]
gi|380002642|gb|AFD27829.1| Beta-lactamase domain protein [Deinococcus gobiensis I-0]
Length = 541
Score = 114 bits (285), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 83/293 (28%), Positives = 146/293 (49%), Gaps = 19/293 (6%)
Query: 4 VKVGNQSIVYTGDYNMTPDRHLGAAWID-KCRP-DLLITESTYATTIRDSKRCRERDFLK 61
++ G +S+ +TGD + + AAW+ + P D +++ESTY T+ S++ + R F+
Sbjct: 154 IESGGRSVFHTGDVSNVDTPVVNAAWLPAQVTPVDAVVSESTYGDTLLPSRKEQVRTFVA 213
Query: 62 KVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQA-PIYFAVGLTEKATNYYK-- 118
+ E + GGKVLIP FALGRAQE+ +L+T L + P+Y GLT + T Y+
Sbjct: 214 AIGETLRAGGKVLIPSFALGRAQEITQILQTGMASGLLPSVPLYLD-GLTRQMTQTYEDL 272
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHI-----RPFDKSFIDNPGPMVVFATPGMLHSGLSLI 173
+ + + R+ + +F + R + + + GP VV A+ GMLH+G S +
Sbjct: 273 LPLLPQALQNRRQSSGQPVFLGGTVTLVGDRRDRERILASDGPAVVVASSGMLHAGASPV 332
Query: 174 IFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEF------ENKQIIDVKMAIEYMSF 227
+ W P N L + G+ + G ++L + + ++ + +E
Sbjct: 333 YARAWLPEAGNALFVVGYQDAESPGRRLLELQQGGDVLLPDGRGGREPVPAYARVERFYL 392
Query: 228 SAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGE 280
SAHAD G++ +I P VLL HGE + L + +++++ +P G+
Sbjct: 393 SAHADRGGLLGMIARYSPGKVLLTHGEVAPRHNLAGYLDTKYDVE--LPQAGQ 443
>gi|429216269|ref|YP_007174259.1| exonuclease [Caldisphaera lagunensis DSM 15908]
gi|429132798|gb|AFZ69810.1| putative exonuclease of the beta-lactamase fold involved in RNA
processing [Caldisphaera lagunensis DSM 15908]
Length = 425
Score = 113 bits (283), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 85/298 (28%), Positives = 133/298 (44%), Gaps = 42/298 (14%)
Query: 4 VKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFLKKV 63
V+ N I+YT D N + ++ D LI ESTY+T+ + E+ F+ V
Sbjct: 150 VETSNNKILYTADMNTIETKLKNPHKLNGVEADTLIIESTYSTSNHPPRDETEKRFIDDV 209
Query: 64 HECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQ--------APIYFA-------VG 108
++ V++GG VL+P F + R QE+ LLE N+ A +Y A G
Sbjct: 210 NDVVEKGGIVLVPAFGVSRGQEIMALLEEKGFGYNIWVDGMIRDIAELYIAHSNYLRSPG 269
Query: 109 LTEKATNYYKMFITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHS 168
L KA ++ W ++ RK Q P V+ A+ GM+
Sbjct: 270 LLLKAMQEQRLVKGWNDR--RKALKQ---------------------PGVIIASAGMMKG 306
Query: 169 GLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFS 228
G SL KK A N + M + + T G +L + K ++VK +E+ FS
Sbjct: 307 GPSLYYLKKLAHNPRNAVFMVSYQAEKTPGRDILETGSYYDGNTK--VEVKARVEWFDFS 364
Query: 229 AHADAKGIMQLIQYCEP-KNVLLVHGEASKMVFLKEKIK-QEFNLDCFMPANGESCFV 284
+H D +GI++ I+ + + V+LVHGE L E+I+ NLD +P NG+ +
Sbjct: 365 SHTDRRGILETIKSIQGIQRVILVHGEPEGQKSLAEEIRNHNTNLDIIIPDNGDEINI 422
>gi|448419881|ref|ZP_21580725.1| universal archaeal kh-domain/beta-lactamase-domain-containing
protein [Halosarcina pallida JCM 14848]
gi|445674795|gb|ELZ27332.1| universal archaeal kh-domain/beta-lactamase-domain-containing
protein [Halosarcina pallida JCM 14848]
Length = 640
Score = 113 bits (283), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 78/299 (26%), Positives = 143/299 (47%), Gaps = 20/299 (6%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYA--TTIRDSKRCRERDF 59
F + G ++ ++GD + R A D R + L+ ESTY + + E++
Sbjct: 339 FHIGDGLYNVAFSGDIHYEDTRLFNGAVNDFPRVETLVLESTYGGRNDYQTDQEDSEQNL 398
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQA-PIYFAVGLTEKATNYYK 118
+ ++E DRGGKVLIP FA+GR+QE+ ++LE + + P++ G+ +AT +
Sbjct: 399 IDVINETHDRGGKVLIPAFAVGRSQEIMLVLEEAMRSGKIPSMPVHLD-GMIWEATAIHT 457
Query: 119 MFITWTNQKIRKTFVQ--RNMF---DFKHIRPFDKSFID--NPGPMVVFATPGMLHSGLS 171
+ + +R N F +F HI ++ D + P +V +T GM+ G
Sbjct: 458 TYPEYLRDDLRDRIFHEDENPFLAEEFNHIDGGEEERQDVADGDPAIVLSTSGMVTGGPI 517
Query: 172 LIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENK------QIIDVKMAIEYM 225
+ + P ++ L+ G+ QGT+G ++ +G ++ ++ + +KM +E +
Sbjct: 518 MSWLRHVGPDPDSRLVFVGYQAQGTLGRRIQNGWDEIPVNDRDNVGRSNTLSLKMDVETV 577
Query: 226 -SFSAHADAKGIMQLIQYCEPK--NVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGES 281
FS HAD G+ ++ P+ VL VHG+ + L + +N+ F P N E+
Sbjct: 578 DGFSGHADRAGLENFVKTMNPRPEKVLCVHGDERSVQDLSSALYHNYNMRTFAPKNLET 636
>gi|313127082|ref|YP_004037352.1| universal archaeal kh-domain/beta-lactamase-domain-containing
protein [Halogeometricum borinquense DSM 11551]
gi|448288450|ref|ZP_21479649.1| universal archaeal kh-domain/beta-lactamase-domain-containing
protein [Halogeometricum borinquense DSM 11551]
gi|312293447|gb|ADQ67907.1| universal archaeal KH-domain/beta-lactamase-domain protein
[Halogeometricum borinquense DSM 11551]
gi|445569601|gb|ELY24173.1| universal archaeal kh-domain/beta-lactamase-domain-containing
protein [Halogeometricum borinquense DSM 11551]
Length = 640
Score = 113 bits (283), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 78/299 (26%), Positives = 140/299 (46%), Gaps = 20/299 (6%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYA--TTIRDSKRCRERDF 59
F + G ++ ++GD + R A D R + L+ ESTY + + E
Sbjct: 339 FHIGDGLYNVAFSGDIHYEDTRLFNGAVNDFPRVETLVLESTYGGRNDYQTDQADSEEKL 398
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQA-PIYFAVGLTEKATNYYK 118
+ ++E DRGGKVLIP FA+GR+QE+ ++LE + + P++ G+ +AT +
Sbjct: 399 IDVINETYDRGGKVLIPAFAVGRSQEIMLVLEEAMRSGKIPSMPVHLD-GMIWEATAIHT 457
Query: 119 MFITWTNQKIRKTFVQ--RNMF---DFKHIRPFDKSFID--NPGPMVVFATPGMLHSGLS 171
+ + +R N F +F HI + D + GP ++ +T GM+ G
Sbjct: 458 TYPEYLRDDLRDRIFHEDENPFLADEFNHIDGGEDERQDVADDGPAIILSTSGMVTGGPI 517
Query: 172 LIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENK------QIIDVKMAIEYM 225
+ + P + L+ G+ QGT+G ++ +G ++ ++ + +KM +E +
Sbjct: 518 MSWLRHVGPDPKSRLVFVGYQAQGTLGRRIQNGWDEIPVNDRDNVGRSNTLQLKMDVETV 577
Query: 226 -SFSAHADAKGIMQLIQYCEPK--NVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGES 281
FS HAD G+ ++ P+ VL VHG+ + L + +N+ F P N E+
Sbjct: 578 DGFSGHADRAGLENFVKTMNPRPEKVLCVHGDERSVQDLSSALYHNYNMRTFAPKNLET 636
>gi|332712233|ref|ZP_08432161.1| putative exonuclease of the beta-lactamase fold involved in RNA
processing [Moorea producens 3L]
gi|332349039|gb|EGJ28651.1| putative exonuclease of the beta-lactamase fold involved in RNA
processing [Moorea producens 3L]
Length = 867
Score = 113 bits (282), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 83/260 (31%), Positives = 129/260 (49%), Gaps = 16/260 (6%)
Query: 8 NQSIVYTGDYNMTPDRHL-GAAWIDKCRPDLLITESTYATTIRDSKRCRERDFLKKVHEC 66
+ S++YTGD++ T R G + D+LITESTY +++ ++R +E ++ + E
Sbjct: 435 DMSLLYTGDFHTTNSRTTNGLNLAELPEADILITESTYGSSVHPARRNQETALMESIAEV 494
Query: 67 VDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKM---FITW 123
V GG VLIP FALGRAQE+ + + T L PI F GL T ++ +
Sbjct: 495 VQAGGNVLIPSFALGRAQEILLGIRTSKLFNKLTVPI-FVDGLVRAVTEVFRENLDLLPV 553
Query: 124 TNQKIRKTFVQRNMFDFKHIRP------FDKSFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
Q + K Q FD P F + I P V+ A+ GML G S+
Sbjct: 554 PVQNLVKINGQEPFFDKTSTPPIISIEHFRERPIAIAKPSVIVASSGMLTGGPSVYYASV 613
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVK---KLEFENKQIIDVKMAIEYMSFSAHADAK 234
EN + + G+ + + G ++L +K +E + K I VK I+ + SAH D
Sbjct: 614 LLERENAAIFISGYTDEESPG-RLLQSLKTGDTIELDGKPIT-VKAQIKRFNLSAHTDKV 671
Query: 235 GIMQLIQYCEPKNVLLVHGE 254
G+ Q+I +PK+++L+HGE
Sbjct: 672 GLGQVINKVKPKHLVLIHGE 691
>gi|452208263|ref|YP_007488385.1| beta-lactamase domain protein [Natronomonas moolapensis 8.8.11]
gi|452084363|emb|CCQ37702.1| beta-lactamase domain protein [Natronomonas moolapensis 8.8.11]
Length = 639
Score = 113 bits (282), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 80/299 (26%), Positives = 142/299 (47%), Gaps = 22/299 (7%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFLK 61
F V G ++ ++GD + R A D R + L+ ESTY R+ + + D +
Sbjct: 340 FHVGDGLYNVAFSGDIHYDDTRLFNGAVNDFPRVETLVMESTYGG--RNDYQTDQEDSEE 397
Query: 62 K----VHECVDRGGKVLIPVFALGRAQELCILLETYWERMNL-QAPIYFAVGLTEKATNY 116
K + E GGKVLIP FA+GR+QE+ ++LE + + + P++ G+ +AT
Sbjct: 398 KLKEVIRETTTEGGKVLIPAFAVGRSQEIMLVLEEAMRKGEIPEVPVHLD-GMIWEATAI 456
Query: 117 YKMFITWTNQKIRKTFVQ--RNMF---DFKHIRPF--DKSFIDNPGPMVVFATPGMLHSG 169
+ + + +R N F F HI ++ + + GP +V +T GM+ G
Sbjct: 457 HTTYPEYLRDDLRDRIFHDDENPFLADQFNHIDGGEDERQEVADGGPCIVLSTSGMVEGG 516
Query: 170 LSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQ----IIDVKMAIEYM 225
+ + L+ G+ QGT+G ++ +G ++ ++ + + M +E +
Sbjct: 517 PIMSWLTHVGGESESSLVFVGYQAQGTLGRRIQNGWDEIPMNDRSSSRGTLTLNMGVETV 576
Query: 226 -SFSAHADAKGIMQLIQYCEPK--NVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGES 281
FS HAD +G+M ++ P+ VL VHG+ S + L + +FN+ F P N E+
Sbjct: 577 DGFSGHADRQGLMNFVRTMNPRPEKVLCVHGDESSVQDLSSALYHDFNMRTFAPKNLET 635
>gi|374296036|ref|YP_005046227.1| putative exonuclease of the beta-lactamase fold involved in RNA
processing [Clostridium clariflavum DSM 19732]
gi|359825530|gb|AEV68303.1| putative exonuclease of the beta-lactamase fold involved in RNA
processing [Clostridium clariflavum DSM 19732]
Length = 541
Score = 112 bits (281), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 80/282 (28%), Positives = 135/282 (47%), Gaps = 19/282 (6%)
Query: 36 DLLITESTYATTIRDSKRCRERDFLKKVHECVDRGGKVLIPVFALGRAQELCILLETYWE 95
D L+ ESTY I ++K + F+ ++E +D+GG V+IP FA+GR QE+ L E
Sbjct: 206 DYLVIESTYGNRIHENKTNKFEYFVDIINETIDKGGNVVIPSFAVGRTQEVIYELNKEKE 265
Query: 96 RMN------LQAPIYFAVGLTEKATNYYKMFITWTNQKIRKTFVQ-RNMFDFKHIR---- 144
+ P++ L AT ++ + +++ R+ N DF ++
Sbjct: 266 SFDGKLKKLFSIPVFVDSPLAISATEVFRNNLDCYDEEAREYINNGDNPLDFPGLQFTRT 325
Query: 145 PFD-KSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLS 203
P + K+ + G ++ + GM +G K E + ++ G+ QGT+G K++
Sbjct: 326 PEESKTLNERSGSAIIISASGMCEAGRIKHHLKHNLWREESTILFVGYQAQGTLGRKLID 385
Query: 204 GVKKLEFENKQIIDVKMAIEYM-SFSAHADAKGIMQLIQ--YCEPKNVLLVHGEASKMVF 260
G KK++ ++ I V IE + FS HAD G++ I+ Y +P + +VHGE M
Sbjct: 386 GAKKVKIFGEE-ISVNARIEMIEGFSGHADKDGLLNWIEAFYKKPSKIFIVHGEEESMTS 444
Query: 261 LKEKIKQEFNLDCFMPANGESCFVQTDMKISIDVSVNLLKEE 302
+I FN+D +P GES + D I S ++K E
Sbjct: 445 FAREINDRFNIDTIIPEKGESFVITGDKVIE---SAEMIKSE 483
>gi|389860541|ref|YP_006362780.1| beta-lactamase [Thermogladius cellulolyticus 1633]
gi|388525444|gb|AFK50642.1| beta-lactamase domain protein [Thermogladius cellulolyticus 1633]
Length = 653
Score = 112 bits (281), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 77/272 (28%), Positives = 141/272 (51%), Gaps = 19/272 (6%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFLK 61
+ +G +IVYTGD+ + R L A + R + LI ESTY T + S++ E+ +
Sbjct: 347 LHIGMGLHNIVYTGDFKYSSTRLLDKAVSEFPRVETLIIESTYGATKQQSRQAAEQQLVD 406
Query: 62 KVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAV-GLTEKATNYYKMF 120
V ++R G VLIPVFA+GR QE+ ++L E+ L P+ + GL + T + +
Sbjct: 407 IVKRTIERRGVVLIPVFAVGRGQEIMVVLNEAIEK-KLIPPVNIYIEGLINEVTAIHTEY 465
Query: 121 ITWTNQKIRKTFVQ-RNMFDFKHIRPFD----KSFIDNPGPMVVFATPGMLHSGLSLIIF 175
+ ++ +R + N F +H + + + I P ++ AT GML G ++
Sbjct: 466 PEYMSKSLRDAIYRGENPFTSEHFKIIEGDVGRPDIVEDRPSIIMATSGMLTGGPAVDYL 525
Query: 176 KKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLE--FENK-QIIDVKMAIEYM-SFSAHA 231
+ A E + LI G+ +GT+G K+ G++++ E K +++ + + + + FS H+
Sbjct: 526 RLIASDERSSLIFVGYQAEGTLGRKIKDGMREITNVVEGKVEVLKINLEVHSIDGFSGHS 585
Query: 232 DAKGIMQLIQYC-----EPKNVLLVHGEASKM 258
D +L++Y +P+N++L HGE + +
Sbjct: 586 DQ---TELVRYVLNIKPKPRNIILNHGEPNAI 614
>gi|448307837|ref|ZP_21497725.1| KH-domain/beta-lactamase-domain-containing protein [Natronorubrum
bangense JCM 10635]
gi|445594972|gb|ELY49098.1| KH-domain/beta-lactamase-domain-containing protein [Natronorubrum
bangense JCM 10635]
Length = 636
Score = 112 bits (280), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 80/297 (26%), Positives = 139/297 (46%), Gaps = 18/297 (6%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYA--TTIRDSKRCRERDF 59
F V G ++V++GD + P R A D R + +I ESTY + E +
Sbjct: 337 FHVGDGFHNVVFSGDVHYEPTRLFNGAVNDFPRAETMIMESTYGRRGDFQTDTAKSEANV 396
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQA-PIYFAVGLTEKATNYYK 118
+ +HE + G +IP FA+GR+QEL ++LE +L P+Y G+ +AT +
Sbjct: 397 HQIIHETYKKDGITVIPAFAVGRSQELMLVLEEAMREGDLPTMPVYLD-GMIREATAIHT 455
Query: 119 MFITWTNQKIRKTFVQ--RNMFDFKHIRPFD-----KSFIDNPGPMVVFATPGMLHSGLS 171
+ + +R+ + N F + R D + I + P V+ +T GM+ G
Sbjct: 456 AYPEYLRDGLRQRILHEDENPFIAEQFRQVDGGQEMREEIASGEPCVILSTSGMVTGGPI 515
Query: 172 LIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQI----IDVKMAIEYMS- 226
+ + P NN L+ G+ GT+G ++ SG ++ E + + + IE +S
Sbjct: 516 MSWLELLGPNPNNTLLFVGYQAAGTLGRRIQSGHTEIPMEGRSGRANHLTLAFRIESVSG 575
Query: 227 FSAHADAKGIMQLIQYCEPK--NVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGES 281
FS HAD G+ + ++ P+ +L VHG+ L + Q++N+ + P N E+
Sbjct: 576 FSGHADRAGLKKYVKEMNPRPETILCVHGDEQATDQLSSALYQQYNVRTYSPKNLET 632
>gi|295695465|ref|YP_003588703.1| RNA-metabolising metallo-beta-lactamase [Kyrpidia tusciae DSM 2912]
gi|295411067|gb|ADG05559.1| RNA-metabolising metallo-beta-lactamase [Kyrpidia tusciae DSM 2912]
Length = 1001
Score = 112 bits (280), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 77/270 (28%), Positives = 136/270 (50%), Gaps = 14/270 (5%)
Query: 10 SIVYTGDYNMTPDRHLGAAWIDKCRP-DLLITESTYATTIRDSKRCRERDFLKKVHECVD 68
S++ TGD+++TP R L + + D +ITESTY + ++ +E ++V E ++
Sbjct: 161 SVLVTGDFSVTPGRLLPGLRMPRGSAYDAVITESTYGNRLHGNRAEQEDRMARQVAEVIE 220
Query: 69 RGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAV--GLTEKATNYYKMFITWTNQ 126
RGG VLIP FA+GRAQE+ +L+ + R N P + V GL Y+
Sbjct: 221 RGGFVLIPAFAVGRAQEVLTILQDF-MRYNKDYPKFPLVVDGLVRSVCPVYEAHGELLRG 279
Query: 127 KIRKTF-VQRNMFDFKHI----RPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAPV 181
++ ++FD + I RP ++ + P + A+ GML+ G S+ K
Sbjct: 280 PAKRMLRTTGSLFDREDIRFVRRPEERREVVEGRPACIVASSGMLNGGPSVFYADKLIQH 339
Query: 182 ENNMLIMPGFCVQGTIGHKVLSGVKKLEFENK-----QIIDVKMAIEYMSFSAHADAKGI 236
ENN +++ G+ + + G ++L+ ++ E E + +++ V + S SAHAD + +
Sbjct: 340 ENNAILLCGYQDEESPGRRLLNMAEQPESERRWFLPDRVVPVNAKVALYSLSAHADRREL 399
Query: 237 MQLIQYCEPKNVLLVHGEASKMVFLKEKIK 266
+Q P+ VLLVHG+ L +++
Sbjct: 400 VQTAAQLRPRQVLLVHGDPEAKEMLAAELR 429
>gi|302348344|ref|YP_003815982.1| exonuclease [Acidilobus saccharovorans 345-15]
gi|302328756|gb|ADL18951.1| Putative exonuclease [Acidilobus saccharovorans 345-15]
Length = 429
Score = 112 bits (279), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 76/285 (26%), Positives = 138/285 (48%), Gaps = 16/285 (5%)
Query: 4 VKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFLKKV 63
V V + ++YT D N + + + + D +I ESTY+T + E++F + V
Sbjct: 156 VDVNGRKVLYTSDMNTIETKLMNPHKLTGVKADAVIIESTYSTVTHPERENTEKEFYESV 215
Query: 64 HECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMFITW 123
+ V RGG VL+P F++ R QE+ LL E P++ G+ T+ Y ++
Sbjct: 216 EDVVKRGGTVLVPAFSVARGQEIMCLL----EEKGFGWPVWID-GMIRSVTDLYLAHSSF 270
Query: 124 TNQKIRKTFVQRNMFDFKHIRPFD--KSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAPV 181
+ + + D K ++ + K + PG V+ ++ GML G SL + +
Sbjct: 271 IKD---PRLLAKALEDQKVVKGWGDRKKALKKPG--VIISSAGMLKGGPSLYYYTRMVQN 325
Query: 182 ENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLIQ 241
E + + + + GT G + L + + F+ ++ + VK ++ FS+H D KG++Q ++
Sbjct: 326 EKDAVFLVSYQAPGTPGRQALE--EGVFFDGERKLPVKARLQIFDFSSHTDWKGVLQTLK 383
Query: 242 -YCEPKNVLLVHGEASKMVFLKEKIKQEF-NLDCFMPANGESCFV 284
+ V+LVHGE + L KIK E +++ +P NG+ V
Sbjct: 384 SLAGVRKVILVHGEPQGQMRLASKIKDELGDVEVLVPQNGDEIPV 428
>gi|290559801|gb|EFD93125.1| beta-lactamase domain protein [Candidatus Parvarchaeum acidophilus
ARMAN-5]
Length = 629
Score = 111 bits (278), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 83/294 (28%), Positives = 147/294 (50%), Gaps = 20/294 (6%)
Query: 1 MFQVKVGNQ--SIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCR--- 55
M + +GN +++YTGD+ + L A R + LI ESTY D++ R
Sbjct: 329 MIHLNIGNGFYNVLYTGDFKYENSKTLNRAETQFERVETLIMESTYGGDA-DTQPVRSET 387
Query: 56 ERDFLKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAV-GLTEKAT 114
E+ F + + V GKVL+PV +GRAQE+ +L+E W+R L + V G+ T
Sbjct: 388 EKFFFDIISKTVANNGKVLVPVLGVGRAQEIMLLIEE-WQRYGLLPEVPVIVDGMLWDVT 446
Query: 115 NYYKMFITWTNQKIRKTFV--QRNMF---DFKH-IRPFDKSFIDNPGPMVVFATPGMLHS 168
Y ++ + N + R + N F FKH + ++ + + GP ++ AT GML+
Sbjct: 447 AIYTVYPEFMNSETRNRIITHHNNPFLSPIFKHTVSEEERQQVLSGGPAILLATSGMLNG 506
Query: 169 GLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFE-NKQIIDVKM-AIEYM- 225
G S+ F E N +++ + GT+G + +G ++++ E N ++ + A+ Y
Sbjct: 507 GPSVSYFANLCEDEKNAIVLVSYQGVGTLGRTIQNGGREVDIELNGKLKHFTVNALVYSI 566
Query: 226 -SFSAHADAKGIMQLIQYCEPK--NVLLVHGEASKMVFLKEKIKQEFNLDCFMP 276
FS H+D K + + + PK V++ HGE K++ L +++ F ++ + P
Sbjct: 567 EGFSGHSDRKQLERFVSDIRPKPRKVIIGHGEEGKILELSHYVRRRFGIETYAP 620
>gi|15789650|ref|NP_279474.1| mRNA 3'-end processing factor-like protein [Halobacterium sp.
NRC-1]
gi|169235363|ref|YP_001688563.1| mRNA 3'-end processing factor-like protein [Halobacterium salinarum
R1]
gi|10580012|gb|AAG18954.1| mRNA 3'-end processing factor homolog [Halobacterium sp. NRC-1]
gi|167726429|emb|CAP13214.1| beta-lactamase domain protein [Halobacterium salinarum R1]
Length = 641
Score = 111 bits (278), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 80/299 (26%), Positives = 140/299 (46%), Gaps = 20/299 (6%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYA--TTIRDSKRCRERDF 59
F + G ++ ++GD + R A D R + L+ ESTY + + ER
Sbjct: 340 FHIGDGLYNVAFSGDIHYDDTRLFNGAVNDFPRVETLVMESTYGGRNDYQTDQEDSERKL 399
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNL-QAPIYFAVGLTEKATNYYK 118
+ ++E + GGKVLIP FA+GR+QE+ ++LE + + PI+ G+ +AT +
Sbjct: 400 KRVINETYEDGGKVLIPAFAVGRSQEMMLVLEEAMREGEIPEMPIHLD-GMIWEATAIHT 458
Query: 119 MFITWTNQKIRKTFVQ--RNMF---DFKHIRPF--DKSFIDNPGPMVVFATPGMLHSGLS 171
+ + +R N F F HI ++ + + ++ +T GM+ G
Sbjct: 459 TYPEYLRDDLRDRIFHSDSNPFLAPQFNHIDGGEEERQAVADDDQCIILSTSGMVSGGPI 518
Query: 172 LIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQI------IDVKMAIEYM 225
+ + AP ++ L G+ QGT+G ++ SG K+ + + + + M +E +
Sbjct: 519 MSWLEHIAPDPDSTLTFVGYQAQGTLGRRIQSGRDKIPMPDSRSGGRTEHLQLNMGVETV 578
Query: 226 -SFSAHADAKGIMQLIQYCEPK--NVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGES 281
FS HAD +G+ ++ P+ VL VHG+ S L + EFN+ F P N E+
Sbjct: 579 DGFSGHADRQGLEDFVRTMNPRPEKVLCVHGDESSTQDLSSALYHEFNMRTFAPKNLET 637
>gi|346722866|ref|YP_001180618.2| RNA-metabolising metallo-beta-lactamase [Caldicellulosiruptor
saccharolyticus DSM 8903]
gi|345106837|gb|ABP67427.2| RNA-metabolising metallo-beta-lactamase [Caldicellulosiruptor
saccharolyticus DSM 8903]
Length = 540
Score = 111 bits (278), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 77/267 (28%), Positives = 138/267 (51%), Gaps = 20/267 (7%)
Query: 30 IDKCRPDLLITESTYATTIRDSKRCRERDFLKKVHECVDRGGKVLIPVFALGRAQELCI- 88
ID C D L ESTY + + + + ++ + + + GGKV+IP FA+GR QE+
Sbjct: 202 IDGC--DYLFIESTYGDRLHEDVENKSKRLIRIICDTISNGGKVIIPSFAVGRTQEILYE 259
Query: 89 ---LLETYWERMNL--QAPIYFAVGLTEKATNYYKMFITWTNQKIRKTFVQRNMF----- 138
+L T + L I+ L A+ YK I + +++ + F+++ ++
Sbjct: 260 IAKVLTTGSDEAKLIQSVEIFVDSPLATSASGVYKKHIDYFDEEAAE-FIKKGIYPLEPK 318
Query: 139 DFKHIRPFDKSFI--DNPGPMVVFATPGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGT 196
+ K +R D+S + D G ++ ++ GM +G K +NN ++ G+ T
Sbjct: 319 NLKFVRTADESKMLNDYEGSCIIISSSGMCEAGRIKHHLKHNLWKKNNTILFVGYQAPNT 378
Query: 197 IGHKVLSGVKKLEFENKQIIDVKMAIEYM-SFSAHADAKGIMQLIQYC--EPKNVLLVHG 253
+G K+L G KK++ ++ I+VK IEY+ ++S HAD G+++ I +P+ V +VHG
Sbjct: 379 LGRKLLDGQKKVKIFGEE-IEVKAKIEYIEAYSGHADKNGLIKWIDSMKNKPQQVFIVHG 437
Query: 254 EASKMVFLKEKIKQEFNLDCFMPANGE 280
E V +++K+ F+ D +PA GE
Sbjct: 438 EKESQVEFAQELKRNFDFDVHIPARGE 464
>gi|76800760|ref|YP_325768.1| mRNA 3'-end processing factor [Natronomonas pharaonis DSM 2160]
gi|76556625|emb|CAI48196.1| beta-lactamase domain protein [Natronomonas pharaonis DSM 2160]
Length = 639
Score = 111 bits (278), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 80/297 (26%), Positives = 137/297 (46%), Gaps = 18/297 (6%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYA--TTIRDSKRCRERDF 59
F V G ++ ++GD + R A D R + L+ ESTY + + E
Sbjct: 340 FHVGDGLYNVAFSGDIHYDDTRLFNGAVNDFPRVETLVLESTYGGRNDYQTDQEDSEETL 399
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQA-PIYFAVGLTEKATNYYK 118
+ + E GGKVLIP FA+GR+QE+ ++LE + P++ G+ +AT +
Sbjct: 400 KEIIRETTTEGGKVLIPAFAVGRSQEIMLVLEEAMREGEIPTVPVHLD-GMIWEATAIHT 458
Query: 119 MFITWTNQKIRKTFVQ--RNMF---DFKHIRPF--DKSFIDNPGPMVVFATPGMLHSGLS 171
+ + +R N F F HI ++ + + GP +V +T GM+ G
Sbjct: 459 TYPEYLRDDLRDRIFHDDENPFLADQFNHIDGGEEERQEVADGGPCIVLSTSGMVEGGPI 518
Query: 172 LIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFEN----KQIIDVKMAIEYM-S 226
+ E + + G+ QGT+G ++ SG ++ + + + + M +E +
Sbjct: 519 MSWLNHIGSEEKSTMTFVGYQAQGTLGSRIQSGWDEIPMNDRGNGRGTLKLNMNVETVDG 578
Query: 227 FSAHADAKGIMQLIQYCEPK--NVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGES 281
FS HAD +G+M ++ P+ VL VHG+ S + L + EFN+ F P N E+
Sbjct: 579 FSGHADRQGLMNFVRTMNPRPEKVLCVHGDESSVQDLSSALYHEFNMRTFAPKNLET 635
>gi|322369848|ref|ZP_08044410.1| hypothetical protein ZOD2009_10175 [Haladaptatus paucihalophilus
DX253]
gi|320550184|gb|EFW91836.1| hypothetical protein ZOD2009_10175 [Haladaptatus paucihalophilus
DX253]
Length = 638
Score = 111 bits (278), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 80/299 (26%), Positives = 149/299 (49%), Gaps = 22/299 (7%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDF-- 59
F + G ++ ++GD + T R A D R + LI ESTY R+ + ++D
Sbjct: 339 FHIGDGLYNVAFSGDIHYTDTRLFNGAVNDFPRVETLILESTYGG--RNDYQTDQKDSEE 396
Query: 60 -LKKV-HECVDRGGKVLIPVFALGRAQELCILLETYWERMNL-QAPIYFAVGLTEKATNY 116
LKKV +E ++GGKV+IP FA+GR+QE+ +++E ++ + P++ G+ +AT
Sbjct: 397 KLKKVINETYEKGGKVVIPAFAVGRSQEMMLVIEEAMRNGDIPEMPVHLD-GMIWEATAI 455
Query: 117 YKMFITWTNQKIRKTFVQ--RNMF---DFKHIRPFDKSFID--NPGPMVVFATPGMLHSG 169
+ + + +R N F F HI ++ D + ++ +T GM+ G
Sbjct: 456 HTTYPEYLRDDLRDRIFHEDENPFLAPQFNHIDGGEEERQDVADGDQCIILSTSGMVTGG 515
Query: 170 LSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENK----QIIDVKMAIEYM 225
+ + P ++ ++ G+ QGT+G ++ SG ++ + + +++++ +E +
Sbjct: 516 PIMSWLEHVGPDPDSTMVFVGYQAQGTLGRRIQSGWDEIPINGRGGRNKTLNLELDVETV 575
Query: 226 -SFSAHADAKGIMQLIQYCEPK--NVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGES 281
FS HAD G+M ++ P+ VL VHG+ S L + +FN+ F P N E+
Sbjct: 576 DGFSGHADRAGLMNFVKTMNPRPEKVLCVHGDESSTQDLSSSLYHDFNIRTFAPKNLET 634
>gi|170291024|ref|YP_001737840.1| beta-lactamase domain-containing protein [Candidatus Korarchaeum
cryptofilum OPF8]
gi|170175104|gb|ACB08157.1| beta-lactamase domain protein [Candidatus Korarchaeum cryptofilum
OPF8]
Length = 624
Score = 111 bits (278), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 80/294 (27%), Positives = 138/294 (46%), Gaps = 15/294 (5%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYA--TTIRDSKRCRERDF 59
++ +++YTGD+ + L A R + LI E TY + + S E
Sbjct: 328 LNIESARHNVLYTGDFRFRDTKLLDKAVRKFPRVETLIMECTYGGESDVLPSLPEAEEAL 387
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNL-QAPIYFAVGLTEKATNYYK 118
+ +RGGKVLIP A+GRAQE+ + L +ER L P+Y G+ +T +
Sbjct: 388 FSVIKGTAERGGKVLIPALAVGRAQEIMLSLVDGFERGLLPDIPVYLD-GMIYDSTAIHS 446
Query: 119 MFITWTNQKIRKTFVQRNMFDFKH-----IRPFDKSFIDNPGPMVVFATPGMLHSGLSLI 173
+ + + +R++ +R+ F + ++ + GP V+ A GML G S+
Sbjct: 447 AYPDYLSNYVRESVFKRDRDPFTEPHFNFVSSDERPDVTKGGPAVIIAPSGMLTGGPSVD 506
Query: 174 IFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQ---IIDVKMAIEYM-SFSA 229
K AP E N +++ + +GT+G ++ G ++L ++++ I VK + + FSA
Sbjct: 507 YLKLLAPGEENSIVLVSYQAEGTLGRRLRDGARELRLQDEEGYITIKVKADVRVIEGFSA 566
Query: 230 HADAKGIMQLIQYCEPK--NVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGES 281
HAD ++ + EP+ V LVHGE KM + ++ P GE+
Sbjct: 567 HADKVQLLSYLSTMEPRPHRVFLVHGEEEKMREFGPLASKSASIRTVSPQIGET 620
>gi|124028414|ref|YP_001013734.1| exonuclease [Hyperthermus butylicus DSM 5456]
gi|123979108|gb|ABM81389.1| predicted exonuclease [Hyperthermus butylicus DSM 5456]
Length = 643
Score = 111 bits (278), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 81/277 (29%), Positives = 140/277 (50%), Gaps = 18/277 (6%)
Query: 7 GNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFLKKVHEC 66
G +IVYTGD+ R L A + R + LI ESTY + + E + + + +
Sbjct: 341 GLHNIVYTGDFKFGRTRLLDKAHTEFPRVETLIMESTYGDRDQPRRDEAELELISVISKT 400
Query: 67 VDRGGKVLIPVFALGRAQELCILLETYWERMNLQAP---IYFAVGLTEKATNYYKMFITW 123
+ R GKVLIPV A+GRAQE+ ++L R L P IY G ++ T + +
Sbjct: 401 IARRGKVLIPVMAVGRAQEILLVLVDAL-RKKLLPPETKIYID-GSIKEVTAIHLTYPEL 458
Query: 124 TNQKIRKTFVQ-RNMFDFKHI-----RPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKK 177
+ ++R ++ N FD ++I R + + P V+ AT GML G S+ +
Sbjct: 459 LSAQVRARILRGENPFDHENIVRVEGRQMREDIAKSDEPGVILATAGMLTGGPSVEYLRL 518
Query: 178 WAPVENNMLIMPGFCVQGTIGHKVLSGVKKL----EFENKQIIDVKMAIEYM-SFSAHAD 232
AP N+L+ + +GT+G ++L G +++ E Q++ ++M ++ + FS H+D
Sbjct: 519 LAPDPRNVLVFVSYQAKGTLGRRILDGEREITMVDEEGKPQLVRIQMEVKSIDGFSGHSD 578
Query: 233 AKGIMQLIQYC--EPKNVLLVHGEASKMVFLKEKIKQ 267
+ ++ + +PKN++L HGE + L E I++
Sbjct: 579 RRQLLAFLANMKPKPKNIILNHGEPQSIHALAETIRR 615
>gi|354611945|ref|ZP_09029897.1| KH-domain/beta-lactamase-domain protein [Halobacterium sp. DL1]
gi|353191523|gb|EHB57029.1| KH-domain/beta-lactamase-domain protein [Halobacterium sp. DL1]
Length = 641
Score = 111 bits (278), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 78/299 (26%), Positives = 142/299 (47%), Gaps = 20/299 (6%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYA--TTIRDSKRCRERDF 59
F + G ++ ++GD + R A D R + L+ ESTY + + ER
Sbjct: 340 FHIGDGLYNVAFSGDIHYDDTRLFNGAVNDFPRVETLVLESTYGGRNDYQTDQEDSERKL 399
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNL-QAPIYFAVGLTEKATNYYK 118
+ ++E +RGGKVLIP FA+GR+QE+ ++LE + + + PI+ G+ +AT +
Sbjct: 400 KRVINETYERGGKVLIPAFAVGRSQEMMLVLEEAMRKGEIPEMPIHLD-GMIWEATAIHT 458
Query: 119 MFITWTNQKIRKTFVQ--RNMF---DFKHIRPF--DKSFIDNPGPMVVFATPGMLHSGLS 171
+ + +R +N F F HI ++ + + ++ +T GM+ G
Sbjct: 459 TYPEYLRDDLRDRIFHEDKNPFLAPQFNHIDGGEEERQAVADDDQCIILSTSGMVEGGPI 518
Query: 172 LIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQI------IDVKMAIEYM 225
+ + P ++ + G+ QGT+G ++ SG ++ + + + + M +E +
Sbjct: 519 MSWIQHIGPDPDSTMTFVGYQAQGTLGRRIQSGWDEIPMPDSRDNGRTDRLQLNMEVETV 578
Query: 226 -SFSAHADAKGIMQLIQYCEPK--NVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGES 281
FS HAD +G+ ++ P+ VL VHG+ S L + EFN+ F P N E+
Sbjct: 579 DGFSGHADRQGLEDFVRTMNPRPEKVLCVHGDESSTQDLSSALYHEFNMRTFAPKNLET 637
>gi|255513337|gb|EET89603.1| beta-lactamase domain protein [Candidatus Micrarchaeum acidiphilum
ARMAN-2]
Length = 418
Score = 111 bits (277), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 77/278 (27%), Positives = 133/278 (47%), Gaps = 17/278 (6%)
Query: 9 QSIVYTGDYNMTPDR-HLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFLKKVHECV 67
+ IVYTGD+ +TP H GA + + DLLI ESTYA + + F+ ++ + +
Sbjct: 156 KRIVYTGDFKITPQMLHAGAEIV---KSDLLIIESTYAGKSHPDRGDLIKKFVAEIRDVL 212
Query: 68 DRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMFITWTNQK 127
D GG L+P FA+GR+QEL +L N A + + G+++ AT+ + + + N
Sbjct: 213 DSGGNALVPAFAVGRSQELLAVLYK-----NGLADVTYVDGMSKSATS---IVLEYQNYI 264
Query: 128 IRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAPVENNMLI 187
K + + ++ P V+ T GML+ G L + + +
Sbjct: 265 NNKNLLNGAAKSVNWVEDYEMRNDALSQPSVILTTAGMLNGGPVLNYITRLN--SRSKIF 322
Query: 188 MPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLIQYCEPKN 247
+ G+ V GT G +L KKL +N ++ +K + + FSAHAD + + I+ P
Sbjct: 323 LTGYQVDGTNGRLLLEN-KKLNIDNSKVA-IKTPVSFYDFSAHADNNDLYEFIEKSSPNT 380
Query: 248 VLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQ 285
+ VHG+++++ E +K D + P G+S +
Sbjct: 381 IACVHGDSTRISEFVENLKG-MGFDAYGPKVGDSIDIN 417
>gi|110667673|ref|YP_657484.1| mRNA 3'-end processing factor-like protein [Haloquadratum walsbyi
DSM 16790]
gi|109625420|emb|CAJ51844.1| beta-lactamase domain protein [Haloquadratum walsbyi DSM 16790]
Length = 639
Score = 110 bits (276), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 78/300 (26%), Positives = 144/300 (48%), Gaps = 23/300 (7%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRC----RER 57
F + G ++ ++GD + R A D R + L+ ESTY RD + ER
Sbjct: 339 FHIGDGLYNVAFSGDIHYEDTRLFNGAVNDFPRVETLVLESTYGG--RDDYQTDQEDSER 396
Query: 58 DFLKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNL-QAPIYFAVGLTEKATNY 116
+ ++ ++E DR GKV+IP FA+GR+QE+ ++LE + Q P++ G+ +AT
Sbjct: 397 ELIEVINETYDRDGKVVIPAFAVGRSQEIMLVLEEAMRTGEIPQMPVHLD-GMIWEATAI 455
Query: 117 YKMFITWTNQKIRKTFVQ--RNMF---DFKHIRPF--DKSFIDNPGPMVVFATPGMLHSG 169
+ + + +R N F +F HI ++ + + P +V +T GM+ G
Sbjct: 456 HTTYPEYLRDNLRDRIFHDDENPFLAEEFNHIDAGEEERQAVSDGEPCIVLSTSGMITGG 515
Query: 170 LSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQI-----IDVKMAIEY 224
+ + ++ L+ G+ QGT+G ++ G ++ Q+ + ++M +
Sbjct: 516 PIMSWLQHIGTDPDSRLVFVGYQAQGTLGRRIQDGWDEIPINGDQLGRSETLTLEMDVTT 575
Query: 225 MS-FSAHADAKGIMQLIQYCEPK--NVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGES 281
+S FS HAD +G+ ++ P+ +L VHG+ + L + ++NL F P N E+
Sbjct: 576 VSGFSGHADRQGLENFVKTMNPRPEKILCVHGDEQSVQDLSSGLYHDYNLRTFAPKNLET 635
>gi|383320534|ref|YP_005381375.1| metal-dependent RNAase-like protein [Methanocella conradii HZ254]
gi|379321904|gb|AFD00857.1| metal-dependent RNAase-like protein [Methanocella conradii HZ254]
Length = 637
Score = 110 bits (276), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 75/300 (25%), Positives = 150/300 (50%), Gaps = 22/300 (7%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATT--IRDSKRCRERDF 59
F + G ++ ++GD+ R A R + ++ E+TY I+ S+R E +
Sbjct: 338 FHIGEGLYNVAFSGDFKYEKTRLFDPAVNQFPRIETMVMEATYGGMHDIQPSRREAEMEI 397
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNL-QAPIYFAVGLTEKATNYYK 118
+ + + RGGKVLIP FA+GR+QE+ I+LE + + + P+Y G+ +AT +
Sbjct: 398 QNIIKKTLQRGGKVLIPTFAVGRSQEVMIVLEEAIRKGFIDKVPVYLD-GMIWEATAIHT 456
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPF---------DKSFIDNPGPMVVFATPGMLHSG 169
+ + N +++ + F + P ++ + +P P +V AT GM++ G
Sbjct: 457 TYPEYLNVELQDMIFHKGQNPF--LSPAFVQVDSPQKREAILADPSPCIVLATSGMMNGG 514
Query: 170 LSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKL----EFENKQIIDVKMAIEYM 225
+ FK++ + N LI G+ +GT+G ++ G ++ E +++ V+M + +
Sbjct: 515 PVMEYFKQYGADKKNTLIFVGYQAEGTMGRRIQKGWTEIPISTEGGKTEVMKVEMEVATV 574
Query: 226 -SFSAHADAKGIMQLIQYCEPK--NVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESC 282
FS H+D + +M+ ++ +P+ ++ HG+ +K + L I ++F + P N E+
Sbjct: 575 DGFSGHSDRRQLMEWVKRMDPRPERIITNHGDENKCLDLASSIYKKFKYETRSPMNLETI 634
>gi|388498176|gb|AFK37154.1| unknown [Lotus japonicus]
Length = 315
Score = 110 bits (275), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 58/145 (40%), Positives = 81/145 (55%), Gaps = 8/145 (5%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF V + ++YTGDY+ DRHL AA + PD+ I ESTY + RE+ F
Sbjct: 173 MFMVDIAGVRVLYTGDYSREEDRHLRAAETPQFSPDVCIIESTYGVQHHQPRHTREKRFT 232
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYW----ERMNLQAPIYFAVGLTEKATNY 116
+H + +GG+VLIP FALGRAQEL ++L+ YW E N+ PIY+A L +K
Sbjct: 233 DVIHSTISQGGRVLIPAFALGRAQELLLILDEYWTNHPELQNI--PIYYASPLAKKCLTV 290
Query: 117 YKMFITWTNQKIRKTFVQRNMFDFK 141
Y+ + N +I+ + N F FK
Sbjct: 291 YETYTLSMNDRIQNA--KSNPFSFK 313
>gi|153955532|ref|YP_001396297.1| hypothetical protein CKL_2915 [Clostridium kluyveri DSM 555]
gi|219855928|ref|YP_002473050.1| hypothetical protein CKR_2585 [Clostridium kluyveri NBRC 12016]
gi|146348390|gb|EDK34926.1| Conserved hypothetical protein [Clostridium kluyveri DSM 555]
gi|219569652|dbj|BAH07636.1| hypothetical protein [Clostridium kluyveri NBRC 12016]
Length = 832
Score = 110 bits (275), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 65/254 (25%), Positives = 127/254 (50%), Gaps = 10/254 (3%)
Query: 10 SIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFLKKVHECVDR 69
S+ Y+GD+++ + + + + RPD+ I E+TY + ++ E + ++EC +
Sbjct: 167 SLFYSGDFSVFSQKTVEGLKVPRLRPDVAIFEATYGDRLHSNREVEEERLIDIINECKRK 226
Query: 70 GGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMFITWTNQKIR 129
GK+LIP FALGR+QE+ + ++ + L+ + G+ YK+ + +
Sbjct: 227 KGKMLIPAFALGRSQEVLLTIKKALNKNILKDIKVYVDGMIRDINRTYKLNPLYLKNSLG 286
Query: 130 KTFVQR-NMFDFKHIRPFD-----KSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAPVEN 183
K ++ F +I P D K +++ +++ ++ GML G S +K AP+EN
Sbjct: 287 KKILKGIEPFYDDNIIPIDDNKLRKKILEDDESLIIISSSGMLTGGYSQYYAEKIAPMEN 346
Query: 184 NMLIMPGFCVQGTIGH----KVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQL 239
++M G+ + + G + K + N+++I V ++ + SAH+D I L
Sbjct: 347 GYIVMTGYQDEESPGRKLLELLKEEKDKKLYINEKVIPVNCKVKNVGLSAHSDKSEIKSL 406
Query: 240 IQYCEPKNVLLVHG 253
I + P+N+ +VHG
Sbjct: 407 INFLSPRNIFIVHG 420
>gi|327399985|ref|YP_004340824.1| beta-lactamase domain-containing protein [Archaeoglobus veneficus
SNP6]
gi|327315493|gb|AEA46109.1| beta-lactamase domain protein [Archaeoglobus veneficus SNP6]
Length = 407
Score = 110 bits (275), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 85/282 (30%), Positives = 138/282 (48%), Gaps = 34/282 (12%)
Query: 10 SIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFLKKVHECVDR 69
+I+YTGD + R L A ++ D+LI ESTY +R E+ F++ V E +DR
Sbjct: 145 NILYTGDVKLQETRLLEPADLNFPETDVLIVESTYFGIEHPDRRELEKAFVESVIETLDR 204
Query: 70 GGKVLIPVFALGRAQELCILLE-----TYWERMNLQAPIYFAVG-LTEKATNYYKMFITW 123
GG +IP FA+GR QE+ ++LE Y + M + VG + E+ +Y K
Sbjct: 205 GGHAIIPAFAVGRTQEVLMILEKHGITPYVDGMGKE------VGKILERYPDYIK----- 253
Query: 124 TNQKIRKTFVQRNMFDFKHIRPFDKSFIDN--PGPMVVFATPGMLHSGLSLIIFKKWAPV 181
+ +K+R+ K P ++ ++ P + T GML+ G +L K
Sbjct: 254 SARKLRRA--------LKKAIPVERGKREDVLTEPAAIVTTAGMLNGGPALFYISKLYND 305
Query: 182 ENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLIQ 241
E + +++ G+ V+GT G L K + + + +KM +E FSAHAD + + ++++
Sbjct: 306 EKSKILLTGYQVEGTNGDMALK--KGMLNLGMRTVKLKMGVEQYDFSAHADDEQLKEVVE 363
Query: 242 YCEPKN---VLLVHGEASKMVFLKEKIKQEFNLDCFMPANGE 280
K V VHGE ++ E IK E +D + P NGE
Sbjct: 364 KAANKGAEIVFTVHGEETEA--FAEWIKDEVGIDAYAPKNGE 403
>gi|16082538|ref|NP_394086.1| metal-dependent RNase [Thermoplasma acidophilum DSM 1728]
Length = 639
Score = 110 bits (275), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 80/268 (29%), Positives = 131/268 (48%), Gaps = 23/268 (8%)
Query: 34 RPDLLITESTYATTIRDSKRCRER------DFLKKVHECVDRGGKVLIPVFALGRAQELC 87
R + +TESTY D RE D + + H DRGG VLIPVFA+GR+QE+
Sbjct: 373 RAETFMTESTYGGR-DDYSFTREEATQTLIDVINRTH---DRGGSVLIPVFAVGRSQEVM 428
Query: 88 ILLETYWERMNLQAPIYFAVGLTEKATNYYKMFITWTNQKIRKTFV--QRNMF---DFKH 142
I+LE + + G+ +A + + + N+++R+ + + N F FK
Sbjct: 429 IVLEDAMRNGRIPQMDVYLDGMIMEAPAIHAAYPEYLNKELREAIMVKKENPFLSPIFKK 488
Query: 143 IRPFDK--SFIDNPGPMVVFATPGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHK 200
+ D+ +NP +V AT GM++ G + FK W+ + L+ G+ GT+G K
Sbjct: 489 VETRDQREEIAENPETTIVLATSGMMNGGPVMEYFKAWSADPKHTLVFVGYQADGTLGKK 548
Query: 201 VLSGVKK--LEFENKQII-DVKMAIEYM-SFSAHADAKGIMQLIQYCE--PKNVLLVHGE 254
+ SG ++ L E K + VKM +E FS H+ K ++ I + PK +L+ HG+
Sbjct: 549 IQSGAQEITLSGEGKNVTYPVKMDVETAEGFSGHSTKKQLLAYIATMQPKPKKILVNHGD 608
Query: 255 ASKMVFLKEKIKQEFNLDCFMPANGESC 282
K ++Q FN++ + N E+
Sbjct: 609 NGKTAEFARLVRQRFNIEAYALKNLETI 636
>gi|296242984|ref|YP_003650471.1| KH-domain/beta-lactamase-domain-containing protein [Thermosphaera
aggregans DSM 11486]
gi|296095568|gb|ADG91519.1| KH-domain/beta-lactamase-domain protein [Thermosphaera aggregans
DSM 11486]
Length = 651
Score = 110 bits (274), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 82/270 (30%), Positives = 135/270 (50%), Gaps = 15/270 (5%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFLK 61
V +G +IVYTGD+ R L A + R + LI ESTY + + ++ E + +
Sbjct: 347 LHVGMGLHNIVYTGDFKYASTRLLDRANTEFPRVETLIMESTYGSAKQQNRLEAEAELVN 406
Query: 62 KVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMFI 121
V V+R G VLIPVFA+GR QE+ ++L + + + GL + T + +
Sbjct: 407 IVKRTVERQGIVLIPVFAVGRGQEIILILNEAMKNGLIPKLNVYVEGLVNEVTAIHTQYP 466
Query: 122 TWTNQKIRKTFVQ-RNMFDFKHIRPFDKSFIDNPG-----PMVVFATPGMLHSGLSLIIF 175
+ ++ IR+ + N F ++ + S + P P V+ AT GML G ++
Sbjct: 467 EYLSRSIREAIYRGENPFTADWLKILE-SGVARPDIVEDRPSVIIATSGMLTGGPAVDYL 525
Query: 176 KKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEF--ENKQIIDVKMAIEYMS---FSAH 230
K A N L+ G+ +GT+G K+ G+++L ENK + VK+ +E S FS H
Sbjct: 526 KLLASDPRNSLVFVGYQAEGTLGRKIKDGMRELTMVVENK-VEAVKIDLEVYSIDGFSGH 584
Query: 231 ADAKGIMQLIQYCEPK--NVLLVHGEASKM 258
+D +++ Q +PK ++L HGE S +
Sbjct: 585 SDQAELVKYAQSIKPKPRKIILNHGEPSAI 614
>gi|154150928|ref|YP_001404546.1| beta-lactamase domain-containing protein [Methanoregula boonei 6A8]
gi|153999480|gb|ABS55903.1| beta-lactamase domain protein [Methanoregula boonei 6A8]
Length = 639
Score = 110 bits (274), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 74/291 (25%), Positives = 136/291 (46%), Gaps = 11/291 (3%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATT--IRDSKRCRERDF 59
F + G +I +TGD+N + R A R + L ESTY + + ++ E
Sbjct: 345 FHIGDGLYNIAFTGDFNYSKSRLFNPAINQFPRLEALFMESTYGGSNDFQPARSDAELKL 404
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKM 119
+ +++ + RGGKV+IP FA+GR+QE+ + LE + + G+ +AT +
Sbjct: 405 YETINKVLSRGGKVIIPAFAVGRSQEVMLALEEGMRLGKIPHVKIYLDGMIREATAIHTT 464
Query: 120 FITWTNQKIRKTFVQRNMFDF--KHIRPFDKSFID----NPGPMVVFATPGMLHSGLSLI 173
+ + N +R + M F + + D S + N P V+ +T GML+ G +
Sbjct: 465 YPEYLNNDLRNLIFREGMNPFLAECFQQVDSSELREKVINGDPCVIISTSGMLNGGPVME 524
Query: 174 IFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYM-SFSAHAD 232
A E N L+ G+ GT G ++ G +++ K I + + I+ + FS H+D
Sbjct: 525 YLLNLAEDEKNALVFVGYQADGTQGRRIQKGWREVPLGRKGTITINLEIDTIDGFSGHSD 584
Query: 233 AKGIMQLIQYCEPK--NVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGES 281
+ +M I +P+ + +HG+ + + L I + +++ P N E+
Sbjct: 585 RRQLMNYIGQIQPRPEKIFCIHGDENNTIDLASSIYKRYHIQTHSPMNLET 635
>gi|9989055|gb|AAG10818.1|AC011808_6 Hypothetical protein [Arabidopsis thaliana]
Length = 489
Score = 110 bits (274), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 101/354 (28%), Positives = 163/354 (46%), Gaps = 63/354 (17%)
Query: 693 IIDLGGGQGYLSTILALQHGKKTLSLDYNQVNTHGAAVRSKKLEDSEKESKGPFQSYAGV 752
++D+G GQGYL+ +L+ Q+ +++D + + HG K+ D+ A
Sbjct: 154 VVDVGAGQGYLAQVLSFQYKHSVVAIDSS--SHHG------KVTDAR----------AAR 195
Query: 753 INKKLWMRTQNRQTSHASVEHHGKNWKRKSKAPVVSIKDEELVVCKDKCKQIT--HFVTP 810
I K Q R++ + K P V + V+ + K +T H
Sbjct: 196 IKKHF--AAQMRKSGSGN------------KCPDVPMTITCRVLSTEMLKALTDVHLEKD 241
Query: 811 DSDISSIL----SQAYPQDSLH-----NVCIMGLHTCGDLSGTALRLFTKSS-LQCLVQV 860
++D S Q+ Q S ++ + GLH CGDLS T LR F + ++ LV +
Sbjct: 242 ETDSSGSALNEEGQSRSQSSSDANRSCSLVLAGLHACGDLSVTMLRTFMECEEVKALVSI 301
Query: 861 GCCYHLLEEEFIRSPFWKDVDQSLYEHGYGFPLSEHLRSRKFFLGRNVRMSGTQSPERVI 920
GCCY+LL E K + S + GY P+S LRS F LG+N R QS ER
Sbjct: 302 GCCYNLLSE--------KSSEDSCSKCGY--PMSAGLRSLGFSLGKNARDLACQSAERWS 351
Query: 921 DLKQTQTLPLF----YRALLEKYLRSKITINDEEPKVVGRLATK-CSNFVEYVHRAVDKL 975
L + L F +RA + + + D++P L + CS+F ++ A +L
Sbjct: 352 SLGEDAGLQNFELHSFRAAFQMFTVDTV---DKKPMRQTSLNSDMCSSFEKFCLSAFSRL 408
Query: 976 KLDLEVDDEEVTRLFNSHQREYEYLQIYYFLKTALAPVIEALIVLDRVLYLREQ 1029
L+ D ++ +N E + Y+ ++ AL PV+E LI+LDR+++L+EQ
Sbjct: 409 NLE-HPRDLDLNATWNEADAFTELIGPYWSIRAALGPVLETLILLDRLMFLQEQ 461
>gi|385806125|ref|YP_005842523.1| putative exonuclease [Fervidicoccus fontis Kam940]
gi|383795988|gb|AFH43071.1| putative exonuclease [Fervidicoccus fontis Kam940]
Length = 428
Score = 110 bits (274), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 81/286 (28%), Positives = 144/286 (50%), Gaps = 28/286 (9%)
Query: 3 QVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFLKK 62
+V + S++YT D N + + +A ID D+LI ESTY ++ E+ +
Sbjct: 155 KVSTPDGSLIYTADINSIDTKLVESAKIDGAEADVLIIESTYGNANHPERKDVEKRLTES 214
Query: 63 VHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYY---KM 119
V+E +++GG VLIPVF++GR QE+ +L N +P+Y G+ ++AT K+
Sbjct: 215 VNEVLEKGGTVLIPVFSVGRGQEIMTILAE-----NDVSPVYID-GMVKQATEIMLENKV 268
Query: 120 FITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDN----PGPMVVFATPGMLHSGLSLIIF 175
F+ ++ + K + + + F K + D P V+ ++ GML G S
Sbjct: 269 FLK-NSELLEKAYSE---------QIFLKGWQDRRKVWKKPCVIVSSSGMLKGGPSRYYL 318
Query: 176 KKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKG 235
+K A E N + + F + T G + +++ +E+ VK +E+M FS+HA G
Sbjct: 319 RKIADDEKNAVFLVSFQGKNTPGRSI---IERGIYEDGG-KPVKARVEWMDFSSHAGKNG 374
Query: 236 IMQLIQYCEP-KNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGE 280
++++I+ + ++LVHGE L ++IKQ+ + +P GE
Sbjct: 375 LIEIIKSIRNLEKIILVHGEPESQNVLADEIKQQTGIVPVIPNTGE 420
>gi|300712088|ref|YP_003737902.1| hypothetical protein HacjB3_13645 [Halalkalicoccus jeotgali B3]
gi|448295777|ref|ZP_21485841.1| hypothetical protein C497_08854 [Halalkalicoccus jeotgali B3]
gi|299125771|gb|ADJ16110.1| hypothetical protein HacjB3_13645 [Halalkalicoccus jeotgali B3]
gi|445583876|gb|ELY38205.1| hypothetical protein C497_08854 [Halalkalicoccus jeotgali B3]
Length = 638
Score = 110 bits (274), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 76/297 (25%), Positives = 143/297 (48%), Gaps = 18/297 (6%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYA--TTIRDSKRCRERDF 59
F + G ++ ++GD + R L A D R + L+ ESTY + + E+
Sbjct: 339 FHIGNGMYNVAFSGDIHYDETRLLNGAVNDFPRVETLVLESTYGGRNDYQTDQSDSEQKL 398
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNL-QAPIYFAVGLTEKATNYYK 118
++ + E DRGGK++IP FA+GR+QE+ ++LE + + PI+ G+ +AT +
Sbjct: 399 IETISEAYDRGGKIVIPAFAVGRSQEIMLVLEEAMRTGKIPEMPIHLD-GMIWEATAIHT 457
Query: 119 MFITWTNQKIRKTFVQ--RNMF---DFKHIRPF--DKSFIDNPGPMVVFATPGMLHSGLS 171
+ + N +R +N F F HI ++ + + ++ +T GM+ G
Sbjct: 458 TYPEYLNDDVRDRIFHDDQNPFLADYFNHIDGGEDERQRVADDEQCIILSTSGMVTGGPI 517
Query: 172 LIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQI----IDVKMAIEYM-S 226
+ + P +N +I G+ QGT+G ++ SG ++ + + + + + +E +
Sbjct: 518 MSWLEHLGPDPDNTMIFVGYQAQGTMGRRIQSGRDEIPMNDSRGRSNRLKLNLDVETVDG 577
Query: 227 FSAHADAKGIMQLIQYCEPK--NVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGES 281
FS HAD +G+ ++ P+ VL VHG+ + L + +N+ F P N E+
Sbjct: 578 FSGHADRQGLENFVKTMNPRPEKVLCVHGDERSVQDLSSALYHNYNMRTFAPKNLET 634
>gi|223942603|gb|ACN25385.1| unknown [Zea mays]
gi|414877973|tpg|DAA55104.1| TPA: hypothetical protein ZEAMMB73_786454 [Zea mays]
Length = 537
Score = 110 bits (274), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 115/454 (25%), Positives = 200/454 (44%), Gaps = 81/454 (17%)
Query: 662 VSQLMSEKKSYEVQVMSQVVAAVTNSCDSSHIIDLGGGQGYLSTILALQHGKKTLSLDYN 721
++Q M+ KK +E++ ++ VV A+ C + ++D+G GQGYL+ L+ ++ + +++D
Sbjct: 109 LAQGMNSKKKHEIENLAAVVHAIAKRCGAKTVVDVGSGQGYLAQALSFEYQLRVVAID-- 166
Query: 722 QVNTHGAAVRSKKLEDSEKESKGPFQSYAGVINKKLWMRTQNRQTSHASVEHHGKNWKRK 781
++H A+V + E +K YA +K + T H
Sbjct: 167 -ASSHHASVTIARAERIKKH-------YAAKCVEKQLLMVPRAVTCH------------- 205
Query: 782 SKAPVVSIKDEELVVCKDKCK-----QITHFVTPDSDISSIL--SQAYPQDSLHNVCIMG 834
V D V D CK + T I+ I +Q P + + G
Sbjct: 206 -----VLSSDTLAAVTLDACKDDNGEHVRDTKTSTKKITQIQESTQGTPP-----LILAG 255
Query: 835 LHTCGDLSGTALRLFTKSS-LQCLVQVGCCYHLLEEEFIRSPFWKDVDQSLYEHGYGFPL 893
LH CGDLS LR+F ++ LV +GCCY+LL E+ ++D D GFP+
Sbjct: 256 LHACGDLSVNMLRVFVSCEQVKALVSIGCCYNLLSEDT-----YEDTDTC-----PGFPM 305
Query: 894 SEHLRSRKFFLGRNVRMSGTQSPERVIDLKQTQTLPLF----YRA----LLEKYLRSKIT 945
S+ + + LG+++R QS ER +L L F +RA +LEKY
Sbjct: 306 SKAAKHSELVLGKSIRDLACQSAERWRNLTMDIALQNFDVHAFRAAFQMVLEKYF----- 360
Query: 946 INDEEPKVVGRLATKCSNFVEYVHRAVDKLKLDLEVDDEEVTRLFNSHQREYEYLQIYYF 1005
P+V ++ S + +A+ + +L V+ + T+ + L+
Sbjct: 361 -----PEV-----SRLSPSIGRQGKALRRQRLRKVVESQMTTK--KNDDFSCSTLKEQNK 408
Query: 1006 LKTALAPVIEALIVLDRVLYLREQQKPQIISNLFLSLLTQ-----VDDEEVTRLFDSHQR 1060
+ PVI + ++ QK + + LS L + V+D + ++ Q
Sbjct: 409 NTNDVHPVIHEVDTGPDDIHQNAHQKFILFKDFTLSGLGRLGCGSVEDTSLLEIWKDVQP 468
Query: 1061 EYEYLQIYYFLKTALAPVIEALIVLDRVLYLREQ 1094
EY+ ++ L+ AL P++E I+LDR+LYL+E+
Sbjct: 469 FSEYIGPFWCLRAALGPLVETYILLDRLLYLQER 502
>gi|297808393|ref|XP_002872080.1| CPSF100 [Arabidopsis lyrata subsp. lyrata]
gi|297317917|gb|EFH48339.1| CPSF100 [Arabidopsis lyrata subsp. lyrata]
Length = 739
Score = 110 bits (274), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 63/205 (30%), Positives = 111/205 (54%), Gaps = 10/205 (4%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDK-CRPDLLITESTYAT-TIRDSKRCRERD 58
++++ + ++Y DYN +RHL + RP +LIT++ +A T + +++ R+++
Sbjct: 159 IWRITKDGEDVIYAVDYNHRKERHLNGTVLQSFVRPAVLITDAYHALYTNQTARQQRDKE 218
Query: 59 FLKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYK 118
FL + + ++ GG VL+PV GR EL ++LE +W + PIYF ++ +Y K
Sbjct: 219 FLDTISKHLEVGGNVLLPVDTAGRVLELLLILEQHWSQRGFSFPIYFLTYVSSSTIDYVK 278
Query: 119 MFITWTNQKIRKTF--VQRNMFDFKHIRPF-DKSFIDN--PGPMVVFATPGMLHSGLSLI 173
F+ W + I K+F + N F +H+ +K+ +DN PGP VV A+ L +G +
Sbjct: 279 SFLEWMSDSISKSFETSRDNAFLLRHVTLLINKTDLDNAPPGPKVVLASMASLEAGFARE 338
Query: 174 IFKKWAPVENNMLIMPGFCVQGTIG 198
IF +WA N+++ F G G
Sbjct: 339 IFVEWANDPRNLVL---FTETGQFG 360
>gi|297267035|ref|XP_001114469.2| PREDICTED: cleavage and polyadenylation specificity factor subunit
3-like [Macaca mulatta]
Length = 234
Score = 110 bits (274), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 58/165 (35%), Positives = 94/165 (56%), Gaps = 3/165 (1%)
Query: 88 ILLETYWERMN--LQAPIYFAVGLTEKATNYYKMFITWTNQKIRKTFVQRNMFDFKHIRP 145
I L+ YW+ PIY+A L +K Y+ ++ N KIRK N F FKHI
Sbjct: 7 ISLDEYWQNHPELHDIPIYYASSLAKKCMAVYQTYVNAMNDKIRKQININNPFVFKHISN 66
Query: 146 FDK-SFIDNPGPMVVFATPGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSG 204
D+ GP VV A+PGM+ SGLS +F+ W + N +I+ G+CV+GT+ ++S
Sbjct: 67 LKSMDHFDDIGPSVVMASPGMMQSGLSRELFESWCTDKRNGVIIAGYCVEGTLAKHIMSE 126
Query: 205 VKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLIQYCEPKNVL 249
+++ + Q + +KM+++Y+SFSAH D + + I+ +P +V+
Sbjct: 127 PEEITTMSGQKLPLKMSVDYISFSAHTDYQQTSEFIRALKPPHVV 171
>gi|147919476|ref|YP_686784.1| mRNA cleavage/polyadenylation specificity factor, 100 kD subunit
[Methanocella arvoryzae MRE50]
gi|110622180|emb|CAJ37458.1| mRNA cleavage/polyadenylation specificity factor,100 kD subunit
[Methanocella arvoryzae MRE50]
Length = 636
Score = 110 bits (274), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 76/301 (25%), Positives = 146/301 (48%), Gaps = 27/301 (8%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATT--IRDSKRCRERDF 59
F + G ++ +TGD+ + AA + R + +I ESTY ++ ++R E +
Sbjct: 338 FHIGEGLYNVAFTGDFKYEKSKLFDAAVNNFPRVETVIMESTYGGMHDMQPARREAEIEI 397
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNL-QAPIYFAVGLTEKATNYYK 118
+ + + + RGGKVLIP FA+GR+QE+ I+LE R + P+Y G+ +AT +
Sbjct: 398 QQVIKQTLARGGKVLIPTFAVGRSQEVMIVLEEAIRRGIIDNVPVYLD-GMIWEATAIHT 456
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPFDKSFID------------NPGPMVVFATPGML 166
+ + N +++ + F SF+ +P P +V AT GM+
Sbjct: 457 TYPEYLNVELQDMIFHKGQNPF-----LSPSFVQVDSPQKREKILADPSPCIVLATSGMM 511
Query: 167 HSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFE---NKQIIDVKMAIE 223
+ G + K + N LI G+ +GT+G ++ G +L +I+ + + +
Sbjct: 512 NGGPVMEYLKTYGHDRRNTLIFVGYQAEGTLGRRIQKGWNELPMSVGGKTEIMKINLQVT 571
Query: 224 YM-SFSAHADAKGIMQLIQYCEPK--NVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGE 280
+ FS H+D + +M+ ++ +P+ ++ HG+ +K + L I +++ L+ P N E
Sbjct: 572 TVDGFSGHSDRRQLMEYVKRMDPRPERIITNHGDENKCLDLASSIYKKYKLETKSPMNLE 631
Query: 281 S 281
+
Sbjct: 632 T 632
>gi|435848113|ref|YP_007310363.1| KH-domain/beta-lactamase-domain protein [Natronococcus occultus
SP4]
gi|433674381|gb|AGB38573.1| KH-domain/beta-lactamase-domain protein [Natronococcus occultus
SP4]
Length = 644
Score = 110 bits (274), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 78/302 (25%), Positives = 140/302 (46%), Gaps = 23/302 (7%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYA--TTIRDSKRCRERDF 59
F + G ++ ++GD + R A D R + L+ ESTY + + ER
Sbjct: 340 FHIGDGLYNVAFSGDIHYDDTRLFNGAVNDFPRVETLVLESTYGGRNDYQTDQEDSERKL 399
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNL-QAPIYFAVGLTEKATNYYK 118
K ++E DR GKVLIP FA+GR+QE+ ++LE + ++ + P++ G+ +AT +
Sbjct: 400 KKVINEAYDRDGKVLIPAFAVGRSQEIMLVLEEAMRKGDIPEMPVHLD-GMIWEATAIHT 458
Query: 119 MFITWTNQKIRKTFVQ--RNMF---DFKHIRPFDKSFID--NPGPMVVFATPGMLHSGLS 171
+ + +R N F F HI ++ D + GP ++ +T GM+ G
Sbjct: 459 TYPEYLRDDLRDRIFHEDENPFLAEQFNHIDGGEEERQDVADGGPCIILSTSGMVTGGPI 518
Query: 172 LIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFEN---------KQIIDVKMAI 222
+ P ++ L+ G+ QGT+G ++ +G ++ + + + M +
Sbjct: 519 MSWLSHLGPDPDSSLVFVGYQAQGTLGRRIQNGWDEIPTSEVGAMGNGGGRGTLSLNMNV 578
Query: 223 EYM-SFSAHADAKGIMQLIQYCEPK--NVLLVHGEASKMVFLKEKIKQEFNLDCFMPANG 279
E + FS HAD G+ ++ P+ VL VHG+ L + ++N+ F P N
Sbjct: 579 ETVDGFSGHADRAGLENFVKTMNPRPEKVLCVHGDERSTQDLSSALYHDYNMRTFAPKNL 638
Query: 280 ES 281
E+
Sbjct: 639 ET 640
>gi|448301186|ref|ZP_21491181.1| beta-lactamase [Natronorubrum tibetense GA33]
gi|445584700|gb|ELY39015.1| beta-lactamase [Natronorubrum tibetense GA33]
Length = 648
Score = 109 bits (273), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 82/297 (27%), Positives = 136/297 (45%), Gaps = 21/297 (7%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYA-----TTIRDSKRCRE 56
F V G +IV++GD + P R A D R + + ESTY D R
Sbjct: 352 FHVGNGFHNIVFSGDVHYEPTRLFNGAINDFPRAETMFMESTYGRRNDFQADTDESEARV 411
Query: 57 RDFLKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQA-PIYFAVGLTEKATN 115
R+ ++ E D+ G V+IP FA+GR+QEL ++LE + PIY G+ +AT
Sbjct: 412 REIIR---ETYDQNGIVVIPAFAVGRSQELMLVLEEAMREDEIPTMPIYLD-GMIREATA 467
Query: 116 YYKMFITWTNQKIRKTFVQ--RNMFDFKHIRPFD-----KSFIDNPGPMVVFATPGMLHS 168
+ + + +R+ + N F + D + I P ++ +T GM+
Sbjct: 468 IHTAYPEYLRDGLRQRILHEDENPFMADQFQQVDGGQPMREEIAGDEPSIILSTSGMVTG 527
Query: 169 GLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENK-QIIDVKMAIEYMS- 226
G + + P NN L+ G+ GT+G ++ SG ++ E + + + IE +S
Sbjct: 528 GPIMSWLELLGPESNNTLLFVGYQADGTLGRRIKSGRTEVTLEGRANHLTLHCRIESVSG 587
Query: 227 FSAHADAKGIMQLIQYC--EPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGES 281
FS HAD G+ Q ++ +P+ +L VHG+ L + Q+ N+ P N E+
Sbjct: 588 FSGHADRNGLEQYVKEMNPQPETILCVHGDDQATDQLSSALYQKHNIRTHQPKNLET 644
>gi|269986876|gb|EEZ93153.1| beta-lactamase domain protein [Candidatus Parvarchaeum acidiphilum
ARMAN-4]
Length = 629
Score = 109 bits (273), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 81/294 (27%), Positives = 147/294 (50%), Gaps = 20/294 (6%)
Query: 1 MFQVKVGNQ--SIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYA--TTIRDSKRCRE 56
M + +GN +++YTGD+ + L A R + LI ESTY + + S+ E
Sbjct: 329 MIHLNIGNGFYNVLYTGDFKYDNSKTLNRAETQFERVETLIMESTYGGESDTQPSRTETE 388
Query: 57 RDFLKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNL--QAPIYFAVGLTEKAT 114
+ FL + + + GKVLIPV +GRAQEL +L+E W+R L +AP+ G+ T
Sbjct: 389 QFFLDIISKTIANNGKVLIPVLGVGRAQELMLLIEE-WQRYGLLPEAPVVVD-GMLWDVT 446
Query: 115 NYYKMFITWTNQKIRKTFVQRNMFD-----FKH-IRPFDKSFIDNPGPMVVFATPGMLHS 168
Y ++ + N + R + N FKH + ++ + + P+++ AT GML+
Sbjct: 447 AIYTVYPEFMNSETRNRIIAHNNNPFLSPIFKHTVSEEERQQVLSGSPVIILATSGMLNG 506
Query: 169 GLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFE-NKQIIDVKM-AIEYM- 225
G ++ F N +++ + GT+G + +G ++++ E N ++ + A+ Y
Sbjct: 507 GPAVSYFANLCEDPKNAVVLVSYQGIGTLGRTIQNGGREVDIELNGKLRHFSVSALVYSI 566
Query: 226 -SFSAHADAKGIMQLIQYCEPK--NVLLVHGEASKMVFLKEKIKQEFNLDCFMP 276
FS H+D K + + + PK +++ HGE K++ L +++ F L+ P
Sbjct: 567 EGFSGHSDRKQLERFVNDIRPKPRKIIIGHGEEGKILELASYMRKRFRLETHAP 620
>gi|404330601|ref|ZP_10971049.1| beta-lactamase [Sporolactobacillus vineae DSM 21990 = SL153]
Length = 902
Score = 109 bits (273), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 75/278 (26%), Positives = 132/278 (47%), Gaps = 13/278 (4%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKC-RPDLLITESTYATTIRDSKRCRERDF 59
MF ++ G + ++ +GD + R + A + RPD ++ ESTY + E+
Sbjct: 155 MFVLEGGGEKLLVSGDLSFNAGRTIPGAEVPSGERPDAMLIESTYGNREHSDRNTEEKRL 214
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWER-MNLQAPIYFAVGLTEKATNYYK 118
V E + GG LIP FALGRAQE+ ++L+ Y +R + + PIY GL Y+
Sbjct: 215 ADNVAEVIAGGGFALIPAFALGRAQEVLLILQDYMDRGLIPEFPIYVD-GLVTPICKIYR 273
Query: 119 MFITWTNQKI-RKTFVQRNMF----DFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLI 173
+ + + R+ + ++F + P D+ + P + A+ GML G S+
Sbjct: 274 RYPQYLKGPVARRIRMHGDVFLTEGRCTAVEPKDRENVLRGKPACIVASSGMLIGGASVW 333
Query: 174 IFKKWAPVENNMLIMPGFCVQGTIGHKVLSGV----KKLEFENKQIIDVKMAIEYMSFSA 229
++ + N + + G+ + + G K+LS ++LE N + V+ + SA
Sbjct: 334 YAERLVSGDKNAIFITGYQDEESPGRKLLSLAEGQSRELEL-NGTLHPVRCRVGKYGLSA 392
Query: 230 HADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQ 267
HADA + + + P L+VHG+ V L+E+I +
Sbjct: 393 HADAGELTRFVAMVRPSRTLIVHGDDDARVALQERIDR 430
>gi|357618299|gb|EHJ71335.1| hypothetical protein KGM_14386 [Danaus plexippus]
Length = 324
Score = 109 bits (273), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 59/145 (40%), Positives = 82/145 (56%), Gaps = 12/145 (8%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ ++YTGD++ DRHL AA I PD+LIT+ + RE F
Sbjct: 172 MFMIEIAGVKVLYTGDFSRQEDRHLMAAEIPTVHPDVLITK----------REERESRFT 221
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWE-RMNLQ-APIYFAVGLTEKATNYYK 118
V + V RGG+ LIPVFALGRAQEL ++L+ YW LQ PIY+A L +K Y+
Sbjct: 222 TLVSDVVGRGGRCLIPVFALGRAQELLLILDEYWSLHPELQDIPIYYASSLAKKCMAVYQ 281
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHI 143
++ N +IR+ N F F+HI
Sbjct: 282 TYVNAMNDRIRRQIAVNNPFVFRHI 306
>gi|15237845|ref|NP_197776.1| cleavage and polyadenylation specificity factor subunit 2
[Arabidopsis thaliana]
gi|18203240|sp|Q9LKF9.2|CPSF2_ARATH RecName: Full=Cleavage and polyadenylation specificity factor
subunit 2; AltName: Full=Cleavage and polyadenylation
specificity factor 100 kDa subunit; Short=AtCPSF100;
Short=CPSF 100 kDa subunit; AltName: Full=Protein EMBRYO
DEFECTIVE 1265; AltName: Full=Protein ENHANCED SILENCING
PHENOTYPE 5
gi|10176855|dbj|BAB10061.1| cleavage and polyadenylation specificity factor [Arabidopsis
thaliana]
gi|14334618|gb|AAK59487.1| putative cleavage and polyadenylation specificity factor
[Arabidopsis thaliana]
gi|28393921|gb|AAO42368.1| putative cleavage and polyadenylation specificity factor
[Arabidopsis thaliana]
gi|332005845|gb|AED93228.1| cleavage and polyadenylation specificity factor subunit 2
[Arabidopsis thaliana]
Length = 739
Score = 109 bits (272), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 63/205 (30%), Positives = 111/205 (54%), Gaps = 10/205 (4%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDK-CRPDLLITESTYAT-TIRDSKRCRERD 58
++++ + ++Y DYN +RHL + RP +LIT++ +A T + +++ R+++
Sbjct: 159 IWRITKDGEDVIYAVDYNHRKERHLNGTVLQSFVRPAVLITDAYHALYTNQTARQQRDKE 218
Query: 59 FLKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYK 118
FL + + ++ GG VL+PV GR EL ++LE +W + PIYF ++ +Y K
Sbjct: 219 FLDTISKHLEVGGNVLLPVDTAGRVLELLLILEQHWSQRGFSFPIYFLTYVSSSTIDYVK 278
Query: 119 MFITWTNQKIRKTF--VQRNMFDFKHIRPF-DKSFIDN--PGPMVVFATPGMLHSGLSLI 173
F+ W + I K+F + N F +H+ +K+ +DN PGP VV A+ L +G +
Sbjct: 279 SFLEWMSDSISKSFETSRDNAFLLRHVTLLINKTDLDNAPPGPKVVLASMASLEAGFARE 338
Query: 174 IFKKWAPVENNMLIMPGFCVQGTIG 198
IF +WA N+++ F G G
Sbjct: 339 IFVEWANDPRNLVL---FTETGQFG 360
>gi|9082326|gb|AAF82809.1|AF283277_1 polyadenylation cleavage/specificity factor 100 kDa subunit
[Arabidopsis thaliana]
Length = 739
Score = 109 bits (272), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 63/205 (30%), Positives = 111/205 (54%), Gaps = 10/205 (4%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDK-CRPDLLITESTYAT-TIRDSKRCRERD 58
++++ + ++Y DYN +RHL + RP +LIT++ +A T + +++ R+++
Sbjct: 159 IWRITKDGEDVIYAVDYNHRKERHLNGTVLQSFVRPAVLITDAYHALYTNQTARQQRDKE 218
Query: 59 FLKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYK 118
FL + + ++ GG VL+PV GR EL ++LE +W + PIYF ++ +Y K
Sbjct: 219 FLDTISKHLEVGGNVLLPVDTAGRVLELLLILEQHWSQRGFSFPIYFLTYVSSSTIDYVK 278
Query: 119 MFITWTNQKIRKTF--VQRNMFDFKHIRPF-DKSFIDN--PGPMVVFATPGMLHSGLSLI 173
F+ W + I K+F + N F +H+ +K+ +DN PGP VV A+ L +G +
Sbjct: 279 SFLEWMSDSISKSFETSRDNAFLLRHVTLLINKTDLDNAPPGPKVVLASMASLEAGFARE 338
Query: 174 IFKKWAPVENNMLIMPGFCVQGTIG 198
IF +WA N+++ F G G
Sbjct: 339 IFVEWANDPRNLVL---FTETGQFG 360
>gi|399575311|ref|ZP_10769069.1| universal archaeal kh-domain/beta-lactamase-domain-containing
protein [Halogranum salarium B-1]
gi|399239579|gb|EJN60505.1| universal archaeal kh-domain/beta-lactamase-domain-containing
protein [Halogranum salarium B-1]
Length = 641
Score = 109 bits (272), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 75/299 (25%), Positives = 142/299 (47%), Gaps = 20/299 (6%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYA--TTIRDSKRCRERDF 59
F + G ++ ++GD + R A D R + L+ ESTY + + E+
Sbjct: 340 FHIGDGLYNVAFSGDIHYKDTRLFNGAVNDFPRVETLVLESTYGGRNDYQTDQEDSEQSL 399
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQA-PIYFAVGLTEKATNYYK 118
+ ++E ++GGKV+IP FA+GR+QE+ ++LE + + P++ G+ +AT +
Sbjct: 400 VDIINETHEKGGKVVIPAFAVGRSQEIMLVLEEAMRSGKIPSMPVHLD-GMIWEATAIHT 458
Query: 119 MFITWTNQKIRKTFVQ--RNMF---DFKHIRPF--DKSFIDNPGPMVVFATPGMLHSGLS 171
+ + +R N F +F HI ++ + + GP ++ +T GM+ G
Sbjct: 459 TYPEYLRDDLRDRIFHEDENPFLAEEFNHIDGGEDERQQVTDDGPAIILSTSGMVTGGPI 518
Query: 172 LIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQ------IIDVKMAIEYM 225
+ + + L+ G+ QGT+G ++ +G ++ + + +KM +E +
Sbjct: 519 MSWLRHLGSDPESTLVFVGYQAQGTLGRRIQTGWDEIPVSDPNDPGRSSTLALKMDVETV 578
Query: 226 -SFSAHADAKGIMQLIQYCEPK--NVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGES 281
FS HAD G+M ++ P+ VL VHG+ + L + +FN+ F P N E+
Sbjct: 579 DGFSGHADRSGLMNFVKTMNPRPEKVLCVHGDERSVQDLSSSLYHDFNMRTFAPKNLET 637
>gi|387927741|ref|ZP_10130420.1| RNA-metabolising metallo-beta-lactamase [Bacillus methanolicus PB1]
gi|387589885|gb|EIJ82205.1| RNA-metabolising metallo-beta-lactamase [Bacillus methanolicus PB1]
Length = 907
Score = 109 bits (272), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 88/292 (30%), Positives = 135/292 (46%), Gaps = 15/292 (5%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHL-GAAWIDKCRPDLLITESTYATTIRDSKRCRERDF 59
MF + + ++ TGD + R + GA +PD++I ESTY + ER
Sbjct: 155 MFLFEGEGERLLITGDLSFKAGRTIPGAQVPHDIQPDVVIMESTYGNRAHTDRNTEERRL 214
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWER-MNLQAPIYFAVGLT--EKATNY 116
+ V E + GG LIP FALGRAQE+ ++L+ Y E+ + + PIY +T + N
Sbjct: 215 AEHVSEVIAGGGFALIPAFALGRAQEVLLILQDYMEKGLITEFPIYVDGLVTPISRIYNQ 274
Query: 117 YKMFITW-TNQKIRKT---FVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSL 172
Y F+ +IRK F+ K + P +++ + N P + A+ GML G S
Sbjct: 275 YPHFLKGPVAHRIRKNGDVFLTEGR--CKAVEPKERNEVLNGKPGCIVASSGMLTGGASS 332
Query: 173 IIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFE---NKQIIDVKMAIEYMSFSA 229
++ E N + + G+ + + G K+L+ LE E N VK + SA
Sbjct: 333 WYAERLISDERNAIFITGYQDEESPGRKLLNVADGLENELELNGTSYQVKCRVGKYGLSA 392
Query: 230 HADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGES 281
HADA + + IQ P LLVHG+ L I F+ + NG+S
Sbjct: 393 HADANEMNRFIQTLNPTYTLLVHGDDEARSQLGSLIDPRFH--PILVENGQS 442
>gi|448437483|ref|ZP_21587486.1| beta-lactamase [Halorubrum tebenquichense DSM 14210]
gi|445680702|gb|ELZ33144.1| beta-lactamase [Halorubrum tebenquichense DSM 14210]
Length = 497
Score = 109 bits (272), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 84/299 (28%), Positives = 142/299 (47%), Gaps = 22/299 (7%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFLK 61
F V G ++V++GD + T R A D R + LI ESTY R+ + + D +
Sbjct: 198 FHVGDGFHNVVFSGDIHHTDTRLFNGASNDFPRVESLIMESTYGR--RNDYQTDQEDSER 255
Query: 62 KVHECVD----RGGKVLIPVFALGRAQELCILLETYWERMNLQA-PIYFAVGLTEKATNY 116
KV E ++ + G V+IP FA+GR+QEL ++LE L+ PIY G+ +AT
Sbjct: 256 KVTELINDTYEQDGIVVIPAFAVGRSQELMLVLEEAMREDRLRTMPIYLD-GMIREATAI 314
Query: 117 YKMFITWTNQKIRKTFVQ--RNMFDFKHIRPFD-----KSFIDNPGPMVVFATPGMLHSG 169
+ + + +R+ + N F + D + I P ++ +T GM+ G
Sbjct: 315 HTAYPEFLRDGLRQRILHDDENPFLADQFQQVDGGQEMREDIAGGEPCIILSTSGMVTGG 374
Query: 170 LSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENK----QIIDVKMAIEYM 225
+ + N LI G+ QGT+G ++ SG +++ F ++ + + ++ +E +
Sbjct: 375 PIMSWLELLGNDPKNTLIFVGYQAQGTLGRRIQSGRREIPFSDRGSRSEQLTLRFDVESV 434
Query: 226 S-FSAHADAKGIMQLIQYCEPK--NVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGES 281
S FS HAD G+ + P+ VL VHG+ S + Q+FN+ P N E+
Sbjct: 435 SGFSGHADRNGLEDFVGTMNPRPERVLCVHGDESSTDQFSSGLYQKFNIRTQSPKNMET 493
>gi|224121102|ref|XP_002330904.1| predicted protein [Populus trichocarpa]
gi|222872726|gb|EEF09857.1| predicted protein [Populus trichocarpa]
Length = 740
Score = 108 bits (271), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 64/204 (31%), Positives = 110/204 (53%), Gaps = 9/204 (4%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKC-RPDLLITESTYATTIRDSKRCRERDF 59
++++ + +VY D+N +RHL ++ RP +LIT++ A + S++ R++ F
Sbjct: 159 VWKITKDGEDVVYAVDFNHRKERHLNGTVLESFYRPAVLITDAYNALNSQPSRQQRDKQF 218
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKM 119
L+ + + ++ GG VL+PV + GR EL ++LE +W + L PI+F ++ +Y K
Sbjct: 219 LETILKTLEGGGNVLLPVDSAGRVLELLLILEQFWGQRFLNYPIFFLSYVSSSTIDYIKS 278
Query: 120 FITWTNQKIRKTF--VQRNMFDFKHIRPF-DKSFIDNP--GPMVVFATPGMLHSGLSLII 174
F+ W + I K+F + N F KH+ K +DN GP VV A+ L +G S I
Sbjct: 279 FLEWMSDSIAKSFETSRDNAFLMKHVTLLISKDELDNASTGPKVVLASVASLEAGFSHDI 338
Query: 175 FKKWAPVENNMLIMPGFCVQGTIG 198
F +WA N+++ F +G G
Sbjct: 339 FAEWAADVKNLVL---FTERGQFG 359
>gi|335437956|ref|ZP_08560713.1| beta-lactamase domain protein [Halorhabdus tiamatea SARL4B]
gi|334893560|gb|EGM31771.1| beta-lactamase domain protein [Halorhabdus tiamatea SARL4B]
Length = 640
Score = 108 bits (271), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 77/300 (25%), Positives = 145/300 (48%), Gaps = 23/300 (7%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFLK 61
F + G ++ ++GD + R A + R + L+ ESTY R+ + + D +
Sbjct: 340 FHIGDGLYNVAFSGDIHYDDTRLFNGAVNEFPRVETLVLESTYGG--RNDYQTDQEDSEQ 397
Query: 62 KVHECV----DRGGKVLIPVFALGRAQELCILLETYWERMNL-QAPIYFAVGLTEKATNY 116
++ E + ++GG VLIP FA+GR+QEL ++LE ++ + P++ G+ +AT
Sbjct: 398 RLKEIIRDTYEQGGSVLIPAFAVGRSQELMLVLEEAMREGDIPEMPVHLD-GMMWEATAI 456
Query: 117 YKMFITWTNQKIRKTFVQ--RNMF---DFKHIRPF--DKSFIDNPGPMVVFATPGMLHSG 169
+ + + +R N F F HI ++ + + P +V +T GM+ G
Sbjct: 457 HTTYPEYLRDDLRDRIFHDDENPFLADQFNHIDGGEEERQEVTDGEPAIVLSTSGMVTGG 516
Query: 170 LSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENK-----QIIDVKMAIEY 224
+ + + P ++ L G+ QGT+G ++ +G ++ N + ++M +E
Sbjct: 517 PIMSWLEHFGPDPDSTLTFVGYQAQGTLGRRIQNGWDEIPIGNNGRGRGNTLSLEMNVET 576
Query: 225 M-SFSAHADAKGIMQLIQYCEPK--NVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGES 281
+ FS HAD +G+ ++ P+ VL VHG+ S + L + E+N+ F P N E+
Sbjct: 577 VDGFSGHADRQGLENFVKTMNPRPEKVLCVHGDESSVQDLSSALYHEYNMRTFAPKNLET 636
>gi|448629175|ref|ZP_21672574.1| mRNA 3'-end processing factor-like protein [Haloarcula vallismortis
ATCC 29715]
gi|445757741|gb|EMA09082.1| mRNA 3'-end processing factor-like protein [Haloarcula vallismortis
ATCC 29715]
Length = 640
Score = 108 bits (271), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 81/302 (26%), Positives = 144/302 (47%), Gaps = 27/302 (8%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYA------TTIRDSKRCR 55
F + G ++ ++GD + R L A D R + L+ ESTY T DS+R
Sbjct: 340 FHIGEGRYNVAFSGDIHYKDTRLLDGAVNDFPRVETLVLESTYGGKNDYQTDQSDSERVL 399
Query: 56 ERDFLKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQA-PIYFAVGLTEKAT 114
RD + + HE GK+LIP FA+GR+QEL ++LE ++ P+Y G+ +AT
Sbjct: 400 -RDVINEAHE---NDGKILIPAFAVGRSQELMLVLEEAMREGDIPTMPVYLD-GMIREAT 454
Query: 115 NYYKMFITWTNQKIRKTFV--QRNMF---DFKHIRPFD--KSFIDNPGPMVVFATPGMLH 167
+ + + +R+ + N F F+ + D + I + P ++ T GM+
Sbjct: 455 AIHTAYPEYLRDDLRQRILYEDENPFLAEQFEQVDGGDEMRQDIADDEPAIILTTSGMVT 514
Query: 168 SGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFEN-----KQIIDVKMAI 222
G + + ++ + G+ +GT+G ++ G ++ + + + +KM I
Sbjct: 515 GGPVMSWLRLLGSDPDSTMAFVGYQAEGTLGRQIQRGQDEITLGDTSGPRAERVSLKMNI 574
Query: 223 EYM-SFSAHADAKGIMQLIQ--YCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANG 279
E + FS HAD +G+ ++ + P+ +L VHG+AS L + Q+FN+ P N
Sbjct: 575 ETVDGFSGHADRQGLESFVETMHPRPEKILCVHGDASTTNQLSSALYQKFNMRTHNPKNL 634
Query: 280 ES 281
E+
Sbjct: 635 ET 636
>gi|448411939|ref|ZP_21576295.1| mRNA 3'-end processing factor [Halosimplex carlsbadense 2-9-1]
gi|445669873|gb|ELZ22481.1| mRNA 3'-end processing factor [Halosimplex carlsbadense 2-9-1]
Length = 639
Score = 108 bits (271), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 75/297 (25%), Positives = 141/297 (47%), Gaps = 18/297 (6%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYA--TTIRDSKRCRERDF 59
F + G ++ ++GD + R A + R + L+ ESTY + + ER
Sbjct: 340 FHIGDGLYNVAFSGDIHYDDTRLFNGAVNEFPRVETLVLESTYGGRNDYQTDQADSERKL 399
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNL-QAPIYFAVGLTEKATNYYK 118
+ + + ++R GKVLIP FA+GR+QE+ ++LE ++ + P++ G+ +AT +
Sbjct: 400 KEVISDTIERDGKVLIPAFAVGRSQEIMLVLEEAMREGDIPEVPVHLD-GMIWEATAIHT 458
Query: 119 MFITWTNQKIRKTFVQ--RNMF---DFKHIRPF--DKSFIDNPGPMVVFATPGMLHSGLS 171
+ + +R +N F F HI ++ + + GP +V +T GM+ G
Sbjct: 459 TYPEYLRDDLRDRIFHDDQNPFLADQFNHIDGGEEERQEVADGGPCIVLSTSGMVEGGPI 518
Query: 172 LIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFEN----KQIIDVKMAIEYM-S 226
+ + + ++ G+ QGT+G ++ +G ++ + + + M +E +
Sbjct: 519 MSWLRHVGGQDEGTMVFVGYQAQGTMGRRIQNGWDEIPVNDGGGRADTLSLNMDVETVDG 578
Query: 227 FSAHADAKGIMQLIQYCEPK--NVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGES 281
FS HAD +G+ ++ P+ VL VHG+ S L + EFN+ F P N E+
Sbjct: 579 FSGHADRQGLENFVRTMNPRPEKVLCVHGDESSTQDLSSALYHEFNMRTFAPKNLET 635
>gi|282165070|ref|YP_003357455.1| hypothetical protein MCP_2400 [Methanocella paludicola SANAE]
gi|282157384|dbj|BAI62472.1| conserved hypothetical protein [Methanocella paludicola SANAE]
Length = 636
Score = 108 bits (270), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 77/302 (25%), Positives = 147/302 (48%), Gaps = 29/302 (9%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATT--IRDSKRCRERDF 59
F V G ++ ++GD+ R A R + ++ ESTY ++ S+R E D
Sbjct: 338 FHVGEGLYNVAFSGDFKYEKTRLFDPAVNQFPRIETMVMESTYGGMHDMQPSRREAEIDL 397
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNL-QAPIYFAVGLTEKATNYYK 118
+ + RGGK L+P FA+GR+QE+ I+LE + + P+Y G+ +AT +
Sbjct: 398 QNIIRRTLQRGGKALVPTFAVGRSQEVMIVLEEAIRKGIIDNVPVYLD-GMIWEATAIHT 456
Query: 119 MFITWTNQKIRKTFVQRNMFDFKHIRPF-------------DKSFIDNPGPMVVFATPGM 165
+ + N +++ +M K PF +S + +P P +V AT GM
Sbjct: 457 TYPEYLNVELQ------DMIFHKGQNPFLSSSFVQVDSQQKRQSILADPSPCIVLATSGM 510
Query: 166 LHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDV-KMAIEY 224
++ G + FK++ + N LI G+ +GT+G ++ G ++ + DV ++ +E
Sbjct: 511 MNGGPVMEYFKQYGADKRNTLIFVGYQAEGTMGRRIQKGWTEIPMSTEGRTDVMRVEMEV 570
Query: 225 MS---FSAHADAKGIMQLIQYCEPK--NVLLVHGEASKMVFLKEKIKQEFNLDCFMPANG 279
+ FS H+D + +M+ ++ +P+ ++ HG+ +K + L I +++ + P N
Sbjct: 571 ATVDGFSGHSDRRQLMEWVKRMDPRPERIITNHGDENKCLDLASSIYKKYKHETKSPMNL 630
Query: 280 ES 281
E+
Sbjct: 631 ET 632
>gi|148702077|gb|EDL34024.1| cleavage and polyadenylation specificity factor 3, isoform CRA_a
[Mus musculus]
Length = 261
Score = 108 bits (270), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 52/108 (48%), Positives = 71/108 (65%), Gaps = 2/108 (1%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ ++YTGD++ DRHL AA I +PD+LI ESTY T I + + RE F
Sbjct: 114 MFMIEIAGVKLLYTGDFSRQEDRHLMAAEIPNIKPDILIIESTYGTHIHEKREEREARFC 173
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMN--LQAPIYFA 106
VH+ V+RGG+ LIPVFALGRAQEL ++L+ YW+ PIY+A
Sbjct: 174 NTVHDIVNRGGRGLIPVFALGRAQELLLILDEYWQNHPELHDIPIYYA 221
>gi|150388612|ref|YP_001318661.1| RNA-metabolising metallo-beta-lactamase [Alkaliphilus
metalliredigens QYMF]
gi|149948474|gb|ABR47002.1| RNA-metabolising metallo-beta-lactamase [Alkaliphilus
metalliredigens QYMF]
Length = 543
Score = 108 bits (270), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 85/292 (29%), Positives = 138/292 (47%), Gaps = 16/292 (5%)
Query: 11 IVYTGDYNMTPDRHL-GAAWIDKCRPDLLITESTYATTIRDSKRCRERDFLKKVHECVDR 69
IV++GD M + ID D LI ESTY + ++ R + + +++ V R
Sbjct: 182 IVFSGDLGMKNKPLIRDPEIIDDA--DYLILESTYGNRVHENVEQRMDELMSTINKTVLR 239
Query: 70 GGKVLIPVFALGRAQELCILLETYWERMN-----LQAPIYFAVGLTEKATNYYKMFITWT 124
GG V+IP FA+GR QEL L Y++ N ++ PIY + AT +K
Sbjct: 240 GGTVIIPSFAVGRTQELIYELSQYYKANNELDAFMRIPIYIDSPMAISATEVFKKNSYCF 299
Query: 125 NQKIRKTFV-QRNMFDFKH---IRPFDKSFIDNPG--PMVVFATPGMLHSGLSLIIFKKW 178
+ + +K + N DF++ +R +S N P V+ + GM G K
Sbjct: 300 DDETKKLILGGSNPLDFENLHFVRDHKESMRLNESNYPKVIISASGMCTGGRIRHHLKHN 359
Query: 179 APVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQ 238
NN LI G+ QGT+G + G KK++ ++I + FS HAD +++
Sbjct: 360 LWKANNSLIFVGYQAQGTLGRILKDGAKKVKLLGEEIAVLSEIYSIEGFSGHADQTDLIE 419
Query: 239 LIQYC--EPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQTDM 288
+ + +PK V LVHGE + + L IK++F++ +P G S ++ D+
Sbjct: 420 WLGHFKKKPKRVFLVHGEDTSLDNLSSLIKEKFDIPTIIPNMGYSFQIEDDV 471
>gi|13541495|ref|NP_111183.1| metal-dependent RNase [Thermoplasma volcanium GSS1]
gi|14324879|dbj|BAB59805.1| cleavage and polyadenylation specificity factor [Thermoplasma
volcanium GSS1]
Length = 639
Score = 108 bits (270), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 76/268 (28%), Positives = 128/268 (47%), Gaps = 23/268 (8%)
Query: 34 RPDLLITESTYATTIRDSKRCRER------DFLKKVHECVDRGGKVLIPVFALGRAQELC 87
R + +TESTY D RE D + + H DRGG VLIPVFA+GR+QE+
Sbjct: 373 RAETFMTESTYGGR-DDYSFTREEATETLVDVINRTH---DRGGSVLIPVFAVGRSQEVM 428
Query: 88 ILLETYWERMNLQAPIYFAVGLTEKATNYYKMFITWTNQKIRKTFVQR-------NMFDF 140
I+LE + + G+ +AT + + + N+++R+ + R +F
Sbjct: 429 IVLEDAMRNGRIPQMNVYLDGMIMEATAIHAAYPEYLNKELREAIMVRKENPFLSQIFKK 488
Query: 141 KHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHK 200
R + ++P VV AT GM++ G + FK WA + L+ G+ GT+G +
Sbjct: 489 VETREQREDIAEDPESKVVLATSGMMNGGPVMEYFKAWAGNPKHTLVFVGYQADGTMGKR 548
Query: 201 VLSGVKKLEF-ENKQII--DVKMAIEYM-SFSAHADAKGIMQLIQYCE--PKNVLLVHGE 254
+ SG K++ E + + + M +E FS H+ K ++ I + PK +L+ HG+
Sbjct: 549 IQSGAKEVTISEGGKTVTYQINMDVETAEGFSGHSTKKQLLAYIATMQPKPKKILVNHGD 608
Query: 255 ASKMVFLKEKIKQEFNLDCFMPANGESC 282
K ++Q FN++ + N E+
Sbjct: 609 NGKTAEFARLVRQRFNIESYALKNLETI 636
>gi|88602138|ref|YP_502316.1| beta-lactamase-like protein [Methanospirillum hungatei JF-1]
gi|88187600|gb|ABD40597.1| beta-lactamase-like protein [Methanospirillum hungatei JF-1]
Length = 629
Score = 108 bits (269), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 76/292 (26%), Positives = 140/292 (47%), Gaps = 13/292 (4%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTY--ATTIRDSKRCRERDF 59
F V G ++ +TGD + R AA R + L ESTY A ++ ++ E
Sbjct: 335 FHVGDGLYNVAFTGDLHYGKSRLFNAAVNHYPRLEALFMESTYGGAQDMQPNRADAEERL 394
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPI-YFAVGLTEKATNYYK 118
E ++RGGK++IP FA+GR+QE+ + +E R+ P+ + G+ ++AT +
Sbjct: 395 YGVFREVLERGGKIIIPAFAVGRSQEVMLAIEEGM-RLGKFPPVKVYLDGMIKEATAIHT 453
Query: 119 MFITWTNQKIRKTFVQ--RNMFDFKHIRPFD----KSFIDNPGPMVVFATPGMLHSGLSL 172
+ + N ++R + N F + D + + P V+ T GML+ G +
Sbjct: 454 TYPEYLNSELRNLIFKEGHNPFLAECFEQVDSAEKRERVITGEPCVIITTSGMLNGGPVM 513
Query: 173 IIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKM-AIEYMSFSAHA 231
+ A E N L+ G+ GT+G ++ G +++ N++ I V + I FS H+
Sbjct: 514 EYLRNLAADERNCLVFVGYQADGTLGRRIQKGWREIPIGNRETIVVNLDCITVDGFSGHS 573
Query: 232 DAKGIMQLIQYCEPK--NVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGES 281
D + +M + + +PK + +HG+ +K + L I ++ ++ P N E+
Sbjct: 574 DRRQLMNFVSHMQPKPEKIFAIHGDENKTIDLASSIYKKLHIQTTSPMNLET 625
>gi|448302487|ref|ZP_21492466.1| mRNA 3'-end processing factor-like protein [Natronorubrum tibetense
GA33]
gi|445581153|gb|ELY35515.1| mRNA 3'-end processing factor-like protein [Natronorubrum tibetense
GA33]
Length = 637
Score = 108 bits (269), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 79/297 (26%), Positives = 137/297 (46%), Gaps = 18/297 (6%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYA--TTIRDSKRCRERDF 59
F + G +IV++GD + + R A D R + L+ ESTY + + ER
Sbjct: 338 FHIGDGLHNIVFSGDVHYSDTRLFNGAVNDFPRVETLVMESTYGRRGDYQTDQEDSERKL 397
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQA-PIYFAVGLTEKATNYYK 118
+ + + GKV+IP FA+GR+QEL ++LE L PIY G+ +AT +
Sbjct: 398 IDLITRTYENDGKVVIPAFAVGRSQELMLVLEEAMRTEKLPTMPIYLD-GMIREATAIHT 456
Query: 119 MFITWTNQKIRKTFVQ--RNMFDFKHIRPFD-----KSFIDNPGPMVVFATPGMLHSGLS 171
+ + +R+ + N F D + I P ++ +T GM+ G
Sbjct: 457 AYPEFLRDGLRQRILHEDENPFLADQFEQVDGGEEMREAIAGGEPCIILSTSGMVTGGPI 516
Query: 172 LIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQI----IDVKMAIEYMS- 226
+ + N LI G+ +GT+G ++ G ++++ +++ + ++ IE +S
Sbjct: 517 MSWLELLGGDPENALIFVGYQAEGTLGRRIQGGNREIQLTDRRGETNRLTLRFTIESVSG 576
Query: 227 FSAHADAKGIMQLIQYCEPK--NVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGES 281
FS HAD G+ +Q P+ +L VHG+ S L + Q+FN + P N E+
Sbjct: 577 FSGHADRNGLENFVQSMNPRPQEILCVHGDKSATDQLSSALYQKFNCRTYAPRNLET 633
>gi|289192409|ref|YP_003458350.1| RNA-metabolising metallo-beta-lactamase [Methanocaldococcus sp.
FS406-22]
gi|288938859|gb|ADC69614.1| RNA-metabolising metallo-beta-lactamase [Methanocaldococcus sp.
FS406-22]
Length = 427
Score = 108 bits (269), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 88/285 (30%), Positives = 136/285 (47%), Gaps = 21/285 (7%)
Query: 4 VKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDL--LITESTYATTIRDSKRCRERDFLK 61
+K N++++YTGD + R A + R D+ LI ESTY +I ++ E F++
Sbjct: 152 LKYPNKTVLYTGDIKLRDTRLTKGADLSYTRDDIDVLIIESTYGNSIHPDRKAVELSFIE 211
Query: 62 KVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYF---AVGLTEKATNYYK 118
K+ E + RGG LIPVFA+ RAQE+ ++L Y N+ APIY AV +T NY
Sbjct: 212 KIREILFRGGVALIPVFAVDRAQEILLILNDY----NIDAPIYLDGMAVEVTRLMLNYKN 267
Query: 119 MFITWTN-QKIRKT--FVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIF 175
M T+ +K K ++++ K I K N G VV T GML G L
Sbjct: 268 MLNEATHLEKALKNVKIIEKSEDRVKAIEDLSK----NGG--VVVTTAGMLDGGPILYYL 321
Query: 176 KKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKG 235
K + N L++ G+ V+ + G ++ K F K I + + +FS HA
Sbjct: 322 KLFMHNPKNALLLTGYQVRDSNGRHLIETGKI--FIGKDEIKPNLEVCMYNFSCHAGMDE 379
Query: 236 IMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGE 280
+ ++I+ P+ +++ HGE + L+ E D P GE
Sbjct: 380 LHEIIKKVNPELLIIQHGEEVQATILR-NWALEHGFDAITPKLGE 423
>gi|219851430|ref|YP_002465862.1| beta-lactamase [Methanosphaerula palustris E1-9c]
gi|219545689|gb|ACL16139.1| beta-lactamase domain protein [Methanosphaerula palustris E1-9c]
Length = 630
Score = 108 bits (269), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 71/295 (24%), Positives = 141/295 (47%), Gaps = 19/295 (6%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTY--ATTIRDSKRCRERDF 59
F + G +I +TGD+N R A R + L ESTY A ++ +++ E
Sbjct: 336 FHLGDGLYNIAFTGDFNYGKSRLFNPAVSTFPRLEALFMESTYGGANDMQPARKDAEEKL 395
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKM 119
+ V++ ++R GKV+IP FA+GR+QE+ + LE + + G+ ++AT +
Sbjct: 396 YETVNKVLERSGKVIIPAFAVGRSQEVMLALEEGMRLGKIPNVKIYLDGMIKEATAIHTT 455
Query: 120 FITWTNQKIRKTFVQRNM----------FDFKHIRPFDKSFIDNPGPMVVFATPGMLHSG 169
+ + N +R + + D +R +K + + P V+ T GM++ G
Sbjct: 456 YPEYLNSDLRNLIFREGLNPFLAECFVQVDSNELR--EKVVLGD--PCVIITTSGMMNGG 511
Query: 170 LSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYM-SFS 228
+ A E N L+ G+ GT+G ++ G +++ ++ I + + I + FS
Sbjct: 512 PVMEYLHNLAGDEKNALVFVGYQADGTLGRRIQKGWREVPLGRRETIVINLEIVTVDGFS 571
Query: 229 AHADAKGIMQLIQYCE--PKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGES 281
H+D + +M + + + P+ + +HG+ +K + L I + ++++ P N E+
Sbjct: 572 GHSDRRQLMNFVGHVQPHPEKIFTIHGDENKTIDLASSIYKRYHIETHSPMNLET 626
>gi|169598186|ref|XP_001792516.1| hypothetical protein SNOG_01891 [Phaeosphaeria nodorum SN15]
gi|111068983|gb|EAT90103.1| hypothetical protein SNOG_01891 [Phaeosphaeria nodorum SN15]
Length = 833
Score = 107 bits (268), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 100/403 (24%), Positives = 171/403 (42%), Gaps = 70/403 (17%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF +++ I++TGDY+ DRHL +A + R Y + CR R
Sbjct: 180 MFLMEIAGLKILFTGDYSREDDRHLVSASVPPGRQGRRSHYRVYFRYLHPYTACRARG-- 237
Query: 61 KKVHECVDRGGKVLIPVFALGRAQE--------LCILLETYWERMNLQAPIYFAVGLTEK 112
HE +R + P A G + + + T E+ + PIY+ L +
Sbjct: 238 -PAHEVDNRRPEPRWPCSASGFSPSAARKNFFSFSMSIGTKHEQYQ-KIPIYYNSSLARR 295
Query: 113 ATNYYKMFITWTNQKIRKTFVQR-------------NMFDFKHIRPF---DKSFIDNPGP 156
Y+ ++ N I++ F +R +DF+ +R DK + G
Sbjct: 296 CMQVYQTYLGAMNDNIKRLFAERMNEAQAEGDTGRRGAWDFRFVRSLKSMDK--FQDLGA 353
Query: 157 MVVFATPGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVL-------------- 202
V+ A+PGM+ SG S + ++WAP N +I+ G+ V+GT+ +++
Sbjct: 354 CVMLASPGMMQSGTSRELLERWAPDPRNGVIITGYSVEGTMAKQIVHEPDQIPAIMTRSA 413
Query: 203 -SGVKKLEFENKQ-IIDVKMAIEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVF 260
+ + + EN+Q +I + ++ SF+AH D K M+ +Q V+LVHGE M
Sbjct: 414 NAARRPGQKENEQSMIPRRCTVQEYSFAAHVDGKENMEFVQEVAAPVVILVHGEKGNMTR 473
Query: 261 LKEKI-----KQEFNLDCFMPANGESCFV--QTD-----------MKISIDVSVNLLKEE 302
LK K+ ++ + PAN E + +TD + + S++ E
Sbjct: 474 LKSKLLSFNAQKTIPTKIYSPANCEELRIPFKTDKVAKVVGKLASIAPPLPRSLSNGDEA 533
Query: 303 AVKYNSEPPNPLKERQIHGVLVIKDSSISLMDVDEACKEVGIS 345
KY E + + GVLV D ISLM ++ + G++
Sbjct: 534 EAKYEEE------VQMVSGVLVQNDFKISLMAPEDLKEYAGLT 570
>gi|448414471|ref|ZP_21577540.1| mRNA 3-end processing factor-like protein [Halosarcina pallida JCM
14848]
gi|445682037|gb|ELZ34461.1| mRNA 3-end processing factor-like protein [Halosarcina pallida JCM
14848]
Length = 634
Score = 107 bits (268), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 83/303 (27%), Positives = 150/303 (49%), Gaps = 29/303 (9%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFLK 61
F + G ++V++GD + R A D R + L+ ESTY RD + +RD +
Sbjct: 334 FHIGDGFHNVVFSGDIHYEDTRLFNGASNDFPRAETLVMESTYGG--RDDYQTDQRDSEE 391
Query: 62 KVHECV----DRGGKVLIPVFALGRAQELCILLETYWERMNLQA----PIYFAVGLTEKA 113
K+ E + + GGKV+IP FA+GR+QEL ++LE E M+ A P+Y G+ +A
Sbjct: 392 KLFELISETHEAGGKVVIPAFAVGRSQELMLVLE---EAMSEGAIPTMPVYLD-GMIREA 447
Query: 114 TNYYKMFITWTNQKIRKTFVQR--NMFDFKHIRPFD-----KSFIDNPGPM-VVFATPGM 165
T + + + ++R+ + N F + + D + I G V+ +T GM
Sbjct: 448 TAIHTAYPHFLRDELRQRILHEDDNPFLAEQFQQVDGGHDMRERIARDGEQCVILSTSGM 507
Query: 166 LHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENK----QIIDVKMA 221
+ G + + N L+ G+ QGT+G ++ SG +++ FE++ + ++++
Sbjct: 508 VTGGPIMSWLELLGGEPENALMFVGYQAQGTLGRRIQSGRREIPFESRGRRNNELTLRLS 567
Query: 222 IEYMS-FSAHADAKGIMQLIQYCEPK--NVLLVHGEASKMVFLKEKIKQEFNLDCFMPAN 278
+ +S FS HAD G+ + ++ P+ +L VHG+ S L + Q+ ++ P N
Sbjct: 568 VASVSGFSGHADRAGLEEFVRTMSPRPERILCVHGDESSTDQLSSGLYQDLHVRTHAPRN 627
Query: 279 GES 281
E+
Sbjct: 628 LET 630
>gi|448671829|ref|ZP_21687634.1| mRNA 3'-end processing factor-like protein [Haloarcula amylolytica
JCM 13557]
gi|445764965|gb|EMA16108.1| mRNA 3'-end processing factor-like protein [Haloarcula amylolytica
JCM 13557]
Length = 640
Score = 107 bits (268), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 80/303 (26%), Positives = 145/303 (47%), Gaps = 29/303 (9%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYA------TTIRDSKRCR 55
F + G ++ ++GD + R L A D R + L+ ESTY T DS+R
Sbjct: 340 FHIGEGRYNVAFSGDIHYKDTRLLDGAVNDFPRVETLVLESTYGGKNDYQTDQSDSERV- 398
Query: 56 ERDFLKKV-HECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQA-PIYFAVGLTEKA 113
LK V +E ++ GK+LIP FA+GR+QEL ++LE + ++ P+Y G+ +A
Sbjct: 399 ----LKDVINEAYEKEGKILIPAFAVGRSQELMLVLEEAMRKGDIPTMPVYLD-GMIREA 453
Query: 114 TNYYKMFITWTNQKIRKTFV--QRNMF---DFKHIRPFD--KSFIDNPGPMVVFATPGML 166
T + + + + +R+ + N F F + D + I + P ++ T GM+
Sbjct: 454 TAIHTAYPEYLREDLRQRILYEDENPFLAEQFAQVDGGDEMRRDITDDEPAIILTTSGMV 513
Query: 167 HSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFEN-----KQIIDVKMA 221
G + + +N + G+ +GT+G ++ G ++ + + + +KM
Sbjct: 514 TGGPVMSWLRLLGSDPDNTMAFVGYQAEGTLGRQIQRGQDEITLGDTSGPRAERVSLKMG 573
Query: 222 IEYM-SFSAHADAKGIMQLIQ--YCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPAN 278
+E + FS HAD +G+ ++ + P+ +L VHG+ S L + Q+FN+ P N
Sbjct: 574 VETVDGFSGHADRQGLESFVETMHPRPEKILCVHGDESSTNQLSSALYQKFNMRTHNPKN 633
Query: 279 GES 281
E+
Sbjct: 634 LET 636
>gi|334127672|ref|ZP_08501577.1| metallo-beta-lactamase [Centipeda periodontii DSM 2778]
gi|333388985|gb|EGK60152.1| metallo-beta-lactamase [Centipeda periodontii DSM 2778]
Length = 555
Score = 107 bits (268), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 86/297 (28%), Positives = 143/297 (48%), Gaps = 35/297 (11%)
Query: 11 IVYTGDYNMT-------PDRHLGAAWIDKCRPDLLITESTYATTI-RDSKRCRERDFLKK 62
+V++GD P+R G D L+ ESTY + RD + E L+
Sbjct: 200 LVFSGDLGQPNQPILRDPERIAGT--------DFLLVESTYGDRVHRDYDK--ETALLEV 249
Query: 63 VHECVDRGGKVLIPVFALGRAQELCILLETYWE--RMNLQAPIYFAVGLTEKATN-YYKM 119
+ + +DRGG V+IP FA+GR Q L W+ R++ PI L +AT + K
Sbjct: 250 IRDTMDRGGNVIIPSFAVGRTQTLLYYFYNLWKAGRLDGDIPIIIDSPLAIQATRVFLKN 309
Query: 120 FITWTNQKIRKTFVQRNMFDFKHIRPFD-----KSFIDNPGPMVVFATPGMLHSGLSLII 174
+ + + I Q + F +R + ++ G ++ + GM +G L
Sbjct: 310 YEDFDEETIAFFGEQGTIPSFPQVRIAETAAESRALNSAEGSAIILSASGMADAGRVLHH 369
Query: 175 FKK--WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYM-SFSAHA 231
K W P E+ +L + G+ +G++G +++ G+K++ ++I VK I+ + FSAHA
Sbjct: 370 LKHNLWRP-ESTVLFV-GYQAEGSLGRRLVDGIKRVRVLGEEIA-VKAQIQVLEGFSAHA 426
Query: 232 DAKGI---MQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQ 285
DA+ I M+ I P V LVHGE LKE+I++E L+ + P G++ +Q
Sbjct: 427 DAEQIVDWMRSITAPRPAKVFLVHGEGQAQEALKERIQEELGLEVYAPFLGDTVTIQ 483
>gi|448406221|ref|ZP_21572743.1| mRNA 3'-end processing factor-like protein [Halosimplex
carlsbadense 2-9-1]
gi|445678060|gb|ELZ30555.1| mRNA 3'-end processing factor-like protein [Halosimplex
carlsbadense 2-9-1]
Length = 640
Score = 107 bits (268), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 78/301 (25%), Positives = 141/301 (46%), Gaps = 25/301 (8%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFLK 61
F V G ++ ++GD + R L A D R + L+ ESTY R+ + + D +
Sbjct: 340 FHVGEGFYNVAFSGDIHYRDTRLLDGAVNDFPRVETLVLESTYGG--RNDYQTDQEDSER 397
Query: 62 K----VHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQA-PIYFAVGLTEKATNY 116
K ++E ++GGK+LIP FA+GR+QEL ++LE + P+Y G+ +AT
Sbjct: 398 KLETVINEAHEKGGKILIPAFAVGRSQELMLVLEEAMREDRIPTMPVYLD-GMIREATAI 456
Query: 117 YKMFITWTNQKIRKTFV--QRNMFDFKHIRPFD------KSFIDNPGPMVVFATPGMLHS 168
+ + +++R+ + N F RP D + +D+ P +V T GM+
Sbjct: 457 HAAYPDHLRERLRQRILYEDDNPFLADQFRPVDGGDEMRRDVVDDE-PSIVLTTSGMITG 515
Query: 169 GLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFEN-----KQIIDVKMAIE 223
G + + + L+ G+ +GT+G ++ G ++ + + + M ++
Sbjct: 516 GPIMSWLRLLGGDPESTLVFVGYQAEGTLGRQIQQGRDEISLNDTAGRGANRVALNMDVQ 575
Query: 224 YM-SFSAHADAKGIMQLIQYCEPK--NVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGE 280
+ FS HAD +G+ ++ P+ VL VHG+ S L + +EF + P N E
Sbjct: 576 TVDGFSGHADRQGLENFVRTMNPRPETVLCVHGDESSTDHLSSALYEEFGMRTVAPKNLE 635
Query: 281 S 281
+
Sbjct: 636 T 636
>gi|448605870|ref|ZP_21658463.1| mRNA 3-end processing factor-like protein [Haloferax sulfurifontis
ATCC BAA-897]
gi|448625251|ref|ZP_21671018.1| mRNA 3-end processing factor-like protein [Haloferax denitrificans
ATCC 35960]
gi|445741193|gb|ELZ92697.1| mRNA 3-end processing factor-like protein [Haloferax sulfurifontis
ATCC BAA-897]
gi|445749013|gb|EMA00459.1| mRNA 3-end processing factor-like protein [Haloferax denitrificans
ATCC 35960]
Length = 640
Score = 107 bits (268), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 75/299 (25%), Positives = 143/299 (47%), Gaps = 20/299 (6%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYA--TTIRDSKRCRERDF 59
F + G ++ ++GD + R A D R + L+ ESTY + ++ E+
Sbjct: 339 FHIGDGLYNVAFSGDIHYEDTRLFNGAVNDFPRVETLVLESTYGGRNDYQTDQQDSEKRL 398
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNL-QAPIYFAVGLTEKATNYYK 118
++ ++E DRGGKV+IP FA+GR+QE+ ++LE + + P++ G+ +AT +
Sbjct: 399 IEVINETYDRGGKVVIPAFAVGRSQEIMLVLEEAMRSGKIPEMPVHLD-GMIWEATAIHT 457
Query: 119 MFITWTNQKIRKTFVQ--RNMF---DFKHIRPFDKSFID--NPGPMVVFATPGMLHSGLS 171
+ + +R N F +F HI ++ D + ++ +T GM+ G
Sbjct: 458 TYPEYLRDDLRDRIFHEDENPFLAEEFNHIDGGEEERQDVADGEQAIILSTSGMVTGGPI 517
Query: 172 LIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENK------QIIDVKMAIEYM 225
+ + P + L+ G+ QGT+G ++ +G ++ + + +KM +E +
Sbjct: 518 MSWLRHVGPDPKSRLVFVGYQAQGTLGRRIQNGWDEIPVNGRDGMGRSDTLKLKMDVETV 577
Query: 226 -SFSAHADAKGIMQLIQYCEPK--NVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGES 281
FS HAD +G+ ++ P+ VL VHG+ + L + ++N+ F P N E+
Sbjct: 578 DGFSGHADRQGLENFVKTMNPRPEKVLCVHGDERSVQDLSSALYHDYNMRTFAPKNLET 636
>gi|333897952|ref|YP_004471826.1| beta-lactamase [Thermoanaerobacterium xylanolyticum LX-11]
gi|333113217|gb|AEF18154.1| beta-lactamase domain protein [Thermoanaerobacterium xylanolyticum
LX-11]
Length = 537
Score = 107 bits (267), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 74/268 (27%), Positives = 131/268 (48%), Gaps = 18/268 (6%)
Query: 36 DLLITESTYATTIRDSKRCRERDFLKKVHECVDRGGKVLIPVFALGRAQELCILL----E 91
D L+ ESTY + + + + ++ + + + RGG V+IP FA+GR QEL + E
Sbjct: 206 DYLVCESTYGDRLHEDVGDKAKKLMEIIKKTISRGGNVIIPSFAVGRTQELLYEIHKDEE 265
Query: 92 TYWERMNL--QAPIYFAVGLTEKATNYYKMFITWTNQKIRKTFVQRNMF-------DFKH 142
Y + + P+Y L T+ ++ + + + + R ++V+ + F H
Sbjct: 266 LYKNEIEYISKVPVYVDSPLATSVTDVFRKHLDYFDDEAR-SYVENGDYPLDFPNLHFTH 324
Query: 143 IRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVL 202
K+ D P+++ + GM +G K ++ ++ G+ +GT+G ++L
Sbjct: 325 STEESKALNDLKEPVIIISASGMCEAGRIKHHLKHNLWRSDSTIVFVGYQAKGTLGRRIL 384
Query: 203 SGVKKLEFENKQIIDVKMAIEYM-SFSAHADAKGIMQLIQYC--EPKNVLLVHGEASKMV 259
G K + ++I V I+ + SFS HAD KGIM I +PK V +VHGE S
Sbjct: 385 EGEKTVNIFGEEIT-VNAEIQNIESFSGHADQKGIMDWISSFKKKPKKVFIVHGEDSAQK 443
Query: 260 FLKEKIKQEFNLDCFMPANGESCFVQTD 287
L EKIK FN++ +P+ ++ ++D
Sbjct: 444 VLSEKIKDVFNIETVIPSKYDTYDFESD 471
>gi|292655037|ref|YP_003534934.1| mRNA 3-end processing factor-like protein [Haloferax volcanii DS2]
gi|448292661|ref|ZP_21483067.1| mRNA 3-end processing factor-like protein [Haloferax volcanii DS2]
gi|291372149|gb|ADE04376.1| mRNA 3-end processing factor homolog [Haloferax volcanii DS2]
gi|445572417|gb|ELY26957.1| mRNA 3-end processing factor-like protein [Haloferax volcanii DS2]
Length = 640
Score = 107 bits (267), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 75/299 (25%), Positives = 142/299 (47%), Gaps = 20/299 (6%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYA--TTIRDSKRCRERDF 59
F + G ++ ++GD + R A D R + L+ ESTY + ++ E
Sbjct: 339 FHIGDGLYNVAFSGDIHYEDTRLFNGAVNDFPRVETLVLESTYGGRNDYQTDQQDSEERL 398
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNL-QAPIYFAVGLTEKATNYYK 118
++ ++E DRGGKV+IP FA+GR+QE+ ++LE + + P++ G+ +AT +
Sbjct: 399 IEVINETYDRGGKVVIPAFAVGRSQEIMLVLEEAMRSGKIPEMPVHLD-GMIWEATAIHT 457
Query: 119 MFITWTNQKIRKTFVQ--RNMF---DFKHIRPFDKSFID--NPGPMVVFATPGMLHSGLS 171
+ + +R N F +F HI ++ D + ++ +T GM+ G
Sbjct: 458 TYPEYLRDDLRDRIFHEDENPFLAEEFNHIDGGEEERQDVADGDQAIILSTSGMVTGGPI 517
Query: 172 LIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENK------QIIDVKMAIEYM 225
+ + P + L+ G+ QGT+G ++ +G ++ + + +KM +E +
Sbjct: 518 MSWLRHVGPDPKSRLVFVGYQAQGTLGRRIQNGWDEIPVNGRDSMGRSDTLKLKMDVETV 577
Query: 226 -SFSAHADAKGIMQLIQYCEPK--NVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGES 281
FS HAD +G+ ++ P+ VL VHG+ + L + ++N+ F P N E+
Sbjct: 578 DGFSGHADRQGLENFVKTMNPRPEKVLCVHGDERSVQDLSSALYHDYNMRTFAPKNLET 636
>gi|357057533|ref|ZP_09118392.1| hypothetical protein HMPREF9334_00108 [Selenomonas infelix ATCC
43532]
gi|355375192|gb|EHG22481.1| hypothetical protein HMPREF9334_00108 [Selenomonas infelix ATCC
43532]
Length = 536
Score = 107 bits (267), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 88/297 (29%), Positives = 143/297 (48%), Gaps = 35/297 (11%)
Query: 11 IVYTGDYNMT-------PDRHLGAAWIDKCRPDLLITESTYATTI-RDSKRCRERDFLKK 62
IV++GD P+ GA D L+ ESTY I RD + E L+
Sbjct: 181 IVFSGDLGQPNQPILRDPEAIAGA--------DFLLVESTYGDRIHRDYDK--ETALLEV 230
Query: 63 VHECVDRGGKVLIPVFALGRAQELCILLETYWE--RMNLQAPIYFAVGLTEKATN-YYKM 119
+ + +DRGG V+IP FA+GR Q L W+ R++ PI L +AT + K
Sbjct: 231 IRDTMDRGGNVIIPSFAVGRTQTLLYYFYNLWKAGRLDGDIPIIIDSPLAIQATRVFLKN 290
Query: 120 FITWTNQKIRKTFVQRNMFDFKHIRPFD-----KSFIDNPGPMVVFATPGMLHSGLSLII 174
+ + + I Q + F +R + ++ G ++ + GM +G L
Sbjct: 291 YEDFDEETIAFFGEQGTIPAFPQVRIAETAAESRALNSAEGSAIILSASGMADAGRVLHH 350
Query: 175 FKK--WAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYM-SFSAHA 231
K W P E+ +L + G+ +G++G +++ G+K++ ++I VK I+ + FSAHA
Sbjct: 351 LKHNLWRP-ESTILFV-GYQAEGSLGRRLVDGIKRVRVLGEEIA-VKAQIQVLEGFSAHA 407
Query: 232 DAKGI---MQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQ 285
DA+ I M+ I+ P V LVHGE LKE+I++E L+ + P G+ +Q
Sbjct: 408 DAEQIVDWMRSIKDPRPAKVFLVHGEGQAQEALKERIQEELGLEVYAPFLGDIVTIQ 464
>gi|429735291|ref|ZP_19269260.1| metallo-beta-lactamase domain protein [Selenomonas sp. oral taxon
138 str. F0429]
gi|429159403|gb|EKY01918.1| metallo-beta-lactamase domain protein [Selenomonas sp. oral taxon
138 str. F0429]
Length = 550
Score = 107 bits (267), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 81/265 (30%), Positives = 132/265 (49%), Gaps = 20/265 (7%)
Query: 36 DLLITESTYATTI-RDSKRCRERDFLKKVHECVDRGGKVLIPVFALGRAQELCILLETYW 94
D L+ ESTY I RD + E L+ + + +DRGG V+IP FA+GR Q L W
Sbjct: 219 DFLLVESTYGDRIHRDYDK--ESALLEVIRDTMDRGGNVIIPSFAVGRTQTLLYYFYNLW 276
Query: 95 E--RMNLQAPIYFAVGLTEKATN-YYKMFITWTNQKIRKTFVQRNMFDFKHIRPFD---- 147
+ R++ PI L +AT + K + + + I Q + F +R +
Sbjct: 277 KAGRLDGDIPIIIDSPLAIQATRVFLKNYEDFDEEAIAFFGEQGTIPSFPQVRIAETAAE 336
Query: 148 -KSFIDNPGPMVVFATPGMLHSGLSLIIFKK--WAPVENNMLIMPGFCVQGTIGHKVLSG 204
++ G ++ + GM +G L K W P E+ +L + G+ +G++G +++ G
Sbjct: 337 SRALNSAEGSAIILSASGMADAGRVLHHLKHNLWRP-ESTVLFV-GYQAEGSLGRRLVDG 394
Query: 205 VKKLEFENKQIIDVKMAIEYM-SFSAHADAKGI---MQLIQYCEPKNVLLVHGEASKMVF 260
+K++ ++I VK I+ + FSAHADA+ I M+ I P V LVHGE
Sbjct: 395 IKRVRVLGEEIA-VKAQIQVLEGFSAHADAEQIVEWMRSITEPRPAKVFLVHGEGQAQEA 453
Query: 261 LKEKIKQEFNLDCFMPANGESCFVQ 285
LKE+I++E L+ + P G+ +Q
Sbjct: 454 LKERIQEELGLEVYAPFLGDIVTIQ 478
>gi|374724464|gb|EHR76544.1| putative metal-dependent RNase [uncultured marine group II
euryarchaeote]
Length = 634
Score = 107 bits (267), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 75/296 (25%), Positives = 142/296 (47%), Gaps = 16/296 (5%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATT--IRDSKRCRERDF 59
+ G +IV++GD AA R + L+ ESTY + + S++ ++
Sbjct: 335 LHIGEGKHNIVFSGDQKYEKSWLFDAANTRFPRVETLVIESTYGASGDYQPSRQEANQEL 394
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKM 119
V + RGGK++ PVFA+GR+QE+ I ++ + ++ + G+ ++AT +
Sbjct: 395 QDIVSRTISRGGKLICPVFAVGRSQEVMIAIDELFRSGQVKPVPVWLDGMIQEATAIHAS 454
Query: 120 FITWTNQKIRKTFVQ---RNMFDFKHIRP-----FDKSFIDNPGPMVVFATPGMLHSGLS 171
+ +RK+ ++ N F + RP +S I + P +V AT GML+ G
Sbjct: 455 HPDYLTSSLRKSLLKDDGENPFTSEWFRPVKGRELRESIIMDNSPCIVLATSGMLNGGPV 514
Query: 172 LIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFE-NKQIIDVKMAIEYMS---F 227
+ F+ WA + N L G+ +GT+G ++ G ++ N + VK+ E ++ F
Sbjct: 515 MEYFRHWAHEDRNSLCFVGYQAEGTLGRRLQKGFGEVPMVINGKTEIVKIGCEMVTIDGF 574
Query: 228 SAHADAKGIMQLIQYC--EPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGES 281
S H+D + ++ + +PKN++ HG+ K L +++ + + P N E+
Sbjct: 575 SGHSDRRQLLDFVDQLSPKPKNIICHHGDYHKCNELGHTLRERYKCRTYAPKNLET 630
>gi|150400464|ref|YP_001324230.1| beta-lactamase domain-containing protein [Methanococcus aeolicus
Nankai-3]
gi|150013167|gb|ABR55618.1| beta-lactamase domain protein [Methanococcus aeolicus Nankai-3]
Length = 432
Score = 107 bits (267), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 82/270 (30%), Positives = 131/270 (48%), Gaps = 20/270 (7%)
Query: 9 QSIVYTGDYNMTPDRHLGAAWIDKCRPDL--LITESTYATTIRDSKRCRERDFLKKVHEC 66
+ IVYTGD ++ R + A + C D+ LI ESTY + E++FL KV E
Sbjct: 163 KRIVYTGDTRVSKTRLVNGADLSYCDNDINALIIESTYGGENHPDRELLEKEFLNKVKET 222
Query: 67 VDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMFITWTNQ 126
V+RGG +IPVFA+ R+QE+ ++L +N PIYF GL K T K+ +
Sbjct: 223 VERGGVAIIPVFAVDRSQEILLML----NEVNWSVPIYFD-GLGIKIT---KIMLNQNPS 274
Query: 127 KIRKTFVQRNMFDFKHIRPFDKSFI-----DNPGPMVVFATPGMLHSGLSLIIFKKWAPV 181
+ ++ F H D+ ++ +N G ++ +T GML G L +
Sbjct: 275 YLENHAKLKSAFLSTHKVKEDRKYVIEDLRENGG--IIVSTAGMLSGGPVLNYIDNFWND 332
Query: 182 ENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLIQ 241
+ LI+ G+ V+ T G ++L K + I K + FSAHAD + +++
Sbjct: 333 PRSSLILTGYQVKNTAG-RILLDTGKFPI-GELFIKPKFEVAQYEFSAHADMDELRTIVK 390
Query: 242 YCEPKNVLLVHGEASKMVFLKE-KIKQEFN 270
P+N+++ HGE ++ KE I+ FN
Sbjct: 391 KANPENLIIQHGEEESIMLFKEWAIENGFN 420
>gi|401564801|ref|ZP_10805668.1| beta-Casp domain protein [Selenomonas sp. FOBRC6]
gi|400188495|gb|EJO22657.1| beta-Casp domain protein [Selenomonas sp. FOBRC6]
Length = 536
Score = 107 bits (267), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 81/265 (30%), Positives = 132/265 (49%), Gaps = 20/265 (7%)
Query: 36 DLLITESTYATTI-RDSKRCRERDFLKKVHECVDRGGKVLIPVFALGRAQELCILLETYW 94
D L+ ESTY I RD + E L+ + + +DRGG V+IP FA+GR Q L W
Sbjct: 205 DFLLVESTYGDRIHRDYDK--ESALLEVIRDTMDRGGNVIIPSFAVGRTQTLLYYFYNLW 262
Query: 95 E--RMNLQAPIYFAVGLTEKATN-YYKMFITWTNQKIRKTFVQRNMFDFKHIRPFD---- 147
+ R++ PI L +AT + K + + + I Q + F +R +
Sbjct: 263 KAGRLDGDIPIIIDSPLAIQATRVFLKNYEDFDEEAIAFFGEQGTIPSFPQVRIAETAAE 322
Query: 148 -KSFIDNPGPMVVFATPGMLHSGLSLIIFKK--WAPVENNMLIMPGFCVQGTIGHKVLSG 204
++ G ++ + GM +G L K W P E+ +L + G+ +G++G +++ G
Sbjct: 323 SRALNSAEGSAIILSASGMADAGRVLHHLKHNLWRP-ESTVLFV-GYQAEGSLGRRLVDG 380
Query: 205 VKKLEFENKQIIDVKMAIEYM-SFSAHADAKGI---MQLIQYCEPKNVLLVHGEASKMVF 260
+K++ ++I VK I+ + FSAHADA+ I M+ I P V LVHGE
Sbjct: 381 IKRVRVLGEEIA-VKAQIQVLEGFSAHADAEQIVEWMRSITEPRPAKVFLVHGEGQAQEA 439
Query: 261 LKEKIKQEFNLDCFMPANGESCFVQ 285
LKE+I++E L+ + P G+ +Q
Sbjct: 440 LKERIQEELGLEVYAPFLGDIVTIQ 464
>gi|448729880|ref|ZP_21712192.1| universal archaeal kh-domain/beta-lactamase-domain-containing
protein [Halococcus saccharolyticus DSM 5350]
gi|445794201|gb|EMA44754.1| universal archaeal kh-domain/beta-lactamase-domain-containing
protein [Halococcus saccharolyticus DSM 5350]
Length = 635
Score = 107 bits (266), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 74/296 (25%), Positives = 138/296 (46%), Gaps = 17/296 (5%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYA--TTIRDSKRCRERDF 59
F + G ++ ++GD + R A D R + L+ ESTY + + E
Sbjct: 337 FHIGDGFYNVAFSGDIHYDDTRLFNGAVNDFPRVETLVMESTYGGRNDYQTDQADSEEKL 396
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNL-QAPIYFAVGLTEKATNYYK 118
++ ++E DRGGKVLIP FA+GR+QE+ ++LE + + P++ G+ +AT +
Sbjct: 397 IEVINETADRGGKVLIPAFAVGRSQEIMLVLEEAMRSGKIPEMPVHLD-GMIWEATAIHT 455
Query: 119 MFITWTNQKIRKTFVQ--RNMF---DFKHIRPFD--KSFIDNPGPMVVFATPGMLHSGLS 171
+ + +R N F F HI D + + P ++ +T GM+ G
Sbjct: 456 TYPEYLRDDLRDRIFHDDENPFLAPQFNHIDEGDDEREEVAAGDPAIILSTSGMVTGGPI 515
Query: 172 LIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENK---QIIDVKMAIEYM-SF 227
+ + ++ G+ QGT+G ++ +G ++ + ++++ +E + F
Sbjct: 516 MSWLDHLGADTDTTMVFVGYQAQGTLGRRIQNGWDEIPRNGRGRSNTLNLEFDVETVDGF 575
Query: 228 SAHADAKGIMQLIQYCEPK--NVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGES 281
S HAD +G+ ++ P+ VL VHG+ S + L + +N+ F P N E+
Sbjct: 576 SGHADRQGLENFVKTMNPRPEKVLCVHGDESSVQDLSSALYHNYNMRTFAPKNLET 631
>gi|242084170|ref|XP_002442510.1| hypothetical protein SORBIDRAFT_08g021070 [Sorghum bicolor]
gi|241943203|gb|EES16348.1| hypothetical protein SORBIDRAFT_08g021070 [Sorghum bicolor]
Length = 551
Score = 107 bits (266), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 117/460 (25%), Positives = 199/460 (43%), Gaps = 92/460 (20%)
Query: 662 VSQLMSEKKSYEVQVMSQVVAAVTNSCDSSHIIDLGGGQGYLSTILALQHGKKTLSLDYN 721
++Q M+ KK +E++ ++ VV A+ C + +ID+G GQGYL+ L+ ++ +++D
Sbjct: 122 LAQGMNSKKKHEIENLAAVVHAIAKRCGAKTVIDVGSGQGYLAQSLSFEYQLPVVAID-- 179
Query: 722 QVNTHGAAVRSKKLEDSEKESKGPFQSYAGVINKKLWMRTQNRQTSH------------- 768
++H A+V + E +K YA +K + T H
Sbjct: 180 -ASSHHASVTMARAERIKKH-------YAAKCVEKQLLMVPRTVTCHVLSSDTLAAVTLD 231
Query: 769 ASVEHHGKNWKRKSKAPVVSIKDEELVVCKDKCKQITHFVTPDSDISSILSQAYPQDSLH 828
A + HG R++KA K+IT L+Q P
Sbjct: 232 ACKDDHGGEHVRETKAST---------------KKITQIQE--------LTQGSPP---- 264
Query: 829 NVCIMGLHTCGDLSGTALRLFTKSS-LQCLVQVGCCYHLLEEEFIRSPFWKDVDQSLYEH 887
+ + GLH CGDLS LR+F ++ LV VGCCY+LL E+ +D D
Sbjct: 265 -LILAGLHACGDLSVNMLRVFVSCEQVKALVSVGCCYNLLSEDSN-----EDTDTC---- 314
Query: 888 GYGFPLSEHLRSRKFFLGRNVRMSGTQSPERVIDLKQTQTLPLF----YRA----LLEKY 939
GFP+S+ + + LG+++R QS ER +L L F +RA +LEKY
Sbjct: 315 -PGFPMSKAAKLSELVLGKSIRDLACQSAERWRNLTMDIALQNFDVHAFRAAFQMVLEKY 373
Query: 940 LRSKITINDEEPKVVGRLATKCSNFVEYVHRAVDKLKLDLEVDDEEVTRLFNSHQREYEY 999
P+V RL+ + + R + ++ ++ E+ +S +E
Sbjct: 374 F----------PEV-SRLSPSIGRQGKALRRQRLRKVVESQMTTEKTDNFSSSTLKEQNK 422
Query: 1000 LQIYYFLKTALAPVIEALIVLDRVLYLREQQKPQIISNLFLSLLTQ-----VDDEEVTRL 1054
+ VI + ++ QK + + LS L + V+D + +
Sbjct: 423 N------TNDVDSVIHGVDTGPDDIHHNACQKFTLFKDFTLSGLGRLGCGSVEDSRLLEI 476
Query: 1055 FDSHQREYEYLQIYYFLKTALAPVIEALIVLDRVLYLREQ 1094
+ Q EY+ ++ L+ AL P++E I+LDR+LYL+E+
Sbjct: 477 WKDVQPFSEYIGPFWCLRAALGPLVETYILLDRLLYLQER 516
>gi|448322522|ref|ZP_21511992.1| KH-domain/beta-lactamase-domain-containing protein [Natronococcus
amylolyticus DSM 10524]
gi|445601280|gb|ELY55269.1| KH-domain/beta-lactamase-domain-containing protein [Natronococcus
amylolyticus DSM 10524]
Length = 644
Score = 107 bits (266), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 77/302 (25%), Positives = 138/302 (45%), Gaps = 23/302 (7%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYA--TTIRDSKRCRERDF 59
F + G ++ ++GD + R A D R + L+ ESTY + + ER
Sbjct: 340 FHIGDGLYNVAFSGDIHYDDTRLFNGAVNDFPRVETLVLESTYGGRNDYQTDQEDSERKL 399
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNL-QAPIYFAVGLTEKATNYYK 118
+ ++E DR GKVLIP FA+GR+QE+ ++LE + + P++ G+ +AT +
Sbjct: 400 VDVINEAHDRDGKVLIPAFAVGRSQEIMLVLEEAMRSGKIPEMPVHLD-GMIWEATAIHT 458
Query: 119 MFITWTNQKIRKTFVQ--RNMF---DFKHIRPFDKSFID--NPGPMVVFATPGMLHSGLS 171
+ + +R N F F HI ++ D + GP ++ +T GM+ G
Sbjct: 459 TYPEYLRDDLRDRIFHEDENPFLAEQFNHIDAGEEERQDVADGGPCIILSTSGMVTGGPI 518
Query: 172 LIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFEN---------KQIIDVKMAI 222
+ P ++ L+ G+ QGT+G ++ +G ++ + + + M +
Sbjct: 519 MSWLSHLGPDPDSSLVFVGYQAQGTLGRRIQNGWDEIPTSEVGAAGNGGGRGTLSLNMNV 578
Query: 223 EYM-SFSAHADAKGIMQLIQYCEPK--NVLLVHGEASKMVFLKEKIKQEFNLDCFMPANG 279
E + FS HAD G+ ++ P+ VL VHG+ L + ++N+ F P N
Sbjct: 579 ETVDGFSGHADRAGLENFVKTMNPRPEKVLCVHGDERSTQDLSSALYHDYNMRTFAPKNL 638
Query: 280 ES 281
E+
Sbjct: 639 ET 640
>gi|433424491|ref|ZP_20406487.1| mRNA 3-end processing factor-like protein [Haloferax sp. BAB2207]
gi|448572052|ref|ZP_21640141.1| mRNA 3-end processing factor-like protein [Haloferax lucentense DSM
14919]
gi|432198078|gb|ELK54402.1| mRNA 3-end processing factor-like protein [Haloferax sp. BAB2207]
gi|445721085|gb|ELZ72754.1| mRNA 3-end processing factor-like protein [Haloferax lucentense DSM
14919]
Length = 640
Score = 106 bits (265), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 75/299 (25%), Positives = 142/299 (47%), Gaps = 20/299 (6%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYA--TTIRDSKRCRERDF 59
F + G ++ ++GD + R A D R + L+ ESTY + ++ E
Sbjct: 339 FHIGDGLYNVAFSGDIHYEDTRLFNGAVNDFPRVETLVLESTYGGRNDYQTDQQDSEERL 398
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNL-QAPIYFAVGLTEKATNYYK 118
++ ++E DRGGKV+IP FA+GR+QE+ ++LE + + P++ G+ +AT +
Sbjct: 399 IEVINETYDRGGKVVIPAFAVGRSQEIMLVLEEAMRSGKIPEMPVHLD-GMIWEATAIHT 457
Query: 119 MFITWTNQKIRKTFVQ--RNMF---DFKHIRPFDKSFID--NPGPMVVFATPGMLHSGLS 171
+ + +R N F +F HI ++ D + ++ +T GM+ G
Sbjct: 458 TYPEYLRDDLRDRIFHEDENPFLAEEFNHIDGGEEERQDVADGEQAIILSTSGMVTGGPI 517
Query: 172 LIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENK------QIIDVKMAIEYM 225
+ + P + L+ G+ QGT+G ++ +G ++ + + +KM +E +
Sbjct: 518 MSWLRHVGPDPKSRLVFVGYQAQGTLGRRIQNGWDEIPVNGRDGMGRSDTLKLKMDVETV 577
Query: 226 -SFSAHADAKGIMQLIQYCEPK--NVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGES 281
FS HAD +G+ ++ P+ VL VHG+ + L + ++N+ F P N E+
Sbjct: 578 DGFSGHADRQGLENFVKTMNPRPEKVLCVHGDERSVQDLSSALYHDYNMRTFAPKNLET 636
>gi|448542567|ref|ZP_21624729.1| mRNA 3-end processing factor-like protein [Haloferax sp. ATCC
BAA-646]
gi|448549964|ref|ZP_21628569.1| mRNA 3-end processing factor-like protein [Haloferax sp. ATCC
BAA-645]
gi|445707046|gb|ELZ58912.1| mRNA 3-end processing factor-like protein [Haloferax sp. ATCC
BAA-646]
gi|445713012|gb|ELZ64793.1| mRNA 3-end processing factor-like protein [Haloferax sp. ATCC
BAA-645]
Length = 640
Score = 106 bits (265), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 74/299 (24%), Positives = 142/299 (47%), Gaps = 20/299 (6%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYA--TTIRDSKRCRERDF 59
F + G ++ ++GD + R A D R + L+ ESTY + ++ E
Sbjct: 339 FHIGDGLYNVAFSGDIHYEDTRLFNGAVNDFPRVETLVLESTYGGRNDYQTDQQDSEERL 398
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNL-QAPIYFAVGLTEKATNYYK 118
++ ++E DRGGKV+IP FA+GR+QE+ ++LE + + P++ G+ +AT +
Sbjct: 399 IEVINETYDRGGKVVIPAFAVGRSQEIMLVLEEAMRSGKIPEMPVHLD-GMIWEATAIHT 457
Query: 119 MFITWTNQKIRKTFVQ--RNMF---DFKHIRPF--DKSFIDNPGPMVVFATPGMLHSGLS 171
+ + +R N F +F HI ++ + + ++ +T GM+ G
Sbjct: 458 TYPEYLRDDLRDRIFHEDENPFLAEEFNHIDGGEEERQEVADGDQAIILSTSGMVTGGPI 517
Query: 172 LIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENK------QIIDVKMAIEYM 225
+ + P + L+ G+ QGT+G ++ +G ++ + + +KM +E +
Sbjct: 518 MSWLRHVGPDPKSRLVFVGYQAQGTLGRRIQNGWDEIPVNGRDGMGRSDTLKLKMDVETV 577
Query: 226 -SFSAHADAKGIMQLIQYCEPK--NVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGES 281
FS HAD +G+ ++ P+ VL VHG+ + L + ++N+ F P N E+
Sbjct: 578 DGFSGHADRQGLENFVKTMNPRPEKVLCVHGDERSVQDLSSALYHDYNMRTFAPKNLET 636
>gi|448306255|ref|ZP_21496164.1| KH-domain/beta-lactamase-domain-containing protein [Natronorubrum
bangense JCM 10635]
gi|445598669|gb|ELY52724.1| KH-domain/beta-lactamase-domain-containing protein [Natronorubrum
bangense JCM 10635]
Length = 645
Score = 106 bits (265), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 77/305 (25%), Positives = 144/305 (47%), Gaps = 28/305 (9%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFLK 61
F + G ++ ++GD + R A D R + L+ ESTY R+ + ++D +
Sbjct: 340 FHIGDGLYNVAFSGDIHYDDTRLFNGAVNDFPRVETLVLESTYGG--RNDYQTDQQDSEE 397
Query: 62 KVHECVDR----GGKVLIPVFALGRAQELCILLETYWERMNLQA-PIYFAVGLTEKATNY 116
K+ E +++ GGKV+IP FA+GR+QE+ +++E ++ + P++ G+ +AT
Sbjct: 398 KLKEVINKTHEKGGKVVIPAFAVGRSQEIMLVIEEAMRSGDIPSMPVHLD-GMIWEATAI 456
Query: 117 YKMFITWTNQKIRKTFVQ--RNMF---DFKHIRPFDKSFID--NPGPMVVFATPGMLHSG 169
+ + + +R N F +F HI ++ D + GP +V +T GM+ G
Sbjct: 457 HTTYPEYLRDDLRDRIFHDDENPFLAEEFNHIDAGEEERQDVADGGPCIVLSTSGMVTGG 516
Query: 170 LSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFE----------NKQIIDVK 219
+ P ++ L+ G+ QGT+G ++ SG ++ + + +
Sbjct: 517 PIMSWLGHIGPDPDSTLVFVGYQAQGTLGRRIQSGWDEIPTSEVGAMGNNGGGRGTLSLN 576
Query: 220 MAIEYM-SFSAHADAKGIMQLIQYCEPK--NVLLVHGEASKMVFLKEKIKQEFNLDCFMP 276
M +E + FS HAD G+ ++ P+ VL VHG+ L + ++N+ F P
Sbjct: 577 MDVETVDGFSGHADRAGLENFVKTMNPRPEKVLCVHGDERSTQDLSSALYHDYNMRTFAP 636
Query: 277 ANGES 281
N E+
Sbjct: 637 KNLET 641
>gi|448582055|ref|ZP_21645559.1| mRNA 3-end processing factor-like protein [Haloferax gibbonsii ATCC
33959]
gi|445731703|gb|ELZ83286.1| mRNA 3-end processing factor-like protein [Haloferax gibbonsii ATCC
33959]
Length = 640
Score = 106 bits (265), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 74/299 (24%), Positives = 143/299 (47%), Gaps = 20/299 (6%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYA--TTIRDSKRCRERDF 59
F + G ++ ++GD + R A D R + L+ ESTY + ++ E+
Sbjct: 339 FHIGDGLYNVAFSGDIHYEDTRLFNGAVNDFPRVETLVLESTYGGRNDYQTDQQDSEKRL 398
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNL-QAPIYFAVGLTEKATNYYK 118
++ ++E DRGGKV+IP FA+GR+QE+ ++LE + + P++ G+ +AT +
Sbjct: 399 IEVINETYDRGGKVVIPAFAVGRSQEIMLVLEEAMRSGKIPEMPVHLD-GMIWEATAIHT 457
Query: 119 MFITWTNQKIRKTFVQ--RNMF---DFKHIRPFDKSFID--NPGPMVVFATPGMLHSGLS 171
+ + +R N F +F HI ++ D + ++ +T GM+ G
Sbjct: 458 TYPEYLRDDLRDRIFHEDENPFLAEEFNHIDGGEEERQDVADGDQAIILSTSGMVTGGPI 517
Query: 172 LIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENK------QIIDVKMAIEYM 225
+ + P + L+ G+ QGT+G ++ +G ++ + + ++M +E +
Sbjct: 518 MSWLRHVGPDPKSRLVFVGYQAQGTLGRRIQNGWDEIPVNGRDGMGRSDTLKLEMDVETV 577
Query: 226 -SFSAHADAKGIMQLIQYCEPK--NVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGES 281
FS HAD +G+ ++ P+ VL VHG+ + L + ++N+ F P N E+
Sbjct: 578 DGFSGHADRQGLENFVKTMNPRPEKVLCVHGDERSVQDLSSALYHDYNMRTFAPKNLET 636
>gi|448561310|ref|ZP_21634662.1| mRNA 3-end processing factor-like protein [Haloferax prahovense DSM
18310]
gi|445721542|gb|ELZ73210.1| mRNA 3-end processing factor-like protein [Haloferax prahovense DSM
18310]
Length = 640
Score = 106 bits (265), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 74/299 (24%), Positives = 143/299 (47%), Gaps = 20/299 (6%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYA--TTIRDSKRCRERDF 59
F + G ++ ++GD + R A D R + L+ ESTY + ++ E+
Sbjct: 339 FHIGDGLYNVAFSGDIHYEDTRLFNGAVNDFPRVETLVLESTYGGRNDYQTDQQDSEKRL 398
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNL-QAPIYFAVGLTEKATNYYK 118
++ ++E DRGGKV+IP FA+GR+QE+ ++LE + + P++ G+ +AT +
Sbjct: 399 IEVINETYDRGGKVVIPAFAVGRSQEIMLVLEEAMRSGKIPEMPVHLD-GMIWEATAIHT 457
Query: 119 MFITWTNQKIRKTFVQ--RNMF---DFKHIRPFDKSFID--NPGPMVVFATPGMLHSGLS 171
+ + +R N F +F HI ++ D + ++ +T GM+ G
Sbjct: 458 TYPEYLRDDLRDRIFHEDENPFLAEEFNHIDGGEEERQDVADGDQAIILSTSGMVTGGPI 517
Query: 172 LIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENK------QIIDVKMAIEYM 225
+ + P + L+ G+ QGT+G ++ +G ++ + + ++M +E +
Sbjct: 518 MSWLRHVGPDPKSRLVFVGYQAQGTLGRRIQNGWDEIPVNGRDGMGRSDTLKLEMDVETV 577
Query: 226 -SFSAHADAKGIMQLIQYCEPK--NVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGES 281
FS HAD +G+ ++ P+ VL VHG+ + L + ++N+ F P N E+
Sbjct: 578 DGFSGHADRQGLENFVKTMNPRPEKVLCVHGDERSVQDLSSALYHDYNMRTFAPKNLET 636
>gi|356559788|ref|XP_003548179.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
2-like isoform 1 [Glycine max]
Length = 738
Score = 106 bits (265), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 60/194 (30%), Positives = 104/194 (53%), Gaps = 6/194 (3%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDK-CRPDLLITESTYATTIRDSKRCRERDF 59
++++ + ++Y D+N +RHL + RP +LIT++ A + +R +++F
Sbjct: 159 IWKITKDGEDVIYAVDFNHRKERHLNGTVLGSFVRPAVLITDAYNALNNQPYRRQNDKEF 218
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKM 119
+ + + GG VL+PV +GR EL ++LE+YW NL PIYF + +Y K
Sbjct: 219 GDILKKTLREGGNVLLPVDTVGRVLELILMLESYWTDENLNYPIYFLTYVASSTIDYVKS 278
Query: 120 FITWTNQKIRKTF--VQRNMFDFKHIRPF-DKSFIDNP--GPMVVFATPGMLHSGLSLII 174
F+ W + I K+F + N+F K++ +K+ +DN GP VV A+ L +G S I
Sbjct: 279 FLEWMSDTIAKSFEKTRENIFLLKYVTLLINKTELDNAPDGPKVVLASMASLEAGFSHEI 338
Query: 175 FKKWAPVENNMLIM 188
F +WA N+++
Sbjct: 339 FVEWANDVKNLVLF 352
>gi|448596874|ref|ZP_21654012.1| mRNA 3-end processing factor-like protein [Haloferax alexandrinus
JCM 10717]
gi|445740755|gb|ELZ92260.1| mRNA 3-end processing factor-like protein [Haloferax alexandrinus
JCM 10717]
Length = 640
Score = 106 bits (265), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 75/299 (25%), Positives = 142/299 (47%), Gaps = 20/299 (6%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYA--TTIRDSKRCRERDF 59
F + G ++ ++GD + R A D R + L+ ESTY + ++ E
Sbjct: 339 FHIGDGLYNVAFSGDIHYEDTRLFNGAVNDFPRVETLVLESTYGGRNDYQTDQQDSEERL 398
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNL-QAPIYFAVGLTEKATNYYK 118
++ ++E DRGGKV+IP FA+GR+QE+ ++LE + + P++ G+ +AT +
Sbjct: 399 IEVINETYDRGGKVVIPAFAVGRSQEIMLVLEEAMRSGKIPKMPVHLD-GMIWEATAIHT 457
Query: 119 MFITWTNQKIRKTFVQ--RNMF---DFKHIRPFDKSFID--NPGPMVVFATPGMLHSGLS 171
+ + +R N F +F HI ++ D + ++ +T GM+ G
Sbjct: 458 TYPEYLRDDLRDRIFHEDENPFLAEEFNHIDGGEEERQDVADGEQAIILSTSGMVTGGPI 517
Query: 172 LIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENK------QIIDVKMAIEYM 225
+ + P + L+ G+ QGT+G ++ +G ++ + + +KM +E +
Sbjct: 518 MSWLRHVGPDPKSRLVFVGYQAQGTLGRRIQNGWDEIPVNGRDGMGRSDTLKLKMDVETV 577
Query: 226 -SFSAHADAKGIMQLIQYCEPK--NVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGES 281
FS HAD +G+ ++ P+ VL VHG+ + L + ++N+ F P N E+
Sbjct: 578 DGFSGHADRQGLENFVKTMNPRPEKVLCVHGDERSVQDLSSALYHDYNMRTFAPKNLET 636
>gi|448678621|ref|ZP_21689628.1| mRNA 3'-end processing factor-like protein [Haloarcula
argentinensis DSM 12282]
gi|445772608|gb|EMA23653.1| mRNA 3'-end processing factor-like protein [Haloarcula
argentinensis DSM 12282]
Length = 638
Score = 106 bits (265), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 75/296 (25%), Positives = 137/296 (46%), Gaps = 17/296 (5%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYA--TTIRDSKRCRERDF 59
F V G ++ ++GD + R A D R + L+ ESTY + + ER
Sbjct: 340 FHVGDGLYNVAFSGDIHYDDTRLFDGAVNDFPRVETLVLESTYGGRNDYQTDQEDSERKL 399
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNL-QAPIYFAVGLTEKATNYYK 118
+ + DR GKVLIP FA+GR+QEL ++LE ++ + P++ G+ +AT +
Sbjct: 400 KQIIQNAYDRDGKVLIPTFAVGRSQELMLVLEEAMRNGDIPEMPVHLD-GMIWEATAVHT 458
Query: 119 MFITWTNQKIRKTFV--QRNMFDFKHIRPFD-----KSFIDNPGPMVVFATPGMLHSGLS 171
+ + ++ N F P D + I + ++ +T GML G +
Sbjct: 459 TYPEYLRDGLQDRIFDEDENPFLADQFNPIDGGDEERQEIADGDSSIILSTSGMLTGGPA 518
Query: 172 LIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFEN---KQIIDVKMAIEYM-SF 227
+ ++ P ++ L G+ QGT+G ++ G+ ++ + + ++M +E + F
Sbjct: 519 MSWLEQLGPDPDSTLTFVGYQAQGTLGKRIQRGIDEVPVGGSGRSETVPLEMNVETVDGF 578
Query: 228 SAHADAKGIMQLIQYCEPK--NVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGES 281
S HAD +G+ ++ P+ +L VHG+ S L + + N+ F P N E+
Sbjct: 579 SGHADRQGLENFVKTMNPRPEKILCVHGDESATQDLSSSLYHDQNIRTFAPENLET 634
>gi|448689042|ref|ZP_21694779.1| mRNA 3'-end processing factor-like protein [Haloarcula japonica DSM
6131]
gi|445778912|gb|EMA29854.1| mRNA 3'-end processing factor-like protein [Haloarcula japonica DSM
6131]
Length = 638
Score = 106 bits (265), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 76/296 (25%), Positives = 137/296 (46%), Gaps = 17/296 (5%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYA--TTIRDSKRCRERDF 59
F V G ++ ++GD + R A D R + L+ ESTY + + ER
Sbjct: 340 FHVGDGLYNVAFSGDIHYDDTRLFDGAVNDFPRVETLVLESTYGGRNDYQTDQEDSERKL 399
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNL-QAPIYFAVGLTEKATNYYK 118
+ V DR GKVLIP FA+GR+QEL ++LE ++ + P++ G+ +AT +
Sbjct: 400 KQIVQNAYDRDGKVLIPTFAVGRSQELMLVLEEAMRNGDIPEMPVHLD-GMIWEATAVHT 458
Query: 119 MFITWTNQKIRKTFV--QRNMFDFKHIRPFD-----KSFIDNPGPMVVFATPGMLHSGLS 171
+ + ++ +N F P D + I + ++ +T GML G +
Sbjct: 459 TYPEYLRDGLQDRIFDEDKNPFLADQFNPIDGGDEERQEIADGESAIILSTSGMLTGGPA 518
Query: 172 LIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFEN---KQIIDVKMAIEYM-SF 227
+ + P ++ L G+ QGT+G ++ G+ ++ + + ++M +E + F
Sbjct: 519 MSWLEHLGPDPDSTLTFVGYQAQGTLGKRIQRGIDEVPVGGSGRSETVPLEMDVETVDGF 578
Query: 228 SAHADAKGIMQLIQYCEPK--NVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGES 281
S HAD +G+ ++ P+ +L VHG+ S L + + N+ F P N E+
Sbjct: 579 SGHADRQGLENFVKTMNPRPEKILCVHGDESATQDLSSSLYHDQNIRTFAPENLET 634
>gi|448636727|ref|ZP_21675175.1| mRNA 3'-end processing factor-like protein [Haloarcula sinaiiensis
ATCC 33800]
gi|445765033|gb|EMA16172.1| mRNA 3'-end processing factor-like protein [Haloarcula sinaiiensis
ATCC 33800]
Length = 640
Score = 106 bits (264), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 79/302 (26%), Positives = 145/302 (48%), Gaps = 27/302 (8%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYA------TTIRDSKRCR 55
F + G ++ ++GD + R L A D R + L+ ESTY T DS+R
Sbjct: 340 FHIGEGRYNVAFSGDIHYKDTRLLDGAVNDFPRVETLVLESTYGGKNDYQTDQSDSERVL 399
Query: 56 ERDFLKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQA-PIYFAVGLTEKAT 114
RD ++E + GKVLIP FA+GR+QEL ++LE + ++ P+Y G+ +AT
Sbjct: 400 -RDV---INESYENDGKVLIPAFAVGRSQELMLVLEEAMRKGDIPTMPVYLD-GMIREAT 454
Query: 115 NYYKMFITWTNQKIRKTFV--QRNMF---DFKHIRPFD--KSFIDNPGPMVVFATPGMLH 167
+ + + +R+ + N F F+ + D + I + P ++ T GM+
Sbjct: 455 AIHTAYPEYLRDDLRQRILYEDENPFLAEQFEQVDGGDEMRQDIADDEPAIILTTSGMVT 514
Query: 168 SGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFEN-----KQIIDVKMAI 222
G + + +N + G+ +GT+G ++ G ++ + + + +++ +
Sbjct: 515 GGPVMSWLRLLGSDPDNTMAFVGYQAEGTLGRQIQRGQDEITLGDTSGPRAERVSLRLNV 574
Query: 223 EYM-SFSAHADAKGIMQLIQ--YCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANG 279
E + FS HAD +G+ ++ + P+ +L VHG+AS L + Q+FN+ P N
Sbjct: 575 ETVDGFSGHADRQGLESFVETMHPRPEKILCVHGDASTTNQLSSALYQKFNMRTHNPKNL 634
Query: 280 ES 281
E+
Sbjct: 635 ET 636
>gi|435851175|ref|YP_007312761.1| KH-domain/beta-lactamase-domain protein [Methanomethylovorans
hollandica DSM 15978]
gi|433661805|gb|AGB49231.1| KH-domain/beta-lactamase-domain protein [Methanomethylovorans
hollandica DSM 15978]
Length = 636
Score = 106 bits (264), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 80/296 (27%), Positives = 143/296 (48%), Gaps = 17/296 (5%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYA--TTIRDSKRCRERDF 59
F + G ++V+TGD+ R A R + ++ ESTY ++ S + E
Sbjct: 338 FHIGDGLHNVVFTGDFKYEKTRLFDPAVNKFPRVESVVMESTYGGVNALQPSLQDAEGHM 397
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNL-QAPIYFAVGLTEKATNYYK 118
+ + + G VLIP FA+GR+QE+ I+LE + + P+Y G+ +AT +
Sbjct: 398 QQVIKNTLQNKGIVLIPAFAVGRSQEVMIVLEDAIRKGLIDNVPVYLD-GMIWEATAIHA 456
Query: 119 MFITWTNQKIRKTFVQR--NMFDFKHIRPFD-----KSFIDNPGPMVVFATPGMLHSGLS 171
+ + N +RK Q+ N F + RP D + I P V+ AT GM+++G
Sbjct: 457 TYPEYLNNDLRKLIFQKGENPFLSECFRPVDSNELRQEIISKREPCVILATSGMMNAGPV 516
Query: 172 LIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEF---ENKQIIDVKMAIEYM-SF 227
+ FK +A E N L+ G+ GT+G ++ G K++ + ++I + M +E + F
Sbjct: 517 MEYFKAFAEDERNTLVFVGYQADGTLGRRIQKGWKEIPLKTGKGTEMIHMNMNVEIIDGF 576
Query: 228 SAHADAKGIMQLIQYCEPK--NVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGES 281
S H+D + +M+ I+ +P+ V HG+ + L I ++ ++ N E+
Sbjct: 577 SGHSDRRQLMEYIRKMKPRPERVFTEHGDERSCIDLATSIHKKNKMETRALTNLET 632
>gi|448681732|ref|ZP_21691823.1| mRNA 3'-end processing factor-like protein [Haloarcula
argentinensis DSM 12282]
gi|445767602|gb|EMA18705.1| mRNA 3'-end processing factor-like protein [Haloarcula
argentinensis DSM 12282]
Length = 640
Score = 106 bits (264), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 80/303 (26%), Positives = 145/303 (47%), Gaps = 29/303 (9%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYA------TTIRDSKRCR 55
F + G ++ ++GD + R L A D R + L+ ESTY T DS+R
Sbjct: 340 FHIGEGRYNVAFSGDIHYKDTRLLDGAVNDFPRVETLVLESTYGGKNDYQTDQSDSERV- 398
Query: 56 ERDFLKKV-HECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQA-PIYFAVGLTEKA 113
LK V +E + GKVLIP FA+GR+QEL ++LE + ++ P+Y G+ +A
Sbjct: 399 ----LKDVINETYENDGKVLIPAFAVGRSQELMLVLEEAMRKGDIPTMPVYLD-GMIREA 453
Query: 114 TNYYKMFITWTNQKIRKTFV--QRNMF---DFKHIRPFD--KSFIDNPGPMVVFATPGML 166
T + + + +R+ + N F F+ + D + I + P ++ T GM+
Sbjct: 454 TAIHTAYPEYLRDDLRQRILYEDENPFLAEQFEQVDGGDEMRQDIADDEPAIILTTSGMV 513
Query: 167 HSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFEN-----KQIIDVKMA 221
G + + +N + G+ +GT+G ++ G ++ + + + +++
Sbjct: 514 TGGPVMSWLRLLGSEPDNTMAFVGYQAEGTLGRQIQRGQDEITMGDTSGPRAERVSLRLN 573
Query: 222 IEYM-SFSAHADAKGIMQLIQ--YCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPAN 278
+E + FS HAD +G+ ++ + P+ +L VHG+AS L + Q+FN+ P N
Sbjct: 574 VETVDGFSGHADRQGLESFVETMHPRPEKILCVHGDASTTNQLSSALYQKFNMRTHNPKN 633
Query: 279 GES 281
E+
Sbjct: 634 LET 636
>gi|383754023|ref|YP_005432926.1| hypothetical protein SELR_11950 [Selenomonas ruminantium subsp.
lactilytica TAM6421]
gi|381366075|dbj|BAL82903.1| hypothetical protein SELR_11950 [Selenomonas ruminantium subsp.
lactilytica TAM6421]
Length = 537
Score = 106 bits (264), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 74/264 (28%), Positives = 136/264 (51%), Gaps = 18/264 (6%)
Query: 36 DLLITESTYATTIRDSKRCRERDFLKKVHECVDRGGKVLIPVFALGRAQELCILLETYWE 95
D L+ ESTY + +E L+ V++ +DRGG V+IP FA+GR Q L W+
Sbjct: 206 DFLVVESTYGDR-KHKTYDKETKLLEIVNDTMDRGGNVIIPSFAVGRTQTLLYYFYNLWK 264
Query: 96 --RMNLQAPIYFAVGLTEKATN-YYKMFITWTNQKIRKTFVQRNMFDFKHIR----PFD- 147
R++ PI L AT + K + ++ + I+ + +F +R P +
Sbjct: 265 QGRLDGDIPIILDSPLAINATRIFMKNYQEFSEEAIKMFGNSEKLTEFPQLRICKSPEES 324
Query: 148 KSFIDNPGPMVVFATPGMLHSGLSLIIFKK--WAPVENNMLIMPGFCVQGTIGHKVLSGV 205
++ + G ++ + GM +G L K W P E+ +L + G+ +G++G +++ G+
Sbjct: 325 RALNSSEGSAIIISASGMADAGRILHHLKHNLWRP-ESTILFV-GYQAEGSLGRRLVEGI 382
Query: 206 KKLEFENKQIIDVKMAIEYMS-FSAHADAKGIMQLI---QYCEPKNVLLVHGEASKMVFL 261
K++ +++ VK I+ M+ FSAHAD +M I Q +P + +VHGEA F+
Sbjct: 383 KRVRVMGEEVA-VKAQIQVMNGFSAHADTDQLMDWITNFQNPKPAKIFIVHGEAQGQEFM 441
Query: 262 KEKIKQEFNLDCFMPANGESCFVQ 285
K +I++E+ + ++P G++ +
Sbjct: 442 KSRIQKEYQGEAYIPFRGDAVVIH 465
>gi|448685512|ref|ZP_21693504.1| mRNA 3'-end processing factor-like protein [Haloarcula japonica DSM
6131]
gi|445782123|gb|EMA32974.1| mRNA 3'-end processing factor-like protein [Haloarcula japonica DSM
6131]
Length = 640
Score = 105 bits (263), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 81/303 (26%), Positives = 144/303 (47%), Gaps = 29/303 (9%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYA------TTIRDSKRCR 55
F + G ++ ++GD + R L A D R + L+ ESTY T DS+R
Sbjct: 340 FHIGEGRYNVAFSGDIHYKDTRLLDGAVNDFPRVETLVLESTYGGKNDYQTDQSDSERV- 398
Query: 56 ERDFLKKV-HECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQA-PIYFAVGLTEKA 113
LK V +E + GKVLIP FA+GR+QEL ++LE + + P+Y G+ +A
Sbjct: 399 ----LKDVINETYENDGKVLIPAFAVGRSQELMLVLEEAMRKGEIPTMPVYLD-GMIREA 453
Query: 114 TNYYKMFITWTNQKIRKTFV--QRNMF---DFKHIRPFD--KSFIDNPGPMVVFATPGML 166
T + + + +R+ + N F F+ + D + I + P +V T GM+
Sbjct: 454 TAIHTAYPEYLRDDLRQRILYEDENPFLAEQFEQVDGGDEMRQDIADDEPAIVLTTSGMV 513
Query: 167 HSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFEN-----KQIIDVKMA 221
G + + +N + G+ +GT+G ++ G ++ + + + +++
Sbjct: 514 TGGPVMSWLRLLGSEPDNTMAFVGYQAEGTLGRQIQRGQDEITLGDTSGPRAERVSLRLN 573
Query: 222 IEYM-SFSAHADAKGIMQLIQ--YCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPAN 278
+E + FS HAD +G+ ++ + P+ +L VHG+AS L + Q+FN+ P N
Sbjct: 574 VETVDGFSGHADRQGLESFVETMHPRPEKILCVHGDASTTNQLSSALYQKFNMRTHNPKN 633
Query: 279 GES 281
E+
Sbjct: 634 LET 636
>gi|345004156|ref|YP_004807009.1| KH-domain/beta-lactamase-domain-containing protein [halophilic
archaeon DL31]
gi|344319782|gb|AEN04636.1| KH-domain/beta-lactamase-domain protein [halophilic archaeon DL31]
Length = 643
Score = 105 bits (263), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 77/301 (25%), Positives = 141/301 (46%), Gaps = 22/301 (7%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYA--TTIRDSKRCRERDF 59
F V G ++ ++GD + R A D R + L+ ESTY + + E+
Sbjct: 340 FHVGDGLYNVCFSGDIHYEDTRLFNGAVNDFPRVETLVMESTYGGRNDYQTDQEDSEKAL 399
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNL-QAPIYFAVGLTEKATNYYK 118
+ ++E D GGKVLIP FA+GR+QE+ +++E + + P++ G+ +AT +
Sbjct: 400 AEAINEVYDEGGKVLIPAFAVGRSQEIMLVIEEAMRSGKIPEMPVHLD-GMIWEATAIHT 458
Query: 119 MFITWTNQKIRKTFVQ--RNMF---DFKHIRPF--DKSFIDNPGPMVVFATPGMLHSGLS 171
+ + +R N F F HI ++ + + P ++ +T GM+ G
Sbjct: 459 TYPEYLRDDLRDRIFHDDENPFLADQFNHIDGGEDERQEVADGEPAIILSTSGMVTGGPI 518
Query: 172 LIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLE---FENKQ-----IIDVKMAIE 223
+ + ++ L G+ QGT+G ++ +G ++ F N + + ++M ++
Sbjct: 519 MSWLRHLGSDADSKLTFVGYQAQGTLGRRIQNGWDEIPVSGFGNDRSGRASTLTLEMGVQ 578
Query: 224 YM-SFSAHADAKGIMQLIQYCEPK--NVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGE 280
+ FS HAD G+M ++ P+ VL VHG+ S + + EFN+ F P N E
Sbjct: 579 TVDGFSGHADRGGLMDFVRNMSPRPEKVLCVHGDESSVQDFSSALYHEFNMRTFAPKNLE 638
Query: 281 S 281
+
Sbjct: 639 T 639
>gi|389860743|ref|YP_006362983.1| beta-lactamase [Thermogladius cellulolyticus 1633]
gi|388525647|gb|AFK50845.1| beta-lactamase domain protein [Thermogladius cellulolyticus 1633]
Length = 424
Score = 105 bits (263), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 79/281 (28%), Positives = 134/281 (47%), Gaps = 15/281 (5%)
Query: 8 NQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFLKKVHECV 67
I+YTGD N L A + + D LITE+TY + + E+ F+ V E +
Sbjct: 154 GHRILYTGDINTVQTWTLSGAELWPVKVDTLITEATYGDRVHPPRYRVEKRFVSVVEEVI 213
Query: 68 DRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMFITWTNQK 127
GG VL+P F++GR+QE+ LL +++ + G+ + Y + +
Sbjct: 214 SSGGVVLVPSFSVGRSQEVLTLLANELPYVDI-----YVDGMAREICGIYLNYQKYLRDP 268
Query: 128 IRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAPVENNMLI 187
T + K++R K+ + +PG V+ AT GML G SL KK A N ++
Sbjct: 269 SLYTRALERVTFVKNVRDRKKA-LKSPG--VIIATAGMLKGGPSLQYLKKLADDPKNAVL 325
Query: 188 MPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLIQ-YCEP- 245
+ + + GHK+L +K E + K +E+ FS+HA G++ + + Y
Sbjct: 326 LVSYQSPNSPGHKLL---EKGGLEEIGVYFTKARVEWFDFSSHAGRDGLVDIAKTYSHTL 382
Query: 246 KNVLLVHGEASKMVFLKEKIKQEF--NLDCFMPANGESCFV 284
+NV++VHG+ L I++ +L+ ++P NG+S V
Sbjct: 383 RNVVIVHGDEESSKGLANAIRESLGKDLNIYVPNNGDSITV 423
>gi|336122441|ref|YP_004577216.1| RNA-metabolising metallo-beta-lactamase [Methanothermococcus
okinawensis IH1]
gi|334856962|gb|AEH07438.1| RNA-metabolising metallo-beta-lactamase [Methanothermococcus
okinawensis IH1]
Length = 427
Score = 105 bits (263), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 74/261 (28%), Positives = 132/261 (50%), Gaps = 14/261 (5%)
Query: 8 NQSIVYTGDYNMTPDRHLGAAWIDKCRPDL--LITESTYATTIRDSKRCRERDFLKKVHE 65
+ IVYTGD ++ R A + C D+ LI ESTY + +++ E++FL KV E
Sbjct: 156 GKKIVYTGDIKLSKTRLTKGADLSYCDNDIDILIIESTYGNRLHPNRKQLEKEFLNKVKE 215
Query: 66 CVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYF---AVGLTEKATNYYKMFIT 122
V+RGG +IPVFA+ R+QE+ ++L ++L PIYF V +T+ + ++
Sbjct: 216 TVERGGTAIIPVFAVDRSQEVILMLND----LDLDVPIYFDGLGVKITKIMLGQHPKYLR 271
Query: 123 WTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAPVE 182
++K+++ F + + K +N G ++ +T GML+ G + +
Sbjct: 272 -NHKKLKRAFSNTYTVENEERHYVIKDLKENGG--IIVSTAGMLNGGPVIKYISNFWNDP 328
Query: 183 NNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLIQY 242
+ LI G+ V+ T G ++L KL + +++ K I + FSAHA + +++
Sbjct: 329 KSSLIFTGYQVKNTAG-RILLETGKLPI-GEMMVEPKFEISFYEFSAHAGMDELRTVVKK 386
Query: 243 CEPKNVLLVHGEASKMVFLKE 263
P+ +++ HGE + KE
Sbjct: 387 ANPEILIIQHGEEEAINVFKE 407
>gi|448574890|ref|ZP_21641413.1| mRNA 3-end processing factor-like protein [Haloferax larsenii JCM
13917]
gi|445732569|gb|ELZ84151.1| mRNA 3-end processing factor-like protein [Haloferax larsenii JCM
13917]
Length = 640
Score = 105 bits (263), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 74/299 (24%), Positives = 143/299 (47%), Gaps = 20/299 (6%)
Query: 2 FQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYA--TTIRDSKRCRERDF 59
F + G ++ ++GD + R A D R + L+ ESTY + ++ E+
Sbjct: 339 FHIGDGLYNVAFSGDIHYEDTRLFNGAVNDFPRVETLVLESTYGGRNDYQTDQQDSEQKL 398
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQA-PIYFAVGLTEKATNYYK 118
+ ++E +RGGKV+IP FA+GR+QE+ ++LE + + P++ G+ +AT +
Sbjct: 399 IDVINETYERGGKVVIPAFAVGRSQEIMLVLEEAMRSGKIPSMPVHLD-GMIWEATAIHT 457
Query: 119 MFITWTNQKIRKTFVQ--RNMF---DFKHIRPFDKSFID--NPGPMVVFATPGMLHSGLS 171
+ + +R N F +F HI ++ D + ++ +T GM+ G
Sbjct: 458 TYPEYLRDDLRDRIFHEDENPFLAEEFNHIDGGEEERQDVADGDQAIILSTSGMVTGGPI 517
Query: 172 LIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFEN------KQIIDVKMAIEYM 225
+ + P ++ L+ G+ QGT+G ++ +G ++ + + +KM +E +
Sbjct: 518 MSWLRHVGPDPDSRLVFVGYQAQGTLGRRIQNGWDEIPVNDGDGRGRSDTLKLKMDVETV 577
Query: 226 -SFSAHADAKGIMQLIQYCEPK--NVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGES 281
FS HAD +G+ ++ P+ VL VHG+ + L + ++N+ F P N E+
Sbjct: 578 DGFSGHADRQGLENFVKTMNPRPEKVLCVHGDERSVQDLSSALYHDYNMRTFAPKNLET 636
>gi|449446027|ref|XP_004140773.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
2-like [Cucumis sativus]
Length = 738
Score = 105 bits (262), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 63/204 (30%), Positives = 105/204 (51%), Gaps = 9/204 (4%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDK-CRPDLLITESTYATTIRDSKRCRERDF 59
++++ + ++Y D+N +RHL ++ RP +LIT++ A + +R ++++F
Sbjct: 159 LWKITKDGEDVIYAVDFNHRKERHLNGTILESFVRPAVLITDAYNALNNQPYRRQKDKEF 218
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKM 119
+ + + G VL+PV GR EL +LE YWE +L PI+F + +Y K
Sbjct: 219 GDTIQKTLRANGNVLLPVDTAGRVLELIQILEWYWEEESLNYPIFFLTYVASSTIDYIKS 278
Query: 120 FITWTNQKIRKTF--VQRNMFDFKHIRPF-DKSFIDNP--GPMVVFATPGMLHSGLSLII 174
F+ W + I K+F + N F KH+ +KS +DN GP VV A+ L +G S I
Sbjct: 279 FLEWMSDTIAKSFEHTRNNAFLLKHVTLLINKSELDNAPDGPKVVLASMASLEAGYSHDI 338
Query: 175 FKKWAPVENNMLIMPGFCVQGTIG 198
F WA N+++ F +G G
Sbjct: 339 FVDWAMDAKNLVL---FSERGQFG 359
Score = 40.0 bits (92), Expect = 7.0, Method: Compositional matrix adjust.
Identities = 20/77 (25%), Positives = 37/77 (48%)
Query: 212 NKQIIDVKMAIEYMSFSAHADAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNL 271
N+ + VK ++ YM F +D + I ++ + P ++LVHG A LK+ +
Sbjct: 529 NELTVQVKCSLHYMDFEGRSDGRSIKSILSHVAPLKLVLVHGTAEATEHLKQHCLKNVCP 588
Query: 272 DCFMPANGESCFVQTDM 288
+ P E+ V +D+
Sbjct: 589 HVYAPQIEETIDVTSDL 605
>gi|305663939|ref|YP_003860227.1| beta-lactamase [Ignisphaera aggregans DSM 17230]
gi|304378508|gb|ADM28347.1| beta-lactamase domain protein [Ignisphaera aggregans DSM 17230]
Length = 424
Score = 105 bits (262), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 78/291 (26%), Positives = 137/291 (47%), Gaps = 22/291 (7%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDKCRPDLLITESTYATTIRDSKRCRERDFL 60
MF++ +G +SI+YTGD N R + D+LI ESTY + E F+
Sbjct: 150 MFKITMGEKSILYTGDINTIDTRLIKGLKPQNIDADILIIESTYGLYDHPERSRVEELFI 209
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMF 120
V ++ GG VL+P F+LGRAQE+ LL ER+ A +Y+ G+ ++ N
Sbjct: 210 DTVKSVIEDGGNVLVPAFSLGRAQEILTLL---IERLP-NANVYYD-GMAKEILNILLEH 264
Query: 121 ITWTNQ-----KIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIF 175
+ N+ K+ + F+ + + I G ++ A GML G +L
Sbjct: 265 REYINRYDLLLKVYERFIAVDKTTL-------RKKICKEGGNIIVAPAGMLKGGPALYYI 317
Query: 176 KKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKG 235
++ N +I+ + T G K+L+ E ++ ++ + + FS+HA + G
Sbjct: 318 RRLGDNPRNAIILVSYQAPSTPGRKLLTNGVIEEGGSR----IRAKLYWFDFSSHAGSTG 373
Query: 236 IMQLIQYCEP-KNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFVQ 285
+ +++ + + VLLVHG + L +IK E ++ PANG++ ++
Sbjct: 374 LFNVVKSIKNVEKVLLVHGNDDSIYTLGYRIKDELGIEFEAPANGQTILIK 424
>gi|390933400|ref|YP_006390905.1| RNA-metabolising metallo-beta-lactamase [Thermoanaerobacterium
saccharolyticum JW/SL-YS485]
gi|389568901|gb|AFK85306.1| RNA-metabolising metallo-beta-lactamase [Thermoanaerobacterium
saccharolyticum JW/SL-YS485]
Length = 537
Score = 105 bits (262), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 71/258 (27%), Positives = 126/258 (48%), Gaps = 18/258 (6%)
Query: 36 DLLITESTYATTIRDSKRCRERDFLKKVHECVDRGGKVLIPVFALGRAQELCILL----E 91
D L+ ESTY + + + + ++ + + + RGG V+IP FA+GR QEL + E
Sbjct: 206 DYLVCESTYGDRLHEDVGDKAKKLMEIIKKTISRGGNVIIPSFAVGRTQELLYEIHKDEE 265
Query: 92 TYWERMNL--QAPIYFAVGLTEKATNYYKMFITWTNQKIRKTFVQRNMF-------DFKH 142
Y + + P+Y L T+ ++ + + + + R ++V+ + F H
Sbjct: 266 LYKNEIEYISKVPVYVDSPLATSVTSVFRKHLDYFDDEAR-SYVENGDYPLDFPNLHFTH 324
Query: 143 IRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVL 202
K+ D P+++ + GM +G K ++ ++ G+ +GT+G ++L
Sbjct: 325 SAEESKALNDIKEPVIIISASGMCEAGRIKHHLKHNLWRSDSTIVFVGYQAKGTLGRRIL 384
Query: 203 SGVKKLEFENKQIIDVKMAIEYM-SFSAHADAKGIMQLIQYC--EPKNVLLVHGEASKMV 259
G K + ++I V I+ + SFS HAD KGIM I +PK + +VHGE S
Sbjct: 385 DGEKTVNIFGEEIT-VNAEIQNIESFSGHADQKGIMDWISSFTKKPKKIFIVHGEDSAQK 443
Query: 260 FLKEKIKQEFNLDCFMPA 277
L +KIK E N++ +P+
Sbjct: 444 VLSKKIKDELNIETVIPS 461
>gi|356530856|ref|XP_003533995.1| PREDICTED: cleavage and polyadenylation specificity factor subunit
2-like isoform 1 [Glycine max]
Length = 736
Score = 105 bits (262), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 60/194 (30%), Positives = 103/194 (53%), Gaps = 6/194 (3%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDK-CRPDLLITESTYATTIRDSKRCRERDF 59
++++ + ++Y D+N +RHL + RP +LIT++ A + +R +++F
Sbjct: 159 IWKITKDGEDVIYAVDFNHRKERHLNGTVLGSFVRPAVLITDAYNALNNQPYRRQNDKEF 218
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKM 119
+ + + GG VL+PV +GR EL ++LE YW NL PIYF + +Y K
Sbjct: 219 GDILKKTLRAGGNVLLPVDTVGRVLELILMLELYWADENLNYPIYFLTYVASSTIDYVKS 278
Query: 120 FITWTNQKIRKTF--VQRNMFDFKHIRPF-DKSFIDNP--GPMVVFATPGMLHSGLSLII 174
F+ W + I K+F + N+F K++ +K+ +DN GP VV A+ L +G S I
Sbjct: 279 FLEWMSDTIAKSFEKTRENIFLLKYVTLLINKTELDNAPDGPKVVLASMASLEAGFSHDI 338
Query: 175 FKKWAPVENNMLIM 188
F +WA N+++
Sbjct: 339 FVEWANDVKNLVLF 352
>gi|284173343|ref|ZP_06387312.1| mRNA 3'-end processing factor, putative [Sulfolobus solfataricus
98/2]
gi|384433838|ref|YP_005643196.1| beta-lactamase [Sulfolobus solfataricus 98/2]
gi|261601992|gb|ACX91595.1| beta-lactamase domain protein [Sulfolobus solfataricus 98/2]
Length = 422
Score = 105 bits (262), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 77/272 (28%), Positives = 136/272 (50%), Gaps = 14/272 (5%)
Query: 11 IVYTGDYNMTPDRHLGAAWIDKCR-PDLLITESTYATTIRDSKRCRERDFLKKVHECVDR 69
I +TGD N+T + + A I+ ++L+ ESTY +++ E DF KV E V+
Sbjct: 159 IAFTGDINLTETKLMKPAEIENIGDANVLVMESTYGKFNHPNRKDVENDFYDKVMEVVES 218
Query: 70 GGKVLIPVFALGRAQELCILLETYWERMNLQAPIYFAVGLTEKATNYYKMFITWTNQKIR 129
GG VL+P F+L R+QE+ +L ER N P+Y+ G++ + T F + N R
Sbjct: 219 GGTVLVPAFSLARSQEVLSVLA---ER-NFPYPVYYD-GMSREITEIMLGFKEFLN---R 270
Query: 130 KTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFKKWAPVENNMLIMP 189
+++ +F +++ ++ V+ A+ GML G ++ FKK + N + +
Sbjct: 271 PDLLKKAYDNFNYVKGWEDRHRAWKEKGVIVASAGMLKGGPAVYYFKKLSENSKNAVFLV 330
Query: 190 GFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYMSFSAHADAKGIMQLIQYCEP-KNV 248
+ T G K+L K E+ +K +E FS+HA + ++++++ + + V
Sbjct: 331 SYQAINTPGRKLLEMGKFDEYSGL----LKARLEIFDFSSHAGRRQLLEIVKSVKDLEKV 386
Query: 249 LLVHGEASKMVFLKEKIKQEFNLDCFMPANGE 280
+LVHG L + IKQE ++ P NG+
Sbjct: 387 VLVHGSPDNESSLADLIKQEIGVEVITPENGQ 418
>gi|171185999|ref|YP_001794918.1| beta-lactamase domain-containing protein [Pyrobaculum neutrophilum
V24Sta]
gi|170935211|gb|ACB40472.1| beta-lactamase domain protein [Pyrobaculum neutrophilum V24Sta]
Length = 425
Score = 105 bits (261), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 89/303 (29%), Positives = 142/303 (46%), Gaps = 48/303 (15%)
Query: 4 VKVGNQSIVYTGDYNMTPDRHL--GAAWIDKCR-PDLLITESTYATTIRDSKRCRERDFL 60
+++ + +V+TGD+N T D +L GA + + PD++I E+TYA+ + E++F+
Sbjct: 150 LEIEGRVVVFTGDFN-TVDTNLLRGADLYNLPKAPDVVIMEATYASADHPPREKLEKEFV 208
Query: 61 KKVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYF---------AVG--- 108
+ V E +D+GG VLIP FALGRAQE+ + L + + APIY VG
Sbjct: 209 QAVKEVLDQGGTVLIPSFALGRAQEIMLTL----VKHGVNAPIYIDGLARQINQIVGRYP 264
Query: 109 -LTEKATNYYKMF---ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPG 164
L T Y K I N +RK ++ + S I P G
Sbjct: 265 YLLRDPTLYRKALEAAIEVPNTYVRKGAIE------------EASVIITPA--------G 304
Query: 165 MLHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEY 224
ML G +L FK+ A N + +P F T G ++LS K + ++ V+ +E+
Sbjct: 305 MLKGGAALYYFKRLAQNRKNGIFLPSFQAPNTPGFEILS--KGHAVVDGTLVKVEARLEW 362
Query: 225 MSFSAHADAKGIMQLI-QYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPAN-GESC 282
FSAHA + I+ I ++ ++LVH E L ++ E + ++P GE
Sbjct: 363 FDFSAHAGRREIVHFIKRFSADTRIILVHTEPKAAAPLVRRLAGEGFDNIYVPTEPGEEA 422
Query: 283 FVQ 285
+Q
Sbjct: 423 VLQ 425
>gi|145591929|ref|YP_001153931.1| beta-lactamase domain-containing protein [Pyrobaculum arsenaticum
DSM 13514]
gi|145283697|gb|ABP51279.1| beta-lactamase domain protein [Pyrobaculum arsenaticum DSM 13514]
Length = 425
Score = 105 bits (261), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 84/302 (27%), Positives = 138/302 (45%), Gaps = 46/302 (15%)
Query: 4 VKVGNQSIVYTGDYNMTPDRHLGAAWIDKC--RPDLLITESTYATTIRDSKRCRERDFLK 61
++V + +V+TGD+N L A + PD++I E+TYA+ + E++F++
Sbjct: 150 IEVEGRVVVFTGDFNTVDTNLLRGADVYNLPKNPDVVIMEATYASANHPPRDKLEKEFVQ 209
Query: 62 KVHECVDRGGKVLIPVFALGRAQELCILLETYWERMNLQAPIYF---------AVG---- 108
V E ++ GG VLIP FALGRAQE+ + L + ++ PIY VG
Sbjct: 210 SVKEVLEGGGSVLIPSFALGRAQEILLTL----VKHGVEYPIYVDGLARQINQIVGRYPH 265
Query: 109 LTEKATNYYKMF---ITWTNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGM 165
L + + Y K I N +RK + P V+ A GM
Sbjct: 266 LLKDPSLYKKALDVSIEVPNTYVRKGAAEE--------------------PSVIIAPAGM 305
Query: 166 LHSGLSLIIFKKWAPVENNMLIMPGFCVQGTIGHKVLSGVKKLEFENKQIIDVKMAIEYM 225
+ G +L FKK A N + +P F T G ++LS K + +I V+ +E+
Sbjct: 306 IKGGAALFYFKKMAGNRKNGVFLPSFQAPNTPGFQILS--KGYAIVDGAVIKVEARLEWF 363
Query: 226 SFSAHADAKGIMQLIQYCEPKN-VLLVHGEASKMVFLKEKIKQEFNLDCFMPAN-GESCF 283
FSAHA + I+ P+ ++LVH + + L +K+ ++ + ++P GE
Sbjct: 364 DFSAHAGRGELEAFIKRFSPETKIILVHTDPATAAPLVKKLAEDGYDNIYLPTTPGEELA 423
Query: 284 VQ 285
+Q
Sbjct: 424 LQ 425
>gi|392956492|ref|ZP_10322019.1| RNA-metabolising metallo-beta-lactamase [Bacillus macauensis
ZFHKF-1]
gi|391877474|gb|EIT86067.1| RNA-metabolising metallo-beta-lactamase [Bacillus macauensis
ZFHKF-1]
Length = 896
Score = 105 bits (261), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 77/290 (26%), Positives = 138/290 (47%), Gaps = 12/290 (4%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDK-CRPDLLITESTYATTIRDSKRCRERDF 59
MF ++ + + TGD + R + A + K PD++I ESTY + E+
Sbjct: 155 MFLIEGDGEKLFVTGDLSFKAGRTIPGAVVPKHIEPDVVIMESTYGNRAHSDRNSEEKRL 214
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQELCILLETYWER-MNLQAPIYFAVGLTEKATNYYK 118
+ V + GG LIP FALGRAQE+ ++L+ Y ER + Q PIY GL + Y+
Sbjct: 215 AEHVAATIAGGGFALIPAFALGRAQEVLLILQDYMERGLIPQFPIYVD-GLVTPISRIYR 273
Query: 119 MFITWTNQKIRKTFVQR-NMFDFKHIRPF---DKSFIDNPGPMVVFATPGMLHSGLSLII 174
+ + + ++ + F + R ++ + N P + A+ GML G S
Sbjct: 274 DYPHYLKGPVAHRIMKNGDAFLTERCRAVSRDEREDVLNGKPACIVASSGMLTGGASAWY 333
Query: 175 FKKWAPVENNMLIMPGFCVQGTIGHKVLS---GVKKLEFENKQIIDVKMAIEYMSFSAHA 231
++ E N + + G+ + + G K+L+ G+++ N + V+ ++ S HA
Sbjct: 334 AERLLSNEKNAIFITGYQDEESPGRKLLNLADGIEQHIELNGREYPVQCSVTKYGLSGHA 393
Query: 232 DAKGIMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGES 281
DA + I+ P + LLVHG+ + + L E++ + ++ + NGE+
Sbjct: 394 DAHEMNVFIEALAPTHTLLVHGDDTARMQLAERLHERYS--PILVENGET 441
>gi|432327997|ref|YP_007246141.1| putative exonuclease of the beta-lactamase fold involved in RNA
processing [Aciduliprofundum sp. MAR08-339]
gi|432134706|gb|AGB03975.1| putative exonuclease of the beta-lactamase fold involved in RNA
processing [Aciduliprofundum sp. MAR08-339]
Length = 413
Score = 104 bits (260), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 82/289 (28%), Positives = 149/289 (51%), Gaps = 29/289 (10%)
Query: 1 MFQVKVGNQSIVYTGDYNMTPDRHLGAAWIDK-CRPDLLITESTYATTIRDSKRCRERDF 59
M++++ +++I++TGD T + HL W K + D+LI ESTYA + ER+F
Sbjct: 149 MYEIR-DDKTILFTGDIQ-TVNTHL--VWGTKPVKTDVLIMESTYAGREHPPRDEVERNF 204
Query: 60 LKKVHECVDRGGKVLIPVFALGRAQE-LCILLETYWERMNLQAPIYFAVGLTEKATNYYK 118
L+K+ E V+RGGK ++PVFA+GR QE + IL +T ++ + G+ TN +
Sbjct: 205 LEKIEEIVERGGKAVVPVFAVGRTQEIMLILAKTDYD--------VWIDGMGRFITNLFL 256
Query: 119 MFITW--TNQKIRKTFVQRNMFDFKHIRPFDKSFIDNPGPMVVFATPGMLHSGLSLIIFK 176
+ + + +K+++ RN + R + ++ V+ AT GML G L
Sbjct: 257 QYPQYLRSAKKLKRA---RNKMNIVRRRSDRRKALNGD---VIVATGGMLEGGPVLHYIN 310
Query: 177 KWAPVENNMLIMPGFCVQGTIGHKVLS-GVKKLEFENKQIIDVKMAIEYMSFSAHADAKG 235
+ +++ G+ V+GT G ++ G+ +L + K+ ID M + + FSAHA
Sbjct: 311 HLKNDRRSGILLTGYQVEGTNGRLLMDEGILEL-YGIKEKID--MELNFFDFSAHAGHSE 367
Query: 236 IMQLIQYCEPKNVLLVHGEASKMVFLKEKIKQEFNLDCFMPANGESCFV 284
+++ + C P+ ++L+HG+ + + K + D F+P N E +
Sbjct: 368 LVEFAEKCSPEKIILMHGDNREAL---AKDLKNMGFDVFLPKNDEEFLI 413
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.320 0.134 0.387
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 16,113,597,086
Number of Sequences: 23463169
Number of extensions: 649598053
Number of successful extensions: 1674469
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 1996
Number of HSP's successfully gapped in prelim test: 1278
Number of HSP's that attempted gapping in prelim test: 1665423
Number of HSP's gapped (non-prelim): 5266
length of query: 1101
length of database: 8,064,228,071
effective HSP length: 154
effective length of query: 947
effective length of database: 8,745,867,341
effective search space: 8282336371927
effective search space used: 8282336371927
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 83 (36.6 bits)