BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 043917
(423 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|Q8GUU3|CPS3B_ARATH Cleavage and polyadenylation specificity factor subunit 3-II
OS=Arabidopsis thaliana GN=CPSF73-II PE=1 SV=2
Length = 613
Score = 483 bits (1242), Expect = e-135, Method: Compositional matrix adjust.
Identities = 243/421 (57%), Positives = 301/421 (71%), Gaps = 39/421 (9%)
Query: 1 VHKCVAGGGKVLIPAFALGRAQELCILLDDYWERMNLRVPIYFSAGLTIQANMYYKMLIS 60
VHKCVAGGGK LIP+FALGRAQELC+LLDDYWERMN++VPIYFS+GLTIQANMYYKMLIS
Sbjct: 224 VHKCVAGGGKALIPSFALGRAQELCMLLDDYWERMNIKVPIYFSSGLTIQANMYYKMLIS 283
Query: 61 WTSQKVKE---TYNAFDFKNVHNFDRSLIDAPGPCVLFATPGMLTGGFSLEVFKHWAPSE 117
WTSQ VKE T+N FDFKNV +FDRSLI APGPCVLFATPGML GFSLEVFKHWAPS
Sbjct: 284 WTSQNVKEKHNTHNPFDFKNVKDFDRSLIHAPGPCVLFATPGMLCAGFSLEVFKHWAPSP 343
Query: 118 MNLITLPGYCLAGTIGNKLMSGNPT-IEL-EGTKIDVRCQIHQLAFSPHTDGKGIMDLVK 175
+NL+ LPGY +AGT+G+KLM+G PT ++L GTK+DVRC++HQ+AFSPHTD KGIMDL K
Sbjct: 344 LNLVALPGYSVAGTVGHKLMAGKPTTVDLYNGTKVDVRCKVHQVAFSPHTDAKGIMDLTK 403
Query: 176 FLSPQHVILVHGEKPKMATLKERIQSELGIKCYDPANNESMCIPSTHYVKAGASDAFIRS 235
FLSP++V+LVHGEKP M LKE+I SEL I C+ PAN E++ ST Y+KA ASD F++S
Sbjct: 404 FLSPKNVVLVHGEKPSMMILKEKITSELDIPCFVPANGETVSFASTTYIKANASDMFLKS 463
Query: 236 CMNPNFQYLKSGSEEKSVSGSKCTEGTLPLWIKDERVAEGILVLEKSEKAKVVHQDELLL 295
C NPNF++ S L + D R A+G+LV+EKS+KAK+VHQDE+
Sbjct: 464 CSNPNFKFSNSTQ----------------LRVTDHRTADGVLVIEKSKKAKIVHQDEISE 507
Query: 296 MLGEKRHEVQFAYCCPVNVDELEKFTTTSLTPTARMLRDPNKSSLIRLLVAKLSRKLSEG 355
+L EK H V A+CCPV V + + + LI+ L AK+ + +S
Sbjct: 508 VLHEKNHVVSLAHCCPVKV---------------KGESEDDDVDLIKQLSAKILKTVSGA 552
Query: 356 NIQDFGEHLQVESFHLSVCLKDTCPYRITNGLEDKPRTAFFCCTWSAADDKLARKIISAM 415
I + LQV SF S+CLKD C +R ++ + F CC WS AD +L +II+A+
Sbjct: 553 QIHESENCLQVASFKGSLCLKDKCMHRSSSSSSE---AVFLCCNWSIADLELGWEIINAI 609
Query: 416 E 416
+
Sbjct: 610 K 610
>sp|Q5TA45|INT11_HUMAN Integrator complex subunit 11 OS=Homo sapiens GN=CPSF3L PE=1 SV=2
Length = 600
Score = 286 bits (731), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 161/421 (38%), Positives = 241/421 (57%), Gaps = 56/421 (13%)
Query: 1 VHKCVAGGGKVLIPAFALGRAQELCILLDDYWERMNLRVPIYFSAGLTIQANMYYKMLIS 60
VH+ V GGKVLIP FALGRAQELCILL+ +WERMNL+VPIYFS GLT +AN YYK+ I
Sbjct: 225 VHETVERGGKVLIPVFALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLFIP 284
Query: 61 WTSQKVKETY---NAFDFKNVHNFDRSLIDAPGPCVLFATPGMLTGGFSLEVFKHWAPSE 117
WT+QK+++T+ N F+FK++ FDR+ D PGP V+FATPGML G SL++F+ WA +E
Sbjct: 285 WTNQKIRKTFVQRNMFEFKHIKAFDRAFADNPGPMVVFATPGMLHAGQSLQIFRKWAGNE 344
Query: 118 MNLITLPGYCLAGTIGNKLMSGNPTIELEGTKI-DVRCQIHQLAFSPHTDGKGIMDLVKF 176
N++ +PGYC+ GT+G+K++SG +E+EG ++ +V+ Q+ ++FS H D KGIM LV
Sbjct: 345 KNMVIMPGYCVQGTVGHKILSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLVGQ 404
Query: 177 LSPQHVILVHGEKPKMATLKERIQSELGIKCYDPANNESMCIPSTHYVKAGASDAFIRSC 236
P+ V+LVHGE KM LK++I+ EL + CY PAN E++ +P++ + G S ++
Sbjct: 405 AEPESVLLVHGEAKKMEFLKQKIEQELRVNCYMPANGETVTLPTSPSIPVGISLGLLKRE 464
Query: 237 MNPNFQYLKSGSEEKSVSGSKCTEGTLPLWIKDERVAEGILVLEKSEKAKVVHQDELLLM 296
M +G LP K R+ G L++ K ++V ++ L
Sbjct: 465 M---------------------AQGLLP-EAKKPRLLHGTLIM-KDSNFRLVSSEQALKE 501
Query: 297 LGEKRHEVQFAYCCPVNVDELEKFTTTSLTPTARMLRDPNKSSLIRLLVAKLSRKLSEGN 356
LG H+++F C V++ + K T+L S L +L + L +G+
Sbjct: 502 LGLAEHQLRFT--CRVHLHDTRKEQETALRVY---------SHLKSVLKDHCVQHLPDGS 550
Query: 357 IQDFGEHLQVESFHLSVCLKDTCPYRITNGLEDKPRTAFFCCTWSAADDKLARKIISAME 416
+ VE SV L+ P + P T +W+ D++L + S ++
Sbjct: 551 V-------TVE----SVLLQAAAP-------SEDPGTKVLLVSWTYQDEELGSFLTSLLK 592
Query: 417 N 417
Sbjct: 593 K 593
>sp|Q5NVE6|INT11_PONAB Integrator complex subunit 11 OS=Pongo abelii GN=CPSF3L PE=2 SV=2
Length = 600
Score = 285 bits (729), Expect = 5e-76, Method: Compositional matrix adjust.
Identities = 144/329 (43%), Positives = 210/329 (63%), Gaps = 29/329 (8%)
Query: 1 VHKCVAGGGKVLIPAFALGRAQELCILLDDYWERMNLRVPIYFSAGLTIQANMYYKMLIS 60
VH+ V GGKVLIP FALGRAQELCILL+ +WERMNL+VPIYFS GLT +AN YYK+ I
Sbjct: 225 VHETVERGGKVLIPVFALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLFIP 284
Query: 61 WTSQKVKETY---NAFDFKNVHNFDRSLIDAPGPCVLFATPGMLTGGFSLEVFKHWAPSE 117
WT+QK+++T+ N F+FK++ FDR+ D PGP V+FATPGML G SL++F+ WA +E
Sbjct: 285 WTNQKIRKTFVQRNMFEFKHIKAFDRAFADNPGPMVVFATPGMLHAGQSLQIFRKWAGNE 344
Query: 118 MNLITLPGYCLAGTIGNKLMSGNPTIELEGTKI-DVRCQIHQLAFSPHTDGKGIMDLVKF 176
N++ +PGYC+ GT+G+K++SG +E+EG ++ +V+ Q+ ++FS H D KGIM LV
Sbjct: 345 KNMVIMPGYCVQGTVGHKILSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLVGQ 404
Query: 177 LSPQHVILVHGEKPKMATLKERIQSELGIKCYDPANNESMCIPSTHYVKAGASDAFIRSC 236
P+ V+LVHGE KM LK++I+ EL + CY PAN E++ +P++ + G S ++
Sbjct: 405 AEPESVLLVHGEAKKMEFLKQKIEQELRVSCYMPANGETVTLPTSPSIPVGISLGLLKRE 464
Query: 237 MNPNFQYLKSGSEEKSVSGSKCTEGTLPLWIKDERVAEGILVLEKSEKAKVVHQDELLLM 296
M +G LP K R+ G L++ K ++V ++ L
Sbjct: 465 M---------------------AQGLLP-EAKKPRLLHGTLIM-KDSNFRLVSSEQALKE 501
Query: 297 LGEKRHEVQFAYCCPVNVDELEKFTTTSL 325
LG H+++F C V++ + K T+L
Sbjct: 502 LGLAEHQLRFT--CRVHLHDTRKEQETAL 528
>sp|Q9CWS4|INT11_MOUSE Integrator complex subunit 11 OS=Mus musculus GN=Cpsf3l PE=2 SV=1
Length = 600
Score = 284 bits (726), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 142/329 (43%), Positives = 210/329 (63%), Gaps = 29/329 (8%)
Query: 1 VHKCVAGGGKVLIPAFALGRAQELCILLDDYWERMNLRVPIYFSAGLTIQANMYYKMLIS 60
VH+ V GGKVLIP FALGRAQELCILL+ +WERMNL+VPIYFS GLT +AN YYK+ I+
Sbjct: 225 VHETVERGGKVLIPVFALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLFIT 284
Query: 61 WTSQKVKETY---NAFDFKNVHNFDRSLIDAPGPCVLFATPGMLTGGFSLEVFKHWAPSE 117
WT+QK+++T+ N F+FK++ FDR+ D PGP V+FATPGML G SL++F+ WA +E
Sbjct: 285 WTNQKIRKTFVQRNMFEFKHIKAFDRTFADNPGPMVVFATPGMLHAGQSLQIFRKWAGNE 344
Query: 118 MNLITLPGYCLAGTIGNKLMSGNPTIELEGTK-IDVRCQIHQLAFSPHTDGKGIMDLVKF 176
N++ +PGYC+ GT+G+K++SG +E+EG + ++V+ Q+ ++FS H D KGIM LV
Sbjct: 345 KNMVIMPGYCVQGTVGHKILSGQRKLEMEGRQMLEVKMQVEYMSFSAHADAKGIMQLVGQ 404
Query: 177 LSPQHVILVHGEKPKMATLKERIQSELGIKCYDPANNESMCIPSTHYVKAGASDAFIRSC 236
P+ V+LVHGE KM L+++I+ E + CY PAN E++ +P++ + G S ++
Sbjct: 405 AEPESVLLVHGEAKKMEFLRQKIEQEFRVSCYMPANGETVTLPTSPSIPVGISLGLLKRE 464
Query: 237 MNPNFQYLKSGSEEKSVSGSKCTEGTLPLWIKDERVAEGILVLEKSEKAKVVHQDELLLM 296
M +G LP K R+ G L++ K ++V ++ L
Sbjct: 465 M---------------------VQGLLP-EAKKPRLLHGTLIM-KDSNFRLVSSEQALKE 501
Query: 297 LGEKRHEVQFAYCCPVNVDELEKFTTTSL 325
LG H+++F C V++ + K T+L
Sbjct: 502 LGLAEHQLRFT--CRVHLQDTRKEQETAL 528
>sp|Q3MHC2|INT11_RAT Integrator complex subunit 11 OS=Rattus norvegicus GN=Cpsf3l PE=2
SV=1
Length = 600
Score = 284 bits (726), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 142/329 (43%), Positives = 210/329 (63%), Gaps = 29/329 (8%)
Query: 1 VHKCVAGGGKVLIPAFALGRAQELCILLDDYWERMNLRVPIYFSAGLTIQANMYYKMLIS 60
VH+ V GGKVLIP FALGRAQELCILL+ +WERMNL+VPIYFS GLT +AN YYK+ I+
Sbjct: 225 VHETVERGGKVLIPVFALGRAQELCILLETFWERMNLKVPIYFSTGLTEKANHYYKLFIT 284
Query: 61 WTSQKVKETY---NAFDFKNVHNFDRSLIDAPGPCVLFATPGMLTGGFSLEVFKHWAPSE 117
WT+QK+++T+ N F+FK++ FDR+ D PGP V+FATPGML G SL++F+ WA +E
Sbjct: 285 WTNQKIRKTFVQRNMFEFKHIKAFDRTFADNPGPMVVFATPGMLHAGQSLQIFRKWAGNE 344
Query: 118 MNLITLPGYCLAGTIGNKLMSGNPTIELEGTK-IDVRCQIHQLAFSPHTDGKGIMDLVKF 176
N++ +PGYC+ GT+G+K++SG +E+EG + ++V+ Q+ ++FS H D KGIM LV
Sbjct: 345 KNMVIMPGYCVQGTVGHKILSGQRKLEMEGRQMLEVKMQVEYMSFSAHADAKGIMQLVGQ 404
Query: 177 LSPQHVILVHGEKPKMATLKERIQSELGIKCYDPANNESMCIPSTHYVKAGASDAFIRSC 236
P+ V+LVHGE KM L+++I+ E + CY PAN E++ +P++ + G S ++
Sbjct: 405 AEPESVLLVHGEAKKMEFLRQKIEQEFRVSCYMPANGETVTLPTSPSIPVGISLGLLKRE 464
Query: 237 MNPNFQYLKSGSEEKSVSGSKCTEGTLPLWIKDERVAEGILVLEKSEKAKVVHQDELLLM 296
M +G LP K R+ G L++ K ++V ++ L
Sbjct: 465 M---------------------VQGLLP-EAKKPRLLHGTLIM-KDNNFRLVSSEQALKE 501
Query: 297 LGEKRHEVQFAYCCPVNVDELEKFTTTSL 325
LG H+++F C V++ + K T+L
Sbjct: 502 LGLAEHQLRFT--CRVHLQDTRKEQETAL 528
>sp|Q2YDM2|INT11_BOVIN Integrator complex subunit 11 OS=Bos taurus GN=CPSF3L PE=2 SV=2
Length = 599
Score = 280 bits (716), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 139/323 (43%), Positives = 207/323 (64%), Gaps = 29/323 (8%)
Query: 1 VHKCVAGGGKVLIPAFALGRAQELCILLDDYWERMNLRVPIYFSAGLTIQANMYYKMLIS 60
VH+ V GGKVLIP FALGRAQELCILL+ +WERM+L+ PIYFS GLT +AN YYK+ I
Sbjct: 225 VHETVERGGKVLIPVFALGRAQELCILLETFWERMDLKAPIYFSTGLTEKANHYYKLFIP 284
Query: 61 WTSQKVKETY---NAFDFKNVHNFDRSLIDAPGPCVLFATPGMLTGGFSLEVFKHWAPSE 117
WT+QK+++T+ N F+FK++ FDR+ D+PGP V+FATPGML G SL++F+ WA +E
Sbjct: 285 WTNQKIRKTFVQRNMFEFKHIKAFDRAFADSPGPMVVFATPGMLHAGQSLQIFRKWAGNE 344
Query: 118 MNLITLPGYCLAGTIGNKLMSGNPTIELEGTKI-DVRCQIHQLAFSPHTDGKGIMDLVKF 176
N++ +PGYC+ GT+G+K++SG +E+EG ++ +V+ Q+ ++FS H D KGIM LV
Sbjct: 345 KNMVIMPGYCVQGTVGHKILSGQRKLEMEGRQVLEVKMQVEYMSFSAHADAKGIMQLVGQ 404
Query: 177 LSPQHVILVHGEKPKMATLKERIQSELGIKCYDPANNESMCIPSTHYVKAGASDAFIRSC 236
P++V+LVHGE KM LK++I+ E + CY PAN E++ +P++ + G S ++
Sbjct: 405 AEPENVLLVHGEAKKMEFLKQKIEQEFRVNCYMPANGETVTLPTSPSIPVGISLGLLKRE 464
Query: 237 MNPNFQYLKSGSEEKSVSGSKCTEGTLPLWIKDERVAEGILVLEKSEKAKVVHQDELLLM 296
M +G LP K R+ G L++ K ++V ++ L
Sbjct: 465 M---------------------AQGLLP-DAKKPRLLHGTLIM-KDSNFRLVSSEQALKE 501
Query: 297 LGEKRHEVQFAYCCPVNVDELEK 319
LG H+++F C V++ + K
Sbjct: 502 LGLAEHQLRFT--CRVHLHDTRK 522
>sp|Q54YL3|INT11_DICDI Integrator complex subunit 11 homolog OS=Dictyostelium discoideum
GN=ints11 PE=3 SV=1
Length = 744
Score = 278 bits (711), Expect = 6e-74, Method: Compositional matrix adjust.
Identities = 132/260 (50%), Positives = 181/260 (69%), Gaps = 17/260 (6%)
Query: 1 VHKCVAGGGKVLIPAFALGRAQELCILLDDYWERMNL-RVPIYFSAGLTIQANMYYKMLI 59
+H+CV GGKVLIP FALGR QELCIL+D YWE+MNL +PIYFSAGL +AN+YYK+ I
Sbjct: 224 IHECVEKGGKVLIPVFALGRVQELCILIDSYWEQMNLGHIPIYFSAGLAEKANLYYKLFI 283
Query: 60 SWTSQKVKETY---NAFDFKNVHNFDRSLIDAPGPCVLFATPGMLTGGFSLEVFKHWAPS 116
+WT+QK+K+T+ N FDFK++ F L+DAPG VLFATPGML G SLEVFK WAP+
Sbjct: 284 NWTNQKIKQTFVKRNMFDFKHIKPFQSHLVDAPGAMVLFATPGMLHAGASLEVFKKWAPN 343
Query: 117 EMNLITLPGYCLAGTIGNKLMSGNPT-----------IELE-GTKIDVRCQIHQLAFSPH 164
E+N+ +PGYC+ GT+GNKL++ +E++ T I+V+C+IH L+FS H
Sbjct: 344 ELNMTIIPGYCVVGTVGNKLLTTGSDQQQQSKPQSQMVEIDKKTTIEVKCKIHNLSFSAH 403
Query: 165 TDGKGIMDLVKFLSPQHVILVHGEKPKMATLKERIQSELGIKCYDPANNESMCIPSTHYV 224
D KGI+ L+K +P++VILVHGEK KM L ++I E+G+ CY PAN ++ I + +
Sbjct: 404 ADAKGILQLIKMSNPRNVILVHGEKEKMGFLSQKIIKEMGVNCYYPANGVTIIIDTMKSI 463
Query: 225 KAGAS-DAFIRSCMNPNFQY 243
S + R ++ ++QY
Sbjct: 464 PIDISLNLLKRQILDYSYQY 483
>sp|Q503E1|INT11_DANRE Integrator complex subunit 11 OS=Danio rerio GN=cpsf3l PE=2 SV=1
Length = 598
Score = 276 bits (705), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 148/371 (39%), Positives = 220/371 (59%), Gaps = 37/371 (9%)
Query: 1 VHKCVAGGGKVLIPAFALGRAQELCILLDDYWERMNLRVPIYFSAGLTIQANMYYKMLIS 60
VH+ V GGKVLIP FALGRAQELCILL+ +WERMNL+ PIYFS GLT +AN YYK+ I+
Sbjct: 225 VHETVERGGKVLIPVFALGRAQELCILLETFWERMNLKAPIYFSTGLTEKANHYYKLFIT 284
Query: 61 WTSQKVKETY---NAFDFKNVHNFDRSLIDAPGPCVLFATPGMLTGGFSLEVFKHWAPSE 117
WT+QK+++T+ N F+FK++ FDRS D PGP V+FATPGML G SL++FK WA +E
Sbjct: 285 WTNQKIRKTFVQRNMFEFKHIKAFDRSYADNPGPMVVFATPGMLHAGQSLQIFKKWAGNE 344
Query: 118 MNLITLPGYCLAGTIGNKLMSGNPTIELEG-TKIDVRCQIHQLAFSPHTDGKGIMDLVKF 176
N++ +PGYC+ GT+G+K+++G +E+EG +DV+ Q+ ++FS H D KGIM L++
Sbjct: 345 KNMVIMPGYCVQGTVGHKILNGQKKLEMEGRATLDVKLQVEYMSFSAHADAKGIMQLIRM 404
Query: 177 LSPQHVILVHGEKPKMATLKERIQSELGIKCYDPANNESMCIPSTHYVKAGASDAFIRSC 236
P++++LVHGE KM LK++I+ E I C+ PAN E+ I + V S ++
Sbjct: 405 AEPRNMLLVHGEAKKMEFLKDKIEQEFSISCFMPANGETTTIVTNPSVPVDISLNLLKRE 464
Query: 237 MNPNFQYLKSGSEEKSVSGSKCTEGTLPLWIKDERVAEGILVLEKSEKAKVVHQDELLLM 296
M G LP K R G L++ K ++V ++ L
Sbjct: 465 M--------------------ALGGPLP-DAKRPRTMHGTLIM-KDNSLRLVSPEQALKE 502
Query: 297 LGEKRHEVQFAYCCPV-------NVDELEKFTT--TSLTPTARMLRDPNKSSLIRLLVAK 347
LG H+++F C V + D L + T S+ + + P+ + ++ +V K
Sbjct: 503 LGLNEHQLRFT--CRVQLHDPHSDTDTLSRIYTHLKSVLKSYSIQHVPDGTVIVESIVIK 560
Query: 348 LSRKLSEGNIQ 358
++ E N++
Sbjct: 561 VTSSAEEPNLK 571
>sp|Q5ZIH0|INT11_CHICK Integrator complex subunit 11 OS=Gallus gallus GN=CPSF3L PE=2 SV=1
Length = 600
Score = 272 bits (695), Expect = 5e-72, Method: Compositional matrix adjust.
Identities = 140/329 (42%), Positives = 207/329 (62%), Gaps = 29/329 (8%)
Query: 1 VHKCVAGGGKVLIPAFALGRAQELCILLDDYWERMNLRVPIYFSAGLTIQANMYYKMLIS 60
VH+ V GGKVLIP FALGRAQELCILL+ +WERMNL+ PIYFS GLT +AN YYK+ I+
Sbjct: 225 VHETVERGGKVLIPVFALGRAQELCILLETFWERMNLKAPIYFSTGLTEKANHYYKLFIT 284
Query: 61 WTSQKVKETY---NAFDFKNVHNFDRSLIDAPGPCVLFATPGMLTGGFSLEVFKHWAPSE 117
WT+QK+++T+ N F+FK++ FDR+ D PGP V+FATPGML G SL++F+ WA +E
Sbjct: 285 WTNQKIRKTFVQRNMFEFKHIKAFDRAFADNPGPMVVFATPGMLHAGQSLQIFRKWAGNE 344
Query: 118 MNLITLPGYCLAGTIGNKLMSGNPTIELEGTKI-DVRCQIHQLAFSPHTDGKGIMDLVKF 176
N++ +PGYC+ GT+G+K++SG +E+EG +I +V+ Q+ ++FS H D KGIM L++
Sbjct: 345 KNMVIMPGYCVQGTVGHKILSGQRKLEMEGRQILEVKMQVEYMSFSAHADAKGIMQLIRQ 404
Query: 177 LSPQHVILVHGEKPKMATLKERIQSELGIKCYDPANNESMCIPSTHYVKAGASDAFIRSC 236
P++V+LVHGE KM LK++I+ E + CY PAN E+ I + + S ++
Sbjct: 405 AEPRNVLLVHGEAKKMEFLKQKIEQEFHVNCYMPANGETTTIFTNPSIPVDISLGLLK-- 462
Query: 237 MNPNFQYLKSGSEEKSVSGSKCTEGTLPLWIKDERVAEGILVLEKSEKAKVVHQDELLLM 296
E ++ G LP K ++ G L++ K ++V ++ L
Sbjct: 463 ------------RETAI-------GLLPD-AKKPKLMHGTLIM-KDNSFRLVSPEQALKE 501
Query: 297 LGEKRHEVQFAYCCPVNVDELEKFTTTSL 325
LG H+++F C V++ + K T L
Sbjct: 502 LGLAEHQLRFT--CRVHIQDPRKEHETVL 528
>sp|P79101|CPSF3_BOVIN Cleavage and polyadenylation specificity factor subunit 3 OS=Bos
taurus GN=CPSF3 PE=2 SV=1
Length = 684
Score = 150 bits (380), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 81/232 (34%), Positives = 134/232 (57%), Gaps = 15/232 (6%)
Query: 1 VHKCVAGGGKVLIPAFALGRAQELCILLDDYWERMN--LRVPIYFSAGLTIQANMYYKML 58
VH V GG+ LIP FALGRAQEL ++LD+YW+ +PIY+++ L + Y+
Sbjct: 226 VHDIVNRGGRGLIPVFALGRAQELLLILDEYWQNHPELHDIPIYYASSLAKKCMAVYQTY 285
Query: 59 ISWTSQKVKETYN---AFDFKNVHNFDRSL--IDAPGPCVLFATPGMLTGGFSLEVFKHW 113
++ + K+++ N F FK++ N +S+ D GP V+ A+PGM+ G S E+F+ W
Sbjct: 286 VNAMNDKIRKQININNPFVFKHISNL-KSMDHFDDIGPSVVMASPGMMQSGLSRELFESW 344
Query: 114 APSEMNLITLPGYCLAGTIGNKLMSGNPTI-ELEGTKIDVRCQIHQLAFSPHTDGKGIMD 172
+ N + + GYC+ GT+ +MS I + G K+ ++ + ++FS HTD + +
Sbjct: 345 CTDKRNGVIIAGYCVEGTLAKHIMSEPEEITTMSGQKLPLKMSVDYISFSAHTDYQQTSE 404
Query: 173 LVKFLSPQHVILVHGEKPKMATLKERI------QSELGIKCYDPANNESMCI 218
++ L P HVILVHGE+ +MA LK + E+ I+ ++P N E++ +
Sbjct: 405 FIRALKPPHVILVHGEQNEMARLKAALIREYEDNDEVHIEVHNPRNTEAVTL 456
>sp|Q9QXK7|CPSF3_MOUSE Cleavage and polyadenylation specificity factor subunit 3 OS=Mus
musculus GN=Cpsf3 PE=1 SV=2
Length = 684
Score = 150 bits (380), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 81/232 (34%), Positives = 134/232 (57%), Gaps = 15/232 (6%)
Query: 1 VHKCVAGGGKVLIPAFALGRAQELCILLDDYWERMN--LRVPIYFSAGLTIQANMYYKML 58
VH V GG+ LIP FALGRAQEL ++LD+YW+ +PIY+++ L + Y+
Sbjct: 226 VHDIVNRGGRGLIPVFALGRAQELLLILDEYWQNHPELHDIPIYYASSLAKKCMAVYQTY 285
Query: 59 ISWTSQKVKETYN---AFDFKNVHNFDRSL--IDAPGPCVLFATPGMLTGGFSLEVFKHW 113
++ + K+++ N F FK++ N +S+ D GP V+ A+PGM+ G S E+F+ W
Sbjct: 286 VNAMNDKIRKQININNPFVFKHISNL-KSMDHFDDIGPSVVMASPGMIQNGLSRELFESW 344
Query: 114 APSEMNLITLPGYCLAGTIGNKLMSGNPTI-ELEGTKIDVRCQIHQLAFSPHTDGKGIMD 172
+ N + + GYC+ GT+ +MS I + G K+ ++ + ++FS HTD + +
Sbjct: 345 CTDKRNGVIIAGYCVEGTLAKHIMSEPEEITTMSGQKLPLKMSVDYISFSAHTDYQQTSE 404
Query: 173 LVKFLSPQHVILVHGEKPKMATLKERI------QSELGIKCYDPANNESMCI 218
++ L P HVILVHGE+ +MA LK + E+ I+ ++P N E++ +
Sbjct: 405 FIRALKPPHVILVHGEQNEMARLKAALIREYEDNDEVHIEVHNPRNTEAVTL 456
>sp|Q9UKF6|CPSF3_HUMAN Cleavage and polyadenylation specificity factor subunit 3 OS=Homo
sapiens GN=CPSF3 PE=1 SV=1
Length = 684
Score = 150 bits (380), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 81/232 (34%), Positives = 134/232 (57%), Gaps = 15/232 (6%)
Query: 1 VHKCVAGGGKVLIPAFALGRAQELCILLDDYWERMN--LRVPIYFSAGLTIQANMYYKML 58
VH V GG+ LIP FALGRAQEL ++LD+YW+ +PIY+++ L + Y+
Sbjct: 226 VHDIVNRGGRGLIPVFALGRAQELLLILDEYWQNHPELHDIPIYYASSLAKKCMAVYQTY 285
Query: 59 ISWTSQKVKETYN---AFDFKNVHNFDRSL--IDAPGPCVLFATPGMLTGGFSLEVFKHW 113
++ + K+++ N F FK++ N +S+ D GP V+ A+PGM+ G S E+F+ W
Sbjct: 286 VNAMNDKIRKQININNPFVFKHISNL-KSMDHFDDIGPSVVMASPGMMQSGLSRELFESW 344
Query: 114 APSEMNLITLPGYCLAGTIGNKLMSGNPTI-ELEGTKIDVRCQIHQLAFSPHTDGKGIMD 172
+ N + + GYC+ GT+ +MS I + G K+ ++ + ++FS HTD + +
Sbjct: 345 CTDKRNGVIIAGYCVEGTLAKHIMSEPEEITTMSGQKLPLKMSVDYISFSAHTDYQQTSE 404
Query: 173 LVKFLSPQHVILVHGEKPKMATLKERI------QSELGIKCYDPANNESMCI 218
++ L P HVILVHGE+ +MA LK + E+ I+ ++P N E++ +
Sbjct: 405 FIRALKPPHVILVHGEQNEMARLKAALIREYEDNDEVHIEVHNPRNTEAVTL 456
>sp|O13794|YSH1_SCHPO Endoribonuclease ysh1 OS=Schizosaccharomyces pombe (strain 972 /
ATCC 24843) GN=ysh1 PE=3 SV=2
Length = 757
Score = 144 bits (364), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 80/233 (34%), Positives = 130/233 (55%), Gaps = 20/233 (8%)
Query: 1 VHKCVAGGGKVLIPAFALGRAQELCILLDDYW-ERMNLR-VPIYFSAGLTIQANMYYKML 58
+H + GG+VL+P FALGRAQEL ++LD+YW ++LR VPIY+++ L + ++
Sbjct: 233 IHSTIRNGGRVLMPVFALGRAQELLLILDEYWNNHLDLRSVPIYYASSLARKCMAIFQTY 292
Query: 59 ISWTSQKVKETY---NAFDFKNVHNFDRSL--IDAPGPCVLFATPGMLTGGFSLEVFKHW 113
++ + +++ + N F F+ V + R+L D GP V+ A+PGML G S + + W
Sbjct: 293 VNMMNDNIRKIFAERNPFIFRFVKSL-RNLEKFDDIGPSVILASPGMLQNGVSRTLLERW 351
Query: 114 APSEMNLITLPGYCLAGTIGNKLMSGNP--TIELEGTKIDVRCQIHQLAFSPHTDGKGIM 171
AP N + L GY + GT+ K ++ P + L G KI R + +L+F+ H D
Sbjct: 352 APDPRNTLLLTGYSVEGTMA-KQITNEPIEIVSLSGQKIPRRMAVEELSFAAHVDYLQNS 410
Query: 172 DLVKFLSPQHVILVHGEKPKMATLKERIQS-----ELGIKCYDPANNESMCIP 219
+ + ++ H+ILVHGE+ M LK + S ++ +K Y P N C+P
Sbjct: 411 EFIDLVNADHIILVHGEQTNMGRLKSALASKFHNRKVDVKVYTPRN----CVP 459
>sp|Q74ZC0|YSH1_ASHGO Endoribonuclease YSH1 OS=Ashbya gossypii (strain ATCC 10895 / CBS
109.51 / FGSC 9923 / NRRL Y-1056) GN=YSH1 PE=3 SV=2
Length = 771
Score = 142 bits (358), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 76/236 (32%), Positives = 127/236 (53%), Gaps = 26/236 (11%)
Query: 1 VHKCVAGGGKVLIPAFALGRAQELCILLDDYWERM-----NLRVPIYFSAGLTIQANMYY 55
+H V+ GG+VL+P FALGRAQE+ ++LD+YW + N +VPI++++ L + +
Sbjct: 232 IHTTVSKGGRVLLPVFALGRAQEIMLILDEYWSQHAEQLGNGQVPIFYASNLARKCMSVF 291
Query: 56 KMLISWTSQKVKETY-----NAFDFKNV---HNFDRSLIDAPGPCVLFATPGMLTGGFSL 107
+ ++ + K+++ + N F FKN+ N D GP V+ A+PGML G S
Sbjct: 292 QTYVNMMNDKIRKKFRDSQTNPFIFKNISYLKNLDE--FQDFGPSVMLASPGMLQNGLSR 349
Query: 108 EVFKHWAPSEMNLITLPGYCLAGTIGNKLM---SGNPTIELEGTKIDVRCQIHQLAFSPH 164
++ + W P E NL+ + GY + GT+ LM P+I I RCQ+ +++F+ H
Sbjct: 350 DLLEKWCPDEKNLVLITGYSVEGTMAKFLMLEPETIPSINNSDVSIPRRCQVEEISFAAH 409
Query: 165 TDGKGIMDLVKFLSPQHVILVHGEKPKMATLKERIQSELG--------IKCYDPAN 212
D + ++ V+ + ++ILVHGE M LK + S ++ Y+P N
Sbjct: 410 VDFRENLEFVEKIGAPNIILVHGESNPMGRLKSALLSNFSSLKGTEDEVRVYNPRN 465
>sp|Q86A79|CPSF3_DICDI Cleavage and polyadenylation specificity factor subunit 3
OS=Dictyostelium discoideum GN=cpsf3 PE=3 SV=1
Length = 774
Score = 142 bits (357), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 76/226 (33%), Positives = 124/226 (54%), Gaps = 8/226 (3%)
Query: 1 VHKCVAGGGKVLIPAFALGRAQELCILLDDYW--ERMNLRVPIYFSAGLTIQANMYYKML 58
VH+ V GK LIP FALGRAQEL ++LD+YW VPIY+++ L + Y+
Sbjct: 253 VHQVVERNGKCLIPVFALGRAQELLLILDEYWIANPQLHHVPIYYASALAKKCMGVYRTY 312
Query: 59 ISWTSQKVKETY---NAFDFKNVHNFDR-SLIDAPGPCVLFATPGMLTGGFSLEVFKHWA 114
I+ + +V+ + N F+FK++ N D GPCV A+PGML G S ++F+ W
Sbjct: 313 INMMNDRVRAQFDVSNPFEFKHIKNIKGIESFDDRGPCVFMASPGMLQSGLSRQLFERWC 372
Query: 115 PSEMNLITLPGYCLAGTIGNKLMSGNPTI-ELEGTKIDVRCQIHQLAFSPHTDGKGIMDL 173
+ N I +PGY + GT+ +MS I L+ + + + ++FS H+D +
Sbjct: 373 SDKRNGIVIPGYSVEGTLAKHIMSEPAEITRLDNVNVPLNLTVSYVSFSAHSDFLQTSEF 432
Query: 174 VKFLSPQHVILVHGEKPKMATLKERIQSEL-GIKCYDPANNESMCI 218
++ + P HV+LVHG+ +M+ L++ + ++ I P N S+ +
Sbjct: 433 IQEIQPPHVVLVHGDANEMSRLRQSLVAKFKTINVLTPKNAMSVAL 478
>sp|Q59P50|YSH1_CANAL Endoribonuclease YSH1 OS=Candida albicans (strain SC5314 / ATCC
MYA-2876) GN=YSH1 PE=3 SV=1
Length = 870
Score = 139 bits (350), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 92/284 (32%), Positives = 140/284 (49%), Gaps = 27/284 (9%)
Query: 1 VHKCVAGGGKVLIPAFALGRAQELCILLDDYWERMN--LRVPIYFSAGLTIQANMYYKML 58
+H +A GG+VL+P FALG AQEL ++LD+YW + V +++++ L + Y+
Sbjct: 324 IHATIAKGGRVLLPVFALGNAQELLLILDEYWSQNEDLQNVNVFYASNLAKKCMAVYETY 383
Query: 59 ISWTSQKVK------ETYNAFDFKNVHNF-DRSLIDAPGPCVLFATPGMLTGGFSLEVFK 111
+ K++ E N FDFK + + D S GP V+ ATPGML G S ++ +
Sbjct: 384 TGIMNDKIRLSSASSEKSNPFDFKYIKSIKDLSKFQDMGPSVVVATPGMLQAGVSRQLLE 443
Query: 112 HWAPSEMNLITLPGYCLAGTIGNKLMSGNPTIELEGTKIDV----RCQIHQLAFSPHTDG 167
WAP NL+ L GY + GT+ +L+ PT+ T D+ R I +++F+ H D
Sbjct: 444 KWAPDGKNLVILTGYSVEGTMAKELLK-EPTMIQSATNPDMTIPRRIGIEEISFAAHVDF 502
Query: 168 KGIMDLVKFLSPQHVILVHGEKPKMATLKERIQSELG--------IKCYDPANNESMCIP 219
+ + ++ +SP VILVHG+ M LK + S+ +K Y+P N E + I
Sbjct: 503 QQNSEFIEKVSPSKVILVHGDSVPMGRLKSALLSKYASRKGTDQEVKVYNPKNCEELIIG 562
Query: 220 STHYVKAGASDAFIRSCMNPNFQYLKS-GSEEKSVSGSKCTEGT 262
K + S Q LK +E S SK TE T
Sbjct: 563 ----FKGLKIAKVLGSLAEEQLQVLKKIIQDEVSAENSKITELT 602
>sp|Q4PEJ3|YSH1_USTMA Endoribonuclease YSH1 OS=Ustilago maydis (strain 521 / FGSC 9021)
GN=YSH1 PE=3 SV=1
Length = 880
Score = 139 bits (350), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 77/232 (33%), Positives = 123/232 (53%), Gaps = 15/232 (6%)
Query: 1 VHKCVAGGGKVLIPAFALGRAQELCILLDDYWERMN--LRVPIYFSAGLTIQANMYYKML 58
+H + GG+VL+P F LGRAQEL +LLD+YW VPIY+++ L + Y+
Sbjct: 242 IHHIIKRGGRVLLPVFVLGRAQELLLLLDEYWAAHPELHSVPIYYASALAKKCISVYQTY 301
Query: 59 ISWTSQKVKETYNAFD----FKNVHNFDRSL--IDAPGPCVLFATPGMLTGGFSLEVFKH 112
I + ++ +N D FK++ N RSL + GPCV+ A+PG + G S E+ +
Sbjct: 302 IHTMNDHIRTRFNRRDNPFVFKHISNL-RSLEKFEDRGPCVMMASPGFMQSGVSRELLER 360
Query: 113 WAPSEMNLITLPGYCLAGTIG-NKLMSGNPTIELEGTKIDVRCQIHQLAFSPHTDGKGIM 171
WAP + N + + GY + GT+ N L + I + G KI R + ++FS H D
Sbjct: 361 WAPDKRNGLIVSGYSVEGTMARNILNEPDEIIGINGQKIPRRMSVDYISFSAHVDFAQNS 420
Query: 172 DLVKFLSPQHVILVHGEKPKMATLKERIQSEL-----GIKCYDPANNESMCI 218
+ + QH++LVHGE+ M+ L+ +Q+ +K + P N E + +
Sbjct: 421 RFIDEIKAQHIVLVHGEQNNMSKLRAALQARFTARGSDVKIHTPRNCEPLVL 472
>sp|Q6CUI5|YSH1_KLULA Endoribonuclease YSH1 OS=Kluyveromyces lactis (strain ATCC 8585 /
CBS 2359 / DSM 70799 / NBRC 1267 / NRRL Y-1140 / WM37)
GN=YSH1 PE=3 SV=1
Length = 764
Score = 139 bits (349), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 71/236 (30%), Positives = 125/236 (52%), Gaps = 26/236 (11%)
Query: 1 VHKCVAGGGKVLIPAFALGRAQELCILLDDYWERM-----NLRVPIYFSAGLTIQANMYY 55
+H V+ GG+VL+P FALGRAQE+ ++LD+YW+ N +VPI++++ L + +
Sbjct: 235 IHTVVSKGGRVLLPVFALGRAQEIMLILDEYWQNHKEELGNGQVPIFYASNLAKKCMSVF 294
Query: 56 KMLISWTSQKVKETY-----NAFDFKNV---HNFDRSLIDAPGPCVLFATPGMLTGGFSL 107
+ ++ + +++ + N F FKN+ N D + GP V+ A+PGML G S
Sbjct: 295 QTYVNMMNDDIRKKFKDSQTNPFIFKNISYLKNLDE--FEDFGPSVMLASPGMLQNGLSR 352
Query: 108 EVFKHWAPSEMNLITLPGYCLAGTIGNKLM---SGNPTIELEGTKIDVRCQIHQLAFSPH 164
++ + W P E NL+ + GY + GT+ L+ P++ I RCQ+ ++ F+ H
Sbjct: 353 DILEKWCPEEKNLVLVTGYSVEGTMAKYLLLEPEAIPSVHNPEITIPRRCQVDEITFAAH 412
Query: 165 TDGKGIMDLVKFLSPQHVILVHGEKPKMATLKERIQSELG--------IKCYDPAN 212
D + ++ ++ + ++ILVHGE M LK + S + ++P N
Sbjct: 413 VDFRENLEFIELIGASNIILVHGESNPMGRLKSALLSNFSSLKDTENEVHVFNPRN 468
>sp|Q9C952|CPSF3_ARATH Cleavage and polyadenylation specificity factor subunit 3-I
OS=Arabidopsis thaliana GN=CPSF73-I PE=1 SV=1
Length = 693
Score = 135 bits (339), Expect = 7e-31, Method: Compositional matrix adjust.
Identities = 78/226 (34%), Positives = 123/226 (54%), Gaps = 10/226 (4%)
Query: 1 VHKCVAGGGKVLIPAFALGRAQELCILLDDYWERM-NLR-VPIYFSAGLTIQANMYYKML 58
+H VA GG+VLIPAFALGRAQEL ++LD+YW +L +PIY+++ L + Y+
Sbjct: 236 IHSTVAQGGRVLIPAFALGRAQELLLILDEYWANHPDLHNIPIYYASPLAKKCMAVYQTY 295
Query: 59 ISWTSQKVKETY---NAFDFKNVHNFDR-SLIDAPGPCVLFATPGMLTGGFSLEVFKHWA 114
I + +++ + N F FK++ + + GP V+ ATPG L G S ++F W
Sbjct: 296 ILSMNDRIRNQFANSNPFVFKHISPLNSIDDFNDVGPSVVMATPGGLQSGLSRQLFDSWC 355
Query: 115 PSEMNLITLPGYCLAGTIGNKLMSGNPTIEL-EGTKIDVRCQIHQLAFSPHTDGKGIMDL 173
+ N +PGY + GT+ +++ + L G + Q+H ++FS H D
Sbjct: 356 SDKKNACIIPGYMVEGTLAKTIINEPKEVTLMNGLTAPLNMQVHYISFSAHADYAQTSTF 415
Query: 174 VKFLSPQHVILVHGEKPKMATLKERIQSEL---GIKCYDPANNESM 216
+K L P ++ILVHGE +M LK+++ +E K P N ES+
Sbjct: 416 LKELMPPNIILVHGEANEMMRLKQKLLTEFPDGNTKIMTPKNCESV 461
>sp|Q06224|YSH1_YEAST Endoribonuclease YSH1 OS=Saccharomyces cerevisiae (strain ATCC
204508 / S288c) GN=YSH1 PE=1 SV=1
Length = 779
Score = 135 bits (339), Expect = 9e-31, Method: Compositional matrix adjust.
Identities = 73/235 (31%), Positives = 128/235 (54%), Gaps = 24/235 (10%)
Query: 1 VHKCVAGGGKVLIPAFALGRAQELCILLDDYW-----ERMNLRVPIYFSAGLTIQANMYY 55
+H V GG+VL+P FALGRAQE+ ++LD+YW E +VPI++++ L + +
Sbjct: 231 IHSTVMRGGRVLLPVFALGRAQEIMLILDEYWSQHADELGGGQVPIFYASNLAKKCMSVF 290
Query: 56 KMLISWTSQKVKETY-----NAFDFKNVHNFDRSLIDAP--GPCVLFATPGMLTGGFSLE 108
+ ++ + +++ + N F FKN+ ++ R+L D GP V+ A+PGML G S +
Sbjct: 291 QTYVNMMNDDIRKKFRDSQTNPFIFKNI-SYLRNLEDFQDFGPSVMLASPGMLQSGLSRD 349
Query: 109 VFKHWAPSEMNLITLPGYCLAGTIGNKLM---SGNPTIELEGTKIDVRCQIHQLAFSPHT 165
+ + W P + NL+ + GY + GT+ +M P+I I RCQ+ +++F+ H
Sbjct: 350 LLERWCPEDKNLVLITGYSIEGTMAKFIMLEPDTIPSINNPEITIPRRCQVEEISFAAHV 409
Query: 166 DGKGIMDLVKFLSPQHVILVHGEKPKMATLKERIQSELG--------IKCYDPAN 212
D + ++ ++ +S ++ILVHGE M LK + S + ++P N
Sbjct: 410 DFQENLEFIEKISAPNIILVHGEANPMGRLKSALLSNFASLKGTDNEVHVFNPRN 464
>sp|Q5BEP0|YSH1_EMENI Endoribonuclease ysh1 OS=Emericella nidulans (strain FGSC A4 / ATCC
38163 / CBS 112.46 / NRRL 194 / M139) GN=ysh1 PE=3 SV=1
Length = 884
Score = 132 bits (332), Expect = 6e-30, Method: Compositional matrix adjust.
Identities = 84/260 (32%), Positives = 129/260 (49%), Gaps = 51/260 (19%)
Query: 8 GGKVLIPAFALGRAQELCILLDDYWERMN--LRVPIYFSAGLTIQANMYYKMLISWTSQK 65
GG+VL+P FALGRAQEL ++L++YWE ++PIY+ + + Y+ I +
Sbjct: 250 GGRVLMPVFALGRAQELLLILEEYWETHPELQKIPIYYIGNTARRCMVVYQTYIGAMNDN 309
Query: 66 VKETYNA-----------------FDFKNVHNFDRSL--IDAPGPCVLFATPGMLTGGFS 106
+K + +DFK V + RSL D G CV+ A+PGML G S
Sbjct: 310 IKRLFRQRMAEAEASGDKSVSAGPWDFKYVRSL-RSLERFDDVGGCVMLASPGMLQTGTS 368
Query: 107 LEVFKHWAPSEMNLITLPGYCLAGTIGNKL----------------------MSGNPTIE 144
E+ + WAP+E N + + GY + GT+ +L M+GN E
Sbjct: 369 RELLERWAPNERNGVVMTGYSVEGTMAKQLLNEPDQIHAVMSRAATGMGRTRMNGND--E 426
Query: 145 LEGTKIDVRCQIHQLAFSPHTDGKGIMDLVKFLSPQHVILVHGEKPKMATLKERI----- 199
+ I RC + +++F+ H DG + ++ +S VILVHGEK +M LK ++
Sbjct: 427 EQKIMIPRRCTVDEISFAAHVDGVENRNFIEEVSAPVVILVHGEKHQMMRLKSKLLSLNA 486
Query: 200 QSELGIKCYDPANNESMCIP 219
+ + +K Y PAN E + IP
Sbjct: 487 EKTVKVKVYTPANCEEVRIP 506
>sp|Q4IPN9|YSH1_GIBZE Endoribonuclease YSH1 OS=Gibberella zeae (strain PH-1 / ATCC
MYA-4620 / FGSC 9075 / NRRL 31084) GN=YSH1 PE=3 SV=2
Length = 833
Score = 131 bits (330), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 80/256 (31%), Positives = 126/256 (49%), Gaps = 46/256 (17%)
Query: 8 GGKVLIPAFALGRAQELCILLDDYWERMN--LRVPIYFSAGLTIQANMYYKMLISWTSQK 65
GG+VL+P FALGRAQEL ++LD+YW + + PIY+++ L + + Y+ + +
Sbjct: 250 GGRVLMPVFALGRAQELLLILDEYWGKHADFQKYPIYYASNLARKCMLIYQTYVGAMNDN 309
Query: 66 VKETY-----------------NAFDFKNVH---NFDRSLIDAPGPCVLFATPGMLTGGF 105
+K + +DFK + N DR D G CV+ A+PGML G
Sbjct: 310 IKRLFRERMAEAEASGDGAGKGGPWDFKYIRSLKNLDR--FDDVGGCVMLASPGMLQNGV 367
Query: 106 SLEVFKHWAPSEMNLITLPGYCLAGTIGNKLMSGNPTIEL---------------EGTKI 150
S E+ + WAPSE N + + GY + GT+ ++M I+ +G K+
Sbjct: 368 SRELLERWAPSEKNGVIITGYSVEGTMAKQIMQEPDQIQAVMSRSMAGARRMPGGDGEKV 427
Query: 151 DV--RCQIHQLAFSPHTDGKGIMDLVKFLSPQHVILVHGEKPKMATLKERIQS-----EL 203
+ RC + + +F+ H DG + ++ + VILVHGE+ M LK ++ S
Sbjct: 428 LIPRRCSVQEYSFAAHVDGVENREFIEEVQAPVVILVHGEQHNMMRLKSKLLSLNANKTA 487
Query: 204 GIKCYDPANNESMCIP 219
+K Y P N E + IP
Sbjct: 488 KVKVYSPRNCEELRIP 503
>sp|Q6FUA5|YSH1_CANGA Endoribonuclease YSH1 OS=Candida glabrata (strain ATCC 2001 / CBS
138 / JCM 3761 / NBRC 0622 / NRRL Y-65) GN=YSH1 PE=3
SV=1
Length = 771
Score = 131 bits (329), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 69/217 (31%), Positives = 120/217 (55%), Gaps = 18/217 (8%)
Query: 1 VHKCVAGGGKVLIPAFALGRAQELCILLDDYW-----ERMNLRVPIYFSAGLTIQANMYY 55
+H V GG+VL+P FALGRAQEL ++LD+YW E + ++PI++++ L + +
Sbjct: 230 IHSTVNKGGRVLMPVFALGRAQELMLILDEYWSQHKEELGSNQIPIFYASNLARKCLSVF 289
Query: 56 KMLISWTSQKVKETY-----NAFDFKNV---HNFDRSLIDAPGPCVLFATPGMLTGGFSL 107
+ ++ + +++ + N F FKN+ N D GP V+ A+PGML G S
Sbjct: 290 QTYVNMMNDNIRKKFRDSQTNPFIFKNIAYIKNLDE--FQDFGPSVMLASPGMLQNGLSR 347
Query: 108 EVFKHWAPSEMNLITLPGYCLAGTIGNKLM---SGNPTIELEGTKIDVRCQIHQLAFSPH 164
++ + W P E NL+ + GY + GT+ L+ P++ I RC++ +L+F+ H
Sbjct: 348 DLLERWCPDEKNLVLITGYSVEGTMAKYLLLEPDTIPSVSNPEVTIPRRCRVEELSFAAH 407
Query: 165 TDGKGIMDLVKFLSPQHVILVHGEKPKMATLKERIQS 201
D + ++ ++ ++ ++ILVHGE M LK + S
Sbjct: 408 VDFQENLEFIEQINASNIILVHGEPNPMGRLKSALLS 444
>sp|Q4WRC2|YSH1_ASPFU Endoribonuclease ysh1 OS=Neosartorya fumigata (strain ATCC MYA-4609
/ Af293 / CBS 101355 / FGSC A1100) GN=ysh1 PE=3 SV=1
Length = 872
Score = 131 bits (329), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 84/258 (32%), Positives = 129/258 (50%), Gaps = 47/258 (18%)
Query: 8 GGKVLIPAFALGRAQELCILLDDYWERMN--LRVPIYFSAGLTIQANMYYKMLISWTSQK 65
GG+VL+P FALGRAQEL ++LD+YWE ++PIY+ + + Y+ I +
Sbjct: 251 GGRVLMPVFALGRAQELLLILDEYWETHPELQKIPIYYIGNTARRCMVVYQTYIGAMNDN 310
Query: 66 VKETYNA-----------------FDFKNVHNFDRSL--IDAPGPCVLFATPGMLTGGFS 106
+K + +DFK V + RSL D G CV+ A+PGML G S
Sbjct: 311 IKRLFRQRMAEAEASGDKSASAGPWDFKFVRSL-RSLERFDDVGGCVMLASPGMLQTGTS 369
Query: 107 LEVFKHWAPSEMNLITLPGYCLAGTIGNKLMSGNPTIE--------------LEGTK--- 149
E+ + WAP+E N + + GY + GT+ +L++ I L GT
Sbjct: 370 RELLERWAPNERNGVVMTGYSVEGTMAKQLLNEPEQIPAVMSRSAGGVSRRGLAGTDEEQ 429
Query: 150 ---IDVRCQIHQLAFSPHTDGKGIMDLVKFLSPQHVILVHGEKPKMATLKERIQS----- 201
I RC + +++F+ H DG + ++ ++ VILVHGEK +M LK ++ S
Sbjct: 430 KIMIPRRCTVDEISFAAHVDGVENRNFIEEVAAPVVILVHGEKHQMMRLKSKLLSLNADK 489
Query: 202 ELGIKCYDPANNESMCIP 219
+ +K Y PAN + + IP
Sbjct: 490 AVKVKVYTPANCDEVRIP 507
>sp|Q6C2Z7|YSH1_YARLI Endoribonuclease YSH1 OS=Yarrowia lipolytica (strain CLIB 122 / E
150) GN=YSH1 PE=3 SV=2
Length = 827
Score = 129 bits (325), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 76/237 (32%), Positives = 127/237 (53%), Gaps = 23/237 (9%)
Query: 1 VHKCVAGGGKVLIPAFALGRAQELCILLDDYWERM-NLR-VPIYFSAGLTIQANMYYKML 58
+H + GGK L+P FALGRAQE+ ++LD+YWE +L+ IY+++ L + Y+
Sbjct: 252 IHSTLDKGGKCLLPVFALGRAQEILLILDEYWEAHPDLQEFSIYYASALAKKCIAVYQTY 311
Query: 59 ISWTSQKVKETY-----NAFDFK---NVHNFDRSLIDAPGPCVLFATPGMLTGGFSLEVF 110
I+ + ++ + N F FK N+ N DR D GPCV+ A+PGML G S +
Sbjct: 312 INMMNDNIRRRFRDQKTNPFRFKYIKNIKNLDR--FDDMGPCVMVASPGMLQSGVSRSLL 369
Query: 111 KHWAPSEMNLITLPGYCLAGTIGNKLMSGN---PTIELEGTKIDVRCQIHQLAFSPHTDG 167
+ WAP N + L GY + GT+ ++++ P+ + K+ R + +L+F+ H D
Sbjct: 370 ERWAPDPKNTLILTGYSVEGTMAKQIINEPNEIPSAQNPDLKVPRRLAVEELSFAAHVDF 429
Query: 168 KGIMDLVKFLSPQHVILVHGEKPKMATLKERIQSEL-GIK-------CYDPANNESM 216
+ + + + +++ILVHGE M LK + ++ G+K Y+P N E +
Sbjct: 430 QQNSEFIDLVDSKNIILVHGELNNMQRLKAALLAKYRGLKNSPREKTIYNPRNCEEV 486
>sp|Q8WZS6|YSH1_NEUCR Endoribonuclease ysh-1 OS=Neurospora crassa (strain ATCC 24698 /
74-OR23-1A / CBS 708.71 / DSM 1257 / FGSC 987) GN=ysh-1
PE=3 SV=1
Length = 850
Score = 127 bits (319), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 80/256 (31%), Positives = 127/256 (49%), Gaps = 46/256 (17%)
Query: 8 GGKVLIPAFALGRAQELCILLDDYWERMN--LRVPIYFSAGLTIQANMYYKMLISWTSQK 65
GG+VL+P FALGRAQEL ++LD+YW + + PIY+++ L + + Y+ + +
Sbjct: 249 GGRVLMPVFALGRAQELLLILDEYWGKHAEYQKYPIYYASNLARKCMLVYQTYVGSMNDN 308
Query: 66 VKETY-----------------NAFDFK---NVHNFDRSLIDAPGPCVLFATPGMLTGGF 105
+K + +DF+ ++ + DR + G CV+ A+PGML G
Sbjct: 309 IKRLFRERLAESESSGDGAGKGGPWDFRFIRSLKSLDR--FEDVGGCVMLASPGMLQNGV 366
Query: 106 SLEVFKHWAPSEMNLITLPGYCLAGTIGNKL----------MSGN-------PTIELEGT 148
S E+ + WAPSE N + + GY + GT+ +L MS N P + E
Sbjct: 367 SRELLERWAPSEKNGVIITGYSVEGTMAKQLLQEPEQIQAVMSRNIAGARRGPGGDAEKV 426
Query: 149 KIDVRCQIHQLAFSPHTDGKGIMDLVKFLSPQHVILVHGEKPKMATLKERIQS-----EL 203
I RC + + +F+ H DG + ++ ++ VILVHGE M LK ++ S E
Sbjct: 427 MIPRRCTVQEFSFAAHVDGVENREFIEEVAAPVVILVHGEVHNMMRLKSKLLSLNATKEH 486
Query: 204 GIKCYDPANNESMCIP 219
+K + P N E + IP
Sbjct: 487 KVKVFSPRNCEELRIP 502
>sp|Q6BMW3|YSH1_DEBHA Endoribonuclease YSH1 OS=Debaryomyces hansenii (strain ATCC 36239 /
CBS 767 / JCM 1990 / NBRC 0083 / IGC 2968) GN=YSH1 PE=3
SV=2
Length = 815
Score = 126 bits (316), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 73/243 (30%), Positives = 123/243 (50%), Gaps = 27/243 (11%)
Query: 1 VHKCVAGGGKVLIPAFALGRAQELCILLDDYWERMN--LRVPIYFSAGLTIQANMYYKML 58
+H + GG++L+P FALGRAQEL ++L++YW + + IY+++ L + Y+
Sbjct: 244 IHSTLLKGGRILMPVFALGRAQELLLILEEYWSLNDDLQNINIYYASSLARKCMAVYQTY 303
Query: 59 ISWTSQKVKET---------YNAFDFK---NVHNFDRSLIDAPGPCVLFATPGMLTGGFS 106
+ + ++ T N F FK ++ N D+ GPCV+ A+PGML G S
Sbjct: 304 TNIMNDSIRLTTSATNSSKKQNPFQFKFIKSIKNLDK--FQDFGPCVVVASPGMLQNGVS 361
Query: 107 LEVFKHWAPSEMNLITLPGYCLAGTIGNKLMSGNPTIEL---EGTKIDVRCQIHQLAFSP 163
E+ + WAP N + + GY + GT+ L++ TI+ I R I +++F+
Sbjct: 362 RELLERWAPDPKNAVIMTGYSVEGTMAKDLLTEPHTIQSAMNSDMTIPRRLSIEEISFAA 421
Query: 164 HTDGKGIMDLVKFLSPQHVILVHGEKPKMATLK--------ERIQSELGIKCYDPANNES 215
H D + ++ ++P +ILVHGE M LK R +E +K ++P N +
Sbjct: 422 HVDFQQNASFIEKVNPSKIILVHGESNPMGRLKSALLSKYASRKGTEQEVKVFNPRNCDE 481
Query: 216 MCI 218
+ I
Sbjct: 482 VTI 484
>sp|Q58633|Y1236_METJA Uncharacterized protein MJ1236 OS=Methanocaldococcus jannaschii
(strain ATCC 43067 / DSM 2661 / JAL-1 / JCM 10045 / NBRC
100440) GN=MJ1236 PE=4 SV=1
Length = 634
Score = 122 bits (307), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 80/229 (34%), Positives = 123/229 (53%), Gaps = 19/229 (8%)
Query: 8 GGKVLIPAFALGRAQELCILLDDYWERMNLRVPIYFSAGLTIQANMYYKMLISWTSQKVK 67
GGKVLIP F +GRAQEL ++L++ + + P+Y G+ +A + + S++++
Sbjct: 406 GGKVLIPVFGVGRAQELMLVLEEGYNQGIFNAPVYLD-GMIWEATAIHTAYPEYLSKEMR 464
Query: 68 -----ETYNAF---DFKNV--HNFDRSLIDAPGPCVLFATPGMLTGGFSLEVFKHWAPSE 117
E N F FK V N R +ID+ PCV+ AT GMLTGG S+E KH AP E
Sbjct: 465 QKIFHEGDNPFLSEVFKRVGSTNERRKVIDSDEPCVILATSGMLTGGPSVEYLKHLAPDE 524
Query: 118 MNLITLPGYCLAGTIGNKLMSG---NPTIELEG--TKIDVRCQIHQL-AFSPHTDGKGIM 171
N I GY GT+G K+ SG P I G I + Q++ + FS H+D K ++
Sbjct: 525 KNAIIFVGYQAEGTLGRKVQSGWKEIPIITRNGKTKSIPINLQVYTIEGFSGHSDRKQLI 584
Query: 172 DLVKFL--SPQHVILVHGEKPKMATLKERIQSELGIKCYDPANNESMCI 218
++ L SP+ +I+VHGE+ K + ++ + Y P N +++ +
Sbjct: 585 KYIRRLKPSPEKIIMVHGEESKCLDFADTVRRLFKKQTYVPMNLDAIRV 633
>sp|P0CM88|YSH1_CRYNJ Endoribonuclease YSH1 OS=Cryptococcus neoformans var. neoformans
serotype D (strain JEC21 / ATCC MYA-565) GN=YSH1 PE=3
SV=1
Length = 773
Score = 115 bits (289), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 68/234 (29%), Positives = 116/234 (49%), Gaps = 18/234 (7%)
Query: 1 VHKCVAGGGKVLIPAFALGRAQELCILLDDYW----ERMNLRVPIYFSAGLTIQANMYYK 56
V V GG+ L+P + G QEL +LLD+YW E N +P+YF++ L + YK
Sbjct: 249 VANIVRRGGRCLMPIPSFGNGQELALLLDEYWNDHPELQN--IPVYFASSLFQRGMRVYK 306
Query: 57 MLISWTSQKVKETY----NAFDFKNVHNFD--RSLIDAPGPCVLFATPGMLTGGFSLEVF 110
+ + ++ + N FDF+ V + L + GPCV+ ++P ++ G S ++
Sbjct: 307 TYVHTMNANIRSRFARRDNPFDFRFVKWLKDPQKLRENKGPCVIMSSPQFMSFGLSRDLL 366
Query: 111 KHWAPSEMNLITLPGYCLAGTIGNKLMSGNPTIE-LEGTKIDVRCQIHQLAFSPHTDGKG 169
+ WAP N + + GY + GT+ L+S IE L+G + R + +++F H D
Sbjct: 367 EEWAPDSKNGVIVTGYSIEGTMARTLLSEPDHIESLKGGNVPRRLTVKEISFGAHVDYAQ 426
Query: 170 IMDLVKFLSPQHVILVHGEKPKMATLKERIQSELG-----IKCYDPANNESMCI 218
++ + QHV+LVHGE +M L+ ++ I + P N E + +
Sbjct: 427 NSKFIQEIGAQHVVLVHGEASQMGRLRAALRDTYAAKGQEINIHTPKNCEPLTL 480
>sp|P0CM89|YSH1_CRYNB Endoribonuclease YSH1 OS=Cryptococcus neoformans var. neoformans
serotype D (strain B-3501A) GN=YSH1 PE=3 SV=1
Length = 773
Score = 115 bits (289), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 68/234 (29%), Positives = 116/234 (49%), Gaps = 18/234 (7%)
Query: 1 VHKCVAGGGKVLIPAFALGRAQELCILLDDYW----ERMNLRVPIYFSAGLTIQANMYYK 56
V V GG+ L+P + G QEL +LLD+YW E N +P+YF++ L + YK
Sbjct: 249 VANIVRRGGRCLMPIPSFGNGQELALLLDEYWNDHPELQN--IPVYFASSLFQRGMRVYK 306
Query: 57 MLISWTSQKVKETY----NAFDFKNVHNFD--RSLIDAPGPCVLFATPGMLTGGFSLEVF 110
+ + ++ + N FDF+ V + L + GPCV+ ++P ++ G S ++
Sbjct: 307 TYVHTMNANIRSRFARRDNPFDFRFVKWLKDPQKLRENKGPCVIMSSPQFMSFGLSRDLL 366
Query: 111 KHWAPSEMNLITLPGYCLAGTIGNKLMSGNPTIE-LEGTKIDVRCQIHQLAFSPHTDGKG 169
+ WAP N + + GY + GT+ L+S IE L+G + R + +++F H D
Sbjct: 367 EEWAPDSKNGVIVTGYSIEGTMARTLLSEPDHIESLKGGNVPRRLTVKEISFGAHVDYAQ 426
Query: 170 IMDLVKFLSPQHVILVHGEKPKMATLKERIQSELG-----IKCYDPANNESMCI 218
++ + QHV+LVHGE +M L+ ++ I + P N E + +
Sbjct: 427 NSKFIQEIGAQHVVLVHGEASQMGRLRAALRDTYAAKGQEINIHTPKNCEPLTL 480
>sp|Q57626|Y162_METJA Uncharacterized protein MJ0162 OS=Methanocaldococcus jannaschii
(strain ATCC 43067 / DSM 2661 / JAL-1 / JCM 10045 / NBRC
100440) GN=MJ0162 PE=3 SV=1
Length = 421
Score = 109 bits (273), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 71/222 (31%), Positives = 114/222 (51%), Gaps = 6/222 (2%)
Query: 1 VHKCVAGGGKVLIPAFALGRAQELCILLDDYWERMNLR-VPIYFSAGLTIQANMYYKMLI 59
+ + + GGKV+IP FA+GRAQE+ +++++Y LR VPIY L I A Y I
Sbjct: 202 ISETIENGGKVIIPVFAIGRAQEILLIINNYIRSGKLRDVPIYTDGSL-IHATAVYMSYI 260
Query: 60 SWTSQKVKETY-NAFD-FKNVHNFDRSLIDAPGPCVLFATPGMLTGGFSLEVFKHWAPSE 117
+W + K+K N + F + D SL+ PC++ +T GM+ GG L+ K +
Sbjct: 261 NWLNPKIKNMVENRINPFGEIKKADESLVFNKEPCIIVSTSGMVQGGPVLKYLKLLKDPK 320
Query: 118 MNLITLPGYCLAGTIGNKLMSGNPTIELEGTKIDVRCQIHQLAFSPHTDGKGIMDLVKFL 177
LI L GY GT+G +L G I+ KI +R ++ ++ FS H D ++ +K +
Sbjct: 321 NKLI-LTGYQAEGTLGRELEEGAKEIQPFKNKIPIRGKVVKIEFSAHGDYNSLVRYIKKI 379
Query: 178 -SPQHVILVHGEKPKMATLKERIQSELGIKCYDPANNESMCI 218
P+ I++HGE+ + + I L I + P + I
Sbjct: 380 PKPEKAIVMHGERYQSLSFAMTIWKTLKIPTFVPVRGTILPI 421
>sp|Q5SLP1|RNSE_THET8 Ribonuclease TTHA0252 OS=Thermus thermophilus (strain HB8 / ATCC
27634 / DSM 579) GN=TTHA0252 PE=1 SV=1
Length = 431
Score = 94.7 bits (234), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 61/197 (30%), Positives = 100/197 (50%), Gaps = 13/197 (6%)
Query: 3 KCVAGGGKVLIPAFALGRAQELCILLDDYWERMNLRVPIYFSAGLTIQANMYYKMLISWT 62
K ++ GGKVLIP FA+ RAQE+ +L + R+ R PIY + + + Y L+ +
Sbjct: 210 KTLSQGGKVLIPTFAVERAQEILYVLYTHGHRLP-RAPIYLDSPMAGRVLSLYPRLVRYF 268
Query: 63 SQKVKETY----NAFDFKNVHNFD-----RSLIDAPGPCVLFATPGMLTGGFSLEVFKHW 113
S++V+ + N F + + ++L APGP V+ A GML GG L KH
Sbjct: 269 SEEVQAHFLQGKNPFRPAGLEVVEHTEASKALNRAPGPMVVLAGSGMLAGGRILHHLKHG 328
Query: 114 APSEMNLITLPGYCLAGTIGNKLMSGNPTIELEGTKIDVRCQIHQL-AFSPHTDGKGIMD 172
N + GY G +G ++++ P + + G ++ +R +H L FS H ++D
Sbjct: 329 LSDPRNALVFVGYQPQGGLGAEIIARPPAVRILGEEVPLRASVHTLGGFSGHAGQDELLD 388
Query: 173 LVKFLSPQHVILVHGEK 189
++ V+LVHGE+
Sbjct: 389 WLQ--GEPRVVLVHGEE 403
>sp|Q9LKF9|CPSF2_ARATH Cleavage and polyadenylation specificity factor subunit 2
OS=Arabidopsis thaliana GN=CPSF100 PE=1 SV=2
Length = 739
Score = 74.7 bits (182), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 44/149 (29%), Positives = 71/149 (47%), Gaps = 8/149 (5%)
Query: 1 VHKCVAGGGKVLIPAFALGRAQELCILLDDYWERMNLRVPIYFSAGLTIQANMYYKMLIS 60
+ K + GG VL+P GR EL ++L+ +W + PIYF ++ Y K +
Sbjct: 223 ISKHLEVGGNVLLPVDTAGRVLELLLILEQHWSQRGFSFPIYFLTYVSSSTIDYVKSFLE 282
Query: 61 WTSQKVKETY-----NAFDFKNVHNF-DRSLID--APGPCVLFATPGMLTGGFSLEVFKH 112
W S + +++ NAF ++V +++ +D PGP V+ A+ L GF+ E+F
Sbjct: 283 WMSDSISKSFETSRDNAFLLRHVTLLINKTDLDNAPPGPKVVLASMASLEAGFAREIFVE 342
Query: 113 WAPSEMNLITLPGYCLAGTIGNKLMSGNP 141
WA NL+ GT+ L S P
Sbjct: 343 WANDPRNLVLFTETGQFGTLARMLQSAPP 371
Score = 32.7 bits (73), Expect = 5.1, Method: Compositional matrix adjust.
Identities = 16/71 (22%), Positives = 34/71 (47%)
Query: 150 IDVRCQIHQLAFSPHTDGKGIMDLVKFLSPQHVILVHGEKPKMATLKERIQSELGIKCYD 209
+ V C + ++ + +DG+ I ++ +SP ++LVH LK+ + + Y
Sbjct: 533 VTVSCSLVKMDYEGRSDGRSIKSMIAHVSPLKLVLVHAIAEATEHLKQHCLNNICPHVYA 592
Query: 210 PANNESMCIPS 220
P E++ + S
Sbjct: 593 PQIEETVDVTS 603
>sp|Q60355|Y047_METJA Uncharacterized protein MJ0047 OS=Methanocaldococcus jannaschii
(strain ATCC 43067 / DSM 2661 / JAL-1 / JCM 10045 / NBRC
100440) GN=MJ0047 PE=3 SV=2
Length = 428
Score = 66.6 bits (161), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 51/193 (26%), Positives = 96/193 (49%), Gaps = 10/193 (5%)
Query: 8 GGKVLIPAFALGRAQELCILLDDYWERMNLRVPIYFSAGLTIQAN---MYYKMLISWTSQ 64
GG LIP FA+ RAQE+ ++L+DY N+ PIY G+ ++ + YK +++ +SQ
Sbjct: 221 GGVALIPVFAVDRAQEILLILNDY----NIDAPIYLD-GMAVEVTKLMLNYKHMLNESSQ 275
Query: 65 KVKETYNAFDFKNVHNFDRSLID-APGPCVLFATPGMLTGGFSLEVFKHWAPSEMNLITL 123
K N + + +++ + + ++ T GML GG L K + + N + L
Sbjct: 276 LEKALKNVKIIEKSEDRIKAIENLSKNGGIVVTTAGMLDGGPILYYLKLFMHNPKNALLL 335
Query: 124 PGYCLAGTIGNKLMSGNPTIELEGTKIDVRCQIHQLAFSPHTDGKGIMDLVKFLSPQHVI 183
GY + + G L+ I + +I ++ FS H + +++K ++P+ +I
Sbjct: 336 TGYQVRDSNGRHLIETG-KIFIGKDEIKPNLEVCMYNFSCHAGMDELHEIIKKVNPELLI 394
Query: 184 LVHGEKPKMATLK 196
+ HGE+ + L+
Sbjct: 395 IQHGEEVQATILR 407
>sp|Q652P4|CPSF2_ORYSJ Cleavage and polyadenylation specificity factor subunit 2 OS=Oryza
sativa subsp. japonica GN=Os09g0569400 PE=2 SV=1
Length = 738
Score = 65.5 bits (158), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 42/147 (28%), Positives = 65/147 (44%), Gaps = 8/147 (5%)
Query: 3 KCVAGGGKVLIPAFALGRAQELCILLDDYWERMNLRVPIYFSAGLTIQANMYYKMLISWT 62
K + GGG VL+P GR E+ ++L+ YW + +L PIYF ++ Y K + W
Sbjct: 224 KVLTGGGSVLLPIDTAGRVLEILLILEQYWAQRHLIYPIYFLTNVSTSTVDYVKSFLEWM 283
Query: 63 SQKVKETY-----NAFDFK---NVHNFDRSLIDAPGPCVLFATPGMLTGGFSLEVFKHWA 114
+ + +++ NAF K + N D P V+ A+ L GFS ++F A
Sbjct: 284 NDSISKSFEHTRDNAFLLKCVTQIINKDELEKLGDAPKVVLASMASLEVGFSHDIFVDMA 343
Query: 115 PSEMNLITLPGYCLAGTIGNKLMSGNP 141
NL+ GT+ L P
Sbjct: 344 NEAKNLVLFTEKGQFGTLARMLQVDPP 370
Score = 36.6 bits (83), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 16/71 (22%), Positives = 33/71 (46%)
Query: 150 IDVRCQIHQLAFSPHTDGKGIMDLVKFLSPQHVILVHGEKPKMATLKERIQSELGIKCYD 209
+ V+C + + F +DG+ + ++ ++P ++LVHG LK + Y
Sbjct: 535 VQVKCSLAYMDFEGRSDGRSVKSVIAHVAPLKLVLVHGSAEATEHLKMHCSKNSDLHVYA 594
Query: 210 PANNESMCIPS 220
P E++ + S
Sbjct: 595 PQIEETIDVTS 605
>sp|O74740|CFT2_SCHPO Cleavage factor two protein 2 OS=Schizosaccharomyces pombe (strain
972 / ATCC 24843) GN=cft2 PE=1 SV=1
Length = 797
Score = 58.2 bits (139), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 43/149 (28%), Positives = 64/149 (42%), Gaps = 12/149 (8%)
Query: 1 VHKCVAGGGKVLIPAFALGRAQELCILLDDYWERMN--LRVPIYFSAGLTIQANMYYKML 58
V + GG VL+P A R ELC +LD++W L PI F + + + Y K +
Sbjct: 215 VMSSLLKGGTVLLPVDAASRVLELCCILDNHWSASQPPLPFPILFLSPTSTKTIDYAKSM 274
Query: 59 ISWTSQKVKETY----NAFDFKNVHNF-DRSLID--APGPCVLFATPGMLTGGFSLEVFK 111
I W + + N +F+N++ D S I PGP V+ AT L GFS +
Sbjct: 275 IEWMGDNIVRDFGINENLLEFRNINTITDFSQISHIGPGPKVILATALTLECGFSQRILL 334
Query: 112 HWAPSEMN---LITLPGYCLAGTIGNKLM 137
N L T C ++ N+ +
Sbjct: 335 DLMSENSNDLILFTQRSRCPQNSLANQFI 363
>sp|O35218|CPSF2_MOUSE Cleavage and polyadenylation specificity factor subunit 2 OS=Mus
musculus GN=Cpsf2 PE=1 SV=1
Length = 782
Score = 57.0 bits (136), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 49/167 (29%), Positives = 76/167 (45%), Gaps = 15/167 (8%)
Query: 1 VHKCVAGGGKVLIPAFALGRAQELCILLDDYWERMNLRVPIYFSAGL-TIQANM--YYKM 57
V + + G G VLI GR EL LLD W + + +Y A L + N+ + K
Sbjct: 223 VLETLRGDGNVLIAVDTAGRVLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKS 282
Query: 58 LISWTSQKVKETY-----NAFDFKNV---HNFDRSLIDAPGPCVLFATPGMLTGGFSLEV 109
+ W S K+ + N F F+++ H L P P V+ A+ L GFS ++
Sbjct: 283 QVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGL-SDLARVPSPKVVLASQPDLECGFSRDL 341
Query: 110 FKHWAPSEMNLITLPGYCLAGTIGNKLMSGNPTIELEGTKIDVRCQI 156
F W N I L GT+ L+ NPT ++ T+I++R ++
Sbjct: 342 FIQWCQDPKNSIILTYRTTPGTLARFLID-NPTEKV--TEIELRKRV 385
Score = 36.2 bits (82), Expect = 0.49, Method: Compositional matrix adjust.
Identities = 27/124 (21%), Positives = 52/124 (41%), Gaps = 19/124 (15%)
Query: 115 PSEMNLITLPGYCLAGTIGNKLMSGNPTIELEGTKIDVRCQIHQLAFSPHTDGKGIMDLV 174
P + +L +P C++ T I+++ ++ + + +DG I ++
Sbjct: 510 PMDQDLSDVPTKCVSAT----------------ESIEIKARVTYIDYEGRSDGDSIKKII 553
Query: 175 KFLSPQHVILVHGEKPKMATLKE--RIQSELGIKCYDPANNESMCIPS-THYVKAGASDA 231
+ P+ +I+VHG L E R IK Y P +E++ S TH + D+
Sbjct: 554 NQMKPRQLIIVHGPPEASQDLAECCRAFGGKDIKVYMPKLHETVDATSETHIYQVRLKDS 613
Query: 232 FIRS 235
+ S
Sbjct: 614 LVSS 617
>sp|Q9P2I0|CPSF2_HUMAN Cleavage and polyadenylation specificity factor subunit 2 OS=Homo
sapiens GN=CPSF2 PE=1 SV=2
Length = 782
Score = 55.8 bits (133), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 48/167 (28%), Positives = 76/167 (45%), Gaps = 15/167 (8%)
Query: 1 VHKCVAGGGKVLIPAFALGRAQELCILLDDYWERMNLRVPIYFSAGL-TIQANM--YYKM 57
V + + G G VLI GR EL LLD W + + +Y A L + N+ + K
Sbjct: 223 VLETLRGDGNVLIAVDTAGRVLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKS 282
Query: 58 LISWTSQKVKETY-----NAFDFKNV---HNFDRSLIDAPGPCVLFATPGMLTGGFSLEV 109
+ W S K+ + N F F+++ H L P P V+ A+ L GFS ++
Sbjct: 283 QVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGL-SDLARVPSPKVVLASQPDLECGFSRDL 341
Query: 110 FKHWAPSEMNLITLPGYCLAGTIGNKLMSGNPTIELEGTKIDVRCQI 156
F W N I L GT+ L+ NP+ ++ T+I++R ++
Sbjct: 342 FIQWCQDPKNSIILTYRTTPGTLARFLID-NPSEKI--TEIELRKRV 385
Score = 35.4 bits (80), Expect = 0.79, Method: Compositional matrix adjust.
Identities = 27/124 (21%), Positives = 52/124 (41%), Gaps = 19/124 (15%)
Query: 115 PSEMNLITLPGYCLAGTIGNKLMSGNPTIELEGTKIDVRCQIHQLAFSPHTDGKGIMDLV 174
P + +L +P C++ T I+++ ++ + + +DG I ++
Sbjct: 510 PMDQDLSDVPTKCISTT----------------ESIEIKARVTYIDYEGRSDGDSIKKII 553
Query: 175 KFLSPQHVILVHGEKPKMATLKE--RIQSELGIKCYDPANNESMCIPS-THYVKAGASDA 231
+ P+ +I+VHG L E R IK Y P +E++ S TH + D+
Sbjct: 554 NQMKPRQLIIVHGPPEASQDLAECCRAFGGKDIKVYMPKLHETVDATSETHIYQVRLKDS 613
Query: 232 FIRS 235
+ S
Sbjct: 614 LVSS 617
>sp|Q10568|CPSF2_BOVIN Cleavage and polyadenylation specificity factor subunit 2 OS=Bos
taurus GN=CPSF2 PE=1 SV=1
Length = 782
Score = 55.5 bits (132), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 48/167 (28%), Positives = 76/167 (45%), Gaps = 15/167 (8%)
Query: 1 VHKCVAGGGKVLIPAFALGRAQELCILLDDYWERMNLRVPIYFSAGL-TIQANM--YYKM 57
V + + G G VLI GR EL LLD W + + +Y A L + N+ + K
Sbjct: 223 VLETLRGDGNVLIAVDTAGRVLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKS 282
Query: 58 LISWTSQKVKETY-----NAFDFKNV---HNFDRSLIDAPGPCVLFATPGMLTGGFSLEV 109
+ W S K+ + N F F+++ H L P P V+ A+ L GFS ++
Sbjct: 283 QVEWMSDKLMRCFEDKRNNPFQFRHLSLCHGL-SDLARVPSPKVVLASQPDLECGFSRDL 341
Query: 110 FKHWAPSEMNLITLPGYCLAGTIGNKLMSGNPTIELEGTKIDVRCQI 156
F W N I L GT+ L+ NP+ ++ T+I++R ++
Sbjct: 342 FIQWCQDPKNSIILTYRTTPGTLARFLID-NPSEKV--TEIELRKRV 385
Score = 35.8 bits (81), Expect = 0.72, Method: Compositional matrix adjust.
Identities = 27/124 (21%), Positives = 52/124 (41%), Gaps = 19/124 (15%)
Query: 115 PSEMNLITLPGYCLAGTIGNKLMSGNPTIELEGTKIDVRCQIHQLAFSPHTDGKGIMDLV 174
P + +L +P C++ T I+++ ++ + + +DG I ++
Sbjct: 510 PMDQDLSDVPTKCISTT----------------ESIEIKARVTYIDYEGRSDGDSIKKII 553
Query: 175 KFLSPQHVILVHGEKPKMATLKE--RIQSELGIKCYDPANNESMCIPS-THYVKAGASDA 231
+ P+ +I+VHG L E R IK Y P +E++ S TH + D+
Sbjct: 554 NQMKPRQLIIVHGPPEASQDLAECCRAFGGKDIKVYMPKLHETVDATSETHIYQVRLKDS 613
Query: 232 FIRS 235
+ S
Sbjct: 614 LVSS 617
>sp|Q9V3D6|CPSF2_DROME Probable cleavage and polyadenylation specificity factor subunit 2
OS=Drosophila melanogaster GN=Cpsf100 PE=1 SV=1
Length = 756
Score = 54.7 bits (130), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 48/170 (28%), Positives = 75/170 (44%), Gaps = 23/170 (13%)
Query: 3 KCVAGGGKVLIPAFALGRAQELCILLDDYWERMNLRVPIYFSAGLTIQANMYYKML---- 58
+ V G VLI GR EL +LD W+ + Y L + N+ Y ++
Sbjct: 225 QTVRNNGNVLIAVDTAGRVLELAHMLDQLWKNKESGLMAY---SLALLNNVSYNVIEFAK 281
Query: 59 --ISWTSQKVKETY-----NAFDFKNVHNFDRSLIDA----PGPCVLFATPGMLTGGFSL 107
I W S K+ + + N F FK++ SL D GP V+ A+ L GF+
Sbjct: 282 SQIEWMSDKLTKAFEGARNNPFQFKHIQ-LCHSLADVYKLPAGPKVVLASTPDLESGFTR 340
Query: 108 EVFKHWAPSEMNLITLPGYCLAGTIGNKLMSG-NPTIELEGTKIDVRCQI 156
++F WA + N I L GT+ +L+ P ++E +DVR ++
Sbjct: 341 DLFVQWASNANNSIILTTRTSPGTLAMELVENCAPGKQIE---LDVRRRV 387
Score = 41.6 bits (96), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 21/90 (23%), Positives = 43/90 (47%), Gaps = 7/90 (7%)
Query: 135 KLMSGNPTIELEGTKIDVRCQIHQLAFSPHTDGKGIMDLVKFLSPQHVILVHGEKPKMAT 194
KL+S TIE V Q+ ++ F +DG+ ++ ++ L P+ VI++HG
Sbjct: 532 KLISQRKTIE-------VNAQVQRIDFEGRSDGESMLKILSQLRPRRVIVIHGTAEGTQV 584
Query: 195 LKERIQSELGIKCYDPANNESMCIPSTHYV 224
+ + +G + + P E + + S ++
Sbjct: 585 VARHCEQNVGARVFTPQKGEIIDVTSEIHI 614
>sp|Q9W799|CPSF2_XENLA Cleavage and polyadenylation specificity factor subunit 2
OS=Xenopus laevis GN=cpsf2 PE=1 SV=1
Length = 783
Score = 54.3 bits (129), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 43/149 (28%), Positives = 64/149 (42%), Gaps = 12/149 (8%)
Query: 1 VHKCVAGGGKVLIPAFALGRAQELCILLDDYWERMNLRVPIYFSAGL-TIQANM--YYKM 57
V + + G G VLI GR EL LLD W + + +Y A L + N+ + K
Sbjct: 223 VLETLRGDGNVLIAVDTAGRVLELAQLLDQIWRTKDAGLGVYSLALLNNVSYNVVEFSKS 282
Query: 58 LISWTSQKVKETY-----NAFDFKNV---HNFDRSLIDAPGPCVLFATPGMLTGGFSLEV 109
+ W S K+ + N F F+++ H + L P P V+ A+ L GFS E+
Sbjct: 283 QVEWMSDKLMRCFEDKRNNPFQFRHLTLCHGY-SDLARVPSPKVVLASQPDLECGFSREL 341
Query: 110 FKHWAPSEMNLITLPGYCLAGTIGNKLMS 138
F W N + L GT+ L+
Sbjct: 342 FIQWCQDPKNSVILTYRTTPGTLARFLID 370
Score = 36.2 bits (82), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 26/124 (20%), Positives = 54/124 (43%), Gaps = 19/124 (15%)
Query: 115 PSEMNLITLPGYCLAGTIGNKLMSGNPTIELEGTKIDVRCQIHQLAFSPHTDGKGIMDLV 174
P + +L +P C++ T ++++ ++ + + +DG I ++
Sbjct: 510 PMDQDLSDVPTKCVSTT----------------ESMEIKARVTYIDYEGRSDGDSIKKII 553
Query: 175 KFLSPQHVILVHGEKPKMATLKERIQSELG--IKCYDPANNESMCIPS-THYVKAGASDA 231
+ P+ +I+VHG L E ++ G IK Y P +E++ S TH + D+
Sbjct: 554 NQMKPRQLIIVHGPPDATQDLAEACRAFGGKDIKVYTPKLHETVDATSETHIYQVRLKDS 613
Query: 232 FIRS 235
+ S
Sbjct: 614 LVSS 617
>sp|A8XUS3|CPSF2_CAEBR Probable cleavage and polyadenylation specificity factor subunit 2
OS=Caenorhabditis briggsae GN=cpsf-2 PE=3 SV=2
Length = 842
Score = 44.7 bits (104), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 36/146 (24%), Positives = 59/146 (40%), Gaps = 11/146 (7%)
Query: 3 KCVAGGGKVLIPAFALGRAQELCILLDDYWERMNLRVPIYFSAGLTIQAN---MYYKMLI 59
+ V G +I GR EL LLD W + + Y ++ A+ + K +
Sbjct: 224 RTVRQKGDCMIVIDTAGRVLELAYLLDQLWANQDAGLSTYNLVMMSHVASSVVQFAKSQL 283
Query: 60 SWTSQKV------KETYNAFDFKNVH--NFDRSLIDAPGPCVLFATPGMLTGGFSLEVFK 111
W +K+ YN F KNV+ + LI P V+ + + GFS E+F
Sbjct: 284 EWMDEKLFRYDSSSARYNPFTLKNVNLVHSHLELIKIRSPKVVLCSSQDMETGFSRELFL 343
Query: 112 HWAPSEMNLITLPGYCLAGTIGNKLM 137
W + N + L + T+ +L+
Sbjct: 344 DWCADQRNGVILTARPASFTLAARLV 369
>sp|Q55BS1|CPSF2_DICDI Cleavage and polyadenylation specificity factor subunit 2
OS=Dictyostelium discoideum GN=cpsf2 PE=3 SV=1
Length = 784
Score = 43.9 bits (102), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 41/163 (25%), Positives = 69/163 (42%), Gaps = 12/163 (7%)
Query: 1 VHKCVAGGGKVLIPAFALGRAQELCILLDDYWERMNLRVPIY-------FSAGLTIQANM 53
+++ + GG VLIP GR EL + +++YW + N + +Y FS + A
Sbjct: 227 INRNLRDGGNVLIPVDTAGRVLELLLCIENYWSK-NKSLALYSVVFLGRFSFSVCQFARS 285
Query: 54 YYKMLISWTSQKVKETY-NAFDFKNVHNFD--RSLIDAPGPC-VLFATPGMLTGGFSLEV 109
+ + S S K ++ N F FK++ L + P V+ + L GFS E+
Sbjct: 286 QLEFMSSTASVKFEQNIENPFSFKHIKILSSLEELQELPDTNKVILTSSQDLETGFSREL 345
Query: 110 FKHWAPSEMNLITLPGYCLAGTIGNKLMSGNPTIELEGTKIDV 152
F W LI ++ +KL+ T G I++
Sbjct: 346 FIQWCSDPKTLILFTQKIPKDSLADKLIKQYSTPNGRGKCIEI 388
Score = 32.3 bits (72), Expect = 6.5, Method: Compositional matrix adjust.
Identities = 13/73 (17%), Positives = 40/73 (54%), Gaps = 1/73 (1%)
Query: 149 KIDVRCQIHQLAFSPHTDGKGIMDLVKFLSPQHVILVHGEKPKMATLKERIQSELGIK-C 207
++ + C+I + + +DG+ I +++ ++P ++L+ G + + +++ ++ + K
Sbjct: 550 RLPINCKIQTIDYEGCSDGRSIKAIIQQIAPTKLVLIRGSEQQSQSIENYVKENIRTKGI 609
Query: 208 YDPANNESMCIPS 220
Y P+ E + + S
Sbjct: 610 YIPSIGEQLDLTS 622
>sp|Q55470|Y514_SYNY3 Uncharacterized protein sll0514 OS=Synechocystis sp. (strain PCC
6803 / Kazusa) GN=sll0514 PE=4 SV=1
Length = 554
Score = 42.7 bits (99), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 50/229 (21%), Positives = 87/229 (37%), Gaps = 25/229 (10%)
Query: 1 VHKCVAGGGKVLIPAFALGRAQELCILLDDYWERMNLRVPIYFSAGLTIQANMYYKML-- 58
+ +A G +L+P LG AQE+ LL + + +V ++ + + Y ++
Sbjct: 210 IETVLAKGRNILLPVPPLGLAQEILKLLRTHHQFTGRQVNLWAGESVARGCDAYQGIIDH 269
Query: 59 ISWTSQKVKETYNAFDFKNVHNFDRSLIDAPG------PCVLFAT--PGMLTGGFSLEVF 110
+ + + F V+ R L D G P ++ T P +L
Sbjct: 270 LPDNVRNFAQHQPLFWDDKVYPHLRPLTDDQGELSLSAPSIVITTTWPAFWPSPAALPGL 329
Query: 111 KHWAPSEMNLITLPGYCLAGTIGNKLMSGNPTIELEGTKIDVRCQIHQLAFSPHTDGKGI 170
W L+TLP CL L P ELE + H+DG+
Sbjct: 330 --WTVFMPQLLTLPS-CLVNFAWQDLEE-FPKYELE-----------DYLLADHSDGRNT 374
Query: 171 MDLVKFLSPQHVILVHGEKPKMATLKERIQSELGIKCYDPANNESMCIP 219
L+ L PQH++ VHG+ + L + + + + PA ++ +P
Sbjct: 375 TQLIHNLRPQHLVFVHGQPSDIEDLTSLEELQSRYQLHSPAAGNAVALP 423
>sp|O17403|CPSF2_CAEEL Probable cleavage and polyadenylation specificity factor subunit 2
OS=Caenorhabditis elegans GN=cpsf-2 PE=3 SV=1
Length = 843
Score = 40.8 bits (94), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 36/148 (24%), Positives = 61/148 (41%), Gaps = 13/148 (8%)
Query: 3 KCVAGGGKVLIPAFALGRAQELCILLDDYWERMNLRVPIYFSAGLTIQAN---MYYKMLI 59
+ V G +I GR EL LLD W + + Y ++ A+ + K +
Sbjct: 224 RTVRQKGDCMIVIDTAGRVLELAHLLDQLWSNADAGLSTYNLVMMSHVASSVVQFAKSQL 283
Query: 60 SWTSQKV------KETYNAFDFKNV---HNFDRSLIDAPGPCVLFATPGMLTGGFSLEVF 110
W ++K+ YN F K+V H+ + L+ P V+ + + GFS E+F
Sbjct: 284 EWMNEKLFKYDSSSARYNPFTLKHVTLCHSH-QELMRVRSPKVVLCSSQDMESGFSRELF 342
Query: 111 KHWAPSEMNLITLPGYCLAGTIGNKLMS 138
W N + L + T+ KL++
Sbjct: 343 LDWCSDPRNGVILTARPASFTLAAKLVN 370
>sp|Q54SH0|INT9_DICDI Integrator complex subunit 9 homolog OS=Dictyostelium discoideum
GN=ints9 PE=3 SV=1
Length = 712
Score = 38.9 bits (89), Expect = 0.084, Method: Compositional matrix adjust.
Identities = 47/197 (23%), Positives = 77/197 (39%), Gaps = 40/197 (20%)
Query: 8 GGKVLIPAFALGRAQELCILLDDYWERMNL-RVPIYFSAGLTIQANMYYKMLISWTSQKV 66
GG VLIP+++ G +L L DY ++ L VPIYF + ++ Y + W ++
Sbjct: 329 GGTVLIPSYSCGIILDLFEHLADYLNKVGLPYVPIYFVSSVSKAVLSYADIYSEWLNKSK 388
Query: 67 KETYNAF------------------DFKNVHNFDRSLIDAPGPCVLF-ATPGMLTGGFSL 107
+E AF +++VH S A PC++F P G +
Sbjct: 389 QE--RAFMPETPFLHQDLMRKGQFQAYQHVH----SNFQANDPCIIFTGHPSCRIGDITT 442
Query: 108 EVFKHWAPSEMNLITLPGYCLAGTIGNKLMSGNPTIELEGTKIDVRCQIHQLAFSPHTDG 167
+ + P L+ P + T+ L +K R Q L P +
Sbjct: 443 LIKLYDNPKNSILLIEPDFDFKSTV------------LPFSKQISRIQF--LPIDPRINF 488
Query: 168 KGIMDLVKFLSPQHVIL 184
L+ LSP+H+I+
Sbjct: 489 NEANLLISKLSPKHLII 505
>sp|Q83887|GP_NYV Envelope glycoprotein OS=New York virus GN=GP PE=1 SV=1
Length = 1140
Score = 34.3 bits (77), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 14/46 (30%), Positives = 25/46 (54%)
Query: 192 MATLKERIQSELGIKCYDPANNESMCIPSTHYVKAGASDAFIRSCM 237
+ T + +S + CYD A N++ C+P+ H + + +RSCM
Sbjct: 117 VTTFEAAYKSRKTVICYDLACNQTHCLPTVHLIAPVQTCMSVRSCM 162
>sp|A7SBF0|INT9_NEMVE Integrator complex subunit 9 homolog OS=Nematostella vectensis
GN=ints9 PE=3 SV=1
Length = 660
Score = 33.9 bits (76), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 36/133 (27%), Positives = 50/133 (37%), Gaps = 21/133 (15%)
Query: 7 GGGKVLIPAFALGRAQELCILLDDYWERMNL-RVPIYFSAGLTIQANMYYKMLISWTSQ- 64
GG VL+P + G +L L Y + L VPIYF + + + Y + W Q
Sbjct: 295 AGGNVLVPCYPSGVLYDLFECLYTYLDNAKLGMVPIYFISPVADSSLAYSNIYGEWLCQS 354
Query: 65 ---KVKETYNAFD------------FKNVHN-FDRSLIDAPGPCVLFATPGMLTGGFSLE 108
KV F F N+HN F S PCV+F L G ++
Sbjct: 355 KQTKVYLPEPPFPHAELLKEARLKVFSNLHNGFSSSF---KTPCVVFTGHPSLRYGDAVH 411
Query: 109 VFKHWAPSEMNLI 121
+ W S N +
Sbjct: 412 FMEIWGKSGNNTV 424
>sp|B1ZZL9|DNLI_OPITP Probable DNA ligase OS=Opitutus terrae (strain DSM 11246 / PB90-1)
GN=lig PE=3 SV=1
Length = 1017
Score = 33.5 bits (75), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 25/90 (27%), Positives = 40/90 (44%), Gaps = 13/90 (14%)
Query: 161 FSPHTDGKGIMDLVKFLSPQHVILVHGEKPKMA-TLKER----------IQSELGIKCYD 209
S H D + ++ V+ ++PQ V+ +HG + A TL+ER Q ELGI+
Sbjct: 272 LSDHADYQDLLRFVEAVNPQRVLTLHGFATEFAQTLRERGLEAWAIGEDNQLELGIRSSP 331
Query: 210 PANNESMCIPSTHYVKAGASDAFIRSCMNP 239
P PS+ +G + +C P
Sbjct: 332 PQVRAGDATPSSR--SSGDAGVAAPACATP 359
>sp|Q8K114|INT9_MOUSE Integrator complex subunit 9 OS=Mus musculus GN=Ints9 PE=2 SV=1
Length = 658
Score = 33.1 bits (74), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 30/135 (22%), Positives = 52/135 (38%), Gaps = 21/135 (15%)
Query: 5 VAGGGKVLIPAFALGRAQELCILLDDYWERMNL-RVPIYFSAGLTIQANMYYKMLISWTS 63
V GG VL+P + G +L L Y + L +P YF + + + + ++ W
Sbjct: 289 VRNGGNVLVPCYPSGVIYDLLECLYQYIDSAGLSNIPFYFISPVANSSLEFSQIFAEWLC 348
Query: 64 Q----------------KVKETYNAFDFKNVH-NFDRSLIDAPGPCVLFATPGMLTGGFS 106
++ +T ++++H +F D PCVLF L G
Sbjct: 349 HNKQSKVYLPEPPFPHAELIQTNKLKHYRSIHGDFSN---DFRQPCVLFTGHPSLRFGDV 405
Query: 107 LEVFKHWAPSEMNLI 121
+ + W S +N I
Sbjct: 406 VHFMELWGKSSLNTI 420
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.320 0.136 0.414
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 162,358,790
Number of Sequences: 539616
Number of extensions: 6870656
Number of successful extensions: 13581
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 51
Number of HSP's successfully gapped in prelim test: 14
Number of HSP's that attempted gapping in prelim test: 13426
Number of HSP's gapped (non-prelim): 81
length of query: 423
length of database: 191,569,459
effective HSP length: 120
effective length of query: 303
effective length of database: 126,815,539
effective search space: 38425108317
effective search space used: 38425108317
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 63 (28.9 bits)