RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy8075
(1074 letters)
>gnl|CDD|216919 pfam02181, FH2, Formin Homology 2 Domain.
Length = 372
Score = 342 bits (880), Expect = e-108
Identities = 151/381 (39%), Positives = 220/381 (57%), Gaps = 21/381 (5%)
Query: 601 PPRSKLK----SRIPRPYFPNVSVQVNGTIWTELDESKLYASMELEVVDKLFCAYQKNGV 656
P+ KLK ++ Q GT+W ELDE ++L +++LF A K
Sbjct: 6 KPKKKLKPLHWDKVNPA-------QDRGTVWDELDEESFEKDLDLSELEELFSAKAKKKK 58
Query: 657 PADGSVEDLTKLGKISKSKILSVIDGRRAQNCTILLSKLKMTDEEISKAILSMDSADQLP 716
S + K K K +SV+D +R+QN ILL KLK+ EEI +AIL MD L
Sbjct: 59 ----SKKSEKKSSSKKKKKEISVLDPKRSQNIAILLRKLKLPPEEIVQAILEMD-ESVLG 113
Query: 717 LDMLEQLLKFTPNTEEAALLEEHSTE--TLARADRFLYEISKIPHYEQRLRCLHYKKRFH 774
L++LE LLK P EE L+E+ + L RA++FL E+SKIP E+RL L +K F
Sbjct: 114 LELLENLLKMAPTKEELKKLKEYKGDPSKLGRAEQFLLELSKIPRLEERLEALLFKSTFE 173
Query: 775 FWISDIEPKIKSVMEASREVTRSRRLRKLLELVLALGNYMNKG-ARGNASGFRLTSLNKL 833
+ +++P ++++ AS E+ SR+ +KLLEL+LALGNYMN G RGNA GF+L+SL KL
Sbjct: 174 EEVEELKPSLETLEAASEELRESRKFKKLLELILALGNYMNSGTRRGNAKGFKLSSLLKL 233
Query: 834 VDTKSSSSRGVTLLHYLVELAEKRFKEVLKLEEDLCHVKEASKVSMKELENDMTQLRNNL 893
DTKS+ ++ TLLHYLV++ ++ ++L +L HV++A+KV +++LE D+ +L L
Sbjct: 234 SDTKSTDNK-TTLLHYLVKIIREKLPDLLDFSSELSHVEKAAKVDLEQLEKDVKELEKGL 292
Query: 894 KEVSREIEFQRVQPLVPGDRFLPVMKEFLTSATCRLSEIEDLFQDMKTRFDRAVRLFGED 953
K++ RE+E P D+F+ MKEFL A +L ++E L ++ F FGED
Sbjct: 293 KKLERELELSA-LDEHPDDKFVEKMKEFLEEAEEKLDKLESLLKEAMELFKELTEYFGED 351
Query: 954 NSSIQPEDFFVIFDTFLTTLN 974
PE+FF I FL
Sbjct: 352 PKETSPEEFFKILRDFLRMFK 372
>gnl|CDD|214697 smart00498, FH2, Formin Homology 2 Domain. FH proteins control
rearrangements of the actin cytoskeleton, especially in
the context of cytokinesis and cell polarisation. Members
of this family have been found to interact with
Rho-GTPases, profilin and other actin-assoziated
proteins. These interactions are mediated by the
proline-rich FH1 domain, usually located in front of FH2
(but not listed in SMART). Despite this cytosolic
function, vertebrate formins have been assigned functions
within the nucleus. A set of Formin-Binding Proteins
(FBPs) has been shown to bind FH1 with their WW domain.
Length = 392
Score = 289 bits (742), Expect = 2e-88
Identities = 146/451 (32%), Positives = 225/451 (49%), Gaps = 69/451 (15%)
Query: 592 PRPLVRCRQPPRSKLKSRIPRPYFPNVSVQVNGTIWTELDESKLYASMELEVVDKLFCAY 651
P+P + + KL GT+W ++DE +L+ +++LF +
Sbjct: 4 PKPKKKLKPLHWDKLNPSDLS-----------GTVWDKIDEES---EGDLDELEELF-SA 48
Query: 652 QKNGVPADGSVEDLTKLGKISKSKILSVIDGRRAQNCTILLSKLKMTDEEISKAILSMDS 711
++ A V + + K S+ ++D +R+QN ILL KL M+ EEI +AIL D
Sbjct: 49 KEKTKSASKDVSEKKSILKKKASQEFKILDPKRSQNLAILLRKLHMSYEEIKEAILEGDE 108
Query: 712 ADQLPLDMLEQLLKFTPNTEEAALLEEHSTET---LARADRFLYEISKIPHYEQRLRCLH 768
D L +D+LEQLLK+ P EE L E+ E LARA++FL IS IP+ E+RL L
Sbjct: 109 -DVLSVDLLEQLLKYAPTKEELKKLREYKEEDPEELARAEQFLLLISNIPYLEERLNALL 167
Query: 769 YKKRFHFWISDIEPKIKSVMEASREVTRSRRLRKLLELVLALGNYMNKG-ARGNASGFRL 827
+K F + D++P+I+ V A E+ S++ RKLLEL+LA+GNYMN G RG A GF+L
Sbjct: 168 FKANFEEEVEDLKPQIEKVEAACEELRESKKFRKLLELILAIGNYMNGGSRRGQAYGFKL 227
Query: 828 TSLNKLVDTKSSSSRGVTLLHYLVELAEKRFKEVLKLEEDLCHVKEASKVSMKELENDMT 887
+SL KL D KS+ ++ TLLH+LV++ K++ L E+L
Sbjct: 228 SSLLKLSDVKSADNK-TTLLHFLVKIIRKKYLGGLSDPENLD------------------ 268
Query: 888 QLRNNLKEVSREIEFQRVQPLVPGDRFLPVMKEFLTSATCRLSEIEDLFQDMKTRFDRAV 947
D+F+ VMK FL +A + +++ D+KTRF++ V
Sbjct: 269 ------------------------DKFIEVMKPFLKAAKEKYDKLQKDLSDLKTRFEKLV 304
Query: 948 RLFGEDNSSIQPEDFFVIFDTFLTTLNEAKLDNENMKRRREEEEKRARQDAEMKKRT--- 1004
+GED PE+FF F+ FL ++A EN+K + EEEE+R ++ +
Sbjct: 305 EYYGEDPKDTSPEEFFKDFNEFLKEFSKAA--EENIK-KEEEEEERRKKLVKETTEYEQS 361
Query: 1005 TDRKQGAKENGVVISNGSVKGEFDDLISALR 1035
+ R++ + G D L+ L
Sbjct: 362 SSRQKERNPSMDFEVERDFLGVLDSLLEELG 392
>gnl|CDD|219000 pfam06367, Drf_FH3, Diaphanous FH3 Domain. This region is found in
the Formin-like and and diaphanous proteins.
Length = 197
Score = 185 bits (472), Expect = 1e-53
Identities = 72/182 (39%), Positives = 103/182 (56%), Gaps = 6/182 (3%)
Query: 230 GGHKKVLDAMSHYQRFCNERSRFQGIVNDLDKSTGIYRTDVSIKTALMSFLNAVLSYGPG 289
GGH+KVL+A +++ C ER RF+ +V LD S +V K A M F+NA+++
Sbjct: 1 GGHEKVLEATLNFKEVCRERGRFRSLVGALDSS---ENDNVEYKVATMQFINALVNSPE- 56
Query: 290 QHSLEFRLHLRYEVIMLGIQPIIDKLRTHENETLDRHLDFFEMLRHEDESELARKYETDH 349
L FRLHLR E LG+ I+DKLR ENE LD L FE R ED EL +++ +
Sbjct: 57 --DLNFRLHLRSEFTRLGLDRILDKLRELENEKLDDQLQIFEENREEDVEELLERFDDVN 114
Query: 350 IETKSATSMFNILRRKLGHTAAYPHFLSLLHHCILLPLEFGCHPQHWLLLDRLAQQLTLQ 409
++ +F +L+ K+ T A PHFLS+L H +L+ + PQ+W LL+ L Q+ L
Sbjct: 115 VDLDDPVELFELLKNKVKDTEAEPHFLSILQHLLLIRDDEEELPQYWKLLEELVSQIVLH 174
Query: 410 TE 411
Sbjct: 175 RT 176
>gnl|CDD|219001 pfam06371, Drf_GBD, Diaphanous GTPase-binding Domain. This domain
is bound to by GTP-attached Rho proteins, leading to
activation of the Drf protein.
Length = 187
Score = 164 bits (417), Expect = 2e-46
Identities = 69/200 (34%), Positives = 100/200 (50%), Gaps = 26/200 (13%)
Query: 41 LPIPPTNELNTMFEELVEELDLTAPNKAAMLSLPDEKKWQIYCSRK---------MDQTS 91
L P +E+ +FEEL+EE++L + M + P EKKWQ+ S
Sbjct: 1 LQKPDDDEILKLFEELMEEMNLPEEKRRPMRAKPIEKKWQLVVQYLLTNFQGESLSKSLS 60
Query: 92 TTDAPDNPEQYIEQLRNFKSSPLTEHDDTEIKSRTRILDSLRTALRTQPHSFVLRFIECE 151
++ +PE Y+++L++ + L+SLR ALRTQP S+V RFI E
Sbjct: 61 KSNETGSPEYYVKKLKDGSID-------------QKCLESLRVALRTQPVSWVKRFIGAE 107
Query: 152 GLLTLLEYLKNLD----HYSGQTSLHSSIIGCIKALMNNSTGRSHVLGYPGAIDIIAQSL 207
GL LL L + I+ C+KALMNN G HVLG+P I ++A+SL
Sbjct: 108 GLSALLNILSKIVRKKSESDEDLDREYEILKCLKALMNNKFGIDHVLGHPEVILLLARSL 167
Query: 208 STENIKTKIAVLEILGAVCL 227
+ KT+ LE+L A+CL
Sbjct: 168 DSPRPKTRKLALELLTALCL 187
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
chromosome partitioning].
Length = 1163
Score = 53.2 bits (128), Expect = 8e-07
Identities = 30/131 (22%), Positives = 64/131 (48%), Gaps = 4/131 (3%)
Query: 401 RLAQQLTLQTETGEDYDHILLKINVKKVVHLLAKEEEIVAAKKKADELEMENVDLSNRLA 460
+L +L E E+ + L ++ + L EEE+ + ++ +L+ E +L +
Sbjct: 734 QLQSRLEELEEELEELEEELEELQER----LEELEEELESLEEALAKLKEEIEELEEKRQ 789
Query: 461 KKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHLEAKQKVAELEDRISELKHRADFEQD 520
++EL++ +E E+ + L ++ LE +Q++ ELE+ I EL+ + D ++
Sbjct: 790 ALQEELEELEEELEEAERRLDALERELESLEQRRERLEQEIEELEEEIEELEEKLDELEE 849
Query: 521 ERKRLENLINS 531
E + LE +
Sbjct: 850 ELEELEKELEE 860
Score = 51.6 bits (124), Expect = 2e-06
Identities = 31/102 (30%), Positives = 55/102 (53%)
Query: 430 HLLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEK 489
L EEE+ A+++ D LE E L R + EQE+++ +E E+L+E L +++ LE+
Sbjct: 794 ELEELEEELEEAERRLDALERELESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEE 853
Query: 490 ETAGHLEAKQKVAELEDRISELKHRADFEQDERKRLENLINS 531
E K+++ ELE EL+ ++E++ LE +
Sbjct: 854 LEKELEELKEELEELEAEKEELEDELKELEEEKEELEEELRE 895
Score = 46.6 bits (111), Expect = 7e-05
Identities = 27/99 (27%), Positives = 55/99 (55%)
Query: 433 AKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETA 492
A EEE+ + + +ELE E +L L + ++ L++ +E E L+E+L+++K+ +E+
Sbjct: 727 ALEEELEQLQSRLEELEEELEELEEELEELQERLEELEEELESLEEALAKLKEEIEELEE 786
Query: 493 GHLEAKQKVAELEDRISELKHRADFEQDERKRLENLINS 531
++++ ELE+ + E + R D + E + LE
Sbjct: 787 KRQALQEELEELEEELEEAERRLDALERELESLEQRRER 825
Score = 46.2 bits (110), Expect = 9e-05
Identities = 31/140 (22%), Positives = 66/140 (47%), Gaps = 7/140 (5%)
Query: 397 LLLDRLAQQ---LTLQTETGEDYDHILLKINVK----KVVHLLAKEEEIVAAKKKADELE 449
LL+ L +Q L Q E E Y + ++ + L +E+ +++ LE
Sbjct: 193 DLLEELEKQLEKLERQAEKAERYQELKAELRELELALLLAKLKELRKELEELEEELSRLE 252
Query: 450 MENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHLEAKQKVAELEDRIS 509
E +L L + E+E+++ E E+L E L +++ L + E + +++ L +R+
Sbjct: 253 EELEELQEELEEAEKEIEELKSELEELREELEELQEELLELKEEIEELEGEISLLRERLE 312
Query: 510 ELKHRADFEQDERKRLENLI 529
EL++ + ++ + L+ I
Sbjct: 313 ELENELEELEERLEELKEKI 332
Score = 46.2 bits (110), Expect = 9e-05
Identities = 30/98 (30%), Positives = 57/98 (58%)
Query: 433 AKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETA 492
EEEI ++K DELE E +L L + ++EL++ EKE+L++ L +++ E+
Sbjct: 832 ELEEEIEELEEKLDELEEELEELEKELEELKEELEELEAEKEELEDELKELEEEKEELEE 891
Query: 493 GHLEAKQKVAELEDRISELKHRADFEQDERKRLENLIN 530
E + ++AEL++ I +L+ R + + + +RLE +
Sbjct: 892 ELRELESELAELKEEIEKLRERLEELEAKLERLEVELP 929
Score = 45.1 bits (107), Expect = 2e-04
Identities = 28/100 (28%), Positives = 58/100 (58%)
Query: 430 HLLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEK 489
L +EE+ A+K+ +EL+ E +L L + ++EL + +E E+L+ +S +++RLE+
Sbjct: 254 ELEELQEELEEAEKEIEELKSELEELREELEELQEELLELKEEIEELEGEISLLRERLEE 313
Query: 490 ETAGHLEAKQKVAELEDRISELKHRADFEQDERKRLENLI 529
E ++++ EL+++I LK + + + LE L+
Sbjct: 314 LENELEELEERLEELKEKIEALKEELEERETLLEELEQLL 353
Score = 45.1 bits (107), Expect = 2e-04
Identities = 27/111 (24%), Positives = 53/111 (47%), Gaps = 3/111 (2%)
Query: 431 LLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKE 490
L + EI + +++ + L DL L + E EL++ E E+L+E L ++++LE+
Sbjct: 399 LEELKREIESLEERLERLSERLEDLKEELKELEAELEELQTELEELNEELEELEEQLEEL 458
Query: 491 TAGHLEAKQKVAELEDRISELKHRADFEQDERKRLENLINSGSVPDDTKAV 541
E ++++AEL++ + L+ RL+ L +AV
Sbjct: 459 RDRLKELERELAELQEELQRLE---KELSSLEARLDRLEAEQRASQGVRAV 506
Score = 45.1 bits (107), Expect = 2e-04
Identities = 27/111 (24%), Positives = 59/111 (53%), Gaps = 1/111 (0%)
Query: 421 LKINVKKVVHLLAKEEEIV-AAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDES 479
LK ++ + LL + + +++ +EL+ E L L + + L++ +E E+L+E
Sbjct: 693 LKNELRSLEDLLEELRRQLEELERQLEELKRELAALEEELEQLQSRLEELEEELEELEEE 752
Query: 480 LSRIKDRLEKETAGHLEAKQKVAELEDRISELKHRADFEQDERKRLENLIN 530
L +++RLE+ ++ +A+L++ I EL+ + Q+E + LE +
Sbjct: 753 LEELQERLEELEEELESLEEALAKLKEEIEELEEKRQALQEELEELEEELE 803
Score = 44.3 bits (105), Expect = 3e-04
Identities = 27/104 (25%), Positives = 51/104 (49%), Gaps = 4/104 (3%)
Query: 431 LLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKE 490
L EE + K+K + L+ E + L + EQ L + + KE+L+E LS + + LE+
Sbjct: 318 LEELEERLEELKEKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSALLEELEEL 377
Query: 491 TAGHLE----AKQKVAELEDRISELKHRADFEQDERKRLENLIN 530
E + ++AE+ + + ELK + ++ +RL +
Sbjct: 378 FEALREELAELEAELAEIRNELEELKREIESLEERLERLSERLE 421
Score = 43.9 bits (104), Expect = 5e-04
Identities = 26/102 (25%), Positives = 49/102 (48%)
Query: 430 HLLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEK 489
L EEE+ + + ++LE E L N L E L++ ++ E+L+ L +K L
Sbjct: 668 ELKELEEELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEELERQLEELKRELAA 727
Query: 490 ETAGHLEAKQKVAELEDRISELKHRADFEQDERKRLENLINS 531
+ + ++ ELE+ + EL+ + Q+ + LE + S
Sbjct: 728 LEEELEQLQSRLEELEEELEELEEELEELQERLEELEEELES 769
Score = 43.9 bits (104), Expect = 5e-04
Identities = 27/98 (27%), Positives = 51/98 (52%)
Query: 429 VHLLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLE 488
L EE A +++ ELE E ++ N L + ++E++ + E L E L +K+ L+
Sbjct: 369 ALLEELEELFEALREELAELEAELAEIRNELEELKREIESLEERLERLSERLEDLKEELK 428
Query: 489 KETAGHLEAKQKVAELEDRISELKHRADFEQDERKRLE 526
+ A E + ++ EL + + EL+ + + +D K LE
Sbjct: 429 ELEAELEELQTELEELNEELEELEEQLEELRDRLKELE 466
Score = 42.0 bits (99), Expect = 0.002
Identities = 20/96 (20%), Positives = 48/96 (50%)
Query: 433 AKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETA 492
+ E+ +++ +EL+ E ++L + + E E+ + E+L+ L +++RLE+
Sbjct: 271 ELKSELEELREELEELQEELLELKEEIEELEGEISLLRERLEELENELEELEERLEELKE 330
Query: 493 GHLEAKQKVAELEDRISELKHRADFEQDERKRLENL 528
K+++ E E + EL+ ++ ++ LE
Sbjct: 331 KIEALKEELEERETLLEELEQLLAELEEAKEELEEK 366
Score = 39.7 bits (93), Expect = 0.010
Identities = 30/118 (25%), Positives = 55/118 (46%)
Query: 414 EDYDHILLKINVKKVVHLLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEK 473
E+ + +L ++ AKEE +ELE L LA+ E EL + E
Sbjct: 340 EERETLLEELEQLLAELEEAKEELEEKLSALLEELEELFEALREELAELEAELAEIRNEL 399
Query: 474 EDLDESLSRIKDRLEKETAGHLEAKQKVAELEDRISELKHRADFEQDERKRLENLINS 531
E+L + +++RLE+ + + K+++ ELE + EL+ + +E + LE +
Sbjct: 400 EELKREIESLEERLERLSERLEDLKEELKELEAELEELQTELEELNEELEELEEQLEE 457
Score = 38.5 bits (90), Expect = 0.023
Identities = 28/100 (28%), Positives = 51/100 (51%), Gaps = 2/100 (2%)
Query: 432 LAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKET 491
L K E ++ EL+ E +L L + L + +E E+L+E LSR+++ LE+
Sbjct: 202 LEKLERQAEKAERYQELKAELRELELALLLAK--LKELRKELEELEEELSRLEEELEELQ 259
Query: 492 AGHLEAKQKVAELEDRISELKHRADFEQDERKRLENLINS 531
EA++++ EL+ + EL+ + Q+E L+ I
Sbjct: 260 EELEEAEKEIEELKSELEELREELEELQEELLELKEEIEE 299
Score = 37.0 bits (86), Expect = 0.062
Identities = 29/106 (27%), Positives = 57/106 (53%), Gaps = 6/106 (5%)
Query: 430 HLLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEK 489
L +EEI +++ +ELE + L L + E+EL + + ++ L+ L R +RLE+
Sbjct: 899 ELAELKEEIEKLRERLEELEAKLERLEVELPELEEEL--EEEYEDTLETELEREIERLEE 956
Query: 490 ETAG----HLEAKQKVAELEDRISELKHRADFEQDERKRLENLINS 531
E +L A ++ E+E+R ELK + + ++ +++L +I
Sbjct: 957 EIEALGPVNLRAIEEYEEVEERYEELKSQREDLEEAKEKLLEVIEE 1002
Score = 37.0 bits (86), Expect = 0.068
Identities = 21/99 (21%), Positives = 48/99 (48%)
Query: 432 LAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKET 491
LA++ E+ +++ ELE + L L + EL E+L L ++ +LE+
Sbjct: 663 LAQKRELKELEEELAELEAQLEKLEEELKSLKNELRSLEDLLEELRRQLEELERQLEELK 722
Query: 492 AGHLEAKQKVAELEDRISELKHRADFEQDERKRLENLIN 530
++++ +L+ R+ EL+ + ++E + L+ +
Sbjct: 723 RELAALEEELEQLQSRLEELEEELEELEEELEELQERLE 761
Score = 36.6 bits (85), Expect = 0.085
Identities = 25/97 (25%), Positives = 49/97 (50%)
Query: 434 KEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAG 493
E+ + KA+ E+E L +L + +EL++ +E L+E L +++ LE+
Sbjct: 209 AEKAERYQELKAELRELELALLLAKLKELRKELEELEEELSRLEEELEELQEELEEAEKE 268
Query: 494 HLEAKQKVAELEDRISELKHRADFEQDERKRLENLIN 530
E K ++ EL + + EL+ ++E + LE I+
Sbjct: 269 IEELKSELEELREELEELQEELLELKEEIEELEGEIS 305
Score = 33.1 bits (76), Expect = 1.1
Identities = 23/91 (25%), Positives = 50/91 (54%), Gaps = 4/91 (4%)
Query: 399 LDRLAQQLTLQTETGEDYDHILLKINVKKVVHLLAKEEEIVAAKKKADELEMENVDLSNR 458
++ L ++L +E ED L ++ + L + E+ ++ +ELE + +L +R
Sbjct: 406 IESLEERLERLSERLEDLKEELKELEAE----LEELQTELEELNEELEELEEQLEELRDR 461
Query: 459 LAKKEQELDQKTQEKEDLDESLSRIKDRLEK 489
L + E+EL + +E + L++ LS ++ RL++
Sbjct: 462 LKELERELAELQEELQRLEKELSSLEARLDR 492
>gnl|CDD|218549 pfam05308, Mito_fiss_reg, Mitochondrial fission regulator. In
eukaryotes, this family of proteins induces
mitochondrial fission.
Length = 248
Score = 49.8 bits (119), Expect = 2e-06
Identities = 41/146 (28%), Positives = 58/146 (39%), Gaps = 13/146 (8%)
Query: 456 SNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHLEAKQKVAELEDRISELKHRA 515
NR K L ++ +L + + RL+ + EA QK++ LED ++ L RA
Sbjct: 79 PNRHEWKPSPLLPMKLSRQPSVPNLRQEEPRLQSLKPPNEEALQKISALEDELAFL--RA 136
Query: 516 DF----EQDERKRLENLINSGSVPDDTKAVGLSRSGGREEPSSPRKPEAPSSCIPPPPPP 571
E N S + D S + P SP E P +PPPPPP
Sbjct: 137 QIAKIVAAQE---QSNSTTSDLLSSDESVPSSSTTSF---PISP-PTEEPVLEVPPPPPP 189
Query: 572 ASLPPPPPPRRSSDLPYLTCPRPLVR 597
PPPP ++S+ L R R
Sbjct: 190 PPPPPPPSLQQSTSAIDLIKERKGQR 215
>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
Length = 520
Score = 46.3 bits (111), Expect = 8e-05
Identities = 34/105 (32%), Positives = 52/105 (49%), Gaps = 10/105 (9%)
Query: 424 NVKKVVHLLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRI 483
+KK L AKEE K +E E E + N L K E+ L QK + + E L +
Sbjct: 53 AIKKEALLEAKEE----IHKLRNEFEKELRERRNELQKLEKRLLQKEENLDRKLELLEKR 108
Query: 484 KDRLEKETAGHLEAKQKVAELEDRISELKHRADFEQDERKRLENL 528
++ LEK+ LE KQ+ ELE + EL+ +++ + LE +
Sbjct: 109 EEELEKKEKE-LEQKQQ--ELEKKEEELEEL---IEEQLQELERI 147
Score = 33.2 bits (77), Expect = 0.90
Identities = 27/92 (29%), Positives = 45/92 (48%), Gaps = 17/92 (18%)
Query: 424 NVKKVVHLLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRI 483
N+ + + LL K EE + K+K E + + ++ KKE+EL++ +E+ E +S
Sbjct: 97 NLDRKLELLEKREEELEKKEKELEQKQQELE------KKEEELEELIEEQLQELERIS-- 148
Query: 484 KDRLEKETAGHLEAKQKVAELEDRISELKHRA 515
L E EAK+ + LE E +H A
Sbjct: 149 --GLTAE-----EAKEIL--LEKVEEEARHEA 171
>gnl|CDD|184281 PRK13729, PRK13729, conjugal transfer pilus assembly protein TraB;
Provisional.
Length = 475
Score = 45.6 bits (108), Expect = 1e-04
Identities = 24/97 (24%), Positives = 40/97 (41%), Gaps = 12/97 (12%)
Query: 496 EAKQKVAELEDRISELKHRADF----EQDERKRLENLINSGS-VPDDTKAVG--LSRSGG 548
E + A+++ + E++ D D+++R+E L + + + KA+G + G
Sbjct: 73 EMQVTAAQMQKQYEEIRRELDVLNKQRGDDQRRIEKLGQDNAALAEQVKALGANPVTATG 132
Query: 549 REEPSSPRKPEAPSSCIPPPPPPASLPP-----PPPP 580
P P P P P P S PP PPP
Sbjct: 133 EPVPQMPASPPGPEGEPQPGNTPVSFPPQGSVAVPPP 169
>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
primarily archaeal type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. It is found
in a single copy and is homodimeric in prokaryotes, but
six paralogs (excluded from this family) are found in
eukarotes, where SMC proteins are heterodimeric. This
family represents the SMC protein of archaea and a few
bacteria (Aquifex, Synechocystis, etc); the SMC of other
bacteria is described by TIGR02168. The N- and
C-terminal domains of this protein are well conserved,
but the central hinge region is skewed in composition
and highly divergent [Cellular processes, Cell division,
DNA metabolism, Chromosome-associated proteins].
Length = 1164
Score = 45.4 bits (108), Expect = 2e-04
Identities = 29/100 (29%), Positives = 51/100 (51%), Gaps = 4/100 (4%)
Query: 435 EEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGH 494
E EI + ++ E E E D RLAK E E+D+ E E+L+ + + R +K T +
Sbjct: 300 EAEIASLERSIAEKERELEDAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEY 359
Query: 495 LEAKQKVAELEDRISEL--KHRADFEQ--DERKRLENLIN 530
E K+++ +L + E+ + ++ D R++LE L
Sbjct: 360 AELKEELEDLRAELEEVDKEFAETRDELKDYREKLEKLKR 399
Score = 45.1 bits (107), Expect = 2e-04
Identities = 19/94 (20%), Positives = 49/94 (52%)
Query: 433 AKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETA 492
+EE+ + + +E++ E + + L ++L++ +E +L L R+++ L++ +
Sbjct: 361 ELKEELEDLRAELEEVDKEFAETRDELKDYREKLEKLKREINELKRELDRLQEELQRLSE 420
Query: 493 GHLEAKQKVAELEDRISELKHRADFEQDERKRLE 526
+ +A +E +I+EL+ + + E K+ E
Sbjct: 421 ELADLNAAIAGIEAKINELEEEKEDKALEIKKQE 454
Score = 43.5 bits (103), Expect = 8e-04
Identities = 41/142 (28%), Positives = 67/142 (47%), Gaps = 21/142 (14%)
Query: 397 LLLDRLAQQL-TLQTE--TGEDYDHILLKINVKKVVHLLAKEEEIVAAKKKADELEMENV 453
L++D QQL L+ E E Y LLK + + L KE K+A E + E +
Sbjct: 191 LIIDEKRQQLERLRREREKAERYQ-ALLKEKREYEGYELLKE-------KEALERQKEAI 242
Query: 454 DLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETA--------GHLEAKQKVAELE 505
+ +LA E+EL++ T+E +L++ L I+ LE+ L K+K+ ELE
Sbjct: 243 E--RQLASLEEELEKLTEEISELEKRLEEIEQLLEELNKKIKDLGEEEQLRVKEKIGELE 300
Query: 506 DRISELKHRADFEQDERKRLEN 527
I+ L+ ++ E + E
Sbjct: 301 AEIASLERSIAEKERELEDAEE 322
Score = 42.7 bits (101), Expect = 0.001
Identities = 23/92 (25%), Positives = 45/92 (48%), Gaps = 2/92 (2%)
Query: 435 EEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGH 494
E + + + + L+ E L + L + E LD+ +QE D + I+ +E+
Sbjct: 675 ELQRLRERL--EGLKRELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQEE 732
Query: 495 LEAKQKVAELEDRISELKHRADFEQDERKRLE 526
+ K+++ ELE+ +S L+ + + E K LE
Sbjct: 733 EKLKERLEELEEDLSSLEQEIENVKSELKELE 764
Score = 40.8 bits (96), Expect = 0.004
Identities = 24/102 (23%), Positives = 50/102 (49%), Gaps = 5/102 (4%)
Query: 435 EEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGH 494
+I +K+ ++LE E L RL + E++L QE E++ L ++ R+E+
Sbjct: 715 SRKIGEIEKEIEQLEQEEEKLKERLEELEEDLSSLEQEIENVKSELKELEARIEELEEDL 774
Query: 495 LEAKQKVAELEDRIS-----ELKHRADFEQDERKRLENLINS 531
+ ++ + +LE R+S E++ ++E R+E +
Sbjct: 775 HKLEEALNDLEARLSHSRIPEIQAELSKLEEEVSRIEARLRE 816
Score = 40.1 bits (94), Expect = 0.008
Identities = 23/107 (21%), Positives = 50/107 (46%), Gaps = 7/107 (6%)
Query: 434 KEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIK---DRLEKE 490
+E+ + K++ +E E +L+ + + E+EL++ DL+ L +K D LE +
Sbjct: 838 LQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLESRLGDLKKERDELEAQ 897
Query: 491 TA----GHLEAKQKVAELEDRISELKHRADFEQDERKRLENLINSGS 533
E + ++ + R+SELK + + ++E +E+
Sbjct: 898 LRELERKIEELEAQIEKKRKRLSELKAKLEALEEELSEIEDPKGEDE 944
Score = 38.9 bits (91), Expect = 0.019
Identities = 23/101 (22%), Positives = 48/101 (47%), Gaps = 10/101 (9%)
Query: 435 EEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIK---DRLEKET 491
I + K +ELE E D + + K+E +L+Q + ++ L +K DR+EKE
Sbjct: 426 NAAIAGIEAKINELEEEKEDKALEIKKQEWKLEQLAADLSKYEQELYDLKEEYDRVEKEL 485
Query: 492 AGHLEAKQKVAELEDRISELKHRADFEQDERKRLENLINSG 532
+ + ++++AE E + + R + +E ++ +
Sbjct: 486 S---KLQRELAEAEAQARASEER----VRGGRAVEEVLKAS 519
Score = 38.5 bits (90), Expect = 0.023
Identities = 27/111 (24%), Positives = 49/111 (44%), Gaps = 15/111 (13%)
Query: 431 LLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQEL-----DQKTQEKEDLDESLSRIKD 485
+ + E+ + + +ELE + L L E L + E L+E +SRI+
Sbjct: 753 IENVKSELKELEARIEELEEDLHKLEEALNDLEARLSHSRIPEIQAELSKLEEEVSRIEA 812
Query: 486 RLE----KETAGHLE---AKQKVAELEDRISELKHRADFEQDERKRLENLI 529
RL K LE ++++ EL+++ +LK + + K +ENL
Sbjct: 813 RLREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQ---IKSIEKEIENLN 860
Score = 38.5 bits (90), Expect = 0.023
Identities = 16/95 (16%), Positives = 48/95 (50%)
Query: 436 EEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHL 495
+E+ ++K ++L+ E +L L + ++EL + ++E DL+ +++ I+ ++ +
Sbjct: 385 DELKDYREKLEKLKREINELKRELDRLQEELQRLSEELADLNAAIAGIEAKINELEEEKE 444
Query: 496 EAKQKVAELEDRISELKHRADFEQDERKRLENLIN 530
+ ++ + E ++ +L + E L+ +
Sbjct: 445 DKALEIKKQEWKLEQLAADLSKYEQELYDLKEEYD 479
Score = 37.7 bits (88), Expect = 0.045
Identities = 17/73 (23%), Positives = 33/73 (45%), Gaps = 7/73 (9%)
Query: 447 ELEMENVDLSNRLAKKEQELDQKTQEKEDLDE-------SLSRIKDRLEKETAGHLEAKQ 499
L E DL+ +A E ++++ +EKED L ++ L K + K+
Sbjct: 417 RLSEELADLNAAIAGIEAKINELEEEKEDKALEIKKQEWKLEQLAADLSKYEQELYDLKE 476
Query: 500 KVAELEDRISELK 512
+ +E +S+L+
Sbjct: 477 EYDRVEKELSKLQ 489
Score = 35.0 bits (81), Expect = 0.25
Identities = 20/92 (21%), Positives = 42/92 (45%), Gaps = 7/92 (7%)
Query: 435 EEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGH 494
E + ++ + + ++ + + EQE ++ + E+L+E LS ++ +E
Sbjct: 701 ENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEELEEDLSSLEQEIE------ 754
Query: 495 LEAKQKVAELEDRISELKHRADFEQDERKRLE 526
K ++ ELE RI EL+ ++ LE
Sbjct: 755 -NVKSELKELEARIEELEEDLHKLEEALNDLE 785
Score = 35.0 bits (81), Expect = 0.28
Identities = 35/178 (19%), Positives = 65/178 (36%), Gaps = 10/178 (5%)
Query: 408 LQTETGEDYDHILLKINVKKVVHLLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELD 467
L+ + + + L EEE+ + + E+E + L+ E+E+
Sbjct: 777 LEEALNDLEARLSHSRIPEIQAELSKLEEEVSRIEARLREIEQKLNRLTLEKEYLEKEIQ 836
Query: 468 QKTQEKEDLDESLSRIKDRLE------KETAGHLEAKQ-KVAELEDRISELKHRADFEQD 520
+ +++ DL E + I+ +E +E LE + + +LE R+ +LK D +
Sbjct: 837 ELQEQRIDLKEQIKSIEKEIENLNGKKEELEEELEELEAALRDLESRLGDLKKERDELEA 896
Query: 521 ERKRLENLINSGSVPDDTKAVGLSRSGGREEPSSPRKPE---APSSCIPPPPPPASLP 575
+ + LE I + K LS + E E P SL
Sbjct: 897 QLRELERKIEELEAQIEKKRKRLSELKAKLEALEEELSEIEDPKGEDEEIPEEELSLE 954
Score = 32.3 bits (74), Expect = 1.6
Identities = 19/91 (20%), Positives = 41/91 (45%)
Query: 433 AKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETA 492
EEE+ + +LE DL + E +L + ++ E+L+ + + + RL + A
Sbjct: 865 ELEEELEELEAALRDLESRLGDLKKERDELEAQLRELERKIEELEAQIEKKRKRLSELKA 924
Query: 493 GHLEAKQKVAELEDRISELKHRADFEQDERK 523
+++++E+ED E + + E
Sbjct: 925 KLEALEEELSEIEDPKGEDEEIPEEELSLED 955
Score = 31.2 bits (71), Expect = 4.2
Identities = 21/71 (29%), Positives = 39/71 (54%), Gaps = 6/71 (8%)
Query: 458 RLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHLEAKQKVAELEDRISELKHRADF 517
RL ++ + L + E L L RI++RL++ + +A +K+ E+E I +L+
Sbjct: 678 RLRERLEGLKR---ELSSLQSELRRIENRLDELSQELSDASRKIGEIEKEIEQLEQE--- 731
Query: 518 EQDERKRLENL 528
E+ ++RLE L
Sbjct: 732 EEKLKERLEEL 742
>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
bacterial type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. This family
represents the SMC protein of most bacteria. The smc
gene is often associated with scpB (TIGR00281) and scpA
genes, where scp stands for segregation and condensation
protein. SMC was shown (in Caulobacter crescentus) to be
induced early in S phase but present and bound to DNA
throughout the cell cycle [Cellular processes, Cell
division, DNA metabolism, Chromosome-associated
proteins].
Length = 1179
Score = 45.4 bits (108), Expect = 2e-04
Identities = 30/143 (20%), Positives = 68/143 (47%), Gaps = 8/143 (5%)
Query: 394 QHWLLLDR-------LAQQLTLQTETGEDYDHILLKINVKKVVHLLAKEEEIVAAKKKAD 446
+ LL+ R L + E E+ + + ++ + L E+ +++ +
Sbjct: 226 ELALLVLRLEELREELEELQEELKEAEEELEELTAELQELEE-KLEELRLEVSELEEEIE 284
Query: 447 ELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHLEAKQKVAELED 506
EL+ E L+N +++ EQ+ + +L+ L ++ +LE+ + E +++AELE+
Sbjct: 285 ELQKELYALANEISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEE 344
Query: 507 RISELKHRADFEQDERKRLENLI 529
++ ELK + + E + LE +
Sbjct: 345 KLEELKEELESLEAELEELEAEL 367
Score = 43.5 bits (103), Expect = 6e-04
Identities = 24/92 (26%), Positives = 48/92 (52%)
Query: 435 EEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGH 494
EE+I +K EL E +L L + +EL++ +++ L + L+R++ +E+
Sbjct: 690 EEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERI 749
Query: 495 LEAKQKVAELEDRISELKHRADFEQDERKRLE 526
+ +++ ELE I EL+ R + ++E E
Sbjct: 750 AQLSKELTELEAEIEELEERLEEAEEELAEAE 781
Score = 41.2 bits (97), Expect = 0.004
Identities = 30/129 (23%), Positives = 64/129 (49%), Gaps = 6/129 (4%)
Query: 399 LDRLAQQLTLQTETGEDYDHILLKINVKKV-VHLLAKEEEIVAAKKKADELEMENVDLSN 457
++ L +++ E + L+ + ++ L EEE+ +K+ +EL + L
Sbjct: 679 IEELEEKIEELEEKIAE-----LEKALAELRKELEELEEELEQLRKELEELSRQISALRK 733
Query: 458 RLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHLEAKQKVAELEDRISELKHRADF 517
LA+ E E++Q + L + L+ ++ +E+ EA++++AE E I EL+ + +
Sbjct: 734 DLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQ 793
Query: 518 EQDERKRLE 526
++E K L
Sbjct: 794 LKEELKALR 802
Score = 40.4 bits (95), Expect = 0.006
Identities = 25/101 (24%), Positives = 47/101 (46%), Gaps = 7/101 (6%)
Query: 431 LLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKE 490
+L + EI ++K +ELE + +L LA+ +EL++ +E E L + L + ++
Sbjct: 672 ILERRREIEELEEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQIS-- 729
Query: 491 TAGHLEAKQKVAELEDRISELKHRADFEQDERKRLENLINS 531
++ +A LE + +L+ R E LE I
Sbjct: 730 -----ALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEE 765
Score = 40.0 bits (94), Expect = 0.008
Identities = 24/97 (24%), Positives = 48/97 (49%)
Query: 435 EEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGH 494
EE+I + + L E +L + + E EL+ E+ L+E+L+ ++ LE+ +
Sbjct: 844 EEQIEELSEDIESLAAEIEELEELIEELESELEALLNERASLEEALALLRSELEELSEEL 903
Query: 495 LEAKQKVAELEDRISELKHRADFEQDERKRLENLINS 531
E + K +EL + EL+ + + + LE I++
Sbjct: 904 RELESKRSELRRELEELREKLAQLELRLEGLEVRIDN 940
Score = 39.3 bits (92), Expect = 0.015
Identities = 22/97 (22%), Positives = 45/97 (46%)
Query: 435 EEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGH 494
+EE+ A ++ DEL E L+ A + L+ + + L +++++E+ +
Sbjct: 795 KEELKALREALDELRAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSEDI 854
Query: 495 LEAKQKVAELEDRISELKHRADFEQDERKRLENLINS 531
++ ELE+ I EL+ + +ER LE +
Sbjct: 855 ESLAAEIEELEELIEELESELEALLNERASLEEALAL 891
Score = 38.1 bits (89), Expect = 0.029
Identities = 24/96 (25%), Positives = 45/96 (46%)
Query: 435 EEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGH 494
EE I K+ ELE E +L RL + E+EL + E E+L+ + ++K+ L+
Sbjct: 746 EERIAQLSKELTELEAEIEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREAL 805
Query: 495 LEAKQKVAELEDRISELKHRADFEQDERKRLENLIN 530
E + ++ L + + L+ R + + E +
Sbjct: 806 DELRAELTLLNEEAANLRERLESLERRIAATERRLE 841
Score = 37.7 bits (88), Expect = 0.037
Identities = 24/112 (21%), Positives = 53/112 (47%), Gaps = 14/112 (12%)
Query: 434 KEEEIVAAKKKADELEMENVDLSNRLAKKEQ-------ELDQKTQEKEDLDESLSRIKDR 486
E++ +++ LE + +L +L + E EL + ++ E+L E L ++
Sbjct: 300 LEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELESLEAE 359
Query: 487 LEKETAGHLEAKQKVAELED-------RISELKHRADFEQDERKRLENLINS 531
LE+ A E + ++ ELE+ ++++L+ + +E +RLE +
Sbjct: 360 LEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARLER 411
Score = 37.7 bits (88), Expect = 0.039
Identities = 24/97 (24%), Positives = 45/97 (46%)
Query: 430 HLLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEK 489
L EE+ ++K +EL+ E L L + E EL++ E+L+E L ++ ++ +
Sbjct: 331 KLDELAEELAELEEKLEELKEELESLEAELEELEAELEELESRLEELEEQLETLRSKVAQ 390
Query: 490 ETAGHLEAKQKVAELEDRISELKHRADFEQDERKRLE 526
++ LE R+ L+ R + Q E + L
Sbjct: 391 LELQIASLNNEIERLEARLERLEDRRERLQQEIEELL 427
Score = 37.0 bits (86), Expect = 0.066
Identities = 29/130 (22%), Positives = 62/130 (47%), Gaps = 4/130 (3%)
Query: 401 RLAQQLTLQTETGEDYDHILLKINVKKVVHLLAKEEEIVAAKKKADELEMENVDLSNRLA 460
+L +++ ++ + + + ++ + L EEE+ A+ + +ELE + L L
Sbjct: 744 QLEERIAQLSKELTELEAEIEELEER----LEEAEEELAEAEAEIEELEAQIEQLKEELK 799
Query: 461 KKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHLEAKQKVAELEDRISELKHRADFEQD 520
+ LD+ E L+E + +++RLE ++++ +LE++I EL +
Sbjct: 800 ALREALDELRAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSEDIESLAA 859
Query: 521 ERKRLENLIN 530
E + LE LI
Sbjct: 860 EIEELEELIE 869
Score = 37.0 bits (86), Expect = 0.072
Identities = 19/92 (20%), Positives = 38/92 (41%)
Query: 435 EEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGH 494
E + + + L E +L RL E+ + + EDL+E + + + +E A
Sbjct: 802 REALDELRAELTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELSEDIESLAAEI 861
Query: 495 LEAKQKVAELEDRISELKHRADFEQDERKRLE 526
E ++ + ELE + L + ++ L
Sbjct: 862 EELEELIEELESELEALLNERASLEEALALLR 893
Score = 36.6 bits (85), Expect = 0.087
Identities = 28/129 (21%), Positives = 56/129 (43%), Gaps = 12/129 (9%)
Query: 433 AKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEK--- 489
E E+ + + +ELE + L +++A+ E ++ E E L+ L R++DR E+
Sbjct: 362 ELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARLERLEDRRERLQQ 421
Query: 490 ---------ETAGHLEAKQKVAELEDRISELKHRADFEQDERKRLENLINSGSVPDDTKA 540
E A E + ++ ELE+ + EL+ + ++ + L + D
Sbjct: 422 EIEELLKKLEEAELKELQAELEELEEELEELQEELERLEEALEELREELEEAEQALDAAE 481
Query: 541 VGLSRSGGR 549
L++ R
Sbjct: 482 RELAQLQAR 490
Score = 35.8 bits (83), Expect = 0.14
Identities = 24/99 (24%), Positives = 49/99 (49%), Gaps = 5/99 (5%)
Query: 435 EEEIVAAKKKADELEMENVDL-----SNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEK 489
E + + + + L+ E +L L + + EL++ +E E+L E L R+++ LE+
Sbjct: 406 EARLERLEDRRERLQQEIEELLKKLEEAELKELQAELEELEEELEELQEELERLEEALEE 465
Query: 490 ETAGHLEAKQKVAELEDRISELKHRADFEQDERKRLENL 528
EA+Q + E +++L+ R D + ++ LE
Sbjct: 466 LREELEEAEQALDAAERELAQLQARLDSLERLQENLEGF 504
Score = 35.8 bits (83), Expect = 0.16
Identities = 33/155 (21%), Positives = 63/155 (40%), Gaps = 33/155 (21%)
Query: 399 LDRLAQQLTLQTETGEDYDHILLKINV------KKVVHLLAKEEEIVAAKKKADELEMEN 452
L+ L +Q+ +E E + ++ ++ LL + + A E+E
Sbjct: 840 LEDLEEQIEELSEDIESLAAEIEELEELIEELESELEALLNERASLEEALALL-RSELEE 898
Query: 453 VDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHLEAKQKVAELEDRISELK 512
LS L + E + + +E E+L E L++++ RLE + ++ L++R+SE +
Sbjct: 899 --LSEELRELESKRSELRRELEELREKLAQLELRLEG-------LEVRIDNLQERLSE-E 948
Query: 513 HRADFE----------------QDERKRLENLINS 531
+ E + KRLEN I
Sbjct: 949 YSLTLEEAEALENKIEDDEEEARRRLKRLENKIKE 983
Score = 31.6 bits (72), Expect = 2.9
Identities = 22/66 (33%), Positives = 36/66 (54%)
Query: 465 ELDQKTQEKEDLDESLSRIKDRLEKETAGHLEAKQKVAELEDRISELKHRADFEQDERKR 524
L++ +E E+L E L ++ LE+ TA E ++K+ EL +SEL+ + Q E
Sbjct: 233 RLEELREELEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKELYA 292
Query: 525 LENLIN 530
L N I+
Sbjct: 293 LANEIS 298
>gnl|CDD|234501 TIGR04211, SH3_and_anchor, SH3 domain protein. Members of this
protein family have a signal peptide, a strongly
conserved SH3 domain, a variable region, and then a
C-terminal hydrophobic transmembrane alpha helix region.
Length = 198
Score = 43.5 bits (103), Expect = 2e-04
Identities = 24/93 (25%), Positives = 49/93 (52%), Gaps = 4/93 (4%)
Query: 435 EEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGH 494
++E+ +++ EL+ + +L + +QEL E E L + L+RIK + +A
Sbjct: 72 QQELAELQEELAELQEQLAELQQENQELKQELSTLEAELERLQKELARIK----QLSANA 127
Query: 495 LEAKQKVAELEDRISELKHRADFEQDERKRLEN 527
+E ++ EL + ++ELK + + E +RL+
Sbjct: 128 IELDEENRELREELAELKQENEALEAENERLQE 160
Score = 31.9 bits (73), Expect = 1.1
Identities = 19/52 (36%), Positives = 25/52 (48%), Gaps = 1/52 (1%)
Query: 432 LAKE-EEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSR 482
L KE I A EL+ EN +L LA+ +QE + E E L E+ R
Sbjct: 113 LQKELARIKQLSANAIELDEENRELREELAELKQENEALEAENERLQENEQR 164
>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
Length = 3151
Score = 45.3 bits (107), Expect = 2e-04
Identities = 22/92 (23%), Positives = 29/92 (31%), Gaps = 20/92 (21%)
Query: 545 RSGGREEPSSPRKPEAPSSCIPPPPPPASLPP------PPPPRRSS---------DLPYL 589
R + + PR P P PP+ LPP PPPP S
Sbjct: 2588 RPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVP 2647
Query: 590 TCPRPLV-----RCRQPPRSKLKSRIPRPYFP 616
RP R +P R++ R + P
Sbjct: 2648 PPERPRDDPAPGRVSRPRRARRLGRAAQASSP 2679
Score = 39.5 bits (92), Expect = 0.011
Identities = 19/84 (22%), Positives = 28/84 (33%), Gaps = 19/84 (22%)
Query: 549 REEPSSPRKPEAPSSCIP---PPPPPAS----------------LPPPPPPRRSSDLPYL 589
++ PR P PS P P PP +PPP PR +
Sbjct: 2602 VDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRV 2661
Query: 590 TCPRPLVRCRQPPRSKLKSRIPRP 613
+ PR R + ++ + PR
Sbjct: 2662 SRPRRARRLGRAAQASSPPQRPRR 2685
Score = 38.8 bits (90), Expect = 0.018
Identities = 19/49 (38%), Positives = 23/49 (46%), Gaps = 4/49 (8%)
Query: 550 EEPSSPRKPEAPSSCIPPPPPPA----SLPPPPPPRRSSDLPYLTCPRP 594
++P P +P+AP P P PP PPPPPPR L T P
Sbjct: 2903 DQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAG 2951
Score = 38.4 bits (89), Expect = 0.025
Identities = 20/68 (29%), Positives = 28/68 (41%), Gaps = 4/68 (5%)
Query: 550 EEPSSPRKPEAPSSCIPPPPPPASLPPPPPPR---RSSDLPYLTCPRPLVRCRQPPRSKL 606
P +P P PPPPPP PP P + P P+P + P R +
Sbjct: 2915 PPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAV 2974
Query: 607 -KSRIPRP 613
+ R+P+P
Sbjct: 2975 PRFRVPQP 2982
Score = 37.6 bits (87), Expect = 0.050
Identities = 17/36 (47%), Positives = 18/36 (50%)
Query: 548 GREEPSSPRKPEAPSSCIPPPPPPASLPPPPPPRRS 583
E P P+ P P PPPPP PPPPPP R
Sbjct: 2904 QPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRP 2939
Score = 37.2 bits (86), Expect = 0.054
Identities = 14/53 (26%), Positives = 17/53 (32%), Gaps = 1/53 (1%)
Query: 553 SSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLPYL-TCPRPLVRCRQPPRS 604
+ P + S PP P P P P P P+P PPR
Sbjct: 2887 ARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRP 2939
Score = 36.8 bits (85), Expect = 0.082
Identities = 26/74 (35%), Positives = 30/74 (40%), Gaps = 3/74 (4%)
Query: 531 SGSVPDDTKAVGLSRSGGREEPSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLPYLT 590
S P D A L+ + +SP P P + P PP PPP PP S L
Sbjct: 2800 SPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPP---PPPGPPPPSLPLGGSV 2856
Query: 591 CPRPLVRCRQPPRS 604
P VR R P RS
Sbjct: 2857 APGGDVRRRPPSRS 2870
Score = 36.8 bits (85), Expect = 0.084
Identities = 17/70 (24%), Positives = 26/70 (37%), Gaps = 7/70 (10%)
Query: 552 PSSPRKPEA---PSSCIPPPPPPASLPPPPPPR-RSSDLPYLTCPRPLVRCRQPPRSKLK 607
P++P +P + +LPP P R P P+P + PP+ +
Sbjct: 2875 PAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAP--PPPQPQPQPPPPPQPQPP 2932
Query: 608 SRIP-RPYFP 616
P RP P
Sbjct: 2933 PPPPPRPQPP 2942
Score = 35.7 bits (82), Expect = 0.17
Identities = 25/77 (32%), Positives = 32/77 (41%), Gaps = 3/77 (3%)
Query: 539 KAVGLSRSGGREEPSSPRKPEAPSSCIPPPPP-PASLPPPPP-PRRSSDLPYLTCPRPLV 596
A +R G ++ + P P S P P P PAS PPPP P S++ P P
Sbjct: 391 AATPFARGPGGDDQTRPAAPVPASVPTPAPTPVPASAPPPPATPLPSAEPGSDDGPAP-P 449
Query: 597 RCRQPPRSKLKSRIPRP 613
RQPP + P
Sbjct: 450 PERQPPAPATEPAPDDP 466
Score = 34.9 bits (80), Expect = 0.34
Identities = 19/50 (38%), Positives = 21/50 (42%), Gaps = 8/50 (16%)
Query: 567 PPPPPASLPPPPPPRRSSDLPYLTCPRPLVRCRQPPRSKLKSRIPRPYFP 616
PPPP PP P RS + PRP R P + SR RP P
Sbjct: 2551 PPPPLPPAAPPAAPDRS-----VPPPRPAPR---PSEPAVTSRARRPDAP 2592
Score = 34.9 bits (80), Expect = 0.35
Identities = 21/92 (22%), Positives = 28/92 (30%), Gaps = 6/92 (6%)
Query: 507 RISELKHRADFEQDERKRLENLINSGSVPDDTKAVGLSRSGGREEPSS-PRKPEAPSSCI 565
+E +R + G V +A L R+ P PR+ A +
Sbjct: 2634 AANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVG 2693
Query: 566 P-----PPPPPASLPPPPPPRRSSDLPYLTCP 592
PPPP P P P S P P
Sbjct: 2694 SLTSLADPPPPPPTPEPAPHALVSATPLPPGP 2725
Score = 34.5 bits (79), Expect = 0.46
Identities = 17/72 (23%), Positives = 19/72 (26%), Gaps = 2/72 (2%)
Query: 531 SGSVPDDTKAVGLSRSGGREEPSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLPYLT 590
G P G GG P + A P PP L P R +
Sbjct: 2843 PGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFA--L 2900
Query: 591 CPRPLVRCRQPP 602
P R QP
Sbjct: 2901 PPDQPERPPQPQ 2912
Score = 33.4 bits (76), Expect = 0.88
Identities = 18/55 (32%), Positives = 19/55 (34%), Gaps = 6/55 (10%)
Query: 549 REEPSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLPYL-TCPRPLVRCRQPP 602
R S P+ P P PP P P PP P PRP QPP
Sbjct: 2893 RSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRP-----QPP 2942
Score = 33.4 bits (76), Expect = 0.96
Identities = 9/35 (25%), Positives = 12/35 (34%)
Query: 552 PSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDL 586
+ R PE PPPPP + P +
Sbjct: 263 EGADRAPETARGATGPPPPPEAAAPNGAAAPPDGV 297
Score = 32.6 bits (74), Expect = 1.6
Identities = 18/53 (33%), Positives = 24/53 (45%), Gaps = 2/53 (3%)
Query: 552 PSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLPYLTCPRPL--VRCRQPP 602
P++P P S P PPP PP P + D P + L +R R+PP
Sbjct: 431 PATPLPSAEPGSDDGPAPPPERQPPAPATEPAPDDPDDATRKALDALRERRPP 483
Score = 31.8 bits (72), Expect = 2.4
Identities = 19/58 (32%), Positives = 23/58 (39%), Gaps = 9/58 (15%)
Query: 552 PSSPRKPEAP-SSCIPPPPPPASLPPPPPPRRSSDLPYLTCPRPLVRCRQPPRSKLKS 608
P P +P P ++ PP P P + P PPRR RP V R L S
Sbjct: 2751 PGGPARPARPPTTAGPPAPAPPAAPAAGPPRRL--------TRPAVASLSESRESLPS 2800
Score = 31.1 bits (70), Expect = 5.0
Identities = 14/50 (28%), Positives = 17/50 (34%), Gaps = 2/50 (4%)
Query: 547 GGREEPSSPRKPEAPSSCIPPPPPPASLPP--PPPPRRSSDLPYLTCPRP 594
GG P+ P P + PP P A P P S + P P
Sbjct: 2752 GGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSP 2801
Score = 31.1 bits (70), Expect = 5.2
Identities = 14/61 (22%), Positives = 19/61 (31%), Gaps = 5/61 (8%)
Query: 535 PDDTKAVGLSRSGGREEPS----SPRKPEAPSSCIPPPPPPASLP-PPPPPRRSSDLPYL 589
P A SG +P P + P +P P P P PP L +
Sbjct: 2945 PTTDPAGAGEPSGAVPQPWLGALVPGRVAVPRFRVPQPAPSREAPASSTPPLTGHSLSRV 3004
Query: 590 T 590
+
Sbjct: 3005 S 3005
Score = 30.7 bits (69), Expect = 6.5
Identities = 22/72 (30%), Positives = 23/72 (31%), Gaps = 4/72 (5%)
Query: 546 SGGREEPSSPRKPEAPSSCIPPPPPPASLPPPPP----PRRSSDLPYLTCPRPLVRCRQP 601
+G P P P A PPP PA P P RR P PR V R
Sbjct: 2548 AGDPPPPLPPAAPPAAPDRSVPPPRPAPRPSEPAVTSRARRPDAPPQSARPRAPVDDRGD 2607
Query: 602 PRSKLKSRIPRP 613
PR P
Sbjct: 2608 PRGPAPPSPLPP 2619
Score = 30.3 bits (68), Expect = 7.4
Identities = 11/38 (28%), Positives = 14/38 (36%), Gaps = 8/38 (21%)
Query: 551 EPSSPRKPEAPSSCI--------PPPPPPASLPPPPPP 580
E ++P AP + P P PPPP P
Sbjct: 283 EAAAPNGAAAPPDGVWGAALAGAPLALPAPPDPPPPAP 320
Score = 30.3 bits (68), Expect = 7.6
Identities = 12/53 (22%), Positives = 16/53 (30%), Gaps = 2/53 (3%)
Query: 541 VGLSRSGGREEPSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLPYLTCPR 593
G +R + P P P++ PP L P S L P
Sbjct: 2752 GGPARPARPPTTAGPPAPAPPAAPAAGPPRR--LTRPAVASLSESRESLPSPW 2802
Score = 30.3 bits (68), Expect = 8.7
Identities = 18/67 (26%), Positives = 22/67 (32%), Gaps = 6/67 (8%)
Query: 547 GGREEPSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLPYLTCPRPLVRCRQPPRSKL 606
GG P + PS P PA+ P PP RR L R PP
Sbjct: 2853 GGSVAPGGDVRRRPPSR--SPAAKPAA-PARPPVRR---LARPAVSRSTESFALPPDQPE 2906
Query: 607 KSRIPRP 613
+ P+
Sbjct: 2907 RPPQPQA 2913
Score = 29.9 bits (67), Expect = 9.7
Identities = 14/51 (27%), Positives = 30/51 (58%), Gaps = 1/51 (1%)
Query: 438 IVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLE 488
+ A +++ADELE DL+ ++A + + + D++ +L R+++R E
Sbjct: 1287 VGATRRRADELEAAAADLAEKMAARRAR-ASRERWAADVEAALDRVENRAE 1336
>gnl|CDD|223039 PHA03307, PHA03307, transcriptional regulator ICP4; Provisional.
Length = 1352
Score = 44.8 bits (106), Expect = 3e-04
Identities = 23/65 (35%), Positives = 25/65 (38%), Gaps = 4/65 (6%)
Query: 538 TKAVGLSRSGGREEPSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLPYLTCPRPLVR 597
T A G P P P+ P PP PPPAS PP P P S L + P P
Sbjct: 94 TLAPASPAREGSPTPPGPSSPDPP----PPTPPPASPPPSPAPDLSEMLRPVGSPGPPPA 149
Query: 598 CRQPP 602
P
Sbjct: 150 ASPPA 154
Score = 43.2 bits (102), Expect = 9e-04
Identities = 17/83 (20%), Positives = 23/83 (27%), Gaps = 10/83 (12%)
Query: 530 NSGSVPDDTKAVGLSRSGGREEPSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLPYL 589
+S S + + + S S + PS P P P P R
Sbjct: 317 SSSSSRESSSSSTSSSSESSRGAAVSP---GPSPSRSPSPSRPPPPADPSSPRKR----- 368
Query: 590 TCPRPLVRCRQPPRSKLKSRIPR 612
PRP P S + R
Sbjct: 369 --PRPSRAPSSPAASAGRPTRRR 389
Score = 40.5 bits (95), Expect = 0.006
Identities = 24/103 (23%), Positives = 37/103 (35%), Gaps = 6/103 (5%)
Query: 530 NSGSVPDDTKAVGLSRSGGREEPSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLPYL 589
+ +V D + + + R P +P + PP PPA+ P PP R S
Sbjct: 158 SPAAVASDAASSRQAALPLSSPEETARAPSSPPAEPPPSTPPAAASPRPPRRSSPISASA 217
Query: 590 TCPRPLVRCRQPPRSKLKSRIPRPYFPNVSVQVNGTIWTELDE 632
+ P P P RS + S + +G W +E
Sbjct: 218 SSPAPA-----PGRSA-ADDAGASSSDSSSSESSGCGWGPENE 254
Score = 38.6 bits (90), Expect = 0.024
Identities = 17/89 (19%), Positives = 25/89 (28%), Gaps = 7/89 (7%)
Query: 532 GSVPDDTKAVGLSRSGGREEPSSPRKP-EAPSSCIPPPPPPASLPPPPPPRRSSDLPYLT 590
G + A + + S P S P PP A+ PPP S
Sbjct: 800 GRLRRSGPAADAASRTASKRKSRSHTPDGGSESSGPARPPGAAARPPPARSSESSKSKPA 859
Query: 591 CPRPLVRCR------QPPRSKLKSRIPRP 613
R + +PP + + P
Sbjct: 860 AAGGRARGKNGRRRPRPPEPRARPGAAAP 888
Score = 36.7 bits (85), Expect = 0.079
Identities = 23/86 (26%), Positives = 26/86 (30%), Gaps = 14/86 (16%)
Query: 531 SGSVPDDTKAVGLSRSGGR----------EEPSSPRKPEAPSSCIPPPPPPASLPPPPPP 580
S S AV S R +PSSPRK PS P P AS P
Sbjct: 331 SSSESSRGAAVSPGPSPSRSPSPSRPPPPADPSSPRKRPRPSR--APSSPAASAGRPTRR 388
Query: 581 RRSSDLPYLTCPR--PLVRCRQPPRS 604
R + + R PR
Sbjct: 389 RARAAVAGRARRRDATGRFPAGRPRP 414
Score = 34.0 bits (78), Expect = 0.53
Identities = 24/112 (21%), Positives = 29/112 (25%), Gaps = 13/112 (11%)
Query: 512 KHRADFEQDERKRLENLINSGSVPDDTK---AVGLSRSGGREEPSSPRKPEAP------- 561
A +RK + + GS R SS KP A
Sbjct: 809 ADAASRTASKRKSRSHTPDGGSESSGPARPPGAAARPPPARSSESSKSKPAAAGGRARGK 868
Query: 562 SSCIPPPPPPASLPPPPPPRRSSDLPYLTCPRPLVRCRQPPRSKLKSRIPRP 613
+ P PP P + P R R PR KL P P
Sbjct: 869 NGRRRPRPPEPRARPGAAAPPKAAAAAPPAGAPAPRPRPAPRVKL---GPMP 917
Score = 33.2 bits (76), Expect = 1.1
Identities = 16/68 (23%), Positives = 19/68 (27%), Gaps = 4/68 (5%)
Query: 531 SGSVPDDTKAVGLSRSGGREEPSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSD---LP 587
S A G +R P +P A PP A+ PP P
Sbjct: 852 ESSKSKPAAAGGRARGKNGRRRPRPPEPRARPGAAAPPKAAAAAPPAGAPAPRPRPAPRV 911
Query: 588 YLTCPRPL 595
L P P
Sbjct: 912 KLG-PMPP 918
Score = 32.8 bits (75), Expect = 1.3
Identities = 22/105 (20%), Positives = 27/105 (25%), Gaps = 22/105 (20%)
Query: 531 SGSVPDDTKAVGLSRSGGREEPSSPRKPEAPSSCIPPPPPPASL---------------- 574
SG ++ S S E S P +P S P P AS
Sbjct: 273 SGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSS 332
Query: 575 ------PPPPPPRRSSDLPYLTCPRPLVRCRQPPRSKLKSRIPRP 613
P S P + P P P + SR P
Sbjct: 333 SESSRGAAVSPGPSPSRSPSPSRPPPPADPSSPRKRPRPSRAPSS 377
Score = 32.5 bits (74), Expect = 1.5
Identities = 17/83 (20%), Positives = 22/83 (26%)
Query: 520 DERKRLENLINSGSVPDDTKAVGLSRSGGREEPSSPRKPEAPSSCIPPPPPPASLPPPPP 579
+R G + S +P P S PP P PPP P
Sbjct: 62 CDRFEPPTGPPPGPGTEAPANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTP 121
Query: 580 PRRSSDLPYLTCPRPLVRCRQPP 602
P S ++R P
Sbjct: 122 PPASPPPSPAPDLSEMLRPVGSP 144
Score = 31.7 bits (72), Expect = 2.9
Identities = 21/82 (25%), Positives = 24/82 (29%), Gaps = 8/82 (9%)
Query: 540 AVGLSRSGGREEPSSPRKPEAPSSCIPPPPPPASLPPPPPPR--------RSSDLPYLTC 591
A G + R P+S S P P P S P P PR R S +
Sbjct: 272 ASGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSS 331
Query: 592 PRPLVRCRQPPRSKLKSRIPRP 613
R SR P P
Sbjct: 332 SSESSRGAAVSPGPSPSRSPSP 353
Score = 31.3 bits (71), Expect = 3.6
Identities = 11/85 (12%), Positives = 19/85 (22%)
Query: 520 DERKRLENLINSGSVPDDTKAVGLSRSGGREEPSSPRKPEAPSSCIPPPPPPASLPPPPP 579
D + + +G+ D G + + + + PAS
Sbjct: 46 DSAELAAVTVVAGAAACDRFEPPTGPPPGPGTEAPANESRSTPTWSLSTLAPASPAREGS 105
Query: 580 PRRSSDLPYLTCPRPLVRCRQPPRS 604
P P PP
Sbjct: 106 PTPPGPSSPDPPPPTPPPASPPPSP 130
>gnl|CDD|224495 COG1579, COG1579, Zn-ribbon protein, possibly nucleic acid-binding
[General function prediction only].
Length = 239
Score = 41.6 bits (98), Expect = 0.001
Identities = 30/93 (32%), Positives = 50/93 (53%), Gaps = 3/93 (3%)
Query: 424 NVKKVVHLLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRI 483
VK L A EI AK++ + LE E +L + K E+E++ + E L+++L+
Sbjct: 84 AVKDERELRALNIEIQIAKERINSLEDELAELMEEIEKLEKEIEDLKERLERLEKNLAEA 143
Query: 484 KDRLEKETAGHLEAKQKVAELEDRISELKHRAD 516
+ RLE+E A E +++ EL + ELK + D
Sbjct: 144 EARLEEEVA---EIREEGQELSSKREELKEKLD 173
Score = 38.5 bits (90), Expect = 0.012
Identities = 18/95 (18%), Positives = 42/95 (44%)
Query: 424 NVKKVVHLLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRI 483
+++ V L E + + + K E ++ V L E+ + L++ L+ +
Sbjct: 56 DLENQVSQLESEIQEIRERIKRAEEKLSAVKDERELRALNIEIQIAKERINSLEDELAEL 115
Query: 484 KDRLEKETAGHLEAKQKVAELEDRISELKHRADFE 518
+ +EK + K+++ LE ++E + R + E
Sbjct: 116 MEEIEKLEKEIEDLKERLERLEKNLAEAEARLEEE 150
Score = 33.5 bits (77), Expect = 0.38
Identities = 18/93 (19%), Positives = 35/93 (37%), Gaps = 5/93 (5%)
Query: 434 KEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAG 493
E I +K + + E L+ L E EL+ + L+ + I++R+++
Sbjct: 22 LEPRIKEIRKALKKAKAELEALNKALEALEIELEDLENQVSQLESEIQEIRERIKR---- 77
Query: 494 HLEAKQKVAELEDRISELKHRADFEQDERKRLE 526
E K + E + L ++ LE
Sbjct: 78 -AEEKLSAVKDERELRALNIEIQIAKERINSLE 109
>gnl|CDD|202427 pfam02841, GBP_C, Guanylate-binding protein, C-terminal domain.
Transcription of the anti-viral guanylate-binding
protein (GBP) is induced by interferon-gamma during
macrophage induction. This family contains GBP1 and
GPB2, both GTPases capable of binding GTP, GDP and GMP.
Length = 297
Score = 41.5 bits (98), Expect = 0.002
Identities = 25/101 (24%), Positives = 50/101 (49%), Gaps = 6/101 (5%)
Query: 431 LLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKE-DLDESLSRIKDRLEK 489
L AKE+ I A + KA+ E E L +K++E +Q + +E E + ++ +++E
Sbjct: 199 LTAKEKAIEAERAKAEAAEAEQ----ELLREKQKEEEQMMEAQERSYQEHVKQLIEKMEA 254
Query: 490 ETAGHLEAKQKVAELEDRISELKHRADFEQDERKRLENLIN 530
E L ++++ E + + E + F+ E + L+ I
Sbjct: 255 EREKLLAEQERMLEHKLQEQEELLKEGFKT-EAESLQKEIQ 294
>gnl|CDD|217933 pfam04156, IncA, IncA protein. Chlamydia trachomatis is an
obligate intracellular bacterium that develops within a
parasitophorous vacuole termed an inclusion. The
inclusion is non-fusogenic with lysosomes but intercepts
lipids from a host cell exocytic pathway. Initiation of
chlamydial development is concurrent with modification
of the inclusion membrane by a set of C.
trachomatis-encoded proteins collectively designated
Incs. One of these Incs, IncA, is functionally
associated with the homotypic fusion of inclusions. This
family probably includes members of the wider Inc family
rather than just IncA.
Length = 186
Score = 40.5 bits (95), Expect = 0.002
Identities = 28/106 (26%), Positives = 52/106 (49%), Gaps = 4/106 (3%)
Query: 430 HLLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEK 489
L + E+ K++ EL+ E DL R+A+ E EL+ ++ + L E L +++RLE
Sbjct: 73 KLEELQGELSELKQQLSELQEELEDLEERIAELESELEDLKEDLQLLRELLKSLEERLES 132
Query: 490 ETAGHLEAKQKVAELE----DRISELKHRADFEQDERKRLENLINS 531
E +++ EL + + EL+ + Q+ +RL+ I
Sbjct: 133 LEESIKELAKELRELRQDLREEVEELREELERLQENLQRLQEAIQE 178
Score = 35.5 bits (82), Expect = 0.076
Identities = 26/100 (26%), Positives = 50/100 (50%), Gaps = 1/100 (1%)
Query: 431 LLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKE 490
LL K +K +EL+ E +L +L++ ++EL+ + +L+ L +K+ L+
Sbjct: 60 LLLKAPVQSVRPQKLEELQGELSELKQQLSELQEELEDLEERIAELESELEDLKEDLQLL 119
Query: 491 TAGHLEAKQKVAELEDRISEL-KHRADFEQDERKRLENLI 529
++++ LE+ I EL K + QD R+ +E L
Sbjct: 120 RELLKSLEERLESLEESIKELAKELRELRQDLREEVEELR 159
>gnl|CDD|237178 PRK12705, PRK12705, hypothetical protein; Provisional.
Length = 508
Score = 41.6 bits (98), Expect = 0.002
Identities = 33/105 (31%), Positives = 53/105 (50%), Gaps = 10/105 (9%)
Query: 432 LAKEEEIVAAK-KKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKE 490
L ++EE + A+ +K D LE + + L+ +E EL++ ++ LD L R+ L E
Sbjct: 86 LVQKEEQLDARAEKLDNLENQLEEREKALSARELELEELEKQ---LDNELYRVAG-LTPE 141
Query: 491 TAGHLEAKQKVAELED----RISELKHRADFEQDERKRLENLINS 531
A L K AELE+ R+ +++ AD E ERK L +
Sbjct: 142 QARKLLLKLLDAELEEEKAQRVKKIEEEADLEA-ERKAQNILAQA 185
>gnl|CDD|222828 PHA01732, PHA01732, proline-rich protein.
Length = 94
Score = 38.2 bits (88), Expect = 0.003
Identities = 13/29 (44%), Positives = 16/29 (55%), Gaps = 5/29 (17%)
Query: 553 SSPRKPEAPSSCIPPPPPPASLPPPPPPR 581
+P+ PE P P PPA +PPPPP
Sbjct: 5 RAPKPPEPP-----APLPPAPVPPPPPAP 28
Score = 34.7 bits (79), Expect = 0.039
Identities = 12/28 (42%), Positives = 15/28 (53%)
Query: 560 APSSCIPPPPPPASLPPPPPPRRSSDLP 587
AP PP P P + PPPPP + +P
Sbjct: 6 APKPPEPPAPLPPAPVPPPPPAPPAPVP 33
Score = 34.3 bits (78), Expect = 0.054
Identities = 12/29 (41%), Positives = 12/29 (41%)
Query: 552 PSSPRKPEAPSSCIPPPPPPASLPPPPPP 580
P P P PPPPPA P P P
Sbjct: 7 PKPPEPPAPLPPAPVPPPPPAPPAPVPEP 35
>gnl|CDD|219321 pfam07174, FAP, Fibronectin-attachment protein (FAP). This family
contains bacterial fibronectin-attachment proteins
(FAP). Family members are rich in alanine and proline,
are approximately 300 long, and seem to be restricted to
mycobacteria. These proteins contain a
fibronectin-binding motif that allows mycobacteria to
bind to fibronectin in the extracellular matrix.
Length = 297
Score = 40.6 bits (95), Expect = 0.003
Identities = 20/66 (30%), Positives = 22/66 (33%), Gaps = 3/66 (4%)
Query: 552 PSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLPYLTCPRPLVRCRQPPRSKLKSRIP 611
P P A + PPPPP PP P D P P P PP + P
Sbjct: 45 PPPPSTAAAAPAPAAPPPPPPPAAPPAPQ---PDDPNAAPPPPPADPNAPPPPPVDPNAP 101
Query: 612 RPYFPN 617
P P
Sbjct: 102 PPPAPE 107
Score = 34.9 bits (80), Expect = 0.21
Identities = 23/85 (27%), Positives = 28/85 (32%), Gaps = 4/85 (4%)
Query: 551 EPSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLPYLTCPRPLVRCRQPPRSKLKSRI 610
P+ P P PP P P PPP + P P P+ PP + RI
Sbjct: 54 APAPAAPPPPPPPAAPPAPQPDDPNAAPPPPPAD--PNAPPPPPVDPNAPPPPAPEPGRI 111
Query: 611 PRPYFPNVSVQVNGTIWTELDESKL 635
V G W E D + L
Sbjct: 112 DNAVGGFSYVVPAG--WVESDATHL 134
Score = 33.7 bits (77), Expect = 0.51
Identities = 17/66 (25%), Positives = 21/66 (31%)
Query: 552 PSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLPYLTCPRPLVRCRQPPRSKLKSRIP 611
P++ AP P A P PPP P P PP + + P
Sbjct: 34 PATANADPAPPPPPPSTAAAAPAPAAPPPPPPPAAPPAPQPDDPNAAPPPPPADPNAPPP 93
Query: 612 RPYFPN 617
P PN
Sbjct: 94 PPVDPN 99
>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y. Members of this family are
RNase Y, an endoribonuclease. The member from Bacillus
subtilis, YmdA, has been shown to be involved in
turnover of yitJ riboswitch [Transcription, Degradation
of RNA].
Length = 514
Score = 40.3 bits (95), Expect = 0.005
Identities = 33/104 (31%), Positives = 51/104 (49%), Gaps = 12/104 (11%)
Query: 424 NVKKVVHLLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRI 483
+KK L AKEE K ELE E + N L + E+ L Q+ + + ESL +
Sbjct: 47 TLKKEALLEAKEE----VHKLRAELERELKERRNELQRLERRLLQREETLDRKMESLDKK 102
Query: 484 KDRLEKETAGHLEAKQK-VAELEDRISELKHRADFEQDERKRLE 526
++ LEK+ L K+K + E E+ + EL ++R+ LE
Sbjct: 103 EENLEKKEK-ELSNKEKNLDEKEEELEEL------IAEQREELE 139
Score = 34.9 bits (81), Expect = 0.22
Identities = 27/90 (30%), Positives = 48/90 (53%), Gaps = 14/90 (15%)
Query: 431 LLAKEE----EIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDR 486
LL +EE ++ + KK + LE + L+ KE+ LD+K +E L+E ++ ++
Sbjct: 85 LLQREETLDRKMESLDKKEENLEKKE----KELSNKEKNLDEKEEE---LEELIAEQREE 137
Query: 487 LEKETA-GHLEAKQKVAELEDRISELKHRA 515
LE+ + EAK+ + LE+ E +H A
Sbjct: 138 LERISGLTQEEAKEIL--LEEVEEEARHEA 165
>gnl|CDD|233045 TIGR00601, rad23, UV excision repair protein Rad23. All proteins
in this family for which functions are known are
components of a multiprotein complex used for targeting
nucleotide excision repair to specific parts of the
genome. In humans, Rad23 complexes with the XPC protein.
This family is based on the phylogenomic analysis of JA
Eisen (1999, Ph.D. Thesis, Stanford University) [DNA
metabolism, DNA replication, recombination, and repair].
Length = 378
Score = 39.9 bits (93), Expect = 0.005
Identities = 18/102 (17%), Positives = 33/102 (32%), Gaps = 22/102 (21%)
Query: 495 LEAKQKVAELEDRISELKHRADFEQDERKRLENLINSGSVPDDTKAVGLSR--------- 545
+E + V EL+++I + + + ++K LI SG + D K V +
Sbjct: 17 MEPDETVKELKEKIEAEQGKDAYPVAQQK----LIYSGKILSDDKTVKEYKIKEKDFVVV 72
Query: 546 ---------SGGREEPSSPRKPEAPSSCIPPPPPPASLPPPP 578
++P P+ P P P
Sbjct: 73 MVSKPKTGTGKVAPPAATPTSAPTPTPSPPASPASGMSAAPA 114
>gnl|CDD|148139 pfam06346, Drf_FH1, Formin Homology Region 1. This region is found
in some of the Diaphanous related formins (Drfs). It
consists of low complexity repeats of around 12
residues.
Length = 160
Score = 38.0 bits (88), Expect = 0.008
Identities = 17/33 (51%), Positives = 17/33 (51%), Gaps = 2/33 (6%)
Query: 547 GGREEPSSPRKPEAPSSCIPPPPPPASLPPPPP 579
G P P P AP IPPPPP PPPPP
Sbjct: 118 GPGVPPPPPPFPGAPG--IPPPPPGMGSPPPPP 148
Score = 38.0 bits (88), Expect = 0.009
Identities = 21/52 (40%), Positives = 24/52 (46%), Gaps = 8/52 (15%)
Query: 547 GGREEPSSPRKPEAPSSCIPPPPP---PASLPPPPPPRRSSDLPYLTCPRPL 595
GG P P P CIPPPPP +PPPPP + +P P PL
Sbjct: 8 GGVGIPPPP--PLPGGVCIPPPPPLPGGTGIPPPPPLPGGAAIP---PPPPL 54
Score = 37.6 bits (87), Expect = 0.011
Identities = 23/64 (35%), Positives = 29/64 (45%), Gaps = 4/64 (6%)
Query: 555 PRKPEAPSSCIPPPPPP----ASLPPPPPPRRSSDLPYLTCPRPLVRCRQPPRSKLKSRI 610
P P + IPPPPPP A++PPPPP +P P P PP + S
Sbjct: 86 PPPPLPGGAGIPPPPPPLPGGAAVPPPPPLPGGPGVPPPPPPFPGAPGIPPPPPGMGSPP 145
Query: 611 PRPY 614
P P+
Sbjct: 146 PPPF 149
Score = 36.5 bits (84), Expect = 0.030
Identities = 16/35 (45%), Positives = 17/35 (48%), Gaps = 2/35 (5%)
Query: 547 GGREEPSSPRKPEAPSSCIPPPPPP--ASLPPPPP 579
GG P P P P PPPP P +PPPPP
Sbjct: 105 GGAAVPPPPPLPGGPGVPPPPPPFPGAPGIPPPPP 139
Score = 35.7 bits (82), Expect = 0.043
Identities = 25/73 (34%), Positives = 30/73 (41%), Gaps = 15/73 (20%)
Query: 547 GGREEPSSPRKPEAPSSCIPPPPP---PASLPPPPPPRRSSDLPYLTCPRPLVRCRQPPR 603
G P P P A + IPPPPP A +PPPPP + +P P P PP
Sbjct: 56 GVAGIPPPPPLPGA--TAIPPPPPLPGAAGIPPPPPLPGGAGIP----PPP------PPL 103
Query: 604 SKLKSRIPRPYFP 616
+ P P P
Sbjct: 104 PGGAAVPPPPPLP 116
Score = 34.9 bits (80), Expect = 0.089
Identities = 25/71 (35%), Positives = 27/71 (38%), Gaps = 17/71 (23%)
Query: 555 PRKPEAPSSCIPPPPPPASLP-----PPPPPRRSSDLPYLTCPRPLVRCRQPPRSKLKSR 609
P P P PPPPP LP PPPPP LP T P PP +
Sbjct: 1 PPPPPLPGGVGIPPPPP--LPGGVCIPPPPP-----LPGGTGIPP-----PPPLPGGAAI 48
Query: 610 IPRPYFPNVSV 620
P P P V+
Sbjct: 49 PPPPPLPGVAG 59
Score = 33.8 bits (77), Expect = 0.22
Identities = 28/81 (34%), Positives = 30/81 (37%), Gaps = 27/81 (33%)
Query: 547 GGREEPSSPRKPEAPSSCIPPPP---------PPASLP-----PPPPPRRSSDLPYLTCP 592
GG P P P + IPPPP PP LP PPPPP LP T
Sbjct: 20 GGVCIPPPPPLPGG--TGIPPPPPLPGGAAIPPPPPLPGVAGIPPPPP-----LPGATAI 72
Query: 593 RPLVRCRQPPRSKLKSRIPRP 613
P PP + IP P
Sbjct: 73 PP------PPPLPGAAGIPPP 87
>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
Length = 880
Score = 39.6 bits (93), Expect = 0.011
Identities = 31/119 (26%), Positives = 57/119 (47%), Gaps = 15/119 (12%)
Query: 421 LKINVKKVVHLLAKEEEIVAAK--------KKADELEMENVDLSNRLAKKEQELDQKTQE 472
L+ + + L+A+ E + K ++A ELE E + A+ E+E ++ +E
Sbjct: 514 LEERREDLEELIAERRETIEEKRERAEELRERAAELEAEAEEKREAAAEAEEEAEEAREE 573
Query: 473 KEDLDESLSRIKDRLEK-ETAGHLEAKQKVAELEDRISEL----KHRADFEQDERKRLE 526
+L+ L+ +K+R+E E L A +A+ ED I L + A+ + R+RL
Sbjct: 574 VAELNSKLAELKERIESLERIRTLLA--AIADAEDEIERLREKREALAELNDERRERLA 630
Score = 35.8 bits (83), Expect = 0.15
Identities = 30/119 (25%), Positives = 55/119 (46%), Gaps = 22/119 (18%)
Query: 421 LKINVKKVVHLLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESL 480
L+ V++V L + E++V A+ + + LE DL +A++ + +++K + E+L E
Sbjct: 487 LEEEVEEVEERLERAEDLVEAEDRIERLEERREDLEELIAERRETIEEKRERAEELRER- 545
Query: 481 SRIKDRLEKETAGHLEAKQK-----------VAELEDRISELKHRADFEQDERKRLENL 528
LE E EA + VAEL +++ELK + + LE +
Sbjct: 546 ---AAELEAEAEEKREAAAEAEEEAEEAREEVAELNSKLAELK-------ERIESLERI 594
Score = 35.8 bits (83), Expect = 0.17
Identities = 21/91 (23%), Positives = 44/91 (48%)
Query: 436 EEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHL 495
EE+ +++ +ELE E DL + + + +E+L++ ++DRLE+
Sbjct: 279 EEVRDLRERLEELEEERDDLLAEAGLDDADAEAVEARREELEDRDEELRDRLEECRVAAQ 338
Query: 496 EAKQKVAELEDRISELKHRADFEQDERKRLE 526
++ L + +L+ RA+ ++E LE
Sbjct: 339 AHNEEAESLREDADDLEERAEELREEAAELE 369
Score = 34.2 bits (79), Expect = 0.48
Identities = 22/79 (27%), Positives = 37/79 (46%), Gaps = 5/79 (6%)
Query: 449 EMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKET---AGHLEAKQKVAELE 505
E E DL RL E EL + +E E +E + ++ ++ H E ++++ LE
Sbjct: 198 EKEEKDLHERLNGLESELAELDEEIERYEEQREQARETRDEADEVLEEHEERREELETLE 257
Query: 506 DRISELKHRADFEQDERKR 524
I +L R + ER+R
Sbjct: 258 AEIEDL--RETIAETERER 274
Score = 32.7 bits (75), Expect = 1.2
Identities = 31/96 (32%), Positives = 48/96 (50%), Gaps = 10/96 (10%)
Query: 435 EEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGH 494
EE + ++K DEL E DL + E EL+ E E+L E +++R+E A +
Sbjct: 659 EEYLEQVEEKLDELREERDDLQAEIGAVENELE----ELEELRERREALENRVEALEALY 714
Query: 495 LEAKQKVAELEDRISELKHRADFEQDERKRLENLIN 530
EA+ ELE +L RA+ Q + LE ++N
Sbjct: 715 DEAE----ELESMYGDL--RAELRQRNVETLERMLN 744
Score = 31.9 bits (73), Expect = 2.3
Identities = 21/108 (19%), Positives = 54/108 (50%), Gaps = 11/108 (10%)
Query: 433 AKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQ------KTQEKEDLDESLSRIKDR 486
K E A+++A+E E +L+++LA+ ++ ++ D ++ + R++++
Sbjct: 555 EKREAAAEAEEEAEEAREEVAELNSKLAELKERIESLERIRTLLAAIADAEDEIERLREK 614
Query: 487 LEKETAGHLEAKQKVAELEDRISELKHRADFE-----QDERKRLENLI 529
E + E ++++AE +R EL+ D +++++R E +
Sbjct: 615 REALAELNDERRERLAEKRERKRELEAEFDEARIEEAREDKERAEEYL 662
Score = 30.8 bits (70), Expect = 5.1
Identities = 21/87 (24%), Positives = 39/87 (44%), Gaps = 7/87 (8%)
Query: 435 EEEIVAAKKKADELE--MENV-----DLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRL 487
EE +++A ELE +E D + + E+E+++ + D L +D L
Sbjct: 355 EERAEELREEAAELESELEEAREAVEDRREEIEELEEEIEELRERFGDAPVDLGNAEDFL 414
Query: 488 EKETAGHLEAKQKVAELEDRISELKHR 514
E+ E +++ AELE + + R
Sbjct: 415 EELREERDELREREAELEATLRTARER 441
Score = 30.0 bits (68), Expect = 8.5
Identities = 26/100 (26%), Positives = 48/100 (48%), Gaps = 12/100 (12%)
Query: 435 EEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLE--KETA 492
+ E V A++ +ELE + +L +RL + +E E L E +++R E +E A
Sbjct: 308 DAEAVEARR--EELEDRDEELRDRLEECRVAAQAHNEEAESLREDADDLEERAEELREEA 365
Query: 493 GHLE-----AKQKVAELEDRISELKHRADFEQDERKRLEN 527
LE A++ V + + I EL+ + + R+R +
Sbjct: 366 AELESELEEAREAVEDRREEIEELEEEIE---ELRERFGD 402
Score = 30.0 bits (68), Expect = 9.7
Identities = 22/112 (19%), Positives = 46/112 (41%), Gaps = 21/112 (18%)
Query: 432 LAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIK------- 484
L + ++AA A++ ++ K+E + + +E L E R +
Sbjct: 591 LERIRTLLAAIADAED------EIERLREKREALAELNDERRERLAEKRERKRELEAEFD 644
Query: 485 -DRLEKETAGHLEAKQKVAELEDRISELKHRADFEQD-------ERKRLENL 528
R+E+ A++ + ++E+++ EL+ D Q E + LE L
Sbjct: 645 EARIEEAREDKERAEEYLEQVEEKLDELREERDDLQAEIGAVENELEELEEL 696
>gnl|CDD|129694 TIGR00606, rad50, rad50. All proteins in this family for which
functions are known are involvedin recombination,
recombinational repair, and/or non-homologous end
joining.They are components of an exonuclease complex
with MRE11 homologs. This family is distantly related to
the SbcC family of bacterial proteins.This family is
based on the phylogenomic analysis of JA Eisen (1999,
Ph.D. Thesis, Stanford University).
Length = 1311
Score = 39.3 bits (91), Expect = 0.013
Identities = 35/141 (24%), Positives = 62/141 (43%), Gaps = 14/141 (9%)
Query: 405 QLTLQTETGEDYDHIL--LKINVKKVVHLLAK----EEEIVAAKKKADELEMENVDLSNR 458
QL E + Y++ L LK +K++ H L+K + EI A K + ++E +N +L +
Sbjct: 232 QLESSREIVKSYENELDPLKNRLKEIEHNLSKIMKLDNEIKALKSRKKQMEKDNSELELK 291
Query: 459 LAKKEQELDQKTQEKEDL--------DESLSRIKDRLEKETAGHLEAKQKVAELEDRISE 510
+ K Q D++ + + L + LEK Q+ EL
Sbjct: 292 MEKVFQGTDEQLNDLYHNHQRTVREKERELVDCQRELEKLNKERRLLNQEKTELLVEQGR 351
Query: 511 LKHRADFEQDERKRLENLINS 531
L+ +AD Q+ + ++LI S
Sbjct: 352 LQLQADRHQEHIRARDSLIQS 372
Score = 31.2 bits (70), Expect = 3.6
Identities = 34/151 (22%), Positives = 67/151 (44%), Gaps = 20/151 (13%)
Query: 399 LDRLAQQLTLQT-ETGEDYDHILLKINVKKVVHLLAKEEEIVAAKKKADELEMENVDLSN 457
LDR QQ+ + E + D ++ KI + + + + ++E+I K K +EL+ E + +
Sbjct: 820 LDRTVQQVNQEKQEKQHELDTVVSKIELNRKL-IQDQQEQIQHLKSKTNELKSEKLQIGT 878
Query: 458 RLAKKEQELDQKTQEKEDLDESLSRIKDRLEK----------------ETAGHLEAKQKV 501
L +++Q +Q + ++ + IKD E+ E E K
Sbjct: 879 NLQRRQQFEEQLVELSTEVQSLIREIKDAKEQDSPLETFLEKDQQEKEELISSKETSNKK 938
Query: 502 AELEDRISELKHRADFEQDERKRLENLINSG 532
A +D+++++K + K +EN I G
Sbjct: 939 A--QDKVNDIKEKVKNIHGYMKDIENKIQDG 967
Score = 30.4 bits (68), Expect = 7.2
Identities = 15/60 (25%), Positives = 26/60 (43%)
Query: 431 LLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKE 490
K KK+ L+ E DL +L K +QE++Q + KD+++K+
Sbjct: 489 KAEKNSLTETLKKEVKSLQNEKADLDRKLRKLDQEMEQLNHHTTTRTQMEMLTKDKMDKD 548
>gnl|CDD|217469 pfam03276, Gag_spuma, Spumavirus gag protein.
Length = 582
Score = 38.7 bits (90), Expect = 0.015
Identities = 12/30 (40%), Positives = 15/30 (50%)
Query: 551 EPSSPRKPEAPSSCIPPPPPPASLPPPPPP 580
SP +P AP + IP PP P + PP
Sbjct: 226 PGPSPAQPSAPPASIPAPPIPPVIQYVAPP 255
>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
Length = 880
Score = 38.9 bits (91), Expect = 0.016
Identities = 27/97 (27%), Positives = 54/97 (55%), Gaps = 3/97 (3%)
Query: 435 EEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRI--KDRLEKETA 492
EE I +K+ +E+ E ++S+ L + +EL++ +E ++L+E I ++ +
Sbjct: 192 EELIKEKEKELEEVLREINEISSELPELREELEKLEKEVKELEELKEEIEELEKELESLE 251
Query: 493 GHLEA-KQKVAELEDRISELKHRADFEQDERKRLENL 528
G ++K+ ELE+RI ELK + +++ K L+ L
Sbjct: 252 GSKRKLEEKIRELEERIEELKKEIEELEEKVKELKEL 288
Score = 37.0 bits (86), Expect = 0.064
Identities = 29/110 (26%), Positives = 61/110 (55%), Gaps = 9/110 (8%)
Query: 421 LKINVKKVVHLLAKEEEIVAAKKKADELEMENVDLSNRLAK-KEQELDQKTQEKEDLDES 479
L+ ++++ +L KE E++ K+ A++L+ +L +L K +EL++K +E E L E
Sbjct: 478 LRKELRELEKVLKKESELIKLKELAEQLK----ELEEKLKKYNLEELEKKAEEYEKLKEK 533
Query: 480 LSRIKDRLEKETAGHLEAKQKVAELEDRISELKHRADFEQDERKRLENLI 529
L +L+ E + +K+ EL+ +++EL+ + D ++E L +
Sbjct: 534 L----IKLKGEIKSLKKELEKLEELKKKLAELEKKLDELEEELAELLKEL 579
Score = 36.2 bits (84), Expect = 0.11
Identities = 28/108 (25%), Positives = 56/108 (51%), Gaps = 11/108 (10%)
Query: 421 LKINVKKVVHLLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESL 480
L+ VK++ L K EE + + +E E ++ RL++ E+E++ + ++L+E
Sbjct: 278 LEEKVKELKELKEKAEEYIKLSEFYEEYLDELREIEKRLSRLEEEINGIEERIKELEEKE 337
Query: 481 SRIKDRLEKETAGHLEAKQKVAELEDRISELKHRADFEQDERKRLENL 528
R++ E K+K+ ELE R+ EL+ R + ++ + + E L
Sbjct: 338 ERLE-----------ELKKKLKELEKRLEELEERHELYEEAKAKKEEL 374
Score = 33.1 bits (76), Expect = 0.89
Identities = 23/106 (21%), Positives = 50/106 (47%), Gaps = 5/106 (4%)
Query: 426 KKVVHLLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDL-----DESL 480
+ + L E+E+ +K+ +LE E LA+ E+ L++ +E E+L +E
Sbjct: 602 NEYLELKDAEKELEREEKELKKLEEELDKAFEELAETEKRLEELRKELEELEKKYSEEEY 661
Query: 481 SRIKDRLEKETAGHLEAKQKVAELEDRISELKHRADFEQDERKRLE 526
+++ + + + ++ ELE R E+K + ++E + E
Sbjct: 662 EELREEYLELSRELAGLRAELEELEKRREEIKKTLEKLKEELEERE 707
Score = 33.1 bits (76), Expect = 1.0
Identities = 24/99 (24%), Positives = 52/99 (52%), Gaps = 3/99 (3%)
Query: 435 EEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKT--QEKEDLDESLSRI-KDRLEKET 491
E+E+ ++K +L E +L L K+ + + K ++ ++L+E L + + LEK+
Sbjct: 465 EKELKEIEEKERKLRKELRELEKVLKKESELIKLKELAEQLKELEEKLKKYNLEELEKKA 524
Query: 492 AGHLEAKQKVAELEDRISELKHRADFEQDERKRLENLIN 530
+ + K+K+ +L+ I LK + ++ +K+L L
Sbjct: 525 EEYEKLKEKLIKLKGEIKSLKKELEKLEELKKKLAELEK 563
Score = 32.7 bits (75), Expect = 1.4
Identities = 27/94 (28%), Positives = 47/94 (50%), Gaps = 1/94 (1%)
Query: 435 EEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGH 494
EE I KK+ +ELE E V L +K +E + ++ E+ + L I+ RL +
Sbjct: 265 EERIEELKKEIEELE-EKVKELKELKEKAEEYIKLSEFYEEYLDELREIEKRLSRLEEEI 323
Query: 495 LEAKQKVAELEDRISELKHRADFEQDERKRLENL 528
++++ ELE++ L+ ++ KRLE L
Sbjct: 324 NGIEERIKELEEKEERLEELKKKLKELEKRLEEL 357
Score = 32.3 bits (74), Expect = 1.7
Identities = 29/102 (28%), Positives = 51/102 (50%), Gaps = 7/102 (6%)
Query: 434 KEEEIVAAKKKADELEMENV-DLSNRLAKKEQ------ELDQKTQEKEDLDESLSRIKDR 486
EEE+ K+ +EL E+V +L RL + E EL +E E ++ L ++++
Sbjct: 568 LEEELAELLKELEELGFESVEELEERLKELEPFYNEYLELKDAEKELEREEKELKKLEEE 627
Query: 487 LEKETAGHLEAKQKVAELEDRISELKHRADFEQDERKRLENL 528
L+K E ++++ EL + EL+ + E+ E R E L
Sbjct: 628 LDKAFEELAETEKRLEELRKELEELEKKYSEEEYEELREEYL 669
Score = 31.2 bits (71), Expect = 3.7
Identities = 20/82 (24%), Positives = 40/82 (48%), Gaps = 5/82 (6%)
Query: 434 KEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIK---DRLEKE 490
+E+ +KK E E E L + +EL E E+L++ IK ++L++E
Sbjct: 645 LRKELEELEKKYSEEEYEE--LREEYLELSRELAGLRAELEELEKRREEIKKTLEKLKEE 702
Query: 491 TAGHLEAKQKVAELEDRISELK 512
+AK+++ +LE + ++
Sbjct: 703 LEEREKAKKELEKLEKALERVE 724
Score = 30.0 bits (68), Expect = 7.8
Identities = 18/82 (21%), Positives = 44/82 (53%), Gaps = 1/82 (1%)
Query: 425 VKKVVHLLAKEEEIVAAKKKADELEMENVD-LSNRLAKKEQELDQKTQEKEDLDESLSRI 483
V + ++ ++ E + + + E E++ ++ L + + E+EL+ K L+E + +
Sbjct: 205 VLREINEISSELPELREELEKLEKEVKELEELKEEIEELEKELESLEGSKRKLEEKIREL 264
Query: 484 KDRLEKETAGHLEAKQKVAELE 505
++R+E+ E ++KV EL+
Sbjct: 265 EERIEELKKEIEELEEKVKELK 286
Score = 30.0 bits (68), Expect = 9.3
Identities = 26/92 (28%), Positives = 46/92 (50%), Gaps = 16/92 (17%)
Query: 421 LKINVKKVVHLLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESL 480
L+ + AK+EE+ KK+ L E + E+EL++ + KE+++E +
Sbjct: 357 LEERHELYEEAKAKKEELERLKKRLTGLTPEKL---------EKELEELEKAKEEIEEEI 407
Query: 481 SRIKDRLEKETAGHLEAKQKVAELEDRISELK 512
S+I R+ E K+++ EL+ I ELK
Sbjct: 408 SKITARIG-------ELKKEIKELKKAIEELK 432
>gnl|CDD|218146 pfam04554, Extensin_2, Extensin-like region.
Length = 57
Score = 34.7 bits (80), Expect = 0.016
Identities = 17/49 (34%), Positives = 18/49 (36%), Gaps = 4/49 (8%)
Query: 566 PPPPPPASLPPPPPPRRSSDLPYLTCPRPLVRCRQPPRSKLKSRIPRPY 614
PPPP PPPP S P P + PP KS P Y
Sbjct: 5 SPPPPVKQYSPPPPYYYKSPPP----PVKSPVYKSPPPPVYKSPPPPKY 49
Score = 32.8 bits (75), Expect = 0.060
Identities = 13/35 (37%), Positives = 13/35 (37%)
Query: 553 SSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLP 587
SP P PPPP PPPP S P
Sbjct: 22 KSPPPPVKSPVYKSPPPPVYKSPPPPKYVYKSPPP 56
Score = 32.4 bits (74), Expect = 0.098
Identities = 19/59 (32%), Positives = 23/59 (38%), Gaps = 6/59 (10%)
Query: 553 SSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLPYLTCPRPLVRCRQPPRSKLKSRIP 611
SP P PPPP PPPP Y + P P+ + PP+ KS P
Sbjct: 4 KSPPPPVKQY---SPPPPYYYKSPPPPV---KSPVYKSPPPPVYKSPPPPKYVYKSPPP 56
Score = 31.3 bits (71), Expect = 0.24
Identities = 16/51 (31%), Positives = 18/51 (35%), Gaps = 7/51 (13%)
Query: 552 PSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLPYLTCPRPLVRCRQPP 602
SP P S PP P PPPP +S P P + PP
Sbjct: 12 QYSPPPPYYYKSPPPPVKSPVYKSPPPPVYKSP-------PPPKYVYKSPP 55
>gnl|CDD|191111 pfam04849, HAP1_N, HAP1 N-terminal conserved region. This family
represents an N-terminal conserved region found in
several huntingtin-associated protein 1 (HAP1)
homologues. HAP1 binds to huntingtin in a polyglutamine
repeat-length-dependent manner. However, its possible
role in the pathogenesis of Huntington's disease is
unclear. This family also includes a similar N-terminal
conserved region from hypothetical protein products of
ALS2CR3 genes found in the human juvenile amyotrophic
lateral sclerosis critical region 2q33-2q34.
Length = 307
Score = 38.3 bits (89), Expect = 0.017
Identities = 50/218 (22%), Positives = 85/218 (38%), Gaps = 28/218 (12%)
Query: 321 ETLDRHLDFFEMLRHE--DESELARKY--ETDHIETKSATSMFNILRRKLGHTAAYPHFL 376
E L + D LRHE + EL + Y + E +S+ S + +
Sbjct: 105 EQLGKARDEILQLRHELNLKDELLQFYSDADEESEDESSESTPLRPQESSSSSHGCFQLE 164
Query: 377 SLLHHCILLPLEFGCHPQHWLLLDRLAQQLTLQTETGEDYDHILLKINVKKV----VHLL 432
+L LL + L A L +T T E+ + L+ VK++ +
Sbjct: 165 ALQEKLKLLE-------EENEHLRSEASHLKTETVTYEEKEQQLVNDCVKQLREANDQIA 217
Query: 433 AKEEEIVAAKKKADELEM---ENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEK 489
+ EE+ KK ++LE E L +++ +++ E E+L + L+ KD +
Sbjct: 218 SLSEELA---KKTEDLERQQEEITHLLSQIVDLQKKCKSYALENEELQQHLAAAKDAQRQ 274
Query: 490 ETAGHLEAKQKVAELEDRISELKHRADFEQDERKRLEN 527
A E + K AE + + E Q+E K L N
Sbjct: 275 LQAELQELQDKYAECMEMLHE-------AQEELKNLRN 305
>gnl|CDD|197874 smart00787, Spc7, Spc7 kinetochore protein. This domain is found
in cell division proteins which are required for
kinetochore-spindle association.
Length = 312
Score = 38.1 bits (89), Expect = 0.019
Identities = 36/142 (25%), Positives = 66/142 (46%), Gaps = 14/142 (9%)
Query: 396 WLLLDRLAQQLTLQTET-GEDYDHILLKINVKKVVH------LLAKEEEIVAAKKKADEL 448
LL+ L + L E EDY ++ ++ + + A EEE+ K+ DEL
Sbjct: 139 MKLLEGLKEGLDENLEGLKEDYKLLMKELELLNSIKPKLRDRKDALEEELRQLKQLEDEL 198
Query: 449 EMENVDLSNR----LAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHLEAKQKVAEL 504
E + +R L K QE+ K ++ E+L+E L ++ ++E T E ++AE
Sbjct: 199 EDCDPTELDRAKEKLKKLLQEIMIKVKKLEELEEELQELESKIEDLTNKKSELNTEIAEA 258
Query: 505 EDRISELKHRADFEQDERKRLE 526
E ++ + + F E ++L+
Sbjct: 259 EKKLEQCRG---FTFKEIEKLK 277
Score = 35.4 bits (82), Expect = 0.14
Identities = 16/85 (18%), Positives = 38/85 (44%), Gaps = 4/85 (4%)
Query: 434 KEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAG 493
E+ AK+K +L E + +L + E+EL + + EDL S + + +
Sbjct: 202 DPTELDRAKEKLKKLLQEIMIKVKKLEELEEELQELESKIEDLTNKKSELNTEIAEAEKK 261
Query: 494 HLEAKQ----KVAELEDRISELKHR 514
+ + ++ +L++++ L+
Sbjct: 262 LEQCRGFTFKEIEKLKEQLKLLQSL 286
>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
Length = 434
Score = 38.2 bits (89), Expect = 0.020
Identities = 18/75 (24%), Positives = 41/75 (54%), Gaps = 2/75 (2%)
Query: 452 NVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHLEAKQKVAELEDRISEL 511
N DLS K+++ L + ++ +D + L +KD + +AK++ + E+ +
Sbjct: 359 NTDLSG--DKRQELLKEYNKKLQDYTKKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQ 416
Query: 512 KHRADFEQDERKRLE 526
K +AD ++++R++ E
Sbjct: 417 KEQADEDKEKRQKDE 431
>gnl|CDD|215533 PLN02983, PLN02983, biotin carboxyl carrier protein of acetyl-CoA
carboxylase.
Length = 274
Score = 37.5 bits (87), Expect = 0.029
Identities = 16/47 (34%), Positives = 18/47 (38%), Gaps = 6/47 (12%)
Query: 552 PSSPRKPEAPSSCIPPPPPPASLPPPPPP------RRSSDLPYLTCP 592
P P +P + P P PAS PPP P S P L P
Sbjct: 157 PPHAMPPASPPAAQPAPSAPASSPPPTPASPPPAKAPKSSHPPLKSP 203
Score = 35.2 bits (81), Expect = 0.16
Identities = 15/51 (29%), Positives = 17/51 (33%), Gaps = 3/51 (5%)
Query: 558 PEAPSSCIPPPPPPASLPPPPPPRRSSDLPYLTCPRPLVRCRQPPRSKLKS 608
P P + P PP A P P P P + PP LKS
Sbjct: 155 PPPPHAMPPASPPAAQPAPSAPASSPPPTPASPPPAKAPKSSHPP---LKS 202
Score = 33.7 bits (77), Expect = 0.44
Identities = 16/58 (27%), Positives = 24/58 (41%), Gaps = 3/58 (5%)
Query: 557 KPEAPSSCI--PPPPPPASLPPPPPPRRSSDLPYLTCPRPLVRCRQPPRSKLKSRIPR 612
+P P+ + PPPP A P PP + + + P P P ++ KS P
Sbjct: 143 QPPPPAPVVMMQPPPPHAMPPASPPAAQPAPSAPASSPPPTPASPPPAKAP-KSSHPP 199
>gnl|CDD|233119 TIGR00763, lon, ATP-dependent protease La. This protein is induced
by heat shock and other stresses in E. coli, B.
subtilis, and other species. The yeast member,
designated PIM1, is located in the mitochondrial matrix,
required for mitochondrial function, and also induced by
heat shock [Protein fate, Degradation of proteins,
peptides, and glycopeptides].
Length = 775
Score = 38.0 bits (89), Expect = 0.030
Identities = 32/97 (32%), Positives = 51/97 (52%), Gaps = 10/97 (10%)
Query: 443 KKADEL---EMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHLEAKQ 499
KKA EL E+E + L N++ KK +E +KTQ + L E L IK L E + K
Sbjct: 192 KKALELLKKELELLKLQNKITKKVEEKMEKTQREYYLREQLKAIKKELGIEK----DDKD 247
Query: 500 KVAELEDRISELKHRADFE---QDERKRLENLINSGS 533
++ +L++++ ELK + + + E +L L S S
Sbjct: 248 ELEKLKEKLEELKLPEEVKKVIEKELTKLSLLEPSSS 284
>gnl|CDD|218191 pfam04652, DUF605, Vta1 like. Vta1 (VPS20-associated protein 1) is
a positive regulator of Vps4. Vps4 is an ATPase that is
required in the multivesicular body (MVB) sorting
pathway to dissociate the endosomal sorting complex
required for transport (ESCRT). Vta1 promotes correct
assembly of Vps4 and stimulates its ATPase activity
through its conserved Vta1/SBP1/LIP5 region.
Length = 315
Score = 37.4 bits (87), Expect = 0.033
Identities = 25/104 (24%), Positives = 30/104 (28%), Gaps = 10/104 (9%)
Query: 518 EQDERKRLENLINSGSVPDDTKAVGLSRSGGREEPSSPRKPEAPSSCIPPPPPPASLPPP 577
++++ NS + A S S SSP P PS P P S PP
Sbjct: 155 DEEDEDADVATTNSDNSFPGEDADPASASPSDPPSSSPGVPSFPSPPEDPSSPSDSSLPP 214
Query: 578 PPPRRSSDLPYLTCPRPLVRCRQPPRSKLKSRIPRPYFPNVSVQ 621
P SD P P S P P P
Sbjct: 215 APSSFQSDTPP----------PSPESPTNPSPPPGPAAPPPPPV 248
>gnl|CDD|177871 PLN02226, PLN02226, 2-oxoglutarate dehydrogenase E2 component.
Length = 463
Score = 37.8 bits (87), Expect = 0.033
Identities = 28/104 (26%), Positives = 44/104 (42%), Gaps = 23/104 (22%)
Query: 526 ENLINSG-SVPDDTKAVGLSRSGGREEPSSPRKPEAPSSCIPPPPPPASLP--------- 575
E L+ G +V TK +S+S +P + + P + P P PPA
Sbjct: 147 EFLVKEGDTVEPGTKVAIISKSEDAASQVTPSQ-KIPETTDPKPSPPAEDKQKPKVESAP 205
Query: 576 ---------PPPPPRRSSDLPYLTCPRPLVRCRQPPRSKLKSRI 610
PPPP++S+ P L P R R+ P ++L+ R+
Sbjct: 206 VAEKPKAPSSPPPPKQSAKEPQLP---PKERERRVPMTRLRKRV 246
>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
recombination, and repair].
Length = 908
Score = 37.8 bits (88), Expect = 0.034
Identities = 28/150 (18%), Positives = 63/150 (42%), Gaps = 6/150 (4%)
Query: 399 LDRLAQQLTLQTETGEDYDHILLKINVKKVVHLLAKEEEIVAAKKKADELEMENVDLSNR 458
L +L + Q E + + L+ + ++ +E + K + E+E ++
Sbjct: 215 LKKLEEIQEEQEEEELEQEIEALEERLAELEEE---KERLEELKARLLEIESLELEALKI 271
Query: 459 LAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHLEAKQKVAELEDRISELKH-RADF 517
++ +EL++ +E E+ E L ++ +E+ + + ELE+ + +LK
Sbjct: 272 REEELRELERLLEELEEKIERLEELEREIEELEEELEGLRALLEELEELLEKLKSLEERL 331
Query: 518 EQDERKR--LENLINSGSVPDDTKAVGLSR 545
E+ E K LE+ + + + A L
Sbjct: 332 EKLEEKLEKLESELEELAEEKNELAKLLEE 361
Score = 36.7 bits (85), Expect = 0.073
Identities = 38/155 (24%), Positives = 68/155 (43%), Gaps = 18/155 (11%)
Query: 394 QHWLLLDRLAQQLTLQTETGEDYDHILLKINVKKVVHLLAKEEEIVAAKKKADELE---- 449
Q L+ +L + E E+ LL+I ++ L +EEE+ ++ +ELE
Sbjct: 232 QEIEALEERLAELEEEKERLEELKARLLEIESLELEALKIREEELRELERLLEELEEKIE 291
Query: 450 --MENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHLEAKQKVAELED- 506
E L ++ + L +E E+L E L +++RLEK + + ++ EL +
Sbjct: 292 RLEELEREIEELEEELEGLRALLEELEELLEKLKSLEERLEKLEEKLEKLESELEELAEE 351
Query: 507 ----------RISELKHRAD-FEQDERKRLENLIN 530
R+ EL+ R + E++ K LE L
Sbjct: 352 KNELAKLLEERLKELEERLEELEKELEKALERLKQ 386
Score = 35.9 bits (83), Expect = 0.15
Identities = 30/141 (21%), Positives = 62/141 (43%), Gaps = 11/141 (7%)
Query: 394 QHWLLLDRLAQQLTLQTETGEDYDHILLKINVKKVVHLLAKEEEIVAAKKKADELEMENV 453
LL+ L + L E + + K+ + EE + A LE
Sbjct: 309 GLRALLEELEELLEKLKSLEERLEKLEEKLEKLES----ELEELAEEKNELAKLLEERLK 364
Query: 454 DLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHLEAKQ-------KVAELED 506
+L RL + E+EL++ + + L+E++ +K+ L + +A E ++ ++ ELE
Sbjct: 365 ELEERLEELEKELEKALERLKQLEEAIQELKEELAELSAALEEIQEELEELEKELEELER 424
Query: 507 RISELKHRADFEQDERKRLEN 527
+ EL+ +++ +LE+
Sbjct: 425 ELEELEEEIKKLEEQINQLES 445
Score = 32.8 bits (75), Expect = 1.2
Identities = 31/111 (27%), Positives = 56/111 (50%), Gaps = 10/111 (9%)
Query: 421 LKINVKKVVHLLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESL 480
L+ + + EEI +K+ ELE E ++L ++EL++K ++ E+L E L
Sbjct: 486 LEEELSREKEEAELREEIEELEKELRELEEELIELLELEEALKEELEEKLEKLENLLEEL 545
Query: 481 SRIKDRLEKETAGHLEAKQKVAELEDRISELKHRADFEQDERKRLENLINS 531
+K++L+ + + K+++ +LEDR+ ELK E E RL
Sbjct: 546 EELKEKLQLQ-----QLKEELRQLEDRLQELK-----ELLEELRLLRTRKE 586
Score = 32.4 bits (74), Expect = 1.5
Identities = 24/106 (22%), Positives = 48/106 (45%), Gaps = 5/106 (4%)
Query: 430 HLLAKEEEIVAAKKKADELEMENVDLSNRLAKKE-----QELDQKTQEKEDLDESLSRIK 484
L ++EE+ +++ EL+ + +L RL++ E EL + E E+ +E L
Sbjct: 580 LLRTRKEELEELRERLKELKKKLKELEERLSQLEELLQSLELSEAENELEEAEEELESEL 639
Query: 485 DRLEKETAGHLEAKQKVAELEDRISELKHRADFEQDERKRLENLIN 530
++L + + + ELE+++ EL+ E + E L
Sbjct: 640 EKLNLQAELEELLQAALEELEEKVEELEAEIRRELQRIENEEQLEE 685
Score = 32.0 bits (73), Expect = 2.1
Identities = 29/105 (27%), Positives = 51/105 (48%), Gaps = 14/105 (13%)
Query: 431 LLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKE 490
L A EEE+ KK + E + L ++ + L+++ E E+ E L +K RL +
Sbjct: 205 LEALEEELKELKKLEEIQEEQEE---EELEQEIEALEERLAELEEEKERLEELKARLLEI 261
Query: 491 TAGHLEA----KQKVAELEDRISELKHRADFEQDERKRLENLINS 531
+ LEA ++++ ELE + EL +++ +RLE L
Sbjct: 262 ESLELEALKIREEELRELERLLEEL-------EEKIERLEELERE 299
Score = 31.7 bits (72), Expect = 3.1
Identities = 29/126 (23%), Positives = 56/126 (44%), Gaps = 15/126 (11%)
Query: 421 LKINVKKVVHLLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDE-- 478
L+ +++ L + E + A ++ +EL + L RL K E++L++ E E+L E
Sbjct: 293 LEELEREIEELEEELEGLRALLEELEELLEKLKSLEERLEKLEEKLEKLESELEELAEEK 352
Query: 479 ------------SLSRIKDRLEKETAGHLEAKQKVAE-LEDRISELKHRADFEQDERKRL 525
L + LEKE LE +++ E +++ EL + ++ ++ L
Sbjct: 353 NELAKLLEERLKELEERLEELEKELEKALERLKQLEEAIQELKEELAELSAALEEIQEEL 412
Query: 526 ENLINS 531
E L
Sbjct: 413 EELEKE 418
Score = 31.3 bits (71), Expect = 3.9
Identities = 29/115 (25%), Positives = 53/115 (46%), Gaps = 16/115 (13%)
Query: 398 LLDRLAQQLTLQTETGEDYDHILLKINVKKVVHLLAKEEEIVAAKKKADELEMENVDLSN 457
L++L Q L E+ L+ +KV L A+ + + ++LE +
Sbjct: 638 ELEKLNLQAEL-----EELLQAALEELEEKVEELEAEIRRELQRIENEEQLE----EKLE 688
Query: 458 RLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHLEAKQKVAELEDRISELK 512
L + E+EL+Q +E E+L + L I+ +E+ + + AELE+ EL+
Sbjct: 689 ELEQLEEELEQLREELEELLKKLGEIEQLIEE-------LESRKAELEELKKELE 736
Score = 30.1 bits (68), Expect = 8.5
Identities = 28/165 (16%), Positives = 65/165 (39%), Gaps = 32/165 (19%)
Query: 398 LLDRLAQQLTLQTETGEDYDHIL----------LKINVKKVVHLLAKEEEIVAAKKKADE 447
L + + L E ++ + L L+ + + +EE+ +E
Sbjct: 348 LAEEKNELAKLLEERLKELEERLEELEKELEKALERLKQLEEAIQELKEELAELSAALEE 407
Query: 448 LEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEK--ETAGHLEA-------- 497
++ E +L L + E+EL++ +E + L+E +++++ + E AG E
Sbjct: 408 IQEELEELEKELEELERELEELEEEIKKLEEQINQLESKELMIAELAGAGEKCPVCGQEL 467
Query: 498 ------------KQKVAELEDRISELKHRADFEQDERKRLENLIN 530
+ ++ ELE+ +S K A+ ++ + + L
Sbjct: 468 PEEHEKELLELYELELEELEEELSREKEEAELREEIEELEKELRE 512
>gnl|CDD|181619 PRK09039, PRK09039, hypothetical protein; Validated.
Length = 343
Score = 37.3 bits (87), Expect = 0.034
Identities = 18/79 (22%), Positives = 36/79 (45%), Gaps = 14/79 (17%)
Query: 455 LSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHLEAKQKVAELEDRISELKHR 514
LS ++ K+ LD+ + +L + LS LE++ +L+D ++ L+
Sbjct: 44 LSREISGKDSALDRLNSQIAELADLLS-----LERQGN---------QDLQDSVANLRAS 89
Query: 515 ADFEQDERKRLENLINSGS 533
+ ER RL+ L+ +
Sbjct: 90 LSAAEAERSRLQALLAELA 108
>gnl|CDD|152960 pfam12526, DUF3729, Protein of unknown function (DUF3729). This
family of proteins is found in viruses. Proteins in this
family are typically between 145 and 1707 amino acids in
length. The family is found in association with
pfam01443, pfam01661, pfam05417, pfam01660, pfam00978.
There is a single completely conserved residue L that
may be functionally important.
Length = 115
Score = 35.0 bits (81), Expect = 0.036
Identities = 10/31 (32%), Positives = 12/31 (38%)
Query: 552 PSSPRKPEAPSSCIPPPPPPASLPPPPPPRR 582
P P S + PP P P PPP +
Sbjct: 77 EPPVPGPAGPPSPLAPPAPARKPPLPPPRPQ 107
Score = 32.3 bits (74), Expect = 0.35
Identities = 18/68 (26%), Positives = 22/68 (32%), Gaps = 12/68 (17%)
Query: 535 PDDTKAVGLSRSGGREEPSSPRKPEAPSSCIPPPPPPA-----SLPPPPPPRRSSDLPYL 589
PDD VG R + P P S + PPP+ P PP + P
Sbjct: 37 PDDPPPVGDPRPPVVDTPP-------PVSAVWVLPPPSEPAAPPPDPEPPVPGPAGPPSP 89
Query: 590 TCPRPLVR 597
P R
Sbjct: 90 LAPPAPAR 97
Score = 31.2 bits (71), Expect = 0.94
Identities = 15/36 (41%), Positives = 16/36 (44%), Gaps = 4/36 (11%)
Query: 551 EPSSPRKPEAPSSCIPP----PPPPASLPPPPPPRR 582
P +P P PP PP PA PP PPPR
Sbjct: 71 APPPDPEPPVPGPAGPPSPLAPPAPARKPPLPPPRP 106
Score = 30.0 bits (68), Expect = 2.4
Identities = 15/33 (45%), Positives = 15/33 (45%), Gaps = 1/33 (3%)
Query: 551 EPSSPRKPEAPSSCIPPPPP-PASLPPPPPPRR 582
EP P PS PP P LPPP P RR
Sbjct: 77 EPPVPGPAGPPSPLAPPAPARKPPLPPPRPQRR 109
Score = 29.6 bits (67), Expect = 2.7
Identities = 16/52 (30%), Positives = 17/52 (32%), Gaps = 10/52 (19%)
Query: 562 SSCIPPPPPPASLPPPPPPRRSSDLPYLTCPRPLVRCRQPPRSKLKSRIPRP 613
SSC PP PPP PRP V PP S + P
Sbjct: 26 SSCFSPPESAHPDDPPPVG----------DPRPPVVDTPPPVSAVWVLPPPS 67
Score = 29.3 bits (66), Expect = 4.1
Identities = 13/38 (34%), Positives = 13/38 (34%), Gaps = 6/38 (15%)
Query: 543 LSRSGGREEPSSPRKPEAPSSCIPPPPPPASLPPPPPP 580
S SG S P S P PPP P PP
Sbjct: 20 WSTSGFSSCFS------PPESAHPDDPPPVGDPRPPVV 51
>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
unknown].
Length = 652
Score = 37.8 bits (88), Expect = 0.036
Identities = 19/99 (19%), Positives = 44/99 (44%), Gaps = 11/99 (11%)
Query: 433 AKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETA 492
E+ I ++ + LE EN +L ++EL++ +E E L+ L R + + +
Sbjct: 419 VYEKRIKKLEETVERLEEENSEL-------KRELEELKREIEKLESELERFRREVRDK-- 469
Query: 493 GHLEAKQKVAELEDRISELKHRADFEQDERKRLENLINS 531
+ + ++ + RI L+ + ++ + LE +
Sbjct: 470 -VRKDR-EIRARDRRIERLEKELEEKKKRVEELERKLAE 506
Score = 36.6 bits (85), Expect = 0.083
Identities = 26/78 (33%), Positives = 45/78 (57%), Gaps = 4/78 (5%)
Query: 435 EEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGH 494
EEE K++ +EL+ E L + L + +E+ K ++ ++ RI +RLEKE
Sbjct: 435 EEENSELKRELEELKREIEKLESELERFRREVRDKVRKDREIRARDRRI-ERLEKELE-- 491
Query: 495 LEAKQKVAELEDRISELK 512
E K++V ELE +++EL+
Sbjct: 492 -EKKKRVEELERKLAELR 508
Score = 30.4 bits (69), Expect = 6.3
Identities = 17/130 (13%), Positives = 46/130 (35%), Gaps = 14/130 (10%)
Query: 399 LDRLAQQLTLQTETGEDYDHILLKINVKKVVHLLAKEEEIVAAKKKADELEMENVDLSNR 458
++R +L + + ++ + + + + +EE + +E E R
Sbjct: 364 VERKLPELGIWKDVERIKALVIRGYPLAEALSKVKEEERPREKEGTEEEERREITVYEKR 423
Query: 459 LAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHLEAKQKVAELEDRISELKHRADFE 518
+ K E+ +++ +E +L L +K +EK LE + + +
Sbjct: 424 IKKLEETVERLEEENSELKRELEELKREIEK--------------LESELERFRREVRDK 469
Query: 519 QDERKRLENL 528
+ + +
Sbjct: 470 VRKDREIRAR 479
>gnl|CDD|221526 pfam12316, Dsh_C, Segment polarity protein dishevelled (Dsh) C
terminal. This domain family is found in eukaryotes,
and is typically between 177 and 207 amino acids in
length. The family is found in association with
pfam00778, pfam02377, pfam00610, pfam00595. The segment
polarity gene dishevelled (dsh) is required for pattern
formation of the embryonic segments. It is involved in
the determination of body organisation through the
Wingless pathway (analogous to the Wnt-1 pathway).
Length = 202
Score = 36.5 bits (84), Expect = 0.038
Identities = 28/91 (30%), Positives = 32/91 (35%), Gaps = 22/91 (24%)
Query: 508 ISELKHRADFEQDERKRLENLINSGSVPDDTKAVGLSRSGGREEPSS-----PRKPEAPS 562
E AD + RK SGS + T G RSGGR PS P + S
Sbjct: 83 DGERSRAADGREGGRKSG----GSGSESEHTSRSGSRRSGGRRAPSERSGPPPSEGSVRS 138
Query: 563 SC-------------IPPPPPPASLPPPPPP 580
S +PPP P L PPP
Sbjct: 139 SLSHPSSHSSYGAPGVPPPYNPPMLMMKPPP 169
>gnl|CDD|221408 pfam12072, DUF3552, Domain of unknown function (DUF3552). This
presumed domain is functionally uncharacterized. This
domain is found in bacteria, archaea and eukaryotes.
This domain is about 200 amino acids in length. This
domain is found associated with pfam00013, pfam01966.
This domain has a single completely conserved residue A
that may be functionally important.
Length = 201
Score = 36.4 bits (85), Expect = 0.038
Identities = 32/107 (29%), Positives = 47/107 (43%), Gaps = 14/107 (13%)
Query: 426 KKVVHLLAKEEEIVAAKKKADELEMENVD-------LSNRLAKKEQELDQKTQEKEDLDE 478
KK L AKEE K E E E + RL +KE+ LD+K + E +E
Sbjct: 51 KKEALLEAKEE----IHKLRAEAERELKERRNELQRQEKRLLQKEETLDRKDESLEKKEE 106
Query: 479 SLSRIKDRLEKETAGHLEAKQKVAE--LEDRISELKHRADFEQDERK 523
SL + L LE K++ E +E++ EL+ + +E K
Sbjct: 107 SLEEKEKELAARQQ-QLEEKEEELEELIEEQQQELERISGLTAEEAK 152
>gnl|CDD|234818 PRK00708, PRK00708, sec-independent translocase; Provisional.
Length = 209
Score = 36.3 bits (84), Expect = 0.043
Identities = 15/77 (19%), Positives = 28/77 (36%), Gaps = 5/77 (6%)
Query: 542 GLSRSGGREEPSSPRKPEAPSSCIPP-----PPPPASLPPPPPPRRSSDLPYLTCPRPLV 596
L ++ EP++ KP ++ + P PP +P P P ++ P+
Sbjct: 95 DLQKATSMSEPATENKPAEVTTPVEPMGLPETPPAVPVPAPAPAVAAAAAQAAAAPKAPA 154
Query: 597 RCRQPPRSKLKSRIPRP 613
+ R P+P
Sbjct: 155 KPRAKSPRPAAKAAPKP 171
>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family. Atrophin-1 is the
protein product of the dentatorubral-pallidoluysian
atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
neurodegenerative disorder. It is caused by the
expansion of a CAG repeat in the DRPLA gene on
chromosome 12p. This results in an extended
polyglutamine region in atrophin-1, that is thought to
confer toxicity to the protein, possibly through
altering its interactions with other proteins. The
expansion of a CAG repeat is also the underlying defect
in six other neurodegenerative disorders, including
Huntington's disease. One interaction of expanded
polyglutamine repeats that is thought to be pathogenic
is that with the short glutamine repeat in the
transcriptional coactivator CREB binding protein, CBP.
This interaction draws CBP away from its usual nuclear
location to the expanded polyglutamine repeat protein
aggregates that are characteristic of the polyglutamine
neurodegenerative disorders. This interferes with
CBP-mediated transcription and causes cytotoxicity.
Length = 979
Score = 37.4 bits (86), Expect = 0.053
Identities = 13/51 (25%), Positives = 15/51 (29%)
Query: 552 PSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLPYLTCPRPLVRCRQPP 602
P + PPP P S P PP S P + C P
Sbjct: 453 AQHPFTSGGLPAIGPPPSLPTSTPAAPPRASSGSQPPGSALPSSGGCAGPG 503
Score = 35.4 bits (81), Expect = 0.20
Identities = 41/168 (24%), Positives = 62/168 (36%), Gaps = 19/168 (11%)
Query: 460 AKKEQELDQKTQEKEDLDESLSRIKD--RLEKETAGHLEAKQKVAELEDRISELKHRADF 517
A +E ++ T +K E LSR E E G E+ + E+ S+ K D
Sbjct: 88 ASDTEEPERVTAKKSKTQE-LSRPNSPSEGEGEGEGEGESSDSRSVNEEGSSDPK---DI 143
Query: 518 EQDERKRLENLINSGSVPDDTKAVGLSRSGGREE-------PSSPRKPEAPSSCIPPPPP 570
+QD R +S S+P S S +++ PS P A + PPP
Sbjct: 144 DQDNRS------SSPSIPSPQDNESDSDSSAQQQLLQPQGPPSIQVPPGAALAPSAPPPT 197
Query: 571 PASLPPPPPPRRSSDLPYLTCPRPLVRCRQPPRSKLKSRIPRPYFPNV 618
P++ PP + P +P S R+P P+ P
Sbjct: 198 PSAQAVPPQGSPIAAQPAPQPQQPSPLSLISAPSLHPQRLPSPHPPLQ 245
Score = 33.9 bits (77), Expect = 0.59
Identities = 20/67 (29%), Positives = 24/67 (35%), Gaps = 12/67 (17%)
Query: 533 SVPDDTKAVGLSRSGGREEPSSPRKPEAPSSCIPPPPPPASL------------PPPPPP 580
S+P T A S G + P S + PP PP + PPPPP
Sbjct: 470 SLPTSTPAAPPRASSGSQPPGSALPSSGGCAGPGPPLPPIQIKEEPLDEAEEPESPPPPP 529
Query: 581 RRSSDLP 587
R S P
Sbjct: 530 RSPSPEP 536
>gnl|CDD|206034 pfam13863, DUF4200, Domain of unknown function (DUF4200). This
family is found in eukaryotes. It is a coiled-coil
domain of unknwon function.
Length = 126
Score = 34.9 bits (81), Expect = 0.062
Identities = 20/101 (19%), Positives = 44/101 (43%), Gaps = 11/101 (10%)
Query: 435 EEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKE----DLDESLSRIKDRLEKE 490
+ + A +++ + E L KKE+EL + + + + + R + + E+E
Sbjct: 13 QLALDAKREEFERREELLKQREEELEKKEEELQESLIKFDKFLKENEAKRRRAEKKAEEE 72
Query: 491 TAGHLEAKQKVAELEDRISELKHRADFEQDERKRLENLINS 531
E ++++ EL+ + ELK E ++LE +
Sbjct: 73 KKLRKEKEEEIKELKAELEELKA-------EIEKLEEKLEE 106
>gnl|CDD|144972 pfam01576, Myosin_tail_1, Myosin tail. The myosin molecule is a
multi-subunit complex made up of two heavy chains and
four light chains it is a fundamental contractile
protein found in all eukaryote cell types. This family
consists of the coiled-coil myosin heavy chain tail
region. The coiled-coil is composed of the tail from two
molecules of myosin. These can then assemble into the
macromolecular thick filament. The coiled-coil region
provides the structural backbone the thick filament.
Length = 859
Score = 36.9 bits (86), Expect = 0.065
Identities = 26/80 (32%), Positives = 40/80 (50%), Gaps = 14/80 (17%)
Query: 447 ELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHLEAKQKVAELED 506
ELE + +L N+L +KE EL Q + +LE E A + ++K+ ELE
Sbjct: 1 ELERQKRELENQLYRKESELSQ--------------LSSKLEDEQALVAQLQKKIKELEA 46
Query: 507 RISELKHRADFEQDERKRLE 526
RI EL+ + E+ R + E
Sbjct: 47 RIRELEEELEAERAARAKAE 66
Score = 32.7 bits (75), Expect = 1.3
Identities = 21/91 (23%), Positives = 46/91 (50%), Gaps = 3/91 (3%)
Query: 437 EIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHLE 496
++ + K + L+ EN DL+ +L + E ++ ++ K L+ L K LE+E+
Sbjct: 205 DLTSQKSR---LQSENSDLTRQLEEAEAQVSNLSKLKSQLESQLEEAKRSLEEESRERAN 261
Query: 497 AKQKVAELEDRISELKHRADFEQDERKRLEN 527
+ ++ +LE + L+ + + E + + LE
Sbjct: 262 LQAQLRQLEHDLDSLREQLEEESEAKAELER 292
Score = 31.6 bits (72), Expect = 3.0
Identities = 38/137 (27%), Positives = 61/137 (44%), Gaps = 22/137 (16%)
Query: 399 LDRLAQQLTLQTETGEDYDHILLKINVKKVVHLLAKEEEIVAAKKKADELEMENVDLSNR 458
LD L +QL ++E + + L K N + + +K E A + A+ELE +L +
Sbjct: 273 LDSLREQLEEESEAKAELERQLSKANAE-IQQWRSKFESEGALR--AEELE----ELKKK 325
Query: 459 LAKKEQELDQKTQEKEDLDESLSRIKDRLEKE-------------TAGHLEAKQKVAELE 505
L +K EL++ + +SL + K RL+ E A LE KQK +
Sbjct: 326 LNQKISELEEAAEAANAKCDSLEKTKSRLQSELEDLQIELERANAAASELEKKQK--NFD 383
Query: 506 DRISELKHRADFEQDER 522
++E K + D Q E
Sbjct: 384 KILAEWKRKVDELQAEL 400
Score = 30.0 bits (68), Expect = 9.5
Identities = 26/94 (27%), Positives = 46/94 (48%), Gaps = 5/94 (5%)
Query: 436 EEIVAAKKKADELEMENVDLSNRLA---KKEQELDQKTQE-KEDLDESLSRIKDRLEK-E 490
+E V K A+E + + RLA ++EQE Q + ++ L+ + ++ RL++ E
Sbjct: 675 DEAVNELKAAEERAKKAQADAARLAEELRQEQEHSQHLERLRKQLESQVKELQVRLDEAE 734
Query: 491 TAGHLEAKQKVAELEDRISELKHRADFEQDERKR 524
A K+ + +LE R+ EL+ D EQ
Sbjct: 735 AAALKGGKKMIQKLEARVRELEAELDGEQRRHAE 768
>gnl|CDD|204194 pfam09304, Cortex-I_coil, Cortexillin I, coiled coil. Members of
this family are predominantly found in the
actin-bundling protein Cortexillin I from Dictyostelium
discoideum. They adopt a structure consisting of an
18-heptad-repeat alpha-helical coiled-coil, and are a
prerequisite for the assembly of Cortexillin I.
Length = 107
Score = 34.2 bits (78), Expect = 0.081
Identities = 23/85 (27%), Positives = 44/85 (51%)
Query: 442 KKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHLEAKQKV 501
K++ LE +L+N+LA E L+ + +E+L + +++ L A E +Q++
Sbjct: 1 KEEKARLEASKNELANKLAGLENSLESEKASQEELAKQKDQLRSLLASLEAQKAEREQRL 60
Query: 502 AELEDRISELKHRADFEQDERKRLE 526
EL+ +I EL + E+ + LE
Sbjct: 61 RELQAKIDELLKNLELEKLAKMELE 85
>gnl|CDD|165528 PHA03270, PHA03270, envelope glycoprotein C; Provisional.
Length = 466
Score = 36.5 bits (84), Expect = 0.081
Identities = 24/76 (31%), Positives = 31/76 (40%), Gaps = 2/76 (2%)
Query: 538 TKAVGLSRSGGREEPSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLPYLTCPRPLVR 597
+ A SG PR P+ P PAS P PP+ P+ C RP V
Sbjct: 31 SNASEAPTSGSPGSAEGPRTTPTPTRGKGTPTGPASPPKSGPPKSPPA-PF-RCKRPDVL 88
Query: 598 CRQPPRSKLKSRIPRP 613
R R +++ R PRP
Sbjct: 89 ARYGSRVQIRCRFPRP 104
Score = 33.4 bits (76), Expect = 0.72
Identities = 26/108 (24%), Positives = 40/108 (37%), Gaps = 8/108 (7%)
Query: 542 GLSRSGGREEPSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLPYLTCPRPLVRCRQP 601
+S + SP E P + P + P P +S P + P P RC++P
Sbjct: 29 AVSNASEAPTSGSPGSAEGPRTTPTPTRGKGTPTGPASPPKSG--PPKSPPAP-FRCKRP 85
Query: 602 -PRSKLKSRIP-RPYFPNVSVQVNGTIW---TELDESKLYASMELEVV 644
++ SR+ R FP + + IW E A +E V
Sbjct: 86 DVLARYGSRVQIRCRFPRPTAEFRLQIWRVAAAPSEEIRPAPIEPGAV 133
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 36.7 bits (84), Expect = 0.087
Identities = 33/94 (35%), Positives = 49/94 (52%), Gaps = 10/94 (10%)
Query: 426 KKVVHLLAKEEEIVAAKKKADEL---EMENVDLSNRLAKKEQELDQKTQEKEDLDESLSR 482
KKV L KE E KKKA+EL E EN + AKK +E +K +E + +E +
Sbjct: 1633 KKVEQLKKKEAE---EKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKK 1689
Query: 483 IKDRLEKETAGHLEAKQKVAELEDRISELKHRAD 516
+ L+KE E +K EL+ + +E K +A+
Sbjct: 1690 AAEALKKEA----EEAKKAEELKKKEAEEKKKAE 1719
Score = 33.6 bits (76), Expect = 0.78
Identities = 25/93 (26%), Positives = 36/93 (38%), Gaps = 13/93 (13%)
Query: 425 VKKVVHLLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIK 484
KK K EE KKADE + + + KK +K +E + E
Sbjct: 1301 KKKADEAKKKAEE----AKKADEAKKK----AEEAKKKADAAKKKAEEAKKAAE----AA 1348
Query: 485 DRLEKETAGHLEAKQKVAEL-EDRISELKHRAD 516
+ A EA ++ AE E + E K +AD
Sbjct: 1349 KAEAEAAADEAEAAEEKAEAAEKKKEEAKKKAD 1381
Score = 33.2 bits (75), Expect = 1.0
Identities = 31/103 (30%), Positives = 44/103 (42%), Gaps = 10/103 (9%)
Query: 426 KKVVHLLAKEEEIVAAKKKADELE--MENVDLSNRLAKKEQELDQKTQEKEDLDESLSRI 483
KK K EE AKKKAD + E + AK E E E + +
Sbjct: 1315 KKADEAKKKAEE---AKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEK 1371
Query: 484 KDRLEKETAGHLEAKQKVAELEDRISELKHRADFEQDERKRLE 526
K K+ A +A +K AE + + E K +A E+D++K E
Sbjct: 1372 KKEEAKKKA---DAAKKKAEEKKKADEAKKKA--EEDKKKADE 1409
Score = 32.4 bits (73), Expect = 1.9
Identities = 32/112 (28%), Positives = 50/112 (44%), Gaps = 10/112 (8%)
Query: 425 VKKVVHLLAKEEEIVAA---KKKADEL-EMENVDLSNRLAKKEQELDQKTQEKEDLDESL 480
KK K EE A KKKA+E + E AKK E +K +E + DE+
Sbjct: 1430 KKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAK 1489
Query: 481 SRIKDRLEK--ETAGHLEAKQKVAELEDRISELKHRAD--FEQDERKRLENL 528
+ ++ +K E EAK+K E + +E +AD + +E K+ +
Sbjct: 1490 KKAEEAKKKADEAKKAAEAKKKADEA--KKAEEAKKADEAKKAEEAKKADEA 1539
Score = 32.0 bits (72), Expect = 2.3
Identities = 27/98 (27%), Positives = 42/98 (42%), Gaps = 6/98 (6%)
Query: 433 AKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETA 492
A E E K +A E + E AKK+ E +K E + E + D L+K A
Sbjct: 1356 ADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAA 1415
Query: 493 GHLEAKQ--KVAELEDRISELKHRADFEQDERKRLENL 528
+A + K AE + + E K +A +E K+ +
Sbjct: 1416 AKKKADEAKKKAEEKKKADEAKKKA----EEAKKADEA 1449
Score = 31.6 bits (71), Expect = 2.8
Identities = 27/92 (29%), Positives = 41/92 (44%), Gaps = 8/92 (8%)
Query: 425 VKKVVHLLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIK 484
KK K EE KKKADEL+ KK E +K +EK+ DE+ + +
Sbjct: 1390 KKKADEAKKKAEE---DKKKADELKKAAAA-----KKKADEAKKKAEEKKKADEAKKKAE 1441
Query: 485 DRLEKETAGHLEAKQKVAELEDRISELKHRAD 516
+ + + A + K AE + +E +AD
Sbjct: 1442 EAKKADEAKKKAEEAKKAEEAKKKAEEAKKAD 1473
Score = 30.9 bits (69), Expect = 5.4
Identities = 27/103 (26%), Positives = 44/103 (42%), Gaps = 13/103 (12%)
Query: 424 NVKKVVHLLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRI 483
N K K EE KKKA+E + D K + L ++ +E + +E +
Sbjct: 1659 NKIKAAEEAKKAEE---DKKKAEEAKKAEEDEK----KAAEALKKEAEEAKKAEE----L 1707
Query: 484 KDRLEKETAGHLEAKQKVAELEDRISELKHRADFEQDERKRLE 526
K + +E E K+ E + + E K A E+D++K E
Sbjct: 1708 KKKEAEEKKKAEELKKAEEENKIKAEEAKKEA--EEDKKKAEE 1748
Score = 30.1 bits (67), Expect = 9.3
Identities = 31/120 (25%), Positives = 52/120 (43%), Gaps = 11/120 (9%)
Query: 426 KKVVHLLAKEEEIVAAKKKADEL-EMENVDLSNRLAKKEQ----ELDQKTQEKEDLDESL 480
KK K E AKKKADE + E ++ K E+ + +K +EK+ DE L
Sbjct: 1496 KKKADEAKKAAE---AKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADE-L 1551
Query: 481 SRIKDRLEKETAGHLEAKQKVAELEDRISELKHRADFEQDERKRLENLINSGSVPDDTKA 540
+ ++ + E E +K ED+ L+ + ++ E R+E ++ KA
Sbjct: 1552 KKAEELKKAEEKKKAEEAKK--AEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKA 1609
>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
protein; Reviewed.
Length = 782
Score = 36.3 bits (85), Expect = 0.087
Identities = 28/118 (23%), Positives = 49/118 (41%), Gaps = 26/118 (22%)
Query: 436 EEIVAAKKK---ADELEMENV-----DLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRL 487
E I+ KK D+ ++ + +L L +K +E + +E E L E L K++L
Sbjct: 501 ENIIEEAKKLIGEDKEKLNELIASLEELERELEQKAEEAEALLKEAEKLKEELEEKKEKL 560
Query: 488 EKETAGHL-EAKQKV--------AELEDRISELKHRADFE---------QDERKRLEN 527
++E L EA+++ E ++ I EL+ + RKRL
Sbjct: 561 QEEEDKLLEEAEKEAQQAIKEAKKEADEIIKELRQLQKGGYASVKAHELIEARKRLNK 618
>gnl|CDD|184918 PRK14954, PRK14954, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 620
Score = 36.5 bits (84), Expect = 0.088
Identities = 35/183 (19%), Positives = 56/183 (30%), Gaps = 22/183 (12%)
Query: 476 LDESLSRIKDRLEKETAGHLEAK--QKVAELEDRISELKHRADFEQDERKRLENLINSGS 533
L E +K+R ++ G A Q L ELK + FE R L L
Sbjct: 316 LIERPESVKERYQQSAGGLTPAAVMQMTDFLMKTQGELKFQ--FEYQFRFELALL----- 368
Query: 534 VPDDTKAVGLSRSGGREEPSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLPYLTCPR 593
+ + L R+ G PS P+ P P P P + R
Sbjct: 369 -----RLIELVRNDGGVAPSPAGSPDVKKKAPEPDLPQPDRHPGPAKPEAPGA------R 417
Query: 594 PLVRCRQPPRSKLKSRIPRPYFPNVSVQVNGTIWTELDESKLYASMELEVVDKLFCAYQK 653
P P + + P P + + + + + ++L F + +
Sbjct: 418 PAELP-SPASAPTPEQQP-PVARSAPLPPSPQASAPRNVASGKPGVDLGSWQGKFMNFTR 475
Query: 654 NGV 656
NG
Sbjct: 476 NGS 478
>gnl|CDD|235316 PRK04863, mukB, cell division protein MukB; Provisional.
Length = 1486
Score = 36.5 bits (85), Expect = 0.10
Identities = 22/82 (26%), Positives = 36/82 (43%), Gaps = 10/82 (12%)
Query: 451 ENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHLEAK-------QKVAE 503
+N DL+ +L Q L+Q QE+ E L + + +L + K Q + E
Sbjct: 982 KNSDLNEKL---RQRLEQAEQERTRAREQLRQAQAQLAQYNQVLASLKSSYDAKRQMLQE 1038
Query: 504 LEDRISELKHRADFEQDERKRL 525
L+ + +L AD +ER R
Sbjct: 1039 LKQELQDLGVPADSGAEERARA 1060
>gnl|CDD|148844 pfam07469, DUF1518, Domain of unknown function (DUF1518). This
domain, which is usually found tandemly repeated, is
found various receptor co-activating proteins.
Length = 56
Score = 32.1 bits (73), Expect = 0.11
Identities = 15/50 (30%), Positives = 19/50 (38%), Gaps = 4/50 (8%)
Query: 553 SSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLPYLTCPRPLVRCRQPP 602
+PR P+AP P PP P P S P P++ R P
Sbjct: 7 GNPRMPQAPPQQFPYPPNYGMGQQPDPAFTSPFSP----QSPMMSPRMGP 52
>gnl|CDD|219339 pfam07223, DUF1421, Protein of unknown function (DUF1421). This
family represents a conserved region approximately 350
residues long within a number of plant proteins of
unknown function.
Length = 357
Score = 35.7 bits (82), Expect = 0.12
Identities = 27/122 (22%), Positives = 44/122 (36%), Gaps = 9/122 (7%)
Query: 495 LEAKQKVAELEDRISELKHRADFEQDERKRLENLINSGSVPDDTKAVGLSRSGGREEPSS 554
L KQ++AE + +S+L+ + Q +S V D TK + E +
Sbjct: 11 LRDKQEIAETQKELSKLQLSHEEAQSSEA------HSFHV-DSTKQPPAPEQVAKHELAD 63
Query: 555 PRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLPYLTCPRPLVRCRQPPRSKLKSRIPRPY 614
P + PP P P P ++S P P L + + + PY
Sbjct: 64 --APLQQVNAALPPAPAPQSPQPDQQQQSQAPPSHQYPSQLPPQQVQSVPQQPTPQQEPY 121
Query: 615 FP 616
+P
Sbjct: 122 YP 123
Score = 34.1 bits (78), Expect = 0.39
Identities = 17/68 (25%), Positives = 27/68 (39%), Gaps = 3/68 (4%)
Query: 552 PSSPRKPEAPSSCIPPPPP---PASLPPPPPPRRSSDLPYLTCPRPLVRCRQPPRSKLKS 608
P+ ++P P P PPP P + P PPP+ Y + P+ + PP +
Sbjct: 114 PTPQQEPYYPPPSQPQPPPAQQPQAQQPQPPPQVPQQQQYQSPPQQPQYQQNPPPQAQSA 173
Query: 609 RIPRPYFP 616
+P
Sbjct: 174 PQVSGLYP 181
>gnl|CDD|173502 PTZ00266, PTZ00266, NIMA-related protein kinase; Provisional.
Length = 1021
Score = 36.3 bits (83), Expect = 0.12
Identities = 38/125 (30%), Positives = 66/125 (52%), Gaps = 12/125 (9%)
Query: 426 KKVVHLLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLD-ESLSRIK 484
K+ H A E +I+ KK+ + LE E + R ++ + ++++ E+E L+ E L R
Sbjct: 442 KENAHRKALEMKILE-KKRIERLEREERERLER--ERMERIERERLERERLERERLER-- 496
Query: 485 DRLEKETAGHLEAKQKVAELE-DRISELKHRADFEQDERKRLENLINSGSVPDDTKAVGL 543
DRLE++ LE +++V LE DR+ + + + F K +EN +++G P D VG
Sbjct: 497 DRLERDRLDRLE-RERVDRLERDRLEKARRNSYF----LKGMENGLSAGGGPGDGPGVGA 551
Query: 544 SRSGG 548
G
Sbjct: 552 GVGAG 556
>gnl|CDD|236942 PRK11637, PRK11637, AmiB activator; Provisional.
Length = 428
Score = 35.8 bits (83), Expect = 0.13
Identities = 31/138 (22%), Positives = 66/138 (47%), Gaps = 17/138 (12%)
Query: 433 AKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDE-------SLSRIKD 485
A++E I K+ +EL + +L + ++++ L ++ +++ L++ +L+ ++
Sbjct: 167 ARQETIAELKQTREELAAQKAELEEKQSQQKTLLYEQQAQQQKLEQARNERKKTLTGLES 226
Query: 486 RLEKETAGHLEAKQKVAELEDRIS----ELKHRADFEQDE----RKRLENLINSGSV--P 535
L+K+ E + + L D I+ E K RA+ E E R + + GS P
Sbjct: 227 SLQKDQQQLSELRANESRLRDSIARAEREAKARAEREAREAARVRDKQKQAKRKGSTYKP 286
Query: 536 DDTKAVGLSRSGGREEPS 553
+++ +SR+GG P
Sbjct: 287 TESERSLMSRTGGLGRPR 304
>gnl|CDD|220369 pfam09731, Mitofilin, Mitochondrial inner membrane protein.
Mitofilin controls mitochondrial cristae morphology.
Mitofilin is enriched in the narrow space between the
inner boundary and the outer membranes, where it forms a
homotypic interaction and assembles into a large
multimeric protein complex. The first 78 amino acids
contain a typical amino-terminal-cleavable mitochondrial
presequence rich in positive-charged and hydroxylated
residues and a membrane anchor domain. In addition, it
has three centrally located coiled coil domains.
Length = 493
Score = 35.8 bits (83), Expect = 0.14
Identities = 32/137 (23%), Positives = 60/137 (43%), Gaps = 28/137 (20%)
Query: 399 LDRLAQQLTLQTETGEDYDHILLKINVKKVVHLLAKEEEIVAAKKKADELEMENVDLSN- 457
LD+L+++L E+ + + K+ L EEE++A + + + + L
Sbjct: 172 LDQLSKKLAELKAEEEE--ELERALKEKREELLSKLEEELLARLESKEAALEKQLRLEFE 229
Query: 458 ----RLAKK-EQELDQKTQ-EKEDLDESLSR----------------IKDRLEKETAGHL 495
L KK E++L Q+ + + E ++ L IK+++E+E G L
Sbjct: 230 REKEELRKKYEEKLRQELERQAEAHEQKLKNELALQAIELQREFNKEIKEKVEEERNGRL 289
Query: 496 EAKQKVAELEDRISELK 512
K+AEL R+ L+
Sbjct: 290 A---KLAELNSRLKGLE 303
>gnl|CDD|165468 PHA03201, PHA03201, uracil DNA glycosylase; Provisional.
Length = 318
Score = 35.3 bits (81), Expect = 0.14
Identities = 16/48 (33%), Positives = 20/48 (41%), Gaps = 3/48 (6%)
Query: 556 RKPEAPSSCIPPPPPPASLPPPPPPRRSSDLPYLTCPRPLVRCRQPPR 603
++ + S P PP S P P PPR P T P P +PP
Sbjct: 2 KRARSRS---PSPPRRPSPPRPTPPRSPDASPEETPPSPPGPGAEPPP 46
Score = 29.9 bits (67), Expect = 8.3
Identities = 18/52 (34%), Positives = 20/52 (38%), Gaps = 13/52 (25%)
Query: 544 SRSGGREEPSSPRKPEAPSS-------CIPPPPPPASLPP------PPPPRR 582
SRS SP +P P S P PP P + PP P PRR
Sbjct: 6 SRSPSPPRRPSPPRPTPPRSPDASPEETPPSPPGPGAEPPPGRAAGPAAPRR 57
>gnl|CDD|226513 COG4026, COG4026, Uncharacterized protein containing TOPRIM domain,
potential nuclease [General function prediction only].
Length = 290
Score = 35.3 bits (81), Expect = 0.16
Identities = 18/72 (25%), Positives = 37/72 (51%)
Query: 442 KKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHLEAKQKV 501
K+K +EL+ E +L L + E E ++ + + L+ SR+++ L+K + K++
Sbjct: 141 KEKLEELQKEKEELLKELEELEAEYEEVQERLKRLEVENSRLEEMLKKLPGEVYDLKKRW 200
Query: 502 AELEDRISELKH 513
ELE + +
Sbjct: 201 DELEPGVELPEE 212
Score = 32.2 bits (73), Expect = 1.2
Identities = 30/144 (20%), Positives = 62/144 (43%), Gaps = 17/144 (11%)
Query: 413 GEDYDHILLKINVKKVVHLLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQE 472
G D +HI +++ K++ + +V A K + E +DL + +++L++ +E
Sbjct: 97 GHDVEHIDVELVRKEL------KNALVRAGLKTLQRVPEYMDLKEDYEELKEKLEELQKE 150
Query: 473 KEDLDESLSRIKDRLEKETAGHLEAKQKVAELEDRISELKHRADFEQDE----RKRLENL 528
KE+L + L ++ E E ++++ LE S L+ E +KR + L
Sbjct: 151 KEELLKELEELEAEYE-------EVQERLKRLEVENSRLEEMLKKLPGEVYDLKKRWDEL 203
Query: 529 INSGSVPDDTKAVGLSRSGGREEP 552
+P++ L + P
Sbjct: 204 EPGVELPEEELISDLVKETLNLAP 227
>gnl|CDD|177464 PHA02682, PHA02682, ORF080 virion core protein; Provisional.
Length = 280
Score = 35.2 bits (80), Expect = 0.17
Identities = 15/47 (31%), Positives = 20/47 (42%), Gaps = 2/47 (4%)
Query: 556 RKPEAPSSCIPPPPPPASLPPPPPPRRSSDLPYLTCPRPLVRCRQPP 602
++P S P P A P P ++ P +TCP P C PP
Sbjct: 74 QRPSGQSPLAPSPACAAPAPACPACAPAAPAPAVTCPAPAPAC--PP 118
Score = 32.1 bits (72), Expect = 1.4
Identities = 21/70 (30%), Positives = 24/70 (34%), Gaps = 2/70 (2%)
Query: 554 SPRKPEAPSSCIPPPPP--PASLPPPPPPRRSSDLPYLTCPRPLVRCRQPPRSKLKSRIP 611
S + P APS P P PA P P P + P CP PP P
Sbjct: 77 SGQSPLAPSPACAAPAPACPACAPAAPAPAVTCPAPAPACPPATAPTCPPPAVCPAPARP 136
Query: 612 RPYFPNVSVQ 621
P P + Q
Sbjct: 137 APACPPSTRQ 146
Score = 30.6 bits (68), Expect = 4.0
Identities = 20/72 (27%), Positives = 29/72 (40%), Gaps = 7/72 (9%)
Query: 552 PSSPRKPEAPSSCIPPPPPPASLPP----PPPPRRSSDLPYLTCPRPLVRCRQPPRSKLK 607
P+ P +C PP PA P PP R+ P L P+P + L
Sbjct: 112 PAPACPPATAPTCPPPAVCPAPARPAPACPPSTRQCPPAPPLPTPKPAPAAKP---IFLH 168
Query: 608 SRIPRPYFPNVS 619
+++P P +P S
Sbjct: 169 NQLPPPDYPAAS 180
Score = 30.6 bits (68), Expect = 4.2
Identities = 21/77 (27%), Positives = 27/77 (35%), Gaps = 17/77 (22%)
Query: 552 PSSPRKPEAPSSCIPPPP-------------PPASLP--PPPPPRRSSDLPYLTCPRPLV 596
P+ A +C P P PPA+ P PPP + P CP
Sbjct: 86 PACAAPAPACPACAPAAPAPAVTCPAPAPACPPATAPTCPPPAVCPAPARPAPACPPSTR 145
Query: 597 RCRQPPRSKLKSRIPRP 613
+C PP L + P P
Sbjct: 146 QC--PPAPPLPTPKPAP 160
>gnl|CDD|166942 PRK00404, tatB, sec-independent translocase; Provisional.
Length = 141
Score = 33.6 bits (77), Expect = 0.17
Identities = 17/54 (31%), Positives = 26/54 (48%), Gaps = 2/54 (3%)
Query: 551 EPSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLPYLTCPRPLVRCRQPPRS 604
EP +P ++P+ +P PPP ++ PP P + P P QPPR+
Sbjct: 89 EPVTPPTAQSPAPAVPTPPPTSTPAVPPAPAAAVPAPAAAPPPS--DPPQPPRA 140
Score = 33.3 bits (76), Expect = 0.29
Identities = 22/74 (29%), Positives = 28/74 (37%), Gaps = 4/74 (5%)
Query: 512 KHRADFEQDERKRLENLI---NSGSVPDDTKAVGLSRSGGREEPSSPRKPEAPSSCIPPP 568
+H EQ+ RK L L V T S+P P AP++ +P P
Sbjct: 66 EHILSMEQEARKILAPLTPPAPPEPVTPPTAQSPAPAVPTPPPTSTPAVPPAPAAAVPAP 125
Query: 569 -PPPASLPPPPPPR 581
P PP PPR
Sbjct: 126 AAAPPPSDPPQPPR 139
>gnl|CDD|236798 PRK10929, PRK10929, putative mechanosensitive channel protein;
Provisional.
Length = 1109
Score = 35.8 bits (83), Expect = 0.17
Identities = 25/79 (31%), Positives = 41/79 (51%), Gaps = 8/79 (10%)
Query: 431 LLAKEEEIVAAKKKADELEMENVDLSNR--LAKKEQELDQKTQEKEDLDESLSRIKDRL- 487
L A + E A K DELE+ + +NR LA+ EL +K + LD L ++++L
Sbjct: 175 LTALQAESAALKALVDELELAQLSANNRQELARLRSELAKK--RSQQLDAYLQALRNQLN 232
Query: 488 ---EKETAGHLEAKQKVAE 503
++E LE+ + +AE
Sbjct: 233 SQRQREAERALESTELLAE 251
>gnl|CDD|200219 TIGR02927, SucB_Actino, 2-oxoglutarate dehydrogenase, E2 component,
dihydrolipoamide succinyltransferase. This model
represents an Actinobacterial clade of E2 enzyme, a
component of the 2-oxoglutarate dehydrogenase complex
involved in the TCA cycle. These proteins have multiple
domains including the catalytic domain (pfam00198), one
or two biotin domains (pfam00364) and an E3-component
binding domain (pfam02817).
Length = 579
Score = 35.4 bits (81), Expect = 0.17
Identities = 15/49 (30%), Positives = 20/49 (40%), Gaps = 5/49 (10%)
Query: 551 EPSSPRKPEAPSSCIPPPPPPASLPP--PPPPRRSSDLPYLTCPRPLVR 597
+P++ AP P P P + P P PY+T PLVR
Sbjct: 228 DPAARAPHAAPDPPAPAPAPAKTAAPAAAAPVSSGDSGPYVT---PLVR 273
>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein. This entry is a highly
conserved protein present in eukaryotes.
Length = 680
Score = 35.3 bits (81), Expect = 0.18
Identities = 26/105 (24%), Positives = 48/105 (45%), Gaps = 14/105 (13%)
Query: 421 LKINVKKVVH-LLAKEEEIVAAKKKADELEM--ENVDLSNRLAKKEQELDQKTQEKE--- 474
L++ +KK+ H L KEEE +K+A EL E+ + L Q + K E
Sbjct: 538 LEMEIKKLEHDLKLKEEECRMLEKEAQELRKYQESEKETEVLMSALQAMQDKNLMLENSL 597
Query: 475 --------DLDESLSRIKDRLEKETAGHLEAKQKVAELEDRISEL 511
DL +L +K +LE + + ++ +L+ +I+++
Sbjct: 598 SAETRLKLDLFSALGDVKRQLEIAHGAIYKREAEIVDLKSKIADI 642
Score = 31.0 bits (70), Expect = 4.4
Identities = 22/92 (23%), Positives = 42/92 (45%), Gaps = 11/92 (11%)
Query: 437 EIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHLE 496
E+ A++ EL + L++ + +L Q +E + L L+ + +K+
Sbjct: 421 ELQQARQNESELRNQISLLTSLERSLKSDLGQLKKENDMLQTKLNSMVSAKQKD------ 474
Query: 497 AKQKVAELEDRI-SELKHRADFE---QDERKR 524
KQ + +E R+ SE R + E +E+KR
Sbjct: 475 -KQSMQSMEKRLKSEADSRVNAEKQLAEEKKR 505
>gnl|CDD|223428 COG0351, ThiD, Hydroxymethylpyrimidine/phosphomethylpyrimidine
kinase [Coenzyme metabolism].
Length = 263
Score = 34.9 bits (81), Expect = 0.19
Identities = 16/59 (27%), Positives = 25/59 (42%), Gaps = 11/59 (18%)
Query: 703 SKAILSMDSADQLPLDMLEQLLKF----TPNTEEAALL----EEHSTETLARADRFLYE 753
S L + A + + E+LL TPN EA L + + E + A + L+E
Sbjct: 112 SGDPLLDEEAVEA---LREELLPLATVVTPNLPEAEALSGLPKIKTEEDMKEAAKLLHE 167
>gnl|CDD|220623 pfam10186, Atg14, UV radiation resistance protein and
autophagy-related subunit 14. The Atg14 or Apg14
proteins are hydrophilic proteins with a predicted
molecular mass of 40.5 kDa, and have a coiled-coil motif
at the N terminus region. Yeast cells with mutant Atg14
are defective not only in autophagy but also in sorting
of carboxypeptidase Y (CPY), a vacuolar-soluble
hydrolase, to the vacuole. Subcellular fractionation
indicate that Apg14p and Apg6p are peripherally
associated with a membrane structure(s). Apg14p was
co-immunoprecipitated with Apg6p, suggesting that they
form a stable protein complex. These results imply that
Apg6/Vps30p has two distinct functions: in the
autophagic process and in the vacuolar protein sorting
pathway. Apg14p may be a component specifically required
for the function of Apg6/Vps30p through the autophagic
pathway. There are 17 auto-phagosomal component proteins
which are categorized into six functional units, one of
which is the AS-PI3K complex (Vps30/Atg6 and Atg14). The
AS-PI3K complex and the Atg2-Atg18 complex are essential
for nucleation, and the specific function of the AS-PI3K
apparently is to produce phosphatidylinositol
3-phosphate (PtdIns(3)P) at the pre-autophagosomal
structure (PAS). The localisation of this complex at the
PAS is controlled by Atg14. Autophagy mediates the
cellular response to nutrient deprivation, protein
aggregation, and pathogen invasion in humans, and
malfunction of autophagy has been implicated in multiple
human diseases including cancer. This effect seems to be
mediated through direct interaction of the human Atg14
with Beclin 1 in the human phosphatidylinositol 3-kinase
class III complex.
Length = 307
Score = 34.7 bits (80), Expect = 0.23
Identities = 16/83 (19%), Positives = 36/83 (43%), Gaps = 8/83 (9%)
Query: 454 DLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHL-----EAKQKVAELEDRI 508
N L + +L + E E+L ++++ LE T K +VA ++R+
Sbjct: 17 CARNSLYELRLDLARLLLENEELK---QKVEEALEGATNEDGKLAADLLKLEVARKKERL 73
Query: 509 SELKHRADFEQDERKRLENLINS 531
++++ R ++E ++ I
Sbjct: 74 NQIRARISQLKEEIEQKRERIEE 96
Score = 29.3 bits (66), Expect = 10.0
Identities = 11/69 (15%), Positives = 34/69 (49%), Gaps = 1/69 (1%)
Query: 447 ELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHLEAKQK-VAELE 505
+ + R+++ ++E++QK + E+L +L++ + L + + + + +L+
Sbjct: 68 RKKERLNQIRARISQLKEEIEQKRERIEELKRALAQRRSDLSSASYQLEKRRASQLEKLQ 127
Query: 506 DRISELKHR 514
D I + +
Sbjct: 128 DEIKRTRSK 136
>gnl|CDD|205039 pfam12718, Tropomyosin_1, Tropomyosin like. This family is a set
of eukaryotic tropomyosins. Within the yeast Tmp1 and
Tmp2, biochemical and sequence analyses indicate that
Tpm2p spans four actin monomers along a filament,
whereas Tpmlp spans five. Despite its shorter length,
Tpm2p can compete with Tpm1p for binding to F-actin.
Over-expression of Tpm2p in vivo alters the axial
budding of haploids to a bipolar pattern, and this can
be partially suppressed by co-over-expression of Tpm1p.
This suggests distinct functions for the two
tropomyosins, and indicates that the ratio between them
is important for correct morphogenesis. The family also
contains higher eukaryote Tmp3 members.
Length = 143
Score = 33.3 bits (77), Expect = 0.25
Identities = 19/73 (26%), Positives = 42/73 (57%), Gaps = 7/73 (9%)
Query: 440 AAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHLEAKQ 499
A+++A+ELE + +L +KEQE+ ++ + L+E + +++++L+ EAK+
Sbjct: 11 NAQERAEELEEKLKELEQENLEKEQEITSLQKKNQQLEEEVEKLEEQLK-------EAKE 63
Query: 500 KVAELEDRISELK 512
K+ E E + +
Sbjct: 64 KLEESEKLATNAE 76
Score = 31.0 bits (71), Expect = 1.4
Identities = 30/102 (29%), Positives = 42/102 (41%), Gaps = 19/102 (18%)
Query: 431 LLAKEEEIVAAKKKADELEME----NVDLSNRLAKKEQELDQKTQEK----------EDL 476
L KE+EI + +KK +LE E L K E+ T + E+L
Sbjct: 30 NLEKEQEITSLQKKNQQLEEEVEKLEEQLKEAKEKLEESEKLATNAEALTRRIQLLEEEL 89
Query: 477 DESLSRIKDRLEK-----ETAGHLEAKQKVAELEDRISELKH 513
+ES R+K+ EK + A E K K E E E K+
Sbjct: 90 EESEKRLKETTEKLREADKKAEESERKVKALENERDEWEEKY 131
>gnl|CDD|214922 smart00935, OmpH, Outer membrane protein (OmpH-like). This family
includes outer membrane proteins such as OmpH among
others. Skp (OmpH) has been characterized as a molecular
chaperone that interacts with unfolded proteins as they
emerge in the periplasm from the Sec translocation
machinery.
Length = 140
Score = 33.3 bits (77), Expect = 0.25
Identities = 23/110 (20%), Positives = 47/110 (42%), Gaps = 19/110 (17%)
Query: 423 INVKKVVHLLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSR 482
++V+K++ +E A K +LE E L K E+EL + ++ L +
Sbjct: 4 VDVQKIL------QESPAGKAAQKQLEKEFKKRQAELEKLEKELQKLKEK-------LQK 50
Query: 483 IKDRLEKETAGHLEAKQKVAELEDRISELKH-RADFEQDERKRLENLINS 531
L + ++K EL+ ++ E + + +QD +KR + +
Sbjct: 51 DAATLSEA-----AREKKEKELQKKVQEFQRKQQKLQQDLQKRQQEELQK 95
>gnl|CDD|235540 PRK05641, PRK05641, putative acetyl-CoA carboxylase biotin carboxyl
carrier protein subunit; Validated.
Length = 153
Score = 33.3 bits (76), Expect = 0.26
Identities = 15/58 (25%), Positives = 26/58 (44%), Gaps = 2/58 (3%)
Query: 537 DTKAVGLSRSGGREEPSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLPYLTCPRP 594
+ K +G+ S +E+ +P AP+ P P + P P P ++ +T P P
Sbjct: 37 EAKGLGIDLSAVQEQVPTPAPAPAPAVPSAPTPVAPAAPAPAPASAGENV--VTAPMP 92
>gnl|CDD|237864 PRK14950, PRK14950, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 585
Score = 34.8 bits (80), Expect = 0.28
Identities = 20/92 (21%), Positives = 30/92 (32%), Gaps = 14/92 (15%)
Query: 525 LENLINSGSVPDDTKAVGLSRSGGREEPSSPRKPEAPSSCIPPP--------------PP 570
+E L+ P K + S R P+ +P+A ++ PP P
Sbjct: 356 IEALLVPVPAPQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPR 415
Query: 571 PASLPPPPPPRRSSDLPYLTCPRPLVRCRQPP 602
P + P P P + L P PP
Sbjct: 416 PVAPPVPHTPESAPKLTRAAIPVDEKPKYTPP 447
>gnl|CDD|185628 PTZ00449, PTZ00449, 104 kDa microneme/rhoptry antigen; Provisional.
Length = 943
Score = 34.7 bits (79), Expect = 0.33
Identities = 17/76 (22%), Positives = 26/76 (34%), Gaps = 2/76 (2%)
Query: 545 RSGGREEPSSPRKPEAPSSCIPPPPPPASLPPPPPPR-RSSDLP-YLTCPRPLVRCRQPP 602
+ ++P P+ PE P P P P D+P P ++PP
Sbjct: 578 KPEFPKDPKHPKDPEEPKKPKRPRSAQRPTRPKSPKLPELLDIPKSPKRPESPKSPKRPP 637
Query: 603 RSKLKSRIPRPYFPNV 618
+ S RP P +
Sbjct: 638 PPQRPSSPERPEGPKI 653
Score = 32.4 bits (73), Expect = 1.6
Identities = 20/69 (28%), Positives = 25/69 (36%), Gaps = 2/69 (2%)
Query: 550 EEPSSPRKPEAPSSCIPPPPP--PASLPPPPPPRRSSDLPYLTCPRPLVRCRQPPRSKLK 607
EEP P++P + P P P L P P+R P P R P R +
Sbjct: 592 EEPKKPKRPRSAQRPTRPKSPKLPELLDIPKSPKRPESPKSPKRPPPPQRPSSPERPEGP 651
Query: 608 SRIPRPYFP 616
I P P
Sbjct: 652 KIIKSPKPP 660
Score = 32.4 bits (73), Expect = 1.8
Identities = 13/30 (43%), Positives = 16/30 (53%)
Query: 550 EEPSSPRKPEAPSSCIPPPPPPASLPPPPP 579
+ PSSP +PE P P PP + PP P
Sbjct: 640 QRPSSPERPEGPKIIKSPKPPKSPKPPFDP 669
Score = 30.4 bits (68), Expect = 7.1
Identities = 59/261 (22%), Positives = 90/261 (34%), Gaps = 35/261 (13%)
Query: 395 HWLLLDRLAQQLTLQTETGEDYDHILLKINVKKVVHLLAKEEEIVAAKKKADELEMENVD 454
H + DR+ + ++Y + KI + + L K+ +KKK +E E+ D
Sbjct: 451 HLRIKDRIPANNDIYMLKFDEYWTRISKIQFTQEIKKLIKK-----SKKKLAPIEEEDSD 505
Query: 455 ----------LSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHLEAKQKVAEL 504
S K + + + E ED ES + ET K+
Sbjct: 506 KHDEPPEGPEASGLPPKAPGDKEGEEGEHEDSKESDEPKEGGKPGETKEGEVGKKPGPAK 565
Query: 505 E---DRISELKHRADFEQDERKRLENLINSGSVPDDTKAVGLSRSGGRE-EPSSPRKPEA 560
E +I L + +F +D + P++ K RS R P SP+ PE
Sbjct: 566 EHKPSKIPTLSKKPEFPKDPK--------HPKDPEEPKKPKRPRSAQRPTRPKSPKLPEL 617
Query: 561 ---PSSCIPPPPPPASLPPPPP-----PRRSSDLPYLTCPRPLVRCRQPPRSKLKSRIPR 612
P S P P + PPPP P R + P+P + P K K +
Sbjct: 618 LDIPKSPKRPESPKSPKRPPPPQRPSSPERPEGPKIIKSPKPPKSPKPPFDPKFKEKFYD 677
Query: 613 PYFPNVSVQVNGTIWTELDES 633
Y + LDES
Sbjct: 678 DYLDAAAKSKETKTTVVLDES 698
>gnl|CDD|178768 PLN03229, PLN03229, acetyl-coenzyme A carboxylase carboxyl
transferase subunit alpha; Provisional.
Length = 762
Score = 34.4 bits (79), Expect = 0.33
Identities = 28/166 (16%), Positives = 60/166 (36%), Gaps = 26/166 (15%)
Query: 392 HPQHWLLLD---RLAQQLTLQTETGEDYDHILLKINVKKVVH----LLAKEEEIVAAKKK 444
H +L++ +L + + +Y + K+++ L K+ + K +
Sbjct: 503 QLMHPVLMEKIEKLKDEFNKRLSRAPNYLSLKYKLDMLNEFSRAKALSEKKSKAEKLKAE 562
Query: 445 ADELEMENVDLSNRLAKKEQELDQKTQEK--------EDLDESLSRIKDRLEKETAGHLE 496
++ E +D K E + +DL E + ++K +E E AG L+
Sbjct: 563 INKKFKEVMDRPEIKEKMEALKAEVASSGASSGDELDDDLKEKVEKMKKEIELELAGVLK 622
Query: 497 ----------AKQKVAELEDRISELKHR-ADFEQDERKRLENLINS 531
K K + L+ + ++ K++E +I S
Sbjct: 623 SMGLEVIGVTKKNKDTAEQTPPPNLQEKIESLNEEINKKIERVIRS 668
>gnl|CDD|130141 TIGR01069, mutS2, MutS2 family protein. Function of MutS2 is
unknown. It should not be considered a DNA mismatch
repair protein. It is likely a DNA mismatch binding
protein of unknown cellular function [DNA metabolism,
Other].
Length = 771
Score = 34.4 bits (79), Expect = 0.34
Identities = 35/98 (35%), Positives = 50/98 (51%), Gaps = 5/98 (5%)
Query: 434 KEEEIVAAKKKAD---ELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKE 490
KEE V +K + ELE +N L L ++E+ + QE E+L E K LEKE
Sbjct: 510 KEEINVLIEKLSALEKELEQKNEHLEKLLKEQEKLKKELEQEMEELKERERNKKLELEKE 569
Query: 491 TAGHLEAKQKVAELEDRISELKHRADFEQDERKRLENL 528
L+A +K E+E I ELK + + E K +E+L
Sbjct: 570 AQEALKALKK--EVESIIRELKEKKIHKAKEIKSIEDL 605
Score = 30.6 bits (69), Expect = 5.8
Identities = 22/101 (21%), Positives = 42/101 (41%), Gaps = 11/101 (10%)
Query: 410 TETGEDYDHILLKIN--VKKVVHLLAKEEEIV-AAKKKADELEMENVDLSNRLAKKEQEL 466
E E+ + ++ K++ K++ E+++ +K ELE E +L R K+ EL
Sbjct: 507 GEFKEEINVLIEKLSALEKELEQKNEHLEKLLKEQEKLKKELEQEMEELKERERNKKLEL 566
Query: 467 DQKTQEK--------EDLDESLSRIKDRLEKETAGHLEAKQ 499
+++ QE E + L K KE + +
Sbjct: 567 EKEAQEALKALKKEVESIIRELKEKKIHKAKEIKSIEDLVK 607
>gnl|CDD|216368 pfam01213, CAP_N, Adenylate cyclase associated (CAP) N terminal.
Length = 313
Score = 34.0 bits (78), Expect = 0.34
Identities = 13/40 (32%), Positives = 16/40 (40%)
Query: 548 GREEPSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLP 587
G + P SS PPPP PPP P S+ +
Sbjct: 213 GPVAAAKSALPAVSSSAPSAPPPPPPPPPPSVPTISNSVE 252
>gnl|CDD|183610 PRK12585, PRK12585, putative monovalent cation/H+ antiporter
subunit G; Reviewed.
Length = 197
Score = 33.5 bits (76), Expect = 0.35
Identities = 25/89 (28%), Positives = 49/89 (55%), Gaps = 4/89 (4%)
Query: 422 KINVKKVVHLLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLS 481
K ++KK L+ ++E+I A+++ +ELE E ++ R K ++ DQ+ QE+E ++++
Sbjct: 112 KDDIKKKKSLIIRQEQIEKARQEREELE-ERMEWERREEKIDEREDQEEQEREREEQTIE 170
Query: 482 RIKDRLEKETAGHLEAKQKVAELEDRISE 510
D E E +E + E +D +E
Sbjct: 171 EQSDDSEHEI---IEQDESETESDDDKTE 196
>gnl|CDD|227512 COG5185, HEC1, Protein involved in chromosome segregation,
interacts with SMC proteins [Cell division and
chromosome partitioning].
Length = 622
Score = 34.6 bits (79), Expect = 0.36
Identities = 23/89 (25%), Positives = 37/89 (41%), Gaps = 12/89 (13%)
Query: 434 KEEEIVAAKKKADEL--EMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKET 491
KEEEI A + DEL ++ +S ++ + QE+E L L +I + +K T
Sbjct: 342 KEEEIKALQSNIDELHKQLRKQGIST------EQFELMNQEREKLTRELDKINIQSDKLT 395
Query: 492 AG----HLEAKQKVAELEDRISELKHRAD 516
LEA+ LE + +
Sbjct: 396 KSVKSRKLEAQGIFKSLEKTLRQYDSLIQ 424
>gnl|CDD|237871 PRK14965, PRK14965, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 576
Score = 34.3 bits (79), Expect = 0.37
Identities = 21/92 (22%), Positives = 28/92 (30%), Gaps = 12/92 (13%)
Query: 494 HLEAKQKVAELEDRISELKHRADFEQDERKRLENLINSGSVPDDTKAVGLSRSGGREEPS 553
L V+EL DR+ L+ A + P P
Sbjct: 360 TLAPGAPVSELLDRLEALERGAPAPPSAAWGAPTPAAPAAPPP-----------AAAPPV 408
Query: 554 SPRKPEAPSSCIPPPPPPASLPPPPPPRRSSD 585
P P P++ P P P PP RS+D
Sbjct: 409 PPAAPARPAAARPAPAPAPP-AAAAPPARSAD 439
Score = 32.8 bits (75), Expect = 1.1
Identities = 14/57 (24%), Positives = 18/57 (31%)
Query: 548 GREEPSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLPYLTCPRPLVRCRQPPRS 604
AP+ P PPPA+ PP PP + P P P +
Sbjct: 379 RGAPAPPSAAWGAPTPAAPAAPPPAAAPPVPPAAPARPAAARPAPAPAPPAAAAPPA 435
>gnl|CDD|223010 PHA03215, PHA03215, nuclear protein UL24; Provisional.
Length = 262
Score = 33.7 bits (77), Expect = 0.41
Identities = 14/52 (26%), Positives = 22/52 (42%)
Query: 533 SVPDDTKAVGLSRSGGREEPSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSS 584
+VP + + R G P S R+P ++ P P PA+ PP +
Sbjct: 188 AVPLEPRTTRRGRRRGARRPRSRRRPRGGAAGPPVPAEPAAPERPPADPTPT 239
>gnl|CDD|219358 pfam07271, Cytadhesin_P30, Cytadhesin P30/P32. This family
consists of several Mycoplasma species specific
Cytadhesin P32 and P30 proteins. P30 has been found to
be membrane associated and localised on the tip
organelle. It is thought that it is important in
cytadherence and virulence.
Length = 279
Score = 33.5 bits (76), Expect = 0.47
Identities = 24/171 (14%), Positives = 54/171 (31%), Gaps = 23/171 (13%)
Query: 461 KKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHLEAKQKVAELEDRISELKHRADFEQD 520
+KE+ L ++ + +E + E L RI ++ E++ ++ + + + +
Sbjct: 97 RKEKRLLEEKERQEQMAEQLQRISEQNEQQAIEIDPTEEVNTQEPTQPA--------GVN 148
Query: 521 ERKRLENLINSGSVPDDTKAVGLSRSGGREEPSSPRKP-----------EAPSSCIPPPP 569
+ + P+ + + R G +P+ +P P+ P
Sbjct: 149 VANNPQPQVQPQFGPNPQQRINPQRFGFPMQPNMGMRPGFNQMPPHMPGMPPNQMRPGFN 208
Query: 570 PPASLPP----PPPPRRSSDLPYLTCPRPLVRCRQPPRSKLKSRIPRPYFP 616
P +PP P ++ P + RP F
Sbjct: 209 PMPGMPPRPGFNQNPNMMPNMNRPGFRPQPGGFNHPGTPMGPNMQQRPGFN 259
>gnl|CDD|222426 pfam13870, DUF4201, Domain of unknown function (DUF4201). This is
a family of coiled-coil proteins from eukaryotes. The
function is not known.
Length = 177
Score = 32.9 bits (76), Expect = 0.48
Identities = 24/110 (21%), Positives = 47/110 (42%), Gaps = 15/110 (13%)
Query: 421 LKINVKKVVHLLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESL 480
LK+ VH+L K+K LE E L + ++EL +K + L + L
Sbjct: 68 LKLKYTNTVHILT------HLKEKLHMLEEE-------LERLKKELKEKEERLAKLRKEL 114
Query: 481 SRIKDRLEKETAGHLEAKQKVAEL--EDRISELKHRADFEQDERKRLENL 528
++K +K + + +Q+ L + + + +++RK +E L
Sbjct: 115 YKVKLERDKLRKQNKKLRQQGGLLHVPALLYDYDKTVEKVEEKRKSVEKL 164
Score = 30.3 bits (69), Expect = 3.4
Identities = 26/118 (22%), Positives = 56/118 (47%), Gaps = 4/118 (3%)
Query: 416 YDHILLKINVKKVVHLLAKEEEIVAAKKKAD--ELEMENVDLSNRLAKKEQEL-DQKTQE 472
I LK + K+ L ++EE+ D +L++EN L+ ++ ++ +EL K +
Sbjct: 13 LKLITLKHTLAKIQEKLEQKEELGEGLHLIDFEQLQIENQALNEKIEERNKELKRLKLKY 72
Query: 473 KEDLDESLSRIKDRLEKETAGHLEAKQKVAELEDRISELKHRADFEQDERKRLENLIN 530
+ L+ +K++L K+++ E E+R+++L+ + ER +L
Sbjct: 73 TNTVHI-LTHLKEKLHMLEEELERLKKELKEKEERLAKLRKELYKVKLERDKLRKQNK 129
>gnl|CDD|146930 pfam04534, Herpes_UL56, Herpesvirus UL56 protein. In herpes
simplex virus type 2, UL56 is thought to be a
tail-anchored type II membrane protein involved in
vesicular trafficking. The C terminal hydrophobic region
is required for association with the cytoplasmic
membrane, and the N terminal proline-rich region is
important for the translocation of UL56 to the Golgi
apparatus and cytoplasmic vesicles.
Length = 198
Score = 33.2 bits (75), Expect = 0.48
Identities = 19/74 (25%), Positives = 26/74 (35%), Gaps = 5/74 (6%)
Query: 535 PDDTKAVGLSRSGGREEPSSPRKPEAPSSC---IPPPPPPASLPPPPPPRRSSDL--PYL 589
P D ++ +S E S P + C P S P P +SS L P
Sbjct: 123 PPDVRSDLVSGLPTYAEAPSDHPPTYATVCAARTTEQPMGLSWPFDQPRSQSSGLWRPPT 182
Query: 590 TCPRPLVRCRQPPR 603
R L R ++ R
Sbjct: 183 VNSRELYRAQRAAR 196
>gnl|CDD|217803 pfam03938, OmpH, Outer membrane protein (OmpH-like). This family
includes outer membrane proteins such as OmpH among
others. Skp (OmpH) has been characterized as a molecular
chaperone that interacts with unfolded proteins as they
emerge in the periplasm from the Sec translocation
machinery.
Length = 157
Score = 32.6 bits (75), Expect = 0.50
Identities = 27/109 (24%), Positives = 49/109 (44%), Gaps = 19/109 (17%)
Query: 423 INVKKVVHLLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSR 482
++V+KV+ E A K +LE E L L KKE+EL ++ Q+ + +LS
Sbjct: 21 VDVQKVL------SESPAGKAAQKQLEKEFKKLQAELQKKEKELQKEEQKLQKQAATLSE 74
Query: 483 IKDRLEKETAGHLEAKQKVAELEDRISELKH-RADFEQDERKRLENLIN 530
K K EL+ + EL+ + +Q+ +++ + L+
Sbjct: 75 ------------EARKAKQQELQQKQQELQQKQQAAQQELQQKQQELLQ 111
>gnl|CDD|171499 PRK12438, PRK12438, hypothetical protein; Provisional.
Length = 991
Score = 34.1 bits (78), Expect = 0.52
Identities = 16/51 (31%), Positives = 18/51 (35%), Gaps = 3/51 (5%)
Query: 538 TKAVGLSRSGGREEPSSP-RKPEAPSSCIPPPPPPASLPPPPPPRRSSDLP 587
T V + G P P AP PPP + PP PPR P
Sbjct: 899 TGRVATAPGGDAASAPPPGAGPPAPPQ--AVPPPRTTQPPAAPPRGPDVPP 947
>gnl|CDD|202833 pfam03962, Mnd1, Mnd1 family. This family of proteins includes
MND1 from S. cerevisiae. The mnd1 protein forms a
complex with hop2 to promote homologous chromosome
pairing and meiotic double-strand break repair.
Length = 188
Score = 33.0 bits (76), Expect = 0.53
Identities = 19/73 (26%), Positives = 40/73 (54%), Gaps = 8/73 (10%)
Query: 456 SNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEK------ETAGHLEAKQKVAELEDRIS 509
S L K + L++ +E E+L + ++ ++ ++EK ET E +++ +LE +
Sbjct: 61 SQALNKLKTRLEKLKKELEELKQRIAELQAQIEKLKKGREETEERTELLEELKQLEKELK 120
Query: 510 ELKHRADFEQDER 522
+LK A+ E+ E+
Sbjct: 121 KLK--AELEKYEK 131
>gnl|CDD|172376 PRK13855, PRK13855, type IV secretion system protein VirB10;
Provisional.
Length = 376
Score = 33.7 bits (77), Expect = 0.53
Identities = 19/61 (31%), Positives = 22/61 (36%), Gaps = 13/61 (21%)
Query: 545 RSGGREEPSSPR-------KPEAPS----SCIPPPPPPASLPPPPPPRRSSDLPYLTCPR 593
RS EP+ P KP P+ PP A P PP +S P PR
Sbjct: 52 RSKKENEPAPPSTMIATNTKPFHPAPIDVPPDPPAAQEAVQPTAPPSAQSE--PERNEPR 109
Query: 594 P 594
P
Sbjct: 110 P 110
>gnl|CDD|149160 pfam07926, TPR_MLP1_2, TPR/MLP1/MLP2-like protein. The sequences
featured in this family are similar to a region of human
TPR protein and to yeast myosin-like proteins 1 (MLP1)
and 2 (MLP2). These proteins share a number of features;
for example, they all have coiled-coil regions and all
three are associated with nuclear pores. TPR is thought
to be a component of nuclear pore complex- attached
intra-nuclear filaments, and is implicated in nuclear
protein import. Moreover, its N-terminal region is
involved in the activation of oncogenic kinases,
possibly by mediating the dimerisation of kinase domains
or by targeting these kinases to the nuclear pore
complex. MLP1 and MLP2 are involved in the process of
telomere length regulation, where they are thought to
interact with proteins such as Tel1p and modulate their
activity.
Length = 132
Score = 32.3 bits (74), Expect = 0.53
Identities = 18/78 (23%), Positives = 34/78 (43%), Gaps = 5/78 (6%)
Query: 440 AAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHLEAKQ 499
A +K EL +++ + L ++L++ +E L + L + E K+
Sbjct: 43 AQQKYEREL-VKHAEDIEELQALRKQLNELKKEIAQLKAEAESAQAELSEAEESWEEQKK 101
Query: 500 ----KVAELEDRISELKH 513
+++ELE RI EL
Sbjct: 102 MLEDELSELEKRIEELNE 119
>gnl|CDD|205921 pfam13747, DUF4164, Domain of unknown function (DUF4164). This is
a family of short, approx 100 residue-long, bacterial
proteins of unknown function. There is several conserved
LE/LD sequence pairs.
Length = 89
Score = 31.0 bits (71), Expect = 0.55
Identities = 17/58 (29%), Positives = 24/58 (41%), Gaps = 10/58 (17%)
Query: 471 QEKEDLDESLSRIK---DRLEKETAGHLEAKQKVAELEDRISELKHRADFEQDERKRL 525
+L+ +L R+ DRLE LE + + ELE I L +R RL
Sbjct: 1 TVTTELEAALQRLDAALDRLEAALDRRLERDRDIDELEAEIQRL-------GADRSRL 51
>gnl|CDD|225606 COG3064, TolA, Membrane protein involved in colicin uptake [Cell
envelope biogenesis, outer membrane].
Length = 387
Score = 33.8 bits (77), Expect = 0.56
Identities = 29/134 (21%), Positives = 50/134 (37%), Gaps = 21/134 (15%)
Query: 426 KKVVHLLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKD 485
KK AK E AK KA+ AKK+ E + K ++ E+ ++ K
Sbjct: 170 KKKAEEAAKAAE--EAKAKAEAAA----------AKKKAEAEAKAAAEKAKAEAEAKAKA 217
Query: 486 RLEKETAGHLEAKQKVAELEDRISELKHRADFEQDERKR----LENLINSGSVPDDTKAV 541
+ E A +A + + + K A + ERK L+++ S
Sbjct: 218 EKKAEAAAEEKAAAEKKKAAAKAKADKAAAAAKAAERKAAAAALDDIFGGLSS-----GK 272
Query: 542 GLSRSGGREEPSSP 555
++GG + +S
Sbjct: 273 NAPKTGGGAKNASS 286
>gnl|CDD|218636 pfam05557, MAD, Mitotic checkpoint protein. This family consists
of several eukaryotic mitotic checkpoint (Mitotic arrest
deficient or MAD) proteins. The mitotic spindle
checkpoint monitors proper attachment of the bipolar
spindle to the kinetochores of aligned sister chromatids
and causes a cell cycle arrest in prometaphase when
failures occur. Multiple components of the mitotic
spindle checkpoint have been identified in yeast and
higher eukaryotes. In S.cerevisiae, the existence of a
Mad1-dependent complex containing Mad2, Mad3, Bub3 and
Cdc20 has been demonstrated.
Length = 722
Score = 33.7 bits (77), Expect = 0.57
Identities = 29/112 (25%), Positives = 52/112 (46%), Gaps = 3/112 (2%)
Query: 418 HILLKINVKKVVHLLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLD 477
H +I +++ LA+ E + ELE+ L K E E + +E + L
Sbjct: 84 HKRAQIELERKASTLAENYEREL--DRNLELEVRLKALEELEKKAENEAAEAEEEAKLLK 141
Query: 478 ESLSRIKDRLEKETAGHL-EAKQKVAELEDRISELKHRADFEQDERKRLENL 528
+ L +L+ E L EAK+ ++ +++ +SE++ RA E K LE+
Sbjct: 142 DKLDAESLKLQNEKEDQLKEAKESISRIKNDLSEMQCRAQNADTELKLLESE 193
>gnl|CDD|224755 COG1842, PspA, Phage shock protein A (IM30), suppresses
sigma54-dependent transcription [Transcription / Signal
transduction mechanisms].
Length = 225
Score = 33.1 bits (76), Expect = 0.58
Identities = 24/114 (21%), Positives = 45/114 (39%), Gaps = 17/114 (14%)
Query: 402 LAQQLTLQTETGEDYDHILLKINVKKVVHLLAKEEEIVAAKKKADELEMENVDLSNRLAK 461
LAQ + Q + + + EE+ A + +E DL+ +
Sbjct: 47 LAQAIARQKQLERKLEEAQARAE--------KLEEKAELALQAGNE------DLAREALE 92
Query: 462 KEQELDQKTQEKEDLDESLSRIKDRLEKETAGHLEAKQKVAELEDRISELKHRA 515
++Q L+ + E + ++L+K+ A +QK+AEL + LK R
Sbjct: 93 EKQSLEDLAKALEAELQQAEEQVEKLKKQLA---ALEQKIAELRAKKEALKARK 143
>gnl|CDD|236090 PRK07764, PRK07764, DNA polymerase III subunits gamma and tau;
Validated.
Length = 824
Score = 33.8 bits (78), Expect = 0.60
Identities = 21/91 (23%), Positives = 27/91 (29%), Gaps = 14/91 (15%)
Query: 518 EQDER---KRLENLINSGSVPDDTKAVGLSRSGGREEPSSPRKPEA-----------PSS 563
DER RLE L V A + + A P++
Sbjct: 368 SDDERGLLARLERLERRLGVAGGAGAPAAAAPSAAAAAPAAAPAPAAAAPAAAAAPAPAA 427
Query: 564 CIPPPPPPASLPPPPPPRRSSDLPYLTCPRP 594
P P PA P PP P ++ P P
Sbjct: 428 APQPAPAPAPAPAPPSPAGNAPAGGAPSPPP 458
>gnl|CDD|237622 PRK14140, PRK14140, heat shock protein GrpE; Provisional.
Length = 191
Score = 32.7 bits (75), Expect = 0.63
Identities = 20/73 (27%), Positives = 36/73 (49%), Gaps = 8/73 (10%)
Query: 461 KKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHLEAKQKVAELEDRISELKHR-----A 515
+E+ + + +E + + +++ E E E + K+AELE ++ EL+ R A
Sbjct: 8 VEEEVEETEVEEAVEDEVEEETVEEESEAELLD--EEQAKIAELEAKLDELEERYLRLQA 65
Query: 516 DFE-QDERKRLEN 527
DFE R + EN
Sbjct: 66 DFENYKRRIQKEN 78
>gnl|CDD|219111 pfam06625, DUF1151, Protein of unknown function (DUF1151). This
family consists of several hypothetical eukaryotic
proteins of unknown function.
Length = 122
Score = 31.7 bits (72), Expect = 0.63
Identities = 13/46 (28%), Positives = 25/46 (54%)
Query: 434 KEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDES 479
K ++++ +K+ +E + +L L K+ Q+L+Q EK DE
Sbjct: 61 KRDQVLKEQKEEEEAKRLQSELERELMKRAQKLEQLELEKAKEDEE 106
>gnl|CDD|235899 PRK06975, PRK06975, bifunctional uroporphyrinogen-III
synthetase/uroporphyrin-III C-methyltransferase;
Reviewed.
Length = 656
Score = 33.5 bits (77), Expect = 0.69
Identities = 10/43 (23%), Positives = 13/43 (30%)
Query: 552 PSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLPYLTCPRP 594
P+ R + S P A+ P PP P R
Sbjct: 277 PAPSRMTDTNDSKSVTSQPAAAAAAPAPPPNPPATPPEPPARR 319
Score = 32.8 bits (75), Expect = 1.2
Identities = 13/34 (38%), Positives = 18/34 (52%)
Query: 550 EEPSSPRKPEAPSSCIPPPPPPASLPPPPPPRRS 583
+ S +P A ++ PPP P + PP PP RR
Sbjct: 287 DSKSVTSQPAAAAAAPAPPPNPPATPPEPPARRG 320
Score = 30.8 bits (70), Expect = 4.4
Identities = 11/35 (31%), Positives = 14/35 (40%)
Query: 550 EEPSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSS 584
+ S A ++ P PPP PP PP R
Sbjct: 286 NDSKSVTSQPAAAAAAPAPPPNPPATPPEPPARRG 320
Score = 30.1 bits (68), Expect = 8.6
Identities = 10/35 (28%), Positives = 12/35 (34%)
Query: 552 PSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDL 586
+S A + PP PP PP R S
Sbjct: 291 VTSQPAAAAAAPAPPPNPPATPPEPPARRGRGSAA 325
>gnl|CDD|221745 pfam12737, Mating_C, C-terminal domain of homeodomain 1. Mating in
fungi is controlled by the loci that determine the
mating type of an individual, and only individuals with
differing mating types can mate. Basidiomycete fungi
have evolved a unique mating system, termed tetrapolar
or bifactorial incompatibility, in which mating type is
determined by two unlinked loci; compatibility at both
loci is required for mating to occur. The multi-allelic
tetrapolar mating system is considered to be a novel
innovation that could have only evolved once, and is
thus unique to the mushroom fungi. This domain is
C-terminal to the homeodomain transcription factor
region.
Length = 418
Score = 33.2 bits (76), Expect = 0.71
Identities = 16/59 (27%), Positives = 16/59 (27%), Gaps = 5/59 (8%)
Query: 551 EPSSPRKPEAPSSCIPPP--PPPASLPPPPPPRRSSDLP---YLTCPRPLVRCRQPPRS 604
E S E S P P P RR S P RP R R S
Sbjct: 75 ERSPALSSERLLSPSPSVLDLSPVLASPQTGKRRRSSSPSDDEDEAERPSKRPRSDSIS 133
Score = 30.2 bits (68), Expect = 7.2
Identities = 23/114 (20%), Positives = 35/114 (30%), Gaps = 24/114 (21%)
Query: 511 LKHRADFEQDE-------RKRLENLINSGSVPDDTKAVGLSRSGGREEPSSPRKPEAPSS 563
+ R+ D+ KR + S S P++ + E +
Sbjct: 106 KRRRSSSPSDDEDEAERPSKRPRSDSISSSSSPAKPP-----EACLPSPAASTQDELSEA 160
Query: 564 CIPPPPPPASLPPPPPP---------RRSSDLPYLTCPRPLVRCRQPPRSKLKS 608
P P P+ PP P RR SD L P+ R + R + S
Sbjct: 161 SAAPLPTPSLSPPHTPTDTAPSGKRKRRLSDGFQLPAPK---RPQTSSRPQTVS 211
>gnl|CDD|227505 COG5178, PRP8, U5 snRNP spliceosome subunit [RNA processing and
modification].
Length = 2365
Score = 33.8 bits (77), Expect = 0.73
Identities = 14/26 (53%), Positives = 15/26 (57%), Gaps = 4/26 (15%)
Query: 561 PSSCIPPPPPPA----SLPPPPPPRR 582
P + PPPPPP S PPPPPP
Sbjct: 6 PGNPPPPPPPPGFEPPSQPPPPPPPG 31
Score = 29.9 bits (67), Expect = 9.2
Identities = 13/30 (43%), Positives = 14/30 (46%), Gaps = 4/30 (13%)
Query: 552 PSSPRKPEAPSSCIPPPPPPASLPPPPPPR 581
P +P P P PP PP PPPPP
Sbjct: 6 PGNPPPPPPPPGFEPPSQPP----PPPPPG 31
>gnl|CDD|220260 pfam09483, HpaP, Type III secretion protein (HpaP). This entry
represents proteins encoded by genes which are always
found in type III secretion operons, although their
function in the processes of secretion and virulence is
unclear. Hpa stands for Hrp-associated gene, where Hrp
stands for hypersensitivity response and virulence. see
also PMID:18584024.
Length = 185
Score = 32.4 bits (74), Expect = 0.73
Identities = 13/70 (18%), Positives = 15/70 (21%), Gaps = 1/70 (1%)
Query: 534 VPDDTKAVGLSRSGGREEPSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLPYLTCPR 593
P RS P PP P P P R P P
Sbjct: 1 APAGAAPRAARRSFDYARLMRRAARAGPPGTPAPPGPAEDAHPEFPERPRDA-PAPPAPP 59
Query: 594 PLVRCRQPPR 603
+ P+
Sbjct: 60 RATDGDRDPQ 69
>gnl|CDD|130529 TIGR01462, greA, transcription elongation factor GreA. The GreA
and GreB transcription elongation factors enable to
continuation of RNA transcription past template-encoded
arresting sites. Among the Proteobacteria, distinct
clades of GreA and GreB are found. GreA differs
functionally in that it releases smaller
oligonucleotides. Because members of the family outside
the Proteobacteria resemble GreA more closely than GreB,
the GreB clade (TIGR01461) forms a plausible outgroup
and the remainder of the GreA/B family, included in this
model, is designated GreA. In the Chlamydias and some
spirochetes, the region described by This model is found
as the C-terminal region of a much larger protein
[Transcription, Transcription factors].
Length = 151
Score = 31.8 bits (73), Expect = 0.74
Identities = 23/87 (26%), Positives = 39/87 (44%), Gaps = 22/87 (25%)
Query: 461 KKEQELDQ-KTQEKEDLDESLSRIKDRLE----KETAGHLEAKQKVAELEDRISELKHRA 515
K ++EL+ KT ++ ++ E I + E E A + AK++ E RI+E
Sbjct: 9 KLKEELEYLKTVKRPEISE---EIAEAREHGDLSENAEYHAAKEEQGFNEGRIAE----- 60
Query: 516 DFEQDERKRLENLINSGSVPDDTKAVG 542
LE+L+ + V DD+K
Sbjct: 61 ---------LEDLLANAQVIDDSKLST 78
>gnl|CDD|237865 PRK14951, PRK14951, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 618
Score = 33.5 bits (77), Expect = 0.77
Identities = 13/57 (22%), Positives = 22/57 (38%), Gaps = 3/57 (5%)
Query: 551 EPSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSD---LPYLTCPRPLVRCRQPPRS 604
P + AP++ P +L P PP + + + +P P P V P +
Sbjct: 424 APVAAPAAAAPAAAPAAAPAAVALAPAPPAQAAPETVAIPVRVAPEPAVASAAPAPA 480
Score = 30.5 bits (69), Expect = 6.2
Identities = 7/34 (20%), Positives = 13/34 (38%)
Query: 547 GGREEPSSPRKPEAPSSCIPPPPPPASLPPPPPP 580
P++ +A ++ P P A+ P P
Sbjct: 384 PEAAAPAAAPVAQAAAAPAPAAAPAAAASAPAAP 417
Score = 30.1 bits (68), Expect = 8.7
Identities = 10/51 (19%), Positives = 16/51 (31%)
Query: 552 PSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLPYLTCPRPLVRCRQPP 602
++P A + P PP A+ P R + P + P
Sbjct: 435 AAAPAAAPAAVALAPAPPAQAAPETVAIPVRVAPEPAVASAAPAPAAAPAA 485
>gnl|CDD|223043 PHA03323, PHA03323, nuclear egress membrane protein UL34;
Provisional.
Length = 272
Score = 32.7 bits (75), Expect = 0.79
Identities = 17/64 (26%), Positives = 25/64 (39%), Gaps = 5/64 (7%)
Query: 549 REEPSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLPYLTCPRPLVRCRQPPRSKLKS 608
+ RK +S PP PP + PP P ++ + + PL + PRS
Sbjct: 193 DGSIAGDRKNGDAASPGCPPIPPRAASPPLPAAGAAR-GWASLASPL----RAPRSVPTV 247
Query: 609 RIPR 612
RI
Sbjct: 248 RISA 251
>gnl|CDD|221432 pfam12128, DUF3584, Protein of unknown function (DUF3584). This
protein is found in bacteria and eukaryotes. Proteins in
this family are typically between 943 to 1234 amino
acids in length. This family contains a P-loop motif
suggesting it is a nucleotide binding protein. It may be
involved in replication.
Length = 1198
Score = 33.5 bits (77), Expect = 0.81
Identities = 30/121 (24%), Positives = 48/121 (39%), Gaps = 9/121 (7%)
Query: 431 LLAKEEEIVAAKKKADELEME----NVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDR 486
L + E+ + A+EL+ E D R K EQE ++ + LDE L ++D
Sbjct: 821 LAIQLREL---ESSAEELQQELTRLIKDTKLRRKKLEQERKALEKQLDQLDELLRGLRDE 877
Query: 487 LEKETAGHLEAKQKVAELEDRISELKHRADFEQDERKRLENLINSGSVPDDTKAVGLSRS 546
+ + A + E ISE + + + +RKRL + D S S
Sbjct: 878 MRQLAELKEPA--NANQAEGSISERLDQLEEFKRKRKRLSGDLKKFLERFDGVIKDHSGS 935
Query: 547 G 547
G
Sbjct: 936 G 936
Score = 32.7 bits (75), Expect = 1.3
Identities = 29/84 (34%), Positives = 45/84 (53%), Gaps = 4/84 (4%)
Query: 446 DELEMENVDLSNRLAK-KEQELDQKTQEKEDLDESLSRIKDRLEKETAGHLEAKQKVAEL 504
++LE + + RLA +E++ QK +EDL S+++ +LE AG LE ++ EL
Sbjct: 391 EQLERDLEKNNERLAAIREEKDRQKAAIEEDLQALESQLRQQLE---AGKLEFNEEEYEL 447
Query: 505 EDRISELKHRADFEQDERKRLENL 528
E R+ LK R D + LE L
Sbjct: 448 ELRLGRLKQRLDSATATPEELEQL 471
Score = 31.2 bits (71), Expect = 4.1
Identities = 31/126 (24%), Positives = 52/126 (41%), Gaps = 15/126 (11%)
Query: 408 LQTETGEDYDHILLKINVKKVVHLLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELD 467
LQ + E + L++ L+A EE + A+++ + E +N L +L E +L
Sbjct: 251 LQEDF-EQLLSLELRLQHLHG-ELVADEERL--AEEQEERQEAKN-RLRQQLRTLEDQLK 305
Query: 468 QK----TQEKEDLDESLSRIKDRLEKETAGHLEAKQKVAELEDRISELKHRADFEQDERK 523
+ QE + L+ + LE LE QK A + I +L+ D R
Sbjct: 306 EARDELNQELSAANAKLAADRSELE-----LLED-QKGAFEDADIEQLQADLDQLPSIRS 359
Query: 524 RLENLI 529
LE +
Sbjct: 360 ELEEVE 365
>gnl|CDD|215496 PLN02918, PLN02918, pyridoxine (pyridoxamine) 5'-phosphate oxidase.
Length = 544
Score = 33.4 bits (76), Expect = 0.82
Identities = 11/57 (19%), Positives = 16/57 (28%), Gaps = 6/57 (10%)
Query: 563 SCIPPPPPPASLPPPPP-----PRRSSDLPYLTCPRPLVRCRQPPRSKLKSRIPRPY 614
+ + P + PPPP S P P R L ++ P
Sbjct: 11 TFLLQSLLPLPISPPPPHSSSLSSSPSPTQRFLTPSQG-SRLPPRRRALCTKSQDPR 66
>gnl|CDD|234398 TIGR03921, T7SS_mycosin, type VII secretion-associated serine
protease mycosin. Members of this family are
subtilisin-related serine proteases, found strictly in
the Actinobacteria and associated with type VII
secretion operons. The designation mycosin is used for
members from Mycobacterium [Protein fate, Protein and
peptide secretion and trafficking, Protein fate, Protein
modification and repair].
Length = 350
Score = 33.1 bits (76), Expect = 0.82
Identities = 10/32 (31%), Positives = 13/32 (40%)
Query: 555 PRKPEAPSSCIPPPPPPASLPPPPPPRRSSDL 586
P + P P P P + P PPPP +
Sbjct: 287 PPEDGRPLRPAPAPARPVAAPAPPPPPDDTPR 318
>gnl|CDD|234252 TIGR03545, TIGR03545, TIGR03545 family protein. This model
represents a relatively rare but broadly distributed
uncharacterized protein family, distributed in 1-2
percent of bacterial genomes, all of which have outer
membranes. In many of these genomes, it is part of a
two-gene pair.
Length = 555
Score = 33.2 bits (76), Expect = 0.83
Identities = 25/118 (21%), Positives = 54/118 (45%), Gaps = 15/118 (12%)
Query: 426 KKVVHLLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKD 485
KK L ++++ KK+ + ++ +++ L K ++E D+ +E + + + K+
Sbjct: 181 KKRKKDLPNKQDLEEYKKRLEAIKKKDIKNPLELQKIKEEFDKLKKEGKADKQKIKSAKN 240
Query: 486 RLEKETAGHLEAKQKVAELEDRISELKHRADFEQDERKRLENL--INSGSVPDDTKAV 541
L+ + + K +AEL+ Q++ KRLEN I SG + + +
Sbjct: 241 DLQNDKK---QLKADLAELKKAP----------QNDLKRLENKYAIKSGDLKNFAVDL 285
>gnl|CDD|223202 COG0124, HisS, Histidyl-tRNA synthetase [Translation, ribosomal
structure and biogenesis].
Length = 429
Score = 33.0 bits (76), Expect = 0.90
Identities = 15/85 (17%), Positives = 33/85 (38%), Gaps = 10/85 (11%)
Query: 415 DYDHILLKINV---KKVVHLLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQ 471
+ L + + + ++ L K ++I + D + LD K
Sbjct: 168 ILEGRLEYLGIDQREALLRYLDKLDKIGKLELDEDSKRRLKTNPLR-------VLDSKKD 220
Query: 472 EKEDLDESLSRIKDRLEKETAGHLE 496
++L ++ + D L++E+ HLE
Sbjct: 221 SDQELLKNAPELLDYLDEESLEHLE 245
>gnl|CDD|200948 pfam00038, Filament, Intermediate filament protein.
Length = 312
Score = 32.6 bits (75), Expect = 0.95
Identities = 29/102 (28%), Positives = 55/102 (53%), Gaps = 5/102 (4%)
Query: 435 EEEIVAAKKKADELEMENVDLSNRLAKKEQELD-QKTQEKEDLDESLSRIKDR--LEKET 491
E +IV +K DE + VDL ++ ++EL K +E++ E S+I+ + +E +
Sbjct: 102 ENDIVGLRKDLDEATLARVDLEMKIESLQEELAFLKKNHEEEVRELQSQIQGQVNVEMDA 161
Query: 492 AGHLEAKQKVAELEDRISEL--KHRADFEQDERKRLENLINS 531
A L+ + +AE+ + EL K+R + E+ + +LE L +
Sbjct: 162 ARKLDLTKALAEIRAQYEELAEKNRQEAEEWYKSKLEELQQA 203
>gnl|CDD|219791 pfam08317, Spc7, Spc7 kinetochore protein. This domain is found in
cell division proteins which are required for
kinetochore-spindle association.
Length = 321
Score = 32.7 bits (75), Expect = 0.97
Identities = 21/97 (21%), Positives = 43/97 (44%), Gaps = 7/97 (7%)
Query: 435 EEEIVAAKKKADELEMENVD----LSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKE 490
+EEI + ++ ADEL + + L ++ +K ++ E+L + L + +E
Sbjct: 188 KEEIASLRQLADELNLCDPLELEKARQELRSLSVKISEKRKQLEELQQELQELTIAIEAL 247
Query: 491 TAGHLEAKQKVAELEDRISELKHRADFEQDERKRLEN 527
T E +++AE E E + + E +L+
Sbjct: 248 TNKKSELLEEIAEAEKIREECRG---WSAKEISKLKA 281
Score = 31.2 bits (71), Expect = 3.2
Identities = 20/99 (20%), Positives = 37/99 (37%), Gaps = 13/99 (13%)
Query: 435 EEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGH 494
EE + + ++ + L ++ L Q E D LEK
Sbjct: 163 EELLNKDLNLINSIKPKLRKKLQALKEEIASLRQLADELNLCD------PLELEK----- 211
Query: 495 LEAKQKVAELEDRISELKHRADFEQDERKRLENLINSGS 533
A+Q++ L +ISE + + + Q E + L I + +
Sbjct: 212 --ARQELRSLSVKISEKRKQLEELQQELQELTIAIEALT 248
>gnl|CDD|147144 pfam04834, Adeno_E3_14_5, Early E3 14.5 kDa protein. The E3B 14.5
kDa was first identified in Human adenovirus type 5. It
is an integral membrane protein oriented with its C
terminus in the cytoplasm. It functions to down-regulate
the epidermal growth factor receptor and prevent tumour
necrosis factor cytolysis. It achieves this through the
interaction with E3 10.4 kDa protein.
Length = 97
Score = 30.5 bits (69), Expect = 1.00
Identities = 9/26 (34%), Positives = 11/26 (42%)
Query: 555 PRKPEAPSSCIPPPPPPASLPPPPPP 580
PR P+ + PPPP P P
Sbjct: 60 PRYPDPQHIPLQQPPPPPEPQPRAPS 85
>gnl|CDD|193407 pfam12934, FTO_CTD, FTO C-terminal domain. This domain is found at
the C-terminus of the FTO protein which was shown to be
associated with increased BMI and obesity risk in
humans. The N-terminal domain of this protein is a DNA
demethylase and this domain is found to associate with
the N-terminal domain in the crystal structure. This
domain is alpha helical with three helices that form a
bundle.
Length = 172
Score = 32.0 bits (73), Expect = 1.0
Identities = 31/133 (23%), Positives = 55/133 (41%), Gaps = 19/133 (14%)
Query: 868 LCHVKEASKVSMKELENDMTQLRNNLKEVSREIEFQRV-QPLVPGDRFLPVMKEFLTSAT 926
L +K+++ VS+K LE + Q L+E+ E+EF+ + Q + G R + T
Sbjct: 17 LPDLKDSAAVSLKSLEAAVIQQ---LEEIHNEVEFEWLRQFWLQGSRHAKCHDYW----T 69
Query: 927 CRLSEIEDLFQDMKTRFDRA-VRLFGEDNSSIQPEDFFVIFDTFLTTLNEAKLDNENMKR 985
++E+ + + M+ L ++ QP +D L L E +
Sbjct: 70 QPIAELTEAWDKMELMTRLVLDALKRASLTTEQPNR---AYDMLLYLLTE-------RQE 119
Query: 986 RREEEEKRARQDA 998
R+E R R A
Sbjct: 120 LRDEWTARCRSGA 132
>gnl|CDD|235600 PRK05771, PRK05771, V-type ATP synthase subunit I; Validated.
Length = 646
Score = 33.0 bits (76), Expect = 1.0
Identities = 34/136 (25%), Positives = 58/136 (42%), Gaps = 37/136 (27%)
Query: 773 FHFWISDIEPKIKSVMEASREVTRSRRLRKLLELVLALGNYMNKGARGNASGFRLTSLNK 832
H + K + E R++ L KL E + L +Y+ K L L +
Sbjct: 31 VHI----EDLKEELSNERLRKL--RSLLTKLSEALDKLRSYLPK----------LNPLRE 74
Query: 833 LVDTKSSSSRGVTLLHYLVELAEKRFKEVLKLEEDLCHVKEASKVSMKELENDMTQLRNN 892
S S L EL + +E+ K+E+++ KELE ++++L N
Sbjct: 75 EKKKVSVKS--------LEELIKDVEEELEKIEKEI-----------KELEEEISELENE 115
Query: 893 LKEVSREIEFQRVQPL 908
+KE+ +EIE R++P
Sbjct: 116 IKELEQEIE--RLEPW 129
Score = 32.6 bits (75), Expect = 1.2
Identities = 25/117 (21%), Positives = 51/117 (43%), Gaps = 20/117 (17%)
Query: 438 IVAAKKKADE-------LEMENVDLSN------RLAKKEQELDQKTQEKEDLDESLSRIK 484
+V K+ +DE L E ++L + + ++EL++ +E+E L E L +
Sbjct: 183 VVVLKELSDEVEEELKKLGFERLELEEEGTPSELIREIKEELEEIEKERESLLEELKELA 242
Query: 485 DRLEKE---TAGHLEAKQKVAELEDRISELKHRADFE----QDERKRLENLINSGSV 534
+ +E +LE + + AE + + E +D K+L+ LI+ +
Sbjct: 243 KKYLEELLALYEYLEIELERAEALSKFLKTDKTFAIEGWVPEDRVKKLKELIDKATG 299
Score = 32.2 bits (74), Expect = 1.9
Identities = 29/112 (25%), Positives = 50/112 (44%), Gaps = 19/112 (16%)
Query: 428 VVHLLAKEEEIVAAK-----KKADELEMENVDLSNRLAKKEQELD--QKTQEKEDLDESL 480
VVH+ +EE+ + +L E +D K L +K + L+E +
Sbjct: 30 VVHIEDLKEELSNERLRKLRSLLTKL-SEALDKLRSYLPKLNPLREEKKKVSVKSLEELI 88
Query: 481 SRIKDRLEK--ETAGHLEAKQKVAELEDRISELKHRADFEQDERKRLENLIN 530
+++ LEK + LE ++++ELE+ I EL+ E +RLE N
Sbjct: 89 KDVEEELEKIEKEIKELE--EEISELENEIKELEQ-------EIERLEPWGN 131
Score = 30.3 bits (69), Expect = 7.5
Identities = 20/105 (19%), Positives = 41/105 (39%), Gaps = 13/105 (12%)
Query: 435 EEEIVAAKKKADEL----EMENVDLSNRLAKKEQELDQ-KTQEKEDLDESLSRIKDRLEK 489
+ + D+L N + + L++ +E+L++ IK+ E+
Sbjct: 49 RSLLTKLSEALDKLRSYLPKLNPLREEKKKVSVKSLEELIKDVEEELEKIEKEIKELEEE 108
Query: 490 ETAGHLEAKQKVAELEDRISELKHRADFEQDERKRLENLINSGSV 534
+ E + ++ ELE I L+ +F+ D L L+ V
Sbjct: 109 IS----ELENEIKELEQEIERLEPWGNFDLD----LSLLLGFKYV 145
>gnl|CDD|218762 pfam05816, TelA, Toxic anion resistance protein (TelA). This
family consists of several prokaryotic TelA like
proteins. TelA and KlA are associated with tellurite
resistance and plasmid fertility inhibition.
Length = 333
Score = 32.6 bits (75), Expect = 1.1
Identities = 20/67 (29%), Positives = 31/67 (46%), Gaps = 5/67 (7%)
Query: 421 LKINVKKVVHLLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESL 480
L + L E+ I A K K ELE + L L +K + QE DL+++L
Sbjct: 124 LYDKNLEYYEEL--EKYIAAGKLKLAELEAK---LLPELEEKAEGDQMAAQELNDLEQAL 178
Query: 481 SRIKDRL 487
R++ R+
Sbjct: 179 DRLEQRV 185
>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
This is a family of fungal proteins of unknown function.
Length = 182
Score = 32.0 bits (73), Expect = 1.1
Identities = 16/86 (18%), Positives = 31/86 (36%), Gaps = 3/86 (3%)
Query: 491 TAGHLEAKQKVAELEDRISELKHRADFEQDERKRLENLINSGSVPDDTKAVGLSRSGGRE 550
A + EAK+K EL + I ++K ++E+ ++K S D K + +
Sbjct: 54 DAEYTEAKKKKKELAEEIEKVK--KEYEE-KQKWKWKKKKSKKKKDKDKDKKDDKKDDKS 110
Query: 551 EPSSPRKPEAPSSCIPPPPPPASLPP 576
E ++ E +
Sbjct: 111 EKKDEKEAEDKLEDLTKSYSETLSTL 136
Score = 32.0 bits (73), Expect = 1.1
Identities = 19/77 (24%), Positives = 34/77 (44%), Gaps = 5/77 (6%)
Query: 436 EEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHL 495
E AKKK EL E + +K++ +K + K+ D+ KD+ + +
Sbjct: 55 AEYTEAKKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKD----KDK-KDDKKDDK 109
Query: 496 EAKQKVAELEDRISELK 512
K+ E ED++ +L
Sbjct: 110 SEKKDEKEAEDKLEDLT 126
>gnl|CDD|220403 pfam09788, Tmemb_55A, Transmembrane protein 55A. Members of this
family catalyze the hydrolysis of the 4-position
phosphate of phosphatidylinositol 4,5-bisphosphate, in
the reaction: 1-phosphatidyl-myo-inositol
4,5-bisphosphate + H(2)O =
1-phosphatidyl-1D-myo-inositol 5-phosphate + phosphate.
Length = 258
Score = 32.5 bits (74), Expect = 1.1
Identities = 20/64 (31%), Positives = 25/64 (39%), Gaps = 4/64 (6%)
Query: 536 DDTKAVGLSRSGGREEPSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLPYLTCPRPL 595
+ G S G E P KP + P P + LPPP P +S D P+
Sbjct: 12 SEDIDGGTGVSPGDEGAGPPSKPPYDQNFAPFPVLLSELPPPYSPAQSPDSG----SAPV 67
Query: 596 VRCR 599
V CR
Sbjct: 68 VNCR 71
>gnl|CDD|153280 cd07596, BAR_SNX, The Bin/Amphiphysin/Rvs (BAR) domain of Sorting
Nexins. BAR domains are dimerization, lipid binding and
curvature sensing modules found in many different
proteins with diverse functions. Sorting nexins (SNXs)
are Phox homology (PX) domain containing proteins that
are involved in regulating membrane traffic and protein
sorting in the endosomal system. SNXs differ from each
other in their lipid-binding specificity, subcellular
localization and specific function in the endocytic
pathway. A subset of SNXs also contain BAR domains. The
PX-BAR structural unit determines the specific membrane
targeting of SNXs. BAR domains form dimers that bind to
membranes, induce membrane bending and curvature, and
may also be involved in protein-protein interactions.
Length = 218
Score = 31.9 bits (73), Expect = 1.2
Identities = 17/100 (17%), Positives = 45/100 (45%), Gaps = 7/100 (7%)
Query: 398 LLDRLAQQLTLQ-TETGEDYDHIL------LKINVKKVVHLLAKEEEIVAAKKKADELEM 450
L + A Q ++ E ++Y L ++ L + ++++ + K + ++L+
Sbjct: 79 LSEAQANQELVKLLEPLKEYLRYCQAVKETLDDRADALLTLQSLKKDLASKKAQLEKLKA 138
Query: 451 ENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKE 490
++ + E+EL++ E+ + I +RL++E
Sbjct: 139 APGIKPAKVEELEEELEEAESALEEARKRYEEISERLKEE 178
>gnl|CDD|219780 pfam08286, Spc24, Spc24 subunit of Ndc80. Spc24 is a component of
the evolutionarily conserved kinetochore-associated
Ndc80 complex and is involved in chromosome segregation.
Length = 117
Score = 30.8 bits (70), Expect = 1.2
Identities = 18/61 (29%), Positives = 27/61 (44%), Gaps = 4/61 (6%)
Query: 447 ELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHLEAKQKVAELED 506
ELE E L+ L E EL+ E E+L E L + LE++ E + ++D
Sbjct: 3 ELENEKFRLAKELNDLESELNNLQSELEELKERL----EELEEQEVEGDEVDEDETPVDD 58
Query: 507 R 507
Sbjct: 59 S 59
>gnl|CDD|215397 PLN02744, PLN02744, dihydrolipoyllysine-residue acetyltransferase
component of pyruvate dehydrogenase complex.
Length = 539
Score = 32.5 bits (74), Expect = 1.3
Identities = 16/41 (39%), Positives = 20/41 (48%), Gaps = 7/41 (17%)
Query: 551 EPSSPRKPEAPSSCIPPPPP-------PASLPPPPPPRRSS 584
+PSS P AP + PPPP PAS P P + S+
Sbjct: 201 KPSSSAAPAAPKAKPSPPPPKEEEVEKPASSPEPKASKPSA 241
Score = 30.2 bits (68), Expect = 7.8
Identities = 14/44 (31%), Positives = 17/44 (38%)
Query: 561 PSSCIPPPPPPASLPPPPPPRRSSDLPYLTCPRPLVRCRQPPRS 604
PSS P P A PPPP + P + + PP S
Sbjct: 202 PSSSAAPAAPKAKPSPPPPKEEEVEKPASSPEPKASKPSAPPSS 245
>gnl|CDD|237866 PRK14952, PRK14952, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 584
Score = 32.5 bits (74), Expect = 1.4
Identities = 21/104 (20%), Positives = 33/104 (31%), Gaps = 10/104 (9%)
Query: 548 GREEPSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLPYLTCPR-PLVRCRQPP-RSK 605
++P+ APS P P P P P + P P VR R K
Sbjct: 389 ANLLHNAPQAAPAPS---AAAPEPKHQPAPEPRPVLAPTPASGEPNAAAVRSMWSTVRDK 445
Query: 606 LKSRIPRPYFPNVSVQVNGTIWTELDESKLYASMELEVVDKLFC 649
++ R V + G L+ + L + E + +
Sbjct: 446 VRQRS-----RTTEVMLAGATVRALEGNTLVLTHESAPLARRLS 484
>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3. This
protein, which interacts with both microtubules and
TRAF3 (tumour necrosis factor receptor-associated factor
3), is conserved from worms to humans. The N-terminal
region is the microtubule binding domain and is
well-conserved; the C-terminal 100 residues, also
well-conserved, constitute the coiled-coil region which
binds to TRAF3. The central region of the protein is
rich in lysine and glutamic acid and carries KKE motifs
which may also be necessary for tubulin-binding, but
this region is the least well-conserved.
Length = 506
Score = 32.6 bits (74), Expect = 1.4
Identities = 31/169 (18%), Positives = 55/169 (32%), Gaps = 18/169 (10%)
Query: 458 RLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHLEAKQKVAELEDRISELKHRADF 517
K+E+ + +K + ++ K+ E+E + + E+ + +
Sbjct: 162 EEKKRERVRAKSRPKKPPKKKPPNKKKEPPEEEKQRQAAREAVKGKPEEPDVNEE--REK 219
Query: 518 EQDERKRLENLINSGSVPDDTKAVGLSRSGGREEPSSPRKPEAPSSCIPPPPPPASLPPP 577
E+D+ K E + + + S R SS +KP+ S P +S
Sbjct: 220 EEDDGKDRETTTSPMEEDESRQ----SSEISRRSSSSLKKPDPSPSMASPETRESSKRTE 275
Query: 578 PPPRRSSDLPYLTCPRPLVRCRQPPRSKLKSRIPRPYFPNVSVQVNGTI 626
PR S P RP PPR K K + +
Sbjct: 276 TRPRTSLRPP---SARPASARPAPPRVKRK---------EIVTVLQDAQ 312
Score = 31.8 bits (72), Expect = 2.5
Identities = 30/152 (19%), Positives = 56/152 (36%), Gaps = 3/152 (1%)
Query: 433 AKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEK--E 490
KE+++ + + +E + E V +R KK + ++KE +E R R +
Sbjct: 149 EKEKKVEEPRDREEEKKRERVRAKSR-PKKPPKKKPPNKKKEPPEEEKQRQAAREAVKGK 207
Query: 491 TAGHLEAKQKVAELEDRISELKHRADFEQDERKRLENLINSGSVPDDTKAVGLSRSGGRE 550
+++ E +D + E+DE ++ + S S +
Sbjct: 208 PEEPDVNEEREKEEDDGKDRETTTSPMEEDESRQSSEISRRSSSSLKKPDPSPSMASPET 267
Query: 551 EPSSPRKPEAPSSCIPPPPPPASLPPPPPPRR 582
SS R P + + PP + P PPR
Sbjct: 268 RESSKRTETRPRTSLRPPSARPASARPAPPRV 299
>gnl|CDD|223037 PHA03301, PHA03301, envelope glycoprotein L; Provisional.
Length = 226
Score = 31.8 bits (72), Expect = 1.5
Identities = 13/59 (22%), Positives = 19/59 (32%), Gaps = 8/59 (13%)
Query: 548 GREEPSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLPYLTCPRPLVRCRQPPRSKL 606
R S+P +P PPP + P PR +P +P R +
Sbjct: 176 TRRPLSAPDDEASPQPKSLATPPPVAAPSRRTPRPR--------RKPRGNRTRPSRPRN 226
>gnl|CDD|221952 pfam13166, AAA_13, AAA domain. This family of domains contain a
P-loop motif that is characteristic of the AAA
superfamily. Many of the proteins in this family are
conjugative transfer proteins. This family includes the
PrrC protein that is thought to be the active component
of the anticodon nuclease.
Length = 713
Score = 32.3 bits (74), Expect = 1.5
Identities = 14/58 (24%), Positives = 34/58 (58%), Gaps = 1/58 (1%)
Query: 451 ENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHLE-AKQKVAELEDR 507
EN+++ ++ + ++EL + ++ E L+ + + ++ LEK L+ A +K+A+ D
Sbjct: 89 ENIEIEAQIEELKKELKKLEEKIEQLEAEIEKKEEELEKAKNKFLDKAWKKLAKKYDS 146
Score = 32.3 bits (74), Expect = 1.6
Identities = 25/108 (23%), Positives = 43/108 (39%), Gaps = 8/108 (7%)
Query: 434 KEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQE-KEDLDESLSRIKDRLEKETA 492
E +I KK+ +LE + L + KKE+EL++ + + + L++ D E
Sbjct: 93 IEAQIEELKKELKKLEEKIEQLEAEIEKKEEELEKAKNKFLDKAWKKLAKKYDSNLSEAL 152
Query: 493 GHLEAKQK-----VAELEDRISELKHRADFEQDERKRLENLINSGSVP 535
L K+ + EL+ I E E K + S + P
Sbjct: 153 KGLNYKKNFKEKLLKELKSVILNASSLLSLE--ELKAKIKTLFSSNKP 198
>gnl|CDD|236692 PRK10431, PRK10431, N-acetylmuramoyl-l-alanine amidase II;
Provisional.
Length = 445
Score = 32.1 bits (73), Expect = 1.6
Identities = 13/37 (35%), Positives = 18/37 (48%), Gaps = 3/37 (8%)
Query: 568 PPPPASLPPPPPPRRSSDLPYLTCPRPLVRCRQPPRS 604
PPPP PPPP + + P + PR R P ++
Sbjct: 130 PPPP---PPPPVVAKRVETPAVVAPRVSEPARNPFKT 163
>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division
and chromosome partitioning].
Length = 420
Score = 32.4 bits (74), Expect = 1.6
Identities = 18/74 (24%), Positives = 32/74 (43%), Gaps = 3/74 (4%)
Query: 459 LAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHLEAKQKVAELEDRISELKHRADFE 518
LA E+ E+ +L LS + + K E K+ +A+L +S + + +
Sbjct: 173 LAAVRAEI---AAEQAELTTLLSEQRAQQAKLAQLLEERKKTLAQLNSELSADQKKLEEL 229
Query: 519 QDERKRLENLINSG 532
+ RL+N I S
Sbjct: 230 RANESRLKNEIASA 243
Score = 31.6 bits (72), Expect = 2.6
Identities = 32/172 (18%), Positives = 50/172 (29%), Gaps = 13/172 (7%)
Query: 386 PLEFGCHPQHWLLLDRLAQQL-TLQTETGEDYDHILLKINVKKVVHLLAKEEEIVAAKKK 444
P P+ RLA L E D + + L A EI A + +
Sbjct: 132 PPALLVSPEDAQRSVRLAIYYGALNPARAERIDAL-----KATLKQLAAVRAEIAAEQAE 186
Query: 445 ADELEMENVDLSNRLAKK-------EQELDQKTQEKEDLDESLSRIKDRLEKETAGHLEA 497
L E +LA+ +L+ + + E L + RL+ E A A
Sbjct: 187 LTTLLSEQRAQQAKLAQLLEERKKTLAQLNSELSADQKKLEELRANESRLKNEIASAEAA 246
Query: 498 KQKVAELEDRISELKHRADFEQDERKRLENLINSGSVPDDTKAVGLSRSGGR 549
K E RA + +R + + G G+
Sbjct: 247 AAKAREAAAAAEAAAARARAAEAKRTGETYKPTAPEKMLISSTGGFGALRGQ 298
Score = 30.8 bits (70), Expect = 4.0
Identities = 15/72 (20%), Positives = 34/72 (47%)
Query: 433 AKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETA 492
A E++I + + +LE + L +A E +L + + + L + ++ + RL
Sbjct: 49 ALEKKIREQQDQRAKLEKQLKSLETEIASLEAQLIETADDLKKLRKQIADLNARLNALEV 108
Query: 493 GHLEAKQKVAEL 504
E ++++AE
Sbjct: 109 QEREQRRRLAEQ 120
>gnl|CDD|117486 pfam08919, F_actin_bind, F-actin binding. The F-actin binding
domain forms a compact bundle of four antiparallel
alpha-helices, which are arranged in a left-handed
topology. Binding of F-actin to the F-actin binding
domain may result in cytoplasmic retention and
subcellular distribution of the protein, as well as
possible inhibition of protein function.
Length = 179
Score = 31.2 bits (70), Expect = 1.6
Identities = 20/67 (29%), Positives = 27/67 (40%), Gaps = 12/67 (17%)
Query: 548 GREEPSSPRKPEAPSSCIPP--PPPPASLP---PPPPPRRSSDLPYLTCPRPLV------ 596
G ++P P P+ S+ P PP P LP P P + P PL+
Sbjct: 2 GLKKPVPPAVPKPQSTAKPVGTPPSPVPLPSTSPSPSKMANGTQPSSAAFIPLISTRVSL 61
Query: 597 -RCRQPP 602
+ RQPP
Sbjct: 62 RKTRQPP 68
>gnl|CDD|191840 pfam07749, ERp29, Endoplasmic reticulum protein ERp29, C-terminal
domain. ERp29 is a ubiquitously expressed endoplasmic
reticulum protein found in mammals. ERp29 is comprised
of two domains. This domain, the C-terminal domain, has
an all helical fold. ERp29 is thought to form part of
the thyroglobulin folding complex.
Length = 95
Score = 29.9 bits (68), Expect = 1.6
Identities = 14/62 (22%), Positives = 27/62 (43%), Gaps = 2/62 (3%)
Query: 477 DESLSRIKDRLEKETAGHLEAKQKVAELEDRISE--LKHRADFEQDERKRLENLINSGSV 534
D+ + +E+ A ++ K A+ ++ E L+ D+ E RLE L+
Sbjct: 18 DDERKELLSEVEEAVAKLKDSAAKYAKYYVKVMEKILEKGEDYVAKELARLERLLEGSLA 77
Query: 535 PD 536
P+
Sbjct: 78 PE 79
>gnl|CDD|147296 pfam05038, Cytochrom_B558a, Cytochrome Cytochrome b558
alpha-subunit. Cytochrome b-245 light chain (p22-phox)
is one of the key electron transfer elements of the
NADPH oxidase in phagocytes.
Length = 186
Score = 31.4 bits (71), Expect = 1.7
Identities = 15/35 (42%), Positives = 16/35 (45%), Gaps = 1/35 (2%)
Query: 546 SGGREEPSSPRKPEAPSSCIPPPPPPASLPPPPPP 580
G + P PRK E P PP S PPP PP
Sbjct: 126 RGEQWTPILPRKKERPPVGGTIKQPP-SNPPPRPP 159
>gnl|CDD|218621 pfam05518, Totivirus_coat, Totivirus coat protein.
Length = 753
Score = 32.5 bits (74), Expect = 1.7
Identities = 13/56 (23%), Positives = 16/56 (28%), Gaps = 3/56 (5%)
Query: 552 PSSPRKPEAPS---SCIPPPPPPASLPPPPPPRRSSDLPYLTCPRPLVRCRQPPRS 604
P +PR P PPP PA+ P P P Q +
Sbjct: 695 PQAPRPGGPPGGGGGLPPPPDLPAAAGPAPCGSSLIASPTAPPEPEPPGAEQADGA 750
>gnl|CDD|206039 pfam13868, Trichoplein, Tumour suppressor, Mitostatin. Trichoplein
or mitostatin, was first defined as a meiosis-specific
nuclear structural protein. It has since been linked
with mitochondrial movement. It is associated with the
mitochondrial outer membrane, and over-expression leads
to reduction in mitochondrial motility whereas lack of
it enhances mitochondrial movement. The activity appears
to be mediated through binding the mitochondria to the
actin intermediate filaments (IFs).
Length = 349
Score = 31.8 bits (73), Expect = 1.7
Identities = 23/110 (20%), Positives = 58/110 (52%), Gaps = 2/110 (1%)
Query: 422 KINVKKVVHLLAKEEEIVAAKKKADELE-MENVDLSNRLAKKEQELDQKTQEKEDLDESL 480
+ + +++ + +E+E A +K+ + + E +D N + +E +K +E+E+ + L
Sbjct: 98 REQMDEIIERIQEEDEAEAQEKREKQKKLREEIDEFNEERIERKEE-EKEREREEELKIL 156
Query: 481 SRIKDRLEKETAGHLEAKQKVAELEDRISELKHRADFEQDERKRLENLIN 530
+++ E+E E +++ E E ++ L+ + + +DER+ L+ L
Sbjct: 157 EYQREKAEREEEREAERRERKEEKEREVARLRAQQEEAEDEREELDELRA 206
Score = 30.6 bits (70), Expect = 4.3
Identities = 26/133 (19%), Positives = 56/133 (42%), Gaps = 5/133 (3%)
Query: 398 LLDRLAQQLTLQTETGEDYDHILLKINVKKVVHLLAKEEEIVAAKKKADELEMENVDLSN 457
+ RL Q + E+ D + + ++ ++E+ A K++ + E++
Sbjct: 183 EVARLRAQQEEAEDEREELDELRADLYQEEYERKERQKEKEEAEKRRRQKQELQRAREEQ 242
Query: 458 RLAKKEQELDQKTQEKEDLDESLS--RIKDRLEKETAGHLEAKQKV--AELEDRISELKH 513
K+E+ +++ +E+ + + L + LE+E A K+ ELE +I E K
Sbjct: 243 IEEKEERLQEERAEEEAERERMLEKQAEDEELEQENAEKRRMKRLEHRRELEQQI-EEKE 301
Query: 514 RADFEQDERKRLE 526
+ E + E
Sbjct: 302 ERRAAEREEELEE 314
Score = 30.3 bits (69), Expect = 5.2
Identities = 27/106 (25%), Positives = 50/106 (47%), Gaps = 12/106 (11%)
Query: 435 EEEIVAAKKKADELEMENVDLSNRLAK------------KEQELDQKTQEKEDLDESLSR 482
EEE + A + +E E + + +++E +++ QE+E +DE + R
Sbjct: 48 EEERLKALAEEEERERKRKEERREGRAVLQEQIEEREKRRQEEYEERLQEREQMDEIIER 107
Query: 483 IKDRLEKETAGHLEAKQKVAELEDRISELKHRADFEQDERKRLENL 528
I++ E E E ++K+ E D +E + E+ ER+R E L
Sbjct: 108 IQEEDEAEAQEKREKQKKLREEIDEFNEERIERKEEEKEREREEEL 153
>gnl|CDD|193253 pfam12777, MT, Microtubule-binding stalk of dynein motor. the 380
kDa motor unit of dynein belongs to the AAA class of
chaperone-like ATPases. The core of the 380 kDa motor
unit contains a concatenated chain of six AAA modules,
of which four correspond to the ATP binding sites with
P-loop signatures described previously, and two are
modules in which the P loop has been lost in evolution.
This family is the region between D4 and D5 and is the
two predicted alpha-helical coiled coil segments that
form the stalk supporting the ATP-sensitive microtubule
binding component.
Length = 344
Score = 31.9 bits (72), Expect = 1.9
Identities = 17/57 (29%), Positives = 32/57 (56%)
Query: 454 DLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHLEAKQKVAELEDRISE 510
DL +LA +E EL QK ++ + L + + +++ KE A + + KVA + ++E
Sbjct: 19 DLKAKLAIQEVELKQKNEDADKLIQVVGVETEKVSKEKAIADQEEVKVAVINKEVTE 75
>gnl|CDD|236912 PRK11448, hsdR, type I restriction enzyme EcoKI subunit R;
Provisional.
Length = 1123
Score = 32.2 bits (74), Expect = 1.9
Identities = 22/99 (22%), Positives = 39/99 (39%), Gaps = 25/99 (25%)
Query: 399 LDRLAQQLTLQTETGEDYDHILLKINVKKVVHLLAKEEEIVAAKKKADELEMENVDLSNR 458
+ L QQL LQ A+++E+VA + A ELE + +L +
Sbjct: 151 VLTLKQQLELQAREKAQSQ-----------ALAEAQQQELVALEGLAAELEEKQQELEAQ 199
Query: 459 LAK-KEQELDQKTQEKE-------------DLDESLSRI 483
L + +E+ + + K+ +L E +RI
Sbjct: 200 LEQLQEKAAETSQERKQKRKEITDQAAKRLELSEEETRI 238
Score = 29.9 bits (68), Expect = 8.7
Identities = 16/70 (22%), Positives = 30/70 (42%)
Query: 454 DLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHLEAKQKVAELEDRISELKH 513
L + +Q+L+ + +EK + L E ++K ELE ++ +L+
Sbjct: 146 ALQQEVLTLKQQLELQAREKAQSQALAEAQQQELVALEGLAAELEEKQQELEAQLEQLQE 205
Query: 514 RADFEQDERK 523
+A ERK
Sbjct: 206 KAAETSQERK 215
>gnl|CDD|220614 pfam10174, Cast, RIM-binding protein of the cytomatrix active zone.
This is a family of proteins that form part of the CAZ
(cytomatrix at the active zone) complex which is
involved in determining the site of synaptic vesicle
fusion. The C-terminus is a PDZ-binding motif that binds
directly to RIM (a small G protein Rab-3A effector). The
family also contains four coiled-coil domains.
Length = 774
Score = 31.9 bits (72), Expect = 2.0
Identities = 23/110 (20%), Positives = 52/110 (47%), Gaps = 7/110 (6%)
Query: 429 VHLLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLE 488
L K+ E++ + + + L ++ D+ L K +++L + QEK L + ++ LE
Sbjct: 294 QELSRKKTELLGLQTELETLANQDSDMRQHLDKLKEDLTRAEQEKAILQTEVDALRYELE 353
Query: 489 -------KETAGHLEAKQKVAELEDRISELKHRADFEQDERKRLENLINS 531
K+TA A+++ A I +++ R + + + + L+ I +
Sbjct: 354 RKHNTLTKKTASLQAAQEEKATYAGEIEDMRDRYEKTERKLRVLQKKIEN 403
Score = 30.8 bits (69), Expect = 5.3
Identities = 29/137 (21%), Positives = 55/137 (40%), Gaps = 16/137 (11%)
Query: 402 LAQQLTLQTETGEDYDHILLKINVKKVVHLLAKEEEIVAAKKKADELEMENVDLSNRLAK 461
L Q L+T +D D HL +E++ A+++ L+ E L L +
Sbjct: 304 LGLQTELETLANQDSDM---------RQHLDKLKEDLTRAEQEKAILQTEVDALRYELER 354
Query: 462 KEQELDQKTQEKEDLDESLSR-------IKDRLEKETAGHLEAKQKVAELEDRISELKHR 514
K L +KT + E + ++DR EK ++K+ L++ + R
Sbjct: 355 KHNTLTKKTASLQAAQEEKATYAGEIEDMRDRYEKTERKLRVLQKKIENLQETFRRKERR 414
Query: 515 ADFEQDERKRLENLINS 531
E++ + L+ N+
Sbjct: 415 LKEEKERLRSLQTDTNT 431
>gnl|CDD|221389 pfam12037, DUF3523, Domain of unknown function (DUF3523). This
presumed domain is functionally uncharacterized. This
domain is found in eukaryotes. This domain is typically
between 257 to 277 amino acids in length. This domain is
found associated with pfam00004. This domain has a
conserved LER sequence motif.
Length = 276
Score = 31.7 bits (72), Expect = 2.0
Identities = 20/66 (30%), Positives = 35/66 (53%), Gaps = 2/66 (3%)
Query: 433 AKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKT-QEKEDLDESLSRIKDRLEKET 491
A EEEI+ +++ E E E ++ N AK E E + +E+E+ D + +K + +E
Sbjct: 151 ATEEEILEMRRETIEEEAE-LERENIRAKIEAEARGRAKEERENEDINREMLKLKANEER 209
Query: 492 AGHLEA 497
LE+
Sbjct: 210 ETVLES 215
>gnl|CDD|172947 PRK14472, PRK14472, F0F1 ATP synthase subunit B; Provisional.
Length = 175
Score = 30.9 bits (70), Expect = 2.1
Identities = 24/75 (32%), Positives = 39/75 (52%), Gaps = 8/75 (10%)
Query: 441 AKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHLEAKQK 500
AK +A+ + +N +L LAK + E D+ +E ++ E L R E H EAK+
Sbjct: 64 AKDEAEAILRKNREL---LAKADAEADKIIREGKEYAEKL-----RAEITEKAHTEAKKM 115
Query: 501 VAELEDRISELKHRA 515
+A ++ I + K RA
Sbjct: 116 IASAKEEIEQEKRRA 130
>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain. This domain is
found at the N terminus of SMC proteins. The SMC
(structural maintenance of chromosomes) superfamily
proteins have ATP-binding domains at the N- and
C-termini, and two extended coiled-coil domains
separated by a hinge in the middle. The eukaryotic SMC
proteins form two kind of heterodimers: the SMC1/SMC3
and the SMC2/SMC4 types. These heterodimers constitute
an essential part of higher order complexes, which are
involved in chromatin and DNA dynamics. This family also
includes the RecF and RecN proteins that are involved in
DNA metabolism and recombination.
Length = 1162
Score = 32.2 bits (73), Expect = 2.1
Identities = 24/126 (19%), Positives = 44/126 (34%), Gaps = 6/126 (4%)
Query: 405 QLTLQTETGEDYDHILLKINVKKVVHLLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQ 464
+L + E+ K K L ++EEI +K EL+ + ++EQ
Sbjct: 305 KLERRKVDDEEKLKESEKELKKLEKELKKEKEEIEELEK---ELKELEIKREAEEEEEEQ 361
Query: 465 ELDQKTQE---KEDLDESLSRIKDRLEKETAGHLEAKQKVAELEDRISELKHRADFEQDE 521
+ + +E+L +RL E + E E L ++ E+D
Sbjct: 362 LEKLQEKLEQLEEELLAKKKLESERLSSAAKLKEEELELKNEEEKEAKLLLELSEQEEDL 421
Query: 522 RKRLEN 527
K +
Sbjct: 422 LKEEKK 427
>gnl|CDD|218115 pfam04502, DUF572, Family of unknown function (DUF572). Family of
eukaryotic proteins with undetermined function.
Length = 321
Score = 31.6 bits (72), Expect = 2.2
Identities = 35/165 (21%), Positives = 62/165 (37%), Gaps = 24/165 (14%)
Query: 430 HLLAKEEEIVAAKKKADELE---MENVDLSNRLAKKEQELDQKTQEKEDLDE--SLSRIK 484
L +E+E K++ +EL M+ L NR A ++E++ E L+E L +
Sbjct: 111 DKLDEEQEERVEKEREEELAGDAMKK--LENRTADSKREME----VLERLEELKELQSRR 164
Query: 485 DRLEKET---AGHLEAKQKVAELEDR----ISELKHRADFEQDERKRLENLINSGSVPDD 537
++ + A K++ E E+ I L + E+D R+ + +D
Sbjct: 165 ADVDVNSMLEALFRREKKEEEEEEEEDEALIKSLSFGPETEEDRRRADDEDSEDDEEDND 224
Query: 538 TKAVGLSRSGGR------EEPSSPRKPEAPSSCIPPPPPPASLPP 576
S S + S+ ++ EAPSS P
Sbjct: 225 NTPSPKSGSSSPAKPTSILKKSAAKRSEAPSSSKAKKNSRGIPKP 269
>gnl|CDD|236795 PRK10920, PRK10920, putative uroporphyrinogen III
C-methyltransferase; Provisional.
Length = 390
Score = 31.6 bits (72), Expect = 2.2
Identities = 15/74 (20%), Positives = 35/74 (47%), Gaps = 10/74 (13%)
Query: 441 AKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHLEAKQK 500
K++A N L+N+L ++ + +K++L+ L + L++ A ++
Sbjct: 58 GKQQAQNQTATNDALANQLTALQK---AQESQKQELEGILKQQAKALDQ-------ANRQ 107
Query: 501 VAELEDRISELKHR 514
A L ++ EL+ +
Sbjct: 108 QAALAKQLDELQQK 121
>gnl|CDD|237046 PRK12297, obgE, GTPase CgtA; Reviewed.
Length = 424
Score = 31.6 bits (73), Expect = 2.2
Identities = 30/131 (22%), Positives = 53/131 (40%), Gaps = 24/131 (18%)
Query: 414 EDYDHILLKIN--VKKVVHLLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQK-- 469
EDY+ KIN +K L + +IV A K M+ + L + +++L K
Sbjct: 256 EDYE----KINKELKLYNPRLLERPQIVVANK------MDLPEAEENLEEFKEKLGPKVF 305
Query: 470 ---TQEKEDLDESLSRIKDRLEKETAGHLEAKQKVAELEDRISELKHRADFEQDER---- 522
+ LDE L + + LE+ LE ++ E+ + E + +DE
Sbjct: 306 PISALTGQGLDELLYAVAELLEETPEFPLEEEEVEEEVYYKFEEEEKDFTITRDEDGVFV 365
Query: 523 ---KRLENLIN 530
+++E L
Sbjct: 366 VSGEKIERLFK 376
>gnl|CDD|236768 PRK10819, PRK10819, transport protein TonB; Provisional.
Length = 246
Score = 31.2 bits (71), Expect = 2.3
Identities = 19/67 (28%), Positives = 29/67 (43%), Gaps = 1/67 (1%)
Query: 551 EPSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLPYLTCPRPLVRCRQPPRSKLKSRI 610
P +PE IP PP A + P P + P P+P+ + + P+ ++K
Sbjct: 67 PPEPVVEPEPEPEPIPEPPKEAPVVIPKPEPKPKPKPKPK-PKPVKKVEEQPKREVKPVE 125
Query: 611 PRPYFPN 617
PRP P
Sbjct: 126 PRPASPF 132
>gnl|CDD|221818 pfam12868, DUF3824, Domain of unknwon function (DUF3824). This is
a repeating domain found in fungal proteins. It is
proline-rich, and the function is not known.
Length = 135
Score = 30.2 bits (68), Expect = 2.3
Identities = 18/73 (24%), Positives = 20/73 (27%), Gaps = 1/73 (1%)
Query: 545 RSGGREEPSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLPYLTC-PRPLVRCRQPPR 603
+ RE +A S P P PP PP P P P PP
Sbjct: 30 KKAERERERYRHDHDAYSDSYEEPYDPTPYPPSPPVSDPRYYPNSNYFPPPPGSTPVPPP 89
Query: 604 SKLKSRIPRPYFP 616
P Y P
Sbjct: 90 GPQPGYNPADYPP 102
Score = 28.7 bits (64), Expect = 8.7
Identities = 15/46 (32%), Positives = 16/46 (34%), Gaps = 4/46 (8%)
Query: 553 SSPRKPEAPSSC----IPPPPPPASLPPPPPPRRSSDLPYLTCPRP 594
S+P P P PPPP A PP P PRP
Sbjct: 83 STPVPPPGPQPGYNPADYPPPPGAVPPPQNYPYPPGPGQDPYAPRP 128
>gnl|CDD|129661 TIGR00570, cdk7, CDK-activating kinase assembly factor MAT1. All
proteins in this family for which functions are known
are cyclin dependent protein kinases that are components
of TFIIH, a complex that is involved in nucleotide
excision repair and transcription initiation. Also known
as MAT1 (menage a trois 1). This family is based on the
phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
Stanford University) [DNA metabolism, DNA replication,
recombination, and repair].
Length = 309
Score = 31.3 bits (71), Expect = 2.4
Identities = 31/156 (19%), Positives = 63/156 (40%), Gaps = 21/156 (13%)
Query: 425 VKKVVHLLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIK 484
V+ +V+ L ++ KKK + + EN D+ + ++ T+E+E+L+E
Sbjct: 102 VEDIVYNLTNNIDLENTKKKIETYQKENKDVIQK------NKEKSTREQEELEE------ 149
Query: 485 DRLEKETAGHLEAKQKVAELEDRISELKHRADFEQDERKRLENLINSGSVPDDTKAVGLS 544
LE E + + + + E+ + + + ++++ L+ L S + A
Sbjct: 150 -ALEFEKEEEEQRRLLLQKEEE-----EQQMNKRKNKQALLDELETSTLPAAELIAQHKK 203
Query: 545 RSGGREEPSSPRKPEAP---SSCIPPPPPPASLPPP 577
S E KPE P S+ I + +P
Sbjct: 204 NSVKLEMQVEKPKPEKPNTFSTGIKMGYQISLVPVQ 239
>gnl|CDD|220777 pfam10486, PI3K_1B_p101, Phosphoinositide 3-kinase gamma adapter
protein p101 subunit. Class I PI3Ks are dual-specific
lipid and protein kinases involved in numerous
intracellular signaling pathways. Class IB PI3K,
p110gamma, is mainly activated by seven-transmembrane
G-protein-coupled receptors (GPCRs), through its
regulatory subunit p101 and G-protein beta-gamma
subunits.
Length = 856
Score = 31.9 bits (72), Expect = 2.4
Identities = 29/115 (25%), Positives = 42/115 (36%), Gaps = 10/115 (8%)
Query: 474 EDLDESLSRIKDRLE-KETAGHLEAKQKVAELEDRISELKHRADFEQDERKRLENLINSG 532
ED DE + R E +E H + KQK+ +L +L R D L L +G
Sbjct: 384 EDSDEGSDELPARGEPQEERAHNKLKQKIYKLFKSKGQLVKRRDLSPCIDWPLP-LRRAG 442
Query: 533 SV--PDDTKAVGLSRSGGREEPSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSD 585
S+ P ++ G SR S A + + P + P S D
Sbjct: 443 SLCGPQLSRIPGRSRR------SKSLPQPALGTQLLDLQLPQKVAFKRRPFLSGD 491
>gnl|CDD|114855 pfam06160, EzrA, Septation ring formation regulator, EzrA. During
the bacterial cell cycle, the tubulin-like cell-division
protein FtsZ polymerises into a ring structure that
establishes the location of the nascent division site.
EzrA modulates the frequency and position of FtsZ ring
formation.
Length = 559
Score = 31.8 bits (73), Expect = 2.4
Identities = 32/117 (27%), Positives = 56/117 (47%), Gaps = 17/117 (14%)
Query: 399 LDRLAQQLTLQTETGEDYDHILLKIN-VKKVVHLLAKEEEIVAAKKKADELEMENVDLSN 457
LDRL Q TL + E + ++ ++ L + +A KK A +L
Sbjct: 329 LDRLQQSYTLNEDELETVRELEKQLEELEAQYDQLVER---IAEKKVAYS------ELQE 379
Query: 458 RLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHLEAKQKVAELEDRISELKHR 514
L + E++L++ +E+E+L ESL L K+ LEA++K+ E ++ E+K
Sbjct: 380 ELEEIEKQLEEIEKEQEELSESLQG----LRKD---ELEAREKLQEYRQKLHEIKRY 429
Score = 31.0 bits (71), Expect = 3.9
Identities = 23/83 (27%), Positives = 37/83 (44%), Gaps = 15/83 (18%)
Query: 421 LKINVKKVVHLLAKEEEIVAA-KKKADELEMENVDL-------SNRLAKKEQELDQKTQE 472
+ IN+ V L + + V K+K +EL ++N L +NR +E+ + +E
Sbjct: 463 VPINMDAVNRQLEEATDDVETLKEKTEEL-VDNATLAEQLIQYANRYRSDNEEVAEALKE 521
Query: 473 KEDL------DESLSRIKDRLEK 489
E L E+L I LEK
Sbjct: 522 AERLFENYDYKEALEIIATALEK 544
>gnl|CDD|224617 COG1703, ArgK, Putative periplasmic protein kinase ArgK and related
GTPases of G3E family [Amino acid transport and
metabolism].
Length = 323
Score = 31.5 bits (72), Expect = 2.5
Identities = 15/70 (21%), Positives = 29/70 (41%), Gaps = 3/70 (4%)
Query: 970 LTTLNEAKLDNENMKRRREEEEKRARQDAEMKKRTTDRKQGAKENGVVISNGSVKGEFDD 1029
+ L +A D+ ++ R RT R + + +N +VK + D
Sbjct: 242 IDELWDAIEDHRKFLTESGLFTEKRRTQYVEWIRTLVRDEVLDR---LEANPAVKKVYAD 298
Query: 1030 LISALRTGDV 1039
L +A+R G++
Sbjct: 299 LEAAVRKGEL 308
>gnl|CDD|203371 pfam06005, DUF904, Protein of unknown function (DUF904). This
family consists of several bacterial and archaeal
hypothetical proteins of unknown function.
Length = 72
Score = 28.8 bits (65), Expect = 2.5
Identities = 13/55 (23%), Positives = 26/55 (47%)
Query: 436 EEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKE 490
E I + + +EL+ EN LS + E+E + +E+ L + +L++
Sbjct: 18 ETIALLQMEIEELKEENEQLSEENEELEEENKKLKEERNAWQSRLRALLGKLDEV 72
>gnl|CDD|215641 PLN03237, PLN03237, DNA topoisomerase 2; Provisional.
Length = 1465
Score = 31.8 bits (72), Expect = 2.6
Identities = 36/154 (23%), Positives = 62/154 (40%), Gaps = 13/154 (8%)
Query: 433 AKEEEIVAAKKKADELEMENVD---LSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEK 489
A E E + +E ENV A +++ +EKE+ DE L +KDRL
Sbjct: 1214 ASESETTEETYGSSAMETENVAEVVKPKGRAGAKKKAPAAAKEKEEEDEILD-LKDRLAA 1272
Query: 490 ETAGHLEAKQKVAELEDRISELKHRADFEQDERKRLENLINSGSVPDDTKAVGLS----- 544
A + A++E+ + + R + + ++I+ DD AV +S
Sbjct: 1273 YNLD--SAPAQSAKMEETVKAVPARRAAARKKPLASVSVISDSDDDDDDFAVEVSLAERL 1330
Query: 545 -RSGGREEPSSPRKPEAPSSCIPPPPPPASLPPP 577
+ GGR+ P++ K A PA++
Sbjct: 1331 KKKGGRK-PAAANKKAAKPPAAAKKRGPATVQSG 1363
>gnl|CDD|223163 COG0085, RpoB, DNA-directed RNA polymerase, beta subunit/140 kD
subunit [Transcription].
Length = 1060
Score = 31.8 bits (73), Expect = 2.6
Identities = 35/160 (21%), Positives = 55/160 (34%), Gaps = 21/160 (13%)
Query: 672 SKSKILSVIDGRRAQNCTILLSKLKM-TDEEISKAI-------LSMDSADQLPLDMLEQL 723
K + ID +R TILL L + TDEEI +A L ++ L++ E+
Sbjct: 177 PKDNLYVRIDRKRKIPVTILLRALGLETDEEIIEAFGGDELTDLVPPEGEEALLEIYEEA 236
Query: 724 LKFTPNTEEAALLEEHSTETLARADRFL--YEISKIPHYEQRLRCLHYKKRFHFWISDIE 781
A L + R D Y+ +K ++ L L D+
Sbjct: 237 KGEKITARNALELIGSRVFVVKRYDAKEGRYKRAKYVLDKELLPHLGEAGE----RYDLS 292
Query: 782 PKIKSVMEASREVTRSRRLRKLLELVLALGNYMNKGARGN 821
K+ ++ L+EL L G + GN
Sbjct: 293 RVGKAKDII-------AMIKYLIELRLGKGEEDDIDHLGN 325
>gnl|CDD|218107 pfam04484, DUF566, Family of unknown function (DUF566). Family of
related proteins that is plant specific.
Length = 313
Score = 31.5 bits (71), Expect = 2.7
Identities = 20/88 (22%), Positives = 29/88 (32%)
Query: 544 SRSGGREEPSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLPYLTCPRPLVRCRQPPR 603
RS R SS + SS PAS P P R +S + R
Sbjct: 17 PRSSSRRRLSSSFLSTSASSRPRRLNAPASPPSSSPARNTSSSSSFGLSKQRPSSLSRGR 76
Query: 604 SKLKSRIPRPYFPNVSVQVNGTIWTELD 631
+ P P+ + +NG++ T
Sbjct: 77 LSSRFVSPSRGSPSAAASLNGSLATAST 104
>gnl|CDD|237192 PRK12758, PRK12758, DNA topoisomerase IV subunit A; Provisional.
Length = 869
Score = 31.5 bits (72), Expect = 2.7
Identities = 28/121 (23%), Positives = 47/121 (38%), Gaps = 34/121 (28%)
Query: 420 LLKINVKKVVHLLAKEEEIVAAKKKADELEMENVDLSNRLAKK--EQELDQKTQEKEDLD 477
+L+ + V LL +E EI + ELE E L K E+ + ++ +E E +
Sbjct: 333 ILRRSTDHTVSLLKQELEI-----RLSELE-EQW-HFASLEKIFIEERIYKEIEEAETWE 385
Query: 478 ESLSRIKDRLE-----------KETAGHL--------------EAKQKVAELEDRISELK 512
+ I LE ++ L +A + +A LE I+E+K
Sbjct: 386 AVIEAIDKGLEPFKKQFIREVTEDDIVRLTEIKIKRISKFDSDKADELIARLEAEIAEVK 445
Query: 513 H 513
H
Sbjct: 446 H 446
>gnl|CDD|217453 pfam03251, Tymo_45kd_70kd, Tymovirus 45/70Kd protein. Tymoviruses
are single stranded RNA viruses. This family includes a
protein of unknown function that has been named based on
its molecular weight. Tymoviruses such as the ononis
yellow mosaic tymovirus encode only three proteins. Of
these two are overlapping this protein overlaps a larger
ORF that is thought to be the polymerase.
Length = 458
Score = 31.6 bits (72), Expect = 2.7
Identities = 16/71 (22%), Positives = 19/71 (26%), Gaps = 8/71 (11%)
Query: 552 PSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLPYLTCPRPLVRCRQPPRSKLKSRIP 611
+P P P P P P R LP PR R L
Sbjct: 247 HPHTTRPSPPRPAFSRSPSSPLSPLPRPSTRRGLLPNPRLPR-------ASRGHLPPPTS 299
Query: 612 -RPYFPNVSVQ 621
P PN ++
Sbjct: 300 SAPPRPNGGLR 310
Score = 30.4 bits (69), Expect = 6.1
Identities = 19/53 (35%), Positives = 20/53 (37%), Gaps = 7/53 (13%)
Query: 549 REEPSSPRKPEAPSSCIPP-PPPPAS-----LPPPPPPRRS-SDLPYLTCPRP 594
PS PR + S P P P S LP P PR S LP T P
Sbjct: 250 TTRPSPPRPAFSRSPSSPLSPLPRPSTRRGLLPNPRLPRASRGHLPPPTSSAP 302
Score = 30.1 bits (68), Expect = 7.0
Identities = 20/68 (29%), Positives = 26/68 (38%), Gaps = 6/68 (8%)
Query: 548 GREEPSSPRKPEAPSSCIPPPPPPASLPPPPPP------RRSSDLPYLTCPRPLVRCRQP 601
E+P PR P+ PS P L RRS LP L+ RP+ +P
Sbjct: 106 PGEQPKLPRAPQLPSHPRRHRSLPHHLHHSSKHRNRLHARRSDVLPSLSNSRPVHELPKP 165
Query: 602 PRSKLKSR 609
L+ R
Sbjct: 166 STPVLQPR 173
>gnl|CDD|217565 pfam03449, GreA_GreB_N, Transcription elongation factor,
N-terminal. This domain adopts a long alpha-hairpin
structure.
Length = 71
Score = 28.6 bits (65), Expect = 2.7
Identities = 18/61 (29%), Positives = 33/61 (54%), Gaps = 8/61 (13%)
Query: 459 LAKKEQELDQ-KTQEKEDLDESLSRIKDRLE----KETAGHLEAKQKVAELEDRISELKH 513
K ++EL++ K E+ ++ ++ I + E E A + AK++ E+E RI EL+
Sbjct: 8 YEKLQEELERLKNVERPEIIKA---IAEAREHGDLSENAEYHAAKEEQREIEARIRELED 64
Query: 514 R 514
R
Sbjct: 65 R 65
>gnl|CDD|153364 cd07680, F-BAR_PACSIN1, The F-BAR (FES-CIP4 Homology and
Bin/Amphiphysin/Rvs) domain of Protein kinase C and
Casein kinase Substrate in Neurons 1 (PACSIN1). F-BAR
domains are dimerization modules that bind and bend
membranes and are found in proteins involved in membrane
dynamics and actin reorganization. Protein kinase C and
Casein kinase Substrate in Neurons (PACSIN) proteins,
also called Synaptic dynamin-associated proteins
(Syndapins), act as regulators of cytoskeletal and
membrane dynamics. Vetebrates harbor three isoforms with
distinct expression patterns and specific functions.
PACSIN 1 or Syndapin I is expressed specifically in the
brain and is localized in neurites and synaptic boutons.
It binds the brain-specific proteins dynamin I,
synaptojanin, synapsin I, and neural Wiskott-Aldrich
syndrome protein (nWASP), and functions as a link
between the cytoskeletal machinery and synaptic vesicle
endocytosis. PACSIN 1 interacts with huntingtin and may
be implicated in the neuropathology of Huntington's
disease. It contains an N-terminal F-BAR domain and a
C-terminal SH3 domain. F-BAR domains form banana-shaped
dimers with a positively-charged concave surface that
binds to negatively-charged lipid membranes. They can
induce membrane deformation in the form of long tubules.
Length = 258
Score = 31.2 bits (70), Expect = 2.7
Identities = 18/78 (23%), Positives = 37/78 (47%), Gaps = 2/78 (2%)
Query: 426 KKVVHLLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKD 485
KK HL KEE++ ++ + E + K + ++D+ Q+ + E ++ D
Sbjct: 139 KKAYHLACKEEKLAMTREANSKAEQSVT--PEQQKKLQDKVDKCKQDVQKTQEKYEKVLD 196
Query: 486 RLEKETAGHLEAKQKVAE 503
+ K T ++E ++V E
Sbjct: 197 DVGKTTPQYMENMEQVFE 214
>gnl|CDD|197891 smart00818, Amelogenin, Amelogenins, cell adhesion proteins, play a
role in the biomineralisation of teeth. They seem to
regulate formation of crystallites during the secretory
stage of tooth enamel development and are thought to
play a major role in the structural organisation and
mineralisation of developing enamel. The extracellular
matrix of the developing enamel comprises two major
classes of protein: the hydrophobic amelogenins and the
acidic enamelins. Circular dichroism studies of porcine
amelogenin have shown that the protein consists of 3
discrete folding units: the N-terminal region appears to
contain beta-strand structures, while the C-terminal
region displays characteristics of a random coil
conformation. Subsequent studies on the bovine protein
have indicated the amelogenin structure to contain a
repetitive beta-turn segment and a "beta-spiral" between
Gln112 and Leu138, which sequester a (Pro, Leu, Gln)
rich region. The beta-spiral offers a probable site for
interactions with Ca2+ ions. Muatations in the human
amelogenin gene (AMGX) cause X-linked hypoplastic
amelogenesis imperfecta, a disease characterised by
defective enamel. A 9bp deletion in exon 2 of AMGX
results in the loss of codons for Ile5, Leu6, Phe7 and
Ala8, and replacement by a new threonine codon,
disrupting the 16-residue (Met1-Ala16) amelogenin signal
peptide.
Length = 165
Score = 30.5 bits (69), Expect = 2.7
Identities = 16/45 (35%), Positives = 20/45 (44%)
Query: 551 EPSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLPYLTCPRPL 595
+P P++P P + P PP PP PP LP L PL
Sbjct: 106 QPPQPQQPMQPQPPVHPIPPLPPQPPLPPMFPMQPLPPLLPDLPL 150
>gnl|CDD|222689 pfam14335, DUF4391, Domain of unknown function (DUF4391). This
family of proteins is functionally uncharacterized. This
family of proteins is found in bacteria and archaea.
Proteins in this family are typically between 220 and
257 amino acids in length.
Length = 221
Score = 30.7 bits (70), Expect = 2.8
Identities = 20/89 (22%), Positives = 37/89 (41%), Gaps = 10/89 (11%)
Query: 396 WLLLDRLAQQLTLQTETGEDYDHILLKINVKKV-----VHLLAKEEEIVAAKKKADELEM 450
W LD L+ L Y+ I+ +I V+ +E V ++ +ELE
Sbjct: 130 WAFLDSLSLSLFSGLNLDALYESIVNQIIKLNASSNTGVYEKEDLKERVERLEQIEELEK 189
Query: 451 ENVDLSNRLAK-----KEQELDQKTQEKE 474
E L +L K ++ EL+ + ++ +
Sbjct: 190 EIAKLKKKLKKEKQFNRKVELNLEIKKLK 218
>gnl|CDD|204405 pfam10146, zf-C4H2, Zinc finger-containing protein. This is a
family of proteins which appears to have a highly
conserved zinc finger domain at the C terminal end,
described as -C-X2-CH-X3-H-X5-C-X2-C-. The structure is
predicted to contain a coiled coil. Members are
annotated as being tumour-associated antigen HCA127 in
humans but this could not confirmed.
Length = 215
Score = 31.0 bits (70), Expect = 2.9
Identities = 31/151 (20%), Positives = 55/151 (36%), Gaps = 20/151 (13%)
Query: 426 KKVVHLLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKD 485
K + A+EE + + +K E + E +L + +EL Q + D++ + + K
Sbjct: 15 KLKDEIKAEEEALESEEKHLKEYDKEMEELLEEKMQHVEELRQIHADINDMETEIKQSKS 74
Query: 486 RLEKETAGHLEAKQKVAELEDRISELKHRADFEQDERKRLENLINSGSVPDDTKAVGLSR 545
LE+ K+A + + LK + + LE L P +
Sbjct: 75 ELER-------RMGKIARMHGEYNPLKESINEMRKLELGLEEL------PQLHEEERQKI 121
Query: 546 SGGREEPSSPRKPEAPSSCIPPPPPPASLPP 576
+SP P PP P++LP
Sbjct: 122 ----SPATSPVPPVPLP---DPPAFPSTLPA 145
>gnl|CDD|220237 pfam09428, DUF2011, Fungal protein of unknown function (DUF2011).
This is a family of fungal proteins whose function is
unknown.
Length = 130
Score = 29.9 bits (68), Expect = 2.9
Identities = 6/29 (20%), Positives = 16/29 (55%)
Query: 986 RREEEEKRARQDAEMKKRTTDRKQGAKEN 1014
R +E ++ ++ + +++ R+Q KE
Sbjct: 102 RTKERAEKEKRTRKNREKKFKRRQKEKEK 130
>gnl|CDD|216989 pfam02354, Latrophilin, Latrophilin Cytoplasmic C-terminal region.
This family consists of the cytoplasmic C-terminal
region in latrophilin. Latrophilin is a synaptic Ca2+
independent alpha- latrotoxin (LTX) receptor and is a
novel member of the secretin family of G-protein coupled
receptors that are involved in secretion. Latrophilin
mRNA is present only in neuronal tissue. Lactrophillin
interacts with G-alpha O.
Length = 366
Score = 31.2 bits (70), Expect = 3.0
Identities = 14/56 (25%), Positives = 19/56 (33%), Gaps = 4/56 (7%)
Query: 549 REEPSSPR-KPEAPSSCIPPPPPPASLPPPPPPRRSSDLPYLTCPRPLVRCRQPPR 603
R+ P+ P P + +PPPPP +P L L Q R
Sbjct: 285 RDSPTYPSSSPSTQTEALPPPPPA---KCGDAEIYYKSMPNLGARNQLQTYYQLGR 337
>gnl|CDD|236658 PRK10153, PRK10153, DNA-binding transcriptional activator CadC;
Provisional.
Length = 517
Score = 31.6 bits (72), Expect = 3.0
Identities = 12/46 (26%), Positives = 20/46 (43%)
Query: 567 PPPPPASLPPPPPPRRSSDLPYLTCPRPLVRCRQPPRSKLKSRIPR 612
PPP P ++P P S++ + P C+ P +SK +
Sbjct: 120 PPPIPEAVPATAEPSESANAQFPAPPLTRAVCQSPAKSKRFTTFWV 165
>gnl|CDD|221533 pfam12329, TMF_DNA_bd, TATA element modulatory factor 1 DNA
binding. This is the middle region of a family of TATA
element modulatory factor 1 proteins conserved in
eukaryotes that contains at its N-terminal section a
number of leucine zippers that could potentially form
coiled coil structures. The whole proteins bind to the
TATA element of some RNA polymerase II promoters and
repress their activity. by competing with the binding of
TATA binding protein. TMFs are evolutionarily conserved
golgins that bind Rab6, a ubiquitous ras-like
GTP-binding Golgi protein, and contribute to Golgi
organisation in animal and plant cells.
Length = 74
Score = 28.8 bits (65), Expect = 3.1
Identities = 13/46 (28%), Positives = 21/46 (45%)
Query: 432 LAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLD 477
L I + K ELE E +L +L K E+EL+ + + +
Sbjct: 29 LKHNNTIKKLRAKNKELEKEIAELKKKLEKLEKELENLEERLKRAE 74
Score = 28.4 bits (64), Expect = 3.3
Identities = 17/83 (20%), Positives = 40/83 (48%), Gaps = 13/83 (15%)
Query: 430 HLLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEK 489
LA+++E +A + E E LS + K + + + ++L++ ++ +K +LEK
Sbjct: 5 KKLAEKDEQIAQLME----EGEK--LSKKELKHNNTIKKLRAKNKELEKEIAELKKKLEK 58
Query: 490 ETAGHLEAKQKVAELEDRISELK 512
+++ LE+R+ +
Sbjct: 59 LE-------KELENLEERLKRAE 74
>gnl|CDD|234749 PRK00405, rpoB, DNA-directed RNA polymerase subunit beta; Reviewed.
Length = 1112
Score = 31.6 bits (73), Expect = 3.1
Identities = 15/33 (45%), Positives = 17/33 (51%), Gaps = 1/33 (3%)
Query: 671 ISKSKILSV-IDGRRAQNCTILLSKLKMTDEEI 702
IL V ID RR T+LL L +DEEI
Sbjct: 181 FDPKDILYVRIDRRRKLPVTVLLRALGYSDEEI 213
>gnl|CDD|193472 pfam12999, PRKCSH-like, Glucosidase II beta subunit-like. The
sequences found in this family are similar to a region
found in the beta-subunit of glucosidase II, which is
also known as protein kinase C substrate 80K-H (PRKCSH).
The enzyme catalyzes the sequential removal of two
alpha-1,3-linked glucose residues in the second step of
N-linked oligosaccharide processing. The beta subunit is
required for the solubility and stability of the
heterodimeric enzyme, and is involved in retaining the
enzyme within the endoplasmic reticulum.
Length = 176
Score = 30.5 bits (69), Expect = 3.1
Identities = 19/63 (30%), Positives = 26/63 (41%), Gaps = 7/63 (11%)
Query: 411 ETGEDYDHILLKINVKKVVHLLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKT 470
E ++ L + N L KE ++AA+KK DE L RL + E L K
Sbjct: 121 EIARQFEEYLTEHNNSVKNGLKIKEGLLLAAQKKRDE-------LKKRLKELEDSLPAKQ 173
Query: 471 QEK 473
Q
Sbjct: 174 QRL 176
>gnl|CDD|217817 pfam03961, DUF342, Protein of unknown function (DUF342). This
family of bacterial proteins has no known function. The
proteins are in the region of 500-600 amino acid
residues in length.
Length = 450
Score = 31.5 bits (72), Expect = 3.1
Identities = 20/89 (22%), Positives = 40/89 (44%), Gaps = 12/89 (13%)
Query: 447 ELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDR------------LEKETAGH 494
E+ ++ +L L + E+EL + +E E + + L ++ + LEK
Sbjct: 324 EVGVDFPELKEELKELEEELKELEEELEKIKKLLKKLPKKARGQLPPEKREQLEKLLETK 383
Query: 495 LEAKQKVAELEDRISELKHRADFEQDERK 523
+ +++ ELE+ + ELK + E K
Sbjct: 384 EKLSEELEELEEELKELKEELESLYSEGK 412
>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein. TolA couples the inner
membrane complex of itself with TolQ and TolR to the
outer membrane complex of TolB and OprL (also called
Pal). Most of the length of the protein consists of
low-complexity sequence that may differ in both length
and composition from one species to another,
complicating efforts to discriminate TolA (the most
divergent gene in the tol-pal system) from paralogs such
as TonB. Selection of members of the seed alignment and
criteria for setting scoring cutoffs are based largely
conserved operon struction. //The Tol-Pal complex is
required for maintaining outer membrane integrity. Also
involved in transport (uptake) of colicins and
filamentous DNA, and implicated in pathogenesis.
Transport is energized by the proton motive force. TolA
is an inner membrane protein that interacts with
periplasmic TolB and with outer membrane porins ompC,
phoE and lamB [Transport and binding proteins, Other,
Cellular processes, Pathogenesis].
Length = 346
Score = 31.0 bits (70), Expect = 3.1
Identities = 25/101 (24%), Positives = 36/101 (35%), Gaps = 18/101 (17%)
Query: 426 KKVVHLLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKD 485
KK K+ E A K A E AKK+ +K E E ++ ++ K
Sbjct: 142 KKAKEEAKKQAEEEAKAKAAAE------------AKKKAAEAKKKAEAEAKAKAEAKAKA 189
Query: 486 RLEKETAGHLEAKQKVAELEDRISELKHRADFEQDERKRLE 526
+ E+ A AK K A E +A+ E E
Sbjct: 190 KAEEAKAKAEAAKAKAAA------EAAAKAEAEAAAAAAAE 224
>gnl|CDD|216108 pfam00769, ERM, Ezrin/radixin/moesin family. This family of
proteins contain a band 4.1 domain (pfam00373), at their
amino terminus. This family represents the rest of these
proteins.
Length = 244
Score = 30.9 bits (70), Expect = 3.2
Identities = 22/69 (31%), Positives = 39/69 (56%)
Query: 435 EEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGH 494
EEE +KKADELE EN L A E+E ++ E ++ ++++++ EK+ A
Sbjct: 46 EEEAQLLEKKADELEEENRRLEEEAAASEEERERLEAEVDEATAEVAKLEEEREKKEAET 105
Query: 495 LEAKQKVAE 503
+ +Q++ E
Sbjct: 106 RQLQQELRE 114
Score = 30.1 bits (68), Expect = 5.9
Identities = 25/86 (29%), Positives = 45/86 (52%), Gaps = 7/86 (8%)
Query: 441 AKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHLEAKQK 500
A+++ ELE + + + ++EL++ + +L+E L + E+E A LE K
Sbjct: 3 AEREQQELEERMEQMEEDMRRAQKELEEYEETALELEEKLKQ-----EEEEAQLLEKKAD 57
Query: 501 VAELEDRISELKHRADFEQDERKRLE 526
ELE+ L+ A ++ER+RLE
Sbjct: 58 --ELEEENRRLEEEAAASEEERERLE 81
>gnl|CDD|235124 PRK03427, PRK03427, cell division protein ZipA; Provisional.
Length = 333
Score = 31.2 bits (71), Expect = 3.2
Identities = 23/87 (26%), Positives = 36/87 (41%), Gaps = 7/87 (8%)
Query: 523 KRLENLINSGSVPDDTKAVGLSR-SGGREEPSSPRKPEA--PSSCIPPPPPPASLPPPPP 579
KR ++ + D+ VG R P++ ++ EA PS PP AS P P
Sbjct: 47 KRDDDSYDEDVEDDE--GVGEVRVHRVNHAPANAQEHEAARPSPQHQYQPPYASAQPRQP 104
Query: 580 PRRSSD--LPYLTCPRPLVRCRQPPRS 604
++ + +P PRP QP +
Sbjct: 105 VQQPPEAQVPPQHAPRPAQPAPQPVQQ 131
>gnl|CDD|218825 pfam05956, APC_basic, APC basic domain. This region of the APC
family of proteins is known as the basic domain. It
contains a high proportion of positively charged amino
acids and interacts with microtubules.
Length = 359
Score = 31.2 bits (70), Expect = 3.3
Identities = 19/56 (33%), Positives = 24/56 (42%), Gaps = 3/56 (5%)
Query: 549 REEPSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLPYLTCPRPLVRCRQPPRS 604
R + ++P K + P P PP S P P +RS L P L PPRS
Sbjct: 17 RSQSTTPSK-KGPPLKTQPSDPPKS--PSPGQQRSRSLHRPAKPSELAELSPPPRS 69
>gnl|CDD|225499 COG2948, VirB10, Type IV secretory pathway, VirB10 components
[Intracellular trafficking and secretion].
Length = 360
Score = 30.9 bits (70), Expect = 3.3
Identities = 12/44 (27%), Positives = 18/44 (40%), Gaps = 4/44 (9%)
Query: 540 AVGLSRSGGREEPSSPRKP----EAPSSCIPPPPPPASLPPPPP 579
L G ++ + +P E + +PP P LPPP P
Sbjct: 40 GFALIALQGEKKRINNTQPPSNVERGTPPLPPLPDDPPLPPPLP 83
>gnl|CDD|216319 pfam01134, GIDA, Glucose inhibited division protein A.
Length = 391
Score = 31.0 bits (71), Expect = 3.3
Identities = 9/27 (33%), Positives = 15/27 (55%)
Query: 630 LDESKLYASMELEVVDKLFCAYQKNGV 656
+D +L ++E + + LF A Q NG
Sbjct: 340 IDPPQLLPTLETKKIPGLFFAGQINGT 366
>gnl|CDD|218545 pfam05300, DUF737, Protein of unknown function (DUF737). This
family consists of several uncharacterized mammalian
proteins of unknown function.
Length = 187
Score = 30.5 bits (69), Expect = 3.3
Identities = 20/69 (28%), Positives = 36/69 (52%), Gaps = 8/69 (11%)
Query: 458 RLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHLEAKQKVAELEDRISELKHRADF 517
R+AK+E+E ++ + L E S ++R + AK +LE++ +ELK F
Sbjct: 101 RIAKREREAAEEQLSRAVLREKASAEQERQK--------AKHLARQLEEKEAELKRLDAF 152
Query: 518 EQDERKRLE 526
+++ RLE
Sbjct: 153 YKEQLARLE 161
>gnl|CDD|223033 PHA03291, PHA03291, envelope glycoprotein I; Provisional.
Length = 401
Score = 31.1 bits (70), Expect = 3.3
Identities = 12/59 (20%), Positives = 19/59 (32%), Gaps = 4/59 (6%)
Query: 550 EEPSSPRKPEAPSSCIPPP----PPPASLPPPPPPRRSSDLPYLTCPRPLVRCRQPPRS 604
+SP PS+ PP P P++ P + + P PP +
Sbjct: 213 RTTASPETTPTPSTTTSPPSTTIPAPSTTIAAPQAGTTPEAEGTPAPPTPGGGEAPPAN 271
Score = 30.3 bits (68), Expect = 6.8
Identities = 8/36 (22%), Positives = 11/36 (30%)
Query: 552 PSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLP 587
P + A P P S P P S+ +
Sbjct: 208 PRPTPRTTASPETTPTPSTTTSPPSTTIPAPSTTIA 243
>gnl|CDD|217476 pfam03286, Pox_Ag35, Pox virus Ag35 surface protein.
Length = 198
Score = 30.5 bits (69), Expect = 3.3
Identities = 26/108 (24%), Positives = 49/108 (45%), Gaps = 6/108 (5%)
Query: 440 AAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHLEAKQ 499
A KK+ + + E KK + D KT+E E+ ++ D E +A L
Sbjct: 63 ATTKKSKKKDKE---KLTEEEKKPESDDDKTEENENDPDNNEESGDSQESASANSLSDID 119
Query: 500 KVAELEDRISELKHRADFEQDERKRLENLINS-GSVPDDTKAVGLSRS 546
+++D S+LK + + K+L +++ +V +D +A +SR
Sbjct: 120 NEDDMDD--SDLKLATENIIKDLKKLNARVSAVSTVLEDVQAASISRQ 165
>gnl|CDD|226400 COG3883, COG3883, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 265
Score = 30.8 bits (70), Expect = 3.4
Identities = 16/78 (20%), Positives = 37/78 (47%), Gaps = 7/78 (8%)
Query: 454 DLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHLEAKQKVAELEDRISELKH 513
LS+++ ++ +L + +EK+++ + + +++E E + K+ EL+ I + K
Sbjct: 28 LLSDKIQNQDSKLSELQKEKKNIQNEIESLDNQIE-------EIQSKIDELQKEIDQSKA 80
Query: 514 RADFEQDERKRLENLINS 531
Q E L+ I
Sbjct: 81 EIKKLQKEIAELKENIVE 98
Score = 30.1 bits (68), Expect = 6.0
Identities = 22/101 (21%), Positives = 42/101 (41%), Gaps = 1/101 (0%)
Query: 426 KKVVHLLAKEEEIVAAKKKADELEMEN-VDLSNRLAKKEQELDQKTQEKEDLDESLSRIK 484
KK++ ++++ + K+ A E ++E V L N L + L+ + EK L +L+ +
Sbjct: 147 KKILEQQKEDKKSLEEKQAALEDKLETLVALQNELETQLNSLNSQKAEKNALIAALAAKE 206
Query: 485 DRLEKETAGHLEAKQKVAELEDRISELKHRADFEQDERKRL 525
E A E K ++ + A E+ L
Sbjct: 207 ASALGEKAALEEQKALAEAAAAEAAKQEAAAKAAAQEQAAL 247
>gnl|CDD|234938 PRK01297, PRK01297, ATP-dependent RNA helicase RhlB; Provisional.
Length = 475
Score = 31.0 bits (70), Expect = 3.5
Identities = 13/51 (25%), Positives = 16/51 (31%), Gaps = 3/51 (5%)
Query: 551 EPSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLPYLTCPRPLVRCRQP 601
P SP AP P P ++ P PR R R+P
Sbjct: 20 APPSPAAAPAPPPPAKTAAPATKAAAPAAAAPRAEKPKKDKPR---RERKP 67
Score = 31.0 bits (70), Expect = 4.0
Identities = 18/60 (30%), Positives = 24/60 (40%), Gaps = 1/60 (1%)
Query: 557 KPEAPSSCIPPPPPPASLPPPPPPRRSSDLPYLTCPRPLVRCRQPPRSKLKSRIPRPYFP 616
K EA PP P A+ PPPP + ++ P + P+ K K R R P
Sbjct: 11 KGEAEQPAPAPPSPAAAPAPPPPAKTAAPATKAAAPAAAAPRAEKPK-KDKPRRERKPKP 69
Score = 30.7 bits (69), Expect = 4.6
Identities = 12/55 (21%), Positives = 20/55 (36%)
Query: 548 GREEPSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLPYLTCPRPLVRCRQPP 602
G E +P P ++ PPPP + P ++ P P+ R+
Sbjct: 12 GEAEQPAPAPPSPAAAPAPPPPAKTAAPATKAAAPAAAAPRAEKPKKDKPRRERK 66
Score = 30.7 bits (69), Expect = 4.9
Identities = 14/55 (25%), Positives = 17/55 (30%), Gaps = 3/55 (5%)
Query: 532 GSVPDDTKAVGLSRSGGREEPSSPRKPEAPSSCIPPP---PPPASLPPPPPPRRS 583
G + A P P K AP++ P P A P PRR
Sbjct: 10 GKGEAEQPAPAPPSPAAAPAPPPPAKTAAPATKAAAPAAAAPRAEKPKKDKPRRE 64
>gnl|CDD|221518 pfam12301, CD99L2, CD99 antigen like protein 2. This family of
proteins is found in eukaryotes. Proteins in this family
are typically between 165 and 237 amino acids in length.
CD99L2 and CD99 are involved in trans-endothelial
migration of neutrophils in vitro and in the recruitment
of neutrophils into inflamed peritoneum.
Length = 154
Score = 30.1 bits (68), Expect = 3.5
Identities = 10/47 (21%), Positives = 15/47 (31%), Gaps = 12/47 (25%)
Query: 551 EPSSPRKPEA------------PSSCIPPPPPPASLPPPPPPRRSSD 585
+P++P+KP + P P P P P D
Sbjct: 18 KPATPKKPSKGDDDFDLEDALDGGNDGGAGPKPKPRPQPDPGNPGGD 64
>gnl|CDD|149849 pfam08912, Rho_Binding, Rho Binding. Rho Binding Domain is
responsible for the recognition and binding of Rho
binding domain-containing proteins (such as ROCK) to
Rho, resulting in activation of the GTPase which in turn
modulates the phosphorylation of various signalling
proteins. This domain is within an amphipathic
alpha-helical coiled-coil and interacts with Rho through
predominantly hydrophobic interactions.
Length = 68
Score = 28.2 bits (63), Expect = 3.5
Identities = 21/70 (30%), Positives = 36/70 (51%), Gaps = 8/70 (11%)
Query: 442 KKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEK----ETAGHLEA 497
K L E +L+N+L K ++EL + +E+ +S++ IK EK E +A
Sbjct: 1 TKDVANLANEKEELNNKLKKAQEELQKLKEEE----QSINNIKAAFEKQLQTERTLKTQA 56
Query: 498 KQKVAELEDR 507
K+AE+ +R
Sbjct: 57 VNKLAEIMNR 66
>gnl|CDD|177614 PHA03377, PHA03377, EBNA-3C; Provisional.
Length = 1000
Score = 31.2 bits (70), Expect = 3.6
Identities = 15/48 (31%), Positives = 22/48 (45%), Gaps = 9/48 (18%)
Query: 544 SRSGGREEPSSPRKPEAP-----SSCI----PPPPPPASLPPPPPPRR 582
R G ++PS P +P ++ I P PP ++ P PPP R
Sbjct: 451 ERPGPSDQPSVPVEPAHLTPVEHTTVILHQPPQSPPTVAIKPAPPPSR 498
>gnl|CDD|236138 PRK07994, PRK07994, DNA polymerase III subunits gamma and tau;
Validated.
Length = 647
Score = 31.4 bits (72), Expect = 3.6
Identities = 16/72 (22%), Positives = 27/72 (37%), Gaps = 2/72 (2%)
Query: 536 DDTKAVGLSRSGGREEPSSPRKPEAPSSCIPPPPPPASLPP--PPPPRRSSDLPYLTCPR 593
+ + + + ++P AP PPPPAS P P P + L +
Sbjct: 368 PEVPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQLLAARQ 427
Query: 594 PLVRCRQPPRSK 605
L R + ++K
Sbjct: 428 QLQRAQGATKAK 439
>gnl|CDD|184538 PRK14149, PRK14149, heat shock protein GrpE; Provisional.
Length = 191
Score = 30.2 bits (68), Expect = 3.7
Identities = 24/81 (29%), Positives = 35/81 (43%), Gaps = 9/81 (11%)
Query: 446 DELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHLEAKQKVAELE 505
DE E+ LS KE E ++K ++ + E E E K K E+
Sbjct: 3 DEHNQEHDHLSQ----KEPESEEKACACKEQQGEEKQEASEKEGEIKEDFELKYK--EMH 56
Query: 506 DRISELKHRADFEQDERKRLE 526
+ L+ ADFE + +KRLE
Sbjct: 57 E--KYLRVHADFE-NVKKRLE 74
>gnl|CDD|148061 pfam06229, FRG1, FRG1-like family. The human FRG1 gene maps to
human chromosome 4q35 and has been identified as a
candidate for facioscapulohumeral muscular dystrophy.
Currently, the function of FRG1 is unknown.
Length = 189
Score = 30.5 bits (69), Expect = 3.7
Identities = 19/93 (20%), Positives = 34/93 (36%), Gaps = 15/93 (16%)
Query: 435 EEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLD----------ESLSRIK 484
+IVA K A E EM + S+ + + + + + + + K
Sbjct: 95 AGDIVAKSKTAGEGEMVEI-RSDASREFKTTDRIRMEARFKPRIKESKYVKKFQKIQDKK 153
Query: 485 DRLEKETAGHLEAKQKV----AELEDRISELKH 513
RL + L+ + EL DR ++KH
Sbjct: 154 LRLSDDEVKRLKKARAEGNLHEELLDRRVKMKH 186
>gnl|CDD|237862 PRK14948, PRK14948, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 620
Score = 31.1 bits (71), Expect = 3.7
Identities = 15/60 (25%), Positives = 18/60 (30%)
Query: 552 PSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLPYLTCPRPLVRCRQPPRSKLKSRIP 611
S P P P P PPPPPP + P PP + + P
Sbjct: 529 KSPPPPAPTPPLPQPTATAPPPTPPPPPPTATQASSNAPAQIPADSSPPPPIPEEPTPSP 588
>gnl|CDD|192111 pfam08647, BRE1, BRE1 E3 ubiquitin ligase. BRE1 is an E3 ubiquitin
ligase that has been shown to act as a transcriptional
activator through direct activator interactions.
Length = 96
Score = 28.8 bits (65), Expect = 3.7
Identities = 19/57 (33%), Positives = 31/57 (54%), Gaps = 4/57 (7%)
Query: 435 EEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKET 491
+++ AA + D L+ EN L N K EL ++ ++ E +E RIK+ LEK+
Sbjct: 44 DQKYFAAMRSKDALDAENKKL-NTQLNKSSELIEQLKDLE--NELRRRIKE-LEKQL 96
>gnl|CDD|239122 cd02657, Peptidase_C19A, A subfamily of Peptidase C19. Peptidase
C19 contains ubiquitinyl hydrolases. They are
intracellular peptidases that remove ubiquitin molecules
from polyubiquinated peptides by cleavage of isopeptide
bonds. They hydrolyse bonds involving the carboxyl group
of the C-terminal Gly residue of ubiquitin. The purpose
of the de-ubiquitination is thought to be editing of the
ubiquitin conjugates, which could rescue them from
degradation, as well as recycling of the ubiquitin. The
ubiquitin/proteasome system is responsible for most
protein turnover in the mammalian cell, and with over 50
members, family C19 is one of the largest families of
peptidases in the human genome.
Length = 305
Score = 30.8 bits (70), Expect = 3.8
Identities = 17/47 (36%), Positives = 19/47 (40%), Gaps = 1/47 (2%)
Query: 800 LRKLLELVLALGNYMNKGARGNASGFRLT-SLNKLVDTKSSSSRGVT 845
LR + EL AL NY N S LT +L L DT V
Sbjct: 18 LRSVPELRDALKNYNPARRGANQSSDNLTNALRDLFDTMDKKQEPVP 64
>gnl|CDD|148614 pfam07106, TBPIP, Tat binding protein 1(TBP-1)-interacting protein
(TBPIP). This family consists of several eukaryotic
TBP-1 interacting protein (TBPIP) sequences. TBP-1 has
been demonstrated to interact with the human
immunodeficiency virus type 1 (HIV-1) viral protein Tat,
then modulate the essential replication process of HIV.
In addition, TBP-1 has been shown to be a component of
the 26S proteasome, a basic multiprotein complex that
degrades ubiquitinated proteins in an ATP-dependent
fashion. Human TBPIP interacts with human TBP-1 then
modulates the inhibitory action of human TBP-1 on
HIV-Tat-mediated transactivation.
Length = 169
Score = 30.0 bits (68), Expect = 3.8
Identities = 17/82 (20%), Positives = 40/82 (48%), Gaps = 6/82 (7%)
Query: 447 ELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHLEAKQKVAELED 506
+L+ME +L + +Q+ E + L L+ + L++E E K++V E+E+
Sbjct: 76 KLDMEIEELREEVQLLKQDCSTLEIELKSLTSDLT--TEELQEEIQ---ELKKEVREIEE 130
Query: 507 RISELKH-RADFEQDERKRLEN 527
++ L+ +E ++++
Sbjct: 131 KLESLEEGWKPVTPEEMEKVKK 152
>gnl|CDD|225950 COG3416, COG3416, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 233
Score = 30.6 bits (69), Expect = 3.8
Identities = 35/141 (24%), Positives = 52/141 (36%), Gaps = 43/141 (30%)
Query: 468 QKTQEKEDLDESLSRIKDRLEKE-TAGHL-------------EAKQKVAELEDRISELKH 513
+K + E ++ + I + + K+ A + +A ++ ELE RI+ L
Sbjct: 17 KKAEANERDPQAEALIAEAVAKQPDAAYYLAQRVLIQEQALKKASTQIKELEKRIAIL-- 74
Query: 514 RADFEQDERKRLENLINSGSVPDDTKAVGLSRSGGREEPSSPRKPEAPSSCIPPPPPPAS 573
+ E SGS LS + P P AP + PPP PA+
Sbjct: 75 ----QAGEAG-------SGSF--------LSNAFKWGTPQEP----APPANAPPPKEPAA 111
Query: 574 LPPPPPPRRSSDLPYLTCPRP 594
PP RSS T P P
Sbjct: 112 ----PPSWRSSPAGPTTQPSP 128
>gnl|CDD|234173 TIGR03346, chaperone_ClpB, ATP-dependent chaperone ClpB. Members
of this protein family are the bacterial ATP-dependent
chaperone ClpB. This protein belongs to the AAA family,
ATPases associated with various cellular activities
(pfam00004). This molecular chaperone does not act as a
protease, but rather serves to disaggregate misfolded
and aggregated proteins [Protein fate, Protein folding
and stabilization].
Length = 852
Score = 31.1 bits (71), Expect = 3.9
Identities = 24/98 (24%), Positives = 45/98 (45%), Gaps = 20/98 (20%)
Query: 434 KEEEIVAAKKKADELEMENVDLS--------NRLAKKEQELDQKTQEKEDLDE------- 478
K EE+ ++ +LE+E L RL E+EL + +E DL+E
Sbjct: 403 KPEELDELDRRIIQLEIEREALKKEKDEASKERLEDLEKELAELEEEYADLEEQWKAEKA 462
Query: 479 ---SLSRIKDRLEKETAGHLEAKQKVAELEDRISELKH 513
+ +IK+ +E+ LE ++ +L + +EL++
Sbjct: 463 AIQGIQQIKEEIEQ-VRLELEQAEREGDLA-KAAELQY 498
>gnl|CDD|116500 pfam07888, CALCOCO1, Calcium binding and coiled-coil domain
(CALCOCO1) like. Proteins found in this family are
similar to the coiled-coil transcriptional coactivator
protein coexpressed by Mus musculus (CoCoA/CALCOCO1).
This protein binds to a highly conserved N-terminal
domain of p160 coactivators, such as GRIP1, and thus
enhances transcriptional activation by a number of
nuclear receptors. CALCOCO1 has a central coiled-coil
region with three leucine zipper motifs, which is
required for its interaction with GRIP1 and may regulate
the autonomous transcriptional activation activity of
the C-terminal region.
Length = 546
Score = 31.1 bits (70), Expect = 4.0
Identities = 20/102 (19%), Positives = 42/102 (41%), Gaps = 8/102 (7%)
Query: 421 LKINVKKVVHLLAKEEEIVAAKKKA----DELEMENVDLSNRLAKKEQE---LDQKTQEK 473
L+ + +K L + +E ++ + + L + + R+ + EQ+ L QK QE
Sbjct: 180 LRHSTEKTEELEEQHKEAQSSSESMSAERNALLAQRAENQQRILELEQDIQTLTQKKQEN 239
Query: 474 EDLDESLSRIKDRLEKETAGHLE-AKQKVAELEDRISELKHR 514
+ + E I+ LE+ + K+ + D +E
Sbjct: 240 DRVLEGTQDIEAELERMKGELKQRLKKMTIQRRDEETERIDL 281
>gnl|CDD|217310 pfam02993, MCPVI, Minor capsid protein VI. This minor capsid
protein may act as a link between the external capsid
and the internal DNA-protein core. The C-terminal 11
residues may function as a protease cofactor leading to
enzyme activation.
Length = 238
Score = 30.5 bits (69), Expect = 4.0
Identities = 15/74 (20%), Positives = 23/74 (31%), Gaps = 3/74 (4%)
Query: 514 RADFEQDERKRLENLINSGSVPDDTKAVGLSRSGGREEPSSPRKPEAPSSCIPPPPPPAS 573
R E+D K LE ++ + V + +P PR P PP+
Sbjct: 95 RQKVEKDLEKLLEKVLGEEEPAPQEETV--ADPIQALQPR-PRPDVEEVLVPAAPEPPSY 151
Query: 574 LPPPPPPRRSSDLP 587
P + P
Sbjct: 152 EETIKPGPAPVEEP 165
>gnl|CDD|222417 pfam13851, GAS, Growth-arrest specific micro-tubule binding. This
family is the highly conserved central region of a
number of metazoan proteins referred to as growth-arrest
proteins. In mouse, Gas8 is predominantly a testicular
protein, whose expression is developmentally regulated
during puberty and spermatogenesis. In humans, it is
absent in infertile males who lack the ability to
generate gametes. The localisation of Gas8 in the
motility apparatus of post-meiotic gametocytes and
mature spermatozoa, together with the detection of Gas8
also in cilia at the apical surfaces of epithelial cells
lining the pulmonary bronchi and Fallopian tubes
suggests that the Gas8 protein may have a role in the
functioning of motile cellular appendages. Gas8 is a
microtubule-binding protein localised to regions of
dynein regulation in mammalian cells.
Length = 201
Score = 30.3 bits (69), Expect = 4.1
Identities = 28/103 (27%), Positives = 47/103 (45%), Gaps = 17/103 (16%)
Query: 435 EEEIVAAKKKAD-------ELEMENVDLSNRLAKKEQE---LDQKTQEKEDLDESLSRIK 484
+EEI KK + E+ EN L L K E+E L +K ++ E +SL +K
Sbjct: 33 KEEIAEMKKNEEHNEKLMAEIAQENKRLVEPLKKAEEEVEELRKKLKDYEKDKQSLKNLK 92
Query: 485 DRLEKETAGHLEAKQKVAELEDRISELKHRADFEQDERKRLEN 527
RL+ E ++++ L+ L+ R + + ER L +
Sbjct: 93 ARLK-------ELEKELKNLKWESEVLEQRFEKVERERDELYD 128
>gnl|CDD|223831 COG0760, SurA, Parvulin-like peptidyl-prolyl isomerase
[Posttranslational modification, protein turnover,
chaperones].
Length = 320
Score = 30.8 bits (69), Expect = 4.2
Identities = 22/170 (12%), Positives = 48/170 (28%), Gaps = 8/170 (4%)
Query: 876 KVSMKELENDMTQLRNNLKEVSREIEF----QRVQPLVPGDRFLPVMKEFLTSATCRLSE 931
K+S + + + ++ K + Q Q + F V +F A +
Sbjct: 81 KISQQVVAQLIAEIAVLDKLAKELLLGLSDEQGKQLIAEKPAFQDVEGKFDARAYLKRLR 140
Query: 932 IEDLFQDMKTRFDRAVRLFGEDNSSIQPEDFFVIFDTFLTTLNEAKLDNENMKRRREEEE 991
I+ L ++ + R + + L + + ++ E
Sbjct: 141 IKGLTEEQEAEALR--DKLQNKQQGKKVTEVQARH--ILVKAEAKAKEALALLKKGVREA 196
Query: 992 KRARQDAEMKKRTTDRKQGAKENGVVISNGSVKGEFDDLISALRTGDVFG 1041
K + K+ + G + EF L+ G+V
Sbjct: 197 KADFAELAKKQSEDPSSKNGGGLLGWNKKGQLVPEFRKAAFILKVGEVSA 246
>gnl|CDD|203593 pfam07200, Mod_r, Modifier of rudimentary (Mod(r)) protein. This
family represents a conserved region approximately 150
residues long within a number of eukaryotic proteins
that show homology with Drosophila melanogaster Modifier
of rudimentary (Mod(r)) proteins. The N-terminal half of
Mod(r) proteins is acidic, whereas the C-terminal half
is basic, and both of these regions are represented in
this family. Members of this family include the Vps37
subunit of the endosomal sorting complex ESCRT-I, a
complex involved in recruiting transport machinery for
protein sorting at the multivesicular body (MVB). The
yeast ESCRT-I complex consists of three proteins (Vps23,
Vps28 and Vps37). The mammalian homologue of Vps37
interacts with Tsg101 (Pfam: PF05743) through its mod(r)
domain and its function is essential for lysosomal
sorting of EGF receptors.
Length = 150
Score = 29.6 bits (67), Expect = 4.2
Identities = 14/77 (18%), Positives = 34/77 (44%), Gaps = 5/77 (6%)
Query: 414 EDYDHILLKINVKKVVHLLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEK 473
+ +L++ + + E A + + L EN++ +L + + +L K +E
Sbjct: 17 DALLEFVLELPQVQDL-----NLEKDALLAQNESLAEENLEKEPQLEELKAQLRDKYEEL 71
Query: 474 EDLDESLSRIKDRLEKE 490
+ L+E K +L++
Sbjct: 72 KKLEEKYEEKKQQLDEL 88
>gnl|CDD|192662 pfam10738, Lpp-LpqN, Probable lipoprotein LpqN. This family is
conserved in Mycobacteriaceae and is likely to be a
lipoprotein.
Length = 175
Score = 30.0 bits (68), Expect = 4.2
Identities = 16/50 (32%), Positives = 20/50 (40%)
Query: 531 SGSVPDDTKAVGLSRSGGREEPSSPRKPEAPSSCIPPPPPPASLPPPPPP 580
VP T A L +G EP +P A + +P PP P P P
Sbjct: 1 PVPVPSGTLAQYLEENGVTGEPVAPGDLGALTISLPTPPGWEPYPDPNVP 50
>gnl|CDD|223542 COG0466, Lon, ATP-dependent Lon protease, bacterial type
[Posttranslational modification, protein turnover,
chaperones].
Length = 782
Score = 31.0 bits (71), Expect = 4.3
Identities = 28/100 (28%), Positives = 49/100 (49%), Gaps = 18/100 (18%)
Query: 435 EEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGH 494
E+ + +K+ D L++E R+ KK +E +K+Q + L E L I+ L ++
Sbjct: 195 EKLLDLLEKEIDLLQLEK-----RIRKKVKEQMEKSQREYYLREQLKAIQKELGEDDDDK 249
Query: 495 LEAKQKVAELEDRISEL------KHRADFEQDERKRLENL 528
E V EL ++I +L K +A+ E K+LE +
Sbjct: 250 DE----VEELREKIEKLKLPKEAKEKAE---KELKKLETM 282
>gnl|CDD|237000 PRK11855, PRK11855, dihydrolipoamide acetyltransferase; Reviewed.
Length = 547
Score = 30.9 bits (71), Expect = 4.3
Identities = 7/47 (14%), Positives = 12/47 (25%), Gaps = 3/47 (6%)
Query: 551 EPSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLPYLTCPRPLVR 597
+ A + P + P P+ + P VR
Sbjct: 207 AAAPAAAAAAAPAPAPAAAAAPAAAAPAAAAAPGKAPHAS---PAVR 250
>gnl|CDD|219490 pfam07627, PSCyt3, Protein of unknown function (DUF1588). A region
of similarity shared by several Rhodopirellula baltica
cytochrome-like proteins that are predicted to be
secreted. These proteins also match pfam07626 and
pfam07624.
Length = 102
Score = 29.0 bits (66), Expect = 4.4
Identities = 8/19 (42%), Positives = 10/19 (52%)
Query: 564 CIPPPPPPASLPPPPPPRR 582
PPPPPP ++P P
Sbjct: 36 GDPPPPPPPNVPALEPDDP 54
>gnl|CDD|219897 pfam08549, SWI-SNF_Ssr4, Fungal domain of unknown function
(DUF1750). This is a fungal domain of unknown function.
Length = 669
Score = 30.8 bits (69), Expect = 4.5
Identities = 29/131 (22%), Positives = 47/131 (35%), Gaps = 13/131 (9%)
Query: 426 KKVVHLLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKD 485
+V +A + KK ME + ++ L EQ L + D + R++
Sbjct: 367 DRVAKKVADMTAEIEKLKKRHARRMEKFNRTSALKDAEQRLRDAAADPADTGSEIWRLEG 426
Query: 486 RLEKETAGHLEAKQKVAELEDRISELKHRADFEQDERKR-LENLINSGSVPDDTKAVGLS 544
RLE +T R +H+A ++ D+ R +E VP+ V
Sbjct: 427 RLEIDTEDD----------SSRQGPREHKAKYKVDDIVREVEKSWGKPIVPE--PDVSCV 474
Query: 545 RSGGREEPSSP 555
GG E P
Sbjct: 475 EKGGLLEKIEP 485
>gnl|CDD|184923 PRK14959, PRK14959, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 624
Score = 30.8 bits (69), Expect = 4.5
Identities = 20/80 (25%), Positives = 28/80 (35%), Gaps = 3/80 (3%)
Query: 540 AVGLSRSGG---REEPSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLPYLTCPRPLV 596
A G++ S PS+ P P PP PP + +PP P PR P P +
Sbjct: 411 AAGMTPSSAAPATPAPSAAPSPRVPWDDAPPAPPRSGIPPRPAPRMPEASPVPGAPDSVA 470
Query: 597 RCRQPPRSKLKSRIPRPYFP 616
P + + P
Sbjct: 471 SASDAPPTLGDPSDTAEHTP 490
>gnl|CDD|223250 COG0172, SerS, Seryl-tRNA synthetase [Translation, ribosomal
structure and biogenesis].
Length = 429
Score = 30.6 bits (70), Expect = 4.5
Identities = 16/56 (28%), Positives = 27/56 (48%), Gaps = 5/56 (8%)
Query: 432 LAKE--EEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKD 485
L+KE + + A+EL E +L +L + E LD+ + +LD L I +
Sbjct: 55 LSKEIGRALKRGEDDAEELIAEVKELKEKLKELEAALDEL---EAELDTLLLTIPN 107
>gnl|CDD|114474 pfam05750, Rubella_Capsid, Rubella capsid protein. Rubella virus
is an enveloped positive-strand RNA virus of the family
Togaviridae. Virions are composed of three structural
proteins: a capsid and two membrane-spanning
glycoproteins, E2 and E1. During virus assembly, the
capsid interacts with genomic RNA to form nucleocapsids.
It has been discovered that capsid phosphorylation
serves to negatively regulate binding of viral genomic
RNA. This may delay the initiation of nucleocapsid
assembly until sufficient amounts of virus glycoproteins
accumulate at the budding site and/or prevent
non-specific binding to cellular RNA when levels of
genomic RNA are low. It follows that at a late stage in
replication, the capsid may undergo dephosphorylation
before nucleocapsid assembly occurs.
Length = 300
Score = 30.6 bits (68), Expect = 4.5
Identities = 32/134 (23%), Positives = 50/134 (37%), Gaps = 15/134 (11%)
Query: 495 LEAKQKVAELEDRISELKHRADFEQDERKRLENLINSGSVPDDT--------KAVGLSRS 546
+E QK E + R + A Q R R +S + DD+ + G
Sbjct: 9 MEDLQKALEAQSRALRAELAAGASQSRRPRPPRQRDSSTSGDDSGRDSGGPRRRRGNRGR 68
Query: 547 GGREE----PSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLPYLTCPRPLVRCRQPP 602
G R++ P P + + S P P P + P P P R + PRP + PP
Sbjct: 69 GQRKDWSRAPPPPEERQESRSQTPAPKPSRAPPQQPQPPRMQTGRGGSAPRPEL---GPP 125
Query: 603 RSKLKSRIPRPYFP 616
+ ++ + R P
Sbjct: 126 TNPFQAAVARGLRP 139
>gnl|CDD|193414 pfam12941, HCV_NS5a_C, HCV NS5a protein C-terminal region. This is
a family of proteins found in the hepatitis C virus.
This family contains the C-terminal region of the NS5A
protein. CC The molecular function of the non-structural
5a protein is uncertain. The NS5a protein is
phosphorylated when expressed in mammalian cells. It is
thought to interact with the ds RNA dependent
(interferon inducible) kinase PKR.
Length = 244
Score = 30.5 bits (69), Expect = 4.5
Identities = 11/13 (84%), Positives = 11/13 (84%)
Query: 570 PPASLPPPPPPRR 582
PPA LPP PPPRR
Sbjct: 140 PPAGLPPVPPPRR 152
>gnl|CDD|240289 PTZ00144, PTZ00144, dihydrolipoamide succinyltransferase;
Provisional.
Length = 418
Score = 30.8 bits (70), Expect = 4.6
Identities = 18/72 (25%), Positives = 30/72 (41%), Gaps = 3/72 (4%)
Query: 539 KAVGLSRSGGREEPSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLPYLTCPRPLVRC 598
A ++ + ++ PE P++ P PP A P P P + P R R
Sbjct: 133 AAAKAEKTTPEKPKAAAPTPEPPAASKPTPPAAAKPPEPAPAAKP---PPTPVARADPRE 189
Query: 599 RQPPRSKLKSRI 610
+ P S+++ RI
Sbjct: 190 TRVPMSRMRQRI 201
>gnl|CDD|223448 COG0371, GldA, Glycerol dehydrogenase and related enzymes [Energy
production and conversion].
Length = 360
Score = 30.7 bits (70), Expect = 4.7
Identities = 15/65 (23%), Positives = 27/65 (41%), Gaps = 3/65 (4%)
Query: 650 AYQKNGVPADGSVEDLTKLGKISKSKILSVIDGRRAQNCTILLSKLKMTDEEISKAILSM 709
+K G+P ++ +L L + L+V R + TIL +T E A+ +
Sbjct: 298 FLKKVGLPT--TLAELG-LDDDEVIEALTVAHAIRPERETILGMPFGLTPEAARAALEAT 354
Query: 710 DSADQ 714
+ Q
Sbjct: 355 EVIGQ 359
>gnl|CDD|204784 pfam11932, DUF3450, Protein of unknown function (DUF3450). This
family of proteins are functionally uncharacterized.
This protein is found in bacteria and eukaryotes.
Proteins in this family are about 260 amino acids in
length.
Length = 250
Score = 30.3 bits (69), Expect = 4.8
Identities = 20/100 (20%), Positives = 39/100 (39%), Gaps = 18/100 (18%)
Query: 432 LAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKET 491
EI K+ + L + N L +A ++QE+ L + + +I ++ +
Sbjct: 53 QELLAEIRQLLKEIENLRVYNDQLQRLVANQQQEIA-------SLQQQIEQI-EKTRQGL 104
Query: 492 AGHLEAKQKVAELEDRISELKHRAD--FEQDER-KRLENL 528
+ + + LE + D F ++ER RL L
Sbjct: 105 VPLMY--RMIDGLEQFV-----ALDLPFLKEERLARLARL 137
>gnl|CDD|236892 PRK11281, PRK11281, hypothetical protein; Provisional.
Length = 1113
Score = 30.7 bits (70), Expect = 5.0
Identities = 18/69 (26%), Positives = 31/69 (44%), Gaps = 8/69 (11%)
Query: 443 KKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHLEAKQKVA 502
K D + E L +LA+ +L Q E L +KD ++ET L + +
Sbjct: 73 DKIDRQKEETEQLKQQLAQAPAKLRQAQAE-------LEALKDDNDEETRETL-STLSLR 124
Query: 503 ELEDRISEL 511
+LE R+++
Sbjct: 125 QLESRLAQT 133
>gnl|CDD|218661 pfam05622, HOOK, HOOK protein. This family consists of several
HOOK1, 2 and 3 proteins from different eukaryotic
organisms. The different members of the human gene
family are HOOK1, HOOK2 and HOOK3. Different domains
have been identified in the three human HOOK proteins,
and it was demonstrated that the highly conserved
NH2-domain mediates attachment to microtubules, whereas
the central coiled-coil motif mediates homodimerisation
and the more divergent C-terminal domains are involved
in binding to specific organelles (organelle-binding
domains). It has been demonstrated that endogenous HOOK3
binds to Golgi membranes, whereas both HOOK1 and HOOK2
are localised to discrete but unidentified cellular
structures. In mice the Hook1 gene is predominantly
expressed in the testis. Hook1 function is necessary for
the correct positioning of microtubular structures
within the haploid germ cell. Disruption of Hook1
function in mice causes abnormal sperm head shape and
fragile attachment of the flagellum to the sperm head.
Length = 713
Score = 30.9 bits (70), Expect = 5.0
Identities = 35/154 (22%), Positives = 70/154 (45%), Gaps = 26/154 (16%)
Query: 397 LLLDRLAQQLTLQTETGEDYDHILLKINVKKVVHLLAKEEEIVAAKKKADELEMENVDL- 455
L++L Q+ + E D I + K++ L + +E+ + ++ L+ E L
Sbjct: 246 SQLEQL-QEENFRLEAARDDYRIKCEELEKELAELQHRNDELTSLAAESQALKDEIDVLR 304
Query: 456 --SNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHLEA---------------- 497
S++ K E +++ ++ EDL++ ++K LE+ A +++
Sbjct: 305 ESSDKAKKLEAQVETYKKKLEDLNDLRRQVK-LLEERNAMYMQNTVQLEEELKKANAARG 363
Query: 498 -----KQKVAELEDRISELKHRADFEQDERKRLE 526
K++V EL ++SE +AD + E KRLE
Sbjct: 364 QLETYKRQVQELHAKLSEESKKADKLEFEYKRLE 397
>gnl|CDD|218307 pfam04880, NUDE_C, NUDE protein, C-terminal conserved region. This
family represents the C-terminal conserved region of the
NUDE proteins. NUDE proteins are involved in nuclear
migration.
Length = 166
Score = 29.8 bits (66), Expect = 5.1
Identities = 30/128 (23%), Positives = 46/128 (35%), Gaps = 20/128 (15%)
Query: 457 NRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHLEAKQKVAELEDRISELKHRAD 516
N+ ++ L+ + EKE L R++D EL D EL +
Sbjct: 10 NQALERNALLEMEYDEKESLRIENQRLRD-----------------ELRDLKQELIVQER 52
Query: 517 FEQDERK-RLENLINSGSVPDDTKAVGLSRSGGREEPSSP-RKPEAPSSCIPPPPPPASL 574
+ RK R ++N GS P T S + R P+ P P + + A+
Sbjct: 53 LRNNNRKSRPAPVVNLGSSP-STPHTNSSMNSPRSPPNGTVSSPLTPPTKLSLTLASATA 111
Query: 575 PPPPPPRR 582
P PP
Sbjct: 112 TDPAPPMS 119
>gnl|CDD|177328 PHA01929, PHA01929, putative scaffolding protein.
Length = 306
Score = 30.4 bits (68), Expect = 5.1
Identities = 10/30 (33%), Positives = 13/30 (43%), Gaps = 3/30 (10%)
Query: 554 SPRKPEAPSSCIPPPP---PPASLPPPPPP 580
+ P P+ PP P +L PPPP
Sbjct: 70 VQQAPAQPAPAAPPAAGAALPEALEVPPPP 99
>gnl|CDD|218434 pfam05103, DivIVA, DivIVA protein. The Bacillus subtilis divIVA1
mutation causes misplacement of the septum during cell
division, resulting in the formation of small, circular,
anucleate mini-cells. Inactivation of divIVA produces a
mini-cell phenotype, whereas overproduction of DivIVA
results in a filamentation phenotype. These proteins
appear to contain coiled-coils.
Length = 131
Score = 29.2 bits (66), Expect = 5.3
Identities = 20/86 (23%), Positives = 40/86 (46%), Gaps = 8/86 (9%)
Query: 451 ENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKE-TAGHLEAKQKVAELEDRIS 509
EN +L + + E++L++ + +E L ++L ++ E+ EA+ + E E +
Sbjct: 40 ENEELKEEIERLEEKLEEYKELEETLQKTLVVAQETAEEVKANAQKEAELIIKEAEAKAE 99
Query: 510 ELKHRAD-------FEQDERKRLENL 528
+ + A+ E DE KR L
Sbjct: 100 RIVNDANEEAKKLATEYDELKRQSRL 125
>gnl|CDD|217020 pfam02403, Seryl_tRNA_N, Seryl-tRNA synthetase N-terminal domain.
This domain is found associated with the Pfam tRNA
synthetase class II domain (pfam00587) and represents
the N-terminal domain of seryl-tRNA synthetase.
Length = 108
Score = 28.7 bits (65), Expect = 5.3
Identities = 20/81 (24%), Positives = 39/81 (48%), Gaps = 11/81 (13%)
Query: 431 LLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKE 490
LL +EE + + +EL+ E N L+K+ + +K ++ E L + +KD L+
Sbjct: 31 LLELDEERRELQVELEELQAE----RNELSKEIGKAKKKKEDAEALIAEVKELKDELK-- 84
Query: 491 TAGHLEAKQKVAELEDRISEL 511
+ ++ ELE + +L
Sbjct: 85 -----ALEAELRELEAELDKL 100
>gnl|CDD|225986 COG3455, COG3455, Type VI protein secretion system component VasF
[Intracellular trafficking, secretion, and vesicular
transport].
Length = 262
Score = 30.0 bits (68), Expect = 5.4
Identities = 12/37 (32%), Positives = 17/37 (45%)
Query: 349 HIETKSATSMFNILRRKLGHTAAYPHFLSLLHHCILL 385
H E +F +L R L A Y L L++ C+ L
Sbjct: 128 HNEAWGGEKVFELLERLLREPARYRDLLELIYLCLSL 164
>gnl|CDD|215918 pfam00435, Spectrin, Spectrin repeat. Spectrin repeat-domains are
found in several proteins involved in cytoskeletal
structure. These include spectrin, alpha-actinin and
dystrophin. The sequence repeat used in this family is
taken from the structural repeat in reference. The
spectrin domain- repeat forms a three helix bundle. The
second helix is interrupted by proline in some
sequences. The repeats are defined by a characteristic
tryptophan (W) residue at position 17 in helix A and a
leucine (L) at 2 residues from the carboxyl end of helix
C. Although the domain occurs in ultiple repeats along
sequences, the domains are actually stable on their own
- ie they act, biophysically, like domains rather than
repeats that along function when aggregated.
Length = 105
Score = 28.8 bits (65), Expect = 5.5
Identities = 21/102 (20%), Positives = 44/102 (43%), Gaps = 19/102 (18%)
Query: 442 KKKADEL-----EMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEK------- 489
+ AD+L E E + S K + + ++ + L+ L+ +DR+E
Sbjct: 7 FRDADDLESWIEEKEALLSSEDYGKDLESVQALLKKHKALEAELAAHQDRVEALNELAEK 66
Query: 490 -ETAGHLEAKQ---KVAELEDRISELKHRADFEQDERKRLEN 527
GH +++ ++ EL +R +L A + +++LE
Sbjct: 67 LIAEGHYASEEIQERLEELNERWEQLLELA---AERKQKLEE 105
>gnl|CDD|237378 PRK13406, bchD, magnesium chelatase subunit D; Provisional.
Length = 584
Score = 30.4 bits (69), Expect = 5.6
Identities = 12/36 (33%), Positives = 13/36 (36%), Gaps = 3/36 (8%)
Query: 552 PSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLP 587
P + R P P PPPP PPPP
Sbjct: 253 PRATRLPAPPQPPEEEPPPP---PPPPEDDDDPPED 285
Score = 30.0 bits (68), Expect = 9.3
Identities = 10/36 (27%), Positives = 13/36 (36%)
Query: 552 PSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLP 587
P+ P+ PE PPPP PP +
Sbjct: 259 PAPPQPPEEEPPPPPPPPEDDDDPPEDEEEQDDAED 294
>gnl|CDD|221459 pfam12200, DUF3597, Domain of unknown function (DUF3597). This
family of proteins is found in bacteria, eukaryotes and
viruses. Proteins in this family are typically between
126 and 281 amino acids in length. The function of this
domain is unknown. The structure of this domain has been
found to contain five helices with a long flexible loop
between helices one and two.
Length = 124
Score = 28.8 bits (65), Expect = 5.6
Identities = 10/40 (25%), Positives = 15/40 (37%), Gaps = 1/40 (2%)
Query: 547 GGREEPSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDL 586
G ++P P A + P A+ PP S D+
Sbjct: 10 FGEAAAAAP-APAAAPATAPAAAAAAAPAATPPAAASVDV 48
>gnl|CDD|227751 COG5464, COG5464, Uncharacterized conserved protein [Function
unknown].
Length = 289
Score = 30.1 bits (68), Expect = 5.6
Identities = 33/169 (19%), Positives = 59/169 (34%), Gaps = 21/169 (12%)
Query: 366 LGHTAAYPHFLSLLHHCILLPLEFGCHPQHWLLLDRLAQQLTLQTETGEDYDHILLKINV 425
Y L+ ++ E H + L L L Q+ Q + + L + V
Sbjct: 133 FDDRKLYSSAFPLVDLTVVPDDEIMQHRR--LALLELLQKHIRQRDLMGLVE-QLATLLV 189
Query: 426 KKVVHLLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKD 485
+L + EI + N L ++ + + +E E+L I
Sbjct: 190 YG--YLNDSQLEI-----------LFNYGLQTDTLRETRFYQEIAEEGPQHGEALMTIAG 236
Query: 486 RLEKETAGH---LEAKQKVAELEDRISELKHRADFEQDERKRLENLINS 531
RLE+E G LE + EL RI+++ +++ R+ L
Sbjct: 237 RLEQE--GKQEGLEIVPGLLELGLRIAQIMLELGLDRELVLRVTGLSEE 283
>gnl|CDD|217197 pfam02711, Pap_E4, E4 protein. This is is a family of
Papillomavirus proteins, E4, coded for by ORF4. A splice
variant, E1--E4, exists but neither the function of E4
or E1--E4 is known.
Length = 95
Score = 28.4 bits (64), Expect = 5.6
Identities = 13/50 (26%), Positives = 16/50 (32%), Gaps = 11/50 (22%)
Query: 563 SCIPPPPPPASLPPPPPPRR-----------SSDLPYLTCPRPLVRCRQP 601
+ PP PP PP P +R SS + P P P
Sbjct: 18 TYTTPPRPPPPPPPWAPKKRCRRRLESDSDSSSTTSPSSPPTPQPTTECP 67
>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein. This family
consists of several Borrelia P83/P100 antigen proteins.
Length = 489
Score = 30.4 bits (68), Expect = 5.7
Identities = 22/108 (20%), Positives = 49/108 (45%), Gaps = 4/108 (3%)
Query: 423 INVKKVVHLLAKEEE--IVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESL 480
I+ KKVV L ++ E + + D E E+ + + R + ++ELD+K + + +
Sbjct: 177 ISDKKVVEALREDNEKGVNFRRDMTDLKERESQEDAKRAQQLKEELDKKQIDADKAQQKA 236
Query: 481 SRIKDRLEKETAGHLEAKQKVAELED--RISELKHRADFEQDERKRLE 526
+D +K+ + +Q+ L S K +++++ +E
Sbjct: 237 DFAQDNADKQRDEVRQKQQEAKNLPKPADTSSPKEDKQVAENQKREIE 284
>gnl|CDD|218806 pfam05911, DUF869, Plant protein of unknown function (DUF869).
This family consists of a number of sequences found in
Arabidopsis thaliana, Oryza sativa and Lycopersicon
esculentum (Tomato). The function of this family is
unknown.
Length = 767
Score = 30.7 bits (69), Expect = 5.8
Identities = 26/83 (31%), Positives = 37/83 (44%), Gaps = 13/83 (15%)
Query: 440 AAKKKADELEMENVDLSNRLAKKEQELDQKT----------QEKEDLDESLSRIKDRLEK 489
AA +K E E +L +L + ++EL + QEK + LS K R E
Sbjct: 75 AATEKTKEWEKVKAELEKKLLELQKELARAAAENDALSRSLQEKSKMIMELSEEKSRAES 134
Query: 490 ETAGHLEAKQKVAELEDRISELK 512
E L+A+ + AE E S LK
Sbjct: 135 EIE-DLKARLESAEKE--NSSLK 154
Score = 30.3 bits (68), Expect = 6.9
Identities = 31/112 (27%), Positives = 50/112 (44%), Gaps = 9/112 (8%)
Query: 417 DHILLKINVKKVVHLLAKEEEIVAAKKKADELEMENVDLSNRLAKKE--QELDQKTQEKE 474
+ ++ + + V + A E + K+ E E+ + D +EL+ EKE
Sbjct: 545 EWVVNQEKSLQDVSVEASEIKKNFLGLKSSEKEINSPDEVKGAVCISTLEELETLKSEKE 604
Query: 475 DLDESLSRIKDRLEKETAGHLEAKQKVAELEDRISELKHRADFEQDERKRLE 526
+LD LS+ KD LE E+K K+ E E ++ ELK D Q+ E
Sbjct: 605 NLDGELSKCKDDLE-------ESKNKLQETEKKLEELKSELDASQESNSLAE 649
>gnl|CDD|217835 pfam03999, MAP65_ASE1, Microtubule associated protein (MAP65/ASE1
family).
Length = 619
Score = 30.6 bits (69), Expect = 6.0
Identities = 22/104 (21%), Positives = 41/104 (39%), Gaps = 9/104 (8%)
Query: 430 HLLAKEEEIVAAKKKADELE------MENVDLSNRLAKKEQELDQKTQEKEDLD---ESL 480
L EE V ++ D L+ + EQ++ + D E++
Sbjct: 150 RLGELREEKVRRLEEVDSLKQSIKSLCSLLGTPPARTDFEQDVLSYGEIPNDHCVTRETI 209
Query: 481 SRIKDRLEKETAGHLEAKQKVAELEDRISELKHRADFEQDERKR 524
++ LE A + K+ +L ++I EL +R +E+KR
Sbjct: 210 DKLDKMLENLAAQKKQRADKIDDLREQIQELWNRLQISDEEQKR 253
>gnl|CDD|237057 PRK12323, PRK12323, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 700
Score = 30.6 bits (69), Expect = 6.1
Identities = 8/44 (18%), Positives = 14/44 (31%), Gaps = 3/44 (6%)
Query: 544 SRSGGREEPSSPRKPEAPSSCIPPPPPPASLPP---PPPPRRSS 584
+R G +P AP++ P P R++
Sbjct: 441 ARGPGGAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAAAPARAA 484
>gnl|CDD|114709 pfam06003, SMN, Survival motor neuron protein (SMN). This family
consists of several eukaryotic survival motor neuron
(SMN) proteins. The Survival of Motor Neurons (SMN)
protein, the product of the spinal muscular
atrophy-determining gene, is part of a large
macromolecular complex (SMN complex) that functions in
the assembly of spliceosomal small nuclear
ribonucleoproteins (snRNPs). The SMN complex functions
as a specificity factor essential for the efficient
assembly of Sm proteins on U snRNAs and likely protects
cells from illicit, and potentially deleterious,
non-specific binding of Sm proteins to RNAs.
Length = 264
Score = 30.0 bits (67), Expect = 6.2
Identities = 24/116 (20%), Positives = 28/116 (24%), Gaps = 16/116 (13%)
Query: 474 EDLDESLSRIKDRLEKETAGHLEAKQKVAELEDRISELKHRADFEQDERKRLENLINSGS 533
D +K+ ET + + DR S
Sbjct: 121 PPPDVDEDALKEANVNET-------ESSTDESDRSSHSHEVRSKSNFPMGPPSPWNPRFP 173
Query: 534 VPDDTKAVGLSRSGGREEPSSPRKPEAPSSCIPPPPPPASLPP--PPPPRRSSDLP 587
G R G P S PPP P PP PPPP S D
Sbjct: 174 PGPPPPPPGFGRHG-----EKPSGWPPFLSGWPPPFPLG--PPMIPPPPPMSPDFG 222
>gnl|CDD|214775 smart00689, DM6, Cysteine-rich domain currently specific to
Drosophila.
Length = 157
Score = 29.4 bits (66), Expect = 6.2
Identities = 18/68 (26%), Positives = 21/68 (30%), Gaps = 11/68 (16%)
Query: 550 EEPSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLPYLT---CPRP-LVRCRQPPRSK 605
E P KP+ C P P P PRR P CP + C+
Sbjct: 35 ECPPIMIKPKR--ICCPDKEPR-----IPRPRRRRRKPRCPSAACPTGNPMPCKPEAGKS 87
Query: 606 LKSRIPRP 613
RI P
Sbjct: 88 PCPRIRMP 95
Score = 29.0 bits (65), Expect = 7.4
Identities = 17/70 (24%), Positives = 21/70 (30%), Gaps = 10/70 (14%)
Query: 549 REEPSSPRKPEAPSSCIPPP-PPPASLPPPPPPRRSSDLPYLTCPR-PLVRCRQ---PPR 603
+EP PR P P P P P CPR + C+ PP
Sbjct: 50 DKEPRIPRPRRRRRKPRCPSAACPTGNPMPCKPEAGKSP----CPRIRMPCCKPARRPPS 105
Query: 604 SKLKSRIPRP 613
+ R P
Sbjct: 106 CR-AVRRPSD 114
>gnl|CDD|215379 PLN02708, PLN02708, Probable pectinesterase/pectinesterase
inhibitor.
Length = 553
Score = 30.2 bits (68), Expect = 6.2
Identities = 9/24 (37%), Positives = 10/24 (41%)
Query: 553 SSPRKPEAPSSCIPPPPPPASLPP 576
S+ S PPPP S PP
Sbjct: 22 SNRHHHHHTPSPSPPPPSSPSTPP 45
>gnl|CDD|223649 COG0576, GrpE, Molecular chaperone GrpE (heat shock protein)
[Posttranslational modification, protein turnover,
chaperones].
Length = 193
Score = 29.6 bits (67), Expect = 6.3
Identities = 22/97 (22%), Positives = 40/97 (41%), Gaps = 18/97 (18%)
Query: 435 EEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGH 494
EE ++A++ E E + + E E +Q QE +L+ L +KD+
Sbjct: 9 EEPDAEETEEAEKSEEEEAEEEEPEEENELEEEQ--QEIAELEAQLEELKDKY------- 59
Query: 495 LEAKQKVAELEDRISELKHRADFEQDE--RKRLENLI 529
+ AE E L+ R + E++E + +E
Sbjct: 60 ---LRAQAEFE----NLRKRTEREREEAKKYAIEKFA 89
>gnl|CDD|235906 PRK07003, PRK07003, DNA polymerase III subunits gamma and tau;
Validated.
Length = 830
Score = 30.6 bits (69), Expect = 6.3
Identities = 12/59 (20%), Positives = 20/59 (33%), Gaps = 4/59 (6%)
Query: 548 GREEPSSPRKPEAPSSCIPPPPPPASLPPPPP--PRRSSDLPYLTCPR--PLVRCRQPP 602
G ++ + AP++ P P ++ P P + D P R R P
Sbjct: 568 GARAAAAAKPAAAPAAAPKPAAPRVAVQVPTPRARAATGDAPPNGAARAEQAAESRGAP 626
>gnl|CDD|216840 pfam02005, TRM, N2,N2-dimethylguanosine tRNA methyltransferase.
This enzyme EC:2.1.1.32 used S-AdoMet to methylate tRNA.
The TRM1 gene of Saccharomyces cerevisiae is necessary
for the N2,N2-dimethylguanosine modification of both
mitochondrial and cytoplasmic tRNAs. The enzyme is found
in both eukaryotes and archaebacteria.
Length = 375
Score = 30.0 bits (68), Expect = 6.7
Identities = 13/58 (22%), Positives = 26/58 (44%), Gaps = 8/58 (13%)
Query: 760 YEQRLRCLHYKKRFHF----WISDI--EPKIKSVMEA--SREVTRSRRLRKLLELVLA 809
+ + C H RFH W+ + + ++ ++E S S+R+ LL+L+
Sbjct: 255 AKFKAECPHCGGRFHLAGPLWLGPLHDKEFVEEMLEIAESPPEEFSKRILGLLKLIKE 312
>gnl|CDD|221130 pfam11527, ARL2_Bind_BART, The ARF-like 2 binding protein BART.
BART binds specifically to ARL2.GTP with a high affinity
however it does not bind to ARL2.GDP. It is thought that
this specific interaction is due to BART being the first
identified ARL2-specific effector. The function is not
completely characterized. BART is predominantly
cytosolic but can also be found to be associated with
mitochondria. BART is also involved in binding to the
adenine nucleotide transporter ANT1.
Length = 120
Score = 28.8 bits (65), Expect = 6.7
Identities = 15/43 (34%), Positives = 21/43 (48%), Gaps = 3/43 (6%)
Query: 688 CTILLSKLKM---TDEEISKAILSMDSADQLPLDMLEQLLKFT 727
+L S LK + E+ +A LS S +Q + EQLL T
Sbjct: 58 EKLLESFLKEVGISPEQFEEACLSQLSKNQQAKVIFEQLLAAT 100
>gnl|CDD|220610 pfam10165, Ric8, Guanine nucleotide exchange factor synembryn.
Ric8 is involved in the EGL-30 neurotransmitter
signalling pathway. It is a guanine nucleotide exchange
factor that regulates neurotransmitter secretion.
Length = 439
Score = 30.4 bits (69), Expect = 6.7
Identities = 18/86 (20%), Positives = 33/86 (38%), Gaps = 13/86 (15%)
Query: 309 QPIIDKLRTHEN--ETLDRHLDFFEMLRHEDESELARKYETDHI-ETKSATSMFNILRRK 365
+ +ID+ L+RHL + + LA E + ET +FN+
Sbjct: 110 KQLIDEHHGLSLLLAALERHLKQASK---DKQEPLAPPMELLALSETLKL--LFNVTFFY 164
Query: 366 LGH-----TAAYPHFLSLLHHCILLP 386
+A+ PH +++L +P
Sbjct: 165 PRDVEEEFSASLPHIVAILRRLPTIP 190
>gnl|CDD|233692 TIGR02031, BchD-ChlD, magnesium chelatase ATPase subunit D. This
model represents one of two ATPase subunits of the
trimeric magnesium chelatase responsible for insertion
of magnesium ion into protoporphyrin IX. This is an
essential step in the biosynthesis of both chlorophyll
and bacteriochlorophyll. This subunit is found in green
plants, photosynthetic algae, cyanobacteria and other
photosynthetic bacteria. Unlike subunit I (TIGR02030),
this subunit is not found in archaea [Biosynthesis of
cofactors, prosthetic groups, and carriers, Chlorophyll
and bacteriochlorphyll].
Length = 589
Score = 30.2 bits (68), Expect = 6.9
Identities = 19/51 (37%), Positives = 20/51 (39%), Gaps = 4/51 (7%)
Query: 552 PSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLPYLTCPRPLVRCRQPP 602
P + R PE P PPPP P PP P D P T P Q P
Sbjct: 261 PRATRLPEPE----PQPPPPPPPPEPPEPEEEPDEPDQTDPDDGEETDQIP 307
>gnl|CDD|222604 pfam14222, MOR2-PAG1_N, Cell morphogenesis N-terminal. This family
is the conserved N-terminal region of proteins that are
involved in cell morphogenesis.
Length = 538
Score = 30.3 bits (69), Expect = 6.9
Identities = 16/66 (24%), Positives = 19/66 (28%), Gaps = 11/66 (16%)
Query: 566 PPPPPPASLPPPPPPRRSSDLPYLTCPRPLVRCRQPPRSKLKSRIPRPYFPNVSVQVNGT 625
PPPPPP S P R L I + Y+P + G
Sbjct: 351 PPPPPPTSPPSSNNLRVKKTPSSEYLTNE---------VALSIGI-KEYYPEFCKAL-GK 399
Query: 626 IWTELD 631
I D
Sbjct: 400 ILILCD 405
>gnl|CDD|218332 pfam04929, Herpes_DNAp_acc, Herpes DNA replication accessory
factor. Replicative DNA polymerases are capable of
polymerising tens of thousands of nucleotides without
dissociating from their DNA templates. The high
processivity of these polymerases is dependent upon
accessory proteins that bind to the catalytic subunit of
the polymerase or to the substrate. The Epstein-Barr
virus (EBV) BMRF1 protein is an essential component of
the viral DNA polymerase and is absolutely required for
lytic virus replication. BMRF1 is also a transactivator.
This family is predicted to have a UL42 like structure.
Length = 381
Score = 30.0 bits (68), Expect = 7.1
Identities = 18/68 (26%), Positives = 24/68 (35%), Gaps = 7/68 (10%)
Query: 553 SSPRKPEAPSSCIPPPPPPASLPPP---PPPRRSSDLPYLTCPRPL----VRCRQPPRSK 605
S R S P PP + P PPP S P RPL R R+ + +
Sbjct: 306 VSDRPRHLSSDSSPSPPDTSDSDPSTETPPPASLSHSPPAAFERPLALSPKRKREGDKKQ 365
Query: 606 LKSRIPRP 613
K + +
Sbjct: 366 KKKKSKKL 373
>gnl|CDD|215180 PLN02316, PLN02316, synthase/transferase.
Length = 1036
Score = 30.2 bits (68), Expect = 7.1
Identities = 21/65 (32%), Positives = 27/65 (41%), Gaps = 13/65 (20%)
Query: 961 DFFVI---------FDTFLT--TLNEA-KLDNENMKRRREEEEKRARQDAEMKKRTTDRK 1008
DF V F+ FL E KL E +R R+ EE+R R++ E DR
Sbjct: 232 DFCVEIEGGMDEHSFEDFLLEEKRRELEKLAKEEAERERQAEEQR-RREEEKAAMEADRA 290
Query: 1009 QGAKE 1013
Q E
Sbjct: 291 QAKAE 295
>gnl|CDD|165539 PHA03282, PHA03282, envelope glycoprotein E; Provisional.
Length = 540
Score = 30.3 bits (68), Expect = 7.1
Identities = 9/30 (30%), Positives = 10/30 (33%)
Query: 552 PSSPRKPEAPSSCIPPPPPPASLPPPPPPR 581
P E + P P P PPP P
Sbjct: 172 TPPPDDDEGDEAGAPATPAPPLHPPPAPHP 201
>gnl|CDD|224259 COG1340, COG1340, Uncharacterized archaeal coiled-coil protein
[Function unknown].
Length = 294
Score = 30.0 bits (68), Expect = 7.2
Identities = 23/93 (24%), Positives = 48/93 (51%), Gaps = 7/93 (7%)
Query: 436 EEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHL 495
EE +K+ADEL E V+LS ++ + +E E +L++ + ++ + +
Sbjct: 200 EEADELRKEADELHEEFVELSKKIDELHEEFRNLQNELRELEKKIKALRAKEKA-----A 254
Query: 496 EAKQKVAELEDRISELKHRADFEQDERKRLENL 528
+ ++K EL++R E+ + F++ E+ E L
Sbjct: 255 KRREKREELKERAEEIYEK--FKRGEKLTTEEL 285
>gnl|CDD|177618 PHA03381, PHA03381, tegument protein VP22; Provisional.
Length = 290
Score = 30.0 bits (67), Expect = 7.2
Identities = 14/60 (23%), Positives = 18/60 (30%), Gaps = 4/60 (6%)
Query: 550 EEPSSPRKPEAPSSC--IPPPPPPASLPPPPPPRRSSDLPYLTCPR--PLVRCRQPPRSK 605
E P+ PR P + P P P R PR P + PR +
Sbjct: 82 ERPADPRPSRRPHAQPEASGPGPARGARGPAGSRGRGRRAESPSPRDPPNPKGASAPRGR 141
>gnl|CDD|214661 smart00435, TOPEUc, DNA Topoisomerase I (eukaryota). DNA
Topoisomerase I (eukaryota), DNA topoisomerase V,
Vaccina virus topoisomerase, Variola virus
topoisomerase, Shope fibroma virus topoisomeras.
Length = 391
Score = 30.0 bits (68), Expect = 7.3
Identities = 17/81 (20%), Positives = 40/81 (49%), Gaps = 3/81 (3%)
Query: 432 LAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKET 491
+ K +E + A K + + + L ++ +++L + E+++ E L +KE
Sbjct: 279 MEKLQEKIKALKYQLKRLKKMILLFEMISDLKRKL-KSKFERDN--EKLDAEVKEKKKEK 335
Query: 492 AGHLEAKQKVAELEDRISELK 512
+ K+++ LE+RI +L+
Sbjct: 336 KKEEKKKKQIERLEERIEKLE 356
>gnl|CDD|236995 PRK11824, PRK11824, polynucleotide phosphorylase/polyadenylase;
Provisional.
Length = 693
Score = 30.0 bits (69), Expect = 7.9
Identities = 15/84 (17%), Positives = 40/84 (47%), Gaps = 5/84 (5%)
Query: 436 EEIV--AAKKKADELEMENVDLSNRLAKK-EQELDQ--KTQEKEDLDESLSRIKDRLEKE 490
EE+ A K + + +L + + E +L + + +K++ + +L IK+ + +
Sbjct: 220 EELAAEAGPKWEWQPPEVDEELKAAVKELAEAKLKEAYQITDKQEREAALDAIKEEVLEA 279
Query: 491 TAGHLEAKQKVAELEDRISELKHR 514
A E ++ E+++ +L+ +
Sbjct: 280 LAAEEEEEEDEKEIKEAFKKLEKK 303
>gnl|CDD|236081 PRK07735, PRK07735, NADH dehydrogenase subunit C; Validated.
Length = 430
Score = 29.9 bits (67), Expect = 8.0
Identities = 21/84 (25%), Positives = 35/84 (41%), Gaps = 2/84 (2%)
Query: 433 AKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETA 492
EEE AAK KA + +K + ++ T+E+E+ D+ ++ K +
Sbjct: 119 VTEEEKAAAKAKAAAAAKAKAAALAK--QKREGTEEVTEEEEETDKEKAKAKAAAAAKAK 176
Query: 493 GHLEAKQKVAELEDRISELKHRAD 516
AKQK AE + E+
Sbjct: 177 AAALAKQKAAEAGEGTEEVTEEEK 200
>gnl|CDD|226447 COG3937, COG3937, Uncharacterized conserved protein [Function
unknown].
Length = 108
Score = 28.2 bits (63), Expect = 8.2
Identities = 20/87 (22%), Positives = 45/87 (51%), Gaps = 7/87 (8%)
Query: 441 AKKKADELEMENVDLSNRLAKK--EQELDQKTQEKEDLDESLSR-IKDRLEKETAGHLEA 497
+K DEL ++ +L+ AK+ + L Q + + +L+E + R I++ L +
Sbjct: 26 VQKLVDEL-VKKGELNAEEAKRFVDDLLRQAKEAQGELEEKIPRKIEEMLSDLE---VAR 81
Query: 498 KQKVAELEDRISELKHRADFEQDERKR 524
+ ++ EL +R+ L+ + +++ KR
Sbjct: 82 QSEMDELTERVDALERQVADLENKLKR 108
>gnl|CDD|182002 PRK09627, oorA, 2-oxoglutarate-acceptor oxidoreductase subunit
OorA; Reviewed.
Length = 375
Score = 29.7 bits (67), Expect = 8.2
Identities = 10/31 (32%), Positives = 18/31 (58%)
Query: 490 ETAGHLEAKQKVAELEDRISELKHRADFEQD 520
ET GH+ K + +LE+ + +R +F+ D
Sbjct: 170 ETVGHMYGKAVIPDLEEVQKMIINRKEFDGD 200
>gnl|CDD|227556 COG5231, VMA13, Vacuolar H+-ATPase V1 sector, subunit H [Energy
production and conversion].
Length = 432
Score = 29.9 bits (67), Expect = 8.3
Identities = 31/132 (23%), Positives = 52/132 (39%), Gaps = 11/132 (8%)
Query: 405 QLTLQTETGEDYDHILLKIN-VKKVVHLLAKEEE-------IVAAKKKADELEMENVDLS 456
LT E +D D + IN + +V AKE+ + K+ + + + L
Sbjct: 219 ILTFSKECAQDIDKMDDLINDLIAIVKERAKEKVLRLCCGIVANVLDKSPKGYIFSPLLL 278
Query: 457 NRLAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHLEAKQKVAELEDRI---SELKH 513
N ++K Q L ++ E+L + RI+ RL + T + EL+ S H
Sbjct: 279 NDISKCVQVLLERKYSDEELVIDIERIRSRLVQNTKKLCIFDNYLNELDSGRLEWSPYHH 338
Query: 514 RADFEQDERKRL 525
+ DF L
Sbjct: 339 KKDFWSTNLDML 350
>gnl|CDD|220603 pfam10152, DUF2360, Predicted coiled-coil domain-containing protein
(DUF2360). This is the conserved 140 amino acid region
of a family of proteins conserved from nematodes to
humans. One C. elegans member is annotated as a
Daf-16-dependent longevity protein 1 but this could not
be confirmed. The function is unknown.
Length = 147
Score = 28.9 bits (65), Expect = 8.3
Identities = 20/88 (22%), Positives = 28/88 (31%), Gaps = 15/88 (17%)
Query: 542 GLSRSGGREEPSSPRKPEAPSS-CIPPPPPPASLPPPPPPRRSS--DLPYLTCPRPL--- 595
GL + P P A ++ PPPPP + PPP + P
Sbjct: 49 GLEDVTVQTTPPPP--ASAITNGGPPPPPPARAEAASPPPPEAPAEPPAEPEPEAPAENT 106
Query: 596 VRCRQPPRSKLKSRIPRPYFPNVSVQVN 623
V + PR YF + + V
Sbjct: 107 VTVAKDPRY-------AKYFKMLKLGVP 127
>gnl|CDD|224644 COG1730, GIM5, Predicted prefoldin, molecular chaperone implicated
in de novo protein folding [Posttranslational
modification, protein turnover, chaperones].
Length = 145
Score = 28.9 bits (65), Expect = 8.4
Identities = 12/39 (30%), Positives = 23/39 (58%)
Query: 473 KEDLDESLSRIKDRLEKETAGHLEAKQKVAELEDRISEL 511
++ DE++ +K R+E+ + +Q +AEL RI +L
Sbjct: 89 EKSADEAIEFLKKRIEELEKAIEKLQQALAELAQRIEQL 127
>gnl|CDD|220261 pfam09484, Cas_TM1802, CRISPR-associated protein TM1802
(cas_TM1802). Clusters of short DNA repeats with
non-homologous spacers, which are found at regular
intervals in the genomes of phylogenetically distinct
prokaryotic species, comprise a family with recognisable
features. This family is known as CRISPR (short for
Clustered, Regularly Interspaced Short Palindromic
Repeats). A number of protein families appear only in
association with these repeats and are designated Cas
(CRISPR-Associated) proteins. This minor cas protein is
found in at least five prokaryotic genomes:
Methanosarcina mazei, Sulfurihydrogenibium azorense,
Thermotoga maritima, Carboxydothermus hydrogenoformans,
and Dictyoglomus thermophilum, the first of which is
archaeal while the rest are bacterial.
Length = 584
Score = 30.0 bits (68), Expect = 8.4
Identities = 10/44 (22%), Positives = 22/44 (50%)
Query: 473 KEDLDESLSRIKDRLEKETAGHLEAKQKVAELEDRISELKHRAD 516
K++L++ L I + L + +K +A LE+ I ++ +
Sbjct: 272 KKNLEKELEEILESLNDIKSLSKGSKNSLAGLEEEILDILSDEN 315
>gnl|CDD|220716 pfam10359, Fmp27_WPPW, RNA pol II promoter Fmp27 protein domain.
Fmp27_WPPW is a conserved domain of a family of proteins
involved in RNA polymerase II transcription initiation.
It contains characteristic HQR and WPPW sequence motifs.
and is towards the C-terminal in members which contain
Fmp27_SW pfam10305.
Length = 470
Score = 30.0 bits (68), Expect = 8.4
Identities = 17/69 (24%), Positives = 33/69 (47%), Gaps = 1/69 (1%)
Query: 473 KEDLDESLSRIKDRLEK-ETAGHLEAKQKVAELEDRISELKHRADFEQDERKRLENLINS 531
+ L+E + +++++L+ E E + + ELE+ +S LK R +F + + LE S
Sbjct: 170 LKQLEEQIKKLEEKLDDLELNDTEELQSDLEELEEELSVLKERLEFLEKLLEDLERSEES 229
Query: 532 GSVPDDTKA 540
T
Sbjct: 230 SDRSSSTDT 238
>gnl|CDD|235549 PRK05658, PRK05658, RNA polymerase sigma factor RpoD; Validated.
Length = 619
Score = 30.1 bits (69), Expect = 8.5
Identities = 20/80 (25%), Positives = 37/80 (46%), Gaps = 1/80 (1%)
Query: 433 AKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEK-EDLDESLSRIKDRLEKET 491
A + ++ E E + + LA E EL +K EK + L + +++ EK+
Sbjct: 183 AHVGSELEELDDDEDEEEEEDENDDSLAADESELPEKVLEKFKALAKQYKKLRKAQEKKV 242
Query: 492 AGHLEAKQKVAELEDRISEL 511
G L +K A+L +++ E
Sbjct: 243 EGRLAQHKKYAKLREKLKEE 262
>gnl|CDD|235309 PRK04596, minC, septum formation inhibitor; Reviewed.
Length = 248
Score = 29.5 bits (66), Expect = 8.5
Identities = 13/27 (48%), Positives = 13/27 (48%), Gaps = 6/27 (22%)
Query: 568 PPPPASLPPPPPPRRSSDLPYLTCPRP 594
PPPP PPPPP R P RP
Sbjct: 119 PPPP---PPPPPARAEPAPP---VARP 139
>gnl|CDD|192987 pfam12325, TMF_TATA_bd, TATA element modulatory factor 1 TATA
binding. This is the C-terminal conserved coiled coil
region of a family of TATA element modulatory factor 1
proteins conserved in eukaryotes. The proteins bind to
the TATA element of some RNA polymerase II promoters and
repress their activity. by competing with the binding of
TATA binding protein. TMF1_TATA_bd is the most conserved
part of the TMFs. TMFs are evolutionarily conserved
golgins that bind Rab6, a ubiquitous ras-like
GTP-binding Golgi protein, and contribute to Golgi
organisation in animal and plant cells. The Rab6-binding
domain appears to be the same region as this C-terminal
family.
Length = 121
Score = 28.3 bits (64), Expect = 8.5
Identities = 14/58 (24%), Positives = 30/58 (51%), Gaps = 4/58 (6%)
Query: 459 LAKKEQELDQKTQEKEDLDESLSRIKDRLEKETAGHLEAKQKVAELEDRISELKHRAD 516
LA + EL + E+++ + + + L +E E K+++ ELE + +L+ R +
Sbjct: 33 LASLKDELARLEAERDEARQEIVK----LTEENEELRELKKEIEELEKELEDLEQRYE 86
>gnl|CDD|236012 PRK07406, PRK07406, RNA polymerase sigma factor RpoD; Validated.
Length = 373
Score = 29.8 bits (67), Expect = 8.5
Identities = 22/63 (34%), Positives = 34/63 (53%), Gaps = 10/63 (15%)
Query: 429 VHLLAKEEEIVAAKKKADELEMENVDLSNRLAKK-EQELDQKTQEKE---DLDESLSRIK 484
+ LL +EEI A+K AD LE+E L ++ E EL ++ +KE +D L + +
Sbjct: 75 IRLLRPDEEIELARKIADLLELE------ELREQFESELGREPSDKEWAELVDMPLPKFR 128
Query: 485 DRL 487
RL
Sbjct: 129 RRL 131
>gnl|CDD|222878 PHA02562, 46, endonuclease subunit; Provisional.
Length = 562
Score = 30.0 bits (68), Expect = 8.7
Identities = 28/133 (21%), Positives = 53/133 (39%), Gaps = 34/133 (25%)
Query: 426 KKVVHLLAKEEEIVAAKKKAD----ELEMENVDLSNRLAK----------------KEQE 465
K L+ + + I A ++ L M+ D S L K K +
Sbjct: 220 NKYDELVEEAKTIKAEIEELTDELLNLVMDIEDPSAALNKLNTAAAKIKSKIEQFQKVIK 279
Query: 466 LDQK-------TQEKEDLDESLSRIKDRLEKETAGHLEAKQKVAELEDRISELKHRADFE 518
+ +K TQ+ + + +++IKD+L+ E + + +L+ I EL+ D
Sbjct: 280 MYEKGGVCPTCTQQISEGPDRITKIKDKLK-------ELQHSLEKLDTAIDELEEIMDEF 332
Query: 519 QDERKRLENLINS 531
++ K+L L N
Sbjct: 333 NEQSKKLLELKNK 345
>gnl|CDD|133382 cd01107, HTH_BmrR, Helix-Turn-Helix DNA binding domain of the BmrR
transcription regulator. Helix-turn-helix (HTH)
multidrug-efflux transporter transcription regulator,
BmrR and YdfL of Bacillus subtilis, and related
proteins; N-terminal domain. Bmr is a membrane protein
which causes the efflux of a variety of toxic substances
and antibiotics. BmrR is comprised of two distinct
domains that harbor a regulatory (effector-binding) site
and an active (DNA-binding) site. The conserved
N-terminal domain contains a winged HTH motif that
mediates DNA binding, while the C-terminal domain binds
coactivating, toxic compounds. BmrR shares the
N-terminal DNA binding domain with other transcription
regulators of the MerR superfamily that promote
transcription by reconfiguring the spacer between the
-35 and -10 promoter elements.
Length = 108
Score = 28.3 bits (64), Expect = 8.8
Identities = 18/58 (31%), Positives = 27/58 (46%), Gaps = 9/58 (15%)
Query: 475 DLDESLSRIKDRLEKETAGHLEA--KQKVAELEDRISELKHRADFEQDERKRLENLIN 530
DL L IK+ L+ + L ++K+AELE I EL Q + LE+ +
Sbjct: 56 DLGFPLEEIKEILDADNDDELRKLLREKLAELEAEIEEL-------QRILRLLEDRLK 106
>gnl|CDD|223079 PHA03419, PHA03419, E4 protein; Provisional.
Length = 200
Score = 29.1 bits (65), Expect = 9.1
Identities = 13/45 (28%), Positives = 16/45 (35%)
Query: 561 PSSCIPPPPPPASLPPPPPPRRSSDLPYLTCPRPLVRCRQPPRSK 605
C P P P+S PPP PP + + PP K
Sbjct: 49 YPFCPPTTPHPSSQPPPCPPSPGHPPQTNDTHEKDLALQPPPGGK 93
>gnl|CDD|182123 PRK09866, PRK09866, hypothetical protein; Provisional.
Length = 741
Score = 29.8 bits (67), Expect = 9.4
Identities = 11/43 (25%), Positives = 22/43 (51%)
Query: 873 EASKVSMKELENDMTQLRNNLKEVSREIEFQRVQPLVPGDRFL 915
E + ++ ++E + L+ N +VS EI+ + L + FL
Sbjct: 449 EQLRQNIHQVEESLQLLQLNQAQVSGEIKHEIELALTSANHFL 491
>gnl|CDD|200432 TIGR04181, NHT_00031, aminotransferase, LLPSF_NHT_00031 family.
This clade of aminotransferases is a member of the
pfam01041 (DegT/DnrJ/EryC1/StrS) superfamily. The family
is named after the instance in Leptospira interrogans
serovar Lai, str. 56601, where it is the 31st gene in
the 91-gene lipopolysaccharide biosynthesis locus.
Members of this family are generally found within a
subcluster of seven or more genes including an
epimerase/dehydratase, four genes homologous to the
elements of the neuraminic (sialic) acid biosynthesis
cluster (NeuABCD) and a nucleotidyl transferase.
Together it is very likely that these enzymes direct the
biosynthesis of a nine-carbon sugar analogous to
CMP-neuraminic acid. These seven genes form the core of
the cassette, although they are often accompanied by
additional genes that may further modify the product
sugar.
Length = 359
Score = 29.4 bits (67), Expect = 9.5
Identities = 15/56 (26%), Positives = 18/56 (32%), Gaps = 3/56 (5%)
Query: 360 NILRRKLGHTAAYPHFLSLLHHCILLPLEFGCHPQHWL---LLDRLAQQLTLQTET 412
L RK Y F S + LP G +WL LLD + L
Sbjct: 260 EFLARKRELAEIYKEFFSGIPGVEFLPEPAGARSNYWLNALLLDSKLDRDELLEAL 315
>gnl|CDD|227352 COG5019, CDC3, Septin family protein [Cell division and chromosome
partitioning / Cytoskeleton].
Length = 373
Score = 29.6 bits (67), Expect = 9.6
Identities = 28/98 (28%), Positives = 46/98 (46%), Gaps = 15/98 (15%)
Query: 437 EIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLSRIKDRL-EKETAGHL 495
E ++ K + E ++ + RL ++E+EL +K EK E RL E E
Sbjct: 290 EKLSGLKNSGEPSLKEIH-EARLNEEERELKKKFTEKIREKE------KRLEELEQNLIE 342
Query: 496 EAKQKVAELEDRISELKHRADFEQDERKRLENLINSGS 533
E K+ ++LE+ +L+ D KRLE L ++ S
Sbjct: 343 ERKELNSKLEEIQKKLE-------DLEKRLEKLKSNKS 373
>gnl|CDD|151656 pfam11214, Med2, Mediator complex subunit 2. This family of mediator
complex subunit 2 proteins is conserved in fungi.
Cyclin-dependent kinase CDK8 or Srb10 interacts with and
phosphorylates Med2. Post-translational modifications of
Mediator subunits are important for regulation of gene
expression.
Length = 99
Score = 27.8 bits (62), Expect = 9.6
Identities = 13/43 (30%), Positives = 25/43 (58%), Gaps = 3/43 (6%)
Query: 962 FFVIFDTFLTTLNEAKLDNENM---KRRREEEEKRARQDAEMK 1001
F I D + LN+ K + + K+++EEEE++ +++ E K
Sbjct: 57 FHSILDDTESKLNDTKWYVDTLVENKKKQEEEERKKKEEEEKK 99
>gnl|CDD|148701 pfam07246, Phlebovirus_NSM, Phlebovirus nonstructural protein NS-M.
This family consists of several Phlebovirus
nonstructural NS-M proteins which represent the
N-terminal region of the M polyprotein precursor. The
function of this family is unknown.
Length = 264
Score = 29.5 bits (65), Expect = 9.7
Identities = 19/89 (21%), Positives = 37/89 (41%), Gaps = 12/89 (13%)
Query: 455 LSNRLAKKEQELDQKTQEKED------LDESLSRIKDRLEKETAG------HLEAKQKVA 502
L +K+ + +KT+E E+ + + K+ ++K H +AK +V
Sbjct: 146 LKKNAEEKDATIQRKTKEMEEDSRNQIAHHEIQQKKNEIQKLRNDLKRGQEHRDAKLRVE 205
Query: 503 ELEDRISELKHRADFEQDERKRLENLINS 531
E R+ LK DE + ++ + S
Sbjct: 206 ESMARVKGLKEELKQLTDELQHAKSEMQS 234
>gnl|CDD|183876 PRK13169, PRK13169, DNA replication intiation control protein YabA;
Reviewed.
Length = 110
Score = 27.9 bits (63), Expect = 9.8
Identities = 25/88 (28%), Positives = 40/88 (45%), Gaps = 22/88 (25%)
Query: 399 LDRLAQQL-TLQTETGEDYDHILLKINVKKVVHLLAKEEEIVAAKKKADELEMENVDLSN 457
LD L Q L L E G LK + +++ EE A L +EN L
Sbjct: 10 LDDLEQNLGVLLKELGA------LKKQLAELL------EENTA-------LRLENDKLRE 50
Query: 458 RLAKKEQELDQKTQEKEDLD--ESLSRI 483
RL + E E K ++K++ + ++L+R+
Sbjct: 51 RLEELEAEEPAKEKKKKEGEGKDNLARL 78
>gnl|CDD|236839 PRK11086, PRK11086, sensory histidine kinase DcuS; Provisional.
Length = 542
Score = 29.9 bits (68), Expect = 9.8
Identities = 23/72 (31%), Positives = 35/72 (48%), Gaps = 9/72 (12%)
Query: 118 DDTEIKSRTRILDSLRT---ALRTQPHSFVLRFIECEGLLTLLEYLKNLDHYSGQTSLH- 173
D TE++ + LD + ALR Q H F+ + GLL L Y L+ Y +T+ +
Sbjct: 322 DKTEVRQLAQRLDGMVNYADALRAQSHEFMNKLHVILGLLHLKSY-DQLEDYILKTANNY 380
Query: 174 ----SSIIGCIK 181
S++G IK
Sbjct: 381 QEEIGSLLGKIK 392
>gnl|CDD|238574 cd01169, HMPP_kinase, 4-amino-5-hydroxymethyl-2-methyl-pyrimidine
phosphate kinase (HMPP-kinase) catalyzes two consecutive
phosphorylation steps in the thiamine phosphate
biosynthesis pathway, leading to the synthesis of
vitamin B1. The first step is the phosphorylation of the
hydroxyl group of HMP to form
4-amino-5-hydroxymethyl-2-methyl-pyrimidine phosphate
(HMP-P) and then the phophorylation of HMP-P to form
4-amino-5-hydroxymethyl-2-methyl-pyrimidine
pyrophosphate (HMP-PP), which is the substrate for the
thiamine synthase coupling reaction.
Length = 242
Score = 29.4 bits (67), Expect = 9.8
Identities = 17/54 (31%), Positives = 23/54 (42%), Gaps = 10/54 (18%)
Query: 707 LSMDSADQLPLDMLEQLLKF----TPNTEEAALL---EEHSTETLARADRFLYE 753
L D A + + E LL TPN EA LL E + E + +A + L
Sbjct: 112 LLDDDAIEA---LRELLLPLATLITPNLPEAELLTGLEIATEEDMMKAAKALLA 162
>gnl|CDD|191511 pfam06409, NPIP, Nuclear pore complex interacting protein (NPIP).
This family consists of a series of primate specific
nuclear pore complex interacting protein (NPIP)
sequences. The function of this family is unknown but is
well conserved from African apes to humans.
Length = 350
Score = 29.4 bits (65), Expect = 9.8
Identities = 53/238 (22%), Positives = 85/238 (35%), Gaps = 32/238 (13%)
Query: 422 KINVKKVVHLLAKEEEIVAAKKKADELEMENVDLSNRLAKKEQELDQKTQEKEDLDESLS 481
KI KV + ++I +K A E +L+ KE E +K ++ + +E+
Sbjct: 120 KICKMKVTTKINHHDKINGKRKTAKEH-------LRKLSMKECEHAEKERQVSEAEEN-- 170
Query: 482 RIKDRLE-KETAGHLEAKQKVAELEDRISELKHRADFEQDERKRLENLINSGSV------ 534
+L+ KE + E Q+ L R +E H+ RK L N + +
Sbjct: 171 ---GKLDMKEIHTYKEMFQRAQALRRR-AEDYHKCKIPPSARKPLCNRVRMAAAEHRHSS 226
Query: 535 -----PDDTKAVGLSRSGGREEPSSPRKPEAPSSCIPPPPPPASLPPPPPPRRSSDLPYL 589
P T +R G + P + + +S PP L P PP S L
Sbjct: 227 GLPYWPYLTAETLKNRMGHQPPPPTQQHSIIDNSLSLKTPPECLLTPLPPSALPSADDNL 286
Query: 590 TCPRPLVRCRQPP-------RSKLKSRIPRPYFPNVSVQVNGTIWTELDESKLYASME 640
P + PP L + P P P+V + + L S L S++
Sbjct: 287 KTPAECLLRPLPPSADDNIKECPLALKPPPPLPPSVDDNLKECLLAPLPPSALPPSVD 344
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.316 0.133 0.380
Gapped
Lambda K H
0.267 0.0677 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 55,221,044
Number of extensions: 5637388
Number of successful extensions: 13914
Number of sequences better than 10.0: 1
Number of HSP's gapped: 11878
Number of HSP's successfully gapped: 843
Length of query: 1074
Length of database: 10,937,602
Length adjustment: 107
Effective length of query: 967
Effective length of database: 6,191,724
Effective search space: 5987397108
Effective search space used: 5987397108
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 64 (28.5 bits)