RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy16862
(303 letters)
>gnl|CDD|215646 pfam00001, 7tm_1, 7 transmembrane receptor (rhodopsin family).
This family contains, amongst other G-protein-coupled
receptors (GCPRs), members of the opsin family, which
have been considered to be typical members of the
rhodopsin superfamily. They share several motifs, mainly
the seven transmembrane helices, GCPRs of the rhodopsin
superfamily. All opsins bind a chromophore, such as
11-cis-retinal. The function of most opsins other than
the photoisomerases is split into two steps: light
absorption and G-protein activation. Photoisomerases, on
the other hand, are not coupled to G-proteins - they are
thought to generate and supply the chromophore that is
used by visual opsins.
Length = 251
Score = 145 bits (368), Expect = 4e-42
Identities = 53/190 (27%), Positives = 91/190 (47%), Gaps = 11/190 (5%)
Query: 66 VIKHRKLRIITNYFVVSLAFADLLVALCVMPFNAIVSLTDEWYFGYFMCDVWNSFDVYFS 125
+++ +KLR TN F+++LA ADLL L + P+ + +W FG +C + V
Sbjct: 3 ILRTKKLRTPTNIFLLNLAVADLLFLLTLPPWALYYLVGGDWPFGDALCKLVGFLFVVNG 62
Query: 126 TASILHLCCISVDRYYAIVKPLQYPIIMNQNTVLLMLSNVWILPGIISFTPIMLGWYTTP 185
ASIL L IS+DRY AIV PL+Y I +++ VW+L ++S P++ W T
Sbjct: 63 YASILLLTAISIDRYLAIVHPLRYRRIRTPRRAKVLILVVWVLALLLSLPPLLFSWLRTV 122
Query: 186 DHKQYRKHHPNVCIFVVNVYYALISSSISFWIPGFVM-----------KKKKKRKKKKKK 234
+ + Y L+S+ + F +P V+ +K+ + + +
Sbjct: 123 EEGNVTTCLIDFPEESTKRSYTLLSTLLGFVLPLLVILVCYTLILRTLRKRARSGASQAR 182
Query: 235 KKKKKRKKKK 244
K+ K++K
Sbjct: 183 AKRSSSKERK 192
>gnl|CDD|222976 PHA03087, PHA03087, G protein-coupled chemokine receptor-like
protein; Provisional.
Length = 335
Score = 71.7 bits (176), Expect = 3e-14
Identities = 50/188 (26%), Positives = 87/188 (46%), Gaps = 15/188 (7%)
Query: 3 NTIYNSLNYTYNINYTSTNFS---NTSLFPTNSTIHIHIFPFIIKGMLMCFIIITAILGN 59
NT + Y Y +++ + TNSTI I ++ I ++GN
Sbjct: 8 NTTIENTTDYYYDTYYDEDYADCDLNIGYDTNSTILI---------VVYSTIFFFGLVGN 58
Query: 60 LLVIISVIKHRKLRIITNYFVVSLAFADLLVALCVMPFNAIVSLTDEWYFGYFMCDVWNS 119
++VI V+ K++ + ++++LA +DLL + +PF + +W FG F C + +
Sbjct: 59 IIVIY-VLTKTKIKTPMDIYLLNLAVSDLLFVM-TLPFQIYYYILFQWSFGEFACKIVSG 116
Query: 120 FDVYFSTASILHLCCISVDRYYAIVKPLQYPIIMNQNTVLLMLSNVWILPGIISFTPIML 179
S+ + +SVDRY AIV P++ I ++ +WI+ II TPI+
Sbjct: 117 LYYIGFYNSMNFITVMSVDRYIAIVHPVKSNKINTVKYGYIVSLVIWII-SIIETTPILF 175
Query: 180 GWYTTPDH 187
+ T DH
Sbjct: 176 VYTTKKDH 183
>gnl|CDD|165021 PHA02638, PHA02638, CC chemokine receptor-like protein;
Provisional.
Length = 417
Score = 67.0 bits (163), Expect = 1e-12
Identities = 52/165 (31%), Positives = 87/165 (52%), Gaps = 16/165 (9%)
Query: 11 YTYNINYTSTNFSNTSLFPTNSTIHIHIFPFIIKGMLMCFIIITAILGNLLVIISVIKHR 70
Y NI Y N N ++P+ S +I IF II I + GN II ++ +
Sbjct: 77 YENNITYELINIKNKCMYPSISE-YIKIFYIII--------FILGLFGNAA-IIMILFCK 126
Query: 71 KLRIITNYFVVSLAFADLLVALCVMPFNAIVSLTDEWYFGYFMCDVWNS--FDVYFSTAS 128
K++ IT+ ++ +LA +DL+ + PF I + D+W FG FMC V ++ + +FS
Sbjct: 127 KIKTITDIYIFNLAISDLIFVI-DFPF-IIYNEFDQWIFGDFMCKVISASYYIGFFSNMF 184
Query: 129 ILHLCCISVDRYYAIVKPLQYPIIMNQNTVLLMLSNVWILPGIIS 173
++ L +S+DRY+AI+ P+ + N +++ WIL II+
Sbjct: 185 LITL--MSIDRYFAILYPISFQKYRTFNIGIILCIISWILSLIIT 227
>gnl|CDD|236877 PRK11192, PRK11192, ATP-dependent RNA helicase SrmB; Provisional.
Length = 434
Score = 53.8 bits (130), Expect = 4e-08
Identities = 19/67 (28%), Positives = 26/67 (38%), Gaps = 19/67 (28%)
Query: 225 KKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKK 284
+K+ K KK KR +KK EK+K+K K KK+ K K
Sbjct: 384 SEKKTGKPSKKVLAKRAEKK-------------------EKEKEKPKVKKRHRDTKNIGK 424
Query: 285 KRKKKKN 291
+RK
Sbjct: 425 RRKPSGT 431
Score = 52.3 bits (126), Expect = 1e-07
Identities = 20/70 (28%), Positives = 29/70 (41%), Gaps = 16/70 (22%)
Query: 224 KKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRK 283
K K +KK K KK K R E++E++K+K K KK+ + K
Sbjct: 379 KTKAPSEKKTGKPSKKVLAK--------------RAEKKEKEKEKPKVKKRH--RDTKNI 422
Query: 284 KKRKKKKNQH 293
KR+K
Sbjct: 423 GKRRKPSGTS 432
Score = 48.8 bits (117), Expect = 1e-06
Identities = 17/66 (25%), Positives = 26/66 (39%), Gaps = 20/66 (30%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
KK K KK K+ +K++K+K +K K KK+ + T KR
Sbjct: 386 KKTGKPSKKVLAKRAEKKEKEK--------------------EKPKVKKRHRDTKNIGKR 425
Query: 283 KKKRKK 288
+K
Sbjct: 426 RKPSGT 431
Score = 40.7 bits (96), Expect = 6e-04
Identities = 14/37 (37%), Positives = 18/37 (48%)
Query: 260 EEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQHHRD 296
E+K K KK +KK K+K K K + HRD
Sbjct: 382 APSEKKTGKPSKKVLAKRAEKKEKEKEKPKVKKRHRD 418
Score = 35.7 bits (83), Expect = 0.024
Identities = 7/25 (28%), Positives = 12/25 (48%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQH 247
+K K +K+ + K KR+K
Sbjct: 408 EKPKVKKRHRDTKNIGKRRKPSGTS 432
Score = 34.5 bits (80), Expect = 0.052
Identities = 8/23 (34%), Positives = 14/23 (60%)
Query: 222 MKKKKKRKKKKKKKKKKKRKKKK 244
K+K+K K KK+ + K K++
Sbjct: 404 EKEKEKPKVKKRHRDTKNIGKRR 426
Score = 31.5 bits (72), Expect = 0.59
Identities = 11/30 (36%), Positives = 14/30 (46%)
Query: 261 EEEEKKKKKKKKKKKTTKKKKRKKKRKKKK 290
+E K K +KK KK KR +KK
Sbjct: 374 DELRPKTKAPSEKKTGKPSKKVLAKRAEKK 403
Score = 31.1 bits (71), Expect = 0.68
Identities = 6/24 (25%), Positives = 12/24 (50%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQ 246
K KKR + K K+++ ++
Sbjct: 410 PKVKKRHRDTKNIGKRRKPSGTSE 433
Score = 29.9 bits (68), Expect = 1.6
Identities = 6/24 (25%), Positives = 11/24 (45%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQ 246
K KK+ + K K++K +
Sbjct: 411 KVKKRHRDTKNIGKRRKPSGTSEE 434
>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3. This
protein, which interacts with both microtubules and
TRAF3 (tumour necrosis factor receptor-associated factor
3), is conserved from worms to humans. The N-terminal
region is the microtubule binding domain and is
well-conserved; the C-terminal 100 residues, also
well-conserved, constitute the coiled-coil region which
binds to TRAF3. The central region of the protein is
rich in lysine and glutamic acid and carries KKE motifs
which may also be necessary for tubulin-binding, but
this region is the least well-conserved.
Length = 506
Score = 53.4 bits (128), Expect = 6e-08
Identities = 19/70 (27%), Positives = 37/70 (52%), Gaps = 2/70 (2%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKK--KKKKKKTTKKK 280
+ K+KR K+K+K+K+K+ ++ + R ++ KKK KKK+ ++K
Sbjct: 136 EAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKKKEPPEEEK 195
Query: 281 KRKKKRKKKK 290
+R+ R+ K
Sbjct: 196 QRQAAREAVK 205
Score = 53.0 bits (127), Expect = 7e-08
Identities = 22/79 (27%), Positives = 44/79 (55%), Gaps = 11/79 (13%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
K + ++++K+K++ K+ KKKK + +EE + +K K++ K+K K+K
Sbjct: 99 KNESGKEEEKEKEQVKEEKKKKK-----------EKPKEEPKDRKPKEEAKEKRPPKEKE 147
Query: 283 KKKRKKKKNQHHRDVWQKK 301
K+K KK + R+ +K+
Sbjct: 148 KEKEKKVEEPRDREEEKKR 166
Score = 52.6 bits (126), Expect = 1e-07
Identities = 19/70 (27%), Positives = 34/70 (48%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
K++ K K+ K+K+K+K KK + R+ R + KK KKK K + +
Sbjct: 134 KEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSRPKKPPKKKPPNKKKEPPEE 193
Query: 283 KKKRKKKKNQ 292
+K+R+ +
Sbjct: 194 EKQRQAAREA 203
Score = 50.3 bits (120), Expect = 6e-07
Identities = 25/78 (32%), Positives = 45/78 (57%), Gaps = 9/78 (11%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKK---------KK 273
++K+K + K++KKKKK++ K++ + + + R +E+EK+K+KK KK
Sbjct: 106 EEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKK 165
Query: 274 KKTTKKKKRKKKRKKKKN 291
++ + K R KK KKK
Sbjct: 166 RERVRAKSRPKKPPKKKP 183
Score = 49.9 bits (119), Expect = 9e-07
Identities = 21/74 (28%), Positives = 46/74 (62%), Gaps = 5/74 (6%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
K+++++K++ K++KKK+K+K + +D EE +EK+ K+K+K+K K ++
Sbjct: 103 GKEEEKEKEQVKEEKKKKKEKPKEEPKD-----RKPKEEAKEKRPPKEKEKEKEKKVEEP 157
Query: 283 KKKRKKKKNQHHRD 296
+ + ++KK + R
Sbjct: 158 RDREEEKKRERVRA 171
Score = 38.3 bits (89), Expect = 0.004
Identities = 13/47 (27%), Positives = 28/47 (59%)
Query: 255 RTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQHHRDVWQKK 301
+T +E + + K+++K+K+ K++K+KKK K K+ R ++
Sbjct: 91 KTKPAKEPKNESGKEEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEA 137
Score = 38.3 bits (89), Expect = 0.004
Identities = 14/74 (18%), Positives = 30/74 (40%), Gaps = 2/74 (2%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
+ K R KK KKK +KK+ + + R + + EE ++++K++ K
Sbjct: 169 VRAKSRPKKPPKKKPPNKKKEPPEEEKQRQAAREAVKGKPEEPDVNEEREKEE--DDGKD 226
Query: 283 KKKRKKKKNQHHRD 296
++ +
Sbjct: 227 RETTTSPMEEDESR 240
Score = 36.8 bits (85), Expect = 0.011
Identities = 16/75 (21%), Positives = 35/75 (46%), Gaps = 14/75 (18%)
Query: 227 KRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKR 286
KR +K K + K E + E K+++K+K++ ++KK+KK++
Sbjct: 78 KRVEKGGSKGPAAKTKPAK--------------EPKNESGKEEEKEKEQVKEEKKKKKEK 123
Query: 287 KKKKNQHHRDVWQKK 301
K++ + + + K
Sbjct: 124 PKEEPKDRKPKEEAK 138
Score = 34.5 bits (79), Expect = 0.067
Identities = 15/77 (19%), Positives = 33/77 (42%), Gaps = 13/77 (16%)
Query: 221 VMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKK 280
+++ + K + KK KKK KK + EEEK+++ ++ K ++
Sbjct: 164 KKRERVRAKSRPKKPPKKKPPNKKKE-------------PPEEEKQRQAAREAVKGKPEE 210
Query: 281 KRKKKRKKKKNQHHRDV 297
+ ++K+ +D
Sbjct: 211 PDVNEEREKEEDDGKDR 227
Score = 33.7 bits (77), Expect = 0.10
Identities = 14/76 (18%), Positives = 31/76 (40%), Gaps = 15/76 (19%)
Query: 224 KKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRK 283
+ + + + KK KKK KKK E EE+K+++ ++ K ++
Sbjct: 168 RVRAKSRPKKPPKKKPPNKKK---------------EPPEEEKQRQAAREAVKGKPEEPD 212
Query: 284 KKRKKKKNQHHRDVWQ 299
+++K + +
Sbjct: 213 VNEEREKEEDDGKDRE 228
>gnl|CDD|215521 PLN02967, PLN02967, kinase.
Length = 581
Score = 52.4 bits (125), Expect = 1e-07
Identities = 20/82 (24%), Positives = 39/82 (47%), Gaps = 10/82 (12%)
Query: 219 GFVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKK----- 273
G V KKK R K+ KK + + ++ +N + ++E +K ++ ++K
Sbjct: 66 GAVSKKKPTRSVKRATKKTVVEISEPLEEGSELVVNEDAALDKESKKTPRRTRRKAAAAS 125
Query: 274 -----KKTTKKKKRKKKRKKKK 290
+KT KK ++++K KK
Sbjct: 126 SDVEEEKTEKKVRKRRKVKKMD 147
Score = 30.4 bits (68), Expect = 1.2
Identities = 15/73 (20%), Positives = 26/73 (35%), Gaps = 11/73 (15%)
Query: 219 GFVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTK 278
GF + ++ + + K +KK S +EE + KKK T+
Sbjct: 27 GFNKFRGERWRLQNKFLLAAGSRKK-----------IESALAVDEEPDENGAVSKKKPTR 75
Query: 279 KKKRKKKRKKKKN 291
KR K+ +
Sbjct: 76 SVKRATKKTVVEI 88
>gnl|CDD|219746 pfam08208, RNA_polI_A34, DNA-directed RNA polymerase I subunit
RPA34.5. This is a family of proteins conserved from
yeasts to human. Subunit A34.5 of RNA polymerase I is a
non-essential subunit which is thought to help Pol I
overcome topological constraints imposed on ribosomal
DNA during the process of transcription.
Length = 193
Score = 50.5 bits (121), Expect = 2e-07
Identities = 24/69 (34%), Positives = 38/69 (55%), Gaps = 15/69 (21%)
Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
K K+ + ++++KK KKKK +E +++KK+KK KK+K + K
Sbjct: 138 ETTAKVEKEAEVEEEEKKEKKKK---------------KEVKKEKKEKKDKKEKMVEPKG 182
Query: 282 RKKKRKKKK 290
KKK+KKKK
Sbjct: 183 SKKKKKKKK 191
Score = 48.2 bits (115), Expect = 1e-06
Identities = 20/66 (30%), Positives = 39/66 (59%), Gaps = 9/66 (13%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
K+ + +K+ + +++++K+KK + +E++EKK KK+K + KKK+
Sbjct: 137 KETTAKVEKEAEVEEEEKKEKKKKKEVK---------KEKKEKKDKKEKMVEPKGSKKKK 187
Query: 283 KKKRKK 288
KKK+KK
Sbjct: 188 KKKKKK 193
Score = 41.6 bits (98), Expect = 2e-04
Identities = 22/63 (34%), Positives = 35/63 (55%), Gaps = 18/63 (28%)
Query: 221 VMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKK 280
V +++KK KKKKK+ KK+K++KK ++++K + K KK KKK
Sbjct: 149 VEEEEKKEKKKKKEVKKEKKEKK------------------DKKEKMVEPKGSKKKKKKK 190
Query: 281 KRK 283
K+K
Sbjct: 191 KKK 193
Score = 40.9 bits (96), Expect = 2e-04
Identities = 20/44 (45%), Positives = 30/44 (68%)
Query: 260 EEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQHHRDVWQKKYT 303
E E E+++KK+KKKKK KK+K++KK KK+K + +KK
Sbjct: 146 EAEVEEEEKKEKKKKKEVKKEKKEKKDKKEKMVEPKGSKKKKKK 189
>gnl|CDD|220237 pfam09428, DUF2011, Fungal protein of unknown function (DUF2011).
This is a family of fungal proteins whose function is
unknown.
Length = 130
Score = 47.7 bits (114), Expect = 5e-07
Identities = 24/67 (35%), Positives = 43/67 (64%), Gaps = 7/67 (10%)
Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
++K+ R+K+KKKK+K+ KK+ R R R +E EK+K+ +K ++K K K+
Sbjct: 71 VEKELLREKEKKKKRKRPGKKR-----RIALRLRRERTKERAEKEKRTRKNREK--KFKR 123
Query: 282 RKKKRKK 288
R+K+++K
Sbjct: 124 RQKEKEK 130
Score = 42.3 bits (100), Expect = 4e-05
Identities = 16/69 (23%), Positives = 40/69 (57%), Gaps = 13/69 (18%)
Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
+ ++K++KKK+K+ KK+R + E +++ +K+K+ +K +KK
Sbjct: 75 LLREKEKKKKRKRPGKKRRIALR-------------LRRERTKERAEKEKRTRKNREKKF 121
Query: 282 RKKKRKKKK 290
++++++K+K
Sbjct: 122 KRRQKEKEK 130
Score = 30.7 bits (70), Expect = 0.41
Identities = 14/52 (26%), Positives = 28/52 (53%), Gaps = 12/52 (23%)
Query: 253 INRTSRGEEEEEKKKKKKKKK------------KKTTKKKKRKKKRKKKKNQ 292
+ + E+E++KK+K+ KK K+ +K+KR +K ++KK +
Sbjct: 71 VEKELLREKEKKKKRKRPGKKRRIALRLRRERTKERAEKEKRTRKNREKKFK 122
Score = 29.9 bits (68), Expect = 0.72
Identities = 12/38 (31%), Positives = 24/38 (63%)
Query: 253 INRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKK 290
+ + + E+E ++K+KKKK+K KK+R R +++
Sbjct: 64 LKKHNAKVEKELLREKEKKKKRKRPGKKRRIALRLRRE 101
Score = 28.0 bits (63), Expect = 3.6
Identities = 12/39 (30%), Positives = 26/39 (66%), Gaps = 2/39 (5%)
Query: 260 EEEEEKKKKKKKKKKKTTKKK--KRKKKRKKKKNQHHRD 296
E+E ++K+KKKK+K+ KK+ + +R++ K + ++
Sbjct: 72 EKELLREKEKKKKRKRPGKKRRIALRLRRERTKERAEKE 110
>gnl|CDD|150884 pfam10278, Med19, Mediator of RNA pol II transcription subunit 19.
Med19 represents a family of conserved proteins which
are members of the multi-protein co-activator Mediator
complex. Mediator is required for activation of RNA
polymerase II transcription by DNA binding
transactivators.
Length = 178
Score = 48.7 bits (116), Expect = 5e-07
Identities = 25/71 (35%), Positives = 36/71 (50%), Gaps = 5/71 (7%)
Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
M + +KK K K KK + + + S G + EKK KKKK + +KKK
Sbjct: 104 MHIQPPKKKHKHKHKKHRTQDPLPEETPS-----DSEGLKGHEKKHKKKKHEDDKERKKK 158
Query: 282 RKKKRKKKKNQ 292
+K+K+KKKK
Sbjct: 159 KKEKKKKKKRH 169
Score = 46.7 bits (111), Expect = 2e-06
Identities = 29/72 (40%), Positives = 37/72 (51%), Gaps = 2/72 (2%)
Query: 228 RKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKK--KKTTKKKKRKKK 285
+ KKK K K KK + Q S G + EKK KKKK + K+ KKKK KKK
Sbjct: 105 HIQPPKKKHKHKHKKHRTQDPLPEETPSDSEGLKGHEKKHKKKKHEDDKERKKKKKEKKK 164
Query: 286 RKKKKNQHHRDV 297
+KK+ + H V
Sbjct: 165 KKKRHSPEHPGV 176
Score = 39.0 bits (91), Expect = 0.001
Identities = 15/27 (55%), Positives = 19/27 (70%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHR 249
KKKK + K++KKKKK KKKK + H
Sbjct: 144 HKKKKHEDDKERKKKKKEKKKKKKRHS 170
Score = 30.2 bits (68), Expect = 0.80
Identities = 21/66 (31%), Positives = 28/66 (42%), Gaps = 25/66 (37%)
Query: 218 PGFVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTT 277
+ K +KK KKKK E++K++KKKKK+KK
Sbjct: 129 ETPSDSEGLKGHEKKHKKKKH-----------------------EDDKERKKKKKEKK-- 163
Query: 278 KKKKRK 283
KKKKR
Sbjct: 164 KKKKRH 169
>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
Provisional.
Length = 482
Score = 49.9 bits (120), Expect = 7e-07
Identities = 20/72 (27%), Positives = 46/72 (63%), Gaps = 2/72 (2%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK- 281
KK +K KK +K +K+++++ + + + EEEEE+K+KK+++K++ ++ +
Sbjct: 405 SKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEE 464
Query: 282 -RKKKRKKKKNQ 292
++++ +KKK Q
Sbjct: 465 EKEEEEEKKKKQ 476
Score = 45.7 bits (109), Expect = 1e-05
Identities = 23/78 (29%), Positives = 41/78 (52%), Gaps = 15/78 (19%)
Query: 224 KKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRK 283
KK KK KK +K KK++ EE++EKKKK KKK ++++ K
Sbjct: 404 GSKKATKKIKKIVEKAEKKRE---------------EEKKEKKKKAFAGKKKEEEEEEEK 448
Query: 284 KKRKKKKNQHHRDVWQKK 301
+K++++K + + ++K
Sbjct: 449 EKKEEEKEEEEEEAEEEK 466
Score = 38.7 bits (91), Expect = 0.003
Identities = 17/57 (29%), Positives = 36/57 (63%), Gaps = 1/57 (1%)
Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTS-RGEEEEEKKKKKKKKKKKTT 277
++++K++KKKK KK+++++ + + E EEEK+++++KKKK+ T
Sbjct: 422 KREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAEEEKEEEEEKKKKQAT 478
Score = 28.7 bits (65), Expect = 4.6
Identities = 12/36 (33%), Positives = 19/36 (52%), Gaps = 3/36 (8%)
Query: 261 EEEEKK---KKKKKKKKKTTKKKKRKKKRKKKKNQH 293
EEE + KK KK +K +KKR+++K +
Sbjct: 395 TEEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEK 430
Score = 28.0 bits (63), Expect = 7.7
Identities = 10/37 (27%), Positives = 20/37 (54%)
Query: 260 EEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQHHRD 296
EEE E KK KK K ++ +K+++++ + +
Sbjct: 396 EEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKK 432
>gnl|CDD|115057 pfam06375, BLVR, Bovine leukaemia virus receptor (BLVR). This
family consists of several bovine specific leukaemia
virus receptors which are thought to function as
transmembrane proteins, although their exact function is
unknown.
Length = 561
Score = 48.9 bits (116), Expect = 1e-06
Identities = 31/107 (28%), Positives = 45/107 (42%), Gaps = 33/107 (30%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHR-DVWINRT------------------------- 256
K KKK KK+K+K++ K +KK+ + ++ +
Sbjct: 201 KPKKKEKKEKEKERDKDKKKEVEGFKSLLLALDDSPASAASVAEADEASLANTVSGTAPD 260
Query: 257 -----SRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKK--KKNQHHRD 296
+ E EE KK K KKKK K+K+ KKK+KK HH D
Sbjct: 261 SEPDEPKDAEAEETKKSPKHKKKKQRKEKEEKKKKKKHHHHRCHHSD 307
Score = 39.7 bits (92), Expect = 0.001
Identities = 21/52 (40%), Positives = 29/52 (55%), Gaps = 8/52 (15%)
Query: 246 QHHRDVWINRTSRGEEEE-----EKKKKKKKKKKKTTKKKKRKKKRKKKKNQ 292
Q HR+ TS+ E+ EKK KK KKK+K K+K+R K +KK+
Sbjct: 176 QKHRNA---ETSKSPEKGDVPAVEKKSKKPKKKEKKEKEKERDKDKKKEVEG 224
Score = 35.0 bits (80), Expect = 0.044
Identities = 12/35 (34%), Positives = 23/35 (65%), Gaps = 3/35 (8%)
Query: 262 EEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQHHRD 296
EEE++ +++ +K K KKK+ K+KR +++ H
Sbjct: 85 EEERRHRQRLEKDKREKKKREKEKRGRRR---HHS 116
Score = 34.7 bits (79), Expect = 0.051
Identities = 15/61 (24%), Positives = 32/61 (52%), Gaps = 5/61 (8%)
Query: 209 ISSSISFWIPGFVM-----KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEE 263
I S+ +PG M K +++R+ +++ +K K+ KKK+ + R + + E +E
Sbjct: 64 IDLSVPLKVPGLPMSDQYVKLEEERRHRQRLEKDKREKKKREKEKRGRRRHHSLGTESDE 123
Query: 264 E 264
+
Sbjct: 124 D 124
Score = 34.3 bits (78), Expect = 0.073
Identities = 8/34 (23%), Positives = 23/34 (67%)
Query: 261 EEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQHH 294
EEE + +++ +K K+ KK++++K+ +++ +
Sbjct: 85 EEERRHRQRLEKDKREKKKREKEKRGRRRHHSLG 118
Score = 33.9 bits (77), Expect = 0.094
Identities = 9/34 (26%), Positives = 21/34 (61%)
Query: 260 EEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQH 293
EEE +++ +K K++ K++K K+ R++ +
Sbjct: 85 EEERRHRQRLEKDKREKKKREKEKRGRRRHHSLG 118
Score = 33.1 bits (75), Expect = 0.18
Identities = 12/32 (37%), Positives = 23/32 (71%)
Query: 260 EEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKN 291
E++ +K KKK+KK+K+ + K +KK+ + K+
Sbjct: 196 EKKSKKPKKKEKKEKEKERDKDKKKEVEGFKS 227
Score = 32.7 bits (74), Expect = 0.21
Identities = 18/42 (42%), Positives = 23/42 (54%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEE 264
KKKK+RK+K++KKKKKK + H G EEE
Sbjct: 281 KKKKQRKEKEEKKKKKKHHHHRCHHSDGGAEQPVQNGAVEEE 322
Score = 31.6 bits (71), Expect = 0.54
Identities = 26/127 (20%), Positives = 48/127 (37%), Gaps = 47/127 (37%)
Query: 222 MKKKKKRKKKKKKKKKKKRKKKK------------------------------------N 245
++K K+ KKK++K+K+ +R+ N
Sbjct: 94 LEKDKREKKKREKEKRGRRRHHSLGTESDEDIAPAQMVDIVTEEMPENALPSDEDDKDPN 153
Query: 246 QHHR--DVWINRTSRGEEE---------EEKKKKKKKKKKKTTKKKKRKKKRKKKKNQHH 294
+R D+ +++ E+ E K +K KK K+ KK++KK+ +
Sbjct: 154 DPYRALDIDLDKPLADSEKLPVQKHRNAETSKSPEKGDVPAVEKKSKKPKKKEKKEKEKE 213
Query: 295 RDVWQKK 301
RD +KK
Sbjct: 214 RDKDKKK 220
Score = 30.8 bits (69), Expect = 1.1
Identities = 9/27 (33%), Positives = 17/27 (62%)
Query: 258 RGEEEEEKKKKKKKKKKKTTKKKKRKK 284
R + EK K++KKK++K + ++R
Sbjct: 89 RHRQRLEKDKREKKKREKEKRGRRRHH 115
Score = 30.0 bits (67), Expect = 1.6
Identities = 10/31 (32%), Positives = 21/31 (67%)
Query: 265 KKKKKKKKKKKTTKKKKRKKKRKKKKNQHHR 295
K +++++ +++ K K+ KKKR+K+K R
Sbjct: 83 KLEEERRHRQRLEKDKREKKKREKEKRGRRR 113
Score = 29.6 bits (66), Expect = 2.3
Identities = 9/28 (32%), Positives = 19/28 (67%)
Query: 255 RTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
R R E++K++KKK++K+K +++
Sbjct: 88 RRHRQRLEKDKREKKKREKEKRGRRRHH 115
Score = 29.6 bits (66), Expect = 2.4
Identities = 8/28 (28%), Positives = 17/28 (60%)
Query: 258 RGEEEEEKKKKKKKKKKKTTKKKKRKKK 285
R + +K K++KKK++ K+ +R+
Sbjct: 88 RRHRQRLEKDKREKKKREKEKRGRRRHH 115
Score = 29.6 bits (66), Expect = 2.5
Identities = 17/59 (28%), Positives = 25/59 (42%), Gaps = 16/59 (27%)
Query: 224 KKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
K + ++ KK K KK+K++K E+EEKKKKKK +
Sbjct: 267 KDAEAEETKKSPKHKKKKQRK----------------EKEEKKKKKKHHHHRCHHSDGG 309
>gnl|CDD|219124 pfam06658, DUF1168, Protein of unknown function (DUF1168). This
family consists of several hypothetical eukaryotic
proteins of unknown function.
Length = 142
Score = 46.6 bits (111), Expect = 1e-06
Identities = 21/58 (36%), Positives = 35/58 (60%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKK 280
KK+ KR+KKK+KKKKKK+ KK N+ + +EEE+ ++ K+++ +K
Sbjct: 85 KKRAKRQKKKQKKKKKKKAKKGNKKEEKEGSKSSEESSDEEEEGEEDKQEEPVEIMEK 142
Score = 38.5 bits (90), Expect = 0.001
Identities = 23/70 (32%), Positives = 39/70 (55%), Gaps = 19/70 (27%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
+K KK + ++ ++K++ KK+K +EEK KK+ K++K KK+K+
Sbjct: 57 EKWKKETEDEEFQQKREEKKRK-----------------DEEKTAKKRAKRQK--KKQKK 97
Query: 283 KKKRKKKKNQ 292
KKK+K KK
Sbjct: 98 KKKKKAKKGN 107
Score = 36.9 bits (86), Expect = 0.003
Identities = 22/68 (32%), Positives = 36/68 (52%), Gaps = 20/68 (29%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
+K++++K+K ++K KKR K+ +KKK+KKKKKKK K K+
Sbjct: 70 QKREEKKRKDEEKTAKKRAKR--------------------QKKKQKKKKKKKAKKGNKK 109
Query: 283 KKKRKKKK 290
++K K
Sbjct: 110 EEKEGSKS 117
Score = 35.8 bits (83), Expect = 0.007
Identities = 22/72 (30%), Positives = 35/72 (48%), Gaps = 20/72 (27%)
Query: 220 FVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKK 279
F K+++K++K ++K KK+ K++K KK+KKKKKKK
Sbjct: 68 FQQKREEKKRKDEEKTAKKRAKRQK--------------------KKQKKKKKKKAKKGN 107
Query: 280 KKRKKKRKKKKN 291
KK +K+ K
Sbjct: 108 KKEEKEGSKSSE 119
Score = 35.4 bits (82), Expect = 0.012
Identities = 18/63 (28%), Positives = 32/63 (50%), Gaps = 2/63 (3%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
K KKR K++KKK+KK KKKK + + EE ++++ ++ K + +
Sbjct: 82 KTAKKRAKRQKKKQKK--KKKKKAKKGNKKEEKEGSKSSEESSDEEEEGEEDKQEEPVEI 139
Query: 283 KKK 285
+K
Sbjct: 140 MEK 142
Score = 34.3 bits (79), Expect = 0.024
Identities = 19/72 (26%), Positives = 34/72 (47%), Gaps = 18/72 (25%)
Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
K ++K KK+ K++KKK+KKKK + K KK KK++K K
Sbjct: 77 RKDEEKTAKKRAKRQKKKQKKKKKK------------------KAKKGNKKEEKEGSKSS 118
Query: 282 RKKKRKKKKNQH 293
+ ++++ +
Sbjct: 119 EESSDEEEEGEE 130
Score = 33.5 bits (77), Expect = 0.046
Identities = 13/37 (35%), Positives = 28/37 (75%)
Query: 260 EEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQHHRD 296
EE ++K+++KK+K ++ T KK+ K+++KK+K + +
Sbjct: 66 EEFQQKREEKKRKDEEKTAKKRAKRQKKKQKKKKKKK 102
Score = 29.2 bits (66), Expect = 1.5
Identities = 13/45 (28%), Positives = 28/45 (62%), Gaps = 12/45 (26%)
Query: 258 RGEEEE----------EKKKKKKKKK--KKTTKKKKRKKKRKKKK 290
+E+ ++K+++KK+K +KT KK+ +++K+K+KK
Sbjct: 53 ELMDEKWKKETEDEEFQQKREEKKRKDEEKTAKKRAKRQKKKQKK 97
Score = 27.3 bits (61), Expect = 6.1
Identities = 13/41 (31%), Positives = 27/41 (65%), Gaps = 2/41 (4%)
Query: 253 INRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRK--KKKN 291
++ + E E+E+ ++K+++KK+ ++K KK+ K KKK
Sbjct: 55 MDEKWKKETEDEEFQQKREEKKRKDEEKTAKKRAKRQKKKQ 95
>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
This is a family of fungal proteins of unknown function.
Length = 182
Score = 45.8 bits (109), Expect = 4e-06
Identities = 19/62 (30%), Positives = 34/62 (54%)
Query: 229 KKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKK 288
KKKKK+ ++ K KK + W + + +++++K K KK KK +KK +K+ +
Sbjct: 61 KKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEKEAED 120
Query: 289 KK 290
K
Sbjct: 121 KL 122
Score = 42.0 bits (99), Expect = 1e-04
Identities = 20/68 (29%), Positives = 41/68 (60%), Gaps = 4/68 (5%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
KKKKK ++ +K KK+ ++K+ + W + S+ +++++K KK KK K+ KK ++
Sbjct: 61 KKKKKELAEEIEKVKKEYEEKQ----KWKWKKKKSKKKKDKDKDKKDDKKDDKSEKKDEK 116
Query: 283 KKKRKKKK 290
+ + K +
Sbjct: 117 EAEDKLED 124
Score = 40.1 bits (94), Expect = 4e-04
Identities = 19/72 (26%), Positives = 32/72 (44%), Gaps = 15/72 (20%)
Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
++K+K K KKKK KKKK K K +++++K K +KK +K + K
Sbjct: 78 YEEKQKWKWKKKKSKKKKDKDKD---------------KKDDKKDDKSEKKDEKEAEDKL 122
Query: 282 RKKKRKKKKNQH 293
+ +
Sbjct: 123 EDLTKSYSETLS 134
Score = 38.1 bits (89), Expect = 0.002
Identities = 22/84 (26%), Positives = 35/84 (41%), Gaps = 15/84 (17%)
Query: 216 WIPGFVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKK 275
W KKKK K K KK KK K +K ++E+E + K + K
Sbjct: 84 WKWKKKKSKKKKDKDKDKKDDKKDDKSEK---------------KDEKEAEDKLEDLTKS 128
Query: 276 TTKKKKRKKKRKKKKNQHHRDVWQ 299
++ + K +K H+D++Q
Sbjct: 129 YSETLSTLSELKPRKYALHKDIYQ 152
Score = 35.8 bits (83), Expect = 0.011
Identities = 18/72 (25%), Positives = 37/72 (51%), Gaps = 14/72 (19%)
Query: 221 VMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKK 280
V K+ ++++K K KKKK K+KK K+ +++++ KK K +KK + +
Sbjct: 74 VKKEYEEKQKWKWKKKKSKKKKDKD--------------KDKKDDKKDDKSEKKDEKEAE 119
Query: 281 KRKKKRKKKKNQ 292
+ + K ++
Sbjct: 120 DKLEDLTKSYSE 131
Score = 27.0 bits (60), Expect = 9.6
Identities = 17/39 (43%), Positives = 21/39 (53%), Gaps = 7/39 (17%)
Query: 260 EEEEEKKKKK-------KKKKKKTTKKKKRKKKRKKKKN 291
E E KKKKK K KK+ K+K + KK+K KK
Sbjct: 56 EYTEAKKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKK 94
>gnl|CDD|236944 PRK11642, PRK11642, exoribonuclease R; Provisional.
Length = 813
Score = 47.8 bits (114), Expect = 5e-06
Identities = 24/93 (25%), Positives = 39/93 (41%), Gaps = 3/93 (3%)
Query: 203 NVYYALISSSISFWIPGFVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEE 262
+ ++LISS + G ++K K+ KK K+++ KK D + + +
Sbjct: 719 KIDFSLISSERAPRNVGKTAREKAKKGDAGKKGGKRRQVGKKVNFEPDSAFRGEKKAKPK 778
Query: 263 EEKKKKKKKKKKKTTKKKKR---KKKRKKKKNQ 292
KK +K KK +K K KR KK
Sbjct: 779 AAKKDARKAKKPSAKTQKIAAATKAKRAAKKKV 811
Score = 32.0 bits (73), Expect = 0.44
Identities = 16/63 (25%), Positives = 21/63 (33%), Gaps = 20/63 (31%)
Query: 226 KKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKK 285
+ KK K K KK +K K K K +K TK K+ KK
Sbjct: 770 RGEKKAKPKAAKKDARKAK--------------------KPSAKTQKIAAATKAKRAAKK 809
Query: 286 RKK 288
+
Sbjct: 810 KVA 812
>gnl|CDD|217502 pfam03343, SART-1, SART-1 family. SART-1 is a protein involved in
cell cycle arrest and pre-mRNA splicing. It has been
shown to be a component of U4/U6 x U5 tri-snRNP complex
in human, Schizosaccharomyces pombe and Saccharomyces
cerevisiae. SART-1 is a known tumour antigen in a range
of cancers recognised by T cells.
Length = 603
Score = 47.1 bits (112), Expect = 6e-06
Identities = 23/96 (23%), Positives = 42/96 (43%), Gaps = 19/96 (19%)
Query: 220 FVMKKKKKRKKKKKKKKKKKRKK----KKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKK 275
M K KK KKKKKKKKK+++ + + + SR + EEE + + KK+
Sbjct: 270 SEMVKFKKPKKKKKKKKKRRKDLDEDELEPEAEGLGSSDSGSRKDVEEENARLEDSPKKR 329
Query: 276 TTKKKK---------------RKKKRKKKKNQHHRD 296
+++ ++++ +KK + R
Sbjct: 330 KEEQEDDDFVEDDDDLQASLAKQRRLAQKKRKKLRP 365
Score = 41.3 bits (97), Expect = 5e-04
Identities = 21/70 (30%), Positives = 30/70 (42%), Gaps = 3/70 (4%)
Query: 219 GFVMKKKKKRKKKKKKKKKKKRKKKKNQ---HHRDVWINRTSRGEEEEEKKKKKKKKKKK 275
F + + +KK+++ +KK K D S + E K KK KKKK
Sbjct: 223 LFTLDSGGSTDDEAEKKRQEVKKKLKINNVSLDDDSTETPASDYYDVSEMVKFKKPKKKK 282
Query: 276 TTKKKKRKKK 285
KKK+RK
Sbjct: 283 KKKKKRRKDL 292
Score = 39.7 bits (93), Expect = 0.001
Identities = 19/57 (33%), Positives = 28/57 (49%)
Query: 230 KKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKR 286
+KK+++ KK+ K N D + + + K KK KKK KKKKR+K
Sbjct: 236 AEKKRQEVKKKLKINNVSLDDDSTETPASDYYDVSEMVKFKKPKKKKKKKKKRRKDL 292
Score = 38.6 bits (90), Expect = 0.004
Identities = 18/84 (21%), Positives = 40/84 (47%), Gaps = 17/84 (20%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEE---------KKKKKKKKK 273
K+++ R+K K ++K++R K + GE++++ K KK++KKK
Sbjct: 54 KREELREKIAKAREKRERNSKLGG--------IKTLGEDDDDDDDTKAWLKKSKKRQKKK 105
Query: 274 KKTTKKKKRKKKRKKKKNQHHRDV 297
+ KK +++K++ +
Sbjct: 106 EAERKKALLLDEKEKERAAEYTSE 129
Score = 37.0 bits (86), Expect = 0.010
Identities = 22/77 (28%), Positives = 35/77 (45%), Gaps = 12/77 (15%)
Query: 228 RKKKKKKKKKKKRKKKKNQHHRDVWI--NRTSRGEEEEEKKKKKKKKKKKTT-------- 277
+K+K KK + +KKK + D R+ + +EE + KKKK T
Sbjct: 174 VEKEKDKKNLELKKKKPDYDPDDDDKFNKRSILSKYDEEIEGKKKKSDNLFTLDSGGSTD 233
Query: 278 --KKKKRKKKRKKKKNQ 292
+KKR++ +KK K
Sbjct: 234 DEAEKKRQEVKKKLKIN 250
Score = 32.8 bits (75), Expect = 0.21
Identities = 28/121 (23%), Positives = 45/121 (37%), Gaps = 44/121 (36%)
Query: 222 MKKKKKRKKKKKKKKKKK--------RKKKK------------NQHHRDVWINRTSRG-- 259
+ +K+K KK + KKKK + K+ + D S G
Sbjct: 173 LVEKEKDKKNLELKKKKPDYDPDDDDKFNKRSILSKYDEEIEGKKKKSDNLFTLDSGGST 232
Query: 260 -EEEEEKKKKKKKKKKKT---------------------TKKKKRKKKRKKKKNQHHRDV 297
+E E+K+++ KKK K K K+ KK+KKKK + +D+
Sbjct: 233 DDEAEKKRQEVKKKLKINNVSLDDDSTETPASDYYDVSEMVKFKKPKKKKKKKKKRRKDL 292
Query: 298 W 298
Sbjct: 293 D 293
Score = 28.6 bits (64), Expect = 5.0
Identities = 9/33 (27%), Positives = 20/33 (60%)
Query: 260 EEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQ 292
EE E K+K+++ ++K ++KR++ K +
Sbjct: 47 EEAEAKRKREELREKIAKAREKRERNSKLGGIK 79
Score = 27.8 bits (62), Expect = 7.8
Identities = 20/94 (21%), Positives = 34/94 (36%), Gaps = 36/94 (38%)
Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRG---------------------- 259
KK++K+K+ ++KK +K+K + + G
Sbjct: 98 SKKRQKKKEAERKKALLLDEKEKERAAEYT--SEDLAGLKVGHKVEEFEEGEDVILTLKD 155
Query: 260 ----EEEEEK--------KKKKKKKKKKTTKKKK 281
E+E+E +K+K KK KKKK
Sbjct: 156 TGVLEDEDEGDELENVELVEKEKDKKNLELKKKK 189
>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein. TolA couples the inner
membrane complex of itself with TolQ and TolR to the
outer membrane complex of TolB and OprL (also called
Pal). Most of the length of the protein consists of
low-complexity sequence that may differ in both length
and composition from one species to another,
complicating efforts to discriminate TolA (the most
divergent gene in the tol-pal system) from paralogs such
as TonB. Selection of members of the seed alignment and
criteria for setting scoring cutoffs are based largely
conserved operon struction. //The Tol-Pal complex is
required for maintaining outer membrane integrity. Also
involved in transport (uptake) of colicins and
filamentous DNA, and implicated in pathogenesis.
Transport is energized by the proton motive force. TolA
is an inner membrane protein that interacts with
periplasmic TolB and with outer membrane porins ompC,
phoE and lamB [Transport and binding proteins, Other,
Cellular processes, Pathogenesis].
Length = 346
Score = 46.4 bits (110), Expect = 8e-06
Identities = 18/68 (26%), Positives = 31/68 (45%), Gaps = 7/68 (10%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
+ K K+ + K K + + +KK + + + EEE + K + KKK KKK
Sbjct: 123 EAKAKQAAEAKAKAEAEAEKKAKEEAK-------KQAEEEAKAKAAAEAKKKAAEAKKKA 175
Query: 283 KKKRKKKK 290
+ + K K
Sbjct: 176 EAEAKAKA 183
Score = 43.7 bits (103), Expect = 7e-05
Identities = 14/68 (20%), Positives = 30/68 (44%), Gaps = 8/68 (11%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
+K+ +++ +K K+ + K + + EE K K+ + K K + ++
Sbjct: 91 QKELEQRAAAEKAAKQAEQAAKQA--------EEKQKQAEEAKAKQAAEAKAKAEAEAEK 142
Query: 283 KKKRKKKK 290
K K + KK
Sbjct: 143 KAKEEAKK 150
Score = 41.7 bits (98), Expect = 2e-04
Identities = 11/68 (16%), Positives = 33/68 (48%), Gaps = 10/68 (14%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
+ +K+R ++ ++K+ +++ + + + E+ K+ ++K+K+ K K+
Sbjct: 79 EAEKQRAAEQARQKELEQRAAAEKAAK----------QAEQAAKQAEEKQKQAEEAKAKQ 128
Query: 283 KKKRKKKK 290
+ K K
Sbjct: 129 AAEAKAKA 136
Score = 41.4 bits (97), Expect = 4e-04
Identities = 15/70 (21%), Positives = 25/70 (35%), Gaps = 9/70 (12%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
+ K K + KKK + +KK + + E + K K ++ K K K K
Sbjct: 155 EAKAKAAAEAKKKAAEAKKKAEAE---------AKAKAEAKAKAKAEEAKAKAEAAKAKA 205
Query: 283 KKKRKKKKNQ 292
+ K
Sbjct: 206 AAEAAAKAEA 215
Score = 40.6 bits (95), Expect = 6e-04
Identities = 19/78 (24%), Positives = 39/78 (50%), Gaps = 12/78 (15%)
Query: 223 KKKKKRKKKKKKKKKKKRK---KKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKK 279
+K K+ ++ K+ ++K+K + K + + ++ E E EKK K++ KK+ +
Sbjct: 101 EKAAKQAEQAAKQAEEKQKQAEEAKAKQAAE----AKAKAEAEAEKKAKEEAKKQAEEEA 156
Query: 280 KKR-----KKKRKKKKNQ 292
K + KKK + K +
Sbjct: 157 KAKAAAEAKKKAAEAKKK 174
Score = 37.9 bits (88), Expect = 0.004
Identities = 17/69 (24%), Positives = 29/69 (42%), Gaps = 12/69 (17%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
KK K+ KK+ +++ K + + + + E KKK + + K K K K
Sbjct: 142 KKAKEEAKKQAEEEAKAKAAAEAKK------------KAAEAKKKAEAEAKAKAEAKAKA 189
Query: 283 KKKRKKKKN 291
K + K K
Sbjct: 190 KAEEAKAKA 198
Score = 37.5 bits (87), Expect = 0.005
Identities = 13/71 (18%), Positives = 33/71 (46%), Gaps = 2/71 (2%)
Query: 223 KKKKKRKKKKKKKKKK-KRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
K+ ++R +K K+ + K+ + + + ++ E + K + + +KK + KK
Sbjct: 92 KELEQRAAAEKAAKQAEQAAKQAEEKQKQAEEAK-AKQAAEAKAKAEAEAEKKAKEEAKK 150
Query: 282 RKKKRKKKKNQ 292
+ ++ K K
Sbjct: 151 QAEEEAKAKAA 161
Score = 35.2 bits (81), Expect = 0.030
Identities = 14/68 (20%), Positives = 23/68 (33%), Gaps = 15/68 (22%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
K + +KK + + K K K K + EE K K + K K +
Sbjct: 167 KAAEAKKKAEAEAKAKAEAKAKAK---------------AEEAKAKAEAAKAKAAAEAAA 211
Query: 283 KKKRKKKK 290
K + +
Sbjct: 212 KAEAEAAA 219
Score = 35.2 bits (81), Expect = 0.036
Identities = 13/70 (18%), Positives = 33/70 (47%), Gaps = 11/70 (15%)
Query: 224 KKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKK---K 280
+KK KK+++++KK ++ + R E+ +K+ +++ + K +
Sbjct: 58 QKKPAAKKEQERQKKLEQQAEEA--------EKQRAAEQARQKELEQRAAAEKAAKQAEQ 109
Query: 281 KRKKKRKKKK 290
K+ +K+K
Sbjct: 110 AAKQAEEKQK 119
Score = 34.8 bits (80), Expect = 0.042
Identities = 9/69 (13%), Positives = 32/69 (46%), Gaps = 13/69 (18%)
Query: 225 KKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKK 284
KK ++++KK +++ + +K + E+ ++K+ +++ K+ ++
Sbjct: 63 AKKEQERQKKLEQQAEEAEKQR-------------AAEQARQKELEQRAAAEKAAKQAEQ 109
Query: 285 KRKKKKNQH 293
K+ + +
Sbjct: 110 AAKQAEEKQ 118
Score = 34.4 bits (79), Expect = 0.051
Identities = 11/67 (16%), Positives = 24/67 (35%), Gaps = 12/67 (17%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
K + + K K + K K K ++ K ++ E + K + K +
Sbjct: 174 KAEAEAKAKAEAKAKAKAEEAK------------AKAEAAKAKAAAEAAAKAEAEAAAAA 221
Query: 283 KKKRKKK 289
+ ++K
Sbjct: 222 AAEAERK 228
>gnl|CDD|220695 pfam10328, 7TM_GPCR_Srx, Serpentine type 7TM GPCR chemoreceptor
Srx. Chemoreception is mediated in Caenorhabditis
elegans by members of the seven-transmembrane
G-protein-coupled receptor class (7TM GPCRs) of proteins
which are of the serpentine type. Srx is part of the Srg
superfamily of chemoreceptors. Chemoperception is one of
the central senses of soil nematodes like C. elegans
which are otherwise 'blind' and 'deaf'.
Length = 275
Score = 45.6 bits (109), Expect = 1e-05
Identities = 37/208 (17%), Positives = 83/208 (39%), Gaps = 17/208 (8%)
Query: 52 IITAILG---NLLVIISVIKHRKLRIITNYFVVSLAFADLLVALCVMPFNAIVSLTDEWY 108
+ ++ G N L+ +K L+ + A ++ ++ + + ++L D +
Sbjct: 1 FLISLFGIILNWLIFYIFLKLPSLKNSFGILCANKAISNAIICTIFLFYVVPMTLLDLSF 60
Query: 109 FGYFMCDVWNSFDVYFS-TASILHLCCISVDRYYAIVKPLQYPIIMNQNTVLLMLSNVWI 167
+ + S L IS++R+ A+ P +Y I + ++++ +WI
Sbjct: 61 LPELLNSHIGGLILLGLYEISPLTHLLISLNRFCAVFFPFKYKKIFSIKNTKIIITFIWI 120
Query: 168 LPGIISFTP-IMLGWYTT--PDHKQYRKHHPNVC-------IFVVNVYYALISSSISFWI 217
+ IIS LG + P+ + C F+ N+ +I++ I+
Sbjct: 121 IAIIISTLFYFPLGCHFYYSPESLTWSFDEDPPCAEIGWYGDFLKNLVLVIITNIINVIT 180
Query: 218 PGFVMKKKKKRKKKKKKKKKKKRKKKKN 245
+K +K KK ++ K+++KK
Sbjct: 181 ---FIKLRKFSKKSSLSSEESKKRRKKE 205
>gnl|CDD|218336 pfam04935, SURF6, Surfeit locus protein 6. The surfeit locus
protein SURF-6 is shown to be a component of the
nucleolar matrix and has a strong binding capacity for
nucleic acids.
Length = 206
Score = 44.2 bits (105), Expect = 2e-05
Identities = 28/71 (39%), Positives = 42/71 (59%), Gaps = 15/71 (21%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKK---KKKKTTKK 279
KK KRK+K+KKK KK+ W R + E+++ +++KK++ KK+K KK
Sbjct: 145 KKALKRKEKQKKKSKKE------------WKERKEKVEKKKAERQKKREENLKKRKDDKK 192
Query: 280 KKRKKKRKKKK 290
K+KKK KKK
Sbjct: 193 NKKKKKAKKKG 203
Score = 41.5 bits (98), Expect = 2e-04
Identities = 26/68 (38%), Positives = 39/68 (57%), Gaps = 11/68 (16%)
Query: 223 KKKKKRKKKKKKK-KKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
K+K+K+KKK KK+ K++K K +K + R ++ E+ KK+K KK KKKK
Sbjct: 149 KRKEKQKKKSKKEWKERKEKVEKKKAERQ----------KKREENLKKRKDDKKNKKKKK 198
Query: 282 RKKKRKKK 289
KKK +
Sbjct: 199 AKKKGRIL 206
Score = 37.7 bits (88), Expect = 0.003
Identities = 22/85 (25%), Positives = 39/85 (45%), Gaps = 20/85 (23%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKK--------- 273
+K+++RK +KK+K+K+ +KK+ Q + EE + KKK +
Sbjct: 13 RKREQRKARKKQKRKEAKKKEDAQKSEA----EEVKNEENKSKKKAAPIENAEGNIVFSK 68
Query: 274 -------KKTTKKKKRKKKRKKKKN 291
+ K +KKK+KKK +
Sbjct: 69 VEFADGEQAKKDLKLKKKKKKKKTD 93
Score = 36.5 bits (85), Expect = 0.007
Identities = 23/80 (28%), Positives = 42/80 (52%), Gaps = 13/80 (16%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEK----------KKKKKKK 272
KK + RKKK ++ + K + + ++ W ++ E + K K+K+K+K
Sbjct: 99 KKLEARKKKLEELDEDKAAEIEE---KEKWTKALAKAEGVKVKDDEKLLKKALKRKEKQK 155
Query: 273 KKKTTKKKKRKKKRKKKKNQ 292
KK + K+RK+K +KKK +
Sbjct: 156 KKSKKEWKERKEKVEKKKAE 175
Score = 35.4 bits (82), Expect = 0.020
Identities = 11/32 (34%), Positives = 24/32 (75%)
Query: 261 EEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQ 292
E +++++K++++K KK+KRK+ +KK+ Q
Sbjct: 5 EALLEQRRRKREQRKARKKQKRKEAKKKEDAQ 36
Score = 35.4 bits (82), Expect = 0.022
Identities = 20/79 (25%), Positives = 36/79 (45%), Gaps = 11/79 (13%)
Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKK-----------KKK 270
K +KK+K+K+ KKK+ +K + + + ++ E + ++
Sbjct: 18 RKARKKQKRKEAKKKEDAQKSEAEEVKNEENKSKKKAAPIENAEGNIVFSKVEFADGEQA 77
Query: 271 KKKKKTTKKKKRKKKRKKK 289
KK K KKKK+KK K+
Sbjct: 78 KKDLKLKKKKKKKKTDYKQ 96
Score = 34.2 bits (79), Expect = 0.046
Identities = 26/68 (38%), Positives = 37/68 (54%), Gaps = 14/68 (20%)
Query: 221 VMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKK 280
KKK K++ K++K+K +K+K ++ Q R EE KK+K KK KK KKK
Sbjct: 153 KQKKKSKKEWKERKEKVEKKKAER-QKKR-----------EENLKKRKDDKKNKK--KKK 198
Query: 281 KRKKKRKK 288
+KK R
Sbjct: 199 AKKKGRIL 206
Score = 31.5 bits (72), Expect = 0.33
Identities = 26/91 (28%), Positives = 41/91 (45%), Gaps = 22/91 (24%)
Query: 221 VMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTK-- 278
++ +K K KKKKKK+K Q + + R + EE +E K + ++K+K TK
Sbjct: 71 FADGEQAKKDLKLKKKKKKKKTDYKQLLKKL-EARKKKLEELDEDKAAEIEEKEKWTKAL 129
Query: 279 -------------------KKKRKKKRKKKK 290
K+K K+K+K KK
Sbjct: 130 AKAEGVKVKDDEKLLKKALKRKEKQKKKSKK 160
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 45.9 bits (108), Expect = 2e-05
Identities = 16/79 (20%), Positives = 38/79 (48%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
+ KK + K + + + + + + + +++ + KKK ++KKK + KK+
Sbjct: 1340 EAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKK 1399
Query: 283 KKKRKKKKNQHHRDVWQKK 301
++ KKK ++ + KK
Sbjct: 1400 AEEDKKKADELKKAAAAKK 1418
Score = 42.8 bits (100), Expect = 2e-04
Identities = 20/80 (25%), Positives = 38/80 (47%), Gaps = 7/80 (8%)
Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
KKK + KK + K + + ++ + + E +KKK++ KKK KKK
Sbjct: 1334 AKKKAEEAKKAAEAAKAEAEAAADEA-------EAAEEKAEAAEKKKEEAKKKADAAKKK 1386
Query: 282 RKKKRKKKKNQHHRDVWQKK 301
++K+K + + + +KK
Sbjct: 1387 AEEKKKADEAKKKAEEDKKK 1406
Score = 42.4 bits (99), Expect = 2e-04
Identities = 15/66 (22%), Positives = 33/66 (50%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
+ + ++K + ++KK++ + D + ++ +E KKK ++ KKK + KK
Sbjct: 1354 AAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKA 1413
Query: 283 KKKRKK 288
+KK
Sbjct: 1414 AAAKKK 1419
Score = 42.4 bits (99), Expect = 3e-04
Identities = 16/74 (21%), Positives = 30/74 (40%), Gaps = 3/74 (4%)
Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
KKK + KK + KKK + KK D + ++ E K + + + +
Sbjct: 1307 AKKKAEEAKKADEAKKKAEEAKKKA---DAAKKKAEEAKKAAEAAKAEAEAAADEAEAAE 1363
Query: 282 RKKKRKKKKNQHHR 295
K + +KK + +
Sbjct: 1364 EKAEAAEKKKEEAK 1377
Score = 41.7 bits (97), Expect = 4e-04
Identities = 24/83 (28%), Positives = 41/83 (49%), Gaps = 4/83 (4%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK- 281
+KK KK+ ++ KK ++ K + + + EEE K K ++ KK+ KKK
Sbjct: 1687 EKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKA 1746
Query: 282 ---RKKKRKKKKNQHHRDVWQKK 301
+K + +KKK H + +KK
Sbjct: 1747 EEAKKDEEEKKKIAHLKKEEEKK 1769
Score = 41.3 bits (96), Expect = 6e-04
Identities = 21/76 (27%), Positives = 41/76 (53%), Gaps = 4/76 (5%)
Query: 221 VMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKK----KKKKKKKKKT 276
KK ++ KKK+ ++KKK + KK + + ++ EE++KK KK ++ +KK
Sbjct: 1631 EKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKA 1690
Query: 277 TKKKKRKKKRKKKKNQ 292
+ K++ + KK +
Sbjct: 1691 AEALKKEAEEAKKAEE 1706
Score = 40.9 bits (95), Expect = 7e-04
Identities = 17/67 (25%), Positives = 31/67 (46%), Gaps = 7/67 (10%)
Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
KKK + KKK KKK ++ K ++ E E + + ++K +KK
Sbjct: 1320 AKKKAEEAKKKADAAKKKAEEAKKA-------AEAAKAEAEAAADEAEAAEEKAEAAEKK 1372
Query: 282 RKKKRKK 288
+++ +KK
Sbjct: 1373 KEEAKKK 1379
Score = 40.5 bits (94), Expect = 0.001
Identities = 15/79 (18%), Positives = 37/79 (46%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
+ + ++K + KKK+ + + + +++ ++ KKK ++ KK + K+
Sbjct: 1353 EAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKK 1412
Query: 283 KKKRKKKKNQHHRDVWQKK 301
KKK ++ + +KK
Sbjct: 1413 AAAAKKKADEAKKKAEEKK 1431
Score = 40.1 bits (93), Expect = 0.001
Identities = 24/78 (30%), Positives = 39/78 (50%), Gaps = 8/78 (10%)
Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
+KK KKK + KKK +KKK + + ++ +E KKK ++ KK KKK
Sbjct: 1410 LKKAAAAKKKADEAKKKAEEKKKADEAK----KKAEEAKKADEAKKKAEEAKKAEEAKKK 1465
Query: 282 RKKKRK----KKKNQHHR 295
++ +K KKK + +
Sbjct: 1466 AEEAKKADEAKKKAEEAK 1483
Score = 40.1 bits (93), Expect = 0.001
Identities = 23/73 (31%), Positives = 41/73 (56%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
KK ++ KKK+ ++KKK + KK + + + EE++KK ++ KK ++ KK
Sbjct: 1702 KKAEELKKKEAEEKKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAH 1761
Query: 283 KKKRKKKKNQHHR 295
KK ++KK + R
Sbjct: 1762 LKKEEEKKAEEIR 1774
Score = 39.7 bits (92), Expect = 0.002
Identities = 24/82 (29%), Positives = 41/82 (50%), Gaps = 9/82 (10%)
Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEE--EKKKKKKKKKKKTTKK 279
KKK + KK + KKK + KK + + EE + ++ KKK ++ KK +
Sbjct: 1436 AKKKAEEAKKADEAKKKAEEAKKAE-------EAKKKAEEAKKADEAKKKAEEAKKADEA 1488
Query: 280 KKRKKKRKKKKNQHHRDVWQKK 301
KK+ ++ KKK ++ + KK
Sbjct: 1489 KKKAEEAKKKADEAKKAAEAKK 1510
Score = 39.7 bits (92), Expect = 0.002
Identities = 15/70 (21%), Positives = 32/70 (45%), Gaps = 4/70 (5%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
KK + KKK ++ KK KKK + + + +++ E+ KK + K +
Sbjct: 1302 KKADEAKKKAEEAKKADEAKKKAEEAK----KKADAAKKKAEEAKKAAEAAKAEAEAAAD 1357
Query: 283 KKKRKKKKNQ 292
+ + ++K +
Sbjct: 1358 EAEAAEEKAE 1367
Score = 39.4 bits (91), Expect = 0.002
Identities = 24/81 (29%), Positives = 36/81 (44%), Gaps = 11/81 (13%)
Query: 223 KKKKKRKKKK--KKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKK 280
KKK KKK + KK + K + + D E E ++K + +KKK KK
Sbjct: 1328 KKKADAAKKKAEEAKKAAEAAKAEAEAAAD---------EAEAAEEKAEAAEKKKEEAKK 1378
Query: 281 KRKKKRKKKKNQHHRDVWQKK 301
K +KK + + D +KK
Sbjct: 1379 KADAAKKKAEEKKKADEAKKK 1399
Score = 39.4 bits (91), Expect = 0.002
Identities = 24/80 (30%), Positives = 40/80 (50%), Gaps = 4/80 (5%)
Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
KKK + KKK + KKK + KK + + ++ EE KKK ++ KK KKK
Sbjct: 1423 AKKKAEEKKKADEAKKKAEEAKKADEAK----KKAEEAKKAEEAKKKAEEAKKADEAKKK 1478
Query: 282 RKKKRKKKKNQHHRDVWQKK 301
++ +K + + + +KK
Sbjct: 1479 AEEAKKADEAKKKAEEAKKK 1498
Score = 39.4 bits (91), Expect = 0.002
Identities = 17/71 (23%), Positives = 31/71 (43%), Gaps = 3/71 (4%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
KK ++ KK + KKK + KKK + E + + + + + +K +
Sbjct: 1309 KKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEA 1368
Query: 283 KKKRK---KKK 290
+K+K KKK
Sbjct: 1369 AEKKKEEAKKK 1379
Score = 39.4 bits (91), Expect = 0.002
Identities = 21/84 (25%), Positives = 44/84 (52%), Gaps = 3/84 (3%)
Query: 221 VMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEE---EEKKKKKKKKKKKTT 277
+ K ++++KK ++ KKK+ +KKK + + + EE E+ KKK ++ KK
Sbjct: 1625 LKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAE 1684
Query: 278 KKKKRKKKRKKKKNQHHRDVWQKK 301
+ +K+ + KK+ + + + K
Sbjct: 1685 EDEKKAAEALKKEAEEAKKAEELK 1708
Score = 39.0 bits (90), Expect = 0.003
Identities = 20/81 (24%), Positives = 40/81 (49%), Gaps = 1/81 (1%)
Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEE-EEKKKKKKKKKKKTTKKK 280
+ +++ + +KKK++ +KK + + +++ EE KKK + KK KK
Sbjct: 1359 AEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKK 1418
Query: 281 KRKKKRKKKKNQHHRDVWQKK 301
K + +KK + + D +KK
Sbjct: 1419 KADEAKKKAEEKKKADEAKKK 1439
Score = 38.6 bits (89), Expect = 0.004
Identities = 25/81 (30%), Positives = 38/81 (46%), Gaps = 8/81 (9%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEE----EEKKKKKKKKKKKTTK 278
KKK KKK ++KKK + KK + + +E KKK ++KKK
Sbjct: 1377 KKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEA 1436
Query: 279 KKKRKKKRK----KKKNQHHR 295
KKK ++ +K KKK + +
Sbjct: 1437 KKKAEEAKKADEAKKKAEEAK 1457
Score = 38.6 bits (89), Expect = 0.004
Identities = 23/79 (29%), Positives = 38/79 (48%), Gaps = 5/79 (6%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
KK + KK KKK KKK + + + +++ E+ KK + KKK + KK
Sbjct: 1405 KKADELKKAAAAKKKADEAKKKAEEKK-----KADEAKKKAEEAKKADEAKKKAEEAKKA 1459
Query: 283 KKKRKKKKNQHHRDVWQKK 301
++ +KK + D +KK
Sbjct: 1460 EEAKKKAEEAKKADEAKKK 1478
Score = 38.6 bits (89), Expect = 0.004
Identities = 21/72 (29%), Positives = 34/72 (47%), Gaps = 6/72 (8%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKK----KKKKKKKKKTTK 278
KKK KK ++KKK KKK + + + + EE +KK KKK ++ KK +
Sbjct: 1289 KKKADEAKKAEEKKKADEAKKKAEEAKKA--DEAKKKAEEAKKKADAAKKKAEEAKKAAE 1346
Query: 279 KKKRKKKRKKKK 290
K + + +
Sbjct: 1347 AAKAEAEAAADE 1358
Score = 38.6 bits (89), Expect = 0.004
Identities = 22/74 (29%), Positives = 36/74 (48%)
Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
KK + KKK + KK + KK ++ + + ++ EEKKK + KK + KK +
Sbjct: 1502 AKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAE 1561
Query: 282 RKKKRKKKKNQHHR 295
KKK ++ K
Sbjct: 1562 EKKKAEEAKKAEED 1575
Score = 38.2 bits (88), Expect = 0.005
Identities = 21/67 (31%), Positives = 35/67 (52%), Gaps = 4/67 (5%)
Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
KKK + KK ++ KKK + KK + + ++ +E KKK ++ KKK + KK
Sbjct: 1449 AKKKAEEAKKAEEAKKKAEEAKKADEAK----KKAEEAKKADEAKKKAEEAKKKADEAKK 1504
Query: 282 RKKKRKK 288
+ +KK
Sbjct: 1505 AAEAKKK 1511
Score = 37.8 bits (87), Expect = 0.006
Identities = 25/82 (30%), Positives = 41/82 (50%), Gaps = 2/82 (2%)
Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGE--EEEEKKKKKKKKKKKTTKK 279
KK++ +KK KKK + KKK ++ + ++ E + KKK + KKK +K
Sbjct: 1371 KKKEEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEK 1430
Query: 280 KKRKKKRKKKKNQHHRDVWQKK 301
KK + +KK + D +KK
Sbjct: 1431 KKADEAKKKAEEAKKADEAKKK 1452
Score = 37.8 bits (87), Expect = 0.007
Identities = 23/77 (29%), Positives = 38/77 (49%), Gaps = 8/77 (10%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
KK ++ KKK + KK KKK + + ++ +E KKK ++ KK KKK
Sbjct: 1398 KKAEEDKKKADELKKAAAAKKKADEAK----KKAEEKKKADEAKKKAEEAKKADEAKKKA 1453
Query: 283 KKKRK----KKKNQHHR 295
++ +K KKK + +
Sbjct: 1454 EEAKKAEEAKKKAEEAK 1470
Score = 37.4 bits (86), Expect = 0.009
Identities = 24/83 (28%), Positives = 42/83 (50%), Gaps = 8/83 (9%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKK-- 280
+ ++KK++ KKK KKK + + ++ + EE+KKK + KK KKK
Sbjct: 1366 AEAAEKKKEEAKKKADAAKKKAEEKKK----ADEAKKKAEEDKKKADELKKAAAAKKKAD 1421
Query: 281 --KRKKKRKKKKNQHHRDVWQKK 301
K+K + KKK ++ + + K
Sbjct: 1422 EAKKKAEEKKKADEAKKKAEEAK 1444
Score = 37.0 bits (85), Expect = 0.011
Identities = 21/75 (28%), Positives = 39/75 (52%), Gaps = 4/75 (5%)
Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKK- 280
+K ++ RK + KK ++K+K + + + ++ + EE KK + KKK + KKK
Sbjct: 1272 IKAEEARKADELKKAEEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKA 1331
Query: 281 ---KRKKKRKKKKNQ 292
K+K + KK +
Sbjct: 1332 DAAKKKAEEAKKAAE 1346
Score = 36.7 bits (84), Expect = 0.017
Identities = 21/77 (27%), Positives = 38/77 (49%), Gaps = 4/77 (5%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKK-- 280
KKK + KK ++ +KK + K + + E EE+KK ++ KK ++ K K
Sbjct: 1674 KKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELKKAEEENKIKAE 1733
Query: 281 --KRKKKRKKKKNQHHR 295
K++ + KKK + +
Sbjct: 1734 EAKKEAEEDKKKAEEAK 1750
Score = 36.7 bits (84), Expect = 0.018
Identities = 22/77 (28%), Positives = 39/77 (50%), Gaps = 4/77 (5%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKK-- 280
KKK + KK +++ K K ++ + D ++ EE+EKK + KK+ KK
Sbjct: 1646 KKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAE 1705
Query: 281 --KRKKKRKKKKNQHHR 295
K+K+ +KKK + +
Sbjct: 1706 ELKKKEAEEKKKAEELK 1722
Score = 36.3 bits (83), Expect = 0.021
Identities = 25/84 (29%), Positives = 40/84 (47%), Gaps = 8/84 (9%)
Query: 223 KKKKKRKKKKKKKKKKKRK--KKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKK 280
+ KK + KKK ++ KK + KKK + + ++ + EE KK + KKK + KKK
Sbjct: 1442 EAKKADEAKKKAEEAKKAEEAKKKAEEAKKA---DEAKKKAEEAKKADEAKKKAEEAKKK 1498
Query: 281 K---RKKKRKKKKNQHHRDVWQKK 301
+K KKK + + K
Sbjct: 1499 ADEAKKAAEAKKKADEAKKAEEAK 1522
Score = 35.9 bits (82), Expect = 0.028
Identities = 18/66 (27%), Positives = 33/66 (50%), Gaps = 4/66 (6%)
Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
KKK + KK + KKK + KK + + +++ ++ KK + KKK + KK
Sbjct: 1462 AKKKAEEAKKADEAKKKAEEAKKADEAK----KKAEEAKKKADEAKKAAEAKKKADEAKK 1517
Query: 282 RKKKRK 287
++ +K
Sbjct: 1518 AEEAKK 1523
Score = 35.5 bits (81), Expect = 0.035
Identities = 15/78 (19%), Positives = 34/78 (43%), Gaps = 6/78 (7%)
Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
KK +++KK + KKK + KK ++ + ++ + + KKK ++ KK K +
Sbjct: 1295 AKKAEEKKKADEAKKKAEEAKKADEAKKKA---EEAKKKADAAKKKAEEAKKAAEAAKAE 1351
Query: 282 ---RKKKRKKKKNQHHRD 296
+ + + +
Sbjct: 1352 AEAAADEAEAAEEKAEAA 1369
Score = 35.5 bits (81), Expect = 0.039
Identities = 20/69 (28%), Positives = 39/69 (56%), Gaps = 3/69 (4%)
Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
+K ++ KK ++ KKK + KK + + + +E EE KK ++ KKK+ +KKK
Sbjct: 1661 IKAAEEAKKAEEDKKKAEEAKKAEEDEKK---AAEALKKEAEEAKKAEELKKKEAEEKKK 1717
Query: 282 RKKKRKKKK 290
++ +K ++
Sbjct: 1718 AEELKKAEE 1726
Score = 35.5 bits (81), Expect = 0.040
Identities = 23/86 (26%), Positives = 38/86 (44%), Gaps = 12/86 (13%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
KK ++ KKK + KK KKK + + ++ +E KK ++ KK KK +
Sbjct: 1490 KKAEEAKKKADEAKKAAEAKKKADEAK-----KAEEAKKADEAKKAEEAKKADEAKKAEE 1544
Query: 283 KKKRKK-------KKNQHHRDVWQKK 301
KKK + KK + + + K
Sbjct: 1545 KKKADELKKAEELKKAEEKKKAEEAK 1570
Score = 34.7 bits (79), Expect = 0.064
Identities = 21/73 (28%), Positives = 35/73 (47%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
KKK KK ++ KK KK + + + ++ +E KK ++ KK + KK +
Sbjct: 1509 KKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEE 1568
Query: 283 KKKRKKKKNQHHR 295
KK ++ KN R
Sbjct: 1569 AKKAEEDKNMALR 1581
Score = 34.3 bits (78), Expect = 0.075
Identities = 20/77 (25%), Positives = 38/77 (49%), Gaps = 3/77 (3%)
Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEE---EEEKKKKKKKKKKKTTK 278
KK ++ K K ++ KK + +KKK + + + EE EE+ K K ++ K +
Sbjct: 1612 AKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAE 1671
Query: 279 KKKRKKKRKKKKNQHHR 295
+ K+K + KK + +
Sbjct: 1672 EDKKKAEEAKKAEEDEK 1688
Score = 34.3 bits (78), Expect = 0.080
Identities = 21/79 (26%), Positives = 34/79 (43%), Gaps = 4/79 (5%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
+ +K + KK ++KKK + KK + + ++ EE KK + KKK KKK
Sbjct: 1276 EARKADELKKAEEKKKADEAKKAEEKKKA----DEAKKKAEEAKKADEAKKKAEEAKKKA 1331
Query: 283 KKKRKKKKNQHHRDVWQKK 301
+KK + K
Sbjct: 1332 DAAKKKAEEAKKAAEAAKA 1350
Score = 34.3 bits (78), Expect = 0.094
Identities = 23/81 (28%), Positives = 39/81 (48%), Gaps = 6/81 (7%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKK-- 280
K ++ +KK ++ KK + KKK + + ++ + EE KKK + KK KKK
Sbjct: 1458 KAEEAKKKAEEAKKADEAKKKAEEAKK----ADEAKKKAEEAKKKADEAKKAAEAKKKAD 1513
Query: 281 KRKKKRKKKKNQHHRDVWQKK 301
+ KK + KK + + K
Sbjct: 1514 EAKKAEEAKKADEAKKAEEAK 1534
Score = 34.3 bits (78), Expect = 0.095
Identities = 21/76 (27%), Positives = 36/76 (47%), Gaps = 2/76 (2%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK- 281
KK + KK + KKK KK + + + ++ +E KK ++KKK KK +
Sbjct: 1497 KKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEE 1556
Query: 282 -RKKKRKKKKNQHHRD 296
+K + KKK + +
Sbjct: 1557 LKKAEEKKKAEEAKKA 1572
Score = 34.3 bits (78), Expect = 0.097
Identities = 23/84 (27%), Positives = 40/84 (47%), Gaps = 10/84 (11%)
Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEE----EEKKKKKKKKKKKTT 277
KK + KKK ++ KK + +K + EE EE KKK+ ++KKK
Sbjct: 1666 EAKKAEEDKKKAEEAKKAEEDEKKAAE------ALKKEAEEAKKAEELKKKEAEEKKKAE 1719
Query: 278 KKKKRKKKRKKKKNQHHRDVWQKK 301
+ KK +++ K K + ++ + K
Sbjct: 1720 ELKKAEEENKIKAEEAKKEAEEDK 1743
Score = 34.0 bits (77), Expect = 0.13
Identities = 18/70 (25%), Positives = 38/70 (54%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
KK K + KK ++ K K ++ K + + + E EE+KK ++ KK ++ K K
Sbjct: 1605 KKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAA 1664
Query: 283 KKKRKKKKNQ 292
++ +K ++++
Sbjct: 1665 EEAKKAEEDK 1674
Score = 32.4 bits (73), Expect = 0.35
Identities = 19/78 (24%), Positives = 37/78 (47%)
Query: 224 KKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRK 283
KK + KK + KK + KK ++ + + ++ EE KK ++KKK + KK +
Sbjct: 1516 KKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEED 1575
Query: 284 KKRKKKKNQHHRDVWQKK 301
K +K + + + +
Sbjct: 1576 KNMALRKAEEAKKAEEAR 1593
Score = 32.4 bits (73), Expect = 0.36
Identities = 18/67 (26%), Positives = 32/67 (47%)
Query: 224 KKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRK 283
KK + KK ++++ + +K + R R + EE +K + KK + KK
Sbjct: 1236 KKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKADEA 1295
Query: 284 KKRKKKK 290
KK ++KK
Sbjct: 1296 KKAEEKK 1302
Score = 32.4 bits (73), Expect = 0.39
Identities = 16/66 (24%), Positives = 37/66 (56%)
Query: 225 KKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKK 284
KK + KK ++KKK + KK + +++ + + ++ EE + ++ K + KK K ++
Sbjct: 1552 KKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEE 1611
Query: 285 KRKKKK 290
+K ++
Sbjct: 1612 AKKAEE 1617
Score = 31.6 bits (71), Expect = 0.58
Identities = 26/84 (30%), Positives = 44/84 (52%), Gaps = 8/84 (9%)
Query: 226 KKRKKKKKKKKKKKRKKKKNQHHRDVWINRT---SRGEE-----EEEKKKKKKKKKKKTT 277
KK ++KKK ++ KK ++ KN R + +R EE EEEKK K ++ KK
Sbjct: 1558 KKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEE 1617
Query: 278 KKKKRKKKRKKKKNQHHRDVWQKK 301
K K ++ +K ++ + + +KK
Sbjct: 1618 AKIKAEELKKAEEEKKKVEQLKKK 1641
Score = 31.3 bits (70), Expect = 0.76
Identities = 20/73 (27%), Positives = 34/73 (46%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
KK KK ++ KK KK + + + + ++ EEKKK ++ KK + K
Sbjct: 1521 AKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMAL 1580
Query: 283 KKKRKKKKNQHHR 295
+K + KK + R
Sbjct: 1581 RKAEEAKKAEEAR 1593
Score = 31.3 bits (70), Expect = 0.85
Identities = 19/79 (24%), Positives = 42/79 (53%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
++ + + K +++KK K ++ + + I + EEEKKK ++ KKK+ +KKK
Sbjct: 1590 EEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKA 1649
Query: 283 KKKRKKKKNQHHRDVWQKK 301
++ +K ++ + + K
Sbjct: 1650 EELKKAEEENKIKAAEEAK 1668
Score = 30.5 bits (68), Expect = 1.5
Identities = 17/70 (24%), Positives = 37/70 (52%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
K RK ++ KK ++ R ++ + + + + ++ EE K K ++ KK +KKK
Sbjct: 1576 KNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKV 1635
Query: 283 KKKRKKKKNQ 292
++ +KK+ +
Sbjct: 1636 EQLKKKEAEE 1645
Score = 30.1 bits (67), Expect = 1.7
Identities = 20/79 (25%), Positives = 39/79 (49%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
++KKK ++ KK ++ K +K + + R + E++KK K ++ K ++ K
Sbjct: 1561 EEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKI 1620
Query: 283 KKKRKKKKNQHHRDVWQKK 301
K + KK + + V Q K
Sbjct: 1621 KAEELKKAEEEKKKVEQLK 1639
Score = 30.1 bits (67), Expect = 1.7
Identities = 19/79 (24%), Positives = 36/79 (45%), Gaps = 1/79 (1%)
Query: 224 KKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRK 283
KK + KK ++ KK +++ N+ R R + + K ++ +K KK + K
Sbjct: 1230 KKAEEAKKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEK 1289
Query: 284 KKRKK-KKNQHHRDVWQKK 301
KK + KK + + + K
Sbjct: 1290 KKADEAKKAEEKKKADEAK 1308
Score = 30.1 bits (67), Expect = 1.8
Identities = 22/85 (25%), Positives = 43/85 (50%), Gaps = 6/85 (7%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
KKK + KK +++ K K ++ K + D ++ +EEE+KK KK+++ ++ R
Sbjct: 1715 KKKAEELKKAEEENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIR 1774
Query: 283 KKKRK------KKKNQHHRDVWQKK 301
K+K ++++ R KK
Sbjct: 1775 KEKEAVIEEELDEEDEKRRMEVDKK 1799
Score = 30.1 bits (67), Expect = 2.1
Identities = 23/87 (26%), Positives = 43/87 (49%), Gaps = 12/87 (13%)
Query: 221 VMKKKKKRKKKKKKKKKKKRKK----KKNQHHRDVWINRTSRGEEEEEKKK----KKKKK 272
V K ++ +K ++ KK ++ + +K + R R + EE +K KK ++
Sbjct: 1229 VKKAEEAKKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEE 1288
Query: 273 KKKTTKKKKRKKKRK----KKKNQHHR 295
KKK + KK ++K+K KKK + +
Sbjct: 1289 KKKADEAKKAEEKKKADEAKKKAEEAK 1315
Score = 29.7 bits (66), Expect = 2.2
Identities = 19/74 (25%), Positives = 39/74 (52%), Gaps = 4/74 (5%)
Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
MK ++ +K ++ K K ++ KK + + + + + +E EEKKK ++ KK + K K
Sbjct: 1607 MKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKK----KEAEEKKKAEELKKAEEENKIK 1662
Query: 282 RKKKRKKKKNQHHR 295
++ KK + +
Sbjct: 1663 AAEEAKKAEEDKKK 1676
Score = 29.7 bits (66), Expect = 2.4
Identities = 18/75 (24%), Positives = 43/75 (57%), Gaps = 11/75 (14%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
K+ ++ KKK ++ KK + +KKK H + +EEEKK ++ +K+K+ +++
Sbjct: 1737 KEAEEDKKKAEEAKKDEEEKKKIAHLK-----------KEEEKKAEEIRKEKEAVIEEEL 1785
Query: 283 KKKRKKKKNQHHRDV 297
++ +K++ + + +
Sbjct: 1786 DEEDEKRRMEVDKKI 1800
Score = 29.3 bits (65), Expect = 2.9
Identities = 18/59 (30%), Positives = 31/59 (52%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
KKK + KK +++KKK KK + + I + EEE ++ +K++ + KK K
Sbjct: 1743 KKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDEKRRMEVDKKIK 1801
Score = 29.3 bits (65), Expect = 3.0
Identities = 18/78 (23%), Positives = 38/78 (48%)
Query: 224 KKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRK 283
KK + KK + KK + KKK ++ + + + ++ EE KK ++ K K ++ K
Sbjct: 1528 KKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAK 1587
Query: 284 KKRKKKKNQHHRDVWQKK 301
K + + + + ++K
Sbjct: 1588 KAEEARIEEVMKLYEEEK 1605
Score = 29.0 bits (64), Expect = 4.5
Identities = 16/70 (22%), Positives = 37/70 (52%), Gaps = 2/70 (2%)
Query: 226 KKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKK 285
KK ++ KK ++KKK ++ K R + EE ++ ++ + ++ K +++K+ K
Sbjct: 1552 KKAEELKKAEEKKKAEEAKKAEEDKNMALR--KAEEAKKAEEARIEEVMKLYEEEKKMKA 1609
Query: 286 RKKKKNQHHR 295
+ KK + +
Sbjct: 1610 EEAKKAEEAK 1619
Score = 29.0 bits (64), Expect = 4.6
Identities = 18/68 (26%), Positives = 37/68 (54%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
+KKK + KK ++ KK +KKK + + ++ + EE KK ++ + ++ K +
Sbjct: 1544 EKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEE 1603
Query: 283 KKKRKKKK 290
+KK K ++
Sbjct: 1604 EKKMKAEE 1611
>gnl|CDD|218737 pfam05764, YL1, YL1 nuclear protein. The proteins in this family
are designated YL1. These proteins have been shown to be
DNA-binding and may be a transcription factor.
Length = 238
Score = 44.3 bits (105), Expect = 3e-05
Identities = 24/85 (28%), Positives = 40/85 (47%), Gaps = 16/85 (18%)
Query: 220 FVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVW--------INRTSRG------EEEEEK 265
KKKK K K R KKK++ R W ++SR E E+
Sbjct: 99 TKKKKKKDPTAAKSPKAAAPRPKKKSE--RISWAPTLLDSPRRKSSRSSTVQNKEATHER 156
Query: 266 KKKKKKKKKKTTKKKKRKKKRKKKK 290
K+++ ++KK K +++K++KK+K
Sbjct: 157 LKEREIRRKKIQAKARKRKEKKKEK 181
Score = 43.5 bits (103), Expect = 5e-05
Identities = 23/109 (21%), Positives = 39/109 (35%), Gaps = 35/109 (32%)
Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKK--------- 272
+KKKK+ K K K+ KK+KKK + + KKK ++
Sbjct: 84 LKKKKRVKTKAYKEPTKKKKKKDPT--------AAKSPKAAAPRPKKKSERISWAPTLLD 135
Query: 273 ------KKKTTKKKK------------RKKKRKKKKNQHHRDVWQKKYT 303
+ +T + K R+KK + K + +K+ T
Sbjct: 136 SPRRKSSRSSTVQNKEATHERLKEREIRRKKIQAKARKRKEKKKEKELT 184
Score = 42.4 bits (100), Expect = 1e-04
Identities = 17/71 (23%), Positives = 27/71 (38%), Gaps = 20/71 (28%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
++++R KKKK+ K K K+ KKKKKK + K
Sbjct: 78 LQREERLKKKKRVKTKAYKEPT--------------------KKKKKKDPTAAKSPKAAA 117
Query: 283 KKKRKKKKNQH 293
+ +KK +
Sbjct: 118 PRPKKKSERIS 128
Score = 41.2 bits (97), Expect = 3e-04
Identities = 22/68 (32%), Positives = 29/68 (42%), Gaps = 19/68 (27%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
+K+ +R+++ KKKK+ K K K E KKKKKK K K
Sbjct: 75 EKELQREERLKKKKRVKTKAYK-------------------EPTKKKKKKDPTAAKSPKA 115
Query: 283 KKKRKKKK 290
R KKK
Sbjct: 116 AAPRPKKK 123
Score = 39.3 bits (92), Expect = 0.001
Identities = 15/32 (46%), Positives = 23/32 (71%)
Query: 259 GEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKK 290
GE+E +++++ KKKK+ TK K K+KKKK
Sbjct: 74 GEKELQREERLKKKKRVKTKAYKEPTKKKKKK 105
Score = 39.3 bits (92), Expect = 0.001
Identities = 24/90 (26%), Positives = 40/90 (44%), Gaps = 15/90 (16%)
Query: 221 VMKKKKK-------RKKKKKKKKKKKRKKKKNQHHRDVWINRTSRG------EEEEEKKK 267
KKKKK K + KKK + D ++SR E E+ K
Sbjct: 99 TKKKKKKDPTAAKSPKAAAPRPKKKSERISWAPTLLDSPRRKSSRSSTVQNKEATHERLK 158
Query: 268 KKKKKKKKTTKK--KKRKKKRKKKKNQHHR 295
+++ ++KK K K+++KK++K+ Q R
Sbjct: 159 EREIRRKKIQAKARKRKEKKKEKELTQEER 188
Score = 38.1 bits (89), Expect = 0.003
Identities = 14/31 (45%), Positives = 21/31 (67%)
Query: 260 EEEEEKKKKKKKKKKKTTKKKKRKKKRKKKK 290
E E+E +++++ KKKK K K K+ KKKK
Sbjct: 73 EGEKELQREERLKKKKRVKTKAYKEPTKKKK 103
Score = 37.0 bits (86), Expect = 0.007
Identities = 17/38 (44%), Positives = 22/38 (57%), Gaps = 5/38 (13%)
Query: 260 EEEEEK-----KKKKKKKKKKTTKKKKRKKKRKKKKNQ 292
EEE EK ++ KKKK+ KT K+ KK+KKK
Sbjct: 71 EEEGEKELQREERLKKKKRVKTKAYKEPTKKKKKKDPT 108
Score = 35.4 bits (82), Expect = 0.023
Identities = 15/62 (24%), Positives = 27/62 (43%), Gaps = 11/62 (17%)
Query: 229 KKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKK 288
+K+ +++++ +KKK+ + +E KKKKKK K + KK
Sbjct: 74 GEKELQREERLKKKKRVK-----------TKAYKEPTKKKKKKDPTAAKSPKAAAPRPKK 122
Query: 289 KK 290
K
Sbjct: 123 KS 124
Score = 34.7 bits (80), Expect = 0.040
Identities = 22/77 (28%), Positives = 40/77 (51%), Gaps = 17/77 (22%)
Query: 221 VMKKKKKRKKKKKKKKKKKRKKKKNQHHRDV-------------WINRTS--RGEEEEEK 265
++ K+R+ ++KK + K RK+K+ + +++ IN S R EE+EE+
Sbjct: 153 THERLKEREIRRKKIQAKARKRKEKKKEKELTQEERLAEAKETERINLKSLERYEEQEEE 212
Query: 266 KKKKKKK--KKKTTKKK 280
KKK K + KK+ +
Sbjct: 213 KKKAKIQALKKRRLYEG 229
>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain. This domain is
found at the N terminus of SMC proteins. The SMC
(structural maintenance of chromosomes) superfamily
proteins have ATP-binding domains at the N- and
C-termini, and two extended coiled-coil domains
separated by a hinge in the middle. The eukaryotic SMC
proteins form two kind of heterodimers: the SMC1/SMC3
and the SMC2/SMC4 types. These heterodimers constitute
an essential part of higher order complexes, which are
involved in chromatin and DNA dynamics. This family also
includes the RecF and RecN proteins that are involved in
DNA metabolism and recombination.
Length = 1162
Score = 45.3 bits (107), Expect = 3e-05
Identities = 12/79 (15%), Positives = 35/79 (44%)
Query: 224 KKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRK 283
K+KK+++ KK ++ + + ++ + E+ ++ + + K+K +++
Sbjct: 169 KRKKKERLKKLIEETENLAELIIDLEELKLQELKLKEQAKKALEYYQLKEKLELEEENLL 228
Query: 284 KKRKKKKNQHHRDVWQKKY 302
K N+ D+ Q+
Sbjct: 229 YLDYLKLNEERIDLLQELL 247
Score = 37.6 bits (87), Expect = 0.007
Identities = 15/82 (18%), Positives = 38/82 (46%), Gaps = 5/82 (6%)
Query: 224 KKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRK 283
K+K+ K++++++ KR + +V + + EE+EE+ K + KK++ ++KK
Sbjct: 949 KEKEEDNKEEEEERNKRLLLAKEELGNVNLMAIAEFEEKEERYNKDELKKERLEEEKKEL 1008
Query: 284 KK-----RKKKKNQHHRDVWQK 300
+ ++ +
Sbjct: 1009 LREIIEETCQRFKEFLELFVSI 1030
Score = 36.9 bits (85), Expect = 0.014
Identities = 18/81 (22%), Positives = 33/81 (40%), Gaps = 9/81 (11%)
Query: 222 MKKKKKRKKKKKKKKKK-------KRKKKKNQHHRDVWINRTSRGEEEEEKKKKK--KKK 272
KK + K+K + K N+ D+ +EE E K++ K++
Sbjct: 206 QAKKALEYYQLKEKLELEEENLLYLDYLKLNEERIDLLQELLRDEQEEIESSKQELEKEE 265
Query: 273 KKKTTKKKKRKKKRKKKKNQH 293
+ K+ K++ K+KK Q
Sbjct: 266 EILAQVLKENKEEEKEKKLQE 286
Score = 33.8 bits (77), Expect = 0.12
Identities = 16/78 (20%), Positives = 44/78 (56%)
Query: 224 KKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRK 283
+K+ +K +K+ KK+K+ ++ + +++ I R + EEEE+ +K ++K ++ + +K
Sbjct: 321 EKELKKLEKELKKEKEEIEELEKELKELEIKREAEEEEEEQLEKLQEKLEQLEEELLAKK 380
Query: 284 KKRKKKKNQHHRDVWQKK 301
K ++ + + ++
Sbjct: 381 KLESERLSSAAKLKEEEL 398
Score = 33.4 bits (76), Expect = 0.17
Identities = 15/71 (21%), Positives = 41/71 (57%), Gaps = 3/71 (4%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
+K K+++++++K + KK ++++ + + EEEEK +K K +++K K K +
Sbjct: 744 QKIKEKEEEEEKSRLKKEEEEEEKSELSLKEKEL---AEEEEKTEKLKVEEEKEEKLKAQ 800
Query: 283 KKKRKKKKNQH 293
+++ + + +
Sbjct: 801 EEELRALEEEL 811
Score = 31.9 bits (72), Expect = 0.45
Identities = 17/72 (23%), Positives = 37/72 (51%), Gaps = 4/72 (5%)
Query: 220 FVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKK-KKKKKKKKTTK 278
++K+++ + K+ K++ K+KK Q + ++ EEE + + K +++K +
Sbjct: 259 QELEKEEEILAQVLKENKEEEKEKKLQE---EELKLLAKEEEELKSELLKLERRKVDDEE 315
Query: 279 KKKRKKKRKKKK 290
K K +K KK
Sbjct: 316 KLKESEKELKKL 327
Score = 31.9 bits (72), Expect = 0.49
Identities = 9/72 (12%), Positives = 38/72 (52%), Gaps = 1/72 (1%)
Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRD-VWINRTSRGEEEEEKKKKKKKKKKKTTKKK 280
++K K ++++ + +++ K++ + + I + + +EEE ++ + K+++ +K
Sbjct: 793 KEEKLKAQEEELRALEEELKEEAELLEEEQLLIEQEEKIKEEELEELALELKEEQKLEKL 852
Query: 281 KRKKKRKKKKNQ 292
++ + ++
Sbjct: 853 AEEELERLEEEI 864
Score = 30.3 bits (68), Expect = 1.5
Identities = 12/71 (16%), Positives = 36/71 (50%), Gaps = 2/71 (2%)
Query: 226 KKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTK--KKKRK 283
++ + + K++ +K ++ Q ++ + +EEE K K++++ ++ K +R+
Sbjct: 250 EQEEIESSKQELEKEEEILAQVLKENKEEEKEKKLQEEELKLLAKEEEELKSELLKLERR 309
Query: 284 KKRKKKKNQHH 294
K ++K +
Sbjct: 310 KVDDEEKLKES 320
Score = 29.9 bits (67), Expect = 2.0
Identities = 22/89 (24%), Positives = 37/89 (41%), Gaps = 18/89 (20%)
Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKK------- 274
KK K +K++ K + + K + + EEEEEK + KK++++
Sbjct: 712 ELKKLKLEKEELLADKVQEAQDKINEELKLLEQKIKEKEEEEEKSRLKKEEEEEEKSELS 771
Query: 275 -----------KTTKKKKRKKKRKKKKNQ 292
KT K K ++K +K K Q
Sbjct: 772 LKEKELAEEEEKTEKLKVEEEKEEKLKAQ 800
Score = 28.4 bits (63), Expect = 6.0
Identities = 11/70 (15%), Positives = 30/70 (42%), Gaps = 2/70 (2%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKK--KKKKKKKKTTKKK 280
++ + + +++K+ K ++ + + EE E+ K+ K ++ + +
Sbjct: 396 EELELKNEEEKEAKLLLELSEQEEDLLKEEKKEELKIVEELEESLETKQGKLTEEKEELE 455
Query: 281 KRKKKRKKKK 290
K+ K K K
Sbjct: 456 KQALKLLKDK 465
Score = 28.4 bits (63), Expect = 6.2
Identities = 9/77 (11%), Positives = 31/77 (40%), Gaps = 3/77 (3%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKN---QHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKK 279
+++++R K+ K++ + ++E +++KK+ ++ +
Sbjct: 957 EEEEERNKRLLLAKEELGNVNLMAIAEFEEKEERYNKDELKKERLEEEKKELLREIIEET 1016
Query: 280 KKRKKKRKKKKNQHHRD 296
+R K+ + +R
Sbjct: 1017 CQRFKEFLELFVSINRG 1033
>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
Length = 509
Score = 45.0 bits (107), Expect = 3e-05
Identities = 17/66 (25%), Positives = 26/66 (39%)
Query: 225 KKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKK 284
K++ K+ + KKK + + + T E + KKK K K K KK
Sbjct: 35 KEEIKEALESKKKTPEQIDQVLIFLSGMVKDTDDATESDIPKKKTKTAAKAAAAKAPAKK 94
Query: 285 KRKKKK 290
K K +
Sbjct: 95 KLKDEL 100
Score = 40.7 bits (96), Expect = 7e-04
Identities = 17/68 (25%), Positives = 28/68 (41%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
K K K++ K+ + KKK + V I + ++ ++ + KKK T K
Sbjct: 27 SKSKGFITKEEIKEALESKKKTPEQIDQVLIFLSGMVKDTDDATESDIPKKKTKTAAKAA 86
Query: 283 KKKRKKKK 290
K KK
Sbjct: 87 AAKAPAKK 94
Score = 36.5 bits (85), Expect = 0.016
Identities = 12/71 (16%), Positives = 24/71 (33%)
Query: 221 VMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKK 280
+ K K + K++ K+ + KK + + G ++ + KK TK
Sbjct: 23 LAAKSKSKGFITKEEIKEALESKKKTPEQIDQVLIFLSGMVKDTDDATESDIPKKKTKTA 82
Query: 281 KRKKKRKKKKN 291
+ K
Sbjct: 83 AKAAAAKAPAK 93
Score = 36.1 bits (84), Expect = 0.017
Identities = 17/81 (20%), Positives = 28/81 (34%), Gaps = 9/81 (11%)
Query: 219 GFVMKKKKKRKKKKKKKKKKKRKK---------KKNQHHRDVWINRTSRGEEEEEKKKKK 269
GF+ K++ K + KKK ++ + K + I + + K
Sbjct: 31 GFITKEEIKEALESKKKTPEQIDQVLIFLSGMVKDTDDATESDIPKKKTKTAAKAAAAKA 90
Query: 270 KKKKKKTTKKKKRKKKRKKKK 290
KKK + KK KK
Sbjct: 91 PAKKKLKDELDSSKKAEKKNA 111
Score = 30.0 bits (68), Expect = 1.6
Identities = 14/79 (17%), Positives = 23/79 (29%), Gaps = 21/79 (26%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
+ KKK K K K KKK K + ++KK ++
Sbjct: 70 TESDIPKKKTKTAAKAAAAKAPA---------------------KKKLKDELDSSKKAEK 108
Query: 283 KKKRKKKKNQHHRDVWQKK 301
K K + ++
Sbjct: 109 KNALDKDDDLNYVKDIDVL 127
Score = 29.6 bits (67), Expect = 2.6
Identities = 13/68 (19%), Positives = 21/68 (30%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
K KK K K K K + K+ + + + K ++ KK+
Sbjct: 19 KLKKLAAKSKSKGFITKEEIKEALESKKKTPEQIDQVLIFLSGMVKDTDDATESDIPKKK 78
Query: 283 KKKRKKKK 290
K K
Sbjct: 79 TKTAAKAA 86
Score = 29.2 bits (66), Expect = 2.8
Identities = 11/78 (14%), Positives = 24/78 (30%), Gaps = 14/78 (17%)
Query: 224 KKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRK 283
K + +++ K+K KK + + K K++ K + K+K
Sbjct: 2 TTASTKAELAAEEEAKKKLKKLA--------------AKSKSKGFITKEEIKEALESKKK 47
Query: 284 KKRKKKKNQHHRDVWQKK 301
+ + K
Sbjct: 48 TPEQIDQVLIFLSGMVKD 65
>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
subunit (TFIIF-alpha). Transcription initiation factor
IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
II-associating protein 74 (RAP74) is the large subunit
of transcription factor IIF (TFIIF), which is essential
for accurate initiation and stimulates elongation by RNA
polymerase II.
Length = 528
Score = 45.0 bits (106), Expect = 3e-05
Identities = 19/71 (26%), Positives = 32/71 (45%), Gaps = 8/71 (11%)
Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWI--NRTSRGEEEEEKKKKKKKKKKKTTKK 279
+ +K + K K K + D + + +G E+ +++K KKKKKK K
Sbjct: 199 ETEGEKGGGGRGKDLKIKDLEG------DDEDDGDESDKGGEDGDEEKSKKKKKKLAKNK 252
Query: 280 KKRKKKRKKKK 290
KK +K K+
Sbjct: 253 KKLDDDKKGKR 263
Score = 36.1 bits (83), Expect = 0.018
Identities = 28/114 (24%), Positives = 41/114 (35%), Gaps = 40/114 (35%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEE-------------------- 262
KKKK+K K KKK KK K D + G++E
Sbjct: 241 SKKKKKKLAKNKKKLDDDKKGKRGGDDDADEYDSDDGDDEGREEDYISDSSASGNDPEER 300
Query: 263 --------------------EEKKKKKKKKKKKTTKKKKRKKKRKKKKNQHHRD 296
EE +++K +++ +KK K+ KK K KKN +D
Sbjct: 301 EDKLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKKNGLDKD 354
Score = 31.9 bits (72), Expect = 0.41
Identities = 24/114 (21%), Positives = 42/114 (36%), Gaps = 40/114 (35%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKK---------------NQHHRDVWINRTS---------- 257
KKKKK+ K KKK +K K+ ++ + +I+ +S
Sbjct: 242 KKKKKKLAKNKKKLDDDKKGKRGGDDDADEYDSDDGDDEGREEDYISDSSASGNDPEERE 301
Query: 258 ---------------RGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQHHRD 296
+ EE +++K +++ + K KK KK + KK D
Sbjct: 302 DKLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKKGKKLKKLKGKKNGLDKDD 355
Score = 31.1 bits (70), Expect = 0.70
Identities = 17/67 (25%), Positives = 33/67 (49%), Gaps = 4/67 (5%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
K +++ KK KK KK K KKN +D + G++ ++ + T KK++
Sbjct: 327 KNEEEGGLSKKGKKLKKLKGKKNGLDKD----DSDSGDDSDDSDIDGEDSVSLVTAKKQK 382
Query: 283 KKKRKKK 289
+ K+++
Sbjct: 383 EPKKEEP 389
>gnl|CDD|221641 pfam12569, NARP1, NMDA receptor-regulated protein 1. This domain
family is found in eukaryotes, and is approximately 40
amino acids in length. The family is found in
association with pfam07719, pfam00515. There is a single
completely conserved residue L that may be functionally
important. NARP1 is the mammalian homologue of a yeast
N-terminal acetyltransferase that regulates entry into
the G(0) phase of the cell cycle.
Length = 516
Score = 44.9 bits (107), Expect = 3e-05
Identities = 19/68 (27%), Positives = 29/68 (42%), Gaps = 17/68 (25%)
Query: 225 KKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKK 284
+RKK +KK++K ++K +K EE +K KKK + KK K
Sbjct: 408 PAERKKLRKKQRKAEKKAEK-----------------EEAEKAAAKKKAEAAAKKAKGPD 450
Query: 285 KRKKKKNQ 292
KK +
Sbjct: 451 GETKKVDP 458
Score = 34.5 bits (80), Expect = 0.061
Identities = 10/43 (23%), Positives = 21/43 (48%)
Query: 259 GEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQHHRDVWQKK 301
G ++KK +KK++K KK ++++ K + +K
Sbjct: 404 GNLSPAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAAKKA 446
Score = 34.1 bits (79), Expect = 0.090
Identities = 15/69 (21%), Positives = 28/69 (40%), Gaps = 20/69 (28%)
Query: 229 KKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKK 288
++KK +KK+RK E+K +K++ +K KK + K+ K
Sbjct: 408 PAERKKLRKKQRK--------------------AEKKAEKEEAEKAAAKKKAEAAAKKAK 447
Query: 289 KKNQHHRDV 297
+ + V
Sbjct: 448 GPDGETKKV 456
Score = 30.3 bits (69), Expect = 1.3
Identities = 10/46 (21%), Positives = 19/46 (41%)
Query: 256 TSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQHHRDVWQKK 301
GE ++KK +KK K +K+ +K + +K +
Sbjct: 398 EEEGENGNLSPAERKKLRKKQRKAEKKAEKEEAEKAAAKKKAEAAA 443
Score = 29.9 bits (68), Expect = 1.9
Identities = 9/47 (19%), Positives = 19/47 (40%), Gaps = 8/47 (17%)
Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKK 268
+K +K+ +K++ +K +KK + + E KK
Sbjct: 418 QRKAEKKAEKEEAEKAAAKKKAEAAA--------KKAKGPDGETKKV 456
>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis). This nucleolar
family of proteins are involved in 60S ribosomal
biogenesis. They are specifically involved in the
processing beyond the 27S stage of 25S rRNA maturation.
This family contains sequences that bear similarity to
the glioma tumour suppressor candidate region gene 2
protein (p60). This protein has been found to interact
with herpes simplex type 1 regulatory proteins.
Length = 387
Score = 44.7 bits (106), Expect = 3e-05
Identities = 20/75 (26%), Positives = 41/75 (54%), Gaps = 9/75 (12%)
Query: 222 MKKKKKRKKKKKKKKKKKR------KKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKK 275
K+K K ++ K+K++K+ K+ K + + + R +E +K+K + +KK+
Sbjct: 275 PKRKTKAQRNKEKRRKELEREAKEEKQLKKKLAQ---LARLKEIAKEVAQKEKARARKKE 331
Query: 276 TTKKKKRKKKRKKKK 290
K++ KKK K++K
Sbjct: 332 QRKERGEKKKLKRRK 346
Score = 40.8 bits (96), Expect = 5e-04
Identities = 23/92 (25%), Positives = 43/92 (46%), Gaps = 29/92 (31%)
Query: 220 FVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKK--------- 270
+ K + ++K K ++ K+KR+K+ E +EEK+ KKK
Sbjct: 268 PINKPVRPKRKTKAQRNKEKRRKEL-------------EREAKEEKQLKKKLAQLARLKE 314
Query: 271 -------KKKKKTTKKKKRKKKRKKKKNQHHR 295
K+K + KK++RK++ +KKK + +
Sbjct: 315 IAKEVAQKEKARARKKEQRKERGEKKKLKRRK 346
Score = 39.3 bits (92), Expect = 0.002
Identities = 20/101 (19%), Positives = 43/101 (42%), Gaps = 22/101 (21%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRG----------------------E 260
+K+ K +KK+++ ++ + KK + ++ S G E
Sbjct: 204 EKEVKAEKKRQELERVEEKKLEKMAPEASRLDEMSEGLLEESDDDGEEESDDESAWEGFE 263
Query: 261 EEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQHHRDVWQKK 301
E E K + K+KT ++ ++K+RK+ + + + KK
Sbjct: 264 SEYEPINKPVRPKRKTKAQRNKEKRRKELEREAKEEKQLKK 304
Score = 34.3 bits (79), Expect = 0.068
Identities = 23/97 (23%), Positives = 38/97 (39%), Gaps = 20/97 (20%)
Query: 220 FVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTT-- 277
FV+ KKK++KK +K K+ D + S+ KK+ KKK KK
Sbjct: 43 FVVDTTGDEATIKKKRRKKLKKPLKS----DEILENRSKVPAPSVKKRHVKKKSKKKKSK 98
Query: 278 --------------KKKKRKKKRKKKKNQHHRDVWQK 300
+ KK K + ++ + D+W +
Sbjct: 99 QVSRKELLRLKKLARGKKGGKAKLLNASKDNYDLWGE 135
Score = 32.4 bits (74), Expect = 0.28
Identities = 19/106 (17%), Positives = 39/106 (36%), Gaps = 27/106 (25%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHR----------------------DVWINRTSRGE 260
K +KKR++ ++ ++KK K W S E
Sbjct: 208 KAEKKRQELERVEEKKLEKMAPEASRLDEMSEGLLEESDDDGEEESDDESAWEGFESEYE 267
Query: 261 EEE-----EKKKKKKKKKKKTTKKKKRKKKRKKKKNQHHRDVWQKK 301
++K K ++ K+K K+ +R+ K +K+ + + + K
Sbjct: 268 PINKPVRPKRKTKAQRNKEKRRKELEREAKEEKQLKKKLAQLARLK 313
Score = 31.2 bits (71), Expect = 0.62
Identities = 16/66 (24%), Positives = 27/66 (40%), Gaps = 6/66 (9%)
Query: 220 FVMKKKKKRKKK--KKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKK----KKKKKKK 273
V +K+ R KK + KK K + ++ + D+W + KK ++
Sbjct: 99 QVSRKELLRLKKLARGKKGGKAKLLNASKDNYDLWGEEEPPDTPKWPKKDLAEHTLEQVA 158
Query: 274 KKTTKK 279
KKT K
Sbjct: 159 KKTVKT 164
>gnl|CDD|237753 PRK14552, PRK14552, C/D box methylation guide ribonucleoprotein
complex aNOP56 subunit; Provisional.
Length = 414
Score = 43.8 bits (104), Expect = 5e-05
Identities = 24/68 (35%), Positives = 36/68 (52%), Gaps = 18/68 (26%)
Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
++ KR ++ K+K K KKK+ EEKK +K+KKKKK KK K
Sbjct: 365 KEELNKRIEEIKEKYPKPPKKKR------------------EEKKPQKRKKKKKRKKKGK 406
Query: 282 RKKKRKKK 289
++KK+ +K
Sbjct: 407 KRKKKGRK 414
Score = 40.0 bits (94), Expect = 0.001
Identities = 17/28 (60%), Positives = 23/28 (82%)
Query: 265 KKKKKKKKKKKTTKKKKRKKKRKKKKNQ 292
KKK+++KK +K KKKKRKKK KK+K +
Sbjct: 384 KKKREEKKPQKRKKKKKRKKKGKKRKKK 411
Score = 38.0 bits (89), Expect = 0.005
Identities = 18/38 (47%), Positives = 27/38 (71%), Gaps = 5/38 (13%)
Query: 258 RGEEEEEK-----KKKKKKKKKKTTKKKKRKKKRKKKK 290
R EE +EK KKK+++KK + KKKK++KK+ KK+
Sbjct: 371 RIEEIKEKYPKPPKKKREEKKPQKRKKKKKRKKKGKKR 408
Score = 37.3 bits (87), Expect = 0.008
Identities = 17/35 (48%), Positives = 25/35 (71%), Gaps = 5/35 (14%)
Query: 262 EEEKKK-----KKKKKKKKTTKKKKRKKKRKKKKN 291
EE K+K KKK+++KK K+KK+KK++KK K
Sbjct: 373 EEIKEKYPKPPKKKREEKKPQKRKKKKKRKKKGKK 407
Score = 30.3 bits (69), Expect = 1.3
Identities = 13/37 (35%), Positives = 23/37 (62%)
Query: 260 EEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQHHRD 296
EE ++ ++ K+K K KKK+ +KK +K+K + R
Sbjct: 366 EELNKRIEEIKEKYPKPPKKKREEKKPQKRKKKKKRK 402
Score = 28.4 bits (64), Expect = 4.7
Identities = 13/38 (34%), Positives = 22/38 (57%), Gaps = 2/38 (5%)
Query: 261 EEEEKKKK--KKKKKKKTTKKKKRKKKRKKKKNQHHRD 296
EE K+ + K+K K KK++ KK +K+KK + +
Sbjct: 366 EELNKRIEEIKEKYPKPPKKKREEKKPQKRKKKKKRKK 403
>gnl|CDD|235370 PRK05244, PRK05244, Der GTPase activator; Provisional.
Length = 177
Score = 42.2 bits (100), Expect = 8e-05
Identities = 21/70 (30%), Positives = 30/70 (42%), Gaps = 14/70 (20%)
Query: 226 KKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKK------KTTKK 279
KK+K K+ K + KKK + D E E K+KKK K K + +
Sbjct: 1 KKKKSSPKRSKGMAKSKKKTREELDA--------EARERKRKKKHKGLKSGSRHNEGNTQ 52
Query: 280 KKRKKKRKKK 289
K K + +KK
Sbjct: 53 SKGKGQAQKK 62
Score = 40.7 bits (96), Expect = 3e-04
Identities = 27/93 (29%), Positives = 38/93 (40%), Gaps = 25/93 (26%)
Query: 222 MKKKKKRKKKKKKKKKKK-----------RKKKKNQHHRDVWINRTSRGEEEEEKKKKKK 270
KK ++ K K KKK RK+KK +H +R + G + + K + +
Sbjct: 2 KKKSSPKRSKGMAKSKKKTREELDAEARERKRKK-KHKGLKSGSRHNEGNTQSKGKGQAQ 60
Query: 271 KK------KKK-------TTKKKKRKKKRKKKK 290
KK KK K KK+K K KK K
Sbjct: 61 KKDPRIGSKKPIPLGVEEKVKPKKKKPKSKKPK 93
Score = 27.2 bits (61), Expect = 7.3
Identities = 20/72 (27%), Positives = 29/72 (40%), Gaps = 23/72 (31%)
Query: 222 MKKKKKRKKKK---------KKKKKKKRKKKKNQHHRDVWINRTSR-------GEEEEEK 265
K+KKK K K + K K + +KK+ S+ E+ + K
Sbjct: 31 RKRKKKHKGLKSGSRHNEGNTQSKGKGQAQKKDPRI-------GSKKPIPLGVEEKVKPK 83
Query: 266 KKKKKKKKKKTT 277
KKK K KK K +
Sbjct: 84 KKKPKSKKPKLS 95
>gnl|CDD|220431 pfam09831, DUF2058, Uncharacterized protein conserved in bacteria
(DUF2058). This domain, found in various prokaryotic
proteins, has no known function.
Length = 177
Score = 42.2 bits (100), Expect = 8e-05
Identities = 17/68 (25%), Positives = 37/68 (54%), Gaps = 5/68 (7%)
Query: 225 KKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKK 284
KKK KK KK+K+K++++ +K D + EE K +K ++ ++ ++++ +
Sbjct: 14 KKKAKKAKKEKRKQRKQARKGADDGD-----DELKQAAEEAKAEKAERDRELNRQRQAEA 68
Query: 285 KRKKKKNQ 292
++K Q
Sbjct: 69 EQKAIVAQ 76
>gnl|CDD|220600 pfam10147, CR6_interact, Growth arrest and DNA-damage-inducible
proteins-interacting protein 1. Members of this family
of proteins act as negative regulators of G1 to S cell
cycle phase progression by inhibiting cyclin-dependent
kinases. Inhibitory effects are additive with GADD45
proteins but occur also in the absence of GADD45
proteins. Furthermore, they act as a repressor of the
orphan nuclear receptor NR4A1 by inhibiting AB
domain-mediated transcriptional activity.
Length = 217
Score = 42.5 bits (100), Expect = 1e-04
Identities = 21/72 (29%), Positives = 46/72 (63%), Gaps = 1/72 (1%)
Query: 216 WIPGFVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWIN-RTSRGEEEEEKKKKKKKKKK 274
I + +K+K+ +K + K++K+R + + H W++ R R +E ++K+K++KKK
Sbjct: 136 MIADWRAQKRKREQKARAAKERKERLVAEAREHFGYWVDPRDPRFQEMLQQKEKEEKKKV 195
Query: 275 KTTKKKKRKKKR 286
K K++++++KR
Sbjct: 196 KEAKRREKEEKR 207
Score = 40.2 bits (94), Expect = 5e-04
Identities = 20/68 (29%), Positives = 43/68 (63%), Gaps = 5/68 (7%)
Query: 224 KKKKRKKKKKKKKKKKRKKK---KNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKK 280
+ +KRK+++K + K+RK++ + + H W++ R +E ++K+K++KK K+
Sbjct: 141 RAQKRKREQKARAAKERKERLVAEAREHFGYWVD--PRDPRFQEMLQQKEKEEKKKVKEA 198
Query: 281 KRKKKRKK 288
KR++K +K
Sbjct: 199 KRREKEEK 206
Score = 30.5 bits (69), Expect = 0.73
Identities = 14/82 (17%), Positives = 34/82 (41%), Gaps = 20/82 (24%)
Query: 216 WIPGF-VMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKK 274
W P M ++ + ++K+K+ +++ R E E K K +
Sbjct: 97 WYPSLNQMLEENREQQKEKEARRQAR-------------------EAEIAKNMAKMPQMI 137
Query: 275 KTTKKKKRKKKRKKKKNQHHRD 296
+ +KRK+++K + + ++
Sbjct: 138 ADWRAQKRKREQKARAAKERKE 159
>gnl|CDD|218684 pfam05672, MAP7, MAP7 (E-MAP-115) family. The organisation of
microtubules varies with the cell type and is presumably
controlled by tissue-specific microtubule-associated
proteins (MAPs). The 115-kDa epithelial MAP
(E-MAP-115/MAP7) has been identified as a
microtubule-stabilising protein predominantly expressed
in cell lines of epithelial origin. The binding of this
microtubule associated protein is nucleotide
independent.
Length = 171
Score = 41.6 bits (97), Expect = 1e-04
Identities = 16/81 (19%), Positives = 55/81 (67%), Gaps = 2/81 (2%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTS--RGEEEEEKKKKKKKKKKKTTKKK 280
K+++ R+++++++++++ ++++++ R+ R + R EEE +++++++ ++ +K
Sbjct: 31 KRRQAREQREQEEQERREQEEQDRLEREELKRRAAEERLRREEEARRQEEERAREKEEKA 90
Query: 281 KRKKKRKKKKNQHHRDVWQKK 301
KRK + ++K+ Q ++ QK+
Sbjct: 91 KRKAEEEEKQEQEEQERIQKQ 111
Score = 33.9 bits (77), Expect = 0.043
Identities = 11/79 (13%), Positives = 48/79 (60%), Gaps = 8/79 (10%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
++++ R++++++ ++K+ K K+ + +E+EE+++ +K+K++ + ++
Sbjct: 71 REEEARRQEEERAREKEEKAKRKAEEEEK--------QEQEEQERIQKQKEEAEARAREE 122
Query: 283 KKKRKKKKNQHHRDVWQKK 301
++ + ++ +H + + Q++
Sbjct: 123 AERMRLEREKHFQQIEQER 141
Score = 33.1 bits (75), Expect = 0.094
Identities = 12/81 (14%), Positives = 50/81 (61%), Gaps = 11/81 (13%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEE-----------EEEKKKKKKK 271
+++++++++++++++++ + ++ + R R R EE +EEK K+K +
Sbjct: 36 REQREQEEQERREQEEQDRLEREELKRRAAEERLRREEEARRQEEERAREKEEKAKRKAE 95
Query: 272 KKKKTTKKKKRKKKRKKKKNQ 292
+++K ++++ + +++K++ +
Sbjct: 96 EEEKQEQEEQERIQKQKEEAE 116
Score = 30.8 bits (69), Expect = 0.47
Identities = 9/73 (12%), Positives = 42/73 (57%)
Query: 224 KKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRK 283
K+++ ++++ +K+K++ + + + + + R ++ E+++ ++KK+ ++ K+ ++
Sbjct: 99 KQEQEEQERIQKQKEEAEARAREEAERMRLEREKHFQQIEQERLERKKRLEEIMKRTRKS 158
Query: 284 KKRKKKKNQHHRD 296
+ + K + +
Sbjct: 159 EVSPQVKKEDPKV 171
>gnl|CDD|218797 pfam05890, Ebp2, Eukaryotic rRNA processing protein EBP2. This
family consists of several Eukaryotic rRNA processing
protein EBP2 sequences. Ebp2p is required for the
maturation of 25S rRNA and 60S subunit assembly. Ebp2p
may be one of the target proteins of Rrs1p for executing
the signal to regulate ribosome biogenesis. This family
also plays a role in chromosome segregation.
Length = 270
Score = 42.3 bits (100), Expect = 1e-04
Identities = 22/73 (30%), Positives = 31/73 (42%), Gaps = 14/73 (19%)
Query: 224 KKKKRKKKKKK---KKKKKRKKKKNQHHR----DVWINRTSRGEEEEEKKKKKKKKKKKT 276
K ++R K+KK K KK +KK+K D ++ K KKK +K
Sbjct: 154 KLQERAKEKKDMLEKIKKLKKKRKGGGDEGDLFDF-------LVDDAAKGGSKKKGRKGG 206
Query: 277 TKKKKRKKKRKKK 289
+ K KRK K
Sbjct: 207 AARGKPNAKRKAK 219
Score = 34.2 bits (79), Expect = 0.058
Identities = 18/73 (24%), Positives = 22/73 (30%), Gaps = 12/73 (16%)
Query: 219 GFVMKKKKKRKKKKKKKK----KKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKK 274
G + K K+K K K KKR K N S + KK K
Sbjct: 205 GGAARGKPNAKRKAKDAKFGFGGKKRGSKSN--------TAESSNDLSGFSVKKMKGGGG 256
Query: 275 KTTKKKKRKKKRK 287
KR K +
Sbjct: 257 AGKGGNKRPGKSR 269
Score = 31.9 bits (73), Expect = 0.30
Identities = 19/70 (27%), Positives = 34/70 (48%), Gaps = 19/70 (27%)
Query: 222 MKKKKKRKKKKKKKKKKKRKK--KKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKK 279
+ + +K ++ +K+++ KK K+ Q EK +++ K+KK +K
Sbjct: 125 LAEAAAKKASEEARKQRELKKFGKQVQV----------------EKLQERAKEKKDMLEK 168
Query: 280 -KKRKKKRKK 288
KK KKKRK
Sbjct: 169 IKKLKKKRKG 178
>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27. This protein forms
the C subunit of DNA polymerase delta. It carries the
essential residues for binding to the Pol1 subunit of
polymerase alpha, from residues 293-332, which are
characterized by the motif D--G--VT, referred to as the
DPIM motif. The first 160 residues of the protein form
the minimal domain for binding to the B subunit, Cdc1,
of polymerase delta, the final 10 C-terminal residues,
362-372, being the DNA sliding clamp, PCNA, binding
motif.
Length = 427
Score = 42.5 bits (100), Expect = 2e-04
Identities = 19/78 (24%), Positives = 34/78 (43%), Gaps = 9/78 (11%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDV---------WINRTSRGEEEEEKKKKKKKKK 273
KKK K KK+KK+ + K++ + E+E+E K ++
Sbjct: 223 KKKTKEKKEKKEASESTVKEESEEESGKRDVILEDESAEPTGLDEDEDEDEPKPSGERSD 282
Query: 274 KKTTKKKKRKKKRKKKKN 291
+ ++K K+KRK+ K
Sbjct: 283 SEEETEEKEKEKRKRLKK 300
Score = 34.8 bits (80), Expect = 0.047
Identities = 21/89 (23%), Positives = 44/89 (49%), Gaps = 14/89 (15%)
Query: 218 PGFVMKKKKKRKKKKKKKKKKKRKKKKNQHHRD----VWINRTSRGEEEEEKKK------ 267
P ++ ++ ++K+K+K+++ KK D + I S EEEE ++
Sbjct: 274 PKPSGERSDSEEETEEKEKEKRKRLKKMMEDEDEDEEMEIVPESPVEEEESEEPEPPPLP 333
Query: 268 KKKKKKKKTTKK----KKRKKKRKKKKNQ 292
KK+++K++ T ++R ++R KK
Sbjct: 334 KKEEEKEEVTVSPDGGRRRGRRRVMKKKT 362
Score = 33.7 bits (77), Expect = 0.11
Identities = 20/73 (27%), Positives = 27/73 (36%), Gaps = 13/73 (17%)
Query: 228 RKKKKKKKKKKKRKK--------KKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKK 279
R++ +++ KKK K KK W + S E E K K K
Sbjct: 350 RRRGRRRVMKKKTFKDEEGYLVTKKVY----EWES-FSEDEAEPPPTKPKPKVSTPAVPA 404
Query: 280 KKRKKKRKKKKNQ 292
+K K KKK Q
Sbjct: 405 AAKKPKAPKKKKQ 417
Score = 32.5 bits (74), Expect = 0.23
Identities = 16/72 (22%), Positives = 25/72 (34%), Gaps = 2/72 (2%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKK--K 280
K K + K K KK ++ +T + K+ KK
Sbjct: 163 PSSKPPKSIMSPEVKVKSAKKTQDTSKETTTEKTEGKTSVKAASLKRNPPKKSNIMSSFF 222
Query: 281 KRKKKRKKKKNQ 292
K+K K KK+K +
Sbjct: 223 KKKTKEKKEKKE 234
Score = 30.6 bits (69), Expect = 1.1
Identities = 19/82 (23%), Positives = 31/82 (37%), Gaps = 3/82 (3%)
Query: 221 VMKKKKKRKKKKK-KKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKK 279
+M + K K KK + K+ +K + V R + K KK TK+
Sbjct: 171 IMSPEVKVKSAKKTQDTSKETTTEKTEGKTSVKAASLKRNPPK--KSNIMSSFFKKKTKE 228
Query: 280 KKRKKKRKKKKNQHHRDVWQKK 301
KK KK+ + + + K
Sbjct: 229 KKEKKEASESTVKEESEEESGK 250
Score = 30.6 bits (69), Expect = 1.1
Identities = 16/68 (23%), Positives = 31/68 (45%), Gaps = 7/68 (10%)
Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
+K K K KR K N S +++ K+KK+KK+ ++T K++
Sbjct: 191 TTTEKTEGKTSVKAASLKRNPPKKS-------NIMSSFFKKKTKEKKEKKEASESTVKEE 243
Query: 282 RKKKRKKK 289
+++ K+
Sbjct: 244 SEEESGKR 251
>gnl|CDD|206039 pfam13868, Trichoplein, Tumour suppressor, Mitostatin. Trichoplein
or mitostatin, was first defined as a meiosis-specific
nuclear structural protein. It has since been linked
with mitochondrial movement. It is associated with the
mitochondrial outer membrane, and over-expression leads
to reduction in mitochondrial motility whereas lack of
it enhances mitochondrial movement. The activity appears
to be mediated through binding the mitochondria to the
actin intermediate filaments (IFs).
Length = 349
Score = 42.2 bits (100), Expect = 2e-04
Identities = 19/81 (23%), Positives = 48/81 (59%), Gaps = 6/81 (7%)
Query: 223 KKKKKRKKKKKKKKKKKRKKK------KNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKT 276
++K+R+K+K++ +K++R+K+ + Q + R EEE E+++ +K+ +
Sbjct: 213 YERKERQKEKEEAEKRRRQKQELQRAREEQIEEKEERLQEERAEEEAERERMLEKQAEDE 272
Query: 277 TKKKKRKKKRKKKKNQHHRDV 297
+++ +KR+ K+ +H R++
Sbjct: 273 ELEQENAEKRRMKRLEHRREL 293
Score = 39.9 bits (94), Expect = 0.001
Identities = 15/88 (17%), Positives = 47/88 (53%), Gaps = 17/88 (19%)
Query: 222 MKKKKKRKKKKK--------KKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKK 273
K+K++R++ + ++K+++ + ++ R+ R +EE+E + ++K++K
Sbjct: 63 RKRKEERREGRAVLQEQIEEREKRRQEEYEERLQEREQMDEIIERIQEEDEAEAQEKREK 122
Query: 274 KKTTK---------KKKRKKKRKKKKNQ 292
+K + + +RK++ K+++ +
Sbjct: 123 QKKLREEIDEFNEERIERKEEEKERERE 150
Score = 39.1 bits (92), Expect = 0.002
Identities = 14/83 (16%), Positives = 45/83 (54%), Gaps = 5/83 (6%)
Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWI----NRTSRGEEEEEKKKKKKKKKKKTT 277
M +K+ ++ +++ +KR+ K+ +H R++ R E EE+ ++ ++ +++
Sbjct: 264 MLEKQAEDEELEQENAEKRRMKRLEHRRELEQQIEEKEERRAAEREEELEEGERLREEEA 323
Query: 278 KKKKR-KKKRKKKKNQHHRDVWQ 299
+++ R +++R++ +H +
Sbjct: 324 ERQARIEEERQRLLKEHAEALLG 346
Score = 38.7 bits (91), Expect = 0.003
Identities = 14/72 (19%), Positives = 39/72 (54%), Gaps = 4/72 (5%)
Query: 225 KKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKK----KKKKKKKKKTTKKK 280
+ + ++K++K+KK + + + R E EEE K +++K ++++ + +
Sbjct: 113 EAEAQEKREKQKKLREEIDEFNEERIERKEEEKEREREEELKILEYQREKAEREEEREAE 172
Query: 281 KRKKKRKKKKNQ 292
+R++K +K++
Sbjct: 173 RRERKEEKEREV 184
Score = 36.0 bits (84), Expect = 0.018
Identities = 13/82 (15%), Positives = 44/82 (53%), Gaps = 10/82 (12%)
Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWI----NRTSRGEEEEEKKKKKKKKKKKTT 277
M++++ + +++++++KRK+++ + + R EE EE+ +++++ +
Sbjct: 47 MEEERLKALAEEEERERKRKEERREGRAVLQEQIEEREKRRQEEYEERLQEREQMDEIIE 106
Query: 278 K------KKKRKKKRKKKKNQH 293
+ + ++K+ K+KK +
Sbjct: 107 RIQEEDEAEAQEKREKQKKLRE 128
Score = 36.0 bits (84), Expect = 0.019
Identities = 14/88 (15%), Positives = 48/88 (54%), Gaps = 8/88 (9%)
Query: 223 KKKKKRK--------KKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKK 274
K+KK R+ + ++K+++K+R++++ + + R EE E +++++K++K+
Sbjct: 122 KQKKLREEIDEFNEERIERKEEEKEREREEELKILEYQREKAEREEEREAERRERKEEKE 181
Query: 275 KTTKKKKRKKKRKKKKNQHHRDVWQKKY 302
+ + + +++ + + + ++ Y
Sbjct: 182 REVARLRAQQEEAEDEREELDELRADLY 209
Score = 34.5 bits (80), Expect = 0.052
Identities = 13/81 (16%), Positives = 45/81 (55%), Gaps = 20/81 (24%)
Query: 222 MKKKKKRKKKKKKKKK-------KKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKK 274
+++KK+ K ++K++++ ++R K + EEE E+K+K+++++
Sbjct: 26 IEEKKRIKAEEKEEERRIDEMMEEERLKALAE-------------EEERERKRKEERREG 72
Query: 275 KTTKKKKRKKKRKKKKNQHHR 295
+ +++ +++ K+++ ++
Sbjct: 73 RAVLQEQIEEREKRRQEEYEE 93
Score = 33.3 bits (77), Expect = 0.13
Identities = 10/79 (12%), Positives = 44/79 (55%), Gaps = 7/79 (8%)
Query: 222 MKKKKKRKKKKKKK-----KKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKT 276
+ +++ +K++K++ + ++ + + + D +EE E+K+++K+K++
Sbjct: 169 REAERRERKEEKEREVARLRAQQEEAEDEREELDELRAD--LYQEEYERKERQKEKEEAE 226
Query: 277 TKKKKRKKKRKKKKNQHHR 295
+++++++ ++ ++ Q
Sbjct: 227 KRRRQKQELQRAREEQIEE 245
>gnl|CDD|240246 PTZ00053, PTZ00053, methionine aminopeptidase 2; Provisional.
Length = 470
Score = 42.0 bits (99), Expect = 2e-04
Identities = 24/68 (35%), Positives = 36/68 (52%)
Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
K++K++ K+K KKK +K KK+ D ++ +E E K+ KKKKKK KKKK
Sbjct: 9 EVKQQKQQNKQKGTKKKNKKSKKDVDDDDAFLAELISENQEAENKQNNKKKKKKKKKKKK 68
Query: 282 RKKKRKKK 289
+
Sbjct: 69 KNLGEAYD 76
Score = 39.7 bits (93), Expect = 0.001
Identities = 23/69 (33%), Positives = 35/69 (50%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
+ K+++++ K+K KKK KK K D E +E + K+ KKKKK KKKK+
Sbjct: 9 EVKQQKQQNKQKGTKKKNKKSKKDVDDDDAFLAELISENQEAENKQNNKKKKKKKKKKKK 68
Query: 283 KKKRKKKKN 291
K +
Sbjct: 69 KNLGEAYDL 77
Score = 37.0 bits (86), Expect = 0.011
Identities = 21/74 (28%), Positives = 37/74 (50%), Gaps = 9/74 (12%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWI---------NRTSRGEEEEEKKKKKKKKK 273
+ + K++K++ K+K K+K KK++ D N+ + ++ +KKKKKKKKK
Sbjct: 7 ENEVKQQKQQNKQKGTKKKNKKSKKDVDDDDAFLAELISENQEAENKQNNKKKKKKKKKK 66
Query: 274 KKTTKKKKRKKKRK 287
KK +
Sbjct: 67 KKKNLGEAYDLAYD 80
>gnl|CDD|192535 pfam10320, 7TM_GPCR_Srsx, Serpentine type 7TM GPCR chemoreceptor
Srsx. Chemoreception is mediated in Caenorhabditis
elegans by members of the seven-transmembrane
G-protein-coupled receptor class (7TM GPCRs) of proteins
which are of the serpentine type. Srsx is a solo family
amongst the superfamilies of chemoreceptors.
Chemoperception is one of the central senses of soil
nematodes like C. elegans which are otherwise 'blind'
and 'deaf'.
Length = 257
Score = 41.4 bits (98), Expect = 2e-04
Identities = 41/196 (20%), Positives = 75/196 (38%), Gaps = 21/196 (10%)
Query: 52 IITAILGNLLVIISVIKHRKLRIITNYFVVSLAFADLLVALCVMPFNAIVSLTDEWYFGY 111
+ I GN+++II K +KLR +Y + ADLL + F ++ T
Sbjct: 1 SVIGIFGNVIMIILTFKKKKLRSKCSYLICVQCLADLLCLSGEIIFVVLL-FTGTQLTRN 59
Query: 112 FMCDVWNSFDVYFSTASILHLCCISVDRYYAIVKPLQYPIIMNQNTVLLMLSNVWILPGI 171
C + ++ TA + I +DR A+ P+ Y ++ + + + L I P I
Sbjct: 60 E-CFLIIIPYIFGQTAQSPLMLMIGIDRLIAVKFPIFYRLLSSSKYLFIQL----IFPVI 114
Query: 172 ISFTPIMLGWYTTPDHKQYRKHHPNVC---------IFVVNVYYALISSSISFWIPGFVM 222
S ++ G+ D VC F + ++I + I + ++
Sbjct: 115 YSSFILVYGFLERDDETII------VCAPPLALNGTAFRIFTLSSVIINVIVLIVYIILI 168
Query: 223 KKKKKRKKKKKKKKKK 238
KK+ + KK
Sbjct: 169 IIFKKKGQTTNSDSKK 184
>gnl|CDD|219547 pfam07741, BRF1, Brf1-like TBP-binding domain. This region covers
both the Brf homology II and III regions. This region is
involved in binding TATA binding protein.
Length = 95
Score = 39.2 bits (92), Expect = 3e-04
Identities = 15/33 (45%), Positives = 25/33 (75%)
Query: 260 EEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQ 292
EE+EEK+ K+K + + KKK+K+K KKK+++
Sbjct: 30 EEQEEKELKQKADEGNNSGKKKKKRKAKKKRDE 62
Score = 35.3 bits (82), Expect = 0.005
Identities = 13/33 (39%), Positives = 21/33 (63%)
Query: 261 EEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQH 293
EE+E+K+ K+K + KK+KK++ KKK
Sbjct: 30 EEQEEKELKQKADEGNNSGKKKKKRKAKKKRDE 62
Score = 34.9 bits (81), Expect = 0.007
Identities = 13/33 (39%), Positives = 21/33 (63%)
Query: 262 EEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQHH 294
EE+++K+ K+K + K+KKKRK KK +
Sbjct: 30 EEQEEKELKQKADEGNNSGKKKKKRKAKKKRDE 62
Score = 29.9 bits (68), Expect = 0.42
Identities = 16/54 (29%), Positives = 28/54 (51%)
Query: 221 VMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKK 274
+K+ K+K + KK+KK+K + RD ++ E ++ +KKK KK
Sbjct: 32 QEEKELKQKADEGNNSGKKKKKRKAKKKRDEAGPASTAAEAVKKMLQKKKFSKK 85
Score = 28.0 bits (63), Expect = 2.0
Identities = 12/32 (37%), Positives = 18/32 (56%), Gaps = 6/32 (18%)
Query: 223 KKKKKRKKKKK------KKKKKKRKKKKNQHH 248
++++K K+K KKKKKRK KK +
Sbjct: 31 EQEEKELKQKADEGNNSGKKKKKRKAKKKRDE 62
Score = 26.1 bits (58), Expect = 8.4
Identities = 9/25 (36%), Positives = 13/25 (52%)
Query: 260 EEEEEKKKKKKKKKKKTTKKKKRKK 284
+ +E KKKKK+K KK+
Sbjct: 40 KADEGNNSGKKKKKRKAKKKRDEAG 64
>gnl|CDD|218188 pfam04641, Rtf2, Replication termination factor 2. It is vital for
effective cell-replication that replication is not
stalled at any point by, for instance, damaged bases.
Rtf2 stabilizes the replication fork stalled at the
site-specific replication barrier RTS1 by preventing
replication restart until completion of DNA synthesis by
a converging replication fork initiated at a flanking
origin. The RTS1 element terminates replication forks
that are moving in the cen2-distal direction while
allowing forks moving in the cen2-proximal direction to
pass through the region. Rtf2 contains a C2HC2 motif
related to the C3HC4 RING-finger motif, and would appear
to fold up, creating a RING finger-like structure but
forming only one functional Zn2+ ion-binding site.
Length = 254
Score = 41.2 bits (97), Expect = 3e-04
Identities = 20/74 (27%), Positives = 31/74 (41%), Gaps = 10/74 (13%)
Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKK---KKKKKKKKKTTK 278
+K + + ++ KKKKKKKK+K KKN S E + +
Sbjct: 172 LKARLEEERAKKKKKKKKKKTKKNN-------ATGSSAEATVSSAVPTELSSGAGQVGEA 224
Query: 279 KKKRKKKRKKKKNQ 292
KK +KK+ N+
Sbjct: 225 KKLKKKRSIAPDNE 238
Score = 38.1 bits (89), Expect = 0.003
Identities = 17/64 (26%), Positives = 26/64 (40%)
Query: 225 KKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKK 284
KKK+KKKKKK KK + + + + KK KKK+ ++ +
Sbjct: 182 KKKKKKKKKKTKKNNATGSSAEATVSSAVPTELSSGAGQVGEAKKLKKKRSIAPDNEKSE 241
Query: 285 KRKK 288
K
Sbjct: 242 VYKS 245
Score = 33.9 bits (78), Expect = 0.080
Identities = 15/31 (48%), Positives = 22/31 (70%)
Query: 262 EEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQ 292
EEE + K + +++ KKKK+KKK+K KKN
Sbjct: 166 EEEVELLKARLEEERAKKKKKKKKKKTKKNN 196
Score = 32.7 bits (75), Expect = 0.15
Identities = 18/69 (26%), Positives = 30/69 (43%), Gaps = 5/69 (7%)
Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
++ K R ++++ KKKKK+KKKK N T E + + +
Sbjct: 169 VELLKARLEEERAKKKKKKKKKK-----TKKNNATGSSAEATVSSAVPTELSSGAGQVGE 223
Query: 282 RKKKRKKKK 290
KK +KK+
Sbjct: 224 AKKLKKKRS 232
Score = 32.3 bits (74), Expect = 0.26
Identities = 11/35 (31%), Positives = 14/35 (40%)
Query: 260 EEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQHH 294
E ++KKKKKKKK KK +
Sbjct: 179 ERAKKKKKKKKKKTKKNNATGSSAEATVSSAVPTE 213
Score = 31.5 bits (72), Expect = 0.42
Identities = 14/33 (42%), Positives = 22/33 (66%)
Query: 259 GEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKN 291
EEE + K + ++++ KKKK+KKK+ KK N
Sbjct: 164 PTEEEVELLKARLEEERAKKKKKKKKKKTKKNN 196
Score = 30.0 bits (68), Expect = 1.4
Identities = 12/34 (35%), Positives = 19/34 (55%)
Query: 258 RGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKN 291
EE E K + ++++ K KKKK+KK +K
Sbjct: 165 TEEEVELLKARLEEERAKKKKKKKKKKTKKNNAT 198
Score = 29.2 bits (66), Expect = 2.2
Identities = 14/68 (20%), Positives = 20/68 (29%), Gaps = 12/68 (17%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
KKKKK+ KK S E + + KK KK+
Sbjct: 186 KKKKKKTKKNNATGSSAEATVS------------SAVPTELSSGAGQVGEAKKLKKKRSI 233
Query: 283 KKKRKKKK 290
+K +
Sbjct: 234 APDNEKSE 241
>gnl|CDD|222636 pfam14265, DUF4355, Domain of unknown function (DUF4355). This
family of proteins is found in bacteria and viruses.
Proteins in this family are typically between 180 and
214 amino acids in length.
Length = 125
Score = 39.5 bits (93), Expect = 3e-04
Identities = 16/81 (19%), Positives = 33/81 (40%), Gaps = 10/81 (12%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
+K K+ K K+K K +K Q + E EK K ++K + +K
Sbjct: 5 EKTFTDKEVDKAIAKEKAKWEKKQE----------EKKSEAEKLAKMSAEEKAEYELEKL 54
Query: 283 KKKRKKKKNQHHRDVWQKKYT 303
+K+ ++ + + R + +
Sbjct: 55 EKELEELEAELARRELKAEAK 75
Score = 38.0 bits (89), Expect = 0.001
Identities = 15/71 (21%), Positives = 33/71 (46%), Gaps = 13/71 (18%)
Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
K+K K +KK+++KK +K + EEK + + +K +K ++ +
Sbjct: 16 AIAKEKAKWEKKQEEKKSEAEKLAK-------------MSAEEKAEYELEKLEKELEELE 62
Query: 282 RKKKRKKKKNQ 292
+ R++ K +
Sbjct: 63 AELARRELKAE 73
Score = 36.9 bits (86), Expect = 0.003
Identities = 15/68 (22%), Positives = 35/68 (51%), Gaps = 7/68 (10%)
Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
+ K+K + +KK+++KK + +K + + E E EK +K+ ++ + +++
Sbjct: 17 IAKEKAKWEKKQEEKKSEAEKLAKM-------SAEEKAEYELEKLEKELEELEAELARRE 69
Query: 282 RKKKRKKK 289
K + KK
Sbjct: 70 LKAEAKKM 77
>gnl|CDD|240576 cd12932, RRP7_like, RRP7 domain ribosomal RNA-processing protein 7
(Rrp7p), ribosomal RNA-processing protein 7 homolog A
(Rrp7A), and similar proteins. This CD corresponds to
the RRP7 domain of Rrp7p and Rrp7A. Rrp7p is encoded by
YCL031C gene from Saccharomyces cerevisiae. It is an
essential yeast protein involved in pre-rRNA processing
and ribosome assembly, and is speculated to be required
for correct assembly of rpS27 into the pre-ribosomal
particle. Rrp7A, also termed gastric cancer antigen
Zg14, is the Rrp7p homolog mainly found in Metazoans.
The cellular function of Rrp7A remains unclear
currently. Both Rrp7p and Rrp7A harbor an N-terminal RNA
recognition motif (RRM), also termed RBD (RNA binding
domain) or RNP (ribonucleoprotein domain), and a
C-terminal RRP7 domain.
Length = 118
Score = 39.2 bits (92), Expect = 3e-04
Identities = 22/72 (30%), Positives = 43/72 (59%), Gaps = 10/72 (13%)
Query: 231 KKKKKKKKKRKKKKNQHHRDVWI--------NRTSRGEEEEEKKKKKKKKKKKTTKKKK- 281
K+++++K++ K+ +N+ D ++ +T+R E E K K+K+KKKKK + +
Sbjct: 32 KREEEEKEEAKEARNEPDEDGFVTVTRGGRKGKTAREEAVEAKAKEKEKKKKKKKELEDF 91
Query: 282 -RKKKRKKKKNQ 292
R + R+KKK +
Sbjct: 92 YRFQIREKKKEE 103
Score = 32.2 bits (74), Expect = 0.099
Identities = 18/62 (29%), Positives = 32/62 (51%), Gaps = 11/62 (17%)
Query: 218 PGFVMKKKKKRKKKKKKKKKKKRKKKKN----QHHRDVWINRTSRGEEEEEKKKKKKKKK 273
++ + K K+K+KKKKK+K+ ++ Q R + EE E +KK ++ K
Sbjct: 63 GKTAREEAVEAKAKEKEKKKKKKKELEDFYRFQI-------REKKKEELAELRKKFEEDK 115
Query: 274 KK 275
K+
Sbjct: 116 KR 117
>gnl|CDD|218899 pfam06102, DUF947, Domain of unknown function (DUF947). Family of
eukaryotic proteins with unknown function.
Length = 168
Score = 40.0 bits (94), Expect = 4e-04
Identities = 20/70 (28%), Positives = 38/70 (54%), Gaps = 2/70 (2%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTK--KK 280
K K R+++ K+ KK+ K+ + + ++ ++ + +KK + KK K+ K +K
Sbjct: 98 LKNKDREREILKEHKKQEKELIKEGKKPYYLKKSEIKKLVLKKKFDELKKSKQLDKALEK 157
Query: 281 KRKKKRKKKK 290
KRKK K+K
Sbjct: 158 KRKKNAGKEK 167
Score = 32.6 bits (75), Expect = 0.13
Identities = 20/72 (27%), Positives = 35/72 (48%), Gaps = 1/72 (1%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEE-EEKKKKKKKKKKKTTKKKK 281
KK K ++K++ K+ + K + + ++ R E + +EK+ K+ KK KK +
Sbjct: 73 KKTKDSEEKEELKRTLQSMKSRLKTLKNKDREREILKEHKKQEKELIKEGKKPYYLKKSE 132
Query: 282 RKKKRKKKKNQH 293
KK KKK
Sbjct: 133 IKKLVLKKKFDE 144
Score = 28.8 bits (65), Expect = 2.3
Identities = 10/23 (43%), Positives = 15/23 (65%)
Query: 222 MKKKKKRKKKKKKKKKKKRKKKK 244
+KK K+ K +KK+KK K+K
Sbjct: 145 LKKSKQLDKALEKKRKKNAGKEK 167
Score = 28.4 bits (64), Expect = 3.0
Identities = 14/78 (17%), Positives = 35/78 (44%), Gaps = 7/78 (8%)
Query: 224 KKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRK 283
++ K KK K +++++ K + + K K ++++ K KK++++
Sbjct: 65 IEELEKALKKTKDSEEKEELKRTLQS-------MKSRLKTLKNKDREREILKEHKKQEKE 117
Query: 284 KKRKKKKNQHHRDVWQKK 301
++ KK + + KK
Sbjct: 118 LIKEGKKPYYLKKSEIKK 135
Score = 28.4 bits (64), Expect = 3.3
Identities = 16/62 (25%), Positives = 26/62 (41%), Gaps = 19/62 (30%)
Query: 220 FVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKK 279
+ +KK + +K KKK + +K K+ K +KK+KK K+
Sbjct: 126 YYLKKSEIKKLVLKKKFDELKKSKQ-------------------LDKALEKKRKKNAGKE 166
Query: 280 KK 281
KK
Sbjct: 167 KK 168
Score = 27.2 bits (61), Expect = 8.2
Identities = 11/63 (17%), Positives = 31/63 (49%)
Query: 230 KKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKK 289
+K+ ++ +K KK K+ ++ + + K K ++++ + KK++K+ K+
Sbjct: 62 EKEIEELEKALKKTKDSEEKEELKRTLQSMKSRLKTLKNKDREREILKEHKKQEKELIKE 121
Query: 290 KNQ 292
+
Sbjct: 122 GKK 124
>gnl|CDD|165177 PHA02834, PHA02834, chemokine receptor-like protein; Provisional.
Length = 323
Score = 41.0 bits (96), Expect = 5e-04
Identities = 46/157 (29%), Positives = 77/157 (49%), Gaps = 15/157 (9%)
Query: 46 MLMCFIIITAILGNLLVIISVIKHRKLRIITNYFVVSLAFADLLVALCVMPFNAIVSLTD 105
+ + I ++GN+LVI +I R + ++ Y + ++A +DL++ PF I + +
Sbjct: 33 VFYILLFIFGLIGNVLVIAVLIVKRFMFVVDVY-LFNIAMSDLMLVFS-FPF-IIHNDLN 89
Query: 106 EWYFGYFMCDVWNS--FDVYFSTASILHLCCISVDRYYAIVKPLQYPIIMNQNTVLLMLS 163
EW FG FMC + F +FS + L IS+DRY +V + +VLL ++
Sbjct: 90 EWIFGEFMCKLVLGVYFVGFFSNMFFVTL--ISIDRYILVVNATKIKNKSISLSVLLSVA 147
Query: 164 NVWILPGIISFTPIMLGWYTTPDHKQYRKHHPNVCIF 200
W+ I+S P M+ +Y D+ K CIF
Sbjct: 148 -AWVCSVILS-MPAMVLYYV--DNTDNLKQ----CIF 176
>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
Length = 413
Score = 40.9 bits (96), Expect = 5e-04
Identities = 19/78 (24%), Positives = 35/78 (44%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
++ K KK +K+ KN + + + +E++E K + + K T KK +
Sbjct: 41 TFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGETKLGFKTPKKSK 100
Query: 283 KKKRKKKKNQHHRDVWQK 300
K K+K K + + DV
Sbjct: 101 KTKKKPPKPKPNEDVDNA 118
Score = 37.8 bits (88), Expect = 0.005
Identities = 17/65 (26%), Positives = 30/65 (46%), Gaps = 7/65 (10%)
Query: 226 KKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKK 285
+++ K KK K ++ + +++ EKKKKKKK+KK+ + + K
Sbjct: 40 STFSEEENKVATTSTKKDKK-------EDKNNESKKKSEKKKKKKKEKKEPKSEGETKLG 92
Query: 286 RKKKK 290
K K
Sbjct: 93 FKTPK 97
Score = 33.9 bits (78), Expect = 0.084
Identities = 27/78 (34%), Positives = 33/78 (42%), Gaps = 13/78 (16%)
Query: 221 VMKKKKKRKK----KKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKT 276
KK KK K KKK +KKKK+KK+K S GE + K KK KK K
Sbjct: 53 STKKDKKEDKNNESKKKSEKKKKKKKEKK--------EPKSEGETKLGFKTPKKSKKTKK 104
Query: 277 TK-KKKRKKKRKKKKNQH 293
K K + N+
Sbjct: 105 KPPKPKPNEDVDNAFNKI 122
Score = 32.4 bits (74), Expect = 0.27
Identities = 19/77 (24%), Positives = 34/77 (44%), Gaps = 14/77 (18%)
Query: 225 KKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKK 284
+++ K KK K++ K N E +++ +KKKKKKK+K K + +
Sbjct: 44 EEENKVATTSTKKDKKEDKNN--------------ESKKKSEKKKKKKKEKKEPKSEGET 89
Query: 285 KRKKKKNQHHRDVWQKK 301
K K + + +K
Sbjct: 90 KLGFKTPKKSKKTKKKP 106
Score = 29.7 bits (67), Expect = 2.1
Identities = 18/70 (25%), Positives = 30/70 (42%), Gaps = 8/70 (11%)
Query: 231 KKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKK--------KKKTTKKKKR 282
+K K + ++ S E + KK K KKK+ KKKK+
Sbjct: 17 QKSKLQPISYIYSNVLVLSKEILSTFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKK 76
Query: 283 KKKRKKKKNQ 292
KK++K+ K++
Sbjct: 77 KKEKKEPKSE 86
Score = 28.1 bits (63), Expect = 6.8
Identities = 17/69 (24%), Positives = 31/69 (44%)
Query: 224 KKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRK 283
+K K + K+ N+ + +++KK+ K + KK ++KKK+K
Sbjct: 17 QKSKLQPISYIYSNVLVLSKEILSTFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKK 76
Query: 284 KKRKKKKNQ 292
KK KK+
Sbjct: 77 KKEKKEPKS 85
>gnl|CDD|222447 pfam13904, DUF4207, Domain of unknown function (DUF4207). This
family is found in eukaryotes; it has several conserved
tryptophan residues. The function is not known.
Length = 261
Score = 40.1 bits (94), Expect = 7e-04
Identities = 15/86 (17%), Positives = 46/86 (53%), Gaps = 7/86 (8%)
Query: 224 KKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKK-------KKKKKKKKT 276
++K ++ K++ K K++ ++ + R +EE KK KK K++++
Sbjct: 139 RQKAQQAAKQRTPKHKKEAAESASSSLSGSAKPERNVSQEEAKKRLQEWELKKLKQQQQK 198
Query: 277 TKKKKRKKKRKKKKNQHHRDVWQKKY 302
++++RK+++K+++ + + ++ +
Sbjct: 199 REEERRKQRKKQQEEEERKQKAEEAW 224
Score = 34.7 bits (80), Expect = 0.044
Identities = 18/64 (28%), Positives = 39/64 (60%), Gaps = 16/64 (25%)
Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
+KK K++++K++++++K+RKK++ EEEE+K+K ++ +K K K
Sbjct: 189 LKKLKQQQQKREEERRKQRKKQQ----------------EEEERKQKAEEAWQKWMKNVK 232
Query: 282 RKKK 285
++ K
Sbjct: 233 QRPK 236
Score = 33.5 bits (77), Expect = 0.10
Identities = 9/83 (10%), Positives = 40/83 (48%), Gaps = 4/83 (4%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKK----KKKKKTTK 278
++K++++++K++++ + +++ + + W + ++ ++ K KK +
Sbjct: 108 LEEKQKQEREKEREEAELRQRLAKEKYEEWCRQKAQQAAKQRTPKHKKEAAESASSSLSG 167
Query: 279 KKKRKKKRKKKKNQHHRDVWQKK 301
K ++ +++ + W+ K
Sbjct: 168 SAKPERNVSQEEAKKRLQEWELK 190
Score = 33.2 bits (76), Expect = 0.13
Identities = 17/79 (21%), Positives = 41/79 (51%), Gaps = 17/79 (21%)
Query: 216 WIPGFVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKK 275
W+ +++KK +K ++K+K++R+K++ EE E +++ K+K ++
Sbjct: 92 WLSAKQAQRQKKLQKLLEEKQKQEREKER---------------EEAELRQRLAKEKYEE 136
Query: 276 TTKKKKR--KKKRKKKKNQ 292
++K + K+R K +
Sbjct: 137 WCRQKAQQAAKQRTPKHKK 155
Score = 28.2 bits (63), Expect = 5.3
Identities = 11/69 (15%), Positives = 37/69 (53%), Gaps = 1/69 (1%)
Query: 234 KKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKK-KKNQ 292
+ K+ + +++ Q + W++ +++ +K ++K+K++ K+++ + R++ K +
Sbjct: 74 RSLKEVKLERQAQEAYENWLSAKQAQRQKKLQKLLEEKQKQEREKEREEAELRQRLAKEK 133
Query: 293 HHRDVWQKK 301
+ QK
Sbjct: 134 YEEWCRQKA 142
>gnl|CDD|172341 PRK13808, PRK13808, adenylate kinase; Provisional.
Length = 333
Score = 39.9 bits (93), Expect = 0.001
Identities = 19/68 (27%), Positives = 25/68 (36%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
K +K K K K+ KK V + ++ + KK KK KK K
Sbjct: 208 AKSGAKKASAKAKSAAKKVSKKKAAKTAVSAKKAAKTAAKAAKKAKKTAKKALKKAAKAV 267
Query: 283 KKKRKKKK 290
KK KK
Sbjct: 268 KKAAKKAA 275
Score = 37.2 bits (86), Expect = 0.007
Identities = 19/69 (27%), Positives = 22/69 (31%), Gaps = 18/69 (26%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
K K+ K K KK KKK K KK KT K +
Sbjct: 209 KSGAKKASAKAKSAAKKVSKKKA------------------AKTAVSAKKAAKTAAKAAK 250
Query: 283 KKKRKKKKN 291
K K+ KK
Sbjct: 251 KAKKTAKKA 259
Score = 36.0 bits (83), Expect = 0.018
Identities = 21/68 (30%), Positives = 26/68 (38%), Gaps = 3/68 (4%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
K K KK KK K K K ++ +++ KK K K T K KR
Sbjct: 262 KAAKAVKKAAKKAAKAAAKAAKGAA---KATKGKAKAKKKAGKKAAAGSKAKATAKAPKR 318
Query: 283 KKKRKKKK 290
K KK K
Sbjct: 319 GAKGKKAK 326
Score = 35.2 bits (81), Expect = 0.031
Identities = 20/70 (28%), Positives = 25/70 (35%), Gaps = 10/70 (14%)
Query: 221 VMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKK 280
K KK KK KK K +K K + ++ + K K K K KKK
Sbjct: 249 AKKAKKTAKKALKKAAKAVKKAAK----------KAAKAAAKAAKGAAKATKGKAKAKKK 298
Query: 281 KRKKKRKKKK 290
KK K
Sbjct: 299 AGKKAAAGSK 308
Score = 35.2 bits (81), Expect = 0.032
Identities = 15/58 (25%), Positives = 24/58 (41%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKK 280
K + K K K + K K + + ++ + K+ K KK KK TKK+
Sbjct: 275 AKAAAKAAKGAAKATKGKAKAKKKAGKKAAAGSKAKATAKAPKRGAKGKKAKKVTKKR 332
Score = 35.2 bits (81), Expect = 0.033
Identities = 17/73 (23%), Positives = 25/73 (34%), Gaps = 8/73 (10%)
Query: 224 KKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRK 283
KK K K KK K+ KK + + ++ KK K K K
Sbjct: 238 AKKAAKTAAKAAKKAKKTAKKAL--------KKAAKAVKKAAKKAAKAAAKAAKGAAKAT 289
Query: 284 KKRKKKKNQHHRD 296
K + K K + +
Sbjct: 290 KGKAKAKKKAGKK 302
Score = 34.1 bits (78), Expect = 0.078
Identities = 16/61 (26%), Positives = 23/61 (37%), Gaps = 2/61 (3%)
Query: 224 KKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKK--KK 281
KK + K K K K K + + + + + K K+ K K KK KK
Sbjct: 272 KKAAKAAAKAAKGAAKATKGKAKAKKKAGKKAAAGSKAKATAKAPKRGAKGKKAKKVTKK 331
Query: 282 R 282
R
Sbjct: 332 R 332
Score = 32.6 bits (74), Expect = 0.26
Identities = 20/70 (28%), Positives = 27/70 (38%), Gaps = 8/70 (11%)
Query: 221 VMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKK 280
V KK + K KK KK KK + + ++ ++ KK K K K
Sbjct: 236 VSAKKAAKTAAKAAKKAKKTAKKA--------LKKAAKAVKKAAKKAAKAAAKAAKGAAK 287
Query: 281 KRKKKRKKKK 290
K K K KK
Sbjct: 288 ATKGKAKAKK 297
>gnl|CDD|221275 pfam11861, DUF3381, Domain of unknown function (DUF3381). This
domain is functionally uncharacterized. This domain is
found in eukaryotes. This presumed domain is typically
between 156 to 174 amino acids in length. This domain is
found associated with pfam07780, pfam01728.
Length = 154
Score = 38.4 bits (90), Expect = 0.001
Identities = 12/73 (16%), Positives = 37/73 (50%), Gaps = 12/73 (16%)
Query: 221 VMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKK 280
+ KK+++++++++ + + ++ Q +E EK+ K K++K+ ++
Sbjct: 94 LGLDKKEKEEEEEEEVEVEELDEEEQI------------DELLEKELAKLKREKRRENER 141
Query: 281 KRKKKRKKKKNQH 293
K+K+ K++
Sbjct: 142 KQKEILKEQMKML 154
Score = 32.3 bits (74), Expect = 0.16
Identities = 17/76 (22%), Positives = 37/76 (48%), Gaps = 15/76 (19%)
Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKK--KKKKKTTKK 279
+K +KK +K KK+K ++++ + E EE ++++ + +K K
Sbjct: 84 LKWRKKVRKLLGLDKKEKEEEEEEE-------------VEVEELDEEEQIDELLEKELAK 130
Query: 280 KKRKKKRKKKKNQHHR 295
KR+K+R+ ++ Q
Sbjct: 131 LKREKRRENERKQKEI 146
>gnl|CDD|221857 pfam12923, RRP7, Ribosomal RNA-processing protein 7 (RRP7). RRP7
is an essential protein in yeast that is involved in
pre-rRNA processing and ribosome assembly. It is
speculated to be required for correct assembly of rpS27
into the pre-ribosomal particle.
Length = 131
Score = 38.0 bits (89), Expect = 0.001
Identities = 22/71 (30%), Positives = 36/71 (50%), Gaps = 11/71 (15%)
Query: 218 PGFVMKKKKKRKKKKKKKKKKKRKKKKN----QHHRDVWINRTSRGEEEEEKKKKKKKKK 273
G K + ++K K+KKKKK+K+ +N Q R + EE E +KK ++ K
Sbjct: 64 AGASRNKAAEERRKLKEKKKKKKKELENFYRFQI-------REKKKEELAELRKKFEEDK 116
Query: 274 KKTTKKKKRKK 284
K+ + K +K
Sbjct: 117 KRIEQLKAARK 127
Score = 33.3 bits (77), Expect = 0.046
Identities = 15/60 (25%), Positives = 34/60 (56%)
Query: 232 KKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKN 291
K++++ K+ K +++ D W T G + + + K +++ K+K+KKK+K+ +N
Sbjct: 32 KREEEAKEEAKARSEPDEDGWTTVTRGGRKRKAGASRNKAAEERRKLKEKKKKKKKELEN 91
>gnl|CDD|235322 PRK04950, PRK04950, ProP expression regulator; Provisional.
Length = 213
Score = 39.1 bits (92), Expect = 0.001
Identities = 13/64 (20%), Positives = 29/64 (45%), Gaps = 14/64 (21%)
Query: 229 KKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKK 288
K K + ++ +++ KK R E++K ++++K K +K++K R +
Sbjct: 107 KAKVQAQRAEQQAKK--------------REAAGEKEKAPRRERKPKPKAPRKKRKPRAQ 152
Query: 289 KKNQ 292
K
Sbjct: 153 KPEP 156
Score = 38.8 bits (91), Expect = 0.001
Identities = 17/73 (23%), Positives = 33/73 (45%), Gaps = 17/73 (23%)
Query: 225 KKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKK 284
K K + ++ +++ KKR+ E+EK ++++K K +KKRK
Sbjct: 107 KAKVQAQRAEQQAKKREA-----------------AGEKEKAPRRERKPKPKAPRKKRKP 149
Query: 285 KRKKKKNQHHRDV 297
+ +K + QH
Sbjct: 150 RAQKPEPQHTPVS 162
Score = 37.2 bits (87), Expect = 0.005
Identities = 12/68 (17%), Positives = 28/68 (41%), Gaps = 15/68 (22%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
K + +R +++ KK++ +K+K E+K K K +KK + ++
Sbjct: 109 KVQAQRAEQQAKKREAAGEKEK---------------APRRERKPKPKAPRKKRKPRAQK 153
Query: 283 KKKRKKKK 290
+ +
Sbjct: 154 PEPQHTPV 161
Score = 36.4 bits (85), Expect = 0.009
Identities = 11/71 (15%), Positives = 28/71 (39%), Gaps = 17/71 (23%)
Query: 221 VMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKK 280
V ++ +++ KK++ +K K + + K K K +KK+ + +
Sbjct: 110 VQAQRAEQQAKKREAAGEKEKAPRRER-----------------KPKPKAPRKKRKPRAQ 152
Query: 281 KRKKKRKKKKN 291
K + + +
Sbjct: 153 KPEPQHTPVSD 163
Score = 34.9 bits (81), Expect = 0.029
Identities = 11/59 (18%), Positives = 22/59 (37%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
+ KK+ +K+K ++ +K K + R R + E + + K K
Sbjct: 118 QAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQKPEPQHTPVSDISELTVGQAVKVK 176
Score = 31.8 bits (73), Expect = 0.28
Identities = 7/36 (19%), Positives = 18/36 (50%)
Query: 265 KKKKKKKKKKKTTKKKKRKKKRKKKKNQHHRDVWQK 300
+++ KK++ K+K +++RK K + +
Sbjct: 115 AEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPR 150
Score = 31.4 bits (72), Expect = 0.43
Identities = 6/37 (16%), Positives = 19/37 (51%)
Query: 265 KKKKKKKKKKKTTKKKKRKKKRKKKKNQHHRDVWQKK 301
+++ KK++ +K R++++ K K + + +
Sbjct: 116 EQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQ 152
Score = 28.0 bits (63), Expect = 6.0
Identities = 7/34 (20%), Positives = 19/34 (55%)
Query: 262 EEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQHHR 295
EE K K + ++ ++ KK++ +++K + +
Sbjct: 104 EEAKAKVQAQRAEQQAKKREAAGEKEKAPRRERK 137
>gnl|CDD|177089 CHL00189, infB, translation initiation factor 2; Provisional.
Length = 742
Score = 39.4 bits (92), Expect = 0.002
Identities = 20/68 (29%), Positives = 26/68 (38%)
Query: 225 KKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKK 284
KK + K+KKK KKK + D N +K K K + KKK
Sbjct: 73 KKTKLKQKKKIKKKLHIDDDYDNFFDSKNNSKQFAGPLAISLMRKPKPKTEKLKKKITVN 132
Query: 285 KRKKKKNQ 292
K KK +
Sbjct: 133 KSTNKKKK 140
Score = 38.7 bits (90), Expect = 0.003
Identities = 20/69 (28%), Positives = 26/69 (37%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
KK K ++KKK KKK N + + +K K K +K K
Sbjct: 73 KKTKLKQKKKIKKKLHIDDDYDNFFDSKNNSKQFAGPLAISLMRKPKPKTEKLKKKITVN 132
Query: 283 KKKRKKKKN 291
K KKKK
Sbjct: 133 KSTNKKKKK 141
Score = 37.5 bits (87), Expect = 0.009
Identities = 22/69 (31%), Positives = 30/69 (43%), Gaps = 2/69 (2%)
Query: 226 KKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSR--GEEEEEKKKKKKKKKKKTTKKKKRK 283
KK K K+KKK KKK + + N + + G +K K K +K KK
Sbjct: 73 KKTKLKQKKKIKKKLHIDDDYDNFFDSKNNSKQFAGPLAISLMRKPKPKTEKLKKKITVN 132
Query: 284 KKRKKKKNQ 292
K KKK +
Sbjct: 133 KSTNKKKKK 141
Score = 36.0 bits (83), Expect = 0.025
Identities = 25/75 (33%), Positives = 32/75 (42%), Gaps = 10/75 (13%)
Query: 224 KKKKRKKKKKKKKK----------KKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKK 273
KK K K+KKK KKK K Q + I+ + + + EK KKK
Sbjct: 73 KKTKLKQKKKIKKKLHIDDDYDNFFDSKNNSKQFAGPLAISLMRKPKPKTEKLKKKITVN 132
Query: 274 KKTTKKKKRKKKRKK 288
K T KKKK+ K
Sbjct: 133 KSTNKKKKKVLSSKD 147
Score = 28.6 bits (64), Expect = 5.5
Identities = 21/70 (30%), Positives = 33/70 (47%), Gaps = 2/70 (2%)
Query: 227 KRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKR 286
K K K + + K + + D IN+ + +++ KK K K +KK K K+K+
Sbjct: 24 KNLKHSSYKIRLESDIKDSLLNLD--INKKLHEKLDKKNKKFNKTDDLKDSKKTKLKQKK 81
Query: 287 KKKKNQHHRD 296
K KK H D
Sbjct: 82 KIKKKLHIDD 91
>gnl|CDD|235401 PRK05306, infB, translation initiation factor IF-2; Validated.
Length = 746
Score = 39.4 bits (93), Expect = 0.002
Identities = 15/75 (20%), Positives = 41/75 (54%), Gaps = 2/75 (2%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
+ + ++ K KK +KKK + + + +RG + + K +++++ + +++++
Sbjct: 104 ARPAEAAARRPKAKKAAKKKKGPKPKKKKPKRKAARGGKRGKGGKGRRRRRGR--RRRRK 161
Query: 283 KKKRKKKKNQHHRDV 297
KKK++K + R+V
Sbjct: 162 KKKKQKPTEKIPREV 176
Score = 31.0 bits (71), Expect = 0.84
Identities = 14/73 (19%), Positives = 31/73 (42%), Gaps = 5/73 (6%)
Query: 222 MKKKKKRKKKKKKKKKKKR--KKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKK 279
M K + + K+ K +K K + + S EEEE +K++ K++ + K
Sbjct: 1 MSKVRVYELAKELGVSSKELLEKLKEL---GIEVKSHSSTVEEEEARKEEAKREAEEEAK 57
Query: 280 KKRKKKRKKKKNQ 292
+ ++ + +
Sbjct: 58 AEAEEAAAAEAEE 70
Score = 28.3 bits (64), Expect = 5.6
Identities = 11/34 (32%), Positives = 24/34 (70%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRT 256
K +++R+ +++++KKKK++K + R+V I T
Sbjct: 148 KGRRRRRGRRRRRKKKKKQKPTEKIPREVVIPET 181
>gnl|CDD|220267 pfam09494, Slx4, Slx4 endonuclease. The Slx4 protein is a
heteromeric structure-specific endonuclease found in
fungi. Slx4 with Slx1 acts as a nuclease on branched DNA
substrates, particularly simple-Y, 5'-flap, or
replication fork structures by cleaving the strand
bearing the 5' non-homologous arm at the branch junction
and thus generating ligatable nicked products from
5'-flap or replication fork substrates.
Length = 627
Score = 38.8 bits (90), Expect = 0.003
Identities = 15/75 (20%), Positives = 31/75 (41%), Gaps = 5/75 (6%)
Query: 226 KKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKK 285
++ ++ + ++ R + N + + KKKK KK K K+ KRK K
Sbjct: 37 EEENQRLQLRQVISRFTSFESDDQANSGNVSG-----KRVPKKKKIKKPKLRKRTKRKNK 91
Query: 286 RKKKKNQHHRDVWQK 300
+ K + + ++
Sbjct: 92 KIKSLTAFNEENFET 106
Score = 31.5 bits (71), Expect = 0.63
Identities = 20/63 (31%), Positives = 31/63 (49%), Gaps = 4/63 (6%)
Query: 219 GFVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTK 278
G V K+ +KKK KK K +KR K+KN+ + + T+ EE E + + K
Sbjct: 64 GNVSGKRVPKKKKIKKPKLRKRTKRKNKKIK----SLTAFNEENFETDRAPSLLSYLSGK 119
Query: 279 KKK 281
+ K
Sbjct: 120 QSK 122
>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
subunit. This is a family of proteins which are
subunits of the eukaryotic translation initiation factor
3 (eIF3). In yeast it is called Hcr1. The Saccharomyces
cerevisiae protein eIF3j (HCR1) has been shown to be
required for processing of 20S pre-rRNA and binds to 18S
rRNA and eIF3 subunits Rpg1p and Prt1p.
Length = 242
Score = 38.1 bits (89), Expect = 0.003
Identities = 17/71 (23%), Positives = 35/71 (49%), Gaps = 17/71 (23%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKK----TTK 278
+K++++ K K K KK K K + E+E+ K++K++K ++ T +
Sbjct: 44 EKEEEKAKVAAKAKAKKALKAKIE-------------EKEKAKREKEEKGLRELEEDTPE 90
Query: 279 KKKRKKKRKKK 289
+ +K R +K
Sbjct: 91 DELAEKLRLRK 101
Score = 37.3 bits (87), Expect = 0.005
Identities = 13/40 (32%), Positives = 24/40 (60%), Gaps = 6/40 (15%)
Query: 260 EEEEEKKKKKK------KKKKKTTKKKKRKKKRKKKKNQH 293
EEE+E+K+++K K KK K K +K++ K++ +
Sbjct: 39 EEEDEEKEEEKAKVAAKAKAKKALKAKIEEKEKAKREKEE 78
Score = 35.4 bits (82), Expect = 0.021
Identities = 10/39 (25%), Positives = 20/39 (51%)
Query: 260 EEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQHHRDVW 298
+ K K KK K K +K+K K+++++K + +
Sbjct: 50 AKVAAKAKAKKALKAKIEEKEKAKREKEEKGLRELEEDT 88
Score = 35.0 bits (81), Expect = 0.031
Identities = 9/31 (29%), Positives = 18/31 (58%)
Query: 260 EEEEEKKKKKKKKKKKTTKKKKRKKKRKKKK 290
+ + K K KK K ++K++ K+ K++K
Sbjct: 49 KAKVAAKAKAKKALKAKIEEKEKAKREKEEK 79
Score = 34.6 bits (80), Expect = 0.040
Identities = 11/31 (35%), Positives = 19/31 (61%)
Query: 262 EEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQ 292
+EE+ ++K+++K K K K KK K K +
Sbjct: 38 DEEEDEEKEEEKAKVAAKAKAKKALKAKIEE 68
Score = 31.6 bits (72), Expect = 0.36
Identities = 9/32 (28%), Positives = 11/32 (34%)
Query: 261 EEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQ 292
+ EK K KKKK K K +
Sbjct: 195 LKAEKAAKGGKKKKGKAKAKLNVGGANDDDDD 226
Score = 31.6 bits (72), Expect = 0.37
Identities = 10/38 (26%), Positives = 16/38 (42%)
Query: 255 RTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQ 292
+ K KK K K + +K KR+K+ K +
Sbjct: 47 EEKAKVAAKAKAKKALKAKIEEKEKAKREKEEKGLREL 84
Score = 30.8 bits (70), Expect = 0.62
Identities = 9/36 (25%), Positives = 11/36 (30%)
Query: 261 EEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQHHRD 296
E+ K KKKK K K + D
Sbjct: 197 AEKAAKGGKKKKGKAKAKLNVGGANDDDDDYDGYDD 232
Score = 30.8 bits (70), Expect = 0.72
Identities = 12/65 (18%), Positives = 27/65 (41%), Gaps = 21/65 (32%)
Query: 224 KKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRK 283
+ ++++++K K K + KK K K ++K+K ++K+ K
Sbjct: 41 EDEEKEEEKAKVAAKAKAKK---------------------ALKAKIEEKEKAKREKEEK 79
Query: 284 KKRKK 288
R+
Sbjct: 80 GLREL 84
Score = 29.6 bits (67), Expect = 1.6
Identities = 10/36 (27%), Positives = 14/36 (38%)
Query: 250 DVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKK 285
+V IN + E+ + KKKK K K
Sbjct: 188 NVLINEKLKAEKAAKGGKKKKGKAKAKLNVGGANDD 223
Score = 29.2 bits (66), Expect = 2.3
Identities = 11/23 (47%), Positives = 13/23 (56%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKN 245
K K ++ K KKKK K K K N
Sbjct: 194 KLKAEKAAKGGKKKKGKAKAKLN 216
Score = 29.2 bits (66), Expect = 2.5
Identities = 10/25 (40%), Positives = 10/25 (40%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQH 247
K K KKKK K K K N
Sbjct: 199 KAAKGGKKKKGKAKAKLNVGGANDD 223
Score = 28.9 bits (65), Expect = 3.0
Identities = 12/52 (23%), Positives = 24/52 (46%), Gaps = 9/52 (17%)
Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKK 273
K K + ++K+K K++K +K + D E+E +K + +K
Sbjct: 59 KKALKAKIEEKEKAKREKEEKGLRELEED---------TPEDELAEKLRLRK 101
Score = 28.9 bits (65), Expect = 3.1
Identities = 11/40 (27%), Positives = 13/40 (32%)
Query: 264 EKKKKKKKKKKKTTKKKKRKKKRKKKKNQHHRDVWQKKYT 303
EK K +K K KK K K K D +
Sbjct: 193 EKLKAEKAAKGGKKKKGKAKAKLNVGGANDDDDDYDGYDD 232
Score = 28.9 bits (65), Expect = 3.1
Identities = 8/28 (28%), Positives = 9/28 (32%)
Query: 260 EEEEEKKKKKKKKKKKTTKKKKRKKKRK 287
E+ K KKKK K K
Sbjct: 197 AEKAAKGGKKKKGKAKAKLNVGGANDDD 224
Score = 28.1 bits (63), Expect = 5.8
Identities = 10/27 (37%), Positives = 15/27 (55%)
Query: 221 VMKKKKKRKKKKKKKKKKKRKKKKNQH 247
++ +K K +K K KKKK K K +
Sbjct: 190 LINEKLKAEKAAKGGKKKKGKAKAKLN 216
Score = 27.7 bits (62), Expect = 6.8
Identities = 8/31 (25%), Positives = 13/31 (41%)
Query: 261 EEEEKKKKKKKKKKKTTKKKKRKKKRKKKKN 291
E+ + +K K KKK K K + +
Sbjct: 193 EKLKAEKAAKGGKKKKGKAKAKLNVGGANDD 223
>gnl|CDD|220383 pfam09756, DDRGK, DDRGK domain. This is a family of proteins of
approximately 300 residues, found in plants and
vertebrates. They contain a highly conserved DDRGK
motif.
Length = 189
Score = 37.4 bits (87), Expect = 0.003
Identities = 17/65 (26%), Positives = 40/65 (61%), Gaps = 9/65 (13%)
Query: 226 KKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKK 285
KK KK+ K ++K+ +++ + + EE EE+KK ++K++ + ++++ +++
Sbjct: 1 KKIGAKKRAKLEEKQARRQQREAEE---------EEREERKKLEEKREGERKEEEELEEE 51
Query: 286 RKKKK 290
R+KKK
Sbjct: 52 REKKK 56
Score = 37.4 bits (87), Expect = 0.004
Identities = 15/69 (21%), Positives = 48/69 (69%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
KK+ K ++K+ ++++++ ++++ + + + R +EEEE +++++KKK++ +K++
Sbjct: 6 KKRAKLEEKQARRQQREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEEERKERE 65
Query: 283 KKKRKKKKN 291
++ RK+++
Sbjct: 66 EQARKEQEE 74
Score = 35.8 bits (83), Expect = 0.013
Identities = 15/75 (20%), Positives = 43/75 (57%), Gaps = 2/75 (2%)
Query: 225 KKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK--R 282
KK KK+ K ++K+ ++++ + + R E+ E ++K++++ +++ KKK+
Sbjct: 1 KKIGAKKRAKLEEKQARRQQREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEEE 60
Query: 283 KKKRKKKKNQHHRDV 297
+K+R+++ + +
Sbjct: 61 RKEREEQARKEQEEY 75
Score = 35.4 bits (82), Expect = 0.014
Identities = 16/69 (23%), Positives = 38/69 (55%), Gaps = 10/69 (14%)
Query: 224 KKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRK 283
KK KK+ K ++K+ R++++ E EE KK ++K++ ++ +++ +
Sbjct: 1 KKIGAKKRAKLEEKQARRQQREAEEE----------EREERKKLEEKREGERKEEEELEE 50
Query: 284 KKRKKKKNQ 292
++ KKK+ +
Sbjct: 51 EREKKKEEE 59
Score = 32.7 bits (75), Expect = 0.15
Identities = 19/79 (24%), Positives = 46/79 (58%), Gaps = 8/79 (10%)
Query: 222 MKKKKKRKKKKKKKKKKKR--------KKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKK 273
+ KK+ K ++K+ ++++R ++KK + R+ EEE EKKK+++++K
Sbjct: 3 IGAKKRAKLEEKQARRQQREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEEERK 62
Query: 274 KKTTKKKKRKKKRKKKKNQ 292
++ + +K +++ +K K+
Sbjct: 63 EREEQARKEQEEYEKLKSS 81
>gnl|CDD|222665 pfam14303, NAM-associated, No apical meristem-associated C-terminal
domain. This domain is found in a number of different
types of plant proteins including NAM-like proteins.
Length = 147
Score = 37.0 bits (86), Expect = 0.003
Identities = 20/79 (25%), Positives = 37/79 (46%)
Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
+ KK KKKKK+ N+ + D T+ + E +KK K+K ++ K KK
Sbjct: 22 RSELKKASKKKKKRSNSSPGSTSNEENEDEDDESTAESKRPEGRKKAKEKLRRDKLKAKK 81
Query: 282 RKKKRKKKKNQHHRDVWQK 300
+ +++K+K + +
Sbjct: 82 EEAEKEKEKEERFMKALAE 100
Score = 32.7 bits (75), Expect = 0.089
Identities = 19/63 (30%), Positives = 33/63 (52%), Gaps = 8/63 (12%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
K+K +R K K KK++ +++K+K + E EK++ + +KKK K K
Sbjct: 69 KEKLRRDKLKAKKEEAEKEKEKEERFMKAL--------AEAEKERAELEKKKAEAKLMKE 120
Query: 283 KKK 285
+KK
Sbjct: 121 EKK 123
Score = 32.3 bits (74), Expect = 0.12
Identities = 19/63 (30%), Positives = 33/63 (52%), Gaps = 15/63 (23%)
Query: 228 RKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRK 287
RKK K+K ++ K K KK EE E++K+K+++ K + +K + + +
Sbjct: 65 RKKAKEKLRRDKLKAKK---------------EEAEKEKEKEERFMKALAEAEKERAELE 109
Query: 288 KKK 290
KKK
Sbjct: 110 KKK 112
Score = 30.0 bits (68), Expect = 0.84
Identities = 18/67 (26%), Positives = 33/67 (49%), Gaps = 13/67 (19%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
K K+K ++ K K KK++ +K+K + EE K + +K++ +KKK
Sbjct: 67 KAKEKLRRDKLKAKKEEAEKEKEK-------------EERFMKALAEAEKERAELEKKKA 113
Query: 283 KKKRKKK 289
+ K K+
Sbjct: 114 EAKLMKE 120
Score = 29.3 bits (66), Expect = 1.6
Identities = 19/65 (29%), Positives = 34/65 (52%), Gaps = 10/65 (15%)
Query: 225 KKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKK 284
+ KR + +KK K+K R+ K + + E E+EK+K+++ K +K+R +
Sbjct: 58 ESKRPEGRKKAKEKLRRDKL----------KAKKEEAEKEKEKEERFMKALAEAEKERAE 107
Query: 285 KRKKK 289
KKK
Sbjct: 108 LEKKK 112
Score = 28.9 bits (65), Expect = 1.9
Identities = 19/72 (26%), Positives = 34/72 (47%), Gaps = 19/72 (26%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKK----KKKKKTTK 278
KK K++ ++ K K KK+ +K+ +E+EE+ K K++ + K
Sbjct: 66 KKAKEKLRRDKLKAKKEEAEKE---------------KEKEERFMKALAEAEKERAELEK 110
Query: 279 KKKRKKKRKKKK 290
KK K K++K
Sbjct: 111 KKAEAKLMKEEK 122
>gnl|CDD|219868 pfam08496, Peptidase_S49_N, Peptidase family S49 N-terminal. This
domain is found to the N-terminus of bacterial signal
peptidases of the S49 family (pfam01343).
Length = 154
Score = 37.1 bits (87), Expect = 0.004
Identities = 19/67 (28%), Positives = 34/67 (50%), Gaps = 5/67 (7%)
Query: 229 KKKKKKKKKKKRKKKKNQHHRDVWINRTSRGE---EEEEKKKKKKKKKKKTTKKKKRKKK 285
+KKK KK + N+ ++D + + ++E K +K +KK + K K KKK
Sbjct: 32 AQKKKGKKGELEITDLNEEYKD--LKESLEAALLDKKELKAWEKAEKKAEKAKAKAEKKK 89
Query: 286 RKKKKNQ 292
KK++ +
Sbjct: 90 AKKEEPK 96
Score = 32.9 bits (76), Expect = 0.079
Identities = 14/63 (22%), Positives = 22/63 (34%), Gaps = 25/63 (39%)
Query: 220 FVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKK 279
++ KK+ + +K +KK +K K K KKK K +
Sbjct: 61 ALLDKKELKAWEKAEKKAEKAKAKAE-------------------------KKKAKKEEP 95
Query: 280 KKR 282
K R
Sbjct: 96 KPR 98
Score = 30.6 bits (70), Expect = 0.46
Identities = 11/27 (40%), Positives = 16/27 (59%)
Query: 260 EEEEEKKKKKKKKKKKTTKKKKRKKKR 286
E+ E+K +K K K +K KK+ K R
Sbjct: 72 EKAEKKAEKAKAKAEKKKAKKEEPKPR 98
>gnl|CDD|219572 pfam07780, Spb1_C, Spb1 C-terminal domain. This presumed domain is
found at the C-terminus of a family of FtsJ-like
methyltransferases. Members of this family are involved
in 60S ribosomal biogenesis.
Length = 212
Score = 37.7 bits (88), Expect = 0.004
Identities = 25/71 (35%), Positives = 37/71 (52%), Gaps = 6/71 (8%)
Query: 226 KKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEE---KKKKKKKKKKKT---TKK 279
K RKK + K+ +K KKK D I+ + ++ ++ K KKKKKK+KKT KK
Sbjct: 117 KARKKMRALKRLEKAKKKAEAIAEDEDISEREKAKQIKKLYKKAKKKKKKRKKTYVVAKK 176
Query: 280 KKRKKKRKKKK 290
+ K + K
Sbjct: 177 GNKGKAGRPKG 187
Score = 33.4 bits (77), Expect = 0.087
Identities = 24/77 (31%), Positives = 29/77 (37%), Gaps = 27/77 (35%)
Query: 221 VMKKKKKRKKKKKKK----KKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKK-- 274
+ KK KK+KKK+KK KK + K + K K K K
Sbjct: 156 LYKKAKKKKKKRKKTYVVAKKGNKGKAG--------------------RPKGVKGKYKVV 195
Query: 275 -KTTKKKKRKKKRKKKK 290
KK R KRK KK
Sbjct: 196 DPRMKKDLRALKRKAKK 212
>gnl|CDD|240317 PTZ00217, PTZ00217, flap endonuclease-1; Provisional.
Length = 393
Score = 38.1 bits (89), Expect = 0.004
Identities = 19/67 (28%), Positives = 31/67 (46%), Gaps = 1/67 (1%)
Query: 232 KKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKK-KKRKKKRKKKK 290
K ++ KK K KK Q D + T + ++ K K KKKKKK + + ++ K
Sbjct: 325 KYIERLKKAKTKKTQTRLDSFFTATKKPIKKSNSKAKLKKKKKKAGASAVPKSETSQEAK 384
Query: 291 NQHHRDV 297
+ + V
Sbjct: 385 SSGKKKV 391
Score = 35.4 bits (82), Expect = 0.032
Identities = 19/55 (34%), Positives = 22/55 (40%), Gaps = 5/55 (9%)
Query: 219 GFVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKK 273
F KK KK K K KK+KKK S +E + KKK KK
Sbjct: 344 SFFTATKKPIKKSNSKAKLKKKKKKAGASAVP-----KSETSQEAKSSGKKKVKK 393
>gnl|CDD|217884 pfam04086, SRP-alpha_N, Signal recognition particle, alpha subunit,
N-terminal. SRP is a complex of six distinct
polypeptides and a 7S RNA that is essential for
transferring nascent polypeptide chains that are
destined for export from the cell to the translocation
apparatus of the endoplasmic reticulum (ER) membrane.
SRP binds hydrophobic signal sequences as they emerge
from the ribosome, and arrests translation.
Length = 272
Score = 37.8 bits (88), Expect = 0.004
Identities = 15/84 (17%), Positives = 27/84 (32%), Gaps = 11/84 (13%)
Query: 217 IPGFVMKKKKKRKKKKKKKKKKKRKKKK-----------NQHHRDVWINRTSRGEEEEEK 265
I + KRK++KK ++ + H + R ++ +
Sbjct: 128 IERKPKEPGLKRKQRKKAQESATSPESSPSSTPNSSRPSTPHLLKAKEGPSRRAKKAAKL 187
Query: 266 KKKKKKKKKKTTKKKKRKKKRKKK 289
+K+ K K KK KK
Sbjct: 188 SSTASSGDEKSPKSKAAPKKAGKK 211
Score = 30.5 bits (69), Expect = 0.85
Identities = 17/89 (19%), Positives = 27/89 (30%), Gaps = 23/89 (25%)
Query: 218 PGFVMKKKKKRKKKKKKKKKKKRKK-----------------------KKNQHHRDVWIN 254
K K+ K+K++KK ++ K+ R
Sbjct: 126 SMIERKPKEPGLKRKQRKKAQESATSPESSPSSTPNSSRPSTPHLLKAKEGPSRRAKKAA 185
Query: 255 RTSRGEEEEEKKKKKKKKKKKTTKKKKRK 283
+ S ++K K K K KK RK
Sbjct: 186 KLSSTASSGDEKSPKSKAAPKKAGKKMRK 214
>gnl|CDD|221733 pfam12720, DUF3807, Protein of unknown function (DUF3807). This is
a family of conserved fungal proteins of unknown
function.
Length = 169
Score = 37.0 bits (86), Expect = 0.005
Identities = 7/59 (11%), Positives = 24/59 (40%)
Query: 238 KKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQHHRD 296
++ K++ + + ++ + + +K+ ++ +KRK R K++
Sbjct: 68 ERELKEEAEAEEEGEVDASPDAGAVAGESSADRKEAEQQGAAQKRKSCRDKERKSAKDP 126
>gnl|CDD|236048 PRK07561, PRK07561, DNA topoisomerase I subunit omega; Validated.
Length = 859
Score = 38.2 bits (90), Expect = 0.005
Identities = 10/62 (16%), Positives = 16/62 (25%), Gaps = 19/62 (30%)
Query: 228 RKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRK 287
K +K + K KK K +KT ++ +K
Sbjct: 816 EKPEKLRYLADAPAKDP-------------------AGKKAAVKFSRKTKQQYVASEKDG 856
Query: 288 KK 289
K
Sbjct: 857 KA 858
Score = 37.1 bits (87), Expect = 0.011
Identities = 11/62 (17%), Positives = 16/62 (25%), Gaps = 20/62 (32%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
K +K R K KK K +K K++ +K
Sbjct: 817 KPEKLRYLADAPAKDPAGKKAA--------------------VKFSRKTKQQYVASEKDG 856
Query: 283 KK 284
K
Sbjct: 857 KA 858
Score = 35.5 bits (83), Expect = 0.030
Identities = 7/26 (26%), Positives = 9/26 (34%)
Query: 265 KKKKKKKKKKKTTKKKKRKKKRKKKK 290
K KK K RK K++
Sbjct: 826 DAPAKDPAGKKAAVKFSRKTKQQYVA 851
Score = 34.8 bits (81), Expect = 0.050
Identities = 7/26 (26%), Positives = 12/26 (46%)
Query: 265 KKKKKKKKKKKTTKKKKRKKKRKKKK 290
K KK K ++K K++ +K
Sbjct: 830 KDPAGKKAAVKFSRKTKQQYVASEKD 855
Score = 34.4 bits (80), Expect = 0.066
Identities = 7/54 (12%), Positives = 13/54 (24%), Gaps = 17/54 (31%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKT 276
K KK + +K + ++ +K K T
Sbjct: 823 YLADAPAKDPAGKKAAVKFSRKTK-----------------QQYVASEKDGKAT 859
Score = 33.6 bits (78), Expect = 0.12
Identities = 9/33 (27%), Positives = 13/33 (39%)
Query: 260 EEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQ 292
E+ E+ + K KK K +K K Q
Sbjct: 816 EKPEKLRYLADAPAKDPAGKKAAVKFSRKTKQQ 848
Score = 32.1 bits (74), Expect = 0.35
Identities = 9/34 (26%), Positives = 12/34 (35%)
Query: 260 EEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQH 293
E+ EK + K KK K +K Q
Sbjct: 815 AEKPEKLRYLADAPAKDPAGKKAAVKFSRKTKQQ 848
Score = 30.5 bits (70), Expect = 1.3
Identities = 5/27 (18%), Positives = 10/27 (37%)
Query: 264 EKKKKKKKKKKKTTKKKKRKKKRKKKK 290
+ K KK K ++ K++
Sbjct: 826 DAPAKDPAGKKAAVKFSRKTKQQYVAS 852
Score = 29.4 bits (67), Expect = 2.5
Identities = 4/26 (15%), Positives = 8/26 (30%)
Query: 265 KKKKKKKKKKKTTKKKKRKKKRKKKK 290
K K K +K +++
Sbjct: 825 ADAPAKDPAGKKAAVKFSRKTKQQYV 850
>gnl|CDD|178945 PRK00247, PRK00247, putative inner membrane protein translocase
component YidC; Validated.
Length = 429
Score = 37.9 bits (88), Expect = 0.005
Identities = 16/71 (22%), Positives = 30/71 (42%), Gaps = 2/71 (2%)
Query: 215 FWIPGFVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKK 274
+KK + +K + K +KK+ +K+ R+ INR +R E + + ++
Sbjct: 325 LHAENAEIKKTRTAEKNEAKARKKEIAQKRRAAERE--INREARQERAAAMARARARRAA 382
Query: 275 KTTKKKKRKKK 285
KKK
Sbjct: 383 VKAKKKGLIDA 393
Score = 37.1 bits (86), Expect = 0.009
Identities = 16/96 (16%), Positives = 45/96 (46%), Gaps = 14/96 (14%)
Query: 220 FVMKKKKKRKKKKKKKKKKKRKKKKNQ-------------HHRDVWINRTSRGEEEEEKK 266
+++ + ++K+K+KK ++N+ H + I +T E+ E K
Sbjct: 286 HHAEQRAQYREKQKEKKAFLWTLRRNRLRMIITPWRAPELHAENAEIKKTRTAEKNEAKA 345
Query: 267 KKKK-KKKKKTTKKKKRKKKRKKKKNQHHRDVWQKK 301
+KK+ +K++ +++ ++ R+++ R ++
Sbjct: 346 RKKEIAQKRRAAEREINREARQERAAAMARARARRA 381
>gnl|CDD|233352 TIGR01310, L7, 60S ribosomal protein L7, eukaryotic. This model
describes the eukaryotic 60S (cytosolic) ribosomal
protein L7 and paralogs that may or may not also be L7.
Human, Drosophila, and Arabidopsis all have both a
typical L7 and an L7-related paralog. This family is
designated subfamily rather than equivalog to reflect
these uncharacterized paralogs. Members of this family
average ~ 250 residues in length, somewhat longer than
the archaeal L30P/L7E homolog (~ 155 residues) and much
longer than the related bacterial/organellar form (~ 60
residues).
Length = 235
Score = 37.0 bits (86), Expect = 0.006
Identities = 17/62 (27%), Positives = 33/62 (53%), Gaps = 1/62 (1%)
Query: 221 VMKKKKKRKKKKKK-KKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKK 279
++KK+K R++ + K+ K KKK N+ R ++ R +E K +++ + K+ KK
Sbjct: 3 LLKKRKARQELAVQVAKQAKAKKKANKKKRKIYFKRAESFVKEYRKAEREIIRLKRLAKK 62
Query: 280 KK 281
Sbjct: 63 PG 64
Score = 30.8 bits (70), Expect = 0.63
Identities = 8/30 (26%), Positives = 16/30 (53%)
Query: 265 KKKKKKKKKKKTTKKKKRKKKRKKKKNQHH 294
KK+K +++ K+ + KK+ KK +
Sbjct: 5 KKRKARQELAVQVAKQAKAKKKANKKKRKI 34
Score = 30.4 bits (69), Expect = 0.89
Identities = 10/31 (32%), Positives = 18/31 (58%)
Query: 265 KKKKKKKKKKKTTKKKKRKKKRKKKKNQHHR 295
K+K +++ + K+ K KKK KKK + +
Sbjct: 6 KRKARQELAVQVAKQAKAKKKANKKKRKIYF 36
>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein. This family
consists of several Borrelia P83/P100 antigen proteins.
Length = 489
Score = 37.3 bits (86), Expect = 0.007
Identities = 16/79 (20%), Positives = 38/79 (48%), Gaps = 6/79 (7%)
Query: 224 KKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKK------TT 277
+K++ +K + + KK + K + H+ + + S+ E+E + K+ + +KK+
Sbjct: 279 QKREIEKAQIEIKKNDEEALKAKDHKAFDLKQESKASEKEAEDKELEAQKKREPVAEDLQ 338
Query: 278 KKKKRKKKRKKKKNQHHRD 296
K K + + + N+ D
Sbjct: 339 KTKPQVEAQPTSLNEDAID 357
Score = 31.1 bits (70), Expect = 0.81
Identities = 16/82 (19%), Positives = 37/82 (45%), Gaps = 7/82 (8%)
Query: 225 KKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKK------KKKKKKTTK 278
K++ KK+ K ++K Q + D R +++E K K K+ K+ +
Sbjct: 219 KEELDKKQIDADKAQQKADFAQDNADK-QRDEVRQKQQEAKNLPKPADTSSPKEDKQVAE 277
Query: 279 KKKRKKKRKKKKNQHHRDVWQK 300
+KR+ ++ + + + + + K
Sbjct: 278 NQKREIEKAQIEIKKNDEEALK 299
Score = 30.4 bits (68), Expect = 1.2
Identities = 12/81 (14%), Positives = 37/81 (45%), Gaps = 2/81 (2%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTT--KKK 280
K+ ++ K++ KK+ K ++ ++ +++++ K K T+ K+
Sbjct: 213 KRAQQLKEELDKKQIDADKAQQKADFAQDNADKQRDEVRQKQQEAKNLPKPADTSSPKED 272
Query: 281 KRKKKRKKKKNQHHRDVWQKK 301
K+ + +K++ + + +K
Sbjct: 273 KQVAENQKREIEKAQIEIKKN 293
Score = 28.0 bits (62), Expect = 7.4
Identities = 11/75 (14%), Positives = 33/75 (44%)
Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
++K + K++ + ++K+ + TS +E+++ + +K++ +K + K
Sbjct: 232 AQQKADFAQDNADKQRDEVRQKQQEAKNLPKPADTSSPKEDKQVAENQKREIEKAQIEIK 291
Query: 282 RKKKRKKKKNQHHRD 296
+ + K H
Sbjct: 292 KNDEEALKAKDHKAF 306
>gnl|CDD|225606 COG3064, TolA, Membrane protein involved in colicin uptake [Cell
envelope biogenesis, outer membrane].
Length = 387
Score = 37.2 bits (86), Expect = 0.008
Identities = 15/67 (22%), Positives = 24/67 (35%)
Query: 224 KKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRK 283
K K KKK + + K + + + + E ++K +KKK K K
Sbjct: 184 KAKAEAAAAKKKAEAEAKAAAEKAKAEAEAKAKAEKKAEAAAEEKAAAEKKKAAAKAKAD 243
Query: 284 KKRKKKK 290
K K
Sbjct: 244 KAAAAAK 250
Score = 35.7 bits (82), Expect = 0.025
Identities = 19/90 (21%), Positives = 51/90 (56%), Gaps = 6/90 (6%)
Query: 218 PGFVMK--KKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGE----EEEEKKKKKKK 271
PG V++ + + ++ KK +++RKKK+ Q ++ + + E E+E+ K +++
Sbjct: 57 PGAVVQQYGRIQSQQSSAKKGEQQRKKKEEQVAEELKPKQAAEQERLKQLEKERLKAQEQ 116
Query: 272 KKKKTTKKKKRKKKRKKKKNQHHRDVWQKK 301
+K+ +K+ + ++K+++ Q + ++K
Sbjct: 117 QKQAEEAEKQAQLEQKQQEEQARKAAAEQK 146
Score = 35.7 bits (82), Expect = 0.026
Identities = 17/84 (20%), Positives = 31/84 (36%), Gaps = 5/84 (5%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGE-----EEEEKKKKKKKKKKKTT 277
+KKK K K + + K + + + E E KKK + + K
Sbjct: 145 QKKKAEAAKAKAAAEAAKLKAAAEAKKKAEEAAKAAEEAKAKAEAAAAKKKAEAEAKAAA 204
Query: 278 KKKKRKKKRKKKKNQHHRDVWQKK 301
+K K + + K K + ++K
Sbjct: 205 EKAKAEAEAKAKAEKKAEAAAEEK 228
Score = 34.5 bits (79), Expect = 0.052
Identities = 15/68 (22%), Positives = 28/68 (41%), Gaps = 2/68 (2%)
Query: 225 KKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKK 284
KKK + + K +K + + + + + + EE+ +KKK K K K K
Sbjct: 193 KKKAEAEAKAAAEKAKAEAEAKAKAEKKAEAAA--EEKAAAEKKKAAAKAKADKAAAAAK 250
Query: 285 KRKKKKNQ 292
++K
Sbjct: 251 AAERKAAA 258
Score = 34.2 bits (78), Expect = 0.080
Identities = 15/73 (20%), Positives = 26/73 (35%), Gaps = 7/73 (9%)
Query: 223 KKKK-----KRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTT 277
K K K+K ++ K ++ K K + + EK K + + K K
Sbjct: 161 AKLKAAAEAKKKAEEAAKAAEEAKAKAEAAAAK--KKAEAEAKAAAEKAKAEAEAKAKAE 218
Query: 278 KKKKRKKKRKKKK 290
KK + + K
Sbjct: 219 KKAEAAAEEKAAA 231
Score = 33.8 bits (77), Expect = 0.096
Identities = 15/79 (18%), Positives = 44/79 (55%), Gaps = 7/79 (8%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
K +++RKKK+++ ++ + K+ + R + E+E K ++++K+ + +K+ +
Sbjct: 76 KGEQQRKKKEEQVAEELKPKQAAEQER-------LKQLEKERLKAQEQQKQAEEAEKQAQ 128
Query: 283 KKKRKKKKNQHHRDVWQKK 301
+++++++ QKK
Sbjct: 129 LEQKQQEEQARKAAAEQKK 147
Score = 33.4 bits (76), Expect = 0.12
Identities = 11/67 (16%), Positives = 34/67 (50%), Gaps = 1/67 (1%)
Query: 224 KKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRK 283
+++K+ ++ +K+ + ++K+++ Q + + + E + K + K K + KK+
Sbjct: 115 EQQKQAEEAEKQAQLEQKQQEEQARKAA-AEQKKKAEAAKAKAAAEAAKLKAAAEAKKKA 173
Query: 284 KKRKKKK 290
++ K
Sbjct: 174 EEAAKAA 180
Score = 31.8 bits (72), Expect = 0.41
Identities = 16/69 (23%), Positives = 32/69 (46%)
Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
+K+ +K + K ++++K+ + +K +R E+KKK + K K + K
Sbjct: 103 LKQLEKERLKAQEQQKQAEEAEKQAQLEQKQQEEQARKAAAEQKKKAEAAKAKAAAEAAK 162
Query: 282 RKKKRKKKK 290
K + KK
Sbjct: 163 LKAAAEAKK 171
Score = 31.5 bits (71), Expect = 0.59
Identities = 14/74 (18%), Positives = 33/74 (44%), Gaps = 8/74 (10%)
Query: 225 KKKRKKKKKKKKKKKRK---KKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
+K+ + ++K+++++ RK ++K + ++ E K K + KKK + K
Sbjct: 124 EKQAQLEQKQQEEQARKAAAEQKKKAEAAK-----AKAAAEAAKLKAAAEAKKKAEEAAK 178
Query: 282 RKKKRKKKKNQHHR 295
++ K K
Sbjct: 179 AAEEAKAKAEAAAA 192
Score = 30.7 bits (69), Expect = 0.93
Identities = 9/73 (12%), Positives = 37/73 (50%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
++++ ++ +K++ K ++++K+ + + + + + E+ + ++KKK + K
Sbjct: 99 EQERLKQLEKERLKAQEQQKQAEEAEKQAQLEQKQQEEQARKAAAEQKKKAEAAKAKAAA 158
Query: 283 KKKRKKKKNQHHR 295
+ + K + +
Sbjct: 159 EAAKLKAAAEAKK 171
>gnl|CDD|219901 pfam08555, DUF1754, Eukaryotic family of unknown function
(DUF1754). This is a eukaryotic protein family of
unknown function.
Length = 90
Score = 34.7 bits (80), Expect = 0.008
Identities = 18/74 (24%), Positives = 39/74 (52%), Gaps = 7/74 (9%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKK----KKTTK 278
KKKKK+KKKK K K++ +K+ + + E EE++ +K +++ T
Sbjct: 20 KKKKKKKKKKNKSKEEVVTEKEEEEKSS---AESDLKEGEEDEDNEKIEQEEDGMNLTEA 76
Query: 279 KKKRKKKRKKKKNQ 292
++ ++ ++K+ +
Sbjct: 77 ERAFEEAQRKRLKE 90
Score = 31.2 bits (71), Expect = 0.12
Identities = 18/64 (28%), Positives = 31/64 (48%), Gaps = 14/64 (21%)
Query: 220 FVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKK 279
V+ K K K KK KKKK+KKKK + +E+ +K++++K++ +
Sbjct: 4 NVVGGKLKLKGKKIDVKKKKKKKKKK--------------NKSKEEVVTEKEEEEKSSAE 49
Query: 280 KKRK 283
K
Sbjct: 50 SDLK 53
Score = 28.2 bits (63), Expect = 1.5
Identities = 14/25 (56%), Positives = 16/25 (64%)
Query: 266 KKKKKKKKKKTTKKKKRKKKRKKKK 290
K K K KK KKK+KKK+KK K
Sbjct: 7 GGKLKLKGKKIDVKKKKKKKKKKNK 31
Score = 28.2 bits (63), Expect = 1.9
Identities = 11/29 (37%), Positives = 16/29 (55%)
Query: 266 KKKKKKKKKKTTKKKKRKKKRKKKKNQHH 294
K KK KKK KKKK+ K +++ +
Sbjct: 13 KGKKIDVKKKKKKKKKKNKSKEEVVTEKE 41
Score = 27.8 bits (62), Expect = 2.3
Identities = 11/31 (35%), Positives = 18/31 (58%)
Query: 263 EEKKKKKKKKKKKTTKKKKRKKKRKKKKNQH 293
++KKKKKKKK K + K++ +K +
Sbjct: 20 KKKKKKKKKKNKSKEEVVTEKEEEEKSSAES 50
Score = 26.2 bits (58), Expect = 8.8
Identities = 15/30 (50%), Positives = 17/30 (56%), Gaps = 2/30 (6%)
Query: 265 KKKKKKKKKKKTTKKKKRKKKRKKKKNQHH 294
K K KK KK KKKK+KKK K K+
Sbjct: 11 KLKGKKIDVKK--KKKKKKKKNKSKEEVVT 38
>gnl|CDD|223880 COG0810, TonB, Periplasmic protein TonB, links inner and outer
membranes [Cell envelope biogenesis, outer membrane].
Length = 244
Score = 36.7 bits (85), Expect = 0.008
Identities = 15/81 (18%), Positives = 28/81 (34%), Gaps = 5/81 (6%)
Query: 213 ISFWIPGFVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKK 272
P + + + +++ K + + + + E++ KK K K
Sbjct: 47 KVLEAPTEEPQPEPEPPEEQPKPPTEPETPPE-----PTPPKPKEKPKPEKKPKKPKPKP 101
Query: 273 KKKTTKKKKRKKKRKKKKNQH 293
K K K K K + K KK
Sbjct: 102 KPKPKPKPKVKPQPKPKKPPS 122
Score = 32.5 bits (74), Expect = 0.18
Identities = 18/70 (25%), Positives = 24/70 (34%), Gaps = 17/70 (24%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
+ K K+K K +KK KK K + K K K K K K K K+
Sbjct: 77 PEPTPPKPKEKPKPEKKPKKPKPKP-----------------KPKPKPKPKVKPQPKPKK 119
Query: 283 KKKRKKKKNQ 292
+ K
Sbjct: 120 PPSKTAAKAP 129
>gnl|CDD|113514 pfam04747, DUF612, Protein of unknown function, DUF612. This
family includes several uncharacterized proteins from
Caenorhabditis elegans.
Length = 517
Score = 37.3 bits (85), Expect = 0.008
Identities = 22/74 (29%), Positives = 42/74 (56%), Gaps = 8/74 (10%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKK------KKKKKKKKT 276
KK +++ ++ + + KKR ++ +H + W R ++E+EKK+ + +KKK+K
Sbjct: 95 KKAAEKEARRAEAEAKKRAAQEEEHKQ--WKAEQERIQKEQEKKEADLKKLQAEKKKEKA 152
Query: 277 TKKKKRKKKRKKKK 290
K +K +K K KK
Sbjct: 153 VKAEKAEKAEKTKK 166
Score = 33.5 bits (75), Expect = 0.14
Identities = 19/73 (26%), Positives = 39/73 (53%)
Query: 218 PGFVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTT 277
P V K KK KKK +K+ K + ++ + + R E E +K+ ++++ K+
Sbjct: 64 PQQVEKVKKSEKKKAQKQIAKDHEAEQKVNAKKAAEKEARRAEAEAKKRAAQEEEHKQWK 123
Query: 278 KKKKRKKKRKKKK 290
+++R +K ++KK
Sbjct: 124 AEQERIQKEQEKK 136
Score = 30.8 bits (68), Expect = 0.91
Identities = 19/83 (22%), Positives = 43/83 (51%), Gaps = 2/83 (2%)
Query: 221 VMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKK 280
+ ++ ++ +K KK +KKK +K+ H + +N E+E + + + KK+ ++
Sbjct: 60 LTEQPQQVEKVKKSEKKKAQKQIAKDHEAEQKVNAKKAAEKEARRAEAEAKKRAAQEEEH 119
Query: 281 KRKKKRKKK--KNQHHRDVWQKK 301
K+ K +++ K Q ++ KK
Sbjct: 120 KQWKAEQERIQKEQEKKEADLKK 142
>gnl|CDD|185429 PTZ00074, PTZ00074, 60S ribosomal protein L34; Provisional.
Length = 135
Score = 35.8 bits (83), Expect = 0.008
Identities = 16/30 (53%), Positives = 22/30 (73%)
Query: 263 EEKKKKKKKKKKKTTKKKKRKKKRKKKKNQ 292
+EK K+KK+KKKK KKKK+ K+ KK +
Sbjct: 106 KEKAKQKKQKKKKKKKKKKKTSKKAAKKKK 135
Score = 34.3 bits (79), Expect = 0.022
Identities = 15/25 (60%), Positives = 19/25 (76%)
Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQ 246
KK+KK+KKKKKKKK K+ KK +
Sbjct: 111 QKKQKKKKKKKKKKKTSKKAAKKKK 135
Score = 34.3 bits (79), Expect = 0.026
Identities = 19/30 (63%), Positives = 23/30 (76%)
Query: 261 EEEEKKKKKKKKKKKTTKKKKRKKKRKKKK 290
+E+ K+KK+KKKKKK KKK KK KKKK
Sbjct: 106 KEKAKQKKQKKKKKKKKKKKTSKKAAKKKK 135
Score = 33.9 bits (78), Expect = 0.029
Identities = 17/32 (53%), Positives = 23/32 (71%), Gaps = 2/32 (6%)
Query: 263 EEKK--KKKKKKKKKTTKKKKRKKKRKKKKNQ 292
EE+K K+ K+K K K+KK+KKK+KKKK
Sbjct: 96 EEQKIVKQVLKEKAKQKKQKKKKKKKKKKKTS 127
Score = 32.7 bits (75), Expect = 0.074
Identities = 16/29 (55%), Positives = 22/29 (75%)
Query: 260 EEEEEKKKKKKKKKKKTTKKKKRKKKRKK 288
E+ ++KK+KKKKKKKK K K+ K+KK
Sbjct: 107 EKAKQKKQKKKKKKKKKKKTSKKAAKKKK 135
Score = 32.7 bits (75), Expect = 0.078
Identities = 16/26 (61%), Positives = 22/26 (84%)
Query: 221 VMKKKKKRKKKKKKKKKKKRKKKKNQ 246
V+K+K K+KK+KKKKKKKK+KK +
Sbjct: 104 VLKEKAKQKKQKKKKKKKKKKKTSKK 129
Score = 32.4 bits (74), Expect = 0.11
Identities = 15/26 (57%), Positives = 19/26 (73%)
Query: 265 KKKKKKKKKKKTTKKKKRKKKRKKKK 290
K+K K+KK+K KKKK+KKK KK
Sbjct: 105 LKEKAKQKKQKKKKKKKKKKKTSKKA 130
Score = 31.6 bits (72), Expect = 0.18
Identities = 16/30 (53%), Positives = 23/30 (76%)
Query: 260 EEEEEKKKKKKKKKKKTTKKKKRKKKRKKK 289
+E+ ++KK+KKKKKKK KK +K +KKK
Sbjct: 106 KEKAKQKKQKKKKKKKKKKKTSKKAAKKKK 135
Score = 31.6 bits (72), Expect = 0.20
Identities = 21/61 (34%), Positives = 32/61 (52%), Gaps = 18/61 (29%)
Query: 220 FVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKK 279
F+++++K K+ K+K K+K++KKK + KKKKKK KK KK
Sbjct: 93 FLVEEQKIVKQVLKEKAKQKKQKKKKK------------------KKKKKKTSKKAAKKK 134
Query: 280 K 280
K
Sbjct: 135 K 135
>gnl|CDD|221028 pfam11208, DUF2992, Protein of unknown function (DUF2992). This
bacterial family of proteins has no known function.
However, the cis-regulatory yjdF motif, just upstream
from the gene encoding the proteins for this family, is
a small non-coding RNA, Rfam:RF01764. The yjdF motif is
found in many Firmicutes, including Bacillus subtilis.
In most cases, it resides in potential 5' UTRs of
homologues of the yjdF gene whose function is unknown.
However, in Streptococcus thermophilus, a yjdF RNA motif
is associated with an operon whose protein products
synthesise nicotinamide adenine dinucleotide (NAD+).
Also, the S. thermophilus yjdF RNA lacks typical yjdF
motif consensus features downstream of and including the
P4 stem. Thus, if yjdF RNAs are riboswitch aptamers, the
S. thermophilus RNAs might sense a distinct compound
that structurally resembles the ligand bound by other
yjdF RNAs. On the ohter hand, perhaps these RNAs have an
alternative solution forming a similar binding site, as
is observed with some SAM riboswitches.
Length = 132
Score = 35.7 bits (83), Expect = 0.008
Identities = 23/75 (30%), Positives = 40/75 (53%), Gaps = 4/75 (5%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKK----KKKTTK 278
K +K+KK K+ +R+ K + + E E+ K++KKK KK+ K
Sbjct: 55 VKVSIKKQKKINPKRLQRQAAKEVKKPGISTKAQQALKLEHERNKQEKKKRSKEKKEEEK 114
Query: 279 KKKRKKKRKKKKNQH 293
++KR+ K++KKK +H
Sbjct: 115 ERKRQLKQQKKKAKH 129
Score = 31.5 bits (72), Expect = 0.23
Identities = 14/55 (25%), Positives = 30/55 (54%), Gaps = 14/55 (25%)
Query: 221 VMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKK 275
+K + +R K++KKK+ K++K+++ + R + K++KKK K +
Sbjct: 90 ALKLEHERNKQEKKKRSKEKKEEEKERKR--------------QLKQQKKKAKHR 130
Score = 27.2 bits (61), Expect = 6.2
Identities = 23/94 (24%), Positives = 37/94 (39%), Gaps = 25/94 (26%)
Query: 220 FVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTK- 278
F++K K + K K++KK N R R + K+ KK +TK
Sbjct: 41 FILKHLSTLKFTEAVKVSIKKQKKIN-------PKRLQR------QAAKEVKKPGISTKA 87
Query: 279 -----------KKKRKKKRKKKKNQHHRDVWQKK 301
K+++KK+ K+KK + Q K
Sbjct: 88 QQALKLEHERNKQEKKKRSKEKKEEEKERKRQLK 121
>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
Length = 434
Score = 37.1 bits (86), Expect = 0.010
Identities = 15/68 (22%), Positives = 39/68 (57%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
K+++ K+ KK + +K + + D ++ +EE+ K+++ +KK+K+ + K
Sbjct: 366 KRQELLKEYNKKLQDYTKKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKE 425
Query: 283 KKKRKKKK 290
K+++ ++K
Sbjct: 426 KRQKDERK 433
Score = 33.2 bits (76), Expect = 0.14
Identities = 15/63 (23%), Positives = 31/63 (49%), Gaps = 10/63 (15%)
Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
KK + K + ++ + K K+ + + EE E+K+K++ + K+ +K +
Sbjct: 382 TKKLGEVKDETDASEEAEAKAKEEK----------LKQEENEKKQKEQADEDKEKRQKDE 431
Query: 282 RKK 284
RKK
Sbjct: 432 RKK 434
Score = 31.3 bits (71), Expect = 0.69
Identities = 20/74 (27%), Positives = 34/74 (45%), Gaps = 6/74 (8%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
K+++ K+ KK + KK + T EE E K K++K K+++ KK+K
Sbjct: 365 DKRQELLKEYNKKLQDYTKKLGEVK------DETDASEEAEAKAKEEKLKQEENEKKQKE 418
Query: 283 KKKRKKKKNQHHRD 296
+ K+K Q
Sbjct: 419 QADEDKEKRQKDER 432
>gnl|CDD|217970 pfam04220, YihI, Der GTPase activator (YihI). YihI activates the
GTPase activity of Der, a 50S ribosomal subunit
stability factor. The stimulation is specific to Der as
YihI does not stimulate the GTPase activity of Era or
ObgE. The interaction of YihI with Der requires only the
C-terminal 78 amino acids of YihI. A yihI deletion
mutant is viable and shows a shorter lag period, but the
same post-lag growth rate as a wild-type strain. yihI is
expressed during the lag period. Overexpression of yihI
inhibits cell growth and biogenesis of the 50S ribosomal
subunit. YihI is an unusual, highly hydrophilic protein
with an uneven distribution of charged residues,
resulting in an N-terminal region with high pI and a
C-terminal region with low pI.
Length = 169
Score = 35.8 bits (83), Expect = 0.011
Identities = 20/69 (28%), Positives = 31/69 (44%), Gaps = 16/69 (23%)
Query: 228 RKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKT-------TKKK 280
RK K K + KKK R +E E+K+KKK+K K+ ++ +
Sbjct: 1 RKSGKNGPKLAPKGKKKT---------RYELDQEARERKRKKKRKGLKSGSRHNEESESQ 51
Query: 281 KRKKKRKKK 289
K+K +KK
Sbjct: 52 KQKGAAQKK 60
Score = 30.8 bits (70), Expect = 0.45
Identities = 28/97 (28%), Positives = 34/97 (35%), Gaps = 32/97 (32%)
Query: 222 MKKKKKRKKKKKKKKKK------------KRKKKKNQHHRDVWINRTSRGEEEEEKKKKK 269
K K K K KKK KRKKK+ SR EE E +K+K
Sbjct: 1 RKSGKNGPKLAPKGKKKTRYELDQEARERKRKKKRKGL------KSGSRHNEESESQKQK 54
Query: 270 KKKKKK--------------TTKKKKRKKKRKKKKNQ 292
+KK K K K+ KKKK +
Sbjct: 55 GAAQKKDPRIGSKKPVPLIVEFKVKPEKQAPKKKKVK 91
Score = 29.2 bits (66), Expect = 1.8
Identities = 19/75 (25%), Positives = 30/75 (40%), Gaps = 26/75 (34%)
Query: 223 KKKKKRKKKK-------KKKKKKKRKKKKNQHHR---------DVWINRTSRGEEEEEKK 266
K+KKKRK K + + +K++ + + R V E + K
Sbjct: 30 KRKKKRKGLKSGSRHNEESESQKQKGAAQKKDPRIGSKKPVPLIV----------EFKVK 79
Query: 267 KKKKKKKKKTTKKKK 281
+K+ KKK KK K
Sbjct: 80 PEKQAPKKKKVKKPK 94
>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein.
Length = 529
Score = 36.7 bits (85), Expect = 0.012
Identities = 20/73 (27%), Positives = 41/73 (56%), Gaps = 6/73 (8%)
Query: 210 SSSISFWIPGFVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKK 269
S I++ I +V K+ +K ++++++K+ KK++ + D E +EE++K++
Sbjct: 16 SEFINYPIYLWVEKEVEKEVPDEEEEEEKEEKKEEEEKTTDK------EEEVDEEEEKEE 69
Query: 270 KKKKKKTTKKKKR 282
KKKK K K+
Sbjct: 70 KKKKTKKVKETTT 82
>gnl|CDD|219564 pfam07771, TSGP1, Tick salivary peptide group 1. This contains a
group of peptides derived from a salivary gland cDNA
library of the tick Ixodes scapularis. Also present are
peptides from a related tick species, Ixodes ricinus.
They are characterized by a putative signal peptide
indicative of secretion and conserved cysteine residues.
Length = 120
Score = 34.9 bits (80), Expect = 0.013
Identities = 17/26 (65%), Positives = 17/26 (65%)
Query: 260 EEEEEKKKKKKKKKKKTTKKKKRKKK 285
E EK KKKKKK KKT K KK KK
Sbjct: 94 PEPTEKPKKKKKKSKKTKKPKKSSKK 119
Score = 33.7 bits (77), Expect = 0.035
Identities = 14/27 (51%), Positives = 17/27 (62%)
Query: 261 EEEEKKKKKKKKKKKTTKKKKRKKKRK 287
E +K KKKKKK K TKK K+ K+
Sbjct: 94 PEPTEKPKKKKKKSKKTKKPKKSSKKD 120
Score = 32.9 bits (75), Expect = 0.050
Identities = 13/22 (59%), Positives = 16/22 (72%)
Query: 224 KKKKRKKKKKKKKKKKRKKKKN 245
K KK+KKK KK KK K+ KK+
Sbjct: 99 KPKKKKKKSKKTKKPKKSSKKD 120
Score = 32.2 bits (73), Expect = 0.093
Identities = 14/22 (63%), Positives = 15/22 (68%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKK 244
K KKK+KK KK KK KK KK
Sbjct: 99 KPKKKKKKSKKTKKPKKSSKKD 120
Score = 31.8 bits (72), Expect = 0.15
Identities = 15/27 (55%), Positives = 17/27 (62%)
Query: 263 EEKKKKKKKKKKKTTKKKKRKKKRKKK 289
E +K KKKKKK+ K KK KK KK
Sbjct: 94 PEPTEKPKKKKKKSKKTKKPKKSSKKD 120
Score = 29.5 bits (66), Expect = 0.80
Identities = 13/22 (59%), Positives = 14/22 (63%)
Query: 229 KKKKKKKKKKKRKKKKNQHHRD 250
K KKKKKK KK KK K +D
Sbjct: 99 KPKKKKKKSKKTKKPKKSSKKD 120
Score = 29.5 bits (66), Expect = 0.88
Identities = 12/27 (44%), Positives = 15/27 (55%)
Query: 262 EEEKKKKKKKKKKKTTKKKKRKKKRKK 288
E +K KKKKKK KK +K +K
Sbjct: 94 PEPTEKPKKKKKKSKKTKKPKKSSKKD 120
Score = 29.5 bits (66), Expect = 1.0
Identities = 12/21 (57%), Positives = 14/21 (66%)
Query: 226 KKRKKKKKKKKKKKRKKKKNQ 246
K +KKKKK KK KK KK +
Sbjct: 99 KPKKKKKKSKKTKKPKKSSKK 119
Score = 27.5 bits (61), Expect = 4.6
Identities = 11/21 (52%), Positives = 11/21 (52%)
Query: 218 PGFVMKKKKKRKKKKKKKKKK 238
P KK KK KK KK KK
Sbjct: 100 PKKKKKKSKKTKKPKKSSKKD 120
>gnl|CDD|218337 pfam04939, RRS1, Ribosome biogenesis regulatory protein (RRS1).
This family consists of several eukaryotic ribosome
biogenesis regulatory (RRS1) proteins. RRS1 is a nuclear
protein that is essential for the maturation of 25 S
rRNA and the 60 S ribosomal subunit assembly in
Saccharomyces cerevisiae.
Length = 164
Score = 35.3 bits (82), Expect = 0.014
Identities = 21/80 (26%), Positives = 29/80 (36%), Gaps = 26/80 (32%)
Query: 225 KKKRKKKKKKKKK--------------KKRKKKKNQHHRDVWI---------NRTSRGEE 261
KKK KK+KK K KR K + WI ++
Sbjct: 88 KKKGIKKRKKSKLVYDEATGEWVPRWGYKRANKDLE---KDWIIEVKPNDDPKEDPFAKK 144
Query: 262 EEEKKKKKKKKKKKTTKKKK 281
EKK++ K +K+ K KK
Sbjct: 145 RREKKERVAKNEKRELKNKK 164
Score = 29.6 bits (67), Expect = 1.4
Identities = 21/69 (30%), Positives = 31/69 (44%), Gaps = 7/69 (10%)
Query: 231 KKKKKKKKKRKKKKNQHHRDVWINR--TSRGEEEEEKK-----KKKKKKKKKTTKKKKRK 283
KKK KK+K+ K W+ R R ++ EK K K+ KK+R+
Sbjct: 88 KKKGIKKRKKSKLVYDEATGEWVPRWGYKRANKDLEKDWIIEVKPNDDPKEDPFAKKRRE 147
Query: 284 KKRKKKKNQ 292
KK + KN+
Sbjct: 148 KKERVAKNE 156
>gnl|CDD|217348 pfam03064, U79_P34, HSV U79 / HCMV P34. This family represents
herpes virus protein U79 and cytomegalovirus early
phosphoprotein P34 (UL112).
Length = 238
Score = 36.0 bits (83), Expect = 0.015
Identities = 15/74 (20%), Positives = 34/74 (45%), Gaps = 10/74 (13%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
K+K+KR+ + +K K+ R+KK+ + R +E+++ + +
Sbjct: 165 KQKEKRRVEDSQKHKEDRRKKQEE----------KRRNDEDKRPGGGGGSSGGQSGLSTK 214
Query: 283 KKKRKKKKNQHHRD 296
+ K+K+ +HH
Sbjct: 215 DEPPKEKRQKHHDP 228
Score = 27.9 bits (62), Expect = 6.4
Identities = 13/43 (30%), Positives = 28/43 (65%), Gaps = 2/43 (4%)
Query: 256 TSRGEEEEEKKKK--KKKKKKKTTKKKKRKKKRKKKKNQHHRD 296
SR + ++E +K+ K+K+K++ +K K+ R+KK+ + R+
Sbjct: 150 LSRKKSDDEHRKRSGKQKEKRRVEDSQKHKEDRRKKQEEKRRN 192
>gnl|CDD|220129 pfam09159, Ydc2-catalyt, Mitochondrial resolvase Ydc2 / RNA
splicing MRS1. Members of this family adopt a secondary
structure consisting of two beta sheets and one alpha
helix, arranged as a beta-alpha-beta motif. Each beta
sheet has five strands, arranged in a 32145 order, with
the second strand being antiparallel to the rest.
Mitochondrial resolvase Ydc2 is capable of resolving
Holliday junctions and cleaves DNA after 5'-CT-3' and
5'-TT-3' sequences. This family also contains the
mitochondrial RNA-splicing protein MRS1 which is
involved in the excision of group I introns.
Length = 254
Score = 35.8 bits (83), Expect = 0.016
Identities = 16/89 (17%), Positives = 28/89 (31%), Gaps = 11/89 (12%)
Query: 213 ISFWIPGFVMKKKKKRKKKKKKKKKKKRKKKKN-----------QHHRDVWINRTSRGEE 261
++W + +KK+ + KKK K +K + +
Sbjct: 142 ANYWCERTEILAEKKKPRSKKKSSKNSKKLRIDLVGEWLSTAISPPDSSPVSLDAREKLR 201
Query: 262 EEEKKKKKKKKKKKTTKKKKRKKKRKKKK 290
K + K K KK+ K+ K K
Sbjct: 202 AYLPKWEGKTKNKKSLKEYKDLIKLLDSG 230
>gnl|CDD|220413 pfam09805, Nop25, Nucleolar protein 12 (25kDa). Members of this
family of proteins are part of the yeast nuclear pore
complex-associated pre-60S ribosomal subunit. The family
functions as a highly conserved exonuclease that is
required for the 5'-end maturation of 5.8S and 25S
rRNAs, demonstrating that 5'-end processing also has a
redundant pathway. Nop25 binds late pre-60S ribosomes,
accompanying them from the nucleolus to the nuclear
periphery; and there is evidence for both physical and
functional links between late 60S subunit processing and
export.
Length = 134
Score = 34.6 bits (80), Expect = 0.017
Identities = 19/79 (24%), Positives = 43/79 (54%), Gaps = 22/79 (27%)
Query: 225 KKKRKKKKKKKK----KKKRK-------KKKNQHHRDVWINRTSRGEEEEEKKKKKKKKK 273
KKK+ KKK ++ K+ RK K+K Q R ++ +E+ K+K++++
Sbjct: 1 KKKKAKKKGVEEVVFDKEARKEYLTGFHKRKQQ-----------RRKKAQEEAKEKEREE 49
Query: 274 KKTTKKKKRKKKRKKKKNQ 292
+ +K+ R++++++ + Q
Sbjct: 50 RIEERKRIREERKQELEKQ 68
Score = 30.4 bits (69), Expect = 0.46
Identities = 14/63 (22%), Positives = 37/63 (58%), Gaps = 17/63 (26%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
K+K++R+KK +++ K+K ++++ EE+K+ ++++K++ K+ K
Sbjct: 29 KRKQQRRKKAQEEAKEKEREER-----------------IEERKRIREERKQELEKQLKE 71
Query: 283 KKK 285
+K+
Sbjct: 72 RKE 74
>gnl|CDD|215565 PLN03083, PLN03083, E3 UFM1-protein ligase 1 homolog; Provisional.
Length = 803
Score = 36.3 bits (84), Expect = 0.017
Identities = 17/64 (26%), Positives = 24/64 (37%), Gaps = 8/64 (12%)
Query: 236 KKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR--------KKKRK 287
K +KKK + +EE+ KK KK +KK K K KK+
Sbjct: 417 DKGSKKKKGKSTSTKGGTAESIPDDEEDAPKKGKKNQKKGRDKSSKVPSDSKAGGKKESV 476
Query: 288 KKKN 291
K +
Sbjct: 477 KSQE 480
>gnl|CDD|217834 pfam03998, Utp11, Utp11 protein. This protein is found to be part
of a large ribonucleoprotein complex containing the U3
snoRNA. Depletion of the Utp proteins impedes production
of the 18S rRNA, indicating that they are part of the
active pre-rRNA processing complex. This large RNP
complex has been termed the small subunit (SSU)
processome.
Length = 239
Score = 35.4 bits (82), Expect = 0.019
Identities = 28/72 (38%), Positives = 38/72 (52%), Gaps = 6/72 (8%)
Query: 221 VMKKKKKRKKKKKKKKKK--KRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTK 278
V +K+KK+ KKK+K K K +K++ + + V R E + E KK K KKKK K
Sbjct: 168 VDEKQKKKSAKKKRKLYKELKERKEREKKLKKV----EQRLELQRELMKKGKGKKKKIVK 223
Query: 279 KKKRKKKRKKKK 290
K K K KK
Sbjct: 224 DKDGKVVYKWKK 235
Score = 34.7 bits (80), Expect = 0.035
Identities = 26/73 (35%), Positives = 39/73 (53%), Gaps = 8/73 (10%)
Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
+KKK KKK+K K+ K +K++ + + V R E + E KK K KKKK K K
Sbjct: 171 KQKKKSAKKKRKLYKELKERKEREKKLKKV----EQRLELQRELMKKGKGKKKKIVKDKD 226
Query: 282 R----KKKRKKKK 290
K K+++K+
Sbjct: 227 GKVVYKWKKERKR 239
Score = 33.1 bits (76), Expect = 0.11
Identities = 18/79 (22%), Positives = 39/79 (49%), Gaps = 6/79 (7%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
+++KKK KKK+K K+ + R + ++ E++ + +++ KK K KK+
Sbjct: 166 SLVDEKQKKKSAKKKRKLYKELKER-----KEREKKLKKVEQRLELQRELMKKG-KGKKK 219
Query: 283 KKKRKKKKNQHHRDVWQKK 301
K + K ++ ++K
Sbjct: 220 KIVKDKDGKVVYKWKKERK 238
Score = 33.1 bits (76), Expect = 0.12
Identities = 21/76 (27%), Positives = 32/76 (42%), Gaps = 10/76 (13%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEK------KKKKKKKKKKT 276
K+KKK KKK++ K+ + R + ++ E++ KK K KKK
Sbjct: 165 TSLVDEKQKKKSAKKKRKLYKELKER----KEREKKLKKVEQRLELQRELMKKGKGKKKK 220
Query: 277 TKKKKRKKKRKKKKNQ 292
K K K K K +
Sbjct: 221 IVKDKDGKVVYKWKKE 236
Score = 27.3 bits (61), Expect = 9.1
Identities = 27/97 (27%), Positives = 37/97 (38%), Gaps = 21/97 (21%)
Query: 219 GFVMKKK--KKRKKKKKKKKKK-KRKKKK-----------NQHHRDVWINRTSRGEEEEE 264
G + KKK K R K KK+K K K+K EE+E
Sbjct: 17 GLLEKKKDYKLRAKDYHKKEKTLKALKEKAAERNPDEFYFGMISSKTDDGVHILKREEKE 76
Query: 265 KKKKKKKKKKKT-------TKKKKRKKKRKKKKNQHH 294
K++ K KT TK++ KK +K K + H
Sbjct: 77 VLSKEQVKLLKTQDLNYVRTKRQAEAKKIEKLKEELH 113
>gnl|CDD|223939 COG1007, NuoN, NADH:ubiquinone oxidoreductase subunit 2 (chain N)
[Energy production and conversion].
Length = 475
Score = 36.1 bits (84), Expect = 0.021
Identities = 23/114 (20%), Positives = 37/114 (32%), Gaps = 17/114 (14%)
Query: 50 FIIITAILGNLLVIISVIKHRKLRIITNYFVVSLAFADLLVALCVMPFNAIVSLTDEWYF 109
+ ITA++ LLV + + + R + LA LL AL + F A +
Sbjct: 13 ILAITALVL-LLVGLFIRRSRS-------LLAYLAVLGLLAALASLLFIAYRGS---AFA 61
Query: 110 GYFMCDVWNSFDVYFSTASILHLCCISVDRYYAIVKPLQYPIIMNQNTVLLMLS 163
G F+ D + L +S Y LL+ +
Sbjct: 62 GAFIVDALALLMKIVVLLASLAALILS--LPYLERAGTFKGEFY----ALLLFA 109
>gnl|CDD|220121 pfam09135, Alb1, Alb1. Alb1 is a nuclear shuttling factor involved
in ribosome biogenesis.
Length = 112
Score = 34.0 bits (78), Expect = 0.021
Identities = 17/55 (30%), Positives = 28/55 (50%), Gaps = 1/55 (1%)
Query: 232 KKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKR 286
K + K+ R + S+ E +KK++KKK K+ T K++KR+KK
Sbjct: 17 DKSSLTSLPRAKRTDRKRPGILA-ASKNEGVSKKKQRKKKPKRLTRKQRKRQKKG 70
Score = 31.7 bits (72), Expect = 0.13
Identities = 19/60 (31%), Positives = 32/60 (53%), Gaps = 8/60 (13%)
Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
KK++K+K K+ +K++KR+KK + V + S KK+ K + K KK+K
Sbjct: 48 KKKQRKKKPKRLTRKQRKRQKKGLERAEIV-TDTLS-------KKRAKSIGRAKYVKKRK 99
Score = 31.3 bits (71), Expect = 0.20
Identities = 14/64 (21%), Positives = 25/64 (39%)
Query: 227 KRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKR 286
+R K + R + E KKK++KKK + ++++KR
Sbjct: 7 RRAPSPSIDLDKSSLTSLPRAKRTDRKRPGILAASKNEGVSKKKQRKKKPKRLTRKQRKR 66
Query: 287 KKKK 290
+KK
Sbjct: 67 QKKG 70
>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
Length = 1388
Score = 36.2 bits (84), Expect = 0.021
Identities = 19/72 (26%), Positives = 37/72 (51%)
Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
K K RK K KKK+KKK+K ++ + + + R + +E++K K KK+
Sbjct: 1164 GKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDEKRKLDDKPDNKKSNSSGS 1223
Query: 282 RKKKRKKKKNQH 293
++ +++K +
Sbjct: 1224 DQEDDEEQKTKP 1235
Score = 33.9 bits (78), Expect = 0.11
Identities = 19/75 (25%), Positives = 33/75 (44%)
Query: 217 IPGFVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKT 276
K +K K KKK+KKKKK K++ V ++ +E+ + K KK +
Sbjct: 1161 KTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDEKRKLDDKPDNKKSNS 1220
Query: 277 TKKKKRKKKRKKKKN 291
+ + + +K K
Sbjct: 1221 SGSDQEDDEEQKTKP 1235
Score = 32.3 bits (74), Expect = 0.32
Identities = 23/77 (29%), Positives = 42/77 (54%), Gaps = 7/77 (9%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSR-------GEEEEEKKKKKKKKKKK 275
K+K ++ + +KK+K+ +K KN +D+W+ + EE EEK+ K+++ K
Sbjct: 1101 KEKVEKLNAELEKKEKELEKLKNTTPKDMWLEDLDKFEEALEEQEEVEEKEIAKEQRLKS 1160
Query: 276 TTKKKKRKKKRKKKKNQ 292
TK K K ++ K K +
Sbjct: 1161 KTKGKASKLRKPKLKKK 1177
Score = 31.9 bits (73), Expect = 0.53
Identities = 15/75 (20%), Positives = 30/75 (40%), Gaps = 14/75 (18%)
Query: 218 PGFVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTT 277
P KK+ KKKKK +KK +++ K + K+ +++
Sbjct: 1305 PTKKKVKKRLEGSLAALKKKKKSEKK--------------TARKKKSKTRVKQASASQSS 1350
Query: 278 KKKKRKKKRKKKKNQ 292
+ +R +K+K +
Sbjct: 1351 RLLRRPRKKKSDSSS 1365
Score = 31.2 bits (71), Expect = 0.86
Identities = 16/70 (22%), Positives = 32/70 (45%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
K KKK KKKKK K +K + + V + + +++ + KK + ++++
Sbjct: 1173 KLKKKEKKKKKSSADKSKKASVVGNSKRVDSDEKRKLDDKPDNKKSNSSGSDQEDDEEQK 1232
Query: 283 KKKRKKKKNQ 292
K +K +
Sbjct: 1233 TKPKKSSVKR 1242
Score = 30.8 bits (70), Expect = 0.99
Identities = 12/61 (19%), Positives = 29/61 (47%)
Query: 230 KKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKK 289
+ ++K+ K+++ K + + + ++E++KKK K KK + K+ +
Sbjct: 1146 EVEEKEIAKEQRLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDE 1205
Query: 290 K 290
K
Sbjct: 1206 K 1206
Score = 29.6 bits (67), Expect = 2.8
Identities = 18/66 (27%), Positives = 25/66 (37%), Gaps = 1/66 (1%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
K+ K K K+ + D N S+ KKK KK+ + KK+
Sbjct: 1266 SKEGKPKNAPKRVSAVQYSPPPPSKRPDGESNGGSK-PSSPTKKKVKKRLEGSLAALKKK 1324
Query: 283 KKKRKK 288
KK KK
Sbjct: 1325 KKSEKK 1330
Score = 29.2 bits (66), Expect = 3.2
Identities = 16/77 (20%), Positives = 26/77 (33%), Gaps = 14/77 (18%)
Query: 220 FVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKK 279
+ K+++ K K K K + +K +++EKKKKK K
Sbjct: 1148 EEKEIAKEQRLKSKTKGKASKLRKPKL--------------KKKEKKKKKSSADKSKKAS 1193
Query: 280 KKRKKKRKKKKNQHHRD 296
KR + D
Sbjct: 1194 VVGNSKRVDSDEKRKLD 1210
Score = 29.2 bits (66), Expect = 3.2
Identities = 19/78 (24%), Positives = 29/78 (37%), Gaps = 15/78 (19%)
Query: 224 KKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRK 283
K+++ K K K K K RK K +++E+KKKK K K
Sbjct: 1154 KEQRLKSKTKGKASKLRKPKL---------------KKKEKKKKKSSADKSKKASVVGNS 1198
Query: 284 KKRKKKKNQHHRDVWQKK 301
K+ + + D K
Sbjct: 1199 KRVDSDEKRKLDDKPDNK 1216
Score = 27.7 bits (62), Expect = 9.6
Identities = 26/83 (31%), Positives = 33/83 (39%), Gaps = 14/83 (16%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRT--------------SRGEEEEEKKKK 268
KKK+K+KKK K KK N D R S E++EE+K K
Sbjct: 1175 KKKEKKKKKSSADKSKKASVVGNSKRVDSDEKRKLDDKPDNKKSNSSGSDQEDDEEQKTK 1234
Query: 269 KKKKKKKTTKKKKRKKKRKKKKN 291
KK K K KK + + N
Sbjct: 1235 PKKSSVKRLKSKKNNSSKSSEDN 1257
>gnl|CDD|218941 pfam06217, GAGA_bind, GAGA binding protein-like family. This
family includes gbp a protein from Soybean that binds to
GAGA element dinucleotide repeat DNA. It seems likely
that the this domain mediates DNA binding. This putative
domain contains several conserved cysteines and a
histidine suggesting this may be a zinc-binding DNA
interaction domain.
Length = 301
Score = 35.6 bits (82), Expect = 0.022
Identities = 15/47 (31%), Positives = 22/47 (46%)
Query: 246 QHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQ 292
HH +V T+ E +E KK KK + K K +K K+K +
Sbjct: 129 MHHLEVPPISTAPPEAKEVKKPKKGQSPKVPKAPKPKKPKKKGSVSN 175
Score = 30.2 bits (68), Expect = 1.2
Identities = 11/53 (20%), Positives = 18/53 (33%)
Query: 217 IPGFVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKK 269
P + KK +K + K K + KK + + G + K K
Sbjct: 140 APPEAKEVKKPKKGQSPKVPKAPKPKKPKKKGSVSNRSVKMPGIDPRSKPDWK 192
Score = 28.3 bits (63), Expect = 5.0
Identities = 18/70 (25%), Positives = 22/70 (31%), Gaps = 21/70 (30%)
Query: 225 KKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKK 284
K+ KK KK + K K K KK KKK + K
Sbjct: 144 AKEVKKPKKGQSPKVPKAPK-------------------PKKPKKKGSVSNRSVKMPGID 184
Query: 285 KRKK--KKNQ 292
R K K+Q
Sbjct: 185 PRSKPDWKSQ 194
>gnl|CDD|220252 pfam09468, RNase_H2-Ydr279, Ydr279p protein family (RNase H2
complex component). RNases H are enzymes that
specifically hydrolyse RNA when annealed to a
complementary DNA and are present in all living
organisms. In yeast RNase H2 is composed of a complex of
three proteins (Rnh2Ap, Ydr279p and Ylr154p), this
family represents the homologues of Ydr279p. It is not
known whether non yeast proteins in this family fulfil
the same function.
Length = 287
Score = 35.8 bits (83), Expect = 0.022
Identities = 22/86 (25%), Positives = 33/86 (38%), Gaps = 13/86 (15%)
Query: 203 NVYYALISSSISFWIPGFVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEE 262
+ +L+ KKKKR+ ++ + + R +KK +E
Sbjct: 210 ELLKSLLIPEFKPLDKYLKESKKKKRETEEDVEAAESRAEKKR-------------KSKE 256
Query: 263 EEKKKKKKKKKKKTTKKKKRKKKRKK 288
E KKKK K+ K KK K KK
Sbjct: 257 EIKKKKPKESKGVKALKKVVAKGMKK 282
Score = 35.4 bits (82), Expect = 0.029
Identities = 22/80 (27%), Positives = 36/80 (45%), Gaps = 3/80 (3%)
Query: 213 ISFWIPGFVMKKKKKRKKKKKKKK---KKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKK 269
+ ++P + K+ K + K K KKK + + SR E++ + K++
Sbjct: 199 LCSYLPPDLYKELLKSLLIPEFKPLDKYLKESKKKKRETEEDVEAAESRAEKKRKSKEEI 258
Query: 270 KKKKKKTTKKKKRKKKRKKK 289
KKKK K +K K KK K
Sbjct: 259 KKKKPKESKGVKALKKVVAK 278
>gnl|CDD|216581 pfam01577, Peptidase_S30, Potyvirus P1 protease. The potyviridae
family positive stand RNA viruses with genome encoding a
polyprotein. members include zucchini yellow mosaic
virus, and turnip mosaic viruses which cause
considerable losses of crops worldwide. This family
consists of a C terminus region from various plant
potyvirus P1 proteins (found at the N terminus of the
polyprotein). The C terminus of P1 is a serine-type
protease responsible for autocatalytic cleavage between
P1 and the helper component protease pfam00851. The
entire P1 protein may be involved in virus-host
interactions.
Length = 245
Score = 35.4 bits (82), Expect = 0.023
Identities = 18/77 (23%), Positives = 34/77 (44%), Gaps = 5/77 (6%)
Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKK-----KKKKKKKKT 276
+ KK++K K+++R++++ I ++ K + K+T
Sbjct: 31 KLSPAQVAKKREKLKREEREERQFLQGAYASIVSKITPIGTDKVSKTESVSFRTPYYKRT 90
Query: 277 TKKKKRKKKRKKKKNQH 293
TKK K+KKK+KK
Sbjct: 91 TKKMKKKKKKKKVVMSD 107
>gnl|CDD|240578 cd12951, RRP7_Rrp7A, RRP7 domain ribosomal RNA-processing protein 7
homolog A (Rrp7A) and similar proteins. The family
corresponds to the RRP7 domain of Rrp7A, also termed
gastric cancer antigen Zg14, and similar proteins which
are yeast ribosomal RNA-processing protein 7 (Rrp7p)
homologs mainly found in Metazoans. The cellular
function of Rrp7A remains unclear currently. Rrp7A
harbors an N-terminal RNA recognition motif (RRM), also
termed RBD (RNA binding domain) or RNP
(ribonucleoprotein domain), and a C-terminal RRP7
domain.
Length = 129
Score = 34.2 bits (79), Expect = 0.024
Identities = 17/63 (26%), Positives = 35/63 (55%), Gaps = 6/63 (9%)
Query: 232 KKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKN 291
K+++++K+ K+K+ + D W+ T +G ++ K +K+ K K+K+KKKK
Sbjct: 32 KEEEEEKEEKEKEAEPDEDGWVTVTKKG------RRPKTARKESVAAKAAEKEKKKKKKK 85
Query: 292 QHH 294
+
Sbjct: 86 ELK 88
Score = 34.2 bits (79), Expect = 0.027
Identities = 25/73 (34%), Positives = 35/73 (47%), Gaps = 12/73 (16%)
Query: 218 PGFVMKKKKKRKKKKKKKKKKKRKKKKN----QHHRDVWINRTSRGEEEEEKKKKKKKKK 273
P K+ K +K+KKKKK+K+ KN Q R S+ E E +KK ++ K
Sbjct: 62 PKTARKESVAAKAAEKEKKKKKKKELKNFYRFQI-------RESKRNELAELRKKFEEDK 114
Query: 274 KKTTK-KKKRKKK 285
K+ K RK K
Sbjct: 115 KRIALLKAARKFK 127
Score = 32.6 bits (75), Expect = 0.076
Identities = 20/70 (28%), Positives = 39/70 (55%), Gaps = 7/70 (10%)
Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
M++ K ++++ K+ K+K+ + D W+ T +G K +K+ K +K+K
Sbjct: 27 MEEYDKEEEEE-----KEEKEKEAEPDEDGWVTVTKKG--RRPKTARKESVAAKAAEKEK 79
Query: 282 RKKKRKKKKN 291
+KKK+K+ KN
Sbjct: 80 KKKKKKELKN 89
>gnl|CDD|218148 pfam04557, tRNA_synt_1c_R2, Glutaminyl-tRNA synthetase,
non-specific RNA binding region part 2. This is a
region found N terminal to the catalytic domain of
glutaminyl-tRNA synthetase (EC 6.1.1.18) in eukaryotes
but not in Escherichia coli. This region is thought to
bind RNA in a non-specific manner, enhancing
interactions between the tRNA and enzyme, but is not
essential for enzyme function.
Length = 83
Score = 33.1 bits (76), Expect = 0.026
Identities = 12/34 (35%), Positives = 15/34 (44%)
Query: 259 GEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQ 292
G + E KKKKKKKK + + KK
Sbjct: 15 GPKTEADLVKKKKKKKKKKAEDTAATAKAKKATA 48
Score = 31.9 bits (73), Expect = 0.057
Identities = 13/32 (40%), Positives = 15/32 (46%)
Query: 261 EEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQ 292
E + KKKKKKKKKK K +K
Sbjct: 19 EADLVKKKKKKKKKKAEDTAATAKAKKATAED 50
Score = 30.4 bits (69), Expect = 0.21
Identities = 12/22 (54%), Positives = 14/22 (63%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKK 244
KKKKK+KKKK + K KK
Sbjct: 24 KKKKKKKKKKAEDTAATAKAKK 45
Score = 30.0 bits (68), Expect = 0.34
Identities = 11/23 (47%), Positives = 14/23 (60%)
Query: 222 MKKKKKRKKKKKKKKKKKRKKKK 244
+ KKKK+KKKKK + K K
Sbjct: 22 LVKKKKKKKKKKAEDTAATAKAK 44
Score = 28.9 bits (65), Expect = 0.71
Identities = 10/24 (41%), Positives = 13/24 (54%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQ 246
KKKKK+KKK + + KK
Sbjct: 25 KKKKKKKKKAEDTAATAKAKKATA 48
Score = 28.9 bits (65), Expect = 0.76
Identities = 10/23 (43%), Positives = 13/23 (56%)
Query: 224 KKKKRKKKKKKKKKKKRKKKKNQ 246
KK+KKKKKKK + K +
Sbjct: 22 LVKKKKKKKKKKAEDTAATAKAK 44
Score = 27.7 bits (62), Expect = 2.1
Identities = 13/59 (22%), Positives = 17/59 (28%), Gaps = 20/59 (33%)
Query: 225 KKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRK 283
K + KKKKKKKK+K + K KK +
Sbjct: 17 KTEADLVKKKKKKKKKKAED--------------------TAATAKAKKATAEDVSEGA 55
Score = 26.2 bits (58), Expect = 7.5
Identities = 8/24 (33%), Positives = 11/24 (45%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQ 246
KKKKK+K + K +K
Sbjct: 27 KKKKKKKAEDTAATAKAKKATAED 50
Score = 26.2 bits (58), Expect = 7.8
Identities = 10/31 (32%), Positives = 16/31 (51%)
Query: 255 RTSRGEEEEEKKKKKKKKKKKTTKKKKRKKK 285
+T +++KKKKKKK + K +K
Sbjct: 17 KTEADLVKKKKKKKKKKAEDTAATAKAKKAT 47
>gnl|CDD|221654 pfam12589, WBS_methylT, Methyltransferase involved in
Williams-Beuren syndrome. This domain family is found
in eukaryotes, and is typically between 72 and 83 amino
acids in length. The family is found in association with
pfam08241. This family is made up of
S-adenosylmethionine-dependent methyltransferases. The
proteins are deleted in Williams-Beuren syndrome (WBS),
a complex developmental disorder with multisystemic
manifestations including supravalvular aortic stenosis
(SVAS) and a specific cognitive phenotype.
Length = 85
Score = 33.0 bits (76), Expect = 0.027
Identities = 20/66 (30%), Positives = 38/66 (57%), Gaps = 12/66 (18%)
Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
+++ +R ++KKKKKKK +KK K WI R ++E+ +++ K+ K +K
Sbjct: 32 VRRISQRNRRKKKKKKKLKKKSKE------WILR------KKEQMRRRGKEVKPDSKYTG 79
Query: 282 RKKKRK 287
RK++ +
Sbjct: 80 RKRRPR 85
Score = 29.2 bits (66), Expect = 0.64
Identities = 15/41 (36%), Positives = 25/41 (60%), Gaps = 5/41 (12%)
Query: 254 NRTSRGEEEEEKKKKKKKKKKKTTK-----KKKRKKKRKKK 289
++ R + +KKKKKKK KK +K KK++ ++R K+
Sbjct: 30 SKVRRISQRNRRKKKKKKKLKKKSKEWILRKKEQMRRRGKE 70
Score = 28.0 bits (63), Expect = 1.8
Identities = 13/41 (31%), Positives = 23/41 (56%), Gaps = 3/41 (7%)
Query: 253 INRTSRGEEEEEKKKKKKKKKKKTTKKK---KRKKKRKKKK 290
I+ + + ++KKKKKKK K K RKK++ +++
Sbjct: 27 IDASKVRRISQRNRRKKKKKKKLKKKSKEWILRKKEQMRRR 67
Score = 27.6 bits (62), Expect = 2.3
Identities = 11/32 (34%), Positives = 20/32 (62%)
Query: 259 GEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKK 290
+E+ + K ++ ++ KKKK+KK +KK K
Sbjct: 23 DDEQIDASKVRRISQRNRRKKKKKKKLKKKSK 54
Score = 26.5 bits (59), Expect = 5.0
Identities = 13/42 (30%), Positives = 21/42 (50%)
Query: 254 NRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQHHR 295
+ + ++ ++KKKKK KKK K+ +KK Q R
Sbjct: 25 EQIDASKVRRISQRNRRKKKKKKKLKKKSKEWILRKKEQMRR 66
Score = 26.5 bits (59), Expect = 5.0
Identities = 11/32 (34%), Positives = 22/32 (68%)
Query: 260 EEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKN 291
E++E+ K ++ + ++KK+KKK+ KKK+
Sbjct: 22 EDDEQIDASKVRRISQRNRRKKKKKKKLKKKS 53
>gnl|CDD|203848 pfam08079, Ribosomal_L30_N, Ribosomal L30 N-terminal domain. This
presumed domain is found at the N-terminus of Ribosomal
L30 proteins and has been termed RL30NT or NUC018.
Length = 71
Score = 32.6 bits (75), Expect = 0.027
Identities = 15/58 (25%), Positives = 29/58 (50%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKK 280
K+K+ K + K+ KK+ KK + R + R + +E ++ + + K+ KKK
Sbjct: 5 KRKRNEKLRAKRAKKRAAKKAARKKKRKLIFKRAEKYVKEYRAAERDEIRLKREAKKK 62
Score = 28.3 bits (64), Expect = 0.92
Identities = 9/39 (23%), Positives = 21/39 (53%)
Query: 262 EEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQHHRDVWQK 300
E+ + K+ KK+ K +KK++K K+ ++ ++
Sbjct: 10 EKLRAKRAKKRAAKKAARKKKRKLIFKRAEKYVKEYRAA 48
Score = 27.6 bits (62), Expect = 1.8
Identities = 10/36 (27%), Positives = 19/36 (52%)
Query: 266 KKKKKKKKKKTTKKKKRKKKRKKKKNQHHRDVWQKK 301
KK+K+ +K K+ +K+ KK + R + K+
Sbjct: 2 LLKKRKRNEKLRAKRAKKRAAKKAARKKKRKLIFKR 37
>gnl|CDD|224415 COG1498, SIK1, Protein implicated in ribosomal biogenesis, Nop56p
homolog [Translation, ribosomal structure and
biogenesis].
Length = 395
Score = 35.5 bits (82), Expect = 0.028
Identities = 17/70 (24%), Positives = 31/70 (44%), Gaps = 17/70 (24%)
Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
++ +KR +K K+K K K K E KK++ + ++ K+KK
Sbjct: 343 REELEKRIEKLKEKPPKPPTKAK-----------------PERDKKERPGRYRRKKKEKK 385
Query: 282 RKKKRKKKKN 291
K +R+ +N
Sbjct: 386 AKSERRGLQN 395
Score = 32.8 bits (75), Expect = 0.21
Identities = 10/41 (24%), Positives = 19/41 (46%)
Query: 255 RTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQHHR 295
+ + + K K ++ KK + R+KK++KK R
Sbjct: 351 EKLKEKPPKPPTKAKPERDKKERPGRYRRKKKEKKAKSERR 391
>gnl|CDD|221185 pfam11719, Drc1-Sld2, DNA replication and checkpoint protein.
Genome duplication is precisely regulated by
cyclin-dependent kinases CDKs, which bring about the
onset of S phase by activating replication origins and
then prevent relicensing of origins until mitosis is
completed. The optimum sequence motif for CDK
phosphorylation is S/T-P-K/R-K/R, and Drc1-Sld2 is found
to have at least 11 potential phosphorylation sites.
Drc1 is required for DNA synthesis and S-M replication
checkpoint control. Drc1 associates with Cdc2 and is
phosphorylated at the onset of S phase when Cdc2 is
activated. Thus Cdc2 promotes DNA replication by
phosphorylating Drc1 and regulating its association with
Cut5. Sld2 and Sld3 represent the minimal set of S-CDK
substrates required for DNA replication.
Length = 397
Score = 35.5 bits (82), Expect = 0.030
Identities = 21/89 (23%), Positives = 41/89 (46%), Gaps = 16/89 (17%)
Query: 218 PGFVMKKK-KKRKKKK-------KKKKKKKRKKKKNQHHRDVWINRTSR--------GEE 261
P V KKK +KR ++ K + + + H ++ S G +
Sbjct: 280 PRRVFKKKGQKRTTRRVKMRPVRAKPSDEPSLPESDIHEEIPKLDEKSLSEFLGYMGGID 339
Query: 262 EEEKKKKKKKKKKKTTKKKKRKKKRKKKK 290
E+++ + ++ K++ KK+K KKK +K+K
Sbjct: 340 EDDEDEDDEESKEEVEKKQKVKKKPRKRK 368
Score = 28.2 bits (63), Expect = 6.6
Identities = 18/71 (25%), Positives = 25/71 (35%), Gaps = 10/71 (14%)
Query: 232 KKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKK----------K 281
K +K + D E +EE +KK+K KKK +K K
Sbjct: 321 PKLDEKSLSEFLGYMGGIDEDDEDEDDEESKEEVEKKQKVKKKPRKRKVNPVSNNFRRLK 380
Query: 282 RKKKRKKKKNQ 292
+ KRKK
Sbjct: 381 LRNKRKKGFKG 391
>gnl|CDD|227492 COG5163, NOP7, Protein required for biogenesis of the 60S ribosomal
subunit [Translation, ribosomal structure and
biogenesis].
Length = 591
Score = 35.4 bits (81), Expect = 0.032
Identities = 18/63 (28%), Positives = 35/63 (55%)
Query: 224 KKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRK 283
K KKRK +++++KK + + + ++ ++EE+ + KKKKK+ K+KK
Sbjct: 529 KNKKRKVDEEEEEKKLKMIMMSNKQKKLYKKMKYSNAKKEEQAENLKKKKKQIAKQKKLD 588
Query: 284 KKR 286
K+
Sbjct: 589 SKK 591
Score = 31.2 bits (70), Expect = 0.81
Identities = 19/64 (29%), Positives = 32/64 (50%), Gaps = 3/64 (4%)
Query: 230 KKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKR--- 286
K K K KKRK + + + + + S +++ KK K KK+ + +KKK+
Sbjct: 523 KDVNKSKNKKRKVDEEEEEKKLKMIMMSNKQKKLYKKMKYSNAKKEEQAENLKKKKKQIA 582
Query: 287 KKKK 290
K+KK
Sbjct: 583 KQKK 586
Score = 29.7 bits (66), Expect = 2.1
Identities = 15/64 (23%), Positives = 35/64 (54%)
Query: 226 KKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKK 285
K K K KK+K ++++ + + N+ + ++ + KK+++ + KKKK++
Sbjct: 523 KDVNKSKNKKRKVDEEEEEKKLKMIMMSNKQKKLYKKMKYSNAKKEEQAENLKKKKKQIA 582
Query: 286 RKKK 289
++KK
Sbjct: 583 KQKK 586
>gnl|CDD|237035 PRK12280, rplW, 50S ribosomal protein L23; Reviewed.
Length = 158
Score = 34.4 bits (79), Expect = 0.033
Identities = 21/68 (30%), Positives = 33/68 (48%), Gaps = 9/68 (13%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
K++K+ K+ ++K+ K KK+K + + E+ KKK K K T KK
Sbjct: 97 KEQKEVSKETEEKEAIKAKKEKKE---------KKEKKVAEKLAKKKSTKTTKNTTKKAT 147
Query: 283 KKKRKKKK 290
KK KK+
Sbjct: 148 KKTTTKKE 155
Score = 31.3 bits (71), Expect = 0.32
Identities = 20/71 (28%), Positives = 34/71 (47%), Gaps = 1/71 (1%)
Query: 221 VMKKKKKRKKKKKKKKKKKRKKKKNQHHRD-VWINRTSRGEEEEEKKKKKKKKKKKTTKK 279
+ KK K+ + KK + IN E+E+K+ K+ ++K+ K
Sbjct: 55 IFNVDKKPKRLGRFPGFTNSYKKAYVTLAEGYSINLFPEESEKEQKEVSKETEEKEAIKA 114
Query: 280 KKRKKKRKKKK 290
KK KK++K+KK
Sbjct: 115 KKEKKEKKEKK 125
Score = 30.9 bits (70), Expect = 0.48
Identities = 18/72 (25%), Positives = 33/72 (45%), Gaps = 17/72 (23%)
Query: 229 KKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKK 288
+K++K+ K+ +K+ + KK+KK+KK+KK +K +KK K
Sbjct: 96 EKEQKEVSKETEEKEA-----------------IKAKKEKKEKKEKKVAEKLAKKKSTKT 138
Query: 289 KKNQHHRDVWQK 300
KN + +
Sbjct: 139 TKNTTKKATKKT 150
Score = 30.1 bits (68), Expect = 0.74
Identities = 13/44 (29%), Positives = 21/44 (47%)
Query: 260 EEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQHHRDVWQKKYT 303
+E E+K+ K KK+K KK+K+ ++ KK K
Sbjct: 103 SKETEEKEAIKAKKEKKEKKEKKVAEKLAKKKSTKTTKNTTKKA 146
Score = 29.7 bits (67), Expect = 0.98
Identities = 19/62 (30%), Positives = 31/62 (50%), Gaps = 10/62 (16%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
++K+ K KK+KK+KK++K + + + + K KK KKTT KK+
Sbjct: 107 EEKEAIKAKKEKKEKKEKKVAEKLAKK----------KSTKTTKNTTKKATKKTTTKKEE 156
Query: 283 KK 284
K
Sbjct: 157 GK 158
>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex. This
entry is characterized by proteins with alternating
conserved and low-complexity regions. Bud13 together
with Snu17p and a newly identified factor,
Pml1p/Ylr016c, form a novel trimeric complex. called The
RES complex, pre-mRNA retention and splicing complex.
Subunits of this complex are not essential for viability
of yeasts but they are required for efficient splicing
in vitro and in vivo. Furthermore, inactivation of this
complex causes pre-mRNA leakage from the nucleus. Bud13
contains a unique, phylogenetically conserved C-terminal
region of unknown function.
Length = 141
Score = 33.8 bits (78), Expect = 0.033
Identities = 16/90 (17%), Positives = 46/90 (51%), Gaps = 17/90 (18%)
Query: 223 KKKKKRKKKKKKK---------KKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKK- 272
+K++K +K++K+K ++++++ ++ + ++ + R + E+ +E+ K++++
Sbjct: 24 EKEEKERKEEKEKEWGKGLVQKEEREKRLEELEKAKNKPLARYADDEDYDEELKEQERWD 83
Query: 273 -------KKKTTKKKKRKKKRKKKKNQHHR 295
+KK K K+ K K +R
Sbjct: 84 DPMAQFLRKKKEKTDKKGKPLYKGPFPPNR 113
Score = 33.4 bits (77), Expect = 0.051
Identities = 14/62 (22%), Positives = 38/62 (61%), Gaps = 8/62 (12%)
Query: 219 GFVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTK 278
G ++ ++KR++K+++K++K+RK++K +G ++E+++K+ ++ +K
Sbjct: 9 GRIIDIEEKREEKEREKEEKERKEEK--------EKEWGKGLVQKEEREKRLEELEKAKN 60
Query: 279 KK 280
K
Sbjct: 61 KP 62
Score = 30.7 bits (70), Expect = 0.37
Identities = 17/77 (22%), Positives = 37/77 (48%), Gaps = 18/77 (23%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKK---KKKKTTKK 279
K + ++K+++K++ K++K E +EEK+K+ K +K++ K+
Sbjct: 7 KSGRIIDIEEKREEKEREKEEK---------------ERKEEKEKEWGKGLVQKEEREKR 51
Query: 280 KKRKKKRKKKKNQHHRD 296
+ +K K K + D
Sbjct: 52 LEELEKAKNKPLARYAD 68
Score = 30.0 bits (68), Expect = 0.86
Identities = 12/54 (22%), Positives = 30/54 (55%), Gaps = 8/54 (14%)
Query: 248 HRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKK--------KRKKKRKKKKNQH 293
+RD E+ EEK+++K++K++K K+K K +++++ ++ +
Sbjct: 4 YRDKSGRIIDIEEKREEKEREKEEKERKEEKEKEWGKGLVQKEEREKRLEELEK 57
>gnl|CDD|220102 pfam09073, BUD22, BUD22. BUD22 has been shown in yeast to be a
nuclear protein involved in bud-site selection. It plays
a role in positioning the proximal bud pole signal. More
recently it has been shown to be involved in ribosome
biogenesis.
Length = 424
Score = 35.2 bits (81), Expect = 0.033
Identities = 18/74 (24%), Positives = 29/74 (39%), Gaps = 8/74 (10%)
Query: 229 KKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKK 288
KK K+KK + G E+E+ + + K KRK +R +
Sbjct: 258 PPSKKPKEKKTSSTFLPSLMGGYF-----SGSEDEDDDDEDIDPDQVVKKPVKRKNRRGQ 312
Query: 289 KKNQHHRDVWQKKY 302
+ Q +W+KKY
Sbjct: 313 RARQ---AIWEKKY 323
Score = 35.2 bits (81), Expect = 0.037
Identities = 16/42 (38%), Positives = 22/42 (52%)
Query: 221 VMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEE 262
++ + K KK K KKK KK KKK+ + D S E+E
Sbjct: 139 ILGIETKAKKGKAKKKTKKSKKKEAKESSDKDDEEESESEDE 180
Score = 33.3 bits (76), Expect = 0.17
Identities = 17/55 (30%), Positives = 23/55 (41%), Gaps = 1/55 (1%)
Query: 239 KRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQH 293
K KN + G E + KK K KKK K +KKK+ K+ K +
Sbjct: 121 NSKPVKNVLESIMEGLDKILGIETK-AKKGKAKKKTKKSKKKEAKESSDKDDEEE 174
Score = 31.7 bits (72), Expect = 0.50
Identities = 14/43 (32%), Positives = 21/43 (48%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEK 265
KK K KKK KK KKK+ K+ ++ + + EE +
Sbjct: 146 AKKGKAKKKTKKSKKKEAKESSDKDDEEESESEDESKSEESAE 188
Score = 29.8 bits (67), Expect = 1.7
Identities = 17/67 (25%), Positives = 33/67 (49%), Gaps = 1/67 (1%)
Query: 224 KKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRK 283
KK K KK+++K +K++ + W R ++ E + K + + T++K +
Sbjct: 321 KKYGSGAKHVKKEREKEQKEREGRQSE-WEARQAKREGGDAKAGRAAEPTGSRTQQKGDR 379
Query: 284 KKRKKKK 290
KR +KK
Sbjct: 380 PKRGEKK 386
Score = 28.6 bits (64), Expect = 4.5
Identities = 13/41 (31%), Positives = 18/41 (43%)
Query: 228 RKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKK 268
K KK K KKK KK K + ++ E E++ K
Sbjct: 143 ETKAKKGKAKKKTKKSKKKEAKESSDKDDEEESESEDESKS 183
Score = 28.6 bits (64), Expect = 4.8
Identities = 13/53 (24%), Positives = 24/53 (45%), Gaps = 5/53 (9%)
Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKK 274
+ K + K KK K KK+ KK + S +++EE+ + + + K
Sbjct: 136 LDKILGIETKAKKGKAKKKTKKSKKKEAK-----ESSDKDDEEESESEDESKS 183
Score = 27.5 bits (61), Expect = 9.6
Identities = 17/74 (22%), Positives = 28/74 (37%), Gaps = 17/74 (22%)
Query: 218 PGFVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKK----------- 266
+ ++ K+ K + + + Q D R RGE+++ KK
Sbjct: 346 SEWEARQAKREGGDAKAGRAAEPTGSRTQQKGD----RPKRGEKKKPKKPSVDKPLHPSW 401
Query: 267 --KKKKKKKKKTTK 278
KKK K+KK K
Sbjct: 402 EAKKKAKEKKANAK 415
>gnl|CDD|219947 pfam08639, SLD3, DNA replication regulator SLD3. The SLD3 DNA
replication regulator is required for loading and
maintenance of Cdc45 on chromatin during DNA
replication.
Length = 437
Score = 35.2 bits (81), Expect = 0.036
Identities = 12/36 (33%), Positives = 22/36 (61%)
Query: 262 EEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQHHRDV 297
E+E KKK K+ + + K+RK++++KK + V
Sbjct: 127 EKEYKKKLLKRSQNLDRSKRRKRRKRKKNKKQDLPV 162
Score = 31.3 bits (71), Expect = 0.55
Identities = 14/37 (37%), Positives = 23/37 (62%)
Query: 255 RTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKN 291
R + E++ KKK K+ + + KR+K+RK+KKN
Sbjct: 119 ELDRFTKFEKEYKKKLLKRSQNLDRSKRRKRRKRKKN 155
Score = 29.8 bits (67), Expect = 1.9
Identities = 13/38 (34%), Positives = 21/38 (55%)
Query: 256 TSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQH 293
E+E KKK K+ + K+++++KRKK K Q
Sbjct: 122 RFTKFEKEYKKKLLKRSQNLDRSKRRKRRKRKKNKKQD 159
>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein. This protein is found to be part
of a large ribonucleoprotein complex containing the U3
snoRNA. Depletion of the Utp proteins impedes production
of the 18S rRNA, indicating that they are part of the
active pre-rRNA processing complex. This large RNP
complex has been termed the small subunit (SSU)
processome.
Length = 728
Score = 35.4 bits (82), Expect = 0.037
Identities = 14/59 (23%), Positives = 22/59 (37%), Gaps = 7/59 (11%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
K K KK+ K+K ++ + + K KKKKKK+K+
Sbjct: 497 LKTTSSVGKSAKKQDSKKKSSSKL-------DKAANKISKAAVKVKKKKKKEKSIDLDD 548
Score = 34.3 bits (79), Expect = 0.080
Identities = 19/51 (37%), Positives = 27/51 (52%), Gaps = 14/51 (27%)
Query: 221 VMKKKKKRKKKKKKK--KKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKK 269
K+K+KR+ K + KK+KRK KK ++ V IN EK+ KK
Sbjct: 630 KKKRKRKRRFLTKIEGVKKEKRKDKKLKN---VIIN---------EKRNKK 668
Score = 32.7 bits (75), Expect = 0.22
Identities = 13/32 (40%), Positives = 23/32 (71%), Gaps = 2/32 (6%)
Query: 264 EKKKKKKKKKKKTTKKKKR--KKKRKKKKNQH 293
+K+KKK+K+K++ K + K+KRK KK ++
Sbjct: 627 KKRKKKRKRKRRFLTKIEGVKKEKRKDKKLKN 658
Score = 32.7 bits (75), Expect = 0.26
Identities = 22/81 (27%), Positives = 37/81 (45%), Gaps = 16/81 (19%)
Query: 221 VMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKK-------- 272
V K KR +K +K+++++ ++N W+ TS + KK+ KKK
Sbjct: 470 VANKLLKRSEKAQKEEEEEELDEENP-----WLKTTSS-VGKSAKKQDSKKKSSSKLDKA 523
Query: 273 --KKKTTKKKKRKKKRKKKKN 291
K K +KKK+K+K
Sbjct: 524 ANKISKAAVKVKKKKKKEKSI 544
Score = 31.6 bits (72), Expect = 0.51
Identities = 21/79 (26%), Positives = 34/79 (43%), Gaps = 4/79 (5%)
Query: 224 KKKKRKKKKKKKK----KKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKK 279
K KR KK K KK KK K K+ + + EE +K ++++ +++ T K
Sbjct: 218 AKAKRIKKIKSKKYHRVHKKEKLKEELKEFEELVKADPEAALEELEKLERRRAEERMTLK 277
Query: 280 KKRKKKRKKKKNQHHRDVW 298
K K K + R +
Sbjct: 278 HKNTSKWAKSMLKTGRAKY 296
Score = 31.6 bits (72), Expect = 0.62
Identities = 19/100 (19%), Positives = 36/100 (36%), Gaps = 21/100 (21%)
Query: 218 PGFVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKK--- 274
P K+ + +K KK+ K + K KK+ ++ + + K+ +K +K
Sbjct: 427 PENGEKEAESKKLKKENKNEFKEKKESDEEEELEDEEEAKVEKVANKLLKRSEKAQKEEE 486
Query: 275 ------------------KTTKKKKRKKKRKKKKNQHHRD 296
K+ KK+ KKK K ++
Sbjct: 487 EEELDEENPWLKTTSSVGKSAKKQDSKKKSSSKLDKAANK 526
Score = 31.2 bits (71), Expect = 0.74
Identities = 12/32 (37%), Positives = 21/32 (65%)
Query: 260 EEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKN 291
++ ++K+K+K++ K KK K+K KK KN
Sbjct: 627 KKRKKKRKRKRRFLTKIEGVKKEKRKDKKLKN 658
Score = 30.0 bits (68), Expect = 1.7
Identities = 20/60 (33%), Positives = 28/60 (46%), Gaps = 18/60 (30%)
Query: 226 KKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKK 285
KKRKKK+K+K++ K E +K+K+K KK K +KR KK
Sbjct: 627 KKRKKKRKRKRRFLTKI------------------EGVKKEKRKDKKLKNVIINEKRNKK 668
Score = 29.3 bits (66), Expect = 3.0
Identities = 25/124 (20%), Positives = 36/124 (29%), Gaps = 55/124 (44%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEE------------------- 263
K K KKKKKK+K + + I EE+E
Sbjct: 527 ISKAAVKVKKKKKKEKSIDLDDDLIDEEDSIKLDVDDEEDEDDEELPFLFKQKDLIKEAF 586
Query: 264 -----------EKKK-------------------------KKKKKKKKTTKKKKRKKKRK 287
EKK+ KK+KKK+K ++ K +
Sbjct: 587 AGDDVVAEFEKEKKEVIEEEDPKEIDLTLPGWGSWAGDGIKKRKKKRKRKRRFLTKIEGV 646
Query: 288 KKKN 291
KK+
Sbjct: 647 KKEK 650
Score = 28.9 bits (65), Expect = 4.6
Identities = 17/77 (22%), Positives = 38/77 (49%)
Query: 225 KKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKK 284
K ++ + +KK++ + + R++ S EE EE KK ++K + +++
Sbjct: 375 SMKFMQRAEARKKEENDAEIEELRRELEGEEESDEEENEEPSKKNVGRRKFGPENGEKEA 434
Query: 285 KRKKKKNQHHRDVWQKK 301
+ KK K ++ + +KK
Sbjct: 435 ESKKLKKENKNEFKEKK 451
Score = 27.7 bits (62), Expect = 9.0
Identities = 11/27 (40%), Positives = 16/27 (59%)
Query: 262 EEEKKKKKKKKKKKTTKKKKRKKKRKK 288
EE K K+ KK K K + +K+K K+
Sbjct: 216 EEAKAKRIKKIKSKKYHRVHKKEKLKE 242
>gnl|CDD|215214 PLN02381, PLN02381, valyl-tRNA synthetase.
Length = 1066
Score = 35.3 bits (81), Expect = 0.037
Identities = 19/66 (28%), Positives = 28/66 (42%), Gaps = 17/66 (25%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
KKKK+ K K+K+ KK K +K E + K + ++ KK
Sbjct: 21 KKKKEEKAKEKELKKLKAAQK-----------------EAKAKLQAQQASDGTNVPKKSE 63
Query: 283 KKKRKK 288
KK RK+
Sbjct: 64 KKSRKR 69
Score = 34.1 bits (78), Expect = 0.091
Identities = 19/77 (24%), Positives = 36/77 (46%), Gaps = 16/77 (20%)
Query: 226 KKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKK- 284
+ +KK +++ +RKKKK + + +E+E KK K +K+ K + ++
Sbjct: 6 SEAEKKILTEEELERKKKKEE-----------KAKEKELKKLKAAQKEAKAKLQAQQASD 54
Query: 285 ----KRKKKKNQHHRDV 297
+K +K RDV
Sbjct: 55 GTNVPKKSEKKSRKRDV 71
Score = 29.5 bits (66), Expect = 2.5
Identities = 16/67 (23%), Positives = 35/67 (52%), Gaps = 7/67 (10%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
KK ++ ++KKKK+++ K+K + + ++E + K + ++ T KK
Sbjct: 10 KKILTEEELERKKKKEEKAKEKE-------LKKLKAAQKEAKAKLQAQQASDGTNVPKKS 62
Query: 283 KKKRKKK 289
+KK +K+
Sbjct: 63 EKKSRKR 69
>gnl|CDD|220818 pfam10595, UPF0564, Uncharacterized protein family UPF0564. This
family of proteins has no known function. However, one
of the members is annotated as an EF-hand family
protein.
Length = 349
Score = 35.1 bits (81), Expect = 0.039
Identities = 12/76 (15%), Positives = 43/76 (56%), Gaps = 1/76 (1%)
Query: 217 IPGFVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKT 276
+ F+ ++K+++ K++++KK+ ++KK + + + +K ++++K++
Sbjct: 274 LVKFLRTERKEKEAKEQQEKKELEQRKKKKKEMAP-KVKQRFEANDPAQKLQEERKEQLA 332
Query: 277 TKKKKRKKKRKKKKNQ 292
+K+ K++ K+ + +
Sbjct: 333 KLRKEEKEREKEYEQE 348
Score = 30.5 bits (69), Expect = 1.0
Identities = 16/91 (17%), Positives = 38/91 (41%), Gaps = 17/91 (18%)
Query: 220 FVMKKKKKRKKKKKK-----KKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKK 274
F ++++K+ +K +++ +K K + ++I +EEE K+K + + +
Sbjct: 94 FYEREEQKKAILPRKLRSSTSEREPKKFKAKPVPKSIYIPLLKDKMQEEELKRKIRVQMR 153
Query: 275 K------------TTKKKKRKKKRKKKKNQH 293
K + +++ KKKK
Sbjct: 154 AQELLQSSRLPPRMAKHEAQERLTKKKKRGQ 184
Score = 30.5 bits (69), Expect = 1.0
Identities = 11/62 (17%), Positives = 33/62 (53%)
Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
++ + ++K K+ + + R + ++ ++E+++ +++KKKKK+ K K
Sbjct: 252 AGEENLKPTRRKLPPSTKKWESLVKFLRTERKEKEAKEQQEKKELEQRKKKKKEMAPKVK 311
Query: 282 RK 283
++
Sbjct: 312 QR 313
Score = 29.7 bits (67), Expect = 1.9
Identities = 15/63 (23%), Positives = 33/63 (52%)
Query: 227 KRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKR 286
++ K ++K + V RT R E+E +++++KK+ +++ KKK+ K
Sbjct: 251 SAGEENLKPTRRKLPPSTKKWESLVKFLRTERKEKEAKEQQEKKELEQRKKKKKEMAPKV 310
Query: 287 KKK 289
K++
Sbjct: 311 KQR 313
Score = 29.0 bits (65), Expect = 3.0
Identities = 20/74 (27%), Positives = 35/74 (47%), Gaps = 10/74 (13%)
Query: 217 IPGFVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKK------ 270
IP K +++ K+K + + + ++ ++ R ++ E +E KKKK
Sbjct: 132 IPLLKDKMQEEELKRKIRVQMRAQELLQSSR----LPPRMAKHEAQERLTKKKKRGQKKS 187
Query: 271 KKKKKTTKKKKRKK 284
K KKKT K K+ K
Sbjct: 188 KYKKKTFKPKRAKS 201
Score = 27.8 bits (62), Expect = 7.1
Identities = 13/72 (18%), Positives = 33/72 (45%)
Query: 221 VMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKK 280
+ ++ ++ K ++K W + E ++K+ K++++KK +++
Sbjct: 240 SSRTYLDQENISAGEENLKPTRRKLPPSTKKWESLVKFLRTERKEKEAKEQQEKKELEQR 299
Query: 281 KRKKKRKKKKNQ 292
K+KKK K +
Sbjct: 300 KKKKKEMAPKVK 311
>gnl|CDD|222581 pfam14181, YqfQ, YqfQ-like protein. The YqfQ-like protein family
includes the B. subtilis YqfQ protein, also known as
VrrA, which is functionally uncharacterized. This family
of proteins is found in bacteria. Proteins in this
family are typically between 146 and 237 amino acids in
length. There are two conserved sequence motifs: QYGP
and PKLY.
Length = 155
Score = 34.0 bits (78), Expect = 0.041
Identities = 9/33 (27%), Positives = 17/33 (51%)
Query: 260 EEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQ 292
+E E++ + K + KKK+ K K +K +
Sbjct: 106 TDETEQEDPPETKTESKEKKKREVPKPKTEKEK 138
Score = 33.6 bits (77), Expect = 0.050
Identities = 9/38 (23%), Positives = 20/38 (52%)
Query: 256 TSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQH 293
EE ++ +++ + KT K+K+K++ K K +
Sbjct: 99 EETEEESTDETEQEDPPETKTESKEKKKREVPKPKTEK 136
Score = 32.8 bits (75), Expect = 0.084
Identities = 12/39 (30%), Positives = 22/39 (56%)
Query: 254 NRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQ 292
+ T + + E K + K+KKK++ K K K+K K + +
Sbjct: 107 DETEQEDPPETKTESKEKKKREVPKPKTEKEKPKTEPKK 145
Score = 31.6 bits (72), Expect = 0.21
Identities = 16/63 (25%), Positives = 26/63 (41%), Gaps = 18/63 (28%)
Query: 225 KKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKK 284
+ +++ + K + K KKK+ E K K +K+K KT KK +
Sbjct: 108 ETEQEDPPETKTESKEKKKR------------------EVPKPKTEKEKPKTEPKKPKPS 149
Query: 285 KRK 287
K K
Sbjct: 150 KPK 152
Score = 31.3 bits (71), Expect = 0.28
Identities = 13/51 (25%), Positives = 21/51 (41%), Gaps = 8/51 (15%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKK 273
+++ + K + K+KKKR+ K + E KK K K K
Sbjct: 110 EQEDPPETKTESKEKKKREVPK--------PKTEKEKPKTEPKKPKPSKPK 152
Score = 30.1 bits (68), Expect = 0.87
Identities = 13/31 (41%), Positives = 20/31 (64%)
Query: 260 EEEEEKKKKKKKKKKKTTKKKKRKKKRKKKK 290
+ E K+KKK++ K T+K+K K + KK K
Sbjct: 117 TKTESKEKKKREVPKPKTEKEKPKTEPKKPK 147
Score = 29.3 bits (66), Expect = 1.4
Identities = 10/39 (25%), Positives = 21/39 (53%)
Query: 254 NRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQ 292
+S EEEE +++ + +++ + K + K KKK+
Sbjct: 91 LSSSDDEEEETEEESTDETEQEDPPETKTESKEKKKREV 129
Score = 29.0 bits (65), Expect = 2.1
Identities = 11/31 (35%), Positives = 20/31 (64%)
Query: 260 EEEEEKKKKKKKKKKKTTKKKKRKKKRKKKK 290
+ E ++KKK++ K KT K+K + + +K K
Sbjct: 118 KTESKEKKKREVPKPKTEKEKPKTEPKKPKP 148
>gnl|CDD|214395 CHL00204, ycf1, Ycf1; Provisional.
Length = 1832
Score = 35.1 bits (81), Expect = 0.044
Identities = 13/61 (21%), Positives = 27/61 (44%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
+ + + +K+ + + + K+K + V N+ EE+ + KK+K KK K
Sbjct: 1521 ENRNQEEKEPAGQGELESDKEKKGNLESVLSNQEKNIEEDYAESDIKKRKNKKQYKSNTE 1580
Query: 283 K 283
Sbjct: 1581 A 1581
Score = 32.4 bits (74), Expect = 0.40
Identities = 16/74 (21%), Positives = 30/74 (40%), Gaps = 5/74 (6%)
Query: 223 KKKKKRKKKKKK-----KKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTT 277
KKK +KK + ++ K+ Q + + E ++K ++ +
Sbjct: 1505 KKKGFIPSNEKKSIEIENRNQEEKEPAGQGELESDKEKKGNLESVLSNQEKNIEEDYAES 1564
Query: 278 KKKKRKKKRKKKKN 291
KKRK K++ K N
Sbjct: 1565 DIKKRKNKKQYKSN 1578
Score = 30.1 bits (68), Expect = 1.6
Identities = 13/72 (18%), Positives = 30/72 (41%)
Query: 219 GFVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTK 278
GF+ +KK + + + +++K + + D ++K ++ + K
Sbjct: 1508 GFIPSNEKKSIEIENRNQEEKEPAGQGELESDKEKKGNLESVLSNQEKNIEEDYAESDIK 1567
Query: 279 KKKRKKKRKKKK 290
K+K KK+ K
Sbjct: 1568 KRKNKKQYKSNT 1579
Score = 28.1 bits (63), Expect = 7.7
Identities = 14/28 (50%), Positives = 21/28 (75%)
Query: 260 EEEEEKKKKKKKKKKKTTKKKKRKKKRK 287
+ EEK KKKKKK+KK ++ KR++K +
Sbjct: 739 DSVEEKTKKKKKKEKKKEEEYKREEKAR 766
>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
TolA; Provisional.
Length = 387
Score = 34.8 bits (80), Expect = 0.045
Identities = 15/72 (20%), Positives = 23/72 (31%), Gaps = 11/72 (15%)
Query: 225 KKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKK 284
KKK + + KKK + KKK + + K + K + K +
Sbjct: 200 KKKAEAEAKKKAAAEAKKKAA-----------AEAKAAAAKAAAEAKAAAEKAAAAKAAE 248
Query: 285 KRKKKKNQHHRD 296
K K D
Sbjct: 249 KAAAAKAAAEVD 260
Score = 34.4 bits (79), Expect = 0.057
Identities = 14/79 (17%), Positives = 43/79 (54%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
++K ++ ++++KKK++++ ++ Q + R + E+E +++KK+ ++ K+
Sbjct: 70 QQKSAKRAEEQRKKKEQQQAEELQQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAAL 129
Query: 283 KKKRKKKKNQHHRDVWQKK 301
K+K+ ++ + K
Sbjct: 130 KQKQAEEAAAKAAAAAKAK 148
Score = 33.2 bits (76), Expect = 0.14
Identities = 18/74 (24%), Positives = 23/74 (31%), Gaps = 19/74 (25%)
Query: 222 MKKKKKRKKKKK-----KKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKT 276
KK + KKK K KKK E E +KK + KKK
Sbjct: 175 AAKKAAAEAKKKAEAEAAAKAAAEAKKK--------------AEAEAKKKAAAEAKKKAA 220
Query: 277 TKKKKRKKKRKKKK 290
+ K K +
Sbjct: 221 AEAKAAAAKAAAEA 234
Score = 30.2 bits (68), Expect = 1.6
Identities = 15/67 (22%), Positives = 24/67 (35%), Gaps = 8/67 (11%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
K+K+ ++ K K K + R +K + KKK + KK
Sbjct: 128 ALKQKQAEEAAAKAAAAAKAKAEAEAK--------RAAAAAKKAAAEAKKKAEAEAAKKA 179
Query: 283 KKKRKKK 289
+ KKK
Sbjct: 180 AAEAKKK 186
>gnl|CDD|240577 cd12950, RRP7_Rrp7p, RRP7 domain ribosomal RNA-processing protein 7
(Rrp7p) and similar proteins. This CD corresponds to
the RRP7 domain of Rrp7p. Rrp7p is encoded by YCL031C
gene from Saccharomyces cerevisiae. It is an essential
yeast protein involved in pre-rRNA processing and
ribosome assembly. Rrp7p contains an N-terminal RNA
recognition motif (RRM), also termed RBD (RNA binding
domain) or RNP (ribonucleoprotein domain), and a
C-terminal RRP7 domain.
Length = 128
Score = 33.4 bits (77), Expect = 0.047
Identities = 21/75 (28%), Positives = 41/75 (54%), Gaps = 13/75 (17%)
Query: 231 KKKKKKKKKRKKKKNQHHRDVWI-----NRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKK 285
+++++K++RK+ +N+ D ++ R EE K +++KK+K KKKK+K+
Sbjct: 32 AREEERKRERKRLRNEPDEDGFVTVVRGGRKGPAAGEEAGKAAEEEKKEKEKKKKKKKEL 91
Query: 286 --------RKKKKNQ 292
R+KKK +
Sbjct: 92 EDFYRFQLREKKKEE 106
Score = 32.6 bits (75), Expect = 0.090
Identities = 24/76 (31%), Positives = 38/76 (50%), Gaps = 26/76 (34%)
Query: 219 GFVMKKKKKRKKKKKKKKKKK--------RKKKKNQHHRDVWINRTSRGEEEEEKKKKKK 270
G +++KK K+KKKKKKK+ R+KKK EE+ + KK +
Sbjct: 71 GKAAEEEKKEKEKKKKKKKELEDFYRFQLREKKK---------------EEQADLLKKFE 115
Query: 271 KKKKKTTKKKKRKKKR 286
+ K++ +K K+KR
Sbjct: 116 EDKERV---EKMKEKR 128
>gnl|CDD|236978 PRK11778, PRK11778, putative inner membrane peptidase; Provisional.
Length = 330
Score = 34.8 bits (81), Expect = 0.047
Identities = 15/84 (17%), Positives = 37/84 (44%), Gaps = 15/84 (17%)
Query: 212 SISFWIPGFVMKKKKKRKKKKKKK-----KKKKRKKKKNQHHRDVWINRTSRGEEEEEKK 266
+I+ + V ++K+ +K + + ++ K K++ + +++E K
Sbjct: 18 AIAAVVALIVSLAQRKKSQKGELEVTNLNEQYKEMKEELKAAL----------LDKKELK 67
Query: 267 KKKKKKKKKTTKKKKRKKKRKKKK 290
K +KKK ++ K K + K +
Sbjct: 68 AWHKAQKKKEKQEAKAAKAKSKPR 91
Score = 33.7 bits (78), Expect = 0.097
Identities = 10/36 (27%), Positives = 20/36 (55%), Gaps = 2/36 (5%)
Query: 220 FVMKKK--KKRKKKKKKKKKKKRKKKKNQHHRDVWI 253
++ KK K K +KKK+K++ K K + +++
Sbjct: 59 ALLDKKELKAWHKAQKKKEKQEAKAAKAKSKPRLFV 94
Score = 32.9 bits (76), Expect = 0.18
Identities = 14/70 (20%), Positives = 32/70 (45%), Gaps = 6/70 (8%)
Query: 229 KKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKK 288
++KK +K + N+ ++++ + E + KK+ K +KKK K++ K
Sbjct: 30 AQRKKSQKGELEVTNLNEQYKEM------KEELKAALLDKKELKAWHKAQKKKEKQEAKA 83
Query: 289 KKNQHHRDVW 298
K + ++
Sbjct: 84 AKAKSKPRLF 93
>gnl|CDD|218482 pfam05178, Kri1, KRI1-like family. The yeast member of this family
(Kri1p) is found to be required for 40S ribosome
biogenesis in the nucleolus.
Length = 99
Score = 32.6 bits (75), Expect = 0.049
Identities = 9/31 (29%), Positives = 23/31 (74%)
Query: 260 EEEEEKKKKKKKKKKKTTKKKKRKKKRKKKK 290
+EEE+ +++++ K+ K K+++ ++K +K K
Sbjct: 4 KEEEKAQREEELKRLKNLKREEIEEKLEKIK 34
Score = 29.5 bits (67), Expect = 0.57
Identities = 10/33 (30%), Positives = 21/33 (63%)
Query: 258 RGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKK 290
R EEE+ +++++ K+ K +++ +K K KK
Sbjct: 3 RKEEEKAQREEELKRLKNLKREEIEEKLEKIKK 35
Score = 28.0 bits (63), Expect = 2.1
Identities = 7/20 (35%), Positives = 17/20 (85%)
Query: 226 KKRKKKKKKKKKKKRKKKKN 245
K+RK+++K +++++ K+ KN
Sbjct: 1 KERKEEEKAQREEELKRLKN 20
Score = 27.6 bits (62), Expect = 3.3
Identities = 6/25 (24%), Positives = 18/25 (72%)
Query: 263 EEKKKKKKKKKKKTTKKKKRKKKRK 287
+E+K+++K ++++ K+ K K+ +
Sbjct: 1 KERKEEEKAQREEELKRLKNLKREE 25
Score = 27.6 bits (62), Expect = 3.3
Identities = 5/25 (20%), Positives = 18/25 (72%)
Query: 264 EKKKKKKKKKKKTTKKKKRKKKRKK 288
+++K+++K +++ K+ + KR++
Sbjct: 1 KERKEEEKAQREEELKRLKNLKREE 25
Score = 27.2 bits (61), Expect = 3.8
Identities = 4/23 (17%), Positives = 16/23 (69%)
Query: 224 KKKKRKKKKKKKKKKKRKKKKNQ 246
+K++ K +++++ K+ + K+ +
Sbjct: 3 RKEEEKAQREEELKRLKNLKREE 25
Score = 27.2 bits (61), Expect = 4.2
Identities = 6/21 (28%), Positives = 15/21 (71%)
Query: 222 MKKKKKRKKKKKKKKKKKRKK 242
++K +R+++ K+ K KR++
Sbjct: 5 EEEKAQREEELKRLKNLKREE 25
Score = 26.5 bits (59), Expect = 6.4
Identities = 5/22 (22%), Positives = 17/22 (77%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKK 244
K+++K +++++ K+ K K+++
Sbjct: 4 KEEEKAQREEELKRLKNLKREE 25
Score = 26.5 bits (59), Expect = 7.8
Identities = 6/21 (28%), Positives = 15/21 (71%)
Query: 224 KKKKRKKKKKKKKKKKRKKKK 244
K++K ++K +++++ KR K
Sbjct: 1 KERKEEEKAQREEELKRLKNL 21
Score = 26.1 bits (58), Expect = 9.1
Identities = 6/28 (21%), Positives = 18/28 (64%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRD 250
K++K+ +K +++++ K+ K K + +
Sbjct: 1 KERKEEEKAQREEELKRLKNLKREEIEE 28
>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682). The
members of this family are all hypothetical eukaryotic
proteins of unknown function. One member is described as
being an adipocyte-specific protein, but no evidence of
this was found.
Length = 322
Score = 34.5 bits (80), Expect = 0.050
Identities = 16/75 (21%), Positives = 46/75 (61%), Gaps = 9/75 (12%)
Query: 221 VMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKK 280
V++K K ++++++K K ++++ + ++ ++EE+KK++++ K K + ++
Sbjct: 257 VLRKVDKTREEEEEKILKAAEEERQE---------EAQEKKEEKKKEEREAKLAKLSPEE 307
Query: 281 KRKKKRKKKKNQHHR 295
+RK + K++K Q +
Sbjct: 308 QRKLEEKERKKQARK 322
Score = 31.1 bits (71), Expect = 0.69
Identities = 12/60 (20%), Positives = 34/60 (56%), Gaps = 9/60 (15%)
Query: 220 FVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKK 279
+ +++R+++ ++KK++K+K+++ EE++K ++K++KK +K
Sbjct: 272 ILKAAEEERQEEAQEKKEEKKKEER---------EAKLAKLSPEEQRKLEEKERKKQARK 322
>gnl|CDD|184900 PRK14907, rplD, 50S ribosomal protein L4; Provisional.
Length = 295
Score = 34.2 bits (78), Expect = 0.058
Identities = 20/69 (28%), Positives = 28/69 (40%)
Query: 221 VMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKK 280
K KK+ ++KK KK K TS ++ K KK K K TTKK
Sbjct: 4 TKKTTKKKTTEEKKPAAKKATTSKETAKTKKTAKTTSTKAAKKAAKVKKTKSVKTTTKKV 63
Query: 281 KRKKKRKKK 289
K ++ +
Sbjct: 64 TVKFEKTES 72
Score = 33.4 bits (76), Expect = 0.12
Identities = 19/69 (27%), Positives = 31/69 (44%)
Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
KK K+K ++KK K+ + + +T+ + ++ K KK K KTT KK
Sbjct: 4 TKKTTKKKTTEEKKPAAKKATTSKETAKTKKTAKTTSTKAAKKAAKVKKTKSVKTTTKKV 63
Query: 282 RKKKRKKKK 290
K K +
Sbjct: 64 TVKFEKTES 72
Score = 31.5 bits (71), Expect = 0.51
Identities = 18/80 (22%), Positives = 30/80 (37%), Gaps = 9/80 (11%)
Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
KK +K K+ K +K K +++ ++ K KK K K T K
Sbjct: 15 EKKPAAKKATTSKETAKTKKTAKTT---------STKAAKKAAKVKKTKSVKTTTKKVTV 65
Query: 282 RKKKRKKKKNQHHRDVWQKK 301
+ +K + K + KK
Sbjct: 66 KFEKTESVKKESVAKKTVKK 85
Score = 29.9 bits (67), Expect = 1.7
Identities = 16/67 (23%), Positives = 23/67 (34%)
Query: 221 VMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKK 280
K + KK K KK + K V +T ++E KK KK+ +
Sbjct: 35 TAKTTSTKAAKKAAKVKKTKSVKTTTKKVTVKFEKTESVKKESVAKKTVKKEAVSAEVFE 94
Query: 281 KRKKKRK 287
K K
Sbjct: 95 ASNKLFK 101
Score = 28.4 bits (63), Expect = 4.7
Identities = 17/71 (23%), Positives = 26/71 (36%), Gaps = 4/71 (5%)
Query: 223 KKKKKRK----KKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTK 278
K KK K K KK K K+ K + V + ++E KK KK+ +
Sbjct: 31 KTKKTAKTTSTKAAKKAAKVKKTKSVKTTTKKVTVKFEKTESVKKESVAKKTVKKEAVSA 90
Query: 279 KKKRKKKRKKK 289
+ + K
Sbjct: 91 EVFEASNKLFK 101
>gnl|CDD|221177 pfam11708, Slu7, Pre-mRNA splicing Prp18-interacting factor. The
spliceosome, an assembly of snRNAs (U1, U2, U4/U6, and
U5) and proteins, catalyzes the excision of introns from
pre-mRNAs in two successive trans-esterification
reactions. Step 2 depends upon integral spliceosome
constituents such as U5 snRNA and Prp8 and
non-spliceosomal proteins Prp16, Slu7, Prp18, and Prp22.
ATP hydrolysis by the DEAH-box enzyme Prp16 promotes a
conformational change in the spliceosome that leads to
protection of the 3'ss from targeted RNase H cleavage.
This change, which probably reflects binding of the 3'ss
PyAG in the catalytic centre of the spliceosome,
requires the ordered recruitment of Slu7, Prp18, and
Prp22 to the spliceosome. There is a close functional
relationship between Prp8, Prp18, and Slu7, and Prp18
interacts with Slu7, so that together they recruit Prp22
to the spliceosome. Most members of the family carry a
zinc-finger of the CCHC-type upstream of this domain.
Length = 236
Score = 33.9 bits (78), Expect = 0.064
Identities = 16/62 (25%), Positives = 31/62 (50%), Gaps = 9/62 (14%)
Query: 222 MKKKKKRKKKKKKKKKKKRK----KKKNQHH-----RDVWINRTSRGEEEEEKKKKKKKK 272
+KK K+KK++ K +K++ K + H +++ + ++ E + KKKK K
Sbjct: 159 ELLRKKEKEKKEQLKIQKKQSLLEKYGGEEHLDKPPKELLLGQSEDYVEYDRAGKKKKAK 218
Query: 273 KK 274
K
Sbjct: 219 SK 220
Score = 30.9 bits (70), Expect = 0.64
Identities = 12/40 (30%), Positives = 22/40 (55%)
Query: 264 EKKKKKKKKKKKTTKKKKRKKKRKKKKNQHHRDVWQKKYT 303
E +KK+K+KK+ K +K++ +K + H D K+
Sbjct: 159 ELLRKKEKEKKEQLKIQKKQSLLEKYGGEEHLDKPPKELL 198
Score = 30.9 bits (70), Expect = 0.69
Identities = 13/66 (19%), Positives = 32/66 (48%)
Query: 232 KKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKN 291
+KK+K++K++ + + + E ++ K+ + + + R K+KK K+
Sbjct: 160 LLRKKEKEKKEQLKIQKKQSLLEKYGGEEHLDKPPKELLLGQSEDYVEYDRAGKKKKAKS 219
Query: 292 QHHRDV 297
++ DV
Sbjct: 220 KYEEDV 225
>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
Length = 520
Score = 34.4 bits (80), Expect = 0.064
Identities = 20/109 (18%), Positives = 49/109 (44%), Gaps = 15/109 (13%)
Query: 199 IFVVNVYYALISSSISFWIPGFVMKKKKKRKKKKKKKKKKK-----------RKKKKNQH 247
+ ++ + AL++ + I FV KK + K K+ +++ K+ KK+
Sbjct: 2 MLLIIILIALVALVVGAVIGYFVRKKIAEAKIKEAEEEAKRILEEAKKEAEAIKKEALLE 61
Query: 248 HRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKK----KRKKKRKKKKNQ 292
++ + E+E +++ + +K +K +K RK + +K+ +
Sbjct: 62 AKEEIHKLRNEFEKELRERRNELQKLEKRLLQKEENLDRKLELLEKREE 110
Score = 34.0 bits (79), Expect = 0.085
Identities = 18/91 (19%), Positives = 50/91 (54%), Gaps = 14/91 (15%)
Query: 222 MKKKKKRKKKKKKKK-----KKKRKKKKNQHHRDVWINRTSRGE-EEEEKKKKKK----- 270
+ ++ K++ + KK+ K++ K +N+ +++ R R E ++ EK+ +K
Sbjct: 43 ILEEAKKEAEAIKKEALLEAKEEIHKLRNEFEKEL---RERRNELQKLEKRLLQKEENLD 99
Query: 271 KKKKKTTKKKKRKKKRKKKKNQHHRDVWQKK 301
+K + K+++ +K++K+ Q +++ +K+
Sbjct: 100 RKLELLEKREEELEKKEKELEQKQQELEKKE 130
Score = 32.1 bits (74), Expect = 0.43
Identities = 17/91 (18%), Positives = 41/91 (45%), Gaps = 23/91 (25%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTS-----------------------RG 259
K+ + KK+ + K++ K +N+ +++ R +
Sbjct: 49 KEAEAIKKEALLEAKEEIHKLRNEFEKELRERRNELQKLEKRLLQKEENLDRKLELLEKR 108
Query: 260 EEEEEKKKKKKKKKKKTTKKKKRKKKRKKKK 290
EEE EKK+K+ ++K++ +KK+ + + ++
Sbjct: 109 EEELEKKEKELEQKQQELEKKEEELEELIEE 139
>gnl|CDD|220231 pfam09420, Nop16, Ribosome biogenesis protein Nop16. Nop16 is a
protein involved in ribosome biogenesis.
Length = 173
Score = 33.6 bits (77), Expect = 0.072
Identities = 10/33 (30%), Positives = 17/33 (51%)
Query: 264 EKKKKKKKKKKKTTKKKKRKKKRKKKKNQHHRD 296
+KKKK + +K+ K+K +KKK +
Sbjct: 1 VRKKKKNRSSNYKVNRKRLKRKDRKKKINIIGN 33
Score = 28.6 bits (64), Expect = 2.7
Identities = 12/31 (38%), Positives = 17/31 (54%), Gaps = 3/31 (9%)
Query: 223 KKKKKRK---KKKKKKKKKKRKKKKNQHHRD 250
KKKK R K +K+ K+K +KKK +
Sbjct: 3 KKKKNRSSNYKVNRKRLKRKDRKKKINIIGN 33
Score = 28.6 bits (64), Expect = 3.2
Identities = 8/25 (32%), Positives = 14/25 (56%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQH 247
+KKKK + K +K+ K+K +
Sbjct: 2 RKKKKNRSSNYKVNRKRLKRKDRKK 26
Score = 27.8 bits (62), Expect = 4.6
Identities = 8/27 (29%), Positives = 16/27 (59%)
Query: 224 KKKKRKKKKKKKKKKKRKKKKNQHHRD 250
+KKK+ + K +KR K+K++ +
Sbjct: 2 RKKKKNRSSNYKVNRKRLKRKDRKKKI 28
Score = 27.0 bits (60), Expect = 8.2
Identities = 5/27 (18%), Positives = 14/27 (51%)
Query: 225 KKKRKKKKKKKKKKKRKKKKNQHHRDV 251
+KK+K + K +++ K+ + +
Sbjct: 2 RKKKKNRSSNYKVNRKRLKRKDRKKKI 28
Score = 27.0 bits (60), Expect = 8.6
Identities = 15/71 (21%), Positives = 23/71 (32%), Gaps = 18/71 (25%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVW------------------INRTSRGEEEEE 264
K RK+ K+K +KKK N W N +R + +
Sbjct: 11 NYKVNRKRLKRKDRKKKINIIGNPIIAKNWDKKKTLAQNYKRLGLAADPNTAARTDLLIQ 70
Query: 265 KKKKKKKKKKK 275
K + KK +
Sbjct: 71 DKPIPELKKGE 81
>gnl|CDD|146486 pfam03879, Cgr1, Cgr1 family. Members of this family are
coiled-coil proteins that are involved in pre-rRNA
processing.
Length = 105
Score = 32.4 bits (74), Expect = 0.073
Identities = 18/69 (26%), Positives = 36/69 (52%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
K+ +KR +++ K ++K K + + R I +E+K++ +K K KK
Sbjct: 32 KRMEKRLEQQAIKAREKELKDEKEAERQRRIQAIKERRAAKEEKERYEKMAAKMHAKKVE 91
Query: 283 KKKRKKKKN 291
+ KR++K+N
Sbjct: 92 RLKRREKRN 100
>gnl|CDD|221952 pfam13166, AAA_13, AAA domain. This family of domains contain a
P-loop motif that is characteristic of the AAA
superfamily. Many of the proteins in this family are
conjugative transfer proteins. This family includes the
PrrC protein that is thought to be the active component
of the anticodon nuclease.
Length = 713
Score = 34.2 bits (79), Expect = 0.074
Identities = 19/77 (24%), Positives = 35/77 (45%), Gaps = 10/77 (12%)
Query: 217 IPG-FVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKK 275
I G F + ++ + + ++ KK+ KK + + + E E EKK+++ +K K
Sbjct: 80 IKGVFTLGEENIEIEAQIEELKKELKKLEEKI---------EQLEAEIEKKEEELEKAKN 130
Query: 276 TTKKKKRKKKRKKKKNQ 292
K KK KK +
Sbjct: 131 KFLDKAWKKLAKKYDSN 147
Score = 31.9 bits (73), Expect = 0.39
Identities = 14/74 (18%), Positives = 39/74 (52%), Gaps = 3/74 (4%)
Query: 231 KKKKKKKKKRKKKKNQHHRDVWINRTSRGEEE---EEKKKKKKKKKKKTTKKKKRKKKRK 287
++ +K KK+KN+ + +W++ + +E+ +K+KK +K + +K+ ++ + +
Sbjct: 378 REHNEKIDNLKKEKNKAKKKLWLHLVAELKEDIDAYQKEKKGLEKAINSLEKEIKQLEAE 437
Query: 288 KKKNQHHRDVWQKK 301
K + +K+
Sbjct: 438 IKALEKEIKELEKQ 451
Score = 27.7 bits (62), Expect = 9.9
Identities = 11/44 (25%), Positives = 26/44 (59%)
Query: 260 EEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQHHRDVWQKKYT 303
E ++E KK ++K ++ + +K++++ +K KN+ W+K
Sbjct: 99 ELKKELKKLEEKIEQLEAEIEKKEEELEKAKNKFLDKAWKKLAK 142
>gnl|CDD|215641 PLN03237, PLN03237, DNA topoisomerase 2; Provisional.
Length = 1465
Score = 34.5 bits (79), Expect = 0.077
Identities = 18/63 (28%), Positives = 24/63 (38%), Gaps = 4/63 (6%)
Query: 225 KKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEE----EEEKKKKKKKKKKKTTKKK 280
KK ++ KK KK KK ++ +S E E K K + KKK
Sbjct: 1195 KKVSRQAPKKPAPKKTTKKASESETTEETYGSSAMETENVAEVVKPKGRAGAKKKAPAAA 1254
Query: 281 KRK 283
K K
Sbjct: 1255 KEK 1257
Score = 33.7 bits (77), Expect = 0.14
Identities = 13/66 (19%), Positives = 21/66 (31%)
Query: 219 GFVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTK 278
G K ++ KK KK K+ + + E K K + KK
Sbjct: 1192 GAAKKVSRQAPKKPAPKKTTKKASESETTEETYGSSAMETENVAEVVKPKGRAGAKKKAP 1251
Query: 279 KKKRKK 284
++K
Sbjct: 1252 AAAKEK 1257
Score = 32.1 bits (73), Expect = 0.47
Identities = 6/31 (19%), Positives = 16/31 (51%)
Query: 260 EEEEEKKKKKKKKKKKTTKKKKRKKKRKKKK 290
+ EE ++K ++ + + K+ ++ KK
Sbjct: 1174 KAEEAREKLQRAAARGESGAAKKVSRQAPKK 1204
Score = 28.3 bits (63), Expect = 6.6
Identities = 14/60 (23%), Positives = 27/60 (45%), Gaps = 10/60 (16%)
Query: 221 VMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKK 280
+ K K+ K ++ ++K +R + + S ++ ++ KK KKTTKK
Sbjct: 1165 LDKLDKEDAKAEEAREKLQRAAARGE----------SGAAKKVSRQAPKKPAPKKTTKKA 1214
Score = 27.9 bits (62), Expect = 7.6
Identities = 8/41 (19%), Positives = 18/41 (43%), Gaps = 1/41 (2%)
Query: 257 SRGEEEEEKKKKKKKKKK-KTTKKKKRKKKRKKKKNQHHRD 296
++ EE EK ++ + + KK R+ +K + +
Sbjct: 1173 AKAEEAREKLQRAAARGESGAAKKVSRQAPKKPAPKKTTKK 1213
>gnl|CDD|218538 pfam05285, SDA1, SDA1. This family consists of several SDA1
protein homologues. SDA1 is a Saccharomyces cerevisiae
protein which is involved in the control of the actin
cytoskeleton. The protein is essential for cell
viability and is localised in the nucleus.
Length = 317
Score = 33.9 bits (78), Expect = 0.079
Identities = 23/94 (24%), Positives = 36/94 (38%), Gaps = 25/94 (26%)
Query: 222 MKKKKKRKKKKKKKKK--------------KKRKKKKNQHHRDVWINRTSRGEEEEEKKK 267
+ K KR+ K ++ KK+K+ K + R R + K K
Sbjct: 204 LGGKLKRRDKDAPERHSDELVDADDIEGPAKKKKQTKEERIATAKEGREDREKFGSRKGK 263
Query: 268 KKKKKKKKTTKKKKRKK-----------KRKKKK 290
K K+ K T K+K RKK + K+K+
Sbjct: 264 KDKEGKSTTNKEKARKKNFMMTLHKKRVRGKQKR 297
Score = 33.5 bits (77), Expect = 0.10
Identities = 22/84 (26%), Positives = 32/84 (38%), Gaps = 20/84 (23%)
Query: 226 KKRKKKKKKKKKKKRKKKKNQHHRDVWIN--------------RTSR------GEEEEEK 265
+K K K K+R K + H D ++ + R G E+ EK
Sbjct: 197 EKGVDKALGGKLKRRDKDAPERHSDELVDADDIEGPAKKKKQTKEERIATAKEGREDREK 256
Query: 266 KKKKKKKKKKTTKKKKRKKKRKKK 289
+K KK K K K+K +KK
Sbjct: 257 FGSRKGKKDKEGKSTTNKEKARKK 280
Score = 32.7 bits (75), Expect = 0.19
Identities = 17/69 (24%), Positives = 32/69 (46%), Gaps = 15/69 (21%)
Query: 224 KKKKRKKKKKKKKKKKRKKKKN----QHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKK 279
+K K+ K+ K K++ +KKN H + V RG K+K+ + K+K +
Sbjct: 260 RKGKKDKEGKSTTNKEKARKKNFMMTLHKKRV------RG-----KQKRSLRDKQKVLRA 308
Query: 280 KKRKKKRKK 288
++K+
Sbjct: 309 HILRQKKGG 317
Score = 31.9 bits (73), Expect = 0.40
Identities = 20/85 (23%), Positives = 34/85 (40%), Gaps = 20/85 (23%)
Query: 221 VMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKK----KKKKKT 276
+ ++ R+K +K KK + + +EK +KK KK+
Sbjct: 247 AKEGREDREKFGSRK-----GKKDKEG----------KSTTNKEKARKKNFMMTLHKKRV 291
Query: 277 TKKKKRKKKRKKKKNQHHRDVWQKK 301
K+KR + K+K + H QKK
Sbjct: 292 RGKQKRSLRDKQKVLRAHILR-QKK 315
Score = 30.4 bits (69), Expect = 1.2
Identities = 21/96 (21%), Positives = 40/96 (41%), Gaps = 30/96 (31%)
Query: 217 IPGFVMKKKKKRK---------KKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKK 267
I G KKK+ ++ ++ ++K ++ KK + S +E+ +KK
Sbjct: 229 IEGPAKKKKQTKEERIATAKEGREDREKFGSRKGKKDKEGK--------STTNKEKARKK 280
Query: 268 -------KKK---KKKKKTTKKKKRKKK---RKKKK 290
KK+ K+K+ K+K + R+KK
Sbjct: 281 NFMMTLHKKRVRGKQKRSLRDKQKVLRAHILRQKKG 316
>gnl|CDD|219882 pfam08524, rRNA_processing, rRNA processing. This is a family of
proteins that are involved in rRNA processing. In a
localisation study they were found to localise to the
nucleus and nucleolus. The family also includes other
metazoa members from plants to mammals where the protein
has been named BR22 and is associated with TTF-1,
thyroid transcription factor 1. In the lungs, the family
binds TTF-1 to form a complex which influences the
expression of the key lung surfactant protein-B (SP-B)
and -C (SP-C), the small hydrophobic surfactant proteins
that maintain surface tension in alveoli.
Length = 150
Score = 32.9 bits (75), Expect = 0.082
Identities = 20/68 (29%), Positives = 43/68 (63%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
K+ +++ K K+ +K KKKK +++ R E+E K++K+ +K + +KKK++
Sbjct: 52 KESAEKQVKSSKEDRKFEKKKKLDEKKEIAKQRKREQREKELAKRQKELEKIELSKKKQK 111
Query: 283 KKKRKKKK 290
+++R++KK
Sbjct: 112 ERERRRKK 119
Score = 30.6 bits (69), Expect = 0.48
Identities = 15/61 (24%), Positives = 37/61 (60%), Gaps = 8/61 (13%)
Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
+ +KK+ K++K+++++K K+ + + E ++K+K++++++KK TKK K
Sbjct: 74 LDEKKEIAKQRKREQREKELAKRQK--------ELEKIELSKKKQKERERRRKKLTKKTK 125
Query: 282 R 282
Sbjct: 126 S 126
Score = 30.6 bits (69), Expect = 0.58
Identities = 20/70 (28%), Positives = 39/70 (55%), Gaps = 11/70 (15%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
KKKK +KK+ K++K+ +++K R ++E +K + KKK+K +++++
Sbjct: 70 KKKKLDEKKEIAKQRKREQREKELAKR-----------QKELEKIELSKKKQKERERRRK 118
Query: 283 KKKRKKKKNQ 292
K +K K Q
Sbjct: 119 KLTKKTKSGQ 128
Score = 30.2 bits (68), Expect = 0.75
Identities = 20/66 (30%), Positives = 38/66 (57%), Gaps = 5/66 (7%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
+K +K+KK +KK+ K++K++ + R +E E+ + KKK+K++ ++KK
Sbjct: 66 RKFEKKKKLDEKKEIAKQRKREQREKEL-----AKRQKELEKIELSKKKQKERERRRKKL 120
Query: 283 KKKRKK 288
KK K
Sbjct: 121 TKKTKS 126
Score = 29.1 bits (65), Expect = 1.7
Identities = 19/78 (24%), Positives = 42/78 (53%), Gaps = 8/78 (10%)
Query: 226 KKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKK 285
K+ +K+ K K+ RK +K + + +E +++K+++++K+ ++K+ +K
Sbjct: 52 KESAEKQVKSSKEDRKFEKKKKLDEK--------KEIAKQRKREQREKELAKRQKELEKI 103
Query: 286 RKKKKNQHHRDVWQKKYT 303
KK Q R+ +KK T
Sbjct: 104 ELSKKKQKERERRRKKLT 121
Score = 27.5 bits (61), Expect = 6.0
Identities = 20/91 (21%), Positives = 42/91 (46%), Gaps = 12/91 (13%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHR---------DVWINRTSRGEEEEEKKKKKK-KK 272
+ +K+ KK ++ K ++ ++N + + + E+E +K+ K K+
Sbjct: 5 DQNQKKNGKKFTREYKVKEIQRNLTKKARLKKEYLKLLEKEGYAVPEKESAEKQVKSSKE 64
Query: 273 KKKTTKKKK--RKKKRKKKKNQHHRDVWQKK 301
+K KKKK KK+ K++ + R+ K
Sbjct: 65 DRKFEKKKKLDEKKEIAKQRKREQREKELAK 95
>gnl|CDD|222977 PHA03089, PHA03089, late transcription factor VLTF-4; Provisional.
Length = 191
Score = 33.2 bits (76), Expect = 0.087
Identities = 21/64 (32%), Positives = 28/64 (43%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
KKKK +KKK KK K+KKK+ + ++ S EE EE KK + K
Sbjct: 52 KKKKTTPRKKKTTKKTKKKKKEKEEVPELAAEELSDSEENEENDKKVDYELPKVQNTAAE 111
Query: 283 KKKR 286
Sbjct: 112 VNHE 115
Score = 31.3 bits (71), Expect = 0.36
Identities = 18/35 (51%), Positives = 21/35 (60%)
Query: 260 EEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQHH 294
E+ E KK KKKK T +KKK KK KKKK +
Sbjct: 41 EDVEIPSKKTSKKKKTTPRKKKTTKKTKKKKKEKE 75
Score = 27.1 bits (60), Expect = 9.3
Identities = 13/26 (50%), Positives = 19/26 (73%)
Query: 265 KKKKKKKKKKKTTKKKKRKKKRKKKK 290
KKKK +KK T KK +KKK++K++
Sbjct: 51 SKKKKTTPRKKKTTKKTKKKKKEKEE 76
>gnl|CDD|218435 pfam05104, Rib_recp_KP_reg, Ribosome receptor lysine/proline rich
region. This highly conserved region is found towards
the C-terminus of the transmembrane domain. The function
is unclear.
Length = 151
Score = 32.6 bits (74), Expect = 0.093
Identities = 12/31 (38%), Positives = 22/31 (70%)
Query: 261 EEEEKKKKKKKKKKKTTKKKKRKKKRKKKKN 291
+ + +K KKKK+K ++KK + KK+++K N
Sbjct: 18 GKTQSQKSDKKKKEKVSEKKGKSKKKEEKPN 48
Score = 30.3 bits (68), Expect = 0.71
Identities = 13/33 (39%), Positives = 21/33 (63%)
Query: 258 RGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKK 290
R E + + +K KKKK+ +KK K K+K++K
Sbjct: 14 RKESGKTQSQKSDKKKKEKVSEKKGKSKKKEEK 46
Score = 29.9 bits (67), Expect = 0.87
Identities = 12/22 (54%), Positives = 16/22 (72%)
Query: 224 KKKKRKKKKKKKKKKKRKKKKN 245
KKKK K +KK K KK+++K N
Sbjct: 27 KKKKEKVSEKKGKSKKKEEKPN 48
Score = 29.6 bits (66), Expect = 1.0
Identities = 15/52 (28%), Positives = 24/52 (46%), Gaps = 13/52 (25%)
Query: 237 KKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKK 288
K++++ K Q + +KKKK+K +KK KKK +K K
Sbjct: 12 KQRKESGKTQSQK-------------SDKKKKEKVSEKKGKSKKKEEKPNGK 50
Score = 28.4 bits (63), Expect = 2.7
Identities = 18/50 (36%), Positives = 25/50 (50%), Gaps = 15/50 (30%)
Query: 226 KKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKK 275
K+RK+ K + +K KKKK E+ EKK K KKK++K
Sbjct: 12 KQRKESGKTQSQKSDKKKK---------------EKVSEKKGKSKKKEEK 46
Score = 26.9 bits (59), Expect = 9.6
Identities = 11/20 (55%), Positives = 15/20 (75%)
Query: 223 KKKKKRKKKKKKKKKKKRKK 242
KKKK++ +KK K KKK +K
Sbjct: 27 KKKKEKVSEKKGKSKKKEEK 46
>gnl|CDD|227519 COG5192, BMS1, GTP-binding protein required for 40S ribosome
biogenesis [Translation, ribosomal structure and
biogenesis].
Length = 1077
Score = 33.9 bits (77), Expect = 0.098
Identities = 14/80 (17%), Positives = 36/80 (45%), Gaps = 6/80 (7%)
Query: 217 IPGFVMKKKKKRKKKKKKKKKK------KRKKKKNQHHRDVWINRTSRGEEEEEKKKKKK 270
+P + K K+ ++ ++K++ + R EE+E + ++
Sbjct: 981 LPREIESKLPLDKRSIAVVSRRIELPVPPECREKHEIKDRIVKERIKDQEEKERMESLQR 1040
Query: 271 KKKKKTTKKKKRKKKRKKKK 290
K+++ KK+K +++R +K
Sbjct: 1041 AKEEEIGKKEKEREQRIRKT 1060
Score = 33.2 bits (75), Expect = 0.17
Identities = 14/63 (22%), Positives = 30/63 (47%), Gaps = 3/63 (4%)
Query: 225 KKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKK 284
K + K++ K +++K + + Q + + E+E E++ +K KKR K
Sbjct: 1018 KDRIVKERIKDQEEKERMESLQRAK---EEEIGKKEKEREQRIRKTIHDNYKEMAKKRLK 1074
Query: 285 KRK 287
K++
Sbjct: 1075 KKR 1077
Score = 32.0 bits (72), Expect = 0.46
Identities = 19/80 (23%), Positives = 37/80 (46%), Gaps = 14/80 (17%)
Query: 217 IPGFVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKT 276
+P ++K K + K++ K +++K + S +EE+ KK+K++++
Sbjct: 1005 LPVPPECREKHEIKDRIVKERIKDQEEKER--------MESLQRAKEEEIGKKEKEREQR 1056
Query: 277 TKKKKRK------KKRKKKK 290
+K KKR KKK
Sbjct: 1057 IRKTIHDNYKEMAKKRLKKK 1076
Score = 31.2 bits (70), Expect = 0.88
Identities = 10/72 (13%), Positives = 37/72 (51%)
Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
+ K+ ++ + + ++ +D + + +EE+E+ + ++ K++ KK+
Sbjct: 991 LDKRSIAVVSRRIELPVPPECREKHEIKDRIVKERIKDQEEKERMESLQRAKEEEIGKKE 1050
Query: 282 RKKKRKKKKNQH 293
++++++ +K H
Sbjct: 1051 KEREQRIRKTIH 1062
>gnl|CDD|219978 pfam08701, GN3L_Grn1, GNL3L/Grn1 putative GTPase. Grn1 (yeast) and
GNL3L (human) are putative GTPases which are required
for growth and play a role in processing of nucleolar
pre-rRNA. This family contains a potential nuclear
localisation signal.
Length = 80
Score = 31.5 bits (72), Expect = 0.10
Identities = 18/69 (26%), Positives = 34/69 (49%), Gaps = 12/69 (17%)
Query: 223 KKKKKRKKKKKKKKKKKRKKK----------KNQHHRDVWINRTSRGEEEEEKKKKKKKK 272
+K RK+ KK K +KKK K + + I R +EEE++++K+ +K
Sbjct: 13 HHRKLRKEAKKNPTWKSKKKKDPGIPNSFPFKEEILEE--IEEKKRKQEEEKERRKEARK 70
Query: 273 KKKTTKKKK 281
++ +K+
Sbjct: 71 AERAEARKR 79
Score = 28.0 bits (63), Expect = 1.4
Identities = 20/79 (25%), Positives = 39/79 (49%), Gaps = 16/79 (20%)
Query: 226 KKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRG--------------EEEEEKKKKKKK 271
+ R K +KK + RK +K W ++ + EE EEKK+K+++
Sbjct: 1 RLRYKIEKKVAEHHRKLRKEAKKNPTWKSKKKKDPGIPNSFPFKEEILEEIEEKKRKQEE 60
Query: 272 KKKKTTKKKKRKKKRKKKK 290
+K++ +K+ RK +R + +
Sbjct: 61 EKER--RKEARKAERAEAR 77
Score = 27.2 bits (61), Expect = 3.2
Identities = 14/79 (17%), Positives = 36/79 (45%), Gaps = 17/79 (21%)
Query: 223 KKKKKRKKKKKKKKKKKRK--------KKKNQH---------HRDVWINRTSRGEEEEEK 265
+ + K +KK + +K RK K K + ++ + + ++E+
Sbjct: 1 RLRYKIEKKVAEHHRKLRKEAKKNPTWKSKKKKDPGIPNSFPFKEEILEEIEEKKRKQEE 60
Query: 266 KKKKKKKKKKTTKKKKRKK 284
+K+++K+ +K + + RK+
Sbjct: 61 EKERRKEARKAERAEARKR 79
>gnl|CDD|152713 pfam12278, SDP_N, Sex determination protein N terminal. This
family of proteins is found in eukaryotes. Proteins in
this family are typically between 168 and 410 amino
acids in length. This family is the N terminal end of
the sex determination protein of many different animals.
It plays a role in the gender determination of around
20% of all animals.
Length = 168
Score = 32.7 bits (74), Expect = 0.11
Identities = 19/72 (26%), Positives = 35/72 (48%), Gaps = 8/72 (11%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKK--------KKKKKK 274
K K+ R + + + + + ++ QH R+ W + R E E+ KKK +K++ +
Sbjct: 14 KFKQLRSEDSESGLRSRTEDERLQHRREEWFIQQEREREHEKLKKKMILEYELRRKREIE 73
Query: 275 KTTKKKKRKKKR 286
K K+ R K R
Sbjct: 74 KLLSKRSRTKSR 85
>gnl|CDD|214661 smart00435, TOPEUc, DNA Topoisomerase I (eukaryota). DNA
Topoisomerase I (eukaryota), DNA topoisomerase V,
Vaccina virus topoisomerase, Variola virus
topoisomerase, Shope fibroma virus topoisomeras.
Length = 391
Score = 33.5 bits (77), Expect = 0.11
Identities = 20/72 (27%), Positives = 39/72 (54%)
Query: 221 VMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKK 280
+ K ++K K K + K+ K+ + D+ S+ E + EK + K+KKK KK+
Sbjct: 279 MEKLQEKIKALKYQLKRLKKMILLFEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKE 338
Query: 281 KRKKKRKKKKNQ 292
++KKK+ ++ +
Sbjct: 339 EKKKKQIERLEE 350
Score = 32.7 bits (75), Expect = 0.22
Identities = 20/70 (28%), Positives = 34/70 (48%), Gaps = 2/70 (2%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKK--KKKKKTTKKK 280
K + KR KK + K+ + N E +E+KK+KKK KKKK+ + +
Sbjct: 290 KYQLKRLKKMILLFEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKKKKQIERLE 349
Query: 281 KRKKKRKKKK 290
+R +K + +
Sbjct: 350 ERIEKLEVQA 359
Score = 30.8 bits (70), Expect = 0.84
Identities = 13/59 (22%), Positives = 31/59 (52%)
Query: 229 KKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRK 287
+ K+K K K +++ D + + +++EEKKKK+ ++ ++ +K + + K
Sbjct: 304 EMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKKKKQIERLEERIEKLEVQATDK 362
Score = 28.9 bits (65), Expect = 3.4
Identities = 16/68 (23%), Positives = 34/68 (50%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
K+ K+ + ++K K++ RD E+++EKKK++KKKK+ +++
Sbjct: 293 LKRLKKMILLFEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKKKKQIERLEERI 352
Query: 283 KKKRKKKK 290
+K +
Sbjct: 353 EKLEVQAT 360
Score = 28.1 bits (63), Expect = 6.7
Identities = 18/73 (24%), Positives = 41/73 (56%), Gaps = 4/73 (5%)
Query: 222 MKKKKKRKKKKKKKKKKKRKK----KKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTT 277
M+K +++ K K + K+ +K + + ++ R E+ + + K+KKK+KK
Sbjct: 279 MEKLQEKIKALKYQLKRLKKMILLFEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKE 338
Query: 278 KKKKRKKKRKKKK 290
+KKK++ +R +++
Sbjct: 339 EKKKKQIERLEER 351
>gnl|CDD|237855 PRK14900, valS, valyl-tRNA synthetase; Provisional.
Length = 1052
Score = 33.8 bits (77), Expect = 0.11
Identities = 21/68 (30%), Positives = 27/68 (39%), Gaps = 7/68 (10%)
Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
MKKK +K KK KK KK + ++ KK +K KK KK
Sbjct: 992 MKKKVAKKAPAKKAAAKKAAAKKA-------AAKKKVAKKAPAKKVARKPAAKKAAKKPA 1044
Query: 282 RKKKRKKK 289
RK +K
Sbjct: 1045 RKAAGRKA 1052
>gnl|CDD|233294 TIGR01159, DRP1, density-regulated protein DRP1. This protein
family shows weak but suggestive similarity to
translation initiation factor SUI1 and its prokaryotic
homologs.
Length = 173
Score = 32.9 bits (75), Expect = 0.11
Identities = 14/68 (20%), Positives = 25/68 (36%), Gaps = 16/68 (23%)
Query: 219 GFVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTK 278
+ + +++K +K E E KKKKK +K T K
Sbjct: 51 AEAVTIGEAQEEKGEKDLLGI----------------QKAQEAREGGKKKKKLPQKVTIK 94
Query: 279 KKKRKKKR 286
++ R K++
Sbjct: 95 REPRTKRK 102
>gnl|CDD|223058 PHA03366, PHA03366, FGAM-synthase; Provisional.
Length = 1304
Score = 33.8 bits (78), Expect = 0.12
Identities = 23/118 (19%), Positives = 36/118 (30%), Gaps = 21/118 (17%)
Query: 72 LRIITNYFVVSLAFADLLVALCVMPFNAIVSLTDEWYFGYFM-CDVWNSFDVYFSTASI- 129
L + N + L + + + + + T +Y C S + S+
Sbjct: 195 LDKVCNGVMGDLPLGGDALPVAIQASSPLPGYTVAYYLSLTPTCTSVRSNYLTRGAHSVE 254
Query: 130 ---LHLCCISVDRYYAIVKPLQYPIIMNQNTVLLMLSNVWILPGIISFTPIMLGWYTT 184
LH +Q Q T LL + +LPG LGWY T
Sbjct: 255 TGFLHTHG-----------SIQTMG--PQLTGLLFQPS--LLPGG-LPRASALGWYVT 296
>gnl|CDD|240709 cd12263, RRM_ABT1_like, RNA recognition motif found in activator of
basal transcription 1 (ABT1) and similar proteins. This
subfamily corresponds to the RRM of novel nuclear
proteins termed ABT1 and its homologous counterpart,
pre-rRNA-processing protein ESF2 (eighteen S factor 2),
from yeast. ABT1 associates with the TATA-binding
protein (TBP) and enhances basal transcription activity
of class II promoters. Meanwhile, ABT1 could be a
transcription cofactor that can bind to DNA in a
sequence-independent manner. The yeast ABT1 homolog,
ESF2, is a component of 90S preribosomes and 5'
ETS-based RNPs. It is previously identified as a
putative partner of the TATA-element binding protein.
However, it is primarily localized to the nucleolus and
physically associates with pre-rRNA processing factors.
ESF2 may play a role in ribosome biogenesis. It is
required for normal pre-rRNA processing, as well as for
SSU processome assembly and function. Both ABT1 and ESF2
contain an RNA recognition motif (RRM), also termed RBD
(RNA binding domain) or RNP (ribonucleoprotein domain).
.
Length = 98
Score = 31.4 bits (72), Expect = 0.12
Identities = 16/76 (21%), Positives = 34/76 (44%), Gaps = 12/76 (15%)
Query: 227 KRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKR 286
+ + K+K++KK+ K + + W+ E + KK K+ ++
Sbjct: 33 QPEDPAKRKRRKKKGGNKKKKFTEGWV---------EFEDKKVAKRVAESLNNTP---IG 80
Query: 287 KKKKNQHHRDVWQKKY 302
KK++++ D+W KY
Sbjct: 81 GKKRSRYRDDLWNLKY 96
>gnl|CDD|227931 COG5644, COG5644, Uncharacterized conserved protein [Function
unknown].
Length = 869
Score = 33.9 bits (77), Expect = 0.13
Identities = 19/91 (20%), Positives = 39/91 (42%), Gaps = 9/91 (9%)
Query: 215 FWIPGFVMKKKKKRKKKKKKKKKKKRKKKK--NQHHRDV--WINRTSRGEEEEEKKKKKK 270
F P ++K R +KK+ + + + + W++R R K+ KK
Sbjct: 595 FVTPSTSLEKSMDRILHGQKKRAEGAVVFEKPLEATENFNPWLDRKMR-----RIKRIKK 649
Query: 271 KKKKKTTKKKKRKKKRKKKKNQHHRDVWQKK 301
K ++ + K+ KKK +++N + +K
Sbjct: 650 KAYRRIRRDKRLKKKMPEEENTQENHLGSEK 680
Score = 33.1 bits (75), Expect = 0.21
Identities = 16/74 (21%), Positives = 39/74 (52%), Gaps = 2/74 (2%)
Query: 220 FVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKK 279
+ ++++K K+ K K K RK +KN+ +++ + S E+ E +K ++ + ++ T++
Sbjct: 384 LMFREERKAKRVAKIKSKTYRKIRKNRKEKEMALIPKS--EDLENEKSEEARALERMTQR 441
Query: 280 KKRKKKRKKKKNQH 293
K +K +
Sbjct: 442 HKNTSSWTRKMLER 455
Score = 28.5 bits (63), Expect = 5.2
Identities = 21/57 (36%), Positives = 22/57 (38%), Gaps = 14/57 (24%)
Query: 218 PGFVMKKKKKRKKKKKKK--KKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKK 272
PG + KKK KK K KRK H V IN EK KK KK
Sbjct: 768 PGLKVIKKKSNTVKKVVGGIKYSKRKDFNLSH---VIIN---------EKVNKKDKK 812
>gnl|CDD|234707 PRK00270, rpsU, 30S ribosomal protein S21; Reviewed.
Length = 64
Score = 30.6 bits (70), Expect = 0.13
Identities = 15/61 (24%), Positives = 30/61 (49%), Gaps = 15/61 (24%)
Query: 227 KRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKR 286
+R K+K +K R+ ++ + + EK +K+K+KK +K++RKK
Sbjct: 17 RRFKRKVEKAGILRELRRREFY---------------EKPSEKRKRKKAAARKRRRKKLA 61
Query: 287 K 287
+
Sbjct: 62 R 62
Score = 27.1 bits (61), Expect = 2.4
Identities = 10/21 (47%), Positives = 15/21 (71%)
Query: 223 KKKKKRKKKKKKKKKKKRKKK 243
K +KRK+KK +K++RKK
Sbjct: 40 KPSEKRKRKKAAARKRRRKKL 60
Score = 26.4 bits (59), Expect = 4.7
Identities = 7/21 (33%), Positives = 15/21 (71%)
Query: 225 KKKRKKKKKKKKKKKRKKKKN 245
K+KRKK +K+++K+ ++
Sbjct: 44 KRKRKKAAARKRRRKKLAREE 64
Score = 26.0 bits (58), Expect = 5.7
Identities = 9/22 (40%), Positives = 14/22 (63%)
Query: 225 KKKRKKKKKKKKKKKRKKKKNQ 246
K K+K+KK +KR++KK
Sbjct: 40 KPSEKRKRKKAAARKRRRKKLA 61
>gnl|CDD|217840 pfam04006, Mpp10, Mpp10 protein. This family includes proteins
related to Mpp10 (M phase phosphoprotein 10). The U3
small nucleolar ribonucleoprotein (snoRNP) is required
for three cleavage events that generate the mature 18S
rRNA from the pre-rRNA. In Saccharomyces cerevisiae,
depletion of Mpp10, a U3 snoRNP-specific protein, halts
18S rRNA production and impairs cleavage at the three U3
snoRNP-dependent sites.
Length = 613
Score = 33.4 bits (76), Expect = 0.14
Identities = 19/79 (24%), Positives = 30/79 (37%), Gaps = 19/79 (24%)
Query: 218 PGFVMKKKKKRKKKKKKK----KKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKK 273
P + KK K +K K + K ++R+KKK + KK K K
Sbjct: 537 PEEIYKKMKAIEKSKTELDRTDKNRERRKKKRKR---------------RAAKKAVTKAK 581
Query: 274 KKTTKKKKRKKKRKKKKNQ 292
K+ K++ KK +
Sbjct: 582 KERKIGKEKVDGVAKKSSS 600
Score = 32.7 bits (74), Expect = 0.23
Identities = 13/57 (22%), Positives = 26/57 (45%), Gaps = 3/57 (5%)
Query: 224 KKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKK 280
K ++R+KKK+K++ K+ K + R + + + +K K + TK
Sbjct: 559 KNRERRKKKRKRRAAKKAVTKAKKERKI---GKEKVDGVAKKSSSAFFAKLQETKAV 612
Score = 31.9 bits (72), Expect = 0.40
Identities = 14/60 (23%), Positives = 25/60 (41%), Gaps = 7/60 (11%)
Query: 226 KKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKK 285
K R+++KKK+K++ KK + + R +E+ KK K + K
Sbjct: 559 KNRERRKKKRKRRAAKKAVTK-------AKKERKIGKEKVDGVAKKSSSAFFAKLQETKA 611
Score = 28.4 bits (63), Expect = 4.9
Identities = 13/36 (36%), Positives = 20/36 (55%), Gaps = 2/36 (5%)
Query: 262 EEEKKKKKKKKKKKT--TKKKKRKKKRKKKKNQHHR 295
EE KK K +K KT + K +++RKKK+ +
Sbjct: 538 EEIYKKMKAIEKSKTELDRTDKNRERRKKKRKRRAA 573
>gnl|CDD|220838 pfam10659, Trypan_glycop_C, Trypanosome variant surface
glycoprotein C-terminal domain. The trypanosome
parasite expresses these proteins to evade the immune
response.
Length = 98
Score = 31.3 bits (71), Expect = 0.15
Identities = 17/63 (26%), Positives = 23/63 (36%), Gaps = 9/63 (14%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
KK+ K K K+ K K+ + G E +K K KK KK K K
Sbjct: 19 DKKEDDGKCKPKEGKAKKNGAPVT---------QTAGTETTTEKCKGKKDKKDCKKGCKW 69
Query: 283 KKK 285
+
Sbjct: 70 EGN 72
Score = 30.9 bits (70), Expect = 0.20
Identities = 18/63 (28%), Positives = 25/63 (39%), Gaps = 7/63 (11%)
Query: 229 KKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKK 288
K+ K K K+ K KKN ++ E +K K KK K KK K +
Sbjct: 21 KEDDGKCKPKEGKAKKNG-------APVTQTAGTETTTEKCKGKKDKKDCKKGCKWEGNT 73
Query: 289 KKN 291
K+
Sbjct: 74 CKD 76
Score = 27.4 bits (61), Expect = 3.7
Identities = 19/69 (27%), Positives = 27/69 (39%), Gaps = 19/69 (27%)
Query: 231 KKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKK---------TTKKKK 281
KK K K + K+K + +E++ K K K+ K KK T+
Sbjct: 2 NKKNKTKTECKEKGCKWD----------KKEDDGKCKPKEGKAKKNGAPVTQTAGTETTT 51
Query: 282 RKKKRKKKK 290
K K KK K
Sbjct: 52 EKCKGKKDK 60
>gnl|CDD|221821 pfam12871, PRP38_assoc, Pre-mRNA-splicing factor 38-associated
hydrophilic C-term. This domain is a hydrophilic region
found at the C-terminus of plant and metazoan
pre-mRNA-splicing factor 38 proteins. The function is
not known.
Length = 97
Score = 31.3 bits (71), Expect = 0.16
Identities = 8/37 (21%), Positives = 20/37 (54%)
Query: 260 EEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQHHRD 296
E EEE+ ++ ++K + + R+ R++ + + R
Sbjct: 15 ESEEEEDDEEIRRKAERDVDRGRRSPRRRTRRRSRRR 51
>gnl|CDD|205206 pfam13025, DUF3886, Protein of unknown function (DUF3886). This
family of proteins is functionally uncharacterized. This
family of proteins is found in bacteria. Proteins in
this family are approximately 90 amino acids in length.
There are two completely conserved L residues that may
be functionally important.
Length = 70
Score = 30.4 bits (69), Expect = 0.18
Identities = 8/26 (30%), Positives = 20/26 (76%)
Query: 260 EEEEEKKKKKKKKKKKTTKKKKRKKK 285
+ EEEK++++++ +K+ +K++ K K
Sbjct: 30 KAEEEKREEEEEARKREERKEREKNK 55
>gnl|CDD|187751 cd09270, RNase_H2-B, Ribonuclease H2-B is a subunit of the
eukaryotic RNase H complex which cleaves RNA-DNA
hybrids. Ribonuclease H2B is one of the three proteins
of eukaryotic RNase H2 complex that is required for
nucleic acid binding and hydrolysis. RNase H is
classified into two families, type I (prokaryotic RNase
HI, eukaryotic RNase H1 and viral RNase H) and type II
(prokaryotic RNase HII and HIII, and eukaryotic RNase
H2/HII). RNase H endonucleolytically hydrolyzes an RNA
strand when it is annealed to a complementary DNA strand
in the presence of divalent cations, in DNA replication
and repair. The enzyme can be found in bacteria,
archaea, and eukaryotes. Most prokaryotic and eukaryotic
genomes contain multiple RNase H genes. Despite a lack
of evidence for homology from sequence comparisons, type
I and type II RNase H share a common fold and similar
steric configurations of the four acidic active-site
residues, suggesting identical or very similar catalytic
mechanisms. Eukaryotic RNase HII is active during
replication and is believed to play a role in removal of
Okazaki fragment primers and single ribonucleotides in
DNA-DNA duplexes. Eukaryotic RNase HII is functional
when it forms a complex with RNase H2B and RNase H2C
proteins. It is speculated that the two accessory
subunits are required for correct folding of the
catalytic subunit of RNase HII. Mutations in the three
subunits of human RNase HII cause neurological disorder.
Length = 211
Score = 32.3 bits (74), Expect = 0.18
Identities = 26/69 (37%), Positives = 33/69 (47%), Gaps = 15/69 (21%)
Query: 222 MKKKKKRKKKKK---KKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTK 278
+ KK +R KKK+ K+ + R KKK S EEE+KKKKKK KKK K
Sbjct: 149 LLKKVERLKKKELDIKEAEAARDKKK------------SNNAEEEKKKKKKKSAKKKKLK 196
Query: 279 KKKRKKKRK 287
K +
Sbjct: 197 KVAAVGMKA 205
Score = 29.2 bits (66), Expect = 2.0
Identities = 18/57 (31%), Positives = 27/57 (47%), Gaps = 7/57 (12%)
Query: 235 KKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKN 291
KK +R KKK ++ ++KK +++KK KKK KKK+ KK
Sbjct: 150 LKKVERLKKKE-------LDIKEAEAARDKKKSNNAEEEKKKKKKKSAKKKKLKKVA 199
Score = 27.3 bits (61), Expect = 10.0
Identities = 13/24 (54%), Positives = 16/24 (66%)
Query: 221 VMKKKKKRKKKKKKKKKKKRKKKK 244
KK +++KKKKKKK KKKK
Sbjct: 171 DKKKSNNAEEEKKKKKKKSAKKKK 194
>gnl|CDD|235778 PRK06319, PRK06319, DNA topoisomerase I/SWI domain fusion protein;
Validated.
Length = 860
Score = 33.3 bits (76), Expect = 0.20
Identities = 14/36 (38%), Positives = 18/36 (50%)
Query: 260 EEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQHHR 295
E++ + KKK K K K K+K K K KK R
Sbjct: 748 EKKTKAKKKSASTKGKAAKTVKKKSKAKSKKTTKKR 783
Score = 30.6 bits (69), Expect = 1.3
Identities = 13/35 (37%), Positives = 18/35 (51%)
Query: 267 KKKKKKKKKTTKKKKRKKKRKKKKNQHHRDVWQKK 301
+KK K KKK+ K + K KKK++ KK
Sbjct: 748 EKKTKAKKKSASTKGKAAKTVKKKSKAKSKKTTKK 782
Score = 29.0 bits (65), Expect = 4.1
Identities = 22/84 (26%), Positives = 28/84 (33%), Gaps = 24/84 (28%)
Query: 198 CIFVVNVYYALISSSISFWIPGFVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTS 257
C + N A+I+ + K K K+K K K K
Sbjct: 725 CSVIGNSIDAVITKYAGTPKTPYEKKTKAKKKSASTKGKAAK------------------ 766
Query: 258 RGEEEEEKKKKKKKKKKKTTKKKK 281
KKK K K KKTTKK+
Sbjct: 767 ------TVKKKSKAKSKKTTKKRA 784
Score = 28.6 bits (64), Expect = 4.7
Identities = 9/25 (36%), Positives = 11/25 (44%)
Query: 267 KKKKKKKKKTTKKKKRKKKRKKKKN 291
K K K K K + KK KK+
Sbjct: 760 TKGKAAKTVKKKSKAKSKKTTKKRA 784
Score = 27.9 bits (62), Expect = 8.7
Identities = 15/37 (40%), Positives = 16/37 (43%), Gaps = 8/37 (21%)
Query: 264 EKKKKKKKK--------KKKTTKKKKRKKKRKKKKNQ 292
EKK K KKK K KK K K K+ KK
Sbjct: 748 EKKTKAKKKSASTKGKAAKTVKKKSKAKSKKTTKKRA 784
>gnl|CDD|218612 pfam05501, DUF755, Domain of unknown function (DUF755). This
family is predominated by ORFs from Circoviridae. The
function of this family remains to be determined.
Length = 122
Score = 31.2 bits (71), Expect = 0.22
Identities = 14/60 (23%), Positives = 26/60 (43%), Gaps = 19/60 (31%)
Query: 233 KKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQ 292
+ ++K+K ++K +Q H KKK+KK + + KK KK+ +
Sbjct: 63 QPQRKRKEQQKSSQTH-------------------KKKRKKSRHVSSRSAKKISAKKRRR 103
Score = 29.3 bits (66), Expect = 1.0
Identities = 14/56 (25%), Positives = 25/56 (44%), Gaps = 15/56 (26%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTK 278
++K+K ++K + KKKRKK ++ R KK KK++ +
Sbjct: 65 QRKRKEQQKSSQTHKKKRKKSRHVSSR---------------SAKKISAKKRRRSS 105
Score = 27.4 bits (61), Expect = 5.3
Identities = 14/55 (25%), Positives = 28/55 (50%), Gaps = 11/55 (20%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTT 277
+ ++KRK+++K + K+K+KK R V KK KK+++ ++
Sbjct: 63 QPQRKRKEQQKSSQTHKKKRKK---SRHVSSRSA--------KKISAKKRRRSSS 106
>gnl|CDD|227577 COG5252, COG5252, Uncharacterized conserved protein, contains
CCCH-type Zn-finger protein [General function prediction
only].
Length = 299
Score = 32.4 bits (73), Expect = 0.22
Identities = 16/74 (21%), Positives = 37/74 (50%), Gaps = 4/74 (5%)
Query: 218 PGFVMKKKKKRKKKKKKKKKKKRKKK----KNQHHRDVWINRTSRGEEEEEKKKKKKKKK 273
P + KK+++ KK + +K+ + K KN++ + E K++ +KK+K
Sbjct: 3 PKKMAKKQQESGKKATRDMRKELEDKTFGLKNKNRSTKVQAIIKQIETLNLKEQLEKKEK 62
Query: 274 KKTTKKKKRKKKRK 287
+ +K++ +K+
Sbjct: 63 MRMEEKRREPEKQV 76
Score = 31.6 bits (71), Expect = 0.49
Identities = 17/76 (22%), Positives = 35/76 (46%)
Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
M KK KK+++ KK R +K + + +R + + K+ + K +KK
Sbjct: 1 MPPKKMAKKQQESGKKATRDMRKELEDKTFGLKNKNRSTKVQAIIKQIETLNLKEQLEKK 60
Query: 282 RKKKRKKKKNQHHRDV 297
K + ++K+ + + V
Sbjct: 61 EKMRMEEKRREPEKQV 76
>gnl|CDD|183610 PRK12585, PRK12585, putative monovalent cation/H+ antiporter
subunit G; Reviewed.
Length = 197
Score = 32.0 bits (72), Expect = 0.23
Identities = 12/57 (21%), Positives = 38/57 (66%)
Query: 224 KKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKK 280
KKKK ++++ +K R++++ R W R + +E E++++++++++++T +++
Sbjct: 116 KKKKSLIIRQEQIEKARQEREELEERMEWERREEKIDEREDQEEQEREREEQTIEEQ 172
>gnl|CDD|219621 pfam07890, Rrp15p, Rrp15p. Rrp15p is required for the formation of
60S ribosomal subunits.
Length = 132
Score = 31.6 bits (72), Expect = 0.23
Identities = 8/34 (23%), Positives = 19/34 (55%)
Query: 254 NRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRK 287
++ +++ K +K +KK K+ + +KR+ K
Sbjct: 27 SKKLLKAKKKLKSEKLEKKAKRQLRAEKRQALEK 60
Score = 31.2 bits (71), Expect = 0.24
Identities = 17/71 (23%), Positives = 30/71 (42%), Gaps = 18/71 (25%)
Query: 221 VMKKKKKRKKKKKK---KKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTT 277
++ K K+K + KK K KK + + EK +KK K++ +
Sbjct: 9 ILASKLPASKRKDPILSRSKKLLKAKK---------------KLKSEKLEKKAKRQLRAE 53
Query: 278 KKKKRKKKRKK 288
K++ +K R K
Sbjct: 54 KRQALEKGRVK 64
Score = 28.1 bits (63), Expect = 2.8
Identities = 8/38 (21%), Positives = 21/38 (55%)
Query: 257 SRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQHH 294
SR ++ + KKK K +K + K++ + ++++ +
Sbjct: 25 SRSKKLLKAKKKLKSEKLEKKAKRQLRAEKRQALEKGR 62
Score = 26.9 bits (60), Expect = 7.7
Identities = 11/35 (31%), Positives = 20/35 (57%)
Query: 263 EEKKKKKKKKKKKTTKKKKRKKKRKKKKNQHHRDV 297
+ KKK K +K +K K++ R +KR+ + + V
Sbjct: 32 KAKKKLKSEKLEKKAKRQLRAEKRQALEKGRVKPV 66
>gnl|CDD|149438 pfam08374, Protocadherin, Protocadherin. The structure of
protocadherins is similar to that of classic cadherins
(pfam00028), but particularly on the cytoplasmic domains
they also have some unique features. They are expressed
in a variety of organisms and are found in high
concentrations in the brain where they seem to be
localised mainly at cell-cell contact sites. Their
expression seems to be developmentally regulated.
Length = 223
Score = 32.1 bits (73), Expect = 0.24
Identities = 14/40 (35%), Positives = 22/40 (55%)
Query: 230 KKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKK 269
K+KKKKK KK+K K+ V + + +E E+K +
Sbjct: 90 KQKKKKKDKKKKSPKSLLLNFVTVEESKPDDEVHEQKSET 129
>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
Length = 880
Score = 32.7 bits (75), Expect = 0.25
Identities = 13/66 (19%), Positives = 32/66 (48%), Gaps = 1/66 (1%)
Query: 224 KKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRK 283
+ + + ++ +KR + + + R EE+EE+ ++ KKK K+ K+ +
Sbjct: 299 SEFYEEYLDELREIEKRLSRLEEEINGI-EERIKELEEKEERLEELKKKLKELEKRLEEL 357
Query: 284 KKRKKK 289
++R +
Sbjct: 358 EERHEL 363
Score = 30.4 bits (69), Expect = 1.5
Identities = 13/72 (18%), Positives = 34/72 (47%), Gaps = 11/72 (15%)
Query: 219 GFVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTK 278
+ +++ +K+ ++ +K K+K K K + ++E +K ++ KKK +
Sbjct: 514 KYNLEELEKKAEEYEKLKEKLIKLKGEI-----------KSLKKELEKLEELKKKLAELE 562
Query: 279 KKKRKKKRKKKK 290
KK + + + +
Sbjct: 563 KKLDELEEELAE 574
Score = 30.0 bits (68), Expect = 1.9
Identities = 14/68 (20%), Positives = 33/68 (48%), Gaps = 3/68 (4%)
Query: 225 KKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSR---GEEEEEKKKKKKKKKKKTTKKKK 281
+K+ + ++ KKK K+ +K+ + + ++ E E KK+ +K K+ +
Sbjct: 335 EKEERLEELKKKLKELEKRLEELEERHELYEEAKAKKEELERLKKRLTGLTPEKLEKELE 394
Query: 282 RKKKRKKK 289
+K K++
Sbjct: 395 ELEKAKEE 402
Score = 29.6 bits (67), Expect = 2.2
Identities = 14/84 (16%), Positives = 43/84 (51%), Gaps = 4/84 (4%)
Query: 222 MKKKKKR----KKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTT 277
+K+K + + ++ + R+ +K + IN +E E+K+++ ++ KK
Sbjct: 288 LKEKAEEYIKLSEFYEEYLDELREIEKRLSRLEEEINGIEERIKELEEKEERLEELKKKL 347
Query: 278 KKKKRKKKRKKKKNQHHRDVWQKK 301
K+ +++ + +++++ + + KK
Sbjct: 348 KELEKRLEELEERHELYEEAKAKK 371
Score = 29.6 bits (67), Expect = 2.3
Identities = 14/69 (20%), Positives = 40/69 (57%), Gaps = 3/69 (4%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKK-KK 281
+ +K+ ++ +KK +++ ++ + ++ + E E+ +K++++ KKT +K K+
Sbjct: 644 ELRKELEELEKKYSEEEYEELREEYLE--LSRELAGLRAELEELEKRREEIKKTLEKLKE 701
Query: 282 RKKKRKKKK 290
++R+K K
Sbjct: 702 ELEEREKAK 710
Score = 29.6 bits (67), Expect = 2.3
Identities = 10/72 (13%), Positives = 33/72 (45%), Gaps = 8/72 (11%)
Query: 224 KKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRK 283
++ ++K+K+ ++ + + + ++ EE EK +K+ K+ ++ ++ +
Sbjct: 192 EELIKEKEKELEEVLREINEISSELPEL--------REELEKLEKEVKELEELKEEIEEL 243
Query: 284 KKRKKKKNQHHR 295
+K + R
Sbjct: 244 EKELESLEGSKR 255
Score = 29.3 bits (66), Expect = 3.1
Identities = 14/72 (19%), Positives = 31/72 (43%), Gaps = 7/72 (9%)
Query: 224 KKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKK----KKKKKKTTKK 279
+ +K + + K+K +K + R I + EE E+K K+ K+K ++ K
Sbjct: 242 ELEKELESLEGSKRKLEEKIRELEER---IEELKKEIEELEEKVKELKELKEKAEEYIKL 298
Query: 280 KKRKKKRKKKKN 291
+ ++ +
Sbjct: 299 SEFYEEYLDELR 310
Score = 28.9 bits (65), Expect = 3.7
Identities = 13/81 (16%), Positives = 37/81 (45%), Gaps = 3/81 (3%)
Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
+++ K+ ++ +K+ + K+ + I EE +K+ ++ ++K K K+ K
Sbjct: 233 LEELKEEIEELEKELESLEGSKRKLEEK---IRELEERIEELKKEIEELEEKVKELKELK 289
Query: 282 RKKKRKKKKNQHHRDVWQKKY 302
K + K ++ + + +
Sbjct: 290 EKAEEYIKLSEFYEEYLDELR 310
Score = 28.1 bits (63), Expect = 7.3
Identities = 7/34 (20%), Positives = 16/34 (47%)
Query: 262 EEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQHHR 295
++E +K ++ KKK +KK + ++
Sbjct: 545 KKELEKLEELKKKLAELEKKLDELEEELAELLKE 578
Score = 28.1 bits (63), Expect = 7.3
Identities = 14/73 (19%), Positives = 37/73 (50%), Gaps = 4/73 (5%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
+ +++ + ++ + + ++ + R+ + +EE E+++K KK+ +K K +R
Sbjct: 663 ELREEYLELSRELAGLRAELEELEKRREEIKKTLEKLKEELEEREKAKKELEKLEKALER 722
Query: 283 ----KKKRKKKKN 291
++K KK K
Sbjct: 723 VEELREKVKKYKA 735
Score = 28.1 bits (63), Expect = 7.9
Identities = 12/65 (18%), Positives = 26/65 (40%)
Query: 226 KKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKK 285
KK+ + +KK + ++ + S E EE K+ + + K K+
Sbjct: 555 KKKLAELEKKLDELEEELAELLKELEELGFESVEELEERLKELEPFYNEYLELKDAEKEL 614
Query: 286 RKKKK 290
+++K
Sbjct: 615 EREEK 619
Score = 27.7 bits (62), Expect = 9.6
Identities = 16/70 (22%), Positives = 36/70 (51%), Gaps = 7/70 (10%)
Query: 221 VMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKK 280
+ ++ K+ ++K+++ ++ K+K K+ + R EE E ++ K KK++ + K
Sbjct: 326 IEERIKELEEKEERLEELKKKLKELE-------KRLEELEERHELYEEAKAKKEELERLK 378
Query: 281 KRKKKRKKKK 290
KR +K
Sbjct: 379 KRLTGLTPEK 388
>gnl|CDD|218896 pfam06098, Radial_spoke_3, Radial spoke protein 3. This family
consists of several radial spoke protein 3 (RSP3)
sequences. Eukaryotic cilia and flagella present in
diverse types of cells perform motile, sensory, and
developmental functions in organisms from protists to
humans. They are centred by precisely organised,
microtubule-based structures, the axonemes. The axoneme
consists of two central singlet microtubules, called the
central pair, and nine outer doublet microtubules. These
structures are well-conserved during evolution. The
outer doublet microtubules, each composed of A and B
sub-fibres, are connected to each other by nexin links,
while the central pair is held at the centre of the
axoneme by radial spokes. The radial spokes are T-shaped
structures extending from the A-tubule of each outer
doublet microtubule to the centre of the axoneme. Radial
spoke protein 3 (RSP3), is present at the proximal end
of the spoke stalk and helps in anchoring the radial
spoke to the outer doublet. It is thought that radial
spokes regulate the activity of inner arm dynein through
protein phosphorylation and dephosphorylation.
Length = 288
Score = 32.3 bits (74), Expect = 0.25
Identities = 13/52 (25%), Positives = 36/52 (69%)
Query: 239 KRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKK 290
++++++ + R+ + T R EE E +++++K+++KK K++K+++K +K
Sbjct: 158 RQQQRQFEQRRNAELAETQRLEEAERRRREEKERRKKQDKERKQREKETAEK 209
>gnl|CDD|185768 cd09245, BRO1_UmRIM23-like, Protein-interacting, Bro1-like domain
of Ustilago maydis Rim23 (PalC), and related domains.
This family contains the Bro1-like domain of Ustilago
maydis Rim23 (also known as PalC), and related proteins.
It belongs to the BRO1_Alix_like superfamily which
includes the Bro1-like domains of mammalian Alix
(apoptosis-linked gene-2 interacting protein X),
His-Domain type N23 protein tyrosine phosphatase
(HD-PTP, also known as PTPN23), RhoA-binding proteins
Rhophilin-1 and -2, Brox, Bro1 and Rim20 (also known as
PalA) from Saccharomyces cerevisiae, and related
domains. Alix, HD-PTP, Brox, Bro1, Rim20, and Rim23
interact with the ESCRT (Endosomal Sorting Complexes
Required for Transport) system. Rim20 and Rim23
participate in the response to the external pH via the
Rim101 pathway. Through its Bro1-like domain, Rim23
allows the interaction between the endosomal and plasma
membrane complexes. Bro1-like domains are
boomerang-shape, and part of the domain is a
tetratricopeptide repeat (TPR)-like structure.
Intermediates in the Rim101 pathway may play roles in
the pathogenesis of fungal corneal infection during
Candida albicans keratitis. This family lacks the
V-shaped (V) domain found in many members of the
BRO1_Alix_like superfamily.
Length = 413
Score = 32.4 bits (74), Expect = 0.26
Identities = 14/58 (24%), Positives = 24/58 (41%), Gaps = 8/58 (13%)
Query: 252 WINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQHHR-------DVWQKKY 302
W+ R ++ E E+ K K K K K K++ +K + ++ KKY
Sbjct: 318 WL-RAAKKELEDLKSPSGVASKAKLKKSWKEKREDRKVEKGAGVEEELRTLEMLLKKY 374
>gnl|CDD|224668 COG1754, COG1754, Uncharacterized C-terminal domain of
topoisomerase IA [General function prediction only].
Length = 298
Score = 32.4 bits (74), Expect = 0.26
Identities = 15/60 (25%), Positives = 19/60 (31%), Gaps = 18/60 (30%)
Query: 231 KKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKK 290
++ K K+ KK K KK KK KK K K+ KK
Sbjct: 255 RRAKGGPGKKPAKKA------------------TAAKAKKTTAKKAAAKKAAKTKKAAKK 296
Score = 30.8 bits (70), Expect = 0.74
Identities = 13/28 (46%), Positives = 13/28 (46%)
Query: 265 KKKKKKKKKKKTTKKKKRKKKRKKKKNQ 292
KK K KKTT KK KK K K
Sbjct: 266 AKKATAAKAKKTTAKKAAAKKAAKTKKA 293
Score = 28.5 bits (64), Expect = 4.5
Identities = 11/29 (37%), Positives = 12/29 (41%)
Query: 260 EEEEEKKKKKKKKKKKTTKKKKRKKKRKK 288
E K KK KK T K +K KK
Sbjct: 253 AERRAKGGPGKKPAKKATAAKAKKTTAKK 281
Score = 28.1 bits (63), Expect = 5.1
Identities = 12/33 (36%), Positives = 14/33 (42%), Gaps = 2/33 (6%)
Query: 260 EEEEEKKKKKKKKKKKTTKKKKR--KKKRKKKK 290
E + KK KK T K K+ KK KK
Sbjct: 254 ERRAKGGPGKKPAKKATAAKAKKTTAKKAAAKK 286
>gnl|CDD|222571 pfam14153, Spore_coat_CotO, Spore coat protein CotO. Bacillus
spores are protected by a protein shell consisting of
over 50 different polypeptides, known as the coat. This
family of proteins has an important morphogenetic role
in coat assembly, it is involved in the assembly of at
least 5 different coat proteins including CotB, CotG,
CotS, CotSA and CotW. It is likely to act at a late
stage of coat assembly.
Length = 185
Score = 31.7 bits (72), Expect = 0.28
Identities = 13/82 (15%), Positives = 43/82 (52%)
Query: 220 FVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKK 279
F++KK +++++K+ + + K++ Q + E E + ++++K++ ++
Sbjct: 31 FIIKKADEKEEEKENSDEHVKSKEEEQKIEYEEAEKEKEAGEPEREDIAEQQEKEEIAQE 90
Query: 280 KKRKKKRKKKKNQHHRDVWQKK 301
++++++ + K Q +KK
Sbjct: 91 EEKEEEAEDVKQQEVFSFKRKK 112
Score = 29.0 bits (65), Expect = 1.9
Identities = 15/84 (17%), Positives = 35/84 (41%), Gaps = 13/84 (15%)
Query: 221 VMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGE-------------EEEEKKK 267
KK +K+++K+ + K K + + + E E+EE +
Sbjct: 30 TFIIKKADEKEEEKENSDEHVKSKEEEQKIEYEEAEKEKEAGEPEREDIAEQQEKEEIAQ 89
Query: 268 KKKKKKKKTTKKKKRKKKRKKKKN 291
+++K+++ K++ K+KK
Sbjct: 90 EEEKEEEAEDVKQQEVFSFKRKKP 113
>gnl|CDD|151665 pfam11223, DUF3020, Protein of unknown function (DUF3020). This
family of fungal proteins is conserved towards the
C-terminus of HMG domains. The function is not known.
Length = 49
Score = 29.5 bits (66), Expect = 0.28
Identities = 11/38 (28%), Positives = 19/38 (50%)
Query: 254 NRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKN 291
NR + + E ++ K ++ KKK KKK K+ +
Sbjct: 1 NRERKKKWREANSERNKDNDLRSRVKKKAKKKFGKEDS 38
>gnl|CDD|219913 pfam08576, DUF1764, Eukaryotic protein of unknown function
(DUF1764). This is a family of eukaryotic proteins of
unknown function. This family contains many hypothetical
proteins.
Length = 98
Score = 30.5 bits (69), Expect = 0.29
Identities = 13/28 (46%), Positives = 18/28 (64%)
Query: 265 KKKKKKKKKKKTTKKKKRKKKRKKKKNQ 292
KK+KKKKK+ T + K KK +KK +
Sbjct: 22 KKRKKKKKRTAKTARPKATKKGQKKDKK 49
Score = 30.1 bits (68), Expect = 0.45
Identities = 12/61 (19%), Positives = 27/61 (44%)
Query: 231 KKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKK 290
K++KK +K K+ + ++ + + + + K KK +K KKK + ++
Sbjct: 1 KEEKKNEKTDKRDIDDIFSNIKKRKKKKKRTAKTARPKATKKGQKKDKKKDEFPEFPEES 60
Query: 291 N 291
Sbjct: 61 K 61
Score = 29.0 bits (65), Expect = 1.1
Identities = 13/56 (23%), Positives = 28/56 (50%), Gaps = 11/56 (19%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTK 278
KK+KK+KK+ K + K KK + ++++E + ++ K++ T+
Sbjct: 22 KKRKKKKKRTAKTARPKATKKG-----------QKKDKKKDEFPEFPEESKRRRTE 66
Score = 28.2 bits (63), Expect = 1.9
Identities = 18/69 (26%), Positives = 35/69 (50%), Gaps = 4/69 (5%)
Query: 235 KKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQHH 294
K++KK +K + D++ N R ++KKK+ K + KK +KK +KK +
Sbjct: 1 KEEKKNEKTDKRDIDDIFSNIKKR----KKKKKRTAKTARPKATKKGQKKDKKKDEFPEF 56
Query: 295 RDVWQKKYT 303
+ +++ T
Sbjct: 57 PEESKRRRT 65
Score = 27.0 bits (60), Expect = 4.4
Identities = 18/72 (25%), Positives = 27/72 (37%), Gaps = 21/72 (29%)
Query: 217 IPGFVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKT 276
I KK++KKKK+ K + K KK +KK KKK
Sbjct: 14 IDDIFSNIKKRKKKKKRTAKTARPKAT---------------------KKGQKKDKKKDE 52
Query: 277 TKKKKRKKKRKK 288
+ + KR++
Sbjct: 53 FPEFPEESKRRR 64
Score = 26.7 bits (59), Expect = 6.5
Identities = 10/40 (25%), Positives = 17/40 (42%), Gaps = 1/40 (2%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINR-TSRGEE 261
KK+ + + K KK ++K KK + R E+
Sbjct: 28 KKRTAKTARPKATKKGQKKDKKKDEFPEFPEESKRRRTED 67
>gnl|CDD|227476 COG5147, REB1, Myb superfamily proteins, including transcription
factors and mRNA splicing factors [Transcription / RNA
processing and modification / Cell division and
chromosome partitioning].
Length = 512
Score = 32.5 bits (74), Expect = 0.30
Identities = 14/79 (17%), Positives = 26/79 (32%), Gaps = 6/79 (7%)
Query: 224 KKKKRK-KKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
+ + K KK R KK H N + + K+++++ K +
Sbjct: 255 DRDEDKFWPNIYKKLPYRDKKSIYKHLRRKYNIF--EQRGKWTKEEEQELAKLVVEHGGS 312
Query: 283 KKKRKK---KKNQHHRDVW 298
+ K + RD W
Sbjct: 313 WTEIGKLLGRMPNDCRDRW 331
>gnl|CDD|185611 PTZ00428, PTZ00428, 60S ribosomal protein L4; Provisional.
Length = 381
Score = 32.3 bits (74), Expect = 0.30
Identities = 17/67 (25%), Positives = 31/67 (46%), Gaps = 3/67 (4%)
Query: 230 KKKKKKKKKKRKKKKN---QHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKR 286
+ KKK K+R +KKN + +N ++ + ++K K K K R+ ++
Sbjct: 302 RPAKKKPAKRRVQKKNPLKNRRVRLRLNPAAKKLRKLAVLAQEKATAKGAQKVKNRRARK 361
Query: 287 KKKKNQH 293
K KK +
Sbjct: 362 KAKKARL 368
Score = 29.7 bits (67), Expect = 1.9
Identities = 16/70 (22%), Positives = 30/70 (42%), Gaps = 1/70 (1%)
Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
K K+R +KK K ++ + + N + + E+ K +K K ++ KK K
Sbjct: 306 KKPAKRRVQKKNPLKNRRVRLRLNPAAKKLRKLAVLAQEKATAKGAQK-VKNRRARKKAK 364
Query: 282 RKKKRKKKKN 291
+ + K K
Sbjct: 365 KARLAKVAKA 374
Score = 27.7 bits (62), Expect = 8.2
Identities = 8/44 (18%), Positives = 19/44 (43%), Gaps = 3/44 (6%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKK 266
++K K +K K +R +KK + + + ++ + K
Sbjct: 341 LAQEKATAKGAQKVKNRRARKKAKK---ARLAKVAKALYKSIKA 381
>gnl|CDD|119392 cd06174, MFS, The Major Facilitator Superfamily (MFS) is a large
and diverse group of secondary transporters that
includes uniporters, symporters, and antiporters. MFS
proteins facilitate the transport across cytoplasmic or
internal membranes of a variety of substrates including
ions, sugar phosphates, drugs, neurotransmitters,
nucleosides, amino acids, and peptides. They do so using
the electrochemical potential of the transported
substrates. Uniporters transport a single substrate,
while symporters and antiporters transport two
substrates in the same or in opposite directions,
respectively, across membranes. MFS proteins are
typically 400 to 600 amino acids in length, and the
majority contain 12 transmembrane alpha helices (TMs)
connected by hydrophilic loops. The N- and C-terminal
halves of these proteins display weak similarity and may
be the result of a gene duplication/fusion event. Based
on kinetic studies and the structures of a few bacterial
superfamily members, GlpT (glycerol-3-phosphate
transporter), LacY (lactose permease), and EmrD
(multidrug transporter), MFS proteins are thought to
function through a single substrate binding site,
alternating-access mechanism involving a rocker-switch
type of movement. Bacterial members function primarily
for nutrient uptake, and as drug-efflux pumps to confer
antibiotic resistance. Some MFS proteins have medical
significance in humans such as the glucose transporter
Glut4, which is impaired in type II diabetes, and
glucose-6-phosphate transporter (G6PT), which causes
glycogen storage disease when mutated.
Length = 352
Score = 32.3 bits (74), Expect = 0.30
Identities = 18/97 (18%), Positives = 35/97 (36%), Gaps = 6/97 (6%)
Query: 11 YTYNINYTSTNFSNTSLFPTNSTIHIHIFPFIIK--GMLMCFIIITAILGNLLVIISV-- 66
+ + +F L + G+L+ + ILG LL +
Sbjct: 178 LLLALAFFLLSFGYYGLLTYLPLYLQEVLGLSAAEAGLLLSLFGLGGILGALLGGLLSDR 237
Query: 67 IKHRKLRIITNYFVVSLAFADLLVALCVMPFNAIVSL 103
+ R+L ++ + +L LL+AL +V+L
Sbjct: 238 LGRRRLLLLIGLLLAALGL--LLLALAPSLALLLVAL 272
Score = 30.0 bits (68), Expect = 1.5
Identities = 15/107 (14%), Positives = 34/107 (31%), Gaps = 7/107 (6%)
Query: 22 FSNTSLFPTNSTIHIHIFPFIIKGMLMCFIIITAILGNLL--VIISVIKHRKLRIITNYF 79
+L+P + + FP +G + LG LL ++ ++
Sbjct: 99 LGGGALYPAAAALIAEWFPPKERGRALGLFSAGFGLGALLGPLLGGLLAESLGWRWLFLI 158
Query: 80 VVSLAFADLLVALCVMPFNAIVSLTDEWYFGYFMCDVWNSFDVYFST 126
+ L L+ L ++ +++L +F + Y
Sbjct: 159 LAILGLLLALLLLFLLRLLLLLALA---FFLLSF--GYYGLLTYLPL 200
>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
recombination, and repair].
Length = 908
Score = 32.4 bits (74), Expect = 0.30
Identities = 7/79 (8%), Positives = 35/79 (44%), Gaps = 1/79 (1%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
+ ++ ++ + + + ++ + + R + EE+ EK + + ++ + + +
Sbjct: 299 EIEELEEELEGLRALLEELEELLEKLKS-LEERLEKLEEKLEKLESELEELAEEKNELAK 357
Query: 283 KKKRKKKKNQHHRDVWQKK 301
+ + K+ + + +K+
Sbjct: 358 LLEERLKELEERLEELEKE 376
>gnl|CDD|204414 pfam10211, Ax_dynein_light, Axonemal dynein light chain. Axonemal
dynein light chain proteins play a dynamic role in
flagellar and cilia motility. Eukaryotic cilia and
flagella are complex organelles consisting of a core
structure, the axoneme, which is composed of nine
microtubule doublets forming a cylinder that surrounds a
pair of central singlet microtubules. This
ultra-structural arrangement seems to be one of the most
stable micro-tubular assemblies known and is responsible
for the flagellar and ciliary movement of a large number
of organisms ranging from protozoan to mammals. This
light chain interacts directly with the N-terminal half
of the heavy chains.
Length = 189
Score = 31.4 bits (72), Expect = 0.32
Identities = 19/94 (20%), Positives = 42/94 (44%), Gaps = 17/94 (18%)
Query: 205 YYALISSSISFWIPGFVMKKKKKRKKKK----KKKKKKKRKKKKNQHHRDVWINR----T 256
Y L SS+ F M+K + ++ K ++ KK + +K++ + +
Sbjct: 100 YQTLYESSV-----AFGMRKALQAEQGKSELEQEIKKLEEEKEELEKRVAELEAKLEAIE 154
Query: 257 SRGEEEEEKKKKKKKKK----KKTTKKKKRKKKR 286
R EEE + ++K+ + KK ++ K + ++
Sbjct: 155 KREEEERQIEEKRHADEIAFLKKQNQQLKSQLEQ 188
>gnl|CDD|192292 pfam09429, Wbp11, WW domain binding protein 11. The WW domain is a
small protein module with a triple-stranded beta-sheet
fold. This is a family of WW domain binding proteins.
Length = 78
Score = 29.9 bits (68), Expect = 0.32
Identities = 15/66 (22%), Positives = 36/66 (54%), Gaps = 5/66 (7%)
Query: 225 KKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKK 284
+K++KKK+ KK K +R+ + R+ + + + + E K K +++ + +++K
Sbjct: 11 RKEQKKKELKKNKAERQAR-----REAKLAKKNPDRLQRELDKLKAQEESGKLRPVEKQK 65
Query: 285 KRKKKK 290
+K +K
Sbjct: 66 LKKLEK 71
Score = 27.6 bits (62), Expect = 1.9
Identities = 16/66 (24%), Positives = 34/66 (51%), Gaps = 10/66 (15%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
KK+ K+ K +++ +++ + KKN R + E +K K +++ K +K++
Sbjct: 16 KKELKKNKAERQARREAKLAKKN----------PDRLQRELDKLKAQEESGKLRPVEKQK 65
Query: 283 KKKRKK 288
KK +K
Sbjct: 66 LKKLEK 71
Score = 26.5 bits (59), Expect = 5.2
Identities = 10/31 (32%), Positives = 20/31 (64%)
Query: 261 EEEEKKKKKKKKKKKTTKKKKRKKKRKKKKN 291
+ K++KKK+ KK +++ R++ + KKN
Sbjct: 8 DAYRKEQKKKELKKNKAERQARREAKLAKKN 38
>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
Length = 651
Score = 32.3 bits (74), Expect = 0.32
Identities = 10/31 (32%), Positives = 21/31 (67%)
Query: 262 EEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQ 292
E+E+K+ K++K+ KK+ +KK+K+ +
Sbjct: 556 EKEEKEALKEQKRLRKLKKQEEKKKKELEKL 586
Score = 31.2 bits (71), Expect = 0.76
Identities = 14/51 (27%), Positives = 28/51 (54%), Gaps = 16/51 (31%)
Query: 225 KKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKK 275
++++++K+ K++K+ +K K Q EEKKKK+ +K +K
Sbjct: 554 QREKEEKEALKEQKRLRKLKKQ----------------EEKKKKELEKLEK 588
Score = 30.8 bits (70), Expect = 0.95
Identities = 12/30 (40%), Positives = 23/30 (76%)
Query: 261 EEEEKKKKKKKKKKKTTKKKKRKKKRKKKK 290
E+EEK+ K++K+ + KK++ KKK++ +K
Sbjct: 556 EKEEKEALKEQKRLRKLKKQEEKKKKELEK 585
Score = 30.8 bits (70), Expect = 1.0
Identities = 11/30 (36%), Positives = 23/30 (76%)
Query: 260 EEEEEKKKKKKKKKKKTTKKKKRKKKRKKK 289
E+EE++ K++K+ +K K++++KKK +K
Sbjct: 556 EKEEKEALKEQKRLRKLKKQEEKKKKELEK 585
Score = 30.4 bits (69), Expect = 1.2
Identities = 12/31 (38%), Positives = 21/31 (67%)
Query: 260 EEEEEKKKKKKKKKKKTTKKKKRKKKRKKKK 290
+E +++K+ +K KK+ KKKK +K +K K
Sbjct: 560 KEALKEQKRLRKLKKQEEKKKKELEKLEKAK 590
Score = 30.4 bits (69), Expect = 1.3
Identities = 11/31 (35%), Positives = 23/31 (74%)
Query: 258 RGEEEEEKKKKKKKKKKKTTKKKKRKKKRKK 288
R +EE+E K++K+ +K +++K+KK+ +K
Sbjct: 555 REKEEKEALKEQKRLRKLKKQEEKKKKELEK 585
Score = 30.4 bits (69), Expect = 1.5
Identities = 12/31 (38%), Positives = 22/31 (70%)
Query: 260 EEEEEKKKKKKKKKKKTTKKKKRKKKRKKKK 290
EE+E K++K+ +K K ++KK+K+ K +K
Sbjct: 558 EEKEALKEQKRLRKLKKQEEKKKKELEKLEK 588
Score = 29.6 bits (67), Expect = 2.5
Identities = 11/32 (34%), Positives = 23/32 (71%)
Query: 258 RGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKK 289
+ E+E K++K+ +K KK +KKK++ ++ +K
Sbjct: 557 KEEKEALKEQKRLRKLKKQEEKKKKELEKLEK 588
Score = 28.5 bits (64), Expect = 5.8
Identities = 10/41 (24%), Positives = 28/41 (68%)
Query: 260 EEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQHHRDVWQK 300
+++EE +++K++K+ +K+ RK K++++K + + +K
Sbjct: 548 DDKEELQREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEK 588
Score = 28.1 bits (63), Expect = 7.1
Identities = 19/90 (21%), Positives = 36/90 (40%), Gaps = 22/90 (24%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWIN----------------------RTSRGE 260
+ K++K+ +K KK++++KKK+ + I + GE
Sbjct: 561 EALKEQKRLRKLKKQEEKKKKELEKLEKAKIPPAEFFKRQEDKYSAFDETGLPTHDADGE 620
Query: 261 EEEEKKKKKKKKKKKTTKKKKRKKKRKKKK 290
E +K++KK K+ K + K K
Sbjct: 621 EISKKERKKLSKEYDKQAKLHEEYLAKGGK 650
Score = 27.7 bits (62), Expect = 9.9
Identities = 18/81 (22%), Positives = 33/81 (40%), Gaps = 13/81 (16%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEE---EEEKKKKK--------KK 271
+K+ +++K+ +K KK+ +KKK + + E +E K
Sbjct: 559 EKEALKEQKRLRKLKKQEEKKKKELEK--LEKAKIPPAEFFKRQEDKYSAFDETGLPTHD 616
Query: 272 KKKKTTKKKKRKKKRKKKKNQ 292
+ KK+RKK K+ Q
Sbjct: 617 ADGEEISKKERKKLSKEYDKQ 637
>gnl|CDD|240254 PTZ00069, PTZ00069, 60S ribosomal protein L5; Provisional.
Length = 300
Score = 32.0 bits (73), Expect = 0.33
Identities = 17/59 (28%), Positives = 29/59 (49%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
KK + K KKK+KKKK H+ + + + + K KK +++++ KK K
Sbjct: 242 KKAHAAIRANPSKVKKKKKKKKKVVHKKYKTKKLTGKQRKARVKAKKAQRRERLQKKIK 300
Score = 29.7 bits (67), Expect = 1.7
Identities = 19/44 (43%), Positives = 25/44 (56%), Gaps = 3/44 (6%)
Query: 261 EEEEKKKKKKKKKKKTTKKK---KRKKKRKKKKNQHHRDVWQKK 301
+++KKKKKK KK KK K++K R K K R+ QKK
Sbjct: 255 VKKKKKKKKKVVHKKYKTKKLTGKQRKARVKAKKAQRRERLQKK 298
Score = 29.7 bits (67), Expect = 2.1
Identities = 15/27 (55%), Positives = 17/27 (62%)
Query: 264 EKKKKKKKKKKKTTKKKKRKKKRKKKK 290
K KKKKKKKKK KK + KK K+
Sbjct: 253 SKVKKKKKKKKKVVHKKYKTKKLTGKQ 279
Score = 28.5 bits (64), Expect = 4.2
Identities = 16/66 (24%), Positives = 31/66 (46%), Gaps = 7/66 (10%)
Query: 225 KKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKK 284
KK + K K+KKKK + + + ++ K++K + K K ++R++
Sbjct: 242 KKAHAAIRANPSKVKKKKKKKKK-------VVHKKYKTKKLTGKQRKARVKAKKAQRRER 294
Query: 285 KRKKKK 290
+KK K
Sbjct: 295 LQKKIK 300
>gnl|CDD|236081 PRK07735, PRK07735, NADH dehydrogenase subunit C; Validated.
Length = 430
Score = 31.9 bits (72), Expect = 0.37
Identities = 17/67 (25%), Positives = 22/67 (32%)
Query: 230 KKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKK 289
K K K K K T EEEE+ K+K K K K + K+
Sbjct: 124 KAAAKAKAAAAAKAKAAALAKQKREGTEEVTEEEEETDKEKAKAKAAAAAKAKAAALAKQ 183
Query: 290 KNQHHRD 296
K +
Sbjct: 184 KAAEAGE 190
Score = 30.7 bits (69), Expect = 0.86
Identities = 18/61 (29%), Positives = 21/61 (34%)
Query: 229 KKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKK 288
K K K K K + EEEEE K+K K K K K K+
Sbjct: 124 KAAAKAKAAAAAKAKAAALAKQKREGTEEVTEEEEETDKEKAKAKAAAAAKAKAAALAKQ 183
Query: 289 K 289
K
Sbjct: 184 K 184
Score = 30.3 bits (68), Expect = 1.3
Identities = 14/63 (22%), Positives = 17/63 (26%)
Query: 225 KKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKK 284
K K K K K Q E ++EK K K K K+
Sbjct: 124 KAAAKAKAAAAAKAKAAALAKQKREGTEEVTEEEEETDKEKAKAKAAAAAKAKAAALAKQ 183
Query: 285 KRK 287
K
Sbjct: 184 KAA 186
Score = 28.4 bits (63), Expect = 4.5
Identities = 15/67 (22%), Positives = 23/67 (34%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
+++K K K K K + + EEE +K+K K K K
Sbjct: 120 TEEEKAAAKAKAAAAAKAKAAALAKQKREGTEEVTEEEEETDKEKAKAKAAAAAKAKAAA 179
Query: 283 KKKRKKK 289
K+K
Sbjct: 180 LAKQKAA 186
>gnl|CDD|216782 pfam01914, MarC, MarC family integral membrane protein. Integral
membrane protein family that includes the antibiotic
resistance protein MarC. These proteins may be
transporters.
Length = 203
Score = 31.5 bits (72), Expect = 0.39
Identities = 19/76 (25%), Positives = 28/76 (36%), Gaps = 13/76 (17%)
Query: 39 FPFIIKGMLMCFIIITAILGNLLVIISVIKH--RKLRIITNYFVVSLAFADLLVALCVMP 96
FI L F II I GN+ V IS+ ++ + R +AF LL+ L
Sbjct: 1 LKFIFSAFLSLFAIINPI-GNVPVFISLTENYPAEERKRIILRASIIAFLILLLFL---- 55
Query: 97 FNAIVSLTDEWYFGYF 112
+ + F
Sbjct: 56 ------VFGKLILKLF 65
>gnl|CDD|177433 PHA02608, 67, prohead core protein; Provisional.
Length = 80
Score = 29.8 bits (67), Expect = 0.39
Identities = 7/44 (15%), Positives = 11/44 (25%)
Query: 247 HHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKK 290
V I R+ E EE + + K +
Sbjct: 33 EEEKVEIARSVMIEGEEPEDDDDDEDDDDDDDKDDKDDDDDDDD 76
>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
unknown].
Length = 652
Score = 32.0 bits (73), Expect = 0.41
Identities = 11/71 (15%), Positives = 38/71 (53%)
Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
+K++ + K++ +K + + ++ + + V +R R + ++ +K+ ++KK ++
Sbjct: 441 LKRELEELKREIEKLESELERFRREVRDKVRKDREIRARDRRIERLEKELEEKKKRVEEL 500
Query: 282 RKKKRKKKKNQ 292
+K + +K +
Sbjct: 501 ERKLAELRKMR 511
>gnl|CDD|235040 PRK02463, PRK02463, OxaA-like protein precursor; Provisional.
Length = 307
Score = 31.6 bits (72), Expect = 0.41
Identities = 15/66 (22%), Positives = 30/66 (45%), Gaps = 4/66 (6%)
Query: 217 IPGFVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKT 276
I +++K + +++ ++ K + K +DV T + KKKKK +
Sbjct: 246 ITTYILKPRLRKQIAEEFAKNPPKAYKSPSGRKDV----TPSQNQAIASLPKKKKKNRNA 301
Query: 277 TKKKKR 282
K++KR
Sbjct: 302 GKQRKR 307
>gnl|CDD|215239 PLN02436, PLN02436, cellulose synthase A.
Length = 1094
Score = 32.1 bits (73), Expect = 0.41
Identities = 15/26 (57%), Positives = 19/26 (73%)
Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQH 247
+KKKK+KK K+KKKKK R+ K H
Sbjct: 683 SRKKKKKKKSKEKKKKKNREASKQIH 708
Score = 31.4 bits (71), Expect = 0.74
Identities = 14/25 (56%), Positives = 19/25 (76%)
Query: 224 KKKKRKKKKKKKKKKKRKKKKNQHH 248
+KKK+KKK K+KKKKK ++ Q H
Sbjct: 684 RKKKKKKKSKEKKKKKNREASKQIH 708
Score = 30.6 bits (69), Expect = 1.2
Identities = 14/25 (56%), Positives = 19/25 (76%)
Query: 269 KKKKKKKTTKKKKRKKKRKKKKNQH 293
+KKKKKK +K+KK+KK R+ K H
Sbjct: 684 RKKKKKKKSKEKKKKKNREASKQIH 708
Score = 29.1 bits (65), Expect = 4.2
Identities = 12/25 (48%), Positives = 17/25 (68%)
Query: 270 KKKKKKTTKKKKRKKKRKKKKNQHH 294
+KKKKK K+K+KKK ++ Q H
Sbjct: 684 RKKKKKKKSKEKKKKKNREASKQIH 708
Score = 28.3 bits (63), Expect = 6.6
Identities = 12/23 (52%), Positives = 18/23 (78%)
Query: 268 KKKKKKKKTTKKKKRKKKRKKKK 290
+KKKKKKK+ +KKK+K + K+
Sbjct: 684 RKKKKKKKSKEKKKKKNREASKQ 706
Score = 28.3 bits (63), Expect = 7.0
Identities = 12/23 (52%), Positives = 17/23 (73%)
Query: 267 KKKKKKKKKTTKKKKRKKKRKKK 289
+KKKKKKK KKKK+ ++ K+
Sbjct: 684 RKKKKKKKSKEKKKKKNREASKQ 706
Score = 28.3 bits (63), Expect = 7.4
Identities = 13/23 (56%), Positives = 17/23 (73%)
Query: 266 KKKKKKKKKKTTKKKKRKKKRKK 288
+KKKKKKK K KKKK ++ K+
Sbjct: 684 RKKKKKKKSKEKKKKKNREASKQ 706
>gnl|CDD|223683 COG0610, COG0610, Type I site-specific restriction-modification
system, R (restriction) subunit and related helicases
[Defense mechanisms].
Length = 962
Score = 32.0 bits (73), Expect = 0.42
Identities = 11/71 (15%), Positives = 25/71 (35%), Gaps = 3/71 (4%)
Query: 229 KKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEK---KKKKKKKKKKTTKKKKRKKK 285
+ K + K+ + WI + ++ E+ + + KK T +K +
Sbjct: 823 QTFNDKNGAYESLKELIERIIKEWIEDLRQKKKLIERLIEAINQYRAKKLDTAEKLEELY 882
Query: 286 RKKKKNQHHRD 296
KK + +
Sbjct: 883 ILAKKEEEFKQ 893
>gnl|CDD|221673 pfam12626, PolyA_pol_arg_C, Polymerase A arginine-rich C-terminus.
The C-terminus of polymerase A in E coli is
arginine-rich and is necessary for full functioning of
the enzyme.
Length = 123
Score = 30.2 bits (69), Expect = 0.45
Identities = 7/31 (22%), Positives = 20/31 (64%)
Query: 260 EEEEEKKKKKKKKKKKTTKKKKRKKKRKKKK 290
+EE+++ + + + K++R++ R++KK
Sbjct: 93 ADEEQRRAMIEALQGREGGKRRRRRPRRRKK 123
>gnl|CDD|217476 pfam03286, Pox_Ag35, Pox virus Ag35 surface protein.
Length = 198
Score = 31.3 bits (71), Expect = 0.46
Identities = 11/24 (45%), Positives = 15/24 (62%)
Query: 267 KKKKKKKKKTTKKKKRKKKRKKKK 290
K+ KKK+ TT +K K+ KKK
Sbjct: 48 PKQPKKKRPTTPRKPATTKKSKKK 71
Score = 29.0 bits (65), Expect = 2.5
Identities = 11/24 (45%), Positives = 14/24 (58%)
Query: 266 KKKKKKKKKKTTKKKKRKKKRKKK 289
KKK+ +K KK KKK K+K
Sbjct: 52 KKKRPTTPRKPATTKKSKKKDKEK 75
Score = 27.8 bits (62), Expect = 6.0
Identities = 10/24 (41%), Positives = 15/24 (62%)
Query: 265 KKKKKKKKKKKTTKKKKRKKKRKK 288
KKK+ +K T KK +KK ++K
Sbjct: 52 KKKRPTTPRKPATTKKSKKKDKEK 75
Score = 27.8 bits (62), Expect = 6.4
Identities = 11/24 (45%), Positives = 15/24 (62%)
Query: 264 EKKKKKKKKKKKTTKKKKRKKKRK 287
+KK+ +K TTKK K+K K K
Sbjct: 52 KKKRPTTPRKPATTKKSKKKDKEK 75
>gnl|CDD|177120 MTH00045, ND6, NADH dehydrogenase subunit 6; Validated.
Length = 162
Score = 30.6 bits (70), Expect = 0.48
Identities = 31/138 (22%), Positives = 57/138 (41%), Gaps = 44/138 (31%)
Query: 46 MLMCFIIITAILGNLLVIISVIKHRKLRIITNYF------VVSLAFADLLVALCVMPFNA 99
M+ +I + G+ LV S ++ Y+ +VSL+ LL++L + F A
Sbjct: 1 MVFYLFLILMLFGSTLVFYS---------LSPYYSALGLVLVSLSGC-LLLSLLGLSFLA 50
Query: 100 IVSLTDEWYFGYFMCDVWNSFDVY-FSTASILHLCCISVDRYYAIVKPLQYPIIMNQN-- 156
++ L Y G + V+ +S+A +S +R YP + N
Sbjct: 51 LLLLL--IYMGGMLV-------VFVYSSA-------LSAER---------YPSVSNLGEV 85
Query: 157 TVLLMLSNVWILPGIISF 174
VL +L ++W+ +F
Sbjct: 86 LVLSLLLSLWVFISFDNF 103
>gnl|CDD|225620 COG3078, COG3078, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 169
Score = 31.0 bits (70), Expect = 0.50
Identities = 17/71 (23%), Positives = 30/71 (42%), Gaps = 11/71 (15%)
Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
MK+ KK ++ + K KK R+K R E ++K++KK+K + +
Sbjct: 1 MKRSKKTRRPRSKADKKARRK-----------TREELDAEARDRKRQKKRKGLASGSRHS 49
Query: 282 RKKKRKKKKNQ 292
+ K Q
Sbjct: 50 GGNENSGNKQQ 60
Score = 27.5 bits (61), Expect = 6.5
Identities = 9/36 (25%), Positives = 22/36 (61%)
Query: 266 KKKKKKKKKKTTKKKKRKKKRKKKKNQHHRDVWQKK 301
K+ KK ++ ++ KK ++K +++ + RD ++K
Sbjct: 2 KRSKKTRRPRSKADKKARRKTREELDAEARDRKRQK 37
>gnl|CDD|219111 pfam06625, DUF1151, Protein of unknown function (DUF1151). This
family consists of several hypothetical eukaryotic
proteins of unknown function.
Length = 122
Score = 30.1 bits (68), Expect = 0.51
Identities = 12/56 (21%), Positives = 39/56 (69%), Gaps = 1/56 (1%)
Query: 221 VMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKT 276
V++K+K+ + K++K++++ K+ +++ R++ + R + E+ E +K K+ +++++
Sbjct: 56 VLEKRKRDQVLKEQKEEEEAKRLQSELEREL-MKRAQKLEQLELEKAKEDEEQEQI 110
>gnl|CDD|216985 pfam02349, MSG, Major surface glycoprotein. This is a novel repeat
in Pneumocystis carinii Major surface glycoprotein (MSG)
some members of the alignment have up to nine repeats of
this family, the repeats containing several conserved
cysteines. The MSG of P. carinii is an important protein
in host-pathogen interactions. Surface glycoprotein A
from Pneumocystis carinii is a main target for the host
immune system, this protein is implicated in the
attachment of Pneumocystis carinii to the host alveolar
epithelial cells, alveolar macrophages, host surfactant
and possibly accounts in part for the hypoxia seen in
Pneumocystis carinii pneumonia (PCP).
Length = 81
Score = 29.3 bits (66), Expect = 0.53
Identities = 18/62 (29%), Positives = 33/62 (53%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
++ K+K +K K+K K K + + + +GE +++KK KK K+K T KK+
Sbjct: 20 EEACPLLKEKCEKLKEKCYKLKKDLKLEELLLKELKGELKKKKKCKKALKEKCTKLKKES 79
Query: 283 KK 284
+
Sbjct: 80 NE 81
>gnl|CDD|224748 COG1835, COG1835, Predicted acyltransferases [Lipid metabolism].
Length = 386
Score = 31.3 bits (71), Expect = 0.54
Identities = 34/177 (19%), Positives = 75/177 (42%), Gaps = 12/177 (6%)
Query: 38 IFPFIIKGMLMCFIIITAILGNLLVIISVIKHRKLRIITNYFVVSLAFADLLVALCVMPF 97
+F F++ G L+ ++ + ++ + + + R RI+ VV L A + L ++P
Sbjct: 55 LF-FVLSGFLITRSLLRSAAAPVISLAAFLARRLRRILPALLVVLLLTA--ALLLALLPP 111
Query: 98 NAIVSLTDEWYFGYFMCDVWNSFDV--YFSTASILHLCC----ISVDRYYAIVKPLQYPI 151
A +L + +++ ++ + V YF+ + +L +SV+ + ++ PL +
Sbjct: 112 GASAALWADLAALFYLSNLLLTLAVGAYFAASGLLSPLLHLWSLSVEEQFYLLFPLLLLL 171
Query: 152 IMNQNTVLLMLSNVWILPGIISFTPIMLGWYTTPDHKQYRKHHPNVCIFVVNVYYAL 208
+ L +L + L F + LG + T Y + F++ AL
Sbjct: 172 LRRVRFALALLLLLLAL--ASFFLLLALGPFATGAALYYGTATR-LWEFLLGALAAL 225
>gnl|CDD|241486 cd13332, FERM_C_JAK1, Janus kinase 1 FERM domain C-lobe. JAK1 is a
tyrosine kinase protein essential in signaling type I
and type II cytokines. It interacts with the gamma chain
of type I cytokine receptors to elicit signals from the
IL-2 receptor family, the IL-4 receptor family, the
gp130 receptor family, ciliary neurotrophic factor
receptor (CNTF-R), neurotrophin-1 receptor (NNT-1R) and
Leptin-R). It also is involved in transducing a signal
by type I (IFN-alpha/beta) and type II (IFN-gamma)
interferons, and members of the IL-10 family via type II
cytokine receptors. JAK (also called Just Another
Kinase) is a family of intracellular, non-receptor
tyrosine kinases that transduce cytokine-mediated
signals via the JAK-STAT pathway. The JAK family in
mammals consists of 4 members: JAK1, JAK2, JAK3 and
TYK2. JAKs are composed of seven JAK homology (JH)
domains (JH1-JH7) . The C-terminal JH1 domain is the
main catalytic domain, followed by JH2, which is often
referred to as a pseudokinase domain, followed by
JH3-JH4 which is homologous to the SH2 domain, and
lastly JH5-JH7 which is a FERM domain. Named after
Janus, the two-faced Roman god of doorways, JAKs possess
two near-identical phosphate-transferring domains; one
which displays the kinase activity (JH1), while the
other negatively regulates the kinase activity of the
first (JH2). The FERM domain has a cloverleaf tripart
structure (FERM_N, FERM_M, FERM_C/N, alpha-, and
C-lobe/A-lobe,A-lobe, B-lobe, C-lobe/F1, F2, F3). The
C-lobe/F3 within the FERM domain is part of the PH
domain family. The FERM domain is found in the
cytoskeletal-associated proteins such as ezrin, moesin,
radixin, 4.1R, and merlin. These proteins provide a link
between the membrane and cytoskeleton and are involved
in signal transduction pathways. The FERM domain is also
found in protein tyrosine phosphatases (PTPs) , the
tyrosine kinases FAK and JAK, in addition to other
proteins involved in signaling. This domain is
structurally similar to the PH and PTB domains and
consequently is capable of binding to both peptides and
phospholipids at different sites.
Length = 198
Score = 31.0 bits (70), Expect = 0.55
Identities = 17/34 (50%), Positives = 19/34 (55%), Gaps = 3/34 (8%)
Query: 221 VMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWIN 254
V KKKK + KK K K KK KKK R+ W N
Sbjct: 89 VEKKKKGKSKKNKLKGKKDEDKKKA---REGWNN 119
Score = 29.4 bits (66), Expect = 1.6
Identities = 13/25 (52%), Positives = 15/25 (60%)
Query: 264 EKKKKKKKKKKKTTKKKKRKKKRKK 288
EKKKK K KK K KK KK+ +
Sbjct: 90 EKKKKGKSKKNKLKGKKDEDKKKAR 114
Score = 28.6 bits (64), Expect = 2.7
Identities = 13/35 (37%), Positives = 19/35 (54%)
Query: 252 WINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKR 286
W + + E++KK K KK K K K + +KK R
Sbjct: 80 WRRKPATTAVEKKKKGKSKKNKLKGKKDEDKKKAR 114
Score = 27.9 bits (62), Expect = 4.9
Identities = 11/28 (39%), Positives = 16/28 (57%)
Query: 267 KKKKKKKKKTTKKKKRKKKRKKKKNQHH 294
KKKK K KK K K+ + +KK + +
Sbjct: 91 KKKKGKSKKNKLKGKKDEDKKKAREGWN 118
Score = 27.9 bits (62), Expect = 6.0
Identities = 12/24 (50%), Positives = 15/24 (62%)
Query: 266 KKKKKKKKKKTTKKKKRKKKRKKK 289
KKKK K KK K KK + K+K +
Sbjct: 91 KKKKGKSKKNKLKGKKDEDKKKAR 114
Score = 27.1 bits (60), Expect = 9.3
Identities = 13/25 (52%), Positives = 16/25 (64%)
Query: 263 EEKKKKKKKKKKKTTKKKKRKKKRK 287
E+KKK K KK K KK + KKK +
Sbjct: 90 EKKKKGKSKKNKLKGKKDEDKKKAR 114
>gnl|CDD|222592 pfam14204, Ribosomal_L18_c, Ribosomal L18 C-terminal region. This
domain is the C-terminal end of ribosomal L18/L5
proteins.
Length = 93
Score = 29.4 bits (67), Expect = 0.58
Identities = 9/21 (42%), Positives = 14/21 (66%)
Query: 264 EKKKKKKKKKKKTTKKKKRKK 284
+KKK+KK+ K +K+ KK
Sbjct: 60 SRKKKEKKEVKAESKRYNAKK 80
Score = 28.7 bits (65), Expect = 1.0
Identities = 8/20 (40%), Positives = 14/20 (70%)
Query: 223 KKKKKRKKKKKKKKKKKRKK 242
+KKK++K+ K + K+ KK
Sbjct: 61 RKKKEKKEVKAESKRYNAKK 80
Score = 27.5 bits (62), Expect = 2.5
Identities = 9/20 (45%), Positives = 13/20 (65%)
Query: 224 KKKKRKKKKKKKKKKKRKKK 243
+KKK KK+ K + K+ KK
Sbjct: 61 RKKKEKKEVKAESKRYNAKK 80
Score = 27.1 bits (61), Expect = 3.7
Identities = 9/24 (37%), Positives = 14/24 (58%)
Query: 266 KKKKKKKKKKTTKKKKRKKKRKKK 289
+KKK+KK+ K K+ KK +
Sbjct: 61 RKKKEKKEVKAESKRYNAKKLTLE 84
Score = 26.4 bits (59), Expect = 6.8
Identities = 9/25 (36%), Positives = 14/25 (56%)
Query: 268 KKKKKKKKTTKKKKRKKKRKKKKNQ 292
+KKK+KK+ + KR +K Q
Sbjct: 61 RKKKEKKEVKAESKRYNAKKLTLEQ 85
Score = 26.4 bits (59), Expect = 8.4
Identities = 8/24 (33%), Positives = 14/24 (58%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQ 246
KKK+K++ K + K+ +K Q
Sbjct: 62 KKKEKKEVKAESKRYNAKKLTLEQ 85
>gnl|CDD|219640 pfam07916, TraG_N, TraG-like protein, N-terminal region. The
bacterial sequences found in this family are similar to
the N-terminal region of the TraG protein. This is a
membrane-spanning protein, with three predicted
transmembrane segments and two periplasmic regions. TraG
protein is known to be essential for DNA transfer in the
process of conjugation, with the N-terminal portion
being required for F pilus assembly. The protein is
thought to interact with the periplasmic domain of TraN
to stabilise mating-cell interactions.
Length = 462
Score = 31.5 bits (72), Expect = 0.58
Identities = 20/61 (32%), Positives = 28/61 (45%), Gaps = 5/61 (8%)
Query: 50 FIIITAILGNLLVIISVIKHRKLRIITNYFVVSLAFADLLVALCVMPFNAIVSLTDEWYF 109
I I A LG L I LR I N+ +V+L L+V ++P V +TD +
Sbjct: 30 LIRIAAWLGVLAEGIDEGNKGDLRRIENWLLVAL----LVVMFLLVPKTT-VQITDRTFS 84
Query: 110 G 110
G
Sbjct: 85 G 85
>gnl|CDD|202096 pfam02029, Caldesmon, Caldesmon.
Length = 431
Score = 31.5 bits (71), Expect = 0.59
Identities = 15/60 (25%), Positives = 39/60 (65%), Gaps = 7/60 (11%)
Query: 225 KKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKK 284
KKKR++++K +++++++K+ + R EEEE+++ K++ ++++ +KR+K
Sbjct: 220 KKKREERRKVLEEEEQRRKQEEADRKS-------REEEEKRRLKEEIERRRAEAAEKRQK 272
>gnl|CDD|225664 COG3122, COG3122, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 215
Score = 31.1 bits (70), Expect = 0.60
Identities = 13/55 (23%), Positives = 27/55 (49%)
Query: 238 KKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQ 292
KKR K + + R + EE K ++ ++ K+ ++++K+ K+ K Q
Sbjct: 58 KKRAKVQRLAKKSRVDAREAYRIAAEENKAEQLERDKQLSEQQKQAALAKEYKAQ 112
>gnl|CDD|177779 PLN00185, PLN00185, 60S ribosomal protein L4-1; Provisional.
Length = 405
Score = 31.3 bits (71), Expect = 0.63
Identities = 17/66 (25%), Positives = 21/66 (31%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
KK KR KK K K + E K KK+K KK+ K+
Sbjct: 311 KKDVKRAPLKKNPLKNLNAMLKLNPYAKTARRMALLAEAARVKAKKEKLAKKRKQVSKEE 370
Query: 283 KKKRKK 288
K
Sbjct: 371 AAAIKA 376
Score = 30.2 bits (68), Expect = 1.6
Identities = 17/67 (25%), Positives = 24/67 (35%), Gaps = 2/67 (2%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
K K+ KK K K N + + R + E K KK+K KK + K
Sbjct: 312 KDVKRAPLKKNPLKNLNAMLKLNPYAKTA--RRMALLAEAARVKAKKEKLAKKRKQVSKE 369
Query: 283 KKKRKKK 289
+ K
Sbjct: 370 EAAAIKA 376
>gnl|CDD|216108 pfam00769, ERM, Ezrin/radixin/moesin family. This family of
proteins contain a band 4.1 domain (pfam00373), at their
amino terminus. This family represents the rest of these
proteins.
Length = 244
Score = 30.9 bits (70), Expect = 0.69
Identities = 8/78 (10%), Positives = 36/78 (46%), Gaps = 4/78 (5%)
Query: 221 VMKKKK----KRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKT 276
++++ +R ++ ++ ++ +K+ + + + + EEEE + +KK + +
Sbjct: 2 EAEREQQELEERMEQMEEDMRRAQKELEEYEETALELEEKLKQEEEEAQLLEKKADELEE 61
Query: 277 TKKKKRKKKRKKKKNQHH 294
++ ++ ++ +
Sbjct: 62 ENRRLEEEAAASEEERER 79
>gnl|CDD|226159 COG3633, SstT, Na+/serine symporter [Amino acid transport and
metabolism].
Length = 407
Score = 31.2 bits (71), Expect = 0.70
Identities = 21/59 (35%), Positives = 33/59 (55%), Gaps = 1/59 (1%)
Query: 38 IFPFIIKGMLMCFIIITAILGNLLVIISVIKHRKLRIITNYFVVSL-AFADLLVALCVM 95
+F + +G L+ I+I +LG LL IIS + ++ FV +L A A +LV + VM
Sbjct: 7 LFRLLARGSLVKQILIGLVLGILLAIISPPAAEAVGLLGTLFVGALKAVAPVLVFILVM 65
>gnl|CDD|179580 PRK03449, PRK03449, putative inner membrane protein translocase
component YidC; Provisional.
Length = 304
Score = 30.8 bits (70), Expect = 0.70
Identities = 13/30 (43%), Positives = 17/30 (56%)
Query: 261 EEEEKKKKKKKKKKKTTKKKKRKKKRKKKK 290
EEE KK+ K +++ K K KR KKK
Sbjct: 273 EEEAKKQAKAERRAANAPKPGAKPKRSKKK 302
Score = 28.5 bits (64), Expect = 4.9
Identities = 11/34 (32%), Positives = 18/34 (52%), Gaps = 5/34 (14%)
Query: 260 EEEEEKKKKKKKKKKKTT-----KKKKRKKKRKK 288
++EEE KK+ K +++ K KR KK+
Sbjct: 271 DKEEEAKKQAKAERRAANAPKPGAKPKRSKKKAP 304
Score = 27.7 bits (62), Expect = 6.8
Identities = 10/31 (32%), Positives = 17/31 (54%)
Query: 260 EEEEEKKKKKKKKKKKTTKKKKRKKKRKKKK 290
EEE +K+ K +++ K + K+ KKK
Sbjct: 273 EEEAKKQAKAERRAANAPKPGAKPKRSKKKA 303
>gnl|CDD|224212 COG1293, COG1293, Predicted RNA-binding protein homologous to
eukaryotic snRNP [Transcription].
Length = 564
Score = 31.2 bits (71), Expect = 0.71
Identities = 15/60 (25%), Positives = 28/60 (46%)
Query: 220 FVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKK 279
+ ++ + K+ + K +K + + + R EE K KKKK+KKK+ +K
Sbjct: 381 NLDRQLSELKEAIAYYESAKTALEKAEGKKAIEEIREELIEEGLLKSKKKKRKKKEWFEK 440
Score = 29.3 bits (66), Expect = 2.6
Identities = 18/74 (24%), Positives = 32/74 (43%), Gaps = 6/74 (8%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRD----VW--INRTSRGEEEEEKKKKKKKKKKKT 276
+ KK KK K K R+ + + + + + EE +++ ++
Sbjct: 367 RYFKKYKKLKGAKVNLDRQLSELKEAIAYYESAKTALEKAEGKKAIEEIREELIEEGLLK 426
Query: 277 TKKKKRKKKRKKKK 290
+KKKKRKKK +K
Sbjct: 427 SKKKKRKKKEWFEK 440
>gnl|CDD|227891 COG5604, COG5604, Uncharacterized conserved protein [Function
unknown].
Length = 523
Score = 31.0 bits (70), Expect = 0.71
Identities = 17/61 (27%), Positives = 22/61 (36%), Gaps = 1/61 (1%)
Query: 219 GFVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKK-KKKKKKKKKTT 277
G K KK K K +RK+ NR S E + E K K+K +
Sbjct: 2 GKASKATKKFTKNHLKNTIDRRKQLARSKKVYGTKNRNSHTENKMESGTNDNNKNKEKLS 61
Query: 278 K 278
K
Sbjct: 62 K 62
>gnl|CDD|206228 pfam14058, PcfK, PcfK-like protein. The PcfK-like protein family
includes the Enterococcus faecalis PcfK protein, which
is functionally uncharacterized. This family of proteins
is found in bacteria and viruses. Proteins in this
family are typically between 137 and 257 amino acids in
length. There are two completely conserved residues (D
and L) that may be functionally important.
Length = 136
Score = 30.0 bits (68), Expect = 0.71
Identities = 14/65 (21%), Positives = 25/65 (38%), Gaps = 5/65 (7%)
Query: 228 RKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRK 287
+ K K + N + T + E K+ K ++++ K +KR KK K
Sbjct: 68 DEDDIKVGKPINCRVTVNHT-----VELTEEEKAEARKEALKAYQQEELRKIQKRSKKSK 122
Query: 288 KKKNQ 292
K +
Sbjct: 123 KAEPV 127
>gnl|CDD|204792 pfam11947, DUF3464, Protein of unknown function (DUF3464). This
family of proteins are functionally uncharacterized.
This protein is found in bacteria and eukaryotes.
Proteins in this family are typically between 137 to 196
amino acids in length.
Length = 149
Score = 30.0 bits (68), Expect = 0.76
Identities = 9/51 (17%), Positives = 20/51 (39%), Gaps = 6/51 (11%)
Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKK 272
M +KR + K+ +KK K + +++++ K K+
Sbjct: 1 MASSRKRLPFEPKRSRKKGVKALRK------AAVAKSKDKQKKPKSKRAAS 45
Score = 28.1 bits (63), Expect = 4.1
Identities = 13/39 (33%), Positives = 18/39 (46%)
Query: 254 NRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQ 292
+SR E K+ +KK K K K K K+KK +
Sbjct: 1 MASSRKRLPFEPKRSRKKGVKALRKAAVAKSKDKQKKPK 39
Score = 26.9 bits (60), Expect = 9.1
Identities = 10/31 (32%), Positives = 17/31 (54%)
Query: 262 EEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQ 292
E ++ +KK K + K K K+KK K++
Sbjct: 11 EPKRSRKKGVKALRKAAVAKSKDKQKKPKSK 41
>gnl|CDD|215822 pfam00257, Dehydrin, Dehydrin.
Length = 137
Score = 29.8 bits (67), Expect = 0.76
Identities = 15/59 (25%), Positives = 21/59 (35%), Gaps = 6/59 (10%)
Query: 223 KKKKKRKKK--KKKKKKK----KRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKK 275
+ +KK K+K K+K K + EEKK K K K+K
Sbjct: 72 DGQGGERKKGIKEKIKEKLPGGHGGKGQATTTTTTGTGGGYGQHGHEEKKGKMDKIKEK 130
>gnl|CDD|217756 pfam03839, Sec62, Translocation protein Sec62.
Length = 217
Score = 30.5 bits (69), Expect = 0.77
Identities = 21/77 (27%), Positives = 30/77 (38%), Gaps = 20/77 (25%)
Query: 221 VMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKK-------KK 273
+ +K K K K + K +Q ++ EK K K KK
Sbjct: 14 RALESEKYKANKDKGNPEIYNKINSQ-------------DKAIEKFKLLIKAQMAERVKK 60
Query: 274 KKTTKKKKRKKKRKKKK 290
+ +KK+ KKK KKKK
Sbjct: 61 LHSQEKKEEKKKPKKKK 77
Score = 29.4 bits (66), Expect = 2.0
Identities = 17/78 (21%), Positives = 28/78 (35%), Gaps = 4/78 (5%)
Query: 225 KKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKK 284
+K + K K + K + + E KK ++KK++ K KK+K
Sbjct: 19 EKYKANKDKGNPEIYNKINSQDKAIEKFKLLIKAQMAERVKKLHSQEKKEEKKKPKKKKV 78
Query: 285 KRKKKKNQHHRD----VW 298
+ Q D VW
Sbjct: 79 PLQVNPAQLFVDDEYYVW 96
>gnl|CDD|240377 PTZ00352, PTZ00352, 60S ribosomal protein L13; Provisional.
Length = 212
Score = 30.5 bits (69), Expect = 0.81
Identities = 13/72 (18%), Positives = 26/72 (36%)
Query: 219 GFVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTK 278
GF K++ K+ K + + +R +EE+ + ++
Sbjct: 134 GFAGIPADTSKEEVVALPVKQNKNSEVIPFQRTPKREKARVITKEERAFNAYRTLRQAKL 193
Query: 279 KKKRKKKRKKKK 290
K K++KKK
Sbjct: 194 NAKFVGKKEKKK 205
>gnl|CDD|129521 TIGR00427, TIGR00427, membrane protein, MarC family. MarC is a
protein that spans the plasma membrane multiple times
and once was thought to be a multiple antibiotic
resistance protein. The function for this family is
unknown [Unknown function, General].
Length = 201
Score = 30.4 bits (69), Expect = 0.81
Identities = 17/79 (21%), Positives = 28/79 (35%), Gaps = 13/79 (16%)
Query: 36 IHIFPFIIKGMLMCFIIITAILGNLLVIISVIKH--RKLRIITNYFVVSLAFADLLVALC 93
+ I F K + F II I GN+ + IS+ ++ R +F LL+ L
Sbjct: 1 MDILSFYFKFFISLFAIINPI-GNIPIFISLTEYYTAAERNKIAKKANISSFIILLIFL- 58
Query: 94 VMPFNAIVSLTDEWYFGYF 112
+ + F
Sbjct: 59 ---------VFGDTILKLF 68
>gnl|CDD|240274 PTZ00112, PTZ00112, origin recognition complex 1 protein;
Provisional.
Length = 1164
Score = 31.1 bits (70), Expect = 0.82
Identities = 17/73 (23%), Positives = 26/73 (35%)
Query: 225 KKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKK 284
+ K K+ KK+ TS + K K+K K+K KK R
Sbjct: 180 RNKEISNKQLKKQTNVTHTTCYDKMRRSPRNTSTIKNNTNDKNKEKNKEKDKNIKKDRDG 239
Query: 285 KRKKKKNQHHRDV 297
++ K+N V
Sbjct: 240 DKQTKRNSEKSKV 252
Score = 29.6 bits (66), Expect = 2.9
Identities = 18/65 (27%), Positives = 29/65 (44%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
K KK + K+ K+ K K H DV I R+ E ++++K + K+K
Sbjct: 231 KNIKKDRDGDKQTKRNSEKSKVQNSHFDVRILRSYTKENKKDEKNVVSGIRSSVLLKRKS 290
Query: 283 KKKRK 287
+ RK
Sbjct: 291 QCLRK 295
>gnl|CDD|240235 PTZ00032, PTZ00032, 60S ribosomal protein L18; Provisional.
Length = 211
Score = 30.6 bits (69), Expect = 0.84
Identities = 17/70 (24%), Positives = 22/70 (31%)
Query: 224 KKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRK 283
K + K+ K K K + H W E EE K K K K+R
Sbjct: 47 TGKNKLLLTKRSKLKGIPKPRKLHKHGFWAEIFEEKVEREELGNPCKDLDKDIIKGKRRP 106
Query: 284 KKRKKKKNQH 293
+ K N
Sbjct: 107 RLTLKNTNNQ 116
>gnl|CDD|237276 PRK13024, PRK13024, bifunctional preprotein translocase subunit
SecD/SecF; Reviewed.
Length = 755
Score = 31.0 bits (71), Expect = 0.84
Identities = 15/36 (41%), Positives = 23/36 (63%)
Query: 210 SSSISFWIPGFVMKKKKKRKKKKKKKKKKKRKKKKN 245
SSI P ++ +K++ KKKKK+KK KK + +K
Sbjct: 719 YSSIFIAAPLWLDLEKRRLKKKKKRKKVKKWEVEKI 754
Score = 27.9 bits (63), Expect = 9.2
Identities = 10/22 (45%), Positives = 16/22 (72%)
Query: 225 KKKRKKKKKKKKKKKRKKKKNQ 246
+K+R KKKKK+KK K+ + +
Sbjct: 733 EKRRLKKKKKRKKVKKWEVEKI 754
>gnl|CDD|234941 PRK01315, PRK01315, putative inner membrane protein translocase
component YidC; Provisional.
Length = 329
Score = 30.5 bits (69), Expect = 0.99
Identities = 11/59 (18%), Positives = 27/59 (45%), Gaps = 5/59 (8%)
Query: 237 KKKRKKKKNQHHRD-----VWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKK 290
+++R KK + H + V E ++ K++ +++ K++ R K++K
Sbjct: 269 REERLAKKGKDHGESEGKVVAPEGAVAQTTEVREQTKRQTVQRQQPKRQSRAKRQKGGA 327
>gnl|CDD|216298 pfam01101, HMG14_17, HMG14 and HMG17.
Length = 93
Score = 29.0 bits (65), Expect = 1.00
Identities = 14/30 (46%), Positives = 16/30 (53%)
Query: 263 EEKKKKKKKKKKKTTKKKKRKKKRKKKKNQ 292
E K KK KKKK K K+ K KK+ Q
Sbjct: 35 EPKPKKAAAKKKKPAVKGKKGAKGKKETKQ 64
Score = 28.6 bits (64), Expect = 1.2
Identities = 21/73 (28%), Positives = 33/73 (45%), Gaps = 12/73 (16%)
Query: 231 KKKKKKKKKRKKKKNQHHRDVWINRTSR-------GEEEEEKKKKKKKKKKKTTKKKKRK 283
K+K + K K K + R R++R + E + KK KKKK K KK
Sbjct: 2 KRKGEAKDAAKVTKQEPKR-----RSARLSAKPAPPKPEPKPKKAAAKKKKPAVKGKKGA 56
Query: 284 KKRKKKKNQHHRD 296
K +K+ K + ++
Sbjct: 57 KGKKETKQEEAKE 69
>gnl|CDD|219783 pfam08297, U3_snoRNA_assoc, U3 snoRNA associated. This family of
proteins is associated with U3 snoRNA. U3 snoRNA is
required for nucleolar processing of pre-18S ribosomal
RNA.
Length = 87
Score = 28.5 bits (64), Expect = 1.1
Identities = 14/29 (48%), Positives = 19/29 (65%)
Query: 261 EEEEKKKKKKKKKKKTTKKKKRKKKRKKK 289
EEE KK+KKK K + KK +++ K KK
Sbjct: 24 YEEELKKQKKKNKLRFLKKLEKRPKDVKK 52
Score = 27.4 bits (61), Expect = 2.6
Identities = 15/35 (42%), Positives = 18/35 (51%)
Query: 254 NRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKK 288
EEE KK+KKK K + K +KR K KK
Sbjct: 18 TPPPESYEEELKKQKKKNKLRFLKKLEKRPKDVKK 52
Score = 27.4 bits (61), Expect = 3.3
Identities = 14/31 (45%), Positives = 18/31 (58%)
Query: 260 EEEEEKKKKKKKKKKKTTKKKKRKKKRKKKK 290
E EE+ KK+KKK K KK K+ + KK
Sbjct: 22 ESYEEELKKQKKKNKLRFLKKLEKRPKDVKK 52
>gnl|CDD|114629 pfam05917, DUF874, Helicobacter pylori protein of unknown function
(DUF874). This family consists of several hypothetical
proteins specific to Helicobacter pylori. The function
of this family is unknown.
Length = 417
Score = 30.6 bits (68), Expect = 1.1
Identities = 16/83 (19%), Positives = 45/83 (54%)
Query: 219 GFVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTK 278
G + + + +++K+K +++++K + + + I + E E+EK+K + +K+ +
Sbjct: 169 GIELANSQIKAEQEKQKTEQEKQKTEQEKQKTSNIANKNAIELEQEKQKTENEKQDLIKE 228
Query: 279 KKKRKKKRKKKKNQHHRDVWQKK 301
+K K+ ++ ++H + KK
Sbjct: 229 QKDFIKEAEQNCQENHNQFFIKK 251
>gnl|CDD|222892 PHA02586, 68, prohead core protein; Provisional.
Length = 140
Score = 29.3 bits (66), Expect = 1.1
Identities = 19/90 (21%), Positives = 42/90 (46%), Gaps = 4/90 (4%)
Query: 202 VNVYYALISSSISFWIPGFVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEE 261
++ A++S + + + ++K R + + K +K R++ Q +++ R +
Sbjct: 50 PDLALAMLSINENMSLNEAIVKHVSSRGEVTRTKDRKTRERNAYQ---TTGLSKAKRRQI 106
Query: 262 EEEKKKKKKKKKKKTTK-KKKRKKKRKKKK 290
+ K K+ K ++KRKK KK+K
Sbjct: 107 ARKAAKTKRANPSIQRKAQRKRKKALKKRK 136
>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
Rpc31. RNA polymerase III contains seventeen subunits
in yeasts and in human cells. Twelve of these are akin
to RNA polymerase I or II and the other five are RNA pol
III-specific, and form the functionally distinct groups
(i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31,
Rpc34 and Rpc82 form a cluster of enzyme-specific
subunits that contribute to transcription initiation in
S.cerevisiae and H.sapiens. There is evidence that these
subunits are anchored at or near the N-terminal Zn-fold
of Rpc1, itself prolonged by a highly conserved but RNA
polymerase III-specific domain.
Length = 221
Score = 30.1 bits (68), Expect = 1.2
Identities = 16/69 (23%), Positives = 26/69 (37%), Gaps = 11/69 (15%)
Query: 221 VMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKK 280
VM KK KK K K++ T E+ +EK +KK K+ +
Sbjct: 119 VMGINKKAGKKLALSKFKRKVGLF-----------TEEEEDIDEKLSMLEKKLKELEAED 167
Query: 281 KRKKKRKKK 289
++ K +
Sbjct: 168 VDEEDEKDE 176
>gnl|CDD|223562 COG0488, Uup, ATPase components of ABC transporters with duplicated
ATPase domains [General function prediction only].
Length = 530
Score = 30.3 bits (69), Expect = 1.2
Identities = 16/70 (22%), Positives = 31/70 (44%), Gaps = 15/70 (21%)
Query: 222 MKKKKKRKKKKKKKKKKKRKK-KKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKK 280
+++K +R +++ +K++K+ K Q WI R K K KK +
Sbjct: 239 LEQKAERLRQEAAAYEKQQKELAKEQE----WIRRG----------KAAASKAKKAKSRI 284
Query: 281 KRKKKRKKKK 290
KR +K + +
Sbjct: 285 KRLEKLEARL 294
>gnl|CDD|236768 PRK10819, PRK10819, transport protein TonB; Provisional.
Length = 246
Score = 30.0 bits (68), Expect = 1.2
Identities = 9/28 (32%), Positives = 15/28 (53%)
Query: 263 EEKKKKKKKKKKKTTKKKKRKKKRKKKK 290
+ K K K K KK ++ KR+ K + +
Sbjct: 100 KPKPKPKPKPVKKVEEQPKREVKPVEPR 127
Score = 28.9 bits (65), Expect = 2.8
Identities = 12/31 (38%), Positives = 16/31 (51%)
Query: 260 EEEEEKKKKKKKKKKKTTKKKKRKKKRKKKK 290
+ E K K K K K K KK + + KR+ K
Sbjct: 93 PKPEPKPKPKPKPKPKPVKKVEEQPKREVKP 123
Score = 28.1 bits (63), Expect = 4.8
Identities = 11/28 (39%), Positives = 14/28 (50%)
Query: 217 IPGFVMKKKKKRKKKKKKKKKKKRKKKK 244
IP K K K K K K KK + + K+
Sbjct: 92 IPKPEPKPKPKPKPKPKPVKKVEEQPKR 119
Score = 28.1 bits (63), Expect = 5.3
Identities = 9/22 (40%), Positives = 13/22 (59%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKK 244
K K K K KK +++ KR+ K
Sbjct: 102 KPKPKPKPVKKVEEQPKREVKP 123
Score = 27.3 bits (61), Expect = 9.2
Identities = 12/31 (38%), Positives = 17/31 (54%)
Query: 221 VMKKKKKRKKKKKKKKKKKRKKKKNQHHRDV 251
V+ K + + K K K K K KK + Q R+V
Sbjct: 91 VIPKPEPKPKPKPKPKPKPVKKVEEQPKREV 121
>gnl|CDD|185575 PTZ00361, PTZ00361, 26 proteosome regulatory subunit 4-like
protein; Provisional.
Length = 438
Score = 30.5 bits (69), Expect = 1.2
Identities = 14/39 (35%), Positives = 24/39 (61%)
Query: 254 NRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQ 292
N +G +++K KKK+KKKK++ K++KK+K
Sbjct: 3 NAQGQGNNQKDKNKKKEKKKKESPPPPHEIKRKKKRKGP 41
>gnl|CDD|151203 pfam10710, DUF2512, Protein of unknown function (DUF2512).
Proteins in this family are predicted to be integral
membrane proteins, and many of them are annotated as
being YndM protein. They are all found in Firmicutes.
The true function is not known.
Length = 136
Score = 29.2 bits (66), Expect = 1.3
Identities = 26/123 (21%), Positives = 47/123 (38%), Gaps = 39/123 (31%)
Query: 37 HIFPFIIKGMLMCFIIITAILGNL-------LVIISVIKHRKLRIITNYFVVSLAFADLL 89
H+ +IK + M ++ +LG +++IS+ ++T +VS DL
Sbjct: 3 HVKALLIKFI-MTLAVLYIVLGLFFGVSFGDVLLISL-------VLT---LVSYLLGDLF 51
Query: 90 VALCVMPF--NAIVSLTD-------EWYFGYFMCDVWNSFDVYFS--TASILHLCCISVD 138
V+P N ++ D W G + D Y AS++ I+V
Sbjct: 52 ----VLPRTGNTTATIADFGLAFLVVWLMGRIL------TDNYLPIAWASLISALVIAVG 101
Query: 139 RYY 141
Y+
Sbjct: 102 EYF 104
>gnl|CDD|220477 pfam09933, DUF2165, Predicted small integral membrane protein
(DUF2165). This domain, found in various hypothetical
prokaryotic proteins, has no known function.
Length = 160
Score = 29.6 bits (67), Expect = 1.3
Identities = 27/119 (22%), Positives = 44/119 (36%), Gaps = 7/119 (5%)
Query: 7 NSLNYTYNINYTSTNFSNTSLFPTNSTIHIHIFPFIIKGMLMCFIIITAILGNLLVIISV 66
N +Y N + S + FP N+ ++ I + II+ +L LL I
Sbjct: 25 NITDYDTNFQFVQHVLSMDTTFPDNALMYRAITSPWLHHAAYALIIVLELLTGLLCWIGA 84
Query: 67 I---KHRKLRIITNYFVVSLAFADL-LVALCVMPFNAIVSLTDEWYFGYFMCDVWNSFD 121
+ F + A+A L L ++ F + + EW FG + WN D
Sbjct: 85 WRLLRALNASAAA--FNRAKAWAILGLTLGFLVWFVGFMVVGGEW-FGMWQSKEWNGQD 140
>gnl|CDD|153271 cd07307, BAR, The Bin/Amphiphysin/Rvs (BAR) domain, a dimerization
module that binds membranes and detects membrane
curvature. BAR domains are dimerization, lipid binding
and curvature sensing modules found in many different
proteins with diverse functions including organelle
biogenesis, membrane trafficking or remodeling, and cell
division and migration. Mutations in BAR containing
proteins have been linked to diseases and their
inactivation in cells leads to altered membrane
dynamics. A BAR domain with an additional N-terminal
amphipathic helix (an N-BAR) can drive membrane
curvature. These N-BAR domains are found in amphiphysins
and endophilins, among others. BAR domains are also
frequently found alongside domains that determine lipid
specificity, such as the Pleckstrin Homology (PH) and
Phox Homology (PX) domains which are present in beta
centaurins (ACAPs and ASAPs) and sorting nexins,
respectively. A FES-CIP4 Homology (FCH) domain together
with a coiled coil region is called the F-BAR domain and
is present in Pombe/Cdc15 homology (PCH) family
proteins, which include Fes/Fes tyrosine kinases, PACSIN
or syndapin, CIP4-like proteins, and srGAPs, among
others. The Inverse (I)-BAR or IRSp53/MIM homology
Domain (IMD) is found in multi-domain proteins, such as
IRSp53 and MIM, that act as scaffolding proteins and
transducers of a variety of signaling pathways that link
membrane dynamics and the underlying actin cytoskeleton.
BAR domains form dimers that bind to membranes, induce
membrane bending and curvature, and may also be involved
in protein-protein interactions. The I-BAR domain
induces membrane protrusions in the opposite direction
compared to classical BAR and F-BAR domains, which
produce membrane invaginations. BAR domains that also
serve as protein interaction domains include those of
arfaptin and OPHN1-like proteins, among others, which
bind to Rac and Rho GAP domains, respectively.
Length = 194
Score = 29.7 bits (67), Expect = 1.3
Identities = 15/62 (24%), Positives = 31/62 (50%), Gaps = 7/62 (11%)
Query: 226 KKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKK 285
K+ K+ KK++KK K + + +R + ++ +KKKK K +++ ++ K
Sbjct: 89 LKKDLKEIKKRRKKLDKARLDYDA-------AREKLKKLRKKKKDSSKLAEAEEELQEAK 141
Query: 286 RK 287
K
Sbjct: 142 EK 143
>gnl|CDD|222791 PHA00431, PHA00431, internal virion protein C.
Length = 746
Score = 30.5 bits (69), Expect = 1.3
Identities = 7/50 (14%), Positives = 27/50 (54%), Gaps = 2/50 (4%)
Query: 229 KKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTK 278
K ++ ++K ++ + + +D N+ + + +++ ++K+K+T+
Sbjct: 691 SKMYQENQQKLAEEAREKALKDA--NKRAPIVAATQAREEVREKRKQTSG 738
>gnl|CDD|216652 pfam01698, FLO_LFY, Floricaula / Leafy protein. This family
consists of various plant development proteins which are
homologues of floricaula (FLO) and Leafy (LFY) proteins
which are floral meristem identity proteins. Mutations
in the sequences of these proteins affect flower and
leaf development.
Length = 382
Score = 30.4 bits (69), Expect = 1.3
Identities = 13/49 (26%), Positives = 27/49 (55%)
Query: 217 IPGFVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEK 265
+PG +KK+++KK+++K+ K ++ + D + GE EE+
Sbjct: 174 VPGHSSDSEKKKQRKKQRRKRSKELREDDDDDEDEDDDGEGGGEGGEER 222
Score = 28.8 bits (65), Expect = 4.1
Identities = 13/51 (25%), Positives = 20/51 (39%)
Query: 201 VVNVYYALISSSISFWIPGFVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDV 251
V A S G V +KKK++KK+R+K+ + D
Sbjct: 152 VQQEKEAAGSGGGGGGTWGLVGVPGHSSDSEKKKQRKKQRRKRSKELREDD 202
>gnl|CDD|218391 pfam05029, TIMELESS_C, Timeless protein C terminal region. The
timeless (tim) gene is essential for circadian function
in Drosophila. Putative homologues of Drosophila tim
have been identified in both mice and humans (mTim and
hTIM, respectively). Mammalian TIM is not the true
orthologue of Drosophila TIM, but is the likely
orthologue of a fly gene, timeout (also called tim-2).
mTim has been shown to be essential for embryonic
development, but does not have substantiated circadian
function. Some family members contain a SANT domain in
this region.
Length = 507
Score = 30.1 bits (67), Expect = 1.5
Identities = 16/57 (28%), Positives = 30/57 (52%), Gaps = 3/57 (5%)
Query: 219 GFVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKK 275
G V +K++ R KK++KK K ++ + D + EEEE+ + K ++ +K
Sbjct: 215 GLVSEKRELRGGKKRRKKLKPKQPNGEESGEDDF---QEDPEEEEQLPESKPEETEK 268
>gnl|CDD|148820 pfam07432, Hc1, Histone H1-like protein Hc1. This family consists
of several bacterial histone H1-like Hc1 proteins. In
Chlamydia, Hc1 is expressed in the late stages of the
life cycle, concomitant with the reorganisation of
chlamydial reticulate bodies into elementary bodies.
This suggests that Hc1 protein plays a role in the
condensation of chromatin during intracellular
differentiation.
Length = 123
Score = 28.9 bits (64), Expect = 1.5
Identities = 20/68 (29%), Positives = 25/68 (36%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
+K KKK KK K K K + ++ + K K KK K KK
Sbjct: 56 EKSGLMKKKPAKKAPKAAKAPKKAAKAPAKKAAAAPAKKAKAVKAKPATKKTAAKKVKKM 115
Query: 283 KKKRKKKK 290
KKR K
Sbjct: 116 SKKRSTAK 123
>gnl|CDD|205480 pfam13300, DUF4078, Domain of unknown function (DUF4078). This
family is found from fungi to humans, but its exact
function is not known.
Length = 88
Score = 28.0 bits (63), Expect = 1.5
Identities = 16/70 (22%), Positives = 38/70 (54%), Gaps = 11/70 (15%)
Query: 234 KKKKKKRKKKKNQHHRDVWINRTSRG-------EEEEEKKKKK---KKKKKKTTKKKKRK 283
+K+ + H+ RT RG ++EEE+K++ +K +++T +++K +
Sbjct: 8 RKRDRSPTPPPEVHYDADAEVRT-RGTGFYAFSKDEEERKEQMEELEKAREETERERKER 66
Query: 284 KKRKKKKNQH 293
++RK+K+ +
Sbjct: 67 EERKEKRKRA 76
>gnl|CDD|219084 pfam06550, DUF1119, Protein of unknown function (DUF1119). This
family consists of several hypothetical archaeal
proteins of unknown function.
Length = 283
Score = 29.8 bits (67), Expect = 1.5
Identities = 34/158 (21%), Positives = 71/158 (44%), Gaps = 31/158 (19%)
Query: 50 FIIITAILGNLLVIISVIKHRKLRI--ITNYFVVSLAFADLLVALCVMPFNA----IVSL 103
F I+ A++ L V+I + +++K I + VV++ F L+ L + NA I S+
Sbjct: 37 FYIVAALVFTLFVLIYIKRNKKWIIFLVIYSAVVAVIFYVFLILLSIFLVNAGAINIASI 96
Query: 104 T------------DEWYF----GYFM----CDVWN-SFDVYFSTASILHLCCISVDRYYA 142
EWY G+F+ +W SF V+ +++ L ++V +
Sbjct: 97 LLAIGLFYLLYFYPEWYVIDLAGFFLAVGISALWGISFGVW---PAVVLLIALAVYDAIS 153
Query: 143 IVKPLQYPIIMNQNTVLLMLSNVWILPGIISFTPIMLG 180
+ K ++ I + + + L L ++++P + ++ I
Sbjct: 154 VYKT-KHMITLARGVMDLDLPILFVIPENLDYSFIEES 190
>gnl|CDD|176075 cd08693, C2_PI3K_class_I_beta_delta, C2 domain present in class I
beta and delta phosphatidylinositol 3-kinases (PI3Ks).
PI3Ks (AKA phosphatidylinositol (PtdIns) 3-kinases)
regulate cell processes such as cell growth,
differentiation, proliferation, and motility. PI3Ks
work on phosphorylation of phosphatidylinositol,
phosphatidylinositide (4)P (PtdIns (4)P),2 or
PtdIns(4,5)P2. Specifically they phosphorylate the D3
hydroxyl group of phosphoinositol lipids on the inositol
ring. There are 3 classes of PI3Ks based on structure,
regulation, and specificity. All classes contain a C2
domain, a PIK domain, and a kinase catalytic domain.
The members here are class I, beta and delta isoforms of
PI3Ks and contain both a Ras-binding domain and a
p85-binding domain. Class II PI3Ks contain both of
these as well as a PX domain, and a C-terminal C2 domain
containing a nuclear localization signal. C2 domains
fold into an 8-standed beta-sandwich that can adopt 2
structural arrangements: Type I and Type II,
distinguished by a circular permutation involving their
N- and C-terminal beta strands. Many C2 domains are
Ca2+-dependent membrane-targeting modules that bind a
wide variety of substances including bind phospholipids,
inositol polyphosphates, and intracellular proteins.
Most C2 domain proteins are either signal transduction
enzymes that contain a single C2 domain, such as protein
kinase C, or membrane trafficking proteins which contain
at least two C2 domains, such as synaptotagmin 1.
However, there are a few exceptions to this including
RIM isoforms and some splice variants of piccolo/aczonin
and intersectin which only have a single C2 domain. C2
domains with a calcium binding region have negatively
charged residues, primarily aspartates, that serve as
ligands for calcium ions. Members have a type-I
topology.
Length = 173
Score = 29.2 bits (66), Expect = 1.6
Identities = 10/21 (47%), Positives = 15/21 (71%)
Query: 221 VMKKKKKRKKKKKKKKKKKRK 241
V KK K ++ +K + KKKK+K
Sbjct: 85 VSKKAKGKRSRKNQTKKKKKK 105
Score = 27.7 bits (62), Expect = 4.9
Identities = 9/21 (42%), Positives = 14/21 (66%)
Query: 267 KKKKKKKKKTTKKKKRKKKRK 287
KK K K++ K + +KKK+K
Sbjct: 85 VSKKAKGKRSRKNQTKKKKKK 105
>gnl|CDD|227446 COG5116, RPN2, 26S proteasome regulatory complex component
[Posttranslational modification, protein turnover,
chaperones].
Length = 926
Score = 30.3 bits (68), Expect = 1.6
Identities = 16/65 (24%), Positives = 22/65 (33%), Gaps = 6/65 (9%)
Query: 224 KKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKK------KKKKKKKKKTT 277
K R K+K K+K K+ K + ++ EEK KKKK
Sbjct: 792 KAAARAKQKPKEKGPNDKEIKIESPSVETEGERCTIKQREEKGIDAPAILNVKKKKPYKV 851
Query: 278 KKKKR 282
R
Sbjct: 852 DNMTR 856
Score = 29.5 bits (66), Expect = 2.4
Identities = 14/59 (23%), Positives = 27/59 (45%)
Query: 232 KKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKK 290
K + K++ K+K + +++ I S E E K++++K KK+K K
Sbjct: 792 KAAARAKQKPKEKGPNDKEIKIESPSVETEGERCTIKQREEKGIDAPAILNVKKKKPYK 850
Score = 28.8 bits (64), Expect = 4.4
Identities = 15/64 (23%), Positives = 25/64 (39%), Gaps = 8/64 (12%)
Query: 225 KKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKK 284
K + K+K K+K K+ V E E E+ K++++K
Sbjct: 792 KAAARAKQKPKEKGPNDKEIKIESPSV--------ETEGERCTIKQREEKGIDAPAILNV 843
Query: 285 KRKK 288
K+KK
Sbjct: 844 KKKK 847
>gnl|CDD|233034 TIGR00584, mug, mismatch-specific thymine-DNA glycosylate (mug).
All proteins in this family for whcih functions are
known are G-T or G-U mismatch glycosylases that function
in base excision repair. This family is based on the
phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
Stanford University). Used 2pf model [DNA metabolism,
DNA replication, recombination, and repair].
Length = 328
Score = 30.0 bits (67), Expect = 1.6
Identities = 16/78 (20%), Positives = 34/78 (43%), Gaps = 3/78 (3%)
Query: 218 PGFVMKKKKKRKKKKKKKKKKKRKKKKNQHHR--DVWINRTSRGEEEEEKKKKKKKK-KK 274
V K ++ ++ K +KRK + N+ +R + S+ + K K+K++K
Sbjct: 40 TSGVPAKAPTQEPSEEAPKFRKRKPRSNEPYRPVEPKKPSDSKKSGKSTKSKEKQEKITD 99
Query: 275 KTTKKKKRKKKRKKKKNQ 292
K K+K + + +
Sbjct: 100 KFKVKRKVDRFNGVSEAE 117
>gnl|CDD|226138 COG3611, DnaB, Replication initiation/membrane attachment protein
[DNA replication, recombination, and repair].
Length = 417
Score = 30.1 bits (68), Expect = 1.7
Identities = 15/65 (23%), Positives = 30/65 (46%), Gaps = 1/65 (1%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVW-INRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
K+ RK+K+ KK+ K + + W + +EE+ K + K ++ + +
Sbjct: 353 AIKENRKRKELALAKKQSSTTKRKENLPKWSNPEYKKTTSQEEQAKLEFIKLEELKRLEN 412
Query: 282 RKKKR 286
K+KR
Sbjct: 413 GKQKR 417
>gnl|CDD|220376 pfam09745, DUF2040, Coiled-coil domain-containing protein 55
(DUF2040). This entry is a conserved domain of
approximately 130 residues of proteins conserved from
fungi to humans. The proteins do contain a coiled-coil
domain, but the function is unknown.
Length = 128
Score = 28.9 bits (65), Expect = 1.7
Identities = 18/78 (23%), Positives = 39/78 (50%), Gaps = 11/78 (14%)
Query: 222 MKKKKKRKKKKKKKKKKKRKK-----------KKNQHHRDVWINRTSRGEEEEEKKKKKK 270
+K K+ KK+ K ++K+ +K ++ + R++ R + E E+E +
Sbjct: 27 IKAAKEEKKQAKLSERKENRKPKYIGSLLEAAERRKREREIAEERKLQKEREKEGDEFAD 86
Query: 271 KKKKKTTKKKKRKKKRKK 288
K+K T+ KK+ ++ +K
Sbjct: 87 KEKFVTSAYKKQLEENRK 104
>gnl|CDD|220611 pfam10169, Laps, Learning-associated protein. This is a family of
121-amino acid secretory proteins. Laps functions in the
regulation of neuronal cell adhesion and/or movement and
synapse attachment. Laps binds to the ApC/EBP (Aplysia
CCAAT/enhancer binding protein) promoter and activates
the transcription of ApC/EBP mRNA.
Length = 124
Score = 28.7 bits (64), Expect = 1.7
Identities = 13/28 (46%), Positives = 22/28 (78%)
Query: 218 PGFVMKKKKKRKKKKKKKKKKKRKKKKN 245
P ++ +++ K+ KKK++KKKKK+K KN
Sbjct: 95 PAWMNQRQAKKLKKKREKKKKKKKGAKN 122
Score = 28.3 bits (63), Expect = 2.9
Identities = 17/60 (28%), Positives = 27/60 (45%), Gaps = 12/60 (20%)
Query: 231 KKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKK 290
K + KK + ++ + W+N +++ KK KKK KKKK+KK K
Sbjct: 77 NKAAVRNKKTLRDQHGQY-PAWMN-----------QRQAKKLKKKREKKKKKKKGAKNLA 124
Score = 27.5 bits (61), Expect = 4.6
Identities = 24/100 (24%), Positives = 40/100 (40%), Gaps = 22/100 (22%)
Query: 207 ALISSSISFWIPGFVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKK 266
A + ++ +MK K K ++ K++KKK+ + +E+ E
Sbjct: 25 ARLKKALGLIDGNVLMKDVKDIATVKPAEEVKEKKKKEGT--------ESEEDDEKMEVD 76
Query: 267 KKKKKKKKKT--------------TKKKKRKKKRKKKKNQ 292
K + KKT + KK KKKR+KKK +
Sbjct: 77 NKAAVRNKKTLRDQHGQYPAWMNQRQAKKLKKKREKKKKK 116
Score = 26.7 bits (59), Expect = 7.7
Identities = 13/20 (65%), Positives = 14/20 (70%)
Query: 223 KKKKKRKKKKKKKKKKKRKK 242
K KKKR+KKKKKKK K
Sbjct: 105 KLKKKREKKKKKKKGAKNLA 124
Score = 26.7 bits (59), Expect = 8.0
Identities = 12/22 (54%), Positives = 16/22 (72%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKK 244
+ KK +KK++KKKKKKK K
Sbjct: 102 QAKKLKKKREKKKKKKKGAKNL 123
Score = 26.7 bits (59), Expect = 8.2
Identities = 11/24 (45%), Positives = 16/24 (66%)
Query: 226 KKRKKKKKKKKKKKRKKKKNQHHR 249
+R+ KK KKK++K+KKKK
Sbjct: 99 NQRQAKKLKKKREKKKKKKKGAKN 122
>gnl|CDD|223898 COG0828, RpsU, Ribosomal protein S21 [Translation, ribosomal
structure and biogenesis].
Length = 67
Score = 27.7 bits (62), Expect = 1.7
Identities = 11/26 (42%), Positives = 19/26 (73%)
Query: 264 EKKKKKKKKKKKTTKKKKRKKKRKKK 289
EK +K+K+KK +K+K K+ RK++
Sbjct: 39 EKPSEKRKRKKAAARKRKFKRLRKEQ 64
Score = 26.9 bits (60), Expect = 2.7
Identities = 11/22 (50%), Positives = 16/22 (72%)
Query: 225 KKKRKKKKKKKKKKKRKKKKNQ 246
K+KRKK +K+K KR +K+ Q
Sbjct: 44 KRKRKKAAARKRKFKRLRKEQQ 65
>gnl|CDD|236078 PRK07726, PRK07726, DNA topoisomerase III; Provisional.
Length = 658
Score = 30.1 bits (69), Expect = 1.7
Identities = 15/73 (20%), Positives = 20/73 (27%), Gaps = 12/73 (16%)
Query: 210 SSSISFWIPGFVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKK 269
SS + K K K K K K Q E ++K
Sbjct: 598 QSSEKYKFDNLTGPKCPDCGKPMLKVKGKNGKMLVCQ------------DRECGKRKNVS 645
Query: 270 KKKKKKTTKKKKR 282
KK + KK+
Sbjct: 646 KKTNARCPNCKKK 658
>gnl|CDD|235971 PRK07219, PRK07219, DNA topoisomerase I; Validated.
Length = 822
Score = 30.0 bits (68), Expect = 1.7
Identities = 9/64 (14%), Positives = 17/64 (26%), Gaps = 15/64 (23%)
Query: 227 KRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKR 286
K+KK+K+ K + + K + +K K +T
Sbjct: 765 TEKQKKEKESKSELEALKG---------------VGAKTAEKLKDAGVETVTDLTAADPD 809
Query: 287 KKKK 290
Sbjct: 810 AVAA 813
Score = 28.0 bits (63), Expect = 7.5
Identities = 7/63 (11%), Positives = 11/63 (17%), Gaps = 13/63 (20%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
+ K + + K K K K G E
Sbjct: 773 ESKSELEALKGVGAKTAEKLK-------------DAGVETVTDLTAADPDAVAAKVDGVS 819
Query: 283 KKK 285
+
Sbjct: 820 ADR 822
>gnl|CDD|203922 pfam08377, MAP2_projctn, MAP2/Tau projection domain. This domain is
found in the MAP2/Tau family of proteins which includes
MAP2, MAP4, Tau, and their homologs. All isoforms contain
a conserved C-terminal domain containing tubulin-binding
repeats (pfam00418), and a N-terminal projection domain
of varying size. This domain has a net negative charge
and exerts a long-range repulsive force. This provides a
mechanism that can regulate microtubule spacing which
might facilitate efficient organelle transport.
Length = 1134
Score = 30.1 bits (67), Expect = 1.7
Identities = 15/59 (25%), Positives = 31/59 (52%)
Query: 232 KKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKK 290
K+++ +K+ ++ + HR +T RG ++K KK+ ++ + R+KK KK
Sbjct: 1038 KEERAEKEARRPSLEKHRKEKPFKTGRGRISTPERKVAKKEPSTVSRDEVRRKKAVYKK 1096
>gnl|CDD|227594 COG5269, ZUO1, Ribosome-associated chaperone zuotin [Translation,
ribosomal structure and biogenesis / Posttranslational
modification, protein turnover, chaperones].
Length = 379
Score = 30.0 bits (67), Expect = 1.8
Identities = 18/75 (24%), Positives = 26/75 (34%), Gaps = 20/75 (26%)
Query: 223 KKKKKRKK--------KKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKK 274
K+ KK +K K K + + KN + E E E KK
Sbjct: 240 KEMKKIRKWEREAGARLKALAALKGKAEAKN------------KAEIEAEALASATAVKK 287
Query: 275 KTTKKKKRKKKRKKK 289
K + K+ K +KK
Sbjct: 288 KAKEVMKKALKMEKK 302
Score = 29.2 bits (65), Expect = 2.6
Identities = 13/62 (20%), Positives = 29/62 (46%)
Query: 231 KKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKK 290
K+++K+ KK +K + + + +G+ E + K + + + + K+K K KK
Sbjct: 236 KEQEKEMKKIRKWEREAGARLKALAALKGKAEAKNKAEIEAEALASATAVKKKAKEVMKK 295
Query: 291 NQ 292
Sbjct: 296 AL 297
Score = 28.8 bits (64), Expect = 4.0
Identities = 19/77 (24%), Positives = 36/77 (46%), Gaps = 9/77 (11%)
Query: 228 RKKKKKKKKKKKRKKKKNQHHR------DVWINRTSR---GEEEEEKKKKKKKKKKKTTK 278
RK+ + K ++KR K KNQ + + R R +E+E++ KK +K +++
Sbjct: 195 RKRYSEAKNREKRAKLKNQDNARLKRLVQIAKKRDPRIKSFKEQEKEMKKIRKWEREAGA 254
Query: 279 KKKRKKKRKKKKNQHHR 295
+ K K K ++
Sbjct: 255 RLKALAALKGKAEAKNK 271
Score = 28.5 bits (63), Expect = 4.6
Identities = 14/69 (20%), Positives = 26/69 (37%), Gaps = 8/69 (11%)
Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
MKK +K +++ + K K ++ E E + KK K+
Sbjct: 242 MKKIRKWEREAGARLKALAALKGKAEAKNKA--------EIEAEALASATAVKKKAKEVM 293
Query: 282 RKKKRKKKK 290
+K + +KK
Sbjct: 294 KKALKMEKK 302
>gnl|CDD|219555 pfam07753, DUF1609, Protein of unknown function (DUF1609). This
region is found in a number of hypothetical proteins
thought to be expressed by the eukaryote Encephalitozoon
cuniculi, an obligate intracellular microsporidial
parasite. It is approximately 200 residues long.
Length = 230
Score = 29.4 bits (66), Expect = 1.8
Identities = 19/65 (29%), Positives = 31/65 (47%), Gaps = 2/65 (3%)
Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDV--WINRTSRGEEEEEKKKKKKKKKKKTTKK 279
M R KKK KKK K + + H+ V W + +EE +K ++K K + +
Sbjct: 15 MAVGGARAKKKGGKKKSKGGRHCYKIHKRVLRWRKSPEKIKEELDKGSEEKWKGRSIEEI 74
Query: 280 KKRKK 284
K++K
Sbjct: 75 KEQKV 79
>gnl|CDD|221144 pfam11595, DUF3245, Protein of unknown function (DUF3245). This is
a family of proteins conserved in fungi. The function is
not known, and there is no S. cerevisiae member.
Length = 145
Score = 28.8 bits (64), Expect = 1.9
Identities = 14/62 (22%), Positives = 29/62 (46%), Gaps = 4/62 (6%)
Query: 221 VMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKK 280
VM +K+K + K+K K+ Q + V ++++E + + KK + KK
Sbjct: 88 VMAEKEKAEGGPVKRKAAVVAKEAKQSSKGV----GDDDDDDDEDESRSAAFGKKGSNKK 143
Query: 281 KR 282
++
Sbjct: 144 RK 145
>gnl|CDD|222466 pfam13945, NST1, Splicing factor, salt tolerance regulator. NST1
is a family of proteins that seem to be involved,
directly or indirectly, in the salt sensitivity of some
cellular functions in yeast. These proteins also
interact with the splicing factor Msl1p.
Length = 189
Score = 29.1 bits (65), Expect = 1.9
Identities = 9/21 (42%), Positives = 11/21 (52%)
Query: 231 KKKKKKKKKRKKKKNQHHRDV 251
K KKKKKK+ K H+
Sbjct: 32 KSKKKKKKRSKATSPSHNASD 52
Score = 28.7 bits (64), Expect = 3.1
Identities = 10/25 (40%), Positives = 12/25 (48%)
Query: 254 NRTSRGEEEEEKKKKKKKKKKKTTK 278
N + K KKKKKK+ K T
Sbjct: 21 NTVIHNDSSSSKSKKKKKKRSKATS 45
>gnl|CDD|236295 PRK08570, rpl19e, 50S ribosomal protein L19e; Reviewed.
Length = 150
Score = 29.1 bits (66), Expect = 1.9
Identities = 13/49 (26%), Positives = 22/49 (44%), Gaps = 5/49 (10%)
Query: 253 INRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQHHRDVWQKK 301
SRG E +K+KK +++ +K +K R KK + W +
Sbjct: 55 KKGISRGRARERHEKRKKGRRRGPGSRKGKKGARTPKK-----ERWINR 98
>gnl|CDD|130559 TIGR01495, ETRAMP, Plasmodium ring stage membrane protein ETRAMP.
This model describes a family of proteins from the
malaria parasite Plasmodium falciparum, several of which
have been shown to be expressed specifically in the ring
stage as well as the rident parasite Plasmodium yoelii.
A homolog from Plasmodium chabaudi was localized to the
parasitophorous vacuole membrane. Members have an
initial hydrophobic, Phe/Tyr-rich stretch long enough to
span the membrane, a highly charged region rich in Lys,
a second putative transmembrane region, and a second
highly charged, low complexity sequence region. Some
members have up to 100 residues of additional C-terminal
sequence. These genes have been shown to be found in the
sub-telomeric regions of both P. falciparum and P.
yoelii chromosomes.
Length = 85
Score = 27.7 bits (62), Expect = 1.9
Identities = 17/43 (39%), Positives = 23/43 (53%)
Query: 204 VYYALISSSISFWIPGFVMKKKKKRKKKKKKKKKKKRKKKKNQ 246
+Y+ +I+F PG+ KK KK KK K KKKKN+
Sbjct: 7 LYFFAALLAINFIAPGYCNNGKKVNKKGNLKKLTKAEKKKKNK 49
>gnl|CDD|129694 TIGR00606, rad50, rad50. All proteins in this family for which
functions are known are involvedin recombination,
recombinational repair, and/or non-homologous end
joining.They are components of an exonuclease complex
with MRE11 homologs. This family is distantly related to
the SbcC family of bacterial proteins.This family is
based on the phylogenomic analysis of JA Eisen (1999,
Ph.D. Thesis, Stanford University).
Length = 1311
Score = 30.0 bits (67), Expect = 2.0
Identities = 14/65 (21%), Positives = 29/65 (44%), Gaps = 3/65 (4%)
Query: 221 VMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKK 280
+ +K ++ + + K K+ K+K RD TS+ + E ++ K + + K
Sbjct: 195 RQTQGQKVQEHQMELKYLKQYKEKACEIRD---QITSKEAQLESSREIVKSYENELDPLK 251
Query: 281 KRKKK 285
R K+
Sbjct: 252 NRLKE 256
>gnl|CDD|173965 cd08045, TAF4, TATA Binding Protein (TBP) Associated Factor 4
(TAF4) is one of several TAFs that bind TBP and is
involved in forming Transcription Factor IID (TFIID)
complex. The TATA Binding Protein (TBP) Associated
Factor 4 (TAF4) is one of several TAFs that bind TBP and
are involved in forming the Transcription Factor IID
(TFIID) complex. TFIID is one of seven General
Transcription Factors (GTF) (TFIIA, TFIIB, TFIID, TFIIE,
TFIIF, and TFIID) that are involved in accurate
initiation of transcription by RNA polymerase II in
eukaryote. TFIID plays an important role in the
recognition of promoter DNA and assembly of the
pre-initiation complex. TFIID complex is composed of the
TBP and at least 13 TAFs. TAFs from various species were
originally named by their predicted molecular weight or
their electrophoretic mobility in polyacrylamide gels. A
new, unified nomenclature for the pol II TAFs has been
suggested to show the relationship between TAF orthologs
and paralogs. Several hypotheses are proposed for TAFs
functions such as serving as activator-binding sites,
core-promoter recognition or a role in essential
catalytic activity. Each TAF, with the help of a
specific activator, is required only for the expression
of subset of genes and is not universally involved for
transcription as are GTFs. In yeast and human cells,
TAFs have been found as components of other complexes
besides TFIID. Several TAFs interact via histone-fold
(HFD) motifs; HFD is the interaction motif involved in
heterodimerization of the core histones and their
assembly into nucleosome octamers. The minimal HFD
contains three alpha-helices linked by two loops and is
found in core histones, TAFS and many other
transcription factors. TFIID has a histone octamer-like
substructure. TAF4 domain interacts with TAF12 and makes
a novel histone-like heterodimer that binds DNA and has
a core promoter function of a subset of genes.
Length = 212
Score = 29.2 bits (66), Expect = 2.0
Identities = 15/82 (18%), Positives = 42/82 (51%), Gaps = 16/82 (19%)
Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
++K+ + ++ ++++++KR +++ + R ++ E+ + K+K K+ +K ++
Sbjct: 112 VRKQLRFLEQLEREEEEKRDEEERERLL-----RAAKSRSEQSRLKQKAKEMQKEEDEEM 166
Query: 282 RKK-----------KRKKKKNQ 292
R + RKKKK +
Sbjct: 167 RHRAANATALAAIGGRKKKKRR 188
>gnl|CDD|236698 PRK10475, PRK10475, 23S rRNA pseudouridine synthase F; Provisional.
Length = 290
Score = 29.3 bits (66), Expect = 2.1
Identities = 14/58 (24%), Positives = 21/58 (36%), Gaps = 11/58 (18%)
Query: 229 KKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKR 286
K K K K K K+ + E+ EK + K+ T ++KK R
Sbjct: 244 KPKAKAKPKTAGIKRP-----------VVKMEKTAEKGGRPASNGKRFTSPGRKKKGR 290
>gnl|CDD|211393 cd11381, NSA2, pre-ribosomal protein NSA2 (Nop seven-associated 2).
NSA2 appears to be a protein required for the
maturation of 27S pre-rRNA in yeast; it has been
characterized in mammalian cells as a nucleolar protein
that might play a role in the regulation of the cell
cycle and in cell proliferation.
Length = 257
Score = 29.5 bits (67), Expect = 2.1
Identities = 18/73 (24%), Positives = 38/73 (52%), Gaps = 3/73 (4%)
Query: 226 KKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKK---KKTTKKKKR 282
+KR ++ +++KRKK+ + H+ + RG + + KK+ K+K KKT K +
Sbjct: 8 RKRHGRRLDHEERKRKKEAREPHKRSKKAQKLRGLKAKLYNKKRYKEKIQMKKTIKMHEE 67
Query: 283 KKKRKKKKNQHHR 295
+ ++K ++
Sbjct: 68 RNVKQKVDDKVPE 80
>gnl|CDD|234229 TIGR03490, Mycoplas_LppA, mycoides cluster lipoprotein, LppA/P72
family. Members of this protein family occur in
Mycoplasma mycoides, Mycoplasma hyopneumoniae, and
related Mycoplasmas in small paralogous families that
may also include truncated forms and/or pseudogenes.
Members are predicted lipoproteins with a conserved
signal peptidase II processing and lipid attachment
site. Note that the name for certain characterized
members, p72, reflects an anomalous apparent molecular
weight, given a theoretical MW of about 61 kDa.
Length = 541
Score = 29.8 bits (67), Expect = 2.1
Identities = 15/69 (21%), Positives = 31/69 (44%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
+ +KK + K + K KK N+ + N+++ ++EE +KK K K
Sbjct: 33 QPEKKPEIKPNENTPKIPKKPDNKEPSENNNNKSNNENKDEENPSSTNPEKKPDPSKNKE 92
Query: 283 KKKRKKKKN 291
+ ++ K +
Sbjct: 93 EIEKPKDEP 101
>gnl|CDD|220207 pfam09368, Sas10_Utp3_C, Sas10 C-terminal domain. This family
contains a C-terminal presumed domain in Sas10 which
hash been identified as a regulator of chromatin
silencing.
Length = 76
Score = 27.6 bits (62), Expect = 2.2
Identities = 17/59 (28%), Positives = 24/59 (40%), Gaps = 2/59 (3%)
Query: 219 GFVMKKKKKRK--KKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKK 275
G K+KK R+ + KK+KK +K K+ R V G E K + K
Sbjct: 18 GLTRKRKKDRRNPRVKKRKKYEKAVKRLKSQRRVVREEDGGYGGETTGIKTNVVRSVKL 76
Score = 26.1 bits (58), Expect = 7.5
Identities = 10/29 (34%), Positives = 19/29 (65%)
Query: 265 KKKKKKKKKKKTTKKKKRKKKRKKKKNQH 293
+K+KK ++ + K+KK +K K+ K+Q
Sbjct: 21 RKRKKDRRNPRVKKRKKYEKAVKRLKSQR 49
>gnl|CDD|225924 COG3389, COG3389, Uncharacterized protein conserved in archaea
[Function unknown].
Length = 277
Score = 29.4 bits (66), Expect = 2.2
Identities = 38/201 (18%), Positives = 80/201 (39%), Gaps = 36/201 (17%)
Query: 45 GMLMCFIIITAILGNLLVIISVIKHRKLRIITNYFVVSLAFADLLVALCVMPFNA----I 100
+ F+ I +L+ I + + R I + V S+ F L+ L + I
Sbjct: 34 NSVYYFVYILVATLFILLAIKLGRKWIFRGIYSVAVASVLFYVFLILLSIFLVLVYAINI 93
Query: 101 VSLT------------DEWYF----GYFM----CDVWN-SFDVYFSTASILHLCCISVDR 139
S+ EWY G+F+ ++ SF V +++ L ++V
Sbjct: 94 ASIGLAIGLVYLLYKYPEWYVIDLAGFFLAVGIAAIFGISFGVL---PAVVLLIALAVYD 150
Query: 140 YYAIVKPLQYPIIMNQNTVLLMLSNVWILPGIISFTPIMLGWYTTPDHKQYRKH-----H 194
++ K ++ I + + + L L ++++P ++++ + + D Y
Sbjct: 151 AISVYKT-RHMISLAEGVMDLDLPMLFVIPENLAYSFVEDAFENRGDGNAYMIGLGDAVM 209
Query: 195 PNVCIFVVNVYYALISSSISF 215
P+ I VV+ + LIS ++F
Sbjct: 210 PS--ILVVSAAFFLISPILAF 228
>gnl|CDD|218738 pfam05766, NinG, Bacteriophage Lambda NinG protein. NinG or Rap is
involved in recombination. Rap (recombination adept with
plasmid) increases lambda-by-plasmid recombination
catalyzed by Escherichia coli's RecBCD pathway.
Length = 188
Score = 28.8 bits (65), Expect = 2.2
Identities = 11/36 (30%), Positives = 23/36 (63%), Gaps = 1/36 (2%)
Query: 265 KKKKKKKKKKKTTKKKKRKKKRKKKKNQHHRDVWQK 300
++K ++KK+K + ++R+ K +K+K + D W K
Sbjct: 36 REKAQEKKRKAEAQAERRELKARKEKLKTRSD-WLK 70
Score = 27.7 bits (62), Expect = 5.6
Identities = 7/26 (26%), Positives = 18/26 (69%)
Query: 219 GFVMKKKKKRKKKKKKKKKKKRKKKK 244
+K++K ++KK+K + + +R++ K
Sbjct: 31 ALALKREKAQEKKRKAEAQAERRELK 56
>gnl|CDD|218440 pfam05110, AF-4, AF-4 proto-oncoprotein. This family consists of
AF4 (Proto-oncogene AF4) and FMR2 (Fragile X E mental
retardation syndrome) nuclear proteins. These proteins
have been linked to human diseases such as acute
lymphoblastic leukaemia and mental retardation. The
family also contains a Drosophila AF4 protein homologue
Lilliputian which contains an AT-hook domain.
Lilliputian represents a novel pair-rule gene that acts
in cytoskeleton regulation, segmentation and
morphogenesis in Drosophila.
Length = 1154
Score = 29.9 bits (67), Expect = 2.3
Identities = 14/70 (20%), Positives = 29/70 (41%), Gaps = 3/70 (4%)
Query: 224 KKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKK---KKKKKKTTKKK 280
KK+ K +K K ++K KN D ++ R EE+ + K+
Sbjct: 733 KKQTSKTASEKSSSKGKRKHKNDEEADKIESKKQRLEEKSSSCSPSSSSSHHHSSSNKES 792
Query: 281 KRKKKRKKKK 290
++ + K+++
Sbjct: 793 RKSSRNKEEE 802
>gnl|CDD|233605 TIGR01865, cas_Csn1, CRISPR-associated protein Cas9/Csn1, subtype
II/NMEMI. CRISPR loci appear to be mobile elements with
a wide host range. This model represents a protein found
only in CRISPR-containing species, near other
CRISPR-associated proteins (cas), as part of the NMENI
subtype of CRISPR/Cas locus. The species range so far
for this protein is animal pathogens and commensals only
[Mobile and extrachromosomal element functions, Other].
Length = 805
Score = 29.7 bits (67), Expect = 2.3
Identities = 10/51 (19%), Positives = 18/51 (35%), Gaps = 4/51 (7%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKK 273
++ K++ KK + K K+ + + EE E K K
Sbjct: 528 GTNFGKRNSKERYKKNEDKIKE----FASALGKEILKEEPTENSSKNILKL 574
>gnl|CDD|240253 PTZ00068, PTZ00068, 60S ribosomal protein L13a; Provisional.
Length = 202
Score = 28.9 bits (65), Expect = 2.3
Identities = 14/43 (32%), Positives = 21/43 (48%), Gaps = 3/43 (6%)
Query: 261 EEEEKKKKKKKKKKKTTKKKKRKKKRKK---KKNQHHRDVWQK 300
EE+ K++ KKK +K K+ RKK K + V +K
Sbjct: 156 EEKRKERAAAYYKKKVKLRKAWKEARKKALAKLPKAIVAVLKK 198
>gnl|CDD|240341 PTZ00272, PTZ00272, heat shock protein 83 kDa (Hsp83); Provisional.
Length = 701
Score = 29.6 bits (66), Expect = 2.3
Identities = 20/77 (25%), Positives = 40/77 (51%), Gaps = 3/77 (3%)
Query: 226 KKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKK 285
+K +K+ + ++ KK ++ + + G+E KKKK KK K+ TK+ + + K
Sbjct: 208 EKTTEKEVTDEDEEDTKKADEDGEEPKVEEVKEGDEG---KKKKTKKVKEVTKEYEVQNK 264
Query: 286 RKKKKNQHHRDVWQKKY 302
K + +DV +++Y
Sbjct: 265 HKPLWTRDPKDVTKEEY 281
>gnl|CDD|204935 pfam12474, PKK, Polo kinase kinase. This domain family is found in
eukaryotes, and is approximately 140 amino acids in
length. The family is found in association with
pfam00069. Polo-like kinase 1 (Plx1) is essential during
mitosis for the activation of Cdc25C, for spindle
assembly, and for cyclin B degradation. This family is
Polo kinase kinase (PKK) which phosphorylates Polo
kinase and Polo-like kinase to activate them. PKK is a
serine/threonine kinase.
Length = 142
Score = 28.4 bits (64), Expect = 2.4
Identities = 13/68 (19%), Positives = 35/68 (51%), Gaps = 5/68 (7%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
++ ++ K+ + ++K + K K ++E EK + ++++KK K +K
Sbjct: 48 QELRRLPKRIRAEQKTRLKMFKESLK-----IEKKELKQEVEKLPRFQEQEKKRMKAEKE 102
Query: 283 KKKRKKKK 290
++++K +K
Sbjct: 103 EQEQKHQK 110
>gnl|CDD|185507 PTZ00191, PTZ00191, 60S ribosomal protein L23a; Provisional.
Length = 145
Score = 28.5 bits (64), Expect = 2.4
Identities = 13/31 (41%), Positives = 16/31 (51%)
Query: 265 KKKKKKKKKKKTTKKKKRKKKRKKKKNQHHR 295
K KKK KK K KK + KKRK + +
Sbjct: 6 KAKKKAKKAAKAAKKGVKVKKRKVRTSVRFF 36
Score = 28.1 bits (63), Expect = 3.2
Identities = 11/31 (35%), Positives = 15/31 (48%)
Query: 265 KKKKKKKKKKKTTKKKKRKKKRKKKKNQHHR 295
KKK KK K K K KK++ + + R
Sbjct: 7 AKKKAKKAAKAAKKGVKVKKRKVRTSVRFFR 37
>gnl|CDD|148491 pfam06898, YqfD, Putative stage IV sporulation protein YqfD. This
family consists of several putative bacterial stage IV
sporulation (SpoIV) proteins. YqfD of Bacillus subtilis
is known to be essential for efficient sporulation
although its exact function is unknown.
Length = 383
Score = 29.3 bits (66), Expect = 2.5
Identities = 16/72 (22%), Positives = 34/72 (47%)
Query: 217 IPGFVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKT 276
I G + K ++ + +K K++ K + + I R + E +++ K K++ +
Sbjct: 277 IGGKKLPLKFRKNEFEKYDKEEDEKPLFLKWKLPISIVRETYYEVQDKVVKLTKEEAVEK 336
Query: 277 TKKKKRKKKRKK 288
KK +KK K+
Sbjct: 337 GKKLAKKKLEKE 348
>gnl|CDD|217010 pfam02387, IncFII_repA, IncFII RepA protein family. This protein
is plasmid encoded and found to be essential for plasmid
replication.
Length = 279
Score = 29.2 bits (66), Expect = 2.6
Identities = 18/72 (25%), Positives = 28/72 (38%), Gaps = 7/72 (9%)
Query: 228 RKKKKKKKKKKKRKKKKNQ-----HHRDVWINRTSRGEEEEEKKKKKKKKKKKTTK--KK 280
RK++ + + KK +KK R R ++K KK+++ K K
Sbjct: 172 RKQQLEWENKKLKKKGLIPLTLDEARRRAKEFHIERAFSYRTERKAFGKKRRRARKLAKL 231
Query: 281 KRKKKRKKKKNQ 292
K RKK N
Sbjct: 232 DEKDIRKKILNA 243
>gnl|CDD|217315 pfam03006, HlyIII, Haemolysin-III related. Members of this family
are integral membrane proteins. This family includes a
protein with hemolytic activity from Bacillus cereus. It
has been proposed that YOL002c encodes a Saccharomyces
cerevisiae protein that plays a key role in metabolic
pathways that regulate lipid and phosphate metabolism.
In eukaryotes, members are seven-transmembrane pass
molecules found to encode functional receptors with a
broad range of apparent ligand specificities, including
progestin and adipoQ receptors, and hence have been
named PAQR proteins. The mammalian members include
progesterone binding proteins. Unlike the case with GPCR
receptor proteins, the evolutionary ancestry of the
members of this family can be traced back to the
Archaea.
Length = 207
Score = 28.8 bits (65), Expect = 2.8
Identities = 22/95 (23%), Positives = 40/95 (42%), Gaps = 7/95 (7%)
Query: 47 LMCFIIITAILGNLLVIISVIKHRKLRI---ITNYFVVSLAFADLLVALCVMPFNAIVSL 103
L+ FI A+LG LL + + + R LR + ++ + L++AL +V L
Sbjct: 107 LLIFIWGLALLGILLKLFWLKRFRWLRTVLYLLMGWLGIIPIKHLILALGGGGLVLLV-L 165
Query: 104 TDEWYF---GYFMCDVWNSFDVYFSTASILHLCCI 135
Y ++ FD++ + I HL +
Sbjct: 166 GGVLYTLGAIFYALRFPGPFDIWGHSHQIFHLFVV 200
>gnl|CDD|238706 cd01418, Ribosomal_L19e_A, Ribosomal protein L19e, archaeal. L19e
is found in the large ribosomal subunit of eukaryotes
and archaea. L19e is distinct from the ribosomal subunit
L19, which is found in prokaryotes. It consists of two
small globular domains connected by an extended segment.
It is located toward the surface of the large subunit,
with one exposed end involved in forming the
intersubunit bridge with the small subunit. The other
exposed end is involved in forming the translocon
binding site, along with L22, L23, L24, L29, and L31e
subunits.
Length = 145
Score = 28.4 bits (64), Expect = 2.8
Identities = 22/90 (24%), Positives = 37/90 (41%), Gaps = 14/90 (15%)
Query: 219 GFVMKKKKK-----RKKKKKKKKKKKRKKK---------KNQHHRDVWINRTSRGEEEEE 264
G + K KK R K++ +K+KK R++ ++ WI +
Sbjct: 45 GVIKAKPKKGISRGRLKERHEKRKKGRRRGPGSRKGKKGARTPKKERWIKTIRALRRYLK 104
Query: 265 KKKKKKKKKKKTTKKKKRKKKRKKKKNQHH 294
+ + K K KKT +K RK K ++ H
Sbjct: 105 ELRDKGKIDKKTYRKLYRKAKGGSFRSLSH 134
>gnl|CDD|240370 PTZ00342, PTZ00342, acyl-CoA synthetase; Provisional.
Length = 746
Score = 29.3 bits (66), Expect = 2.8
Identities = 15/68 (22%), Positives = 29/68 (42%)
Query: 221 VMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKK 280
++K K+ K++K N+++++ E E+ + K+K K K
Sbjct: 210 LIKSKEININKEEKNNGSNVNNNGNKNNKEEQKGNDLSNELEDISLGPLEYDKEKLEKIK 269
Query: 281 KRKKKRKK 288
K+K KK
Sbjct: 270 DLKEKAKK 277
>gnl|CDD|235202 PRK04023, PRK04023, DNA polymerase II large subunit; Validated.
Length = 1121
Score = 29.5 bits (67), Expect = 2.9
Identities = 17/44 (38%), Positives = 20/44 (45%), Gaps = 6/44 (13%)
Query: 236 KKKKRKKKKNQHHRDVW------INRTSRGEEEEEKKKKKKKKK 273
K K K ++ D W I+ EEEEEKKK K K K
Sbjct: 252 KAPKILKYVDKLKIDGWEWLEELISGKKEKEEEEEKKKIKPKDK 295
>gnl|CDD|200340 TIGR03927, T7SS_EssA_Firm, type VII secretion protein EssA.
Members of this family are associated with type VII
secretion of WXG100 family targets in the Firmicutes,
but not in the Actinobacteria. This highly divergent
protein family consists largely of a central region of
highly polar low-complexity sequence containing
occasional LF motifs in weak repeats about 17 residues
in length, flanked by hydrophobic N- and C-terminal
regions [Protein fate, Protein and peptide secretion and
trafficking].
Length = 150
Score = 28.5 bits (64), Expect = 2.9
Identities = 10/55 (18%), Positives = 25/55 (45%), Gaps = 9/55 (16%)
Query: 238 KKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQ 292
KK + + ++ E E +K+ +++KK T +K ++K +++
Sbjct: 33 KKDIEINTDYLQE---------ETELDKELFTPEEQKKITFQKHKEKPEQEELKN 78
>gnl|CDD|224259 COG1340, COG1340, Uncharacterized archaeal coiled-coil protein
[Function unknown].
Length = 294
Score = 28.9 bits (65), Expect = 2.9
Identities = 12/77 (15%), Positives = 33/77 (42%)
Query: 225 KKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKK 284
+ + K ++ + +K+ ++ H + E EE + + + ++ K K +
Sbjct: 190 EYHEEMIKLFEEADELRKEADELHEEFVELSKKIDELHEEFRNLQNELRELEKKIKALRA 249
Query: 285 KRKKKKNQHHRDVWQKK 301
K K K + R+ +++
Sbjct: 250 KEKAAKRREKREELKER 266
>gnl|CDD|203489 pfam06644, ATP11, ATP11 protein. This family consists of several
eukaryotic ATP11 proteins. In Saccharomyces cerevisiae,
expression of functional F1-ATPase requires two proteins
encoded by the ATP11 and ATP12 genes. Atp11p is a
molecular chaperone of the mitochondrial matrix that
participates in the biogenesis pathway to form F1, the
catalytic unit of the ATP synthase.
Length = 250
Score = 28.9 bits (65), Expect = 2.9
Identities = 14/60 (23%), Positives = 21/60 (35%), Gaps = 10/60 (16%)
Query: 234 KKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQH 293
K + K +K K + E KK K K +KK KK K+ ++
Sbjct: 3 KYRSKLLQKAKESGLEFI----------ERLKKALKDKIEKKEFSAKKPPTGPSKQASKF 52
>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
protein; Reviewed.
Length = 782
Score = 29.4 bits (67), Expect = 3.0
Identities = 14/71 (19%), Positives = 36/71 (50%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKR 282
K + +K+ ++ K+ KK+ ++ +++ + + + + +K+ +KK
Sbjct: 565 DKLLEEAEKEAQQAIKEAKKEADEIIKELRQLQKGGYASVKAHELIEARKRLNKANEKKE 624
Query: 283 KKKRKKKKNQH 293
KKK+K+K+ Q
Sbjct: 625 KKKKKQKEKQE 635
Score = 28.3 bits (64), Expect = 5.7
Identities = 26/87 (29%), Positives = 42/87 (48%), Gaps = 14/87 (16%)
Query: 225 KKKRKKKKKKKKKKKRKKKKNQHHRDVWI-NRTSRGE-----EEEE--------KKKKKK 270
K +KK+KKKKK+K K+++ + +V + +GE +++E K K
Sbjct: 617 NKANEKKEKKKKKQKEKQEELKVGDEVKYLSLGQKGEVLSIPDDKEAIVQAGIMKMKVPL 676
Query: 271 KKKKKTTKKKKRKKKRKKKKNQHHRDV 297
+K K KK+KKK+ K R V
Sbjct: 677 SDLEKIQKPKKKKKKKPKTVKPKPRTV 703
Score = 27.9 bits (63), Expect = 8.0
Identities = 18/54 (33%), Positives = 27/54 (50%), Gaps = 4/54 (7%)
Query: 213 ISFWIPGFVMKKKKKRKKKKKKKKKKKRKKKKN-QHHRDVWINRTSRGEEEEEK 265
+ +P ++K +K KKKKKKK K + K + D+ R R EE E+
Sbjct: 670 MKMKVPLSDLEKIQKPKKKKKKKPKTVKPKPRTVSLELDL---RGMRYEEALER 720
>gnl|CDD|201540 pfam00992, Troponin, Troponin. Troponin (Tn) contains three
subunits, Ca2+ binding (TnC), inhibitory (TnI), and
tropomyosin binding (TnT). this Pfam contains members of
the TnT subunit. Troponin is a complex of three
proteins, Ca2+ binding (TnC), inhibitory (TnI), and
tropomyosin binding (TnT). The troponin complex
regulates Ca++ induced muscle contraction. This family
includes troponin T and troponin I. Troponin I binds to
actin and troponin T binds to tropomyosin.
Length = 131
Score = 27.9 bits (63), Expect = 3.0
Identities = 15/40 (37%), Positives = 17/40 (42%), Gaps = 6/40 (15%)
Query: 261 EEEEKKKK---KKKKKKKTTKKKKRKKK---RKKKKNQHH 294
E E+ KKK + K KK T KK RK K H
Sbjct: 78 EIEDLKKKVNDLRGKFKKPTLKKVRKSADAMLKALLGSKH 117
>gnl|CDD|203462 pfam06476, DUF1090, Protein of unknown function (DUF1090). This
family consists of several bacterial proteins of unknown
function and is known as YqjC in E. coli.
Length = 115
Score = 28.0 bits (63), Expect = 3.0
Identities = 10/68 (14%), Positives = 30/68 (44%), Gaps = 20/68 (29%)
Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
++ +++ K ++K+++ +R E+E K+ + K K KK++
Sbjct: 64 LRAERQAKVEEKEQEVAER--------------------EQELKEAQAKGDADKIEKKQR 103
Query: 282 RKKKRKKK 289
+ + + +
Sbjct: 104 KLAEAQAE 111
>gnl|CDD|227925 COG5638, COG5638, Uncharacterized conserved protein [Function
unknown].
Length = 622
Score = 29.4 bits (65), Expect = 3.0
Identities = 17/68 (25%), Positives = 31/68 (45%), Gaps = 12/68 (17%)
Query: 225 KKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKK 284
K RK +++++ +K+R+ KK + DV + K KK+ KK K K +
Sbjct: 436 KLDRKLRERQENRKERQLKKTKDDSDV------------DLKDKKESINKKNKKGKHAIE 483
Query: 285 KRKKKKNQ 292
+ K +
Sbjct: 484 RTAASKEE 491
Score = 28.6 bits (63), Expect = 5.6
Identities = 18/66 (27%), Positives = 29/66 (43%), Gaps = 8/66 (12%)
Query: 225 KKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKK 284
KK KK +K+KR K I +T G+ E + KK+K +K + + + K
Sbjct: 564 KKTGGMKKIMDEKRKRLKNN--------IEQTQDGKPELKIKKRKAEKGDQRQELDRIVK 615
Query: 285 KRKKKK 290
K+
Sbjct: 616 SIKRSG 621
>gnl|CDD|227935 COG5648, NHP6B, Chromatin-associated proteins containing the HMG
domain [Chromatin structure and dynamics].
Length = 211
Score = 28.7 bits (64), Expect = 3.0
Identities = 18/71 (25%), Positives = 27/71 (38%), Gaps = 1/71 (1%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHR-DVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
++K++ KK K + R V + EE K K + +KKKK
Sbjct: 133 REKEEYNKKLPNKAPIGPFIENEPKIRPKVEGPSPDKALVEETKIISKAWSELDESKKKK 192
Query: 282 RKKKRKKKKNQ 292
K KK K +
Sbjct: 193 YIDKYKKLKEE 203
>gnl|CDD|219953 pfam08648, DUF1777, Protein of unknown function (DUF1777). This is
a family of eukaryotic proteins of unknown function.
Some of the proteins in this family are putative nucleic
acid binding proteins.
Length = 158
Score = 28.3 bits (63), Expect = 3.1
Identities = 13/73 (17%), Positives = 41/73 (56%), Gaps = 6/73 (8%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWI-----NRTSRGEEEEEKKKKKKKKKKKTT 277
+++++R++ + +++ ++R+ + HR +R SR ++ +K+++ K
Sbjct: 23 RRERRRERSRSRERDRRRRSRSRSPHRSRRSRSPRRHR-SRSRSPSRRRDRKRERDKDAR 81
Query: 278 KKKKRKKKRKKKK 290
+ KKR++++ K+
Sbjct: 82 EPKKRERQKLIKE 94
>gnl|CDD|219761 pfam08243, SPT2, SPT2 chromatin protein. This family includes the
Saccharomyces cerevisiae protein SPT2 which is a
chromatin protein involved in transcriptional
regulation.
Length = 116
Score = 27.9 bits (62), Expect = 3.1
Identities = 13/19 (68%), Positives = 14/19 (73%)
Query: 255 RTSRGEEEEEKKKKKKKKK 273
R EEEEEK+KKKKK K
Sbjct: 98 RELAREEEEEKRKKKKKNK 116
Score = 27.5 bits (61), Expect = 3.8
Identities = 15/62 (24%), Positives = 29/62 (46%), Gaps = 8/62 (12%)
Query: 236 KKKKRKKKKNQHHRDVWINRTSRGEE--EEEKKKKK------KKKKKKTTKKKKRKKKRK 287
K +KR D N + E +EE++ + +++ + +++KRKKK+K
Sbjct: 55 KGRKRSYYDRYDEDDALDNMEATFMEIQKEERRSARMARLEDERELAREEEEEKRKKKKK 114
Query: 288 KK 289
K
Sbjct: 115 NK 116
>gnl|CDD|237744 PRK14521, rpsP, 30S ribosomal protein S16; Provisional.
Length = 186
Score = 28.6 bits (64), Expect = 3.1
Identities = 13/49 (26%), Positives = 21/49 (42%), Gaps = 4/49 (8%)
Query: 247 HHRDVWINRTSRGEEEEEKK----KKKKKKKKKTTKKKKRKKKRKKKKN 291
H + + + E + E K K++K+ K K K K K+ KK
Sbjct: 83 KHLLGGVAKGAFTEAQAEAKFEAWKEEKEGKVNAKKDKLSKAKKAAKKA 131
Score = 27.8 bits (62), Expect = 4.8
Identities = 16/58 (27%), Positives = 20/58 (34%), Gaps = 15/58 (25%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKK 280
K+ K KK K K KK KK E EKK + + + KK
Sbjct: 110 KEGKVNAKKDKLSKAKKAAKKAAL---------------EAEKKVNEARAEAVAEKKA 152
>gnl|CDD|223587 COG0513, SrmB, Superfamily II DNA and RNA helicases [DNA
replication, recombination, and repair / Transcription /
Translation, ribosomal structure and biogenesis].
Length = 513
Score = 29.0 bits (65), Expect = 3.2
Identities = 6/88 (6%), Positives = 17/88 (19%), Gaps = 8/88 (9%)
Query: 211 SSISFWIPGFVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKK---- 266
+ P K R +++ KK + + R +
Sbjct: 401 VLLPLDEPEDAKLLKTTRPGLEEESDISDEIKKLKSSKKALLRGLGVRFTLSKLLANLGK 460
Query: 267 ----KKKKKKKKKTTKKKKRKKKRKKKK 290
+++ +
Sbjct: 461 EIPGAGDAVTIDPELERRSPNSADDIEY 488
>gnl|CDD|227458 COG5129, MAK16, Nuclear protein with HMG-like acidic region
[General function prediction only].
Length = 303
Score = 28.9 bits (64), Expect = 3.2
Identities = 9/37 (24%), Positives = 19/37 (51%)
Query: 257 SRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQH 293
S +E+E+ K K +K+K KK ++ + + +
Sbjct: 256 SDEDEDEDNKGKIRKRKTDDAKKSRKPHIHIEYEQER 292
Score = 28.5 bits (63), Expect = 4.6
Identities = 12/35 (34%), Positives = 17/35 (48%)
Query: 260 EEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQHH 294
E+E+E K K +K+K KK RK + Q
Sbjct: 258 EDEDEDNKGKIRKRKTDDAKKSRKPHIHIEYEQER 292
>gnl|CDD|233390 TIGR01385, TFSII, transcription elongation factor S-II. This model
represents eukaryotic transcription elongation factor
S-II. This protein allows stalled RNA transcription
complexes to perform a cleavage of the nascent RNA and
restart at the newly generated 3-prime end.
Length = 299
Score = 29.0 bits (65), Expect = 3.2
Identities = 18/66 (27%), Positives = 27/66 (40%)
Query: 226 KKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKK 285
K KK K KK K K+ H ++T+ GE K++ K + K + K
Sbjct: 61 SKLAKKIIKSWKKVVDKNKSDHPGGNPEDKTTVGESVNSVKQEAKSQSDKIEQPKYVSSS 120
Query: 286 RKKKKN 291
+ KN
Sbjct: 121 PRNAKN 126
>gnl|CDD|185616 PTZ00436, PTZ00436, 60S ribosomal protein L19-like protein;
Provisional.
Length = 357
Score = 28.8 bits (63), Expect = 3.4
Identities = 14/70 (20%), Positives = 37/70 (52%), Gaps = 4/70 (5%)
Query: 221 VMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKK 280
+ K K ++KK+++ ++ K+ K++ HR ++ + E + +K +++ +++
Sbjct: 142 IHKVKNEKKKERQLAEQLAAKRLKDEQHR----HKARKQELRKREKDRERARREDAAAAA 197
Query: 281 KRKKKRKKKK 290
K+K KK
Sbjct: 198 AAKQKAAAKK 207
>gnl|CDD|204122 pfam09072, TMA7, Translation machinery associated TMA7. TMA7 plays
a role in protein translation. Deletions of the TMA7
gene results in altered protein synthesis rates.
Length = 63
Score = 26.7 bits (59), Expect = 3.4
Identities = 18/61 (29%), Positives = 26/61 (42%)
Query: 228 RKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRK 287
R+ KKK K +K+KK D+ + + E + K+ K K K KK K
Sbjct: 3 RQGGKKKPLKAPKKQKKELDEEDLAFKQKQKEEAKALKELAAKAKGKGPLGGGGIKKSGK 62
Query: 288 K 288
K
Sbjct: 63 K 63
>gnl|CDD|224977 COG2066, GlsA, Glutaminase [Amino acid transport and metabolism].
Length = 309
Score = 28.8 bits (65), Expect = 3.5
Identities = 11/29 (37%), Positives = 13/29 (44%), Gaps = 6/29 (20%)
Query: 109 FGYFMCDVWNSFDVYFSTASILHLCCISV 137
FG DV + DVYF H C I +
Sbjct: 179 FGNLEHDVEEALDVYF------HQCAIEM 201
>gnl|CDD|234043 TIGR02871, spore_ylbJ, sporulation integral membrane protein YlbJ.
Members of this protein family are found exclusively in
Firmicutes (low-GC Gram-positive bacterial) and are
known from studies in Bacillus subtilis to be part of
the sigma-E regulon. Mutation leads to a sporulation
defect, confirming that members of this protein family,
YlbJ, are sporulation proteins. This protein appears to
be universal among endospore-forming bacteria, but is
encoded by a pair ORFs distant from eash other in
Symbiobacterium thermophilum IAM14863 [Cellular
processes, Sporulation and germination].
Length = 362
Score = 28.8 bits (65), Expect = 3.5
Identities = 14/34 (41%), Positives = 20/34 (58%)
Query: 205 YYALISSSISFWIPGFVMKKKKKRKKKKKKKKKK 238
Y LIS +S G +++ K+RK+K KKKK
Sbjct: 142 YILLISHYLSCITMGLILRFYKRRKRKILSKKKK 175
>gnl|CDD|219461 pfam07543, PGA2, Protein trafficking PGA2. A Saccharomyces
cerevisiae member of this family (PGA2) is an ER protein
which has been implicated in protein trafficking.
Length = 139
Score = 27.9 bits (62), Expect = 3.6
Identities = 22/130 (16%), Positives = 48/130 (36%), Gaps = 33/130 (25%)
Query: 200 FVVNVYYALISSSISFWI-------------PGFVMKKKKKRKKKKKKKKKKKRKKKKNQ 246
F N+Y + S WI P F+ K ++K+ +K++ ++ + ++ +
Sbjct: 1 FRNNLYGSFAQMSAKDWIRIIVIVGGYLLIRPYFIKLGAKAQEKEHEKERAEREEAREKK 60
Query: 247 HHRDVWINRTSRG----------EEEEEKK----------KKKKKKKKKTTKKKKRKKKR 286
R EE+EE KK +K+++K +K +++
Sbjct: 61 AKISPNALRGGATAGHGEEDTDDEEDEEDFATPSAVPQWGKKARKRQRKVIRKLLEAEEQ 120
Query: 287 KKKKNQHHRD 296
++ D
Sbjct: 121 LREDQYDDED 130
>gnl|CDD|219392 pfam07382, HC2, Histone H1-like nucleoprotein HC2. This family
contains the bacterial histone H1-like nucleoprotein HC2
(approximately 200 residues long), which seems to be
found mostly in Chlamydia. HC2 functions in DNA
condensation, although it has been suggested that it
also has other roles.
Length = 187
Score = 28.6 bits (63), Expect = 3.6
Identities = 22/77 (28%), Positives = 29/77 (37%)
Query: 224 KKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRK 283
KK KK KK K+ K + + + +K KK KKT KK
Sbjct: 51 KKTVAAKKPAAKKAAKKAVAKKVVAKKPVAKKAVAKKATAKKVAAKKVVAKKTVAKKAAA 110
Query: 284 KKRKKKKNQHHRDVWQK 300
KK KK + V +K
Sbjct: 111 KKPAAKKAVAKKAVARK 127
>gnl|CDD|216919 pfam02181, FH2, Formin Homology 2 Domain.
Length = 372
Score = 28.8 bits (65), Expect = 3.6
Identities = 25/76 (32%), Positives = 34/76 (44%), Gaps = 11/76 (14%)
Query: 228 RKKKKKKKKKKKR----KKKKNQHHRDVW--INRTSRGEEE-----EEKKKKKKKKKKKT 276
+KK+ K KKK K K Q VW ++ S ++ EE K KKKK
Sbjct: 1 KKKEPKPKKKLKPLHWDKVNPAQDRGTVWDELDEESFEKDLDLSELEELFSAKAKKKKSK 60
Query: 277 TKKKKRKKKRKKKKNQ 292
+KK K+KKK+
Sbjct: 61 KSEKKSSSKKKKKEIS 76
Score = 28.4 bits (64), Expect = 5.6
Identities = 13/28 (46%), Positives = 17/28 (60%)
Query: 219 GFVMKKKKKRKKKKKKKKKKKRKKKKNQ 246
F K KKK+ KK +KK K+KKK+
Sbjct: 49 LFSAKAKKKKSKKSEKKSSSKKKKKEIS 76
>gnl|CDD|146145 pfam03357, Snf7, Snf7. This family of proteins are involved in
protein sorting and transport from the endosome to the
vacuole/lysosome in eukaryotic cells. Vacuoles/lysosomes
play an important role in the degradation of both lipids
and cellular proteins. In order to perform this
degradative function, vacuoles/lysosomes contain
numerous hydrolases which have been transported in the
form of inactive precursors via the biosynthetic pathway
and are proteolytically activated upon delivery to the
vacuole/lysosome. The delivery of transmembrane
proteins, such as activated cell surface receptors to
the lumen of the vacuole/lysosome, either for
degradation/downregulation, or in the case of
hydrolases, for proper localisation, requires the
formation of multivesicular bodies (MVBs). These late
endosomal structures are formed by invaginating and
budding of the limiting membrane into the lumen of the
compartment. During this process, a subset of the
endosomal membrane proteins is sorted into the forming
vesicles. Mature MVBs fuse with the vacuole/lysosome,
thereby releasing cargo containing vesicles into its
hydrolytic lumen for degradation. Endosomal proteins
that are not sorted into the intralumenal MVB vesicles
are either recycled back to the plasma membrane or Golgi
complex, or remain in the limiting membrane of the MVB
and are thereby transported to the limiting membrane of
the vacuole/lysosome as a consequence of fusion.
Therefore, the MVB sorting pathway plays a critical role
in the decision between recycling and degradation of
membrane proteins. A few archaeal sequences are also
present within this family.
Length = 169
Score = 28.4 bits (64), Expect = 3.6
Identities = 9/45 (20%), Positives = 16/45 (35%)
Query: 248 HRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQ 292
+ E++ +K + + KK K K K+KK
Sbjct: 9 AIRELDKKQESLEKKIKKLEAEIKKLAKKGNKDAALILLKQKKRY 53
Score = 27.2 bits (61), Expect = 8.8
Identities = 13/51 (25%), Positives = 23/51 (45%), Gaps = 3/51 (5%)
Query: 253 INRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQHHRDVWQKKYT 303
+ + R +++++ +KK KK + KK KK K + QKK
Sbjct: 6 LRKAIRELDKKQESLEKKIKKLEAEIKKLAKKGNKDAALILLK---QKKRY 53
>gnl|CDD|214922 smart00935, OmpH, Outer membrane protein (OmpH-like). This family
includes outer membrane proteins such as OmpH among
others. Skp (OmpH) has been characterized as a molecular
chaperone that interacts with unfolded proteins as they
emerge in the periplasm from the Sec translocation
machinery.
Length = 140
Score = 27.9 bits (63), Expect = 3.6
Identities = 18/81 (22%), Positives = 47/81 (58%), Gaps = 6/81 (7%)
Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
++K+ K+++ + +K +K+ +K K + +D + E EKK+K+ +KK + ++K+
Sbjct: 23 LEKEFKKRQAELEKLEKELQKLKEKLQKD----AATLSEAAREKKEKELQKKVQEFQRKQ 78
Query: 282 RKKKRK--KKKNQHHRDVWQK 300
+K ++ K++ + + + K
Sbjct: 79 QKLQQDLQKRQQEELQKILDK 99
>gnl|CDD|226400 COG3883, COG3883, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 265
Score = 28.5 bits (64), Expect = 3.7
Identities = 12/73 (16%), Positives = 25/73 (34%), Gaps = 11/73 (15%)
Query: 223 KKKKKR-KKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKK-----KKKKKT 276
K+ KK ++K+ + K Q+ T +K +K K+ +
Sbjct: 154 KEDKKSLEEKQAALEDKLETLVALQNE-----LETQLNSLNSQKAEKNALIAALAAKEAS 208
Query: 277 TKKKKRKKKRKKK 289
+K + +K
Sbjct: 209 ALGEKAALEEQKA 221
>gnl|CDD|227615 COG5296, COG5296, Transcription factor involved in TATA site
selection and in elongation by RNA polymerase II
[Transcription].
Length = 521
Score = 28.9 bits (64), Expect = 3.8
Identities = 10/56 (17%), Positives = 29/56 (51%), Gaps = 1/56 (1%)
Query: 230 KKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKK 285
KKKK + KK ++++ + + + I R ++ +E ++ ++ +++ T +
Sbjct: 144 KKKKLLELKKTREREERLYSERHIEL-QRFKDYKELEESEQGLQEEYTPSYAEEAV 198
>gnl|CDD|223836 COG0765, HisM, ABC-type amino acid transport system, permease
component [Amino acid transport and metabolism].
Length = 222
Score = 28.6 bits (65), Expect = 3.8
Identities = 13/49 (26%), Positives = 27/49 (55%), Gaps = 6/49 (12%)
Query: 38 IFPFIIKGMLM------CFIIITAILGNLLVIISVIKHRKLRIITNYFV 80
PF++KG+L+ I++ +LG LL ++ + ++ LR + +V
Sbjct: 14 YLPFLLKGLLVTLLLTLLSIVLGLVLGLLLALMRLSGNKPLRWLARAYV 62
>gnl|CDD|235132 PRK03577, PRK03577, acid shock protein precursor; Provisional.
Length = 102
Score = 27.3 bits (60), Expect = 3.8
Identities = 14/54 (25%), Positives = 24/54 (44%)
Query: 241 KKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQHH 294
K HH + + + KK K KK++K ++K + K+ KK+ H
Sbjct: 38 APAKTTHHHKKQHKKAPEQKAQAAKKHHKNKKEQKAPEQKAQAAKKHAKKHSHK 91
Score = 26.5 bits (58), Expect = 7.1
Identities = 13/56 (23%), Positives = 24/56 (42%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTK 278
K KK+ KK +++ + +HH++ + + + KK KK K K
Sbjct: 40 AKTTHHHKKQHKKAPEQKAQAAKKHHKNKKEQKAPEQKAQAAKKHAKKHSHKTAAK 95
>gnl|CDD|149343 pfam08229, SHR3_chaperone, ER membrane protein SH3. This family of
proteins are membrane localised chaperones that are
required for correct plasma membrane localisation of
amino acid permeases (AAPs). SH3 prevents AAPs proteins
from aggregating and assists in their correct folding.
In the absence of SH3, AAPs are retained in the ER.
Length = 196
Score = 28.5 bits (64), Expect = 3.9
Identities = 9/26 (34%), Positives = 14/26 (53%)
Query: 260 EEEEEKKKKKKKKKKKTTKKKKRKKK 285
EE E K+++ +KK+ KKK
Sbjct: 170 AEEAEAAAAAKEEESAEGEKKESKKK 195
Score = 27.3 bits (61), Expect = 7.7
Identities = 8/27 (29%), Positives = 14/27 (51%)
Query: 261 EEEEKKKKKKKKKKKTTKKKKRKKKRK 287
EE + K+++ +KK KK+K
Sbjct: 170 AEEAEAAAAAKEEESAEGEKKESKKKK 196
>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
primarily archaeal type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. It is found
in a single copy and is homodimeric in prokaryotes, but
six paralogs (excluded from this family) are found in
eukarotes, where SMC proteins are heterodimeric. This
family represents the SMC protein of archaea and a few
bacteria (Aquifex, Synechocystis, etc); the SMC of other
bacteria is described by TIGR02168. The N- and
C-terminal domains of this protein are well conserved,
but the central hinge region is skewed in composition
and highly divergent [Cellular processes, Cell division,
DNA metabolism, Chromosome-associated proteins].
Length = 1164
Score = 28.9 bits (65), Expect = 4.0
Identities = 15/91 (16%), Positives = 33/91 (36%), Gaps = 19/91 (20%)
Query: 225 KKKRKKKKKKKKKKKRKKKKNQHHRDVWINR----TSRGEE---------------EEEK 265
+ + K ++K +K KR+ + + D + + EEEK
Sbjct: 384 RDELKDYREKLEKLKREINELKRELDRLQEELQRLSEELADLNAAIAGIEAKINELEEEK 443
Query: 266 KKKKKKKKKKTTKKKKRKKKRKKKKNQHHRD 296
+ K + KK+ K ++ K + + +
Sbjct: 444 EDKALEIKKQEWKLEQLAADLSKYEQELYDL 474
>gnl|CDD|227509 COG5182, CUS1, Splicing factor 3b, subunit 2 [RNA processing and
modification].
Length = 429
Score = 28.8 bits (64), Expect = 4.1
Identities = 16/77 (20%), Positives = 26/77 (33%), Gaps = 12/77 (15%)
Query: 224 KKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRK 283
K +KR + K K + ++ ++KKK T K K
Sbjct: 5 KSRKRSGNNQNKNASVVNNKAEIAAMI------------DARRLEQKKKGGVTNSKGKTN 52
Query: 284 KKRKKKKNQHHRDVWQK 300
K K + +DV Q+
Sbjct: 53 KVVDAKLEKEFKDVLQR 69
>gnl|CDD|149515 pfam08492, SRP72, SRP72 RNA-binding domain. This region has been
identified as the binding site of the SRP72 protein to
SRP RNA.
Length = 57
Score = 26.3 bits (58), Expect = 4.1
Identities = 13/21 (61%), Positives = 15/21 (71%)
Query: 223 KKKKKRKKKKKKKKKKKRKKK 243
KKK ++KKKK KKKRK K
Sbjct: 10 KKKPGTEQKKKKSAKKKRKPK 30
Score = 25.9 bits (57), Expect = 6.2
Identities = 14/29 (48%), Positives = 19/29 (65%)
Query: 262 EEEKKKKKKKKKKKTTKKKKRKKKRKKKK 290
E +K KK KKK T +KKK+ K+K+K
Sbjct: 1 EAKKPKKIAKKKPGTEQKKKKSAKKKRKP 29
Score = 25.1 bits (55), Expect = 9.1
Identities = 14/25 (56%), Positives = 15/25 (60%)
Query: 265 KKKKKKKKKKKTTKKKKRKKKRKKK 289
KK KKK + KKK KKKRK K
Sbjct: 6 KKIAKKKPGTEQKKKKSAKKKRKPK 30
>gnl|CDD|193409 pfam12936, Kri1_C, KRI1-like family C-terminal. The yeast member
of this family (Kri1p) is found to be required for 40S
ribosome biogenesis in the nucleolus. This is the
C-terminal domain of the family.
Length = 93
Score = 27.2 bits (61), Expect = 4.2
Identities = 10/26 (38%), Positives = 16/26 (61%)
Query: 260 EEEEEKKKKKKKKKKKTTKKKKRKKK 285
++EE +K KKK KK ++ K+K
Sbjct: 68 DKEERRKDKKKYGKKARLREWKKKVF 93
Score = 26.8 bits (60), Expect = 5.0
Identities = 9/22 (40%), Positives = 16/22 (72%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKK 244
K+++++ KKK KK + R+ KK
Sbjct: 69 KEERRKDKKKYGKKARLREWKK 90
Score = 26.8 bits (60), Expect = 5.1
Identities = 10/26 (38%), Positives = 17/26 (65%)
Query: 261 EEEEKKKKKKKKKKKTTKKKKRKKKR 286
++EE++K KKK KK ++ +KK
Sbjct: 68 DKEERRKDKKKYGKKARLREWKKKVF 93
Score = 26.8 bits (60), Expect = 5.9
Identities = 11/27 (40%), Positives = 15/27 (55%)
Query: 258 RGEEEEEKKKKKKKKKKKTTKKKKRKK 284
R +EE K KKK KK + + KK+
Sbjct: 67 RDKEERRKDKKKYGKKARLREWKKKVF 93
Score = 26.4 bits (59), Expect = 7.0
Identities = 9/23 (39%), Positives = 15/23 (65%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKN 245
++++K KKK KK + + KKK
Sbjct: 70 EERRKDKKKYGKKARLREWKKKV 92
Score = 26.0 bits (58), Expect = 8.9
Identities = 9/25 (36%), Positives = 17/25 (68%)
Query: 266 KKKKKKKKKKTTKKKKRKKKRKKKK 290
K++++K KKK KK + ++ +KK
Sbjct: 69 KEERRKDKKKYGKKARLREWKKKVF 93
>gnl|CDD|220654 pfam10254, Pacs-1, PACS-1 cytosolic sorting protein. PACS-1 is a
cytosolic sorting protein that directs the localisation
of membrane proteins in the trans-Golgi network
(TGN)/endosomal system. PACS-1 connects the clathrin
adaptor AP-1 to acidic cluster sorting motifs contained
in the cytoplasmic domain of cargo proteins such as
furin, the cation-independent mannose-6-phosphate
receptor and in viral proteins such as human
immunodeficiency virus type 1 Nef.
Length = 413
Score = 28.6 bits (64), Expect = 4.3
Identities = 17/73 (23%), Positives = 30/73 (41%), Gaps = 12/73 (16%)
Query: 232 KKKKKKKKRKKKKNQ---HHRDVWINRTSRGEEEEEK---------KKKKKKKKKKTTKK 279
++K+ K+ KN R + ++R +E + K+K KK
Sbjct: 285 RRKEGSKRDASGKNTLKSTFRSLQVSRLPSSGQEAQMTNTMSMTVVTKEKNKKVPVMFLG 344
Query: 280 KKRKKKRKKKKNQ 292
KK K+K + K+Q
Sbjct: 345 KKPKEKEVESKSQ 357
>gnl|CDD|149048 pfam07768, PVL_ORF50, PVL ORF-50-like family. This is a family of
sequences found in both bacteria and bacteriophages.
This region is approximately 130 residues long and in
some cases is found as part of the PVL (Panton-Valentine
leukocidin) group of genes, which encode a member of the
leukocidin group of bacterial toxins that kill
leukocytes by creation of pores in the cell membrane.
PVL appears to be a virulence factor associated with a
number of human diseases.
Length = 118
Score = 27.5 bits (61), Expect = 4.4
Identities = 12/47 (25%), Positives = 28/47 (59%), Gaps = 3/47 (6%)
Query: 257 SRGEEEEEKKKKKKKKKKKTTKKKKRKKKRK---KKKNQHHRDVWQK 300
R +E E KK + ++++ ++ +RK++++ ++K H +V QK
Sbjct: 49 MRLKEYREIKKSENIEQERKERELERKRRKEAELRRKKPHLFNVPQK 95
>gnl|CDD|218611 pfam05499, DMAP1, DNA methyltransferase 1-associated protein 1
(DMAP1). DNA methylation can contribute to
transcriptional silencing through several
transcriptionally repressive complexes, which include
methyl-CpG binding domain proteins (MBDs) and histone
deacetylases (HDACs). The chief enzyme that maintains
mammalian DNA methylation, DNMT1, can also establish a
repressive transcription complex. The non-catalytic
amino terminus of DNMT1 binds to HDAC2 and DMAP1 (for
DNMT1 associated protein), and can mediate
transcriptional repression. DMAP1 has intrinsic
transcription repressive activity, and binds to the
transcriptional co-repressor TSG101. DMAP1 is targeted
to replication foci through interaction with the far N
terminus of DNMT1 throughout S phase, whereas HDAC2
joins DNMT1 and DMAP1 only during late S phase,
providing a platform for how histones may become
deacetylated in heterochromatin following replication.
Length = 175
Score = 27.9 bits (62), Expect = 4.4
Identities = 15/67 (22%), Positives = 29/67 (43%), Gaps = 10/67 (14%)
Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
++K + RKK+++KK + +K I + + +K++KK KK
Sbjct: 9 LRKIEARKKEREKKAQDLQK----------LITAADQQAAGFSTAEARKREKKLPKKKIP 58
Query: 282 RKKKRKK 288
+K K
Sbjct: 59 QKPGPSK 65
Score = 27.5 bits (61), Expect = 6.6
Identities = 17/65 (26%), Positives = 32/65 (49%), Gaps = 5/65 (7%)
Query: 224 KKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRK 283
+K + +KK+++KK + +K + T+ E +K++KK KKK +K
Sbjct: 10 RKIEARKKEREKKAQDLQKLITAADQQAAGFSTA-----EARKREKKLPKKKIPQKPGPS 64
Query: 284 KKRKK 288
K+ K
Sbjct: 65 KEDSK 69
>gnl|CDD|222911 PHA02616, PHA02616, VP2/VP3; Provisional.
Length = 259
Score = 28.5 bits (63), Expect = 4.4
Identities = 16/73 (21%), Positives = 30/73 (41%), Gaps = 10/73 (13%)
Query: 217 IPGFVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKT 276
IP +++ ++ K K + K+K+++ H +K KK K
Sbjct: 184 IPDWILYVLEELNKDIYKIPTQAVKRKQDELH----------PVSPTKKAALSKKSKWTG 233
Query: 277 TKKKKRKKKRKKK 289
TK + KKR+ +
Sbjct: 234 TKSSQSSKKRRGR 246
>gnl|CDD|215059 PLN00111, PLN00111, accumulation of photosystem one; Provisional.
Length = 399
Score = 28.4 bits (64), Expect = 4.5
Identities = 8/27 (29%), Positives = 15/27 (55%)
Query: 266 KKKKKKKKKKTTKKKKRKKKRKKKKNQ 292
K++KK + K +R+ + KKK +
Sbjct: 48 SKREKKPYPRPMKLLRREAREKKKLRK 74
>gnl|CDD|233873 TIGR02454, CbiQ_TIGR, cobalt ABC transporter, permease protein
CbiQ. This model represents the permease component of
the cobalt-specific ABC transporter. This model finds
permeases which are generally next to the other subunits
of the complex (CbiN and CbiO) or the cobalamin
biosynthesis protein CbiM which is a transmembrane
protein which likely interacts with the complex in some
manner. In genomes which possess all of these subunits
the ATPase is most likely running in the direction of
import (for the biosynthesis of coenzyme B12). In other
genomes, this subunit may be involved in the export of
cobalt and/or other closely related heavy metals
[Transport and binding proteins, Cations and iron
carrying compounds].
Length = 198
Score = 28.0 bits (63), Expect = 4.5
Identities = 11/74 (14%), Positives = 27/74 (36%), Gaps = 7/74 (9%)
Query: 46 MLMCFIIITAILGN-----LLVIISVIKHRKLRI-ITNYFV-VSLAFADLLVALCVMPFN 98
+ +++ +L N +L I +++ I + +++ LLV L ++ F+
Sbjct: 9 SALALLVLAVLLPNPIALLILAIFALLLVGLAGIPLRILLKRLAVPIGFLLVGLLLLLFS 68
Query: 99 AIVSLTDEWYFGYF 112
G
Sbjct: 69 NGTPGEPLLVLGPL 82
>gnl|CDD|218747 pfam05786, Cnd2, Condensin complex subunit 2. This family consists
of several Barren protein homologues from several
eukaryotic organisms. In Drosophila Barren (barr) is
required for sister-chromatid segregation in mitosis.
barr encodes a novel protein that is present in
proliferating cells and has homologues in yeast and
human. Mitotic defects in barr embryos become apparent
during cycle 16, resulting in a loss of PNS and CNS
neurons. Centromeres move apart at the
metaphase-anaphase transition and Cyclin B is degraded,
but sister chromatids remain connected, resulting in
chromatin bridging. Barren protein localises to
chromatin throughout mitosis. Colocalisation and
biochemical experiments indicate that Barren associates
with Topoisomerase II throughout mitosis and alters the
activity of Topoisomerase II. It has been suggested that
this association is required for proper chromosomal
segregation by facilitating the decatenation of
chromatids at anaphase. This family forms one of the
three non-structural maintenance of chromosomes (SMC)
subunits of the mitotic condensation complex along with
Cnd1 and Cnd3.
Length = 719
Score = 28.6 bits (64), Expect = 4.5
Identities = 15/65 (23%), Positives = 25/65 (38%), Gaps = 6/65 (9%)
Query: 226 KKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKK 285
K +K+KKR K+ I+ + EE + + + K T K K +KK
Sbjct: 435 AAADTKSAAEKQKKRAKE------PFEIDFGAPLEEIDFEVIFQPPKANSTLSKSKLQKK 488
Query: 286 RKKKK 290
+
Sbjct: 489 WDTRT 493
>gnl|CDD|71622 pfam08188, Protamine_3, Spermatozal protamine family. This family
consists of the spermatozal protamines. Spermatozal
protamines play an important role in remodelling of the
sperm chromatin during mammalian spermiogenesis. Nuclear
elongation and chromatin condensation are concomitant
with modifications in the basic protein complement
associated with DNA. Somatic histones are initially
replaced by testis -specific histone variants, then by
transitional proteins, and ultimately by protamines.
Length = 48
Score = 26.0 bits (56), Expect = 4.6
Identities = 10/38 (26%), Positives = 23/38 (60%)
Query: 265 KKKKKKKKKKKTTKKKKRKKKRKKKKNQHHRDVWQKKY 302
+++ KKK+K+ +++K +K ++K+KN R +
Sbjct: 2 RRRHSMKKKRKSVRRRKTRKNQRKRKNSLGRSFKAHGF 39
>gnl|CDD|235222 PRK04125, PRK04125, murein hydrolase regulator LrgA; Provisional.
Length = 141
Score = 27.7 bits (62), Expect = 4.6
Identities = 9/27 (33%), Positives = 18/27 (66%)
Query: 220 FVMKKKKKRKKKKKKKKKKKRKKKKNQ 246
F++ K +K K+ KK+KK K + ++ +
Sbjct: 115 FILGKTEKEKEDKKEKKVKIKGRQHGK 141
>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family. Emg1 and Nop14 are novel
proteins whose interaction is required for the
maturation of the 18S rRNA and for 40S ribosome
production.
Length = 809
Score = 28.8 bits (65), Expect = 4.8
Identities = 12/23 (52%), Positives = 18/23 (78%)
Query: 259 GEEEEEKKKKKKKKKKKTTKKKK 281
EEEEE+K+KKKKK ++T+ +
Sbjct: 369 EEEEEEEKEKKKKKSAESTRSEL 391
>gnl|CDD|233042 TIGR00598, rad14, DNA repair protein. All proteins in this family
for which functions are known are used for the
recognition of DNA damage as part of nucleotide excision
repair. This family is based on the phylogenomic
analysis of JA Eisen (1999, Ph.D. Thesis, Stanford
University) [DNA metabolism, DNA replication,
recombination, and repair].
Length = 172
Score = 27.8 bits (62), Expect = 4.9
Identities = 13/41 (31%), Positives = 23/41 (56%), Gaps = 4/41 (9%)
Query: 258 RGEEEEEKKKKKKKKKKKTTKKKKR----KKKRKKKKNQHH 294
R E +EE K+KK +KK K ++ R K++ + ++H
Sbjct: 103 REESKEEMKEKKFEKKLKELRRAVRSSEYTNKKEGRVHEHE 143
>gnl|CDD|202833 pfam03962, Mnd1, Mnd1 family. This family of proteins includes
MND1 from S. cerevisiae. The mnd1 protein forms a
complex with hop2 to promote homologous chromosome
pairing and meiotic double-strand break repair.
Length = 188
Score = 28.0 bits (63), Expect = 5.0
Identities = 18/71 (25%), Positives = 37/71 (52%), Gaps = 2/71 (2%)
Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
+KK+ + K++ + + + +K K R+ RT EE ++ +K+ KK K + K +K
Sbjct: 74 LKKELEELKQRIAELQAQIEKLKKG--REETEERTELLEELKQLEKELKKLKAELEKYEK 131
Query: 282 RKKKRKKKKNQ 292
+R +K +
Sbjct: 132 NDPERIEKLKE 142
>gnl|CDD|219589 pfam07808, RED_N, RED-like protein N-terminal region. This family
contains sequences that are similar to the N-terminal
region of Red protein. This and related proteins contain
a RED repeat which consists of a number of RE and RD
sequence elements. The region in question has several
conserved NLS sequences and a putative trimeric
coiled-coil region, suggesting that these proteins are
expressed in the nucleus. The function of Red protein is
unknown, but efficient sequestration to nuclear bodies
suggests that its expression may be tightly regulated of
that the protein self-aggregates extremely efficiently.
Length = 238
Score = 28.3 bits (63), Expect = 5.0
Identities = 9/26 (34%), Positives = 17/26 (65%)
Query: 225 KKKRKKKKKKKKKKKRKKKKNQHHRD 250
KKK+K +K+++ +K+ N +RD
Sbjct: 1 KKKKKYAYLRKQEENAEKEINPKYRD 26
>gnl|CDD|153128 cd02437, CCC1_like_1, CCC1-related protein family. CCC1_like_1:
This is a protein family closely related to CCC1, a
family of proteins involved in iron and manganese
transport. Yeast CCC1 is a vacuole transmembrane protein
responsible for the iron and manganese accumulation in
vacuole. .
Length = 175
Score = 27.8 bits (62), Expect = 5.2
Identities = 10/62 (16%), Positives = 24/62 (38%), Gaps = 1/62 (1%)
Query: 36 IHIFPFIIKGMLMCFIIITAILGNLLVIISVIKHRKLRIITNYFVVSLAFADLLV-ALCV 94
+ + PF+ + + I AI+ +L I+ ++ + +I V + +L
Sbjct: 110 LPLLPFLPFDLWLAAGIAVAIVLAILFILGLVIGKISKINVAISFVWAVGMVITGISLVF 169
Query: 95 MP 96
Sbjct: 170 SV 171
Score = 27.0 bits (60), Expect = 8.3
Identities = 6/33 (18%), Positives = 9/33 (27%)
Query: 204 VYYALISSSISFWIPGFVMKKKKKRKKKKKKKK 236
A+ +I G V+ K K
Sbjct: 124 AGIAVAIVLAILFILGLVIGKISKINVAISFVW 156
>gnl|CDD|236172 PRK08173, PRK08173, DNA topoisomerase III; Validated.
Length = 862
Score = 28.5 bits (64), Expect = 5.2
Identities = 16/76 (21%), Positives = 29/76 (38%), Gaps = 7/76 (9%)
Query: 217 IPGFVMKKKKK-------RKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKK 269
+PGF + + ++ K + + ++ K + T+ + EK K
Sbjct: 785 LPGFKSSRTGRNFKAYLVKQPDGKIGFEFEPREPKAAAAKKTAAKATAAAATKAEKAAAK 844
Query: 270 KKKKKKTTKKKKRKKK 285
K KKT KK +K
Sbjct: 845 KAPAKKTAAKKTAARK 860
>gnl|CDD|219137 pfam06687, SUR7, SUR7/PalI family. This family consists of several
fungal-specific SUR7 proteins. Its activity regulates
expression of RVS161, a homologue of human endophilin,
suggesting a function for both in endocytosis. The
protein carries four transmembrane domains and is thus
likely to act as an anchoring protein for the eisosome
to the plasma membrane. Eisosomes are the immobile
protein complexes, that include the proteins Pil1 and
Lsp1, which co-localise with sites of protein and lipid
endocytosis at the plasma membrane. SUR7 protein may
play a role in sporulation. This family also includes
PalI which is part of a pH signal transduction cascade.
Based on the similarity of PalI to the yeast Rim9
meiotic signal transduction component it has been
suggested that PalI might be a membrane sensor for
ambient pH.
Length = 205
Score = 28.1 bits (63), Expect = 5.3
Identities = 13/90 (14%), Positives = 33/90 (36%), Gaps = 8/90 (8%)
Query: 13 YNINYTS---TNFSNTSLFPTNSTIHIHIFPFIIKGMLMCFIIITAILGNLLVIISVI-- 67
Y + T P++ +++ + ++ + M I + +I++ I
Sbjct: 76 YPFDPVDNFGTESGENLNLPSSFRDNLNTYYYLSRFMF-IVHPIALFFTVIALILAGILA 134
Query: 68 --KHRKLRIITNYFVVSLAFADLLVALCVM 95
+ + N+ + LAF L+A +
Sbjct: 135 HFSSPRRGSLVNFLLSLLAFLFTLLAAALD 164
>gnl|CDD|218550 pfam05311, Baculo_PP31, Baculovirus 33KDa late protein (PP31).
Autographa californica nuclear polyhedrosis virus
(AcMNPV) pp31 is a nuclear phosphoprotein that
accumulates in the virogenic stroma, which is the viral
replication centre in the infected-cell nucleus, binds
to DNA, and serves as a late expression factor.
Length = 267
Score = 28.1 bits (63), Expect = 5.3
Identities = 9/38 (23%), Positives = 15/38 (39%)
Query: 256 TSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQH 293
S + +K+K K KK K++K+K
Sbjct: 212 ASVVPTPQSSSPSRKRKAPKKVAKKVAAAKKRKQKRSP 249
>gnl|CDD|237748 PRK14537, PRK14537, 50S ribosomal protein L20/unknown domain fusion
protein; Provisional.
Length = 230
Score = 28.0 bits (62), Expect = 5.3
Identities = 17/93 (18%), Positives = 35/93 (37%), Gaps = 8/93 (8%)
Query: 193 HHPNVCIFVVNVYYALISSSISFWIPGFVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVW 252
P V Y L +S + + K KK + ++K + ++K +
Sbjct: 101 LQPEV----FVQYVELAKKFLSQFQSTTITKPKKVLINEVLQEKTINQNEEKTSLQNE-- 154
Query: 253 INRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKK 285
+ E EEK + + + + T+ K++K
Sbjct: 155 --KVLSPELSEEKSDSELETQPQKTQLKEKKPS 185
>gnl|CDD|220684 pfam10310, DUF2413, Protein of unknown function (DUF2413). This is
a family of proteins conserved in fungi. The function is
not known.
Length = 436
Score = 28.2 bits (63), Expect = 5.4
Identities = 15/64 (23%), Positives = 23/64 (35%), Gaps = 7/64 (10%)
Query: 229 KKKKKKKKKKKRKKKKNQHHRDVWINRTSRGE-EEEEKKKKKKKKKKKTTKKKKRKKKRK 287
K KK KK K + D+ +E E+ +K K KK + +
Sbjct: 12 KAPTKKPKKGDASKDSTEDDEDIL------EFLDELEQSEKAKPPKKPKEASRPGTPRNP 65
Query: 288 KKKN 291
KK +
Sbjct: 66 KKSS 69
>gnl|CDD|215329 PLN02610, PLN02610, probable methionyl-tRNA synthetase.
Length = 801
Score = 28.6 bits (64), Expect = 5.4
Identities = 15/45 (33%), Positives = 21/45 (46%), Gaps = 3/45 (6%)
Query: 255 RTSRGEEEEEKK---KKKKKKKKKTTKKKKRKKKRKKKKNQHHRD 296
R +R E E KK + KKK KKK+ KK K++ +
Sbjct: 591 RAARAEAAEAKKLAKQLKKKALSDGGKKKQGKKAGGGGKSKAAAE 635
>gnl|CDD|220505 pfam09988, DUF2227, Uncharacterized metal-binding protein
(DUF2227). Members of this family of hypothetical
bacterial proteins possess metal binding properties;
however, their exact function has not, as yet, been
determined.
Length = 169
Score = 27.6 bits (62), Expect = 5.4
Identities = 17/79 (21%), Positives = 35/79 (44%), Gaps = 8/79 (10%)
Query: 158 VLLMLSNVWILPGIISFTPIMLGWYTTPDHKQ-------YRKHHPNVCIFVVNVYYALIS 210
+L +L V +L ++++ +LG P +Q R+H + F++ +
Sbjct: 92 LLYLLLWVLLLALLLAWILGLLGGLNWPTPEQAPWLQQLLRQHPAELLAFLIGLELGSWL 151
Query: 211 SSISFWIPGFVMKKKKKRK 229
I+ +P K+ +KRK
Sbjct: 152 HLIADGVPSPF-KRSRKRK 169
>gnl|CDD|148288 pfam06589, CRA, Circumsporozoite-related antigen (CRA). This
family consists of several circumsporozoite-related
antigen (CRA) or exported protein-1 (EXP1) sequences
found specifically in Plasmodium species. The function
of this family is unknown.
Length = 157
Score = 27.6 bits (61), Expect = 5.4
Identities = 20/56 (35%), Positives = 22/56 (39%), Gaps = 2/56 (3%)
Query: 224 KKKKRKKKKKKKKKKKRKKKKNQHHRDV--WINRTSRGEEEEEKKKKKKKKKKKTT 277
K K K KKK KK + DV I+ R EEE K K K K T
Sbjct: 22 KNKNGKYGSGNVIKKKNKKGSGEPLIDVHDLISDMVRKEEELVKLTKNKSSYKLAT 77
>gnl|CDD|223596 COG0522, RpsD, Ribosomal protein S4 and related proteins
[Translation, ribosomal structure and biogenesis].
Length = 205
Score = 28.0 bits (63), Expect = 5.5
Identities = 15/53 (28%), Positives = 26/53 (49%), Gaps = 6/53 (11%)
Query: 223 KKKKKRK------KKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKK 269
KKK R+ K++ K++K K QH ++ W + G + EK+K +
Sbjct: 8 KKKLSRRLGFNPWLKERLCKERKLPYKPGQHGQERWKKLSDYGLQLREKQKLR 60
>gnl|CDD|221643 pfam12572, DUF3752, Protein of unknown function (DUF3752). This
domain family is found in eukaryotes, and is typically
between 140 and 163 amino acids in length.
Length = 148
Score = 27.4 bits (61), Expect = 5.6
Identities = 18/74 (24%), Positives = 35/74 (47%), Gaps = 8/74 (10%)
Query: 223 KKKKKRKKKKKKKKKKKRKKK-KNQHHRDVWINRTSRGEEEEEKKKKKK---KKKKKTTK 278
K+ K+ +K K+ +KKR K QH + + + +EEE ++ ++K
Sbjct: 69 KRDKEMARKVKEYNEKKRGKSLVEQHQKK----QKKKKKEEENDDPSRRPFDREKDLAVG 124
Query: 279 KKKRKKKRKKKKNQ 292
K K +R++ N+
Sbjct: 125 GKISKAQRRELINK 138
>gnl|CDD|163538 TIGR03826, YvyF, flagellar operon protein TIGR03826. This gene is
found in flagellar operons of Bacillus-related
organisms. Its function has not been determined and an
official gene symbol has not been assigned, although the
gene is designated yvyF in B. subtilus. A tentative
assignment as a regulator is suggested in the NCBI
record GI:16080597.
Length = 137
Score = 27.3 bits (61), Expect = 5.6
Identities = 13/29 (44%), Positives = 19/29 (65%)
Query: 257 SRGEEEEEKKKKKKKKKKKTTKKKKRKKK 285
S GE+E+ KKK+K+ +KK T + K K
Sbjct: 109 SAGEQEKTKKKEKETEKKDTYYIQDTKNK 137
>gnl|CDD|218215 pfam04696, Pinin_SDK_memA, pinin/SDK/memA/ protein conserved
region. Members of this family have very varied
localisations within the eukaryotic cell. pinin is known
to localise at the desmosomes and is implicated in
anchoring intermediate filaments to the desmosomal
plaque. SDK2/3 is a dynamically localised nuclear
protein thought to be involved in modulation of
alternative pre-mRNA splicing. memA is a tumour marker
preferentially expressed in human melanoma cell lines. A
common feature of the members of this family is that
they may all participate in regulating protein-protein
interactions.
Length = 131
Score = 27.4 bits (61), Expect = 5.6
Identities = 11/64 (17%), Positives = 36/64 (56%), Gaps = 7/64 (10%)
Query: 229 KKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKK 288
K +++ + ++K++ + I + +E++E+++ +K+K++ +++RK+ +
Sbjct: 22 KFSQEESRLTSKEKRRAE------IEQKLEEQEKQEREELRKEKRELFE-ERRRKQLELR 74
Query: 289 KKNQ 292
K Q
Sbjct: 75 KLEQ 78
>gnl|CDD|218078 pfam04419, 4F5, 4F5 protein family. Members of this family are
short proteins that are rich in aspartate, glutamate,
lysine and arginine. Although the function of these
proteins is unknown, they are found to be ubiquitously
expressed.
Length = 38
Score = 25.2 bits (56), Expect = 5.7
Identities = 9/24 (37%), Positives = 11/24 (45%)
Query: 222 MKKKKKRKKKKKKKKKKKRKKKKN 245
KK K+KKK K + K K
Sbjct: 15 AKKAAKKKKKGAKSQLKAAAKALE 38
>gnl|CDD|217803 pfam03938, OmpH, Outer membrane protein (OmpH-like). This family
includes outer membrane proteins such as OmpH among
others. Skp (OmpH) has been characterized as a molecular
chaperone that interacts with unfolded proteins as they
emerge in the periplasm from the Sec translocation
machinery.
Length = 157
Score = 27.6 bits (62), Expect = 5.8
Identities = 25/105 (23%), Positives = 47/105 (44%), Gaps = 26/105 (24%)
Query: 199 IFVVNVYYALISSSISFWIPGFVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSR 258
I VV+V L S K +K+ +K+ KK + + +KK
Sbjct: 18 IGVVDVQKVLSESP--------AGKAAQKQLEKEFKKLQAELQKK--------------- 54
Query: 259 GEEEEEKKKKKKKKKKKTTKKKKRKKKRKK--KKNQHHRDVWQKK 301
E+E +K+++K +K+ T ++ RK K+++ +K Q + Q
Sbjct: 55 -EKELQKEEQKLQKQAATLSEEARKAKQQELQQKQQELQQKQQAA 98
>gnl|CDD|233048 TIGR00605, rad4, DNA repair protein rad4. All proteins in this
family for which functions are known are involved in
targeting nucleotide excision repair to specific regions
of the genome.This family is based on the phylogenomic
analysis of JA Eisen (1999, Ph.D. Thesis, Stanford
University) [DNA metabolism, DNA replication,
recombination, and repair].
Length = 713
Score = 28.3 bits (63), Expect = 5.8
Identities = 9/33 (27%), Positives = 19/33 (57%)
Query: 260 EEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQ 292
E E++ K +++K +++ +R+KKR K
Sbjct: 21 EAEKQPKSRRRKVRRENEPSLRRRKKRFKTGLN 53
Score = 28.3 bits (63), Expect = 6.5
Identities = 7/34 (20%), Positives = 21/34 (61%)
Query: 260 EEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQH 293
+E E++ K +++K ++ + R++K++ K +
Sbjct: 20 KEAEKQPKSRRRKVRRENEPSLRRRKKRFKTGLN 53
>gnl|CDD|241262 cd01228, PH_BCR-related, Breakpoint Cluster Region-related
pleckstrin homology (PH) domain. The BCR gene is one of
the two genes in the BCR-ABL complex, which is
associated with the Philadelphia chromosome, a product
of a reciprocal translocation between chromosomes 22 and
9. BCR is a GTPase-activating protein (GAP) for RAC1
(primarily) and CDC42. The Dbl region of BCR has the
most RhoGEF activity for Cdc42, and less activity
towards Rac and Rho. Since BCR possesses both GAP and
GEF activities, it may function to temporally regulate
the activity of these GTPases. It also displays
serine/threonine kinase activity. The BCR protein
contains multiple domains including an N-terminal kinase
domain, a RhoGEF domain, a PH domain, a C1 domain, a C2
domain, and a C-terminal RhoGAP domain. ABR, a related
smaller protein, is structurally similar to BCR, but
lacks the N-terminal kinase domain and has GAP activity
for both Rac and Cdc42. PH domains have diverse
functions, but in general are involved in targeting
proteins to the appropriate cellular location or in the
interaction with a binding partner. They share little
sequence conservation, but all have a common fold, which
is electrostatically polarized. Less than 10% of PH
domains bind phosphoinositide phosphates (PIPs) with
high affinity and specificity. PH domains are
distinguished from other PIP-binding domains by their
specific high-affinity binding to PIPs with two vicinal
phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or
PtdIns(3,4,5)P3 which results in targeting some PH
domain proteins to the plasma membrane. A few display
strong specificity in lipid binding. Any specificity is
usually determined by loop regions or insertions in the
N-terminus of the domain, which are not conserved across
all PH domains. PH domains are found in cellular
signaling proteins such as serine/threonine kinase,
tyrosine kinases, regulators of G-proteins, endocytotic
GTPases, adaptors, as well as cytoskeletal associated
molecules and in lipid associated enzymes.
Length = 179
Score = 27.7 bits (62), Expect = 5.9
Identities = 15/53 (28%), Positives = 23/53 (43%), Gaps = 3/53 (5%)
Query: 217 IPGFVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKK 269
P V K ++ + KK K+K K++ R + SR EK +KK
Sbjct: 80 APDVVPKSSEELVQMKKAISALKQKIKRDSAERKKAASSGSRA---IEKLRKK 129
>gnl|CDD|218437 pfam05107, DUF694, Family of unknown function (DUF694). Family of
hypothetical bacterial proteins.
Length = 278
Score = 28.0 bits (63), Expect = 6.0
Identities = 18/43 (41%), Positives = 28/43 (65%), Gaps = 1/43 (2%)
Query: 239 KRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
+ K+K N H +++ R R EE ++KKKKKKKKKK + ++
Sbjct: 63 QEKEKLNDEHGELY-ERADRAEELLKEKKKKKKKKKKKEELRE 104
Score = 27.6 bits (62), Expect = 7.5
Identities = 11/43 (25%), Positives = 27/43 (62%)
Query: 250 DVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQ 292
++++ + +E + ++ + ++ K+KK+KKK+KKKK +
Sbjct: 59 EIFVQEKEKLNDEHGELYERADRAEELLKEKKKKKKKKKKKEE 101
>gnl|CDD|215278 PLN02502, PLN02502, lysyl-tRNA synthetase.
Length = 553
Score = 28.0 bits (63), Expect = 6.2
Identities = 8/32 (25%), Positives = 14/32 (43%)
Query: 255 RTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKR 286
R + EEEK K++ K K + ++
Sbjct: 16 RLKAKQAEEEKAAKEEAKAAAAAAAAKGRSRK 47
>gnl|CDD|198139 smart01071, CDC37_N, Cdc37 N terminal kinase binding. Cdc37 is a
molecular chaperone required for the activity of
numerous eukaryotic protein kinases. This domain
corresponds to the N terminal domain which binds
predominantly to protein kinases.and is found N terminal
to the Hsp (Heat shocked protein) 90-binding domain.
Expression of a construct consisting of only the
N-terminal domain of Saccharomyces pombe Cdc37 results
in cellular viability. This indicates that interactions
with the cochaperone Hsp90 may not be essential for
Cdc37 function.
Length = 154
Score = 27.4 bits (61), Expect = 6.4
Identities = 8/50 (16%), Positives = 25/50 (50%)
Query: 224 KKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKK 273
+ + +K+ ++ + ++ + HRD ++ ++ +K++KKK
Sbjct: 92 QDQLKKELEEANGDSEGLLEELKKHRDKLKKEQKELRKKLDELEKEEKKK 141
>gnl|CDD|227474 COG5145, RAD14, DNA excision repair protein [DNA replication,
recombination, and repair].
Length = 292
Score = 28.0 bits (62), Expect = 6.4
Identities = 15/49 (30%), Positives = 30/49 (61%), Gaps = 6/49 (12%)
Query: 259 GEEEEEKKKKKKKKKKKTTKKKKRKKK----RKKKKNQHH--RDVWQKK 301
EE +++K++++K K K+KK +KK R+K + ++ DV +K+
Sbjct: 210 SLEELDREKQRREKMKDDRKEKKLEKKIKELRRKTRTSNYSRMDVREKE 258
Score = 27.7 bits (61), Expect = 7.5
Identities = 19/69 (27%), Positives = 32/69 (46%), Gaps = 19/69 (27%)
Query: 228 RKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRK 287
R+K++++K K RK+KK +KK K+ ++K T R R+
Sbjct: 216 REKQRREKMKDDRKEKK-------------------LEKKIKELRRKTRTSNYSRMDVRE 256
Query: 288 KKKNQHHRD 296
K+K+ H D
Sbjct: 257 KEKHVHVFD 265
>gnl|CDD|235501 PRK05559, PRK05559, DNA topoisomerase IV subunit B; Reviewed.
Length = 631
Score = 28.1 bits (64), Expect = 6.5
Identities = 7/24 (29%), Positives = 12/24 (50%)
Query: 220 FVMKKKKKRKKKKKKKKKKKRKKK 243
+K + R + KK K+KK+
Sbjct: 375 KAIKAAQARLRAAKKVKRKKKTSG 398
>gnl|CDD|215525 PLN02972, PLN02972, Histidyl-tRNA synthetase.
Length = 763
Score = 28.3 bits (63), Expect = 6.5
Identities = 12/26 (46%), Positives = 13/26 (50%), Gaps = 4/26 (15%)
Query: 258 RGEEEEEK----KKKKKKKKKKTTKK 279
+ E E E KK KK KKKK K
Sbjct: 237 QAEAENEAGKSDKKDKKSKKKKVLGK 262
>gnl|CDD|225743 COG3202, COG3202, ATP/ADP translocase [Energy production and
conversion].
Length = 509
Score = 28.0 bits (63), Expect = 6.5
Identities = 37/200 (18%), Positives = 67/200 (33%), Gaps = 34/200 (17%)
Query: 50 FIIITAILGNLLVIISVIKHRKLRIITNYFVVSLAFADLLVALCVMPFNAIVSLTDEWYF 109
F II + + + + + F+ L A M + + EW +
Sbjct: 95 FYIILGFFLGFFALFAFV----IYPYKDILHPDPEFSRDLFADLPMFLKWFILIVGEWSY 150
Query: 110 G--YFMCDVWNSFDV---YFSTASILHLCCIS-VDRYYAIVKPLQYPIIMNQNTVLLMLS 163
Y M ++W S + ++ A+ + I R+Y + N LL+
Sbjct: 151 SLFYIMAELWGSLVLSLLFWQFAN--EITTIEEAKRFYP-------LFGLGANISLLLSG 201
Query: 164 NVWILPGIISFTPIMLGWYTTPDHKQYRKHHPNVC--IFVVNVYYALISSSISFWIPGFV 221
V W + + + +F+V V +I+ + +I V
Sbjct: 202 EV-------------TSWLSKHRDALASEFKDLLLQILFIVIVILGIIAILLYRYINRNV 248
Query: 222 MKKKKKRKKKKKKKKKKKRK 241
+ + KKKKKKKK K
Sbjct: 249 LTDPLFYLRAKKKKKKKKLK 268
>gnl|CDD|218332 pfam04929, Herpes_DNAp_acc, Herpes DNA replication accessory
factor. Replicative DNA polymerases are capable of
polymerising tens of thousands of nucleotides without
dissociating from their DNA templates. The high
processivity of these polymerases is dependent upon
accessory proteins that bind to the catalytic subunit of
the polymerase or to the substrate. The Epstein-Barr
virus (EBV) BMRF1 protein is an essential component of
the viral DNA polymerase and is absolutely required for
lytic virus replication. BMRF1 is also a transactivator.
This family is predicted to have a UL42 like structure.
Length = 381
Score = 28.1 bits (63), Expect = 6.6
Identities = 10/19 (52%), Positives = 14/19 (73%)
Query: 224 KKKKRKKKKKKKKKKKRKK 242
K+K+ KK+KKKK K+ K
Sbjct: 356 KRKREGDKKQKKKKSKKLK 374
Score = 27.7 bits (62), Expect = 8.1
Identities = 10/20 (50%), Positives = 15/20 (75%)
Query: 226 KKRKKKKKKKKKKKRKKKKN 245
K++++ KK+KKKK KK K
Sbjct: 356 KRKREGDKKQKKKKSKKLKL 375
>gnl|CDD|226814 COG4379, COG4379, Mu-like prophage tail protein gpP [General
function prediction only].
Length = 386
Score = 27.9 bits (62), Expect = 6.6
Identities = 9/42 (21%), Positives = 17/42 (40%), Gaps = 1/42 (2%)
Query: 248 HRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKK 289
+ + E +K+K +K + K KK KK + +
Sbjct: 341 EDGAFDVDAYPKKAEA-ARKRKGDRKGVSHKAKKGGKKNQAE 381
>gnl|CDD|197664 smart00338, BRLZ, basic region leucin zipper.
Length = 65
Score = 26.0 bits (58), Expect = 6.7
Identities = 6/33 (18%), Positives = 24/33 (72%)
Query: 261 EEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQH 293
EE+EK+++++++ ++ ++ + +KK + ++ +
Sbjct: 1 EEDEKRRRRRERNREAARRSRERKKAEIEELER 33
Score = 25.6 bits (57), Expect = 7.9
Identities = 5/34 (14%), Positives = 23/34 (67%)
Query: 262 EEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQHHR 295
EE++K+++++++ + ++ R++K+ + + +
Sbjct: 1 EEDEKRRRRRERNREAARRSRERKKAEIEELERK 34
>gnl|CDD|147574 pfam05462, Dicty_CAR, Slime mold cyclic AMP receptor. This family
consists of cyclic AMP receptor (CAR) proteins from
slime molds. CAR proteins are responsible for
controlling development in Dictyostelium discoideum.
Length = 305
Score = 27.8 bits (62), Expect = 6.8
Identities = 27/98 (27%), Positives = 46/98 (46%), Gaps = 14/98 (14%)
Query: 53 ITAILGNLLVIISVIKHRKLR-----IITNYFVVSLAFADLLVALCVMPFNAIVSLTDEW 107
T+I+G LV+I + + LR +I+ + + SL DL+ + I++LT+
Sbjct: 19 FTSIIGCFLVLIGFWRLKLLRNHITKVISCFCLTSL-LKDLI--------STILTLTNSA 69
Query: 108 YFGYFMCDVWNSFDVYFSTASILHLCCISVDRYYAIVK 145
+ F C ++ Y S A L C++ Y IVK
Sbjct: 70 QYNGFPCYLYAIVITYGSLACWLWTLCLAFSIYNLIVK 107
>gnl|CDD|226889 COG4487, COG4487, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 438
Score = 27.9 bits (62), Expect = 6.8
Identities = 13/74 (17%), Positives = 31/74 (41%), Gaps = 4/74 (5%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKT----TK 278
K++K ++ K+ + + + + +N E E +K+ + K+
Sbjct: 78 KEQKNLFNEQIKQFELALQDEIAKLEALELLNLEKDKELELLEKELDELSKELQKQLQNT 137
Query: 279 KKKRKKKRKKKKNQ 292
+ +KKR+ KN+
Sbjct: 138 AEIIEKKRENNKNE 151
>gnl|CDD|215590 PLN03123, PLN03123, poly [ADP-ribose] polymerase; Provisional.
Length = 981
Score = 28.2 bits (63), Expect = 6.8
Identities = 15/77 (19%), Positives = 36/77 (46%), Gaps = 11/77 (14%)
Query: 225 KKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKK 284
K+++ +++K++ KK K+KK+ G+++ +K K + T +K+
Sbjct: 191 KEEKAEERKQESKKGAKRKKDAS-----------GDDKSKKAKTDRDVSTSTAASQKKSS 239
Query: 285 KRKKKKNQHHRDVWQKK 301
+ K +++W K
Sbjct: 240 DLESKLEAQSKELWSLK 256
>gnl|CDD|217785 pfam03901, Glyco_transf_22, Alg9-like mannosyltransferase family.
Members of this family are mannosyltransferase enzymes.
At least some members are localised in endoplasmic
reticulum and involved in GPI anchor biosynthesis.
Length = 412
Score = 28.1 bits (63), Expect = 7.0
Identities = 35/207 (16%), Positives = 70/207 (33%), Gaps = 13/207 (6%)
Query: 12 TYNINYTSTNFSNTSLFPTNSTIHIHIFPFIIKGMLMCFIIIT-AILGNLLVIISVIKHR 70
+ P S + ++ ++ L T A+L LV+ +++
Sbjct: 139 MILTSIALYYLLEYGSVPNTSVSYKYLKAVLLIAALAILGRPTSALLWLPLVLYHLLRLP 198
Query: 71 KLRIITNYFVVSLAFADLLVALCVMPFNAIVS---LTDEWYFGYFMCDVWNSFDVYFSTA 127
R+ F L ++L ++ ++ L D +++G F+ N F Y +
Sbjct: 199 GKRLKL--------FLFLAISLGLLVLLLVLGAVILIDSYFYGRFVFTPLN-FLKYNVLS 249
Query: 128 SILHLCCISVDRYYAIVKPLQYPIIMNQNTVLLMLSNVWILPGIISFTPIMLGWYTTPDH 187
+ L +Y +L L V L +++ I L Y+ H
Sbjct: 250 NGSSLYGTHPWYWYFSNGLPNILGPFLLAFILGPLVLVSRLLLLLAPILIWLFVYSLQPH 309
Query: 188 KQYRKHHPNVCIFVVNVYYALISSSIS 214
K+ R +P + ++ AL S
Sbjct: 310 KEERFLYPIYPLICLSAAIALDRLSRK 336
>gnl|CDD|222928 PHA02778, PHA02778, major capsid L1 protein; Provisional.
Length = 503
Score = 27.9 bits (63), Expect = 7.1
Identities = 5/25 (20%), Positives = 17/25 (68%)
Query: 265 KKKKKKKKKKKTTKKKKRKKKRKKK 289
+ +++K+K ++ + K++R+K+
Sbjct: 479 RTRRRKRKSTSSSSSRSTKRRRRKR 503
Score = 27.9 bits (63), Expect = 8.0
Identities = 5/25 (20%), Positives = 17/25 (68%)
Query: 266 KKKKKKKKKKTTKKKKRKKKRKKKK 290
+ +++K+K ++ + K+R++K+
Sbjct: 479 RTRRRKRKSTSSSSSRSTKRRRRKR 503
Score = 27.5 bits (62), Expect = 9.1
Identities = 7/25 (28%), Positives = 16/25 (64%)
Query: 267 KKKKKKKKKTTKKKKRKKKRKKKKN 291
+ +++K+K T+ R KR+++K
Sbjct: 479 RTRRRKRKSTSSSSSRSTKRRRRKR 503
>gnl|CDD|201707 pfam01280, Ribosomal_L19e, Ribosomal protein L19e.
Length = 148
Score = 27.1 bits (61), Expect = 7.2
Identities = 12/46 (26%), Positives = 23/46 (50%), Gaps = 5/46 (10%)
Query: 256 TSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKKNQHHRDVWQKK 301
SRG + K+K++K + + +K K R KK ++W ++
Sbjct: 57 HSRGRARKRKEKRRKGRHRGPGSRKGTKGARMPKK-----ELWIRR 97
Score = 26.7 bits (60), Expect = 8.8
Identities = 23/90 (25%), Positives = 38/90 (42%), Gaps = 22/90 (24%)
Query: 219 GFVMKKKKK-----RKKKKKKKKKK-------KRKKKKN--QHHRDVWIN--RTSRGEEE 262
G ++KK KK R +K+K+K++K RK K +++WI R R
Sbjct: 47 GLIIKKPKKGHSRGRARKRKEKRRKGRHRGPGSRKGTKGARMPKKELWIRRIRALR---- 102
Query: 263 EEKKKKKKKKKKKTTKKKKRKKKRKKKKNQ 292
+ ++ + K K R+ RK K
Sbjct: 103 --RLLRELRDAGKIDKHTYRELYRKAKGGV 130
>gnl|CDD|234402 TIGR03928, T7_EssCb_Firm, type VII secretion protein EssC,
C-terminal domain. This model describes the C-terminal
domain, or longer subunit, of the Firmicutes type VII
secretion protein EssC. This protein (homologous to EccC
in Actinobacteria) and the WXG100 target proteins are
the only homologous parts of type VII secretion between
Firmicutes and Actinobacteria [Protein fate, Protein and
peptide secretion and trafficking].
Length = 1296
Score = 28.0 bits (63), Expect = 7.2
Identities = 15/53 (28%), Positives = 25/53 (47%), Gaps = 12/53 (22%)
Query: 207 ALISSSISFWIPGFVMKKKKKRKKKKKKKKK-------KKRKK----KKNQHH 248
A+ +I F + ++KKK KK +K+ + KKRK+ + Q H
Sbjct: 69 AMSLVTIIFSTTTY-FREKKKYKKDVEKRNRSYRLYLDKKRKELQALSEKQRH 120
>gnl|CDD|233223 TIGR00990, 3a0801s09, mitochondrial precursor proteins import
receptor (72 kDa mitochondrial outermembrane protein)
(mitochondrial import receptor for the ADP/ATP carrier)
(translocase of outermembrane tom70). [Transport and
binding proteins, Amino acids, peptides and amines].
Length = 615
Score = 28.0 bits (62), Expect = 7.4
Identities = 22/99 (22%), Positives = 39/99 (39%), Gaps = 8/99 (8%)
Query: 204 VYYALISSSISFWIPGFVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEE 263
VYY P K+++KRK+ +K+ + K +KK E +E
Sbjct: 53 VYYQRKGQQQRESKPKISKKERRKRKQAEKETEGKTEEKKSTAPKNAPVEPADELPEIDE 112
Query: 264 EKKKKKKKKKKKTTKKKKRKKKRKKKKNQHHRDVWQKKY 302
++ ++KK K K+K N+ +R K +
Sbjct: 113 SSVANLSEE-----ERKKYAAKLKEKGNKAYR---NKDF 143
>gnl|CDD|223390 COG0313, COG0313, Predicted methyltransferases [General function
prediction only].
Length = 275
Score = 27.6 bits (62), Expect = 7.4
Identities = 11/37 (29%), Positives = 20/37 (54%), Gaps = 3/37 (8%)
Query: 207 ALISSSI---SFWIPGFVMKKKKKRKKKKKKKKKKKR 240
AL +S + F GF+ +K K+R+K+ + + R
Sbjct: 121 ALSASGLPSQRFLFEGFLPRKSKERRKRLEALANEPR 157
>gnl|CDD|241400 cd13246, PH_Scd1, Shape and Conjugation Deficiency 1 Pleckstrin
homology (PH) domain. Fission yeast Scd1 is an exchange
factor for Cdc42 and an effector of Ras1, the homolog of
the human H-Ras. Scd2/Bem1 mediates Cdc42 activation by
binding to Scd1/Cdc24 and to Cdc42. Ras1 regulates
Scd1/Cdc24/Ral1, which is a putative guanine nucleotide
exchange factor for Cdc42, a member of the Rho family of
Ras-like proteins. Cdc42 then activates the Shk1/Orb2
protein kinase. Scd1 interacts with Klp5 and Klp6
kinesins to mediate cytokinesis. PH domains have diverse
functions, but in general are involved in targeting
proteins to the appropriate cellular location or in the
interaction with a binding partner. They share little
sequence conservation, but all have a common fold, which
is electrostatically polarized. Less than 10% of PH
domains bind phosphoinositide phosphates (PIPs) with
high affinity and specificity. PH domains are
distinguished from other PIP-binding domains by their
specific high-affinity binding to PIPs with two vicinal
phosphate groups: PtdIns(3,4)P2, PtdIns(4,5)P2 or
PtdIns(3,4,5)P3 which results in targeting some PH
domain proteins to the plasma membrane. A few display
strong specificity in lipid binding. Any specificity is
usually determined by loop regions or insertions in the
N-terminus of the domain, which are not conserved across
all PH domains. PH domains are found in cellular
signaling proteins such as serine/threonine kinase,
tyrosine kinases, regulators of G-proteins, endocytotic
GTPases, adaptors, as well as cytoskeletal associated
molecules and in lipid associated enzymes.
Length = 148
Score = 27.2 bits (61), Expect = 7.4
Identities = 12/25 (48%), Positives = 15/25 (60%)
Query: 263 EEKKKKKKKKKKKTTKKKKRKKKRK 287
E KKKKKK K + +KKK+K
Sbjct: 62 EVKKKKKKSSLKSKSSSSSKKKKKK 86
>gnl|CDD|119394 cd06261, TM_PBP2, Transmembrane subunit (TM) found in Periplasmic
Binding Protein (PBP)-dependent ATP-Binding Cassette
(ABC) transporters which generally bind type 2 PBPs.
These types of transporters consist of a PBP, two TMs,
and two cytoplasmic ABC ATPase subunits, and are mainly
involved in importing solutes from the environment. The
solute is captured by the PBP which delivers it to a
gated translocation pathway formed by the two TMs. The
two ABCs bind and hydrolyze ATP and drive the transport
reaction. For these transporters the ABCs and TMs are on
independent polypeptide chains. These systems transport
a diverse range of substrates. Most are specific for a
single substrate or a group of related substrates;
however some transporters are more promiscuous,
transporting structurally diverse substrates such as the
histidine/lysine and arginine transporter in
Enterobacteriaceae. In the latter case, this is achieved
through binding different PBPs with different
specificities to the TMs. For other promiscuous
transporters such as the multiple-sugar transporter Msm
of Streptococcus mutans, the PBP has a wide substrate
specificity. These transporters include the
maltose-maltodextrin, phosphate and sulfate
transporters, among others.
Length = 190
Score = 27.2 bits (61), Expect = 7.6
Identities = 13/74 (17%), Positives = 23/74 (31%), Gaps = 3/74 (4%)
Query: 33 TIHIHIFPFIIKGMLMCFIIITAILGNLLVIISVI---KHRKLRIITNYFVVSLAFADLL 89
I P + +L ++ A ++S + + L DL
Sbjct: 117 IFRRIILPLALPPILTGLVLAFARALGEFALVSFLGGGEAPGPGTGLLLIFAILFPGDLG 176
Query: 90 VALCVMPFNAIVSL 103
VA V ++SL
Sbjct: 177 VAAAVALILLLLSL 190
>gnl|CDD|220400 pfam09785, Prp31_C, Prp31 C terminal domain. This is the C
terminal domain of the pre-mRNA processing factor Prp31.
Prp31 is required for U4/U6.U5 tri-snRNP formation. In
humans this protein has been linked to autosomal
dominant retinitis pigmentosa.
Length = 124
Score = 26.9 bits (60), Expect = 7.6
Identities = 21/90 (23%), Positives = 31/90 (34%), Gaps = 30/90 (33%)
Query: 223 KKKKKRKKKKKKKKKKK------RKKKKNQHHRDVWINRTSRGEEEEE------------ 264
K KKKR ++ +K K++ RK NR G+EEEE
Sbjct: 14 KPKKKRGGRRFRKMKERFAMTELRKA----------QNRMEFGKEEEEVGYDFDEGVGLG 63
Query: 265 --KKKKKKKKKKKTTKKKKRKKKRKKKKNQ 292
+ + K + K KK K +
Sbjct: 64 MLGSGGGGRIRLSQKDAKTKAKLSKKMKKR 93
>gnl|CDD|183377 PRK11910, PRK11910, amidase; Provisional.
Length = 615
Score = 28.1 bits (62), Expect = 7.6
Identities = 19/74 (25%), Positives = 32/74 (43%), Gaps = 4/74 (5%)
Query: 221 VMKKKKKRKKKKKKKKKKKRKK----KKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKT 276
V KKKK KK+K K+ K+N H + N+ + E ++ + K +
Sbjct: 39 VSKKKKPTVKKEKPKQSSNNLTLGKNKENFHLEKGFGNKQLQVERIIDRIFQSSLKNRTE 98
Query: 277 TKKKKRKKKRKKKK 290
K K + +KK+
Sbjct: 99 IKVKPKNNPQKKQN 112
Score = 27.7 bits (61), Expect = 8.4
Identities = 13/35 (37%), Positives = 16/35 (45%)
Query: 256 TSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKKK 290
TS+ E K KKKK T KK+K K+
Sbjct: 26 TSQQTTTSENKSAVSKKKKPTVKKEKPKQSSNNLT 60
>gnl|CDD|150406 pfam09727, CortBP2, Cortactin-binding protein-2. This entry is the
first approximately 250 residues of cortactin-binding
protein 2. In addition to being a positional candidate
for autism this protein is expressed at highest levels
in the brain in humans. The human protein has six
associated ankyrin repeat domains pfam00023 towards the
C-terminus which act as protein-protein interaction
domains.
Length = 193
Score = 27.5 bits (61), Expect = 7.6
Identities = 14/72 (19%), Positives = 35/72 (48%), Gaps = 4/72 (5%)
Query: 226 KKRKKKKKKKKKKKRKKKKNQHHRDVWIN----RTSRGEEEEEKKKKKKKKKKKTTKKKK 281
K+++K + +++KRK + D + N R ++ E++K ++ KK++ +K
Sbjct: 110 KRQRKTVLELEEEKRKHIRYMKKSDDFTNLLEQERERLKKLLEQEKSQQAKKEQEHRKLL 169
Query: 282 RKKKRKKKKNQH 293
+ + K +
Sbjct: 170 ATLEEELGKLKS 181
>gnl|CDD|180007 PRK05309, PRK05309, 30S ribosomal protein S11; Validated.
Length = 128
Score = 26.7 bits (60), Expect = 7.6
Identities = 9/14 (64%), Positives = 10/14 (71%)
Query: 232 KKKKKKKKRKKKKN 245
KKKK K+K KKN
Sbjct: 1 AKKKKTVKKKVKKN 14
>gnl|CDD|219668 pfam07964, Red1, Rec10 / Red1. Rec10 / Red1 is involved in meiotic
recombination and chromosome segregation during
homologous chromosome formation. This protein localises
to the synaptonemal complex in S. cerevisiae and the
analogous structures (linear elements) in S. pombe. This
family is currently only found in fungi.
Length = 706
Score = 27.9 bits (62), Expect = 7.9
Identities = 18/67 (26%), Positives = 28/67 (41%)
Query: 225 KKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKK 284
+K K K K K K++ RD+ + T ++ +KK K K+T K +
Sbjct: 421 SEKINNPVKVVKVSKYKGNKSEKKRDINVLDTIFASPVSKELRKKVGKSKQTKLKNFKPV 480
Query: 285 KRKKKKN 291
K KK
Sbjct: 481 PNKSKKQ 487
>gnl|CDD|218581 pfam05416, Peptidase_C37, Southampton virus-type processing
peptidase. Corresponds to Merops family C37.
Norwalk-like viruses (NLVs), including the Southampton
virus, cause acute non-bacterial gastroenteritis in
humans. The NLV genome encodes three open reading frames
(ORFs). ORF1 encodes a polyprotein, which is processed
by the viral protease into six proteins.
Length = 535
Score = 27.9 bits (62), Expect = 7.9
Identities = 18/57 (31%), Positives = 28/57 (49%), Gaps = 5/57 (8%)
Query: 219 GFVMKKKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKK 275
G+ ++ + + KK K KK R KK N R R EE +E KK ++++ K
Sbjct: 220 GWYIEPQDATPEGKKGKNKKGRGKKHNAFSR-----RGLSDEEYDEYKKIREERGGK 271
>gnl|CDD|215871 pfam00346, Complex1_49kDa, Respiratory-chain NADH dehydrogenase, 49
Kd subunit.
Length = 272
Score = 27.7 bits (62), Expect = 8.0
Identities = 12/49 (24%), Positives = 23/49 (46%), Gaps = 8/49 (16%)
Query: 23 SNTSLFP------TNSTIHIHIFPFIIKGMLMCFIIITAILGNLLVIIS 65
S+ S FP H+ I + +G L+ ++ AILG++ ++
Sbjct: 222 SDGSTFPYRWKIRAPGFAHLQILDHLSRGHLLADLV--AILGSIDIVFG 268
>gnl|CDD|153342 cd07658, F-BAR_NOSTRIN, The F-BAR (FES-CIP4 Homology and
Bin/Amphiphysin/Rvs) domain of Nitric Oxide Synthase
TRaffic INducer (NOSTRIN). F-BAR domains are
dimerization modules that bind and bend membranes and
are found in proteins involved in membrane dynamics and
actin reorganization. Nitric Oxide Synthase TRaffic
INducer (NOSTRIN) is expressed in endothelial and
epithelial cells and is involved in the regulation,
trafficking and targeting of endothelial NOS (eNOS).
NOSTRIN facilitates the endocytosis of eNOS by
coordinating the functions of dynamin and the
Wiskott-Aldrich syndrome protein (WASP). Increased
expression of NOSTRIN may be correlated to preeclampsia.
NOSTRIN contains an N-terminal F-BAR domain and a
C-terminal SH3 domain. F-BAR domains form banana-shaped
dimers with a positively-charged concave surface that
binds to negatively-charged lipid membranes. They can
induce membrane deformation in the form of long tubules.
The F-BAR domain of NOSTRIN is necessary and sufficient
for its membrane association and is responsible for its
subcellular localization.
Length = 239
Score = 27.3 bits (61), Expect = 8.2
Identities = 20/74 (27%), Positives = 31/74 (41%), Gaps = 7/74 (9%)
Query: 230 KKKKKKKKKKRKKKKNQHH-RDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKK 288
K KKK R+ +K Q D + T +K K KK + K+ K + K+
Sbjct: 127 KVKKKLHGLARENEKLQDQVEDNKQSCTK------QKMLNKLKKSAEVQDKEDEKLEAKR 180
Query: 289 KKNQHHRDVWQKKY 302
KK + R + +Y
Sbjct: 181 KKGEESRLKAENEY 194
>gnl|CDD|201461 pfam00829, Ribosomal_L21p, Ribosomal prokaryotic L21 protein.
Length = 96
Score = 26.3 bits (59), Expect = 8.3
Identities = 11/29 (37%), Positives = 15/29 (51%), Gaps = 2/29 (6%)
Query: 267 KKKKKKKKKTTKKKKRKKKRKKKKNQHHR 295
+ + KK K KR+K +KK Q HR
Sbjct: 64 LEHGRGKKVIVFKYKRRKNYRKK--QGHR 90
>gnl|CDD|225816 COG3277, GAR1, RNA-binding protein involved in rRNA processing
[Translation, ribosomal structure and biogenesis].
Length = 98
Score = 26.2 bits (58), Expect = 8.4
Identities = 10/25 (40%), Positives = 16/25 (64%)
Query: 220 FVMKKKKKRKKKKKKKKKKKRKKKK 244
++ K RKK+K +KK++ KK K
Sbjct: 74 YIPPDKLIRKKRKLPRKKRRPKKPK 98
Score = 26.2 bits (58), Expect = 8.7
Identities = 10/37 (27%), Positives = 16/37 (43%), Gaps = 2/37 (5%)
Query: 253 INRTSRGEEEE--EKKKKKKKKKKKTTKKKKRKKKRK 287
+ + K +KK+K +KK+R KK K
Sbjct: 62 DVKLESLVGDTLYIPPDKLIRKKRKLPRKKRRPKKPK 98
>gnl|CDD|224495 COG1579, COG1579, Zn-ribbon protein, possibly nucleic acid-binding
[General function prediction only].
Length = 239
Score = 27.3 bits (61), Expect = 8.4
Identities = 13/69 (18%), Positives = 29/69 (42%), Gaps = 6/69 (8%)
Query: 224 KKKKRKKKKKKKKKKKRKKKKNQHHR--DVWINRTSRGEEEEEKKKKKKKKKKKTTKKKK 281
+K R + + K+ +K KK K + E E E + + + + ++ +
Sbjct: 17 LEKDRLEPRIKEIRKALKKAKAELEALNKALEAL----EIELEDLENQVSQLESEIQEIR 72
Query: 282 RKKKRKKKK 290
+ KR ++K
Sbjct: 73 ERIKRAEEK 81
>gnl|CDD|216205 pfam00937, Corona_nucleoca, Coronavirus nucleocapsid protein.
Length = 346
Score = 27.7 bits (62), Expect = 8.5
Identities = 9/28 (32%), Positives = 13/28 (46%)
Query: 266 KKKKKKKKKKTTKKKKRKKKRKKKKNQH 293
K+K KK KK + KK+ N+
Sbjct: 206 KQKSSSKKPSRVTKKSAAEAAKKQLNKP 233
>gnl|CDD|223334 COG0256, RplR, Ribosomal protein L18 [Translation, ribosomal
structure and biogenesis].
Length = 125
Score = 26.8 bits (60), Expect = 8.6
Identities = 7/21 (33%), Positives = 13/21 (61%)
Query: 222 MKKKKKRKKKKKKKKKKKRKK 242
K K R++++ K+ + RKK
Sbjct: 2 RYKVKFRRRRRGKRAYRIRKK 22
>gnl|CDD|234533 TIGR04285, nucleoid_noc, nucleoid occlusion protein. This model
describes nucleoid occlusion protein, a close homolog to
ParB chromosome partitioning proteins including Spo0J in
Bacillus subtilis. Its gene often is located near the
gene for the Spo0J ortholog. This protein bind a
specific DNA sequence and blocks cytokinesis from
happening until chromosome segregation is complete.
Length = 255
Score = 27.5 bits (62), Expect = 8.6
Identities = 11/19 (57%), Positives = 13/19 (68%)
Query: 224 KKKKRKKKKKKKKKKKRKK 242
KK K +K KKKKK+RK
Sbjct: 192 KKLLEKPEKPKKKKKRRKG 210
>gnl|CDD|224596 COG1682, TagG, ABC-type polysaccharide/polyol phosphate export
systems, permease component [Carbohydrate transport and
metabolism / Cell envelope biogenesis, outer membrane].
Length = 263
Score = 27.5 bits (62), Expect = 8.7
Identities = 13/83 (15%), Positives = 36/83 (43%), Gaps = 5/83 (6%)
Query: 36 IHIFPFIIKGMLMCFIIITAILGNLLVII---SVIKHRKLR--IITNYFVVSLAFADLLV 90
++ +++ G+++ F AI ++ ++IK I+ +S F L+
Sbjct: 66 LNFLAYLLAGLILWFFFSEAISEGAGSVVANAALIKKINFPPLILPVARTLSRLFNFLIH 125
Query: 91 ALCVMPFNAIVSLTDEWYFGYFM 113
+ ++ F I+ + W++ +
Sbjct: 126 LIIILIFLIILGVEPSWHWLLLL 148
>gnl|CDD|225381 COG2825, HlpA, Outer membrane protein [Cell envelope biogenesis,
outer membrane].
Length = 170
Score = 27.0 bits (60), Expect = 9.0
Identities = 16/74 (21%), Positives = 40/74 (54%), Gaps = 3/74 (4%)
Query: 222 MKKKKKRKKKKKKKKKKKRKKKKNQHHRDVW---INRTSRGEEEEEKKKKKKKKKKKTTK 278
++ + K+++K+ +K +K+ K K+ + D ++ ++ E E +K+K KK +
Sbjct: 49 LESEFKKRQKELQKMQKELKAKEAKLQDDGKMEALSDRAKAEAEIKKEKLVNAFNKKQQE 108
Query: 279 KKKRKKKRKKKKNQ 292
+K +R+ ++ Q
Sbjct: 109 YEKDLNRREAEEEQ 122
>gnl|CDD|227360 COG5027, SAS2, Histone acetyltransferase (MYST family) [Chromatin
structure and dynamics].
Length = 395
Score = 27.4 bits (61), Expect = 9.1
Identities = 17/91 (18%), Positives = 36/91 (39%), Gaps = 3/91 (3%)
Query: 204 VYYALISSSISFWIPGFVMKKKKKR---KKKKKKKKKKKRKKKKNQHHRDVWINRTSRGE 260
V+Y ++ + WI ++ K+KK+ +K KK KK K D+
Sbjct: 39 VHYVELNRRLDEWITADLINLGAAISIPKRKKQTEKGKKEKKPKVSDRMDLDNENVQLEM 98
Query: 261 EEEEKKKKKKKKKKKTTKKKKRKKKRKKKKN 291
+++ ++ + K + + + KN
Sbjct: 99 LYSISNEREIRQLRFGGSKVQNPHEGARVKN 129
>gnl|CDD|129661 TIGR00570, cdk7, CDK-activating kinase assembly factor MAT1. All
proteins in this family for which functions are known
are cyclin dependent protein kinases that are components
of TFIIH, a complex that is involved in nucleotide
excision repair and transcription initiation. Also known
as MAT1 (menage a trois 1). This family is based on the
phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
Stanford University) [DNA metabolism, DNA replication,
recombination, and repair].
Length = 309
Score = 27.5 bits (61), Expect = 9.1
Identities = 16/63 (25%), Positives = 34/63 (53%)
Query: 224 KKKKRKKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRK 283
KKK +K+ K ++ K+K+ ++ +EEEE+++ +K+++ + KRK
Sbjct: 119 KKKIETYQKENKDVIQKNKEKSTREQEELEEALEFEKEEEEQRRLLLQKEEEEQQMNKRK 178
Query: 284 KKR 286
K+
Sbjct: 179 NKQ 181
>gnl|CDD|240520 cd13156, KOW_RPL6, KOW motif of Ribosomal Protein L6. RPL6
contains KOW motif that has an extra ribosomal role as
an oncogenic. KOW domain is known as an RNA-binding
motif that is shared so far among some families of
ribosomal proteins, the essential bacterial
transcriptional elongation factor NusG, the eukaryotic
chromatin elongation factor Spt5, the higher eukaryotic
KIN17 proteins and Mtr4. .
Length = 152
Score = 26.7 bits (60), Expect = 9.4
Identities = 16/33 (48%), Positives = 21/33 (63%), Gaps = 5/33 (15%)
Query: 265 KKKKKKKKKKKT-----TKKKKRKKKRKKKKNQ 292
K+KKKKKKKKK KKKK ++K++Q
Sbjct: 82 KRKKKKKKKKKEGEFFEEKKKKYVVSEERKEDQ 114
>gnl|CDD|215343 PLN02638, PLN02638, cellulose synthase A (UDP-forming), catalytic
subunit.
Length = 1079
Score = 27.6 bits (61), Expect = 9.6
Identities = 13/33 (39%), Positives = 16/33 (48%)
Query: 218 PGFVMKKKKKRKKKKKKKKKKKRKKKKNQHHRD 250
PGF+ +KK K KK KKK+ H D
Sbjct: 652 PGFLSSLCGGSRKKSSKSSKKGSDKKKSGKHVD 684
>gnl|CDD|225058 COG2147, RPL19A, Ribosomal protein L19E [Translation, ribosomal
structure and biogenesis].
Length = 150
Score = 26.9 bits (60), Expect = 9.6
Identities = 20/74 (27%), Positives = 32/74 (43%), Gaps = 2/74 (2%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKN--QHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKK 280
+K+ +KKK +++ RK K ++ WI R E K + K K T +K
Sbjct: 64 RKRHAQKKKGRRRGPGSRKGTKGARMPSKERWIKRIRALRRELRKLRDDGKIDKHTYRKL 123
Query: 281 KRKKKRKKKKNQHH 294
R K K++ H
Sbjct: 124 YRMAKGGAFKSKSH 137
>gnl|CDD|219256 pfam06991, Prp19_bind, Splicing factor, Prp19-binding domain. This
family represents the C-terminus (approximately 300
residues) of proteins that are involved as binding
partners for Prp19 as part of the nuclear pore complex.
The family in Drosophila is necessary for pre-mRNA
splicing, and the human protein has been found in
purifications of the spliceosome. In the past this
family was thought, erroneously, to be associated with
microfibrillin.
Length = 277
Score = 27.2 bits (60), Expect = 9.6
Identities = 17/65 (26%), Positives = 34/65 (52%), Gaps = 1/65 (1%)
Query: 229 KKKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKK 288
K ++ K+ K+ R++++ I + R EEE++ + +K K +T K K+ K +
Sbjct: 121 KLRELKRIKRDREEREEMEREKAEIEKM-RNMTEEERRAELRKNPKVSTNKAKKGKYKFL 179
Query: 289 KKNQH 293
+K H
Sbjct: 180 QKYYH 184
>gnl|CDD|220417 pfam09812, MRP-L28, Mitochondrial ribosomal protein L28. Members
of this family are components of the mitochondrial large
ribosomal subunit. Mature mitochondrial ribosomes
consist of a small (37S) and a large (54S) subunit. The
37S subunit contains at least 33 different proteins and
1 molecule of RNA (15S). The 54S subunit contains at
least 45 different proteins and 1 molecule of RNA (21S).
Length = 146
Score = 26.6 bits (59), Expect = 9.7
Identities = 12/32 (37%), Positives = 17/32 (53%), Gaps = 7/32 (21%)
Query: 222 MKKKKK-------RKKKKKKKKKKKRKKKKNQ 246
+KKKKK RK+K+ +K +KR K
Sbjct: 6 LKKKKKVDPQREQRKRKRLEKFIRKRTKSARG 37
>gnl|CDD|215124 PLN02195, PLN02195, cellulose synthase A.
Length = 977
Score = 27.6 bits (61), Expect = 9.8
Identities = 13/30 (43%), Positives = 16/30 (53%), Gaps = 3/30 (10%)
Query: 251 VWINRTSRGEEEEEKKKKKKKKKKKTTKKK 280
+W NR E ++KK KKKK KK K
Sbjct: 113 IWKNRV---ESWKDKKNKKKKSAKKKEAHK 139
>gnl|CDD|221012 pfam11169, DUF2956, Protein of unknown function (DUF2956). This
family of proteins with unknown function appears to be
restricted to Gammaproteobacteria.
Length = 103
Score = 26.1 bits (58), Expect = 9.9
Identities = 8/26 (30%), Positives = 16/26 (61%)
Query: 223 KKKKKRKKKKKKKKKKKRKKKKNQHH 248
KK++K K ++ K +K++ K K +
Sbjct: 40 KKQQKAKAREADKARKQQLKAKQRQA 65
>gnl|CDD|192718 pfam11176, DUF2962, Protein of unknown function (DUF2962). This
eukaryotic family of proteins has no known function.
Length = 155
Score = 26.8 bits (60), Expect = 10.0
Identities = 11/42 (26%), Positives = 14/42 (33%), Gaps = 11/42 (26%)
Query: 265 KKKKKKKKKKKT-----------TKKKKRKKKRKKKKNQHHR 295
K K KKKK + R K +KKK +
Sbjct: 1 KVTKNIAKKKKVLHPKSRKAKQLNRATLRDDKLEKKKAERAE 42
>gnl|CDD|224023 COG1098, VacB, Predicted RNA binding protein (contains ribosomal
protein S1 domain) [Translation, ribosomal structure and
biogenesis].
Length = 129
Score = 26.5 bits (59), Expect = 10.0
Identities = 13/62 (20%), Positives = 29/62 (46%), Gaps = 6/62 (9%)
Query: 230 KKKKKKKKKKRKKKKNQHHRDVWINRTSRGEEEEEKKKKKKKKKKKTTKKKKRKKKRKKK 289
+K +++ +K+ +K + +R G E + K ++ +T K++ KKKR +
Sbjct: 74 RKLEEEPEKQHRKPRFSK------SRPKEGFETLLSRLLKWIEESLSTLKRQTKKKRGGR 127
Query: 290 KN 291
Sbjct: 128 GA 129
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.324 0.136 0.424
Gapped
Lambda K H
0.267 0.0677 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 15,607,133
Number of extensions: 1581873
Number of successful extensions: 23841
Number of sequences better than 10.0: 1
Number of HSP's gapped: 14804
Number of HSP's successfully gapped: 1652
Length of query: 303
Length of database: 10,937,602
Length adjustment: 96
Effective length of query: 207
Effective length of database: 6,679,618
Effective search space: 1382680926
Effective search space used: 1382680926
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 15 ( 7.0 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.6 bits)
S2: 59 (26.6 bits)