RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= 015100
(413 letters)
>gnl|CDD|216061 pfam00687, Ribosomal_L1, Ribosomal protein L1p/L10e family. This
family includes prokaryotic L1 and eukaryotic L10.
Length = 200
Score = 142 bits (360), Expect = 2e-40
Identities = 60/188 (31%), Positives = 88/188 (46%), Gaps = 18/188 (9%)
Query: 48 EQDDFVYLILTLKKIPQVSRTN-AFKIPLPHSLLGNDSDNPPEICLIMDDRPKSNLTKDA 106
+ D+ V L + LKK P+ N + LPH L ++C+I D
Sbjct: 11 KFDETVELQINLKKDPRKKDQNVRGTVVLPHGL-----GKDVKVCVIAKDP--------- 56
Query: 107 VMKKIKNDNLPITKVIKITKLKTDYRPFEAKRKLCDSYDIFLADKRVVPLLPKLLGKHFF 166
K K V+ LK Y + RKL +D+FLAD ++PLLPKLLGK
Sbjct: 57 -EAKAKEAKDAGADVVGGEDLKEKY-KIKKGRKLAKDFDVFLADPDIMPLLPKLLGKVLG 114
Query: 167 KKKKIPVPVDLKHQNWKEQIEKVCGSALLYLR-TGTCSVLKVGKVSMGAEDIAENVIAAI 225
+ K+P P + + IE+ + + + G C +KVGKVSM E++ EN+ A I
Sbjct: 115 PRGKMPNPKTTVTPDVAKAIEEAKSGTVEFRKDKGGCIHVKVGKVSMSPEELVENIEAVI 174
Query: 226 NGLLESLA 233
N L++ L
Sbjct: 175 NALVKKLP 182
>gnl|CDD|238235 cd00403, Ribosomal_L1, Ribosomal protein L1. The L1 protein,
located near the E-site of the ribosome, forms part of
the L1 stalk along with 23S rRNA. In bacteria and
archaea, L1 functions both as a ribosomal protein that
binds rRNA, and as a translation repressor that binds
its own mRNA. Like several other large ribosomal
subunit proteins, L1 displays RNA chaperone activity.
L1 is one of the largest ribosomal proteins. It is
composed of two domains that cycle between open and
closed conformations via a hinge motion. The RNA-binding
site of L1 is highly conserved, with both mRNA and rRNA
binding the same binding site.
Length = 208
Score = 124 bits (313), Expect = 1e-33
Identities = 66/211 (31%), Positives = 95/211 (45%), Gaps = 28/211 (13%)
Query: 24 VERAVKALLKWLKSNSQTQKPQLLEQDDFVYLILTLKKIPQVSRTN-AFKIPLPHSLLGN 82
+E A+KAL K + + D+ V L + LKK + + LPH L +
Sbjct: 1 LEEAIKALKKTSV--------KKRKFDETVELQINLKKDDRKKDQRIRGTVILPHGLGKD 52
Query: 83 DSDNPPEICLIMDDRPKSNLTKDAVMKKIKNDNLPITKVIKITKLKTDYRPFEAKRKLCD 142
+C+ KD K+ K V+ LK + EAK KL
Sbjct: 53 VK-----VCVF---------AKDEQAKEAKAA---GADVVGGEDLKKKIKNGEAK-KLAK 94
Query: 143 SYDIFLADKRVVPLLPKLLGKHFFKKKKIPVPVDLK-HQNWKEQIEKVCGSALLYLRTGT 201
+D+FLAD R++ LLPKLLGK + K+P P ++ + IE+ S L G
Sbjct: 95 DFDLFLADPRIMMLLPKLLGKVLGPRGKMPNPKTGTVTEDLAKAIEEAKSSVEFRLDKGG 154
Query: 202 CSVLKVGKVSMGAEDIAENVIAAINGLLESL 232
C + VGKVSM E + EN+ A IN L++ L
Sbjct: 155 CIHVPVGKVSMSPEQLVENIEAVINALVKKL 185
>gnl|CDD|185405 PTZ00029, PTZ00029, 60S ribosomal protein L10a; Provisional.
Length = 216
Score = 51.3 bits (123), Expect = 2e-07
Identities = 29/95 (30%), Positives = 51/95 (53%), Gaps = 1/95 (1%)
Query: 138 RKLCDSYDIFLADKRVVPLLPKLLGKHFFKKKKIPVPVDLKHQNWKEQIEKVCGSALLYL 197
+KL YD FLA + ++P +P+LLG K K P + + + +++I ++ S L
Sbjct: 100 KKLAKKYDAFLASQSLLPQIPRLLGPGLNKAGKFPTLI-THNDDIEDKINELKSSVKFQL 158
Query: 198 RTGTCSVLKVGKVSMGAEDIAENVIAAINGLLESL 232
+ C + VG V M E + +N++ +IN L+ L
Sbjct: 159 KKVLCLGVAVGNVEMTEEQLRQNIVLSINFLVSLL 193
>gnl|CDD|140252 PTZ00225, PTZ00225, 60S ribosomal protein L10a; Provisional.
Length = 214
Score = 47.3 bits (112), Expect = 4e-06
Identities = 51/217 (23%), Positives = 99/217 (45%), Gaps = 28/217 (12%)
Query: 17 SRVSPKTVERAVKALLKWLKSNSQTQKPQLLEQDDFVYLILTLKKI-PQVSRTNAFKIPL 75
S++ P+T+ A++A+LK K + + L + LK PQ + + + L
Sbjct: 2 SKIPPQTLSEAIQAVLKVDKERKFKES---------IDLQVNLKNYDPQKDKRFSGSLKL 52
Query: 76 PHSLLGNDSDNPPEICLIMDDRPKSNLTKDAVMKKIKNDNLPITKVIKITKLKTDYRPFE 135
P+ + +CL+ D L + + KK + +P ++ KL + +
Sbjct: 53 PNVCRPRMT-----VCLLCD------LVHEDIAKK---EGVPTMNQEELKKLNKNKKLV- 97
Query: 136 AKRKLCDSYDIFLADKRVVPLLPKLLGKHFFKKKKIPVPVDLKHQNWKEQIEKVCGSALL 195
+K+C+ YD FL + ++ +P+L+G H + K P V ++ +++ ++ +
Sbjct: 98 --KKMCNQYDAFLCSESIIKTVPRLVGPHMHRMGKFPT-VCSPSESLPDKVVELRSTVKF 154
Query: 196 YLRTGTCSVLKVGKVSMGAEDIAENVIAAINGLLESL 232
L+ C VG V M E + +NV+ AIN L+ L
Sbjct: 155 QLKKVLCLGTCVGHVEMTEEQLRQNVVMAINFLVSLL 191
>gnl|CDD|235254 PRK04203, rpl1P, 50S ribosomal protein L1P; Reviewed.
Length = 215
Score = 41.4 bits (98), Expect = 3e-04
Identities = 25/93 (26%), Positives = 51/93 (54%), Gaps = 1/93 (1%)
Query: 136 AKRKLCDSYDIFLADKRVVPLLPKLLGKHFFKKKKIPVPVDLKHQNWKEQIEKVCGSALL 195
A +KL + YD F+A+ ++PL+ + LG + K+P P+ + + K +E++ + +
Sbjct: 96 AAKKLANEYDFFIAEADLMPLIGRYLGPVLGPRGKMPTPLP-PNADIKPLVERLKNTVRI 154
Query: 196 YLRTGTCSVLKVGKVSMGAEDIAENVIAAINGL 228
+ ++VG M E++AEN+ A +N +
Sbjct: 155 RTKDQPTFHVRVGTEDMSPEELAENIDAVLNRI 187
>gnl|CDD|233148 TIGR00844, c_cpa1, na(+)/h(+) antiporter. The Monovalent
Cation:Proton Antiporter-1 (CPA1) Family (TC 2.A.36) The
CPA1 family is a large family of proteins derived from
Gram-positive and Gram-negative bacteria, blue green
bacteria, yeast, plants and animals. Transporters from
eukaryotes have been functionally characterized, and all
of these catalyze Na+:H+ exchange. Their primary
physiological functions may be in (1) cytoplasmic pH
regulation, extruding the H+ generated during
metabolism, and (2) salt tolerance (in plants), due to
Na+ uptake into vacuoles. This model is specific for the
fungal members of this family [Transport and binding
proteins, Cations and iron carrying compounds].
Length = 810
Score = 41.1 bits (96), Expect = 0.001
Identities = 22/74 (29%), Positives = 36/74 (48%), Gaps = 5/74 (6%)
Query: 258 QDKDSEKENAEDVNDHGSKKKLKKKKGRIHEVRYMD-----STIGEVLDEDELGSDDDGE 312
+D+ E ++ HG K+K+KK+ + V S + E + D L S+D+
Sbjct: 724 EDESGEVFRRYRISPHGGKRKIKKRNDSVVSVDEEKAIEGPSRVPERGNHDLLHSEDEMA 783
Query: 313 GDVGESEDGEDSED 326
D ESE+ +D ED
Sbjct: 784 DDEAESENMDDYED 797
>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger. [Transport and
binding proteins, Cations and iron carrying compounds].
Length = 1096
Score = 39.2 bits (91), Expect = 0.004
Identities = 23/89 (25%), Positives = 37/89 (41%), Gaps = 1/89 (1%)
Query: 238 QAVPDLKLKIEGVKENEGEGQDKDSEKENAEDVNDHGSKKKLKKKKGRIHEVRYMDSTIG 297
QA D ++K + E + E + + E E ++ + K E G
Sbjct: 784 QAGEDGEMKGDEGAEGKVEHEGETEAGEKDEHEGQSETQADDTEVKDETGEQELNAENQG 843
Query: 298 EVLDEDELGSDDDGEGDVGESEDGEDSED 326
E +DE G D G D G+SE+ E+ E+
Sbjct: 844 EA-KQDEKGVDGGGGSDGGDSEEEEEEEE 871
Score = 35.4 bits (81), Expect = 0.070
Identities = 27/131 (20%), Positives = 54/131 (41%), Gaps = 10/131 (7%)
Query: 252 ENEGEGQDKDSEKENAEDVNDHGSKKKLKKKKGRIHEVRYMDSTIGEVLDEDELGSDDDG 311
E+EGE + + +E ++ K + +G E + T E G +D+
Sbjct: 727 EDEGEIETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEHEGETEAE-------GKEDED 779
Query: 312 EGDVGESEDGE-DSEDGKMSSGDILGKKRKKGDKERAQKLPKKVA--KVKKDELSSDMKN 368
EG++ EDGE ++G + G+ E + + +VK + ++
Sbjct: 780 EGEIQAGEDGEMKGDEGAEGKVEHEGETEAGEKDEHEGQSETQADDTEVKDETGEQELNA 839
Query: 369 EDVGKQKKQKK 379
E+ G+ K+ +K
Sbjct: 840 ENQGEAKQDEK 850
Score = 31.9 bits (72), Expect = 0.67
Identities = 35/166 (21%), Positives = 68/166 (40%), Gaps = 19/166 (11%)
Query: 248 EGVKENEGEGQDKDSEKENAEDVNDHGSKKKLKKKKGRIH-EVRYMDSTIGEV------- 299
EG + E EG+ E + K + +G I E + GE+
Sbjct: 648 EGERPTEAEGE-NGEESGGEAEQEGETETKGENESEGEIPAERKGEQEGEGEIEAKEADH 706
Query: 300 -------LDEDELGSDDDGEGDVGESEDGEDSED-GKMSSGDILGKK--RKKGDKERAQK 349
E E ++ +G D GE E GE+ E+ G+ GK +GD++ +
Sbjct: 707 KGETEAEEVEHEGETEAEGTEDEGEIETGEEGEEVEDEGEGEAEGKHEVETEGDRKETEH 766
Query: 350 LPKKVAKVKKDELSSDMKNEDVGKQKKQKKVGLSLKNDEEKSSGKE 395
+ A+ K+DE +++ + G+ K + ++++ E +G++
Sbjct: 767 EGETEAEGKEDEDEGEIQAGEDGEMKGDEGAEGKVEHEGETEAGEK 812
>gnl|CDD|223159 COG0081, RplA, Ribosomal protein L1 [Translation, ribosomal
structure and biogenesis].
Length = 228
Score = 36.1 bits (84), Expect = 0.019
Identities = 26/109 (23%), Positives = 47/109 (43%), Gaps = 8/109 (7%)
Query: 135 EAKRKLCDSYDIFLADKRVVPLLPKLLGKHFFKKKKIPVPVDLKH-QNWKEQIEKVCGSA 193
K +D+F+A ++PL+ LGK + +P P + + +E++
Sbjct: 103 LIKNGRAKDFDVFIATPDMMPLV-GKLGKVLGPRGLMPNPKTGTVTDDVAKAVEELKKGT 161
Query: 194 LLYL--RTGTCSVLKVGKVSMGAEDIAEN---VIAAINGLLESLALPVY 237
+ + + G V +GKVS E +AEN ++ AI + A Y
Sbjct: 162 VEFRADKAGVIHVP-IGKVSFDDEKLAENIEALLNAIVKAKPAGAKGQY 209
>gnl|CDD|218177 pfam04615, Utp14, Utp14 protein. This protein is found to be part
of a large ribonucleoprotein complex containing the U3
snoRNA. Depletion of the Utp proteins impedes production
of the 18S rRNA, indicating that they are part of the
active pre-rRNA processing complex. This large RNP
complex has been termed the small subunit (SSU)
processome.
Length = 728
Score = 36.2 bits (84), Expect = 0.032
Identities = 41/202 (20%), Positives = 74/202 (36%), Gaps = 33/202 (16%)
Query: 242 DLKLKIEGVKENEGEGQDKDSEKENAED---VNDHGSKKKLKKKKGRIHEVRYMDSTIGE 298
+L+ KIEG +E + + +E ED +D G + +KK + D
Sbjct: 313 ELRRKIEGKSVSEEDEDEDSDSEEEDEDDDEDDDDGENPWMLRKKLGKLKEGEDDEENSG 372
Query: 299 VL------------------DEDELGSDDDGEGDVGESEDGEDSED--GKMSSGDILGKK 338
+L + +EL + +GE + E E+ E S+ G+ G G+K
Sbjct: 373 LLSMKFMQRAEARKKEENDAEIEELRRELEGEEESDEEENEEPSKKNVGRRKFGPENGEK 432
Query: 339 RKKGDKERAQKLPKKVAKVKKDELSSDMKNED----------VGKQKKQKKVGLSLKNDE 388
+ K + + + K + DE E+ + + +K +K + DE
Sbjct: 433 EAESKKLKKENKNEFKEKKESDEEEELEDEEEAKVEKVANKLLKRSEKAQKEEEEEELDE 492
Query: 389 EKSSGKEKKKSMLGKVKKEKKK 410
E K K++ KK
Sbjct: 493 ENPWLKTTSSVGKSAKKQDSKK 514
>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
subunit (TFIIF-alpha). Transcription initiation factor
IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
II-associating protein 74 (RAP74) is the large subunit
of transcription factor IIF (TFIIF), which is essential
for accurate initiation and stimulates elongation by RNA
polymerase II.
Length = 528
Score = 36.1 bits (83), Expect = 0.036
Identities = 35/159 (22%), Positives = 56/159 (35%), Gaps = 24/159 (15%)
Query: 245 LKIEGVKENEGEGQDKDSEKENAEDVNDHGSKKKLKKKKGRI------------------ 286
+ +G + ++G + + + + KK KKG+
Sbjct: 222 DEDDGDESDKGGEDGDEEKSKKKKKKLAKNKKKLDDDKKGKRGGDDDADEYDSDDGDDEG 281
Query: 287 HEVRYM--DSTIGEVLDEDE-LGSDDDGE-GDVGESEDGEDSEDGKMSSGDILGKKRKK- 341
E Y+ S G +E E S + ++ + ED E+SE+ K L KK KK
Sbjct: 282 REEDYISDSSASGNDPEEREDKLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLSKKGKKL 341
Query: 342 -GDKERAQKLPKKVAKVKKDELSSDMKNEDVGKQKKQKK 379
K + L K + D SD+ ED KK
Sbjct: 342 KKLKGKKNGLDKDDSDSGDDSDDSDIDGEDSVSLVTAKK 380
Score = 34.2 bits (78), Expect = 0.14
Identities = 32/163 (19%), Positives = 63/163 (38%), Gaps = 16/163 (9%)
Query: 244 KLKIEGVKENEGEGQDKDSEKENAEDVNDHGSKKKLKKKKGRIHEVRYMDSTIGEVLDED 303
+ K +K+ EG+ +D E + + D KK KKK + + ++ D+
Sbjct: 209 RGKDLKIKDLEGDDEDDGDESDKGGEDGDEEKSKKKKKKLAKNKK---------KLDDDK 259
Query: 304 ELGSDDDGEGDVGESEDGEDSE-------DGKMSSGDILGKKRKKGDKERAQKLPKKVAK 356
+ D + D +S+DG+D D S D ++ K + A+ ++
Sbjct: 260 KGKRGGDDDADEYDSDDGDDEGREEDYISDSSASGNDPEEREDKLSPEIPAKPEIEQDED 319
Query: 357 VKKDELSSDMKNEDVGKQKKQKKVGLSLKNDEEKSSGKEKKKS 399
++ E + + + K+ K+ K KN +K S
Sbjct: 320 SEESEEEKNEEEGGLSKKGKKLKKLKGKKNGLDKDDSDSGDDS 362
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 36.3 bits (83), Expect = 0.037
Identities = 37/170 (21%), Positives = 69/170 (40%), Gaps = 14/170 (8%)
Query: 246 KIEGVKENEGEGQDKDSEKENAEDVNDHGSKKKLKKKKGRIHEVRYMDSTIGEVLDEDEL 305
K E VK+ E E + E + AE+ ++ +K ++ + R E DEL
Sbjct: 1225 KAEAVKKAE-EAKKDAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADEL 1283
Query: 306 GSDDDGEGDVGESEDGEDSEDGKMSSGDILGKKRKKGDKERAQKLPKKV--AKVKKDEL- 362
++ ++++ + +E+ K + K+K + ++A + KK AK K D
Sbjct: 1284 KKAEE----KKKADEAKKAEEKKKADE----AKKKAEEAKKADEAKKKAEEAKKKADAAK 1335
Query: 363 --SSDMKNEDVGKQKKQKKVGLSLKNDEEKSSGKEKKKSMLGKVKKEKKK 410
+ + K + + + + EEK+ EKKK K KK
Sbjct: 1336 KKAEEAKKAAEAAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKK 1385
Score = 34.7 bits (79), Expect = 0.092
Identities = 37/169 (21%), Positives = 63/169 (37%), Gaps = 17/169 (10%)
Query: 242 DLKLKIEGVKENEGEGQDKDSEKENAEDVNDHGSKKKLKKKKGRIHEVRYMDSTIGEVLD 301
+ K K E K+ + K E + A + K + E + E
Sbjct: 1319 EAKKKAEEAKKKADAAKKKAEEAKKAAEA-----AKAEAEAAADEAEAAEEKAEAAEKKK 1373
Query: 302 EDELGSDDDGEGDVGESEDGEDSEDGKMSSGDILGKKRKKGDKERAQKLPKKVAKVKKDE 361
E+ D + E + +++ KK+ + DK++A +L K A KK +
Sbjct: 1374 EEAKKKADAAKKKAEEKKKADEA------------KKKAEEDKKKADELKKAAAAKKKAD 1421
Query: 362 LSSDMKNEDVGKQKKQKKVGLSLKNDEEKSSGKEKKKSMLGKVKKEKKK 410
+ E + +KK + K DE K +E KK+ K K E+ K
Sbjct: 1422 EAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAK 1470
Score = 34.3 bits (78), Expect = 0.13
Identities = 37/170 (21%), Positives = 70/170 (41%), Gaps = 11/170 (6%)
Query: 242 DLKLKIEGVKENEGEGQDKDSEKENAEDVND-HGSKKKLKKKKGRIHEVRYMDSTIGEVL 300
K K E K+ + + + +K+ A+++ +KKK + K + E + D +
Sbjct: 1382 AAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKA- 1440
Query: 301 DEDELGSDDDGEGDVGESEDGEDSEDGKMSSGDILGKKRKKGDKERAQKLPKKVAKVKKD 360
+E D+ + E++ E+++ + K+K + ++A + KK + KK
Sbjct: 1441 --EEAKKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKK- 1497
Query: 361 ELSSDMKNEDVGKQKKQKKVGLSLKNDEEKSSGKEKKKSMLGKVKKEKKK 410
K ++ K + KK K EE E KK+ K E KK
Sbjct: 1498 ------KADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKK 1541
Score = 34.3 bits (78), Expect = 0.14
Identities = 41/174 (23%), Positives = 76/174 (43%), Gaps = 14/174 (8%)
Query: 243 LKLKIEGVKENEGEGQDKDSEKENAEDVN--DHGSKKKLKKKKGRIHEVRYMDSTIGEVL 300
+KL E K E + + K AE++ + KK + KK E + + E+
Sbjct: 1598 MKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAE----ELK 1653
Query: 301 DEDELGSDDDGEGDVGESEDGEDSEDGKMSSGDILGKKRKKGDKERAQKLPKKVAKVKKD 360
+E E ED + +E+ K + D +K+ ++ + KK ++KK
Sbjct: 1654 KAEEENKIKAAEEAKKAEEDKKKAEEAKKAEED---EKKAAEALKKEAEEAKKAEELKKK 1710
Query: 361 ELSSDMKNEDVGKQKKQKKVGL-SLKNDEEKSSGKEKKKSMLGKVKKEKKKLVA 413
E K E++ K +++ K+ K + E ++KKK+ K +E+KK +A
Sbjct: 1711 EAEEKKKAEELKKAEEENKIKAEEAKKEAE----EDKKKAEEAKKDEEEKKKIA 1760
Score = 32.8 bits (74), Expect = 0.45
Identities = 38/157 (24%), Positives = 72/157 (45%), Gaps = 9/157 (5%)
Query: 254 EGEGQDKDSEKENAEDVNDHGSKKKLKKKKGRIHEVRYMDSTIGEVLDEDELGSDDDGEG 313
+ E + K E + AE++ KKK ++ K + E + M E + E ++
Sbjct: 1541 KAEEKKKADELKKAEELKKAEEKKKAEEAK-KAEEDKNMALRKAEEAKKAEEARIEEVMK 1599
Query: 314 DVGESEDGEDSEDGKMSSGDILGKKRKKGDKERAQKLPKKVAKVKKDELSSDMKNEDVGK 373
E + + E K I ++ KK ++E+ KKV ++KK E K E++ K
Sbjct: 1600 LYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEK-----KKVEQLKKKEAEEKKKAEELKK 1654
Query: 374 QKKQKKVGLSLKNDEEKSSGKEKKKSMLGKVKKEKKK 410
+++ K+ +E K + ++KKK+ K +E +K
Sbjct: 1655 AEEENKIK---AAEEAKKAEEDKKKAEEAKKAEEDEK 1688
Score = 32.4 bits (73), Expect = 0.59
Identities = 34/152 (22%), Positives = 63/152 (41%), Gaps = 15/152 (9%)
Query: 259 DKDSEKENAEDVNDHGSKKKLKKKKGRIHEVRYMDSTIGEVLDEDELGSDDDGEGDVGES 318
+ EK A + +KKK K + E + D + E D ++
Sbjct: 1360 EAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADE------------AKKKAEEDKKKA 1407
Query: 319 EDGEDSEDGKMSSGDILGKKRKKGDKERAQKLPKKVAKVKKDELSSDMKNEDVGKQKKQK 378
++ + + K + + K+K +K++A + KK + KK + + E ++ +K
Sbjct: 1408 DELKKAAAAKKKADEA---KKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKK 1464
Query: 379 KVGLSLKNDEEKSSGKEKKKSMLGKVKKEKKK 410
K + K DE K +E KK+ K K E+ K
Sbjct: 1465 KAEEAKKADEAKKKAEEAKKADEAKKKAEEAK 1496
Score = 30.9 bits (69), Expect = 1.9
Identities = 40/170 (23%), Positives = 65/170 (38%), Gaps = 7/170 (4%)
Query: 246 KIEGVKENEGEGQDKDSEKENAEDVNDHGSKKK----LKKKKGRIHEVRYMDSTIGEVLD 301
K K+ E + K EK+ A++ + K KKK + E
Sbjct: 1412 KAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEEAKKKAEEAKK 1471
Query: 302 EDELGSDDDGEGDVGESEDGEDSEDGKMSSGDILGKKRKKGDKERAQKLPKKVAKVKKDE 361
DE + E++ + K + +KK D+ + + KK + KK E
Sbjct: 1472 ADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKADEAKKAE 1531
Query: 362 LSSDMKNEDVGKQKKQKKVGLSLKNDEEKSSGKEKKKSMLGKVKKEKKKL 411
+ K D K+ ++KK LK EE +EKKK+ K +E K +
Sbjct: 1532 ---EAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNM 1578
>gnl|CDD|215601 PLN03142, PLN03142, Probable chromatin-remodeling complex ATPase
chain; Provisional.
Length = 1033
Score = 35.2 bits (81), Expect = 0.065
Identities = 24/121 (19%), Positives = 55/121 (45%), Gaps = 15/121 (12%)
Query: 292 MDSTIGEVLDEDEL--------GSDDDGEGDVGESEDGEDSEDGKMSSGDI-LGKKRKKG 342
+++ E DE+EL DD E + ++ E+ ++ S + K+ K
Sbjct: 5 VNTQANEEEDEEELEAVARSAGSDSDDDEVPAEDEDEDEEDDEEAESPAKAEISKREKAR 64
Query: 343 DKERAQKLPKKVAKVKKDE---LSSDMKNEDVGKQK---KQKKVGLSLKNDEEKSSGKEK 396
KE ++ +++ K+ + + + +DM N+ G+ K +Q ++ ++ +S K+
Sbjct: 65 LKELKKQKKQEIQKILEQQNAAIDADMNNKGKGRLKYLLQQTEIFAHFAKGDQSASAKKA 124
Query: 397 K 397
K
Sbjct: 125 K 125
>gnl|CDD|217392 pfam03153, TFIIA, Transcription factor IIA, alpha/beta subunit.
Transcription initiation factor IIA (TFIIA) is a
heterotrimer, the three subunits being known as alpha,
beta, and gamma, in order of molecular weight. The N and
C-terminal domains of the gamma subunit are represented
in pfam02268 and pfam02751, respectively. This family
represents the precursor that yields both the alpha and
beta subunits. The TFIIA heterotrimer is an essential
general transcription initiation factor for the
expression of genes transcribed by RNA polymerase II.
Together with TFIID, TFIIA binds to the promoter region;
this is the first step in the formation of a
pre-initiation complex (PIC). Binding of the rest of the
transcription machinery follows this step. After
initiation, the PIC does not completely dissociate from
the promoter. Some components, including TFIIA, remain
attached and re-initiate a subsequent round of
transcription.
Length = 332
Score = 34.7 bits (80), Expect = 0.076
Identities = 20/76 (26%), Positives = 27/76 (35%), Gaps = 4/76 (5%)
Query: 251 KENEGEGQDKDSEKENAEDVNDHGSKKKLKKKKGRIHEVRYMDSTIGEVLDEDELGSDDD 310
K EG G K + + S K+ + + + DED + SD D
Sbjct: 215 KGAEGGGAMKVLK----QPKKQAKSSKRRTIAQIDGIDSDDEGDGSDDDDDEDAIESDLD 270
Query: 311 GEGDVGESEDGEDSED 326
D EDGED D
Sbjct: 271 DSDDDVSDEDGEDLFD 286
>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
Length = 1388
Score = 34.6 bits (80), Expect = 0.10
Identities = 33/180 (18%), Positives = 62/180 (34%), Gaps = 11/180 (6%)
Query: 239 AVPDLKLKIEGVKENEGEGQDKDSEKENAEDVNDHGSKKKLKKKKGRIHEVRYMDSTIGE 298
P LK K + K++ + K S N++ V+ K+KL K +
Sbjct: 1170 RKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSD-EKRKLDDKPDNKKSNSSGSDQEDD 1228
Query: 299 VLDEDELGSDDDGEGDVGESEDGEDSEDGKMSSGDILGKKRKKGDK-------ERAQKLP 351
+ + ++ + SED S D L K+ K + + + P
Sbjct: 1229 EEQKTKPKKSSVKRLKSKKNNSSKSSEDNDEFSSDDLSKEGKPKNAPKRVSAVQYSPPPP 1288
Query: 352 KKVAKVKKD--ELSSDMKNEDVGKQKKQKKVGLSLKNDEEKSSG-KEKKKSMLGKVKKEK 408
K + + S + V K+ + L K EK + K+K K+ + + +
Sbjct: 1289 SKRPDGESNGGSKPSSPTKKKVKKRLEGSLAALKKKKKSEKKTARKKKSKTRVKQASASQ 1348
Score = 33.5 bits (77), Expect = 0.22
Identities = 25/153 (16%), Positives = 47/153 (30%), Gaps = 3/153 (1%)
Query: 258 QDKDSEKENAEDVNDHGSKKKLKKKKGRIHEVRYMDSTIGEVLDEDELGSDDDGEGDVGE 317
+ K+ +K+ + + K+ E R +D S D E D +
Sbjct: 1175 KKKEKKKKKSSADKSKKASVVGNSKRVDSDEKRKLDDKPDNKKSN---SSGSDQEDDEEQ 1231
Query: 318 SEDGEDSEDGKMSSGDILGKKRKKGDKERAQKLPKKVAKVKKDELSSDMKNEDVGKQKKQ 377
+ S ++ S K + + E + K K K K+
Sbjct: 1232 KTKPKKSSVKRLKSKKNNSSKSSEDNDEFSSDDLSKEGKPKNAPKRVSAVQYSPPPPSKR 1291
Query: 378 KKVGLSLKNDEEKSSGKEKKKSMLGKVKKEKKK 410
+ + + K+ KK + G + KKK
Sbjct: 1292 PDGESNGGSKPSSPTKKKVKKRLEGSLAALKKK 1324
>gnl|CDD|217927 pfam04147, Nop14, Nop14-like family. Emg1 and Nop14 are novel
proteins whose interaction is required for the
maturation of the 18S rRNA and for 40S ribosome
production.
Length = 809
Score = 33.8 bits (78), Expect = 0.20
Identities = 25/93 (26%), Positives = 42/93 (45%), Gaps = 4/93 (4%)
Query: 244 KLKIEGVKENEGEGQDKDSE---KENAEDVNDHGSKKKLKKKKGRIHEVRYMDSTIGEVL 300
KL+ E ++ GE +D + E KE+A+D++D + G + V
Sbjct: 273 KLEAERLRRMRGEEEDDEEEEDSKESADDLDD-EFEPDDDDNFGLGQGEEDEEEEEDGVD 331
Query: 301 DEDELGSDDDGEGDVGESEDGEDSEDGKMSSGD 333
DEDE DDD E + + + ++ ED + D
Sbjct: 332 DEDEEDDDDDLEEEEEDVDLSDEEEDEEDEDSD 364
>gnl|CDD|218737 pfam05764, YL1, YL1 nuclear protein. The proteins in this family
are designated YL1. These proteins have been shown to be
DNA-binding and may be a transcription factor.
Length = 238
Score = 32.7 bits (75), Expect = 0.27
Identities = 25/117 (21%), Positives = 49/117 (41%), Gaps = 8/117 (6%)
Query: 301 DEDELGSDDDGEGDVGESEDGEDSEDGKMSSGDILGKKRKKGDKERAQKLPKKVAKVKKD 360
+E+E D D + + + +D E+G+ K+KK K +A K P K K KKD
Sbjct: 48 EEEEEEVDSDFDDSEDDEPESDDEEEGEKELQREERLKKKKRVKTKAYKEPTK-KKKKKD 106
Query: 361 ELSSDMKNEDVGKQKKQKKVGLSLKNDEEK-------SSGKEKKKSMLGKVKKEKKK 410
++ + KK+ + + SS + K++ ++K+ + +
Sbjct: 107 PTAAKSPKAAAPRPKKKSERISWAPTLLDSPRRKSSRSSTVQNKEATHERLKEREIR 163
>gnl|CDD|220303 pfam09592, DUF2031, Protein of unknown function (DUF2031). This
protein is expressed in Plasmodium; its function is
unknown. It may be the product of gene family pyst-b.
Length = 228
Score = 32.3 bits (74), Expect = 0.35
Identities = 22/55 (40%), Positives = 33/55 (60%), Gaps = 1/55 (1%)
Query: 357 VKKDELSSDMKNEDVGKQKKQKKVGLSLKNDEEKSSGKEKKKSMLGKVKKEKKKL 411
+KKDE SS ++ED KQ + ++ L + +E SS KK + K+KK +KKL
Sbjct: 147 IKKDENSSVSEHEDF-KQLENEENFLETEYNEITSSNNYKKLKINRKLKKAEKKL 200
>gnl|CDD|217203 pfam02724, CDC45, CDC45-like protein. CDC45 is an essential gene
required for initiation of DNA replication in S.
cerevisiae, forming a complex with MCM5/CDC46.
Homologues of CDC45 have been identified in human, mouse
and smut fungus among others.
Length = 583
Score = 31.9 bits (73), Expect = 0.67
Identities = 8/60 (13%), Positives = 27/60 (45%), Gaps = 1/60 (1%)
Query: 298 EVLDEDELGSDDDGEGDVGESEDGEDSEDGKMSSGDILGKKRKKGDKERAQKLPKKVAKV 357
++ ++D+ + D E + + ++ +D DI ++R ++ R ++ ++
Sbjct: 122 DLEEDDDDDEESDEEDEESSKSEDDEDDDDDDDDDDIATRERSL-ERRRRRREWEEKRAE 180
>gnl|CDD|219293 pfam07093, SGT1, SGT1 protein. This family consists of several
eukaryotic SGT1 proteins. Human SGT1 or hSGT1 is known
to suppress GCR2 and is highly expressed in the muscle
and heart. The function of this family is unknown
although it has been speculated that SGT1 may be
functionally analogous to the Gcr2p protein of
Saccharomyces cerevisiae which is known to be a
regulatory factor of glycolytic gene expression.
Length = 557
Score = 32.0 bits (73), Expect = 0.70
Identities = 18/109 (16%), Positives = 35/109 (32%), Gaps = 7/109 (6%)
Query: 248 EGVKENEGEGQDKDSEKENAEDVNDHGS--KKKLKKKKGRIHEVRYMDSTIGEVLDEDEL 305
++ + + D +K+ + D ++ K L K I DS + D+++
Sbjct: 437 ADDEDEDDDEPDDSEDKDVSFDEDEFFEFLKNMLGLKDDEIDNDLPDDSDDADEDDDEDD 496
Query: 306 GSDDDGEGDVGESEDGE-----DSEDGKMSSGDILGKKRKKGDKERAQK 349
D+D D E E D+E + S +
Sbjct: 497 DEDEDSSSDSTLEELEEYMDQMDAELKQTDSSNNADISNSGSSGAEDDD 545
>gnl|CDD|226568 COG4083, COG4083, Predicted membrane protein [Function unknown].
Length = 239
Score = 30.9 bits (70), Expect = 0.92
Identities = 17/43 (39%), Positives = 24/43 (55%), Gaps = 2/43 (4%)
Query: 217 IAENVIAAINGL--LESLALPVYQAVPDLKLKIEGVKENEGEG 257
IA NVIA + L L +A V++ VP+L+ KI + GE
Sbjct: 195 IAHNVIAQLGSLAALILIAYAVFKIVPELEDKIRELIPWLGEN 237
>gnl|CDD|225661 COG3119, AslA, Arylsulfatase A and related enzymes [Inorganic ion
transport and metabolism].
Length = 475
Score = 31.3 bits (71), Expect = 0.92
Identities = 14/45 (31%), Positives = 21/45 (46%), Gaps = 6/45 (13%)
Query: 272 DHGSKKKLKKKKGRIHE-----VRYMDSTIGEVLDE-DELGSDDD 310
G + ++ R+ VRY+D IG +LD ELG D+
Sbjct: 244 ADGLPAEEEEDGARLMTVYAACVRYLDDQIGRLLDALKELGLLDN 288
>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
Length = 413
Score = 31.2 bits (71), Expect = 1.1
Identities = 19/73 (26%), Positives = 30/73 (41%)
Query: 340 KKGDKERAQKLPKKVAKVKKDELSSDMKNEDVGKQKKQKKVGLSLKNDEEKSSGKEKKKS 399
+E + K KK++ +++ K + K+KK+K+ E K K KKS
Sbjct: 40 STFSEEENKVATTSTKKDKKEDKNNESKKKSEKKKKKKKEKKEPKSEGETKLGFKTPKKS 99
Query: 400 MLGKVKKEKKKLV 412
K K K K
Sbjct: 100 KKTKKKPPKPKPN 112
>gnl|CDD|179786 PRK04207, PRK04207, glyceraldehyde-3-phosphate dehydrogenase;
Provisional.
Length = 341
Score = 31.0 bits (71), Expect = 1.1
Identities = 17/69 (24%), Positives = 27/69 (39%), Gaps = 18/69 (26%)
Query: 105 DAVMKKIKNDNLPITKVIKITKLKTDYRPFEAKRKLCDSYDIFLADKRVVPLLPKLLGKH 164
DAV + P +++ + K K DY A K Y +++AD
Sbjct: 18 DAVAAQ------PDMELVGVAKTKPDYEARVAVEK---GYPLYVADPEREKA-------- 60
Query: 165 FFKKKKIPV 173
F++ IPV
Sbjct: 61 -FEEAGIPV 68
>gnl|CDD|221237 pfam11810, DUF3332, Domain of unknown function (DUF3332). This
family of proteins are functionally uncharacterized.
This family is only found in bacteria.
Length = 175
Score = 30.0 bits (68), Expect = 1.3
Identities = 16/71 (22%), Positives = 21/71 (29%), Gaps = 5/71 (7%)
Query: 242 DLKLKIEGVKENEGEGQDKD----SEKENAEDVNDH-GSKKKLKKKKGRIHEVRYMDSTI 296
D L VK EG+ E V + GS L K Y+D +
Sbjct: 100 DKSLTEAPVKITNREGEGATVEQIDENTLQMWVTYNDGSSATLLGMKEDGTVDFYLDGEL 159
Query: 297 GEVLDEDELGS 307
V L +
Sbjct: 160 VTVTSLQGLEA 170
>gnl|CDD|217943 pfam04180, LTV, Low temperature viability protein. The
low-temperature viability protein LTV1 is involved in
ribosome biogenesis 40S subunit production.
Length = 426
Score = 30.7 bits (69), Expect = 1.5
Identities = 28/162 (17%), Positives = 51/162 (31%), Gaps = 16/162 (9%)
Query: 241 PDLKLKIEGVKENEGEGQDKDSEKENAEDVNDHGSKKKLKKKKGRIHEVRYMDSTIGEVL 300
P L+ +E +++ D++SE+++ E+ + E D + L
Sbjct: 157 PRLREVLEALEDEAYVRNDEESEEDDGEEWEEMDD------------EEGSDDRSSAGFL 204
Query: 301 DE--DELGSDDDGEGDVGESEDGEDSEDGKMSSGDILGKKRKKGDKERAQKLPKKVAKVK 358
DE D E E++E + R+ D + KK K
Sbjct: 205 DEFEDLSAPGSPQEAMKKHLFVEEETESRNTEYSETSEVMRRNEDWTLHFERFKKFEKKN 264
Query: 359 KDELSSDMKNEDVGKQKKQKKVGL--SLKNDEEKSSGKEKKK 398
D N ++ + L L + + K K K
Sbjct: 265 DDVEIGAADNSELEGSIEVDSNRLQEVLNDYYKTEKAKSKVK 306
>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
(vWA) domain [General function prediction only].
Length = 4600
Score = 31.1 bits (70), Expect = 1.6
Identities = 27/105 (25%), Positives = 45/105 (42%), Gaps = 7/105 (6%)
Query: 298 EVLDEDELGSDDDGEGDVGESEDGEDSEDGKMSSGDILGKKRKKGD-KERAQKLPKKVAK 356
+ L+E D + D+ ESE E D + D + + + D +E Q L ++V
Sbjct: 3835 DDLEELANEEDTANQSDLDESEARELESDMNGVTKDSVVSENENSDSEEENQDLDEEVND 3894
Query: 357 VKKDELSS------DMKNEDVGKQKKQKKVGLSLKNDEEKSSGKE 395
+ +D +S D NE+ + +QK S N+E KE
Sbjct: 3895 IPEDLSNSLNEKLWDEPNEEDLLETEQKSNEQSAANNESDLVSKE 3939
Score = 30.0 bits (67), Expect = 3.5
Identities = 29/152 (19%), Positives = 58/152 (38%), Gaps = 26/152 (17%)
Query: 248 EGVKENEGE-----GQDKDSEKENAEDVN---------DHGSKKKLKKKKGRIHEVRYMD 293
KE+E E G D + + + E+ + D KL +K+G + + ++
Sbjct: 3950 RQEKEDEEEMSDDVGIDDEIQPDIQENNSQPPPENEDLDLPEDLKLDEKEGDVSKDSDLE 4009
Query: 294 STIGEVLDEDELGSDDDG-----EGDVGESEDGEDSEDGKMSSGDIL---GKKRKKGDKE 345
E DE++ +D + + D E + D + + D+ K + G +E
Sbjct: 4010 DMDMEAADENKEEADAEKDEPMQDEDPLEENNTLDEDIQQDDFSDLAEDDEKMNEDGFEE 4069
Query: 346 RAQKLPKKVAKVKKDELSSDMKNEDVGKQKKQ 377
Q+ + +D + SD + E + Q
Sbjct: 4070 NVQENEEST----EDGVKSDEELEQGEVPEDQ 4097
>gnl|CDD|227492 COG5163, NOP7, Protein required for biogenesis of the 60S ribosomal
subunit [Translation, ribosomal structure and
biogenesis].
Length = 591
Score = 30.8 bits (69), Expect = 1.7
Identities = 32/149 (21%), Positives = 54/149 (36%), Gaps = 21/149 (14%)
Query: 262 SEKENAEDVNDHGSKKKLKKKKGRIHEVRYMDSTIGEVLDEDELGSDDDGEGDVGESEDG 321
E + ++ + + + D E+ + EL + G + SE
Sbjct: 463 MTMEETQRHSEEDLVNRFEDVRYEHVAGEEDDDDDEELQAQKELELEAQG---IKYSETS 519
Query: 322 EDSEDGKMSSGDILGKKRKKGDKERAQKLPKKVAKVKKDELSSDMKNEDVGKQKKQKKVG 381
E +D S KKRK ++E +KL + K+ +L MK + K
Sbjct: 520 EADKDVNKSKN----KKRKVDEEEEEKKLKMIMMSNKQKKLYKKMKYSNAKK-------- 567
Query: 382 LSLKNDEEKSSGKEKKKSMLGKVKKEKKK 410
EE++ +KKK + K KK K
Sbjct: 568 ------EEQAENLKKKKKQIAKQKKLDSK 590
>gnl|CDD|227458 COG5129, MAK16, Nuclear protein with HMG-like acidic region
[General function prediction only].
Length = 303
Score = 30.4 bits (68), Expect = 1.8
Identities = 28/103 (27%), Positives = 44/103 (42%), Gaps = 21/103 (20%)
Query: 251 KENEGEGQDKDSEKENA--EDVNDHGSKKKLKKKKGRIHEVRYMDSTIGEVLDEDELGSD 308
+++E E ++ E+ + E V D K+K KKK E LGSD
Sbjct: 196 RQDEKERYVEEEEESDTELEAVTDDSEKEKTKKKDL-----------------EKWLGSD 238
Query: 309 DDGEGDVGESEDGEDSEDGKMSSGDILG--KKRKKGDKERAQK 349
E E E+ +SE + D G +KRK D ++++K
Sbjct: 239 QSMETSESEEEESSESESDEDEDEDNKGKIRKRKTDDAKKSRK 281
>gnl|CDD|221490 pfam12253, CAF1A, Chromatin assembly factor 1 subunit A. The CAF-1
or chromatin assembly factor-1 consists of three
subunits, and this is the first, or A. The A domain is
uniquely required for the progression of S phase in
mouse cells, independent of its ability to promote
histone deposition but dependent on its ability to
interact with HP1 - heterochromatin protein 1-rich
heterochromatin domains next to centromeres that are
crucial for chromosome segregation during mitosis. This
HP1-CAF-1 interaction module functions as a built-in
replication control for heterochromatin, which, like a
control barrier, has an impact on S-phase progression in
addition to DNA-based checkpoints.
Length = 76
Score = 28.0 bits (63), Expect = 1.9
Identities = 11/27 (40%), Positives = 16/27 (59%)
Query: 301 DEDELGSDDDGEGDVGESEDGEDSEDG 327
+E+E G D + E + E ED +D DG
Sbjct: 49 EEEEEGEDLESEDEEDEEEDDDDDMDG 75
>gnl|CDD|180071 PRK05424, rplA, 50S ribosomal protein L1; Validated.
Length = 230
Score = 29.7 bits (68), Expect = 2.0
Identities = 10/21 (47%), Positives = 14/21 (66%)
Query: 206 KVGKVSMGAEDIAENVIAAIN 226
+GKVS AE + EN+ A I+
Sbjct: 173 PIGKVSFDAEKLKENLKALID 193
>gnl|CDD|235033 PRK02363, PRK02363, DNA-directed RNA polymerase subunit delta;
Reviewed.
Length = 129
Score = 28.8 bits (65), Expect = 2.3
Identities = 16/63 (25%), Positives = 28/63 (44%), Gaps = 8/63 (12%)
Query: 264 KENAEDVNDHGSKKKLKKKKGRIHEVRYMDSTIGEVLDEDELGSDDDGEGDVGESEDGED 323
E E++ K KKKK +++D+D L DD E D+ E +D ++
Sbjct: 74 DEIDEEIIPLEEKFDKKKKKF--------MDGDDDIIDDDILPDDDFDEEDLDEEDDEDE 125
Query: 324 SED 326
++
Sbjct: 126 EDE 128
>gnl|CDD|236231 PRK08308, PRK08308, acyl-CoA synthetase; Validated.
Length = 414
Score = 29.6 bits (67), Expect = 3.3
Identities = 12/37 (32%), Positives = 18/37 (48%), Gaps = 2/37 (5%)
Query: 63 PQVSRTNAFKIPLPHSLL--GNDSDNPPEICLIMDDR 97
P + PLPH + G+D + P EI + M D+
Sbjct: 255 PDMKSHLDLGNPLPHVSVSAGSDENAPEEIVVKMGDK 291
>gnl|CDD|215565 PLN03083, PLN03083, E3 UFM1-protein ligase 1 homolog; Provisional.
Length = 803
Score = 29.8 bits (67), Expect = 3.5
Identities = 20/88 (22%), Positives = 33/88 (37%), Gaps = 3/88 (3%)
Query: 312 EGDVGESEDGEDSEDGKMSSGDILGKKRKKGDKERAQKLPKKVAKVKKDELSSDMKNEDV 371
G +G SE S + ++ + +KK K + K + DE D +
Sbjct: 394 AGLIGSSEKSLGSNESSPAASNSDKGSKKKKGKSTSTK-GGTAESIPDDE--EDAPKKGK 450
Query: 372 GKQKKQKKVGLSLKNDEEKSSGKEKKKS 399
QKK + + +D + KE KS
Sbjct: 451 KNQKKGRDKSSKVPSDSKAGGKKESVKS 478
>gnl|CDD|235626 PRK05853, PRK05853, hypothetical protein; Validated.
Length = 161
Score = 28.6 bits (64), Expect = 3.7
Identities = 11/49 (22%), Positives = 19/49 (38%)
Query: 285 RIHEVRYMDSTIGEVLDEDELGSDDDGEGDVGESEDGEDSEDGKMSSGD 333
RI + RY + GE G+ D + ++ D D ++G
Sbjct: 106 RIEKPRYTGPSAGEAPAATGTGAADAADAPASAADSVSDVVDDDAATGH 154
>gnl|CDD|217830 pfam03986, Autophagy_N, Autophagocytosis associated protein (Atg3),
N-terminal domain. Autophagocytosis is a
starvation-induced process responsible for transport of
cytoplasmic proteins to the lysosome/vacuole. Atg3 is a
ubiquitin like modifier that is topologically similar to
the canonical E2 enzyme. It catalyzes the conjugation of
Atg8 and phosphatidylethanolamine.
Length = 146
Score = 28.5 bits (64), Expect = 3.9
Identities = 17/78 (21%), Positives = 33/78 (42%), Gaps = 12/78 (15%)
Query: 250 VKENE-GEGQDKDSEKENAEDVNDHGSKKKLKKKKGRIHEVRYMDSTIGEVLDEDELGSD 308
K+ E G+G ++ E E+ +D + K+ I + D E+G D
Sbjct: 79 AKQMEYGDGAEEIVEDEDEDDGWVTTHGNRDKQ-----------KDDIADEEDIPEIGDD 127
Query: 309 DDGEGDVGESEDGEDSED 326
DD D ++++ +D +
Sbjct: 128 DDDVVDSSDADEDDDDDI 145
>gnl|CDD|227693 COG5406, COG5406, Nucleosome binding factor SPN, SPT16 subunit
[Transcription / DNA replication, recombination, and
repair / Chromatin structure and dynamics].
Length = 1001
Score = 29.6 bits (66), Expect = 4.2
Identities = 16/50 (32%), Positives = 24/50 (48%), Gaps = 4/50 (8%)
Query: 293 DSTIGEVLDEDELGSDDDGEGDVGESEDGED--SEDGKMSSGDILGKKRK 340
+ + ED S+D+ E D + EDGED + K + GK+RK
Sbjct: 954 EDEESDESSED--LSEDESENDSSDEEDGEDWDELESKAAYDSRPGKRRK 1001
>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3. This
protein, which interacts with both microtubules and
TRAF3 (tumour necrosis factor receptor-associated factor
3), is conserved from worms to humans. The N-terminal
region is the microtubule binding domain and is
well-conserved; the C-terminal 100 residues, also
well-conserved, constitute the coiled-coil region which
binds to TRAF3. The central region of the protein is
rich in lysine and glutamic acid and carries KKE motifs
which may also be necessary for tubulin-binding, but
this region is the least well-conserved.
Length = 506
Score = 29.5 bits (66), Expect = 4.3
Identities = 17/74 (22%), Positives = 33/74 (44%), Gaps = 4/74 (5%)
Query: 337 KKRKKGDKERAQKLPKKVAKVKKDELSSDMKNEDVGKQKKQKKVGLSLKNDEEKSSGKEK 396
+++KK ++ ++ + K + E + E +K+KKV +EEK + +
Sbjct: 115 EEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEK----EKEKKVEEPRDREEEKKRERVR 170
Query: 397 KKSMLGKVKKEKKK 410
KS K K+K
Sbjct: 171 AKSRPKKPPKKKPP 184
>gnl|CDD|220759 pfam10446, DUF2457, Protein of unknown function (DUF2457). This is
a family of uncharacterized proteins.
Length = 449
Score = 29.2 bits (65), Expect = 4.4
Identities = 16/67 (23%), Positives = 31/67 (46%)
Query: 260 KDSEKENAEDVNDHGSKKKLKKKKGRIHEVRYMDSTIGEVLDEDELGSDDDGEGDVGESE 319
+ + +ND K+ +K G+ E M+ + D+D+ +D+ + D + E
Sbjct: 18 GSLDYKEKLTLNDTMKKENAIRKLGKEAEEEAMEEEDDDEEDDDDDDDEDEDDDDDDDDE 77
Query: 320 DGEDSED 326
D ED +D
Sbjct: 78 DDEDEDD 84
>gnl|CDD|215590 PLN03123, PLN03123, poly [ADP-ribose] polymerase; Provisional.
Length = 981
Score = 29.0 bits (65), Expect = 6.1
Identities = 18/58 (31%), Positives = 26/58 (44%), Gaps = 4/58 (6%)
Query: 340 KKGDKERAQKLPKKVAKVKKDELSSDMKNEDVGKQKKQKKVGLSLKNDEEKSSGKEKK 397
K+ E ++ KK AK KKD ++ K K + V S ++KSS E K
Sbjct: 191 KEEKAEERKQESKKGAKRKKDA----SGDDKSKKAKTDRDVSTSTAASQKKSSDLESK 244
>gnl|CDD|131944 TIGR02898, spore_YhcN_YlaJ, sporulation lipoprotein, YhcN/YlaJ
family. YhcN and YlaJ are predicted lipoproteins that
have been detected as spore proteins but not vegetative
proteins in Bacillus subtilis. Both appear to be
expressed under control of the RNA polymerase sigma-G
factor. The YlaJ-like members of this family have a
low-complexity, strongly acidic 40-residue C-terminal
domain that is not included in the seed alignment for
this model. A portion of the low-complexity region
between the lipoprotein signal sequence and the main
conserved region of the protein family was also excised
from the seed alignment [Cellular processes, Sporulation
and germination].
Length = 158
Score = 28.1 bits (63), Expect = 6.1
Identities = 24/99 (24%), Positives = 41/99 (41%), Gaps = 12/99 (12%)
Query: 314 DVGESEDGEDSEDGKMSSGDILGKKRKKGDK-ERAQKLPKKVAKVKKDELSSDMKNEDVG 372
+VGE+ + +G ++ D GD + A ++ + AKVK +K+ V
Sbjct: 27 NVGETNVMSKNNNGMNTTND-----TNDGDLYDVADEIASEAAKVK------GVKDATVV 75
Query: 373 KQKKQKKVGLSLKNDEEKSSGKEKKKSMLGKVKKEKKKL 411
VG+ L N E S E K+ + VK ++
Sbjct: 76 ITGNYAYVGVDLTNGLEGSVTDELKEKVAETVKSTDNRI 114
>gnl|CDD|148051 pfam06213, CobT, Cobalamin biosynthesis protein CobT. This family
consists of several bacterial cobalamin biosynthesis
(CobT) proteins. CobT is involved in the transformation
of precorrin-3 into cobyrinic acid.
Length = 282
Score = 28.2 bits (63), Expect = 7.2
Identities = 8/33 (24%), Positives = 16/33 (48%)
Query: 301 DEDELGSDDDGEGDVGESEDGEDSEDGKMSSGD 333
DED+ +++ G + D+ +M SG+
Sbjct: 233 DEDDDQGEEEESGSSDSLSEDSDASSEEMESGE 265
>gnl|CDD|218538 pfam05285, SDA1, SDA1. This family consists of several SDA1
protein homologues. SDA1 is a Saccharomyces cerevisiae
protein which is involved in the control of the actin
cytoskeleton. The protein is essential for cell
viability and is localised in the nucleus.
Length = 317
Score = 28.1 bits (63), Expect = 7.8
Identities = 20/106 (18%), Positives = 34/106 (32%), Gaps = 18/106 (16%)
Query: 301 DEDELGSDDDGEGDVGESEDGEDSEDGKMSSGDILGKKRKKGDKERAQKLPKKVAKVKKD 360
E E G + D + D E + E+ ED I V K+
Sbjct: 86 KEAEQGLESDDDDDEEEEWEVEEDEDSDDEGEWI------------------DVESDKEI 127
Query: 361 ELSSDMKNEDVGKQKKQKKVGLSLKNDEEKSSGKEKKKSMLGKVKK 406
E S E+ + K+ K + EE +++ + +K
Sbjct: 128 ESSDSEDEEEKDEAAKKAKEDSDEELSEEDEEEAAEEEEAEAEKEK 173
>gnl|CDD|219240 pfam06957, COPI_C, Coatomer (COPI) alpha subunit C-terminus. This
family represents the C-terminus (approximately 500
residues) of the eukaryotic coatomer alpha subunit.
Coatomer (COPI) is a large cytosolic protein complex
which forms a coat around vesicles budding from the
Golgi apparatus. Such coatomer-coated vesicles have been
proposed to play a role in many distinct steps of
intracellular transport. Note that many family members
also contain the pfam04053 domain.
Length = 421
Score = 28.3 bits (63), Expect = 8.0
Identities = 12/28 (42%), Positives = 14/28 (50%)
Query: 300 LDEDELGSDDDGEGDVGESEDGEDSEDG 327
LDED +DG D E +G D E G
Sbjct: 49 LDEDGYLEGEDGLLDDEEGPEGGDEEGG 76
>gnl|CDD|216546 pfam01516, Orbi_VP6, Orbivirus helicase VP6. The VP6 protein a
minor protein in the core of the virion is probably the
viral helicase.
Length = 322
Score = 28.1 bits (62), Expect = 8.1
Identities = 21/79 (26%), Positives = 30/79 (37%), Gaps = 5/79 (6%)
Query: 248 EGVKENEGEGQDKDSEKENAEDVNDHGSKKKLKKKK--GRIH-EVRYMDST--IGEVLDE 302
E E+ E + D E E + S K+ + R H EV S GE D+
Sbjct: 37 EPKDEDGQEQRISDGEGEQKQKGGKEESDKETEDASVDRRTHTEVGSGSSAKGSGERADK 96
Query: 303 DELGSDDDGEGDVGESEDG 321
G D G G+++ G
Sbjct: 97 HADGGDGKTRGGGGDADKG 115
>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
This is a family of fungal proteins of unknown function.
Length = 182
Score = 27.7 bits (62), Expect = 8.5
Identities = 18/76 (23%), Positives = 35/76 (46%), Gaps = 7/76 (9%)
Query: 337 KKRKKGDKERAQKLPKKVAKVK--KDELSSDMKNEDVGKQKKQKKVGLSLKNDEEKSSGK 394
KK+KK E +K+ K+ + + K + K +D K KK K K+D+ + +
Sbjct: 61 KKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKKKKDKDKDKKDDK-----KDDKSEKKDE 115
Query: 395 EKKKSMLGKVKKEKKK 410
++ + L + K +
Sbjct: 116 KEAEDKLEDLTKSYSE 131
>gnl|CDD|215521 PLN02967, PLN02967, kinase.
Length = 581
Score = 28.5 bits (63), Expect = 8.8
Identities = 19/87 (21%), Positives = 34/87 (39%)
Query: 323 DSEDGKMSSGDILGKKRKKGDKERAQKLPKKVAKVKKDELSSDMKNEDVGKQKKQKKVGL 382
+ D + + KK+ +RA K +E S + NED K+ KK
Sbjct: 56 LAVDEEPDENGAVSKKKPTRSVKRATKKTVVEISEPLEEGSELVVNEDAALDKESKKTPR 115
Query: 383 SLKNDEEKSSGKEKKKSMLGKVKKEKK 409
+ +S +++ KV+K +K
Sbjct: 116 RTRRKAAAASSDVEEEKTEKKVRKRRK 142
>gnl|CDD|226920 COG4547, CobT, Cobalamin biosynthesis protein CobT
(nicotinate-mononucleotide:5, 6-dimethylbenzimidazole
phosphoribosyltransferase) [Coenzyme metabolism].
Length = 620
Score = 28.3 bits (63), Expect = 8.8
Identities = 17/72 (23%), Positives = 34/72 (47%), Gaps = 5/72 (6%)
Query: 259 DKDSEKENAEDVNDHGSKKKLKKKKGRIHEVRYMDSTIG---EVLDEDELGSDDDGEGDV 315
+ E + ED++ ++ ++ R+ VR M ++ E D+ D+ +GD
Sbjct: 181 RDEIEDKAGEDLDGLAAEIDDQQAFARV--VRDMLGSMDMAEETGDDGIEEDADEEDGDD 238
Query: 316 GESEDGEDSEDG 327
+ ++ EDSE G
Sbjct: 239 DQPDNNEDSEAG 250
>gnl|CDD|218747 pfam05786, Cnd2, Condensin complex subunit 2. This family consists
of several Barren protein homologues from several
eukaryotic organisms. In Drosophila Barren (barr) is
required for sister-chromatid segregation in mitosis.
barr encodes a novel protein that is present in
proliferating cells and has homologues in yeast and
human. Mitotic defects in barr embryos become apparent
during cycle 16, resulting in a loss of PNS and CNS
neurons. Centromeres move apart at the
metaphase-anaphase transition and Cyclin B is degraded,
but sister chromatids remain connected, resulting in
chromatin bridging. Barren protein localises to
chromatin throughout mitosis. Colocalisation and
biochemical experiments indicate that Barren associates
with Topoisomerase II throughout mitosis and alters the
activity of Topoisomerase II. It has been suggested that
this association is required for proper chromosomal
segregation by facilitating the decatenation of
chromatids at anaphase. This family forms one of the
three non-structural maintenance of chromosomes (SMC)
subunits of the mitotic condensation complex along with
Cnd1 and Cnd3.
Length = 719
Score = 28.2 bits (63), Expect = 8.9
Identities = 15/39 (38%), Positives = 20/39 (51%)
Query: 251 KENEGEGQDKDSEKENAEDVNDHGSKKKLKKKKGRIHEV 289
E EG D + + E D + G+KKK KKK+ R E
Sbjct: 158 PETGREGDDGEDDDEEGSDGEEDGAKKKAKKKRQRKPES 196
>gnl|CDD|215633 PLN03210, PLN03210, Resistant to P. syringae 6; Provisional.
Length = 1153
Score = 28.3 bits (63), Expect = 9.4
Identities = 12/51 (23%), Positives = 25/51 (49%), Gaps = 5/51 (9%)
Query: 32 LKWLKSNSQTQKPQLLEQDDFVYLILTLKKIP-----QVSRTNAFKIPLPH 77
+ ++ + Q+ L +Q F LIL+ +++P + + + IPL H
Sbjct: 935 INFINCFNLDQEALLQQQSIFKQLILSGEEVPSYFTHRTTGASLTNIPLLH 985
>gnl|CDD|234340 TIGR03744, traC_PFL_4706, conjugative transfer ATPase, PFL_4706
family. Members of this protein family are predicted
ATP-binding proteins apparently associated with DNA
conjugal transfer. Members are found both in plasmids
and in bacterial chromosomal regions that appear to
derive from integrative elements such as conjugative
transposons. More distant homologs, outside the scope of
this family, include type IV secretion/conjugal transfer
proteins such as TraC, VirB4 and TrsE. The granularity
of this protein family definition is chosen so as to
represent one distinctive clade and act as a marker
through which to define and recognize the class of
mobile element it serves [Mobile and extrachromosomal
element functions, Plasmid functions].
Length = 893
Score = 28.5 bits (64), Expect = 9.7
Identities = 19/58 (32%), Positives = 28/58 (48%), Gaps = 11/58 (18%)
Query: 287 HEVRYMDSTIGEVLDEDELGSDDDGEG--DV-GESE--------DGEDSEDGKMSSGD 333
+V+ +D+ E D D+ DD+ E D+ GE E GE+ ED +MS D
Sbjct: 557 EDVKALDADELEEEDADDEDPDDEDEEQRDILGEMEIAARLMITGGEEKEDARMSRAD 614
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.309 0.131 0.360
Gapped
Lambda K H
0.267 0.0710 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 20,759,704
Number of extensions: 2069838
Number of successful extensions: 2922
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2489
Number of HSP's successfully gapped: 240
Length of query: 413
Length of database: 10,937,602
Length adjustment: 99
Effective length of query: 314
Effective length of database: 6,546,556
Effective search space: 2055618584
Effective search space used: 2055618584
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.7 bits)
S2: 60 (26.9 bits)