RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy9550
(271 letters)
>gnl|CDD|217803 pfam03938, OmpH, Outer membrane protein (OmpH-like). This family
includes outer membrane proteins such as OmpH among
others. Skp (OmpH) has been characterized as a molecular
chaperone that interacts with unfolded proteins as they
emerge in the periplasm from the Sec translocation
machinery.
Length = 157
Score = 44.5 bits (106), Expect = 9e-06
Identities = 20/88 (22%), Positives = 44/88 (50%), Gaps = 7/88 (7%)
Query: 183 YKKLYEESQAELSRLRETLSKTEEELNQVKATYSKN--SLSEQEKRERRTMERKLSEMEV 240
K ++ + E +L+ L K E+EL + + K +LSE+ ++ + +++L + +
Sbjct: 33 GKAAQKQLEKEFKKLQAELQKKEKELQKEEQKLQKQAATLSEEARKAK---QQELQQKQQ 89
Query: 241 ELKELDQFKKENQRLKDENGALIRVISK 268
EL++ Q Q L+ + L++ I
Sbjct: 90 ELQQKQQ--AAQQELQQKQQELLQPIYD 115
>gnl|CDD|214922 smart00935, OmpH, Outer membrane protein (OmpH-like). This family
includes outer membrane proteins such as OmpH among
others. Skp (OmpH) has been characterized as a molecular
chaperone that interacts with unfolded proteins as they
emerge in the periplasm from the Sec translocation
machinery.
Length = 140
Score = 44.1 bits (105), Expect = 9e-06
Identities = 19/98 (19%), Positives = 47/98 (47%), Gaps = 11/98 (11%)
Query: 182 DYKKLYEESQAELSRLRETLSKTEEELNQVKATYSKN--SLSEQEKRERRTMERKLSEME 239
K ++ + E + + L K E+EL ++K K+ +LSE + ++ E++L +
Sbjct: 15 AGKAAQKQLEKEFKKRQAELEKLEKELQKLKEKLQKDAATLSEAAREKK---EKELQKKV 71
Query: 240 VELKEL-DQFKKENQRLKDEN-----GALIRVISKLSK 271
E + + +++ Q+ + E + + I +++K
Sbjct: 72 QEFQRKQQKLQQDLQKRQQEELQKILDKINKAIKEVAK 109
>gnl|CDD|223039 PHA03307, PHA03307, transcriptional regulator ICP4; Provisional.
Length = 1352
Score = 45.5 bits (108), Expect = 2e-05
Identities = 43/177 (24%), Positives = 63/177 (35%), Gaps = 6/177 (3%)
Query: 4 STPPRRSSGRTTSRPRSLYSSSPSPQSRNLSSYSSGLSSSTPYLNSSYPSYGSTYGGTS- 62
STPP +S R R + +S+ SP S + +S +SS S G +G +
Sbjct: 196 STPPAAASPRPPRRSSPISASASSPAPAPGRSAADDAGAS-SSDSSSSESSGCGWGPENE 254
Query: 63 ---SGYGGLTLPSSYSRPSQSSPRSYHAPPSSYRSSYAPTTTPLPPVRPRRSKSSTRSAN 119
+TLP+ S + S P+S SS + P P + +
Sbjct: 255 CPLPRPAPITLPTRIWEASGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGPAPSSPRA 314
Query: 120 YTPEGGGRMSRSGSTTSLNSLRSEAPSEGYESGGDNLSTSRSRVNSESGGSSRKLPN 176
+ R S S ST+S + A S + S SR S RK P
Sbjct: 315 SSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPP-PPADPSSPRKRPR 370
Score = 37.8 bits (88), Expect = 0.005
Identities = 40/153 (26%), Positives = 53/153 (34%), Gaps = 10/153 (6%)
Query: 2 SYSTPPRRSSGRTT-SRPRSLYSSSPSPQSRNLSSYSSGLSSSTPYLNSSYPSYGSTYGG 60
S S+ PR S + S P S + S S + SS SSST +SS S G+
Sbjct: 286 SSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSSST--SSSSESSRGA---A 340
Query: 61 TSSGYGGLTLPSSYSRPSQSSPRSYHAPPSSYRSSYAPTTTPLPPVRP-RRSKSSTRSAN 119
S G PS SRP + S +P R S AP++ RP RR + +
Sbjct: 341 VSPGPSPSRSPSP-SRPPPPADPS--SPRKRPRPSRAPSSPAASAGRPTRRRARAAVAGR 397
Query: 120 YTPEGGGRMSRSGSTTSLNSLRSEAPSEGYESG 152
+G A Y
Sbjct: 398 ARRRDATGRFPAGRPRPSPLDAGAASGAFYARY 430
Score = 35.9 bits (83), Expect = 0.021
Identities = 31/142 (21%), Positives = 47/142 (33%), Gaps = 3/142 (2%)
Query: 5 TPPRRSSGRTTSRPRSLYSSSPSPQSRNLSSYSSGLSSSTPYLNSSYPSYGSTYGGTSSG 64
T +SG R +SS S S S S + SS + S+ S
Sbjct: 267 TRIWEASGWNGPSSRPGPASSSSSPRERSPSPSPS-SPGSGPAPSSPRASSSSSSSRESS 325
Query: 65 YGGLTLPSSYSRPSQSSPRSYHAPPSSYRSSYAPTTTPLPPVRPRRSKSSTRSANYTPEG 124
+ S SR + SP + S P P RPR S++ + A G
Sbjct: 326 SSSTSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADPSSPRKRPRPSRAPSSPA--ASAG 383
Query: 125 GGRMSRSGSTTSLNSLRSEAPS 146
R+ + + + R +A
Sbjct: 384 RPTRRRARAAVAGRARRRDATG 405
Score = 31.7 bits (72), Expect = 0.49
Identities = 36/167 (21%), Positives = 50/167 (29%), Gaps = 7/167 (4%)
Query: 1 MSYSTPPRRSSGRTTSRPRSLYSSSPSPQSRNLSSYSSGLSSSTPYLNSSYPSYGSTYGG 60
S S G P + P +R + SS P SS S
Sbjct: 239 SSSSESSGCGWGPENECPLPRPAPITLP-TRIWEASGWNGPSSRPGPASSSSSPRERSPS 297
Query: 61 TSSGYGGLTLPSSYSRPSQSSPRSYHAPPSSYRSSY------APTTTPLPPVRPRRSKSS 114
S G S R S SS S + SS SS A + P P P S+
Sbjct: 298 PSPSSPGSGPAPSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPP 357
Query: 115 TRSANYTPEGGGRMSRSGSTTSLNSLRSEAPSEGYESGGDNLSTSRS 161
+ +P R SR+ S+ + ++ R G +
Sbjct: 358 PPADPSSPRKRPRPSRAPSSPAASAGRPTRRRARAAVAGRARRRDAT 404
Score = 29.4 bits (66), Expect = 2.3
Identities = 34/152 (22%), Positives = 50/152 (32%), Gaps = 6/152 (3%)
Query: 1 MSYSTPPRRSSGRTTSRPRSLYSSSPSPQSRNLSSYSSGLSSSTPYLNSSYPSYGSTYGG 60
TPP SS S PSP LS + S P +S P+ G++
Sbjct: 103 EGSPTPPGPSSPDPPPPTPPPASPPPSPAPD-LSEMLRPVGSPGPPPAASPPAAGASPAA 161
Query: 61 TSSGYGGLTLPSSYSRPSQSSPRSYHAPPSSYRSSYAPTTTPLPPVRPRRSKSSTRSANY 120
+S + P S + AP S + P +TP PR + S+ +
Sbjct: 162 VASDAAS---SRQAALPLSSPEETARAPSSP--PAEPPPSTPPAAASPRPPRRSSPISAS 216
Query: 121 TPEGGGRMSRSGSTTSLNSLRSEAPSEGYESG 152
RS + + S + SE G
Sbjct: 217 ASSPAPAPGRSAADDAGASSSDSSSSESSGCG 248
>gnl|CDD|222417 pfam13851, GAS, Growth-arrest specific micro-tubule binding. This
family is the highly conserved central region of a
number of metazoan proteins referred to as growth-arrest
proteins. In mouse, Gas8 is predominantly a testicular
protein, whose expression is developmentally regulated
during puberty and spermatogenesis. In humans, it is
absent in infertile males who lack the ability to
generate gametes. The localisation of Gas8 in the
motility apparatus of post-meiotic gametocytes and
mature spermatozoa, together with the detection of Gas8
also in cilia at the apical surfaces of epithelial cells
lining the pulmonary bronchi and Fallopian tubes
suggests that the Gas8 protein may have a role in the
functioning of motile cellular appendages. Gas8 is a
microtubule-binding protein localised to regions of
dynein regulation in mammalian cells.
Length = 201
Score = 44.1 bits (105), Expect = 2e-05
Identities = 30/93 (32%), Positives = 44/93 (47%), Gaps = 8/93 (8%)
Query: 180 ELDYKKLYEESQAELSRLRETLSKTEEE---LNQVKATYSKNSLSEQEKRERRTM-ERKL 235
E +KL E E RL E L K EEE L + Y K+ S + + R E++L
Sbjct: 43 EEHNEKLMAEIAQENKRLVEPLKKAEEEVEELRKKLKDYEKDKQSLKNLKARLKELEKEL 102
Query: 236 SEMEVELKELDQ-FKK---ENQRLKDENGALIR 264
++ E + L+Q F+K E L D+ A I+
Sbjct: 103 KNLKWESEVLEQRFEKVERERDELYDKFEAAIQ 135
>gnl|CDD|224495 COG1579, COG1579, Zn-ribbon protein, possibly nucleic acid-binding
[General function prediction only].
Length = 239
Score = 43.5 bits (103), Expect = 4e-05
Identities = 22/87 (25%), Positives = 39/87 (44%)
Query: 183 YKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVEL 242
+K ++++AEL L + L E EL ++ S+ QE RER + +
Sbjct: 29 IRKALKKAKAELEALNKALEALEIELEDLENQVSQLESEIQEIRERIKRAEEKLSAVKDE 88
Query: 243 KELDQFKKENQRLKDENGALIRVISKL 269
+EL E Q K+ +L +++L
Sbjct: 89 RELRALNIEIQIAKERINSLEDELAEL 115
Score = 36.2 bits (84), Expect = 0.010
Identities = 18/81 (22%), Positives = 43/81 (53%), Gaps = 4/81 (4%)
Query: 182 DYKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLS---EQEKRERRTMERKLSEM 238
D + + ++E+ +RE + + EE+L+ VK +L+ + K ++E +L+E+
Sbjct: 56 DLENQVSQLESEIQEIRERIKRAEEKLSAVKDERELRALNIEIQIAKERINSLEDELAEL 115
Query: 239 EVELKEL-DQFKKENQRLKDE 258
E+++L + + +RL+
Sbjct: 116 MEEIEKLEKEIEDLKERLERL 136
Score = 29.3 bits (66), Expect = 2.0
Identities = 17/84 (20%), Positives = 36/84 (42%), Gaps = 8/84 (9%)
Query: 185 KLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVELKE 244
+ E+ + E+ L+E L + E+ L + +A L E+ R + S+ E ++
Sbjct: 117 EEIEKLEKEIEDLKERLERLEKNLAEAEA-----RLEEEVAEIREEGQELSSKREELKEK 171
Query: 245 LDQ---FKKENQRLKDENGALIRV 265
LD + E R + ++ +
Sbjct: 172 LDPELLSEYERIRKNKKGVGVVPL 195
>gnl|CDD|217817 pfam03961, DUF342, Protein of unknown function (DUF342). This
family of bacterial proteins has no known function. The
proteins are in the region of 500-600 amino acid
residues in length.
Length = 450
Score = 40.7 bits (96), Expect = 5e-04
Identities = 23/79 (29%), Positives = 35/79 (44%), Gaps = 10/79 (12%)
Query: 191 QAELSRLRETLSKTEEELNQVKATYSKNS----------LSEQEKRERRTMERKLSEMEV 240
+ EL L E L + EEEL ++K K EQ ++ T E+ E+E
Sbjct: 333 KEELKELEEELKELEEELEKIKKLLKKLPKKARGQLPPEKREQLEKLLETKEKLSEELEE 392
Query: 241 ELKELDQFKKENQRLKDEN 259
+EL + K+E + L E
Sbjct: 393 LEEELKELKEELESLYSEG 411
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
chromosome partitioning].
Length = 1163
Score = 40.1 bits (94), Expect = 0.001
Identities = 22/84 (26%), Positives = 41/84 (48%), Gaps = 3/84 (3%)
Query: 182 DYKKLYEESQAELSRLRETLSKTEEELNQVKATY--SKNSLSEQEKRERRTMERKLSEME 239
+ ++ EE Q EL L+E + + E E++ ++ +N L E E+R E+ + E
Sbjct: 278 ELREELEELQEELLELKEEIEELEGEISLLRERLEELENELEELEERLEELKEKIEALKE 337
Query: 240 VELKELDQFKKENQRLKDENGALI 263
EL+E + +E ++L E
Sbjct: 338 -ELEERETLLEELEQLLAELEEAK 360
Score = 38.9 bits (91), Expect = 0.002
Identities = 32/111 (28%), Positives = 54/111 (48%), Gaps = 11/111 (9%)
Query: 167 SGGSSRKLPNENGELDYKKLYEESQAELSRLRETLSKTEEELNQVKA--TYSKNSLSEQE 224
+GGS K L K+ +E + EL+ L L K EEEL +K ++ L E
Sbjct: 653 TGGSRNK----RSSLAQKRELKELEEELAELEAQLEKLEEELKSLKNELRSLEDLLEE-L 707
Query: 225 KRERRTMERKLSEMEVELKELD----QFKKENQRLKDENGALIRVISKLSK 271
+R+ +ER+L E++ EL L+ Q + + L++E L + +L +
Sbjct: 708 RRQLEELERQLEELKRELAALEEELEQLQSRLEELEEELEELEEELEELQE 758
Score = 38.5 bits (90), Expect = 0.003
Identities = 29/97 (29%), Positives = 53/97 (54%), Gaps = 6/97 (6%)
Query: 179 GELDYKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQ-EKRER-RTMERKLS 236
G YK+ EE++ +L R E L + E+ L +++ K L Q EK ER + ++ +L
Sbjct: 166 GVSKYKERKEEAERKLERTEENLERLEDLLEELEKQLEK--LERQAEKAERYQELKAELR 223
Query: 237 EMEVEL--KELDQFKKENQRLKDENGALIRVISKLSK 271
E+E+ L +L + +KE + L++E L + +L +
Sbjct: 224 ELELALLLAKLKELRKELEELEEELSRLEEELEELQE 260
Score = 36.2 bits (84), Expect = 0.015
Identities = 22/105 (20%), Positives = 42/105 (40%), Gaps = 12/105 (11%)
Query: 177 ENGELDYKKLYEESQAELSRLRETLSKTEEELNQVKATY------------SKNSLSEQE 224
E + ++ EE Q L L E L EE L ++K L E+
Sbjct: 743 EEELEELEEELEELQERLEELEEELESLEEALAKLKEEIEELEEKRQALQEELEELEEEL 802
Query: 225 KRERRTMERKLSEMEVELKELDQFKKENQRLKDENGALIRVISKL 269
+ R ++ E+E + ++ ++E + L++E L + +L
Sbjct: 803 EEAERRLDALERELESLEQRRERLEQEIEELEEEIEELEEKLDEL 847
Score = 35.8 bits (83), Expect = 0.022
Identities = 17/87 (19%), Positives = 35/87 (40%), Gaps = 8/87 (9%)
Query: 184 KKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVELK 243
EE++ EL L + EEL + +L E+ + +E+E +
Sbjct: 353 LAELEEAKEELEEKLSALLEELEELFE--------ALREELAELEAELAEIRNELEELKR 404
Query: 244 ELDQFKKENQRLKDENGALIRVISKLS 270
E++ ++ +RL + L + +L
Sbjct: 405 EIESLEERLERLSERLEDLKEELKELE 431
Score = 33.9 bits (78), Expect = 0.087
Identities = 27/121 (22%), Positives = 50/121 (41%), Gaps = 11/121 (9%)
Query: 154 DNLSTSRSRVNSESGGSSRKLPNENGELDYKKLYEESQAELSRLRETLSKTEEELNQVKA 213
+ L R+L + E + E+ L E + + EE+L++++
Sbjct: 796 EELEEELEEAERRLDALERELESLEQRR------ERLEQEIEELEEEIEELEEKLDELEE 849
Query: 214 TYSKNSLSEQE-KRERRTMERKLSEMEVELK----ELDQFKKENQRLKDENGALIRVISK 268
+ +E K E +E + E+E ELK E ++ ++E + L+ E L I K
Sbjct: 850 ELEELEKELEELKEELEELEAEKEELEDELKELEEEKEELEEELRELESELAELKEEIEK 909
Query: 269 L 269
L
Sbjct: 910 L 910
Score = 32.4 bits (74), Expect = 0.25
Identities = 23/99 (23%), Positives = 46/99 (46%), Gaps = 9/99 (9%)
Query: 182 DYKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRER--------RTMER 233
+ +K EE + ELSRL E L + +EEL + + + +E RE ++
Sbjct: 236 ELRKELEELEEELSRLEEELEELQEELEEAEKEIEELKSELEELREELEELQEELLELKE 295
Query: 234 KLSEMEVELKEL-DQFKKENQRLKDENGALIRVISKLSK 271
++ E+E E+ L ++ ++ L++ L + K+
Sbjct: 296 EIEELEGEISLLRERLEELENELEELEERLEELKEKIEA 334
Score = 31.2 bits (71), Expect = 0.60
Identities = 30/99 (30%), Positives = 48/99 (48%), Gaps = 3/99 (3%)
Query: 173 KLPNENGELDY-KKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTM 231
+L EL+ + EE + EL L E + EEEL ++++ ++ ++ RER
Sbjct: 857 ELEELKEELEELEAEKEELEDELKELEEEKEELEEELRELESELAELKEEIEKLRERLEE 916
Query: 232 -ERKLSEMEVELKELDQFKKENQRLKDENGALIRVISKL 269
E KL +EVEL EL + + E + L R I +L
Sbjct: 917 LEAKLERLEVELPEL-EEELEEEYEDTLETELEREIERL 954
Score = 30.5 bits (69), Expect = 1.0
Identities = 23/104 (22%), Positives = 42/104 (40%), Gaps = 3/104 (2%)
Query: 170 SSRKLPNENGELDYKKLYEESQAELSRLRETLSKTEEELNQVKATYS--KNSLSEQEKRE 227
R D K+ +E +AEL L+ L + EEL +++ ++ L E E+
Sbjct: 410 EERLERLSERLEDLKEELKELEAELEELQTELEELNEELEELEEQLEELRDRLKELEREL 469
Query: 228 RRTMERKLSEMEVELKELDQFKKENQRLKDENGALIRVISKLSK 271
E +L +E EL L+ + + + + V+ L
Sbjct: 470 AELQE-ELQRLEKELSSLEARLDRLEAEQRASQGVRAVLEALES 512
Score = 30.1 bits (68), Expect = 1.3
Identities = 18/82 (21%), Positives = 43/82 (52%), Gaps = 5/82 (6%)
Query: 177 ENGELDYKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLS 236
+ ++ EE +A+L RL L + EEEL +++L + +RE +E ++
Sbjct: 904 KEEIEKLRERLEELEAKLERLEVELPELEEELE----EEYEDTLETELEREIERLEEEIE 959
Query: 237 EME-VELKELDQFKKENQRLKD 257
+ V L+ ++++++ +R ++
Sbjct: 960 ALGPVNLRAIEEYEEVEERYEE 981
Score = 29.3 bits (66), Expect = 2.7
Identities = 18/84 (21%), Positives = 39/84 (46%), Gaps = 2/84 (2%)
Query: 188 EESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVELKELDQ 247
E + EL L+ + EE L ++ +E +E +E+E +EL++
Sbjct: 393 AEIRNELEELKREIESLEERLERLSERLEDLKEELKELEAE--LEELQTELEELNEELEE 450
Query: 248 FKKENQRLKDENGALIRVISKLSK 271
+++ + L+D L R +++L +
Sbjct: 451 LEEQLEELRDRLKELERELAELQE 474
>gnl|CDD|234501 TIGR04211, SH3_and_anchor, SH3 domain protein. Members of this
protein family have a signal peptide, a strongly
conserved SH3 domain, a variable region, and then a
C-terminal hydrophobic transmembrane alpha helix region.
Length = 198
Score = 38.5 bits (90), Expect = 0.001
Identities = 23/85 (27%), Positives = 42/85 (49%), Gaps = 1/85 (1%)
Query: 188 EESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVELK-ELD 246
E Q EL+ L+E L++ ++E ++K S + ++ ++LS +EL E
Sbjct: 76 AELQEELAELQEQLAELQQENQELKQELSTLEAELERLQKELARIKQLSANAIELDEENR 135
Query: 247 QFKKENQRLKDENGALIRVISKLSK 271
+ ++E LK EN AL +L +
Sbjct: 136 ELREELAELKQENEALEAENERLQE 160
Score = 35.0 bits (81), Expect = 0.018
Identities = 24/87 (27%), Positives = 48/87 (55%), Gaps = 8/87 (9%)
Query: 173 KLPNENGELDYKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTME 232
+L + EL ++ +E + ELS L L + ++EL ++K S N++ + E R +
Sbjct: 84 ELQEQLAEL--QQENQELKQELSTLEAELERLQKELARIKQL-SANAI--ELDEENRELR 138
Query: 233 RKLSEMEVELKELDQFKKENQRLKDEN 259
+L+E++ E + L+ EN+RL++
Sbjct: 139 EELAELKQENEALEA---ENERLQENE 162
>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
primarily archaeal type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. It is found
in a single copy and is homodimeric in prokaryotes, but
six paralogs (excluded from this family) are found in
eukarotes, where SMC proteins are heterodimeric. This
family represents the SMC protein of archaea and a few
bacteria (Aquifex, Synechocystis, etc); the SMC of other
bacteria is described by TIGR02168. The N- and
C-terminal domains of this protein are well conserved,
but the central hinge region is skewed in composition
and highly divergent [Cellular processes, Cell division,
DNA metabolism, Chromosome-associated proteins].
Length = 1164
Score = 39.3 bits (92), Expect = 0.002
Identities = 31/103 (30%), Positives = 48/103 (46%), Gaps = 18/103 (17%)
Query: 171 SRKLPNENGELDYKKLYEESQAELSRLRETLSKTEEELNQVKATYS-------KNSLSEQ 223
+++ N EL E +A + L E L K EE LN ++A S + LS+
Sbjct: 750 EQEIENVKSELK------ELEARIEELEEDLHKLEEALNDLEARLSHSRIPEIQAELSKL 803
Query: 224 EKRERRTMERKLSEMEVELKELDQFK----KENQRLKDENGAL 262
E+ E +E +L E+E +L L K KE Q L+++ L
Sbjct: 804 EE-EVSRIEARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDL 845
Score = 35.8 bits (83), Expect = 0.021
Identities = 24/101 (23%), Positives = 45/101 (44%), Gaps = 19/101 (18%)
Query: 188 EESQAELSRLRETLSKTEEELNQVKATY--SKNSLSEQEKR-------------ERRTME 232
E + EL E L+K E E++++ A + + E+ KR E +
Sbjct: 311 AEKERELEDAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAELKEELEDLR 370
Query: 233 RKLSEMEVELK----ELDQFKKENQRLKDENGALIRVISKL 269
+L E++ E EL ++++ ++LK E L R + +L
Sbjct: 371 AELEEVDKEFAETRDELKDYREKLEKLKREINELKRELDRL 411
Score = 34.3 bits (79), Expect = 0.073
Identities = 28/103 (27%), Positives = 49/103 (47%), Gaps = 10/103 (9%)
Query: 169 GSSRKLPNENGELDYKKLYEESQAELSRLRETLSKTEEELNQVKATYSK-NSLSEQEKRE 227
G SR G L + E Q RLRE L + EL+ +++ + + ++ +E
Sbjct: 657 GGSRAP--RGGILFSRSEPAELQ----RLRERLEGLKRELSSLQSELRRIENRLDELSQE 710
Query: 228 RRTMERKLSEMEVELKELDQFKKENQRLKDENGALIRVISKLS 270
RK+ E+E KE++Q ++E ++LK+ L +S L
Sbjct: 711 LSDASRKIGEIE---KEIEQLEQEEEKLKERLEELEEDLSSLE 750
Score = 33.9 bits (78), Expect = 0.10
Identities = 26/102 (25%), Positives = 50/102 (49%), Gaps = 4/102 (3%)
Query: 172 RKLPNENGELDYKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRT- 230
L E E+D K + E++ EL RE L K + E+N++K + Q E
Sbjct: 367 EDLRAELEEVD--KEFAETRDELKDYREKLEKLKREINELKRELDRLQEELQRLSEELAD 424
Query: 231 MERKLSEMEVELKELDQFKKENQ-RLKDENGALIRVISKLSK 271
+ ++ +E ++ EL++ K++ +K + L ++ + LSK
Sbjct: 425 LNAAIAGIEAKINELEEEKEDKALEIKKQEWKLEQLAADLSK 466
Score = 32.0 bits (73), Expect = 0.34
Identities = 20/76 (26%), Positives = 37/76 (48%), Gaps = 7/76 (9%)
Query: 188 EESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVELKELD- 246
E + L E L + E L +++ K+ER +E +L E+E +++EL+
Sbjct: 857 ENLNGKKEELEEELEELEAALRDLESRLGD------LKKERDELEAQLRELERKIEELEA 910
Query: 247 QFKKENQRLKDENGAL 262
Q +K+ +RL + L
Sbjct: 911 QIEKKRKRLSELKAKL 926
Score = 31.2 bits (71), Expect = 0.60
Identities = 17/77 (22%), Positives = 40/77 (51%), Gaps = 6/77 (7%)
Query: 188 EESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVELKELDQ 247
++ + +L +L LSK E+EL +K Y + +E ++R+L+E E + + ++
Sbjct: 451 KKQEWKLEQLAADLSKYEQELYDLKEEYDRVE------KELSKLQRELAEAEAQARASEE 504
Query: 248 FKKENQRLKDENGALIR 264
+ + +++ A I+
Sbjct: 505 RVRGGRAVEEVLKASIQ 521
Score = 28.9 bits (65), Expect = 3.2
Identities = 15/56 (26%), Positives = 26/56 (46%), Gaps = 3/56 (5%)
Query: 182 DYKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSE 237
Y++ + + E R+ + LSK + EL + +A SE+ R R +E L
Sbjct: 466 KYEQELYDLKEEYDRVEKELSKLQRELAEAEAQ---ARASEERVRGGRAVEEVLKA 518
Score = 28.9 bits (65), Expect = 3.4
Identities = 19/95 (20%), Positives = 41/95 (43%), Gaps = 13/95 (13%)
Query: 188 EESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRE-------RRTMERKLSEMEV 240
E + R+ L + E KA + L E+ + E + +ER+ +E
Sbjct: 187 ERLDLIIDEKRQQLERLRRERE--KAERYQALLKEKREYEGYELLKEKEALERQKEAIER 244
Query: 241 EL----KELDQFKKENQRLKDENGALIRVISKLSK 271
+L +EL++ +E L+ + +++ +L+K
Sbjct: 245 QLASLEEELEKLTEEISELEKRLEEIEQLLEELNK 279
Score = 28.5 bits (64), Expect = 4.9
Identities = 29/112 (25%), Positives = 51/112 (45%), Gaps = 14/112 (12%)
Query: 136 SLNSLRSEAPSEGYESGGDNLSTSRSRVNSESGGSSRKLPNENGELDYKKLYEESQAELS 195
L+SL+SE +N S+ S++ ++ E +L+ ++ E+ + L
Sbjct: 689 ELSSLQSELRRI------ENRLDELSQELSDASRKIGEIEKEIEQLEQEE--EKLKERLE 740
Query: 196 RLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVELKELDQ 247
L E LS E+E+ VK+ L E E R +E L ++E L +L+
Sbjct: 741 ELEEDLSSLEQEIENVKS-----ELKELEARIEE-LEEDLHKLEEALNDLEA 786
Score = 28.5 bits (64), Expect = 5.1
Identities = 16/72 (22%), Positives = 37/72 (51%), Gaps = 1/72 (1%)
Query: 184 KKLYEESQAELSRLRETLSKTEEELNQVKATYSKNS-LSEQEKRERRTMERKLSEMEVEL 242
+ +E ELS + + E+E+ Q++ K E+ + + ++E+++ ++ EL
Sbjct: 701 ENRLDELSQELSDASRKIGEIEKEIEQLEQEEEKLKERLEELEEDLSSLEQEIENVKSEL 760
Query: 243 KELDQFKKENQR 254
KEL+ +E +
Sbjct: 761 KELEARIEELEE 772
Score = 27.7 bits (62), Expect = 9.4
Identities = 17/75 (22%), Positives = 28/75 (37%), Gaps = 4/75 (5%)
Query: 188 EESQAELSRLRETLSKTEEELNQVKATYSKNSLSE----QEKRERRTMERKLSEMEVELK 243
E+ E+S L + L + E+ L ++ E +EK E E + K
Sbjct: 254 EKLTEEISELEKRLEEIEQLLEELNKKIKDLGEEEQLRVKEKIGELEAEIASLERSIAEK 313
Query: 244 ELDQFKKENQRLKDE 258
E + E + K E
Sbjct: 314 ERELEDAEERLAKLE 328
>gnl|CDD|217372 pfam03105, SPX, SPX domain. We have named this region the SPX
domain after (SYG1, Pho81 and XPR1). This 180 residue
length domain is found at the amino terminus of a
variety of proteins. In the yeast protein SYG1, the
N-terminus directly binds to the G- protein beta subunit
and inhibits transduction of the mating pheromone
signal. This finding suggests that all the members of
this family are involved in G-protein associated signal
transduction. The N-termini of several proteins involved
in the regulation of phosphate transport, including the
putative phosphate level sensors PHO81 from
Saccharomyces cerevisiae and NUC-2 from Neurospora
crassa, are also members of this family. The SPX domain
of S. cerevisiae low-affinity phosphate transporters
Pho87 and Pho90 auto-regulates uptake and prevents
efflux. This SPX dependent inhibition is mediated by the
physical interaction with Spl2 NUC-2 contains several
ankyrin repeats pfam00023. Several members of this
family are annotated as XPR1 proteins: the xenotropic
and polytropic retrovirus receptor confers
susceptibility to infection with murine leukaemia
viruses (MLV). The similarity between SYG1, phosphate
regulators and XPR1 sequences has been previously noted,
as has the additional similarity to several predicted
proteins, of unknown function, from Drosophila
melanogaster, Arabidopsis thaliana, Caenorhabditis
elegans, Schizosaccharomyces pombe, and Saccharomyces
cerevisiae. In addition, given the similarities between
XPR1 and SYG1 and phosphate regulatory proteins, it has
been proposed that XPR1 might be involved in G-protein
associated signal transduction and may itself function
as a phosphate sensor.
Length = 176
Score = 37.4 bits (87), Expect = 0.003
Identities = 18/76 (23%), Positives = 32/76 (42%), Gaps = 6/76 (7%)
Query: 178 NGELD-----YKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTME 232
+ EL+ YK+ E L L + L + EE ++ + S S K ++
Sbjct: 70 DKELEKVNSFYKEKEAELLERLEELEKQLEELEERRDETSSAKSSPSDKNLLK-AFEELK 128
Query: 233 RKLSEMEVELKELDQF 248
+ L E+ EL+ L +
Sbjct: 129 KALLELYRELQLLKSY 144
>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
Length = 880
Score = 38.1 bits (89), Expect = 0.004
Identities = 21/73 (28%), Positives = 39/73 (53%), Gaps = 3/73 (4%)
Query: 184 KKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVELK 243
+ YEE EL + + LS+ EEE+N ++ + E+ E +++KL E+E L+
Sbjct: 299 SEFYEEYLDELREIEKRLSRLEEEINGIEERIKELEEKEERLEE---LKKKLKELEKRLE 355
Query: 244 ELDQFKKENQRLK 256
EL++ + + K
Sbjct: 356 ELEERHELYEEAK 368
Score = 35.4 bits (82), Expect = 0.031
Identities = 23/83 (27%), Positives = 38/83 (45%), Gaps = 6/83 (7%)
Query: 182 DYKKLYEESQAELSRLRETLSKTEEELNQVKATYS--KNSLSEQEKR----ERRTMERKL 235
D +K E + EL +L E L K EEL + + + L E EK+ E + +
Sbjct: 609 DAEKELEREEKELKKLEEELDKAFEELAETEKRLEELRKELEELEKKYSEEEYEELREEY 668
Query: 236 SEMEVELKELDQFKKENQRLKDE 258
E+ EL L +E ++ ++E
Sbjct: 669 LELSRELAGLRAELEELEKRREE 691
Score = 33.9 bits (78), Expect = 0.082
Identities = 34/103 (33%), Positives = 56/103 (54%), Gaps = 14/103 (13%)
Query: 172 RKLPNENGELDYKKLYEESQAELSRLRET---LSKTEEELNQVKATYSKNSLSEQEKRER 228
RKL E EL+ K L +ES EL +L+E L + EE+L + Y+ L E++ E
Sbjct: 476 RKLRKELRELE-KVLKKES--ELIKLKELAEQLKELEEKLKK----YNLEEL-EKKAEEY 527
Query: 229 RTMERKLSEMEVELKELDQFKKENQRLKDENGALIRVISKLSK 271
++ KL +++ E+K L KKE ++L++ L + KL +
Sbjct: 528 EKLKEKLIKLKGEIKSL---KKELEKLEELKKKLAELEKKLDE 567
Score = 33.5 bits (77), Expect = 0.13
Identities = 20/91 (21%), Positives = 42/91 (46%), Gaps = 6/91 (6%)
Query: 177 ENGELDYKKLYEESQAELSRLRETLSKTEEELNQVKA-----TYSKNSLSEQEKRERRTM 231
EN + ++ +E + + RL + + +TE +K ++E
Sbjct: 161 ENAYKNLGEVIKEIKRRIERLEKFIKRTENIEELIKEKEKELEEVLREINEISSELPEL- 219
Query: 232 ERKLSEMEVELKELDQFKKENQRLKDENGAL 262
+L ++E E+KEL++ K+E + L+ E +L
Sbjct: 220 REELEKLEKEVKELEELKEEIEELEKELESL 250
Score = 31.6 bits (72), Expect = 0.55
Identities = 24/71 (33%), Positives = 38/71 (53%), Gaps = 11/71 (15%)
Query: 193 ELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRER------RTMERKLSEMEVELKELD 246
EL LRE L K E+E+ +++ K + E EK R +E K+ E+E ++EL
Sbjct: 215 ELPELREELEKLEKEVKELEEL--KEEIEELEKELESLEGSKRKLEEKIRELEERIEEL- 271
Query: 247 QFKKENQRLKD 257
KKE + L++
Sbjct: 272 --KKEIEELEE 280
Score = 31.6 bits (72), Expect = 0.56
Identities = 18/75 (24%), Positives = 41/75 (54%), Gaps = 3/75 (4%)
Query: 184 KKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVELK 243
++ Y E EL+ LR L + E+ ++K T K ++E ER +++L ++E L+
Sbjct: 665 REEYLELSRELAGLRAELEELEKRREEIKKTLEK---LKEELEEREKAKKELEKLEKALE 721
Query: 244 ELDQFKKENQRLKDE 258
+++ +++ ++ K
Sbjct: 722 RVEELREKVKKYKAL 736
Score = 31.6 bits (72), Expect = 0.56
Identities = 27/100 (27%), Positives = 52/100 (52%), Gaps = 8/100 (8%)
Query: 180 ELDYKKLYEESQAELSRLRETLSKTEEELNQVKATYSK--NSLSEQEK-RERRTMERKLS 236
E K+L EE AEL R+ + L + EE+ +++ + L ++ + + + + +L
Sbjct: 447 EEHRKELLEEYTAELKRIEKELKEIEEKERKLRKELRELEKVLKKESELIKLKELAEQLK 506
Query: 237 EMEVELK-----ELDQFKKENQRLKDENGALIRVISKLSK 271
E+E +LK EL++ +E ++LK++ L I L K
Sbjct: 507 ELEEKLKKYNLEELEKKAEEYEKLKEKLIKLKGEIKSLKK 546
Score = 30.8 bits (70), Expect = 0.79
Identities = 21/86 (24%), Positives = 41/86 (47%), Gaps = 12/86 (13%)
Query: 184 KKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKR----ERRTMERKLS--- 236
+ YE+ + +L +L+ + ++EL +++ K + E++ E + ++L
Sbjct: 524 AEEYEKLKEKLIKLKGEIKSLKKELEKLEELKKKLAELEKKLDELEEELAELLKELEELG 583
Query: 237 -----EMEVELKELDQFKKENQRLKD 257
E+E LKEL+ F E LKD
Sbjct: 584 FESVEELEERLKELEPFYNEYLELKD 609
Score = 30.4 bits (69), Expect = 1.0
Identities = 24/97 (24%), Positives = 43/97 (44%), Gaps = 14/97 (14%)
Query: 184 KKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTME--------RKL 235
+K E + +L E + + EE + ++K + E E++ + E KL
Sbjct: 244 EKELESLEGSKRKLEEKIRELEERIEELKK-----EIEELEEKVKELKELKEKAEEYIKL 298
Query: 236 SEMEVE-LKELDQFKKENQRLKDENGALIRVISKLSK 271
SE E L EL + +K RL++E + I +L +
Sbjct: 299 SEFYEEYLDELREIEKRLSRLEEEINGIEERIKELEE 335
Score = 30.4 bits (69), Expect = 1.0
Identities = 26/107 (24%), Positives = 46/107 (42%), Gaps = 11/107 (10%)
Query: 171 SRKLPNENGELD----YKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKR 226
S +LP EL+ K EE + E+ L + L E +++ + E E+R
Sbjct: 213 SSELPELREELEKLEKEVKELEELKEEIEELEKELESLEGSKRKLEE-----KIRELEER 267
Query: 227 --ERRTMERKLSEMEVELKELDQFKKENQRLKDENGALIRVISKLSK 271
E + +L E ELKEL + +E +L + + + ++ K
Sbjct: 268 IEELKKEIEELEEKVKELKELKEKAEEYIKLSEFYEEYLDELREIEK 314
Score = 30.0 bits (68), Expect = 1.5
Identities = 20/87 (22%), Positives = 42/87 (48%), Gaps = 3/87 (3%)
Query: 188 EESQAELSRLRETLSKTEEELNQVKATYSKNS---LSEQEKRERRTMERKLSEMEVELKE 244
+++ EL+ + L + +EL +++ YS+ L E+ R + +E+E K
Sbjct: 629 DKAFEELAETEKRLEELRKELEELEKKYSEEEYEELREEYLELSRELAGLRAELEELEKR 688
Query: 245 LDQFKKENQRLKDENGALIRVISKLSK 271
++ KK ++LK+E + +L K
Sbjct: 689 REEIKKTLEKLKEELEEREKAKKELEK 715
Score = 29.3 bits (66), Expect = 2.5
Identities = 22/84 (26%), Positives = 42/84 (50%), Gaps = 6/84 (7%)
Query: 182 DYKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVE 241
+Y E + LSRL E ++ EE + +++ K E+ K++ + +E++L E+E
Sbjct: 304 EYLDELREIEKRLSRLEEEINGIEERIKELE---EKEERLEELKKKLKELEKRLEELEER 360
Query: 242 LKELD---QFKKENQRLKDENGAL 262
+ + K+E +RLK L
Sbjct: 361 HELYEEAKAKKEELERLKKRLTGL 384
Score = 28.5 bits (64), Expect = 4.8
Identities = 20/88 (22%), Positives = 39/88 (44%), Gaps = 8/88 (9%)
Query: 184 KKLYEESQAELSRLRETLSKTEE------ELNQVKATYSKNSLSEQEKRERRTMERKLSE 237
KK +E + L L E EE EL ++K + + E+ ++E +E+ E
Sbjct: 344 KKKLKELEKRLEELEERHELYEEAKAKKEELERLKKRLTGLTP-EKLEKELEELEKAKEE 402
Query: 238 MEVELKELDQFKKE-NQRLKDENGALIR 264
+E E+ ++ E + +K+ A+
Sbjct: 403 IEEEISKITARIGELKKEIKELKKAIEE 430
>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
protein; Reviewed.
Length = 782
Score = 38.3 bits (90), Expect = 0.004
Identities = 18/62 (29%), Positives = 35/62 (56%), Gaps = 9/62 (14%)
Query: 198 RETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVELKELDQFKKENQRLKD 257
++ + + +E+LN++ A SL E E+ +E+K E E LKE ++ K+E + K+
Sbjct: 508 KKLIGEDKEKLNELIA-----SLEELERE----LEQKAEEAEALLKEAEKLKEELEEKKE 558
Query: 258 EN 259
+
Sbjct: 559 KL 560
Score = 29.0 bits (66), Expect = 3.0
Identities = 20/80 (25%), Positives = 36/80 (45%), Gaps = 3/80 (3%)
Query: 188 EESQAELSRLRETLSKTEEELNQVKATYS--KNSLSEQEKRERRTMERKLSEMEVELK-E 244
EE + EL + E +E ++K K L E+E + E++ + E K E
Sbjct: 526 EELERELEQKAEEAEALLKEAEKLKEELEEKKEKLQEEEDKLLEEAEKEAQQAIKEAKKE 585
Query: 245 LDQFKKENQRLKDENGALIR 264
D+ KE ++L+ A ++
Sbjct: 586 ADEIIKELRQLQKGGYASVK 605
>gnl|CDD|222426 pfam13870, DUF4201, Domain of unknown function (DUF4201). This is
a family of coiled-coil proteins from eukaryotes. The
function is not known.
Length = 177
Score = 36.8 bits (86), Expect = 0.005
Identities = 24/95 (25%), Positives = 47/95 (49%), Gaps = 13/95 (13%)
Query: 176 NENGELDYKKLYEESQAELSRLRETLSKTEEELNQVK------ATYSKNSLSEQEKRERR 229
EN L+ +K+ EE EL RL+ + T L +K + E +++E R
Sbjct: 49 IENQALN-EKI-EERNKELKRLKLKYTNTVHILTHLKEKLHMLEEELERLKKELKEKEER 106
Query: 230 TME--RKLSEMEVELKELDQFKKENQRLKDENGAL 262
+ ++L +++ E D+ +K+N++L+ + G L
Sbjct: 107 LAKLRKELYKVK---LERDKLRKQNKKLRQQGGLL 138
Score = 34.5 bits (80), Expect = 0.026
Identities = 24/90 (26%), Positives = 44/90 (48%), Gaps = 5/90 (5%)
Query: 187 YEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTM-ERKLSEMEVELKEL 245
+E+ Q E L E + + +EL ++K Y+ +E+ M E +L ++ ELKE
Sbjct: 44 FEQLQIENQALNEKIEERNKELKRLKLKYTNTVHILTHLKEKLHMLEEELERLKKELKEK 103
Query: 246 DQ----FKKENQRLKDENGALIRVISKLSK 271
++ +KE ++K E L + KL +
Sbjct: 104 EERLAKLRKELYKVKLERDKLRKQNKKLRQ 133
Score = 28.7 bits (65), Expect = 2.2
Identities = 13/77 (16%), Positives = 33/77 (42%), Gaps = 15/77 (19%)
Query: 184 KKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVE-- 241
KK +E + L++LR+ L K + E ++++ K + ++ + V
Sbjct: 97 KKELKEKEERLAKLRKELYKVKLERDKLRKQNKK-------------LRQQGGLLHVPAL 143
Query: 242 LKELDQFKKENQRLKDE 258
L + D+ ++ + +
Sbjct: 144 LYDYDKTVEKVEEKRKS 160
>gnl|CDD|206039 pfam13868, Trichoplein, Tumour suppressor, Mitostatin. Trichoplein
or mitostatin, was first defined as a meiosis-specific
nuclear structural protein. It has since been linked
with mitochondrial movement. It is associated with the
mitochondrial outer membrane, and over-expression leads
to reduction in mitochondrial motility whereas lack of
it enhances mitochondrial movement. The activity appears
to be mediated through binding the mitochondria to the
actin intermediate filaments (IFs).
Length = 349
Score = 37.2 bits (87), Expect = 0.006
Identities = 21/86 (24%), Positives = 37/86 (43%), Gaps = 6/86 (6%)
Query: 183 YKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERK----LSEM 238
+++ EE +AE RE K EE+++ + E+EK R E K E
Sbjct: 105 IERIQEEDEAEAQEKREKQKKLREEIDEFNE--ERIERKEEEKEREREEELKILEYQREK 162
Query: 239 EVELKELDQFKKENQRLKDENGALIR 264
+E + ++E + K+ A +R
Sbjct: 163 AEREEEREAERRERKEEKEREVARLR 188
Score = 29.1 bits (66), Expect = 2.8
Identities = 13/71 (18%), Positives = 27/71 (38%), Gaps = 4/71 (5%)
Query: 188 EESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVELKELDQ 247
+E E +E + E E Y E+ +RE + E + +E+ +
Sbjct: 131 DEFNEERIERKEEEKEREREEELKILEY----QREKAEREEEREAERRERKEEKEREVAR 186
Query: 248 FKKENQRLKDE 258
+ + + +DE
Sbjct: 187 LRAQQEEAEDE 197
Score = 28.3 bits (64), Expect = 3.8
Identities = 17/78 (21%), Positives = 32/78 (41%), Gaps = 10/78 (12%)
Query: 188 EESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRER-------RTMERKLSEMEV 240
E A L +E EEL++++A E E++ER R+ E++
Sbjct: 181 EREVARLRAQQEEAEDEREELDELRA---DLYQEEYERKERQKEKEEAEKRRRQKQELQR 237
Query: 241 ELKELDQFKKENQRLKDE 258
+E + K+E + +
Sbjct: 238 AREEQIEEKEERLQEERA 255
>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
Length = 520
Score = 37.5 bits (88), Expect = 0.006
Identities = 24/93 (25%), Positives = 48/93 (51%), Gaps = 12/93 (12%)
Query: 183 YKKLYE-ESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERR---TMERKLSEM 238
KK E++ E+ +LR ++ E+EL + +N L + EKR + ++RKL +
Sbjct: 54 IKKEALLEAKEEIHKLR---NEFEKELRE-----RRNELQKLEKRLLQKEENLDRKLELL 105
Query: 239 EVELKELDQFKKENQRLKDENGALIRVISKLSK 271
E +EL++ +KE ++ + E + +L +
Sbjct: 106 EKREEELEKKEKELEQKQQELEKKEEELEELIE 138
Score = 31.3 bits (72), Expect = 0.61
Identities = 23/95 (24%), Positives = 46/95 (48%), Gaps = 13/95 (13%)
Query: 181 LDYKK-LYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEME 239
LD K L E+ + EL + + L + ++EL + + + + ++ ER +S +
Sbjct: 98 LDRKLELLEKREEELEKKEKELEQKQQELEKKEEELEELIEEQLQELER------ISGLT 151
Query: 240 V-ELKE--LDQFKKENQRLKDENGALIRVISKLSK 271
E KE L++ ++E + E LI+ I + +K
Sbjct: 152 AEEAKEILLEKVEEE---ARHEAAVLIKEIEEEAK 183
>gnl|CDD|224705 COG1792, MreC, Cell shape-determining protein [Cell envelope
biogenesis, outer membrane].
Length = 284
Score = 37.0 bits (86), Expect = 0.007
Identities = 12/50 (24%), Positives = 26/50 (52%)
Query: 209 NQVKATYSKNSLSEQEKRERRTMERKLSEMEVELKELDQFKKENQRLKDE 258
V + E ++++L+E+E L+E++ ++EN+RLK+
Sbjct: 56 EFVDGVLEFLKSLKDLALENEELKKELAELEQLLEEVESLEEENKRLKEL 105
>gnl|CDD|202427 pfam02841, GBP_C, Guanylate-binding protein, C-terminal domain.
Transcription of the anti-viral guanylate-binding
protein (GBP) is induced by interferon-gamma during
macrophage induction. This family contains GBP1 and
GPB2, both GTPases capable of binding GTP, GDP and GMP.
Length = 297
Score = 36.9 bits (86), Expect = 0.007
Identities = 26/81 (32%), Positives = 40/81 (49%), Gaps = 12/81 (14%)
Query: 188 EESQAELSRLRETLSKTEEELNQVKATYSKN--SLSEQEKRER--------RTMERKLSE 237
E ++AE LRE + E+ + + +Y ++ L E+ + ER R +E KL E
Sbjct: 214 EAAEAEQELLREKQKEEEQMMEAQERSYQEHVKQLIEKMEAEREKLLAEQERMLEHKLQE 273
Query: 238 MEVELKELDQFKKENQRLKDE 258
E LKE FK E + L+ E
Sbjct: 274 QEELLKE--GFKTEAESLQKE 292
>gnl|CDD|202833 pfam03962, Mnd1, Mnd1 family. This family of proteins includes
MND1 from S. cerevisiae. The mnd1 protein forms a
complex with hop2 to promote homologous chromosome
pairing and meiotic double-strand break repair.
Length = 188
Score = 36.1 bits (84), Expect = 0.007
Identities = 18/80 (22%), Positives = 40/80 (50%), Gaps = 10/80 (12%)
Query: 184 KKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVELK 243
K E+ + EL L++ +++ + ++ ++K +E ER + +L ++E ELK
Sbjct: 68 KTRLEKLKKELEELKQRIAELQAQIEKLKK-------GREETEERTELLEELKQLEKELK 120
Query: 244 ELDQFKKENQRLKDENGALI 263
+L K E ++ + + I
Sbjct: 121 KL---KAELEKYEKNDPERI 137
>gnl|CDD|206034 pfam13863, DUF4200, Domain of unknown function (DUF4200). This
family is found in eukaryotes. It is a coiled-coil
domain of unknwon function.
Length = 126
Score = 35.3 bits (82), Expect = 0.008
Identities = 23/91 (25%), Positives = 42/91 (46%), Gaps = 4/91 (4%)
Query: 184 KKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMER--KLSEMEVE 241
++ +E + L + E L K EEEL + + K L E E + RR ++ + ++ E
Sbjct: 20 REEFERREELLKQREEELEKKEEELQESLIKFDK-FLKENEAKRRRAEKKAEEEKKLRKE 78
Query: 242 L-KELDQFKKENQRLKDENGALIRVISKLSK 271
+E+ + K E + LK E L + +
Sbjct: 79 KEEEIKELKAELEELKAEIEKLEEKLEEYQP 109
Score = 34.9 bits (81), Expect = 0.011
Identities = 18/85 (21%), Positives = 41/85 (48%), Gaps = 11/85 (12%)
Query: 182 DYKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLS--------EQEKRERRTMER 233
++++ E + L + + +E L + +N E+EK+ R+ E
Sbjct: 22 EFERREELLKQREEELEKKEEELQESLIKFDKFLKENEAKRRRAEKKAEEEKKLRKEKEE 81
Query: 234 KLSEMEVELKELDQFKKENQRLKDE 258
++ E++ EL+EL K E ++L+++
Sbjct: 82 EIKELKAELEEL---KAEIEKLEEK 103
>gnl|CDD|204414 pfam10211, Ax_dynein_light, Axonemal dynein light chain. Axonemal
dynein light chain proteins play a dynamic role in
flagellar and cilia motility. Eukaryotic cilia and
flagella are complex organelles consisting of a core
structure, the axoneme, which is composed of nine
microtubule doublets forming a cylinder that surrounds a
pair of central singlet microtubules. This
ultra-structural arrangement seems to be one of the most
stable micro-tubular assemblies known and is responsible
for the flagellar and ciliary movement of a large number
of organisms ranging from protozoan to mammals. This
light chain interacts directly with the N-terminal half
of the heavy chains.
Length = 189
Score = 36.0 bits (84), Expect = 0.008
Identities = 30/93 (32%), Positives = 40/93 (43%), Gaps = 24/93 (25%)
Query: 183 YKKLYEES---------QAE--LSRLRETLSKTEEE----LNQVKATYSKNSLSEQEKR- 226
Y+ LYE S QAE S L + + K EEE +V + L EKR
Sbjct: 100 YQTLYESSVAFGMRKALQAEQGKSELEQEIKKLEEEKEELEKRVAEL--EAKLEAIEKRE 157
Query: 227 -ERRTMERKLSEMEVELKELDQFKKENQRLKDE 258
E R +E K E+ KK+NQ+LK +
Sbjct: 158 EEERQIEEKRHADEI-----AFLKKQNQQLKSQ 185
>gnl|CDD|235505 PRK05563, PRK05563, DNA polymerase III subunits gamma and tau;
Validated.
Length = 559
Score = 36.8 bits (86), Expect = 0.010
Identities = 18/84 (21%), Positives = 39/84 (46%), Gaps = 4/84 (4%)
Query: 189 ESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSE---MEVELKEL 245
E EL L + + + E+EL Q+KA + ++EK++ + +K K L
Sbjct: 366 EYDTELEVLLQRVEQLEQELKQLKAQPVGVAPEQKEKKKEKKKNKKKKYKVPRGKIYKVL 425
Query: 246 DQFKKEN-QRLKDENGALIRVISK 268
+ +++ + LK+ G ++ +
Sbjct: 426 KEATRQDLELLKNVWGEILESLKA 449
>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
recombination, and repair].
Length = 908
Score = 37.0 bits (86), Expect = 0.010
Identities = 21/101 (20%), Positives = 46/101 (45%), Gaps = 5/101 (4%)
Query: 171 SRKLPNENGELDYKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRT 230
K+ E+ + E++ +L E L + EEEL Q++ + L ++ +
Sbjct: 660 EEKVEELEAEIRRELQRIENEEQLEEKLEELEQLEEELEQLREELEE--LLKKLGEIEQL 717
Query: 231 MER---KLSEMEVELKELDQFKKENQRLKDENGALIRVISK 268
+E + +E+E KEL++ +K + L++ L + +
Sbjct: 718 IEELESRKAELEELKKELEKLEKALELLEELREKLGKAGLR 758
Score = 34.0 bits (78), Expect = 0.075
Identities = 25/106 (23%), Positives = 49/106 (46%), Gaps = 15/106 (14%)
Query: 180 ELDYKKLYEESQAELSRLRE---TLSKTEEELNQVKATYSKNSLSEQEKRER-----RTM 231
E+ ++ EE + E+ L E L + +E L ++KA + E E + R +
Sbjct: 220 EIQEEQEEEELEQEIEALEERLAELEEEKERLEELKARLLEIESLELEALKIREEELREL 279
Query: 232 ERKLSEMEVELKELD-------QFKKENQRLKDENGALIRVISKLS 270
ER L E+E +++ L+ + ++E + L+ L ++ KL
Sbjct: 280 ERLLEELEEKIERLEELEREIEELEEELEGLRALLEELEELLEKLK 325
Score = 33.6 bits (77), Expect = 0.12
Identities = 17/78 (21%), Positives = 35/78 (44%), Gaps = 4/78 (5%)
Query: 182 DYKKLYEESQAE-LSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEV 240
+ ++L E+ Q + L L +EL ++ ++E E R +L E++
Sbjct: 544 ELEELKEKLQLQQLKEELRQLEDRLQELKELLEELRLLRTRKEELEELR---ERLKELKK 600
Query: 241 ELKELDQFKKENQRLKDE 258
+LKEL++ + + L
Sbjct: 601 KLKELEERLSQLEELLQS 618
Score = 33.6 bits (77), Expect = 0.12
Identities = 23/90 (25%), Positives = 40/90 (44%), Gaps = 7/90 (7%)
Query: 182 DYKKLYEESQAELSRLRETLSKTEEELNQ----VKATYSKNSLSEQEKRER---RTMERK 234
+ ++L EE + ++ RL E + EE + L E+ K +E K
Sbjct: 278 ELERLLEELEEKIERLEELEREIEELEEELEGLRALLEELEELLEKLKSLEERLEKLEEK 337
Query: 235 LSEMEVELKELDQFKKENQRLKDENGALIR 264
L ++E EL+EL + K E +L +E +
Sbjct: 338 LEKLESELEELAEEKNELAKLLEERLKELE 367
Score = 31.3 bits (71), Expect = 0.55
Identities = 20/106 (18%), Positives = 40/106 (37%), Gaps = 14/106 (13%)
Query: 180 ELDYKKLYEESQAELSRLRETLSKTEEEL--------------NQVKATYSKNSLSEQEK 225
L+ + E + L L K + +V+ ++ Q
Sbjct: 618 SLELSEAENELEEAEEELESELEKLNLQAELEELLQAALEELEEKVEELEAEIRRELQRI 677
Query: 226 RERRTMERKLSEMEVELKELDQFKKENQRLKDENGALIRVISKLSK 271
+E KL E+E +EL+Q ++E + L + G + ++I +L
Sbjct: 678 ENEEQLEEKLEELEQLEEELEQLREELEELLKKLGEIEQLIEELES 723
Score = 30.9 bits (70), Expect = 0.92
Identities = 18/80 (22%), Positives = 41/80 (51%), Gaps = 3/80 (3%)
Query: 182 DYKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKR---ERRTMERKLSEM 238
+ ++ +E + +L L E LS+ EE L ++ + ++N L E E+ E + +
Sbjct: 590 ELRERLKELKKKLKELEERLSQLEELLQSLELSEAENELEEAEEELESELEKLNLQAELE 649
Query: 239 EVELKELDQFKKENQRLKDE 258
E+ L++ +++ + L+ E
Sbjct: 650 ELLQAALEELEEKVEELEAE 669
Score = 29.3 bits (66), Expect = 2.6
Identities = 23/106 (21%), Positives = 43/106 (40%), Gaps = 17/106 (16%)
Query: 183 YKKLYEESQAELSRL--------------RETLSKTEEELNQVKATY---SKNSLSEQEK 225
++ EE + EL L +E L + E+L + + +Q K
Sbjct: 499 LREEIEELEKELRELEEELIELLELEEALKEELEEKLEKLENLLEELEELKEKLQLQQLK 558
Query: 226 RERRTMERKLSEMEVELKELDQFKKENQRLKDENGALIRVISKLSK 271
E R +E +L E++ L+EL + + L++ L + KL +
Sbjct: 559 EELRQLEDRLQELKELLEELRLLRTRKEELEELRERLKELKKKLKE 604
Score = 29.0 bits (65), Expect = 3.7
Identities = 21/77 (27%), Positives = 45/77 (58%), Gaps = 1/77 (1%)
Query: 183 YKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRER-RTMERKLSEMEVE 241
++ EE + EL + E L + EE + ++K ++ S + +E +E +E++L E+E E
Sbjct: 366 LEERLEELEKELEKALERLKQLEEAIQELKEELAELSAALEEIQEELEELEKELEELERE 425
Query: 242 LKELDQFKKENQRLKDE 258
L+EL++ K+ + ++
Sbjct: 426 LEELEEEIKKLEEQINQ 442
Score = 28.2 bits (63), Expect = 6.4
Identities = 30/148 (20%), Positives = 56/148 (37%), Gaps = 30/148 (20%)
Query: 154 DNLSTSRSRVNSESGGSSRKLPNENGELDY-KKLYEESQAELSRLRETLSKTEEELNQVK 212
L + + E S L EL+ +K EE + EL L E + K EE++NQ++
Sbjct: 385 KQLEEAIQELKEELAELSAALEEIQEELEELEKELEELERELEELEEEIKKLEEQINQLE 444
Query: 213 AT------------------------YSKNSLSEQEK-----RERRTMERKLSEMEVELK 243
+ + K L E E + E++ +E+ E++
Sbjct: 445 SKELMIAELAGAGEKCPVCGQELPEEHEKELLELYELELEELEEELSREKEEAELREEIE 504
Query: 244 ELDQFKKENQRLKDENGALIRVISKLSK 271
EL++ +E + E L + + +
Sbjct: 505 ELEKELRELEEELIELLELEEALKEELE 532
>gnl|CDD|149160 pfam07926, TPR_MLP1_2, TPR/MLP1/MLP2-like protein. The sequences
featured in this family are similar to a region of human
TPR protein and to yeast myosin-like proteins 1 (MLP1)
and 2 (MLP2). These proteins share a number of features;
for example, they all have coiled-coil regions and all
three are associated with nuclear pores. TPR is thought
to be a component of nuclear pore complex- attached
intra-nuclear filaments, and is implicated in nuclear
protein import. Moreover, its N-terminal region is
involved in the activation of oncogenic kinases,
possibly by mediating the dimerisation of kinase domains
or by targeting these kinases to the nuclear pore
complex. MLP1 and MLP2 are involved in the process of
telomere length regulation, where they are thought to
interact with proteins such as Tel1p and modulate their
activity.
Length = 132
Score = 34.6 bits (80), Expect = 0.014
Identities = 21/86 (24%), Positives = 39/86 (45%), Gaps = 2/86 (2%)
Query: 187 YEESQAELSRLRETLSKTEEELNQVKATYSK-NSLSEQEKRERRTMERKLSEMEVELK-E 244
YE + + E L ++LN++K ++ + +E + E E E + L+ E
Sbjct: 47 YERELVKHAEDIEELQALRKQLNELKKEIAQLKAEAESAQAELSEAEESWEEQKKMLEDE 106
Query: 245 LDQFKKENQRLKDENGALIRVISKLS 270
L + +K + L ++N L I LS
Sbjct: 107 LSELEKRIEELNEQNKLLHDQIELLS 132
Score = 31.1 bits (71), Expect = 0.24
Identities = 20/75 (26%), Positives = 40/75 (53%), Gaps = 12/75 (16%)
Query: 183 YKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVEL 242
+K E + E+++L+ + EL++ ++ S EQ+K +E +LSE+E +
Sbjct: 64 LRKQLNELKKEIAQLKAEAESAQAELSE-----AEESWEEQKKM----LEDELSELEKRI 114
Query: 243 KELDQFKKENQRLKD 257
+EL ++N+ L D
Sbjct: 115 EEL---NEQNKLLHD 126
>gnl|CDD|130673 TIGR01612, 235kDa-fam, reticulocyte binding/rhoptry protein. This
model represents a group of paralogous families in
plasmodium species alternately annotated as reticulocyte
binding protein, 235-kDa family protein and rhoptry
protein. Rhoptry protein is localized on the cell surface
and is extremely large (although apparently lacking in
repeat structure) and is important for the process of
invasion of the RBCs by the parasite. These proteins are
found in P. falciparum, P. vivax and P. yoelii.
Length = 2757
Score = 36.2 bits (83), Expect = 0.017
Identities = 27/104 (25%), Positives = 55/104 (52%), Gaps = 8/104 (7%)
Query: 164 NSESGGSSRKLPNENGELDYKK-LYEESQAELSRLRETLSKTEEELNQVKA---TYSKNS 219
N + +K+ N ++D KK +Y+E + L+ + E + K + L +VK +Y KN
Sbjct: 1165 NDDPEEIEKKIENIVTKIDKKKNIYDEIKKLLNEIAE-IEKDKTSLEEVKGINLSYGKNL 1223
Query: 220 ---LSEQEKRERRTMERKLSEMEVELKELDQFKKENQRLKDENG 260
E+ E++ E + ME +++LD+ K+++ +++E G
Sbjct: 1224 GKLFLEKIDEEKKKSEHMIKAMEAYIEDLDEIKEKSPEIENEMG 1267
>gnl|CDD|218191 pfam04652, DUF605, Vta1 like. Vta1 (VPS20-associated protein 1) is
a positive regulator of Vps4. Vps4 is an ATPase that is
required in the multivesicular body (MVB) sorting
pathway to dissociate the endosomal sorting complex
required for transport (ESCRT). Vta1 promotes correct
assembly of Vps4 and stimulates its ATPase activity
through its conserved Vta1/SBP1/LIP5 region.
Length = 315
Score = 35.8 bits (83), Expect = 0.017
Identities = 24/107 (22%), Positives = 30/107 (28%), Gaps = 14/107 (13%)
Query: 2 SYSTPPRRSSGRTTSRPRSLYSSSPSPQSRNLSSYSSGLSSSTPYLNSSYPSYGSTYGGT 61
S+ + + S P S PS S S SS P PS +
Sbjct: 171 SFPGEDADPASASPSDPPSSSPGVPSFPSPPEDPSSPSDSSLPPA-----PSSFQSDT-- 223
Query: 62 SSGYGGLTLPSSYSRPSQSSPRSYHAPPSSYRSSYAPTTTPLPPVRP 108
PS S + S P APP P +T P
Sbjct: 224 -------PPPSPESPTNPSPPPGPAAPPPPPVQQVPPLSTAKPTPPS 263
>gnl|CDD|129323 TIGR00219, mreC, rod shape-determining protein MreC. MreC (murein
formation C) is involved in the rod shape determination
in E. coli, and more generally in cell shape
determination of bacteria whether or not they are
rod-shaped. Cells defective in MreC are round. Species
with MreC include many of the Proteobacteria,
Gram-positives, and spirochetes [Cell envelope,
Biosynthesis and degradation of murein sacculus and
peptidoglycan].
Length = 283
Score = 35.6 bits (82), Expect = 0.019
Identities = 16/70 (22%), Positives = 30/70 (42%), Gaps = 9/70 (12%)
Query: 195 SRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVELKELDQFKKENQR 254
+R RE E L V +N + R+ + +K ++E+ + L Q EN R
Sbjct: 52 NRPREVFDGISENLKDVNNLEYEN------YKLRQELLKKNQQLEILTQNLKQ---ENVR 102
Query: 255 LKDENGALIR 264
L++ + +
Sbjct: 103 LRELLNSPLS 112
>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
unknown].
Length = 652
Score = 35.8 bits (83), Expect = 0.023
Identities = 17/81 (20%), Positives = 37/81 (45%), Gaps = 7/81 (8%)
Query: 188 EESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVELKE--- 244
E + EL L+ + K E EL + + ++ RE R +R++ +E EL+E
Sbjct: 439 SELKRELEELKREIEKLESELERFRREVRD---KVRKDREIRARDRRIERLEKELEEKKK 495
Query: 245 -LDQFKKENQRLKDENGALIR 264
+++ +++ L+ +
Sbjct: 496 RVEELERKLAELRKMRKLELS 516
Score = 28.5 bits (64), Expect = 4.4
Identities = 24/73 (32%), Positives = 38/73 (52%), Gaps = 7/73 (9%)
Query: 197 LRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVELKELDQFKKENQRLK 256
L E LSK +EE + + E+E+RE E+++ ++E ++ L ++EN LK
Sbjct: 390 LAEALSKVKEE-ERPREKEGT---EEEERREITVYEKRIKKLEETVERL---EEENSELK 442
Query: 257 DENGALIRVISKL 269
E L R I KL
Sbjct: 443 RELEELKREIEKL 455
>gnl|CDD|237560 PRK13922, PRK13922, rod shape-determining protein MreC;
Provisional.
Length = 276
Score = 35.0 bits (81), Expect = 0.025
Identities = 12/36 (33%), Positives = 23/36 (63%)
Query: 225 KRERRTMERKLSEMEVELKELDQFKKENQRLKDENG 260
+ E ++++L E+E L+EL+Q + EN RL++
Sbjct: 75 REENEELKKELLELESRLQELEQLEAENARLRELLN 110
>gnl|CDD|235600 PRK05771, PRK05771, V-type ATP synthase subunit I; Validated.
Length = 646
Score = 35.3 bits (82), Expect = 0.035
Identities = 19/90 (21%), Positives = 36/90 (40%), Gaps = 6/90 (6%)
Query: 175 PNENGELDYKKLYEESQAELSRLRETLSKTEEELNQVKA---TYSKNSLSEQEKRERRT- 230
E + + +LS + L +LN ++ S SL E K
Sbjct: 36 LKEELSNERLRKLRSLLTKLSEALDKLRSYLPKLNPLREEKKKVSVKSLEELIKDVEEEL 95
Query: 231 --MERKLSEMEVELKELDQFKKENQRLKDE 258
+E+++ E+E E+ EL+ KE ++ +
Sbjct: 96 EKIEKEIKELEEEISELENEIKELEQEIER 125
Score = 29.5 bits (67), Expect = 2.0
Identities = 15/85 (17%), Positives = 37/85 (43%), Gaps = 7/85 (8%)
Query: 165 SESGGSSRKLPNENGELDYKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQE 224
R+ + ++L ++ + EL ++ + + + EEE++++ +N + E E
Sbjct: 66 LPKLNPLREEKKKVSVKSLEELIKDVEEELEKIEKEIKELEEEISEL-----ENEIKELE 120
Query: 225 KRERRTMERKLSEMEVELKELDQFK 249
+ R +++L L FK
Sbjct: 121 QEIERLE--PWGNFDLDLSLLLGFK 143
Score = 29.1 bits (66), Expect = 3.0
Identities = 21/97 (21%), Positives = 40/97 (41%), Gaps = 15/97 (15%)
Query: 172 RKLPNENGELDYKK----LYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRE 227
+KL E EL+ + L E + EL + + EEL ++ Y + L+ E E
Sbjct: 198 KKLGFERLELEEEGTPSELIREIKEELEEIEKERESLLEELKELAKKYLEELLALYEYLE 257
Query: 228 RRTMERKLSEMEVELKELDQF--------KKENQRLK 256
+E + +E + + D+ + ++LK
Sbjct: 258 ---IELERAEALSKFLKTDKTFAIEGWVPEDRVKKLK 291
>gnl|CDD|226889 COG4487, COG4487, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 438
Score = 34.8 bits (80), Expect = 0.035
Identities = 16/74 (21%), Positives = 28/74 (37%), Gaps = 2/74 (2%)
Query: 188 EESQAELSRLRETLSKTEEELNQVKATYSKNSLS--EQEKRERRTMERKLSEMEVELKEL 245
+ E L K E Q + + ++ +E +E++L E+ EL++
Sbjct: 74 INQKKEQKNLFNEQIKQFELALQDEIAKLEALELLNLEKDKELELLEKELDELSKELQKQ 133
Query: 246 DQFKKENQRLKDEN 259
Q E K EN
Sbjct: 134 LQNTAEIIEKKREN 147
Score = 27.9 bits (62), Expect = 7.1
Identities = 21/84 (25%), Positives = 35/84 (41%), Gaps = 11/84 (13%)
Query: 184 KKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEK---------RERRTMERK 234
+ E E + E L K +EL++ +N+ EK R + E+K
Sbjct: 102 LEALELLNLEKDKELELLEKELDELSKELQKQLQNTAEIIEKKRENNKNEERLKFENEKK 161
Query: 235 LSE-MEVE-LKELDQFKKENQRLK 256
L E +E+E K +Q + N L+
Sbjct: 162 LEESLELEREKFEEQLHEANLDLE 185
Score = 27.5 bits (61), Expect = 7.9
Identities = 14/61 (22%), Positives = 27/61 (44%), Gaps = 4/61 (6%)
Query: 189 ESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVELKELDQF 248
++ AE+ + +K EE L + SL + ER E +L E ++L+ +
Sbjct: 135 QNTAEIIEKKRENNKNEERLKFENEKKLEESL----ELEREKFEEQLHEANLDLEFKENE 190
Query: 249 K 249
+
Sbjct: 191 E 191
>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
bacterial type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. This family
represents the SMC protein of most bacteria. The smc
gene is often associated with scpB (TIGR00281) and scpA
genes, where scp stands for segregation and condensation
protein. SMC was shown (in Caulobacter crescentus) to be
induced early in S phase but present and bound to DNA
throughout the cell cycle [Cellular processes, Cell
division, DNA metabolism, Chromosome-associated
proteins].
Length = 1179
Score = 35.0 bits (81), Expect = 0.038
Identities = 32/136 (23%), Positives = 53/136 (38%), Gaps = 16/136 (11%)
Query: 127 RMSRSGSTTSLNSLRSEAPSEGYESGGDNLSTSRSRVNSESGGSSRKLPNENGEL----- 181
+SR S + R EA E E LS + + +E +L EL
Sbjct: 723 ELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEAEEELAEAEA 782
Query: 182 ---DYKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRT-MERKLSE 237
+ + E+ + EL LRE L + EL + RER +ER+++
Sbjct: 783 EIEELEAQIEQLKEELKALREALDELRAELTLLNEEA-------ANLRERLESLERRIAA 835
Query: 238 MEVELKELDQFKKENQ 253
E L++L++ +E
Sbjct: 836 TERRLEDLEEQIEELS 851
Score = 34.6 bits (80), Expect = 0.051
Identities = 19/83 (22%), Positives = 34/83 (40%), Gaps = 4/83 (4%)
Query: 188 EESQAELSRLRETLSKTEEELNQVKATY-SKNSLSEQEKRERRTMERKLSEMEVELKELD 246
E AE+ L E + + E EL + S + E + +L E+E + EL
Sbjct: 855 ESLAAEIEELEELIEELESELEALLNERASLEEALALLRSELEELSEELRELESKRSELR 914
Query: 247 QFKKENQRLKDENGALIRVISKL 269
+ E + L+++ L + L
Sbjct: 915 R---ELEELREKLAQLELRLEGL 934
Score = 31.6 bits (72), Expect = 0.55
Identities = 18/65 (27%), Positives = 29/65 (44%), Gaps = 7/65 (10%)
Query: 185 KLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRT----MERKLSEMEV 240
+ L+ LR L + EEL +++ SK S +E E R +E +L +EV
Sbjct: 880 NERASLEEALALLRSELEELSEELRELE---SKRSELRRELEELREKLAQLELRLEGLEV 936
Query: 241 ELKEL 245
+ L
Sbjct: 937 RIDNL 941
Score = 31.2 bits (71), Expect = 0.63
Identities = 22/92 (23%), Positives = 43/92 (46%), Gaps = 9/92 (9%)
Query: 187 YEESQAELSRLRETLSKTEEELNQVKATY--SKNSLSEQEK------RERRTMERKLSEM 238
E + L+ LR+ L + EEEL Q++ +S K E +E +++++
Sbjct: 693 IAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQL 752
Query: 239 EVELKELDQFKKE-NQRLKDENGALIRVISKL 269
EL EL+ +E +RL++ L +++
Sbjct: 753 SKELTELEAEIEELEERLEEAEEELAEAEAEI 784
Score = 30.0 bits (68), Expect = 1.7
Identities = 23/107 (21%), Positives = 45/107 (42%), Gaps = 8/107 (7%)
Query: 173 KLPNENGEL-DYKKLYEESQAELSRLRETLSKTEEELNQVKAT--YSKNSLSEQEKRERR 229
+L EL + + EE + +L LR +++ E ++ + + L E R R
Sbjct: 359 ELEELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARLERLEDRRER 418
Query: 230 TMERKLSEM----EVELKELD-QFKKENQRLKDENGALIRVISKLSK 271
+ + E ELKEL + ++ + L++ L R+ L +
Sbjct: 419 LQQEIEELLKKLEEAELKELQAELEELEEELEELQEELERLEEALEE 465
Score = 30.0 bits (68), Expect = 1.8
Identities = 28/89 (31%), Positives = 43/89 (48%), Gaps = 16/89 (17%)
Query: 183 YKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVEL 242
EE + EL L+E L + EEEL E+ E + +E KL E+ +E+
Sbjct: 230 LVLRLEELREELEELQEELKEAEEEL-------------EELTAELQELEEKLEELRLEV 276
Query: 243 KELDQFKKENQRLKDENGALIRVISKLSK 271
EL+ +E + L+ E AL IS+L +
Sbjct: 277 SELE---EEIEELQKELYALANEISRLEQ 302
Score = 28.9 bits (65), Expect = 3.5
Identities = 27/111 (24%), Positives = 50/111 (45%), Gaps = 2/111 (1%)
Query: 160 RSRVNSESGGSSRKLPNENGELD-YKKLYEESQAELSRLRETLSKTEEELNQVKATYSKN 218
+ E + +L EL ++ EE + E+S L E + + ++EL + S+
Sbjct: 241 LEELQEELKEAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKELYALANEISRL 300
Query: 219 SLSEQEKRERR-TMERKLSEMEVELKELDQFKKENQRLKDENGALIRVISK 268
+Q RER +ER+L E+E +L+EL+ E E + + +
Sbjct: 301 EQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKE 351
>gnl|CDD|225199 COG2317, COG2317, Zn-dependent carboxypeptidase [Amino acid
transport and metabolism].
Length = 497
Score = 35.0 bits (81), Expect = 0.041
Identities = 18/61 (29%), Positives = 28/61 (45%), Gaps = 4/61 (6%)
Query: 189 ESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEM----EVELKE 244
E+ AELS LR L + E ++ + LSE E R ++R+ + E +KE
Sbjct: 45 EAIAELSVLRHELLTSPEFAELLEKAEEEKDLSEIEAGVVRELKREYEKAKKIPEELVKE 104
Query: 245 L 245
Sbjct: 105 Y 105
>gnl|CDD|216531 pfam01496, V_ATPase_I, V-type ATPase 116kDa subunit family. This
family consists of the 116kDa V-type ATPase (vacuolar
(H+)-ATPases) subunits, as well as V-type ATP synthase
subunit i. The V-type ATPases family are proton pumps
that acidify intracellular compartments in eukaryotic
cells for example yeast central vacuoles,
clathrin-coated and synaptic vesicles. They have
important roles in membrane trafficking processes. The
116kDa subunit (subunit a) in the V-type ATPase is part
of the V0 functional domain responsible for proton
transport. The a subunit is a transmembrane glycoprotein
with multiple putative transmembrane helices it has a
hydrophilic amino terminal and a hydrophobic carboxy
terminal. It has roles in proton transport and assembly
of the V-type ATPase complex. This subunit is encoded by
two homologous gene in yeast VPH1 and STV1.
Length = 707
Score = 35.0 bits (81), Expect = 0.042
Identities = 27/118 (22%), Positives = 42/118 (35%), Gaps = 16/118 (13%)
Query: 163 VNSESGGSSRKLPNENGELD--YKKL-YEESQAELSRLRETLSKTE--------EELNQV 211
+N + RK NE D +KL ES ++ +L L T +E +
Sbjct: 15 LNEDVSAFQRKFVNEVRRCDEVERKLRKLES--KIKKLGIPLKDTGGKPDVPPSKEFLDL 72
Query: 212 KATYSKNSLSEQEKRERRTMERKLSEMEVELKELDQFKKENQRLKDENGALIRVISKL 269
+ E E +E L + EL+E E + DEN + +S L
Sbjct: 73 EEEILD---LEAEIKEVEENLESLEKEINELEEWLNVLDEEKSFLDENLEELSELSNL 127
>gnl|CDD|223783 COG0711, AtpF, F0F1-type ATP synthase, subunit b [Energy production
and conversion].
Length = 161
Score = 33.8 bits (78), Expect = 0.044
Identities = 20/96 (20%), Positives = 44/96 (45%), Gaps = 7/96 (7%)
Query: 177 ENGELDYKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSE-QEKRE--RRTMER 233
+ + ++L EE+QA L+ + L + E+ +++ K + +E + +ER
Sbjct: 43 ADDLAEAERLKEEAQALLAEYEQELEEAREQASEIIEQAKKEAEQIAEEIKAEAEEELER 102
Query: 234 KLSEMEVELK-ELDQFKKENQRLKDENGALIRVISK 268
E E++ E ++ +E L+ E L I++
Sbjct: 103 IKEAAEAEIEAEKERALEE---LRAEVAELAVAIAE 135
>gnl|CDD|237833 PRK14845, PRK14845, translation initiation factor IF-2;
Provisional.
Length = 1049
Score = 34.9 bits (80), Expect = 0.048
Identities = 37/134 (27%), Positives = 49/134 (36%), Gaps = 17/134 (12%)
Query: 136 SLNSLRSEAPSEGYE-----SGGDNLSTSRSRVNSESGGSSRKLPNENGELDYKKLYEES 190
SL LR S+ Y SG NL R + G S K E E +K +
Sbjct: 265 SLLLLRFGIVSKIYRSTLIISGKRNLENFRKYI-----GFSVKEKAEALEKIIEKSKKSE 319
Query: 191 ----QAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVELKELD 246
EL RLR T EL+ YSK E E + L+ +E K LD
Sbjct: 320 RYPINEELKRLRLLFGFTRNELSSNIPFYSKYESEEAPSYE--ILMEILNSIERGSKNLD 377
Query: 247 -QFKKENQRLKDEN 259
+ +++D N
Sbjct: 378 KKIAVLEGKIRDHN 391
>gnl|CDD|179877 PRK04778, PRK04778, septation ring formation regulator EzrA;
Provisional.
Length = 569
Score = 34.4 bits (80), Expect = 0.049
Identities = 16/64 (25%), Positives = 34/64 (53%), Gaps = 3/64 (4%)
Query: 184 KKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVELK 243
+K + L +E + +EE+++VK +Y +L+E E R +E++L +E +
Sbjct: 309 EKNSDTLPDFLEHAKEQNKELKEEIDRVKQSY---TLNESELESVRQLEKQLESLEKQYD 365
Query: 244 ELDQ 247
E+ +
Sbjct: 366 EITE 369
Score = 28.6 bits (65), Expect = 4.2
Identities = 24/100 (24%), Positives = 40/100 (40%), Gaps = 24/100 (24%)
Query: 182 DYKKLYEESQAELSRLRETLSKTEEELNQVKATY--SKNSLSEQEKRER-----RTMERK 234
++ E+ EL L E+ K EE+ Q+K Y + SL R +E++
Sbjct: 116 LIEEDIEQILEELQELLESEEKNREEVEQLKDLYRELRKSLL--ANRFSFGPALDELEKQ 173
Query: 235 LSEMEVELKELDQFKKEN------------QRLKDENGAL 262
L +E +E QF + +L++E AL
Sbjct: 174 LENLE---EEFSQFVELTESGDYVEAREILDQLEEELAAL 210
>gnl|CDD|217330 pfam03035, RNA_capsid, Calicivirus putative RNA polymerase/capsid
protein.
Length = 226
Score = 33.9 bits (78), Expect = 0.057
Identities = 24/90 (26%), Positives = 36/90 (40%)
Query: 28 PQSRNLSSYSSGLSSSTPYLNSSYPSYGSTYGGTSSGYGGLTLPSSYSRPSQSSPRSYHA 87
P S +SYS G +SS + S S S+ S+ GL+ S S + S S+
Sbjct: 108 PNSMATTSYSGGFTSSPVPVPPSSSSSASSVSSQSTQSTGLSSSSYSSSSASSRTSSWVR 167
Query: 88 PPSSYRSSYAPTTTPLPPVRPRRSKSSTRS 117
+S + P V P S +S+
Sbjct: 168 SQNSNLEPFMPGALQTAWVTPPSSTASSSG 197
>gnl|CDD|218107 pfam04484, DUF566, Family of unknown function (DUF566). Family of
related proteins that is plant specific.
Length = 313
Score = 34.1 bits (78), Expect = 0.057
Identities = 37/117 (31%), Positives = 51/117 (43%), Gaps = 5/117 (4%)
Query: 6 PPRRSSGRTTSRPRSLYSSSPSPQSRNLSSYSSGLSSSTPYLNSSYPSYGSTYGGTSSGY 65
PR SS R S S+S P R L++ +S SSS SS S+G + SS
Sbjct: 16 SPRSSSRRRLSSSFLSTSASSRP--RRLNAPASPPSSSPARNTSSSSSFGLSKQRPSSLS 73
Query: 66 GGLTLPSSYSRPSQSSPRSYHAPPSSYRSSYAPTTTPLPPVRPRRSKSSTRSANYTP 122
G L S + PS+ SP + + S ++ P R RR+ SS S+ P
Sbjct: 74 RGR-LSSRFVSPSRGSPSAAASLNGSLATASTS--GSSSPSRSRRTTSSDLSSGNGP 127
>gnl|CDD|179699 PRK03992, PRK03992, proteasome-activating nucleotidase;
Provisional.
Length = 389
Score = 34.0 bits (79), Expect = 0.061
Identities = 13/60 (21%), Positives = 28/60 (46%), Gaps = 17/60 (28%)
Query: 222 EQEKRERRTMERKLSEMEVE----LKELDQFKKENQRLK-------------DENGALIR 264
+ + + R +E KL ++E E +EL++ K E ++LK D+ +++
Sbjct: 11 SELEEQIRQLELKLRDLEAENEKLERELERLKSELEKLKSPPLIVATVLEVLDDGRVVVK 70
>gnl|CDD|215145 PLN02258, PLN02258, 9-cis-epoxycarotenoid dioxygenase NCED.
Length = 590
Score = 33.9 bits (78), Expect = 0.077
Identities = 16/55 (29%), Positives = 23/55 (41%)
Query: 69 TLPSSYSRPSQSSPRSYHAPPSSYRSSYAPTTTPLPPVRPRRSKSSTRSANYTPE 123
+ S S PS +SPR PS+ + P+ PLP + S T T +
Sbjct: 19 SSSSQSSPPSSTSPRPRRRKPSASSLLHTPSILPLPKLSSPSPPSVTLPPAATTQ 73
>gnl|CDD|130141 TIGR01069, mutS2, MutS2 family protein. Function of MutS2 is
unknown. It should not be considered a DNA mismatch
repair protein. It is likely a DNA mismatch binding
protein of unknown cellular function [DNA metabolism,
Other].
Length = 771
Score = 34.0 bits (78), Expect = 0.081
Identities = 16/66 (24%), Positives = 29/66 (43%), Gaps = 8/66 (12%)
Query: 196 RLRETLSKTEEELNQVKATYSKNSLSEQEKR---ERRTMERKLSEMEVELKELDQFKKEN 252
+ + + +EE+N + LS EK + +E+ L E E KEL+Q +E
Sbjct: 501 QAKTFYGEFKEEINVLIE-----KLSALEKELEQKNEHLEKLLKEQEKLKKELEQEMEEL 555
Query: 253 QRLKDE 258
+ +
Sbjct: 556 KERERN 561
Score = 32.5 bits (74), Expect = 0.28
Identities = 20/92 (21%), Positives = 36/92 (39%), Gaps = 5/92 (5%)
Query: 170 SSRKLPNENGELDYK-----KLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQE 224
++L +N L+ KL +E + E+ L+E + EL + K E E
Sbjct: 523 LEKELEQKNEHLEKLLKEQEKLKKELEQEMEELKERERNKKLELEKEAQEALKALKKEVE 582
Query: 225 KRERRTMERKLSEMEVELKELDQFKKENQRLK 256
R E+K+ + + D K + + K
Sbjct: 583 SIIRELKEKKIHKAKEIKSIEDLVKLKETKQK 614
Score = 30.6 bits (69), Expect = 1.0
Identities = 18/74 (24%), Positives = 33/74 (44%), Gaps = 4/74 (5%)
Query: 182 DYKKLYEESQAELSRLRETLSKTEEELNQVKATYS---KNSLSEQEKRERRTMERKLSEM 238
+K E+ L +L + K ++EL Q +N E EK + ++ E+
Sbjct: 522 ALEKELEQKNEHLEKLLKEQEKLKKELEQEMEELKERERNKKLELEKEAQEALKALKKEV 581
Query: 239 EVELKELDQFKKEN 252
E ++EL + KK +
Sbjct: 582 ESIIREL-KEKKIH 594
>gnl|CDD|234141 TIGR03185, DNA_S_dndD, DNA sulfur modification protein DndD. This
model describes the DndB protein encoded by an operon
associated with a sulfur-containing modification to DNA.
The operon is sporadically distributed in bacteria, much
like some restriction enzyme operons. DndD is described
as a putative ATPase. The small number of examples known
so far include species from among the Firmicutes,
Actinomycetes, Proteobacteria, and Cyanobacteria [DNA
metabolism, Restriction/modification].
Length = 650
Score = 33.9 bits (78), Expect = 0.098
Identities = 19/83 (22%), Positives = 38/83 (45%), Gaps = 1/83 (1%)
Query: 170 SSRKLPNENGELDYKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERR 229
R+L E E+D K S+ ++++L E L + + EL + +A + + +E
Sbjct: 399 ELRELEEELAEVDKKISTIPSEEQIAQLLEELGEAQNELFRSEAEIEELLRQLETLKEAI 458
Query: 230 TMERKLSEMEVELKELDQFKKEN 252
RK + + + K + F+ E
Sbjct: 459 EALRKTLDEKTKQKI-NAFELER 480
>gnl|CDD|218545 pfam05300, DUF737, Protein of unknown function (DUF737). This
family consists of several uncharacterized mammalian
proteins of unknown function.
Length = 187
Score = 32.8 bits (75), Expect = 0.099
Identities = 39/150 (26%), Positives = 62/150 (41%), Gaps = 12/150 (8%)
Query: 127 RMSRSGSTTSLNSLRSEAP------SEGYESGGDNLSTSRSRVNSESGGSSRKLPNENGE 180
RM +S SE L + S ++ L + +
Sbjct: 22 RMKEPSQPSSKPQPSPGFGPSSVPESELRRLIAGELKGALEDAKRPSEETAGGLQSSEVK 81
Query: 181 LDYKKLYEESQA----ELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLS 236
D K YE+ QA EL+R+ + + EE K S +EQE+++ + + R+L
Sbjct: 82 EDLLKRYEQEQAIVQEELARIAKREREAAEEQLSRAVLREKAS-AEQERQKAKHLARQLE 140
Query: 237 EMEVELKELDQFKKEN-QRLKDENGALIRV 265
E E ELK LD F KE RL+++N ++
Sbjct: 141 EKEAELKRLDAFYKEQLARLEEKNSEFYKL 170
>gnl|CDD|217933 pfam04156, IncA, IncA protein. Chlamydia trachomatis is an
obligate intracellular bacterium that develops within a
parasitophorous vacuole termed an inclusion. The
inclusion is non-fusogenic with lysosomes but intercepts
lipids from a host cell exocytic pathway. Initiation of
chlamydial development is concurrent with modification
of the inclusion membrane by a set of C.
trachomatis-encoded proteins collectively designated
Incs. One of these Incs, IncA, is functionally
associated with the homotypic fusion of inclusions. This
family probably includes members of the wider Inc family
rather than just IncA.
Length = 186
Score = 32.8 bits (75), Expect = 0.10
Identities = 23/92 (25%), Positives = 43/92 (46%), Gaps = 24/92 (26%)
Query: 184 KKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVELK 243
+ EE Q ELS L++ LS+ +EEL +E +++E+E EL+
Sbjct: 71 PQKLEELQGELSELKQQLSELQEELED--------------------LEERIAELESELE 110
Query: 244 EL----DQFKKENQRLKDENGALIRVISKLSK 271
+L ++ + L++ +L I +L+K
Sbjct: 111 DLKEDLQLLRELLKSLEERLESLEESIKELAK 142
>gnl|CDD|224346 COG1429, CobN, Cobalamin biosynthesis protein CobN and related
Mg-chelatases [Coenzyme metabolism].
Length = 1388
Score = 33.9 bits (78), Expect = 0.10
Identities = 19/76 (25%), Positives = 29/76 (38%), Gaps = 2/76 (2%)
Query: 97 APTTTPLPPVRPRRSKSSTRSANYTPEGGGRMSRSGSTTSLNSLRSEAPSEGYESGGDNL 156
+ TP P + ST S+ + G + SG+ ++ SL + G G
Sbjct: 1294 PASATPGAPESVGTTAVSTASSASSATVTGSDAGSGADSTGPSLGAAGSVTGAGEG--YE 1351
Query: 157 STSRSRVNSESGGSSR 172
T + SES G S
Sbjct: 1352 MTKEAVSGSESTGMSF 1367
>gnl|CDD|220133 pfam09177, Syntaxin-6_N, Syntaxin 6, N-terminal. Members of this
family, which are found in the amino terminus of various
SNARE proteins, adopt a structure consisting of an
antiparallel three-helix bundle. Their exact function
has not been determined, though it is known that they
regulate the SNARE motif, as well as mediate various
protein-protein interactions involved in
membrane-transport.
Length = 96
Score = 31.1 bits (71), Expect = 0.14
Identities = 16/74 (21%), Positives = 36/74 (48%), Gaps = 4/74 (5%)
Query: 170 SSRKLPNENGELDYKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERR 229
S ++ +++ + +L +E + L + L EE ++ V+A K L E+E RR
Sbjct: 23 SWLRIRSDSNSPELDELTKELRNALESIEWDLEDLEEAVSVVEANPEKFGLDEEEISRRR 82
Query: 230 ----TMERKLSEME 239
+ +++E++
Sbjct: 83 RFVAEIRNQINEVK 96
>gnl|CDD|203462 pfam06476, DUF1090, Protein of unknown function (DUF1090). This
family consists of several bacterial proteins of unknown
function and is known as YqjC in E. coli.
Length = 115
Score = 31.5 bits (72), Expect = 0.14
Identities = 17/61 (27%), Positives = 34/61 (55%), Gaps = 8/61 (13%)
Query: 185 KLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVELKE 244
L E QA++ + +++ E+EL + +A + +K E++ +RKL+E + EL+E
Sbjct: 63 GLRAERQAKVEEKEQEVAEREQELKEAQAK------GDADKIEKK--QRKLAEAQAELQE 114
Query: 245 L 245
Sbjct: 115 A 115
Score = 27.6 bits (62), Expect = 2.9
Identities = 20/81 (24%), Positives = 40/81 (49%), Gaps = 12/81 (14%)
Query: 180 ELDYKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEME 239
+++Y K + +Q ++ L E+ L +V+A + L + + + E++++E E
Sbjct: 32 QIEYAKAHG-NQHRIAGL-------EKALKEVQAHCTDAGLRAERQAKVEEKEQEVAERE 83
Query: 240 VELKEL----DQFKKENQRLK 256
ELKE D K E ++ K
Sbjct: 84 QELKEAQAKGDADKIEKKQRK 104
>gnl|CDD|233508 TIGR01649, hnRNP-L_PTB, hnRNP-L/PTB/hephaestus splicing factor
family. Included in this family of heterogeneous
ribonucleoproteins are PTB (polypyrimidine tract binding
protein ) and hnRNP-L. These proteins contain four RNA
recognition motifs (rrm: pfam00067).
Length = 481
Score = 33.2 bits (76), Expect = 0.14
Identities = 15/57 (26%), Positives = 21/57 (36%)
Query: 50 SYPSYGSTYGGTSSGYGGLTLPSSYSRPSQSSPRSYHAPPSSYRSSYAPTTTPLPPV 106
+ Y S G + GG + + PS+ P AP + SSY P V
Sbjct: 221 GHDGYSSHGGPLAPLAGGDRMGPPHGPPSRYRPAYEAAPLAPAISSYGPAGGGPGSV 277
>gnl|CDD|226513 COG4026, COG4026, Uncharacterized protein containing TOPRIM domain,
potential nuclease [General function prediction only].
Length = 290
Score = 32.6 bits (74), Expect = 0.16
Identities = 16/76 (21%), Positives = 34/76 (44%), Gaps = 8/76 (10%)
Query: 181 LDYKKLYEESQAELSRLRETLSKTEEELNQV----KATYSKNSLSEQEKRER----RTME 232
+ K+ EE Q E L + L + E E +V K +NS E+ ++ ++
Sbjct: 138 EELKEKLEELQKEKEELLKELEELEAEYEEVQERLKRLEVENSRLEEMLKKLPGEVYDLK 197
Query: 233 RKLSEMEVELKELDQF 248
++ E+E ++ ++
Sbjct: 198 KRWDELEPGVELPEEE 213
>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y. Members of this family are
RNase Y, an endoribonuclease. The member from Bacillus
subtilis, YmdA, has been shown to be involved in
turnover of yitJ riboswitch [Transcription, Degradation
of RNA].
Length = 514
Score = 33.0 bits (76), Expect = 0.18
Identities = 26/88 (29%), Positives = 41/88 (46%), Gaps = 2/88 (2%)
Query: 183 YKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVEL 242
KL E + EL R L + E L Q + T + S +K E +E+K E+ +
Sbjct: 60 VHKLRAELERELKERRNELQRLERRLLQREETLDRKMESLDKKEE--NLEKKEKELSNKE 117
Query: 243 KELDQFKKENQRLKDENGALIRVISKLS 270
K LD+ ++E + L E + IS L+
Sbjct: 118 KNLDEKEEELEELIAEQREELERISGLT 145
>gnl|CDD|218899 pfam06102, DUF947, Domain of unknown function (DUF947). Family of
eukaryotic proteins with unknown function.
Length = 168
Score = 31.9 bits (73), Expect = 0.18
Identities = 25/101 (24%), Positives = 44/101 (43%), Gaps = 18/101 (17%)
Query: 162 RVNSESGGSSRKLPNENGELDYKKLYEESQAELSRLRETL--SKTEEELNQVKATYSKNS 219
R +S SG + +N YK L + + E+ L + L +K EE ++K T S
Sbjct: 37 RFDSLSGKLNLDKFRKN----YKFLDDYREKEIEELEKALKKTKDSEEKEELKRTL--QS 90
Query: 220 LSEQEK-RERRTMERKLSEMEVELKELDQFKKENQRLKDEN 259
+ + K + + ER++ L + KK+ + L E
Sbjct: 91 MKSRLKTLKNKDREREI---------LKEHKKQEKELIKEG 122
>gnl|CDD|184697 PRK14475, PRK14475, F0F1 ATP synthase subunit B; Provisional.
Length = 167
Score = 31.8 bits (72), Expect = 0.18
Identities = 21/74 (28%), Positives = 38/74 (51%), Gaps = 13/74 (17%)
Query: 180 ELD-YKKLYEESQAELSRLR-----------ETLSKTEEELNQVKATYSKNSLSEQEKRE 227
ELD ++L EE+QA L+ ++ L+ + + +++A +K L EQ KR
Sbjct: 49 ELDEAQRLREEAQALLADVKAEREEAERQAAAMLAAAKADARRMEAE-AKEKLEEQIKRR 107
Query: 228 RRTMERKLSEMEVE 241
ERK+++ E +
Sbjct: 108 AEMAERKIAQAEAQ 121
>gnl|CDD|143630 cd07892, PolyPPase_VTC2-3_like, Polyphosphate(polyP) polymerase
domain of yeast vacuolar transport chaperone (VTC)
proteins VTC-2, and -3 , and similar proteins.
Saccharomyces cerevisiae VTC-1, -2, -3, and -4 comprise
the membrane-integral VTC complex. VTC-2, -3, and -4
contain polyP polymerase domains. S. cerevisiae
VTC-2,and -3 belong to this subgroup. For VTC4 it has
been shown that this domain generates polyP from ATP by
a phosphotransfer reaction releasing ADP. This activity
is metal ion-dependent. The ATP gamma phosphate may be
cleaved and then transferred to an acceptor phosphate to
form polyP. PolyP is ubiquitous. In prokaryotes, it is a
store of phosphate and energy. In eukaryotes, polyPs
have roles in bone calcification, and osmoregulation,
and in phosphate transport in the symbiosis of
mycorrhizal fungi and plants. This subgroup belongs to
the CYTH/triphosphate tunnel metalloenzyme (TTM)-like
superfamily, whose enzymes have a unique active site
located within an eight-stranded beta barrel.
Length = 303
Score = 32.3 bits (74), Expect = 0.19
Identities = 26/101 (25%), Positives = 42/101 (41%), Gaps = 24/101 (23%)
Query: 176 NENGELDYKKLYEESQAELSRLR-------------------ETLSKTEEELNQVKATYS 216
N N +L KL + ++A RLR E S EE+ Q+K Y
Sbjct: 63 NPNFDLYNDKLLKLNEAPTLRLRWTGKLSDKPDIFVEKKTFDENTSSFEEDKLQLKEKYI 122
Query: 217 KNSLSEQEKRERRTMERKLSEMEVELKELDQFKKENQRLKD 257
+ + K E+KL +ME +L+ KK+ + ++D
Sbjct: 123 NGFIFGKYK-----FEKKLQKMEKRGADLENLKKDVENIQD 158
>gnl|CDD|197874 smart00787, Spc7, Spc7 kinetochore protein. This domain is found
in cell division proteins which are required for
kinetochore-spindle association.
Length = 312
Score = 32.3 bits (74), Expect = 0.20
Identities = 17/70 (24%), Positives = 30/70 (42%), Gaps = 13/70 (18%)
Query: 191 QAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKL-------SEMEVELK 243
EL R +E L K +E+ L E E E + +E K+ SE+ E+
Sbjct: 203 PTELDRAKEKLKKLLQEI-----MIKVKKLEELE-EELQELESKIEDLTNKKSELNTEIA 256
Query: 244 ELDQFKKENQ 253
E ++ ++ +
Sbjct: 257 EAEKKLEQCR 266
>gnl|CDD|225939 COG3404, COG3404, Methenyl tetrahydrofolate cyclohydrolase [Amino
acid transport and metabolism].
Length = 208
Score = 31.6 bits (72), Expect = 0.32
Identities = 17/54 (31%), Positives = 27/54 (50%)
Query: 177 ENGELDYKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRT 230
E+ + + K++ EE Q + L + K EE N + A Y +E+EK RR
Sbjct: 53 EDYDDEMKEILEELQKIKAELLALIDKDEEAFNLIMAAYKLPKSTEEEKAARRK 106
>gnl|CDD|222636 pfam14265, DUF4355, Domain of unknown function (DUF4355). This
family of proteins is found in bacteria and viruses.
Proteins in this family are typically between 180 and
214 amino acids in length.
Length = 125
Score = 30.7 bits (70), Expect = 0.33
Identities = 14/74 (18%), Positives = 31/74 (41%), Gaps = 4/74 (5%)
Query: 190 SQAELSRLRETLSKTEEELNQVKAT--YSKNSLSEQEKR--ERRTMERKLSEMEVELKEL 245
+ E+ + E+ + K + +S +EK E +E++L E+E EL
Sbjct: 9 TDKEVDKAIAKEKAKWEKKQEEKKSEAEKLAKMSAEEKAEYELEKLEKELEELEAELARR 68
Query: 246 DQFKKENQRLKDEN 259
+ + + L ++
Sbjct: 69 ELKAEAKKMLSEKG 82
>gnl|CDD|240430 PTZ00473, PTZ00473, Plasmodium Vir superfamily; Provisional.
Length = 420
Score = 31.7 bits (72), Expect = 0.33
Identities = 26/91 (28%), Positives = 33/91 (36%), Gaps = 5/91 (5%)
Query: 10 SSGRTTSRPRSLYSSSPSPQSRNLSSYSSGLSSSTPYLNSSYPSYGSTYGGT-----SSG 64
SGRT S + S + SSY + + ST S ST GG+ S+
Sbjct: 329 RSGRTGSSESIRGFTYDSSTTYGGSSYGTSQTDSTSTYGSRSTFDSSTGGGSQSGGGSTY 388
Query: 65 YGGLTLPSSYSRPSQSSPRSYHAPPSSYRSS 95
G T S S S SY P + S
Sbjct: 389 GGSSTFDGSSRGSSDSFGVSYFGPQQTVGFS 419
>gnl|CDD|163064 TIGR02894, DNA_bind_RsfA, transcription factor, RsfA family. In a
subset of endospore-forming members of the Firmcutes,
members of this protein family are found, several to a
genome. Two very strongly conserved sequences regions
are separated by a highly variable linker region. Much
of the linker region was excised from the seed alignment
for this model. A characterized member is the
prespore-specific transcription RsfA from Bacillus
subtilis, previously called YwfN, which is controlled by
sigma factor F and seems to fine-tune expression of some
genes in the sigma-F regulon. A paralog in Bacillus
subtilis is designated YlbO [Regulatory functions, DNA
interactions, Cellular processes, Sporulation and
germination].
Length = 161
Score = 30.9 bits (70), Expect = 0.34
Identities = 14/78 (17%), Positives = 34/78 (43%), Gaps = 3/78 (3%)
Query: 184 KKLYEESQAELSRLRETLSKTEEELNQVKATY---SKNSLSEQEKRERRTMERKLSEMEV 240
+K YEE+ + R+ L + L + + + ++ ++ ER ++ E
Sbjct: 56 RKQYEEAIELAKKQRKELKREAGSLTLQDVISFLQNLKTTNPSDQALQKENERLKNQNES 115
Query: 241 ELKELDQFKKENQRLKDE 258
K ++ +KE ++L+
Sbjct: 116 LQKRNEELEKELEKLRQR 133
Score = 28.2 bits (63), Expect = 3.0
Identities = 14/62 (22%), Positives = 30/62 (48%), Gaps = 12/62 (19%)
Query: 222 EQEKRERRTMERKLSEMEVE------------LKELDQFKKENQRLKDENGALIRVISKL 269
E K++R+ ++R+ + ++ +KEN+RLK++N +L + +L
Sbjct: 64 ELAKKQRKELKREAGSLTLQDVISFLQNLKTTNPSDQALQKENERLKNQNESLQKRNEEL 123
Query: 270 SK 271
K
Sbjct: 124 EK 125
>gnl|CDD|237629 PRK14160, PRK14160, heat shock protein GrpE; Provisional.
Length = 211
Score = 31.3 bits (71), Expect = 0.37
Identities = 19/83 (22%), Positives = 30/83 (36%), Gaps = 5/83 (6%)
Query: 187 YEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVELKELD 246
E A+ + E K E + K + E E E E +++
Sbjct: 3 KECKDAKHENMEEDCCKENENKEE-----DKGKEEDLEFEEIEKEEIIEDSEESNEVKIE 57
Query: 247 QFKKENQRLKDENGALIRVISKL 269
+ K EN +LK+EN L + L
Sbjct: 58 ELKDENNKLKEENKKLENELEAL 80
>gnl|CDD|220614 pfam10174, Cast, RIM-binding protein of the cytomatrix active zone.
This is a family of proteins that form part of the CAZ
(cytomatrix at the active zone) complex which is
involved in determining the site of synaptic vesicle
fusion. The C-terminus is a PDZ-binding motif that binds
directly to RIM (a small G protein Rab-3A effector). The
family also contains four coiled-coil domains.
Length = 774
Score = 31.9 bits (72), Expect = 0.43
Identities = 19/80 (23%), Positives = 38/80 (47%), Gaps = 4/80 (5%)
Query: 183 YKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVEL 242
+ E+ Q + RL++TL +EE+ ++ LS + + E ++ E E +
Sbjct: 133 LRDTLEKLQLRMERLQQTLDARDEEIKKLLEMLQSKGLSAKAEEEDSERLGRIREAEDQQ 192
Query: 243 KEL----DQFKKENQRLKDE 258
L +Q +KE+ L++E
Sbjct: 193 SHLEVLLEQKEKEHMMLREE 212
Score = 30.4 bits (68), Expect = 1.0
Identities = 19/75 (25%), Positives = 36/75 (48%), Gaps = 1/75 (1%)
Query: 188 EESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVELKELDQ 247
E +Q E L++ L + + +L + K S + KRER + + + + V ++L
Sbjct: 6 ENAQRENELLKKELERKQSKLGSSMNSI-KTFWSPELKRERALRKDEAARITVWKEQLRV 64
Query: 248 FKKENQRLKDENGAL 262
++ENQ + AL
Sbjct: 65 TQEENQHAQLTVQAL 79
>gnl|CDD|204032 pfam08703, PLC-beta_C, PLC-beta C terminal. This domain
corresponds to the alpha helical C terminal domain of
phospholipase C beta.
Length = 181
Score = 30.9 bits (70), Expect = 0.47
Identities = 15/56 (26%), Positives = 29/56 (51%), Gaps = 1/56 (1%)
Query: 184 KKLYEESQ-AELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEM 238
+L E Q A+L +L+E K ++EL + + E + +E+ ER+ +E+
Sbjct: 43 TELAREKQAAQLKKLKEISEKEKKELKKKLDRKRLERIQEAKTKEKHAQEREKTEI 98
>gnl|CDD|200948 pfam00038, Filament, Intermediate filament protein.
Length = 312
Score = 31.4 bits (72), Expect = 0.47
Identities = 21/80 (26%), Positives = 41/80 (51%), Gaps = 6/80 (7%)
Query: 183 YKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLS---EQEKRERRTMERKLSEME 239
YK EE Q +R + L +EE+ +++ SL + K+++ ++ER+L+E+E
Sbjct: 193 YKSKLEELQQAAARNGDALRSAKEEITELRRQI--QSLEIELQSLKKQKASLERQLAELE 250
Query: 240 VEL-KELDQFKKENQRLKDE 258
EL ++ L++E
Sbjct: 251 ERYELELADYQDTISELEEE 270
>gnl|CDD|215532 PLN02982, PLN02982, galactinol-raffinose
galactosyltransferase/ghydrolase, hydrolyzing O-glycosyl
compounds.
Length = 865
Score = 31.3 bits (71), Expect = 0.52
Identities = 17/81 (20%), Positives = 34/81 (41%), Gaps = 5/81 (6%)
Query: 171 SRKLPNENGELDYKKLYEESQAELSRLRETLSKTEEELNQV-----KATYSKNSLSEQEK 225
+ + E+ E KK E +LS + + ++EL+ + K+ S++ S K
Sbjct: 327 YKAIEREHAEKARKKAIESGVTDLSEFDAKIKQLKKELDAMFDGEEKSVSSESESSGSCK 386
Query: 226 RERRTMERKLSEMEVELKELD 246
M+ ++ + K LD
Sbjct: 387 VSGSGMKAFTRDLRTKFKGLD 407
>gnl|CDD|184923 PRK14959, PRK14959, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 624
Score = 31.6 bits (71), Expect = 0.52
Identities = 23/92 (25%), Positives = 31/92 (33%), Gaps = 7/92 (7%)
Query: 52 PSYGSTYGGTSSGYGGLTLPSSYSRPSQSSPRSYHAPPSSYRSSYAPTTTPLP------- 104
GS G +SG P ++P + P S+ ++ AP+ P P
Sbjct: 380 APSGSAAEGPASGGAATIPTPGTQGPQGTAPAAGMTPSSAAPATPAPSAAPSPRVPWDDA 439
Query: 105 PVRPRRSKSSTRSANYTPEGGGRMSRSGSTTS 136
P P RS R A PE S S
Sbjct: 440 PPAPPRSGIPPRPAPRMPEASPVPGAPDSVAS 471
>gnl|CDD|220664 pfam10267, Tmemb_cc2, Predicted transmembrane and coiled-coil 2
protein. This family of transmembrane coiled-coil
containing proteins is conserved from worms to humans.
Its function is unknown.
Length = 387
Score = 31.3 bits (71), Expect = 0.56
Identities = 34/169 (20%), Positives = 67/169 (39%), Gaps = 28/169 (16%)
Query: 122 PEGGGR-MSRSGSTTSLNSLRSEAPSEGYESGGDNLSTSRSRVNSESGGSSRKLPNENGE 180
PE R +S S + S+ SE +S + ++ + +S P
Sbjct: 150 PEENPRNLSGSATLVLSPKYGSDDSSE---------CSSATSSSAGANSNSGSGPGSAVA 200
Query: 181 LDYKKLYEESQAELSRLRETLSKTEEELNQVKATYSKN----SLSEQEKRERRT-MERKL 235
++ EL ++E S+ EE ++K ++ + S QE+R R +E +L
Sbjct: 201 QQQSLNLDKLLEELREIKEGQSRLEESYERLKEQLQRDYQYITQSLQEERYRYERLEEQL 260
Query: 236 SE-MEVELKELDQFKKE------------NQRLKDENGALIRVISKLSK 271
++ E+ E+ K+E +R +D A+ +++SK
Sbjct: 261 NDLTELHQNEIANLKQELASMEEKVAYQSYERARDIQEAMESCQTRISK 309
>gnl|CDD|227352 COG5019, CDC3, Septin family protein [Cell division and chromosome
partitioning / Cytoskeleton].
Length = 373
Score = 31.1 bits (71), Expect = 0.58
Identities = 15/71 (21%), Positives = 31/71 (43%), Gaps = 6/71 (8%)
Query: 176 NENGELDYKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQE-KRERRTMERK 234
+GE K+++E E R + +++ + EQ ER+ + K
Sbjct: 296 KNSGEPSLKEIHEARLNEEERELKKKFTE-----KIREKEKRLEELEQNLIEERKELNSK 350
Query: 235 LSEMEVELKEL 245
L E++ +L++L
Sbjct: 351 LEEIQKKLEDL 361
Score = 30.0 bits (68), Expect = 1.5
Identities = 21/89 (23%), Positives = 39/89 (43%), Gaps = 10/89 (11%)
Query: 184 KKLYEESQAE-LSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVEL 242
LYE + E LS L+ + + +E+++ + + L ++ + R E++L E+E L
Sbjct: 281 NLLYENYRTEKLSGLKNSGEPSLKEIHEARLNEEERELKKKFTEKIREKEKRLEELEQNL 340
Query: 243 KELDQFKKENQRLKDENGALIRVISKLSK 271
E K+ N L + KL
Sbjct: 341 IE---------ERKELNSKLEEIQKKLED 360
>gnl|CDD|193205 pfam12729, 4HB_MCP_1, Four helix bundle sensory module for signal
transduction. This family is a four helix bundle that
operates as a ubiquitous sensory module in prokaryotic
signal-transduction. The 4HB_MCP is always found between
two predicted transmembrane helices indicating that it
detects only extracellular signals. In many cases the
domain is associated with a cytoplasmic HAMP domain
suggesting that most proteins carrying the bundle might
share the mechanism of transmembrane signalling which is
well-characterized in E coli chemoreceptors.
Length = 181
Score = 30.3 bits (69), Expect = 0.64
Identities = 14/76 (18%), Positives = 31/76 (40%), Gaps = 2/76 (2%)
Query: 183 YKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVEL 242
+ + AE L + + + E++++ Y K L+E+EK+ + +L
Sbjct: 66 LELILTTDPAERDELLKDIEELRAEIDKLLKKYEKTILTEEEKKLFNEFKEQLKAYRKVR 125
Query: 243 KELDQFKKENQRLKDE 258
++ K + DE
Sbjct: 126 NKVLDLAKAGK--NDE 139
>gnl|CDD|225950 COG3416, COG3416, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 233
Score = 30.6 bits (69), Expect = 0.70
Identities = 23/67 (34%), Positives = 27/67 (40%), Gaps = 1/67 (1%)
Query: 59 GGTSSGYGGLTLPSSYSRPSQSSPRSYHAPPSSYRSSYA-PTTTPLPPVRPRRSKSSTRS 117
G S P + P+ + P A P S+RSS A PTT P P RS S
Sbjct: 82 GSFLSNAFKWGTPQEPAPPANAPPPKEPAAPPSWRSSPAGPTTQPSPAAANTRSSSFLGG 141
Query: 118 ANYTPEG 124
A T G
Sbjct: 142 ALQTAAG 148
>gnl|CDD|191367 pfam05761, 5_nucleotid, 5' nucleotidase family. This family of
eukaryotic proteins includes 5' nucleotidase enzymes,
such as purine 5'-nucleotidase EC:3.1.3.5.
Length = 448
Score = 31.1 bits (71), Expect = 0.71
Identities = 11/59 (18%), Positives = 22/59 (37%)
Query: 180 ELDYKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEM 238
E++ + EL RL L + + L+ + + E R + R + E+
Sbjct: 329 EIEIWNTEQYRFEELQRLDILLERLYDHLDVHAELSTLRPDLSELDAEIRELRRAMKEL 387
>gnl|CDD|130902 TIGR01843, type_I_hlyD, type I secretion membrane fusion protein,
HlyD family. Type I secretion is an ABC transport
process that exports proteins, without cleavage of any
signal sequence, from the cytosol to extracellular
medium across both inner and outer membranes. The
secretion signal is found in the C-terminus of the
transported protein. This model represents the adaptor
protein between the ATP-binding cassette (ABC) protein
of the inner membrane and the outer membrane protein,
and is called the membrane fusion protein. This model
selects a subfamily closely related to HlyD; it is
defined narrowly and excludes, for example, colicin V
secretion protein CvaA and multidrug efflux proteins
[Protein fate, Protein and peptide secretion and
trafficking].
Length = 423
Score = 30.7 bits (70), Expect = 0.71
Identities = 20/62 (32%), Positives = 29/62 (46%), Gaps = 4/62 (6%)
Query: 188 EESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQE----KRERRTMERKLSEMEVELK 243
QA+L LR+ L EEL + K +S E +RER + +L +E EL+
Sbjct: 161 AGLQAQLQALRQQLEVISEELEARRKLKEKGLVSRLELLELERERAEAQGELGRLEAELE 220
Query: 244 EL 245
L
Sbjct: 221 VL 222
>gnl|CDD|197664 smart00338, BRLZ, basic region leucin zipper.
Length = 65
Score = 28.3 bits (64), Expect = 0.75
Identities = 22/59 (37%), Positives = 31/59 (52%), Gaps = 10/59 (16%)
Query: 221 SEQEKRERRTM----------ERKLSEMEVELKELDQFKKENQRLKDENGALIRVISKL 269
E EKR RR ERK +E+E ++++Q + EN+RLK E L R + KL
Sbjct: 1 EEDEKRRRRRERNREAARRSRERKKAEIEELERKVEQLEAENERLKKEIERLRRELEKL 59
Score = 26.4 bits (59), Expect = 3.6
Identities = 15/52 (28%), Positives = 30/52 (57%), Gaps = 9/52 (17%)
Query: 217 KNSLSEQEKRERRT-----MERKLSEMEVE----LKELDQFKKENQRLKDEN 259
+N + + RER+ +ERK+ ++E E KE+++ ++E ++LK E
Sbjct: 12 RNREAARRSRERKKAEIEELERKVEQLEAENERLKKEIERLRRELEKLKSEL 63
>gnl|CDD|214378 CHL00160, rpl9, ribosomal protein L9; Provisional.
Length = 153
Score = 29.7 bits (67), Expect = 0.77
Identities = 19/76 (25%), Positives = 34/76 (44%), Gaps = 3/76 (3%)
Query: 197 LRETLSK--TEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVELKELDQFKKENQR 254
L+E + K + +VK+ Y++N L + + T L + ++ K LD KE +
Sbjct: 10 LKENIQKLGKSGTVIKVKSGYARNYLIPNKMAKVATQGS-LKQQKMYQKILDLKLKEAKE 68
Query: 255 LKDENGALIRVISKLS 270
+ L+ I K S
Sbjct: 69 KCLKVKQLLEEIQKFS 84
>gnl|CDD|221408 pfam12072, DUF3552, Domain of unknown function (DUF3552). This
presumed domain is functionally uncharacterized. This
domain is found in bacteria, archaea and eukaryotes.
This domain is about 200 amino acids in length. This
domain is found associated with pfam00013, pfam01966.
This domain has a single completely conserved residue A
that may be functionally important.
Length = 201
Score = 30.2 bits (69), Expect = 0.78
Identities = 26/90 (28%), Positives = 46/90 (51%), Gaps = 6/90 (6%)
Query: 184 KKLYEESQAELSRLRETLSKTEEELNQVKATYSK--NSLSEQEKRERRTMERKLSEMEVE 241
KL E++ EL R L + E+ L Q + T + SL EK+E ++E K E+
Sbjct: 63 HKLRAEAERELKERRNELQRQEKRLLQKEETLDRKDESL---EKKEE-SLEEKEKELAAR 118
Query: 242 LKELDQFKKENQRLKDENGALIRVISKLSK 271
++L++ ++E + L +E + IS L+
Sbjct: 119 QQQLEEKEEELEELIEEQQQELERISGLTA 148
>gnl|CDD|218915 pfam06156, DUF972, Protein of unknown function (DUF972). This
family consists of several hypothetical bacterial
sequences. The function of this family is unknown.
Length = 106
Score = 29.2 bits (66), Expect = 0.79
Identities = 13/48 (27%), Positives = 25/48 (52%), Gaps = 10/48 (20%)
Query: 231 MERKLSEMEVELKELDQFKK-------ENQRLKDENGALIRVISKLSK 271
+E++L+++ L EL + K+ EN L+ EN L + +L +
Sbjct: 13 LEQQLNQL---LAELGELKQQLAELLEENAELRIENEHLRERLEELEQ 57
>gnl|CDD|221833 pfam12889, DUF3829, Protein of unknown function (DUF3829). This is
a small family of proteins from several bacterial
species, whose function is not known. It may, however,
be related to the GvpL_GvpF family of proteins,
pfam06386.
Length = 277
Score = 30.5 bits (69), Expect = 0.83
Identities = 17/81 (20%), Positives = 36/81 (44%), Gaps = 4/81 (4%)
Query: 184 KKLYEESQAELSRLRETLSK----TEEELNQVKATYSKNSLSEQEKRERRTMERKLSEME 239
K E +L L++ K T E L + + + S + K++ ++ L+E+E
Sbjct: 141 KINDERLLEQLEELKKEGKKIAYYTLELLLEAEQLLNLFSSKDVAKKDAADLDAALAELE 200
Query: 240 VELKELDQFKKENQRLKDENG 260
L++L + KE+ + +
Sbjct: 201 KALEDLKKLTKEDGDEEKASS 221
>gnl|CDD|238303 cd00542, Ntn_PVA, Penicillin V acylase (PVA), also known as
conjugated bile salt acid hydrolase (CBAH), catalyzes
the hydrolysis of penicillin V to yield 6-amino
penicillanic acid (6-APA), an important key intermediate
of semisynthetic penicillins. PVA has an N-terminal
nucleophilic cysteine, as do other members of the Ntn
hydrolase family to which PVA belongs. This
nucleophilic cysteine is exposed by post-translational
prossessing of the PVA precursor. PVA forms a
homotetramer.
Length = 303
Score = 30.6 bits (70), Expect = 0.84
Identities = 15/47 (31%), Positives = 18/47 (38%), Gaps = 4/47 (8%)
Query: 32 NLSSYSSGLSSSTPYLNSSYPSYGSTYGGTSSGYGGLTLPSSYSRPS 78
NL +Y LS P + G G G L LP Y+ PS
Sbjct: 184 NLRNYI-NLSPEPP---KNVKLGGVNLTAFGQGSGTLGLPGDYTPPS 226
>gnl|CDD|148617 pfam07111, HCR, Alpha helical coiled-coil rod protein (HCR). This
family consists of several mammalian alpha helical
coiled-coil rod HCR proteins. The function of HCR is
unknown but it has been implicated in psoriasis in
humans and is thought to affect keratinocyte
proliferation.
Length = 739
Score = 30.8 bits (69), Expect = 0.85
Identities = 26/94 (27%), Positives = 51/94 (54%), Gaps = 1/94 (1%)
Query: 162 RVNSESGGSSRKLPNENGELDYKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLS 221
+ ++E S R+ + E+ + L E+ +RLRE LS TE LN+ + ++K +S
Sbjct: 552 QESTEEAASLRQELTQQQEVYGQALQEKVAEVETRLREQLSDTERRLNEARREHAKAVVS 611
Query: 222 EQEKRERRTMERKLSEMEVELKELDQFKKENQRL 255
++ + + E++ ++ L+E + K+E QRL
Sbjct: 612 LRQIQRQAAQEKERNQELRRLQE-EARKEEGQRL 644
>gnl|CDD|114015 pfam05266, DUF724, Protein of unknown function (DUF724). This
family contains several uncharacterized proteins found
in Arabidopsis thaliana and other plants. This region is
often found associated with Agenet domains and may
contain coiled-coil.
Length = 190
Score = 30.1 bits (68), Expect = 0.91
Identities = 14/78 (17%), Positives = 33/78 (42%), Gaps = 2/78 (2%)
Query: 181 LDYKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEV 240
L K + E L + +++ E ++ + ++ E + + + ++ E
Sbjct: 99 LSLKDDQTKKLEERKGLEKEIAEKEISRQELDSEIAELERKILELQRQAALLKEK--KEA 156
Query: 241 ELKELDQFKKENQRLKDE 258
E KE+ + K E ++K E
Sbjct: 157 EDKEIARLKSEASKIKQE 174
>gnl|CDD|144972 pfam01576, Myosin_tail_1, Myosin tail. The myosin molecule is a
multi-subunit complex made up of two heavy chains and
four light chains it is a fundamental contractile
protein found in all eukaryote cell types. This family
consists of the coiled-coil myosin heavy chain tail
region. The coiled-coil is composed of the tail from two
molecules of myosin. These can then assemble into the
macromolecular thick filament. The coiled-coil region
provides the structural backbone the thick filament.
Length = 859
Score = 30.8 bits (70), Expect = 0.98
Identities = 22/87 (25%), Positives = 44/87 (50%), Gaps = 4/87 (4%)
Query: 172 RKLPNENGELDYKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRE--RR 229
R+L E EL + EE++A L + + + EL+Q+++ + ++E+ E R+
Sbjct: 464 RRLEAEKDEL--QAALEEAEAALELEESKVLRAQVELSQIRSEIERRLAEKEEEFENTRK 521
Query: 230 TMERKLSEMEVELKELDQFKKENQRLK 256
+R + ++ L+ + K E RLK
Sbjct: 522 NHQRAIESLQATLEAEAKGKAEASRLK 548
Score = 28.9 bits (65), Expect = 3.3
Identities = 32/133 (24%), Positives = 53/133 (39%), Gaps = 21/133 (15%)
Query: 147 EGYESGGDNLSTSRSRVNSESGGSSRKLPNE------------NGELDYKKL---YEESQ 191
E E+ NLS +S++ S+ + R L E E D L EE
Sbjct: 225 EEAEAQVSNLSKLKSQLESQLEEAKRSLEEESRERANLQAQLRQLEHDLDSLREQLEEES 284
Query: 192 AELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRE--RRTMERKLSEMEVELKEL---- 245
+ L LSK E+ Q ++ + E+ E ++ + +K+SE+E +
Sbjct: 285 EAKAELERQLSKANAEIQQWRSKFESEGALRAEELEELKKKLNQKISELEEAAEAANAKC 344
Query: 246 DQFKKENQRLKDE 258
D +K RL+ E
Sbjct: 345 DSLEKTKSRLQSE 357
Score = 27.7 bits (62), Expect = 8.4
Identities = 24/93 (25%), Positives = 42/93 (45%), Gaps = 14/93 (15%)
Query: 154 DNLSTSRSRVNSESGGSSRKLPNENGELDYKKLYEESQA-ELSRLRETLSKTEEELNQVK 212
D L + + S+ +L +EN +L + E+Q LS+L+ L EE +
Sbjct: 197 DELQRQLNDLTSQKS----RLQSENSDLTRQLEEAEAQVSNLSKLKSQLESQLEEAKR-- 250
Query: 213 ATYSKNSLSEQEKRERRTMERKLSEMEVELKEL 245
E+E RER ++ +L ++E +L L
Sbjct: 251 -------SLEEESRERANLQAQLRQLEHDLDSL 276
>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
Length = 3151
Score = 30.7 bits (69), Expect = 0.99
Identities = 10/50 (20%), Positives = 14/50 (28%)
Query: 77 PSQSSPRSYHAPPSSYRSSYAPTTTPLPPVRPRRSKSSTRSANYTPEGGG 126
P + PR+ R P+ P P S A P+
Sbjct: 2593 PQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPPPSPSPAANEPDPHP 2642
>gnl|CDD|129694 TIGR00606, rad50, rad50. All proteins in this family for which
functions are known are involvedin recombination,
recombinational repair, and/or non-homologous end
joining.They are components of an exonuclease complex
with MRE11 homologs. This family is distantly related to
the SbcC family of bacterial proteins.This family is
based on the phylogenomic analysis of JA Eisen (1999,
Ph.D. Thesis, Stanford University).
Length = 1311
Score = 30.4 bits (68), Expect = 1.1
Identities = 28/100 (28%), Positives = 43/100 (43%), Gaps = 14/100 (14%)
Query: 173 KLPNENGELDYKKL-YEESQAELSRLRE-TLSKTEEELNQVKATYSKNSLSEQEKRERRT 230
KL NE L +K E+ +EL E T+E+LN L +R R
Sbjct: 266 KLDNEIKALKSRKKQMEKDNSELELKMEKVFQGTDEQLND---------LYHNHQRTVRE 316
Query: 231 MERKLSEMEVELKELDQFKKENQRLKDENGALIRVISKLS 270
ER+L + + +EL++ KE + L E L+ +L
Sbjct: 317 KERELVDCQ---RELEKLNKERRLLNQEKTELLVEQGRLQ 353
Score = 29.2 bits (65), Expect = 3.1
Identities = 17/104 (16%), Positives = 39/104 (37%), Gaps = 9/104 (8%)
Query: 177 ENGELDYKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLS 236
E +++ + + EL L+ L + E L+++ ++ + K++ +L
Sbjct: 230 EAQLESSREIVKSYENELDPLKNRLKEIEHNLSKIMKLDNEIKALKSRKKQMEKDNSELE 289
Query: 237 EMEV--------ELKELDQFKKENQRLKDENGALI-RVISKLSK 271
+L +L + R K+ R + KL+K
Sbjct: 290 LKMEKVFQGTDEQLNDLYHNHQRTVREKERELVDCQRELEKLNK 333
>gnl|CDD|114855 pfam06160, EzrA, Septation ring formation regulator, EzrA. During
the bacterial cell cycle, the tubulin-like cell-division
protein FtsZ polymerises into a ring structure that
establishes the location of the nascent division site.
EzrA modulates the frequency and position of FtsZ ring
formation.
Length = 559
Score = 30.2 bits (69), Expect = 1.1
Identities = 21/103 (20%), Positives = 44/103 (42%), Gaps = 30/103 (29%)
Query: 182 DYKKLYEESQAELSRLRETLSKTEEELNQVKATYS--KNSLSEQEKRERRT--------M 231
++ E+ EL+ L+E+ K +E+ ++K Y + +L + +
Sbjct: 112 TIEEDIEQILEELNELKESEEKNRKEVEELKDKYRELRKTLLAK-----SFSYGPALDEL 166
Query: 232 ERKLSEMEVELKELDQFKKEN------------QRLKDENGAL 262
E++L E+E +E +QF + +L++E AL
Sbjct: 167 EKQLDELE---EEFEQFVELTESGDYLEAREVLLKLEEETDAL 206
Score = 30.2 bits (69), Expect = 1.2
Identities = 16/64 (25%), Positives = 34/64 (53%), Gaps = 3/64 (4%)
Query: 184 KKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVELK 243
+K ++ L RE + EL++++ +Y+ L+E E R +E++L E+E +
Sbjct: 305 EKNIDKLTDFLEHAREQNKQLLLELDRLQQSYT---LNEDELETVRELEKQLEELEAQYD 361
Query: 244 ELDQ 247
+L +
Sbjct: 362 QLVE 365
>gnl|CDD|220401 pfam09786, CytochromB561_N, Cytochrome B561, N terminal. Members
of this family are found in the N terminal region of
cytochrome B561, as well as in various other putative
uncharacterized proteins.
Length = 559
Score = 30.5 bits (69), Expect = 1.1
Identities = 28/163 (17%), Positives = 53/163 (32%), Gaps = 13/163 (7%)
Query: 20 SLYSSSPSPQSRNLSSYSSGLSSSTPYLNSSYPSYGSTYGGTSSGYGGLTLPSSYSRPSQ 79
S + +S S S+ +++S P + S S + P
Sbjct: 109 QFTVVSQAKKSPPASKTSTPMNTSEPLVPGHSSFSDSPSRSASPSRKFSPSSTIQQSPQL 168
Query: 80 SSPRSYHAPPSSYRSSYAPTTTPLPPVRPRRSKSSTRSANYTPEGGGRMSRSGSTTSLNS 139
+ +P SSY+S + L PV ++S+ RS+ + G + TT
Sbjct: 169 TPSNKPASPSSSYQSPSYS--SSLGPVNSSGNRSNLRSSPWALRSSG--DKKDITTDEKY 224
Query: 140 LRSEAPSEGYESGGDNLSTSRSRVNSESGGSSRKLPNENGELD 182
L E+ + + + S +G ++ N
Sbjct: 225 L---------ETFLAEVDEEQHMITSSAGKNATPPETINSFGS 258
>gnl|CDD|197361 cd09803, UBAN, polyubiquitin binding domain of NEMO and related
proteins. NEMO (NF-kappaB essential modulator) is a
regulatory subunit of the kinase complex IKK, which is
involved in the activation of NF-kappaB via
phosporylation of inhibitory IkappaBs. This mechanism
requires the binding of NEMO to ubiquinated substrates.
Binding is achieved via the UBAN motif (ubiquitin
binding in ABIN and NEMO), which is described in this
model. This region of NEMO has also been named CoZi (for
coiled-coil 2 and leucine zipper). ABINs (A20-binding
inhibitors of NF-kappaB) are sensors for ubiquitin that
are involved in regulation of apoptosis, ABIN-1 is
presumed to inhibit signalling via the NF-kappaB route.
The UBAN motif is also found in optineurin, the product
of a gene associated with glaucoma, which has been
characterized as a negative regulator of NF-kappaB as
well.
Length = 87
Score = 28.5 bits (64), Expect = 1.1
Identities = 19/73 (26%), Positives = 36/73 (49%), Gaps = 13/73 (17%)
Query: 191 QAELSRLRETLSKTEEELNQVKA------TYSKNSLSEQEKRERRTMERKLSEMEVELKE 244
Q ++ L+E +++ E +L + Y + +E+ RE KL + + +L E
Sbjct: 21 QEDIDELKEEIAQQEADLETIPVLKAQAEIYKSDFEAERAARE------KLHQEKEQLAE 74
Query: 245 -LDQFKKENQRLK 256
L+ ++ENQ LK
Sbjct: 75 QLEYLQRENQELK 87
>gnl|CDD|153307 cd07623, BAR_SNX1_2, The Bin/Amphiphysin/Rvs (BAR) domain of
Sorting Nexins 1 and 2. BAR domains are dimerization,
lipid binding and curvature sensing modules found in
many different proteins with diverse functions. Sorting
nexins (SNXs) are Phox homology (PX) domain containing
proteins that are involved in regulating membrane
traffic and protein sorting in the endosomal system.
SNXs differ from each other in their lipid-binding
specificity, subcellular localization and specific
function in the endocytic pathway. A subset of SNXs also
contain BAR domains. The PX-BAR structural unit
determines the specific membrane targeting of SNXs. This
subfamily consists of SNX1, SNX2, and similar proteins.
SNX1 and SNX2 are components of the retromer complex, a
membrane coat multimeric complex required for endosomal
retrieval of lysosomal hydrolase receptors to the Golgi.
The retromer consists of a cargo-recognition subcomplex
and a subcomplex formed by a dimer of sorting nexins
(SNX1 and/or SNX2), which ensures efficient cargo
sorting by facilitating proper membrane localization of
the cargo-recognition subcomplex. BAR domains form
dimers that bind to membranes, induce membrane bending
and curvature, and may also be involved in
protein-protein interactions.
Length = 224
Score = 29.9 bits (68), Expect = 1.1
Identities = 20/81 (24%), Positives = 41/81 (50%), Gaps = 14/81 (17%)
Query: 190 SQAELSRLRETLSKTE-----EELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVELK- 243
+Q L++ RE +K E ++L+Q + + E E + R +++ E+ +K
Sbjct: 128 AQQTLTKKREAKAKLELSGRTDKLDQA-----QQEIKEWEAKVDRG-QKEFEEISKTIKK 181
Query: 244 ELDQFKKENQRLKDENGALIR 264
E+++F E R+KD +I+
Sbjct: 182 EIERF--EKNRVKDFKDIIIK 200
>gnl|CDD|132534 TIGR03495, phage_LysB, phage lysis regulatory protein, LysB family.
Members of this protein family are phage lysis
regulatory protein, including the well-studied protein
LysB (lysis protein B) of Enterobacteria phage P2. For
members of this family, genes are found in phage or in
prophage regions of bacterial genomes, typically near a
phage lysozyme or phage holin.
Length = 135
Score = 29.3 bits (66), Expect = 1.2
Identities = 17/80 (21%), Positives = 26/80 (32%), Gaps = 5/80 (6%)
Query: 190 SQAELSRLRETLSKTEEELNQVKATYSK-----NSLSEQEKRERRTMERKLSEMEVELKE 244
L R L + L +A + L KR + ++
Sbjct: 17 QSQRLRNARADLERANRVLKAQQAELASKANQLIVLLALAKRNEEAQAQLRQQLAQARAL 76
Query: 245 LDQFKKENQRLKDENGALIR 264
L Q ++ +RLK EN L R
Sbjct: 77 LAQREQRIERLKRENEDLRR 96
>gnl|CDD|131526 TIGR02473, flagell_FliJ, flagellar export protein FliJ. Members of
this family are the FliJ protein found, in nearly every
case, in the midst of other flagellar biosynthesis genes
in bacgterial genomes. Typically the fliJ gene is found
adjacent to the gene for the flagellum-specific ATPase
FliI. Sequence scoring in the gray zone between trusted
and noise cutoffs include both probable FliJ proteins
and components of bacterial type III secretion systems.
Length = 141
Score = 29.2 bits (66), Expect = 1.2
Identities = 14/53 (26%), Positives = 24/53 (45%), Gaps = 3/53 (5%)
Query: 183 YKKLYEESQAELSRLRETLSKTEEELNQV---KATYSKNSLSEQEKRERRTME 232
++ Q E+ RE L + EL + K K +E+ KRE++ M+
Sbjct: 80 QQQELALLQQEVEAKRERLLEARRELKALEKLKEKKQKEYRAEEAKREQKEMD 132
>gnl|CDD|219791 pfam08317, Spc7, Spc7 kinetochore protein. This domain is found in
cell division proteins which are required for
kinetochore-spindle association.
Length = 321
Score = 30.0 bits (68), Expect = 1.2
Identities = 17/83 (20%), Positives = 42/83 (50%), Gaps = 7/83 (8%)
Query: 172 RKLPNENGELDYKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTM 231
R+L +E D +L E+++ EL L +S+ ++L +++ + +++ +
Sbjct: 195 RQLADELNLCDPLEL-EKARQELRSLSVKISEKRKQLEELQQELQELTIAIEALTN---- 249
Query: 232 ERKLSEMEVELKELDQFKKENQR 254
K SE+ E+ E ++ ++E +
Sbjct: 250 --KKSELLEEIAEAEKIREECRG 270
>gnl|CDD|223599 COG0525, ValS, Valyl-tRNA synthetase [Translation, ribosomal
structure and biogenesis].
Length = 877
Score = 30.3 bits (69), Expect = 1.3
Identities = 17/60 (28%), Positives = 37/60 (61%), Gaps = 7/60 (11%)
Query: 192 AELSRLRETLSKTEEELNQVKATYS-KNSLS---EQEKRERRTMERKLSEMEVELKELDQ 247
AEL+RL + L K E+E+++++ S + ++ E+ + + KL+E +V+L +L++
Sbjct: 814 AELARLEKELEKLEKEIDRIEKKLSNEGFVAKAPEEVVEKEK---EKLAEYQVKLAKLEE 870
>gnl|CDD|227278 COG4942, COG4942, Membrane-bound metallopeptidase [Cell division
and chromosome partitioning].
Length = 420
Score = 30.1 bits (68), Expect = 1.3
Identities = 13/81 (16%), Positives = 29/81 (35%), Gaps = 9/81 (11%)
Query: 184 KKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVELK 243
Y + + L T ++L V+A + E+ + LSE +
Sbjct: 149 AIYYGALNPARAERIDALKATLKQLAAVRAEIA---------AEQAELTTLLSEQRAQQA 199
Query: 244 ELDQFKKENQRLKDENGALIR 264
+L Q +E ++ + + +
Sbjct: 200 KLAQLLEERKKTLAQLNSELS 220
>gnl|CDD|191675 pfam07047, OPA3, Optic atrophy 3 protein (OPA3). This family
consists of several optic atrophy 3 (OPA3) proteins.
OPA3 deficiency causes type III 3-methylglutaconic
aciduria (MGA) in humans. This disease manifests with
early bilateral optic atrophy, spasticity,
extrapyramidal dysfunction, ataxia, and cognitive
deficits, but normal longevity.
Length = 134
Score = 29.2 bits (66), Expect = 1.3
Identities = 9/38 (23%), Positives = 18/38 (47%), Gaps = 5/38 (13%)
Query: 215 YSKNSLSEQEKRERR-----TMERKLSEMEVELKELDQ 247
Y + S E +K E +E ++ E+E+E++
Sbjct: 96 YQRQSRKEAKKEEELQQELEELEARVGELELEIERQKA 133
>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger. [Transport and
binding proteins, Cations and iron carrying compounds].
Length = 1096
Score = 30.3 bits (68), Expect = 1.3
Identities = 38/173 (21%), Positives = 69/173 (39%), Gaps = 10/173 (5%)
Query: 17 RPRSLYS-----SSPSP---QSRNLSSYSSGLSSSTPYLNSSYPSYGSTYGGTSSGYGGL 68
RP+ L S SS P SR+LS+ + SS P +SS + G
Sbjct: 43 RPQGLPSLWAAVSSQQPIKLASRDLSNDEMMMVSSDPPKSSSEMEGEMLAPQATVGRDEA 102
Query: 69 TLPSSYSRPSQSSPRSYHAPPSSYRSSYAPTTTPLPPVRPRRSKSSTRSANYTPEGGGR- 127
T + R+ P++ +++Y+PT V+ + +R+ N+ GR
Sbjct: 103 TPSIAMENTPSPPRRTAKITPTTPKNNYSPTAAGTERVKEDTPATPSRALNHYISTSGRQ 162
Query: 128 MSRSGSTTSLNSLRSEAPSEGYESGGDNLSTSRSR-VNSESGGSSRKLPNENG 179
+S + ++S +P++ E + R VNS + + +P +G
Sbjct: 163 RVKSYTPKPRGEVKSSSPTQTREKVRKYTPSPLGRMVNSYAPSTFMTMPRSHG 215
>gnl|CDD|225256 COG2382, Fes, Enterochelin esterase and related enzymes [Inorganic
ion transport and metabolism].
Length = 299
Score = 29.7 bits (67), Expect = 1.4
Identities = 14/58 (24%), Positives = 20/58 (34%), Gaps = 9/58 (15%)
Query: 45 PYLNSSYPS---------YGSTYGGTSSGYGGLTLPSSYSRPSQSSPRSYHAPPSSYR 93
PY+ YP+ G + GG S Y GL P + S + P +
Sbjct: 164 PYVEERYPTSADADGRVLAGDSLGGLVSLYAGLRHPERFGHVLSQSGSFWWTPLDTQP 221
>gnl|CDD|183704 PRK12726, PRK12726, flagellar biosynthesis regulator FlhF;
Provisional.
Length = 407
Score = 30.1 bits (67), Expect = 1.4
Identities = 19/70 (27%), Positives = 41/70 (58%), Gaps = 1/70 (1%)
Query: 178 NGELDYKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSE 237
+G +D+++ +++S + + L K +E+L ++ A ++ + S ER+T E +LS
Sbjct: 63 DGRMDFQETFQKSVVKNRPKTDDLLK-KEKLLEMLAAGAELAQSTPLLEERKTQEEELSA 121
Query: 238 MEVELKELDQ 247
M +EL L++
Sbjct: 122 MRLELAALNR 131
>gnl|CDD|226371 COG3853, TelA, Uncharacterized protein involved in tellurite
resistance [Inorganic ion transport and metabolism].
Length = 386
Score = 29.7 bits (67), Expect = 1.5
Identities = 16/56 (28%), Positives = 31/56 (55%), Gaps = 3/56 (5%)
Query: 199 ETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVELKE-LDQFKKENQ 253
ETL KT++ + V A + + E+ +++R R+L ++E E+K+ L K+
Sbjct: 331 ETLKKTQQSV--VAAIEETSQIQEEGRKKRADTIRELDKLEEEIKKKLILLKQNIS 384
>gnl|CDD|218351 pfam04961, FTCD_C, Formiminotransferase-cyclodeaminase. Members of
this family are thought to be Formiminotransferase-
cyclodeaminase enzymes EC:4.3.1.4. This domain is found
in the C-terminus of the bifunctional animal members of
the family.
Length = 176
Score = 29.1 bits (66), Expect = 1.6
Identities = 17/59 (28%), Positives = 24/59 (40%), Gaps = 13/59 (22%)
Query: 184 KKLYEESQAELSRLRETLSKTEEEL-----------NQVKATY--SKNSLSEQEKRERR 229
KK YE+ + E+ + E + EEL N V A Y K + E+ RE
Sbjct: 44 KKKYEDVEEEMKEILEKAEELREELLALADEDAEAFNAVMAAYKLPKETEEEKAAREAA 102
>gnl|CDD|236798 PRK10929, PRK10929, putative mechanosensitive channel protein;
Provisional.
Length = 1109
Score = 30.0 bits (68), Expect = 1.6
Identities = 27/83 (32%), Positives = 45/83 (54%), Gaps = 10/83 (12%)
Query: 180 ELDYKKLYEESQAELSRLR-ETLSKTEEEL-NQVKATYSKNSLSEQEKRE-RRTMER--K 234
EL+ +L ++ EL+RLR E K ++L ++A +N L+ Q +RE R +E
Sbjct: 191 ELELAQLSANNRQELARLRSELAKKRSQQLDAYLQAL--RNQLNSQRQREAERALESTEL 248
Query: 235 LSEMEVELKE--LDQFKKENQRL 255
L+E +L + + QFK N+ L
Sbjct: 249 LAEQSGDLPKSIVAQFKI-NREL 270
>gnl|CDD|223023 PHA03249, PHA03249, DNA packaging tegument protein UL25;
Provisional.
Length = 653
Score = 30.0 bits (67), Expect = 1.7
Identities = 21/114 (18%), Positives = 38/114 (33%), Gaps = 10/114 (8%)
Query: 71 PSSYSRPSQSSPRSYHAPPSSYRSSYAPTTTPLPPVRPRRSKSSTRS----ANYTPEGG- 125
SS+ S++ S R+ P RR+K++ A T
Sbjct: 59 KSSFEVVSETDSGSEAEAERGRRAGMGGRNKATKP--SRRNKTTQCRPTSLALATAATMP 116
Query: 126 --GRMSRSGSTTSLNSLRSEAPSEGYESGGDNLSTSRSRVNSESGGSSRKLPNE 177
+S +S S+ S E + ++ S ++ES S + +E
Sbjct: 117 ATPSSGKSPKVSSPPSIPS-LSEEDEGAERNSGGDDSSHTDNESTQSQPEADDE 169
>gnl|CDD|203738 pfam07716, bZIP_2, Basic region leucine zipper.
Length = 54
Score = 27.2 bits (61), Expect = 1.7
Identities = 12/46 (26%), Positives = 26/46 (56%), Gaps = 5/46 (10%)
Query: 217 KNSLSEQEKRERRTMERKLSEMEVELKELDQFKKENQRLKDENGAL 262
+N+ + + RE++ + E+E +KEL ++EN +L+ + L
Sbjct: 11 RNNEAARRSREKKKQREE--ELEERVKEL---EEENAQLRQKVEQL 51
>gnl|CDD|224234 COG1315, COG1315, Uncharacterized conserved protein [Function
unknown].
Length = 543
Score = 29.7 bits (67), Expect = 1.8
Identities = 13/71 (18%), Positives = 24/71 (33%)
Query: 188 EESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVELKELDQ 247
E L L E +S EE L ++ + E ++ K S +
Sbjct: 406 PEIVERLKELTEEISLHEERLKKLTKLLVALVKVKIESKKNILPPDKESLLTAVNNTKIT 465
Query: 248 FKKENQRLKDE 258
+ +++K E
Sbjct: 466 LRNSIEKIKAE 476
>gnl|CDD|227355 COG5022, COG5022, Myosin heavy chain [Cytoskeleton].
Length = 1463
Score = 30.0 bits (68), Expect = 1.8
Identities = 15/80 (18%), Positives = 32/80 (40%), Gaps = 2/80 (2%)
Query: 182 DYKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRT-MERKLSEMEV 240
Y + Q + R + L +TEE +KA K ++R + +K +
Sbjct: 814 SYLACIIKLQKTIKREK-KLRETEEVEFSLKAEVLIQKFGRSLKAKKRFSLLKKETIYLQ 872
Query: 241 ELKELDQFKKENQRLKDENG 260
+ ++ +++ Q LK +
Sbjct: 873 SAQRVELAERQLQELKIDVK 892
Score = 29.3 bits (66), Expect = 3.0
Identities = 21/101 (20%), Positives = 41/101 (40%), Gaps = 8/101 (7%)
Query: 175 PNENGELDYKKLYEESQAELSR----LRETLSKTEEELNQVKATYSKNSLSEQEKRERRT 230
+ KKL E + E S L + ++ + + K ++ Q +
Sbjct: 821 KLQKTIKREKKLRETEEVEFSLKAEVLIQKFGRSLKAKKRFSLL-KKETIYLQSAQRVEL 879
Query: 231 MERKLSEMEVELKELDQFKKENQRLKDENGALIRVISKLSK 271
ER+L E+++++K + K N L+ E +I + LS
Sbjct: 880 AERQLQELKIDVKSISSLKLVNLELESE---IIELKKSLSS 917
>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
Length = 434
Score = 29.8 bits (67), Expect = 1.8
Identities = 19/73 (26%), Positives = 32/73 (43%), Gaps = 4/73 (5%)
Query: 188 EESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVELKELDQ 247
++ Q L + L ++L +VK SE+ + + + + K E E + KE
Sbjct: 365 DKRQELLKEYNKKLQDYTKKLGEVKDETDA---SEEAEAKAKEEKLKQEENEKKQKEQAD 421
Query: 248 FKKENQRLKDENG 260
KE +R KDE
Sbjct: 422 EDKE-KRQKDERK 433
>gnl|CDD|219404 pfam07415, Herpes_LMP2, Gammaherpesvirus latent membrane protein
(LMP2) protein. This family consists of several
Gammaherpesvirus latent membrane protein (LMP2)
proteins. Epstein-Barr virus is a human Gammaherpesvirus
that infects and establishes latency in B lymphocytes in
vivo. The latent membrane protein 2 (LMP2) gene is
expressed in latently infected B cells and encodes two
protein isoforms, LMP2A and LMP2B, that are identical
except for an additional N-terminal 119 aa cytoplasmic
domain which is present in the LMP2A isoform. LMP2A is
thought to play a key role in either the establishment
or the maintenance of latency and/or the reactivation of
productive infection from the latent state. The
significance of LMP2B and its role in pathogenesis
remain unclear.
Length = 489
Score = 29.8 bits (67), Expect = 1.8
Identities = 18/59 (30%), Positives = 21/59 (35%), Gaps = 3/59 (5%)
Query: 71 PSSYSRPSQSSPRSYHAPPSSYRSSYAPTTTPLPPVRPRRSKSSTRSANYTPEGGGRMS 129
P S+ P S PSS+ SS+ P PPV S R Y G R
Sbjct: 14 PRSHGGPDGDEGDSNPYYPSSFGSSW---DRPGPPVPEDYDAPSHRPPPYGGSNGDRHG 69
Score = 29.4 bits (66), Expect = 1.9
Identities = 25/103 (24%), Positives = 32/103 (31%), Gaps = 26/103 (25%)
Query: 23 SSSPSPQSRNLSSYSSGLSSSTPYLNSSY-------PSYGSTYGGTSSGYGGLTL--PSS 73
S+P S S+ S P + Y P YG + G GY L PS
Sbjct: 26 DSNPYYPS----SFGSSWDRPGPPVPEDYDAPSHRPPPYGGSNGDRHGGYQPLGQQDPSL 81
Query: 74 YSR----------PSQSSPR---SYHAPPSSYRSSYAPTTTPL 103
Y+ P SPR S H + P P+
Sbjct: 82 YAGLGQNGGGGLPPPPYSPRDQGSEHVYEEPRDARMVPPWLPV 124
>gnl|CDD|215130 PLN02217, PLN02217, probable pectinesterase/pectinesterase
inhibitor.
Length = 670
Score = 29.7 bits (66), Expect = 1.9
Identities = 28/117 (23%), Positives = 52/117 (44%), Gaps = 13/117 (11%)
Query: 36 YSSGLSSSTPYLNSSYPSYGSTYGGTSSGYGGLTLPSSYSRPSQSSPRSY-HAPPSSYRS 94
Y GL + P +S P+ + T+ PS+ PS S P + +PP++
Sbjct: 557 YIPGLFAGNPGSTNSTPTGSAASSNTT---FSSDSPSTVVAPSTSPPAGHLGSPPATPSK 613
Query: 95 SYAPTTTP------LPPVRPRRSKSSTRSA---NYTPEGGGRMSRSGSTTSLNSLRS 142
+P+T+P P P +SS + A +PE +++ + S+ S+ S+ +
Sbjct: 614 IVSPSTSPPASHLGSPSTTPSSPESSIKVASTETASPESSIKVASTESSVSMVSMST 670
>gnl|CDD|219083 pfam06548, Kinesin-related, Kinesin-related. This family
represents a region within kinesin-related proteins from
higher plants. Many family members also contain the
pfam00225 domain. Kinesins are ATP-driven microtubule
motor proteins that produce directed force. Some family
members are associated with the phragmoplast, a
structure composed mainly of microtubules that executes
cytokinesis in higher plants.
Length = 488
Score = 29.5 bits (66), Expect = 2.1
Identities = 20/73 (27%), Positives = 33/73 (45%), Gaps = 1/73 (1%)
Query: 187 YEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVEL-KEL 245
Y E Q + + L E + E + VK +K + R + +LS + E KE
Sbjct: 346 YAELQEKYNDLLEKHRRIMEGIEDVKKAAAKAGVKGAGSRFANALAAELSALRAEREKER 405
Query: 246 DQFKKENQRLKDE 258
+ KKEN+ L+ +
Sbjct: 406 ELLKKENKSLQIQ 418
Score = 29.1 bits (65), Expect = 3.1
Identities = 17/84 (20%), Positives = 34/84 (40%), Gaps = 1/84 (1%)
Query: 186 LYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVELKEL 245
+ EE + ++ + S E +K T + E+ + + + E EV L+E+
Sbjct: 173 IDEEFLSLMNEHKLLKSDYENHPEVLKETIELKRIQEELESFKNFY-FDMGEKEVLLEEI 231
Query: 246 DQFKKENQRLKDENGALIRVISKL 269
K + Q + + + R S L
Sbjct: 232 QDLKSQLQCMLSSSKSTARTRSSL 255
>gnl|CDD|219342 pfam07227, DUF1423, Protein of unknown function (DUF1423). This
family represents a conserved region approximately 500
residues long within a number of Arabidopsis thaliana
proteins of unknown function.
Length = 446
Score = 29.3 bits (66), Expect = 2.1
Identities = 30/137 (21%), Positives = 52/137 (37%), Gaps = 10/137 (7%)
Query: 107 RPRRSKSSTRSANYTPEGGGRM---SRSGSTTSLNSLRSEAPSEGYESGGDNLSTSRSRV 163
+ + R E RM +RS L +A E + R +
Sbjct: 295 IAEVVQEAVRKMESVAEEKMRMLKKARSALDACDRELEEKAR----EVSELKMERQRKKP 350
Query: 164 NSESGGSSRKLPNENGELDYKKLYEESQAELSRL-RETLSKTEEELNQVKATYSKNSLSE 222
+ S +L ++ ++ +E++ E RL R L+KTE+ + + Y K LSE
Sbjct: 351 QIDELESIVRLKQAEADM-FQLKADEARREAERLQRIALAKTEKSEEEYASKYLKLRLSE 409
Query: 223 QEKRERRTMERKLSEME 239
E+ + E +L E
Sbjct: 410 AEEERQYLFE-ELKLQE 425
>gnl|CDD|220252 pfam09468, RNase_H2-Ydr279, Ydr279p protein family (RNase H2
complex component). RNases H are enzymes that
specifically hydrolyse RNA when annealed to a
complementary DNA and are present in all living
organisms. In yeast RNase H2 is composed of a complex of
three proteins (Rnh2Ap, Ydr279p and Ylr154p), this
family represents the homologues of Ydr279p. It is not
known whether non yeast proteins in this family fulfil
the same function.
Length = 287
Score = 29.2 bits (66), Expect = 2.1
Identities = 10/64 (15%), Positives = 25/64 (39%), Gaps = 3/64 (4%)
Query: 171 SRKLPNENGELDYKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRT 230
++L ++K L + + + RET E ++ + K E+ K+++
Sbjct: 208 YKELLKSLLIPEFKPLDKYLKESKKKKRETEEDVEAAESRAEK---KRKSKEEIKKKKPK 264
Query: 231 MERK 234
+
Sbjct: 265 ESKG 268
>gnl|CDD|222465 pfam13942, Lipoprotein_20, YfhG lipoprotein. This family includes
the YfhG protein from E. coli. Members of this family
have an N-terminal lipoprotein attachment site. The
members of this family are functionally uncharacterized.
Length = 180
Score = 28.8 bits (65), Expect = 2.2
Identities = 18/55 (32%), Positives = 27/55 (49%), Gaps = 12/55 (21%)
Query: 183 YKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRT-MERKLS 236
Y++L + S +EL RLRE ++ + EL K E T +ER+LS
Sbjct: 121 YQRLQQSSDSELDRLREQQARLQYELETTT-----------RKLENLTDIERQLS 164
>gnl|CDD|151681 pfam11239, DUF3040, Protein of unknown function (DUF3040). Some
members in this family of proteins with unknown function
are annotated as membrane proteins however this cannot
be confirmed.
Length = 82
Score = 27.6 bits (62), Expect = 2.3
Identities = 8/19 (42%), Positives = 10/19 (52%)
Query: 219 SLSEQEKRERRTMERKLSE 237
LSE E+R +ER L
Sbjct: 2 PLSEHEQRRLEEIERALYA 20
>gnl|CDD|206035 pfam13864, Enkurin, Calmodulin-binding. This is a family of
apparent calmodulin-binding proteins found at high
levels in the testis and vomeronasal organ and at lower
levels in certain other tissues. Enkurin is a scaffold
protein that binds PI3 kinase to sperm transient
receptor potential (canonical) (TRPC) channels. The
mammalian transient receptor potential (canonical)
channels are the primary candidates for the Ca(2+) entry
pathway activated by the hormones, growth factors, and
neurotransmitters that exert their effect through
activation of PLC. Calmodulin binds to the C-terminus of
all TRPC channels, and dissociation of calmodulin from
TRPC4 results in profound activation of the channel.
Length = 98
Score = 27.9 bits (63), Expect = 2.3
Identities = 18/68 (26%), Positives = 36/68 (52%), Gaps = 4/68 (5%)
Query: 184 KKLYEESQAELSRLRETLSKTEEELNQ--VKATYSKNSLSEQEKRERRTMERKLSEMEVE 241
S+ E L L K +ELN+ K ++ S++ ++E +E++L+E+E +
Sbjct: 32 PGHRLLSEEERLELLNGLKKNWDELNKEYQKLPVVIDTPSKKRRKEE--LEKELAEIEED 89
Query: 242 LKELDQFK 249
+K+L + K
Sbjct: 90 IKKLSRPK 97
>gnl|CDD|218806 pfam05911, DUF869, Plant protein of unknown function (DUF869).
This family consists of a number of sequences found in
Arabidopsis thaliana, Oryza sativa and Lycopersicon
esculentum (Tomato). The function of this family is
unknown.
Length = 767
Score = 29.5 bits (66), Expect = 2.3
Identities = 27/101 (26%), Positives = 40/101 (39%), Gaps = 19/101 (18%)
Query: 150 ESGGDNLSTSRSRVNSESGGSSRKLPNENGELDYKK---LYEESQA---ELSRLRETLSK 203
ES G N +RSR + S+ +N + K+ L A E L+E LSK
Sbjct: 224 ESLGSNFVDNRSRRSPSPASSAFHSSPQNLQKAQKENEFLTARLLAMEDENKTLKEALSK 283
Query: 204 TEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVELKE 244
EL + Y+K + KLS +E +L+
Sbjct: 284 RNSELQFSRNMYAKTA-------------SKLSVLEAQLES 311
>gnl|CDD|215364 PLN02678, PLN02678, seryl-tRNA synthetase.
Length = 448
Score = 29.3 bits (66), Expect = 2.4
Identities = 14/61 (22%), Positives = 33/61 (54%), Gaps = 3/61 (4%)
Query: 185 KLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVELKE 244
K + + Q EL LR+ +K +E+ ++K + + E + ++++++E E E++E
Sbjct: 40 KEWRQRQFELDSLRKEFNKLNKEVAKLKI---AKEDATELIAETKELKKEITEKEAEVQE 96
Query: 245 L 245
Sbjct: 97 A 97
>gnl|CDD|235229 PRK04156, gltX, glutamyl-tRNA synthetase; Provisional.
Length = 567
Score = 29.4 bits (67), Expect = 2.4
Identities = 21/64 (32%), Positives = 33/64 (51%), Gaps = 13/64 (20%)
Query: 195 SRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVELKELDQFKKENQ- 253
S+ +E + +E + +V NSLS +E+RER L E+ EL E ++ KKE +
Sbjct: 42 SKAKEIIPIVKEVVEEV------NSLSLEEQRER------LEELAPELLEEEEEKKEEKK 89
Query: 254 RLKD 257
L
Sbjct: 90 GLPP 93
>gnl|CDD|131935 TIGR02889, spore_YpeB, germination protein YpeB. Members of this
family are YpeB, a protein usually encoded with the
putative spore-cortex-lytic enzyme SleB and required,
together with SleB, for normal germination. This family
is retricted to endospore-forming species in the
Firmicutes lineage of bacteria, and found in all such
species to date except Clostridium perfringens. The
matching phenotypes of mutants in SleB (called a lytic
transglycosylase) and YpeB suggests that YpeB is
necessary to allow SleB to function [Cellular processes,
Sporulation and germination].
Length = 435
Score = 29.3 bits (66), Expect = 2.5
Identities = 14/80 (17%), Positives = 29/80 (36%), Gaps = 6/80 (7%)
Query: 177 ENGELDYKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSL------SEQEKRERRT 230
+ + K L ++ L+ L K E +L +V+ + + E +
Sbjct: 113 SVKDAEGKSLSDKEYKTLTTLYNQAVKLENQLRKVQNIVMQGGVRWGEIRKLYSGDEAQM 172
Query: 231 MERKLSEMEVELKELDQFKK 250
E L++ + K + QF
Sbjct: 173 PEAILNDFKDVEKNVTQFPT 192
>gnl|CDD|226655 COG4192, COG4192, Signal transduction histidine kinase regulating
phosphoglycerate transport system [Signal transduction
mechanisms].
Length = 673
Score = 29.1 bits (65), Expect = 2.6
Identities = 13/51 (25%), Positives = 25/51 (49%), Gaps = 1/51 (1%)
Query: 193 ELSRLRETLSKTEEELN-QVKATYSKNSLSEQEKRERRTMERKLSEMEVEL 242
EL R+ + L + +N + ++ +N + E ER+ E L + + EL
Sbjct: 392 ELGRIAQLLRLLLQRVNEKTQSLRERNQELQAEVEERQEAEAHLKKTQGEL 442
>gnl|CDD|220368 pfam09730, BicD, Microtubule-associated protein Bicaudal-D. BicD
proteins consist of three coiled-coiled domains and are
involved in dynein-mediated minus end-directed transport
from the Golgi apparatus to the endoplasmic reticulum
(ER). For full functioning they bind with GSK-3beta
pfam05350 to maintain the anchoring of microtubules to
the centromere. It appears that amino-acid residues
437-617 of BicD and the kinase activity of GSK-3 are
necessary for the formation of a complex between BicD
and GSK-3beta in intact cells.
Length = 711
Score = 29.0 bits (65), Expect = 2.7
Identities = 27/112 (24%), Positives = 55/112 (49%), Gaps = 12/112 (10%)
Query: 170 SSRKLPNENGELDYKKLYEESQAELSRLRETLSKTEEELNQVKATYSK--------NSLS 221
SS + +GE + L +ES ++ + + + + + EL Q++A S +SLS
Sbjct: 5 SSHRKVAADGEEREESLLQESASKEAYYLQRILELQAELKQLRAELSNVQAENERLSSLS 64
Query: 222 EQEKRERRTMERKLSEMEVELKELD----QFKKENQRLKDENGALIRVISKL 269
++ K E +E + + E+KE + ++ L++EN +L + +S L
Sbjct: 65 QELKEENEMLELQRGRLRDEIKEYKFREARLLQDYSELEEENISLQKQVSVL 116
>gnl|CDD|224259 COG1340, COG1340, Uncharacterized archaeal coiled-coil protein
[Function unknown].
Length = 294
Score = 28.9 bits (65), Expect = 2.7
Identities = 15/79 (18%), Positives = 38/79 (48%), Gaps = 4/79 (5%)
Query: 184 KKLYEESQAELSRLRETLSKTEE---ELNQVKATYSKNSL-SEQEKRERRTMERKLSEME 239
++L E+Q + + + +E E +++ + + S ++ E R ++ +L E+E
Sbjct: 182 QELANEAQEYHEEMIKLFEEADELRKEADELHEEFVELSKKIDELHEEFRNLQNELRELE 241
Query: 240 VELKELDQFKKENQRLKDE 258
++K L +K +R +
Sbjct: 242 KKIKALRAKEKAAKRREKR 260
>gnl|CDD|179385 PRK02224, PRK02224, chromosome segregation protein; Provisional.
Length = 880
Score = 29.2 bits (66), Expect = 2.7
Identities = 29/100 (29%), Positives = 52/100 (52%), Gaps = 6/100 (6%)
Query: 162 RVNSESGGSSRKLPNENGELDYKKLYE---ESQAELSRLRETLSKTEEELNQVKATYSKN 218
RV S+ GS +L + E + K L+E ++EL+ L E + + EE+ Q + T +
Sbjct: 180 RVLSDQRGSLDQLKAQIEEKEEKDLHERLNGLESELAELDEEIERYEEQREQARETRDEA 239
Query: 219 SLSEQEKRERRTMERKLSEMEVELKELDQFKKENQRLKDE 258
+E ERR +L +E E+++L + E +R ++E
Sbjct: 240 DEVLEEHEERRE---ELETLEAEIEDLRETIAETEREREE 276
Score = 28.9 bits (65), Expect = 3.5
Identities = 18/58 (31%), Positives = 26/58 (44%), Gaps = 7/58 (12%)
Query: 188 EESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKR-ERRTMERKLSEMEVELKE 244
EE +AEL L E + + EE L + + L E E R ER R+ E + +
Sbjct: 478 EELEAELEDLEEEVEEVEERLERAE------DLVEAEDRIERLEERREDLEELIAERR 529
Score = 28.5 bits (64), Expect = 4.3
Identities = 20/76 (26%), Positives = 38/76 (50%), Gaps = 15/76 (19%)
Query: 188 EESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRT-MERKLSEMEVELKELD 246
EE++ + R E L + EE+L E RE R ++ ++ +E EL+EL+
Sbjct: 649 EEAREDKERAEEYLEQVEEKL--------------DELREERDDLQAEIGAVENELEELE 694
Query: 247 QFKKENQRLKDENGAL 262
+ ++ + L++ AL
Sbjct: 695 ELRERREALENRVEAL 710
Score = 28.1 bits (63), Expect = 6.3
Identities = 19/94 (20%), Positives = 42/94 (44%), Gaps = 10/94 (10%)
Query: 173 KLPNENGELDYKKLYEESQAELSRLRETLSKTEEELNQVKATYSK--------NSLSEQE 224
+L E EL+ EE++ + RE + + EEE+ +++ + E+
Sbjct: 360 ELREEAAELE--SELEEAREAVEDRREEIEELEEEIEELRERFGDAPVDLGNAEDFLEEL 417
Query: 225 KRERRTMERKLSEMEVELKELDQFKKENQRLKDE 258
+ ER + + +E+E L+ + +E + L +
Sbjct: 418 REERDELREREAELEATLRTARERVEEAEALLEA 451
>gnl|CDD|191382 pfam05837, CENP-H, Centromere protein H (CENP-H). This family
consists of several eukaryotic centromere protein H
(CENP-H) sequences. Macromolecular
centromere-kinetochore complex plays a critical role in
sister chromatid separation, but its complete protein
composition as well as its precise dynamic function
during mitosis has not yet been clearly determined.
CENP-H contains a coiled-coil structure and a nuclear
localisation signal. CENP-H is specifically and
constitutively localised in kinetochores throughout the
cell cycle. CENP-H may play a role in kinetochore
organisation and function throughout the cell cycle.
This the C-terminus of the region, which is conserved
from fungi to humans.
Length = 106
Score = 27.7 bits (62), Expect = 2.7
Identities = 10/40 (25%), Positives = 17/40 (42%)
Query: 232 ERKLSEMEVELKELDQFKKENQRLKDENGALIRVISKLSK 271
+EL +KE +LK +N L + +L+K
Sbjct: 6 LNLQQASRETREELSDLEKERLQLKQKNVELALELLELTK 45
>gnl|CDD|131264 TIGR02209, ftsL_broad, cell division protein FtsL. This model
represents FtsL, both forms similar to that in E. coli
and similar to that in B. subtilis. FtsL is one of the
later proteins active in cell division septum formation.
FtsL is small, low in complexity, and highly divergent.
The scope of this model is broader than that of the
pfam04999.3 for FtsL, as this one includes FtsL from
Bacillus subtilis and related species [Cellular
processes, Cell division].
Length = 85
Score = 27.2 bits (61), Expect = 2.9
Identities = 8/28 (28%), Positives = 16/28 (57%)
Query: 244 ELDQFKKENQRLKDENGALIRVISKLSK 271
EL + + E +L+ E L +++LS+
Sbjct: 32 ELQKLQLEIDKLQKEWRDLQLEVAELSR 59
>gnl|CDD|224114 COG1193, COG1193, Mismatch repair ATPase (MutS family) [DNA
replication, recombination, and repair].
Length = 753
Score = 29.2 bits (66), Expect = 3.0
Identities = 17/95 (17%), Positives = 36/95 (37%), Gaps = 10/95 (10%)
Query: 177 ENGELDYKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNS-LSEQEKRERRTMERK- 234
+ ++L E+ + L E L + E+ L++V+ NS + +
Sbjct: 500 GEEKELLEELIEKLEEVRKELEEELEEVEKLLDEVELLTGANSGGKTSLLELKAQIVVLA 559
Query: 235 -------LSEMEVELKELDQFKKENQRLKDENGAL 262
E +VE + +F K+ ++ + GA
Sbjct: 560 HMGLPVPAEEAKVEAVDEVKFLKK-KKGILDAGAF 593
>gnl|CDD|237616 PRK14127, PRK14127, cell division protein GpsB; Provisional.
Length = 109
Score = 27.7 bits (62), Expect = 3.0
Identities = 10/30 (33%), Positives = 19/30 (63%)
Query: 242 LKELDQFKKENQRLKDENGALIRVISKLSK 271
+K+ + F+KE + L+ EN L + +L+K
Sbjct: 36 IKDYEAFQKEIEELQQENARLKAQVDELTK 65
>gnl|CDD|238453 cd00890, Prefoldin, Prefoldin is a hexameric molecular chaperone
complex, found in both eukaryotes and archaea, that
binds and stabilizes newly synthesized polypeptides
allowing them to fold correctly. The complex contains
two alpha and four beta subunits, the two subunits being
evolutionarily related. In archaea, there is usually
only one gene for each subunit while in eukaryotes there
two or more paralogous genes encoding each subunit
adding heterogeneity to the structure of the hexamer.
The structure of the complex consists of a double beta
barrel assembly with six protruding coiled-coils.
Length = 129
Score = 28.1 bits (63), Expect = 3.0
Identities = 5/37 (13%), Positives = 22/37 (59%)
Query: 222 EQEKRERRTMERKLSEMEVELKELDQFKKENQRLKDE 258
++ + + ++++L ++ +L++L+ E ++ K+
Sbjct: 2 QELAAQLQQLQQQLEALQQQLQKLEAQLTEYEKAKET 38
>gnl|CDD|185594 PTZ00395, PTZ00395, Sec24-related protein; Provisional.
Length = 1560
Score = 29.3 bits (65), Expect = 3.0
Identities = 47/237 (19%), Positives = 87/237 (36%), Gaps = 36/237 (15%)
Query: 3 YSTPPRRSSGRTTSRPRSLYSSSPSPQSRNLSSYSSGLSSSTPYLNSSY--PSYGSTYGG 60
YS P + G + S++P Y++ +S+TPY N P Y +
Sbjct: 414 YSNPGNSNPGYNNAP----NSNTP---------YNNPPNSNTPYSNPPNSNPPYSNL--- 457
Query: 61 TSSGYGGLTLPSSYSRPSQSSPRSYHAPPSS---YRSSYAPTTTPLPPVRPRRSKSSTRS 117
YS S+ +APPSS + S+Y +P + +
Sbjct: 458 ------------PYSNTPYSNAPLSNAPPSSAKDHHSAYHAAYQHRAANQPAANLPTANQ 505
Query: 118 ANYTPEGGGRMSRSGSTTSLNSLRSEAPSEGYESGGDNLSTSRSRVNSESGGSSRKLPNE 177
G + G+ + S AP G + + + R + GG++R+ +
Sbjct: 506 PAANNFHGAAGNSVGNPFASRPFGS-APYGGNAATTADPNGIAKREDHPEGGTNRQKYEQ 564
Query: 178 NGELDYKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERK 234
+ E + E+ +E E K EE + +K T ++ +++ + T E+K
Sbjct: 565 SDEESVESSSSENSSE--NENEVTDKGEEIYSLLKKTINRIDMNKIPRPIINTQEKK 619
>gnl|CDD|236912 PRK11448, hsdR, type I restriction enzyme EcoKI subunit R;
Provisional.
Length = 1123
Score = 29.1 bits (66), Expect = 3.1
Identities = 24/123 (19%), Positives = 46/123 (37%), Gaps = 25/123 (20%)
Query: 143 EAPSEGYESGGDNLSTSRSRVNSESGGSSRKLPNENGELDYKKLYEESQAELSRL----- 197
E P + + T + ++ E + + L E Q EL L
Sbjct: 138 EDPENLLHALQQEVLTLKQQL--ELQAREKA--------QSQALAEAQQQELVALEGLAA 187
Query: 198 --RETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVELKELDQFKKENQRL 255
E + E +L Q++ ++ S ++KR+ T + + +EL E +E + L
Sbjct: 188 ELEEKQQELEAQLEQLQEKAAETSQERKQKRKEITDQ---AAKRLELSE-----EETRIL 239
Query: 256 KDE 258
D+
Sbjct: 240 IDQ 242
>gnl|CDD|180941 PRK07352, PRK07352, F0F1 ATP synthase subunit B; Validated.
Length = 174
Score = 28.4 bits (64), Expect = 3.2
Identities = 22/81 (27%), Positives = 41/81 (50%), Gaps = 10/81 (12%)
Query: 189 ESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEK-----RERRTMERKLSEMEVELK 243
+++AE R E + E++ ++K T + + +EQE+ R R E +++ E +L
Sbjct: 95 KARAEAIRA-EIEKQAIEDMARLKQTAAADLSAEQERVIAQLR-REAAELAIAKAESQLP 152
Query: 244 -ELDQFKKENQRLKDENGALI 263
LD+ QRL D + A +
Sbjct: 153 GRLDE--DAQQRLIDRSIANL 171
>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein. This entry is a highly
conserved protein present in eukaryotes.
Length = 680
Score = 29.1 bits (65), Expect = 3.2
Identities = 31/117 (26%), Positives = 45/117 (38%), Gaps = 6/117 (5%)
Query: 3 YSTPPRRSSGRTTSRPRSL-YSSSPSPQSRNLSSYSSGLSSSTPYLNSSYPSYGSTYGGT 61
+S R SS T + SSS + S S S S+ NSS S+ S G
Sbjct: 296 HSNKSRPSSSSTNGSKETTSNSSSAAAGSIGSKSSKSAKHSNRNKSNSSPKSHSSANGSV 355
Query: 62 SSGYGGLTLPSSYSRPSQSSPRSYHAPPSSYRSSYAPTTTPLPPVRPRRSKSSTRSA 118
S ++ + S+ ++S S A S +S + P +K ST SA
Sbjct: 356 PSS----SVSDNESKQKRASKSSSGARDSKKDASGMSANGTVENCIP-ENKISTPSA 407
>gnl|CDD|223022 PHA03248, PHA03248, DNA packaging tegument protein UL25;
Provisional.
Length = 583
Score = 28.8 bits (65), Expect = 3.4
Identities = 13/32 (40%), Positives = 19/32 (59%), Gaps = 3/32 (9%)
Query: 111 SKSSTRSANYTPEGGGRMSRSGSTTSLNSLRS 142
+ RS +Y GRMSR+ TT++ SL+S
Sbjct: 194 RPLAARSIDY---RDGRMSRTFMTTAITSLQS 222
>gnl|CDD|224227 COG1308, EGD2, Transcription factor homologous to NACalpha-BTF3
[Transcription].
Length = 122
Score = 27.7 bits (62), Expect = 3.4
Identities = 8/32 (25%), Positives = 20/32 (62%)
Query: 228 RRTMERKLSEMEVELKELDQFKKENQRLKDEN 259
R +++ + +M ++++ELD ++ +LKD
Sbjct: 8 PRKLKKLMKQMGIDVEELDGVERVIIKLKDTE 39
>gnl|CDD|188098 TIGR00957, MRP_assoc_pro, multi drug resistance-associated protein
(MRP). This model describes multi drug
resistance-associated protein (MRP) in eukaryotes. The
multidrug resistance-associated protein is an integral
membrane protein that causes multidrug resistance when
overexpressed in mammalian cells. It belongs to ABC
transporter superfamily. The protein topology and
function was experimentally demonstrated by epitope
tagging and immunofluorescence. Insertion of tags in the
critical regions associated with drug efflux, abrogated
its function. The C-terminal domain seem to highly
conserved [Transport and binding proteins, Other].
Length = 1522
Score = 29.1 bits (65), Expect = 3.4
Identities = 32/139 (23%), Positives = 48/139 (34%), Gaps = 27/139 (19%)
Query: 115 TRSANYTPE-------GGGRMSRSGSTTSL--------NSLRSEAPSEGYESGGDNLSTS 159
T +Y P+ GG++S GS L LR+ AP E
Sbjct: 819 THGISYLPQVDVIIVMSGGKISEMGSYQELLQRDGAFAEFLRTYAPDE------QQGHLE 872
Query: 160 RSRVNSESGGSSRKLPNENGELDYKKLYEESQAELSRLR---ETLSK---TEEELNQVKA 213
S SG ENG L + ++ Q +LS S+ + EL + +A
Sbjct: 873 DSWTALVSGEGKEAKLIENGMLVTDVVGKQLQRQLSASSSDSGDQSRHHGSSAELQKAEA 932
Query: 214 TYSKNSLSEQEKRERRTME 232
L E +K + +E
Sbjct: 933 KEETWKLMEADKAQTGQVE 951
>gnl|CDD|153280 cd07596, BAR_SNX, The Bin/Amphiphysin/Rvs (BAR) domain of Sorting
Nexins. BAR domains are dimerization, lipid binding and
curvature sensing modules found in many different
proteins with diverse functions. Sorting nexins (SNXs)
are Phox homology (PX) domain containing proteins that
are involved in regulating membrane traffic and protein
sorting in the endosomal system. SNXs differ from each
other in their lipid-binding specificity, subcellular
localization and specific function in the endocytic
pathway. A subset of SNXs also contain BAR domains. The
PX-BAR structural unit determines the specific membrane
targeting of SNXs. BAR domains form dimers that bind to
membranes, induce membrane bending and curvature, and
may also be involved in protein-protein interactions.
Length = 218
Score = 28.5 bits (64), Expect = 3.5
Identities = 17/72 (23%), Positives = 30/72 (41%), Gaps = 7/72 (9%)
Query: 184 KKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVELK 243
KK +A+L +L+ ++ ++ + L E E +R E+ LK
Sbjct: 123 KKDLASKKAQLEKLKAAPGIKPAKVEEL-----EEELEEAESALEEARKR-YEEISERLK 176
Query: 244 -ELDQFKKENQR 254
EL +F +E R
Sbjct: 177 EELKRFHEERAR 188
>gnl|CDD|236861 PRK11147, PRK11147, ABC transporter ATPase component; Reviewed.
Length = 635
Score = 28.8 bits (65), Expect = 3.5
Identities = 19/76 (25%), Positives = 33/76 (43%), Gaps = 7/76 (9%)
Query: 184 KKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVEL- 242
KKL + Q EL +L + L E E+ ++A + Q + + + L++ E EL
Sbjct: 560 KKLSYKLQRELEQLPQLLEDLEAEIEALQAQVADADFFSQPHEQTQKVLADLADAEQELE 619
Query: 243 ------KELDQFKKEN 252
+EL+ K
Sbjct: 620 VAFERWEELEALKNGG 635
Score = 28.4 bits (64), Expect = 4.4
Identities = 18/58 (31%), Positives = 30/58 (51%), Gaps = 3/58 (5%)
Query: 191 QAELSRLRETLSKTEEELNQVKATYSKNS---LSEQEKRERRTMERKLSEMEVELKEL 245
QA+ L++ K +EE KA K S LS + +RE + + L ++E E++ L
Sbjct: 530 QAQYLALKQPAVKKKEEAAAPKAETVKRSSKKLSYKLQRELEQLPQLLEDLEAEIEAL 587
>gnl|CDD|218704 pfam05701, DUF827, Plant protein of unknown function (DUF827).
This family consists of several plant proteins of
unknown function. Several sequences in this family are
described as being "myosin heavy chain-like".
Length = 484
Score = 28.8 bits (64), Expect = 3.5
Identities = 23/94 (24%), Positives = 35/94 (37%), Gaps = 7/94 (7%)
Query: 171 SRKLPNENGELDYKKLYEESQAELSRLRETLSKTEEELNQVKATYS--KNSLSEQEKRER 228
S +L + E+ EL + L K EE ++ K L E+EK+E
Sbjct: 238 SAELESLQKEISIMASVASVLKELEEAKANLEKAAEEEKSLRNLVESLKQEL-EEEKKEL 296
Query: 229 RTMERKLSEMEVEL----KELDQFKKENQRLKDE 258
+ K E E EL++ K E + E
Sbjct: 297 EELREKEGEAEEAASSLEAELNRTKSEKESKARE 330
>gnl|CDD|218434 pfam05103, DivIVA, DivIVA protein. The Bacillus subtilis divIVA1
mutation causes misplacement of the septum during cell
division, resulting in the formation of small, circular,
anucleate mini-cells. Inactivation of divIVA produces a
mini-cell phenotype, whereas overproduction of DivIVA
results in a filamentation phenotype. These proteins
appear to contain coiled-coils.
Length = 131
Score = 27.6 bits (62), Expect = 3.6
Identities = 13/46 (28%), Positives = 21/46 (45%), Gaps = 7/46 (15%)
Query: 223 QEKRERRTMERKLSEMEVE------LKELDQFKKENQRLKDENGAL 262
K ++ M R EV+ +K+ + KEN+ LK+E L
Sbjct: 7 LNKEFKKKM-RGYDPDEVDEFLDQVIKDYEALYKENEELKEEIERL 51
Score = 26.9 bits (60), Expect = 8.0
Identities = 23/103 (22%), Positives = 39/103 (37%), Gaps = 32/103 (31%)
Query: 182 DYKKLYEESQAELSRLRETLSKTEEELN---------------------QVKATYSK--- 217
DY+ LY+E++ L+E + + EE+L +VKA K
Sbjct: 33 DYEALYKENE----ELKEEIERLEEKLEEYKELEETLQKTLVVAQETAEEVKANAQKEAE 88
Query: 218 NSLSEQEKRERRTME---RKLSEMEVELKEL-DQFKKENQRLK 256
+ E E + R + + ++ E EL Q + R K
Sbjct: 89 LIIKEAEAKAERIVNDANEEAKKLATEYDELKRQSRLFRTRFK 131
>gnl|CDD|220366 pfam09728, Taxilin, Myosin-like coiled-coil protein. Taxilin
contains an extraordinarily long coiled-coil domain in
its C-terminal half and is ubiquitously expressed. It is
a novel binding partner of several syntaxin family
members and is possibly involved in Ca2+-dependent
exocytosis in neuroendocrine cells. Gamma-taxilin,
described as leucine zipper protein Factor Inhibiting
ATF4-mediated Transcription (FIAT), localises to the
nucleus in osteoblasts and dimerises with ATF4 to form
inactive dimers, thus inhibiting ATF4-mediated
transcription.
Length = 309
Score = 28.5 bits (64), Expect = 3.7
Identities = 21/83 (25%), Positives = 41/83 (49%), Gaps = 6/83 (7%)
Query: 189 ESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVELKELDQF 248
E + L ++LSK + +++ + L+E K E + +E+ L +++ K +Q
Sbjct: 2 EVKKAARLLNQSLSKLDTPEEKLETLIKR--LAELLK-EMKRLEKDLKKLQ---KRGEQL 55
Query: 249 KKENQRLKDENGALIRVISKLSK 271
+KE +L+ E+ I SKL
Sbjct: 56 QKEKDKLQSEHSKAILARSKLES 78
>gnl|CDD|226127 COG3599, DivIVA, Cell division initiation protein [Cell division
and chromosome partitioning].
Length = 212
Score = 28.2 bits (63), Expect = 3.7
Identities = 15/102 (14%), Positives = 33/102 (32%), Gaps = 2/102 (1%)
Query: 170 SSRKLPNENGELDYKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERR 229
+ R L + + + L ++Q E RLR + Q S ++ +
Sbjct: 112 ADRYLEDAKIAQELEDLKRQAQVERQRLRSDIEAQLASAKQEDWDEILRSTVDEVEAANE 171
Query: 230 TMERKLSEMEVELKELDQFKKENQRLKDENGALIRVISKLSK 271
ER + + L + + + + L +S +
Sbjct: 172 EAERLADAAQADADRL--RDECDIYVDTKLAELETRLSGTRR 211
>gnl|CDD|220373 pfam09738, DUF2051, Double stranded RNA binding protein (DUF2051).
This is a novel protein identified as interacting with
the leucine-rich repeat domain of human flightless-I,
FliI protein.
Length = 294
Score = 28.3 bits (63), Expect = 4.0
Identities = 26/129 (20%), Positives = 53/129 (41%), Gaps = 18/129 (13%)
Query: 147 EGYESGGDNLSTSRSRVNSESGGSSRKLPNENGELDYKKLYEESQAELSRLRETLSKTEE 206
+ + S NS G SSR+ + +++A + L+++L + EE
Sbjct: 37 DFDRKDSSSTSAVSIATNSLGGSSSRRSSGDTSSS------VDTEASIRELKDSLIEVEE 90
Query: 207 ELNQVKATYS-----KNSLSEQEKRERRTMERKLSEMEVELKELDQFKKENQRLKDENGA 261
++ + + + K +L E ++ KL E+E + L + ++E + + A
Sbjct: 91 KVKKAMVSNAQLDNEKTNL----MYEVDLLKDKLEELE---ESLAELQREFEEKTRDLEA 143
Query: 262 LIRVISKLS 270
L R I L
Sbjct: 144 LKRAIDILQ 152
>gnl|CDD|183876 PRK13169, PRK13169, DNA replication intiation control protein YabA;
Reviewed.
Length = 110
Score = 27.1 bits (61), Expect = 4.0
Identities = 18/57 (31%), Positives = 25/57 (43%), Gaps = 16/57 (28%)
Query: 231 MERK-----LSEME----VELKELDQFKK-------ENQRLKDENGALIRVISKLSK 271
M++K L ++E V LKEL KK EN L+ EN L + +L
Sbjct: 1 MDKKEIFDALDDLEQNLGVLLKELGALKKQLAELLEENTALRLENDKLRERLEELEA 57
>gnl|CDD|202045 pfam01920, Prefoldin_2, Prefoldin subunit. This family includes
prefoldin subunits that are not detected by pfam02996.
Length = 106
Score = 27.2 bits (61), Expect = 4.0
Identities = 16/59 (27%), Positives = 32/59 (54%), Gaps = 12/59 (20%)
Query: 197 LRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVELKELDQFKKENQRL 255
+++ + +EEL + K T K +T+E++L ++E KEL++ K+E +L
Sbjct: 56 VKQDKEEVKEELEERKETLEKEI---------KTLEKQLEKLE---KELEELKEELYKL 102
>gnl|CDD|216108 pfam00769, ERM, Ezrin/radixin/moesin family. This family of
proteins contain a band 4.1 domain (pfam00373), at their
amino terminus. This family represents the rest of these
proteins.
Length = 244
Score = 28.2 bits (63), Expect = 4.1
Identities = 22/82 (26%), Positives = 36/82 (43%), Gaps = 9/82 (10%)
Query: 188 EESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVELKELDQ 247
+Q EL ET + EE+L Q + + L E++ E R+L E E
Sbjct: 22 RRAQKELEEYEETALELEEKLKQEE---EEAQLLEKKADELEEENRRLEE------EAAA 72
Query: 248 FKKENQRLKDENGALIRVISKL 269
++E +RL+ E ++KL
Sbjct: 73 SEEERERLEAEVDEATAEVAKL 94
>gnl|CDD|233547 TIGR01730, RND_mfp, RND family efflux transporter, MFP subunit.
This model represents the MFP (membrane fusion protein)
component of the RND family of transporters. RND refers
to Resistance, Nodulation, and cell Division. It is, in
part, a subfamily of pfam00529 (Pfam release 7.5) but
hits substantial numbers of proteins missed by that
model. The related HlyD secretion protein, for which
pfam00529 is named, is outside the scope of this model.
Attributed functions imply outward transport. These
functions include nodulation, acriflavin resistance,
heavy metal efflux, and multidrug resistance proteins.
Most members of this family are found in Gram-negative
bacteria. The proposed function of MFP proteins is to
bring the inner and outer membranes together and enable
transport to the outside of the outer membrane. Note,
however, that a few members of this family are found in
Gram-positive bacteria, where there is no outer membrane
[Transport and binding proteins, Unknown substrate].
Length = 322
Score = 28.4 bits (64), Expect = 4.1
Identities = 10/71 (14%), Positives = 29/71 (40%), Gaps = 1/71 (1%)
Query: 182 DYKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQE-KRERRTMERKLSEMEV 240
DY+ + + A+L+ L + + + +N++S+ + + +E +++E
Sbjct: 61 DYQLALQAALAQLAAAEAQLELAQRSFERAERLVKRNAVSQADLDDAKAAVEAAQADLEA 120
Query: 241 ELKELDQFKKE 251
L +
Sbjct: 121 AKASLASAQLN 131
>gnl|CDD|220749 pfam10428, SOG2, RAM signalling pathway protein. SOG2 proteins in
Saccharomyces cerevisiae are involved in cell separation
and cytokinesis.
Length = 419
Score = 28.6 bits (64), Expect = 4.2
Identities = 24/85 (28%), Positives = 33/85 (38%), Gaps = 6/85 (7%)
Query: 76 RPSQSSPRSYHAPPSSYRSSYAPTTTPLPPVRPR---RSKSSTRSANYTPEGGGRMSRSG 132
R SSPRS P + S TT +PP RS + + SAN + RSG
Sbjct: 189 RRPSSSPRSLSNPTTLESPSNLQVTTDVPPPYSNGTSRSSTMSSSANLSIISSLATPRSG 248
Query: 133 ST---TSLNSLRSEAPSEGYESGGD 154
+ T + S P G + +
Sbjct: 249 ESFRSTPTSGSSSINPVSGLDEAEE 273
>gnl|CDD|236745 PRK10722, PRK10722, hypothetical protein; Provisional.
Length = 247
Score = 28.0 bits (63), Expect = 4.2
Identities = 16/55 (29%), Positives = 25/55 (45%), Gaps = 12/55 (21%)
Query: 183 YKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRT-MERKLS 236
Y+KL + S +EL LR+ + + +L K E T +ER+LS
Sbjct: 167 YQKLQQSSDSELDALRQQQQRLQYQLELT-----------TRKLENLTDIERQLS 210
>gnl|CDD|237178 PRK12705, PRK12705, hypothetical protein; Provisional.
Length = 508
Score = 28.5 bits (64), Expect = 4.5
Identities = 18/69 (26%), Positives = 30/69 (43%), Gaps = 5/69 (7%)
Query: 188 EESQAELSRLRETLSKTEEELNQVK----ATYSK-NSLSEQEKRERRTMERKLSEMEVEL 242
+ + E R RE L + EE L Q + A K ++L Q + + + + E+E
Sbjct: 66 NQQRQEARREREELQREEERLVQKEEQLDARAEKLDNLENQLEEREKALSARELELEELE 125
Query: 243 KELDQFKKE 251
K+LD
Sbjct: 126 KQLDNELYR 134
>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex. This
entry is characterized by proteins with alternating
conserved and low-complexity regions. Bud13 together
with Snu17p and a newly identified factor,
Pml1p/Ylr016c, form a novel trimeric complex. called The
RES complex, pre-mRNA retention and splicing complex.
Subunits of this complex are not essential for viability
of yeasts but they are required for efficient splicing
in vitro and in vivo. Furthermore, inactivation of this
complex causes pre-mRNA leakage from the nucleus. Bud13
contains a unique, phylogenetically conserved C-terminal
region of unknown function.
Length = 141
Score = 27.7 bits (62), Expect = 4.5
Identities = 7/37 (18%), Positives = 22/37 (59%)
Query: 222 EQEKRERRTMERKLSEMEVELKELDQFKKENQRLKDE 258
++EK + E++ + V+ +E ++ +E ++ K++
Sbjct: 25 KEEKERKEEKEKEWGKGLVQKEEREKRLEELEKAKNK 61
>gnl|CDD|227512 COG5185, HEC1, Protein involved in chromosome segregation,
interacts with SMC proteins [Cell division and
chromosome partitioning].
Length = 622
Score = 28.4 bits (63), Expect = 4.6
Identities = 26/91 (28%), Positives = 43/91 (47%), Gaps = 4/91 (4%)
Query: 154 DNLSTSRSRVNSESGGSSRKLPNENGELDYKKLYEESQAELSRLRETLSKTEEELNQVKA 213
NL + + +L N + + K EE++ EL R + K E+ELN +
Sbjct: 490 KNLKHDINELTQILEKLELELSEANSKFELSK--EENERELVAQRIEIEKLEKELNDLNL 547
Query: 214 TYSKNSLSEQEKRERRTMERKLSEMEVELKE 244
SK S+ + E+ + T E KL E++V+L
Sbjct: 548 L-SKTSILDAEQLVQST-EIKLDELKVDLNR 576
>gnl|CDD|216525 pfam01486, K-box, K-box region. The K-box region is commonly found
associated with SRF-type transcription factors see
pfam00319. The K-box is a possible coiled-coil
structure. Possible role in multimer formation.
Length = 100
Score = 26.7 bits (60), Expect = 4.7
Identities = 20/85 (23%), Positives = 43/85 (50%), Gaps = 9/85 (10%)
Query: 187 YEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKR--ERRTMERKLS-----EME 239
YE Q E ++L++ + E + SLS +E + E++ +E L + E
Sbjct: 14 YENLQQEAAKLKKQNENLQREQRHLLGE-DLGSLSLKELQQLEQQ-LENGLKHIRSRKNE 71
Query: 240 VELKELDQFKKENQRLKDENGALIR 264
+ L ++++ +K+ + L++EN L +
Sbjct: 72 LLLNQIEELQKKERELQEENRNLRQ 96
>gnl|CDD|169428 PRK08404, PRK08404, V-type ATP synthase subunit H; Validated.
Length = 103
Score = 27.1 bits (60), Expect = 4.9
Identities = 22/66 (33%), Positives = 36/66 (54%), Gaps = 2/66 (3%)
Query: 188 EESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERK-LSEMEVELKELD 246
EE++ + + +E K EEE+ + KA L E++K+E +K L E E E++EL
Sbjct: 24 EEAKKIIRKAKEEAKKIEEEIIK-KAEEEAQKLIEKKKKEGEEEAKKILEEGEKEIEELK 82
Query: 247 QFKKEN 252
+EN
Sbjct: 83 VKAEEN 88
>gnl|CDD|220404 pfam09789, DUF2353, Uncharacterized coiled-coil protein (DUF2353).
Members of this family of uncharacterized proteins have
no known function.
Length = 319
Score = 28.3 bits (63), Expect = 4.9
Identities = 26/133 (19%), Positives = 50/133 (37%), Gaps = 15/133 (11%)
Query: 141 RSEAPSEGYESGGDNLSTSRSRVNSESGGSSRKLPNENGEL-----DYKKLYEESQAELS 195
+SE E + ++ + K EN +L + ++ E Q ++
Sbjct: 44 KSERELELVAACQGDMKIKPGTKRQDLSQLLEKYREENQQLSTDVQELRQRKAELQGDIK 103
Query: 196 RLRETLSKTEEELNQVKATYSKNS----------LSEQEKRERRTMERKLSEMEVELKEL 245
LR+T+++ E S L E+ K+ + L E E L E
Sbjct: 104 LLRQTIAQQRVEFMGGGVRESFEERNDLVSQLEKLREKCKQLESDLRSLLDEKEEVLMER 163
Query: 246 DQFKKENQRLKDE 258
D+++++ RL +E
Sbjct: 164 DEYQRKAHRLNNE 176
>gnl|CDD|129705 TIGR00618, sbcc, exonuclease SbcC. All proteins in this family for
which functions are known are part of an exonuclease
complex with sbcD homologs. This complex is involved in
the initiation of recombination to regulate the levels
of palindromic sequences in DNA. This family is based on
the phylogenomic analysis of JA Eisen (1999, Ph.D.
Thesis, Stanford University) [DNA metabolism, DNA
replication, recombination, and repair].
Length = 1042
Score = 28.4 bits (63), Expect = 5.0
Identities = 13/72 (18%), Positives = 25/72 (34%), Gaps = 2/72 (2%)
Query: 182 DYKKLYEESQ--AELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEME 239
Q EL+ L + L Q + S+ K + + L +M+
Sbjct: 627 LQDVRLHLQQCSQELALKLTALHALQLTLTQERVREHALSIRVLPKELLASRQLALQKMQ 686
Query: 240 VELKELDQFKKE 251
E ++L +K+
Sbjct: 687 SEKEQLTYWKEM 698
>gnl|CDD|220383 pfam09756, DDRGK, DDRGK domain. This is a family of proteins of
approximately 300 residues, found in plants and
vertebrates. They contain a highly conserved DDRGK
motif.
Length = 189
Score = 27.7 bits (62), Expect = 5.1
Identities = 19/80 (23%), Positives = 37/80 (46%), Gaps = 2/80 (2%)
Query: 177 ENGELDYKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLS 236
+ G KL EE QA + RE + EE +++ E+E E R +++
Sbjct: 2 KIGAKKRAKL-EEKQAR-RQQREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEE 59
Query: 237 EMEVELKELDQFKKENQRLK 256
E + ++ + ++E ++LK
Sbjct: 60 ERKEREEQARKEQEEYEKLK 79
>gnl|CDD|114270 pfam05539, Pneumo_att_G, Pneumovirinae attachment membrane
glycoprotein G.
Length = 408
Score = 28.1 bits (62), Expect = 5.1
Identities = 28/123 (22%), Positives = 38/123 (30%), Gaps = 10/123 (8%)
Query: 4 STPPRRSSGRTTSRPRSLYSSSPSPQSRNLSSYSSGLSSSTPYLNSSYPSYGSTYGGTSS 63
ST P + G TTS S+P PQ+ S G S S + S+ ST G
Sbjct: 215 STEPVGTQGTTTS-------SNPEPQTEPPPS-QRGPSGSPQHPPSTTSQDQSTTGDGQE 266
Query: 64 GYGGLTLPSSYSRPSQSSPRSYHAPPSSYRSSYAPTTTPL--PPVRPRRSKSSTRSANYT 121
P + S + P + + + PT P SS
Sbjct: 267 HTQRRKTPPATSNRRSPHSTATPPPTTKRQETGRPTPRPTATTQSGSSPPHSSPPGVQAN 326
Query: 122 PEG 124
P
Sbjct: 327 PTT 329
>gnl|CDD|225437 COG2882, FliJ, Flagellar biosynthesis chaperone [Cell motility and
secretion / Intracellular trafficking and secretion /
Posttranslational modification, protein turnover,
chaperones].
Length = 148
Score = 27.3 bits (61), Expect = 5.3
Identities = 15/68 (22%), Positives = 34/68 (50%), Gaps = 1/68 (1%)
Query: 191 QAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVELKELDQFKK 250
Q +S+L + + + +L++++ + QEK+ KL E + L++ ++
Sbjct: 70 QQFISQLEVAIDQQQSQLSKLRKQVEQKREIWQEKQIELKALEKLKERQKTEFLLEENRR 129
Query: 251 ENQRLKDE 258
E Q++ DE
Sbjct: 130 E-QKIMDE 136
>gnl|CDD|176962 CHL00019, atpF, ATP synthase CF0 B subunit.
Length = 184
Score = 27.5 bits (62), Expect = 5.4
Identities = 23/97 (23%), Positives = 45/97 (46%), Gaps = 19/97 (19%)
Query: 186 LYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVELKEL 245
EE+ +L + R L + E E ++++ N SE E RE+ + +++ + +L+ L
Sbjct: 70 RREEAIEKLEKARARLRQAELEADEIR----VNGYSEIE-REKENL---INQAKEDLERL 121
Query: 246 DQFKKEN-----QRLKDE------NGALIRVISKLSK 271
+ +K E QR ++ AL R + L+
Sbjct: 122 ENYKNETIRFEQQRAINQVRQQVFQLALQRALGTLNS 158
>gnl|CDD|227569 COG5244, NIP100, Dynactin complex subunit involved in mitotic
spindle partitioning in anaphase B [Cell division and
chromosome partitioning].
Length = 669
Score = 28.1 bits (62), Expect = 5.4
Identities = 17/98 (17%), Positives = 30/98 (30%), Gaps = 22/98 (22%)
Query: 165 SESGGSSRKLPNENGELDYK-----KLYEESQAELSRLRETLSKTEEELNQVKATYSKNS 219
E L NG LD E + +L L S +
Sbjct: 211 CERSNIHDVLFLVNGILDGVIDELNGELERLRRQLVSLM-----------------SSHG 253
Query: 220 LSEQEKRERRTMERKLSEMEVELKELDQFKKENQRLKD 257
+ +E + K +E+++ L + +N+ LK
Sbjct: 254 IEVEENSRLKATLEKFQSLELKVNTLQEELYQNKLLKK 291
Score = 27.7 bits (61), Expect = 8.9
Identities = 42/183 (22%), Positives = 61/183 (33%), Gaps = 27/183 (14%)
Query: 106 VRPRRSKSSTRSANYTPEGGGRMSRSGSTTSLNSLRSEA---PSEGYESGGDNLSTSRSR 162
+RP +A Y GG + S ++ E ES L +
Sbjct: 64 IRPDDDSLLNGNAAYEKIKGGLVCESKGMDKDGEIKQENHEDRIHFEESKIRRLEETIEA 123
Query: 163 VNSESGGSSRKLPNENGELDYKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSE 222
+ S +L EN ELD L LRE +S E ELN+ + S + L E
Sbjct: 124 LKSTEKEEIVELRRENEELDKINL---------SLRERISSEEPELNKDGSKLSYDELKE 174
Query: 223 QEKRERRTMERKLSEMEVELKELDQFKKENQR--------------LKDENGALIRVISK 268
+ R + E+ ++ E QR L NG L VI +
Sbjct: 175 FVEESRV-QVYDMVELVSDISETLNRNGSIQRSSVRECERSNIHDVLFLVNGILDGVIDE 233
Query: 269 LSK 271
L+
Sbjct: 234 LNG 236
>gnl|CDD|227453 COG5124, COG5124, Protein predicted to be involved in meiotic
recombination [Cell division and chromosome partitioning
/ General function prediction only].
Length = 209
Score = 27.6 bits (61), Expect = 5.5
Identities = 18/66 (27%), Positives = 29/66 (43%), Gaps = 5/66 (7%)
Query: 199 ETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVEL-----KELDQFKKENQ 253
E L K +E+ Q ATY + E+ R ++ E + + L KE D+ KK+
Sbjct: 85 ELLKKKIQEVKQDIATYKEEIDKEKATRRKKFTEGQKNYNREALLEKRKKEQDEIKKKLN 144
Query: 254 RLKDEN 259
L+
Sbjct: 145 SLQKIE 150
>gnl|CDD|221514 pfam12297, EVC2_like, Ellis van Creveld protein 2 like protein.
This family of proteins is found in eukaryotes. Proteins
in this family are typically between 571 and 1310 amino
acids in length. There are two conserved sequence
motifs: LPA and ELH. EVC2 is implicated in Ellis van
Creveld chondrodysplastic dwarfism in humans. Mutations
in this protein can give rise to this congenital
condition. LIMBIN is a protein which shares around 80%
sequence homology with EVC2 and it is implicated in a
similar condition in bovine chondrodysplastic dwarfism.
Length = 429
Score = 27.9 bits (62), Expect = 6.0
Identities = 19/85 (22%), Positives = 29/85 (34%), Gaps = 6/85 (7%)
Query: 161 SRVNSESGGSSRKLPNENGELDYKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSL 220
+R E+ E E K+ E E S L +TL E+E Q +
Sbjct: 230 TRKKMEAQHQREMAEMEQAEELLKRAPEREAVECSSLLDTLHGLEQEHLQRSLLLQQEED 289
Query: 221 SEQEKRERRTMERKLSEMEVELKEL 245
+ R+ +R VEL +
Sbjct: 290 FAKAHRQLAVFQR------VELHSI 308
>gnl|CDD|215598 PLN03138, PLN03138, Protein TOC75; Provisional.
Length = 796
Score = 28.3 bits (63), Expect = 6.1
Identities = 21/100 (21%), Positives = 32/100 (32%), Gaps = 20/100 (20%)
Query: 55 GSTYGGTSSGYGGLTLPSSYSRPSQSSPRSYHAPPSSYRSSYAPTTTPLPPVRPRRSKSS 114
G + S +L SS + S S RS S+ RS R K S
Sbjct: 1 GRSSSTMVSAAASTSLSSSRPQLSSFSSRSP---QSATRSP-----------RASSIKCS 46
Query: 115 TRSANYTPEGGGRMSRSGSTTSLNSLRSEAPSEGYESGGD 154
++ + + S S + + + S SGG
Sbjct: 47 ASASASSS------ATSSSASLVANGAVALLSASAISGGG 80
>gnl|CDD|233245 TIGR01042, V-ATPase_V1_A, V-type (H+)-ATPase V1, A subunit. This
models eukaryotic vacuolar (H+)-ATPase that is
responsible for acidifying cellular compartments. This
enzyme shares extensive sequence similarity with
archaeal ATP synthase [Transport and binding proteins,
Cations and iron carrying compounds].
Length = 591
Score = 28.2 bits (63), Expect = 6.1
Identities = 14/43 (32%), Positives = 26/43 (60%), Gaps = 1/43 (2%)
Query: 183 YKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEK 225
Y+K Y E ++ +E L + EE+LN++ K++L+E +K
Sbjct: 448 YEKFYPEFVPLRTKAKEILQE-EEDLNEIVQLVGKDALAETDK 489
>gnl|CDD|148679 pfam07218, RAP1, Rhoptry-associated protein 1 (RAP-1). This family
consists of several rhoptry-associated protein 1 (RAP-1)
sequences which appear to be specific to Plasmodium
falciparum.
Length = 790
Score = 28.1 bits (62), Expect = 6.2
Identities = 14/57 (24%), Positives = 22/57 (38%), Gaps = 6/57 (10%)
Query: 7 PRRSSGRTTSRPRSLYSSSPSPQSRNLSSYSSGLSSSTPYLNSSYPSYGSTYGGTSS 63
P + G L SSS +S SG S++ + S+ +T G +S
Sbjct: 132 PGKPKGNQGE---GLASSSDGKSK---ASAKSGSKSASKHGESNSSDESATDSGKAS 182
>gnl|CDD|236892 PRK11281, PRK11281, hypothetical protein; Provisional.
Length = 1113
Score = 28.0 bits (63), Expect = 6.5
Identities = 14/82 (17%), Positives = 33/82 (40%), Gaps = 6/82 (7%)
Query: 182 DYKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVE 241
K+ E+ + +L++ L + + EL +K + + R +E +L++ +
Sbjct: 77 RQKEETEQLKQQLAQAPAKLRQAQAELEALKDDNDEETRETLSTLSLRQLESRLAQTLDQ 136
Query: 242 LKELDQFKKENQRLKDENGALI 263
L+ L + N L+
Sbjct: 137 LQNA------QNDLAEYNSQLV 152
>gnl|CDD|218657 pfam05615, THOC7, Tho complex subunit 7. The Tho complex is
involved in transcription elongation and mRNA export
from the nucleus.
Length = 132
Score = 26.9 bits (60), Expect = 6.5
Identities = 8/40 (20%), Positives = 20/40 (50%)
Query: 176 NENGELDYKKLYEESQAELSRLRETLSKTEEELNQVKATY 215
NE +Y++ EE + + ++ + + ++EL + K
Sbjct: 65 NERERENYEQEKEEIETTIENAKDEIEELKKELEEAKRIR 104
>gnl|CDD|219133 pfam06682, DUF1183, Protein of unknown function (DUF1183). This
family consists of several eukaryotic proteins of around
360 residues in length. The function of this family is
unknown.
Length = 317
Score = 27.8 bits (62), Expect = 6.6
Identities = 15/45 (33%), Positives = 20/45 (44%)
Query: 23 SSSPSPQSRNLSSYSSGLSSSTPYLNSSYPSYGSTYGGTSSGYGG 67
S + S S S S S ++S S S+ TSSG+GG
Sbjct: 269 SRRNNNSSYGRSYGSGSPSYSPSSSSNSSSSSSSSSTRTSSGFGG 313
>gnl|CDD|153341 cd07657, F-BAR_Fes_Fer, The F-BAR (FES-CIP4 Homology and
Bin/Amphiphysin/Rvs) domain of Fes (feline sarcoma) and
Fer (Fes related) tyrosine kinases. F-BAR domains are
dimerization modules that bind and bend membranes and
are found in proteins involved in membrane dynamics and
actin reorganization. Fes (feline sarcoma), also called
Fps (Fujinami poultry sarcoma), and Fer (Fes related)
are cytoplasmic (or nonreceptor) tyrosine kinases that
play roles in haematopoiesis, inflammation and immunity,
growth factor signaling, cytoskeletal regulation, cell
migration and adhesion, and the regulation of cell-cell
interactions. Although Fes and Fer show redundancy in
their biological functions, they show differences in
their expression patterns. Fer is ubiquitously expressed
while Fes is expressed predominantly in myeloid and
endothelial cells. Fes and Fer contain an N-terminal
F-BAR domain, an SH2 domain, and a C-terminal catalytic
kinase domain. F-BAR domains form banana-shaped dimers
with a positively-charged concave surface that binds to
negatively-charged lipid membranes. They can induce
membrane deformation in the form of long tubules. The
F-BAR domain of Fes is critical in its role in
microtubule nucleation and bundling.
Length = 237
Score = 27.3 bits (61), Expect = 6.9
Identities = 18/71 (25%), Positives = 35/71 (49%), Gaps = 5/71 (7%)
Query: 184 KKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVELK 243
KK Y+E E ++ E K +E+ ++K+ Y K L E K + E + + +
Sbjct: 107 KKAYQE---ERQQIDEQYKKLTDEVEKLKSEYQK--LLEDYKAAKSKFEEAVVKGGRGGR 161
Query: 244 ELDQFKKENQR 254
+LD+ + + Q+
Sbjct: 162 KLDKARDKYQK 172
>gnl|CDD|149741 pfam08775, ParB, ParB family. ParB is a component of the par
system which mediates accurate DNA partition during cell
division. It recognises A-box and B-box DNA motifs. ParB
forms an asymmetric dimer with 2 extended
helix-turn-helix (HTH) motifs that bind to A-boxes. The
HTH motifs emanate from a beta sheet coiled coil DNA
binding module. Both DNA binding elements are free to
rotate around a flexible linker, this enables them to
bind to complex arrays of A- and B-box elements on
adjacent DNA arms of the looped partition site.
Length = 127
Score = 26.9 bits (60), Expect = 7.0
Identities = 15/79 (18%), Positives = 36/79 (45%), Gaps = 7/79 (8%)
Query: 182 DYK---KLYEESQAELSRLRETLSKTEEELNQVKATYS----KNSLSEQEKRERRTMERK 234
DYK K+ E+ + + L E + EE+ + + S K + + +E + ++ K
Sbjct: 11 DYKLLLKVQEQLEKKNISLDELIDNVSEEIKDIDSELSPDEQKKEIMKLITKELKLLKDK 70
Query: 235 LSEMEVELKELDQFKKENQ 253
+ + + L +F +++
Sbjct: 71 PVKDKAVVTPLAEFDDKDK 89
>gnl|CDD|227369 COG5036, COG5036, SPX domain-containing protein involved in
vacuolar polyphosphate accumulation [Inorganic ion
transport and metabolism].
Length = 509
Score = 27.8 bits (62), Expect = 7.1
Identities = 19/97 (19%), Positives = 46/97 (47%), Gaps = 4/97 (4%)
Query: 176 NENGELDYKKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTM-ERK 234
NEN +L +L + AE RLR + +++ + T+ ++ E + R + E+
Sbjct: 242 NENFDLYNGRLEKLEGAEALRLRWYGKLSPKDIFVERKTHREDWTGESSFKARFQLKEKF 301
Query: 235 LSEMEVELKELDQFKKENQRLKDENGALIRVISKLSK 271
+++ +K F+++ +++ E G + +L +
Sbjct: 302 VNKF---IKGDYTFEEQFAKVRKEGGTTAEDLERLER 335
>gnl|CDD|224241 COG1322, COG1322, Predicted nuclease of restriction
endonuclease-like fold, RmuC family [General function
prediction only].
Length = 448
Score = 27.7 bits (62), Expect = 7.6
Identities = 24/89 (26%), Positives = 40/89 (44%), Gaps = 6/89 (6%)
Query: 184 KKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVE-- 241
+L E Q + + E L++ ELNQ L E ++ R +E+++ E E
Sbjct: 101 AQLSSEFQELANEIFEELNRRLAELNQQNLKQLLKPLREVLEKFREQLEQRIHE-SAEER 159
Query: 242 ---LKELDQFKKENQRLKDENGALIRVIS 267
L+E+D+ E Q+L E G L +
Sbjct: 160 STLLEEIDRLLGEIQQLAQEAGNLTAALK 188
>gnl|CDD|223630 COG0556, UvrB, Helicase subunit of the DNA excision repair complex
[DNA replication, recombination, and repair].
Length = 663
Score = 27.6 bits (62), Expect = 7.7
Identities = 12/35 (34%), Positives = 20/35 (57%), Gaps = 6/35 (17%)
Query: 228 RRTMERKLSEMEVELKE-LDQFKKEN-----QRLK 256
R +E+ + ++ EL+E L F+K N QRL+
Sbjct: 253 RERLEQAIESIKEELEERLKFFEKNNKLLEAQRLE 287
>gnl|CDD|181619 PRK09039, PRK09039, hypothetical protein; Validated.
Length = 343
Score = 27.6 bits (62), Expect = 7.7
Identities = 13/64 (20%), Positives = 26/64 (40%), Gaps = 6/64 (9%)
Query: 191 QAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVELKELDQFKK 250
Q ++ LR +LS E E ++++A ++ + + E + E+ EL Q
Sbjct: 80 QDSVANLRASLSAAEAERSRLQALLAELAGAGAA------AEGRAGELAQELDSEKQVSA 133
Query: 251 ENQR 254
Sbjct: 134 RALA 137
>gnl|CDD|220785 pfam10498, IFT57, Intra-flagellar transport protein 57. Eukaryotic
cilia and flagella are specialised organelles found at
the periphery of cells of diverse organisms.
Intra-flagellar transport (IFT) is required for the
assembly and maintenance of eukaryotic cilia and
flagella, and consists of the bidirectional movement of
large protein particles between the base and the distal
tip of the organelle. IFT particles contain multiple
copies of two distinct protein complexes, A and B, which
contain at least 6 and 11 protein subunits. IFT57 is
part of complex B but is not, however, required for the
core subunits to stay associated. This protein is known
as Huntington-interacting protein-1 in humans.
Length = 355
Score = 27.7 bits (62), Expect = 7.8
Identities = 22/77 (28%), Positives = 37/77 (48%), Gaps = 6/77 (7%)
Query: 184 KKLYEESQAELSRLRETLSKTEEELNQVKATYSKNSLSE--QEKRERRTMERKLSEMEVE 241
K E++ L +L + LSK E++ + + N L + QE RE R R LS+++ +
Sbjct: 229 SKSLPETKQYLKKLSQDLSKALEKI-SSREKHLNNQLEQLVQEYREAR---RTLSQVQEK 284
Query: 242 LKELDQFKKENQRLKDE 258
+ Q E R +E
Sbjct: 285 YNQASQGVSELTRELNE 301
Score = 27.4 bits (61), Expect = 8.0
Identities = 19/81 (23%), Positives = 35/81 (43%), Gaps = 16/81 (19%)
Query: 188 EESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVELKE--- 244
E+ E R TLS+ +E+ NQ + RE + +L +++ E++E
Sbjct: 265 EQLVQEYREARRTLSQVQEKYNQASQG------VSELTRELNEISEELEQVKQEMEERGA 318
Query: 245 -------LDQFKKENQRLKDE 258
L + K+ +LK+E
Sbjct: 319 SMSDGSPLVKIKQAITKLKEE 339
>gnl|CDD|213400 cd12193, B_zip1, basic leucine zipper DNA-binding and
multimerization region of GCN4 and related proteins.
Basic leucine zipper (bZIP) transcription factors act in
networks of homo- and hetero-dimers in the regulation in
a diverse set of cellular pathways. Classical leucine
zippers have alpha helices with leucine residues 7 amino
acids apart, which stabilize dimerization with a
parallel leucine zipper domain. Dimerization creates a
pair of basic regions that bind DNA and undergo
conformational change. GCN4 was identified in
Saccharomyces cerevisiae from mutations in a deficiency
in activation with the general amino acid control
pathway. GCN4 encodes a trans-activator of amino acid
biosynthetic genes containing 2 acidic activation
domains and a C-terminal bZIP domain, comprised of a
basic alpha-helical DNA-binding region and a coiled-coil
dimerization region.
Length = 54
Score = 25.3 bits (56), Expect = 7.8
Identities = 13/48 (27%), Positives = 22/48 (45%), Gaps = 2/48 (4%)
Query: 217 KNSLSEQEKRE--RRTMERKLSEMEVELKELDQFKKENQRLKDENGAL 262
+ RE RR+ RK+ M+ ++++ EN L+DE L
Sbjct: 2 PRARKRARNREAARRSRARKMERMKQLEDKVEELTSENSDLEDEVDRL 49
>gnl|CDD|237853 PRK14897, PRK14897, unknown domain/DNA-directed RNA polymerase
subunit A'' fusion protein; Provisional.
Length = 509
Score = 27.9 bits (62), Expect = 7.8
Identities = 21/90 (23%), Positives = 43/90 (47%), Gaps = 5/90 (5%)
Query: 177 ENGELDYKKLYEESQAELSRLRETLSKTEEE----LNQVKATYSKNSLSEQEKRERRTME 232
EN + Y +E ++ TL K E ++++ + N + +++K +T+
Sbjct: 17 ENVPVYYAVDFE-VPKNITDGYITLKKKEFTYHVIISKIAPYSNSNGIIKKKKPVLKTVL 75
Query: 233 RKLSEMEVELKELDQFKKENQRLKDENGAL 262
SE ++E +L +FK+ R+ DEN +
Sbjct: 76 EIESEEKIEAIDLMEFKRLFGRILDENMSF 105
>gnl|CDD|216102 pfam00756, Esterase, Putative esterase. This family contains
Esterase D. However it is not clear if all members of
the family have the same function. This family is
related to the pfam00135 family.
Length = 245
Score = 27.4 bits (61), Expect = 8.1
Identities = 12/52 (23%), Positives = 21/52 (40%), Gaps = 7/52 (13%)
Query: 45 PYLNSSYPSY-------GSTYGGTSSGYGGLTLPSSYSRPSQSSPRSYHAPP 89
P L++++P+ G + GG + Y L P + S SP +
Sbjct: 98 PLLDANFPTAPDGRALAGQSMGGLGALYLALKYPDLFGSVSSFSPILNPSNS 149
>gnl|CDD|191268 pfam05405, Mt_ATP-synt_B, Mitochondrial ATP synthase B chain
precursor (ATP-synt_B). The Fo sector of the ATP
synthase is a membrane bound complex which mediates
proton transport. It is composed of nine different
polypeptide subunits (a, b, c, d, e, f, g F6, A6L).
Length = 163
Score = 26.9 bits (60), Expect = 8.3
Identities = 17/56 (30%), Positives = 27/56 (48%), Gaps = 6/56 (10%)
Query: 197 LRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVELKELDQFKKEN 252
L + + K ++ELNQ + L E+ +ER +KL + E K L + KE
Sbjct: 42 LDKRIEKIQDELNQSRN------LHEKALKERIEYVKKLQSLVEETKVLFEVSKET 91
>gnl|CDD|218661 pfam05622, HOOK, HOOK protein. This family consists of several
HOOK1, 2 and 3 proteins from different eukaryotic
organisms. The different members of the human gene
family are HOOK1, HOOK2 and HOOK3. Different domains
have been identified in the three human HOOK proteins,
and it was demonstrated that the highly conserved
NH2-domain mediates attachment to microtubules, whereas
the central coiled-coil motif mediates homodimerisation
and the more divergent C-terminal domains are involved
in binding to specific organelles (organelle-binding
domains). It has been demonstrated that endogenous HOOK3
binds to Golgi membranes, whereas both HOOK1 and HOOK2
are localised to discrete but unidentified cellular
structures. In mice the Hook1 gene is predominantly
expressed in the testis. Hook1 function is necessary for
the correct positioning of microtubular structures
within the haploid germ cell. Disruption of Hook1
function in mice causes abnormal sperm head shape and
fragile attachment of the flagellum to the sperm head.
Length = 713
Score = 27.5 bits (61), Expect = 8.5
Identities = 19/78 (24%), Positives = 36/78 (46%), Gaps = 6/78 (7%)
Query: 194 LSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVELKELDQFKKENQ 253
+ L+ L + +EE +++A + + + +E++L+E++ EL E+Q
Sbjct: 241 YNLLQSQLEQLQEENFRLEAA------RDDYRIKCEELEKELAELQHRNDELTSLAAESQ 294
Query: 254 RLKDENGALIRVISKLSK 271
LKDE L K K
Sbjct: 295 ALKDEIDVLRESSDKAKK 312
>gnl|CDD|232965 TIGR00414, serS, seryl-tRNA synthetase. This model represents the
seryl-tRNA synthetase found in most organisms. This
protein is a class II tRNA synthetase, and is recognized
by the pfam model tRNA-synt_2b. The seryl-tRNA
synthetases of two archaeal species, Methanococcus
jannaschii and Methanobacterium thermoautotrophicum,
differ considerably and are included in a different
model [Protein synthesis, tRNA aminoacylation].
Length = 418
Score = 27.3 bits (61), Expect = 8.7
Identities = 23/103 (22%), Positives = 43/103 (41%), Gaps = 16/103 (15%)
Query: 171 SRKLPNENGELDYKKLYEESQAELS----------RLRETLSKTEEELNQVKATYSKNSL 220
RKL N +L + L + ++ LS+ EE L + SK +
Sbjct: 3 DRKLLRNNPDLVKESLKARGLSVDIDLEKLIALDDERKKLLSEIEE-LQAKRNELSK-QI 60
Query: 221 SEQEKRERRTME---RKLSEMEVELKELD-QFKKENQRLKDEN 259
+ + +++ +E ++L E++ EL EL K L+D+
Sbjct: 61 GKAKGQKKDKIEEIKKELKELKEELTELSAALKALEAELQDKL 103
>gnl|CDD|206365 pfam14197, Cep57_CLD_2, Centrosome localisation domain of PPC89.
The N-terminal region of the fission yeast spindle pole
body protein PPC89 has low similarity to the human Cep57
protein. The CLD or centrosome localisation domain of
Cep57 and PPC89 is found at the N-terminus. This region
localises to the centrosome internally to gamma-tubulin,
suggesting that it is either on both centrioles or on a
centromatrix component. This N-terminal region can also
multimerise with the N-terminus of other Cep57
molecules. The C-terminal part, Family Cep57_MT_bd,
pfam06657, is the microtubule-binding region of Cep57
and PPC89.
Length = 69
Score = 25.3 bits (56), Expect = 8.7
Identities = 18/68 (26%), Positives = 26/68 (38%), Gaps = 10/68 (14%)
Query: 196 RLRETLSKTEEELNQVKATYSKNSLSEQEK----RERRTMERKLSEMEVELKELDQFKKE 251
RL L+ K ++ E E RER + R+L V E + K E
Sbjct: 2 RLEAENLTLRNRLDSAT---RKVAVHEIELKRLNRERDSALRQLG---VAYLECQELKAE 55
Query: 252 NQRLKDEN 259
N+ L+ E
Sbjct: 56 NEALRKEL 63
>gnl|CDD|205988 pfam13815, Dzip-like_N, Iguana/Dzip1-like DAZ-interacting protein
N-terminal. The DAZ gene-product - Deleted in
Azoospermia - and a closely related sequence are
required early in germ-cell development in order to
maintain germ-cell populations. This family is the
N-terminal region that is the only part of the protein
in some fungi and lower metazoa.
Length = 118
Score = 26.5 bits (59), Expect = 8.8
Identities = 11/46 (23%), Positives = 22/46 (47%)
Query: 214 TYSKNSLSEQEKRERRTMERKLSEMEVELKELDQFKKENQRLKDEN 259
+S+ L++ ++ L E E +EL + +E + LK E+
Sbjct: 72 LHSQECLTQSNSELEERLQEALQEAEEREQELSKQSEELKSLKRES 117
>gnl|CDD|189762 pfam00901, Orbi_VP5, Orbivirus outer capsid protein VP5.
cryoelectron microscopy indicates that VP5 is a trimer
implying that there are 360 copies of VP5 per virion.
Length = 507
Score = 27.3 bits (61), Expect = 9.0
Identities = 13/43 (30%), Positives = 26/43 (60%)
Query: 229 RTMERKLSEMEVELKELDQFKKENQRLKDENGALIRVISKLSK 271
+ ++RKL E+E E KE + +K N+++ ++ G + + K K
Sbjct: 87 QGLQRKLKELEREQKEEEVREKHNKKIIEKFGEDLEEVYKFMK 129
>gnl|CDD|182330 PRK10246, PRK10246, exonuclease subunit SbcC; Provisional.
Length = 1047
Score = 27.5 bits (61), Expect = 9.3
Identities = 16/70 (22%), Positives = 32/70 (45%), Gaps = 5/70 (7%)
Query: 191 QAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVELKELDQFKK 250
Q L++L+ + +E Q A ++ +EK ++ + + E E +K+L
Sbjct: 439 QKRLAQLQVAIQNVTQEQTQRNAALNEMRQRYKEKTQQLADVKTICEQEARIKDL----- 493
Query: 251 ENQRLKDENG 260
E QR + + G
Sbjct: 494 EAQRAQLQAG 503
>gnl|CDD|221371 pfam12004, DUF3498, Domain of unknown function (DUF3498). This
presumed domain is functionally uncharacterized. This
domain is found in eukaryotes. This domain is typically
between 433 to 538 amino acids in length. This domain is
found associated with pfam00616, pfam00168. This domain
has two conserved sequence motifs: DLQ and PLSFQNP.
Length = 489
Score = 27.4 bits (60), Expect = 9.4
Identities = 44/234 (18%), Positives = 69/234 (29%), Gaps = 18/234 (7%)
Query: 23 SSSPSPQSRNLSSYSSGLSSSTPYLNSSYPSYGSTYGGTSSGYGGLTLPSSY-SRPSQSS 81
S S + S + S S S S + NS + + L + P Q+S
Sbjct: 196 SGSENHSSASSHSNSEDCSLSAGFSNSGGEEFTRRSTDFTRRQLSLPDRQHQPALPRQNS 255
Query: 82 PRSYHAPPSSYRSSYAPTTTPLPPVRPRRSKSSTRSANYTPEGGG---RMSRSGSTTSLN 138
+PP S + G R S S
Sbjct: 256 AGPQRRVDQPSPPGGGSHRGRIPPSLLSSLPSEGSMLSSEWPQSGARPRQQSSSSKGDSP 315
Query: 139 SLRSEAPSEGYESGGDNLSTSRSRVNSESGGSSRKLPNENGE-LDYKKLYEESQAELSRL 197
LR A S + + R+ + E E + E+ + E++RL
Sbjct: 316 ELRPAAGHLQQPSPVNMSALERTAAWVLMNSQYEDIEYETEENREEGTQAEKYEQEIARL 375
Query: 198 RETLS-------------KTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEM 238
+E L +E+ Q + L + E+R RR E K S+M
Sbjct: 376 KERLRVSVRRLEEYERRLLGQEQQMQKLLQEYQARLEDSEERLRRQQEEKDSQM 429
>gnl|CDD|177739 PLN00128, PLN00128, Succinate dehydrogenase [ubiquinone]
flavoprotein subunit.
Length = 635
Score = 27.5 bits (61), Expect = 9.5
Identities = 18/60 (30%), Positives = 21/60 (35%), Gaps = 1/60 (1%)
Query: 12 GRTTSRPRSLYSSSPSPQSRNLSSYSS-GLSSSTPYLNSSYPSYGSTYGGTSSGYGGLTL 70
R L SSS S + S ++ ST SSY TY G GG L
Sbjct: 4 RCVARGLRLLASSSASSSLASASLRTALSRFFSTGGGRSSYTIVDHTYDAVVVGAGGAGL 63
>gnl|CDD|183610 PRK12585, PRK12585, putative monovalent cation/H+ antiporter
subunit G; Reviewed.
Length = 197
Score = 27.0 bits (59), Expect = 9.5
Identities = 18/71 (25%), Positives = 36/71 (50%), Gaps = 3/71 (4%)
Query: 188 EESQAELSRLRETLSKTEEELNQVKATYSKNSLSEQEKRERRTMERKLSEMEVELKELDQ 247
E+++ E L E + E E + K ++ ++ +RE +T+E + + E E+ E D+
Sbjct: 129 EKARQEREELEE---RMEWERREEKIDEREDQEEQEREREEQTIEEQSDDSEHEIIEQDE 185
Query: 248 FKKENQRLKDE 258
+ E+ K E
Sbjct: 186 SETESDDDKTE 196
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.300 0.118 0.312
Gapped
Lambda K H
0.267 0.0630 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 12,910,933
Number of extensions: 1162772
Number of successful extensions: 2694
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2349
Number of HSP's successfully gapped: 614
Length of query: 271
Length of database: 10,937,602
Length adjustment: 95
Effective length of query: 176
Effective length of database: 6,723,972
Effective search space: 1183419072
Effective search space used: 1183419072
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 17 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 43 (21.7 bits)
S2: 58 (26.3 bits)