RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy4869
(2435 letters)
>gnl|CDD|238121 cd00200, WD40, WD40 domain, found in a number of eukaryotic
proteins that cover a wide variety of functions
including adaptor/regulatory modules in signal
transduction, pre-mRNA processing and cytoskeleton
assembly; typically contains a GH dipeptide 11-24
residues from its N-terminus and the WD dipeptide at its
C-terminus and is 40 residues long, hence the name WD40;
between GH and WD lies a conserved core; serves as a
stable propeller-like platform to which proteins can
bind either stably or reversibly; forms a propeller-like
structure with several blades where each blade is
composed of a four-stranded anti-parallel b-sheet;
instances with few detectable copies are hypothesized to
form larger structures by dimerization; each WD40
sequence repeat forms the first three strands of one
blade and the last strand in the next blade; the last
C-terminal WD40 repeat completes the blade structure of
the first WD40 repeat to create the closed ring
propeller-structure; residues on the top and bottom
surface of the propeller are proposed to coordinate
interactions with other proteins and/or small ligands; 7
copies of the repeat are present in this alignment.
Length = 289
Score = 235 bits (601), Expect = 2e-69
Identities = 97/293 (33%), Positives = 157/293 (53%), Gaps = 11/293 (3%)
Query: 358 LEGHGGEIFCSKYHPDGQYIASSGYDRQIFIWSVYGECENIGVMSGHTGAVMDLKFSTDG 417
L+GH G + C + PDG+ +A+ D I +W + E + + GHTG V D+ S DG
Sbjct: 5 LKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLET-GELLRTLKGHTGPVRDVAASADG 63
Query: 418 CHIFTCSTDQTLAVWDLEKGQRIKKMKGHSTFVNSCDPVRRGQLLIASGSDDCTVKVWDP 477
++ + S+D+T+ +WDLE G+ ++ + GH+++V+S G++ ++S S D T+KVWD
Sbjct: 64 TYLASGSSDKTIRLWDLETGECVRTLTGHTSYVSSVAFSPDGRI-LSSSSRDKTIKVWDV 122
Query: 478 RKKNQAVSMNN-TYQVTSVAFNDTAECVLTGGIDNDIKMWDLRTNSVVQKLRGHSDTVTG 536
++ T V SVAF+ V + D IK+WDLRT V L GH+ V
Sbjct: 123 ETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGTIKLWDLRTGKCVATLTGHTGEVNS 182
Query: 537 LSLSPDGSYILSNAMDNTVRIWDIRPYVPGERCVKVMSGHQHNFEKNLLRCAWSVSGLYV 596
++ SPDG +LS++ D T+++WD+ +C+ + GH E + A+S G +
Sbjct: 183 VAFSPDGEKLLSSSSDGTIKLWDLS----TGKCLGTLRGH----ENGVNSVAFSPDGYLL 234
Query: 597 TAGSADKCVYIWDTTTRRIAYKLPGHNGSVNDVQFHPKEPIIMSASSDKTIYL 649
+GS D + +WD T L GH SV + + P + S S+D TI +
Sbjct: 235 ASGSEDGTIRVWDLRTGECVQTLSGHTNSVTSLAWSPDGKRLASGSADGTIRI 287
Score = 204 bits (521), Expect = 1e-58
Identities = 105/349 (30%), Positives = 154/349 (44%), Gaps = 64/349 (18%)
Query: 1078 LYGHKLPVLSLDMSYDSTLIATGSGDRTVKVWGLDYGDCHKSLLAHEDSVTGVTFVPKTH 1137
L GH V + S D L+ATGSGD T+KVW L+ G+ ++L H V V
Sbjct: 5 LKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASADGT 64
Query: 1138 YFFTTSKDGRVKQWDADNFERIVTLHFFISLYGHKLPVLSLDMSYDSTLIATGSGDRTVK 1197
Y + S D ++ WD + E + TL GH V S+ S D ++++ S D+T+K
Sbjct: 65 YLASGSSDKTIRLWDLETGECVRTLT------GHTSYVSSVAFSPDGRILSSSSRDKTIK 118
Query: 1198 VWGLDYGDCHKSLLAHEDSVTGVTFVPKTHYFFTTSKDGRVKQWDADNFERIVTLHFNPN 1257
VW ++ G C +L H D V V F S DG
Sbjct: 119 VWDVETGKCLTTLRGHTDWVNSVAF----------SPDGTF------------------- 149
Query: 1258 VYLPLQIQVVTGGGDKSVKLWQLELVSVNREADEETKDVSRSHKVLSLLHTRTLKLEEQV 1317
V + D ++KLW L R+ K ++ L T +V
Sbjct: 150 --------VASSSQDGTIKLWDL-----------------RTGKCVATLTGHT----GEV 180
Query: 1318 LCARVSPDSKLLAVSLLDTTVKIFFLDTFKFFISLYGHKLPVLSLDMSYDSTLIATGSGD 1377
SPD + L S D T+K++ L T K +L GH+ V S+ S D L+A+GS D
Sbjct: 181 NSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRGHENGVNSVAFSPDGYLLASGSED 240
Query: 1378 RTVKVWGLDYGDCHKSLLAHEDSVTGVTFVPKTHYFFTTSKDGRVKQWD 1426
T++VW L G+C ++L H +SVT + + P + S DG ++ WD
Sbjct: 241 GTIRVWDLRTGECVQTLSGHTNSVTSLAWSPDGKRLASGSADGTIRIWD 289
Score = 192 bits (489), Expect = 2e-54
Identities = 83/254 (32%), Positives = 132/254 (51%), Gaps = 10/254 (3%)
Query: 397 NIGVMSGHTGAVMDLKFSTDGCHIFTCSTDQTLAVWDLEKGQRIKKMKGHSTFVNSCDPV 456
+ GHTG V + FS DG + T S D T+ VWDLE G+ ++ +KGH+ V
Sbjct: 1 LRRTLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAAS 60
Query: 457 RRGQLLIASGSDDCTVKVWDPRKKNQAVSMNN-TYQVTSVAFNDTAECVLTGGIDNDIKM 515
G L + SD T+++WD ++ T V+SVAF+ + + D IK+
Sbjct: 61 ADGTYLASGSSDK-TIRLWDLETGECVRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKV 119
Query: 516 WDLRTNSVVQKLRGHSDTVTGLSLSPDGSYILSNAMDNTVRIWDIRPYVPGERCVKVMSG 575
WD+ T + LRGH+D V ++ SPDG+++ S++ D T+++WD+R +CV ++G
Sbjct: 120 WDVETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGTIKLWDLR----TGKCVATLTG 175
Query: 576 HQHNFEKNLLRCAWSVSGLYVTAGSADKCVYIWDTTTRRIAYKLPGHNGSVNDVQFHPKE 635
H + A+S G + + S+D + +WD +T + L GH VN V F P
Sbjct: 176 H----TGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRGHENGVNSVAFSPDG 231
Query: 636 PIIMSASSDKTIYL 649
++ S S D TI +
Sbjct: 232 YLLASGSEDGTIRV 245
Score = 191 bits (486), Expect = 5e-54
Identities = 91/253 (35%), Positives = 138/253 (54%), Gaps = 11/253 (4%)
Query: 358 LEGHGGEIFCSKYHPDGQYIASSGYDRQIFIWSVYGECENIGVMSGHTGAVMDLKFSTDG 417
L+GH G + DG Y+AS D+ I +W + E + ++GHT V + FS DG
Sbjct: 47 LKGHTGPVRDVAASADGTYLASGSSDKTIRLWDL-ETGECVRTLTGHTSYVSSVAFSPDG 105
Query: 418 CHIFTCSTDQTLAVWDLEKGQRIKKMKGHSTFVNSCDPVRRGQLLIASGSDDCTVKVWDP 477
+ + S D+T+ VWD+E G+ + ++GH+ +VNS +AS S D T+K+WD
Sbjct: 106 RILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNSVA-FSPDGTFVASSSQDGTIKLWDL 164
Query: 478 RKKN-QAVSMNNTYQVTSVAFNDTAECVLTGGIDNDIKMWDLRTNSVVQKLRGHSDTVTG 536
R A +T +V SVAF+ E +L+ D IK+WDL T + LRGH + V
Sbjct: 165 RTGKCVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRGHENGVNS 224
Query: 537 LSLSPDGSYILSNAMDNTVRIWDIRPYVPGERCVKVMSGHQHNFEKNLLRCAWSVSGLYV 596
++ SPDG + S + D T+R+WD+R CV+ +SGH + ++ AWS G +
Sbjct: 225 VAFSPDGYLLASGSEDGTIRVWDLRT----GECVQTLSGHTN----SVTSLAWSPDGKRL 276
Query: 597 TAGSADKCVYIWD 609
+GSAD + IWD
Sbjct: 277 ASGSADGTIRIWD 289
Score = 185 bits (471), Expect = 6e-52
Identities = 87/320 (27%), Positives = 139/320 (43%), Gaps = 39/320 (12%)
Query: 882 QGHHSEVRALAFSSDNLALVSACA-SQVKIWNRPSLSCLRTIDTGSYALSVC-FVPGDRH 939
+GH V +AFS D L + +K+W+ + LRT+ + + +
Sbjct: 6 KGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASADGTY 65
Query: 940 VLVGTKDGRLLIVDIGAGEILEDIPAHSQELWSVAMLPDQFNPNVYLPLQIQVVTGGGDK 999
+ G+ D + + D+ GE + + H+ + SVA PD +I + + DK
Sbjct: 66 LASGSSDKTIRLWDLETGECVRTLTGHTSYVSSVAFSPDG---------RI-LSSSSRDK 115
Query: 1000 SVKLWQLELVSVNREADEETKDVSRSHKVLSLLHTRTLKLEEQVLCARVSPDSKLLAVSL 1059
++K+W +E + R H + V SPD +A S
Sbjct: 116 TIKVWDVE--------TGKCLTTLRGH-------------TDWVNSVAFSPDGTFVASSS 154
Query: 1060 LDTTVKIFFLDTFKFFISLYGHKLPVLSLDMSYDSTLIATGSGDRTVKVWGLDYGDCHKS 1119
D T+K++ L T K +L GH V S+ S D + + S D T+K+W L G C +
Sbjct: 155 QDGTIKLWDLRTGKCVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCLGT 214
Query: 1120 LLAHEDSVTGVTFVPKTHYFFTTSKDGRVKQWDADNFERIVTLHFFISLYGHKLPVLSLD 1179
L HE+ V V F P + + S+DG ++ WD E + T L GH V SL
Sbjct: 215 LRGHENGVNSVAFSPDGYLLASGSEDGTIRVWDLRTGECVQT------LSGHTNSVTSLA 268
Query: 1180 MSYDSTLIATGSGDRTVKVW 1199
S D +A+GS D T+++W
Sbjct: 269 WSPDGKRLASGSADGTIRIW 288
Score = 184 bits (469), Expect = 1e-51
Identities = 82/279 (29%), Positives = 123/279 (44%), Gaps = 36/279 (12%)
Query: 1168 LYGHKLPVLSLDMSYDSTLIATGSGDRTVKVWGLDYGDCHKSLLAHEDSVTGVTFVPKTH 1227
L GH V + S D L+ATGSGD T+KVW L+ G+ ++L H V V
Sbjct: 5 LKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASADGT 64
Query: 1228 YFFTTSKDGRVKQWDADNFERIVTL----------HFNPNVYLPLQIQVVTGGGDKSVKL 1277
Y + S D ++ WD + E + TL F+P+ + + + DK++K+
Sbjct: 65 YLASGSSDKTIRLWDLETGECVRTLTGHTSYVSSVAFSPDGRI-----LSSSSRDKTIKV 119
Query: 1278 WQLELVSVNREADEETKDVSRSHKVLSLLHTRTLKLEEQVLCARVSPDSKLLAVSLLDTT 1337
W +E + R H + V SPD +A S D T
Sbjct: 120 WDVE--------TGKCLTTLRGH-------------TDWVNSVAFSPDGTFVASSSQDGT 158
Query: 1338 VKIFFLDTFKFFISLYGHKLPVLSLDMSYDSTLIATGSGDRTVKVWGLDYGDCHKSLLAH 1397
+K++ L T K +L GH V S+ S D + + S D T+K+W L G C +L H
Sbjct: 159 IKLWDLRTGKCVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRGH 218
Query: 1398 EDSVTGVTFVPKTHYFFTTSKDGRVKQWDADNFERIVTL 1436
E+ V V F P + + S+DG ++ WD E + TL
Sbjct: 219 ENGVNSVAFSPDGYLLASGSEDGTIRVWDLRTGECVQTL 257
Score = 136 bits (344), Expect = 5e-35
Identities = 83/274 (30%), Positives = 131/274 (47%), Gaps = 35/274 (12%)
Query: 882 QGHHSEVRALAFSSDNLALVSACA-SQVKIWNRPSLSCLRTIDTG--SYALSVCFVPGDR 938
+GH VR +A S+D L S + +++W+ + C+RT+ TG SY SV F P R
Sbjct: 48 KGHTGPVRDVAASADGTYLASGSSDKTIRLWDLETGECVRTL-TGHTSYVSSVAFSPDGR 106
Query: 939 HVLVGTKDGRLLIVDIGAGEILEDIPAHSQELWSVAMLPDQFNPNVYLPLQIQVVTGGGD 998
+ ++D + + D+ G+ L + H+ + SVA PD V + D
Sbjct: 107 ILSSSSRDKTIKVWDVETGKCLTTLRGHTDWVNSVAFSPDG----------TFVASSSQD 156
Query: 999 KSVKLWQLELVSVNREADEETKDVSRSHKVLSLLHTRTLKLEEQVLCARVSPDSKLLAVS 1058
++KLW L R+ K ++ L T +V SPD + L S
Sbjct: 157 GTIKLWDL-----------------RTGKCVATLTGHT----GEVNSVAFSPDGEKLLSS 195
Query: 1059 LLDTTVKIFFLDTFKFFISLYGHKLPVLSLDMSYDSTLIATGSGDRTVKVWGLDYGDCHK 1118
D T+K++ L T K +L GH+ V S+ S D L+A+GS D T++VW L G+C +
Sbjct: 196 SSDGTIKLWDLSTGKCLGTLRGHENGVNSVAFSPDGYLLASGSEDGTIRVWDLRTGECVQ 255
Query: 1119 SLLAHEDSVTGVTFVPKTHYFFTTSKDGRVKQWD 1152
+L H +SVT + + P + S DG ++ WD
Sbjct: 256 TLSGHTNSVTSLAWSPDGKRLASGSADGTIRIWD 289
Score = 130 bits (330), Expect = 4e-33
Identities = 67/207 (32%), Positives = 99/207 (47%), Gaps = 45/207 (21%)
Query: 354 PIMLLEGHGGEIFCSKYHPDGQYIASSGYDRQIFIWSV-YGECENIGVMSGHTGAVMDLK 412
+ L GH + + PDG ++ASS D I +W + G+C + ++GHTG V +
Sbjct: 127 CLTTLRGHTDWVNSVAFSPDGTFVASSSQDGTIKLWDLRTGKC--VATLTGHTGEVNSVA 184
Query: 413 FSTDGCHIFTCSTDQTLAVWDLEKGQRIKKMKGHSTFVNSCDPVRRGQLLIASGSDDCTV 472
FS DG + + S+D T+ +WDL G+ + ++GH VNS L+ASGS+D T+
Sbjct: 185 FSPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRGHENGVNSVA-FSPDGYLLASGSEDGTI 243
Query: 473 KVWDPRKKNQAVSMNNTYQVTSVAFNDTAECVLTGGIDNDIKMWDLRTNSVVQKLRGHSD 532
+V WDLRT VQ L GH++
Sbjct: 244 RV-----------------------------------------WDLRTGECVQTLSGHTN 262
Query: 533 TVTGLSLSPDGSYILSNAMDNTVRIWD 559
+VT L+ SPDG + S + D T+RIWD
Sbjct: 263 SVTSLAWSPDGKRLASGSADGTIRIWD 289
Score = 112 bits (281), Expect = 1e-26
Identities = 56/202 (27%), Positives = 96/202 (47%), Gaps = 13/202 (6%)
Query: 36 GRFLATGASED-VIIWDLRLAEKALLLPGEKHEALLLPGEKHEVCQLSPNHDSSQLAVAY 94
G +LA+G+S+ + +WDL E L G V ++ + D L+ +
Sbjct: 63 GTYLASGSSDKTIRLWDLETGECV-------RT---LTGHTSYVSSVAFSPDGRILSSSS 112
Query: 95 TNGSLKTFSLDTTDVISTFTGHKSAITVIQYDPLGHRLATGSKDTDIVLWDVVAECGLHR 154
+ ++K + ++T ++T GH + + + P G +A+ S+D I LWD+ +
Sbjct: 113 RDKTIKVWDVETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGTIKLWDLRTGKCVAT 172
Query: 155 LSGHKGVITDIRFMSQPGHHFVVSSAKDTFVKIWDADTGDCFKTMAAHLTEVWGVCVMRE 214
L+GH G + + F S G + SS+ D +K+WD TG C T+ H V V +
Sbjct: 173 LTGHTGEVNSVAF-SPDGEKLLSSSS-DGTIKLWDLSTGKCLGTLRGHENGVNSVAFSPD 230
Query: 215 DSYLISGSNDAELKVWNVRDRS 236
L SGS D ++VW++R
Sbjct: 231 GYLLASGSEDGTIRVWDLRTGE 252
Score = 111 bits (280), Expect = 1e-26
Identities = 47/163 (28%), Positives = 82/163 (50%), Gaps = 2/163 (1%)
Query: 71 LPGEKHEVCQLSPNHDSSQLAVAYTNGSLKTFSLDTTDVISTFTGHKSAITVIQYDPLGH 130
L G V ++ + D LA +G++K + L+T +++ T GH + + G
Sbjct: 5 LKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASADGT 64
Query: 131 RLATGSKDTDIVLWDVVAECGLHRLSGHKGVITDIRFMSQPGHHFVVSSAKDTFVKIWDA 190
LA+GS D I LWD+ + L+GH ++ + F P + SS++D +K+WD
Sbjct: 65 YLASGSSDKTIRLWDLETGECVRTLTGHTSYVSSVAFS--PDGRILSSSSRDKTIKVWDV 122
Query: 191 DTGDCFKTMAAHLTEVWGVCVMREDSYLISGSNDAELKVWNVR 233
+TG C T+ H V V + +++ S S D +K+W++R
Sbjct: 123 ETGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGTIKLWDLR 165
Score = 109 bits (275), Expect = 8e-26
Identities = 49/197 (24%), Positives = 94/197 (47%), Gaps = 14/197 (7%)
Query: 37 RFLATGASED--VIIWDLRLAEKALLLPGEKHEALLLPGEKHEVCQLSPNHDSSQLAVAY 94
+ + +S D + +WD+ + L G H V ++ + D + +A +
Sbjct: 105 GRILSSSSRDKTIKVWDVETGKCLTTLRG--HTDW--------VNSVAFSPDGTFVASSS 154
Query: 95 TNGSLKTFSLDTTDVISTFTGHKSAITVIQYDPLGHRLATGSKDTDIVLWDVVAECGLHR 154
+G++K + L T ++T TGH + + + P G +L + S D I LWD+ L
Sbjct: 155 QDGTIKLWDLRTGKCVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCLGT 214
Query: 155 LSGHKGVITDIRFMSQPGHHFVVSSAKDTFVKIWDADTGDCFKTMAAHLTEVWGVCVMRE 214
L GH+ + + F P + + S ++D +++WD TG+C +T++ H V + +
Sbjct: 215 LRGHENGVNSVAFS--PDGYLLASGSEDGTIRVWDLRTGECVQTLSGHTNSVTSLAWSPD 272
Query: 215 DSYLISGSNDAELKVWN 231
L SGS D +++W+
Sbjct: 273 GKRLASGSADGTIRIWD 289
Score = 104 bits (262), Expect = 4e-24
Identities = 41/131 (31%), Positives = 64/131 (48%), Gaps = 2/131 (1%)
Query: 110 ISTFTGHKSAITVIQYDPLGHRLATGSKDTDIVLWDVVAECGLHRLSGHKGVITDIRFMS 169
T GH +T + + P G LATGS D I +WD+ L L GH G + D+ +
Sbjct: 2 RRTLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASA 61
Query: 170 QPGHHFVVSSAKDTFVKIWDADTGDCFKTMAAHLTEVWGVCVMREDSYLISGSNDAELKV 229
++ S + D +++WD +TG+C +T+ H + V V + L S S D +KV
Sbjct: 62 D--GTYLASGSSDKTIRLWDLETGECVRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKV 119
Query: 230 WNVRDRSDIDT 240
W+V + T
Sbjct: 120 WDVETGKCLTT 130
Score = 102 bits (255), Expect = 3e-23
Identities = 50/140 (35%), Positives = 77/140 (55%), Gaps = 1/140 (0%)
Query: 1307 HTRTLKL-EEQVLCARVSPDSKLLAVSLLDTTVKIFFLDTFKFFISLYGHKLPVLSLDMS 1365
RTLK V C SPD KLLA D T+K++ L+T + +L GH PV + S
Sbjct: 1 LRRTLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAAS 60
Query: 1366 YDSTLIATGSGDRTVKVWGLDYGDCHKSLLAHEDSVTGVTFVPKTHYFFTTSKDGRVKQW 1425
D T +A+GS D+T+++W L+ G+C ++L H V+ V F P ++S+D +K W
Sbjct: 61 ADGTYLASGSSDKTIRLWDLETGECVRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVW 120
Query: 1426 DADNFERIVTLHICSCSLNS 1445
D + + + TL + +NS
Sbjct: 121 DVETGKCLTTLRGHTDWVNS 140
Score = 74.3 bits (183), Expect = 7e-14
Identities = 43/158 (27%), Positives = 80/158 (50%), Gaps = 13/158 (8%)
Query: 33 NQEGRFLATGASED-VIIWDLRLAEKALLLPGEKHEALLLPGEKHEVCQLSPNHDSSQLA 91
+ +G F+A+ + + + +WDLR K A L G EV ++ + D +L
Sbjct: 144 SPDGTFVASSSQDGTIKLWDLR---------TGKCVATL-TGHTGEVNSVAFSPDGEKLL 193
Query: 92 VAYTNGSLKTFSLDTTDVISTFTGHKSAITVIQYDPLGHRLATGSKDTDIVLWDVVAECG 151
+ ++G++K + L T + T GH++ + + + P G+ LA+GS+D I +WD+
Sbjct: 194 SSSSDGTIKLWDLSTGKCLGTLRGHENGVNSVAFSPDGYLLASGSEDGTIRVWDLRTGEC 253
Query: 152 LHRLSGHKGVITDIRFMSQPGHHFVVSSAKDTFVKIWD 189
+ LSGH +T + + P + S + D ++IWD
Sbjct: 254 VQTLSGHTNSVTSLAWS--PDGKRLASGSADGTIRIWD 289
Score = 65.0 bits (159), Expect = 9e-11
Identities = 26/82 (31%), Positives = 40/82 (48%), Gaps = 1/82 (1%)
Query: 352 FAPIMLLEGHGGEIFCSKYHPDGQYIASSGYDRQIFIWSVYGECENIGVMSGHTGAVMDL 411
+ L GH + + PDG +AS D I +W + E + +SGHT +V L
Sbjct: 209 GKCLGTLRGHENGVNSVAFSPDGYLLASGSEDGTIRVWDL-RTGECVQTLSGHTNSVTSL 267
Query: 412 KFSTDGCHIFTCSTDQTLAVWD 433
+S DG + + S D T+ +WD
Sbjct: 268 AWSPDGKRLASGSADGTIRIWD 289
Score = 58.5 bits (142), Expect = 1e-08
Identities = 30/123 (24%), Positives = 56/123 (45%), Gaps = 10/123 (8%)
Query: 23 NCNVVFVTLKNQEGRFLATGASEDVIIWDLRLAEKALLLPGEKHEALLLPGEKHEVCQLS 82
V V + L++ + + +WDL L G ++ V ++
Sbjct: 177 TGEVNSVAFSPDGEKLLSSSSDGTIKLWDLS----------TGKCLGTLRGHENGVNSVA 226
Query: 83 PNHDSSQLAVAYTNGSLKTFSLDTTDVISTFTGHKSAITVIQYDPLGHRLATGSKDTDIV 142
+ D LA +G+++ + L T + + T +GH +++T + + P G RLA+GS D I
Sbjct: 227 FSPDGYLLASGSEDGTIRVWDLRTGECVQTLSGHTNSVTSLAWSPDGKRLASGSADGTIR 286
Query: 143 LWD 145
+WD
Sbjct: 287 IWD 289
Score = 32.7 bits (75), Expect = 2.0
Identities = 12/46 (26%), Positives = 20/46 (43%)
Query: 195 CFKTMAAHLTEVWGVCVMREDSYLISGSNDAELKVWNVRDRSDIDT 240
+T+ H V V + L +GS D +KVW++ + T
Sbjct: 1 LRRTLKGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRT 46
>gnl|CDD|215726 pfam00112, Peptidase_C1, Papain family cysteine protease.
Length = 213
Score = 133 bits (338), Expect = 5e-35
Identities = 71/268 (26%), Positives = 96/268 (35%), Gaps = 89/268 (33%)
Query: 2174 IPETFDAREEWPQCKDVIGKVWDQGACQSCWVSHQPRTAGLKGLFSFIKYGQGQERTLSV 2233
+PE+FD RE K + V DQG C SCW
Sbjct: 1 LPESFDWRE-----KGAVTPVKDQGQCGSCW----------------------------- 26
Query: 2234 WDKAISAASVMSDRICIQSKGQVKPILSPQHLICSCTNCTRMHTKTPMSMCMGGDSAAAW 2293
A SA + R CI++ V LS Q L+ C T + C GG A+
Sbjct: 27 ---AFSAVGALEGRYCIKTGKLV--SLSEQQLV-DC--DTGNNG------CNGGLPDNAF 72
Query: 2294 MYWI-NAGLVDGGDY---GTH-----DVSMGRYIEGIGHAASVMGSSNPE---------- 2334
Y N G+V DY S +Y + I V N E
Sbjct: 73 EYIKKNGGIVTESDYPYTAHDGTCKFKKSNSKYAK-IKGYGDV--PYNDEEALQAALAKN 129
Query: 2335 ------VNNFEKVIRLYS--------CEGSINPRYIHSVKIIGWGKSSQNEPYWLCTNSY 2380
++ +E +LY C G ++ H+V I+G+G + PYW+ NS+
Sbjct: 130 GPVSVAIDAYEDDFQLYKSGVYKHTECSGELD----HAVLIVGYGTEN-GVPYWIVKNSW 184
Query: 2381 NQGWGEQGLFKIRRGVNMCSIEDSVMAG 2408
WGE G F+I RGVN C I
Sbjct: 185 GTDWGENGYFRIARGVNECGIASEASYP 212
>gnl|CDD|225201 COG2319, COG2319, FOG: WD40 repeat [General function prediction
only].
Length = 466
Score = 137 bits (346), Expect = 7e-34
Identities = 104/309 (33%), Positives = 159/309 (51%), Gaps = 20/309 (6%)
Query: 349 SNLFAPIMLLEGHGGEIFCSKYHPDGQYIAS-SGYDRQIFIWSVYGECENIGVMSGHTGA 407
S I LEGH + + PDG+ +AS S D I +W + + + ++GHT
Sbjct: 142 STPGKLIRTLEGHSESVTSLAFSPDGKLLASGSSLDGTIKLWDLRTG-KPLSTLAGHTDP 200
Query: 408 VMDLKFSTDG-CHIFTCSTDQTLAVWDLEKGQRIK-KMKGHSTFVNSCDPVRRGQLLIAS 465
V L FS DG I + S+D T+ +WDL G+ ++ + GHS V S L+AS
Sbjct: 201 VSSLAFSPDGGLLIASGSSDGTIRLWDLSTGKLLRSTLSGHSDSVVSS--FSPDGSLLAS 258
Query: 466 GSDDCTVKVWDPRKKNQAVSMN--NTYQVTSVAFNDTAECVLTGGIDNDIKMWDLRTNSV 523
GS D T+++WD R + + ++ V SVAF+ + + +G D +++WDL T +
Sbjct: 259 GSSDGTIRLWDLRSSSSLLRTLSGHSSSVLSVAFSPDGKLLASGSSDGTVRLWDLETGKL 318
Query: 524 VQ--KLRGHSDTVTGLSLSPDGSYILSNAM-DNTVRIWDIRPYVPGERCVKVMSGHQHNF 580
+ L+GH V+ LS SPDGS ++S D T+R+WD+R + +K + GH
Sbjct: 319 LSSLTLKGHEGPVSSLSFSPDGSLLVSGGSDDGTIRLWDLR----TGKPLKTLEGH---- 370
Query: 581 EKNLLRCAWSVSGLYVTAGSADKCVYIWDTTTRRIAYKLPGHNGSVNDVQFHPKEPIIMS 640
N+L ++S G V++GS D V +WD +T + L GH V + F P + S
Sbjct: 371 -SNVLSVSFSPDGRVVSSGSTDGTVRLWDLSTGSLLRNLDGHTSRVTSLDFSPDGKSLAS 429
Query: 641 ASSDKTIYL 649
SSD TI L
Sbjct: 430 GSSDNTIRL 438
Score = 135 bits (341), Expect = 3e-33
Identities = 117/427 (27%), Positives = 178/427 (41%), Gaps = 41/427 (9%)
Query: 1023 SRSHKVLSLLHTRTLKLEEQVLCARVSPDSKLLAVSLLDTTVKIFFLDTFKFFISLY--G 1080
+ L E+ + SPD +LL D T+K++ LD + I
Sbjct: 48 DSLVSLPDLSSLLLRGHEDSITSIAFSPDGELLLSGSSDGTIKLWDLDNGEKLIKSLEGL 107
Query: 1081 HKLPVLSLDMSY---DSTLIATGSGDRTVKVWGL-DYGDCHKSLLAHEDSVTGVTFVPKT 1136
H V L +S +S L+A+ S D TVK+W L G ++L H +SVT + F P
Sbjct: 108 HDSSVSKLALSSPDGNSILLASSSLDGTVKLWDLSTPGKLIRTLEGHSESVTSLAFSPDG 167
Query: 1137 HYFFTTS-KDGRVKQWDADNFERIVTLHFFISLYGHKLPVLSLDMSYDST-LIATGSGDR 1194
+ S DG +K WD + + TL GH PV SL S D LIA+GS D
Sbjct: 168 KLLASGSSLDGTIKLWDLRTGKPLSTLA------GHTDPVSSLAFSPDGGLLIASGSSDG 221
Query: 1195 TVKVWGLDYGDCHKSLLAHEDSVTGVTFVPKTHYFFTTSKDGRVKQWDADNFERIV---- 1250
T+++W L G +S L+ +F P + S DG ++ WD + ++
Sbjct: 222 TIRLWDLSTGKLLRSTLSGHSDSVVSSFSPDGSLLASGSSDGTIRLWDLRSSSSLLRTLS 281
Query: 1251 --TLHFNPNVYLPLQIQVVTGGGDKSVKLWQLELVSVNREADEETKDVSRSHKVLSLLHT 1308
+ + P + +G D +V+LW LE + +
Sbjct: 282 GHSSSVLSVAFSPDGKLLASGSSDGTVRLWDLETGKLLSSLTLKGH-------------- 327
Query: 1309 RTLKLEEQVLCARVSPDSKLLAVSL-LDTTVKIFFLDTFKFFISLYGHKLPVLSLDMSYD 1367
E V SPD LL D T++++ L T K +L GH VLS+ S D
Sbjct: 328 -----EGPVSSLSFSPDGSLLVSGGSDDGTIRLWDLRTGKPLKTLEGHS-NVLSVSFSPD 381
Query: 1368 STLIATGSGDRTVKVWGLDYGDCHKSLLAHEDSVTGVTFVPKTHYFFTTSKDGRVKQWDA 1427
++++GS D TV++W L G ++L H VT + F P + S D ++ WD
Sbjct: 382 GRVVSSGSTDGTVRLWDLSTGSLLRNLDGHTSRVTSLDFSPDGKSLASGSSDNTIRLWDL 441
Query: 1428 DNFERIV 1434
+ V
Sbjct: 442 KTSLKSV 448
Score = 129 bits (324), Expect = 4e-31
Identities = 114/471 (24%), Positives = 194/471 (41%), Gaps = 43/471 (9%)
Query: 102 FSLDTTDVISTFTGHKSAITVIQYDPLGHRLATGSKDTDIVLWDVVAECGLHRLSGHKGV 161
S + + ++ + G L D+ + L D+ L GH+
Sbjct: 12 KSKLLKKSELGPSLNSLSLLSLGSSESGILLLALLSDSLVSLPDL----SSLLLRGHEDS 67
Query: 162 ITDIRFMSQPGHHFVVSSAKDTFVKIWDADTGDCFKTM--AAHLTEVWGVCVMREDSYLI 219
IT I F P ++S + D +K+WD D G+ H + V + + D I
Sbjct: 68 ITSIAF--SPDGELLLSGSSDGTIKLWDLDNGEKLIKSLEGLHDSSVSKLALSSPDGNSI 125
Query: 220 ---SGSNDAELKVWNVRDRSDIDTEDKDKLSEQLNQLLLSEDEPDLTVSKIEVQIINELK 276
S S D +K+W++ + L + PD + + +K
Sbjct: 126 LLASSSLDGTVKLWDLST----PGKLIRTLEGHSESVTSLAFSPDGKLLASGSSLDGTIK 181
Query: 277 NLSTGKKKWLQVFRLALCISSITLNIDDFAFGIDTTQELRTRSMKRKNDEVTVYDREKNY 336
K L + T + AF D + + S + ++D
Sbjct: 182 LWDLRTGKPLSTL------AGHTDPVSSLAFSPDGGLLIASGSSDGT---IRLWDLSTGK 232
Query: 337 KVQKVQKDVGRTSNLFAPIMLLEGHGGEIFCSKYHPDGQYIASSGYDRQIFIWSVYGECE 396
++ L GH + S + PDG +AS D I +W +
Sbjct: 233 LLRST----------------LSGHSDSV-VSSFSPDGSLLASGSSDGTIRLWDLRSSSS 275
Query: 397 NIGVMSGHTGAVMDLKFSTDGCHIFTCSTDQTLAVWDLEKGQRIKKMK--GHSTFVNSCD 454
+ +SGH+ +V+ + FS DG + + S+D T+ +WDLE G+ + + GH V+S
Sbjct: 276 LLRTLSGHSSSVLSVAFSPDGKLLASGSSDGTVRLWDLETGKLLSSLTLKGHEGPVSSLS 335
Query: 455 PVRRGQLLIASGSDDCTVKVWDPRKKNQAVSMNNTYQVTSVAFNDTAECVLTGGIDNDIK 514
G LL++ GSDD T+++WD R ++ V SV+F+ V +G D ++
Sbjct: 336 FSPDGSLLVSGGSDDGTIRLWDLRTGKPLKTLEGHSNVLSVSFSPDGRVVSSGSTDGTVR 395
Query: 515 MWDLRTNSVVQKLRGHSDTVTGLSLSPDGSYILSNAMDNTVRIWDIRPYVP 565
+WDL T S+++ L GH+ VT L SPDG + S + DNT+R+WD++ +
Sbjct: 396 LWDLSTGSLLRNLDGHTSRVTSLDFSPDGKSLASGSSDNTIRLWDLKTSLK 446
Score = 127 bits (319), Expect = 2e-30
Identities = 118/431 (27%), Positives = 182/431 (42%), Gaps = 60/431 (13%)
Query: 978 DQFNPNVYLPLQIQVVTGGGDKSVKLWQLELVSVNREADEETKDVSRSHKVLSLLHTRTL 1037
D + P +++G D ++KLW L+ + + SL
Sbjct: 66 DSITSIAFSPDGELLLSGSSDGTIKLWDLD---------------NGEKLIKSLEGLHDS 110
Query: 1038 KLEEQVLCARVSPDSK--LLAVSLLDTTVKIFFLDT-FKFFISLYGHKLPVLSLDMS-YD 1093
+ + A SPD LLA S LD TVK++ L T K +L GH V SL S
Sbjct: 111 SVSK---LALSSPDGNSILLASSSLDGTVKLWDLSTPGKLIRTLEGHSESVTSLAFSPDG 167
Query: 1094 STLIATGSGDRTVKVWGLDYGDCHKSLLAHEDSVTGVTFVPKTHYFFTT-SKDGRVKQWD 1152
L + S D T+K+W L G +L H D V+ + F P + S DG ++ WD
Sbjct: 168 KLLASGSSLDGTIKLWDLRTGKPLSTLAGHTDPVSSLAFSPDGGLLIASGSSDGTIRLWD 227
Query: 1153 ADNFERIVTLHFFISLYGHKLPVLSLDMSYDSTLIATGSGDRTVKVWGLDYGD-CHKSLL 1211
+ + + +L GH V+S S D +L+A+GS D T+++W L ++L
Sbjct: 228 LSTGKLLRS-----TLSGHSDSVVSS-FSPDGSLLASGSSDGTIRLWDLRSSSSLLRTLS 281
Query: 1212 AHEDSVTGVTFVPKTHYFFTTSKDGRVKQWDADNFERIVTLHFNPN-------VYLPLQI 1264
H SV V F P + S DG V+ WD + + + +L + + P
Sbjct: 282 GHSSSVLSVAFSPDGKLLASGSSDGTVRLWDLETGKLLSSLTLKGHEGPVSSLSFSPDGS 341
Query: 1265 QVVTGGG-DKSVKLWQLELVSVNREADEETKDVSRSHKVLSLLHTRTLKLEEQVLCARVS 1323
+V+GG D +++LW L +TL+ VL S
Sbjct: 342 LLVSGGSDDGTIRLWDLRTGK----------------------PLKTLEGHSNVLSVSFS 379
Query: 1324 PDSKLLAVSLLDTTVKIFFLDTFKFFISLYGHKLPVLSLDMSYDSTLIATGSGDRTVKVW 1383
PD ++++ D TV+++ L T +L GH V SLD S D +A+GS D T+++W
Sbjct: 380 PDGRVVSSGSTDGTVRLWDLSTGSLLRNLDGHTSRVTSLDFSPDGKSLASGSSDNTIRLW 439
Query: 1384 GLDYGDCHKSL 1394
L S
Sbjct: 440 DLKTSLKSVSF 450
Score = 126 bits (316), Expect = 4e-30
Identities = 100/406 (24%), Positives = 168/406 (41%), Gaps = 44/406 (10%)
Query: 852 LLLNNNSLELHSLSLGGSTDSVRHLRSIHAQGHHSEVRALAFSSDNLALVSACASQVKIW 911
LL ++ + L ++ L +H + + L S+ VK+W
Sbjct: 80 LLSGSSDGTIKLWDLDNGEKLIKSLEGLHDSSVSKLALSSPDGNSILLASSSLDGTVKLW 139
Query: 912 NRPSLSCLRTIDTGSYA--LSVCFVPGDRHVLVG-TKDGRLLIVDIGAGEILEDIPAHSQ 968
+ + L G S+ F P + + G + DG + + D+ G+ L + H+
Sbjct: 140 DLSTPGKLIRTLEGHSESVTSLAFSPDGKLLASGSSLDGTIKLWDLRTGKPLSTLAGHTD 199
Query: 969 ELWSVAMLPDQFNPNVYLPLQIQVVTGGGDKSVKLWQLELVSVNREADEETKDVSRSHKV 1028
+ S+A PD + + +G D +++LW D T + RS
Sbjct: 200 PVSSLAFSPD---------GGLLIASGSSDGTIRLW-----------DLSTGKLLRS--- 236
Query: 1029 LSLLHTRTLKLEEQVLCARVSPDSKLLAVSLLDTTVKIFFLDTF-KFFISLYGHKLPVLS 1087
TL + + SPD LLA D T++++ L + +L GH VLS
Sbjct: 237 -------TLSGHSDSVVSSFSPDGSLLASGSSDGTIRLWDLRSSSSLLRTLSGHSSSVLS 289
Query: 1088 LDMSYDSTLIATGSGDRTVKVWGLDYGDC--HKSLLAHEDSVTGVTFVPKTHYFFTT-SK 1144
+ S D L+A+GS D TV++W L+ G +L HE V+ ++F P + S
Sbjct: 290 VAFSPDGKLLASGSSDGTVRLWDLETGKLLSSLTLKGHEGPVSSLSFSPDGSLLVSGGSD 349
Query: 1145 DGRVKQWDADNFERIVTLHFFISLYGHKLPVLSLDMSYDSTLIATGSGDRTVKVWGLDYG 1204
DG ++ WD + + TL VLS+ S D ++++GS D TV++W L G
Sbjct: 350 DGTIRLWDLRTGKPLKTLE-------GHSNVLSVSFSPDGRVVSSGSTDGTVRLWDLSTG 402
Query: 1205 DCHKSLLAHEDSVTGVTFVPKTHYFFTTSKDGRVKQWDADNFERIV 1250
++L H VT + F P + S D ++ WD + V
Sbjct: 403 SLLRNLDGHTSRVTSLDFSPDGKSLASGSSDNTIRLWDLKTSLKSV 448
Score = 122 bits (305), Expect = 1e-28
Identities = 117/429 (27%), Positives = 181/429 (42%), Gaps = 45/429 (10%)
Query: 1023 SRSHKVLSLLHTRTLKLEEQVLCARVSPDSKLLAVSLLDTTVKIFFLDTFKFFISLYGHK 1082
K+L +L S LL L D+ V + L + + L GH+
Sbjct: 10 ENKSKLLKKSELGPSLNSLSLLSLGSSESGILLLALLSDSLVSLPDLSS----LLLRGHE 65
Query: 1083 LPVLSLDMSYDSTLIATGSGDRTVKVWGLDYGDCHKSLL--AHEDSVTGVTFVPKTHYFF 1140
+ S+ S D L+ +GS D T+K+W LD G+ L H+ SV+ +
Sbjct: 66 DSITSIAFSPDGELLLSGSSDGTIKLWDLDNGEKLIKSLEGLHDSSVSKLALSSPDGNSI 125
Query: 1141 ---TTSKDGRVKQWDADNFERIVTLHFFISLYGHKLPVLSLDMS-YDSTLIATGSGDRTV 1196
++S DG VK WD +++ +L GH V SL S L + S D T+
Sbjct: 126 LLASSSLDGTVKLWDLSTPGKLIR-----TLEGHSESVTSLAFSPDGKLLASGSSLDGTI 180
Query: 1197 KVWGLDYGDCHKSLLAHEDSVTGVTFVPKTHYFFTT-SKDGRVKQWDADNFERIVTLHFN 1255
K+W L G +L H D V+ + F P + S DG ++ WD + + +
Sbjct: 181 KLWDLRTGKPLSTLAGHTDPVSSLAFSPDGGLLIASGSSDGTIRLWDLSTGKLLRSTLSG 240
Query: 1256 P-----NVYLPLQIQVVTGGGDKSVKLWQLELVSVNREADEETKDVSRSHKVLSLLHTRT 1310
+ + P + +G D +++LW D+ S +L L +
Sbjct: 241 HSDSVVSSFSPDGSLLASGSSDGTIRLW----------------DLRSSSSLLRTLSGHS 284
Query: 1311 LKLEEQVLCARVSPDSKLLAVSLLDTTVKIFFLDT--FKFFISLYGHKLPVLSLDMSYDS 1368
VL SPD KLLA D TV+++ L+T ++L GH+ PV SL S D
Sbjct: 285 ----SSVLSVAFSPDGKLLASGSSDGTVRLWDLETGKLLSSLTLKGHEGPVSSLSFSPDG 340
Query: 1369 TLIATG-SGDRTVKVWGLDYGDCHKSLLAHEDSVTGVTFVPKTHYFFTTSKDGRVKQWDA 1427
+L+ +G S D T+++W L G K+L H +V V+F P + S DG V+ WD
Sbjct: 341 SLLVSGGSDDGTIRLWDLRTGKPLKTLEGH-SNVLSVSFSPDGRVVSSGSTDGTVRLWDL 399
Query: 1428 DNFERIVTL 1436
+ L
Sbjct: 400 STGSLLRNL 408
Score = 116 bits (290), Expect = 9e-27
Identities = 118/486 (24%), Positives = 206/486 (42%), Gaps = 61/486 (12%)
Query: 169 SQPGHHFVVSSAKDTFVKIWDADTGDCFKTMAAHLTEVWGVCVMREDSYLISGSNDAELK 228
S +++ D+ V + D + + H + + + L+SGS+D +K
Sbjct: 35 SSESGILLLALLSDSLVSLPDLSSL----LLRGHEDSITSIAFSPDGELLLSGSSDGTIK 90
Query: 229 VWNVRDRSD-IDTEDKDKLSEQLNQLLLSEDEPDLTVSKIEVQIINELKNLSTGKKKWLQ 287
+W++ + I + + S L S D + ++ + +L +LST
Sbjct: 91 LWDLDNGEKLIKSLEGLHDSSVSKLALSSPDGNSILLASSSLDGTVKLWDLST------- 143
Query: 288 VFRLALCISSITLNIDDFAFGIDTTQELRTRSMKRKNDEVTVYDREKNYKVQKVQKDVGR 347
+L + + ++ AF + S + + ++D
Sbjct: 144 PGKLIRTLEGHSESVTSLAF---SPDGKLLASGSSLDGTIKLWDLRTG------------ 188
Query: 348 TSNLFAPIMLLEGHGGEIFCSKYHPDGQ-YIASSGYDRQIFIWSVYGECENIGVMSGHTG 406
P+ L GH + + PDG IAS D I +W + +SGH+
Sbjct: 189 -----KPLSTLAGHTDPVSSLAFSPDGGLLIASGSSDGTIRLWDLSTGKLLRSTLSGHSD 243
Query: 407 AVMDLKFSTDGCHIFTCSTDQTLAVWDLE-KGQRIKKMKGHSTFVNSCDPVRRGQLLIAS 465
+V+ FS DG + + S+D T+ +WDL ++ + GHS+ V S G+LL AS
Sbjct: 244 SVVSS-FSPDGSLLASGSSDGTIRLWDLRSSSSLLRTLSGHSSSVLSVAFSPDGKLL-AS 301
Query: 466 GSDDCTVKVWDPRKKNQAVSMNNTY---QVTSVAFNDTAECVLTGG-IDNDIKMWDLRTN 521
GS D TV++WD S+ V+S++F+ +++GG D I++WDLRT
Sbjct: 302 GSSDGTVRLWDLETGKLLSSLTLKGHEGPVSSLSFSPDGSLLVSGGSDDGTIRLWDLRTG 361
Query: 522 SVVQKLRGHSDTVTGLSLSPDGSYILSNAMDNTVRIWDIRPYVPGERCVKVMSGHQHNFE 581
++ L GHS V +S SPDG + S + D TVR+WD+ ++ + GH
Sbjct: 362 KPLKTLEGHS-NVLSVSFSPDGRVVSSGSTDGTVRLWDLSTG----SLLRNLDGHTSR-- 414
Query: 582 KNLLRCAWSVSGLYVTAGSADKCVYIWDTTTRRIAYKLPGHNGSVNDVQFHPKEPIIMSA 641
+ +S G + +GS+D + +WD T S+ V F P ++ S
Sbjct: 415 --VTSLDFSPDGKSLASGSSDNTIRLWDLKT------------SLKSVSFSPDGKVLASK 460
Query: 642 SSDKTI 647
SSD ++
Sbjct: 461 SSDLSV 466
Score = 115 bits (289), Expect = 1e-26
Identities = 111/395 (28%), Positives = 171/395 (43%), Gaps = 40/395 (10%)
Query: 1057 VSLLDTTVKIFFLDTFKFFISLYGHKLPVLSLDMSYDSTLIATGSGDRTVKVWGLDYGDC 1116
V T+ + K + + L +LSL S L+ D V + L
Sbjct: 2 VDNSSTSSENKSKLLKKSELGPSLNSLSLLSLGSSESGILLLALLSDSLVSLPDLSSL-- 59
Query: 1117 HKSLLAHEDSVTGVTFVPKTHYFFTTSKDGRVKQWDADNFERIV-TLHFFISLYGHKLPV 1175
L HEDS+T + F P + S DG +K WD DN E+++ +L KL +
Sbjct: 60 --LLRGHEDSITSIAFSPDGELLLSGSSDGTIKLWDLDNGEKLIKSLEGLHDSSVSKLAL 117
Query: 1176 LSLDMSYDSTLIATGSGDRTVKVWGL-DYGDCHKSLLAHEDSVTGVTFVPKTHYFFTTS- 1233
S D +S L+A+ S D TVK+W L G ++L H +SVT + F P + S
Sbjct: 118 SSPD--GNSILLASSSLDGTVKLWDLSTPGKLIRTLEGHSESVTSLAFSPDGKLLASGSS 175
Query: 1234 KDGRVKQWDADNFERIVTLHFNPNVYLPL------QIQVVTGGGDKSVKLWQLELVSVNR 1287
DG +K WD + + TL + + L + + +G D +++LW
Sbjct: 176 LDGTIKLWDLRTGKPLSTLAGHTDPVSSLAFSPDGGLLIASGSSDGTIRLW--------- 226
Query: 1288 EADEETKDVSRSHKVLSLLHTRTLKLEEQVLCARVSPDSKLLAVSLLDTTVKIFFLDTF- 1346
D T + RS TL + + SPD LLA D T++++ L +
Sbjct: 227 --DLSTGKLLRS----------TLSGHSDSVVSSFSPDGSLLASGSSDGTIRLWDLRSSS 274
Query: 1347 KFFISLYGHKLPVLSLDMSYDSTLIATGSGDRTVKVWGLDYGDC--HKSLLAHEDSVTGV 1404
+L GH VLS+ S D L+A+GS D TV++W L+ G +L HE V+ +
Sbjct: 275 SLLRTLSGHSSSVLSVAFSPDGKLLASGSSDGTVRLWDLETGKLLSSLTLKGHEGPVSSL 334
Query: 1405 TFVPKTHYFFTT-SKDGRVKQWDADNFERIVTLHI 1438
+F P + S DG ++ WD + + TL
Sbjct: 335 SFSPDGSLLVSGGSDDGTIRLWDLRTGKPLKTLEG 369
Score = 100 bits (249), Expect = 1e-21
Identities = 88/408 (21%), Positives = 164/408 (40%), Gaps = 39/408 (9%)
Query: 37 RFLATGASEDVIIWDLRLAEKALLLPGEKHEALLLPGEKHEVCQLSPNHDSSQLAVAYTN 96
L+ + + +WDL EK + H+ ++ SP+ +S LA + +
Sbjct: 79 LLLSGSSDGTIKLWDLDNGEKLIKSLEGLHD-----SSVSKLALSSPDGNSILLASSSLD 133
Query: 97 GSLKTFSLDTTDV-ISTFTGHKSAITVIQYDPLGHRLATGS-KDTDIVLWDVVAECGLHR 154
G++K + L T I T GH ++T + + P G LA+GS D I LWD+ L
Sbjct: 134 GTVKLWDLSTPGKLIRTLEGHSESVTSLAFSPDGKLLASGSSLDGTIKLWDLRTGKPLST 193
Query: 155 LSGHKGVITDIRFMSQPGHHFVVSSAKDTFVKIWDADTGDCFKTMAAHLTEVWGVCVMRE 214
L+GH ++ + F S G + S + D +++WD TG ++ + ++ +
Sbjct: 194 LAGHTDPVSSLAF-SPDGGLLIASGSSDGTIRLWDLSTGKLLRSTLSGHSDSVVSSFSPD 252
Query: 215 DSYLISGSNDAELKVWNVRDRSDIDTEDKDKLSEQLNQLLLSEDEPDLTVSKIEVQIINE 274
S L SGS+D +++W++R S S + + S D L + +
Sbjct: 253 GSLLASGSSDGTIRLWDLRS-SSSLLRTLSGHSSSVLSVAFSPDGKLLASGSSDGTV--R 309
Query: 275 LKNLSTGKKKWLQVFRLALCISSITLNIDDFAFGIDTTQELRTRSMKRKNDEVTVYDREK 334
L +L TGK + + +F D + + S + + ++D
Sbjct: 310 LWDLETGKLLSSLTLK------GHEGPVSSLSFSPDGSLLVSGGS---DDGTIRLWDLRT 360
Query: 335 NYKVQKVQKDVGRTSNLFAPIMLLEGHGGEIFCSKYHPDGQYIASSGYDRQIFIWSVYGE 394
++ + + + PDG+ ++S D + +W +
Sbjct: 361 GKPLKTL------------------EGHSNVLSVSFSPDGRVVSSGSTDGTVRLWDLSTG 402
Query: 395 CENIGVMSGHTGAVMDLKFSTDGCHIFTCSTDQTLAVWDLEKGQRIKK 442
+ + GHT V L FS DG + + S+D T+ +WDL+ +
Sbjct: 403 -SLLRNLDGHTSRVTSLDFSPDGKSLASGSSDNTIRLWDLKTSLKSVS 449
Score = 97.5 bits (241), Expect = 1e-20
Identities = 100/439 (22%), Positives = 182/439 (41%), Gaps = 42/439 (9%)
Query: 69 LLLPGEKHEVCQLSPNHDSSQLAVAYTNGSLKTFSLDT-TDVISTFTGH--KSAITVIQY 125
LLL G + + ++ + D L ++G++K + LD +I + G S +
Sbjct: 59 LLLRGHEDSITSIAFSPDGELLLSGSSDGTIKLWDLDNGEKLIKSLEGLHDSSVSKLALS 118
Query: 126 DPLGHRLATGSKDTD--IVLWDVV-AECGLHRLSGHKGVITDIRFMSQPGHHFVVSSAKD 182
P G+ + S D + LWD+ + L GH +T + F G S+ D
Sbjct: 119 SPDGNSILLASSSLDGTVKLWDLSTPGKLIRTLEGHSESVTSLAFSPD-GKLLASGSSLD 177
Query: 183 TFVKIWDADTGDCFKTMAAHLTEVWGVCVMREDSYLI-SGSNDAELKVWNVRDRSDIDTE 241
+K+WD TG T+A H V + + LI SGS+D +++W++ + +
Sbjct: 178 GTIKLWDLRTGKPLSTLAGHTDPVSSLAFSPDGGLLIASGSSDGTIRLWDLSTGKLLRST 237
Query: 242 DKDKLSEQLNQLLLSEDEPDLTVSKIEVQIINELKNLSTGKKKWLQVFRLALCISSITLN 301
++ S D L + I L +L + + + + S+ +
Sbjct: 238 LSGHSDSVVS--SFSPDGSLLASGSSDGTI--RLWDLRSSSSLLRTLSGHSSSVLSVAFS 293
Query: 302 IDDFAFGIDTTQELRTRSMKRKNDEVTVYDREKNYKVQKVQKDVGRTSNLFAPIMLLEGH 361
D + + V ++D E + + L+GH
Sbjct: 294 PDGKLL-----------ASGSSDGTVRLWDLE---------------TGKLLSSLTLKGH 327
Query: 362 GGEIFCSKYHPDGQYIASSG-YDRQIFIWSVYGECENIGVMSGHTGAVMDLKFSTDGCHI 420
G + + PDG + S G D I +W + + + GH+ V+ + FS DG +
Sbjct: 328 EGPVSSLSFSPDGSLLVSGGSDDGTIRLWD-LRTGKPLKTLEGHSN-VLSVSFSPDGRVV 385
Query: 421 FTCSTDQTLAVWDLEKGQRIKKMKGHSTFVNSCDPVRRGQLLIASGSDDCTVKVWDPRKK 480
+ STD T+ +WDL G ++ + GH++ V S D G+ +ASGS D T+++WD +
Sbjct: 386 SSGSTDGTVRLWDLSTGSLLRNLDGHTSRVTSLDFSPDGKS-LASGSSDNTIRLWDLKTS 444
Query: 481 NQAVSMNNTYQVTSVAFND 499
++VS + +V + +D
Sbjct: 445 LKSVSFSPDGKVLASKSSD 463
Score = 88.6 bits (218), Expect = 8e-18
Identities = 84/346 (24%), Positives = 140/346 (40%), Gaps = 42/346 (12%)
Query: 825 TIKTASKTGKIKSVDVILGGGGEIRLALLLNNNSLELHSLSLGG-----STDSVRHLRSI 879
T+K + K + + G + + L SL G + + L ++
Sbjct: 135 TVKLWDLSTPGKLIRTLEGHSESVTSLAFSPDGKLLASGSSLDGTIKLWDLRTGKPLSTL 194
Query: 880 HAQGHHSEVRALAFSSDNLALVSACAS--QVKIWNRPSLSCLRTIDTGSYALSV-CFVPG 936
GH V +LAFS D L+++ +S +++W+ + LR+ +G V F P
Sbjct: 195 A--GHTDPVSSLAFSPDGGLLIASGSSDGTIRLWDLSTGKLLRSTLSGHSDSVVSSFSPD 252
Query: 937 DRHVLVGTKDGRLLIVDIGAG-EILEDIPAHSQELWSVAMLPDQFNPNVYLPLQIQVVTG 995
+ G+ DG + + D+ + +L + HS + SVA PD + +G
Sbjct: 253 GSLLASGSSDGTIRLWDLRSSSSLLRTLSGHSSSVLSVAFSPDGKL----------LASG 302
Query: 996 GGDKSVKLWQLELVSVNREADEETKDVSRSHKVLSLLHTRTLKLEEQVLCARVSPDSKLL 1055
D +V+LW LE + + E V SPD LL
Sbjct: 303 SSDGTVRLWDLETGKLLSSLTLKGH-------------------EGPVSSLSFSPDGSLL 343
Query: 1056 AVSL-LDTTVKIFFLDTFKFFISLYGHKLPVLSLDMSYDSTLIATGSGDRTVKVWGLDYG 1114
D T++++ L T K +L GH VLS+ S D ++++GS D TV++W L G
Sbjct: 344 VSGGSDDGTIRLWDLRTGKPLKTLEGHS-NVLSVSFSPDGRVVSSGSTDGTVRLWDLSTG 402
Query: 1115 DCHKSLLAHEDSVTGVTFVPKTHYFFTTSKDGRVKQWDADNFERIV 1160
++L H VT + F P + S D ++ WD + V
Sbjct: 403 SLLRNLDGHTSRVTSLDFSPDGKSLASGSSDNTIRLWDLKTSLKSV 448
Score = 74.0 bits (180), Expect = 4e-13
Identities = 58/221 (26%), Positives = 101/221 (45%), Gaps = 16/221 (7%)
Query: 29 VTLKNQEGRFLATGASEDVI-IWDLRLAEKALLLPGEKHEALLLPGEKHEVCQLSPNHDS 87
V+ + +G LA+G+S+ I +WDLR L G V ++ + D
Sbjct: 246 VSSFSPDGSLLASGSSDGTIRLWDLR---------SSSSLLRTLSGHSSSVLSVAFSPDG 296
Query: 88 SQLAVAYTNGSLKTFSLDTTDVIS--TFTGHKSAITVIQYDPLGHRLATG-SKDTDIVLW 144
LA ++G+++ + L+T ++S T GH+ ++ + + P G L +G S D I LW
Sbjct: 297 KLLASGSSDGTVRLWDLETGKLLSSLTLKGHEGPVSSLSFSPDGSLLVSGGSDDGTIRLW 356
Query: 145 DVVAECGLHRLSGHKGVITDIRFMSQPGHHFVVSSAKDTFVKIWDADTGDCFKTMAAHLT 204
D+ L L GH V + F P V S + D V++WD TG + + H +
Sbjct: 357 DLRTGKPLKTLEGHSNV-LSVSFS--PDGRVVSSGSTDGTVRLWDLSTGSLLRNLDGHTS 413
Query: 205 EVWGVCVMREDSYLISGSNDAELKVWNVRDRSDIDTEDKDK 245
V + + L SGS+D +++W+++ + D
Sbjct: 414 RVTSLDFSPDGKSLASGSSDNTIRLWDLKTSLKSVSFSPDG 454
Score = 53.9 bits (128), Expect = 8e-07
Identities = 44/156 (28%), Positives = 68/156 (43%), Gaps = 7/156 (4%)
Query: 1297 SRSHKVLSLLHTRTLKLEEQVLCARVSPDSKLLAVSLLDTTVKIFFLDTFKFFISLY--G 1354
+ L E+ + SPD +LL D T+K++ LD + I
Sbjct: 48 DSLVSLPDLSSLLLRGHEDSITSIAFSPDGELLLSGSSDGTIKLWDLDNGEKLIKSLEGL 107
Query: 1355 HKLPVLSLDMSY---DSTLIATGSGDRTVKVWGL-DYGDCHKSLLAHEDSVTGVTFVPKT 1410
H V L +S +S L+A+ S D TVK+W L G ++L H +SVT + F P
Sbjct: 108 HDSSVSKLALSSPDGNSILLASSSLDGTVKLWDLSTPGKLIRTLEGHSESVTSLAFSPDG 167
Query: 1411 HYFFTTS-KDGRVKQWDADNFERIVTLHICSCSLNS 1445
+ S DG +K WD + + TL + ++S
Sbjct: 168 KLLASGSSLDGTIKLWDLRTGKPLSTLAGHTDPVSS 203
>gnl|CDD|214761 smart00645, Pept_C1, Papain family cysteine protease.
Length = 175
Score = 122 bits (309), Expect = 1e-31
Identities = 63/242 (26%), Positives = 89/242 (36%), Gaps = 75/242 (30%)
Query: 2174 IPETFDAREEWPQCKDVIGKVWDQGACQSCWVSHQPRTAGLKGLFSFIKYGQGQERTLSV 2233
+PE+FD R++ + V DQG C SCW
Sbjct: 1 LPESFDWRKKG-----AVTPVKDQGQCGSCW----------------------------- 26
Query: 2234 WDKAISAASVMSDRICIQSKGQVKPILSPQHLICSCTNCTRMHTKTPMSMCMGGDSAAAW 2293
A SA + R CI++ V LS Q L+ +C+ C GG A+
Sbjct: 27 ---AFSATGALEGRYCIKTGKLV--SLSEQQLV----DCSGGGNCG----CNGGLPDNAF 73
Query: 2294 MYWI-NAGLVDGGDYGTHDVSMGRYIEGIGHAASVMGSSNPEVNNFEK----VIRLYSCE 2348
Y N GL Y Y + AS +F+ + C
Sbjct: 74 EYIKKNGGLETESCY--------PYTGSVAIDAS----------DFQFYKSGIYDHPGCG 115
Query: 2349 GSINPRYIHSVKIIGWGKSSQN-EPYWLCTNSYNQGWGEQGLFKIRRGV-NMCSIEDSVM 2406
+ H+V I+G+G +N + YW+ NS+ WGE G F+I RG N C IE SV
Sbjct: 116 ---SGTLDHAVLIVGYGTEVENGKDYWIVKNSWGTDWGENGYFRIARGKNNECGIEASVA 172
Query: 2407 AG 2408
+
Sbjct: 173 SY 174
>gnl|CDD|239111 cd02620, Peptidase_C1A_CathepsinB, Cathepsin B group; composed of
cathepsin B and similar proteins, including
tubulointerstitial nephritis antigen (TIN-Ag). Cathepsin
B is a lysosomal papain-like cysteine peptidase which is
expressed in all tissues and functions primarily as an
exopeptidase through its carboxydipeptidyl activity.
Together with other cathepsins, it is involved in the
degradation of proteins, proenzyme activation, Ag
processing, metabolism and apoptosis. Cathepsin B has
been implicated in a number of human diseases such as
cancer, rheumatoid arthritis, osteoporosis and
Alzheimer's disease. The unique carboxydipeptidyl
activity of cathepsin B is attributed to the presence of
an occluding loop in its active site which favors the
binding of the C-termini of substrate proteins. Some
members of this group do not possess the occluding loop.
TIN-Ag is an extracellular matrix basement protein which
was originally identified as a target Ag involved in
anti-tubular basement membrane antibody-mediated
interstitial nephritis. It plays a role in renal
tubulogenesis and is defective in hereditary
tubulointerstitial disorders. TIN-Ag is exclusively
expressed in kidney tissues. .
Length = 236
Score = 113 bits (285), Expect = 1e-27
Identities = 47/131 (35%), Positives = 57/131 (43%), Gaps = 41/131 (31%)
Query: 2175 PETFDAREEWPQCKDVIGKVWDQGACQSCWVSHQPRTAGLKGLFSFIKYGQGQERTLSVW 2234
PE+FDARE+WP C IG++ DQG C SCW
Sbjct: 1 PESFDAREKWPNCI-SIGEIRDQGNCGSCW------------------------------ 29
Query: 2235 DKAISAASVMSDRICIQSKGQVKPILSPQHLICSCTNCTRMHTKTPMSMCMGGDSAAAWM 2294
A SA SDR+CIQS G+ +LS Q L+ C+ C C GG AAW
Sbjct: 30 --AFSAVEAFSDRLCIQSNGKENVLLSAQDLLSCCSGCGD--------GCNGGYPDAAWK 79
Query: 2295 YWINAGLVDGG 2305
Y G+V GG
Sbjct: 80 YLTTTGVVTGG 90
Score = 95.4 bits (238), Expect = 2e-21
Identities = 29/52 (55%), Positives = 33/52 (63%), Gaps = 1/52 (1%)
Query: 2357 HSVKIIGWGKSSQNEPYWLCTNSYNQGWGEQGLFKIRRGVNMCSIEDSVMAG 2408
H+VKIIGWG PYWL NS+ WGE G F+I RG N C IE V+AG
Sbjct: 186 HAVKIIGWG-VENGVPYWLAANSWGTDWGENGYFRILRGSNECGIESEVVAG 236
Score = 48.4 bits (116), Expect = 2e-05
Identities = 16/29 (55%), Positives = 18/29 (62%)
Query: 1562 GSKVYYVNNSTTDIQKEIMQHGPVQAKFY 1590
G Y V + TDI KEIM +GPVQA F
Sbjct: 134 GKSAYSVPSDETDIMKEIMTNGPVQAAFT 162
>gnl|CDD|239068 cd02248, Peptidase_C1A, Peptidase C1A subfamily (MEROPS database
nomenclature); composed of cysteine peptidases (CPs)
similar to papain, including the mammalian CPs
(cathepsins B, C, F, H, L, K, O, S, V, X and W). Papain
is an endopeptidase with specific substrate preferences,
primarily for bulky hydrophobic or aromatic residues at
the S2 subsite, a hydrophobic pocket in papain that
accommodates the P2 sidechain of the substrate (the
second residue away from the scissile bond). Most members
of the papain subfamily are endopeptidases. Some
exceptions to this rule can be explained by specific
details of the catalytic domains like the occluding loop
in cathepsin B which confers an additional
carboxydipeptidyl activity and the mini-chain of
cathepsin H resulting in an N-terminal exopeptidase
activity. Papain-like CPs have different functions in
various organisms. Plant CPs are used to mobilize storage
proteins in seeds. Parasitic CPs act extracellularly to
help invade tissues and cells, to hatch or to evade the
host immune system. Mammalian CPs are primarily lysosomal
enzymes with the exception of cathepsin W, which is
retained in the endoplasmic reticulum. They are
responsible for protein degradation in the lysosome.
Papain-like CPs are synthesized as inactive proenzymes
with N-terminal propeptide regions, which are removed
upon activation. In addition to its inhibitory role, the
propeptide is required for proper folding of the newly
synthesized enzyme and its stabilization in denaturing pH
conditions. Residues within the propeptide region also
play a role in the transport of the proenzyme to
lysosomes or acidified vesicles. Also included in this
subfamily are proteins classified as non-peptidase
homologs, which lack peptidase activity or have missing
active site residues.
Length = 210
Score = 98.9 bits (247), Expect = 7e-23
Identities = 58/253 (22%), Positives = 83/253 (32%), Gaps = 72/253 (28%)
Query: 2175 PETFDAREEWPQCKDVIGKVWDQGACQSCWVSHQPRTAGLKGLFSFIKYGQGQERTLSVW 2234
PE+ D RE K + V DQG+C SCW
Sbjct: 1 PESVDWRE-----KGAVTPVKDQGSCGSCW------------------------------ 25
Query: 2235 DKAISAASVMSDRICIQSKGQVKPILSPQHLICSCTNCTRMHTKTPMSMCMGGDSAAAWM 2294
A S + I++ LS Q L+ +C+ C GG+ A+
Sbjct: 26 --AFSTVGALEGAYAIKTG--KLVSLSEQQLV----DCSTSGNNG----CNGGNPDNAFE 73
Query: 2295 YWINAGLVDGGDYGTHDVSMGRYIEGIGHAASVMGSSNPEVNNFEKVIR----------- 2343
Y N GL DY A + G SN + E +
Sbjct: 74 YVKNGGLASESDYPYTGKDGTCKYNSSKVGAKITGYSNVPPGDEEALKAALANYGPVSVA 133
Query: 2344 -------------LYSCEGSINPRYIHSVKIIGWGKSSQNEPYWLCTNSYNQGWGEQGLF 2390
+YS N H+V ++G+G + YW+ NS+ WGE+G
Sbjct: 134 IDASSSFQFYKGGIYSGPCCSNTNLNHAVLLVGYG-TENGVDYWIVKNSWGTSWGEKGYI 192
Query: 2391 KIRRGVNMCSIED 2403
+I RG N+C I
Sbjct: 193 RIARGSNLCGIAS 205
>gnl|CDD|218439 pfam05109, Herpes_BLLF1, Herpes virus major outer envelope
glycoprotein (BLLF1). This family consists of the BLLF1
viral late glycoprotein, also termed gp350/220. It is the
most abundantly expressed glycoprotein in the viral
envelope of the Herpesviruses and is the major antigen
responsible for stimulating the production of
neutralising antibodies in vivo.
Length = 830
Score = 96.0 bits (238), Expect = 2e-19
Identities = 69/316 (21%), Positives = 131/316 (41%), Gaps = 28/316 (8%)
Query: 1854 LAATAVAISVIDNYSEIIFTTNNNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESE 1913
L T A + +++F ++ +V+ + + TT P TT + P +
Sbjct: 404 LIITRTATNATTTTHKVVFHKAPDTTKSVIFVYTLVHVEPHKTTAVP----TTPSLPPAS 459
Query: 1914 STTTSSPESESTTTSSLVSESTTT--SSPESESTTTSSPESESTTTSSLVSESTTTSSPE 1971
+ T S ++ T + + ST +SP S +T+ + + T + + ++ T+
Sbjct: 460 TGPTVSTADPTSGTPTGTTSSTLPEDTSPTSRTTSATPNATSPTPAVTTPNATSPTTQKT 519
Query: 1972 SESTTTSSPESE-STTTSSLVSESTTTSSPESEST---TTISPVSEST--TTSSPVSEST 2025
S++ +SP T++ S T T+S + ++ T SPV+ + +S S T
Sbjct: 520 SDTPNATSPTPIVIGVTTTATSPPTGTTSVPNATSPQVTEESPVNNTNTPVVTSAPSVLT 579
Query: 2026 TTISPESESTTTSS----PASESTTTNNPKSESTTTNNP-------ASESITSSSPASES 2074
+ ++ T +S P S++ + P+S ST+T E+IT +P+ S
Sbjct: 580 SAVTTGQHGTGSSPTSQQPGIPSSSHSTPRSNSTSTTPLLTSAHPTGGENITEETPSVPS 639
Query: 2075 T---TTSSPASESTTTS--SPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEE 2129
T +T SP TTS S S+T+ P T P +T+ S+P+ + T +
Sbjct: 640 TTHVSTLSPGPGPGTTSQVSGPGNSSTSRYPGEVHVTEGMPNPNATSPSAPSGQKTAVPT 699
Query: 2130 QGVSPHSEKLSANEDP 2145
+ + E
Sbjct: 700 VTSTGGKANSTTKETS 715
Score = 87.1 bits (215), Expect = 8e-17
Identities = 64/304 (21%), Positives = 111/304 (36%), Gaps = 12/304 (3%)
Query: 1875 NNNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSES 1934
+ S ++ S + S +P + S TT TS T++ S
Sbjct: 484 EDTSPTSRTTSATPNATSPTPAVTTPNATSPTTQKTSDTPNATSPTPIVIGVTTTATSPP 543
Query: 1935 TTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVS-E 1993
T T+S + T+ ES ++ T+ S + + TT + S+ TS
Sbjct: 544 TGTTSVPN--ATSPQVTEESPVNNTNTPVVTSAPSVLTSAVTTGQHGTGSSPTSQQPGIP 601
Query: 1994 STTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNN--PK 2051
S++ S+P S ST+T ++ + T T P + +T SP TT+
Sbjct: 602 SSSHSTPRSNSTSTTPLLTSAHPTGGENITEETPSVPSTTHVSTLSPGPGPGTTSQVSGP 661
Query: 2052 SESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPE 2111
S+T+ P +T P +T+ S+P+ + T P ST + ++ T+ S
Sbjct: 662 GNSSTSRYPGEVHVTEGMPNPNATSPSAPSGQKTAV--PTVTSTGGKANSTTKETSGSTL 719
Query: 2112 SESTTTSSPASESTTIEEQGVSPHSEKLSANEDPEEFPNEDVFEHTFAEIP-----NIDH 2166
ST+ + T + S+ P A +P + DH
Sbjct: 720 MASTSPHTNEGAFRTTPYNATTYLPPSTSSKLRPRWTFTSPPVTTKQATVPVPPTQHPDH 779
Query: 2167 SNQT 2170
SN +
Sbjct: 780 SNLS 783
Score = 85.2 bits (210), Expect = 2e-16
Identities = 66/267 (24%), Positives = 111/267 (41%), Gaps = 15/267 (5%)
Query: 1880 STVVMSTLNSLLSENTTTNSPESESTTTN-NPESESTTTSSPESESTTTSSLVSESTTTS 1938
S T +S L E+T SP S +T+ N S + ++P + S TT TS
Sbjct: 471 SGTPTGTTSSTLPEDT---SPTSRTTSATPNATSPTPAVTTPNATSPTTQKTSDTPNATS 527
Query: 1939 SPESESTTTS---SPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSEST 1995
T+ SP + +T+ + S T SP + + T + S TS++ +
Sbjct: 528 PTPIVIGVTTTATSPPTGTTSVPNATSPQVTEESPVNNTNTPVVTSAPSVLTSAVTTGQH 587
Query: 1996 -TTSSPESE-----STTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNN 2049
T SSP S+ S++ +P S ST+T+ ++ + T T S P++ +T +
Sbjct: 588 GTGSSPTSQQPGIPSSSHSTPRSNSTSTTPLLTSAHPTGGENITEETPSVPSTTHVSTLS 647
Query: 2050 PKSESTTTNNPAS--ESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTT 2107
P TT+ + S TS P T P +T+ S+P+ + T + S
Sbjct: 648 PGPGPGTTSQVSGPGNSSTSRYPGEVHVTEGMPNPNATSPSAPSGQKTAVPTVTSTGGKA 707
Query: 2108 SSPESESTTTSSPASESTTIEEQGVSP 2134
+S E++ ++ AS S E
Sbjct: 708 NSTTKETSGSTLMASTSPHTNEGAFRT 734
Score = 82.9 bits (204), Expect = 1e-15
Identities = 59/271 (21%), Positives = 108/271 (39%), Gaps = 11/271 (4%)
Query: 1859 VAISVIDNYSEIIFTTNNNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTS 1918
VA V D + II T N+ +T + + +TT + + P + +
Sbjct: 394 VANPVADAKTLIITRTATNATTTTHKVVFHK--APDTTKSVIFVYTLVHVEPHKTTAVPT 451
Query: 1919 SPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTS 1978
+P +T VS + TS + +T+++ PE S T+ + ++ T + S + +
Sbjct: 452 TPSLPPASTGPTVSTADPTSGTPTGTTSSTLPEDTSPTSRT----TSATPNATSPTPAVT 507
Query: 1979 SPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTS 2038
+P + S TT TS T + S T T+S + ++ ++ ES T+
Sbjct: 508 TPNATSPTTQKTSDTPNATSPTPIVIGVTTTATSPPTGTTSVPNATSPQVTEESPVNNTN 567
Query: 2039 SPASESTTTNNPKSESTTTNNPASESITSSS--PASESTTTSSPASESTTTSSPASESTT 2096
+P S + + +T + S + P+S S+P S ST+T+ + +
Sbjct: 568 TPVVTSAPSVLTSAVTTGQHGTGSSPTSQQPGIPSSSH---STPRSNSTSTTPLLTSAHP 624
Query: 2097 TSSPASESTTTSSPESESTTTSSPASESTTI 2127
T T S P + +T SP T
Sbjct: 625 TGGENITEETPSVPSTTHVSTLSPGPGPGTT 655
Score = 80.6 bits (198), Expect = 7e-15
Identities = 58/262 (22%), Positives = 102/262 (38%), Gaps = 6/262 (2%)
Query: 1873 TTNNNSESTVVMSTLNSLLSENT---TTNSPESESTTTNNPESESTTTSSPESESTTTSS 1929
TT N + T ++ + T + + S T + T+ ES ++
Sbjct: 507 TTPNATSPTTQKTSDTPNATSPTPIVIGVTTTATSPPTGTTSVPNATSPQVTEESPVNNT 566
Query: 1930 LVSESTTTSSPESESTTTSSPESESTTTSSLVS-ESTTTSSPESESTTTSSPESESTTTS 1988
T+ S + + TT + S+ TS S++ S+P S ST+T+ + + T
Sbjct: 567 NTPVVTSAPSVLTSAVTTGQHGTGSSPTSQQPGIPSSSHSTPRSNSTSTTPLLTSAHPTG 626
Query: 1989 SLVSESTTTSSPESESTTTISPVSESTTTS--SPVSESTTTISPESESTTTSSPASESTT 2046
T S P + +T+SP TTS S S+T+ P T P +T+
Sbjct: 627 GENITEETPSVPSTTHVSTLSPGPGPGTTSQVSGPGNSSTSRYPGEVHVTEGMPNPNATS 686
Query: 2047 TNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTT 2106
+ P + T S ++S E++ ++ AS S T+ A +T ++ +
Sbjct: 687 PSAPSGQKTAVPTVTSTGGKANSTTKETSGSTLMASTSPHTNEGAFRTTPYNATTYLPPS 746
Query: 2107 TSSPESESTTTSSPASESTTIE 2128
TSS T +SP +
Sbjct: 747 TSSKLRPRWTFTSPPVTTKQAT 768
Score = 79.8 bits (196), Expect = 1e-14
Identities = 62/257 (24%), Positives = 103/257 (40%), Gaps = 9/257 (3%)
Query: 1873 TTNNNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTT-SSPESESTTTSSLV 1931
+ N+ S T + S T S +T+ TTT +SP + +T+ +
Sbjct: 494 SATPNATSPTPAVTTPNATSPTTQKTSDTPNATSPTPIVIGVTTTATSPPTGTTSVPNAT 553
Query: 1932 SESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLV 1991
S T SP + + T + S TS++ + T S S T+ P S++ S+
Sbjct: 554 SPQVTEESPVNNTNTPVVTSAPSVLTSAVTTGQHGTGS----SPTSQQPGIPSSSHSTPR 609
Query: 1992 SESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTS--SPASESTTTNN 2049
S ST+T+ + + T T S P + +T+SP TTS S S+T+
Sbjct: 610 SNSTSTTPLLTSAHPTGGENITEETPSVPSTTHVSTLSPGPGPGTTSQVSGPGNSSTSRY 669
Query: 2050 PKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSS 2109
P T P + + S+P+ + T P ST + ++ T+ S ST+ +
Sbjct: 670 PGEVHVTEGMPNPNATSPSAPSGQKTAV--PTVTSTGGKANSTTKETSGSTLMASTSPHT 727
Query: 2110 PESESTTTSSPASESTT 2126
E TT A+
Sbjct: 728 NEGAFRTTPYNATTYLP 744
Score = 72.1 bits (176), Expect = 3e-12
Identities = 48/239 (20%), Positives = 90/239 (37%), Gaps = 8/239 (3%)
Query: 1870 IIFTTNNNSESTVVMSTLNSLLSENTTTNSPESEST--TTNNPESESTTTSSPESESTTT 1927
+I T + +++ + S T SP + + + S T+ + T +
Sbjct: 532 VIGVTTTATSPPTGTTSVPNATSPQVTEESPVNNTNTPVVTSAPSVLTSAVTTGQHGTGS 591
Query: 1928 SSLVSES----TTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESE 1983
S + ++ S+P S ST+T+ + + T T S P + +T SP
Sbjct: 592 SPTSQQPGIPSSSHSTPRSNSTSTTPLLTSAHPTGGENITEETPSVPSTTHVSTLSPGPG 651
Query: 1984 STTTSSLVSESTTTSSPESEST--TTISPVSESTTTSSPVSESTTTISPESESTTTSSPA 2041
TTS + +++S T P +T+ S+P + T + S +S
Sbjct: 652 PGTTSQVSGPGNSSTSRYPGEVHVTEGMPNPNATSPSAPSGQKTAVPTVTSTGGKANSTT 711
Query: 2042 SESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSP 2100
E++ + S S TN A + ++ +TSS T +SP + + P
Sbjct: 712 KETSGSTLMASTSPHTNEGAFRTTPYNATTYLPPSTSSKLRPRWTFTSPPVTTKQATVP 770
Score = 64.4 bits (156), Expect = 7e-10
Identities = 55/246 (22%), Positives = 92/246 (37%), Gaps = 10/246 (4%)
Query: 1845 ITNNLLISMLAATAVAISVIDNYSEIIFTTNNNSESTVVMSTLNSLLSENTTTNSPESES 1904
+T T+V + +E + NN+ + VV S + L S TT S
Sbjct: 535 VTTTATSPPTGTTSVPNATSPQVTEE--SPVNNTNTPVVTSAPSVLTSAVTTGQHGTGSS 592
Query: 1905 TTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSES 1964
T+ P S++ S+P S ST+T+ L++ + T T S P + +T S
Sbjct: 593 PTSQQPGIPSSSHSTPRSNSTSTTPLLTSAHPTGGENITEETPSVPSTTHVSTLSPGPGP 652
Query: 1965 TTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSES 2024
TTS +++S T P +T+ +P + T + S
Sbjct: 653 GTTSQVSGPGNSSTSRYPGEV--------HVTEGMPNPNATSPSAPSGQKTAVPTVTSTG 704
Query: 2025 TTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASES 2084
S E++ ++ AS S TN +T N ++SS T +SP +
Sbjct: 705 GKANSTTKETSGSTLMASTSPHTNEGAFRTTPYNATTYLPPSTSSKLRPRWTFTSPPVTT 764
Query: 2085 TTTSSP 2090
+ P
Sbjct: 765 KQATVP 770
Score = 48.2 bits (114), Expect = 6e-05
Identities = 33/189 (17%), Positives = 68/189 (35%), Gaps = 11/189 (5%)
Query: 1981 ESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSP 2040
+ +L+ T T++ + + TT S + + P TT+ P
Sbjct: 395 ANPVADAKTLIITRTATNATTTTHKVVFHKAPD-TTKSVIFVYTLVHVEP---HKTTAVP 450
Query: 2041 ASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSP 2100
+ S + +T +P S + T ++ S +SP S +T+ + A+ T +
Sbjct: 451 TTPSLPPA-STGPTVSTADPTSGTPTGTTS-STLPEDTSPTSRTTSATPNATSPTPAVTT 508
Query: 2101 ASESTTTSSPESESTTTSSPASESTTIEEQGVSPHSEKLSANEDP-----EEFPNEDVFE 2155
+ ++ T+ S++ +SP + SP + S EE P +
Sbjct: 509 PNATSPTTQKTSDTPNATSPTPIVIGVTTTATSPPTGTTSVPNATSPQVTEESPVNNTNT 568
Query: 2156 HTFAEIPNI 2164
P++
Sbjct: 569 PVVTSAPSV 577
Score = 47.5 bits (112), Expect = 1e-04
Identities = 45/223 (20%), Positives = 83/223 (37%), Gaps = 13/223 (5%)
Query: 1840 SVSPYITNNLLISMLAATAVAISVIDNYSEIIFTTNNNSESTVVMSTLNSLLSENTTTNS 1899
SP NN ++ + ++ + ++ S+ + S+ +S N+T+ +
Sbjct: 559 EESP--VNNTNTPVVTSAPSVLTSAVTTGQHGTGSSPTSQQPGIPSSSHSTPRSNSTSTT 616
Query: 1900 PESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTS--SPESESTTT 1957
P T+ + E+ T +P STT +T SP TTS S S+T+
Sbjct: 617 P--LLTSAHPTGGENITEETPSVPSTT-------HVSTLSPGPGPGTTSQVSGPGNSSTS 667
Query: 1958 SSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTT 2017
T P +T+ S+P + T ++ S +S E++ + S S T
Sbjct: 668 RYPGEVHVTEGMPNPNATSPSAPSGQKTAVPTVTSTGGKANSTTKETSGSTLMASTSPHT 727
Query: 2018 SSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNP 2060
+ +T + +TSS T +P + P
Sbjct: 728 NEGAFRTTPYNATTYLPPSTSSKLRPRWTFTSPPVTTKQATVP 770
>gnl|CDD|223039 PHA03307, PHA03307, transcriptional regulator ICP4; Provisional.
Length = 1352
Score = 85.6 bits (212), Expect = 3e-16
Identities = 44/234 (18%), Positives = 81/234 (34%), Gaps = 9/234 (3%)
Query: 1900 PESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSS 1959
+ + S SS ++ S E+ S S+P + ++
Sbjct: 150 ASPPAAGASPAAVASDAASSRQAALP--LSSPEETARAPSSPPAEPPPSTPPAAASPRPP 207
Query: 1960 LVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSS 2019
S + S+ S ++ +SS S S ++ P T +
Sbjct: 208 RRSSPISASASSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPENECPLPRPAPITLPT 267
Query: 2020 PVSESTTTISPESESTTTSSPASES-----TTTNNPKSESTTTNNPASESITSSSPASES 2074
+ E++ P S SS +S + ++P S ++ AS S +SS +S S
Sbjct: 268 RIWEASGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSSRESSSS 327
Query: 2075 TTTSSPASESTTTSSPASESTT--TSSPASESTTTSSPESESTTTSSPASESTT 2126
+T+SS S SP + + S SSP + +P+S + +
Sbjct: 328 STSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADPSSPRKRPRPSRAPSSPAAS 381
Score = 72.9 bits (179), Expect = 2e-12
Identities = 50/250 (20%), Positives = 87/250 (34%), Gaps = 27/250 (10%)
Query: 1892 SENTTTNSPESESTTTNN---------PESESTTTSSPESE---STTTSSLVSESTTTSS 1939
+ S ++ PE + SSP +E ST ++ SS
Sbjct: 152 PPAAGASPAAVASDAASSRQAALPLSSPEETARAPSSPPAEPPPSTPPAAASPRPPRRSS 211
Query: 1940 PESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESE----STTTSSLVSEST 1995
P S S ++ +P + + S+ +SS ES S PE+E +L +
Sbjct: 212 PISASASSPAPAPGRSAADDAGASSSDSSSSES-SGCGWGPENECPLPRPAPITLPTRIW 270
Query: 1996 TTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSEST 2055
S S+ P S S++ SP ++ S + S+ + S S+
Sbjct: 271 EASGWNGPSSRP-GPASSSSSPRER--------SPSPSPSSPGSGPAPSSPRASSSSSSS 321
Query: 2056 TTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESEST 2115
++ +S S +S S + + S S + S P SSP + +P S +
Sbjct: 322 RESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRP-PPPADPSSPRKRPRPSRAPSSPAA 380
Query: 2116 TTSSPASEST 2125
+ P
Sbjct: 381 SAGRPTRRRA 390
Score = 71.0 bits (174), Expect = 7e-12
Identities = 55/282 (19%), Positives = 82/282 (29%), Gaps = 42/282 (14%)
Query: 1897 TNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTT--TSSPESES 1954
P T +T + S S S T P S T P S
Sbjct: 68 PTGPPPGPGTEAPANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPPASPP 127
Query: 1955 TTTSSLVSE-------STTTSSPESESTTTSSPESESTTTSS--------LVSESTTTSS 1999
+ + +SE + + S S SS E+ S
Sbjct: 128 PSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSPEETARAPS 187
Query: 2000 PESESTTTI--------------SPVSESTTTSSPVSESTTTISPESESTTTSSPAS--- 2042
SP+S S ++ +P + + S+ +SS S
Sbjct: 188 SPPAEPPPSTPPAAASPRPPRRSSPISASASSPAPAPGRSAADDAGASSSDSSSSESSGC 247
Query: 2043 -ESTTTNNPKSESTTTNNPAS--ESITSSSPASESTTTSSPASES-----TTTSSPASES 2094
P P E+ + P+S SS +S + SSP S
Sbjct: 248 GWGPENECPLPRPAPITLPTRIWEASGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGP 307
Query: 2095 TTTSSPASESTTTSSPESESTTTSSPASESTTIEEQGVSPHS 2136
+S AS S+++S S S+T+SS S G SP
Sbjct: 308 APSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSR 349
Score = 63.3 bits (154), Expect = 2e-09
Identities = 46/241 (19%), Positives = 81/241 (33%), Gaps = 20/241 (8%)
Query: 1910 PESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSS 1969
P S S ++ +P + + S+ +SS ES S PE+E T +
Sbjct: 212 PISASASSPAPAPGRSAADDAGASSSDSSSSES-SGCGWGPENE---CPLPRPAPITLPT 267
Query: 1970 PESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTIS 2029
E++ + P S SS SS E SP ++ S + S+ S
Sbjct: 268 RIWEASGWNGPSSRPGPASS--------SSSPRER----SPSPSPSSPGSGPAPSSPRAS 315
Query: 2030 PESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSS 2089
S S+ SS +S S+++ + + + + S S + S P + +S S
Sbjct: 316 SSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADPSSPRKRP---RPS 372
Query: 2090 PASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQGVSPHSEKLSANEDPEEFP 2149
A S S+ + + A+ SP ++ +P
Sbjct: 373 RAPSSPAASAGRPTRRRARAAVAGRARRRD-ATGRFPAGRPRPSPLDAGAASGAFYARYP 431
Query: 2150 N 2150
Sbjct: 432 L 432
Score = 55.6 bits (134), Expect = 4e-07
Identities = 40/231 (17%), Positives = 76/231 (32%), Gaps = 17/231 (7%)
Query: 1898 NSPESESTTTNNPESESTTTSSPESESTTTSSLVS-------ESTTTSSPESESTTTSSP 1950
+SP S S ++ P + + S+ +SS S E+ + T +
Sbjct: 210 SSPISASASSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPENECPLPRPAPITLPTRI 269
Query: 1951 ESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISP 2010
S S SS S SP ++ S + S+ +S S S+ S
Sbjct: 270 WEASGWNGP-SSRPGPASSSSSPR--ERSPSPSPSSPGSGPAPSSPRASSSSSSSRESS- 325
Query: 2011 VSESTTTSSPVSEST-TTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSS 2069
S ST++SS S + P + + S ++P+ + P+S + ++
Sbjct: 326 -SSSTSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADPSSPRKRPRPSRAPSSPAASAGR 384
Query: 2070 PASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSP 2120
P ++ A + + + S + S + P
Sbjct: 385 PTRR-RARAAVAGRARRRDATGR---FPAGRPRPSPLDAGAASGAFYARYP 431
Score = 53.6 bits (129), Expect = 2e-06
Identities = 35/189 (18%), Positives = 60/189 (31%), Gaps = 7/189 (3%)
Query: 1892 SENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPE 1951
P T E++ + P S SS S S SP ++ S
Sbjct: 250 GPENECPLPRPAPITLPTRIWEASGWNGPSSRPGPASS--SSSPRERSPSPSPSSPGSGP 307
Query: 1952 SESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPV 2011
+ S+ +S S S+ SS S S+++ S + + S S + S P SP
Sbjct: 308 APSSPRASSSSSSSRESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRP-PPPADPSSPR 366
Query: 2012 SESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPA 2071
+ +P S + + P ++ A + + S + A
Sbjct: 367 KRPRPSRAPSSPAASAGRPTRR-RARAAVAGRARRRDAT---GRFPAGRPRPSPLDAGAA 422
Query: 2072 SESTTTSSP 2080
S + P
Sbjct: 423 SGAFYARYP 431
Score = 53.3 bits (128), Expect = 2e-06
Identities = 42/244 (17%), Positives = 80/244 (32%), Gaps = 15/244 (6%)
Query: 1897 TNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTT 1956
P + ++ S S +S ++V+ + E + P +E+
Sbjct: 22 PRPPATPGDAADDLLSGSQGQLVSDSAELAAVTVVAGAAACDRFEPPTGPPPGPGTEAPA 81
Query: 1957 ----TSSLVSESTTT--SSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISP 2010
++ S ST S S T P S + S SP + + + P
Sbjct: 82 NESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPPASPP-PSPAPDLSEMLRP 140
Query: 2011 VSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSP 2070
V + + S SS + + ++P+ + ++P +E S+ P
Sbjct: 141 VGSPGPPPAASPPAAGASPAAVASDAASSRQA-ALPLSSPEETARAPSSPPAEPPPSTPP 199
Query: 2071 ASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQ 2130
A+ + P S+ S+ AS PA + + S+ +SS S +
Sbjct: 200 AA---ASPRPPRRSSPISASASSPA----PAPGRSAADDAGASSSDSSSSESSGCGWGPE 252
Query: 2131 GVSP 2134
P
Sbjct: 253 NECP 256
Score = 51.3 bits (123), Expect = 7e-06
Identities = 30/175 (17%), Positives = 56/175 (32%), Gaps = 13/175 (7%)
Query: 1884 MSTLNSLLSENTTTNSPESESTTTNN--------PESESTTTSSPESESTTTSSLVSEST 1935
TL + + E + N P S ++ P ++ S + S+ +S S S+
Sbjct: 262 PITLPTRIWEASGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGPAPSSPRASSSSSSS 321
Query: 1936 TTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSEST 1995
SS S S+++ S + + S S + S P + +S + S S
Sbjct: 322 RESSSSSTSSSSESSRGAAVSPGPSPSRSPSPSRPPPPADPSSPRKRP---RPSRAPSSP 378
Query: 1996 TTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNP 2050
S+ + V+ + S + AS + P
Sbjct: 379 AASAGRPTRRRARAAVAGRARRRD--ATGRFPAGRPRPSPLDAGAASGAFYARYP 431
Score = 48.2 bits (115), Expect = 7e-05
Identities = 45/225 (20%), Positives = 73/225 (32%), Gaps = 21/225 (9%)
Query: 1930 LVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSS 1989
LVS+S ++ + + E T T + +T + S S
Sbjct: 43 LVSDSAELAAVTVVAGAAACDRFEPPTGPP--PGPGTEAPANESRSTPTWSLSTLAPASP 100
Query: 1990 LVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNN 2049
S T P S +P S SP + + + P + + + +
Sbjct: 101 AREGSPTPPGPSSPDPPPPTPPPASPP-PSPAPDLSEMLRPVGSPGPPPAASPPAAGASP 159
Query: 2050 PKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTT--- 2106
S +S A SSP + SSP +E ++ PA+ S
Sbjct: 160 AAVASDAA--------SSRQAALPL---SSPEETARAPSSPPAEPPPSTPPAAASPRPPR 208
Query: 2107 TSSPESESTTTSSPASESTTIEEQGVSP----HSEKLSANEDPEE 2147
SSP S S ++ +PA + ++ G S SE PE
Sbjct: 209 RSSPISASASSPAPAPGRSAADDAGASSSDSSSSESSGCGWGPEN 253
>gnl|CDD|236304 PRK08581, PRK08581, N-acetylmuramoyl-L-alanine amidase; Validated.
Length = 619
Score = 82.1 bits (203), Expect = 2e-15
Identities = 42/301 (13%), Positives = 99/301 (32%), Gaps = 21/301 (6%)
Query: 1881 TVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESE----STTTSSLVSESTT 1936
T + + ++T + + ++ S+ T++ + ++ + + +T
Sbjct: 21 TSPTAYADDPQKDSTAKTTSHDSKKSNDDETSKDTSSKDTDKADNNNTSNQDNNDKKFST 80
Query: 1937 TSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTT 1996
S S+S ++ +++ T ++ S TT + + E
Sbjct: 81 IDSSTSDSNNIIDFIYKNLPQTNINQLLTKNKYDDNYSLTTLIQNLFNLNSDISDYEQPR 140
Query: 1997 TSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTT 2056
S + + S S T + S+ + ++ S+ + P++ + N+PK
Sbjct: 141 NSEKSTNDSNKNSDSSIKNDTDTQSSKQDKADNQKAPSSNNTKPSTSNKQPNSPKPTQPN 200
Query: 2057 TNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTT 2116
+N S T + +S S S + SE + + S +
Sbjct: 201 QSNSQPAS---------DDTANQKSSSKDNQSMSDSALDSILDQYSEDAKKTQKDYASQS 251
Query: 2117 TSSPASESTTIEEQGVSPHSEKLSANEDPEEFPNEDVFE------HTFAEIPNIDHSNQT 2170
S T Q P ++L P + DV + F P++ +++ +
Sbjct: 252 KKDKTETSNTKNPQ--LPTQDELKHKSKPAQSFENDVNQSNTRSTSLFETGPSLSNNDDS 309
Query: 2171 D 2171
Sbjct: 310 G 310
Score = 79.1 bits (195), Expect = 1e-14
Identities = 40/258 (15%), Positives = 90/258 (34%), Gaps = 4/258 (1%)
Query: 1904 STTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSE 1963
T T+ +++T+ S S TS S T + + ++ + +
Sbjct: 18 PTLTSPTAYADDPQKDSTAKTTSHDSKKSNDDETSKDTSSKDTDKADNNNTSNQDNNDKK 77
Query: 1964 STTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSE 2023
+T S S+S ++ +++ T ++ S TT+ + + E
Sbjct: 78 FSTIDSSTSDSNNIIDFIYKNLPQTNINQLLTKNKYDDNYSLTTLIQNLFNLNSDISDYE 137
Query: 2024 STTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASE 2083
+ + +S +S T+ S+ +N + S ++ P++ + +SP
Sbjct: 138 QPRNSEKSTNDSNKNSDSSIKNDTDTQSSKQDKADNQKAPSSNNTKPSTSNKQPNSPKPT 197
Query: 2084 STTTSSPASESTTTS-SPASESTTTSSPESESTTTSSPASESTTIEEQ---GVSPHSEKL 2139
S+ S T+ +S S +S + SE ++ S +
Sbjct: 198 QPNQSNSQPASDDTANQKSSSKDNQSMSDSALDSILDQYSEDAKKTQKDYASQSKKDKTE 257
Query: 2140 SANEDPEEFPNEDVFEHT 2157
++N + P +D +H
Sbjct: 258 TSNTKNPQLPTQDELKHK 275
Score = 66.7 bits (163), Expect = 1e-10
Identities = 36/231 (15%), Positives = 84/231 (36%), Gaps = 16/231 (6%)
Query: 1957 TSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTT 2016
+++LV + T+ + P+ +ST ++ + S+ T++
Sbjct: 12 STTLVLPTLTSPT-----AYADDPQKDSTAKTTSHDSKKSNDDETSKDTSSKDTDKADNN 66
Query: 2017 TSSPVSESTTTISPESESTTTSSPASE-------STTTNNPKSESTTTNNPASESITSSS 2069
+S + S ST+ S+ + T N +++ +N + ++ +
Sbjct: 67 NTSNQDNNDKKFSTIDSSTSDSNNIIDFIYKNLPQTNINQLLTKNKYDDNYSLTTLIQNL 126
Query: 2070 PASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEE 2129
S + ++ S+ S + SS +++ T SS + ++ +P+S +T
Sbjct: 127 FNLNSDISDYEQPRNSEKSTNDSNKNSDSSIKNDTDTQSSKQDKADNQKAPSSNNTKPST 186
Query: 2130 QGVSPHSEKLSANEDPEEFPNEDVFEHTFAEIPNIDHSN-QTDEAIPETFD 2179
P+S K + P D T + + + +D A+ D
Sbjct: 187 SNKQPNSPKPTQPNQSNSQPASD---DTANQKSSSKDNQSMSDSALDSILD 234
Score = 41.3 bits (97), Expect = 0.008
Identities = 47/292 (16%), Positives = 95/292 (32%), Gaps = 46/292 (15%)
Query: 1765 NSVSPNVTSKILTTDNYSEIIFTTNNNSESTVVMSTLNSLLSENEKLFKPHAKTPGAEFL 1824
S N K T D+ + +NN + +K +T + L
Sbjct: 68 TSNQDNNDKKFSTIDSSTS---DSNNIIDFI----------------YKNLPQTNINQLL 108
Query: 1825 IQCQYCDFDSSMNLLSVSPYITNNLLISMLAATAVAISVIDNYSEIIFTTNNNSESTVVM 1884
+ +Y D S L+ NL ++ + S+ N+ +
Sbjct: 109 TKNKYDDNYSLTTLI-------QNLF-----------NLNSDISDYEQPRNSEKSTNDSN 150
Query: 1885 STLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESES 1944
+S + +T T S + + S + T S ++ + + + S P S+
Sbjct: 151 KNSDSSIKNDTDTQSSKQDKADNQKAPSSNNTKPSTSNKQPNSPKPTQPNQSNSQPASDD 210
Query: 1945 TT---TSSPESESTTTSSLVS--ESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSS 1999
T +SS +++S + S+L S + + + +++ S + + T TS+ + T
Sbjct: 211 TANQKSSSKDNQSMSDSALDSILDQYSEDAKKTQKDYASQSKKDKTETSNTKNPQLPTQD 270
Query: 2000 PESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSP----ASESTTT 2047
+ S+ S S P + S S+ T
Sbjct: 271 ELKHKSKPAQSFENDVNQSNTRSTSLFETGPSLSNNDDSGSFNVVDSKDTRQ 322
>gnl|CDD|239112 cd02621, Peptidase_C1A_CathepsinC, Cathepsin C; also known as
Dipeptidyl Peptidase I (DPPI), an atypical papain-like
cysteine peptidase with chloride dependency and
dipeptidyl aminopeptidase activity, resulting from its
tetrameric structure which limits substrate access. Each
subunit of the tetramer is composed of three peptides:
the heavy and light chains, which together adopts the
papain fold and forms the catalytic domain; and the
residual propeptide region, which forms a beta barrel and
points towards the substrate's N-terminus. The subunit
composition is the result of the unique characteristic of
procathepsin C maturation involving the cleavage of the
catalytic domain and the non-autocatalytic excision of an
activation peptide within its propeptide region. By
removing N-terminal dipeptide extensions, cathepsin C
activates granule serine peptidases (granzymes) involved
in cell-mediated apoptosis, inflammation and tissue
remodelling. Loss-of-function mutations in cathepsin C
are associated with Papillon-Lefevre and Haim-Munk
syndromes, rare diseases characterized by hyperkeratosis
and early-onset periodontitis. Cathepsin C is widely
expressed in many tissues with high levels in lung,
kidney and placenta. It is also highly expressed in
cytotoxic lymphocytes and mature myeloid cells.
Length = 243
Score = 72.8 bits (179), Expect = 1e-13
Identities = 70/289 (24%), Positives = 92/289 (31%), Gaps = 107/289 (37%)
Query: 2175 PETFDAREEWPQCKDVIGKVWDQGACQSCWVSHQPRTAGLKGLFSFIKYGQGQERTLSVW 2234
P++FD + V V +QG C SC+
Sbjct: 2 PKSFDWGDVNNGFNYVSP-VRNQGGCGSCY------------------------------ 30
Query: 2235 DKAISAASVMSDRICIQS-----KGQVKPILSPQH-LICS-------------------- 2268
A ++ + RI I S GQ +PILSPQH L CS
Sbjct: 31 --AFASVYALEARIMIASNKTDPLGQ-QPILSPQHVLSCSQYSQGCDGGFPFLVGKFAED 87
Query: 2269 ------------------CT----NCTRMHTKT--PMSMCMGGDSAAAWMYW--INAGLV 2302
C C R + + C G + M W G +
Sbjct: 88 FGIVTEDYFPYTADDDRPCKASPSECRRYYFSDYNYVGGCYGCTNEDE-MKWEIYRNGPI 146
Query: 2303 DGGDYGTHDVSMGRYIEGIGHAAS---VMGSSNPEVNNFEKVIRLYSCEGSINPRYIHSV 2359
D Y EG+ H V N N FE N H+V
Sbjct: 147 VVAFEVYSDFDF--YKEGVYHHTDNDEVSDGDNDNFNPFELT----------N----HAV 190
Query: 2360 KIIGWGKSSQN-EPYWLCTNSYNQGWGEQGLFKIRRGVNMCSIEDSVMA 2407
++GWG+ E YW+ NS+ WGE+G FKIRRG N C IE +
Sbjct: 191 LLVGWGEDEIKGEKYWIVKNSWGSSWGEKGYFKIRRGTNECGIESQAVF 239
>gnl|CDD|239149 cd02698, Peptidase_C1A_CathepsinX, Cathepsin X; the only papain-like
lysosomal cysteine peptidase exhibiting
carboxymonopeptidase activity. It can also act as a
carboxydipeptidase, like cathepsin B, but has been shown
to preferentially cleave substrates through a
monopeptidyl carboxypeptidase pathway. The propeptide
region of cathepsin X, the shortest among papain-like
peptidases, is covalently attached to the active site
cysteine in the inactive form of the enzyme. Little is
known about the biological function of cathepsin X. Some
studies point to a role in early tumorigenesis. A more
recent study indicates that cathepsin X expression is
restricted to immune cells suggesting a role in
phagocytosis and the regulation of the immune response.
Length = 239
Score = 72.8 bits (179), Expect = 1e-13
Identities = 49/209 (23%), Positives = 71/209 (33%), Gaps = 63/209 (30%)
Query: 2232 SVWDKAISAASVMSDRICIQSKGQVKPI-LSPQHLI-CSCTNCTRMHTKTPMSMCMGGDS 2289
S W A + S ++DRI I KG + LS Q +I C+ C GGD
Sbjct: 30 SCW--AHGSTSALADRINIARKGAWPSVYLSVQVVIDCAGGGS-----------CHGGDP 76
Query: 2290 AAAWMYWINAGLVDG--------------------------------------GDYGTHD 2311
+ Y G+ D DYG+
Sbjct: 77 GGVYEYAHKHGIPDETCNPYQAKDGECNPFNRCGTCNPFGECFAIKNYTLYFVSDYGS-- 134
Query: 2312 VS----MGRYIEGIGHAASVMGSSNPEVNNFEKVIRLYSCEGSINPRYIHSVKIIGWGKS 2367
VS M I G + + ++ N V + Y + IN H + + GWG
Sbjct: 135 VSGRDKMMAEIYARGPISCGIMATEALENYTGGVYKEYVQDPLIN----HIISVAGWGVD 190
Query: 2368 SQNEPYWLCTNSYNQGWGEQGLFKIRRGV 2396
YW+ NS+ + WGE+G F+I
Sbjct: 191 ENGVEYWIVRNSWGEPWGERGWFRIVTSS 219
>gnl|CDD|177776 PLN00181, PLN00181, protein SPA1-RELATED; Provisional.
Length = 793
Score = 73.6 bits (180), Expect = 9e-13
Identities = 63/249 (25%), Positives = 118/249 (47%), Gaps = 20/249 (8%)
Query: 373 DGQYIASSGYDRQIFIWSVYGECENI-----------GVMSGHTGAVMDLKFSTDGCHIF 421
DG++ A++G +++I I+ ECE+I ++ + S +
Sbjct: 494 DGEFFATAGVNKKIKIF----ECESIIKDGRDIHYPVVELASRSKLSGICWNSYIKSQVA 549
Query: 422 TCSTDQTLAVWDLEKGQRIKKMKGHSTFVNSCDPVRRGQLLIASGSDDCTVKVWDPRKKN 481
+ + + + VWD+ + Q + +MK H V S D L+ASGSDD +VK+W +
Sbjct: 550 SSNFEGVVQVWDVARSQLVTEMKEHEKRVWSIDYSSADPTLLASGSDDGSVKLWSINQGV 609
Query: 482 QAVSMNNTYQVTSVAF-NDTAECVLTGGIDNDIKMWDLRTNSV-VQKLRGHSDTVTGLSL 539
++ + V F +++ + G D+ + +DLR + + + GHS TV+ +
Sbjct: 610 SIGTIKTKANICCVQFPSESGRSLAFGSADHKVYYYDLRNPKLPLCTMIGHSKTVSYVRF 669
Query: 540 SPDGSYILSNAMDNTVRIWDIRPYVPGERCVKVMSGHQHNFEKNLLRCAWSVSGLYVTAG 599
D S ++S++ DNT+++WD+ + G + S H KN + SVS Y+ G
Sbjct: 670 V-DSSTLVSSSTDNTLKLWDLSMSISGINETPLHSFMGHTNVKNFV--GLSVSDGYIATG 726
Query: 600 SADKCVYIW 608
S V+++
Sbjct: 727 SETNEVFVY 735
Score = 37.4 bits (86), Expect = 0.11
Identities = 46/211 (21%), Positives = 95/211 (45%), Gaps = 24/211 (11%)
Query: 994 TGGGDKSVKLWQLE-LVSVNREADEETKDVSRSHKVLSLLHTRTLKLEEQVLCARVSPDS 1052
T G +K +K+++ E ++ R+ +++ K+ + +K +
Sbjct: 500 TAGVNKKIKIFECESIIKDGRDIHYPVVELASRSKLSGICWNSYIKSQ------------ 547
Query: 1053 KLLAVSLLDTTVKIFFLDTFKFFISLYGHKLPVLSLDMSY-DSTLIATGSGDRTVKVWGL 1111
+A S + V+++ + + + H+ V S+D S D TL+A+GS D +VK+W +
Sbjct: 548 --VASSNFEGVVQVWDVARSQLVTEMKEHEKRVWSIDYSSADPTLLASGSDDGSVKLWSI 605
Query: 1112 DYGDCHKSLLAHEDSVTGVTFVPKTHYFFT-TSKDGRVKQWDADNFERIVTLHFFISLYG 1170
+ G + + ++ V F ++ S D +V +D N + + ++ G
Sbjct: 606 NQG-VSIGTIKTKANICCVQFPSESGRSLAFGSADHKVYYYDLRNPKLPLC-----TMIG 659
Query: 1171 HKLPVLSLDMSYDSTLIATGSGDRTVKVWGL 1201
H V + STL+++ S D T+K+W L
Sbjct: 660 HSKTVSYVRFVDSSTLVSS-STDNTLKLWDL 689
Score = 35.8 bits (82), Expect = 0.39
Identities = 20/66 (30%), Positives = 30/66 (45%), Gaps = 1/66 (1%)
Query: 176 VVSSAKDTFVKIWDADTGDCFKTMAAHLTEVWGVCVMREDSYLI-SGSNDAELKVWNVRD 234
V SS + V++WD M H VW + D L+ SGS+D +K+W++
Sbjct: 548 VASSNFEGVVQVWDVARSQLVTEMKEHEKRVWSIDYSSADPTLLASGSDDGSVKLWSINQ 607
Query: 235 RSDIDT 240
I T
Sbjct: 608 GVSIGT 613
Score = 33.9 bits (77), Expect = 1.2
Identities = 27/123 (21%), Positives = 58/123 (47%), Gaps = 16/123 (13%)
Query: 1268 TGGGDKSVKLWQLE-LVSVNREADEETKDVSRSHKVLSLLHTRTLKLEEQVLCARVSPDS 1326
T G +K +K+++ E ++ R+ +++ K+ + +K +
Sbjct: 500 TAGVNKKIKIFECESIIKDGRDIHYPVVELASRSKLSGICWNSYIKSQ------------ 547
Query: 1327 KLLAVSLLDTTVKIFFLDTFKFFISLYGHKLPVLSLDMSY-DSTLIATGSGDRTVKVWGL 1385
+A S + V+++ + + + H+ V S+D S D TL+A+GS D +VK+W +
Sbjct: 548 --VASSNFEGVVQVWDVARSQLVTEMKEHEKRVWSIDYSSADPTLLASGSDDGSVKLWSI 605
Query: 1386 DYG 1388
+ G
Sbjct: 606 NQG 608
Score = 33.5 bits (76), Expect = 1.7
Identities = 22/61 (36%), Positives = 35/61 (57%), Gaps = 7/61 (11%)
Query: 1145 DGRVKQWDADNFERIVTLHFFISLYGHKLPVLSLDMSY-DSTLIATGSGDRTVKVWGLDY 1203
+G V+ WD ++VT + H+ V S+D S D TL+A+GS D +VK+W ++
Sbjct: 554 EGVVQVWDVAR-SQLVT-----EMKEHEKRVWSIDYSSADPTLLASGSDDGSVKLWSINQ 607
Query: 1204 G 1204
G
Sbjct: 608 G 608
Score = 32.4 bits (73), Expect = 4.1
Identities = 39/149 (26%), Positives = 66/149 (44%), Gaps = 9/149 (6%)
Query: 88 SQLAVAYTNGSLKTFSLDTTDVISTFTGHKSAITVIQY---DPLGHRLATGSKDTDIVLW 144
SQ+A + G ++ + + + +++ H+ + I Y DP LA+GS D + LW
Sbjct: 546 SQVASSNFEGVVQVWDVARSQLVTEMKEHEKRVWSIDYSSADPT--LLASGSDDGSVKLW 603
Query: 145 DVVAECGLHRLSGHKGVITDIRFMSQPGHHFVVSSAKDTFVKIWDADTGDC-FKTMAAHL 203
+ + + K I ++F S+ G SA D V +D TM H
Sbjct: 604 SINQGVSIGTIKT-KANICCVQFPSESGRSLAFGSA-DHKVYYYDLRNPKLPLCTMIGHS 661
Query: 204 TEVWGVCVMREDSYLISGSNDAELKVWNV 232
V V + + S L+S S D LK+W++
Sbjct: 662 KTVSYVRFV-DSSTLVSSSTDNTLKLWDL 689
>gnl|CDD|237555 PRK13914, PRK13914, invasion associated secreted endopeptidase;
Provisional.
Length = 481
Score = 69.8 bits (170), Expect = 1e-11
Identities = 44/229 (19%), Positives = 100/229 (43%), Gaps = 17/229 (7%)
Query: 1914 STTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESE 1973
T + E+TT + + T T ++ TT +P+ T + +V ++ TT + +S
Sbjct: 148 VAPTQEVKKETTTQQAAPAAETKTEVKQTTQATTPAPKVAETKETPVVDQNATTHAVKSG 207
Query: 1974 STTTSSPESESTTTSSLVSESTTTSSP--------ESESTTTISPVSESTTTSSPVSEST 2025
T + + ++S + +SS ++ T +P +E T + +
Sbjct: 208 DTIWALSVKYGVSVQDIMSWNNLSSSSIYVGQKLAIKQTANTATPKAEVKTEAPAAEKQA 267
Query: 2026 TTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSP-----ASESTTTSSP 2080
+ E+ +T T++ + TTT + T P + + +P A+++ T ++
Sbjct: 268 APVVKENTNTNTATTEKKETTTQ----QQTAPKAPTEAAKPAPAPSTNTNANKTNTNTNT 323
Query: 2081 ASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEE 2129
+ +T TS+P+ + T ++ + + + ++ S+ +S +S S I E
Sbjct: 324 NTNNTNTSTPSKNTNTNTNSNTNTNSNTNANQGSSNNNSNSSASAIIAE 372
Score = 66.0 bits (160), Expect = 1e-10
Identities = 43/220 (19%), Positives = 96/220 (43%), Gaps = 17/220 (7%)
Query: 1893 ENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPES 1952
E TT + + T T ++ TT +P+ T + +V ++ TT + +S T +
Sbjct: 157 ETTTQQAAPAAETKTEVKQTTQATTPAPKVAETKETPVVDQNATTHAVKSGDTIWALSVK 216
Query: 1953 ESTTTSSLVSESTTTSSP--------ESESTTTSSPESESTTTSSLVSESTTTSSPESES 2004
+ ++S + +SS ++ T++P++E T + + E+ +
Sbjct: 217 YGVSVQDIMSWNNLSSSSIYVGQKLAIKQTANTATPKAEVKTEAPAAEKQAAPVVKENTN 276
Query: 2005 TTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASES 2064
T T + + TTT + T +P T + PA +T N +++ T N + +
Sbjct: 277 TNTATTEKKETTTQ----QQTAPKAP----TEAAKPAPAPSTNTN-ANKTNTNTNTNTNN 327
Query: 2065 ITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASES 2104
+S+P+ + T ++ + + + ++ S+ +S +S S
Sbjct: 328 TNTSTPSKNTNTNTNSNTNTNSNTNANQGSSNNNSNSSAS 367
Score = 54.0 bits (129), Expect = 8e-07
Identities = 52/229 (22%), Positives = 103/229 (44%), Gaps = 38/229 (16%)
Query: 1947 TSSPESESTTTSSLVSESTTT-SSPESESTT---------TSSPESESTTTSSLVSESTT 1996
TS+P T + E+TT ++P +E+ T T +P+ T + +V ++ T
Sbjct: 144 TSTP---VAPTQEVKKETTTQQAAPAAETKTEVKQTTQATTPAPKVAETKETPVVDQNAT 200
Query: 1997 TSSPESEST------------------TTISPVSESTTTSSPVSESTTTISPESESTTTS 2038
T + +S T +S S + ++ T +P++E T
Sbjct: 201 THAVKSGDTIWALSVKYGVSVQDIMSWNNLSSSSIYVGQKLAIKQTANTATPKAE-VKTE 259
Query: 2039 SPASESTTTNNPKSESTTTNNPAS---ESITSSSPASESTTTSSPASESTTTSSPASEST 2095
+PA+E K E+T TN + E+ T A ++ T ++ + + +T++ A+++
Sbjct: 260 APAAEKQAAPVVK-ENTNTNTATTEKKETTTQQQTAPKAPTEAAKPAPAPSTNTNANKTN 318
Query: 2096 TTSSPASESTTTSSP--ESESTTTSSPASESTTIEEQGVSPHSEKLSAN 2142
T ++ + +T TS+P + + T S+ + S T QG S ++ SA+
Sbjct: 319 TNTNTNTNNTNTSTPSKNTNTNTNSNTNTNSNTNANQGSSNNNSNSSAS 367
Score = 51.3 bits (122), Expect = 6e-06
Identities = 34/189 (17%), Positives = 81/189 (42%), Gaps = 14/189 (7%)
Query: 1892 SENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSP--------ESE 1943
T ++ TT+ +S T + + ++S + +SS +
Sbjct: 186 VAETKETPVVDQNATTHAVKSGDTIWALSVKYGVSVQDIMSWNNLSSSSIYVGQKLAIKQ 245
Query: 1944 STTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESE 2003
+ T++P++E T + + E+ +T T++ E + TTT + T +P
Sbjct: 246 TANTATPKAEVKTEAPAAEKQAAPVVKENTNTNTATTEKKETTT----QQQTAPKAPTEA 301
Query: 2004 STTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASE 2063
+ +P + T ++ + +T T + + ++T S + +T +N + +T N +S
Sbjct: 302 AKP--APAPSTNTNANKTNTNTNTNTNNTNTSTPSKNTNTNTNSNTNTNSNTNANQGSSN 359
Query: 2064 SITSSSPAS 2072
+ ++SS ++
Sbjct: 360 NNSNSSASA 368
Score = 45.2 bits (106), Expect = 4e-04
Identities = 44/206 (21%), Positives = 85/206 (41%), Gaps = 20/206 (9%)
Query: 1967 TSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTT 2026
TS+P T + E+TT + + T T ++ TT +P T + V ++ T
Sbjct: 144 TSTP---VAPTQEVKKETTTQQAAPAAETKTEVKQTTQATTPAPKVAETKETPVVDQNAT 200
Query: 2027 TISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPAS--ESTTTSSPASES 2084
T + +S T + + + S NN +S SI + ++ T++P +E
Sbjct: 201 THAVKSGDTIWALSVKYGVSVQDIMS----WNNLSSSSIYVGQKLAIKQTANTATPKAE- 255
Query: 2085 TTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQGVSPHSEKLSANED 2144
T +PA+E P + T ++ T ++ E+TT +Q +P + +A
Sbjct: 256 VKTEAPAAEKQAA--PVVKENTNTN------TATTEKKETTT--QQQTAPKAPTEAAKPA 305
Query: 2145 PEEFPNEDVFEHTFAEIPNIDHSNQT 2170
P N + + N +++N +
Sbjct: 306 PAPSTNTNANKTNTNTNTNTNNTNTS 331
Score = 44.4 bits (104), Expect = 7e-04
Identities = 29/159 (18%), Positives = 74/159 (46%), Gaps = 14/159 (8%)
Query: 1871 IFTTNNNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSL 1930
I + NN S S++ + + + T +P++E T + E+ +T T++
Sbjct: 224 IMSWNNLSSSSIYVGQ-KLAIKQTANTATPKAEVKTEAPAAEKQAAPVVKENTNTNTATT 282
Query: 1931 VSESTTTSSPESESTTTSSPE-----SESTTTSSLVSESTTTSSPESESTTTSSPESEST 1985
+ TTT + T +P + + +T++ +++ T ++ + +T TS+P +
Sbjct: 283 EKKETTTQ----QQTAPKAPTEAAKPAPAPSTNTNANKTNTNTNTNTNNTNTSTPSKNTN 338
Query: 1986 TTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSES 2024
T ++ S T ++ + + S + +++ S+ ++E+
Sbjct: 339 TNTN----SNTNTNSNTNANQGSSNNNSNSSASAIIAEA 373
>gnl|CDD|217837 pfam04003, Utp12, Dip2/Utp12 Family. This domain is found at the
C-terminus of proteins containing WD40 repeats. These
proteins are part of the U3 ribonucleoprotein the yeast
protein is called Utp12 or DIP2.
Length = 109
Score = 61.8 bits (151), Expect = 3e-11
Identities = 28/78 (35%), Positives = 43/78 (55%), Gaps = 1/78 (1%)
Query: 1630 SSELEEVLLVLSLSQVTDLLTHLSSLL-DSSHHRCELVIRVAVFLVRIHHGPLTASKELL 1688
S++E LL L S V LL L+ L E ++R FL+RIH L ++ LL
Sbjct: 15 PSDIELTLLSLPFSYVLRLLEFLAERLQAERSPHLEFLLRWLKFLLRIHGKYLVSNPNLL 74
Query: 1689 PVLQRLEQLASRRVEEIR 1706
P L+ L+++ RRV+++R
Sbjct: 75 PQLRSLQKVLRRRVKDLR 92
>gnl|CDD|227430 COG5099, COG5099, RNA-binding protein of the Puf family,
translational repressor [Translation, ribosomal structure
and biogenesis].
Length = 777
Score = 64.0 bits (156), Expect = 9e-10
Identities = 51/257 (19%), Positives = 104/257 (40%), Gaps = 22/257 (8%)
Query: 1894 NTTTN-SPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPES 1952
+T N P +S ++ +S ++T+S E + ++ + + S S S T +
Sbjct: 4 DTMNNLLPSIKSQLHHSKKSPPSSTTSQELMNGNSTP--NSFSPIPSKASSSATFTLNLP 61
Query: 1953 ESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVS 2012
+ + + ++ S++ S + + + + S ++ + SL+ E +++ +P +
Sbjct: 62 INNSVNHKITSSSS-SRRKPSGSWSVAISSSTSGSQSLLMEL---------PSSSFNPST 111
Query: 2013 ESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPAS 2072
S S+ ST + S T +SS AS +N + +N A+ + + SS +
Sbjct: 112 SSRNKSNSALSSTQQGNANSSVTLSSSTASSMFNSNKLPLPNPNHSNSATTNQSGSSFIN 171
Query: 2073 ESTTTSSPASESTTTSSPASESTTTS---------SPASESTTTSSPESESTTTSSPASE 2123
++SS + SS TS P+S+S T S+ S S S
Sbjct: 172 TPASSSSQPLTNLVVSSIKRFPYLTSLSPFFNYLIDPSSDSATASADTSPSFNPPPNLSP 231
Query: 2124 STTIEEQGVSPHSEKLS 2140
+ +SP + S
Sbjct: 232 NNLFSTSDLSPLPDTQS 248
Score = 61.3 bits (149), Expect = 5e-09
Identities = 42/225 (18%), Positives = 88/225 (39%), Gaps = 6/225 (2%)
Query: 1924 STTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESE 1983
S T ++L+ + +S +S+ E +S + + S S S T +
Sbjct: 3 SDTMNNLLPSIKSQLHHSKKSPPSSTTSQELMNGNSTPNSFSPIPSKASSSATFTLNLPI 62
Query: 1984 STTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVS---ESTTTISPESESTTTSSP 2040
+ + + ++ S SS + + + S S+T+ S +++ +P + S S+
Sbjct: 63 NNSVNHKITSS---SSSRRKPSGSWSVAISSSTSGSQSLLMELPSSSFNPSTSSRNKSNS 119
Query: 2041 ASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSP 2100
A ST N S T +++ AS S+ + S+ A+ + + SS + ++SS
Sbjct: 120 ALSSTQQGNANSSVTLSSSTASSMFNSNKLPLPNPNHSNSATTNQSGSSFINTPASSSSQ 179
Query: 2101 ASESTTTSSPESESTTTSSPASESTTIEEQGVSPHSEKLSANEDP 2145
+ SS + TS + I+ S + ++
Sbjct: 180 PLTNLVVSSIKRFPYLTSLSPFFNYLIDPSSDSATASADTSPSFN 224
Score = 57.1 bits (138), Expect = 1e-07
Identities = 50/320 (15%), Positives = 99/320 (30%), Gaps = 17/320 (5%)
Query: 1829 YCDFDSSMNLLSVSPYITNNLLISMLAATAVAISVIDNYSEIIFTTNNNSESTVVMSTLN 1888
F + S S T NL I+ + S + S ST + +
Sbjct: 40 PNSFSPIPSKASSSATFTLNLPINNSVNHKITSSSSSRRKPSGSWSVAISSSTS--GSQS 97
Query: 1889 SLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTS 1948
L+ +++ +P + S +N S+T + S T SS + S S+
Sbjct: 98 LLMELPSSSFNPSTSSRNKSNSA-LSSTQQGNANSSVTLSSSTASSMFNSNKLP------ 150
Query: 1949 SPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTI 2008
+ S+ + + + SS + ++SS + SS+ TS + I
Sbjct: 151 ---LPNPNHSNSATTNQSGSSFINTPASSSSQPLTNLVVSSIKRFPYLTSLSPFFN-YLI 206
Query: 2009 SPVSESTTTSSPVSESTTTISPESESTTTSSPASEST---TTNNPKSESTTTNNPASESI 2065
P S+S T S+ S + P S + S T + + +++ +E
Sbjct: 207 DPSSDSATASADTS-PSFNPPPNLSPNNLFSTSDLSPLPDTQSVENNIILNSSSSINELT 265
Query: 2066 TSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASEST 2125
+ S + + +S S S+ + + + S + S
Sbjct: 266 SIYGSVPSIRNLRGLNSALVSFLNVSSSSLAFSALNGKEVSPTGSPSTRSFARVLPKSSP 325
Query: 2126 TIEEQGVSPHSEKLSANEDP 2145
+ +
Sbjct: 326 NNLLTEILTTGVNPPQSLPS 345
Score = 53.6 bits (129), Expect = 1e-06
Identities = 39/234 (16%), Positives = 86/234 (36%), Gaps = 12/234 (5%)
Query: 1891 LSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSP 1950
++ +++ E N+ + + S S S T + + + + + + S+++
Sbjct: 20 SKKSPPSSTTSQELMNGNSTPNSFSPIPSKASSSATFTLNLPINNSVNHKITSSSSSRRK 79
Query: 1951 ESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISP 2010
S S + + S S + S +++ +P + S S S ++T + S+ T+S
Sbjct: 80 PSGSWSVAISSSTSGSQSLLMELPSSSFNPSTSSRNKS--NSALSSTQQGNANSSVTLSS 137
Query: 2011 VSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSP 2070
+ S+ +S S +T S + +T S+++ + ++S
Sbjct: 138 STASSMFNSNKLPLPNPNHSNSATTNQSGSSFINT------PASSSSQPLTNLVVSSIKR 191
Query: 2071 ASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASES 2124
T+ S + SS S T S+ S + P S + S
Sbjct: 192 FPYLTSLSPFFNYLIDPSSD---SATASADTS-PSFNPPPNLSPNNLFSTSDLS 241
>gnl|CDD|227709 COG5422, ROM1, RhoGEF, Guanine nucleotide exchange factor for
Rho/Rac/Cdc42-like GTPases [Signal transduction
mechanisms].
Length = 1175
Score = 63.8 bits (155), Expect = 1e-09
Identities = 49/232 (21%), Positives = 87/232 (37%), Gaps = 18/232 (7%)
Query: 1903 ESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPE-SESTTTSSPESESTTTSSLV 1961
+ N +++ + S ES S+ +SSP+ + ++ P + S + +S
Sbjct: 43 PISIRNGADNDIINSESKESFGKYALGHQIFSSFSSSPKLFQRRNSAGPITHSPSATSS- 101
Query: 1962 SESTTTSSPESESTTTSSPESES-----TTTSSLVSESTTTSSPESESTTTISPVSESTT 2016
TSS S SP S+S ++T S SP + + P S +
Sbjct: 102 -----TSSLNSNDGDQFSPASDSLSFNPSSTQSRKDSGPGDGSPVQKRKNPLLPSSSTHG 156
Query: 2017 TSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTT 2076
T P+ + S S S + + + S S S TS+ + S
Sbjct: 157 THPPIVFTDNNGSHAGAPNARSRKEIPSLGSQSMQLPSPHFRQKFSSSDTSNGFSYPSIR 216
Query: 2077 TSSPASESTTTSSPASEST------TTSSPASESTTTSSPESESTTTSSPAS 2122
+S S ++ S P S + + SS AS ++ +P S ++ S +S
Sbjct: 217 KNSRHSSNSMPSFPHSSTAVLLKRHSGSSGASLISSNITPSSSNSEAMSTSS 268
Score = 63.0 bits (153), Expect = 2e-09
Identities = 49/228 (21%), Positives = 86/228 (37%), Gaps = 18/228 (7%)
Query: 1892 SENTTTNSPESESTTTNNPESESTTTSSPESESTTTSS-----LVSESTTTSSPESESTT 1946
++ + S ES + S+ +SSP+ S+ S +++TSS S
Sbjct: 52 NDIINSESKESFGKYALGHQIFSSFSSSPKLFQRRNSAGPITHSPSATSSTSSLNSNDGD 111
Query: 1947 TSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTT 2006
SP S+S + + S+T S +S S + + L+ S+T + T
Sbjct: 112 QFSPASDSLS----FNPSSTQSRKDSGPGDGSPVQKRK---NPLLPSSSTHGTHPPIVFT 164
Query: 2007 --TISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASES 2064
S S S + S + S S S T+N S N+ S +
Sbjct: 165 DNNGSHAGAPNARSRKEIPSLGSQSMQLPSPHFRQKFSSSDTSNGFSYPSIRKNSRHSSN 224
Query: 2065 ITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPES 2112
S P S +++ + + SS AS ++ +P+S ++ S S
Sbjct: 225 SMPSFPHS----STAVLLKRHSGSSGASLISSNITPSSSNSEAMSTSS 268
Score = 56.4 bits (136), Expect = 2e-07
Identities = 43/249 (17%), Positives = 87/249 (34%), Gaps = 13/249 (5%)
Query: 1911 ESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSP 1970
+S++ + ++ + + +++ + S ES S+ +SSP
Sbjct: 22 KSDAFVSKQLLPPRRL-QRKLNPISIRNGADNDIINSESKESFGKYALGHQIFSSFSSSP 80
Query: 1971 E-SESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSES-----TTTSSPVSES 2024
+ + ++ P + S + +S TSS S SP S+S ++T S
Sbjct: 81 KLFQRRNSAGPITHSPSATSS------TSSLNSNDGDQFSPASDSLSFNPSSTQSRKDSG 134
Query: 2025 TTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASES 2084
SP + P+S + T+ P + + A S S + S S
Sbjct: 135 PGDGSPVQKRKNPLLPSSSTHGTHPPIVFTDNNGSHAGAPNARSRKEIPSLGSQSMQLPS 194
Query: 2085 TTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQGVSPHSEKLSANED 2144
S S T++ + S +S S ++ S P S + + ++ L ++
Sbjct: 195 PHFRQKFSSSDTSNGFSYPSIRKNSRHSSNSMPSFPHSSTAVLLKRHSGSSGASLISSNI 254
Query: 2145 PEEFPNEDV 2153
N +
Sbjct: 255 TPSSSNSEA 263
Score = 44.9 bits (106), Expect = 7e-04
Identities = 42/186 (22%), Positives = 70/186 (37%), Gaps = 6/186 (3%)
Query: 1876 NNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSEST 1935
+ S S+ +SL S + SP S+S + N ++T S + S V +
Sbjct: 91 PITHSPSATSSTSSLNSNDGDQFSPASDSLSFNP-----SSTQSRKDSGPGDGSPVQKRK 145
Query: 1936 TTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSEST 1995
P S + T P + S S E S + S + S S S
Sbjct: 146 NPLLPSSSTHGTHPPIVFTDNNGSHAGAPNARSRKEIPSLGSQSMQLPSPHFRQKFSSSD 205
Query: 1996 TTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTN-NPKSES 2054
T++ S S S ++ S P S + + S S+ S +S T ++ N ++ S
Sbjct: 206 TSNGFSYPSIRKNSRHSSNSMPSFPHSSTAVLLKRHSGSSGASLISSNITPSSSNSEAMS 265
Query: 2055 TTTNNP 2060
T++ P
Sbjct: 266 TSSKRP 271
Score = 35.3 bits (81), Expect = 0.51
Identities = 40/201 (19%), Positives = 69/201 (34%), Gaps = 14/201 (6%)
Query: 1759 HHSRDINSVSPNVTSKILTTDNYSEIIFTTNNNSESTVVMSTLNSLLSENEKLFKPHAKT 1818
+ NS P S T+ T++ NS S + LS N + +
Sbjct: 81 KLFQRRNSAGPITHSPSATS-------STSSLNSNDGDQFSPASDSLSFNPSSTQSRKDS 133
Query: 1819 -PGAEFLIQCQYCDFDSSMNLLSVSPYITNNLLISMLAATAVAISVIDNYSEIIFTTNNN 1877
PG +Q + + L S S + T+ ++ + A + + I + +
Sbjct: 134 GPGDGSPVQKR-----KNPLLPSSSTHGTHPPIVFTDNNGSHAGAPNARSRKEIPSLGSQ 188
Query: 1878 SESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTT 1937
S S + T+N S N+ S ++ S P S + S S+
Sbjct: 189 SMQLPSPHF-RQKFSSSDTSNGFSYPSIRKNSRHSSNSMPSFPHSSTAVLLKRHSGSSGA 247
Query: 1938 SSPESESTTTSSPESESTTTS 1958
S S T +SS +T+S
Sbjct: 248 SLISSNITPSSSNSEAMSTSS 268
Score = 34.5 bits (79), Expect = 0.92
Identities = 32/169 (18%), Positives = 63/169 (37%), Gaps = 12/169 (7%)
Query: 2009 SPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSS 2068
SES + + S S S + + + S +++T++ S
Sbjct: 54 IINSESKESFGKYALGHQIFSSFSSSPKLFQRRNSAGPITHSPSATSSTSSLNSNDGDQF 113
Query: 2069 SPASES-----TTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESES----TTTSS 2119
SPAS+S ++T S SP + P+S + T P + + +
Sbjct: 114 SPASDSLSFNPSSTQSRKDSGPGDGSPVQKRKNPLLPSSSTHGTHPPIVFTDNNGSHAGA 173
Query: 2120 PASESTTIEEQGVSPHSEKLSANEDPEEFPNEDVFEHTFAEIPNIDHSN 2168
P + S E + S +L + ++F + D + F+ P+I ++
Sbjct: 174 PNARSRK-EIPSLGSQSMQLPSPHFRQKFSSSD-TSNGFS-YPSIRKNS 219
>gnl|CDD|118131 pfam09595, Metaviral_G, Metaviral_G glycoprotein. This is a viral
attachment glycoprotein from region G of metaviruses. It
is high in serine and threonine suggesting it is highly
glycosylated.
Length = 183
Score = 58.5 bits (141), Expect = 3e-09
Identities = 36/154 (23%), Positives = 66/154 (42%), Gaps = 8/154 (5%)
Query: 1977 TSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTT 2036
TSSP +ES+ + ++P S+ T S S + ++ S T + ++T
Sbjct: 32 TSSPPTESSKKTPTTPTDNPDTNPNSQHPTQQSTESSTLPAATSESHLETEPTSTPDTTN 91
Query: 2037 TSSPASESTT----TNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPAS 2092
TT + +++ + + +P + ++T + + +T TSS
Sbjct: 92 RQQTVDRHTTPPSSSRTQTTQAVHEKKNTRTTSRTQTPPT-TSTAAVQTTTTTNTSSTGK 150
Query: 2093 ESTTTSS-PASESTTTSSPESESTTTSSPASEST 2125
E TTTS P S +TT S E++ + +S ST
Sbjct: 151 EPTTTSVQPRSSATTQSH--EETSQANPQSSAST 182
Score = 55.4 bits (133), Expect = 3e-08
Identities = 42/170 (24%), Positives = 71/170 (41%), Gaps = 12/170 (7%)
Query: 1895 TTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESES 1954
T+ S S+ ++ TT + ++ S ++ +T SS +T+ S E+E
Sbjct: 24 NTSESEHHTSSPPTESSKKTPTTPTDNPDTNPNSQHPTQQSTESSTLPAATSESHLETEP 83
Query: 1955 TTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSES 2014
T+T TT+ ++ T+ P S T T+ V E T + T +
Sbjct: 84 TSTPD------TTNRQQTVDRHTTPPSSSRTQTTQAVHEKKNTRTTSRTQTP-----PTT 132
Query: 2015 TTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASES 2064
+T + + +T T S E TTTS S TT + E++ N +S S
Sbjct: 133 STAAVQTTTTTNTSSTGKEPTTTSVQPRSSATTQS-HEETSQANPQSSAS 181
Score = 54.6 bits (131), Expect = 5e-08
Identities = 54/188 (28%), Positives = 79/188 (42%), Gaps = 15/188 (7%)
Query: 1858 AVAISVIDNYSEIIFTTNNNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTT 1917
A+ I +I NY+ + SE S S+ T T ++ T NP S+ T
Sbjct: 10 ALNIYLIINYA--TQKNTSESEHHTSSPPTES--SKKTPTTPTDNPDT---NPNSQHPTQ 62
Query: 1918 SSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTT 1977
S ES + ++ S T + ++T TT S TT + E ++T T
Sbjct: 63 QSTESSTLPAATSESHLETEPTSTPDTTNRQQTVDRHTTPPSSSRTQTTQAVHEKKNTRT 122
Query: 1978 SSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTT 2037
+S TTS+ ++TTT T T S E TTTS S TT S E E++
Sbjct: 123 TSRTQTPPTTSTAAVQTTTT-------TNTSSTGKEPTTTSVQPRSSATTQSHE-ETSQA 174
Query: 2038 SSPASEST 2045
+ +S ST
Sbjct: 175 NPQSSAST 182
Score = 53.9 bits (129), Expect = 1e-07
Identities = 41/171 (23%), Positives = 74/171 (43%), Gaps = 12/171 (7%)
Query: 1925 TTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESES 1984
T+ S S+ + ++ TT + ++ S ++ +T SS +T+ S E+E
Sbjct: 24 NTSESEHHTSSPPTESSKKTPTTPTDNPDTNPNSQHPTQQSTESSTLPAATSESHLETEP 83
Query: 1985 TTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASES 2044
T+T TT+ ++ T P S T T+ V E T ++ T +P + S
Sbjct: 84 TSTPD------TTNRQQTVDRHTTPPSSSRTQTTQAVHEKKNT----RTTSRTQTPPTTS 133
Query: 2045 TTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASEST 2095
T + + T++ + TS P S +TT S E++ + +S ST
Sbjct: 134 TAAVQTTTTTNTSSTGKEPTTTSVQPRSSATTQSH--EETSQANPQSSAST 182
Score = 47.7 bits (113), Expect = 1e-05
Identities = 34/152 (22%), Positives = 61/152 (40%), Gaps = 10/152 (6%)
Query: 1997 TSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTT 2056
TSSP +ES+ ++P S+ T S ES + ++ S T ++T
Sbjct: 32 TSSPPTESSKKTPTTPTDNPDTNPNSQHPTQQSTESSTLPAATSESHLETEPTSTPDTT- 90
Query: 2057 TNNPASESITSSSPASES-------TTTSSPASESTTTSSPASESTTTSSPASESTTTSS 2109
N ++P S S ++ T +P + ST + + T+S+
Sbjct: 91 --NRQQTVDRHTTPPSSSRTQTTQAVHEKKNTRTTSRTQTPPTTSTAAVQTTTTTNTSST 148
Query: 2110 PESESTTTSSPASESTTIEEQGVSPHSEKLSA 2141
+ +TT+ P S +TT + S + + SA
Sbjct: 149 GKEPTTTSVQPRSSATTQSHEETSQANPQSSA 180
>gnl|CDD|218440 pfam05110, AF-4, AF-4 proto-oncoprotein. This family consists of AF4
(Proto-oncogene AF4) and FMR2 (Fragile X E mental
retardation syndrome) nuclear proteins. These proteins
have been linked to human diseases such as acute
lymphoblastic leukaemia and mental retardation. The
family also contains a Drosophila AF4 protein homologue
Lilliputian which contains an AT-hook domain. Lilliputian
represents a novel pair-rule gene that acts in
cytoskeleton regulation, segmentation and morphogenesis
in Drosophila.
Length = 1154
Score = 62.2 bits (151), Expect = 3e-09
Identities = 64/317 (20%), Positives = 115/317 (36%), Gaps = 53/317 (16%)
Query: 1873 TTNNNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVS 1932
++ ++ S S L L +++ +S E ++T + + S E +SS
Sbjct: 342 SSKTSTNSQSGTSMLEDDLKLSSSEDSDEEQATEKPPSRNTPPSAPSSNPEPAASSS--- 398
Query: 1933 ESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTS-------------- 1978
+++SS SES++ S ESES+++ S +E T+SPE E +T+
Sbjct: 399 -GSSSSSSGSESSSGSDSESESSSSDSEENEPPRTASPEPEPPSTNKWQLDNWLNKVNPH 457
Query: 1979 --SPES------------ESTTTSSLVSESTTTSSPESESTTTISPVSESTTT------- 2017
SP E S E ++ T
Sbjct: 458 KVSPAESVSSNPPIKQPMEKEGKVKSSGSQYHPESKEPPPKSSSKEKRRPRTAQKGPESG 517
Query: 2018 -----SSPVSESTT---TISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSS 2069
S SE+ T+ + + A + T P+SE T +S
Sbjct: 518 RGKQKSPAQSEAPPQRRTVGKKQPKKPEKASAGDERTGLRPESEPGTLPYGSSVQTPPDR 577
Query: 2070 PASESTTTSSPA--SESTTTSSPASESTTTSSPA-SESTTTSSPESESTTTSSPASES-- 2124
P + + + P+ E ++ PA+E SP+ + E++S+++ SP ES
Sbjct: 578 PKAATKGSRKPSPRKEPKSSVPPAAEKRKYKSPSKIVPKSREFIETDSSSSDSPEDESLP 637
Query: 2125 TTIEEQGVSPHSEKLSA 2141
+ + G + S K S
Sbjct: 638 PSSQSPG-NTESSKESC 653
Score = 60.7 bits (147), Expect = 1e-08
Identities = 54/262 (20%), Positives = 94/262 (35%), Gaps = 25/262 (9%)
Query: 1896 TTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESEST 1955
T +S T N + + ++ +S S+S T S++ + SS E ++ + S
Sbjct: 323 TKDSQHVSPGTQNQKQYDPSSKTSTNSQSGT--SMLEDDLKLSSSEDSDEEQATEKPPSR 380
Query: 1956 TTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSEST 2015
T S + S +++SS SES++ S SES+++ S E+E T SP E
Sbjct: 381 NTPPSAPSSNPEPAASSSGSSSSSSGSESSSGSDSESESSSSDSEENEPPRTASPEPEPP 440
Query: 2016 TT---------------SSPVSESTTTISP-----ESESTTTSSPASESTTTNNPKSEST 2055
+T +ES ++ P E E SS + + P +S+
Sbjct: 441 STNKWQLDNWLNKVNPHKVSPAESVSSNPPIKQPMEKEGKVKSSGSQYHPESKEPPPKSS 500
Query: 2056 TTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSS---PASESTTTSSPES 2112
+ + + S SE+ + A + T PES
Sbjct: 501 SKEKRRPRTAQKGPESGRGKQKSPAQSEAPPQRRTVGKKQPKKPEKASAGDERTGLRPES 560
Query: 2113 ESTTTSSPASESTTIEEQGVSP 2134
E T +S T + +
Sbjct: 561 EPGTLPYGSSVQTPPDRPKAAT 582
Score = 49.5 bits (118), Expect = 2e-05
Identities = 58/242 (23%), Positives = 95/242 (39%), Gaps = 23/242 (9%)
Query: 1910 PESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSS 1969
E++S+++ SPE ES SS +T +S S T S + + L + +
Sbjct: 621 IETDSSSSDSPEDESLPPSSQSPGNTESSKESCASLRTPVCRSSVGSQNDLSKDRLLSPM 680
Query: 1970 PESESTTTSSPESESTTTSSLVSESTT---TSSPESESTTTISP-VSESTTTSSPVSEST 2025
E+E SP +S SL + + P + P +E + S+P +++
Sbjct: 681 RETE---LLSPLRDSEERYSLWVKIDLDLLSRIPGHPYKKGVPPKPAEKDSLSAPKKQTS 737
Query: 2026 TTIS--PESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASE 2083
T S S+ E+ + K ++ S S +SS S S S +S
Sbjct: 738 KTASEKSSSKGKRKHKNDEEADKIESKKQRLEEKSSSCSPSSSSSHHHSSSNKESRKSSR 797
Query: 2084 ST------TTSSPASESTTTSS-------PASESTTTSS-PESESTTTSSPASESTTIEE 2129
+ + SSP S S+ E T++SS P S S+T SS S ST+
Sbjct: 798 NKEEEMLPSPSSPLSSSSPKPEHPSRKRPRRQEDTSSSSGPFSASSTKSSSKSSSTSKHR 857
Query: 2130 QG 2131
+
Sbjct: 858 KT 859
Score = 46.5 bits (110), Expect = 2e-04
Identities = 52/246 (21%), Positives = 92/246 (37%), Gaps = 17/246 (6%)
Query: 1911 ESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSP 1970
E +S+ + E + S + + E++S+++ SPE ES SS +T +S
Sbjct: 593 EPKSSVPPAAEKRKYKSPSKIV-PKSREFIETDSSSSDSPEDESLPPSSQSPGNTESSKE 651
Query: 1971 ESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTI-- 2028
S T S + + L + + E+E +SP+ +S S + +
Sbjct: 652 SCASLRTPVCRSSVGSQNDLSKDRLLSPMRETE---LLSPLRDSEERYSLWVKIDLDLLS 708
Query: 2029 -SPESESTTTSSPA-SESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTT 2086
P P +E + + PK +++ T S +S+ E+
Sbjct: 709 RIPGHPYKKGVPPKPAEKDSLSAPKKQTSKT--------ASEKSSSKGKRKHKNDEEADK 760
Query: 2087 TSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQGVSPHSEKLSANEDPE 2146
S SS S S+++S S S S +S + EE SP S S++ PE
Sbjct: 761 IESKKQRLEEKSSSCSPSSSSSHHHSSSNKESRKSSRNKE-EEMLPSPSSPLSSSSPKPE 819
Query: 2147 EFPNED 2152
+
Sbjct: 820 HPSRKR 825
Score = 44.9 bits (106), Expect = 6e-04
Identities = 43/211 (20%), Positives = 69/211 (32%), Gaps = 23/211 (10%)
Query: 1952 SESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPV 2011
ES S + ES S T + L S S + + +
Sbjct: 236 DESPELKSSIEESYGQQS--FGKTMDELKSPAKAKLTKLKIPSQPVEQSYSGDVSCVEEI 293
Query: 2012 SESTTTSSPVSESTTTISPESESTTTSSPASES----TTTNNPKSESTTTNNPA------ 2061
+ T S P + ++E + P +S T N K ++
Sbjct: 294 LKEMTHSWPPPLTAIHTPGKTEPSKFPFPTKDSQHVSPGTQNQKQYDPSSKTSTNSQSGT 353
Query: 2062 ---------SESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPES 2112
S S S + S S +S+P ++++ S +S S + SS S
Sbjct: 354 SMLEDDLKLSSSEDSDEEQATEKPPSRNTPPSAPSSNPEPAASSSGSSSSSSGSESSSGS 413
Query: 2113 ESTTTSSPASESTTIEEQGV-SPHSEKLSAN 2142
+S + SS +S+S E SP E S N
Sbjct: 414 DSESESS-SSDSEENEPPRTASPEPEPPSTN 443
Score = 44.1 bits (104), Expect = 0.001
Identities = 55/274 (20%), Positives = 98/274 (35%), Gaps = 47/274 (17%)
Query: 1900 PESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTS----------- 1948
E++S+++++PE ES SS +T +S S T S + +
Sbjct: 621 IETDSSSSDSPEDESLPPSSQSPGNTESSKESCASLRTPVCRSSVGSQNDLSKDRLLSPM 680
Query: 1949 ------SPESESTTTSSLV---------------SESTTTSSPESESTTTSSPESESTTT 1987
SP +S SL + P + + ++ + S T
Sbjct: 681 RETELLSPLRDSEERYSLWVKIDLDLLSRIPGHPYKKGVPPKPAEKDSLSAPKKQTSKTA 740
Query: 1988 SSLVS-ESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASE--- 2043
S S + + E+ S SS S S+++ S S S +S
Sbjct: 741 SEKSSSKGKRKHKNDEEADKIESKKQRLEEKSSSCSPSSSSSHHHSSSNKESRKSSRNKE 800
Query: 2044 ---------STTTNNPKSESTTTNNPASESITSSS--PASESTTTSSPASESTTTSSPAS 2092
++++PK E + P + TSSS P S S+T SS S ST+
Sbjct: 801 EEMLPSPSSPLSSSSPKPEHPSRKRPRRQEDTSSSSGPFSASSTKSSSKSSSTSKHRKTE 860
Query: 2093 ESTTTSSPASESTTTSSPESESTTTSSPASESTT 2126
+++S + ++ +P S+ P S ++
Sbjct: 861 GKGSSTSKEHKGSSGDTPNKASSFPVPPLSNGSS 894
Score = 42.2 bits (99), Expect = 0.004
Identities = 50/260 (19%), Positives = 96/260 (36%), Gaps = 27/260 (10%)
Query: 1892 SENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTS--- 1948
+++++++SPE ES ++ +T +S S T S + + + +
Sbjct: 623 TDSSSSDSPEDESLPPSSQSPGNTESSKESCASLRTPVCRSSVGSQNDLSKDRLLSPMRE 682
Query: 1949 ----SPESESTTTSSLV---------------SESTTTSSPESESTTTSSPESESTTTSS 1989
SP +S SL + P + + ++ + S T S
Sbjct: 683 TELLSPLRDSEERYSLWVKIDLDLLSRIPGHPYKKGVPPKPAEKDSLSAPKKQTSKTASE 742
Query: 1990 LVS-ESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTN 2048
S + + E+ S SS S S+++ S S S +S +
Sbjct: 743 KSSSKGKRKHKNDEEADKIESKKQRLEEKSSSCSPSSSSSHHHSSSNKESRKSSRNKEEE 802
Query: 2049 --NPKSESTTTNNPASESITSSSPASESTTTSS--PASESTTTSSPASESTTTSSPASES 2104
S ++++P E + P + T+SS P S S+T SS S ST+
Sbjct: 803 MLPSPSSPLSSSSPKPEHPSRKRPRRQEDTSSSSGPFSASSTKSSSKSSSTSKHRKTEGK 862
Query: 2105 TTTSSPESESTTTSSPASES 2124
+++S E + ++ +P S
Sbjct: 863 GSSTSKEHKGSSGDTPNKAS 882
Score = 35.7 bits (82), Expect = 0.46
Identities = 32/129 (24%), Positives = 55/129 (42%), Gaps = 6/129 (4%)
Query: 1898 NSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTT 1957
+S S S+++++ S S S S + L S S+ SS S P +
Sbjct: 772 SSSCSPSSSSSHHHSSSNKESRKSSRNKEEEMLPSPSSPLSS---SSPKPEHPSRKRPRR 828
Query: 1958 SSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTT 2017
S S S P S S+T SS +S ST+ +++S E + ++ +P S+
Sbjct: 829 QEDTSSS---SGPFSASSTKSSSKSSSTSKHRKTEGKGSSTSKEHKGSSGDTPNKASSFP 885
Query: 2018 SSPVSESTT 2026
P+S ++
Sbjct: 886 VPPLSNGSS 894
Score = 31.4 bits (71), Expect = 7.3
Identities = 29/130 (22%), Positives = 53/130 (40%), Gaps = 6/130 (4%)
Query: 1878 SESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTT 1937
E + S +S ++++N +S + E SP S +++S +
Sbjct: 769 EEKSSSCSPSSSSSHHHSSSNKESRKS----SRNKEEEMLPSPSSPLSSSSPKPEHPSRK 824
Query: 1938 SSPESESTTTSS-PESESTTTSSLVSESTTTSSP-ESESTTTSSPESESTTTSSLVSEST 1995
E T++SS P S S+T SS S ST+ E + ++TS S+ + + S
Sbjct: 825 RPRRQEDTSSSSGPFSASSTKSSSKSSSTSKHRKTEGKGSSTSKEHKGSSGDTPNKASSF 884
Query: 1996 TTSSPESEST 2005
+ S+
Sbjct: 885 PVPPLSNGSS 894
>gnl|CDD|165513 PHA03255, PHA03255, BDLF3; Provisional.
Length = 234
Score = 59.5 bits (143), Expect = 3e-09
Identities = 40/160 (25%), Positives = 77/160 (48%), Gaps = 9/160 (5%)
Query: 1967 TSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTT 2026
TS + S ++++ T T+++ + S + S P + +TT++ S TT++ +S +TT
Sbjct: 20 TSLIWTSSGSSTASAGNVTGTTAVTTPSPSASGPSTNQSTTLTTTSAPITTTAILSTNTT 79
Query: 2027 TISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASEST-TTSSPASEST 2085
T+ T+T + + TT+N + + TT A + + ST TS+ + S+
Sbjct: 80 TV------TSTGTTVTPVPTTSNASTINVTTKVTAQNITATEAGTGTSTGVTSNVTTRSS 133
Query: 2086 TTSSPASEST--TTSSPASESTTTSSPESESTTTSSPASE 2123
+T+S + T TT +P S TS+ + + E
Sbjct: 134 STTSATTRITNATTLAPTLSSKGTSNATKTTAELPTVPDE 173
Score = 56.5 bits (135), Expect = 3e-08
Identities = 37/158 (23%), Positives = 68/158 (43%), Gaps = 5/158 (3%)
Query: 1914 STTTSSPESESTTTSS--LVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPE 1971
+++ SS S T + + + S + S P + +TT + S TT++++S +TTT +
Sbjct: 25 TSSGSSTASAGNVTGTTAVTTPSPSASGPSTNQSTTLTTTSAPITTTAILSTNTTTVTST 84
Query: 1972 SESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPE 2031
+ T S ++T + + + T T + V+ + TT S STT+ +
Sbjct: 85 GTTVTPVPTTSNASTINVTTKVTAQNITATEAGTGTSTGVTSNVTTR---SSSTTSATTR 141
Query: 2032 SESTTTSSPASESTTTNNPKSESTTTNNPASESITSSS 2069
+ TT +P S T+N + E S S
Sbjct: 142 ITNATTLAPTLSSKGTSNATKTTAELPTVPDERQPSLS 179
Score = 54.5 bits (130), Expect = 2e-07
Identities = 38/155 (24%), Positives = 77/155 (49%), Gaps = 7/155 (4%)
Query: 1937 TSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTT 1996
TS + S ++++ T T+++ + S + S P + +TT + S TT++++S +TT
Sbjct: 20 TSLIWTSSGSSTASAGNVTGTTAVTTPSPSASGPSTNQSTTLTTTSAPITTTAILSTNTT 79
Query: 1997 TSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASEST-TTNNPKSEST 2055
T + + TT++PV TTS+ + + TT T T + ST T+N + S+
Sbjct: 80 TVT---STGTTVTPVP---TTSNASTINVTTKVTAQNITATEAGTGTSTGVTSNVTTRSS 133
Query: 2056 TTNNPASESITSSSPASESTTTSSPASESTTTSSP 2090
+T + + +++ A ++ + + TT P
Sbjct: 134 STTSATTRITNATTLAPTLSSKGTSNATKTTAELP 168
Score = 53.0 bits (126), Expect = 5e-07
Identities = 43/154 (27%), Positives = 74/154 (48%), Gaps = 10/154 (6%)
Query: 1897 TNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTT 1956
T+S S ++ N + + TT SP + +T+ + TTTS+P + + S+ + T+
Sbjct: 25 TSSGSSTASAGNVTGTTAVTTPSPSASGPSTNQSTTL-TTTSAPITTTAILSTNTTTVTS 83
Query: 1957 TSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTT 2016
T + V+ TTS+ + + TT T T + T TS+ + + TT S STT
Sbjct: 84 TGTTVTPVPTTSNASTINVTTKVTAQNITATEA----GTGTSTGVTSNVTT---RSSSTT 136
Query: 2017 TSSPVSESTTTISPESESTTTSSPASESTTTNNP 2050
+++ + TT++P S TS + TT P
Sbjct: 137 SATTRITNATTLAPTLSSKGTS--NATKTTAELP 168
Score = 49.1 bits (116), Expect = 9e-06
Identities = 37/144 (25%), Positives = 66/144 (45%), Gaps = 8/144 (5%)
Query: 1896 TTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESEST 1955
TT SP + +TN + TTTS+P TT++++S +TTT + + T S ++
Sbjct: 44 TTPSPSASGPSTNQSTTL-TTTSAP----ITTTAILSTNTTTVTSTGTTVTPVPTTSNAS 98
Query: 1956 TTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSEST 2015
T + + + T TS+ + + TT S STT+++ + TT++P S
Sbjct: 99 TINVTTKVTAQNITATEAGTGTSTGVTSNVTTR---SSSTTSATTRITNATTLAPTLSSK 155
Query: 2016 TTSSPVSESTTTISPESESTTTSS 2039
TS+ + + E + S
Sbjct: 156 GTSNATKTTAELPTVPDERQPSLS 179
Score = 49.1 bits (116), Expect = 9e-06
Identities = 42/169 (24%), Positives = 74/169 (43%), Gaps = 8/169 (4%)
Query: 1921 ESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSP 1980
E+ TSS S ++ + + + TT SP + +T+ + TTTS+P + + S+
Sbjct: 19 ETSLIWTSSGSSTASAGNVTGTTAVTTPSPSASGPSTNQSTTL-TTTSAPITTTAILSTN 77
Query: 1981 ESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSP 2040
+ T+T + V+ TTS+ + + TT T T + ST S + TT S
Sbjct: 78 TTTVTSTGTTVTPVPTTSNASTINVTTKVTAQNITATEAGTGTSTGVTS----NVTTRSS 133
Query: 2041 ASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSS 2089
++ S TT + + TT P S +S+ + + E + S
Sbjct: 134 STTSATT---RITNATTLAPTLSSKGTSNATKTTAELPTVPDERQPSLS 179
Score = 47.6 bits (112), Expect = 3e-05
Identities = 36/159 (22%), Positives = 68/159 (42%), Gaps = 10/159 (6%)
Query: 1987 TSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTT 2046
TS + + S ++++ T T + + S + S P + +TT++ S TT++ S +TT
Sbjct: 20 TSLIWTSSGSSTASAGNVTGTTAVTTPSPSASGPSTNQSTTLTTTSAPITTTAILSTNTT 79
Query: 2047 TNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTT 2106
T T+T + T+S+ ++ + TT A T T + TS+ + + T
Sbjct: 80 T------VTSTGTTVTPVPTTSNASTINVTTKVTAQNITATEAGTG----TSTGVTSNVT 129
Query: 2107 TSSPESESTTTSSPASESTTIEEQGVSPHSEKLSANEDP 2145
T S + S TT + + + + E P
Sbjct: 130 TRSSSTTSATTRITNATTLAPTLSSKGTSNATKTTAELP 168
Score = 44.9 bits (105), Expect = 2e-04
Identities = 30/148 (20%), Positives = 65/148 (43%), Gaps = 10/148 (6%)
Query: 1873 TTNNNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVS 1932
T + + + T S + +TN + +TT+ + TTT+ +T ++ V+
Sbjct: 31 TASAGNVTGTTAVTTPSPSASGPSTNQSTTLTTTS----APITTTAI----LSTNTTTVT 82
Query: 1933 ESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVS 1992
+ TT +P ++ S+ + T+ ++ + + + T+ + S STT+++
Sbjct: 83 STGTTVTPVPTTSNASTINVTTKVTAQNITATEAGTGTSTGVTSNVTTRSSSTTSATTRI 142
Query: 1993 ESTTTSSPESESTTTISPVSESTTTSSP 2020
+ TT +P S T + + TT P
Sbjct: 143 TNATTLAPTLSSKGTSN--ATKTTAELP 168
Score = 41.8 bits (97), Expect = 0.002
Identities = 34/134 (25%), Positives = 60/134 (44%), Gaps = 10/134 (7%)
Query: 1866 NYSEIIFTTNNNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESEST 1925
N S + TT+ +T ++ST + ++ TT +P TT+N + + TT T
Sbjct: 56 NQSTTLTTTSAPITTTAILSTNTTTVTSTGTTVTPVP---TTSNASTINVTTKVTAQNIT 112
Query: 1926 TTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESEST 1985
T + T TS+ + + TT S STT+++ + TT +P S TS+ +
Sbjct: 113 ATEA----GTGTSTGVTSNVTT---RSSSTTSATTRITNATTLAPTLSSKGTSNATKTTA 165
Query: 1986 TTSSLVSESTTTSS 1999
++ E + S
Sbjct: 166 ELPTVPDERQPSLS 179
Score = 41.0 bits (95), Expect = 0.004
Identities = 21/104 (20%), Positives = 37/104 (35%)
Query: 1846 TNNLLISMLAATAVAISVIDNYSEIIFTTNNNSESTVVMSTLNSLLSENTTTNSPESEST 1905
TN ++ T + N S I TT +++ + T+ + S ST
Sbjct: 76 TNTTTVTSTGTTVTPVPTTSNASTINVTTKVTAQNITATEAGTGTSTGVTSNVTTRSSST 135
Query: 1906 TTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSS 1949
T+ + TT +P S TS+ + + E + S
Sbjct: 136 TSATTRITNATTLAPTLSSKGTSNATKTTAELPTVPDERQPSLS 179
Score = 38.7 bits (89), Expect = 0.025
Identities = 25/98 (25%), Positives = 49/98 (50%), Gaps = 1/98 (1%)
Query: 2031 ESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSP 2090
E+ TSS +S ++ N + + TT +P++ +++ + TTTS+P + + S+
Sbjct: 19 ETSLIWTSSGSSTASAGNVTGTTAVTTPSPSASGPSTNQSTTL-TTTSAPITTTAILSTN 77
Query: 2091 ASESTTTSSPASESTTTSSPESESTTTSSPASESTTIE 2128
+ T+T + + TTS+ + + TT A T E
Sbjct: 78 TTTVTSTGTTVTPVPTTSNASTINVTTKVTAQNITATE 115
>gnl|CDD|234368 TIGR03835, termin_org_DnaJ, terminal organelle assembly protein TopJ.
This model describes TopJ (MG_200, CbpA), a DnaJ homolog
and probable assembly protein of the Mycoplasma terminal
organelle. The terminal organelle is involved in both
cytadherence and gliding motility [Cellular processes,
Chemotaxis and motility].
Length = 871
Score = 61.8 bits (149), Expect = 4e-09
Identities = 57/316 (18%), Positives = 97/316 (30%), Gaps = 31/316 (9%)
Query: 1880 STVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSS 1939
S + T + + + E+ E E T + E++ T+ E
Sbjct: 260 SPTLEVTAPKEVEQPLQPEPVDEETVAETKAEEEPQPTQTVETKPTSAPESTVEENL--- 316
Query: 1940 PESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSS 1999
PE T + + S T S+ E T P+ E T V E T +
Sbjct: 317 PEINQ-PTQAVQPTSETISTTPVEPTDQLKPKEVDQIQ---EELKKTKEIEVEELPTKKN 372
Query: 2000 PESESTTTISPVSESTTTSSPVSESTTTISPES--ESTTTSSPASESTTTN-------NP 2050
E + + ++ PE E+ T E T +N +P
Sbjct: 373 DLVEIN-----FDDLEELKFELVQTNQEKEPEKAVENWATDYQLDEPTQSNIDWYKQEDP 427
Query: 2051 KSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASE---STTT 2107
K + A+ IT + S P+ EST E+ E +
Sbjct: 428 KDLEQLVQDQATLEITEENQISPEPVEEQPSVESTAPEDQVVEAIKEEEELLEQKKAAEF 487
Query: 2108 SSPESESTTTSSPASESTTIEEQGVSPHSEKLSANEDPEEFPNEDVFE-----HTFAEI- 2161
+ + T T+S + Q + N D ++ ++ F I
Sbjct: 488 AELFGQPTPTTSIEELLNPEQTQPTEFDEIIIENNLDNVSVADDQNYQLKDDNKKFINIS 547
Query: 2162 -PNIDHSNQTDEAIPE 2176
P I SN++D+ I +
Sbjct: 548 LPTIVSSNESDDLIYD 563
Score = 59.8 bits (144), Expect = 2e-08
Identities = 51/316 (16%), Positives = 92/316 (29%), Gaps = 26/316 (8%)
Query: 1893 ENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPES 1952
+ + E S+ T + L E + E+ + E
Sbjct: 238 RELEPQDDSEDDYVIPDAEIISSPTLEVTAPKEVEQPLQPEPV-----DEETVAETKAEE 292
Query: 1953 ESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVS 2012
E T ++ E+ TS+PES + + PE T + + T +++P E T + P
Sbjct: 293 EPQPTQTV--ETKPTSAPES-TVEENLPEINQPTQAVQPTSETISTTPV-EPTDQLKP-K 347
Query: 2013 ESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNP----ASESITSS 2068
E + ++ E + K E TN + ++
Sbjct: 348 EVDQIQEELKKTKEIEVEELPTKKNDLVEINFDDLEELKFELVQTNQEKEPEKAVENWAT 407
Query: 2069 SPASESTTTSS--------PASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSP 2120
+ T S+ P A+ T + S P EST
Sbjct: 408 DYQLDEPTQSNIDWYKQEDPKDLEQLVQDQATLEITEENQISPEPVEEQPSVESTAPEDQ 467
Query: 2121 ASESTTIEEQGVSPHSEKLSANEDPEEFPN---EDVFEHTFAEIPNIDHSNQTDEAIPET 2177
E+ EE+ + A + P E++ + P + +
Sbjct: 468 VVEAIKEEEELLEQKKAAEFAELFGQPTPTTSIEELLNPEQTQ-PTEFDEIIIENNLDNV 526
Query: 2178 FDAREEWPQCKDVIGK 2193
A ++ Q KD K
Sbjct: 527 SVADDQNYQLKDDNKK 542
>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein. This entry is a highly
conserved protein present in eukaryotes.
Length = 680
Score = 60.7 bits (147), Expect = 8e-09
Identities = 42/214 (19%), Positives = 76/214 (35%), Gaps = 19/214 (8%)
Query: 1933 ESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVS 1992
E+ T S + E + SS ++T + +++ + S+S+ + +PE E +
Sbjct: 205 ENHTLSVTDKEKSEASSK-GLTSTKELVPVQNSGGNHSLSKSSNSQTPELEYSEKGKDHH 263
Query: 1993 ESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKS 2052
S + + S TI + S P+S ST + +
Sbjct: 264 HSHNHQHHS---------IGINNHHSKHADSKLQTIEVIENHSNKSRPSSSSTNGSKETT 314
Query: 2053 ESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPES 2112
++++ S SS A S S SSP S S+ S S++ S ES
Sbjct: 315 SNSSSAAAGSIGSKSSKSAKHSNRNKSN-------SSPKSHSSANGS--VPSSSVSDNES 365
Query: 2113 ESTTTSSPASESTTIEEQGVSPHSEKLSANEDPE 2146
+ S +S + ++ + N PE
Sbjct: 366 KQKRASKSSSGARDSKKDASGMSANGTVENCIPE 399
Score = 53.0 bits (127), Expect = 2e-06
Identities = 41/229 (17%), Positives = 85/229 (37%), Gaps = 16/229 (6%)
Query: 1900 PESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSS 1959
P+ E+ T + + E + SS ++T + +++ + S+S+ + +PE E +
Sbjct: 202 PKEENHTLSVTDKEKSEASSK-GLTSTKELVPVQNSGGNHSLSKSSNSQTPELEYSEKGK 260
Query: 1960 LVSESTTTSSPE-SESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTS 2018
S + S T ++ + S P S ST ++ +S
Sbjct: 261 DHHHSHNHQHHSIGINNHHSKHADSKLQTIEVIENHSNKSRPSSSSTN--GSKETTSNSS 318
Query: 2019 SPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTS 2078
S + S + SS +++ + N S + ++ +SS +ES
Sbjct: 319 SAAAGSIGS---------KSSKSAKHSNRNKSNSSPKSHSSANGSVPSSSVSDNESKQKR 369
Query: 2079 SPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTI 2127
+ S S S S +++ E+ PE++ +T S+ I
Sbjct: 370 ASKSSSGARDSKKDASGMSANGTVENCI---PENKISTPSAIERLEQDI 415
Score = 48.0 bits (114), Expect = 7e-05
Identities = 34/166 (20%), Positives = 67/166 (40%), Gaps = 11/166 (6%)
Query: 1980 PESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSS 2039
P+ E+ T S E + SS ++T +++ + +S+S+ + +PE E +
Sbjct: 202 PKEENHTLSVTDKEKSEASSK-GLTSTKELVPVQNSGGNHSLSKSSNSQTPELEYSEKGK 260
Query: 2040 PASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSS 2099
S + NN S+ S E S S +++S+ S+ TT++S
Sbjct: 261 DHHHSHNHQHHSIGI---NNHHSKHADSKLQTIEVIENHSNKSRPSSSSTNGSKETTSNS 317
Query: 2100 PASESTTTSSPESESTTTSSPASESTTIEEQGVSPHSEKLSANEDP 2145
++ + + S S+S S+ ++ SP S + P
Sbjct: 318 SSAAAGSIGSKSSKSAKHSNRNKSNS-------SPKSHSSANGSVP 356
Score = 45.3 bits (107), Expect = 4e-04
Identities = 36/187 (19%), Positives = 71/187 (37%), Gaps = 7/187 (3%)
Query: 1880 STVVMSTLNSLL-SENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTS 1938
S+ +++ L+ +N+ N S+S+ + PE E + S S +
Sbjct: 220 SSKGLTSTKELVPVQNSGGNHSLSKSSNSQTPELEYSEKGKDHHHSHNHQH---HSIGIN 276
Query: 1939 SPESESTTTSSPESE---STTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSEST 1995
+ S+ + E + + S S S+T S E+ S ++S+ + SS ++ +
Sbjct: 277 NHHSKHADSKLQTIEVIENHSNKSRPSSSSTNGSKETTSNSSSAAAGSIGSKSSKSAKHS 336
Query: 1996 TTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSEST 2055
+ S + S ++S +ES + +S S S S + N E+
Sbjct: 337 NRNKSNSSPKSHSSANGSVPSSSVSDNESKQKRASKSSSGARDSKKDASGMSANGTVENC 396
Query: 2056 TTNNPAS 2062
N S
Sbjct: 397 IPENKIS 403
Score = 37.6 bits (87), Expect = 0.10
Identities = 47/240 (19%), Positives = 91/240 (37%), Gaps = 27/240 (11%)
Query: 1787 TTNNNSESTVVMSTLNSLLSENEKLFK-PHAKTPGAEFLIQCQYCDFDSSMNLLSVSPYI 1845
++ + + ++ NS N L K +++TP E+ + + + S+
Sbjct: 220 SSKGLTSTKELVPVQNS--GGNHSLSKSSNSQTPELEYSEKGKDHHHSHNHQHHSIG--- 274
Query: 1846 TNNLLISMLAATAVAISVIDNYSEIIFTTNNNSESTVVMSTLNSLLSENTTTNSPESEST 1905
NN + I VI+N+S N S + + + + N++ S + S
Sbjct: 275 INNHHSKHADSKLQTIEVIENHS-------NKSRPSSSSTNGSKETTSNSS--SAAAGSI 325
Query: 1906 TTNNPESESTTT-----SSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSL 1960
+ + +S + SSP+S S+ S S++ S ES+ S S + +
Sbjct: 326 GSKSSKSAKHSNRNKSNSSPKSHSSANGS--VPSSSVSDNESKQKRASKSSSGARDSKKD 383
Query: 1961 VSESTTTSS-----PESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSEST 2015
S + + PE++ +T S+ E L +E ESE IS ++
Sbjct: 384 ASGMSANGTVENCIPENKISTPSAIERLEQDIKKLQAELQQARQNESELRNQISLLTSLE 443
>gnl|CDD|114270 pfam05539, Pneumo_att_G, Pneumovirinae attachment membrane
glycoprotein G.
Length = 408
Score = 59.7 bits (144), Expect = 9e-09
Identities = 38/207 (18%), Positives = 76/207 (36%), Gaps = 2/207 (0%)
Query: 1910 PESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSS 1969
P + + T S + +T+ ++ + + S+ T S
Sbjct: 139 PICQRDYNPRDRPKCRCTLRGKDVSCCKEPKTAVTTSKTTSWPTEVSHPTYPSQVTPQSQ 198
Query: 1970 PESESTTTSSPESESTTTSSLVSESTTTSS-PESESTTTISPVSESTTTSSPVSESTTTI 2028
P ++ T++ ++T + ++ TTTSS PE ++ S S + P S ++
Sbjct: 199 PATQGHQTATANQRLSSTEPVGTQGTTTSSNPEPQTEPPPSQRGPSGSPQHPPSTTSQDQ 258
Query: 2029 SPESESTTTSSPASESTTTNNPKS-ESTTTNNPASESITSSSPASESTTTSSPASESTTT 2087
S + + T+N +S ST T P ++ + P T T+ S +
Sbjct: 259 STTGDGQEHTQRRKTPPATSNRRSPHSTATPPPTTKRQETGRPTPRPTATTQSGSSPPHS 318
Query: 2088 SSPASESTTTSSPASESTTTSSPESES 2114
S P ++ T+ + P+ S
Sbjct: 319 SPPGVQANPTTQNLVDCKELDPPKPNS 345
Score = 58.9 bits (142), Expect = 2e-08
Identities = 37/187 (19%), Positives = 70/187 (37%), Gaps = 10/187 (5%)
Query: 1909 NPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTS 1968
+ +T+ ++ + + S+ T S P ++ T++ ++T + ++ TTTS
Sbjct: 168 PKTAVTTSKTTSWPTEVSHPTYPSQVTPQSQPATQGHQTATANQRLSSTEPVGTQGTTTS 227
Query: 1969 S-PESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTT 2027
S PE ++ S S + +T+S + STT + P + + +
Sbjct: 228 SNPEPQTEPPPSQRGPSGSP----QHPPSTTSQDQ-STTGDGQEHTQRRKTPPATSNRRS 282
Query: 2028 ISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTT 2087
ST T P ++ T P T T S SS P ++ T+ +
Sbjct: 283 P----HSTATPPPTTKRQETGRPTPRPTATTQSGSSPPHSSPPGVQANPTTQNLVDCKEL 338
Query: 2088 SSPASES 2094
P S
Sbjct: 339 DPPKPNS 345
Score = 58.5 bits (141), Expect = 2e-08
Identities = 41/174 (23%), Positives = 68/174 (39%), Gaps = 4/174 (2%)
Query: 1896 TTNSPESESTTTNNP--ESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSS-PES 1952
TT+ S T ++P S+ T S P ++ T++ ++T ++ TTTSS PE
Sbjct: 173 TTSKTTSWPTEVSHPTYPSQVTPQSQPATQGHQTATANQRLSSTEPVGTQGTTTSSNPEP 232
Query: 1953 ESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPES-ESTTTISPV 2011
++ S S + P S ++ S + + TS+ S ST T P
Sbjct: 233 QTEPPPSQRGPSGSPQHPPSTTSQDQSTTGDGQEHTQRRKTPPATSNRRSPHSTATPPPT 292
Query: 2012 SESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESI 2065
++ T P T T S +S P ++ T + + P SI
Sbjct: 293 TKRQETGRPTPRPTATTQSGSSPPHSSPPGVQANPTTQNLVDCKELDPPKPNSI 346
Score = 55.1 bits (132), Expect = 3e-07
Identities = 52/295 (17%), Positives = 101/295 (34%), Gaps = 29/295 (9%)
Query: 1851 ISMLAATAVAISVIDNYSEIIFTTNNNSESTVVMSTLNSLLSENTTTNSPESESTTTNNP 1910
++ A++IS+ + + T T S N TTT++ +TTT +
Sbjct: 39 LTGTTTIALSISISVEQAVLSDCTTYLRNGTTSGSLSNP---TRTTTST----ATTTRDI 91
Query: 1911 ESESTTTSSPESESTTTSSLV-----SESTTT--------------SSPESESTTTSSPE 1951
TT + + ES + + S S P +
Sbjct: 92 RGLQTTRTR-KLESCSNVQIAYGDMHDRSNPVLGGIDCLGLLALCESGPICQRDYNPRDR 150
Query: 1952 SESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPV 2011
+ T S + +T+ ++ + + S+ T S P ++ T +
Sbjct: 151 PKCRCTLRGKDVSCCKEPKTAVTTSKTTSWPTEVSHPTYPSQVTPQSQPATQGHQTATAN 210
Query: 2012 SESTTTSSPVSESTTT-ISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSP 2070
++T ++ TTT +PE ++ S S + +P S ++ + + +
Sbjct: 211 QRLSSTEPVGTQGTTTSSNPEPQTEPPPSQRGPSGSPQHPPSTTSQDQSTTGDGQEHTQR 270
Query: 2071 ASESTTTSSPAS-ESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASES 2124
TS+ S ST T P ++ T P T T+ S +S P ++
Sbjct: 271 RKTPPATSNRRSPHSTATPPPTTKRQETGRPTPRPTATTQSGSSPPHSSPPGVQA 325
Score = 53.1 bits (127), Expect = 1e-06
Identities = 36/175 (20%), Positives = 60/175 (34%), Gaps = 4/175 (2%)
Query: 1954 STTTSSLVSESTTTSSP--ESESTTTSSPES--ESTTTSSLVSESTTTSSPESESTTTIS 2009
+ TTS S T S P S+ T S P + T T++ ST + +T++
Sbjct: 171 AVTTSKTTSWPTEVSHPTYPSQVTPQSQPATQGHQTATANQRLSSTEPVGTQGTTTSSNP 230
Query: 2010 PVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSS 2069
S + P + S S+ T K+ T+N + S +
Sbjct: 231 EPQTEPPPSQRGPSGSPQHPPSTTSQDQSTTGDGQEHTQRRKTPPATSNRRSPHSTATPP 290
Query: 2070 PASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASES 2124
P ++ T P T T+ S +S P ++ T+ + P S
Sbjct: 291 PTTKRQETGRPTPRPTATTQSGSSPPHSSPPGVQANPTTQNLVDCKELDPPKPNS 345
Score = 49.3 bits (117), Expect = 2e-05
Identities = 41/177 (23%), Positives = 63/177 (35%), Gaps = 8/177 (4%)
Query: 1991 VSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNP 2050
V+ S TTS P S T S+ T S P ++ T + ++T ++ TTT
Sbjct: 172 VTTSKTTSWPTEVSHPTYP--SQVTPQSQPATQGHQTATANQRLSSTEPVGTQGTTT--S 227
Query: 2051 KSESTTTNNPASESITSSSPASESTTTS----SPASESTTTSSPASESTTTSSPASESTT 2106
+ T P S+ S SP +TTS + T + T++ + ST
Sbjct: 228 SNPEPQTEPPPSQRGPSGSPQHPPSTTSQDQSTTGDGQEHTQRRKTPPATSNRRSPHSTA 287
Query: 2107 TSSPESESTTTSSPASESTTIEEQGVSPHSEKLSANEDPEEFPNEDVFEHTFAEIPN 2163
T P ++ T P T + G SP + N + PN
Sbjct: 288 TPPPTTKRQETGRPTPRPTATTQSGSSPPHSSPPGVQANPTTQNLVDCKELDPPKPN 344
Score = 43.5 bits (102), Expect = 0.001
Identities = 25/131 (19%), Positives = 50/131 (38%), Gaps = 2/131 (1%)
Query: 2018 SSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTT 2077
S P+ + + T S + +T+ ++ + S+ T
Sbjct: 137 SGPICQRDYNPRDRPKCRCTLRGKDVSCCKEPKTAVTTSKTTSWPTEVSHPTYPSQVTPQ 196
Query: 2078 SSPASESTTTSSPASESTTTSSPASESTTTSS-PESESTTTSSPASESTTIEEQGVSPHS 2136
S PA++ T++ ++T ++ TTTSS PE ++ S S + + + S
Sbjct: 197 SQPATQGHQTATANQRLSSTEPVGTQGTTTSSNPEPQTEPPPSQRGPSGSPQHPPSTT-S 255
Query: 2137 EKLSANEDPEE 2147
+ S D +E
Sbjct: 256 QDQSTTGDGQE 266
Score = 42.3 bits (99), Expect = 0.002
Identities = 32/127 (25%), Positives = 51/127 (40%), Gaps = 8/127 (6%)
Query: 2009 SPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASE-SITS 2067
+ V+ S TTS P S T S+ T S PA++ T ++T ++ + TS
Sbjct: 170 TAVTTSKTTSWPTEVSHPTY--PSQVTPQSQPATQGHQTATANQRLSSTEPVGTQGTTTS 227
Query: 2068 SSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTI 2127
S+P ++ S S + P S T+S STT E + PA+ +
Sbjct: 228 SNPEPQTEPPPSQRGPSGSPQHPPS----TTSQDQ-STTGDGQEHTQRRKTPPATSNRRS 282
Query: 2128 EEQGVSP 2134
+P
Sbjct: 283 PHSTATP 289
Score = 40.4 bits (94), Expect = 0.010
Identities = 30/161 (18%), Positives = 51/161 (31%), Gaps = 10/161 (6%)
Query: 1994 STTTSSPESESTTTISPV--SESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPK 2051
+ TTS S T P S+ T S P + + T T++ ST +
Sbjct: 171 AVTTSKTTSWPTEVSHPTYPSQVTPQSQP--------ATQGHQTATANQRLSSTEPVGTQ 222
Query: 2052 SESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPE 2111
+T++N S + P++ S S+ T + T++
Sbjct: 223 GTTTSSNPEPQTEPPPSQRGPSGSPQHPPSTTSQDQSTTGDGQEHTQRRKTPPATSNRRS 282
Query: 2112 SESTTTSSPASESTTIEEQGVSPHSEKLSANEDPEEFPNED 2152
ST T P ++ P + S + P P
Sbjct: 283 PHSTATPPPTTKRQETGRPTPRPTATTQSGSSPPHSSPPGV 323
Score = 39.3 bits (91), Expect = 0.023
Identities = 30/130 (23%), Positives = 54/130 (41%), Gaps = 8/130 (6%)
Query: 1878 SESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTT 1937
+ + +S+ + ++ TTT+S T P S+ + SP+ +TTS + +TT
Sbjct: 207 ATANQRLSSTEPVGTQGTTTSSNPEPQTEP--PPSQRGPSGSPQHPPSTTSQ---DQSTT 261
Query: 1938 SSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTT 1997
+ + +P + S S ST T P ++ T P T T+ S +
Sbjct: 262 GDGQEHTQRRKTPPATSNRRSPH---STATPPPTTKRQETGRPTPRPTATTQSGSSPPHS 318
Query: 1998 SSPESESTTT 2007
S P ++ T
Sbjct: 319 SPPGVQANPT 328
Score = 33.1 bits (75), Expect = 1.8
Identities = 28/157 (17%), Positives = 52/157 (33%), Gaps = 10/157 (6%)
Query: 1888 NSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTT 1947
+ T T + ST + +T++ +PE ++ S S + P +T
Sbjct: 199 PATQGHQTATANQRLSSTEPVGTQGTTTSS-NPEPQTEPPPSQRGPSGSPQHP----PST 253
Query: 1948 SSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTT 2007
+S + +T T ++ T++ ST T ++ T P T T
Sbjct: 254 TSQDQSTTGDG-----QEHTQRRKTPPATSNRRSPHSTATPPPTTKRQETGRPTPRPTAT 308
Query: 2008 ISPVSESTTTSSPVSESTTTISPESESTTTSSPASES 2044
S +S P ++ T + P S
Sbjct: 309 TQSGSSPPHSSPPGVQANPTTQNLVDCKELDPPKPNS 345
Score = 32.3 bits (73), Expect = 3.6
Identities = 20/104 (19%), Positives = 38/104 (36%), Gaps = 2/104 (1%)
Query: 2084 STTTSSPASESTTTSSPA--SESTTTSSPESESTTTSSPASESTTIEEQGVSPHSEKLSA 2141
+ TTS S T S P S+ T S P ++ T++ ++ E G + +
Sbjct: 171 AVTTSKTTSWPTEVSHPTYPSQVTPQSQPATQGHQTATANQRLSSTEPVGTQGTTTSSNP 230
Query: 2142 NEDPEEFPNEDVFEHTFAEIPNIDHSNQTDEAIPETFDAREEWP 2185
E P++ + P+ +Q+ + R + P
Sbjct: 231 EPQTEPPPSQRGPSGSPQHPPSTTSQDQSTTGDGQEHTQRRKTP 274
>gnl|CDD|218597 pfam05466, BASP1, Brain acid soluble protein 1 (BASP1 protein). This
family consists of several brain acid soluble protein 1
(BASP1) or neuronal axonal membrane protein NAP-22. The
BASP1 is a neuron enriched Ca(2+)-dependent
calmodulin-binding protein of unknown function.
Length = 233
Score = 56.8 bits (136), Expect = 2e-08
Identities = 39/201 (19%), Positives = 85/201 (42%), Gaps = 17/201 (8%)
Query: 1938 SSPESESTTTSSPESESTTTSSLVSEST-TTSSPESESTTTSSPESESTTTSSLVSESTT 1996
++ E E T + E+++ ++ V E+ +++ T + E E ++ E
Sbjct: 28 AATEEEGTPKENEEAQAAAETTEVKEAKEEKPDKDAQDTANKTEEKEGEKEAAAAKEEAP 87
Query: 1997 TSSPESE---STTTISPVSESTTTSSPVSESTTTISPE----SESTTTSSPASESTTTNN 2049
+ PE + P S P + E SE+++ + ++
Sbjct: 88 KAEPEKTEGAAEAKAEPPKASDPEQEPAAAPGPAAGGEAPKASEASSQPAESAAPAKEEE 147
Query: 2050 PKSE---STTTNNPAS---ESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASE 2103
E + T PA+ E+ + ++PAS+S +SS A+ S+ + A+E+ S ++
Sbjct: 148 KSKEEGEAKKTEAPAAAAQETKSDAAPASDSKPSSSEAAPSSKETPAATEA---PSSTAK 204
Query: 2104 STTTSSPESESTTTSSPASES 2124
++ ++P E + +PA+ S
Sbjct: 205 ASAPAAPAEEVKPSEAPAANS 225
Score = 53.3 bits (127), Expect = 3e-07
Identities = 37/201 (18%), Positives = 83/201 (41%), Gaps = 3/201 (1%)
Query: 1909 NPESESTTTSSPESESTTTSSLVSEST-TTSSPESESTTTSSPESESTTTSSLVSESTTT 1967
E E T + E+++ ++ V E+ +++ T + E E ++ E
Sbjct: 29 ATEEEGTPKENEEAQAAAETTEVKEAKEEKPDKDAQDTANKTEEKEGEKEAAAAKEEAPK 88
Query: 1968 SSPE-SESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTT 2026
+ PE +E + E + + + E+ S+ +++P E
Sbjct: 89 AEPEKTEGAAEAKAEPPKASDPEQEPAAAPGPAAGGEAPKASEASSQPAESAAPAKEEEK 148
Query: 2027 TISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTT 2086
+ E E+ T +PA+ + T + + ++ + +SE+ SS +T S ++++
Sbjct: 149 S-KEEGEAKKTEAPAAAAQETKSDAAPASDSKPSSSEAAPSSKETPAATEAPSSTAKASA 207
Query: 2087 TSSPASESTTTSSPASESTTT 2107
++PA E + +PA+ S T
Sbjct: 208 PAAPAEEVKPSEAPAANSDQT 228
Score = 43.7 bits (102), Expect = 5e-04
Identities = 34/186 (18%), Positives = 74/186 (39%), Gaps = 14/186 (7%)
Query: 1968 SSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTT 2027
++ E E T + E+++ ++ V E+ P+ ++ T + E +
Sbjct: 28 AATEEEGTPKENEEAQAAAETTEVKEAKE-EKPDKDAQDTANKTEEKEGEKEAAAAKEEA 86
Query: 2028 ISPESESTTTSSPA-SESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTT 2086
E E T ++ A +E ++P+ E PA+ + ASE+++ + ++
Sbjct: 87 PKAEPEKTEGAAEAKAEPPKASDPEQEPAAAPGPAAGG--EAPKASEASSQPAESAAPAK 144
Query: 2087 TSSPASE---STTTSSPASESTTTSSPESESTTTSSPASESTTIEEQGVSPHSEKLSANE 2143
+ E + T +PA+ + E+ + ++PAS+S + E +A E
Sbjct: 145 EEEKSKEEGEAKKTEAPAAAA-------QETKSDAAPASDSKPSSSEAAPSSKETPAATE 197
Query: 2144 DPEEFP 2149
P
Sbjct: 198 APSSTA 203
>gnl|CDD|237863 PRK14949, PRK14949, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 944
Score = 59.4 bits (144), Expect = 3e-08
Identities = 53/343 (15%), Positives = 113/343 (32%), Gaps = 49/343 (14%)
Query: 1887 LNSLLSENTTTN----SPESESTTTNNPESESTTTS--SPESESTTTSSLVSEST----- 1935
+ L+E TT + +E+ + +E T + + ES ++L +E
Sbjct: 407 KKTALTEQTTAQQQVQAANAEAVAEADASAEPADTVEQALDDESELLAALNAEQAVILSQ 466
Query: 1936 ---------------TTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSP 1980
++ PE +T + T + V +++ +++ +++T +
Sbjct: 467 AQSQGFEASSSLDADNSAVPEQIDSTAEQSVVNPSVTDTQVDDTSASNNSAADNTVDDNY 526
Query: 1981 ESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSP 2040
+E T S+ + E V+ S+ + + +S+ + + + +
Sbjct: 527 SAEDTLESNGLDEGDYAQDSAPLDAYQDDYVAFSSESYNALSDDEQHSANVQSAQSAAEA 586
Query: 2041 ASESTTTNNPKSESTTTNNPASESITS---------------SSPASESTTTSSPASEST 2085
S + + + +T + A + I SP SS +
Sbjct: 587 QPSSQSLSPISAVTTAAASLADDDILDAVLAARDSLLSDLDALSPKEGDGKKSSADRKPK 646
Query: 2086 TTSS---PASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQGVSPHSEKLSAN 2142
T S PAS S SSP + T+ S ++ S G +P +
Sbjct: 647 TPPSRAPPASLSKPASSPDASQTSASFDLDPDFELATHQSVPEAALASGSAPAPPPVPDP 706
Query: 2143 ED--PEEFPNEDVFEHTFAEIPNIDHSNQTDEAIPETFDAREE 2183
D P E E + PN E++ + ++ +
Sbjct: 707 YDRPPWEEAPEVASA---NDGPNNAAEGNLSESVEDASNSELQ 746
Score = 54.7 bits (132), Expect = 6e-07
Identities = 32/233 (13%), Positives = 83/233 (35%), Gaps = 21/233 (9%)
Query: 1912 SESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESEST-TTSSLVSESTTTSSP 1970
E T S+ + + + + E ++ T ++ ++ + + +S
Sbjct: 377 PEGQTPSALAAAVQAPHANEPQFVNAAPAEKKTALTEQTTAQQQVQAANAEAVAEADASA 436
Query: 1971 ESESTTTSSPESESTTTSSLVSEST--------------------TTSSPESESTTTISP 2010
E T + + ES ++L +E ++ PE +T
Sbjct: 437 EPADTVEQALDDESELLAALNAEQAVILSQAQSQGFEASSSLDADNSAVPEQIDSTAEQS 496
Query: 2011 VSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSP 2070
V + T + V +++ + + +++T + ++E T +N E + A
Sbjct: 497 VVNPSVTDTQVDDTSASNNSAADNTVDDNYSAEDTLESNGLDEGDYAQDSAPLDAYQDDY 556
Query: 2071 ASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASE 2123
+ S+ + + S+ S+ + + + S + S + +T +S A +
Sbjct: 557 VAFSSESYNALSDDEQHSANVQSAQSAAEAQPSSQSLSPISAVTTAAASLADD 609
Score = 37.0 bits (86), Expect = 0.18
Identities = 23/193 (11%), Positives = 58/193 (30%), Gaps = 13/193 (6%)
Query: 1983 ESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPAS 2042
+ SL T ++ + +E ++ +E T TT+
Sbjct: 369 DDPAEISLPEGQTPSALAAAVQAP---HANEPQFVNAAPAEKKT----ALTEQTTAQQQV 421
Query: 2043 ESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPAS 2102
++ + + + ES ++ +E S A +S +
Sbjct: 422 QAANAEAVAEADASAEPADTVE---QALDDESELLAALNAEQAVILSQAQSQGFEASSSL 478
Query: 2103 ESTTTSSPESESTTTSSPASESTTIEEQGVSPHSEKLSANEDPEEFPNEDVFEHTFAEIP 2162
++ ++ PE +T + + Q + S N + +++ E
Sbjct: 479 DADNSAVPEQIDSTAEQSVVNPSVTDTQVDDTSA---SNNSAADNTVDDNYSAEDTLESN 535
Query: 2163 NIDHSNQTDEAIP 2175
+D + ++ P
Sbjct: 536 GLDEGDYAQDSAP 548
>gnl|CDD|173611 PTZ00421, PTZ00421, coronin; Provisional.
Length = 493
Score = 58.8 bits (142), Expect = 3e-08
Identities = 46/189 (24%), Positives = 84/189 (44%), Gaps = 12/189 (6%)
Query: 346 GRTSNLFAPIMLLEGHGGEIFCSKYHPDGQ-YIASSGYDRQIFIWSV-YGECENIGVMSG 403
G T N+ PI+ L+GH ++ +HP +AS+G D + +W V G+ V+
Sbjct: 109 GLTQNISDPIVHLQGHTKKVGIVSFHPSAMNVLASAGADMVVNVWDVERGKAVE--VIKC 166
Query: 404 HTGAVMDLKFSTDGCHIFTCSTDQTLAVWDLEKGQRIKKMKGH-STFVNSCDPVRRGQLL 462
H+ + L+++ DG + T S D+ L + D G + ++ H S C +R L+
Sbjct: 167 HSDQITSLEWNLDGSLLCTTSKDKKLNIIDPRDGTIVSSVEAHASAKSQRCLWAKRKDLI 226
Query: 463 IASG---SDDCTVKVWDPRKKNQAVSMNNTYQVTSVAF----NDTAECVLTGGIDNDIKM 515
I G S + +WD RK S + Q +++ DT + + +I+
Sbjct: 227 ITLGCSKSQQRQIMLWDTRKMASPYSTVDLDQSSALFIPFFDEDTNLLYIGSKGEGNIRC 286
Query: 516 WDLRTNSVV 524
++L +
Sbjct: 287 FELMNERLT 295
Score = 54.9 bits (132), Expect = 4e-07
Identities = 54/215 (25%), Positives = 86/215 (40%), Gaps = 50/215 (23%)
Query: 403 GHTGAVMDLKFST-DGCHIFTCSTDQTLAVWDL-EKGQRIKKMKGHSTFVNSCDPVRRGQ 460
G G ++D+ F+ D +FT S D T+ W + E+G
Sbjct: 73 GQEGPIIDVAFNPFDPQKLFTASEDGTIMGWGIPEEG----------------------- 109
Query: 461 LLIASGSDDCTVKVWDPRKKNQAVSMNNTYQVTSVAFNDTAECVL-TGGIDNDIKMWDLR 519
+ D V + KK V V+F+ +A VL + G D + +WD+
Sbjct: 110 --LTQNISDPIVHLQGHTKK-----------VGIVSFHPSAMNVLASAGADMVVNVWDVE 156
Query: 520 TNSVVQKLRGHSDTVTGLSLSPDGSYILSNAMDNTVRIWDIRPYVPGERCVKVMSGHQHN 579
V+ ++ HSD +T L + DGS + + + D + I D R V S H
Sbjct: 157 RGKAVEVIKCHSDQITSLEWNLDGSLLCTTSKDKKLNIIDPR------DGTIVSSVEAHA 210
Query: 580 FEKNLLRCAWSV-SGLYVTAG---SADKCVYIWDT 610
K+ RC W+ L +T G S + + +WDT
Sbjct: 211 SAKS-QRCLWAKRKDLIITLGCSKSQQRQIMLWDT 244
Score = 46.0 bits (109), Expect = 2e-04
Identities = 44/180 (24%), Positives = 78/180 (43%), Gaps = 17/180 (9%)
Query: 354 PIMLLEGHGGEIFCSKYHP-DGQYIASSGYDRQIFIWSVYGE------CENIGVMSGHTG 406
PI+L G G I ++P D Q + ++ D I W + E + I + GHT
Sbjct: 69 PILL--GQEGPIIDVAFNPFDPQKLFTASEDGTIMGWGIPEEGLTQNISDPIVHLQGHTK 126
Query: 407 AVMDLKFSTDGCHIF-TCSTDQTLAVWDLEKGQRIKKMKGHSTFVNSCDPVRRGQLLIAS 465
V + F ++ + D + VWD+E+G+ ++ +K HS + S + G LL +
Sbjct: 127 KVGIVSFHPSAMNVLASAGADMVVNVWDVERGKAVEVIKCHSDQITSLEWNLDGSLL-CT 185
Query: 466 GSDDCTVKVWDPRKKN--QAVSMNNTYQVTSVAFNDTAECVLTGGIDN----DIKMWDLR 519
S D + + DPR +V + + + + + ++T G I +WD R
Sbjct: 186 TSKDKKLNIIDPRDGTIVSSVEAHASAKSQRCLWAKRKDLIITLGCSKSQQRQIMLWDTR 245
Score = 43.3 bits (102), Expect = 0.002
Identities = 34/128 (26%), Positives = 57/128 (44%), Gaps = 7/128 (5%)
Query: 1120 LLAHEDSVTGVTFVP-KTHYFFTTSKDGRVKQWD--ADNFERIVTLHFFISLYGH--KLP 1174
LL E + V F P FT S+DG + W + + ++ + L GH K+
Sbjct: 71 LLGQEGPIIDVAFNPFDPQKLFTASEDGTIMGWGIPEEGLTQNISDPI-VHLQGHTKKVG 129
Query: 1175 VLSLDMSYDSTLIATGSGDRTVKVWGLDYGDCHKSLLAHEDSVTGVTFVPKTHYFFTTSK 1234
++S S + L + G+ D V VW ++ G + + H D +T + + TTSK
Sbjct: 130 IVSFHPSAMNVLASAGA-DMVVNVWDVERGKAVEVIKCHSDQITSLEWNLDGSLLCTTSK 188
Query: 1235 DGRVKQWD 1242
D ++ D
Sbjct: 189 DKKLNIID 196
Score = 39.1 bits (91), Expect = 0.031
Identities = 33/130 (25%), Positives = 58/130 (44%), Gaps = 13/130 (10%)
Query: 115 GHKSAITVIQYDPL-GHRLATGSKDTDIVLWDVVAECGLHR--------LSGHKGVITDI 165
G + I + ++P +L T S+D I+ W + E GL + L GH + +
Sbjct: 73 GQEGPIIDVAFNPFDPQKLFTASEDGTIMGWGIPEE-GLTQNISDPIVHLQGHTKKVGIV 131
Query: 166 RFMSQPGHHFVVSSA-KDTFVKIWDADTGDCFKTMAAHLTEVWGVCVMREDSYLISGSND 224
F P V++SA D V +WD + G + + H ++ + + S L + S D
Sbjct: 132 SF--HPSAMNVLASAGADMVVNVWDVERGKAVEVIKCHSDQITSLEWNLDGSLLCTTSKD 189
Query: 225 AELKVWNVRD 234
+L + + RD
Sbjct: 190 KKLNIIDPRD 199
Score = 32.6 bits (74), Expect = 3.2
Identities = 22/79 (27%), Positives = 38/79 (48%), Gaps = 3/79 (3%)
Query: 1076 ISLYGH--KLPVLSLDMSYDSTLIATGSGDRTVKVWGLDYGDCHKSLLAHEDSVTGVTFV 1133
+ L GH K+ ++S S + L + G+ D V VW ++ G + + H D +T + +
Sbjct: 119 VHLQGHTKKVGIVSFHPSAMNVLASAGA-DMVVNVWDVERGKAVEVIKCHSDQITSLEWN 177
Query: 1134 PKTHYFFTTSKDGRVKQWD 1152
TTSKD ++ D
Sbjct: 178 LDGSLLCTTSKDKKLNIID 196
Score = 32.6 bits (74), Expect = 3.2
Identities = 22/79 (27%), Positives = 38/79 (48%), Gaps = 3/79 (3%)
Query: 1350 ISLYGH--KLPVLSLDMSYDSTLIATGSGDRTVKVWGLDYGDCHKSLLAHEDSVTGVTFV 1407
+ L GH K+ ++S S + L + G+ D V VW ++ G + + H D +T + +
Sbjct: 119 VHLQGHTKKVGIVSFHPSAMNVLASAGA-DMVVNVWDVERGKAVEVIKCHSDQITSLEWN 177
Query: 1408 PKTHYFFTTSKDGRVKQWD 1426
TTSKD ++ D
Sbjct: 178 LDGSLLCTTSKDKKLNIID 196
>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family. Atrophin-1 is the protein
product of the dentatorubral-pallidoluysian atrophy
(DRPLA) gene. DRPLA OMIM:125370 is a progressive
neurodegenerative disorder. It is caused by the expansion
of a CAG repeat in the DRPLA gene on chromosome 12p. This
results in an extended polyglutamine region in
atrophin-1, that is thought to confer toxicity to the
protein, possibly through altering its interactions with
other proteins. The expansion of a CAG repeat is also the
underlying defect in six other neurodegenerative
disorders, including Huntington's disease. One
interaction of expanded polyglutamine repeats that is
thought to be pathogenic is that with the short glutamine
repeat in the transcriptional coactivator CREB binding
protein, CBP. This interaction draws CBP away from its
usual nuclear location to the expanded polyglutamine
repeat protein aggregates that are characteristic of the
polyglutamine neurodegenerative disorders. This
interferes with CBP-mediated transcription and causes
cytotoxicity.
Length = 979
Score = 58.9 bits (142), Expect = 3e-08
Identities = 51/256 (19%), Positives = 99/256 (38%), Gaps = 20/256 (7%)
Query: 1884 MSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESE 1943
MSTL S T SP+ ++ TN + S+ +SP + ST+++ +EST + + +
Sbjct: 14 MSTLRS--GRKKQTASPDGRASPTNE-DQRSSGRNSPSAASTSSNDSKAESTKKPNKKIK 70
Query: 1944 STTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESE 2003
TS +S + E + + E E T +++ + + SE E E
Sbjct: 71 EEATSPLKS-----TKRQREKPASDTEEPERVTAKKSKTQELSRPNSPSEGEGEGEGEGE 125
Query: 2004 STTTISPVSESTTTSSPVSESTTTISPESESTTTSSPA---SESTTTNNPKSESTTTNNP 2060
S S+S + + S I ++ S++ S P+ +ES + ++ + + P
Sbjct: 126 S-------SDSRSVNEEGSSDPKDIDQDNRSSSPSIPSPQDNESDSDSSAQQQLLQPQGP 178
Query: 2061 ASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTT--SSPESESTTTS 2118
S + + + S +P++++ + P S + S+P
Sbjct: 179 PSIQVPPGAALAPSAPPPTPSAQAVPPQGSPIAAQPAPQPQQPSPLSLISAPSLHPQRLP 238
Query: 2119 SPASESTTIEEQGVSP 2134
SP SP
Sbjct: 239 SPHPPLQPQTASQQSP 254
Score = 50.5 bits (120), Expect = 1e-05
Identities = 44/218 (20%), Positives = 87/218 (39%), Gaps = 7/218 (3%)
Query: 1937 TSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTT 1996
T+SP+ ++ T+ + S S + ST+++ ++EST + + + TS L S
Sbjct: 25 TASPDGRASPTNEDQRSSGRNSP-SAASTSSNDSKAESTKKPNKKIKEEATSPLKSTKRQ 83
Query: 1997 TSSPESESTTTISPVSESTTT---SSPVSESTTTISPESESTTTSSPASESTTTNNPKSE 2053
P S++ ++ + T S P S S E E ++ S + +++PK +
Sbjct: 84 REKPASDTEEPERVTAKKSKTQELSRPNSPSEGEGEGEGEGESSDSRSVNEEGSSDPK-D 142
Query: 2054 STTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESE 2113
N +S SI S +ES + SS + P S + + S +P ++
Sbjct: 143 IDQDNRSSSPSIPSPQD-NESDSDSSAQQQLLQPQGPPSIQVPPGAALAPSAPPPTPSAQ 201
Query: 2114 STTTSSPASESTTIEEQGVSPHSEKLSA-NEDPEEFPN 2150
+ + + +SA + P+ P+
Sbjct: 202 AVPPQGSPIAAQPAPQPQQPSPLSLISAPSLHPQRLPS 239
Score = 47.0 bits (111), Expect = 1e-04
Identities = 45/242 (18%), Positives = 86/242 (35%), Gaps = 14/242 (5%)
Query: 1895 TTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESES 1954
+T E ++ T PE + S + S S SE E ES+ + S E
Sbjct: 79 STKRQREKPASDTEEPERVTAKKSKTQELSRPNSP--SEGEGEGEGEGESSDSRSVNEEG 136
Query: 1955 TTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSES 2014
++ + + +SSP + S ++ES + SS + P S + P +
Sbjct: 137 SSDPKDIDQDNRSSSP----SIPSPQDNESDSDSSAQQQLLQPQGPPSIQ---VPPGAAL 189
Query: 2015 TTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASES 2074
++ P + S + P+ S + PA + + S + +P + + S P +
Sbjct: 190 APSAPPPTPSAQAVPPQG-SPIAAQPAPQPQQPSPLSLISAPSLHP--QRLPSPHPPLQP 246
Query: 2075 TTTS--SPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQGV 2132
T S SP + ++ P S P + + +++ P +
Sbjct: 247 QTASQQSPQPPAPSSRHPQSSHHGPGPPMPHALQQGPVFLQHPSSNPPQPFGLAQSQVPP 306
Query: 2133 SP 2134
P
Sbjct: 307 LP 308
Score = 44.7 bits (105), Expect = 7e-04
Identities = 42/253 (16%), Positives = 77/253 (30%), Gaps = 19/253 (7%)
Query: 1893 ENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSP-- 1950
E T +++ + N SE E ES+ + S+ E ++ + +SSP
Sbjct: 95 ERVTAKKSKTQELSRPNSPSEGEGEGEGEGESSDSRSVNEEGSSDPKDIDQDNRSSSPSI 154
Query: 1951 ----ESESTTTSSLVSESTTTSSPESE---STTTSSPESESTTTSSLVSESTTTSSPESE 2003
++ES + SS + P S +P + T S+ + P
Sbjct: 155 PSPQDNESDSDSSAQQQLLQPQGPPSIQVPPGAALAPSAPPPTPSA-------QAVPPQG 207
Query: 2004 STTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASE 2063
S P + S S ++ P+ + P T + ++
Sbjct: 208 SPIAAQPAPQPQQPSPLSLISAPSLHPQ-RLPSPHPPLQPQTASQQSPQPPAPSSRHPQS 266
Query: 2064 SITSSSPASESTTTSSPASESTTTSSPAS--ESTTTSSPASESTTTSSPESESTTTSSPA 2121
S P P +S+P + P + + P S + + S
Sbjct: 267 SHHGPGPPMPHALQQGPVFLQHPSSNPPQPFGLAQSQVPPLPLPSQAQPHSHTPPSQSAL 326
Query: 2122 SESTTIEEQGVSP 2134
EQ + P
Sbjct: 327 QPQQPPREQPLPP 339
Score = 33.5 bits (76), Expect = 1.8
Identities = 25/146 (17%), Positives = 42/146 (28%), Gaps = 10/146 (6%)
Query: 1910 PESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSS 1969
P+S+ + + T S + +T + +S +
Sbjct: 410 PQSQPLQSVPAQPPVLTQSQSLPPKASTHPHSGLHSGPPQSPFAQHPFTSGGLPAIGPPP 469
Query: 1970 PESESTTTSSPESESTTTSSL-VSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTI 2028
ST + P + S + S+ + I E P+ E+
Sbjct: 470 SLPTSTPAAPPRASSGSQPPGSALPSSGGCAGPGPPLPPIQIKEE------PLDEAE--- 520
Query: 2029 SPESESTTTSSPASESTTTNNPKSES 2054
PES SP+ E T N P S
Sbjct: 521 EPESPPPPPRSPSPEPTVVNTPSHAS 546
>gnl|CDD|226406 COG3889, COG3889, Predicted solute binding protein [General function
prediction only].
Length = 872
Score = 59.1 bits (143), Expect = 3e-08
Identities = 29/115 (25%), Positives = 52/115 (45%), Gaps = 3/115 (2%)
Query: 1895 TTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESES 1954
+ P E+ + T TS S + T P+S + T ++ +
Sbjct: 732 SLEVFPAGENWGFIPTTKRVKVRIMDPASGTGTSITTSGTFTAEVPQSPTKTETTLSYSA 791
Query: 1955 TTTSSLVSESTTTSSPESE---STTTSSPESESTTTSSLVSESTTTSSPESESTT 2006
+ +S++ E+T+ ++ TTTSSP TT+ + S STTT++ S++TT
Sbjct: 792 YSNTSILIETTSVVITKTVTQTQTTTSSPSPTQTTSPTQTSTSTTTTTSPSQTTT 846
Score = 57.6 bits (139), Expect = 8e-08
Identities = 27/97 (27%), Positives = 52/97 (53%), Gaps = 9/97 (9%)
Query: 1951 ESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISP 2010
+ T TS S + T P+S + T ++ + + +S++ E+T+ ++ + T
Sbjct: 758 PASGTGTSITTSGTFTAEVPQSPTKTETTLSYSAYSNTSILIETTSVVITKTVTQT---- 813
Query: 2011 VSESTTTSSP-VSESTTTISPESESTTTSSPASESTT 2046
TTTSSP +++T+ + +TTT+SP S++TT
Sbjct: 814 ---QTTTSSPSPTQTTSPTQTSTSTTTTTSP-SQTTT 846
Score = 57.6 bits (139), Expect = 9e-08
Identities = 37/146 (25%), Positives = 61/146 (41%), Gaps = 11/146 (7%)
Query: 1986 TTSSLVSESTTTSSPESESTTTIS----PVSESTTTSSPVSESTTTISPESESTTTSSPA 2041
TT S +++ T T S P E+ I + T TS
Sbjct: 709 TTLSSEAKNPDTVKIGQALTVYGSLEVFPAGENWGFIPTTKRVKVRIMDPASGTGTSITT 768
Query: 2042 SESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPA 2101
S + T P+S + T + + +++S E+T+ + + T TTTSSP+
Sbjct: 769 SGTFTAEVPQSPTKTETTLSYSAYSNTSILIETTSVVITKTVTQT-------QTTTSSPS 821
Query: 2102 SESTTTSSPESESTTTSSPASESTTI 2127
TT+ + S STTT++ S++TT
Sbjct: 822 PTQTTSPTQTSTSTTTTTSPSQTTTG 847
Score = 56.4 bits (136), Expect = 2e-07
Identities = 29/94 (30%), Positives = 48/94 (51%), Gaps = 7/94 (7%)
Query: 1933 ESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVS 1992
T TS S + T P+S + T ++L + + +S E+T+ ++ + T
Sbjct: 760 SGTGTSITTSGTFTAEVPQSPTKTETTLSYSAYSNTSILIETTSVVITKTVTQT------ 813
Query: 1993 ESTTTSSPESESTTTISPVSESTTTSSPVSESTT 2026
TTTSSP TT+ + S STTT++ S++TT
Sbjct: 814 -QTTTSSPSPTQTTSPTQTSTSTTTTTSPSQTTT 846
Score = 54.9 bits (132), Expect = 6e-07
Identities = 28/98 (28%), Positives = 49/98 (50%), Gaps = 7/98 (7%)
Query: 1901 ESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSL 1960
+ T T+ S + T P+S + T ++L + + +S E+T+ ++ + T
Sbjct: 758 PASGTGTSITTSGTFTAEVPQSPTKTETTLSYSAYSNTSILIETTSVVITKTVTQT---- 813
Query: 1961 VSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTS 1998
TTTSSP TT+ + S STTT++ S++TT
Sbjct: 814 ---QTTTSSPSPTQTTSPTQTSTSTTTTTSPSQTTTGG 848
Score = 53.7 bits (129), Expect = 1e-06
Identities = 26/89 (29%), Positives = 47/89 (52%), Gaps = 3/89 (3%)
Query: 1893 ENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESE---STTTSS 1949
T T+ S + T P+S + T ++ + + +S++ E+T+ ++ TTTSS
Sbjct: 760 SGTGTSITTSGTFTAEVPQSPTKTETTLSYSAYSNTSILIETTSVVITKTVTQTQTTTSS 819
Query: 1950 PESESTTTSSLVSESTTTSSPESESTTTS 1978
P TT+ + S STTT++ S++TT
Sbjct: 820 PSPTQTTSPTQTSTSTTTTTSPSQTTTGG 848
Score = 53.3 bits (128), Expect = 2e-06
Identities = 31/143 (21%), Positives = 59/143 (41%), Gaps = 8/143 (5%)
Query: 1914 STTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESE 1973
S+ +P++ + V S P E+ + T TS S
Sbjct: 712 SSEAKNPDTVKIGQALTVYGSLEVF-PAGENWGFIPTTKRVKVRIMDPASGTGTSITTSG 770
Query: 1974 STTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESE 2033
+ T P+S T T + +S S +++ TT++ T T + S + T +
Sbjct: 771 TFTAEVPQSP-TKTETTLSYSAYSNTSILIETTSVVITKTVTQTQTTTSSPSPT-----Q 824
Query: 2034 STTTSSPASESTTTNNPKSESTT 2056
+T+ + ++ +TTT +P S++TT
Sbjct: 825 TTSPTQTSTSTTTTTSP-SQTTT 846
Score = 51.8 bits (124), Expect = 5e-06
Identities = 32/145 (22%), Positives = 56/145 (38%), Gaps = 8/145 (5%)
Query: 1974 STTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESE 2033
S+ +P++ + V S P E+ I + T T S
Sbjct: 712 SSEAKNPDTVKIGQALTVYGSLEVF-PAGENWGFIPTTKRVKVRIMDPASGTGTSITTSG 770
Query: 2034 STTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASE 2093
+ T P S + T S +N TS T T + T++ SP ++
Sbjct: 771 TFTAEVPQSPTKTETTLSY-SAYSNTSILIETTSVVITKTVTQTQTT----TSSPSP-TQ 824
Query: 2094 STTTSSPASESTTTSSPESESTTTS 2118
+T+ + ++ +TTT+SP S++TT
Sbjct: 825 TTSPTQTSTSTTTTTSP-SQTTTGG 848
Score = 50.2 bits (120), Expect = 1e-05
Identities = 33/159 (20%), Positives = 58/159 (36%), Gaps = 10/159 (6%)
Query: 1940 PESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSS 1999
P + S ++ S V + S + T+ V
Sbjct: 700 PYTNSLYKATTLSSEAKNPDTVKIGQALTVYGSLEVFPAGENWGFIPTTKRV---KVRIM 756
Query: 2000 PESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNN 2059
+ T T S + T P S + T + + + +S E+T+ K+ + T
Sbjct: 757 DPASGTGTSITTSGTFTAEVPQSPTKTETTLSYSAYSNTSILIETTSVVITKTVTQT--- 813
Query: 2060 PASESITSSSPASESTTTSSPASESTTTSSPASESTTTS 2098
T+SSP+ TT+ + S STTT++ S++TT
Sbjct: 814 ----QTTTSSPSPTQTTSPTQTSTSTTTTTSPSQTTTGG 848
Score = 47.2 bits (112), Expect = 1e-04
Identities = 36/151 (23%), Positives = 60/151 (39%), Gaps = 18/151 (11%)
Query: 1966 TTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTT-------ISPVSESTTTS 2018
TT S E+++ T T SL + T I + T TS
Sbjct: 709 TTLSSEAKNPDTVKIGQALTVYGSL---EVFPAGENWGFIPTTKRVKVRIMDPASGTGTS 765
Query: 2019 SPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTS 2078
S + T P+S T T + S S +N TT+ + + T++
Sbjct: 766 ITTSGTFTAEVPQSP-TKTETTLSYSAYSNTSILIETTSVVITKTVTQTQTT----TSSP 820
Query: 2079 SPASESTTTSSPASESTTTSSPASESTTTSS 2109
SP +++T+ + ++ +TTT+SP+ TTT
Sbjct: 821 SP-TQTTSPTQTSTSTTTTTSPS--QTTTGG 848
Score = 41.8 bits (98), Expect = 0.005
Identities = 29/135 (21%), Positives = 50/135 (37%), Gaps = 15/135 (11%)
Query: 2006 TTISPVSESTTTSSPVSESTTTIS---PESESTTTSSPASESTTTNNPKSESTTTNNPAS 2062
TT+S +++ T T S + P ++ S T
Sbjct: 709 TTLSSEAKNPDTVKIGQALTVYGSLEVFPAGENWGFIPTTKRVKVRIMDPASGTGT---- 764
Query: 2063 ESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESE---STTTSS 2119
SIT+S T P S + T ++ + + + +S E+T+ ++ TTTSS
Sbjct: 765 -SITTSGT----FTAEVPQSPTKTETTLSYSAYSNTSILIETTSVVITKTVTQTQTTTSS 819
Query: 2120 PASESTTIEEQGVSP 2134
P+ TT Q +
Sbjct: 820 PSPTQTTSPTQTSTS 834
Score = 41.4 bits (97), Expect = 0.008
Identities = 23/77 (29%), Positives = 36/77 (46%), Gaps = 7/77 (9%)
Query: 1873 TTNNNSEST-VVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLV 1931
+ +E+T + N+ + TT+ T T TTTSSP TT+ +
Sbjct: 778 QSPTKTETTLSYSAYSNTSILIETTSVVITKTVTQT------QTTTSSPSPTQTTSPTQT 831
Query: 1932 SESTTTSSPESESTTTS 1948
S STTT++ S++TT
Sbjct: 832 STSTTTTTSPSQTTTGG 848
Score = 41.0 bits (96), Expect = 0.009
Identities = 18/74 (24%), Positives = 34/74 (45%), Gaps = 6/74 (8%)
Query: 1889 SLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTS 1948
+ LS + +N+ TT+ T T + S + T +T+ + + +TTT+
Sbjct: 785 TTLSYSAYSNTSILIETTSVVITKTVTQTQTTTSSPSPTQ-----TTSPTQTSTSTTTTT 839
Query: 1949 SPESESTTTSSLVS 1962
SP S++TT +
Sbjct: 840 SP-SQTTTGGGICG 852
>gnl|CDD|221121 pfam11489, DUF3210, Protein of unknown function (DUF3210). This is a
family of proteins conserved in yeasts. The function is
not known. The Schizosaccharomyces pombe member is
SPBC18E5.07 and the Saccharomyces cerevisiae member is
AIM21.
Length = 671
Score = 58.0 bits (140), Expect = 6e-08
Identities = 50/254 (19%), Positives = 83/254 (32%), Gaps = 21/254 (8%)
Query: 1885 STLNSLLSENTTTNSPESESTTTNNPESESTTTS--------------SPESESTTTSSL 1930
S L+S ++++ +SP ES E S SP E +
Sbjct: 284 SRLSSPAPDSSSFSSPSGESGLEEREAEEPILASDEVAKEPAGESPAVSPSFEREKSEKS 343
Query: 1931 VSESTTTSSPESESTTTSSPESESTTTSSLVS-ESTTTSSPESESTTTSSPESESTTTSS 1989
ES S S+ + + + L E PE ES P +E ++
Sbjct: 344 RHESDPKSRENSKPASIYGSVPDLIRHTPLEDVEEYEPLFPEDESEIAVKPPTEESSRRP 403
Query: 1990 LVSESTTTSSPESESTTTISPVSESTTTSSP-VSESTTTISPESESTTTSSPASESTTTN 2048
+ S E + S + ++ T S+P + +PE E++ +SS S +
Sbjct: 404 EEEKHRFPSEDVWEDSP--SSLQDTATVSTPSNPPPRASETPEQETSRSSSEVSLDPHQS 461
Query: 2049 NPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTS 2108
KSE S+ S E S TT E ++S ++ S
Sbjct: 462 ELKSEKKKARPEVSKQRFPSRDVWEDAPESQELV---TTEETPEEVKSSSPGVTKPAIPS 518
Query: 2109 SPESESTTTSSPAS 2122
P+ T+
Sbjct: 519 RPKKGKPTSEKRKP 532
Score = 49.9 bits (119), Expect = 2e-05
Identities = 48/245 (19%), Positives = 77/245 (31%), Gaps = 29/245 (11%)
Query: 1899 SPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPES-ESTTTSSPESESTTT 1957
SP E + ES S S+ + V + + E E PE ES
Sbjct: 332 SPSFEREKSEKSRHESDPKSRENSKPASIYGSVPDLIRHTPLEDVEEYEPLFPEDESEIA 391
Query: 1958 SSLVSESTTTSSPESESTTTSSPESESTTTSSLVSE--STTTSSPESESTTTISPVSEST 2015
+E ++ E + S E + +S + ST ++ P S T P E++
Sbjct: 392 VKPPTEESSRRPEEEKHRFPSEDVWEDSPSSLQDTATVSTPSNPPPRASET---PEQETS 448
Query: 2016 TTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASEST 2075
+SS VS +SE + A + S + P S+ + ++ E
Sbjct: 449 RSSSEVSLD----PHQSELKSEKKKARPEVSKQRFPSRDVWEDAPESQELVTTEETPEEV 504
Query: 2076 TTSSPASESTTT-SSPASESTTTSS-----------------PASESTTTSSPES-ESTT 2116
+SSP S P T+ PA + E+ S
Sbjct: 505 KSSSPGVTKPAIPSRPKKGKPTSEKRKPPPVPKKPKPQIPARPAKLQKQQAGEEANSSAF 564
Query: 2117 TSSPA 2121
P
Sbjct: 565 KPKPR 569
Score = 49.9 bits (119), Expect = 2e-05
Identities = 58/267 (21%), Positives = 88/267 (32%), Gaps = 42/267 (15%)
Query: 1908 NNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTT 1967
N E+ +T S S E ++PE ++ SSP +S++ SS ES
Sbjct: 247 NKIVRETASTGSGLGTSPEVDGTPEEQVGYTAPEEYTSRLSSPAPDSSSFSSPSGESGLE 306
Query: 1968 SSPESESTTTS---SPESES-------TTTSSLVSESTTTSSPESESTTTISPVSESTTT 2017
E S + E + +S S P+S + + + S
Sbjct: 307 EREAEEPILASDEVAKEPAGESPAVSPSFEREKSEKSRHESDPKSRENSKPASIYGSVPD 366
Query: 2018 SSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPA-----SESITSSSPAS 2072
T E SE P ES+ SE + SP
Sbjct: 367 LI---RHTPLEDVEEYEPLFPEDESEIAV-KPPTEESSRRPEEEKHRFPSEDVWEDSP-- 420
Query: 2073 ESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQGV 2132
S ++ T S+P++ P S T PE E++ +SS S E
Sbjct: 421 ------SSLQDTATVSTPSNP------PPRASET---PEQETSRSSSEVSLDPHQSEL-- 463
Query: 2133 SPHSEKLSANEDP--EEFPNEDVFEHT 2157
SEK A + + FP+ DV+E
Sbjct: 464 --KSEKKKARPEVSKQRFPSRDVWEDA 488
Score = 49.5 bits (118), Expect = 2e-05
Identities = 47/200 (23%), Positives = 66/200 (33%), Gaps = 27/200 (13%)
Query: 1892 SENTTTNSPESESTTTNNPE-----SESTTTSSPESESTTTSSLVSESTTTSSPESESTT 1946
P ES+ E SE SP S T + ST ++ P S T
Sbjct: 387 ESEIAVKPPTEESSRRPEEEKHRFPSEDVWEDSPSSLQDTATV----STPSNPPPRASET 442
Query: 1947 TSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESEST- 2005
PE E++ +SS VS S +SE S+ S V E +PES+
Sbjct: 443 ---PEQETSRSSSEVSLDPHQSELKSEKKKARPEVSKQRFPSRDVWE----DAPESQELV 495
Query: 2006 TTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESI 2065
TT E ++S V++ P+ T+ PK + PA +
Sbjct: 496 TTEETPEEVKSSSPGVTKPAIPSRPKKGKPTSEKRKP-PPVPKKPKPQI-----PARPAK 549
Query: 2066 TSSSPASE----STTTSSPA 2081
A E S P
Sbjct: 550 LQKQQAGEEANSSAFKPKPR 569
Score = 38.8 bits (90), Expect = 0.042
Identities = 60/328 (18%), Positives = 106/328 (32%), Gaps = 27/328 (8%)
Query: 1878 SESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSL----VSE 1933
+E V S+ + E E+ S+P L
Sbjct: 28 NEPPDVPQRPPSVTLPSLGEEGAEYEALEEAELSDSHH--STPAQTRNVGEDLKLHAPKP 85
Query: 1934 STTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTS--SPESESTTTSSLV 1991
S +SS +++ + +S+ + L S+ P +ST+ S S S S+ SS
Sbjct: 86 SLPSSSAKAKVQAVTRTDSQQAAAAGLGRPSSPEQRPVRKSTSRSLHSVASASSQDSSAS 145
Query: 1992 SESTTTSS--------PESESTTTISPVSESTTTSSPVSESTTTISPES------ESTTT 2037
S TSS PE + P + SP + ++ P S
Sbjct: 146 STLRPTSSAVDDEHGIPEIGQRVPMYPNAGDVQAPSPAPYA-NSLPPGSYGLHGHGVFPQ 204
Query: 2038 SSPASEST--TTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASEST 2095
PK E PA + A S + E+ +T S S
Sbjct: 205 EKFEKAWYEKHPEEPKKEEQGEYGPAVGTERPIDWALSSDDLNKIVRETASTGSGLGTSP 264
Query: 2096 TTSSPASESTTTSSPESESTTTSSPASESTTIEEQGVSPHSEKLSANEDPEEFPNEDVFE 2155
E ++PE ++ SSPA +S++ E+ A E +++V +
Sbjct: 265 EVDGTPEEQVGYTAPEEYTSRLSSPAPDSSSFSSPSGESGLEEREAEEP--ILASDEVAK 322
Query: 2156 HTFAEIPNIDHSNQTDEAIPETFDAREE 2183
E P + S + +++ ++ +
Sbjct: 323 EPAGESPAVSPSFEREKSEKSRHESDPK 350
Score = 36.8 bits (85), Expect = 0.19
Identities = 39/183 (21%), Positives = 68/183 (37%), Gaps = 17/183 (9%)
Query: 2003 ESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPAS 2062
E+ +T S + S E +PE ++ SSPA +S++ ++P ES A
Sbjct: 252 ETASTGSGLGTSPEVDGTPEEQVGYTAPEEYTSRLSSPAPDSSSFSSPSGESGLEEREAE 311
Query: 2063 ESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPAS 2122
E I +S ++ SPA SP+ E + ES P+S + +
Sbjct: 312 EPILASDEVAKEPAGESPAV------SPSFEREKSEKSRHESD----PKSRENSKPASIY 361
Query: 2123 ESTTIEEQGVSPHSEKLSANEDPEEFPNEDVFEHTFAEIPNIDHSNQTDEAIPETFDARE 2182
S + H+ E FP ++ + P + S++ E F + +
Sbjct: 362 GSVPDLIR----HTPLEDVEEYEPLFPEDE--SEIAVKPPT-EESSRRPEEEKHRFPSED 414
Query: 2183 EWP 2185
W
Sbjct: 415 VWE 417
Score = 32.6 bits (74), Expect = 3.4
Identities = 33/184 (17%), Positives = 61/184 (33%), Gaps = 16/184 (8%)
Query: 1970 PESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTIS 2029
P + SP +S S L P S + S E+
Sbjct: 7 PRRRGDRSVSPNPDSFAPSPLNEPPDVPQRPPSVTL-------PSLGEEGAEYEALEEAE 59
Query: 2030 PESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTT--T 2087
S+PA + K + + P+S + + + + T S A+ +
Sbjct: 60 LSDSHH--STPAQTRNVGEDLKLHAPKPSLPSSSAK--AKVQAVTRTDSQQAAAAGLGRP 115
Query: 2088 SSPASESTTTSSPAS--ESTTTSSPESESTTTSSPASESTTIEEQGVSPHSEKLSANEDP 2145
SSP S+ S + SS +S +++T P S S +E G+ +++ +
Sbjct: 116 SSPEQRPVRKSTSRSLHSVASASSQDSSASSTLRPTS-SAVDDEHGIPEIGQRVPMYPNA 174
Query: 2146 EEFP 2149
+
Sbjct: 175 GDVQ 178
>gnl|CDD|197651 smart00320, WD40, WD40 repeats. Note that these repeats are
permuted with respect to the structural repeats (blades)
of the beta propeller domain.
Length = 40
Score = 50.4 bits (121), Expect = 7e-08
Identities = 15/40 (37%), Positives = 28/40 (70%)
Query: 520 TNSVVQKLRGHSDTVTGLSLSPDGSYILSNAMDNTVRIWD 559
+ +++ L+GH+ VT ++ SPDG Y+ S + D T+++WD
Sbjct: 1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
Score = 44.6 bits (106), Expect = 6e-06
Identities = 13/37 (35%), Positives = 20/37 (54%)
Query: 354 PIMLLEGHGGEIFCSKYHPDGQYIASSGYDRQIFIWS 390
+ L+GH G + + PDG+Y+AS D I +W
Sbjct: 4 LLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
Score = 44.6 bits (106), Expect = 7e-06
Identities = 15/38 (39%), Positives = 23/38 (60%)
Query: 396 ENIGVMSGHTGAVMDLKFSTDGCHIFTCSTDQTLAVWD 433
E + + GHTG V + FS DG ++ + S D T+ +WD
Sbjct: 3 ELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
Score = 43.8 bits (104), Expect = 1e-05
Identities = 15/39 (38%), Positives = 23/39 (58%)
Query: 1071 TFKFFISLYGHKLPVLSLDMSYDSTLIATGSGDRTVKVW 1109
+ + +L GH PV S+ S D +A+GS D T+K+W
Sbjct: 1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLW 39
Score = 43.8 bits (104), Expect = 1e-05
Identities = 15/39 (38%), Positives = 23/39 (58%)
Query: 1345 TFKFFISLYGHKLPVLSLDMSYDSTLIATGSGDRTVKVW 1383
+ + +L GH PV S+ S D +A+GS D T+K+W
Sbjct: 1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLW 39
Score = 43.8 bits (104), Expect = 1e-05
Identities = 15/40 (37%), Positives = 23/40 (57%)
Query: 106 TTDVISTFTGHKSAITVIQYDPLGHRLATGSKDTDIVLWD 145
+ +++ T GH +T + + P G LA+GS D I LWD
Sbjct: 1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
Score = 42.7 bits (101), Expect = 3e-05
Identities = 15/33 (45%), Positives = 21/33 (63%)
Query: 1167 SLYGHKLPVLSLDMSYDSTLIATGSGDRTVKVW 1199
+L GH PV S+ S D +A+GS D T+K+W
Sbjct: 7 TLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLW 39
Score = 41.9 bits (99), Expect = 5e-05
Identities = 16/39 (41%), Positives = 20/39 (51%)
Query: 1114 GDCHKSLLAHEDSVTGVTFVPKTHYFFTTSKDGRVKQWD 1152
G+ K+L H VT V F P Y + S DG +K WD
Sbjct: 2 GELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
Score = 41.9 bits (99), Expect = 5e-05
Identities = 16/39 (41%), Positives = 20/39 (51%)
Query: 1204 GDCHKSLLAHEDSVTGVTFVPKTHYFFTTSKDGRVKQWD 1242
G+ K+L H VT V F P Y + S DG +K WD
Sbjct: 2 GELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
Score = 41.9 bits (99), Expect = 5e-05
Identities = 16/39 (41%), Positives = 20/39 (51%)
Query: 1388 GDCHKSLLAHEDSVTGVTFVPKTHYFFTTSKDGRVKQWD 1426
G+ K+L H VT V F P Y + S DG +K WD
Sbjct: 2 GELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
Score = 40.8 bits (96), Expect = 1e-04
Identities = 14/38 (36%), Positives = 17/38 (44%)
Query: 612 TRRIAYKLPGHNGSVNDVQFHPKEPIIMSASSDKTIYL 649
+ + L GH G V V F P + S S D TI L
Sbjct: 1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKL 38
Score = 40.4 bits (95), Expect = 2e-04
Identities = 19/41 (46%), Positives = 26/41 (63%), Gaps = 1/41 (2%)
Query: 436 KGQRIKKMKGHSTFVNSCDPVRRGQLLIASGSDDCTVKVWD 476
G+ +K +KGH+ V S G+ L ASGSDD T+K+WD
Sbjct: 1 SGELLKTLKGHTGPVTSVAFSPDGKYL-ASGSDDGTIKLWD 40
Score = 37.3 bits (87), Expect = 0.002
Identities = 14/40 (35%), Positives = 22/40 (55%)
Query: 192 TGDCFKTMAAHLTEVWGVCVMREDSYLISGSNDAELKVWN 231
+G+ KT+ H V V + YL SGS+D +K+W+
Sbjct: 1 SGELLKTLKGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
Score = 36.5 bits (85), Expect = 0.005
Identities = 13/38 (34%), Positives = 20/38 (52%), Gaps = 2/38 (5%)
Query: 152 LHRLSGHKGVITDIRFMSQPGHHFVVSSAKDTFVKIWD 189
L L GH G +T + F P ++ S + D +K+WD
Sbjct: 5 LKTLKGHTGPVTSVAFS--PDGKYLASGSDDGTIKLWD 40
Score = 33.8 bits (78), Expect = 0.038
Identities = 12/42 (28%), Positives = 20/42 (47%), Gaps = 4/42 (9%)
Query: 568 RCVKVMSGHQHNFEKNLLRCAWSVSGLYVTAGSADKCVYIWD 609
+K + GH + A+S G Y+ +GS D + +WD
Sbjct: 3 ELLKTLKGHTGP----VTSVAFSPDGKYLASGSDDGTIKLWD 40
Score = 33.8 bits (78), Expect = 0.049
Identities = 13/29 (44%), Positives = 18/29 (62%)
Query: 489 TYQVTSVAFNDTAECVLTGGIDNDIKMWD 517
T VTSVAF+ + + +G D IK+WD
Sbjct: 12 TGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
Score = 33.1 bits (76), Expect = 0.083
Identities = 11/32 (34%), Positives = 17/32 (53%), Gaps = 1/32 (3%)
Query: 882 QGHHSEVRALAFSSDNLALVSACA-SQVKIWN 912
+GH V ++AFS D L S +K+W+
Sbjct: 9 KGHTGPVTSVAFSPDGKYLASGSDDGTIKLWD 40
Score = 29.2 bits (66), Expect = 2.0
Identities = 14/48 (29%), Positives = 23/48 (47%), Gaps = 10/48 (20%)
Query: 957 GEILEDIPAHSQELWSVAMLPDQFNPNVYLPLQIQVVTGGGDKSVKLW 1004
GE+L+ + H+ + SVA PD + +G D ++KLW
Sbjct: 2 GELLKTLKGHTGPVTSVAFSPDGK----------YLASGSDDGTIKLW 39
>gnl|CDD|201208 pfam00400, WD40, WD domain, G-beta repeat.
Length = 39
Score = 50.0 bits (120), Expect = 7e-08
Identities = 16/38 (42%), Positives = 27/38 (71%)
Query: 522 SVVQKLRGHSDTVTGLSLSPDGSYILSNAMDNTVRIWD 559
+++ L+GH+ VT ++ SPDG+ + S + D TVR+WD
Sbjct: 2 KLLRTLKGHTGPVTSVAFSPDGNLLASGSDDGTVRVWD 39
Score = 45.8 bits (109), Expect = 2e-06
Identities = 15/39 (38%), Positives = 22/39 (56%)
Query: 395 CENIGVMSGHTGAVMDLKFSTDGCHIFTCSTDQTLAVWD 433
+ + + GHTG V + FS DG + + S D T+ VWD
Sbjct: 1 GKLLRTLKGHTGPVTSVAFSPDGNLLASGSDDGTVRVWD 39
Score = 44.7 bits (106), Expect = 6e-06
Identities = 17/33 (51%), Positives = 22/33 (66%)
Query: 1077 SLYGHKLPVLSLDMSYDSTLIATGSGDRTVKVW 1109
+L GH PV S+ S D L+A+GS D TV+VW
Sbjct: 6 TLKGHTGPVTSVAFSPDGNLLASGSDDGTVRVW 38
Score = 44.7 bits (106), Expect = 6e-06
Identities = 17/33 (51%), Positives = 22/33 (66%)
Query: 1167 SLYGHKLPVLSLDMSYDSTLIATGSGDRTVKVW 1199
+L GH PV S+ S D L+A+GS D TV+VW
Sbjct: 6 TLKGHTGPVTSVAFSPDGNLLASGSDDGTVRVW 38
Score = 44.7 bits (106), Expect = 6e-06
Identities = 17/33 (51%), Positives = 22/33 (66%)
Query: 1351 SLYGHKLPVLSLDMSYDSTLIATGSGDRTVKVW 1383
+L GH PV S+ S D L+A+GS D TV+VW
Sbjct: 6 TLKGHTGPVTSVAFSPDGNLLASGSDDGTVRVW 38
Score = 43.5 bits (103), Expect = 2e-05
Identities = 13/37 (35%), Positives = 22/37 (59%)
Query: 109 VISTFTGHKSAITVIQYDPLGHRLATGSKDTDIVLWD 145
++ T GH +T + + P G+ LA+GS D + +WD
Sbjct: 3 LLRTLKGHTGPVTSVAFSPDGNLLASGSDDGTVRVWD 39
Score = 42.7 bits (101), Expect = 3e-05
Identities = 14/39 (35%), Positives = 19/39 (48%)
Query: 1114 GDCHKSLLAHEDSVTGVTFVPKTHYFFTTSKDGRVKQWD 1152
G ++L H VT V F P + + S DG V+ WD
Sbjct: 1 GKLLRTLKGHTGPVTSVAFSPDGNLLASGSDDGTVRVWD 39
Score = 42.7 bits (101), Expect = 3e-05
Identities = 14/39 (35%), Positives = 19/39 (48%)
Query: 1204 GDCHKSLLAHEDSVTGVTFVPKTHYFFTTSKDGRVKQWD 1242
G ++L H VT V F P + + S DG V+ WD
Sbjct: 1 GKLLRTLKGHTGPVTSVAFSPDGNLLASGSDDGTVRVWD 39
Score = 42.7 bits (101), Expect = 3e-05
Identities = 14/39 (35%), Positives = 19/39 (48%)
Query: 1388 GDCHKSLLAHEDSVTGVTFVPKTHYFFTTSKDGRVKQWD 1426
G ++L H VT V F P + + S DG V+ WD
Sbjct: 1 GKLLRTLKGHTGPVTSVAFSPDGNLLASGSDDGTVRVWD 39
Score = 42.7 bits (101), Expect = 3e-05
Identities = 11/37 (29%), Positives = 18/37 (48%)
Query: 354 PIMLLEGHGGEIFCSKYHPDGQYIASSGYDRQIFIWS 390
+ L+GH G + + PDG +AS D + +W
Sbjct: 3 LLRTLKGHTGPVTSVAFSPDGNLLASGSDDGTVRVWD 39
Score = 42.3 bits (100), Expect = 4e-05
Identities = 20/40 (50%), Positives = 26/40 (65%), Gaps = 1/40 (2%)
Query: 437 GQRIKKMKGHSTFVNSCDPVRRGQLLIASGSDDCTVKVWD 476
G+ ++ +KGH+ V S G LL ASGSDD TV+VWD
Sbjct: 1 GKLLRTLKGHTGPVTSVAFSPDGNLL-ASGSDDGTVRVWD 39
Score = 41.2 bits (97), Expect = 9e-05
Identities = 12/37 (32%), Positives = 18/37 (48%)
Query: 613 RRIAYKLPGHNGSVNDVQFHPKEPIIMSASSDKTIYL 649
++ L GH G V V F P ++ S S D T+ +
Sbjct: 1 GKLLRTLKGHTGPVTSVAFSPDGNLLASGSDDGTVRV 37
Score = 36.2 bits (84), Expect = 0.006
Identities = 12/39 (30%), Positives = 20/39 (51%)
Query: 193 GDCFKTMAAHLTEVWGVCVMREDSYLISGSNDAELKVWN 231
G +T+ H V V + + L SGS+D ++VW+
Sbjct: 1 GKLLRTLKGHTGPVTSVAFSPDGNLLASGSDDGTVRVWD 39
Score = 36.2 bits (84), Expect = 0.007
Identities = 13/38 (34%), Positives = 20/38 (52%), Gaps = 2/38 (5%)
Query: 152 LHRLSGHKGVITDIRFMSQPGHHFVVSSAKDTFVKIWD 189
L L GH G +T + F P + + S + D V++WD
Sbjct: 4 LRTLKGHTGPVTSVAFS--PDGNLLASGSDDGTVRVWD 39
Score = 34.6 bits (80), Expect = 0.023
Identities = 11/43 (25%), Positives = 20/43 (46%), Gaps = 4/43 (9%)
Query: 567 ERCVKVMSGHQHNFEKNLLRCAWSVSGLYVTAGSADKCVYIWD 609
+ ++ + GH + A+S G + +GS D V +WD
Sbjct: 1 GKLLRTLKGH----TGPVTSVAFSPDGNLLASGSDDGTVRVWD 39
Score = 33.9 bits (78), Expect = 0.035
Identities = 11/29 (37%), Positives = 17/29 (58%)
Query: 489 TYQVTSVAFNDTAECVLTGGIDNDIKMWD 517
T VTSVAF+ + +G D +++WD
Sbjct: 11 TGPVTSVAFSPDGNLLASGSDDGTVRVWD 39
Score = 33.5 bits (77), Expect = 0.049
Identities = 11/32 (34%), Positives = 17/32 (53%), Gaps = 1/32 (3%)
Query: 882 QGHHSEVRALAFSSDNLALVSACA-SQVKIWN 912
+GH V ++AFS D L S V++W+
Sbjct: 8 KGHTGPVTSVAFSPDGNLLASGSDDGTVRVWD 39
Score = 30.4 bits (69), Expect = 0.65
Identities = 12/48 (25%), Positives = 22/48 (45%), Gaps = 10/48 (20%)
Query: 957 GEILEDIPAHSQELWSVAMLPDQFNPNVYLPLQIQVVTGGGDKSVKLW 1004
G++L + H+ + SVA PD + +G D +V++W
Sbjct: 1 GKLLRTLKGHTGPVTSVAFSPDGN----------LLASGSDDGTVRVW 38
Score = 28.1 bits (63), Expect = 5.1
Identities = 10/28 (35%), Positives = 13/28 (46%)
Query: 1040 EEQVLCARVSPDSKLLAVSLLDTTVKIF 1067
V SPD LLA D TV+++
Sbjct: 11 TGPVTSVAFSPDGNLLASGSDDGTVRVW 38
Score = 28.1 bits (63), Expect = 5.1
Identities = 10/28 (35%), Positives = 13/28 (46%)
Query: 1314 EEQVLCARVSPDSKLLAVSLLDTTVKIF 1341
V SPD LLA D TV+++
Sbjct: 11 TGPVTSVAFSPDGNLLASGSDDGTVRVW 38
>gnl|CDD|217503 pfam03344, Daxx, Daxx Family. The Daxx protein (also known as the
Fas-binding protein) is thought to play a role in
apoptosis, but precise role played by Daxx remains to be
determined. Daxx forms a complex with Axin.
Length = 715
Score = 57.2 bits (138), Expect = 1e-07
Identities = 58/272 (21%), Positives = 76/272 (27%), Gaps = 26/272 (9%)
Query: 1899 SPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTS 1958
S E ES E E ESE E + SE S E +
Sbjct: 439 SEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGSEEEMEGSSEGDGDGEE 498
Query: 1959 SLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTT-- 2016
S S E + SS+ ES + ES S ES
Sbjct: 499 PEEDAERRNSEMAGISRM---SEGQQPRGSSVQPESPQEEPLQPESMDAESVGEESDEEL 555
Query: 2017 -TSSPVSESTTTISPESESTTTS-------SPASESTTTNNPKSESTTTN---NPASESI 2065
S T + + T P ST+ N + T+T N + +
Sbjct: 556 LAEESPLSSHTELEGVATPVETKISSSRKLPPPPVSTSLENDSATVTSTTRNGNVSPHTP 615
Query: 2066 TSSSPASESTTTSSPASESTTTSS----PASESTTTSSPASESTTTSSP--ESESTTTSS 2119
P S ES + + S PA TS ST +
Sbjct: 616 QDEQPPSGRKRKRKEEVESEPLGNQYLRHHNGSEKDGLPAPMDPVTSCSPVADSSTRVDT 675
Query: 2120 PASESTTIEEQ--GVSPHSEKLSANE--DPEE 2147
P+ E T Q G P K++ DPEE
Sbjct: 676 PSHELVTSSPQTPGDPPKKNKVNVATQCDPEE 707
Score = 54.5 bits (131), Expect = 7e-07
Identities = 45/249 (18%), Positives = 75/249 (30%), Gaps = 20/249 (8%)
Query: 1892 SENTTTNSPESESTTTNNPE---SESTTTSSPESESTTTSSLVSESTTTSSPESESTTTS 1948
+ + +++ ++ ST+ +P ES S E E E ESE
Sbjct: 414 TSSRSSDPSKASSTSGESPSMASQESEEEESVEEEEEEEEEEEEEEQ-----ESEEEEGE 468
Query: 1949 SPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTI 2008
E E + SE S E + E SE S
Sbjct: 469 DEEEEEEVEADNGSEEEMEGSSEGD----GDGEEPEEDAERRNSEMAGISRMSEGQQPRG 524
Query: 2009 SPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSS 2068
S V + P+ + E + A ES +++ + E T S +
Sbjct: 525 SSVQPESPQEEPLQPESMDAESVGEESDEELLAEESPLSSHTELEGVATPVETKISSSRK 584
Query: 2069 -SPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTI 2127
P ST+ + ++ T+T+ + S T P S ES +
Sbjct: 585 LPPPPVSTSLENDSATVTSTTRNGNVSPHTPQD-------EQPPSGRKRKRKEEVESEPL 637
Query: 2128 EEQGVSPHS 2136
Q + H+
Sbjct: 638 GNQYLRHHN 646
Score = 51.8 bits (124), Expect = 5e-06
Identities = 51/266 (19%), Positives = 77/266 (28%), Gaps = 25/266 (9%)
Query: 1902 SESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLV 1961
+ S +++ ++ ST+ SP S ES S E E E E
Sbjct: 414 TSSRSSDPSKASSTSGESPSMAS-------QESEEEESVEEEEE-----EEEEEEEEEQE 461
Query: 1962 SESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPV 2021
SE E E + SE S S E SE S
Sbjct: 462 SEEEEGEDEEEEEEVEADNGSEEEMEGS----SEGDGDGEEPEEDAERRNSEMAGISRMS 517
Query: 2022 SESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPA 2081
S + P + E + A ES SS E T
Sbjct: 518 EGQQPRGSSVQPESPQEEPLQPESMDAESVGEESDEELLAEESPLSSHTELEGVATPVET 577
Query: 2082 SESTTTS-SPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQGVSPHSEKLS 2140
S++ P ST+ + ++ T+T+ + S T +EQ S K
Sbjct: 578 KISSSRKLPPPPVSTSLENDSATVTSTTRNGNVSP--------HTPQDEQPPSGRKRKRK 629
Query: 2141 ANEDPEEFPNEDVFEHTFAEIPNIDH 2166
+ E N+ + H +E +
Sbjct: 630 EEVESEPLGNQYLRHHNGSEKDGLPA 655
Score = 48.8 bits (116), Expect = 4e-05
Identities = 49/249 (19%), Positives = 73/249 (29%), Gaps = 18/249 (7%)
Query: 1892 SENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPE 1951
SE S E + + E SE S + S + P
Sbjct: 482 SEEEMEGSSEGDG----DGEEPEEDAERRNSEMAGISRMSEGQQPRGSSVQPESPQEEPL 537
Query: 1952 SESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPV 2011
+ + V E + ES +S E E T S++ P PV
Sbjct: 538 QPESMDAESVGEESDEELLAEESPLSSHTELEGVATPVETKISSSRKLPP-------PPV 590
Query: 2012 SESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNN-PASESITSSSP 2070
S S S STT + + + +P E + + + P
Sbjct: 591 STSLENDSATVTSTT----RNGNVSPHTPQDEQPPSGRKRKRKEEVESEPLGNQYLRHHN 646
Query: 2071 ASESTTTSSPASESTTTSSPASESTTTSSPASESTTTS--SPESESTTTSSPASESTTIE 2128
SE +P T+ S A ST +P+ E T+S +P + E
Sbjct: 647 GSEKDGLPAPMDPVTSCSPVADSSTRVDTPSHELVTSSPQTPGDPPKKNKVNVATQCDPE 706
Query: 2129 EQGVSPHSE 2137
E V SE
Sbjct: 707 EVIVLSDSE 715
Score = 44.5 bits (105), Expect = 7e-04
Identities = 38/211 (18%), Positives = 65/211 (30%), Gaps = 1/211 (0%)
Query: 1859 VAISVIDNYSEIIFTTNNNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTS 1918
S + S + S S L + E + ES +S
Sbjct: 505 RRNSEMAGISRMSEGQQPRGSSVQPESPQEEPLQPESMDAESVGEESDEELLAEESPLSS 564
Query: 1919 SPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTS 1978
E E T S++ P +TS +T TS+ + + + +P+ E +
Sbjct: 565 HTELEGVATPVETKISSSRKLPPP-PVSTSLENDSATVTSTTRNGNVSPHTPQDEQPPSG 623
Query: 1979 SPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTS 2038
S + + + T+ SPV++S+T + S TS
Sbjct: 624 RKRKRKEEVESEPLGNQYLRHHNGSEKDGLPAPMDPVTSCSPVADSSTRVDTPSHELVTS 683
Query: 2039 SPASESTTTNNPKSESTTTNNPASESITSSS 2069
SP + K T +P + S S
Sbjct: 684 SPQTPGDPPKKNKVNVATQCDPEEVIVLSDS 714
Score = 33.0 bits (75), Expect = 2.6
Identities = 27/128 (21%), Positives = 42/128 (32%), Gaps = 9/128 (7%)
Query: 2061 ASESITSSSPASESTTTSSP--ASESTTTSSPASESTTTSSPASESTTTSSPESESTTTS 2118
+ S +S + ST+ SP AS+ + E E S E
Sbjct: 413 GTSSRSSDPSKASSTSGESPSMASQESEEEESVEEEEEEEEEEEEEEQESEEEEGEDEEE 472
Query: 2119 SPASESTTIEEQGVSPHSEKLSANEDPEEFPNEDVFEHTFAEIPNIDHSNQTDEAIPETF 2178
E+ E+ + SE E+PEE E +E+ I ++ P
Sbjct: 473 EEEVEADNGSEEEMEGSSEGDGDGEEPEED-----AERRNSEMAGISRMSEGQ--QPRGS 525
Query: 2179 DAREEWPQ 2186
+ E PQ
Sbjct: 526 SVQPESPQ 533
>gnl|CDD|185594 PTZ00395, PTZ00395, Sec24-related protein; Provisional.
Length = 1560
Score = 56.6 bits (136), Expect = 2e-07
Identities = 42/236 (17%), Positives = 92/236 (38%), Gaps = 14/236 (5%)
Query: 1940 PESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSS 1999
P++ P S ++ + S + +++ +S + +++ S ++ + + +++
Sbjct: 373 PDARGAWAGGPHSNASYNCAAYSNAAQSNAAQSNAGFSNAGYSNPGNSNPGYNNAPNSNT 432
Query: 2000 PESESTTTISPVSESTTTSSPVSE---STTTISPESESTTTSSPASESTTTNNPKSESTT 2056
P + + +P S ++ P S S T S S S A + + + +
Sbjct: 433 PYNNPPNSNTPYSNPPNSNPPYSNLPYSNTPYSNAPLSNAPPSSAKDHHSAYHAAYQHRA 492
Query: 2057 TNNPASESITSSSPASE-------STTTSSPASESTTTSSPASESTTTSSPASESTTTSS 2109
N PA+ T++ PA+ ++ + AS ++ + TT+ P +
Sbjct: 493 ANQPAANLPTANQPAANNFHGAAGNSVGNPFASRPFGSAPYGGNAATTADPNGIAKREDH 552
Query: 2110 PESESTTTSSPASESTTIEEQGVSPHSEKLSANEDPEEFPNEDVFEHTFAEIPNID 2165
PE + S+ ++E S SE S NE+ E+++ I ID
Sbjct: 553 PEGGTNRQKYEQSDEESVE----SSSSENSSENENEVTDKGEEIYSLLKKTINRID 604
Score = 51.6 bits (123), Expect = 6e-06
Identities = 39/224 (17%), Positives = 84/224 (37%), Gaps = 15/224 (6%)
Query: 1970 PESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTIS 2029
P++ P S ++ + S + +++ +S + + + S ++ + + + +
Sbjct: 373 PDARGAWAGGPHSNASYNCAAYSNAAQSNAAQSNAGFSNAGYSNPGNSNPGYNNAPNSNT 432
Query: 2030 PESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSS---PASESTT 2086
P + +++P S +N P S +N P S + S++P S + S A +
Sbjct: 433 PYNNPPNSNTPYSNPPNSNPPYSNLPYSNTPYSNAPLSNAPPSSAKDHHSAYHAAYQHRA 492
Query: 2087 TSSPASESTTTSSPASE-------STTTSSPESESTTTSSPASESTTIEEQGVSPHSEKL 2139
+ PA+ T + PA+ ++ + S ++ + T + E
Sbjct: 493 ANQPAANLPTANQPAANNFHGAAGNSVGNPFASRPFGSAPYGGNAATTADPNGIAKRE-- 550
Query: 2140 SANEDPEEFPNEDVFEHTFAEIPNIDHSNQTDEAIPETFDAREE 2183
+ PE N +E + E S + E E D EE
Sbjct: 551 ---DHPEGGTNRQKYEQSDEESVESSSSENSSENENEVTDKGEE 591
Score = 39.7 bits (92), Expect = 0.026
Identities = 32/189 (16%), Positives = 70/189 (37%), Gaps = 17/189 (8%)
Query: 1875 NNNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSES 1934
+N ++S S N+ S +N S N P S + + P S + ++ S
Sbjct: 395 SNAAQSNAAQS--NAGFSNAGYSNPGNSNPGYNNAPNSNTPYNNPPNSNTPYSNPPNSNP 452
Query: 1935 TTTSSPESESTTTSSPESESTTTSSLVSEST--------TTSSPESESTTTSSPESE--- 1983
++ P S + +++P S + +S+ S + P + T + P +
Sbjct: 453 PYSNLPYSNTPYSNAPLSNAPPSSAKDHHSAYHAAYQHRAANQPAANLPTANQPAANNFH 512
Query: 1984 ----STTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSS 2039
++ + S ++ + TT P + P + +S+ + S
Sbjct: 513 GAAGNSVGNPFASRPFGSAPYGGNAATTADPNGIAKREDHPEGGTNRQKYEQSDEESVES 572
Query: 2040 PASESTTTN 2048
+SE+++ N
Sbjct: 573 SSSENSSEN 581
Score = 38.9 bits (90), Expect = 0.049
Identities = 28/196 (14%), Positives = 78/196 (39%), Gaps = 9/196 (4%)
Query: 1888 NSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTT 1947
N+ + +N+ +S + +N S + ++ S ++ S + + P S + +
Sbjct: 386 NASYNCAAYSNAAQSNAAQSNAGFSNAGYSNPGNSNPGYNNAPNSNTPYNNPPNSNTPYS 445
Query: 1948 SSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTT 2007
+ P S ++ S + +++P S + +S+ + S ++ + + P + T
Sbjct: 446 NPPNSNPPYSNLPYSNTPYSNAPLSNAPPSSAKDHHSAYHAAY--QHRAANQPAANLPTA 503
Query: 2008 ISPVSE-------STTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNP 2060
P + ++ + S + + TT+ P + ++P+ +
Sbjct: 504 NQPAANNFHGAAGNSVGNPFASRPFGSAPYGGNAATTADPNGIAKREDHPEGGTNRQKYE 563
Query: 2061 ASESITSSSPASESTT 2076
S+ + S +SE+++
Sbjct: 564 QSDEESVESSSSENSS 579
Score = 34.3 bits (78), Expect = 1.0
Identities = 40/281 (14%), Positives = 104/281 (37%), Gaps = 22/281 (7%)
Query: 1894 NTTTNSPESESTT-TNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSP-E 1951
N+ T+ P +E+ T + ++ +S +ES S + + ++ ++P
Sbjct: 244 NSATSPPANENNAVTLSCSNDQQRGASSAAESGYAHHRGSNIASHTPNDNIMHAANNPLN 303
Query: 1952 SESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPV 2011
+ + + + +P +++ E + + SP + S
Sbjct: 304 NTNDAQRNAIQGDLVRGAPNDKNSFDRGNEK-----TYQIYGGFHDGSPNAASAGAPFNG 358
Query: 2012 SESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPA 2071
+ +++ + P++ P S ++ S + +N S + S++
Sbjct: 359 LGNQADGGHINQ----VHPDARGAWAGGPHSNASYNCAAYSNAAQSNAAQSNAGFSNAGY 414
Query: 2072 SESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQG 2131
S ++ + + +++P + +++P S ++ P S +++P S +
Sbjct: 415 SNPGNSNPGYNNAPNSNTPYNNPPNSNTPYSNPPNSNPPYSNLPYSNTPYSNAPLSN--- 471
Query: 2132 VSPHSEKLSANEDPEEFPNEDVFEHTFAEIP--NIDHSNQT 2170
+P S SA + + ++H A P N+ +NQ
Sbjct: 472 -APPS---SAKDHHSAY--HAAYQHRAANQPAANLPTANQP 506
>gnl|CDD|146273 pfam03546, Treacle, Treacher Collins syndrome protein Treacle.
Length = 519
Score = 55.7 bits (133), Expect = 2e-07
Identities = 46/244 (18%), Positives = 87/244 (35%), Gaps = 9/244 (3%)
Query: 1893 ENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLV-SESTTTSSPESESTTTSSPE 1951
E +S E + P + + TTS +++ +S V ST T P + P+
Sbjct: 87 EEEAKSSEEESDSEGETPTAATLTTSPAQAKPLGKNSQVRPASTVTPGPSGKGANLPCPQ 146
Query: 1952 SESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPV 2011
+ + + + SS E ES + +S + ++ S P PV
Sbjct: 147 KAGSAAVQVGKQEDSESSSEEESDSDGPGAPAQAKSSGKLLQARPASGPAKGPPQKAGPV 206
Query: 2012 SESTTTSSPV--SESTTTIS-PESESTTTSSPASESTTTNNPKSEST----TTNNPASES 2064
+ SES+ S E E+ + A P+++++ T P S
Sbjct: 207 ATQVKAERGKEDSESSEESSDSEEEAPAAMTAAQAKPALKTPQTKASPRKGTPITPTSAK 266
Query: 2065 ITSSSPASESTTTSSPASESTTTSSPA-SESTTTSSPASESTTTSSPESESTTTSSPASE 2123
+ + + + + SSPA + T S S+ S E E T ++ +
Sbjct: 267 VPPVRVGTPAPRKAGAVTSPACASSPALARGTQRPDEDSSSSEESESEEEGTAPATARGQ 326
Query: 2124 STTI 2127
+ ++
Sbjct: 327 AKSV 330
Score = 49.1 bits (116), Expect = 3e-05
Identities = 43/243 (17%), Positives = 82/243 (33%), Gaps = 26/243 (10%)
Query: 1914 STTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESE 1973
+ E++S+ S T T++ + S + P ++ S + ST T P +
Sbjct: 82 AAQAGEEEAKSSEEESDSEGETPTAATLTTSPAQAKPLGKN---SQVRPASTVTPGPSGK 138
Query: 1974 STTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESE 2033
P+ + + + + SS E ES + +S + ++ P
Sbjct: 139 GANLPCPQKAGSAAVQVGKQEDSESSSEEESDSDGPGAPAQAKSSGKLLQARPASGPAKG 198
Query: 2034 STTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASE 2093
+ P + K +S ++ +S+S + A + T +SP
Sbjct: 199 PPQKAGPVATQVKAERGKEDSESSEE-SSDSEEEAPAAMTAAQAKPALKTPQTKASPRKG 257
Query: 2094 STTTSSPASES---TTTSSP-----ESESTTTSSPA--------------SESTTIEEQG 2131
+ T + A T +P + SSPA SE + EE+G
Sbjct: 258 TPITPTSAKVPPVRVGTPAPRKAGAVTSPACASSPALARGTQRPDEDSSSSEESESEEEG 317
Query: 2132 VSP 2134
+P
Sbjct: 318 TAP 320
Score = 41.8 bits (97), Expect = 0.005
Identities = 43/256 (16%), Positives = 82/256 (32%), Gaps = 34/256 (13%)
Query: 1900 PESESTTTNNPESESTTT---SSPESESTTTSSLVSESTTTSSPESESTTTS-------- 1948
P + PE +S ++ S E E + + SP+ ++ +
Sbjct: 10 PAATQAKAEKPEEDSESSSEDSDSEEEMPAAKNPPQAKPSGKSPQVKAASAPAKESPQKG 69
Query: 1949 ----SPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESES 2004
+P + E +S ES+S + + TT+ + S S
Sbjct: 70 APPVTPGKAGPAAAQAGEEEAKSSEEESDSEGETPTAATLTTSPAQAKPLGKNSQVRPAS 129
Query: 2005 TTTISPVSESTTTSSPV--------------SESTTTISPESESTTTSSPASESTTTNNP 2050
T T P + P SES++ +S+ + A S
Sbjct: 130 TVTPGPSGKGANLPCPQKAGSAAVQVGKQEDSESSSEEESDSDGPGAPAQAKSSGKLLQA 189
Query: 2051 KSESTTTNNPASE----SITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASE-ST 2105
+ S P + + + + + SS S + +PA+ + + PA +
Sbjct: 190 RPASGPAKGPPQKAGPVATQVKAERGKEDSESSEESSDSEEEAPAAMTAAQAKPALKTPQ 249
Query: 2106 TTSSPESESTTTSSPA 2121
T +SP + T + A
Sbjct: 250 TKASPRKGTPITPTSA 265
Score = 37.9 bits (87), Expect = 0.063
Identities = 40/218 (18%), Positives = 75/218 (34%), Gaps = 15/218 (6%)
Query: 1933 ESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESE-----STTT 1987
E + +SS +S+S E E + + SP+ ++ + + ES T
Sbjct: 22 EDSESSSEDSDS------EEEMPAAKNPPQAKPSGKSPQVKAASAPAKESPQKGAPPVTP 75
Query: 1988 SSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTT 2047
+ E++S+ S T T++ ++ S P ++ S ST T
Sbjct: 76 GKAGPAAAQAGEEEAKSSEEESDSEGETPTAATLTTSPAQAKPLGKN---SQVRPASTVT 132
Query: 2048 NNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTT 2107
P + P + + + SS ES + A +S ++
Sbjct: 133 PGPSGKGANLPCPQKAGSAAVQVGKQEDSESSSEEESDSDGPGAPAQAKSSGKLLQARPA 192
Query: 2108 SSPESESTTTSSPASESTTIEE-QGVSPHSEKLSANED 2144
S P + P + E + S SE+ S +E+
Sbjct: 193 SGPAKGPPQKAGPVATQVKAERGKEDSESSEESSDSEE 230
Score = 36.8 bits (84), Expect = 0.16
Identities = 43/239 (17%), Positives = 79/239 (33%), Gaps = 28/239 (11%)
Query: 1911 ESESTTTSSPE-SESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLV---SESTT 1966
+ SSP + T S S+ S E E T ++ ++ + + + S
Sbjct: 283 VTSPACASSPALARGTQRPDEDSSSSEESESEEEGTAPATARGQAKSVGKGLQVKAASVP 342
Query: 1967 TSSPESESTTTSSPESESTTTSSL---VSESTTTSSPESESTTTISPVSESTT----TSS 2019
T P + T P + + V E + +S ES+S + ++ T +
Sbjct: 343 TKGPLGQGTAPVPPGKTGPAVAQVKAEVQEDSESSEEESDSEEAAATPAQVKTSVKTPQA 402
Query: 2020 PVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSS 2079
+ + T P S+P K S P ++ +S+ + +
Sbjct: 403 KANPAPTRAPPAK--GAASAPGKVVAAAAQAKQRSPAKVKPPVRTLQNSTVSVRGQRSVP 460
Query: 2080 PASESTTTSSPA-----------SESTTTSSPASESTTTSSPESEST----TTSSPASE 2123
++ ++ A SES+ S + E T S T S+PA E
Sbjct: 461 AVGKAVAAAAQAQPGPVKGTEEDSESSEEESDSEEETPAQIKPSGKTPQVRAASAPAKE 519
Score = 33.3 bits (75), Expect = 1.6
Identities = 48/246 (19%), Positives = 84/246 (34%), Gaps = 26/246 (10%)
Query: 1914 STTTSSPE--SESTTTSSLVSESTTTSSPESESTTTSSPESES-------TTTSSLVSES 1964
T +SP + T TS+ V + ++ +SP S T S S
Sbjct: 248 PQTKASPRKGTPITPTSAKVPPVRVGTPAPRKAGAVTSPACASSPALARGTQRPDEDSSS 307
Query: 1965 TTTSSPESESTTTSSP--ESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPV- 2021
+ S E E T ++ +++S V ++ + T P ++ + V
Sbjct: 308 SEESESEEEGTAPATARGQAKSVGKGLQVKAASVPTKGPLGQGTAPVPPGKTGPAVAQVK 367
Query: 2022 ------SESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSS----PA 2071
SES+ S E+ T + S T K+ T P ++ S+ A
Sbjct: 368 AEVQEDSESSEEESDSEEAAATPAQVKTSVKTPQAKANPAPTRAPPAKGAASAPGKVVAA 427
Query: 2072 SESTTTSSPAS----ESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTI 2127
+ SPA T +S S S PA ++ +++ +S +
Sbjct: 428 AAQAKQRSPAKVKPPVRTLQNSTVSVRGQRSVPAVGKAVAAAAQAQPGPVKGTEEDSESS 487
Query: 2128 EEQGVS 2133
EE+ S
Sbjct: 488 EEESDS 493
>gnl|CDD|178748 PLN03209, PLN03209, translocon at the inner envelope of chloroplast
subunit 62; Provisional.
Length = 576
Score = 55.3 bits (133), Expect = 3e-07
Identities = 54/276 (19%), Positives = 92/276 (33%), Gaps = 38/276 (13%)
Query: 1878 SESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTT 1937
+E+T ++ + LL++ + P ES + P+ T +PE+ S
Sbjct: 307 AETTAPLTPMEELLAKIPSQRVPPKESDAADGPKPVPTKPVTPEAPSPPIEE-------- 358
Query: 1938 SSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTT 1997
P P S T L +SP ++S S+S + +E
Sbjct: 359 -EPPQPKAVVPRPLSPYTAYEDL----KPPTSPIPTPPSSSPASSKSVDAVAKPAEPDVV 413
Query: 1998 SSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTT 2057
SP S S P E E++ T SP + P S S T
Sbjct: 414 PSPGSASNV-------------PEVEPAQV---EAKKTRPLSPYARYEDLKPPTSPSPTA 457
Query: 2058 NNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSP--------ASESTTTSS 2109
S S++S+S T+ PA+ +T ++P + SP S + ++
Sbjct: 458 PTGVSPSVSSTSSVPAVPDTA-PATAATDAAAPPPANMRPLSPYAVYDDLKPPTSPSPAA 516
Query: 2110 PESESTTTSSPASESTTIEEQGVSPHSEKLSANEDP 2145
P + +S+ + E+ A P
Sbjct: 517 PVGKVAPSSTNEVVKVGNSAPPTALADEQHHAQPKP 552
Score = 43.0 bits (101), Expect = 0.002
Identities = 33/172 (19%), Positives = 53/172 (30%), Gaps = 18/172 (10%)
Query: 1994 STTTSSPESESTTTISPV-SESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKS 2052
S ES++ PV ++ T +P P P S T + K
Sbjct: 325 SQRVPPKESDAADGPKPVPTKPVTPEAPSPPIEEE--PPQPKAVVPRPLSPYTAYEDLKP 382
Query: 2053 ESTTTNNPASESITSSSP----ASESTTTSSPASESTTT---SSPASESTTTSSPASEST 2105
++ P S S SS A + P+ S + PA + P S
Sbjct: 383 PTSPIPTPPSSSPASSKSVDAVAKPAEPDVVPSPGSASNVPEVEPAQVEAKKTRPLSPYA 442
Query: 2106 --------TTSSPESESTTTSSPASESTTIEEQGVSPHSEKLSANEDPEEFP 2149
T+ SP + + + S +S S+ +P + A P
Sbjct: 443 RYEDLKPPTSPSPTAPTGVSPSVSSTSSVPAVPDTAPATAATDAAAPPPANM 494
>gnl|CDD|237015 PRK11901, PRK11901, hypothetical protein; Reviewed.
Length = 327
Score = 54.3 bits (131), Expect = 4e-07
Identities = 40/215 (18%), Positives = 72/215 (33%), Gaps = 39/215 (18%)
Query: 1939 SPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTS 1998
SP + SS + + L S+ +S +S + +T+ S T+
Sbjct: 60 SPTEHESQQSSNNAGAEKNIDLSGSSSLSSG-----NQSSPSAANNTSDGHDASGVKNTA 114
Query: 1999 SPESESTTTISPVSESTTTSSPVSESTTT--------IS-----------------PESE 2033
P+ + P+S + T ++P IS +
Sbjct: 115 PPQ---DISAPPISPTPTQAAPPQTPNGQQRIELPGNISDALSQQQGQVNAASQNAQGNT 171
Query: 2034 STTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASE 2093
ST ++PA+ + + + T+ + + PA + +T PA+
Sbjct: 172 STLPTAPATVAPSKGAKVPATAETHPTPPQKPATKKPAV------NHHKTATVAVPPATS 225
Query: 2094 STTTSSPASESTTTSSPESESTTTSSPASESTTIE 2128
S AS +S+P S T S AS S T+
Sbjct: 226 GKPKSGAASARALSSAPASHYTLQLSSASRSDTLN 260
Score = 48.9 bits (117), Expect = 2e-05
Identities = 41/207 (19%), Positives = 75/207 (36%), Gaps = 17/207 (8%)
Query: 1897 TNSPESESTTTNNPESE-STTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESEST 1955
T +S+ E + SS S +S + +T+ S T+ P+
Sbjct: 62 TEHESQQSSNNAGAEKNIDLSGSSSLSSGNQSSPSAANNTSDGHDASGVKNTAPPQ---D 118
Query: 1956 TTSSLVSESTTTSSPESESTTTSSPESESTTTSSL------VSESTTTSSPESESTTTIS 2009
++ +S + T ++P E + +L V+ ++ + + + T +
Sbjct: 119 ISAPPISPTPTQAAPPQTPNGQQRIELPGNISDALSQQQGQVNAASQNAQGNTSTLPT-A 177
Query: 2010 PVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSS 2069
P + + + + V + T + T PA N +T PA+ S
Sbjct: 178 PATVAPSKGAKVPATAETHPTPPQKPATKKPAV------NHHKTATVAVPPATSGKPKSG 231
Query: 2070 PASESTTTSSPASESTTTSSPASESTT 2096
AS +S+PAS T S AS S T
Sbjct: 232 AASARALSSAPASHYTLQLSSASRSDT 258
>gnl|CDD|115650 pfam07010, Endomucin, Endomucin. This family consists of several
mammalian endomucin proteins. Endomucin is an early
endothelial-specific antigen that is also expressed on
putative hematopoietic progenitor cells.
Length = 259
Score = 53.2 bits (127), Expect = 5e-07
Identities = 47/180 (26%), Positives = 83/180 (46%), Gaps = 5/180 (2%)
Query: 1890 LLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSS 1949
L N+ NS + N+ + STT +S + +T + V++ TT + P+ +T+
Sbjct: 11 FLLSNSLCNSEGVKEAANNSLVTTSTTKASITTPNTVSLKNVNKPTTGTPPKGTTTSELL 70
Query: 1950 PESESTTTSSLVSE----STTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESEST 2005
S +T +SL + TTT+ ++TS + T S+ VS + +S ++E+
Sbjct: 71 KTSLMSTATSLTTPKHELKTTTTGVRKNESSTSKVTVTNVTLSNAVS-TLQSSQNKTENQ 129
Query: 2006 TTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESI 2065
++I S T+ S S S TT+ S+S T + K ST++ P+ SI
Sbjct: 130 SSIRTTEISPTSVLQPDASPKKTGTTSASLTTAETTSQSQDTEDGKIASTSSTTPSYSSI 189
Score = 45.9 bits (108), Expect = 1e-04
Identities = 50/182 (27%), Positives = 86/182 (47%), Gaps = 22/182 (12%)
Query: 1958 SSLVSESTTTSSPESESTTTSSPESESTTTSSLVS-----ESTTTSSPESESTTTISPVS 2012
S+ + S + S T+S S TT + VS + TT + P+ +T+ + S
Sbjct: 14 SNSLCNSEGVKEAANNSLVTTSTTKASITTPNTVSLKNVNKPTTGTPPKGTTTSELLKTS 73
Query: 2013 ESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPAS 2072
+T +S TT E ++TTT +ES+T+ + T +N A ++ SS +
Sbjct: 74 LMSTATSL-----TTPKHELKTTTTGVRKNESSTSKVTVTNVTLSN--AVSTLQSSQNKT 126
Query: 2073 ES-----TTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESE-----STTTSSPAS 2122
E+ TT SP S +SP TT++S + TT+ S ++E ST++++P+
Sbjct: 127 ENQSSIRTTEISPTSVLQPDASPKKTGTTSASLTTAETTSQSQDTEDGKIASTSSTTPSY 186
Query: 2123 ES 2124
S
Sbjct: 187 SS 188
Score = 45.1 bits (106), Expect = 2e-04
Identities = 32/139 (23%), Positives = 59/139 (42%), Gaps = 9/139 (6%)
Query: 1873 TTNNNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVS 1932
TT + T +MST SL T TTT ++TS + T S+ VS
Sbjct: 64 TTTSELLKTSLMSTATSL------TTPKHELKTTTTGVRKNESSTSKVTVTNVTLSNAVS 117
Query: 1933 ESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVS 1992
T S ++++ SS + + +S++ + + S + ++ E+ S + +
Sbjct: 118 ---TLQSSQNKTENQSSIRTTEISPTSVLQPDASPKKTGTTSASLTTAETTSQSQDTEDG 174
Query: 1993 ESTTTSSPESESTTTISPV 2011
+ +TSS ++ I PV
Sbjct: 175 KIASTSSTTPSYSSIILPV 193
Score = 45.1 bits (106), Expect = 3e-04
Identities = 52/193 (26%), Positives = 87/193 (45%), Gaps = 18/193 (9%)
Query: 1922 SESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPE 1981
S S S V E+ S T+T+ + T SL + + T+ + TTTS
Sbjct: 14 SNSLCNSEGVKEAANNSLVT---TSTTKASITTPNTVSLKNVNKPTTGTPPKGTTTSE-- 68
Query: 1982 SESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPA 2041
+SL+S +T+ ++P+ E TT + V ++ +++S V+ + T+S + +S
Sbjct: 69 ---LLKTSLMSTATSLTTPKHELKTTTTGVRKNESSTSKVTVTNVTLSNAVSTLQSSQNK 125
Query: 2042 SESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPA 2101
+E N S TT +P S +SP TT S S TT+ S+S T
Sbjct: 126 TE-----NQSSIRTTEISPTSVLQPDASPKKTGTT-----SASLTTAETTSQSQDTEDGK 175
Query: 2102 SESTTTSSPESES 2114
ST++++P S
Sbjct: 176 IASTSSTTPSYSS 188
Score = 39.7 bits (92), Expect = 0.012
Identities = 39/156 (25%), Positives = 79/156 (50%), Gaps = 13/156 (8%)
Query: 1979 SPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTS 2038
S + +SLV+ STT +S + +T ++ V++ TT + P +T+ + S +T +
Sbjct: 20 SEGVKEAANNSLVTTSTTKASITTPNTVSLKNVNKPTTGTPPKGTTTSELLKTSLMSTAT 79
Query: 2039 SPAS-----ESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASE 2093
S + ++TTT K+ES+T+ +T ++ + ++ +S++ T + +S
Sbjct: 80 SLTTPKHELKTTTTGVRKNESSTSK------VTVTNVTLSNAVSTLQSSQNKTENQ-SSI 132
Query: 2094 STTTSSPASESTTTSSPESESTTTSSPASESTTIEE 2129
TT SP S +SP ++ TTS+ + + T +
Sbjct: 133 RTTEISPTSVLQPDASP-KKTGTTSASLTTAETTSQ 167
>gnl|CDD|233045 TIGR00601, rad23, UV excision repair protein Rad23. All proteins in
this family for which functions are known are components
of a multiprotein complex used for targeting nucleotide
excision repair to specific parts of the genome. In
humans, Rad23 complexes with the XPC protein. This family
is based on the phylogenomic analysis of JA Eisen (1999,
Ph.D. Thesis, Stanford University) [DNA metabolism, DNA
replication, recombination, and repair].
Length = 378
Score = 53.7 bits (129), Expect = 6e-07
Identities = 43/180 (23%), Positives = 67/180 (37%), Gaps = 37/180 (20%)
Query: 1968 SSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTT-SSPVSESTT 2026
S P++ + + P + T S T T SP + + +S S SP ES T
Sbjct: 75 SKPKTGTGKVAPPAATPT------SAPTPTPSPPASPASGMSAAPASAVEEKSPSEESAT 128
Query: 2027 TISPESESTTTSSPASESTTTNNPKSESTTT----------------------NNP--AS 2062
+PES ST+ S S++ +T SE TT NNP A
Sbjct: 129 ATAPESPSTSVPSSGSDAASTLVVGSERETTIEEIMEMGYEREEVERALRAAFNNPDRAV 188
Query: 2063 ESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPAS 2122
E + + P P T +S A + TT +P S + + + ++ A+
Sbjct: 189 EYLLTGIPED----PEQPEPVQQTAASTA--AATTETPQHGSVFEQAAQGGTEQPATEAA 242
Score = 51.8 bits (124), Expect = 3e-06
Identities = 25/78 (32%), Positives = 38/78 (48%), Gaps = 7/78 (8%)
Query: 2050 PKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTT-SSPASESTTTS 2108
PK+ + PA+ +P S T T SP + + S A S SP+ ES T +
Sbjct: 77 PKTGTGKVAPPAA------TPTSAPTPTPSPPASPASGMSAAPASAVEEKSPSEESATAT 130
Query: 2109 SPESESTTTSSPASESTT 2126
+PES ST+ S S++ +
Sbjct: 131 APESPSTSVPSSGSDAAS 148
Score = 49.5 bits (118), Expect = 1e-05
Identities = 40/177 (22%), Positives = 67/177 (37%), Gaps = 25/177 (14%)
Query: 1921 ESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSP 1980
+ ++ T +T TS+P T S P S ++ S+ + + SP ES T ++P
Sbjct: 76 KPKTGTGKVAPPAATPTSAPTP---TPSPPASPASGMSAAPASAVEEKSPSEESATATAP 132
Query: 1981 ESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSEST--------------- 2025
ES ST+ S S++ +T SE TTI + E V +
Sbjct: 133 ESPSTSVPSSGSDAASTLVVGSERETTIEEIMEMGYEREEVERALRAAFNNPDRAVEYLL 192
Query: 2026 TTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPAS 2082
T I + E + ST + TT P S+ + + ++ A+
Sbjct: 193 TGIPEDPEQPEPVQQTAASTA-------AATTETPQHGSVFEQAAQGGTEQPATEAA 242
Score = 43.0 bits (101), Expect = 0.002
Identities = 22/75 (29%), Positives = 35/75 (46%), Gaps = 3/75 (4%)
Query: 2062 SESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPA 2121
++ T +T TS+P T S PAS ++ S+ + + SP ES T ++P
Sbjct: 77 PKTGTGKVAPPAATPTSAPTP---TPSPPASPASGMSAAPASAVEEKSPSEESATATAPE 133
Query: 2122 SESTTIEEQGVSPHS 2136
S ST++ G S
Sbjct: 134 SPSTSVPSSGSDAAS 148
Score = 43.0 bits (101), Expect = 0.002
Identities = 22/79 (27%), Positives = 34/79 (43%), Gaps = 3/79 (3%)
Query: 1882 VVMSTLNSLLSENTTTN--SPESESTTTNNPESESTTTSSPESESTTT-SSLVSESTTTS 1938
VVM + + +P S T T +P + + S S S ES T +
Sbjct: 71 VVMVSKPKTGTGKVAPPAATPTSAPTPTPSPPASPASGMSAAPASAVEEKSPSEESATAT 130
Query: 1939 SPESESTTTSSPESESTTT 1957
+PES ST+ S S++ +T
Sbjct: 131 APESPSTSVPSSGSDAAST 149
Score = 41.4 bits (97), Expect = 0.005
Identities = 31/181 (17%), Positives = 63/181 (34%), Gaps = 26/181 (14%)
Query: 1900 PESESTTTNNPES--ESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTT 1957
P++ + P + S T +P ++ S + + + + + S ES + T
Sbjct: 77 PKTGTGKVAPPAATPTSAPTPTPSPPASPASGMSAAPASAVEEK-----SPSEESATATA 131
Query: 1958 SSLVSESTTTSSPESESTTTSSPESESTTT--------SSLVSESTTTS--SPESESTTT 2007
S S +S ++ ST E E+T V + + +P+
Sbjct: 132 PESPSTSVPSSGSDAASTLVVGSERETTIEEIMEMGYEREEVERALRAAFNNPDRAVEYL 191
Query: 2008 ISPVSESTTTSSPVSE------STTTISPESESTTTSSPASESTTTNNPKSESTTTNNPA 2061
++ + E PV + + TT +P+ S + + P +E+ NP
Sbjct: 192 LTGIPEDPEQPEPVQQTAASTAAATTETPQHGSVFEQAAQGGTE---QPATEAAQGGNPL 248
Query: 2062 S 2062
Sbjct: 249 E 249
Score = 34.5 bits (79), Expect = 0.80
Identities = 14/69 (20%), Positives = 24/69 (34%), Gaps = 8/69 (11%)
Query: 2078 SSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQGVSPHSE 2137
S P + + + PA+ +P S T T SP + + S A S E+
Sbjct: 75 SKPKTGTGKVAPPAA------TPTSAPTPTPSPPASPASGMSAAPASAVEEKS--PSEES 126
Query: 2138 KLSANEDPE 2146
+ +
Sbjct: 127 ATATAPESP 135
>gnl|CDD|115579 pfam06933, SSP160, Special lobe-specific silk protein SSP160. This
family consists of several special lobe-specific silk
protein SSP160 sequences which appear to be specific to
Chironomus (Midge) species.
Length = 758
Score = 54.0 bits (129), Expect = 9e-07
Identities = 56/271 (20%), Positives = 121/271 (44%), Gaps = 27/271 (9%)
Query: 1880 STVVMSTLN--SLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTT 1937
S ++ ++ N +++S N S S + N S S+ S+ S STT+++ + S +T
Sbjct: 78 SGIIKASFNLIAMISANIQAIQSGSGSASGN---SSSSANSTSNSNSTTSNNSTTSSNST 134
Query: 1938 SSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTT 1997
++ + +++++S S T+ +S+VS T + +S+ + S +L + +
Sbjct: 135 TTTSNSTSSSNSTSSGLTSGASVVSLIDTCAWVYQDSSVGIAYLMVSIL--ALFYGQSVS 192
Query: 1998 SSPESE-------STTTISPVSESTTTSSPVSESTTTIS---------PESESTTTSSPA 2041
+ P ++ + + + V +S + ++ TI+ + + +
Sbjct: 193 APPYADLGIPALPANCSGAGVPQSVQIKAAIAYINITINFINLTGQQFEDLQGPVATDCG 252
Query: 2042 SESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPA 2101
+TT+ P A E+ + S ++ ST+ S+ S STT S+ STTT++
Sbjct: 253 CPNTTSVAPLVAEWEAILAALEAFANGSASANSTSNSNSTSNSTTNSN----STTTTNST 308
Query: 2102 SESTTTSSPESESTTTSSPASESTTIEEQGV 2132
+ + +TSS S + + + TI Q +
Sbjct: 309 TSTNSTSSSNSSTIAGCIDIAANFTIALQNL 339
Score = 46.7 bits (110), Expect = 2e-04
Identities = 61/250 (24%), Positives = 95/250 (38%), Gaps = 46/250 (18%)
Query: 1850 LISMLAATAVAISVIDNYSEIIFTTNNNSESTVVMSTLNSLLSENTTTNSPESESTTTNN 1909
LI+M++A AI S + N+S S S NS S N+TT+S + +TTT+N
Sbjct: 87 LIAMISANIQAIQ-----SGSGSASGNSSSSANSTSNSNSTTSNNSTTSS--NSTTTTSN 139
Query: 1910 PESESTTTSSPESESTTTSSLVSESTTTSSPES--------------ESTTTSSPESEST 1955
S S +TSS + + SL+ S + S+P
Sbjct: 140 STSSSNSTSSGLTSGASVVSLIDTCAWVYQDSSVGIAYLMVSILALFYGQSVSAPPYADL 199
Query: 1956 TTSSLVSESTTTSSPES-----------------ESTTTSSPESESTTTSSLVSESTTTS 1998
+L + + P+S T + + + +TT+
Sbjct: 200 GIPALPANCSGAGVPQSVQIKAAIAYINITINFINLTGQQFEDLQGPVATDCGCPNTTSV 259
Query: 1999 SPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTN 2058
+P I E+ S + ST+ S ST+ S+ S STTT N STT+
Sbjct: 260 APLVAEWEAILAALEAFANGSASANSTSN----SNSTSNSTTNSNSTTTTN----STTST 311
Query: 2059 NPASESITSS 2068
N S S +S+
Sbjct: 312 NSTSSSNSST 321
Score = 37.4 bits (86), Expect = 0.11
Identities = 18/69 (26%), Positives = 42/69 (60%)
Query: 2065 ITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASES 2124
I+++ A +S + S+ + S++ +S ++ ++TTS+ ++ S+ +++ S ST++S+ S
Sbjct: 91 ISANIQAIQSGSGSASGNSSSSANSTSNSNSTTSNNSTTSSNSTTTTSNSTSSSNSTSSG 150
Query: 2125 TTIEEQGVS 2133
T VS
Sbjct: 151 LTSGASVVS 159
Score = 37.1 bits (85), Expect = 0.16
Identities = 27/134 (20%), Positives = 61/134 (45%), Gaps = 2/134 (1%)
Query: 1881 TVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSP 1940
T S+L + L+ T + + + NN E +S+ + ES +++++
Sbjct: 607 TKAESSLTAFLASFNATINATIAAASANNSEVQSSEAACIESSLADAAAILAMFEAAYQN 666
Query: 1941 ESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSP 2000
+ + + P + +TTTSS + +TTT++ + TTT++ + + T L + + +
Sbjct: 667 CTAPGSVTVPAAANTTTSS--TTTTTTTTTTAAPTTTTTKAANAPFTYPLCNLIMSAACS 724
Query: 2001 ESESTTTISPVSES 2014
+ T +S +
Sbjct: 725 AGGAGCTYPFISSA 738
Score = 34.4 bits (78), Expect = 0.86
Identities = 18/78 (23%), Positives = 38/78 (48%)
Query: 1943 ESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPES 2002
E+ S + ST+ S+ S STT S+ + + +T+S S S++ SS ++ ++ +
Sbjct: 274 EAFANGSASANSTSNSNSTSNSTTNSNSTTTTNSTTSTNSTSSSNSSTIAGCIDIAANFT 333
Query: 2003 ESTTTISPVSESTTTSSP 2020
+ + + T +P
Sbjct: 334 IALQNLQALLLQEATCAP 351
Score = 34.0 bits (77), Expect = 1.2
Identities = 25/118 (21%), Positives = 58/118 (49%), Gaps = 11/118 (9%)
Query: 1911 ESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVS-------- 1962
++ES+ T+ S + T ++ ++ ++ +S E +S+ + ES +++++
Sbjct: 608 KAESSLTAFLASFNATINATIAAASANNS-EVQSSEAACIESSLADAAAILAMFEAAYQN 666
Query: 1963 --ESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTS 2018
+ + P + +TTTSS + +TTT++ +TTT + + T + + S S
Sbjct: 667 CTAPGSVTVPAAANTTTSSTTTTTTTTTTAAPTTTTTKAANAPFTYPLCNLIMSAACS 724
Score = 34.0 bits (77), Expect = 1.4
Identities = 24/109 (22%), Positives = 46/109 (42%), Gaps = 4/109 (3%)
Query: 1963 ESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVS 2022
E+ S + ST+ S+ S STT S+ + + +T+S S S++ S ++ ++ +
Sbjct: 274 EAFANGSASANSTSNSNSTSNSTTNSNSTTTTNSTTSTNSTSSSNSSTIAGCIDIAANFT 333
Query: 2023 ESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPA 2071
+ + T +PA + N K P + T+S A
Sbjct: 334 IALQNLQALLLQEATCAPALAA----NAKKSGVRDFGPCKAAKTASGCA 378
Score = 33.6 bits (76), Expect = 1.6
Identities = 14/60 (23%), Positives = 35/60 (58%), Gaps = 1/60 (1%)
Query: 1879 ESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTS 1938
E +++ L + + + + NS + ++T+N+ + S +T++ S ++T S+ S S+T +
Sbjct: 265 EWEAILAALEAFANGSASANSTSNSNSTSNS-TTNSNSTTTTNSTTSTNSTSSSNSSTIA 323
Score = 31.3 bits (70), Expect = 7.5
Identities = 29/100 (29%), Positives = 51/100 (51%), Gaps = 9/100 (9%)
Query: 2031 ESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSP 2090
++ES+ T+ AS + T N +T A+ S SS A+ ++ + A+
Sbjct: 608 KAESSLTAFLASFNATIN-----ATIAAASANNSEVQSSEAACIESSLADAAAILAMFEA 662
Query: 2091 ASESTT----TSSPASESTTTSSPESESTTTSSPASESTT 2126
A ++ T + PA+ +TTTSS + +TTT++ A +TT
Sbjct: 663 AYQNCTAPGSVTVPAAANTTTSSTTTTTTTTTTAAPTTTT 702
>gnl|CDD|215621 PLN03188, PLN03188, kinesin-12 family protein; Provisional.
Length = 1320
Score = 53.8 bits (129), Expect = 1e-06
Identities = 45/215 (20%), Positives = 70/215 (32%), Gaps = 19/215 (8%)
Query: 1898 NSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTS----SPESE 1953
S + + + + E + E T T + T P E
Sbjct: 555 QSIIKQGSEDTDVDMEEAISEQEEKHEITIVDCAEPVRNTQNSLQIDTLDHESSEQPLEE 614
Query: 1954 STTTSSLVSESTTTSSP-ESESTTTSSPESESTT-TSSLVSESTTTSSPE-------SES 2004
S VS+ T SP + S +S S + S+ VS + ++ E S S
Sbjct: 615 KNALHSSVSKLNTEESPSKMVEIRPSCQDSVSESGVSTGVSVADESNDSENELVNCASPS 674
Query: 2005 TTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASES 2064
+ +I PV S SP + I +S TSS + S + +S + E
Sbjct: 675 SLSIVPVEVSPVLKSPTLSVSPRIRNSRKSLRTSSMLTAS------QKDSEDESKLTPED 728
Query: 2065 ITSSSPASESTTTSSPASESTTTSSPASESTTTSS 2099
S S +SS S + S A +S
Sbjct: 729 AEPSFAKSMKNNSSSALSTQKSKSFLAPTEHLAAS 763
Score = 46.9 bits (111), Expect = 2e-04
Identities = 50/225 (22%), Positives = 78/225 (34%), Gaps = 18/225 (8%)
Query: 1928 SSLVSESTTTSSPESESTTTSSPES---ESTTTSSLVSESTTTSSPESESTTTSSPESES 1984
+ +E ES +S +S + + + + E + E T
Sbjct: 532 PAGAAEGNNVDMGRVESIHSSDQQSIIKQGSEDTDVDMEEAISEQEEKHEITIVDCAEPV 591
Query: 1985 TTTSSLVSESTTTS----SPESESTTTISPVSESTTTSSP-----VSESTTTISPESEST 2035
T + + T P E S VS+ T SP + S ES +
Sbjct: 592 RNTQNSLQIDTLDHESSEQPLEEKNALHSSVSKLNTEESPSKMVEIRPSCQDSVSESGVS 651
Query: 2036 TTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASEST 2095
T S A ES N+ ++E N AS S S P S SP + + +S
Sbjct: 652 TGVSVADES---NDSENELV---NCASPSSLSIVPVEVSPVLKSPTLSVSPRIRNSRKSL 705
Query: 2096 TTSSPASESTTTSSPESESTTTSSPASESTTIEEQGVSPHSEKLS 2140
TSS + S S ES+ T + S + +++ S S + S
Sbjct: 706 RTSSMLTASQKDSEDESKLTPEDAEPSFAKSMKNNSSSALSTQKS 750
Score = 36.5 bits (84), Expect = 0.21
Identities = 35/194 (18%), Positives = 68/194 (35%), Gaps = 27/194 (13%)
Query: 1833 DSSMNLLSVSPYITNNLLISMLAATAVAISVIDNYSEIIFTTNNNSESTVVMSTLNSLLS 1892
D+ +++ I+++ + ++ ++ + + L+S +S
Sbjct: 564 DTDVDMEEAISEQEEKHEITIVDCAEPVRNTQNSLQIDTLDHESSEQPLEEKNALHSSVS 623
Query: 1893 ENTTTNSPE----------------------SESTTTNNPESESTTTSSPESESTTTSSL 1930
+ T SP S + +N+ E+E +SP S S +
Sbjct: 624 KLNTEESPSKMVEIRPSCQDSVSESGVSTGVSVADESNDSENELVNCASPSSLSIVPVEV 683
Query: 1931 VSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSL 1990
S SP + +S TSS+++ S S E ES T S S
Sbjct: 684 ---SPVLKSPTLSVSPRIRNSRKSLRTSSMLTASQKDS--EDESKLTPEDAEPSFAKSMK 738
Query: 1991 VSESTTTSSPESES 2004
+ S+ S+ +S+S
Sbjct: 739 NNSSSALSTQKSKS 752
>gnl|CDD|240244 PTZ00049, PTZ00049, cathepsin C-like protein; Provisional.
Length = 693
Score = 53.4 bits (128), Expect = 1e-06
Identities = 20/49 (40%), Positives = 30/49 (61%), Gaps = 3/49 (6%)
Query: 2357 HSVKIIGWGKSSQN---EPYWLCTNSYNQGWGEQGLFKIRRGVNMCSIE 2402
H++ ++GWG+ N YW+ NS+ + WG++G FKI RG N IE
Sbjct: 620 HAIVLVGWGEEEINGKLYKYWIGRNSWGKNWGKEGYFKIIRGKNFSGIE 668
>gnl|CDD|216257 pfam01034, Syndecan, Syndecan domain. Syndecans are transmembrane
heparin sulfate proteoglycans which are implicated in the
binding of extracellular matrix components and growth
factors.
Length = 207
Score = 50.9 bits (122), Expect = 2e-06
Identities = 38/144 (26%), Positives = 61/144 (42%), Gaps = 8/144 (5%)
Query: 1929 SLVSESTTTSSPESESTTTSSPESE-STTTSSLVSESTTTSSPESESTTTSS--PESEST 1985
+L ++ + +E + E S + + S S T S +SE
Sbjct: 12 ALSAQPALAAQAAAEYPDERYLDEEGSGDDDEFIDDEMDDEYSGSGSGATPSDDEDSEPV 71
Query: 1986 TTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASEST 2045
TTS+ + TTTSS S TTT S ++++ T S +TT SP SE+ T + + ST
Sbjct: 72 TTSATPPKLTTTSSSPSNDTTTASTSTKTSPTVSTTVTTTT--SP-SETDTEEATTTVST 128
Query: 2046 TTNNPKSESTTTNNPASESITSSS 2069
T S T+ S+++
Sbjct: 129 ETPTEGGSSAATD--PSKNLLERK 150
Score = 50.1 bits (120), Expect = 2e-06
Identities = 31/91 (34%), Positives = 51/91 (56%), Gaps = 3/91 (3%)
Query: 2052 SESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPE 2111
S +T +++ SE +T+S+ + TTTSS S TTT+S +++++ T S +TT SP
Sbjct: 58 SGATPSDDEDSEPVTTSATPPKLTTTSSSPSNDTTTASTSTKTSPTVSTTVTTTT--SP- 114
Query: 2112 SESTTTSSPASESTTIEEQGVSPHSEKLSAN 2142
SE+ T + + ST +G S + S N
Sbjct: 115 SETDTEEATTTVSTETPTEGGSSAATDPSKN 145
Score = 47.0 bits (112), Expect = 3e-05
Identities = 31/121 (25%), Positives = 55/121 (45%), Gaps = 4/121 (3%)
Query: 2022 SESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPA 2081
S + S S +T +++ SE TT+ + T+SS S TTT+S +
Sbjct: 38 SGDDDEFIDDEMDDEYSGSGSGATPSDDEDSEPVTTSATPPKLTTTSSSPSNDTTTASTS 97
Query: 2082 SESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPA-SESTTIEEQGVSPHSEKLS 2140
++++ T S +TT+ SE+ T + + ST T + S + T + + E L+
Sbjct: 98 TKTSPTVSTTVTTTTS---PSETDTEEATTTVSTETPTEGGSSAATDPSKNLLERKEVLA 154
Query: 2141 A 2141
A
Sbjct: 155 A 155
Score = 47.0 bits (112), Expect = 4e-05
Identities = 30/89 (33%), Positives = 49/89 (55%), Gaps = 4/89 (4%)
Query: 1902 SESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLV 1961
S +T +++ +SE TTS+ + TTTSS S TTT+S ++++ T S +TT+
Sbjct: 58 SGATPSDDEDSEPVTTSATPPKLTTTSSSPSNDTTTASTSTKTSPTVSTTVTTTTSP--- 114
Query: 1962 SESTTTSSPESESTTTSS-PESESTTTSS 1989
SE+ T + + ST T + S + T S
Sbjct: 115 SETDTEEATTTVSTETPTEGGSSAATDPS 143
Score = 46.7 bits (111), Expect = 4e-05
Identities = 31/91 (34%), Positives = 48/91 (52%), Gaps = 3/91 (3%)
Query: 1912 SESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPE 1971
S +T + +SE TTS+ + TTTSS S TTT+S ++++ T S +TT SP
Sbjct: 58 SGATPSDDEDSEPVTTSATPPKLTTTSSSPSNDTTTASTSTKTSPTVSTTVTTTT--SP- 114
Query: 1972 SESTTTSSPESESTTTSSLVSESTTTSSPES 2002
SE+ T + + ST T + S T ++
Sbjct: 115 SETDTEEATTTVSTETPTEGGSSAATDPSKN 145
Score = 44.7 bits (106), Expect = 2e-04
Identities = 30/94 (31%), Positives = 48/94 (51%), Gaps = 1/94 (1%)
Query: 1892 SENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPE 1951
S T ++ +SE TT+ + TTTSS S TTT+S ++++ T S +TT+ S E
Sbjct: 58 SGATPSDDEDSEPVTTSATPPKLTTTSSSPSNDTTTASTSTKTSPTVSTTVTTTTSPS-E 116
Query: 1952 SESTTTSSLVSESTTTSSPESESTTTSSPESEST 1985
+++ ++ VS T T S +T S E
Sbjct: 117 TDTEEATTTVSTETPTEGGSSAATDPSKNLLERK 150
Score = 44.3 bits (105), Expect = 2e-04
Identities = 31/104 (29%), Positives = 46/104 (44%), Gaps = 11/104 (10%)
Query: 1992 SESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPK 2051
S +T + +SE TT + + TTTSS S TTT S ++++ T S +TT+ P
Sbjct: 58 SGATPSDDEDSEPVTTSATPPKLTTTSSSPSNDTTTASTSTKTSPTVSTTVTTTTS--P- 114
Query: 2052 SESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASEST 2095
SE+ T + S T T +S +T S E
Sbjct: 115 SETDTEEATTT--------VSTETPTEGGSSAATDPSKNLLERK 150
Score = 43.2 bits (102), Expect = 6e-04
Identities = 25/86 (29%), Positives = 40/86 (46%)
Query: 2067 SSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTT 2126
S S +T + SE TTS+ + TTTSS S TTT+S ++++ T S +TT
Sbjct: 53 YSGSGSGATPSDDEDSEPVTTSATPPKLTTTSSSPSNDTTTASTSTKTSPTVSTTVTTTT 112
Query: 2127 IEEQGVSPHSEKLSANEDPEEFPNED 2152
+ + + + E P E +
Sbjct: 113 SPSETDTEEATTTVSTETPTEGGSSA 138
Score = 39.3 bits (92), Expect = 0.012
Identities = 26/86 (30%), Positives = 41/86 (47%), Gaps = 3/86 (3%)
Query: 2012 SESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPA 2071
S +T + SE TT + + TTTSS S TTT + ++++ T + + TS
Sbjct: 58 SGATPSDDEDSEPVTTSATPPKLTTTSSSPSNDTTTASTSTKTSPTVSTTVTTTTS---P 114
Query: 2072 SESTTTSSPASESTTTSSPASESTTT 2097
SE+ T + + ST T + S T
Sbjct: 115 SETDTEEATTTVSTETPTEGGSSAAT 140
Score = 35.1 bits (81), Expect = 0.28
Identities = 21/84 (25%), Positives = 42/84 (50%), Gaps = 4/84 (4%)
Query: 1892 SENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTT----TSSPESESTTT 1947
SE TT++ + TTT++ S TTT+S ++++ T S +TT T + E+ +T +
Sbjct: 68 SEPVTTSATPPKLTTTSSSPSNDTTTASTSTKTSPTVSTTVTTTTSPSETDTEEATTTVS 127
Query: 1948 SSPESESTTTSSLVSESTTTSSPE 1971
+ +E ++++ E
Sbjct: 128 TETPTEGGSSAATDPSKNLLERKE 151
>gnl|CDD|191716 pfam07263, DMP1, Dentin matrix protein 1 (DMP1). This family
consists of several mammalian dentin matrix protein 1
(DMP1) sequences. The dentin matrix acidic phosphoprotein
1 (DMP1) gene has been mapped to human chromosome 4q21.
DMP1 is a bone and teeth specific protein initially
identified from mineralised dentin. DMP1 is primarily
localised in the nuclear compartment of undifferentiated
osteoblasts. In the nucleus, DMP1 acts as a
transcriptional component for activation of
osteoblast-specific genes like osteocalcin. During the
early phase of osteoblast maturation, Ca(2+) surges into
the nucleus from the cytoplasm, triggering the
phosphorylation of DMP1 by a nuclear isoform of casein
kinase II. This phosphorylated DMP1 is then exported out
into the extracellular matrix, where it regulates
nucleation of hydroxyapatite. DMP1 is a unique molecule
that initiates osteoblast differentiation by
transcription in the nucleus and orchestrates mineralised
matrix formation extracellularly, at later stages of
osteoblast maturation. The DMP1 gene has been found to be
ectopically expressed in lung cancer although the reason
for this is unknown.
Length = 514
Score = 52.7 bits (126), Expect = 2e-06
Identities = 58/213 (27%), Positives = 96/213 (45%), Gaps = 9/213 (4%)
Query: 1874 TNNNSESTVVMSTLNSLLSENTTTNSPES-ESTTTNNPESESTTTSSPESESTTTSSLVS 1932
++N+ ST N+ LS++ + ES E + N + +S P SES+ + L S
Sbjct: 284 DDSNTMEVKSDSTENAGLSQSREHSRSESQEDSEENQSQEDSQEVQDPSSESSQEADLPS 343
Query: 1933 ESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVS 1992
+ ++ S E E + S ++ TTS + + SS E T SS ES+ST
Sbjct: 344 QENSSESQE-EVVSESRGDNPDNTTSHSEDQEDSESSEEDSLDTPSSSESQST------E 396
Query: 1993 ESTTTSSPESESTTTISPVS-ESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPK 2051
E + S ES S++ SP S E +SS + + S ES S + S + ++
Sbjct: 397 EQADSESNESLSSSEESPESTEDENSSSQEGLQSHSASTESRSQESQSEQDSRSEEDDSD 456
Query: 2052 SESTTTNNPASESITSSSPASESTTTSSPASES 2084
S+ ++ + S S S+S + E + ES
Sbjct: 457 SQDSSRSKEDSNSTESASSSEEDGQPKNTEIES 489
Score = 52.0 bits (124), Expect = 3e-06
Identities = 59/252 (23%), Positives = 102/252 (40%), Gaps = 11/252 (4%)
Query: 1901 ESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSL 1960
E E + ES + P + S + E +S + S +E+ S
Sbjct: 245 EDEEQASTQDSGESQSVEYPSRKFFRKSRISEEDGRGELDDSNTMEVKSDSTENAGLSQS 304
Query: 1961 VSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSES-----T 2015
S + S +SE + E SS S+ S E+ S + VSES
Sbjct: 305 REHSRSESQEDSEENQSQEDSQEVQDPSSESSQEADLPSQENSSESQEEVVSESRGDNPD 364
Query: 2016 TTSSPVSESTTTISPESESTTTSSPASESTTTNNP-KSESTTTNNPASESITSS----SP 2070
T+S + + S E +S T S +SES +T SES + + + ES S+ S
Sbjct: 365 NTTSHSEDQEDSESSEEDSLDTPS-SSESQSTEEQADSESNESLSSSEESPESTEDENSS 423
Query: 2071 ASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQ 2130
+ E + S ++ES + S + + + + S+S +S + +S +T S +S + +
Sbjct: 424 SQEGLQSHSASTESRSQESQSEQDSRSEEDDSDSQDSSRSKEDSNSTESASSSEEDGQPK 483
Query: 2131 GVSPHSEKLSAN 2142
S KL+ +
Sbjct: 484 NTEIESRKLTVD 495
Score = 51.6 bits (123), Expect = 4e-06
Identities = 62/242 (25%), Positives = 104/242 (42%), Gaps = 20/242 (8%)
Query: 1892 SENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPE 1951
E +N+ E +S +T E+ S S + S +S S E +S P
Sbjct: 281 GELDDSNTMEVKSDST-----ENAGLSQSREHSRSESQ--EDSEENQSQE-DSQEVQDPS 332
Query: 1952 SESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPV 2011
SES+ + L S+ ++ S E E + S ++ TTS + + SS E T S
Sbjct: 333 SESSQEADLPSQENSSESQE-EVVSESRGDNPDNTTSHSEDQEDSESSEEDSLDTPSS-- 389
Query: 2012 SESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKS-ESTTTNNPASESITSSSP 2070
SES +T S +ES ++S + EST N S E +++ ++ES + S
Sbjct: 390 SESQSTEEQAD------SESNESLSSSEESPESTEDENSSSQEGLQSHSASTESRSQESQ 443
Query: 2071 ASESTTTSSPASESTTTSSPASESTTTSSPAS--ESTTTSSPESESTTTSSPASESTTIE 2128
+ + + + S+S +S +S +T S +S E + E ES + A + I
Sbjct: 444 SEQDSRSEEDDSDSQDSSRSKEDSNSTESASSSEEDGQPKNTEIESRKLTVDAYHNKPIG 503
Query: 2129 EQ 2130
+Q
Sbjct: 504 DQ 505
Score = 49.3 bits (117), Expect = 2e-05
Identities = 59/263 (22%), Positives = 102/263 (38%), Gaps = 14/263 (5%)
Query: 1892 SENTTTNSPESESTTTNNPESESTTTSSPESEST--TTSSLVSESTTTSSPESESTTTSS 1949
+E+ + PE +T ++ E E ES+ S E + PES T S
Sbjct: 170 NEDEVDSRPEGGDSTQDSESEEHWVGGGSEGESSHGDGSEFDDEGMQSDDPES----TRS 225
Query: 1950 PESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTIS 2009
S +S+ + + E +++T S ES+S S + S E
Sbjct: 226 ERGNSRMSSAGLKSKESKGEDEEQASTQDSGESQSVEYPSRKFFRKSRISEEDGRGELDD 285
Query: 2010 PVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSS 2069
S + S +E+ S + S E + N + +S +P+SES S
Sbjct: 286 --SNTMEVKSDSTENAGLSQSREHSRSESQ---EDSEENQSQEDSQEVQDPSSES---SQ 337
Query: 2070 PASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEE 2129
A + +S S+ S ++ ++ SE S E + + +SES + EE
Sbjct: 338 EADLPSQENSSESQEEVVSESRGDNPDNTTSHSEDQEDSESSEEDSLDTPSSSESQSTEE 397
Query: 2130 QGVSPHSEKLSANEDPEEFPNED 2152
Q S +E LS++E+ E ++
Sbjct: 398 QADSESNESLSSSEESPESTEDE 420
Score = 47.0 bits (111), Expect = 1e-04
Identities = 60/260 (23%), Positives = 98/260 (37%), Gaps = 21/260 (8%)
Query: 1893 ENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPE- 1951
E ++ PES + N S S ES+ +++T S ES+S S +
Sbjct: 213 EGMQSDDPESTRSERGNSRMSSAGLKSKESKGEDEE----QASTQDSGESQSVEYPSRKF 268
Query: 1952 ------SESTTTSSLVSESTT-TSSPESESTTTSSPESESTTTSSLVSESTTTSSPESES 2004
SE L +T S +E+ S S + S +S S E +S
Sbjct: 269 FRKSRISEEDGRGELDDSNTMEVKSDSTENAGLSQSREHSRSESQ--EDSEENQSQE-DS 325
Query: 2005 TTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASES 2064
P SES S + S ES+ S ++ SE + + E
Sbjct: 326 QEVQDPSSES---SQEADLPSQENSSESQEEVVSESRGDNPDNTTSHSEDQEDSESSEED 382
Query: 2065 ITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASES 2124
+ +SES +T A + S +SE + S+ S++ +S S +T S + ES
Sbjct: 383 SLDTPSSSESQSTEEQADSESNESLSSSEESPESTEDENSSSQEGLQSHSASTESRSQES 442
Query: 2125 TTIEEQGVSPHSEKLSANED 2144
+ ++ S E S ++D
Sbjct: 443 QSEQD---SRSEEDDSDSQD 459
Score = 42.0 bits (98), Expect = 0.004
Identities = 55/258 (21%), Positives = 101/258 (39%), Gaps = 21/258 (8%)
Query: 1901 ESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSL 1960
+ E +++PES T S S +S+ + + E +++T S ES+S S
Sbjct: 211 DDEGMQSDDPES----TRSERGNSRMSSAGLKSKESKGEDEEQASTQDSGESQSVEYPSR 266
Query: 1961 VSESTTTSSPE---SESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTT 2017
+ S E E +++ E +S +T E+ S S S E +
Sbjct: 267 KFFRKSRISEEDGRGELDDSNTMEVKSDST-----ENAGLSQSREHSR---SESQEDSEE 318
Query: 2018 SSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTT 2077
+ +S P SES S A + N+ +S+ + ++ +++ SE
Sbjct: 319 NQSQEDSQEVQDPSSES---SQEADLPSQENSSESQEEVVSESRGDNPDNTTSHSEDQED 375
Query: 2078 SSPASESTTTSSPASESTTTSSPA---SESTTTSSPESESTTTSSPASESTTIEEQGVSP 2134
S + E + + +SES +T A S + +SS ES +T +S ++ S
Sbjct: 376 SESSEEDSLDTPSSSESQSTEEQADSESNESLSSSEESPESTEDENSSSQEGLQSHSAST 435
Query: 2135 HSEKLSANEDPEEFPNED 2152
S + + + ED
Sbjct: 436 ESRSQESQSEQDSRSEED 453
Score = 41.6 bits (97), Expect = 0.005
Identities = 50/236 (21%), Positives = 89/236 (37%), Gaps = 14/236 (5%)
Query: 1911 ESESTTTSSPESESTTTSSLVSESTTTSSPE---SESTTTSSPESESTTTSSLVSESTTT 1967
E +++T S ES+S S + S E E +++ E +S +T E+
Sbjct: 247 EEQASTQDSGESQSVEYPSRKFFRKSRISEEDGRGELDDSNTMEVKSDST-----ENAGL 301
Query: 1968 SSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTT 2027
S S + S +SE + +S P SES+ + +S S+
Sbjct: 302 SQSREHSRSESQEDSEENQSQE---DSQEVQDPSSESS---QEADLPSQENSSESQEEVV 355
Query: 2028 ISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTT 2087
++ ++ SE + E + +SES ++ A + S +SE +
Sbjct: 356 SESRGDNPDNTTSHSEDQEDSESSEEDSLDTPSSSESQSTEEQADSESNESLSSSEESPE 415
Query: 2088 SSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQGVSPHSEKLSANE 2143
S+ S++ S S +T S ES + SE + Q S E ++ E
Sbjct: 416 STEDENSSSQEGLQSHSASTESRSQESQSEQDSRSEEDDSDSQDSSRSKEDSNSTE 471
Score = 34.2 bits (78), Expect = 1.0
Identities = 60/291 (20%), Positives = 99/291 (34%), Gaps = 41/291 (14%)
Query: 1932 SESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLV 1991
S S +S TT SS +S +S ++ + ++E S PE +T S
Sbjct: 130 GNSRLGSDEDSADTTQSSEDSTPQGENSAQDTTSESRDLDNEDEVDSRPEGGDSTQDSES 189
Query: 1992 SESTTTSSPESEST------------------TTISPVSESTTTSSPVSESTTTISPESE 2033
E E ES+ +T S S +S+ + + E +
Sbjct: 190 EEHWVGGGSEGESSHGDGSEFDDEGMQSDDPESTRSERGNSRMSSAGLKSKESKGEDEEQ 249
Query: 2034 STTTSSPASESTTTNNPK------------------SESTTTNNPASESITSSSPASEST 2075
++T S S+S + K S + + ++E+ S S
Sbjct: 250 ASTQDSGESQSVEYPSRKFFRKSRISEEDGRGELDDSNTMEVKSDSTENAGLSQSREHSR 309
Query: 2076 TTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQGVSPH 2135
+ S SE + + E SS +S+ S E+ S + SES + H
Sbjct: 310 SESQEDSEENQSQEDSQEVQDPSSESSQEADLPSQENSSESQEEVVSESRGDNPDNTTSH 369
Query: 2136 SEKLSANEDPEEFPNEDVFEHTFAEIPNIDHSNQTDEAIPETFDAREEWPQ 2186
SE +E E ED T + + Q D E+ + EE P+
Sbjct: 370 SEDQEDSESSE----EDSL-DTPSSSESQSTEEQADSESNESLSSSEESPE 415
Score = 31.6 bits (71), Expect = 6.6
Identities = 27/113 (23%), Positives = 51/113 (45%), Gaps = 3/113 (2%)
Query: 1865 DNYSEIIFTTNNNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESE---STTTSSPE 1921
++ E T ++SES +S +E+ +++ EST N S+ + ++S E
Sbjct: 377 ESSEEDSLDTPSSSESQSTEEQADSESNESLSSSEESPESTEDENSSSQEGLQSHSASTE 436
Query: 1922 SESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESES 1974
S S + S + +S+ ++ S +S ST ++S E + E ES
Sbjct: 437 SRSQESQSEQDSRSEEDDSDSQDSSRSKEDSNSTESASSSEEDGQPKNTEIES 489
>gnl|CDD|239110 cd02619, Peptidase_C1, C1 Peptidase family (MEROPS database
nomenclature), also referred to as the papain family;
composed of two subfamilies of cysteine peptidases (CPs),
C1A (papain) and C1B (bleomycin hydrolase). Papain-like
enzymes are mostly endopeptidases with some exceptions
like cathepsins B, C, H and X, which are exopeptidases.
Papain-like CPs have different functions in various
organisms. Plant CPs are used to mobilize storage
proteins in seeds while mammalian CPs are primarily
lysosomal enzymes responsible for protein degradation in
the lysosome. Papain-like CPs are synthesized as inactive
proenzymes with N-terminal propeptide regions, which are
removed upon activation. Bleomycin hydrolase (BH) is a CP
that detoxifies bleomycin by hydrolysis of an amide
group. It acts as a carboxypeptidase on its C-terminus to
convert itself into an aminopeptidase and peptide ligase.
BH is found in all tissues in mammals as well as in many
other eukaryotes. It forms a hexameric ring barrel
structure with the active sites imbedded in the central
channel. Some members of the C1 family are proteins
classified as non-peptidase homologs which lack peptidase
activity or have missing active site residues.
Length = 223
Score = 50.6 bits (121), Expect = 3e-06
Identities = 42/245 (17%), Positives = 67/245 (27%), Gaps = 71/245 (28%)
Query: 2187 CKDVIGKVWDQGACQSCWVSHQPRTAGLKGLFSFIKYGQGQERTLSVWDKAISAASVMSD 2246
+ V +QG+ SCW A ++A +
Sbjct: 5 RPLRLTPVKNQGSRGSCW--------------------------------AFASAYALES 32
Query: 2247 RICIQSKGQVKPILSPQHLICSCTNCTRMHTKTPMSMCMGGDSAAAW-MYWINAGLVDGG 2305
I+ LSPQ+L C C GG +A G+
Sbjct: 33 AYRIKGGEDEYVDLSPQYLY----ICANDECLGINGSCDGGGPLSALLKLVALKGIPPEE 88
Query: 2306 D--YGTHDVSMGRYIEGIGHAASV-------------------MGSSNPEVNNFEKVIRL 2344
D YG E +AA V + P V F+
Sbjct: 89 DYPYGAESDGEEPKSEAALNAAKVKLKDYRRVLKNNIEDIKEALAKGGPVVAGFDVYSGF 148
Query: 2345 YSCEGSINPRYI------------HSVKIIGWGKS-SQNEPYWLCTNSYNQGWGEQGLFK 2391
+ I I H+V I+G+ + + + ++ NS+ WG+ G +
Sbjct: 149 DRLKEGIIYEEIVYLLYEDGDLGGHAVVIVGYDDNYVEGKGAFIVKNSWGTDWGDNGYGR 208
Query: 2392 IRRGV 2396
I
Sbjct: 209 ISYED 213
>gnl|CDD|216860 pfam02063, MARCKS, MARCKS family.
Length = 296
Score = 51.4 bits (122), Expect = 3e-06
Identities = 45/236 (19%), Positives = 78/236 (33%), Gaps = 31/236 (13%)
Query: 1900 PESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSS 1959
E + E +S E + + +E + + E E+ T S+ ++E T S
Sbjct: 69 TGKEEAASAAAAEEKEAAASTEPDKEPAEAEPAEPASPAEAEGEAAT-STEKAEDGATPS 127
Query: 1960 LVSES----------------TTTSSPESESTTTSSPESESTTTSSL-VSESTTTSSPES 2002
SE+ + S +++ E+E E ++PE+
Sbjct: 128 PSSETPKKKKKRFSFKKSFKLSGFSFKKNKKEAGEGAEAEGAAAEKEGAKEEAAAAAPEA 187
Query: 2003 ESTTTISPVSE----STTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTN 2058
S + E + E PE A E P++E
Sbjct: 188 GSGEEAAAPGEEAGAAGAEGEAGEEPAADAEPEQPEAKPEEAAPEK-----PQAEEAK-- 240
Query: 2059 NPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESES 2114
A E PA E+ SS A E+ A+ + ++P E+ + SSPE+
Sbjct: 241 -AAEEQKAEEKPAEEAGA-SSAAQEAPAAEQEAAPAEEPAAPPQEACSESSPEAPP 294
Score = 50.2 bits (119), Expect = 6e-06
Identities = 45/241 (18%), Positives = 81/241 (33%), Gaps = 29/241 (12%)
Query: 1904 STTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSP---ESESTTTSSL 1960
S E +++ E +S + E+E +SP E E+ T++
Sbjct: 63 SAPAEETGKEEAASAAAAEEKEAAASTEPDK---EPAEAEPAEPASPAEAEGEAATSTE- 118
Query: 1961 VSESTTTSSPESES----------------TTTSSPESESTTTSSLVSESTTTSSPES-E 2003
+E T SP SE+ + S +++ +E + E
Sbjct: 119 KAEDGATPSPSSETPKKKKKRFSFKKSFKLSGFSFKKNKKEAGEGAEAEGAAAEKEGAKE 178
Query: 2004 STTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASE 2063
+P + S ++ E E E+ + +E K E P +E
Sbjct: 179 EAAAAAPEAGSGEEAAAPGEEAGAAGAEGEAGEEPAADAEPEQPE-AKPEEAAPEKPQAE 237
Query: 2064 SITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASE 2123
+ A E PA E+ SS A E+ A+ + ++P E+ + SSP +
Sbjct: 238 E---AKAAEEQKAEEKPAEEAGA-SSAAQEAPAAEQEAAPAEEPAAPPQEACSESSPEAP 293
Query: 2124 S 2124
Sbjct: 294 P 294
Score = 45.2 bits (106), Expect = 3e-04
Identities = 49/253 (19%), Positives = 84/253 (33%), Gaps = 13/253 (5%)
Query: 1903 ESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESE-STTTSSLV 1961
E+ T P E+ SSP + + V + S +E+ ++ S
Sbjct: 12 EAATAERP-GEAAVASSPSKANGQENGHVKVNGDASPAAAEAGAKEELQANGSAPAEETG 70
Query: 1962 SESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPV 2021
E +++ E +S E + + +E + + E E+ T+ + +E T SP
Sbjct: 71 KEEAASAAAAEEKEAAASTEPDKEPAEAEPAEPASPAEAEGEAATS-TEKAEDGATPSPS 129
Query: 2022 SESTTTISPESESTTTSSPASESTTTNNPK--SESTTTNNPASESITSSSPASESTTTSS 2079
SE T + S S S + N K E A+E + A+ + +
Sbjct: 130 SE-TPKKKKKRFSFKKSFKLSGFSFKKNKKEAGEGAEAEGAAAEKEGAKEEAAAAAPEAG 188
Query: 2080 PASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQGVSPHSEKL 2139
E+ A + E + PE A E EE K
Sbjct: 189 SGEEAAAPGEEAGAAGAEGEAGEEPAADAEPEQPEAKPEEAAPEKPQAEEA-------KA 241
Query: 2140 SANEDPEEFPNED 2152
+ + EE P E+
Sbjct: 242 AEEQKAEEKPAEE 254
>gnl|CDD|233044 TIGR00600, rad2, DNA excision repair protein (rad2). All proteins in
this family for which functions are known are flap
endonucleases that generate the 3' incision next to DNA
damage as part of nucleotide excision repair. This family
is related to many other flap endonuclease families
including the fen1 family. This family is based on the
phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis,
Stanford University) [DNA metabolism, DNA replication,
recombination, and repair].
Length = 1034
Score = 52.6 bits (126), Expect = 3e-06
Identities = 53/289 (18%), Positives = 94/289 (32%), Gaps = 21/289 (7%)
Query: 1896 TTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESEST 1955
++ + + T+S E+ SL+ +T S SE T S
Sbjct: 457 LSSVNSKPEAVASTKIAREVTSSGHEAVPKAVQSLLLGATNDSPIPSEFTILDRKSELSI 516
Query: 1956 TTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSEST 2015
V ++ S+ + +E T +S + E +SP+
Sbjct: 517 --ERTVKPVSSEFGLPSQREDKLAIPTEGTQNLQGIS----DHPEQFEFQNELSPLETKN 570
Query: 2016 TTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASEST 2075
S+ S++ T SP E + SS S +N T NP S A E
Sbjct: 571 NESNLSSDAETEGSPNPEMPSWSSVTVPSEALDN-----YETTNP--------SNAKEVR 617
Query: 2076 TTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTS-SPASESTTIEEQGVSP 2134
+ ++T A ++ E + ESES + S S+T+E Q S
Sbjct: 618 NFAETGIQTTNVGESADLLLISNPMEVEPMESEKEESESDGSFIEVDSVSSTLELQVPSK 677
Query: 2135 HSEKLSANEDPEEFPNEDVFEHTFAEIPNIDHSNQTDEAIPETFDAREE 2183
+ E+ E + EI ++ ++ I + ++
Sbjct: 678 SQPTDESEENAEN-KVASIEGEHRKEIEDLLFDESEEDNIVGMIEEEKD 725
Score = 31.4 bits (71), Expect = 7.7
Identities = 29/133 (21%), Positives = 47/133 (35%), Gaps = 4/133 (3%)
Query: 1874 TNNNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSE 1933
+ N + T+ S +N T +P + N E+ TT+ ES S E
Sbjct: 584 SPNPEMPSWSSVTVPSEALDNYETTNPSNAKEVRNFAETGIQTTNVGESADLLLISNPME 643
Query: 1934 STTTSSPESESTTT--SSPESES-TTTSSLVSESTTTSSPESESTTTSSPESESTTTSSL 1990
S E E + + S E +S ++T L S + + ESE + S
Sbjct: 644 VEPMES-EKEESESDGSFIEVDSVSSTLELQVPSKSQPTDESEENAENKVASIEGEHRKE 702
Query: 1991 VSESTTTSSPESE 2003
+ + S E
Sbjct: 703 IEDLLFDESEEDN 715
>gnl|CDD|183756 PRK12799, motB, flagellar motor protein MotB; Reviewed.
Length = 421
Score = 51.3 bits (122), Expect = 5e-06
Identities = 29/118 (24%), Positives = 54/118 (45%), Gaps = 4/118 (3%)
Query: 1977 TSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESEST- 2035
+ S + T SS ++ S+ + ++++ S +TT +S V+ S+ + P +
Sbjct: 302 AAVTPSSAVTQSSAITPSSAAIPSPAVIPSSVTTQSATTTQASAVALSSAGVLPSDVTLP 361
Query: 2036 -TTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPAS 2092
T + PA+E +T T ++ +ITS+ A+ TT+ A S SP S
Sbjct: 362 GTVALPAAEPVNMQPQPMSTTETQQSSTGNITST--ANGPTTSLPAAPASNIPVSPTS 417
Score = 49.3 bits (117), Expect = 2e-05
Identities = 29/142 (20%), Positives = 60/142 (42%), Gaps = 4/142 (2%)
Query: 1893 ENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTT-TSSPESESTTTSSPE 1951
+N + ++ + + S + T SS ++ S+ SP ++ ++
Sbjct: 278 DNRALDIEKATGLKQIDTHGTVPVAAVTPSSAVTQSSAITPSSAAIPSPAVIPSSVTTQ- 336
Query: 1952 SESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPV 2011
S +TT +S V+ S+ P S+ T + + ++ + +T+ + ST I+
Sbjct: 337 SATTTQASAVALSSAGVLP-SDVTLPGTVALPAAEPVNMQPQPMSTTETQQSSTGNITST 395
Query: 2012 SESTTTSSPVS-ESTTTISPES 2032
+ TTS P + S +SP S
Sbjct: 396 ANGPTTSLPAAPASNIPVSPTS 417
Score = 48.9 bits (116), Expect = 3e-05
Identities = 25/131 (19%), Positives = 47/131 (35%), Gaps = 11/131 (8%)
Query: 1972 SESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPE 2031
+ +P S T +S++ S SP P S +T +++ S +S
Sbjct: 298 TVPVAAVTPSSAVTQSSAITPSSAAIPSPAV------IPSSVTTQSATTTQASAVALS-- 349
Query: 2032 SESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPA 2091
+ S + T P +E + + ++ + T++ A+ TT+ A
Sbjct: 350 -SAGVLPSDVTLPGTVALPAAEPVNMQPQPMSTTETQQSSTGNITST--ANGPTTSLPAA 406
Query: 2092 SESTTTSSPAS 2102
S SP S
Sbjct: 407 PASNIPVSPTS 417
Score = 44.7 bits (105), Expect = 5e-04
Identities = 31/132 (23%), Positives = 53/132 (40%), Gaps = 9/132 (6%)
Query: 2005 TTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASES 2064
T PV+ T +S+ S T S + + P+S +T + S S +
Sbjct: 295 THGTVPVAAVTPSSAVTQSSAITPSSAAIPSPAVIPSSVTTQSATTTQASAVA---LSSA 351
Query: 2065 ITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASES 2124
S + T + PA+E +T T ++ + T+++ TTS PA+ +
Sbjct: 352 GVLPSDVTLPGTVALPAAEPVNMQPQPMSTTETQQSSTGNITSTA---NGPTTSLPAAPA 408
Query: 2125 TTIEEQGVSPHS 2136
+ I VSP S
Sbjct: 409 SNIP---VSPTS 417
Score = 44.7 bits (105), Expect = 5e-04
Identities = 25/122 (20%), Positives = 46/122 (37%), Gaps = 3/122 (2%)
Query: 1932 SESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLV 1991
+ +P S T +S+ S S +S +TTT + ++ L
Sbjct: 298 TVPVAAVTPSSAVTQSSAITPSSAAIPS--PAVIPSSVTTQSATTTQASAVALSSAGVLP 355
Query: 1992 SESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPAS-ESTTTNNP 2050
S+ T + + ++ + +T+ ST I+ + TTS PA+ S +P
Sbjct: 356 SDVTLPGTVALPAAEPVNMQPQPMSTTETQQSSTGNITSTANGPTTSLPAAPASNIPVSP 415
Query: 2051 KS 2052
S
Sbjct: 416 TS 417
Score = 40.1 bits (93), Expect = 0.016
Identities = 19/95 (20%), Positives = 40/95 (42%), Gaps = 1/95 (1%)
Query: 2051 KSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSP 2110
+ P+S SS+ S SPA ++ ++ ++ +T S+ A S
Sbjct: 297 GTVPVAAVTPSSAVTQSSAITPSSAAIPSPAVIPSSVTTQSATTTQASAVALSSAGVLPS 356
Query: 2111 E-SESTTTSSPASESTTIEEQGVSPHSEKLSANED 2144
+ + T + PA+E ++ Q +S + S+ +
Sbjct: 357 DVTLPGTVALPAAEPVNMQPQPMSTTETQQSSTGN 391
>gnl|CDD|221173 pfam11702, DUF3295, Protein of unknown function (DUF3295). This
family is conserved in fungi but the function is not
known.
Length = 509
Score = 50.7 bits (121), Expect = 8e-06
Identities = 49/242 (20%), Positives = 85/242 (35%), Gaps = 26/242 (10%)
Query: 1889 SLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTS 1948
+L N +P S T P SEST T +P + + +EST+T+S + + S
Sbjct: 80 ALPMPNLAPITPPSSEPTPAPPSSESTATRTP--DPNQQALESTESTSTTSADCNDSEQS 137
Query: 1949 SPESESTTTSSLVSESTTTSSPESESTTTS----SPESESTTTSSLVSESTTTSSPESES 2004
S + + S T+TSS + +T+ SP S++ ST +
Sbjct: 138 STPNLN-------SSDTSTSSSGALPSTSVVRGFSPSHISSSYR-----STAQLNKAPSP 185
Query: 2005 TTTISPVSESTTTSSPVSE--STTTISPESESTTTSSPASESTTTNNPKSESTTTNNPAS 2062
T + P + + + T+ S S + ++ +PK S P
Sbjct: 186 TKSAEPTAAPQAKPELPKKKQAMFTLGGSSGDDDEDS-FEDRMSSQDPKRSSLPKPKPKM 244
Query: 2063 ESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPAS 2122
+ S +S + + T + + T + P T+ E T
Sbjct: 245 FQLGGSDELGKSLPSLMSPRKKTASFK--EQVVTRTFPER---TSDDDEDAIETEEDDVD 299
Query: 2123 ES 2124
ES
Sbjct: 300 ES 301
Score = 49.6 bits (118), Expect = 2e-05
Identities = 44/187 (23%), Positives = 75/187 (40%), Gaps = 9/187 (4%)
Query: 1944 STTTSSPESESTTTSSLVSESTTTSSPES----ESTTTSSPESESTTTSSLVSESTTTSS 1999
S + S SE ++S+ T + + +S +S E S
Sbjct: 11 SASVDSAASEEAVDIEHHTDSSPTDISRPRIVRQDSCSSRSRGRERHITSDDLEKMVLSI 70
Query: 2000 PESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTT--T 2057
E + ++ + +P S T P SEST T +P + +S STT
Sbjct: 71 KEKKDLEPLALPMPNLAPITPPSSEPTPAPPSSESTATRTPDPNQQALESTESTSTTSAD 130
Query: 2058 NNPASESITSSSPASESTTTSSPASESTTTS---SPASESTTTSSPASESTTTSSPESES 2114
N + +S T + +S+++T+SS A ST+ SP+ S++ S A + S +S
Sbjct: 131 CNDSEQSSTPNLNSSDTSTSSSGALPSTSVVRGFSPSHISSSYRSTAQLNKAPSPTKSAE 190
Query: 2115 TTTSSPA 2121
T + A
Sbjct: 191 PTAAPQA 197
Score = 46.5 bits (110), Expect = 1e-04
Identities = 46/232 (19%), Positives = 85/232 (36%), Gaps = 16/232 (6%)
Query: 1929 SLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSP---ESEST 1985
+L + +P S T + P SEST T + + +EST+T+S +SE +
Sbjct: 80 ALPMPNLAPITPPSSEPTPAPPSSESTATR--TPDPNQQALESTESTSTTSADCNDSEQS 137
Query: 1986 TTSSLVSESTTTSSPESESTTTI----SPVSESTTTSSPVSESTTTI-SPESESTTTSSP 2040
+T +L S T+TSS + +T++ SP S++ S + + +E T
Sbjct: 138 STPNLNSSDTSTSSSGALPSTSVVRGFSPSHISSSYRSTAQLNKAPSPTKSAEPTAAPQA 197
Query: 2041 ASESTTTNNPK-----SESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASEST 2095
E S + + ++S P S P S +S
Sbjct: 198 KPELPKKKQAMFTLGGSSGDDDEDSFEDRMSSQDPKRSSLPKPKPKMFQLGGSDELGKSL 257
Query: 2096 TTSSPASESTTTSSPESESTTTSSPASESTTIEEQGVSPHSEKLSANEDPEE 2147
+ + T + + + T S+ + ++ SA ED ++
Sbjct: 258 PSLMSPRKKTASFKEQVVTRTFPERTSDDDEDAIETEEDDVDE-SAIEDDDD 308
Score = 44.9 bits (106), Expect = 5e-04
Identities = 48/242 (19%), Positives = 88/242 (36%), Gaps = 19/242 (7%)
Query: 1928 SSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTT 1987
+S E S E + + + + S T + P SEST T +P +
Sbjct: 59 TSDDLEKMVLSIKEKKDLEPLALPMPNLAPITPPSSEPTPAPPSSESTATRTP--DPNQQ 116
Query: 1988 SSLVSESTTTSSPESESTTTISPV---SESTTTSSPVSESTTTI----SPESESTTTSSP 2040
+ +EST+T+S + + S S T+TSS + +T++ SP S++ S
Sbjct: 117 ALESTESTSTTSADCNDSEQSSTPNLNSSDTSTSSSGALPSTSVVRGFSPSHISSSYRST 176
Query: 2041 ASESTTTNNPKSESTTTNNPASESITSSSP-------ASESTTTSSPASESTTTSSPASE 2093
A + +P + T P ++ S + ++ P
Sbjct: 177 AQLNKAP-SPTKSAEPTAAPQAKPELPKKKQAMFTLGGSSGDDDEDSFEDRMSSQDPKRS 235
Query: 2094 STTTSSPASESTTTSSPESESTTTS-SPASESTTIEEQGVS-PHSEKLSANEDPEEFPNE 2151
S P S +S + SP ++ + +EQ V+ E+ S +++ E
Sbjct: 236 SLPKPKPKMFQLGGSDELGKSLPSLMSPRKKTASFKEQVVTRTFPERTSDDDEDAIETEE 295
Query: 2152 DV 2153
D
Sbjct: 296 DD 297
Score = 43.0 bits (101), Expect = 0.002
Identities = 50/232 (21%), Positives = 81/232 (34%), Gaps = 32/232 (13%)
Query: 1892 SENTTTNSPESESTTTNNPE--------SESTTTSSP---ESESTTTSSLVSESTTTSSP 1940
S T P SEST T P+ +EST+T+S +SE ++T +L S T+TSS
Sbjct: 93 SSEPTPAPPSSESTATRTPDPNQQALESTESTSTTSADCNDSEQSSTPNLNSSDTSTSSS 152
Query: 1941 ESESTTTS----SPESESTTTSSLVSESTT---TSSPESESTTTSSPE------------ 1981
+ +T+ SP S++ S + T S E + + PE
Sbjct: 153 GALPSTSVVRGFSPSHISSSYRSTAQLNKAPSPTKSAEPTAAPQAKPELPKKKQAMFTLG 212
Query: 1982 -SESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSP 2040
S + ++ P+ S P S + +S ++ + T +
Sbjct: 213 GSSGDDDEDSFEDRMSSQDPKRSSLPKPKPKMFQLGGSDELGKSLPSLMSPRKKTASFKE 272
Query: 2041 ASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPAS 2092
+ T S+ E S A E S +S S +S
Sbjct: 273 QVVTRTFPERTSDDDEDAIETEEDDVDES-AIEDDDDDSDWEDSVEESGRSS 323
>gnl|CDD|222010 pfam13254, DUF4045, Domain of unknown function (DUF4045). This
presumed domain is functionally uncharacterized. This
domain family is found in bacteria and eukaryotes, and is
typically between 384 and 430 amino acids in length.
Length = 414
Score = 50.2 bits (120), Expect = 9e-06
Identities = 53/268 (19%), Positives = 76/268 (28%), Gaps = 32/268 (11%)
Query: 1878 SESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTT 1937
SE+T+V + + S P S S + S+ +
Sbjct: 79 SEATIVRQAKEG-------ERPATPPEARPDEGFVRPSLPSHPRSRSASVSNSKDGDRPS 131
Query: 1938 SSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTT- 1996
P S S T T + L S SP+ + PE + + S ++
Sbjct: 132 DLPPSPSKTMDPRRWSPTKATWLESALNKPESPKHKPQPPQQPEWKKDLSRLRQSRASVD 191
Query: 1997 ---TSSPESEST----TTISPVSESTTTSSPVSEST------------TTISPESESTTT 2037
T+S + + T P S S + S S T
Sbjct: 192 LGRTNSFKEVTPVGLMRTPPPGSHSKSPSKSGIPDLPSSRDSEKTKPEKPQQETSSMDTE 251
Query: 2038 SSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTT 2097
S A + T +PKS +E S S S AS + S S S
Sbjct: 252 KSSAPKPRETLDPKSPEKAPPIDTTEEELKSP--EASPKESEEASARKRSPSLLSPSPKA 309
Query: 2098 SSP---ASESTTTSSPESESTTTSSPAS 2122
SP AS + P S SP
Sbjct: 310 ESPKPLASPGKSPRDPLSPRPKPQSPPV 337
Score = 38.7 bits (90), Expect = 0.034
Identities = 37/163 (22%), Positives = 59/163 (36%), Gaps = 18/163 (11%)
Query: 1897 TNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTT 1956
TNS + + T P S S + S +S P S + + PE
Sbjct: 195 TNSFKEVTPVG------LMRTPPPGSHSKSPS----KSGIPDLPSSRDSEKTKPEKPQQE 244
Query: 1957 TSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTT 2016
TSS+ +E ++ P S ++ T+ +S S ESE S
Sbjct: 245 TSSMDTEKSSAPKPRETLDPKSPEKAPPIDTTEEELKSPEASPKESE------EASARKR 298
Query: 2017 TSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNN 2059
+ S +S S SP+ ++ SP + PK +S N+
Sbjct: 299 SPSLLSPSPKAESPKPLASPGKSP--RDPLSPRPKPQSPPVND 339
Score = 36.4 bits (84), Expect = 0.22
Identities = 46/280 (16%), Positives = 85/280 (30%), Gaps = 36/280 (12%)
Query: 1928 SSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTT 1987
S VS+ + P S S + ++ + ++S + + SS S +
Sbjct: 21 SDSVSKRWSAQLPSGLSRGNSFLSNRNSDAAPSGTDSLSGRPASRLNREPSSRPGSSHSE 80
Query: 1988 SSLVSESTTTSSPESESTTTISPV-SESTTTSSPVSESTTTISPESESTTTSSPASESTT 2046
+++V ++ P + + S P S S + + + + P S S T
Sbjct: 81 ATIVRQAKEGERPATPPEARPDEGFVRPSLPSHPRSRSASVSNSKDGDRPSDLPPSPSKT 140
Query: 2047 TNNPKSESTTT--------NNPAS---------------------ESITSSSPASESTTT 2077
+ P+ S T N P S +S S ++
Sbjct: 141 MD-PRRWSPTKATWLESALNKPESPKHKPQPPQQPEWKKDLSRLRQSRASVDLGRTNSFK 199
Query: 2078 SSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQGVSPHSE 2137
T P S S + S SS +SE T P E+++++ + S
Sbjct: 200 EVTPVGLMRTPPPGSHSKSPSKSGIPD-LPSSRDSEKTKPEKPQQETSSMDTE---KSSA 255
Query: 2138 KLSANEDPEEFPNEDVFEHTFAEIPNI-DHSNQTDEAIPE 2176
+ P + T E + S + E
Sbjct: 256 PKPRETLDPKSPEKAPPIDTTEEELKSPEASPKESEEASA 295
Score = 35.6 bits (82), Expect = 0.34
Identities = 49/256 (19%), Positives = 77/256 (30%), Gaps = 19/256 (7%)
Query: 1876 NNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSEST 1935
S +S NS + + T + ++ N E S SS SE+T
Sbjct: 35 GLSRGNSFLSNRNSDAAPSGTDSLSGRPASRLNR-EPSSRPGSSH-SEATIVRQAKEGER 92
Query: 1936 TTSSPESE-------STTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTS 1988
+ PE+ + S P S S + S+ + P S S T T +
Sbjct: 93 PATPPEARPDEGFVRPSLPSHPRSRSASVSNSKDGDRPSDLPPSPSKTMDPRRWSPTKAT 152
Query: 1989 SLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTN 2048
L S SP+ + P E S + +S ++ ++ T
Sbjct: 153 WLESALNKPESPKHKPQPPQQP--EWKKDLSRLRQSRASVDLGRTNSFKEVTPVGLMRTP 210
Query: 2049 NPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTS 2108
P S S + S P S+ S +S T S A + T
Sbjct: 211 PPGSHSKS-------PSKSGIPDLPSSRDSEKTKPEKPQQETSSMDT-EKSSAPKPRETL 262
Query: 2109 SPESESTTTSSPASES 2124
P+S +E
Sbjct: 263 DPKSPEKAPPIDTTEE 278
>gnl|CDD|236792 PRK10905, PRK10905, cell division protein DamX; Validated.
Length = 328
Score = 49.6 bits (118), Expect = 1e-05
Identities = 40/243 (16%), Positives = 86/243 (35%), Gaps = 39/243 (16%)
Query: 1910 PESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSS 1969
P + S+ ++ +S + ++ P + T+S E + T VS +S+
Sbjct: 23 PSTSSSDQTASGEKSIDLAGNATDQANGVQP---APGTTSAEQTAGNTQQDVSLPPISST 79
Query: 1970 PESESTTTSSPESESTT------------------TSSLVSESTTTSSPESESTTTISPV 2011
P ++ T + + + +++ ST + P T++PV
Sbjct: 80 P-TQGQTPVATDGQQRVEVQGDLNNALTQPQNQQQLNNVAVNSTLPTEP-----ATVAPV 133
Query: 2012 ----SESTTTSSPVSESTTTISP-------ESESTTTSSPASESTTTNNPK-SESTTTNN 2059
+ T + +E T P E + ++ PK +E
Sbjct: 134 RNGNASRQTAKTQTAERPATTRPARKQAVIEPKKPQATAKTEPKPVAQTPKRTEPAAPVA 193
Query: 2060 PASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSS 2119
+S+PA + T T++P ++ + A+ + + + + S+P S T S
Sbjct: 194 STKAPAATSTPAPKETATTAPVQTASPAQTTATPAAGGKTAGNVGSLKSAPSSHYTLQLS 253
Query: 2120 PAS 2122
+S
Sbjct: 254 SSS 256
Score = 42.2 bits (99), Expect = 0.002
Identities = 31/218 (14%), Positives = 70/218 (32%), Gaps = 17/218 (7%)
Query: 1939 SPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTS 1998
+P + S+ ++ +S + ++ P +T+ + SL S+T +
Sbjct: 22 APSTSSSDQTASGEKSIDLAGNATDQANGVQPAPGTTSAEQTAGNTQQDVSLPPISSTPT 81
Query: 1999 SPESESTT--------------TISPVSESTTTSSPVSESTTTISPESESTTTSSPASES 2044
++ T ++ ++ ST P + + + AS
Sbjct: 82 QGQTPVATDGQQRVEVQGDLNNALTQPQNQQQLNNVAVNSTLPTEPATVAPVRNGNASRQ 141
Query: 2045 TTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASES 2104
T TT PA + ++T + P + + +E +
Sbjct: 142 TAKTQTAERPATTR-PARKQAVIEPKKPQATAKTEP--KPVAQTPKRTEPAAPVASTKAP 198
Query: 2105 TTTSSPESESTTTSSPASESTTIEEQGVSPHSEKLSAN 2142
TS+P + T T++P ++ + K + N
Sbjct: 199 AATSTPAPKETATTAPVQTASPAQTTATPAAGGKTAGN 236
Score = 36.1 bits (83), Expect = 0.22
Identities = 28/154 (18%), Positives = 52/154 (33%), Gaps = 17/154 (11%)
Query: 1908 NNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSP------ESESTTTSSLV 1961
NN ST + P + + + S T + TT E + ++
Sbjct: 115 NNVAVNSTLPTEPATVAPVRNGNASRQTAKTQTAERPATTRPARKQAVIEPKKPQATAKT 174
Query: 1962 SESTTTSSPE-SESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSP 2020
+P+ +E + TS+ + T T++P T SP +++T T +
Sbjct: 175 EPKPVAQTPKRTEPAAPVASTKAPAATSTPAPKETATTAP----VQTASP-AQTTATPAA 229
Query: 2021 VSESTTTIS-----PESESTTTSSPASESTTTNN 2049
++ + P S T S +S N
Sbjct: 230 GGKTAGNVGSLKSAPSSHYTLQLSSSSNYDNLNG 263
Score = 35.7 bits (82), Expect = 0.28
Identities = 16/102 (15%), Positives = 35/102 (34%), Gaps = 8/102 (7%)
Query: 1892 SENTTTNSPESEST--------TTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESE 1943
+E T P + T E + + +E + TS+P +
Sbjct: 148 AERPATTRPARKQAVIEPKKPQATAKTEPKPVAQTPKRTEPAAPVASTKAPAATSTPAPK 207
Query: 1944 STTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESEST 1985
T T++P ++ + + + + + + S+P S T
Sbjct: 208 ETATTAPVQTASPAQTTATPAAGGKTAGNVGSLKSAPSSHYT 249
Score = 33.0 bits (75), Expect = 1.9
Identities = 14/87 (16%), Positives = 30/87 (34%)
Query: 1896 TTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESEST 1955
T E + +E + TS+ + T T++P ++ + + +
Sbjct: 170 ATAKTEPKPVAQTPKRTEPAAPVASTKAPAATSTPAPKETATTAPVQTASPAQTTATPAA 229
Query: 1956 TTSSLVSESTTTSSPESESTTTSSPES 1982
+ + + S+P S T S S
Sbjct: 230 GGKTAGNVGSLKSAPSSHYTLQLSSSS 256
>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
Length = 3151
Score = 50.7 bits (121), Expect = 1e-05
Identities = 34/242 (14%), Positives = 68/242 (28%), Gaps = 23/242 (9%)
Query: 1897 TNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTT 1956
T PE + + + + SSP ++ + +
Sbjct: 2636 NEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRAARPTVGSL 2695
Query: 1957 TSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSES-- 2014
TS P T +P + S+ ++ ++ +P +
Sbjct: 2696 TSL-------ADPPPPPPTPEPAPHA---LVSATPLPPGPAAARQASPALPAAPAPPAVP 2745
Query: 2015 TTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASES 2074
++P + P TT+ P + + PA S++ S + S
Sbjct: 2746 AGPATPGGPARPARPP-----TTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPS 2800
Query: 2075 TTTSSPASESTTTSSPASESTTTSSPASES--TTTSSPESESTTTSSPASESTTIEEQGV 2132
PA +PA+ +SPA T++ P + + V
Sbjct: 2801 P--WDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPP--PGPPPPSLPLGGSV 2856
Query: 2133 SP 2134
+P
Sbjct: 2857 AP 2858
Score = 44.9 bits (106), Expect = 8e-04
Identities = 32/287 (11%), Positives = 67/287 (23%), Gaps = 33/287 (11%)
Query: 1895 TTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESES 1954
+P + P + P + + + + P + + SES
Sbjct: 2736 PAAPAPPAVPAGPATPGGP-ARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSES 2794
Query: 1955 TTTSSLVSE----STTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTI-- 2008
+ + +P + +SP +S + + +
Sbjct: 2795 RESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGG 2854
Query: 2009 -----SPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASE 2063
V + SP ++ P + + + + P + P +
Sbjct: 2855 SVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAP 2914
Query: 2064 SITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTT----------------- 2106
P P P TT + A E +
Sbjct: 2915 PPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAV 2974
Query: 2107 ----TSSPESESTTTSSPASESTTIEEQGVSPHSEKLSANEDPEEFP 2149
P +S T VS + L+ +E+ + P
Sbjct: 2975 PRFRVPQPAPSREAPASSTPPLTGHSLSRVSSWASSLALHEETDPPP 3021
Score = 42.6 bits (100), Expect = 0.004
Identities = 26/254 (10%), Positives = 63/254 (24%), Gaps = 13/254 (5%)
Query: 1910 PESESTTTSSPESESTTTSSLVSES---TTTSSPESESTTTSSPESESTTTSSLVSESTT 1966
PE S ++ S +P + ++P + +
Sbjct: 2708 PEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPP 2767
Query: 1967 TSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTI---SPVSESTTTSSPVSE 2023
+P + +SL + SP + +P + +SP
Sbjct: 2768 APAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGP 2827
Query: 2024 STTTISPESESTTTSSPASESTTT-------NNPKSESTTTNNPASESITSSSPASESTT 2076
S + + + + +PA++ + P
Sbjct: 2828 LPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLA 2887
Query: 2077 TSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQGVSPHS 2136
+ + + + + P + P + P+ P + ++P +
Sbjct: 2888 RPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTT 2947
Query: 2137 EKLSANEDPEEFPN 2150
+ A E P
Sbjct: 2948 DPAGAGEPSGAVPQ 2961
Score = 41.8 bits (98), Expect = 0.006
Identities = 28/231 (12%), Positives = 58/231 (25%), Gaps = 11/231 (4%)
Query: 1910 PESESTTTSSPES-ESTTTSSLVSESTTTSSPESESTTTSSPESESTTT---SSLVSEST 1965
P T +P + S T + +SP + T + +
Sbjct: 2702 PPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPP 2761
Query: 1966 TTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSEST 2025
TT+ P + + + + S + S S + + +
Sbjct: 2762 TTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPA 2821
Query: 2026 TTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASEST 2085
+ + T++ P + P S S+ + SPA++
Sbjct: 2822 ASPAGPLPPPTSAQPTAPPPPPG-PPPPSLPLGG----SVAPGGDVRRRPPSRSPAAKPA 2876
Query: 2086 TTSSPASESTTTS--SPASESTTTSSPESESTTTSSPASESTTIEEQGVSP 2134
+ P S ++ES + E + P
Sbjct: 2877 APARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPP 2927
Score = 40.3 bits (94), Expect = 0.019
Identities = 28/214 (13%), Positives = 57/214 (26%), Gaps = 13/214 (6%)
Query: 1937 TSSPESESTTTSSPESESTTTSSLVSESTT-TSSPESESTTTSSPESESTTTSSLVSEST 1995
T P + +++P + S + +P + ++P + +
Sbjct: 2707 TPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPAR-----PARPP 2761
Query: 1996 TTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSEST 2055
TT+ P + + + P S + S + +
Sbjct: 2762 TTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPA 2821
Query: 2056 TTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESEST 2115
+ TS+ P + P S P S + SP ++
Sbjct: 2822 ASPAGPLPPPTSAQPTA-----PPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPA 2876
Query: 2116 TTSSPASESTTIEEQGVSPHSEKLSANEDPEEFP 2149
+ P + VS +E + D E P
Sbjct: 2877 APARPPVRR--LARPAVSRSTESFALPPDQPERP 2908
>gnl|CDD|218673 pfam05642, Sporozoite_P67, Sporozoite P67 surface antigen. This
family consists of several Theileria P67 surface
antigens. A stage specific surface antigen of Theileria
parva, p67, is the basis for the development of an
anti-sporozoite vaccine for the control of East Coast
fever (ECF) in cattle. The antigen has been shown to
contain five distinct linear peptide sequences recognised
by sporozoite-neutralising murine monoclonal antibodies.
Length = 727
Score = 50.1 bits (119), Expect = 2e-05
Identities = 57/284 (20%), Positives = 91/284 (32%), Gaps = 25/284 (8%)
Query: 1873 TTNNNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPES-----ESTTT 1927
T S++T V + S ++ T +P SE T + + + S + + T
Sbjct: 52 TVGALSKATKVWKSAVSSSDDSKTVPTPVSEPNITRSFQEPVSQESEVQDNTEQNQDTKG 111
Query: 1928 SSLVSESTTTSSPESESTTTSSP-ESESTTTSSLVSESTT-TSSPESESTTTS----SPE 1981
S SE S E ++ +TSS S T VS S+ T+S +T S
Sbjct: 112 SKTDSEEDDDDSEEEDNKSTSSKDGKGSKKTQPGVSTSSGSTTSGTDLNTKQSQTGLGAS 171
Query: 1982 SESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPA 2041
VS+S P P V +SP
Sbjct: 172 GSHAQQDPAVSQSGVVGVPGLGVPGVGVPGGGGAGALPGVGVGRAGVSPGVGVGGLGGVP 231
Query: 2042 SESTTTNNPKSESTTTNNPASESITSS--------------SPASESTTTSSPASESTTT 2087
+N E T ++ + S +S STT S ++ +TT
Sbjct: 232 GVGILASNTSREGQTQDDQERDGDGRVIEPGVGLPGVRVGDSTSSPSTTRPSGSTTTTTP 291
Query: 2088 SSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQG 2131
+S + +S + T S +S S SP + + G
Sbjct: 292 ASSGPSAPGGPGSSSRNAVTRSTDSISGPIPSPGAPRAITGQMG 335
Score = 43.1 bits (101), Expect = 0.002
Identities = 39/241 (16%), Positives = 72/241 (29%), Gaps = 21/241 (8%)
Query: 1913 ESTTTSSPESESTTTSSLVSESTTTSSPES-----ESTTTSSPESESTTTSSLVSESTTT 1967
+S T +P SE T S + S + + T S +SE S ++ +T
Sbjct: 72 DSKTVPTPVSEPNITRSFQEPVSQESEVQDNTEQNQDTKGSKTDSEEDDDDSEEEDNKST 131
Query: 1968 SSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTT 2027
SS + + + + P +++ S+ T +S T + VS+S
Sbjct: 132 SSKDGKGSKKTQPGVSTSSGSTTSGTDLNTK----QSQTGLGASGSHAQQDPAVSQSGVV 187
Query: 2028 ISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTT 2087
P P +P S+ + E T
Sbjct: 188 GVPGLGVPGVGVPGGGGAGALPGVGVGRAGVSPGVGVGGLGGVPGVGILASNTSREGQTQ 247
Query: 2088 SSPASESTTTSS------------PASESTTTSSPESESTTTSSPASESTTIEEQGVSPH 2135
+ ++ S +T+ P +TTT+ +S + G S
Sbjct: 248 DDQERDGDGRVIEPGVGLPGVRVGDSTSSPSTTRPSGSTTTTTPASSGPSAPGGPGSSSR 307
Query: 2136 S 2136
+
Sbjct: 308 N 308
Score = 43.1 bits (101), Expect = 0.002
Identities = 44/193 (22%), Positives = 76/193 (39%), Gaps = 28/193 (14%)
Query: 1940 PESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSS 1999
P ES TS P S LV+ + + P + T S++T V +S +SS
Sbjct: 22 PAGESPRTSKP-------SPLVTLESAITQPSKDPFKTVGALSKATK----VWKSAVSSS 70
Query: 2000 PESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNN 2059
+S T +PVSE T S +S ESE N +++ T +
Sbjct: 71 --DDSKTVPTPVSEPNITRS----FQEPVSQESE-----------VQDNTEQNQDTKGSK 113
Query: 2060 PASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSS 2119
SE S ++ +TSS + + + P +++ S+ + T ++ + S
Sbjct: 114 TDSEEDDDDSEEEDNKSTSSKDGKGSKKTQPGVSTSSGSTTSGTDLNTKQSQTGLGASGS 173
Query: 2120 PASESTTIEEQGV 2132
A + + + GV
Sbjct: 174 HAQQDPAVSQSGV 186
Score = 41.2 bits (96), Expect = 0.008
Identities = 52/295 (17%), Positives = 86/295 (29%), Gaps = 35/295 (11%)
Query: 1898 NSPESESTTTNNPESESTTTSS---PESESTTTSSLVSEST----TTSSPESESTTTSSP 1950
P ES T+ P T S+ P + T +S++T + S +S T +P
Sbjct: 20 KMPAGESPRTSKPSPLVTLESAITQPSKDPFKTVGALSKATKVWKSAVSSSDDSKTVPTP 79
Query: 1951 ESESTTTSSLVSESTTTSSPES-----ESTTTSSPESESTTTSSLVSESTTTSSPESEST 2005
SE T S + S + + T S +SE S ++ +TSS + + +
Sbjct: 80 VSEPNITRSFQEPVSQESEVQDNTEQNQDTKGSKTDSEEDDDDSEEEDNKSTSSKDGKGS 139
Query: 2006 TTISPVSESTTTSSPVSESTTT------ISPESESTTTSSPASESTTTNNPKSESTTTNN 2059
P +++ S+ T + S+S P
Sbjct: 140 KKTQPGVSTSSGSTTSGTDLNTKQSQTGLGASGSHAQQDPAVSQSGVVGVPGLGVPGVGV 199
Query: 2060 PASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESE------ 2113
P + SP S+ + E T E +
Sbjct: 200 PGGGGAGALPGVGVGRAGVSPGVGVGGLGGVPGVGILASNTSREGQTQDDQERDGDGRVI 259
Query: 2114 -----------STTTSSPASESTTIEEQGVSPHSEKLSANEDPEEFPNEDVFEHT 2157
+TSSP++ + +P S SA P V T
Sbjct: 260 EPGVGLPGVRVGDSTSSPSTTRPSGSTTTTTPASSGPSAPGGPGSSSRNAVTRST 314
>gnl|CDD|139494 PRK13335, PRK13335, superantigen-like protein; Reviewed.
Length = 356
Score = 49.4 bits (117), Expect = 2e-05
Identities = 32/168 (19%), Positives = 64/168 (38%), Gaps = 8/168 (4%)
Query: 1986 TTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASEST 2045
TT S+ +E ++ + T ++ T+ S +T + E T A +
Sbjct: 24 TTQSVKAEKIQSTKVDKVPTLKAERLAMINITAGANSATTQAANTRQERTPKLEKAPNTN 83
Query: 2046 TTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASEST 2105
S+ + P E S + ++ + +T +++P ++ TT P S +T
Sbjct: 84 EEKTSASKIEKISQPKQEEQKSLNISATPAPKQEQSQTTTESTTPKTKVTT---PPSTNT 140
Query: 2106 TTSSPESESTTTSSPASESTTIEEQGVSPHSEKLSA--NEDPEEFPNE 2151
++S T SP + + ++P E L A + EF +
Sbjct: 141 PQPMQSTKSDTPQSPTIKQAQTD---MTPKYEDLRAYYTKPSFEFEKQ 185
Score = 41.7 bits (97), Expect = 0.004
Identities = 31/175 (17%), Positives = 70/175 (40%), Gaps = 26/175 (14%)
Query: 1851 ISMLAATAVAISVIDNYSEIIFTTNNNSES-----TVVMSTLN---------SLLSENTT 1896
+ +A T++A+ ++ + + T + +E + TL + + + T
Sbjct: 3 MRTIAKTSLALGLLTTGAITVTTQSVKAEKIQSTKVDKVPTLKAERLAMINITAGANSAT 62
Query: 1897 TNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTT 1956
T + + T E +P + TS+ S+ S P+ E + + +
Sbjct: 63 TQAANTRQERTPKLE------KAPNTNEEKTSA--SKIEKISQPKQEEQKSLNISATPAP 114
Query: 1957 TSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPES-ESTTTISP 2010
+T +++P+++ TT P S +T ++S T SP ++ T ++P
Sbjct: 115 KQEQSQTTTESTTPKTKVTT---PPSTNTPQPMQSTKSDTPQSPTIKQAQTDMTP 166
Score = 40.5 bits (94), Expect = 0.008
Identities = 29/144 (20%), Positives = 54/144 (37%), Gaps = 7/144 (4%)
Query: 1946 TTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESEST 2005
TT S ++E ++ + T + + T+ + S TT + + T E
Sbjct: 24 TTQSVKAEKIQSTKVDKVPTLKAERLAMINITAG--ANSATTQAANTRQERTPKLEKAPN 81
Query: 2006 TTISPVSES--TTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASE 2063
T S S S P E +++ + + +T + PK++ TT P S
Sbjct: 82 TNEEKTSASKIEKISQPKQEEQKSLNISATPAPKQEQSQTTTESTTPKTKVTT---PPST 138
Query: 2064 SITSSSPASESTTTSSPASESTTT 2087
+ +++S T SP + T
Sbjct: 139 NTPQPMQSTKSDTPQSPTIKQAQT 162
Score = 40.5 bits (94), Expect = 0.010
Identities = 29/151 (19%), Positives = 58/151 (38%), Gaps = 18/151 (11%)
Query: 1896 TTNSPESESTTTNNPESESTTTS--------SPESESTTTSSLVSESTTTSSPESESTTT 1947
TT S ++E + + T + + + S TT + + T E T
Sbjct: 24 TTQSVKAEKIQSTKVDKVPTLKAERLAMINITAGANSATTQAANTRQERTPKLEKAPNT- 82
Query: 1948 SSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTT-----TSSPES 2002
+E T++S + + + E +S S+ + S +ESTT T+ P +
Sbjct: 83 ----NEEKTSASKIEKISQPKQEEQKSLNISATPAPKQEQSQTTTESTTPKTKVTTPPST 138
Query: 2003 ESTTTISPVSESTTTSSPVSESTTTISPESE 2033
+ + T S + ++ T ++P+ E
Sbjct: 139 NTPQPMQSTKSDTPQSPTIKQAQTDMTPKYE 169
Score = 37.8 bits (87), Expect = 0.058
Identities = 25/151 (16%), Positives = 51/151 (33%), Gaps = 8/151 (5%)
Query: 1926 TTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESEST 1985
TT S+ +E ++ + T + + T+ + S TT + + T E
Sbjct: 24 TTQSVKAEKIQSTKVDKVPTLKAERLAMINITAG--ANSATTQAANTRQERTPKLEKAPN 81
Query: 1986 TTSSLVSES---TTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPAS 2042
T S S + + E + + + TTT S ++ T+ P++
Sbjct: 82 TNEEKTSASKIEKISQPKQEEQKSLNISATPAPKQEQS---QTTTESTTPKTKVTTPPST 138
Query: 2043 ESTTTNNPKSESTTTNNPASESITSSSPASE 2073
+ T + ++ T +P E
Sbjct: 139 NTPQPMQSTKSDTPQSPTIKQAQTDMTPKYE 169
>gnl|CDD|185513 PTZ00203, PTZ00203, cathepsin L protease; Provisional.
Length = 348
Score = 48.9 bits (116), Expect = 2e-05
Identities = 20/60 (33%), Positives = 33/60 (55%), Gaps = 4/60 (6%)
Query: 2344 LYSCEGSINPRYIHSVKIIGWGKSSQNEPYWLCTNSYNQGWGEQGLFKIRRGVNMCSIED 2403
L SC G + H V ++G+ + + PYW+ NS+ + WGE+G ++ GVN C +
Sbjct: 278 LTSCIGE---QLNHGVLLVGYNMTGE-VPYWVIKNSWGEDWGEKGYVRVTMGVNACLLTG 333
Score = 32.0 bits (72), Expect = 4.4
Identities = 14/32 (43%), Positives = 18/32 (56%), Gaps = 5/32 (15%)
Query: 2173 AIPETFDAREEWPQCKDVIGKVWDQGACQSCW 2204
A+P+ D RE K + V +QGAC SCW
Sbjct: 125 AVPDAVDWRE-----KGAVTPVKNQGACGSCW 151
>gnl|CDD|220749 pfam10428, SOG2, RAM signalling pathway protein. SOG2 proteins in
Saccharomyces cerevisiae are involved in cell separation
and cytokinesis.
Length = 419
Score = 49.0 bits (117), Expect = 3e-05
Identities = 39/233 (16%), Positives = 80/233 (34%), Gaps = 22/233 (9%)
Query: 1861 ISVIDNYSEII---------FTTNNNSE--STVVMSTLNSLLSENTTTNSPESESTTTNN 1909
++ + + II F N + T+++ S++ +S
Sbjct: 98 LTCVSAFRHIISLLRKNLDAFFDNGDVRYIRTLLLMLYGSIMELRNAWSSLGPPLQHRKR 157
Query: 1910 PESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSS 1969
++ +S + + L S T + S++ S + +T S + TT
Sbjct: 158 DAVTASPSSMIARNTPISDRLRPRSVTPTRGRRPSSSPRSLSNPTTLESPSNLQVTTDVP 217
Query: 1970 PESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTIS 2029
P + T+ S S+ S++S T S ES +T T+ SS ++ +
Sbjct: 218 PPYSNGTSRSSTMSSSANLSIISSLATPRSGESFRST-------PTSGSSSINPVSGLDE 270
Query: 2030 PESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPAS 2082
E + T T+ + +E + S AS ++ +P+
Sbjct: 271 AEEDRIDEQLFLKLRTATDM----ALRVLPQLTEQFSKSLIASTTSRNITPSL 319
Score = 38.2 bits (89), Expect = 0.058
Identities = 30/152 (19%), Positives = 50/152 (32%), Gaps = 20/152 (13%)
Query: 1956 TTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSEST 2015
SSL ++ +S + + L S T + S++ S + +T
Sbjct: 144 AWSSLGPPLQHRKRDAVTASPSSMIARNTPISDRLRPRSVTPTRGRRPSSSPRSLSNPTT 203
Query: 2016 TTSSPVSESTTTI-SPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASES 2074
S + TT + P S T+ SS S S + S +T P S
Sbjct: 204 LESPSNLQVTTDVPPPYSNGTSRSSTMSSSANLSIISSLAT--------------PRSGE 249
Query: 2075 TTTSSPASESTTTSSPASESTTTSSPASESTT 2106
+ S+P T+ S + + A E
Sbjct: 250 SFRSTP-----TSGSSSINPVSGLDEAEEDRI 276
Score = 37.0 bits (86), Expect = 0.12
Identities = 29/155 (18%), Positives = 58/155 (37%), Gaps = 17/155 (10%)
Query: 2004 STTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASE 2063
+ +++ P + + + ++ I+ + + P S + T S S + + +
Sbjct: 144 AWSSLGPPLQHRKRDAVTASPSSMIARNTPISDRLRPRSVTPTRGRRPSSSPRSLSNPT- 202
Query: 2064 SITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASE 2123
++ S S +T P S T+ SS S S S +S +T S ES +T +S
Sbjct: 203 TLESPSNLQVTTDVPPPYSNGTSRSSTMSSSANLSIISSLATP-RSGESFRSTPTS---- 257
Query: 2124 STTIEEQGVSPHSEKLSANEDPEEFPNEDVFEHTF 2158
S ++ +E + + E F
Sbjct: 258 -----------GSSSINPVSGLDEAEEDRIDEQLF 281
>gnl|CDD|215130 PLN02217, PLN02217, probable pectinesterase/pectinesterase inhibitor.
Length = 670
Score = 49.3 bits (117), Expect = 3e-05
Identities = 34/112 (30%), Positives = 51/112 (45%), Gaps = 5/112 (4%)
Query: 1897 TNSPESESTTTNNPESESTTTSSPESESTTTSSLVS-ESTTTSSPESESTTTSSPESEST 1955
+P S ++T + S TT S +S ST + S + SP + + SP + S
Sbjct: 563 AGNPGSTNSTPTGSAASSNTTFSSDSPSTVVAPSTSPPAGHLGSPPATPSKIVSPST-SP 621
Query: 1956 TTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTT 2007
S L S STT SSPES S+ + + S + ++T SS S +T
Sbjct: 622 PASHLGSPSTTPSSPESSIKVASTE---TASPESSIKVASTESSVSMVSMST 670
Score = 48.9 bits (116), Expect = 4e-05
Identities = 36/122 (29%), Positives = 52/122 (42%), Gaps = 17/122 (13%)
Query: 1907 TNNPESESTTTSSPESESTTTSSLVSESTTTS---SPESESTTTSSPESESTTTSSLVSE 1963
NP S ++T + + S TT S S ST + SP + S P + S S
Sbjct: 563 AGNPGSTNSTPTGSAASSNTTFSSDSPSTVVAPSTSPPA-GHLGSPPATPSKIVSP---- 617
Query: 1964 STTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSE 2023
+TS P S S STT SS S S+ + ++I V+ + ++ S VS
Sbjct: 618 --STSPPAS------HLGSPSTTPSSPESSIKVASTETASPESSIK-VASTESSVSMVSM 668
Query: 2024 ST 2025
ST
Sbjct: 669 ST 670
Score = 48.5 bits (115), Expect = 4e-05
Identities = 33/104 (31%), Positives = 48/104 (46%), Gaps = 2/104 (1%)
Query: 1932 SESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLV 1991
+ ST T S S +TT SS + S + SP + + SP + S S L
Sbjct: 569 TNSTPTGSAASSNTTFSSDSPSTVVAPSTSPPAGHLGSPPATPSKIVSPST-SPPASHLG 627
Query: 1992 SESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESEST 2035
S STT SSPES + + S +S V+ + +++S S ST
Sbjct: 628 SPSTTPSSPESSIKVASTE-TASPESSIKVASTESSVSMVSMST 670
Score = 46.2 bits (109), Expect = 2e-04
Identities = 35/114 (30%), Positives = 47/114 (41%), Gaps = 12/114 (10%)
Query: 1937 TSSPESESTTTSSPESESTTTSSLVSESTTTS---SPESESTTTSSPESESTTTSSLVSE 1993
+P S ++T + + S TT S S ST + SP + S P + S S S
Sbjct: 563 AGNPGSTNSTPTGSAASSNTTFSSDSPSTVVAPSTSPPA-GHLGSPPATPSKIVSP--ST 619
Query: 1994 STTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTT 2047
S S S STT SP S S+ T SPES S+ +S S +
Sbjct: 620 SPPASHLGSPSTTPSSPESSIKVASTE------TASPESSIKVASTESSVSMVS 667
Score = 45.9 bits (108), Expect = 3e-04
Identities = 32/99 (32%), Positives = 44/99 (44%), Gaps = 3/99 (3%)
Query: 2047 TNNPKSESTTTNNPASESITSSSPASESTTTSSPAS-ESTTTSSPASESTTTSSPASEST 2105
NP S ++T A+ S T+ S S ST + S + SP + + SP S S
Sbjct: 563 AGNPGSTNSTPTGSAASSNTTFSSDSPSTVVAPSTSPPAGHLGSPPATPSKIVSP-STSP 621
Query: 2106 TTSSPESESTTTSSPASESTTIEEQGVSPHSE-KLSANE 2143
S S STT SSP S + SP S K+++ E
Sbjct: 622 PASHLGSPSTTPSSPESSIKVASTETASPESSIKVASTE 660
Score = 45.5 bits (107), Expect = 3e-04
Identities = 39/125 (31%), Positives = 52/125 (41%), Gaps = 14/125 (11%)
Query: 1867 YSEIIFTTNNNSESTVVMSTLNSLLSENTT--TNSPESESTTTNNPESESTTTSSPESES 1924
Y +F N S ++ S S NTT ++SP + + +P + S P + S
Sbjct: 557 YIPGLFAGNPGSTNSTPTG---SAASSNTTFSSDSPSTVVAPSTSPPA-GHLGSPPATPS 612
Query: 1925 TTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESES 1984
S S S S S STT SSPES ST T+SPES S+ S S
Sbjct: 613 KIVSP--STSPPASHLGSPSTTPSSPESSIK------VASTETASPESSIKVASTESSVS 664
Query: 1985 TTTSS 1989
+ S
Sbjct: 665 MVSMS 669
Score = 45.5 bits (107), Expect = 4e-04
Identities = 36/106 (33%), Positives = 48/106 (45%), Gaps = 16/106 (15%)
Query: 2029 SPESESTTTSSPASESTTTNNPKSESTTTNNPAS--ESITSSSPASEST----TTSSPA- 2081
+P S ++T + A+ S TT + S ST S S PA+ S +TS PA
Sbjct: 565 NPGSTNSTPTGSAASSNTTFSSDSPSTVVAPSTSPPAGHLGSPPATPSKIVSPSTSPPAS 624
Query: 2082 ---SESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASES 2124
S STT SSP S S+ T+SPES S+ +S S
Sbjct: 625 HLGSPSTTPSSPESSIKVASTE------TASPESSIKVASTESSVS 664
Score = 43.5 bits (102), Expect = 0.002
Identities = 31/111 (27%), Positives = 45/111 (40%), Gaps = 6/111 (5%)
Query: 2009 SPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSS 2068
+P S ++T + + S TT S +S ST + S + + S S +S
Sbjct: 565 NPGSTNSTPTGSAASSNTTFSSDSPSTVVAPSTSPPAGHLGSPPATPSKIVSPSTSPPAS 624
Query: 2069 SPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSS 2119
S STT SSP S S+ T+SP S S+ S S + S
Sbjct: 625 HLGSPSTTPSSPESSIKVASTE------TASPESSIKVASTESSVSMVSMS 669
Score = 43.5 bits (102), Expect = 0.002
Identities = 35/125 (28%), Positives = 57/125 (45%), Gaps = 20/125 (16%)
Query: 1977 TSSPESESTTTSSLVSESTTTSSPESESTTTISPVS--ESTTTSSPVSESTTTISPESES 2034
+P S ++T + + S TT S +S ST ++P + + SP + + +SP
Sbjct: 563 AGNPGSTNSTPTGSAASSNTTFSSDSPSTV-VAPSTSPPAGHLGSPPATPSKIVSPS--- 618
Query: 2035 TTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASES 2094
TS PAS + S STT ++P S +S+ T+SP S S+ +S S
Sbjct: 619 --TSPPAS------HLGSPSTTPSSPESSIKVASTE------TASPESSIKVASTESSVS 664
Query: 2095 TTTSS 2099
+ S
Sbjct: 665 MVSMS 669
Score = 40.1 bits (93), Expect = 0.017
Identities = 36/118 (30%), Positives = 49/118 (41%), Gaps = 17/118 (14%)
Query: 1962 SESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPV 2021
+ ST T S S +TT SS + S + SP + + +SP +TS P
Sbjct: 569 TNSTPTGSAASSNTTFSSDSPSTVVAPSTSPPAGHLGSPPATPSKIVSP-----STSPPA 623
Query: 2022 SESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSS 2079
S S STT SSP S K ST T +P S +S+ +S S + S
Sbjct: 624 S------HLGSPSTTPSSPESSI------KVASTETASPESSIKVASTESSVSMVSMS 669
Score = 33.1 bits (75), Expect = 2.1
Identities = 21/67 (31%), Positives = 32/67 (47%), Gaps = 3/67 (4%)
Query: 2081 ASESTTTSSPASESTTTS--SPASESTTTSSPESESTTTSSPASESTTIEEQGVSPHSEK 2138
++ ST T S AS +TT S SP++ ++SP + S PA+ S + P S
Sbjct: 568 STNSTPTGSAASSNTTFSSDSPSTVVAPSTSPPA-GHLGSPPATPSKIVSPSTSPPASHL 626
Query: 2139 LSANEDP 2145
S + P
Sbjct: 627 GSPSTTP 633
>gnl|CDD|237019 PRK11907, PRK11907, bifunctional 2',3'-cyclic nucleotide
2'-phosphodiesterase/3'-nucleotidase precursor protein;
Reviewed.
Length = 814
Score = 49.5 bits (118), Expect = 3e-05
Identities = 28/114 (24%), Positives = 49/114 (42%), Gaps = 7/114 (6%)
Query: 2021 VSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSP 2080
S+S ++ + + A + ST S E+ T +P
Sbjct: 6 FSKSAVALTLALLTASNPKLAQAEEIVTTTPATSTEAEQTTP---VESDATEEADNTETP 62
Query: 2081 ASESTTTSSPASESTT-TSSPASESTTTSSPESESTTTSSPASESTT-IEEQGV 2132
+ +T +P+S T TS P SE+T T++ SE+ T + A+E++ +E Q V
Sbjct: 63 VAATTAAEAPSSSETAETSDPTSEATDTTT--SEARTVTPAATETSKPVEGQTV 114
Score = 49.1 bits (117), Expect = 3e-05
Identities = 21/105 (20%), Positives = 40/105 (38%), Gaps = 4/105 (3%)
Query: 1952 SESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPV 2011
S+S +L + + ++ + ST S E+ T +PV
Sbjct: 7 SKSAVALTLALLTASNPKLAQAEEIVTTTPATSTEAEQTT---PVESDATEEADNTETPV 63
Query: 2012 SESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTT 2056
+ +T +P S S T + + S T + SE+ T +E++
Sbjct: 64 AATTAAEAP-SSSETAETSDPTSEATDTTTSEARTVTPAATETSK 107
Score = 47.9 bits (114), Expect = 7e-05
Identities = 18/93 (19%), Positives = 34/93 (36%), Gaps = 4/93 (4%)
Query: 1984 STTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASE 2043
+ + L ++ + ST ++T S +E +TT + S
Sbjct: 19 TASNPKLAQAEEIVTTTPATSTEA----EQTTPVESDATEEADNTETPVAATTAAEAPSS 74
Query: 2044 STTTNNPKSESTTTNNPASESITSSSPASESTT 2076
S T S T+ SE+ T + A+E++
Sbjct: 75 SETAETSDPTSEATDTTTSEARTVTPAATETSK 107
Score = 46.8 bits (111), Expect = 2e-04
Identities = 19/86 (22%), Positives = 38/86 (44%), Gaps = 7/86 (8%)
Query: 1892 SENTTTNSPESESTTTNNPESESTTTSSPESE-STTTSSLVSESTTTSSPESESTTTSSP 1950
+E T +P + + +T S +E + T + V+ +T +P S S T +
Sbjct: 28 AEEIVTTTPATSTEAEQ-----TTPVESDATEEADNTETPVAATTAAEAP-SSSETAETS 81
Query: 1951 ESESTTTSSLVSESTTTSSPESESTT 1976
+ S T + SE+ T + +E++
Sbjct: 82 DPTSEATDTTTSEARTVTPAATETSK 107
Score = 46.4 bits (110), Expect = 2e-04
Identities = 19/94 (20%), Positives = 40/94 (42%), Gaps = 1/94 (1%)
Query: 1915 TTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESES 1974
T+S + + + T++ E + S E+ T + V+ +T +P S
Sbjct: 17 LLTASNPKLAQAEEIVTTTPATSTEAEQTTPVESDATEEADNTETPVAATTAAEAPSSSE 76
Query: 1975 TTTSSPESESTTTSSLVSESTTTSSPESESTTTI 2008
T +S + S T + SE+ T + +E++ +
Sbjct: 77 TAETSDPT-SEATDTTTSEARTVTPAATETSKPV 109
Score = 46.0 bits (109), Expect = 3e-04
Identities = 29/97 (29%), Positives = 40/97 (41%), Gaps = 12/97 (12%)
Query: 1981 ESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTT-TSS 2039
E TT+ S ++P T E+ T +PV+ +T +P S T TS
Sbjct: 28 AEEIVTTTPATSTEAEQTTPVESDAT-----EEADNTETPVAATTAAEAPSSSETAETSD 82
Query: 2040 PASESTTTNNPKSESTTTNNPASESITSSSPASESTT 2076
P SE+T T SE+ T A T +S E T
Sbjct: 83 PTSEATDTTT--SEARTVTPAA----TETSKPVEGQT 113
Score = 45.2 bits (107), Expect = 5e-04
Identities = 25/102 (24%), Positives = 48/102 (47%), Gaps = 10/102 (9%)
Query: 2016 TTSSPV---SESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPAS 2072
T S+P +E T +P + + + ES T ++++T T A+ + + S S
Sbjct: 19 TASNPKLAQAEEIVTTTPATSTEAEQTTPVESDATE--EADNTETPVAATTAAEAPSS-S 75
Query: 2073 ESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESES 2114
E+ TS P SE+T T++ S + + + T+ E ++
Sbjct: 76 ETAETSDPTSEATDTTT----SEARTVTPAATETSKPVEGQT 113
Score = 45.2 bits (107), Expect = 5e-04
Identities = 17/87 (19%), Positives = 32/87 (36%), Gaps = 8/87 (9%)
Query: 2001 ESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNP 2060
E TT S ++PV T E+ T +P + +T P S T
Sbjct: 28 AEEIVTTTPATSTEAEQTTPVESDAT-----EEADNTETPVAATTAAEAPSSSETAET-- 80
Query: 2061 ASESITSSSPASESTTTSSPASESTTT 2087
S+ + ++ + S + + + T+
Sbjct: 81 -SDPTSEATDTTTSEARTVTPAATETS 106
Score = 44.8 bits (106), Expect = 6e-04
Identities = 25/110 (22%), Positives = 41/110 (37%), Gaps = 9/110 (8%)
Query: 1922 SESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPE 1981
S+S +L + + ++ + ST S E+ T +P
Sbjct: 7 SKSAVALTLALLTASNPKLAQAEEIVTTTPATSTEAEQTT---PVESDATEEADNTETPV 63
Query: 1982 SEST-TTSSLVSESTTTSSPESESTTTI-----SPVSESTTTSSPVSEST 2025
+ +T + SE+ TS P SE+T T + +T TS PV T
Sbjct: 64 AATTAAEAPSSSETAETSDPTSEATDTTTSEARTVTPAATETSKPVEGQT 113
Score = 44.5 bits (105), Expect = 9e-04
Identities = 19/92 (20%), Positives = 37/92 (40%), Gaps = 1/92 (1%)
Query: 1945 TTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESES 2004
T+S + + + T++ E + S E+ T + V+ +T +P S
Sbjct: 17 LLTASNPKLAQAEEIVTTTPATSTEAEQTTPVESDATEEADNTETPVAATTAAEAPSSSE 76
Query: 2005 TTTISPVSESTTTSSPVSESTTTISPESESTT 2036
T S + S T + SE+ T +E++
Sbjct: 77 TAETSDPT-SEATDTTTSEARTVTPAATETSK 107
Score = 43.7 bits (103), Expect = 0.001
Identities = 24/89 (26%), Positives = 38/89 (42%), Gaps = 9/89 (10%)
Query: 2052 SESTTTNNPASESITSSSPASESTTTSSPASE---STTTSSPASESTTTSSPASESTTTS 2108
+E T PA+ + + T S A+E +T T A+ + S SE+ TS
Sbjct: 28 AEEIVTTTPATSTEAEQT-----TPVESDATEEADNTETPVAATTAAEAPSS-SETAETS 81
Query: 2109 SPESESTTTSSPASESTTIEEQGVSPHSE 2137
P SE+T T++ + + T S E
Sbjct: 82 DPTSEATDTTTSEARTVTPAATETSKPVE 110
Score = 43.7 bits (103), Expect = 0.002
Identities = 22/99 (22%), Positives = 41/99 (41%), Gaps = 3/99 (3%)
Query: 1849 LLISMLAATAVAISVIDNYSEIIFTTNNNS-ESTVVMSTLNSLLSENTTTNSPESESTTT 1907
+ +++ TA + EI+ TT S E+ + E T +P + +T
Sbjct: 11 VALTLALLTASN-PKLAQAEEIVTTTPATSTEAEQTTPVESDATEEADNTETPVAATTAA 69
Query: 1908 NNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTT 1946
P S T +S + S T + SE+ T + +E++
Sbjct: 70 EAPSSSETAETSDPT-SEATDTTTSEARTVTPAATETSK 107
Score = 41.0 bits (96), Expect = 0.009
Identities = 15/64 (23%), Positives = 28/64 (43%), Gaps = 6/64 (9%)
Query: 2071 ASESTTTSSPASESTTTSSPASESTTTSSPASE-STTTSSPESESTTTSSPASESTTIEE 2129
+E T++PA+ + + T S A+E + T +P + +T +P+S T
Sbjct: 27 QAEEIVTTTPATSTEAEQT-----TPVESDATEEADNTETPVAATTAAEAPSSSETAETS 81
Query: 2130 QGVS 2133
S
Sbjct: 82 DPTS 85
Score = 40.2 bits (94), Expect = 0.016
Identities = 18/82 (21%), Positives = 35/82 (42%), Gaps = 6/82 (7%)
Query: 1906 TTNNPESESTTTSSPESESTTTSSLVSE-STTTSSPESESTTTSSPESESTTTSSLVSES 1964
+ ST ++T S +E + T +P + +T +P S T +S + S
Sbjct: 31 IVTTTPATSTEAE----QTTPVESDATEEADNTETPVAATTAAEAPSSSETAETSDPT-S 85
Query: 1965 TTTSSPESESTTTSSPESESTT 1986
T + SE+ T + +E++
Sbjct: 86 EATDTTTSEARTVTPAATETSK 107
Score = 36.4 bits (84), Expect = 0.27
Identities = 14/80 (17%), Positives = 35/80 (43%), Gaps = 1/80 (1%)
Query: 1878 SESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTT 1937
+E V + S +E TT ++ N + TT++ S++ ++ S+ T+
Sbjct: 28 AEEIVTTTPATSTEAEQTTPVESDATEEADNTETPVAATTAAEAP-SSSETAETSDPTSE 86
Query: 1938 SSPESESTTTSSPESESTTT 1957
++ + S + + + T+
Sbjct: 87 ATDTTTSEARTVTPAATETS 106
Score = 33.7 bits (77), Expect = 1.5
Identities = 16/79 (20%), Positives = 31/79 (39%), Gaps = 11/79 (13%)
Query: 2076 TTSSP---ASESTTTSSPASESTTTSSPASESTTTSSPESE-STTTSSPASESTTIEEQG 2131
T S+P +E T++PA+ + + T S +E + T +P + +T E
Sbjct: 19 TASNPKLAQAEEIVTTTPATSTEAEQT-----TPVESDATEEADNTETPVAATTAAEAP- 72
Query: 2132 VSPHSEKLSANEDPEEFPN 2150
S +++ E
Sbjct: 73 -SSSETAETSDPTSEATDT 90
>gnl|CDD|240412 PTZ00420, PTZ00420, coronin; Provisional.
Length = 568
Score = 48.8 bits (116), Expect = 4e-05
Identities = 54/233 (23%), Positives = 88/233 (37%), Gaps = 68/233 (29%)
Query: 415 TDGCHIFTCSTDQTLAVWDLEKGQRIK--------------KMKGHSTFV-----NSCDP 455
D C I CS+ W++E G I K+KGH++ + N C
Sbjct: 29 IDSCGI-ACSSGFVAVPWEVEGGGLIGAIRLENQMRKPPVIKLKGHTSSILDLQFNPC-- 85
Query: 456 VRRGQLLIASGSDDCTVKVWDPRKKNQAVSMNNTYQVTSVAFNDTAECVLTGGIDNDIKM 515
++ASGS+D T++VW+ +++V Q C+L
Sbjct: 86 ---FSEILASGSEDLTIRVWEIPHNDESVKEIKDPQ-----------CIL---------- 121
Query: 516 WDLRTNSVVQKLRGHSDTVTGLSLSPDGSYIL-SNAMDNTVRIWDIRPYVPGERCVKVMS 574
+GH ++ + +P YI+ S+ D+ V IWDI E+
Sbjct: 122 ------------KGHKKKISIIDWNPMNYYIMCSSGFDSFVNIWDIE----NEK-----R 160
Query: 575 GHQHNFEKNLLRCAWSVSGLYVTAGSADKCVYIWDTTTRRIAYKLPGHNGSVN 627
Q N K L W++ G ++ K ++I D + IA H+G N
Sbjct: 161 AFQINMPKKLSSLKWNIKGNLLSGTCVGKHMHIIDPRKQEIASSFHIHDGGKN 213
Score = 43.8 bits (103), Expect = 0.001
Identities = 41/179 (22%), Positives = 78/179 (43%), Gaps = 18/179 (10%)
Query: 401 MSGHTGAVMDLKFSTDGCHIF-TCSTDQTLAVWDL----EKGQRIKK----MKGHSTFVN 451
+ GHT +++DL+F+ I + S D T+ VW++ E + IK +KGH ++
Sbjct: 70 LKGHTSSILDLQFNPCFSEILASGSEDLTIRVWEIPHNDESVKEIKDPQCILKGHKKKIS 129
Query: 452 SCDPVRRGQLLIASGSDDCTVKVWDPRKKNQAVSMNNTYQVTSVAFNDTAECVLTGGIDN 511
D ++ S D V +WD + +A +N +++S+ +N + +
Sbjct: 130 IIDWNPMNYYIMCSSGFDSFVNIWDIENEKRAFQINMPKKLSSLKWNIKGNLLSGTCVGK 189
Query: 512 DIKMWDLRTNSVVQKLRGHSDTVTGLSLSPDG-----SYILSNAMD-NTVR---IWDIR 561
+ + D R + H ++ DG +YILS N +R +WD++
Sbjct: 190 HMHIIDPRKQEIASSFHIHDGGKNTKNIWIDGLGGDDNYILSTGFSKNNMREMKLWDLK 248
Score = 39.9 bits (93), Expect = 0.018
Identities = 29/85 (34%), Positives = 47/85 (55%), Gaps = 12/85 (14%)
Query: 115 GHKSAITVIQYDP-LGHRLATGSKDTDIVLWDV------VAECG--LHRLSGHKGVITDI 165
GH S+I +Q++P LA+GS+D I +W++ V E L GHK I+ I
Sbjct: 72 GHTSSILDLQFNPCFSEILASGSEDLTIRVWEIPHNDESVKEIKDPQCILKGHKKKISII 131
Query: 166 RFMSQPGHHFVVSSAK-DTFVKIWD 189
+ P +++++ S+ D+FV IWD
Sbjct: 132 DW--NPMNYYIMCSSGFDSFVNIWD 154
Score = 36.9 bits (85), Expect = 0.14
Identities = 25/95 (26%), Positives = 46/95 (48%), Gaps = 14/95 (14%)
Query: 1076 ISLYGHKLPVLSLDMSYD---STLIATGSGDRTVKVWGLDYGDCHKS--------LLAHE 1124
I L GH +L D+ ++ S ++A+GS D T++VW + + D L H+
Sbjct: 68 IKLKGHTSSIL--DLQFNPCFSEILASGSEDLTIRVWEIPHNDESVKEIKDPQCILKGHK 125
Query: 1125 DSVTGVTFVPKTHYFF-TTSKDGRVKQWDADNFER 1158
++ + + P +Y ++ D V WD +N +R
Sbjct: 126 KKISIIDWNPMNYYIMCSSGFDSFVNIWDIENEKR 160
Score = 36.9 bits (85), Expect = 0.14
Identities = 25/95 (26%), Positives = 46/95 (48%), Gaps = 14/95 (14%)
Query: 1166 ISLYGHKLPVLSLDMSYD---STLIATGSGDRTVKVWGLDYGDCHKS--------LLAHE 1214
I L GH +L D+ ++ S ++A+GS D T++VW + + D L H+
Sbjct: 68 IKLKGHTSSIL--DLQFNPCFSEILASGSEDLTIRVWEIPHNDESVKEIKDPQCILKGHK 125
Query: 1215 DSVTGVTFVPKTHYFF-TTSKDGRVKQWDADNFER 1248
++ + + P +Y ++ D V WD +N +R
Sbjct: 126 KKISIIDWNPMNYYIMCSSGFDSFVNIWDIENEKR 160
Score = 36.9 bits (85), Expect = 0.14
Identities = 25/95 (26%), Positives = 46/95 (48%), Gaps = 14/95 (14%)
Query: 1350 ISLYGHKLPVLSLDMSYD---STLIATGSGDRTVKVWGLDYGDCHKS--------LLAHE 1398
I L GH +L D+ ++ S ++A+GS D T++VW + + D L H+
Sbjct: 68 IKLKGHTSSIL--DLQFNPCFSEILASGSEDLTIRVWEIPHNDESVKEIKDPQCILKGHK 125
Query: 1399 DSVTGVTFVPKTHYFF-TTSKDGRVKQWDADNFER 1432
++ + + P +Y ++ D V WD +N +R
Sbjct: 126 KKISIIDWNPMNYYIMCSSGFDSFVNIWDIENEKR 160
Score = 32.2 bits (73), Expect = 4.4
Identities = 28/86 (32%), Positives = 45/86 (52%), Gaps = 10/86 (11%)
Query: 618 KLPGHNGSVNDVQFHP-KEPIIMSASSDKTIYLGESPLHCDKAGS-------ILRSGKGR 669
KL GH S+ D+QF+P I+ S S D TI + E P H D++ IL+ K +
Sbjct: 69 KLKGHTSSILDLQFNPCFSEILASGSEDLTIRVWEIP-HNDESVKEIKDPQCILKGHKKK 127
Query: 670 VHTMV-NDKHRQILCCHGNDNVVDLF 694
+ + N + I+C G D+ V+++
Sbjct: 128 ISIIDWNPMNYYIMCSSGFDSFVNIW 153
>gnl|CDD|118064 pfam09528, Ehrlichia_rpt, Ehrlichia tandem repeat (Ehrlichia_rpt).
This entry represents 77 residues of an 80 amino acid
(240 nucleotide) tandem repeat, found in a variable
number of copies in an immunodominant outer membrane
protein of Ehrlichia chaffeensis, a tick-borne obligate
intracellular pathogen.
Length = 707
Score = 48.9 bits (115), Expect = 4e-05
Identities = 54/279 (19%), Positives = 80/279 (28%), Gaps = 18/279 (6%)
Query: 1892 SENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPE 1951
E+ + P S PE ++ S SS E + + E S E
Sbjct: 234 HESEVGDKPAETSKEEETPEVKAEDLQPAVDGSVEHSSSEIEEHQGETEKEEGIPESHAE 293
Query: 1952 SESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPV 2011
+V +S P S E E L + + ES + + V
Sbjct: 294 DLQPAVDDIVEHP--SSEPFVAEEEVSETEKEENNPEVLAEDLQDAADGESGVSDQPAQV 351
Query: 2012 SESTTT------SSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTT---NNPA- 2061
E + E S + S P+ E + K S T +NP
Sbjct: 352 VEERESEIEEHQGETEKEEGIPESHAEDDEIASDPSIEHFSAEVGKEVSETEKEESNPEV 411
Query: 2062 -----SESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTT 2116
++ ES PA S SP E+ PA + S + E
Sbjct: 412 KAEDLQPAVDGDVAHHESEVGDKPAETSKEEESPEIEA-EDGEPAKDGGIEESHQEEDEI 470
Query: 2117 TSSPASESTTIEEQGVSPHSEKLSANEDPEEFPNEDVFE 2155
S P+ E T E + + E E+V E
Sbjct: 471 VSEPSKEEFTAEVKAEDLQPAVDGSVEHSSSEVGEEVSE 509
Score = 44.6 bits (104), Expect = 8e-04
Identities = 58/302 (19%), Positives = 90/302 (29%), Gaps = 27/302 (8%)
Query: 1906 TTNNPESE-STTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESEST--------- 1955
NP SE ++PE ++ V+ES SS E + + + ES
Sbjct: 57 NVGNPSSEVGKEENAPEVKAEDLEPAVAESVEHSSSEVGKEVSETEKEESNPEVKAEDLQ 116
Query: 1956 ---TTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVS 2012
ES P S +PE E+ + S E +S S
Sbjct: 117 PAVDGDIAHHESEVGDKPAKTSKEEENPEIEAEDGEPAKDDGIEESH--QEEDEIVSESS 174
Query: 2013 ESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPAS 2072
+ T+ +E S ++S E + T +S ++
Sbjct: 175 KEEFTAEVKAEDLQPAVDGSIEHSSSEVGEEVSKTEKEESNPEVKAEDLQPAVDDDVAHH 234
Query: 2073 ESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPE---------SESTTTSSPASE 2123
ES PA S +P ++ S SS E E S A +
Sbjct: 235 ESEVGDKPAETSKEEETPEVKAEDLQPAVDGSVEHSSSEIEEHQGETEKEEGIPESHAED 294
Query: 2124 STTIEEQGVS-PHSEKLSANEDPEEFPNEDVFEHTFAE--IPNIDHSNQTDEAIPETFDA 2180
+ V P SE A E+ E E+ AE D + + + +
Sbjct: 295 LQPAVDDIVEHPSSEPFVAEEEVSETEKEENNPEVLAEDLQDAADGESGVSDQPAQVVEE 354
Query: 2181 RE 2182
RE
Sbjct: 355 RE 356
Score = 44.3 bits (103), Expect = 8e-04
Identities = 60/343 (17%), Positives = 98/343 (28%), Gaps = 38/343 (11%)
Query: 1858 AVAISVIDNYSEIIFTTNNNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTT 1917
AVA SV + SE+ + + L + ES + P S
Sbjct: 82 AVAESVEHSSSEVGKEVSETEKEESNPEVKAEDLQPAVDGDIAHHESEVGDKPAKTSKEE 141
Query: 1918 SSPESESTTTSSLVSESTTTSSPESESTTTSS------PESESTTTSSLVSESTTTSSPE 1971
+PE E+ + S E + + S E ++ V S SS E
Sbjct: 142 ENPEIEAEDGEPAKDDGIEESHQEEDEIVSESSKEEFTAEVKAEDLQPAVDGSIEHSSSE 201
Query: 1972 SESTTTSSPESEST-----------------TTSSLVSESTTTSSPESESTTTISPVSES 2014
+ + + ES S V + +S E E T +
Sbjct: 202 VGEEVSKTEKEESNPEVKAEDLQPAVDDDVAHHESEVGDKPAETSKE-EETPEVKAEDLQ 260
Query: 2015 TTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASE- 2073
V S++ I T ES + + +P+SE + SE
Sbjct: 261 PAVDGSVEHSSSEIEEHQGETEKEEGIPESHAEDLQPAVDDIVEHPSSEPFVAEEEVSET 320
Query: 2074 STTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQGVS 2133
++P + A + S ++ E E + E
Sbjct: 321 EKEENNPEVLAEDLQDAADGESGVSDQPAQVVEERESEIEEHQGETEKEEGIP------E 374
Query: 2134 PHSEKLSANEDPEEFPNEDVFEHTFAEIPNIDHSNQTDEAIPE 2176
H+E DP EH AE+ + +E+ PE
Sbjct: 375 SHAEDDEIASDPS-------IEHFSAEVGKEVSETEKEESNPE 410
Score = 41.2 bits (95), Expect = 0.007
Identities = 52/287 (18%), Positives = 88/287 (30%), Gaps = 12/287 (4%)
Query: 1898 NSPESESTTTNNPESESTTTS----SPESESTTTSSLVSESTTTSSPESESTTTSSPESE 1953
+ E S+ E E + T +PE + + S ++ E E
Sbjct: 301 DIVEHPSSEPFVAEEEVSETEKEENNPEVLAEDLQDAADGESGVSDQPAQVVEERESEIE 360
Query: 1954 STTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSE 2013
E S + S P E ++ + E + T ES +
Sbjct: 361 E-HQGETEKEEGIPESHAEDDEIASDPSIEH-FSAEVGKEVSETEKEESNPEVKAEDLQP 418
Query: 2014 STTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASE 2073
+ ES P S SP E+ P + + E S P+ E
Sbjct: 419 AVDGDVAHHESEVGDKPAETSKEEESPEIEAEDG-EPAKDGGIEESHQEEDEIVSEPSKE 477
Query: 2074 STTTSSPASESTTTSSPASESTTTSSPASESTTTSSPES--ESTTTSSPASESTTIEEQG 2131
T A E + S ++S E + T ES E P + ++ E
Sbjct: 478 EFTAEVKA-EDLQPAVDGSVEHSSSEVGEEVSETEKEESNPEIKAEDLPPAVDDSL-EHS 535
Query: 2132 VSPHSEKLSANEDPEEFPNEDVFEHTFAEIPNIDH-SNQTDEAIPET 2177
+ EK+ E P + A +++H S++ + + ET
Sbjct: 536 IPEVGEKVDEMFAEEFNPEVIAEDLQPAVDGSVEHSSSEVGDKVCET 582
Score = 40.4 bits (93), Expect = 0.014
Identities = 50/254 (19%), Positives = 75/254 (29%), Gaps = 12/254 (4%)
Query: 1932 SESTTTSSPESESTTTSSPESESTTTSSLVSESTT-TSSPESESTTTSSPESESTTTSSL 1990
ES P S +PE E+ + + E E + SS E + +
Sbjct: 126 HESEVGDKPAKTSKEEENPEIEAEDGEPAKDDGIEESHQEEDEIVSESSKEEFTAEVKAE 185
Query: 1991 VSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNP 2050
+ S E S+ VS++ S + P + ES + P
Sbjct: 186 DLQPAVDGSIEHSSSEVGEEVSKTEKEESNPEVKAEDLQPAVDDDVAHH---ESEVGDKP 242
Query: 2051 KSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSS----PASESTT 2106
S P ++ S SS E + E S +
Sbjct: 243 AETSKEEETPEVKAEDLQPAVDGSVEHSSSEIEEHQGETEKEEGIPESHAEDLQPAVDDI 302
Query: 2107 TSSPESESTTTSSPASESTTIEEQGVSPHSEKLSANEDPEEFPNE---DVFEHTFAEIPN 2163
P SE SE T EE +E L D E ++ V E +EI
Sbjct: 303 VEHPSSEPFVAEEEVSE-TEKEENNPEVLAEDLQDAADGESGVSDQPAQVVEERESEIEE 361
Query: 2164 IDHSNQTDEAIPET 2177
+ +E IPE+
Sbjct: 362 HQGETEKEEGIPES 375
Score = 39.6 bits (91), Expect = 0.022
Identities = 56/303 (18%), Positives = 94/303 (31%), Gaps = 13/303 (4%)
Query: 1893 ENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSE-STTTSSPESESTTTSSPE 1951
E + +S E + + + S E S+ VS+ S+PE ++
Sbjct: 168 EIVSESSKEEFTAEVKAEDLQPAVDGSIEHSSSEVGEEVSKTEKEESNPEVKAEDLQPAV 227
Query: 1952 SEST-TTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISP 2010
+ S V + +S E E+ + + + S+ S+ + E T
Sbjct: 228 DDDVAHHESEVGDKPAETSKEEETPEVKAEDLQPAVDGSVEHSSSEIEEHQGE-TEKEEG 286
Query: 2011 VSESTTTSSPVSESTTTISPESESTTTSSPASE-STTTNNPKSESTTTNNPAS-ESITSS 2068
+ ES + P SE SE NNP+ + + A ES S
Sbjct: 287 IPESHAEDLQPAVDDIVEHPSSEPFVAEEEVSETEKEENNPEVLAEDLQDAADGESGVSD 346
Query: 2069 SPASESTTTSS--------PASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSP 2120
PA S E S A + S P+ E + + S T
Sbjct: 347 QPAQVVEERESEIEEHQGETEKEEGIPESHAEDDEIASDPSIEHFSAEVGKEVSETEKEE 406
Query: 2121 ASESTTIEEQGVSPHSEKLSANEDPEEFPNEDVFEHTFAEIPNIDHSNQTDEAIPETFDA 2180
++ E+ + + + + P E E EI D D I E+
Sbjct: 407 SNPEVKAEDLQPAVDGDVAHHESEVGDKPAETSKEEESPEIEAEDGEPAKDGGIEESHQE 466
Query: 2181 REE 2183
+E
Sbjct: 467 EDE 469
Score = 38.5 bits (88), Expect = 0.054
Identities = 53/250 (21%), Positives = 89/250 (35%), Gaps = 27/250 (10%)
Query: 1966 TTSSPESE-STTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSES 2024
+P SE ++PE ++ V+ES SS SE +S + + +E
Sbjct: 57 NVGNPSSEVGKEENAPEVKAEDLEPAVAESVEHSS--SEVGKEVSETEKEESNPEVKAED 114
Query: 2025 TTTISP----ESESTTTSSPASESTTTNNPKSESTTTNNPASESITSS--------SPAS 2072
ES PA S NP+ E+ + I S S +S
Sbjct: 115 LQPAVDGDIAHHESEVGDKPAKTSKEEENPEIEAEDGEPAKDDGIEESHQEEDEIVSESS 174
Query: 2073 ESTTTSSPASESTTTSSPASESTTTSSPASESTTT----SSPESESTTTSSPASESTTIE 2128
+ T+ +E + S ++S E + T S+PE ++ +
Sbjct: 175 KEEFTAEVKAEDLQPAVDGSIEHSSSEVGEEVSKTEKEESNPEVKAEDLQPAVDDDVAHH 234
Query: 2129 EQGVSPHSEKLSANEDPEEFPNEDV-------FEHTFAEIPNIDHSNQTDEAIPETFDAR 2181
E V + S E+ E ED+ EH+ +EI + +E IPE+ A
Sbjct: 235 ESEVGDKPAETSKEEETPEVKAEDLQPAVDGSVEHSSSEIEEHQGETEKEEGIPES-HAE 293
Query: 2182 EEWPQCKDVI 2191
+ P D++
Sbjct: 294 DLQPAVDDIV 303
Score = 35.4 bits (80), Expect = 0.42
Identities = 38/232 (16%), Positives = 63/232 (27%), Gaps = 9/232 (3%)
Query: 1889 SLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSE-STTTSSPES----- 1942
S + E+ E ++ + E + S E S VSE S+PE
Sbjct: 357 SEIEEHQGETEKEEGIPESHAEDDEIASDPSIEHFSAEVGKEVSETEKEESNPEVKAEDL 416
Query: 1943 ESTTTSSPESESTTTSSLVSE-STTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPE 2001
+ + +E S SPE E+ P + S E S P
Sbjct: 417 QPAVDGDVAHHESEVGDKPAETSKEEESPEIEA-EDGEPAKDGGIEESHQEEDEIVSEPS 475
Query: 2002 SES-TTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNP 2060
E T + V S++ + E T E + P + + +
Sbjct: 476 KEEFTAEVKAEDLQPAVDGSVEHSSSEVGEEVSETEKEESNPEIKAEDLPPAVDDSLEHS 535
Query: 2061 ASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPES 2112
E +E + S ++S + T E
Sbjct: 536 IPEVGEKVDEMFAEEFNPEVIAEDLQPAVDGSVEHSSSEVGDKVCETCEEEF 587
>gnl|CDD|184900 PRK14907, rplD, 50S ribosomal protein L4; Provisional.
Length = 295
Score = 47.3 bits (112), Expect = 6e-05
Identities = 24/116 (20%), Positives = 43/116 (37%), Gaps = 5/116 (4%)
Query: 2045 TTTNNPKSESTTTNNPASE-SITSSSPASESTTTSSP----ASESTTTSSPASESTTTSS 2099
T K ++T PA++ + TS A T + A ++ S TTT
Sbjct: 3 ETKKTTKKKTTEEKKPAAKKATTSKETAKTKKTAKTTSTKAAKKAAKVKKTKSVKTTTKK 62
Query: 2100 PASESTTTSSPESESTTTSSPASESTTIEEQGVSPHSEKLSANEDPEEFPNEDVFE 2155
+ T S + ES + E+ + E S K ++ + F +E ++
Sbjct: 63 VTVKFEKTESVKKESVAKKTVKKEAVSAEVFEASNKLFKNTSKLPKKLFASEKIYS 118
Score = 39.5 bits (92), Expect = 0.014
Identities = 21/108 (19%), Positives = 40/108 (37%), Gaps = 5/108 (4%)
Query: 2015 TTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASES 2074
T + ++T P ++ TTS +++ T K+ ST A++ S
Sbjct: 3 ETKKTTKKKTTEEKKPAAKKATTSKETAKTKKTA--KTTSTKAAKKAAKV---KKTKSVK 57
Query: 2075 TTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPAS 2122
TTT + T S ES + E+ + E+ + + +
Sbjct: 58 TTTKKVTVKFEKTESVKKESVAKKTVKKEAVSAEVFEASNKLFKNTSK 105
Score = 37.6 bits (87), Expect = 0.055
Identities = 22/106 (20%), Positives = 41/106 (38%), Gaps = 11/106 (10%)
Query: 1920 PESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSS 1979
E++ TT +T P ++ TTS E+ T ++ + +
Sbjct: 2 AETKKTTKKK----TTEEKKPAAKKATTSK-ETAKTKKTAKTTSTKAAKKAAK----VKK 52
Query: 1980 PESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSEST 2025
+S TTT + + T S + ES + E+ S+ V E++
Sbjct: 53 TKSVKTTTKKVTVKFEKTESVKKESVAKKTVKKEA--VSAEVFEAS 96
Score = 37.6 bits (87), Expect = 0.058
Identities = 20/101 (19%), Positives = 41/101 (40%), Gaps = 7/101 (6%)
Query: 1895 TTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESES 1954
T + + ++T P ++ TTS E+ T ++ + + +S
Sbjct: 3 ETKKTTKKKTTEEKKPAAKKATTSK-ETAKTKKTAKTTSTKAAKKAAK----VKKTKSVK 57
Query: 1955 TTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSEST 1995
TTT + + T S + ES + + E+ S+ V E++
Sbjct: 58 TTTKKVTVKFEKTESVKKESVAKKTVKKEA--VSAEVFEAS 96
Score = 36.5 bits (84), Expect = 0.14
Identities = 20/113 (17%), Positives = 39/113 (34%), Gaps = 9/113 (7%)
Query: 1950 PESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTIS 2009
E++ TT +T P ++ TTS E+ T ++ + +
Sbjct: 2 AETKKTTKKK----TTEEKKPAAKKATTSK-ETAKTKKTAKTTSTKAAKKAAKVK----K 52
Query: 2010 PVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPAS 2062
S TTT + T S + ES + E+ + ++ + N +
Sbjct: 53 TKSVKTTTKKVTVKFEKTESVKKESVAKKTVKKEAVSAEVFEASNKLFKNTSK 105
Score = 36.1 bits (83), Expect = 0.21
Identities = 19/98 (19%), Positives = 36/98 (36%), Gaps = 7/98 (7%)
Query: 1900 PESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSS 1959
E++ TT ++T P ++ TTS + T ++ + + T S
Sbjct: 2 AETKKTTK----KKTTEEKKPAAKKATTSK-ETAKTKKTAKTTSTKAAKKAAKVKKTKS- 55
Query: 1960 LVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTT 1997
V +T + + E T + ES + T + S
Sbjct: 56 -VKTTTKKVTVKFEKTESVKKESVAKKTVKKEAVSAEV 92
Score = 36.1 bits (83), Expect = 0.22
Identities = 17/100 (17%), Positives = 35/100 (35%), Gaps = 11/100 (11%)
Query: 1910 PESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSS 1969
E++ TT E + ++TT+ T + +++ ++
Sbjct: 2 AETKKTTKKKTTEEKKPAA---KKATTSKETAKTKKTAKTTSTKAA------KKAAKVKK 52
Query: 1970 PESESTTTS--SPESESTTTSSLVSESTTTSSPESESTTT 2007
+S TTT + + E T + S + T E+ S
Sbjct: 53 TKSVKTTTKKVTVKFEKTESVKKESVAKKTVKKEAVSAEV 92
Score = 34.9 bits (80), Expect = 0.45
Identities = 18/98 (18%), Positives = 35/98 (35%), Gaps = 5/98 (5%)
Query: 1905 TTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSES 1964
T + ++T P ++ TTS + T ++ + + T S
Sbjct: 3 ETKKTTKKKTTEEKKPAAKKATTSK-ETAKTKKTAKTTSTKAAKKAAKVKKTKS----VK 57
Query: 1965 TTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPES 2002
TTT + T S + ES ++ E+ + E+
Sbjct: 58 TTTKKVTVKFEKTESVKKESVAKKTVKKEAVSAEVFEA 95
Score = 34.5 bits (79), Expect = 0.53
Identities = 19/113 (16%), Positives = 36/113 (31%), Gaps = 11/113 (9%)
Query: 1995 TTTSSPESESTTTISPVSESTTTSS-PVSESTTTISPES----ESTTTSSPASESTTTNN 2049
T + + ++T P ++ TTS T + + ++ S TTT
Sbjct: 3 ETKKTTKKKTTEEKKPAAKKATTSKETAKTKKTAKTTSTKAAKKAAKVKKTKSVKTTTKK 62
Query: 2050 PKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPAS 2102
+ T + ES+ + E+ S +S T+ P
Sbjct: 63 VTVKFEKTESVKKESVAKKTVKKEAV------SAEVFEASNKLFKNTSKLPKK 109
Score = 34.5 bits (79), Expect = 0.62
Identities = 17/95 (17%), Positives = 34/95 (35%), Gaps = 7/95 (7%)
Query: 1935 TTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSES 1994
T + + ++T P ++ TTS + T ++ + + T S
Sbjct: 3 ETKKTTKKKTTEEKKPAAKKATTSK-ETAKTKKTAKTTSTKAAKKAAKVKKTKS----VK 57
Query: 1995 TTTS--SPESESTTTISPVSESTTTSSPVSESTTT 2027
TTT + + E T ++ S + T + S
Sbjct: 58 TTTKKVTVKFEKTESVKKESVAKKTVKKEAVSAEV 92
>gnl|CDD|216513 pfam01456, Mucin, Mucin-like glycoprotein. This family of
trypanosomal proteins resemble vertebrate mucins. The
protein consists of three regions. The N and C terminii
are conserved between all members of the family, whereas
the central region is not well conserved and contains a
large number of threonine residues which can be
glycosylated. Indirect evidence suggested that these
genes might encode the core protein of parasite mucins,
glycoproteins that were proposed to be involved in the
interaction with, and invasion of, mammalian host cells.
This family contains an N-terminal signal peptide.
Length = 143
Score = 44.9 bits (105), Expect = 6e-05
Identities = 27/92 (29%), Positives = 48/92 (52%), Gaps = 4/92 (4%)
Query: 2011 VSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSP 2070
V E+ S + +TTT +P + +TTT++ + TTT +TTT + + T+ +P
Sbjct: 35 VVEAAEGQSQTTTTTTTTTPPTTTTTTTTTTTTITTT--TTKTTTTTTTTTTTTTTTEAP 92
Query: 2071 ASESTTTSSPASESTTTSSPASESTTTSSPAS 2102
+ +TT+ +P +T T +P+S S S
Sbjct: 93 SKNTTTSEAPT--TTDTRAPSSIREIDGSLGS 122
Score = 44.9 bits (105), Expect = 6e-05
Identities = 22/92 (23%), Positives = 43/92 (46%), Gaps = 2/92 (2%)
Query: 2031 ESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSP 2090
+ + +TTT +TTT + + +++ +TTT++ + +TTT +P
Sbjct: 33 AAVVEAAEGQSQTTTTTTTTTPPTTTTTTTTTTTTITTTTTKTTTTTTTTTTTTTTTEAP 92
Query: 2091 ASESTTTSSPASESTTTSSPESESTTTSSPAS 2122
+ +TT+ +P +T T +P S S S
Sbjct: 93 SKNTTTSEAPT--TTDTRAPSSIREIDGSLGS 122
Score = 43.7 bits (102), Expect = 2e-04
Identities = 27/92 (29%), Positives = 47/92 (51%), Gaps = 4/92 (4%)
Query: 1991 VSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNP 2050
V E+ S + +TTT +P + +TTT++ + TTT +TTT++ + +TTT P
Sbjct: 35 VVEAAEGQSQTTTTTTTTTPPTTTTTTTTTTTTITTT--TTKTTTTTTTTTTTTTTTEAP 92
Query: 2051 KSESTTTNNPASESITSSSPASESTTTSSPAS 2082
+TT+ P + + +P+S S S
Sbjct: 93 SKNTTTSEAPTTT--DTRAPSSIREIDGSLGS 122
Score = 43.7 bits (102), Expect = 2e-04
Identities = 27/90 (30%), Positives = 48/90 (53%), Gaps = 4/90 (4%)
Query: 2003 ESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPAS 2062
E+ S + +TTT++P + +TTT + + +T T++ +TTT + +TTT P+
Sbjct: 37 EAAEGQSQTTTTTTTTTPPTTTTTTTT--TTTTITTTTTKTTTTTTTTTTTTTTTEAPSK 94
Query: 2063 ESITSSSPASESTTTSSPASESTTTSSPAS 2092
+ TS +P +T T +P+S S S
Sbjct: 95 NTTTSEAPT--TTDTRAPSSIREIDGSLGS 122
Score = 42.9 bits (100), Expect = 3e-04
Identities = 22/78 (28%), Positives = 50/78 (64%), Gaps = 2/78 (2%)
Query: 1915 TTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESES 1974
+ +S++TTT++ + TTT++ + +TT ++ +++TTT++ + +TTT++ E+ S
Sbjct: 36 VEAAEGQSQTTTTTTTTTPPTTTTTTTTTTTTITTTTTKTTTTTT--TTTTTTTTTEAPS 93
Query: 1975 TTTSSPESESTTTSSLVS 1992
T++ E+ +TT + S
Sbjct: 94 KNTTTSEAPTTTDTRAPS 111
Score = 42.5 bits (99), Expect = 4e-04
Identities = 21/85 (24%), Positives = 44/85 (51%), Gaps = 2/85 (2%)
Query: 2041 ASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSP 2100
A + E+ + + + T+++P + +TTT++ + TTT +TTT++
Sbjct: 25 AQGEGQYDAAVVEAAEGQSQTTTTTTTTTPPTTTTTTTTTTTTITTT--TTKTTTTTTTT 82
Query: 2101 ASESTTTSSPESESTTTSSPASEST 2125
+ +TTT +P +TT+ +P + T
Sbjct: 83 TTTTTTTEAPSKNTTTSEAPTTTDT 107
Score = 42.5 bits (99), Expect = 4e-04
Identities = 21/83 (25%), Positives = 43/83 (51%)
Query: 1911 ESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSP 1970
+ +TTT++ +TTT++ + +T T++ +TTT++ + +TTT +P
Sbjct: 33 AAVVEAAEGQSQTTTTTTTTTPPTTTTTTTTTTTTITTTTTKTTTTTTTTTTTTTTTEAP 92
Query: 1971 ESESTTTSSPESESTTTSSLVSE 1993
+TT+ +P + T S + E
Sbjct: 93 SKNTTTSEAPTTTDTRAPSSIRE 115
Score = 42.5 bits (99), Expect = 4e-04
Identities = 21/89 (23%), Positives = 45/89 (50%)
Query: 1941 ESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSP 2000
+ +TTT++ +TTT++ + +T T++ +TTT++ + +TTT +P
Sbjct: 33 AAVVEAAEGQSQTTTTTTTTTPPTTTTTTTTTTTTITTTTTKTTTTTTTTTTTTTTTEAP 92
Query: 2001 ESESTTTISPVSESTTTSSPVSESTTTIS 2029
+TT+ +P + T S + E ++
Sbjct: 93 SKNTTTSEAPTTTDTRAPSSIREIDGSLG 121
Score = 42.5 bits (99), Expect = 5e-04
Identities = 26/88 (29%), Positives = 48/88 (54%), Gaps = 1/88 (1%)
Query: 1895 TTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESES 1954
+S++TTT + TTT++ + +TT ++ +++TTT++ + +TTT+ S++
Sbjct: 36 VEAAEGQSQTTTTTTTTTPPTTTTTTTTTTTTITTTTTKTTTTTTTTTTTTTTTEAPSKN 95
Query: 1955 TTTSSLVSESTTTSSPESESTTTSSPES 1982
TTTS +T T +P S S S
Sbjct: 96 TTTSE-APTTTDTRAPSSIREIDGSLGS 122
Score = 42.2 bits (98), Expect = 5e-04
Identities = 23/92 (25%), Positives = 44/92 (47%), Gaps = 2/92 (2%)
Query: 1971 ESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISP 2030
+ +TTT++ +TTT++ + +T T + +TTT++ + +TTT +P
Sbjct: 33 AAVVEAAEGQSQTTTTTTTTTPPTTTTTTTTTTTTITTTTTKTTTTTTTTTTTTTTTEAP 92
Query: 2031 ESESTTTSSPASESTTTNNPKSESTTTNNPAS 2062
+TT+ +P +T T P S + S
Sbjct: 93 SKNTTTSEAPT--TTDTRAPSSIREIDGSLGS 122
Score = 42.2 bits (98), Expect = 6e-04
Identities = 27/92 (29%), Positives = 52/92 (56%), Gaps = 4/92 (4%)
Query: 1961 VSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSP 2020
V E+ S + +TTT++P + +TTT++ + +T T++ +TTT + + +TTT +P
Sbjct: 35 VVEAAEGQSQTTTTTTTTTPPTTTTTTTT--TTTTITTTTTKTTTTTTTTTTTTTTTEAP 92
Query: 2021 VSESTTTISPESESTTTSSPASESTTTNNPKS 2052
+TT+ +P +T T +P+S + S
Sbjct: 93 SKNTTTSEAP--TTTDTRAPSSIREIDGSLGS 122
Score = 41.8 bits (97), Expect = 9e-04
Identities = 30/108 (27%), Positives = 55/108 (50%), Gaps = 9/108 (8%)
Query: 1965 TTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSES 2024
T + ++ + E +S TT++ +TTT+ P + +TTT + +T T++ +
Sbjct: 24 TAQGEGQYDAAVVEAAEGQSQTTTT----TTTTTPPTTTTTTT---TTTTTITTTTTKTT 76
Query: 2025 TTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPAS 2072
TTT + + +TTT +P+ +TT+ P +T T P+S S S
Sbjct: 77 TTTTTTTTTTTTTEAPSKNTTTSEAPT--TTDTRAPSSIREIDGSLGS 122
Score = 40.2 bits (93), Expect = 0.003
Identities = 31/109 (28%), Positives = 57/109 (52%), Gaps = 11/109 (10%)
Query: 1905 TTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSES 1964
T + ++ + E +S TT++ +TTT+ P + +TTT++ + +TTT+
Sbjct: 24 TAQGEGQYDAAVVEAAEGQSQTTTT----TTTTTPPTTTTTTTTTTTTITTTTT------ 73
Query: 1965 TTTSSPESESTTTSSPESES-TTTSSLVSESTTTSSPESESTTTISPVS 2012
TT++ + +TTT++ E+ S TT+S +T T +P S S S
Sbjct: 74 KTTTTTTTTTTTTTTTEAPSKNTTTSEAPTTTDTRAPSSIREIDGSLGS 122
Score = 40.2 bits (93), Expect = 0.003
Identities = 31/108 (28%), Positives = 58/108 (53%), Gaps = 9/108 (8%)
Query: 1935 TTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSES 1994
T + ++ + E +S TT++ TTT++P + +TTT++ + TTT++ +
Sbjct: 24 TAQGEGQYDAAVVEAAEGQSQTTTT-----TTTTTPPTTTTTTTTTTTTITTTTT--KTT 76
Query: 1995 TTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPAS 2042
TTT++ + +TTT +P +TT+ +P +T T +P S S S
Sbjct: 77 TTTTTTTTTTTTTEAPSKNTTTSEAPT--TTDTRAPSSIREIDGSLGS 122
Score = 39.8 bits (92), Expect = 0.004
Identities = 22/77 (28%), Positives = 41/77 (53%), Gaps = 2/77 (2%)
Query: 2052 SESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPE 2111
+ E+ S + +TTT++P + +TTT++ + TTT +TTT++
Sbjct: 26 QGEGQYDAAVVEAAEGQSQTTTTTTTTTPPTTTTTTTTTTTTITTT--TTKTTTTTTTTT 83
Query: 2112 SESTTTSSPASESTTIE 2128
+ +TTT +P+ +TT E
Sbjct: 84 TTTTTTEAPSKNTTTSE 100
Score = 39.5 bits (91), Expect = 0.005
Identities = 22/71 (30%), Positives = 44/71 (61%), Gaps = 2/71 (2%)
Query: 1892 SENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPE 1951
S+ TTT + + TTT + +TT ++ +++TTT++ + +TTT++ E+ S T++ E
Sbjct: 43 SQTTTTTTTTTPPTTTTTTTTTTTTITTTTTKTTTTTT--TTTTTTTTTEAPSKNTTTSE 100
Query: 1952 SESTTTSSLVS 1962
+ +TT + S
Sbjct: 101 APTTTDTRAPS 111
Score = 39.1 bits (90), Expect = 0.007
Identities = 24/89 (26%), Positives = 47/89 (52%), Gaps = 9/89 (10%)
Query: 2035 TTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASES 2094
+ S++TTT +TTT P + + T++ + +T T++ +TTT++ + +
Sbjct: 36 VEAAEGQSQTTTT------TTTTTPPTTTTTTTT---TTTTITTTTTKTTTTTTTTTTTT 86
Query: 2095 TTTSSPASESTTTSSPESESTTTSSPASE 2123
TTT +P+ +TT+ +P + T S E
Sbjct: 87 TTTEAPSKNTTTSEAPTTTDTRAPSSIRE 115
Score = 33.3 bits (75), Expect = 0.58
Identities = 24/78 (30%), Positives = 43/78 (55%), Gaps = 1/78 (1%)
Query: 1885 STLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESES 1944
+T + + TTT + + +TT +++TTT++ + +TTT+ S++TTTS + +
Sbjct: 46 TTTTTTTTPPTTTTTTTTTTTTITTTTTKTTTTTTTTTTTTTTTEAPSKNTTTSEAPT-T 104
Query: 1945 TTTSSPESESTTTSSLVS 1962
T T +P S SL S
Sbjct: 105 TDTRAPSSIREIDGSLGS 122
Score = 33.3 bits (75), Expect = 0.71
Identities = 15/55 (27%), Positives = 34/55 (61%)
Query: 2075 TTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEE 2129
+ S++TTT++ + TTT++ + +TT ++ +++TTT++ + +TT E
Sbjct: 36 VEAAEGQSQTTTTTTTTTPPTTTTTTTTTTTTITTTTTKTTTTTTTTTTTTTTTE 90
Score = 32.9 bits (74), Expect = 0.75
Identities = 20/77 (25%), Positives = 34/77 (44%), Gaps = 1/77 (1%)
Query: 1877 NSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESES-TTTSSLVSEST 1935
+ +T + + TTT + + TT + +TTT++ E+ S TT+S +T
Sbjct: 46 TTTTTTTTPPTTTTTTTTTTTTITTTTTKTTTTTTTTTTTTTTTEAPSKNTTTSEAPTTT 105
Query: 1936 TTSSPESESTTTSSPES 1952
T +P S S S
Sbjct: 106 DTRAPSSIREIDGSLGS 122
Score = 31.0 bits (69), Expect = 4.2
Identities = 15/56 (26%), Positives = 30/56 (53%)
Query: 2071 ASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTT 2126
A + E+ S + +TTT++P + +TTT++ + TTT++ + +TT
Sbjct: 25 AQGEGQYDAAVVEAAEGQSQTTTTTTTTTPPTTTTTTTTTTTTITTTTTKTTTTTT 80
Score = 29.8 bits (66), Expect = 7.9
Identities = 15/61 (24%), Positives = 27/61 (44%)
Query: 1873 TTNNNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVS 1932
T + +T + + TTT + + +TTT P +TT+ +P + T S +
Sbjct: 55 PTTTTTTTTTTTTITTTTTKTTTTTTTTTTTTTTTEAPSKNTTTSEAPTTTDTRAPSSIR 114
Query: 1933 E 1933
E
Sbjct: 115 E 115
Score = 29.8 bits (66), Expect = 8.9
Identities = 16/60 (26%), Positives = 29/60 (48%)
Query: 1873 TTNNNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVS 1932
TT + +T +T + TT + + +TTT E+ S T++ E+ +TT + S
Sbjct: 52 TTPPTTTTTTTTTTTTITTTTTKTTTTTTTTTTTTTTTEAPSKNTTTSEAPTTTDTRAPS 111
>gnl|CDD|113514 pfam04747, DUF612, Protein of unknown function, DUF612. This family
includes several uncharacterized proteins from
Caenorhabditis elegans.
Length = 517
Score = 47.7 bits (112), Expect = 7e-05
Identities = 52/283 (18%), Positives = 101/283 (35%), Gaps = 17/283 (6%)
Query: 1911 ESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSP 1970
+++ +T +P E + ++ + +PE + T T++P + + + +
Sbjct: 163 KTKKASTPAPVEEEIVVKKVANDRSAAPAPEPK-TPTNTPAEPAEQVQEITGKKNKKNKK 221
Query: 1971 ESESTTTSSPESESTTTSS---LVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTT 2027
+SES T++P S + E ++P+ + SES + S T
Sbjct: 222 KSESEATAAPASVEQVVEQPKVVTEEPHQQAAPQEKKNKKNKRKSESENVPAA---SETP 278
Query: 2028 ISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPA---SES 2084
+ P E T+ PASE+ N + + + E + + +P S+ T
Sbjct: 279 VEPVVE---TTPPASENQKKNKKDKKKSESEKVVEEPVQAEAPKSKKPTADDNMDFLDFV 335
Query: 2085 TTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQGVSPHSEKLSANED 2144
T P E T + E + E+ +++P + + + SE E
Sbjct: 336 TAKEEPKDEPAETPAAPVEEVVENVVENVVEKSTTPPATENKKKNKKDKKKSESEKVTEQ 395
Query: 2145 PEEF----PNEDVFEHTFAEIPNIDHSNQTDEAIPETFDAREE 2183
P E P + N+ D+ E+ A EE
Sbjct: 396 PVESAPAPPQVEQVVEKTPPASENKKKNKKDKKKSESEKAVEE 438
Score = 47.4 bits (111), Expect = 8e-05
Identities = 49/233 (21%), Positives = 90/233 (38%), Gaps = 25/233 (10%)
Query: 1908 NNPESESTTTSSPESESTTTSS---LVSESTTTSSPESESTTTSSPESES----TTTSSL 1960
N +SES T++P S + E ++P+ + + +SES + +
Sbjct: 219 NKKKSESEATAAPASVEQVVEQPKVVTEEPHQQAAPQEKKNKKNKRKSESENVPAASETP 278
Query: 1961 VSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESES-------------TTT 2007
V T+ P SE+ + + + + + +V E +P+S+ T
Sbjct: 279 VEPVVETTPPASENQKKNKKDKKKSESEKVVEEPVQAEAPKSKKPTADDNMDFLDFVTAK 338
Query: 2008 ISPVSE-STTTSSPVSESTTTISPESESTTTSSPASESTTTNNP---KSESTTTNNPASE 2063
P E + T ++PV E + +T+ PA+E+ N KSES E
Sbjct: 339 EEPKDEPAETPAAPVEEVVENVVENVVEKSTTPPATENKKKNKKDKKKSESEKVTEQPVE 398
Query: 2064 SITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTT 2116
S + + + PASE+ + + + S A E ++P S+ T
Sbjct: 399 SAPAPPQVEQVVEKTPPASENKKKNKK-DKKKSESEKAVEEPVQAAPSSKKPT 450
Score = 43.5 bits (101), Expect = 0.002
Identities = 37/203 (18%), Positives = 76/203 (37%), Gaps = 20/203 (9%)
Query: 1961 VSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSP 2020
V + +++ +T +P E + ++ + +PE + T T +P +
Sbjct: 153 VKAEKAEKAEKTKKASTPAPVEEEIVVKKVANDRSAAPAPEPK-TPTNTPAEPAEQVQEI 211
Query: 2021 VSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSP 2080
+ +SES T++PAS PK T P ++ ++ S
Sbjct: 212 TGKKNKKNKKKSESEATAAPASVEQVVEQPK---VVTEEPHQQAAPQEKKNKKNKRKSES 268
Query: 2081 ASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQGVSPHSEK-- 2138
+ + +P T+ PASE+ + + + + + E Q +P S+K
Sbjct: 269 ENVPAASETPVEPVVETTPPASENQKKNKKDKKKSESEKVVEEPV----QAEAPKSKKPT 324
Query: 2139 ----------LSANEDPEEFPNE 2151
++A E+P++ P E
Sbjct: 325 ADDNMDFLDFVTAKEEPKDEPAE 347
Score = 35.0 bits (79), Expect = 0.62
Identities = 34/227 (14%), Positives = 76/227 (33%), Gaps = 11/227 (4%)
Query: 1892 SENTTTNSPESESTTTNNPESESTTTSSPESESTTTSS---LVSESTTTSSPESESTTTS 1948
SEN N + + + + E +P+S+ T + T P+ E T
Sbjct: 290 SENQKKNKKDKKKSESEKVVEEPVQAEAPKSKKPTADDNMDFLDFVTAKEEPKDEPAETP 349
Query: 1949 SPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTI 2008
+ E + + + +++P + + + + + S V+E S+P
Sbjct: 350 AAPVEEVVENVVENVVEKSTTPPATENKKKNKKDKKKSESEKVTEQPVESAPAP------ 403
Query: 2009 SPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPAS-ESITS 2067
P E +P + + + + + S A E P S+ T ++ +
Sbjct: 404 -PQVEQVVEKTPPASENKKKNKKDKKKSESEKAVEEPVQAAPSSKKPTADDNMDFLDFVT 462
Query: 2068 SSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESES 2114
+ P + + + + A E T ++ + + ES
Sbjct: 463 AKPDKSESAEEHIEAPAIVEPAHADEETAAAAEGKKKNKKDKKKKES 509
>gnl|CDD|227578 COG5253, MSS4, Phosphatidylinositol-4-phosphate 5-kinase [Signal
transduction mechanisms].
Length = 612
Score = 46.9 bits (111), Expect = 1e-04
Identities = 42/247 (17%), Positives = 86/247 (34%), Gaps = 9/247 (3%)
Query: 1915 TTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESES 1974
T P S S T S+ + +T + S S +S + ++ S + ++
Sbjct: 1 TEERPPISRSGTGISMTHDKSTRPNDRSMSNDSSLCGLNQASDANGNEYSPNNKVSKKDT 60
Query: 1975 TTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESES 2034
+ ++ S + + + + ++ + S
Sbjct: 61 FSDQLHDALSKEFTLERERDRLQLNKRKYQAIRLQTSTPIVEIFKNNKDAVDPPNHTRSS 120
Query: 2035 TTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTS---SPASESTTTSSPA 2091
S A+ T + S + N P + + P S + S T +S P+
Sbjct: 121 GNNLSNANVKTLSAPVGEHSRSNNPPNLDQNLDTEPESSISQWGELQLNPSGKTLSSQPS 180
Query: 2092 SESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQGVSPHSEKLSANEDPEEFPNE 2151
+ T+ + P SES + P T+ +SP + + ++ + +E+ S N P +P+
Sbjct: 181 RKPTSEN-PKSESDNSKLP----TSVNSPLPDKSLLKRTLSNFWAERNSYNWKPLVYPSC 235
Query: 2152 DVFEHTF 2158
EH F
Sbjct: 236 PS-EHIF 241
Score = 39.5 bits (92), Expect = 0.026
Identities = 35/216 (16%), Positives = 64/216 (29%), Gaps = 10/216 (4%)
Query: 1875 NNNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSES 1934
N S+ L+ LS+ T N + ++
Sbjct: 53 NKVSKKDTFSDQLHDALSKEFT--LERERDRLQLNKRKYQAIRLQTSTPIVEIFKNNKDA 110
Query: 1935 TTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLV--- 1991
+ S S + T ++ + S + + P + + PES + L
Sbjct: 111 VDPPNHTRSSGNNLSNANVKTLSAPVGEHSRSNNPPNLDQNLDTEPESSISQWGELQLNP 170
Query: 1992 SESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPK 2051
S T +S P + T+ +P SES + P S ++ T S+ +E + N
Sbjct: 171 SGKTLSSQPSRKPTSE-NPKSESDNSKLPTSVNSPLPDKSLLKRTLSNFWAERNSYNWKP 229
Query: 2052 SESTT----TNNPASESITSSSPASESTTTSSPASE 2083
+ S+ I S S+
Sbjct: 230 LVYPSCPSEHIFSDSDVIIREDEPSSLIAFCLSTSD 265
Score = 33.0 bits (75), Expect = 2.2
Identities = 25/171 (14%), Positives = 50/171 (29%), Gaps = 17/171 (9%)
Query: 2009 SPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSS 2068
P+S S T S + +T + S S +S + N S ++ +
Sbjct: 5 PPISRSGTGISMTHDKSTRPNDRSMSNDSSLCGLNQASDANGNEYSPNNKVSKKDTFSDQ 64
Query: 2069 SPASESTTTSSPASES-------------TTTSSPASESTTTSSPASESTTTSSPESEST 2115
+ S + TS+P E + A + + +
Sbjct: 65 LHDALSKEFTLERERDRLQLNKRKYQAIRLQTSTPIVEIFKNNKDAVDPPNHTRSSGNNL 124
Query: 2116 TTSSPASESTTIEEQGVSPHSEKLSANEDPEEFPNEDVFEHTFAEIPNIDH 2166
+ ++ + S + E S + L N D E + + E+
Sbjct: 125 SNANVKTLSAPVGEHSRSNNPPNLDQNLDTE----PESSISQWGELQLNPS 171
>gnl|CDD|225828 COG3291, COG3291, FOG: PKD repeat [General function prediction only].
Length = 297
Score = 46.0 bits (109), Expect = 2e-04
Identities = 36/242 (14%), Positives = 81/242 (33%), Gaps = 15/242 (6%)
Query: 1896 TTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPE-SES 1954
+T +P S + E+ + + + V+ + ++ T T+ E SE+
Sbjct: 21 STGTPTSWIWDFGDGENSTEQNPIHTYKKVGNYT-VNLTVENAAGSDTETKTNYIEVSEA 79
Query: 1955 TTTSSLVSESTTTSSPESES-TTTSSPESES---------TTTSSLVSESTTTSSPESES 2004
+ + T+ +P + + T TS+ E+ S TTS+ + T + + +
Sbjct: 80 PPVADFTANPTSGYAPLTVNFTDTSTNEATSWSWDFGDGGVTTSTEQNPVHTYTDAGTYT 139
Query: 2005 TTTISPVSEST-TTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASE 2063
T VS ST + S ++ T E + ++ T +++ N +S
Sbjct: 140 VTLT--VSNSTGSDSKTKTDYVTVSEEGIEEAVPEAASTVVTKPLTVSGTESSSGNLSSW 197
Query: 2064 SITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASE 2123
++T +P + S T ++ + + +
Sbjct: 198 VYVFEDDKGTNSTVKTPLLGGVIKVTLGSPLPDTVVYPTDKEGKGYYITLTGNGEFSFVD 257
Query: 2124 ST 2125
Sbjct: 258 VV 259
Score = 45.3 bits (107), Expect = 3e-04
Identities = 44/267 (16%), Positives = 88/267 (32%), Gaps = 27/267 (10%)
Query: 1872 FTTNNNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTS-SL 1930
FT + T + + + + + ++ T T+
Sbjct: 17 FTDGSTGTPTSWIWDFGDGENSTEQNPIHTYKKVGNYT-VNLTVENAAGSDTETKTNYIE 75
Query: 1931 VSESTTTSSPESESTTTSSPESES-TTTSSLVSES---------TTTSSPESESTTTSSP 1980
VSE+ + + T+ +P + + T TS+ + S TTS+ ++ T +
Sbjct: 76 VSEAPPVADFTANPTSGYAPLTVNFTDTSTNEATSWSWDFGDGGVTTSTEQNPVHTYTDA 135
Query: 1981 ESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSP 2040
+ + T + VS ST + S T ++ E + P + ST P + S T SS
Sbjct: 136 GTYTVTLT--VSNSTGSDS--KTKTDYVTVSEEGIEEAVPEAASTVVTKPLTVSGTESSS 191
Query: 2041 ASESTTT--------NNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPAS 2092
+ S+ N ++ ++ S P + T E +
Sbjct: 192 GNLSSWVYVFEDDKGTNSTVKTPLLGGVIKVTLGSPLPDTVVYPTD---KEGKGYYITLT 248
Query: 2093 ESTTTSSPASESTTTSSPESESTTTSS 2119
+ S + + SE+ + S
Sbjct: 249 GNGEFSFVDVVAYVKNGDWSENNSPSE 275
Score = 41.4 bits (97), Expect = 0.005
Identities = 38/218 (17%), Positives = 69/218 (31%), Gaps = 25/218 (11%)
Query: 1861 ISVIDNYSEIIFTTNNNSES---TVVMSTLNSLLSEN--------TTTNSPE-SESTTTN 1908
I V + FT N S TV + ++ + + T S E + T
Sbjct: 74 IEVSEAPPVADFTANPTSGYAPLTVNFTDTSTNEATSWSWDFGDGGVTTSTEQNPVHTYT 133
Query: 1909 NPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTS 1968
+ + + T + S + + + T + E + PE+ ST + ++ S T S
Sbjct: 134 DAGTYTVTLTVSNSTGSDSKT----KTDYVTVSEEGIEEAVPEAASTVVTKPLTVSGTES 189
Query: 1969 SPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTI 2028
S + S+ E + T S T +P ++ S T ++
Sbjct: 190 SSGNLSSWVYVFEDDKGTNS-------TVKTPLLGGVIKVTLGSPLPDTVVYPTDKEGKG 242
Query: 2029 SPESESTTTSSPASESTTTNNPKSESTTTNNPASESIT 2066
+ + + S NN SE I
Sbjct: 243 YYITLTGNGEFSFVDVVAYVKNGDWS--ENNSPSEYID 278
>gnl|CDD|218107 pfam04484, DUF566, Family of unknown function (DUF566). Family of
related proteins that is plant specific.
Length = 313
Score = 45.7 bits (108), Expect = 2e-04
Identities = 38/132 (28%), Positives = 54/132 (40%), Gaps = 7/132 (5%)
Query: 1987 TSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTT 2046
+ S S S SSP S S +S ST+ SS SP S S ++ +S S+
Sbjct: 4 SVSSGSTSGDASSPRSSSRRRLSSSFLSTSASSRPRRLNAPASPPSSSPARNT-SSSSSF 62
Query: 2047 TNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTT 2106
+ + S+ + S S S S S S S +T ++S +S SP+ T
Sbjct: 63 GLSKQRPSSLSRGRLSSRFVSPSRGSPSAAASLNGSLATASTSGSS------SPSRSRRT 116
Query: 2107 TSSPESESTTTS 2118
TSS S S
Sbjct: 117 TSSDLSSGNGPS 128
Score = 39.9 bits (93), Expect = 0.013
Identities = 33/149 (22%), Positives = 49/149 (32%), Gaps = 11/149 (7%)
Query: 1910 PESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSS 1969
S +TS S S S SS ++ +S P + S S +S
Sbjct: 3 ASVSSGSTSGDAS-----SPRSSSRRRLSSSFLSTSASSRPRRLNAPASPPSSSPARNTS 57
Query: 1970 PESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTT----SSPVSEST 2025
S S+ S + S+ + +S + S S S S +T + SSP
Sbjct: 58 --SSSSFGLSKQRPSSLSRGRLSSRFVSPSRGSPSAAASLNGSLATASTSGSSSPSRSRR 115
Query: 2026 TTISPESESTTTSSPASESTTTNNPKSES 2054
TT S S S + + K S
Sbjct: 116 TTSSDLSSGNGPSVLSFMADVKRGKKGPS 144
Score = 37.6 bits (87), Expect = 0.064
Identities = 40/152 (26%), Positives = 52/152 (34%), Gaps = 27/152 (17%)
Query: 1937 TSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTT 1996
+ S S S SSP S S S ST+ SS +SP S S ++ S S
Sbjct: 4 SVSSGSTSGDASSPRSSSRRRLSSSFLSTSASSRPRRLNAPASPPSSSPARNTSSSSSFG 63
Query: 1997 TSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTT 2056
S S S S+ +SP S S S + S +T
Sbjct: 64 LSKQRPSS-------------LSRGRLSSRFVSP--------SRGSPSAAASLNGSLATA 102
Query: 2057 TNNPASESITSSSPASESTTTSSPASESTTTS 2088
+ + SSSP+ TTSS S S
Sbjct: 103 STSG------SSSPSRSRRTTSSDLSSGNGPS 128
Score = 35.3 bits (81), Expect = 0.32
Identities = 27/108 (25%), Positives = 42/108 (38%), Gaps = 1/108 (0%)
Query: 2027 TISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTT 2086
++S S S SSP S S + ST+ ++ +SP S S ++ +S S+
Sbjct: 4 SVSSGSTSGDASSPRSSSRRRLSSSFLSTSASSRPRRLNAPASPPSSSPARNT-SSSSSF 62
Query: 2087 TSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQGVSP 2134
S S+ + S + S S S S S +T SP
Sbjct: 63 GLSKQRPSSLSRGRLSSRFVSPSRGSPSAAASLNGSLATASTSGSSSP 110
Score = 33.4 bits (76), Expect = 1.3
Identities = 21/75 (28%), Positives = 31/75 (41%), Gaps = 1/75 (1%)
Query: 2067 SSSPASESTTTSSPASES-TTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASEST 2125
S S S S SSP S S SS ++ +S P + S P S +S +S
Sbjct: 4 SVSSGSTSGDASSPRSSSRRRLSSSFLSTSASSRPRRLNAPASPPSSSPARNTSSSSSFG 63
Query: 2126 TIEEQGVSPHSEKLS 2140
+++ S +LS
Sbjct: 64 LSKQRPSSLSRGRLS 78
Score = 33.0 bits (75), Expect = 2.0
Identities = 34/131 (25%), Positives = 53/131 (40%), Gaps = 15/131 (11%)
Query: 1897 TNSPESESTTTNNPESESTTTSSPESESTTTSS----LVSESTTTSSPESESTTTSSPES 1952
+ S S S ++P S S S ST+ SS L + ++ SS + +T++SS
Sbjct: 4 SVSSGSTSGDASSPRSSSRRRLSSSFLSTSASSRPRRLNAPASPPSSSPARNTSSSSSFG 63
Query: 1953 -----ESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTT 2007
S+ + +S + S S S S S +T ++S SSP TT
Sbjct: 64 LSKQRPSSLSRGRLSSRFVSPSRGSPSAAASLNGSLATASTSGS------SSPSRSRRTT 117
Query: 2008 ISPVSESTTTS 2018
S +S S
Sbjct: 118 SSDLSSGNGPS 128
Score = 31.5 bits (71), Expect = 5.9
Identities = 29/119 (24%), Positives = 47/119 (39%), Gaps = 2/119 (1%)
Query: 1874 TNNNSESTVVMSTLNSLLSENTTTNSPESES-TTTNNPESESTTTSSPESESTTTSSLVS 1932
+ +S + L+S + ++ P + + S + TSS S + S
Sbjct: 12 GDASSPRSSSRRRLSSSFLSTSASSRPRRLNAPASPPSSSPARNTSSSSSFGLSKQRPSS 71
Query: 1933 ESTTTSSPESESTTTSSPESESTTTSSLVSESTT-TSSPESESTTTSSPESESTTTSSL 1990
S S S + SP + ++ SL + ST+ +SSP TTSS S S L
Sbjct: 72 LSRGRLSSRFVSPSRGSPSAAASLNGSLATASTSGSSSPSRSRRTTSSDLSSGNGPSVL 130
>gnl|CDD|114205 pfam05467, Herpes_U47, Herpesvirus glycoprotein U47.
Length = 627
Score = 46.4 bits (109), Expect = 2e-04
Identities = 51/249 (20%), Positives = 100/249 (40%), Gaps = 11/249 (4%)
Query: 1895 TTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESES 1954
T +++P S S + +P ST +PE V+++ T ++ T ++P +
Sbjct: 240 TPSSTPSSTSASITSPHIPSTNIPTPE------PPPVTKNFTELHTDTIKVTPNTPTITA 293
Query: 1955 TTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSES 2014
TT S+ + P T T P +++ +E T + E+ + E+
Sbjct: 294 QTTESIKKIVKRSDFPRPMYTPTDIPTLTIRLNATIKTEQNTENPTENPKSPPKPTNFEN 353
Query: 2015 TTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASES 2074
TT P + + T+ + + T ++ TT + + + SI S + +S
Sbjct: 354 TTIRIPETFESATV---ATNATQKIESTTFATTIGIEEINDNIYSSPKNSIYLKSKSQQS 410
Query: 2075 TTTSSPASESTTTSSPASESTTTSSPASESTTTSS-PESESTTTSSPASESTTIEEQGVS 2133
TT + A +T + + S +T + + TT ++E TI+ V+
Sbjct: 411 TTKFTDAEHTTPILKFTTWQDAARTYMSHNTEVQNMTDRFQRTTLKSSNELPTIQTLSVT 470
Query: 2134 PHSEKLSAN 2142
P +KL +N
Sbjct: 471 P-KKKLPSN 478
Score = 43.3 bits (101), Expect = 0.002
Identities = 64/279 (22%), Positives = 107/279 (38%), Gaps = 24/279 (8%)
Query: 1839 LSVSPYITNNLLISMLAATAVAISVIDNYSEIIFTTNNNSESTVVMSTLNSLLSENTTTN 1898
+++S + +NL S+ T + NY+ ++ N+ S + S N
Sbjct: 166 MAISKFSNSNLTRSLTPFTP---EIFFNYTSFVYFLLYNTTS-CIPSNDQYFEHSPKPIN 221
Query: 1899 SPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTS 1958
S N +S TTT S ST S + +SP ST +PE
Sbjct: 222 VTTSFGRAIVNFDSILTTTPSSTPSST--------SASITSPHIPSTNIPTPE------P 267
Query: 1959 SLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTS 2018
V+++ T ++ T ++P + TT S+ + P T T P +
Sbjct: 268 PPVTKNFTELHTDTIKVTPNTPTITAQTTESIKKIVKRSDFPRPMYTPTDIPTLTIRLNA 327
Query: 2019 SPVSESTTTISPESESTTTSSPASESTTTNNPKS-ESTTTNNPASESITSSSPASESTTT 2077
+ +E T E+ + E+TT P++ ES T A++ I S++ A TT
Sbjct: 328 TIKTEQNTENPTENPKSPPKPTNFENTTIRIPETFESATVATNATQKIESTTFA---TTI 384
Query: 2078 SSPASESTTTSSPASESTTTSSPASESTTTSSPESESTT 2116
SSP + S S+ +TT ++E TT
Sbjct: 385 GIEEINDNIYSSPKNSIYLKSK--SQQSTTKFTDAEHTT 421
Score = 42.9 bits (100), Expect = 0.002
Identities = 60/292 (20%), Positives = 121/292 (41%), Gaps = 25/292 (8%)
Query: 1877 NSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTT------TSSL 1930
N +S + + ++ S + + SP ST PE T + E + T T ++
Sbjct: 232 NFDSILTTTPSSTPSSTSASITSPHIPSTNIPTPEPPPVTKNFTELHTDTIKVTPNTPTI 291
Query: 1931 VSESTTTSSPESESTTTSSPESESTTTSSL-VSESTTTSSPESESTTTSSPESESTTTSS 1989
+++T + + + P T +L + + T + ++ T +P+S T+
Sbjct: 292 TAQTTESIKKIVKRSDFPRPMYTPTDIPTLTIRLNATIKTEQNTENPTENPKSPPKPTN- 350
Query: 1990 LVSESTTTSSPESESTTTISPVS----ESTT--TSSPVSESTTTISPESESTTTSSPASE 2043
E+TT PE+ + T++ + ESTT T+ + E I +++ S+
Sbjct: 351 --FENTTIRIPETFESATVATNATQKIESTTFATTIGIEEINDNIYSSPKNSIYLKSKSQ 408
Query: 2044 STTTNNPKSEST------TTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTT 2097
+TT +E T TT A+ + S + ++ T + +++ + T +
Sbjct: 409 QSTTKFTDAEHTTPILKFTTWQDAARTYMSHNTEVQNMTDRFQRTTLKSSNELPTIQTLS 468
Query: 2098 SSPASE--STTTSSPESESTTTSSPASEST-TIEEQGVSPHSEKLSANEDPE 2146
+P + S T+ E T + P+S S+ +I E P ++SA+ E
Sbjct: 469 VTPKKKLPSNVTAKTEVHITNNALPSSNSSHSITEVTEEPKHNRMSASTHEE 520
Score = 40.2 bits (93), Expect = 0.015
Identities = 52/220 (23%), Positives = 94/220 (42%), Gaps = 18/220 (8%)
Query: 1925 TTTSSLVSESTTTSSPESESTTTSSPESESTTTS--SLVSESTTTSSPESESTTTSSPES 1982
T S ++ S ++S + S T +PE TS + +TT+ P ++ SP+
Sbjct: 160 TDESLQMAISKFSNSNLTRSLTPFTPEIFFNYTSFVYFLLYNTTSCIPSNDQYFEHSPKP 219
Query: 1983 ESTTTS--------SLVSESTTTSSPESESTTTISPVSESTTTS----SPVSESTTTISP 2030
+ TTS + +T +S+P S S + SP ST PV+++ T +
Sbjct: 220 INVTTSFGRAIVNFDSILTTTPSSTPSSTSASITSPHIPSTNIPTPEPPPVTKNFTELHT 279
Query: 2031 ESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSP 2090
++ T ++P + TT + K ++ P + P ++ +E T +
Sbjct: 280 DTIKVTPNTPTITAQTTESIKKIVKRSDFPRPMYTPTDIPTLTIRLNATIKTEQNTENPT 339
Query: 2091 ASESTTTSSPASESTTTSSPES-ESTTTSSPAS---ESTT 2126
+ + E+TT PE+ ES T ++ A+ ESTT
Sbjct: 340 ENPKSPPKPTNFENTTIRIPETFESATVATNATQKIESTT 379
Score = 39.4 bits (91), Expect = 0.023
Identities = 52/270 (19%), Positives = 108/270 (40%), Gaps = 29/270 (10%)
Query: 1893 ENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVS----ESTTTSSPE------- 1941
ENTT PE+ + T T ++ + ESTT ++ + SSP+
Sbjct: 352 ENTTIRIPETFESAT------VATNATQKIESTTFATTIGIEEINDNIYSSPKNSIYLKS 405
Query: 1942 -SESTTTSSPESESTT------TSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSES 1994
S+ +TT ++E TT T + + + + E ++ T + +++ L +
Sbjct: 406 KSQQSTTKFTDAEHTTPILKFTTWQDAARTYMSHNTEVQNMTDRFQRTTLKSSNELPTIQ 465
Query: 1995 TTTSSPESESTTTISPVSESTTT-----SSPVSESTTTISPESESTTTSSPASESTTTNN 2049
T + +P+ + + ++ +E T SS S S T ++ E + S+ E
Sbjct: 466 TLSVTPKKKLPSNVTAKTEVHITNNALPSSNSSHSITEVTEEPKHNRMSASTHEEINHTE 525
Query: 2050 PKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSS 2109
+ N SE T+ + T + +S+ + STT P + ++ S+
Sbjct: 526 IAQITPILNAHTSEKSTTPQRPFTAETFLTTSSKPAILTWSNLLSTTPKEPLTNTSLRST 585
Query: 2110 PESESTTTSSPASESTTIEEQGVSPHSEKL 2139
+ T+S ++S + + +S + +
Sbjct: 586 DHITTQLTTSNRTQSAKLTKAHISSQTTNI 615
Score = 39.1 bits (90), Expect = 0.032
Identities = 51/232 (21%), Positives = 99/232 (42%), Gaps = 18/232 (7%)
Query: 1904 STTTNNPESESTTTSSPESESTTTS--SLVSESTTTSSPESESTTTSSPESESTTTS--- 1958
S +N+ + S T +PE TS + +TT+ P ++ SP+ + TTS
Sbjct: 169 SKFSNSNLTRSLTPFTPEIFFNYTSFVYFLLYNTTSCIPSNDQYFEHSPKPINVTTSFGR 228
Query: 1959 SLVS-ESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTT 2017
++V+ +S T++P S ++TS+ S T+ + S + T P PV+++ T
Sbjct: 229 AIVNFDSILTTTPSSTPSSTSA----SITSPHIPSTNIPTPEP--------PPVTKNFTE 276
Query: 2018 SSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTT 2077
+ T +P + TT S ++ P+ T T+ P +++ +E T
Sbjct: 277 LHTDTIKVTPNTPTITAQTTESIKKIVKRSDFPRPMYTPTDIPTLTIRLNATIKTEQNTE 336
Query: 2078 SSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEE 2129
+ + + E+TT P + + T + + S+ + + IEE
Sbjct: 337 NPTENPKSPPKPTNFENTTIRIPETFESATVATNATQKIESTTFATTIGIEE 388
Score = 31.7 bits (71), Expect = 6.4
Identities = 25/96 (26%), Positives = 47/96 (48%), Gaps = 11/96 (11%)
Query: 2054 STTTNNPASESITSSSPASESTTTSSPAS--------ESTTTSSPASESTTTSSPASEST 2105
+TT+ P+++ SP + TTS + +T +S+P+S S + +SP ST
Sbjct: 201 NTTSCIPSNDQYFEHSPKPINVTTSFGRAIVNFDSILTTTPSSTPSSTSASITSPHIPST 260
Query: 2106 TTSSPESESTTTSSPASESTTIEEQGVSPHSEKLSA 2141
+PE T + + TI+ V+P++ ++A
Sbjct: 261 NIPTPEPPPVTKNFTELHTDTIK---VTPNTPTITA 293
Score = 31.7 bits (71), Expect = 6.5
Identities = 35/156 (22%), Positives = 68/156 (43%), Gaps = 7/156 (4%)
Query: 1875 NNNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSES 1934
+N + T V T N+L S N++ + E +N S ST +E + +++
Sbjct: 477 SNVTAKTEVHITNNALPSSNSSHSITEVTEEPKHNRMSASTHEEINHTEIAQITPILNAH 536
Query: 1935 TTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSES 1994
T+ S + T+ +++ ++++ S S+ E T +S S T+ L
Sbjct: 537 TSEKSTTPQRPFTAETFLTTSSKPAILTWSNLLSTTPKEPLTNTSLRSTDHITTQL---- 592
Query: 1995 TTTSSPESESTTTISPVSESTTTSSP--VSESTTTI 2028
TTS+ + T + +S TT P ++E +T +
Sbjct: 593 -TTSNRTQSAKLTKAHISSQTTNIYPQTITERSTDV 627
>gnl|CDD|217835 pfam03999, MAP65_ASE1, Microtubule associated protein (MAP65/ASE1
family).
Length = 619
Score = 46.4 bits (110), Expect = 2e-04
Identities = 34/163 (20%), Positives = 51/163 (31%), Gaps = 13/163 (7%)
Query: 1962 SESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPV 2021
ST +S P + ST + S T S + + T SS S+ + IS + +T S
Sbjct: 461 YGSTESSVPSTPSTRRNDRNITSNTPSLKRTPNLTKSS-LSQEASLISKSTGNTHKHSTP 519
Query: 2022 SESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPA 2081
TT ++ S N + S SS + S +
Sbjct: 520 RRLTTL------PKLPAASRSSKGNLIRSG------ANGNASSDLSSPGSINSKSPEHSV 567
Query: 2082 SESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASES 2124
STT ++ ST + SP ES
Sbjct: 568 PLVRVFDIHLRASTTKGRHSTPSTNEKKKRLLKRSPLSPPKES 610
Score = 44.1 bits (104), Expect = 0.001
Identities = 33/178 (18%), Positives = 62/178 (34%), Gaps = 9/178 (5%)
Query: 1936 TTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSEST 1995
+S E S+ S +T S+ ++ TS+ T S + + T SSL S+
Sbjct: 451 NKTSTVMEPPYGSTESSVPSTPSTRRNDRNITSN------TPSLKRTPNLTKSSL-SQEA 503
Query: 1996 TTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSEST 2055
+ S + +T S TT + S ++ + ++ + S ++ S
Sbjct: 504 SLISKSTGNTHKHSTPRRLTTLPKLPAASRSS-KGNLIRSG-ANGNASSDLSSPGSINSK 561
Query: 2056 TTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESE 2113
+ + STT ++ ST + SP ES T+ +
Sbjct: 562 SPEHSVPLVRVFDIHLRASTTKGRHSTPSTNEKKKRLLKRSPLSPPKESVATTPRLNS 619
Score = 40.2 bits (94), Expect = 0.013
Identities = 36/162 (22%), Positives = 59/162 (36%), Gaps = 9/162 (5%)
Query: 1885 STLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESES 1944
ST+ +T ++ P + ST N+ S T S + + T SSL S+ + S + +
Sbjct: 454 STVMEPPYGSTESSVPSTPSTRRNDRNITSNTPSLKRTPNLTKSSL-SQEASLISKSTGN 512
Query: 1945 TTTSSPESESTTTSSLVSEST--------TTSSPESESTTTSSPESESTTTSSLVSESTT 1996
T S TT L + S + ++ + S +S S + V
Sbjct: 513 THKHSTPRRLTTLPKLPAASRSSKGNLIRSGANGNASSDLSSPGSINSKSPEHSVPLVRV 572
Query: 1997 TSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTS 2038
STT + ST + +SP ES T+
Sbjct: 573 FDIHLRASTTKGRHSTPSTNEKKKRLLKRSPLSPPKESVATT 614
Score = 40.2 bits (94), Expect = 0.013
Identities = 35/189 (18%), Positives = 55/189 (29%), Gaps = 24/189 (12%)
Query: 1900 PESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSS 1959
ST P ST +S P + ST + S T S + + T SS S+ + S
Sbjct: 450 ANKTSTVMEPP-YGSTESSVPSTPSTRRNDRNITSNTPSLKRTPNLTKSS-LSQEASLIS 507
Query: 1960 LVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSS 2019
+ +T S TT + S S I ++
Sbjct: 508 KSTGNTHKHSTPRRLTTLPKLPAASR----------------SSKGNLIRS------GAN 545
Query: 2020 PVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSS 2079
+ S + S + + STT ++ S + S
Sbjct: 546 GNASSDLSSPGSINSKSPEHSVPLVRVFDIHLRASTTKGRHSTPSTNEKKKRLLKRSPLS 605
Query: 2080 PASESTTTS 2088
P ES T+
Sbjct: 606 PPKESVATT 614
Score = 34.8 bits (80), Expect = 0.75
Identities = 27/129 (20%), Positives = 47/129 (36%), Gaps = 12/129 (9%)
Query: 2041 ASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSP 2100
A++++T P ST ++ P++ S + S T S + + T SS S+ + S
Sbjct: 450 ANKTSTVMEPPYGSTESSVPSTPSTRRNDRNITSNTPSLKRTPNLTKSS-LSQEASLISK 508
Query: 2101 ASESTTTSSPESESTTTSSPA----SESTTIEEQGVSP-HSEKLSANED------PEEFP 2149
++ +T S TT S + G + S LS+ P
Sbjct: 509 STGNTHKHSTPRRLTTLPKLPAASRSSKGNLIRSGANGNASSDLSSPGSINSKSPEHSVP 568
Query: 2150 NEDVFEHTF 2158
VF+
Sbjct: 569 LVRVFDIHL 577
Score = 34.1 bits (78), Expect = 1.1
Identities = 25/141 (17%), Positives = 47/141 (33%), Gaps = 9/141 (6%)
Query: 2006 TTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESI 2065
T + P ST +S P + ST S T S + + T ++ S+ + + ++ +
Sbjct: 455 TVMEPPYGSTESSVPSTPSTRRNDRNITSNTPSLKRTPNLTKSS-LSQEASLISKSTGNT 513
Query: 2066 TSSSPASESTTTSSPASEST--------TTSSPASESTTTSSPASESTTTSSPESESTTT 2117
S TT + S + ++ + S +S + S +
Sbjct: 514 HKHSTPRRLTTLPKLPAASRSSKGNLIRSGANGNASSDLSSPGSINSKSPEHSVPLVRVF 573
Query: 2118 SSPASESTTIEEQGVSPHSEK 2138
STT +EK
Sbjct: 574 DIHLRASTTKGRHSTPSTNEK 594
>gnl|CDD|236776 PRK10856, PRK10856, cytoskeletal protein RodZ; Provisional.
Length = 331
Score = 45.8 bits (109), Expect = 2e-04
Identities = 17/115 (14%), Positives = 39/115 (33%), Gaps = 7/115 (6%)
Query: 1988 SSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTT 2047
+++ +S S+ S+++ P+ STTT +TT TT ++ + + T
Sbjct: 144 TTMADQS---SAELSQNSGQSVPLDTSTTT----DPATTPAPAAPVDTTPTNSQTPAVAT 196
Query: 2048 NNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPAS 2102
+ N + S + A+ + + +T +
Sbjct: 197 APAPAVDPQQNAVVAPSQANVDTAATPAPAAPATPDGAAPLPTDQAGVSTPAADP 251
Score = 45.0 bits (107), Expect = 3e-04
Identities = 17/101 (16%), Positives = 37/101 (36%), Gaps = 4/101 (3%)
Query: 2008 ISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITS 2067
+ +S+++ S P+ STTT + +T + ++T TN+ T PA +
Sbjct: 151 SAELSQNSGQSVPLDTSTTT---DPATTPAPAAPVDTTPTNSQTPAVATAPAPAVDP-QQ 206
Query: 2068 SSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTS 2108
++ + S A+ + + +T
Sbjct: 207 NAVVAPSQANVDTAATPAPAAPATPDGAAPLPTDQAGVSTP 247
Score = 42.7 bits (101), Expect = 0.002
Identities = 17/105 (16%), Positives = 36/105 (34%), Gaps = 4/105 (3%)
Query: 2018 SSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTT 2077
S+ +S+++ P STTT + + +T TN+ T+ +PA +
Sbjct: 151 SAELSQNSGQSVPLDTSTTTDPATTPAPAAPVD---TTPTNSQTPAVATAPAPAVDPQQN 207
Query: 2078 SSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPAS 2122
+ A S A+ + + + +T +
Sbjct: 208 AVVAP-SQANVDTAATPAPAAPATPDGAAPLPTDQAGVSTPAADP 251
Score = 37.3 bits (87), Expect = 0.087
Identities = 14/98 (14%), Positives = 33/98 (33%), Gaps = 2/98 (2%)
Query: 1875 NNNSESTVVMSTLNSLLSENTTTNSPESESTT-TNNPESESTTTSSPESESTTTSSLV-S 1932
+ + V ++ TT TT TN+ T +P + + + S
Sbjct: 154 LSQNSGQSVPLDTSTTTDPATTPAPAAPVDTTPTNSQTPAVATAPAPAVDPQQNAVVAPS 213
Query: 1933 ESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSP 1970
++ ++ ++P+ + + ST + P
Sbjct: 214 QANVDTAATPAPAAPATPDGAAPLPTDQAGVSTPAADP 251
Score = 35.0 bits (81), Expect = 0.49
Identities = 29/132 (21%), Positives = 47/132 (35%), Gaps = 12/132 (9%)
Query: 2067 SSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTT 2126
SS+ S+++ S P ST ++ TT +PA+ TT + ++PA
Sbjct: 150 SSAELSQNSGQSVPLDTST-----TTDPATTPAPAAPVDTTPTNSQTPAVATAPAPAVDP 204
Query: 2127 IEEQGVSPHSEKLSANEDPEEFPNEDVFEHTFAEIPNIDHSNQTDEAIPE----TFDARE 2182
+ V+P + + P A +P T A P F A +
Sbjct: 205 QQNAVVAPSQA--NVDTAATPAPAAPATPDGAAPLPTDQAGVSTPAADPNALVMNFTA-D 261
Query: 2183 EWPQCKDVIGKV 2194
W + D GK
Sbjct: 262 CWLEVTDATGKK 273
>gnl|CDD|217495 pfam03326, Herpes_TAF50, Herpesvirus transcription activation factor
(transactivator). This family includes EBV BRLF1 and
similar ORF 50 proteins from other herpesviruses.
Length = 500
Score = 45.9 bits (109), Expect = 2e-04
Identities = 46/252 (18%), Positives = 78/252 (30%), Gaps = 24/252 (9%)
Query: 1909 NPESESTTTSSPESE--STTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTT 1966
NPE T +SP S+ T + + + P + SS + + + S V ++T
Sbjct: 198 NPEEILETRASPLSQFHGFTPHPSLPQPQSPLKP-----SPSSARPQQSESFSDVWPAST 252
Query: 1967 TSSPESESTTTSSPESESTTTSSLVSE--STTTSSPESESTTTISPVSESTTTSSPVSES 2024
S E S +P S S+ S E +S S V +S+ + +
Sbjct: 253 QSPREETSAEPLAPASPSSRRPSTAQEEQIACSSPQAEPEQGVQSYVPQSSDSRPSCFPA 312
Query: 2025 TTTISP--------ESESTTTSSPASESTTTNNPKSESTTTNNP--ASESITSSSPASES 2074
+T P + A S S S
Sbjct: 313 PSTTQPTFLPPNTNKKAKRDRRPQMVTPKQEGGAAVSQNHDGGTVRAPRGRPSGSGQSPP 372
Query: 2075 TTTSSPASESTTTSSPASESTTTSSPA--SESTTTSSPESESTTTSSPASESTTIEEQGV 2132
+ + +S + T S A + + PA + +S + T SS + EQ +
Sbjct: 373 SNSPLLSSLADTPSGAAHQPASLLPPAVVQQQLEDASDKQPPTPGSSLVPQPD---EQEL 429
Query: 2133 SPHSEKLSANED 2144
P L +
Sbjct: 430 GPSVMALLDRDQ 441
Score = 32.4 bits (74), Expect = 3.5
Identities = 49/211 (23%), Positives = 77/211 (36%), Gaps = 24/211 (11%)
Query: 1970 PESESTTTSSPESESTTTSSLVSESTTTSS---PESESTTTISPVSESTTTSSP-VSEST 2025
+ + T S E + + + S + S PE + +SE ++ S
Sbjct: 139 DDVKLCTQGSAERKRPPHTGIFSGLVSQQSFVLPEP----LLLEISEPGLLAASDADLSE 194
Query: 2026 TTISPESESTTTSSPASE--STTTNNPKSESTTTNNPASESITSSSPA-SESTTTSSPAS 2082
+PE T +SP S+ T + + + P S +S+ P SES + PAS
Sbjct: 195 LLQNPEEILETRASPLSQFHGFTPHPSLPQPQS---PLKPSPSSARPQQSESFSDVWPAS 251
Query: 2083 ESTTTSSPASESTTTS-SPASESTTTSSPESESTTTSSPASESTTIEEQGVSPHSEKLSA 2141
T SP E++ +PAS S+ S E S Q P S+
Sbjct: 252 ----TQSPREETSAEPLAPASPSSRRPSTAQEEQIACSSPQAEPEQGVQSYVP----QSS 303
Query: 2142 NEDPEEFPNEDVFEHTFAEIPNIDHSNQTDE 2172
+ P FP + TF PN + + D
Sbjct: 304 DSRPSCFPAPSTTQPTFLP-PNTNKKAKRDR 333
>gnl|CDD|219916 pfam08580, KAR9, Yeast cortical protein KAR9. The KAR9 protein in
Saccharomyces cerevisiae is a cytoskeletal protein
required for karyogamy, correct positioning of the
mitotic spindle and for orientation of cytoplasmic
microtubules. KAR9 localises at the shmoo tip in mating
cells and at the tip of the growing bud in anaphase.
Length = 626
Score = 46.0 bits (109), Expect = 2e-04
Identities = 43/282 (15%), Positives = 82/282 (29%), Gaps = 25/282 (8%)
Query: 1863 VIDNYSEIIFTTNNNS--ESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTT---- 1916
V+D+ ++ +S V + S + T S S+ P
Sbjct: 360 VVDHVLRDSQSSKIQQIRDSISVSGSDYSNPGSSIDTPSSSPSSSVIMTPPDSGPGSNVS 419
Query: 1917 ----TSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPES 1972
+ + L+ + S S S + + + ST P
Sbjct: 420 SRRVGTPGSKSDRVGAVLLRRMNIKPTLASIPDEKPSNISVFEDSETSPNSSTLLRDPPP 479
Query: 1973 ESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPES 2032
+ S + + + + ++ P S S ++ T + S+ ++ S
Sbjct: 480 KKCGEESGHLPNNPFFNKLKLTLSSIPPLSPRQ---SIITLPTPSRPASRISSLSLRLGS 536
Query: 2033 ESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPAS 2092
S + SP T + + + N S++ P + + P
Sbjct: 537 YSGSIVSPPPYPTLVSRKGAAGLSFN----RSVSDIEGERIGRYNLLP---TRIPALPFK 589
Query: 2093 ESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQGVSP 2134
+TTSS S SS S + P S E + P
Sbjct: 590 AESTTSSRRS-----SSLPSPTGVIGFPGSVPRFDHENLLPP 626
Score = 44.9 bits (106), Expect = 6e-04
Identities = 43/254 (16%), Positives = 73/254 (28%), Gaps = 9/254 (3%)
Query: 1864 IDNYSEIIFTTNNNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESE 1923
I++ S+ I T + S L+ ++ T + + S
Sbjct: 315 IESKSKTISKTFTLIYKALEESILDKGVASRTNREL-APKWLSLKTVVDHVLRDSQSSKI 373
Query: 1924 STTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESE 1983
S+ + S+P S T SS S S + S + S T
Sbjct: 374 QQIRDSISVSGSDYSNPGSSIDTPSSSPSSSVIMTPPDSGPGSNVSSRRVGT---PGSKS 430
Query: 1984 STTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASE 2043
+ L+ + S S +S + + + ST P + S
Sbjct: 431 DRVGAVLLRRMNIKPTLASIPDEKPSNISVFEDSETSPNSSTLLRDPPPKKCGEESGHLP 490
Query: 2044 STTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPA-SESTTTSSPAS 2102
NNP + ++ T S PAS ++ S S S + SP
Sbjct: 491 ----NNPFFNKLKLTLSSIPPLSPRQSIITLPTPSRPASRISSLSLRLGSYSGSIVSPPP 546
Query: 2103 ESTTTSSPESESTT 2116
T S + +
Sbjct: 547 YPTLVSRKGAAGLS 560
Score = 44.5 bits (105), Expect = 7e-04
Identities = 57/308 (18%), Positives = 99/308 (32%), Gaps = 38/308 (12%)
Query: 1870 IIFTTNN---NSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTS-------S 1919
++FT N V +L L + TT ++ +T T+ ES+S T S
Sbjct: 272 LLFTNLNHELQKMLDSVERSLQKLQNNKTTGMHLDNRTTMTDQIESKSKTISKTFTLIYK 331
Query: 1920 PESESTTTSSLVSESTTTSSP--------ESESTTTSSPESESTTTSSLVSESTTTSSPE 1971
ES + S + +P S S+ + S+P
Sbjct: 332 ALEESILDKGVASRTNRELAPKWLSLKTVVDHVLRDSQSSKIQQIRDSISVSGSDYSNPG 391
Query: 1972 SESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPE 2031
S T SS S S +++ + S +P S+S + + I P
Sbjct: 392 SSIDTPSSSPSSSV----IMTPPDSGPGSNVSSRRVGTPGSKSDRVGAVLL-RRMNIKPT 446
Query: 2032 SESTTTSSP---ASESTTTNNPKSESTTTNNPASESITSSSPASES---------TTTSS 2079
S P + + +P + ST +P + S + T +S
Sbjct: 447 LASIPDEKPSNISVFEDSETSP-NSSTLLRDPPPKKCGEESGHLPNNPFFNKLKLTLSSI 505
Query: 2080 PASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQGVSPHSEKL 2139
P S + T + + S+ + S S + SP T + +G + S
Sbjct: 506 PPLSPR--QSIITLPTPSRPASRISSLSLRLGSYSGSIVSPPPYPTLVSRKGAAGLSFNR 563
Query: 2140 SANEDPEE 2147
S ++ E
Sbjct: 564 SVSDIEGE 571
>gnl|CDD|215570 PLN03091, PLN03091, hypothetical protein; Provisional.
Length = 459
Score = 45.7 bits (108), Expect = 2e-04
Identities = 54/220 (24%), Positives = 91/220 (41%), Gaps = 16/220 (7%)
Query: 1875 NNNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSES 1934
++ S+VV + LN L ++N S + S S E ES+++S + + +
Sbjct: 145 KSDKASSVVSNELNLLKADN----SKPLAALQEKRSSSISPAGYQLEVESSSSSKINNSN 200
Query: 1935 TTTSSPESESTTTSSPES--ESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVS 1992
S + T T + + + TTS ES+TTS S+ + + +++ +S
Sbjct: 201 NNNHSNSNLMTPTPNKDFFLDRFTTSH---ESSTTSCRPSDLVGHFPFQQLNYASNARLS 257
Query: 1993 ESTTTS---SPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNN 2049
+ + S S+S S S S T S T++ P T S S S ++N
Sbjct: 258 TNPNPTLWFSQNSKSFEMNSEFSSSMTPSILPPSVTSSFLP----TPMSYKPSISLPSDN 313
Query: 2050 PKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSS 2089
P S T N + + S S S+ SS + E + SS
Sbjct: 314 PSIPSFTVNGVRNWEAGAFSNNSNSSNGSSSSIELQSNSS 353
Score = 31.9 bits (72), Expect = 4.4
Identities = 41/206 (19%), Positives = 72/206 (34%), Gaps = 19/206 (9%)
Query: 1937 TSSPESESTTTSSPESESTTTSSLVSESTTTSSPESE-STTTSSPESESTTTSSLVSEST 1995
T ++++ S E + + S S E ES+++S + + +
Sbjct: 142 TDDKSDKASSVVSNELNLLKADNSKPLAALQEKRSSSISPAGYQLEVESSSSSKINNSNN 201
Query: 1996 TTSSPESESTTT---------ISPVSESTTTSSPVSESTTTISPES---ESTTTSSPASE 2043
S + T T + ES+TTS S+ + S S
Sbjct: 202 NNHSNSNLMTPTPNKDFFLDRFTTSHESSTTSCRPSDLVGHFPFQQLNYASNARLSTNPN 261
Query: 2044 STTTNNPKSESTTTNNPASESITSSSPASESTTT--SSP----ASESTTTSSPASESTTT 2097
T + S+S N+ S S+T S T++ +P S S + +P+ S T
Sbjct: 262 PTLWFSQNSKSFEMNSEFSSSMTPSILPPSVTSSFLPTPMSYKPSISLPSDNPSIPSFTV 321
Query: 2098 SSPASESTTTSSPESESTTTSSPASE 2123
+ + S S S+ SS + E
Sbjct: 322 NGVRNWEAGAFSNNSNSSNGSSSSIE 347
>gnl|CDD|234504 TIGR04216, halo_surf_glyco, major cell surface glycoprotein. Members
of this family are the S-layer-forming halobacterial
major cell surface glycoprotein. The highest scores below
model cutoffs are fragmentary paralogs to actual members
of the family. Modifications include at N-linked and
O-linked glycosylation, a C-terminal diphytanylglyceryl
modification, and probable cleavage of the PGF-CTERM
tail.
Length = 782
Score = 46.0 bits (109), Expect = 2e-04
Identities = 31/151 (20%), Positives = 57/151 (37%), Gaps = 15/151 (9%)
Query: 1869 EIIFTTNNNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTS 1928
E ++ + V + N +NT T N + S T S + ++
Sbjct: 623 ESVYNPVEAGGTLEVAGSTNRKPDDNTIT-------VELLNEDDTSVTLESTDEWNSDGQ 675
Query: 1929 SLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTS 1988
V + + + ++ +V E+ ++TT+ P + +T T+
Sbjct: 676 WSVEVDLSDVETGNYTVEADDGDNTDRVNVEVVEET-----ERPDTTTSEDPTTTTTPTT 730
Query: 1989 SLVSESTTTSSPESESTTTISPVSESTTTSS 2019
+ E+T T+ P TTT P E+TT SS
Sbjct: 731 TGPEETTETAEPT---TTTEEPTEETTTGSS 758
Score = 39.1 bits (91), Expect = 0.031
Identities = 52/266 (19%), Positives = 97/266 (36%), Gaps = 27/266 (10%)
Query: 1855 AATAVAISVIDNYSEIIFTTNNNSESTVVMSTLNSLLSENTTTNSPESESTTTNN-PESE 1913
V++ D + E ++ TV L+S + +T + +
Sbjct: 519 NYQEVSVDSDDTFDEEDIDIGGLTQGTVTAHILSSGRDGEIGDTGTSNGATLNDLIGYLD 578
Query: 1914 STTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESE 1973
+ S E L + T+S + T T TT S+ + + E
Sbjct: 579 TYAGGSNTGEQIREQILSNTVDDTASDDLIVTETFRLADGLTTIESVYNPVEAGGTLEVA 638
Query: 1974 STTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESE 2033
+T P+ + T L + T S EST + S+ + V + + E+
Sbjct: 639 GSTNRKPDDNTITVELLNEDDT---SVTLESTDEWN--SDGQWS---VEVDLSDV--ETG 688
Query: 2034 STTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASE 2093
+ T + ++T N + + + ++TT+ P + +T T++ E
Sbjct: 689 NYTVEADDGDNTDRVNVE-------------VVEETERPDTTTSEDPTTTTTPTTTGPEE 735
Query: 2094 STTTSSPASESTTTSSPESESTTTSS 2119
+T T+ P +TTT P E+TT SS
Sbjct: 736 TTETAEP---TTTTEEPTEETTTGSS 758
Score = 39.1 bits (91), Expect = 0.039
Identities = 37/160 (23%), Positives = 63/160 (39%), Gaps = 16/160 (10%)
Query: 1977 TSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTT 2036
T+S + T T L TT S + + +T P TI+ E +
Sbjct: 602 TASDDLIVTETFRLADGLTTIESVYNPVEAGGTLEVAGSTNRKP---DDNTITVELLNED 658
Query: 2037 TSSPASESTTTNNPK---SESTTTNNPASESITSSSPASESTTTSS-------PASESTT 2086
+S EST N S ++ + + T + ++T + ++TT
Sbjct: 659 DTSVTLESTDEWNSDGQWSVEVDLSDVETGNYTVEADDGDNTDRVNVEVVEETERPDTTT 718
Query: 2087 TSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTT 2126
+ P + +T T++ E+T T+ P TTT P E+TT
Sbjct: 719 SEDPTTTTTPTTTGPEETTETAEPT---TTTEEPTEETTT 755
>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger. [Transport and
binding proteins, Cations and iron carrying compounds].
Length = 1096
Score = 45.8 bits (108), Expect = 4e-04
Identities = 61/249 (24%), Positives = 105/249 (42%), Gaps = 31/249 (12%)
Query: 1897 TNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTT 1956
T SP + P ST + P S T + V +S T++ + T S + TT
Sbjct: 191 TPSPLGRMVNSYAP---STFMTMPRSHGITPRTTVKDSEITATYKMLETNPSKRTAGKTT 247
Query: 1957 TSSL--VSESTTTS-SPESESTTTSSPES---ESTTTSSLVSESTTTSSP-----ESEST 2005
+ L ++++T T + E E+ +SP S ++T T+ ES ++++ ++ T
Sbjct: 248 PTPLKGMTDNTPTFLTREVETDLLTSPRSVVEKNTLTTPRRVESNSSTNHWGLVGKNNLT 307
Query: 2006 TTISPVSESTTTSSPVSESTTTISPESESTTTSSPASEST-TTNNPKSESTTTN---NPA 2061
T V E T SE TIS + S+ + AS + NP S ++ A
Sbjct: 308 TPQGTVLEHT---PATSEGQVTISIMTGSSPAETKASTAAWKIRNPLSRTSAPAVRIASA 364
Query: 2062 SESITSSSPASESTTTSSPASESTTTSS--------PASESTTTSSPASESTTTSSPESE 2113
+ +P++ +T ++P + T+ PA TT SP+ TT PE+
Sbjct: 365 TFRGLEKNPSTAPSTPATPRVRAVLTTQVHHCVVVKPAPAVPTTPSPS--LTTALFPEAP 422
Query: 2114 STTTSSPAS 2122
S + S+
Sbjct: 423 SPSPSALPP 431
Score = 41.1 bits (96), Expect = 0.008
Identities = 63/277 (22%), Positives = 110/277 (39%), Gaps = 35/277 (12%)
Query: 1868 SEIIFTTNNNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTT- 1926
++I TT N+ S T E ++P + S N+ S T+ +S T
Sbjct: 119 AKITPTTPKNNYSPTAAGT------ERVKEDTPATPSRALNHYIS---TSGRQRVKSYTP 169
Query: 1927 -TSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTT--TSSPESE 1983
V S+ T + E T SP + + ST + P S T T+ +SE
Sbjct: 170 KPRGEVKSSSPTQTREKVRKYTPSPLGRMVNS---YAPSTFMTMPRSHGITPRTTVKDSE 226
Query: 1984 STTTSSLVSESTTTSSPESESTTTISPVSESTTT--SSPVSESTTTISPES---ESTTTS 2038
T T ++ + + + + T + ++++T T + V E+ SP S ++T T+
Sbjct: 227 ITATYKMLETNPSKRTAGKTTPTPLKGMTDNTPTFLTREV-ETDLLTSPRSVVEKNTLTT 285
Query: 2039 SPASESTTTNNPKSESTTTN--NPASESITSSSPASESTTTSSPASESTTTSSPASE--- 2093
ES ++ N N P + + SE T S + S+ + AS
Sbjct: 286 PRRVESNSSTNHWGLVGKNNLTTPQGTVLEHTPATSEGQVTISIMTGSSPAETKASTAAW 345
Query: 2094 -----STTTSSPA---SESTTTSSPESESTTTSSPAS 2122
+ TS+PA + +T ++ ST S+PA+
Sbjct: 346 KIRNPLSRTSAPAVRIASATFRGLEKNPSTAPSTPAT 382
Score = 33.4 bits (76), Expect = 2.2
Identities = 36/178 (20%), Positives = 59/178 (33%), Gaps = 15/178 (8%)
Query: 1990 LVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNN 2049
+VS SS E E ++P + + S + T +P TT N
Sbjct: 74 MVSSDPPKSSSEME-GEMLAPQATVGRDEATPSIAMENTPSPPRRTAKITP----TTPKN 128
Query: 2050 PKSESTTTNNPASESI--TSSSPASESTTTSSPASESTTTSSPASE---STTTSSPASES 2104
S + E T S + +TS + T P E S+ T +
Sbjct: 129 NYSPTAAGTERVKEDTPATPSRALNHYISTSGRQRVKSYTPKPRGEVKSSSPTQTREKVR 188
Query: 2105 TTTSSPESESTTTSSPASESTTIEEQGVSPH-----SEKLSANEDPEEFPNEDVFEHT 2157
T SP + +P++ T G++P SE + + E P++ T
Sbjct: 189 KYTPSPLGRMVNSYAPSTFMTMPRSHGITPRTTVKDSEITATYKMLETNPSKRTAGKT 246
>gnl|CDD|227952 COG5665, NOT5, CCR4-NOT transcriptional regulation complex, NOT5
subunit [Transcription].
Length = 548
Score = 45.4 bits (107), Expect = 4e-04
Identities = 33/197 (16%), Positives = 63/197 (31%), Gaps = 26/197 (13%)
Query: 1971 ESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISP 2030
E + +++++ + + S S+ SS + E SP ++ +S+ TT P
Sbjct: 199 EIQPSSSNNEAPKEGNNQT--SLSSIRSSKKQER----SPKKKAPQRDVSISDRATT--P 250
Query: 2031 ESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSP 2090
+ ++S + ST T T S +S+ + +T S S
Sbjct: 251 IAPGVESASQSISSTPTPVSTDTPLHTVKDDSIKFDNSTLGTPTT----HVSMKKKESEN 306
Query: 2091 ASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEE--------------QGVSPHS 2136
SE S + + + T ++ + E Q +SP
Sbjct: 307 DSEQQLNFPKDSTDEIRKTIQHDVETNAAFQNPLFNDELKWWLASKRYLTQPLQEMSPSM 366
Query: 2137 EKLSANEDPEEFPNEDV 2153
N + DV
Sbjct: 367 VSTLENSLLNCPDSLDV 383
Score = 42.0 bits (98), Expect = 0.004
Identities = 31/182 (17%), Positives = 62/182 (34%), Gaps = 16/182 (8%)
Query: 1909 NPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTS 1968
E + +++++ + + S S+ SS + E + + S +TT
Sbjct: 197 GCEIQPSSSNNEAPKEGNNQT--SLSSIRSSKKQERSPKKKAPQRDVSISD---RATTPI 251
Query: 1969 SPESESTT---TSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSEST 2025
+P ES + +S+P ST T + + S T + VS S SE
Sbjct: 252 APGVESASQSISSTPTPVSTDTPLHTVKDDSIKFDNSTLGTPTTHVSMKKKESENDSEQQ 311
Query: 2026 TTISPESESTTTSSPASESTTTNNPKSESTT------TNNP--ASESITSSSPASESTTT 2077
+S + + T ++ + ++ + SP+ ST
Sbjct: 312 LNFPKDSTDEIRKTIQHDVETNAAFQNPLFNDELKWWLASKRYLTQPLQEMSPSMVSTLE 371
Query: 2078 SS 2079
+S
Sbjct: 372 NS 373
Score = 40.8 bits (95), Expect = 0.010
Identities = 42/228 (18%), Positives = 72/228 (31%), Gaps = 14/228 (6%)
Query: 1872 FTTNNNSESTVVMSTLNSLLSENTTTNSPESES-TTTNNPESESTTTSSPESESTTTS-- 1928
+ NN+ + T+ + +S +E+ NN S S+ SS + E +
Sbjct: 177 YVENNDDPDFIEYDTIYEDMGCEIQPSSSNNEAPKEGNNQTSLSSIRSSKKQERSPKKKA 236
Query: 1929 -----SLVSESTTTSSPESESTT---TSSPESESTTTSSLVSESTTTSSPESESTTTSSP 1980
S+ +TT +P ES + +S+P ST T + + S T ++
Sbjct: 237 PQRDVSISDRATTPIAPGVESASQSISSTPTPVSTDTPLHTVKDDSIKFDNSTLGTPTTH 296
Query: 1981 ESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSP 2040
S S SE +S + + T ++ + E + S
Sbjct: 297 VSMKKKESENDSEQQLNFPKDSTDEIRKTIQHDVETNAAFQNPLFN---DELKWWLASKR 353
Query: 2041 ASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTS 2088
S +T N S S + P S TS
Sbjct: 354 YLTQPLQEMSPSMVSTLENSLLNCPDSLDVDSPICLYTKPLSLPHPTS 401
Score = 35.4 bits (81), Expect = 0.45
Identities = 33/189 (17%), Positives = 62/189 (32%), Gaps = 26/189 (13%)
Query: 1923 ESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPES 1982
E +SS ++ S S+ SS + E + + S +TT +P
Sbjct: 199 EIQPSSSNNEAPKEGNNQTSLSSIRSSKKQERSPKKKAPQRDVSISD---RATTPIAPGV 255
Query: 1983 ESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTI-SPESESTTT---S 2038
ES + S ++T +P S T T + +T+ +P + + S
Sbjct: 256 ESASQS-----ISSTPTPVSTDTPLH------TVKDDSIKFDNSTLGTPTTHVSMKKKES 304
Query: 2039 SPASESTTTNNPKSESTTTNNPASESITSSSPASESTT------TSSP--ASESTTTSSP 2090
SE S + T+++ + +S ++ SP
Sbjct: 305 ENDSEQQLNFPKDSTDEIRKTIQHDVETNAAFQNPLFNDELKWWLASKRYLTQPLQEMSP 364
Query: 2091 ASESTTTSS 2099
+ ST +S
Sbjct: 365 SMVSTLENS 373
>gnl|CDD|216421 pfam01299, Lamp, Lysosome-associated membrane glycoprotein (Lamp).
Length = 305
Score = 44.7 bits (106), Expect = 4e-04
Identities = 22/108 (20%), Positives = 41/108 (37%), Gaps = 3/108 (2%)
Query: 1984 STTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASE 2043
S T + + T+ + ++ + V+ ST T +P + + +S + T +
Sbjct: 2 SVTELTFSYNLSDTTLFPNATSKGVKTVTSSTDTKAPTNTTYRCVSSTTVPMTNVTVTLH 61
Query: 2044 STTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPA 2091
T S T + + SP + +T + SP SSPA
Sbjct: 62 DVTLQAYLSNGTFSKTETRCEADTPSPTTVATPSPSPTP---VPSSPA 106
Score = 38.6 bits (90), Expect = 0.032
Identities = 25/112 (22%), Positives = 47/112 (41%), Gaps = 7/112 (6%)
Query: 2010 PVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSS 2069
V+E T + + S TT+ P + ++ + ST T P + + + + +T+ +
Sbjct: 2 SVTELTFSYNL---SDTTLFPNA-TSKGVKTVTSSTDTKAPTNTTYRCVSSTTVPMTNVT 57
Query: 2070 PASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPA 2121
T + S T + + T SP + +T + SP + SSPA
Sbjct: 58 VTLHDVTLQAYLSNGTFSKTETRCEADTPSPTTVATPSPSP---TPVPSSPA 106
Score = 38.2 bits (89), Expect = 0.041
Identities = 20/108 (18%), Positives = 41/108 (37%), Gaps = 3/108 (2%)
Query: 2004 STTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASE 2063
S T ++ + T+ + ++ + + ST T +P + + + + T
Sbjct: 2 SVTELTFSYNLSDTTLFPNATSKGVKTVTSSTDTKAPTNTTYRCVSSTTVPMTNVTVTLH 61
Query: 2064 SITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPE 2111
+T + S T + + T SP + +T + SP SSP
Sbjct: 62 DVTLQAYLSNGTFSKTETRCEADTPSPTTVATPSPSPTP---VPSSPA 106
Score = 37.0 bits (86), Expect = 0.10
Identities = 26/109 (23%), Positives = 44/109 (40%), Gaps = 4/109 (3%)
Query: 1933 ESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVS 1992
T + + S TT P + S + V+ ST T +P + + S + T ++
Sbjct: 2 SVTELTFSYNLSDTTLFPNATSKGVKT-VTSSTDTKAPTNTTYRCVSSTTVPMTNVTVTL 60
Query: 1993 ESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPA 2041
T + S T + + T SP + +T + SP + SSPA
Sbjct: 61 HDVTLQAYLSNGTFSKTETRCEADTPSPTTVATPSPSP---TPVPSSPA 106
Score = 35.5 bits (82), Expect = 0.32
Identities = 27/115 (23%), Positives = 50/115 (43%), Gaps = 17/115 (14%)
Query: 1922 SESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPE 1981
+E T + +L S TT P +S T TSS +++ T ++ S+TT
Sbjct: 4 TELTFSYNL---SDTTLFP-----NATSKGV-KTVTSSTDTKAPTNTTYRCVSSTTVPMT 54
Query: 1982 SESTTTSSLVSESTTTSSPESESTT-----TISPVSESTTTSSPVSESTTTISPE 2031
+ + T + ++ ++ S++ T T SP + +T + SP + SP
Sbjct: 55 NVTVTLHDVTLQAYLSNGTFSKTETRCEADTPSPTTVATPSPSP---TPVPSSPA 106
Score = 34.7 bits (80), Expect = 0.56
Identities = 22/114 (19%), Positives = 47/114 (41%), Gaps = 10/114 (8%)
Query: 1899 SPESESTTTN-NPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTT 1957
S + + N + + +S T TSS +++ T ++ S+TT + + T
Sbjct: 2 SVTELTFSYNLSDTTLFPNATSKGV-KTVTSSTDTKAPTNTTYRCVSSTTVPMTNVTVTL 60
Query: 1958 SSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPV 2011
+ ++ ++ S++ T ++ S TT +T + SP + SP
Sbjct: 61 HDVTLQAYLSNGTFSKTETRCEADTPSPTTV-----ATPSPSP---TPVPSSPA 106
Score = 32.8 bits (75), Expect = 1.9
Identities = 18/103 (17%), Positives = 41/103 (39%), Gaps = 1/103 (0%)
Query: 1952 SESTTTSSLVSESTTTSSPESESTTTSSPES-ESTTTSSLVSESTTTSSPESESTTTISP 2010
+E T + +L + ++ T +S ++ T ++ S+TT + + T
Sbjct: 4 TELTFSYNLSDTTLFPNATSKGVKTVTSSTDTKAPTNTTYRCVSSTTVPMTNVTVTLHDV 63
Query: 2011 VSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSE 2053
++ ++ S++ T ++ S TT + S S T
Sbjct: 64 TLQAYLSNGTFSKTETRCEADTPSPTTVATPSPSPTPVPSSPA 106
Score = 31.2 bits (71), Expect = 6.4
Identities = 29/120 (24%), Positives = 47/120 (39%), Gaps = 17/120 (14%)
Query: 1902 SESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLV 1961
+E T + N S TT P + S + V+ ST T +P + + S + T ++
Sbjct: 4 TELTFSYNL---SDTTLFPNATSKGVKT-VTSSTDTKAPTNTTYRCVSSTTVPMTNVTVT 59
Query: 1962 SESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPV 2021
T + S + T S E+ T SP + +T + SP + SSP
Sbjct: 60 LHDVTLQAYLS-NGTFSKTETRC---------EADTPSPTTVATPSPSP---TPVPSSPA 106
>gnl|CDD|222447 pfam13904, DUF4207, Domain of unknown function (DUF4207). This
family is found in eukaryotes; it has several conserved
tryptophan residues. The function is not known.
Length = 261
Score = 43.9 bits (104), Expect = 5e-04
Identities = 26/78 (33%), Positives = 36/78 (46%), Gaps = 5/78 (6%)
Query: 1921 ESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSP 1980
S S +T SL+S SP S S T EST + S + E S S S P
Sbjct: 1 LSCSDSTRSLLSPLGNELSP-SSSDETEDCSEESTDSWSDMYEGLKDSESSSNSV----P 55
Query: 1981 ESESTTTSSLVSESTTTS 1998
++T+S +S+S+T S
Sbjct: 56 SLSLSSTASSLSDSSTYS 73
Score = 41.6 bits (98), Expect = 0.003
Identities = 25/77 (32%), Positives = 35/77 (45%), Gaps = 5/77 (6%)
Query: 1951 ESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISP 2010
S S +T SL+S SP S S T EST + S + E S S S P
Sbjct: 1 LSCSDSTRSLLSPLGNELSP-SSSDETEDCSEESTDSWSDMYEGLKDSESSSNSV----P 55
Query: 2011 VSESTTTSSPVSESTTT 2027
++T+S +S+S+T
Sbjct: 56 SLSLSSTASSLSDSSTY 72
Score = 37.0 bits (86), Expect = 0.097
Identities = 26/79 (32%), Positives = 37/79 (46%), Gaps = 5/79 (6%)
Query: 1885 STLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESES 1944
+ SLLS SP S S T + EST + S E SES++ S P
Sbjct: 5 DSTRSLLSPLGNELSP-SSSDETEDCSEESTDSWSDMYEG----LKDSESSSNSVPSLSL 59
Query: 1945 TTTSSPESESTTTSSLVSE 1963
++T+S S+S+T S + E
Sbjct: 60 SSTASSLSDSSTYSRSLKE 78
Score = 35.9 bits (83), Expect = 0.19
Identities = 23/77 (29%), Positives = 31/77 (40%), Gaps = 5/77 (6%)
Query: 1971 ESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISP 2030
S S +T S S S S S T EST + S + E S S S P
Sbjct: 1 LSCSDSTRSLLSPLGNELS-PSSSDETEDCSEESTDSWSDMYEGLKDSESSSNSV----P 55
Query: 2031 ESESTTTSSPASESTTT 2047
++T+S S+S+T
Sbjct: 56 SLSLSSTASSLSDSSTY 72
Score = 35.1 bits (81), Expect = 0.37
Identities = 24/77 (31%), Positives = 37/77 (48%), Gaps = 5/77 (6%)
Query: 2022 SESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPA 2081
S S +T S S SP+S S T + EST + + E + S +S S P+
Sbjct: 2 SCSDSTRSLLSPLGNELSPSS-SDETEDCSEESTDSWSDMYEGLKDSESSSNSV----PS 56
Query: 2082 SESTTTSSPASESTTTS 2098
++T+S S+S+T S
Sbjct: 57 LSLSSTASSLSDSSTYS 73
Score = 33.5 bits (77), Expect = 1.2
Identities = 24/78 (30%), Positives = 35/78 (44%), Gaps = 5/78 (6%)
Query: 2031 ESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSP 2090
S S +T S S +P S S T + + ES S S E S +S S P
Sbjct: 1 LSCSDSTRSLLSPLGNELSP-SSSDETEDCSEESTDSWSDMYEGLKDSESSSNSV----P 55
Query: 2091 ASESTTTSSPASESTTTS 2108
+ ++T+S S+S+T S
Sbjct: 56 SLSLSSTASSLSDSSTYS 73
Score = 32.4 bits (74), Expect = 2.2
Identities = 22/78 (28%), Positives = 32/78 (41%), Gaps = 5/78 (6%)
Query: 2001 ESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNP 2060
S S +T S +S SP S T EST + S E + S S P
Sbjct: 1 LSCSDSTRSLLSPLGNELSPSSSDETE-DCSEESTDSWSDMYEGLKDSESSSNSV----P 55
Query: 2061 ASESITSSSPASESTTTS 2078
+ +++S S+S+T S
Sbjct: 56 SLSLSSTASSLSDSSTYS 73
Score = 32.0 bits (73), Expect = 3.1
Identities = 27/97 (27%), Positives = 41/97 (42%), Gaps = 16/97 (16%)
Query: 2062 SESITSSSPASESTTTSSPASESTTTSSPASESTT------TSSPASESTTTSSPESEST 2115
S S ++ S S SP+S S T + EST SES++ S P +
Sbjct: 2 SCSDSTRSLLSPLGNELSPSS-SDETEDCSEESTDSWSDMYEGLKDSESSSNSVPSLSLS 60
Query: 2116 TTSSPASESTT---------IEEQGVSPHSEKLSANE 2143
+T+S S+S+T +E Q + LSA +
Sbjct: 61 STASSLSDSSTYSRSLKEVKLERQAQEAYENWLSAKQ 97
Score = 32.0 bits (73), Expect = 3.3
Identities = 26/83 (31%), Positives = 33/83 (39%), Gaps = 4/83 (4%)
Query: 1901 ESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSL 1960
S S +T + S SP S S T EST + S E S S S + SL
Sbjct: 1 LSCSDSTRSLLSPLGNELSP-SSSDETEDCSEESTDSWSDMYEGLKDSESSSNSVPSLSL 59
Query: 1961 VSESTTTSSPESESTTTSSPESE 1983
S+T SS ST + S +
Sbjct: 60 ---SSTASSLSDSSTYSRSLKEV 79
>gnl|CDD|177546 PHA03151, PHA03151, hypothetical protein; Provisional.
Length = 259
Score = 44.0 bits (103), Expect = 6e-04
Identities = 32/198 (16%), Positives = 63/198 (31%), Gaps = 16/198 (8%)
Query: 1901 ESESTTTNNPESESTTTSSPESESTTTSSLVSESTT---------TSSPESESTTTSSPE 1951
E +ST + N ++ES++ +++ S V ST T + + S+ +
Sbjct: 42 EDDSTPSENTKAESSSIDEDGLLTSSGSDSVFNSTDYESTPEPSKTPGFSDSNVSDSNND 101
Query: 1952 SESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPV 2011
+ S+ SS T+ S + E+ SS + S + TI
Sbjct: 102 KDFDFKPQDEDTSSDDSSAPDFITSLVSSDCEARGLSSSEEDGEPYSKQKMSQPLTIDAK 161
Query: 2012 SESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPA 2071
+E T+ + E + K + + + + P
Sbjct: 162 TEEITSEEDCCVQEDSSDSEEDVVEAFIRQRAQMAGKKKKGKRSIST-------SDDEPP 214
Query: 2072 SESTTTSSPASESTTTSS 2089
+S S++T S
Sbjct: 215 RKSRRKRHSHRISSSTDS 232
Score = 41.3 bits (96), Expect = 0.003
Identities = 33/199 (16%), Positives = 69/199 (34%), Gaps = 6/199 (3%)
Query: 1920 PESESTTTSSLVSESTTTSSPESESTTTSSPES--ESTTTSSLVSESTTTSSPESESTTT 1977
P E +T S +++ ++S E T+S +S ST S E + T + +
Sbjct: 39 PTDEDDSTPSENTKAESSSIDEDGLLTSSGSDSVFNSTDYEST-PEPSKTPGFSDSNVSD 97
Query: 1978 SSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTT 2037
S+ + + S+ SS T+ +S E+ SS + + T
Sbjct: 98 SNNDKDFDFKPQDEDTSSDDSSAPDFITSLVSSDCEARGLSSSEEDGEPYSKQKMSQPLT 157
Query: 2038 SSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSS---PASES 2094
+E T+ +++ + + + + + ++S P +S
Sbjct: 158 IDAKTEEITSEEDCCVQEDSSDSEEDVVEAFIRQRAQMAGKKKKGKRSISTSDDEPPRKS 217
Query: 2095 TTTSSPASESTTTSSPESE 2113
S++T S + E
Sbjct: 218 RRKRHSHRISSSTDSDDEE 236
Score = 40.9 bits (95), Expect = 0.005
Identities = 36/217 (16%), Positives = 73/217 (33%), Gaps = 13/217 (5%)
Query: 1919 SPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTS 1978
+P + T ++ ++ E+ +SS + + TSS ++ ES +
Sbjct: 27 APREKLTNVFKFPTDEDDSTPSENTKAESSSIDEDGLLTSSGSDSVFNSTDYESTPEPSK 86
Query: 1979 SPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTS 2038
+P + S ++ P+ E T S+ SS T+ +S + E+ S
Sbjct: 87 TPGFSDSNVSDSNNDKDFDFKPQDEDT--------SSDDSSAPDFITSLVSSDCEARGLS 138
Query: 2039 SPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTS 2098
S + + K T + +E ITS +S +
Sbjct: 139 SSEEDGEPYSKQKMSQPLTIDAKTEEITSEEDCCVQEDSSDSEEDVVEAFIRQRAQMAGK 198
Query: 2099 SPASESTTTSSPE-----SESTTTSSPASESTTIEEQ 2130
+ + ++S + S S S ST +++
Sbjct: 199 KKKGKRSISTSDDEPPRKSRRKRHSHRISSSTDSDDE 235
Score = 39.4 bits (91), Expect = 0.016
Identities = 30/211 (14%), Positives = 66/211 (31%), Gaps = 13/211 (6%)
Query: 1875 NNNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSES 1934
++++ S + +S+ + T+S + + ES + +P + S ++
Sbjct: 43 DDSTPSENTKAESSSIDEDGLLTSSGSDSVFNSTDYESTPEPSKTPGFSDSNVSDSNNDK 102
Query: 1935 TTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSES 1994
P+ E T++ + TS S + E+ SS E + S
Sbjct: 103 DFDFKPQDEDTSSDDSSAPDFITS--------LVSSDCEARGLSSSEEDGEPYSKQKMSQ 154
Query: 1995 TTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNP---- 2050
T ++E T+ +S + + + + + +
Sbjct: 155 PLTIDAKTEEITSEEDCCVQEDSSDSEEDVVEAFIRQRAQMAGKKKKGKRSISTSDDEPP 214
Query: 2051 -KSESTTTNNPASESITSSSPASESTTTSSP 2080
KS ++ S S S T +P
Sbjct: 215 RKSRRKRHSHRISSSTDSDDEEPRHKMTGTP 245
Score = 35.9 bits (82), Expect = 0.22
Identities = 32/149 (21%), Positives = 57/149 (38%), Gaps = 4/149 (2%)
Query: 2010 PVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSS 2069
P E +T S +++ ++ S + + TSS + + + +S + P S
Sbjct: 39 PTDEDDSTPSENTKAESS-SIDEDGLLTSSGSDSVFNSTDYESTPEPSKTPGFSDSNVSD 97
Query: 2070 PASESTTTSSPASE--STTTSSPASESTTTSSPASESTTTSSPESESTTTS-SPASESTT 2126
++ P E S+ SS T+ S E+ SS E + S S+ T
Sbjct: 98 SNNDKDFDFKPQDEDTSSDDSSAPDFITSLVSSDCEARGLSSSEEDGEPYSKQKMSQPLT 157
Query: 2127 IEEQGVSPHSEKLSANEDPEEFPNEDVFE 2155
I+ + SE+ ++ EDV E
Sbjct: 158 IDAKTEEITSEEDCCVQEDSSDSEEDVVE 186
>gnl|CDD|165021 PHA02638, PHA02638, CC chemokine receptor-like protein; Provisional.
Length = 417
Score = 44.6 bits (105), Expect = 6e-04
Identities = 24/76 (31%), Positives = 40/76 (52%), Gaps = 5/76 (6%)
Query: 1921 ESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSP 1980
++ STT SS++ S+T S TT + E+ + S++S T E ++ + SP
Sbjct: 2 DNSSTTLSSIILSSSTLSP-----TTFFTIETSMDESKSIISTFTEIIPTEIPTSESPSP 56
Query: 1981 ESESTTTSSLVSESTT 1996
S S+++SS S S T
Sbjct: 57 NSNSSSSSSSSSSSIT 72
Score = 44.2 bits (104), Expect = 6e-04
Identities = 26/86 (30%), Positives = 44/86 (51%), Gaps = 5/86 (5%)
Query: 1981 ESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSP 2040
++ STT SS++ S+T S + T + + ES + S +E T P SE + SP
Sbjct: 2 DNSSTTLSSIILSSSTLSP--TTFFTIETSMDESKSIISTFTEIIPTEIPTSE---SPSP 56
Query: 2041 ASESTTTNNPKSESTTTNNPASESIT 2066
S S+++++ S S T + +IT
Sbjct: 57 NSNSSSSSSSSSSSITYDYEYENNIT 82
Score = 44.2 bits (104), Expect = 6e-04
Identities = 28/79 (35%), Positives = 39/79 (49%), Gaps = 11/79 (13%)
Query: 1971 ESESTTTSSPESESTTTSSLVSESTTTSSPES---ESTTTISPVSESTTTSSPVSESTTT 2027
++ STT SS SS TT + E+ ES + IS +E T P SES +
Sbjct: 2 DNSSTTLSS-----IILSSSTLSPTTFFTIETSMDESKSIISTFTEIIPTEIPTSESPS- 55
Query: 2028 ISPESESTTTSSPASESTT 2046
P S S+++SS +S S T
Sbjct: 56 --PNSNSSSSSSSSSSSIT 72
Score = 43.1 bits (101), Expect = 0.002
Identities = 24/79 (30%), Positives = 35/79 (44%), Gaps = 12/79 (15%)
Query: 1911 ESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSP 1970
++ STT SS SS TT + E+ ES + S +E T P
Sbjct: 2 DNSSTTLSS-----IILSSSTLSPTTFFTIETSM-------DESKSIISTFTEIIPTEIP 49
Query: 1971 ESESTTTSSPESESTTTSS 1989
SES + +S S S+++SS
Sbjct: 50 TSESPSPNSNSSSSSSSSS 68
Score = 42.3 bits (99), Expect = 0.003
Identities = 25/68 (36%), Positives = 38/68 (55%), Gaps = 1/68 (1%)
Query: 2004 STTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASE 2063
S+TT+S + S++T SP + T S ES + S +E T P SES + N+ +S
Sbjct: 4 SSTTLSSIILSSSTLSPTTFFTIETS-MDESKSIISTFTEIIPTEIPTSESPSPNSNSSS 62
Query: 2064 SITSSSPA 2071
S +SSS +
Sbjct: 63 SSSSSSSS 70
Score = 41.5 bits (97), Expect = 0.004
Identities = 23/68 (33%), Positives = 40/68 (58%), Gaps = 1/68 (1%)
Query: 2014 STTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASE 2073
S+TT S + S++T+SP + T +S ES + + +E T P SES + +S +S
Sbjct: 4 SSTTLSSIILSSSTLSPTTFFTIETS-MDESKSIISTFTEIIPTEIPTSESPSPNSNSSS 62
Query: 2074 STTTSSPA 2081
S+++SS +
Sbjct: 63 SSSSSSSS 70
Score = 40.8 bits (95), Expect = 0.009
Identities = 27/86 (31%), Positives = 37/86 (43%), Gaps = 15/86 (17%)
Query: 1941 ESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSP 2000
++ STT SS SS TT + E+ ES + S +E T P
Sbjct: 2 DNSSTTLSS-----IILSSSTLSPTTFFTIETSM-------DESKSIISTFTEIIPTEIP 49
Query: 2001 ESESTTTISPVSESTTTSSPVSESTT 2026
SES + P S S+++SS S S T
Sbjct: 50 TSESPS---PNSNSSSSSSSSSSSIT 72
Score = 40.4 bits (94), Expect = 0.010
Identities = 22/68 (32%), Positives = 39/68 (57%), Gaps = 1/68 (1%)
Query: 2024 STTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASE 2083
S+TT+S S++T SP + T + ES + + +E I + P SES + +S +S
Sbjct: 4 SSTTLSSIILSSSTLSPTTFFTIETS-MDESKSIISTFTEIIPTEIPTSESPSPNSNSSS 62
Query: 2084 STTTSSPA 2091
S+++SS +
Sbjct: 63 SSSSSSSS 70
Score = 40.4 bits (94), Expect = 0.012
Identities = 27/86 (31%), Positives = 40/86 (46%), Gaps = 15/86 (17%)
Query: 1951 ESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISP 2010
++ STT SS++ S+T S TT + E+ ES + S +E T P
Sbjct: 2 DNSSTTLSSIILSSSTLSP-----TTFFTIETSM-------DESKSIISTFTEIIPTEIP 49
Query: 2011 VSESTTTSSPVSESTTTISPESESTT 2036
SE + SP S S+++ S S S T
Sbjct: 50 TSE---SPSPNSNSSSSSSSSSSSIT 72
Score = 40.0 bits (93), Expect = 0.015
Identities = 24/66 (36%), Positives = 37/66 (56%), Gaps = 1/66 (1%)
Query: 1934 STTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSE 1993
S+TT S S++T SP + T +S+ ES + S +E T P SES + +S S
Sbjct: 4 SSTTLSSIILSSSTLSPTTFFTIETSM-DESKSIISTFTEIIPTEIPTSESPSPNSNSSS 62
Query: 1994 STTTSS 1999
S+++SS
Sbjct: 63 SSSSSS 68
Score = 39.2 bits (91), Expect = 0.022
Identities = 28/77 (36%), Positives = 38/77 (49%), Gaps = 9/77 (11%)
Query: 2052 SESTTTNNPASESITSSSPASEST--TTSSPASESTTTSSPASESTTTSSPASESTTTSS 2109
+ STT S I SSS S +T T + ES + S +E T P SE + S
Sbjct: 3 NSSTTL----SSIILSSSTLSPTTFFTIETSMDESKSIISTFTEIIPTEIPTSE---SPS 55
Query: 2110 PESESTTTSSPASESTT 2126
P S S+++SS +S S T
Sbjct: 56 PNSNSSSSSSSSSSSIT 72
Score = 38.8 bits (90), Expect = 0.032
Identities = 19/70 (27%), Positives = 31/70 (44%), Gaps = 3/70 (4%)
Query: 2067 SSSPASESTTTSSPASESTTTSSPAS---ESTTTSSPASESTTTSSPESESTTTSSPASE 2123
+SS S SS TT + + ES + S +E T P SES + +S +S
Sbjct: 3 NSSTTLSSIILSSSTLSPTTFFTIETSMDESKSIISTFTEIIPTEIPTSESPSPNSNSSS 62
Query: 2124 STTIEEQGVS 2133
S++ ++
Sbjct: 63 SSSSSSSSIT 72
Score = 38.8 bits (90), Expect = 0.033
Identities = 27/95 (28%), Positives = 45/95 (47%), Gaps = 12/95 (12%)
Query: 1901 ESESTTTNNPESESTTTSSPESESTTTSSLVS---ESTTTSSPESESTTTSSPESESTTT 1957
++ STT + S SS TT ++ + ES + S +E T P SES +
Sbjct: 2 DNSSTTLS-----SIILSSSTLSPTTFFTIETSMDESKSIISTFTEIIPTEIPTSESPSP 56
Query: 1958 SSLVSESTTTSSPESESTTTSSPESESTTTSSLVS 1992
+S ++++SS S S+ T E E+ T L++
Sbjct: 57 NS----NSSSSSSSSSSSITYDYEYENNITYELIN 87
Score = 38.5 bits (89), Expect = 0.047
Identities = 26/91 (28%), Positives = 45/91 (49%), Gaps = 7/91 (7%)
Query: 1875 NNNSESTVVMSTLNSLLSENTTTNSPES---ESTTTNNPESESTTTSSPESESTTTSSLV 1931
+NS +T+ L+S TT + E+ ES + + +E T P SES + +S
Sbjct: 1 MDNSSTTLSSIILSSSTLSPTTFFTIETSMDESKSIISTFTEIIPTEIPTSESPSPNS-- 58
Query: 1932 SESTTTSSPESESTTTSSPESESTTTSSLVS 1962
++++SS S S+ T E E+ T L++
Sbjct: 59 --NSSSSSSSSSSSITYDYEYENNITYELIN 87
Score = 38.1 bits (88), Expect = 0.052
Identities = 18/70 (25%), Positives = 32/70 (45%)
Query: 1897 TNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTT 1956
+S S ++ TT + E+ + S++S T E ++ + SP S S++
Sbjct: 3 NSSTTLSSIILSSSTLSPTTFFTIETSMDESKSIISTFTEIIPTEIPTSESPSPNSNSSS 62
Query: 1957 TSSLVSESTT 1966
+SS S S T
Sbjct: 63 SSSSSSSSIT 72
Score = 37.7 bits (87), Expect = 0.083
Identities = 26/76 (34%), Positives = 40/76 (52%), Gaps = 5/76 (6%)
Query: 2031 ESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSP 2090
++ STT SS S+T + + T+ S+SI S+ +E T P SE + SP
Sbjct: 2 DNSSTTLSSIILSSSTLSPTTFFTIETSMDESKSIIST--FTEIIPTEIPTSE---SPSP 56
Query: 2091 ASESTTTSSPASESTT 2106
S S+++SS +S S T
Sbjct: 57 NSNSSSSSSSSSSSIT 72
Score = 37.3 bits (86), Expect = 0.11
Identities = 21/66 (31%), Positives = 36/66 (54%), Gaps = 1/66 (1%)
Query: 2044 STTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASE 2103
S+TT + S++T +P + T + ES + S +E T P SES + +S +S
Sbjct: 4 SSTTLSSIILSSSTLSPTTFF-TIETSMDESKSIISTFTEIIPTEIPTSESPSPNSNSSS 62
Query: 2104 STTTSS 2109
S+++SS
Sbjct: 63 SSSSSS 68
Score = 36.1 bits (83), Expect = 0.20
Identities = 25/82 (30%), Positives = 36/82 (43%), Gaps = 1/82 (1%)
Query: 2074 STTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQGVS 2133
S+TT S S++T SP + T +S ES + S +E T P SES + S
Sbjct: 4 SSTTLSSIILSSSTLSPTTFFTIETS-MDESKSIISTFTEIIPTEIPTSESPSPNSNSSS 62
Query: 2134 PHSEKLSANEDPEEFPNEDVFE 2155
S S+ E+ N +E
Sbjct: 63 SSSSSSSSITYDYEYENNITYE 84
Score = 32.7 bits (74), Expect = 2.8
Identities = 18/64 (28%), Positives = 32/64 (50%), Gaps = 3/64 (4%)
Query: 1873 TTNNNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVS 1932
++++ S T+ + + E+ + S +E T P SE + SP S S+++SS S
Sbjct: 12 ILSSSTLSPTTFFTIETSMDESKSIISTFTEIIPTEIPTSE---SPSPNSNSSSSSSSSS 68
Query: 1933 ESTT 1936
S T
Sbjct: 69 SSIT 72
>gnl|CDD|236333 PRK08691, PRK08691, DNA polymerase III subunits gamma and tau;
Validated.
Length = 709
Score = 44.7 bits (105), Expect = 6e-04
Identities = 45/200 (22%), Positives = 68/200 (34%), Gaps = 27/200 (13%)
Query: 1993 ESTTTSSPESESTTTI-SPVSESTTTSSPVS-ESTTTISPESESTTTSSPASESTTTNNP 2050
+S + + E E+ P E+ T +PV S + E + T+ P S + P
Sbjct: 378 QSPSAQTAEKETAAKKPQPRPEAETAQTPVQTASAAAMPSEGK---TAGPVSNQENNDVP 434
Query: 2051 -----KSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASEST 2105
E+ T A S S ASE+ T S ++ S SE+
Sbjct: 435 PWEDAPDEAQTAAGTAQTSAKSIQTASEA-ETPPENQVSKNKAADNETDAPLSEVPSENP 493
Query: 2106 TTSSPESESTTTSSPASESTTIEEQGVSPHSEKLSANEDPEEFPNEDVFEHTFAEIPNID 2165
++P E+ T + A E+ G FP+ D AEIP D
Sbjct: 494 IQATPNDEAVETETFAHEAPAEPFYGYG--------------FPDNDCPPEDGAEIPPPD 539
Query: 2166 --HSNQTDEAIPETFDAREE 2183
H+ D A + E
Sbjct: 540 WEHAAPADTAGGGADEEAEA 559
>gnl|CDD|240323 PTZ00233, PTZ00233, variable surface protein Vir18; Provisional.
Length = 509
Score = 44.2 bits (104), Expect = 8e-04
Identities = 58/329 (17%), Positives = 111/329 (33%), Gaps = 54/329 (16%)
Query: 1894 NTTTNSPESESTTTNN-PESESTTTSSPES-ESTTTSSLVSESTTTSS---PESESTTTS 1948
P + TT + T +P+ ++ + S L + S + + + +
Sbjct: 104 FPAKKPPLIKPTTQEPCKGGKGCKTETPQRVDTKSQSKLRPVPSKAKSLEIKDPQEQSQN 163
Query: 1949 SPESESTTTSSLV--SESTTTSSPESESTTTSSPES---ESTTTSSLVSESTT--TSSPE 2001
+++ + S+V +S + SP S T P+S +TTS + T +S +
Sbjct: 164 QADAQESNKESVVLQPQSDSMPSPSSIGTEDKEPQSIVNHHSTTSGMGETQTQQLNASGD 223
Query: 2002 SESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPAS-----ESTTTNNPKSESTT 2056
S S + + ES T + E+ TS + ++ N+P+++ +
Sbjct: 224 SPIRELDSSAGDPPSECVSGKESDLTCTSTGENLDTSLFQTNLSSGKTLDANHPETQDSA 283
Query: 2057 TNNPASE---------------SITSSSPASE--------STTTSSPAS-------ESTT 2086
N + S +P S T T ++ E+T
Sbjct: 284 GNVIEVQTHGDKDIITEAADNLSSLEGTPGSVQLADEDSVDTDTDRGSTGAVASDPENTG 343
Query: 2087 TSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQGVSPH-SEKL------ 2139
T +SES ++ S + S + ES +E H SE +
Sbjct: 344 TEETSSESLVSAPSGDVSNGGITEVDISNDDKAVDGESNGVEISHDQEHDSETICNESTC 403
Query: 2140 SANEDPEEFPNEDVFEHTFAEIPNIDHSN 2168
++ E + A+I N+ SN
Sbjct: 404 REEQNGELTDDGGDKLDILAQIFNVIQSN 432
Score = 34.9 bits (80), Expect = 0.52
Identities = 39/211 (18%), Positives = 75/211 (35%), Gaps = 15/211 (7%)
Query: 1865 DNYSEIIFTTNNNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESES 1924
D SE + ++ T L++ L + TN ++ N+PE++ + + E ++
Sbjct: 235 DPPSECVSGKESDLTCTSTGENLDTSLFQ---TNLSSGKTLDANHPETQDSAGNVIEVQT 291
Query: 1925 TTTSSLVSE-STTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESE 1983
+++E + SS E + + +S T T ST + + E
Sbjct: 292 HGDKDIITEAADNLSSLEGTPGSVQLADEDSV--------DTDTDRG---STGAVASDPE 340
Query: 1984 STTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASE 2043
+T T SES ++ S I+ V S + ES + + + +E
Sbjct: 341 NTGTEETSSESLVSAPSGDVSNGGITEVDISNDDKAVDGESNGVEISHDQEHDSETICNE 400
Query: 2044 STTTNNPKSESTTTNNPASESITSSSPASES 2074
ST E T + + +S
Sbjct: 401 STCREEQNGELTDDGGDKLDILAQIFNVIQS 431
>gnl|CDD|240381 PTZ00364, PTZ00364, dipeptidyl-peptidase I precursor; Provisional.
Length = 548
Score = 44.1 bits (104), Expect = 8e-04
Identities = 21/51 (41%), Positives = 27/51 (52%), Gaps = 2/51 (3%)
Query: 2357 HSVKIIGWGKSSQNEPYWLCTNSY--NQGWGEQGLFKIRRGVNMCSIEDSV 2405
H+V IIGWG YWL + + + W + G KI RGVN +IE V
Sbjct: 404 HTVLIIGWGTDENGGDYWLVLDPWGSRRSWCDGGTRKIARGVNAYNIESEV 454
>gnl|CDD|224343 COG1426, COG1426, Predicted transcriptional regulator contains
Xre-like HTH domain [Function unknown].
Length = 284
Score = 43.2 bits (102), Expect = 9e-04
Identities = 31/168 (18%), Positives = 58/168 (34%), Gaps = 21/168 (12%)
Query: 1980 PESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSS 2039
P + + + VS+++ S + STT SE TT++SP S +T S T +
Sbjct: 134 PPTLPDQSVASVSQNSQDVSLATSSTTP----SEGTTSASPSSATT--------SFTPTV 181
Query: 2040 PASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSS 2099
A K + A + +S++PA++ T A T+ + S
Sbjct: 182 TAIAPVVAPTAKPVTVPKQPAADLAASSTAPAAKEMATGQEAVP---TAGSGVTTVAGKS 238
Query: 2100 PASESTTTSSPESESTTTSSPASESTTIEEQGVSPHSEKLSANEDPEE 2147
A T+ + + G++ + L+
Sbjct: 239 AALVINFTAD------CWIEVTDANGKVLFSGLTKKGDSLTLTGKAPY 280
Score = 42.4 bits (100), Expect = 0.002
Identities = 27/132 (20%), Positives = 49/132 (37%), Gaps = 17/132 (12%)
Query: 1970 PESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTIS 2029
P + + + S+++ SL + STT S E TT+ SP S +T+ + V+ ++
Sbjct: 134 PPTLPDQSVASVSQNSQDVSLATSSTTPS----EGTTSASPSSATTSFTPTVTAIAPVVA 189
Query: 2030 P-ESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTS 2088
P T PA++ + ST + + P + S +
Sbjct: 190 PTAKPVTVPKQPAADLAAS------STAPAAKEMATGQEAVPTAGS------GVTTVAGK 237
Query: 2089 SPASESTTTSSP 2100
S A T+
Sbjct: 238 SAALVINFTADC 249
Score = 35.9 bits (83), Expect = 0.19
Identities = 19/75 (25%), Positives = 38/75 (50%), Gaps = 4/75 (5%)
Query: 2059 NPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTS 2118
P + S + S+++ S A+ STT S E TT++SP+S +T+ + +
Sbjct: 133 EPPTLPDQSVASVSQNSQDVSLATSSTTPS----EGTTSASPSSATTSFTPTVTAIAPVV 188
Query: 2119 SPASESTTIEEQGVS 2133
+P ++ T+ +Q +
Sbjct: 189 APTAKPVTVPKQPAA 203
>gnl|CDD|219927 pfam08601, PAP1, Transcription factor PAP1. The transcription factor
Pap1 regulates antioxidant-gene transcription in response
to H2O2. This region is cysteine rich. Alkylation of
cysteine residues following treatment with a cysteine
alkylating agent can mask the accessibility of the
nuclear exporter Crm1, triggering nuclear accumulation
and Pap1 dependent transcriptional expression.
Length = 344
Score = 43.8 bits (103), Expect = 0.001
Identities = 28/164 (17%), Positives = 56/164 (34%), Gaps = 8/164 (4%)
Query: 1999 SPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASEST-TTNNPKSESTTT 2057
+ T + +S VS + +ES +S + + T+ + + + +
Sbjct: 31 GKLPGACGTKNCPIPKLAKNSSVSSPVPGLLNSTESNVSSPNNNPNGYTSPSSAAMNNKS 90
Query: 2058 NNPASESITSSSPASESTTTSSP------ASESTTTSSPASESTTTSSPASESTTTSSPE 2111
NN A + ++S AS ++ S ++S S + + S+ +SPE
Sbjct: 91 NNRAVDPSANASAASTNSPNGLQSSATQYNSNDNSSSDSPSSGSDGFTNQLLSSLGTSPE 150
Query: 2112 SESTTTSSPASEST-TIEEQGVSPHSEKLSANEDPEEFPNEDVF 2154
+ + AS + +S SA P D
Sbjct: 151 PSTESPPQLASVNNFAAIRNNAESNSNVPSAASSTPNIPGIDFL 194
Score = 38.4 bits (89), Expect = 0.039
Identities = 31/159 (19%), Positives = 60/159 (37%), Gaps = 4/159 (2%)
Query: 1906 TTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSEST 1965
T N P + SS S + S + SSP + +SP S + S
Sbjct: 39 TKNCPIPKLAKNSSVSSPVPGLLN--STESNVSSPNNNPNGYTSPSSAAMNNKSNNR--A 94
Query: 1966 TTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSEST 2025
S + + +T+SP ++ + S ++S S + + S+ +SP +
Sbjct: 95 VDPSANASAASTNSPNGLQSSATQYNSNDNSSSDSPSSGSDGFTNQLLSSLGTSPEPSTE 154
Query: 2026 TTISPESESTTTSSPASESTTTNNPKSESTTTNNPASES 2064
+ S + + + + +N P + S+T N P +
Sbjct: 155 SPPQLASVNNFAAIRNNAESNSNVPSAASSTPNIPGIDF 193
Score = 38.4 bits (89), Expect = 0.042
Identities = 37/191 (19%), Positives = 74/191 (38%), Gaps = 10/191 (5%)
Query: 1894 NTTTNSPESESTTTNNPESE---STTTSSPESESTTTSSLVSESTTTSSPESESTTTSSP 1950
N N S S NN P + T + + +S +S
Sbjct: 5 NQLHNDCSSMSNFNNNNFDFDFPKFCGKLPGACGTKNCPIPKLAKNSSVSSPVPGLLNST 64
Query: 1951 ESESTTTSSLVSESTTTSS------PESESTTTSSPESESTTTSSLVSESTTTSSPESES 2004
ES ++ ++ + T+ SS + + S+ S ++T S +S+ T +++
Sbjct: 65 ESNVSSPNNNPNGYTSPSSAAMNNKSNNRAVDPSANASAASTNSPNGLQSSATQYNSNDN 124
Query: 2005 TTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASES 2064
+++ SP S S ++ + S T SPE + + AS + + + +N P++ S
Sbjct: 125 SSSDSPSSGSDGFTNQLLSSLGT-SPEPSTESPPQLASVNNFAAIRNNAESNSNVPSAAS 183
Query: 2065 ITSSSPASEST 2075
T + P +
Sbjct: 184 STPNIPGIDFL 194
Score = 37.6 bits (87), Expect = 0.076
Identities = 23/166 (13%), Positives = 64/166 (38%), Gaps = 5/166 (3%)
Query: 1950 PESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTIS 2009
P + T + + +S +S ES ++ ++ + T+ S + + + +
Sbjct: 34 PGACGTKNCPIPKLAKNSSVSSPVPGLLNSTESNVSSPNNNPNGYTS-PSSAAMNNKSNN 92
Query: 2010 PVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSS 2069
+ + +S S + SP ++ + S ++++ S + S +S
Sbjct: 93 RAVDPSANASAASTN----SPNGLQSSATQYNSNDNSSSDSPSSGSDGFTNQLLSSLGTS 148
Query: 2070 PASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESEST 2115
P + + AS + + + + ++ P++ S+T + P +
Sbjct: 149 PEPSTESPPQLASVNNFAAIRNNAESNSNVPSAASSTPNIPGIDFL 194
Score = 37.2 bits (86), Expect = 0.088
Identities = 30/151 (19%), Positives = 62/151 (41%), Gaps = 11/151 (7%)
Query: 2020 PVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSS 2079
P + T + +S + N+ +S ++ NN + TS S A+ + +++
Sbjct: 34 PGACGTKNCPIPKLAKNSSVSSPVPGLLNSTESNVSSPNNNPN-GYTSPSSAAMNNKSNN 92
Query: 2080 PASESTTTSSPASESTTTSS--PASESTTTSSPESESTTTSSPASESTTIEEQGVSPHSE 2137
A + + +S AS ++ A++ + + S+S ++ S + + G SP
Sbjct: 93 RAVDPSANASAASTNSPNGLQSSATQYNSNDNSSSDSPSSGSDGFTNQLLSSLGTSP--- 149
Query: 2138 KLSANEDPEEFPNEDVFEHTFAEIPNIDHSN 2168
+ E P + + + FA I N SN
Sbjct: 150 -EPSTESPPQLASVN----NFAAIRNNAESN 175
Score = 34.9 bits (80), Expect = 0.55
Identities = 19/114 (16%), Positives = 49/114 (42%)
Query: 1873 TTNNNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVS 1932
N N ++ + +N+ + S + + +TN+P ++ + S ++S S
Sbjct: 72 NNNPNGYTSPSSAAMNNKSNNRAVDPSANASAASTNSPNGLQSSATQYNSNDNSSSDSPS 131
Query: 1933 ESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTT 1986
+ + + S+ +SPE + + L S + + + + ++ P + S+T
Sbjct: 132 SGSDGFTNQLLSSLGTSPEPSTESPPQLASVNNFAAIRNNAESNSNVPSAASST 185
Score = 34.1 bits (78), Expect = 0.80
Identities = 24/140 (17%), Positives = 54/140 (38%), Gaps = 7/140 (5%)
Query: 1876 NNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSEST 1935
N++ES V N T+ S + + +NN + + +S S ++ S +
Sbjct: 62 NSTESNVSSPNNNPNGY---TSPSSAAMNNKSNNRAVDPSANASAASTNSPNGLQSSATQ 118
Query: 1936 TTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSEST 1995
S+ ++S S + + S+ +SPE + + S + + + +
Sbjct: 119 YNSN----DNSSSDSPSSGSDGFTNQLLSSLGTSPEPSTESPPQLASVNNFAAIRNNAES 174
Query: 1996 TTSSPESESTTTISPVSEST 2015
++ P + S+T P +
Sbjct: 175 NSNVPSAASSTPNIPGIDFL 194
>gnl|CDD|236766 PRK10811, rne, ribonuclease E; Reviewed.
Length = 1068
Score = 44.3 bits (105), Expect = 0.001
Identities = 20/185 (10%), Positives = 46/185 (24%), Gaps = 7/185 (3%)
Query: 1940 PESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSS 1999
+ E+E +V+E ++ E + +V
Sbjct: 849 RPQDVQVEEQREAEEVQVQPVVAEVPVAAAVEPVVSAPVVEAVAEVVEEPVVVAEPQPEE 908
Query: 2000 PESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPA-SESTTTNNPKSESTTTN 2058
TT ++ PV+E I+ + +E ++
Sbjct: 909 VVVVETTHPEVIAA------PVTEQPQVITESDVAVAQEVAEHAEPVVEPQDETADIEEA 962
Query: 2059 NPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTS 2118
+E + + A + + + + T + + T
Sbjct: 963 AETAEVVVAEPEVVAQPAAPVVAEVAAEVETVTAVEPEVAPAQVPEATVEHNHATAPMTR 1022
Query: 2119 SPASE 2123
+PA E
Sbjct: 1023 APAPE 1027
>gnl|CDD|144411 pfam00802, Glycoprotein_G, Pneumovirus attachment glycoprotein G.
This family includes attachment proteins from respiratory
synctial virus. Glycoprotein G has not been shown to have
any neuraminidase or hemagglutinin activity. The amino
terminus is thought to be cytoplasmic, and the carboxyl
terminus extracellular. The extracellular region contains
four completely conserved cysteine residues.
Length = 263
Score = 43.1 bits (101), Expect = 0.001
Identities = 40/210 (19%), Positives = 80/210 (38%), Gaps = 9/210 (4%)
Query: 1910 PESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSS 1969
P + T + + ++ T++ L + ++SP ++STTT + T+ + ++
Sbjct: 62 PTTTPTQQITNQIQNHTSTYLTQHNQLSTSPSNQSTTTPLIHTILDDTTPGTKSTYQHTT 121
Query: 1970 PESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISP-VSESTTTSSPVSESTTTI 2028
++ TT+ ++ T +S P+ + + V S ++P S
Sbjct: 122 VGTKGRTTTPAQTNKPPTKP--RQSNPPEKPQDDFHFEVFNFVPCSICENNPACLSICKR 179
Query: 2029 SPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTS 2088
PE P TT K + TT T S A+ TS P +T T+
Sbjct: 180 IPE------KKPGKAPTTKPTKKPKPKTTKKDTKTQTTKSKEATTHHPTSEPTKLTTKTN 233
Query: 2089 SPASESTTTSSPASESTTTSSPESESTTTS 2118
+ + T S+ + + +S +T+
Sbjct: 234 TTTPQFTPLSTTTTRNPELTSQMETFHSTN 263
Score = 42.4 bits (99), Expect = 0.002
Identities = 39/206 (18%), Positives = 75/206 (36%), Gaps = 24/206 (11%)
Query: 1961 VSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSP 2020
+S + +P + T + + ++ T++ L + ++SP ++STTT + T+
Sbjct: 53 ISSANHKVTPTTTPTQQITNQIQNHTSTYLTQHNQLSTSPSNQSTTTPLIHTILDDTTPG 112
Query: 2021 VSESTTTISPESESTTTSSPASESTTTNNPKSE-------------------STTTNNPA 2061
+ + ++ TT+ + T +S S NNPA
Sbjct: 113 TKSTYQHTTVGTKGRTTTPAQTNKPPTKPRQSNPPEKPQDDFHFEVFNFVPCSICENNPA 172
Query: 2062 SESI----TSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTT 2117
SI P TT + + TT TT S A+ + +S ++ TT
Sbjct: 173 CLSICKRIPEKKPGKAPTTKPTKKPKPKTTKKDTKTQTTKSKEAT-THHPTSEPTKLTTK 231
Query: 2118 SSPASESTTIEEQGVSPHSEKLSANE 2143
++ + T + + E S E
Sbjct: 232 TNTTTPQFTPLSTTTTRNPELTSQME 257
Score = 41.2 bits (96), Expect = 0.004
Identities = 42/232 (18%), Positives = 82/232 (35%), Gaps = 11/232 (4%)
Query: 1854 LAATAVAISVIDNYSEIIFTTNNNSESTVVMSTLNSLLS--ENTTTNSPESESTTTNNPE 1911
L+ A+ IS + IIF ++ N + T + + + +N T+ + + +P
Sbjct: 34 LSILAMIISTSLIIAAIIFISSANHKVTPTTTPTQQITNQIQNHTSTYLTQHNQLSTSPS 93
Query: 1912 SESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPE 1971
++STTT + T+ + ++ ++ TT+ ++ T +S P+
Sbjct: 94 NQSTTTPLIHTILDDTTPGTKSTYQHTTVGTKGRTTTPAQTNKPPTKP--RQSNPPEKPQ 151
Query: 1972 SESTTTSSP-----ESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTT 2026
+ E+ + + P TT + + TT TT
Sbjct: 152 DDFHFEVFNFVPCSICENNPACLSICKRIPEKKPGKAPTTKPTKKPKPKTTKKDTKTQTT 211
Query: 2027 TISPESESTTTSSPASESTTTN--NPKSESTTTNNPASESITSSSPASESTT 2076
+ TS P +T TN P+ +T + +TS ST
Sbjct: 212 KSKEATTHHPTSEPTKLTTKTNTTTPQFTPLSTTTTRNPELTSQMETFHSTN 263
>gnl|CDD|219426 pfam07489, Tir_receptor_C, Translocated intimin receptor (Tir)
C-terminus. Intimin and its translocated intimin
receptor (Tir) are bacterial proteins that mediate
adhesion between mammalian cells and attaching and
effacing (A/E) pathogens. A unique and essential feature
of A/E bacterial pathogens is the formation of actin-rich
pedestals beneath the intimately adherent bacteria and
localised destruction of the intestinal brush border. The
bacterial outer membrane adhesin, intimin, is necessary
for the production of the A/E lesion and diarrhoea. The
A/E bacteria translocate their own receptor for intimin,
Tir, into the membrane of mammalian cells using the type
III secretion system. The translocated Tir triggers
additional host signalling events and actin nucleation,
which are essential for lesion formation. This family
represents the Tir C-terminal domain which has been
reported to bind uninfected host cells and beta-1
integrins although the role of intimin binding to
integrins is unclear. This intimin C-terminal domain has
also been shown to be sufficient for Tir recognition.
Length = 222
Score = 42.6 bits (100), Expect = 0.001
Identities = 23/102 (22%), Positives = 46/102 (45%), Gaps = 11/102 (10%)
Query: 2070 PASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPE-------SESTTTSSPAS 2122
P ++TTT++ +TTT + ++PA +T TS E S +T+S
Sbjct: 56 PVEQTTTTTT---TTTTTHTTVENKPANNTPAQGNTDTSGAEETASSRRSSQASTASTTW 112
Query: 2123 ESTTIEEQGVSPHSE-KLSANEDPEEFPNEDVFEHTFAEIPN 2163
T+ + +P+++ +S N+ E +++ A+ P
Sbjct: 113 SDTSSIDTVDNPYADVGMSRNDSQARNSEEPIYDEVAADSPI 154
Score = 39.9 bits (93), Expect = 0.007
Identities = 30/117 (25%), Positives = 52/117 (44%), Gaps = 5/117 (4%)
Query: 1909 NPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPE-SESTTTSSLVSESTTT 1967
N E TTT++ + +TTT + V ++P +T TS E + S+ SS S ++TT
Sbjct: 54 NQPVEQTTTTT--TTTTTTHTTVENKPANNTPAQGNTDTSGAEETASSRRSSQASTASTT 111
Query: 1968 SSPESESTTTSSPESESTTT--SSLVSESTTTSSPESESTTTISPVSESTTTSSPVS 2022
S S T +P ++ + S S E + + I V + + +P +
Sbjct: 112 WSDTSSIDTVDNPYADVGMSRNDSQARNSEEPIYDEVAADSPIYSVIQHFSGDTPDT 168
Score = 37.6 bits (87), Expect = 0.046
Identities = 26/99 (26%), Positives = 47/99 (47%), Gaps = 10/99 (10%)
Query: 2048 NNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPA-SESTTTSSPASESTT 2106
N P ++TTT + + T+ + ++PA +T TS + S+ SS AS ++T
Sbjct: 54 NQPVEQTTTTT---TTTTTTHTTVENKPANNTPAQGNTDTSGAEETASSRRSSQASTAST 110
Query: 2107 TSSPESESTTTSSPASESTTIEEQGVSPHSEKLSANEDP 2145
T S S T +P ++ G+S + + +E+P
Sbjct: 111 TWSDTSSIDTVDNPYADV------GMSRNDSQARNSEEP 143
Score = 34.9 bits (80), Expect = 0.31
Identities = 24/80 (30%), Positives = 41/80 (51%), Gaps = 7/80 (8%)
Query: 2031 ESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTT--SSPASESTTTS 2088
E +TTT++ TTT + E+ NN ++ T +S A E+ ++ SS AS ++TT
Sbjct: 58 EQTTTTTTT-----TTTTHTTVENKPANNTPAQGNTDTSGAEETASSRRSSQASTASTTW 112
Query: 2089 SPASESTTTSSPASESTTTS 2108
S S T +P ++ +
Sbjct: 113 SDTSSIDTVDNPYADVGMSR 132
Score = 33.4 bits (76), Expect = 1.1
Identities = 24/77 (31%), Positives = 40/77 (51%), Gaps = 3/77 (3%)
Query: 1993 ESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPE-SESTTTSSPASESTTTNNPK 2051
E TTT++ + +TTT + V ++P +T T E + S+ SS AS ++TT +
Sbjct: 58 EQTTTTT--TTTTTTHTTVENKPANNTPAQGNTDTSGAEETASSRRSSQASTASTTWSDT 115
Query: 2052 SESTTTNNPASESITSS 2068
S T +NP ++ S
Sbjct: 116 SSIDTVDNPYADVGMSR 132
Score = 33.0 bits (75), Expect = 1.5
Identities = 21/89 (23%), Positives = 42/89 (47%), Gaps = 12/89 (13%)
Query: 2010 PVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSS 2069
PV ++TTT++ +TTT + ++PA +T T+ + +++ + S
Sbjct: 56 PVEQTTTTTT---TTTTTHTTVENKPANNTPAQGNTDTSGAEETASSRRS---------S 103
Query: 2070 PASESTTTSSPASESTTTSSPASESTTTS 2098
AS ++TT S S T +P ++ +
Sbjct: 104 QASTASTTWSDTSSIDTVDNPYADVGMSR 132
Score = 32.2 bits (73), Expect = 2.4
Identities = 21/66 (31%), Positives = 31/66 (46%), Gaps = 2/66 (3%)
Query: 1895 TTTNSPESESTTTNN-PESESTTTSSPE-SESTTTSSLVSESTTTSSPESESTTTSSPES 1952
TTT E+ NN P +T TS E + S+ SS S ++TT S S T +P +
Sbjct: 67 TTTTHTTVENKPANNTPAQGNTDTSGAEETASSRRSSQASTASTTWSDTSSIDTVDNPYA 126
Query: 1953 ESTTTS 1958
+ +
Sbjct: 127 DVGMSR 132
>gnl|CDD|217330 pfam03035, RNA_capsid, Calicivirus putative RNA polymerase/capsid
protein.
Length = 226
Score = 42.3 bits (100), Expect = 0.001
Identities = 32/111 (28%), Positives = 50/111 (45%), Gaps = 16/111 (14%)
Query: 1915 TTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESES 1974
T +P S +TT+ S S+ P S S++ SS S+ST ++ L S S ++SS
Sbjct: 103 TRYWAPNSMATTSYSGGFTSSPVPVPPSSSSSASSVSSQSTQSTGLSSSSYSSSSA---- 158
Query: 1975 TTTSSPESESTTTSSLVSESTTTSSPESES---TTTISPVSESTTTSSPVS 2022
S+ TSS V + P T ++P S + ++S VS
Sbjct: 159 ---------SSRTSSWVRSQNSNLEPFMPGALQTAWVTPPSSTASSSGTVS 200
Score = 41.2 bits (97), Expect = 0.003
Identities = 30/104 (28%), Positives = 45/104 (43%), Gaps = 16/104 (15%)
Query: 1892 SENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPE 1951
S TT+ S S+ P S S++ SS S+ST ++ L S S ++SS
Sbjct: 110 SMATTSYSGGFTSSPVPVPPSSSSSASSVSSQSTQSTGLSSSSYSSSSA----------- 158
Query: 1952 SESTTTSSLVSESTTTSSPESES---TTTSSPESESTTTSSLVS 1992
S+ TSS V + P T +P S + ++S VS
Sbjct: 159 --SSRTSSWVRSQNSNLEPFMPGALQTAWVTPPSSTASSSGTVS 200
Score = 37.3 bits (87), Expect = 0.063
Identities = 28/101 (27%), Positives = 48/101 (47%), Gaps = 16/101 (15%)
Query: 2035 TTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASES 2094
T +P S +TT+ + S+ P S S ++SS +S+ST ++ +S S ++SS +S
Sbjct: 103 TRYWAPNSMATTSYSGGFTSSPVPVPPSSSSSASSVSSQSTQSTGLSSSSYSSSSASS-- 160
Query: 2095 TTTSS-------------PASESTTTSSPESESTTTSSPAS 2122
TSS P + T +P S + ++S S
Sbjct: 161 -RTSSWVRSQNSNLEPFMPGALQTAWVTPPSSTASSSGTVS 200
Score = 35.8 bits (83), Expect = 0.18
Identities = 29/107 (27%), Positives = 48/107 (44%), Gaps = 16/107 (14%)
Query: 1945 TTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESES 2004
T +P S +TT+ S S+ P S S++ SS S+ST ++ L S S ++SS
Sbjct: 103 TRYWAPNSMATTSYSGGFTSSPVPVPPSSSSSASSVSSQSTQSTGLSSSSYSSSSA---- 158
Query: 2005 TTTISPVSESTTTSSPVSESTTTISPESES---TTTSSPASESTTTN 2048
S+ TSS V + + P T +P S + +++
Sbjct: 159 ---------SSRTSSWVRSQNSNLEPFMPGALQTAWVTPPSSTASSS 196
Score = 35.0 bits (81), Expect = 0.31
Identities = 29/109 (26%), Positives = 50/109 (45%), Gaps = 3/109 (2%)
Query: 1910 PESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSS 1969
P S +TT+ S S+ S S++ SS S+ST ++ S S ++SS S+ TSS
Sbjct: 108 PNSMATTSYSGGFTSSPVPVPPSSSSSASSVSSQSTQSTGLSSSSYSSSSA---SSRTSS 164
Query: 1970 PESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTS 2018
+ P ++ V+ ++T+S +T V +S T +
Sbjct: 165 WVRSQNSNLEPFMPGALQTAWVTPPSSTASSSGTVSTVPKGVLDSWTPA 213
Score = 33.5 bits (77), Expect = 1.0
Identities = 30/105 (28%), Positives = 46/105 (43%), Gaps = 8/105 (7%)
Query: 2015 TTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASES 2074
T +P S +TT+ S S+ P S S++ ++ S+ST S +S S +S S
Sbjct: 103 TRYWAPNSMATTSYSGGFTSSPVPVPPSSSSSASSVSSQSTQ---STGLSSSSYSSSSAS 159
Query: 2075 TTTSSPASESTTTSSPASES---TTTSSPASESTTTSSPESESTT 2116
+ TSS + P T +P S+T SS + ST
Sbjct: 160 SRTSSWVRSQNSNLEPFMPGALQTAWVTPP--SSTASSSGTVSTV 202
Score = 32.3 bits (74), Expect = 2.6
Identities = 23/76 (30%), Positives = 34/76 (44%), Gaps = 7/76 (9%)
Query: 2060 PASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSS 2119
P S + TS S S+ P S S++ SS +S+ST +ST SS S++ SS
Sbjct: 108 PNSMATTSYSGGFTSSPVPVPPSSSSSASSVSSQST-------QSTGLSSSSYSSSSASS 160
Query: 2120 PASESTTIEEQGVSPH 2135
S + + P
Sbjct: 161 RTSSWVRSQNSNLEPF 176
Score = 31.9 bits (73), Expect = 3.4
Identities = 19/86 (22%), Positives = 36/86 (41%), Gaps = 9/86 (10%)
Query: 1880 STVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSS 1939
++ + S S ++ +S ++ST ++ S++ SS TSS V +
Sbjct: 121 TSSPVPVPPSSSSSASSVSSQSTQSTGLSSSSYSSSSASS------RTSSWVRSQNSNLE 174
Query: 1940 PESES---TTTSSPESESTTTSSLVS 1962
P T +P S + ++S VS
Sbjct: 175 PFMPGALQTAWVTPPSSTASSSGTVS 200
>gnl|CDD|183558 PRK12495, PRK12495, hypothetical protein; Provisional.
Length = 226
Score = 42.2 bits (99), Expect = 0.002
Identities = 20/129 (15%), Positives = 46/129 (35%), Gaps = 5/129 (3%)
Query: 2016 TTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASEST 2075
T PV+E ++ ++ S++ + +P ++ PA+E+ + A
Sbjct: 63 TCQQPVTEDGAA-GDDAGDGAEATAPSDAGSQASPDDDAQ----PAAEAEAADQSAPPEA 117
Query: 2076 TTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQGVSPH 2135
+++S E+ T + + +P + + E S P S +
Sbjct: 118 SSTSATDEAATDPPATAAARDGPTPDPTAQPATPDERRSPRQRPPVSGEPPTPSTPDAHV 177
Query: 2136 SEKLSANED 2144
+ L A +
Sbjct: 178 AGTLQAARE 186
Score = 41.8 bits (98), Expect = 0.002
Identities = 22/147 (14%), Positives = 55/147 (37%), Gaps = 16/147 (10%)
Query: 2001 ESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNP 2060
++ + S++ + +SP ++ + E+E+ S+P S+T+ ++ +
Sbjct: 77 DAGDGAEATAPSDAGSQASPDDDAQP--AAEAEAADQSAPPEASSTSATDEAATDPPATA 134
Query: 2061 ASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSP 2120
A+ + P ++ T S T ++ A + T +
Sbjct: 135 AARDGPTPDPTAQPATPDERRSPRQRPPVSGEPPTPSTPDAHVAGTLQA----------- 183
Query: 2121 ASESTTIEEQGVSPHSEKLSANEDPEE 2147
A ES + ++ + + +A +DP
Sbjct: 184 ARESL---VETLARFARRAAATDDPRR 207
Score = 41.0 bits (96), Expect = 0.004
Identities = 26/121 (21%), Positives = 54/121 (44%), Gaps = 5/121 (4%)
Query: 1992 SESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPK 2051
S++ + +SP+ ++ + +E+ S+P S+T+ + E+ + ++ A+ T +P
Sbjct: 88 SDAGSQASPDDDAQP--AAEAEAADQSAPPEASSTSATDEAATDPPATAAARDGPTPDPT 145
Query: 2052 SESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASES--TTTSSPASESTTTSS 2109
++ T + S + E T S+P + T A ES T + A + T
Sbjct: 146 AQPATPDERRSPR-QRPPVSGEPPTPSTPDAHVAGTLQAARESLVETLARFARRAAATDD 204
Query: 2110 P 2110
P
Sbjct: 205 P 205
Score = 39.5 bits (92), Expect = 0.013
Identities = 22/111 (19%), Positives = 44/111 (39%), Gaps = 9/111 (8%)
Query: 2046 TTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASEST 2105
T P +E + A + +++P S++ + +SP ++ PA+E+ A
Sbjct: 63 TCQQPVTEDGAAGDDAGDGAEATAP-SDAGSQASPDDDAQ----PAAEAEAADQSAPPEA 117
Query: 2106 TTSSPESESTTTSSPASESTTIEEQGVSPHSEKLSANEDPEEFPNEDVFEH 2156
+++S E+ T PA+ + G +P A D P +
Sbjct: 118 SSTSATDEA-ATDPPATAAA---RDGPTPDPTAQPATPDERRSPRQRPPVS 164
Score = 38.3 bits (89), Expect = 0.032
Identities = 24/138 (17%), Positives = 47/138 (34%), Gaps = 9/138 (6%)
Query: 1962 SESTTTSSPESESTTTSSPES-ESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSP 2020
S++ + +SP+ ++ + E+ + + S S T + T + T P
Sbjct: 88 SDAGSQASPDDDAQPAAEAEAADQSAPPEASSTSATDEAATDPPATA---AARDGPTPDP 144
Query: 2021 VSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITS--SSPASESTTTS 2078
++ T S T + + T A ES+ + A + T
Sbjct: 145 TAQPATPDERRSPRQRPPVSGEPPTPSTPDAHVAGTLQA-ARESLVETLARFARRAAATD 203
Query: 2079 SP--ASESTTTSSPASES 2094
P A E + A+E+
Sbjct: 204 DPRRAREYLEAAREAAEA 221
>gnl|CDD|227651 COG5347, COG5347, GTPase-activating protein that regulates ARFs
(ADP-ribosylation factors), involved in ARF-mediated
vesicular transport [Intracellular trafficking and
secretion].
Length = 319
Score = 42.8 bits (101), Expect = 0.002
Identities = 27/174 (15%), Positives = 67/174 (38%), Gaps = 6/174 (3%)
Query: 1923 ESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTS-SPESESTTTSSPE 1981
+S++ S S S +++ ES+S ++S+ + S ES ++ +
Sbjct: 125 DSSSPSDFSSFSASSTRTVDSVDDRLDSESQSRSSSASLGNSNRPDDELNVESFQSTGSK 184
Query: 1982 SESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPA 2041
S T++ ++ S + ++ S + T S S++ S ++ + P
Sbjct: 185 PRSLTSTKSNKDNLLNSELLTLNSLLSSNSEVGSGTKSR-SDAQEKSSTKATESVKPGPV 243
Query: 2042 SESTTTNNPKSESTTTNNPASESITSSSPA----SESTTTSSPASESTTTSSPA 2091
+ S+T++ P + + T+ + + + S+ T++ A
Sbjct: 244 NTSSTSSLPPAIKRSPVQQLESFTTTPVYFPVNTPATFDATLKSYYSSLTANIA 297
Score = 42.1 bits (99), Expect = 0.002
Identities = 31/173 (17%), Positives = 66/173 (38%), Gaps = 6/173 (3%)
Query: 1943 ESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTS-SLVSESTTTSSPE 2001
+S++ S S S +++ V ES+S ++S+ S L ES ++ +
Sbjct: 125 DSSSPSDFSSFSASSTRTVDSVDDRLDSESQSRSSSASLGNSNRPDDELNVESFQSTGSK 184
Query: 2002 SESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPA 2061
S T+ ++ S ++ ++ S + T S + ++ K+ + P
Sbjct: 185 PRSLTSTKSNKDNLLNSELLTLNSLLSSNSEVGSGTKSRSDAQEKSST-KATESVKPGPV 243
Query: 2062 SESITSSSPASESTTTSSPASESTTTSSPA----SESTTTSSPASESTTTSSP 2110
+ S TSS P + + TTT + + + S+ T++
Sbjct: 244 NTSSTSSLPPAIKRSPVQQLESFTTTPVYFPVNTPATFDATLKSYYSSLTANI 296
Score = 40.9 bits (96), Expect = 0.007
Identities = 33/172 (19%), Positives = 66/172 (38%), Gaps = 7/172 (4%)
Query: 1896 TTNSPESESTTTNNPESESTTTSSP-ESESTTTSSLVSESTTTS-SPESESTTTSSPESE 1953
++ S S + ++ +S ES+S ++S+ + S ES ++ +
Sbjct: 127 SSPSDFSSFSASSTRTVDSVDDRLDSESQSRSSSASLGNSNRPDDELNVESFQSTGSKPR 186
Query: 1954 STTTSSLVSESTTTSSPESESTTTSSPESESTTTSS--LVSESTTTSSPESESTTTISPV 2011
S T++ ++ S + ++ SS + T S E ++T + ES ++
Sbjct: 187 SLTSTKSNKDNLLNSELLTLNSLLSSNSEVGSGTKSRSDAQEKSSTKATESVKPGPVNTS 246
Query: 2012 SESTTTSSPVSESTTTISPESESTTTSSPA-SESTTTNNPKSE-STTTNNPA 2061
S S + + S +T P + +T KS S+ T N A
Sbjct: 247 STS-SLPPAIKRSPVQQLESFTTTPVYFPVNTPATFDATLKSYYSSLTANIA 297
Score = 36.7 bits (85), Expect = 0.14
Identities = 26/141 (18%), Positives = 52/141 (36%), Gaps = 5/141 (3%)
Query: 1885 STLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESES 1944
S+ SL + N + ES + + S T++ ++ S L++ ++ SS
Sbjct: 158 SSSASLGNSNRPDDELNVESFQSTGSKPRSLTSTKSNKDNLLNSELLTLNSLLSSNSEVG 217
Query: 1945 TTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSP---- 2000
+ T S SS + + P + S+T+S P + + + TTT
Sbjct: 218 SGTKSRSDAQ-EKSSTKATESVKPGPVNTSSTSSLPPAIKRSPVQQLESFTTTPVYFPVN 276
Query: 2001 ESESTTTISPVSESTTTSSPV 2021
+ S+ T++
Sbjct: 277 TPATFDATLKSYYSSLTANIA 297
Score = 35.9 bits (83), Expect = 0.25
Identities = 29/157 (18%), Positives = 58/157 (36%), Gaps = 4/157 (2%)
Query: 1973 ESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTS-SPVSESTTTISPE 2031
+S++ S S S +++ V ES+S ++ + + S ES + +
Sbjct: 125 DSSSPSDFSSFSASSTRTVDSVDDRLDSESQSRSSSASLGNSNRPDDELNVESFQSTGSK 184
Query: 2032 SESTTTSSPASESTTTNNPKS-ESTTTNNPASESITSS-SPASESTTTSSPASESTTTSS 2089
S T++ ++ + + S ++N S T S S A E ++T + S +
Sbjct: 185 PRSLTSTKSNKDNLLNSELLTLNSLLSSNSEVGSGTKSRSDAQEKSSTKATESVKPGPVN 244
Query: 2090 PASESTTTSSPASESTTTSSPESESTTTSSPASESTT 2126
+S S + S +T P + T
Sbjct: 245 TSSTS-SLPPAIKRSPVQQLESFTTTPVYFPVNTPAT 280
>gnl|CDD|236652 PRK10118, PRK10118, flagellar hook-length control protein;
Provisional.
Length = 408
Score = 42.9 bits (101), Expect = 0.002
Identities = 36/216 (16%), Positives = 64/216 (29%), Gaps = 22/216 (10%)
Query: 1901 ESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSL 1960
TT S + ++ V E+ + E ++ +P + TS+L
Sbjct: 63 SKGLLTTKGEPLVSDKLADLLAQQANLLIPVDETLPVITDEQSLSSPLTP---ALKTSAL 119
Query: 1961 VSESTTTSSPESESTTTSSPESESTTTSSLV------SESTTTSSPESESTTTISPVSES 2014
+ S E + + + + S+L +T + S P +
Sbjct: 120 AALSKNAQKDEKADDLS---DEDLASLSALFAMLPGQDNTTPVADAPSTVLPAEKPTLLT 176
Query: 2015 TTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASES 2074
S + T T S S TT P T P + + +E
Sbjct: 177 KDMPSAPQDETHT---LSSDEHEKGLTSAQLTTAQPDDAPGTPAQPLTPLAAEAQAKAEV 233
Query: 2075 TTTSSPASESTTTSSPASESTTTSSPASESTTTSSP 2110
+T SP + A+ T T T ++P
Sbjct: 234 ISTPSP-------VTAAASPTITPHQTQPLPTAAAP 262
Score = 39.5 bits (92), Expect = 0.020
Identities = 34/213 (15%), Positives = 67/213 (31%), Gaps = 11/213 (5%)
Query: 1911 ESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSP 1970
TT S + L+++ P E+ + E + SS ++ + TS+
Sbjct: 63 SKGLLTTKGEPLVSDKLADLLAQQANLLIPVDETLPVITDEQ---SLSSPLTPALKTSAL 119
Query: 1971 ESESTTTSSPESESTTTSS-LVSESTTTSSPESESTTTISPVSESTT--TSSPVSESTTT 2027
+ S E + L S S + + TT + ST P +
Sbjct: 120 AALSKNAQKDEKADDLSDEDLASLSALFAMLPGQDNTTPVADAPSTVLPAEKPTLLTKDM 179
Query: 2028 ISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTT 2087
S + T T S + E S + + + + +++
Sbjct: 180 PSAPQDETHTLS-SDEHEKGLT----SAQLTTAQPDDAPGTPAQPLTPLAAEAQAKAEVI 234
Query: 2088 SSPASESTTTSSPASESTTTSSPESESTTTSSP 2120
S+P+ + S + T P + + S+P
Sbjct: 235 STPSPVTAAASPTITPHQTQPLPTAAAPVLSAP 267
Score = 33.3 bits (76), Expect = 1.8
Identities = 26/138 (18%), Positives = 46/138 (33%), Gaps = 13/138 (9%)
Query: 1877 NSESTVVMSTLNSLLSENTTTNSPESESTTTNN---PESESTTTSSPESESTTTSSLVSE 1933
+ E +S L ++L T +T P + S + T T +S
Sbjct: 136 SDEDLASLSALFAMLPGQDNTTPVADAPSTVLPAEKPTLLTKDMPSAPQDETHT---LSS 192
Query: 1934 STTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSE 1993
S TT+ P+ T + ++ + ++E +T SP T++
Sbjct: 193 DEHEKGLTSAQLTTAQPDDAPGTPAQPLTPLAAEAQAKAEVISTPSP-----VTAAASP- 246
Query: 1994 STTTSSPESESTTTISPV 2011
T T T +PV
Sbjct: 247 -TITPHQTQPLPTAAAPV 263
>gnl|CDD|218056 pfam04388, Hamartin, Hamartin protein. This family includes the
hamartin protein which is thought to function as a tumour
suppressor. The hamartin protein interacts with the
tuberin protein pfam03542. Tuberous sclerosis complex
(TSC) is an autosomal dominant disorder and is
characterized by the presence of hamartomas in many
organs, such as brain, skin, heart, lung, and kidney. It
is caused by mutation either TSC1 or TSC2 tumour
suppressor gene. TSC1 encodes a protein, hamartin,
containing two coiled-coil regions, which have been shown
to mediate binding to tuberin. The TSC2 gene codes for
tuberin pfam03542. These two proteins function within the
same pathway(s) regulating cell cycle, cell growth,
adhesion, and vesicular trafficking.
Length = 667
Score = 43.0 bits (101), Expect = 0.002
Identities = 44/259 (16%), Positives = 84/259 (32%), Gaps = 17/259 (6%)
Query: 1878 SESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTT--TSSLVSEST 1935
+ S + L +NT+T+ + T+ P + +S ++ + SSL +T
Sbjct: 284 NSSPRQALPPSISLPQNTSTSGSLHSAQTSRRPNTTFDKAASSGTKDSLWSPSSLCGMAT 343
Query: 1936 TTSS-PESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSES 1994
SS S + SP S + ES +T+ P + + +
Sbjct: 344 PPSSIGMSPLILSLSPSHLSGRAPGTTGSGKGEPASESTPSTSPPPPGLADDIVRAIFAT 403
Query: 1995 TTTSSPESESTTTIS--PVSESTTTSSPVSESTT------TISPESESTTTSSPASESTT 2046
++ S+P E S P + +S ++ E T S
Sbjct: 404 SSRSAPRKEELQNESSFPKLVRQENLQNIEKSAEGGILDAAVTEELLKLTNEKDDLGSRG 463
Query: 2047 TNNPKSESTTT----NNPASESITSSSPASESTTTSSPASEST--TTSSPASESTTTSSP 2100
++P S T N E + S+ + + S+ + T ++
Sbjct: 464 LDSPFSRDTLLGSQRNKAQPELLVSTPDKGPAESQSAANLRVSWFTPIENPMREEKSAPA 523
Query: 2101 ASESTTTSSPESESTTTSS 2119
+ E TS ES + +
Sbjct: 524 SEEDEQTSLEESLISPSPC 542
Score = 36.9 bits (85), Expect = 0.14
Identities = 63/332 (18%), Positives = 117/332 (35%), Gaps = 34/332 (10%)
Query: 1886 TLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESEST 1945
+L+ + + S S N+ ++ S ++T+TS + + T+ P +
Sbjct: 262 SLDPTETSSEDGYSFSRSSAYPNSSPRQALPPSISLPQNTSTSGSLHSAQTSRRPNTTFD 321
Query: 1946 TTSSPESESTT--TSSLVSESTTTSSPE------SESTTTSSPESESTTTSS---LVSES 1994
+S ++ + SSL +T SS S S + S + TT S SES
Sbjct: 322 KAASSGTKDSLWSPSSLCGMATPPSSIGMSPLILSLSPSHLSGRAPGTTGSGKGEPASES 381
Query: 1995 T--TTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTN---- 2048
T T+ P + + + +++ S+P E S + + + +
Sbjct: 382 TPSTSPPPPGLADDIVRAIFATSSRSAPRKEELQNESSFPKLVRQENLQNIEKSAEGGIL 441
Query: 2049 ----NPKSESTTTNNPASESITSSSPASESTTTSSPASE-------STTTSSPASESTTT 2097
+ T S SP S T S ++ ST PA +
Sbjct: 442 DAAVTEELLKLTNEKDDLGSRGLDSPFSRDTLLGSQRNKAQPELLVSTPDKGPAESQSAA 501
Query: 2098 SSPASESTTTSSPESESTTTS-SPASESTTIEEQGVSPHSEKLSANEDPEEFPNEDVFEH 2156
+ S T +P E + S E T++EE +SP P + P + +F+
Sbjct: 502 NLRVSWFTPIENPMREEKSAPASEEDEQTSLEESLISPSPCSR-----PPQPPYDRLFDI 556
Query: 2157 TFAEIPNIDHSNQTDEAIPETFDAREEWPQCK 2188
+ + S +T EA+ + R + +
Sbjct: 557 ALPKTACLFLSRKTYEALLKEAGQRLSQEEGE 588
Score = 32.6 bits (74), Expect = 3.3
Identities = 38/184 (20%), Positives = 65/184 (35%), Gaps = 11/184 (5%)
Query: 1988 SSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTT 2047
+ L + T TSS + S S + SSP +IS ++T+ S S T+
Sbjct: 259 AKLSLDPTETSSEDGYSF----SRSSAYPNSSPRQALPPSISLPQNTSTSGSLHSAQTSR 314
Query: 2048 N-NPKSESTTTNNPASESITSSSPASESTTTSS-PASESTTTSSPASESTTTSSPASEST 2105
N + ++ + SS +T SS S + SP+ S
Sbjct: 315 RPNTTFDKAASSGTKDSLWSPSSLCGMATPPSSIGMSPLILSLSPSHLSGRAPGTTGSGK 374
Query: 2106 TTSSPESESTTTSSPASESTTIEEQGVSPHSEKLSANEDPEEFPNEDVFE--HTFAEIPN 2163
+ ES +T+ P + ++ + + + EE NE F + N
Sbjct: 375 GEPASESTPSTSPPPPGLA---DDIVRAIFATSSRSAPRKEELQNESSFPKLVRQENLQN 431
Query: 2164 IDHS 2167
I+ S
Sbjct: 432 IEKS 435
>gnl|CDD|114172 pfam05432, BSP_II, Bone sialoprotein II (BSP-II). Bone sialoprotein
(BSP) is a major structural protein of the bone matrix
that is specifically expressed by fully-differentiated
osteoblasts. The expression of bone sialoprotein (BSP) is
normally restricted to mineralised connective tissues of
bones and teeth where it has been associated with mineral
crystal formation. However, it has been found that
ectopic expression of BSP occurs in various lesions,
including oral and extraoral carcinomas, in which it has
been associated with the formation of microcrystalline
deposits and the metastasis of cancer cells to bone.
Length = 291
Score = 42.4 bits (99), Expect = 0.002
Identities = 49/224 (21%), Positives = 74/224 (33%), Gaps = 31/224 (13%)
Query: 1937 TSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTT 1996
+ S E SS E TS+ ++ +S+ E+E+TT S T
Sbjct: 48 SDSSEENGDGDSSEEEGEEETSN-----EEENNEDSDGNEDEEAEAENTTLS------TV 96
Query: 1997 TSSPESESTTTIS---------PVSESTTTSSPVSESTTTISPESESTTTSSPAS---ES 2044
T ++T P E + E E A
Sbjct: 97 TLGYGGDATPGTGNIGLAALQLPKKAGNAGKKATKEDESDEDEEEEEEEEEEEAEVEENE 156
Query: 2045 TTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASES 2104
TN + ST ++ S + E + + +E TT + P TT+SP
Sbjct: 157 QGTNGTSTNSTEVDHGNGSSGGDNGEEGEEESVTEAEAEGTTVAGP-----TTTSPNGGF 211
Query: 2105 TTTSSPESESTTTSSPASESTTIEEQGVSPHSEKLSANEDPEEF 2148
T+ P+ TT P + TT E QG E+ ANE +
Sbjct: 212 QPTTPPQEVYGTTDPPFGKVTTPEYQG---EYEQTGANEYDGGY 252
Score = 37.4 bits (86), Expect = 0.086
Identities = 44/230 (19%), Positives = 71/230 (30%), Gaps = 7/230 (3%)
Query: 1917 TSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTT 1976
+ S E SS TS+ E + + E E + + T +
Sbjct: 48 SDSSEENGDGDSSEEEGEEETSNEEENNEDSDGNEDEEAEAENTTLSTVTLGYGGDATPG 107
Query: 1977 TSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTT 2036
T + + + ++ E ES E + V E+ + S ++T
Sbjct: 108 TGNIGLAALQLPKKAGNAGKKATKEDESDEDEEEEEEEEEEEAEVEENEQGTNGTSTNST 167
Query: 2037 -TSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASEST 2095
S N + E + +E T + P TT+SP T+ P
Sbjct: 168 EVDHGNGSSGGDNGEEGEEESVTEAEAEGTTVAGP-----TTTSPNGGFQPTTPPQEVYG 222
Query: 2096 TTSSPASESTTTSSP-ESESTTTSSPASESTTIEEQGVSPHSEKLSANED 2144
TT P + TT E E T + E + P + A ED
Sbjct: 223 TTDPPFGKVTTPEYQGEYEQTGANEYDGGYEIYESENGEPRGDSYRAYED 272
Score = 36.6 bits (84), Expect = 0.14
Identities = 37/200 (18%), Positives = 69/200 (34%), Gaps = 12/200 (6%)
Query: 1901 ESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTT-TSSPESESTTTSS 1959
E+ + NN +S+ E+E+TT S T T ++T T + +
Sbjct: 67 ETSNEEENNEDSDGNEDEEAEAENTTLS------TVTLGYGGDATPGTGNIGLAALQLPK 120
Query: 1960 LVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSS 2019
+ ++ E ES E E + V E+ ++ S ++T + + ++
Sbjct: 121 KAGNAGKKATKEDESDEDEEEEEEEEEEEAEVEENEQGTNGTSTNSTEVD--HGNGSSGG 178
Query: 2020 PVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSS 2079
E S ++ A +TT+ N + TT P E ++ P TT
Sbjct: 179 DNGEEGEEESVTEAEAEGTTVAGPTTTSPNGGFQPTT---PPQEVYGTTDPPFGKVTTPE 235
Query: 2080 PASESTTTSSPASESTTTSS 2099
E T + +
Sbjct: 236 YQGEYEQTGANEYDGGYEIY 255
>gnl|CDD|213932 TIGR04319, SerAla_Lrha_rpt, surface protein repeat Ser-Ala-175. This
serine and alanine-rich surface protein repeat, about 175
amino acids long, occurs up to nine times in surface
proteins of some Lactobacillus strains, particularly in
Lactobacillus rhamnosus. Members proteins have the
N-terminal variant signal sequence described by TIGR03715
and C-terminal LPXTG signals for surface attachment by
sortase.
Length = 175
Score = 41.0 bits (96), Expect = 0.002
Identities = 46/187 (24%), Positives = 79/187 (42%), Gaps = 16/187 (8%)
Query: 1934 STTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSE 1993
ST +S S + SS S SL S + T SS + S T+S S S S+ S
Sbjct: 2 STASSVASSANAVASSAASRFPDNQSLASLAKTASS--ANSVTSSYAASASADASAASSL 59
Query: 1994 STTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSE 2053
+T SS SS S++ + ++ + +S S +N S
Sbjct: 60 ATKVSSANK-------------AASSAASQANSALAAGNLDAASSYANQASKAASNASSL 106
Query: 2054 STTTNNPASESITSSSPASESTT-TSSPASESTTTSSPASESTTTSSPASESTTTSSPES 2112
+ N+ AS++++ + AS + SS AS + T + + + S AS ++ +S S
Sbjct: 107 ADKANSAASKALSEALQASSAAAIASSAASSAATLAGSLASANDAKSDASAASDAASSAS 166
Query: 2113 ESTTTSS 2119
+++S
Sbjct: 167 VVASSAS 173
Score = 40.6 bits (95), Expect = 0.003
Identities = 44/178 (24%), Positives = 86/178 (48%), Gaps = 9/178 (5%)
Query: 1944 STTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESE 2003
ST +S S + SS S S S + T SS S TSS + ++ +S S
Sbjct: 2 STASSVASSANAVASSAASRFPDNQSLASLAKTASSANS---VTSSYAASASADASAASS 58
Query: 2004 STTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPA-- 2061
T +S S + SS S++ + ++ + +S S +N S + N+ A
Sbjct: 59 LATKVS--SANKAASSAASQANSALAAGNLDAASSYANQASKAASNASSLADKANSAASK 116
Query: 2062 --SESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTT 2117
SE++ +SS A+ +++ +S A+ + + A+++ + +S AS++ +++S + S +T
Sbjct: 117 ALSEALQASSAAAIASSAASSAATLAGSLASANDAKSDASAASDAASSASVVASSAST 174
Score = 39.1 bits (91), Expect = 0.009
Identities = 42/167 (25%), Positives = 75/167 (44%), Gaps = 9/167 (5%)
Query: 1964 STTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSE 2023
ST +S S + SS S SL S + T SS + S T+ S S S+ S
Sbjct: 2 STASSVASSANAVASSAASRFPDNQSLASLAKTASS--ANSVTSSYAASASADASAASSL 59
Query: 2024 STTTISPESESTTTSSPASESTTTNNPK--SESTTTNNPASE--SITSSSPASESTTTSS 2079
+T S SS AS++ + +++ N AS+ S SS ++ S
Sbjct: 60 ATKVSSANK---AASSAASQANSALAAGNLDAASSYANQASKAASNASSLADKANSAASK 116
Query: 2080 PASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTT 2126
SE+ SS A+ +++ +S A+ + + +++ + +S AS++ +
Sbjct: 117 ALSEALQASSAAAIASSAASSAATLAGSLASANDAKSDASAASDAAS 163
Score = 31.4 bits (71), Expect = 3.6
Identities = 42/197 (21%), Positives = 76/197 (38%), Gaps = 26/197 (13%)
Query: 1904 STTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSE 1963
ST ++ S + SS S SL S + T SS + S T+S S
Sbjct: 2 STASSVASSANAVASSAASRFPDNQSLASLAKTASS------------ANSVTSSYAASA 49
Query: 1964 STTTSSPESESTTTSSPESESTTTSSLV-SESTTTSSPESESTTTISPVSESTTTSSPVS 2022
S S+ S +T SS +++ +S S + + S + + S +S
Sbjct: 50 SADASAASSLATKVSSANKAASSAASQANSALAAGNLDAASSYANQASKAASNASSLADK 109
Query: 2023 ESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPAS 2082
++ SE+ SS A+ ++ + AS + T + + + S AS
Sbjct: 110 ANSAASKALSEALQASSAAAIAS-------------SAASSAATLAGSLASANDAKSDAS 156
Query: 2083 ESTTTSSPASESTTTSS 2099
++ +S AS +++S
Sbjct: 157 AASDAASSASVVASSAS 173
>gnl|CDD|223031 PHA03273, PHA03273, envelope glycoprotein C; Provisional.
Length = 486
Score = 42.7 bits (100), Expect = 0.002
Identities = 25/99 (25%), Positives = 45/99 (45%), Gaps = 8/99 (8%)
Query: 1912 SESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPE 1971
S ++T+SS E+ +T+ + S T + S T+ ++++T ++ +T S P
Sbjct: 26 SGASTSSSIENSDNSTAEMQSTPATPTHTTSNLTSPFGTGTDNSTNANGTESTTQASQPH 85
Query: 1972 SESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISP 2010
S TT T T SL+S S + TT++
Sbjct: 86 SHETTI-------TCTKSLIS-VPYYKSVDMNCTTSVGV 116
Score = 42.7 bits (100), Expect = 0.003
Identities = 27/97 (27%), Positives = 43/97 (44%), Gaps = 7/97 (7%)
Query: 1932 SESTTTSSPESESTT----TSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTT 1987
S ++T+SS E+ + S+P + + TTS+L S T + + + T S S
Sbjct: 26 SGASTSSSIENSDNSTAEMQSTPATPTHTTSNLTSPFGTGTDNSTNANGTESTTQASQPH 85
Query: 1988 SSLVSESTTTSSPESESTTTISPVSESTTTSSPVSES 2024
S E+T T + S V + TTS V+ S
Sbjct: 86 S---HETTITCTKSLISVPYYKSVDMNCTTSVGVNYS 119
Score = 42.3 bits (99), Expect = 0.003
Identities = 19/67 (28%), Positives = 32/67 (47%)
Query: 2061 ASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSP 2120
AS + TSSS + +T+ S T + S T+ ++++T ++ +T S P
Sbjct: 25 ASGASTSSSIENSDNSTAEMQSTPATPTHTTSNLTSPFGTGTDNSTNANGTESTTQASQP 84
Query: 2121 ASESTTI 2127
S TTI
Sbjct: 85 HSHETTI 91
Score = 42.3 bits (99), Expect = 0.003
Identities = 22/94 (23%), Positives = 42/94 (44%), Gaps = 8/94 (8%)
Query: 1942 SESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPE 2001
S ++T+SS E+ +T+ + S T + S T+ ++++T ++ +T S P
Sbjct: 26 SGASTSSSIENSDNSTAEMQSTPATPTHTTSNLTSPFGTGTDNSTNANGTESTTQASQPH 85
Query: 2002 SESTTTISPVSESTTTSSPVSES-----TTTISP 2030
S TT + S P +S TT++
Sbjct: 86 SHETTI---TCTKSLISVPYYKSVDMNCTTSVGV 116
Score = 41.9 bits (98), Expect = 0.004
Identities = 23/87 (26%), Positives = 42/87 (48%), Gaps = 3/87 (3%)
Query: 1962 SESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPV 2021
S ++T+SS E+ +T+ +S T + S T+ ++++T + +ESTT +S
Sbjct: 26 SGASTSSSIENSDNSTAEMQSTPATPTHTTSNLTSPFGTGTDNSTNAN-GTESTTQASQP 84
Query: 2022 SESTTTISPESESTTTSSPASESTTTN 2048
TTI+ + S P +S N
Sbjct: 85 HSHETTIT--CTKSLISVPYYKSVDMN 109
Score = 41.5 bits (97), Expect = 0.006
Identities = 21/88 (23%), Positives = 40/88 (45%), Gaps = 1/88 (1%)
Query: 2022 SESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPA 2081
S ++T+ S E+ +T+ S T + S T+ +++ T+++ +T S P
Sbjct: 26 SGASTSSSIENSDNSTAEMQSTPATPTHTTSNLTSPFGTGTDNSTNANGTESTTQASQPH 85
Query: 2082 S-ESTTTSSPASESTTTSSPASESTTTS 2108
S E+T T + + S + TTS
Sbjct: 86 SHETTITCTKSLISVPYYKSVDMNCTTS 113
Score = 40.8 bits (95), Expect = 0.010
Identities = 24/95 (25%), Positives = 43/95 (45%), Gaps = 3/95 (3%)
Query: 1892 SENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPE 1951
S +T++S E+ +T +S T + S T+ ++++T ++ +T S P
Sbjct: 26 SGASTSSSIENSDNSTAEMQSTPATPTHTTSNLTSPFGTGTDNSTNANGTESTTQASQPH 85
Query: 1952 SESTT---TSSLVSESTTTSSPESESTTTSSPESE 1983
S TT T SL+S S + +T+ SE
Sbjct: 86 SHETTITCTKSLISVPYYKSVDMNCTTSVGVNYSE 120
Score = 40.8 bits (95), Expect = 0.010
Identities = 21/88 (23%), Positives = 39/88 (44%), Gaps = 1/88 (1%)
Query: 2012 SESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPA 2071
S ++T+SS + +T +S T + S T+ ++++T N + +S P
Sbjct: 26 SGASTSSSIENSDNSTAEMQSTPATPTHTTSNLTSPFGTGTDNSTNANGTESTTQASQPH 85
Query: 2072 S-ESTTTSSPASESTTTSSPASESTTTS 2098
S E+T T + + S + TTS
Sbjct: 86 SHETTITCTKSLISVPYYKSVDMNCTTS 113
Score = 38.4 bits (89), Expect = 0.045
Identities = 18/71 (25%), Positives = 33/71 (46%)
Query: 1982 SESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPA 2041
S ++T+SS+ + +T+ +S T S T+ ++++T + +T S P
Sbjct: 26 SGASTSSSIENSDNSTAEMQSTPATPTHTTSNLTSPFGTGTDNSTNANGTESTTQASQPH 85
Query: 2042 SESTTTNNPKS 2052
S TT KS
Sbjct: 86 SHETTITCTKS 96
Score = 38.4 bits (89), Expect = 0.054
Identities = 21/88 (23%), Positives = 40/88 (45%), Gaps = 1/88 (1%)
Query: 1992 SESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPK 2051
S ++T+SS E+ +T S T + S T+ ++++T ++ +T + P
Sbjct: 26 SGASTSSSIENSDNSTAEMQSTPATPTHTTSNLTSPFGTGTDNSTNANGTESTTQASQPH 85
Query: 2052 S-ESTTTNNPASESITSSSPASESTTTS 2078
S E+T T + S+ + TTS
Sbjct: 86 SHETTITCTKSLISVPYYKSVDMNCTTS 113
Score = 37.3 bits (86), Expect = 0.11
Identities = 21/82 (25%), Positives = 40/82 (48%), Gaps = 10/82 (12%)
Query: 2041 ASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSP 2100
AS ++T+++ ++ +T + S+PA+ + TTS+ S T + ++ + T S
Sbjct: 25 ASGASTSSSIENSDNST------AEMQSTPATPTHTTSNLTSPFGTGTDNSTNANGTES- 77
Query: 2101 ASESTTTSSPESESTTTSSPAS 2122
+T S P S TT + S
Sbjct: 78 ---TTQASQPHSHETTITCTKS 96
Score = 36.1 bits (83), Expect = 0.25
Identities = 22/65 (33%), Positives = 34/65 (52%), Gaps = 12/65 (18%)
Query: 2081 ASESTTTSSPASESTT----TSSPASESTTTSSPESESTT-----TSSPASESTTIEEQG 2131
AS ++T+SS + + S+PA+ + TTS+ S T T++ +ESTT Q
Sbjct: 25 ASGASTSSSIENSDNSTAEMQSTPATPTHTTSNLTSPFGTGTDNSTNANGTESTT---QA 81
Query: 2132 VSPHS 2136
PHS
Sbjct: 82 SQPHS 86
Score = 35.7 bits (82), Expect = 0.30
Identities = 23/102 (22%), Positives = 41/102 (40%), Gaps = 11/102 (10%)
Query: 2032 SESTTTSSPASESTT----TNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTT 2087
S ++T+SS + + + P + + TT+N S T + ++ + T S +T
Sbjct: 26 SGASTSSSIENSDNSTAEMQSTPATPTHTTSNLTSPFGTGTDNSTNANGTES----TTQA 81
Query: 2088 SSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEE 2129
S P S TT + + S P +S + S E
Sbjct: 82 SQPHSHETTIT---CTKSLISVPYYKSVDMNCTTSVGVNYSE 120
Score = 31.1 bits (70), Expect = 8.2
Identities = 27/100 (27%), Positives = 42/100 (42%), Gaps = 8/100 (8%)
Query: 1855 AATAVAISVIDNYSEIIFTTNNNSESTVVMSTLNSLLSENTTTNSPESESTTT-NNPESE 1913
A+T+ +I DN + + +T T T + +TN+ +ESTT + P S
Sbjct: 28 ASTSSSIENSDNSTAEMQSTPATPTHTTSNLTSPFGTGTDNSTNANGTESTTQASQPHSH 87
Query: 1914 STTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESE 1953
TT T T SL+S S + +T+ SE
Sbjct: 88 ETTI-------TCTKSLISVPYYKSVDMNCTTSVGVNYSE 120
>gnl|CDD|113196 pfam04415, DUF515, Protein of unknown function (DUF515). Family of
hypothetical Archaeal proteins.
Length = 416
Score = 42.6 bits (100), Expect = 0.002
Identities = 26/80 (32%), Positives = 36/80 (45%), Gaps = 3/80 (3%)
Query: 2051 KSESTTTNN--PASESITSSSPASESTTTSSPASESTTTSSPASESTTTS-SPASESTTT 2107
K T + I SS SES + S+ S S++TSS S ST+ S AS +
Sbjct: 257 KQNGTIFYEIVDSGYVILSSISVSESQSQSTSTSSSSSTSSSESSSTSYSPGDASIQNSQ 316
Query: 2108 SSPESESTTTSSPASESTTI 2127
S ST+ S S S++
Sbjct: 317 RSQLQSSTSQSESESASSSY 336
Score = 41.8 bits (98), Expect = 0.004
Identities = 24/70 (34%), Positives = 36/70 (51%), Gaps = 1/70 (1%)
Query: 1910 PESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPE-SESTTTSSLVSESTTTS 1968
+S SES + S+ S S++TSS ES ST+ S + S + S + ST+ S
Sbjct: 268 DSGYVILSSISVSESQSQSTSTSSSSSTSSSESSSTSYSPGDASIQNSQRSQLQSSTSQS 327
Query: 1969 SPESESTTTS 1978
ES S++ S
Sbjct: 328 ESESASSSYS 337
Score = 41.8 bits (98), Expect = 0.004
Identities = 20/80 (25%), Positives = 36/80 (45%), Gaps = 2/80 (2%)
Query: 1932 SESTTTSS--PESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSS 1989
T +S SES + S+ S S++TSS ES ST+ S ++ +
Sbjct: 258 QNGTIFYEIVDSGYVILSSISVSESQSQSTSTSSSSSTSSSESSSTSYSPGDASIQNSQR 317
Query: 1990 LVSESTTTSSPESESTTTIS 2009
+S+T+ S ++++ S
Sbjct: 318 SQLQSSTSQSESESASSSYS 337
Score = 41.5 bits (97), Expect = 0.005
Identities = 25/82 (30%), Positives = 35/82 (42%), Gaps = 1/82 (1%)
Query: 1908 NNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTS-SLVSESTT 1966
+S VSES + S+ S S++TSS ES ST+ S S +
Sbjct: 256 LKQNGTIFYEIVDSGYVILSSISVSESQSQSTSTSSSSSTSSSESSSTSYSPGDASIQNS 315
Query: 1967 TSSPESESTTTSSPESESTTTS 1988
S ST+ S ES S++ S
Sbjct: 316 QRSQLQSSTSQSESESASSSYS 337
Score = 38.8 bits (90), Expect = 0.036
Identities = 24/81 (29%), Positives = 38/81 (46%), Gaps = 7/81 (8%)
Query: 1922 SESTTTSSLVSES----TTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTT 1977
T +V ++ S ES+S +TS+ S ST++S S+T+ SP S
Sbjct: 258 QNGTIFYEIVDSGYVILSSISVSESQSQSTSTSSSSSTSSSES---SSTSYSPGDASIQN 314
Query: 1978 SSPESESTTTSSLVSESTTTS 1998
S ++TS SES ++S
Sbjct: 315 SQRSQLQSSTSQSESESASSS 335
Score = 38.4 bits (89), Expect = 0.043
Identities = 24/84 (28%), Positives = 40/84 (47%), Gaps = 3/84 (3%)
Query: 1876 NNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTS-SLVSES 1934
T+ ++S ++ + ES+S +T+ S ST++S ES ST+ S S
Sbjct: 256 LKQNGTIFYEIVDSGYVILSSISVSESQSQSTSTSSSSSTSSS--ESSSTSYSPGDASIQ 313
Query: 1935 TTTSSPESESTTTSSPESESTTTS 1958
+ S ST+ S ES S++ S
Sbjct: 314 NSQRSQLQSSTSQSESESASSSYS 337
Score = 38.4 bits (89), Expect = 0.044
Identities = 25/81 (30%), Positives = 37/81 (45%), Gaps = 5/81 (6%)
Query: 1962 SESTTTSS--PESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISP--VSESTTT 2017
T +S SES + S+ S S++TSS ES S+T+ SP S +
Sbjct: 258 QNGTIFYEIVDSGYVILSSISVSESQSQSTSTSSSSSTSSSES-SSTSYSPGDASIQNSQ 316
Query: 2018 SSPVSESTTTISPESESTTTS 2038
S + ST+ ES S++ S
Sbjct: 317 RSQLQSSTSQSESESASSSYS 337
Score = 38.0 bits (88), Expect = 0.061
Identities = 23/81 (28%), Positives = 39/81 (48%), Gaps = 7/81 (8%)
Query: 2012 SESTTTSSPVSESTTTISP----ESESTTTSSPASESTTTNNPKSESTTTNNPASESITS 2067
T V +S ES+S +TS+ +S ST++ S+T+ +P SI +
Sbjct: 258 QNGTIFYEIVDSGYVILSSISVSESQSQSTSTSSSSSTSS---SESSSTSYSPGDASIQN 314
Query: 2068 SSPASESTTTSSPASESTTTS 2088
S + ++TS SES ++S
Sbjct: 315 SQRSQLQSSTSQSESESASSS 335
Score = 37.6 bits (87), Expect = 0.071
Identities = 32/89 (35%), Positives = 46/89 (51%), Gaps = 3/89 (3%)
Query: 2062 SESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPA 2121
SES + S+ S S++TSS S S+T+ SP S S + ++TS ESES SS
Sbjct: 280 SESQSQSTSTSSSSSTSSSES-SSTSYSPGDASIQNSQRSQLQSSTSQSESES--ASSSY 336
Query: 2122 SESTTIEEQGVSPHSEKLSANEDPEEFPN 2150
S S + E + + KL A+E + N
Sbjct: 337 SYSVNLPEILKAIAAGKLDADEIKAQLQN 365
Score = 37.2 bits (86), Expect = 0.10
Identities = 23/79 (29%), Positives = 35/79 (44%), Gaps = 3/79 (3%)
Query: 1992 SESTTTSS--PESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNN 2049
T + VSES + S+ S S++T S ES S+T+ SP S +
Sbjct: 258 QNGTIFYEIVDSGYVILSSISVSESQSQSTSTSSSSSTSSSES-SSTSYSPGDASIQNSQ 316
Query: 2050 PKSESTTTNNPASESITSS 2068
++T+ SES +SS
Sbjct: 317 RSQLQSSTSQSESESASSS 335
Score = 37.2 bits (86), Expect = 0.11
Identities = 20/82 (24%), Positives = 39/82 (47%), Gaps = 6/82 (7%)
Query: 1952 SESTTTSSLVSES----TTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTT 2007
T +V ++ S ES+S +TS+ S ST++S S ST+ S ++ +
Sbjct: 258 QNGTIFYEIVDSGYVILSSISVSESQSQSTSTSSSSSTSSSE--SSSTSYSPGDASIQNS 315
Query: 2008 ISPVSESTTTSSPVSESTTTIS 2029
+S+T+ S ++++ S
Sbjct: 316 QRSQLQSSTSQSESESASSSYS 337
Score = 36.4 bits (84), Expect = 0.18
Identities = 26/85 (30%), Positives = 39/85 (45%), Gaps = 5/85 (5%)
Query: 1942 SESTTTSS--PESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSS 1999
T +S VSES + S+ S S++TSS ES ST+ S + +
Sbjct: 258 QNGTIFYEIVDSGYVILSSISVSESQSQSTSTSSSSSTSSSESSSTSYSPGDASIQNSQR 317
Query: 2000 PESESTTTISPVSESTTTSSPVSES 2024
+ +S+T+ SES + SS S S
Sbjct: 318 SQLQSSTS---QSESESASSSYSYS 339
Score = 36.1 bits (83), Expect = 0.21
Identities = 21/69 (30%), Positives = 38/69 (55%), Gaps = 1/69 (1%)
Query: 2044 STTTNNPKSESTTTNNPASESITSSSPASES-TTTSSPASESTTTSSPASESTTTSSPAS 2102
S + + +S+ST+T++ +S S + SS S S S S+ + S S+S + S+ +S
Sbjct: 276 SISVSESQSQSTSTSSSSSTSSSESSSTSYSPGDASIQNSQRSQLQSSTSQSESESASSS 335
Query: 2103 ESTTTSSPE 2111
S + + PE
Sbjct: 336 YSYSVNLPE 344
Score = 31.8 bits (72), Expect = 4.5
Identities = 24/86 (27%), Positives = 37/86 (43%), Gaps = 7/86 (8%)
Query: 1972 SESTTTSS--PESESTTTSSLVSESTTTSSPESESTTTISPVSESTT-----TSSPVSES 2024
T +S VSES + S+ S S++T S S ST+ S S+
Sbjct: 258 QNGTIFYEIVDSGYVILSSISVSESQSQSTSTSSSSSTSSSESSSTSYSPGDASIQNSQR 317
Query: 2025 TTTISPESESTTTSSPASESTTTNNP 2050
+ S S+S + S+ +S S + N P
Sbjct: 318 SQLQSSTSQSESESASSSYSYSVNLP 343
Score = 31.4 bits (71), Expect = 6.7
Identities = 24/86 (27%), Positives = 35/86 (40%), Gaps = 4/86 (4%)
Query: 2058 NNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPE-SESTT 2116
N I + +S SES + S+ S S++TSS S ST+ S + S +
Sbjct: 259 NGTIFYEIV---DSGYVILSSISVSESQSQSTSTSSSSSTSSSESSSTSYSPGDASIQNS 315
Query: 2117 TSSPASESTTIEEQGVSPHSEKLSAN 2142
S ST+ E + S S N
Sbjct: 316 QRSQLQSSTSQSESESASSSYSYSVN 341
>gnl|CDD|220401 pfam09786, CytochromB561_N, Cytochrome B561, N terminal. Members of
this family are found in the N terminal region of
cytochrome B561, as well as in various other putative
uncharacterized proteins.
Length = 559
Score = 42.8 bits (101), Expect = 0.002
Identities = 32/231 (13%), Positives = 67/231 (29%), Gaps = 17/231 (7%)
Query: 1950 PESESTTTSSLVSESTTTSSPESESTT-TSSPESESTTTSSLVSESTTTSSPESESTTTI 2008
+ T S +S S + T S+ + S ++ + ST
Sbjct: 105 AKDSQFTVVSQAKKSPPASKTSTPMNTSEPLVPGHSSFSDSPSRSASPSRKFSPSSTIQQ 164
Query: 2009 SPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSS 2068
SP + + S S + S S +S + S ++P + ++ + T
Sbjct: 165 SPQLTPSNKPASPSSSYQSPSYSSSLGPVNSSGNRSNLRSSPWALRSSG--DKKDITTDE 222
Query: 2069 SP-----ASESTTTSSPASEST--TTSSPASESTTTSSPASESTTTSSPESESTTTSSPA 2121
A S + T S +SSP+ + + ++ ++ +
Sbjct: 223 KYLETFLAEVDEEQHMITSSAGKNATPPETINSFGSSSPSFWNYSRNASDAARSLKKRSY 282
Query: 2122 SESTTIEEQGVSPHSEKLSANEDPEEFPNEDVFEHTFAEIPNIDHSNQTDE 2172
S P +K A+ P++ E + +
Sbjct: 283 QLS-----PSPVPSKQK--ASTSPKKGEGEPPNMSLESASEVFKRVGVLPQ 326
Score = 40.9 bits (96), Expect = 0.008
Identities = 48/228 (21%), Positives = 89/228 (39%), Gaps = 20/228 (8%)
Query: 1920 PESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTS-SPESESTTTS 1978
+ T S S P S+++T + S S+S + S SP + + +S
Sbjct: 105 AKDSQFTVVS----QAKKSPPASKTSTPMNTSEPLVPGHSSFSDSPSRSASPSRKFSPSS 160
Query: 1979 SPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTS 2038
+ + T S ++ +SS +S S ++ S+ S SP + S
Sbjct: 161 TIQQSPQLTPSN-KPASPSSSYQSPSYSSSLGPVNSSGNRSN-----LRSSPWALR---S 211
Query: 2039 SPASESTTTNNPKSESTTTN-NPASESITSSSPASESTTTSSPASESTTTSSPASESTTT 2097
S + TT+ E+ + ITSS+ + T S +SSP+ + +
Sbjct: 212 SGDKKDITTDEKYLETFLAEVDEEQHMITSSAGKN---ATPPETINSFGSSSPSFWNYSR 268
Query: 2098 SSP-ASESTTTSSPESESTTTSSPASESTTIEE-QGVSPHSEKLSANE 2143
++ A+ S S + + S ST+ ++ +G P+ SA+E
Sbjct: 269 NASDAARSLKKRSYQLSPSPVPSKQKASTSPKKGEGEPPNMSLESASE 316
Score = 35.5 bits (82), Expect = 0.35
Identities = 33/198 (16%), Positives = 72/198 (36%), Gaps = 14/198 (7%)
Query: 1872 FTTNNNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTS---SPESESTTTS 1928
++ + V + + S+ +T + ++ S+S + S S + ++T
Sbjct: 103 VKAKDSQFTVVSQAKKSPPASKTSTPMNTSEPLVPGHSSFSDSPSRSASPSRKFSPSSTI 162
Query: 1929 SLVSESTTTSSPESESTTTSSPESESTTTSSLVS--ESTTTSSPESESTTTSSPESESTT 1986
+ T ++ P S S++ SP S+ S S SSP + +S + + TT
Sbjct: 163 QQSPQLTPSNKPASPSSSYQSPSYSSSLGPVNSSGNRSNLRSSPWALR--SSGDKKDITT 220
Query: 1987 TSSL-------VSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSS 2039
V E + + T S +SSP + + + ++ +
Sbjct: 221 DEKYLETFLAEVDEEQHMITSSAGKNATPPETINSFGSSSPSFWNYSRNASDAARSLKKR 280
Query: 2040 PASESTTTNNPKSESTTT 2057
S + K +++T+
Sbjct: 281 SYQLSPSPVPSKQKASTS 298
>gnl|CDD|221548 pfam12361, DBP, Duffy-antigen binding protein. This family of
proteins is found in eukaryotes. Proteins in this family
are typically between 449 and 1061 amino acids in length.
The family is found in association with pfam05424. There
are two conserved sequence motifs: NKNGG and QKHDF. This
family is part of the Duffy-antigen binding protein of
Plasmodium spp. This protein is an antigen on these
parasites which enable them to invade erythrocytes.
Length = 318
Score = 42.0 bits (98), Expect = 0.002
Identities = 42/196 (21%), Positives = 73/196 (37%), Gaps = 7/196 (3%)
Query: 1894 NTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESE 1953
T+NS SEST N + T S+ ++ + LV++ P +++ S S
Sbjct: 95 LGTSNSRPSESTVEANSPGDGTVNSASIPVVSSENPLVTKHKGLE-PSKDNSDNSGSASH 153
Query: 1954 STTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTIS---- 2009
+ + S + S+ + E T E T + S TSS +STT++
Sbjct: 154 ALAGENGESMAGPDSNSKGE-TADPQDNIEVKATKDSSNRSDGTSSATGDSTTSVDRAIN 212
Query: 2010 -PVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSS 2068
V E S + S + T + S + N ++ N P S + +
Sbjct: 213 KGVPEDGDKSVGSKRAENEDSSAEKDGATVAGGSTNDPEQNVSVDTDNGNVPGSGNKQNE 272
Query: 2069 SPASESTTTSSPASES 2084
+ S S ++ES
Sbjct: 273 GATALSGAESLESNES 288
Score = 41.6 bits (97), Expect = 0.004
Identities = 56/276 (20%), Positives = 101/276 (36%), Gaps = 32/276 (11%)
Query: 1892 SENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPE 1951
S + NS +STT E+ T T S V S S +++ P
Sbjct: 20 SAHGNVNSGAGKSTT-----GEAVTGDGQNGNQTPAESNVQRSDIVESLSAKNVDPQKPV 74
Query: 1952 SESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSP------ESEST 2005
SE + +S V++ + + T++S SEST ++ + T S+ E+
Sbjct: 75 SERSADTSSVTD--IAEAGKENLGTSNSRPSESTVEANSPGDGTVNSASIPVVSSENPLV 132
Query: 2006 TT---ISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPAS 2062
T + P +++ S S + + ES + S+ E T E T + ++
Sbjct: 133 TKHKGLEPSKDNSDNSGSASHALAGENGESMAGPDSNSKGE-TADPQDNIEVKATKDSSN 191
Query: 2063 ESITSSSPASESTTT---------------SSPASESTTTSSPASESTTTSSPASESTTT 2107
S +SS +STT+ S + + S A + T + S +
Sbjct: 192 RSDGTSSATGDSTTSVDRAINKGVPEDGDKSVGSKRAENEDSSAEKDGATVAGGSTNDPE 251
Query: 2108 SSPESESTTTSSPASESTTIEEQGVSPHSEKLSANE 2143
+ ++ + P S + E +E L +NE
Sbjct: 252 QNVSVDTDNGNVPGSGNKQNEGATALSGAESLESNE 287
Score = 40.5 bits (94), Expect = 0.009
Identities = 38/211 (18%), Positives = 77/211 (36%), Gaps = 8/211 (3%)
Query: 1938 SSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTT 1997
S+P ++ +S E ++ S S E+ T T S V S
Sbjct: 1 SNPIIQAVDSSKAEKVQGDSAHGNVNSGAGKSTTGEAVTGDGQNGNQTPAESNVQRSDIV 60
Query: 1998 SSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTT 2057
S +++ PVSE + +S V++ + T++S SEST N + T
Sbjct: 61 ESLSAKNVDPQKPVSERSADTSSVTDIAEA--GKENLGTSNSRPSESTVEANSPGDGTVN 118
Query: 2058 NNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTT 2117
+ ++S +P P+ +++ S AS + + S + S+ + E+
Sbjct: 119 SASIP-VVSSENPLVTKHKGLEPSKDNSDNSGSASHALAGENGESMAGPDSNSKGETADP 177
Query: 2118 SSPAS-----ESTTIEEQGVSPHSEKLSANE 2143
+S+ + S + ++ +
Sbjct: 178 QDNIEVKATKDSSNRSDGTSSATGDSTTSVD 208
Score = 39.0 bits (90), Expect = 0.025
Identities = 39/202 (19%), Positives = 71/202 (35%), Gaps = 6/202 (2%)
Query: 1873 TTNNNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVS 1932
S S ST+ + + T NS ++ NP P +++ S S
Sbjct: 93 ENLGTSNSRPSESTVEANSPGDGTVNSASIPVVSSENPLVTKHKGLEPSKDNSDNSGSAS 152
Query: 1933 ESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVS 1992
+ + ES + S+ + E+ + T S S TSS +STT+ V
Sbjct: 153 HALAGENGESMAGPDSNSKGETADPQDNIEVKATKDSSNR-SDGTSSATGDSTTS---VD 208
Query: 1993 ESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKS 2052
+ PE + S + S + T++ S + + + ++ N P S
Sbjct: 209 RAINKGVPEDGDKSVGS--KRAENEDSSAEKDGATVAGGSTNDPEQNVSVDTDNGNVPGS 266
Query: 2053 ESTTTNNPASESITSSSPASES 2074
+ + S S ++ES
Sbjct: 267 GNKQNEGATALSGAESLESNES 288
Score = 37.4 bits (86), Expect = 0.076
Identities = 47/239 (19%), Positives = 86/239 (35%), Gaps = 9/239 (3%)
Query: 1908 NNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTT 1967
+NP ++ +S E ++ S S E+ T T S V S
Sbjct: 1 SNPIIQAVDSSKAEKVQGDSAHGNVNSGAGKSTTGEAVTGDGQNGNQTPAESNVQRSDIV 60
Query: 1968 SSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTT 2027
S +++ P SE + +S V++ + + T+ S SEST ++ + T
Sbjct: 61 ESLSAKNVDPQKPVSERSADTSSVTD--IAEAGKENLGTSNSRPSESTVEANSPGDGTVN 118
Query: 2028 ISPESESTTTSSPASESTTTN--NPKSESTTTNNPASESITSSSPASESTTTSSPASEST 2085
+ SS T P +++ + AS ++ + S + S+ E+
Sbjct: 119 SAS---IPVVSSENPLVTKHKGLEPSKDNSDNSGSASHALAGENGESMAGPDSNSKGETA 175
Query: 2086 TTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQGVSPHSEKLSANED 2144
T S S + +S + +TTS + + + E G K + NED
Sbjct: 176 DPQDNIEVKATKDS--SNRSDGTSSATGDSTTSVDRAINKGVPEDGDKSVGSKRAENED 232
Score = 37.4 bits (86), Expect = 0.090
Identities = 39/182 (21%), Positives = 70/182 (38%), Gaps = 13/182 (7%)
Query: 1877 NSESTVVMSTLNSLLS-----ENTTTNSPESEST----TTNNPESESTTTSSPESESTTT 1927
NS S V+S+ N L++ E + NS S S N ES + S+ + E+
Sbjct: 118 NSASIPVVSSENPLVTKHKGLEPSKDNSDNSGSASHALAGENGESMAGPDSNSKGETADP 177
Query: 1928 SSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTT 1987
+ T S S TSS +STT+ V + PE + S +E+ +
Sbjct: 178 QDNIEVKATKDSSNR-SDGTSSATGDSTTS---VDRAINKGVPEDGDKSVGSKRAENEDS 233
Query: 1988 SSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTT 2047
S+ +T ++ +S +++ ++ + S + + S S T
Sbjct: 234 SAEKDGATVAGGSTNDPEQNVSVDTDNGNVPGSGNKQNEGATALSGAESLESNESVHKTI 293
Query: 2048 NN 2049
+N
Sbjct: 294 DN 295
Score = 35.9 bits (82), Expect = 0.23
Identities = 38/232 (16%), Positives = 78/232 (33%), Gaps = 4/232 (1%)
Query: 1897 TNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTT 1956
++ ES S +P+ + S+ S T + E+ TS+ +T + T
Sbjct: 57 SDIVESLSAKNVDPQKPVSERSADTSSVTDIAEAGKENLGTSNSRPSESTVEANSPGDGT 116
Query: 1957 TSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTT 2016
+S ++ +P P +++ S S + + ES + + E+
Sbjct: 117 VNSASIPVVSSENPLVTKHKGLEPSKDNSDNSGSASHALAGENGESMAGPDSNSKGETAD 176
Query: 2017 TSSPVSESTT----TISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPAS 2072
+ T S + S T S S N E + + + S A
Sbjct: 177 PQDNIEVKATKDSSNRSDGTSSATGDSTTSVDRAINKGVPEDGDKSVGSKRAENEDSSAE 236
Query: 2073 ESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASES 2124
+ T + S + + + ++ + P S + + S S ++ES
Sbjct: 237 KDGATVAGGSTNDPEQNVSVDTDNGNVPGSGNKQNEGATALSGAESLESNES 288
>gnl|CDD|235906 PRK07003, PRK07003, DNA polymerase III subunits gamma and tau;
Validated.
Length = 830
Score = 42.5 bits (100), Expect = 0.003
Identities = 19/123 (15%), Positives = 41/123 (33%), Gaps = 2/123 (1%)
Query: 2014 STTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASE 2073
+T +P + + + A N S + + ++ S AS
Sbjct: 420 ATRAEAPPAAPAPPATADRGDDAADGDAPVPAKANARASADSRCDERDAQPPADSGSASA 479
Query: 2074 STTTSSPAS--ESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQG 2131
+ + P + E ++ S +T + P + + +S E + PA E+
Sbjct: 480 PASDAPPDAAFEPAPRAAAPSAATPAAVPDARAPAAASREDAPAAAAPPAPEARPPTPAA 539
Query: 2132 VSP 2134
+P
Sbjct: 540 AAP 542
Score = 36.8 bits (85), Expect = 0.20
Identities = 21/159 (13%), Positives = 57/159 (35%), Gaps = 5/159 (3%)
Query: 1995 TTTSSPESESTTTISPVSESTTTS-SPVSESTTTISPESESTTTSSPASESTTTNNPKSE 2053
++ + + ++ V+ + + +P + + + +P + + ++
Sbjct: 386 RAAAAVGASAVPAVTAVTGAAGAALAPKAAAAAAATRAEAPPAAPAPPATADRGDDAADG 445
Query: 2054 STTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESE 2113
PA + +S+ + + P ++S + S+PA S A E ++ S
Sbjct: 446 DAPV--PAKANARASADSRCDERDAQPPADSGSASAPA--SDAPPDAAFEPAPRAAAPSA 501
Query: 2114 STTTSSPASESTTIEEQGVSPHSEKLSANEDPEEFPNED 2152
+T + P + + + +P + A E P
Sbjct: 502 ATPAAVPDARAPAAASREDAPAAAAPPAPEARPPTPAAA 540
Score = 36.0 bits (83), Expect = 0.31
Identities = 18/186 (9%), Positives = 58/186 (31%), Gaps = 19/186 (10%)
Query: 1940 PESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSS 1999
P + ++ + + + V+ ++ ++ ++ + + ++
Sbjct: 381 PAPGARAAAAVGASAVPAVTAVT-GAAGAALAPKAAAAAAATRAEAPP---AAPAPPATA 436
Query: 2000 PESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNN 2059
+ ++ S + +++ S AS + P +
Sbjct: 437 DRGDDAAD-GDAPVPAKANARASADSRCDERDAQPPADSGSASAPASDAPP--------D 487
Query: 2060 PASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSS 2119
A E ++ S +T + P + + +S + PA E+ + ++
Sbjct: 488 AAFEPAPRAAAPSAATPAAVPDARAPAAASREDAPAAAAPPAPEAR------PPTPAAAA 541
Query: 2120 PASEST 2125
PA+ +
Sbjct: 542 PAARAG 547
Score = 33.7 bits (77), Expect = 1.4
Identities = 15/168 (8%), Positives = 58/168 (34%), Gaps = 3/168 (1%)
Query: 1920 PESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSS 1979
P + +++ + + + + ++ ++ ++ ++P +T
Sbjct: 381 PAPGARAAAAVGASAVPAVTAVT-GAAGAALAPKAAAAAAATRAEAPPAAPAPPATADRG 439
Query: 1980 PESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSS 2039
++ + S+ P ++S + S+P S++ + E +
Sbjct: 440 DDAADGDAPVPAKANARASADSRCDERDAQPPADSGSASAPASDAPPDAAFEPAPRAAAP 499
Query: 2040 PASESTTTNNPKSESTTTNN--PASESITSSSPASESTTTSSPASEST 2085
A+ + ++ + + PA+ + + + ++PA+ +
Sbjct: 500 SAATPAAVPDARAPAAASREDAPAAAAPPAPEARPPTPAAAAPAARAG 547
>gnl|CDD|237284 PRK13108, PRK13108, prolipoprotein diacylglyceryl transferase;
Reviewed.
Length = 460
Score = 41.9 bits (98), Expect = 0.004
Identities = 27/146 (18%), Positives = 44/146 (30%), Gaps = 4/146 (2%)
Query: 2031 ESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSP 2090
E E ++ A S + N P + + +E T + S
Sbjct: 297 EREPAELAAAAVASAASAVGPVGPGEPNQPDDVAEAVKAEVAEVTDEVAAESVVQVADRD 356
Query: 2091 ASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQGVSPHSEKLSANEDPEEFPN 2150
EST SE+ + +PA+ E +P A+E +E
Sbjct: 357 G-ESTPAVEETSEADIER-EQPGDLAGQAPAAHQVDAEAASAAPEEPAALASEAHDE--T 412
Query: 2151 EDVFEHTFAEIPNIDHSNQTDEAIPE 2176
E A IP+ ++ A P
Sbjct: 413 EPEVPEKAAPIPDPAKPDELAVAGPG 438
Score = 41.9 bits (98), Expect = 0.004
Identities = 23/151 (15%), Positives = 47/151 (31%), Gaps = 4/151 (2%)
Query: 1981 ESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSP 2040
E E ++ S ++ P + + V+E T ++ ES
Sbjct: 297 EREPAELAAAAVASAASAVGPVGPGEPNQPDDVAEAVKAEVAEVTDEVAAESVVQVADRD 356
Query: 2041 ASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSP 2100
EST SE+ + +PA+ + ++ ++ ASE+ + P
Sbjct: 357 G-ESTPAVEETSEADIER-EQPGDLAGQAPAAHQVDAEAASAAPEEPAALASEAHDETEP 414
Query: 2101 ASESTTTSSPESESTTTSSPASESTTIEEQG 2131
E ++P + A +
Sbjct: 415 --EVPEKAAPIPDPAKPDELAVAGPGDDPAE 443
Score = 40.0 bits (93), Expect = 0.016
Identities = 23/151 (15%), Positives = 46/151 (30%), Gaps = 5/151 (3%)
Query: 1921 ESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSP 1980
E E ++ S ++ + P+ + + V+E T + ES +
Sbjct: 297 EREPAELAAAAVASAASAVGPVGPGEPNQPDDVAEAVKAEVAEVTDEVAAESVVQV-ADR 355
Query: 1981 ESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSP 2040
+ EST SE+ + +P + + + + SE+ + P
Sbjct: 356 DGESTPAVEETSEADIER-EQPGDLAGQAPAAHQVDAEAASAAPEEPAALASEAHDETEP 414
Query: 2041 ASESTTTNNPKSESTTTNNPASESITSSSPA 2071
E P + + A PA
Sbjct: 415 --EVPEKAAPIPDPAKPDELAVAG-PGDDPA 442
Score = 39.2 bits (91), Expect = 0.030
Identities = 21/153 (13%), Positives = 48/153 (31%), Gaps = 4/153 (2%)
Query: 1971 ESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISP 2030
E E ++ S ++ + P+ + + V+E T + S
Sbjct: 297 EREPAELAAAAVASAASAVGPVGPGEPNQPDDVAEAVKAEVAEVTDEVAAESVVQVA-DR 355
Query: 2031 ESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSP 2090
+ EST SE+ + PA+ + + + ++ ++ ASE+ + P
Sbjct: 356 DGESTPAVEETSEADIE-REQPGDLAGQAPAAHQVDAEAASAAPEEPAALASEAHDETEP 414
Query: 2091 ASESTTTSSPASESTTTSSPESESTTTSSPASE 2123
E ++P + +
Sbjct: 415 --EVPEKAAPIPDPAKPDELAVAGPGDDPAEPD 445
Score = 38.8 bits (90), Expect = 0.036
Identities = 20/164 (12%), Positives = 46/164 (28%), Gaps = 16/164 (9%)
Query: 1951 ESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISP 2010
E E ++ S ++ + P+ + + V+E T + ES
Sbjct: 297 EREPAELAAAAVASAASAVGPVGPGEPNQPDDVAEAVKAEVAEVTDEVAAESVVQVA--- 353
Query: 2011 VSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSP 2070
++ V E++ + + A + + ++ + S P
Sbjct: 354 -DRDGESTPAVEETSEADIEREQPGDLAGQAPAA-----HQVDAE------AASAAPEEP 401
Query: 2071 ASE-STTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESE 2113
A+ S E ++P + A E +
Sbjct: 402 AALASEAHDETEPEVPEKAAPIPDPAKPDELAVAGPGDDPAEPD 445
Score = 37.3 bits (86), Expect = 0.10
Identities = 21/151 (13%), Positives = 43/151 (28%), Gaps = 4/151 (2%)
Query: 2001 ESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNP 2060
E E + S ++ P+ + + +E T +S +
Sbjct: 297 EREPAELAAAAVASAASAVGPVGPGEPNQPDDVAEAVKAEVAEVTDEVAAESVVQVADRD 356
Query: 2061 ASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSP 2120
ES + SE+ +PA+ + ++ ++ SE+ + P
Sbjct: 357 G-ESTPAVEETSEADIER-EQPGDLAGQAPAAHQVDAEAASAAPEEPAALASEAHDETEP 414
Query: 2121 ASESTTIEEQGVSPHSEKLSANEDPEEFPNE 2151
+ E A P + P E
Sbjct: 415 EVPEKAAPIPDPAKPDE--LAVAGPGDDPAE 443
Score = 35.3 bits (81), Expect = 0.47
Identities = 27/170 (15%), Positives = 50/170 (29%), Gaps = 11/170 (6%)
Query: 2013 ESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPAS 2072
E ++ S + + P + +E T S +
Sbjct: 299 EPAELAAAAVASAASAVGPVGPGEPNQPDDVAEAVKAEVAEVTDEVAAESVVQVADRDG- 357
Query: 2073 ESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQGV 2132
EST SE+ +PA+ E+ S PA+ ++ ++
Sbjct: 358 ESTPAVEETSEADIE-REQPGDLAGQAPAAHQ---VDAEAASAAPEEPAALASEAHDETE 413
Query: 2133 SPHSEKLSANEDPEEFPNEDVFEHTFAEIPNIDHSNQTDEAIPETFDARE 2182
EK + DP + P+E + D + D+ F +R
Sbjct: 414 PEVPEKAAPIPDPAK-PDELAVAGPGDDPAEPDGIRRQDD-----FSSRR 457
Score = 34.6 bits (79), Expect = 0.81
Identities = 16/138 (11%), Positives = 36/138 (26%), Gaps = 14/138 (10%)
Query: 1896 TTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESEST 1955
N P+ + +E T + ES +T + E+ +
Sbjct: 322 EPNQPDDVAEAVKAEVAEVTDEVAAESVVQVADRD--GESTPAVEETSEADIEREQPGDL 379
Query: 1956 TTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSEST 2015
++ + ++ S+ E +S + T PE +P+ +
Sbjct: 380 -------AGQAPAAHQVDAEAASAAPEEPAALASEAHDETEPEVPEKA-----APIPDPA 427
Query: 2016 TTSSPVSESTTTISPESE 2033
E +
Sbjct: 428 KPDELAVAGPGDDPAEPD 445
>gnl|CDD|221866 pfam12935, Sec16_N, Vesicle coat trafficking protein Sec16
N-terminus. Sec16 is a multi-domain vesicle coat
protein. The overall function of Sec16 is in mediating
the movement of protein-cargo between the organelles of
the secretory pathway. Over-expression of truncated
mutants of only the N-terminus are lethal, and this
portion does not appear to be essential for function so
may act as a stabilising region.
Length = 246
Score = 41.0 bits (96), Expect = 0.004
Identities = 46/230 (20%), Positives = 65/230 (28%), Gaps = 31/230 (13%)
Query: 1921 ESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSP 1980
ST T + S E + + E S S + S
Sbjct: 20 NQLSTQTKPIYLPPENESRFEEGAPLLDNGEQNEPVEESAPQTVAIDSVFVEDEDDEGSD 79
Query: 1981 ESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSP 2040
S V E +ST S V +S + S S E T +
Sbjct: 80 FFNSLHEGEAVEEQQPPPHLTRKST---SQVLDSLGLNPDSLSS--PASAEPLDPTAQNE 134
Query: 2041 ASES---TTTNNPKSESTTTNNPASE-----SITSSSPASESTTTSSPASE--------- 2083
S +T NP ES + + P+SE S SEST T +E
Sbjct: 135 FSNVLAASTDGNP--ESESQSEPSSEEELAARAELSDDESESTPTEDDLAERWQAFLDND 192
Query: 2084 -----STTTSSPASESTTTSSPASESTTTSSP--ESESTTTSSPASESTT 2126
T+ + T + + SP E + P +E TT
Sbjct: 193 DDLLLDDETALAEGPNGDTPENSQNTLNDDSPFGTPEFPSPVRPKAEPTT 242
>gnl|CDD|221242 pfam11816, DUF3337, Domain of unknown function (DUF3337). This
family of proteins are functionally uncharacterized. This
family is only found in eukaryotes. This presumed domain
is typically between 285 to 342 amino acids in length.
Length = 320
Score = 41.4 bits (97), Expect = 0.004
Identities = 27/171 (15%), Positives = 51/171 (29%), Gaps = 11/171 (6%)
Query: 1876 NNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSEST 1935
NN S + + S+ + + E+ + E E S S + T
Sbjct: 5 NNKRSILSKDSSGSV-TLWDIPSGKVVETPGEVSEEEEIKELESVYSPNWFTVDSKEGKL 63
Query: 1936 TTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESES-------TTTS 1988
++S + +SS + S+ E S ++ +S E + +
Sbjct: 64 SSSLFGKKFRMSSSLLKKCGAAST---EGKPQKSEKAIDLKSSKAEKDPEINLGGLLLRA 120
Query: 1989 SLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSS 2039
L +P +T P ++ T + TT I E
Sbjct: 121 LLEYWKELKCNPRVLVFSTFLPSLDNETPYLKLPPDTTIIISEESPDLGGG 171
Score = 39.1 bits (91), Expect = 0.024
Identities = 31/195 (15%), Positives = 52/195 (26%), Gaps = 8/195 (4%)
Query: 1936 TTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSEST 1995
S S+ ++ S + S V E+ S E E S S + T
Sbjct: 6 NKRSILSKDSSGSV--TLWDIPSGKVVETPGEVSEEEEIKELESVYSPNWFTVDSKEGKL 63
Query: 1996 TTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSEST 2055
++S + + S + + S+ E SS A + N
Sbjct: 64 SSSLFGKKFRMSSSLLKKCGAASTEGKPQK----SEKAIDLKSSKAEKDPEINLGGLLLR 119
Query: 2056 TTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESEST 2115
E + ST S +E+ P TT +
Sbjct: 120 ALLEYWKELKCNPRVLVFSTFLPSLDNETPYLKLPPD--TTIIISEESPDLGGGRDLYRG 177
Query: 2116 TTSSPASESTTIEEQ 2130
S + + +EE
Sbjct: 178 LVGSTSGDEELLEEN 192
Score = 32.1 bits (73), Expect = 3.7
Identities = 25/155 (16%), Positives = 49/155 (31%), Gaps = 21/155 (13%)
Query: 1996 TTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSEST 2055
S S+ ++ V+ S V E+ +S E E S S + T + K
Sbjct: 6 NKRSILSKDSSG--SVTLWDIPSGKVVETPGEVSEEEEIKELESVYSPNWFTVDSK---- 59
Query: 2056 TTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESEST 2115
++SS + +SS + S+ + + +S S E +
Sbjct: 60 ------EGKLSSSLFGKKFRMSSSLLKKCGAASTEGKPQKSEKAIDLKS---SKAEKD-- 108
Query: 2116 TTSSPASESTTIEEQGVSPHSEKLSANEDPEEFPN 2150
P + + + + ++L N F
Sbjct: 109 ----PEINLGGLLLRALLEYWKELKCNPRVLVFST 139
>gnl|CDD|225805 COG3266, DamX, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 292
Score = 41.5 bits (97), Expect = 0.004
Identities = 33/174 (18%), Positives = 64/174 (36%), Gaps = 12/174 (6%)
Query: 1958 SSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTT 2017
S+L + ST++S S S + +T ++ TS+ + ++ P+S + T
Sbjct: 29 SALKAPSTSSSEA-PASAEKSIDLNGATQANAQQPAPGPTSAENTSQDLSLPPISSTPTQ 87
Query: 2018 --SSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESIT--SSSPASE 2073
+ + + + + + NN ST PA+ + +S P +E
Sbjct: 88 GQEPLAQDGQQRVEVQGDLNNAAVQPQNLSQLNNVAVTSTLPTEPATVAPVRNASVPTAE 147
Query: 2074 STTTSSPASESTTTS---SPASESTTTSSPASESTTTSSPESESTTTSSPASES 2124
+ P + P + T T++ A T+SP T PA +
Sbjct: 148 RPAITRPVRAQAVSEPAVEPKAAKTATATEAK--VQTASPAQTPAT--PPAGKG 197
Score = 36.5 bits (84), Expect = 0.13
Identities = 34/185 (18%), Positives = 62/185 (33%), Gaps = 12/185 (6%)
Query: 1917 TSSPESESTTTSSL-VSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESEST 1975
TSS E+ ++ S+ ++ +T ++ + TS+ + + +S + T
Sbjct: 36 TSSSEAPASAEKSIDLNGATQANAQQPAPGPTSAENTSQDLSLPPISSTPTQGQEPLAQD 95
Query: 1976 TTSSPESESTTTSSLVSESTTTS----SPESESTTTISPVSESTTTSSPVSESTTTISPE 2031
E + ++ V + + S T + V+ S P +E P
Sbjct: 96 GQQRVEVQGDLNNAAVQPQNLSQLNNVAVTSTLPTEPATVAPVRNASVPTAERPAITRPV 155
Query: 2032 SESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPA 2091
+ S PA E PK+ T T A S + + A+ S S
Sbjct: 156 -RAQAVSEPAVE------PKAAKTATATEAKVQTASPAQTPATPPAGKGAAASGQLKSAP 208
Query: 2092 SESTT 2096
S T
Sbjct: 209 SSHYT 213
Score = 33.8 bits (77), Expect = 1.1
Identities = 27/178 (15%), Positives = 59/178 (33%), Gaps = 18/178 (10%)
Query: 1947 TSSPESESTTTSSL-VSESTTTSSPESESTTTSSPESESTTTSSLVSES-TTTSSPES-- 2002
TSS E+ ++ S+ ++ +T ++ + TS+ + + +S + T P +
Sbjct: 36 TSSSEAPASAEKSIDLNGATQANAQQPAPGPTSAENTSQDLSLPPISSTPTQGQEPLAQD 95
Query: 2003 -----ESTTTISPVSESTTTSSPV------SESTTTISPESESTTTSSPASESTTTNNPK 2051
E ++ + S + S T + + S P +E P
Sbjct: 96 GQQRVEVQGDLNNAAVQPQNLSQLNNVAVTSTLPTEPATVAPVRNASVPTAERPAITRPV 155
Query: 2052 SESTTTN---NPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTT 2106
+ P + +++ A T + + + A+ S S S T
Sbjct: 156 RAQAVSEPAVEPKAAKTATATEAKVQTASPAQTPATPPAGKGAAASGQLKSAPSSHYT 213
>gnl|CDD|222274 pfam13634, Nucleoporin_FG, Nucleoporin FG repeat region. This family
includes a number of FG repeats that are found in
nucleoporin proteins. This family includes the yeast
nucleoporins Nup116, Nup100, Nup49, Nup57 and Nup 145.
Length = 106
Score = 38.6 bits (90), Expect = 0.004
Identities = 13/100 (13%), Positives = 38/100 (38%), Gaps = 4/100 (4%)
Query: 1907 TNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTT--TSSPESESTTTSSLVSES 1964
+++ + +++ S T L ++ ++P S +SS ++ + L +
Sbjct: 1 SSSTTAGASSGGLFGSAPATGGGLFGQNAANTTPTSGGGLFGSSSSQATQPSGGGLFGSA 60
Query: 1965 TTTSSPESESTT--TSSPESESTTTSSLVSESTTTSSPES 2002
TS+ + +++ + + T L +T +
Sbjct: 61 AQTSATTTGGGLFGSTTATTTTATGGGLFGNATAAQPATT 100
Score = 33.2 bits (76), Expect = 0.39
Identities = 18/108 (16%), Positives = 38/108 (35%), Gaps = 10/108 (9%)
Query: 1937 TSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTT--TSSPESESTTTSSLVSES 1994
+SS + +++ S T L ++ ++P S +SS ++ + L +
Sbjct: 1 SSSTTAGASSGGLFGSAPATGGGLFGQNAANTTPTSGGGLFGSSSSQATQPSGGGLFGSA 60
Query: 1995 TTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPAS 2042
TS+ TT +T T++ T +T +
Sbjct: 61 AQTSAT----TTGGGLFGSTTATTTT----ATGGGLFGNATAAQPATT 100
Score = 32.5 bits (74), Expect = 0.74
Identities = 17/103 (16%), Positives = 37/103 (35%), Gaps = 1/103 (0%)
Query: 1977 TSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTT 2036
+SS + +++ S T ++ +P S S S++T +
Sbjct: 1 SSSTTAGASSGGLFGSAPATGGGLFGQNAANTTPTSGGGLFGSSSSQATQPSGGGLFGSA 60
Query: 2037 TSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSS 2079
+ A+ +T S + TT + ++ A++ TT
Sbjct: 61 AQTSAT-TTGGGLFGSTTATTTTATGGGLFGNATAAQPATTGG 102
Score = 30.9 bits (70), Expect = 2.4
Identities = 15/105 (14%), Positives = 38/105 (36%), Gaps = 5/105 (4%)
Query: 1918 SSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTT- 1976
SS + ++ L + T ++ + T+ L S++ ++ S
Sbjct: 1 SSSTTAGASSGGLFGSAPATGGG---LFGQNAANTTPTSGGGLFGSSSSQATQPSGGGLF 57
Query: 1977 -TSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSP 2020
+++ S +TT L +T T++ + + + T+
Sbjct: 58 GSAAQTSATTTGGGLFGSTTATTTTATGGGLFGNATAAQPATTGG 102
Score = 30.5 bits (69), Expect = 2.8
Identities = 17/103 (16%), Positives = 40/103 (38%), Gaps = 6/103 (5%)
Query: 1894 NTTTNSPESESTTTNNPESESTT--TSSPESESTTTSSLVSESTTTSSPESESTT--TSS 1949
++TT S + P + ++ + T+ L S++ ++ S +++
Sbjct: 2 SSTTAGASSGGLFGSAPATGGGLFGQNAANTTPTSGGGLFGSSSSQATQPSGGGLFGSAA 61
Query: 1950 PESESTTTSSLVSESTTTSSPESESTT--TSSPESESTTTSSL 1990
S +TT L +T T++ + ++ +TT L
Sbjct: 62 QTSATTTGGGLFGSTTATTTTATGGGLFGNATAAQPATTGGGL 104
Score = 30.1 bits (68), Expect = 4.7
Identities = 13/74 (17%), Positives = 26/74 (35%), Gaps = 3/74 (4%)
Query: 2054 STTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESE 2113
S+TT +S + S+PA T + T+ + SS + + +
Sbjct: 2 SSTTAGASSGGLFGSAPA---TGGGLFGQNAANTTPTSGGGLFGSSSSQATQPSGGGLFG 58
Query: 2114 STTTSSPASESTTI 2127
S +S + +
Sbjct: 59 SAAQTSATTTGGGL 72
Score = 29.8 bits (67), Expect = 5.7
Identities = 13/109 (11%), Positives = 34/109 (31%), Gaps = 7/109 (6%)
Query: 1948 SSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTT 2007
SS + ++ L + T ++ + T+ L S++ ++ +
Sbjct: 1 SSSTTAGASSGGLFGSAPATGGG---LFGQNAANTTPTSGGGLFGSSSSQATQ----PSG 53
Query: 2008 ISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTT 2056
+ TS+ + S + +TT + T + +
Sbjct: 54 GGLFGSAAQTSATTTGGGLFGSTTATTTTATGGGLFGNATAAQPATTGG 102
>gnl|CDD|213844 TIGR03657, IsdB, heme uptake protein IsdB. Isd proteins are
iron-regulated surface proteins found in Bacillus,
Staphylococcus and Listeria species and are responsible
for heme scavenging from hemoproteins. The IsdB protein
is only observed in Staphylococcus and consists of an
N-terminal hydrophobic signal sequence, a pair of tandem
NEAT (NEAr Transporter, pfam05031) domains which confers
the ability to bind heme and a C-terminal sortase
processing signal which targets the protein to the cell
wall. IsdB is believed to make a direct contact with
methemoglobin facilitating transfer of heme to IsdB. The
heme is then transferred to other cell wall-bound NEAT
domain proteins such as IsdA and IsdC.
Length = 644
Score = 41.8 bits (97), Expect = 0.005
Identities = 36/174 (20%), Positives = 64/174 (36%), Gaps = 19/174 (10%)
Query: 1901 ESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSL 1960
+ E+ T N + + S T+ TT E ES S + ++ + S+
Sbjct: 449 DKEAFTKANADKTNKKEQQDNSAKKETTPATPSKPTTPPVEKESQKQDSQKDDNKQSPSV 508
Query: 1961 VSESTTTSSPESESTTTSSP-----ESESTTTSSLVSESTTTSSPESESTTTISPVSEST 2015
E+ +S + T + P ES STT + +VS + + P T
Sbjct: 509 EKENDASSESGKDKTPATKPAKGEVESSSTTPTKVVSTTQNVAKP--------------T 554
Query: 2016 TTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSS 2069
T SS ++ S S S+P ++ N + + NN ++ + S
Sbjct: 555 TASSETTKDVVQTSAGSSEAKDSAPLQKANIKNTNDGHTQSQNNKNTQENKAKS 608
Score = 40.3 bits (93), Expect = 0.016
Identities = 39/166 (23%), Positives = 70/166 (42%), Gaps = 13/166 (7%)
Query: 1894 NTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESE 1953
N + + + + E+ T S P TT + ES S + ++ + S E E
Sbjct: 457 NADKTNKKEQQDNSAKKETTPATPSKP-----TTPPVEKESQKQDSQKDDNKQSPSVEKE 511
Query: 1954 STTTSSLVSESTTTSSP---ESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISP 2010
+ +S + T + P E ES++T+ + STT + V++ TT SS ++ S
Sbjct: 512 NDASSESGKDKTPATKPAKGEVESSSTTPTKVVSTTQN--VAKPTTASSETTKDVVQTSA 569
Query: 2011 VSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTT 2056
S S+P+ ++ + T S +++T N KS T
Sbjct: 570 GSSEAKDSAPLQKANIK---NTNDGHTQSQNNKNTQENKAKSLPQT 612
Score = 38.0 bits (87), Expect = 0.064
Identities = 26/126 (20%), Positives = 54/126 (42%), Gaps = 2/126 (1%)
Query: 1987 TSSLVSESTTTSSPESESTTTISPVSESTTTSSPVS-ESTTTISPESESTTTSSPASEST 2045
T + ++ ++ + +P + S T+ PV ES S + ++ + S E+
Sbjct: 454 TKANADKTNKKEQQDNSAKKETTPATPSKPTTPPVEKESQKQDSQKDDNKQSPSVEKEND 513
Query: 2046 TTNNPKSESTTTNNPASESITSSSPA-SESTTTSSPASESTTTSSPASESTTTSSPASES 2104
++ + T PA + SSS ++ +T+ ++ TT SS ++ +S S
Sbjct: 514 ASSESGKDKTPATKPAKGEVESSSTTPTKVVSTTQNVAKPTTASSETTKDVVQTSAGSSE 573
Query: 2105 TTTSSP 2110
S+P
Sbjct: 574 AKDSAP 579
Score = 36.5 bits (83), Expect = 0.21
Identities = 22/102 (21%), Positives = 48/102 (47%), Gaps = 2/102 (1%)
Query: 1880 STVVMSTLNSLLSENTTTNSPESESTTTN-NPESESTTTSSPESESTTTSSLVSESTTTS 1938
++V +STL L+S + E+ T T P++E+ + + +E + V+ + + S
Sbjct: 22 ASVAISTLLLLMSNGEAQAAEETGGTNTEAQPKTEAVASPTTTTEKAPEAKPVANAVSVS 81
Query: 1939 SPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSP 1980
+ E E+ T+ + E++ E T P +++ + P
Sbjct: 82 NKEVEAPTSETKEAKEVKEVKAPKE-TKEVKPAAKADNNTYP 122
Score = 36.5 bits (83), Expect = 0.21
Identities = 30/146 (20%), Positives = 65/146 (44%), Gaps = 9/146 (6%)
Query: 1968 SSPESESTTTSSP-ESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTT 2026
++P + S T+ P E ES S ++ + S E E+ + SES +P ++
Sbjct: 475 TTPATPSKPTTPPVEKESQKQDSQKDDNKQSPSVEKENDAS----SESGKDKTPATKPAK 530
Query: 2027 TISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTT 2086
E ES++T+ P +TT N +T ++ + + +S+ +SE+ ++ +
Sbjct: 531 G---EVESSSTT-PTKVVSTTQNVAKPTTASSETTKDVVQTSAGSSEAKDSAPLQKANIK 586
Query: 2087 TSSPASESTTTSSPASESTTTSSPES 2112
++ + + E+ S P++
Sbjct: 587 NTNDGHTQSQNNKNTQENKAKSLPQT 612
Score = 36.1 bits (82), Expect = 0.29
Identities = 33/128 (25%), Positives = 56/128 (43%), Gaps = 8/128 (6%)
Query: 1872 FTTNNNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLV 1931
FT N ++ NS E TT +P TT E ES S + ++ + S+
Sbjct: 453 FTKANADKTNKKEQQDNSAKKE-TTPATP--SKPTTPPVEKESQKQDSQKDDNKQSPSVE 509
Query: 1932 SESTTTSSPESESTTTSSP-----ESESTTTSSLVSESTTTSSPESESTTTSSPESESTT 1986
E+ +S + T + P ES STT + +VS + + P + S+ T+ +++
Sbjct: 510 KENDASSESGKDKTPATKPAKGEVESSSTTPTKVVSTTQNVAKPTTASSETTKDVVQTSA 569
Query: 1987 TSSLVSES 1994
SS +S
Sbjct: 570 GSSEAKDS 577
Score = 35.3 bits (80), Expect = 0.48
Identities = 37/170 (21%), Positives = 67/170 (39%), Gaps = 8/170 (4%)
Query: 1963 ESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVS 2022
E+ T ++ + + S T+ TT E ES S ++ + S
Sbjct: 451 EAFTKANADKTNKKEQQDNSAKKETTPATPSKPTTPPVEKESQKQDSQKDDNKQSPSVEK 510
Query: 2023 ESTTTISPESESTTTSSPAS---ESTTTNNPKSESTTTNNPASESITSSSPASESTTTSS 2079
E+ + + T + PA ES++T K STT N ++ T+SS ++ +S
Sbjct: 511 ENDASSESGKDKTPATKPAKGEVESSSTTPTKVVSTTQN--VAKPTTASSETTKDVVQTS 568
Query: 2080 PASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEE 2129
S S+P ++ + + T S +++T + S T EE
Sbjct: 569 AGSSEAKDSAPLQKANIKN---TNDGHTQSQNNKNTQENKAKSLPQTGEE 615
Score = 35.3 bits (80), Expect = 0.50
Identities = 35/179 (19%), Positives = 68/179 (37%), Gaps = 9/179 (5%)
Query: 1942 SESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPE 2001
+++ + + E S+ + TT ++P TT E ES S ++ + S E
Sbjct: 454 TKANADKTNKKEQQDNSA--KKETTPATP--SKPTTPPVEKESQKQDSQKDDNKQSPSVE 509
Query: 2002 SESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPA 2061
E+ + SES +P ++ ES STT + S + P + S+ T
Sbjct: 510 KENDAS----SESGKDKTPATKPAKG-EVESSSTTPTKVVSTTQNVAKPTTASSETTKDV 564
Query: 2062 SESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSP 2120
++ SS A +S ++T S++ + + + E + + P
Sbjct: 565 VQTSAGSSEAKDSAPLQKANIKNTNDGHTQSQNNKNTQENKAKSLPQTGEESNKDMTLP 623
Score = 34.1 bits (77), Expect = 1.0
Identities = 33/181 (18%), Positives = 74/181 (40%), Gaps = 21/181 (11%)
Query: 2011 VSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNP-KSESTTTNNPASESITSSS 2069
V + T + ++ ++ + ++PA+ S T P + ES ++ ++ S S
Sbjct: 448 VDKEAFTKANADKTNKKEQQDNSAKKETTPATPSKPTTPPVEKESQKQDSQKDDNKQSPS 507
Query: 2070 PASESTTTSSPASESTTTSSPA-----SESTTTSSPASESTTTSSPESESTTT------- 2117
E+ +S + T + PA S STT + S + + P + S+ T
Sbjct: 508 VEKENDASSESGKDKTPATKPAKGEVESSSTTPTKVVSTTQNVAKPTTASSETTKDVVQT 567
Query: 2118 ---SSPASESTTIEEQGVSPHSEKLSANEDPEEFPNEDVFEHTFAEIPNIDHSNQTDEAI 2174
SS A +S +++ + ++ + +++ N++ E+ +P + D +
Sbjct: 568 SAGSSEAKDSAPLQKANIKNTNDGHTQSQN-----NKNTQENKAKSLPQTGEESNKDMTL 622
Query: 2175 P 2175
P
Sbjct: 623 P 623
Score = 33.4 bits (75), Expect = 1.8
Identities = 25/120 (20%), Positives = 53/120 (44%), Gaps = 8/120 (6%)
Query: 1888 NSLLSENTTTNSPESESTTTNNPESESTTTSSP-----ESESTTTSSLVSESTTTSSPES 1942
+S +N + S E E+ ++ + T + P ES STT + +VS + + P +
Sbjct: 496 DSQKDDNKQSPSVEKENDASSESGKDKTPATKPAKGEVESSSTTPTKVVSTTQNVAKPTT 555
Query: 1943 ESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPES 2002
S+ T+ +++ SS +S ++T +S++ + E+ S P++
Sbjct: 556 ASSETTKDVVQTSAGSSEAKDSAPLQKANIKNTNDGHTQSQNNKNTQ---ENKAKSLPQT 612
>gnl|CDD|219947 pfam08639, SLD3, DNA replication regulator SLD3. The SLD3 DNA
replication regulator is required for loading and
maintenance of Cdc45 on chromatin during DNA replication.
Length = 437
Score = 41.7 bits (98), Expect = 0.005
Identities = 35/206 (16%), Positives = 67/206 (32%), Gaps = 14/206 (6%)
Query: 1913 ESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPES 1972
S S + + T T T S +SS +S + ++++ +SS
Sbjct: 233 PSMKISPLKKKKTGTLKSSKPEPGTPLKRQTSPASSSQKSRRRSLQRVLTDERKSSSR-- 290
Query: 1973 ESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPES 2032
+P + T+S + E S E+ + S S + + + +S S
Sbjct: 291 -----RTPSLLRSRTNSSLIEFLKRESSENLLPSLSSRTSSDLLKNKRLQKRQVDLSDSS 345
Query: 2033 ESTTTSSPASESTTTNNPKSESTTTN----NPASESITSSSPASESTTTSSPASESTTTS 2088
+ N K E E + + +S T+
Sbjct: 346 RQHEEK--LKKKQMLNEQKKELKRAISALKKSNRELSSKDIVETAEKRSSQFGQGVQVTA 403
Query: 2089 SPASESTTTSSPASESTTTSSPESES 2114
+PA + +E+T++S P S+S
Sbjct: 404 TPAGNRKKDAGL-TEATSSSFPSSDS 428
>gnl|CDD|234977 PRK01741, PRK01741, cell division protein ZipA; Provisional.
Length = 332
Score = 41.3 bits (97), Expect = 0.005
Identities = 34/178 (19%), Positives = 61/178 (34%), Gaps = 13/178 (7%)
Query: 1942 SESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPE 2001
S + T + S S+ ++ T +P+S TT P + T S
Sbjct: 35 SNANTFTRTRPPSRPISNEEADQPNTLNPQSYVETTPPPFQQPQTEESESENEVQIQQEV 94
Query: 2002 SESTTTIS---PVSEST-TTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTT 2057
+S I P E + SE P+ +S T ++ AS + E T +
Sbjct: 95 EQSVDEIKITLPNQEPAYYMQNHRSEPIQPTQPQYQSPTQTNVASMTI-------EETQS 147
Query: 2058 NNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESEST 2115
N E I SSS + + + + + + ++ T + PE+ +
Sbjct: 148 PNVPIEGINSSSE--QLRVELAELAAEIYSDASHRVELAKNFMEPQAETEAQPEATTN 203
>gnl|CDD|165527 PHA03269, PHA03269, envelope glycoprotein C; Provisional.
Length = 566
Score = 41.3 bits (96), Expect = 0.006
Identities = 22/128 (17%), Positives = 45/128 (35%), Gaps = 5/128 (3%)
Query: 2003 ESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPAS 2062
+ T P+ E TS+ + +P ++ PA T+ + K + PA+
Sbjct: 20 ANLNTNIPIPE-LHTSAATQKPDPAPAPHQAASRAPDPAVAPTSAASRKPDLAQAPTPAA 78
Query: 2063 ESITSSSPASESTTTSSP----ASESTTTSSPASESTTTSSPASESTTTSSPESESTTTS 2118
+PA + +P A + P + TS+ + + S ++
Sbjct: 79 SEKFDPAPAPHQAASRAPDPAVAPQLAAAPKPDAAEAFTSAAQAHEAPADAGTSAASKKP 138
Query: 2119 SPASESTT 2126
PA+ +
Sbjct: 139 DPAAHTQH 146
Score = 40.5 bits (94), Expect = 0.013
Identities = 22/172 (12%), Positives = 58/172 (33%), Gaps = 19/172 (11%)
Query: 1957 TSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTT 2016
+++ + + +T PE TS+ + +P ++ P T+
Sbjct: 6 IILIITIACINLIIANLNTNIPIPE---LHTSAATQKPDPAPAPHQAASRAPDPAVAPTS 62
Query: 2017 TSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTT 2076
+S + +P ++ PA ++ +PA +++P
Sbjct: 63 AASRKPD--LAQAPTPAASEKFDPAPAPH------QAASRAPDPAVAPQLAAAP------ 108
Query: 2077 TSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIE 2128
P + TS+ + + S ++ P + + + P + + ++E
Sbjct: 109 --KPDAAEAFTSAAQAHEAPADAGTSAASKKPDPAAHTQHSPPPFAYTRSME 158
Score = 38.6 bits (89), Expect = 0.046
Identities = 17/142 (11%), Positives = 44/142 (30%), Gaps = 13/142 (9%)
Query: 1911 ESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSP 1970
+ +T PE ++ + +P + +P+ TS+ + +P
Sbjct: 20 ANLNTNIPIPELHTSAATQK-----PDPAPAPHQAASRAPDPAVAPTSAASRKPDLAQAP 74
Query: 1971 ESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTT--TSSPVSESTTTI 2028
++ P ++ P +P ++ TS+ +
Sbjct: 75 TPAASEKFDPAPAPH------QAASRAPDPAVAPQLAAAPKPDAAEAFTSAAQAHEAPAD 128
Query: 2029 SPESESTTTSSPASESTTTNNP 2050
+ S ++ PA+ + + P
Sbjct: 129 AGTSAASKKPDPAAHTQHSPPP 150
Score = 35.1 bits (80), Expect = 0.62
Identities = 25/137 (18%), Positives = 50/137 (36%), Gaps = 16/137 (11%)
Query: 2006 TTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESI 2065
I+ + + T+ P+ E T S +T PA ++ +PA
Sbjct: 13 ACINLIIANLNTNIPIPELHT-----SAATQKPDPAPAPH------QAASRAPDPAVAPT 61
Query: 2066 TSSSPASESTTTSSPASESTTTSSPASESTTTSSP----ASESTTTSSPESESTTTSSPA 2121
+++S + +PA+ +PA + +P A + P++ TS+
Sbjct: 62 SAASRKPDLAQAPTPAASEKFDPAPAPHQAASRAPDPAVAPQLAAAPKPDAAEAFTSAAQ 121
Query: 2122 SESTTIEEQGVSPHSEK 2138
+ + G S S+K
Sbjct: 122 AHEAPA-DAGTSAASKK 137
Score = 34.3 bits (78), Expect = 0.88
Identities = 20/133 (15%), Positives = 45/133 (33%), Gaps = 11/133 (8%)
Query: 1889 SLLSENTTTNSPE---SESTTTNNPESESTTTSS--PESESTTTSSLVSESTTTSSPESE 1943
+ + NT PE S +T +P +S P+ TS+ + +P
Sbjct: 18 IIANLNTNIPIPELHTSAATQKPDPAPAPHQAASRAPDPAVAPTSAASRKPDLAQAPTPA 77
Query: 1944 STTTSSPESESTTTSS------LVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTT 1997
++ P +S + + P++ TS+ ++ + S ++
Sbjct: 78 ASEKFDPAPAPHQAASRAPDPAVAPQLAAAPKPDAAEAFTSAAQAHEAPADAGTSAASKK 137
Query: 1998 SSPESESTTTISP 2010
P + + + P
Sbjct: 138 PDPAAHTQHSPPP 150
>gnl|CDD|218908 pfam06136, DUF966, Domain of unknown function (DUF966). Family of
plant proteins with unknown function.
Length = 308
Score = 40.9 bits (96), Expect = 0.006
Identities = 26/132 (19%), Positives = 48/132 (36%), Gaps = 10/132 (7%)
Query: 1995 TTTSSPESESTTTISPVSESTTTSSPV---SESTTTISPESESTTTSS-PASESTTTNNP 2050
++SS + + E + T ++S ++ + PA ST T++
Sbjct: 90 DSSSSKGDPEEASSRKLQEESDTPPVNRRANQSWSSSDLAEYKVYKAEEPADASTQTDDR 149
Query: 2051 KSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSP 2110
+S ++ S SP SS +S S+++S ES + + S
Sbjct: 150 RSRDSSEAESTELSREEISPP------SSSSSPSSSSSPETLESLIKADGRLSLSFRSLE 203
Query: 2111 ESESTTTSSPAS 2122
E ES +S
Sbjct: 204 EDESAGRVRASS 215
Score = 40.9 bits (96), Expect = 0.007
Identities = 23/132 (17%), Positives = 51/132 (38%), Gaps = 13/132 (9%)
Query: 1895 TTTNSPESESTTTNNPESESTTTSSPE---SESTTTSSL----VSESTTTSSPESESTTT 1947
+++S ++ E + T ++S ++S L V ++ + +++
Sbjct: 90 DSSSSKGDPEEASSRKLQEESDTPPVNRRANQSWSSSDLAEYKVYKAEEPADASTQTDDR 149
Query: 1948 SSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTT 2007
S +S ++ L E + S S +++SSPE T S + ++ + S S
Sbjct: 150 RSRDSSEAESTELSREEISPPSSSSSPSSSSSPE---TLESLIKADGRLSLSFRSLEE-- 204
Query: 2008 ISPVSESTTTSS 2019
+ SS
Sbjct: 205 -DESAGRVRASS 215
Score = 37.4 bits (87), Expect = 0.068
Identities = 21/126 (16%), Positives = 45/126 (35%), Gaps = 5/126 (3%)
Query: 2012 SESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPA 2071
SE +SS + S + + + + P + N S S ++ + +
Sbjct: 86 SEILDSSSSKGDPEEASSRKLQEESDTPPVNR--RANQSWSSSDLAEYKVYKAEEPADAS 143
Query: 2072 SESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQG 2131
+++ S S ++ + E + S +S +++SSPE T S ++
Sbjct: 144 TQTDDRRSRDSSEAESTELSREEISPPSSSSSPSSSSSPE---TLESLIKADGRLSLSFR 200
Query: 2132 VSPHSE 2137
E
Sbjct: 201 SLEEDE 206
Score = 35.5 bits (82), Expect = 0.32
Identities = 28/140 (20%), Positives = 49/140 (35%), Gaps = 17/140 (12%)
Query: 1945 TTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESES 2004
++SS +S + E + T + S S V ++ + S
Sbjct: 90 DSSSSKGDPEEASSRKLQEESDTPPVNR---RANQSWSSSDLAEYKVYKAEEPAD---AS 143
Query: 2005 TTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASES 2064
T T S ++ + S ISP S ++SSP+S S E+ + A
Sbjct: 144 TQTDDRRSRDSSEAESTELSREEISPPS---SSSSPSSSS------SPETLESLIKADGR 194
Query: 2065 ITSSSPASE--STTTSSPAS 2082
++ S + E + AS
Sbjct: 195 LSLSFRSLEEDESAGRVRAS 214
Score = 33.5 bits (77), Expect = 1.2
Identities = 27/119 (22%), Positives = 44/119 (36%), Gaps = 5/119 (4%)
Query: 1932 SESTTTSSP--ESESTTTSSPESESTTTSSLVSESTTTSSPESES---TTTSSPESESTT 1986
SE +SS + E ++ + ES T + + SS + P ST
Sbjct: 86 SEILDSSSSKGDPEEASSRKLQEESDTPPVNRRANQSWSSSDLAEYKVYKAEEPADASTQ 145
Query: 1987 TSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASEST 2045
T S ++ + S ISP S S++ SS S T +++ + S S
Sbjct: 146 TDDRRSRDSSEAESTELSREEISPPSSSSSPSSSSSPETLESLIKADGRLSLSFRSLEE 204
Score = 32.0 bits (73), Expect = 3.3
Identities = 18/132 (13%), Positives = 37/132 (28%), Gaps = 9/132 (6%)
Query: 1871 IFTTNNNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSL 1930
I ++++ S+ + N S S ++ +
Sbjct: 88 ILDSSSSKGDPEEASSRKLQEES-----DTPPVNRRANQSWSSSDLAEYKVYKAEEPADA 142
Query: 1931 VSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSL 1990
+++ S +S ++ E + S S +++SSPE T S S
Sbjct: 143 STQTDDRRSRDSSEAESTELSREEISPPSSSSSPSSSSSPE----TLESLIKADGRLSLS 198
Query: 1991 VSESTTTSSPES 2002
S
Sbjct: 199 FRSLEEDESAGR 210
>gnl|CDD|221577 pfam12440, MAGE_N, Melanoma associated antigen family N terminal.
This domain family is found in eukaryotes, and is
typically between 82 and 96 amino acids in length. The
family is found in association with pfam01454. This
family is the N terminal of various melanoma associated
antigens. These are tumour rejection antigens which are
expressed on HLA-A1 of tumour cells and they are
recognised by cytotoxic T lymphocytes (CTLs).
Length = 96
Score = 37.9 bits (88), Expect = 0.007
Identities = 22/61 (36%), Positives = 33/61 (54%), Gaps = 1/61 (1%)
Query: 2063 ESITSSSPASESTTTSSPASESTTTS-SPASESTTTSSPASESTTTSSPESESTTTSSPA 2121
ES +SSSP T S PA+ S + SP S+++++ A+ S + S S S SP+
Sbjct: 35 ESPSSSSPLIPGTPESVPAAGSPSPPQSPQGASSSSTAVAATSWSQSDEGSSSQEEESPS 94
Query: 2122 S 2122
S
Sbjct: 95 S 95
Score = 36.8 bits (85), Expect = 0.018
Identities = 19/62 (30%), Positives = 30/62 (48%), Gaps = 1/62 (1%)
Query: 2073 ESTTTSSPASESTTTSSPASESTTTS-SPASESTTTSSPESESTTTSSPASESTTIEEQG 2131
ES ++SSP T S PA+ S + SP S+++++ + S + S S S E
Sbjct: 35 ESPSSSSPLIPGTPESVPAAGSPSPPQSPQGASSSSTAVAATSWSQSDEGSSSQEEESPS 94
Query: 2132 VS 2133
S
Sbjct: 95 SS 96
Score = 35.2 bits (81), Expect = 0.059
Identities = 21/73 (28%), Positives = 39/73 (53%), Gaps = 6/73 (8%)
Query: 1917 TSSPESESTTTSSLVSESTTTSSPESESTTTS-SPESESTTTSSLVSESTTTSSPESEST 1975
++ E ES ++SS + T S P + S + SP+ S++++++ + S + S S
Sbjct: 29 PAAEEEESPSSSSPLIPGTPESVPAAGSPSPPQSPQGASSSSTAVAATSWSQSDEGS--- 85
Query: 1976 TTSSPESESTTTS 1988
SS E ES ++S
Sbjct: 86 --SSQEEESPSSS 96
Score = 33.3 bits (76), Expect = 0.28
Identities = 19/73 (26%), Positives = 37/73 (50%), Gaps = 6/73 (8%)
Query: 1947 TSSPESESTTTSSLVSESTTTSSPESESTTTS-SPESESTTTSSLVSESTTTSSPESEST 2005
++ E ES ++SS + T S P + S + SP+ S++++++ + S + S S
Sbjct: 29 PAAEEEESPSSSSPLIPGTPESVPAAGSPSPPQSPQGASSSSTAVAATSWSQSDEGSS-- 86
Query: 2006 TTISPVSESTTTS 2018
S ES ++S
Sbjct: 87 ---SQEEESPSSS 96
Score = 31.8 bits (72), Expect = 0.86
Identities = 21/68 (30%), Positives = 35/68 (51%), Gaps = 4/68 (5%)
Query: 2051 KSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSP 2110
+ ES ++++P S PA+ S S P S +SS + + T+ S + E +SS
Sbjct: 33 EEESPSSSSPLIPGTPESVPAAGSP--SPPQSPQGASSSSTAVAATSWSQSDEG--SSSQ 88
Query: 2111 ESESTTTS 2118
E ES ++S
Sbjct: 89 EEESPSSS 96
Score = 30.6 bits (69), Expect = 2.2
Identities = 18/67 (26%), Positives = 37/67 (55%), Gaps = 1/67 (1%)
Query: 1997 TSSPESESTTTISPVSESTTTSSPVSESTTTI-SPESESTTTSSPASESTTTNNPKSEST 2055
++ E ES ++ SP+ T S P + S + SP+ S+++++ A+ S + ++ S S
Sbjct: 29 PAAEEEESPSSSSPLIPGTPESVPAAGSPSPPQSPQGASSSSTAVAATSWSQSDEGSSSQ 88
Query: 2056 TTNNPAS 2062
+P+S
Sbjct: 89 EEESPSS 95
Score = 30.6 bits (69), Expect = 2.5
Identities = 20/70 (28%), Positives = 35/70 (50%), Gaps = 2/70 (2%)
Query: 1937 TSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTT 1996
++ E ES ++SSP T S + S S P+S +SS + + T+ S E ++
Sbjct: 29 PAAEEEESPSSSSPLIPGTPESVPAAGSP--SPPQSPQGASSSSTAVAATSWSQSDEGSS 86
Query: 1997 TSSPESESTT 2006
+ ES S++
Sbjct: 87 SQEEESPSSS 96
Score = 30.2 bits (68), Expect = 3.3
Identities = 18/61 (29%), Positives = 33/61 (54%), Gaps = 3/61 (4%)
Query: 2081 ASESTT--TSSPASESTTTSSPASESTTTS-SPESESTTTSSPASESTTIEEQGVSPHSE 2137
A E + +SSP T S PA+ S + SP+ S+++++ A+ S + ++G S E
Sbjct: 31 AEEEESPSSSSPLIPGTPESVPAAGSPSPPQSPQGASSSSTAVAATSWSQSDEGSSSQEE 90
Query: 2138 K 2138
+
Sbjct: 91 E 91
>gnl|CDD|185628 PTZ00449, PTZ00449, 104 kDa microneme/rhoptry antigen; Provisional.
Length = 943
Score = 41.2 bits (96), Expect = 0.007
Identities = 42/279 (15%), Positives = 69/279 (24%), Gaps = 18/279 (6%)
Query: 1903 ESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESE------STTTSSPESESTT 1956
+ +S + P+ + E P E T + PE
Sbjct: 526 DKEGEEGEHEDSKESDEPKEGGKPGETKEGEVGKKPGPAKEHKPSKIPTLSKKPEFPKDP 585
Query: 1957 TSSLVSESTTTS-SPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSEST 2015
E P S T + + SP+S P
Sbjct: 586 KHPKDPEEPKKPKRPRSAQRPTRPKSPKLPELLDIPKSPKRPESPKSPK----RPPPPQR 641
Query: 2016 TTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASEST 2075
+S E I + P +PK + ++ + S +
Sbjct: 642 PSSPERPEGPKIIK-------SPKPPKSPKPPFDPKFKEKFYDDYLDAAAKSKETKTTVV 694
Query: 2076 TTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQGVSPH 2135
S S T + T+ E P +E E P
Sbjct: 695 LDESFESILKETLPETPGTPFTTPRPLPPKLPRDEEFPFEPIGDPDAEQPDDIEFFTPPE 754
Query: 2136 SEKLSANEDPEEFPNEDVFEHTFAEIPNIDHSNQTDEAI 2174
E+ +E P + P D+ F E + + DEA+
Sbjct: 755 EERTFFHETPADTPLPDILAEEFKEEDIHAETGEPDEAM 793
Score = 33.1 bits (75), Expect = 2.4
Identities = 36/273 (13%), Positives = 70/273 (25%), Gaps = 19/273 (6%)
Query: 1897 TNSPESESTTTNNPESESTTTSSP--ESESTTTSSLVSESTTTSSPESESTTTSSPESES 1954
++ P+ E E P E + + +L + P+ + +
Sbjct: 540 SDEPKEGGKPGETKEGEVGKKPGPAKEHKPSKIPTLSKKPEFPKDPKHPKDPEEPKKPKR 599
Query: 1955 TTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSES 2014
++ + + PE S ES + SSPE I +
Sbjct: 600 PRSAQRPTRPKSPKLPELLDIPKSPKRPESPKSPKRPPPPQRPSSPERPEGPKIIKSPKP 659
Query: 2015 TTTSSP-----------------VSESTTTISPESESTTTSSPASESTTTNNPKSESTTT 2057
+ P ++S T + + S E+ +T
Sbjct: 660 PKSPKPPFDPKFKEKFYDDYLDAAAKSKETKTTVVLDESFESILKETLPETPGTPFTTPR 719
Query: 2058 NNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTT 2117
P P + + +P E T T +E
Sbjct: 720 PLPPKLPRDEEFPFEPIGDPDAEQPDDIEFFTPPEEERTFFHETPADTPLPDILAEEFKE 779
Query: 2118 SSPASESTTIEEQGVSPHSEKLSANEDPEEFPN 2150
+E+ +E P S ++ P + P+
Sbjct: 780 EDIHAETGEPDEAMKRPDSPSEHEDKPPGDHPS 812
>gnl|CDD|237874 PRK14971, PRK14971, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 614
Score = 41.3 bits (97), Expect = 0.007
Identities = 20/138 (14%), Positives = 40/138 (28%), Gaps = 17/138 (12%)
Query: 2032 SESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPA 2091
+ S ++ P++ + S SP+ S A +S T +
Sbjct: 364 QKGDDASGGRGPKQHIKPVFTQPAAAPQPSAAAAASPSPSQSSAAAQPSAPQSATQPAGT 423
Query: 2092 SESTTTSSPASESTTTSSPESESTTTSSPASE---------STTIE-EQGVSPHSEKLSA 2141
+ + PA+ S ++ + E S + + +E+ +
Sbjct: 424 PPTVSVDPPAAVPVNPPSTAPQAVRPAQFKEEKKIPVSKVSSLGPSTLRPIQEKAEQATG 483
Query: 2142 NE-------DPEEFPNED 2152
N E F ED
Sbjct: 484 NIKEAPTGTQKEIFTEED 501
Score = 34.0 bits (78), Expect = 1.3
Identities = 17/124 (13%), Positives = 33/124 (26%), Gaps = 4/124 (3%)
Query: 1942 SESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPE 2001
+ S + ++ P + + + SP S +S T +
Sbjct: 364 QKGDDASGGRGPKQHIKPVFTQPAAAPQPSAAAAASPSPSQSSAAAQPSAPQSATQPAGT 423
Query: 2002 SESTTTISPVSEST-TTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSEST---TT 2057
+ + P + S+ E + S +S +T P E T
Sbjct: 424 PPTVSVDPPAAVPVNPPSTAPQAVRPAQFKEEKKIPVSKVSSLGPSTLRPIQEKAEQATG 483
Query: 2058 NNPA 2061
N
Sbjct: 484 NIKE 487
>gnl|CDD|203922 pfam08377, MAP2_projctn, MAP2/Tau projection domain. This domain is
found in the MAP2/Tau family of proteins which includes
MAP2, MAP4, Tau, and their homologs. All isoforms contain
a conserved C-terminal domain containing tubulin-binding
repeats (pfam00418), and a N-terminal projection domain
of varying size. This domain has a net negative charge
and exerts a long-range repulsive force. This provides a
mechanism that can regulate microtubule spacing which
might facilitate efficient organelle transport.
Length = 1134
Score = 41.3 bits (96), Expect = 0.008
Identities = 60/300 (20%), Positives = 112/300 (37%), Gaps = 36/300 (12%)
Query: 1878 SESTVVMSTLNS-LLSENTTTNSPESESTTTNNPESESTTTS--SPESESTTTSSLVSES 1934
SE+T V+ ++S + N E TT+ + E++T S P T + + E+
Sbjct: 9 SEATTVLGDVHSPAVEGFVGENISGEEKGTTDQEKKETSTPSVQEPTLTETEPQTKLEET 68
Query: 1935 TTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPES---ESTTTSSPESESTTTS--S 1989
+ S E+ + S + + + + + + S E + T + + +S S
Sbjct: 69 SKVSIEETVAKEEESLKLKDDKAGVIQTSTEHSFSKEDQKGQEQTIEALKQDSFPISLEQ 128
Query: 1990 LVSESTTTSSPESESTTTISPVSE----------------------STTTSS---PVSES 2024
V+++ + + T+ VSE S T + P E
Sbjct: 129 AVTDAAMATKTLEKVTSEPEAVSEKREIQGLFEEDIADKSKLEGAGSATVAEVEMPFYED 188
Query: 2025 TTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASES 2084
+ +S E++ + ST + E + + A ES+ + SP ++ A S
Sbjct: 189 KSGMSKYFETSALKEDVTRSTGLGSDYYELSDSRGNAQESLDTVSPKNQQDEKELLAKAS 248
Query: 2085 TTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQGVSPHSEKLSANED 2144
S PA E+ S ++S T+ P SSP TI+ + + S N+D
Sbjct: 249 -QPSPPAHEAGY--STLAQSYTSDHPSELPEEPSSPQERMFTIDPKVYGEKRDLHSKNKD 305
>gnl|CDD|227680 COG5391, COG5391, Phox homology (PX) domain protein [Intracellular
trafficking and secretion / General function prediction
only].
Length = 524
Score = 40.9 bits (96), Expect = 0.009
Identities = 29/136 (21%), Positives = 51/136 (37%), Gaps = 11/136 (8%)
Query: 1909 NPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSP--------ESESTTTSSL 1960
+P++ES+ + S S S++ S S S+ S+P ES+
Sbjct: 8 SPKNESSASDSGPSGSSSESQESSTVKNNDGSPVNSSIKSTPLDIQKRYSGFESSAKLPR 67
Query: 1961 VSESTTTSSPESESTTTSSPESESTTTSSLVSE--STTTSSPESESTTTISPVSESTTTS 2018
+S++ + P T + + + S SE S T+ P+S S T
Sbjct: 68 ISDAPSFVPPPGGHTISYTIAIHDSKIHSRASEFRSLRDMLSLLLPTSLQPPLSTSHTIL 127
Query: 2019 SPVSESTTTISPESES 2034
ST + P+S +
Sbjct: 128 DYFISSTVSN-PQSLT 142
Score = 39.4 bits (92), Expect = 0.025
Identities = 33/142 (23%), Positives = 55/142 (38%), Gaps = 11/142 (7%)
Query: 1938 SSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSP--------ESESTTTSS 1989
SSP++ES+ + S S S++ S S S+ S+P ES+
Sbjct: 7 SSPKNESSASDSGPSGSSSESQESSTVKNNDGSPVNSSIKSTPLDIQKRYSGFESSAKLP 66
Query: 1990 LVSESTTTSSPESESTTTISPVSESTTTSSPVSE--STTTISPESESTTTSSPASESTTT 2047
+S++ + P T + + + S SE S + T+ P S S T
Sbjct: 67 RISDAPSFVPPPGGHTISYTIAIHDSKIHSRASEFRSLRDMLSLLLPTSLQPPLSTS-HT 125
Query: 2048 NNPKSESTTTNNPASESITSSS 2069
S+T +NP S ++ S
Sbjct: 126 ILDYFISSTVSNPQSLTLLVDS 147
Score = 39.4 bits (92), Expect = 0.027
Identities = 32/139 (23%), Positives = 55/139 (39%), Gaps = 5/139 (3%)
Query: 1968 SSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTT 2027
SSP++ES+ + S S S++ S S S+ +P+ S ES+
Sbjct: 7 SSPKNESSASDSGPSGSSSESQESSTVKNNDGSPVNSSIKSTPLDIQKRYSG--FESSAK 64
Query: 2028 ISPESESTTTSSPASESTTTNNPKSESTTTNNPASE--SITSSSPASESTTTSSPASEST 2085
+ S++ + P T + + ++ ASE S+ T+ P S S
Sbjct: 65 LPRISDAPSFVPPPGGHTISYTIAIHDSKIHSRASEFRSLRDMLSLLLPTSLQPPLSTS- 123
Query: 2086 TTSSPASESTTTSSPASES 2104
T S+T S+P S +
Sbjct: 124 HTILDYFISSTVSNPQSLT 142
Score = 34.4 bits (79), Expect = 0.97
Identities = 27/122 (22%), Positives = 44/122 (36%), Gaps = 6/122 (4%)
Query: 2009 SPVSESTTTSSPVSESTTTIS-PESESTTTSSPASESTTTNNPKSESTTTNNPASESITS 2067
SP +ES+ + S S S++ + SP + S + + + ES
Sbjct: 8 SPKNESSASDSGPSGSSSESQESSTVKNNDGSPVNSSIKSTPLDIQKRYSG---FESSAK 64
Query: 2068 SSPASESTTTSSPASESTTTSSPASESTTTSSPASE--STTTSSPESESTTTSSPASEST 2125
S++ + P T + + A + S ASE S T+ P S S
Sbjct: 65 LPRISDAPSFVPPPGGHTISYTIAIHDSKIHSRASEFRSLRDMLSLLLPTSLQPPLSTSH 124
Query: 2126 TI 2127
TI
Sbjct: 125 TI 126
Score = 34.0 bits (78), Expect = 1.3
Identities = 28/138 (20%), Positives = 54/138 (39%), Gaps = 3/138 (2%)
Query: 1918 SSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTT 1977
SSP++ES+ + S S S++ S S S+ S+ + ES+
Sbjct: 7 SSPKNESSASDSGPSGSSSESQESSTVKNNDGSPVNSSIKSTPLDIQKRY--SGFESSAK 64
Query: 1978 SSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSE-STTTISPESESTT 2036
S++ + T + + + S SE + +S T++ P ++
Sbjct: 65 LPRISDAPSFVPPPGGHTISYTIAIHDSKIHSRASEFRSLRDMLSLLLPTSLQPPLSTSH 124
Query: 2037 TSSPASESTTTNNPKSES 2054
T S+T +NP+S +
Sbjct: 125 TILDYFISSTVSNPQSLT 142
Score = 31.7 bits (72), Expect = 6.4
Identities = 22/101 (21%), Positives = 35/101 (34%), Gaps = 12/101 (11%)
Query: 2059 NPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPAS--------ESTTTSSP 2110
+P +ES S S S S++ S +S S+ S+P ES+
Sbjct: 8 SPKNESSASDSGPSGSSSESQESSTVKNNDGSPVNSSIKSTPLDIQKRYSGFESSAKLPR 67
Query: 2111 ESESTTTSSP---ASESTTIEEQGVSPHSEKLSANEDPEEF 2148
S++ + P + S TI HS S +
Sbjct: 68 ISDAPSFVPPPGGHTISYTIAIHDSKIHSRA-SEFRSLRDM 107
Score = 31.3 bits (71), Expect = 7.7
Identities = 13/45 (28%), Positives = 21/45 (46%)
Query: 2078 SSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPAS 2122
SSP +ES+ + S S S++ S +S S+ S+P
Sbjct: 7 SSPKNESSASDSGPSGSSSESQESSTVKNNDGSPVNSSIKSTPLD 51
>gnl|CDD|227400 COG5068, ARG80, Regulator of arginine metabolism and related MADS
box-containing transcription factors [Transcription].
Length = 412
Score = 40.8 bits (95), Expect = 0.009
Identities = 37/253 (14%), Positives = 72/253 (28%), Gaps = 24/253 (9%)
Query: 1912 SESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTS-SP 1970
+E E+ T + + S E +S S + + +S S S + S P
Sbjct: 121 TEVLLLVISENGLVHTFTTPKLESVVKSLEGKSLIQSPCSNAPSDSSEEPSSSASFSVDP 180
Query: 1971 ESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISP-----------------VSE 2013
+ S + S T+ + ++ T + S+ P + E
Sbjct: 181 NDNNPMGSFQHNGSPQTNFIPLQNPQTQQYQQHSSRKDHPTVPHSNTNNGRPPAKFMIPE 240
Query: 2014 STTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASE 2073
++ S + + IS +S+ + + NNP E E
Sbjct: 241 LHSSHSTLDLPSDFISDSGFPNQSSTSIFPLDSAIIQITPPHLPNNPPQE------NRHE 294
Query: 2074 STTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQGVS 2133
+ S T + SP + + + S + E + + S
Sbjct: 295 LYSNDSSMVSETPPPKNLPNGSPNQSPLNNLSRGNPASPNSIIRENNQVEDESFNGRQGS 354
Query: 2134 PHSEKLSANEDPE 2146
L + P
Sbjct: 355 AIWNALISTTQPN 367
>gnl|CDD|173171 PRK14708, PRK14708, flagellin; Provisional.
Length = 888
Score = 41.1 bits (96), Expect = 0.009
Identities = 42/268 (15%), Positives = 106/268 (39%), Gaps = 12/268 (4%)
Query: 1872 FTTNNNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLV 1931
+ T +N +T+ +T + L + +N+ + + + ++T S ++ S+
Sbjct: 105 YATKSNVSATIAGATADDLRGTQSFSNAVATSNVIFDGTAGGTSTASGTDTLGGGIVSIA 164
Query: 1932 SESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTT---S 1988
+ + T +++T S S T ++ +S + T + P + + T
Sbjct: 165 AGTAVTVLGAADATALGSVLSVGTAAATATGADLISSLTNGSTATATGPAAGDSITVNGK 224
Query: 1989 SLVSESTTTSSPESESTTTI---SPVSESTTTSSPVSESTTTISPESESTTTSSPASEST 2045
++ + ++ +S TI ++ T ++ +TT + S T+ +
Sbjct: 225 TITFTTAGAATADSNGNYTIGLDQTLTALLATIDTINGNTT-----NPSVVTAGKLELHS 279
Query: 2046 TTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASEST 2105
TN+P + S + + + + T++ A+ S TT + +++ ++ T
Sbjct: 280 GTNSPLTISDNAGGAVLAKLGLGAQVTTTAGTTAAANISATTQLFNTHGGLSTTAIADGT 339
Query: 2106 T-TSSPESESTTTSSPASESTTIEEQGV 2132
T T + ++ + TS + + GV
Sbjct: 340 TLTVNGKTITFKTSDAPQGNNILTGSGV 367
Score = 38.0 bits (88), Expect = 0.083
Identities = 42/253 (16%), Positives = 94/253 (37%), Gaps = 6/253 (2%)
Query: 1885 STLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESES 1944
S N L N + + S T ++ + S + TS+++ + T + +
Sbjct: 93 SIANQALQTNVGYATKSNVSATIAGATADDLRGTQSFSNAVATSNVIFDGTAGGTSTASG 152
Query: 1945 TTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSE-----STTTSS 1999
T T S + V+ + S + + + T + L+S + T +
Sbjct: 153 TDTLGGGIVSIAAGTAVTVLGAADATALGSVLSVGTAAATATGADLISSLTNGSTATATG 212
Query: 2000 PESESTTTISPVSES-TTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTN 2058
P + + T++ + + TT + ++S + + T T+ A+ T N + S T
Sbjct: 213 PAAGDSITVNGKTITFTTAGAATADSNGNYTIGLDQTLTALLATIDTINGNTTNPSVVTA 272
Query: 2059 NPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTS 2118
++SP + S + + + T++ A+ S TT + ++
Sbjct: 273 GKLELHSGTNSPLTISDNAGGAVLAKLGLGAQVTTTAGTTAAANISATTQLFNTHGGLST 332
Query: 2119 SPASESTTIEEQG 2131
+ ++ TT+ G
Sbjct: 333 TAIADGTTLTVNG 345
Score = 34.5 bits (79), Expect = 0.84
Identities = 40/217 (18%), Positives = 78/217 (35%), Gaps = 6/217 (2%)
Query: 1926 TTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESEST 1985
T S VS + ++ + T S S + TS+++ + T + + T T S
Sbjct: 106 ATKSNVSATIAGATADDLRGTQSF--SNAVATSNVIFDGTAGGTSTASGTDTLGGGIVSI 163
Query: 1986 TTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASEST 2045
+ V+ + S ++ + + T + +S T + T T A +S
Sbjct: 164 AAGTAVTVLGAADATALGSVLSVGTAAATATGADLISSLTNGSTA----TATGPAAGDSI 219
Query: 2046 TTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASEST 2105
T N TT ++S + + + T T+ A+ T + + S T+ +
Sbjct: 220 TVNGKTITFTTAGAATADSNGNYTIGLDQTLTALLATIDTINGNTTNPSVVTAGKLELHS 279
Query: 2106 TTSSPESESTTTSSPASESTTIEEQGVSPHSEKLSAN 2142
T+SP + S + Q + +AN
Sbjct: 280 GTNSPLTISDNAGGAVLAKLGLGAQVTTTAGTTAAAN 316
>gnl|CDD|220779 pfam10488, PP1c_bdg, Phosphatase-1 catalytic subunit binding region.
This conserved C-terminus appears to be a protein
phosphatase-1 catalytic subunit (PP1C) binding region,
which may in some circumstances also be retroviral in
origin since it is found in both herpes simplex virus and
in mouse and man. This domain is found in Gadd-34
apoptosis-associated proteins as well as the constitutive
repressor of eIF2-alpha phosphorylation/protein
phosphatase 1, regulatory (inhibitor) subunit 15b,
otherwise known as CReP. Diverse stressful conditions are
associated with phosphorylation of the {alpha} subunit of
eukaryotic translation initiation factor 2 (eIF2{alpha})
on serine 51. This signaling event, which is conserved
from yeast to mammals, negatively regulates the guanine
nucleotide exchange factor, eIF2-B and inhibits the
recycling of eIF2 to its active GTP bound form. In
mammalian cells eIF2{alpha} phosphorylation emerges as an
important event in stress signaling that impacts on gene
expression at both the translational and transcriptional
levels.
Length = 307
Score = 40.4 bits (94), Expect = 0.010
Identities = 36/177 (20%), Positives = 66/177 (37%), Gaps = 9/177 (5%)
Query: 1932 SESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESES-----TTTSSPESESTT 1986
S+ ++S ES S S + + SSL SES E T + P +
Sbjct: 24 SDLESSSDVESISWDEESEDDGFDSDSSL-SESDREQDDEGLHLWNSFTKSVDPYNPLNF 82
Query: 1987 TSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTT 2046
T+++ + +T P S + S ++ P+ + S E +S +S+ ES
Sbjct: 83 TATIQTAATIKPKPPSSE-SDWSGEENVSSQEGPLPSTPEHSSSEDDSWESSADEEESLK 141
Query: 2047 TNNPKSESTTTNNPAS--ESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPA 2101
N ++ NP + +S + + S + + + +ST S A
Sbjct: 142 LWNSFCQNDDPYNPLNFKAPFQTSGKNPKGSKHDSKTNSEQNVAIRSLKSTRLSCKA 198
Score = 36.2 bits (83), Expect = 0.18
Identities = 37/185 (20%), Positives = 58/185 (31%), Gaps = 35/185 (18%)
Query: 1962 SESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPES--------ESTTTISPVSE 2013
S+ ++S ES S S + + SSL SES E +S +P++
Sbjct: 24 SDLESSSDVESISWDEESEDDGFDSDSSL-SESDREQDDEGLHLWNSFTKSVDPYNPLNF 82
Query: 2014 STTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPA-SESITSSSPAS 2072
+ T + TI P+ S+ + E N E + P S S S +S
Sbjct: 83 TATIQTAA-----TIKPKPPSSESDWSGEE----NVSSQEGPLPSTPEHSSSEDDSWESS 133
Query: 2073 ESTTTSSPASES----------------TTTSSPASESTTTSSPASESTTTSSPESESTT 2116
S S TS + + S + + +ST
Sbjct: 134 ADEEESLKLWNSFCQNDDPYNPLNFKAPFQTSGKNPKGSKHDSKTNSEQNVAIRSLKSTR 193
Query: 2117 TSSPA 2121
S A
Sbjct: 194 LSCKA 198
Score = 35.0 bits (80), Expect = 0.42
Identities = 38/188 (20%), Positives = 64/188 (34%), Gaps = 21/188 (11%)
Query: 1902 SESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPES--------ESTTTSSPESE 1953
S+ ++++ ES S S + + SSL SES E +S P +
Sbjct: 24 SDLESSSDVESISWDEESEDDGFDSDSSL-SESDREQDDEGLHLWNSFTKSVD---PYNP 79
Query: 1954 STTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSE 2013
T+++ + +T P S + S E+ S+ L S +SS + S
Sbjct: 80 LNFTATIQTAATIKPKPPSSESDWSGEENVSSQEGPLPSTPEHSSSEDDS-----WESSA 134
Query: 2014 STTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASE 2073
S + S +P S + TN+ + +I S +
Sbjct: 135 DEEESLKLWNSFCQNDDPYNPLNFKAPFQTSGKNPKGSKHDSKTNSEQNVAIRS----LK 190
Query: 2074 STTTSSPA 2081
ST S A
Sbjct: 191 STRLSCKA 198
Score = 30.8 bits (69), Expect = 8.9
Identities = 28/134 (20%), Positives = 51/134 (38%), Gaps = 5/134 (3%)
Query: 2012 SESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPA 2071
S+ ++S S S ESE S +S S + E N ++S+ +P
Sbjct: 24 SDLESSSDVESISWDE---ESEDDGFDSDSSLSESDREQDDEGLHLWNSFTKSVDPYNPL 80
Query: 2072 SESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQG 2131
+ + T + A+ P+SES + S P + ++S S ++ +E+
Sbjct: 81 NFTATIQTAATIK--PKPPSSESDWSGEENVSSQEGPLPSTPEHSSSEDDSWESSADEEE 138
Query: 2132 VSPHSEKLSANEDP 2145
N+DP
Sbjct: 139 SLKLWNSFCQNDDP 152
>gnl|CDD|219929 pfam08604, Nup153, Nucleoporin Nup153-like. This family contains
both the nucleoporin Nup153 from human and Nup153 from
fission yeast. These have been demonstrated to be
functionally equivalent.
Length = 519
Score = 40.8 bits (95), Expect = 0.010
Identities = 49/236 (20%), Positives = 89/236 (37%), Gaps = 23/236 (9%)
Query: 1910 PESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSS 1969
P + T S ST S + S T P + S E+ + + +
Sbjct: 253 PPVQRLVTPKSRSVSTNRSGYIKPSLT---PSGVFSAVSRRLDEACEDDVRKN-ALPKQN 308
Query: 1970 PESESTT---TSSPESESTTTSS--LVSESTT---TSSPESESTTTISPVSESTTTSSPV 2021
P+SE + S+P + ++ + E + + E E + P +SP
Sbjct: 309 PKSERFSYPIFSTPAANGLSSGGGKMTRERPSFASSKPHEEELEAPVLPKISLPIKTSPA 368
Query: 2022 SESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPA 2081
+ T SPE +T + SP S+ T + + + T+ S T SSP +ST +
Sbjct: 369 LPTFTFSSPEDTATFSHSPISKDTPAKSQEVKITS----PSPQFTFSSPIVKST----ES 420
Query: 2082 SESTTTSSPASESTTTSSPASESTTTSS---PESESTTTSSPASESTTIEEQGVSP 2134
+ + S + + +E+T S P+ E T + +ST +++ P
Sbjct: 421 NVEPPSPSKEFTFSVPVAKFTEATGDKSLVVPKFEFKPTHTATVQSTNLKDNEPKP 476
Score = 39.3 bits (91), Expect = 0.030
Identities = 40/232 (17%), Positives = 83/232 (35%), Gaps = 13/232 (5%)
Query: 1893 ENTTTNSPESESTTTNNPESESTT---TSSPESESTTTSS--LVSESTT---TSSPESES 1944
+ + + NP+SE + S+P + ++ + E + + E E
Sbjct: 292 DEACEDDVRKNALPKQNPKSERFSYPIFSTPAANGLSSGGGKMTRERPSFASSKPHEEEL 351
Query: 1945 TTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESES 2004
P+ +S + T SSPE +T + SP S+ T S + T+ S + S
Sbjct: 352 EAPVLPKISLPIKTSPALPTFTFSSPEDTATFSHSPISKDTPAKSQEVKITSPSPQFTFS 411
Query: 2005 TTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASES 2064
+ + + SP E T ++ P ++ T + + PK E T+ +S
Sbjct: 412 SPIVKSTESNVEPPSPSKEFTFSV-PVAKFTEATG----DKSLVVPKFEFKPTHTATVQS 466
Query: 2065 ITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTT 2116
+ T + +++ S + +S S + + + +
Sbjct: 467 TNLKDNEPKPTFGAFKPAKTLKEGSVLDLLKSPGFFSSPSPKREATQKTANS 518
Score = 35.8 bits (82), Expect = 0.27
Identities = 33/210 (15%), Positives = 77/210 (36%), Gaps = 10/210 (4%)
Query: 1947 TSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTT 2006
+ +P + S + E+ + +P+SE + + + S T
Sbjct: 277 SLTPSGVFSAVSRRLDEACEDDV-RKNALPKQNPKSERFSYPIFSTPAANGLSSGGGKMT 335
Query: 2007 TISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESIT 2066
P S+ E + P+ +SPA + T ++P+ +T +++P S+
Sbjct: 336 RERPSFASSKPHE--EELEAPVLPKISLPIKTSPALPTFTFSSPEDTATFSHSPISKDTP 393
Query: 2067 SSSPASESTTTSSPASESTTT--SSPASESTTTSSPASESTTTSSPESESTTTSSPASES 2124
+ S + T+ S + S+ S+ ++ + S T S P ++ T + +S
Sbjct: 394 AKSQEVKITSPSPQFTFSSPIVKSTESNVEPPSPSK---EFTFSVPVAKFTEATG--DKS 448
Query: 2125 TTIEEQGVSPHSEKLSANEDPEEFPNEDVF 2154
+ + P + + ++ + F
Sbjct: 449 LVVPKFEFKPTHTATVQSTNLKDNEPKPTF 478
Score = 32.8 bits (74), Expect = 2.8
Identities = 34/148 (22%), Positives = 58/148 (39%), Gaps = 14/148 (9%)
Query: 1886 TLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESEST 1945
T +S T ++SP S+ T + E + T+ S + S+ + SP S+
Sbjct: 373 TFSSPEDTATFSHSPISKDTPAKSQEVKITSPSPQFTFSSPIVKSTESNVEPPSP-SKEF 431
Query: 1946 TTSSPE---SESTTTSSLVS---ESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSS 1999
T S P +E+T SLV E T + +ST E + T + +++ S
Sbjct: 432 TFSVPVAKFTEATGDKSLVVPKFEFKPTHTATVQSTNLKDNEPKPTFGAFKPAKTLKEGS 491
Query: 2000 -------PESESTTTISPVSESTTTSSP 2020
P S+ + + T +SP
Sbjct: 492 VLDLLKSPGFFSSPSPKREATQKTANSP 519
>gnl|CDD|219106 pfam06614, Neuromodulin, Neuromodulin. This family consists of
several neuromodulin (Axonal membrane protein GAP-43)
sequences and is found in conjunction with pfam00612.
GAP-43 is a neuronal calmodulin-binding phosphoprotein
that is concentrated in growth cones and pre-synaptic
terminals.
Length = 174
Score = 39.1 bits (90), Expect = 0.011
Identities = 34/159 (21%), Positives = 62/159 (38%), Gaps = 9/159 (5%)
Query: 1895 TTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESES 1954
T T + E+E+ T+ P + ++ + E + +S E +P S +S E+ES
Sbjct: 21 TATEATEAETPKTDEPTKDGSSPAE-EKKGEGSSDKPQEQPAPQAPASSEEKQASAETES 79
Query: 1955 TTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSES 2014
T +S T +SP S++ E V+ + T+ ++T +P E
Sbjct: 80 ATKAS------TDNSPSSKADVAPLKEESKKADVPAVTAAAATTPAAEDATAKAAPQPEQ 133
Query: 2015 TTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSE 2053
T S S+ E+ + S E K++
Sbjct: 134 ETAES--SQEEEKKDAVEETKPSESAQQEEAKEEEAKAD 170
Score = 37.1 bits (85), Expect = 0.044
Identities = 33/161 (20%), Positives = 60/161 (37%), Gaps = 11/161 (6%)
Query: 1971 ESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISP 2030
+ E+ T + T + ++ ++ + E + + E +P S S
Sbjct: 16 KGEAKTATEATEAETPKTDEPTKDGSSPAEEKKGEGSSDKPQEQPAPQAPASSEEKQASA 75
Query: 2031 ESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSS-PASESTTTSSPASESTTTSS 2089
E+ES T +S T N+P S++ P E + PA + ++PA+E T +
Sbjct: 76 ETESATKAS------TDNSPSSKADVA--PLKEESKKADVPAVTAAAATTPAAEDATAKA 127
Query: 2090 PASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQ 2130
T+ + E E T S A + EE+
Sbjct: 128 APQPEQETAESSQEEEKKD--AVEETKPSESAQQEEAKEEE 166
Score = 34.5 bits (78), Expect = 0.39
Identities = 26/138 (18%), Positives = 52/138 (37%), Gaps = 6/138 (4%)
Query: 1909 NPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTS 1968
N + E+ T + T + ++ ++ + E + +S E + S +
Sbjct: 14 NKKGEAKTATEATEAETPKTDEPTKDGSSPAEEKKGEGSSDKPQEQPAPQAPASSEEKQA 73
Query: 1969 SPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTI 2028
S E+ES T +S T +S S++ E + V+ + T+ ++T
Sbjct: 74 SAETESATKAS------TDNSPSSKADVAPLKEESKKADVPAVTAAAATTPAAEDATAKA 127
Query: 2029 SPESESTTTSSPASESTT 2046
+P+ E T S E
Sbjct: 128 APQPEQETAESSQEEEKK 145
Score = 32.5 bits (73), Expect = 1.5
Identities = 40/174 (22%), Positives = 62/174 (35%), Gaps = 18/174 (10%)
Query: 2020 PVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSS 2079
P E+ E+++ T ++ A T K S+ E SS E +
Sbjct: 7 PSEEAVENKKGEAKTATEATEAETPKTDEPTKDGSSPAEEKKGE--GSSDKPQEQPAPQA 64
Query: 2080 PASESTTTSSPASESTTT-------SSPASESTTTSSPESESTTTSSPASESTTIEE--- 2129
PAS +S +ES T SS A + + + A+ +T E
Sbjct: 65 PASSEEKQASAETESATKASTDNSPSSKADVAPLKEESKKADVPAVTAAAATTPAAEDAT 124
Query: 2130 QGVSPHSEKLSANEDPEEFPNEDVFEHTFAEIPNIDHSNQTDEAIPETFDAREE 2183
+P E+ +A EE + V E +E S Q +EA E A +E
Sbjct: 125 AKAAPQPEQETAESSQEEEKKDAVEETKPSE------SAQQEEAKEEEAKADQE 172
>gnl|CDD|215964 pfam00513, Late_protein_L2, Late Protein L2.
Length = 466
Score = 40.4 bits (95), Expect = 0.013
Identities = 21/120 (17%), Positives = 39/120 (32%), Gaps = 10/120 (8%)
Query: 1986 TTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASEST 2045
+ SLV ES+ S TTSS + + ++P + + S T
Sbjct: 100 SIVSLVEESSIIESGAPIPPIPGDGSGFPITTSSTTTPAILDVTPTTRTVHVS-----RT 154
Query: 2046 TTNNPKSESTTTNNPASES-----ITSSSPASESTTTSSPASESTTTSSPASESTTTSSP 2100
NNP + P + + S + + ++ S + +S+P
Sbjct: 155 QYNNPLFTDPSVLQPPQPAEVSGHVLVSGQTIGTHSYEEIPMDTFAVSEGTTPPPISSTP 214
Score = 38.0 bits (89), Expect = 0.069
Identities = 25/115 (21%), Positives = 35/115 (30%), Gaps = 3/115 (2%)
Query: 1934 STTTSSPESE-STTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVS 1992
S + E E S L + E T S TT
Sbjct: 327 SPIAPAEEIELQPLGEHSGDTSPVEDGLYDIYADPDPLDVELDTYSDDLLLDETTE--DF 384
Query: 1993 ESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTT 2047
++ S S ++TT + + S+T PV + PES TT P T
Sbjct: 385 STSQLVSSSSRTSTTNTTIPLSSTPDVPVYYGPDIVLPESPGTTPIVPVPPDLPT 439
Score = 36.9 bits (86), Expect = 0.15
Identities = 26/117 (22%), Positives = 36/117 (30%), Gaps = 7/117 (5%)
Query: 1928 SSLVSESTTTSSPESESTTTSSPESESTT-----TSSLVSESTTTSSPESESTTTSSPES 1982
S + P E + +SP + L E T S TT +
Sbjct: 327 SPIAPAEEIELQPLGEHSGDTSPVEDGLYDIYADPDPLDVELDTYSDDLLLDETTEDFST 386
Query: 1983 ESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSS 2039
+SS + +T T+ P S + PV P S TT I P T
Sbjct: 387 SQLVSSSSRTSTTNTTIPLSSTPDV--PVYYGPDIVLPESPGTTPIVPVPPDLPTVI 441
Score = 36.9 bits (86), Expect = 0.15
Identities = 30/130 (23%), Positives = 43/130 (33%), Gaps = 30/130 (23%)
Query: 1926 TTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTS-----SPESESTTTSSP 1980
+ SLV ES+ S TTS STTT +P + + S
Sbjct: 100 SIVSLVEESSIIESGAPIPPIPGDGSGFPITTS-----STTTPAILDVTPTTRTVHVSR- 153
Query: 1981 ESESTTTSSLVSE-STTTSSPESE-------STTTISPVS--ESTTTSSPVSESTTTISP 2030
+ + L ++ S +E S TI S E + VSE TT
Sbjct: 154 ---TQYNNPLFTDPSVLQPPQPAEVSGHVLVSGQTIGTHSYEEIPMDTFAVSEGTTP--- 207
Query: 2031 ESESTTTSSP 2040
+S+P
Sbjct: 208 ---PPISSTP 214
Score = 36.5 bits (85), Expect = 0.17
Identities = 28/152 (18%), Positives = 44/152 (28%), Gaps = 17/152 (11%)
Query: 1998 SSPESESTTTISPVSESTTTSSPVSESTTTISP-ESESTTTSSPASESTTTNNPKSESTT 2056
+ T +PV S V + +I ES+ S A + T
Sbjct: 71 GTRPVRVVGTGTPVRPPVVVESTVGPTDPSIVSLVEESSIIESGAPIPPIPGDGSGFPIT 130
Query: 2057 TNNPASESITSSSPASESTTTSSPASE-------STTTSSPASE-------STTTSSPAS 2102
T++ + +I +P + + S S +E S T S
Sbjct: 131 TSSTTTPAILDVTPTTRTVHVSRTQYNNPLFTDPSVLQPPQPAEVSGHVLVSGQTIGTHS 190
Query: 2103 ESTTTSSPESESTTTSSPASESTTIEEQGVSP 2134
+ S T+ P ST I GV
Sbjct: 191 YEEIPMDTFAVSEGTTPPPISSTPI--PGVRR 220
Score = 36.5 bits (85), Expect = 0.19
Identities = 22/101 (21%), Positives = 29/101 (28%), Gaps = 2/101 (1%)
Query: 1919 SPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTS 1978
S L + E T S TT + +SS + +T T+
Sbjct: 343 HSGDTSPVEDGLYDIYADPDPLDVELDTYSDDLLLDETTEDFSTSQLVSSSSRTSTTNTT 402
Query: 1979 SPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSS 2019
P S + PES TT I PV T
Sbjct: 403 IPLSSTPDVPVYYGPDIV--LPESPGTTPIVPVPPDLPTVI 441
Score = 34.6 bits (80), Expect = 0.83
Identities = 26/126 (20%), Positives = 42/126 (33%), Gaps = 11/126 (8%)
Query: 1994 STTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSE 2053
S + E E + P+ E + +SPV + I + +
Sbjct: 327 SPIAPAEEIE----LQPLGEHSGDTSPVEDGLYDIYADPDPLDVELDTYSDDLL-----L 377
Query: 2054 STTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESE 2113
TT + ++ + SSS + +T T+ P S +T P P S TT P
Sbjct: 378 DETTEDFSTSQLVSSSSRTSTTNTTIPLS--STPDVPVYYGPDIVLPESPGTTPIVPVPP 435
Query: 2114 STTTSS 2119
T
Sbjct: 436 DLPTVI 441
Score = 34.2 bits (79), Expect = 0.90
Identities = 20/125 (16%), Positives = 37/125 (29%), Gaps = 20/125 (16%)
Query: 1956 TTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSEST 2015
+ SLV ES+ S TTSS TT+ + ++P + +
Sbjct: 100 SIVSLVEESSIIESGAPIPPIPGDGSGFPITTSS------TTTPAILD----VTPTTRTV 149
Query: 2016 TTSSPVSE-------STTTISPESEST---TTSSPASESTTTNNPKSESTTTNNPASESI 2065
S S +E + S + + ++ + +
Sbjct: 150 HVSRTQYNNPLFTDPSVLQPPQPAEVSGHVLVSGQTIGTHSYEEIPMDTFAVSEGTTPPP 209
Query: 2066 TSSSP 2070
SS+P
Sbjct: 210 ISSTP 214
Score = 33.4 bits (77), Expect = 1.6
Identities = 28/165 (16%), Positives = 54/165 (32%), Gaps = 33/165 (20%)
Query: 1870 IIFTTNNNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSS 1929
++ +T ++ ++V SL+ E++ S + TTSS TT+
Sbjct: 89 VVESTVGPTDPSIV-----SLVEESSIIESGAPIPPIPGDGSGFPITTSS------TTTP 137
Query: 1930 LVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSS 1989
+ + T TT + VS + + ++ + P+ +
Sbjct: 138 AILDVT------------------PTTRTVHVSRTQYNNPLFTDPSVLQPPQPAEVSGHV 179
Query: 1990 LVSEST--TTSSPESESTTTISPVSEST--TTSSPVSESTTTISP 2030
LVS T T S E T + +S+P+
Sbjct: 180 LVSGQTIGTHSYEEIPMDTFAVSEGTTPPPISSTPIPGVRRVARL 224
Score = 33.4 bits (77), Expect = 1.6
Identities = 22/116 (18%), Positives = 33/116 (28%), Gaps = 3/116 (2%)
Query: 1877 NSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTT 1936
+ + L + + + +P T S + T+ S S
Sbjct: 330 APAEEIELQPLGEHSGDTSPVEDGLYDIYADPDPLDVELDTYSDDLLLDETTEDFSTSQL 389
Query: 1937 TSSPESESTT-TSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLV 1991
SS STT T+ P S + PES TT P T +
Sbjct: 390 VSSSSRTSTTNTTIPLSSTPDVPVYYGPDIV--LPESPGTTPIVPVPPDLPTVIIH 443
Score = 32.3 bits (74), Expect = 4.2
Identities = 22/119 (18%), Positives = 34/119 (28%), Gaps = 13/119 (10%)
Query: 2018 SSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTT 2077
S + P E + +SP + + E T S TT
Sbjct: 327 SPIAPAEEIELQPLGEHSGDTSPVEDGLYDIYADPDPLDV-----ELDTYSDDLLLDETT 381
Query: 2078 SSPASESTTTSSPASESTTTSSPASESTTT--------SSPESESTTTSSPASESTTIE 2128
++ +SS + +T T+ P S + PES TT P
Sbjct: 382 EDFSTSQLVSSSSRTSTTNTTIPLSSTPDVPVYYGPDIVLPESPGTTPIVPVPPDLPTV 440
>gnl|CDD|219594 pfam07816, DUF1645, Protein of unknown function (DUF1645). These
sequences are derived from a number of hypothetical plant
proteins. The region in question is approximately 270
amino acids long. Some members of this family are
annotated as yeast pheromone receptor proteins AR781 but
no literature was found to support this.
Length = 191
Score = 39.1 bits (91), Expect = 0.013
Identities = 26/134 (19%), Positives = 51/134 (38%), Gaps = 23/134 (17%)
Query: 2009 SPVSESTTTSSPVSESTTTISPESESTTTSSPASEST--TTNNPKSESTTTNNPASESIT 2066
SP SES + + + ++SPE ++ S +++ P S +++ +S +
Sbjct: 36 SPRSESAFAPARLRRALRSLSPERGGGSSDSESTDEGELEGVPPSSYCVSSSPASSSRKS 95
Query: 2067 SSSPASE---------------------STTTSSPASESTTTSSPASESTTTSSPASEST 2105
SS+ +S+ P + + SSPAS S+ + ES+
Sbjct: 96 SSTGSSKRWRLSDLLLFRSASDGKDAFVFDAAKDPLLKYSPLSSPASPVKPASAKSRESS 155
Query: 2106 TTSSPESESTTTSS 2119
+ T S+
Sbjct: 156 ASKGKRRGKTVASA 169
Score = 36.0 bits (83), Expect = 0.13
Identities = 31/156 (19%), Positives = 56/156 (35%), Gaps = 19/156 (12%)
Query: 1979 SPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTS 2038
SP SES + + + + SPE ++ S ++ P S ++S
Sbjct: 36 SPRSESAFAPARLRRALRSLSPERGGGSSDSESTDEGELEGV--------PPSSYCVSSS 87
Query: 2039 SPASESTTTNNPKSE-----------STTTNNPASESITSSSPASESTTTSSPASESTTT 2087
+S +++ S+ S + A + P + + SSPAS
Sbjct: 88 PASSSRKSSSTGSSKRWRLSDLLLFRSASDGKDAFVFDAAKDPLLKYSPLSSPASPVKPA 147
Query: 2088 SSPASESTTTSSPASESTTTSSPESESTTTSSPASE 2123
S+ + ES+ + T S+ E T + A E
Sbjct: 148 SAKSRESSASKGKRRGKTVASAHELLYATNRAAAEE 183
Score = 33.3 bits (76), Expect = 0.85
Identities = 31/139 (22%), Positives = 48/139 (34%), Gaps = 23/139 (16%)
Query: 1919 SPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTS 1978
SP SES + + + + SPE ++ S ++ E SS S+ S
Sbjct: 36 SPRSESAFAPARLRRALRSLSPERGGGSSDSESTDEGE-----LEGVPPSSYCVSSSPAS 90
Query: 1979 SPESESTTTSS-------LV---SES--------TTTSSPESESTTTISPVSESTTTSSP 2020
S S+T SS L+ S S P + + SP S S+
Sbjct: 91 SSRKSSSTGSSKRWRLSDLLLFRSASDGKDAFVFDAAKDPLLKYSPLSSPASPVKPASAK 150
Query: 2021 VSESTTTISPESESTTTSS 2039
ES+ + T S+
Sbjct: 151 SRESSASKGKRRGKTVASA 169
Score = 32.5 bits (74), Expect = 1.9
Identities = 25/134 (18%), Positives = 44/134 (32%), Gaps = 13/134 (9%)
Query: 1899 SPESESTTTNNPESESTTTSSPESESTTTSSLVSEST--TTSSPESESTTTSSPESESTT 1956
SP SES + + SPE ++ S ++ P S ++S S +
Sbjct: 36 SPRSESAFAPARLRRALRSLSPERGGGSSDSESTDEGELEGVPPSSYCVSSSPASSSRKS 95
Query: 1957 TSS-----------LVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESEST 2005
+S+ L+ S + P + + SS S S+ ES+
Sbjct: 96 SSTGSSKRWRLSDLLLFRSASDGKDAFVFDAAKDPLLKYSPLSSPASPVKPASAKSRESS 155
Query: 2006 TTISPVSESTTTSS 2019
+ T S+
Sbjct: 156 ASKGKRRGKTVASA 169
Score = 31.3 bits (71), Expect = 4.1
Identities = 31/142 (21%), Positives = 55/142 (38%), Gaps = 9/142 (6%)
Query: 1949 SPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSES-TTTSSPESESTTT 2007
SP SES + + + + SPE ++ S ++ + S +SSP S
Sbjct: 36 SPRSESAFAPARLRRALRSLSPERGGGSSDSESTDEGELEGVPPSSYCVSSSPAS----- 90
Query: 2008 ISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITS 2067
S S+T SS + + S S A +P + + ++PAS +
Sbjct: 91 -SSRKSSSTGSSKRWRLSDLLLFRSAS--DGKDAFVFDAAKDPLLKYSPLSSPASPVKPA 147
Query: 2068 SSPASESTTTSSPASESTTTSS 2089
S+ + ES+ + T S+
Sbjct: 148 SAKSRESSASKGKRRGKTVASA 169
>gnl|CDD|221745 pfam12737, Mating_C, C-terminal domain of homeodomain 1. Mating in
fungi is controlled by the loci that determine the mating
type of an individual, and only individuals with
differing mating types can mate. Basidiomycete fungi have
evolved a unique mating system, termed tetrapolar or
bifactorial incompatibility, in which mating type is
determined by two unlinked loci; compatibility at both
loci is required for mating to occur. The multi-allelic
tetrapolar mating system is considered to be a novel
innovation that could have only evolved once, and is thus
unique to the mushroom fungi. This domain is C-terminal
to the homeodomain transcription factor region.
Length = 418
Score = 39.8 bits (93), Expect = 0.016
Identities = 52/298 (17%), Positives = 97/298 (32%), Gaps = 38/298 (12%)
Query: 1899 SPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTS 1958
SPE +P S SP S S V S T S+ + + +
Sbjct: 73 SPER------SPALSSERLLSPSP-SVLDLSPVLASPQTGKRRRSSSPSDDEDEAERPSK 125
Query: 1959 SLVSESTTTSSPESESTTTSSPESESTTTSSLVSES-----TTTSSPESESTTTISPVSE 2013
S+S ++SS ++ P ++T L S T + SP T T +P +
Sbjct: 126 RPRSDSISSSSSPAKPPEACLPSPAASTQDELSEASAAPLPTPSLSPPHTPTDT-APSGK 184
Query: 2014 STTTSSPVSESTTTISPESES-TTTSS---PASESTTTNNPKSESTTTNNPASESITSSS 2069
S + P++ S T S P +T + + +++ +
Sbjct: 185 RKRRLSDGFQLPAPKRPQTSSRPQTVSDPLPLHATTDWDTWFQATVSSSPSLLLTGDIPP 244
Query: 2070 PASESTTTSS----------PASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSS 2119
P S S P + +P + S+++S+ + T+SS
Sbjct: 245 PVSVFAPDDSTPLDISLFNFPLIPLLPPEALDL-----PAPTAVSSSSSTFAVPALTSSS 299
Query: 2120 PASESTTIEEQ----GVSPHSEKL-SANEDPEE-FPNEDVFEHTFAEIPNIDHSNQTD 2171
+T +++ G + +SE L N+ P+ P ++ +
Sbjct: 300 VDQSATPLDQGFSNFGSNMYSEPLNPTNDSLLYGLPSSSSLYANRTIFPAWASTSVSP 357
Score = 39.0 bits (91), Expect = 0.032
Identities = 49/266 (18%), Positives = 93/266 (34%), Gaps = 46/266 (17%)
Query: 1900 PESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTT--- 1956
P S+S ++++ ++ P ++T L SE++ P + +P + +
Sbjct: 127 PRSDSISSSSSPAKPPEACLPSPAASTQDEL-SEASAAPLPTPSLSPPHTPTDTAPSGKR 185
Query: 1957 --TSSLVSESTTTSSPESES--TTTSSPESESTTTSSLVSESTT-TSSPESESTTTIS-P 2010
S + P++ S T S P TT T +SSP T I P
Sbjct: 186 KRRLSDGFQLPAPKRPQTSSRPQTVSDPLPLHATTDWDTWFQATVSSSPSLLLTGDIPPP 245
Query: 2011 VS-------------------------ESTTTSSPVSESTTTISPESESTTTSSPASEST 2045
VS E+ +P + S+++ + + T+SS +T
Sbjct: 246 VSVFAPDDSTPLDISLFNFPLIPLLPPEALDLPAPTAVSSSSSTFAVPALTSSSVDQSAT 305
Query: 2046 TTNNPKSESTTTNNPASESITSSSPASESTT---TSSPASEST------TTSSPASESTT 2096
+ S + N SE + ++ + +S A+ + T+ SP ST
Sbjct: 306 PLDQGFSNFGS--NMYSEPLNPTNDSLLYGLPSSSSLYANRTIFPAWASTSVSPLDFSTL 363
Query: 2097 TSSPASESTTTSSPESESTTTSSPAS 2122
+ P+ + S + + TS
Sbjct: 364 FNQPSPSPMASQSILAPAQPTSPSPV 389
Score = 36.3 bits (84), Expect = 0.18
Identities = 42/263 (15%), Positives = 83/263 (31%), Gaps = 28/263 (10%)
Query: 1878 SESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTT-----TSSLVS 1932
S S+ L S +T SE++ P + +P + + S
Sbjct: 134 SSSSPAKPPEACLPSPAASTQDELSEASAAPLPTPSLSPPHTPTDTAPSGKRKRRLSDGF 193
Query: 1933 ESTTTSSPESES----------TTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPES 1982
+ P++ S ++ + S S + + +P+
Sbjct: 194 QLPAPKRPQTSSRPQTVSDPLPLHATTDWDTWFQATVSSSPSLLLTGDIPPPVSVFAPDD 253
Query: 1983 ESTTTSSL----VSESTTTSSPESESTTTISPVSES----TTTSSPVSESTTTISPESES 2034
+ SL + + + + T +S S + TSS V +S T + S
Sbjct: 254 STPLDISLFNFPLIPLLPPEALDLPAPTAVSSSSSTFAVPALTSSSVDQSATPLDQ-GFS 312
Query: 2035 TTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASES 2094
S+ SE N + +S + + ++T+ SP ST + P+
Sbjct: 313 NFGSNMYSEPLNPTNDSLLYGLPS-SSSLYANRTIFPAWASTSVSPLDFSTLFNQPSPSP 371
Query: 2095 TTTSS---PASESTTTSSPESES 2114
+ S PA ++ + S
Sbjct: 372 MASQSILAPAQPTSPSPVALPSS 394
>gnl|CDD|221391 pfam12042, RP1-2, Tubuliform egg casing silk strands structural
domain. Spiders use fibroins to make silk strands. This
family includes tubuliform silk fibroins which are used
to protect egg cases. This domain is a structural domain
which is found in repeats of up to 20 in many individuals
(although this is not necessarily the case). RP1 makes up
structural domains in the N terminal while RP2 makes up
structural domains in the C terminal.
Length = 167
Score = 38.3 bits (89), Expect = 0.017
Identities = 34/166 (20%), Positives = 75/166 (45%), Gaps = 12/166 (7%)
Query: 1952 SESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPES--ESTTTIS 2009
+ S SS S S+ ++ +S S+ +S S+ SS S S S + +S
Sbjct: 2 AASQAASSASSSSSASAFAQSLSSALASSSQFSSAFSSATSASAAGSLAYALGQSAARSL 61
Query: 2010 PVSESTTTSSPVSESTTTI----SPESESTTTSSPASESTTTN------NPKSESTTTNN 2059
+S ++ +S V+++ +++ S + + S+ + N S +++ +
Sbjct: 62 GLSNASALASAVAQAVSSVGVGASASAYANAISNAIGQFLAGQGVLNASNASSLASSFAS 121
Query: 2060 PASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASEST 2105
S S SS+ + S + ++ A++S +S S++ + SS S ++
Sbjct: 122 ALSASAASSAAQAASASAAAAAAQSQAAASAFSQAASQSSSQSAAS 167
>gnl|CDD|221145 pfam11596, DUF3246, Protein of unknown function (DUF3246). This is a
small family of fungal proteins one of whose members from
Pichia stipitis is described as being an extremely serine
rich protein-mucin-like protein.
Length = 208
Score = 38.6 bits (89), Expect = 0.018
Identities = 43/213 (20%), Positives = 80/213 (37%), Gaps = 12/213 (5%)
Query: 1926 TTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESEST 1985
T + S TT + + T+ S +TT+ + T S+ + + + E
Sbjct: 1 TVDPITSNDITTIGSSTVTITSGGSGSSVSTTAGSSTILPTGSATDDDDYDDEETDCEGQ 60
Query: 1986 TTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSE-STTTISPESESTTTSSPASES 2044
TT++ T T+ P ++ T+ P +TT + TTIS + TT +
Sbjct: 61 TTAN--PTGTVTTDPTGTTSQTVVPTKPTTTDDDDDTTCVETTISDPTTITTPTG----- 113
Query: 2045 TTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSP---ASESTTTSSPA 2101
T N + + TTN A+ ++ ++ T T + + +T + E+TT ++
Sbjct: 114 -TVNGNPTGTVTTNGTATTTVITTVEGVAVTYTGTGQTFTTDGTEDDEDCDETTTYTTTY 172
Query: 2102 SESTTTSSPESESTTTSSPASESTTIEEQGVSP 2134
TT T + T+ V
Sbjct: 173 YTPYTTVIHGGTVYTNGVTVIATHTVYPTDVED 205
Score = 35.1 bits (80), Expect = 0.28
Identities = 30/162 (18%), Positives = 58/162 (35%), Gaps = 4/162 (2%)
Query: 1876 NNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSEST 1935
+ S+ ++ T ++ ++ + E TT NP TT + + T + + +
Sbjct: 31 TTAGSSTILPTGSATDDDDYDDEETDCEGQTTANPTGTVTTDPTGTTSQTVVPTKPTTTD 90
Query: 1936 TTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSEST 1995
TT S P + +T T ++ T T + +TTT E + + T
Sbjct: 91 DDDDTTCVETTISDPTTITTPTGTVNGNPTGTVTTNGTATTTVITTVEGVAVTYTGTGQT 150
Query: 1996 TTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTT 2037
T+ + + +TT +P TT+ T
Sbjct: 151 FTTDGTEDDEDCDETTTYTTTYYTP----YTTVIHGGTVYTN 188
Score = 35.1 bits (80), Expect = 0.28
Identities = 40/202 (19%), Positives = 83/202 (41%), Gaps = 14/202 (6%)
Query: 1886 TLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESEST 1945
T++ + S + TT + + T+ S +TT+ + T S+ + + E
Sbjct: 1 TVDPITSNDITTIGSSTVTITSGGSGSSVSTTAGSSTILPTGSATDDDDYDDEETDCEGQ 60
Query: 1946 TTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESEST 2005
TT++P T T + TT+ + TT+ + ++T + +S+ TT ++P T
Sbjct: 61 TTANP----TGTVTTDPTGTTSQTVVPTKPTTTDDDDDTTCVETTISDPTTITTP----T 112
Query: 2006 TTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESI 2065
T++ T T++ + +TT I+ T + ++ TT+ + + E+
Sbjct: 113 GTVNGNPTGTVTTNGTA-TTTVITTVEGVAVTYTGTGQTFTTDGTEDDED-----CDETT 166
Query: 2066 TSSSPASESTTTSSPASESTTT 2087
T ++ TT T
Sbjct: 167 TYTTTYYTPYTTVIHGGTVYTN 188
Score = 30.5 bits (68), Expect = 7.7
Identities = 38/184 (20%), Positives = 66/184 (35%), Gaps = 15/184 (8%)
Query: 1879 ESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTS 1938
STV +++ S S +TT S T + + + + TT + T T+
Sbjct: 15 SSTVTITSGGSGSSVSTTAGSSTILPTGSATDDDDYDDEETDCEGQTTANPT---GTVTT 71
Query: 1939 SPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTS 1998
P ++ T P +TT + T+ + + TT + T ++ + T T
Sbjct: 72 DPTGTTSQTVVPTKPTTTDDDDDTTCVETTISDPTTITTPTGTVNGNPTGTVTTNGTAT- 130
Query: 1999 SPESESTTTISPVSESTTTSSPVSESTTTISPES-----ESTTTSSPASESTTTNNPKSE 2053
TT I+ V T + ++ TT E E+TT ++ TT
Sbjct: 131 ------TTVITTVEGVAVTYTGTGQTFTTDGTEDDEDCDETTTYTTTYYTPYTTVIHGGT 184
Query: 2054 STTT 2057
T
Sbjct: 185 VYTN 188
>gnl|CDD|218191 pfam04652, DUF605, Vta1 like. Vta1 (VPS20-associated protein 1) is a
positive regulator of Vps4. Vps4 is an ATPase that is
required in the multivesicular body (MVB) sorting pathway
to dissociate the endosomal sorting complex required for
transport (ESCRT). Vta1 promotes correct assembly of Vps4
and stimulates its ATPase activity through its conserved
Vta1/SBP1/LIP5 region.
Length = 315
Score = 39.3 bits (92), Expect = 0.021
Identities = 19/132 (14%), Positives = 44/132 (33%), Gaps = 8/132 (6%)
Query: 1981 ESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSP 2040
E E ++ S+++ ++ + S S+ P S S ++ P
Sbjct: 157 EDEDADVATTNSDNSFPGEDADPASASPSDPPSSSPGVPSFPSPPE--DPSSPSDSSLPP 214
Query: 2041 ASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSP 2100
A S ++ P + NP S P + +++ + + +++P
Sbjct: 215 APSSFQSDTPPPSPESPTNP------SPPPGPAAPPPPPVQQVPPLSTAKPTPPSASATP 268
Query: 2101 ASESTTTSSPES 2112
A T ++
Sbjct: 269 APIGGITLDDDA 280
>gnl|CDD|240430 PTZ00473, PTZ00473, Plasmodium Vir superfamily; Provisional.
Length = 420
Score = 39.4 bits (92), Expect = 0.024
Identities = 22/85 (25%), Positives = 31/85 (36%), Gaps = 3/85 (3%)
Query: 1915 TTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESES 1974
S S T S SES + +S +T S S T S+ S +T +
Sbjct: 321 NYGGQFNSRSGRTGS--SESIRGFTYDSSTTYGGSSYGTSQTDSTSTYGSRSTFDSSTGG 378
Query: 1975 TTTSSPESESTTTSSLVSESTTTSS 1999
+ S S + SS S+ SS
Sbjct: 379 GSQSGGGS-TYGGSSTFDGSSRGSS 402
Score = 38.7 bits (90), Expect = 0.039
Identities = 22/89 (24%), Positives = 33/89 (37%), Gaps = 4/89 (4%)
Query: 1932 SESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLV 1991
S S T S SES + +S +T S S T S+ S +T + + S
Sbjct: 328 SRSGRTGS--SESIRGFTYDSSTTYGGSSYGTSQTDSTSTYGSRSTFDSSTGGGSQSG-- 383
Query: 1992 SESTTTSSPESESTTTISPVSESTTTSSP 2020
ST S + ++ S S + P
Sbjct: 384 GGSTYGGSSTFDGSSRGSSDSFGVSYFGP 412
Score = 37.5 bits (87), Expect = 0.075
Identities = 24/105 (22%), Positives = 37/105 (35%), Gaps = 6/105 (5%)
Query: 1895 TTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESES 1954
S S T + SES + +S +T S S T S+ S +T +
Sbjct: 321 NYGGQFNSRSGRTGS--SESIRGFTYDSSTTYGGSSYGTSQTDSTSTYGSRSTFDSSTGG 378
Query: 1955 TTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSS 1999
+ S ST S + ++ S S+S S + T S
Sbjct: 379 GSQSG--GGSTYGGSSTFDGSSRGS--SDSFGVSYFGPQQTVGFS 419
Score = 37.1 bits (86), Expect = 0.11
Identities = 23/87 (26%), Positives = 33/87 (37%), Gaps = 5/87 (5%)
Query: 2052 SESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPE 2111
S S T +SESI + S +T S S T S+ S +T ++ + S
Sbjct: 328 SRSGRTG--SSESIRGFTYDSSTTYGGSSYGTSQTDSTSTYGSRSTFDSSTGGGSQSGGG 385
Query: 2112 SESTTTSSPASESTTIEEQ--GVSPHS 2136
S + SS S+ GVS
Sbjct: 386 S-TYGGSSTFDGSSRGSSDSFGVSYFG 411
Score = 37.1 bits (86), Expect = 0.12
Identities = 21/98 (21%), Positives = 33/98 (33%), Gaps = 4/98 (4%)
Query: 1945 TTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESES 2004
S S T S SES + +S +T S S T S+ S +T +
Sbjct: 321 NYGGQFNSRSGRTGS--SESIRGFTYDSSTTYGGSSYGTSQTDSTSTYGSRSTFDSSTGG 378
Query: 2005 TTTISPVSESTTTSSPVSESTTTISPESESTTTSSPAS 2042
+ ST S + ++ S +S + P
Sbjct: 379 GSQSG--GGSTYGGSSTFDGSSRGSSDSFGVSYFGPQQ 414
Score = 37.1 bits (86), Expect = 0.13
Identities = 21/91 (23%), Positives = 34/91 (37%), Gaps = 4/91 (4%)
Query: 1892 SENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPE 1951
S + T S ES T +S +T S S T S+ S +T + + S
Sbjct: 328 SRSGRTGSSESIRGFTY--DSSTTYGGSSYGTSQTDSTSTYGSRSTFDSSTGGGSQSG-- 383
Query: 1952 SESTTTSSLVSESTTTSSPESESTTTSSPES 1982
ST S + ++ S +S + P+
Sbjct: 384 GGSTYGGSSTFDGSSRGSSDSFGVSYFGPQQ 414
Score = 36.0 bits (83), Expect = 0.25
Identities = 22/91 (24%), Positives = 34/91 (37%), Gaps = 4/91 (4%)
Query: 1962 SESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPV 2021
S S T S SES + +S +T S S T S+ S +T + + S
Sbjct: 328 SRSGRTGS--SESIRGFTYDSSTTYGGSSYGTSQTDSTSTYGSRSTFDSSTGGGSQSG-- 383
Query: 2022 SESTTTISPESESTTTSSPASESTTTNNPKS 2052
ST S + ++ S S + P+
Sbjct: 384 GGSTYGGSSTFDGSSRGSSDSFGVSYFGPQQ 414
Score = 35.6 bits (82), Expect = 0.36
Identities = 21/85 (24%), Positives = 29/85 (34%), Gaps = 3/85 (3%)
Query: 1935 TTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSES 1994
S S T S SES + S +T S S T S+ S +T +
Sbjct: 321 NYGGQFNSRSGRTGS--SESIRGFTYDSSTTYGGSSYGTSQTDSTSTYGSRSTFDSSTGG 378
Query: 1995 TTTSSPESESTTTISPVSESTTTSS 2019
+ S S + S S+ SS
Sbjct: 379 GSQSGGGS-TYGGSSTFDGSSRGSS 402
Score = 34.8 bits (80), Expect = 0.55
Identities = 19/85 (22%), Positives = 29/85 (34%), Gaps = 3/85 (3%)
Query: 2035 TTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASES 2094
S S T + +S T S + S S T S+ S +T ++
Sbjct: 321 NYGGQFNSRSGRTGSSESIRGFTY--DSSTTYGGSSYGTSQTDSTSTYGSRSTFDSSTGG 378
Query: 2095 TTTSSPASESTTTSSPESESTTTSS 2119
+ S S + SS S+ SS
Sbjct: 379 GSQSGGGS-TYGGSSTFDGSSRGSS 402
Score = 32.9 bits (75), Expect = 2.2
Identities = 26/88 (29%), Positives = 35/88 (39%), Gaps = 13/88 (14%)
Query: 2012 SESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPA 2071
S S T S SES + +S +T S S T +ST+T S S SS
Sbjct: 328 SRSGRTGS--SESIRGFTYDSSTTYGGSSYGTSQT------DSTSTYG--SRSTFDSS-- 375
Query: 2072 SESTTTSSPASESTTTSSPASESTTTSS 2099
+ + S S + SS S+ SS
Sbjct: 376 TGGGSQSGGGS-TYGGSSTFDGSSRGSS 402
Score = 32.9 bits (75), Expect = 2.2
Identities = 28/98 (28%), Positives = 38/98 (38%), Gaps = 6/98 (6%)
Query: 1992 SESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPK 2051
S S T S ES T S +T S S T +S ST S +S+T +
Sbjct: 328 SRSGRTGSSESIRGFTYD--SSTTYGGSSYGTSQT----DSTSTYGSRSTFDSSTGGGSQ 381
Query: 2052 SESTTTNNPASESITSSSPASESTTTSSPASESTTTSS 2089
S +T +S SS +S+S S + T S
Sbjct: 382 SGGGSTYGGSSTFDGSSRGSSDSFGVSYFGPQQTVGFS 419
Score = 32.9 bits (75), Expect = 2.5
Identities = 21/95 (22%), Positives = 31/95 (32%), Gaps = 2/95 (2%)
Query: 1995 TTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSES 2054
S S T S SES + S +T S S T S+ S +T + +
Sbjct: 321 NYGGQFNSRSGRTGS--SESIRGFTYDSSTTYGGSSYGTSQTDSTSTYGSRSTFDSSTGG 378
Query: 2055 TTTNNPASESITSSSPASESTTTSSPASESTTTSS 2089
+ + S SS+ S +S S
Sbjct: 379 GSQSGGGSTYGGSSTFDGSSRGSSDSFGVSYFGPQ 413
>gnl|CDD|221734 pfam12722, Hid1, High-temperature-induced dauer-formation protein.
Hid1 (high-temperature-induced dauer-formation protein 1)
represents proteins of approximately 800 residues long
and is conserved from fungi to humans. It contains up to
seven potential transmembrane domains separated by
regions of low complexity. Functionally it might be
involved in vesicle secretion or be an inter-cellular
signalling protein or be a novel insulin receptor.
Length = 813
Score = 39.7 bits (93), Expect = 0.025
Identities = 24/143 (16%), Positives = 48/143 (33%), Gaps = 14/143 (9%)
Query: 1972 SESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPE 2031
SS E + + S + S +++S + + S V E +++
Sbjct: 572 RNLILDSSQEEDERSNQSASGSLSDNPSNDNDS---------RSPSLSEVPEENKSLAIT 622
Query: 2032 SESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPA 2091
+ PAS +T+ + + + P S ++ S S + + P
Sbjct: 623 DDF----DPASRENSTSEAAAPPSVNSVPLQLQGPSEKDRGKNPAGSLAFSRLNSATRPK 678
Query: 2092 SESTTTSSPASESTTTSSPESES 2114
S +S + E +S ES
Sbjct: 679 WPSGLSSK-SKEKFPPTSDWVES 700
>gnl|CDD|144541 pfam00985, MSA_2, Merozoite Surface Antigen 2 (MSA-2) family.
Length = 171
Score = 37.6 bits (86), Expect = 0.033
Identities = 26/142 (18%), Positives = 53/142 (37%), Gaps = 8/142 (5%)
Query: 2042 SESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPA 2101
+ +TTT N ST+T++ + + SP +++ + S S
Sbjct: 1 TTTTTTTNDAEASTSTSSENPNHNNAETNPKGEGEVQSP-NQANKETQNNSNVQQDSQTK 59
Query: 2102 SESTTTSSPESESTTTSSPASESTTIEEQGVSPHSEKLSANEDPEEFPNEDVFEHTFAEI 2161
S T +++S T +E++ +P +E+ + E N+ +H
Sbjct: 60 SNVPETQDADTKSPTAQPEQAENS-------APTAEQTESPELQSAPENKGTGQHGHMHG 112
Query: 2162 PNIDHSNQTDEAIPETFDAREE 2183
+H T ++ E D +E
Sbjct: 113 SRNNHPQNTSDSQKECTDGNKE 134
Score = 36.4 bits (83), Expect = 0.080
Identities = 18/93 (19%), Positives = 44/93 (47%), Gaps = 2/93 (2%)
Query: 1892 SENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPE 1951
+ TTTN E+ ++T++ + + ++P+ E S +++ + S S +
Sbjct: 2 TTTTTTNDAEASTSTSSENPNHNNAETNPKGEGEVQSP--NQANKETQNNSNVQQDSQTK 59
Query: 1952 SESTTTSSLVSESTTTSSPESESTTTSSPESES 1984
S T ++S T ++E++ ++ ++ES
Sbjct: 60 SNVPETQDADTKSPTAQPEQAENSAPTAEQTES 92
Score = 36.4 bits (83), Expect = 0.088
Identities = 22/105 (20%), Positives = 45/105 (42%), Gaps = 3/105 (2%)
Query: 1902 SESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLV 1961
+ +TTTN+ E+ ++T+S + + ++ E S ++ + S S
Sbjct: 2 TTTTTTNDAEASTSTSSENPNHNNAETNPKGEGEVQSPN--QANKETQNNSNVQQDSQTK 59
Query: 1962 SESTTTSSPESESTTTSSPESE-STTTSSLVSESTTTSSPESEST 2005
S T +++S T ++E S T+ S+PE++ T
Sbjct: 60 SNVPETQDADTKSPTAQPEQAENSAPTAEQTESPELQSAPENKGT 104
Score = 36.0 bits (82), Expect = 0.11
Identities = 24/123 (19%), Positives = 47/123 (38%), Gaps = 1/123 (0%)
Query: 1942 SESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPE 2001
+ +TTT++ ST+TSS + SP +++ + S S +
Sbjct: 1 TTTTTTTNDAEASTSTSSENPNHNNAETNPKGEGEVQSP-NQANKETQNNSNVQQDSQTK 59
Query: 2002 SESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPA 2061
S T ++S T +E++ + ++ES S T + + N+P
Sbjct: 60 SNVPETQDADTKSPTAQPEQAENSAPTAEQTESPELQSAPENKGTGQHGHMHGSRNNHPQ 119
Query: 2062 SES 2064
+ S
Sbjct: 120 NTS 122
Score = 34.9 bits (79), Expect = 0.26
Identities = 22/105 (20%), Positives = 45/105 (42%), Gaps = 2/105 (1%)
Query: 1912 SESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPE 1971
+ +TTT++ ST+TSS + SP +++ + S S +
Sbjct: 1 TTTTTTTNDAEASTSTSSENPNHNNAETNPKGEGEVQSP-NQANKETQNNSNVQQDSQTK 59
Query: 1972 SESTTTSSPESESTTTSSLVSESTTTSSPESESTTTIS-PVSEST 2015
S T +++S T +E++ ++ ++ES S P ++ T
Sbjct: 60 SNVPETQDADTKSPTAQPEQAENSAPTAEQTESPELQSAPENKGT 104
Score = 34.1 bits (77), Expect = 0.46
Identities = 28/113 (24%), Positives = 51/113 (45%), Gaps = 13/113 (11%)
Query: 1873 TTNNNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVS 1932
TT N++E++ S+ N + T E E + N E+ S+ + +S T S++
Sbjct: 5 TTTNDAEASTSTSSENPNHNNAETNPKGEGEVQSPNQANKETQNNSNVQQDSQTKSNV-- 62
Query: 1933 ESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESEST 1985
PE++ T SP ++ + + T SPE + S+PE++ T
Sbjct: 63 -------PETQDADTKSPTAQPEQAENSAPTAEQTESPELQ----SAPENKGT 104
Score = 33.3 bits (75), Expect = 0.93
Identities = 21/105 (20%), Positives = 45/105 (42%), Gaps = 2/105 (1%)
Query: 1992 SESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPK 2051
+ +TTT++ ST+T S + SP +++ + S + K
Sbjct: 1 TTTTTTTNDAEASTSTSSENPNHNNAETNPKGEGEVQSP-NQANKETQNNSNVQQDSQTK 59
Query: 2052 SESTTTNNPASESITSSSPASESTTTSSPASEST-TTSSPASEST 2095
S T + ++S T+ +E++ ++ +ES S+P ++ T
Sbjct: 60 SNVPETQDADTKSPTAQPEQAENSAPTAEQTESPELQSAPENKGT 104
Score = 32.6 bits (73), Expect = 1.7
Identities = 22/105 (20%), Positives = 42/105 (40%), Gaps = 2/105 (1%)
Query: 1982 SESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPA 2041
+ +TTT++ ST+TSS + SP +++ S S
Sbjct: 1 TTTTTTTNDAEASTSTSSENPNHNNAETNPKGEGEVQSP-NQANKETQNNSNVQQDSQTK 59
Query: 2042 SESTTTNNPKSESTTTNNPASESITSSSPASEST-TTSSPASEST 2085
S T + ++S T +E+ ++ +ES S+P ++ T
Sbjct: 60 SNVPETQDADTKSPTAQPEQAENSAPTAEQTESPELQSAPENKGT 104
Score = 31.8 bits (71), Expect = 2.7
Identities = 21/105 (20%), Positives = 43/105 (40%), Gaps = 2/105 (1%)
Query: 1922 SESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPE 1981
+ +TTT++ ST+TSS + S +++ + S S +
Sbjct: 1 TTTTTTTNDAEASTSTSSENPNHNNAETNPKGEGEVQS-PNQANKETQNNSNVQQDSQTK 59
Query: 1982 SESTTTSSLVSESTTTSSPESESTTTISPVSEST-TTSSPVSEST 2025
S T ++S T ++E++ + +ES S+P ++ T
Sbjct: 60 SNVPETQDADTKSPTAQPEQAENSAPTAEQTESPELQSAPENKGT 104
Score = 31.0 bits (69), Expect = 4.8
Identities = 25/106 (23%), Positives = 48/106 (45%), Gaps = 4/106 (3%)
Query: 2012 SESTTTSSPVSESTTTISP-ESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSP 2070
+ +TTT++ ST+T S + + ++P E + ++ T NN S S
Sbjct: 1 TTTTTTTNDAEASTSTSSENPNHNNAETNPKGEGEVQSPNQANKETQNN--SNVQQDSQT 58
Query: 2071 ASESTTTSSPASESTTTSSPASESTTTSSPASEST-TTSSPESEST 2115
S T ++S T +E++ ++ +ES S+PE++ T
Sbjct: 59 KSNVPETQDADTKSPTAQPEQAENSAPTAEQTESPELQSAPENKGT 104
>gnl|CDD|220267 pfam09494, Slx4, Slx4 endonuclease. The Slx4 protein is a
heteromeric structure-specific endonuclease found in
fungi. Slx4 with Slx1 acts as a nuclease on branched DNA
substrates, particularly simple-Y, 5'-flap, or
replication fork structures by cleaving the strand
bearing the 5' non-homologous arm at the branch junction
and thus generating ligatable nicked products from
5'-flap or replication fork substrates.
Length = 627
Score = 39.2 bits (91), Expect = 0.034
Identities = 53/288 (18%), Positives = 90/288 (31%), Gaps = 18/288 (6%)
Query: 1895 TTTNSPESESTTTNNPESESTTTSSPESESTT----TSSLVSESTTTSSPESESTTTSSP 1950
+ + P T + + + + T S E + ES+S S P
Sbjct: 197 SASQLPPDTELTDEDLQWLYDLDDEQMANDNSPLVMTLSQTMEDQSAIEKESDSYIDSEP 256
Query: 1951 ESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISP 2010
S T + + S SS + ST S + + S+S + IS
Sbjct: 257 NSSITEPYDHDIQVKNSEPEFKPSNEISSHQVNSTDNESSIISFPLHIADSSDSVSEISL 316
Query: 2011 VSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSS-- 2068
T S P S +T P S+P ++ K + + S S S+
Sbjct: 317 ----TEPSRPQSIDSTIEPPIEIPRKMSTPFFTPRSSILDKHIELSQD---SFSAVSTAT 369
Query: 2069 SPASESTTTS-----SPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASE 2123
SP S+ T+TS P +S T + + TS E S E
Sbjct: 370 SPFKVSSAQIINSDGDVPLTRTSTSIPTRQSGTAAYKKRKKLNTSRYEISSKLRVKDYQE 429
Query: 2124 STTIEEQGVSPHSEKLSANEDPEEFPNEDVFEHTFAEIPNIDHSNQTD 2171
T + + K ++ E + + + + I I ++
Sbjct: 430 DKTNNKAKLLKEETKRLPVDNLNEIADSESDDDSSLSIIEIVDTSVLQ 477
>gnl|CDD|220271 pfam09507, CDC27, DNA polymerase subunit Cdc27. This protein forms
the C subunit of DNA polymerase delta. It carries the
essential residues for binding to the Pol1 subunit of
polymerase alpha, from residues 293-332, which are
characterized by the motif D--G--VT, referred to as the
DPIM motif. The first 160 residues of the protein form
the minimal domain for binding to the B subunit, Cdc1, of
polymerase delta, the final 10 C-terminal residues,
362-372, being the DNA sliding clamp, PCNA, binding
motif.
Length = 427
Score = 38.7 bits (90), Expect = 0.036
Identities = 30/205 (14%), Positives = 56/205 (27%), Gaps = 20/205 (9%)
Query: 1900 PESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSS 1959
+ +PE + + + S T++ E T + ++ +P +S SS
Sbjct: 164 SSKPPKSIMSPEVKVKSAKKTQDTSKETTT---EKTEGKTSVKAASLKRNPPKKSNIMSS 220
Query: 1960 LVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSS 2019
+ T + E++ ++ E ES ES + + + + E
Sbjct: 221 FFKKKTKEKKEKKEASESTVKE-ESE------EESGKRDVILEDESAEPTGLDEDEDEDE 273
Query: 2020 PVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTT-TS 2078
P + S E + E I SP E +
Sbjct: 274 PKPSGERSDSEEETEEKEKEKRKRLKKMMEDEDED------EEMEIVPESPVEEEESEEP 327
Query: 2079 SPASESTTTSSPASESTTTSSPASE 2103
P + T SP
Sbjct: 328 EPPPLPKK---EEEKEEVTVSPDGG 349
Score = 37.5 bits (87), Expect = 0.076
Identities = 28/155 (18%), Positives = 53/155 (34%), Gaps = 13/155 (8%)
Query: 1998 SSPESESTTT--ISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSEST 2055
++P + T + PV+ + + + + + S + SP + + + S
Sbjct: 131 TNPNVKRRTGVGLPPVAPAASPALKPTANGKRPS-SKPPKSIMSPEVKVKSAKKTQDTSK 189
Query: 2056 TTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSP-ESES 2114
T +E TS AS +P +S SS + T E++ ++ ESE
Sbjct: 190 ETTTEKTEGKTSVKAAS---LKRNPPKKSNIMSSFFKKKTKEKKEKKEASESTVKEESEE 246
Query: 2115 TTTSSPASESTTIEEQGVSPHSEKLSANEDPEEFP 2149
+ S L +ED +E
Sbjct: 247 ESGKRDVILEDE------SAEPTGLDEDEDEDEPK 275
Score = 37.5 bits (87), Expect = 0.085
Identities = 28/151 (18%), Positives = 50/151 (33%), Gaps = 6/151 (3%)
Query: 1998 SSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTT 2057
P + + + + + S + +SPE + + S T K+E T+
Sbjct: 143 LPPVAPAASPALKPTANGKRPS-SKPPKSIMSPEVKVKSAKKTQDTSKETTTEKTEGKTS 201
Query: 2058 NNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTT 2117
AS +P +S SS + T E++ ++ ES S
Sbjct: 202 VKAAS---LKRNPPKKSNIMSSFFKKKTKEKKEKKEASESTVKE-ESEEESGKRDVILED 257
Query: 2118 SSPASESTTIEEQGVSPH-SEKLSANEDPEE 2147
S +E P S + S +E+ E
Sbjct: 258 ESAEPTGLDEDEDEDEPKPSGERSDSEEETE 288
>gnl|CDD|191251 pfam05283, MGC-24, Multi-glycosylated core protein 24 (MGC-24). This
family consists of several MGC-24 (or Cd164 antigen)
proteins from eukaryotic organisms. MGC-24/CD164 is a
sialomucin expressed in many normal and cancerous
tissues. In humans, soluble and transmembrane forms of
MGC-24 are produced by alternative splicing.
Length = 187
Score = 37.7 bits (87), Expect = 0.038
Identities = 31/151 (20%), Positives = 49/151 (32%), Gaps = 23/151 (15%)
Query: 1990 LVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPES------------ESTTT 2037
L + S P I TS V T PE +
Sbjct: 18 LAAGSNWAQLPNVTKGARIFG-----RTSLLVLNVWLTTYPEGCEHLNSCVSCVNRTHNN 72
Query: 2038 SSPASESTTTNNPK---SESTTTNNPASESITSSSPASESTT---TSSPASESTTTSSPA 2091
S+ + P S++ + T+ S + +TT T+S A + T S
Sbjct: 73 STCVWQQCGPEEPGYCSSQAEVVKSGCQIYNTTDSCSVATTTPVPTNSTAKPTITPSPTT 132
Query: 2092 SESTTTSSPASESTTTSSPESESTTTSSPAS 2122
S TS P + +T T + + + +T AS
Sbjct: 133 SHHHVTSEPKTNTTVTPTSQPDRKSTFDAAS 163
Score = 36.9 bits (85), Expect = 0.060
Identities = 36/151 (23%), Positives = 56/151 (37%), Gaps = 13/151 (8%)
Query: 1960 LVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPES--ESTTTISPVSESTTT 2017
L + S P TS LV T+ PE + +S V+ +
Sbjct: 18 LAAGSNWAQLPNVTKGARIFG-----RTSLLVLNVWLTTYPEGCEHLNSCVSCVNRTHNN 72
Query: 2018 SSPVSESTTTISPE---SESTTTSSPASESTTTNNPKSESTT---TNNPASESITSSSPA 2071
S+ V + P S++ S TT++ +TT TN+ A +IT S
Sbjct: 73 STCVWQQCGPEEPGYCSSQAEVVKSGCQIYNTTDSCSVATTTPVPTNSTAKPTITPSPTT 132
Query: 2072 SESTTTSSPASESTTTSSPASESTTTSSPAS 2102
S TS P + +T T + + +T AS
Sbjct: 133 SHHHVTSEPKTNTTVTPTSQPDRKSTFDAAS 163
Score = 31.1 bits (70), Expect = 4.5
Identities = 23/84 (27%), Positives = 36/84 (42%), Gaps = 15/84 (17%)
Query: 1922 SESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPE 1981
S++ S TT S S T++P ++T ++ S TTS TS P+
Sbjct: 90 SQAEVVKSGCQIYNTTDSC---SVATTTPVPTNSTAKPTITPSPTTSH----HHVTSEPK 142
Query: 1982 SESTTTSSLVSESTTTSSPESEST 2005
+ +T T TS P+ +ST
Sbjct: 143 TNTTVTP--------TSQPDRKST 158
>gnl|CDD|215299 PLN02543, PLN02543, pfkB-type carbohydrate kinase family protein.
Length = 496
Score = 38.7 bits (90), Expect = 0.039
Identities = 18/93 (19%), Positives = 33/93 (35%), Gaps = 8/93 (8%)
Query: 2060 PASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSS 2119
SI S P ST ++ + + T+S P +++T + +++
Sbjct: 40 SLHPSIKRSRPGRCSTNGAAVPESPKPSRRGRKKKPTSSPPKAKTTRRRTKKTDQELDPE 99
Query: 2120 PASESTTIEEQGVSPHSEKLSANEDPEEFPNED 2152
A E E G +D +FP +D
Sbjct: 100 GAEEDQEAAEDG--------EDYDDGIDFPYDD 124
>gnl|CDD|234371 TIGR03839, termin_org_P1, adhesin P1. Members of this protein family
are the major adhesin of the Mycoplasma terminal
organelle. The protein is called adhesin P1, cytadhesin
P1, P140, attachment protein, and MgPa, with locus names
MG191 in Mycoplasma genitalium and MPN141 in M.
pneumoniae. A conserved C-terminal region is shared by
additional paralogs in M. pneumoniae and M.
gallisepticum, as well as by the member of this family
[Cell envelope, Surface structures, Cellular processes,
Pathogenesis].
Length = 1425
Score = 38.6 bits (89), Expect = 0.048
Identities = 40/256 (15%), Positives = 76/256 (29%), Gaps = 15/256 (5%)
Query: 1819 PGAEFLIQCQYCDFDSSMNLLSVSPYITNNLLISMLAATAVAISVIDNYSEIIFTTN-NN 1877
PGA L++ + + S ++ A+T AI D Y ++ +
Sbjct: 69 PGALVLVRSK-SAKGITAGSGSQQTTYPTRTEAALTASTTFAIRRYDLYGRALYDFDPGK 127
Query: 1878 SESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTT----TSSPESESTTTSSLVSE 1933
L + N T S N E +S T PE + LV +
Sbjct: 128 LNPQTPTRDLTGKVGFNPFTGFGLSGDAPFNWNELKSKVPVEVTQDPEDPNVFYVLLVPD 187
Query: 1934 STTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSE 1993
+ E + + + S++ T +E T +S+
Sbjct: 188 AAVQYEQLQRGLQEQKTEDQVFESYFGAMFGLKVKNAMSDAPKTGEKLAEGTASSAGSGS 247
Query: 1994 STTTSSPESESTTTISPV---------SESTTTSSPVSESTTTISPESESTTTSSPASES 2044
S++ + + + T + S T + T I S+S +
Sbjct: 248 SSSAAGGGAVAPTAAKALKREVEEGSSSGMGTMLPKNDTAETPIKYNSDSGKIVKLKALL 307
Query: 2045 TTTNNPKSESTTTNNP 2060
+T + +S + P
Sbjct: 308 DSTESSESINGGRWRP 323
>gnl|CDD|227270 COG4934, COG4934, Predicted protease [Posttranslational modification,
protein turnover, chaperones].
Length = 1174
Score = 38.6 bits (90), Expect = 0.050
Identities = 46/380 (12%), Positives = 103/380 (27%), Gaps = 50/380 (13%)
Query: 1764 INSVSPNVTSKILTTDNYSEIIFTTNNNSESTVVMSTLNSLLSENEKLFKPHAKTPGAEF 1823
+ S +P T+ + ++ + ++ N + T ++
Sbjct: 759 VISYAPPFTTGLFLSNGTAYTVYWNGNLIAESNGTLTPQTIQFNTTYSGSNTVT-----N 813
Query: 1824 LIQCQYCDFDSSMNLLSVSPYITNNLLISMLAATAVAIS----VIDNYSEIIFTTNNNSE 1879
Q + Y + I + + + F +
Sbjct: 814 QTIPQVGLLIPLFKFVYGYYYSSAIATIDAKYVFNEGNGPGAYIYVGSTPLYFFSAIIYP 873
Query: 1880 STVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSS--LVSESTTT 1937
+++ N + + +T +S S T + ++
Sbjct: 874 NSLS---YNIYVIGSIAIIPLPYNATLLEWVGPAIIPLTSSGSNFTFSFGYYVIQFPPGI 930
Query: 1938 SSPESESTTTSSPESESTTTSSLVSESTT---TSSPESEST-------TTSSPESESTTT 1987
+ + S + S VS +S P S T + T
Sbjct: 931 YTINTSIPGLDPYSSLINSKSGTVSNLQIYFLSSVPTSGLTGKSSDGGIKNFVIDVLVNT 990
Query: 1988 SSLVSESTTTSSP---ESESTTTISPVS---------------ESTTTSSPVSESTTTIS 2029
+ + + + T + S S TIS S T+ +S +S ++S
Sbjct: 991 NGISAINNGTGNYYVIASVSNGTISFSSQIYGKDVYNITVAEGNITSVNSALSNLIVSLS 1050
Query: 2030 ----PESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASEST 2085
P ++ S E N+ T + + S+SP+ +T ++
Sbjct: 1051 STTVPIIKNVLPSLVYGEYNIINS----YTGNDFGVITIVISNSPSGSYPSTLYNTDQTQ 1106
Query: 2086 TTSSPASESTTTSSPASEST 2105
T+S +S + +
Sbjct: 1107 TSSYISSTLPAHNYIINLIL 1126
>gnl|CDD|221188 pfam11725, AvrE, Pathogenicity factor. This family is secreted by
gram-negative Gammaproteobacteria such as Pseudomonas
syringae of tomato and the fire blight plant pathogen
Erwinia amylovora, amongst others. It is an essential
pathogenicity factor of approximately 198 kDa. Its
injection into the host-plant is dependent upon the
bacterial type III or Hrp secretion system. The family is
long and carries a number of predicted functional
regions, including an ERMS or endoplasmic reticulum
membrane retention signal at both the C- and the
N-termini, a leucine-zipper motif from residues 539-560,
and a nuclear localisation signal at 1358-1361. this
conserved AvrE-family of effectors is among the few that
are required for full virulence of many phytopathogenic
pseudomonads, erwinias and pantoeas.
Length = 1771
Score = 38.6 bits (90), Expect = 0.056
Identities = 33/253 (13%), Positives = 65/253 (25%), Gaps = 17/253 (6%)
Query: 1886 TLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESEST 1945
L S+ + T PE + + P ++ SP ++ SL SE +
Sbjct: 2 QLISINTATKTAVQPE-ATPSAGAPTGLQQSSESPTQRASH--SLASEGKKNRKKMPKVF 58
Query: 1946 TTSSPESESTTTSSLVSESTTTSSPESESTTTSS----PES-------ESTTTSSLVSES 1994
SS + T + S T PE ES+ ++ ++ S
Sbjct: 59 QKSSAPRQIQAAPPQALNPTAAAPQSSRGPTLRELLALPEDDGETQAPESSPSARRLTRS 118
Query: 1995 TTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSES 2054
+ E E V + S + S S +
Sbjct: 119 EGVARHEMEDLAGRPVVKPDADRQLRQDILNKSSSSRRPPVSKEEGTSSKMPATALASAA 178
Query: 2055 TTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESES 2114
++ + + ++ + + S + + P + E E
Sbjct: 179 LFKDDEIRQEVDAARS--DQASQSRLSRSRGNPPAI-PPDAAPRQPMLTRSAGGRFEGED 235
Query: 2115 TTTSSPASESTTI 2127
+ I
Sbjct: 236 ENLERNLQPQSPI 248
>gnl|CDD|234383 TIGR03895, protease_PatA, cyanobactin maturation protease, PatA/PatG
family. This model describes a protease domain
associated with the maturation of various members of the
cyanobactin family of ribosomally produced, heavily
modified bioactive metabolites. Members include the PatA
protein and C-terminal domain of the PatG protein of
Prochloron didemni, TenA and a region of TenG from Nostoc
spongiaeforme var. tenue, etc.
Length = 602
Score = 38.2 bits (89), Expect = 0.058
Identities = 22/109 (20%), Positives = 34/109 (31%), Gaps = 13/109 (11%)
Query: 2007 TISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESIT 2066
T+S ++ S + ES+T+ P + PA SI
Sbjct: 230 TMSEGLVTSEQDGVEEASGCGVQGTIESSTSVIPPGRAAE-------------PAPVSIP 276
Query: 2067 SSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESEST 2115
++P +T ++ S A T PAS T S S
Sbjct: 277 VAAPGEGATPAAAQIELSAGVLPNAISPATPVRPASNGVTPSQAPSAEP 325
Score = 33.5 bits (77), Expect = 1.5
Identities = 19/99 (19%), Positives = 31/99 (31%), Gaps = 3/99 (3%)
Query: 1977 TSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTT 2036
T S ++ + S ES+T++ P + P S +P +T
Sbjct: 230 TMSEGLVTSEQDGVEEASGCGVQGTIESSTSVIPPGR---AAEPAPVSIPVAAPGEGATP 286
Query: 2037 TSSPASESTTTNNPKSESTTTNNPASESITSSSPASEST 2075
++ S T PAS +T S S
Sbjct: 287 AAAQIELSAGVLPNAISPATPVRPASNGVTPSQAPSAEP 325
Score = 31.2 bits (71), Expect = 7.5
Identities = 18/99 (18%), Positives = 26/99 (26%), Gaps = 3/99 (3%)
Query: 1987 TSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTT 2046
T S ++ E S + ES+T+ P + A+ +
Sbjct: 230 TMSEGLVTSEQDGVEEASGCGVQGTIESSTSVIPPGRAAEPAPVSIPVAAPGEGATPAAA 289
Query: 2047 TNNPKSESTTTNNPASESITSSSPASESTTTSSPASEST 2085
S A T PAS T S S
Sbjct: 290 QI---ELSAGVLPNAISPATPVRPASNGVTPSQAPSAEP 325
>gnl|CDD|218902 pfam06121, DUF959, Domain of Unknown Function (DUF959). This
N-terminal domain is not expressed in the 'Short' isoform
of Collagen A.
Length = 202
Score = 37.1 bits (85), Expect = 0.061
Identities = 39/177 (22%), Positives = 63/177 (35%), Gaps = 17/177 (9%)
Query: 1884 MSTLNSL--LSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPE 1941
+ST L + T SP + S + +T S + ESTT +S E
Sbjct: 16 LSTPKKPTWLWKPYTELSPTASSAAVPQASTPVQSTESTTTHVVPRPGETEESTTPASSE 75
Query: 1942 SES------TTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSS------ 1989
P + +TT + SSP+ + +E +
Sbjct: 76 EPKEIVEKGKQNVVPGTVATTPTVTPVAMDVASSPDLSEENIAGVGAEILNVAEGIRSFV 135
Query: 1990 -LVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASEST 2045
L + T +S ++ T P+ +T SS TTT+ P S SSP++ +T
Sbjct: 136 QLWEDKVTNASAQTPVPDTEMPLVLATPISSLPQNDTTTLWPSSH--IPSSPSANTT 190
Score = 34.4 bits (78), Expect = 0.48
Identities = 40/204 (19%), Positives = 68/204 (33%), Gaps = 20/204 (9%)
Query: 1899 SPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTS 1958
+ T + + + T P +E + T+S + ++ +S T S +
Sbjct: 7 TSADAETASLSTPKKPTWLWKPYTELSPTASSAAVPQASTPVQS----TESTTTHVVPRP 62
Query: 1959 SLVSESTTTSSPESES------TTTSSPESESTTTSSLVSESTTTSSPESESTTT----- 2007
ESTT +S E P + +TT + SSP+
Sbjct: 63 GETEESTTPASSEEPKEIVEKGKQNVVPGTVATTPTVTPVAMDVASSPDLSEENIAGVGA 122
Query: 2008 -ISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESIT 2066
I V+E + + E T + S T P +E S N+ + +
Sbjct: 123 EILNVAEGIRSFVQLWEDKVT----NASAQTPVPDTEMPLVLATPISSLPQNDTTTLWPS 178
Query: 2067 SSSPASESTTTSSPASESTTTSSP 2090
S P+S S T+ + S T P
Sbjct: 179 SHIPSSPSANTTEAGTLSGPTKLP 202
>gnl|CDD|223065 PHA03378, PHA03378, EBNA-3B; Provisional.
Length = 991
Score = 38.5 bits (89), Expect = 0.062
Identities = 39/196 (19%), Positives = 56/196 (28%), Gaps = 18/196 (9%)
Query: 1954 STTTSSLVSES---TTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISP 2010
S TTS L S + T P + T P TT S + S P + P
Sbjct: 579 SPTTSQLASSAPSYAQTPWPVPHPSQTPEP---PTTQSHIPETSAPRQWPMPLRPIPMRP 635
Query: 2011 VSESTTTSSPVSESTTTISPESESTTTSS--------PASESTTTNN----PKSESTTTN 2058
+ T + + T P+ E T P S T N + T
Sbjct: 636 LRMQPITFNVLVFPTPHQPPQVEITPYKPTWTQIGHIPYQPSPTGANTMLPIQWAPGTMQ 695
Query: 2059 NPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTS 2118
P PA+ PA+ + PA+ PA+ P +
Sbjct: 696 PPPRAPTPMRPPAAPPGRAQRPAAATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRAR 755
Query: 2119 SPASESTTIEEQGVSP 2134
PA+ +P
Sbjct: 756 PPAAAPGRARPPAAAP 771
Score = 35.4 bits (81), Expect = 0.45
Identities = 39/198 (19%), Positives = 60/198 (30%), Gaps = 16/198 (8%)
Query: 1924 STTTSSLVSES---TTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSP 1980
S TTS L S + T P + T P TT S + S P P
Sbjct: 579 SPTTSQLASSAPSYAQTPWPVPHPSQTPEP---PTTQSHIPETSAPRQWPMPLRPIPMRP 635
Query: 1981 ESESTTTSSLVSESTTTSSPESEST------TTISPVSESTTTSSPVSESTTTISPESES 2034
T +++ T P+ E T T I + + + T + +
Sbjct: 636 LRMQPITFNVLVFPTPHQPPQVEITPYKPTWTQIGHIPYQPSPTGA----NTMLPIQWAP 691
Query: 2035 TTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASES 2094
T P T P + PA+ + + PA+ PA+ PA+
Sbjct: 692 GTMQPPPRAPTPMRPPAAPPGRAQRPAAATGRARPPAAAPGRARPPAAAPGRARPPAAAP 751
Query: 2095 TTTSSPASESTTTSSPES 2112
PA+ P +
Sbjct: 752 GRARPPAAAPGRARPPAA 769
Score = 34.7 bits (79), Expect = 0.74
Identities = 37/197 (18%), Positives = 56/197 (28%), Gaps = 17/197 (8%)
Query: 1938 SSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTT 1997
+SP + +S+P T V + T P TT S S +
Sbjct: 578 TSPTTSQLASSAPSY--AQTPWPVPHPSQTPEP---PTTQSHIPETSAPRQWPMPLRPIP 632
Query: 1998 SSPESESTTTISPVSESTTTSSPVSESTTTIS--------PESESTTTSS----PASEST 2045
P T + + T P E T P S T ++
Sbjct: 633 MRPLRMQPITFNVLVFPTPHQPPQVEITPYKPTWTQIGHIPYQPSPTGANTMLPIQWAPG 692
Query: 2046 TTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASEST 2105
T P T PA+ + PA+ + PA+ PA+ PA+
Sbjct: 693 TMQPPPRAPTPMRPPAAPPGRAQRPAAATGRARPPAAAPGRARPPAAAPGRARPPAAAPG 752
Query: 2106 TTSSPESESTTTSSPAS 2122
P + PA+
Sbjct: 753 RARPPAAAPGRARPPAA 769
>gnl|CDD|220888 pfam10846, DUF2722, Protein of unknown function (DUF2722). This
eukaryotic family of proteins has no known function.
Length = 373
Score = 37.9 bits (88), Expect = 0.063
Identities = 53/255 (20%), Positives = 89/255 (34%), Gaps = 27/255 (10%)
Query: 1897 TNSPESESTTTNNPESESTTTSS--PESESTTTSSLVSESTTTSSPESESTTTSSPE--- 1951
+S ST PE T S P S SS S T + +SP
Sbjct: 123 ALPTKSNSTGLLAPEQNGTNASPVPPSSYKFPPSS--SGLTPRHTVLPTHRRPNSPARIG 180
Query: 1952 -----SESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTT 2006
+ +T T+ ES SSP S + + S S T+S + T
Sbjct: 181 AAAVANLATPTTPYKEESLGASSPLRRKKFGSQLHQRNMSLPSNTPTSGNTNSNIPKPAT 240
Query: 2007 TISPVSESTTTSSPVSESTTTISPESESTTTSSPAS-----ESTTTNNPKSESTTTNNPA 2061
++ S P+ + + + S+ + TS ES + +++S+++
Sbjct: 241 SVLNFKPSPA--QPLHKQSKSAPQPSQESMTSFQHIIQWKPESQQKKHRRTKSSSSFG-- 296
Query: 2062 SESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPA 2121
I +S + S + + S ++ S P EST + + ++ + SS
Sbjct: 297 --VIDLNSISEASQVN--EDDDPPDSDSKERKNEENSDP--ESTPSDDNDDKTCSESSSR 350
Query: 2122 SESTTIEEQGVSPHS 2136
SES G PH
Sbjct: 351 SESPNRTNTGRYPHD 365
Score = 31.4 bits (71), Expect = 7.3
Identities = 33/139 (23%), Positives = 51/139 (36%), Gaps = 25/139 (17%)
Query: 1998 SSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSS--PASE----STTTNNPK 2051
S +EST S ST ++PE T S P+S S++ P+
Sbjct: 111 SGGLAEST-------NPRQALPTKSNSTGLLAPEQNGTNASPVPPSSYKFPPSSSGLTPR 163
Query: 2052 SESTTTN-NPASES-ITSSSPASESTTTSSPASESTTTSSPASESTTTS----------S 2099
T+ P S + I +++ A+ +T T+ ES SSP S S
Sbjct: 164 HTVLPTHRRPNSPARIGAAAVANLATPTTPYKEESLGASSPLRRKKFGSQLHQRNMSLPS 223
Query: 2100 PASESTTTSSPESESTTTS 2118
S T+S + T+
Sbjct: 224 NTPTSGNTNSNIPKPATSV 242
>gnl|CDD|221185 pfam11719, Drc1-Sld2, DNA replication and checkpoint protein. Genome
duplication is precisely regulated by cyclin-dependent
kinases CDKs, which bring about the onset of S phase by
activating replication origins and then prevent
relicensing of origins until mitosis is completed. The
optimum sequence motif for CDK phosphorylation is
S/T-P-K/R-K/R, and Drc1-Sld2 is found to have at least 11
potential phosphorylation sites. Drc1 is required for DNA
synthesis and S-M replication checkpoint control. Drc1
associates with Cdc2 and is phosphorylated at the onset
of S phase when Cdc2 is activated. Thus Cdc2 promotes DNA
replication by phosphorylating Drc1 and regulating its
association with Cut5. Sld2 and Sld3 represent the
minimal set of S-CDK substrates required for DNA
replication.
Length = 397
Score = 37.9 bits (88), Expect = 0.063
Identities = 45/183 (24%), Positives = 73/183 (39%), Gaps = 14/183 (7%)
Query: 1929 SLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTS 1988
L+S T SP+ ++ ES+ST + S+ SP S + SS ++E T
Sbjct: 46 KLLSAKTIEPSPKKRKHSSPDGESQSTPRKRIPSDVDPYDSP-SALRSPSSLKTELGPTP 104
Query: 1989 S----------LVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTS 2038
L+S ST S S+ S V+ +T S+P S+ T+ E E
Sbjct: 105 QRDGKVLSLFDLLSSSTPPESTPSKRKLA-SSVASATPFSTP-SKRRETLDAEDEDRPEY 162
Query: 2039 SPASESTTTNNPKSESTTTNN-PASESITSSSPASESTTTSSPASESTTTSSPASESTTT 2097
P SE T ++ K P S +S +P+ + ++ S +S + +
Sbjct: 163 GPRSERTPLSSGKKVMLDLFFTPTSWRYSSETPSFLRRSNQDVSATSNPLNSAEPDFGVS 222
Query: 2098 SSP 2100
SP
Sbjct: 223 PSP 225
Score = 33.6 bits (77), Expect = 1.3
Identities = 40/184 (21%), Positives = 65/184 (35%), Gaps = 16/184 (8%)
Query: 1899 SPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTS 1958
S T +P+ ++ ES+ST + S+ SP S + SS ++E T
Sbjct: 46 KLLSAKTIEPSPKKRKHSSPDGESQSTPRKRIPSDVDPYDSP-SALRSPSSLKTELGPTP 104
Query: 1959 S-----------LVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTT 2007
L S + S+P +S + +T S E T E E
Sbjct: 105 QRDGKVLSLFDLLSSSTPPESTPSKRKLASSVASATPFSTPSKRRE---TLDAEDEDRPE 161
Query: 2008 ISPVSESTTTSSPVSES-TTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESIT 2066
P SE T SS +P S ++ +P+ + + + S N+ +
Sbjct: 162 YGPRSERTPLSSGKKVMLDLFFTPTSWRYSSETPSFLRRSNQDVSATSNPLNSAEPDFGV 221
Query: 2067 SSSP 2070
S SP
Sbjct: 222 SPSP 225
>gnl|CDD|148679 pfam07218, RAP1, Rhoptry-associated protein 1 (RAP-1). This family
consists of several rhoptry-associated protein 1 (RAP-1)
sequences which appear to be specific to Plasmodium
falciparum.
Length = 790
Score = 38.1 bits (88), Expect = 0.068
Identities = 23/151 (15%), Positives = 51/151 (33%), Gaps = 6/151 (3%)
Query: 1865 DNYSEIIFTTNNNS-ESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESE 1923
D +S+ F N S + + T S + + +S + + S + E
Sbjct: 62 DEFSDESFLENKASKDDGNINLTDTSENGDASKKGHGKSRVRSASAAAILEEDDSKDDME 121
Query: 1924 STTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESE 1983
+ + + E +SS + S + ++S ES ++ ++
Sbjct: 122 FKANPNEAGKPGKPKGNQGEGLASSSDGKSKASAKS----GSKSASKHGESNSSDESATD 177
Query: 1984 STTTSSLVSESTTTSSPESES-TTTISPVSE 2013
S S+ V+ + T++P+ E
Sbjct: 178 SGKASASVAGIVGADEEAPPAPKNTLTPLEE 208
Score = 38.1 bits (88), Expect = 0.075
Identities = 33/169 (19%), Positives = 56/169 (33%), Gaps = 17/169 (10%)
Query: 1894 NTTTNSPESESTTTNNPESES----TTTSSPESESTTTSSLVSESTTTSSPESESTTTSS 1949
N+ + ES N + T +S +++ S + S+ S
Sbjct: 58 NSWEDEFSDESFLENKASKDDGNINLTDTSENGDASKKGHGKSRVRSASAAAILEEDDSK 117
Query: 1950 PESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTIS 2009
+ E + + + E +SS + S + ++S ES ++
Sbjct: 118 DDMEFKANPNEAGKPGKPKGNQGEGLASSSDGKSKASAKS----GSKSASKHGESNSS-- 171
Query: 2010 PVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTN 2058
ES T S S S I E + PA ++T T P E TN
Sbjct: 172 --DESATDSGKASASVAGIVGADEE---APPAPKNTLT--PLEELYETN 213
Score = 37.4 bits (86), Expect = 0.14
Identities = 28/170 (16%), Positives = 62/170 (36%), Gaps = 5/170 (2%)
Query: 1988 SSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTT 2047
+S E + S E++++ ++ + T+ + + +S S+ A+
Sbjct: 58 NSWEDEFSDESFLENKASKDDGNINLTDTSENGDASKKGH----GKSRVRSASAAAILEE 113
Query: 2048 NNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTT 2107
++ K + NP +E+ P + +S+ + +S S S + S +++
Sbjct: 114 DDSKDDMEFKANP-NEAGKPGKPKGNQGEGLASSSDGKSKASAKSGSKSASKHGESNSSD 172
Query: 2108 SSPESESTTTSSPASESTTIEEQGVSPHSEKLSANEDPEEFPNEDVFEHT 2157
S ++S A EE +P + E E N +H
Sbjct: 173 ESATDSGKASASVAGIVGADEEAPPAPKNTLTPLEELYETNVNLFALKHP 222
>gnl|CDD|236138 PRK07994, PRK07994, DNA polymerase III subunits gamma and tau;
Validated.
Length = 647
Score = 37.9 bits (89), Expect = 0.074
Identities = 18/116 (15%), Positives = 36/116 (31%), Gaps = 2/116 (1%)
Query: 2010 PVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSS 2069
P + P + S ++ + T++ A P S PA ++S
Sbjct: 361 PAAPLPEPEVPPQSAAPAASAQATAAPTAAVAPPQAPAVPPPPASAPQQAPAVPLPETTS 420
Query: 2070 PASESTT--TSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASE 2123
+ + + S PA+ S ++ S + S +PA +
Sbjct: 421 QLLAARQQLQRAQGATKAKKSEPAAASRARPVNSALERLASVRPAPSALEKAPAKK 476
>gnl|CDD|132697 TIGR03658, IsdH_HarA, haptoglobin-binding heme uptake protein HarA.
HarA is a heme-binding NEAT-domain (NEAr Transporter,
pfam05031) protein which has been shown to bind to the
haptoglobin-hemoglobin complex in order to extract heme
from it. HarA has also been reported to bind hemoglobin
directly. HarA (also known as IsdH) contains three NEAT
domains as well as a sortase A C-terminal signal for
localization to the cell wall. The heme bound at the
third of these NEAT domains has been shown to be
transferred to the IsdA protein also localized at the
cell wall, presumably through an additional specific
protein-protein interaction. Haptoglobin is a hemoglobin
carrier protein involved in scavenging hemoglobin in the
blood following red blood cell lysis and targetting it to
the liver.
Length = 895
Score = 37.9 bits (87), Expect = 0.080
Identities = 25/119 (21%), Positives = 56/119 (47%), Gaps = 2/119 (1%)
Query: 1866 NYSEIIFTTNNNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESEST 1925
+Y++++F ++ ++V S N + N ++S S T TN ++T ++ ++
Sbjct: 213 DYTKLVFAKPIYNDPSLVKSDTNDAVVTNDQSSSDASNQTNTNTSNQNTSTINNANNQPQ 272
Query: 1926 TTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESES 1984
T+++ + SS ++ SS + T ++ ++ T SS +S+ P ES
Sbjct: 273 ATTNMSQPAQPKSSANADQ--ASSQPAHETNSNGNTNDKTNESSNQSDVNQQYPPADES 329
Score = 37.5 bits (86), Expect = 0.11
Identities = 32/147 (21%), Positives = 60/147 (40%), Gaps = 7/147 (4%)
Query: 1980 PESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTT--ISPESESTTT 2037
P S+ T +VS + E+ T ++ + +S T + +S++
Sbjct: 188 PVSDGTQELKIVSSTQIDDGEETNYDYTKLVFAKPIYNDPSLVKSDTNDAVVTNDQSSSD 247
Query: 2038 SSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTT 2097
+S + + T+N S NN + S PA ++ ++ +S PA E T +
Sbjct: 248 ASNQTNTNTSNQNTSTINNANNQPQATTNMSQPAQPKSSANA----DQASSQPAHE-TNS 302
Query: 2098 SSPASESTTTSSPESESTTTSSPASES 2124
+ ++ T SS +S+ PA ES
Sbjct: 303 NGNTNDKTNESSNQSDVNQQYPPADES 329
Score = 32.1 bits (72), Expect = 4.4
Identities = 28/131 (21%), Positives = 56/131 (42%), Gaps = 2/131 (1%)
Query: 1944 STTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESE 2003
ST E + + LV + P + T+ + +SS S T T++ ++
Sbjct: 201 STQIDDGEETNYDYTKLVFAKPIYNDPSLVKSDTNDAVVTNDQSSSDASNQTNTNT-SNQ 259
Query: 2004 STTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASE 2063
+T+TI+ + ++ +S+ S + +S PA E+ + N ++ ++N S+
Sbjct: 260 NTSTINNANNQPQATTNMSQPAQPKSSANADQASSQPAHETNSNGNTNDKTNESSN-QSD 318
Query: 2064 SITSSSPASES 2074
PA ES
Sbjct: 319 VNQQYPPADES 329
>gnl|CDD|152561 pfam12126, DUF3583, Protein of unknown function (DUF3583). This
domain is found in eukaryotes, and is typically between
302 and 338 amino acids in length. It is found in
association with pfam00097 and pfam00643. Most members
are promyelocytic leukemia proteins, and this family lies
towards the C-terminus.
Length = 284
Score = 37.3 bits (86), Expect = 0.082
Identities = 35/147 (23%), Positives = 52/147 (35%), Gaps = 12/147 (8%)
Query: 1904 STTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSE 1963
S T ++ + +SPE+ ST T+ E+ +TTTS S T
Sbjct: 149 SCITQGIDAAVSKKASPEAAST------PRDPVTTDTEASNTTTSQKRKCSQTDCPRKII 202
Query: 1964 STTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSE 2023
+ + TSSPE +TS VS P ES + P S
Sbjct: 203 KMESEEGNEDRLATSSPEQPRPSTSKAVSPPHLDGPPSPESPVPEKEI------LLPNSN 256
Query: 2024 STTTISPESESTTTSSPASESTTTNNP 2050
T+ + E+E +SE + N
Sbjct: 257 HVTSDTGETEERVVVISSSEDSDAENL 283
>gnl|CDD|223020 PHA03246, PHA03246, large tegument protein UL36; Provisional.
Length = 3095
Score = 38.0 bits (88), Expect = 0.085
Identities = 41/244 (16%), Positives = 84/244 (34%), Gaps = 26/244 (10%)
Query: 1925 TTTSSLVSESTTTSS----PESESTTTSSPESESTTTSSLV---SESTTTSSPESESTTT 1977
T ++ + + + SEST + E S+LV S + S P + +
Sbjct: 329 WQTKIVIGTADSYADSSPKLHSESTDLTPHEHGEYDPSTLVGGASTNINISDPPARTDCR 388
Query: 1978 SSPESEST--TTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISP-ESES 2034
E + S + + T +S + + S +SE + + T ++
Sbjct: 389 RYSEGSVIHESVDSHIEDVTEATSVVAAWSDAFSDISEDYSHLTRPDLPATAHDVSKNGH 448
Query: 2035 TTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASES 2094
T S S + + + + + T +SE+++S P + S S+ S
Sbjct: 449 DTKSDRRSRGSNSRHKRRRPSWTPPSSSENVSSDGPTFSQSRKPSRKSKRALDLDYGHLS 508
Query: 2095 TTTSSPASE-----STTTSSPESESTTTSSPASE---STTIEEQGV------SPHSEKLS 2140
S E + S+ + +S+ +IE + +PH+ ++
Sbjct: 509 NEPSDVDGENSDSPAGAISNIPDNVSFNEFISSQARAEDSIEHLSLRNRPVFNPHT--VT 566
Query: 2141 ANED 2144
N D
Sbjct: 567 GNLD 570
Score = 35.0 bits (80), Expect = 0.77
Identities = 41/220 (18%), Positives = 80/220 (36%), Gaps = 20/220 (9%)
Query: 1899 SPESESTTTNNPESESTTTSSPESESTTTSSLVS-ESTTTSSPESESTTTSSPESESTTT 1957
+ +S + ++ SEST + E S+LV ST + + + T SE +
Sbjct: 337 TADSYADSSPKLHSESTDLTPHEHGEYDPSTLVGGASTNINISDPPARTDCRRYSEGSVI 396
Query: 1958 SSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTT 2017
+ S E + TS + S S + + + + P+ +T VS++
Sbjct: 397 -----HESVDSHIEDVTEATSVVAAWSDAFSDISEDYSHLTRPDLPATA--HDVSKNGHD 449
Query: 2018 SSPVSESTTTIS----------PESESTTTSSPASESTTTNNP--KSESTTTNNPASESI 2065
+ S + S P S S SS + + P KS+ + S
Sbjct: 450 TKSDRRSRGSNSRHKRRRPSWTPPSSSENVSSDGPTFSQSRKPSRKSKRALDLDYGHLSN 509
Query: 2066 TSSSPASESTTTSSPASESTTTSSPASESTTTSSPASEST 2105
S E++ + + A + + +E ++ + A +S
Sbjct: 510 EPSDVDGENSDSPAGAISNIPDNVSFNEFISSQARAEDSI 549
Score = 31.5 bits (71), Expect = 8.2
Identities = 23/128 (17%), Positives = 47/128 (36%), Gaps = 7/128 (5%)
Query: 1892 SENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSE-----STTTSSPESESTT 1946
SEN +++ P + + +S+ S S + E + S+ +
Sbjct: 476 SENVSSDGPTFSQSRKPSRKSKRALDLDYGHLSNEPSDVDGENSDSPAGAISNIPDNVSF 535
Query: 1947 TSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTT 2006
S++ S+ S + T T ++T SL ++ + S P S+ +
Sbjct: 536 NEFISSQARAEDSIEHLSLRNRPVFNPHTVTG--NLDNTLRDSLWNDEYSGSYPLSDISD 593
Query: 2007 TISPVSES 2014
I ++ES
Sbjct: 594 MIDDITES 601
>gnl|CDD|183854 PRK13042, PRK13042, superantigen-like protein; Reviewed.
Length = 291
Score = 37.3 bits (86), Expect = 0.090
Identities = 26/99 (26%), Positives = 43/99 (43%), Gaps = 7/99 (7%)
Query: 1954 STTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSE 2013
S L + TT++ + +TT SS + E+ ++ ST +P+S+ T P
Sbjct: 10 SLALGLLTTGVITTTTQAANATTPSSTKVEAPQSTP---PSTKVEAPQSKPNATTPP--- 63
Query: 2014 STTTSSPVSESTTTISPESESTTTSSPASESTTTN-NPK 2051
ST +P T ++ T SP ++ T NPK
Sbjct: 64 STKVEAPQQTPNATTPSSTKVETPQSPTTKQVPTEINPK 102
Score = 36.1 bits (83), Expect = 0.18
Identities = 23/87 (26%), Positives = 42/87 (48%), Gaps = 7/87 (8%)
Query: 1946 TTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESEST 2005
TT++ + +TT SS E+ ++ P ST +P+S+ T+ ST +P+
Sbjct: 22 TTTTQAANATTPSSTKVEAPQSTPP---STKVEAPQSKPNATTP---PSTKVEAPQQTPN 75
Query: 2006 TTISPVSESTTTSSPVSEST-TTISPE 2031
T ++ T SP ++ T I+P+
Sbjct: 76 ATTPSSTKVETPQSPTTKQVPTEINPK 102
Score = 34.2 bits (78), Expect = 0.71
Identities = 21/72 (29%), Positives = 36/72 (50%), Gaps = 5/72 (6%)
Query: 2076 TTSSPASESTTTSSPASESTTTSSPASESTTTSSPESE--STTTSSPASESTTIEEQGVS 2133
TT++ A+ +TT SS E+ ++ P ST +P+S+ +TT S E+ +
Sbjct: 22 TTTTQAANATTPSSTKVEAPQSTPP---STKVEAPQSKPNATTPPSTKVEAPQQTPNATT 78
Query: 2134 PHSEKLSANEDP 2145
P S K+ + P
Sbjct: 79 PSSTKVETPQSP 90
Score = 33.5 bits (76), Expect = 1.2
Identities = 31/128 (24%), Positives = 51/128 (39%), Gaps = 13/128 (10%)
Query: 2026 TTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASEST 2085
TTI+ S + + +TTT ++ P+S + + ST +P S+
Sbjct: 4 TTIAKTSLALGLLTTGVITTTT-----QAANATTPSSTKVEAPQSTPPSTKVEAPQSKPN 58
Query: 2086 TTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQGVSPHSEKLSA--NE 2143
T+ P ST +P T+ ST +P S +T ++P + L A +
Sbjct: 59 ATTPP---STKVEAPQQTPNATTPS---STKVETPQSPTTKQVPTEINPKFKDLRAYYTK 112
Query: 2144 DPEEFPNE 2151
EF NE
Sbjct: 113 PSLEFKNE 120
Score = 33.5 bits (76), Expect = 1.2
Identities = 23/80 (28%), Positives = 38/80 (47%), Gaps = 9/80 (11%)
Query: 1976 TTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESEST 2035
TT++ + +TT SS E+ ++ P ST +P S+ T+ P ST +P+
Sbjct: 22 TTTTQAANATTPSSTKVEAPQSTPP---STKVEAPQSKPNATTPP---STKVEAPQQTPN 75
Query: 2036 TTSSPASESTTTNNPKSEST 2055
T+ ST P+S +T
Sbjct: 76 ATTPS---STKVETPQSPTT 92
Score = 32.7 bits (74), Expect = 2.1
Identities = 20/82 (24%), Positives = 38/82 (46%), Gaps = 6/82 (7%)
Query: 1916 TTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESEST 1975
TT++ + +TT SS E+ ++ P ST +P+S+ T+ ST +P+
Sbjct: 22 TTTTQAANATTPSSTKVEAPQSTPP---STKVEAPQSKPNATTP---PSTKVEAPQQTPN 75
Query: 1976 TTSSPESESTTTSSLVSESTTT 1997
T+ ++ T S ++ T
Sbjct: 76 ATTPSSTKVETPQSPTTKQVPT 97
Score = 30.8 bits (69), Expect = 9.5
Identities = 18/89 (20%), Positives = 35/89 (39%), Gaps = 5/89 (5%)
Query: 1989 SLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTN 2048
L++ T++ ++ + TT S S+P S + +TT S E+
Sbjct: 14 GLLTTGVITTTTQAANATTPSSTKVEAPQSTPPSTKVEAPQSKPNATTPPSTKVEA---- 69
Query: 2049 NPKSESTTTNNPASESITSSSPASESTTT 2077
P+ T +++ T SP ++ T
Sbjct: 70 -PQQTPNATTPSSTKVETPQSPTTKQVPT 97
>gnl|CDD|218825 pfam05956, APC_basic, APC basic domain. This region of the APC
family of proteins is known as the basic domain. It
contains a high proportion of positively charged amino
acids and interacts with microtubules.
Length = 359
Score = 37.4 bits (86), Expect = 0.091
Identities = 38/162 (23%), Positives = 57/162 (35%), Gaps = 8/162 (4%)
Query: 1970 PESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTIS 2029
S+STT S T S +S + S S + SE S P S T +
Sbjct: 16 NRSQSTTPSKKGPPLKTQPSDPPKSPSPGQQRSRSLHRPAKPSELAELS-PPPRSATPPA 74
Query: 2030 PESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASEST--TTSSPASESTTT 2087
+++ ++SS + + + P+ T + SI S S TSSPA
Sbjct: 75 RLAKTPSSSSSQTSTPSQPLPRPLPRPTQSAGRNSILPGPGNSLSQVPRTSSPA--RALL 132
Query: 2088 SSPASESTTTSSPA---SESTTTSSPESESTTTSSPASESTT 2126
+S S+ T SP P +S P E +
Sbjct: 133 ASSGSQHKTQKSPVRIPFMQNPAKPPPLSKNASSRPRPEPGS 174
Score = 37.4 bits (86), Expect = 0.094
Identities = 50/254 (19%), Positives = 78/254 (30%), Gaps = 32/254 (12%)
Query: 1904 STTTNNP-ESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVS 1962
S +P + + S + + +S +++P + T S S T+T
Sbjct: 35 SDPPKSPSPGQQRSRSLHRPAKPSELAELSPPPRSATPPARLAKTPSSSSSQTSTP---- 90
Query: 1963 ESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPV- 2021
S P T ++ S + +S+ TSSP S S+ T SPV
Sbjct: 91 -SQPLPRPLPRPTQSAGRNSILPGPGNSLSQVPRTSSP--ARALLASSGSQHKTQKSPVR 147
Query: 2022 --SESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSS 2079
P +S P E + + P + S + S
Sbjct: 148 IPFMQNPAKPPPLSKNASSRPRPEPG----SRGRAGMNGGPGARG--SRLELVRMASAKS 201
Query: 2080 PASESTTT----------SSPASESTTTSSPASESTTTSSPESESTTTSSPASE-----S 2124
SES + SP + S +S + SS + S S PA S
Sbjct: 202 SGSESDRSGFRRQLTFIKESPGTLRRRRSELSSAESLASSSQPASPRRSRPALPAVFLCS 261
Query: 2125 TTIEEQGVSPHSEK 2138
+ E S HS
Sbjct: 262 SRCPELRASTHSSV 275
Score = 33.9 bits (77), Expect = 1.0
Identities = 32/200 (16%), Positives = 52/200 (26%), Gaps = 10/200 (5%)
Query: 1935 TTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSES 1994
T P + + S+ S+ S + + S + + +S
Sbjct: 7 TVIYIPGPANRSQSTTPSKKGPPLKTQPSDPPKSPSPGQQRSRSLHRPAKPSELAELSPP 66
Query: 1995 TTTSSPESESTTTISPVSESTTTSS---PVSESTTTISPESESTTTSSPASESTTTNNPK 2051
+++P + T S S T+T S P T S S S S
Sbjct: 67 PRSATPPARLAKTPSSSSSQTSTPSQPLPRPLPRPTQSAGRNSILPGPGNSLSQVPRTSS 126
Query: 2052 SESTTTNNPASESITSSSPA---SESTTTSSPASESTTTSSPASESTTTSSPASESTTTS 2108
+ S+ T SP P +S P E + +
Sbjct: 127 PARALLASSGSQHKTQKSPVRIPFMQNPAKPPPLSKNASSRPRPEPGSRGRAGMNGGPGA 186
Query: 2109 SPE----SESTTTSSPASES 2124
+ S SES
Sbjct: 187 RGSRLELVRMASAKSSGSES 206
Score = 32.0 bits (72), Expect = 4.1
Identities = 46/240 (19%), Positives = 66/240 (27%), Gaps = 28/240 (11%)
Query: 1917 TSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSL-----VSESTTTSSPE 1971
P + S +T+ + P S SP + SL SE S P
Sbjct: 11 IPGPANRSQSTTPSKKGPPLKTQP---SDPPKSPSPGQQRSRSLHRPAKPSELAELSPPP 67
Query: 1972 SESTTTSSPESESTTTSSLVSESTTTSS---PESESTTTISPVSESTTTSSPVSESTTTI 2028
+T P + T S S T+T S P T S S S S
Sbjct: 68 RSAT----PPARLAKTPSSSSSQTSTPSQPLPRPLPRPTQSAGRNSILPGPGNSLSQVPR 123
Query: 2029 SPESESTTTSSPASESTTTNNP---KSESTTTNNPASESITSSSPASESTTTSSPASEST 2085
+ +S S+ T +P P SS P E +
Sbjct: 124 TSSPARALLASSGSQHKTQKSPVRIPFMQNPAKPPPLSKNASSRPRPEPGSRGRAGMNGG 183
Query: 2086 TTSSPA----SESTTTSSPASESTTTSSPESESTTTSSPA------SESTTIEEQGVSPH 2135
+ + + S SES + + SP SE ++ E S
Sbjct: 184 PGARGSRLELVRMASAKSSGSESDRSGFRRQLTFIKESPGTLRRRRSELSSAESLASSSQ 243
Score = 30.9 bits (69), Expect = 9.6
Identities = 56/336 (16%), Positives = 94/336 (27%), Gaps = 56/336 (16%)
Query: 1895 TTTNSPESESTTTNNPESESTTTSSPESESTTTS------------SLVSESTT------ 1936
+ + +P + S+ + SP + + + S S T
Sbjct: 18 SQSTTPSKKGPPLKTQPSDPPKSPSPGQQRSRSLHRPAKPSELAELSPPPRSATPPARLA 77
Query: 1937 -TSSPESESTTTSS---PESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVS 1992
T S S T+T S P T S S S S + +S S
Sbjct: 78 KTPSSSSSQTSTPSQPLPRPLPRPTQSAGRNSILPGPGNSLSQVPRTSSPARALLASSGS 137
Query: 1993 ESTTTSSP---ESESTTTISPVSESTTTSSPVSESTTTISPESES--------------T 2035
+ T SP P +S P E +
Sbjct: 138 QHKTQKSPVRIPFMQNPAKPPPLSKNASSRPRPEPGSRGRAGMNGGPGARGSRLELVRMA 197
Query: 2036 TTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASEST 2095
+ S SES + + + +P + S +S + SS S S PA +
Sbjct: 198 SAKSSGSESDRSGFRRQLTFIKESPGTLRRRRSELSSAESLASSSQPASPRRSRPALPAV 257
Query: 2096 TTSSPASE--STTTSSPESESTTTSSPASESTTIEEQ--------GVSPHSEKLSANEDP 2145
S +T S P + IE ++ + +++E P
Sbjct: 258 FLCSSRCPELRASTHSSVQAGGWRKLPPRQGPAIEYNQRRPAARPDIAERYGRRTSSESP 317
Query: 2146 EEFP------NEDVFEHTFAEIPNIDHSNQTDEAIP 2175
P + + +A +P+I +TD A
Sbjct: 318 SRLPVRAGPGKPETVKR-YASLPHISVWRRTDSASS 352
>gnl|CDD|223044 PHA03325, PHA03325, nuclear-egress-membrane-like protein;
Provisional.
Length = 418
Score = 37.6 bits (87), Expect = 0.093
Identities = 28/169 (16%), Positives = 51/169 (30%), Gaps = 11/169 (6%)
Query: 1962 SESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPV 2021
S + +S S ++ + SE + P + P
Sbjct: 259 SSAFMLNSSLPTSAPKRRSRRAGAMRAAAGETADLADDDGSEHS---DPEPLPASLPPPP 315
Query: 2022 SESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPA 2081
PE+ E N +++ ++ S SSS ++ + ++ P
Sbjct: 316 VRRPRVKHPEA-------GKEEPDGARNAEAKEPAQPATSTSSKGSSSAQNKDSGSTGPG 368
Query: 2082 SESTTTSSPASESTTTSSPASESTTTSSPESESTTTS-SPASESTTIEE 2129
S SS + S P +T+ S S T++ P S T
Sbjct: 369 SSLAAASSFLEDDDFGSPPLDLTTSLRHMPSPSVTSAPEPPSIPLTYLS 417
Score = 36.8 bits (85), Expect = 0.14
Identities = 33/171 (19%), Positives = 65/171 (38%), Gaps = 11/171 (6%)
Query: 1953 ESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVS--ESTTTSSPESESTTTISP 2010
+ T+++ +++ S TS+P+ S + + + T+ L S + ++ P
Sbjct: 256 QLTSSAFMLNSSLPTSAPKRRSRRAGAMRAAAGETADLADDDGSEHSDPEPLPASLPPPP 315
Query: 2011 VSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSP 2070
V + + +E+ + PA T+T++ S S ++ S+ P
Sbjct: 316 VRRPRVKHPEAGKEEPDGARNAEAKEPAQPA---TSTSSKGSSSA-----QNKDSGSTGP 367
Query: 2071 ASESTTTSSPASESTTTSSPASESTTTSSPASESTTTS-SPESESTTTSSP 2120
S SS + S P +T+ S S T++ P S T S
Sbjct: 368 GSSLAAASSFLEDDDFGSPPLDLTTSLRHMPSPSVTSAPEPPSIPLTYLSD 418
Score = 34.1 bits (78), Expect = 1.1
Identities = 18/92 (19%), Positives = 35/92 (38%), Gaps = 1/92 (1%)
Query: 1899 SPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSS-PESESTTT 1957
+ + E N E++ + + S +SS ++ + ++ P S SS E + +
Sbjct: 326 AGKEEPDGARNAEAKEPAQPATSTSSKGSSSAQNKDSGSTGPGSSLAAASSFLEDDDFGS 385
Query: 1958 SSLVSESTTTSSPESESTTTSSPESESTTTSS 1989
L ++ P T+ P S T S
Sbjct: 386 PPLDLTTSLRHMPSPSVTSAPEPPSIPLTYLS 417
>gnl|CDD|217602 pfam03535, Paxillin, Paxillin family.
Length = 193
Score = 36.4 bits (84), Expect = 0.094
Identities = 23/71 (32%), Positives = 38/71 (53%), Gaps = 3/71 (4%)
Query: 2060 PASESITSSS--PASESTTTSSPASESTTTSSPASESTTTSSPASESTT-TSSPESESTT 2116
P++E++ SS AS S +S P ES S +++ ++ S PA E S P + +
Sbjct: 10 PSAEALNGSSWVEASSSYHSSQPQQESPKYRSSSAKPSSPSPPAGEEEHVYSFPNKQKSA 69
Query: 2117 TSSPASESTTI 2127
SSPA S+++
Sbjct: 70 ESSPAVMSSSL 80
Score = 35.7 bits (82), Expect = 0.17
Identities = 22/69 (31%), Positives = 33/69 (47%), Gaps = 1/69 (1%)
Query: 2056 TTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTT-TSSPASESTTTSSPESES 2114
++ AS S SS P ES S +++ ++ S PA E S P + + SSP S
Sbjct: 18 SSWVEASSSYHSSQPQQESPKYRSSSAKPSSPSPPAGEEEHVYSFPNKQKSAESSPAVMS 77
Query: 2115 TTTSSPASE 2123
++ S SE
Sbjct: 78 SSLGSNLSE 86
Score = 34.5 bits (79), Expect = 0.35
Identities = 22/79 (27%), Positives = 38/79 (48%), Gaps = 3/79 (3%)
Query: 2038 SSPASEST--TTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTT-TSSPASES 2094
P++E+ ++ S S ++ P ES S +++ ++ S PA E S P +
Sbjct: 8 PPPSAEALNGSSWVEASSSYHSSQPQQESPKYRSSSAKPSSPSPPAGEEEHVYSFPNKQK 67
Query: 2095 TTTSSPASESTTTSSPESE 2113
+ SSPA S++ S SE
Sbjct: 68 SAESSPAVMSSSLGSNLSE 86
Score = 33.4 bits (76), Expect = 0.92
Identities = 36/181 (19%), Positives = 67/181 (37%), Gaps = 13/181 (7%)
Query: 1894 NTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTT-TSSPESESTTTSSPES 1952
N ++ S S ++ P+ ES S ++ ++ S E S P + + SSP
Sbjct: 16 NGSSWVEASSSYHSSQPQQESPKYRSSSAKPSSPSPPAGEEEHVYSFPNKQKSAESSPAV 75
Query: 1953 ESTTTSSLVSE---------STTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESE 2003
S++ S +SE + S P + +SP S++ + E+ + ++
Sbjct: 76 MSSSLGSNLSELDRLLLELNAVQHSPPSFPADEEASPPLPSSSIPHYIPENGGSPGGKAA 135
Query: 2004 STTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASE 2063
T P V S ++ E ES S P+ T + S+ +S+
Sbjct: 136 PPTKEKPKRNGGRGIEDVRPSVESLLDELES---SVPSPVPAITVSQGETSSPQQVNSSQ 192
Query: 2064 S 2064
Sbjct: 193 Q 193
>gnl|CDD|206007 pfam13836, DUF4195, Domain of unknown function (DUF4195). This
family is found at the N-terminus of metazoan proteins
that carry PHD-like zinc-finger domains. The function is
not known.
Length = 184
Score = 36.2 bits (83), Expect = 0.099
Identities = 40/148 (27%), Positives = 65/148 (43%), Gaps = 10/148 (6%)
Query: 1936 TTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESE-STTTSSPESESTTTSSLVSES 1994
TS ++ + S S+S+ P S+ T +SP+ +S +S L+ +S
Sbjct: 44 PTSQHYRNPSSNPVAALPNFHPESKSSDSSVIVQPFSKPDFTKNSPQVDSNNSSELLFDS 103
Query: 1995 TTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSP-ASESTTTNNPKSE 2053
T + P S+ T+S T+ ST+ ++ S P SES + NP S
Sbjct: 104 TQDTLPHSQGGPTLSRAGMDETSFLLKHPSTSKVN----SVNPKKPKTSESVSGINPSSS 159
Query: 2054 STTTNNPASESITSSSPA-SESTTTSSP 2080
++ +P S+TSS S+ T TSS
Sbjct: 160 LSSQKSP---SVTSSQVVLSKGTNTSSQ 184
Score = 36.2 bits (83), Expect = 0.10
Identities = 32/141 (22%), Positives = 56/141 (39%), Gaps = 6/141 (4%)
Query: 1956 TTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSE-STTTSSPESESTTTISPVSES 2014
TS ++ + S S+S+ S+ T +SP+ +S + + +S
Sbjct: 44 PTSQHYRNPSSNPVAALPNFHPESKSSDSSVIVQPFSKPDFTKNSPQVDSNNSSELLFDS 103
Query: 2015 TTTSSPVSESTTTISPESESTTT---SSPASESTTTNNPKSESTTTNNPASESITSSSPA 2071
T + P S+ T+S T+ P++ + NPK T+ + +S S
Sbjct: 104 TQDTLPHSQGGPTLSRAGMDETSFLLKHPSTSKVNSVNPKKPKTSESVSGINPSSSLSSQ 163
Query: 2072 SESTTTSSPA--SESTTTSSP 2090
+ TSS S+ T TSS
Sbjct: 164 KSPSVTSSQVVLSKGTNTSSQ 184
Score = 35.0 bits (80), Expect = 0.24
Identities = 32/118 (27%), Positives = 54/118 (45%), Gaps = 6/118 (5%)
Query: 1899 SPESESTTTNNPESE-STTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTT 1957
S S+S+ P S+ T +SP+ +S +S L+ +ST + P S+ T S T+
Sbjct: 67 SKSSDSSVIVQPFSKPDFTKNSPQVDSNNSSELLFDSTQDTLPHSQGGPTLSRAGMDETS 126
Query: 1958 SSLVSESTT---TSSPESESTTTSSPESESTTTSSLVSESTTTSSPE--SESTTTISP 2010
L ST+ + +P+ T+ S +++ S + TSS S+ T T S
Sbjct: 127 FLLKHPSTSKVNSVNPKKPKTSESVSGINPSSSLSSQKSPSVTSSQVVLSKGTNTSSQ 184
Score = 33.5 bits (76), Expect = 0.80
Identities = 39/147 (26%), Positives = 61/147 (41%), Gaps = 8/147 (5%)
Query: 1966 TTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESE-STTTISPVSESTTTSSPVSES 2024
TS ++ + S S+S+ P S+ T SP +S +S + +S
Sbjct: 44 PTSQHYRNPSSNPVAALPNFHPESKSSDSSVIVQPFSKPDFTKNSPQVDSNNSSELLFDS 103
Query: 2025 TTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSP-ASESTTTSSPASE 2083
T P S+ T S A T+ K ST+ N S+ P SES + +P+S
Sbjct: 104 TQDTLPHSQGGPTLSRAGMDETSFLLKHPSTSKVN----SVNPKKPKTSESVSGINPSSS 159
Query: 2084 STTTSSPASESTTTSSPASESTTTSSP 2110
++ SP+ S+ S+ T TSS
Sbjct: 160 LSSQKSPSVTSSQVV--LSKGTNTSSQ 184
Score = 31.5 bits (71), Expect = 3.7
Identities = 28/119 (23%), Positives = 54/119 (45%), Gaps = 18/119 (15%)
Query: 1877 NSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTT--------S 1928
+S+S+V++ + + T NSP+ +S ++ +ST + P S+ T +
Sbjct: 69 SSDSSVIVQPFSKP---DFTKNSPQVDSNNSSELLFDSTQDTLPHSQGGPTLSRAGMDET 125
Query: 1929 SLVSESTTTS-----SPESESTTTSSPESESTTTSSLVSESTTTSSPE--SESTTTSSP 1980
S + + +TS +P+ T+ S +++ S + TSS S+ T TSS
Sbjct: 126 SFLLKHPSTSKVNSVNPKKPKTSESVSGINPSSSLSSQKSPSVTSSQVVLSKGTNTSSQ 184
>gnl|CDD|227549 COG5224, HAP2, CCAAT-binding factor, subunit B [Transcription].
Length = 248
Score = 36.7 bits (84), Expect = 0.099
Identities = 26/136 (19%), Positives = 52/136 (38%), Gaps = 8/136 (5%)
Query: 1878 SESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTT 1937
+ S+ V T N + S S S+ + P + +T + SP + ++ +
Sbjct: 31 TVSSEVTHTSEGYADSNDSRPSSISNSSESPAPINSATASMSPANNTSGNNIT------- 83
Query: 1938 SSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTT 1997
SP S +T ++S T P+++S T++ S S S +
Sbjct: 84 -SPNVRGELDMSSGPTNTASTSGPVPHDMTVLPQTDSNTSNLMSSGSQLGSFATQSTNGN 142
Query: 1998 SSPESESTTTISPVSE 2013
+S + +++ P S
Sbjct: 143 NSTTTTTSSAAHPGSF 158
Score = 35.9 bits (82), Expect = 0.18
Identities = 31/155 (20%), Positives = 62/155 (40%), Gaps = 9/155 (5%)
Query: 1897 TNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTT 1956
TN+ ++ T + E T+ +S + SS +S S+ + +P + +T + SP + ++
Sbjct: 21 TNANDATVPATVSSEVTHTSEGYADSNDSRPSS-ISNSSESPAPINSATASMSPANNTSG 79
Query: 1957 TSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTT 2016
+ SP S +T ++S T P+++S T+ S S
Sbjct: 80 NNIT--------SPNVRGELDMSSGPTNTASTSGPVPHDMTVLPQTDSNTSNLMSSGSQL 131
Query: 2017 TSSPVSESTTTISPESESTTTSSPASESTTTNNPK 2051
S + S + +++ + P S N K
Sbjct: 132 GSFATQSTNGNNSTTTTTSSAAHPGSFQPDYVNAK 166
Score = 32.5 bits (73), Expect = 2.1
Identities = 29/156 (18%), Positives = 68/156 (43%), Gaps = 3/156 (1%)
Query: 1901 ESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTS-SPESESTTTSS 1959
E+ N + ++ +++T +++ SE T TS ++S + S S S+ + +
Sbjct: 3 EAAEAAANGGSTGDDVNATNANDATVPATVSSEVTHTSEGYADSNDSRPSSISNSSESPA 62
Query: 1960 LVSESTTTSSPESEST--TTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTT 2017
++ +T + SP + ++ +SP S +T ++S T+ P ++S T+
Sbjct: 63 PINSATASMSPANNTSGNNITSPNVRGELDMSSGPTNTASTSGPVPHDMTVLPQTDSNTS 122
Query: 2018 SSPVSESTTTISPESESTTTSSPASESTTTNNPKSE 2053
+ S S + +S + +++ +P S
Sbjct: 123 NLMSSGSQLGSFATQSTNGNNSTTTTTSSAAHPGSF 158
Score = 32.5 bits (73), Expect = 2.7
Identities = 29/146 (19%), Positives = 56/146 (38%), Gaps = 4/146 (2%)
Query: 1942 SESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPE 2001
S ++ + T + VS T +S + S P S S ++ S ++ T+S
Sbjct: 13 STGDDVNATNANDATVPATVSSEVTHTSEGYADSNDSRPSSISNSSESPAPINSATASMS 72
Query: 2002 SESTTTI----SPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTT 2057
+ T+ SP S +T + S T P ++S T+N S S
Sbjct: 73 PANNTSGNNITSPNVRGELDMSSGPTNTASTSGPVPHDMTVLPQTDSNTSNLMSSGSQLG 132
Query: 2058 NNPASESITSSSPASESTTTSSPASE 2083
+ + ++S + +++ + P S
Sbjct: 133 SFATQSTNGNNSTTTTTSSAAHPGSF 158
Score = 30.5 bits (68), Expect = 9.1
Identities = 25/116 (21%), Positives = 51/116 (43%), Gaps = 8/116 (6%)
Query: 2024 STTTISPESESTTTSSPASESTTTNNPKSESTTTNN--PASESITSSSPASESTTTSS-- 2079
ST + + + PA+ S+ + +N+ P+S S +S SPA ++ T+S
Sbjct: 13 STGDDVNATNANDATVPATVSSEVTHTSEGYADSNDSRPSSISNSSESPAPINSATASMS 72
Query: 2080 PASES----TTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQG 2131
PA+ + T+ + E +S P + ++T+ + T S ++ + G
Sbjct: 73 PANNTSGNNITSPNVRGELDMSSGPTNTASTSGPVPHDMTVLPQTDSNTSNLMSSG 128
Score = 30.5 bits (68), Expect = 9.2
Identities = 32/142 (22%), Positives = 59/142 (41%), Gaps = 11/142 (7%)
Query: 1997 TSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTT------NNP 2050
T++ ++ T+S T+ S + P S S ++ SPA ++ T NN
Sbjct: 21 TNANDATVPATVSSEVTHTSEGYADSNDS---RPSSISNSSESPAPINSATASMSPANNT 77
Query: 2051 KSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASE--STTTS 2108
+ T+ N E SS P + ++T+ + T S ++ S S+ S T
Sbjct: 78 SGNNITSPNVRGELDMSSGPTNTASTSGPVPHDMTVLPQTDSNTSNLMSSGSQLGSFATQ 137
Query: 2109 SPESESTTTSSPASESTTIEEQ 2130
S ++TT++ +S + Q
Sbjct: 138 STNGNNSTTTTTSSAAHPGSFQ 159
>gnl|CDD|237864 PRK14950, PRK14950, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 585
Score = 37.5 bits (87), Expect = 0.10
Identities = 14/104 (13%), Positives = 31/104 (29%), Gaps = 12/104 (11%)
Query: 2053 ESTTTNNPASESITSSSPA--------SESTTTSSPASESTTTSSP----ASESTTTSSP 2100
E+ PA + ++ A + ST + A+ + P A+ P
Sbjct: 357 EALLVPVPAPQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPRP 416
Query: 2101 ASESTTTSSPESESTTTSSPASESTTIEEQGVSPHSEKLSANED 2144
+ + + T ++ + P E+ + D
Sbjct: 417 VAPPVPHTPESAPKLTRAAIPVDEKPKYTPPAPPKEEEKALIAD 460
>gnl|CDD|220684 pfam10310, DUF2413, Protein of unknown function (DUF2413). This is a
family of proteins conserved in fungi. The function is
not known.
Length = 436
Score = 37.1 bits (86), Expect = 0.11
Identities = 23/104 (22%), Positives = 37/104 (35%), Gaps = 11/104 (10%)
Query: 2038 SSPASESTTTNNPKSESTTTNNPASESITS------SSPASESTTTSSPASESTTTSSPA 2091
S P ++ T K +++ + E I S ++ AS T +P
Sbjct: 7 SLPDEKAPTKKPKKGDASKDSTEDDEDILEFLDELEQSEKAKPPKKPKEASRPGTPRNPK 66
Query: 2092 SESTTTSSPASESTTTS-----SPESESTTTSSPASESTTIEEQ 2130
S T S A+ S S ES ++ + ST EE+
Sbjct: 67 KSSKPTESSAASSEEKPAKPRKSAESTRSSHPKSKAPSTESEEE 110
Score = 34.0 bits (78), Expect = 1.1
Identities = 29/147 (19%), Positives = 47/147 (31%), Gaps = 25/147 (17%)
Query: 1958 SSLVSESTTTSSPESESTTTSSPESE-----------------STTTSSLVSESTTTSSP 2000
SL E T P+ + S E + S T +P
Sbjct: 6 DSLPDEKAPTKKPKKGDASKDSTEDDEDILEFLDELEQSEKAKPPKKPKEASRPGTPRNP 65
Query: 2001 ESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNP 2060
+ S T ES+ SS + S ES +S P S++ +T + + E
Sbjct: 66 KKSSKPT-----ESSAASSEEKPAKPRKSAESTR--SSHPKSKAPSTESEEEEEPEETPD 118
Query: 2061 ASESITSS-SPASESTTTSSPASESTT 2086
SI S T+T++ + +
Sbjct: 119 PIASIGGWWSLWGSITSTATSTASAAV 145
Score = 31.3 bits (71), Expect = 6.9
Identities = 19/108 (17%), Positives = 34/108 (31%), Gaps = 6/108 (5%)
Query: 2009 SPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSS 2068
S ++ S T +P+ S T S A+ S +S S +
Sbjct: 44 SEKAKPPKKPKEASRPGTPRNPKKSSKPTESSAASSEEKPAKPRKSAE-----STRSSHP 98
Query: 2069 SPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTT 2116
+ ST + E T P + S T+T++ + +
Sbjct: 99 KSKAPSTESEEE-EEPEETPDPIASIGGWWSLWGSITSTATSTASAAV 145
>gnl|CDD|221143 pfam11593, Med3, Mediator complex subunit 3 fungal. Mediator is a
large complex of up to 33 proteins that is conserved from
plants to fungi to humans - the number and representation
of individual subunits varying with species. It is
arranged into four different sections, a core, a head, a
tail and a kinase-activity part, and the number of
subunits within each of these is what varies with
species. Overall, Mediator regulates the transcriptional
activity of RNA polymerase II but it would appear that
each of the four different sections has a slightly
different function. Mediator subunit Hrs1/Med3 is a
physical target for Cyc8-Tup1, a yeast transcriptional
co-repressor.
Length = 381
Score = 36.9 bits (85), Expect = 0.11
Identities = 17/117 (14%), Positives = 42/117 (35%), Gaps = 11/117 (9%)
Query: 2010 PVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSS 2069
+ + + ++ T + + S A+ S+T N P + N A S+
Sbjct: 115 TLGTYNQLGNAGASASITKT-----SNGSDAATTSSTANTPAAAKVLKANAA------SA 163
Query: 2070 PASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTT 2126
P + + S+ + + + ++ + +TT P T + + + + A
Sbjct: 164 PNTTTGVGSAATTAAISATTATTPTTTQKKPRKPRQTKKTGPAAAAKAQASAQAQAQ 220
Score = 36.5 bits (84), Expect = 0.15
Identities = 31/206 (15%), Positives = 66/206 (32%), Gaps = 18/206 (8%)
Query: 1878 SESTVVMSTLNSLLSENTTTNSPESESTTTNN----PESESTTTSSPESESTTTSSLVSE 1933
S S S + + ++T N+P + N P + + S+ + + + ++ +
Sbjct: 128 SASITKTSNGSDAATTSSTANTPAAAKVLKANAASAPNTTTGVGSAATTAAISATTATTP 187
Query: 1934 STTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESEST-----TTSSPESESTTTS 1988
+TT P T + + + + S+ + TS T
Sbjct: 188 TTTQKKPRKPRQTKKTGPAAAAKAQASAQAQAQASAYNQMGSLGVPQNTSMLAQIPNPTP 247
Query: 1989 SL-----VSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASE 2043
+ VS + +SP +SP+ + + T S + + S +
Sbjct: 248 LMQLLNGVSPNNAMASP----LNNMSPMRNLNQMGNQNNGGQMTPSANNGNMNNQSRENS 303
Query: 2044 STTTNNPKSESTTTNNPASESITSSS 2069
P + NN +I + S
Sbjct: 304 MNQGMTPSASMINLNNITPANILNMS 329
Score = 36.1 bits (83), Expect = 0.22
Identities = 27/172 (15%), Positives = 65/172 (37%), Gaps = 2/172 (1%)
Query: 1887 LNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTT 1946
L +L + N N+ S S T + S++ TTSS + +T ++ V ++ S+P + +
Sbjct: 113 LETLGTYNQLGNAGASASITKTSNGSDAATTSS--TANTPAAAKVLKANAASAPNTTTGV 170
Query: 1947 TSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTT 2006
S+ + + + ++ + +TT P T + + + + S+ +
Sbjct: 171 GSAATTAAISATTATTPTTTQKKPRKPRQTKKTGPAAAAKAQASAQAQAQASAYNQMGSL 230
Query: 2007 TISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTN 2058
+ + + ++ S + +SP + + N N
Sbjct: 231 GVPQNTSMLAQIPNPTPLMQLLNGVSPNNAMASPLNNMSPMRNLNQMGNQNN 282
Score = 34.6 bits (79), Expect = 0.68
Identities = 20/85 (23%), Positives = 39/85 (45%), Gaps = 2/85 (2%)
Query: 2050 PKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSS 2109
T N + +S+ ++++ S A+ S+T ++PA+ ++ AS TT+
Sbjct: 112 VLETLGTYNQLGNAG--ASASITKTSNGSDAATTSSTANTPAAAKVLKANAASAPNTTTG 169
Query: 2110 PESESTTTSSPASESTTIEEQGVSP 2134
S +TT + A+ +TT P
Sbjct: 170 VGSAATTAAISATTATTPTTTQKKP 194
Score = 32.7 bits (74), Expect = 2.6
Identities = 23/191 (12%), Positives = 63/191 (32%), Gaps = 12/191 (6%)
Query: 1894 NTTTNSPESESTTTNNPESESTTT----SSPESESTTTSSLVSESTTTSSPESESTTTSS 1949
+ + + S+T N P + S+P + + S+ + + + ++ + +TT
Sbjct: 134 TSNGSDAATTSSTANTPAAAKVLKANAASAPNTTTGVGSAATTAAISATTATTPTTTQKK 193
Query: 1950 PESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTIS 2009
P T + + + + S+ + + + ++
Sbjct: 194 PRKPRQTKKTGPAAAAKAQASAQAQAQASAYNQMGSLGVPQNTSMLAQIPNPTPLMQLLN 253
Query: 2010 PVSESTTTSSPV-----SESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASES 2064
VS + +SP+ + + ++ + A+ N + S S S
Sbjct: 254 GVSPNNAMASPLNNMSPMRNLNQMGNQNNGGQMTPSANNGNMNNQSRENSMNQGMTPSAS 313
Query: 2065 ITSS---SPAS 2072
+ + +PA+
Sbjct: 314 MINLNNITPAN 324
Score = 31.9 bits (72), Expect = 4.0
Identities = 24/134 (17%), Positives = 49/134 (36%), Gaps = 5/134 (3%)
Query: 2002 SESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSEST-TTNNP 2060
S S T S S++ TTSS + T + ++ A +TT + + +
Sbjct: 128 SASITKTSNGSDAATTSS----TANTPAAAKVLKANAASAPNTTTGVGSAATTAAISATT 183
Query: 2061 ASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSP 2120
A+ T+ + T + + ++++ +S ++ + P++ S P
Sbjct: 184 ATTPTTTQKKPRKPRQTKKTGPAAAAKAQASAQAQAQASAYNQMGSLGVPQNTSMLAQIP 243
Query: 2121 ASESTTIEEQGVSP 2134
GVSP
Sbjct: 244 NPTPLMQLLNGVSP 257
>gnl|CDD|178666 PLN03119, PLN03119, putative ADP-ribosylation factor
GTPase-activating protein AGD14; Provisional.
Length = 648
Score = 37.5 bits (86), Expect = 0.11
Identities = 51/197 (25%), Positives = 78/197 (39%), Gaps = 30/197 (15%)
Query: 1966 TTSSPESESTTTSSPESESTTTSSL---VSES-TTTSSPESESTTTISPVSESTTT---- 2017
TTSS S ++ +S T+ L VSES T S + +++ + V+EST
Sbjct: 269 TTSSGSVRSVDSNFMSIKSYTSGGLGEAVSESRQNTGSQQGKTSNHVPLVAESTKAPIDL 328
Query: 2018 ----SSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITS--SSPA 2071
+PV++S T P S A S N ++ T + PA+ + P
Sbjct: 329 FQLPGAPVAQSVDTFQP--------SIAPRSPPVNLQQAPQTYSFTPANSFAGNLGQQPT 380
Query: 2072 SESTTTSSPASE---STTTSSPASESTTT-SSPASESTTTSSPESESTTTSSPASESTTI 2127
S + S+P +E S PA++ST +SP E +TS +
Sbjct: 381 SRPSELSAPKNEGWASFDNPMPAAKSTNVITSPGDFQLELKIEEILQPSTSMQLPPYPST 440
Query: 2128 EEQGV----SPHSEKLS 2140
+Q SP E LS
Sbjct: 441 VDQHALSIPSPWQEDLS 457
>gnl|CDD|185274 PRK15376, PRK15376, pathogenicity island 1 effector protein SipA;
Provisional.
Length = 670
Score = 37.3 bits (86), Expect = 0.13
Identities = 32/155 (20%), Positives = 55/155 (35%), Gaps = 9/155 (5%)
Query: 2031 ESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSP 2090
ES +T SS S S + + + T T + AS A + T+ +E+ T +S
Sbjct: 336 ESHHSTNSSNVSHSHSRVDSTTHQTETAHSASTGAIDHGIAGKIDVTAHATAEAVTNASS 395
Query: 2091 ASESTT--TSSPASESTTTSSPESESTTTSSPASESTTIEEQGVSPHSEKLSANE----- 2143
S+ TS + TTS E + T+ S + GV + ++ E
Sbjct: 396 ESKDGKVVTSEKGTTGETTSFDEVDGVTSKSIIGKPVQATVHGVDDNKQQSQTAEIVNVK 455
Query: 2144 --DPEEFPNEDVFEHTFAEIPNIDHSNQTDEAIPE 2176
+ E+V T + N+ +
Sbjct: 456 PLASQLAGVENVKTDTLQSDTTVITGNKAGTTDND 490
>gnl|CDD|179334 PRK01770, PRK01770, sec-independent translocase; Provisional.
Length = 171
Score = 35.6 bits (82), Expect = 0.13
Identities = 27/101 (26%), Positives = 43/101 (42%), Gaps = 8/101 (7%)
Query: 2024 STTTISPE-SESTTTSSPASESTTT----NNPKSESTTTNNPASESITSSSPASESTTTS 2078
S T +SPE S A+ES N+P+ S + + + + A E T
Sbjct: 70 SLTNLSPELKASVDELKQAAESMKRSYAANDPEKASDEAHTIHNPVVKDNEAAHEGVT-- 127
Query: 2079 SPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSS 2119
PA+ T SSP + TT P + + P++ + + SS
Sbjct: 128 -PAAAQTQASSPEQKPETTPEPVVKPAADAEPKTAAPSPSS 167
Score = 31.7 bits (72), Expect = 2.5
Identities = 20/86 (23%), Positives = 35/86 (40%), Gaps = 3/86 (3%)
Query: 2061 ASESITSSSPASESTTTSSPASESTTTSSPASESTTTS-SPASESTTTSSPESESTTTSS 2119
A+ES+ S A++ S A +E+ +PA+ T SSPE + TT
Sbjct: 88 AAESMKRSYAANDPEKASDEAHTIHNPVVKDNEAAHEGVTPAAAQTQASSPEQKPETTPE 147
Query: 2120 PASESTTIEEQGVSPHSEKLSANEDP 2145
P + + + S+++ P
Sbjct: 148 PVVKPA--ADAEPKTAAPSPSSSDKP 171
>gnl|CDD|237868 PRK14960, PRK14960, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 702
Score = 37.3 bits (86), Expect = 0.13
Identities = 32/190 (16%), Positives = 54/190 (28%), Gaps = 7/190 (3%)
Query: 1900 PESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSS 1959
++ N ++++ +P S + + PE E PE E
Sbjct: 374 QNGQAEVGLNSQAQTAQEITPVSAVQPVEVISQPAMVEPEPEPEPEPEPEPEPEPEPEPE 433
Query: 1960 LVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSS 2019
E P + E S V + T + E PV E
Sbjct: 434 PEPEPEPEPQPNQDLMVFDPNHHELIGLESAVVQETVSVLEED-----FIPVPEQKLVQV 488
Query: 2020 PVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSP--ASESTTT 2077
I PE ST E+++ ++ T+ + SE + +E T
Sbjct: 489 QAETQVKQIEPEPASTAEPIGLFEASSAEFSLAQDTSAYDLVSEPVIEQQSLVQAEIVET 548
Query: 2078 SSPASESTTT 2087
+ E T
Sbjct: 549 VAVVKEPNAT 558
Score = 33.1 bits (75), Expect = 2.1
Identities = 29/152 (19%), Positives = 44/152 (28%), Gaps = 14/152 (9%)
Query: 1990 LVSESTTTS-------SPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPAS 2042
LVSE + + ++++ I+PVS + PE E P
Sbjct: 367 LVSEPVQQNGQAEVGLNSQAQTAQEITPVSAVQPVEVISQPAMVEPEPEPEPEPEPEPEP 426
Query: 2043 ESTTTNNPKSESTTTNNPASESITSSSPAS------ESTTTSSPASESTTTSSPASESTT 2096
E P+ E P ++ + P ES S P E
Sbjct: 427 EPEPEPEPEPEPEPEPQP-NQDLMVFDPNHHELIGLESAVVQETVSVLEEDFIPVPEQKL 485
Query: 2097 TSSPASESTTTSSPESESTTTSSPASESTTIE 2128
A PE ST E+++ E
Sbjct: 486 VQVQAETQVKQIEPEPASTAEPIGLFEASSAE 517
Score = 33.1 bits (75), Expect = 2.5
Identities = 19/110 (17%), Positives = 35/110 (31%), Gaps = 12/110 (10%)
Query: 2067 SSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTT 2126
++ + +++ +P S + + PE E P E
Sbjct: 371 PVQQNGQAEVGLNSQAQTAQEITPVSAVQPVEVISQPAMVEPEPEPEPEPEPEPEPEPEP 430
Query: 2127 IEEQGVSPHSEKLSANEDPEEFPNED--VFEHTFAEIPNIDHSNQTDEAI 2174
E P +PE PN+D VF+ E+ ++ S E +
Sbjct: 431 EPEPEPEP---------EPEPQPNQDLMVFDPNHHELIGLE-SAVVQETV 470
>gnl|CDD|234351 TIGR03773, anch_rpt_wall, putative ABC transporter-associated repeat
protein. Members of this protein family occur in genomes
that contain a three-gene ABC transporter operon
associated with the presence of domain TIGR03769. That
domain occurs as a single-copy insert in the
substrate-binding protein, and occurs in two or more
copies in members of this protein family. Members of this
family typically are encoded adjacent to the said
transporter operon and may serve as a substrate receptor.
Length = 513
Score = 36.9 bits (85), Expect = 0.14
Identities = 18/127 (14%), Positives = 36/127 (28%), Gaps = 9/127 (7%)
Query: 1975 TTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESES 2034
T T++ + S T T I P +T P +++ P ++
Sbjct: 142 TVTATADLADGGAKS--KPETYTVVVGKVEVDKIDPARCATGAGKPQNDA---NGPAADK 196
Query: 2035 TTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASES 2094
PAS + + S + PA + ++P++ S
Sbjct: 197 PLFDDPASGVQALGDESAFSPGQQATVQIGKSVRLPAD----APLGVAAVVVKAAPSTGS 252
Query: 2095 TTTSSPA 2101
+
Sbjct: 253 SDAEGGL 259
Score = 36.9 bits (85), Expect = 0.17
Identities = 21/123 (17%), Positives = 36/123 (29%), Gaps = 11/123 (8%)
Query: 2005 TTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASES 2064
T T + S T T+ PA +T P++++ N PA++
Sbjct: 142 TVTATADLADGGAKS--KPETYTVVVGKVEVDKIDPARCATGAGKPQNDA---NGPAADK 196
Query: 2065 ITSSSPASESTTTSSPASES----TTTSSPASESTTTSSPASESTTTSS--PESESTTTS 2118
PAS ++ S T S +P + P + S+
Sbjct: 197 PLFDDPASGVQALGDESAFSPGQQATVQIGKSVRLPADAPLGVAAVVVKAAPSTGSSDAE 256
Query: 2119 SPA 2121
Sbjct: 257 GGL 259
Score = 32.2 bits (73), Expect = 3.5
Identities = 19/118 (16%), Positives = 35/118 (29%), Gaps = 11/118 (9%)
Query: 2015 TTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASES 2074
T T++ S T T +P +T P +++ PA++
Sbjct: 142 TVTATADLADGGAKS--KPETYTVVVGKVEVDKIDPARCATGAGKPQNDAN---GPAADK 196
Query: 2075 TTTSSPASESTTTSSPASES----TTTSSPASESTTTSSPESESTTTSS--PASESTT 2126
PAS ++ S T S +P + P++ S+
Sbjct: 197 PLFDDPASGVQALGDESAFSPGQQATVQIGKSVRLPADAPLGVAAVVVKAAPSTGSSD 254
>gnl|CDD|185641 PTZ00462, PTZ00462, Serine-repeat antigen protein; Provisional.
Length = 1004
Score = 37.3 bits (86), Expect = 0.14
Identities = 15/40 (37%), Positives = 25/40 (62%), Gaps = 4/40 (10%)
Query: 2357 HSVKIIGWGKSSQNE----PYWLCTNSYNQGWGEQGLFKI 2392
H+V I+G+G +E YW+ NS+ + WG++G FK+
Sbjct: 723 HAVNIVGYGNYINDEDEKKSYWIVRNSWGKYWGDEGYFKV 762
Score = 32.3 bits (73), Expect = 4.5
Identities = 13/83 (15%), Positives = 27/83 (32%), Gaps = 5/83 (6%)
Query: 2053 ESTTTNNPASESITSSSPASESTTTSSP-----ASESTTTSSPASESTTTSSPASESTTT 2107
E N + + SP A+ + + ES+ + P +
Sbjct: 26 EDDDNGNIGGGQAGGTGGDNAGNIDGSPIGNLDANIHASFGADPKESSGANLPGKKEKKK 85
Query: 2108 SSPESESTTTSSPASESTTIEEQ 2130
++S + S++IE+Q
Sbjct: 86 KEIRGHDIMSNSDSQNSSSIEKQ 108
>gnl|CDD|218881 pfam06070, Herpes_UL32, Herpesvirus large structural phosphoprotein
UL32. The large phosphorylated protein (UL32-like) of
herpes viruses is the polypeptide most frequently
reactive in immuno-blotting analyses with antisera when
compared with other viral proteins.
Length = 777
Score = 37.2 bits (86), Expect = 0.14
Identities = 32/241 (13%), Positives = 67/241 (27%), Gaps = 16/241 (6%)
Query: 1917 TSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTT 1976
S E + S E S T + ++ S ES S
Sbjct: 272 EDSLEYDDPGLES-TDEDDDDDGDSSLQTFKPLLDLTGSSLWSDDEESGDEDGDGSGFAP 330
Query: 1977 TSSPESESTTTSSLV-----SESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPE 2031
+++S + +LV S S ++ T++ + + + E ++ E
Sbjct: 331 EPLIKTDSRSNDTLVDLGRGGGSLKLDSVDAPGTSSYLFEPGLSPSPNSGKEMPGILTTE 390
Query: 2032 SESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPA 2091
+ +S S + + + NN + +P ++ + + +
Sbjct: 391 NLDLPLASTDSTEMDPEDKRGGAVKINNSGILAWGLKTPGLAV-------NDERSIAVSS 443
Query: 2092 SESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQGVSPHSEKLSANEDPEEFPNE 2151
T P S SS + + S P+ + + E
Sbjct: 444 DGITDVLDPPSPLRLHSSDKV-IDSVSPPSKRRVSAPASRLDDAKRP--EVTATPESSGS 500
Query: 2152 D 2152
D
Sbjct: 501 D 501
>gnl|CDD|215598 PLN03138, PLN03138, Protein TOC75; Provisional.
Length = 796
Score = 37.1 bits (86), Expect = 0.14
Identities = 22/95 (23%), Positives = 40/95 (42%), Gaps = 15/95 (15%)
Query: 2030 PESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSS 2089
S ST S+ AS S +++ P+ S S S + T SP + S S
Sbjct: 1 GRSSSTMVSAAASTSLSSSRPQLSS-------------FSSRSPQSATRSPRASSIKCS- 46
Query: 2090 PASESTTTSSPASESTTTSSPESESTTTSSPASES 2124
AS S ++S+ +S ++ ++ + S+ +
Sbjct: 47 -ASASASSSATSSSASLVANGAVALLSASAISGGG 80
Score = 36.4 bits (84), Expect = 0.24
Identities = 15/81 (18%), Positives = 41/81 (50%), Gaps = 2/81 (2%)
Query: 2014 STTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASE 2073
++++ + ++T++S ++ S S + T +P++ S + AS S +SS+ +S
Sbjct: 2 RSSSTMVSAAASTSLSSSRPQLSSFSSRSPQSATRSPRASSIKCS--ASASASSSATSSS 59
Query: 2074 STTTSSPASESTTTSSPASES 2094
++ ++ A + S+ +
Sbjct: 60 ASLVANGAVALLSASAISGGG 80
Score = 35.6 bits (82), Expect = 0.42
Identities = 14/81 (17%), Positives = 36/81 (44%), Gaps = 2/81 (2%)
Query: 1914 STTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESE 1973
++++ + ++T+ S ++ S S + T SP + S S S S ++S+ S
Sbjct: 2 RSSSTMVSAAASTSLSSSRPQLSSFSSRSPQSATRSPRASSIKCS--ASASASSSATSSS 59
Query: 1974 STTTSSPESESTTTSSLVSES 1994
++ ++ + S++
Sbjct: 60 ASLVANGAVALLSASAISGGG 80
Score = 35.6 bits (82), Expect = 0.45
Identities = 14/81 (17%), Positives = 34/81 (41%), Gaps = 2/81 (2%)
Query: 1944 STTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESE 2003
++++ + ++T+ S ++ S S + T SP + S S S S ++S+ S
Sbjct: 2 RSSSTMVSAAASTSLSSSRPQLSSFSSRSPQSATRSPRASSIKCS--ASASASSSATSSS 59
Query: 2004 STTTISPVSESTTTSSPVSES 2024
++ + + S+
Sbjct: 60 ASLVANGAVALLSASAISGGG 80
Score = 34.4 bits (79), Expect = 0.90
Identities = 24/80 (30%), Positives = 33/80 (41%), Gaps = 8/80 (10%)
Query: 2070 PASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEE 2129
S ST S+ AS S ++S P S S S + T SP + S S+ AS S++
Sbjct: 1 GRSSSTMVSAAASTSLSSSRPQLSS---FSSRSPQSATRSPRASSIKCSASASASSS--- 54
Query: 2130 QGVSPHSEKLSANEDPEEFP 2149
+ S L AN
Sbjct: 55 --ATSSSASLVANGAVALLS 72
Score = 31.0 bits (70), Expect = 9.7
Identities = 16/81 (19%), Positives = 39/81 (48%), Gaps = 2/81 (2%)
Query: 1994 STTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSE 2053
++++ + ++T++S ++ S S + T SP + S S AS S +++ S
Sbjct: 2 RSSSTMVSAAASTSLSSSRPQLSSFSSRSPQSATRSPRASSIKCS--ASASASSSATSSS 59
Query: 2054 STTTNNPASESITSSSPASES 2074
++ N A +++S+ +
Sbjct: 60 ASLVANGAVALLSASAISGGG 80
>gnl|CDD|227416 COG5084, YTH1, Cleavage and polyadenylation specificity factor (CPSF)
Clipper subunit and related makorin family Zn-finger
proteins [General function prediction only].
Length = 285
Score = 36.4 bits (84), Expect = 0.14
Identities = 32/175 (18%), Positives = 50/175 (28%), Gaps = 10/175 (5%)
Query: 1874 TNNNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTS----SPESESTTTSS 1929
T NN + V+ S++ S S S S + ++ S
Sbjct: 92 TPNNHVNPVLSSSVVCKFFLRGLCKSGFSCEFLHEYDLRSSQGPPCRSFSLKGSCSSGPS 151
Query: 1930 LVS--ESTTTSSPESE----STTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESE 1983
+ + + +T P S S + + SSP T SP
Sbjct: 152 CGYSHIDPDSFAGNCDQYSGATYGFCPLGASCKFSHTLKRVSYGSSPCGNYTPPFSPPGT 211
Query: 1984 STTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTS 2038
+ + S TS S + I T S S T I SE + +
Sbjct: 212 PSESVSSWGYGKGTSCSLSHPSLNIDIQQPQTAPSRKDSGGTNPIGASSEIGSEA 266
Score = 35.6 bits (82), Expect = 0.25
Identities = 50/296 (16%), Positives = 78/296 (26%), Gaps = 39/296 (13%)
Query: 1862 SVIDNYSEIIFTTNNNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPE 1921
Y+ +T+ N V S SP S T N + T+
Sbjct: 16 GSGCTYNHSNYTSLN-DGLQSVSSKYMGA-----KQISPSLSSPTFKNKANLMQNTNDN- 68
Query: 1922 SESTTTSSLVSESTTTSSPESESTTTSSPE---SESTTTSSLVSESTTTSSPESESTTTS 1978
T + +S + S S +T ++ S+ S E
Sbjct: 69 FVPGNTVACISRNFN-SIRGSRLSTPNNHVNPVLSSSVVCKFFLRGLCKSGFSCEFLHEY 127
Query: 1979 SPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTS 2038
S S + SS S + I P + + + S +T
Sbjct: 128 DLRSSQGPPCRSFSLKGSCSSGPSCGYSHIDP-----DSFAGNCDQ------YSGATYGF 176
Query: 2039 SPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTS 2098
P + + T S SSP T SP + + S TS
Sbjct: 177 CP-------LGASCKFSHTLKRVS---YGSSPCGNYTPPFSPPGTPSESVSSWGYGKGTS 226
Query: 2099 SPASESTTTSSPESESTTTSS-------PASESTTIEEQGVSPHSEKLSANEDPEE 2147
S + + T S P S+ I + +S + D EE
Sbjct: 227 CSLSHPSLNIDIQQPQTAPSRKDSGGTNPIGASSEIGSEADGNMQNSISGSGDSEE 282
>gnl|CDD|149648 pfam08662, eIF2A, Eukaryotic translation initiation factor eIF2A.
This is a family of eukaryotic translation initiation
factors.
Length = 194
Score = 35.7 bits (83), Expect = 0.15
Identities = 19/74 (25%), Positives = 31/74 (41%), Gaps = 11/74 (14%)
Query: 494 SVAFNDTAECVLTGGIDN---DIKMWDLRTNSVVQKLRGHSDTVTGLSLSPDGSYILSNA 550
++ ++ VL G N I+ WD++ + S+ T SPDG Y L+
Sbjct: 105 TIFWSPFGRLVLLAGFGNLAGQIEFWDVKNKKKIATAE-ASNA-TDCEWSPDGRYFLTAT 162
Query: 551 ------MDNTVRIW 558
+DN +IW
Sbjct: 163 TSPRLRVDNGFKIW 176
>gnl|CDD|221825 pfam12877, DUF3827, Domain of unknown function (DUF3827). This
family contains the human KIAA1549 protein which has been
found to be fused fused to BRAF gene in many cases of
pilocytic astrocytomas. The fusion is due mainly to a
tandem duplication of 2 Mb at 7q34. Although nothing is
known about the function of KIAA1549 protein, the BRAF
protein is a well characterized oncoprotein. It is a
serine/threonine protein kinase which is implicated in
MAP/ERK signalling, a critical pathway for the regulation
of cell division, differentiation and secretion.
Length = 684
Score = 36.8 bits (85), Expect = 0.15
Identities = 30/178 (16%), Positives = 53/178 (29%), Gaps = 11/178 (6%)
Query: 1878 SESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTT 1937
+ + SL E+ +P+S+S+ + + S+ + S S +
Sbjct: 343 EPAPLPPLKKESLPIEDAEVPTPKSKSSQDGSSNKKRRRGRKSPSDGDSEGS--SVISNR 400
Query: 1938 SSPESESTTTSSPESESTTTSSLVSESTTTSSPES--ESTTTSSPESESTTTSSLVSEST 1995
SS E +S S+ S + E +P S + +S+ E S S
Sbjct: 401 SSRE-KSGRPSTTPSVTAQQKPTKEEGRKKPAPPSGTDEQLSSASIFEHVDRLSRPSSDP 459
Query: 1996 TTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSE 2053
S + P +P S + + E KSE
Sbjct: 460 YDRSSGKIQLIAMQP------MPAPPVPPRFEPSRDDRAAENGKVNKEIQVALRHKSE 511
Score = 33.8 bits (77), Expect = 1.4
Identities = 31/154 (20%), Positives = 48/154 (31%), Gaps = 27/154 (17%)
Query: 1972 SESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPE 2031
E +P+S+S S SS + SP + SS +S S
Sbjct: 357 IEDAEVPTPKSKS---------SQDGSSNKKRRRGRKSPSDGDSEGSSVISNR----SSR 403
Query: 2032 SESTTTSSPASESTTTNNPKSESTTTNNPAS--ESITSSSPASESTT-TSSPASESTTTS 2088
+S S+ S + K E P S + SS+ E S P+S+ S
Sbjct: 404 EKSGRPSTTPSVTAQQKPTKEEGRKKPAPPSGTDEQLSSASIFEHVDRLSRPSSDPYDRS 463
Query: 2089 S-----------PASESTTTSSPASESTTTSSPE 2111
S PA P+ + + +
Sbjct: 464 SGKIQLIAMQPMPAPPVPPRFEPSRDDRAAENGK 497
Score = 32.2 bits (73), Expect = 4.3
Identities = 32/145 (22%), Positives = 49/145 (33%), Gaps = 17/145 (11%)
Query: 1929 SLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESES--TT 1986
SL E +P+S+S+ S S S S S S + SS E +T
Sbjct: 354 SLPIEDAEVPTPKSKSSQDGS--SNKKRRRGRKSPSDGDSEGSSVISNRSSREKSGRPST 411
Query: 1987 TSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTIS-PESESTTTSS------ 2039
T S+ ++ T + S E +++S + E +S P S+ SS
Sbjct: 412 TPSVTAQQKPTKEEGRKKPAPPSGTDEQLSSAS-IFEHVDRLSRPSSDPYDRSSGKIQLI 470
Query: 2040 -----PASESTTTNNPKSESTTTNN 2059
PA P + N
Sbjct: 471 AMQPMPAPPVPPRFEPSRDDRAAEN 495
>gnl|CDD|224346 COG1429, CobN, Cobalamin biosynthesis protein CobN and related
Mg-chelatases [Coenzyme metabolism].
Length = 1388
Score = 37.0 bits (86), Expect = 0.15
Identities = 14/77 (18%), Positives = 25/77 (32%), Gaps = 5/77 (6%)
Query: 1898 NSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESE----STTTSSPESE 1953
+ +T +T S+ S S+ T + + S + T + E
Sbjct: 1291 AFAPASATPGAPESVGTTAVSTASSASSATVTGSDAGSGADSTGPSLGAAGSVTGAGEGY 1350
Query: 1954 STTTSSLVSESTTTSSP 1970
T + VS S +T
Sbjct: 1351 EMTKEA-VSGSESTGMS 1366
Score = 36.6 bits (85), Expect = 0.24
Identities = 20/83 (24%), Positives = 32/83 (38%), Gaps = 6/83 (7%)
Query: 1923 ESTTTSSLVSESTTTSSPESESTT-TSSPESESTTTSSLVSESTTTSSPESE----STTT 1977
+T ++ S T +PES TT S+ S S+ T + + S + T
Sbjct: 1285 AATRYAAFAPASATPGAPESVGTTAVSTASSASSATVTGSDAGSGADSTGPSLGAAGSVT 1344
Query: 1978 SSPESESTTTSSLVSESTTTSSP 2000
+ E T + VS S +T
Sbjct: 1345 GAGEGYEMTKEA-VSGSESTGMS 1366
Score = 35.8 bits (83), Expect = 0.35
Identities = 18/91 (19%), Positives = 32/91 (35%), Gaps = 10/91 (10%)
Query: 1953 ESTTTSSLVSESTTTSSPESESTT-TSSPESESTTTSSLVSESTTTSSPESESTTTISPV 2011
+T ++ S T +PES TT S+ S S+ T + + S
Sbjct: 1285 AATRYAAFAPASATPGAPESVGTTAVSTASSASSATVTGSDAGSGADSTGPSL------- 1337
Query: 2012 SESTTTSSPVSESTTTISPESESTTTSSPAS 2042
S + ++ E+ S + S+ S
Sbjct: 1338 --GAAGSVTGAGEGYEMTKEAVSGSESTGMS 1366
Score = 34.3 bits (79), Expect = 1.0
Identities = 13/85 (15%), Positives = 32/85 (37%), Gaps = 5/85 (5%)
Query: 2059 NPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTS 2118
+ + ++ + +T + + +T S+ +S S+ T + + + S
Sbjct: 1282 ATYAATRYAAFAPASATPGAPESVGTTAVSTASSASSATVTGSDAGSGADSTGPSL---G 1338
Query: 2119 SPASESTTIEEQGVSPHSEKLSANE 2143
+ S + E G E +S +E
Sbjct: 1339 AAGSVTGAGE--GYEMTKEAVSGSE 1361
>gnl|CDD|165564 PHA03309, PHA03309, transcriptional regulator ICP4; Provisional.
Length = 2033
Score = 37.1 bits (85), Expect = 0.16
Identities = 37/152 (24%), Positives = 63/152 (41%), Gaps = 14/152 (9%)
Query: 1982 SESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTS--- 2038
S S+++SS S S+ +S P +T ++SP S S +PV S + E + + +
Sbjct: 1817 SSSSSSSSSSSSSSPSSRPSRSATPSLSP-SPSPPRRAPVDRSRSGRRRERDRPSANPFR 1875
Query: 2039 -SPASESTTTNNPKSES------TTTNNPA-SESITSSSPASESTTTSSP--ASESTTTS 2088
+P S ++P + + P I + S A+ + S P + + T T
Sbjct: 1876 WAPRQRSRADHSPDGTAPGDAPLNLEDGPGRGRPIWTPSSATTLPSRSGPEDSVDETETE 1935
Query: 2089 SPASESTTTSSPASESTTTSSPESESTTTSSP 2120
A + SP S S +SE S+P
Sbjct: 1936 DSAPPARLAPSPLETSRAEDSEDSEYPEYSNP 1967
Score = 36.8 bits (84), Expect = 0.22
Identities = 36/186 (19%), Positives = 67/186 (36%), Gaps = 7/186 (3%)
Query: 1969 SPES-----ESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSE 2023
SPE +S S P + ++ ++ S S+++ S S S+ +S P
Sbjct: 1779 SPERVLGRRQSRRDSVPVRRRSGAANCGGRWMISAGRSSSSSSSSSSSSSSSPSSRPSRS 1838
Query: 2024 STTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASE 2083
+T ++SP S S +P S + + + NP + S A S ++P
Sbjct: 1839 ATPSLSP-SPSPPRRAPVDRSRSGRR-RERDRPSANPFRWAPRQRSRADHSPDGTAPGDA 1896
Query: 2084 STTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQGVSPHSEKLSANE 2143
+P+S +T S E + + +S SP + +
Sbjct: 1897 PLNLEDGPGRGRPIWTPSSATTLPSRSGPEDSVDETETEDSAPPARLAPSPLETSRAEDS 1956
Query: 2144 DPEEFP 2149
+ E+P
Sbjct: 1957 EDSEYP 1962
Score = 35.6 bits (81), Expect = 0.46
Identities = 46/199 (23%), Positives = 72/199 (36%), Gaps = 20/199 (10%)
Query: 1938 SSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTT 1997
S+ S S+++SS S S++ SS S S T S S S +P S + + +
Sbjct: 1812 SAGRSSSSSSSSSSSSSSSPSSRPSRSATPSLSPSPSPPRRAPVDRSRSGRRRERDRPSA 1871
Query: 1998 S----SPESESTTTISPVSESTTTSSPVS------ESTTTISPESESTTTSSPASESTTT 2047
+ +P S SP + +P++ +P S +T S E +
Sbjct: 1872 NPFRWAPRQRSRADHSP-DGTAPGDAPLNLEDGPGRGRPIWTPSSATTLPSRSGPEDSV- 1929
Query: 2048 NNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTT 2107
+ T T + A + + SP S S SE S+P S PA +S
Sbjct: 1930 -----DETETEDSAPPARLAPSPLETSRAEDSEDSEYPEYSNP---RLGKSPPALKSREA 1981
Query: 2108 SSPESESTTTSSPASESTT 2126
P S+ S T
Sbjct: 1982 RRPSSKQPRRPSSGKNGHT 2000
Score = 34.1 bits (77), Expect = 1.3
Identities = 45/197 (22%), Positives = 73/197 (37%), Gaps = 23/197 (11%)
Query: 1922 SESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTS------------- 1968
S S+++SS S S+ +S P +T + SP S S + V S +
Sbjct: 1817 SSSSSSSSSSSSSSPSSRPSRSATPSLSP-SPSPPRRAPVDRSRSGRRRERDRPSANPFR 1875
Query: 1969 -SPESESTTTSSPESESTTTSSL-VSESTTTSSP-ESESTTTISPVSESTTTSSPVSEST 2025
+P S SP+ + + L + + P + S+ T P S +E+
Sbjct: 1876 WAPRQRSRADHSPDGTAPGDAPLNLEDGPGRGRPIWTPSSATTLPSRSGPEDSVDETETE 1935
Query: 2026 TTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASEST 2085
+ P + SP S ++ SE +NP + S PA +S P+S+
Sbjct: 1936 DSAPP---ARLAPSPLETSRAEDSEDSEYPEYSNP---RLGKSPPALKSREARRPSSKQP 1989
Query: 2086 TTSSPASESTTTSSPAS 2102
S T S AS
Sbjct: 1990 RRPSSGKNGHTDVSAAS 2006
>gnl|CDD|227928 COG5641, GAT1, GATA Zn-finger-containing transcription factor
[Transcription].
Length = 498
Score = 36.8 bits (85), Expect = 0.16
Identities = 54/265 (20%), Positives = 93/265 (35%), Gaps = 26/265 (9%)
Query: 1885 STLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTS--SPES 1942
+L S ++ ++ S S N+ E+ T ES +++S +++S +
Sbjct: 204 ISLKSDSIKSRSSRS----SHNNNDSNGENANT---ESIGNSSASKLTKSWEERPQGRQL 256
Query: 1943 ESTTTSSPESESTTTSSLVSESTTTSSPESEST---TTSSPESESTTTSSLVSESTTTSS 1999
S S + S L+ ++S + ST S + ST T+S + +S
Sbjct: 257 LSDAGSLSPRSNNPKSPLLEGLMGSTSLQPVSTPKLVLPSDKKRSTLTTSTATPLWRRTS 316
Query: 2000 PESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNN 2059
+S + S + S P +S S +T N ST TN
Sbjct: 317 DKSSFSCNASGSALKPPGS---------KRPLLPKPDPNSKRSNATCMNC---SSTPTNK 364
Query: 2060 PASESITSSSPASESTTTSSPASESTT-TSSPASESTTTSSPASESTTTSSPESESTTTS 2118
S TS+SP ++ + S T + PA S+ +P + +
Sbjct: 365 ILSPPTTSNSPGAQVKLPNQTRSTGATKKKITRRRMNSGKIPALSSSMK-NPVPKEFSPL 423
Query: 2119 SPASESTTIEEQGVSPHSEKLSANE 2143
P S + Q S + KL E
Sbjct: 424 IPQSTESETPSQSKSSLTSKLEEFE 448
>gnl|CDD|223033 PHA03291, PHA03291, envelope glycoprotein I; Provisional.
Length = 401
Score = 36.5 bits (84), Expect = 0.20
Identities = 33/139 (23%), Positives = 55/139 (39%), Gaps = 15/139 (10%)
Query: 2041 ASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSP 2100
+E T P E + + ++ S+P PA+ T + AS TT +
Sbjct: 167 PAEGTLAAPPLGE-GSADGSCDPALPLSAPRLGPADVFVPATPRPTPRTTASPETTPTPS 225
Query: 2101 ASESTTTSSPESESTTTSSPASESTTIEEQGVSPHS----EKLSANEDPEEFPNEDVFEH 2156
+ S +++ + STT ++P + +T E +P + E AN P P +E
Sbjct: 226 TTTSPPSTTIPAPSTTIAAPQAGTTPEAEGTPAPPTPGGGEAPPANATPA--PEASRYEL 283
Query: 2157 TFAEIPNIDHSNQTDEAIP 2175
T +I I AIP
Sbjct: 284 TVTQIIQI--------AIP 294
Score = 34.5 bits (79), Expect = 0.80
Identities = 28/139 (20%), Positives = 48/139 (34%), Gaps = 7/139 (5%)
Query: 1983 ESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPAS 2042
E T +SL E T +P + + +P PA+
Sbjct: 151 EGATNASLFPLGLAAFPAEG---TLAAPPLGEGSADGSCDPALPLSAPRLGPADVFVPAT 207
Query: 2043 ESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPAS----ESTTTSSPASESTTTS 2098
T S TT + S S++ + STT ++P + E+ T +P + +
Sbjct: 208 PRPTPRTTASPETTPTPSTTTSPPSTTIPAPSTTIAAPQAGTTPEAEGTPAPPTPGGGEA 267
Query: 2099 SPASESTTTSSPESESTTT 2117
PA+ + + E T T
Sbjct: 268 PPANATPAPEASRYELTVT 286
Score = 31.5 bits (71), Expect = 6.6
Identities = 20/97 (20%), Positives = 35/97 (36%), Gaps = 4/97 (4%)
Query: 1996 TTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSEST 2055
++P TT SP + T S+ S +TTI S + + P
Sbjct: 204 VPATPRPTPRTTASP-ETTPTPSTTTSPPSTTIPAPSTTIAAPQAGTTPEAEGTPA---P 259
Query: 2056 TTNNPASESITSSSPASESTTTSSPASESTTTSSPAS 2092
T +++PA E++ ++ + PAS
Sbjct: 260 PTPGGGEAPPANATPAPEASRYELTVTQIIQIAIPAS 296
>gnl|CDD|240310 PTZ00200, PTZ00200, cysteine proteinase; Provisional.
Length = 448
Score = 36.2 bits (84), Expect = 0.21
Identities = 19/60 (31%), Positives = 30/60 (50%), Gaps = 8/60 (13%)
Query: 2346 SCEGSINPRYIHSVKIIGWGKSSQ-NEPYWLCTNSYNQGWGEQGLFKIRR---GVNMCSI 2401
C S+N H+V ++G G + + YW+ NS+ WGE G ++ R G + C I
Sbjct: 381 ECGKSLN----HAVLLVGEGYDEKTKKRYWIIKNSWGTDWGENGYMRLERTNEGTDKCGI 436
>gnl|CDD|165099 PHA02732, PHA02732, hypothetical protein; Provisional.
Length = 1467
Score = 36.7 bits (84), Expect = 0.22
Identities = 37/202 (18%), Positives = 77/202 (38%), Gaps = 17/202 (8%)
Query: 1931 VSESTTTSSPESESTTTSSPESESTTTSSLVSES---TTTSSPESESTTT------SSPE 1981
VS + P + + S ++ SS V+ + + P + T+ + P+
Sbjct: 1058 VSYFAASQGPSPFTFVSPSYIFLNSWASSYVAPGFLGSPYALPYFMNQTSALVGNTALPK 1117
Query: 1982 SESTTTSSLVSESTTTSSPESESTTTISPV---SESTTTSSPVSESTTTISPESESTTTS 2038
+ + + T S+ ++T SPV + S + + S +++ S
Sbjct: 1118 GLNVFSGYMFGAGTVASAFLYMNSTPQSPVLALLLAPYISYKFNALSLGFSITADAAIFS 1177
Query: 2039 S---PASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTS-SPASES 2094
PA + ++ P + S +P I T T + + S + + +
Sbjct: 1178 LFGIPAPQLLSSYIP-TGSVLYQDPIFTYIPPGIIGMSGTNTFTFKAAQLQLSAASSPPA 1236
Query: 2095 TTTSSPASESTTTSSPESESTT 2116
TT +P S+++SS +S ST+
Sbjct: 1237 ATTPTPPPSSSSSSSAQSISTS 1258
Score = 34.3 bits (78), Expect = 0.99
Identities = 36/186 (19%), Positives = 64/186 (34%), Gaps = 21/186 (11%)
Query: 1925 TTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESES 1984
TS+LV + S T S+ + ++T SP +P S
Sbjct: 1104 NQTSALVGNTALPKGLNVFSGYMFGA---GTVASAFLYMNSTPQSPVL--ALLLAPYI-S 1157
Query: 1985 TTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPAS-- 2042
++L + T+ S I P + ++ P T ++ + T P
Sbjct: 1158 YKFNALSLGFSITADAAIFSLFGI-PAPQLLSSYIP----TGSVLYQDPIFTYIPPGIIG 1212
Query: 2043 -ESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPA 2101
T T K+ + +SSP + +T T P+S S++++ S S
Sbjct: 1213 MSGTNTFTFKAAQLQLSA-------ASSPPAATTPTPPPSSSSSSSAQSISTSPGQIQIV 1265
Query: 2102 SESTTT 2107
+TT
Sbjct: 1266 LNGSTT 1271
Score = 34.3 bits (78), Expect = 1.1
Identities = 20/92 (21%), Positives = 34/92 (36%), Gaps = 3/92 (3%)
Query: 1917 TSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTT 1976
S T S L + T P + + T + + + SSP + +T
Sbjct: 1184 PQLLSSYIPTGSVLYQDPIFTYIP---PGIIGMSGTNTFTFKAAQLQLSAASSPPAATTP 1240
Query: 1977 TSSPESESTTTSSLVSESTTTSSPESESTTTI 2008
T P S S++++ +S S +TTI
Sbjct: 1241 TPPPSSSSSSSAQSISTSPGQIQIVLNGSTTI 1272
Score = 32.4 bits (73), Expect = 3.6
Identities = 24/141 (17%), Positives = 47/141 (33%), Gaps = 2/141 (1%)
Query: 1888 NSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTT 1947
++ L N+T SP P + S T + + +P+ S+
Sbjct: 1134 SAFLYMNSTPQSPVLALLLA--PYISYKFNALSLGFSITADAAIFSLFGIPAPQLLSSYI 1191
Query: 1948 SSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTT 2007
+ + + T + + + + SS + +T T P S S+++
Sbjct: 1192 PTGSVLYQDPIFTYIPPGIIGMSGTNTFTFKAAQLQLSAASSPPAATTPTPPPSSSSSSS 1251
Query: 2008 ISPVSESTTTSSPVSESTTTI 2028
+S S V +TTI
Sbjct: 1252 AQSISTSPGQIQIVLNGSTTI 1272
Score = 32.4 bits (73), Expect = 3.8
Identities = 40/211 (18%), Positives = 71/211 (33%), Gaps = 36/211 (17%)
Query: 1838 LLSVSPYITNNLLISMLAATAVAISVIDNYSEIIFTTNNNSESTVVMSTLNSLLSENTTT 1897
+ S+ LL S + +V Y + IFT + MS N+ +
Sbjct: 1175 IFSLFGIPAPQLLSSYIPTGSVL------YQDPIFTYI--PPGIIGMSGTNTFTFKAAQL 1226
Query: 1898 NSPESESTTTNNPESESTTTSSPESESTT-TSSLVSESTTTSSPESESTTT--------- 1947
S +++ +TT + P S S++ ++ +S S +TT
Sbjct: 1227 QL--SAASSP----PAATTPTPPPSSSSSSSAQSISTSPGQIQIVLNGSTTIHINFLFFP 1280
Query: 1948 --SSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESEST 2005
S+P+ +V+ S S S+ + T V + T ++
Sbjct: 1281 ALSTPKIGQILAMPIVNSSGAFIS----LYVNSAISANFNVTIEYVFSNGTVIKRFTDEP 1336
Query: 2006 TTISPVSESTTTSSPVSESTTTISPESESTT 2036
I P+ + IS E+ESTT
Sbjct: 1337 GQIFPLP---LINGDEE---VIISVENESTT 1361
Score = 31.6 bits (71), Expect = 6.4
Identities = 16/75 (21%), Positives = 28/75 (37%), Gaps = 1/75 (1%)
Query: 2060 PASESITSSSP-ASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTS 2118
PA + ++S P S + +T T A + +S +TT +
Sbjct: 1182 PAPQLLSSYIPTGSVLYQDPIFTYIPPGIIGMSGTNTFTFKAAQLQLSAASSPPAATTPT 1241
Query: 2119 SPASESTTIEEQGVS 2133
P S S++ Q +S
Sbjct: 1242 PPPSSSSSSSAQSIS 1256
>gnl|CDD|177577 PHA03292, PHA03292, envelope glycoprotein I; Provisional.
Length = 413
Score = 36.1 bits (83), Expect = 0.24
Identities = 24/148 (16%), Positives = 53/148 (35%), Gaps = 13/148 (8%)
Query: 1936 TTSSPESESTTTSSPESESTTTSSLVSESTTTSS-PESESTTTSSPESESTTTSSLVSES 1994
TT+ PE + ++P + + S + SS P +T T +P ++ T+
Sbjct: 178 TTARPEPAAGYVATPTPRYLNAVTTSTYSRSMSSQPAGAATATPTPTLDTGLTTVAPPNE 237
Query: 1995 TTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSES 2054
T + + P + T + +T ++ + T S P +
Sbjct: 238 TVVTGETALLCHWFQPSTRVPTLYLHLLGTTGNLTEDVLLTED------SEILRTPPPDP 291
Query: 2055 TTTNNPASESITSSSPASESTTTSSPAS 2082
+++ +P + + T ++SP
Sbjct: 292 SSSRSPGAGD------DFKQTNSTSPKR 313
Score = 34.5 bits (79), Expect = 0.63
Identities = 27/142 (19%), Positives = 51/142 (35%), Gaps = 11/142 (7%)
Query: 1966 TTSSPESESTTTSSPESESTTTSSLVSESTTTSS-PESESTTTISPVSESTTTSSPVSES 2024
TT+ PE + ++P + + S + SS P +T T +P ++ T+
Sbjct: 178 TTARPEPAAGYVATPTPRYLNAVTTSTYSRSMSSQPAGAATATPTPTLDTGLTTVAPPNE 237
Query: 2025 TTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASES 2084
T + P++ T TT N +T S +P +
Sbjct: 238 TVVTGETALLCHWFQPSTRVPTLYL-HLLGTTGNLTEDVLLTEDSEI-----LRTPPPDP 291
Query: 2085 TTTSSPASES----TTTSSPAS 2102
+++ SP + T ++SP
Sbjct: 292 SSSRSPGAGDDFKQTNSTSPKR 313
Score = 33.8 bits (77), Expect = 1.2
Identities = 22/89 (24%), Positives = 35/89 (39%), Gaps = 2/89 (2%)
Query: 2029 SPESESTT-TSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTT 2087
P+ E TT PA+ T P+ + T + S S SS PA +T T +P ++ T
Sbjct: 172 VPDPEPTTARPEPAAGYVATPTPRYLNAVTTSTYSRS-MSSQPAGAATATPTPTLDTGLT 230
Query: 2088 SSPASESTTTSSPASESTTTSSPESESTT 2116
+ T + + P + T
Sbjct: 231 TVAPPNETVVTGETALLCHWFQPSTRVPT 259
>gnl|CDD|152349 pfam11914, DUF3432, Domain of unknown function (DUF3432). This
presumed domain is functionally uncharacterized. This
domain is found in eukaryotes. This domain is about 100
amino acids in length. This domain is found associated
with pfam00096. This domain has two conserved sequence
motifs: YPSPV and PSP.
Length = 100
Score = 33.6 bits (76), Expect = 0.24
Identities = 29/99 (29%), Positives = 43/99 (43%), Gaps = 8/99 (8%)
Query: 1977 TSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSES------TTTISP 2030
++P S ++ S+ S S +S P +T+ SPV T+ SSPVS T+ SP
Sbjct: 3 KAAPVSTASPNISIYSSSPVSSYPSPIATSYPSPV--PTSYSSPVSSCYPSPVHTSFPSP 60
Query: 2031 ESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSS 2069
+T S + T S TN+ +S T S
Sbjct: 61 SIATTYPSVSPTFQTQVATSFPSSVVTNSFSSPVTTPLS 99
Score = 33.6 bits (76), Expect = 0.25
Identities = 27/100 (27%), Positives = 45/100 (45%), Gaps = 6/100 (6%)
Query: 2017 TSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTT 2076
++PVS ++ IS S S +S P+ +T+ +P T+ + S SP T+
Sbjct: 3 KAAPVSTASPNISIYSSSPVSSYPSPIATSYPSP----VPTSYSSPVSSCYPSPV--HTS 56
Query: 2077 TSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTT 2116
SP+ +T S + T ++ S T+S S TT
Sbjct: 57 FPSPSIATTYPSVSPTFQTQVATSFPSSVVTNSFSSPVTT 96
Score = 31.7 bits (71), Expect = 1.2
Identities = 27/99 (27%), Positives = 45/99 (45%), Gaps = 3/99 (3%)
Query: 1947 TSSPESESTTTSSLVSESTTTSSPESESTTTSSP--ESESTTTSSLVSESTTTSSPESES 2004
++P S ++ S+ S S +S P +T+ SP S S+ SS TS P
Sbjct: 3 KAAPVSTASPNISIYSSSPVSSYPSPIATSYPSPVPTSYSSPVSSCYPSPVHTSFPSPSI 62
Query: 2005 TTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASE 2043
TT VS + T S ++ ++ S S+ ++P S+
Sbjct: 63 ATTYPSVSPTFQTQVATSFPSSVVT-NSFSSPVTTPLSD 100
Score = 30.9 bits (69), Expect = 1.8
Identities = 26/99 (26%), Positives = 45/99 (45%), Gaps = 3/99 (3%)
Query: 1917 TSSPESESTTTSSLVSESTTTSSPESESTTTSSP--ESESTTTSSLVSESTTTSSPESES 1974
++P S ++ S+ S S +S P +T+ SP S S+ SS TS P S S
Sbjct: 3 KAAPVSTASPNISIYSSSPVSSYPSPIATSYPSPVPTSYSSPVSSCYPSPVHTSFP-SPS 61
Query: 1975 TTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSE 2013
T+ P T + + + ++ S S+ +P+S+
Sbjct: 62 IATTYPSVSPTFQTQVATSFPSSVVTNSFSSPVTTPLSD 100
Score = 29.3 bits (65), Expect = 6.8
Identities = 27/100 (27%), Positives = 44/100 (44%), Gaps = 6/100 (6%)
Query: 1997 TSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTT 2056
++P S ++ IS S S +S P +T+ SP T+ SSP S P T+
Sbjct: 3 KAAPVSTASPNISIYSSSPVSSYPSPIATSYPSP--VPTSYSSPVSSCY----PSPVHTS 56
Query: 2057 TNNPASESITSSSPASESTTTSSPASESTTTSSPASESTT 2096
+P+ + S + T ++ S T+S +S TT
Sbjct: 57 FPSPSIATTYPSVSPTFQTQVATSFPSSVVTNSFSSPVTT 96
>gnl|CDD|219938 pfam08618, Opi1, Transcription factor Opi1. Opi1 is a leucine zipper
containing yeast transcription factor that negatively
regulates phospholipid biosynthesis. It represses the
expression of several UAS(INO) cis acting element
containing genes and its activity is mediated by
phosphorylations catalyzed by protein kinase A, protein
kinase C and casein kinase II.
Length = 387
Score = 35.8 bits (82), Expect = 0.25
Identities = 24/163 (14%), Positives = 39/163 (23%), Gaps = 5/163 (3%)
Query: 1906 TTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSEST 1965
T + S+ T STT P + S + S ++
Sbjct: 48 TVAAVGRATGVESNNRWALNTPVRTTPSSTT--MPSALSKRSLDAASIHMASNGAPPLIQ 105
Query: 1966 TTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSEST 2025
+S + T S+ S S ++ +PV S
Sbjct: 106 KSSEKVNGGIDAIRNSETEGTLYSVDVGSQGLRMRIQTQGYAPSG---NSNRGAPVRTSA 162
Query: 2026 TTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSS 2068
+ S SP S+ + T N S
Sbjct: 163 LSTSTLPGYDDHRSPRYSSSPVPQQPQTAVTANGGPRPPQPRS 205
Score = 33.5 bits (76), Expect = 1.5
Identities = 28/182 (15%), Positives = 53/182 (29%), Gaps = 10/182 (5%)
Query: 1942 SESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPE 2001
ST + + S+ T STT S S+ + ++ + ++ + P
Sbjct: 45 VASTVAAVG-RATGVESNNRWALNTPVRTTPSSTTMPSALSKRSLDAASIHMASNGAPPL 103
Query: 2002 SESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPA 2061
+ + S + SE+ T+ + ++ + +
Sbjct: 104 IQKS---SEKVNGGIDAIRNSETEGTLYSVDVGSQGLRMRIQTQGYAPSGNSNRGAPVRT 160
Query: 2062 SESITSSSPASESTTTSSPASESTTTSSPASESTT----TSSPASESTTTSSPESESTTT 2117
S TS+ P + SP S+ + T P S S T
Sbjct: 161 SALSTSTLPGYDDH--RSPRYSSSPVPQQPQTAVTANGGPRPPQPRSAWQSGNGRVLITA 218
Query: 2118 SS 2119
SS
Sbjct: 219 SS 220
Score = 31.6 bits (71), Expect = 5.4
Identities = 26/163 (15%), Positives = 51/163 (31%), Gaps = 5/163 (3%)
Query: 1883 VMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPES 1942
V ST+ ++ ++ T STT S S+ + ++ + ++ + P
Sbjct: 45 VASTVAAVGRATGVESNNRWALNTPVRTTPSSTTMPSALSKRSLDAASIHMASNGAPPLI 104
Query: 1943 ESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPES 2002
+ +S + T S + S S S +
Sbjct: 105 QK--SSEKVNGGIDAIRNSETEGTLYSVDVGSQGLRMRIQTQGYAPSGNSNRGAPVRTSA 162
Query: 2003 ESTTTISPVSESTT---TSSPVSESTTTISPESESTTTSSPAS 2042
ST+T+ + + +SSPV + T + P S
Sbjct: 163 LSTSTLPGYDDHRSPRYSSSPVPQQPQTAVTANGGPRPPQPRS 205
Score = 31.2 bits (70), Expect = 6.9
Identities = 18/155 (11%), Positives = 49/155 (31%), Gaps = 13/155 (8%)
Query: 1958 SSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTT 2017
S + ++T ++ + S+ T STT S S+ + + + ++
Sbjct: 40 RSAIPVASTVAAVGRATGVESNNRWALNTPVRTTPSSTTMPSALSKRSLDAASIHMASNG 99
Query: 2018 SSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTT 2077
+ P+ + ++ E + + + T + + +P+ S
Sbjct: 100 APPLIQKSS----EKVNGGIDAIRNSETEGTLYSVDVGSQGLRMRIQTQGYAPSGNSNRG 155
Query: 2078 SSPASESTTTSSPASESTTTSSPASESTTTSSPES 2112
+ S +T++ P + + S
Sbjct: 156 APV---------RTSALSTSTLPGYDDHRSPRYSS 181
Score = 30.8 bits (69), Expect = 9.4
Identities = 29/178 (16%), Positives = 54/178 (30%), Gaps = 10/178 (5%)
Query: 1958 SSLVSESTTTSSPE----SESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSE 2013
SS S+ + SP +E S+ ST + + S+ T +
Sbjct: 17 SSHAYTSSKSYSPRFRYGAEIVERSAIPVASTVAAVG-RATGVESNNRWALNTPVRTTPS 75
Query: 2014 STTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASE 2073
STT P + S ++ S ++ ++ + + T S
Sbjct: 76 STT--MPSALSKRSLDAASIHMASNGAPPLIQKSSEKVNGGIDAIRNSETEGTLYSVDVG 133
Query: 2074 STTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQG 2131
S S S + S +T++ P + SP S+ + +Q
Sbjct: 134 SQGLRMRIQTQGYAPSGNS-NRGAPVRTSALSTSTLPGYDD--HRSPRYSSSPVPQQP 188
>gnl|CDD|227498 COG5170, CDC55, Serine/threonine protein phosphatase 2A, regulatory
subunit [Signal transduction mechanisms].
Length = 460
Score = 35.8 bits (82), Expect = 0.28
Identities = 52/301 (17%), Positives = 104/301 (34%), Gaps = 64/301 (21%)
Query: 380 SGYDRQIFIWSVYGECENIGVMS------GHTGAVMDLKFSTDGCHIFTCST-DQTLAVW 432
S D+ I +W +Y + N+ V++ + ST + S D+ +A
Sbjct: 105 STNDKTIKLWKIYEK--NLKVVAENNLSDSFHSPMGGPLTSTKELLLPRLSEHDEIIAAK 162
Query: 433 DLEKGQRIKKMKGHSTFVNSCDPVRRGQLLIASGSDDCTVKVWDPRKKNQAVSMNN---- 488
H +NS + L+++ DD + +W+ + + ++ +
Sbjct: 163 -----PCRVYANAHPYHINSISFNSDKETLLSA--DDLRINLWNLEIIDGSFNIVDIKPH 215
Query: 489 -----TYQVTSVAFNDTAECVLTG--GIDNDIKMWDLRTNSV----------------VQ 525
T +TS F+ C + +IK+ DLR +++ V
Sbjct: 216 NMEELTEVITSAEFH-PEMCNVFMYSSSKGEIKLNDLRQSALCDNSKKLFELTIDGVDVD 274
Query: 526 KLRGHSDTVTGLSLSPDGSYILSNAMDNTVRIWDIRPYVPGERCVKVMSGHQHNFEK--- 582
+++ S +G YILS TV+IWD+ + +K + H ++
Sbjct: 275 FFEEIVSSISDFKFSDNGRYILSRDY-LTVKIWDVN---MAKNPIKTIPMHCDLMDELND 330
Query: 583 --------NLLRCAWSVSGLYVTAGSADKCVYIWDTTTRRIAYKLPGHNGSVNDVQFHPK 634
+ ++S +V +GS I+ T + G V ++
Sbjct: 331 VYENDAIFDKFEISFSGDDKHVLSGSYSNNFGIYPTDS-----SGFKDVGHVVNLADGSA 385
Query: 635 E 635
E
Sbjct: 386 E 386
>gnl|CDD|177555 PHA03193, PHA03193, tegument protein VP11/12; Provisional.
Length = 594
Score = 35.8 bits (82), Expect = 0.29
Identities = 18/128 (14%), Positives = 43/128 (33%), Gaps = 3/128 (2%)
Query: 2001 ESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNP 2060
+S + + ++ + I PE S A + + + + TT +
Sbjct: 440 DSPFQRKRAMPEDGGEIHEALANNGQAIFPECFSGDLPPIAQALLSAD--ELPNDTTAST 497
Query: 2061 ASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSP 2120
++E + + + + + A++ + + PA ++ ES ST +P
Sbjct: 498 SNEMKGDAECPAAQDAAAILPASFQIENGGAADGSGLAIPA-AMCDATAVESPSTVAETP 556
Query: 2121 ASESTTIE 2128
E
Sbjct: 557 PERLLAAE 564
Score = 34.3 bits (78), Expect = 0.95
Identities = 27/159 (16%), Positives = 53/159 (33%), Gaps = 12/159 (7%)
Query: 1981 ESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSP 2040
+S ++ + + + I P S + E + TT+S
Sbjct: 440 DSPFQRKRAMPEDGGEIHEALANNGQAIFPECFSGDLPPIAQALLSAD--ELPNDTTAST 497
Query: 2041 ASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSP 2100
++E PA++ + PAS A++ + + PA+ T+
Sbjct: 498 SNEMKGDAEC---------PAAQDAAAILPASFQIENGG-AADGSGLAIPAAMCDATAVE 547
Query: 2101 ASESTTTSSPESESTTTSSPASESTTIEEQGVSPHSEKL 2139
+ + + PE S P ++T + G S E L
Sbjct: 548 SPSTVAETPPERLLAAESGPRCKATAKHKGGSSKVEEIL 586
Score = 33.2 bits (75), Expect = 2.1
Identities = 15/135 (11%), Positives = 42/135 (31%), Gaps = 9/135 (6%)
Query: 1898 NSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTT 1957
+SP PE + + ++ E + P S+ E + TT
Sbjct: 440 DSPFQRKRAM--PEDGGEIHEALANNG---QAIFPECFSGDLPPIAQALLSADELPNDTT 494
Query: 1958 SSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTT 2017
+S +E + + + + + ++ + + P + + + +
Sbjct: 495 ASTSNEMKGDAECPAAQDAAAILPASFQIENGGAADGSGLAIPAA----MCDATAVESPS 550
Query: 2018 SSPVSESTTTISPES 2032
+ + ++ ES
Sbjct: 551 TVAETPPERLLAAES 565
Score = 32.0 bits (72), Expect = 5.0
Identities = 20/134 (14%), Positives = 46/134 (34%), Gaps = 7/134 (5%)
Query: 1897 TNSPESESTTTNNPESESTTTSSPESESTTTSSLV---SESTTTSSPESESTTTSSP--- 1950
+ E NN ++ S + + L + TT+S +E +
Sbjct: 451 EDGGEIHEALANNGQAIFPECFSGDLPPIAQALLSADELPNDTTASTSNEMKGDAECPAA 510
Query: 1951 -ESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTIS 2009
++ + +S E+ + + + ++ + + S V+E+ +ES
Sbjct: 511 QDAAAILPASFQIENGGAADGSGLAIPAAMCDATAVESPSTVAETPPERLLAAESGPRCK 570
Query: 2010 PVSESTTTSSPVSE 2023
++ SS V E
Sbjct: 571 ATAKHKGGSSKVEE 584
>gnl|CDD|227502 COG5175, MOT2, Transcriptional repressor [Transcription].
Length = 480
Score = 35.8 bits (82), Expect = 0.29
Identities = 26/180 (14%), Positives = 62/180 (34%), Gaps = 9/180 (5%)
Query: 1920 PESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSS 1979
PE +S T L + E + ++T T +P + +
Sbjct: 227 PEKDSLTKDELCNSQHKLHGSEVRNKNKKRIHRSTSTARYDTDLLNFTGTPSPAAM--EA 284
Query: 1980 PESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSS 2039
T+ + + +T + +PV+ S ++S + ++ +E+TTT++
Sbjct: 285 QFKHKTSRVFKAPDKILFPPLDFTNTQSATPVTLSNSSSINLPTLNDSLGHHTETTTTTN 344
Query: 2040 PASESTTTNNPKSES--TTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTT 2097
+ S + + K +S +++ ++ + S + S + E T
Sbjct: 345 TNATSHSHGSKKKQSLAAEEYKDPYDALGNA-----ARLHSLSNYQKRPISIKSDEETYK 399
Score = 31.6 bits (71), Expect = 6.3
Identities = 21/144 (14%), Positives = 46/144 (31%), Gaps = 17/144 (11%)
Query: 1877 NSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTS-------- 1928
N T + + + T+ + + +T +++P + S ++S
Sbjct: 272 NFTGTPSPAAMEAQFKHKTSRVFKAPDKILFPPLDFTNTQSATPVTLSNSSSINLPTLND 331
Query: 1929 SLVSESTTTSSPESESTTTSSP---------ESESTTTSSLVSESTTTSSPESESTTTSS 1979
SL + TT++ + +T+ S E +L + + S + S
Sbjct: 332 SLGHHTETTTTTNTNATSHSHGSKKKQSLAAEEYKDPYDALGNAARLHSLSNYQKRPISI 391
Query: 1980 PESESTTTSSLVSESTTTSSPESE 2003
E T T ++ E
Sbjct: 392 KSDEETYKKWDKKSDNTLANKLVE 415
Score = 31.2 bits (70), Expect = 8.2
Identities = 30/150 (20%), Positives = 55/150 (36%), Gaps = 12/150 (8%)
Query: 1980 PESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSS 2039
PE +S T L + SE + S+ + T + T T S
Sbjct: 227 PEKDSLTKDELCNSQHK--LHGSEVRNK---NKKRIHRSTSTARYDTDLLN---FTGTPS 278
Query: 2040 PASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSS 2099
PA+ + T+ A + I +T +++P + S ++S +
Sbjct: 279 PAAMEAQFKH----KTSRVFKAPDKILFPPLDFTNTQSATPVTLSNSSSINLPTLNDSLG 334
Query: 2100 PASESTTTSSPESESTTTSSPASESTTIEE 2129
+E+TTT++ + S + S +S EE
Sbjct: 335 HHTETTTTTNTNATSHSHGSKKKQSLAAEE 364
>gnl|CDD|227600 COG5275, COG5275, BRCT domain type II [General function prediction
only].
Length = 276
Score = 35.5 bits (81), Expect = 0.31
Identities = 20/90 (22%), Positives = 35/90 (38%), Gaps = 5/90 (5%)
Query: 1925 TTTSSLVSESTTTSSPE-----SESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSS 1979
T +ST + S E+TT+ T S+S + +
Sbjct: 16 TPDEYFEQQSTRSRSKPRIISNKETTTSKDVVHPVKTELDTTSDSKPVVHQTRATRKPAQ 75
Query: 1980 PESESTTTSSLVSESTTTSSPESESTTTIS 2009
P++E +TTS S +TT ++ S S+ +
Sbjct: 76 PKAEKSTTSKSKSHTTTATTHTSRSSKSKG 105
Score = 33.6 bits (76), Expect = 1.1
Identities = 23/94 (24%), Positives = 38/94 (40%)
Query: 1984 STTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASE 2043
S S+ E S S S I E+TT+ V T + S+S
Sbjct: 10 SDGVSTTPDEYFEQQSTRSRSKPRIISNKETTTSKDVVHPVKTELDTTSDSKPVVHQTRA 69
Query: 2044 STTTNNPKSESTTTNNPASESITSSSPASESTTT 2077
+ PK+E +TT+ S + T+++ S S+ +
Sbjct: 70 TRKPAQPKAEKSTTSKSKSHTTTATTHTSRSSKS 103
Score = 33.6 bits (76), Expect = 1.2
Identities = 25/139 (17%), Positives = 48/139 (34%), Gaps = 7/139 (5%)
Query: 1935 TTTSSPESESTTTSSPES-ESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSE 1993
T E +ST + S S ++ + E ++T+ S P T + ++
Sbjct: 16 TPDEYFEQQSTRSRSKPRIISNKETTTSKDVVHPVKTELDTTSDSKPVVHQTRATRKPAQ 75
Query: 1994 STTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSE 2053
P++E +TT S +TT ++ S S+ + S S +
Sbjct: 76 ------PKAEKSTTSKSKSHTTTATTHTSRSSKSKGLPRFSDEVSQALKNVPLIDVDSMG 129
Query: 2054 STTTNNPASESITSSSPAS 2072
+ T+ +P S
Sbjct: 130 VMAPGTFYERAATTQTPGS 148
Score = 33.2 bits (75), Expect = 1.7
Identities = 20/88 (22%), Positives = 36/88 (40%), Gaps = 5/88 (5%)
Query: 1895 TTTNSPESESTTTNNPE-----SESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSS 1949
T E +ST + + E+TT+ T S+S + +
Sbjct: 16 TPDEYFEQQSTRSRSKPRIISNKETTTSKDVVHPVKTELDTTSDSKPVVHQTRATRKPAQ 75
Query: 1950 PESESTTTSSLVSESTTTSSPESESTTT 1977
P++E +TTS S +TT ++ S S+ +
Sbjct: 76 PKAEKSTTSKSKSHTTTATTHTSRSSKS 103
Score = 32.8 bits (74), Expect = 2.2
Identities = 18/93 (19%), Positives = 39/93 (41%), Gaps = 4/93 (4%)
Query: 1909 NPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTT- 1967
+ S + + + + S S ++ + E ++T+ S V T
Sbjct: 11 DGVSTTPDEYFEQQSTRSRSKPRIISNKETTTSKDVVHPVKTELDTTSDSKPVVHQTRAT 70
Query: 1968 ---SSPESESTTTSSPESESTTTSSLVSESTTT 1997
+ P++E +TTS +S +TT ++ S S+ +
Sbjct: 71 RKPAQPKAEKSTTSKSKSHTTTATTHTSRSSKS 103
Score = 32.0 bits (72), Expect = 3.4
Identities = 22/94 (23%), Positives = 34/94 (36%)
Query: 1964 STTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSE 2023
S S+ E S S S E+TT+ T + S+S
Sbjct: 10 SDGVSTTPDEYFEQQSTRSRSKPRIISNKETTTSKDVVHPVKTELDTTSDSKPVVHQTRA 69
Query: 2024 STTTISPESESTTTSSPASESTTTNNPKSESTTT 2057
+ P++E +TTS S +TT S S+ +
Sbjct: 70 TRKPAQPKAEKSTTSKSKSHTTTATTHTSRSSKS 103
Score = 32.0 bits (72), Expect = 3.5
Identities = 21/85 (24%), Positives = 37/85 (43%), Gaps = 5/85 (5%)
Query: 1911 ESESTTTSSPE-----SESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSEST 1965
E +ST + S E+TT+ +V T S+S + + +E +
Sbjct: 22 EQQSTRSRSKPRIISNKETTTSKDVVHPVKTELDTTSDSKPVVHQTRATRKPAQPKAEKS 81
Query: 1966 TTSSPESESTTTSSPESESTTTSSL 1990
TTS +S +TT ++ S S+ + L
Sbjct: 82 TTSKSKSHTTTATTHTSRSSKSKGL 106
Score = 31.6 bits (71), Expect = 4.2
Identities = 21/93 (22%), Positives = 35/93 (37%), Gaps = 3/93 (3%)
Query: 1975 TTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESES 2034
+TT E +T S ++ + S + PV T+S + +
Sbjct: 14 STTPDEYFEQQSTRSRSKPRIISNKETTTSKDVVHPVKTELDTTSDSKP---VVHQTRAT 70
Query: 2035 TTTSSPASESTTTNNPKSESTTTNNPASESITS 2067
+ P +E +TT+ KS +TT S S S
Sbjct: 71 RKPAQPKAEKSTTSKSKSHTTTATTHTSRSSKS 103
Score = 31.6 bits (71), Expect = 4.5
Identities = 25/133 (18%), Positives = 44/133 (33%), Gaps = 5/133 (3%)
Query: 1955 TTTSSLVSESTTTSSPE-----SESTTTSSPESESTTTSSLVSESTTTSSPESESTTTIS 2009
T +ST + S E+TT+ T S+S +
Sbjct: 16 TPDEYFEQQSTRSRSKPRIISNKETTTSKDVVHPVKTELDTTSDSKPVVHQTRATRKPAQ 75
Query: 2010 PVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSS 2069
P +E +TTS S +TT + S S+ + S + + + + +
Sbjct: 76 PKAEKSTTSKSKSHTTTATTHTSRSSKSKGLPRFSDEVSQALKNVPLIDVDSMGVMAPGT 135
Query: 2070 PASESTTTSSPAS 2082
+ TT +P S
Sbjct: 136 FYERAATTQTPGS 148
Score = 31.2 bits (70), Expect = 6.3
Identities = 21/86 (24%), Positives = 35/86 (40%)
Query: 2012 SESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPA 2071
E S S S I E+TT+ T + S+S + + + P
Sbjct: 18 DEYFEQQSTRSRSKPRIISNKETTTSKDVVHPVKTELDTTSDSKPVVHQTRATRKPAQPK 77
Query: 2072 SESTTTSSPASESTTTSSPASESTTT 2097
+E +TTS S +TT ++ S S+ +
Sbjct: 78 AEKSTTSKSKSHTTTATTHTSRSSKS 103
Score = 30.9 bits (69), Expect = 8.8
Identities = 21/90 (23%), Positives = 37/90 (41%), Gaps = 7/90 (7%)
Query: 2035 TTTSSPASESTTTNNPKSESTTTNNPASESITSSSPAS---ESTTTSSPASESTTT---- 2087
+TT E +T + +N + S P ++T+ S P T
Sbjct: 14 STTPDEYFEQQSTRSRSKPRIISNKETTTSKDVVHPVKTELDTTSDSKPVVHQTRATRKP 73
Query: 2088 SSPASESTTTSSPASESTTTSSPESESTTT 2117
+ P +E +TTS S +TT ++ S S+ +
Sbjct: 74 AQPKAEKSTTSKSKSHTTTATTHTSRSSKS 103
>gnl|CDD|148682 pfam07222, PBP_sp32, Proacrosin binding protein sp32. This family
consists of several mammalian specific proacrosin binding
protein sp32 sequences. sp32 is a sperm specific protein
which is known to bind with with 55- and 53-kDa
proacrosins and the 49-kDa acrosin intermediate. The
exact function of sp32 is unclear, it is thought however
that the binding of sp32 to proacrosin may be involved in
packaging the acrosin zymogen into the acrosomal matrix.
Length = 243
Score = 35.0 bits (80), Expect = 0.33
Identities = 17/56 (30%), Positives = 25/56 (44%), Gaps = 1/56 (1%)
Query: 2090 PASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQGVS-PHSEKLSANED 2144
P S+ + SP + S E + TT + P +E TI E P E+L N +
Sbjct: 122 PCSQPVSILSPNTLKEAEPSAEVQPTTMTLPIAEHPTITENQSFQPWPERLHNNVE 177
>gnl|CDD|215187 PLN02328, PLN02328, lysine-specific histone demethylase 1 homolog.
Length = 808
Score = 35.7 bits (82), Expect = 0.34
Identities = 28/91 (30%), Positives = 44/91 (48%), Gaps = 4/91 (4%)
Query: 1895 TTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTS---SPE 1951
T T PE + N+ SE+++ + S S + S E+ +SPE++S T SP
Sbjct: 3 TETKEPEDPADNVNDVVSEASSPETDLSLSPSQSEQNIENDGQNSPETQSPLTELQPSPL 62
Query: 1952 SESTTTSSLVSEST-TTSSPESESTTTSSPE 1981
+TT + VS+S SS E + +S E
Sbjct: 63 PPNTTLDAPVSDSQGDESSSEQQPQNPNSTE 93
Score = 35.0 bits (80), Expect = 0.62
Identities = 27/99 (27%), Positives = 43/99 (43%), Gaps = 9/99 (9%)
Query: 1934 STTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSE 1993
T T PE + + SE+++ + +S S + S E+ +SPE++S T
Sbjct: 2 ETETKEPEDPADNVNDVVSEASSPETDLSLSPSQSEQNIENDGQNSPETQSPLTE----- 56
Query: 1994 STTTSSPESESTTTISPVSESTTTSSPVSESTTTISPES 2032
SP +TT +PVS+S S S +P S
Sbjct: 57 --LQPSPLPPNTTLDAPVSDSQGDES--SSEQQPQNPNS 91
Score = 34.6 bits (79), Expect = 0.89
Identities = 26/103 (25%), Positives = 44/103 (42%), Gaps = 13/103 (12%)
Query: 2034 STTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASE 2093
T T P + N+ SE+++ E+ S SP+ + S T SP +E
Sbjct: 2 ETETKEPEDPADNVNDVVSEASSP-----ETDLSLSPSQSEQNIENDGQNSPETQSPLTE 56
Query: 2094 STTTSSPASESTTTSSPESESTTTSSPASESTTIEEQGVSPHS 2136
SP +TT +P S+S + ++ E+Q +P+S
Sbjct: 57 --LQPSPLPPNTTLDAPVSDSQ------GDESSSEQQPQNPNS 91
Score = 33.0 bits (75), Expect = 2.4
Identities = 27/107 (25%), Positives = 49/107 (45%), Gaps = 12/107 (11%)
Query: 1904 STTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSE 1963
T T PE + + SE+++ + +S S + S E+ +SPE++S T
Sbjct: 2 ETETKEPEDPADNVNDVVSEASSPETDLSLSPSQSEQNIENDGQNSPETQSPLTE----- 56
Query: 1964 STTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISP 2010
SP +TT +P S+ S ES++ P++ ++T +P
Sbjct: 57 --LQPSPLPPNTTLDAPVSD-----SQGDESSSEQQPQNPNSTEPAP 96
Score = 31.5 bits (71), Expect = 7.4
Identities = 26/117 (22%), Positives = 46/117 (39%), Gaps = 22/117 (18%)
Query: 1994 STTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSE 2053
T T PE + VSE+++ E+ ++SP SE N+ ++
Sbjct: 2 ETETKEPEDPADNVNDVVSEASSP-----ETDLSLSPSQ---------SEQNIENDGQNS 47
Query: 2054 STTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSP 2110
T + SP +TT +P S+S ES++ P + ++T +P
Sbjct: 48 PETQSPLTEL---QPSPLPPNTTLDAPVSDSQ-----GDESSSEQQPQNPNSTEPAP 96
>gnl|CDD|233186 TIGR00920, 2A060605, 3-hydroxy-3-methylglutaryl-coenzyme A reductase.
[Transport and binding proteins, Carbohydrates, organic
alcohols, and acids].
Length = 889
Score = 36.0 bits (83), Expect = 0.35
Identities = 19/120 (15%), Positives = 38/120 (31%), Gaps = 14/120 (11%)
Query: 1867 YSEIIFTTNNNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTT 1926
S+ IF + +ESTV + + +++ +T+ + E + T
Sbjct: 330 ASKYIFFSQGETESTVSLKNGDPVVNPV-----------STDKKQLEYCCRRELTVSADT 378
Query: 1927 TSSLVSESTTTSSPESESTTTSSP---ESESTTTSSLVSESTTTSSPESESTTTSSPESE 1983
+ E S P S+S +S + + + + PE E
Sbjct: 379 IVVSILEEALASKFVFFEVIKPLPTETGSDSWVEASFPVGHKYSGTEQPSCSAPKEPEEE 438
>gnl|CDD|219971 pfam08690, GET2, GET complex subunit GET2. This family corresponds
to the GET complex subunit GET2. The GET complex is
involved in the retrieval of ER resident proteins from
the Golgi.
Length = 298
Score = 35.1 bits (81), Expect = 0.36
Identities = 22/118 (18%), Positives = 42/118 (35%), Gaps = 10/118 (8%)
Query: 1916 TTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESEST 1975
T + + S L ++ + + + S+PE + + E+ ESES
Sbjct: 32 TGQGSSVKLVSKSVLDAKPEDNTGSTTSAHDQSTPEIQD------ILEAIDPPKDESESP 85
Query: 1976 TTS-SPESE--STTTSSLVSESTTTSSPESESTTTI-SPVSESTTTSSPVSESTTTIS 2029
+ PE E + + + P +ST + S + + P SES +
Sbjct: 86 AENIDPEVEMFQQLAKMQQQGNGSDNPPADDSTADLFSMLLQMGGGDGPDSESPASAQ 143
>gnl|CDD|234428 TIGR03979, His_Ser_Rich, His-Xaa-Ser repeat protein HxsA. Members of
this protein share two defining regions. One is a
histidine/serine-rich cluster, typically
H-R-S-H-S-S-H-R-S-H-S-S-H. Members are found always in
the context of a pair of radical SAM proteins, HxsB and
HxsC, and a fourth protein HxsD. The system is predicted
to perform peptide modifications, likely in the
His-Xaa-Ser region, to produce some uncharacterized
natural product.
Length = 186
Score = 34.5 bits (79), Expect = 0.37
Identities = 18/69 (26%), Positives = 32/69 (46%), Gaps = 6/69 (8%)
Query: 2068 SSPASESTTTSSPASESTTTSSPASESTTTS----SPASESTTTSSPESESTTT--SSPA 2121
SS S S+ +S + + S P+ +++T S SP+ + SS +S +TT +
Sbjct: 57 SSHRSHSSHSSHYSGAGGSYSVPSGDTSTYSYPVPSPSYSPSPGSSIQSLPSTTGVRPQS 116
Query: 2122 SESTTIEEQ 2130
S E+
Sbjct: 117 SAENANSEK 125
Score = 31.0 bits (70), Expect = 4.8
Identities = 16/71 (22%), Positives = 30/71 (42%), Gaps = 4/71 (5%)
Query: 2029 SPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTS 2088
S S S+ +S + + + P +++T + P S SP+ S+ S P+ +T
Sbjct: 58 SHRSHSSHSSHYSGAGGSYSVPSGDTSTYSYPVPSP--SYSPSPGSSIQSLPS--TTGVR 113
Query: 2089 SPASESTTTSS 2099
+S S
Sbjct: 114 PQSSAENANSE 124
Score = 30.6 bits (69), Expect = 7.0
Identities = 15/59 (25%), Positives = 23/59 (38%), Gaps = 3/59 (5%)
Query: 2061 ASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSS 2119
S + S S S T+T S + + SP+ S+ S P+ +T S S
Sbjct: 69 YSGAGGSYSVPSGDTSTYS-YPVPSPSYSPSPGSSIQSLPS--TTGVRPQSSAENANSE 124
>gnl|CDD|221509 pfam12287, Caprin-1_C, Cytoplasmic
activation/proliferation-associated protein-1 C term.
This family of proteins is found in eukaryotes. Proteins
in this family are typically between 343 and 708 amino
acids in length. This family is the C terminal region of
caprin-1. Caprin-1 is a protein involved in regulating
cellular proliferation. In mutated phenotypes, the G1
phase of the cell cycle is greatly lengthened, impairing
normal proliferation. The C terminal region of caprin-1
contains RGG motifs which are characteristic of RNA
binding domains. It is possible that caprin-1 functions
through an RNA binding mechanism.
Length = 319
Score = 35.3 bits (81), Expect = 0.37
Identities = 20/86 (23%), Positives = 38/86 (44%), Gaps = 1/86 (1%)
Query: 2060 PASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSS 2119
P + SP SE T+S P + + T+ P + T P S + +S ++ ++++
Sbjct: 61 PEPTQVPMVSPTSEGYTSSPPLYQPSHTAEPRPQ-TDPIDPIQASMSLNSEQTPTSSSLP 119
Query: 2120 PASESTTIEEQGVSPHSEKLSANEDP 2145
AS+ + HS ++ N P
Sbjct: 120 AASQPQVFQTGSKPLHSSGINVNAAP 145
>gnl|CDD|240410 PTZ00418, PTZ00418, Poly(A) polymerase; Provisional.
Length = 593
Score = 35.5 bits (82), Expect = 0.40
Identities = 22/74 (29%), Positives = 34/74 (45%), Gaps = 5/74 (6%)
Query: 1885 STLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESES 1944
S L + + T ++++ T N S +T+ S S ST+ S+ + SSP S
Sbjct: 525 SQLPAFVLSQTPEEPVKTKANTKTNTSSATTSGQSGSSGSTSNSN-----SNESSPTMSS 579
Query: 1945 TTTSSPESESTTTS 1958
T + S STT S
Sbjct: 580 TELLNVSSTSTTGS 593
Score = 32.8 bits (75), Expect = 2.8
Identities = 18/53 (33%), Positives = 26/53 (49%), Gaps = 4/53 (7%)
Query: 2069 SPASESTTTSSPASESTTTSSPASESTTTSSPASE---STTTSSPESESTTTS 2118
+ A+ T TSS + + SS S S + S+ +S ST + S STT S
Sbjct: 542 TKANTKTNTSSATTSGQSGSSG-STSNSNSNESSPTMSSTELLNVSSTSTTGS 593
Score = 32.5 bits (74), Expect = 3.1
Identities = 15/69 (21%), Positives = 30/69 (43%), Gaps = 8/69 (11%)
Query: 2020 PVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSS 2079
+ + + T TSS + + ++ S S + +N +S +++S T +
Sbjct: 533 SQTPEEPVKTKANTKTNTSSATTSGQSGSSG-STSNSNSNESSPTMSS-------TELLN 584
Query: 2080 PASESTTTS 2088
+S STT S
Sbjct: 585 VSSTSTTGS 593
Score = 31.7 bits (72), Expect = 6.8
Identities = 16/62 (25%), Positives = 26/62 (41%), Gaps = 4/62 (6%)
Query: 2000 PESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASE---STTTNNPKSESTT 2056
++ + + T TSS + + S S S + S+ +S ST N S STT
Sbjct: 533 SQTPEEPVKTKANTKTNTSSATTSGQSGSSG-STSNSNSNESSPTMSSTELLNVSSTSTT 591
Query: 2057 TN 2058
+
Sbjct: 592 GS 593
>gnl|CDD|219833 pfam08418, Pol_alpha_B_N, DNA polymerase alpha subunit B N-terminal.
This is the eukaryotic DNA polymerase alpha subunit B
N-terminal domain which is involved in complex formation.
Also see pfam04058.
Length = 239
Score = 34.7 bits (80), Expect = 0.40
Identities = 28/134 (20%), Positives = 46/134 (34%), Gaps = 12/134 (8%)
Query: 2001 ESESTTTISPVSESTTTSSPVSESTTTISPESESTTTS---SPASESTTTNNPKSESTTT 2057
E T S + P +E + S TT S +T PK + + +
Sbjct: 63 EKRVRTPASIKTSKRLIEVPEAEESLLDS----YTTPSDKGGMLRILSTPELPKRKRSFS 118
Query: 2058 NNPASESITSSSPASES----TTTSSPASESTTTSSPASESTTTSSPA-SESTTTSSPES 2112
+ SPAS S +T SP S ++ S E T +P ++ P+S
Sbjct: 119 ASSLESPSLFFSPASFSPSAAPSTPSPNSAKFSSRSNPGEVVETLNPHLGQTPEGGGPDS 178
Query: 2113 ESTTTSSPASESTT 2126
+ S ++
Sbjct: 179 DPKVKLSANFDAKK 192
>gnl|CDD|220633 pfam10214, Rrn6, RNA polymerase I-specific transcription-initiation
factor. RNA polymerase I-specific
transcription-initiation factor Rrn6 and Rrn7 represent
components of a multisubunit transcription factor
essential for the initiation of rDNA transcription by Pol
I. These proteins are found in fungi.
Length = 753
Score = 35.5 bits (82), Expect = 0.44
Identities = 37/258 (14%), Positives = 79/258 (30%), Gaps = 38/258 (14%)
Query: 1761 SRDINSVSPNVTSKILTTDNYSEIIFTTNNNSESTVVM-----STLNSLLSENEKLFKPH 1815
+ P++ + +I +++ +S + + LN L+ + +
Sbjct: 505 VLSLLDELPSLPDHDQNITEFDSLISQLSSHYQSEDLTFSSLINFLNQLIHVLSEESRTS 564
Query: 1816 AKTPGAEFLIQCQYCDFDSSMNLLSVSPYITNNLLISM-LAATAVAISVIDNYSEIIFTT 1874
+ L+QC + +L + + + L+ V+ ID+ E
Sbjct: 565 LDDI-YDKLLQCWESNLPH--DLPGTKEKLIRKIAAEIGLSLIKVSKKEIDSRLEEFLDE 621
Query: 1875 NNNSESTVVMSTLNSLLS--ENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVS 1932
N NS S V ++L + S +S T T SS + ++ S
Sbjct: 622 NTNSLSEEV----KNILDHWDPGDDPSDVDDSQATQPDV----TDSSQLESQSQIPTIRS 673
Query: 1933 ESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVS 1992
+ + + S S S+P +S P + +++ S
Sbjct: 674 SQQVSQTRKGGS-------------------SVVPSAPAPRLAQSSQPPTSQSSSDLPPS 714
Query: 1993 ESTTTSSPESESTTTISP 2010
S S + +
Sbjct: 715 SSQAFSLSDLPMQSQSES 732
Score = 34.4 bits (79), Expect = 0.88
Identities = 23/112 (20%), Positives = 42/112 (37%), Gaps = 3/112 (2%)
Query: 2022 SESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPA 2081
SE I + S +S T ++S+ S+S + +S+ + +
Sbjct: 627 SEEVKNILDHWDPGDDPSDVDDSQATQPDVTDSS---QLESQSQIPTIRSSQQVSQTRKG 683
Query: 2082 SESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQGVS 2133
S S+PA +S P + +++ P S S S + E G+S
Sbjct: 684 GSSVVPSAPAPRLAQSSQPPTSQSSSDLPPSSSQAFSLSDLPMQSQSESGLS 735
Score = 32.0 bits (73), Expect = 4.3
Identities = 22/115 (19%), Positives = 42/115 (36%), Gaps = 4/115 (3%)
Query: 1944 STTTSSPESESTTTSSLVSESTTTSSPESESTTTS--SPESESTTTSSLVSESTTTSSPE 2001
S S + + + E+T + S E ++ + S S ++ T S +
Sbjct: 602 SLIKVSKKEIDSRLEEFLDENTNSLSEEVKNILDHWDPGDDPSDVDDSQATQPDVTDSSQ 661
Query: 2002 SESTTTISPVSESTTTSSP--VSESTTTISPESESTTTSSPASESTTTNNPKSES 2054
ES + I + S S S +P +S P + ++++ P S S
Sbjct: 662 LESQSQIPTIRSSQQVSQTRKGGSSVVPSAPAPRLAQSSQPPTSQSSSDLPPSSS 716
Score = 32.0 bits (73), Expect = 5.2
Identities = 15/74 (20%), Positives = 29/74 (39%)
Query: 2063 ESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPAS 2122
+ SS S+S + +S+ + + S S+PA +S P + +++ P S
Sbjct: 655 DVTDSSQLESQSQIPTIRSSQQVSQTRKGGSSVVPSAPAPRLAQSSQPPTSQSSSDLPPS 714
Query: 2123 ESTTIEEQGVSPHS 2136
S + S
Sbjct: 715 SSQAFSLSDLPMQS 728
>gnl|CDD|221067 pfam11301, DUF3103, Protein of unknown function (DUF3103). This
family of proteins with unknown function appear to be
restricted to Proteobacteria.
Length = 344
Score = 35.0 bits (81), Expect = 0.44
Identities = 15/67 (22%), Positives = 19/67 (28%), Gaps = 6/67 (8%)
Query: 1456 PALEASNDIAELLMECLATMKEYEVECKEMTAMGKPPPSLPPIMKALNVTSPRDYLMTVL 1515
P D + L LA M+ E K P + TVL
Sbjct: 125 PVFVVDLDSKKELKAGLAVMRA------EFAQAAKQMQLQPRSAASSAAAETAPISTTVL 178
Query: 1516 SRIRSTD 1522
+IR D
Sbjct: 179 KKIRLKD 185
>gnl|CDD|236782 PRK10871, nlpD, lipoprotein NlpD; Provisional.
Length = 319
Score = 34.8 bits (80), Expect = 0.51
Identities = 18/74 (24%), Positives = 33/74 (44%), Gaps = 3/74 (4%)
Query: 2032 SESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPA---SESTTTS 2088
+E PA ST + T + + +S P ++ T+ A + + +T+
Sbjct: 125 AEQGVVIKPAQNSTVAVASQPTITYSESSGEQSANKMLPNNKPAATTVTAPVTAPTASTT 184
Query: 2089 SPASESTTTSSPAS 2102
P + ST+TS+P S
Sbjct: 185 EPTASSTSTSTPIS 198
Score = 34.0 bits (78), Expect = 1.0
Identities = 22/76 (28%), Positives = 37/76 (48%), Gaps = 7/76 (9%)
Query: 2012 SESTTTSSPVSESTTTISPESESTTTSSPASESTTTN----NPKSESTTTNNP-ASESIT 2066
+E P ST ++ S+ T T S +S + N N K +TT P + + +
Sbjct: 125 AEQGVVIKPAQNSTVAVA--SQPTITYSESSGEQSANKMLPNNKPAATTVTAPVTAPTAS 182
Query: 2067 SSSPASESTTTSSPAS 2082
++ P + ST+TS+P S
Sbjct: 183 TTEPTASSTSTSTPIS 198
Score = 32.9 bits (75), Expect = 2.2
Identities = 24/101 (23%), Positives = 39/101 (38%), Gaps = 9/101 (8%)
Query: 2022 SESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPA 2081
+ S T I+ + T A+E P ST S S +S P
Sbjct: 107 NASGTPITGGNAITQAD--AAEQGVVIKPAQNSTVAVASQPTITYSESSGEQSANKMLPN 164
Query: 2082 SESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPAS 2122
++ T T T+ + + +T+ P + ST+TS+P S
Sbjct: 165 NKPAAT-------TVTAPVTAPTASTTEPTASSTSTSTPIS 198
Score = 31.7 bits (72), Expect = 4.3
Identities = 24/81 (29%), Positives = 41/81 (50%), Gaps = 7/81 (8%)
Query: 1967 TSSPESESTTTSSPESESTTTSSLVSESTTTSSPES--ESTTTISPVSE--STTTSSPVS 2022
T + +E P ST + S+ T T S S +S + P ++ +TT ++PV+
Sbjct: 120 TQADAAEQGVVIKPAQNSTVA--VASQPTITYSESSGEQSANKMLPNNKPAATTVTAPVT 177
Query: 2023 ESTTTIS-PESESTTTSSPAS 2042
T + + P + ST+TS+P S
Sbjct: 178 APTASTTEPTASSTSTSTPIS 198
Score = 31.3 bits (71), Expect = 6.7
Identities = 19/92 (20%), Positives = 39/92 (42%), Gaps = 1/92 (1%)
Query: 1932 SESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLV 1991
+ S T + + T + E + S S P + +S +S + +
Sbjct: 107 NASGTPITGGNAITQADAAEQGVVIKPAQNSTVAVASQPTITYSESSGEQSANKMLPNNK 166
Query: 1992 SESTTTSSPESESTTTISPVSESTT-TSSPVS 2022
+TT ++P + T + + + S+T TS+P+S
Sbjct: 167 PAATTVTAPVTAPTASTTEPTASSTSTSTPIS 198
Score = 31.0 bits (70), Expect = 8.0
Identities = 19/86 (22%), Positives = 37/86 (43%), Gaps = 7/86 (8%)
Query: 1927 TSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTT 1986
T + +E P ST + + T + S +S P ++ T T
Sbjct: 120 TQADAAEQGVVIKPAQNSTVAVASQPTITYSESSGEQSANKMLPNNKPAAT-------TV 172
Query: 1987 TSSLVSESTTTSSPESESTTTISPVS 2012
T+ + + + +T+ P + ST+T +P+S
Sbjct: 173 TAPVTAPTASTTEPTASSTSTSTPIS 198
>gnl|CDD|147982 pfam06112, Herpes_capsid, Gammaherpesvirus capsid protein. This
family consists of several Gammaherpesvirus capsid
proteins. The exact function of this family is unknown.
Length = 148
Score = 33.7 bits (77), Expect = 0.51
Identities = 16/65 (24%), Positives = 28/65 (43%), Gaps = 2/65 (3%)
Query: 1920 PESESTTTSSLVSESTTTSSPESESTT--TSSPESESTTTSSLVSESTTTSSPESESTTT 1977
P++ S+ S+L + S++ S + SS + S+ SL S S+ + S T
Sbjct: 82 PQTSSSIGSALSASSSSASGVPGGANQLSGSSGSALSSGPGSLSSSSSLSGSGAGAGDTA 141
Query: 1978 SSPES 1982
S
Sbjct: 142 PSSSK 146
Score = 32.9 bits (75), Expect = 0.74
Identities = 17/60 (28%), Positives = 28/60 (46%), Gaps = 2/60 (3%)
Query: 2060 PASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSS 2119
S SI S+ AS S+ + P + + S S S +S P S S+++S S + +
Sbjct: 83 QTSSSIGSALSASSSSASGVPGGANQLSGS--SGSALSSGPGSLSSSSSLSGSGAGAGDT 140
Score = 31.7 bits (72), Expect = 2.3
Identities = 16/63 (25%), Positives = 28/63 (44%), Gaps = 2/63 (3%)
Query: 1950 PESESTTTSSLVSESTTTSSPESESTT--TSSPESESTTTSSLVSESTTTSSPESESTTT 2007
P++ S+ S+L + S++ S + SS + S+ SL S S+ + S T
Sbjct: 82 PQTSSSIGSALSASSSSASGVPGGANQLSGSSGSALSSGPGSLSSSSSLSGSGAGAGDTA 141
Query: 2008 ISP 2010
S
Sbjct: 142 PSS 144
Score = 31.4 bits (71), Expect = 3.1
Identities = 15/65 (23%), Positives = 31/65 (47%), Gaps = 1/65 (1%)
Query: 2050 PKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSS 2109
P++ S+ + ++ S ++S + S +S S +S P S S+++S S + +
Sbjct: 82 PQTSSSIGSALSASSSSASGVPGGANQLSG-SSGSALSSGPGSLSSSSSLSGSGAGAGDT 140
Query: 2110 PESES 2114
S S
Sbjct: 141 APSSS 145
Score = 30.6 bits (69), Expect = 4.6
Identities = 14/64 (21%), Positives = 32/64 (50%), Gaps = 1/64 (1%)
Query: 2063 ESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPAS 2122
+++ + P + S+ S+ ++ S++ S + S +S S +S P S S+++S S
Sbjct: 75 QALRGAGPQTSSSIGSALSASSSSASGVPGGANQLSG-SSGSALSSGPGSLSSSSSLSGS 133
Query: 2123 ESTT 2126
+
Sbjct: 134 GAGA 137
>gnl|CDD|222819 PHA01077, PHA01077, putative lower collar protein.
Length = 251
Score = 34.7 bits (79), Expect = 0.51
Identities = 18/102 (17%), Positives = 35/102 (34%)
Query: 1978 SSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTT 2037
SS E E S +E ++ ++ T+ + S +T + + P+SE
Sbjct: 114 SSSEVEKYLQSQGFTEHNEDTTNNTDETSNQNATSLDNSTGMTANRNAYVSLPQSEVNID 173
Query: 2038 SSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSS 2079
+ NN T N ++ES ++ +
Sbjct: 174 VDNTTLRFADNNTIDNGKTVNKSSNESNQNAKRNQNQKGNAK 215
>gnl|CDD|112562 pfam03753, HHV6-IE, Human herpesvirus 6 immediate early protein. The
proteins in this family are poorly characterized, but an
investigation has indicated that the immediate early
protein is required the down-regulation of MHC class I
expression in dendritic cells. Human herpesvirus 6
immediate early protein is also referred to as U90.
Length = 993
Score = 35.1 bits (80), Expect = 0.58
Identities = 43/261 (16%), Positives = 92/261 (35%), Gaps = 7/261 (2%)
Query: 1862 SVIDNYSEIIFTTNNNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPE 1921
S++D ++ + T + + +L +L T S T+N + + + +
Sbjct: 477 SLLDTQADSVVTQTVSKNNEAFNMSLYNLKRNEETYQDKNSRDKKTDNQAGPTFSRTDKK 536
Query: 1922 SESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTT--TSS 1979
+ S + + + E ++ T + L+SE + S + S
Sbjct: 537 TNSPAGILMERSIFNKDTQDKEQYFELFTMTDGTLDNPLISEMLSFGYETDHSAPYESES 596
Query: 1980 PESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSS 2039
++ + V T++ +T +P S+S + V+ S T + + +++
Sbjct: 597 DNNDEIDYIASVDSGNRTNNIHMNNTNENTPFSKSGKSPPEVTPSKTFYKRDKKKDISTN 656
Query: 2040 PASESTTTNNPKSESTTTNNPASESI-TSSSPASESTTTSSPASESTTTSSPA-SESTTT 2097
+ T K ++ S+ I + S P + S SE +S
Sbjct: 657 RKVKKRTA---KRKTVGYKTDKSKKIKSDSLPTDTNVIVISSESEDEEDGFNIIKKSQLK 713
Query: 2098 SSPASESTTTSSPESESTTTS 2118
SE + SS ES+ T+
Sbjct: 714 KKIKSELKSESSSESDDCTSE 734
>gnl|CDD|240388 PTZ00372, PTZ00372, endonuclease 4-like protein; Provisional.
Length = 413
Score = 34.7 bits (80), Expect = 0.59
Identities = 16/95 (16%), Positives = 28/95 (29%), Gaps = 2/95 (2%)
Query: 1997 TSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTT-TNNPKSEST 2055
+ + + IS + + S +T E++ TTS+ + N K +S
Sbjct: 13 SGTTQKSKLQPISYIYSNVLVLS-KEILSTFSEEENKVATTSTKKDKKEDKNNESKKKSE 71
Query: 2056 TTNNPASESITSSSPASESTTTSSPASESTTTSSP 2090
E S +P T P
Sbjct: 72 KKKKKKKEKKEPKSEGETKLGFKTPKKSKKTKKKP 106
Score = 34.3 bits (79), Expect = 0.82
Identities = 18/95 (18%), Positives = 26/95 (27%), Gaps = 3/95 (3%)
Query: 2018 SSPVSESTTT-ISPESESTTTSSPASESTTT-NNPKSESTTTNNPASESITSSSPASEST 2075
S +S IS + S ST + K +T+T E + S +S
Sbjct: 13 SGTTQKSKLQPISYIYSNVLVLSKEILSTFSEEENKVATTSTKKDKKEDKNNESK-KKSE 71
Query: 2076 TTSSPASESTTTSSPASESTTTSSPASESTTTSSP 2110
E S +P T P
Sbjct: 72 KKKKKKKEKKEPKSEGETKLGFKTPKKSKKTKKKP 106
Score = 33.5 bits (77), Expect = 1.3
Identities = 16/96 (16%), Positives = 26/96 (27%), Gaps = 4/96 (4%)
Query: 1968 SSPESESTTTSSPESESTTTSSLVSESTTTSS---PESESTTTISPVSESTTTSSPVSES 2024
S ++ + + L E +T S + +T+T E S +S
Sbjct: 12 FSGTTQKSKLQPISYIYSNVLVLSKEILSTFSEEENKVATTSTKKDKKEDKNNESK-KKS 70
Query: 2025 TTTISPESESTTTSSPASESTTTNNPKSESTTTNNP 2060
+ E S PK T P
Sbjct: 71 EKKKKKKKEKKEPKSEGETKLGFKTPKKSKKTKKKP 106
>gnl|CDD|235895 PRK06945, flgK, flagellar hook-associated protein FlgK; Validated.
Length = 651
Score = 35.0 bits (81), Expect = 0.61
Identities = 30/137 (21%), Positives = 53/137 (38%), Gaps = 15/137 (10%)
Query: 1974 STTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESE 2033
S T+ + + + S +T + T IS S S+ P+ TTT++ ++
Sbjct: 425 SVATTDGSAIAAASPVRASAGSTNTG-----TGAISQGSVSSGY--PLPSGTTTLTYDAA 477
Query: 2034 STTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTT--------TSSPASEST 2085
+ T S + +T T ++ T PA+ + +S A S + +PA T
Sbjct: 478 TGTLSGFPAGTTVTVAGTPPTSVTITPATTPVPYTSGAGISLVFNGVSVTLSGTPADGDT 537
Query: 2086 TTSSPASESTTTSSPAS 2102
T P + T A
Sbjct: 538 FTIGPNTGGTNDGRNAL 554
>gnl|CDD|178677 PLN03131, PLN03131, hypothetical protein; Provisional.
Length = 705
Score = 34.8 bits (79), Expect = 0.73
Identities = 28/184 (15%), Positives = 58/184 (31%), Gaps = 10/184 (5%)
Query: 1914 STTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESE 1973
T+ +P + L + +++ SS + T S SPE
Sbjct: 368 PATSPAPPVDLFEIPPLDPAPAINAYQPPQTSLPSSIDLFGGITQQQSINSLDEKSPEL- 426
Query: 1974 STTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTIS-PES 2032
S P++E T + +T E+ + +I P + V + P
Sbjct: 427 ----SIPKNEGWATFDGIQPIASTPGNENLTPFSIGPSMAGSANFDQVPSLDKGMQWPPF 482
Query: 2033 ESTTTSSPASESTTTNNPKSESTTTNNPASESITS----SSPASESTTTSSPASESTTTS 2088
++++ AS +N ++++ + S A +SE T +
Sbjct: 483 QNSSDEESASGPAPWLGDLHNVEAPDNTSAQNWNAFEFDDSVAGIPLEGIKQSSEPQTAA 542
Query: 2089 SPAS 2092
+
Sbjct: 543 NMPP 546
>gnl|CDD|180536 PRK06347, PRK06347, autolysin; Reviewed.
Length = 592
Score = 34.7 bits (79), Expect = 0.74
Identities = 55/276 (19%), Positives = 104/276 (37%), Gaps = 55/276 (19%)
Query: 1874 TNNNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSS---- 1929
T + T + LN L+S N + +S T S ST SS S + TS+
Sbjct: 277 TGTYATDTAYATKLNDLIS---RYNLTQYDSGKTTGGNSGSTGNSSNSSNTGNTSNAKIY 333
Query: 1930 -LVSESTTTSSPESESTTTSSPESESTTTSSL--------VSESTTTSSPESESTTTSSP 1980
+V + + T ++ ++ + S VS +TTS + +T +
Sbjct: 334 TVVKGDSLWRIANNHKVTVANLKAWNNLKSDFIYPGQKLKVSAGSTTSDTNTSKPSTGTS 393
Query: 1981 ESESTTTSSLVSESTTTSSPES------ESTTTIS-------------------PVSEST 2015
S+ +T +S ++ T +S + TI+ VS +
Sbjct: 394 TSKPSTGTSTNAKVYTVVKGDSLWRIANNNKVTIANLKSWNNLKSDFIYPGQKLKVSAGS 453
Query: 2016 TTSSPVSESTTTISPESESTTTSSPASEST----------TTNNPKSEST--TTNNPASE 2063
T+++ S+ +T + ST T++ A T NN + + + NN S+
Sbjct: 454 TSNTNTSKPSTNTNTSKPSTNTNTNAKVYTVAKGDSLWRIANNNKVTIANLKSWNNLKSD 513
Query: 2064 SITSSS--PASESTTTSSPASESTTTSSPASESTTT 2097
I S +TT++ + +T+ P++ + T
Sbjct: 514 FIYPGQKLKVSAGSTTNNTNTAKPSTNKPSNSTVKT 549
Score = 34.3 bits (78), Expect = 0.90
Identities = 24/118 (20%), Positives = 48/118 (40%), Gaps = 6/118 (5%)
Query: 1972 SESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPE 2031
S T + E+ + ++ E+ T++PE+ + T+ P T + E + +
Sbjct: 51 SADETAPADEASKSAEANTTKEAPATATPENTTEPTVEPKQTETKEQTKTPEEKQPAAKQ 110
Query: 2032 SESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSS 2089
E +E T +NP +T+++ PA+ ++ S T S +SS
Sbjct: 111 VEK-----APAEPATVSNP-DNATSSSTPATYNLLQKSALRSGATVQSFIQTIQASSS 162
Score = 33.9 bits (77), Expect = 1.4
Identities = 31/145 (21%), Positives = 69/145 (47%), Gaps = 12/145 (8%)
Query: 1986 TTSSLVSESTTTSSPESE---STTTISPVSESTTTSSPVS--ESTTTISPESESTTTSSP 2040
T + + + +T+ + P E S +P E++ ++ + E+ T +PE+ + T P
Sbjct: 30 TIAGVTAIATSITVPGIEVIVSADETAPADEASKSAEANTTKEAPATATPENTTEPTVEP 89
Query: 2041 ASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSP 2100
T ++++ PA++ + +E T S+P +T++S+PA+ + S
Sbjct: 90 KQTETKE---QTKTPEEKQPAAKQV--EKAPAEPATVSNP-DNATSSSTPATYNLLQKSA 143
Query: 2101 -ASESTTTSSPESESTTTSSPASES 2124
S +T S ++ ++S A+E+
Sbjct: 144 LRSGATVQSFIQTIQASSSQIAAEN 168
Score = 33.1 bits (75), Expect = 2.0
Identities = 23/133 (17%), Positives = 52/133 (39%), Gaps = 16/133 (12%)
Query: 2014 STTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASE 2073
S ++P E++ + + ++ E+TT + + T T +
Sbjct: 51 SADETAPADEASKSAEANTTKEAPATATPENTTEPTVEPKQTETKEQT-----------K 99
Query: 2074 STTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEE--QG 2131
+ PA++ +E T S+P +T++S+P + + S T++ Q
Sbjct: 100 TPEEKQPAAKQVEK--APAEPATVSNP-DNATSSSTPATYNLLQKSALRSGATVQSFIQT 156
Query: 2132 VSPHSEKLSANED 2144
+ S +++A D
Sbjct: 157 IQASSSQIAAEND 169
Score = 33.1 bits (75), Expect = 2.3
Identities = 27/109 (24%), Positives = 50/109 (45%), Gaps = 12/109 (11%)
Query: 2074 STTTSSPASE---STTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQ 2130
+T+ + P E S ++PA E++ ++ + ++ E+TT + + T +EQ
Sbjct: 38 ATSITVPGIEVIVSADETAPADEASKSAEANTTKEAPATATPENTTEPTVEPKQTETKEQ 97
Query: 2131 GVSPHSEKLSANEDPEEFPNEDVFEHTFAEIPNIDHSNQTDEAIPETFD 2179
+P EK A + E+ P E A + N D N T + P T++
Sbjct: 98 TKTP-EEKQPAAKQVEKAPAEP------ATVSNPD--NATSSSTPATYN 137
Score = 32.0 bits (72), Expect = 5.1
Identities = 22/124 (17%), Positives = 50/124 (40%), Gaps = 4/124 (3%)
Query: 1880 STVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSS 1939
+++ + + ++S + T + E+ + N E+ T++PE+ + T T +
Sbjct: 39 TSITVPGIEVIVSADETAPADEASKSAEANTTKEAPATATPENTTEPTVEPKQTETKEQT 98
Query: 1940 PESESTTTSSPESESTT----TSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSEST 1995
E ++ + E T S +T++S+P + + S T S +
Sbjct: 99 KTPEEKQPAAKQVEKAPAEPATVSNPDNATSSSTPATYNLLQKSALRSGATVQSFIQTIQ 158
Query: 1996 TTSS 1999
+SS
Sbjct: 159 ASSS 162
Score = 31.6 bits (71), Expect = 5.8
Identities = 15/85 (17%), Positives = 34/85 (40%), Gaps = 6/85 (7%)
Query: 1964 STTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSE 2023
S ++P E++ ++ + ++ E+TT + E + T T +
Sbjct: 51 SADETAPADEASKSAEANTTKEAPATATPENTTEPTVEPKQTETKEQTKTPEEKQPAAKQ 110
Query: 2024 ------STTTISPESESTTTSSPAS 2042
T+S +T++S+PA+
Sbjct: 111 VEKAPAEPATVSNPDNATSSSTPAT 135
>gnl|CDD|114299 pfam05568, ASFV_J13L, African swine fever virus J13L protein. This
family consists of several African swine fever virus J13L
proteins.
Length = 189
Score = 33.7 bits (76), Expect = 0.75
Identities = 27/115 (23%), Positives = 49/115 (42%), Gaps = 8/115 (6%)
Query: 2001 ESESTTTISPVSESTTTSSPVSESTTTISPESESTTTS-SPASESTTTNNPKSESTTTNN 2059
E E I+P + + V+ P ST ++ P + TN ++ TN
Sbjct: 64 EEEDIQFINPYQDQQW--AEVTPQPGIAKPAGASTASAGKPVMDRPATNRLVADKPATNK 121
Query: 2060 PASES--ITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPES 2112
P ++ + + PA+ S S+ AS+ + PA TT ++ + S T + E+
Sbjct: 122 PVMDNLGMAAGGPAAASAPASAAASDP---AHPAELYTTATTQNTASQTMPADEN 173
Score = 32.9 bits (74), Expect = 1.4
Identities = 17/92 (18%), Positives = 35/92 (38%), Gaps = 7/92 (7%)
Query: 2037 TSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTT 2096
+ PA ST + + + PA+ + + PA+ + PA+ S
Sbjct: 88 IAKPAGASTAS----AGKPVMDRPATNRLVADKPATNKPVMDNLG---MAAGGPAAASAP 140
Query: 2097 TSSPASESTTTSSPESESTTTSSPASESTTIE 2128
S+ AS+ + + +TT ++ + E
Sbjct: 141 ASAAASDPAHPAELYTTATTQNTASQTMPADE 172
>gnl|CDD|227568 COG5243, HRD1, HRD ubiquitin ligase complex, ER membrane component
[Posttranslational modification, protein turnover,
chaperones].
Length = 491
Score = 34.6 bits (79), Expect = 0.84
Identities = 31/125 (24%), Positives = 49/125 (39%), Gaps = 25/125 (20%)
Query: 2029 SPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTS 2088
SP S + +T NP + TTT P IT+SS + ++ + +S
Sbjct: 350 SPTPASPNVRN-TQIATQVPNPDNTPTTTAVPG---ITNSSNQGDPQASTFNGVPNANSS 405
Query: 2089 SPASESTTTSSPA--------------SESTTTSSPESESTTTSSPASEST-----TIEE 2129
A+ + SS S+ST+T++P +T T+ S ST T
Sbjct: 406 GFAAHTQDLSSVIPGWTMLPIPGTRRISQSTSTTNP--SATPTTGDPSNSTYGGPQTFPN 463
Query: 2130 QGVSP 2134
G +P
Sbjct: 464 SGNNP 468
Score = 32.6 bits (74), Expect = 2.9
Identities = 25/109 (22%), Positives = 43/109 (39%), Gaps = 6/109 (5%)
Query: 1899 SPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTS 1958
SP S N +T +P++ TTT+ T SS + + ++ + +S
Sbjct: 350 SPTPASPNVRN-TQIATQVPNPDNTPTTTAV---PGITNSSNQGDPQASTFNGVPNANSS 405
Query: 1959 SLVSESTTTSSPESESTTTSSPESE--STTTSSLVSESTTTSSPESEST 2005
+ + SS T P + S +TS+ +T T+ S ST
Sbjct: 406 GFAAHTQDLSSVIPGWTMLPIPGTRRISQSTSTTNPSATPTTGDPSNST 454
Score = 31.9 bits (72), Expect = 5.0
Identities = 30/127 (23%), Positives = 52/127 (40%), Gaps = 11/127 (8%)
Query: 1928 SSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTT 1987
SS S + +T +P++ TTT+ T SS + + ++ + +
Sbjct: 349 SSPTPASPNVRN-TQIATQVPNPDNTPTTTAV---PGITNSSNQGDPQASTFNGVPNANS 404
Query: 1988 SSLVSESTTTSSPESESTTTISP----VSESTTTSSPVSESTTTISPESESTTTSSPASE 2043
S + + SS T P +S+ST+T++P +T T S S T P +
Sbjct: 405 SGFAAHTQDLSSVIPGWTMLPIPGTRRISQSTSTTNP--SATPTTGDPSNS-TYGGPQTF 461
Query: 2044 STTTNNP 2050
+ NNP
Sbjct: 462 PNSGNNP 468
>gnl|CDD|227268 COG4932, COG4932, Predicted outer membrane protein [Cell envelope
biogenesis, outer membrane].
Length = 1531
Score = 34.8 bits (80), Expect = 0.87
Identities = 28/149 (18%), Positives = 47/149 (31%), Gaps = 18/149 (12%)
Query: 1850 LISMLAATAVAISVIDNYSEIIFTTNNNSESTVVMSTLNSLLSENTTTNSPESESTTTNN 1909
IS + ATA +I ST + + N ++ + N
Sbjct: 21 NISPVLATANDEKTETTTLKITKEDK---------STKEKINGSSFEKNKETGKTISLNI 71
Query: 1910 PESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSS 1969
P TTT S + TT + + T + E T+TS+ E
Sbjct: 72 PSEGLTTTDSLLVGDYEVKEKSAGLGTTLDEATYNVTLALKEEVITSTSTKTQE------ 125
Query: 1970 PESESTTTSSPESESTTTSSLVSESTTTS 1998
E T +PE + ++++ T
Sbjct: 126 ---EKTEIVTPEPSKKKLKAEITDNIFTP 151
Score = 32.5 bits (74), Expect = 3.6
Identities = 37/182 (20%), Positives = 60/182 (32%), Gaps = 21/182 (11%)
Query: 1784 IIFTTNNNSESTVVMSTLNSLLSENEKLFKPHAKT----PGAEFLIQCQYCDFDSSMNLL 1839
+ FT N E V ++ N + + L K + + GAEF + D N+L
Sbjct: 1319 VNFTIEFNQEEAVKVTKENDAKTGSVVLTKLDSSSGVTLEGAEFELL------DEEGNIL 1372
Query: 1840 --SVSPYITNNLLISMLAA-------TAVAISVIDNYSEIIFTTNNNSES--TVVMSTLN 1888
+ LL+ LA T + + + FT N E V +
Sbjct: 1373 KEGLVTDENGQLLVDDLAPGDYQFVETKAPTGYELDATPVDFTIEFNQEEALKVTKTNKL 1432
Query: 1889 SLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTS 1948
+ E++ N +E T N E +P + E T P+ E S
Sbjct: 1433 FIEFEDSIGNQLNAEEHTGNVGEEYVFKAKNPGHYKEGDQPITFEPTEPPKPDPEKRLDS 1492
Query: 1949 SP 1950
+
Sbjct: 1493 NN 1494
Score = 32.5 bits (74), Expect = 3.8
Identities = 26/146 (17%), Positives = 53/146 (36%), Gaps = 1/146 (0%)
Query: 1896 TTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESEST 1955
T N ++E+TT + + +T S + ++ + + P TTT S
Sbjct: 28 TANDEKTETTTLKITKEDKSTKEKINGSSFEKNKETGKTISLNIPSEGLTTTDSLLVGDY 87
Query: 1956 TTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTT-TSSPESESTTTISPVSES 2014
+ TT + + T + E T+TS+ E T +PE + ++++
Sbjct: 88 EVKEKSAGLGTTLDEATYNVTLALKEEVITSTSTKTQEEKTEIVTPEPSKKKLKAEITDN 147
Query: 2015 TTTSSPVSESTTTISPESESTTTSSP 2040
T + + + + SP
Sbjct: 148 IFTPVTLKDGNGYEANTTNRIPNGSP 173
Score = 32.1 bits (73), Expect = 4.5
Identities = 23/124 (18%), Positives = 49/124 (39%), Gaps = 4/124 (3%)
Query: 1936 TTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSEST 1995
T + ++E+TT + + +T + S + ++ + + P TTT SL+
Sbjct: 28 TANDEKTETTTLKITKEDKSTKEKINGSSFEKNKETGKTISLNIPSEGLTTTDSLLVGDY 87
Query: 1996 TTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSEST 2055
+ TT+ + T + E T+ S +++ T E + K ++
Sbjct: 88 EVKEKSAGLGTTLDE-ATYNVTLALKEEVITSTSTKTQEEKTEIVTPEPSKK---KLKAE 143
Query: 2056 TTNN 2059
T+N
Sbjct: 144 ITDN 147
>gnl|CDD|114140 pfam05399, EVI2A, Ectropic viral integration site 2A protein (EVI2A).
This family contains several mammalian ectropic viral
integration site 2A (EVI2A) proteins. The function of
this protein is unknown although it is thought to be a
membrane protein and may function as an oncogene in
retrovirus induced myeloid tumours.
Length = 227
Score = 33.5 bits (76), Expect = 0.95
Identities = 26/114 (22%), Positives = 38/114 (33%), Gaps = 1/114 (0%)
Query: 2005 TTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASES 2064
TT S +TT + + + I TS E+ TN P E + P +E
Sbjct: 15 TTVFSLSLGTTTNYTDLWAVSNEIWYSICQNLTSRNIPETNNTNPPTPEVNGKSTPTAEP 74
Query: 2065 ITSSSPASESTTTS-SPASESTTTSSPASESTTTSSPASESTTTSSPESESTTT 2117
TS+ ST+ S S S TS E+ E ++
Sbjct: 75 QTSTPVPLYSTSGSNFFTPSSAQNSPDTGGPGNTSKSKGETFKKEVCEENTSNF 128
Score = 32.7 bits (74), Expect = 1.9
Identities = 25/103 (24%), Positives = 38/103 (36%), Gaps = 1/103 (0%)
Query: 1985 TTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASE- 2043
TT SL +TT + + I TS + E+ T P E S+P +E
Sbjct: 15 TTVFSLSLGTTTNYTDLWAVSNEIWYSICQNLTSRNIPETNNTNPPTPEVNGKSTPTAEP 74
Query: 2044 STTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTT 2086
T+T P ++ +N S +S TS E+
Sbjct: 75 QTSTPVPLYSTSGSNFFTPSSAQNSPDTGGPGNTSKSKGETFK 117
Score = 32.3 bits (73), Expect = 2.1
Identities = 22/103 (21%), Positives = 37/103 (35%), Gaps = 1/103 (0%)
Query: 1955 TTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSE- 2013
TT SL +TT + + TS + E+ T+ P E +P +E
Sbjct: 15 TTVFSLSLGTTTNYTDLWAVSNEIWYSICQNLTSRNIPETNNTNPPTPEVNGKSTPTAEP 74
Query: 2014 STTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTT 2056
T+T P+ ++ + S S T+ K E+
Sbjct: 75 QTSTPVPLYSTSGSNFFTPSSAQNSPDTGGPGNTSKSKGETFK 117
Score = 32.0 bits (72), Expect = 3.2
Identities = 20/103 (19%), Positives = 35/103 (33%), Gaps = 1/103 (0%)
Query: 1945 TTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESE- 2003
TT S +TT + + + TS E+ T+ E S+P +E
Sbjct: 15 TTVFSLSLGTTTNYTDLWAVSNEIWYSICQNLTSRNIPETNNTNPPTPEVNGKSTPTAEP 74
Query: 2004 STTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTT 2046
T+T P+ ++ ++ S TS E+
Sbjct: 75 QTSTPVPLYSTSGSNFFTPSSAQNSPDTGGPGNTSKSKGETFK 117
Score = 31.2 bits (70), Expect = 6.2
Identities = 21/103 (20%), Positives = 39/103 (37%), Gaps = 3/103 (2%)
Query: 2027 TISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASE-ST 2085
++S + + T A + + T+ N P E+ ++ P E S+P +E T
Sbjct: 19 SLSLGTTTNYTDLWAVSNEIWYSICQNLTSRNIP--ETNNTNPPTPEVNGKSTPTAEPQT 76
Query: 2086 TTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIE 2128
+T P ++ ++ S S TS E+ E
Sbjct: 77 STPVPLYSTSGSNFFTPSSAQNSPDTGGPGNTSKSKGETFKKE 119
Score = 30.8 bits (69), Expect = 8.2
Identities = 24/115 (20%), Positives = 45/115 (39%), Gaps = 1/115 (0%)
Query: 1873 TTNNNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVS 1932
+ +M+T+ SL TT + + TS E+ T+
Sbjct: 3 HKGHYLHLAFLMTTVFSLSLGTTTNYTDLWAVSNEIWYSICQNLTSRNIPETNNTNPPTP 62
Query: 1933 ESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPES-ESTTTSSPESESTT 1986
E S+P +E T++ ST+ S+ + S+ +SP++ TS + E+
Sbjct: 63 EVNGKSTPTAEPQTSTPVPLYSTSGSNFFTPSSAQNSPDTGGPGNTSKSKGETFK 117
>gnl|CDD|110602 pfam01611, Filo_glycop, Filovirus glycoprotein. This family includes
an extracellular region from the envelope glycoprotein of
Ebola and Marburg viruses. This region is also produced
as a separate transcript that gives rise to a
non-structural, secreted glycoprotein, which is produced
in large amounts and has an unknown function. Processing
of this protein may be involved in viral pathogenicity.
Length = 364
Score = 34.0 bits (78), Expect = 1.1
Identities = 32/155 (20%), Positives = 61/155 (39%), Gaps = 15/155 (9%)
Query: 1866 NYSEIIFTTNNNSESTVVMS-------TLN-SLLSENTTTNSPESESTTTNNPESESTTT 1917
N +F NN + + S LN ++ NT +N+ T + P +S
Sbjct: 214 NEFGTLFEVNNTTYVQLDPSHTPQFLPQLNETIYLTNTLSNTTGKLIWTVD-PSIDSG-- 270
Query: 1918 SSPESESTTTSSLVSE---STTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESES 1974
S E T V++ S+T S S S T++ ++ T S S+ + +
Sbjct: 271 -SGEWAFWETKKNVTKQGQSSTCLSTPSLSPRTTNHSRQAVTELDKNRTSLQPSTNNTTT 329
Query: 1975 TTTSSPESESTTTSSLVSESTTTSSPESESTTTIS 2009
+T++ + +T S+ ++ T + +S T
Sbjct: 330 ISTNNTSKHNFSTQSIPLQNFTNDNSQSTLTENEQ 364
Score = 33.3 bits (76), Expect = 1.9
Identities = 35/191 (18%), Positives = 79/191 (41%), Gaps = 16/191 (8%)
Query: 1894 NTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESE 1953
N T +S T+TN ++ + + +L + TT S T P+
Sbjct: 190 NMTLDSTSYYWTSTNEYQTNNFGCN-------EFGTLFEVNNTTYVQLDPSHT---PQFL 239
Query: 1954 STTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSE 2013
++ +T +++ T P +S + E+ + + +S+T +S S
Sbjct: 240 PQLNETIYLTNTLSNTTGKLIWTVD-PSIDSGSGEWAFWETKKNVTKQGQSSTCLSTPSL 298
Query: 2014 STTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASE 2073
S T++ ++ T E + TS S + TT + +T+ +N +++SI + ++
Sbjct: 299 SPRTTNHSRQAVT----ELDKNRTSLQPSTNNTTTISTN-NTSKHNFSTQSIPLQNFTND 353
Query: 2074 STTTSSPASES 2084
++ ++ +E
Sbjct: 354 NSQSTLTENEQ 364
Score = 31.4 bits (71), Expect = 6.8
Identities = 28/162 (17%), Positives = 58/162 (35%), Gaps = 12/162 (7%)
Query: 1968 SSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTT 2027
S P E + + TS+ ++ E + ++ + S +
Sbjct: 182 SRPGQEPRNMTLDSTSYYWTSTNEYQTNNFGCNEFGTLFEVNNTTYVQLDPSHTPQFLPQ 241
Query: 2028 ISPESESTTTSSPASESTT-TNNPKSEST-------TTNNPASESITSSSPASESTTTSS 2079
++ T T S + T +P +S T ++ SS+ S T + S
Sbjct: 242 LNETIYLTNTLSNTTGKLIWTVDPSIDSGSGEWAFWETKKNVTKQGQSSTCLS--TPSLS 299
Query: 2080 PASESTTTSSPAS--ESTTTSSPASESTTTSSPESESTTTSS 2119
P + + + + ++ T+ P++ +TTT S + S S
Sbjct: 300 PRTTNHSRQAVTELDKNRTSLQPSTNNTTTISTNNTSKHNFS 341
>gnl|CDD|148635 pfam07139, DUF1387, Protein of unknown function (DUF1387). This
family represents a conserved region approximately 300
residues long within a number of hypothetical proteins of
unknown function that seem to be restricted to mammals.
Length = 301
Score = 33.8 bits (77), Expect = 1.1
Identities = 27/143 (18%), Positives = 49/143 (34%), Gaps = 8/143 (5%)
Query: 2030 PESESTTTSSPASESTTTNNPKSESTTTN----NPASESITSSSPASESTTTSSPASEST 2085
PE+ + + S + P E N N +++ S SE ++S +
Sbjct: 18 PEAPAKSASKEETTPEEQAAPGDEKDEVNGFHANGSADDTESVDSLSEGLDSASLDAREP 77
Query: 2086 TTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQGVSPHSEKLSAN-ED 2144
+ S +S + S +S+ SSP S + +++K +
Sbjct: 78 EAVTL---DAPPSPSSSLTNGLSDLQSKLELQSSPHSSAKPHPSSDQHKNAKKYVSKPSQ 134
Query: 2145 PEEFPNEDVFEHTFAEIPNIDHS 2167
P N + A PNI+ S
Sbjct: 135 PVTPNNSAHHDAPAALGPNIEKS 157
Score = 32.7 bits (74), Expect = 2.6
Identities = 29/153 (18%), Positives = 63/153 (41%), Gaps = 14/153 (9%)
Query: 1938 SSPESESTTTSSPESESTTTSSLVSESTTTSSPESEST-----TTSSPESESTTTSSLVS 1992
S P+ E+ S+ + E+T E E + S+ ++ES + S
Sbjct: 14 SKPKPEAPAKSASKEETT------PEEQAAPGDEKDEVNGFHANGSADDTESVDSLSEGL 67
Query: 1993 ESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKS 2052
+S + + E E+ T +P S S++ ++ +S+ + + +S +++ P S N K
Sbjct: 68 DSASLDAREPEAVTLDAPPSPSSSLTNGLSDLQSKLELQSSPHSSAKPHPSSDQHKNAKK 127
Query: 2053 ESTTTNNPASESITSSSPASESTTTSSPASEST 2085
+ P+ ++S ++ P E +
Sbjct: 128 YVS---KPSQPVTPNNSAHHDAPAALGPNIEKS 157
>gnl|CDD|185638 PTZ00459, PTZ00459, mucin-associated surface protein (MASP);
Provisional.
Length = 291
Score = 33.6 bits (76), Expect = 1.1
Identities = 50/227 (22%), Positives = 83/227 (36%), Gaps = 26/227 (11%)
Query: 1918 SSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSS--PESEST 1975
S PES+ TSS ++ + ++ + P E SE E E
Sbjct: 41 SPPESKGLETSSQGTQDLKGGAAGAKENSPPLPTEEDDEDVDDDSEEGDDDDGGAEDEEE 100
Query: 1976 TTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSE----STTTISPE 2031
+S T +L S ST SE T +S S + + S E T T
Sbjct: 101 EKVRGQSGQEGTVALGSGSTEKKLIGSEKQTELSISSAESISPSGSRELNVNLTQTEVEG 160
Query: 2032 SESTTTSSPASEST-TTNNPKS-------ESTTTNNPASESITSSSPASESTTT------ 2077
+ T ++PA E+ TT N ++ E + P + I S E TT+
Sbjct: 161 KKETDKNTPAVENPLTTGNGENTLPAGIVEGNPSPPPPQDGIHSREQDGEGTTSEGQKNV 220
Query: 2078 ------SSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTS 2118
++P S S E T ++ + +T T++ ++ +T+
Sbjct: 221 PLPETAATPQSHHDKGSEGTGEDTKATTVTANTTDTTNTQNSDGSTA 267
Score = 33.2 bits (75), Expect = 1.4
Identities = 41/168 (24%), Positives = 66/168 (39%), Gaps = 19/168 (11%)
Query: 1952 SESTTTSSLVSESTTTSSPESESTTTSSPESE----STTTSSLVSESTTTSSPESESTTT 2007
S ST + SE T S S + + S E T T + T ++P E+ T
Sbjct: 117 SGSTEKKLIGSEKQTELSISSAESISPSGSRELNVNLTQTEVEGKKETDKNTPAVENPLT 176
Query: 2008 ISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITS 2067
+ T + + E + P + + E TT+ K N P E T+
Sbjct: 177 T--GNGENTLPAGIVEGNPSPPPPQDGIHSREQDGEGTTSEGQK------NVPLPE--TA 226
Query: 2068 SSPASESTTTSSPASEST-TTSSPASESTTTSSPASESTT----TSSP 2110
++P S S E T T+ A+ + TT++ S+ +T T+SP
Sbjct: 227 ATPQSHHDKGSEGTGEDTKATTVTANTTDTTNTQNSDGSTAVSHTTSP 274
Score = 32.1 bits (72), Expect = 3.8
Identities = 28/118 (23%), Positives = 52/118 (44%), Gaps = 5/118 (4%)
Query: 1895 TTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESES 1954
T T + T N P E+ T+ E+T + +V E + P + + + E
Sbjct: 154 TQTEVEGKKETDKNTPAVENPLTTG-NGENTLPAGIV-EGNPSPPPPQDGIHSREQDGEG 211
Query: 1955 TTTSSL--VSESTTTSSPESESTTTSSPESESTTTSSLVSEST-TTSSPESESTTTIS 2009
TT+ V T ++P+S S E T +++ + +T TT++ S+ +T +S
Sbjct: 212 TTSEGQKNVPLPETAATPQSHHDKGSEGTGEDTKATTVTANTTDTTNTQNSDGSTAVS 269
>gnl|CDD|191179 pfam05053, Menin, Menin. MEN1, the gene responsible for multiple
endocrine neoplasia type 1, is a tumour suppressor gene
that encodes a protein called Menin which may be an
atypical GTPase stimulated by nm23.
Length = 618
Score = 34.2 bits (78), Expect = 1.2
Identities = 19/128 (14%), Positives = 46/128 (35%), Gaps = 11/128 (8%)
Query: 2021 VSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSP 2080
+ + PE E+ + A E + P ES + ES P
Sbjct: 451 IRQKVVIKLPEKEAKESKEAAGEEAREGRRRG-------PRRESKSQEPSGGESPNPELP 503
Query: 2081 A-SESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQG---VSPHS 2136
A + ++ +++ + A+ + ++ + S T+ S + + ++ +S
Sbjct: 504 ANNNNSNSNNNNNNGADRKEAAATTGNATTTSNGSGTSVPLPVSSEPPQHKEGPVITFYS 563
Query: 2137 EKLSANED 2144
EK+ ++
Sbjct: 564 EKMKGMKE 571
>gnl|CDD|223061 PHA03369, PHA03369, capsid maturational protease; Provisional.
Length = 663
Score = 33.8 bits (77), Expect = 1.2
Identities = 27/184 (14%), Positives = 55/184 (29%), Gaps = 12/184 (6%)
Query: 1943 ESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPES 2002
E + + E E+T S + + + + + T ++ + + + + T
Sbjct: 490 EEQESLAKELEATAHKSEIKKIAESEFKNAGAKTAAANIEPNCSADA---AAPATKRARP 546
Query: 2003 ESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPAS 2062
E+ T + V + S S + ++ T + T
Sbjct: 547 ETKTELEAVVRFPYQIRNMESPAFVHSFTSTTLAAAAGQGSDTAEALAGAIETLLTQ--- 603
Query: 2063 ESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPAS 2122
S+ PA S + S+PAS + TS+P E++
Sbjct: 604 ---ASAQPAGLSLPA---PAVPVNASTPASTPPPLAPQEPPQPGTSAPSLETSLPQQKPV 657
Query: 2123 ESTT 2126
S
Sbjct: 658 LSKG 661
Score = 32.7 bits (74), Expect = 3.4
Identities = 27/179 (15%), Positives = 60/179 (33%), Gaps = 14/179 (7%)
Query: 1953 ESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVS 2012
E + + E+T S + + + + T ++ + + + + + T
Sbjct: 490 EEQESLAKELEATAHKSEIKKIAESEFKNAGAKTAAANIEPNCSADAA---APATKRARP 546
Query: 2013 ESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTN-NPASESITSSSPA 2071
E+ T V I S S + + T A E++ + +
Sbjct: 547 ETKTELEAVVRFPYQIRNMESPAFVHSFTSTTLAAAAGQGSDTAEALAGAIETLLTQA-- 604
Query: 2072 SESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQ 2130
++ PA S + S+PAS + E TS+P+ E++ +++
Sbjct: 605 -----SAQPAGLSLPA---PAVPVNASTPASTPPPLAPQEPPQPGTSAPSLETSLPQQK 655
Score = 31.1 bits (70), Expect = 9.3
Identities = 20/132 (15%), Positives = 31/132 (23%), Gaps = 10/132 (7%)
Query: 2023 ESTTTISPESESTTTSSPAS--ESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSP 2080
++ + +P + A T P I S PA T P
Sbjct: 349 KTASLTAPSRVLAAAAKVAVIAAPQTHTGPADRQRPQRPDG---IPYSVPARSPMTAYPP 405
Query: 2081 ASESTTTSSPASES-----TTTSSPASESTTTSSPESESTTTSSPASESTTIEEQGVSPH 2135
+ S T+ P P + S A+ Q
Sbjct: 406 VPQFCGDPGLVSPYNPQSPGTSYGPEPVGPVPPQPTNPYVMPISMANMVYPGHPQEHGHE 465
Query: 2136 SEKLSANEDPEE 2147
++ E EE
Sbjct: 466 RKRKRGGELKEE 477
>gnl|CDD|111090 pfam02158, Neuregulin, Neuregulin family.
Length = 406
Score = 33.7 bits (76), Expect = 1.3
Identities = 56/276 (20%), Positives = 103/276 (37%), Gaps = 23/276 (8%)
Query: 1865 DNYSEIIFTTNNNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESE- 1923
++ S I+ ++ NS + L + P+ S + ++ + SP SE
Sbjct: 136 ESNSVIMMSSVENSRHSSPAGGPRGRL--HGIGGPPDDCSFLRHARDTPDSYRDSPHSER 193
Query: 1924 ---STTTSSLVS--ESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTS 1978
+ TT + +S + T SP+S S PES + V+ S E S
Sbjct: 194 YVSAMTTPARMSPVDFHTPISPKSPCLEMSPPESSLAVSMPSVAVSPFIEE-ERPLLLVS 252
Query: 1979 SPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTS 2038
P + T + ++ +P +S +S P + E E+T
Sbjct: 253 PPRLREKKYDHKTPQKT---HHKQHNSFHHNPAHDS--SSLPPNPLRIVEDEEYETTQEY 307
Query: 2039 SPASEST--TTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASEST- 2095
P+ E TN+ +++ T N + + S +++ S SES T E T
Sbjct: 308 EPSLEPAKKLTNSRRAKRTKPNGHIANRLELDS----DSSSESSNSESETEDERIGEDTP 363
Query: 2096 --TTSSPASESTTTSSPESESTTTSSPASESTTIEE 2129
+P + S ++ + + ++PA +T EE
Sbjct: 364 FLGIQNPLAASLESAPAFRHADSRTNPAGRFSTQEE 399
Score = 33.3 bits (75), Expect = 1.9
Identities = 38/164 (23%), Positives = 64/164 (39%), Gaps = 18/164 (10%)
Query: 1971 ESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISP 2030
E+E++ ++S + + S+ V+++ + S S + IS S S S V S
Sbjct: 96 ETETSFSTSHYTSTAHHSTTVTQTPSHSWSNGHSESMISEESNSVIMMSSVENS------ 149
Query: 2031 ESESTTTSSPASESTTTNNPKSESTTTNNPASESITS--SSPASES--TTTSSPASESTT 2086
S+ P P + + + A ++ S SP SE + ++PA S
Sbjct: 150 -RHSSPAGGPRGRLHGIGGPPDDCSFLRH-ARDTPDSYRDSPHSERYVSAMTTPARMSPV 207
Query: 2087 TSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQ 2130
+ T SP S S PES + + S IEE+
Sbjct: 208 ------DFHTPISPKSPCLEMSPPESSLAVSMPSVAVSPFIEEE 245
>gnl|CDD|234055 TIGR02907, spore_VI_D, stage VI sporulation protein D. SpoVID, the
stage VI sporulation protein D, is restricted to
endospore-forming members of the bacteria, all of which
are found among the Firmicutes. It is widely distributed
but not quite universal in this group. Between
well-conserved N-terminal and C-terminal domains is a
poorly conserved, low-complexity region of variable
length, rich enough in glutamic acid to cause spurious
BLAST search results unless a filter is used. The seed
alignment for this model was trimmed, in effect, by
choosing member sequences in which these regions are
relatively short. SpoVID is involved in spore coat
assembly by the mother cell compartment late in the
process of sporulation [Cellular processes, Sporulation
and germination].
Length = 338
Score = 33.3 bits (76), Expect = 1.4
Identities = 20/115 (17%), Positives = 32/115 (27%)
Query: 2072 SESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQG 2131
S S PA E T ++ A E + + S + E
Sbjct: 156 SFSAEFEHPAQEETAGEEERTDEPKVEHEAHEQHEQPADDDPDEWKISASEPFQLESEVE 215
Query: 2132 VSPHSEKLSANEDPEEFPNEDVFEHTFAEIPNIDHSNQTDEAIPETFDAREEWPQ 2186
SP E ED E ED + + + + + + EE +
Sbjct: 216 ASPEEENYEEYEDETELEVEDEEKALDEQTEDPQQEDALAGDAKKALEEEEEKGE 270
>gnl|CDD|217490 pfam03318, ETX_MTX2, Clostridium epsilon toxin ETX/Bacillus
mosquitocidal toxin MTX2. This family appears to be
distantly related to pfam01117.
Length = 228
Score = 33.2 bits (76), Expect = 1.4
Identities = 46/196 (23%), Positives = 73/196 (37%), Gaps = 12/196 (6%)
Query: 1874 TNNNSESTVVMSTLNSLLSENTTTNSP-ESESTTTNNPESESTTTSSPESESTTTSSLVS 1932
TN T+ + T +T TNN +S T + S+ TT +
Sbjct: 3 TNGFPSYINFNVTVLDEETTVKTLTPLYTGSNTLTNNTDSTQTLQTQSFSKKVTT----T 58
Query: 1933 ESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVS 1992
STTT+ S V+E T S E +S+ + ++TT + +
Sbjct: 59 TSTTTTHGFKIGAKAS-----GKFGIPFVAEGGITLSVTGEYNFSSTTTNTTSTTETYTA 113
Query: 1993 ESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKS 2052
S + P +T T++ TT S PV + TT+S + SS + P S
Sbjct: 114 PSQKVTVP-PHTTVTVTLYLYKTTYSGPV-DLYTTLSGTFFISIVSSVSFTRDGYVEPAS 171
Query: 2053 ESTTTNNPASESITSS 2068
T + P ++I S
Sbjct: 172 YVLTASWPLYDTIFLS 187
>gnl|CDD|117051 pfam08474, MYT1, Myelin transcription factor 1. This domain is found
in the myelin transcription factor 1 (MYT1) of chordates.
MYT1 contains C2HC zinc finger domains (pfam01530) and is
expressed in developing neurons of the central nervous
system where it is involved in the selection of neuronal
precursor cells.
Length = 257
Score = 33.2 bits (75), Expect = 1.5
Identities = 35/185 (18%), Positives = 57/185 (30%), Gaps = 11/185 (5%)
Query: 1968 SSPESESTTTSSPESES-----TTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVS 2022
SPES ++ S S T S V S+ +SE+ + + S+
Sbjct: 65 PSPESSHFSSYVKSSSSLPSAGAHTQSTVRASSFDYGQDSEAAHMAA--TAILNLSTRCR 122
Query: 2023 ESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPAS 2082
E +S + + E N S N +SI +S + T SSP S
Sbjct: 123 EMPDNLSTKPQDLRAKGADIE-VDENGTLDLSMKKNRIRDKSIPPTSSCTTIATPSSPMS 181
Query: 2083 ESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQGVSPHSEKLSAN 2142
+S + + + P + ES + + P E L
Sbjct: 182 PQKASSLLVNAA---FYQLCDQDGWDVPIDYTKPHRKTEEESKEKDPVNLDPSLENLEEK 238
Query: 2143 EDPEE 2147
+ E
Sbjct: 239 KFAGE 243
>gnl|CDD|233787 TIGR02223, ftsN, cell division protein FtsN. FtsN is a poorly
conserved protein active in cell division in a number of
Proteobacteria. The N-terminal 30 residue region tends to
by Lys/Arg-rich, and is followed by a membrane-spanning
region. This is followed by an acidic low-complexity
region of variable length and a well-conserved C-terminal
domain of two tandem regions matched by pfam05036
(Sporulation related repeat), found in several cell
division and sporulation proteins. The role of FtsN as a
suppressor for other cell division mutations is poorly
understood; it may involve cell wall hydrolysis [Cellular
processes, Cell division].
Length = 298
Score = 33.1 bits (75), Expect = 1.5
Identities = 27/190 (14%), Positives = 52/190 (27%), Gaps = 20/190 (10%)
Query: 1890 LLSENTTTNSPESE--STTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTT 1947
LL+E+ N PE+ T N E+ + PE + E
Sbjct: 47 LLTESKQANEPETLQPKNQTENGETAADLPPKPEERWS-----YIEELEAREVLINDPEE 101
Query: 1948 SSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTT 2007
S ++ L +E + + + S T + E+ T
Sbjct: 102 PSNGGGVEESAQLTAEQRQLLEQM-------QADMRAAEKVLATAPSEQTVAVEARKQTA 154
Query: 2008 ISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITS 2067
++ T E+E + ++ PK + T +N
Sbjct: 155 EKKPQKARTA------EAQKTPVETEKIASKVKEAKQKQKALPKQTAETQSNSKPIETAP 208
Query: 2068 SSPASESTTT 2077
+ ++ T
Sbjct: 209 KADKADKTKP 218
>gnl|CDD|222011 pfam13257, DUF4048, Domain of unknown function (DUF4048). This
presumed domain is functionally uncharacterized. This
domain family is found in eukaryotes, and is typically
between 228 and 257 amino acids in length.
Length = 242
Score = 32.8 bits (75), Expect = 1.6
Identities = 17/81 (20%), Positives = 31/81 (38%), Gaps = 5/81 (6%)
Query: 2071 ASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIE-- 2128
A+ES T P S + + S + + S+ +S + TS S+S I+
Sbjct: 116 ATESRTVPPPRSRRSGSRSTSRSRLRLQGGSLSSSRSSRSSTSKGATSGKDSKSADIDVS 175
Query: 2129 ---EQGVSPHSEKLSANEDPE 2146
E G+ +K + +
Sbjct: 176 FWSEFGIDTPGQKSKSPQKAS 196
Score = 32.4 bits (74), Expect = 2.6
Identities = 20/105 (19%), Positives = 40/105 (38%), Gaps = 7/105 (6%)
Query: 2032 SESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASEST-TTSSP 2090
+ES T P S + + + + S SS ++ TS S+S S
Sbjct: 117 TESRTVPPPRSRRSGSRSTSRSRLRLQGGSLSSSRSSRSSTSKGATSGKDSKSADIDVSF 176
Query: 2091 ASE------STTTSSPASESTTTSSPESESTTTSSPASESTTIEE 2129
SE + SP S+T + ++ + ++ +S +++
Sbjct: 177 WSEFGIDTPGQKSKSPQKASSTPAGNTNQGQSQNAQSSNLLDVDD 221
Score = 30.9 bits (70), Expect = 6.6
Identities = 23/107 (21%), Positives = 45/107 (42%), Gaps = 3/107 (2%)
Query: 2041 ASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASEST-TTSS 2099
A+ES T P+S + + + + + + S+ +S ++ TS S+S S
Sbjct: 116 ATESRTVPPPRSRRSGSRSTSRSRLRLQGGSLSSSRSSRSSTSKGATSGKDSKSADIDVS 175
Query: 2100 PASE-STTTSSPESEST-TTSSPASESTTIEEQGVSPHSEKLSANED 2144
SE T +S+S SS + +T + + S L +++
Sbjct: 176 FWSEFGIDTPGQKSKSPQKASSTPAGNTNQGQSQNAQSSNLLDVDDN 222
>gnl|CDD|184920 PRK14956, PRK14956, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 484
Score = 33.4 bits (76), Expect = 1.6
Identities = 14/74 (18%), Positives = 25/74 (33%)
Query: 1896 TTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESEST 1955
+ N PE + + + TTS S S+ + + + S+S
Sbjct: 392 SKNIPEDVEPVKKISTPPPLQQEASKKKDPTTSDQKLNSQFESNQQDSNLDNNPLPSKSE 451
Query: 1956 TTSSLVSESTTTSS 1969
+ S S TS+
Sbjct: 452 SQSEPPSSKFDTST 465
Score = 33.4 bits (76), Expect = 1.6
Identities = 22/106 (20%), Positives = 35/106 (33%), Gaps = 15/106 (14%)
Query: 2046 TTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASEST 2105
+ N P+ + + + TTS S S+ + + S+S
Sbjct: 392 SKNIPEDVEPVKKISTPPPLQQEASKKKDPTTSDQKLNSQFESNQQDSNLDNNPLPSKSE 451
Query: 2106 TTSSPESESTTTSSPASESTTIEEQGVSPHSEKLSANE-DPEEFPN 2150
+ S P SS ST I+ +K E DP +FP
Sbjct: 452 SQSEP------PSSKFDTSTEIK--------KKFLGTEVDPNQFPK 483
Score = 33.0 bits (75), Expect = 2.0
Identities = 14/78 (17%), Positives = 28/78 (35%)
Query: 1916 TTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESEST 1975
+ + PE + + + TTS + S S+ + + S+S
Sbjct: 392 SKNIPEDVEPVKKISTPPPLQQEASKKKDPTTSDQKLNSQFESNQQDSNLDNNPLPSKSE 451
Query: 1976 TTSSPESESTTTSSLVSE 1993
+ S P S TS+ + +
Sbjct: 452 SQSEPPSSKFDTSTEIKK 469
Score = 33.0 bits (75), Expect = 2.2
Identities = 14/82 (17%), Positives = 27/82 (32%)
Query: 1882 VVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPE 1941
+V + N + + TTS + S S+ + +
Sbjct: 388 MVQGSKNIPEDVEPVKKISTPPPLQQEASKKKDPTTSDQKLNSQFESNQQDSNLDNNPLP 447
Query: 1942 SESTTTSSPESESTTTSSLVSE 1963
S+S + S P S TS+ + +
Sbjct: 448 SKSESQSEPPSSKFDTSTEIKK 469
Score = 31.1 bits (70), Expect = 8.1
Identities = 16/74 (21%), Positives = 27/74 (36%)
Query: 1906 TTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSEST 1965
+ N PE + + TTS + S S+ + + + L S+S
Sbjct: 392 SKNIPEDVEPVKKISTPPPLQQEASKKKDPTTSDQKLNSQFESNQQDSNLDNNPLPSKSE 451
Query: 1966 TTSSPESESTTTSS 1979
+ S P S TS+
Sbjct: 452 SQSEPPSSKFDTST 465
>gnl|CDD|171664 PRK12688, PRK12688, flagellin; Reviewed.
Length = 751
Score = 33.7 bits (77), Expect = 1.7
Identities = 36/247 (14%), Positives = 99/247 (40%), Gaps = 13/247 (5%)
Query: 1872 FTTNNNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLV 1931
++T +N +T+ +T + L + ++ S + + +T + + T SL
Sbjct: 105 YSTKSNVSTTISGATADDLRGTTSYASATASSNVLYDGAAGGATAATGATTLGGTAGSLA 164
Query: 1932 SESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESEST--------TTSSPESE 1983
T + T T + + + TT++ + + + ++ + + ++P S
Sbjct: 165 GTGATAGDGTTALTGTITLIATNGTTATGLLGNAQPADGDTLTVNGKTITFRSGAAPAST 224
Query: 1984 STTTSSLVSESTTT----SSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSS 2039
+ + S VS + T +S + T++ + + +S V ++ T S + ++S
Sbjct: 225 AVPSGSGVSGNLVTDGNGNSTVYLGSATVNDLLSAIDLASGV-QTVTISSGAATIAVSAS 283
Query: 2040 PASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSS 2099
+ S + ++T S + + + TT++ A +T ++ + + + +
Sbjct: 284 GGAVSAAAAGAVTLKSSTGADLSVTGKADLLKALGLTTATGAGNATVNANRTTSAGSLGA 343
Query: 2100 PASESTT 2106
+ +T
Sbjct: 344 LIQDGST 350
>gnl|CDD|216289 pfam01080, Presenilin, Presenilin. Mutations in presenilin-1 are a
major cause of early onset Alzheimer's disease. It has
been found that presenilin-1 binds to beta-catenin
in-vivo. This family also contains SPE proteins from
C.elegans.
Length = 403
Score = 33.3 bits (76), Expect = 1.7
Identities = 27/93 (29%), Positives = 39/93 (41%), Gaps = 3/93 (3%)
Query: 2056 TTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASEST-TTSSPESES 2114
+ +E S+ + +T S+ +S TS E SS S + S E+ES
Sbjct: 230 SNQEETNEGTPSTIRRTSKSTRSAANPDSAPTSHSTLELPEKSSTPELSDDESDSSETES 289
Query: 2115 TTTSSPASESTTIEEQGVSPHSEKLSANEDPEE 2147
+ SS A E E+ V S L +NE EE
Sbjct: 290 QSDSSLAPEEDAAEQPEVQ--SNSLPSNEKREE 320
Score = 32.5 bits (74), Expect = 2.8
Identities = 31/102 (30%), Positives = 50/102 (49%), Gaps = 6/102 (5%)
Query: 1880 STVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSS 1939
STVV+ T+ S E T +P + T+ + + ++P+S T+ S+L +S+
Sbjct: 221 STVVVLTVGSN-QEETNEGTPSTIRRTSK----STRSAANPDSAPTSHSTL-ELPEKSST 274
Query: 1940 PESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPE 1981
PE + S E+ES + SSL E PE +S + S E
Sbjct: 275 PELSDDESDSSETESQSDSSLAPEEDAAEQPEVQSNSLPSNE 316
Score = 31.3 bits (71), Expect = 6.7
Identities = 19/69 (27%), Positives = 30/69 (43%), Gaps = 1/69 (1%)
Query: 1926 TTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESEST-TTSSPESES 1984
+ +E T ++ + +T S+ +S TS E SS S + S E+ES
Sbjct: 230 SNQEETNEGTPSTIRRTSKSTRSAANPDSAPTSHSTLELPEKSSTPELSDDESDSSETES 289
Query: 1985 TTTSSLVSE 1993
+ SSL E
Sbjct: 290 QSDSSLAPE 298
>gnl|CDD|221429 pfam12118, SprA-related, SprA-related family. This protein is found
in bacteria. Proteins in this family are typically
between 234 to 465 amino acids in length. There is a
conserved GEV sequence motif.Most members are annotated
as being SprA-related.
Length = 261
Score = 32.8 bits (75), Expect = 1.8
Identities = 16/88 (18%), Positives = 42/88 (47%), Gaps = 4/88 (4%)
Query: 2064 SITSSSPASESTTTSSP---ASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSP 2120
+I++S + S +++ A T T + A + ++S ++ + +S +++ +S
Sbjct: 1 NISNSLSSISSGSSAPIGTSALRGTNTPAAAKPAPSSSEASNAGSGSSEQKAKLKGQAST 60
Query: 2121 ASESTTIEEQGVSP-HSEKLSANEDPEE 2147
A+ S + E Q + +++ E+ E
Sbjct: 61 AAGSASQELQKQASESNDEEVVGEEEPE 88
>gnl|CDD|219081 pfam06546, Vert_HS_TF, Vertebrate heat shock transcription factor.
This family represents the C-terminal region of
vertebrate heat shock transcription factors. Heat shock
transcription factors regulate the expression of heat
shock proteins - a set of proteins that protect the cell
from damage caused by stress and aid the cell's recovery
after the removal of stress. This C-terminal region is
found with the N-terminal pfam00447, and may contain a
three-stranded coiled-coil trimerisation domain and a CE2
regulatory region, the latter of which is involved in
sustained heat shock response.
Length = 252
Score = 32.8 bits (74), Expect = 1.8
Identities = 18/96 (18%), Positives = 35/96 (36%)
Query: 2035 TTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASES 2094
+++ S + +S ++ P T +S + E +SSP S+S
Sbjct: 3 SSSGSYSPDSVASSGPIISDVTELAESSPVASPDGSIEERAVSSSPLVRIKEEPPSPSQS 62
Query: 2095 TTTSSPASESTTTSSPESESTTTSSPASESTTIEEQ 2130
S S +P S +T S +E + ++
Sbjct: 63 PEQSEAVPGSDLVDTPLSPTTFIDSILNEEEPVSQE 98
Score = 31.2 bits (70), Expect = 5.3
Identities = 21/94 (22%), Positives = 35/94 (37%)
Query: 1925 TTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESES 1984
+++ S +S +S P T + S + + E +SSP S+S
Sbjct: 3 SSSGSYSPDSVASSGPIISDVTELAESSPVASPDGSIEERAVSSSPLVRIKEEPPSPSQS 62
Query: 1985 TTTSSLVSESTTTSSPESESTTTISPVSESTTTS 2018
S V S +P S +T S ++E S
Sbjct: 63 PEQSEAVPGSDLVDTPLSPTTFIDSILNEEEPVS 96
>gnl|CDD|185219 PRK15319, PRK15319, AIDA autotransporter-like protein ShdA;
Provisional.
Length = 2039
Score = 33.5 bits (76), Expect = 1.9
Identities = 61/292 (20%), Positives = 120/292 (41%), Gaps = 50/292 (17%)
Query: 1865 DNYSEIIFTTNNNSESTVVMSTLNSLLSENTTTNSP--ESESTTTNNPESESTTTSSPES 1922
Y ++ T +++ T L+ + +TT N P ++ST T + + ++ ++
Sbjct: 341 TGYEDLNALTVSDANVTSDTVALH--VDGSTTINDPIELTDSTFTAPTAIKLGSKATIQA 398
Query: 1923 ESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPES 1982
E+TT + + ++ +SS S S ST T S+ + TT S ++ + P
Sbjct: 399 ENTTLTGNIVQTDASSSSLSLS-------QGSTLTGSVDAMFTTLSLDDTSQWNMTDP-- 449
Query: 1983 ESTTTSSLVSESTTTSSPESESTTTISPVSES--------------TTTSSPV------- 2021
+T +L ++ T S ST T+ V + T SSP+
Sbjct: 450 --STVGNLTNDGDITLGNASGSTGTLLTVDNTLTLQDGSQINATLDTANSSPIIKAANVT 507
Query: 2022 -------SESTTTISPESES-----TTTSSPASESTTTNNPKSESTTTNNPASESITSSS 2069
S + T ++PE++ T S + +T ++ ++ T+ P +I +
Sbjct: 508 LDGTLNLSSTATFVAPETDEHFGSITLIDSQTAITTDFDSVTLDADTSAMPDYLTINAGV 567
Query: 2070 PASESTT--TSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSS 2119
A+++T S+ S +S + T + A + T +S E+T TS+
Sbjct: 568 DANDNTNYELSTGLSWYAGANSARAAHGTFTVDAGSTFTVTSELDETTATSN 619
>gnl|CDD|219210 pfam06873, SerH, Cell surface immobilisation antigen SerH. This
family consists of several cell surface immobilisation
antigen SerH proteins which seem to be specific to
Tetrahymena thermophila. The SerH locus of Tetrahymena
thermophila is one of several paralogous loci with genes
encoding variants of the major cell surface protein known
as the immobilisation antigen (i-ag).
Length = 407
Score = 33.4 bits (76), Expect = 1.9
Identities = 41/208 (19%), Positives = 86/208 (41%), Gaps = 17/208 (8%)
Query: 1928 SSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTS---SPESES 1984
++ V+ S + S+ + T S + TT + VS + S + T S + + +
Sbjct: 100 TACVASSASCSNRRRGAWTDSDCTLCNPTTPAAVSGACQACSSITSGWTDSNCNACATTA 159
Query: 1985 TTTSSLVSESTTTSS--PESESTTTISPVSESTTTS--SPVSESTTTISPESESTTTSSP 2040
+ +S V ++ S+ S S + S + + T + + +T + + S SS
Sbjct: 160 SPKNSNVFANSAGSACVASSASCGSTSRGTTAWTDADCLLCNPTTPYLVGDKSSCAASSC 219
Query: 2041 ASESTTTN----------NPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSP 2090
A+ S++T+ N + +T N A+ + +S +S S +SS + + T
Sbjct: 220 AACSSSTSGWTDSDCNACNTTASPSTKNIFANAAGSSCVASSASCGSSSRGTTAWTDGDC 279
Query: 2091 ASESTTTSSPASESTTTSSPESESTTTS 2118
+ +T + + S +S S T+
Sbjct: 280 TLCTPSTPAVYASSDGSSCVACSSITSG 307
Score = 32.2 bits (73), Expect = 3.6
Identities = 46/230 (20%), Positives = 82/230 (35%), Gaps = 16/230 (6%)
Query: 1914 STTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPES- 1972
S +++ + T S S ST++ T S TT ++ S TS P +
Sbjct: 18 SVISATAGNNVQCTGSGNSCSTSSCCTVPTITGCSWGTGTDATTCAITDCSCLTSGPATG 77
Query: 1973 ------ESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTT 2026
+S S+ + + + S+ + S T S + T+
Sbjct: 78 LTDLFCQSCKGSNQNVFANSAGTACVASSASCSNRRRGAWTDSDCTLCNPTTPAAVSGAC 137
Query: 2027 TISPESESTTTSSPASESTTTNNPKSESTTTNN------PASESITSSSPASESTTTSSP 2080
S T S + TT +PK+ + N+ +S S S+S + + T +
Sbjct: 138 QACSSITSGWTDSNCNACATTASPKNSNVFANSAGSACVASSASCGSTSRGTTAWTDADC 197
Query: 2081 ASESTTT---SSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTI 2127
+ TT S +S A S+T+ +S+ ++ AS ST
Sbjct: 198 LLCNPTTPYLVGDKSSCAASSCAACSSSTSGWTDSDCNACNTTASPSTKN 247
>gnl|CDD|240274 PTZ00112, PTZ00112, origin recognition complex 1 protein;
Provisional.
Length = 1164
Score = 33.4 bits (76), Expect = 1.9
Identities = 53/343 (15%), Positives = 106/343 (30%), Gaps = 44/343 (12%)
Query: 1892 SENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPE 1951
++N TT ++ N S S++ SS + + SS S + S+ S + +
Sbjct: 105 NDNVTTPIKANKKEKHNLDSSSSSSISSSLTNISFFSSPTSIYSCLSNSLSSKHSPKVIK 164
Query: 1952 SESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPV 2011
+T ++ S+ +SP ++ + + ++ T + SP + ST
Sbjct: 165 ENQSTHVNISSD----NSPRNKEISNKQLKKQTNVTHT-TCYDKMRRSPRNTST---IKN 216
Query: 2012 SESTTTSSPVSESTTTISPESESTTTSSPASE---------------STTTNNPKSESTT 2056
+ + E I + + + SE S T N K E
Sbjct: 217 NTNDKNKEKNKEKDKNIKKDRDGDKQTKRNSEKSKVQNSHFDVRILRSYTKENKKDEKNV 276
Query: 2057 TNNPASESITSSSPASESTTTSSPAS-----------ESTTTSSPASESTTTSSPASEST 2105
+ S + S+ S S + + S S
Sbjct: 277 VSGIRSSVLLKRK--SQCLRKDSYVYSNHQKKAKTGDPKNIIHRNNGSSNSNNDDTSSSN 334
Query: 2106 TTSSPESESTTTSSPASESTTIEEQGVSPHSEKLSANEDPEEFPN-----EDVFEHTFAE 2160
S + SSP + TT + + + K + + ++F + + + +
Sbjct: 335 HLGSNRISNRNPSSPYKKQTTT-KHTNNTKNNKYNKTKTTQKFNHPLRHHATINKRSSML 393
Query: 2161 IPNIDHSNQTDEAIPE--TFDAREEWPQCKDVIGKVWDQGACQ 2201
+ E F E KD K+ ++ +CQ
Sbjct: 394 PMSEQKGRGASEKSEYIKEFTMEEVAKLTKDTTIKLVEENSCQ 436
Score = 33.0 bits (75), Expect = 2.3
Identities = 38/158 (24%), Positives = 67/158 (42%), Gaps = 29/158 (18%)
Query: 2084 STTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQGVSPHSEKLSANE 2143
TTTSS A + T + ++S T+ + E ST+ + + PH N
Sbjct: 670 QTTTSSKAKTHSKTKNDHNKSKTSKNKEPSSTSFLQDVKKKSD-------PH------NV 716
Query: 2144 DPEEFPNEDVFEHTFAEIPNIDHSNQTDEAI--------PETFDAREEWPQCKDVIG--- 2192
D + F +D + + NI ++ TD+AI P+ RE+ + K+V G
Sbjct: 717 DFKSFIKQDQENYYVNLLRNI--TDPTDKAIRMMQLDVVPKYLPCREK--EIKEVHGFLE 772
Query: 2193 -KVWDQGACQSCWVSHQPRTAGLKGLFSFIKYGQGQER 2229
+ G+ Q ++S P T ++S I+ Q + +
Sbjct: 773 SGIKQSGSNQILYISGMPGTGKTATVYSVIQLLQHKTK 810
Score = 33.0 bits (75), Expect = 2.4
Identities = 32/158 (20%), Positives = 57/158 (36%), Gaps = 30/158 (18%)
Query: 2048 NNPKSESTTTNN---PASESITSSS-----PASESTTTSSPASESTTTSSPASESTTTSS 2099
N P+ E N P I +++ +E + T +++ TT A++ +
Sbjct: 63 NTPRKEEKKKKNLNLPDYNQIQNNTHDFYIDLNERSKTPIKNNDNVTTPIKANKKEKHNL 122
Query: 2100 PASESTTTSSPESESTTTSSPASESTTIEEQGVSPHSEK-----------LSANEDPEEF 2148
+S S++ SS + + SSP S + + S HS K +S++ P
Sbjct: 123 DSSSSSSISSSLTNISFFSSPTSIYSCLSNSLSSKHSPKVIKENQSTHVNISSDNSPRN- 181
Query: 2149 PNEDVFEHTFAEIPNIDHSNQTDEAIPETFDAREEWPQ 2186
EI N QT+ +D P+
Sbjct: 182 ----------KEISNKQLKKQTNVTHTTCYDKMRRSPR 209
Score = 33.0 bits (75), Expect = 2.5
Identities = 25/142 (17%), Positives = 53/142 (37%), Gaps = 5/142 (3%)
Query: 1910 PESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSS 1969
+E + T +++ TT ++ + S S++ SS + + SS S + S+
Sbjct: 93 DLNERSKTPIKNNDNVTTPIKANKKEKHNLDSSSSSSISSSLTNISFFSSPTSIYSCLSN 152
Query: 1970 PESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTIS 2029
S + + S+ V+ S+ S E + + + T + S
Sbjct: 153 SLSSKHSPKVIKENQ---STHVNISSDNSPRNKEISNKQ--LKKQTNVTHTTCYDKMRRS 207
Query: 2030 PESESTTTSSPASESTTTNNPK 2051
P + ST ++ ++ N K
Sbjct: 208 PRNTSTIKNNTNDKNKEKNKEK 229
>gnl|CDD|236555 PRK09537, pylS, pyrolysyl-tRNA synthetase; Reviewed.
Length = 417
Score = 33.3 bits (76), Expect = 1.9
Identities = 13/88 (14%), Positives = 25/88 (28%)
Query: 1953 ESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVS 2012
+ T V + T + +P+ + S + P +T
Sbjct: 89 DKTQVKVKVVSAPTKKKKAMPKSVVRAPKPLENPVPAQAESSGSKPVPSIPVSTPEVKAP 148
Query: 2013 ESTTTSSPVSESTTTISPESESTTTSSP 2040
T S T +SP+ + + S
Sbjct: 149 APALTPSQKDRLETLLSPKDKISLNSEK 176
Score = 31.7 bits (72), Expect = 4.6
Identities = 15/87 (17%), Positives = 25/87 (28%)
Query: 2043 ESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPAS 2102
+ T + T A +P + A S + P+ +T A
Sbjct: 89 DKTQVKVKVVSAPTKKKKAMPKSVVRAPKPLENPVPAQAESSGSKPVPSIPVSTPEVKAP 148
Query: 2103 ESTTTSSPESESTTTSSPASESTTIEE 2129
T S + T SP + + E
Sbjct: 149 APALTPSQKDRLETLLSPKDKISLNSE 175
Score = 31.3 bits (71), Expect = 6.6
Identities = 14/88 (15%), Positives = 24/88 (27%)
Query: 2033 ESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPAS 2092
+ T + T + P + A S + P+ +T A
Sbjct: 89 DKTQVKVKVVSAPTKKKKAMPKSVVRAPKPLENPVPAQAESSGSKPVPSIPVSTPEVKAP 148
Query: 2093 ESTTTSSPASESTTTSSPESESTTTSSP 2120
T S T SP+ + + S
Sbjct: 149 APALTPSQKDRLETLLSPKDKISLNSEK 176
>gnl|CDD|233909 TIGR02520, pilus_B_mal_scr, type IVB pilus formation outer membrane
protein, R64 PilN family. Several related protein
families encode outer membrane pore proteins for type II
secretion, type III secretion, and type IV pilus
formation. This protein family appears to encode a
secretin for pilus formation, although it is quite
different from PilQ. Members include the PilN lipoprotein
of the plasmid R64 thin pilus, a type IV pilus. Scoring
between the trusted and noise cutoffs are examples of
bundle-forming pilus B (bfpB) [Cell envelope, Surface
structures, Protein fate, Protein and peptide secretion
and trafficking].
Length = 497
Score = 33.3 bits (76), Expect = 1.9
Identities = 45/222 (20%), Positives = 80/222 (36%), Gaps = 15/222 (6%)
Query: 1880 STVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSS 1939
++ V ST +S ++++ S +T + + ++ + + + S L S + S
Sbjct: 169 NSSVTSTSSSTAGSGSSSSGGSGNSGSTQSTAVKLESSVHNDIQQSIKSMLSSSGSWHLS 228
Query: 1940 PESES-TTTSSPES----ESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSES 1994
+ S T PE S S + + ++ SLV +S
Sbjct: 229 GSTGSLVVTDVPEVLDRVASYIDSQNRRLTRQVLLNVKVLSVQFKGSDQTGVDWSLVYKS 288
Query: 1995 TTTS--SPESESTTTISPVSEST---TTSSPVSESTTTISPESESTTTSSPASESTTTNN 2049
+ S + T+T + S P + +T I S S S S TT N
Sbjct: 289 LSRFGLSLANAGTSTAATAGSSAGINVVDGPFAGTTALIRALSTQGKVSVVTSPSVTTLN 348
Query: 2050 ----PKSESTTTNNPASESITSSSPASESTTTSSPASESTTT 2087
P +T T AS+S T+ + S+T P + +T
Sbjct: 349 LQPAPFQIATQTGYLASQS-TTVTANVGSSTDLEPGTITTGF 389
Score = 31.8 bits (72), Expect = 4.8
Identities = 25/97 (25%), Positives = 42/97 (43%), Gaps = 8/97 (8%)
Query: 1959 SLVSESTTTSSPESESTTTSSPESESTTTSSLV---SESTTTSSPESESTTTIS----PV 2011
SL + T+T++ S + + T++L+ S S S S TT++ P
Sbjct: 295 SLANAGTSTAATAGSSAGINVVDGPFAGTTALIRALSTQGKVSVVTSPSVTTLNLQPAPF 354
Query: 2012 SESTTTSSPVSESTTTISPESESTTTSSPASESTTTN 2048
+T T S+STT + S+T P + +T N
Sbjct: 355 QIATQTGYLASQSTTV-TANVGSSTDLEPGTITTGFN 390
>gnl|CDD|147777 pfam05808, Podoplanin, Podoplanin. This family consists of several
mammalian podoplanin like proteins which are thought to
control specifically the unique shape of podocytes.
Length = 162
Score = 32.2 bits (73), Expect = 1.9
Identities = 26/104 (25%), Positives = 45/104 (43%), Gaps = 11/104 (10%)
Query: 1912 SESTTTSSPESESTTTSSLVSESTTTSSPESESTTT-SSPESESTTTSSLVSESTTTSS- 1969
++ +T PE + T ++ T E TTT ++ E + + LV T +
Sbjct: 20 AQGASTVRPEDDVVTPG--TTDGMVTPGVEDYITTTGATEELNESGLAPLVPTGTENVTK 77
Query: 1970 ------PESESTTTSSPESESTTTSSLV-SESTTTSSPESESTT 2006
P +E T E +STTT ++V S S + E+++T
Sbjct: 78 DHLEDLPTAEGTDHDGEEHKSTTTVTVVTSHSQDKTGDETQTTD 121
>gnl|CDD|236733 PRK10672, PRK10672, rare lipoprotein A; Provisional.
Length = 361
Score = 32.7 bits (75), Expect = 2.2
Identities = 17/83 (20%), Positives = 33/83 (39%), Gaps = 1/83 (1%)
Query: 2034 STTTSSPASESTTTNNPKSESTTTNNPASESITSSSP-ASESTTTSSPASESTTTSSPAS 2092
T + PA P S ST + + + +SS TT + E + + A
Sbjct: 205 GTPSVQPAPAPQGDVLPVSNSTLKSEDPTGAPVTSSGFLGAPTTLAPGVLEGSEPTPTAP 264
Query: 2093 ESTTTSSPASESTTTSSPESEST 2115
S ++PA+ + ++ S ++
Sbjct: 265 SSAPATAPAAAAPQAAATSSSAS 287
>gnl|CDD|148051 pfam06213, CobT, Cobalamin biosynthesis protein CobT. This family
consists of several bacterial cobalamin biosynthesis
(CobT) proteins. CobT is involved in the transformation
of precorrin-3 into cobyrinic acid.
Length = 282
Score = 32.9 bits (75), Expect = 2.2
Identities = 15/76 (19%), Positives = 31/76 (40%)
Query: 2024 STTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASE 2083
S+ ++ E S+ + ++ ++PK + ES +S S + +S +S
Sbjct: 204 SSMDMAEELGDEPESADSEDNEDEDDPKEDEDDDQGEEEESGSSDSLSEDSDASSEEMES 263
Query: 2084 STTTSSPASESTTTSS 2099
++ AS T S
Sbjct: 264 GEMEAAEASADDTPDS 279
>gnl|CDD|218752 pfam05793, TFIIF_alpha, Transcription initiation factor IIF, alpha
subunit (TFIIF-alpha). Transcription initiation factor
IIF, alpha subunit (TFIIF-alpha) or RNA polymerase
II-associating protein 74 (RAP74) is the large subunit of
transcription factor IIF (TFIIF), which is essential for
accurate initiation and stimulates elongation by RNA
polymerase II.
Length = 528
Score = 33.0 bits (75), Expect = 2.3
Identities = 38/191 (19%), Positives = 61/191 (31%), Gaps = 22/191 (11%)
Query: 1952 SESTTTSSLVSESTTTSSPESES-----TTTSSPESESTTTSSLVSESTTTSSPESESTT 2006
S+S+ + + E SPE + S ESE S +
Sbjct: 288 SDSSASGNDPEEREDKLSPEIPAKPEIEQDEDSEESEEEKNEEEGGLS----KKGKKLKK 343
Query: 2007 TISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESIT 2066
+ S + + + E S + PK E +NP+S
Sbjct: 344 LKGKKNGLDKDDSDSGDDSDDSDIDGE---DSVSLVTAKKQKEPKKEEPVDSNPSSP--G 398
Query: 2067 SSSPASESTTTSSPAS-------ESTTTSSPASESTTTSSPASES-TTTSSPESESTTTS 2118
+S PA S + + S PA + T ++P S S +T S S ++S
Sbjct: 399 NSGPARPSPESKDKGKRKAANEVSKSPASVPAKKLKTENAPKSSSGKSTPQTFSGSKSSS 458
Query: 2119 SPASESTTIEE 2129
+ A T E
Sbjct: 459 NAADGGVTEEA 469
>gnl|CDD|223003 PHA03169, PHA03169, hypothetical protein; Provisional.
Length = 413
Score = 32.6 bits (74), Expect = 2.5
Identities = 44/219 (20%), Positives = 63/219 (28%), Gaps = 17/219 (7%)
Query: 1910 PESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSS 1969
P + TTS P+ + + T + E + + S S
Sbjct: 46 PAPPAPTTSGPQVRAVAEQGHRQTESDTETAEESRHGEKEERGQGGPSGS--------GS 97
Query: 1970 PESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTIS 2029
S T S S S L E+T+ SSPES P S S S P +
Sbjct: 98 ESVGSPTPSPSGSAEELASGLSPENTSGSSPES-------PASHSPPPSPPSHPGPHEPA 150
Query: 2030 PESESTTTSSPASESTTTNNPKSESTTTNNPASESITSS--SPASESTTTSSPASESTTT 2087
P + + S + + P SE S P SE+ T+S P
Sbjct: 151 PPESHNPSPNQQPSSFLQPSHEDSPEEPEPPTSEPEPDSPGPPQSETPTSSPPPQSPPDE 210
Query: 2088 SSPASESTTTSSPASESTTTSSPESESTTTSSPASESTT 2126
T +P+ + E E T
Sbjct: 211 PGEPQSPTPQQAPSPNTQQAVEHEDEPTEPEREGPPFPG 249
Score = 31.5 bits (71), Expect = 6.3
Identities = 43/217 (19%), Positives = 72/217 (33%), Gaps = 17/217 (7%)
Query: 1937 TSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTT 1996
+ P + TTS P+ + V+E + T S E E
Sbjct: 43 AAKPAPPAPTTSGPQVRA------VAEQGHRQTESDTETAEESRHGEK--------EERG 88
Query: 1997 TSSPESESTTTISPVSESTTTSSPVSESTTTISPESES-TTTSSPASESTTTNNPKSEST 2055
P + ++ + S + S+ S +SPE+ S ++ SPAS S + P
Sbjct: 89 QGGPSGSGSESVGSPTPSPSGSAEELASG--LSPENTSGSSPESPASHSPPPSPPSHPGP 146
Query: 2056 TTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESEST 2115
P S + S S P SE S +S T +S +
Sbjct: 147 HEPAPPESHNPSPNQQPSSFLQPSHEDSPEEPEPPTSEPEPDSPGPPQSETPTSSPPPQS 206
Query: 2116 TTSSPASESTTIEEQGVSPHSEKLSANEDPEEFPNED 2152
P + +Q SP++++ +ED P +
Sbjct: 207 PPDEPGEPQSPTPQQAPSPNTQQAVEHEDEPTEPERE 243
>gnl|CDD|213230 cd03263, ABC_subfamily_A, ATP-binding cassette domain of the lipid
transporters, subfamily A. The ABCA subfamily mediates
the transport of a variety of lipid compounds. Mutations
of members of ABCA subfamily are associated with human
genetic diseases, such as, familial high-density
lipoprotein (HDL) deficiency, neonatal surfactant
deficiency, degenerative retinopathies, and congenital
keratinization disorders. The ABCA1 protein is involved
in disorders of cholesterol transport and high-density
lipoprotein (HDL) biosynthesis. The ABCA4 (ABCR) protein
transports vitamin A derivatives in the outer segments of
photoreceptor cells, and therefore, performs a crucial
step in the visual cycle. The ABCA genes are not present
in yeast. However, evolutionary studies of ABCA genes
indicate that they arose as transporters that
subsequently duplicated and that certain sets of ABCA
genes were lost in different eukaryotic lineages.
Length = 220
Score = 32.1 bits (74), Expect = 2.6
Identities = 12/25 (48%), Positives = 16/25 (64%)
Query: 2241 ASVMSDRICIQSKGQVKPILSPQHL 2265
A + DRI I S G+++ I SPQ L
Sbjct: 195 AEALCDRIAIMSDGKLRCIGSPQEL 219
>gnl|CDD|218307 pfam04880, NUDE_C, NUDE protein, C-terminal conserved region. This
family represents the C-terminal conserved region of the
NUDE proteins. NUDE proteins are involved in nuclear
migration.
Length = 166
Score = 31.8 bits (71), Expect = 2.7
Identities = 25/114 (21%), Positives = 43/114 (37%), Gaps = 3/114 (2%)
Query: 1891 LSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSES---TTTSSPESESTTT 1947
L + + + P SSP + T +S S T SSP + T
Sbjct: 43 LKQELIVQERLRNNNRKSRPAPVVNLGSSPSTPHTNSSMNSPRSPPNGTVSSPLTPPTKL 102
Query: 1948 SSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPE 2001
S + +T T S T+SS S + + P +++ + S + S + P+
Sbjct: 103 SLTLASATATDPAPPMSETSSSVNSLTAASGFPLQKASASESFGTRSLYGNRPQ 156
>gnl|CDD|173701 cd05610, STKc_MASTL, Catalytic domain of the Protein Serine/Threonine
Kinase, Microtubule-associated serine/threonine-like
kinase. Serine/Threonine Kinases (STKs),
Microtubule-associated serine/threonine (MAST) kinase
subfamily, MAST-like (MASTL) kinases, catalytic (c)
domain. STKs catalyze the transfer of the
gamma-phosphoryl group from ATP to serine/threonine
residues on protein substrates. The MAST kinase subfamily
is part of a larger superfamily that includes the
catalytic domains of other protein STKs, protein tyrosine
kinases, RIO kinases, aminoglycoside phosphotransferase,
choline kinase, and phosphoinositide 3-kinase. MAST
kinases contain an N-terminal domain of unknown function,
a central catalytic domain, and a C-terminal PDZ domain
that mediates protein-protein interactions. The MASTL
kinases in this group carry only a catalytic domain,
which contains a long insertion relative to MAST kinases.
The human MASTL gene has also been labelled FLJ14813. A
missense mutation in FLJ14813 is associated with
autosomal dominant thrombocytopenia. To date, the
function of MASTL is unknown.
Length = 669
Score = 32.9 bits (75), Expect = 2.8
Identities = 30/141 (21%), Positives = 46/141 (32%), Gaps = 7/141 (4%)
Query: 1979 SPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTS 2038
+P E S ++ TSS T T P+ S P+S + E+ ++ S
Sbjct: 195 TPVGEKDQGSVNSGQNNGTSS---VRTGTSHPLLMINKESLPMSLKLSKSCLETSESSPS 251
Query: 2039 SPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTS 2098
P T P + AS S T S + ++ S S + S + S
Sbjct: 252 LPVRSLT----PNLLKSRKRPEASTSSTHSCMTNSLSSCESECCSSNLKLLEQASSPSQS 307
Query: 2099 SPASESTTTSSPESESTTTSS 2119
S E E + S
Sbjct: 308 PRWSVDEGNIISEGEKSEKGS 328
Score = 31.4 bits (71), Expect = 7.5
Identities = 27/113 (23%), Positives = 42/113 (37%), Gaps = 4/113 (3%)
Query: 2009 SPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSS 2068
+PV E S ++ T S T TS P + P S + + + + S
Sbjct: 195 TPVGEKDQGSVNSGQNNGT---SSVRTGTSHPLLMINKESLPMSLKLSKSCLETSESSPS 251
Query: 2069 SPASESTTTSSPASESTTTSSPASESTTTSSPAS-ESTTTSSPESESTTTSSP 2120
P T + + S+ ++ S T+S +S ES SS SSP
Sbjct: 252 LPVRSLTPNLLKSRKRPEASTSSTHSCMTNSLSSCESECCSSNLKLLEQASSP 304
>gnl|CDD|203570 pfam07058, Myosin_HC-like, Myosin II heavy chain-like. This family
represents a conserved region within a number of myosin
II heavy chain-like proteins that seem to be specific to
Arabidopsis thaliana.
Length = 351
Score = 32.7 bits (74), Expect = 2.8
Identities = 23/126 (18%), Positives = 43/126 (34%), Gaps = 7/126 (5%)
Query: 2034 STTTSSPASES-TTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPAS 2092
+++ P + + +N P + S TS+ S+ SS S T
Sbjct: 163 NSSFVRPTTVGRSESNGPTRRQSLGGAETSPKFTSNGGLSKKRP-SSQLRGSLTGRISTV 221
Query: 2093 ESTTTSSPASESTTTSSPESESTTTSSPAS-----ESTTIEEQGVSPHSEKLSANEDPEE 2147
+ S T S + + P++ + +G SP SE+ + ED
Sbjct: 222 LKHAKGTSISFDGGTRSMDRSKILANGPSNFPLNDKHEEGTSRGESPDSERKTEEEDGNA 281
Query: 2148 FPNEDV 2153
+ + V
Sbjct: 282 YSEDSV 287
>gnl|CDD|221931 pfam13136, DUF3984, Protein of unknown function (DUF3984). This
family of proteins is functionally uncharacterized. This
family of proteins is found in eukaryotes. Proteins in
this family are typically between 393 and 442 amino acids
in length.
Length = 301
Score = 32.4 bits (74), Expect = 2.8
Identities = 31/135 (22%), Positives = 59/135 (43%), Gaps = 13/135 (9%)
Query: 2006 TTISPVSESTTTS----SPVSESTTTISPESESTTTS--SPASESTTTNNPKSESTTTNN 2059
T+ P+ + +P +T+ +S +S TT S + + + K + ++ +
Sbjct: 18 TSRFPLDDDDEERDYSYAPHPPTTSYLSSKSVPTTPGILSHSRSPSRSRLHKRKKSSRRS 77
Query: 2060 PASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSP-ASESTTTSSPESESTTTS 2118
P S+++ +S +++ +T S S+S TTS S S +SE +
Sbjct: 78 PMSDTLL------KSKSSAHLLHHQSTRSHRRSKSGTTSPRKPSSSAHRRRNDSEWLLRA 131
Query: 2119 SPASESTTIEEQGVS 2133
A S+T EE+G S
Sbjct: 132 GAALASSTREEKGQS 146
>gnl|CDD|140307 PTZ00284, PTZ00284, protein kinase; Provisional.
Length = 467
Score = 32.6 bits (74), Expect = 2.8
Identities = 26/115 (22%), Positives = 44/115 (38%), Gaps = 3/115 (2%)
Query: 2055 TTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESES 2114
T+ P + +S A+ S +T S ST ++ S S A+ + + + E
Sbjct: 18 YTSGAPVNALSGNSPKANNSASTGQTTSRSTNSAR-RSGSKRDRETATSTDSGRTKSHEG 76
Query: 2115 TTTSSPASESTTIEEQGVSPHSEKLSANEDPEEFPNEDVFEHTFAEIPNIDHSNQ 2169
T+ A+ + T + P +K P + E F E +ID S Q
Sbjct: 77 AATTKQATTTPTTNVEVAPPPKKKKVTYALPNQSREEGHFYVVLGE--DIDVSTQ 129
>gnl|CDD|222890 PHA02584, 34, long tail fiber, proximal subunit; Provisional.
Length = 1229
Score = 32.8 bits (75), Expect = 2.9
Identities = 26/199 (13%), Positives = 71/199 (35%), Gaps = 12/199 (6%)
Query: 1872 FTTNNNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLV 1931
FT N N + +V S+ + T ++ +++T+ T+ + S++ TT ++V
Sbjct: 913 FTKNTNLSAPLVSSSTATFGGSVTANSTLTTQNTSNGTVVVVDETSIAFYSQNNTTGNIV 972
Query: 1932 SESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLV 1991
T + + T ++ V+ + E ++ + + T +
Sbjct: 973 FNIDGT-------VDPINVNANGTLNATGVATNGRAVYAEGGGIARTNNAARAITGGFTI 1025
Query: 1992 SESTTTSSPESESTTTISPVSESTTTSSPVSEST-TTISPESESTTTSSPASESTTTNNP 2050
+T+ + + + + + TI+ + S T N+
Sbjct: 1026 RNDGSTTVFLLTAAGDQTGGFNGLKSLIINNANGQVTINDNYIINAGGTIMSGGLTVNSR 1085
Query: 2051 ----KSESTTTNNPASESI 2065
++++ T P ++++
Sbjct: 1086 IRSQGTKASYTRAPTADTV 1104
>gnl|CDD|183064 PRK11267, PRK11267, biopolymer transport protein ExbD; Provisional.
Length = 141
Score = 31.2 bits (71), Expect = 2.9
Identities = 8/34 (23%), Positives = 20/34 (58%)
Query: 630 QFHPKEPIIMSASSDKTIYLGESPLHCDKAGSIL 663
Q P++P+ +S +D ++++G P+ + + L
Sbjct: 57 QPRPEKPVYLSVKADNSMFIGNDPVTDETMITAL 90
>gnl|CDD|221583 pfam12449, DUF3684, Protein of unknown function (DUF3684). This
domain family is found in eukaryotes, and is typically
between 1072 and 1090 amino acids in length.
Length = 1084
Score = 32.7 bits (75), Expect = 2.9
Identities = 15/94 (15%), Positives = 28/94 (29%), Gaps = 13/94 (13%)
Query: 1874 TNNNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSE 1933
T+ + +T ++ S E + + + S S
Sbjct: 49 TSVTRTVAQIDATWMKVVEWKPPAGSARREGQRVPDT-------------TGSLRSFFSR 95
Query: 1934 STTTSSPESESTTTSSPESESTTTSSLVSESTTT 1967
T +SSP T + E+ L ST++
Sbjct: 96 LTGSSSPPKPKTPEPAKVEENLDAEDLTEISTSS 129
Score = 31.9 bits (73), Expect = 5.7
Identities = 14/69 (20%), Positives = 22/69 (31%), Gaps = 3/69 (4%)
Query: 1929 SLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTS 1988
+V S E + + S S T +SSP T + E+
Sbjct: 64 KVVEWKPPAGSARREGQRVPDT---TGSLRSFFSRLTGSSSPPKPKTPEPAKVEENLDAE 120
Query: 1989 SLVSESTTT 1997
L ST++
Sbjct: 121 DLTEISTSS 129
Score = 31.1 bits (71), Expect = 9.1
Identities = 25/112 (22%), Positives = 44/112 (39%), Gaps = 13/112 (11%)
Query: 1884 MSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSL--VSESTTTS--- 1938
+L S S T ++SP T E+ ST++ L + + TS
Sbjct: 86 TGSLRSFFSRLTGSSSPPKPKTPEPAKVEENLDAEDLTEISTSSVFLHIFTANIQTSVSQ 145
Query: 1939 --SPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTS 1988
+ E E T P TT +++ T+S E +++ S +S ++T
Sbjct: 146 SFAAELERATKKPP--PKTTKLAIL----TSSYDEYDASKASDSKSSASTGD 191
>gnl|CDD|217443 pfam03234, CDC37_N, Cdc37 N terminal kinase binding. Cdc37 is a
molecular chaperone required for the activity of
numerous eukaryotic protein kinases. This domain
corresponds to the N terminal domain which binds
predominantly to protein kinases and is found N terminal
to the Hsp (Heat shocked protein) 90-binding domain
pfam08565. Expression of a construct consisting of only
the N-terminal domain of Saccharomyces pombe Cdc37
results in cellular viability. This indicates that
interactions with the cochaperone Hsp90 may not be
essential for Cdc37 function.
Length = 172
Score = 31.7 bits (72), Expect = 3.1
Identities = 21/91 (23%), Positives = 36/91 (39%), Gaps = 10/91 (10%)
Query: 705 RCRKRLRKLKKKEKYESPLHCDKAGSILRSGKGRVHTMVNDKHRQILCCHGNDNVVDLFY 764
R K L +LK++ S ++++S N + Q N+ V DLF
Sbjct: 64 RVDKLLSELKEESLDSSQ-------AVMKSLNENFTDKENVEPEQPTY---NEMVEDLFD 113
Query: 765 FCTKDESSTRCRKRLRKLKKKEKKLQEEQME 795
+ + +L+K KL++EQ E
Sbjct: 114 QVKDEVDEKNGAALIEELQKHRDKLKKEQKE 144
>gnl|CDD|220402 pfam09787, Golgin_A5, Golgin subfamily A member 5. Members of this
family of proteins are involved in maintaining Golgi
structure. They stimulate the formation of Golgi stacks
and ribbons, and are involved in intra-Golgi retrograde
transport. Two main interactions have been characterized:
one with RAB1A that has been activated by GTP-binding and
another with isoform CASP of CUTL1.
Length = 509
Score = 32.5 bits (74), Expect = 3.1
Identities = 25/97 (25%), Positives = 40/97 (41%), Gaps = 6/97 (6%)
Query: 2019 SPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTS 2078
+E S+TTSSP S + + ++ S +N A +P + S
Sbjct: 18 RKATEEDDDEDLLEVSSTTSSPVG-SISWSVRETAS---SNKARSRSEKWNPDQPGSRVS 73
Query: 2079 SPASESTTTSSPASESTTTSSPASESTTTSSPESEST 2115
SP+S+ TS S S+ AS ++ SS + E
Sbjct: 74 SPSSKKDGTS--RSLSSQVDDLASAVSSQSSSDLEDE 108
>gnl|CDD|221480 pfam12238, MSA-2c, Merozoite surface antigen 2c. This family of
proteins is found in eukaryotes. Proteins in this family
are typically between 263 and 318 amino acids in length.
There is a conserved SFT sequence motif. MSA-2 is a
plasma membrane glycoprotein which can be found in
Babesia bovis species.
Length = 201
Score = 31.7 bits (72), Expect = 3.3
Identities = 12/60 (20%), Positives = 26/60 (43%), Gaps = 6/60 (10%)
Query: 2061 ASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSP-ESESTTTSS 2119
+E + +S+ T+T+ P+ S T + ++ +P + + P E+ SS
Sbjct: 138 PAEYYSPKHSSSQGTSTTRPSDGSATPN-----TSAPPTPGNPAAQPEKPAETPKGNGSS 192
>gnl|CDD|236504 PRK09418, PRK09418, bifunctional 2',3'-cyclic nucleotide
2'-phosphodiesterase/3'-nucleotidase precursor protein;
Reviewed.
Length = 780
Score = 32.8 bits (74), Expect = 3.3
Identities = 36/145 (24%), Positives = 56/145 (38%), Gaps = 13/145 (8%)
Query: 1994 STTTSSPESESTTT----ISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNN 2049
+T SSP ++ IS V S + + T + + + T +P + T +
Sbjct: 617 TTFDSSPNAQKYIKKDGNISYVGPSENEFAKYAIDITKKNDDDKETGGENPTTPPTGEGD 676
Query: 2050 PKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPA-SESTTTS 2108
TT P E +P T+ P E +P + ST + A S TTT
Sbjct: 677 NGENPTTP--PTGEGNNGENP------TTPPTGEGNNGGNPTTPSTDEGNNAGSGQTTTD 728
Query: 2109 SPESESTTTSSPASESTTIEEQGVS 2133
+ S+ TTT S E + + G S
Sbjct: 729 NQNSKETTTVSENKEERDLPKTGTS 753
Score = 31.2 bits (70), Expect = 7.8
Identities = 23/96 (23%), Positives = 35/96 (36%), Gaps = 4/96 (4%)
Query: 2003 ESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPAS 2062
T P E +P + T + TT P E NP + ST N A
Sbjct: 664 GENPTTPPTGEGDNGENPTTPPTGEGNNGENPTT--PPTGEGNNGGNPTTPSTDEGNNAG 721
Query: 2063 --ESITSSSPASESTTTSSPASESTTTSSPASESTT 2096
++ T + + E+TT S E + S ++T
Sbjct: 722 SGQTTTDNQNSKETTTVSENKEERDLPKTGTSVAST 757
>gnl|CDD|144451 pfam00859, CTF_NFI, CTF/NF-I family transcription modulation region.
Length = 295
Score = 32.0 bits (72), Expect = 3.3
Identities = 54/274 (19%), Positives = 92/274 (33%), Gaps = 31/274 (11%)
Query: 1855 AATAVAISVIDNYSEIIFTTNNNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESES 1914
A T S+ D S + N + + + +S S+ T S E E T+ E
Sbjct: 25 AGTGPNFSLADLSSSSYYDLNPGAGLRRSLPSTSSSSSKRPKTVSMEEEMDTSPGGEDFY 84
Query: 1915 TTTSSPESESTTTSSLVS--ESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPES 1972
T+ SSP S S + S T P+ ++ SP+ S S+ + S
Sbjct: 85 TSPSSPSSSSANWHEVEGGMSSPTMKKPDKSLFSSPSPQDSSPRLSAFTQHHRPVITGHS 144
Query: 1973 ESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPE- 2031
+ + P T SP T+ I P S+ + P+
Sbjct: 145 GISASPHP----------------TPSPLHFPTSPILPQQPSSYFPHTAIRYPPHLHPQD 188
Query: 2032 ---SESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTS 2088
P+S+ N + N+ + + P P +
Sbjct: 189 PLKEFVQLVCDPSSQQAGQPNGSGQGKVPNHFLPTPMLAPPP-------PPPMARPVPLP 241
Query: 2089 SPASESTTTSSPASESTTTSSPESESTTTSSPAS 2122
P ++ TTS+ ++ TS + ST ++SPA+
Sbjct: 242 MPDTKPPTTSTEGGATSPTSP--TYSTPSTSPAN 273
>gnl|CDD|225249 COG2374, COG2374, Predicted extracellular nuclease [General function
prediction only].
Length = 798
Score = 32.5 bits (74), Expect = 3.3
Identities = 14/111 (12%), Positives = 33/111 (29%), Gaps = 5/111 (4%)
Query: 2027 TISPESESTTTSSPASESTTT-NNPKSESTTTNNPASESITSSSPASESTTTSSPASEST 2085
+ + T S + T +++ T ++ S + + + + +
Sbjct: 112 YVLLNKDGGYTDSLGVQGGTPLTRWNTDAQQTLTTSAVKEDSFDGSVKESVNFEETATPS 171
Query: 2086 TTSSPA----SESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQGV 2132
T + E +TT TS + + S + +GV
Sbjct: 172 TYPGLSHVNIGELSTTQYGNEALVLTSIGQIQGEGHRSGPLGGGVVTIEGV 222
>gnl|CDD|234520 TIGR04246, nitrous_NosZ_Gp, nitrous-oxide reductase, Sec-dependent.
This model represents the nitrous-oxide reductase
protein NosZ as characterized in Geobacillus
thermodenitrificans. In contrast to the related form in
Pseudomonas stutzeri, this version lacks a recognizable
twin-arginine translocation (TAT) signal at the
N-terminus. Consequently, its accessory protein may
differ. Some members of this family have an additional
cytochrome c-like domain at the C-terminus.
Length = 578
Score = 32.3 bits (74), Expect = 3.3
Identities = 15/45 (33%), Positives = 25/45 (55%), Gaps = 5/45 (11%)
Query: 509 IDNDIKMWDLRTNSVVQKLR-----GHSDTVTGLSLSPDGSYILS 548
+D+++ W+L T VV K+ GH G ++ PDG Y++S
Sbjct: 356 VDSEVVKWNLDTWEVVDKVPVHYSVGHLMAPEGDTVKPDGKYLVS 400
>gnl|CDD|236090 PRK07764, PRK07764, DNA polymerase III subunits gamma and tau;
Validated.
Length = 824
Score = 32.7 bits (75), Expect = 3.4
Identities = 13/134 (9%), Positives = 41/134 (30%), Gaps = 5/134 (3%)
Query: 1996 TTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSEST 2055
+ + + ++P + + + + + + + + +
Sbjct: 386 GVAGGAGAPAAAAPSAAAAAPAAAPAPAAAAPAAAAAPAPAAAPQPAPAPAP--APAPPS 443
Query: 2056 TTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESEST 2115
N + S PA+ + +PA + + A ++PA + +
Sbjct: 444 PAGNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPA---AAPAAPAAP 500
Query: 2116 TTSSPASESTTIEE 2129
+ A ++ T+ E
Sbjct: 501 AAPAGADDAATLRE 514
>gnl|CDD|215448 PLN02834, PLN02834, 3-dehydroquinate synthase.
Length = 433
Score = 32.4 bits (74), Expect = 3.4
Identities = 14/68 (20%), Positives = 24/68 (35%), Gaps = 1/68 (1%)
Query: 2051 KSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTS-SPASESTTTSS 2109
KS S + + ++ S SP+ +S S S A++S T +
Sbjct: 2 KSSSADNSESNTPTVLSRSPSDAFFDQNSSIESSKEGDLTEVIHEKCPVSGANKSEVTKT 61
Query: 2110 PESESTTT 2117
+ TT
Sbjct: 62 ASATVTTV 69
>gnl|CDD|237624 PRK14143, PRK14143, heat shock protein GrpE; Provisional.
Length = 238
Score = 32.0 bits (73), Expect = 3.6
Identities = 17/81 (20%), Positives = 32/81 (39%), Gaps = 3/81 (3%)
Query: 1918 SSPESESTTT-SSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTT 1976
S+PE + +++SES + S + E+E T + SSP+S S
Sbjct: 1 STPEQDPLEVKLAVISESEAEDNSPESSEEVTEQEAELTNPE--GDAAEAESSPDSGSAA 58
Query: 1977 TSSPESESTTTSSLVSESTTT 1997
+ + + + L E +
Sbjct: 59 SETAADNAARLAQLEQELESL 79
>gnl|CDD|183582 PRK12543, PRK12543, RNA polymerase sigma factor; Provisional.
Length = 179
Score = 31.2 bits (71), Expect = 3.7
Identities = 20/106 (18%), Positives = 43/106 (40%), Gaps = 24/106 (22%)
Query: 708 KRLRKLKKKEKYESPLHCDKAGSILRSGKGR-----VHTMVNDKHRQILCCHGNDNVVDL 762
+R R +K E+ P+ D + +L + +H + K RQ++ ++
Sbjct: 79 RRFRIFEKAEEQRKPVSIDFSEDVLSKESNQELIELIHKL-PYKLRQVI-------ILRY 130
Query: 763 FYFCTKDESST-----------RCRKRLRKLKKKEKKLQEEQMEVV 797
+ +++E + R L+KL++KE+ + EV
Sbjct: 131 LHDYSQEEIAQLLQIPIGTVKSRIHAALKKLRQKEQIEEIFLGEVG 176
>gnl|CDD|233367 TIGR01349, PDHac_trf_mito, pyruvate dehydrogenase complex
dihydrolipoamide acetyltransferase, long form. This
model represents one of several closely related clades of
the dihydrolipoamide acetyltransferase subunit of the
pyruvate dehydrogenase complex. It includes sequences
from mitochondria and from alpha and beta branches of the
proteobacteria, as well as from some other bacteria.
Sequences from Gram-positive bacteria are not included.
The non-enzymatic homolog protein X, which serves as an
E3 component binding protein, falls within the clade
phylogenetically but is rejected by its low score [Energy
metabolism, Pyruvate dehydrogenase].
Length = 436
Score = 32.1 bits (73), Expect = 3.8
Identities = 16/82 (19%), Positives = 33/82 (40%), Gaps = 2/82 (2%)
Query: 2065 ITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASES 2124
+ + + ES+ + +P ++P S + +P+ +S SSPA S
Sbjct: 73 VLVEEKEDVADAFKNYKLESSASPAPKPSEIAPTAPPSAPKPSPAPQKQSPEPSSPAPLS 132
Query: 2125 TTIEEQGV--SPHSEKLSANED 2144
+ SP ++KL+ +
Sbjct: 133 DKESGDRIFASPLAKKLAKEKG 154
Score = 32.1 bits (73), Expect = 4.7
Identities = 23/92 (25%), Positives = 37/92 (40%), Gaps = 15/92 (16%)
Query: 2067 SSSPASESTTT--SSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASES 2124
S+SPA + + ++P S + +P +S SSPA S ES +SP ++
Sbjct: 93 SASPAPKPSEIAPTAPPSAPKPSPAPQKQSPEPSSPAP----LSDKESGDRIFASPLAKK 148
Query: 2125 TTIEE-------QGVSPHSEKLSANEDPEEFP 2149
E+ G P+ + D E F
Sbjct: 149 LAKEKGIDLSAVAGSGPNGRIVKK--DIESFV 178
Score = 31.3 bits (71), Expect = 7.3
Identities = 15/59 (25%), Positives = 23/59 (38%)
Query: 2036 TTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASES 2094
+ N K ES+ + P I ++P S + +P +S SSPA S
Sbjct: 74 LVEEKEDVADAFKNYKLESSASPAPKPSEIAPTAPPSAPKPSPAPQKQSPEPSSPAPLS 132
>gnl|CDD|216784 pfam01917, Arch_flagellin, Archaebacterial flagellin. Members of
this family are the proteins that form the flagella in
archaebacteria.
Length = 151
Score = 31.0 bits (71), Expect = 3.8
Identities = 25/107 (23%), Positives = 39/107 (36%), Gaps = 10/107 (9%)
Query: 1849 LLISMLAATAVAISVIDNYSEIIFTTNNNSESTVVMSTLNSLLSEN-TTTNSPESESTTT 1907
+ I+M+ AVA V+ N S + ST L LS + ST+T
Sbjct: 10 VFIAMVLVAAVAAGVLINTS---GFLQQKASSTG--EELTEQLSTDLEIIGVVGDSSTST 64
Query: 1908 NNPESE---STTTSSPESESTTTSSLVSESTT-TSSPESESTTTSSP 1950
+ S+P S T +++ + + STT S P
Sbjct: 65 TIDKLTIYVKNAGSTPIDLSQTKITVLYDGGIVVINDTDYSTTVSDP 111
>gnl|CDD|227931 COG5644, COG5644, Uncharacterized conserved protein [Function
unknown].
Length = 869
Score = 32.4 bits (73), Expect = 4.0
Identities = 57/297 (19%), Positives = 108/297 (36%), Gaps = 27/297 (9%)
Query: 1876 NNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSEST 1935
N S+S +L + + + +S ++ E+E + +S E+E +L+
Sbjct: 76 NASKSGKSNKDHKNLNNTKEISLNDSDDSVNSDKLENEGSVSSIDENELVDLDTLLDNDQ 135
Query: 1936 TT----------SSPES--ESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESE 1983
+ E+ ES +SS +SES + S ++ S + E++ +
Sbjct: 136 PEKNESGNNDHATDKENLLESDASSSNDSESEESDSESEIESSDSDHDDENSDSKLDNLR 195
Query: 1984 STTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASE 2043
+ S E+ S S+ +I + ++ S S+ TI S P +
Sbjct: 196 NYIVSLKKDEADAESVLSSDDNDSIEEIKYDPHETNKESGSSETID--ITDLLDSIPMEQ 253
Query: 2044 STTTNNP-KSESTTTNNPASESITS---SSPASESTTTSSPASESTTTSSPASE------ 2093
+ P SES+ + P ++SI A E T + + S+
Sbjct: 254 LKVSLKPLVSESSKLDAPLAKSIQDRLERQAAYEQTKNDLEKWKPIVADNRKSDQLIFPM 313
Query: 2094 -STTTSSPASESTTTS-SPESESTTTSSPASESTTIEEQGVSPHSEKLSANE-DPEE 2147
T P++ +S P +ES A +E + E+L+ N+ EE
Sbjct: 314 NETARPVPSNNGLASSFEPRTESERKMHQALLDAGLENESALKKQEELALNKLSVEE 370
>gnl|CDD|216194 pfam00922, Phosphoprotein, Vesiculovirus phosphoprotein.
Length = 283
Score = 31.8 bits (72), Expect = 4.0
Identities = 16/96 (16%), Positives = 30/96 (31%), Gaps = 9/96 (9%)
Query: 2100 PASESTTTSSPESESTTTSSPASESTTIEEQGVSPH---SEKLSANEDPEEFPNEDVFEH 2156
P + T + E E ++ E+ SP +E+LS +E ++
Sbjct: 11 PRLDQTLSEIEEMEEQRADKSSTFQEDSVEEHTSPSYYLAEELSDSETEPSIEDDQGLYT 70
Query: 2157 TFAEIPNIDHSNQT------DEAIPETFDAREEWPQ 2186
++ Q D+ I F+ W
Sbjct: 71 QLPPAEQVEGFIQGPLDDIADDDIDVVFEEDRPWKP 106
>gnl|CDD|200219 TIGR02927, SucB_Actino, 2-oxoglutarate dehydrogenase, E2 component,
dihydrolipoamide succinyltransferase. This model
represents an Actinobacterial clade of E2 enzyme, a
component of the 2-oxoglutarate dehydrogenase complex
involved in the TCA cycle. These proteins have multiple
domains including the catalytic domain (pfam00198), one
or two biotin domains (pfam00364) and an E3-component
binding domain (pfam02817).
Length = 579
Score = 32.3 bits (73), Expect = 4.0
Identities = 35/192 (18%), Positives = 60/192 (31%), Gaps = 13/192 (6%)
Query: 1932 SESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPE-----SESTT 1986
SE + + +P T + + + + E+T PE +E T
Sbjct: 84 SEPAPAAPEPEAAPEPEAPAPAPTPAAEAPAPAAPQAGGSGEATEVKMPELGESVTEGTV 143
Query: 1987 TSSL--VSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASES 2044
TS L V ++ P E +T T SPV+ + I + T
Sbjct: 144 TSWLKAVGDTVEVDEPLLEVSTD----KVDTEIPSPVAGTLLEIRAPEDDTVEVGTVLAI 199
Query: 2045 TTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASES 2104
N + S S PA + + + +PA T++PA+ +
Sbjct: 200 IGDANAAPAEPAEEEAPAPSEAGSEPAPD--PAARAPHAAPDPPAPAPAPAKTAAPAAAA 257
Query: 2105 TTTSSPESESTT 2116
+S T
Sbjct: 258 PVSSGDSGPYVT 269
>gnl|CDD|227358 COG5025, COG5025, Transcription factor of the Forkhead/HNF3 family
[Transcription].
Length = 610
Score = 32.1 bits (73), Expect = 4.2
Identities = 31/225 (13%), Positives = 69/225 (30%), Gaps = 9/225 (4%)
Query: 1910 PESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSS-LVSESTTTS 1968
+ +S + S + S P S +
Sbjct: 375 RHKPTAWQNSIRHNLSLNKSFEKVPRSASQPGKGCFWKIDYSYIYEKESKRNPRSPKKSP 434
Query: 1969 SPESESTTTS---SPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSEST 2025
S S S + +S TS + S S+ +S +T I + + ++E
Sbjct: 435 SAHSVHQKLSLHVNDLYQSPATSDIASSSSQVNSQPEFISTQIHSSKGVS--NVDLTEQD 492
Query: 2026 TTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASEST 2085
+ + S + T TT++ +S + ++P + S + + ++
Sbjct: 493 SQKEASKGNFLDDSGSLSPNTNEINSFSLNTTDSQQKQSPSHNAPTNNSLNEMASKNSNS 552
Query: 2086 TTSSPASESTTTSSPASESTTTSSPESESTT---TSSPASESTTI 2127
T + S + A + + + T + A+ES ++
Sbjct: 553 QTQASNSNENVAAVKAILDASAQMEKPYDLSQAATPTKATESASV 597
>gnl|CDD|225372 COG2815, COG2815, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 303
Score = 32.0 bits (73), Expect = 4.2
Identities = 16/116 (13%), Positives = 29/116 (25%), Gaps = 2/116 (1%)
Query: 1963 ESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVS 2022
E ++ PE E + S P + S + + + + + + V
Sbjct: 190 EYVSSDRPEGEVISQSPPAGTTVNVGSKIEIVVSKGAFVAPDLSGMFTVEAEPHPREEGD 249
Query: 2023 ESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTS 2078
S I + T S S P T + + S
Sbjct: 250 TSQEVIRDKDADVTASGTDSSVNIQPPP--GGTIVLKGSEITSGIYQVVVNDKVIS 303
>gnl|CDD|152115 pfam11679, DUF3275, Protein of unknown function (DUF3275). This
family of proteins with unknown function appear to be
restricted to Proteobacteria.
Length = 211
Score = 31.4 bits (71), Expect = 4.3
Identities = 16/71 (22%), Positives = 25/71 (35%), Gaps = 3/71 (4%)
Query: 2059 NPASESITSSSPASESTTTSSPASESTTTSSPASE-STTTSSPASESTTTSSPESES--T 2115
+ + + P SPAS + S+PA S + PAS +
Sbjct: 84 KLSRDEPRRTEPQEPDPLDESPASAAPVASAPAPAPSPQSPKPASRRASRDMRRIAPFGM 143
Query: 2116 TTSSPASESTT 2126
S+PA E+
Sbjct: 144 NASAPAQEAAQ 154
>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
unknown].
Length = 652
Score = 32.0 bits (73), Expect = 4.4
Identities = 28/88 (31%), Positives = 39/88 (44%), Gaps = 9/88 (10%)
Query: 776 RKRLRKLKKKEKKLQEEQMEV------VEENPVDPDDTEGGKGKPELVDVVKRLPTIKTA 829
R R R++++ EK+L+E++ V + E GKG P V L I+ A
Sbjct: 477 RARDRRIERLEKELEEKKKRVEELERKLAELRKMRKLELSGKGTPVKVVEKLTLEAIEEA 536
Query: 830 SKTGKIKSVDVIL---GGGGEIRLALLL 854
+ IK DVIL GG R A L
Sbjct: 537 EEEYGIKEGDVILVEDPSGGGARTAEEL 564
>gnl|CDD|215601 PLN03142, PLN03142, Probable chromatin-remodeling complex ATPase
chain; Provisional.
Length = 1033
Score = 32.1 bits (73), Expect = 4.5
Identities = 17/48 (35%), Positives = 30/48 (62%), Gaps = 3/48 (6%)
Query: 767 TKDESSTRCRKRLRKLKKKEKKLQEEQMEVVEENP-VDPDDTEGGKGK 813
K E S R + RL++LKK++K+ ++ +E ++N +D D GKG+
Sbjct: 53 AKAEISKREKARLKELKKQKKQEIQKILE--QQNAAIDADMNNKGKGR 98
>gnl|CDD|152451 pfam12016, Stonin2_N, Stonin 2. Stonin 2 is involved in clathrin
mediated endocytosis. It binds to Eps15 by its highly
conserved NPF motif. The complex formed has been shown to
directly associate with the clathrin adaptor complex
AP-2, and to localize to clathrin-coated pits (CCPs). In
addition, stonin2 was recently identified as a specific
sorting adaptor for synaptotagmin, and may thus regulate
synaptic vesicle recycling.
Length = 341
Score = 31.8 bits (71), Expect = 4.5
Identities = 38/205 (18%), Positives = 71/205 (34%), Gaps = 17/205 (8%)
Query: 1895 TTTNSPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESES 1954
+T+ P E+ T P + T P +S L SES+ T+ SE T++ S
Sbjct: 112 ASTSPPHKETAETALPLTMPCWTC-PSFDSLGRCPLTSESSWTT--HSEDTSSPSFACSY 168
Query: 1955 TTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVS-- 2012
T + +E + +T +S + + + S SP PV+
Sbjct: 169 TDLQLINAEEQASGQASGADSTDNSSSLQEDEEVEMEAISWQAGSPAMNGHPAAPPVTSA 228
Query: 2013 --------ESTTTSSPVSESTTTISPESESTTTSSP----ASESTTTNNPKSESTTTNNP 2060
+ P+ + P + +++P S + + +ST N P
Sbjct: 229 RFPSWVTFDDNEVGCPLPPVPSPKKPNTPPAASAAPDVPFNSMGSFKKRDRPKSTLMNFP 288
Query: 2061 ASESITSSSPASESTTTSSPASEST 2085
+ + SS + +P +T
Sbjct: 289 KVQKLDISSLNRPPSVIEAPPWRAT 313
>gnl|CDD|218803 pfam05904, DUF863, Plant protein of unknown function (DUF863). This
family consists of a number of hypothetical proteins from
Arabidopsis thaliana and Oryza sativa. The function of
this family is unknown.
Length = 766
Score = 32.3 bits (73), Expect = 4.5
Identities = 48/314 (15%), Positives = 105/314 (33%), Gaps = 34/314 (10%)
Query: 1868 SEIIFTTNNNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESESTTTSSPESESTTT 1927
S + N S + + T N E S S + S ++
Sbjct: 103 SNGLADLNEPSPTWGLTETANVQGQEVEERASDTSRDFLGRYGSNISHVQDQSLEKNLNH 162
Query: 1928 SSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPE--SEST 1985
+S++ S+P+S S V + S T S + E T
Sbjct: 163 NSVLEAGKEKSTPKSSLDLPSQEGQ--------VLSNKAFQPRYSLLTDQSKCKYVRERT 214
Query: 1986 TTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASEST 2045
+++ V S + SP+ S ++ P P+S + + +S
Sbjct: 215 SSNLEVQNK-------SPGVSYQSPLESSVASNLP--RLNPFYRPDSAKSWSHWSSSWEN 265
Query: 2046 TTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASEST 2105
++ +ST N+ ++ + + E+++T++P+ ++ S+ ++ S S+ +
Sbjct: 266 MSSGLDQKSTPLNSAQTQPV----LSFETSSTANPSFGTSCCSTNSNGFYNGFSSGSKES 321
Query: 2106 T--TSSPESESTTTSSPASESTTIEEQGVSPHSEKLS---------ANEDPEEFPNEDVF 2154
S+ + +S + + E E S + P + F
Sbjct: 322 PFFASTGFNYPNISSGEEATEHSFVELQGPKSEECSSGLPWLRKKPTCKGPLDLNASSAF 381
Query: 2155 EHTFAEIPNIDHSN 2168
+ A + +++ SN
Sbjct: 382 YSSNANVIDVEPSN 395
>gnl|CDD|227520 COG5193, LHP1, La protein, small RNA-binding pol III transcript
stabilizing protein and related La-motif-containing
proteins involved in translation [Posttranslational
modification, protein turnover, chaperones / Translation,
ribosomal structure and biogenesis].
Length = 438
Score = 31.9 bits (72), Expect = 4.8
Identities = 39/263 (14%), Positives = 76/263 (28%), Gaps = 23/263 (8%)
Query: 1914 STTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESE 1973
S T E + TSSL S T+S E S +S + ++ES+ + +
Sbjct: 9 SNTEHQAEDKKKQTSSLKLASEPTTSEEKS----KSQDSNTVIPVEELTESSKSKKEDKN 64
Query: 1974 STTTSSPESES-TTTSSLVSES--TTTSSPESESTTTISPVSESTTTSSPVSESTTTISP 2030
+ +S + S S T ++ P+ + T +P ++ P+ T +
Sbjct: 65 PSKLTSNTKWTLKQVEFYFSGSKDTDSNFPKDKFLKTTAPKNKKRDKWVPIKTI-ATFNR 123
Query: 2031 ESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSP 2090
S S + + + + E +S S + + S ++ST+
Sbjct: 124 MKNSG--------SPVSAVSGALRKSLDARVLEVSSSGSNKNRTEKLISNNNKSTSQM-- 173
Query: 2091 ASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQGV---SPHSEKLSANEDPEE 2147
+ E +S + + K E
Sbjct: 174 -QRDVYQNGFGKEDVNNASRPEQQEDLEIQFPPHYHAPPSQIRNRRDWLNKNFRGSVFVE 232
Query: 2148 FPNEDVFE-HTFAEIPNIDHSNQ 2169
F + N + N
Sbjct: 233 FKYFREAQRFNNGFYRNKKYPND 255
>gnl|CDD|215361 PLN02673, PLN02673, quinolinate synthetase A.
Length = 724
Score = 31.9 bits (72), Expect = 5.0
Identities = 21/82 (25%), Positives = 31/82 (37%), Gaps = 14/82 (17%)
Query: 2032 SESTTTSSPASESTTTNNPKSESTTTN------------NPASESIT--SSSPASESTTT 2077
S S T+SS +S + NP TT+ NP +S S P + +
Sbjct: 2 SSSPTSSSSSSFLSLLPNPSPNFRTTHPNFGSQRRIGTINPLFKSFKCIQSPPPDSAPSN 61
Query: 2078 SSPASESTTTSSPASESTTTSS 2099
+SP S S SP+ +
Sbjct: 62 ASPFSCSAVAFSPSQTTELVPC 83
>gnl|CDD|115071 pfam06390, NESP55, Neuroendocrine-specific golgi protein P55
(NESP55). This family consists of several mammalian
neuroendocrine-specific golgi protein P55 (NESP55)
sequences. NESP55 is a novel member of the chromogranin
family and is a soluble, acidic, heat-stable secretory
protein that is expressed exclusively in endocrine and
nervous tissues, although less widely than chromogranins.
Length = 261
Score = 31.4 bits (70), Expect = 5.2
Identities = 31/127 (24%), Positives = 45/127 (35%), Gaps = 9/127 (7%)
Query: 1910 PESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSS 1969
PE S S E E E + ++ T S E ES + SE+ +
Sbjct: 81 PEP-SEPESDHEDEDFEPELARPECLEYDEDDFDTETDSETEPES----DIESETEFETE 135
Query: 1970 PESESTT--TSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTT 2027
PE+E T T+ PE+E V T T + + + +SP +T
Sbjct: 136 PETEPDTAPTTEPETEPEDEPGPVVPKGATFH--QSLTERLHALKLQSADASPRRAPPST 193
Query: 2028 ISPESES 2034
PES
Sbjct: 194 QEPESAR 200
>gnl|CDD|221581 pfam12446, DUF3682, Protein of unknown function (DUF3682). This
domain family is found in eukaryotes, and is typically
between 125 and 136 amino acids in length.
Length = 133
Score = 30.2 bits (68), Expect = 5.4
Identities = 21/83 (25%), Positives = 35/83 (42%), Gaps = 2/83 (2%)
Query: 2037 TSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTT 2096
SS +S P PA+ + SS+ +S SS ++ S TT ++ +
Sbjct: 10 VSSGSSAPAPPAGPGPGPNAPPAPAAPGVDSSAGSSGGEAGSSGSNSSNTTGDSSTGDQS 69
Query: 2097 TSSPASESTTTSSPESESTTTSS 2119
A+ + +S PE + TTS
Sbjct: 70 --PAAAAAHNSSPPEGPAGTTSG 90
>gnl|CDD|233366 TIGR01348, PDHac_trf_long, pyruvate dehydrogenase complex
dihydrolipoamide acetyltransferase, long form. This
model describes a subset of pyruvate dehydrogenase
complex dihydrolipoamide acetyltransferase specifically
close by both phylogenetic and per cent identity (UPGMA)
trees. Members of this set include two or three copies of
the lipoyl-binding domain. E. coli AceF is a member of
this model, while mitochondrial and some other bacterial
forms belong to a separate model [Energy metabolism,
Pyruvate dehydrogenase].
Length = 546
Score = 31.8 bits (72), Expect = 5.5
Identities = 23/96 (23%), Positives = 38/96 (39%), Gaps = 1/96 (1%)
Query: 2021 VSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSP 2080
VS + I+ ES+ + PA S + K + + +T S S T +P
Sbjct: 143 VSADQSLITLESDKASMEVPAPASGVVKSVKVKVGDSVPTGDLILTLSVAGSTPATAPAP 202
Query: 2081 ASESTTTSSPASES-TTTSSPASESTTTSSPESEST 2115
AS SPA+ ++PA+ +P+ T
Sbjct: 203 ASAQPAAQSPAATQPEPAAAPAAAKAQAPAPQQAGT 238
>gnl|CDD|177475 PHA02693, PHA02693, hypothetical protein; Provisional.
Length = 710
Score = 31.9 bits (72), Expect = 5.5
Identities = 29/116 (25%), Positives = 46/116 (39%), Gaps = 10/116 (8%)
Query: 2013 ESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPAS 2072
+ + T+ S S +T S S +T S+ + T T +P + ES P +
Sbjct: 271 DESDTADSCSRSFSTQSTRSTRSTRSTRSGAETDTTDPDLDP-----DDDESFDEVGPLT 325
Query: 2073 ESTTTSSPASESTTTSSPAS--ESTTTSSPASESTTTSS---PESESTTTSSPASE 2123
T +S A ++ SS AS S+ SE +S+ P+ T A E
Sbjct: 326 RRFTATSFAPRASVRSSSASMRLHARGSTRISEPLMSSAARVPKVSMAPTLDTAEE 381
>gnl|CDD|218190 pfam04651, Pox_A12, Poxvirus A12 protein.
Length = 188
Score = 30.9 bits (70), Expect = 5.6
Identities = 17/73 (23%), Positives = 29/73 (39%), Gaps = 1/73 (1%)
Query: 2046 TTNNPKSESTTTNNPASESITSSS-PASESTTTSSPASESTTTSSPASESTTTSSPASES 2104
N + + NNP+ + + S S++ S S ST+ + P S S+ + S A
Sbjct: 36 QANRGGNLAGPENNPSDNEVKAGKRVTSASSSKSKRCSTSTSKTKPCSRSSRSRSGAPRR 95
Query: 2105 TTTSSPESESTTT 2117
T+ E
Sbjct: 96 RGTAFGSMEDPQI 108
>gnl|CDD|218744 pfam05781, MRVI1, MRVI1 protein. This family consists of mammalian
MRVI1 proteins which are related to the
lymphoid-restricted membrane protein (JAW1) and the IP3
receptor associated cGMP kinase substrates A and B (IRAGA
and IRAGB). The function of MRVI1 is unknown although
mutations in the Mrvi1 gene induces myeloid leukaemia by
altering the expression of a gene important for myeloid
cell growth and/or differentiation so it has been
speculated that Mrvi1 is a tumour suppressor gene. IRAG
is very similar in sequence to MRVI1 and is an essential
NO/cGKI-dependent regulator of IP3-induced calcium
release. Activation of cGKI decreases IP3-stimulated
elevations in intracellular calcium, induces smooth
muscle relaxation and contributes to the
antiproliferative and pro-apoptotic effects of NO/cGMP.
Jaw1 is a member of a class of proteins with
COOH-terminal hydrophobic membrane anchors and is
structurally similar to proteins involved in vesicle
targeting and fusion. This suggests that the function
and/or the structure of the ER in lymphocytes may be
modified by lymphoid-restricted resident ER proteins.
Length = 538
Score = 31.9 bits (72), Expect = 5.6
Identities = 24/123 (19%), Positives = 36/123 (29%), Gaps = 7/123 (5%)
Query: 2030 PESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSS 2089
PA ES + +S + + + T TSSP + +
Sbjct: 40 ASQGENGVGEPAGES-----VGQKRELWPPTSSPPLLRGTSSDSGTETSSPRGQKILAMA 94
Query: 2090 PASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQ-GVSPHSEKLSANEDPEEF 2148
ES +SP + T S A E + V +L A E+ E
Sbjct: 95 SLDLDEKRLCGKEESKRAASPGLKQQGT-SLAEEHILLRNSNLVGKKLPELEAAEEQETS 153
Query: 2149 PNE 2151
E
Sbjct: 154 EIE 156
>gnl|CDD|152863 pfam12429, DUF3676, Protein of unknown function (DUF3676). This
domain family is found in eukaryotes, and is
approximately 230 amino acids in length.
Length = 230
Score = 31.0 bits (70), Expect = 5.6
Identities = 54/217 (24%), Positives = 82/217 (37%), Gaps = 18/217 (8%)
Query: 1892 SENTTTNSPESESTTTNNPESESTTTSSPESESTTTS--SLVSESTTTSSPESESTTTSS 1949
SE +T + E T+ E ES P + S+T S VSE T + ES S
Sbjct: 11 SEESTASHEELTEDDTDKQEEESVHDPVPAAPSSTVVAGSSVSEPATAA----ESAENSR 66
Query: 1950 PE-----SESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLV---SESTTTSSPE 2001
PE SE T+ S +SE T + V SES T PE
Sbjct: 67 PEDNAQLSEGETSQQATLNEDNESMQRDSDVQPQDLQSEELTEVTDVEGSSESNDTEQPE 126
Query: 2002 SESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPA 2061
E + S +T+ S S T + + ++E + N+ + T A
Sbjct: 127 EEGEA--NDRSGGSTSPVAASLSMETATAPVDGEHQVQQSTELSAENDDVRSTGTGTTGA 184
Query: 2062 SESITSSSPASESTTTSSPASESTTTSSPASESTTTS 2098
ES+ S A + + + S+S+ T S + T++
Sbjct: 185 EESL--SLEAGDGNSERTMGSDSSLTPSKSDAEPTSA 219
>gnl|CDD|221321 pfam11928, DUF3446, Domain of unknown function (DUF3446). This
presumed domain is functionally uncharacterized. This
domain is found in eukaryotes. This domain is typically
between 80 to 99 amino acids in length. This domain is
found associated with pfam00096. This domain has a single
completely conserved residue P that may be functionally
important.
Length = 84
Score = 29.1 bits (65), Expect = 5.7
Identities = 16/50 (32%), Positives = 25/50 (50%), Gaps = 3/50 (6%)
Query: 2077 TSSPASESTTTSSPASESTTTSSPASESTTTSSPE---SESTTTSSPASE 2123
++ P S S ++SS +S S++ S P S S S P S + SS +
Sbjct: 31 SNPPPSSSPSSSSSSSSSSSQSPPLSCSVHQSEPSPIYSAAPPYSSACGD 80
>gnl|CDD|227507 COG5180, PBP1, Protein interacting with poly(A)-binding protein [RNA
processing and modification].
Length = 654
Score = 31.6 bits (71), Expect = 5.9
Identities = 22/127 (17%), Positives = 42/127 (33%), Gaps = 6/127 (4%)
Query: 1994 STTTSSPESESTTTISPVSESTTTSSPVSESTTTISPE---SESTTTSSPASESTTTNNP 2050
PE+ S + S E I + S + S
Sbjct: 281 LLENRKPEAVSAPEAVSPQSKSEGPSSGQEKEKQIKEKKSFSYGWKHTKFDSSKNLLEVI 340
Query: 2051 KSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSP 2110
KS+ + + +S + S S A++ S P ES + S A++ + ++
Sbjct: 341 KSKFKSLFDISSGELKWGSKPPWEAKAVSIATK---VSKPKKESVRSGSKAAKKSPSTKH 397
Query: 2111 ESESTTT 2117
+ S+T+
Sbjct: 398 TTRSSTS 404
>gnl|CDD|227478 COG5149, TOA1, Transcription initiation factor IIA, large chain
[Transcription].
Length = 293
Score = 31.2 bits (70), Expect = 6.1
Identities = 27/126 (21%), Positives = 43/126 (34%), Gaps = 17/126 (13%)
Query: 1856 ATAVAISVIDNYSEIIFTTNNNSESTVVMSTLNSLLSENTTTNSPESESTTTNNPESEST 1915
A AVA S I N S TN + +S+ + + + +P ++TN
Sbjct: 77 APAVANSPILNQSA----TNISFDSSAIPNV-------QSNNTAPFPSYSSTN------Q 119
Query: 1916 TTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESEST 1975
T SP +T++L + S E E + S ++ T E
Sbjct: 120 TADSPIINDHSTANLKIYGDIIAEVISLPNRLEQVEDELSIGKSAITTLRNTDWRERLID 179
Query: 1976 TTSSPE 1981
T S
Sbjct: 180 DTQSEW 185
>gnl|CDD|146285 pfam03566, Peptidase_A21, Peptidase family A21.
Length = 628
Score = 31.8 bits (72), Expect = 6.4
Identities = 28/173 (16%), Positives = 56/173 (32%), Gaps = 17/173 (9%)
Query: 1900 PESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSS 1959
P SE+ +T P + + S+ V S P S + +P ++
Sbjct: 275 PISETQNAVPDIVAGSTFVGPSNVTRPGSATVVTLVWASLPPGGSAPSGTPTWTPNSSGQ 334
Query: 1960 LVSES-----------TTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTI 2008
T E ++P T + T T + + + T +
Sbjct: 335 FGQWRHGGFDASVILPTVPRGYTMEYGDFANPGDTLTFGQTGGDNVTITITAPTVTVTVL 394
Query: 2009 SPVSESTTTSSPVS-ESTTTISPESESTTTSS----PAS-ESTTTNNPKSEST 2055
+ ++ S V+ +S ++ ++ + S P + T N PK+E
Sbjct: 395 ASLTSSNGVFRGVTADSGARLNLDTAALNRLSIPLPPLTFGQTMQNTPKTEQF 447
>gnl|CDD|227404 COG5072, ALK1, Serine/threonine kinase of the haspin family [Cell
division and chromosome partitioning].
Length = 488
Score = 31.4 bits (71), Expect = 6.6
Identities = 17/84 (20%), Positives = 35/84 (41%), Gaps = 1/84 (1%)
Query: 1972 SESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPE 2031
S + S P S++ + + + E + ++ S V + P E++ TI +
Sbjct: 49 SLESIHSKPSKTSSSKWNFWKKKGSYPENELLAKSSFSSVHTVIFPAGPRDEASKTIVSK 108
Query: 2032 SESTT-TSSPASESTTTNNPKSES 2054
E T + A S+ +N+ K +
Sbjct: 109 KEVTNLLNHKALSSSLSNSLKHKP 132
>gnl|CDD|216095 pfam00748, Calpain_inhib, Calpain inhibitor. This region is found
multiple times in calpain inhibitor proteins.
Length = 131
Score = 29.8 bits (67), Expect = 7.0
Identities = 19/82 (23%), Positives = 32/82 (39%), Gaps = 2/82 (2%)
Query: 2074 STTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEEQGVS 2133
S T S + + T+ E ++ + E + S S + P + + + +
Sbjct: 6 SDFTCSASPPPSPTAKKKKEEAEKTAASGEVVSAQSAPSVRSAAPPPEKKRDKMSDDALD 65
Query: 2134 PHSEKLSANE-DPEE-FPNEDV 2153
S+ L E DPEE P ED
Sbjct: 66 ALSDSLGQREPDPEEKKPVEDK 87
>gnl|CDD|240232 PTZ00021, PTZ00021, falcipain-2; Provisional.
Length = 489
Score = 31.3 bits (71), Expect = 7.2
Identities = 15/56 (26%), Positives = 27/56 (48%), Gaps = 13/56 (23%)
Query: 2346 SCEGSINPRYIHSVKIIGWG-------KSSQNEP--YWLCTNSYNQGWGEQGLFKI 2392
C N H+V ++G+G + + E Y++ NS+ + WGE+G +I
Sbjct: 415 ECGEEPN----HAVILVGYGMEEIYNSDTKKMEKRYYYIIKNSWGESWGEKGFIRI 466
>gnl|CDD|177303 PHA00735, PHA00735, hypothetical protein.
Length = 808
Score = 31.4 bits (71), Expect = 7.3
Identities = 17/36 (47%), Positives = 21/36 (58%)
Query: 87 SSQLAVAYTNGSLKTFSLDTTDVISTFTGHKSAITV 122
S+QL V Y NG+LKTFS+ VI+ S TV
Sbjct: 198 SNQLYVYYYNGTLKTFSITPGQVINNQFYPLSLNTV 233
>gnl|CDD|218883 pfam06075, DUF936, Plant protein of unknown function (DUF936). This
family consists of several hypothetical proteins from
Arabidopsis thaliana and Oryza sativa. The function of
this family is unknown.
Length = 564
Score = 31.3 bits (71), Expect = 7.3
Identities = 45/285 (15%), Positives = 77/285 (27%), Gaps = 64/285 (22%)
Query: 1901 ESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSL 1960
S S T+ P S SS S S+ + + S + + SS S S
Sbjct: 192 PSPSGGTSCPSSSGGRRSSIGSRRLRGSASLRKKVAVLSAPRKPGSRSSDCKSSPRARSS 251
Query: 1961 VSESTTTSSPESESTTTSSPESE----STTTSSLVSESTTTSSPESESTTTISPVSESTT 2016
++S SS + ++T S S T+ S SE E++ ++ ++
Sbjct: 252 SAKSPFKSSIQRKATKALSKLSLRASPKDTSKSSKSEVAPPKKSEAKVPSSSKKWTDGNV 311
Query: 2017 TSSPVSESTTTISPE-----------------------------------SESTTTSSPA 2041
+ + S + + E S S +P
Sbjct: 312 SWDSLPSSLSKLGKEALRQRDVAQKAALEALREASATESLIRCLSTFSELSSSAKEDNPL 371
Query: 2042 ---------------------SESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSP 2080
S S + + A I ++ S + S
Sbjct: 372 PCIEKFLKFHQELDQAIKIAESLSKSRSPDAECRLERKKSALSWIRAALATDLSPFSLSG 431
Query: 2081 ASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASEST 2125
+TS TS E + S + S ES+
Sbjct: 432 KESKRSTSLKKLVPPKTSRSNDEGRS----SSVGSIKGSGLKESS 472
Score = 31.3 bits (71), Expect = 7.5
Identities = 36/202 (17%), Positives = 63/202 (31%), Gaps = 8/202 (3%)
Query: 1977 TSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTT 2036
++ + + + +S + S ++ SSP S + T
Sbjct: 116 VAADSLAFFSDAVIQVIKRKKASSAPRRGSWDSSSKSASIDSSPTVIGPRPRSFSELNLT 175
Query: 2037 TSSPA--SESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASES 2094
+PA S + S S T+ P+S SS S S AS + ++
Sbjct: 176 DRTPAKVRSSRSELGAPSPSGGTSCPSSSGGRRSSIGSRRLRGS--ASLRKKVAVLSAPR 233
Query: 2095 TTTSSPASESTTTSSPESESTTTSSPA-SESTTIEEQGVSPHSEKLSANEDPEEFPNEDV 2153
S S SSP + S++ SP S + +S S + S + + +E
Sbjct: 234 KPGSRS---SDCKSSPRARSSSAKSPFKSSIQRKATKALSKLSLRASPKDTSKSSKSEVA 290
Query: 2154 FEHTFAEIPNIDHSNQTDEAIP 2175
TD +
Sbjct: 291 PPKKSEAKVPSSSKKWTDGNVS 312
>gnl|CDD|218115 pfam04502, DUF572, Family of unknown function (DUF572). Family of
eukaryotic proteins with undetermined function.
Length = 321
Score = 31.3 bits (71), Expect = 7.3
Identities = 17/100 (17%), Positives = 35/100 (35%), Gaps = 14/100 (14%)
Query: 2011 VSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSS- 2069
++ T SP S S++ P S +++ SE+ P S N+ +
Sbjct: 220 EEDNDNTPSPKSGSSSPAKPTSILKKSAAKRSEA-----PSSSKAKKNSRGIPKPRDALS 274
Query: 2070 --------PASESTTTSSPASESTTTSSPASESTTTSSPA 2101
++ + S A ++ + A S+ +S
Sbjct: 275 SLVVRKKAAPESTSQSPSSAEPTSESPQTAGNSSLSSLGD 314
>gnl|CDD|205996 pfam13825, Paramyxo_PNT, Paramyxovirus structural protein V/P
N-terminus. This family consists of several
Paramyxoviridae structural protein P and V sequences.
From a structural point of view, P is the
best-characterized protein of the replicative complex. P
is organised into two moieties that are functionally and
structurally distinct: a C-terminal moiety (PCT) and an
N-terminal moiety (PNT). PCT is the most conserved in
sequence and contains all regions required for virus
transcription, whereas PNT, which is poorly conserved,
provides several additional functions required for
replication. P protein plays a crucial role in the enzyme
by positioning L onto the N/RNA template through an
interaction with the C-terminal domain of N. Without P, L
is not functional. The N, P, and L proteins of SeV and
measles and mumps viruses are functionally equivalent.
However, sequence identity between proteins from these
viruses is limited, and the viruses have been placed in
different genera (Respirovirus, Morbilivirus, and
Rubulavirus, respectively). SeV P protein (568 aa) is a
modular protein with distinct functional domains. The
N-terminal part of P (PNT) is a chaperone for N and
prevents it from binding to non-viral RNA in the infected
cell.
Length = 309
Score = 31.0 bits (70), Expect = 7.6
Identities = 24/91 (26%), Positives = 37/91 (40%), Gaps = 12/91 (13%)
Query: 2046 TTNNPKSESTTTNNPASESI--------TSSSPASESTTTSS---PASESTTTSS-PASE 2093
T P +P+ + I SS +ES +T A +ST SS P +
Sbjct: 206 TLQVPPIPDVKRGDPSCKPIKKGTEERSASSGTETESLSTGGATQSALKSTWGSSEPNAS 265
Query: 2094 STTTSSPASESTTTSSPESESTTTSSPASES 2124
+ AS + + ES TT+SP S++
Sbjct: 266 AGNVRQSASNAKMIQKCKQESGTTASPRSQN 296
Score = 31.0 bits (70), Expect = 8.3
Identities = 20/96 (20%), Positives = 32/96 (33%), Gaps = 11/96 (11%)
Query: 1939 SPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTS 1998
P+ + S + T S T T S + T S+ +S +S
Sbjct: 212 IPDVKRGDPSCKPIKKGTEERSASSGTETESLSTGGATQSALKSTW-----------GSS 260
Query: 1999 SPESESTTTISPVSESTTTSSPVSESTTTISPESES 2034
P + + S + ES TT SP S++
Sbjct: 261 EPNASAGNVRQSASNAKMIQKCKQESGTTASPRSQN 296
>gnl|CDD|219865 pfam08493, AflR, Aflatoxin regulatory protein. This domain is found
in the aflatoxin regulatory protein (AflR) which is
involved in the regulation of the biosynthesis of
aflatoxin in the fungal genus Aspergillus. It occurs
together with the fungal Zn(2)-Cys(6) binuclear cluster
domain (pfam00172).
Length = 275
Score = 31.1 bits (70), Expect = 7.7
Identities = 32/160 (20%), Positives = 53/160 (33%), Gaps = 8/160 (5%)
Query: 1961 VSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSP 2020
+ T SSP + TT+ TTSS + S P S +P + + T+S
Sbjct: 2 LETPNTASSPTIPANTTA------NTTSSSHPQPPVQSGPSSIQPPVATPHTPNGTSSPS 55
Query: 2021 VSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTT--TS 2078
S + E E + + S S + N + S SP+
Sbjct: 56 PKFSHQSPPAEPELWGSILSPNASNQDQGDLSSLLSVNTDFGQLFASLSPSPLFDGNDAD 115
Query: 2079 SPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTS 2118
A + S E ++ ++ S P S T+ +
Sbjct: 116 LHAEATGELSVADLEVSSPMQDLFLTSALSPPSSARTSHT 155
>gnl|CDD|132198 TIGR03154, sulfolob_CbsA, cytochrome b558/566, subunit A. Members of
this protein family are CbsA, one subunit of a highly
glycosylated, heterodimeric, mono-heme cytochrome
b558/566, found in Sulfolobus acidocaldarius and several
other members of the Sulfolobales, a branch of the
Crenarchaeota.
Length = 465
Score = 31.1 bits (70), Expect = 8.1
Identities = 16/47 (34%), Positives = 27/47 (57%), Gaps = 1/47 (2%)
Query: 1983 ESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTTIS 2029
+ + TSS ++ T+ P ++TT S S STTTSS + +T ++
Sbjct: 400 DKSITSSFLTLELVTTPPTPPTSTTTS-TSPSTTTSSAIPSTTLYVT 445
Score = 31.1 bits (70), Expect = 8.6
Identities = 16/40 (40%), Positives = 24/40 (60%)
Query: 2063 ESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPAS 2102
+SITSS E TT STTTS+ S +T+++ P++
Sbjct: 401 KSITSSFLTLELVTTPPTPPTSTTTSTSPSTTTSSAIPST 440
>gnl|CDD|221093 pfam11359, gpUL132, Glycoprotein UL132. Glycoprotein UL132 is a
low-abundance structural component of Human
cytomegalovirus (HCMV). The function of this protein is
not fully understood.
Length = 235
Score = 30.8 bits (69), Expect = 8.1
Identities = 15/67 (22%), Positives = 32/67 (47%), Gaps = 2/67 (2%)
Query: 1956 TTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSEST 2015
TSS + + TT++ T+++ + T++L ++TT+ P S T + +
Sbjct: 1 MTSSTTTPANTTATVTVTVATSNTTSVSTNVTTAL--TASTTAEPGSVLTELLGIIIYCV 58
Query: 2016 TTSSPVS 2022
+ S +S
Sbjct: 59 SGVSILS 65
>gnl|CDD|220102 pfam09073, BUD22, BUD22. BUD22 has been shown in yeast to be a
nuclear protein involved in bud-site selection. It plays
a role in positioning the proximal bud pole signal. More
recently it has been shown to be involved in ribosome
biogenesis.
Length = 424
Score = 31.0 bits (70), Expect = 8.3
Identities = 33/140 (23%), Positives = 52/140 (37%), Gaps = 11/140 (7%)
Query: 1893 ENTTTNSPE-SESTTTNNPESESTTTSSPESESTTTSS--------LVSESTTTSSPESE 1943
E++ + E SES + E + S E E + S LV S E+
Sbjct: 165 ESSDKDDEEESESEDESKSEESAEDDSDDEEEEDSDSEDYSQYDGMLVDSSDEEEGEEAP 224
Query: 1944 STT--TSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPE 2001
S + ESES + S +SES + S E S + P+ + T+++ L S S
Sbjct: 225 SINYNEDTSESESDESDSEISESRSVSDSEESSPPSKKPKEKKTSSTFLPSLMGGYFSGS 284
Query: 2002 SESTTTISPVSESTTTSSPV 2021
+ + PV
Sbjct: 285 EDEDDDDEDIDPDQVVKKPV 304
>gnl|CDD|234665 PRK00145, PRK00145, putative inner membrane protein translocase
component YidC; Provisional.
Length = 223
Score = 30.5 bits (69), Expect = 8.4
Identities = 12/30 (40%), Positives = 20/30 (66%), Gaps = 4/30 (13%)
Query: 779 LRKLKKKEK----KLQEEQMEVVEENPVDP 804
++KL+ K K KLQ+E M++ +E V+P
Sbjct: 67 IKKLQAKYKNDPQKLQQEMMKLYKEKGVNP 96
>gnl|CDD|216269 pfam01056, Myc_N, Myc amino-terminal region. The myc family belongs
to the basic helix-loop-helix leucine zipper class of
transcription factors, see pfam00010. Myc forms a
heterodimer with Max, and this complex regulates cell
growth through direct activation of genes involved in
cell replication. Mutations in the C-terminal 20 residues
of this domain cause unique changes in the induction of
apoptosis, transformation, and G2 arrest.
Length = 329
Score = 31.1 bits (70), Expect = 8.8
Identities = 23/84 (27%), Positives = 33/84 (39%), Gaps = 1/84 (1%)
Query: 1909 NPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVS-ESTTT 1967
N S+S+ +SP + S S++ S ESE E E +V+ E +
Sbjct: 195 NERSKSSKVASPTPRLGLRTPPNSSSSSGSDSESEEDEEEEEEEEEEEEIDVVTVEKRRS 254
Query: 1968 SSPESESTTTSSPESESTTTSSLV 1991
SS ST+ S S LV
Sbjct: 255 SSNRKASTSESITVPSRRHHSPLV 278
>gnl|CDD|215592 PLN03126, PLN03126, Elongation factor Tu; Provisional.
Length = 478
Score = 31.1 bits (70), Expect = 8.8
Identities = 20/70 (28%), Positives = 36/70 (51%), Gaps = 7/70 (10%)
Query: 1922 SESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPE 1981
S ++++SSL+ S+++SSP S + + S S + T +SS S +TT++
Sbjct: 6 SAASSSSSLLLPSSSSSSPSSSTFSFKST-------SGKLKSLTLSSSFLSPFSTTTTST 58
Query: 1982 SESTTTSSLV 1991
S+ S V
Sbjct: 59 SQRRRRSFTV 68
>gnl|CDD|219094 pfam06583, Neogenin_C, Neogenin C-terminus. This family represents
the C-terminus of eukaryotic neogenin precursor proteins,
which contains several potential phosphorylation sites.
Neogenin is a member of the N-CAM family of cell adhesion
molecules (and therefore contains multiple copies of
pfam00047 and pfam00041) and is closely related to the
DCC tumour suppressor gene product - these proteins may
play an integral role in regulating differentiation
programmes and/or cell migration events within many adult
and embryonic tissues.
Length = 295
Score = 30.7 bits (69), Expect = 9.0
Identities = 42/259 (16%), Positives = 76/259 (29%), Gaps = 23/259 (8%)
Query: 1899 SPESESTTTNNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTS 1958
SP + T+ P S + S + + + ESE + +S
Sbjct: 23 SPHPNPSGTDTPIRSSQDITPVSSSAQSEPQSGQRRNSYRGHESEDSMSSLAARRGMRPK 82
Query: 1959 SLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTS 2018
++ + P + + ++ L S + ++ P+ ST T
Sbjct: 83 MMIPMDSQPPQPVVSAHPIHTLDN-PQYPGILPSPRCGYLHHQF----SLRPMPFSTLTV 137
Query: 2019 SPVSESTTTISPESESTTT--SSPASESTTTNNPKSESTTTNNPASESITS----SSPAS 2072
+ +ES + +P +S + P+ T+ + P
Sbjct: 138 ----QRLYQHGDRAESVESVRQTPEPPYLPAAQSESSNAAEEAPSRSIPTAHVRPTHPLK 193
Query: 2073 ESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTT----SSPASESTTIE 2128
+ PAS ST P ST + + S ++ S T SP T
Sbjct: 194 SFAVPALPASMSTI--EPKLPSTPLLTQQGPTLPKHSVKTASVGTLGRARSPLLPVTVPS 251
Query: 2129 EQGVSPHSEKLSANEDPEE 2147
V ED +E
Sbjct: 252 APDVL--ETGGKMLEDTDE 268
>gnl|CDD|113413 pfam04642, DUF601, Protein of unknown function, DUF601. This family
represents a conserved region found in several
uncharacterized plant proteins.
Length = 311
Score = 30.8 bits (69), Expect = 9.4
Identities = 19/85 (22%), Positives = 33/85 (38%), Gaps = 8/85 (9%)
Query: 2069 SPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIE 2128
+P SES T S ++T++ P+ +SE S P S E
Sbjct: 31 PSTLAGKNPDAPTSESRTPS----KATSSKDPSKRYADKKRKQSEKDARSPPRSSRPRTE 86
Query: 2129 EQGVSPHSEKLSANEDPEEFPNEDV 2153
E+ P +K E ++ ++D+
Sbjct: 87 EKDAGPSQQK----EKGKKGDSQDL 107
>gnl|CDD|227911 COG5624, TAF61, Transcription initiation factor TFIID, subunit TAF12
(also component of histone acetyltransferase SAGA)
[Transcription].
Length = 505
Score = 30.8 bits (69), Expect = 9.4
Identities = 18/126 (14%), Positives = 28/126 (22%), Gaps = 13/126 (10%)
Query: 1993 ESTTTSSPESESTTTISPVSESTTTSSPVSESTTTISPESESTTTSSPASESTTTNNPKS 2052
E++ P + + V P S T N +S
Sbjct: 259 EASGMPPPAEWAGSNGLHVLPGRREEVPRGIFRCPSPESSRGEPTHLDYRNGMANNAQRS 318
Query: 2053 E-----STTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTT 2107
S NP ++ P T T A+ ++P
Sbjct: 319 RFPGTCSIYPENPGKRWCSTKYPQP----LVHKGDRDTETGGCAAPDGGLATPG----RD 370
Query: 2108 SSPESE 2113
P E
Sbjct: 371 KGPLYE 376
>gnl|CDD|227625 COG5309, COG5309, Exo-beta-1,3-glucanase [Carbohydrate transport and
metabolism].
Length = 305
Score = 30.6 bits (69), Expect = 9.6
Identities = 16/47 (34%), Positives = 25/47 (53%)
Query: 2072 SESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTS 2118
S S+ S S S ++ +S S+ SS ASE +++SS S S +
Sbjct: 1 STSSMQFSSTSSSAALATLSSSSSALSSSASEVSSSSSRASASGFLA 47
>gnl|CDD|173135 PRK14672, uvrC, excinuclease ABC subunit C; Provisional.
Length = 691
Score = 31.2 bits (70), Expect = 9.6
Identities = 26/114 (22%), Positives = 52/114 (45%), Gaps = 8/114 (7%)
Query: 2037 TSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTT 2096
+S+ +E ++ ++ T T P T +P+S + TT++P ++ S+ +S
Sbjct: 316 SSAGLAEHWLSHKAGTQCTVTLIPLHTFPTPQTPSS-TVTTNAPTLAASQNSNAVQDSGL 374
Query: 2097 TSSPASESTTTSSPESESTTTSSPASESTTIEEQGVSPHSE------KLSANED 2144
S + ST + ++ T+S + T E +PH +L+A+ED
Sbjct: 375 RSC-SETSTMHTLQKAHDACTASEGTRENTPHESAHTPHHRAILAMAQLNAHED 427
>gnl|CDD|215180 PLN02316, PLN02316, synthase/transferase.
Length = 1036
Score = 31.0 bits (70), Expect = 9.6
Identities = 18/107 (16%), Positives = 32/107 (29%), Gaps = 1/107 (0%)
Query: 2047 TNNPKSESTTTNNPASESITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTT 2106
+ K + + A + SS ST+TSS + + +S A E
Sbjct: 1 MSTSKPKGSAPRGFAPRTTVESSQKRIQQNNGDKEDSSTSTSSLSVSAVEKTSNAKEEIQ 60
Query: 2107 TSSPESESTTTSSPASESTTIEEQGVSPHSEKLSANEDPEEFPNEDV 2153
+ + +E IE + K S+ E +
Sbjct: 61 VDFQHNSESAVEEVEAE-DEIEVEQNQSDVLKSSSIVKEESISTDMD 106
>gnl|CDD|218549 pfam05308, Mito_fiss_reg, Mitochondrial fission regulator. In
eukaryotes, this family of proteins induces mitochondrial
fission.
Length = 248
Score = 30.5 bits (69), Expect = 9.8
Identities = 19/86 (22%), Positives = 31/86 (36%), Gaps = 4/86 (4%)
Query: 1968 SSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVSESTTTSSPVSESTTT 2027
+S S+ ++ S+TTS +S T E P + +ST+
Sbjct: 149 NSTTSDLLSSDESVPSSSTTSFPISPPT----EEPVLEVPPPPPPPPPPPPPSLQQSTSA 204
Query: 2028 ISPESESTTTSSPASESTTTNNPKSE 2053
I E S A ++ + PKS
Sbjct: 205 IDLIKERKGQRSAAGKTLVLSKPKSP 230
>gnl|CDD|234229 TIGR03490, Mycoplas_LppA, mycoides cluster lipoprotein, LppA/P72
family. Members of this protein family occur in
Mycoplasma mycoides, Mycoplasma hyopneumoniae, and
related Mycoplasmas in small paralogous families that may
also include truncated forms and/or pseudogenes. Members
are predicted lipoproteins with a conserved signal
peptidase II processing and lipid attachment site. Note
that the name for certain characterized members, p72,
reflects an anomalous apparent molecular weight, given a
theoretical MW of about 61 kDa.
Length = 541
Score = 31.0 bits (70), Expect = 9.9
Identities = 24/119 (20%), Positives = 35/119 (29%), Gaps = 11/119 (9%)
Query: 2064 SITSSSPASESTTTSSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASE 2123
SI+ S S STT+S+ S ++P + SE+ S
Sbjct: 15 SISFLSVVSCSTTSSN----SKQPEKKPEIKPNENTPKIPKKPDNKEPSENNNNKSNNEN 70
Query: 2124 STTIEEQGVSPHSEKLSANEDPEEFPNEDVFEHTFAEIPNIDHSNQTDEAIPETFDARE 2182
P S DP + N++ E E D Q D+ D
Sbjct: 71 KDEEN-----PSSTNPEKKPDPSK--NKEEIEKPKDEPKKPDKKPQADQPNNVHADQPN 122
>gnl|CDD|114648 pfam05937, EB1_binding, EB-1 Binding Domain. This region at the
C-terminus of the APC proteins binds the
microtubule-associating protein EB-1. At the C-terminus
of the alignment is also a pfam00595 binding domain. A
short motif in the middle of the region appears to be
found in the APC2 proteins.
Length = 174
Score = 30.2 bits (67), Expect = 10.0
Identities = 20/97 (20%), Positives = 43/97 (44%), Gaps = 4/97 (4%)
Query: 1908 NNPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTSSPESESTTTSSLVSESTTT 1967
NNP T + +E T SS S S+ SSP +P + + + +++++
Sbjct: 74 NNPVPVQETNENSIAERTAFSS--SSSSKHSSPSGTVAARVTPFNYNPSPRKSNADNSSA 131
Query: 1968 SSPESESTTTSSPESESTTTSSLVSESTTTSSPESES 2004
+ + ++ + + T S ++S+ + SP+ S
Sbjct: 132 RPSQIPTPVNNNTKKRDSKTDS--TDSSGSQSPKRHS 166
>gnl|CDD|177952 PLN02318, PLN02318, phosphoribulokinase/uridine kinase.
Length = 656
Score = 31.0 bits (70), Expect = 10.0
Identities = 35/172 (20%), Positives = 66/172 (38%), Gaps = 17/172 (9%)
Query: 1962 SESTTTSSPESESTTTSSPESESTTTSSLVSESTTTSSPESESTTTISPVS----ESTTT 2017
S S E+ + +S + + S +S S +T ++ S T V+ + +
Sbjct: 429 SLDDDLVSSPKEALSRASADRRNKNLKSGLSHSYSTQRDKNLSKLTGLAVTNRRFDERNS 488
Query: 2018 SSPVSESTTTISPESESTTTSSPASESTTTNNPKSESTTTNNPASESITSSSPASESTTT 2077
SP + + I+ SE ++ + + T+ + S + S S + + +E+
Sbjct: 489 ESPAALNQGAITQLSEQISSLNERMDEFTSRIEELNSKLSIKKNSPSQQNLALQAEACNG 548
Query: 2078 SSPASESTTTSSPASESTTTSSPASESTTTSSPESESTTTSSPASESTTIEE 2129
S+P S + S T + P S S+ S A ES +EE
Sbjct: 549 SAPTSYFVSGLGNGS-----------LTGSILPLSSSS--SQLAKESPLMEE 587
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.315 0.129 0.379
Gapped
Lambda K H
0.267 0.0789 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 117,439,458
Number of extensions: 11348795
Number of successful extensions: 22542
Number of sequences better than 10.0: 1
Number of HSP's gapped: 16441
Number of HSP's successfully gapped: 1401
Length of query: 2435
Length of database: 10,937,602
Length adjustment: 113
Effective length of query: 2322
Effective length of database: 5,925,600
Effective search space: 13759243200
Effective search space used: 13759243200
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 67 (29.5 bits)