RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy5490
(352 letters)
>gnl|CDD|238125 cd00204, ANK, ankyrin repeats; ankyrin repeats mediate
protein-protein interactions in very diverse families of
proteins. The number of ANK repeats in a protein can
range from 2 to over 20 (ankyrins, for example). ANK
repeats may occur in combinations with other types of
domains. The structural repeat unit contains two
antiparallel helices and a beta-hairpin, repeats are
stacked in a superhelical arrangement; this alignment
contains 4 consecutive repeats.
Length = 126
Score = 124 bits (313), Expect = 4e-35
Identities = 54/120 (45%), Positives = 77/120 (64%), Gaps = 4/120 (3%)
Query: 63 ESPLTLAACGGHVDLALLLIERGANIEEVNDEGYTPLMEAAREGHEEMVCVLLQKGAKIN 122
+PL LAA GH+++ LL+E GA++ +++G TPL AA+ GH E+V +LL+KGA +N
Sbjct: 8 RTPLHLAASNGHLEVVKLLLENGADVNAKDNDGRTPLHLAAKNGHLEIVKLLLEKGADVN 67
Query: 123 AHTEETQETALTLACCGGFLDVADFLLKNGANIELGAS---TPLMEAAQEGHLELVRYLL 179
A ++ T L LA G LDV LLK+GA++ TPL AA+ GHLE+V+ LL
Sbjct: 68 A-RDKDGNTPLHLAARNGNLDVVKLLLKHGADVNARDKDGRTPLHLAAKNGHLEVVKLLL 126
Score = 117 bits (296), Expect = 1e-32
Identities = 49/123 (39%), Positives = 76/123 (61%), Gaps = 4/123 (3%)
Query: 27 ESPLTLAACGGHVDLALLLIERGANIEEVNMPTDSFESPLTLAACGGHVDLALLLIERGA 86
+PL LAA GH+++ LL+E GA++ N + +PL LAA GH+++ LL+E+GA
Sbjct: 8 RTPLHLAASNGHLEVVKLLLENGADV---NAKDNDGRTPLHLAAKNGHLEIVKLLLEKGA 64
Query: 87 NIEEVNDEGYTPLMEAAREGHEEMVCVLLQKGAKINAHTEETQETALTLACCGGFLDVAD 146
++ + +G TPL AAR G+ ++V +LL+ GA +NA ++ T L LA G L+V
Sbjct: 65 DVNARDKDGNTPLHLAARNGNLDVVKLLLKHGADVNARDKD-GRTPLHLAAKNGHLEVVK 123
Query: 147 FLL 149
LL
Sbjct: 124 LLL 126
Score = 100 bits (250), Expect = 7e-26
Identities = 38/97 (39%), Positives = 56/97 (57%), Gaps = 4/97 (4%)
Query: 90 EVNDEGYTPLMEAAREGHEEMVCVLLQKGAKINAHTEETQETALTLACCGGFLDVADFLL 149
+++G TPL AA GH E+V +LL+ GA +NA + T L LA G L++ LL
Sbjct: 2 ARDEDGRTPLHLAASNGHLEVVKLLLENGADVNAKDND-GRTPLHLAAKNGHLEIVKLLL 60
Query: 150 KNGANIELGAS---TPLMEAAQEGHLELVRYLLDSGA 183
+ GA++ TPL AA+ G+L++V+ LL GA
Sbjct: 61 EKGADVNARDKDGNTPLHLAARNGNLDVVKLLLKHGA 97
Score = 96.7 bits (241), Expect = 1e-24
Identities = 44/118 (37%), Positives = 65/118 (55%), Gaps = 16/118 (13%)
Query: 7 ESPLTLAACGGH-------------VNMPTDSFESPLTLAACGGHVDLALLLIERGANIE 53
+PL LAA GH VN + +PL LAA GH+++ LL+E+GA+
Sbjct: 8 RTPLHLAASNGHLEVVKLLLENGADVNAKDNDGRTPLHLAAKNGHLEIVKLLLEKGAD-- 65
Query: 54 EVNMPTDSFESPLTLAACGGHVDLALLLIERGANIEEVNDEGYTPLMEAAREGHEEMV 111
VN +PL LAA G++D+ LL++ GA++ + +G TPL AA+ GH E+V
Sbjct: 66 -VNARDKDGNTPLHLAARNGNLDVVKLLLKHGADVNARDKDGRTPLHLAAKNGHLEVV 122
Score = 58.5 bits (142), Expect = 7e-11
Identities = 30/88 (34%), Positives = 44/88 (50%), Gaps = 16/88 (18%)
Query: 8 SPLTLAACGGH-------------VNMPTDSFESPLTLAACGGHVDLALLLIERGANIEE 54
+PL LAA GH VN +PL LAA G++D+ LL++ GA++
Sbjct: 42 TPLHLAAKNGHLEIVKLLLEKGADVNARDKDGNTPLHLAARNGNLDVVKLLLKHGADV-- 99
Query: 55 VNMPTDSFESPLTLAACGGHVDLALLLI 82
N +PL LAA GH+++ LL+
Sbjct: 100 -NARDKDGRTPLHLAAKNGHLEVVKLLL 126
>gnl|CDD|205076 pfam12796, Ank_2, Ankyrin repeats (3 copies).
Length = 91
Score = 93.1 bits (232), Expect = 1e-23
Identities = 37/94 (39%), Positives = 58/94 (61%), Gaps = 5/94 (5%)
Query: 30 LTLAACGGHVDLALLLIERGANIEEVNMPTDSFESPLTLAACGGHVDLALLLIERGANIE 89
L LAA G+++L LL+E+GA++ + T L LAA G++++ LL+E GA++
Sbjct: 1 LHLAAKNGNLELVKLLLEKGADVNLGDTDT-----ALHLAARNGNLEIVKLLLEHGADVN 55
Query: 90 EVNDEGYTPLMEAAREGHEEMVCVLLQKGAKINA 123
+ +G T L AAR G+ E+V +LL+ GA IN
Sbjct: 56 AKDKDGNTALHLAARNGNLEIVKLLLEHGADINL 89
Score = 90.8 bits (226), Expect = 7e-23
Identities = 35/88 (39%), Positives = 53/88 (60%), Gaps = 6/88 (6%)
Query: 99 LMEAAREGHEEMVCVLLQKGAKINAHTEETQETALTLACCGGFLDVADFLLKNGANIELG 158
L AA+ G+ E+V +LL+KGA +N +T AL LA G L++ LL++GA++
Sbjct: 1 LHLAAKNGNLELVKLLLEKGADVNLGDTDT---ALHLAARNGNLEIVKLLLEHGADVNAK 57
Query: 159 AS---TPLMEAAQEGHLELVRYLLDSGA 183
T L AA+ G+LE+V+ LL+ GA
Sbjct: 58 DKDGNTALHLAARNGNLEIVKLLLEHGA 85
Score = 85.4 bits (212), Expect = 6e-21
Identities = 38/92 (41%), Positives = 58/92 (63%), Gaps = 3/92 (3%)
Query: 66 LTLAACGGHVDLALLLIERGANIEEVNDEGYTPLMEAAREGHEEMVCVLLQKGAKINAHT 125
L LAA G+++L LL+E+GA++ + T L AAR G+ E+V +LL+ GA +NA
Sbjct: 1 LHLAAKNGNLELVKLLLEKGADVNLGDT--DTALHLAARNGNLEIVKLLLEHGADVNA-K 57
Query: 126 EETQETALTLACCGGFLDVADFLLKNGANIEL 157
++ TAL LA G L++ LL++GA+I L
Sbjct: 58 DKDGNTALHLAARNGNLEIVKLLLEHGADINL 89
Score = 60.3 bits (147), Expect = 8e-12
Identities = 22/65 (33%), Positives = 38/65 (58%), Gaps = 3/65 (4%)
Query: 28 SPLTLAACGGHVDLALLLIERGANIEEVNMPTDSFESPLTLAACGGHVDLALLLIERGAN 87
+ L LAA G++++ LL+E GA++ N + L LAA G++++ LL+E GA+
Sbjct: 30 TALHLAARNGNLEIVKLLLEHGADV---NAKDKDGNTALHLAARNGNLEIVKLLLEHGAD 86
Query: 88 IEEVN 92
I +
Sbjct: 87 INLKD 91
Score = 43.8 bits (104), Expect = 5e-06
Identities = 18/62 (29%), Positives = 28/62 (45%), Gaps = 13/62 (20%)
Query: 8 SPLTLAACGGH-------------VNMPTDSFESPLTLAACGGHVDLALLLIERGANIEE 54
+ L LAA G+ VN + L LAA G++++ LL+E GA+I
Sbjct: 30 TALHLAARNGNLEIVKLLLEHGADVNAKDKDGNTALHLAARNGNLEIVKLLLEHGADINL 89
Query: 55 VN 56
+
Sbjct: 90 KD 91
>gnl|CDD|222984 PHA03100, PHA03100, ankyrin repeat protein; Provisional.
Length = 422
Score = 84.7 bits (210), Expect = 3e-18
Identities = 45/168 (26%), Positives = 68/168 (40%), Gaps = 32/168 (19%)
Query: 44 LLIERGANIEEVNMPTDSFESPLTLAACGGHV-----DLALLLIERGANIEEVNDEGYTP 98
+L++ GA+I N T + +PL + + ++ LL+E GAN+ ++ G TP
Sbjct: 53 ILLDNGADI---NSSTKNNSTPLHYLSNIKYNLTDVKEIVKLLLEYGANVNAPDNNGITP 109
Query: 99 LMEAARE--GHEEMVCVLLQKGAKINAHTEETQETALTLA--CCGGFLDVADFLLKNGAN 154
L+ A + +V LL GA +N E L L L + L+ G +
Sbjct: 110 LLYAISKKSNSYSIVEYLLDNGANVNIKN-SDGENLLHLYLESNKIDLKILKLLIDKGVD 168
Query: 155 I----------ELGA---------STPLMEAAQEGHLELVRYLLDSGA 183
I G TPL A + E V+YLLD GA
Sbjct: 169 INAKNRVNYLLSYGVPINIKDVYGFTPLHYAVYNNNPEFVKYLLDLGA 216
Score = 75.1 bits (185), Expect = 5e-15
Identities = 42/161 (26%), Positives = 67/161 (41%), Gaps = 24/161 (14%)
Query: 16 GGHVNMPTDSFESPLTLAACG--GHVDLALLLIERGANIEEVNMPTDSFESPLTLAACGG 73
G +VN P ++ +PL A + L++ GAN+ N+ E+ L L
Sbjct: 96 GANVNAPDNNGITPLLYAISKKSNSYSIVEYLLDNGANV---NIKNSDGENLLHLYLESN 152
Query: 74 HVDLAL--LLIERGAN----------------IEEVNDEGYTPLMEAAREGHEEMVCVLL 115
+DL + LLI++G + I + G+TPL A + E V LL
Sbjct: 153 KIDLKILKLLIDKGVDINAKNRVNYLLSYGVPINIKDVYGFTPLHYAVYNNNPEFVKYLL 212
Query: 116 QKGAKINAHTEETQETALTLACCGGFLDVADFLLKNGANIE 156
GA N + +T L +A ++ LL NG +I+
Sbjct: 213 DLGANPNL-VNKYGDTPLHIAILNNNKEIFKLLLNNGPSIK 252
Score = 59.3 bits (144), Expect = 8e-10
Identities = 35/123 (28%), Positives = 53/123 (43%), Gaps = 15/123 (12%)
Query: 16 GGHVNMPTDSFESPLTLAACGGHVDLAL--LLIERGANIE---EVNM------PTDS--- 61
G +VN+ E+ L L +DL + LLI++G +I VN P +
Sbjct: 131 GANVNIKNSDGENLLHLYLESNKIDLKILKLLIDKGVDINAKNRVNYLLSYGVPINIKDV 190
Query: 62 -FESPLTLAACGGHVDLALLLIERGANIEEVNDEGYTPLMEAAREGHEEMVCVLLQKGAK 120
+PL A + + L++ GAN VN G TPL A ++E+ +LL G
Sbjct: 191 YGFTPLHYAVYNNNPEFVKYLLDLGANPNLVNKYGDTPLHIAILNNNKEIFKLLLNNGPS 250
Query: 121 INA 123
I
Sbjct: 251 IKT 253
Score = 39.6 bits (93), Expect = 0.002
Identities = 21/110 (19%), Positives = 42/110 (38%), Gaps = 21/110 (19%)
Query: 16 GGHVNMPTDSFESPLTLAACGGHVDLALLLIERGANIEEVNMPTDSFESPLTLAACGGHV 75
G +N+ +PL A + + L++ GAN N+ ++PL +A +
Sbjct: 182 GVPINIKDVYGFTPLHYAVYNNNPEFVKYLLDLGANP---NLVNKYGDTPLHIAILNNNK 238
Query: 76 DLALLLIERGANIEEVNDEGYTPLMEAAREGHEEMVCVLLQKGAKINAHT 125
++ LL+ G +I+ + + +L K +N T
Sbjct: 239 EIFKLLLNNGPSIKTIIE------------------TLLYFKDKDLNTIT 270
>gnl|CDD|223738 COG0666, Arp, FOG: Ankyrin repeat [General function prediction
only].
Length = 235
Score = 70.6 bits (172), Expect = 4e-14
Identities = 46/138 (33%), Positives = 65/138 (47%), Gaps = 10/138 (7%)
Query: 64 SPLTLAACGGHVDLALLLIERGANIEEVNDEGYTPLMEAAREGHE-----EMVCVLLQKG 118
PL AA G + LL+ GA++ + +G TPL AA G+ E+ +LL+ G
Sbjct: 75 LPLHSAASKGDDKIVKLLLASGADVNAKDADGDTPLHLAALNGNPPEGNIEVAKLLLEAG 134
Query: 119 A--KINAHTEETQETALTLACCGGFLDVADFLLKNGANIELGAS---TPLMEAAQEGHLE 173
A +N +E T L A G D+ + LL+ GA+ S T L AA+ G +E
Sbjct: 135 ADLDVNNLRDEDGNTPLHWAALNGDADIVELLLEAGADPNSRNSYGVTALDPAAKNGRIE 194
Query: 174 LVRYLLDSGAQEMQTKHI 191
LV+ LLD G K
Sbjct: 195 LVKLLLDKGLHLSLLKFN 212
Score = 62.5 bits (151), Expect = 2e-11
Identities = 42/171 (24%), Positives = 70/171 (40%), Gaps = 12/171 (7%)
Query: 24 DSFESPLTLAACGGHVDLALLLIERGANIEEVNMPTDSFESPLTLAACGGHVDLALLLIE 83
S L L L + L+ + + LL +
Sbjct: 2 KPSLSALLLINKCFLDLLLVALLLLLSLDLSNPSDKKLNLYLELALLPAASLSELLLKLI 61
Query: 84 RGANIEEVNDEGYTPLMEAAREGHEEMVCVLLQKGAKINAHTEETQETALTLAC-----C 138
++ + +G PL AA +G +++V +LL GA +NA + +T L LA
Sbjct: 62 VDRHLAARDLDGRLPLHSAASKGDDKIVKLLLASGADVNA-KDADGDTPLHLAALNGNPP 120
Query: 139 GGFLDVADFLLKNGANIELGAS------TPLMEAAQEGHLELVRYLLDSGA 183
G ++VA LL+ GA++++ TPL AA G ++V LL++GA
Sbjct: 121 EGNIEVAKLLLEAGADLDVNNLRDEDGNTPLHWAALNGDADIVELLLEAGA 171
Score = 60.6 bits (146), Expect = 1e-10
Identities = 42/112 (37%), Positives = 61/112 (54%), Gaps = 5/112 (4%)
Query: 12 LAACGGHVNMPTDSFESPLTLAA-----CGGHVDLALLLIERGANIEEVNMPTDSFESPL 66
L A G VN ++PL LAA G++++A LL+E GA+++ N+ + +PL
Sbjct: 92 LLASGADVNAKDADGDTPLHLAALNGNPPEGNIEVAKLLLEAGADLDVNNLRDEDGNTPL 151
Query: 67 TLAACGGHVDLALLLIERGANIEEVNDEGYTPLMEAAREGHEEMVCVLLQKG 118
AA G D+ LL+E GA+ N G T L AA+ G E+V +LL KG
Sbjct: 152 HWAALNGDADIVELLLEAGADPNSRNSYGVTALDPAAKNGRIELVKLLLDKG 203
Score = 59.1 bits (142), Expect = 3e-10
Identities = 45/158 (28%), Positives = 70/158 (44%), Gaps = 12/158 (7%)
Query: 1 MPTDSFESPLTLAACGGHVNMPTDSFESPLTLAACGGHVDLALLLIERGANIEEVNMPTD 60
+P S L H+ PL AA G + LL+ GA++ +
Sbjct: 48 LPAASLSELLLKLIVDRHLAARDLDGRLPLHSAASKGDDKIVKLLLASGADVNAKDAD-- 105
Query: 61 SFESPLTLAA-----CGGHVDLALLLIERGA---NIEEVNDEGYTPLMEAAREGHEEMVC 112
++PL LAA G++++A LL+E GA +++G TPL AA G ++V
Sbjct: 106 -GDTPLHLAALNGNPPEGNIEVAKLLLEAGADLDVNNLRDEDGNTPLHWAALNGDADIVE 164
Query: 113 VLLQKGAKINAHTEETQETALTLACCGGFLDVADFLLK 150
+LL+ GA N+ TAL A G +++ LL
Sbjct: 165 LLLEAGADPNSRNSYGV-TALDPAAKNGRIELVKLLLD 201
>gnl|CDD|165206 PHA02875, PHA02875, ankyrin repeat protein; Provisional.
Length = 413
Score = 59.2 bits (143), Expect = 8e-10
Identities = 43/127 (33%), Positives = 63/127 (49%), Gaps = 3/127 (2%)
Query: 28 SPLTLAACGGHVDLALLLIERGANIEEVNMPTDSFESPLTLAACGGHVDLALLLIERGAN 87
+PL LA +D+ LLI RGA+ + N TD F SPL LA G + LLI+ A
Sbjct: 104 TPLHLATILKKLDIMKLLIARGADPDIPN--TDKF-SPLHLAVMMGDIKGIELLIDHKAC 160
Query: 88 IEEVNDEGYTPLMEAAREGHEEMVCVLLQKGAKINAHTEETQETALTLACCGGFLDVADF 147
++ + G TPL+ A +G + +LL GA I+ + AL A +D+
Sbjct: 161 LDIEDCCGCTPLIIAMAKGDIAICKMLLDSGANIDYFGKNGCVAALCYAIENNKIDIVRL 220
Query: 148 LLKNGAN 154
+K GA+
Sbjct: 221 FIKRGAD 227
Score = 57.7 bits (139), Expect = 3e-09
Identities = 48/161 (29%), Positives = 79/161 (49%), Gaps = 10/161 (6%)
Query: 28 SPLTLAACGGHVDLALLLIERGANIEEVNMPTDSFESPLTLAACGGHVDLALLLIERGAN 87
SP+ LA + LL++ GA I +V P ES L A G V L++ G
Sbjct: 37 SPIKLAMKFRDSEAIKLLMKHGA-IPDVKYP--DIESELHDAVEEGDVKAVEELLDLGKF 93
Query: 88 IEEV-NDEGYTPLMEAAREGHEEMVCVLLQKGAKIN-AHTEETQETALTLACCGGFLDVA 145
++V +G TPL A +++ +L+ +GA + +T++ + L LA G +
Sbjct: 94 ADDVFYKDGMTPLHLATILKKLDIMKLLIARGADPDIPNTDKF--SPLHLAVMMGDIKGI 151
Query: 146 DFLLKNGA--NIELG-ASTPLMEAAQEGHLELVRYLLDSGA 183
+ L+ + A +IE TPL+ A +G + + + LLDSGA
Sbjct: 152 ELLIDHKACLDIEDCCGCTPLIIAMAKGDIAICKMLLDSGA 192
Score = 42.7 bits (100), Expect = 2e-04
Identities = 35/128 (27%), Positives = 57/128 (44%), Gaps = 8/128 (6%)
Query: 11 TLAACGGHVNMPTDSFESPLTLAACGGHVDLALLLIERGA--NIEEVNMPTDSFESPLTL 68
L A G ++P SPL LA G + LLI+ A +IE+ T PL +
Sbjct: 120 LLIARGADPDIPNTDKFSPLHLAVMMGDIKGIELLIDHKACLDIEDCCGCT-----PLII 174
Query: 69 AACGGHVDLALLLIERGANIEEVNDEGYTPLMEAAREGHE-EMVCVLLQKGAKINAHTEE 127
A G + + +L++ GANI+ G + A E ++ ++V + +++GA N
Sbjct: 175 AMAKGDIAICKMLLDSGANIDYFGKNGCVAALCYAIENNKIDIVRLFIKRGADCNIMFMI 234
Query: 128 TQETALTL 135
E L
Sbjct: 235 EGEECTIL 242
Score = 29.2 bits (65), Expect = 4.0
Identities = 28/115 (24%), Positives = 51/115 (44%), Gaps = 5/115 (4%)
Query: 73 GHVDLALLLIERGANIEEVNDEGYTPLMEAAREGHEEMVCVLLQKGAKINAHTEETQETA 132
G +D+A L++ G N +G +P+ A + E + +L++ GA + + E+
Sbjct: 13 GELDIARRLLDIGINPNFEIYDGISPIKLAMKFRDSEAIKLLMKHGAIPDVKYPDI-ESE 71
Query: 133 LTLACCGGFLDVADFLLKNGANIE----LGASTPLMEAAQEGHLELVRYLLDSGA 183
L A G + + LL G + TPL A L++++ L+ GA
Sbjct: 72 LHDAVEEGDVKAVEELLDLGKFADDVFYKDGMTPLHLATILKKLDIMKLLIARGA 126
>gnl|CDD|222277 pfam13637, Ank_4, Ankyrin repeats (many copies).
Length = 54
Score = 53.0 bits (128), Expect = 1e-09
Identities = 18/52 (34%), Positives = 31/52 (59%)
Query: 64 SPLTLAACGGHVDLALLLIERGANIEEVNDEGYTPLMEAAREGHEEMVCVLL 115
+ L AA G ++L L+E+G +I +++G T L AA G+ E++ +LL
Sbjct: 3 TALHKAAISGRLELVKYLLEKGVDINRTDEDGNTALHIAAENGNLEVLKLLL 54
Score = 53.0 bits (128), Expect = 1e-09
Identities = 24/55 (43%), Positives = 30/55 (54%), Gaps = 1/55 (1%)
Query: 95 GYTPLMEAAREGHEEMVCVLLQKGAKINAHTEETQETALTLACCGGFLDVADFLL 149
G T L +AA G E+V LL+KG IN E+ TAL +A G L+V LL
Sbjct: 1 GRTALHKAAISGRLELVKYLLEKGVDINRTDED-GNTALHIAAENGNLEVLKLLL 54
Score = 43.0 bits (102), Expect = 4e-06
Identities = 16/55 (29%), Positives = 30/55 (54%), Gaps = 3/55 (5%)
Query: 28 SPLTLAACGGHVDLALLLIERGANIEEVNMPTDSFESPLTLAACGGHVDLALLLI 82
+ L AA G ++L L+E+G +I N + + L +AA G++++ LL+
Sbjct: 3 TALHKAAISGRLELVKYLLEKGVDI---NRTDEDGNTALHIAAENGNLEVLKLLL 54
Score = 35.3 bits (82), Expect = 0.003
Identities = 13/24 (54%), Positives = 16/24 (66%)
Query: 161 TPLMEAAQEGHLELVRYLLDSGAQ 184
T L +AA G LELV+YLL+ G
Sbjct: 3 TALHKAAISGRLELVKYLLEKGVD 26
>gnl|CDD|222939 PHA02878, PHA02878, ankyrin repeat protein; Provisional.
Length = 477
Score = 54.5 bits (131), Expect = 4e-08
Identities = 43/156 (27%), Positives = 69/156 (44%), Gaps = 9/156 (5%)
Query: 4 DSFESPLT--LAACGGHVNMPT-DSFESPLTLAACGGHVDLALLLIERGANIEEVNMPTD 60
D E+ +T L + G +NM + L A L LL+ GAN VN+P
Sbjct: 143 DIIEAEITKLLLSYGADINMKDRHKGNTALHYATENKDQRLTELLLSYGAN---VNIPDK 199
Query: 61 SFESPLTLAACGGHVDLALLLIERGANIEEVNDEGYTPL-MEAAREGHEEMVCVLLQKGA 119
+ SPL A + + +L+E GA+ + + G TPL + +++ +LL+ G
Sbjct: 200 TNNSPLHHAVKHYNKPIVHILLENGASTDARDKCGNTPLHISVGYCKDYDILKLLLEHGV 259
Query: 120 KINAHTEETQETALTLACCGGFLDVADFLLKNGANI 155
+NA + TAL + LL+ GA+I
Sbjct: 260 DVNAKSYILGLTALHSSIKSE--RKLKLLLEYGADI 293
Score = 48.3 bits (115), Expect = 3e-06
Identities = 40/163 (24%), Positives = 72/163 (44%), Gaps = 22/163 (13%)
Query: 44 LLIERGANIEEVNMPTDSFESPLTLAACGGHVDLALLLIERGANIEEVNDEGYTPLMEAA 103
LL+ GA+I + + + L A L LL+ GAN+ + +PL A
Sbjct: 152 LLLSYGADINMKDRHKGN--TALHYATENKDQRLTELLLSYGANVNIPDKTNNSPLHHAV 209
Query: 104 REGHEEMVCVLLQKGAKINAHTEETQETALTLACCGGFL---DVADFLLKNGANIELGAS 160
+ ++ +V +LL+ GA + ++ T L ++ G+ D+ LL++G ++ +
Sbjct: 210 KHYNKPIVHILLENGASTD-ARDKCGNTPLHISV--GYCKDYDILKLLLEHGVDVNAKSY 266
Query: 161 ----TPLMEAAQ-EGHLELVRYLLDSGAQEMQTKHIDILEFKK 198
T L + + E L+L LL+ GA I+ L K
Sbjct: 267 ILGLTALHSSIKSERKLKL---LLEYGAD------INSLNSYK 300
Score = 41.4 bits (97), Expect = 5e-04
Identities = 26/104 (25%), Positives = 47/104 (45%), Gaps = 9/104 (8%)
Query: 83 ERGANIEEVNDEGYTPLMEAAREGHEEMVCVLLQKGAKINAHTEETQETALTLACCGGFL 142
+ ++ ++ + ++EA E+ +LL GA IN TAL A
Sbjct: 128 IQTIDLVYIDKKSKDDIIEA------EITKLLLSYGADINMKDRHKGNTALHYATENKDQ 181
Query: 143 DVADFLLKNGANI---ELGASTPLMEAAQEGHLELVRYLLDSGA 183
+ + LL GAN+ + ++PL A + + +V LL++GA
Sbjct: 182 RLTELLLSYGANVNIPDKTNNSPLHHAVKHYNKPIVHILLENGA 225
Score = 38.3 bits (89), Expect = 0.004
Identities = 35/128 (27%), Positives = 62/128 (48%), Gaps = 18/128 (14%)
Query: 16 GGHVNMPTDSFESPLTLAACGGHVDLALLLIERGANIEEVNMPTDSFESPLTLAACGGHV 75
G +VN+P + SPL A + + +L+E GA+ + + + +PL ++ G+
Sbjct: 191 GANVNIPDKTNNSPLHHAVKHYNKPIVHILLENGASTDARDKCGN---TPLHISV--GYC 245
Query: 76 ---DLALLLIERGANIEEVNDE----GYTPLMEAAREGHEEMVCVLLQKGAKINAHTEET 128
D+ LL+E G + VN + G T L + + E + +LL+ GA IN+
Sbjct: 246 KDYDILKLLLEHGVD---VNAKSYILGLTALHSSIKS--ERKLKLLLEYGADINSLNSY- 299
Query: 129 QETALTLA 136
+ T L+ A
Sbjct: 300 KLTPLSSA 307
>gnl|CDD|165207 PHA02876, PHA02876, ankyrin repeat protein; Provisional.
Length = 682
Score = 53.9 bits (129), Expect = 7e-08
Identities = 56/176 (31%), Positives = 86/176 (48%), Gaps = 13/176 (7%)
Query: 16 GGHVNMPTDSFESPLTLAACGGH-VDLALLLIERGANIEEVNMPTDSFESPLTLAAC-GG 73
G VN E+PL L A G+ + LI GA +VN + +PL A+
Sbjct: 297 GADVNAKNIKGETPLYLMAKNGYDTENIRTLIMLGA---DVNAADRLYITPLHQASTLDR 353
Query: 74 HVDLALLLIERGANIEEVNDEGYTPLMEAAREGHEEMVCVLLQKGAKINAHTEETQETAL 133
+ D+ + L+E GAN+ + TP+ AA + ++ LL GA I A +++ TAL
Sbjct: 354 NKDIVITLLELGANVNARDYCDKTPIHYAAVRNNVVIINTLLDYGADIEALSQKIG-TAL 412
Query: 134 TLACCGG--FLDVADFLLKNGANIELG---ASTPLMEAAQEG-HLELVRYLLDSGA 183
A CG ++ V L+ GAN+ STPL A ++ L+++ LLD+GA
Sbjct: 413 HFALCGTNPYMSVKT-LIDRGANVNSKNKDLSTPLHYACKKNCKLDVIEMLLDNGA 467
Score = 47.4 bits (112), Expect = 9e-06
Identities = 45/152 (29%), Positives = 76/152 (50%), Gaps = 10/152 (6%)
Query: 39 VDLALLLIERGANIEEVNMPTDSFESPLTLAACGGHVD-LALLLIERGANIEEVNDEGYT 97
++ +LLL + G ++ ++ D +PL A+ + L L+ERGA++ N +G T
Sbjct: 253 LETSLLLYDAGFSVNSID---DCKNTPLHHASQAPSLSRLVPKLLERGADVNAKNIKGET 309
Query: 98 PLMEAAREGHE-EMVCVLLQKGAKINAHTEETQETALTLACC-GGFLDVADFLLKNGANI 155
PL A+ G++ E + L+ GA +NA + T L A D+ LL+ GAN+
Sbjct: 310 PLYLMAKNGYDTENIRTLIMLGADVNA-ADRLYITPLHQASTLDRNKDIVITLLELGANV 368
Query: 156 ---ELGASTPLMEAAQEGHLELVRYLLDSGAQ 184
+ TP+ AA ++ ++ LLD GA
Sbjct: 369 NARDYCDKTPIHYAAVRNNVVIINTLLDYGAD 400
Score = 46.6 bits (110), Expect = 2e-05
Identities = 41/148 (27%), Positives = 74/148 (50%), Gaps = 8/148 (5%)
Query: 10 LTLAACGGHVNMPTDSFESPLTLAACGGHVDLALLLIERGANIEEVNMPTDSFESPLTLA 69
+TL G +VN ++P+ AA +V + L++ GA+IE + + + L A
Sbjct: 359 ITLLELGANVNARDYCDKTPIHYAAVRNNVVIINTLLDYGADIEAL---SQKIGTALHFA 415
Query: 70 ACGGHVDLAL-LLIERGANIEEVNDEGYTPLMEAAREGHE-EMVCVLLQKGAKINAHTEE 127
CG + +++ LI+RGAN+ N + TPL A ++ + +++ +LL GA +NA +
Sbjct: 416 LCGTNPYMSVKTLIDRGANVNSKNKDLSTPLHYACKKNCKLDVIEMLLDNGADVNAINIQ 475
Query: 128 TQETALTLACCGGFLDVADFLLKNGANI 155
Q L + + + LL GA +
Sbjct: 476 NQYPLLIAL---EYHGIVNILLHYGAEL 500
Score = 38.5 bits (89), Expect = 0.005
Identities = 30/106 (28%), Positives = 51/106 (48%), Gaps = 2/106 (1%)
Query: 77 LALLLIERGANIEEVNDEGYTPLMEAAREGHEEMVCVLLQKGAKINAHTEETQETALTLA 136
+A +L+E GA++ + TP+ AA G+ +MV +LL GA +N + + L A
Sbjct: 160 IAEMLLEGGADVNAKDIYCITPIHYAAERGNAKMVNLLLSYGADVNIIALDDL-SVLECA 218
Query: 137 CCGGFLDVADFLLKNGANIELGASTPLMEAAQEGHLELVRYLLDSG 182
+D ++ N +NI L++A + LE L D+G
Sbjct: 219 VDSKNIDTIKAIIDNRSNINKN-DLSLLKAIRNEDLETSLLLYDAG 263
Score = 30.8 bits (69), Expect = 1.4
Identities = 41/160 (25%), Positives = 72/160 (45%), Gaps = 23/160 (14%)
Query: 41 LALLLIERGANIEEVNMPTDSFESPLTLAACGGHVDLALLLIERGANIEEVNDEGYTPLM 100
+A +L+E GA +VN +P+ AA G+ + LL+ GA++ + + + L
Sbjct: 160 IAEMLLEGGA---DVNAKDIYCITPIHYAAERGNAKMVNLLLSYGADVNIIALDDLSVLE 216
Query: 101 EAAREGHEEMVCVLLQKGAKIN--------AHTEETQETALTLACCGGFLDVADFLLKNG 152
A + + + ++ + IN A E ET+L L G ++
Sbjct: 217 CAVDSKNIDTIKAIIDNRSNINKNDLSLLKAIRNEDLETSLLLYDAGFSVN--------- 267
Query: 153 ANIELGASTPLMEAAQEGHL-ELVRYLLDSGAQEMQTKHI 191
+I+ +TPL A+Q L LV LL+ GA ++ K+I
Sbjct: 268 -SIDDCKNTPLHHASQAPSLSRLVPKLLERGA-DVNAKNI 305
Score = 30.0 bits (67), Expect = 2.2
Identities = 16/45 (35%), Positives = 28/45 (62%), Gaps = 3/45 (6%)
Query: 142 LDVADFLLKNGANI---ELGASTPLMEAAQEGHLELVRYLLDSGA 183
L +A+ LL+ GA++ ++ TP+ AA+ G+ ++V LL GA
Sbjct: 158 LLIAEMLLEGGADVNAKDIYCITPIHYAAERGNAKMVNLLLSYGA 202
>gnl|CDD|215625 PLN03192, PLN03192, Voltage-dependent potassium channel;
Provisional.
Length = 823
Score = 53.3 bits (128), Expect = 1e-07
Identities = 34/123 (27%), Positives = 55/123 (44%), Gaps = 6/123 (4%)
Query: 64 SPLTLAACGGHVDLALLLIERGANIEEVNDEGYTPLMEAAREGHEEMVCVLLQKGAKINA 123
+PL +AA G+ D L+L++ N+ + G T L A H ++ +L + +
Sbjct: 560 TPLHIAASKGYEDCVLVLLKHACNVHIRDANGNTALWNAISAKHHKIFRILYHFASISDP 619
Query: 124 HTEETQETALTLACCGGFLDVADFLLKNGANIEL---GASTPLMEAAQEGHLELVRYLLD 180
H L A L LLK G N++ +T L A E H+++VR L+
Sbjct: 620 H---AAGDLLCTAAKRNDLTAMKELLKQGLNVDSEDHQGATALQVAMAEDHVDMVRLLIM 676
Query: 181 SGA 183
+GA
Sbjct: 677 NGA 679
Score = 49.9 bits (119), Expect = 1e-06
Identities = 48/198 (24%), Positives = 79/198 (39%), Gaps = 22/198 (11%)
Query: 28 SPLTLAACGGHVDLALLLIERGAN--IEEVNMPTDSFESPLTLAACGGHVDLALLLIE-- 83
+PL +AA G+ D L+L++ N I + N T + A H + +L
Sbjct: 560 TPLHIAASKGYEDCVLVLLKHACNVHIRDANGNTALWN-----AISAKHHKIFRILYHFA 614
Query: 84 RGANIEEVNDEGYTPLMEAAREGHEEMVCVLLQKGAKINAHTEETQ-ETALTLACCGGFL 142
++ D L AA+ + LL++G +N +E+ Q TAL +A +
Sbjct: 615 SISDPHAAGD----LLCTAAKRNDLTAMKELLKQG--LNVDSEDHQGATALQVAMAEDHV 668
Query: 143 DVADFLLKNGANI------ELGASTPLMEAAQEGHLELVRYLLDSGAQEMQTKHIDILEF 196
D+ L+ NGA++ + + T L E Q+ L ++DS + D
Sbjct: 669 DMVRLLIMNGADVDKANTDDDFSPTELRELLQKRELGHSITIVDSVPADEPDLGRDGGSR 728
Query: 197 KKRKQEESKAAFCRGSES 214
R Q S CR S
Sbjct: 729 PGRLQGTSSDNQCRPRVS 746
>gnl|CDD|200936 pfam00023, Ank, Ankyrin repeat. Ankyrins are multifunctional
adaptors that link specific proteins to the
membrane-associated, spectrin- actin cytoskeleton. This
repeat-domain is a 'membrane-binding' domain of up to 24
repeated units, and it mediates most of the protein's
binding activities. Repeats 13-24 are especially active,
with known sites of interaction for the Na/K ATPase,
Cl/HCO(3) anion exchanger, voltage-gated sodium channel,
clathrin heavy chain and L1 family cell adhesion
molecules. The ANK repeats are found to form a
contiguous spiral stack such that ion transporters like
the anion exchanger associate in a large central cavity
formed by the ANK repeat spiral, while clathrin and cell
adhesion molecules associate with specific regions
outside this cavity.
Length = 33
Score = 47.2 bits (113), Expect = 1e-07
Identities = 17/33 (51%), Positives = 23/33 (69%)
Query: 94 EGYTPLMEAAREGHEEMVCVLLQKGAKINAHTE 126
+G TPL AAR GH E+V +LL+ GA +NA +
Sbjct: 1 DGNTPLHLAARNGHLEVVKLLLEAGADVNARDK 33
Score = 40.2 bits (95), Expect = 3e-05
Identities = 14/23 (60%), Positives = 19/23 (82%)
Query: 161 TPLMEAAQEGHLELVRYLLDSGA 183
TPL AA+ GHLE+V+ LL++GA
Sbjct: 4 TPLHLAARNGHLEVVKLLLEAGA 26
Score = 37.9 bits (89), Expect = 2e-04
Identities = 12/31 (38%), Positives = 20/31 (64%)
Query: 63 ESPLTLAACGGHVDLALLLIERGANIEEVND 93
+PL LAA GH+++ LL+E GA++ +
Sbjct: 3 NTPLHLAARNGHLEVVKLLLEAGADVNARDK 33
Score = 37.1 bits (87), Expect = 3e-04
Identities = 12/30 (40%), Positives = 20/30 (66%)
Query: 27 ESPLTLAACGGHVDLALLLIERGANIEEVN 56
+PL LAA GH+++ LL+E GA++ +
Sbjct: 3 NTPLHLAARNGHLEVVKLLLEAGADVNARD 32
Score = 35.6 bits (83), Expect = 0.001
Identities = 11/26 (42%), Positives = 15/26 (57%)
Query: 130 ETALTLACCGGFLDVADFLLKNGANI 155
T L LA G L+V LL+ GA++
Sbjct: 3 NTPLHLAARNGHLEVVKLLLEAGADV 28
>gnl|CDD|222931 PHA02798, PHA02798, ankyrin-like protein; Provisional.
Length = 489
Score = 52.5 bits (126), Expect = 1e-07
Identities = 35/126 (27%), Positives = 60/126 (47%), Gaps = 17/126 (13%)
Query: 39 VDLALLLIERGANIEEVNMPTDSFESPLTLAACGGHV---DLALLLIERGANIEEVNDEG 95
+D+ +LIE GA+I N E+PL G++ ++ L +IE GA+ ++ +G
Sbjct: 89 LDIVKILIENGADI---NKKNSDGETPLYCLLSNGYINNLEILLFMIENGADTTLLDKDG 145
Query: 96 YTPLMEAAREGHE---EMVCVLLQKGAKINAHTEETQETALTLACCGGF------LDVAD 146
+T L + H E++ +LL+KG IN H + + TL C + D+
Sbjct: 146 FTMLQVYLQSNHHIDIEIIKLLLEKGVDINTHNNKEKYD--TLHCYFKYNIDRIDADILK 203
Query: 147 FLLKNG 152
+ NG
Sbjct: 204 LFVDNG 209
Score = 41.4 bits (97), Expect = 5e-04
Identities = 42/155 (27%), Positives = 65/155 (41%), Gaps = 45/155 (29%)
Query: 39 VDLALLLIERGANIEEVNMPTDSFESPL-TLAA----CGGHVDLALLLIERGANIEEVND 93
D+ L I GAN VN + + +PL T+ + +D+ +LIE GA+I + N
Sbjct: 51 TDIVKLFINLGAN---VNGLDNEYSTPLCTILSNIKDYKHMLDIVKILIENGADINKKNS 107
Query: 94 EGYTPLMEAAREGHEEMVCVLLQKGAKINAHTEETQETALTLACCGGFLDVADFLLKNGA 153
+G TPL LL G IN L++ F+++NGA
Sbjct: 108 DGETPL------------YCLLSNGY-INN------------------LEILLFMIENGA 136
Query: 154 NIELGAS---TPLMEAAQEGH---LELVRYLLDSG 182
+ L T L Q H +E+++ LL+ G
Sbjct: 137 DTTLLDKDGFTMLQVYLQSNHHIDIEIIKLLLEKG 171
Score = 31.7 bits (72), Expect = 0.67
Identities = 22/91 (24%), Positives = 37/91 (40%), Gaps = 3/91 (3%)
Query: 39 VDLALLLIERGANIEEVNMPTDS--FESPLTLAACGGHVDLALL-LIERGANIEEVNDEG 95
D+ L ++ G I + N E +L +L I +I +V++ G
Sbjct: 199 ADILKLFVDNGFIINKENKSHKKKFMEYLNSLLYDNKRFKKNILDFIFSYIDINQVDELG 258
Query: 96 YTPLMEAAREGHEEMVCVLLQKGAKINAHTE 126
+ PL + + ++ LLQ G IN TE
Sbjct: 259 FNPLYYSVSHNNRKIFEYLLQLGGDINIITE 289
>gnl|CDD|165205 PHA02874, PHA02874, ankyrin repeat protein; Provisional.
Length = 434
Score = 50.0 bits (119), Expect = 8e-07
Identities = 47/196 (23%), Positives = 78/196 (39%), Gaps = 28/196 (14%)
Query: 16 GGHVNMPTDSFESPLTLAACGGHVDLALLLIERGANIEEVNMPTDSFESPLTLAACGGHV 75
G +N+ D +PL A G + L I+ GA+I +N T LT G H
Sbjct: 25 GNCINISVDETTTPLIDAIRSGDAKIVELFIKHGADINHIN--TKIPHPLLTAIKIGAHD 82
Query: 76 DLALLLI----------------------ERGANIEEVNDEGYTPLMEAAREGHEEMVCV 113
+ LL+ + G ++ + E T L A ++G E + +
Sbjct: 83 IIKLLIDNGVDTSILPIPCIEKDMIKTILDCGIDVNIKDAELKTFLHYAIKKGDLESIKM 142
Query: 114 LLQKGAKINAHTEETQETALTLACCGGFLDVADFLLKNGANIELG---ASTPLMEAAQEG 170
L + GA +N ++ + +A F D+ LL+ GA + +PL AA+ G
Sbjct: 143 LFEYGADVNIE-DDNGCYPIHIAIKHNFFDIIKLLLEKGAYANVKDNNGESPLHNAAEYG 201
Query: 171 HLELVRYLLDSGAQEM 186
++ L+D G M
Sbjct: 202 DYACIKLLIDHGNHIM 217
Score = 44.6 bits (105), Expect = 5e-05
Identities = 49/155 (31%), Positives = 68/155 (43%), Gaps = 11/155 (7%)
Query: 6 FESPLTLAACGGHVNMPTDSFESPLTLAACGGHVDLALLLIERGANIEEVNMPTDSFESP 65
ES L G VN+ D+ P+ +A D+ LL+E+GA N+ ++ ESP
Sbjct: 137 LESIKMLFEYGADVNIEDDNGCYPIHIAIKHNFFDIIKLLLEKGAYA---NVKDNNGESP 193
Query: 66 LTLAACGGHVDLALLLIERGANIEEVNDEGYTPLMEAAREGHEEMVCVLLQKGAKINAHT 125
L AA G LLI+ G +I G+TPL A H LL A IN
Sbjct: 194 LHNAAEYGDYACIKLLIDHGNHIMNKCKNGFTPLHNAII--HNRSAIELLINNASIND-Q 250
Query: 126 EETQETALTLAC---CGGFLDVADFLLKNGANIEL 157
+ T L A C +D+ D LL + A+I +
Sbjct: 251 DIDGSTPLHHAINPPCD--IDIIDILLYHKADISI 283
Score = 40.3 bits (94), Expect = 0.001
Identities = 44/170 (25%), Positives = 70/170 (41%), Gaps = 27/170 (15%)
Query: 11 TLAACGGHVNMPTDSFESPLTLAACGGHVDLALLLIERGANIEEVNMPTDSFESPLTLAA 70
T+ CG VN+ ++ L A G ++ +L E GA +VN+ D+ P+ +A
Sbjct: 109 TILDCGIDVNIKDAELKTFLHYAIKKGDLESIKMLFEYGA---DVNIEDDNGCYPIHIAI 165
Query: 71 CGGHVDLALLLIERGANIEEVNDEGYTPLMEAAREGHEEMVCVLLQKGAKINAHTEETQE 130
D+ LL+E+GA ++ G +PL AA G + +L+ G I
Sbjct: 166 KHNFFDIIKLLLEKGAYANVKDNNGESPLHNAAEYGDYACIKLLIDHGNHI--------- 216
Query: 131 TALTLACCGGFLDVADFLLKNGANIEL------------GASTPLMEAAQ 168
C GF + + ++ N + IEL STPL A
Sbjct: 217 ---MNKCKNGFTPLHNAIIHNRSAIELLINNASINDQDIDGSTPLHHAIN 263
Score = 32.2 bits (73), Expect = 0.35
Identities = 29/99 (29%), Positives = 46/99 (46%), Gaps = 5/99 (5%)
Query: 4 DSFESPLTLAACGGHVNMPTDSFESPLTLAACGGHVDLALLLIERGANIEEVNMPTDSFE 63
+ F+ L G + N+ ++ ESPL AA G LLI+ G +I +N + F
Sbjct: 168 NFFDIIKLLLEKGAYANVKDNNGESPLHNAAEYGDYACIKLLIDHGNHI--MNKCKNGF- 224
Query: 64 SPLTLAACGGHVDLALLLIERGANIEEVNDEGYTPLMEA 102
+PL A + LL+ N ++++ G TPL A
Sbjct: 225 TPLHNAIIHNRSAIELLINNASINDQDID--GSTPLHHA 261
>gnl|CDD|197603 smart00248, ANK, ankyrin repeats. Ankyrin repeats are about 33
amino acids long and occur in at least four consecutive
copies. They are involved in protein-protein
interactions. The core of the repeat seems to be an
helix-loop-helix structure.
Length = 30
Score = 43.7 bits (104), Expect = 2e-06
Identities = 17/30 (56%), Positives = 21/30 (70%)
Query: 94 EGYTPLMEAAREGHEEMVCVLLQKGAKINA 123
+G TPL AA G+ E+V +LL KGA INA
Sbjct: 1 DGRTPLHLAAENGNLEVVKLLLDKGADINA 30
Score = 37.6 bits (88), Expect = 3e-04
Identities = 14/23 (60%), Positives = 18/23 (78%)
Query: 161 TPLMEAAQEGHLELVRYLLDSGA 183
TPL AA+ G+LE+V+ LLD GA
Sbjct: 4 TPLHLAAENGNLEVVKLLLDKGA 26
Score = 34.5 bits (80), Expect = 0.003
Identities = 11/27 (40%), Positives = 20/27 (74%)
Query: 28 SPLTLAACGGHVDLALLLIERGANIEE 54
+PL LAA G++++ LL+++GA+I
Sbjct: 4 TPLHLAAENGNLEVVKLLLDKGADINA 30
Score = 34.5 bits (80), Expect = 0.003
Identities = 11/27 (40%), Positives = 20/27 (74%)
Query: 64 SPLTLAACGGHVDLALLLIERGANIEE 90
+PL LAA G++++ LL+++GA+I
Sbjct: 4 TPLHLAAENGNLEVVKLLLDKGADINA 30
Score = 34.5 bits (80), Expect = 0.003
Identities = 12/27 (44%), Positives = 14/27 (51%)
Query: 131 TALTLACCGGFLDVADFLLKNGANIEL 157
T L LA G L+V LL GA+I
Sbjct: 4 TPLHLAAENGNLEVVKLLLDKGADINA 30
>gnl|CDD|205784 pfam13606, Ank_3, Ankyrin repeat. Ankyrins are multifunctional
adaptors that link specific proteins to the
membrane-associated, spectrin- actin cytoskeleton. This
repeat-domain is a 'membrane-binding' domain of up to 24
repeated units, and it mediates most of the protein's
binding activities.
Length = 30
Score = 43.0 bits (102), Expect = 3e-06
Identities = 17/30 (56%), Positives = 22/30 (73%)
Query: 94 EGYTPLMEAAREGHEEMVCVLLQKGAKINA 123
+G TPL AAR G+ E+V +LL+ GA INA
Sbjct: 1 DGNTPLHLAARNGNLELVKLLLEHGADINA 30
Score = 36.1 bits (84), Expect = 0.001
Identities = 14/23 (60%), Positives = 18/23 (78%)
Query: 161 TPLMEAAQEGHLELVRYLLDSGA 183
TPL AA+ G+LELV+ LL+ GA
Sbjct: 4 TPLHLAARNGNLELVKLLLEHGA 26
Score = 33.8 bits (78), Expect = 0.005
Identities = 11/26 (42%), Positives = 16/26 (61%)
Query: 130 ETALTLACCGGFLDVADFLLKNGANI 155
T L LA G L++ LL++GA+I
Sbjct: 3 NTPLHLAARNGNLELVKLLLEHGADI 28
Score = 32.6 bits (75), Expect = 0.015
Identities = 13/26 (50%), Positives = 19/26 (73%)
Query: 27 ESPLTLAACGGHVDLALLLIERGANI 52
+PL LAA G+++L LL+E GA+I
Sbjct: 3 NTPLHLAARNGNLELVKLLLEHGADI 28
Score = 32.6 bits (75), Expect = 0.015
Identities = 13/26 (50%), Positives = 19/26 (73%)
Query: 63 ESPLTLAACGGHVDLALLLIERGANI 88
+PL LAA G+++L LL+E GA+I
Sbjct: 3 NTPLHLAARNGNLELVKLLLEHGADI 28
>gnl|CDD|222980 PHA03095, PHA03095, ankyrin-like protein; Provisional.
Length = 471
Score = 47.7 bits (114), Expect = 5e-06
Identities = 44/161 (27%), Positives = 72/161 (44%), Gaps = 35/161 (21%)
Query: 44 LLIERGANIEEVNMPTDSFESPLTL---AACGGHVDLALLLIERGANIEEVNDEGYTPL- 99
L+ GA+ VN + ++PL L + D+ LL+E GA++ G+TPL
Sbjct: 32 RLLAAGAD---VNFRGEYGKTPLHLYLHYSSEKVKDIVRLLLEAGADVNAPERCGFTPLH 88
Query: 100 --MEAAREGHEEMVCVLLQKGAKINAHTEETQETALTLACCGGF---LDVADFLLKNGAN 154
+ A +++ +L++ GA +NA ++ T L GF V LL+ GA+
Sbjct: 89 LYLYNATT--LDVIKLLIKAGADVNA-KDKVGRTPLH-VYLSGFNINPKVIRLLLRKGAD 144
Query: 155 IE---LGASTPLMEAAQEGH---------LELVRYLLDSGA 183
+ L TPL +EL+R L+D+GA
Sbjct: 145 VNALDLYGMTPL-------AVLLKSRNANVELLRLLIDAGA 178
Score = 43.5 bits (103), Expect = 1e-04
Identities = 28/94 (29%), Positives = 39/94 (41%), Gaps = 6/94 (6%)
Query: 45 LIERGANIEEVNMPTDSFESPLTLAACGGHVDLALL--LIERGANIEEVNDEGYTPLMEA 102
LI G + +M +PL A G +L+ L+ G +I N G TPL A
Sbjct: 208 LIRAGCDPAATDM---LGNTPLHSMATGSSCKRSLVLPLLIAGISINARNRYGQTPLHYA 264
Query: 103 AREGHEEMVCVLLQKGAKINAHTEETQETALTLA 136
A + L+ GA INA + T L+L
Sbjct: 265 AVFNNPRACRRLIALGADINAVS-SDGNTPLSLM 297
Score = 41.5 bits (98), Expect = 4e-04
Identities = 25/93 (26%), Positives = 41/93 (44%), Gaps = 6/93 (6%)
Query: 10 LTLAACGGHVNMPTDSFESPLTLAACGGHV-DLALLLIERGANIEEVNMPTDSFESPLTL 68
L G VN P +PL L D+ LLI+ GA++ + + PL +
Sbjct: 67 RLLLEAGADVNAPERCGFTPLHLYLYNATTLDVIKLLIKAGADVNAKDKVGRT---PLHV 123
Query: 69 AACGG--HVDLALLLIERGANIEEVNDEGYTPL 99
G + + LL+ +GA++ ++ G TPL
Sbjct: 124 YLSGFNINPKVIRLLLRKGADVNALDLYGMTPL 156
Score = 33.5 bits (77), Expect = 0.17
Identities = 29/109 (26%), Positives = 41/109 (37%), Gaps = 5/109 (4%)
Query: 11 TLAACGGHVNMPTDSFESPLTLAACGGHVDLALL--LIERGANIEEVNMPTDSFESPLTL 68
L G +PL A G +L+ L+ G +I N ++PL
Sbjct: 207 ELIRAGCDPAATDMLGNTPLHSMATGSSCKRSLVLPLLIAGISINARN---RYGQTPLHY 263
Query: 69 AACGGHVDLALLLIERGANIEEVNDEGYTPLMEAAREGHEEMVCVLLQK 117
AA + LI GA+I V+ +G TPL R + V L K
Sbjct: 264 AAVFNNPRACRRLIALGADINAVSSDGNTPLSLMVRNNNGRAVRAALAK 312
>gnl|CDD|206028 pfam13857, Ank_5, Ankyrin repeats (many copies).
Length = 56
Score = 42.3 bits (100), Expect = 8e-06
Identities = 21/57 (36%), Positives = 30/57 (52%), Gaps = 2/57 (3%)
Query: 81 LIERG-ANIEEVNDEGYTPLMEAAREGHEEMVCVLLQKGAKINAHTEETQETALTLA 136
L+E G ++ + G TPL AA+ G E+V LL+ G +N + TAL LA
Sbjct: 1 LLEHGPIDLNATDGNGNTPLHLAAKYGALELVQWLLKPGVDLNLRDSDGL-TALDLA 56
Score = 36.9 bits (86), Expect = 7e-04
Identities = 17/58 (29%), Positives = 30/58 (51%), Gaps = 2/58 (3%)
Query: 45 LIERGANIEEVNMPTDSFESPLTLAACGGHVDLALLLIERGANIEEVNDEGYTPLMEA 102
L+E G ++N + +PL LAA G ++L L++ G ++ + +G T L A
Sbjct: 1 LLEHGP--IDLNATDGNGNTPLHLAAKYGALELVQWLLKPGVDLNLRDSDGLTALDLA 56
Score = 35.8 bits (83), Expect = 0.002
Identities = 15/44 (34%), Positives = 22/44 (50%)
Query: 114 LLQKGAKINAHTEETQETALTLACCGGFLDVADFLLKNGANIEL 157
LL+ G T+ T L LA G L++ +LLK G ++ L
Sbjct: 1 LLEHGPIDLNATDGNGNTPLHLAAKYGALELVQWLLKPGVDLNL 44
Score = 31.9 bits (73), Expect = 0.042
Identities = 16/40 (40%), Positives = 23/40 (57%), Gaps = 4/40 (10%)
Query: 148 LLKNGAN----IELGASTPLMEAAQEGHLELVRYLLDSGA 183
LL++G + +TPL AA+ G LELV++LL G
Sbjct: 1 LLEHGPIDLNATDGNGNTPLHLAAKYGALELVQWLLKPGV 40
Score = 30.8 bits (70), Expect = 0.096
Identities = 15/58 (25%), Positives = 26/58 (44%), Gaps = 3/58 (5%)
Query: 12 LAACGGHVNMPTDSFESPLTLAACGGHVDLALLLIERGANIEEVNMPTDSFESPLTLA 69
L +N + +PL LAA G ++L L++ G ++ N+ + L LA
Sbjct: 2 LEHGPIDLNATDGNGNTPLHLAAKYGALELVQWLLKPGVDL---NLRDSDGLTALDLA 56
>gnl|CDD|236776 PRK10856, PRK10856, cytoskeletal protein RodZ; Provisional.
Length = 331
Score = 46.6 bits (111), Expect = 9e-06
Identities = 21/95 (22%), Positives = 31/95 (32%), Gaps = 6/95 (6%)
Query: 257 DQSSVVNVATDST---LSHVTSTPSTSAATTTQTTSPPVVSAASQNTSSATQSSTAATQN 313
DQSS L T+T + ++ + ++A + QN
Sbjct: 148 DQSSAELSQNSGQSVPLDTSTTTDPATTPAPAAPVDTTPTNSQTPAVATAPAPAVDPQQN 207
Query: 314 ASPATQSSNVATQNTPV---TNLNNAVATLKTAVA 345
A A +NV T TP + A L T A
Sbjct: 208 AVVAPSQANVDTAATPAPAAPATPDGAAPLPTDQA 242
Score = 37.3 bits (87), Expect = 0.009
Identities = 15/91 (16%), Positives = 30/91 (32%)
Query: 235 PSGKPSDPPNSGKTKEPPNSGDDQSSVVNVATDSTLSHVTSTPSTSAATTTQTTSPPVVS 294
+ S P ++ T +P + + V T+S V + P+ + P +
Sbjct: 156 QNSGQSVPLDTSTTTDPATTPAPAAPVDTTPTNSQTPAVATAPAPAVDPQQNAVVAPSQA 215
Query: 295 AASQNTSSATQSSTAATQNASPATQSSNVAT 325
+ A + A T + V+T
Sbjct: 216 NVDTAATPAPAAPATPDGAAPLPTDQAGVST 246
Score = 34.2 bits (79), Expect = 0.072
Identities = 13/85 (15%), Positives = 22/85 (25%), Gaps = 5/85 (5%)
Query: 256 DDQSSVVNVATDSTLSHV-TSTPSTSAATTTQTTSPPVVSAASQNTSSATQSSTAATQNA 314
++ T T S + A T + Q + S A
Sbjct: 166 TSTTTDPATTPAPAAPVDTTPTNSQTPAVATAPAP----AVDPQQNAVVAPSQANVDTAA 221
Query: 315 SPATQSSNVATQNTPVTNLNNAVAT 339
+PA + P+ V+T
Sbjct: 222 TPAPAAPATPDGAAPLPTDQAGVST 246
Score = 33.8 bits (78), Expect = 0.11
Identities = 22/105 (20%), Positives = 35/105 (33%), Gaps = 10/105 (9%)
Query: 211 GSESTP-TVSSKQAVKTTFPPVSTEPSGKPSDPPNSGKTKEPPNSGDDQSSVVNVATDST 269
+S P S+ TT P + + + + T P Q++VV + +
Sbjct: 158 SGQSVPLDTSTTTDPATTPAPAAPVDTTPTNSQTPAVATAPAPAVDPQQNAVVAPSQANV 217
Query: 270 LSHVTSTPSTSAATTTQTTSPPVVSAASQNTSSATQSSTAATQNA 314
+ T P+ A P T A S+ AA NA
Sbjct: 218 DTAATPAPAAPATPDGAAPLP---------TDQAGVSTPAADPNA 253
Score = 33.1 bits (76), Expect = 0.21
Identities = 16/93 (17%), Positives = 29/93 (31%), Gaps = 4/93 (4%)
Query: 230 PVSTEPSGKPSDPPNSGKTKEPPNSGDDQSSVVNVATDSTLSHVTSTPS-TSAATTTQTT 288
P+ T + P+ P P ++ S VAT + + + + T
Sbjct: 163 PLDTSTTTDPATTPAPA---APVDTTPTNSQTPAVATAPAPAVDPQQNAVVAPSQANVDT 219
Query: 289 SPPVVSAASQNTSSATQSSTAATQNASPATQSS 321
+ AA A T ++PA +
Sbjct: 220 AATPAPAAPATPDGAAPLPTDQAGVSTPAADPN 252
Score = 29.2 bits (66), Expect = 3.4
Identities = 21/69 (30%), Positives = 31/69 (44%), Gaps = 2/69 (2%)
Query: 273 VTSTPSTSAATTTQTTSPPVVSAASQNTSSATQSSTAATQNASPA-TQSSNVATQNTP-V 330
+T+ S+A +Q + V S T AT + AA + +P +Q+ VAT P V
Sbjct: 143 ITTMADQSSAELSQNSGQSVPLDTSTTTDPATTPAPAAPVDTTPTNSQTPAVATAPAPAV 202
Query: 331 TNLNNAVAT 339
NAV
Sbjct: 203 DPQQNAVVA 211
>gnl|CDD|233161 TIGR00870, trp, transient-receptor-potential calcium channel
protein. The Transient Receptor Potential Ca2+ Channel
(TRP-CC) Family (TC. 1.A.4)The TRP-CC family has also
been called the store-operated calcium channel (SOC)
family. The prototypical members include the Drosophila
retinal proteinsTRP and TRPL (Montell and Rubin, 1989;
Hardie and Minke, 1993). SOC members of the family
mediate the entry of extracellular Ca2+ into cells in
responseto depletion of intracellular Ca2+ stores
(Clapham, 1996) and agonist stimulated production of
inositol-1,4,5 trisphosphate (IP3). One member of the
TRP-CCfamily, mammalian Htrp3, has been shown to form a
tight complex with the IP3 receptor (TC #1.A.3.2.1).
This interaction is apparently required for IP3
tostimulate Ca2+ release via Htrp3. The vanilloid
receptor subtype 1 (VR1), which is the receptor for
capsaicin (the ?hot? ingredient in chili peppers) and
servesas a heat-activated ion channel in the pain
pathway (Caterina et al., 1997), is also a member of
this family. The stretch-inhibitable non-selective
cation channel(SIC) is identical to the vanilloid
receptor throughout all of its first 700 residues, but
it exhibits a different sequence in its last 100
residues. VR1 and SICtransport monovalent cations as
well as Ca2+. VR1 is about 10x more permeable to Ca2+
than to monovalent ions. Ca2+ overload probably causes
cell deathafter chronic exposure to capsaicin.
(McCleskey and Gold, 1999) [Transport and binding
proteins, Cations and iron carrying compounds].
Length = 743
Score = 42.8 bits (101), Expect = 2e-04
Identities = 50/176 (28%), Positives = 72/176 (40%), Gaps = 39/176 (22%)
Query: 23 TDSF---ESPLTLAACGGHVDLALLLIERGANI-------EEVNMP-TDSF---ESPLTL 68
T F + L LAA + ++ LL+ERGA++ V DSF ESPL
Sbjct: 122 TSEFTPGITALHLAAHRQNYEIVKLLLERGASVPARACGDFFVKSQGVDSFYHGESPLNA 181
Query: 69 AACGGHVDLALLLIERGANIEEVNDEGYTPL----MEAA-REGHEEMVCVLLQKGAKINA 123
AAC G + LL E A+I + G T L ME + +EE+ C + +
Sbjct: 182 AACLGSPSIVALLSEDPADILTADSLGNTLLHLLVMENEFKAEYEELSCQMYNFALSLLD 241
Query: 124 HTEETQETALTLACCGGFLDVADFLLKNGANIELGASTPLMEAAQEGHLELVRYLL 179
+++E + L N + TPL AA+EG + L R L
Sbjct: 242 KLRDSKELEVIL---------------NHQGL-----TPLKLAAKEGRIVLFRLKL 277
Score = 38.1 bits (89), Expect = 0.006
Identities = 36/159 (22%), Positives = 59/159 (37%), Gaps = 36/159 (22%)
Query: 27 ESPLTLAACGG-HVDLALLLIERGANIEEVNMPTDSFESPLTLAACGGHVDL--ALLLI- 82
S L +AA +++L LL+ L A +VD A+LL
Sbjct: 53 RSALFVAAIENENLELTELLLNLSCRGAV--------GDTLLHAISLEYVDAVEAILLHL 104
Query: 83 ----ERGANIEEVNDE-------GYTPLMEAAREGHEEMVCVLLQKGAKIN--AHTEETQ 129
+ +E ND+ G T L AA + E+V +LL++GA + A +
Sbjct: 105 LAAFRKSGPLELANDQYTSEFTPGITALHLAAHRQNYEIVKLLLERGASVPARACGDFFV 164
Query: 130 ETA-----------LTLACCGGFLDVADFLLKNGANIEL 157
++ L A C G + L ++ A+I
Sbjct: 165 KSQGVDSFYHGESPLNAAACLGSPSIVALLSEDPADILT 203
Score = 36.6 bits (85), Expect = 0.020
Identities = 34/122 (27%), Positives = 53/122 (43%), Gaps = 20/122 (16%)
Query: 23 TDSF---ESPLTLAACGGHVDLALLLIERGANIE-------EV----NMPTDSFESPLTL 68
DSF ESPL AAC G + LL E A+I + M + F++
Sbjct: 169 VDSFYHGESPLNAAACLGSPSIVALLSEDPADILTADSLGNTLLHLLVMENE-FKAEYEE 227
Query: 69 AACGGHVDLALLLIERGANIEEV----NDEGYTPLMEAAREGHEEMVCVLLQKGAKINAH 124
+C + + AL L+++ + +E+ N +G TPL AA+EG + + L K
Sbjct: 228 LSCQMY-NFALSLLDKLRDSKELEVILNHQGLTPLKLAAKEGRIVLFRLKLAIKYKQKKF 286
Query: 125 TE 126
Sbjct: 287 VA 288
>gnl|CDD|114270 pfam05539, Pneumo_att_G, Pneumovirinae attachment membrane
glycoprotein G.
Length = 408
Score = 41.6 bits (97), Expect = 4e-04
Identities = 33/120 (27%), Positives = 45/120 (37%), Gaps = 3/120 (2%)
Query: 212 SESTPTVSSKQAVKTTFPPVSTEPSGKPSDPPNSGKTKEPPNSGDDQSSVVNVATDSTLS 271
ST V ++ ++ P TEP PS SG + PP S Q
Sbjct: 213 LSSTEPVGTQGTTTSSNPEPQTEP--PPSQRGPSGSPQHPP-STTSQDQSTTGDGQEHTQ 269
Query: 272 HVTSTPSTSAATTTQTTSPPVVSAASQNTSSATQSSTAATQNASPATQSSNVATQNTPVT 331
+ P+TS + +T+ P + Q T T TA TQ+ S SS Q P T
Sbjct: 270 RRKTPPATSNRRSPHSTATPPPTTKRQETGRPTPRPTATTQSGSSPPHSSPPGVQANPTT 329
Score = 38.5 bits (89), Expect = 0.004
Identities = 29/113 (25%), Positives = 43/113 (38%), Gaps = 5/113 (4%)
Query: 217 TVSSKQAVKTTFPPVSTEPSGKPSDPPNSGKTKEPPNSGDDQSSVVNVATDSTLSHVTST 276
T ++ Q + +T PV T+ + S+P +T+ PP+ S ST S ST
Sbjct: 206 TATANQRLSST-EPVGTQGTTTSSNPEP--QTEPPPSQRGPSGS--PQHPPSTTSQDQST 260
Query: 277 PSTSAATTTQTTSPPVVSAASQNTSSATQSSTAATQNASPATQSSNVATQNTP 329
T + +PP S S+AT T Q T TQ+
Sbjct: 261 TGDGQEHTQRRKTPPATSNRRSPHSTATPPPTTKRQETGRPTPRPTATTQSGS 313
Score = 28.1 bits (62), Expect = 8.9
Identities = 21/126 (16%), Positives = 41/126 (32%), Gaps = 10/126 (7%)
Query: 207 AFCRGSESTPTVSSKQAVKTTFPPVSTEPSG-KPSDP--PNSGKTKEPPNSGDDQSSVVN 263
CR + VS + KT T + S P P+ + P + Q++ N
Sbjct: 151 PKCRCTLRGKDVSCCKEPKTAVTTSKTTSWPTEVSHPTYPSQVTPQSQPATQGHQTATAN 210
Query: 264 VATDSTLSHVTSTPSTSAATTTQTTSPPVVSAASQNTSSATQSSTAATQNASPATQSSNV 323
++ST TT +++P + + + S S ++
Sbjct: 211 QR-------LSSTEPVGTQGTTTSSNPEPQTEPPPSQRGPSGSPQHPPSTTSQDQSTTGD 263
Query: 324 ATQNTP 329
++T
Sbjct: 264 GQEHTQ 269
>gnl|CDD|215130 PLN02217, PLN02217, probable pectinesterase/pectinesterase
inhibitor.
Length = 670
Score = 40.5 bits (94), Expect = 0.001
Identities = 27/102 (26%), Positives = 43/102 (42%), Gaps = 2/102 (1%)
Query: 232 STEPSGKPSDPPNSGKTKEPPNSGDDQSSVVNVATDSTLSHVTSTPSTSAATTTQTTSPP 291
+ P S P S + S D S+VV +T H+ S P+T + + +TSPP
Sbjct: 563 AGNPGSTNSTPTGSAASSNTTFSSDSPSTVVAPSTSPPAGHLGSPPATPSKIVSPSTSPP 622
Query: 292 V--VSAASQNTSSATQSSTAATQNASPATQSSNVATQNTPVT 331
+ + S SS S A+ + S VA+ + V+
Sbjct: 623 ASHLGSPSTTPSSPESSIKVASTETASPESSIKVASTESSVS 664
Score = 36.2 bits (83), Expect = 0.023
Identities = 25/104 (24%), Positives = 44/104 (42%), Gaps = 8/104 (7%)
Query: 211 GSESTPTVSSKQAVKTTFPPVSTEPSGKPSDPPNSGKTKEPPNSGDDQSSVVNVATDSTL 270
GS ++ S + TTF S PS P +G PP + S +V+ +T
Sbjct: 567 GSTNSTPTGSAASSNTTFSSDSPSTVVAPSTSPPAGHLGSPPAT---PSKIVSPSTSPPA 623
Query: 271 SHVTSTPSTSAATTTQTTSPPVVSAASQNTSSATQSSTAATQNA 314
SH+ S TT + + A+++ S + A+T+++
Sbjct: 624 SHLGSPS-----TTPSSPESSIKVASTETASPESSIKVASTESS 662
>gnl|CDD|140343 PTZ00322, PTZ00322,
6-phosphofructo-2-kinase/fructose-2,6-biphosphatase;
Provisional.
Length = 664
Score = 39.9 bits (93), Expect = 0.002
Identities = 28/84 (33%), Positives = 42/84 (50%), Gaps = 4/84 (4%)
Query: 32 LAACGGHVDLALLLIERGANIEEVNMPTDSFESPLTLAACGGHVDLALLLIERGANIEEV 91
LAA G V +LL GA+ + +PL +A GHV + +L+E GA+ +
Sbjct: 89 LAASGDAVGARILL-TGGADPNCRDYDG---RTPLHIACANGHVQVVRVLLEFGADPTLL 144
Query: 92 NDEGYTPLMEAAREGHEEMVCVLL 115
+ +G TPL A G E+V +L
Sbjct: 145 DKDGKTPLELAEENGFREVVQLLS 168
Score = 39.9 bits (93), Expect = 0.002
Identities = 36/109 (33%), Positives = 52/109 (47%), Gaps = 4/109 (3%)
Query: 68 LAACGGHVDLALLLIERGANIEEVNDEGYTPLMEAAREGHEEMVCVLLQKGAKINAHTEE 127
LAA G V +LL GA+ + +G TPL A GH ++V VLL+ GA ++
Sbjct: 89 LAASGDAVGARILL-TGGADPNCRDYDGRTPLHIACANGHVQVVRVLLEFGADPTL-LDK 146
Query: 128 TQETALTLACCGGFLDVADFLLKNG-ANIELGASTPLME-AAQEGHLEL 174
+T L LA GF +V L ++ + ELGA+ + LE
Sbjct: 147 DGKTPLELAEENGFREVVQLLSRHSQCHFELGANAKPDSFTGKPPSLED 195
>gnl|CDD|165212 PHA02884, PHA02884, ankyrin repeat protein; Provisional.
Length = 300
Score = 37.7 bits (87), Expect = 0.006
Identities = 34/110 (30%), Positives = 50/110 (45%), Gaps = 7/110 (6%)
Query: 38 HVDLALLLIERGANIEEVNMPT--DSFESPLTLAA-CGGHVDLALLLIERGANIEEVNDE 94
+ D+ +++ GA+ E P +S +PL A C D A LLI GA++ +E
Sbjct: 45 YTDIIDAILKLGAD-PEAPFPLSENSKTNPLIYAIDCDND-DAAKLLIRYGADVNRYAEE 102
Query: 95 G-YTPLMEAAREGHEEMVCVLLQKGAKINAHTEETQET-ALTLACCGGFL 142
TPL + G + + +LL GA IN T + L L C FL
Sbjct: 103 AKITPLYISVLHGCLKCLEILLSYGADINIQTNDMVTPIELALMICNNFL 152
Score = 34.6 bits (79), Expect = 0.065
Identities = 23/86 (26%), Positives = 46/86 (53%), Gaps = 4/86 (4%)
Query: 74 HVDLALLLIERGANIE---EVNDEGYT-PLMEAAREGHEEMVCVLLQKGAKINAHTEETQ 129
+ D+ +++ GA+ E +++ T PL+ A +++ +L++ GA +N + EE +
Sbjct: 45 YTDIIDAILKLGADPEAPFPLSENSKTNPLIYAIDCDNDDAAKLLIRYGADVNRYAEEAK 104
Query: 130 ETALTLACCGGFLDVADFLLKNGANI 155
T L ++ G L + LL GA+I
Sbjct: 105 ITPLYISVLHGCLKCLEILLSYGADI 130
>gnl|CDD|237019 PRK11907, PRK11907, bifunctional 2',3'-cyclic nucleotide
2'-phosphodiesterase/3'-nucleotidase precursor protein;
Reviewed.
Length = 814
Score = 37.9 bits (88), Expect = 0.007
Identities = 15/70 (21%), Positives = 28/70 (40%)
Query: 248 TKEPPNSGDDQSSVVNVATDSTLSHVTSTPSTSAATTTQTTSPPVVSAASQNTSSATQSS 307
P + + V +D+T + +A T + S + S TS AT ++
Sbjct: 32 VTTTPATSTEAEQTTPVESDATEEADNTETPVAATTAAEAPSSSETAETSDPTSEATDTT 91
Query: 308 TAATQNASPA 317
T+ + +PA
Sbjct: 92 TSEARTVTPA 101
Score = 36.0 bits (83), Expect = 0.035
Identities = 21/87 (24%), Positives = 41/87 (47%), Gaps = 5/87 (5%)
Query: 265 ATDSTLSHVTSTPSTSAATTTQT-TSPPVVSAASQ---NTSSATQSSTAATQNASPAT-Q 319
A++ L+ +T+ AT+T+ + PV S A++ NT + ++TAA +S T +
Sbjct: 20 ASNPKLAQAEEIVTTTPATSTEAEQTTPVESDATEEADNTETPVAATTAAEAPSSSETAE 79
Query: 320 SSNVATQNTPVTNLNNAVATLKTAVAT 346
+S+ ++ T T T +
Sbjct: 80 TSDPTSEATDTTTSEARTVTPAATETS 106
Score = 31.7 bits (72), Expect = 0.60
Identities = 30/138 (21%), Positives = 47/138 (34%), Gaps = 16/138 (11%)
Query: 198 KRKQEESKAAFCRGSESTPTVSS--KQAVKTTFPPVSTEPSGKPSDPPNSGKTKEPPNSG 255
K SK+A + QA + +T + + P S T+E N+
Sbjct: 1 MMKHYFSKSAVALTLALLTASNPKLAQAEEIVTTTPATSTEAEQTTPVESDATEEADNT- 59
Query: 256 DDQSSVVNVATDSTLSHVTSTPSTSAATTTQTTSPPVVSAASQNTSSATQSSTAATQNAS 315
T + PS+S T TS P S A+ T+S ++ T A S
Sbjct: 60 ---------ETPVAATTAAEAPSSS---ETAETSDP-TSEATDTTTSEARTVTPAATETS 106
Query: 316 PATQSSNVATQNTPVTNL 333
+ V + T+L
Sbjct: 107 KPVEGQTVDVRILSTTDL 124
Score = 31.0 bits (70), Expect = 1.2
Identities = 11/86 (12%), Positives = 29/86 (33%), Gaps = 5/86 (5%)
Query: 259 SSVVNVATDSTLSHVTSTPSTSAATTTQTTSP----PVVSAASQNTSSATQSSTAATQNA 314
S+ + ++ +T + + TT + T++A S++ T
Sbjct: 21 SNPKLAQAEEIVTTTPATSTEAEQTTPVESDATEEADNTETPVAATTAAEAPSSSETAET 80
Query: 315 S-PATQSSNVATQNTPVTNLNNAVAT 339
S P +++++ T +
Sbjct: 81 SDPTSEATDTTTSEARTVTPAATETS 106
Score = 29.1 bits (65), Expect = 4.0
Identities = 12/77 (15%), Positives = 26/77 (33%), Gaps = 1/77 (1%)
Query: 272 HVTSTPSTSAATTTQTTSPPVVSAASQNTSSATQSSTAATQNASPATQSSNVATQNTPVT 331
H S + + T S P ++ A + + T +++ + +P + NT
Sbjct: 4 HYFSKSAVALTLALLTASNPKLAQA-EEIVTTTPATSTEAEQTTPVESDATEEADNTETP 62
Query: 332 NLNNAVATLKTAVATLN 348
A ++ T
Sbjct: 63 VAATTAAEAPSSSETAE 79
>gnl|CDD|165195 PHA02859, PHA02859, ankyrin repeat protein; Provisional.
Length = 209
Score = 36.3 bits (84), Expect = 0.011
Identities = 22/92 (23%), Positives = 41/92 (44%), Gaps = 19/92 (20%)
Query: 19 VNMPTDSFESPLTLAACGGHVDLALL--LIERGANIEEVNMPTDSFESPLTLAACGGH-- 74
VN D +E+P+ V++ +L LIE GA++ ++ S L H
Sbjct: 44 VNDCNDLYETPIFSCLEKDKVNVEILKFLIENGADVNFK-TRDNNL-SAL------HHYL 95
Query: 75 -------VDLALLLIERGANIEEVNDEGYTPL 99
++ +LI+ G++I E +++G L
Sbjct: 96 SFNKNVEPEILKILIDSGSSITEEDEDGKNLL 127
>gnl|CDD|221173 pfam11702, DUF3295, Protein of unknown function (DUF3295). This
family is conserved in fungi but the function is not
known.
Length = 509
Score = 37.2 bits (86), Expect = 0.012
Identities = 29/110 (26%), Positives = 49/110 (44%), Gaps = 1/110 (0%)
Query: 218 VSSKQAVKTTFPPVSTEPSGKPSDPPNSGKTKEPPNSGDDQSSVVNVATDSTLS-HVTST 276
V S + K P P+ P PP+S T PP+S + + + S TST
Sbjct: 67 VLSIKEKKDLEPLALPMPNLAPITPPSSEPTPAPPSSESTATRTPDPNQQALESTESTST 126
Query: 277 PSTSAATTTQTTSPPVVSAASQNTSSATQSSTAATQNASPATQSSNVATQ 326
S + Q+++P + S+ + +SS ST+ + SP+ SS+ +
Sbjct: 127 TSADCNDSEQSSTPNLNSSDTSTSSSGALPSTSVVRGFSPSHISSSYRST 176
Score = 28.0 bits (62), Expect = 8.6
Identities = 17/88 (19%), Positives = 32/88 (36%), Gaps = 3/88 (3%)
Query: 247 KTKEPPNSGDDQSSVVNVATDSTLSHVTSTPSTSAATTTQTTSPPVVSAASQNTSSATQS 306
K EP + + + S+ ST+ T + ++ TS+
Sbjct: 74 KDLEPLALPMPNLAPITPPSSEPTPAPPSSESTATRTPDPNQQALESTESTSTTSADCND 133
Query: 307 STAAT---QNASPATQSSNVATQNTPVT 331
S ++ N+S + SS+ A +T V
Sbjct: 134 SEQSSTPNLNSSDTSTSSSGALPSTSVV 161
>gnl|CDD|215397 PLN02744, PLN02744, dihydrolipoyllysine-residue acetyltransferase
component of pyruvate dehydrogenase complex.
Length = 539
Score = 36.8 bits (85), Expect = 0.017
Identities = 28/111 (25%), Positives = 49/111 (44%), Gaps = 21/111 (18%)
Query: 155 IELGASTPLMEAAQEGHLELVRYLLDSGAQEMQTKHI---------DILEFKKRKQEESK 205
+E +T ME +EG+L + + GA+E++ + DI +FK K S
Sbjct: 149 VETDKATVEMECMEEGYL--AKIVKGDGAKEIKVGEVIAITVEEEEDIGKFKDYKPSSSA 206
Query: 206 AAFCRGSESTPTVSSKQAVKTTFPPVSTEPSGKPSDPPNSGKTKEPPNSGD 256
A ++ +P ++ V+ +P+ P P + K PP+SGD
Sbjct: 207 APAAPKAKPSPPPPKEEEVE--------KPASSPE--PKASKPSAPPSSGD 247
>gnl|CDD|165157 PHA02795, PHA02795, ankyrin-like protein; Provisional.
Length = 437
Score = 36.5 bits (84), Expect = 0.018
Identities = 20/55 (36%), Positives = 27/55 (49%)
Query: 52 IEEVNMPTDSFESPLTLAACGGHVDLALLLIERGANIEEVNDEGYTPLMEAAREG 106
IE++N + L A G++DL L+E GAN+ V GYT L A G
Sbjct: 211 IEDINQLDAGGRTLLYRAIYAGYIDLVSWLLENGANVNAVMSNGYTCLDVAVDRG 265
>gnl|CDD|218439 pfam05109, Herpes_BLLF1, Herpes virus major outer envelope
glycoprotein (BLLF1). This family consists of the BLLF1
viral late glycoprotein, also termed gp350/220. It is
the most abundantly expressed glycoprotein in the viral
envelope of the Herpesviruses and is the major antigen
responsible for stimulating the production of
neutralising antibodies in vivo.
Length = 830
Score = 36.7 bits (84), Expect = 0.019
Identities = 33/124 (26%), Positives = 43/124 (34%), Gaps = 2/124 (1%)
Query: 205 KAAFCRGSESTPTVSSKQAVKTTFPPVSTEPSGKPSDPPNSGKTKEPPNSGDDQSSVVNV 264
K F + ++T +V + P +T PS PP S D S
Sbjct: 419 KVVFHKAPDTTKSVIFVYTLVHVEPHKTTAVPTTPSLPPASTGPTVST--ADPTSGTPTG 476
Query: 265 ATDSTLSHVTSTPSTSAATTTQTTSPPVVSAASQNTSSATQSSTAATQNASPATQSSNVA 324
T STL TS S + + T TSP TS TQ ++ SP V
Sbjct: 477 TTSSTLPEDTSPTSRTTSATPNATSPTPAVTTPNATSPTTQKTSDTPNATSPTPIVIGVT 536
Query: 325 TQNT 328
T T
Sbjct: 537 TTAT 540
Score = 31.7 bits (71), Expect = 0.72
Identities = 27/141 (19%), Positives = 51/141 (36%), Gaps = 6/141 (4%)
Query: 202 EESKAAFCRGSESTPTVSSKQAVKTTFPPVSTEPSGKPSDPPNSGKTKEPPNSGDDQSSV 261
+ + S++ S V ++ P+G S + T ++
Sbjct: 510 NATSPTTQKTSDTPNATSPTPIVIGVTTTATSPPTGTTS--VPNA-TSPQVTEESPVNNT 566
Query: 262 VNVATDSTLSHVTSTPSTSAATTTQ--TTSPPVVSAASQNTSSATQSSTAATQNASPATQ 319
S S +TS +T T T+ P + ++S +T + +ST ++ T
Sbjct: 567 NTPVVTSAPSVLTSAVTTGQHGTGSSPTSQQPGIPSSSHSTPRSNSTSTTPLLTSAHPTG 626
Query: 320 SSNVATQNTPVTNLNNAVATL 340
N+ T+ TP V+TL
Sbjct: 627 GENI-TEETPSVPSTTHVSTL 646
Score = 30.5 bits (68), Expect = 1.8
Identities = 29/132 (21%), Positives = 44/132 (33%), Gaps = 1/132 (0%)
Query: 200 KQEESKAAFCRGSESTPTVSSKQAVKTTFPPVSTEPSGKPSDPPNSGKTKEPPNSGDDQS 259
A + +E +P ++ V T+ P V T S T + P
Sbjct: 546 TTSVPNATSPQVTEESPVNNTNTPVVTSAPSVLTSAVTTGQHGTGSSPTSQQPGIPSSSH 605
Query: 260 SVVNVATDSTLSHVTSTPSTSAATTTQTTSPPVVSAASQNTSSATQSSTAATQNASPATQ 319
S + ST +TS T T+ T P V S +T S +Q + P
Sbjct: 606 STPRSNSTSTTPLLTSAHPTGGENITEET-PSVPSTTHVSTLSPGPGPGTTSQVSGPGNS 664
Query: 320 SSNVATQNTPVT 331
S++ VT
Sbjct: 665 STSRYPGEVHVT 676
Score = 29.8 bits (66), Expect = 2.4
Identities = 23/126 (18%), Positives = 48/126 (38%), Gaps = 2/126 (1%)
Query: 214 STPTVSSKQAVKTTFPPVSTEPSGKPSDPPNSGKTKEPPNSGDDQSSVVNVATDSTLSHV 273
++ T ++ P +T P+ + + + + P G ++ +++ +
Sbjct: 493 TSATPNATSPTPAVTTPNATSPTTQKTSDTPNATSPTPIVIGVTTTATSPPTGTTSVPNA 552
Query: 274 TSTPSTSAATTTQTTSPPVVSAASQNTSSATQSSTAATQNASPATQSSNVATQNTPVTNL 333
TS T + T +P V SA S TS+ T +SP +Q + + +
Sbjct: 553 TSPQVTEESPVNNTNTPVVTSAPSVLTSAVTTGQHGTG--SSPTSQQPGIPSSSHSTPRS 610
Query: 334 NNAVAT 339
N+ T
Sbjct: 611 NSTSTT 616
Score = 29.8 bits (66), Expect = 2.9
Identities = 20/148 (13%), Positives = 39/148 (26%)
Query: 202 EESKAAFCRGSESTPTVSSKQAVKTTFPPVSTEPSGKPSDPPNSGKTKEPPNSGDDQSSV 261
+ S + P S+ + + + + +G + +
Sbjct: 573 SAPSVLTSAVTTGQHGTGSSPTSQQPGIPSSSHSTPRSNSTSTTPLLTSAHPTGGENITE 632
Query: 262 VNVATDSTLSHVTSTPSTSAATTTQTTSPPVVSAASQNTSSATQSSTAATQNASPATQSS 321
+ ST T +P TT+Q + P S + SP+ S
Sbjct: 633 ETPSVPSTTHVSTLSPGPGPGTTSQVSGPGNSSTSRYPGEVHVTEGMPNPNATSPSAPSG 692
Query: 322 NVATQNTPVTNLNNAVATLKTAVATLNR 349
T + A +T K +
Sbjct: 693 QKTAVPTVTSTGGKANSTTKETSGSTLM 720
Score = 29.4 bits (65), Expect = 3.9
Identities = 23/120 (19%), Positives = 44/120 (36%), Gaps = 2/120 (1%)
Query: 212 SESTPTVSSKQAVKTTFPPVSTEPSGKPSDPPNSGKTKEPPNSGDDQSSVVNVATDSTLS 271
S + PT + ++ P T P+ + + + + P + + +S T T +
Sbjct: 465 STADPTSGTPTGTTSSTLPEDTSPTSRTTSATPNATSPTPAVTTPNATSPTTQKTSDTPN 524
Query: 272 HVTSTPSTSAATTTQTTSPPVVSAASQNTSS--ATQSSTAATQNASPATQSSNVATQNTP 329
+ TP TTT T+ P ++ TS +S T + S + + T
Sbjct: 525 ATSPTPIVIGVTTTATSPPTGTTSVPNATSPQVTEESPVNNTNTPVVTSAPSVLTSAVTT 584
>gnl|CDD|237592 PRK14040, PRK14040, oxaloacetate decarboxylase; Provisional.
Length = 593
Score = 35.7 bits (83), Expect = 0.036
Identities = 14/71 (19%), Positives = 29/71 (40%), Gaps = 2/71 (2%)
Query: 228 FPPVSTEPSGKPSDPPNSGKTKEPPNSGDDQSSVVNVATDSTLSHVTSTPSTSAATTTQT 287
F PV + +P+ ++ + ++ VV V+ +S + TP+ AA
Sbjct: 457 FEPVPQAEAAQPAAKAEPAGSETYTVEVEGKAYVVKVSEGGDISQI--TPAAPAAAPAAA 514
Query: 288 TSPPVVSAASQ 298
+ +AA +
Sbjct: 515 AAAAPAAAAGE 525
>gnl|CDD|237015 PRK11901, PRK11901, hypothetical protein; Reviewed.
Length = 327
Score = 35.0 bits (81), Expect = 0.052
Identities = 23/152 (15%), Positives = 45/152 (29%), Gaps = 22/152 (14%)
Query: 201 QEESKAAFCRGSESTPTVSSKQAVKTTFPPVSTEPSGKPSDPPNSG--KTKEPPNSGD-- 256
+ + + + Q + + PP+S P+ +G + + P N D
Sbjct: 96 SAANNTSDGHDASGVKNTAPPQDI--SAPPISPTPTQAAPPQTPNGQQRIELPGNISDAL 153
Query: 257 -DQSSVVNVATDSTLSHVTSTPSTSA--------ATTTQTTSPPVVSAASQNTSSATQSS 307
Q VN A+ + + ++ P+ A + P A
Sbjct: 154 SQQQGQVNAASQNAQGNTSTLPTAPATVAPSKGAKVPATAETHPTPPQKPATKKPAVNHH 213
Query: 308 TAATQNASPATQSSNVAT-------QNTPVTN 332
AT PAT + + P ++
Sbjct: 214 KTATVAVPPATSGKPKSGAASARALSSAPASH 245
>gnl|CDD|222954 PHA02989, PHA02989, ankyrin repeat protein; Provisional.
Length = 494
Score = 35.1 bits (81), Expect = 0.055
Identities = 30/113 (26%), Positives = 56/113 (49%), Gaps = 3/113 (2%)
Query: 39 VDLALLLIERGANIEEVNMPTDS-FESPLTLAACGGHVDLALL-LIERGANIEEVNDEGY 96
+ + LI++G NIE N ++S ES L + +L I + I + + +G+
Sbjct: 198 IKVIKYLIKKGVNIETNNNGSESVLESFLDNNKILSKKEFKVLNFILKYIKINKKDKKGF 257
Query: 97 TPLMEAAREGHEEMVCVLLQKGAKINAHTEETQETALTLACCGGFLDVADFLL 149
PL+ +A+ + E LL+ G I +++ +T LT A G +D+ + +L
Sbjct: 258 NPLLISAKVDNYEAFNYLLKLGDDIYNVSKDG-DTVLTYAIKHGNIDMLNRIL 309
Score = 32.8 bits (75), Expect = 0.31
Identities = 29/92 (31%), Positives = 41/92 (44%), Gaps = 25/92 (27%)
Query: 80 LLIERGANIEEVNDEGY--TPL------MEAAREGHEEMVCVLLQKGAKINAHTEETQET 131
LLI+ GA+ VN +GY TPL E +++V +LL+ GA IN +T
Sbjct: 55 LLIDNGAD---VNYKGYIETPLCAVLRNREITSNKIKKIVKLLLKFGADINL---KTFNG 108
Query: 132 ALTLACCGGF--------LDVADFLLKNGANI 155
+ C F D+ FLL G N+
Sbjct: 109 VSPIVC---FIYNSNINNCDMLRFLLSKGINV 137
>gnl|CDD|226406 COG3889, COG3889, Predicted solute binding protein [General
function prediction only].
Length = 872
Score = 35.2 bits (81), Expect = 0.059
Identities = 27/100 (27%), Positives = 41/100 (41%), Gaps = 17/100 (17%)
Query: 215 TPTVSSKQ---AVKTTFPPVSTEPSGKPSDPPNSGKTKEPPNSGDDQSSVVNVATDSTLS 271
T +S V + P TE + S N+ E +SVV T +
Sbjct: 764 TSITTSGTFTAEVPQS--PTKTETTLSYSAYSNTSILIE-------TTSVVITKTVTQTQ 814
Query: 272 HVTSTPSTSAATTTQTTSPPVVSAASQNTSSATQSSTAAT 311
T+T S + TQTTSP S ++ T+S +Q++T
Sbjct: 815 --TTTSSP---SPTQTTSPTQTSTSTTTTTSPSQTTTGGG 849
Score = 34.1 bits (78), Expect = 0.12
Identities = 24/105 (22%), Positives = 37/105 (35%), Gaps = 9/105 (8%)
Query: 223 AVKTTFPPVSTEPSGKPSDPPNSGKTK---------EPPNSGDDQSSVVNVATDSTLSHV 273
F P + + DP + T E P S + ++ + S S +
Sbjct: 739 GENWGFIPTTKRVKVRIMDPASGTGTSITTSGTFTAEVPQSPTKTETTLSYSAYSNTSIL 798
Query: 274 TSTPSTSAATTTQTTSPPVVSAASQNTSSATQSSTAATQNASPAT 318
T S T T S + T+S TQ+ST+ T SP+
Sbjct: 799 IETTSVVITKTVTQTQTTTSSPSPTQTTSPTQTSTSTTTTTSPSQ 843
>gnl|CDD|220749 pfam10428, SOG2, RAM signalling pathway protein. SOG2 proteins in
Saccharomyces cerevisiae are involved in cell separation
and cytokinesis.
Length = 419
Score = 34.3 bits (79), Expect = 0.086
Identities = 20/113 (17%), Positives = 39/113 (34%), Gaps = 2/113 (1%)
Query: 214 STPTVSSKQAVKTTFPPVSTEPSGKPS--DPPNSGKTKEPPNSGDDQSSVVNVATDSTLS 271
T + SS A T S P+ P+S + + S + V TD
Sbjct: 160 VTASPSSMIARNTPISDRLRPRSVTPTRGRRPSSSPRSLSNPTTLESPSNLQVTTDVPPP 219
Query: 272 HVTSTPSTSAATTTQTTSPPVVSAASQNTSSATQSSTAATQNASPATQSSNVA 324
+ T +S +++ S A ++ S + T+ + + +P +
Sbjct: 220 YSNGTSRSSTMSSSANLSIISSLATPRSGESFRSTPTSGSSSINPVSGLDEAE 272
>gnl|CDD|213844 TIGR03657, IsdB, heme uptake protein IsdB. Isd proteins are
iron-regulated surface proteins found in Bacillus,
Staphylococcus and Listeria species and are responsible
for heme scavenging from hemoproteins. The IsdB protein
is only observed in Staphylococcus and consists of an
N-terminal hydrophobic signal sequence, a pair of tandem
NEAT (NEAr Transporter, pfam05031) domains which confers
the ability to bind heme and a C-terminal sortase
processing signal which targets the protein to the cell
wall. IsdB is believed to make a direct contact with
methemoglobin facilitating transfer of heme to IsdB. The
heme is then transferred to other cell wall-bound NEAT
domain proteins such as IsdA and IsdC.
Length = 644
Score = 34.5 bits (78), Expect = 0.096
Identities = 31/136 (22%), Positives = 58/136 (42%), Gaps = 7/136 (5%)
Query: 205 KAAFCRGSESTPTVSSKQAVKTTFPPVSTEPSGKPSDPPNSGKTKEPPNSGDDQSSVVNV 264
K AF + + +Q PS KP+ PP ++++ + DD +V
Sbjct: 450 KEAFTKANADKTNKKEQQDNSAKKETTPATPS-KPTTPPVEKESQKQDSQKDDNKQSPSV 508
Query: 265 ATD---STLSHVTSTPSTSAA---TTTQTTSPPVVSAASQNTSSATQSSTAATQNASPAT 318
+ S+ S TP+T A + +T+P V + +QN + T +S+ T++ +
Sbjct: 509 EKENDASSESGKDKTPATKPAKGEVESSSTTPTKVVSTTQNVAKPTTASSETTKDVVQTS 568
Query: 319 QSSNVATQNTPVTNLN 334
S+ A + P+ N
Sbjct: 569 AGSSEAKDSAPLQKAN 584
>gnl|CDD|240026 cd04669, Nudix_Hydrolase_11, Members of the Nudix hydrolase
superfamily catalyze the hydrolysis of NUcleoside
DIphosphates linked to other moieties, X. Enzymes
belonging to this superfamily require a divalent cation,
such as Mg2+ or Mn2+, for their activity and contain a
highly conserved 23-residue nudix motif
(GX5EX7REUXEEXGU, where U = I, L or V), which functions
as a metal binding and catalytic site. Substrates of
nudix hydrolases include intact and oxidatively damaged
nucleoside triphosphates, dinucleoside polyphosphates,
nucleotide-sugars and dinucleotide enzymes. These
substrates are metabolites or cell signaling molecules
that require regulation during different stages of the
cell cycle or during periods of stress. In general, the
role of the nudix hydrolase is to sanitize the
nucleotide pools and to maintain cell viability, thereby
serving as surveillance & "house-cleaning" enzymes.
Substrate specificity is used to define families within
the superfamily. Differences in substrate specificity
are determined by the N-terminal extension or by
residues in variable loop regions. Mechanistically,
substrate hydrolysis occurs by a nucleophilic
substitution reaction, with variation in the numbers and
roles of divalent cations required.
Length = 121
Score = 32.0 bits (73), Expect = 0.13
Identities = 18/55 (32%), Positives = 24/55 (43%), Gaps = 16/55 (29%)
Query: 79 LLLIER-----------GANIEEVNDEGYTPLMEAAREGHEEM-VCVLLQKGAKI 121
+LLI R G IEE G TP A RE EE+ + V +++ I
Sbjct: 14 ILLIRRIKPGKTYYVFPGGGIEE----GETPEEAAKREALEELGLDVRVEEIFLI 64
>gnl|CDD|223023 PHA03249, PHA03249, DNA packaging tegument protein UL25;
Provisional.
Length = 653
Score = 33.8 bits (77), Expect = 0.15
Identities = 26/131 (19%), Positives = 51/131 (38%), Gaps = 10/131 (7%)
Query: 202 EESKAAFCRGSESTPTVSSKQAVKTTFPPVSTEPS-GKPSDPPNSGKTKEPPNSGDDQSS 260
+ +A S S+ SS + V T E G+ + K +P +
Sbjct: 43 DRMEAGLSSYSSSSDNKSSFEVVSETDSGSEAEAERGRRAGMGGRNKATKPSRR---NKT 99
Query: 261 VVNVATDSTLSHVTSTPST-SAATTTQTTSPPVVSAASQNTSSATQSSTAATQNASPATQ 319
T L+ + P+T S+ + + +SPP + + S+ A ++S ++
Sbjct: 100 TQCRPTSLALATAATMPATPSSGKSPKVSSPPSIPSLSEEDEGAERNS-----GGDDSSH 154
Query: 320 SSNVATQNTPV 330
+ N +TQ+ P
Sbjct: 155 TDNESTQSQPE 165
>gnl|CDD|224343 COG1426, COG1426, Predicted transcriptional regulator contains
Xre-like HTH domain [Function unknown].
Length = 284
Score = 33.2 bits (76), Expect = 0.15
Identities = 23/98 (23%), Positives = 33/98 (33%), Gaps = 2/98 (2%)
Query: 251 PPNSGDDQSSVVNVATDSTLSHVTSTPSTSAATTTQTTSPPVVSAASQNTSSATQSSTAA 310
+VA+ S S S ++S + TTS SA + T + T +
Sbjct: 129 LAQEEPPTLPDQSVASVSQNSQDVSLATSSTTPSEGTTSASPSSATTSFTPTVTAIAPVV 188
Query: 311 TQNASPATQSSNVATQNT-PVTNLN-NAVATLKTAVAT 346
A P T A T +AT + AV T
Sbjct: 189 APTAKPVTVPKQPAADLAASSTAPAAKEMATGQEAVPT 226
>gnl|CDD|183854 PRK13042, PRK13042, superantigen-like protein; Reviewed.
Length = 291
Score = 33.1 bits (75), Expect = 0.17
Identities = 21/73 (28%), Positives = 33/73 (45%), Gaps = 5/73 (6%)
Query: 264 VATDSTLSHVTSTPSTSAATTTQTTSPPVVSAASQNTSSATQSST----AATQNASPAT- 318
V T +T + +TPS++ Q+T P A Q+ +AT + A Q + T
Sbjct: 20 VITTTTQAANATTPSSTKVEAPQSTPPSTKVEAPQSKPNATTPPSTKVEAPQQTPNATTP 79
Query: 319 QSSNVATQNTPVT 331
S+ V T +P T
Sbjct: 80 SSTKVETPQSPTT 92
>gnl|CDD|240410 PTZ00418, PTZ00418, Poly(A) polymerase; Provisional.
Length = 593
Score = 32.8 bits (75), Expect = 0.24
Identities = 23/75 (30%), Positives = 34/75 (45%), Gaps = 4/75 (5%)
Query: 247 KTKEPPNSGDDQSSVVNVATDSTLSHVTSTPSTSAATTTQTTSPPVVSAASQNTSSATQS 306
K + P Q+ V T + T T ++SA T+ Q+ S S ++ N SS T S
Sbjct: 523 KKSQLPAFVLSQTPEEPVKTKAN----TKTNTSSATTSGQSGSSGSTSNSNSNESSPTMS 578
Query: 307 STAATQNASPATQSS 321
ST +S +T S
Sbjct: 579 STELLNVSSTSTTGS 593
Score = 28.2 bits (63), Expect = 8.8
Identities = 11/59 (18%), Positives = 23/59 (38%)
Query: 285 TQTTSPPVVSAASQNTSSATQSSTAATQNASPATQSSNVATQNTPVTNLNNAVATLKTA 343
T + NTSSAT S + + ++ + S+ + + LN + + +
Sbjct: 535 TPEEPVKTKANTKTNTSSATTSGQSGSSGSTSNSNSNESSPTMSSTELLNVSSTSTTGS 593
>gnl|CDD|220684 pfam10310, DUF2413, Protein of unknown function (DUF2413). This is
a family of proteins conserved in fungi. The function is
not known.
Length = 436
Score = 32.8 bits (75), Expect = 0.26
Identities = 29/141 (20%), Positives = 44/141 (31%), Gaps = 33/141 (23%)
Query: 192 DILEF-----KKRKQEESKAAFCRGSESTPTVSSKQAVKTTFPPVSTEPSGKPSDPPNSG 246
DILEF + K + K TP K + T + + P
Sbjct: 33 DILEFLDELEQSEKAKPPKKPKEASRPGTPRNPKKSSKPT-----ESSAASSEEKPAKPR 87
Query: 247 KTKEPPNSGDDQSSVVNVATDSTLSHVTSTPSTSAATTTQTTSPPVVSAASQN------- 299
K+ E S +S PST + + P A+
Sbjct: 88 KSAESTRSSHPKSK---------------APSTESEEEEEPEETPDPIASIGGWWSLWGS 132
Query: 300 -TSSATQSSTAATQNASPATQ 319
TS+AT +++AA + A A
Sbjct: 133 ITSTATSTASAAVKQAEQAVN 153
>gnl|CDD|227430 COG5099, COG5099, RNA-binding protein of the Puf family,
translational repressor [Translation, ribosomal
structure and biogenesis].
Length = 777
Score = 32.8 bits (75), Expect = 0.32
Identities = 26/143 (18%), Positives = 53/143 (37%), Gaps = 9/143 (6%)
Query: 211 GSESTPTVSSKQAVKTTFPPVSTEPSGKPSDPPNSGKTKEPPNSGDDQSSVVNVAT---D 267
P+ +S A T P++ + K +S ++P S S +
Sbjct: 42 SFSPIPSKASSSATFTLNLPINNSVNHK--ITSSSSSRRKPSGSWSVAISSSTSGSQSLL 99
Query: 268 STLSHVTSTPSTSAATTTQTTSPPVVSAASQNTSSATQSSTAATQNAS----PATQSSNV 323
L + PSTS+ + + + ++ + + S+ ++ N++ P SN
Sbjct: 100 MELPSSSFNPSTSSRNKSNSALSSTQQGNANSSVTLSSSTASSMFNSNKLPLPNPNHSNS 159
Query: 324 ATQNTPVTNLNNAVATLKTAVAT 346
AT N ++ N A+ + T
Sbjct: 160 ATTNQSGSSFINTPASSSSQPLT 182
>gnl|CDD|236304 PRK08581, PRK08581, N-acetylmuramoyl-L-alanine amidase; Validated.
Length = 619
Score = 32.5 bits (74), Expect = 0.39
Identities = 14/90 (15%), Positives = 31/90 (34%), Gaps = 10/90 (11%)
Query: 250 EPPNSGDDQSSVVNVATDSTLSHVTSTPSTSAATTTQTTSPPVVSAA----------SQN 299
E P + + ++ N +DS++ + T T S+ +P + +
Sbjct: 137 EQPRNSEKSTNDSNKNSDSSIKNDTDTQSSKQDKADNQKAPSSNNTKPSTSNKQPNSPKP 196
Query: 300 TSSATQSSTAATQNASPATQSSNVATQNTP 329
T +S A+ + + SS +
Sbjct: 197 TQPNQSNSQPASDDTANQKSSSKDNQSMSD 226
Score = 28.2 bits (63), Expect = 7.7
Identities = 17/138 (12%), Positives = 36/138 (26%), Gaps = 6/138 (4%)
Query: 197 KKRKQEESKAAFCRGSESTPTVSSKQAVKTTFPPVSTEPSGKPSDPPNSGKTKEPPNSGD 256
K E SK + ++ ++ + + S S+ K P +
Sbjct: 45 KSNDDETSKDTSSKDTDKADNNNTSNQDNNDKKFSTIDSSTSDSNNIIDFIYKNLPQTNI 104
Query: 257 DQSSVVNVATDSTLSHVTSTP------STSAATTTQTTSPPVVSAASQNTSSATQSSTAA 310
+Q N D+ S + + + + SS +
Sbjct: 105 NQLLTKNKYDDNYSLTTLIQNLFNLNSDISDYEQPRNSEKSTNDSNKNSDSSIKNDTDTQ 164
Query: 311 TQNASPATQSSNVATQNT 328
+ A ++ NT
Sbjct: 165 SSKQDKADNQKAPSSNNT 182
Score = 27.8 bits (62), Expect = 10.0
Identities = 13/95 (13%), Positives = 27/95 (28%), Gaps = 6/95 (6%)
Query: 239 PSDPPNSGKTKEPPNSGDDQSSV--VNVATDSTLSHVTSTPSTSAATTTQTTSPPVVSAA 296
+ S DD++S + TD ++ TS + + S ++
Sbjct: 32 KDSTAKTTSHDSK-KSNDDETSKDTSSKDTDKADNNNTSNQDNNDKKFSTIDSS---TSD 87
Query: 297 SQNTSSATQSSTAATQNASPATQSSNVATQNTPVT 331
S N + T T++ +
Sbjct: 88 SNNIIDFIYKNLPQTNINQLLTKNKYDDNYSLTTL 122
>gnl|CDD|233273 TIGR01108, oadA, oxaloacetate decarboxylase alpha subunit. This
model describes the bacterial oxaloacetate decarboxylase
alpha subunit and its equivalents in archaea. The
oxaloacetate decarboxylase Na+ pump is the paradigm of
the family of Na+ transport decarboxylases that present
in bacteria and archaea. It a multi subunit enzyme
consisting of a peripheral alpha-subunit and integral
membrane subunits beta and gamma. The energy released by
the decarboxylation reaction of oxaloacetate is coupled
to Na+ ion pumping across the membrane [Transport and
binding proteins, Cations and iron carrying compounds,
Energy metabolism, Other].
Length = 582
Score = 32.5 bits (74), Expect = 0.41
Identities = 30/139 (21%), Positives = 46/139 (33%), Gaps = 21/139 (15%)
Query: 173 ELVRYLLDSGAQEMQTKHIDIL--EFKKRKQEESKAAFCRGSEST-------PTVSSK-- 221
EL R +L + + D+L E K + E +A + S P V K
Sbjct: 378 ELQRKILGDEKPIVDCRPADLLEPELDKLRAEVREAGAEKNSIEDVLTYALFPQVGLKFL 437
Query: 222 --QAVKTTFPPVSTEPSGKPSDPPNSGKTKEPPNSG------DDQSSVVNVATDSTLSHV 273
+ F P E + GK +E SG + ++ VV V +
Sbjct: 438 ENRHNPAAFEPKPEEKVIEQEHAQVVGKYEETHASGSYTVEVEGKAFVVKV-SPGGD-VS 495
Query: 274 TSTPSTSAATTTQTTSPPV 292
T S A T+ T +
Sbjct: 496 QITASAPANTSGGTVAAKA 514
>gnl|CDD|183756 PRK12799, motB, flagellar motor protein MotB; Reviewed.
Length = 421
Score = 32.0 bits (72), Expect = 0.47
Identities = 27/154 (17%), Positives = 49/154 (31%), Gaps = 10/154 (6%)
Query: 189 KHIDILEFKKRKQE-------ESKAAFCRGSESTPTVSSKQAVKTTFPPVSTEPSGKPSD 241
+ I IL K+ Q +++A + + + V S+ + +
Sbjct: 257 RRISILVLNKQSQHDIEHENLDNRALDIEKATGLKQIDTHGTVPVAAVTPSSAVTQSSAI 316
Query: 242 PPNSGKTKEP---PNSGDDQSSVVNVATDSTLSHVTSTPSTSAATTTQTTSPPVVSAASQ 298
P+S P P+S QS+ A+ LS PS T
Sbjct: 317 TPSSAAIPSPAVIPSSVTTQSATTTQASAVALSSAGVLPSDVTLPGTVALPAAEPVNMQP 376
Query: 299 NTSSATQSSTAATQNASPATQSSNVATQNTPVTN 332
S T++ ++T N + + P +N
Sbjct: 377 QPMSTTETQQSSTGNITSTANGPTTSLPAAPASN 410
>gnl|CDD|165513 PHA03255, PHA03255, BDLF3; Provisional.
Length = 234
Score = 31.4 bits (70), Expect = 0.57
Identities = 26/79 (32%), Positives = 39/79 (49%)
Query: 268 STLSHVTSTPSTSAATTTQTTSPPVVSAASQNTSSATQSSTAATQNASPATQSSNVATQN 327
+T S S PST+ +TT TTS P+ + A +T++ T +ST T P T +++
Sbjct: 44 TTPSPSASGPSTNQSTTLTTTSAPITTTAILSTNTTTVTSTGTTVTPVPTTSNASTINVT 103
Query: 328 TPVTNLNNAVATLKTAVAT 346
T VT N T +T
Sbjct: 104 TKVTAQNITATEAGTGTST 122
Score = 30.6 bits (68), Expect = 0.88
Identities = 31/121 (25%), Positives = 48/121 (39%), Gaps = 7/121 (5%)
Query: 212 SESTPTVSSKQAVKTTFPPVSTEPSGKPSDPP-NSGKTKEPPNSGDDQSSVVNVATDSTL 270
S S P+ + + TT P++T + S T P +S +NV T T
Sbjct: 49 SASGPSTNQSTTLTTTSAPITTTAILSTNTTTVTSTGTTVTPVPTTSNASTINVTTKVTA 108
Query: 271 SHVTSTPSTSAATTTQTTSPPVVSAASQNTSSATQSSTAATQNASPATQSSNVATQNTPV 330
++T+T TS V S + +SS T ++T T + A S+ T N
Sbjct: 109 QNITAT------EAGTGTSTGVTSNVTTRSSSTTSATTRITNATTLAPTLSSKGTSNATK 162
Query: 331 T 331
T
Sbjct: 163 T 163
Score = 29.5 bits (65), Expect = 2.5
Identities = 29/135 (21%), Positives = 56/135 (41%), Gaps = 6/135 (4%)
Query: 214 STPTVSSKQAVKTTFPPVSTEPSGKPSDPPNSGKTKEPPNSGDDQSSVVNVATDSTLSHV 273
++ S+ A T T PS S P + T S ++ + ++T++T
Sbjct: 25 TSSGSSTASAGNVTGTTAVTTPSPSASGPSTNQSTTLTTTSAPITTTAI-LSTNTTTVTS 83
Query: 274 TSTPSTSAATTTQTTSPPVVSAASQNTSSATQSSTAATQNASPATQSSNVATQNTPVTNL 333
T T T TT+ ++ V + + +AT++ T + +SNV T+++ T+
Sbjct: 84 TGTTVTPVPTTSNASTINVTTKVTAQNITATEAGTGTSTGV-----TSNVTTRSSSTTSA 138
Query: 334 NNAVATLKTAVATLN 348
+ T TL+
Sbjct: 139 TTRITNATTLAPTLS 153
>gnl|CDD|180536 PRK06347, PRK06347, autolysin; Reviewed.
Length = 592
Score = 31.6 bits (71), Expect = 0.60
Identities = 22/83 (26%), Positives = 31/83 (37%), Gaps = 2/83 (2%)
Query: 233 TEPSGKPSDPPNSGKTKEPPNSGDDQSSVVNVAT--DSTLSHVTSTPSTSAATTTQTTSP 290
T P+ + S + TKE P + +++ + T TP Q
Sbjct: 55 TAPADEASKSAEANTTKEAPATATPENTTEPTVEPKQTETKEQTKTPEEKQPAAKQVEKA 114
Query: 291 PVVSAASQNTSSATQSSTAATQN 313
P A N +AT SST AT N
Sbjct: 115 PAEPATVSNPDNATSSSTPATYN 137
>gnl|CDD|234541 TIGR04320, Surf_Exclu_PgrA, SEC10/PgrA surface exclusion domain.
This model describes a conserved domain found in surface
proteins of a number of Firmutes. Many members have
LPXTG C-terminal anchoring motifs and a substantial
number have the KxYKxGKxW putative sorting signal at the
N-terminus. The tetracycline resistance plasmid pCF10 in
Enterococcus faecalis promotes conjugal plasmid transfer
in response to sex pheromones, but PgrA/Sec10 encoded by
that plasmid, a member of this family, specifically
inhibits the ability of cells to receive homologous
plasmids. The phenomenon is called surface exclusion.
Length = 356
Score = 31.2 bits (71), Expect = 0.66
Identities = 17/54 (31%), Positives = 25/54 (46%)
Query: 295 AASQNTSSATQSSTAATQNASPATQSSNVATQNTPVTNLNNAVATLKTAVATLN 348
AA+Q + Q A Q + T +N+AT + N +A K A+A LN
Sbjct: 292 AAAQAALATAQKELANAQAQALQTAQNNLATAQAALANAEARLAKAKEALANLN 345
Score = 28.5 bits (64), Expect = 5.0
Identities = 13/51 (25%), Positives = 23/51 (45%)
Query: 295 AASQNTSSATQSSTAATQNASPATQSSNVATQNTPVTNLNNAVATLKTAVA 345
A +Q + T + AT A+ A + +A + NLN +A + A+
Sbjct: 306 ANAQAQALQTAQNNLATAQAALANAEARLAKAKEALANLNADLAKKQAALD 356
>gnl|CDD|165256 PHA02946, PHA02946, ankyin-like protein; Provisional.
Length = 446
Score = 31.6 bits (71), Expect = 0.69
Identities = 16/43 (37%), Positives = 25/43 (58%)
Query: 81 LIERGANIEEVNDEGYTPLMEAAREGHEEMVCVLLQKGAKINA 123
L+ RG + E +D+G PL A++ + +V +LL GA NA
Sbjct: 58 LLHRGYSPNETDDDGNYPLHIASKINNNRIVAMLLTHGADPNA 100
Score = 30.0 bits (67), Expect = 1.8
Identities = 28/95 (29%), Positives = 45/95 (47%), Gaps = 6/95 (6%)
Query: 45 LIERGANIEEVNMPTDSFESPLTLAACGGHVDLALLLIERGANIEEVNDEGYTPLMEAAR 104
L+ RG + E + D PL +A+ + + +L+ GA+ + + TPL +
Sbjct: 58 LLHRGYSPNETD---DDGNYPLHIASKINNNRIVAMLLTHGADPNACDKQHKTPLYYLSG 114
Query: 105 EGHE--EMVCVLLQKGAKINAHTEETQETALTLAC 137
E E + +L+Q GAKIN +E L LAC
Sbjct: 115 TDDEVIERINLLVQYGAKINNSVDEEGCGPL-LAC 148
>gnl|CDD|236652 PRK10118, PRK10118, flagellar hook-length control protein;
Provisional.
Length = 408
Score = 31.4 bits (71), Expect = 0.70
Identities = 23/107 (21%), Positives = 42/107 (39%), Gaps = 4/107 (3%)
Query: 227 TFPPVSTEPSGKPSDPPNSGKTKEPPNSGDDQSSVVNVATDSTLSHVTSTPSTSAATTTQ 286
PV+ PS + TK+ P++ D +++D +TS T+A
Sbjct: 155 NTTPVADAPSTVLPAEKPTLLTKDMPSAPQD--ETHTLSSDEHEKGLTSAQLTTAQPDDA 212
Query: 287 TTSP--PVVSAASQNTSSATQSSTAATQNASPATQSSNVATQNTPVT 331
+P P+ A++ + A ST + A+ + + TQ P
Sbjct: 213 PGTPAQPLTPLAAEAQAKAEVISTPSPVTAAASPTITPHQTQPLPTA 259
>gnl|CDD|215964 pfam00513, Late_protein_L2, Late Protein L2.
Length = 466
Score = 31.5 bits (72), Expect = 0.73
Identities = 23/123 (18%), Positives = 40/123 (32%), Gaps = 31/123 (25%)
Query: 211 GSESTPTVSSKQAVKTTFPPVSTEPSGKPSDP---------------------PNSGKTK 249
G+ V + V+ PPV E + P+DP P G
Sbjct: 71 GTRPVRVVGTGTPVR---PPVVVESTVGPTDPSIVSLVEESSIIESGAPIPPIPGDGSGF 127
Query: 250 EPPNSGDDQSSVVNVATDSTLSHVTST----PSTSAATTTQTTSPPVVSAASQNTSSATQ 305
S ++++V + HV+ T P + + Q PP + S + + Q
Sbjct: 128 PITTSSTTTPAILDVTPTTRTVHVSRTQYNNPLFTDPSVLQ---PPQPAEVSGHVLVSGQ 184
Query: 306 SST 308
+
Sbjct: 185 TIG 187
>gnl|CDD|225805 COG3266, DamX, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 292
Score = 30.7 bits (69), Expect = 0.92
Identities = 22/103 (21%), Positives = 39/103 (37%), Gaps = 5/103 (4%)
Query: 246 GKTKEPPNSGDDQSSVVNVATDSTLSHVTSTPSTSAATTTQTTSPPVVSAASQNTSSATQ 305
+ + ++S +N AT + P+++ T+ + PP+ +S T
Sbjct: 35 STSSSEAPASAEKSIDLNGATQANAQQPAPGPTSAENTSQDLSLPPI---SSTPTQGQEP 91
Query: 306 SSTAATQNASPATQSSNVATQNTPVTNLNNAVA--TLKTAVAT 346
+ Q +N A Q ++ LNN TL T AT
Sbjct: 92 LAQDGQQRVEVQGDLNNAAVQPQNLSQLNNVAVTSTLPTEPAT 134
Score = 29.1 bits (65), Expect = 2.9
Identities = 28/137 (20%), Positives = 45/137 (32%), Gaps = 25/137 (18%)
Query: 214 STPTVSSKQAVKTTFPPVS-TEPSGKPSDPPNSGKTKEPPNS-------GDDQSSVVNVA 265
PT + + + PP+S T G+ + + E + S + NVA
Sbjct: 64 PGPTSAENTSQDLSLPPISSTPTQGQEPLAQDGQQRVEVQGDLNNAAVQPQNLSQLNNVA 123
Query: 266 TDSTLSHVTST-----PSTSAATTTQTTSPPVVS------AASQNTSSATQSSTAATQNA 314
STL +T ++ + PV + A + ++ A Q A
Sbjct: 124 VTSTLPTEPATVAPVRNASVPTAERPAITRPVRAQAVSEPAVEPKAAKTATATEAKVQTA 183
Query: 315 SPATQSSNVATQNTPVT 331
SPA T TP
Sbjct: 184 SPA------QTPATPPA 194
>gnl|CDD|233045 TIGR00601, rad23, UV excision repair protein Rad23. All proteins
in this family for which functions are known are
components of a multiprotein complex used for targeting
nucleotide excision repair to specific parts of the
genome. In humans, Rad23 complexes with the XPC protein.
This family is based on the phylogenomic analysis of JA
Eisen (1999, Ph.D. Thesis, Stanford University) [DNA
metabolism, DNA replication, recombination, and repair].
Length = 378
Score = 31.0 bits (70), Expect = 0.97
Identities = 15/86 (17%), Positives = 32/86 (37%), Gaps = 2/86 (2%)
Query: 256 DDQSSVVNVATDSTLSHVTSTPSTSAATTTQTTSPPV--VSAASQNTSSATQSSTAATQN 313
D V+ + V +T + T T SPP S S +SA + + + ++
Sbjct: 67 KDFVVVMVSKPKTGTGKVAPPAATPTSAPTPTPSPPASPASGMSAAPASAVEEKSPSEES 126
Query: 314 ASPATQSSNVATQNTPVTNLNNAVAT 339
A+ S + + ++ + +
Sbjct: 127 ATATAPESPSTSVPSSGSDAASTLVV 152
Score = 30.6 bits (69), Expect = 1.3
Identities = 15/66 (22%), Positives = 28/66 (42%), Gaps = 3/66 (4%)
Query: 274 TSTPSTSAATTTQTTSPPVVSAASQNTSSATQSSTAATQNASPATQSSNVATQNTPVT-- 331
T TPS A+ + ++ P SA + + S ++ A ++ S + SS +T V
Sbjct: 96 TPTPSPPASPASGMSAAP-ASAVEEKSPSEESATATAPESPSTSVPSSGSDAASTLVVGS 154
Query: 332 NLNNAV 337
+
Sbjct: 155 ERETTI 160
>gnl|CDD|118131 pfam09595, Metaviral_G, Metaviral_G glycoprotein. This is a viral
attachment glycoprotein from region G of metaviruses. It
is high in serine and threonine suggesting it is highly
glycosylated.
Length = 183
Score = 30.4 bits (68), Expect = 0.97
Identities = 20/105 (19%), Positives = 37/105 (35%), Gaps = 4/105 (3%)
Query: 232 STEPSGKPSDPPNSGKTKEP----PNSGDDQSSVVNVATDSTLSHVTSTPSTSAATTTQT 287
ST P+ + T P D+ + ++ + + + T+
Sbjct: 68 STLPAATSESHLETEPTSTPDTTNRQQTVDRHTTPPSSSRTQTTQAVHEKKNTRTTSRTQ 127
Query: 288 TSPPVVSAASQNTSSATQSSTAATQNASPATQSSNVATQNTPVTN 332
T P +AA Q T++ SST + S+ TQ+ T+
Sbjct: 128 TPPTTSTAAVQTTTTTNTSSTGKEPTTTSVQPRSSATTQSHEETS 172
>gnl|CDD|218191 pfam04652, DUF605, Vta1 like. Vta1 (VPS20-associated protein 1) is
a positive regulator of Vps4. Vps4 is an ATPase that is
required in the multivesicular body (MVB) sorting
pathway to dissociate the endosomal sorting complex
required for transport (ESCRT). Vta1 promotes correct
assembly of Vps4 and stimulates its ATPase activity
through its conserved Vta1/SBP1/LIP5 region.
Length = 315
Score = 30.8 bits (70), Expect = 0.97
Identities = 19/125 (15%), Positives = 32/125 (25%), Gaps = 10/125 (8%)
Query: 228 FPPVSTEPSGKPSDPPNSGKTKEPPNSGDDQSSVVNVATDSTLSHVTSTPSTSAATTTQT 287
S + G P D S + + S + TP S + T
Sbjct: 175 EDADPASASPSDPPSSSPGVPSFPSPPEDPSSPSDSSLPPAPSSFQSDTPPPSPESPTNP 234
Query: 288 TSPPVVSAASQNTSSATQSSTAATQNASPATQSSNVATQNTPVTNLNNAVATLKTAVATL 347
+ PP A P S++ P+ + A+A
Sbjct: 235 SPPP----GPAAPPPPPVQQVPPLSTAKPTPPSASATPA--PIGGITLDDD----AIAKA 284
Query: 348 NRHCR 352
+H +
Sbjct: 285 QKHAK 289
>gnl|CDD|234383 TIGR03895, protease_PatA, cyanobactin maturation protease,
PatA/PatG family. This model describes a protease
domain associated with the maturation of various members
of the cyanobactin family of ribosomally produced,
heavily modified bioactive metabolites. Members include
the PatA protein and C-terminal domain of the PatG
protein of Prochloron didemni, TenA and a region of TenG
from Nostoc spongiaeforme var. tenue, etc.
Length = 602
Score = 30.8 bits (70), Expect = 1.0
Identities = 22/109 (20%), Positives = 37/109 (33%), Gaps = 7/109 (6%)
Query: 245 SGKTKEPPNSGDDQSSVVNVATDSTLSHVTSTPSTSAATTTQTTSP---PVVSA---ASQ 298
+G+T +Q V + + S+ S +P PV + A+
Sbjct: 227 TGETMSEGLVTSEQDGVEEASGCGVQGTIESSTSVIPPGRAAEPAPVSIPVAAPGEGATP 286
Query: 299 NTSSATQSSTAATQNASPATQSSNVATQNTPVTNLNNAVATLKTAVATL 347
+ S+ SPAT + TP + +A L A+ TL
Sbjct: 287 AAAQIELSAGVLPNAISPATPVRPASNGVTP-SQAPSAEPQLVYALGTL 334
>gnl|CDD|217495 pfam03326, Herpes_TAF50, Herpesvirus transcription activation
factor (transactivator). This family includes EBV BRLF1
and similar ORF 50 proteins from other herpesviruses.
Length = 500
Score = 30.9 bits (70), Expect = 1.2
Identities = 23/115 (20%), Positives = 33/115 (28%), Gaps = 13/115 (11%)
Query: 227 TFPPVSTEPSGKPSDPPNSGKTKEPPNSGDDQSSVVNVATDSTLSHVT---------STP 277
P S P P S P + S V +T S + S+
Sbjct: 216 FTPHPSLPQPQSPLKPSPSSA---RPQQSESFSDVWPASTQSPREETSAEPLAPASPSSR 272
Query: 278 STSAATTTQTTSPPVVSAASQNTSSATQSSTAATQNASPATQSSNVATQNTPVTN 332
S A Q + Q S S+ + + PA ++ T P TN
Sbjct: 273 RPSTAQEEQIACSSPQAEPEQGVQSYVPQSSDSRPSCFPAPSTTQ-PTFLPPNTN 326
Score = 29.8 bits (67), Expect = 2.3
Identities = 25/135 (18%), Positives = 40/135 (29%), Gaps = 19/135 (14%)
Query: 215 TPTVSSKQAVKTTFPPV-STEPSGKPSDPPNSGKTKEPPNSGDDQSSVVNVATDSTLSHV 273
TP AV P G+PS S PP++ SS+
Sbjct: 339 TPKQEGGAAVSQNHDGGTVRAPRGRPSGSGQS-----PPSNSPLLSSL------------ 381
Query: 274 TSTPSTSAATTTQTTSPPVVSAASQNTSSATQSSTAATQNASPATQSSNVATQNTPVTNL 333
TPS +A P VV ++ +S Q T + + + +
Sbjct: 382 ADTPSGAAHQPASLLPPAVVQQQLED-ASDKQPPTPGSSLVPQPDEQELGPSVMALLDRD 440
Query: 334 NNAVATLKTAVATLN 348
+A + + L
Sbjct: 441 QGILALILAPIPGLY 455
>gnl|CDD|234351 TIGR03773, anch_rpt_wall, putative ABC transporter-associated
repeat protein. Members of this protein family occur in
genomes that contain a three-gene ABC transporter operon
associated with the presence of domain TIGR03769. That
domain occurs as a single-copy insert in the
substrate-binding protein, and occurs in two or more
copies in members of this protein family. Members of
this family typically are encoded adjacent to the said
transporter operon and may serve as a substrate
receptor.
Length = 513
Score = 30.7 bits (69), Expect = 1.2
Identities = 29/173 (16%), Positives = 49/173 (28%), Gaps = 17/173 (9%)
Query: 139 GGFLDVADFLLKNGANIELGASTPLMEAAQEGHLELVRYLLDSGAQEMQTKHIDILEFKK 198
GG L L G + + + +A H+ + G E+ L
Sbjct: 98 GGTLGKVTPLFTIGDSSKPAS----FKAEGPTHVHGNWGFTEPGIYELTVTATADLADGG 153
Query: 199 RKQEESKAAFCRGSESTPTVSSKQAVKTTFPPVSTEPSGKPSDPPNSGKTKEPPNSGDDQ 258
K + G + P +GKP + N +P D
Sbjct: 154 AKSKPETYTVVVGKVEVDKID---------PARCATGAGKPQNDANGPAADKP-LFDDP- 202
Query: 259 SSVVNVATDSTLSHVTSTPSTSA--ATTTQTTSPPVVSAASQNTSSATQSSTA 309
+S V D + + + +P V+A + +T SS A
Sbjct: 203 ASGVQALGDESAFSPGQQATVQIGKSVRLPADAPLGVAAVVVKAAPSTGSSDA 255
>gnl|CDD|171499 PRK12438, PRK12438, hypothetical protein; Provisional.
Length = 991
Score = 31.0 bits (70), Expect = 1.2
Identities = 17/79 (21%), Positives = 24/79 (30%), Gaps = 4/79 (5%)
Query: 208 FCRGSESTPTVSSKQAVKTTFPPVSTEPSGKPSD--PPNSGKTKEPPNSGDDQSSVVNVA 265
F G+ T A + PP P P PP + + P G D
Sbjct: 895 FGPGTGRVATAPGGDA--ASAPPPGAGPPAPPQAVPPPRTTQPPAAPPRGPDVPPAAVAE 952
Query: 266 TDSTLSHVTSTPSTSAATT 284
TL+ + S + T
Sbjct: 953 LRETLADLRSAQRSGDFTA 971
Score = 29.4 bits (66), Expect = 3.2
Identities = 25/135 (18%), Positives = 39/135 (28%), Gaps = 38/135 (28%)
Query: 216 PTVSSKQAVKTTFPP----VSTEPSGKPSDPPNSGKTKEPPNSGDDQSSVVNVATDSTLS 271
PT++ +A+ F P V+T P G + P G PP
Sbjct: 885 PTLA--EALDQVFGPGTGRVATAPGGDAASAPPPGAG--PP------------------- 921
Query: 272 HVTSTPSTSAATTTQTTSPPVVSAASQNTSSATQSSTAATQNASPATQSSNVATQNTPVT 331
TT +PP + A + A S A ++ T
Sbjct: 922 -APPQAVPPPRTTQPPAAPP--------RGPDVPPAAVAELRETLADLRS--AQRSGDFT 970
Query: 332 NLNNAVATLKTAVAT 346
A+ L+ A+
Sbjct: 971 AYGAALDRLEKAIDA 985
>gnl|CDD|236782 PRK10871, nlpD, lipoprotein NlpD; Provisional.
Length = 319
Score = 30.2 bits (68), Expect = 1.4
Identities = 25/88 (28%), Positives = 36/88 (40%), Gaps = 3/88 (3%)
Query: 262 VNVATDSTLSHVTSTPST--SAATTTQTTSPPVVSAASQNTSSATQSSTAATQNASPATQ 319
+NV + + + TP T +A T VV +QN++ A S T + S Q
Sbjct: 97 LNVGQTLQVGNASGTPITGGNAITQADAAEQGVVIKPAQNSTVAVASQPTITYSESSGEQ 156
Query: 320 SSN-VATQNTPVTNLNNAVATLKTAVAT 346
S+N + N P A T TA T
Sbjct: 157 SANKMLPNNKPAATTVTAPVTAPTASTT 184
>gnl|CDD|220944 pfam11018, Cuticle_3, Pupal cuticle protein C1. Insect cuticles
are composite structures whose mechanical properties are
optimised for biological function. The major components
are the chitin filament system and the cuticular
proteins, and the cuticle's properties are determined
largely by the interactions between these two sets of
molecules. The proteins can be ordered by species.
Length = 164
Score = 29.6 bits (66), Expect = 1.7
Identities = 18/58 (31%), Positives = 26/58 (44%), Gaps = 1/58 (1%)
Query: 289 SPPVVSAASQNTS-SATQSSTAATQNASPATQSSNVATQNTPVTNLNNAVATLKTAVA 345
S P V + QNT S T + + + T S+V +T ++N A KTA A
Sbjct: 1 SAPAVGSTHQNTVRSFGGLGTVSHYSKAVDTPYSSVRKSDTRISNNAYQPAYAKTAYA 58
>gnl|CDD|165527 PHA03269, PHA03269, envelope glycoprotein C; Provisional.
Length = 566
Score = 30.1 bits (67), Expect = 1.9
Identities = 18/112 (16%), Positives = 33/112 (29%), Gaps = 7/112 (6%)
Query: 214 STPTVSSKQAVKTTFPPVSTEPSGKPSDP-------PNSGKTKEPPNSGDDQSSVVNVAT 266
P + QA P S P P + + +P + +S
Sbjct: 40 PDPAPAPHQAASRAPDPAVAPTSAASRKPDLAQAPTPAASEKFDPAPAPHQAASRAPDPA 99
Query: 267 DSTLSHVTSTPSTSAATTTQTTSPPVVSAASQNTSSATQSSTAATQNASPAT 318
+ P + A T+ + + A + +S A TQ++ P
Sbjct: 100 VAPQLAAAPKPDAAEAFTSAAQAHEAPADAGTSAASKKPDPAAHTQHSPPPF 151
>gnl|CDD|221480 pfam12238, MSA-2c, Merozoite surface antigen 2c. This family of
proteins is found in eukaryotes. Proteins in this family
are typically between 263 and 318 amino acids in length.
There is a conserved SFT sequence motif. MSA-2 is a
plasma membrane glycoprotein which can be found in
Babesia bovis species.
Length = 201
Score = 29.4 bits (66), Expect = 2.0
Identities = 12/45 (26%), Positives = 16/45 (35%)
Query: 285 TQTTSPPVVSAASQNTSSATQSSTAATQNASPATQSSNVATQNTP 329
++SQ TS+ S +AT N S N A Q
Sbjct: 137 APAEYYSPKHSSSQGTSTTRPSDGSATPNTSAPPTPGNPAAQPEK 181
Score = 28.6 bits (64), Expect = 3.6
Identities = 15/56 (26%), Positives = 22/56 (39%), Gaps = 8/56 (14%)
Query: 274 TSTPSTSAATTTQTTSPPVVSAASQNTSSATQSSTAATQNASPATQSSNVATQNTP 329
+P S++ T TT P S SAT +++A +PA Q A
Sbjct: 141 YYSPKHSSSQGTSTTRP------SDG--SATPNTSAPPTPGNPAAQPEKPAETPKG 188
>gnl|CDD|225010 COG2099, CobK, Precorrin-6x reductase [Coenzyme metabolism].
Length = 257
Score = 29.6 bits (67), Expect = 2.1
Identities = 14/50 (28%), Positives = 22/50 (44%), Gaps = 11/50 (22%)
Query: 139 GGFLDV---ADFLLKNGANIELGASTP--------LMEAAQEGHLELVRY 177
GGFL A FL + G ++ + A+ P AA+E + +R
Sbjct: 50 GGFLGAEGLAAFLREEGIDLLIDATHPYAARISQNAARAAKETGIPYLRL 99
>gnl|CDD|227578 COG5253, MSS4, Phosphatidylinositol-4-phosphate 5-kinase [Signal
transduction mechanisms].
Length = 612
Score = 29.9 bits (67), Expect = 2.3
Identities = 23/112 (20%), Positives = 36/112 (32%), Gaps = 17/112 (15%)
Query: 212 SESTPTVSSKQAVKTTFPPVSTEPSGKPSDPPNSGKTKEPPNSGDDQSSVVNVATDSTLS 271
+ S+ S VKT + P G+ S N + D +++ L
Sbjct: 117 TRSSGNNLSNANVKT-----LSAPVGEHSRSNNP----PNLDQNLDTEPESSISQWGELQ 167
Query: 272 HVTSTPSTSAATTTQTTSPPVVSAASQNTSSATQSSTAATQNASPATQSSNV 323
S + S S P S+N S + +S T SP S +
Sbjct: 168 LNPSGKTLS--------SQPSRKPTSENPKSESDNSKLPTSVNSPLPDKSLL 211
>gnl|CDD|220430 pfam09829, DUF2057, Uncharacterized protein conserved in bacteria
(DUF2057). This domain, found in various prokaryotic
proteins, has no known function.
Length = 189
Score = 29.2 bits (66), Expect = 2.4
Identities = 10/47 (21%), Positives = 15/47 (31%)
Query: 295 AASQNTSSATQSSTAATQNASPATQSSNVATQNTPVTNLNNAVATLK 341
A N S ++ A A + +VA N A L+
Sbjct: 126 LADYNQKSGAAAAVPAPATAVTTAAAVSVAATTASAQTQNEAEQMLQ 172
>gnl|CDD|223039 PHA03307, PHA03307, transcriptional regulator ICP4; Provisional.
Length = 1352
Score = 29.8 bits (67), Expect = 2.5
Identities = 24/113 (21%), Positives = 33/113 (29%), Gaps = 6/113 (5%)
Query: 210 RGSESTPTVSSKQAVKTTFPPVSTEPSGKPSDPPNSGKTKEPPNSGDDQSSVVNVATDST 269
S S ++ P PS PS P + P S SS ++ ST
Sbjct: 271 EASGWNGPSSRPGPASSSSSPRERSPSPSPSSPGSGPAPSSPRAS-SSSSSSRESSSSST 329
Query: 270 LSHVTSTPSTSAATTTQTTSPPVVSAASQNTS-SATQSSTAATQNASPATQSS 321
+S+ +S P S + A SS S A S
Sbjct: 330 ----SSSSESSRGAAVSPGPSPSRSPSPSRPPPPADPSSPRKRPRPSRAPSSP 378
Score = 29.0 bits (65), Expect = 5.5
Identities = 21/121 (17%), Positives = 34/121 (28%), Gaps = 4/121 (3%)
Query: 211 GSESTPTVSSKQAVKTTFPPVSTEPS-GKPSDPPNSGKTKEPPNSGDDQSSVVNVATDST 269
GS + P SS T PP S PS P +
Sbjct: 104 GSPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVA 163
Query: 270 LSHVTSTPSTSAATTTQTTSPPVVSAASQNTSSATQSSTAATQNASPATQSSNVATQNTP 329
+S + ++ + T+ S + + AA P S A+ ++P
Sbjct: 164 SDAASSRQAALPLSSPEETARAPSSPPA---EPPPSTPPAAASPRPPRRSSPISASASSP 220
Query: 330 V 330
Sbjct: 221 A 221
>gnl|CDD|235467 PRK05441, murQ, N-acetylmuramic acid-6-phosphate etherase;
Reviewed.
Length = 299
Score = 29.4 bits (67), Expect = 2.6
Identities = 15/49 (30%), Positives = 20/49 (40%), Gaps = 9/49 (18%)
Query: 31 TLAACGGHVDLALLLIERGANIEEVNMPTDSFESPLTLAACGGHVDLAL 79
L A G V LA+++I G + E LA GG + AL
Sbjct: 256 ALEAADGSVKLAIVMILTGLDAAEAKA---------LLARHGGFLRKAL 295
Score = 29.0 bits (66), Expect = 3.8
Identities = 16/57 (28%), Positives = 26/57 (45%), Gaps = 7/57 (12%)
Query: 39 VDLALL---LIERGANI--EEVNMPTDSFESPLTLAACGGHVDLALLLIERGANIEE 90
VD+ L++R I E + + E+ L A G V LA+++I G + E
Sbjct: 225 VDVKATNEKLVDRAVRIVMEATGVSREEAEA--ALEAADGSVKLAIVMILTGLDAAE 279
>gnl|CDD|227502 COG5175, MOT2, Transcriptional repressor [Transcription].
Length = 480
Score = 29.6 bits (66), Expect = 2.6
Identities = 30/177 (16%), Positives = 60/177 (33%), Gaps = 15/177 (8%)
Query: 177 YLLDSGAQEMQTKHIDILEFKKRKQEESKAAFCRGSESTPTVSSKQAVKTTFPPVSTEPS 236
YL + G ++ ++ + + + T +++ T + PS
Sbjct: 221 YLHEPGPEKDSLTKDELCNSQHKLHGSEVRNKNKKRIHRSTSTAR--YDTDLLNFTGTPS 278
Query: 237 GKPSDPPNSGKTK-----------EPPNSGDDQSSVVNVATDSTLSHVTSTPSTSAATTT 285
+ KT P + + QS+ V ++ S T + S T
Sbjct: 279 PAAMEAQFKHKTSRVFKAPDKILFPPLDFTNTQSA-TPVTLSNSSSINLPTLNDSLGHHT 337
Query: 286 QTTSPPVVSAASQNTSSATQSSTAATQNASPATQSSNVATQNTPVTNLNNAVATLKT 342
+TT+ +A S + S + S AA + P N A ++N ++K+
Sbjct: 338 ETTTTTNTNATSHSHGSKKKQSLAAEEYKDPYDALGNAARL-HSLSNYQKRPISIKS 393
>gnl|CDD|223031 PHA03273, PHA03273, envelope glycoprotein C; Provisional.
Length = 486
Score = 29.6 bits (66), Expect = 2.7
Identities = 16/73 (21%), Positives = 37/73 (50%), Gaps = 3/73 (4%)
Query: 259 SSVVNVATDSTLSHVTSTPSTSAATTTQTTSPPVVSAASQNTSSATQSSTAATQNASPAT 318
+S +D++ + + STP+T TT+ TSP + N+++A + + TQ + P +
Sbjct: 30 TSSSIENSDNSTAEMQSTPATPTHTTSNLTSP--FGTGTDNSTNANGTE-STTQASQPHS 86
Query: 319 QSSNVATQNTPVT 331
+ + + ++
Sbjct: 87 HETTITCTKSLIS 99
>gnl|CDD|234022 TIGR02813, omega_3_PfaA, polyketide-type polyunsaturated fatty acid
synthase PfaA. Members of the seed for this alignment
are involved in omega-3 polyunsaturated fatty acid
biosynthesis, such as the protein PfaA from the
eicosapentaenoic acid biosynthesis operon in
Photobacterium profundum strain SS9. PfaA is encoded
together with PfaB, PfaC, and PfaD, and the functions of
the individual polypeptides have not yet been described.
More distant homologs of PfaA, also included with the
reach of this model, appear to be involved in
polyketide-like biosynthetic mechanisms of
polyunsaturated fatty acid biosynthesis, an alternative
to the more familiar iterated mechanism of chain
extension and desaturation, and in most cases are encoded
near genes for homologs of PfaB, PfaC, and/or PfaD.
Length = 2582
Score = 30.0 bits (67), Expect = 2.8
Identities = 18/59 (30%), Positives = 26/59 (44%), Gaps = 2/59 (3%)
Query: 264 VATDSTLSHVTSTPSTSAATTTQTTSPPVVSAASQNTSSATQSST--AATQNASPATQS 320
V T + + VT + + +A SPPVVSAA + + + A A P QS
Sbjct: 1132 VVTQAPVVQVTISVAPAAPVLPAVVSPPVVSAAPAQSVATAVAMAPVAEVPIAVPVQQS 1190
>gnl|CDD|147982 pfam06112, Herpes_capsid, Gammaherpesvirus capsid protein. This
family consists of several Gammaherpesvirus capsid
proteins. The exact function of this family is unknown.
Length = 148
Score = 28.7 bits (64), Expect = 2.9
Identities = 15/65 (23%), Positives = 29/65 (44%), Gaps = 2/65 (3%)
Query: 265 ATDSTLSHVTSTPSTSAATTTQTTSPPVVSAASQNTSSATQSSTAATQNASPATQSSNVA 324
A T S + S + SA++++ + P + S ++ SA S + ++S + S A
Sbjct: 80 AGPQTSSSIGS--ALSASSSSASGVPGGANQLSGSSGSALSSGPGSLSSSSSLSGSGAGA 137
Query: 325 TQNTP 329
P
Sbjct: 138 GDTAP 142
>gnl|CDD|191179 pfam05053, Menin, Menin. MEN1, the gene responsible for multiple
endocrine neoplasia type 1, is a tumour suppressor gene
that encodes a protein called Menin which may be an
atypical GTPase stimulated by nm23.
Length = 618
Score = 29.6 bits (66), Expect = 2.9
Identities = 23/93 (24%), Positives = 34/93 (36%), Gaps = 9/93 (9%)
Query: 232 STEPSGKPSDPP-------NSGKTKEPPNSGDDQS--SVVNVATDSTLSHVTSTPSTSAA 282
S EPSG S P NS N D + + AT ++ TS P ++
Sbjct: 489 SQEPSGGESPNPELPANNNNSNSNNNNNNGADRKEAAATTGNATTTSNGSGTSVPLPVSS 548
Query: 283 TTTQTTSPPVVSAASQNTSSATQSSTAATQNAS 315
Q PV++ S+ + A N+S
Sbjct: 549 EPPQHKEGPVITFYSEKMKGMKELLLAEKLNSS 581
>gnl|CDD|237864 PRK14950, PRK14950, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 585
Score = 29.4 bits (66), Expect = 3.1
Identities = 17/110 (15%), Positives = 27/110 (24%), Gaps = 1/110 (0%)
Query: 224 VKTTFPPVSTEPSGKPSDPPNSGKTKEPPNSGDDQSSVVNVATDSTLSHVTSTPSTSAAT 283
++ PV KP+ S P S +++ T+TP
Sbjct: 356 IEALLVPVPAPQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPR 415
Query: 284 TTQTTSPPVVSAASQNTSSATQSSTAATQNASPATQSSNVATQNTPVTNL 333
P +A + T +A PA L
Sbjct: 416 PVAPPVPHTPESAPKLTRAAIPVDEKPKYTP-PAPPKEEEKALIADGDVL 464
>gnl|CDD|172358 PRK13831, PRK13831, conjugal transfer protein TrbI; Provisional.
Length = 432
Score = 29.3 bits (66), Expect = 3.2
Identities = 14/60 (23%), Positives = 21/60 (35%), Gaps = 3/60 (5%)
Query: 265 ATDSTLSHVTSTPSTSAATTTQTTSPPVVSAASQNTS---SATQSSTAATQNASPATQSS 321
A DS L+ A T T + + ++ S +A + QN P Q S
Sbjct: 158 AYDSPLAVDIGKVEKRTADTNDTATQTTNATSTAGASDLYAAAMRAGLGGQNVDPNGQKS 217
>gnl|CDD|236766 PRK10811, rne, ribonuclease E; Reviewed.
Length = 1068
Score = 29.6 bits (67), Expect = 3.4
Identities = 27/133 (20%), Positives = 40/133 (30%), Gaps = 9/133 (6%)
Query: 213 ESTPTVSSKQAVKTTFPPVSTE-PSGK-----PSDPPNSGKTKEPPNSGDDQSSVVNVAT 266
E PT S + T S E SGK P P + +E + + Q V
Sbjct: 817 ERYPTQS---PMPLTVACASPEMASGKVWIRYPVVRPQDVQVEEQREAEEVQVQPVVAEV 873
Query: 267 DSTLSHVTSTPSTSAATTTQTTSPPVVSAASQNTSSATQSSTAATQNASPATQSSNVATQ 326
+ + + PVV A Q +T A+P T+ V T+
Sbjct: 874 PVAAAVEPVVSAPVVEAVAEVVEEPVVVAEPQPEEVVVVETTHPEVIAAPVTEQPQVITE 933
Query: 327 NTPVTNLNNAVAT 339
+ A
Sbjct: 934 SDVAVAQEVAEHA 946
>gnl|CDD|237874 PRK14971, PRK14971, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 614
Score = 29.4 bits (66), Expect = 3.5
Identities = 15/80 (18%), Positives = 24/80 (30%), Gaps = 10/80 (12%)
Query: 252 PNSGDDQSSVVNVATDSTLSHVTSTPSTSAATTTQTTSPPVVSAASQNTSSATQSSTA-- 309
P + S+ + + S + PS + T +PP VS STA
Sbjct: 386 PAAAPQPSAAAAASPSPSQSSAAAQPSAPQSATQPAGTPPTVSVDPPAAVPVNPPSTAPQ 445
Query: 310 --------ATQNASPATQSS 321
+ + SS
Sbjct: 446 AVRPAQFKEEKKIPVSKVSS 465
Score = 28.6 bits (64), Expect = 5.9
Identities = 17/56 (30%), Positives = 25/56 (44%), Gaps = 3/56 (5%)
Query: 284 TTQTTSPPVVSAASQNTSSATQSSTAATQNASPATQSSNVATQNTPVTNLNNAVAT 339
TQ + P SAA+ + S +QSS AA A QS+ P +++ A
Sbjct: 383 FTQPAAAPQPSAAAAASPSPSQSSAAAQ---PSAPQSATQPAGTPPTVSVDPPAAV 435
>gnl|CDD|144411 pfam00802, Glycoprotein_G, Pneumovirus attachment glycoprotein G.
This family includes attachment proteins from
respiratory synctial virus. Glycoprotein G has not been
shown to have any neuraminidase or hemagglutinin
activity. The amino terminus is thought to be
cytoplasmic, and the carboxyl terminus extracellular.
The extracellular region contains four completely
conserved cysteine residues.
Length = 263
Score = 28.9 bits (64), Expect = 3.6
Identities = 19/86 (22%), Positives = 29/86 (33%), Gaps = 3/86 (3%)
Query: 212 SESTPTVSSKQAVKTTFPPVSTEPSGKPSDPPNSGKTKEPPNSGDDQSSVVNVATDSTLS 271
+TPT ++ T+ + + P N T ++ D ++ T ST
Sbjct: 62 PTTTPTQQITNQIQNHTSTYLTQHNQLSTSPSNQSTTTPLIHTILDDTT---PGTKSTYQ 118
Query: 272 HVTSTPSTSAATTTQTTSPPVVSAAS 297
H T T QT PP S
Sbjct: 119 HTTVGTKGRTTTPAQTNKPPTKPRQS 144
>gnl|CDD|165089 PHA02716, PHA02716, CPXV016; CPX019; EVM010; Provisional.
Length = 764
Score = 29.5 bits (66), Expect = 3.7
Identities = 13/25 (52%), Positives = 16/25 (64%)
Query: 75 VDLALLLIERGANIEEVNDEGYTPL 99
+D+ LI GA+I VN GYTPL
Sbjct: 379 LDVIQCLISLGADITAVNCLGYTPL 403
>gnl|CDD|219419 pfam07462, MSP1_C, Merozoite surface protein 1 (MSP1) C-terminus.
This family represents the C-terminal region of
merozoite surface protein 1 (MSP1) which are found in a
number of Plasmodium species. MSP-1 is a 200-kDa protein
expressed on the surface of the P. vivax merozoite.
MSP-1 of Plasmodium species is synthesised as a
high-molecular-weight precursor and then processed into
several fragments. At the time of red cell invasion by
the merozoite, only the 19-kDa C-terminal fragment
(MSP-119), which contains two epidermal growth
factor-like domains, remains on the surface. Antibodies
against MSP-119 inhibit merozoite entry into red cells,
and immunisation with MSP-119 protects monkeys from
challenging infections. Hence, MSP-119 is considered a
promising vaccine candidate.
Length = 574
Score = 29.1 bits (65), Expect = 3.7
Identities = 20/85 (23%), Positives = 31/85 (36%), Gaps = 1/85 (1%)
Query: 253 NSGDDQSSVVNVATDSTLSHVTSTPSTSAATTTQTTSPPVVSAASQNTSSATQSSTAATQ 312
N+ + + VN A + + + A TT T P +A S + SS +A+
Sbjct: 238 NTNPENNPEVNEALEQYKELLPKGTTQEAKVTTVVTPPQADAAPSPLSVRPAGSSGSASG 297
Query: 313 NASPATQSSNVATQNTPVTNLNNAV 337
+ T S V T L V
Sbjct: 298 STQIPTSGS-VLGPGAAATELQQVV 321
>gnl|CDD|183757 PRK12800, fliF, flagellar MS-ring protein; Reviewed.
Length = 574
Score = 29.2 bits (65), Expect = 3.8
Identities = 13/54 (24%), Positives = 21/54 (38%)
Query: 283 TTTQTTSPPVVSAASQNTSSATQSSTAATQNASPATQSSNVATQNTPVTNLNNA 336
T+T T P A+ N+ + AA +PA + A P + +A
Sbjct: 307 TSTSATGPQGPPGATSNSPGQPPAPAAAGAPGTPAAANGQAAAAAAPTESSKSA 360
>gnl|CDD|237555 PRK13914, PRK13914, invasion associated secreted endopeptidase;
Provisional.
Length = 481
Score = 29.0 bits (64), Expect = 4.3
Identities = 18/111 (16%), Positives = 40/111 (36%)
Query: 218 VSSKQAVKTTFPPVSTEPSGKPSDPPNSGKTKEPPNSGDDQSSVVNVATDSTLSHVTSTP 277
++ KQ T P + ++ + KE N+ + T + T
Sbjct: 241 LAIKQTANTATPKAEVKTEAPAAEKQAAPVVKENTNTNTATTEKKETTTQQQTAPKAPTE 300
Query: 278 STSAATTTQTTSPPVVSAASQNTSSATQSSTAATQNASPATQSSNVATQNT 328
+ A T + + + NT++ +++ ++N + T S+ NT
Sbjct: 301 AAKPAPAPSTNTNANKTNTNTNTNTNNTNTSTPSKNTNTNTNSNTNTNSNT 351
Score = 29.0 bits (64), Expect = 4.6
Identities = 23/115 (20%), Positives = 45/115 (39%), Gaps = 3/115 (2%)
Query: 200 KQEESKAAFCRGSESTPTV---SSKQAVKTTFPPVSTEPSGKPSDPPNSGKTKEPPNSGD 256
K E A ++ P V ++ T +T+ P P + K P++
Sbjct: 253 KAEVKTEAPAAEKQAAPVVKENTNTNTATTEKKETTTQQQTAPKAPTEAAKPAPAPSTNT 312
Query: 257 DQSSVVNVATDSTLSHVTSTPSTSAATTTQTTSPPVVSAASQNTSSATQSSTAAT 311
+ + +T + TSTPS + T T + + + + SS S+++A+
Sbjct: 313 NANKTNTNTNTNTNNTNTSTPSKNTNTNTNSNTNTNSNTNANQGSSNNNSNSSAS 367
Score = 28.6 bits (63), Expect = 5.0
Identities = 21/118 (17%), Positives = 43/118 (36%), Gaps = 6/118 (5%)
Query: 231 VSTEPSGKPSDPPNSGKTKEPPNSGDDQSSVVNVATDSTLSHVTSTPSTSAATTTQTTSP 290
++ + + + P KT+ P + V ++T ++ +T T QT
Sbjct: 241 LAIKQTANTATPKAEVKTEAPAA----EKQAAPVVKENTNTNTATTEKKETTTQQQTAPK 296
Query: 291 PVVSAASQNTSSATQSSTAATQNASPATQSSNVATQNTPVTNLNNAVATLKTAVATLN 348
AA + A ++T A + + ++N +TP N N + + N
Sbjct: 297 APTEAA--KPAPAPSTNTNANKTNTNTNTNTNNTNTSTPSKNTNTNTNSNTNTNSNTN 352
>gnl|CDD|148679 pfam07218, RAP1, Rhoptry-associated protein 1 (RAP-1). This family
consists of several rhoptry-associated protein 1 (RAP-1)
sequences which appear to be specific to Plasmodium
falciparum.
Length = 790
Score = 28.9 bits (64), Expect = 4.8
Identities = 17/138 (12%), Positives = 42/138 (30%), Gaps = 7/138 (5%)
Query: 218 VSSKQAVKTTFPPVSTEPSGKPSDPPNSGKTKEPPNSG------DDQSSVVNVATDSTLS 271
+ +K + ++ + GK++ S DD + + +
Sbjct: 70 LENKASKDDGNINLTDTSENGDASKKGHGKSRVRSASAAAILEEDDSKDDMEFKANPNEA 129
Query: 272 HVTSTPSTSAATTTQTTSPPVVSAASQ-NTSSATQSSTAATQNASPATQSSNVATQNTPV 330
P + ++S A+++ + SA++ + + + S A+ V
Sbjct: 130 GKPGKPKGNQGEGLASSSDGKSKASAKSGSKSASKHGESNSSDESATDSGKASASVAGIV 189
Query: 331 TNLNNAVATLKTAVATLN 348
A K + L
Sbjct: 190 GADEEAPPAPKNTLTPLE 207
>gnl|CDD|217503 pfam03344, Daxx, Daxx Family. The Daxx protein (also known as the
Fas-binding protein) is thought to play a role in
apoptosis, but precise role played by Daxx remains to be
determined. Daxx forms a complex with Axin.
Length = 715
Score = 28.7 bits (64), Expect = 4.9
Identities = 25/115 (21%), Positives = 36/115 (31%), Gaps = 12/115 (10%)
Query: 220 SKQAVKTTFPPVSTEPSGKPSDPP--NSGKTKEPPNSGDDQSSVVNVATDSTLSHVTSTP 277
S TT + + + PP K KE S + + H S
Sbjct: 598 SATVTSTTRNGNVSPHTPQDEQPPSGRKRKRKEEVESEPLGNQYL-------RHHNGSEK 650
Query: 278 STSAATTTQTTSPPVVSAASQNTSSATQ---SSTAATQNASPATQSSNVATQNTP 329
A TS V+ +S + + +S+ T P NVATQ P
Sbjct: 651 DGLPAPMDPVTSCSPVADSSTRVDTPSHELVTSSPQTPGDPPKKNKVNVATQCDP 705
>gnl|CDD|216257 pfam01034, Syndecan, Syndecan domain. Syndecans are transmembrane
heparin sulfate proteoglycans which are implicated in
the binding of extracellular matrix components and
growth factors.
Length = 207
Score = 28.2 bits (63), Expect = 5.1
Identities = 18/75 (24%), Positives = 30/75 (40%), Gaps = 1/75 (1%)
Query: 251 PPNSGDDQSSVVNVATDSTLSHVTSTPSTSAATTTQTTSPPVVSAASQNTSSATQSSTAA 310
DD+ S V T +T +T+T S+ + TT ++ S T + T S +
Sbjct: 59 GATPSDDEDSEP-VTTSATPPKLTTTSSSPSNDTTTASTSTKTSPTVSTTVTTTTSPSET 117
Query: 311 TQNASPATQSSNVAT 325
+ T S+ T
Sbjct: 118 DTEEATTTVSTETPT 132
Score = 27.4 bits (61), Expect = 9.5
Identities = 16/75 (21%), Positives = 31/75 (41%), Gaps = 7/75 (9%)
Query: 241 DPPNSGKTKEPPNSGDDQSSVVNVATDSTLSHVTSTPSTSAATTTQTTSPPVVSAASQNT 300
PP T P++ +S + + + VT+T S S T + T + +T
Sbjct: 76 TPPKLTTTSSSPSNDTTTASTSTKTSPTVSTTVTTTTSPSETDTEEAT-------TTVST 128
Query: 301 SSATQSSTAATQNAS 315
+ T+ ++A + S
Sbjct: 129 ETPTEGGSSAATDPS 143
>gnl|CDD|218673 pfam05642, Sporozoite_P67, Sporozoite P67 surface antigen. This
family consists of several Theileria P67 surface
antigens. A stage specific surface antigen of Theileria
parva, p67, is the basis for the development of an
anti-sporozoite vaccine for the control of East Coast
fever (ECF) in cattle. The antigen has been shown to
contain five distinct linear peptide sequences
recognised by sporozoite-neutralising murine monoclonal
antibodies.
Length = 727
Score = 28.9 bits (64), Expect = 5.5
Identities = 20/49 (40%), Positives = 26/49 (53%), Gaps = 2/49 (4%)
Query: 267 DSTLSHVTSTPSTSAATTTQTTSPPVVSAASQNTSSATQSSTAATQNAS 315
DST S T+ PS S TTT +S P SA SS+ + T +T + S
Sbjct: 272 DSTSSPSTTRPSGSTTTTTPASSGP--SAPGGPGSSSRNAVTRSTDSIS 318
>gnl|CDD|222812 PHA00727, PHA00727, hypothetical protein.
Length = 278
Score = 28.3 bits (63), Expect = 5.5
Identities = 13/29 (44%), Positives = 19/29 (65%)
Query: 272 HVTSTPSTSAATTTQTTSPPVVSAASQNT 300
HVT+T +T ATTT TT+ P + S ++
Sbjct: 248 HVTNTTTTHLATTTTTTTSPSFTIPSNSS 276
>gnl|CDD|185214 PRK15314, PRK15314, outer membrane protein RatB; Provisional.
Length = 2435
Score = 29.0 bits (64), Expect = 5.5
Identities = 22/82 (26%), Positives = 38/82 (46%), Gaps = 1/82 (1%)
Query: 252 PNSGDDQSSVVNVATDSTLSHVTSTPSTSAATTTQTTSPPVVSAASQNTSSATQSSTAAT 311
PN+G Q +N T S T+T + +T++P ++ S SA ++ A
Sbjct: 2117 PNTGYYQFINLNSGGKGTASSSTATGAQVCLVEPRTSTPASITLTSTAMDSAKNAAVVAK 2176
Query: 312 QNASPATQSSNVATQNTPVTNL 333
+A P T + ++ N PV N+
Sbjct: 2177 GSAMPLTVTVKDSSGN-PVANV 2197
>gnl|CDD|173184 PRK14721, flhF, flagellar biosynthesis regulator FlhF; Provisional.
Length = 420
Score = 28.4 bits (63), Expect = 6.0
Identities = 16/59 (27%), Positives = 25/59 (42%)
Query: 268 STLSHVTSTPSTSAATTTQTTSPPVVSAASQNTSSATQSSTAATQNASPATQSSNVATQ 326
S+L+ T +T TT + T+PP AS ATQ + S+N+ +
Sbjct: 49 SSLTSDTPEEATIPETTVKPTAPPRQKPASGQPQPPAIHKQPATQPPAADIPSANIMQE 107
>gnl|CDD|222925 PHA02743, PHA02743, Viral ankyrin protein; Provisional.
Length = 166
Score = 27.9 bits (62), Expect = 6.1
Identities = 17/55 (30%), Positives = 26/55 (47%), Gaps = 1/55 (1%)
Query: 113 VLLQKGAKINAHTEETQETALTLACCGGFLDVADFLLKNGANIELGASTPLMEAA 167
+L+ GA INA T T L +A ++A++L + + LGA E A
Sbjct: 78 LLVNMGADINARELGTGNTLLHIAASTKNYELAEWLCRQ-LGVNLGAINYQHETA 131
>gnl|CDD|185616 PTZ00436, PTZ00436, 60S ribosomal protein L19-like protein;
Provisional.
Length = 357
Score = 28.4 bits (62), Expect = 6.1
Identities = 20/108 (18%), Positives = 34/108 (31%), Gaps = 1/108 (0%)
Query: 212 SESTPTVSSKQAVKTTFPPV-STEPSGKPSDPPNSGKTKEPPNSGDDQSSVVNVATDSTL 270
+ + P ++ K PP + P K + PP + + A +
Sbjct: 239 AAAAPAKAAAAPAKAAAPPAKAAAPPAKAAAPPAKAAAPPAKAAAPPAKAAAPPAKAAAA 298
Query: 271 SHVTSTPSTSAATTTQTTSPPVVSAASQNTSSATQSSTAATQNASPAT 318
+ AA + P AA+ +AT + AA A A
Sbjct: 299 PAKAAAAPAKAAAAPAKAAAPPAKAAAPPAKAATPPAKAAAPPAKAAA 346
>gnl|CDD|115579 pfam06933, SSP160, Special lobe-specific silk protein SSP160. This
family consists of several special lobe-specific silk
protein SSP160 sequences which appear to be specific to
Chironomus (Midge) species.
Length = 758
Score = 28.6 bits (63), Expect = 6.1
Identities = 15/57 (26%), Positives = 34/57 (59%)
Query: 254 SGDDQSSVVNVATDSTLSHVTSTPSTSAATTTQTTSPPVVSAASQNTSSATQSSTAA 310
SG +S + ++ ++ S+ ST S ++ T++ +T+ S +S N++S+ +S A+
Sbjct: 100 SGSGSASGNSSSSANSTSNSNSTTSNNSTTSSNSTTTTSNSTSSSNSTSSGLTSGAS 156
>gnl|CDD|178748 PLN03209, PLN03209, translocon at the inner envelope of chloroplast
subunit 62; Provisional.
Length = 576
Score = 28.4 bits (63), Expect = 6.4
Identities = 23/103 (22%), Positives = 36/103 (34%), Gaps = 2/103 (1%)
Query: 229 PPVSTEPSGKPSDPPNSGKTKEPPNSGDDQSSV--VNVATDSTLSHVTSTPSTSAATTTQ 286
PP S+ S K D + S S+V V A +P
Sbjct: 390 PPSSSPASSKSVDAVAKPAEPDVVPSPGSASNVPEVEPAQVEAKKTRPLSPYARYEDLKP 449
Query: 287 TTSPPVVSAASQNTSSATQSSTAATQNASPATQSSNVATQNTP 329
TSP + + S ++ SS A + +PAT +++ A
Sbjct: 450 PTSPSPTAPTGVSPSVSSTSSVPAVPDTAPATAATDAAAPPPA 492
>gnl|CDD|218908 pfam06136, DUF966, Domain of unknown function (DUF966). Family of
plant proteins with unknown function.
Length = 308
Score = 28.2 bits (63), Expect = 6.4
Identities = 20/114 (17%), Positives = 39/114 (34%), Gaps = 17/114 (14%)
Query: 210 RGSESTPTVSSKQAVKTTFPPVSTEPSGKPSDPPNSGKTKEPP-NSGDDQSSVVNVATDS 268
+GSE + SSK + S+ + SD P + +S D V
Sbjct: 84 KGSEILDSSSSKGDPEEA----SSRKLQEESDTPPVNRRANQSWSSSDLAEYKV------ 133
Query: 269 TLSHVTSTPSTSAATTTQTTSPPVVSAASQNTSSATQSSTAATQNASPATQSSN 322
A +TQT ++ ++ ++ + ++S + SS+
Sbjct: 134 ------YKAEEPADASTQTDDRRSRDSSEAESTELSREEISPPSSSSSPSSSSS 181
>gnl|CDD|219071 pfam06525, SoxE, Sulfocyanin (SoxE). This family consists of
several archaeal sulfocyanin (or blue copper protein)
sequences from a number of Sulfolobus species.
Length = 195
Score = 27.9 bits (62), Expect = 6.9
Identities = 10/46 (21%), Positives = 22/46 (47%)
Query: 272 HVTSTPSTSAATTTQTTSPPVVSAASQNTSSATQSSTAATQNASPA 317
+ S+P++++ T++ TT PP ++S+ T + P
Sbjct: 29 SMLSSPTSTSTTSSSTTLPPGAITLPYDSSNKTVFIYLVVLSTGPT 74
>gnl|CDD|139494 PRK13335, PRK13335, superantigen-like protein; Reviewed.
Length = 356
Score = 28.2 bits (62), Expect = 7.1
Identities = 11/56 (19%), Positives = 20/56 (35%), Gaps = 7/56 (12%)
Query: 274 TSTPSTSAATTTQTTSPPVVSAASQNTSSATQSSTAATQNASPATQSSNVATQNTP 329
T+ TSA+ + + P + S S+T A + T + +T
Sbjct: 82 TNEEKTSASKIEKISQPK-----QEEQKSLNISATPAPKQEQSQTTTE--STTPKT 130
>gnl|CDD|224680 COG1766, fliF, Flagellar basal body M-ring protein [Cell motility
and secretion].
Length = 545
Score = 28.1 bits (63), Expect = 7.2
Identities = 18/70 (25%), Positives = 25/70 (35%), Gaps = 7/70 (10%)
Query: 273 VTSTPSTSAATTTQTTSPPV---VSAASQNTSSATQSSTAATQNASPATQSSNVATQNTP 329
VT+ SA +T+ P + S+ T ++ S TQ P SN P
Sbjct: 267 VTAELDFSAKESTEEKYDPNGREPAVRSEQTVTSLSQSGQGTQGGVPGALESNQ----PP 322
Query: 330 VTNLNNAVAT 339
V N T
Sbjct: 323 VAGNQNGKDT 332
>gnl|CDD|236792 PRK10905, PRK10905, cell division protein DamX; Validated.
Length = 328
Score = 28.0 bits (62), Expect = 7.4
Identities = 17/73 (23%), Positives = 29/73 (39%), Gaps = 2/73 (2%)
Query: 273 VTSTPSTSAATTTQTTSPPVVSAASQNTSSATQSSTAATQNASPATQSSNVATQNTPVTN 332
T A T + T P A+++ ++ + + T +P +S T TP
Sbjct: 171 TAKTEPKPVAQTPKRTEPAAPVASTKAPAATSTPAPKETATTAPVQTASPAQTTATPAAG 230
Query: 333 LNNA--VATLKTA 343
A V +LK+A
Sbjct: 231 GKTAGNVGSLKSA 243
>gnl|CDD|184285 PRK13733, PRK13733, conjugal transfer protein TraV; Provisional.
Length = 171
Score = 27.4 bits (61), Expect = 7.4
Identities = 14/93 (15%), Positives = 27/93 (29%), Gaps = 3/93 (3%)
Query: 197 KKRKQEESKAAFCRGSESTPTVSSKQAVKTTFPPVST--EPSGKPSDPPNSGKT-KEPPN 253
+K K+ E + + S P ++ V+ +P+ + P
Sbjct: 44 EKAKKLEQSSDAKPAAASLPRLAEGNFRTMPVQTVTATTPSGSRPAVTATPEQKLLAPRP 103
Query: 254 SGDDQSSVVNVATDSTLSHVTSTPSTSAATTTQ 286
V V S+++ VT T
Sbjct: 104 LFTAAREVKTVVPVSSVTPVTPPRPLRTGEQTA 136
>gnl|CDD|202716 pfam03646, FlaG, FlaG protein. Although important for flagella the
exact function of this protein is unknown.
Length = 107
Score = 26.8 bits (60), Expect = 7.5
Identities = 11/58 (18%), Positives = 23/58 (39%)
Query: 295 AASQNTSSATQSSTAATQNASPATQSSNVATQNTPVTNLNNAVATLKTAVATLNRHCR 352
AA ++ +A + +A Q + + + + L AV L + +LN +
Sbjct: 1 AAPSSSQTAAEKVVSAAQAVAEEAEETAAEEKEVSKEELEEAVEKLNKFLQSLNTNLE 58
>gnl|CDD|218597 pfam05466, BASP1, Brain acid soluble protein 1 (BASP1 protein).
This family consists of several brain acid soluble
protein 1 (BASP1) or neuronal axonal membrane protein
NAP-22. The BASP1 is a neuron enriched Ca(2+)-dependent
calmodulin-binding protein of unknown function.
Length = 233
Score = 27.9 bits (61), Expect = 7.5
Identities = 20/109 (18%), Positives = 48/109 (44%), Gaps = 5/109 (4%)
Query: 230 PVSTEPSGKPSDPPNSGKTKEPPNSGDDQSSVVNVATDSTLSHVTSTPSTSAATTTQTTS 289
P ++E S +P++ K +E + A + + + P++ + ++ +
Sbjct: 127 PKASEASSQPAESAAPAKEEEKSKEEGEAKKTEAPAAAAQETKSDAAPASDSKPSSSEAA 186
Query: 290 PPVVSAASQNTSSATQSSTAATQNASPATQSSNVATQNTPVTNLNNAVA 338
P +S+ T +AT++ ++ + ++PA + V P N + VA
Sbjct: 187 P-----SSKETPAATEAPSSTAKASAPAAPAEEVKPSEAPAANSDQTVA 230
>gnl|CDD|203841 pfam08030, NAD_binding_6, Ferric reductase NAD binding domain.
Length = 149
Score = 27.3 bits (61), Expect = 7.6
Identities = 14/70 (20%), Positives = 27/70 (38%), Gaps = 1/70 (1%)
Query: 170 GHLELVRYLLDSGAQEMQTKHIDILEFKKRKQEESKAAFCRGSESTPTVSSKQAVKTTFP 229
LE + +L+ + + +I+I + + E A+ SE SK + +
Sbjct: 45 SSLEWFKDVLNELEELKELGNIEIHIYLTGEYEAEDASDASDSEQISMFDSKNHEEISGT 104
Query: 230 PVSTEPSGKP 239
V G+P
Sbjct: 105 RVEFHF-GRP 113
>gnl|CDD|216513 pfam01456, Mucin, Mucin-like glycoprotein. This family of
trypanosomal proteins resemble vertebrate mucins. The
protein consists of three regions. The N and C terminii
are conserved between all members of the family, whereas
the central region is not well conserved and contains a
large number of threonine residues which can be
glycosylated. Indirect evidence suggested that these
genes might encode the core protein of parasite mucins,
glycoproteins that were proposed to be involved in the
interaction with, and invasion of, mammalian host cells.
This family contains an N-terminal signal peptide.
Length = 143
Score = 27.5 bits (60), Expect = 7.7
Identities = 17/61 (27%), Positives = 31/61 (50%)
Query: 268 STLSHVTSTPSTSAATTTQTTSPPVVSAASQNTSSATQSSTAATQNASPATQSSNVATQN 327
+T + T+ P+T+ TTT TT+ + + T++ T ++T T+ S T +S T
Sbjct: 46 TTTTTTTTPPTTTTTTTTTTTTITTTTTKTTTTTTTTTTTTTTTEAPSKNTTTSEAPTTT 105
Query: 328 T 328
Sbjct: 106 D 106
>gnl|CDD|236733 PRK10672, PRK10672, rare lipoprotein A; Provisional.
Length = 361
Score = 28.1 bits (63), Expect = 8.0
Identities = 31/118 (26%), Positives = 43/118 (36%), Gaps = 14/118 (11%)
Query: 211 GSESTPTVSSKQAVKTTF----PPVSTEPSGKPSDPPNSGKTKEPPNSGDDQ---SSVVN 263
GS S P + K ++ P + G PS P P GD +S +
Sbjct: 175 GSLSGPGTAGTTVAKQSYALPARPDLSGGMGTPSVQP------APAPQGDVLPVSNSTLK 228
Query: 264 VATDSTLSHVTSTPSTSAATTTQTTSPPVVSAASQNTSSATQSSTAATQNASPATQSS 321
+ D T + VTS+ A TT SSA ++ AA + AT SS
Sbjct: 229 -SEDPTGAPVTSSGFLGAPTTLAPGVLEGSEPTPTAPSSAPATAPAAAAPQAAATSSS 285
>gnl|CDD|235513 PRK05579, PRK05579, bifunctional phosphopantothenoylcysteine
decarboxylase/phosphopantothenate synthase; Validated.
Length = 399
Score = 28.2 bits (64), Expect = 8.1
Identities = 19/68 (27%), Positives = 30/68 (44%), Gaps = 26/68 (38%)
Query: 39 VDLALLLIERGANIEEVNMPTDS---FESPLTL-AACG----------------GHVDLA 78
++L L + GA++ V T++ F +PLT A G GH++LA
Sbjct: 23 LELVRRLRKAGADVRVV--MTEAAKKFVTPLTFQALSGNPVSTDLWDPAAEAAMGHIELA 80
Query: 79 ----LLLI 82
L+LI
Sbjct: 81 KWADLVLI 88
>gnl|CDD|237191 PRK12757, PRK12757, cell division protein FtsN; Provisional.
Length = 256
Score = 27.7 bits (62), Expect = 8.1
Identities = 11/55 (20%), Positives = 16/55 (29%)
Query: 268 STLSHVTSTPSTSAATTTQTTSPPVVSAASQNTSSATQSSTAATQNASPATQSSN 322
ST+ TT P + + TQ A+P TQ+
Sbjct: 117 STVQIQQQAQQQQPPATTAQPQPVTPPRQTTAPVQPQTPAPVRTQPAAPVTQAVE 171
>gnl|CDD|236709 PRK10531, PRK10531, acyl-CoA thioesterase; Provisional.
Length = 422
Score = 27.8 bits (62), Expect = 8.4
Identities = 17/64 (26%), Positives = 26/64 (40%), Gaps = 3/64 (4%)
Query: 264 VATDSTLSHVTSTP---STSAATTTQTTSPPVVSAASQNTSSATQSSTAATQNASPATQS 320
+A TL+ +STP S T T S P++SA+ +A + A+P S
Sbjct: 13 LAFGVTLTACSSTPPDQRPSDQTAPGTASRPILSASEAKNFTAAHYFASLGPIAAPWNPS 72
Query: 321 SNVA 324
Sbjct: 73 PITL 76
>gnl|CDD|221825 pfam12877, DUF3827, Domain of unknown function (DUF3827). This
family contains the human KIAA1549 protein which has
been found to be fused fused to BRAF gene in many cases
of pilocytic astrocytomas. The fusion is due mainly to a
tandem duplication of 2 Mb at 7q34. Although nothing is
known about the function of KIAA1549 protein, the BRAF
protein is a well characterized oncoprotein. It is a
serine/threonine protein kinase which is implicated in
MAP/ERK signalling, a critical pathway for the
regulation of cell division, differentiation and
secretion.
Length = 684
Score = 28.0 bits (62), Expect = 8.4
Identities = 22/91 (24%), Positives = 31/91 (34%), Gaps = 9/91 (9%)
Query: 229 PPVSTEPSGKPSDPPNSGKTKEP------PNSGDDQ--SSVVNVATDSTLSHVTSTPSTS 280
S PS PS TKE P SG D+ SS LS +S P
Sbjct: 403 REKSGRPSTTPSVTAQQKPTKEEGRKKPAPPSGTDEQLSSASIFEHVDRLSRPSSDPYDR 462
Query: 281 AATTTQTTS-PPVVSAASQNTSSATQSSTAA 310
++ Q + P+ + ++ AA
Sbjct: 463 SSGKIQLIAMQPMPAPPVPPRFEPSRDDRAA 493
>gnl|CDD|233728 TIGR02102, pullulan_Gpos, pullulanase, extracellular, Gram-positive.
Pullulan is an unusual, industrially important
polysaccharide in which short alpha-1,4 chains
(maltotriose) are connected in alpha-1,6 linkages.
Enzymes that cleave alpha-1,6 linkages in pullulan and
release maltotriose are called pullulanases although
pullulan itself may not be the natural substrate. In
contrast, a glycogen debranching enzyme such GlgX,
homologous to this family, can release glucose at
alpha,1-6 linkages from glycogen first subjected to limit
degradation by phosphorylase. Characterized members of
this family include a surface-located pullulanase from
Streptococcus pneumoniae (PMID:11083842) and an
extracellular bifunctional amylase/pullulanase with
C-terminal pullulanase activity (PMID:8798645).
Length = 1111
Score = 28.3 bits (63), Expect = 8.4
Identities = 13/56 (23%), Positives = 18/56 (32%), Gaps = 4/56 (7%)
Query: 211 GSESTPTVSSKQAVKTTFPPVSTEPSGKPSDPPNSGKTKE---PPNSGDDQSSVVN 263
E TP + + PP KP D + K+ PP S D+
Sbjct: 1009 APEKTPP-PPEHEPQAPKPPTQDPDGSKPKDKVDPKDNKDPLTPPGSDDENGETPK 1063
>gnl|CDD|132697 TIGR03658, IsdH_HarA, haptoglobin-binding heme uptake protein HarA.
HarA is a heme-binding NEAT-domain (NEAr Transporter,
pfam05031) protein which has been shown to bind to the
haptoglobin-hemoglobin complex in order to extract heme
from it. HarA has also been reported to bind hemoglobin
directly. HarA (also known as IsdH) contains three NEAT
domains as well as a sortase A C-terminal signal for
localization to the cell wall. The heme bound at the
third of these NEAT domains has been shown to be
transferred to the IsdA protein also localized at the
cell wall, presumably through an additional specific
protein-protein interaction. Haptoglobin is a hemoglobin
carrier protein involved in scavenging hemoglobin in the
blood following red blood cell lysis and targetting it
to the liver.
Length = 895
Score = 28.3 bits (62), Expect = 8.6
Identities = 21/65 (32%), Positives = 30/65 (46%), Gaps = 4/65 (6%)
Query: 268 STLSHVTSTPSTSAATTTQTTSPPVVSAASQNTSSATQSSTAATQNASPATQSSNVATQN 327
STL +TS AA T T+ + +QN ++ T T PATQ +N A +N
Sbjct: 27 STLFLITSQHQAQAAENTNTSDK---ISENQNNNATTTQPPKDTNQTQPATQPANTA-KN 82
Query: 328 TPVTN 332
P +
Sbjct: 83 YPAAD 87
>gnl|CDD|223033 PHA03291, PHA03291, envelope glycoprotein I; Provisional.
Length = 401
Score = 28.0 bits (62), Expect = 9.4
Identities = 16/95 (16%), Positives = 30/95 (31%)
Query: 235 PSGKPSDPPNSGKTKEPPNSGDDQSSVVNVATDSTLSHVTSTPSTSAATTTQTTSPPVVS 294
G + PP + + S + T P+ + +TT P +
Sbjct: 168 AEGTLAAPPLGEGSADGSCDPALPLSAPRLGPADVFVPATPRPTPRTTASPETTPTPSTT 227
Query: 295 AASQNTSSATQSSTAATQNASPATQSSNVATQNTP 329
+ +T+ S+T A A ++ TP
Sbjct: 228 TSPPSTTIPAPSTTIAAPQAGTTPEAEGTPAPPTP 262
>gnl|CDD|219627 pfam07897, DUF1675, Protein of unknown function (DUF1675). The
members of this family are sequences derived from
hypothetical plant proteins of unknown function. One
member of this family is annotated as a putative
RNA-binding protein, but no evidence was found to
support this.
Length = 283
Score = 27.6 bits (61), Expect = 9.4
Identities = 29/146 (19%), Positives = 53/146 (36%), Gaps = 22/146 (15%)
Query: 185 EMQTKHIDILEFKKRKQEESKAAFCRGSESTPTVS------------SKQAVKTTFPPVS 232
EMQ+ KRK+ E + + +++ S + V+T P +
Sbjct: 77 EMQSLR---RLEAKRKRSEKEYNGVSNGDDMDSINAANGGGSGRDKESSKGVETGSPIAT 133
Query: 233 TEPSGKPS-----DPPNSGKTKEPPNSGDDQSSVVNVATDSTLSHVTSTPSTSAATTTQT 287
+ SG PS + + + +S V N+ T + + + P + + T
Sbjct: 134 SSSSGGPSSSGLSELDVGVIGGQSGQTKQVRSPVNNILTGTESTSHGTDPVDAETNKSNT 193
Query: 288 TSPPVVSAASQNTSSATQSSTAATQN 313
TSP + + S T S A+ N
Sbjct: 194 TSPT--KRSRRPPSVDTNSGGEASLN 217
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.308 0.122 0.340
Gapped
Lambda K H
0.267 0.0742 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 16,274,125
Number of extensions: 1487585
Number of successful extensions: 2642
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2228
Number of HSP's successfully gapped: 404
Length of query: 352
Length of database: 10,937,602
Length adjustment: 98
Effective length of query: 254
Effective length of database: 6,590,910
Effective search space: 1674091140
Effective search space used: 1674091140
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.7 bits)
S2: 59 (26.5 bits)