RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy16067
(1386 letters)
>gnl|CDD|188655 cd09823, peroxinectin_like, peroxinectin_like animal heme
peroxidases. Peroxinectin is an arthropod protein that
plays a role in invertebrate immunity mechanisms.
Specifically, peroxinectins are secreted as cell-adhesive
and opsonic peroxidases. The immunity mechanism appears
to involve an interaction between peroxinectin and a
transmembrane receptor of the integrin family. Human
myeloperoxidase, which is included in this wider family,
has also been reported to interact with integrins.
Length = 378
Score = 272 bits (698), Expect = 2e-81
Identities = 142/485 (29%), Positives = 198/485 (40%), Gaps = 136/485 (28%)
Query: 693 QIVQSTHFIDVSFMYGSHPDKARELRSLSRGKLRVREVNGRTFMPTGDPSNCVNASSNIC 752
Q+ Q T F+D S +YGS ++AR+L RTF
Sbjct: 3 QLNQVTSFLDGSQVYGSSEEEARKL---------------RTF----------------- 30
Query: 753 YDAGDIRVNQQLDLAVSQTVWLRFHNYVAEKLIQQNPSWSNRDELVYQEARRLVIGCFQH 812
G ++ + + L F N + S CF
Sbjct: 31 -KGGLLKTQR-----RNGRELLPFSNNPTDD--CSLSSAGKP--------------CF-- 66
Query: 813 IVYNEWLPILIGDIRVNQQLDLAVSQTVWLRFHNYVAEKLIQQNPSWSNRDELVYQEARR 872
L GD RVN+Q L T++LR HN +A++L + NP W DE ++QEAR+
Sbjct: 67 ---------LAGDGRVNEQPGLTSMHTLFLREHNRIADELKKLNPHWD--DERLFQEARK 115
Query: 873 LVIGCFQHIVYNEWLPILIGESYTKDVGLSPTSSGYFQGYDPEVNPSTLADFAAGAFRGE 932
+VI QHI YNE+LPIL+G + GL +SGYF GYDP V+PS L +FAA AFR
Sbjct: 116 IVIAQMQHITYNEFLPILLGRELMEKFGLYLLTSGYFNGYDPNVDPSILNEFAAAAFR-- 173
Query: 933 SYTKDVGLSPTSSGYFQGYDPEVNPSTLADFAAGAFRGLHSLIPGTIKLVNERRVTNQEL 992
+ HSL+PGT + ++E +
Sbjct: 174 --------------FG-----------------------HSLVPGTFERLDENYRPQGSV 196
Query: 993 PFSSTMQHPGDILEVMDVFDSTLRGLTNQLQQYQDRFITEQTIDFLFRVGSLSLEKVIVD 1052
+P + E D LRGL Q Q DRF T++ F G +D
Sbjct: 197 NLHDLFFNPDRLYEE-GGLDPLLRGLATQPAQKVDRFFTDELTTHFFFRGGNPFG---LD 252
Query: 1053 LVGLTL------GFF----YLWYCGG-----FLLQ--------VTRLSQVYRGVNDIDYY 1089
L L + G Y +CG F + +L ++Y+ V+DID Y
Sbjct: 253 LAALNIQRGRDHGLPGYNDYREFCGLPRATTFDDLLGIMSPETIQKLRRLYKSVDDIDLY 312
Query: 1090 IGGLLESVIRGTLSSPSFRCVFAEAFYRYKYGDRYFYEGPPATNPGAFTPGKQLTLTSLT 1149
+GGL E + G L P+F C+ E F R + GDR++YE P +FTP QL
Sbjct: 313 VGGLSEKPVPGGLVGPTFACIIGEQFRRLRRGDRFWYENGGQ--PSSFTP-AQLNEIRKV 369
Query: 1150 SLASL 1154
SLA +
Sbjct: 370 SLARI 374
Score = 158 bits (401), Expect = 7e-42
Identities = 60/130 (46%), Positives = 77/130 (59%), Gaps = 3/130 (2%)
Query: 277 RGEFGDDLAARDIQRGRDYGLRPYVDYRELCGLSTVRSFGDLNDFMDSEQVTRLSQVYRG 336
FG DLAA +IQRGRD+GL Y DYRE CGL +F DL M E + +L ++Y+
Sbjct: 246 GNPFGLDLAALNIQRGRDHGLPGYNDYREFCGLPRATTFDDLLGIMSPETIQKLRRLYKS 305
Query: 337 VNDIDYYIGGLLESVIRGTLSSPSFRCVFAEAFYRYKYGDRYFYEGPPATNPGAFTPGKQ 396
V+DID Y+GGL E + G L P+F C+ E F R + GDR++YE P +FTP Q
Sbjct: 306 VDDIDLYVGGLSEKPVPGGLVGPTFACIIGEQFRRLRRGDRFWYENGGQ--PSSFTP-AQ 362
Query: 397 LTLTSLTSLA 406
L SLA
Sbjct: 363 LNEIRKVSLA 372
Score = 30.6 bits (70), Expect = 6.6
Identities = 10/17 (58%), Positives = 14/17 (82%)
Query: 540 EAQLDAIKKVFLSQIIC 556
AQL+ I+KV L++IIC
Sbjct: 360 PAQLNEIRKVSLARIIC 376
>gnl|CDD|240341 PTZ00272, PTZ00272, heat shock protein 83 kDa (Hsp83); Provisional.
Length = 701
Score = 250 bits (639), Expect = 3e-70
Identities = 109/157 (69%), Positives = 138/157 (87%)
Query: 1227 IIPNKEEKTLTIIDSGIGMTKADLVNNLGTIAKSGTKAFMEALQAGADISMIGQFGVGFY 1286
++P+KE KTLT+ D+GIGMTKADLVNNLGTIA+SGTKAFMEAL+AG D+SMIGQFGVGFY
Sbjct: 65 VVPDKENKTLTVEDNGIGMTKADLVNNLGTIARSGTKAFMEALEAGGDMSMIGQFGVGFY 124
Query: 1287 SAYLVADKVTVTSKHNDDEQYIWESSAGGSFTIKPDNSEPLGRGTKIVLHMKEDQAEYIE 1346
SAYLVAD+VTVTSK+N DE Y+WESSAGG+FTI + RGT+I LH+KEDQ EY+E
Sbjct: 125 SAYLVADRVTVTSKNNSDESYVWESSAGGTFTITSTPESDMKRGTRITLHLKEDQMEYLE 184
Query: 1347 EKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEE 1383
+++KE++KKHS+FIGY I+L+VEK EKE+++++EE
Sbjct: 185 PRRLKELIKKHSEFIGYDIELMVEKTTEKEVTDEDEE 221
Score = 138 bits (349), Expect = 2e-33
Identities = 63/121 (52%), Positives = 88/121 (72%), Gaps = 6/121 (4%)
Query: 434 MTVTSKHNDDEQYIWESSAGGSFTIKPDNSEPLGRGTKIVLHMKEDQAEYIEEKKIKEIV 493
+TVTSK+N DE Y+WESSAGG+FTI + RGT+I LH+KEDQ EY+E +++KE++
Sbjct: 133 VTVTSKNNSDESYVWESSAGGTFTITSTPESDMKRGTRITLHLKEDQMEYLEPRRLKELI 192
Query: 494 KKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEKEEDKTPKLE-----DEAQLDAIKK 548
KKHS+FIGY I+L+VEK EKE++ DE+EE ++ E+ + PK+E DE + KK
Sbjct: 193 KKHSEFIGYDIELMVEKTTEKEVT-DEDEEDTKKADEDGEEPKVEEVKEGDEGKKKKTKK 251
Query: 549 V 549
V
Sbjct: 252 V 252
Score = 63.5 bits (154), Expect = 6e-10
Identities = 28/40 (70%), Positives = 35/40 (87%)
Query: 1 IVETLRQKADADKNDKAVKDLVNLLFETSLLSSGFTLEDP 40
I++ LR++ AD+NDKAVKDLV LLF+TSLL+SGF LEDP
Sbjct: 614 IIKELRRRVGADENDKAVKDLVFLLFDTSLLTSGFQLEDP 653
>gnl|CDD|217366 pfam03098, An_peroxidase, Animal haem peroxidase.
Length = 521
Score = 227 bits (580), Expect = 5e-64
Identities = 111/357 (31%), Positives = 163/357 (45%), Gaps = 79/357 (22%)
Query: 822 LIGDIRVNQQLDLAVSQTVWLRFHNYVAEKLIQQNPSWSNRDELVYQEARRLVIGCFQHI 881
L GD RVN+ L T++LR HN +A++L NP WS DE ++QEAR +VI +Q I
Sbjct: 204 LAGDSRVNENPGLTALHTLFLREHNRIADELKALNPHWS--DEKLFQEARLIVIAQYQKI 261
Query: 882 VYNEWLPILIGESYTKDVGLSPTSSGYFQGYDPEVNPSTLADFAAGAFRGESYTKDVGLS 941
YNE+LP L+G + L
Sbjct: 262 TYNEYLPALLGPDLMRANWLLL-------------------------------------- 283
Query: 942 PTSSGYFQGYDPEVNPSTLADFAAGAFRGLHSLIPGTIKLVNERRVTNQELPFSSTMQHP 1001
Y+ GYDP V+PS +FA A+R HSLIP ++ ++E R E+P T +P
Sbjct: 284 -----YYTGYDPNVDPSISNEFATAAYRFGHSLIPPGLERLDEFRTIAPEIPLHDTFFNP 338
Query: 1002 GDILEVMDVFDSTLRGLTNQLQQYQDRFITEQTIDFLFRVGSLSLEKVIVDLVGLT---- 1057
ILE + D LRGL +Q + D ++++ + LF G + +DL L
Sbjct: 339 SRILE--EGIDPLLRGLASQPAELLDNSLSDELRNRLF--GPRNFPGSGLDLAALNIQRG 394
Query: 1058 ----LGFFYLW--YCG-----------GFLLQ---VTRLSQVYRGVNDIDYYIGGLLESV 1097
L + + +CG + +L ++Y +DID ++GGL E
Sbjct: 395 RDHGLPPYNEYRRFCGLKPATSFEDLTDEIGDEELAEKLKELYGDPDDIDLWVGGLAEKP 454
Query: 1098 IRGTLSSPSFRCVFAEAFYRYKYGDRYFYEGPPATNPGAFTPGKQLTLTSLTSLASL 1154
+ G L P+F C+ AE F R + GDR++YE NPG FT +QL T+LA +
Sbjct: 455 VPGGLVGPTFACIIAEQFLRLRDGDRFWYE-----NPGVFTG-EQLEEIRKTTLARV 505
Score = 176 bits (449), Expect = 6e-47
Identities = 76/242 (31%), Positives = 108/242 (44%), Gaps = 55/242 (22%)
Query: 597 PKCCSDSGDRVRSTSGNCRPILIPSNDDFYSQFNRRCLEYKRSLAINCSVGALQPDKICP 656
CC+D+ C PI +P D ++S F R CL + RS P
Sbjct: 92 SPCCADN-------HPECFPIPVPPGDPYFSPFGR-CLPFFRSA----------PACGTG 133
Query: 657 GRESNPRPSAYKADVLATKLIRLKNFFFTDVETCTFQIVQSTHFIDVSFMYGSHPDKARE 716
+ PR QI Q T ++D+S +YGS ++A +
Sbjct: 134 PSCNLPRE----------------------------QINQLTSYLDLSQVYGSSEEEADK 165
Query: 717 LRSLSRGKLRV----REVNGRTFMPTGDPSNCVNASSNICYDAGDIRVNQQLDLAVSQTV 772
LR+ GKL+V NG+ +P P S C+ AGD RVN+ L T+
Sbjct: 166 LRTFKDGKLKVNGEFPPPNGKGLLPAPPPGPSGCLS---CFLAGDSRVNENPGLTALHTL 222
Query: 773 WLRFHNYVAEKLIQQNPSWSNRDELVYQEARRLVIGCFQHIVYNEWLPILIGDIRVNQQL 832
+LR HN +A++L NP WS DE ++QEAR +VI +Q I YNE+LP L+G +
Sbjct: 223 FLREHNRIADELKALNPHWS--DEKLFQEARLIVIAQYQKITYNEYLPALLGPDLMRANW 280
Query: 833 DL 834
L
Sbjct: 281 LL 282
Score = 141 bits (358), Expect = 4e-35
Identities = 59/156 (37%), Positives = 83/156 (53%), Gaps = 17/156 (10%)
Query: 253 FSQLVAHDITLNPQESTAVPTRGLRGEFGDDLAARDIQRGRDYGLRPYVDYRELCGLSTV 312
S + + + + DLAA +IQRGRD+GL PY +YR CGL
Sbjct: 365 LSDELRNRLFGPRNFPGSGL----------DLAALNIQRGRDHGLPPYNEYRRFCGLKPA 414
Query: 313 RSFGDLNDFMDSEQV-TRLSQVYRGVNDIDYYIGGLLESVIRGTLSSPSFRCVFAEAFYR 371
SF DL D + E++ +L ++Y +DID ++GGL E + G L P+F C+ AE F R
Sbjct: 415 TSFEDLTDEIGDEELAEKLKELYGDPDDIDLWVGGLAEKPVPGGLVGPTFACIIAEQFLR 474
Query: 372 YKYGDRYFYEGPPATNPGAFTPGKQLTLTSLTSLAS 407
+ GDR++YE NPG FT +QL T+LA
Sbjct: 475 LRDGDRFWYE-----NPGVFTG-EQLEEIRKTTLAR 504
Score = 102 bits (255), Expect = 3e-22
Identities = 41/92 (44%), Positives = 54/92 (58%), Gaps = 5/92 (5%)
Query: 179 YRTLDGTCNNLRRPWWGAVDTAHVRLMRPYYEDGVSSYRVSQVDASPLPNARVLSRVFLP 238
YR+ DG+CNNL+ P WGA T RL+ P YEDGVS+ R S V+ SPLP+ RV+S L
Sbjct: 1 YRSADGSCNNLQNPLWGAAGTPFTRLLPPVYEDGVSTPRGS-VNGSPLPSPRVVSNKLLA 59
Query: 239 DRNISDVH----TRMYLEFSQLVAHDITLNPQ 266
+N T M ++ Q + HD+T P
Sbjct: 60 RKNGFIPPDPGLTLMLAQWGQFIDHDLTFTPH 91
>gnl|CDD|235366 PRK05218, PRK05218, heat shock protein 90; Provisional.
Length = 613
Score = 226 bits (580), Expect = 5e-63
Identities = 78/151 (51%), Positives = 105/151 (69%), Gaps = 6/151 (3%)
Query: 1227 IIPNKEEKTLTIIDSGIGMTKADLVNNLGTIAKSGTKAFMEALQ--AGADISMIGQFGVG 1284
I +KE +TLTI D+GIGMT+ +++ NLGTIAKSGTK F+E L+ D +IGQFGVG
Sbjct: 66 ISFDKEARTLTISDNGIGMTREEVIENLGTIAKSGTKEFLEKLKGDQKKDSQLIGQFGVG 125
Query: 1285 FYSAYLVADKVTVTSKH--NDDEQYIWESSAGGSFTIKPDNSEPLGRGTKIVLHMKEDQA 1342
FYSA++VADKVTV ++ E WES G +TI+ E RGT+I LH+KED+
Sbjct: 126 FYSAFMVADKVTVITRSAGPAAEAVRWESDGEGEYTIEE--IEKEERGTEITLHLKEDED 183
Query: 1343 EYIEEKKIKEIVKKHSQFIGYPIKLLVEKER 1373
E+++E +I+ I+KK+S FI PIKL E+E
Sbjct: 184 EFLDEWRIRSIIKKYSDFIPVPIKLEKEEEE 214
Score = 97.5 bits (244), Expect = 1e-20
Identities = 37/103 (35%), Positives = 57/103 (55%), Gaps = 16/103 (15%)
Query: 435 TVTSKH--NDDEQYIWESSAGGSFTIKPDNSEPLGRGTKIVLHMKEDQAEYIEEKKIKEI 492
TV ++ E WES G +TI+ E RGT+I LH+KED+ E+++E +I+ I
Sbjct: 137 TVITRSAGPAAEAVRWESDGEGEYTIEE--IEKEERGTEITLHLKEDEDEFLDEWRIRSI 194
Query: 493 VKKHSQFIGYPIKLLVEKERE------------KELSEDEEEE 523
+KK+S FI PIKL E+E E++++E +E
Sbjct: 195 IKKYSDFIPVPIKLEKEEEETINSASALWTRSKSEITDEEYKE 237
Score = 38.2 bits (90), Expect = 0.031
Identities = 13/34 (38%), Positives = 22/34 (64%)
Query: 7 QKADADKNDKAVKDLVNLLFETSLLSSGFTLEDP 40
+K + ++ KDL LL++ +LL+ G +LEDP
Sbjct: 564 KKLADEADEAKFKDLAELLYDQALLAEGGSLEDP 597
>gnl|CDD|223403 COG0326, HtpG, Molecular chaperone, HSP90 family [Posttranslational
modification, protein turnover, chaperones].
Length = 623
Score = 224 bits (572), Expect = 6e-62
Identities = 84/158 (53%), Positives = 117/158 (74%), Gaps = 3/158 (1%)
Query: 1227 IIPNKEEKTLTIIDSGIGMTKADLVNNLGTIAKSGTKAFMEAL-QAGADISMIGQFGVGF 1285
I +K+ KTLTI D+GIGMTK +++ NLGTIAKSGTK F+E+L + D +IGQFGVGF
Sbjct: 67 ISFDKDNKTLTISDNGIGMTKDEVIENLGTIAKSGTKEFLESLSEDQKDSDLIGQFGVGF 126
Query: 1286 YSAYLVADKVTVTSKHND-DEQYIWESSAGGSFTIKPDNSEPLGRGTKIVLHMKEDQAEY 1344
YSA++VADKVTV ++ DE Y WES G +T++ + EP RGT+I LH+KE++ E+
Sbjct: 127 YSAFMVADKVTVITRSAGEDEAYHWESDGEGEYTVEDIDKEPR-RGTEITLHLKEEEDEF 185
Query: 1345 IEEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEE 1382
+EE +++EIVKK+S I YPI + EKE+++E+ E E
Sbjct: 186 LEEWRLREIVKKYSDHIAYPIYIEGEKEKDEEVIEWET 223
Score = 104 bits (262), Expect = 9e-23
Identities = 42/107 (39%), Positives = 65/107 (60%), Gaps = 10/107 (9%)
Query: 435 TVTSKHND-DEQYIWESSAGGSFTIKPDNSEPLGRGTKIVLHMKEDQAEYIEEKKIKEIV 493
TV ++ DE Y WES G +T++ + EP RGT+I LH+KE++ E++EE +++EIV
Sbjct: 137 TVITRSAGEDEAYHWESDGEGEYTVEDIDKEPR-RGTEITLHLKEEEDEFLEEWRLREIV 195
Query: 494 KKHSQFIGYPIKLLVEKEREKELSEDEE--------EEKKEEEKEED 532
KK+S I YPI + EKE+++E+ E E K E +E+
Sbjct: 196 KKYSDHIAYPIYIEGEKEKDEEVIEWETINKAKALWTRNKSEITDEE 242
Score = 40.3 bits (95), Expect = 0.008
Identities = 14/34 (41%), Positives = 24/34 (70%)
Query: 7 QKADADKNDKAVKDLVNLLFETSLLSSGFTLEDP 40
+K + +++ +V DLV LL++ +LL+ G LEDP
Sbjct: 574 KKLASLEDEASVADLVELLYDQALLAEGGPLEDP 607
>gnl|CDD|185466 PTZ00130, PTZ00130, heat shock protein 90; Provisional.
Length = 814
Score = 180 bits (458), Expect = 2e-46
Identities = 90/156 (57%), Positives = 116/156 (74%), Gaps = 2/156 (1%)
Query: 1227 IIPNKEEKTLTIIDSGIGMTKADLVNNLGTIAKSGTKAFMEAL-QAGADISMIGQFGVGF 1285
I NKE+ L+I D+GIGMTK DL+NNLGTIAKSGT F+EA+ ++G D+S+IGQFGVGF
Sbjct: 128 ISANKEKNILSITDTGIGMTKEDLINNLGTIAKSGTSNFLEAISKSGGDMSLIGQFGVGF 187
Query: 1286 YSAYLVADKVTVTSKHNDDEQYIWESSAGGSFTIKPD-NSEPLGRGTKIVLHMKEDQAEY 1344
YSA+LVADKV V +K+N+DEQYIWES+A FTI D L RGT+I LH+KED
Sbjct: 188 YSAFLVADKVIVYTKNNNDEQYIWESTADAKFTIYKDPRGSTLKRGTRISLHLKEDATNL 247
Query: 1345 IEEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSED 1380
+ +KK+ +++ K+SQFI YPI LL E +E+ D
Sbjct: 248 MNDKKLVDLISKYSQFIQYPIYLLHENVYTEEVLAD 283
Score = 82.0 bits (202), Expect = 1e-15
Identities = 42/89 (47%), Positives = 58/89 (65%), Gaps = 1/89 (1%)
Query: 436 VTSKHNDDEQYIWESSAGGSFTIKPD-NSEPLGRGTKIVLHMKEDQAEYIEEKKIKEIVK 494
V +K+N+DEQYIWES+A FTI D L RGT+I LH+KED + +KK+ +++
Sbjct: 199 VYTKNNNDEQYIWESTADAKFTIYKDPRGSTLKRGTRISLHLKEDATNLMNDKKLVDLIS 258
Query: 495 KHSQFIGYPIKLLVEKEREKELSEDEEEE 523
K+SQFI YPI LL E +E+ D +E
Sbjct: 259 KYSQFIQYPIYLLHENVYTEEVLADIAKE 287
>gnl|CDD|188647 cd05396, An_peroxidase_like, Animal heme peroxidases and related
proteins. A diverse family of enzymes, which includes
prostaglandin G/H synthase, thyroid peroxidase,
myeloperoxidase, linoleate diol synthase,
lactoperoxidase, peroxinectin, peroxidasin, and others.
Despite its name, this family is not restricted to
metazoans: members are found in fungi, plants, and
bacteria as well.
Length = 370
Score = 162 bits (413), Expect = 1e-43
Identities = 109/480 (22%), Positives = 162/480 (33%), Gaps = 141/480 (29%)
Query: 693 QIVQSTHFIDVSFMYGSHPDKARELRSLSRGKLRVREVNGRTFMPTGDPSNCVNASSNIC 752
Q+ T ++D S +YGS+PD AR LR+ G L+ N V S
Sbjct: 1 QLNARTPYLDGSSIYGSNPDVARALRTFKGGLLKT---------------NEVKGPSY-- 43
Query: 753 YDAGDIRVNQQLDLAVSQTVWLRFHNYVAEKLIQQNPSWSNRDELVYQEARRLVIGCFQH 812
T L F+N P CF
Sbjct: 44 -----------------GTELLPFNNPNPSMGTIGLPP----------------TRCF-- 68
Query: 813 IVYNEWLPILIGDIRVNQQLDLAVSQTVWLRFHNYVAEKLIQQNPSWSNRDELVYQEARR 872
+ GD RVN+ L L T++LR HN +A++L +++P W DE +YQEAR
Sbjct: 69 ---------IAGDPRVNENLLLLAVHTLFLREHNRLADRLKKEHPEWD--DERLYQEARL 117
Query: 873 LVIGCFQHIVYNEWLPILIGESYTKDVGLSPTSSGYFQGYDPEVNPSTLADFAAGAFRGE 932
+VI +Q I YNE+LP ++G+
Sbjct: 118 IVIAQYQLITYNEYLPAILGKFT------------------------------------- 140
Query: 933 SYTKDVGLSPTSSGYFQGYDPEVNPSTLADFAAGAFRGLHSLIPG--TIKLVNERRVTNQ 990
P DP+V P L++F A+R HSL+P N +
Sbjct: 141 --------DPRDDLVLLFPDPDVVPYVLSEFFTAAYRFGHSLVPEGVDRIDENGQPKEIP 192
Query: 991 ELPFSSTMQHPGDILEVMDVFDSTLRGLTNQLQQYQDRFITEQTIDFLFRVGSLSLEKVI 1050
++P + + D LRG Q I + D +F LE V
Sbjct: 193 DVPLKDFFFNTSRSILSDTGLDPLLRGFLRQPAG----LIDQNVDDVMF--LFGPLEGVG 246
Query: 1051 VDLVGLTL---------------GFFYLWYCGGFLLQVT-------RLSQVYRGVNDIDY 1088
+DL L + F L + +L+++Y +D+D
Sbjct: 247 LDLAALNIQRGRDLGLPSYNEVRRFIGL-KPPTSFQDILTDPELAKKLAELYGDPDDVDL 305
Query: 1089 YIGGLLESVIRGTLSSPSFRCVFAEAFYRYKYGDRYFYEGPPATNPGAFTPGKQLTLTSL 1148
++GGLLE + + E F R GDR++Y G + L SL
Sbjct: 306 WVGGLLEKKVPPARLGELLATIILEQFKRLVDGDRFYYVNYN--PFGKSGKEELEKLISL 363
Score = 99.4 bits (248), Expect = 5e-22
Identities = 47/172 (27%), Positives = 68/172 (39%), Gaps = 14/172 (8%)
Query: 233 SRVFLPDRNISDVHTRMYLEFSQLVAHDITLNPQESTAVPTRGLRGEFGD--DLAARDIQ 290
SR L D + + + Q ++ G DLAA +IQ
Sbjct: 204 SRSILSDTGLDPL---LRGFLRQPA-----GLIDQNVDDVMFLFGPLEGVGLDLAALNIQ 255
Query: 291 RGRDYGLRPYVDYRELCGLSTVRSFGDLNDFMDSEQVTRLSQVYRGVNDIDYYIGGLLES 350
RGRD GL Y + R GL SF D+ D E +L+++Y +D+D ++GGLLE
Sbjct: 256 RGRDLGLPSYNEVRRFIGLKPPTSFQDILT--DPELAKKLAELYGDPDDVDLWVGGLLEK 313
Query: 351 VIRGTLSSPSFRCVFAEAFYRYKYGDRYFYEGPPATNPGAFTPGKQLTLTSL 402
+ + E F R GDR++Y G + L SL
Sbjct: 314 KVPPARLGELLATIILEQFKRLVDGDRFYYVNYN--PFGKSGKEELEKLISL 363
>gnl|CDD|188658 cd09826, peroxidasin_like, Animal heme peroxidase domain of
peroxidasin and related proteins. Peroxidasin is a
secreted heme peroxidase which is involved in hydrogen
peroxide metabolism and peroxidative reactions in the
cardiovascular system. The domain co-occurs with
extracellular matrix domains and may play a role in the
formation of the extracellular matrix.
Length = 440
Score = 159 bits (404), Expect = 1e-41
Identities = 131/498 (26%), Positives = 194/498 (38%), Gaps = 159/498 (31%)
Query: 693 QIVQSTHFIDVSFMYGSHPDKARELRSLS--RGKLRV--REVNGRTFMP--TGDPSNCVN 746
QI Q T +ID S +YGS ++A ELR L+ RG LRV G+ +P P +C
Sbjct: 39 QINQLTSYIDASNVYGSSDEEALELRDLASDRGLLRVGIVSEAGKPLLPFERDSPMDCRR 98
Query: 747 ASSNICYDAGDIRVNQQLDLAVSQTVWLRFHNYVAEKLIQQNPSWSNRDELVYQEARRLV 806
+ ++P
Sbjct: 99 DPN-------------------------------------ESP----------------- 104
Query: 807 IGCFQHIVYNEWLPILIGDIRVNQQLDLAVSQTVWLRFHNYVAEKLIQQNPSWSNRDELV 866
I CF L GD R N+QL L T+WLR HN +A +L++ NP W E +
Sbjct: 105 IPCF-----------LAGDHRANEQLGLTSMHTLWLREHNRIASELLELNPHWD--GETI 151
Query: 867 YQEARRLVIGCFQHIVYNEWLPILIGESYTKDVGLSPTSSGYFQGYDPEVNPSTLADFAA 926
Y E R++V QHI Y+ WLP ++G G +
Sbjct: 152 YHETRKIVGAQMQHITYSHWLPKILGP----------------VGME------------- 182
Query: 927 GAFRGESYTKDVGLSPTSSGYFQGYDPEVNPSTLADFAAGAFRGLHSLIPGTIKLVNERR 986
G ++GY+P VNPS +FA AFR H+LI + ++E
Sbjct: 183 -----------------MLGEYRGYNPNVNPSIANEFATAAFRFGHTLINPILFRLDEDF 225
Query: 987 VTNQE--LPFSSTMQHPGDILEVMDVFDSTLRGL--TNQLQQYQDRFITEQTIDFLFRVG 1042
E LP P ++ + D LRGL T + D+ + + + LF +
Sbjct: 226 QPIPEGHLPLHKAFFAPYRLVNEGGI-DPLLRGLFATAAKDRVPDQLLNTELTEKLFEMA 284
Query: 1043 SLSLEKVIVDLVGLT--------LGFFYLW--YCGGFLLQ--------------VTRLSQ 1078
+V +DL L L + + +C + + +L +
Sbjct: 285 ----HEVALDLAALNIQRGRDHGLPGYNDYRKFCNLSVAETFEDLKNEIKNDDVREKLKR 340
Query: 1079 VYRGVNDIDYYIGGLLESVIRGTLSSPSFRCVFAEAFYRYKYGDRYFYEGPPATNPGAFT 1138
+Y +ID ++GG+LE ++ G P+ C+ AE F R + GDR++YE NPG F+
Sbjct: 341 LYGHPGNIDLFVGGILEDLLPGARVGPTLACLLAEQFRRLRDGDRFWYE-----NPGVFS 395
Query: 1139 PGKQLTLTSLTSLAS-LC 1155
P QLT TSLA LC
Sbjct: 396 P-AQLTQIKKTSLARVLC 412
Score = 117 bits (296), Expect = 1e-27
Identities = 55/126 (43%), Positives = 77/126 (61%), Gaps = 7/126 (5%)
Query: 283 DLAARDIQRGRDYGLRPYVDYRELCGLSTVRSFGDL-NDFMDSEQVTRLSQVYRGVNDID 341
DLAA +IQRGRD+GL Y DYR+ C LS +F DL N+ + + +L ++Y +ID
Sbjct: 290 DLAALNIQRGRDHGLPGYNDYRKFCNLSVAETFEDLKNEIKNDDVREKLKRLYGHPGNID 349
Query: 342 YYIGGLLESVIRGTLSSPSFRCVFAEAFYRYKYGDRYFYEGPPATNPGAFTPGKQLTLTS 401
++GG+LE ++ G P+ C+ AE F R + GDR++YE NPG F+P QLT
Sbjct: 350 LFVGGILEDLLPGARVGPTLACLLAEQFRRLRDGDRFWYE-----NPGVFSP-AQLTQIK 403
Query: 402 LTSLAS 407
TSLA
Sbjct: 404 KTSLAR 409
Score = 37.7 bits (88), Expect = 0.048
Identities = 14/45 (31%), Positives = 22/45 (48%)
Query: 540 EAQLDAIKKVFLSQIICIGSDRIVKMQLQSFYMPSERQFNVTLSD 584
AQL IKK L++++C D I ++Q F +P V+
Sbjct: 396 PAQLTQIKKTSLARVLCDNGDNITRVQEDVFLVPGNPHGYVSCES 440
>gnl|CDD|188654 cd09822, peroxinectin_like_bacterial, Uncharacterized family of heme
peroxidases, mostly bacterial. Animal heme peroxidases
are diverse family of enzymes which are not restricted to
animals. Members are also found in metazoans, fungi, and
plants, and also in bacteria - like most members of this
family of uncharacterized proteins.
Length = 420
Score = 146 bits (371), Expect = 1e-37
Identities = 95/343 (27%), Positives = 135/343 (39%), Gaps = 98/343 (28%)
Query: 822 LIGDIRVNQQLDLAVSQTVWLRFHNYVAEKLIQQNPSWSNRDELVYQEARRLVIGCFQHI 881
L GD+R N+ L T+++R HN +A++L ++NPS S DE +YQ AR +VI Q I
Sbjct: 114 LAGDVRANENPGLTALHTLFVREHNRLADELARRNPSLS--DEEIYQAARAIVIAEIQAI 171
Query: 882 VYNEWLPILIGESYTKDVGLSPTSSGYFQGYDPEVNPSTLADFAAGAFRGESYTKDVGLS 941
YNE+LP L+GE+ A A+
Sbjct: 172 TYNEFLPALLGEN------------------------------ALPAYS----------- 190
Query: 942 PTSSGYFQGYDPEVNPSTLADFAAGAFRGLHSLIPGTIKLVNERRVTNQELPFSSTMQHP 1001
GYD VNP +F+ A+R HS++ + +E L +P
Sbjct: 191 --------GYDETVNPGISNEFSTAAYRFGHSMLSSELLRGDEDGTEATSLALRDAFFNP 242
Query: 1002 GDILEVMDVFDSTLRGLTNQLQQYQDRFITEQTIDFLFRVGSLSLEKVIVDLVGLT---- 1057
++ E D LRGL +Q+ Q D FI + +FLF DL L
Sbjct: 243 DELEENG--IDPLLRGLASQVAQEIDTFIVDDVRNFLFGPPGAGG----FDLAALNIQRG 296
Query: 1058 --------------LGFFYLWYCGGFLLQVT-------------RLSQVYRGVNDIDYYI 1090
LG L VT RL+ VY V+ ID ++
Sbjct: 297 RDHGLPSYNQLREALG----------LPAVTSFSDITSDPDLAARLASVYGDVDQIDLWV 346
Query: 1091 GGLLESVIRGTLSSPSFRCVFAEAFYRYKYGDRYFYEGPPATN 1133
GGL E + G L +F + A+ F R + GDR+FYE
Sbjct: 347 GGLAEDHVNGGLVGETFSTIIADQFTRLRDGDRFFYENDDLLL 389
Score = 123 bits (310), Expect = 1e-29
Identities = 53/137 (38%), Positives = 77/137 (56%), Gaps = 7/137 (5%)
Query: 693 QIVQSTHFIDVSFMYGSHPDKARELRSLSRGKLRVREVNGRTFMPTGDPSNCVNASSNIC 752
QI T +ID S +YGS ++A LRS GKL+ N +P + N + +
Sbjct: 50 QINAITAYIDGSNVYGSDEERADALRSFGGGKLKTSVANAGDLLPFNEA-GLPNDNGGVP 108
Query: 753 YD----AGDIRVNQQLDLAVSQTVWLRFHNYVAEKLIQQNPSWSNRDELVYQEARRLVIG 808
D AGD+R N+ L T+++R HN +A++L ++NPS S DE +YQ AR +VI
Sbjct: 109 ADDLFLAGDVRANENPGLTALHTLFVREHNRLADELARRNPSLS--DEEIYQAARAIVIA 166
Query: 809 CFQHIVYNEWLPILIGD 825
Q I YNE+LP L+G+
Sbjct: 167 EIQAITYNEFLPALLGE 183
Score = 97.8 bits (244), Expect = 3e-21
Identities = 48/113 (42%), Positives = 62/113 (54%), Gaps = 2/113 (1%)
Query: 275 GLRGEFGDDLAARDIQRGRDYGLRPYVDYRELCGLSTVRSFGDLNDFMDSEQVTRLSQVY 334
G G G DLAA +IQRGRD+GL Y RE GL V SF D+ D + RL+ VY
Sbjct: 279 GPPGAGGFDLAALNIQRGRDHGLPSYNQLREALGLPAVTSFSDITS--DPDLAARLASVY 336
Query: 335 RGVNDIDYYIGGLLESVIRGTLSSPSFRCVFAEAFYRYKYGDRYFYEGPPATN 387
V+ ID ++GGL E + G L +F + A+ F R + GDR+FYE
Sbjct: 337 GDVDQIDLWVGGLAEDHVNGGLVGETFSTIIADQFTRLRDGDRFFYENDDLLL 389
>gnl|CDD|188657 cd09825, thyroid_peroxidase, Thyroid peroxidase (TPO). TPO is a
member of the animal heme peroxidase family, which is
expressed in the thyroid and involved in the processing
of iodine and iodine compounds. Specifically, TPO
oxidizes iodide via hydrogen peroxide to form active
iodine, which is then, for example, incorporated into the
tyrosine residues of thyroglobulin to yield mono- and
di-iodotyrosines.
Length = 565
Score = 140 bits (355), Expect = 2e-34
Identities = 138/584 (23%), Positives = 201/584 (34%), Gaps = 208/584 (35%)
Query: 614 CRPILIPSNDDFYSQFNRRCLEYKRSLAINCSVGALQPDKICPG-RESNPRPSAYKADVL 672
C PI +PS D R CL + RS A+ C G + +NPR
Sbjct: 103 CFPIQLPSEDPRILG--RACLPFFRSSAV-CGTGDTST--LFGNLSLANPRE-------- 149
Query: 673 ATKLIRLKNFFFTDVETCTFQIVQSTHFIDVSFMYGSHPDKARELRSLSR--GKLRVRE- 729
QI T FID S +YGS AR LR LS G LRV
Sbjct: 150 --------------------QINGLTSFIDASTVYGSTLALARSLRDLSSDDGLLRVNSK 189
Query: 730 --VNGRTFMPTGDPSNCVNASSNICYDAGDIRVNQQLDLAVSQTVWLRFHNYVAEKLIQQ 787
+GR ++P + E++
Sbjct: 190 FDDSGRDYLP-----------------------------------------FQPEEVSSC 208
Query: 788 NPSWSNRDELVYQEARRLVIGCFQHIVYNEWLPILIGDIRVNQQLDLAVSQTVWLRFHNY 847
NP + + + CF L GD R ++ L L S T+WLR HN
Sbjct: 209 NPDPNGGE----------RVPCF-----------LAGDGRASEVLTLTASHTLWLREHNR 247
Query: 848 VAEKLIQQNPSWSNRDELVYQEARRLVIGCFQHIVYNEWLPILIGESYTKDVGLSPTSSG 907
+A L NP W E +YQEAR++V Q I + +++P ++G P +
Sbjct: 248 LARALKSINPHWDG--EQIYQEARKIVGALHQIITFRDYIPKILG----------PEA-- 293
Query: 908 YFQGYDPEVNPSTLADFAAGAFRGESYTKDVGLSPTSSGYFQGYDPEVNPSTLADFAAGA 967
F Y GY++GYDP VNP+ F+ A
Sbjct: 294 -FDQY--------------------------------GGYYEGYDPTVNPTVSNVFSTAA 320
Query: 968 FRGLHSLIPGTIKLVNERRVTNQELPFSSTMQHPGDILEVMDVF------------DSTL 1015
FR H+ I T++ ++E F P L + D F D +
Sbjct: 321 FRFGHATIHPTVRRLDEN--------FQEHPVLPN--LALHDAFFSPWRLVREGGLDPVI 370
Query: 1016 RGLTNQLQ--QYQDRFITEQTIDFLF-RVGSLSLEKVIVDLVGLT--------LGFFYLW 1064
RGL D + E+ + LF S +L DL L L + W
Sbjct: 371 RGLIGGPAKLVTPDDLMNEELTEKLFVLSNSSTL-----DLASLNLQRGRDHGLPGYNDW 425
Query: 1065 --YCGGFLLQ--------------VTRLSQVYRGVNDIDYYIGGLLESVIRGTLSSPSFR 1108
+CG L ++ +Y+ ++ID ++GGL E + G + P F
Sbjct: 426 REFCGLPRLATPADLATAIADQAVADKILDLYKHPDNIDVWLGGLAEDFLPGARTGPLFA 485
Query: 1109 CVFAEAFYRYKYGDRYFYEGPPATNPGAFTPGKQLTLTSLTSLA 1152
C+ + + GDR+++E N FT Q SL+
Sbjct: 486 CLIGKQMKALRDGDRFWWE-----NSNVFTDA-QRRELRKHSLS 523
Score = 90.2 bits (224), Expect = 3e-18
Identities = 41/125 (32%), Positives = 67/125 (53%), Gaps = 7/125 (5%)
Query: 283 DLAARDIQRGRDYGLRPYVDYRELCGLSTVRSFGDLNDFMDSEQVT-RLSQVYRGVNDID 341
DLA+ ++QRGRD+GL Y D+RE CGL + + DL + + V ++ +Y+ ++ID
Sbjct: 405 DLASLNLQRGRDHGLPGYNDWREFCGLPRLATPADLATAIADQAVADKILDLYKHPDNID 464
Query: 342 YYIGGLLESVIRGTLSSPSFRCVFAEAFYRYKYGDRYFYEGPPATNPGAFTPGKQLTLTS 401
++GGL E + G + P F C+ + + GDR+++E N FT Q
Sbjct: 465 VWLGGLAEDFLPGARTGPLFACLIGKQMKALRDGDRFWWE-----NSNVFTDA-QRRELR 518
Query: 402 LTSLA 406
SL+
Sbjct: 519 KHSLS 523
Score = 30.5 bits (69), Expect = 8.4
Identities = 23/89 (25%), Positives = 37/89 (41%), Gaps = 8/89 (8%)
Query: 195 GAVDTAHVRLMRPYYEDGVSSYR----VSQVDASPLPNARVLSRVFLPDRNIS----DVH 246
GA +T R + P YEDG S + LP+ R +S + + + D++
Sbjct: 1 GASNTPLARWLPPIYEDGFSEPVGWNKERLYNGFTLPSVREVSNKIMRTSSTAVTPDDLY 60
Query: 247 TRMYLEFSQLVAHDITLNPQESTAVPTRG 275
+ M + Q + HDI PQ + G
Sbjct: 61 SHMLTVWGQYIDHDIDFTPQSVSRTMFIG 89
>gnl|CDD|188656 cd09824, myeloperoxidase_like, Myeloperoxidases, eosinophil
peroxidases, and lactoperoxidases. This well conserved
family of animal heme peroxidases contains members with
somewhat diverse functions. Myeloperoxidases are
lysosomal proteins found in azurophilic granules of
neutrophils and the lysosomes of monocytes. They are
involved in the formation of microbicidal agents upon
activation of activated neutrophils (neutrophils
undergoing respiratory bursts as a result of
phagocytosis), by catalyzing the conversion of hydrogen
peroxide to hypochlorous acid. As a heme protein,
myeloperoxidase is responsible for the greenish tint of
pus, which is rich in neutrophils. Eosinophil peroxidases
are haloperoxidases as well, preferring bromide over
chloride. Expressed by eosinophil granulocytes, they are
involved in attacking multicellular parasites and play
roles in various inflammatory diseases such as asthma.
The haloperoxidase lactoperoxidase is secreted from
mucosal glands and provides antibacterial activity by
oxidizing a variety of substrates such as bromide or
chloride in the presence of hydrogen peroxide.
Length = 411
Score = 120 bits (302), Expect = 1e-28
Identities = 118/495 (23%), Positives = 182/495 (36%), Gaps = 152/495 (30%)
Query: 693 QIVQSTHFIDVSFMYGSHPDKARELRSLS--RGKLRVRE---VNGRTFMPTGDPSNCVNA 747
QI T F+D S +YGS P A+ LR+L+ G L V + NG +P
Sbjct: 14 QINALTSFVDASMVYGSEPSLAKXLRNLTNQLGLLAVNQRFTDNGLALLP---------- 63
Query: 748 SSNICYDAGDIRVNQQLDLAVSQTVWLRFHNYVAEKLIQQNPSWSNRDELVYQEARRLVI 807
N D + N A I
Sbjct: 64 -----------FENLHNDPCALR-------NTSAN------------------------I 81
Query: 808 GCFQHIVYNEWLPILIGDIRVNQQLDLAVSQTVWLRFHNYVAEKLIQQNPSWSNRDELVY 867
CF L GD RV++ LA T+ LR HN +A +L + NP W E +Y
Sbjct: 82 PCF-----------LAGDTRVSENPGLAALHTLLLREHNRLARELHRLNPHWDG--ETLY 128
Query: 868 QEARRLVIGCFQHIVYNEWLPILIGESYTKDVGLSPTSSGYFQGYDPEVNPSTLADFAAG 927
QEAR++V Q I Y ++LP+++GE + GY+ V+P F
Sbjct: 129 QEARKIVGAMVQIITYRDYLPLILGEDAAARLPPYR-------GYNESVDPRIANVFTTA 181
Query: 928 AFRGESYTKDVGLSPTSSGYFQGYDP-EVNPSTLADFAAGAFRGLHSLIPGTIKLVNERR 986
RG + P + Y P NP LH + +++ E
Sbjct: 182 FRRGH-----TTVQPFVFRLDENYQPHPPNPQV----------PLHKAFFASWRIIRE-- 224
Query: 987 VTNQELPFSSTMQHPGDILEVMDVFDSTLRGL--TNQLQQYQDRFITEQTIDFLFRVGSL 1044
G I D LRGL T Q++ + ++ + LF+
Sbjct: 225 ---------------GGI-------DPILRGLMATPAKLNNQNQMLVDELRERLFQ---- 258
Query: 1045 SLEKVIVDLVGL--------TLGFFYLW--YCGGFLLQ-VTRLSQV-------------Y 1080
+++ +DL L L + W +CG Q + L+ V Y
Sbjct: 259 QTKRMGLDLAALNLQRGRDHGLPGYNAWRRFCGLSQPQNLAELAAVLNNTVLARKLLDLY 318
Query: 1081 RGVNDIDYYIGGLLESVIRGTLSSPSFRCVFAEAFYRYKYGDRYFYEGPPATNPGAFTPG 1140
++ID +IGG+ E ++ G P C+ + F R + GDR+++E NPG FT
Sbjct: 319 GTPDNIDIWIGGVAEPLVPGGRVGPLLACLISRQFRRIRDGDRFWWE-----NPGVFTEE 373
Query: 1141 KQLTLTSLTSLASLC 1155
++ +L S++ +C
Sbjct: 374 QRESLRSVSLSRIIC 388
Score = 80.9 bits (200), Expect = 1e-15
Identities = 44/126 (34%), Positives = 73/126 (57%), Gaps = 6/126 (4%)
Query: 281 GDDLAARDIQRGRDYGLRPYVDYRELCGLSTVRSFGDLNDFMDSEQVTR-LSQVYRGVND 339
G DLAA ++QRGRD+GL Y +R CGLS ++ +L +++ + R L +Y ++
Sbjct: 264 GLDLAALNLQRGRDHGLPGYNAWRRFCGLSQPQNLAELAAVLNNTVLARKLLDLYGTPDN 323
Query: 340 IDYYIGGLLESVIRGTLSSPSFRCVFAEAFYRYKYGDRYFYEGPPATNPGAFTPGKQLTL 399
ID +IGG+ E ++ G P C+ + F R + GDR+++E NPG FT ++ +L
Sbjct: 324 IDIWIGGVAEPLVPGGRVGPLLACLISRQFRRIRDGDRFWWE-----NPGVFTEEQRESL 378
Query: 400 TSLTSL 405
S++
Sbjct: 379 RSVSLS 384
>gnl|CDD|188652 cd09820, dual_peroxidase_like, Dual oxidase and related animal heme
peroxidases. Animal heme peroxidases of the dual-oxidase
like subfamily play vital roles in the innate mucosal
immunity of gut epithelia. They provide reactive oxygen
species which help control infection.
Length = 558
Score = 113 bits (285), Expect = 1e-25
Identities = 88/355 (24%), Positives = 144/355 (40%), Gaps = 100/355 (28%)
Query: 824 GDIRVNQQLDLAVSQTVWLRFHNYVAEKLIQQNPSWSNRDELVYQEARRLVIGCFQHIVY 883
G+ R N+ L +W R+HNY+A+++ +++P WS DE ++QEAR+ VI +Q+IV+
Sbjct: 204 GNPRGNENPFLLTFGILWFRYHNYLAQRIAREHPDWS--DEDIFQEARKWVIATYQNIVF 261
Query: 884 NEWLPILIGESYTKDVGLSPTSSGYFQGYDPEVNPSTLADFAAGAFRGESYTKDVGLSPT 943
EWLP L+G + YT
Sbjct: 262 YEWLPALLGTNVPP------------------------------------YT-------- 277
Query: 944 SSGYFQGYDPEVNPSTLADFAAGAFRGLHSLIPGTIKLVNERRVTNQELPFSSTMQH--- 1000
GY P V+P +F A AFR H+L+P + N + +E+ +S
Sbjct: 278 ------GYKPHVDPGISHEFQAAAFRFGHTLVPPGVYRRNRQC-NFREVLTTSGGSPALR 330
Query: 1001 -------PGDILEVMDVFDSTLRGLTNQLQQYQDRFITEQTIDFLFRVGSLSLEKVIVDL 1053
+ L D+ D L G+ +Q+ + +D I E D+LF G L + DL
Sbjct: 331 LCNTYWNSQEPLLKSDI-DELLLGMASQIAEREDNIIVEDLRDYLF--GPLEFSR--RDL 385
Query: 1054 VGLT-----------------------LGFFYLWYCGGFLLQ---VTRLSQVYRG-VNDI 1086
+ L + F + RL+++Y ++ +
Sbjct: 386 MALNIQRGRDHGLPDYNTAREAFGLPPRTTWSDINPDLFKKDPELLERLAELYGNDLSKL 445
Query: 1087 DYYIGGLLESVIRGTLSSPSFRCVFAEAFYRYKYGDRYFYEGPPATNPGAFTPGK 1141
D Y+GG+LES +G FR + + F R + GDR+++E G FT +
Sbjct: 446 DLYVGGMLES--KGGGPGELFRAIILDQFQRLRDGDRFWFEN---VKNGLFTAEE 495
Score = 105 bits (264), Expect = 3e-23
Identities = 49/142 (34%), Positives = 75/142 (52%), Gaps = 12/142 (8%)
Query: 693 QIVQSTHFIDVSFMYGSHPDKARELRSLSRGKLR-------VREVNGRTFM--PTGDPSN 743
Q+ + T +ID S +YGS + LRS S G+L R R + P PS
Sbjct: 133 QLNEVTSWIDGSSIYGSSKAWSDALRSFSGGRLASGDDGGFPRRNTNRLPLANPP-PPSY 191
Query: 744 CVNASSNICYDAGDIRVNQQLDLAVSQTVWLRFHNYVAEKLIQQNPSWSNRDELVYQEAR 803
+ G+ R N+ L +W R+HNY+A+++ +++P WS DE ++QEAR
Sbjct: 192 HGTRGPERLFKLGNPRGNENPFLLTFGILWFRYHNYLAQRIAREHPDWS--DEDIFQEAR 249
Query: 804 RLVIGCFQHIVYNEWLPILIGD 825
+ VI +Q+IV+ EWLP L+G
Sbjct: 250 KWVIATYQNIVFYEWLPALLGT 271
Score = 78.1 bits (193), Expect = 1e-14
Identities = 42/117 (35%), Positives = 64/117 (54%), Gaps = 9/117 (7%)
Query: 283 DLAARDIQRGRDYGLRPYVDYRELCGLSTVRSFGDLND---FMDSEQVTRLSQVYRG-VN 338
DL A +IQRGRD+GL Y RE GL ++ D+N D E + RL+++Y ++
Sbjct: 384 DLMALNIQRGRDHGLPDYNTAREAFGLPPRTTWSDINPDLFKKDPELLERLAELYGNDLS 443
Query: 339 DIDYYIGGLLESVIRGTLSSPSFRCVFAEAFYRYKYGDRYFYEGPPATNPGAFTPGK 395
+D Y+GG+LES +G FR + + F R + GDR+++E G FT +
Sbjct: 444 KLDLYVGGMLES--KGGGPGELFRAIILDQFQRLRDGDRFWFEN---VKNGLFTAEE 495
Score = 56.5 bits (137), Expect = 7e-08
Identities = 29/85 (34%), Positives = 37/85 (43%), Gaps = 14/85 (16%)
Query: 183 DGTCNNLRRPWWGAVDTAHVRLMRPYYEDGVSSYRVSQVDASPLPNARVLSRVF------ 236
DG NNL P WGA D+ R + +Y DGV Y S + PN R LS +
Sbjct: 2 DGWYNNLAHPEWGAADSRLTRRLPAHYSDGV--YAPSGEE---RPNPRSLSNLLMKGESG 56
Query: 237 LPDRNISDVHTRMYLEFSQLVAHDI 261
LP T + + F Q V +I
Sbjct: 57 LPSTRNR---TALLVFFGQHVVSEI 78
>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein.
Length = 529
Score = 97.1 bits (242), Expect = 1e-20
Identities = 38/62 (61%), Positives = 49/62 (79%), Gaps = 2/62 (3%)
Query: 482 EYIEEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEE--EKEEDKTPKLED 539
EY+EE++IKE+VKKHS+FI YPI L VEKE EKE+ ++EEEE+KEE E+EE T K E+
Sbjct: 1 EYLEERRIKELVKKHSEFINYPIYLWVEKEVEKEVPDEEEEEEKEEKKEEEEKTTDKEEE 60
Query: 540 EA 541
Sbjct: 61 VD 62
Score = 79.0 bits (195), Expect = 6e-15
Identities = 28/41 (68%), Positives = 36/41 (87%)
Query: 1343 EYIEEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEE 1383
EY+EE++IKE+VKKHS+FI YPI L VEKE EKE+ ++EEE
Sbjct: 1 EYLEERRIKELVKKHSEFINYPIYLWVEKEVEKEVPDEEEE 41
Score = 73.3 bits (180), Expect = 4e-13
Identities = 31/64 (48%), Positives = 43/64 (67%), Gaps = 9/64 (14%)
Query: 1 IVETLRQKADADKNDKAVKDLVNLLFETSLLSSGFTLEDPQ---------VELTLSGRPD 51
I++ LR++ +ADKNDK VKDL LL+ET+LL+SGF+LEDP+ ++L LS D
Sbjct: 439 IIKELRKRVEADKNDKTVKDLARLLYETALLTSGFSLEDPKAFASRIYRMIKLGLSIDED 498
Query: 52 RSPE 55
E
Sbjct: 499 EEVE 502
>gnl|CDD|237603 PRK14083, PRK14083, HSP90 family protein; Provisional.
Length = 601
Score = 89.6 bits (223), Expect = 4e-18
Identities = 43/147 (29%), Positives = 79/147 (53%), Gaps = 4/147 (2%)
Query: 1228 IPNKEEKTLTIIDSGIGMTKADLVNNLGTIAKSGTKAFMEALQAGADISMIGQFGVGFYS 1287
+ + TL + D+GIG+T+ ++ L TI +S + E L + +GQFG+G S
Sbjct: 57 LTDAGGGTLIVEDNGIGLTEEEVHEFLATIGRSSKRD--ENLGFARN-DFLGQFGIGLLS 113
Query: 1288 AYLVADKVTVTSKHNDDEQYI-WESSAGGSFTIKPDNSEPLGRGTKIVLHMKEDQAEYIE 1346
+LVAD++ V S+ D + W A G+++++ +E GT + L + D E++E
Sbjct: 114 CFLVADEIVVVSRSAKDGPAVEWRGKADGTYSVRKLETERAEPGTTVYLRPRPDAEEWLE 173
Query: 1347 EKKIKEIVKKHSQFIGYPIKLLVEKER 1373
+ ++E+ KK+ + PI++ EK
Sbjct: 174 RETVEELAKKYGSLLPVPIRVEGEKGG 200
Score = 42.6 bits (101), Expect = 0.002
Identities = 17/65 (26%), Positives = 35/65 (53%)
Query: 448 WESSAGGSFTIKPDNSEPLGRGTKIVLHMKEDQAEYIEEKKIKEIVKKHSQFIGYPIKLL 507
W A G+++++ +E GT + L + D E++E + ++E+ KK+ + PI++
Sbjct: 136 WRGKADGTYSVRKLETERAEPGTTVYLRPRPDAEEWLERETVEELAKKYGSLLPVPIRVE 195
Query: 508 VEKER 512
EK
Sbjct: 196 GEKGG 200
>gnl|CDD|188648 cd09816, prostaglandin_endoperoxide_synthase, Animal prostaglandin
endoperoxide synthase and related bacterial proteins.
Animal prostaglandin endoperoxide synthases, including
prostaglandin H2 synthase and a set of similar bacterial
proteins which may function as cyclooxygenases.
Prostaglandin H2 synthase catalyzes the synthesis of
prostaglandin H2 from arachidonic acid. In two reaction
steps, arachidonic acid is converted to Prostaglandin
G2, a peroxide (cyclooxygenase activity) and
subsequently converted to the end product via the
enzyme's peroxidase activity. Prostaglandin H2 synthase
is the target of aspirin and other non-steroid
anti-inflammatory drugs such as ibuprofen, which block
the substrate's access to the active site and may
acetylate a conserved serine residue. In humans and
other mammals, prostaglandin H2 synthase (PGHS), also
called cyclooxygenase (COX) is present as at least two
isozymes, PGHS-1 (or COX-1) and PGHS-2 (or COX-2),
respectively. PGHS-1 is expressed constitutively in most
mammalian cells, while the expression of PGHS-2 is
induced via inflammation response in endothelial cells,
activated macrophages, and others. COX-3 is a splice
variant of COX-1.
Length = 490
Score = 70.0 bits (172), Expect = 4e-12
Identities = 39/138 (28%), Positives = 60/138 (43%), Gaps = 18/138 (13%)
Query: 697 STHFIDVSFMYGSHPDKARELRSLSRGKLRVREVNGRTFMPT-------------GDPSN 743
S H ID+S +YG + LR GKL+ + +NG + P P
Sbjct: 127 SNHGIDLSQIYGLTEARTHALRLFKDGKLKSQMINGEEYPPYLFEDGGVKMEFPPLVPPL 186
Query: 744 CVNASSNI---CYDAGDIRVNQQLDLAVSQTVWLRFHNYVAEKLIQQNPSWSNRDELVYQ 800
+ + G R N L + T+WLR HN V + L +++P W DE ++Q
Sbjct: 187 GDELTPEREAKLFAVGHERFNLTPGLFMLNTIWLREHNRVCDILKKEHPDWD--DERLFQ 244
Query: 801 EARRLVIGCFQHIVYNEW 818
AR ++IG IV ++
Sbjct: 245 TARNILIGELIKIVIEDY 262
Score = 51.5 bits (124), Expect = 3e-06
Identities = 25/77 (32%), Positives = 38/77 (49%), Gaps = 4/77 (5%)
Query: 275 GLR--GEFGDDLAARDIQRGRDYGLRPYVDYRELCGLSTVRSFGDLNDFMDSEQVTRLSQ 332
GLR F + R I++GR L + DYR+ GL SF +L D E L +
Sbjct: 349 GLRNTPPFLLPVEVRSIEQGRKLRLASFNDYRKRFGLPPYTSFEELTG--DPEVAAELEE 406
Query: 333 VYRGVNDIDYYIGGLLE 349
+Y V+ +++Y+G E
Sbjct: 407 LYGDVDAVEFYVGLFAE 423
Score = 50.3 bits (121), Expect = 5e-06
Identities = 23/64 (35%), Positives = 36/64 (56%), Gaps = 2/64 (3%)
Query: 823 IGDIRVNQQLDLAVSQTVWLRFHNYVAEKLIQQNPSWSNRDELVYQEARRLVIGCFQHIV 882
+G R N L + T+WLR HN V + L +++P W DE ++Q AR ++IG IV
Sbjct: 201 VGHERFNLTPGLFMLNTIWLREHNRVCDILKKEHPDWD--DERLFQTARNILIGELIKIV 258
Query: 883 YNEW 886
++
Sbjct: 259 IEDY 262
>gnl|CDD|188650 cd09818, PIOX_like, Animal heme oxidases similar to plant
pathogen-inducible oxygenases. This is a diverse family
of oxygenases related to the animal heme peroxidases,
with members from plants, animals, and bacteria. The
plant pathogen-inducible oxygenases (PIOX) oxygenate
fatty acids into 2R-hydroperoxides. They may be involved
in the hypersensitive reaction, rapid and localized cell
death induced by infection with pathogens, and the
rapidly induced expression of PIOX may be caused by the
oxidative burst that occurs in the process of cell
death.
Length = 484
Score = 65.0 bits (159), Expect = 2e-10
Identities = 38/162 (23%), Positives = 70/162 (43%), Gaps = 19/162 (11%)
Query: 698 THFIDVSFMYGSHPDKARELRSLSR-GKLRVREVNGRTFMPTGDPSNCVNASSNICYDAG 756
TH+ D S +YGS + + LR+ GKL++ +P D + + G
Sbjct: 91 THWWDGSQIYGSTEEAQKRLRTFPPDGKLKLDA---DGLLPV-DEHTGLPLT-------G 139
Query: 757 DIRVNQQLDLAVSQTVWLRFHNYVAEKLIQQNPSWSNRDELVYQEARRLVIGCFQHIVYN 816
N + L++ T+++R HN + + L ++ P WS DE ++ +AR + I
Sbjct: 140 FND-NWWVGLSLLHTLFVREHNAICDALRKEYPDWS--DEQLFDKARLVNAALMAKIHTV 196
Query: 817 EWLPILIGDIRVNQQLDLAVSQTVWLRFHNYVAEKLIQQNPS 858
EW P ++ + L++A+ W + L + S
Sbjct: 197 EWTPAILA----HPTLEIAMRANWWGLLGERLKRVLGRDGTS 234
Score = 57.3 bits (139), Expect = 3e-08
Identities = 38/99 (38%), Positives = 49/99 (49%), Gaps = 5/99 (5%)
Query: 283 DLAARDIQRGRDYGLRPYVDYRELCGLSTVRSFGDLNDFMDSEQVTRLSQVYRG-VNDID 341
DLAA DI R R+ G+ Y ++R L L +SF DL D E L +VY G V +D
Sbjct: 341 DLAAIDILRDRERGVPRYNEFRRLLHLPPAKSFEDLTG--DEEVAAELREVYGGDVEKVD 398
Query: 342 YYIGGLLESVIRG-TLSSPSFRCVFAEAFYRYKYGDRYF 379
+G L E + G S +FR A R K DR+F
Sbjct: 399 LLVGLLAEPLPPGFGFSDTAFRIFILMASRRLK-SDRFF 436
Score = 40.3 bits (95), Expect = 0.007
Identities = 22/82 (26%), Positives = 38/82 (46%), Gaps = 10/82 (12%)
Query: 818 WLP-------ILIGDIRVNQQLDLAVSQTVWLRFHNYVAEKLIQQNPSWSNRDELVYQEA 870
LP L G N + L++ T+++R HN + + L ++ P WS DE ++ +A
Sbjct: 126 LLPVDEHTGLPLTGFND-NWWVGLSLLHTLFVREHNAICDALRKEYPDWS--DEQLFDKA 182
Query: 871 RRLVIGCFQHIVYNEWLPILIG 892
R + I EW P ++
Sbjct: 183 RLVNAALMAKIHTVEWTPAILA 204
Score = 32.3 bits (74), Expect = 1.9
Identities = 19/58 (32%), Positives = 25/58 (43%), Gaps = 6/58 (10%)
Query: 180 RTLDGTCNNLRRPWWGAVDTAHVRLMRPYYEDGVSSYRVSQVDASPLPNARVLSRVFL 237
RT DG+ N+L P G+V T R + + D PN RV+SR L
Sbjct: 1 RTADGSYNDLDNPSMGSVGTRFGRNVPL------DATFPEDKDELLTPNPRVISRRLL 52
>gnl|CDD|188653 cd09821, An_peroxidase_bacterial_2, Uncharacterized bacterial
family of heme peroxidases. Animal heme peroxidases are
diverse family of enzymes which are not restricted to
metazoans; members are also found in fungi, and plants,
and in bacteria - like this family of uncharacterized
proteins.
Length = 570
Score = 60.1 bits (146), Expect = 6e-09
Identities = 49/178 (27%), Positives = 74/178 (41%), Gaps = 41/178 (23%)
Query: 824 GDIRVNQQLDLAVSQTVWLRFHNYVAEKLIQQ----------------NPSWSNRDELVY 867
GD RVN+ + L TV+ R HN + +++ N +W E ++
Sbjct: 181 GDGRVNENIGLTAVHTVFHREHNRLVDQIKDTLLQSADLAFANEAGGNNLAWDG--ERLF 238
Query: 868 QEARRLVIGCFQHIVYNEWLPILIGESYTKDVGLSPTSSGYFQGYDPEVNPSTLADFAAG 927
Q AR +QH+V+ E+ + G+ G F GY+PE+NPS A+FA
Sbjct: 239 QAARFANEMQYQHLVFEEFARRIQP-------GIDG--FGSFNGYNPEINPSISAEFAHA 289
Query: 928 AFR-GES-YTKDVGL--------SPTSSGYFQGY--DPEVNPSTL-ADFAAGAF-RGL 971
+R G S T+ V G + P+TL A+ AGA RG+
Sbjct: 290 VYRFGHSMLTETVTRIGPDADEGLDNQVGLIDAFLNPVAFLPATLYAEEGAGAILRGM 347
Score = 42.8 bits (101), Expect = 0.001
Identities = 44/186 (23%), Positives = 64/186 (34%), Gaps = 59/186 (31%)
Query: 696 QSTHFIDVSFMYGSHPDKARELR-SLSRGKLRVREVNGRT---------FM--------- 736
+T F+D + YGSH LR G R + G T F+
Sbjct: 86 VTTPFVDQNQTYGSHASHQVFLREYDGDGVATGRLLEGATGGSARTGHAFLDDIAHNAAP 145
Query: 737 ----------PTGDPSNCVNASSNICYD---------AGDIRVNQQLDLAVSQTVWLRFH 777
PT DP S YD AGD RVN+ + L TV+ R H
Sbjct: 146 KGGLGSLRDNPTEDPPGPGAPGS---YDNELLDAHFVAGDGRVNENIGLTAVHTVFHREH 202
Query: 778 NYVAEKLIQQ----------------NPSWSNRDELVYQEARRLVIGCFQHIVYNEWLPI 821
N + +++ N +W E ++Q AR +QH+V+ E+
Sbjct: 203 NRLVDQIKDTLLQSADLAFANEAGGNNLAWDG--ERLFQAARFANEMQYQHLVFEEFARR 260
Query: 822 LIGDIR 827
+ I
Sbjct: 261 IQPGID 266
Score = 39.7 bits (93), Expect = 0.013
Identities = 17/46 (36%), Positives = 28/46 (60%), Gaps = 1/46 (2%)
Query: 336 GVNDIDYYIGGLLES-VIRGTLSSPSFRCVFAEAFYRYKYGDRYFY 380
G++++D ++GGL E V G + +F VF E R + GDR++Y
Sbjct: 482 GLDNVDLWVGGLAEKQVPFGGMLGSTFNFVFEEQMDRLQDGDRFYY 527
Score = 39.7 bits (93), Expect = 0.013
Identities = 17/46 (36%), Positives = 28/46 (60%), Gaps = 1/46 (2%)
Query: 1082 GVNDIDYYIGGLLES-VIRGTLSSPSFRCVFAEAFYRYKYGDRYFY 1126
G++++D ++GGL E V G + +F VF E R + GDR++Y
Sbjct: 482 GLDNVDLWVGGLAEKQVPFGGMLGSTFNFVFEEQMDRLQDGDRFYY 527
>gnl|CDD|188651 cd09819, An_peroxidase_bacterial_1, Uncharacterized bacterial
family of heme peroxidases. Animal heme peroxidases are
diverse family of enzymes which are not restricted to
metazoans; members are also found in fungi, and plants,
and in bacteria - like this family of uncharacterized
proteins.
Length = 465
Score = 57.7 bits (140), Expect = 2e-08
Identities = 30/109 (27%), Positives = 52/109 (47%), Gaps = 7/109 (6%)
Query: 822 LIGDIRVNQQLDLAVSQTVWLRFHNYVAEKLIQQNPSWSNRDELVYQEARRLVIGCFQHI 881
LIGD R ++ L +A +LRFHN V + L +++EARRLV +Q +
Sbjct: 145 LIGDPRNDENLIVAQLHLAFLRFHNAVVDALRAHGTPGD----ELFEEARRLVRWHYQWL 200
Query: 882 VYNEWLPILIGESYTKDVGLSPTSSGYFQGYDPEVNPSTLADFAAGAFR 930
V N++LP + DV + +++ + + +F+ A+R
Sbjct: 201 VLNDFLPRICDPDVVDDVLANG--RRFYRFFREGKPFMPV-EFSVAAYR 246
Score = 50.8 bits (122), Expect = 4e-06
Identities = 38/136 (27%), Positives = 54/136 (39%), Gaps = 33/136 (24%)
Query: 706 MYGSHPDKARELRSLS----RGKLRVREVNGRTFMPTGDPSNCVNASSNICYDAGDIRVN 761
+YG PD + L + KLRV GR P G + A D+ N
Sbjct: 92 VYGGGPDGSPYLYDQATPNDGAKLRV----GRE-SPGGPGGLPGD-------GARDLPRN 139
Query: 762 QQLD-----------LAVSQT--VWLRFHNYVAEKLIQQNPSWSNRDELVYQEARRLVIG 808
Q L V+Q +LRFHN V + L +++EARRLV
Sbjct: 140 GQGTALIGDPRNDENLIVAQLHLAFLRFHNAVVDALRAHGTPGD----ELFEEARRLVRW 195
Query: 809 CFQHIVYNEWLPILIG 824
+Q +V N++LP +
Sbjct: 196 HYQWLVLNDFLPRICD 211
>gnl|CDD|217081 pfam02518, HATPase_c, Histidine kinase-, DNA gyrase B-, and
HSP90-like ATPase. This family represents the
structurally related ATPase domains of histidine kinase,
DNA gyrase B and HSP90.
Length = 111
Score = 52.7 bits (127), Expect = 4e-08
Identities = 21/114 (18%), Positives = 37/114 (32%), Gaps = 33/114 (28%)
Query: 1227 IIPNKEEKTLTIIDSGIGMTKADLVNNLGTIAKSGTKAFMEALQAGADISMIGQFGVGFY 1286
+ + +T+ D+GIG+ DL K F + + +G G+G
Sbjct: 31 LERDGGRLRITVEDNGIGIPPEDL-----------PKIFEPFFRTDSSSRKVGGTGLGLS 79
Query: 1287 SAYLVADKVTVTSKHNDDEQYIWESSAGGSFTIKPDNSEPLGRGTKIVLHMKED 1340
+ + GG+ T+ SEP G GT + +
Sbjct: 80 IVRKLVELH------------------GGTITV---ESEP-GGGTTFTFTLPLE 111
>gnl|CDD|222246 pfam13589, HATPase_c_3, Histidine kinase-, DNA gyrase B-, and
HSP90-like ATPase. This family represents, additionally,
the structurally related ATPase domains of histidine
kinase, DNA gyrase B and HSP90.
Length = 134
Score = 48.8 bits (117), Expect = 1e-06
Identities = 24/82 (29%), Positives = 42/82 (51%), Gaps = 13/82 (15%)
Query: 1227 IIPNKEEKTLTIIDSGIGMTKADLVNNL--GTIAKSGTKAFMEALQAGADISMIGQFGVG 1284
I P++ E ++I D+G GM+ +L N L G +K G + D + +G+ G+G
Sbjct: 26 IDPDRGEDGISIEDNGGGMSYEELRNALKLGRSSKEGER----------DSTTLGRKGIG 75
Query: 1285 -FYSAYLVADKVTVTSKHNDDE 1305
++ + K+TVTSK +
Sbjct: 76 MKLASLSLGRKLTVTSKKEGES 97
>gnl|CDD|214643 smart00387, HATPase_c, Histidine kinase-like ATPases. Histidine
kinase-, DNA gyrase B-, phytochrome-like ATPases.
Length = 111
Score = 44.9 bits (107), Expect = 2e-05
Identities = 16/96 (16%), Positives = 29/96 (30%), Gaps = 17/96 (17%)
Query: 1227 IIPNKEEKTLTIIDSGIGMTKADLVNNLGTIAKSGTKAFMEALQAGADISMIGQFGVGFY 1286
+ + + +T+ D+G G+ DL K F + IG G+G
Sbjct: 31 LERDGDHVEITVEDNGPGIPPEDL-----------EKIFEPFFRTDKRSRKIGGTGLGLS 79
Query: 1287 SAYLVADKVTVTSKHNDDEQYIWESSAGGSFTIKPD 1322
+ + + G +FTI
Sbjct: 80 IVKKLVELHGGEISVESEPG------GGTTFTITLP 109
>gnl|CDD|238030 cd00075, HATPase_c, Histidine kinase-like ATPases; This family
includes several ATP-binding proteins for example:
histidine kinase, DNA gyrase B, topoisomerases, heat
shock protein HSP90, phytochrome-like ATPases and DNA
mismatch repair proteins.
Length = 103
Score = 39.6 bits (93), Expect = 0.001
Identities = 10/80 (12%), Positives = 25/80 (31%), Gaps = 16/80 (20%)
Query: 1227 IIPNKEEKTLTIIDSGIGMTKADLVNNLGTIAKSGTKAFMEALQAGADISMIGQFGVGFY 1286
+ + + + + D+G G+ + DL + G G+G
Sbjct: 27 VERDGDHLEIRVEDNGPGIPEEDLERIFERFSDGSRS------------RKGGGTGLGLS 74
Query: 1287 SAYLVAD----KVTVTSKHN 1302
+ + ++ V S+
Sbjct: 75 IVKKLVELHGGRIEVESEPG 94
>gnl|CDD|219978 pfam08701, GN3L_Grn1, GNL3L/Grn1 putative GTPase. Grn1 (yeast) and
GNL3L (human) are putative GTPases which are required
for growth and play a role in processing of nucleolar
pre-rRNA. This family contains a potential nuclear
localisation signal.
Length = 80
Score = 38.0 bits (89), Expect = 0.002
Identities = 18/62 (29%), Positives = 27/62 (43%), Gaps = 7/62 (11%)
Query: 485 EEKKIKEIVKKHSQFIG----YPIK--LLVEKEREKELSEDEEEEKKEEEKEEDKTPKLE 538
E KK K + G +P K +L E E +K E+E+E +KE K E + +
Sbjct: 20 EAKKNPTWKSKKKKDPGIPNSFPFKEEILEEIEEKKRKQEEEKERRKEARKAE-RAEARK 78
Query: 539 DE 540
Sbjct: 79 RG 80
>gnl|CDD|177921 PLN02283, PLN02283, alpha-dioxygenase.
Length = 633
Score = 40.9 bits (96), Expect = 0.005
Identities = 36/133 (27%), Positives = 61/133 (45%), Gaps = 23/133 (17%)
Query: 679 LKNFFF---TDVETCTFQIVQS-----THFIDVSFMYGSHPDKARELRSLSRGKLRVREV 730
LK+F F +V T + I T + D S +YGS+ R +R+ GKL++ E
Sbjct: 184 LKSFKFYKTKEVPTGSPDIKTGSLNIRTPWWDGSVIYGSNEKGLRRVRTFKDGKLKISED 243
Query: 731 NGRTFMPTGDPSNCVNASSNICYDAGDIRVNQQLDLAVSQTVWLRFHNYVAEKLIQQNPS 790
G P +GD+R N +++ Q ++++ HN V + L ++ P
Sbjct: 244 GLLLHDEDGIPI------------SGDVR-NSWAGVSLLQALFVKEHNAVCDALKEEYPD 290
Query: 791 WSNRDELVYQEAR 803
+ DE +Y+ AR
Sbjct: 291 FD--DEELYRHAR 301
Score = 37.8 bits (88), Expect = 0.043
Identities = 30/99 (30%), Positives = 49/99 (49%), Gaps = 5/99 (5%)
Query: 283 DLAARDIQRGRDYGLRPYVDYRELCGLSTVRSFGDLNDFMDSEQVTRLSQVYRG-VNDID 341
D+AA +I R R+ G+ Y ++R + + + DL D D E + L +VY V +D
Sbjct: 472 DMAALEIYRDRERGVARYNEFRRNLLMIPISKWEDLTD--DEEAIEVLREVYGDDVEKLD 529
Query: 342 YYIGGLLESVIRG-TLSSPSFRCVFAEAFYRYKYGDRYF 379
+G + E I+G +S +F A R + DR+F
Sbjct: 530 LLVGLMAEKKIKGFAISETAFFIFLLMASRRLE-ADRFF 567
>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
Provisional.
Length = 482
Score = 38.0 bits (89), Expect = 0.034
Identities = 23/73 (31%), Positives = 38/73 (52%), Gaps = 5/73 (6%)
Query: 470 TKIVLHMKEDQAEYIEEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEK 529
TK + + E EKK +E K+ + K E+E EKE E+E+EE++EE +
Sbjct: 409 TKKIKKIVEK-----AEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKEEEKEEEEEEAE 463
Query: 530 EEDKTPKLEDEAQ 542
EE + + + + Q
Sbjct: 464 EEKEEEEEKKKKQ 476
Score = 35.7 bits (83), Expect = 0.20
Identities = 16/75 (21%), Positives = 35/75 (46%), Gaps = 10/75 (13%)
Query: 474 LHMKEDQAEYIEEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEKEEDK 533
L + E++ E++ K IK +VEK +K E +E++KK ++ +
Sbjct: 392 LELTEEEIEFLTGSKKATKK----------IKKIVEKAEKKREEEKKEKKKKAFAGKKKE 441
Query: 534 TPKLEDEAQLDAIKK 548
+ E++ + + K+
Sbjct: 442 EEEEEEKEKKEEEKE 456
>gnl|CDD|234767 PRK00448, polC, DNA polymerase III PolC; Validated.
Length = 1437
Score = 38.3 bits (90), Expect = 0.039
Identities = 19/90 (21%), Positives = 48/90 (53%), Gaps = 2/90 (2%)
Query: 462 NSEPLGRGTKIVLHMKEDQA-EYIEEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDE 520
+ G K+++ + + +++++K + +++K++ +F G+ I + + + + ++
Sbjct: 116 KQKVEVEGNKLIIKVNNEIERDHLKKKHLPKLIKQYEKF-GFGILKIDFEIDDSKEELEK 174
Query: 521 EEEKKEEEKEEDKTPKLEDEAQLDAIKKVF 550
E +KEEE E+ LE +L+A KK
Sbjct: 175 FEAQKEEEDEKLAKEALEAMKKLEAEKKKQ 204
>gnl|CDD|100110 cd05832, Ribosomal_L12p, Ribosomal protein L12p. This subfamily
includes archaeal L12p, the protein that is functionally
equivalent to L7/L12 in bacteria and the P1 and P2
proteins in eukaryotes. L12p is homologous to P1 and P2
but is not homologous to bacterial L7/L12. It is located
in the L12 stalk, with proteins L10, L11, and 23S rRNA.
L12p is the only protein in the ribosome to occur as
multimers, always appearing as sets of dimers. Recent
data indicate that most archaeal species contain six
copies of L12p (three homodimers), while eukaryotes have
four copies (two heterodimers), and bacteria may have
four or six copies (two or three homodimers), depending
on the species. The organization of proteins within the
stalk has been characterized primarily in bacteria,
where L7/L12 forms either two or three homodimers and
each homodimer binds to the extended C-terminal helix of
L10. L7/L12 is attached to the ribosome through L10 and
is the only ribosomal protein that does not directly
interact with rRNA. Archaeal L12p is believed to
function in a similar fashion. However, hybrid ribosomes
containing the large subunit from E. coli with an
archaeal stalk are able to bind archaeal and eukaryotic
elongation factors but not bacterial elongation factors.
In several mesophilic and thermophilic archaeal species,
the binding of 23S rRNA to protein L11 and to the
L10/L12p pentameric complex was found to be
temperature-dependent and cooperative.
Length = 106
Score = 34.8 bits (80), Expect = 0.050
Identities = 14/28 (50%), Positives = 19/28 (67%)
Query: 511 EREKELSEDEEEEKKEEEKEEDKTPKLE 538
E++ E EEE+KKEEEKEE++ L
Sbjct: 73 AAEEKAEEKEEEKKKEEEKEEEEEEALA 100
Score = 32.1 bits (73), Expect = 0.47
Identities = 11/24 (45%), Positives = 20/24 (83%)
Query: 518 EDEEEEKKEEEKEEDKTPKLEDEA 541
E++ EEK+EE+K+E++ + E+EA
Sbjct: 75 EEKAEEKEEEKKKEEEKEEEEEEA 98
Score = 31.7 bits (72), Expect = 0.65
Identities = 11/24 (45%), Positives = 19/24 (79%)
Query: 509 EKEREKELSEDEEEEKKEEEKEED 532
E++ E++ E ++EE+KEEE+EE
Sbjct: 75 EEKAEEKEEEKKKEEEKEEEEEEA 98
Score = 30.9 bits (70), Expect = 1.4
Identities = 16/51 (31%), Positives = 27/51 (52%), Gaps = 2/51 (3%)
Query: 484 IEEKKIKEIVKKHSQFI--GYPIKLLVEKEREKELSEDEEEEKKEEEKEED 532
+EE I E +KK + P EK ++EE++K+EE++EE+
Sbjct: 45 LEEVNIDEAIKKAAVAAAAAAPAAAAAAAAEEKAEEKEEEKKKEEEKEEEE 95
>gnl|CDD|224969 COG2058, RPP1A, Ribosomal protein L12E/L44/L45/RPP1/RPP2
[Translation, ribosomal structure and biogenesis].
Length = 109
Score = 34.7 bits (80), Expect = 0.069
Identities = 16/55 (29%), Positives = 21/55 (38%), Gaps = 6/55 (10%)
Query: 484 IEEKKIKEIVKK-HSQFIGYPI-----KLLVEKEREKELSEDEEEEKKEEEKEED 532
+E I E++K E E E EEEEK+EE +EE
Sbjct: 45 LEGVDIDEVIKNAAEAPAAAAAAGAAAAAAAGAEAAAEADEAEEEEKEEEAEEES 99
>gnl|CDD|221122 pfam11490, DNA_pol3_alph_N, DNA polymerase III polC-type
N-terminus. This is an N-terminal domain of DNA
polymerase III polC subunit A that is found only in
Firmicutes. DNA polymerase polC-type III enzyme
functions as the 'replicase' in low G + C Gram-positive
bacteria. Purine asymmetry is a characteristic of
organisms with a heterodimeric DNA polymerase III
alpha-subunit constituted by polC which probably plays a
direct role in the maintenance of strand-biased gene
distribution; since, among prokaryotic genomes, the
distribution of genes on the leading and lagging strands
of the replication fork is known to be biased. The
domain is associated with DNA_pol3_alpha pfam07733.
Length = 180
Score = 35.4 bits (82), Expect = 0.090
Identities = 28/109 (25%), Positives = 60/109 (55%), Gaps = 14/109 (12%)
Query: 435 TVTSKHNDDE------QYIWESSAGGSFTIKP--DNSEPLGRGTKIVLH-MKEDQAEYIE 485
V ++ +E +YI E + S ++K +P G K+++ E +A +++
Sbjct: 51 EVENQTTTEELLQDYWEYIVEELSKKSPSLKSLLKKQKPELEGNKLIIKVPNEIEANFLK 110
Query: 486 EKKIKEIVKKHSQFIGYP-IKLLVE---KEREKELSEDEEEEKKEEEKE 530
+KK+ ++++++ +F G+ +K+ VE E +E E+ EE+K+EEE +
Sbjct: 111 KKKLDKLLEEYIKF-GFGKLKIDVEVDEDESSEEEIEEFEEQKEEEEAK 158
>gnl|CDD|177433 PHA02608, 67, prohead core protein; Provisional.
Length = 80
Score = 33.2 bits (76), Expect = 0.11
Identities = 14/65 (21%), Positives = 33/65 (50%), Gaps = 11/65 (16%)
Query: 476 MKEDQAEYIEEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEKEEDKTP 535
M+ IEE+K+ EI + +++E E ++ +DE+++ +++ ++D
Sbjct: 24 MEARTEALIEEEKV-EIARS----------VMIEGEEPEDDDDDEDDDDDDDKDDKDDDD 72
Query: 536 KLEDE 540
+DE
Sbjct: 73 DDDDE 77
>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger. [Transport and
binding proteins, Cations and iron carrying compounds].
Length = 1096
Score = 36.5 bits (84), Expect = 0.12
Identities = 15/27 (55%), Positives = 23/27 (85%)
Query: 509 EKEREKELSEDEEEEKKEEEKEEDKTP 535
E+E E+E E+EEEE++EEE+EE++ P
Sbjct: 867 EEEEEEEEEEEEEEEEEEEEEEENEEP 893
Score = 35.7 bits (82), Expect = 0.23
Identities = 14/25 (56%), Positives = 22/25 (88%)
Query: 509 EKEREKELSEDEEEEKKEEEKEEDK 533
E+E E+E E+EEEE++EEE+EE++
Sbjct: 865 EEEEEEEEEEEEEEEEEEEEEEEEE 889
Score = 35.0 bits (80), Expect = 0.33
Identities = 16/32 (50%), Positives = 23/32 (71%)
Query: 509 EKEREKELSEDEEEEKKEEEKEEDKTPKLEDE 540
E+E E+E E+EEEE++EEE+EE+ L E
Sbjct: 866 EEEEEEEEEEEEEEEEEEEEEEEENEEPLSLE 897
Score = 35.0 bits (80), Expect = 0.38
Identities = 15/36 (41%), Positives = 26/36 (72%)
Query: 509 EKEREKELSEDEEEEKKEEEKEEDKTPKLEDEAQLD 544
+ E E+E E+EEEE++EEE+EE++ + E+ L+
Sbjct: 862 DSEEEEEEEEEEEEEEEEEEEEEEEEEENEEPLSLE 897
>gnl|CDD|206039 pfam13868, Trichoplein, Tumour suppressor, Mitostatin. Trichoplein
or mitostatin, was first defined as a meiosis-specific
nuclear structural protein. It has since been linked
with mitochondrial movement. It is associated with the
mitochondrial outer membrane, and over-expression leads
to reduction in mitochondrial motility whereas lack of
it enhances mitochondrial movement. The activity appears
to be mediated through binding the mitochondria to the
actin intermediate filaments (IFs).
Length = 349
Score = 36.0 bits (84), Expect = 0.12
Identities = 15/79 (18%), Positives = 38/79 (48%), Gaps = 7/79 (8%)
Query: 475 HMKEDQAEYIEEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEKEEDKT 534
++++ E +K + +H + + + + EKE + +EE E+ E +EE+
Sbjct: 268 QAEDEELEQENAEKRRMKRLEHRREL---EQQIEEKEERRAAEREEELEEGERLREEEA- 323
Query: 535 PKLEDEAQLDAIKKVFLSQ 553
E +A+++ ++ L +
Sbjct: 324 ---ERQARIEEERQRLLKE 339
Score = 34.1 bits (79), Expect = 0.49
Identities = 18/57 (31%), Positives = 33/57 (57%), Gaps = 7/57 (12%)
Query: 475 HMKEDQAEYIEEKKI--KEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEK 529
+ +++ IEEKK E ++ + I ++E+ER K L+E+EE E+K +E+
Sbjct: 17 KVNKERDAQIEEKKRIKAEEKEEERR-----IDEMMEEERLKALAEEEERERKRKEE 68
Score = 32.6 bits (75), Expect = 1.3
Identities = 21/81 (25%), Positives = 34/81 (41%), Gaps = 11/81 (13%)
Query: 474 LHMKEDQAEYIEEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEKEEDK 533
M E EE + + K+ Q KL E + E + +EE+KE E+EE+
Sbjct: 99 EQMDEIIERIQEEDEAEAQEKREKQ-----KKLREEIDEFNEERIERKEEEKEREREEEL 153
Query: 534 ------TPKLEDEAQLDAIKK 548
K E E + +A ++
Sbjct: 154 KILEYQREKAEREEEREAERR 174
Score = 32.2 bits (74), Expect = 1.8
Identities = 16/76 (21%), Positives = 40/76 (52%), Gaps = 9/76 (11%)
Query: 475 HMKEDQAEYIEEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEKEE--D 532
+K E E+K KE ++ + + + E+E+ ++ EE E++ +E+E+ +
Sbjct: 51 RLKALAEEEERERKRKEERREGRAVL---QEQIEEREKRRQ----EEYEERLQEREQMDE 103
Query: 533 KTPKLEDEAQLDAIKK 548
++++E + +A +K
Sbjct: 104 IIERIQEEDEAEAQEK 119
Score = 31.0 bits (71), Expect = 4.3
Identities = 19/59 (32%), Positives = 33/59 (55%), Gaps = 8/59 (13%)
Query: 476 MKEDQAEYIEEK-KIKEIVKKHSQFIGYPIKLL--VEKEREKELSEDEEEEKKEEEKEE 531
++E+ E+ EE+ + KE K+ + + L +E +REK E+E E ++ E KEE
Sbjct: 126 LREEIDEFNEERIERKEEEKERER-----EEELKILEYQREKAEREEEREAERRERKEE 179
Score = 30.3 bits (69), Expect = 6.9
Identities = 17/93 (18%), Positives = 40/93 (43%), Gaps = 12/93 (12%)
Query: 471 KIVLHMKEDQ---AEYIEEKKIKEIVK-KHSQFIGYPIKLLVEKEREKELSED-----EE 521
KI+ + +E E E++ ++ K + + E E E+E ++ +
Sbjct: 154 KILEYQREKAEREEEREAERRERKEEKEREVARLR---AQQEEAEDEREELDELRADLYQ 210
Query: 522 EEKKEEEKEEDKTPKLEDEAQLDAIKKVFLSQI 554
EE + +E++++K + Q +++ QI
Sbjct: 211 EEYERKERQKEKEEAEKRRRQKQELQRAREEQI 243
Score = 30.3 bits (69), Expect = 7.1
Identities = 21/71 (29%), Positives = 36/71 (50%), Gaps = 15/71 (21%)
Query: 476 MKEDQAEYIEEKKIKEIVKKH--SQFIGYPIKLLVEKEREKELSED-----------EEE 522
+ED+AE +EK+ K+ + +F I+ E+E+E+E E+ E E
Sbjct: 109 QEEDEAE-AQEKREKQKKLREEIDEFNEERIER-KEEEKEREREEELKILEYQREKAERE 166
Query: 523 EKKEEEKEEDK 533
E++E E+ E K
Sbjct: 167 EEREAERRERK 177
>gnl|CDD|221244 pfam11818, DUF3340, C-terminal domain of tail specific protease
(DUF3340). This presumed domain is found at the
C-terminus of tail specific proteases. Its function is
unknown. This family is found in bacteria and
eukaryotes. This presumed domain is typically between 88
to 187 amino acids in length.
Length = 139
Score = 34.5 bits (80), Expect = 0.13
Identities = 24/94 (25%), Positives = 37/94 (39%), Gaps = 19/94 (20%)
Query: 478 EDQAEYIEEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEE-----KEED 532
ED A EE+ K + L E ER+ E EDE++ K E E+
Sbjct: 59 EDIAWLKEERDDKTVS-------------LNEAERKAEREEDEKKRLKRENERRKALGEE 105
Query: 533 KTPKLEDEAQLDAIKKVFLSQIICIGSDRIVKMQ 566
L+D + +L + I D ++K+Q
Sbjct: 106 PLKSLDDLPKDKEEPDAYLDEAANILLD-LIKLQ 138
>gnl|CDD|218435 pfam05104, Rib_recp_KP_reg, Ribosome receptor lysine/proline rich
region. This highly conserved region is found towards
the C-terminus of the transmembrane domain. The function
is unclear.
Length = 151
Score = 34.6 bits (79), Expect = 0.14
Identities = 21/78 (26%), Positives = 39/78 (50%), Gaps = 8/78 (10%)
Query: 476 MKEDQAEYIEEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEKEEDKTP 535
MKE E K+ KE K SQ +K++++++SE + + KK+EEK K P
Sbjct: 1 MKETSYEEALAKQRKESGKTQSQ--------KSDKKKKEKVSEKKGKSKKKEEKPNGKIP 52
Query: 536 KLEDEAQLDAIKKVFLSQ 553
+ E ++ ++ + +
Sbjct: 53 EHEPNQEVTEVEVIIEKE 70
>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
This is a family of fungal proteins of unknown function.
Length = 182
Score = 35.1 bits (81), Expect = 0.15
Identities = 21/97 (21%), Positives = 47/97 (48%), Gaps = 17/97 (17%)
Query: 474 LHMKEDQ-------AEYIEEKK--------IKEIVKKHSQFIGYPIKLLVEKEREKELSE 518
H+++ AEY E KK I+++ K++ + + K K+ +K+ +
Sbjct: 41 SHLQDRHFATPIYDAEYTEAKKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKK-KKDKDK 99
Query: 519 DEEEEKKEEEKEEDKTPKLEDEAQ-LDAIKKVFLSQI 554
D++++KK+++ E+ + ED+ + L LS +
Sbjct: 100 DKKDDKKDDKSEKKDEKEAEDKLEDLTKSYSETLSTL 136
>gnl|CDD|234311 TIGR03685, L12P_arch, 50S ribosomal protein L12P. This model
represents the L12P protein of the large (50S) subunit
of the archaeal ribosome.
Length = 105
Score = 33.5 bits (77), Expect = 0.15
Identities = 12/24 (50%), Positives = 17/24 (70%)
Query: 509 EKEREKELSEDEEEEKKEEEKEED 532
E+E E+EEEE++EEE EE+
Sbjct: 72 AAAAEEEEEEEEEEEEEEEESEEE 95
Score = 32.3 bits (74), Expect = 0.40
Identities = 11/24 (45%), Positives = 16/24 (66%)
Query: 509 EKEREKELSEDEEEEKKEEEKEED 532
E+E E+EEEE++EE +EE
Sbjct: 73 AAAEEEEEEEEEEEEEEEESEEEA 96
Score = 31.2 bits (71), Expect = 1.0
Identities = 13/29 (44%), Positives = 19/29 (65%), Gaps = 4/29 (13%)
Query: 513 EKELSEDEEEEKKEEEKEEDKTPKLEDEA 541
E+EEEE++EEE+EE+ E+EA
Sbjct: 72 AAAAEEEEEEEEEEEEEEEES----EEEA 96
Score = 30.0 bits (68), Expect = 2.1
Identities = 10/25 (40%), Positives = 16/25 (64%)
Query: 518 EDEEEEKKEEEKEEDKTPKLEDEAQ 542
EEEE++EEE+EE++ E+
Sbjct: 74 AAEEEEEEEEEEEEEEEESEEEAMA 98
>gnl|CDD|220763 pfam10453, NUFIP1, Nuclear fragile X mental retardation-interacting
protein 1 (NUFIP1). Proteins in this family have been
implicated in the assembly of the large subunit of the
ribosome and in telomere maintenance. Some proteins in
this family contain a CCCH zinc finger. This family
contains a protein called human fragile X mental
retardation-interacting protein 1, which is known to
bind RNA and is phosphorylated upon DNA damage.
Length = 57
Score = 32.0 bits (73), Expect = 0.16
Identities = 21/62 (33%), Positives = 29/62 (46%), Gaps = 16/62 (25%)
Query: 468 RGTKIVLHMKEDQAEYIEEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEE 527
GT I L E+ A++IEE+K K YP K +EK+++ EEE K
Sbjct: 12 PGTSIKLQTPEEIAKWIEERK-----KN------YPTKANIEKKKKA-----EEERIKRG 55
Query: 528 EK 529
E
Sbjct: 56 EL 57
Score = 31.2 bits (71), Expect = 0.34
Identities = 18/57 (31%), Positives = 27/57 (47%), Gaps = 12/57 (21%)
Query: 1329 RGTKIVLHMKEDQAEYIEEKKIKEIVKKHSQFIGYPIKLLVE-KEREKELSEDEEEV 1384
GT I L E+ A++IEE+K K YP K +E K++ +E E+
Sbjct: 12 PGTSIKLQTPEEIAKWIEERK-----KN------YPTKANIEKKKKAEEERIKRGEL 57
>gnl|CDD|188649 cd09817, linoleate_diol_synthase_like, Linoleate (8R)-dioxygenase
and related enzymes. These fungal enzymes, related to
animal heme peroxidases, catalyze the oxygenation of
linoleate and similar targets. Linoleate
(8R)-dioxygenase, also called linoleate:oxygen
7S,8S-oxidoreductase, generates
(9Z,12Z)-(7S,8S)-dihydroxyoctadeca-9,12-dienoate as a
product. Other members are 5,8-linoleate dioxygenase
(LDS, ppoA) and linoleate 10R-dioxygenase (ppoC),
involved in the biosynthesis of oxylipins.
Length = 550
Score = 35.8 bits (83), Expect = 0.17
Identities = 15/62 (24%), Positives = 30/62 (48%), Gaps = 2/62 (3%)
Query: 288 DIQRGRDYGLRPYVDYRELCGLSTVRSFGDLNDFMDSEQVTRLSQVYRGVNDIDYYIGGL 347
I + R++ + ++R+ GL +F D+N D E L +Y ++++ Y G +
Sbjct: 379 GILQAREWNVATLNEFRKFFGLKPYETFEDIN--SDPEVAEALELLYGHPDNVELYPGLV 436
Query: 348 LE 349
E
Sbjct: 437 AE 438
Score = 35.4 bits (82), Expect = 0.24
Identities = 37/160 (23%), Positives = 62/160 (38%), Gaps = 55/160 (34%)
Query: 677 IRLKNFFFTDVETCTFQIVQSTHFIDVSFMYGSHPDKARELRSLSRGKLRVREVNGRTF- 735
I + + F TD I ++ ++D+S +YGS+ ++ ++R++ GKL+ TF
Sbjct: 102 IIIHDIFRTDHRDMN--INNTSSYLDLSPLYGSNQEEQNKVRTMKDGKLKPD-----TFS 154
Query: 736 ------MPTGDPSNCVNASSNICYDAGDIRVNQQLDLAVSQTVWLRFHNYVAEKLIQQN- 788
P P C + ++ RFHNYV E+L Q N
Sbjct: 155 DKRLLGQP---PGVCALL---VMFN--------------------RFHNYVVEQLAQINE 188
Query: 789 --------------PSWSNRDELVYQEARRLVIGCFQHIV 814
DE ++Q AR + G + +IV
Sbjct: 189 GGRFTPPGDKLDSSAKEEKLDEDLFQTARLITCGLYINIV 228
Score = 33.8 bits (78), Expect = 0.73
Identities = 18/55 (32%), Positives = 24/55 (43%), Gaps = 15/55 (27%)
Query: 843 RFHNYVAEKLIQQN---------------PSWSNRDELVYQEARRLVIGCFQHIV 882
RFHNYV E+L Q N DE ++Q AR + G + +IV
Sbjct: 174 RFHNYVVEQLAQINEGGRFTPPGDKLDSSAKEEKLDEDLFQTARLITCGLYINIV 228
>gnl|CDD|212510 cd11724, TP_methylase_like, Uncharacterized subfamily of S-AdoMet
dependent tetrapyrrole methylases. TP-methylases use
S-AdoMet (S-adenosyl-L-methionine or SAM) in the
methylation of diverse substrates. Most members catalyze
various methylation steps in cobalamin (vitamin B12)
biosynthesis, other members like Diphthine synthase and
Ribosomal RNA small subunit methyltransferase I (RsmI)
act on other substrates. The function of this subfamily
is not known.
Length = 255
Score = 35.4 bits (82), Expect = 0.18
Identities = 28/83 (33%), Positives = 36/83 (43%), Gaps = 9/83 (10%)
Query: 472 IVLHMKEDQ---AEYIEEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEE 528
I+ KE A+Y++ K + E K F Y K E EK E EEEKK +
Sbjct: 29 IIFCPKEIVKRFADYLKGKPVLEDPWKL--FWHYGGKPWENLEAEKSPREAFEEEKKRQR 86
Query: 529 KEEDKTPKLEDEAQLDAIKKVFL 551
+E K + EA L K V L
Sbjct: 87 EEFVKVIR---EA-LAQGKNVAL 105
>gnl|CDD|217443 pfam03234, CDC37_N, Cdc37 N terminal kinase binding. Cdc37 is a
molecular chaperone required for the activity of
numerous eukaryotic protein kinases. This domain
corresponds to the N terminal domain which binds
predominantly to protein kinases and is found N terminal
to the Hsp (Heat shocked protein) 90-binding domain
pfam08565. Expression of a construct consisting of only
the N-terminal domain of Saccharomyces pombe Cdc37
results in cellular viability. This indicates that
interactions with the cochaperone Hsp90 may not be
essential for Cdc37 function.
Length = 172
Score = 34.0 bits (78), Expect = 0.26
Identities = 21/108 (19%), Positives = 43/108 (39%), Gaps = 14/108 (12%)
Query: 434 MTVTSKHNDDEQYIWESSAGGSFTIKPDNSEPLGRGTKIVLHMKEDQAEYIEEKKIKEIV 493
++ + + D S +FT K + ++V + + + ++EK ++
Sbjct: 69 LSELKEESLDSSQAVMKSLNENFTDKENVEPEQPTYNEMVEDLFDQVKDEVDEKNGAALI 128
Query: 494 KKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEE--KEEDKTPKLED 539
++ ++K R+K E +E KK +E KEE K D
Sbjct: 129 EE------------LQKHRDKLKKEQKELLKKLDELEKEEKKKIWSVD 164
>gnl|CDD|222636 pfam14265, DUF4355, Domain of unknown function (DUF4355). This
family of proteins is found in bacteria and viruses.
Proteins in this family are typically between 180 and
214 amino acids in length.
Length = 125
Score = 33.0 bits (76), Expect = 0.28
Identities = 12/80 (15%), Positives = 33/80 (41%), Gaps = 14/80 (17%)
Query: 477 KEDQAEYIEE-------KKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEK 529
E++ + ++ K+ + KK + E E+ ++S +E+ E + E+
Sbjct: 2 PEEEKTFTDKEVDKAIAKEKAKWEKKQEEKKS-------EAEKLAKMSAEEKAEYELEKL 54
Query: 530 EEDKTPKLEDEAQLDAIKKV 549
E++ + A+ + +
Sbjct: 55 EKELEELEAELARRELKAEA 74
>gnl|CDD|221275 pfam11861, DUF3381, Domain of unknown function (DUF3381). This
domain is functionally uncharacterized. This domain is
found in eukaryotes. This presumed domain is typically
between 156 to 174 amino acids in length. This domain is
found associated with pfam07780, pfam01728.
Length = 154
Score = 33.4 bits (77), Expect = 0.30
Identities = 22/88 (25%), Positives = 35/88 (39%), Gaps = 19/88 (21%)
Query: 487 KKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEE------EKEEDKTPKLEDE 540
K K+++K + EKE E+E + EE +EE EKE K K E
Sbjct: 78 KDFKKLLKWRKKVRKLLGLDKKEKEEEEEEEVEVEELDEEEQIDELLEKELAKL-KREKR 136
Query: 541 AQLDAIKKVFLSQIICIGSDRIVKMQLQ 568
+ + +K I+K Q++
Sbjct: 137 RENERKQK------------EILKEQMK 152
>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein. TolA couples the inner
membrane complex of itself with TolQ and TolR to the
outer membrane complex of TolB and OprL (also called
Pal). Most of the length of the protein consists of
low-complexity sequence that may differ in both length
and composition from one species to another,
complicating efforts to discriminate TolA (the most
divergent gene in the tol-pal system) from paralogs such
as TonB. Selection of members of the seed alignment and
criteria for setting scoring cutoffs are based largely
conserved operon struction. //The Tol-Pal complex is
required for maintaining outer membrane integrity. Also
involved in transport (uptake) of colicins and
filamentous DNA, and implicated in pathogenesis.
Transport is energized by the proton motive force. TolA
is an inner membrane protein that interacts with
periplasmic TolB and with outer membrane porins ompC,
phoE and lamB [Transport and binding proteins, Other,
Cellular processes, Pathogenesis].
Length = 346
Score = 34.8 bits (80), Expect = 0.33
Identities = 13/40 (32%), Positives = 23/40 (57%)
Query: 509 EKEREKELSEDEEEEKKEEEKEEDKTPKLEDEAQLDAIKK 548
E+ER+K+L + EE +K+ E+ + +LE A + K
Sbjct: 66 EQERQKKLEQQAEEAEKQRAAEQARQKELEQRAAAEKAAK 105
Score = 31.7 bits (72), Expect = 2.6
Identities = 21/76 (27%), Positives = 30/76 (39%), Gaps = 16/76 (21%)
Query: 477 KEDQAEYIEEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSED----EEEEKKEEEKEED 532
K+ QAE + K+ E K E E EK+ E+ EEE K + E
Sbjct: 117 KQKQAEEAKAKQAAEAKAK------------AEAEAEKKAKEEAKKQAEEEAKAKAAAEA 164
Query: 533 KTPKLEDEAQLDAIKK 548
K E + + +A K
Sbjct: 165 KKKAAEAKKKAEAEAK 180
>gnl|CDD|184940 PRK14977, PRK14977, bifunctional DNA-directed RNA polymerase A'/A''
subunit; Provisional.
Length = 1321
Score = 35.0 bits (80), Expect = 0.35
Identities = 29/125 (23%), Positives = 56/125 (44%), Gaps = 8/125 (6%)
Query: 459 KPDNSEPLGRGTKIVLHMKEDQAEYIEEKKIKEIVKKHSQ-FIGYPIKLLVEKEREKELS 517
K D+ E I ED+ + + +I+E+ K++++ F KLL + EL
Sbjct: 896 KLDHGEAFNLERIIEKQKIEDRGKGASKDEIEELAKEYTKTFNANLPKLLADAIHGAELK 955
Query: 518 EDEEEEKKEEEKEEDKTPKLEDEAQLDAIKKVFLSQIICIGSDRIVKMQLQSFYMPSERQ 577
EDE E E KE ++A+++ + + + I + +M L++F+ +
Sbjct: 956 EDELEAICAEGKE------GFEKAKVEPGQAIGIISAQSIA-EPGTQMTLRTFHAAGIKA 1008
Query: 578 FNVTL 582
+VT
Sbjct: 1009 MDVTH 1013
>gnl|CDD|237799 PRK14715, PRK14715, DNA polymerase II large subunit; Provisional.
Length = 1627
Score = 35.2 bits (81), Expect = 0.35
Identities = 17/57 (29%), Positives = 34/57 (59%), Gaps = 6/57 (10%)
Query: 504 IKLLVEKEREKELS-----EDEEEEKKEEEKEEDKTPKLEDEAQLDAIKKVFLSQII 555
+K L EK+ EK+ + EE +++ EE+E+ +L ++ ++A KK F+ ++I
Sbjct: 276 LKELKEKKEEKDEEKSEEVKTEEVDEEFEEEEKGFYYELYEKVNIEANKK-FIKEVI 331
>gnl|CDD|129130 TIGR00019, prfA, peptide chain release factor 1. This model
describes peptide chain release factor 1 (PrfA, RF-1),
and excludes the related peptide chain release factor 2
(PrfB, RF-2). RF-1 helps recognize and terminate
translation at UAA and UAG stop codons. The
mitochondrial release factors are prfA-like, although
not included above the trusted cutoff for this model.
RF-1 does not have a translational frameshift [Protein
synthesis, Translation factors].
Length = 360
Score = 34.3 bits (79), Expect = 0.41
Identities = 17/74 (22%), Positives = 37/74 (50%), Gaps = 13/74 (17%)
Query: 479 DQAEYI----EEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEKEE--- 531
DQ + E +++EIV + ++ + + + KE+ E+ + E +E KEE
Sbjct: 31 DQDKLRKLSKEYSQLEEIVDCYREYQ----QAQEDIKEAKEILEESDPEMREMAKEELEE 86
Query: 532 --DKTPKLEDEAQL 543
+K +LE++ ++
Sbjct: 87 LEEKIEELEEQLKV 100
>gnl|CDD|224581 COG1667, COG1667, Uncharacterized protein conserved in archaea
[Function unknown].
Length = 254
Score = 34.1 bits (78), Expect = 0.42
Identities = 22/89 (24%), Positives = 35/89 (39%), Gaps = 14/89 (15%)
Query: 473 VLHMKEDQAEYIEEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEKEED 532
VL + + E E + + ++ Y I++ +E E+ L E + EE EE
Sbjct: 113 VLSVPITEEEVENEMEELKSSEREELLEKYGIRVPEIEEIERILDEAVGSDYFSEEIEEP 172
Query: 533 -----------KTPKLEDEAQLDAIKKVF 550
K P DE ++ I KVF
Sbjct: 173 SREELLKKLGIKEP---DENWVETILKVF 198
>gnl|CDD|225368 COG2811, NtpF, Archaeal/vacuolar-type H+-ATPase subunit H [Energy
production and conversion].
Length = 108
Score = 32.0 bits (73), Expect = 0.46
Identities = 11/40 (27%), Positives = 23/40 (57%), Gaps = 1/40 (2%)
Query: 509 EKEREKELSEDEEEEKKEEEKEEDKTPKLEDEAQLDAIKK 548
++E E+ + E EE ++ E+ E++ KL E L+ ++
Sbjct: 27 KEEAEQIIKEAREEAREIIEEAEEEAEKLAQE-ILEEARE 65
>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
Length = 434
Score = 34.4 bits (79), Expect = 0.48
Identities = 15/57 (26%), Positives = 32/57 (56%), Gaps = 2/57 (3%)
Query: 486 EKKIKEIVKKHSQFIGYPIKLLVEKER--EKELSEDEEEEKKEEEKEEDKTPKLEDE 540
KK+++ KK + + + E++L ++E E+K++E+ +EDK + +DE
Sbjct: 375 NKKLQDYTKKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQKDE 431
>gnl|CDD|107059 PHA01746, PHA01746, hypothetical protein.
Length = 131
Score = 32.6 bits (74), Expect = 0.49
Identities = 16/57 (28%), Positives = 33/57 (57%), Gaps = 3/57 (5%)
Query: 487 KKIKEIVKKHSQFIGYPIKLLVEKE---REKELSEDEEEEKKEEEKEEDKTPKLEDE 540
K I E ++K+S F+ +K +E+E +++EEK+E+ +E+ + +E+E
Sbjct: 74 KLIAEFLEKYSDFLNEYVKFTPRNNNAIQEEEDDMEQQEEKEEKGREKGRKKNVEEE 130
>gnl|CDD|237629 PRK14160, PRK14160, heat shock protein GrpE; Provisional.
Length = 211
Score = 33.6 bits (77), Expect = 0.50
Identities = 25/78 (32%), Positives = 39/78 (50%), Gaps = 7/78 (8%)
Query: 476 MKEDQAEYIEEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEE---EEKKEEEKEED 532
K+ + E +EE KE K G L E+ ++E+ ED E E K EE K+E+
Sbjct: 5 CKDAKHENMEEDCCKENENKEED-KGKEEDLEFEEIEKEEIIEDSEESNEVKIEELKDEN 63
Query: 533 KTPKLEDEA---QLDAIK 547
K E++ +L+A+K
Sbjct: 64 NKLKEENKKLENELEALK 81
Score = 30.1 bits (68), Expect = 7.1
Identities = 10/40 (25%), Positives = 23/40 (57%)
Query: 509 EKEREKELSEDEEEEKKEEEKEEDKTPKLEDEAQLDAIKK 548
EKE + E+ EE+ +E + +++ E++ + + I+K
Sbjct: 2 EKECKDAKHENMEEDCCKENENKEEDKGKEEDLEFEEIEK 41
>gnl|CDD|235795 PRK06402, rpl12p, 50S ribosomal protein L12P; Reviewed.
Length = 106
Score = 31.8 bits (73), Expect = 0.51
Identities = 11/24 (45%), Positives = 16/24 (66%)
Query: 509 EKEREKELSEDEEEEKKEEEKEED 532
E++ E+EEEE+KEE +EE
Sbjct: 74 AAAAEEKKEEEEEEEEKEESEEEA 97
Score = 31.1 bits (71), Expect = 1.1
Identities = 12/24 (50%), Positives = 17/24 (70%)
Query: 518 EDEEEEKKEEEKEEDKTPKLEDEA 541
EEKKEEE+EE++ + E+EA
Sbjct: 74 AAAAEEKKEEEEEEEEKEESEEEA 97
Score = 31.1 bits (71), Expect = 1.2
Identities = 11/24 (45%), Positives = 16/24 (66%)
Query: 518 EDEEEEKKEEEKEEDKTPKLEDEA 541
EE+K+EEE+EE+K E+ A
Sbjct: 75 AAAEEKKEEEEEEEEKEESEEEAA 98
>gnl|CDD|220383 pfam09756, DDRGK, DDRGK domain. This is a family of proteins of
approximately 300 residues, found in plants and
vertebrates. They contain a highly conserved DDRGK
motif.
Length = 189
Score = 33.1 bits (76), Expect = 0.53
Identities = 15/58 (25%), Positives = 34/58 (58%)
Query: 476 MKEDQAEYIEEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEKEEDK 533
+E + E EE+K E ++ + ++ EK++E+E ++ EE+ ++E++E +K
Sbjct: 20 QREAEEEEREERKKLEEKREGERKEEEELEEEREKKKEEEERKEREEQARKEQEEYEK 77
Score = 30.4 bits (69), Expect = 4.2
Identities = 14/40 (35%), Positives = 25/40 (62%)
Query: 509 EKEREKELSEDEEEEKKEEEKEEDKTPKLEDEAQLDAIKK 548
+ER+K + E E K+EEE EE++ K E+E + + ++
Sbjct: 28 REERKKLEEKREGERKEEEELEEEREKKKEEEERKEREEQ 67
>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510). This
family consists of several hypothetical bacterial
proteins of around 200 residues in length. The function
of this family is unknown.
Length = 214
Score = 33.6 bits (77), Expect = 0.56
Identities = 12/32 (37%), Positives = 21/32 (65%)
Query: 510 KEREKELSEDEEEEKKEEEKEEDKTPKLEDEA 541
KE EKE + E++E K + ++ED+ + E+E
Sbjct: 66 KEEEKEAANSEDKEDKGDAEKEDEESEEENEE 97
>gnl|CDD|233035 TIGR00585, mutl, DNA mismatch repair protein MutL. All proteins in
this family for which the functions are known are
involved in the process of generalized mismatch repair.
This family is based on the phylogenomic analysis of JA
Eisen (1999, Ph.D. Thesis, Stanford University) [DNA
metabolism, DNA replication, recombination, and repair].
Length = 312
Score = 33.8 bits (78), Expect = 0.64
Identities = 23/142 (16%), Positives = 52/142 (36%), Gaps = 45/142 (31%)
Query: 1238 IIDSGIGMTKADLV-----------NNLGTIAKSGTKAFM-EALQAGADISMIGQFGVGF 1285
+ D+G G+ K DL + + + T F EAL + + +S
Sbjct: 56 VSDNGSGIDKEDLPLACERHATSKIQSFEDLERIETLGFRGEALASISSVS--------- 106
Query: 1286 YSAYLVADKVTVTSKHNDDEQYIWESSAGGSFTIKPDNSEPLGRGTKIVLH--------- 1336
++T+T+K + + +++ G + P GT + +
Sbjct: 107 --------RLTITTKTSAADGLAYQALLEGGMIESIKPA-PRPVGTTVEVRDLFYNLPVR 157
Query: 1337 ---MKEDQAEYIEEKKIKEIVK 1355
+K + E+ +KI ++++
Sbjct: 158 RKFLKSPKKEF---RKILDVLQ 176
>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3. This
protein, which interacts with both microtubules and
TRAF3 (tumour necrosis factor receptor-associated factor
3), is conserved from worms to humans. The N-terminal
region is the microtubule binding domain and is
well-conserved; the C-terminal 100 residues, also
well-conserved, constitute the coiled-coil region which
binds to TRAF3. The central region of the protein is
rich in lysine and glutamic acid and carries KKE motifs
which may also be necessary for tubulin-binding, but
this region is the least well-conserved.
Length = 506
Score = 33.7 bits (77), Expect = 0.71
Identities = 16/34 (47%), Positives = 27/34 (79%)
Query: 509 EKEREKELSEDEEEEKKEEEKEEDKTPKLEDEAQ 542
E+E+EKE ++E+++KKE+ KEE K K ++EA+
Sbjct: 105 EEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAK 138
Score = 31.0 bits (70), Expect = 5.4
Identities = 17/69 (24%), Positives = 29/69 (42%), Gaps = 5/69 (7%)
Query: 485 EEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKK-----EEEKEEDKTPKLED 539
EEKK K+ K P + EK KE +++E++ + EEEK+ ++
Sbjct: 115 EEKKKKKEKPKEEPKDRKPKEEAKEKRPPKEKEKEKEKKVEEPRDREEEKKRERVRAKSR 174
Query: 540 EAQLDAIKK 548
+ K
Sbjct: 175 PKKPPKKKP 183
>gnl|CDD|204404 pfam10145, PhageMin_Tail, Phage-related minor tail protein. Members
of this family are found in putative phage tail tape
measure proteins.
Length = 201
Score = 33.0 bits (76), Expect = 0.77
Identities = 14/59 (23%), Positives = 23/59 (38%), Gaps = 3/59 (5%)
Query: 1244 GMTKADLVNNLGTIAKSGTKAFMEALQAGADI--SMIGQFGVGFYSAYLVADKVTVTSK 1300
G++ +L+ K T A L ADI + + FG+ A AD + +
Sbjct: 30 GLSGKELLGFTPAALKLAT-ATGADLAEAADIAAAAMNAFGLSADDAERAADVLAYAAN 87
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 34.0 bits (77), Expect = 0.84
Identities = 25/72 (34%), Positives = 37/72 (51%), Gaps = 5/72 (6%)
Query: 471 KIVLHMKEDQAEYIEEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEKE 530
K V +K+ +AE EKK E +KK + IK E ++ +E + EE KK EE E
Sbjct: 1633 KKVEQLKKKEAE---EKKKAEELKKAEE--ENKIKAAEEAKKAEEDKKKAEEAKKAEEDE 1687
Query: 531 EDKTPKLEDEAQ 542
+ L+ EA+
Sbjct: 1688 KKAAEALKKEAE 1699
Score = 32.4 bits (73), Expect = 2.0
Identities = 22/82 (26%), Positives = 42/82 (51%), Gaps = 10/82 (12%)
Query: 477 KEDQAEYIEEKKIKEIVKKHSQFIGYPIKLLVEKEREK----ELSEDEEEEKK------E 526
K ++A+ EE +I+E++K + + + + E K EL + EEE+KK +
Sbjct: 1582 KAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKK 1641
Query: 527 EEKEEDKTPKLEDEAQLDAIKK 548
E +E+ K +L+ + + IK
Sbjct: 1642 EAEEKKKAEELKKAEEENKIKA 1663
Score = 31.6 bits (71), Expect = 3.9
Identities = 28/81 (34%), Positives = 41/81 (50%), Gaps = 13/81 (16%)
Query: 476 MKEDQAEYIEEKKIK-EIVKKHSQFIGYPIKLLVEKEREKELSEDE--EEEKKEEE---- 528
MK ++A+ EE KIK E +KK + K VE+ ++KE E + EE KK EE
Sbjct: 1607 MKAEEAKKAEEAKIKAEELKKAEE-----EKKKVEQLKKKEAEEKKKAEELKKAEEENKI 1661
Query: 529 -KEEDKTPKLEDEAQLDAIKK 548
E+ ED+ + + KK
Sbjct: 1662 KAAEEAKKAEEDKKKAEEAKK 1682
Score = 31.3 bits (70), Expect = 5.2
Identities = 17/72 (23%), Positives = 36/72 (50%)
Query: 477 KEDQAEYIEEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEKEEDKTPK 536
++A+ EE++ E ++K + + +E + +E +K EE+K+ D+ K
Sbjct: 1238 DAEEAKKAEEERNNEEIRKFEEARMAHFARRQAAIKAEEARKADELKKAEEKKKADEAKK 1297
Query: 537 LEDEAQLDAIKK 548
E++ + D KK
Sbjct: 1298 AEEKKKADEAKK 1309
Score = 31.3 bits (70), Expect = 5.6
Identities = 22/64 (34%), Positives = 36/64 (56%), Gaps = 1/64 (1%)
Query: 485 EEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEKEEDKTPKLEDEAQLD 544
EEKK E +KK + + +KE E++ + EE +K EEEK++ K E+E + +
Sbjct: 1713 EEKKKAEELKKAEEENKIKAEEA-KKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAE 1771
Query: 545 AIKK 548
I+K
Sbjct: 1772 EIRK 1775
Score = 30.5 bits (68), Expect = 8.9
Identities = 16/64 (25%), Positives = 26/64 (40%), Gaps = 2/64 (3%)
Query: 485 EEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEKEEDKTPKLEDEAQLD 544
EE K + KK ++ +K E + + E K E E D+ E++A+
Sbjct: 1312 EEAKKADEAKKKAEEAKKKADAAKKKAEEAK--KAAEAAKAEAEAAADEAEAAEEKAEAA 1369
Query: 545 AIKK 548
KK
Sbjct: 1370 EKKK 1373
>gnl|CDD|148630 pfam07133, Merozoite_SPAM, Merozoite surface protein (SPAM). This
family consists of several Plasmodium falciparum SPAM
(secreted polymorphic antigen associated with
merozoites) proteins. Variation among SPAM alleles is
the result of deletions and amino acid substitutions in
non-repetitive sequences within and flanking the alanine
heptad-repeat domain. Heptad repeats in which the a and
d position contain hydrophobic residues generate
amphipathic alpha-helices which give rise to helical
bundles or coiled-coil structures in proteins. SPAM is
an example of a P. falciparum antigen in which a
repetitive sequence has features characteristic of a
well-defined structural element.
Length = 164
Score = 32.5 bits (74), Expect = 0.84
Identities = 21/95 (22%), Positives = 40/95 (42%), Gaps = 16/95 (16%)
Query: 448 WESSAGGSFTIKPDNSEPLGRGTKIVLHMKEDQAEYIEEKKIKEIVKKHSQFIGYPIKLL 507
WE G P+ + + +L + I ++I+K++
Sbjct: 2 WEFGGGA-----PEKKPEEEKKDENLL-----EHVKITSWDKEDIIKENEDVKDEK---- 47
Query: 508 VEKEREKELSEDEEEEKKEEEKEEDKTPKLEDEAQ 542
+E ++E E++EEE +E E ED+ +EDE +
Sbjct: 48 --QEDDEEEEEEDEEEIEEPEDIEDEEEIVEDEEE 80
Score = 31.8 bits (72), Expect = 1.3
Identities = 19/68 (27%), Positives = 33/68 (48%), Gaps = 13/68 (19%)
Query: 473 VLHMKEDQAEYIEEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEKEED 532
V K++ E EE+ +EI + + E E+E+ EDEEEE+ E+E++
Sbjct: 43 VKDEKQEDDEEEEEEDEEEIEEPE------------DIEDEEEIVEDEEEEE-EDEEDNV 89
Query: 533 KTPKLEDE 540
+E +
Sbjct: 90 DLKDIEKK 97
>gnl|CDD|205206 pfam13025, DUF3886, Protein of unknown function (DUF3886). This
family of proteins is functionally uncharacterized. This
family of proteins is found in bacteria. Proteins in
this family are approximately 90 amino acids in length.
There are two completely conserved L residues that may
be functionally important.
Length = 70
Score = 30.4 bits (69), Expect = 0.84
Identities = 15/38 (39%), Positives = 23/38 (60%), Gaps = 3/38 (7%)
Query: 510 KEREKELSEDEEEEKKEEEKEEDKTPKLE---DEAQLD 544
K E++ E+EE K+EE KE +K E +E++LD
Sbjct: 30 KAEEEKREEEEEARKREERKEREKNKSFEELLNESELD 67
>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
Length = 880
Score = 33.5 bits (77), Expect = 0.87
Identities = 22/63 (34%), Positives = 31/63 (49%), Gaps = 1/63 (1%)
Query: 477 KEDQAEYIE-EKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEKEEDKTP 535
KE + E E ++I EI + + KL E + +EL E+ EE +KE E E
Sbjct: 196 KEKEKELEEVLREINEISSELPELREELEKLEKEVKELEELKEEIEELEKELESLEGSKR 255
Query: 536 KLE 538
KLE
Sbjct: 256 KLE 258
Score = 32.3 bits (74), Expect = 2.3
Identities = 18/73 (24%), Positives = 40/73 (54%), Gaps = 15/73 (20%)
Query: 486 EKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEKEEDKTPKLED------ 539
E+++KE+ +++++ +L K+ EKEL +E+E KK EE+ + +L +
Sbjct: 591 EERLKELEPFYNEYL----EL---KDAEKELEREEKELKKLEEELDKAFEELAETEKRLE 643
Query: 540 --EAQLDAIKKVF 550
+L+ ++K +
Sbjct: 644 ELRKELEELEKKY 656
Score = 32.0 bits (73), Expect = 2.9
Identities = 22/77 (28%), Positives = 35/77 (45%), Gaps = 7/77 (9%)
Query: 477 KEDQAEYIEEKKIKEIVKKHSQFIGYPIK------LLVEKEREKELSEDEEEEKKEEEKE 530
KE++ E ++ KK+KE+ K+ + E ER K+ EK E+E E
Sbjct: 336 KEERLEELK-KKLKELEKRLEELEERHELYEEAKAKKEELERLKKRLTGLTPEKLEKELE 394
Query: 531 EDKTPKLEDEAQLDAIK 547
E + K E E ++ I
Sbjct: 395 ELEKAKEEIEEEISKIT 411
Score = 31.6 bits (72), Expect = 4.4
Identities = 23/83 (27%), Positives = 41/83 (49%), Gaps = 14/83 (16%)
Query: 478 EDQAEYIEEKKIKEIVKKHSQFIGYPIKLLVEKER-EKELSEDEEE-----------EKK 525
E++ + ++E +KE +++ + + + L E EK LS EEE E+K
Sbjct: 279 EEKVKELKE--LKEKAEEYIKLSEFYEEYLDELREIEKRLSRLEEEINGIEERIKELEEK 336
Query: 526 EEEKEEDKTPKLEDEAQLDAIKK 548
EE EE K E E +L+ +++
Sbjct: 337 EERLEELKKKLKELEKRLEELEE 359
>gnl|CDD|237762 PRK14598, PRK14598, peptide deformylase; Provisional.
Length = 187
Score = 32.4 bits (74), Expect = 0.90
Identities = 18/45 (40%), Positives = 26/45 (57%), Gaps = 1/45 (2%)
Query: 3 ETLRQKADADKN-DKAVKDLVNLLFETSLLSSGFTLEDPQVELTL 46
E LRQKA K D +++L+ +FE+ +SG L PQV +L
Sbjct: 11 EVLRQKAKPLKGVDAEIEELIASMFESMYNASGIGLAAPQVGRSL 55
>gnl|CDD|235850 PRK06669, fliH, flagellar assembly protein H; Validated.
Length = 281
Score = 32.7 bits (75), Expect = 1.1
Identities = 26/107 (24%), Positives = 50/107 (46%), Gaps = 18/107 (16%)
Query: 471 KIVLHMKEDQAEYIEEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEKE 530
K++ I ++K+K + +F K+L KE+E+ E+EE+ ++ E+
Sbjct: 3 KVIFKRSN----VINKEKLKTHEIQKYRF-----KVLSIKEKERLREEEEEQVEQLREEA 53
Query: 531 EDKTPKLEDEAQLDAIKKV---------FLSQIICIGSDRIVKMQLQ 568
D+ ++ +EA+ DA + V L + S I K+Q+Q
Sbjct: 54 NDEAKEIIEEAEEDAFEIVEAAEEEAKEELLKKTDEASSIIEKLQMQ 100
Score = 31.1 bits (71), Expect = 3.7
Identities = 20/56 (35%), Positives = 27/56 (48%), Gaps = 7/56 (12%)
Query: 476 MKEDQAEYIEEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEKEE 531
E++A+ KK E S I KL ++ ERE+E E+E E EE K E
Sbjct: 74 AAEEEAKEELLKKTDEA----SSII---EKLQMQIEREQEEWEEELERLIEEAKAE 122
>gnl|CDD|217450 pfam03247, Prothymosin, Prothymosin/parathymosin family.
Prothymosin alpha and parathymosin are two ubiquitous
small acidic nuclear proteins that are thought to be
involved in cell cycle progression, proliferation, and
cell differentiation.
Length = 106
Score = 31.1 bits (70), Expect = 1.1
Identities = 15/38 (39%), Positives = 23/38 (60%), Gaps = 5/38 (13%)
Query: 509 EKEREKELSEDEEEEKKEEEKEEDKTPKLEDEAQLDAI 546
E E E+E+ ED+EEE+ E E+EE E+E + +
Sbjct: 50 EMEEEEEVDEDDEEEEGEGEEEEG-----EEEEETEGA 82
Score = 28.8 bits (64), Expect = 7.4
Identities = 9/34 (26%), Positives = 21/34 (61%)
Query: 509 EKEREKELSEDEEEEKKEEEKEEDKTPKLEDEAQ 542
+E + E+ E+EE ++ +EE+E + + +E +
Sbjct: 44 AQEGDDEMEEEEEVDEDDEEEEGEGEEEEGEEEE 77
>gnl|CDD|111875 pfam03032, Brevenin, Brevenin/esculentin/gaegurin/rugosin family.
This family contains a number of defence peptides
secreted from the skin of amphibians, including the
opiate-like dermorphins and deltorphins, and the
antimicrobial dermoseptins and temporins. The alignment
for this family includes the signal peptide.
Length = 46
Score = 29.2 bits (66), Expect = 1.1
Identities = 10/27 (37%), Positives = 19/27 (70%)
Query: 507 LVEKEREKELSEDEEEEKKEEEKEEDK 533
L E+E+ ++ E+E+EE+ EE+ E +
Sbjct: 20 LCEEEKREDEEENEDEEEGEEQSEVKR 46
>gnl|CDD|222896 PHA02592, 52, DNA topisomerase II medium subunit; Provisional.
Length = 439
Score = 32.7 bits (75), Expect = 1.3
Identities = 24/80 (30%), Positives = 38/80 (47%), Gaps = 3/80 (3%)
Query: 470 TKIVLHMKEDQAEYIEEKKIKEIVKKH-SQFIGYPIKLLVEKEREKELSEDEEEEKKEEE 528
+IVL K +E EE + ++ + + I + EREK L ++ EE +KE E
Sbjct: 362 GEIVLQGKTR-SELTEELSDNKDFGEYADKLVAMNIYSMTSDEREK-LQKEAEELEKEHE 419
Query: 529 KEEDKTPKLEDEAQLDAIKK 548
+ T K E L+ +KK
Sbjct: 420 YWKKTTAKKEYIKDLEELKK 439
>gnl|CDD|217840 pfam04006, Mpp10, Mpp10 protein. This family includes proteins
related to Mpp10 (M phase phosphoprotein 10). The U3
small nucleolar ribonucleoprotein (snoRNP) is required
for three cleavage events that generate the mature 18S
rRNA from the pre-rRNA. In Saccharomyces cerevisiae,
depletion of Mpp10, a U3 snoRNP-specific protein, halts
18S rRNA production and impairs cleavage at the three U3
snoRNP-dependent sites.
Length = 613
Score = 33.0 bits (75), Expect = 1.4
Identities = 24/130 (18%), Positives = 40/130 (30%), Gaps = 22/130 (16%)
Query: 434 MTVTSKHNDDEQY----IWESSAGGSFTIKPDNSEP----------LGRGTKIVLHMKED 479
+D E Y + F D + K D
Sbjct: 212 EAALGDEDDFEDYFQDDSEDGKDDEDFGSGEDEEDDEEGNIEYEDFFDPKEKDKKKDAGD 271
Query: 480 QAEYIEEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEKEEDKTPKLED 539
AE +++ KE VKK + +E ++E E E+++ +EE E D
Sbjct: 272 DAELEDDEPDKEAVKKEAD--------SKPEEEDEEDDEQEDDQDEEEPPEAAMDKVKLD 323
Query: 540 EAQLDAIKKV 549
E L+ +
Sbjct: 324 EPVLEGVDLE 333
>gnl|CDD|237576 PRK13979, PRK13979, DNA topoisomerase IV subunit A; Provisional.
Length = 957
Score = 32.8 bits (75), Expect = 1.5
Identities = 31/128 (24%), Positives = 53/128 (41%), Gaps = 26/128 (20%)
Query: 1227 IIPNKEEKTLTIIDSGIGMT-KADLVNNLGTIAKSGTKAFMEALQAGADISMIGQFGV-- 1283
I ++E+ L I + G+ + K++ VN +G IA SG IS+ + V
Sbjct: 853 IKEHEEKNILLITEKGMAIRFKSEAVNPMGKIA-SGVTG----------ISLKDEDKVIY 901
Query: 1284 GFYSAYLVADKVTVTSKHNDDEQYIWESSAGGSFTIKPDNSEPLGRGTKIVLHMKEDQAE 1343
G K+ + SK D + + IK N GRG ++ + D
Sbjct: 902 GSIIDENDTRKIKLKSKAKDKGEIEIKD-------IKLQNRA--GRGNSLMPLLLND--- 949
Query: 1344 YIEEKKIK 1351
+I++ +IK
Sbjct: 950 FIKDVEIK 957
>gnl|CDD|215009 smart01069, CDC37_C, Cdc37 C terminal domain. Cdc37 is a protein
required for the activity of numerous eukaryotic protein
kinases. This domains corresponds to the C terminal
domain whose function is unclear. It is found C terminal
to the Hsp90 chaperone (Heat shocked protein 90) binding
domain pfam08565 and the N terminal kinase binding
domain of Cdc37.
Length = 93
Score = 30.4 bits (69), Expect = 1.5
Identities = 20/60 (33%), Positives = 33/60 (55%), Gaps = 9/60 (15%)
Query: 476 MKEDQAEYIEEKKIKEIVKKHSQFIGYPIKLLVE---KEREKELSEDEEEEKKEEEKEED 532
M ++AEY E+ I+ S G P + E KE +++ +EE EK++EE+E+D
Sbjct: 40 MPVEEAEYHLERCIE------SGLWGVPNAIEDETEFKELQEQYEVEEEAEKEDEEEEDD 93
>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis). This nucleolar
family of proteins are involved in 60S ribosomal
biogenesis. They are specifically involved in the
processing beyond the 27S stage of 25S rRNA maturation.
This family contains sequences that bear similarity to
the glioma tumour suppressor candidate region gene 2
protein (p60). This protein has been found to interact
with herpes simplex type 1 regulatory proteins.
Length = 387
Score = 32.4 bits (74), Expect = 1.6
Identities = 15/38 (39%), Positives = 20/38 (52%), Gaps = 2/38 (5%)
Query: 511 EREKELSEDEEEEKKEEEKEEDKTPKLEDEAQLDAIKK 548
+R KE E E + +EEK+ K KL A+L I K
Sbjct: 282 QRNKEKRRKELEREAKEEKQLKK--KLAQLARLKEIAK 317
Score = 31.2 bits (71), Expect = 3.7
Identities = 16/60 (26%), Positives = 29/60 (48%)
Query: 477 KEDQAEYIEEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEKEEDKTPK 536
K + E +EEKK++++ + S+ LL E + + E D+E + E E + K
Sbjct: 212 KRQELERVEEKKLEKMAPEASRLDEMSEGLLEESDDDGEEESDDESAWEGFESEYEPINK 271
Score = 30.8 bits (70), Expect = 5.0
Identities = 10/59 (16%), Positives = 24/59 (40%)
Query: 484 IEEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEKEEDKTPKLEDEAQ 542
E+K+ + + + + E + L E+ +++ +EE +E E E +
Sbjct: 209 AEKKRQELERVEEKKLEKMAPEASRLDEMSEGLLEESDDDGEEESDDESAWEGFESEYE 267
>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682). The
members of this family are all hypothetical eukaryotic
proteins of unknown function. One member is described as
being an adipocyte-specific protein, but no evidence of
this was found.
Length = 322
Score = 32.2 bits (74), Expect = 1.6
Identities = 17/58 (29%), Positives = 24/58 (41%), Gaps = 12/58 (20%)
Query: 476 MKEDQAEYIEEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEKEEDK 533
+K + E EE + K+ KK E+ K EE++K EEKE K
Sbjct: 273 LKAAEEERQEEAQEKKEEKKK------------EEREAKLAKLSPEEQRKLEEKERKK 318
Score = 30.3 bits (69), Expect = 8.0
Identities = 13/39 (33%), Positives = 22/39 (56%), Gaps = 3/39 (7%)
Query: 508 VEKEREKELSED---EEEEKKEEEKEEDKTPKLEDEAQL 543
V+K RE+E + EEE++EE +E+ + K E+
Sbjct: 261 VDKTREEEEEKILKAAEEERQEEAQEKKEEKKKEEREAK 299
>gnl|CDD|226889 COG4487, COG4487, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 438
Score = 32.5 bits (74), Expect = 1.6
Identities = 16/71 (22%), Positives = 34/71 (47%), Gaps = 11/71 (15%)
Query: 484 IEEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEKEEDKTPKLEDEAQL 543
+ EK++ E+ K+ + + E+++E +++EE K E EK+ E L
Sbjct: 118 LLEKELDELSKELQKQLQ---NTAEIIEKKRENNKNEERLKFENEKKL--------EESL 166
Query: 544 DAIKKVFLSQI 554
+ ++ F Q+
Sbjct: 167 ELEREKFEEQL 177
>gnl|CDD|237078 PRK12366, PRK12366, replication factor A; Reviewed.
Length = 637
Score = 32.7 bits (75), Expect = 1.7
Identities = 17/56 (30%), Positives = 31/56 (55%), Gaps = 8/56 (14%)
Query: 486 EKKIKEIVKKHSQFIG--YPIKLLVEKEREKELSEDEEEEKKEEEKEEDKTPKLED 539
EKKI E ++++ F+G + ++ +EL D E ++EEE E+ K +E+
Sbjct: 22 EKKINEKIEENGGFLGEEAALMMVA-----QELGIDIEYGQEEEE-EDFKISDIEE 71
>gnl|CDD|179537 PRK03095, prsA, peptidylprolyl isomerase; Reviewed.
Length = 287
Score = 32.3 bits (73), Expect = 1.8
Identities = 21/71 (29%), Positives = 39/71 (54%), Gaps = 2/71 (2%)
Query: 478 EDQAEYIEEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEKEEDKTPKL 537
ED A +++ ++ E VK SQF + IK+ KE EK + + + KKE +++ + +
Sbjct: 192 EDAAYKLKKDEVSEPVK--SQFGYHIIKVTDIKEPEKSFEQSKADIKKELVQKKAQDGEF 249
Query: 538 EDEAQLDAIKK 548
++ + IKK
Sbjct: 250 MNDLMMKEIKK 260
>gnl|CDD|222571 pfam14153, Spore_coat_CotO, Spore coat protein CotO. Bacillus
spores are protected by a protein shell consisting of
over 50 different polypeptides, known as the coat. This
family of proteins has an important morphogenetic role
in coat assembly, it is involved in the assembly of at
least 5 different coat proteins including CotB, CotG,
CotS, CotSA and CotW. It is likely to act at a late
stage of coat assembly.
Length = 185
Score = 31.7 bits (72), Expect = 1.8
Identities = 28/103 (27%), Positives = 40/103 (38%), Gaps = 10/103 (9%)
Query: 456 FTIKPDNSEPLGRGTKIVLHMKEDQAEYIEEKKIKEIVKKHSQFIGYPIKLLVEKEREKE 515
+ ++P E K + K D+ E EEK+ + K + EKE+E
Sbjct: 14 YIVQPKLEEAKANMQKTFIIKKADEKE--EEKENSDEHVKSKEEEQKIEYEEAEKEKEAG 71
Query: 516 LSEDE--------EEEKKEEEKEEDKTPKLEDEAQLDAIKKVF 550
E E EE +EEEKEE+ + E KK F
Sbjct: 72 EPEREDIAEQQEKEEIAQEEEKEEEAEDVKQQEVFSFKRKKPF 114
Score = 31.3 bits (71), Expect = 2.0
Identities = 24/72 (33%), Positives = 38/72 (52%), Gaps = 2/72 (2%)
Query: 483 YIEEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEKEEDKTPKLEDEAQ 542
YI + K++E K + Q + IK EKE EKE S++ + K+EE+K E + + E EA
Sbjct: 14 YIVQPKLEE-AKANMQKT-FIIKKADEKEEEKENSDEHVKSKEEEQKIEYEEAEKEKEAG 71
Query: 543 LDAIKKVFLSQI 554
+ + Q
Sbjct: 72 EPEREDIAEQQE 83
>gnl|CDD|179712 PRK04019, rplP0, acidic ribosomal protein P0; Validated.
Length = 330
Score = 32.1 bits (74), Expect = 2.0
Identities = 10/20 (50%), Positives = 15/20 (75%)
Query: 513 EKELSEDEEEEKKEEEKEED 532
+E E+EEEE++EE EE+
Sbjct: 301 AEEEEEEEEEEEEEEPSEEE 320
Score = 30.6 bits (70), Expect = 5.4
Identities = 13/35 (37%), Positives = 21/35 (60%)
Query: 507 LVEKEREKELSEDEEEEKKEEEKEEDKTPKLEDEA 541
L E + + EEE++EEE+EE++ P E+ A
Sbjct: 288 LKEVLSAQAQAAAAEEEEEEEEEEEEEEPSEEEAA 322
>gnl|CDD|169428 PRK08404, PRK08404, V-type ATP synthase subunit H; Validated.
Length = 103
Score = 30.1 bits (68), Expect = 2.0
Identities = 24/77 (31%), Positives = 38/77 (49%), Gaps = 22/77 (28%)
Query: 471 KIVLHMKEDQAEYIEEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEKE 530
KI+ KE+ A+ IEE EI+KK +E ++L E +KK+E +E
Sbjct: 28 KIIRKAKEE-AKKIEE----EIIKK-------------AEEEAQKLIE----KKKKEGEE 65
Query: 531 EDKTPKLEDEAQLDAIK 547
E K E E +++ +K
Sbjct: 66 EAKKILEEGEKEIEELK 82
>gnl|CDD|216337 pfam01159, Ribosomal_L6e, Ribosomal protein L6e.
Length = 108
Score = 30.3 bits (69), Expect = 2.0
Identities = 20/47 (42%), Positives = 30/47 (63%), Gaps = 8/47 (17%)
Query: 509 EKEREKELSEDE----EEEKKE--EEKEEDKTPKLEDEAQLDAIKKV 549
EK+++K+ SE E ++EKKE E+++ D+ K D A L AIKK
Sbjct: 40 EKKKKKKKSEGEFFAEKKEKKEVSEQRKADQ--KAVDAALLAAIKKH 84
>gnl|CDD|219924 pfam08597, eIF3_subunit, Translation initiation factor eIF3
subunit. This is a family of proteins which are
subunits of the eukaryotic translation initiation factor
3 (eIF3). In yeast it is called Hcr1. The Saccharomyces
cerevisiae protein eIF3j (HCR1) has been shown to be
required for processing of 20S pre-rRNA and binds to 18S
rRNA and eIF3 subunits Rpg1p and Prt1p.
Length = 242
Score = 31.9 bits (73), Expect = 2.1
Identities = 16/71 (22%), Positives = 29/71 (40%), Gaps = 13/71 (18%)
Query: 477 KEDQAEYIEEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEKEEDK-TP 535
+E + K K + K + EKE+ K E++ + EE+ ED+
Sbjct: 47 EEKAKVAAKAKAKKALKAK-----------IEEKEKAKREKEEKGLRELEEDTPEDELAE 95
Query: 536 KL-EDEAQLDA 545
KL + Q ++
Sbjct: 96 KLRLRKLQEES 106
>gnl|CDD|240520 cd13156, KOW_RPL6, KOW motif of Ribosomal Protein L6. RPL6
contains KOW motif that has an extra ribosomal role as
an oncogenic. KOW domain is known as an RNA-binding
motif that is shared so far among some families of
ribosomal proteins, the essential bacterial
transcriptional elongation factor NusG, the eukaryotic
chromatin elongation factor Spt5, the higher eukaryotic
KIN17 proteins and Mtr4. .
Length = 152
Score = 31.0 bits (71), Expect = 2.1
Identities = 18/46 (39%), Positives = 29/46 (63%), Gaps = 3/46 (6%)
Query: 505 KLLVEKEREKELSEDEEEEKK-EEEKEEDKTPKLEDEAQLDAIKKV 549
K +K++E E E+++++ EE++ED+ K D A L AIKKV
Sbjct: 85 KKKKKKKKEGEFFEEKKKKYVVSEERKEDQ--KAVDAALLAAIKKV 128
>gnl|CDD|150406 pfam09727, CortBP2, Cortactin-binding protein-2. This entry is the
first approximately 250 residues of cortactin-binding
protein 2. In addition to being a positional candidate
for autism this protein is expressed at highest levels
in the brain in humans. The human protein has six
associated ankyrin repeat domains pfam00023 towards the
C-terminus which act as protein-protein interaction
domains.
Length = 193
Score = 31.4 bits (71), Expect = 2.4
Identities = 27/93 (29%), Positives = 46/93 (49%), Gaps = 17/93 (18%)
Query: 465 PLGRGTKIVLHMKEDQ---------AEYIEEKKIKEIVKKHSQFIGYPIK-----LLVEK 510
P+ K++ KE Q AE + K + E+ ++ + I Y K L+E+
Sbjct: 83 PMSELDKVMAKQKETQRRMLAQLLAAEKRQRKTVLELEEEKRKHIRYMKKSDDFTNLLEQ 142
Query: 511 ERE--KELSEDEEEEKKEEEKEEDKT-PKLEDE 540
ERE K+L E E+ ++ ++E+E K LE+E
Sbjct: 143 ERERLKKLLEQEKSQQAKKEQEHRKLLATLEEE 175
>gnl|CDD|218538 pfam05285, SDA1, SDA1. This family consists of several SDA1
protein homologues. SDA1 is a Saccharomyces cerevisiae
protein which is involved in the control of the actin
cytoskeleton. The protein is essential for cell
viability and is localised in the nucleus.
Length = 317
Score = 31.9 bits (73), Expect = 2.5
Identities = 25/101 (24%), Positives = 42/101 (41%), Gaps = 19/101 (18%)
Query: 442 DDEQYIWESSAGGSFTIKPDNSEPLGRGTKIVLHMKEDQAEYIEEKKIKEIVKKHSQFIG 501
DDE+ WE D V KE ++ E+++ K+ K
Sbjct: 98 DDEEEEWEVEE--------DEDSDDEGEWIDVESDKEIESSDSEDEEEKDEAAKK----- 144
Query: 502 YPIKLLVEKEREKELSEDEEEEKKEEEKEEDKTPKLEDEAQ 542
+++ ++ELSE++EEE EEE+ E + K + A
Sbjct: 145 ------AKEDSDEELSEEDEEEAAEEEEAEAEKEKASELAT 179
>gnl|CDD|223496 COG0419, SbcC, ATPase involved in DNA repair [DNA replication,
recombination, and repair].
Length = 908
Score = 32.0 bits (73), Expect = 2.6
Identities = 23/87 (26%), Positives = 35/87 (40%), Gaps = 6/87 (6%)
Query: 474 LHMKEDQAEYIEEKK------IKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEE 527
L E E +EEK +EI + + G L +E ++L EE +K E
Sbjct: 276 LRELERLLEELEEKIERLEELEREIEELEEELEGLRALLEELEELLEKLKSLEERLEKLE 335
Query: 528 EKEEDKTPKLEDEAQLDAIKKVFLSQI 554
EK E +LE+ A+ L +
Sbjct: 336 EKLEKLESELEELAEEKNELAKLLEER 362
Score = 31.7 bits (72), Expect = 4.0
Identities = 22/69 (31%), Positives = 39/69 (56%), Gaps = 8/69 (11%)
Query: 486 EKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEKEEDKTPKLEDEAQLDA 545
+KK+KE+ ++ SQ L E + ELSE E E ++ EE+ E + KL +A+L+
Sbjct: 599 KKKLKELEERLSQ--------LEELLQSLELSEAENELEEAEEELESELEKLNLQAELEE 650
Query: 546 IKKVFLSQI 554
+ + L ++
Sbjct: 651 LLQAALEEL 659
Score = 30.5 bits (69), Expect = 8.2
Identities = 17/78 (21%), Positives = 29/78 (37%), Gaps = 6/78 (7%)
Query: 478 EDQAEYIEEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEKEEDKTPKL 537
E Q + E + + +++E+E+E E E E +E E L
Sbjct: 191 EGQLSELLEDIEDLLEALEEELKELKKLEEIQEEQEEEELEQEIEALEERLAE------L 244
Query: 538 EDEAQLDAIKKVFLSQII 555
E+E + K L +I
Sbjct: 245 EEEKERLEELKARLLEIE 262
>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
Rpc31. RNA polymerase III contains seventeen subunits
in yeasts and in human cells. Twelve of these are akin
to RNA polymerase I or II and the other five are RNA pol
III-specific, and form the functionally distinct groups
(i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31,
Rpc34 and Rpc82 form a cluster of enzyme-specific
subunits that contribute to transcription initiation in
S.cerevisiae and H.sapiens. There is evidence that these
subunits are anchored at or near the N-terminal Zn-fold
of Rpc1, itself prolonged by a highly conserved but RNA
polymerase III-specific domain.
Length = 221
Score = 31.3 bits (71), Expect = 2.6
Identities = 12/37 (32%), Positives = 26/37 (70%)
Query: 504 IKLLVEKEREKELSEDEEEEKKEEEKEEDKTPKLEDE 540
KL + +++ KEL ++ +E+ E+++EE++ + EDE
Sbjct: 151 EKLSMLEKKLKELEAEDVDEEDEKDEEEEEEEEEEDE 187
Score = 30.5 bits (69), Expect = 4.7
Identities = 14/41 (34%), Positives = 27/41 (65%)
Query: 504 IKLLVEKEREKELSEDEEEEKKEEEKEEDKTPKLEDEAQLD 544
+ +L +K +E E + +EE++K+EE+EE++ + ED D
Sbjct: 153 LSMLEKKLKELEAEDVDEEDEKDEEEEEEEEEEDEDFDDDD 193
>gnl|CDD|203446 pfam06428, Sec2p, GDP/GTP exchange factor Sec2p. In Saccharomyces
cerevisiae, Sec2p is a GDP/GTP exchange factor for
Sec4p, which is required for vesicular transport at the
post-Golgi stage of yeast secretion.
Length = 90
Score = 29.6 bits (67), Expect = 2.7
Identities = 15/31 (48%), Positives = 19/31 (61%), Gaps = 1/31 (3%)
Query: 513 EKELSEDEEEEKKEEEKE-EDKTPKLEDEAQ 542
EK+ E+EKK+ E+E ED T L DEA
Sbjct: 6 EKKKRLRAEKEKKKLEQEVEDLTASLFDEAN 36
>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
Length = 509
Score = 31.9 bits (73), Expect = 2.8
Identities = 17/68 (25%), Positives = 28/68 (41%), Gaps = 13/68 (19%)
Query: 508 VEKEREKELSEDEEEEKKEEEKEEDKTPK--------LEDEAQLDAIKKVFLSQIICIGS 559
+ E + + D+E+E+K+E KE +K + EA A K L+ +
Sbjct: 155 DDDEDDDDDDVDDEDEEKKEAKELEKLSDDDDFVWDEDDSEALRQARKDAKLTA-----T 209
Query: 560 DRIVKMQL 567
VK L
Sbjct: 210 ADPVKAYL 217
>gnl|CDD|215961 pfam00510, COX3, Cytochrome c oxidase subunit III.
Length = 258
Score = 31.5 bits (72), Expect = 2.9
Identities = 21/62 (33%), Positives = 26/62 (41%), Gaps = 13/62 (20%)
Query: 947 YFQGYDPEVNPSTLAD-------FAAGAFRGLHSLIPGTIKL-VNERRVTNQELPFSSTM 998
Q Y+ P T++D F A F GLH +I GTI L V R+ N T
Sbjct: 172 GLQAYEYYEAPFTISDGVYGSTFFMATGFHGLHVII-GTIFLLVCLLRLLNYHF----TS 226
Query: 999 QH 1000
H
Sbjct: 227 NH 228
>gnl|CDD|222160 pfam13476, AAA_23, AAA domain.
Length = 204
Score = 31.0 bits (70), Expect = 2.9
Identities = 19/100 (19%), Positives = 43/100 (43%), Gaps = 6/100 (6%)
Query: 455 SFTIKPDNSEPLGRGTKIVLHMKEDQAEYIEEKKIKEIVKKHSQFIGY------PIKLLV 508
S + + + + ++ + E Q E K + + ++G K
Sbjct: 95 SRELTKKKGKKVKKSILEIVEIDELQEFIDELLKSDKEILPLLLYLGQERLEELKFKRKE 154
Query: 509 EKEREKELSEDEEEEKKEEEKEEDKTPKLEDEAQLDAIKK 548
+KER +EL ++ EE + E++ E + E + +L+ +K+
Sbjct: 155 KKERLEELEKELEELEDEKDLLEKLLEEKEKKKELEELKE 194
>gnl|CDD|217503 pfam03344, Daxx, Daxx Family. The Daxx protein (also known as the
Fas-binding protein) is thought to play a role in
apoptosis, but precise role played by Daxx remains to be
determined. Daxx forms a complex with Axin.
Length = 715
Score = 31.8 bits (72), Expect = 2.9
Identities = 14/34 (41%), Positives = 23/34 (67%)
Query: 509 EKEREKELSEDEEEEKKEEEKEEDKTPKLEDEAQ 542
E+E E++ SE+EE E +EEE+E + E+E +
Sbjct: 454 EEEEEEQESEEEEGEDEEEEEEVEADNGSEEEME 487
Score = 31.4 bits (71), Expect = 4.2
Identities = 17/35 (48%), Positives = 25/35 (71%)
Query: 508 VEKEREKELSEDEEEEKKEEEKEEDKTPKLEDEAQ 542
VE+E E+E E+EEE++ EEE+ ED+ + E EA
Sbjct: 445 VEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEAD 479
>gnl|CDD|225572 COG3028, COG3028, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 187
Score = 30.8 bits (70), Expect = 3.0
Identities = 23/82 (28%), Positives = 38/82 (46%), Gaps = 16/82 (19%)
Query: 1293 DKVTVTSKHNDDEQYIWESSAGGSFTIKPDNSEPLGRGTKIV---------LHMKEDQAE 1343
D V + ++D++ IW S + IK D G ++V + + ED E
Sbjct: 10 DDVPGDDEEDEDDEIIWVSKS----QIKRDAEALQDLGEELVDLTKAALAKIPLDEDLLE 65
Query: 1344 YIEE-KKIK-EIVKK-HSQFIG 1362
IE ++IK EI ++ Q+IG
Sbjct: 66 AIELAQRIKSEIARRRQLQYIG 87
>gnl|CDD|185429 PTZ00074, PTZ00074, 60S ribosomal protein L34; Provisional.
Length = 135
Score = 30.0 bits (68), Expect = 3.4
Identities = 12/54 (22%), Positives = 30/54 (55%), Gaps = 14/54 (25%)
Query: 484 IEEKKI-KEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEKEEDKTPK 536
+EE+KI K+++K+ ++ ++K+ + ++++KK+ K+ K K
Sbjct: 95 VEEQKIVKQVLKEKAK-------------QKKQKKKKKKKKKKKTSKKAAKKKK 135
>gnl|CDD|237761 PRK14597, PRK14597, peptide deformylase; Provisional.
Length = 166
Score = 30.5 bits (69), Expect = 3.5
Identities = 14/43 (32%), Positives = 20/43 (46%), Gaps = 1/43 (2%)
Query: 5 LRQKADADKN-DKAVKDLVNLLFETSLLSSGFTLEDPQVELTL 46
LR+KA D +K + + ET G L PQV ++L
Sbjct: 11 LRKKAKPVTKFDDNLKKTIEEMIETMYHYDGVGLAAPQVGISL 53
>gnl|CDD|201540 pfam00992, Troponin, Troponin. Troponin (Tn) contains three
subunits, Ca2+ binding (TnC), inhibitory (TnI), and
tropomyosin binding (TnT). this Pfam contains members of
the TnT subunit. Troponin is a complex of three
proteins, Ca2+ binding (TnC), inhibitory (TnI), and
tropomyosin binding (TnT). The troponin complex
regulates Ca++ induced muscle contraction. This family
includes troponin T and troponin I. Troponin I binds to
actin and troponin T binds to tropomyosin.
Length = 131
Score = 30.2 bits (69), Expect = 3.5
Identities = 17/50 (34%), Positives = 24/50 (48%), Gaps = 8/50 (16%)
Query: 505 KLLVEKEREKELSEDEEEEKKEEEKE---EDKTPKLE---DEAQLDAIKK 548
K L+ + E E E+ KKEEEKE ++ P L A+L + K
Sbjct: 5 KSLLLL-KAAEELE-FEQRKKEEEKEKYLAERCPPLRLSLSRAELQELCK 52
>gnl|CDD|224495 COG1579, COG1579, Zn-ribbon protein, possibly nucleic acid-binding
[General function prediction only].
Length = 239
Score = 31.2 bits (71), Expect = 3.5
Identities = 18/68 (26%), Positives = 29/68 (42%), Gaps = 6/68 (8%)
Query: 486 EKKIKEIVKKHSQFIGYPIKLLVEKEREKELSE-----DEEEEKKEEEKEEDKTPKLEDE 540
E+K+ + K + I++ + KER L + EE EK E+E E+ K E
Sbjct: 79 EEKLSAV-KDERELRALNIEIQIAKERINSLEDELAELMEEIEKLEKEIEDLKERLERLE 137
Query: 541 AQLDAIKK 548
L +
Sbjct: 138 KNLAEAEA 145
>gnl|CDD|218552 pfam05316, VAR1, Mitochondrial ribosomal protein (VAR1). This family
consists of the yeast mitochondrial ribosomal proteins
VAR1. Mitochondria possess their own ribosomes
responsible for the synthesis of a small number of
proteins encoded by the mitochondrial genome. In yeast
the two ribosomal RNAs and a single ribosomal protein,
VAR1, are products of mitochondrial genes, and the
remaining approximately 80 ribosomal proteins are encoded
in the nucleus. VAR1 along with 15S rRNA are necessary
for the formation of mature 37S subunits.
Length = 350
Score = 31.5 bits (71), Expect = 3.6
Identities = 20/74 (27%), Positives = 30/74 (40%), Gaps = 17/74 (22%)
Query: 1196 VANVNNNNN-------NNNNNNITNVYFFLFPGFGQDPIIPNKEEKTLTIIDSGIGMT-- 1246
+ N+NNNNN NN+NNN+ + + + NK L + IG +
Sbjct: 230 INNINNNNNIKYNNMINNSNNNLNINNIY-------NNLNINKISMDLLMNKYLIGWSIL 282
Query: 1247 -KADLVNNLGTIAK 1259
K L N+ K
Sbjct: 283 FKGRLNKNISRSIK 296
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
chromosome partitioning].
Length = 1163
Score = 31.6 bits (72), Expect = 3.7
Identities = 17/74 (22%), Positives = 29/74 (39%), Gaps = 3/74 (4%)
Query: 477 KEDQAEYIEEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSE--DEEEEKKEEEKEEDKT 534
E+ E +EE K ++I + L ++ EL E +E EEK EE +
Sbjct: 318 LEELEERLEELK-EKIEALKEELEERETLLEELEQLLAELEEAKEELEEKLSALLEELEE 376
Query: 535 PKLEDEAQLDAIKK 548
+L ++
Sbjct: 377 LFEALREELAELEA 390
>gnl|CDD|221446 pfam12158, DUF3592, Protein of unknown function (DUF3592). This
family of proteins is functionally uncharacterized.This
family of proteins is found in bacteria, archaea,
eukaryotes and viruses. Proteins in this family are
typically between 150 and 242 amino acids in length.
Length = 134
Score = 30.1 bits (68), Expect = 3.8
Identities = 18/95 (18%), Positives = 35/95 (36%), Gaps = 12/95 (12%)
Query: 1053 LVGLTLGFFYLWYCGGFLLQVTRLSQVYRGVNDIDYYIGGLLESVIRG--TLSSPSFRCV 1110
++ L +G L LLQ R G +++S R + ++ V
Sbjct: 9 VLFLLIGLVLLVVGVRLLLQAQRSRTWPE-------TPGTVVDSESRDSDGRRATTYYPV 61
Query: 1111 FAEAFYRYKYGDRYFYEGPPATNPGAFTPGKQLTL 1145
+RY G R + +NP + G+++ +
Sbjct: 62 VE---FRYDDGRRIRFRSGSGSNPPGYRVGERVPV 93
>gnl|CDD|219124 pfam06658, DUF1168, Protein of unknown function (DUF1168). This
family consists of several hypothetical eukaryotic
proteins of unknown function.
Length = 142
Score = 30.0 bits (68), Expect = 4.2
Identities = 12/32 (37%), Positives = 24/32 (75%), Gaps = 2/32 (6%)
Query: 507 LVEKEREKELSEDEEEEKKEE--EKEEDKTPK 536
L++++ +KE ++E ++K+EE K+E+KT K
Sbjct: 54 LMDEKWKKETEDEEFQQKREEKKRKDEEKTAK 85
>gnl|CDD|218482 pfam05178, Kri1, KRI1-like family. The yeast member of this family
(Kri1p) is found to be required for 40S ribosome
biogenesis in the nucleolus.
Length = 99
Score = 29.2 bits (66), Expect = 4.3
Identities = 17/41 (41%), Positives = 26/41 (63%), Gaps = 6/41 (14%)
Query: 510 KEREKELSEDEEEEKKEEEKEEDKTPKLED-EAQLDAIKKV 549
KER+ +EE+ ++EEE + K K E+ E +L+ IKKV
Sbjct: 1 KERK-----EEEKAQREEELKRLKNLKREEIEEKLEKIKKV 36
>gnl|CDD|237035 PRK12280, rplW, 50S ribosomal protein L23; Reviewed.
Length = 158
Score = 30.1 bits (68), Expect = 4.3
Identities = 14/42 (33%), Positives = 24/42 (57%), Gaps = 1/42 (2%)
Query: 501 GYPIKLLVEK-EREKELSEDEEEEKKEEEKEEDKTPKLEDEA 541
GY I L E+ E+E++ E EEK+ + +++K K E +
Sbjct: 85 GYSINLFPEESEKEQKEVSKETEEKEAIKAKKEKKEKKEKKV 126
>gnl|CDD|225288 COG2433, COG2433, Uncharacterized conserved protein [Function
unknown].
Length = 652
Score = 31.2 bits (71), Expect = 4.7
Identities = 13/45 (28%), Positives = 23/45 (51%), Gaps = 2/45 (4%)
Query: 501 GYPIKLLVEKEREKELSEDEEEEKKEEEKEEDKTPKLEDEAQLDA 545
GYP+ + K +E+E E+E EEE+ + T + +L+
Sbjct: 387 GYPLAEALSKVKEEE--RPREKEGTEEEERREITVYEKRIKKLEE 429
>gnl|CDD|214661 smart00435, TOPEUc, DNA Topoisomerase I (eukaryota). DNA
Topoisomerase I (eukaryota), DNA topoisomerase V,
Vaccina virus topoisomerase, Variola virus
topoisomerase, Shope fibroma virus topoisomeras.
Length = 391
Score = 30.8 bits (70), Expect = 4.9
Identities = 19/67 (28%), Positives = 37/67 (55%), Gaps = 1/67 (1%)
Query: 477 KEDQAEYIEEKKIKEIVKK-HSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEKEEDKTP 535
K + + + I ++ +K S+F KL E + +K+ + EE++KK+ E+ E++
Sbjct: 294 KRLKKMILLFEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKKKKQIERLEERIE 353
Query: 536 KLEDEAQ 542
KLE +A
Sbjct: 354 KLEVQAT 360
>gnl|CDD|224389 COG1472, BglX, Beta-glucosidase-related glycosidases [Carbohydrate
transport and metabolism].
Length = 397
Score = 30.8 bits (70), Expect = 5.2
Identities = 16/64 (25%), Positives = 27/64 (42%), Gaps = 4/64 (6%)
Query: 1242 GIGMTKADLVNNLGTIAKSGTKAFMEALQAGADISMIGQFGVGFYSAYL-VADKVTVTSK 1300
G+ ++ + + S AL+AG DI ++ Y AYL V + V ++
Sbjct: 243 GVVISDDLSMKAIAAAHGSAADRAEAALKAGVDIVLVCN---ELYEAYLVVLELVGLSEA 299
Query: 1301 HNDD 1304
DD
Sbjct: 300 RLDD 303
>gnl|CDD|204414 pfam10211, Ax_dynein_light, Axonemal dynein light chain. Axonemal
dynein light chain proteins play a dynamic role in
flagellar and cilia motility. Eukaryotic cilia and
flagella are complex organelles consisting of a core
structure, the axoneme, which is composed of nine
microtubule doublets forming a cylinder that surrounds a
pair of central singlet microtubules. This
ultra-structural arrangement seems to be one of the most
stable micro-tubular assemblies known and is responsible
for the flagellar and ciliary movement of a large number
of organisms ranging from protozoan to mammals. This
light chain interacts directly with the N-terminal half
of the heavy chains.
Length = 189
Score = 30.3 bits (69), Expect = 5.2
Identities = 19/41 (46%), Positives = 27/41 (65%), Gaps = 1/41 (2%)
Query: 511 EREKELSEDEEE-EKKEEEKEEDKTPKLEDEAQLDAIKKVF 550
+ E+ SE E+E +K EEEKEE + E EA+L+AI+K
Sbjct: 117 QAEQGKSELEQEIKKLEEEKEELEKRVAELEAKLEAIEKRE 157
>gnl|CDD|146486 pfam03879, Cgr1, Cgr1 family. Members of this family are
coiled-coil proteins that are involved in pre-rRNA
processing.
Length = 105
Score = 28.9 bits (65), Expect = 5.2
Identities = 12/33 (36%), Positives = 17/33 (51%), Gaps = 12/33 (36%)
Query: 510 KEREKELSEDEEEEKKE------------EEKE 530
K REKEL +++E E++ EEKE
Sbjct: 44 KAREKELKDEKEAERQRRIQAIKERRAAKEEKE 76
>gnl|CDD|226175 COG3649, COG3649, CRISPR system related protein [Defense
mechanisms].
Length = 283
Score = 30.7 bits (69), Expect = 5.3
Identities = 20/48 (41%), Positives = 25/48 (52%)
Query: 494 KKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEKEEDKTPKLEDEA 541
K+HS +GY +VEKE E EL D E K E DK ++E E
Sbjct: 225 KEHSSKLGYLSSAVVEKELESELKLDVTELKDVYESLLDKIERIEYEV 272
>gnl|CDD|145949 pfam03066, Nucleoplasmin, Nucleoplasmin. Nucleoplasmins are also
known as chromatin decondensation proteins. They bind to
core histones and transfer DNA to them in a reaction
that requires ATP. This is thought to play a role in the
assembly of regular nucleosomal arrays.
Length = 146
Score = 29.6 bits (67), Expect = 5.4
Identities = 11/28 (39%), Positives = 19/28 (67%)
Query: 509 EKEREKELSEDEEEEKKEEEKEEDKTPK 536
+ + E E ED+EE+ E+E EE+++P
Sbjct: 115 DDDEEDEEEEDDEEDDDEDESEEEESPV 142
>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
(vWA) domain [General function prediction only].
Length = 4600
Score = 31.1 bits (70), Expect = 5.4
Identities = 14/38 (36%), Positives = 24/38 (63%)
Query: 507 LVEKEREKELSEDEEEEKKEEEKEEDKTPKLEDEAQLD 544
LV KE + + ED++ ++KE+E+E ++DE Q D
Sbjct: 3935 LVSKEDDNKALEDKDRQEKEDEEEMSDDVGIDDEIQPD 3972
>gnl|CDD|227657 COG5354, COG5354, Uncharacterized protein, contains Trp-Asp (WD)
repeat [General function prediction only].
Length = 561
Score = 31.0 bits (70), Expect = 5.4
Identities = 18/77 (23%), Positives = 30/77 (38%), Gaps = 5/77 (6%)
Query: 459 KPDNSEPLGRGTKIVLHMKE---DQAEYIEEKKIKEIVKKHSQFIGYPIKLLV-EKEREK 514
K E KE EY +E KI+ ++KK ++ E+
Sbjct: 481 KAVIMEADSANRSSAPRKKELVEQWPEYSDEDKIRSLLKKLRAIEALKERMRSGEELEVI 540
Query: 515 ELSEDEEEEK-KEEEKE 530
++++ E EE+ E KE
Sbjct: 541 QVNKIETEEEVLSELKE 557
>gnl|CDD|204405 pfam10146, zf-C4H2, Zinc finger-containing protein. This is a
family of proteins which appears to have a highly
conserved zinc finger domain at the C terminal end,
described as -C-X2-CH-X3-H-X5-C-X2-C-. The structure is
predicted to contain a coiled coil. Members are
annotated as being tumour-associated antigen HCA127 in
humans but this could not confirmed.
Length = 215
Score = 30.2 bits (68), Expect = 5.5
Identities = 31/113 (27%), Positives = 48/113 (42%), Gaps = 13/113 (11%)
Query: 488 KIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEKE-----EDKTPKLEDEAQ 542
IKEI K + KL E + E+E E EE+ KE +KE E+K +E+ Q
Sbjct: 2 GIKEIRNKTDELE----KLKDEIKAEEEALESEEKHLKEYDKEMEELLEEKMQHVEELRQ 57
Query: 543 LDAIKKVFLSQIICIGSDRIVKM----QLQSFYMPSERQFNVTLSDNLHLDLL 591
+ A ++I S+ +M ++ Y P + N L L+ L
Sbjct: 58 IHADINDMETEIKQSKSELERRMGKIARMHGEYNPLKESINEMRKLELGLEEL 110
>gnl|CDD|221756 pfam12757, DUF3812, Protein of unknown function (DUF3812). This is
a family of fungal proteins whose function is not known.
Length = 126
Score = 29.6 bits (67), Expect = 5.5
Identities = 10/28 (35%), Positives = 18/28 (64%)
Query: 509 EKEREKELSEDEEEEKKEEEKEEDKTPK 536
++ EK+L E+E + + EE KE ++ K
Sbjct: 98 ARDEEKKLDEEEAKRQHEEAKEREREKK 125
>gnl|CDD|235593 PRK05758, PRK05758, F0F1 ATP synthase subunit delta; Validated.
Length = 177
Score = 30.1 bits (69), Expect = 5.7
Identities = 21/75 (28%), Positives = 34/75 (45%), Gaps = 10/75 (13%)
Query: 485 EEKK--IKEIVKKHSQFIGYPIKLLVEKEREKELSE--DE-EEEKKEEEKEEDKT----P 535
E+KK + + K S+++ +K+L E R L E ++ E + E E D
Sbjct: 56 EQKKKLLAAVFKSLSEYVQNFLKVLAENGRLALLPEILEQFEALRAEHENIVDAEVTSAF 115
Query: 536 KLEDEAQLDAIKKVF 550
L +E QLD +K
Sbjct: 116 PLSEE-QLDKLKAAL 129
>gnl|CDD|240427 PTZ00470, PTZ00470, glycoside hydrolase family 47 protein;
Provisional.
Length = 522
Score = 30.8 bits (70), Expect = 5.8
Identities = 16/45 (35%), Positives = 18/45 (40%)
Query: 1198 NVNNNNNNNNNNNITNVYFFLFPGFGQDPIIPNKEEKTLTIIDSG 1242
N NNNNNNNN N N + P K+E ID
Sbjct: 17 NYNNNNNNNNQINSNNPNNNGNNQASKLPRGKKKQENPFNKIDEV 61
>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
Length = 520
Score = 30.9 bits (71), Expect = 5.9
Identities = 22/77 (28%), Positives = 38/77 (49%), Gaps = 17/77 (22%)
Query: 480 QAEYIEEKKI----KEIVKKHSQFIGYPIKLLVEKE---REKELSEDEEEEKKEEEKEED 532
+AE I+++ + +EI K ++F EKE R EL + E+ ++EE +
Sbjct: 50 EAEAIKKEALLEAKEEIHKLRNEF---------EKELRERRNELQKLEKRLLQKEENLDR 100
Query: 533 KTPKLED-EAQLDAIKK 548
K LE E +L+ +K
Sbjct: 101 KLELLEKREEELEKKEK 117
>gnl|CDD|150531 pfam09871, DUF2098, Uncharacterized protein conserved in archaea
(DUF2098). This domain, found in various hypothetical
prokaryotic proteins, has no known function.
Length = 91
Score = 28.4 bits (64), Expect = 6.0
Identities = 9/33 (27%), Positives = 18/33 (54%)
Query: 509 EKEREKELSEDEEEEKKEEEKEEDKTPKLEDEA 541
EK+ E+E ++E E+ ++E+E + L
Sbjct: 54 EKKEEREEDKEELIERIKKEEETFEDVDLGSAG 86
>gnl|CDD|220069 pfam08915, tRNA-Thr_ED, Archaea-specific editing domain of
threonyl-tRNA synthetase. Archaea-specific editing
domain of threonyl-tRNA synthetase, with marked
structural similarity to D-amino acids deacylases found
in eubacteria and eukaryotes. This domain can bind
D-amino acids, and ensures high fidelity during
translation. It is especially responsible for removing
incorrectly attached serine from tRNA-Thr. The domain
forms a fold that can be be defined as two layers of
beta-sheets (a three-stranded sheet and a five-stranded
sheet), with two alpha-helices located adjacent to the
five-stranded sheet.
Length = 137
Score = 29.5 bits (67), Expect = 6.2
Identities = 20/80 (25%), Positives = 33/80 (41%), Gaps = 23/80 (28%)
Query: 496 HSQFIGYPIKLLVEKEREKELSEDEEEEKKEE------------EKEEDKTPKLEDEAQL 543
H+ Y + ++K L EEEE KE EK +++ P+ E
Sbjct: 7 HADRFEY-------EPKKKTLKIAEEEEPKEGRLEEALVVFTAVEKGDEENPEEVIEKAA 59
Query: 544 DAIKKVFLSQIICIGSDRIV 563
+ I KV S+ + + R+V
Sbjct: 60 EEILKVA-SK---VKATRVV 75
>gnl|CDD|238356 cd00660, Topoisomer_IB_N, Topoisomer_IB_N: N-terminal DNA binding
fragment found in eukaryotic DNA topoisomerase (topo) IB
proteins similar to the monomeric yeast and human topo I
and heterodimeric topo I from Leishmania donvanni. Topo
I enzymes are divided into: topo type IA (bacterial)
and type IB (eukaryotic). Topo I relaxes superhelical
tension in duplex DNA by creating a single-strand nick,
the broken strand can then rotate around the unbroken
strand to remove DNA supercoils and, the nick is
religated, liberating topo I. These enzymes regulate the
topological changes that accompany DNA replication,
transcription and other nuclear processes. Human topo I
is the target of a diverse set of anticancer drugs
including camptothecins (CPTs). CPTs bind to the topo
I-DNA complex and inhibit re-ligation of the
single-strand nick, resulting in the accumulation of
topo I-DNA adducts. In addition to differences in
structure and some biochemical properties,
Trypanosomatid parasite topo I differ from human topo I
in their sensitivity to CPTs and other classical topo I
inhibitors. Trypanosomatid topos I play putative roles
in organizing the kinetoplast DNA network unique to
these parasites. This family may represent more than
one structural domain.
Length = 215
Score = 29.9 bits (68), Expect = 6.4
Identities = 17/50 (34%), Positives = 32/50 (64%), Gaps = 2/50 (4%)
Query: 486 EKKIKEIVKKHSQFIGYPIK--LLVEKEREKELSEDEEEEKKEEEKEEDK 533
K+ K I+KK S+ PI EKE++K +S++E++ KEE+++ ++
Sbjct: 72 TKEEKHIIKKLSKCDFTPIYQYFEEEKEKKKAMSKEEKKAIKEEKEKLEE 121
>gnl|CDD|200340 TIGR03927, T7SS_EssA_Firm, type VII secretion protein EssA.
Members of this family are associated with type VII
secretion of WXG100 family targets in the Firmicutes,
but not in the Actinobacteria. This highly divergent
protein family consists largely of a central region of
highly polar low-complexity sequence containing
occasional LF motifs in weak repeats about 17 residues
in length, flanked by hydrophobic N- and C-terminal
regions [Protein fate, Protein and peptide secretion and
trafficking].
Length = 150
Score = 29.7 bits (67), Expect = 6.6
Identities = 16/60 (26%), Positives = 32/60 (53%), Gaps = 4/60 (6%)
Query: 474 LHMKEDQAEYIEEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEKEEDK 533
L +K ++ E+K I EI + Q K L E +K+++ + +EK E+E+ +++
Sbjct: 24 LQVKPNR---YEKKDI-EINTDYLQEETELDKELFTPEEQKKITFQKHKEKPEQEELKNQ 79
Score = 29.3 bits (66), Expect = 8.3
Identities = 10/61 (16%), Positives = 22/61 (36%), Gaps = 2/61 (3%)
Query: 512 REKELSEDEEEEKKEEEKEEDKTPKLEDEAQLDAIKKV--FLSQIICIGSDRIVKMQLQS 569
L E+ E +K+ EE K + + +++ L ++ + + Q
Sbjct: 39 NTDYLQEETELDKELFTPEEQKKITFQKHKEKPEQEELKNQLFSENATENNTVKATKKQL 98
Query: 570 F 570
F
Sbjct: 99 F 99
>gnl|CDD|220371 pfam09736, Bud13, Pre-mRNA-splicing factor of RES complex. This
entry is characterized by proteins with alternating
conserved and low-complexity regions. Bud13 together
with Snu17p and a newly identified factor,
Pml1p/Ylr016c, form a novel trimeric complex. called The
RES complex, pre-mRNA retention and splicing complex.
Subunits of this complex are not essential for viability
of yeasts but they are required for efficient splicing
in vitro and in vivo. Furthermore, inactivation of this
complex causes pre-mRNA leakage from the nucleus. Bud13
contains a unique, phylogenetically conserved C-terminal
region of unknown function.
Length = 141
Score = 29.2 bits (66), Expect = 6.7
Identities = 11/24 (45%), Positives = 21/24 (87%)
Query: 508 VEKEREKELSEDEEEEKKEEEKEE 531
+E++RE++ E EE+E+KEE+++E
Sbjct: 14 IEEKREEKEREKEEKERKEEKEKE 37
>gnl|CDD|239286 cd02988, Phd_like_VIAF, Phosducin (Phd)-like family, Viral
inhibitor of apoptosis (IAP)-associated factor (VIAF)
subfamily; VIAF is a Phd-like protein that functions in
caspase activation during apoptosis. It was identified
as an IAP binding protein through a screen of a human
B-cell library using a prototype IAP. VIAF lacks a
consensus IAP binding motif and while it does not
function as an IAP antagonist, it still plays a
regulatory role in the complete activation of caspases.
VIAF itself is a substrate for IAP-mediated
ubiquitination, suggesting that it may be a target of
IAPs in the prevention of cell death. The similarity of
VIAF to Phd points to a potential role distinct from
apoptosis regulation. Phd functions as a cytosolic
regulator of G protein by specifically binding to G
protein betagamma (Gbg)-subunits. The C-terminal domain
of Phd adopts a thioredoxin fold, but it does not
contain a CXXC motif. Phd interacts with G protein beta
mostly through the N-terminal helical domain.
Length = 192
Score = 29.9 bits (68), Expect = 6.8
Identities = 12/60 (20%), Positives = 22/60 (36%), Gaps = 12/60 (20%)
Query: 507 LVEKEREKELSEDEEEE----KKEEEKEEDKTPK--------LEDEAQLDAIKKVFLSQI 554
L K + E+E E + E E K ED+ L+ ++ L+++
Sbjct: 13 LPPKPPSPKEEEEEALELAIQEAHENALEKKLLDELDEELDEEEDDRFLEEYRRKRLAEM 72
>gnl|CDD|202427 pfam02841, GBP_C, Guanylate-binding protein, C-terminal domain.
Transcription of the anti-viral guanylate-binding
protein (GBP) is induced by interferon-gamma during
macrophage induction. This family contains GBP1 and
GPB2, both GTPases capable of binding GTP, GDP and GMP.
Length = 297
Score = 30.3 bits (69), Expect = 6.8
Identities = 21/91 (23%), Positives = 39/91 (42%), Gaps = 16/91 (17%)
Query: 462 NSEPLGRGTKI--VLHMKEDQAEYIEEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSED 519
N P +G K VL E++ K+ E + + L KE+ E +E
Sbjct: 165 NQVP-RKGVKAEEVLQ------EFLNSKEAVEEAILQTD------QALTAKEKAIE-AER 210
Query: 520 EEEEKKEEEKEEDKTPKLEDEAQLDAIKKVF 550
+ E E E+E + + E+E ++A ++ +
Sbjct: 211 AKAEAAEAEQELLREKQKEEEQMMEAQERSY 241
>gnl|CDD|187774 cd09643, Csn1, CRISPR/Cas system-associated protein Cas9. CRISPR
(Clustered Regularly Interspaced Short Palindromic
Repeats) and associated Cas proteins comprise a system
for heritable host defense by prokaryotic cells against
phage and other foreign DNA; Very large protein
containing McrA/HNH-nuclease related domain and a
RuvC-like nuclease domain; signature gene for type II.
Length = 799
Score = 30.9 bits (70), Expect = 6.9
Identities = 24/81 (29%), Positives = 42/81 (51%), Gaps = 9/81 (11%)
Query: 487 KKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEKEEDKTPKLEDEAQLDAI 546
K + E+VKK+ G P K+++E RE ++ + KK ++K ED + A +
Sbjct: 501 KVVNELVKKY----GPPDKIVIEMARENGTNKGTKNRKKRQKKNEDNIKEAAS-ALEQKL 555
Query: 547 KKVFLSQIICIGSDRIVKMQL 567
K++ L I S I+K++L
Sbjct: 556 KELPLD----IKSKNILKLRL 572
>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain. This domain is
found at the N terminus of SMC proteins. The SMC
(structural maintenance of chromosomes) superfamily
proteins have ATP-binding domains at the N- and
C-termini, and two extended coiled-coil domains
separated by a hinge in the middle. The eukaryotic SMC
proteins form two kind of heterodimers: the SMC1/SMC3
and the SMC2/SMC4 types. These heterodimers constitute
an essential part of higher order complexes, which are
involved in chromatin and DNA dynamics. This family also
includes the RecF and RecN proteins that are involved in
DNA metabolism and recombination.
Length = 1162
Score = 30.7 bits (69), Expect = 6.9
Identities = 18/94 (19%), Positives = 42/94 (44%), Gaps = 6/94 (6%)
Query: 477 KEDQAEYIEEKKIKEIVKKHSQFI----GYPIKLLVEKEREKELSEDEEEEKKEEEKEED 532
+E + + KK+ E + ++ I ++ L KE+ K+ E + ++K E +EE+
Sbjct: 167 REKRKKKERLKKLIEETENLAELIIDLEELKLQELKLKEQAKKALEYYQLKEKLELEEEN 226
Query: 533 KTPKLEDEAQLDAIKKVFLSQIICIGSDRIVKMQ 566
D +L+ + L +++ + I +
Sbjct: 227 LL--YLDYLKLNEERIDLLQELLRDEQEEIESSK 258
>gnl|CDD|236080 PRK07734, motB, flagellar motor protein MotB; Reviewed.
Length = 259
Score = 30.1 bits (68), Expect = 7.1
Identities = 20/74 (27%), Positives = 34/74 (45%), Gaps = 13/74 (17%)
Query: 484 IEEKKIKEIVKK-HSQFIG--------YPIKLLVEKEREKELSEDEEEEKKEEEKEEDKT 534
I+ K K++ +S+F G K EKE E E+ +KKEE + + K
Sbjct: 43 IDAAKFKQMAVAFNSEFAGGTGVKDFLSDQKPEDEKELSASSLEAEQAKKKEEAEAKKK- 101
Query: 535 PKLEDEAQLDAIKK 548
++ +L A++K
Sbjct: 102 ---KEMEELKAVQK 112
>gnl|CDD|234083 TIGR03002, outer_YhbN_LptA, lipopolysaccharide transport periplasmic
protein LptA. Members of this protein family include
LptA (previously called YhbN). It was shown to be an
essential protein in E. coli, implicated in cell envelope
integrity, and to play a role in the delivery of LPS to
the outer leaflet of the outer membrane. It works with
LptB (formerly yhbG), a homolog of ABC transporter
ATP-binding proteins, encoded by an adjacent gene.
Numerous homologs in other Proteobacteria are found in a
conserved location near lipopolysaccharide inner core
biosynthesis genes. This family is related to organic
solvent tolerance protein (OstA), though distantly [Cell
envelope, Biosynthesis and degradation of surface
polysaccharides and lipopolysaccharides, Transport and
binding proteins, Other].
Length = 142
Score = 29.1 bits (66), Expect = 7.1
Identities = 16/65 (24%), Positives = 27/65 (41%), Gaps = 7/65 (10%)
Query: 1292 ADKVTVTSKHNDDEQYIWESSAGGS---FTIKPDNSEPL-GRGTKIVLHMKEDQAEYIEE 1347
ADKV VT + +A G F K DN + + G+ +I +D+
Sbjct: 43 ADKVVVTRNDAGGIEKA---TATGKPATFRQKLDNGKEIEGQANRIEYDPAKDEVVLTGN 99
Query: 1348 KKIKE 1352
++K+
Sbjct: 100 ARLKQ 104
>gnl|CDD|221185 pfam11719, Drc1-Sld2, DNA replication and checkpoint protein.
Genome duplication is precisely regulated by
cyclin-dependent kinases CDKs, which bring about the
onset of S phase by activating replication origins and
then prevent relicensing of origins until mitosis is
completed. The optimum sequence motif for CDK
phosphorylation is S/T-P-K/R-K/R, and Drc1-Sld2 is found
to have at least 11 potential phosphorylation sites.
Drc1 is required for DNA synthesis and S-M replication
checkpoint control. Drc1 associates with Cdc2 and is
phosphorylated at the onset of S phase when Cdc2 is
activated. Thus Cdc2 promotes DNA replication by
phosphorylating Drc1 and regulating its association with
Cut5. Sld2 and Sld3 represent the minimal set of S-CDK
substrates required for DNA replication.
Length = 397
Score = 30.5 bits (69), Expect = 7.2
Identities = 17/58 (29%), Positives = 30/58 (51%), Gaps = 2/58 (3%)
Query: 485 EEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEKEEDKTPKLEDEAQ 542
E K+ E K S+F+GY + + E E + EE EKK++ K++ + K+ +
Sbjct: 319 EIPKLDE--KSLSEFLGYMGGIDEDDEDEDDEESKEEVEKKQKVKKKPRKRKVNPVSN 374
>gnl|CDD|220102 pfam09073, BUD22, BUD22. BUD22 has been shown in yeast to be a
nuclear protein involved in bud-site selection. It plays
a role in positioning the proximal bud pole signal. More
recently it has been shown to be involved in ribosome
biogenesis.
Length = 424
Score = 30.6 bits (69), Expect = 7.4
Identities = 16/68 (23%), Positives = 30/68 (44%), Gaps = 4/68 (5%)
Query: 483 YIEEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEE-EKKEEEKEEDKTPKLEDEA 541
+ KK++ +KK F + L+ + + E ++ + EEE E KT L+ A
Sbjct: 3 HHGVKKLRRALKKAKGFE---RQKLIRRIKNAEKKSKNDKLARIEEEIEALKTLDLDQLA 59
Query: 542 QLDAIKKV 549
+ K+
Sbjct: 60 KKYLFSKL 67
>gnl|CDD|169396 PRK08349, PRK08349, hypothetical protein; Validated.
Length = 198
Score = 29.7 bits (67), Expect = 7.4
Identities = 12/44 (27%), Positives = 27/44 (61%), Gaps = 4/44 (9%)
Query: 466 LGRGTKIV-LHMKEDQAEYIEEKKIKEIVKKHSQFIGYPIKLLV 508
L RG ++ +H ++D+ +E+K++E+V++ + G +K V
Sbjct: 21 LRRGVEVYPVHFRQDE---KKEEKVRELVERLQELHGGKLKDPV 61
Score = 29.7 bits (67), Expect = 7.4
Identities = 12/44 (27%), Positives = 27/44 (61%), Gaps = 4/44 (9%)
Query: 1327 LGRGTKIV-LHMKEDQAEYIEEKKIKEIVKKHSQFIGYPIKLLV 1369
L RG ++ +H ++D+ +E+K++E+V++ + G +K V
Sbjct: 21 LRRGVEVYPVHFRQDE---KKEEKVRELVERLQELHGGKLKDPV 61
>gnl|CDD|235305 PRK04527, PRK04527, argininosuccinate synthase; Provisional.
Length = 400
Score = 30.2 bits (68), Expect = 7.6
Identities = 25/87 (28%), Positives = 37/87 (42%), Gaps = 21/87 (24%)
Query: 62 PIRQT-CQHEDNRVYERQANPEREPEVRGYGTRYRQSSYSSSYGYNRNSRESREYGRTET 120
PIR+ +H R YE++ ER G+G R +Q +Y+ N N G T +
Sbjct: 142 PIREIQKEHTQTRAYEQKYLEER-----GFGVRAKQKAYT----INENL-----LGVTMS 187
Query: 121 SGSRETYGRPDPSSGRPDQASGLCKPR 147
G + + P + A G C PR
Sbjct: 188 GGEIDRWEAPG------EGARGWCAPR 208
>gnl|CDD|202096 pfam02029, Caldesmon, Caldesmon.
Length = 431
Score = 30.4 bits (68), Expect = 7.6
Identities = 15/46 (32%), Positives = 23/46 (50%), Gaps = 8/46 (17%)
Query: 511 EREKELSEDEEEEKKEE------EKEEDK--TPKLEDEAQLDAIKK 548
ER K L E+E+ K+EE E+EE + ++E A K+
Sbjct: 225 ERRKVLEEEEQRRKQEEADRKSREEEEKRRLKEEIERRRAEAAEKR 270
>gnl|CDD|236603 PRK09640, PRK09640, RNA polymerase sigma factor SigX; Reviewed.
Length = 188
Score = 29.9 bits (67), Expect = 7.7
Identities = 14/36 (38%), Positives = 16/36 (44%)
Query: 510 KEREKELSEDEEEEKKEEEKEEDKTPKLEDEAQLDA 545
KER K D EE E+K PK E+ LD
Sbjct: 95 KERRKRRLMDALSLDPLEEASEEKAPKPEERGGLDR 130
>gnl|CDD|224418 COG1501, COG1501, Alpha-glucosidases, family 31 of glycosyl
hydrolases [Carbohydrate transport and metabolism].
Length = 772
Score = 30.4 bits (69), Expect = 7.8
Identities = 12/60 (20%), Positives = 26/60 (43%), Gaps = 8/60 (13%)
Query: 446 YIWESSAGGSFTIKPDNSEPLGRGTKIVLHMKEDQAEY--IEEKKIKEIVKKHSQFIGYP 503
++SA GSF + + + ++ Q +Y I K++++K++ G P
Sbjct: 205 LFVDNSAYGSFDVG------SEEYSYVQFSVEGGQLDYYVIAGPTPKDVLEKYTDLTGKP 258
Score = 30.4 bits (69), Expect = 7.8
Identities = 12/60 (20%), Positives = 26/60 (43%), Gaps = 8/60 (13%)
Query: 1307 YIWESSAGGSFTIKPDNSEPLGRGTKIVLHMKEDQAEY--IEEKKIKEIVKKHSQFIGYP 1364
++SA GSF + + + ++ Q +Y I K++++K++ G P
Sbjct: 205 LFVDNSAYGSFDVG------SEEYSYVQFSVEGGQLDYYVIAGPTPKDVLEKYTDLTGKP 258
>gnl|CDD|220377 pfam09747, DUF2052, Coiled-coil domain containing protein
(DUF2052). This entry is of sequences of two conserved
domains separated by a region of low complexity,
spanning some 200 residues. The function is unknown.
Length = 178
Score = 29.7 bits (67), Expect = 7.9
Identities = 13/29 (44%), Positives = 21/29 (72%)
Query: 504 IKLLVEKEREKELSEDEEEEKKEEEKEED 532
+L + +++E+E S +EEEE EEE EE+
Sbjct: 69 ARLELLEQQEEEESAEEEEEDPEEENEEE 97
>gnl|CDD|232942 TIGR00366, TIGR00366, TIGR00366 family protein. [Hypothetical
proteins, Conserved].
Length = 438
Score = 30.2 bits (68), Expect = 8.0
Identities = 14/36 (38%), Positives = 23/36 (63%), Gaps = 2/36 (5%)
Query: 510 KEREKELSEDEEEEKKEEEKEEDKTP--KLEDEAQL 543
K + +L + E+EE+K+ +K+ED T KLE+ L
Sbjct: 216 KSIDPKLLKKEKEEEKKLKKKEDATIAEKLENSRVL 251
>gnl|CDD|217286 pfam02919, Topoisom_I_N, Eukaryotic DNA topoisomerase I, DNA
binding fragment. Topoisomerase I promotes the
relaxation of DNA superhelical tension by introducing a
transient single-stranded break in duplex DNA and are
vital for the processes of replication, transcription,
and recombination. This family may be more than one
structural domain.
Length = 215
Score = 29.9 bits (68), Expect = 8.3
Identities = 13/34 (38%), Positives = 21/34 (61%), Gaps = 3/34 (8%)
Query: 503 PIKLLVEKERE--KELSEDEEEEKKEE-EKEEDK 533
PI E E+E K +S++E++ KEE +K E+
Sbjct: 90 PIYEYFEAEKEKKKAMSKEEKKAIKEEKDKLEEP 123
>gnl|CDD|184064 PRK13461, PRK13461, F0F1 ATP synthase subunit B; Provisional.
Length = 159
Score = 29.2 bits (66), Expect = 8.4
Identities = 11/48 (22%), Positives = 21/48 (43%)
Query: 487 KKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEKEEDKT 534
KIK ++ I I+ E +++ + + E + + KEE K
Sbjct: 28 DKIKAVIDSRQSEIDNKIEKADEDQKKARELKLKNERELKNAKEEGKK 75
>gnl|CDD|177121 MTH00046, CYTB, cytochrome b; Validated.
Length = 355
Score = 30.3 bits (69), Expect = 8.4
Identities = 23/76 (30%), Positives = 33/76 (43%), Gaps = 19/76 (25%)
Query: 1050 IVDLVGLTLGFFYLWYCGGFLLQVTRLSQVYRGVNDIDYYIGGLLESVIRGTLSSPSFRC 1109
++DL Y W C GF+L V + QV GV L S++ S SF C
Sbjct: 9 LIDL-PTNYSLNYYW-CSGFVLSVFMVIQVLTGV----------LLSLLYVADSLCSFFC 56
Query: 1110 VFA---EAFY----RY 1118
V + ++F+ RY
Sbjct: 57 VMSLSNDSFFTWCVRY 72
>gnl|CDD|223294 COG0216, PrfA, Protein chain release factor A [Translation,
ribosomal structure and biogenesis].
Length = 363
Score = 30.3 bits (69), Expect = 8.8
Identities = 21/73 (28%), Positives = 37/73 (50%), Gaps = 12/73 (16%)
Query: 479 DQAEYI----EEKKIKEIVKKHSQFIGY-----PIKLLVEKEREKELSEDEEEEKKEEEK 529
D EY E +++ IV+K+ ++ K ++ +E++ E+ E EEE KE E
Sbjct: 31 DPDEYRKLSKEYAELEPIVEKYREYKKAQEDLEDAKEMLAEEKDPEMREMAEEEIKELEA 90
Query: 530 EEDKTPKLEDEAQ 542
K +LE+E +
Sbjct: 91 ---KIEELEEELK 100
>gnl|CDD|218806 pfam05911, DUF869, Plant protein of unknown function (DUF869).
This family consists of a number of sequences found in
Arabidopsis thaliana, Oryza sativa and Lycopersicon
esculentum (Tomato). The function of this family is
unknown.
Length = 767
Score = 30.3 bits (68), Expect = 8.9
Identities = 14/69 (20%), Positives = 29/69 (42%), Gaps = 1/69 (1%)
Query: 485 EEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEKEEDKTPKLEDEAQLD 544
E + K++ +H L E E+E++ E K E +E+ + + E+ Q
Sbjct: 662 LELRAKDLEAEHKSLQEKISSLEDELEKERQNH-QELIAKCRELEEKIERAEQEENMQKL 720
Query: 545 AIKKVFLSQ 553
++ + Q
Sbjct: 721 DEEEQKIKQ 729
>gnl|CDD|232861 TIGR00181, pepF, oligoendopeptidase F. This family represents the
oligoendopeptidase F clade of the family of larger M3 or
thimet (for thiol-dependent metallopeptidase)
oligopeptidase family. Lactococcus lactis PepF hydrolyzed
peptides of 7 and 17 amino acids with fairly broad
specificity. The homolog of lactococcal PepF in group B
Streptococcus was named PepB (PMID:8757883), with the
name difference reflecting a difference in species of
origin rather activity; substrate profiles were quite
similar. Differences in substrate specificity should be
expected in other species. The gene is duplicated in
Lactococcus lactis on the plasmid that bears it. A
shortened second copy is found in Bacillus subtilis
[Protein fate, Degradation of proteins, peptides, and
glycopeptides].
Length = 591
Score = 30.3 bits (69), Expect = 9.0
Identities = 14/41 (34%), Positives = 23/41 (56%), Gaps = 1/41 (2%)
Query: 1345 IEEKKIKEIVKKHSQFIGYPIKL-LVEKEREKELSEDEEEV 1384
IEEK IKE +K + Y L + +++ LSE+ E++
Sbjct: 112 IEEKIIKEWLKDPEELADYKRALEEIFRDKPHILSEEVEKL 152
>gnl|CDD|218223 pfam04712, Radial_spoke, Radial spokehead-like protein. This
family includes the radial spoke head proteins RSP4 and
RSP6 from Chlamydomonas reinhardtii, and several
eukaryotic homologues, including mammalian RSHL1, the
protein product of a familial ciliary dyskinesia
candidate gene.
Length = 481
Score = 30.0 bits (68), Expect = 9.6
Identities = 12/34 (35%), Positives = 23/34 (67%)
Query: 510 KEREKELSEDEEEEKKEEEKEEDKTPKLEDEAQL 543
+K+ E++E+E++EEE+EE + P+ E+ L
Sbjct: 343 NPEQKDEEEEQEDEEEEEEEEEPEEPEPEEGPPL 376
>gnl|CDD|218336 pfam04935, SURF6, Surfeit locus protein 6. The surfeit locus
protein SURF-6 is shown to be a component of the
nucleolar matrix and has a strong binding capacity for
nucleic acids.
Length = 206
Score = 29.6 bits (67), Expect = 9.7
Identities = 19/63 (30%), Positives = 35/63 (55%), Gaps = 10/63 (15%)
Query: 476 MKEDQAEYIEEKKIKEIVKKHSQFIGYPIK----LLVE----KEREKELSEDEEEEKKEE 527
+ ED+A IEEK+ ++ G +K LL + KE++K+ S+ E +E+KE+
Sbjct: 111 LDEDKAAEIEEKEKWTKALAKAE--GVKVKDDEKLLKKALKRKEKQKKKSKKEWKERKEK 168
Query: 528 EKE 530
++
Sbjct: 169 VEK 171
>gnl|CDD|237496 PRK13766, PRK13766, Hef nuclease; Provisional.
Length = 773
Score = 30.2 bits (69), Expect = 9.9
Identities = 19/51 (37%), Positives = 24/51 (47%), Gaps = 11/51 (21%)
Query: 319 NDFMDS-------EQVTRLSQVY-RGVNDI---DYYIGGLLESVIRGTLSS 358
DF+DS EQV L + Y R V I Y I + + IRG L+S
Sbjct: 611 EDFVDSIIDRRLFEQVKDLKRAYERPVLIIEGDLYTIRNIHPNAIRGALAS 661
>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
Length = 651
Score = 30.0 bits (68), Expect = 9.9
Identities = 17/53 (32%), Positives = 29/53 (54%), Gaps = 4/53 (7%)
Query: 503 PIKLLVEKEREKELSEDEEE--EKKEEEKEEDKTPKLEDEAQLDAIKKVFLSQ 553
+L EKE ++ L E + KK+EEK++ + KLE +A++ + F Q
Sbjct: 550 KEELQREKEEKEALKEQKRLRKLKKQEEKKKKELEKLE-KAKIPP-AEFFKRQ 600
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.318 0.135 0.403
Gapped
Lambda K H
0.267 0.0685 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 71,564,958
Number of extensions: 7210772
Number of successful extensions: 11647
Number of sequences better than 10.0: 1
Number of HSP's gapped: 10705
Number of HSP's successfully gapped: 325
Length of query: 1386
Length of database: 10,937,602
Length adjustment: 109
Effective length of query: 1277
Effective length of database: 6,103,016
Effective search space: 7793551432
Effective search space used: 7793551432
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 65 (28.9 bits)