RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy15682
(224 letters)
>gnl|CDD|240341 PTZ00272, PTZ00272, heat shock protein 83 kDa (Hsp83); Provisional.
Length = 701
Score = 238 bits (609), Expect = 9e-75
Identities = 126/221 (57%), Positives = 166/221 (75%), Gaps = 18/221 (8%)
Query: 1 ALDKIRYESLTDPSRLESEKNLHIKIIPNKEEKTLTIIDSGFIPNKEEKTLTIIDSGIGM 60
A DKIRY+SLTDPS L L I+++P+KE KTLT+ D+G IGM
Sbjct: 40 ACDKIRYQSLTDPSVLGESPRLCIRVVPDKENKTLTVEDNG----------------IGM 83
Query: 61 TKADLVNNLGTTAKSGTKASMEALQAGADISMIGQFGVGFYSAYMVADKVTVTSKHNDNE 120
TKADLVNNLGT A+SGTKA MEAL+AG D+SMIGQFGVGFYSAY+VAD+VTVTSK+N +E
Sbjct: 84 TKADLVNNLGTIARSGTKAFMEALEAGGDMSMIGQFGVGFYSAYLVADRVTVTSKNNSDE 143
Query: 121 QYIWESSAGGSFTIKPDNSQPLGRGTKIVLHMKEDQAEYIEEKKIKEIVKKHSQFIGYPI 180
Y+WESSAGG+FTI + RGT+I LH+KEDQ EY+E +++KE++KKHS+FIGY I
Sbjct: 144 SYVWESSAGGTFTITSTPESDMKRGTRITLHLKEDQMEYLEPRRLKELIKKHSEFIGYDI 203
Query: 181 KLLVEKEREKELSEDEEEE--KKEEEKEEDKTPKLEDEGED 219
+L+VEK EKE+++++EE+ K +E+ EE K ++++ E
Sbjct: 204 ELMVEKTTEKEVTDEDEEDTKKADEDGEEPKVEEVKEGDEG 244
>gnl|CDD|235366 PRK05218, PRK05218, heat shock protein 90; Provisional.
Length = 613
Score = 217 bits (555), Expect = 3e-67
Identities = 88/192 (45%), Positives = 123/192 (64%), Gaps = 22/192 (11%)
Query: 1 ALDKIRYESLTDPSRLESEKNLHIKIIPNKEEKTLTIIDSGFIPNKEEKTLTIIDSGIGM 60
A+DK+R+E+LTDP+ E + +L I+I +KE +TLTI D +GIGM
Sbjct: 41 AIDKLRFEALTDPALYEGDGDLKIRISFDKEARTLTISD----------------NGIGM 84
Query: 61 TKADLVNNLGTTAKSGTKASMEALQ--AGADISMIGQFGVGFYSAYMVADKVTVTSKH-- 116
T+ +++ NLGT AKSGTK +E L+ D +IGQFGVGFYSA+MVADKVTV ++
Sbjct: 85 TREEVIENLGTIAKSGTKEFLEKLKGDQKKDSQLIGQFGVGFYSAFMVADKVTVITRSAG 144
Query: 117 NDNEQYIWESSAGGSFTIKPDNSQPLGRGTKIVLHMKEDQAEYIEEKKIKEIVKKHSQFI 176
E WES G +TI+ + RGT+I LH+KED+ E+++E +I+ I+KK+S FI
Sbjct: 145 PAAEAVRWESDGEGEYTIEEIEKEE--RGTEITLHLKEDEDEFLDEWRIRSIIKKYSDFI 202
Query: 177 GYPIKLLVEKER 188
PIKL E+E
Sbjct: 203 PVPIKLEKEEEE 214
>gnl|CDD|223403 COG0326, HtpG, Molecular chaperone, HSP90 family [Posttranslational
modification, protein turnover, chaperones].
Length = 623
Score = 213 bits (544), Expect = 1e-65
Identities = 95/218 (43%), Positives = 138/218 (63%), Gaps = 27/218 (12%)
Query: 1 ALDKIRYESLTDPSRLESEKNLHIKIIPNKEEKTLTIIDSGFIPNKEEKTLTIIDSGIGM 60
A+DK+R+E+L+DP E + +L I+I +K+ KTLTI D+G IGM
Sbjct: 42 AIDKLRFEALSDPELGEGDSDLRIRISFDKDNKTLTISDNG----------------IGM 85
Query: 61 TKADLVNNLGTTAKSGTKASMEAL-QAGADISMIGQFGVGFYSAYMVADKVTVTSKHNDN 119
TK +++ NLGT AKSGTK +E+L + D +IGQFGVGFYSA+MVADKVTV ++
Sbjct: 86 TKDEVIENLGTIAKSGTKEFLESLSEDQKDSDLIGQFGVGFYSAFMVADKVTVITRSAGE 145
Query: 120 -EQYIWESSAGGSFTIKPDNSQPLGRGTKIVLHMKEDQAEYIEEKKIKEIVKKHSQFIGY 178
E Y WES G +T++ + +P RGT+I LH+KE++ E++EE +++EIVKK+S I Y
Sbjct: 146 DEAYHWESDGEGEYTVEDIDKEPR-RGTEITLHLKEEEDEFLEEWRLREIVKKYSDHIAY 204
Query: 179 PIKLLVEKEREKELSEDEE--------EEKKEEEKEED 208
PI + EKE+++E+ E E K E +E+
Sbjct: 205 PIYIEGEKEKDEEVIEWETINKAKALWTRNKSEITDEE 242
>gnl|CDD|185466 PTZ00130, PTZ00130, heat shock protein 90; Provisional.
Length = 814
Score = 163 bits (414), Expect = 2e-46
Identities = 101/201 (50%), Positives = 134/201 (66%), Gaps = 18/201 (8%)
Query: 1 ALDKIRYESLTDPSRLESEKNLHIKIIPNKEEKTLTIIDSGFIPNKEEKTLTIIDSGIGM 60
AL+KIR+ SL+D S L EK L I+I NKE+ L+I D+G IGM
Sbjct: 103 ALEKIRFLSLSDESVLGEEKKLEIRISANKEKNILSITDTG----------------IGM 146
Query: 61 TKADLVNNLGTTAKSGTKASMEAL-QAGADISMIGQFGVGFYSAYMVADKVTVTSKHNDN 119
TK DL+NNLGT AKSGT +EA+ ++G D+S+IGQFGVGFYSA++VADKV V +K+N++
Sbjct: 147 TKEDLINNLGTIAKSGTSNFLEAISKSGGDMSLIGQFGVGFYSAFLVADKVIVYTKNNND 206
Query: 120 EQYIWESSAGGSFTIKPD-NSQPLGRGTKIVLHMKEDQAEYIEEKKIKEIVKKHSQFIGY 178
EQYIWES+A FTI D L RGT+I LH+KED + +KK+ +++ K+SQFI Y
Sbjct: 207 EQYIWESTADAKFTIYKDPRGSTLKRGTRISLHLKEDATNLMNDKKLVDLISKYSQFIQY 266
Query: 179 PIKLLVEKEREKELSEDEEEE 199
PI LL E +E+ D +E
Sbjct: 267 PIYLLHENVYTEEVLADIAKE 287
>gnl|CDD|237603 PRK14083, PRK14083, HSP90 family protein; Provisional.
Length = 601
Score = 88.5 bits (220), Expect = 3e-20
Identities = 39/148 (26%), Positives = 78/148 (52%), Gaps = 4/148 (2%)
Query: 42 FIPNKEEKTLTIIDSGIGMTKADLVNNLGTTAKSGTKASMEALQAGADISMIGQFGVGFY 101
+ + TL + D+GIG+T+ ++ L T +S + E L + +GQFG+G
Sbjct: 56 ELTDAGGGTLIVEDNGIGLTEEEVHEFLATIGRSSKRD--ENLGFARN-DFLGQFGIGLL 112
Query: 102 SAYMVADKVTVTSKHNDNEQYI-WESSAGGSFTIKPDNSQPLGRGTKIVLHMKEDQAEYI 160
S ++VAD++ V S+ + + W A G+++++ ++ GT + L + D E++
Sbjct: 113 SCFLVADEIVVVSRSAKDGPAVEWRGKADGTYSVRKLETERAEPGTTVYLRPRPDAEEWL 172
Query: 161 EEKKIKEIVKKHSQFIGYPIKLLVEKER 188
E + ++E+ KK+ + PI++ EK
Sbjct: 173 ERETVEELAKKYGSLLPVPIRVEGEKGG 200
>gnl|CDD|215774 pfam00183, HSP90, Hsp90 protein.
Length = 529
Score = 82.9 bits (205), Expect = 2e-18
Identities = 39/66 (59%), Positives = 52/66 (78%), Gaps = 2/66 (3%)
Query: 158 EYIEEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEE--EKEEDKTPKLED 215
EY+EE++IKE+VKKHS+FI YPI L VEKE EKE+ ++EEEE+KEE E+EE T K E+
Sbjct: 1 EYLEERRIKELVKKHSEFINYPIYLWVEKEVEKEVPDEEEEEEKEEKKEEEEKTTDKEEE 60
Query: 216 EGEDAD 221
E+ +
Sbjct: 61 VDEEEE 66
>gnl|CDD|217081 pfam02518, HATPase_c, Histidine kinase-, DNA gyrase B-, and
HSP90-like ATPase. This family represents the
structurally related ATPase domains of histidine kinase,
DNA gyrase B and HSP90.
Length = 111
Score = 44.6 bits (106), Expect = 2e-06
Identities = 16/95 (16%), Positives = 28/95 (29%), Gaps = 19/95 (20%)
Query: 44 PNKEEKTLTIIDSGIGMTKADLVNNLGTTAKSGTKASMEALQAGADISMIGQFGVGFYSA 103
+ +T+ D+GIG+ DL K + + +G G+G
Sbjct: 33 RDGGRLRITVEDNGIGIPPEDL-----------PKIFEPFFRTDSSSRKVGGTGLGLSIV 81
Query: 104 YMVADKVTVTSKHNDNEQYIWESSAGGS-FTIKPD 137
+ + T GG+ FT
Sbjct: 82 RKLVELHGGTITVESEPG-------GGTTFTFTLP 109
>gnl|CDD|222246 pfam13589, HATPase_c_3, Histidine kinase-, DNA gyrase B-, and
HSP90-like ATPase. This family represents,
additionally, the structurally related ATPase domains of
histidine kinase, DNA gyrase B and HSP90.
Length = 134
Score = 45.0 bits (107), Expect = 3e-06
Identities = 23/82 (28%), Positives = 41/82 (50%), Gaps = 13/82 (15%)
Query: 42 FIPNKEEKTLTIIDSGIGMTKADLVNNL--GTTAKSGTKASMEALQAGADISMIGQFGVG 99
P++ E ++I D+G GM+ +L N L G ++K G + D + +G+ G+G
Sbjct: 26 IDPDRGEDGISIEDNGGGMSYEELRNALKLGRSSKEGER----------DSTTLGRKGIG 75
Query: 100 -FYSAYMVADKVTVTSKHNDNE 120
++ + K+TVTSK
Sbjct: 76 MKLASLSLGRKLTVTSKKEGES 97
>gnl|CDD|214643 smart00387, HATPase_c, Histidine kinase-like ATPases. Histidine
kinase-, DNA gyrase B-, phytochrome-like ATPases.
Length = 111
Score = 38.8 bits (91), Expect = 3e-04
Identities = 21/114 (18%), Positives = 33/114 (28%), Gaps = 22/114 (19%)
Query: 24 IKIIPNKEEKTLTIIDSGFIPNKEEKTLTIIDSGIGMTKADLVNNLGTTAKSGTKASMEA 83
IK P T+T+ G + +T+ D+G G+ DL K
Sbjct: 18 IKYTPEGGRITVTLERDG-----DHVEITVEDNGPGIPPEDL-----------EKIFEPF 61
Query: 84 LQAGADISMIGQFGVGFYSAYMVADKVTVTSKHNDNEQYIWESSAGGSFTIKPD 137
+ IG G+G + + G +FTI
Sbjct: 62 FRTDKRSRKIGGTGLGLSIVKKLVELHGGEISVESEPG------GGTTFTITLP 109
>gnl|CDD|238030 cd00075, HATPase_c, Histidine kinase-like ATPases; This family
includes several ATP-binding proteins for example:
histidine kinase, DNA gyrase B, topoisomerases, heat
shock protein HSP90, phytochrome-like ATPases and DNA
mismatch repair proteins.
Length = 103
Score = 38.4 bits (90), Expect = 3e-04
Identities = 12/78 (15%), Positives = 26/78 (33%), Gaps = 16/78 (20%)
Query: 44 PNKEEKTLTIIDSGIGMTKADLVNNLGTTAKSGTKASMEALQAGADISMIGQFGVGFYSA 103
+ + + + D+G G+ + DL E G+ G G+G
Sbjct: 29 RDGDHLEIRVEDNGPGIPEEDLERIF------------ERFSDGSRSRKGGGTGLGLSIV 76
Query: 104 YMVAD----KVTVTSKHN 117
+ + ++ V S+
Sbjct: 77 KKLVELHGGRIEVESEPG 94
>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger. [Transport and
binding proteins, Cations and iron carrying compounds].
Length = 1096
Score = 34.2 bits (78), Expect = 0.050
Identities = 15/27 (55%), Positives = 23/27 (85%)
Query: 185 EKEREKELSEDEEEEKKEEEKEEDKTP 211
E+E E+E E+EEEE++EEE+EE++ P
Sbjct: 867 EEEEEEEEEEEEEEEEEEEEEEENEEP 893
Score = 33.8 bits (77), Expect = 0.071
Identities = 15/31 (48%), Positives = 25/31 (80%)
Query: 193 SEDEEEEKKEEEKEEDKTPKLEDEGEDADCP 223
SE+EEEE++EEE+EE++ + E+E E+ + P
Sbjct: 863 SEEEEEEEEEEEEEEEEEEEEEEEEEENEEP 893
Score = 33.4 bits (76), Expect = 0.092
Identities = 14/25 (56%), Positives = 22/25 (88%)
Query: 185 EKEREKELSEDEEEEKKEEEKEEDK 209
E+E E+E E+EEEE++EEE+EE++
Sbjct: 865 EEEEEEEEEEEEEEEEEEEEEEEEE 889
Score = 33.0 bits (75), Expect = 0.13
Identities = 16/32 (50%), Positives = 23/32 (71%)
Query: 185 EKEREKELSEDEEEEKKEEEKEEDKTPKLEDE 216
E+E E+E E+EEEE++EEE+EE+ L E
Sbjct: 866 EEEEEEEEEEEEEEEEEEEEEEEENEEPLSLE 897
Score = 32.3 bits (73), Expect = 0.20
Identities = 14/31 (45%), Positives = 24/31 (77%)
Query: 189 EKELSEDEEEEKKEEEKEEDKTPKLEDEGED 219
+ E E+EEEE++EEE+EE++ + E+E E+
Sbjct: 862 DSEEEEEEEEEEEEEEEEEEEEEEEEEENEE 892
Score = 31.9 bits (72), Expect = 0.28
Identities = 14/30 (46%), Positives = 24/30 (80%)
Query: 187 EREKELSEDEEEEKKEEEKEEDKTPKLEDE 216
+ E+E E+EEEE++EEE+EE++ + E+E
Sbjct: 862 DSEEEEEEEEEEEEEEEEEEEEEEEEEENE 891
Score = 31.9 bits (72), Expect = 0.33
Identities = 14/25 (56%), Positives = 22/25 (88%)
Query: 185 EKEREKELSEDEEEEKKEEEKEEDK 209
E+E E+E E+EEEE++EEE+EE++
Sbjct: 864 EEEEEEEEEEEEEEEEEEEEEEEEE 888
Score = 30.0 bits (67), Expect = 1.1
Identities = 14/35 (40%), Positives = 24/35 (68%)
Query: 189 EKELSEDEEEEKKEEEKEEDKTPKLEDEGEDADCP 223
E+E E+EEEE++EEE+EE++ + +E + P
Sbjct: 865 EEEEEEEEEEEEEEEEEEEEEEEEENEEPLSLEWP 899
Score = 29.2 bits (65), Expect = 1.9
Identities = 13/38 (34%), Positives = 25/38 (65%)
Query: 184 VEKEREKELSEDEEEEKKEEEKEEDKTPKLEDEGEDAD 221
V+ + + EEEE++EEE+EE++ + E+E E+ +
Sbjct: 852 VDGGGGSDGGDSEEEEEEEEEEEEEEEEEEEEEEEEEE 889
>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
Provisional.
Length = 482
Score = 33.7 bits (78), Expect = 0.057
Identities = 21/76 (27%), Positives = 39/76 (51%), Gaps = 12/76 (15%)
Query: 146 TKIVLHMKEDQAEYIEEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEK 205
TK + + E EKK +E K+ K ++++E E+E+E+K+EE++
Sbjct: 409 TKKIKKIVEK-----AEKKREEEKKEKK-------KKAFAGKKKEEEEEEEKEKKEEEKE 456
Query: 206 EEDKTPKLEDEGEDAD 221
EE++ + E E E+
Sbjct: 457 EEEEEAEEEKEEEEEK 472
Score = 32.2 bits (74), Expect = 0.18
Identities = 23/74 (31%), Positives = 37/74 (50%), Gaps = 11/74 (14%)
Query: 150 LHMKEDQAEYIEEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDE----EEEKKEEEK 205
L + E++ E++ K K I EK+RE+E E + +KKEEE+
Sbjct: 392 LELTEEEIEFLTGSKKATKKIKK-------IVEKAEKKREEEKKEKKKKAFAGKKKEEEE 444
Query: 206 EEDKTPKLEDEGED 219
EE+K K E++ E+
Sbjct: 445 EEEKEKKEEEKEEE 458
>gnl|CDD|233035 TIGR00585, mutl, DNA mismatch repair protein MutL. All proteins in
this family for which the functions are known are
involved in the process of generalized mismatch repair.
This family is based on the phylogenomic analysis of JA
Eisen (1999, Ph.D. Thesis, Stanford University) [DNA
metabolism, DNA replication, recombination, and repair].
Length = 312
Score = 33.0 bits (76), Expect = 0.11
Identities = 26/137 (18%), Positives = 54/137 (39%), Gaps = 35/137 (25%)
Query: 53 IIDSGIGMTKADLVNNL--GTTAKSGTKASMEALQAGADISMIGQFGVGF-----YSAYM 105
+ D+G G+ K DL T+K +Q+ D+ I G F S
Sbjct: 56 VSDNGSGIDKEDLPLACERHATSK---------IQSFEDLERIETLG--FRGEALASISS 104
Query: 106 VADKVTVTSKHNDNEQYIWESSAGGSFTIKPDNSQPLGRGTKIVLH------------MK 153
V+ ++T+T+K + + +++ G + P GT + + +K
Sbjct: 105 VS-RLTITTKTSAADGLAYQALLEGGMIESIKPA-PRPVGTTVEVRDLFYNLPVRRKFLK 162
Query: 154 EDQAEYIEEKKIKEIVK 170
+ E+ +KI ++++
Sbjct: 163 SPKKEF---RKILDVLQ 176
>gnl|CDD|212510 cd11724, TP_methylase_like, Uncharacterized subfamily of S-AdoMet
dependent tetrapyrrole methylases. TP-methylases use
S-AdoMet (S-adenosyl-L-methionine or SAM) in the
methylation of diverse substrates. Most members catalyze
various methylation steps in cobalamin (vitamin B12)
biosynthesis, other members like Diphthine synthase and
Ribosomal RNA small subunit methyltransferase I (RsmI)
act on other substrates. The function of this subfamily
is not known.
Length = 255
Score = 32.3 bits (74), Expect = 0.15
Identities = 22/68 (32%), Positives = 30/68 (44%), Gaps = 5/68 (7%)
Query: 148 IVLHMKEDQ---AEYIEEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEE 204
I+ KE A+Y++ K + E K F Y K E EK E EEEKK +
Sbjct: 29 IIFCPKEIVKRFADYLKGKPVLEDPWKL--FWHYGGKPWENLEAEKSPREAFEEEKKRQR 86
Query: 205 KEEDKTPK 212
+E K +
Sbjct: 87 EEFVKVIR 94
>gnl|CDD|221175 pfam11705, RNA_pol_3_Rpc31, DNA-directed RNA polymerase III subunit
Rpc31. RNA polymerase III contains seventeen subunits
in yeasts and in human cells. Twelve of these are akin
to RNA polymerase I or II and the other five are RNA pol
III-specific, and form the functionally distinct groups
(i) Rpc31-Rpc34-Rpc82, and (ii) Rpc37-Rpc53. Rpc31,
Rpc34 and Rpc82 form a cluster of enzyme-specific
subunits that contribute to transcription initiation in
S.cerevisiae and H.sapiens. There is evidence that these
subunits are anchored at or near the N-terminal Zn-fold
of Rpc1, itself prolonged by a highly conserved but RNA
polymerase III-specific domain.
Length = 221
Score = 31.3 bits (71), Expect = 0.31
Identities = 15/42 (35%), Positives = 29/42 (69%)
Query: 180 IKLLVEKEREKELSEDEEEEKKEEEKEEDKTPKLEDEGEDAD 221
+ +L +K +E E + +EE++K+EE+EE++ + ED +D D
Sbjct: 153 LSMLEKKLKELEAEDVDEEDEKDEEEEEEEEEEDEDFDDDDD 194
Score = 28.6 bits (64), Expect = 2.1
Identities = 14/42 (33%), Positives = 28/42 (66%)
Query: 180 IKLLVEKEREKELSEDEEEEKKEEEKEEDKTPKLEDEGEDAD 221
KL + +++ KEL ++ +E+ E+++EE++ + EDE D D
Sbjct: 151 EKLSMLEKKLKELEAEDVDEEDEKDEEEEEEEEEEDEDFDDD 192
>gnl|CDD|234767 PRK00448, polC, DNA polymerase III PolC; Validated.
Length = 1437
Score = 31.3 bits (72), Expect = 0.43
Identities = 13/71 (18%), Positives = 40/71 (56%), Gaps = 2/71 (2%)
Query: 138 NSQPLGRGTKIVLHMKEDQA-EYIEEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDE 196
+ G K+++ + + +++++K + +++K++ +F G+ I + + + + ++
Sbjct: 116 KQKVEVEGNKLIIKVNNEIERDHLKKKHLPKLIKQYEKF-GFGILKIDFEIDDSKEELEK 174
Query: 197 EEEKKEEEKEE 207
E +KEEE E+
Sbjct: 175 FEAQKEEEDEK 185
>gnl|CDD|219408 pfam07423, DUF1510, Protein of unknown function (DUF1510). This
family consists of several hypothetical bacterial
proteins of around 200 residues in length. The function
of this family is unknown.
Length = 214
Score = 30.5 bits (69), Expect = 0.49
Identities = 14/36 (38%), Positives = 24/36 (66%)
Query: 186 KEREKELSEDEEEEKKEEEKEEDKTPKLEDEGEDAD 221
KE EKE + E++E K + ++ED+ + E+E ED +
Sbjct: 66 KEEEKEAANSEDKEDKGDAEKEDEESEEENEEEDEE 101
>gnl|CDD|218223 pfam04712, Radial_spoke, Radial spokehead-like protein. This
family includes the radial spoke head proteins RSP4 and
RSP6 from Chlamydomonas reinhardtii, and several
eukaryotic homologues, including mammalian RSHL1, the
protein product of a familial ciliary dyskinesia
candidate gene.
Length = 481
Score = 30.8 bits (70), Expect = 0.49
Identities = 18/45 (40%), Positives = 26/45 (57%), Gaps = 5/45 (11%)
Query: 185 EKEREKELSEDEEEEKKEEEKEEDKTPKL-----EDEGEDADCPA 224
++E E+E E+EEEE++ EE E ++ P L ED D PA
Sbjct: 348 DEEEEQEDEEEEEEEEEPEEPEPEEGPPLLTPISEDAPLPNDDPA 392
Score = 27.7 bits (62), Expect = 6.2
Identities = 11/29 (37%), Positives = 21/29 (72%)
Query: 186 KEREKELSEDEEEEKKEEEKEEDKTPKLE 214
+K+ E++E+E++EEE+EE + P+ E
Sbjct: 343 NPEQKDEEEEQEDEEEEEEEEEPEEPEPE 371
>gnl|CDD|235640 PRK05901, PRK05901, RNA polymerase sigma factor; Provisional.
Length = 509
Score = 30.7 bits (70), Expect = 0.54
Identities = 12/38 (31%), Positives = 21/38 (55%)
Query: 184 VEKEREKELSEDEEEEKKEEEKEEDKTPKLEDEGEDAD 221
+ E + + D+E+E+K+E KE +K +D D D
Sbjct: 155 DDDEDDDDDDVDDEDEEKKEAKELEKLSDDDDFVWDED 192
>gnl|CDD|217049 pfam02459, Adeno_terminal, Adenoviral DNA terminal protein. This
protein is covalently attached to the terminii of
replicating DNA in vivo.
Length = 548
Score = 30.8 bits (70), Expect = 0.55
Identities = 13/32 (40%), Positives = 21/32 (65%)
Query: 185 EKEREKELSEDEEEEKKEEEKEEDKTPKLEDE 216
E E+E E+EE ++EEE+EE++ E+E
Sbjct: 302 PPEPEEEEEEEEEVPEEEEEEEEEEERTFEEE 333
>gnl|CDD|224418 COG1501, COG1501, Alpha-glucosidases, family 31 of glycosyl
hydrolases [Carbohydrate transport and metabolism].
Length = 772
Score = 30.4 bits (69), Expect = 0.79
Identities = 12/60 (20%), Positives = 26/60 (43%), Gaps = 8/60 (13%)
Query: 122 YIWESSAGGSFTIKPDNSQPLGRGTKIVLHMKEDQAEY--IEEKKIKEIVKKHSQFIGYP 179
++SA GSF + + + ++ Q +Y I K++++K++ G P
Sbjct: 205 LFVDNSAYGSFDVG------SEEYSYVQFSVEGGQLDYYVIAGPTPKDVLEKYTDLTGKP 258
>gnl|CDD|232942 TIGR00366, TIGR00366, TIGR00366 family protein. [Hypothetical
proteins, Conserved].
Length = 438
Score = 30.2 bits (68), Expect = 0.98
Identities = 13/32 (40%), Positives = 22/32 (68%), Gaps = 2/32 (6%)
Query: 186 KEREKELSEDEEEEKKEEEKEEDKTP--KLED 215
K + +L + E+EE+K+ +K+ED T KLE+
Sbjct: 216 KSIDPKLLKKEKEEEKKLKKKEDATIAEKLEN 247
>gnl|CDD|220759 pfam10446, DUF2457, Protein of unknown function (DUF2457). This is
a family of uncharacterized proteins.
Length = 449
Score = 30.0 bits (67), Expect = 1.0
Identities = 14/41 (34%), Positives = 24/41 (58%)
Query: 181 KLLVEKEREKELSEDEEEEKKEEEKEEDKTPKLEDEGEDAD 221
KL E E E ED++EE +++ +ED+ +D+ ED +
Sbjct: 40 KLGKEAEEEAMEEEDDDEEDDDDDDDEDEDDDDDDDDEDDE 80
>gnl|CDD|165442 PHA03171, PHA03171, UL37 tegument protein; Provisional.
Length = 499
Score = 30.0 bits (67), Expect = 1.0
Identities = 13/37 (35%), Positives = 22/37 (59%), Gaps = 3/37 (8%)
Query: 184 VEKEREKELSEDEEEEKKEEEKEEDKTPK---LEDEG 217
E++ ++E + E EE+ EEE+ E P+ L+ EG
Sbjct: 87 EEEDNDRECPDTEAEEEDEEEEIEAPDPEVNPLDAEG 123
>gnl|CDD|233256 TIGR01063, gyrA, DNA gyrase, A subunit. This model describes the
common type II DNA topoisomerase (DNA gyrase). Two
apparently independently arising families, one in the
Proteobacteria and one in Gram-positive lineages, are
both designated toposisomerase IV [DNA metabolism, DNA
replication, recombination, and repair].
Length = 800
Score = 30.0 bits (68), Expect = 1.1
Identities = 22/75 (29%), Positives = 40/75 (53%)
Query: 147 KIVLHMKEDQAEYIEEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEKE 206
+ +L M+ + +E +K++E K+ + I +L +ER E+ +E EE KE+ +
Sbjct: 414 QAILDMRLQRLTGLEREKLQEEYKELLELIADLEDILASEERVLEIIREELEEIKEQFGD 473
Query: 207 EDKTPKLEDEGEDAD 221
+T + DE ED D
Sbjct: 474 PRRTEIVYDESEDID 488
>gnl|CDD|235949 PRK07159, PRK07159, F0F1 ATP synthase subunit C; Validated.
Length = 100
Score = 28.3 bits (63), Expect = 1.1
Identities = 14/67 (20%), Positives = 29/67 (43%), Gaps = 2/67 (2%)
Query: 61 TKADLVNNLGTTAKSGTKASMEALQA--GADISMIGQFGVGFYSAYMVADKVTVTSKHND 118
++ N T ++ S A GA ++MIG GVG Y V +++ +
Sbjct: 7 FVGQILQNTQTVSQKVAADSSGLKAAYIGAGLAMIGVIGVGLGQGYAFGKAVEAIARNPE 66
Query: 119 NEQYIWE 125
++ +++
Sbjct: 67 AQKQVFK 73
>gnl|CDD|226894 COG4499, COG4499, Predicted membrane protein [Function unknown].
Length = 434
Score = 29.4 bits (66), Expect = 1.5
Identities = 15/60 (25%), Positives = 33/60 (55%), Gaps = 2/60 (3%)
Query: 162 EKKIKEIVKKHSQFIGYPIKLLVEKER--EKELSEDEEEEKKEEEKEEDKTPKLEDEGED 219
KK+++ KK + + + E++L ++E E+K++E+ +EDK + +DE +
Sbjct: 375 NKKLQDYTKKLGEVKDETDASEEAEAKAKEEKLKQEENEKKQKEQADEDKEKRQKDERKK 434
>gnl|CDD|234752 PRK00413, thrS, threonyl-tRNA synthetase; Reviewed.
Length = 638
Score = 29.3 bits (67), Expect = 1.9
Identities = 23/61 (37%), Positives = 33/61 (54%), Gaps = 9/61 (14%)
Query: 162 EKKIKEIVKKHSQFIGYPI-KLLVEKEREKELSEDEEEEKKEE---EKEEDKTPKLEDEG 217
EK++KEI+K+ YPI + +V +E EL +D E K E E ED+ L +G
Sbjct: 121 EKEMKEIIKE-----NYPIEREVVSREEAIELFKDRGEPYKVELIEEIPEDEEISLYRQG 175
Query: 218 E 218
E
Sbjct: 176 E 176
>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682). The
members of this family are all hypothetical eukaryotic
proteins of unknown function. One member is described as
being an adipocyte-specific protein, but no evidence of
this was found.
Length = 322
Score = 28.8 bits (65), Expect = 2.3
Identities = 17/58 (29%), Positives = 24/58 (41%), Gaps = 12/58 (20%)
Query: 152 MKEDQAEYIEEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEKEEDK 209
+K + E EE + K+ KK E+ K EE++K EEKE K
Sbjct: 273 LKAAEEERQEEAQEKKEEKKK------------EEREAKLAKLSPEEQRKLEEKERKK 318
>gnl|CDD|221324 pfam11933, DUF3451, Domain of unknown function (DUF3451). This
presumed domain is functionally uncharacterized. This
domain is found in eukaryotes. This domain is typically
between 199 to 238 amino acids in length. This domain is
found associated with pfam06512, pfam00520. This domain
has a conserved ADD sequence motif.
Length = 222
Score = 28.6 bits (64), Expect = 2.4
Identities = 11/31 (35%), Positives = 17/31 (54%), Gaps = 2/31 (6%)
Query: 186 KEREKELSEDEEEEK--KEEEKEEDKTPKLE 214
KER ++++EE EE + +K PK E
Sbjct: 12 KERRNRNDKNKKEEHSIGSEEGDSEKEPKSE 42
>gnl|CDD|227391 COG5058, LAG1, Protein transporter of the TRAM (translocating
chain-associating membrane) superfamily, longevity
assurance factor [Intracellular trafficking and
secretion].
Length = 395
Score = 28.7 bits (64), Expect = 2.4
Identities = 13/35 (37%), Positives = 21/35 (60%), Gaps = 2/35 (5%)
Query: 176 IGYPIKLLVEKEREKELSEDEEEEKKEEEKEEDKT 210
+ Y +++ E E + E S+DE EE+ + E EDK
Sbjct: 362 VAY--RVIWEGELKDERSDDESEEESDLESSEDKN 394
>gnl|CDD|129130 TIGR00019, prfA, peptide chain release factor 1. This model
describes peptide chain release factor 1 (PrfA, RF-1),
and excludes the related peptide chain release factor 2
(PrfB, RF-2). RF-1 helps recognize and terminate
translation at UAA and UAG stop codons. The
mitochondrial release factors are prfA-like, although
not included above the trusted cutoff for this model.
RF-1 does not have a translational frameshift [Protein
synthesis, Translation factors].
Length = 360
Score = 28.5 bits (64), Expect = 2.6
Identities = 17/71 (23%), Positives = 35/71 (49%), Gaps = 13/71 (18%)
Query: 155 DQAEYI----EEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEKEE--- 207
DQ + E +++EIV + ++ + + + KE+ E+ + E +E KEE
Sbjct: 31 DQDKLRKLSKEYSQLEEIVDCYREYQ----QAQEDIKEAKEILEESDPEMREMAKEELEE 86
Query: 208 --DKTPKLEDE 216
+K +LE++
Sbjct: 87 LEEKIEELEEQ 97
>gnl|CDD|237538 PRK13876, PRK13876, conjugal transfer coupling protein TraG;
Provisional.
Length = 663
Score = 28.7 bits (65), Expect = 2.8
Identities = 10/36 (27%), Positives = 16/36 (44%), Gaps = 1/36 (2%)
Query: 185 EKEREKELSEDEEEEKKEEEKEEDKTPKLEDEGEDA 220
R+ EL E EE + E + L+D+ +D
Sbjct: 601 GIRRQPELPEHEEIVPEPRPPENEFDL-LDDDDDDD 635
>gnl|CDD|237192 PRK12758, PRK12758, DNA topoisomerase IV subunit A; Provisional.
Length = 869
Score = 28.8 bits (65), Expect = 2.9
Identities = 14/65 (21%), Positives = 26/65 (40%)
Query: 155 DQAEYIEEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEKEEDKTPKLE 214
D E+I K IK + + + I L E+ E EE ++ + ++ ++
Sbjct: 800 DLEEFIAVKGIKAKGNRLTTYKVKTINELEPLPYEEPEEETAEEPEEVDPEDVKSEDDID 859
Query: 215 DEGED 219
DE
Sbjct: 860 DEDTG 864
>gnl|CDD|185429 PTZ00074, PTZ00074, 60S ribosomal protein L34; Provisional.
Length = 135
Score = 27.7 bits (62), Expect = 3.0
Identities = 12/54 (22%), Positives = 30/54 (55%), Gaps = 14/54 (25%)
Query: 160 IEEKKI-KEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEKEEDKTPK 212
+EE+KI K+++K+ ++ ++K+ + ++++KK+ K+ K K
Sbjct: 95 VEEQKIVKQVLKEKAK-------------QKKQKKKKKKKKKKKTSKKAAKKKK 135
>gnl|CDD|133143 cd05476, pepsin_A_like_plant, Chroloplast Nucleoids DNA-binding
Protease and Nucellin, pepsin-like aspartic proteases
from plants. This family contains pepsin like aspartic
proteases from plants including Chloroplast Nucleoids
DNA-binding Protease and Nucellin. Chloroplast Nucleoids
DNA-binding Protease catalyzes the degradation of
ribulose-1,5-bisphosphate carboxylase/oxygenase
(Rubisco) in senescent leaves of tobacco and Nucellins
are important regulators of nucellar cell's progressive
degradation after ovule fertilization. Structurally,
aspartic proteases are bilobal enzymes, each lobe
contributing a catalytic Asp residue, with an extended
active site cleft localized between the two lobes of the
molecule. The N- and C-terminal domains, although
structurally related by a 2-fold axis, have only limited
sequence homology except the vicinity of the active
site. This suggests that the enzymes evolved by an
ancient duplication event. The enzymes specifically
cleave bonds in peptides which have at least six
residues in length with hydrophobic residues in both the
P1 and P1' positions. The active site is located at the
groove formed by the two lobes, with an extended loop
projecting over the cleft to form an 11-residue flap,
which encloses substrates and inhibitors in the active
site. Specificity is determined by nearest-neighbor
hydrophobic residues surrounding the catalytic
aspartates, and by three residues in the flap. The
enzymes are mostly secreted from cells as inactive
proenzymes that activate autocatalytically at acidic pH.
Length = 265
Score = 28.4 bits (64), Expect = 3.0
Identities = 11/26 (42%), Positives = 13/26 (50%)
Query: 32 EKTLTIIDSGFIPNKEEKTLTIIDSG 57
K L I S F + + TIIDSG
Sbjct: 159 GKRLPIPPSVFAIDSDGSGGTIIDSG 184
>gnl|CDD|100110 cd05832, Ribosomal_L12p, Ribosomal protein L12p. This subfamily
includes archaeal L12p, the protein that is functionally
equivalent to L7/L12 in bacteria and the P1 and P2
proteins in eukaryotes. L12p is homologous to P1 and P2
but is not homologous to bacterial L7/L12. It is located
in the L12 stalk, with proteins L10, L11, and 23S rRNA.
L12p is the only protein in the ribosome to occur as
multimers, always appearing as sets of dimers. Recent
data indicate that most archaeal species contain six
copies of L12p (three homodimers), while eukaryotes have
four copies (two heterodimers), and bacteria may have
four or six copies (two or three homodimers), depending
on the species. The organization of proteins within the
stalk has been characterized primarily in bacteria,
where L7/L12 forms either two or three homodimers and
each homodimer binds to the extended C-terminal helix of
L10. L7/L12 is attached to the ribosome through L10 and
is the only ribosomal protein that does not directly
interact with rRNA. Archaeal L12p is believed to
function in a similar fashion. However, hybrid ribosomes
containing the large subunit from E. coli with an
archaeal stalk are able to bind archaeal and eukaryotic
elongation factors but not bacterial elongation factors.
In several mesophilic and thermophilic archaeal species,
the binding of 23S rRNA to protein L11 and to the
L10/L12p pentameric complex was found to be
temperature-dependent and cooperative.
Length = 106
Score = 27.5 bits (61), Expect = 3.0
Identities = 14/28 (50%), Positives = 19/28 (67%)
Query: 187 EREKELSEDEEEEKKEEEKEEDKTPKLE 214
E++ E EEE+KKEEEKEE++ L
Sbjct: 73 AAEEKAEEKEEEKKKEEEKEEEEEEALA 100
>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family. Atrophin-1 is the
protein product of the dentatorubral-pallidoluysian
atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
neurodegenerative disorder. It is caused by the
expansion of a CAG repeat in the DRPLA gene on
chromosome 12p. This results in an extended
polyglutamine region in atrophin-1, that is thought to
confer toxicity to the protein, possibly through
altering its interactions with other proteins. The
expansion of a CAG repeat is also the underlying defect
in six other neurodegenerative disorders, including
Huntington's disease. One interaction of expanded
polyglutamine repeats that is thought to be pathogenic
is that with the short glutamine repeat in the
transcriptional coactivator CREB binding protein, CBP.
This interaction draws CBP away from its usual nuclear
location to the expanded polyglutamine repeat protein
aggregates that are characteristic of the polyglutamine
neurodegenerative disorders. This interferes with
CBP-mediated transcription and causes cytotoxicity.
Length = 979
Score = 28.5 bits (63), Expect = 3.0
Identities = 14/38 (36%), Positives = 22/38 (57%)
Query: 181 KLLVEKEREKELSEDEEEEKKEEEKEEDKTPKLEDEGE 218
KL ++E E ++ E E+K EE+E +K + E E E
Sbjct: 576 KLAKKREEAVEKAKREAEQKAREEREREKEKEKERERE 613
Score = 28.1 bits (62), Expect = 4.5
Identities = 13/32 (40%), Positives = 20/32 (62%), Gaps = 3/32 (9%)
Query: 181 KLLVEKEREKELSEDEEEEKKEEEKEEDKTPK 212
K E+EREK E E+E ++E E+E ++ K
Sbjct: 595 KAREEREREK---EKEKEREREREREAERAAK 623
>gnl|CDD|227446 COG5116, RPN2, 26S proteasome regulatory complex component
[Posttranslational modification, protein turnover,
chaperones].
Length = 926
Score = 28.8 bits (64), Expect = 3.2
Identities = 11/41 (26%), Positives = 20/41 (48%), Gaps = 1/41 (2%)
Query: 185 EKEREKELSEDEEE-EKKEEEKEEDKTPKLEDEGEDADCPA 224
+K +EK ++ E + E E E ++ + E + D PA
Sbjct: 799 QKPKEKGPNDKEIKIESPSVETEGERCTIKQREEKGIDAPA 839
>gnl|CDD|224536 COG1621, SacC, Beta-fructosidases (levanase/invertase)
[Carbohydrate transport and metabolism].
Length = 486
Score = 28.5 bits (64), Expect = 3.3
Identities = 13/50 (26%), Positives = 17/50 (34%), Gaps = 10/50 (20%)
Query: 99 GFYSAYMVADK------VTVTSKHNDN----EQYIWESSAGGSFTIKPDN 138
G YS V D T + ++ Q I S GG+F N
Sbjct: 100 GCYSGSAVVDDGNLSLFYTGNVRDSNGIRQQTQCIAYSEDGGTFEKYSGN 149
>gnl|CDD|111553 pfam02667, SCFA_trans, Short chain fatty acid transporter. This
family consists of two sequences annotated as short
chain fatty acid transporters, however, there are no
references giving details of experimental
characterisation of this function.
Length = 453
Score = 28.2 bits (63), Expect = 3.3
Identities = 11/32 (34%), Positives = 21/32 (65%), Gaps = 2/32 (6%)
Query: 186 KEREKELSEDEEEEKKEEEKEEDKTP--KLED 215
K + +L + E+EE+K+ + ++D T KLE+
Sbjct: 219 KSIDPKLLKKEKEEEKKLKSKKDATIAEKLEN 250
>gnl|CDD|214661 smart00435, TOPEUc, DNA Topoisomerase I (eukaryota). DNA
Topoisomerase I (eukaryota), DNA topoisomerase V,
Vaccina virus topoisomerase, Variola virus
topoisomerase, Shope fibroma virus topoisomeras.
Length = 391
Score = 28.5 bits (64), Expect = 3.3
Identities = 19/70 (27%), Positives = 38/70 (54%), Gaps = 1/70 (1%)
Query: 153 KEDQAEYIEEKKIKEIVKK-HSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEKEEDKTP 211
K + + + I ++ +K S+F KL E + +K+ + EE++KK+ E+ E++
Sbjct: 294 KRLKKMILLFEMISDLKRKLKSKFERDNEKLDAEVKEKKKEKKKEEKKKKQIERLEERIE 353
Query: 212 KLEDEGEDAD 221
KLE + D +
Sbjct: 354 KLEVQATDKE 363
>gnl|CDD|217503 pfam03344, Daxx, Daxx Family. The Daxx protein (also known as the
Fas-binding protein) is thought to play a role in
apoptosis, but precise role played by Daxx remains to be
determined. Daxx forms a complex with Axin.
Length = 715
Score = 28.3 bits (63), Expect = 3.5
Identities = 15/34 (44%), Positives = 23/34 (67%)
Query: 185 EKEREKELSEDEEEEKKEEEKEEDKTPKLEDEGE 218
E+E E++ SE+EE E +EEE+E + E+E E
Sbjct: 454 EEEEEEQESEEEEGEDEEEEEEVEADNGSEEEME 487
Score = 28.3 bits (63), Expect = 4.0
Identities = 14/37 (37%), Positives = 24/37 (64%)
Query: 185 EKEREKELSEDEEEEKKEEEKEEDKTPKLEDEGEDAD 221
E+E E E+EEEE++EEE+E ++ ++E E+
Sbjct: 440 EEEESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEV 476
Score = 28.0 bits (62), Expect = 5.1
Identities = 15/39 (38%), Positives = 24/39 (61%)
Query: 185 EKEREKELSEDEEEEKKEEEKEEDKTPKLEDEGEDADCP 223
E+ E+E E+EEEE++E+E EE++ E+E E
Sbjct: 442 EESVEEEEEEEEEEEEEEQESEEEEGEDEEEEEEVEADN 480
Score = 28.0 bits (62), Expect = 5.2
Identities = 11/35 (31%), Positives = 24/35 (68%)
Query: 185 EKEREKELSEDEEEEKKEEEKEEDKTPKLEDEGED 219
E+E E+E E+E+E ++EE ++E++ ++E +
Sbjct: 448 EEEEEEEEEEEEQESEEEEGEDEEEEEEVEADNGS 482
Score = 28.0 bits (62), Expect = 5.2
Identities = 18/38 (47%), Positives = 27/38 (71%), Gaps = 3/38 (7%)
Query: 184 VEKEREKELSEDEEEEKKEEEKEEDKTPKLEDEGEDAD 221
VE+E E+E E+EEE++ EEE+ ED+ E+E +AD
Sbjct: 445 VEEEEEEEEEEEEEEQESEEEEGEDEE---EEEEVEAD 479
>gnl|CDD|233817 TIGR02304, aden_form_hyp, putative adenylate-forming enzyme.
Members of this family form a distinct clade within a
larger family of proteins that also includes coenzyme
F390 synthetase, an enzyme known in Methanobacterium
thermoautotrophicum and a few other methanogenic
archaea. That enzyme adenylates coenzyme F420 to F390, a
reversible process, during oxygen stress. Other
informative homologies include domains of the
non-ribosomal peptide synthetases involved in activation
by adenylation. The family defined by this model is
likely to be of an adenylate-forming enzyme related to
but distinct from coenzyme F390 synthetase.
Length = 430
Score = 28.3 bits (63), Expect = 3.5
Identities = 15/81 (18%), Positives = 25/81 (30%), Gaps = 3/81 (3%)
Query: 47 EEKTLTIIDSGIGMTKADLVNNLGTTAKSGTKASMEALQAGADISMIGQFGVGFYSAYMV 106
++D + M D +N G +M + + +G VG S
Sbjct: 53 PFNQWPMMDKALMMEHFDELNTAGLKKDEALDCAMRSEKTRDFKPCVGNISVGLSSGTSG 112
Query: 107 ADKVTVTSKHNDNEQYIWESS 127
+ V S EQ +W
Sbjct: 113 RRGLFVVSPE---EQQMWAGG 130
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 28.6 bits (63), Expect = 3.6
Identities = 22/60 (36%), Positives = 31/60 (51%), Gaps = 2/60 (3%)
Query: 161 EEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEKEEDKTPKLEDEGEDA 220
EEKK E +KK + IK E ++ +E + EE KK EE E+ L+ E E+A
Sbjct: 1644 EEKKKAEELKKAEE--ENKIKAAEEAKKAEEDKKKAEEAKKAEEDEKKAAEALKKEAEEA 1701
>gnl|CDD|179712 PRK04019, rplP0, acidic ribosomal protein P0; Validated.
Length = 330
Score = 27.9 bits (63), Expect = 3.8
Identities = 10/20 (50%), Positives = 15/20 (75%)
Query: 189 EKELSEDEEEEKKEEEKEED 208
+E E+EEEE++EE EE+
Sbjct: 301 AEEEEEEEEEEEEEEPSEEE 320
Score = 27.5 bits (62), Expect = 6.3
Identities = 12/36 (33%), Positives = 20/36 (55%)
Query: 183 LVEKEREKELSEDEEEEKKEEEKEEDKTPKLEDEGE 218
L E + + EEE++EEE+EE++ P E+
Sbjct: 288 LKEVLSAQAQAAAAEEEEEEEEEEEEEEPSEEEAAA 323
>gnl|CDD|227657 COG5354, COG5354, Uncharacterized protein, contains Trp-Asp (WD)
repeat [General function prediction only].
Length = 561
Score = 28.3 bits (63), Expect = 4.1
Identities = 17/77 (22%), Positives = 30/77 (38%), Gaps = 5/77 (6%)
Query: 135 KPDNSQPLGRGTKIVLHMKE---DQAEYIEEKKIKEIVKKHSQFIGYPIKLLV-EKEREK 190
K + KE EY +E KI+ ++KK ++ E+
Sbjct: 481 KAVIMEADSANRSSAPRKKELVEQWPEYSDEDKIRSLLKKLRAIEALKERMRSGEELEVI 540
Query: 191 ELSEDEEEEK-KEEEKE 206
++++ E EE+ E KE
Sbjct: 541 QVNKIETEEEVLSELKE 557
>gnl|CDD|236740 PRK10712, PRK10712, PTS system fructose-specific transporter
subunits IIBC; Provisional.
Length = 563
Score = 28.2 bits (63), Expect = 4.3
Identities = 14/45 (31%), Positives = 22/45 (48%), Gaps = 2/45 (4%)
Query: 49 KTLTIIDSGIGMTKADLVNNLGTTAKSGTKASMEALQAGADISMI 93
KTL IID+ +G +A + L A + KA +E + D +
Sbjct: 2 KTLLIIDANLGQARAYMAKTLLGAAAA--KAGLEIIDNPNDAELA 44
>gnl|CDD|234311 TIGR03685, L12P_arch, 50S ribosomal protein L12P. This model
represents the L12P protein of the large (50S) subunit
of the archaeal ribosome.
Length = 105
Score = 26.9 bits (60), Expect = 4.3
Identities = 12/24 (50%), Positives = 17/24 (70%)
Query: 185 EKEREKELSEDEEEEKKEEEKEED 208
E+E E+EEEE++EEE EE+
Sbjct: 72 AAAAEEEEEEEEEEEEEEEESEEE 95
Score = 25.8 bits (57), Expect = 9.2
Identities = 11/24 (45%), Positives = 16/24 (66%)
Query: 185 EKEREKELSEDEEEEKKEEEKEED 208
E+E E+EEEE++EE +EE
Sbjct: 73 AAAEEEEEEEEEEEEEEEESEEEA 96
>gnl|CDD|185603 PTZ00415, PTZ00415, transmission-blocking target antigen s230;
Provisional.
Length = 2849
Score = 28.1 bits (62), Expect = 4.4
Identities = 11/33 (33%), Positives = 24/33 (72%)
Query: 187 EREKELSEDEEEEKKEEEKEEDKTPKLEDEGED 219
+ +++ +D+EE+ +EEE+EE++ +DE E+
Sbjct: 157 DEDEDEDDDDEEDDEEEEEEEEEIKGFDDEDEE 189
>gnl|CDD|237496 PRK13766, PRK13766, Hef nuclease; Provisional.
Length = 773
Score = 27.9 bits (63), Expect = 4.6
Identities = 20/68 (29%), Positives = 32/68 (47%), Gaps = 8/68 (11%)
Query: 162 EKKIKEIVK--------KHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEKEEDKTPKL 213
EKK+KE +K K + EK+ + L + + + KEEE+EE+K K
Sbjct: 493 EKKMKEELKNLKGILNKKLQELDEEQKGEEEEKDEQLSLDDFVKSKGKEEEEEEEKEEKD 552
Query: 214 EDEGEDAD 221
++ ED
Sbjct: 553 KETEEDEP 560
>gnl|CDD|227596 COG5271, MDN1, AAA ATPase containing von Willebrand factor type A
(vWA) domain [General function prediction only].
Length = 4600
Score = 28.0 bits (62), Expect = 4.6
Identities = 12/37 (32%), Positives = 23/37 (62%)
Query: 183 LVEKEREKELSEDEEEEKKEEEKEEDKTPKLEDEGED 219
LV KE + + ED++ ++KE+E+E ++DE +
Sbjct: 3935 LVSKEDDNKALEDKDRQEKEDEEEMSDDVGIDDEIQP 3971
>gnl|CDD|236498 PRK09401, PRK09401, reverse gyrase; Reviewed.
Length = 1176
Score = 28.0 bits (63), Expect = 5.2
Identities = 15/55 (27%), Positives = 26/55 (47%), Gaps = 2/55 (3%)
Query: 158 EYIEEKKIKEIVKKHSQFIGYPIKLLVE-KEREKELSEDEEEEKKEEEKEEDKTP 211
EY + K ++K + G ++ KE ++ + E +K EEKEE+ P
Sbjct: 828 EYKKSKGYVLVIKLENGG-GLELEGEFSEKEEAEKFYNNLIEVEKVEEKEEELNP 881
>gnl|CDD|226175 COG3649, COG3649, CRISPR system related protein [Defense
mechanisms].
Length = 283
Score = 27.6 bits (61), Expect = 5.3
Identities = 20/47 (42%), Positives = 25/47 (53%)
Query: 170 KKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEKEEDKTPKLEDE 216
K+HS +GY +VEKE E EL D E K E DK ++E E
Sbjct: 225 KEHSSKLGYLSSAVVEKELESELKLDVTELKDVYESLLDKIERIEYE 271
>gnl|CDD|235795 PRK06402, rpl12p, 50S ribosomal protein L12P; Reviewed.
Length = 106
Score = 26.5 bits (59), Expect = 5.6
Identities = 10/25 (40%), Positives = 15/25 (60%)
Query: 194 EDEEEEKKEEEKEEDKTPKLEDEGE 218
EE+K+EEE+EE+K E+
Sbjct: 75 AAAEEKKEEEEEEEEKEESEEEAAA 99
Score = 26.1 bits (58), Expect = 7.9
Identities = 11/24 (45%), Positives = 16/24 (66%)
Query: 185 EKEREKELSEDEEEEKKEEEKEED 208
E++ E+EEEE+KEE +EE
Sbjct: 74 AAAAEEKKEEEEEEEEKEESEEEA 97
>gnl|CDD|219978 pfam08701, GN3L_Grn1, GNL3L/Grn1 putative GTPase. Grn1 (yeast) and
GNL3L (human) are putative GTPases which are required
for growth and play a role in processing of nucleolar
pre-rRNA. This family contains a potential nuclear
localisation signal.
Length = 80
Score = 26.1 bits (58), Expect = 5.7
Identities = 17/62 (27%), Positives = 26/62 (41%), Gaps = 7/62 (11%)
Query: 161 EEKKIKEIVKKHSQFIGYPI------KLLVEKEREKELSEDEEEEKKEEEKEEDKTPKLE 214
E KK K + G P ++L E E +K E+E+E +KE K E + +
Sbjct: 20 EAKKNPTWKSKKKKDPGIPNSFPFKEEILEEIEEKKRKQEEEKERRKEARKAE-RAEARK 78
Query: 215 DE 216
Sbjct: 79 RG 80
>gnl|CDD|169396 PRK08349, PRK08349, hypothetical protein; Validated.
Length = 198
Score = 27.4 bits (61), Expect = 5.7
Identities = 12/44 (27%), Positives = 27/44 (61%), Gaps = 4/44 (9%)
Query: 142 LGRGTKIV-LHMKEDQAEYIEEKKIKEIVKKHSQFIGYPIKLLV 184
L RG ++ +H ++D+ +E+K++E+V++ + G +K V
Sbjct: 21 LRRGVEVYPVHFRQDE---KKEEKVRELVERLQELHGGKLKDPV 61
>gnl|CDD|107059 PHA01746, PHA01746, hypothetical protein.
Length = 131
Score = 26.8 bits (59), Expect = 5.9
Identities = 16/57 (28%), Positives = 33/57 (57%), Gaps = 3/57 (5%)
Query: 163 KKIKEIVKKHSQFIGYPIKLLVEKE---REKELSEDEEEEKKEEEKEEDKTPKLEDE 216
K I E ++K+S F+ +K +E+E +++EEK+E+ +E+ + +E+E
Sbjct: 74 KLIAEFLEKYSDFLNEYVKFTPRNNNAIQEEEDDMEQQEEKEEKGREKGRKKNVEEE 130
>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
Length = 651
Score = 27.7 bits (62), Expect = 6.0
Identities = 14/38 (36%), Positives = 22/38 (57%), Gaps = 2/38 (5%)
Query: 179 PIKLLVEKEREKELSEDEEE--EKKEEEKEEDKTPKLE 214
+L EKE ++ L E + KK+EEK++ + KLE
Sbjct: 550 KEELQREKEEKEALKEQKRLRKLKKQEEKKKKELEKLE 587
>gnl|CDD|236873 PRK11186, PRK11186, carboxy-terminal protease; Provisional.
Length = 667
Score = 27.6 bits (62), Expect = 6.1
Identities = 23/76 (30%), Positives = 29/76 (38%), Gaps = 18/76 (23%)
Query: 153 KEDQAEYIEEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKK---EEEKEEDK 209
ED A Y EK + L EREKE ED+ + E K E K
Sbjct: 583 NEDIARYKAEKDKNIVS-------------LNYAEREKENDEDDAKRLARLNERFKREGK 629
Query: 210 TP--KLEDEGEDADCP 223
P L+D +D + P
Sbjct: 630 KPLKSLDDLPKDYEEP 645
>gnl|CDD|235229 PRK04156, gltX, glutamyl-tRNA synthetase; Provisional.
Length = 567
Score = 27.5 bits (62), Expect = 6.1
Identities = 16/59 (27%), Positives = 33/59 (55%), Gaps = 2/59 (3%)
Query: 163 KKIKEIVKKHSQFIGYPIKLLVEKEREK--ELSEDEEEEKKEEEKEEDKTPKLEDEGED 219
K KEI+ + + L +E++RE+ EL+ + EE++E+++E+ P L + +
Sbjct: 42 SKAKEIIPIVKEVVEEVNSLSLEEQRERLEELAPELLEEEEEKKEEKKGLPPLPNAEKG 100
>gnl|CDD|119380 cd06418, GH25_BacA-like, BacA is a bacterial lysin from
Enterococcus faecalis that degrades bacterial cell walls
by catalyzing the hydrolysis of 1,4-beta-linkages
between N-acetylmuramic acid and N-acetyl-D-glucosamine
residues. BacA is homologous to the YbfG and YkuG
lysins of Bacillus subtilis. BacA has a C-terminal
catalytic glycosyl hydrolase family 25 (GH25) domain and
an N-terminal peptidoglycan-binding domain comprised of
three alpha helices which is similar to a domain found
in matrixins.
Length = 212
Score = 27.3 bits (61), Expect = 6.2
Identities = 6/48 (12%), Positives = 13/48 (27%), Gaps = 2/48 (4%)
Query: 96 FGVGFYSAYMVADKVTVTSKHNDNEQYIWESSAGGSFTIKPDNSQPLG 143
+ +G Y + V +V + + F+ P
Sbjct: 142 YRIGIYGSRNVCSRVLA--AAAGLAVVSFVADMSTGFSGNLGYPLPKD 187
>gnl|CDD|238356 cd00660, Topoisomer_IB_N, Topoisomer_IB_N: N-terminal DNA binding
fragment found in eukaryotic DNA topoisomerase (topo) IB
proteins similar to the monomeric yeast and human topo I
and heterodimeric topo I from Leishmania donvanni. Topo
I enzymes are divided into: topo type IA (bacterial)
and type IB (eukaryotic). Topo I relaxes superhelical
tension in duplex DNA by creating a single-strand nick,
the broken strand can then rotate around the unbroken
strand to remove DNA supercoils and, the nick is
religated, liberating topo I. These enzymes regulate the
topological changes that accompany DNA replication,
transcription and other nuclear processes. Human topo I
is the target of a diverse set of anticancer drugs
including camptothecins (CPTs). CPTs bind to the topo
I-DNA complex and inhibit re-ligation of the
single-strand nick, resulting in the accumulation of
topo I-DNA adducts. In addition to differences in
structure and some biochemical properties,
Trypanosomatid parasite topo I differ from human topo I
in their sensitivity to CPTs and other classical topo I
inhibitors. Trypanosomatid topos I play putative roles
in organizing the kinetoplast DNA network unique to
these parasites. This family may represent more than
one structural domain.
Length = 215
Score = 27.2 bits (61), Expect = 6.2
Identities = 17/50 (34%), Positives = 32/50 (64%), Gaps = 2/50 (4%)
Query: 162 EKKIKEIVKKHSQFIGYPIK--LLVEKEREKELSEDEEEEKKEEEKEEDK 209
K+ K I+KK S+ PI EKE++K +S++E++ KEE+++ ++
Sbjct: 72 TKEEKHIIKKLSKCDFTPIYQYFEEEKEKKKAMSKEEKKAIKEEKEKLEE 121
>gnl|CDD|219838 pfam08432, DUF1742, Fungal protein of unknown function (DUF1742).
This is a family of fungal proteins of unknown function.
Length = 182
Score = 27.0 bits (60), Expect = 6.3
Identities = 20/87 (22%), Positives = 44/87 (50%), Gaps = 16/87 (18%)
Query: 150 LHMKEDQ-------AEYIEEKK--------IKEIVKKHSQFIGYPIKLLVEKEREKELSE 194
H+++ AEY E KK I+++ K++ + + K K+ +K+ +
Sbjct: 41 SHLQDRHFATPIYDAEYTEAKKKKKELAEEIEKVKKEYEEKQKWKWKKKKSKK-KKDKDK 99
Query: 195 DEEEEKKEEEKEEDKTPKLEDEGEDAD 221
D++++KK+++ E+ + ED+ ED
Sbjct: 100 DKKDDKKDDKSEKKDEKEAEDKLEDLT 126
>gnl|CDD|219563 pfam07767, Nop53, Nop53 (60S ribosomal biogenesis). This nucleolar
family of proteins are involved in 60S ribosomal
biogenesis. They are specifically involved in the
processing beyond the 27S stage of 25S rRNA maturation.
This family contains sequences that bear similarity to
the glioma tumour suppressor candidate region gene 2
protein (p60). This protein has been found to interact
with herpes simplex type 1 regulatory proteins.
Length = 387
Score = 27.4 bits (61), Expect = 6.9
Identities = 11/60 (18%), Positives = 24/60 (40%)
Query: 160 IEEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEKEEDKTPKLEDEGED 219
E+K+ + + + + E + L E+ +++ +EE +E E E E
Sbjct: 209 AEKKRQELERVEEKKLEKMAPEASRLDEMSEGLLEESDDDGEEESDDESAWEGFESEYEP 268
>gnl|CDD|218734 pfam05758, Ycf1, Ycf1. The chloroplast genomes of most higher
plants contain two giant open reading frames designated
ycf1 and ycf2. Although the function of Ycf1 is unknown,
it is known to be an essential gene.
Length = 832
Score = 27.7 bits (62), Expect = 7.1
Identities = 17/50 (34%), Positives = 27/50 (54%), Gaps = 4/50 (8%)
Query: 165 IKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEEKEEDKTPKLE 214
I E++K F + +K+ K L EE+ KKEE+KE++K + E
Sbjct: 633 ISELMKIKQIFR----NWIEKKKEFKILDYTEEKTKKEEKKEKNKREEKE 678
>gnl|CDD|221244 pfam11818, DUF3340, C-terminal domain of tail specific protease
(DUF3340). This presumed domain is found at the
C-terminus of tail specific proteases. Its function is
unknown. This family is found in bacteria and
eukaryotes. This presumed domain is typically between 88
to 187 amino acids in length.
Length = 139
Score = 26.4 bits (59), Expect = 7.3
Identities = 20/71 (28%), Positives = 28/71 (39%), Gaps = 18/71 (25%)
Query: 154 EDQAEYIEEKKIKEIVKKHSQFIGYPIKLLVEKEREKELSEDEEEEKKEEE-----KEED 208
ED A EE+ K + L E ER+ E EDE++ K E E+
Sbjct: 59 EDIAWLKEERDDKTVS-------------LNEAERKAEREEDEKKRLKRENERRKALGEE 105
Query: 209 KTPKLEDEGED 219
L+D +D
Sbjct: 106 PLKSLDDLPKD 116
>gnl|CDD|237205 PRK12792, flhA, flagellar biosynthesis protein FlhA; Reviewed.
Length = 694
Score = 27.4 bits (61), Expect = 7.6
Identities = 12/46 (26%), Positives = 20/46 (43%), Gaps = 6/46 (13%)
Query: 175 FIGY--PIKLLVEKEREKELSEDEEEEKKEEE----KEEDKTPKLE 214
F+ Y P + E + EEE + E KE+ +T ++E
Sbjct: 318 FVAYTIPRRRAARAAAEAAKVKREEESAQAEAKDSVKEQLRTAEIE 363
>gnl|CDD|218333 pfam04931, DNA_pol_phi, DNA polymerase phi. This family includes
the fifth essential DNA polymerase in yeast EC:2.7.7.7.
Pol5p is localised exclusively to the nucleolus and
binds near or at the enhancer region of rRNA-encoding
DNA repeating units.
Length = 784
Score = 27.1 bits (60), Expect = 8.2
Identities = 15/42 (35%), Positives = 21/42 (50%), Gaps = 5/42 (11%)
Query: 185 EKEREKELSEDEEEEKK-----EEEKEEDKTPKLEDEGEDAD 221
E E + E ++D+E+E + E E E D EDE ED
Sbjct: 650 EDEDDLEETDDDEDECEAIEDSESESESDGEDGEEDEQEDDA 691
>gnl|CDD|220648 pfam10243, MIP-T3, Microtubule-binding protein MIP-T3. This
protein, which interacts with both microtubules and
TRAF3 (tumour necrosis factor receptor-associated factor
3), is conserved from worms to humans. The N-terminal
region is the microtubule binding domain and is
well-conserved; the C-terminal 100 residues, also
well-conserved, constitute the coiled-coil region which
binds to TRAF3. The central region of the protein is
rich in lysine and glutamic acid and carries KKE motifs
which may also be necessary for tubulin-binding, but
this region is the least well-conserved.
Length = 506
Score = 27.2 bits (60), Expect = 8.4
Identities = 15/35 (42%), Positives = 27/35 (77%)
Query: 185 EKEREKELSEDEEEEKKEEEKEEDKTPKLEDEGED 219
E+E+EKE ++E+++KKE+ KEE K K ++E ++
Sbjct: 105 EEEKEKEQVKEEKKKKKEKPKEEPKDRKPKEEAKE 139
>gnl|CDD|219900 pfam08553, VID27, VID27 cytoplasmic protein. This is a family of
fungal and plant proteins and contains many hypothetical
proteins. VID27 is a cytoplasmic protein that plays a
potential role in vacuolar protein degradation.
Length = 794
Score = 27.4 bits (61), Expect = 8.6
Identities = 15/37 (40%), Positives = 21/37 (56%)
Query: 185 EKEREKELSEDEEEEKKEEEKEEDKTPKLEDEGEDAD 221
ER+ E EDEEEE++E+E E +DE + D
Sbjct: 385 NTERDDEEEEDEEEEEEEDEDEGPSKEHSDDEEFEED 421
>gnl|CDD|117592 pfam09026, Cenp-B_dimeris, Centromere protein B dimerisation
domain. The centromere protein B (CENP-B) dimerisation
domain is composed of two alpha-helices, which are
folded into an antiparallel configuration. Dimerisation
of CENP-B is mediated by this domain, in which monomers
dimerise to form a symmetrical, antiparallel, four-helix
bundle structure with a large hydrophobic patch in which
23 residues of one monomer form van der Waals contacts
with the other monomer. This CENP-B dimer configuration
may be suitable for capturing two distant CENP-B boxes
during centromeric heterochromatin formation.
Length = 101
Score = 25.9 bits (56), Expect = 9.2
Identities = 10/36 (27%), Positives = 20/36 (55%)
Query: 189 EKELSEDEEEEKKEEEKEEDKTPKLEDEGEDADCPA 224
E S+ +EEE ++E EED +++ ++ P+
Sbjct: 10 EDSDSDSDEEEDDDDEDEEDDDEDDDEDDDEVPVPS 45
>gnl|CDD|237063 PRK12329, nusA, transcription elongation factor NusA; Provisional.
Length = 449
Score = 27.1 bits (60), Expect = 9.2
Identities = 12/38 (31%), Positives = 22/38 (57%)
Query: 184 VEKEREKELSEDEEEEKKEEEKEEDKTPKLEDEGEDAD 221
+++E E+ L ++ E +E+ + + P EDE ED D
Sbjct: 398 LQREAEERLEAEQAERAEEDARLRELYPLPEDEFEDED 435
>gnl|CDD|234083 TIGR03002, outer_YhbN_LptA, lipopolysaccharide transport
periplasmic protein LptA. Members of this protein
family include LptA (previously called YhbN). It was
shown to be an essential protein in E. coli, implicated
in cell envelope integrity, and to play a role in the
delivery of LPS to the outer leaflet of the outer
membrane. It works with LptB (formerly yhbG), a homolog
of ABC transporter ATP-binding proteins, encoded by an
adjacent gene. Numerous homologs in other Proteobacteria
are found in a conserved location near
lipopolysaccharide inner core biosynthesis genes. This
family is related to organic solvent tolerance protein
(OstA), though distantly [Cell envelope, Biosynthesis
and degradation of surface polysaccharides and
lipopolysaccharides, Transport and binding proteins,
Other].
Length = 142
Score = 26.4 bits (59), Expect = 9.4
Identities = 16/65 (24%), Positives = 27/65 (41%), Gaps = 7/65 (10%)
Query: 107 ADKVTVTSKHNDNEQYIWESSAGGS---FTIKPDNSQPL-GRGTKIVLHMKEDQAEYIEE 162
ADKV VT + +A G F K DN + + G+ +I +D+
Sbjct: 43 ADKVVVTRNDAGGIEKA---TATGKPATFRQKLDNGKEIEGQANRIEYDPAKDEVVLTGN 99
Query: 163 KKIKE 167
++K+
Sbjct: 100 ARLKQ 104
>gnl|CDD|100109 cd05831, Ribosomal_P1, Ribosomal protein P1. This subfamily
represents the eukaryotic large ribosomal protein P1.
Eukaryotic P1 and P2 are functionally equivalent to the
bacterial protein L7/L12, but are not homologous to
L7/L12. P1 is located in the L12 stalk, with proteins
P2, P0, L11, and 28S rRNA. P1 and P2 are the only
proteins in the ribosome to occur as multimers, always
appearing as sets of heterodimers. Recent data indicate
that eukaryotes have four copies (two heterodimers),
while most archaeal species contain six copies of L12p
(three homodimers) and bacteria may have four or six
copies (two or three homodimers), depending on the
species. Experiments using S. cerevisiae P1 and P2
indicate that P1 proteins are positioned more internally
with limited reactivity in the C-terminal domains, while
P2 proteins seem to be more externally located and are
more likely to interact with other cellular components.
In lower eukaryotes, P1 and P2 are further subdivided
into P1A, P1B, P2A, and P2B, which form P1A/P2B and
P1B/P2A heterodimers. Some plant species have a third
P-protein, called P3, which is not homologous to P1 and
P2. In humans, P1 and P2 are strongly autoimmunogenic.
They play a significant role in the etiology and
pathogenesis of systemic lupus erythema (SLE). In
addition, the ribosome-inactivating protein
trichosanthin (TCS) interacts with human P0, P1, and P2,
with its primary binding site located in the C-terminal
region of P2. TCS inactivates the ribosome by
depurinating a specific adenine in the sarcin-ricin loop
of 28S rRNA.
Length = 103
Score = 25.7 bits (57), Expect = 9.6
Identities = 10/15 (66%), Positives = 13/15 (86%)
Query: 194 EDEEEEKKEEEKEED 208
E ++EEKKEEE+EE
Sbjct: 80 EAKKEEKKEEEEEES 94
>gnl|CDD|111875 pfam03032, Brevenin, Brevenin/esculentin/gaegurin/rugosin family.
This family contains a number of defence peptides
secreted from the skin of amphibians, including the
opiate-like dermorphins and deltorphins, and the
antimicrobial dermoseptins and temporins. The alignment
for this family includes the signal peptide.
Length = 46
Score = 24.6 bits (54), Expect = 9.7
Identities = 10/27 (37%), Positives = 19/27 (70%)
Query: 183 LVEKEREKELSEDEEEEKKEEEKEEDK 209
L E+E+ ++ E+E+EE+ EE+ E +
Sbjct: 20 LCEEEKREDEEENEDEEEGEEQSEVKR 46
>gnl|CDD|217286 pfam02919, Topoisom_I_N, Eukaryotic DNA topoisomerase I, DNA
binding fragment. Topoisomerase I promotes the
relaxation of DNA superhelical tension by introducing a
transient single-stranded break in duplex DNA and are
vital for the processes of replication, transcription,
and recombination. This family may be more than one
structural domain.
Length = 215
Score = 26.4 bits (59), Expect = 10.0
Identities = 13/34 (38%), Positives = 21/34 (61%), Gaps = 3/34 (8%)
Query: 179 PIKLLVEKERE--KELSEDEEEEKKEE-EKEEDK 209
PI E E+E K +S++E++ KEE +K E+
Sbjct: 90 PIYEYFEAEKEKKKAMSKEEKKAIKEEKDKLEEP 123
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.306 0.128 0.345
Gapped
Lambda K H
0.267 0.0677 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 11,230,695
Number of extensions: 1080447
Number of successful extensions: 4751
Number of sequences better than 10.0: 1
Number of HSP's gapped: 3635
Number of HSP's successfully gapped: 592
Length of query: 224
Length of database: 10,937,602
Length adjustment: 93
Effective length of query: 131
Effective length of database: 6,812,680
Effective search space: 892461080
Effective search space used: 892461080
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 43 (22.0 bits)
S2: 57 (25.8 bits)