RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy9780
(302 letters)
>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
bacterial type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. This family
represents the SMC protein of most bacteria. The smc
gene is often associated with scpB (TIGR00281) and scpA
genes, where scp stands for segregation and condensation
protein. SMC was shown (in Caulobacter crescentus) to be
induced early in S phase but present and bound to DNA
throughout the cell cycle [Cellular processes, Cell
division, DNA metabolism, Chromosome-associated
proteins].
Length = 1179
Score = 36.6 bits (85), Expect = 0.018
Identities = 50/209 (23%), Positives = 97/209 (46%), Gaps = 12/209 (5%)
Query: 2 AQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQR 61
+ E++++ RE +R LEE +LEE E K + L E K ++
Sbjct: 296 EISRLEQQKQILRERLANLERQLEELEAQLEELESKLDELAEELAELEEK-------LEE 348
Query: 62 REPGLTLSTAQLERNKTKEQIEEEKKIALNIRIKPLNVEGWSIHKLQAKATELWETIVKL 121
+ L A+LE + + + E + L +++ L + + +L+ + L I +L
Sbjct: 349 LKEELESLEAELEELEAELEELESRLEELEEQLETLRSK---VAQLELQIASLNNEIERL 405
Query: 122 ETDKYDLEERQKRQDYDLKELKERQKQQLRHKALKKGLDPEALTGKYPVKEEERRQREVE 181
E LE+R++R +++EL ++ ++ K L+ L+ + +E ER + +E
Sbjct: 406 EARLERLEDRRERLQQEIEEL-LKKLEEAELKELQAELEELEEELEELQEELERLEEALE 464
Query: 182 EKKQRDLEEKRQRLEEAEKKRQAMMAALK 210
E ++ LEE Q L+ AE++ + A L
Sbjct: 465 ELREE-LEEAEQALDAAERELAQLQARLD 492
Score = 35.4 bits (82), Expect = 0.035
Identities = 52/222 (23%), Positives = 100/222 (45%), Gaps = 11/222 (4%)
Query: 1 MAQRKKEEEERRQREVEEK---KQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNF 57
A+ + EE +E+EEK + ++ E + +EE +K+ A+ + + +
Sbjct: 250 EAEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKELYALANEISRLEQQKQILRE 309
Query: 58 TIQRREPGLTLSTAQLERNKTKEQIEEEKKIALNIRIKPLNVEGWSIHKLQAKATELWET 117
+ E L AQLE ++K E+ L +++ L E + L+A+ EL
Sbjct: 310 RLANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEE---LESLEAELEELEAE 366
Query: 118 IVKLETDKYDLEE--RQKRQDYDLKELKERQ--KQQLRHKALKKGLDPEALTGKYPVKEE 173
+ +LE+ +LEE R EL+ + R +A + L+ + ++E
Sbjct: 367 LEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEARLERLEDRRERLQQEIEEL 426
Query: 174 ERRQREVEEKK-QRDLEEKRQRLEEAEKKRQAMMAALKEQTN 214
++ E E K+ Q +LEE + LEE +++ + + AL+E
Sbjct: 427 LKKLEEAELKELQAELEELEEELEELQEELERLEEALEELRE 468
Score = 34.6 bits (80), Expect = 0.059
Identities = 67/298 (22%), Positives = 127/298 (42%), Gaps = 36/298 (12%)
Query: 3 QRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQRR 62
+R+ E+ ER + E ++ +L RLEE ++ + + LKE + + +Q
Sbjct: 206 ERQAEKAERYKELKAELRELELALLVLRLEELREELEELQEELKEAEEELEELTAELQEL 265
Query: 63 EPGLTLSTAQLERNKTKEQIEEEKKIALNIRIKPLNVEGWSIHKLQAKATELWETIVKLE 122
E L +LE ++ +E+IEE +K + + +E L+ + L + +LE
Sbjct: 266 E--EKLEELRLEVSELEEEIEELQKELYALANEISRLEQ-QKQILRERLANLERQLEELE 322
Query: 123 TDKYDLEERQKRQDYDLKELKERQKQQLRHKALKKGLDPEALTGKYPVKEEERRQREVEE 182
+LE + +L EL+E+ ++ +KEE E
Sbjct: 323 AQLEELESKLDELAEELAELEEKLEE---------------------LKEELESLEAELE 361
Query: 183 KKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQRREPGLTLSTAQLERNKT 242
+ + +LEE RLEE E++ + + + + + + N I+R E L + R +
Sbjct: 362 ELEAELEELESRLEELEEQLETLRSKVAQLELQIASLNNEIERLEA--RLERLEDRRERL 419
Query: 243 KEQIEEEKKIALNIRIKPLNVEGWSIHKLQAKATELWETIVKLETDKYDLEERQKRQD 300
+++IEE K +K +LQA+ EL E + +L+ + LEE +
Sbjct: 420 QQEIEELLKKLEEAELK----------ELQAELEELEEELEELQEELERLEEALEELR 467
Score = 33.9 bits (78), Expect = 0.11
Identities = 47/221 (21%), Positives = 91/221 (41%), Gaps = 13/221 (5%)
Query: 3 QRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQRR 62
++ R+ E + LEE+ +L + + +A + L+E+ ++ +
Sbjct: 724 LSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAEIEELEERLEEA---EEELAEA 780
Query: 63 EPGLTLSTAQLERNKTKEQIEEEKKIALNIRIKPLNVEGWSIHKLQAKATELWETIVKLE 122
E + AQ+E+ K + + E L + LN E L+ + L I E
Sbjct: 781 EAEIEELEAQIEQLKEELKALREALDELRAELTLLNEE---AANLRERLESLERRIAATE 837
Query: 123 TDKYDLEERQKRQDYDLKEL-KERQKQQLRHKALKKGLDPEALTGKYPVKEEE----RRQ 177
DLEE+ + D++ L E ++ + + L+ L EAL + EE R +
Sbjct: 838 RRLEDLEEQIEELSEDIESLAAEIEELEELIEELESEL--EALLNERASLEEALALLRSE 895
Query: 178 REVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKG 218
E ++ R+LE KR L ++ + +A L+ + +
Sbjct: 896 LEELSEELRELESKRSELRRELEELREKLAQLELRLEGLEV 936
Score = 30.4 bits (69), Expect = 1.6
Identities = 45/250 (18%), Positives = 91/250 (36%), Gaps = 12/250 (4%)
Query: 7 EEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQRREPGL 66
EE+ E + ++ L E R+ LEE E++ + + L+E + + + R E +
Sbjct: 683 EEKIEELEEKIAELEKALAELRKELEELEEELEQLRKELEELSRQISALRKDLARLEAEV 742
Query: 67 TLSTAQLERNKTKEQIEEEKKIALNIRIKPLNVEGWSIHKLQAKATELWETIVKLETDKY 126
++ + + E + L R + + + + E I +LE
Sbjct: 743 EQLEERIAQLSKELTELEAEIEELEER----------LEEAEEELAEAEAEIEELEAQIE 792
Query: 127 DLEERQKRQDYDLKELKERQKQQLRHKALKKGLDPEALTGKYPVKEEERRQREVE-EKKQ 185
L+E K L EL+ + L +A E+L + E E + E+
Sbjct: 793 QLKEELKALREALDELRA-ELTLLNEEAANLRERLESLERRIAATERRLEDLEEQIEELS 851
Query: 186 RDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQRREPGLTLSTAQLERNKTKEQ 245
D+E +EE E+ + + + L+ N+ + L + +L ++K
Sbjct: 852 EDIESLAAEIEELEELIEELESELEALLNERASLEEALALLRSELEELSEELRELESKRS 911
Query: 246 IEEEKKIALN 255
+ L
Sbjct: 912 ELRRELEELR 921
>gnl|CDD|178635 PLN03086, PLN03086, PRLI-interacting factor K; Provisional.
Length = 567
Score = 35.2 bits (81), Expect = 0.033
Identities = 18/46 (39%), Positives = 32/46 (69%)
Query: 3 QRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQ 48
+R +E+ ER QRE +++ + LE +R+ EEA K+R+A+ AA + +
Sbjct: 6 RRAREKLEREQRERKQRAKLKLERERKAKEEAAKQREAIEAAQRSR 51
Score = 31.8 bits (72), Expect = 0.42
Identities = 16/39 (41%), Positives = 27/39 (69%)
Query: 174 ERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQ 212
ER QRE +++ + LE +R+ EEA K+R+A+ AA + +
Sbjct: 13 EREQRERKQRAKLKLERERKAKEEAAKQREAIEAAQRSR 51
Score = 27.9 bits (62), Expect = 8.2
Identities = 21/84 (25%), Positives = 42/84 (50%), Gaps = 16/84 (19%)
Query: 127 DLEERQKRQDYDLKELKER-QKQQLRHKALKKGLDPEALTGKYPVKEEERRQREVEEKKQ 185
D E R+ R+ + +E +ER Q+ +L+ + +K KEE +QRE E Q
Sbjct: 2 DFELRRAREKLE-REQRERKQRAKLKLERERKA------------KEEAAKQREAIEAAQ 48
Query: 186 RD--LEEKRQRLEEAEKKRQAMMA 207
R L+ +++ ++ ++++ A
Sbjct: 49 RSRRLDAIEAQIKADQQMQESLQA 72
>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
chromosome partitioning].
Length = 1163
Score = 35.5 bits (82), Expect = 0.035
Identities = 71/318 (22%), Positives = 134/318 (42%), Gaps = 25/318 (7%)
Query: 4 RKKEEEERRQREVEEKKQR---DLEEKRQRLEEAEKKRQAMMAALKEQTNKS-------- 52
+KEE ER+ EE +R LEE ++LE+ E++ + + +
Sbjct: 172 ERKEEAERKLERTEENLERLEDLLEELEKQLEKLERQAEKAERYQELKAELRELELALLL 231
Query: 53 ---KGPNFTIQRREPGLTLSTAQLERNKTKEQIEEEKKIALNIRIKPLNVEGWSIHKLQA 109
K ++ E L+ +LE + + + E++ L ++ L E + +LQ
Sbjct: 232 AKLKELRKELEELEEELSRLEEELEELQEELEEAEKEIEELKSELEELREE---LEELQE 288
Query: 110 KATELWETIVKLETDKYDLEERQKRQDYDLKELKERQKQQL----RHKALKKGLDPEALT 165
+ EL E I +LE + L ER + + +L+EL+ER ++ K + +
Sbjct: 289 ELLELKEEIEELEGEISLLRERLEELENELEELEERLEELKEKIEALKEELEERETLLEE 348
Query: 166 GKYPVKEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQR 225
+ + E E + E+EEK LEE + E ++ + A L E N+ + I+
Sbjct: 349 LEQLLAELEEAKEELEEKLSALLEELEELFEALREELAELEAELAEIRNELEELKREIES 408
Query: 226 REPGLTLSTAQLERNKTKEQIEEEKKIALNIRIKPLNVEGWS----IHKLQAKATELWET 281
E L + +LE K + + E + L ++ LN E + +L+ + EL
Sbjct: 409 LEERLERLSERLEDLKEELKELEAELEELQTELEELNEELEELEEQLEELRDRLKELERE 468
Query: 282 IVKLETDKYDLEERQKRQ 299
+ +L+ + LE+
Sbjct: 469 LAELQEELQRLEKELSSL 486
Score = 33.1 bits (76), Expect = 0.19
Identities = 52/255 (20%), Positives = 110/255 (43%), Gaps = 12/255 (4%)
Query: 4 RKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPN------- 56
K EEE + + + LEE R++LEE E++ + + L + +
Sbjct: 684 EKLEEELKSLKNELRSLEDLLEELRRQLEELERQLEELKRELAALEEELEQLQSRLEELE 743
Query: 57 FTIQRREPGLTLSTAQLERNKTKEQIEEEKKIALNIRIKPL-NVEGWSIHKLQAKATELW 115
++ E L +LE + + + EE L I+ L +L+ EL
Sbjct: 744 EELEELEEELEELQERLEELEEELESLEEALAKLKEEIEELEEKRQALQEELEELEEELE 803
Query: 116 ETIVKLETDKYDLEERQKRQDYDLKELKERQKQQLRHKALKKGLDPEALTGKYPVKEEER 175
E +L+ + +LE ++R++ +E++E +++ + L+ E +++E
Sbjct: 804 EAERRLDALERELESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEE----LEKELE 859
Query: 176 RQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQRREPGLTLSTA 235
+E E+ + + EE L+E E++++ + L+E ++ I++ L A
Sbjct: 860 ELKEELEELEAEKEELEDELKELEEEKEELEEELRELESELAELKEEIEKLRERLEELEA 919
Query: 236 QLERNKTKEQIEEEK 250
+LER + + EE+
Sbjct: 920 KLERLEVELPELEEE 934
Score = 32.8 bits (75), Expect = 0.27
Identities = 43/228 (18%), Positives = 97/228 (42%), Gaps = 7/228 (3%)
Query: 3 QRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQRR 62
+ E E RE E+ + +LEE +RLEE ++K +A+ L+E+ + +++
Sbjct: 297 IEELEGEISLLRERLEELENELEELEERLEELKEKIEALKEELEERETLLEE----LEQL 352
Query: 63 EPGLTLSTAQLERNKTKEQIEEEKKIALNIRIKPLNVEGWSIHKLQAKATELWETIVKLE 122
L + +LE +EE +++ +R + L + +++ + EL I LE
Sbjct: 353 LAELEEAKEELEEKL-SALLEELEELFEALREE-LAELEAELAEIRNELEELKREIESLE 410
Query: 123 TDKYDLEERQKRQDYDLKELKERQKQQLRHKALKKGLDPEALTGKYPVKEEERRQREVEE 182
L ER + +LKEL+ ++ + E L + + ++ E E
Sbjct: 411 ERLERLSERLEDLKEELKELEAELEELQTELEELN-EELEELEEQLEELRDRLKELEREL 469
Query: 183 KKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQRREPGL 230
+ ++ ++ ++ + + R + A + + + ++ PG+
Sbjct: 470 AELQEELQRLEKELSSLEARLDRLEAEQRASQGVRAVLEALESGLPGV 517
Score = 30.1 bits (68), Expect = 1.7
Identities = 43/214 (20%), Positives = 97/214 (45%), Gaps = 22/214 (10%)
Query: 4 RKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQRRE 63
E E + E+ ++++EE + +EE E+K + L+E + E
Sbjct: 810 DALERELESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEELEKE---------LEE 860
Query: 64 PGLTLSTAQLERNKTKEQIEEEKKIALNIRIKPLNVEGWSIHKLQAKATELWETIVKLET 123
L + E+ + +++++E ++ + + + +L+++ EL E I KL
Sbjct: 861 LKEELEELEAEKEELEDELKELEEEKEELEEE--------LRELESELAELKEEIEKLRE 912
Query: 124 DKYDLEERQKRQDYDLKEL---KERQKQQLRHKALKKGLD--PEALTGKYPVKEEERRQR 178
+LE + +R + +L EL E + + L++ ++ E + PV +
Sbjct: 913 RLEELEAKLERLEVELPELEEELEEEYEDTLETELEREIERLEEEIEALGPVNLRAIEEY 972
Query: 179 EVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQ 212
E E++ +L+ +R+ LEEA++K ++ L ++
Sbjct: 973 EEVEERYEELKSQREDLEEAKEKLLEVIEELDKE 1006
Score = 29.7 bits (67), Expect = 2.3
Identities = 51/249 (20%), Positives = 102/249 (40%), Gaps = 24/249 (9%)
Query: 4 RKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQRRE 63
+ EEE E E+ Q LEE + LE E+ + ++E K +
Sbjct: 740 EELEEELEELEEELEELQERLEELEEELESLEEALAKLKEEIEELEEKRQ---------- 789
Query: 64 PGLTLSTAQLERNKTKEQIEEEKKIALNIRIKPLNVEGWSIHKLQAKATELWETIVKLET 123
Q E + +E++EE ++ + + L +L+ + EL E I +LE
Sbjct: 790 ------ALQEELEELEEELEEAERRLDALE-RELESLEQRRERLEQEIEELEEEIEELEE 842
Query: 124 DKYDLEERQKRQDYDLKELKERQKQQLRHKALKKGLDPEALTGKYPVKEEERRQREVEEK 183
+LEE + + +L+ELKE ++ K + + L KEE + E
Sbjct: 843 KLDELEEELEELEKELEELKEELEELEAEKEELE----DELKELEEEKEELEEELRELES 898
Query: 184 KQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQRREPGLTLSTAQLERNKTK 243
+ +L+E+ ++L E ++ +A + L+ + + ++ + + E + +
Sbjct: 899 ELAELKEEIEKLRERLEELEAKLERLEVELPEL---EEELEEEYEDTLETELEREIERLE 955
Query: 244 EQIEEEKKI 252
E+IE +
Sbjct: 956 EEIEALGPV 964
>gnl|CDD|221432 pfam12128, DUF3584, Protein of unknown function (DUF3584). This
protein is found in bacteria and eukaryotes. Proteins in
this family are typically between 943 to 1234 amino
acids in length. This family contains a P-loop motif
suggesting it is a nucleotide binding protein. It may be
involved in replication.
Length = 1198
Score = 34.3 bits (79), Expect = 0.082
Identities = 56/319 (17%), Positives = 129/319 (40%), Gaps = 32/319 (10%)
Query: 2 AQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQR 61
E+ + + K++ + E QR E + + + L+ Q + G R
Sbjct: 487 QAEANVEQLQSELRQLRKRRDEALEALQRAERRLLQLRQALDELELQLSPQAGSLLHFLR 546
Query: 62 REP-------GLTLSTAQLERN----KTKEQIEEEKKIALNIRIKPLNVEGWSIHKLQAK 110
E G +S LER + E + + +++ ++ L+V + A
Sbjct: 547 NEAPGWEESIGKVISPELLERTDLDPQLVEGSDSDTLYGVSLDLQRLDVPDY-----AAN 601
Query: 111 ATELWETIVKLETDKYDLEERQKRQDYDL----KELKERQKQQLRHKALKKGLDPEALTG 166
TEL E + + E +QK+ + L EL+E+++ + + K +
Sbjct: 602 ETELRERLQQAEEALQSAVAKQKQAEEQLVQANAELEEQKRAEAEARTALKQARLDLQRL 661
Query: 167 KYPVKEEERRQREVEEKKQRDLEEKRQRLEEA----EKKRQAMMAALKEQ-----TNKSK 217
+ + + + ++++ E + ++L+ +++QA + ALK+ T +
Sbjct: 662 QNEQQSLKDKLELAIAERKQQAETQLRQLDAQLKQLLEQQQAFLEALKDDFRELRTERLA 721
Query: 218 GPNFTIQRREPGLTLSTAQLERNKT--KEQIEEEKKIALNIRIKPLNVEGWSIHKLQAKA 275
+ L +A +E +T K +++E KK + + L+V+ ++ +L+ +
Sbjct: 722 KWQVVEGELDNQLAQLSAAIEAARTQAKARLKELKK-QYDRELASLDVDPNTVKELKRQI 780
Query: 276 TELWETIVKLETDKYDLEE 294
EL TI ++ + ++ E
Sbjct: 781 EELETTIERIAVRRPEVRE 799
>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
Length = 1388
Score = 33.9 bits (78), Expect = 0.10
Identities = 41/251 (16%), Positives = 92/251 (36%), Gaps = 26/251 (10%)
Query: 1 MAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQ 60
+ +++KE E+ + ++ DL++ + LEE E+ + + + K
Sbjct: 1111 LEKKEKELEKLKNTTPKDMWLEDLDKFEEALEEQEEVEE----KEIAKEQRLKSKTKGKA 1166
Query: 61 RREPGLTLSTAQLERNKTKEQIEEEKKIALNIRIKPLNVEGWSIHKLQAKATELWETIVK 120
+ R ++ E++KK + + K +V G S ++ K
Sbjct: 1167 SK-----------LRKPKLKKKEKKKKKSSADKSKKASVVGNSKRV---------DSDEK 1206
Query: 121 LETDKYDLEERQKRQDYDLKELKERQKQQLRHKALKKGLDPEALTGKYPVKEEERRQREV 180
+ D ++ D ++ E QK + + ++K+ + + K +E ++
Sbjct: 1207 RKLDDKPDNKKSNSSGSDQED-DEEQKTKPKKSSVKRLKSKKNNSSKSSEDNDEFSSDDL 1265
Query: 181 EEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQRREPGLTLSTAQLERN 240
++ + KR + E SK + T ++ + L S A L++
Sbjct: 1266 SKEGKPKNAPKRVSAVQYSPP-PPSKRPDGESNGGSKPSSPTKKKVKKRLEGSLAALKKK 1324
Query: 241 KTKEQIEEEKK 251
K E+ KK
Sbjct: 1325 KKSEKKTARKK 1335
>gnl|CDD|216807 pfam01956, DUF106, Integral membrane protein DUF106. This
archaebacterial protein family has no known function.
Members are predicted to be integral membrane proteins.
Length = 168
Score = 32.6 bits (75), Expect = 0.11
Identities = 18/59 (30%), Positives = 36/59 (61%), Gaps = 5/59 (8%)
Query: 4 RKKEEEERRQREVEEKKQRDLEE--KRQRLEEAEKKRQAMMAALKE-QTNKSKGPNFTI 59
RK E+ ++R++E+ +K+ R+L + + ++ EK+++ +M KE + K PNF I
Sbjct: 40 RKMEKYQKREKEI-QKRARELRKNGDKLSPKKFEKRQEELMEDQKEMMMDMMK-PNFAI 96
>gnl|CDD|201540 pfam00992, Troponin, Troponin. Troponin (Tn) contains three
subunits, Ca2+ binding (TnC), inhibitory (TnI), and
tropomyosin binding (TnT). this Pfam contains members of
the TnT subunit. Troponin is a complex of three
proteins, Ca2+ binding (TnC), inhibitory (TnI), and
tropomyosin binding (TnT). The troponin complex
regulates Ca++ induced muscle contraction. This family
includes troponin T and troponin I. Troponin I binds to
actin and troponin T binds to tropomyosin.
Length = 131
Score = 31.8 bits (73), Expect = 0.15
Identities = 25/78 (32%), Positives = 43/78 (55%), Gaps = 1/78 (1%)
Query: 66 LTLSTAQLERNKTKEQIEEEKKIALNIRIKPLNVEGWSIHKLQAKATELWETIVKLETDK 125
L L A E + + EEEK+ L R PL + S +LQ +L I +L+ ++
Sbjct: 7 LLLLKAAEELEFEQRKKEEEKEKYLAERCPPLRLSL-SRAELQELCKKLHARIDRLDEER 65
Query: 126 YDLEERQKRQDYDLKELK 143
YD+EE+ ++D ++++LK
Sbjct: 66 YDIEEKVAKKDKEIEDLK 83
Score = 28.7 bits (65), Expect = 1.6
Identities = 23/73 (31%), Positives = 38/73 (52%), Gaps = 1/73 (1%)
Query: 230 LTLSTAQLERNKTKEQIEEEKKIALNIRIKPLNVEGWSIHKLQAKATELWETIVKLETDK 289
L L A E + + EEEK+ L R PL + S +LQ +L I +L+ ++
Sbjct: 7 LLLLKAAEELEFEQRKKEEEKEKYLAERCPPLRLSL-SRAELQELCKKLHARIDRLDEER 65
Query: 290 YDLEERQKRQDYD 302
YD+EE+ ++D +
Sbjct: 66 YDIEEKVAKKDKE 78
>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
Length = 2084
Score = 33.6 bits (76), Expect = 0.16
Identities = 52/253 (20%), Positives = 103/253 (40%), Gaps = 17/253 (6%)
Query: 2 AQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQR 61
A +KK EE ++ E + + E E AE+K +A +E K+ +
Sbjct: 1333 AAKKKAEEAKKAAEAAKAEA---EAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEE 1389
Query: 62 REPGLTLSTAQLERNKTKEQI---EEEKKIALNIRIKPLNVEGWSIHKLQAKATELWETI 118
++ E K +++ KK A + K + K +A E
Sbjct: 1390 KKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKA------EEA 1443
Query: 119 VKLETDKYDLEERQKRQDYDLKELKERQKQQLRHKALKKGLDPEALTGKYPVKEEERRQR 178
K + K EE +K ++ K + ++ + + KA + EA K EE +++
Sbjct: 1444 KKADEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKKADEAKK-----KAEEAKKK 1498
Query: 179 EVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQRREPGLTLSTAQLE 238
E KK + ++K ++AE+ ++A A E+ K+ ++++ +L+
Sbjct: 1499 ADEAKKAAEAKKKADEAKKAEEAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELK 1558
Query: 239 RNKTKEQIEEEKK 251
+ + K++ EE KK
Sbjct: 1559 KAEEKKKAEEAKK 1571
Score = 32.8 bits (74), Expect = 0.29
Identities = 43/223 (19%), Positives = 97/223 (43%), Gaps = 20/223 (8%)
Query: 2 AQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQR 61
A+ K+ EE+++ E +K + D ++ EEA+K +A + + + + K +
Sbjct: 1554 AEELKKAEEKKKAEEAKKAEEDKNMALRKAEEAKKAEEARIEEVMKLYEEEKKMKAEEAK 1613
Query: 62 REPGLTLSTAQLER--------NKTKEQIEEEKKIALNIRIKPLNVEGWSIHKLQAKATE 113
+ + +L++ + K++ EEKK A ++ E I +
Sbjct: 1614 KAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAE---EENKIKAAEEAKKA 1670
Query: 114 LWETIVKLETDKYDLEERQKRQDYDLKELKERQKQQLRHKALKKGLDPEALTGKYPVKEE 173
+ E K + +E++ + + + ++ ++L+ K ++ E L K E
Sbjct: 1671 EEDKKKAEEAKKAEEDEKKAAEALKKEAEEAKKAEELKKKEAEEKKKAEELK-----KAE 1725
Query: 174 ERRQREVEEKKQRDLEEKRQ----RLEEAEKKRQAMMAALKEQ 212
E + + EE K+ E+K++ + +E EKK+ A + +E+
Sbjct: 1726 EENKIKAEEAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEK 1768
Score = 30.1 bits (67), Expect = 1.8
Identities = 45/213 (21%), Positives = 95/213 (44%), Gaps = 14/213 (6%)
Query: 3 QRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQRR 62
+ KK EEE+++ E +KK+ + ++K + L++AE++ + A ++ + K ++
Sbjct: 1624 ELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEENKIKAAEEAKKAEEDKKKAEEAKKA 1683
Query: 63 EPGLTLSTAQLERNKTKEQIEEEKKIALNIRIKPLNVEGWSIHKLQAKATELWETIVKLE 122
E + L++ EE K A ++ K E K + E +K E
Sbjct: 1684 EEDEKKAAEALKKE------AEEAKKAEELKKK----EAEEKKKAEELKKAEEENKIKAE 1733
Query: 123 TDKYDLEERQKRQDYDLKELKERQKQQLRHKALKKGLDPEALTG----KYPVKEEERRQR 178
K + EE +K+ + K+ +E++K K +K + + + EE+ ++R
Sbjct: 1734 EAKKEAEEDKKKAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDEKRR 1793
Query: 179 EVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKE 211
+KK +D+ + + E K+ ++ KE
Sbjct: 1794 MEVDKKIKDIFDNFANIIEGGKEGNLVINDSKE 1826
Score = 29.7 bits (66), Expect = 2.6
Identities = 46/217 (21%), Positives = 94/217 (43%), Gaps = 18/217 (8%)
Query: 5 KKEEEERRQREVEEK--KQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQRR 62
KK EE+++ E ++K + + +E +++ EEA+KK A +E ++
Sbjct: 1296 KKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAE--------- 1346
Query: 63 EPGLTLSTAQLERNKTKEQIEEEKKIALNIRIKPLNVEGWSIHKLQAKATELWETIVKLE 122
+ A+ E + + EEK A + + + + K + + E K E
Sbjct: 1347 -----AAKAEAEAAADEAEAAEEKAEAAEKKKEEAKKKADAAKKKAEEKKKADEAKKKAE 1401
Query: 123 TDKYDLEERQKRQDYDLK--ELKERQKQQLRHKALKKGLDPEALTGKYPVKEEERRQREV 180
DK +E +K K E K++ +++ + KK + + K EE ++ E
Sbjct: 1402 EDKKKADELKKAAAAKKKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKKAEE 1461
Query: 181 EEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSK 217
+KK + ++ + ++AE+ ++A A K + K K
Sbjct: 1462 AKKKAEEAKKADEAKKKAEEAKKADEAKKKAEEAKKK 1498
>gnl|CDD|217817 pfam03961, DUF342, Protein of unknown function (DUF342). This
family of bacterial proteins has no known function. The
proteins are in the region of 500-600 amino acid
residues in length.
Length = 450
Score = 33.0 bits (76), Expect = 0.20
Identities = 26/99 (26%), Positives = 54/99 (54%), Gaps = 10/99 (10%)
Query: 117 TIVKLETDKYDLEERQKRQDYDLKELKERQKQQLRHKALKKGLD--PEALTGKYPVKEEE 174
T +++ D +L+E K + +LKEL+E ++ +KK L P+ G+ P ++ E
Sbjct: 321 TEIEVGVDFPELKEELKELEEELKELEEELEK------IKKLLKKLPKKARGQLPPEKRE 374
Query: 175 RRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQT 213
+ ++ +E K++ L E+ + LEE K+ + + +L +
Sbjct: 375 QLEKLLETKEK--LSEELEELEEELKELKEELESLYSEG 411
>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
protein; Reviewed.
Length = 782
Score = 32.9 bits (76), Expect = 0.24
Identities = 23/96 (23%), Positives = 42/96 (43%), Gaps = 11/96 (11%)
Query: 2 AQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQR 61
A+ +E E+ + E+EEKK++ EE+ + LEEAEK+ Q + K++ ++ +Q+
Sbjct: 539 AEALLKEAEKLKEELEEKKEKLQEEEDKLLEEAEKEAQQAIKEAKKEADEIIKELRQLQK 598
Query: 62 REPGLTLSTAQLERNKTKEQIEEEKKIALNIRIKPL 97
K E + LN +
Sbjct: 599 GGYA-----------SVKAHELIEARKRLNKANEKK 623
Score = 29.0 bits (66), Expect = 3.3
Identities = 27/124 (21%), Positives = 48/124 (38%), Gaps = 24/124 (19%)
Query: 138 DLKELKERQKQQLRHKALKKGLDPEALTGKYPVKEEERRQREVEEKKQRDLEEKRQRLEE 197
L+EL+ +Q+ KE E+ + E+EEKK++ EE+ + LEE
Sbjct: 524 SLEELERELEQKAEEAEALL-------------KEAEKLKEELEEKKEKLQEEEDKLLEE 570
Query: 198 AEKKRQAMMAALKEQTNKSKGPNFTIQRREPGLTLSTAQLERNKTKEQIEEEKKIALNIR 257
AEK+ Q + K++ ++ +Q+ K E + LN
Sbjct: 571 AEKEAQQAIKEAKKEADEIIKELRQLQKGGYA-----------SVKAHELIEARKRLNKA 619
Query: 258 IKPL 261
+
Sbjct: 620 NEKK 623
>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein. This family
consists of several Borrelia P83/P100 antigen proteins.
Length = 489
Score = 32.7 bits (74), Expect = 0.26
Identities = 25/120 (20%), Positives = 54/120 (45%), Gaps = 1/120 (0%)
Query: 130 ERQKRQDYDLKELKERQKQ-QLRHKALKKGLDPEALTGKYPVKEEERRQREVEEKKQRDL 188
E ++ + +E+++ Q + + + K D +A K K E+ + E + Q+
Sbjct: 271 EDKQVAENQKREIEKAQIEIKKNDEEALKAKDHKAFDLKQESKASEKEAEDKELEAQKKR 330
Query: 189 EEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQRREPGLTLSTAQLERNKTKEQIEE 248
E + L++ + + +A +L E S P + ++ +P LS L KT+ ++ E
Sbjct: 331 EPVAEDLQKTKPQVEAQPTSLNEDAIDSSNPVYGLKVVDPITNLSELVLIDLKTEVRLRE 390
>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family. Atrophin-1 is the
protein product of the dentatorubral-pallidoluysian
atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
neurodegenerative disorder. It is caused by the
expansion of a CAG repeat in the DRPLA gene on
chromosome 12p. This results in an extended
polyglutamine region in atrophin-1, that is thought to
confer toxicity to the protein, possibly through
altering its interactions with other proteins. The
expansion of a CAG repeat is also the underlying defect
in six other neurodegenerative disorders, including
Huntington's disease. One interaction of expanded
polyglutamine repeats that is thought to be pathogenic
is that with the short glutamine repeat in the
transcriptional coactivator CREB binding protein, CBP.
This interaction draws CBP away from its usual nuclear
location to the expanded polyglutamine repeat protein
aggregates that are characteristic of the polyglutamine
neurodegenerative disorders. This interferes with
CBP-mediated transcription and causes cytotoxicity.
Length = 979
Score = 32.4 bits (73), Expect = 0.32
Identities = 15/46 (32%), Positives = 34/46 (73%), Gaps = 2/46 (4%)
Query: 1 MAQRKKEEEERRQREVEEK--KQRDLEEKRQRLEEAEKKRQAMMAA 44
+A++++E E+ +RE E+K ++R+ E+++++ E E++R+A AA
Sbjct: 577 LAKKREEAVEKAKREAEQKAREEREREKEKEKEREREREREAERAA 622
Score = 30.4 bits (68), Expect = 1.2
Identities = 15/39 (38%), Positives = 29/39 (74%), Gaps = 2/39 (5%)
Query: 2 AQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQA 40
A+R+ E++ R +RE E++K+++ E +R+R EAE+ +A
Sbjct: 588 AKREAEQKAREEREREKEKEKERERERER--EAERAAKA 624
Score = 27.7 bits (61), Expect = 9.0
Identities = 11/37 (29%), Positives = 27/37 (72%)
Query: 172 EEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAA 208
E+ +R+ E + +++R+ E+++++ E E++R+A AA
Sbjct: 586 EKAKREAEQKAREEREREKEKEKEREREREREAERAA 622
>gnl|CDD|221371 pfam12004, DUF3498, Domain of unknown function (DUF3498). This
presumed domain is functionally uncharacterized. This
domain is found in eukaryotes. This domain is typically
between 433 to 538 amino acids in length. This domain is
found associated with pfam00616, pfam00168. This domain
has two conserved sequence motifs: DLQ and PLSFQNP.
Length = 489
Score = 32.4 bits (73), Expect = 0.33
Identities = 14/37 (37%), Positives = 26/37 (70%)
Query: 1 MAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKK 37
++ R+ EE ERR E++ Q+ L+E + RLE++E++
Sbjct: 381 VSVRRLEEYERRLLGQEQQMQKLLQEYQARLEDSEER 417
Score = 27.8 bits (61), Expect = 9.4
Identities = 18/73 (24%), Positives = 38/73 (52%), Gaps = 4/73 (5%)
Query: 129 EERQKRQDYDLKELKERQKQQLRHKALKKGLDPEALTGKYPVKEEERRQREVEEKKQRDL 188
E+ + + + +E + +K + LK+ L ++E ERR E++ Q+ L
Sbjct: 349 EDIEYETEENREEGTQAEKYEQEIARLKERLRVSVRR----LEEYERRLLGQEQQMQKLL 404
Query: 189 EEKRQRLEEAEKK 201
+E + RLE++E++
Sbjct: 405 QEYQARLEDSEER 417
>gnl|CDD|219580 pfam07793, DUF1631, Protein of unknown function (DUF1631). The
members of this family are sequences derived from a
group of hypothetical proteins expressed by certain
bacterial species. The region concerned is approximately
440 amino acid residues in length.
Length = 729
Score = 31.1 bits (71), Expect = 0.84
Identities = 17/42 (40%), Positives = 27/42 (64%), Gaps = 3/42 (7%)
Query: 8 EEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQT 49
E+ERR+ E+ E++ R+ EE R RLE A RQ + L+++
Sbjct: 459 EQERRRSELVEQRTREAEEGRARLELA---RQQVQQELEQRL 497
Score = 31.1 bits (71), Expect = 0.84
Identities = 17/42 (40%), Positives = 27/42 (64%), Gaps = 3/42 (7%)
Query: 172 EEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQT 213
E+ERR+ E+ E++ R+ EE R RLE A RQ + L+++
Sbjct: 459 EQERRRSELVEQRTREAEEGRARLELA---RQQVQQELEQRL 497
>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
primarily archaeal type. SMC (structural maintenance of
chromosomes) proteins bind DNA and act in organizing and
segregating chromosomes for partition. SMC proteins are
found in bacteria, archaea, and eukaryotes. It is found
in a single copy and is homodimeric in prokaryotes, but
six paralogs (excluded from this family) are found in
eukarotes, where SMC proteins are heterodimeric. This
family represents the SMC protein of archaea and a few
bacteria (Aquifex, Synechocystis, etc); the SMC of other
bacteria is described by TIGR02168. The N- and
C-terminal domains of this protein are well conserved,
but the central hinge region is skewed in composition
and highly divergent [Cellular processes, Cell division,
DNA metabolism, Chromosome-associated proteins].
Length = 1164
Score = 30.8 bits (70), Expect = 0.95
Identities = 38/202 (18%), Positives = 82/202 (40%), Gaps = 10/202 (4%)
Query: 11 RRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQRREPGLTLST 70
R +E++ D EE+ +LE K A + L+ + + + +R + +
Sbjct: 307 ERSIAEKERELEDAEERLAKLEAEIDKLLAEIEELEREIEEER-----KRRDKLTEEYAE 361
Query: 71 AQLERNKTKEQIEEEKKIALNIRIKPLNVEGWSIHKLQAKATELWETIVKLETDKYDLEE 130
+ E + ++EE K R + + + KL+ + EL + +L+ + L E
Sbjct: 362 LKEELEDLRAELEEVDKEFAETRDELKDYRE-KLEKLKREINELKRELDRLQEELQRLSE 420
Query: 131 RQKRQDYDLKELKERQKQQLRHKALKKGLDPEALTGKYPVKEEERRQREVEEKKQRDLEE 190
+ + ++ + +L + K L+ + K E+ E++ DL+E
Sbjct: 421 ELADLNAAIAGIEAKIN-ELEEEKEDKALEIKKQEWKL---EQLAADLSKYEQELYDLKE 476
Query: 191 KRQRLEEAEKKRQAMMAALKEQ 212
+ R+E+ K Q +A + Q
Sbjct: 477 EYDRVEKELSKLQRELAEAEAQ 498
Score = 28.1 bits (63), Expect = 6.5
Identities = 39/220 (17%), Positives = 89/220 (40%), Gaps = 13/220 (5%)
Query: 7 EEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQRREPGL 66
E + E EK ++ E +RLEE E+ + + +K+ + ++ + L
Sbjct: 243 ERQLASLEEELEKLTEEISELEKRLEEIEQLLEELNKKIKD---LGEEEQLRVKEKIGEL 299
Query: 67 TLSTAQLERN------KTKEQIEEEKKIALNIRIKPLNVEGW--SIHKLQAKATELWETI 118
A LER+ + ++ E K+ I +E I + + + +L E
Sbjct: 300 EAEIASLERSIAEKERELEDAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEY 359
Query: 119 VKLETDKYDLEERQKRQDYDLKELKERQKQ-QLRHKALKKGLDPEALTGKYPVKEEERRQ 177
+L+ + DL + D + E ++ K + + + LK+ ++ +E +R
Sbjct: 360 AELKEELEDLRAELEEVDKEFAETRDELKDYREKLEKLKREINELKRELDRLQEELQRLS 419
Query: 178 REVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSK 217
E+ + + ++ E E++++ +K+Q K +
Sbjct: 420 EELADLNA-AIAGIEAKINELEEEKEDKALEIKKQEWKLE 458
Score = 27.7 bits (62), Expect = 8.7
Identities = 36/203 (17%), Positives = 83/203 (40%), Gaps = 18/203 (8%)
Query: 3 QRKKEEEERRQREVE-EKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQR 61
+ + EE + E E +K ++EE + +EE K+R + E
Sbjct: 316 ELEDAEERLAKLEAEIDKLLAEIEELEREIEEERKRRDKLTEEYAE-------------- 361
Query: 62 REPGLTLSTAQLERNKTKEQIEEEKKIALNIRIKPLNVEGWSIHKLQAKATELWETIVKL 121
+ L A+LE + ++ +++ L E I++L+ + L E + +L
Sbjct: 362 LKEELEDLRAELEEVDKEFAETRDELKDYREKLEKLKRE---INELKRELDRLQEELQRL 418
Query: 122 ETDKYDLEERQKRQDYDLKELKERQKQQLRHKALKKGLDPEALTGKYPVKEEERRQREVE 181
+ DL + + EL+E ++ + ++ + ++E +E
Sbjct: 419 SEELADLNAAIAGIEAKINELEEEKEDKALEIKKQEWKLEQLAADLSKYEQELYDLKEEY 478
Query: 182 EKKQRDLEEKRQRLEEAEKKRQA 204
++ +++L + ++ L EAE + +A
Sbjct: 479 DRVEKELSKLQRELAEAEAQARA 501
Score = 27.7 bits (62), Expect = 9.7
Identities = 44/210 (20%), Positives = 75/210 (35%), Gaps = 26/210 (12%)
Query: 4 RKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQRRE 63
K EEE R + ++ L E EK+ Q + + + I+ E
Sbjct: 801 SKLEEEVSRIEARLREIEQKLNRLTLEKEYLEKEIQELQEQRIDLKEQ-------IKSIE 853
Query: 64 PGLTLSTAQLERNKTKEQIEEEKKIALNIRIKPLNVEGWSIHKLQAKATELWETIVKLET 123
+ + E E+ EE + AL L +L+A+ EL I +LE
Sbjct: 854 KEIENLNGKKEE---LEEELEELEAALRDLESRLGDLKKERDELEAQLRELERKIEELEA 910
Query: 124 DKYDLEERQKRQDYDLKELKERQKQQLRHKALKKGLDPEALTGKYPVKEEERRQREVE-- 181
+R L+ L+E + K + + E L+ + E +R + E+
Sbjct: 911 QIEKKRKRLSELKAKLEALEEELSEIEDPKGEDEEIPEEELSLEDVQAELQRVEEEIRAL 970
Query: 182 --------------EKKQRDLEEKRQRLEE 197
K+ +L+EKR +LEE
Sbjct: 971 EPVNMLAIQEYEEVLKRLDELKEKRAKLEE 1000
>gnl|CDD|235362 PRK05192, PRK05192, tRNA uridine 5-carboxymethylaminomethyl
modification enzyme GidA; Validated.
Length = 618
Score = 30.8 bits (71), Expect = 1.0
Identities = 31/124 (25%), Positives = 56/124 (45%), Gaps = 20/124 (16%)
Query: 62 REPGLTLSTAQLER-NKTKEQIEEEKKIALNIRIKP--LNVEGWSIHKLQAKATEL---- 114
E GL + + R + KE IEEE + + R+ P LN G K + +L
Sbjct: 449 YELGL-VDDERWARFEEKKEAIEEEIERLKSTRVTPDELNELGGDPLKREVSLLDLLRRP 507
Query: 115 ---WETIVKLETDKYDLEERQKRQ--------DYDLKELKERQKQQ-LRHKALKKGLDPE 162
+E + KL+ + DL+ Q Y ++ +E +K + L +K + + +D +
Sbjct: 508 EITYEDLAKLDPELADLDPEVAEQVEIEIKYEGYIERQQEEIEKLKRLENKKIPEDIDYD 567
Query: 163 ALTG 166
A++G
Sbjct: 568 AISG 571
Score = 28.1 bits (64), Expect = 6.8
Identities = 23/84 (27%), Positives = 36/84 (42%), Gaps = 11/84 (13%)
Query: 226 REPGLTLSTAQLER-NKTKEQIEEEKKIALNIRIKP--LNVEGWSIHKLQAKATEL---- 278
E GL + + R + KE IEEE + + R+ P LN G K + +L
Sbjct: 449 YELGL-VDDERWARFEEKKEAIEEEIERLKSTRVTPDELNELGGDPLKREVSLLDLLRRP 507
Query: 279 ---WETIVKLETDKYDLEERQKRQ 299
+E + KL+ + DL+ Q
Sbjct: 508 EITYEDLAKLDPELADLDPEVAEQ 531
>gnl|CDD|219342 pfam07227, DUF1423, Protein of unknown function (DUF1423). This
family represents a conserved region approximately 500
residues long within a number of Arabidopsis thaliana
proteins of unknown function.
Length = 446
Score = 30.5 bits (69), Expect = 1.0
Identities = 18/51 (35%), Positives = 25/51 (49%), Gaps = 5/51 (9%)
Query: 171 KEEERRQREVEEKKQRDLEE-----KRQRLEEAEKKRQAMMAALKEQTNKS 216
+E ER QR K ++ EE + RL EAE++RQ + LK Q
Sbjct: 378 REAERLQRIALAKTEKSEEEYASKYLKLRLSEAEEERQYLFEELKLQEESH 428
Score = 30.5 bits (69), Expect = 1.2
Identities = 19/56 (33%), Positives = 26/56 (46%), Gaps = 5/56 (8%)
Query: 2 AQRKKEEEERRQREVEEKKQRDLEE-----KRQRLEEAEKKRQAMMAALKEQTNKS 52
A + E ER QR K ++ EE + RL EAE++RQ + LK Q
Sbjct: 373 ADEARREAERLQRIALAKTEKSEEEYASKYLKLRLSEAEEERQYLFEELKLQEESH 428
>gnl|CDD|222432 pfam13882, Bravo_FIGEY, C-terminal domain of Fibronectin type III.
This is the very C-terminal region of neural adhesion
molecule L1 proteins that are also known as Bravo or
NrCAM. It lies upstream of the IG and Fn3 domains and
has the highly conserved motif FIGEY. The function is
not known.
Length = 118
Score = 29.2 bits (66), Expect = 1.1
Identities = 12/48 (25%), Positives = 18/48 (37%), Gaps = 2/48 (4%)
Query: 166 GKYPVKEEERRQ--REVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKE 211
GKY VKE+E E + K E + + R + + E
Sbjct: 34 GKYSVKEKEDAPGDPEDQPMKDEGFGEYSRPEDNKPLPRGSQQSLDGE 81
>gnl|CDD|130141 TIGR01069, mutS2, MutS2 family protein. Function of MutS2 is
unknown. It should not be considered a DNA mismatch
repair protein. It is likely a DNA mismatch binding
protein of unknown cellular function [DNA metabolism,
Other].
Length = 771
Score = 30.6 bits (69), Expect = 1.2
Identities = 34/179 (18%), Positives = 68/179 (37%), Gaps = 20/179 (11%)
Query: 106 KLQAKATELWETIVKLETDKYDLEERQKRQDYDLKEL--------KERQKQQLRHKALKK 157
E+ I KL + +LE++ + + LKE +E ++ + R + K
Sbjct: 505 FYGEFKEEINVLIEKLSALEKELEQKNEHLEKLLKEQEKLKKELEQEMEELKERERNKKL 564
Query: 158 GLDPEALTGKYPVKEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSK 217
L+ EA +E + ++ E R+L+EK+ K+ +++ +K + K K
Sbjct: 565 ELEKEA-------QEALKALKKEVESIIRELKEKK---IHKAKEIKSIEDLVKLKETKQK 614
Query: 218 GPN--FTIQRREPGLTLSTAQLERNKTKEQIEEEKKIALNIRIKPLNVEGWSIHKLQAK 274
P Q + G + + QI K + + + V G + K+
Sbjct: 615 IPQKPTNFQADKIGDKVRIRYFGQKGKIVQILGGNKWNVTVGGMRMKVHGSELEKINKA 673
>gnl|CDD|233503 TIGR01642, U2AF_lg, U2 snRNP auxilliary factor, large subunit,
splicing factor. These splicing factors consist of an
N-terminal arginine-rich low complexity domain followed
by three tandem RNA recognition motifs (pfam00076). The
well-characterized members of this family are auxilliary
components of the U2 small nuclear ribonuclearprotein
splicing factor (U2AF). These proteins are closely
related to the CC1-like subfamily of splicing factors
(TIGR01622). Members of this subfamily are found in
plants, metazoa and fungi.
Length = 509
Score = 30.2 bits (68), Expect = 1.4
Identities = 14/71 (19%), Positives = 27/71 (38%)
Query: 124 DKYDLEERQKRQDYDLKELKERQKQQLRHKALKKGLDPEALTGKYPVKEEERRQREVEEK 183
D+ ER+K + D ER +++ R ++ + + Y R +R + +
Sbjct: 2 DEEPDREREKSRGRDRDRSSERPRRRSRDRSRFRDRHRRSRERSYREDSRPRDRRRYDSR 61
Query: 184 KQRDLEEKRQR 194
R L R
Sbjct: 62 SPRSLRYSSVR 72
>gnl|CDD|240402 PTZ00399, PTZ00399, cysteinyl-tRNA-synthetase; Provisional.
Length = 651
Score = 30.0 bits (68), Expect = 1.5
Identities = 11/49 (22%), Positives = 26/49 (53%), Gaps = 5/49 (10%)
Query: 157 KGLDPEALTGKYPVKEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAM 205
K D E L +E+E ++ E+K+ R L+++ ++ ++ +K +
Sbjct: 546 KLDDKEEL-----QREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEKA 589
Score = 28.8 bits (65), Expect = 3.9
Identities = 12/37 (32%), Positives = 23/37 (62%)
Query: 3 QRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQ 39
QR+KEE+E + + +K + EEK+++ E +K +
Sbjct: 554 QREKEEKEALKEQKRLRKLKKQEEKKKKELEKLEKAK 590
>gnl|CDD|223522 COG0445, GidA, Flavin-dependent tRNA uridine
5-carboxymethylaminomethyl modification enzyme GidA
[Cell cycle control, cell division, chromosome
partitioning].
Length = 621
Score = 30.2 bits (69), Expect = 1.7
Identities = 35/143 (24%), Positives = 52/143 (36%), Gaps = 37/143 (25%)
Query: 62 REPGLTLSTAQLER-NKTKEQIEEEKKIALNIRIKPLNVEGWSIHKLQAKATELWETIVK 120
RE GL + + ER K KE IEEE + + + P V + L +K
Sbjct: 450 RELGL-VDDERYERFLKKKENIEEEIERLKSTWVTPSEV--------AKELLALGGQPLK 500
Query: 121 LETDKYDLEERQKRQDYDLKELKERQKQQLRHKALKKGLDPEALTGKYPVKEEERRQREV 180
+ YDL +R + +L L P + E Q E+
Sbjct: 501 RRSSLYDL---LRRPEISYDDLIS--------------LFPL----PADLDAEVLEQVEI 539
Query: 181 EEK------KQRDLEEKRQRLEE 197
E K +Q++ EK +RLE
Sbjct: 540 EIKYEGYIKRQQEQIEKLKRLEN 562
>gnl|CDD|235870 PRK06834, PRK06834, hypothetical protein; Provisional.
Length = 488
Score = 30.0 bits (68), Expect = 1.8
Identities = 36/143 (25%), Positives = 59/143 (41%), Gaps = 23/143 (16%)
Query: 157 KGLDPEALTGKY-----PVKEEERRQ-------REVEEKKQ--RDLEEKRQRLEEAEKKR 202
KG PE+L Y PV R +++ + RD+ + ++E K+
Sbjct: 304 KGTSPESLLDTYHAERHPVAARVLRNTMAQVALLRPDDRTEALRDIVAELLGMDEPRKRI 363
Query: 203 QAMMAALKEQTNKSKG-PNFTIQRREPGLTLSTAQLERNKTKEQIEEEKKIALNIRIKP- 260
AMM+ L + +G P + RR P L L TA R + + + + LN+ P
Sbjct: 364 AAMMSGLDIHYDLGEGHP--LLGRRMPDLDLVTADGPR-RVFTLLHNARPVLLNLG-APG 419
Query: 261 -LNVEGWS--IHKLQAKATELWE 280
++ WS + + AK WE
Sbjct: 420 AFDIAPWSDRVRLVDAKYAGPWE 442
Score = 28.8 bits (65), Expect = 3.4
Identities = 25/92 (27%), Positives = 41/92 (44%), Gaps = 9/92 (9%)
Query: 30 RLEEAEKKRQAMMAALKEQTNKSKG-PNFTIQRREPGLTLSTAQLERNKTKEQIEEEKKI 88
++E K+ AMM+ L + +G P + RR P L L TA R + + + +
Sbjct: 355 GMDEPRKRIAAMMSGLDIHYDLGEGHP--LLGRRMPDLDLVTADGPR-RVFTLLHNARPV 411
Query: 89 ALNIRIKP--LNVEGWS--IHKLQAKATELWE 116
LN+ P ++ WS + + AK WE
Sbjct: 412 LLNLG-APGAFDIAPWSDRVRLVDAKYAGPWE 442
>gnl|CDD|215064 PLN00122, PLN00122, serine/threonine protein phosphatase 2A;
Provisional.
Length = 170
Score = 29.3 bits (66), Expect = 1.8
Identities = 16/32 (50%), Positives = 21/32 (65%), Gaps = 4/32 (12%)
Query: 3 QRKKEEEERRQREVEEKKQRDLEEKRQRLEEA 34
RK EE+E + +EVEEK+ E +RLEEA
Sbjct: 135 LRKFEEDEAKAKEVEEKR----EATWKRLEEA 162
>gnl|CDD|218001 pfam04275, P-mevalo_kinase, Phosphomevalonate kinase.
Phosphomevalonate kinase (EC:2.7.4.2) catalyzes the
phosphorylation of 5-phosphomevalonate into
5-diphosphomevalonate, an essential step in isoprenoid
biosynthesis via the mevalonate pathway. This family
represents the animal type of the enzyme. The other is
the ERG8 type, found in plants and fungi, and some
bacteria (see pfam00288).
Length = 115
Score = 28.4 bits (64), Expect = 1.9
Identities = 17/39 (43%), Positives = 22/39 (56%), Gaps = 2/39 (5%)
Query: 151 RHKALKKGLDPEALTGKYPVKEEERRQ--REVEEKKQRD 187
A K GLD E L G P KE+ R+ + EEK+Q+D
Sbjct: 36 EEWARKHGLDLEELLGDGPYKEKYRKDMIKWGEEKRQKD 74
>gnl|CDD|234714 PRK00286, xseA, exodeoxyribonuclease VII large subunit; Reviewed.
Length = 438
Score = 29.8 bits (68), Expect = 2.0
Identities = 22/98 (22%), Positives = 48/98 (48%), Gaps = 5/98 (5%)
Query: 128 LEERQKRQDYDLKELKERQKQQLRHKALKKGLDPEALTGKYPVKEEERRQREVEEKKQRD 187
L ++Q+R D + L+ +++LR + + L + P + ER Q+ +E+ +QR
Sbjct: 307 LAQQQQRLDRLQQRLQRALERRLRLAKQRLERLSQRLQQQNPQRRIERAQQRLEQLEQRL 366
Query: 188 LEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQR 225
R++L+ ++ +A+ L ++ P T+ R
Sbjct: 367 RRAMRRQLKRKRQRLEALAQQL-----EALSPLATLAR 399
>gnl|CDD|227355 COG5022, COG5022, Myosin heavy chain [Cytoskeleton].
Length = 1463
Score = 30.0 bits (68), Expect = 2.1
Identities = 52/294 (17%), Positives = 100/294 (34%), Gaps = 19/294 (6%)
Query: 3 QRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQRR 62
Q+ + ++R KK+ + QR+E AE++ Q + +K + K N ++
Sbjct: 849 QKFGRSLKAKKRFSLLKKETIYLQSAQRVELAERQLQELKIDVKS-ISSLKLVNLELESE 907
Query: 63 EPGLTLSTAQLERNKTK---EQIEEEKKIALNIRIKPLNVEGWSIHK-LQAKATELWETI 118
L S + + E I KK+ NI ++ EG SI + +L E
Sbjct: 908 IIELKKSLSSDLIENLEFKTELIARLKKLLNNIDLE----EGPSIEYVKLPELNKLHEVE 963
Query: 119 VKLETDKYDLEERQKRQDYDLKELKERQKQQLRHKALKKGLDPEALTGKYPVKEEERRQR 178
KL+ + E+ K+ ++E + + K L + + K+ +
Sbjct: 964 SKLKETSEEYEDLLKKSTILVREGNKANSELKNFKKELAELSKQYGALQESTKQLKELPV 1023
Query: 179 EVEEKKQ------RDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQRREPGL-T 231
EV E + + E + K + L+ +++ ++R L
Sbjct: 1024 EVAELQSASKIISSESTELSILKPLQKLKGLLL---LENNQLQARYKALKLRRENSLLDD 1080
Query: 232 LSTAQLERNKTKEQIEEEKKIALNIRIKPLNVEGWSIHKLQAKATELWETIVKL 285
QLE + + K + + R Q L + I K
Sbjct: 1081 KQLYQLESTENLLKTINVKDLEVTNRNLVKPANVLQFIVAQMIKLNLLQEISKF 1134
>gnl|CDD|178945 PRK00247, PRK00247, putative inner membrane protein translocase
component YidC; Validated.
Length = 429
Score = 29.4 bits (66), Expect = 2.2
Identities = 10/52 (19%), Positives = 20/52 (38%)
Query: 3 QRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKG 54
K E + R++ ++++ + E R+ +E A K KG
Sbjct: 338 AEKNEAKARKKEIAQKRRAAEREINREARQERAAAMARARARRAAVKAKKKG 389
Score = 28.7 bits (64), Expect = 4.0
Identities = 14/65 (21%), Positives = 27/65 (41%)
Query: 3 QRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQRR 62
RKKE ++R+ E + +E+ + A +R A+ A K + S + +
Sbjct: 345 ARKKEIAQKRRAAEREINREARQERAAAMARARARRAAVKAKKKGLIDASPNEDTPSENE 404
Query: 63 EPGLT 67
E +
Sbjct: 405 ESKGS 409
>gnl|CDD|239571 cd03489, Topoisomer_IB_N_LdtopoI_like,
Topoisomer_IB_N_LdtopoI_like: N-terminal DNA binding
fragment found in eukaryotic DNA topoisomerase (topo) IB
proteins similar to the heterodimeric topo I from
Leishmania donvanni. Topo I enzymes are divided into:
topo type IA (bacterial) and type IB (eukaryotic). Topo
I relaxes superhelical tension in duplex DNA by creating
a single-strand nick, the broken strand can then rotate
around the unbroken strand to remove DNA supercoils and,
the nick is religated, liberating topo I. These enzymes
regulate the topological changes that accompany DNA
replication, transcription and other nuclear processes.
Human topo I is the target of a diverse set of
anticancer drugs including camptothecins (CPTs). CPTs
bind to the topo I-DNA complex and inhibit re-ligation
of the single-strand nick, resulting in the accumulation
of topo I-DNA adducts. In addition to differences in
structure and some biochemical properties,
Trypanosomatid parasite topo I differ from human topo I
in their sensitivity to CPTs and other classical topo I
inhibitors. Trypanosomatid topo I play putative roles in
organizing the kinetoplast DNA network unique to these
parasites. This family may represent more than one
structural domain.
Length = 212
Score = 29.1 bits (65), Expect = 2.2
Identities = 16/43 (37%), Positives = 22/43 (51%), Gaps = 4/43 (9%)
Query: 10 ERRQREVEEKKQRDLEE----KRQRLEEAEKKRQAMMAALKEQ 48
E RE E+KK R EE K ++ +EAE + +KEQ
Sbjct: 89 EWHLREKEKKKSRTKEEKKALKEEKDKEAEPYMWCVWDGVKEQ 131
Score = 29.1 bits (65), Expect = 2.2
Identities = 16/43 (37%), Positives = 22/43 (51%), Gaps = 4/43 (9%)
Query: 174 ERRQREVEEKKQRDLEE----KRQRLEEAEKKRQAMMAALKEQ 212
E RE E+KK R EE K ++ +EAE + +KEQ
Sbjct: 89 EWHLREKEKKKSRTKEEKKALKEEKDKEAEPYMWCVWDGVKEQ 131
>gnl|CDD|234780 PRK00488, pheS, phenylalanyl-tRNA synthetase subunit alpha;
Validated.
Length = 339
Score = 29.3 bits (67), Expect = 2.5
Identities = 23/72 (31%), Positives = 36/72 (50%), Gaps = 11/72 (15%)
Query: 138 DLKELKERQKQQLRHKAL-KKGLDPEALTGKYPVKEEERRQR-----EVEEKKQRDLEEK 191
DL+ L + LR K L KKG E L G + EER++ E+++ + LEE+
Sbjct: 21 DLEAL-----EALRVKYLGKKGELTELLKGLGKLPPEERKEAGALINELKQAIEAALEER 75
Query: 192 RQRLEEAEKKRQ 203
++ LE A +
Sbjct: 76 KEELEAAALNAR 87
>gnl|CDD|180664 PRK06705, PRK06705, argininosuccinate lyase; Provisional.
Length = 502
Score = 29.6 bits (66), Expect = 2.5
Identities = 25/86 (29%), Positives = 46/86 (53%), Gaps = 13/86 (15%)
Query: 128 LEERQKRQDYDLKE----LKERQKQQLRHKALKKGLDPEALTGKYPV------KEEER-- 175
LE++++ + K+ L+E+ K QL K ++ + PEA K V KE ER
Sbjct: 405 LEQKKELHELCFKDVNIYLQEKFKIQLLEKEWEEIISPEAFIQKRNVYGGPSKKEMERMI 464
Query: 176 -RQREVEEKKQRDLEEKRQRLEEAEK 200
++E+ K++ E+++QR+ +AE
Sbjct: 465 NNRKELFRKEEEVFEKEKQRILQAEN 490
>gnl|CDD|236861 PRK11147, PRK11147, ABC transporter ATPase component; Reviewed.
Length = 635
Score = 29.5 bits (67), Expect = 2.7
Identities = 27/105 (25%), Positives = 45/105 (42%), Gaps = 12/105 (11%)
Query: 146 QKQQLRHKALKKGLDPEALTGKYPVKEE-ERRQREVEEKKQRDLEEKRQRLEEAEKKRQA 204
++QQ ++ ALK+ + P E +R +++ K QR+LE+ Q LE+ E +
Sbjct: 527 RQQQAQYLALKQPAVKKKEEAAAPKAETVKRSSKKLSYKLQRELEQLPQLLEDLEAE--- 583
Query: 205 MMAALKEQTNKSKGPNFTIQRREPGLTLSTAQLERNKTKEQIEEE 249
+ AL+ Q +F Q E + L EQ E
Sbjct: 584 -IEALQAQVA---DADFFSQPHE----QTQKVLADLADAEQELEV 620
>gnl|CDD|218209 pfam04678, DUF607, Protein of unknown function, DUF607. This
family represents a conserved region found in several
uncharacterized eukaryotic proteins.
Length = 180
Score = 28.5 bits (64), Expect = 2.8
Identities = 8/61 (13%), Positives = 22/61 (36%)
Query: 155 LKKGLDPEALTGKYPVKEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTN 214
L+ +A+ P+ Q + ++ LE + L E+ ++ + +
Sbjct: 33 LRPSFVVDAVHRLLPLPNVSEHQLDRTHYLRKRLEALSEELAPLEEIKKEIDRLAHRRAR 92
Query: 215 K 215
+
Sbjct: 93 R 93
>gnl|CDD|235461 PRK05431, PRK05431, seryl-tRNA synthetase; Provisional.
Length = 425
Score = 28.9 bits (66), Expect = 3.2
Identities = 12/44 (27%), Positives = 24/44 (54%), Gaps = 1/44 (2%)
Query: 2 AQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAAL 45
A+RK E+ E EV+E K+ ++ L+E E + + ++ +
Sbjct: 61 AKRKGEDAEALIAEVKELKEE-IKALEAELDELEAELEELLLRI 103
>gnl|CDD|214661 smart00435, TOPEUc, DNA Topoisomerase I (eukaryota). DNA
Topoisomerase I (eukaryota), DNA topoisomerase V,
Vaccina virus topoisomerase, Variola virus
topoisomerase, Shope fibroma virus topoisomeras.
Length = 391
Score = 28.9 bits (65), Expect = 3.4
Identities = 18/84 (21%), Positives = 42/84 (50%), Gaps = 8/84 (9%)
Query: 120 KLETDKYDLEERQKRQDYDLKELKERQKQQLRHKALKKGLDPEALTGKYPVKEEERRQRE 179
K++ KY L+ +K ++K + + + + LD E +E++++++
Sbjct: 285 KIKALKYQLKRLKKMILLFEMISDLKRKLKSKFERDNEKLDAE--------VKEKKKEKK 336
Query: 180 VEEKKQRDLEEKRQRLEEAEKKRQ 203
EEKK++ +E +R+E+ E +
Sbjct: 337 KEEKKKKQIERLEERIEKLEVQAT 360
>gnl|CDD|218038 pfam04350, PilO, Pilus assembly protein, PilO. PilO proteins are
involved in the assembly of pilin. However, the precise
function of this family of proteins is not known.
Length = 144
Score = 27.9 bits (63), Expect = 4.2
Identities = 13/49 (26%), Positives = 29/49 (59%), Gaps = 2/49 (4%)
Query: 3 QRKKEEEERRQREVEEKKQR--DLEEKRQRLEEAEKKRQAMMAALKEQT 49
+R + +E ++E EEK ++ +L +++LEE E++ A++ L +
Sbjct: 2 ERAQAKEATLKQEFEEKARQAANLPAYKKQLEELEERFGALLKQLPSKK 50
Score = 27.1 bits (61), Expect = 8.0
Identities = 12/46 (26%), Positives = 26/46 (56%), Gaps = 2/46 (4%)
Query: 170 VKEEERRQREVEEKKQR--DLEEKRQRLEEAEKKRQAMMAALKEQT 213
+E ++E EEK ++ +L +++LEE E++ A++ L +
Sbjct: 5 QAKEATLKQEFEEKARQAANLPAYKKQLEELEERFGALLKQLPSKK 50
>gnl|CDD|222290 pfam13654, AAA_32, AAA domain. This family includes a wide variety
of AAA domains including some that have lost essential
nucleotide binding residues in the P-loop.
Length = 509
Score = 28.6 bits (65), Expect = 4.7
Identities = 28/119 (23%), Positives = 55/119 (46%), Gaps = 15/119 (12%)
Query: 106 KLQAKATELWETIVK-----LETDKYDLEERQKRQDYDLKELKERQKQQLRHKALKKGL- 159
+L+ EL E + K E+++Y E R++ + + +E +E ++L +A +KG
Sbjct: 88 ELKKDMEELVEELKKEIPAAFESEEY--EARKEEIEEEFQEKREEAFEELEEEAKEKGFA 145
Query: 160 ---DPEALTGKYPVKEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNK 215
P P+K+ E EE+ + EE+R+ LEE + + + + Q +
Sbjct: 146 LVRTPGGFVF-APLKDGEPLT---EEEFEALPEEEREELEEKIDELEEELQEILRQLRE 200
>gnl|CDD|116627 pfam08017, Fibrinogen_BP, Fibrinogen binding protein. Proteins in
this family bind to fibrinogen. Members of this family
includes the fibrinogen receptor, FbsA, which mediates
platelet aggregation.
Length = 393
Score = 28.7 bits (63), Expect = 4.8
Identities = 49/243 (20%), Positives = 103/243 (42%), Gaps = 20/243 (8%)
Query: 10 ERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQRREPGLTLS 69
ERRQR+ E + Q ++ E+RQR E + + ++ N+S+G ++R+
Sbjct: 32 ERRQRDAENRSQGNVLERRQRDAENRSQGNVLERRQRDAENRSQGNVLERRQRDAENRSQ 91
Query: 70 TAQLERNKTKEQIEEEKKIALNIRIKPLNVEGWSIHKLQAKATELWETIVKLETDKYDLE 129
LER + + + + + + N ++ + + + E LE + D E
Sbjct: 92 GNVLERRQRDVENKSQGNVLERRQRDVENKSQGNVLERRQRDAENRSQGNVLERRQRDAE 151
Query: 130 ERQKRQDYDLKELKERQKQQLRHKALKKGLDPEALTGKYPVKEEERRQREVEEKKQRDLE 189
+ + + ER+++ +++ L ERRQR+VE K Q ++
Sbjct: 152 NKSQ------GNVLERRQRDAENRSQGNVL--------------ERRQRDVENKSQGNVL 191
Query: 190 EKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQRREPGLTLSTAQLERNKTKEQIEEE 249
E+RQR E + + + ++ N+S+G ++R+ LER + + + +
Sbjct: 192 ERRQRDVENKSQGNVLERRQRDAENRSQGNVLERRQRDVENKSQGNVLERRQRDVENKSQ 251
Query: 250 KKI 252
+
Sbjct: 252 GNV 254
Score = 28.3 bits (62), Expect = 5.9
Identities = 49/232 (21%), Positives = 97/232 (41%), Gaps = 4/232 (1%)
Query: 10 ERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQRREPGLTLS 69
ERRQR+ E + Q ++ E+RQR E + + ++ N+S+G ++R+
Sbjct: 48 ERRQRDAENRSQGNVLERRQRDAENRSQGNVLERRQRDAENRSQGNVLERRQRDVENKSQ 107
Query: 70 TAQLERNKTKEQIEEEKKIALNIRIKPLNVEGWSIHKLQAKATELWETIVKLETDKYDLE 129
LER + + + + + + N ++ + + + E LE + D E
Sbjct: 108 GNVLERRQRDVENKSQGNVLERRQRDAENRSQGNVLERRQRDAENKSQGNVLERRQRDAE 167
Query: 130 ERQKRQDYDLKELKERQKQQLRHKALKKGLDPEALTGKYPVKEEERRQREVEEKKQRDLE 189
R + + ++ K Q ++ + K ERRQR+ E + Q ++
Sbjct: 168 NRSQGNVLERRQRDVENKSQGNVLERRQ----RDVENKSQGNVLERRQRDAENRSQGNVL 223
Query: 190 EKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQRREPGLTLSTAQLERNK 241
E+RQR E + + + ++ NKS+G ++R+ LER +
Sbjct: 224 ERRQRDVENKSQGNVLERRQRDVENKSQGNVLERRQRDAENRSQGNVLERRQ 275
Score = 28.3 bits (62), Expect = 6.1
Identities = 52/237 (21%), Positives = 105/237 (44%), Gaps = 10/237 (4%)
Query: 10 ERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQRREPGLTLS 69
ERRQR+VE K Q ++ E+RQR E + + + ++ N+S+G ++R+
Sbjct: 96 ERRQRDVENKSQGNVLERRQRDVENKSQGNVLERRQRDAENRSQGNVLERRQRDAENKSQ 155
Query: 70 TAQLERNKTKEQIEEEKKIALNIRIKPLNVEGWSIHKLQAKATELWETIVKLETDKYDLE 129
LER + + + + + N ++ + + + E LE + D E
Sbjct: 156 GNVLERRQRDAENRSQGNVLERRQRDVENKSQGNVLERRQRDVENKSQGNVLERRQRDAE 215
Query: 130 ERQKRQDYDLKELKERQKQQLRHKALKKGLDPEALTGKYPVKEE--ERRQREVEEKKQRD 187
R + + ER+++ + +K+ L+ + + ERRQR+ E + Q +
Sbjct: 216 NRSQ------GNVLERRQRDVENKSQGNVLERRQRDVENKSQGNVLERRQRDAENRSQGN 269
Query: 188 LEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQRREPGLTLSTAQLERNKTKE 244
+ E+RQR + E K Q + +++ ++K + + P L+ S E N + +
Sbjct: 270 VLERRQR--DVENKSQGNVLERRQRDAENKSQVGQLIGKNPLLSKSIISRENNHSSQ 324
Score = 27.9 bits (61), Expect = 7.2
Identities = 46/209 (22%), Positives = 91/209 (43%), Gaps = 4/209 (1%)
Query: 10 ERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQRREPGLTLS 69
ERRQR+ E + Q ++ E+RQR E + + + ++ NKS+G ++R+
Sbjct: 80 ERRQRDAENRSQGNVLERRQRDVENKSQGNVLERRQRDVENKSQGNVLERRQRDAENRSQ 139
Query: 70 TAQLERNKTKEQIEEEKKIALNIRIKPLNVEGWSIHKLQAKATELWETIVKLETDKYDLE 129
LER + + + + + + N ++ + + + E LE + D+E
Sbjct: 140 GNVLERRQRDAENKSQGNVLERRQRDAENRSQGNVLERRQRDVENKSQGNVLERRQRDVE 199
Query: 130 ERQKRQDYDLKELKERQKQQLRHKALKKGLDPEALTGKYPVKEEERRQREVEEKKQRDLE 189
+ + + ++ + Q ++ + K ERRQR+VE K Q ++
Sbjct: 200 NKSQGNVLERRQRDAENRSQGNVLERRQ----RDVENKSQGNVLERRQRDVENKSQGNVL 255
Query: 190 EKRQRLEEAEKKRQAMMAALKEQTNKSKG 218
E+RQR E + + ++ NKS+G
Sbjct: 256 ERRQRDAENRSQGNVLERRQRDVENKSQG 284
>gnl|CDD|191187 pfam05087, Rota_VP2, Rotavirus VP2 protein. Rotavirus particles
consist of three concentric proteinaceous capsid
layers. The innermost capsid (core) is made of VP2. The
genomic RNA and the two minor proteins VP1 and VP3 are
encapsidated within this layer. The N-terminus of
rotavirus VP2 is necessary for the encapsidation of VP1
and VP3.
Length = 887
Score = 28.4 bits (63), Expect = 5.7
Identities = 15/81 (18%), Positives = 34/81 (41%), Gaps = 6/81 (7%)
Query: 1 MAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQ 60
MA R + E + ++K + ++++ R+E EK K++ + + ++
Sbjct: 1 MAYRNRREANINNNDRMQEKDDEKQDQKNRMELKEKVLD------KKEEVVTDNVDSPVK 54
Query: 61 RREPGLTLSTAQLERNKTKEQ 81
+ L A + TKE+
Sbjct: 55 EQSSQENLKIADEVKKSTKEE 75
>gnl|CDD|236892 PRK11281, PRK11281, hypothetical protein; Provisional.
Length = 1113
Score = 28.3 bits (64), Expect = 5.8
Identities = 28/106 (26%), Positives = 49/106 (46%), Gaps = 16/106 (15%)
Query: 147 KQQLRHKALKKGLDPEALTGKYPVKEEERRQREVE--EKKQRDLEEKRQRLEEAEKK-RQ 203
+ QL AL K EA K ++ E+ ++ ++++ + E+ +Q+L +A K RQ
Sbjct: 42 QAQL--DALNKQKLLEA-EDKLVQQDLEQTLALLDKIDRQKEETEQLKQQLAQAPAKLRQ 98
Query: 204 AM--MAALKEQTNKSKGPNFTIQRREPGLTLSTAQLERNKTKEQIE 247
A + ALK+ ++ RE TLS QLE + +
Sbjct: 99 AQAELEALKDDNDEET--------RETLSTLSLRQLESRLAQTLDQ 136
>gnl|CDD|129242 TIGR00136, gidA, glucose-inhibited division protein A. GidA, the
longer of two forms of GidA-related proteins, appears to
be present in all complete eubacterial genomes so far,
as well as Saccharomyces cerevisiae. A subset of these
organisms have a closely related protein. GidA is absent
in the Archaea. It appears to act with MnmE, in an
alpha2/beta2 heterotetramer, in the
5-carboxymethylaminomethyl modification of uridine 34 in
certain tRNAs. The shorter, related protein, previously
called gid or gidA(S), is now called TrmFO (see model
TIGR00137) [Protein synthesis, tRNA and rRNA base
modification].
Length = 617
Score = 28.5 bits (64), Expect = 5.8
Identities = 28/131 (21%), Positives = 47/131 (35%), Gaps = 42/131 (32%)
Query: 76 NKTKEQIEEEKKIALNIRIKPLNVEGWSIHKLQAKATELWETIVKLET----DKYDLEER 131
K KE IEEE + + + P + E+ E + + E+
Sbjct: 462 LKKKENIEEEIQRLKSTWLTP--------------SKEVKEELKNHLQSPLKREASGEDL 507
Query: 132 QKRQDYDLKELKERQKQQLRHKALKKGLDPEALTGKYPVKEEERRQREVEEK------KQ 185
+R + +L++L + L P + EE Q E++ K KQ
Sbjct: 508 LRRPEMNLEKLTK--------------LTPF----LPALDEEVLEQVEIQIKYEGYIKKQ 549
Query: 186 RDLEEKRQRLE 196
+D +K RLE
Sbjct: 550 QDEIKKLDRLE 560
>gnl|CDD|235334 PRK05035, PRK05035, electron transport complex protein RnfC;
Provisional.
Length = 695
Score = 28.4 bits (64), Expect = 5.9
Identities = 22/99 (22%), Positives = 36/99 (36%), Gaps = 12/99 (12%)
Query: 2 AQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKK-----------RQAMMAALKEQTN 50
Q KK+ EE + R E ++ R EK R +K A +A +K +
Sbjct: 442 EQEKKKAEEAKAR-FEARQARLEREKAAREARHKKAAEARAAKDKDAVAAALARVKAKKA 500
Query: 51 KSKGPNFTIQRREPGLTLSTAQLERNKTKEQIEEEKKIA 89
+ P P + A E K + + + +K A
Sbjct: 501 AATQPIVIKAGARPDNSAVIAAREARKAQARARQAEKQA 539
>gnl|CDD|215214 PLN02381, PLN02381, valyl-tRNA synthetase.
Length = 1066
Score = 28.3 bits (63), Expect = 6.3
Identities = 10/51 (19%), Positives = 27/51 (52%)
Query: 1 MAQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNK 51
+ + E++ + E+E KK+++ + K + L++ + ++ A L+ Q
Sbjct: 3 RTESEAEKKILTEEELERKKKKEEKAKEKELKKLKAAQKEAKAKLQAQQAS 53
>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682). The
members of this family are all hypothetical eukaryotic
proteins of unknown function. One member is described as
being an adipocyte-specific protein, but no evidence of
this was found.
Length = 322
Score = 28.0 bits (63), Expect = 6.5
Identities = 15/39 (38%), Positives = 24/39 (61%), Gaps = 7/39 (17%)
Query: 2 AQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQA 40
++KKEE E + ++ ++QR LEEK E+K+QA
Sbjct: 289 EEKKKEEREAKLAKLSPEEQRKLEEK-------ERKKQA 320
>gnl|CDD|178867 PRK00106, PRK00106, hypothetical protein; Provisional.
Length = 535
Score = 27.9 bits (62), Expect = 6.9
Identities = 32/139 (23%), Positives = 65/139 (46%), Gaps = 16/139 (11%)
Query: 117 TIVKLETDKYDLEERQKRQDYDLKELKERQKQQLRHKALKKGLDPEALTGKYPVKEEERR 176
T++ E + +L + +R +K+ +R+ + L+ + L + KEE R+
Sbjct: 36 TLLNAEQEAVNLRGKAERDAEHIKKTAKRESKALKKELLLEA------------KEEARK 83
Query: 177 QREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQRREPGLTLSTAQ 236
RE E+ + + +RQ L++ E + +L + T++ +E LT +
Sbjct: 84 YREEIEQ---EFKSERQELKQIESRLTERATSLDRKDENLSSKEKTLESKEQSLTDKSKH 140
Query: 237 L-ERNKTKEQIEEEKKIAL 254
+ ER + E++EE+KK L
Sbjct: 141 IDEREEQVEKLEEQKKAEL 159
>gnl|CDD|233357 TIGR01322, scrB_fam, sucrose-6-phosphate hydrolase. [Energy
metabolism, Biosynthesis and degradation of
polysaccharides].
Length = 445
Score = 28.1 bits (63), Expect = 7.2
Identities = 26/108 (24%), Positives = 38/108 (35%), Gaps = 12/108 (11%)
Query: 168 YPVKE-EERRQREVEEKKQR-----DLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNF 221
P++E + R E L +K A L TNK +
Sbjct: 322 TPLRELKALRTEEHINVFGDQEHTLPGLNGEFELILDLEKDSAFELGLAL-TNKGEETLL 380
Query: 222 TIQRREPGLTL---STAQLERNKT--KEQIEEEKKIALNIRIKPLNVE 264
TI E +TL S+ LE + KK++L+I I +VE
Sbjct: 381 TIDADEGKVTLDRRSSGNLEDYGGTRSCPLPNTKKVSLHIFIDKSSVE 428
>gnl|CDD|240439 PTZ00491, PTZ00491, major vault protein; Provisional.
Length = 850
Score = 28.1 bits (63), Expect = 7.5
Identities = 32/151 (21%), Positives = 64/151 (42%), Gaps = 38/151 (25%)
Query: 71 AQLERNKTKEQI--EEEKKIALNIRIKPLNVE--GWSIHKLQAKATELWETIVKLETDKY 126
+LER K ++ EE++ L ++ + VE G S + A+A +++ E +
Sbjct: 681 GRLERQKMHDKAKAEEQRTKLLELQAESAAVESSGQSRAEALAEAEAR---LIEAEAEVE 737
Query: 127 DLEERQK----RQDYDLKELKERQKQQLRHKALKKGLDPEALTGKYPVKEEERRQREVEE 182
E R K + +L++L++RQ+ +L + E+ Q E+E
Sbjct: 738 QAELRAKALRIEAEAELEKLRKRQELELEY---------------------EQAQNELEI 776
Query: 183 KKQRDLEEKRQRLEEAEKKRQAMMAALKEQT 213
K ++L + K + ++ AL +T
Sbjct: 777 AKAKELADIEA------TKFERIVEALGRET 801
>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
Length = 520
Score = 27.8 bits (63), Expect = 7.6
Identities = 13/39 (33%), Positives = 26/39 (66%)
Query: 4 RKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMM 42
K+EEE ++ + E+KQ++LE+K + LEE +++ +
Sbjct: 106 EKREEELEKKEKELEQKQQELEKKEEELEELIEEQLQEL 144
>gnl|CDD|217051 pfam02463, SMC_N, RecF/RecN/SMC N terminal domain. This domain is
found at the N terminus of SMC proteins. The SMC
(structural maintenance of chromosomes) superfamily
proteins have ATP-binding domains at the N- and
C-termini, and two extended coiled-coil domains
separated by a hinge in the middle. The eukaryotic SMC
proteins form two kind of heterodimers: the SMC1/SMC3
and the SMC2/SMC4 types. These heterodimers constitute
an essential part of higher order complexes, which are
involved in chromatin and DNA dynamics. This family also
includes the RecF and RecN proteins that are involved in
DNA metabolism and recombination.
Length = 1162
Score = 28.0 bits (62), Expect = 7.6
Identities = 54/294 (18%), Positives = 108/294 (36%), Gaps = 14/294 (4%)
Query: 6 KEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQRREPG 65
EE + Q +++ + E Q E+ E + + ++ + N+ + R+
Sbjct: 192 DLEELKLQELKLKEQAKKALEYYQLKEKLELEEENLLYLDYLKLNEERIDLLQELLRDEQ 251
Query: 66 LTLSTAQLERNKTKEQIEEEKKIALNIRIKPLNVEGWSIHKLQAKATELWETIVKLETDK 125
+ +++ E K +E + + K K ++ + L + EL ++KLE K
Sbjct: 252 EEIESSKQELEKEEEILAQVLKE-NKEEEKEKKLQEEELKLLAKEEEELKSELLKLERRK 310
Query: 126 YDLEERQKRQDYDLKELKERQKQQLRHKALKKGLDPEALTGKYPVKEEERRQREVEEKKQ 185
D EE+ K + +LK+L++ K++ + L +E E + E EK Q
Sbjct: 311 VDDEEKLKESEKELKKLEKELKKEKEEIEELEK----ELKELEIKREAEEEEEEQLEKLQ 366
Query: 186 RDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQRREPGLTLSTAQLERNKTKEQ 245
LE+ + L +K +++ + + E + EQ
Sbjct: 367 EKLEQLEEELLAKKKLESERLSSAAKLKEEEL---------ELKNEEEKEAKLLLELSEQ 417
Query: 246 IEEEKKIALNIRIKPLNVEGWSIHKLQAKATELWETIVKLETDKYDLEERQKRQ 299
E+ K +K + S+ Q K TE E + K + K+
Sbjct: 418 EEDLLKEEKKEELKIVEELEESLETKQGKLTEEKEELEKQALKLLKDKLELKKS 471
>gnl|CDD|236766 PRK10811, rne, ribonuclease E; Reviewed.
Length = 1068
Score = 27.7 bits (62), Expect = 8.7
Identities = 13/63 (20%), Positives = 32/63 (50%), Gaps = 3/63 (4%)
Query: 2 AQRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQR 61
+ + ++E+++ E +++R+ +EKRQ +EA K + ++T + + R
Sbjct: 663 EKARTQDEQQQAPRRERQRRRN-DEKRQAQQEA--KALNVEEQSVQETEQEERVQQVQPR 719
Query: 62 REP 64
R+
Sbjct: 720 RKQ 722
>gnl|CDD|238356 cd00660, Topoisomer_IB_N, Topoisomer_IB_N: N-terminal DNA binding
fragment found in eukaryotic DNA topoisomerase (topo) IB
proteins similar to the monomeric yeast and human topo I
and heterodimeric topo I from Leishmania donvanni. Topo
I enzymes are divided into: topo type IA (bacterial)
and type IB (eukaryotic). Topo I relaxes superhelical
tension in duplex DNA by creating a single-strand nick,
the broken strand can then rotate around the unbroken
strand to remove DNA supercoils and, the nick is
religated, liberating topo I. These enzymes regulate the
topological changes that accompany DNA replication,
transcription and other nuclear processes. Human topo I
is the target of a diverse set of anticancer drugs
including camptothecins (CPTs). CPTs bind to the topo
I-DNA complex and inhibit re-ligation of the
single-strand nick, resulting in the accumulation of
topo I-DNA adducts. In addition to differences in
structure and some biochemical properties,
Trypanosomatid parasite topo I differ from human topo I
in their sensitivity to CPTs and other classical topo I
inhibitors. Trypanosomatid topos I play putative roles
in organizing the kinetoplast DNA network unique to
these parasites. This family may represent more than
one structural domain.
Length = 215
Score = 27.2 bits (61), Expect = 9.3
Identities = 10/44 (22%), Positives = 22/44 (50%), Gaps = 4/44 (9%)
Query: 5 KKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQ 48
++E+E+++ EEKK K ++ + E ++ KE+
Sbjct: 95 EEEKEKKKAMSKEEKK----AIKEEKEKLEEPYGYCLVDGHKEK 134
>gnl|CDD|188306 TIGR03319, RNase_Y, ribonuclease Y. Members of this family are
RNase Y, an endoribonuclease. The member from Bacillus
subtilis, YmdA, has been shown to be involved in
turnover of yitJ riboswitch [Transcription, Degradation
of RNA].
Length = 514
Score = 27.6 bits (62), Expect = 9.4
Identities = 19/86 (22%), Positives = 40/86 (46%), Gaps = 5/86 (5%)
Query: 6 KEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQRREPG 65
KEE + + E+E R+L+E+R L+ E++ L + ++++E
Sbjct: 57 KEEVHKLRAELE----RELKERRNELQRLERRLLQREETLDRKMESLDKKEENLEKKEKE 112
Query: 66 LTLSTAQL-ERNKTKEQIEEEKKIAL 90
L+ L E+ + E++ E++ L
Sbjct: 113 LSNKEKNLDEKEEELEELIAEQREEL 138
>gnl|CDD|236545 PRK09510, tolA, cell envelope integrity inner membrane protein
TolA; Provisional.
Length = 387
Score = 27.5 bits (61), Expect = 9.6
Identities = 15/87 (17%), Positives = 34/87 (39%)
Query: 3 QRKKEEEERRQREVEEKKQRDLEEKRQRLEEAEKKRQAMMAALKEQTNKSKGPNFTIQRR 62
Q+K+ E+ R +++E+++ E+K+Q E A++ A + + +
Sbjct: 93 QQKQAAEQERLKQLEKERLAAQEQKKQAEEAAKQAALKQKQAEEAAAKAAAAAKAKAEAE 152
Query: 63 EPGLTLSTAQLERNKTKEQIEEEKKIA 89
+ + K+ E K A
Sbjct: 153 AKRAAAAAKKAAAEAKKKAEAEAAKKA 179
>gnl|CDD|183207 PRK11570, PRK11570, peptidyl-prolyl cis-trans isomerase;
Provisional.
Length = 206
Score = 27.1 bits (60), Expect = 9.7
Identities = 19/65 (29%), Positives = 25/65 (38%), Gaps = 12/65 (18%)
Query: 148 QQLRHKALKKGLDPEAL-----------TGKYPVKEEERRQREVEEKKQRDLEEKRQRLE 196
QQL L+ GL PEAL PV R RE+ E+ E++Q +
Sbjct: 23 QQLSESGLE-GLLPEALVAGLADALEGKHPAVPVDVVHRALREIHERADAVRRERQQAMA 81
Query: 197 EAEKK 201
K
Sbjct: 82 AEGVK 86
>gnl|CDD|224340 COG1422, COG1422, Predicted membrane protein [Function unknown].
Length = 201
Score = 26.9 bits (60), Expect = 9.8
Identities = 10/45 (22%), Positives = 20/45 (44%), Gaps = 5/45 (11%)
Query: 115 WETIVKL-----ETDKYDLEERQKRQDYDLKELKERQKQQLRHKA 154
E + +L E K E ++ LK+L+E+Q + + +
Sbjct: 71 QEKMKELQKMMKEFQKEFREAQESGDMKKLKKLQEKQMEMMDDQR 115
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.307 0.125 0.329
Gapped
Lambda K H
0.267 0.0677 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 15,484,844
Number of extensions: 1555708
Number of successful extensions: 9545
Number of sequences better than 10.0: 1
Number of HSP's gapped: 6370
Number of HSP's successfully gapped: 1373
Length of query: 302
Length of database: 10,937,602
Length adjustment: 96
Effective length of query: 206
Effective length of database: 6,679,618
Effective search space: 1376001308
Effective search space used: 1376001308
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.6 bits)
S2: 59 (26.6 bits)