RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy4747
(414 letters)
>gnl|CDD|227640 COG5333, CCL1, Cdk activating kinase (CAK)/RNA polymerase II
transcription initiation/nucleotide excision repair
factor TFIIH/TFIIK, cyclin H subunit [Cell division and
chromosome partitioning / Transcription / DNA
replication, recombination, and repair].
Length = 297
Score = 102 bits (255), Expect = 1e-24
Identities = 59/185 (31%), Positives = 96/185 (51%), Gaps = 13/185 (7%)
Query: 55 SFREGIDYETEQRYRREGARFIIDCGTKMDLGYNTMATGVVFFHRFYMYHSFKTFPRYVT 114
+ E E + I+D T+++L +AT ++FF RFY+ +S + Y
Sbjct: 31 VLEPELTLEKELNLVIYYLKLIMDLCTRLNLPQTVLATAILFFSRFYLKNSVEEISLYSV 90
Query: 115 ASCCLFLAGKVEETPKKCKDVIKTARMLVDD---KKFKTFGDDPREEVMTLERILLQTIK 171
+ C++LA KVE+TP+ I D ++ K+ RE ++ E LL+ +
Sbjct: 91 VTTCVYLACKVEDTPRD----ISIESFEARDLWSEEPKSS----RERILEYEFELLEALD 142
Query: 172 FDLQVDHPYSFLLKYAKSLKGDKTKLEKMLQMAWTFVNDSLCTTLCLQWEPEIIAVALMY 231
FDL V HPY +L + K L+ K+LQ+AW +ND+L T LCL + P IIA+A +
Sbjct: 143 FDLHVHHPYKYLEGFLKDLQEKDKY--KLLQIAWKIINDALRTDLCLLYPPHIIALAALL 200
Query: 232 LAGKL 236
+A ++
Sbjct: 201 IACEV 205
>gnl|CDD|238003 cd00043, CYCLIN, Cyclin box fold. Protein binding domain
functioning in cell-cycle and transcription control.
Present in cyclins, TFIIB and Retinoblastoma (RB).The
cyclins consist of 8 classes of cell cycle regulators
that regulate cyclin dependent kinases (CDKs). TFIIB is
a transcription factor that binds the TATA box. Cyclins,
TFIIB and RB contain 2 copies of the domain.
Length = 88
Score = 63.8 bits (156), Expect = 6e-13
Identities = 25/99 (25%), Positives = 40/99 (40%), Gaps = 12/99 (12%)
Query: 69 RREGARFIIDCGTKMDLGYNTMATGVVFFHRFYMYHSFKTFPRYVTASCCLFLAGKVEET 128
R F+ + L T+ V RF + +S + A+ L+LA KVEE
Sbjct: 2 RPTPLDFLRRVAKALGLSPETLTLAVNLLDRFLLDYSVLGRSPSLVAAAALYLAAKVEEI 61
Query: 129 PKKCKDVIKTARMLVDDKKFKTFGDDPREEVMTLERILL 167
P KD++ G EE++ +E++LL
Sbjct: 62 PPWLKDLVHVT------------GYATEEEILRMEKLLL 88
Score = 47.2 bits (113), Expect = 4e-07
Identities = 17/59 (28%), Positives = 27/59 (45%), Gaps = 4/59 (6%)
Query: 178 HPYSFLLKYAKSLKGDKTKLEKMLQMAWTFVNDSLCTTLCLQWEPEIIAVALMYLAGKL 236
P FL + AK+L + L +A ++ L L P ++A A +YLA K+
Sbjct: 4 TPLDFLRRVAKALGLSP----ETLTLAVNLLDRFLLDYSVLGRSPSLVAAAALYLAAKV 58
>gnl|CDD|214641 smart00385, CYCLIN, domain present in cyclins, TFIIB and
Retinoblastoma. A helical domain present in cyclins and
TFIIB (twice) and Retinoblastoma (once). A protein
recognition domain functioning in cell-cycle and
transcription control.
Length = 83
Score = 57.6 bits (140), Expect = 8e-11
Identities = 24/95 (25%), Positives = 40/95 (42%), Gaps = 12/95 (12%)
Query: 74 RFIIDCGTKMDLGYNTMATGVVFFHRFYMYHSFKTFPRYVTASCCLFLAGKVEETPKKCK 133
F+ ++L T+ V RF + F + + A+ L+LA K EETP K
Sbjct: 1 DFLRRVCKALNLDPETLNLAVNLLDRFLSDYKFLKYSPSLIAAAALYLASKTEETPPWTK 60
Query: 134 DVIKTARMLVDDKKFKTFGDDPREEVMTLERILLQ 168
++ G EE++ +ER+LL+
Sbjct: 61 EL------------VHYTGYFTEEEILRMERLLLE 83
Score = 41.8 bits (99), Expect = 3e-05
Identities = 17/55 (30%), Positives = 27/55 (49%), Gaps = 4/55 (7%)
Query: 182 FLLKYAKSLKGDKTKLEKMLQMAWTFVNDSLCTTLCLQWEPEIIAVALMYLAGKL 236
FL + K+L D + L +A ++ L L++ P +IA A +YLA K
Sbjct: 2 FLRRVCKALNLDP----ETLNLAVNLLDRFLSDYKFLKYSPSLIAAAALYLASKT 52
>gnl|CDD|215740 pfam00134, Cyclin_N, Cyclin, N-terminal domain. Cyclins regulate
cyclin dependent kinases (CDKs). Human cyclin-O is a
Uracil-DNA glycosylase that is related to other cyclins.
Cyclins contain two domains of similar all-alpha fold,
of which this family corresponds with the N-terminal
domain.
Length = 127
Score = 56.4 bits (137), Expect = 5e-10
Identities = 25/135 (18%), Positives = 51/135 (37%), Gaps = 13/135 (9%)
Query: 42 YWYFDKKDIRNSP-SFREGIDYETEQRYRREGARFIIDCGTKMDLGYNTMATGVVFFHRF 100
Y Y + + + P + + R ++++ + L T+ V + RF
Sbjct: 3 YAYLRELEEEDRPPPDYLDQQPDINPKMRAILIDWLVEVHEEFKLLPETLYLAVNYLDRF 62
Query: 101 YMYHSFKTFPRYVTASCCLFLAGKVEET-PKKCKDVIKTARMLVDDKKFKTFGDDPREEV 159
+ CL +A K EE P +D + + D + +EE+
Sbjct: 63 LSKQPVPRTKLQLVGVTCLLIAAKYEEIYPPSVEDFV-----YITDNAYT------KEEI 111
Query: 160 MTLERILLQTIKFDL 174
+ +E ++L T+ +DL
Sbjct: 112 LRMELLILSTLNWDL 126
>gnl|CDD|129660 TIGR00569, ccl1, cyclin ccl1. All proteins in this family for
which functions are known are cyclins that are
components of TFIIH, a complex that is involved in
nucleotide excision repair and transcription initiation.
This family is based on the phylogenomic analysis of JA
Eisen (1999, Ph.D. Thesis, StanfordUniversity) [DNA
metabolism, DNA replication, recombination, and repair].
Length = 305
Score = 55.6 bits (134), Expect = 1e-08
Identities = 40/150 (26%), Positives = 70/150 (46%), Gaps = 24/150 (16%)
Query: 91 ATGVVFFHRFYMYHSFKTF-PRYVTASCCLFLAGKVEE----TPKKCKDVIKTARMLVDD 145
T +++F RFY+ +S + P+ + +C +FLA KVEE + ++ +T ++
Sbjct: 80 GTAIMYFKRFYLNNSVMEYHPKIIMLTC-VFLACKVEEFNVSIDQFVGNLKETPLKALE- 137
Query: 146 KKFKTFGDDPREEVMTLERILLQTIKFDLQVDHPY----SFLLKYAKSLKGDKTKLEKML 201
+V+ E +L+Q + F L V +PY FL+ L G E +
Sbjct: 138 ------------QVLEYELLLIQQLNFHLIVHNPYRPLEGFLIDIKTRLPG-LENPEYLR 184
Query: 202 QMAWTFVNDSLCTTLCLQWEPEIIAVALMY 231
+ A F+N +L T L + P IA+A +
Sbjct: 185 KHADKFLNRTLLTDAYLLYTPSQIALAAIL 214
>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
Length = 3151
Score = 48.0 bits (114), Expect = 9e-06
Identities = 35/113 (30%), Positives = 38/113 (33%), Gaps = 5/113 (4%)
Query: 280 QSQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAP-PHSAIPTLGAQYPQRVAP 338
Q Q PPQ QP P P PQ L P+ P P LGA P RVA
Sbjct: 2915 PPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPGRVAV 2974
Query: 339 P----PSGHYPNPYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAAGSTQQT 387
P P PA P H + V S H P S +QT
Sbjct: 2975 PRFRVPQPAPSREAPASSTPPLTGHSLSRVSSWASSLALHEETDPPPVSLKQT 3027
Score = 44.2 bits (104), Expect = 1e-04
Identities = 29/131 (22%), Positives = 43/131 (32%), Gaps = 9/131 (6%)
Query: 283 QQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQ-YPQRVAPPPS 341
+ A + + P + PPQ PP PP P PQ P +
Sbjct: 2888 RPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTT 2947
Query: 342 GHYPNPYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAAGSTQQTQTRSCSTTPLCHSY 401
P+ VP P + + PG R P A S + S+TP +
Sbjct: 2948 DPAGAGEPSGAVPQPWL----GALVPGRVAVPRFRVPQPAPSREA----PASSTPPLTGH 2999
Query: 402 PGSPISTTCST 412
S +S+ S+
Sbjct: 3000 SLSRVSSWASS 3010
Score = 40.7 bits (95), Expect = 0.002
Identities = 28/118 (23%), Positives = 37/118 (31%), Gaps = 5/118 (4%)
Query: 279 SQSQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAP 338
++ +A+Q SPPQ +P P L P P A L + P P
Sbjct: 2667 ARRLGRAAQASSPPQ-RPRRRAARPTVGSLTSLADPPPPPPTPEPAPHALVSATPLPPGP 2725
Query: 339 PPSGHYPNPYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAAGSTQQTQTRSCSTTP 396
+ PA P P P P G RPP AG + + P
Sbjct: 2726 AAARQASPALPAAPAPPAV--PAGPATPGGPARPA--RPPTTAGPPAPAPPAAPAAGP 2779
Score = 37.2 bits (86), Expect = 0.018
Identities = 30/120 (25%), Positives = 38/120 (31%), Gaps = 16/120 (13%)
Query: 290 SPPQTQPEPSTKPPPQSHLPPSKLKPDLVAP-----------PHSAIPTLGAQYP-QRVA 337
PP T +P+ PPP PPS VAP +A P A+ P +R+A
Sbjct: 2828 LPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPVRRLA 2887
Query: 338 PPPSGHYPNPYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAAGSTQQTQTRSCSTTPL 397
P + P A PP Q +PPP PL
Sbjct: 2888 RPAVSRSTESFALPPDQPERPPQPQAPPPPQPQ----PQPPPPPQPQPPPPPPPRPQPPL 2943
Score = 36.8 bits (85), Expect = 0.022
Identities = 30/139 (21%), Positives = 43/139 (30%), Gaps = 17/139 (12%)
Query: 285 ASQQDSPPQTQPEPSTKPPPQSHLPPSKLK-----PDLVAPPHSAIPTLGAQYPQRVAPP 339
A+ + + P P+ P P P PP A P A P R
Sbjct: 2726 AAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTR 2785
Query: 340 PSG--------HYPNPYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAAGSTQQTQTR- 390
P+ P+P+ P + A +PP + G PP +A T
Sbjct: 2786 PAVASLSESRESLPSPWDPADPPAA-VLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPG 2844
Query: 391 -SCSTTPLCHSY-PGSPIS 407
+ PL S PG +
Sbjct: 2845 PPPPSLPLGGSVAPGGDVR 2863
Score = 36.5 bits (84), Expect = 0.035
Identities = 25/138 (18%), Positives = 30/138 (21%), Gaps = 14/138 (10%)
Query: 291 PPQTQPEPSTKPPPQSHLPPSKLKP------------DLVAPPHSAIPTLGAQYPQRVAP 338
PP P P P P P P A P + P
Sbjct: 2701 PPPPPPTPEPAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARP 2760
Query: 339 PPSGHYPNPYP--AHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAAGSTQQTQTRSCSTTP 396
P + P P P A P AV P A + + P
Sbjct: 2761 PTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPP 2820
Query: 397 LCHSYPGSPISTTCSTTP 414
P T+ T
Sbjct: 2821 AASPAGPLPPPTSAQPTA 2838
Score = 35.7 bits (82), Expect = 0.062
Identities = 23/124 (18%), Positives = 30/124 (24%), Gaps = 16/124 (12%)
Query: 290 SPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPP--------PS 341
SP P+ P + LPP+ + PP SA PT P P P
Sbjct: 2800 SPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPG 2859
Query: 342 GHY---PNPYPAHPVPIPNIHP-----VHAVVPPGSQYYGHHRPPPAAGSTQQTQTRSCS 393
G P P P V ++ + P Q
Sbjct: 2860 GDVRRRPPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQP 2919
Query: 394 TTPL 397
Sbjct: 2920 QPQP 2923
Score = 34.5 bits (79), Expect = 0.14
Identities = 25/113 (22%), Positives = 35/113 (30%), Gaps = 11/113 (9%)
Query: 282 QQQASQQDSPPQ-TQPEPSTKP--PPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAP 338
+A + D+PPQ +P P+ PPS L PD AP P
Sbjct: 2583 TSRARRPDAPPQSARPRAPVDDRGDPRGPAPPSPLPPDTHAPDPPP--------PSPSPA 2634
Query: 339 PPSGHYPNPYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAAGSTQQTQTRS 391
P P P P V + R A+ Q+ + R+
Sbjct: 2635 ANEPDPHPPPTVPPPERPRDDPAPGRVSRPRRARRLGRAAQASSPPQRPRRRA 2687
Score = 33.0 bits (75), Expect = 0.40
Identities = 23/91 (25%), Positives = 25/91 (27%), Gaps = 7/91 (7%)
Query: 298 PSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPPSGHYPNPYPAHPVPIPN 357
PS P + P L P S A P + PP P P P P P
Sbjct: 2867 PSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQAPPPPQPQPQPPPP 2926
Query: 358 IHPVHAVVPPGSQYYGHHRPPPAAGSTQQTQ 388
P PP RP P T
Sbjct: 2927 PQPQPPPPPPP-------RPQPPLAPTTDPA 2950
Score = 31.1 bits (70), Expect = 1.4
Identities = 23/99 (23%), Positives = 28/99 (28%), Gaps = 11/99 (11%)
Query: 289 DSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPPSGHYPNPY 348
D P P++ P P P+ APP A P A+ P P P
Sbjct: 403 DQTRPAAPVPASVPTPAPTPVPAS------APPPPATPLPSAEPGSDDGPAPP---PERQ 453
Query: 349 PAHPVPIPNIHPVHAVVPPGSQYYGHHRP--PPAAGSTQ 385
P P P RP PP A +
Sbjct: 454 PPAPATEPAPDDPDDATRKALDALRERRPPEPPGADLAE 492
Score = 28.8 bits (64), Expect = 7.8
Identities = 25/92 (27%), Positives = 35/92 (38%), Gaps = 7/92 (7%)
Query: 290 SPPQTQPEPSTKP-PPQSHLPPSKLKPDLVAPPHSAIPTL--GAQYPQRVAPPPSGHYPN 346
+P ++ P P P P + + +PD APP SA P + R PPS P+
Sbjct: 2563 APDRSVPPPRPAPRPSEPAVTSRARRPD--APPQSARPRAPVDDRGDPRGPAPPSPLPPD 2620
Query: 347 PYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPP 378
+ P P P+ P A P P
Sbjct: 2621 THAPDP-PPPSPSP-AANEPDPHPPPTVPPPE 2650
Score = 28.4 bits (63), Expect = 9.2
Identities = 29/150 (19%), Positives = 41/150 (27%), Gaps = 25/150 (16%)
Query: 281 SQQQASQQDSPPQTQPEPSTKPPPQ------SHLPPSKLKPDLVAPPHSAIPTLGAQYPQ 334
++ P P + PP + P + P PP A
Sbjct: 272 ARGATGPPPPPEAAAPNGAAAPPDGVWGAALAGAPLALPAPPDPPPPAPAGDAEEEDDED 331
Query: 335 RV------APPPSGHYPNPYPAH-------PVPIPNI-----HPVHAVVPPGSQYYGHHR 376
P P HYP +P P + ++ HP A +P + H
Sbjct: 332 GAMEVVSPLPRPRQHYPLGFPKRRRPTWTPPSSLEDLSAGRHHPKRASLPTRKRRSARHA 391
Query: 377 PPP-AAGSTQQTQTRSCSTTPLCHSYPGSP 405
P A G QTR + P P
Sbjct: 392 ATPFARGPGGDDQTRPAAPVPASVPTPAPT 421
>gnl|CDD|219339 pfam07223, DUF1421, Protein of unknown function (DUF1421). This
family represents a conserved region approximately 350
residues long within a number of plant proteins of
unknown function.
Length = 357
Score = 46.5 bits (110), Expect = 1e-05
Identities = 32/145 (22%), Positives = 44/145 (30%), Gaps = 9/145 (6%)
Query: 279 SQSQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPP-HSAIPTLGAQYP---- 333
SQ Q +QQ Q QP P Q PP + + PP + P + YP
Sbjct: 126 SQPQPPPAQQPQAQQPQPPPQVPQQQQYQSPPQQPQYQQNPPPQAQSAPQVSGLYPEESP 185
Query: 334 -QRVAPPPSGHYPNPYPAHPVP---IPNIHPVHAVVPPGSQYYGHHRPPPAAGSTQQTQT 389
Q + PP+ P+ P P+ P Y G P + + Q
Sbjct: 186 YQPQSYPPNEPLPSSMAMQPPYSGAPPSQQFYGPPQPSPYMYGGPGGRPNSGFPSGQQPP 245
Query: 390 RSCSTTPLCHSYPGSPISTTCSTTP 414
S +S P S
Sbjct: 246 PSQGQEGYGYSGPPPSKGNHGSVAS 270
Score = 36.1 bits (83), Expect = 0.029
Identities = 33/134 (24%), Positives = 40/134 (29%), Gaps = 9/134 (6%)
Query: 281 SQQQASQQDSPPQTQPEPSTKPPP--QSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAP 338
Q QQ P + PS PP QS + + PP S AQ PQ P
Sbjct: 82 PQPDQQQQSQAPPSHQYPSQLPPQQVQSVPQQPTPQQEPYYPPPSQPQPPPAQQPQAQQP 141
Query: 339 PPSGHYPN-----PYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAAGSTQQTQTRSCS 393
P P P P N P P S Y P + S
Sbjct: 142 QPPPQVPQQQQYQSPPQQPQYQQNPPPQAQSAPQVSGLYPEESPYQP--QSYPPNEPLPS 199
Query: 394 TTPLCHSYPGSPIS 407
+ + Y G+P S
Sbjct: 200 SMAMQPPYSGAPPS 213
Score = 33.8 bits (77), Expect = 0.15
Identities = 30/126 (23%), Positives = 36/126 (28%), Gaps = 24/126 (19%)
Query: 279 SQSQQQASQQDSPPQTQPEPSTKPPPQSH---LPPSKLKPDLVAPPHSAIPTLGAQYPQR 335
S Q +P Q P Q LPP AP + Q Q
Sbjct: 41 SFHVDSTKQPPAPEQVAKHELADAPLQQVNAALPP--------APAPQSPQPDQQQQSQ- 91
Query: 336 VAPPPSGHYPNPYPAHPVPIPNIHPVHAVVP----------PGSQYYGHHRPPPAAGSTQ 385
PPS YP+ P V P P P +Q +P P Q
Sbjct: 92 --APPSHQYPSQLPPQQVQSVPQQPTPQQEPYYPPPSQPQPPPAQQPQAQQPQPPPQVPQ 149
Query: 386 QTQTRS 391
Q Q +S
Sbjct: 150 QQQYQS 155
>gnl|CDD|237865 PRK14951, PRK14951, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 618
Score = 44.3 bits (105), Expect = 9e-05
Identities = 18/108 (16%), Positives = 23/108 (21%), Gaps = 1/108 (0%)
Query: 291 PPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPPSGHYPNPYPA 350
P P P P+ APP +A P A P
Sbjct: 388 APAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPPAPVAAPAAAAPAAAPAAAPAAVAL 447
Query: 351 HPVPIPNIHPVHAVVPPGSQYYGHHRPPPAAGSTQQTQTRSCSTTPLC 398
P P P +P A + R + T
Sbjct: 448 APAPPAQAAPETVAIPVRVAPEPAVASAAPAPAAAPAAAR-LTPTEEG 494
Score = 34.7 bits (80), Expect = 0.088
Identities = 16/68 (23%), Positives = 21/68 (30%), Gaps = 3/68 (4%)
Query: 290 SPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPPSGHYPNPYP 349
P + P P+ + P +A T P RVAP P+ P P
Sbjct: 422 PPAPVAAPAAAAPAAAPAAAPAAVALAPAPPAQAAPET--VAIPVRVAPEPAVASAAPAP 479
Query: 350 A-HPVPIP 356
A P
Sbjct: 480 AAAPAAAR 487
Score = 34.3 bits (79), Expect = 0.11
Identities = 18/119 (15%), Positives = 27/119 (22%), Gaps = 2/119 (1%)
Query: 296 PEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPPSGHYPNPYPAHPVPI 355
E T P++ P + A P A A P + P PA P
Sbjct: 376 AEKKTPARPEAAAPAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPPAPVAAPAAAAPA 435
Query: 356 PNIHPVHAVVPPGSQYYGHHRPPPAAGSTQQTQTRSCSTTPLCHSYPGSPISTTCSTTP 414
A V P + + +P + + T
Sbjct: 436 AAPAAAPAAVALAPAPP-AQAAPETVAIPVRVAPEPAVASAAPAP-AAAPAAARLTPTE 492
Score = 28.9 bits (65), Expect = 6.7
Identities = 22/99 (22%), Positives = 29/99 (29%), Gaps = 9/99 (9%)
Query: 280 QSQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAP---PHSA------IPTLGA 330
+Q A+ + + PP + P P AP P +A P A
Sbjct: 394 VAQAAAAPAPAAAPAAAASAPAAPPAAAPPAPVAAPAAAAPAAAPAAAPAAVALAPAPPA 453
Query: 331 QYPQRVAPPPSGHYPNPYPAHPVPIPNIHPVHAVVPPGS 369
Q P P P A P P P A + P
Sbjct: 454 QAAPETVAIPVRVAPEPAVASAAPAPAAAPAAARLTPTE 492
Score = 28.1 bits (63), Expect = 10.0
Identities = 19/89 (21%), Positives = 28/89 (31%), Gaps = 6/89 (6%)
Query: 278 YSQSQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVA 337
+ A+ + P P + + PP++ P+ VA P P A
Sbjct: 420 AAPPAPVAAPAAAAPAAAPAAAPAAVALAPAPPAQAAPETVAIPVRVAPEPAVASA---A 476
Query: 338 PPPSGHYPNPYPAHPVPIPNIHPVHAVVP 366
P P+ P A P HA V
Sbjct: 477 PAPAAA---PAAARLTPTEEGDVWHATVQ 502
>gnl|CDD|220392 pfam09770, PAT1, Topoisomerase II-associated protein PAT1. Members
of this family are necessary for accurate chromosome
transmission during cell division.
Length = 804
Score = 44.0 bits (104), Expect = 1e-04
Identities = 25/121 (20%), Positives = 28/121 (23%), Gaps = 4/121 (3%)
Query: 272 HQVLDLYSQSQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQ 331
Q L QQ Q P Q P P P +P
Sbjct: 167 QQAPQLPQPPQQVLPQGMPPRQAAFPQQGPPEQPPGYPQPPQGHPEQVQPQQFLPAPSQA 226
Query: 332 YPQRVAPP--PSGHYPNPYPAHPVPIPNIHPVHAVVPPGSQYYGH--HRPPPAAGSTQQT 387
Q PP P P P P + P P Q +PPP T
Sbjct: 227 PAQPPLPPQLPQQPPPLQQPQFPGLSQQMPPPPPQPPQQQQQPPQPQAQPPPQNQPTPHP 286
Query: 388 Q 388
Sbjct: 287 G 287
Score = 42.1 bits (99), Expect = 5e-04
Identities = 27/112 (24%), Positives = 29/112 (25%), Gaps = 6/112 (5%)
Query: 278 YSQSQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVA 337
Y Q Q +Q P Q P PS P P +L P L Q P
Sbjct: 203 YPQPPQGHPEQVQPQQFLPAPSQAPAQPPL--PPQLPQQPPPLQQPQFPGLSQQMPPPPP 260
Query: 338 PPPSGHYPNPYP-AHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAAGSTQQTQ 388
PP P P A P P P PP Q
Sbjct: 261 QPPQQQQQPPQPQAQPPPQNQPTPHP---GLPQGQNAPLPPPQQPQLLPLVQ 309
Score = 42.1 bits (99), Expect = 5e-04
Identities = 29/115 (25%), Positives = 36/115 (31%), Gaps = 8/115 (6%)
Query: 279 SQSQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAP 338
Q SQ + P P+ +PPP L + PP P Q PQ A
Sbjct: 217 QQFLPAPSQAPAQPPLPPQLPQQPPPLQQPQFPGL-SQQMPPPPPQPPQQQQQPPQPQAQ 275
Query: 339 PPSGHYPNPYPAHPVPIPNIHPVHA-------VVPPGSQYYGHHRPPPAAGSTQQ 386
PP + P P+P P P V P Q G +QQ
Sbjct: 276 PPPQNQPTPHPGLPQGQNAPLPPPQQPQLLPLVQQPQGQQRGPQFREQLVQLSQQ 330
Score = 41.7 bits (98), Expect = 6e-04
Identities = 24/116 (20%), Positives = 26/116 (22%), Gaps = 5/116 (4%)
Query: 278 YSQSQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQY--PQR 335
Q P Q P + P S P P P P PQ
Sbjct: 224 SQAPAQPPLPPQLPQQPPPLQQPQFPGLSQQMPP---PPPQPPQQQQQPPQPQAQPPPQN 280
Query: 336 VAPPPSGHYPNPYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAAGSTQQTQTRS 391
P G P P P G Q R S QQ + S
Sbjct: 281 QPTPHPGLPQGQNAPLPPPQQPQLLPLVQQPQGQQRGPQFREQLVQLSQQQREALS 336
Score = 41.3 bits (97), Expect = 8e-04
Identities = 20/115 (17%), Positives = 22/115 (19%), Gaps = 8/115 (6%)
Query: 278 YSQSQQQASQQDSPPQTQP----EPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYP 333
Q SQ Q E Q P +P P P A
Sbjct: 134 PQPPQAPESQPQPQTPAQKMLSLEEVEAQLQQRQQAPQLPQPPQQVLPQGMPPRQAAFPQ 193
Query: 334 QRVAPPPSGHYPNPYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAAGSTQQTQ 388
Q P G+ P P P PP Q
Sbjct: 194 QGPPEQPPGYPQPPQGHPEQVQPQQFLPA----PSQAPAQPPLPPQLPQQPPPLQ 244
Score = 38.2 bits (89), Expect = 0.007
Identities = 21/108 (19%), Positives = 23/108 (21%), Gaps = 15/108 (13%)
Query: 272 HQVLDLYSQSQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQ 331
Q SQQ PPQ Q +P P PQ+ PP P
Sbjct: 243 LQQPQFPGLSQQMPPPPPQPPQQQQQP---PQPQAQPPPQNQPT---PHPGLPQGQNAPL 296
Query: 332 YPQRVAPPPSGHYPNPYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPP 379
P P P V Q R
Sbjct: 297 PP----PQQPQLLPLVQQPQGQQRGPQFREQLV-----QLSQQQREAL 335
Score = 38.2 bits (89), Expect = 0.009
Identities = 21/100 (21%), Positives = 28/100 (28%), Gaps = 7/100 (7%)
Query: 280 QSQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPP 339
+ + Q Q+ PQ PPQ LP P P YPQ
Sbjct: 158 EVEAQLQQRQQAPQLPQ------PPQQVLPQGMPPRQAAFPQQGP-PEQPPGYPQPPQGH 210
Query: 340 PSGHYPNPYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPP 379
P P + P P P+ +P + P
Sbjct: 211 PEQVQPQQFLPAPSQAPAQPPLPPQLPQQPPPLQQPQFPG 250
Score = 37.4 bits (87), Expect = 0.012
Identities = 22/119 (18%), Positives = 32/119 (26%), Gaps = 15/119 (12%)
Query: 282 QQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDL-------VAPPHSAIPTLGAQYPQ 334
Q S + P+ +P+P P Q P + + L P L PQ
Sbjct: 120 QPDPSLWGTAPKPEPQPPQAPESQ-PQPQTPAQKMLSLEEVEAQLQQRQQAPQLPQP-PQ 177
Query: 335 RVAPPPSGHYPNPYPAHPVPIPNIHPVHAVVPPGSQYYGHH----RPPPAAGSTQQTQT 389
+V P +P P P + P G P A +
Sbjct: 178 QVLPQGMPPRQAAFPQQGP--PEQPPGYPQPPQGHPEQVQPQQFLPAPSQAPAQPPLPP 234
>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family. Atrophin-1 is the
protein product of the dentatorubral-pallidoluysian
atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
neurodegenerative disorder. It is caused by the
expansion of a CAG repeat in the DRPLA gene on
chromosome 12p. This results in an extended
polyglutamine region in atrophin-1, that is thought to
confer toxicity to the protein, possibly through
altering its interactions with other proteins. The
expansion of a CAG repeat is also the underlying defect
in six other neurodegenerative disorders, including
Huntington's disease. One interaction of expanded
polyglutamine repeats that is thought to be pathogenic
is that with the short glutamine repeat in the
transcriptional coactivator CREB binding protein, CBP.
This interaction draws CBP away from its usual nuclear
location to the expanded polyglutamine repeat protein
aggregates that are characteristic of the polyglutamine
neurodegenerative disorders. This interferes with
CBP-mediated transcription and causes cytotoxicity.
Length = 979
Score = 43.5 bits (102), Expect = 2e-04
Identities = 37/129 (28%), Positives = 53/129 (41%), Gaps = 18/129 (13%)
Query: 274 VLDLYSQSQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYP 333
L L SQ+Q + S QP+ +PP + LPP+ P + PP + IP L
Sbjct: 306 PLPLPSQAQPHSHTPPSQSALQPQ---QPPREQPLPPAPSMPHIKPPPTTPIPQLP---N 359
Query: 334 QRVAPPPSGHYPNPYPAHPV---PIPNIHPVHAVV---PPGSQYYGHHRPPPAAGSTQQT 387
Q PP P+P+P P P P + P+ ++ PP + PPP Q
Sbjct: 360 QSHKHPPHLQGPSPFPQMPSNLPPPPALKPLSSLPTHHPPSAH------PPPLQLMPQSQ 413
Query: 388 QTRSCSTTP 396
+S P
Sbjct: 414 PLQSVPAQP 422
Score = 40.1 bits (93), Expect = 0.002
Identities = 42/172 (24%), Positives = 55/172 (31%), Gaps = 40/172 (23%)
Query: 280 QSQQQASQQDSPPQTQPEPST------KPPPQSHLP----------------------PS 311
Q A Q PP+ QP P KPPP + +P PS
Sbjct: 320 PPSQSALQPQQPPREQPLPPAPSMPHIKPPPTTPIPQLPNQSHKHPPHLQGPSPFPQMPS 379
Query: 312 KLKPDLVAPPHSAIPTLGAQYPQRVAPPPSGHYPNPYPAHPVP--IPNIHPVHAVVPPGS 369
L P P S++PT +P PPP P P VP P + ++ P S
Sbjct: 380 NLPPPPALKPLSSLPT---HHPPSAHPPPLQLMPQSQPLQSVPAQPPVLTQSQSLPPKAS 436
Query: 370 ---QYYGHHRPP--PAAGSTQQTQTRSCSTTPLCH--SYPGSPISTTCSTTP 414
H PP P A + P S P +P + + P
Sbjct: 437 THPHSGLHSGPPQSPFAQHPFTSGGLPAIGPPPSLPTSTPAAPPRASSGSQP 488
Score = 37.7 bits (87), Expect = 0.011
Identities = 32/146 (21%), Positives = 41/146 (28%), Gaps = 19/146 (13%)
Query: 277 LYSQSQQQASQQDSPPQTQPEPSTKPPPQSH--LPP------SKLKPDLVAPPHSAIPTL 328
Q Q S Q PP PS PP S +PP ++ P P ++ +
Sbjct: 171 QLLQPQGPPSIQ-VPPGAALAPSAPPPTPSAQAVPPQGSPIAAQPAPQPQQPSPLSLISA 229
Query: 329 GAQYPQRVAPPPSGHYPNPYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAAGSTQQTQ 388
+ +PQR+ P P P H HH P P Q
Sbjct: 230 PSLHPQRLPSPHPPLQPQTASQQSPQPPAPSSRHPQSS-------HHGPGPPMPHALQQG 282
Query: 389 TRSCSTTPLCHSYPGSPISTTCSTTP 414
S P P S P
Sbjct: 283 P---VFLQHPSSNPPQPFGLAQSQVP 305
Score = 34.7 bits (79), Expect = 0.10
Identities = 34/126 (26%), Positives = 45/126 (35%), Gaps = 16/126 (12%)
Query: 283 QQASQQDSPPQTQPEPS--TKPPPQSHLPPSKLKPD-LVAPPHSAIPTLGAQYPQRVAPP 339
Q S PP +P S T PP +H PP +L P A P + Q Q + P
Sbjct: 376 QMPSNLPPPPALKPLSSLPTHHPPSAHPPPLQLMPQSQPLQSVPAQPPVLTQ-SQSLPPK 434
Query: 340 PSGHYPN---------PYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAAGSTQQTQTR 390
S H + P+ HP P A+ PP S PP A S Q
Sbjct: 435 ASTHPHSGLHSGPPQSPFAQHPF-TSGGLP--AIGPPPSLPTSTPAAPPRASSGSQPPGS 491
Query: 391 SCSTTP 396
+ ++
Sbjct: 492 ALPSSG 497
Score = 34.3 bits (78), Expect = 0.14
Identities = 24/123 (19%), Positives = 33/123 (26%), Gaps = 6/123 (4%)
Query: 282 QQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPPS 341
Q Q + Q SP P P P P+ PQ S
Sbjct: 245 QPQTASQQSPQPPAPSSRHPQSSHHGPGPPMPHALQQGPVFLQHPSSNP--PQPFGLAQS 302
Query: 342 GHYPNPYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAAGSTQQTQTRSCSTTPLCHSY 401
P P P+ P + P + + P PP + + TTP+
Sbjct: 303 QVPPLPLPSQAQPHSHTPPSQSALQPQQPPREQPLPPAPSMP----HIKPPPTTPIPQLP 358
Query: 402 PGS 404
S
Sbjct: 359 NQS 361
Score = 28.5 bits (63), Expect = 7.7
Identities = 25/122 (20%), Positives = 31/122 (25%), Gaps = 17/122 (13%)
Query: 275 LDLYSQSQQQAS----------QQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSA 324
L L QSQ S Q PP+ P + P P +
Sbjct: 406 LQLMPQSQPLQSVPAQPPVLTQSQSLPPKASTHPHSGLHSGPPQSPFAQHPFTSGGLPAI 465
Query: 325 IPTLGAQYPQRVAPP-------PSGHYPNPYPAHPVPIPNIHPVHAVVPPGSQYYGHHRP 377
P APP P G P P + P+ P + P
Sbjct: 466 GPPPSLPTSTPAAPPRASSGSQPPGSALPSSGGCAGPGPPLPPIQIKEEPLDEAEEPESP 525
Query: 378 PP 379
PP
Sbjct: 526 PP 527
>gnl|CDD|236669 PRK10263, PRK10263, DNA translocase FtsK; Provisional.
Length = 1355
Score = 41.6 bits (97), Expect = 8e-04
Identities = 22/106 (20%), Positives = 31/106 (29%), Gaps = 6/106 (5%)
Query: 280 QSQQQASQQDSPPQTQPEPSTKP-PPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAP 338
+ Q SQ P EP +P PQ + P ++ P AQ P
Sbjct: 378 EGYPQQSQYAQPAVQYNEPLQQPVQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPA 437
Query: 339 PPSGHYPNPYPAHPV-----PIPNIHPVHAVVPPGSQYYGHHRPPP 379
P N + A P P +Q + +P P
Sbjct: 438 PEQPVAGNAWQAEEQQSTFAPQSTYQTEQTYQQPAAQEPLYQQPQP 483
Score = 40.8 bits (95), Expect = 0.001
Identities = 28/89 (31%), Positives = 33/89 (37%), Gaps = 6/89 (6%)
Query: 279 SQSQQQASQQDSPPQT---QPEPSTKPPPQSHLP--PSKLKPDLVAPPHSAIPTLGAQYP 333
Q Q Q QQ PQ QP+ P PQ P P +P P P Q P
Sbjct: 761 PQQQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQP 820
Query: 334 QR-VAPPPSGHYPNPYPAHPVPIPNIHPV 361
Q+ VAP P P A +HP+
Sbjct: 821 QQPVAPQPQYQQPQQPVAPQPQDTLLHPL 849
Score = 35.8 bits (82), Expect = 0.052
Identities = 20/92 (21%), Positives = 27/92 (29%), Gaps = 9/92 (9%)
Query: 278 YSQSQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVA 337
Y+ + +Q +QQ +P Q+ S P Y Q A
Sbjct: 421 YAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQSTFAPQSTYQTEQT-------YQQPAA 473
Query: 338 PPPSGHYPNPYPAHPV--PIPNIHPVHAVVPP 367
P P P PV P P + PP
Sbjct: 474 QEPLYQQPQPVEQQPVVEPEPVVEETKPARPP 505
Score = 32.0 bits (72), Expect = 0.73
Identities = 29/114 (25%), Positives = 36/114 (31%), Gaps = 3/114 (2%)
Query: 275 LDLYSQSQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQ 334
LD + S +A D P + P +P Q P + + A Q Q
Sbjct: 724 LDDFEFSPMKALLDDGPHEPLFTPIVEPVQQPQQPVAPQQQYQQPQQPVAPQPQYQQPQQ 783
Query: 335 RVAPPPSGHYPNPYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAAGSTQQTQ 388
VAP P P A P P V P QY +P QQ Q
Sbjct: 784 PVAPQPQYQQPQQPVA---PQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQ 834
Score = 32.0 bits (72), Expect = 0.81
Identities = 29/127 (22%), Positives = 41/127 (32%), Gaps = 13/127 (10%)
Query: 281 SQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPP 340
+Q A+ + QT P S PP P +P V P + P +AP P
Sbjct: 330 TQSWAAPVEPVTQTPPVASVDVPPAQ--PTVAWQP--VPGPQTGEPV--------IAPAP 377
Query: 341 SGHYPNPYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAAGSTQQTQTRSCSTTPLCHS 400
G+ A P + P+ V P YY PA + P
Sbjct: 378 EGYPQQSQYAQP-AVQYNEPLQQPVQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAP 436
Query: 401 YPGSPIS 407
P P++
Sbjct: 437 APEQPVA 443
Score = 31.2 bits (70), Expect = 1.3
Identities = 26/146 (17%), Positives = 38/146 (26%), Gaps = 17/146 (11%)
Query: 281 SQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKL------------KPDLVAPPHSAIPTL 328
+Q + Q P EP P P+ + S+ +P P+ A
Sbjct: 354 AQPTVAWQPVPGPQTGEPVIAPAPEGYPQQSQYAQPAVQYNEPLQQPVQPQQPYYAPAAE 413
Query: 329 GAQYPQRVAPPPSGHYPNPYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAAGSTQQTQ 388
AP P PY A P S + P + T+QT
Sbjct: 414 QPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQSTF-----APQSTYQTEQTY 468
Query: 389 TRSCSTTPLCHSYPGSPISTTCSTTP 414
+ + PL P
Sbjct: 469 QQPAAQEPLYQQPQPVEQQPVVEPEP 494
Score = 28.9 bits (64), Expect = 6.4
Identities = 26/110 (23%), Positives = 33/110 (30%), Gaps = 2/110 (1%)
Query: 280 QSQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAP-PHSAIPTLGAQYPQRVAP 338
Q+ AS P Q P PQ+ P P+ A P + P +
Sbjct: 342 QTPPVASVDVPPAQPTVAWQPVPGPQTGEPVIAPAPEGYPQQSQYAQPAVQYNEPLQQPV 401
Query: 339 PPSGHYPNPYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAAGSTQQTQ 388
P Y P P P P P YY P AG+ Q +
Sbjct: 402 QPQQPYYAPAAEQPAQQPYYAP-APEQPAQQPYYAPAPEQPVAGNAWQAE 450
>gnl|CDD|235124 PRK03427, PRK03427, cell division protein ZipA; Provisional.
Length = 333
Score = 40.4 bits (95), Expect = 0.001
Identities = 26/118 (22%), Positives = 39/118 (33%), Gaps = 12/118 (10%)
Query: 279 SQSQQQASQQDSPP---QTQPEPSTKPPPQSHLP-PSKLKPDLVAPPHSAIPTLGAQYPQ 334
++ + + Q P QP + PP++ +P +P AP P Q Q
Sbjct: 81 HEAARPSPQHQYQPPYASAQPRQPVQQPPEAQVPPQHAPRPAQPAPQPVQQPAYQPQPEQ 140
Query: 335 RVAPPPSGHY-PNPYPAHPVPIP---NIHPVHAVVPPGSQYYGHHRPPPAAGSTQQTQ 388
+ P S P P P H P P P V P + PA + +
Sbjct: 141 PLQQPVSPQVAPAPQPVHSAPQPAQQAFQPAEPVAAPQPE----PVAEPAPVMDKPKR 194
Score = 35.0 bits (81), Expect = 0.059
Identities = 26/118 (22%), Positives = 35/118 (29%), Gaps = 10/118 (8%)
Query: 272 HQVLDLYSQSQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQ 331
HQ Y+ +Q + Q P P P Q P + P P
Sbjct: 90 HQYQPPYASAQPRQPVQQPPEAQVPPQHAPRPAQPAPQPVQQPAY--QPQPEQ-PLQQPV 146
Query: 332 YPQRVAPPPSGHYPNPYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAAGSTQQTQT 389
PQ VAP P + P PA P V P + P P ++ +
Sbjct: 147 SPQ-VAPAPQPVHSAPQPAQQA----FQPAEPVAAPQPE--PVAEPAPVMDKPKRKEA 197
Score = 30.4 bits (69), Expect = 1.6
Identities = 12/97 (12%), Positives = 19/97 (19%), Gaps = 4/97 (4%)
Query: 300 TKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPPSGHYPNPYPAHPVPIPNIH 359
+ +P H P + P++ P P P P
Sbjct: 69 HRVNHAPANAQEHEAARP-SPQHQYQPPYASAQPRQPVQQPPEA---QVPPQHAPRPAQP 124
Query: 360 PVHAVVPPGSQYYGHHRPPPAAGSTQQTQTRSCSTTP 396
V P Q + + P
Sbjct: 125 APQPVQQPAYQPQPEQPLQQPVSPQVAPAPQPVHSAP 161
Score = 29.2 bits (66), Expect = 3.6
Identities = 16/97 (16%), Positives = 18/97 (18%), Gaps = 10/97 (10%)
Query: 271 CHQVLDLYSQSQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGA 330
Q +P Q P Q P P A
Sbjct: 106 QQPPEAQVPPQHAPRPAQPAPQPVQQPAYQPQPEQPLQQP--------VSPQVAPAP--Q 155
Query: 331 QYPQRVAPPPSGHYPNPYPAHPVPIPNIHPVHAVVPP 367
P P A P P P P + P
Sbjct: 156 PVHSAPQPAQQAFQPAEPVAAPQPEPVAEPAPVMDKP 192
Score = 28.8 bits (65), Expect = 5.2
Identities = 17/109 (15%), Positives = 25/109 (22%), Gaps = 10/109 (9%)
Query: 283 QQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPPSG 342
+ + + Q P Q P + +P Q P+ PP
Sbjct: 69 HRVNHAPANAQEHEAARPSPQHQYQPPYASAQPRQPVQ----------QPPEAQVPPQHA 118
Query: 343 HYPNPYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAAGSTQQTQTRS 391
P PV P P P S Q ++
Sbjct: 119 PRPAQPAPQPVQQPAYQPQPEQPLQQPVSPQVAPAPQPVHSAPQPAQQA 167
>gnl|CDD|227665 COG5373, COG5373, Predicted membrane protein [Function unknown].
Length = 931
Score = 40.6 bits (95), Expect = 0.002
Identities = 20/99 (20%), Positives = 31/99 (31%), Gaps = 8/99 (8%)
Query: 272 HQVLDLYSQSQQQASQQDSPPQTQPE--PSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLG 329
++ +L S + A+ + Q + + P S P++ P A G
Sbjct: 30 RELRELRSLVAEGAAGPVAKAAEQMAAPEAAEAAPLPAAAESIASPEVPPPVPPAPAQEG 89
Query: 330 AQYPQRVAPPPSGHYPNPYPAHPVPIPNIHPVHAVVPPG 368
A PS P P A P P + P P
Sbjct: 90 ---EAPAAEQPSAV-PAP-SAAPAPAEPVEPS-LAANPF 122
>gnl|CDD|219569 pfam07777, MFMR, G-box binding protein MFMR. This region is found
to the N-terminus of the pfam00170 transcription factor
domain. It is between 150 and 200 amino acids in length.
The N-terminal half is rather rich in proline residues
and has been termed the PRD (proline rich domain),
whereas the C-terminal half is more polar and has been
called the MFMR (multifunctional mosaic region). It has
been suggested that this family is composed of three
sub-families called A, B and C, classified according to
motif composition. It has been suggested that some of
these motifs may be involved in mediating
protein-protein interactions. The MFMR region contains a
nuclear localisation signal in bZIP opaque and GBF-2.
The MFMR also contains a transregulatory activity in
TAF-1. The MFMR in CPRF-2 contains cytoplasmic retention
signals.
Length = 189
Score = 38.3 bits (89), Expect = 0.003
Identities = 27/118 (22%), Positives = 36/118 (30%), Gaps = 30/118 (25%)
Query: 289 DSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPP------SG 342
S + S K Q P + PD + A Y R PP S
Sbjct: 3 SSEEGKPSKSSPKTSVQEDTPTPTVYPDW--------SAMQAYYGPRPPPPYFNSSVASS 54
Query: 343 HYPNPY---PAH--------PVPIPNIHP-----VHAVVPPGSQYYGHHRPPPAAGST 384
P+PY P P P ++P H +PPGS + + P A
Sbjct: 55 PQPHPYMWGPQQPMMPPYGTPPPYAAMYPPGGVYAHPSMPPGSHPFSPYAMPSAEVPG 112
>gnl|CDD|219321 pfam07174, FAP, Fibronectin-attachment protein (FAP). This family
contains bacterial fibronectin-attachment proteins
(FAP). Family members are rich in alanine and proline,
are approximately 300 long, and seem to be restricted to
mycobacteria. These proteins contain a
fibronectin-binding motif that allows mycobacteria to
bind to fibronectin in the extracellular matrix.
Length = 297
Score = 38.3 bits (89), Expect = 0.005
Identities = 20/74 (27%), Positives = 24/74 (32%), Gaps = 4/74 (5%)
Query: 295 QPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPPSGHYPNPYPAHPVP 354
P P PP + P+ P PP + APPP PN P P P
Sbjct: 40 DPAPPPPPPSTAAAAPAPAAPPPPPPPAAPPAPQPD--DPNAAPPPPPADPNAPP--PPP 95
Query: 355 IPNIHPVHAVVPPG 368
+ P PG
Sbjct: 96 VDPNAPPPPAPEPG 109
Score = 37.2 bits (86), Expect = 0.013
Identities = 17/68 (25%), Positives = 21/68 (30%)
Query: 290 SPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPPSGHYPNPYP 349
PP T P PP+ P++A P A PP + P P
Sbjct: 46 PPPSTAAAAPAPAAPPPPPPPAAPPAPQPDDPNAAPPPPPADPNAPPPPPVDPNAPPPPA 105
Query: 350 AHPVPIPN 357
P I N
Sbjct: 106 PEPGRIDN 113
Score = 30.6 bits (69), Expect = 1.2
Identities = 16/57 (28%), Positives = 19/57 (33%), Gaps = 1/57 (1%)
Query: 291 PPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPPSGHYPNP 347
P P P+ P PQ P + P P P + P AP P G N
Sbjct: 59 APPPPPPPAAPPAPQPDDPNAAPPPPPADPNAPPPPPVDPNAPPPPAPEP-GRIDNA 114
>gnl|CDD|218191 pfam04652, DUF605, Vta1 like. Vta1 (VPS20-associated protein 1) is
a positive regulator of Vps4. Vps4 is an ATPase that is
required in the multivesicular body (MVB) sorting
pathway to dissociate the endosomal sorting complex
required for transport (ESCRT). Vta1 promotes correct
assembly of Vps4 and stimulates its ATPase activity
through its conserved Vta1/SBP1/LIP5 region.
Length = 315
Score = 38.1 bits (89), Expect = 0.005
Identities = 24/99 (24%), Positives = 30/99 (30%), Gaps = 2/99 (2%)
Query: 285 ASQQDSPPQTQPEPSTKPPPQ-SHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPPSGH 343
AS D P + PS PP+ P P + S P + P +PPP
Sbjct: 182 ASPSDPPSSSPGVPSFPSPPEDPSSPSDSSLPPAPSSFQSDTPPPSPESPTNPSPPPGPA 241
Query: 344 YPNPYPAHPVP-IPNIHPVHAVVPPGSQYYGHHRPPPAA 381
P P P VP + P G A
Sbjct: 242 APPPPPVQQVPPLSTAKPTPPSASATPAPIGGITLDDDA 280
Score = 36.6 bits (85), Expect = 0.019
Identities = 20/93 (21%), Positives = 24/93 (25%), Gaps = 3/93 (3%)
Query: 278 YSQSQQQASQQDSPPQTQPEPSTKPPPQSHLP--PSKLKPDLVAPPHSAIPTLGAQYPQR 335
S S +S P P P S LP PS + D P +
Sbjct: 182 ASPSDPPSSSPGVPSFPSPPEDPSSPSDSSLPPAPSSFQSDTPPPSPESPTNPSPPPGPA 241
Query: 336 VAPPPSGHYPNPYPAHPVPIPNIHPVHAVVPPG 368
PPP P + P P G
Sbjct: 242 APPPPPVQ-QVPPLSTAKPTPPSASATPAPIGG 273
Score = 35.4 bits (82), Expect = 0.042
Identities = 23/115 (20%), Positives = 31/115 (26%), Gaps = 9/115 (7%)
Query: 276 DLYSQSQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVA--PPHSAIPTLGAQYP 333
+ + S + PP S PS P P S++P + +
Sbjct: 161 ADVATTNSDNSFPGEDADPASASPSDPPSSSPGVPSFPSPPEDPSSPSDSSLPPAPSSFQ 220
Query: 334 QRVAPPPSGHYPNPYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAAGSTQQTQ 388
PP NP P P PV V P + P S T
Sbjct: 221 SDTPPPSPESPTNPSPPPGPAAPPPPPVQQVPPLST-------AKPTPPSASATP 268
Score = 34.3 bits (79), Expect = 0.095
Identities = 18/97 (18%), Positives = 26/97 (26%), Gaps = 1/97 (1%)
Query: 286 SQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPPSGHYP 345
+ S + P S+ P PP + A + + P P+ P
Sbjct: 177 ADPASASPSDPPSSSPGVPSFPSPPEDPSSPSDSSLPPAPSSFQSDTPPPSPESPTNPSP 236
Query: 346 NPYP-AHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAA 381
P P A P P P + P P
Sbjct: 237 PPGPAAPPPPPVQQVPPLSTAKPTPPSASATPAPIGG 273
Score = 33.5 bits (77), Expect = 0.19
Identities = 22/88 (25%), Positives = 28/88 (31%), Gaps = 6/88 (6%)
Query: 274 VLDLYSQSQQQASQQDSPPQTQP-EPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQY 332
V S + +S DS P + PP S P+ P P + P Q
Sbjct: 194 VPSFPSPPEDPSSPSDSSLPPAPSSFQSDTPPPSPESPT--NPSPPPGPAAPPPPPVQQV 251
Query: 333 PQRVAPPPSGHYPNPYPAHPVPIPNIHP 360
P P+ P A P PI I
Sbjct: 252 PPLSTAKPT---PPSASATPAPIGGITL 276
Score = 33.1 bits (76), Expect = 0.22
Identities = 20/109 (18%), Positives = 27/109 (24%), Gaps = 8/109 (7%)
Query: 283 QQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPPSG 342
+ S PP S P P +PP + P P PS
Sbjct: 167 NSDNSFPGEDADPASASPSDPP-SSSPGV---PSFPSPPEDPSSPSDSSLP----PAPSS 218
Query: 343 HYPNPYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAAGSTQQTQTRS 391
+ P P N P P P A T + + +
Sbjct: 219 FQSDTPPPSPESPTNPSPPPGPAAPPPPPVQQVPPLSTAKPTPPSASAT 267
>gnl|CDD|237864 PRK14950, PRK14950, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 585
Score = 38.6 bits (90), Expect = 0.006
Identities = 20/93 (21%), Positives = 30/93 (32%), Gaps = 8/93 (8%)
Query: 280 QSQQQASQQDSPPQTQPEPSTKPPPQSHL--PPSKLKPDLVAPPHSAIPTLGAQYPQRVA 337
Q + + SP + P PST+P + PP + + PP + P
Sbjct: 367 QPAKPTAAAPSPVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPRPVA---PPVPH 423
Query: 338 PPPSGHYPNPYPAHPVPIPNIHPVHAVVPPGSQ 370
P S A PV + PP +
Sbjct: 424 TPESAP-KLTRAAIPVDEKPKYT--PPAPPKEE 453
Score = 30.2 bits (68), Expect = 2.2
Identities = 20/97 (20%), Positives = 24/97 (24%), Gaps = 7/97 (7%)
Query: 289 DSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQY-PQRVAPPPSGHYPNP 347
P QP T P P + P P +A P R P P P
Sbjct: 361 VPVPAPQPAKPTAAAP----SPVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPRP 416
Query: 348 YPAHPVPIPNIHP--VHAVVPPGSQYYGHHRPPPAAG 382
P P A +P + PP
Sbjct: 417 VAPPVPHTPESAPKLTRAAIPVDEKPKYTPPAPPKEE 453
>gnl|CDD|217310 pfam02993, MCPVI, Minor capsid protein VI. This minor capsid
protein may act as a link between the external capsid
and the internal DNA-protein core. The C-terminal 11
residues may function as a protease cofactor leading to
enzyme activation.
Length = 238
Score = 37.5 bits (87), Expect = 0.007
Identities = 26/97 (26%), Positives = 32/97 (32%), Gaps = 16/97 (16%)
Query: 284 QASQQDSPPQT---QPEPSTKPP--PQSHLPPS-----KLKPDLVAPPH--SAIPTLGAQ 331
Q P Q +P P + P + PPS K P V P AI
Sbjct: 118 QEETVADPIQALQPRPRPDVEEVLVPAAPEPPSYEETIKPGPAPVEEPVDSMAIAVPAID 177
Query: 332 YPQRVAPPPSGHYPNPYPAHPVPIPNIHPVHAVVPPG 368
P + PP+ P P P VP P+ VH
Sbjct: 178 TPVTLELPPA---PQP-PPPVVPQPSTMVVHRRSRIK 210
>gnl|CDD|236090 PRK07764, PRK07764, DNA polymerase III subunits gamma and tau;
Validated.
Length = 824
Score = 38.4 bits (90), Expect = 0.007
Identities = 22/115 (19%), Positives = 28/115 (24%), Gaps = 17/115 (14%)
Query: 284 QASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPPSGH 343
A +P P+ P P + P + AP + P A P PPS
Sbjct: 392 GAPAAAAPSAAAAAPAAAPAPAAAAPAA---AAAPAPAAAPQP---APAPAPAPAPPSPA 445
Query: 344 YPNPYPAHPVPIPNIHP-----------VHAVVPPGSQYYGHHRPPPAAGSTQQT 387
P P P P P P P A +
Sbjct: 446 GNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAAAPAAPAAP 500
Score = 35.0 bits (81), Expect = 0.082
Identities = 16/120 (13%), Positives = 17/120 (14%), Gaps = 17/120 (14%)
Query: 281 SQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHS-----------AIPTLG 329
+ A E S P P P K V A P
Sbjct: 622 AAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGGAAPAAP 681
Query: 330 AQYPQRVAPPPSGHYPNPYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAAGSTQQTQT 389
P AP PA S
Sbjct: 682 PPAPAPAAPAAPAGAAPAQPAPA------PAATPPAGQADDPAAQPPQAAQGASAPSPAA 735
Score = 34.6 bits (80), Expect = 0.11
Identities = 16/106 (15%), Positives = 24/106 (22%), Gaps = 3/106 (2%)
Query: 279 SQSQQQASQQDSPPQTQPEPSTKP-PPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVA 337
+ A + P P P + P P + + +
Sbjct: 680 APPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPV 739
Query: 338 PPPS--GHYPNPYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAA 381
P P P+P A P P P A P +
Sbjct: 740 PLPPEPDDPPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPSEEEEM 785
Score = 33.8 bits (78), Expect = 0.18
Identities = 16/113 (14%), Positives = 22/113 (19%), Gaps = 14/113 (12%)
Query: 279 SQSQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAP 338
+ A+ +P+ P A Q Q +
Sbjct: 682 PPAPAPAAPAAPAGAAPAQPAPAPAATPP-----------AGQADDPAAQPPQAAQGASA 730
Query: 339 PPSGHYPNPYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAAGSTQQTQTRS 391
P P P P P A P P A + S
Sbjct: 731 PSPAADDPV-PLPPEPDD--PPDPAGAPAQPPPPPAPAPAAAPAAAPPPSPPS 780
Score = 32.7 bits (75), Expect = 0.42
Identities = 15/80 (18%), Positives = 19/80 (23%), Gaps = 6/80 (7%)
Query: 279 SQSQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAP 338
+ Q + +P P P P A P A P + AP
Sbjct: 425 PAAAPQPAPAPAPAP-APPSPAGNAPAGGAPSPPPAAAPSAQPAPA-PAAAPEPTAAPAP 482
Query: 339 PPSGHYPNPY----PAHPVP 354
P PA P
Sbjct: 483 APPAAPAPAAAPAAPAAPAA 502
Score = 30.7 bits (70), Expect = 1.5
Identities = 21/131 (16%), Positives = 31/131 (23%), Gaps = 13/131 (9%)
Query: 284 QASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPPSGH 343
+ P + P + AP +A A P P
Sbjct: 386 GVAGGAGAPAAAAPSAAAAAP----AAA------PAPAAAAPAAAAAPAPAAAPQPAPA- 434
Query: 344 YPNPYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAAGSTQQTQTRSCSTTPLCHSYPG 403
P P PA P P P +P PA + + P +
Sbjct: 435 -PAPAPAPP-SPAGNAPAGGAPSPPPAAAPSAQPAPAPAAAPEPTAAPAPAPPAAPAPAA 492
Query: 404 SPISTTCSTTP 414
+P + P
Sbjct: 493 APAAPAAPAAP 503
Score = 30.0 bits (68), Expect = 2.6
Identities = 22/104 (21%), Positives = 29/104 (27%), Gaps = 19/104 (18%)
Query: 283 QQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPPSG 342
A + PQ P P+ P P S + PP A + AP P+
Sbjct: 420 AAAPAPAAAPQPAPAPAPAPAPPSPAGNAPAGGAPSPPP-------AAAPSAQPAPAPAA 472
Query: 343 HYPNPYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAAGSTQQ 386
P P P A P + PAA +
Sbjct: 473 ------APEPTAAPAPAPPAAPAPAAAP------AAPAAPAAPA 504
Score = 30.0 bits (68), Expect = 2.9
Identities = 15/92 (16%), Positives = 21/92 (22%), Gaps = 4/92 (4%)
Query: 279 SQSQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAP 338
+ + A+ P P P PP P + P
Sbjct: 420 AAAPAPAAAPQPAPAPAPAP---APPSPAGNAPAGGAPSPPPAAAPSAQPAPA-PAAAPE 475
Query: 339 PPSGHYPNPYPAHPVPIPNIHPVHAVVPPGSQ 370
P + P P A P P G+
Sbjct: 476 PTAAPAPAPPAAPAPAAAPAAPAAPAAPAGAD 507
Score = 28.8 bits (65), Expect = 6.1
Identities = 15/106 (14%), Positives = 21/106 (19%), Gaps = 7/106 (6%)
Query: 292 PQTQPEPSTKPPPQ------SHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPPSGHYP 345
P P+ P + S VA P +
Sbjct: 615 PAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAG 674
Query: 346 NPYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAAGSTQQTQTRS 391
PA P P P A + P + Q +
Sbjct: 675 GAAPAAPPPAP-APAAPAAPAGAAPAQPAPAPAATPPAGQADDPAA 719
>gnl|CDD|237605 PRK14086, dnaA, chromosomal replication initiation protein;
Provisional.
Length = 617
Score = 37.9 bits (88), Expect = 0.009
Identities = 18/114 (15%), Positives = 25/114 (21%), Gaps = 14/114 (12%)
Query: 291 PPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPPSG------HY 344
P P P+ P + P P + Q P+
Sbjct: 95 PAPPPPHARRTSEPELPRPGRRPYEGYGGPRADDRPPGLPRQDQLPTARPAYPAYQQRPE 154
Query: 345 PNPYPAHPVPIPNIHPVHAVVP-----PGSQYYGHH---RPPPAAGSTQQTQTR 390
P +P P + Y R P AG + Q R
Sbjct: 155 PGAWPRAADDYGWQQQRLGFPPRAPYASPASYAPEQERDREPYDAGRPEYDQRR 208
Score = 33.6 bits (77), Expect = 0.18
Identities = 20/110 (18%), Positives = 27/110 (24%), Gaps = 22/110 (20%)
Query: 301 KPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQ--YPQRVAPPPSGHYPNPYPAHPVPIPNI 358
+P P +P+L P G + P P PA+P
Sbjct: 94 EPAPPPPHARRTSEPELPRPGRRPYEGYGGPRADDRPPGLPRQDQLPTARPAYP------ 147
Query: 359 HPVHAVVPPGSQYYGHHRP---PPAAGSTQQTQTRSCSTTPLCHSYPGSP 405
Y P P AA Q R ++ P S
Sbjct: 148 -----------AYQQRPEPGAWPRAADDYGWQQQRLGFPPRAPYASPASY 186
Score = 30.2 bits (68), Expect = 2.7
Identities = 28/119 (23%), Positives = 39/119 (32%), Gaps = 13/119 (10%)
Query: 289 DSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPPSGHYPNPY 348
D P P P + P + +P+ A P +A G Q QR+ PP Y +P
Sbjct: 127 DDRPPGLPRQDQLPTARPAYPAYQQRPEPGAWPRAADD-YGWQQ-QRLGFPPRAPYASPA 184
Query: 349 PAHPVPIPNIHPVHAVVPPGSQYYGHHRPP---PAAGSTQQTQTRSCSTTPLCHSYPGS 404
P + P A G Y R P + + R T PG+
Sbjct: 185 SYAPEQERDREPYDA----GRPEYDQRRRDYDHPRPDWDRPRRDR----TDRPEPPPGA 235
>gnl|CDD|237874 PRK14971, PRK14971, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 614
Score = 37.8 bits (88), Expect = 0.009
Identities = 17/79 (21%), Positives = 19/79 (24%), Gaps = 3/79 (3%)
Query: 283 QQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPPSG 342
Q A+ PS P QS P P PT+ P V P
Sbjct: 385 QPAAAPQPSAAAAASPS---PSQSSAAAQPSAPQSATQPAGTPPTVSVDPPAAVPVNPPS 441
Query: 343 HYPNPYPAHPVPIPNIHPV 361
P PV
Sbjct: 442 TAPQAVRPAQFKEEKKIPV 460
Score = 34.8 bits (80), Expect = 0.081
Identities = 16/67 (23%), Positives = 22/67 (32%), Gaps = 4/67 (5%)
Query: 279 SQSQQQASQQDSPPQTQP--EPSTKPPPQSHLP--PSKLKPDLVAPPHSAIPTLGAQYPQ 334
S S Q+S P Q +P+ PP S P + P AP +
Sbjct: 399 SPSPSQSSAAAQPSAPQSATQPAGTPPTVSVDPPAAVPVNPPSTAPQAVRPAQFKEEKKI 458
Query: 335 RVAPPPS 341
V+ S
Sbjct: 459 PVSKVSS 465
Score = 32.1 bits (73), Expect = 0.69
Identities = 12/75 (16%), Positives = 20/75 (26%), Gaps = 2/75 (2%)
Query: 281 SQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPP 340
+ Q S + + + S Q P S +P P S P P
Sbjct: 388 AAPQPSAAAAASPSPSQSSA--AAQPSAPQSATQPAGTPPTVSVDPPAAVPVNPPSTAPQ 445
Query: 341 SGHYPNPYPAHPVPI 355
+ +P+
Sbjct: 446 AVRPAQFKEEKKIPV 460
Score = 30.5 bits (69), Expect = 1.9
Identities = 25/109 (22%), Positives = 37/109 (33%), Gaps = 5/109 (4%)
Query: 292 PQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQR--VAPPPSGHYPNPYP 349
Q + S P+ H+ P +P A P + + P + A PS P
Sbjct: 363 TQKGDDASGGRGPKQHIKPVFTQP--AAAPQPSAAAAASPSPSQSSAAAQPSAPQSATQP 420
Query: 350 AHPVPIPNIHPVHAV-VPPGSQYYGHHRPPPAAGSTQQTQTRSCSTTPL 397
A P ++ P AV V P S RP + ++ S P
Sbjct: 421 AGTPPTVSVDPPAAVPVNPPSTAPQAVRPAQFKEEKKIPVSKVSSLGPS 469
>gnl|CDD|223061 PHA03369, PHA03369, capsid maturational protease; Provisional.
Length = 663
Score = 37.7 bits (87), Expect = 0.013
Identities = 15/69 (21%), Positives = 19/69 (27%), Gaps = 1/69 (1%)
Query: 289 DSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPPSGHYPNPY 348
PQ P +S + P P P P P G P P
Sbjct: 381 RQRPQRPDGIPYSVPARSPMTAYPPVPQFCGDPGLVSPYNPQSPGTSYGPEPVGPVP-PQ 439
Query: 349 PAHPVPIPN 357
P +P +P
Sbjct: 440 PTNPYVMPI 448
>gnl|CDD|220309 pfam09606, Med15, ARC105 or Med15 subunit of Mediator complex
non-fungal. The approx. 70 residue Med15 domain of the
ARC-Mediator co-activator is a three-helix bundle with
marked similarity to the KIX domain. The sterol
regulatory element binding protein (SREBP) family of
transcription activators use the ARC105 subunit to
activate target genes in the regulation of cholesterol
and fatty acid homeostasis. In addition, Med15 is a
critical transducer of gene activation signals that
control early metazoan development.
Length = 768
Score = 37.3 bits (86), Expect = 0.014
Identities = 26/144 (18%), Positives = 40/144 (27%), Gaps = 23/144 (15%)
Query: 280 QSQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVA-------------PPH--SA 324
Q QQQ + P Q + + + + P S
Sbjct: 355 QQQQQQQGGNHPAAHQQQMNQQVGQGGQMVALGYLNIQGNQGGLGANPMQQGQPGMMSSP 414
Query: 325 IPTLGAQYPQRVAPPPSGHYPNPYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAAGST 384
P Q Q + PP P P P P ++P P P+ +
Sbjct: 415 SPVPQVQTNQSMPQPPQPS--VPSPGGPGSQPPQSVSGGMIPSPP----ALMPSPSPQMS 468
Query: 385 QQTQTRSCSTTPLCHSYPGSPIST 408
Q ++ T PG P++T
Sbjct: 469 QSPASQ--RTIQQDMVSPGGPLNT 490
>gnl|CDD|217392 pfam03153, TFIIA, Transcription factor IIA, alpha/beta subunit.
Transcription initiation factor IIA (TFIIA) is a
heterotrimer, the three subunits being known as alpha,
beta, and gamma, in order of molecular weight. The N and
C-terminal domains of the gamma subunit are represented
in pfam02268 and pfam02751, respectively. This family
represents the precursor that yields both the alpha and
beta subunits. The TFIIA heterotrimer is an essential
general transcription initiation factor for the
expression of genes transcribed by RNA polymerase II.
Together with TFIID, TFIIA binds to the promoter region;
this is the first step in the formation of a
pre-initiation complex (PIC). Binding of the rest of the
transcription machinery follows this step. After
initiation, the PIC does not completely dissociate from
the promoter. Some components, including TFIIA, remain
attached and re-initiate a subsequent round of
transcription.
Length = 332
Score = 35.9 bits (83), Expect = 0.030
Identities = 19/106 (17%), Positives = 25/106 (23%), Gaps = 5/106 (4%)
Query: 287 QQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPT-LGAQYPQRVAPPPSGHYP 345
+PP P P PQ + PT A P P+G
Sbjct: 52 SPQAPPPVAQLPQ--PLPQPPPTQALQALPAGDQQQHNTPTGSPAANPPATFALPAGPAG 109
Query: 346 NPYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAAGSTQQTQTRS 391
P V+ + P + QQ Q R
Sbjct: 110 PTI--QTEPGQLYPVQVPVMVTQNPANSPLDQPAQQRALQQLQQRY 153
Score = 33.9 bits (78), Expect = 0.13
Identities = 24/119 (20%), Positives = 29/119 (24%), Gaps = 20/119 (16%)
Query: 272 HQVLDLYSQSQQQASQQDSPPQTQP--EPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLG 329
+Q+ Q D P P+ PP LP P + P G
Sbjct: 65 PLPQPPPTQALQALPAGDQQQHNTPTGSPAANPPATFALPAGPAGPTIQTEP-------G 117
Query: 330 AQYPQRVAPPPSGHYPNPYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAAGSTQQTQ 388
YP V P P Q YG PA+G Q
Sbjct: 118 QLYP--VQVPVMVTQNPANSPLDQPAQQRALQQLQ-----QRYGA----PASGQLPSQQ 165
Score = 29.7 bits (67), Expect = 3.2
Identities = 17/118 (14%), Positives = 21/118 (17%), Gaps = 19/118 (16%)
Query: 288 QDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPPSGHYPNP 347
+P PQ + PT P
Sbjct: 51 PSPQAPPPVAQLPQPLPQPPPTQALQALPAGDQQQHNTPT-----------GSPAANPPA 99
Query: 348 YPAHPVPIPNIHPVHAVVPPGSQYYGHHRPP----PAAGSTQQTQTRSCSTTPLCHSY 401
A P PG Q Y P A S + + L Y
Sbjct: 100 TFALP---AGPAGPTIQTEPG-QLYPVQVPVMVTQNPANSPLDQPAQQRALQQLQQRY 153
>gnl|CDD|223535 COG0459, GroL, Chaperonin GroEL (HSP60 family) [Posttranslational
modification, protein turnover, chaperones].
Length = 524
Score = 36.0 bits (84), Expect = 0.034
Identities = 7/35 (20%), Positives = 15/35 (42%), Gaps = 3/35 (8%)
Query: 208 VNDSLCTTLCLQWEPEIIA---VALMYLAGKLSKF 239
+ D+L E +I+ A + A +L ++
Sbjct: 380 IEDALNVVRAAVEEGKIVPGGGAAEIEAALRLREY 414
>gnl|CDD|237191 PRK12757, PRK12757, cell division protein FtsN; Provisional.
Length = 256
Score = 35.4 bits (82), Expect = 0.035
Identities = 16/45 (35%), Positives = 18/45 (40%)
Query: 280 QSQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSA 324
Q QQQA QQ P T PP Q+ P P V +A
Sbjct: 120 QIQQQAQQQQPPATTAQPQPVTPPRQTTAPVQPQTPAPVRTQPAA 164
Score = 30.4 bits (69), Expect = 1.7
Identities = 13/61 (21%), Positives = 15/61 (24%)
Query: 280 QSQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPP 339
+ Q Q QP +T P P P P A Q V P
Sbjct: 114 VPRSTVQIQQQAQQQQPPATTAQPQPVTPPRQTTAPVQPQTPAPVRTQPAAPVTQAVEAP 173
Query: 340 P 340
Sbjct: 174 K 174
Score = 29.6 bits (67), Expect = 2.9
Identities = 12/71 (16%), Positives = 18/71 (25%), Gaps = 15/71 (21%)
Query: 279 SQSQQQASQQ---------DSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLG 329
Q+ Q PP T +P PP+ P + + A P
Sbjct: 109 EQTPQVPRSTVQIQQQAQQQQPPATTAQPQPVTPPRQTTAPVQPQTPAPVRTQPAAPV-- 166
Query: 330 AQYPQRVAPPP 340
+ P
Sbjct: 167 ----TQAVEAP 173
Score = 28.9 bits (65), Expect = 4.4
Identities = 15/65 (23%), Positives = 21/65 (32%), Gaps = 2/65 (3%)
Query: 273 QVLDLYSQSQQQASQQDSPPQTQPEPSTKP-PPQSHLPPSKLKPDLVA-PPHSAIPTLGA 330
Q+ Q Q A+ P T P +T P PQ+ P V +
Sbjct: 120 QIQQQAQQQQPPATTAQPQPVTPPRQTTAPVQPQTPAPVRTQPAAPVTQAVEAPKVEAEK 179
Query: 331 QYPQR 335
+ QR
Sbjct: 180 EKEQR 184
>gnl|CDD|223039 PHA03307, PHA03307, transcriptional regulator ICP4; Provisional.
Length = 1352
Score = 35.9 bits (83), Expect = 0.048
Identities = 19/124 (15%), Positives = 30/124 (24%), Gaps = 10/124 (8%)
Query: 285 ASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPPSGHY 344
+PP P P PP PDL + + + G +
Sbjct: 102 REGSPTPPGPSSPDPPPPTPPPASPPPSPAPDL-SEMLRPVGSPGPPPAASPPAAGAS-- 158
Query: 345 PNPYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAAGSTQQTQTRSCSTTPLCHSYPGS 404
P + + R P + + T + +P S
Sbjct: 159 PAAVASDAASSRQAALPLSSPEE------TARAPSSPPAEPPPSTPPAAASPRPPR-RSS 211
Query: 405 PIST 408
PIS
Sbjct: 212 PISA 215
Score = 33.6 bits (77), Expect = 0.20
Identities = 24/108 (22%), Positives = 32/108 (29%), Gaps = 6/108 (5%)
Query: 279 SQSQQQASQQDSPPQTQPEPSTKPP---PQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQR 335
S +++ + P P P T PP P S P V P
Sbjct: 99 SPAREGSPTPPGPSSPDPPPPTPPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGAS 158
Query: 336 VAPPPSGHYPNPYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAAGS 383
A S + A P+ P A P ++ PPAA S
Sbjct: 159 PAAVASDAASSRQAALPLSSPE-ETARAPSSPPAEPPP--STPPAAAS 203
Score = 31.3 bits (71), Expect = 1.3
Identities = 25/129 (19%), Positives = 40/129 (31%), Gaps = 6/129 (4%)
Query: 292 PQTQPEPSTKPPPQSHL--PPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPPSGHYPNPYP 349
+ EP T PPP P ++ + + P A+ P PS P P
Sbjct: 61 ACDRFEPPTGPPPGPGTEAPANESRSTPTWSLSTLAPASPAREGSPTPPGPSSPDPPPPT 120
Query: 350 AHPV-PIPNIHPVHAVVPPGSQYYGHH-RPPPAAGSTQQTQTRSCSTTPLCHSYPGS--P 405
P P P+ P + + G P A S + + + P S
Sbjct: 121 PPPASPPPSPAPDLSEMLRPVGSPGPPPAASPPAAGASPAAVASDAASSRQAALPLSSPE 180
Query: 406 ISTTCSTTP 414
+ ++P
Sbjct: 181 ETARAPSSP 189
>gnl|CDD|227357 COG5024, COG5024, Cyclin [Cell division and chromosome
partitioning].
Length = 440
Score = 34.4 bits (79), Expect = 0.11
Identities = 30/118 (25%), Positives = 49/118 (41%), Gaps = 18/118 (15%)
Query: 118 CLFLAGKVEETPKKCKDVIKTARMLVDDKKFKTFGDDPREEVMTLERILLQTIKFDLQVD 177
LF+A K EE C + D + T G R++++ ER +L+ + F++
Sbjct: 262 ALFIASKYEEV--NCP--------SIKDLVYATDGAFTRDDIIRAERYMLEVLDFNISWP 311
Query: 178 HPYSFLLKYAKSLKGD--KTKLEKMLQMAWTFVNDSLCTTLCLQWEPEIIAVALMYLA 233
P SFL + +K+ D K + V+ +Q P A A MYL+
Sbjct: 312 SPMSFLRRISKASDYDIFSRTPAKFS-SEISPVDYKF-----IQISPSWCAAAAMYLS 363
>gnl|CDD|220950 pfam11029, DAZAP2, DAZ associated protein 2 (DAZAP2). DAZ
associated protein 2 has a highly conserved sequence
throughout evolution including a conserved polyproline
region and several SH2/SH3 binding sites. It occurs as a
single copy gene with a four-exon organisation and is
located on chromosome 12. It encodes a ubiquitously
expressed protein and binds to DAZ and DAZL1 through DAZ
repeats.
Length = 136
Score = 32.5 bits (74), Expect = 0.14
Identities = 20/76 (26%), Positives = 29/76 (38%), Gaps = 8/76 (10%)
Query: 278 YSQSQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVA 337
YS+ Q + Q + P P +LP +++ P S+ P + A YP
Sbjct: 8 YSELYQPSYVVPPQAQMPQASAPYPGPSMYLPMAQVMA---VGPQSSHPPM-AYYPIGAP 63
Query: 338 PPPSGHYPNPYPAHPV 353
PP YP P V
Sbjct: 64 PPV---YP-PGSTVLV 75
Score = 31.0 bits (70), Expect = 0.50
Identities = 19/71 (26%), Positives = 24/71 (33%), Gaps = 2/71 (2%)
Query: 320 PPHSAI-PTLGAQYPQRVAPPPSGHYPNPYPAHPVPIPNIHPVHAVVPPGSQY-YGHHRP 377
P +S + PQ P S YP P P+ + PP + Y G P
Sbjct: 6 PAYSELYQPSYVVPPQAQMPQASAPYPGPSMYLPMAQVMAVGPQSSHPPMAYYPIGAPPP 65
Query: 378 PPAAGSTQQTQ 388
GST Q
Sbjct: 66 VYPPGSTVLVQ 76
>gnl|CDD|237057 PRK12323, PRK12323, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 700
Score = 34.1 bits (78), Expect = 0.14
Identities = 28/102 (27%), Positives = 35/102 (34%), Gaps = 15/102 (14%)
Query: 290 SPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLG-AQYPQRVAPPP-------- 340
+ P QP P+ P + P+ A P +A A P R +P P
Sbjct: 379 AAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQ 438
Query: 341 -SGHYPNPYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAA 381
S P PA P P P P A P + RP AA
Sbjct: 439 ASARGPGGAPA-PAPAPAAAPAAAARPAAAGP----RPVAAA 475
Score = 32.9 bits (75), Expect = 0.36
Identities = 23/108 (21%), Positives = 31/108 (28%), Gaps = 14/108 (12%)
Query: 285 ASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPP---- 340
A P P + +P P + A P A P A P PPP
Sbjct: 447 APAPAPAPAAAPAAAARPAAAGPRPVAAAAA---AAPARAAPA-AAPAPADDDPPPWEEL 502
Query: 341 SGHYPNPYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAAGSTQQTQ 388
+ +P PA P P ++ P + P A T
Sbjct: 503 PPEFASPAPAQPDAAPAGWVAESIPDPAT------ADPDDAFETLAPA 544
Score = 32.5 bits (74), Expect = 0.44
Identities = 18/101 (17%), Positives = 23/101 (22%), Gaps = 8/101 (7%)
Query: 291 PPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTL---GAQYPQRVAPPPSGHYPNP 347
P P+ P + P +P A+ A+ P P P
Sbjct: 399 PAAPPAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGGAPAPAPAPAAAP 458
Query: 348 YPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAAGSTQQTQ 388
A PV A R PAA
Sbjct: 459 AAAARPAAAGPRPVAAAAAAAPA-----RAAPAAAPAPADD 494
Score = 29.1 bits (65), Expect = 5.2
Identities = 13/84 (15%), Positives = 20/84 (23%), Gaps = 1/84 (1%)
Query: 284 QASQQDSPPQTQPE-PSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPPSG 342
A + + P+ + P L ++ A A P A P +
Sbjct: 408 AAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGGAPAPAPAPAAAPAAAARPAAA 467
Query: 343 HYPNPYPAHPVPIPNIHPVHAVVP 366
A P A P
Sbjct: 468 GPRPVAAAAAAAPARAAPAAAPAP 491
Score = 29.1 bits (65), Expect = 5.3
Identities = 17/105 (16%), Positives = 29/105 (27%)
Query: 285 ASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPPSGHY 344
A+ +P + P + P P +L P+ +P + A + P P+
Sbjct: 474 AAAAAAPARAAPAAAPAPADDDPPPWEELPPEFASPAPAQPDAAPAGWVAESIPDPATAD 533
Query: 345 PNPYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAAGSTQQTQT 389
P+ P P P P +A
Sbjct: 534 PDDAFETLAPAPAAAPAPRAAAATEPVVAPRPPRASASGLPDMFD 578
Score = 29.1 bits (65), Expect = 5.8
Identities = 14/82 (17%), Positives = 22/82 (26%)
Query: 300 TKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPPSGHYPNPYPAHPVPIPNIH 359
P + P ++ P AP +A AP + A P
Sbjct: 372 AGPATAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAPE 431
Query: 360 PVHAVVPPGSQYYGHHRPPPAA 381
+ A ++ G P A
Sbjct: 432 ALAAARQASARGPGGAPAPAPA 453
Score = 28.7 bits (64), Expect = 6.4
Identities = 15/98 (15%), Positives = 21/98 (21%), Gaps = 5/98 (5%)
Query: 293 QTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPPSGHYPNPYPAHP 352
P + P P + P + A A P P
Sbjct: 371 GAGPATAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRSPAP 430
Query: 353 VPIPNIHPVHAVVPPGSQYYGHHRPPPAAGSTQQTQTR 390
+ A P G+ P PA + R
Sbjct: 431 EALAAARQASARGPGGAP-----APAPAPAAAPAAAAR 463
>gnl|CDD|237862 PRK14948, PRK14948, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 620
Score = 34.2 bits (79), Expect = 0.15
Identities = 23/80 (28%), Positives = 26/80 (32%), Gaps = 8/80 (10%)
Query: 281 SQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPP 340
Q S S P P K PP P +P APP + P P A
Sbjct: 511 ESQSGSA--SNTAKTPPPPQKSPPPPAPTPPLPQPTATAPPPTPPP------PPPTATQA 562
Query: 341 SGHYPNPYPAHPVPIPNIHP 360
S + P PA P P I
Sbjct: 563 SSNAPAQIPADSSPPPPIPE 582
Score = 33.8 bits (78), Expect = 0.16
Identities = 18/80 (22%), Positives = 25/80 (31%), Gaps = 1/80 (1%)
Query: 280 QSQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPP 339
+SQ ++ + P+ S PP + P P PT A
Sbjct: 511 ESQSGSASNTAKTPPPPQKSPPPPAPTPPLPQPTATAPPPTPPPPPPTATQASSNAPAQI 570
Query: 340 PSGHYP-NPYPAHPVPIPNI 358
P+ P P P P P P
Sbjct: 571 PADSSPPPPIPEEPTPSPTK 590
Score = 31.9 bits (73), Expect = 0.74
Identities = 15/78 (19%), Positives = 21/78 (26%), Gaps = 3/78 (3%)
Query: 280 QSQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPP 339
+ A+ T + PP K K A + P A P V
Sbjct: 364 FISEIANASAPANPTPAPNPSPPPAPIQPSAPKTKQ---AATTPSPPPAKASPPIPVPAE 420
Query: 340 PSGHYPNPYPAHPVPIPN 357
P+ P P P+
Sbjct: 421 PTEPSPTPPANAANAPPS 438
>gnl|CDD|217298 pfam02948, Amelogenin, Amelogenin. Amelogenins play a role in
biomineralisation. They seem to regulate the formation
of crystallites during the secretory stage of tooth
enamel development. thought to play a major role in the
structural organisation and mineralisation of developing
enamel. They are found in the extracellular matrix.
Mutations in X-chromosomal amelogenin can cause
Amelogenesis imperfecta.
Length = 174
Score = 33.0 bits (75), Expect = 0.16
Identities = 17/78 (21%), Positives = 23/78 (29%), Gaps = 2/78 (2%)
Query: 303 PPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPPSGHYPNPYPAHPVPIPNIHPV- 361
P +P + PH + L Q P P P + P HP+
Sbjct: 52 PLSPQMPQQQQSAHPKLTPHHQLLILPPQQPMMPVPGHHPMVPMTGQQPHLQPPAQHPLQ 111
Query: 362 -HAVVPPGSQYYGHHRPP 378
P Q H +PP
Sbjct: 112 PTYGQNPQPQQPTHTQPP 129
>gnl|CDD|234938 PRK01297, PRK01297, ATP-dependent RNA helicase RhlB; Provisional.
Length = 475
Score = 33.7 bits (77), Expect = 0.18
Identities = 16/76 (21%), Positives = 25/76 (32%), Gaps = 3/76 (3%)
Query: 279 SQSQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAP 338
+++Q A SP P PP ++ P +K A P + P +R
Sbjct: 12 GEAEQPAPAPPSPAAAPAPP---PPAKTAAPATKAAAPAAAAPRAEKPKKDKPRRERKPK 68
Query: 339 PPSGHYPNPYPAHPVP 354
P S + P
Sbjct: 69 PASLWKLEDFVVEPQE 84
>gnl|CDD|180523 PRK06305, PRK06305, DNA polymerase III subunits gamma and tau;
Validated.
Length = 451
Score = 33.6 bits (77), Expect = 0.19
Identities = 26/137 (18%), Positives = 52/137 (37%), Gaps = 16/137 (11%)
Query: 216 LCLQWEPEIIAVALMYLAGKLSKFEVVDWIGRNSKHLR---WWEMFVEDMTMDLLEDICH 272
L Q+ ++ +VA Y + +L E++D++G ++KH++ + + F+E + + L+ I
Sbjct: 295 LLKQYNKQLSSVATKYSSEQL--LEIIDFLGESAKHIQLTIFEKTFLETVIIHLIR-IYQ 351
Query: 273 QVLDLYSQSQQQASQQDSPPQTQPEPSTKPPPQSHLPPS-KLKPD-------LVAPPHSA 324
+ SQ ++ P K S + +P L+
Sbjct: 352 RPTLSQLVSQIKSPAFTQLRLESCPPQIKSSNSQQTVSSPQPQPVAKLEQGSLLTAASP- 410
Query: 325 IPTLGAQYPQRVAPPPS 341
T Q V P
Sbjct: 411 -QTKKETIHQEVPSPSL 426
>gnl|CDD|166942 PRK00404, tatB, sec-independent translocase; Provisional.
Length = 141
Score = 31.7 bits (72), Expect = 0.26
Identities = 26/110 (23%), Positives = 41/110 (37%), Gaps = 4/110 (3%)
Query: 244 WIGRNSKHLRWWEMFVE-DMTMDLLEDICHQVLDLYSQSQQQASQQDSPPQTQPEPSTKP 302
WIGR + + VE ++ D +I Q+ + + S +Q +++ P T P P
Sbjct: 35 WIGRLKRSFNAIKQEVEREIGAD---EIRRQLHNEHILSMEQEARKILAPLTPPAPPEPV 91
Query: 303 PPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPPSGHYPNPYPAHP 352
P + P+ P A+P A A P P P P
Sbjct: 92 TPPTAQSPAPAVPTPPPTSTPAVPPAPAAAVPAPAAAPPPSDPPQPPRAP 141
>gnl|CDD|184923 PRK14959, PRK14959, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 624
Score = 33.1 bits (75), Expect = 0.32
Identities = 24/94 (25%), Positives = 28/94 (29%), Gaps = 12/94 (12%)
Query: 322 HSAIPTLGAQYPQRVAP-----PPSGHYPNPYPAHPVPIPNIHPVHAVVPPGSQYYGHHR 376
+ IPT G Q PQ AP P S P P+ P P + P P + R
Sbjct: 394 AATIPTPGTQGPQGTAPAAGMTPSSAAPATPAPS-AAPSPRV-PWDDAPPAPPRSGIPPR 451
Query: 377 PPPAAGSTQQT-----QTRSCSTTPLCHSYPGSP 405
P P S S P P
Sbjct: 452 PAPRMPEASPVPGAPDSVASASDAPPTLGDPSDT 485
>gnl|CDD|130706 TIGR01645, half-pint, poly-U binding splicing factor, half-pint
family. The proteins represented by this model contain
three RNA recognition motifs (rrm: pfam00076) and have
been characterized as poly-pyrimidine tract binding
proteins associated with RNA splicing factors. In the
case of PUF60 (GP|6176532), in complex with p54, and in
the presence of U2AF, facilitates association of U2
snRNP with pre-mRNA.
Length = 612
Score = 32.7 bits (74), Expect = 0.33
Identities = 21/91 (23%), Positives = 31/91 (34%), Gaps = 8/91 (8%)
Query: 280 QSQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSA---IPTLGAQYPQRV 336
+ + + P+T P S LP +V+ +P L P V
Sbjct: 313 MAAEAVAGAAVLGPRAQSPAT---PSSSLPTDIGNKAVVSSAKKEAEEVPPLPQAAPAVV 369
Query: 337 APPPSGHYPNPYPAHPVPIPN-IHPVHAVVP 366
P P P P P + IP+ + P V P
Sbjct: 370 KPGPM-EIPTPVPPPGLAIPSLVAPPGLVAP 399
>gnl|CDD|217453 pfam03251, Tymo_45kd_70kd, Tymovirus 45/70Kd protein. Tymoviruses
are single stranded RNA viruses. This family includes a
protein of unknown function that has been named based on
its molecular weight. Tymoviruses such as the ononis
yellow mosaic tymovirus encode only three proteins. Of
these two are overlapping this protein overlaps a larger
ORF that is thought to be the polymerase.
Length = 458
Score = 32.8 bits (75), Expect = 0.37
Identities = 24/129 (18%), Positives = 35/129 (27%), Gaps = 2/129 (1%)
Query: 277 LYSQSQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRV 336
+ S QQ + ++ + Q P + L PP V
Sbjct: 330 IRKNSLQQTRARLGHSKSLGQSPNLRSSQLPPPTKRRLRLLPVPPPKVQALPLTALAPLV 389
Query: 337 APPPSGHYPNPYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAAGSTQQTQTRSCSTTP 396
PS P+P A P + H +PP R + S R + P
Sbjct: 390 RHSPSIPLPHPPSALPSHVGASSSKHHRLPP--SVLPGPRLSSPSPSPSLPTRRPGTPPP 447
Query: 397 LCHSYPGSP 405
SP
Sbjct: 448 PASPPTPSP 456
>gnl|CDD|218108 pfam04487, CITED, CITED. CITED, CBP/p300-interacting
transactivator with ED-rich tail, are characterized by a
conserved 32-amino acid sequence at the C-terminus.
CITED proteins do not bind DNA directly and are thought
to function as transcriptional co-activators.
Length = 206
Score = 32.2 bits (73), Expect = 0.39
Identities = 22/92 (23%), Positives = 26/92 (28%), Gaps = 11/92 (11%)
Query: 290 SPPQTQPEPSTKPPPQSHLP-PSKLKPDLVAPPHSAIPTLGAQYPQRVAPPPSGHYPNPY 348
P Q+ P P P P L+A + L QY P GH
Sbjct: 71 HPHQSMPAYMMFNPSSKPQPFMLVPGPQLMASMQ--LQKLNTQYQGHAGAPA-GHPGGGG 127
Query: 349 PAHPVPIPNIHPVHAVVPPGSQYYGHHRPPPA 380
P P PPG Q+ PP
Sbjct: 128 PQQFRPGAG-------QPPGMQHMPAPALPPN 152
>gnl|CDD|223033 PHA03291, PHA03291, envelope glycoprotein I; Provisional.
Length = 401
Score = 32.2 bits (73), Expect = 0.43
Identities = 22/96 (22%), Positives = 28/96 (29%), Gaps = 16/96 (16%)
Query: 308 LPPS--KLKPDLVAPPHSAIPTLGAQYPQRVAPPPSGHYPNPYPAHPVPIPNIHPVHAVV 365
LP S +L P V P + PT P PS P P P I A
Sbjct: 190 LPLSAPRLGPADVFVPATPRPTPRTTASPETTPTPSTTTSPPSTTIPAPSTTIAAPQAGT 249
Query: 366 -----------PPGSQYYGHHRPPPAAGSTQQTQTR 390
PG G P A + + ++
Sbjct: 250 TPEAEGTPAPPTPGG---GEAPPANATPAPEASRYE 282
Score = 29.2 bits (65), Expect = 4.3
Identities = 16/72 (22%), Positives = 21/72 (29%), Gaps = 8/72 (11%)
Query: 291 PPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRV--------APPPSG 342
P +P P T P++ PS + T+ A P P G
Sbjct: 205 PATPRPTPRTTASPETTPTPSTTTSPPSTTIPAPSTTIAAPQAGTTPEAEGTPAPPTPGG 264
Query: 343 HYPNPYPAHPVP 354
P A P P
Sbjct: 265 GEAPPANATPAP 276
>gnl|CDD|217307 pfam02984, Cyclin_C, Cyclin, C-terminal domain. Cyclins regulate
cyclin dependent kinases (CDKs). Human CCNO is a
Uracil-DNA glycosylase that is related to other cyclins.
Cyclins contain two domains of similar all-alpha fold,
of which this family corresponds with the C-terminal
domain.
Length = 117
Score = 30.7 bits (70), Expect = 0.44
Identities = 17/55 (30%), Positives = 28/55 (50%), Gaps = 4/55 (7%)
Query: 179 PYSFLLKYAKSLKGDKTKLEKMLQMAWTFVNDSLCTTLCLQWEPEIIAVALMYLA 233
P SFL +++K+ D + +A + SL L++ P +IA A +YLA
Sbjct: 3 PLSFLRRFSKAADYDL----ETRTLAKYLLELSLLDYDFLKYPPSLIAAAAVYLA 53
>gnl|CDD|223880 COG0810, TonB, Periplasmic protein TonB, links inner and outer
membranes [Cell envelope biogenesis, outer membrane].
Length = 244
Score = 31.7 bits (72), Expect = 0.49
Identities = 18/119 (15%), Positives = 29/119 (24%), Gaps = 6/119 (5%)
Query: 290 SPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPPSGHYPNPYP 349
EP +P P P +P+ P P + ++ P
Sbjct: 48 VLEAPTEEPQPEPEPPEEQPKPPTEPETPPEPTPPKPKEKPKPEKKPKKPKPKP-----K 102
Query: 350 AHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAAGSTQQTQTRSCSTTPLC-HSYPGSPIS 407
P P P + P P S+ P + + + SY
Sbjct: 103 PKPKPKPKVKPQPKPKKPPSKTAAKAPAAPNQPARPPSAASASGAATGPSASYLSGLRR 161
>gnl|CDD|235899 PRK06975, PRK06975, bifunctional uroporphyrinogen-III
synthetase/uroporphyrin-III C-methyltransferase;
Reviewed.
Length = 656
Score = 32.4 bits (74), Expect = 0.52
Identities = 10/60 (16%), Positives = 13/60 (21%)
Query: 271 CHQVLDLYSQSQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGA 330
D +Q A T S P + P+ A P G
Sbjct: 262 FLTWADAAAQPATAAPAPSRMTDTNDSKSVTSQPAAAAAAPAPPPNPPATPPEPPARRGR 321
>gnl|CDD|217469 pfam03276, Gag_spuma, Spumavirus gag protein.
Length = 582
Score = 32.2 bits (73), Expect = 0.53
Identities = 26/87 (29%), Positives = 30/87 (34%), Gaps = 8/87 (9%)
Query: 296 PEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPPSGHYPNPYPAHPVPI 355
P PPP L P + SA T G + P RV+ P P P PA P
Sbjct: 183 PGAIQPPPPS---SLPGLPPGSSSLAPSASSTPGNRLP-RVSFNP--FLPGPSPAQP-SA 235
Query: 356 PNIHPVHAVVPPGSQYYGHHRPPPAAG 382
P +PP QY P P
Sbjct: 236 PPASIPAPPIPPVIQYV-APPPVPPPQ 261
Score = 30.7 bits (69), Expect = 1.6
Identities = 24/93 (25%), Positives = 31/93 (33%), Gaps = 4/93 (4%)
Query: 279 SQSQQQASQQDSPPQTQP-EPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVA 337
Q +S PP + PS P + LP P L P + A P
Sbjct: 186 IQPPPPSSLPGLPPGSSSLAPSASSTPGNRLPRVSFNPFLPGPSPAQPSAPPASIPAPPI 245
Query: 338 PPPSGHY---PNPYPAHPVPIPNIHPVHAVVPP 367
PP + P P P +PI +I V P
Sbjct: 246 PPVIQYVAPPPVPPPQPIIPIQHIRAVTGETPA 278
>gnl|CDD|237868 PRK14960, PRK14960, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 702
Score = 32.3 bits (73), Expect = 0.55
Identities = 22/83 (26%), Positives = 34/83 (40%), Gaps = 6/83 (7%)
Query: 277 LYSQSQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKL--KPDLVAPPHSAIPTLGAQYPQ 334
L S+ QQ Q + +Q + + + P S + P ++ +P +V P P +
Sbjct: 367 LVSEPVQQNGQAEVGLNSQAQTAQEITPVSAVQPVEVISQPAMVEPEPEPEPEPEPEPEP 426
Query: 335 RVAPPPSGHYPNPYPAHPVPIPN 357
P P P P P P P PN
Sbjct: 427 EPEPEPE---PEPEPE-PEPQPN 445
>gnl|CDD|197891 smart00818, Amelogenin, Amelogenins, cell adhesion proteins, play a
role in the biomineralisation of teeth. They seem to
regulate formation of crystallites during the secretory
stage of tooth enamel development and are thought to
play a major role in the structural organisation and
mineralisation of developing enamel. The extracellular
matrix of the developing enamel comprises two major
classes of protein: the hydrophobic amelogenins and the
acidic enamelins. Circular dichroism studies of porcine
amelogenin have shown that the protein consists of 3
discrete folding units: the N-terminal region appears to
contain beta-strand structures, while the C-terminal
region displays characteristics of a random coil
conformation. Subsequent studies on the bovine protein
have indicated the amelogenin structure to contain a
repetitive beta-turn segment and a "beta-spiral" between
Gln112 and Leu138, which sequester a (Pro, Leu, Gln)
rich region. The beta-spiral offers a probable site for
interactions with Ca2+ ions. Muatations in the human
amelogenin gene (AMGX) cause X-linked hypoplastic
amelogenesis imperfecta, a disease characterised by
defective enamel. A 9bp deletion in exon 2 of AMGX
results in the loss of codons for Ile5, Leu6, Phe7 and
Ala8, and replacement by a new threonine codon,
disrupting the 16-residue (Met1-Ala16) amelogenin signal
peptide.
Length = 165
Score = 30.9 bits (70), Expect = 0.62
Identities = 22/101 (21%), Positives = 29/101 (28%), Gaps = 7/101 (6%)
Query: 286 SQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAI-------PTLGAQYPQRVAP 338
SQQ P T P + P +P + P ++ P L Q P
Sbjct: 43 SQQHPPTHTLQPHHHIPVLPAQQPVVPQQPLMPVPGQHSMTPTQHHQPNLPQPAQQPFQP 102
Query: 339 PPSGHYPNPYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPP 379
P P P P + P PP + PP
Sbjct: 103 QPLQPPQPQQPMQPQPPVHPIPPLPPQPPLPPMFPMQPLPP 143
>gnl|CDD|172376 PRK13855, PRK13855, type IV secretion system protein VirB10;
Provisional.
Length = 376
Score = 31.8 bits (72), Expect = 0.63
Identities = 19/75 (25%), Positives = 27/75 (36%), Gaps = 12/75 (16%)
Query: 282 QQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPPS 341
+ + + +PP T +TKP H P + PD A + PT APP +
Sbjct: 52 RSKKENEPAPPSTMIATNTKP---FHPAPIDVPPDPPAAQEAVQPT---------APPSA 99
Query: 342 GHYPNPYPAHPVPIP 356
P P P
Sbjct: 100 QSEPERNEPRPEETP 114
>gnl|CDD|237866 PRK14952, PRK14952, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 584
Score = 31.8 bits (72), Expect = 0.70
Identities = 15/46 (32%), Positives = 19/46 (41%)
Query: 285 ASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGA 330
A+ + PQ P PS P H P + +P L P S P A
Sbjct: 389 ANLLHNAPQAAPAPSAAAPEPKHQPAPEPRPVLAPTPASGEPNAAA 434
>gnl|CDD|222912 PHA02624, PHA02624, large T antigen; Provisional.
Length = 647
Score = 31.9 bits (73), Expect = 0.75
Identities = 20/69 (28%), Positives = 25/69 (36%), Gaps = 8/69 (11%)
Query: 253 RWWEMFVEDMTMDLLEDICHQVLDLYSQSQQQASQQDSPPQTQPEPSTKPPPQSHLPPSK 312
+WWE F E DL C + +L S + SPP P + S P K
Sbjct: 91 QWWEEFNEKWDEDLF---CDE--ELSSSDDEDEPPPPSPP---PSQEEESQSSSQATPPK 142
Query: 313 LKPDLVAPP 321
K AP
Sbjct: 143 KKKKNPAPN 151
>gnl|CDD|236362 PRK09010, PRK09010, single-stranded DNA-binding protein;
Provisional.
Length = 177
Score = 31.0 bits (70), Expect = 0.76
Identities = 10/34 (29%), Positives = 13/34 (38%), Gaps = 1/34 (2%)
Query: 278 YSQSQQ-QASQQDSPPQTQPEPSTKPPPQSHLPP 310
+ Q QQ Q Q S + P S+ PP
Sbjct: 135 WGQPQQPQGGNQFSGGAQSRPQQSAPAAPSNEPP 168
>gnl|CDD|200219 TIGR02927, SucB_Actino, 2-oxoglutarate dehydrogenase, E2 component,
dihydrolipoamide succinyltransferase. This model
represents an Actinobacterial clade of E2 enzyme, a
component of the 2-oxoglutarate dehydrogenase complex
involved in the TCA cycle. These proteins have multiple
domains including the catalytic domain (pfam00198), one
or two biotin domains (pfam00364) and an E3-component
binding domain (pfam02817).
Length = 579
Score = 31.9 bits (72), Expect = 0.78
Identities = 19/79 (24%), Positives = 26/79 (32%), Gaps = 11/79 (13%)
Query: 289 DSPPQTQPEPSTKPPPQSHLPPSKLKPDLVA-PPHSAIPTLGAQYPQRVAPPPSGHYPNP 347
D+ + P S S+ PD A PH+A P AP P+
Sbjct: 202 DANAAPAEPAEEEAPAPSE-AGSEPAPDPAARAPHAA--------PDPPAPAPAPAKTAA 252
Query: 348 YPAHPVPIPNIHPVHAVVP 366
PA P+ + V P
Sbjct: 253 -PAAAAPVSSGDSGPYVTP 270
Score = 30.8 bits (69), Expect = 1.5
Identities = 14/59 (23%), Positives = 23/59 (38%), Gaps = 4/59 (6%)
Query: 284 QASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPPSG 342
+A + +P +PE + +P + P P AP +A G+ V P G
Sbjct: 81 EAGSEPAPAAPEPEAAPEPEAPAPAP----TPAAEAPAPAAPQAGGSGEATEVKMPELG 135
>gnl|CDD|216078 pfam00716, Peptidase_S21, Assemblin (Peptidase family S21).
Length = 326
Score = 31.2 bits (71), Expect = 0.87
Identities = 17/85 (20%), Positives = 20/85 (23%), Gaps = 12/85 (14%)
Query: 281 SQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQY-----PQR 335
S Q P PP S + + L A + Y Q
Sbjct: 245 PSVSPSGQPLSPAA-------PPGTSSVAGTALSASPAALFGDMVYVPLDAYNQLLAGQA 297
Query: 336 VAPPPSGHYPNPYPAHPVPIPNIHP 360
PP P P P P P
Sbjct: 298 FNQPPDPQGPAPPAELAPPAPAPPP 322
>gnl|CDD|130689 TIGR01628, PABP-1234, polyadenylate binding protein, human types 1,
2, 3, 4 family. These eukaryotic proteins recognize the
poly-A of mRNA and consists of four tandem RNA
recognition domains at the N-terminus (rrm: pfam00076)
followed by a PABP-specific domain (pfam00658) at the
C-terminus. The protein is involved in the transport of
mRNA's from the nucleus to the cytoplasm. There are four
paralogs in Homo sapiens which are expressed in testis
(GP:11610605_PABP3 ), platelets (SP:Q13310_PABP4 ),
broadly expressed (SP:P11940_PABP1) and of unknown
tissue range (SP:Q15097_PABP2).
Length = 562
Score = 31.7 bits (72), Expect = 0.89
Identities = 21/87 (24%), Positives = 29/87 (33%), Gaps = 9/87 (10%)
Query: 282 QQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPPS 341
Q Q + P P S P P P L+P+ +AP +A+ PP
Sbjct: 403 QGPQQQFNGQPLGWPRMSMMPTPMG--PGGPLRPNGLAPM-NAVRAPSRNAQNAAQKPPM 459
Query: 342 G--HYPNPYPAHPV----PIPNIHPVH 362
YP Y + P+ P P
Sbjct: 460 QPVMYPPNYQSLPLSQDLPQPQSTASQ 486
>gnl|CDD|226348 COG3827, COG3827, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 231
Score = 31.0 bits (70), Expect = 0.97
Identities = 12/78 (15%), Positives = 16/78 (20%), Gaps = 5/78 (6%)
Query: 280 QSQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPP 339
++ A + Q Q K P S +K A P A A
Sbjct: 80 IARDAAPGPAAVAQAQNPDEKKNEPASI--EDIVKEISGVIAPKARPPKNAAGENAPASE 137
Query: 340 P---SGHYPNPYPAHPVP 354
S
Sbjct: 138 DRPESTEAVTQSEEATAI 155
>gnl|CDD|237081 PRK12372, PRK12372, ribonuclease III; Reviewed.
Length = 413
Score = 31.4 bits (71), Expect = 1.00
Identities = 14/68 (20%), Positives = 21/68 (30%), Gaps = 6/68 (8%)
Query: 289 DSPPQTQPEPSTKPPPQSHLPPSKL--KPDLVAPPHSAIPTLGAQYPQRVAPPPSGHYPN 346
+ P + P + KP Q+ P S KP A + P + A P + P
Sbjct: 342 EKPAEAAPRAADKPAGQAADPASSSADKPGASADAAARTPA----RARDAAAPDADTPPG 397
Query: 347 PYPAHPVP 354
Sbjct: 398 GASLAAAQ 405
>gnl|CDD|218181 pfam04621, ETS_PEA3_N, PEA3 subfamily ETS-domain transcription
factor N terminal domain. The N terminus of the PEA3
transcription factors is implicated in transactivation
and in inhibition of DNA binding. Transactivation is
potentiated by activation of the Ras/MAP kinase and
protein kinase A signalling cascades. The N terminal
region contains conserved MAP kinase phosphorylation
sites.
Length = 336
Score = 31.0 bits (70), Expect = 1.0
Identities = 16/101 (15%), Positives = 22/101 (21%), Gaps = 3/101 (2%)
Query: 278 YSQSQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVA 337
+ + + P + P P +PP P
Sbjct: 121 AYDRKPASGFKPPTPPSTPCSPVNPQETVRQLQPSGPLSNSSPPSPHTPLPNQSPLPPPM 180
Query: 338 PPPSGHYPNPYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPP 378
P YP H P PP + RPP
Sbjct: 181 SSPDSSYP---SEHRFQRQLSEPCLPFPPPPGRGSRDGRPP 218
>gnl|CDD|114270 pfam05539, Pneumo_att_G, Pneumovirinae attachment membrane
glycoprotein G.
Length = 408
Score = 31.2 bits (70), Expect = 1.0
Identities = 29/117 (24%), Positives = 45/117 (38%), Gaps = 18/117 (15%)
Query: 281 SQQQASQQDSPPQTQPEPSTKPPPQS-HLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPP 339
+Q + + PQT+P PS + P S PPS D + G ++ QR P
Sbjct: 221 TQGTTTSSNPEPQTEPPPSQRGPSGSPQHPPSTTSQD------QSTTGDGQEHTQRRKTP 274
Query: 340 PSGHYPNPYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAAGSTQQTQTRSCSTTP 396
P+ N H A PP ++ RP P +T Q+ + ++P
Sbjct: 275 PA--TSNRRSPHS---------TATPPPTTKRQETGRPTPRPTATTQSGSSPPHSSP 320
Score = 30.4 bits (68), Expect = 2.1
Identities = 25/135 (18%), Positives = 47/135 (34%), Gaps = 6/135 (4%)
Query: 279 SQSQQQASQQDSPP-QTQPEPSTKPPPQSHLPPS-KLKPDLVAPPHSAIPTLGAQYPQRV 336
S++ ++ P +Q P ++P Q H + + P + T + +
Sbjct: 175 SKTTSWPTEVSHPTYPSQVTPQSQPATQGHQTATANQRLSSTEPVGTQGTTTSSNPEPQT 234
Query: 337 APPPSGHYPNPYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAAGSTQQT--QTRSCST 394
PPPS P+ P HP + + G ++ + PPA + + T
Sbjct: 235 EPPPSQRGPSGSPQHPPSTTS--QDQSTTGDGQEHTQRRKTPPATSNRRSPHSTATPPPT 292
Query: 395 TPLCHSYPGSPISTT 409
T + +P T
Sbjct: 293 TKRQETGRPTPRPTA 307
>gnl|CDD|218397 pfam05044, Prox1, Homeobox prospero-like protein (PROX1). The
homeobox gene Prox1 is expressed in a subpopulation of
endothelial cells that, after budding from veins, gives
rise to the mammalian lymphatic system. Prox1 has been
found to be an early specific marker for the developing
liver and pancreas in the mammalian foregut endoderm.
This family contains an atypical homeobox domain.
Length = 908
Score = 31.2 bits (70), Expect = 1.2
Identities = 29/156 (18%), Positives = 46/156 (29%), Gaps = 24/156 (15%)
Query: 280 QSQQQASQQDSPPQTQPEP--STKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVA 337
Q QD+P QT+ T + + +++ P V+ + +G
Sbjct: 558 CMPQTPQPQDAPEQTEALSLVVTPKKKRHKVTDTRITPRTVSRILALRDAVGPAAGTHHQ 617
Query: 338 P-PPSGHYPN-----PYPAHPVP------IPNIHPVHAVVPPGSQYYGHHRPPPAAGSTQ 385
P PS + P HP P IPN + V G + H P+
Sbjct: 618 PLHPSSLSASMGFHPPPFRHPFPLPLTVAIPNPSLHQSEVFMGYPFQSPHLGAPSGSPPG 677
Query: 386 QTQTRSCSTTPLC----------HSYPGSPISTTCS 411
+ + P H SP + S
Sbjct: 678 KDRDSPDLPRPTTSLHPKLLSAHHHPGSSPDGLSLS 713
>gnl|CDD|220708 pfam10349, WWbp, WW-domain ligand protein. The WWbp domain is
characterized by several short PY and PT-like motifs of
the PPPPY form. These appear to bind directly to the WW
domains of WWP1 and WWP2 and other such diverse proteins
as dystrophin and YAP (Yes-associated protein). This is
the WW-domain binding protein WWbp via PY and PY_like
motifs. The presence of a phosphotyrosine residue in the
pWBP-1 peptide abolishes WW domain binding which
suggests a potential regulatory role for tyrosine
phosphorylation in modulating WW domain-ligand
interactions. Given the likelihood that WWP1 and WWP2
function as E3 ubiquitin-protein ligases, it is possible
that initial substrate-specific recognition occurs via
WW domain-substrate protein interaction followed by
ubiquitin transfer and subsequent proteolysis. This
domain lies just downstream of the GRAM (pfam02893) in
many members.
Length = 111
Score = 29.3 bits (66), Expect = 1.4
Identities = 19/55 (34%), Positives = 21/55 (38%), Gaps = 5/55 (9%)
Query: 332 YPQRVAPPPSGHYPNPYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAAGSTQQ 386
Y Q APPP G P YPA P P P ++ Q PPP Q
Sbjct: 62 YGQYAAPPPYGPPPPYYPAPPGVYPTPPPPNSGYMADPQ-----EPPPPYPGPPQ 111
Score = 27.4 bits (61), Expect = 6.6
Identities = 15/76 (19%), Positives = 20/76 (26%), Gaps = 7/76 (9%)
Query: 330 AQYPQRVAPPPSGHYPNPYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAAGSTQQTQT 389
AQ Q V+ + P H P+P A PP + P P
Sbjct: 34 AQRAQPVSRESGYYPPPGAYVHLEPLPAYGQYAA--PPPYGPPPPYYPAPPGVYPTPPP- 90
Query: 390 RSCSTTPLCHSYPGSP 405
+ P P
Sbjct: 91 ----PNSGYMADPQEP 102
>gnl|CDD|218621 pfam05518, Totivirus_coat, Totivirus coat protein.
Length = 753
Score = 30.9 bits (70), Expect = 1.4
Identities = 19/99 (19%), Positives = 26/99 (26%), Gaps = 13/99 (13%)
Query: 291 PPQTQPEPSTKP----PPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPPSGHYPN 346
P P P P P ++ P H+A+ A R PP G
Sbjct: 657 PGVPGPVPVGMPAHTARPSR---VARGDPVRPTAHHAALR---APQAPRPGGPPGGGGGL 710
Query: 347 PYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAAGSTQ 385
P P P + + P G+ Q
Sbjct: 711 PPPPDLPAAAGPAPCGSSLIASPT---APPEPEPPGAEQ 746
>gnl|CDD|216412 pfam01285, TEA, TEA/ATTS domain family.
Length = 424
Score = 30.9 bits (70), Expect = 1.4
Identities = 11/50 (22%), Positives = 18/50 (36%), Gaps = 1/50 (2%)
Query: 319 APPHSAIPTLGAQYPQRVAPPPSGHYP-NPYPAHPVPIPNIHPVHAVVPP 367
A I G + + P S HY + A+ + + P +PP
Sbjct: 143 AQHLFYIIPGGPSWRTSIKPFSSSHYGSHNSSAYSDHLRPLQPYSGELPP 192
>gnl|CDD|221188 pfam11725, AvrE, Pathogenicity factor. This family is secreted by
gram-negative Gammaproteobacteria such as Pseudomonas
syringae of tomato and the fire blight plant pathogen
Erwinia amylovora, amongst others. It is an essential
pathogenicity factor of approximately 198 kDa. Its
injection into the host-plant is dependent upon the
bacterial type III or Hrp secretion system. The family
is long and carries a number of predicted functional
regions, including an ERMS or endoplasmic reticulum
membrane retention signal at both the C- and the
N-termini, a leucine-zipper motif from residues 539-560,
and a nuclear localisation signal at 1358-1361. this
conserved AvrE-family of effectors is among the few that
are required for full virulence of many phytopathogenic
pseudomonads, erwinias and pantoeas.
Length = 1771
Score = 30.9 bits (70), Expect = 1.7
Identities = 28/150 (18%), Positives = 38/150 (25%), Gaps = 22/150 (14%)
Query: 279 SQSQQQASQQDS--------------PPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSA 324
QS + +Q+ S P Q S+ P PP L P AP S
Sbjct: 29 QQSSESPTQRASHSLASEGKKNRKKMPKVFQK--SSAPRQIQAAPPQALNPTAAAPQSSR 86
Query: 325 IPTLGAQYPQRVAPPPSGHYPNPYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAAGST 384
PTL + P V RP +
Sbjct: 87 GPTLRELLALPEDDGETQA------PESSPSARRLTRSEGVARHEMEDLAGRPVVKPDAD 140
Query: 385 QQTQTRSCSTTPLCHSYPGSPISTTCSTTP 414
+Q + + + P S T S P
Sbjct: 141 RQLRQDILNKSSSSRRPPVSKEEGTSSKMP 170
>gnl|CDD|148303 pfam06619, DUF1149, Protein of unknown function (DUF1149). This
family consists of several hypothetical bacterial
proteins of unknown function.
Length = 126
Score = 29.2 bits (66), Expect = 1.7
Identities = 10/29 (34%), Positives = 15/29 (51%)
Query: 143 VDDKKFKTFGDDPREEVMTLERILLQTIK 171
+ D+ F + +EEV L R LL +K
Sbjct: 78 ILDRLFGEPSEFEQEEVEKLARPLLDILK 106
>gnl|CDD|221562 pfam12406, DUF3664, Surface protein. This family of proteins is
found in eukaryotes. Proteins in this family are
typically between 131 and 312 amino acids in length.
Length = 100
Score = 28.9 bits (64), Expect = 1.7
Identities = 20/77 (25%), Positives = 30/77 (38%), Gaps = 3/77 (3%)
Query: 273 QVLDLYSQSQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQY 332
Q++D QSQ+ Q+ PQ +P T+P L P + ++ P S P Q
Sbjct: 22 QLIDQAEQSQEPTQQEPIEPQQPTQPETEP---EELEPETVTVEVPEPVTSEEPKESDQT 78
Query: 333 PQRVAPPPSGHYPNPYP 349
+ S P P
Sbjct: 79 EEPTETQDSKQEPTQQP 95
>gnl|CDD|219805 pfam08347, CTNNB1_binding, N-terminal CTNNB1 binding. This region
tends to appear at the N-terminus of proteins also
containing DNA-binding HMG (high mobility group) boxes
(pfam00505) and appears to bind the armadillo repeat of
CTNNB1 (beta-catenin), forming a stable complex.
Signaling by Wnt through TCF/LCF is involved in
developmental patterning, induction of neural tissues,
cell fate decisions and stem cell differentiation.
Isoforms of HMG T-cell factors lacking the N-terminal
CTNNB1-binding domain cannot fulfill their role as
transcriptional activators in T-cell differentiation.
Length = 200
Score = 29.9 bits (67), Expect = 1.8
Identities = 23/121 (19%), Positives = 29/121 (23%), Gaps = 15/121 (12%)
Query: 267 LEDICHQVLDLYSQSQQQASQQDSPPQTQPEPSTKPP-PQSHLPPSKLKPDLVAPPHSAI 325
LED+ H+ D+ S P P H
Sbjct: 83 LEDVSHKHQDMGYYKGPPYSGYPFLMLPNPYLPNGSLSPLPPSSNKVPVVQPPHHVHPLT 142
Query: 326 PTLGAQYPQRVAPPPSGHYPN--------PYPAHPVPIPNIHPVHAVVPPGSQYYGHHRP 377
P + P H P P P HP I +P + PG G
Sbjct: 143 PLITYSNEHFSPGTPPPHLPYDVDPKTGIPRPPHPPDISPFYP----LSPGGV--GQIPH 196
Query: 378 P 378
P
Sbjct: 197 P 197
>gnl|CDD|219897 pfam08549, SWI-SNF_Ssr4, Fungal domain of unknown function
(DUF1750). This is a fungal domain of unknown function.
Length = 669
Score = 30.8 bits (69), Expect = 1.8
Identities = 20/69 (28%), Positives = 25/69 (36%), Gaps = 8/69 (11%)
Query: 326 PTLGAQYPQRVAPPPSGHYPNPYPAHPVPIPNIHP-VHAVVPPGSQYYGH-----HRPPP 379
PT+ P ++A P PYP V +P AV P + GH HR
Sbjct: 200 PTIP--LPPQMAGQSMYQPPGPYPNAMVGRQPFYPQPGAVAGPPKRRGGHKAPRGHRASQ 257
Query: 380 AAGSTQQTQ 388
AAG
Sbjct: 258 AAGQAASLD 266
>gnl|CDD|225987 COG3456, COG3456, Predicted component of the type VI protein
secretion system, contains a FHA domain [Intracellular
trafficking, secretion, and vesicular transport;
Signal transduction mechanisms].
Length = 430
Score = 30.5 bits (69), Expect = 1.9
Identities = 16/103 (15%), Positives = 26/103 (25%), Gaps = 7/103 (6%)
Query: 289 DSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPPSGHYPNPY 348
D +PEP P P +L + P + L A + + P P
Sbjct: 131 DFLADIEPEPR---LPPGLTDPLQLADPV--PNGFSADPLSALQSESLIAQPDPAGGAPS 185
Query: 349 PAHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAAGSTQQTQTRS 391
+ P P A + P ++
Sbjct: 186 ISRNSEAPLSDPASA--GGIDTPFPDDLDVPPIAASPPGPQEG 226
>gnl|CDD|233787 TIGR02223, ftsN, cell division protein FtsN. FtsN is a poorly
conserved protein active in cell division in a number of
Proteobacteria. The N-terminal 30 residue region tends
to by Lys/Arg-rich, and is followed by a
membrane-spanning region. This is followed by an acidic
low-complexity region of variable length and a
well-conserved C-terminal domain of two tandem regions
matched by pfam05036 (Sporulation related repeat), found
in several cell division and sporulation proteins. The
role of FtsN as a suppressor for other cell division
mutations is poorly understood; it may involve cell wall
hydrolysis [Cellular processes, Cell division].
Length = 298
Score = 30.0 bits (67), Expect = 1.9
Identities = 13/69 (18%), Positives = 21/69 (30%), Gaps = 3/69 (4%)
Query: 273 QVLDLYSQSQQQASQQDSPPQTQPEPST---KPPPQSHLPPSKLKPDLVAPPHSAIPTLG 329
Q + Q + A Q +P +T+ S Q LP + + P P
Sbjct: 152 QTAEKKPQKARTAEAQKTPVETEKIASKVKEAKQKQKALPKQTAETQSNSKPIETAPKAD 211
Query: 330 AQYPQRVAP 338
+ P
Sbjct: 212 KADKTKPKP 220
>gnl|CDD|235906 PRK07003, PRK07003, DNA polymerase III subunits gamma and tau;
Validated.
Length = 830
Score = 30.6 bits (69), Expect = 1.9
Identities = 16/83 (19%), Positives = 19/83 (22%), Gaps = 4/83 (4%)
Query: 276 DLYSQSQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHS--AIPTLGAQYP 333
D + DS + P P P P P A A
Sbjct: 460 DSRCDERDAQPPADSGSASAPASDAPPDAAFEPAPRAAAPSAATPAAVPDARAPAAASRE 519
Query: 334 QRVAPP--PSGHYPNPYPAHPVP 354
A P+ P PA P
Sbjct: 520 DAPAAAAPPAPEARPPTPAAAAP 542
Score = 29.4 bits (66), Expect = 4.5
Identities = 16/101 (15%), Positives = 23/101 (22%), Gaps = 5/101 (4%)
Query: 285 ASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPPSGHY 344
A + + + L P A A T P APP +
Sbjct: 382 APGARAAAAVGASAVPAVTAVTGAAGAALAPKAAAA---AAATRAEAPPAAPAPPATADR 438
Query: 345 PNPYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAAGSTQ 385
+ P+P A PPA +
Sbjct: 439 GDDAADGDAPVPAKANARASADSRCDERDAQ--PPADSGSA 477
Score = 28.7 bits (64), Expect = 7.4
Identities = 22/139 (15%), Positives = 31/139 (22%), Gaps = 30/139 (21%)
Query: 285 ASQQDSPPQTQPEPSTKPPPQSHLPPSKLK-PDLVAPPHSAIPTLGAQYPQRVAPP---- 339
A+ + +P P +T P+ D A P P AP
Sbjct: 488 AAFEPAPRAAAPSAATPAAVPDARAPAAASREDAPAAAAPPAPEARPPTPAAAAPAARAG 547
Query: 340 -------------------------PSGHYPNPYPAHPVPIPNIHPVHAVVPPGSQYYGH 374
+ A P P V P G
Sbjct: 548 GAAAALDVLRNAGMRVSSDRGARAAAAAKPAAAPAAAPKPAAPRVAVQVPTPRARAATGD 607
Query: 375 HRPPPAAGSTQQTQTRSCS 393
P AA + Q ++R
Sbjct: 608 APPNGAARAEQAAESRGAP 626
>gnl|CDD|220633 pfam10214, Rrn6, RNA polymerase I-specific transcription-initiation
factor. RNA polymerase I-specific
transcription-initiation factor Rrn6 and Rrn7 represent
components of a multisubunit transcription factor
essential for the initiation of rDNA transcription by
Pol I. These proteins are found in fungi.
Length = 753
Score = 30.5 bits (69), Expect = 2.0
Identities = 10/25 (40%), Positives = 11/25 (44%)
Query: 276 DLYSQSQQQASQQDSPPQTQPEPST 300
DL S Q S D P Q+Q E
Sbjct: 710 DLPPSSSQAFSLSDLPMQSQSESGL 734
>gnl|CDD|218035 pfam04344, CheZ, Chemotaxis phosphatase, CheZ. This family
represents the bacterial chemotaxis phosphatase, CheZ.
This protein forms a dimer characterized by a long
four-helix bundle, composed of two helices from each
monomer. CheZ dephosphorylates CheY in a reaction that
is essential to maintain a continuous chemotactic
response to environmental changes. It is thought that
CheZ's conserved residue Gln 147 orientates a water
molecule for nucleophilic attack at the CheY active
site.
Length = 205
Score = 29.8 bits (68), Expect = 2.0
Identities = 7/35 (20%), Positives = 17/35 (48%)
Query: 253 RWWEMFVEDMTMDLLEDICHQVLDLYSQSQQQASQ 287
RW + ++ ++ +++CH S ++ SQ
Sbjct: 80 RWDRLMAREIELEEFKELCHDTDQFLSDVEEDTSQ 114
>gnl|CDD|215448 PLN02834, PLN02834, 3-dehydroquinate synthase.
Length = 433
Score = 30.1 bits (68), Expect = 2.1
Identities = 13/57 (22%), Positives = 19/57 (33%), Gaps = 2/57 (3%)
Query: 286 SQQDSPPQTQPE-PSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPPS 341
+ + + P PS Q+ S + DL H P GA + V S
Sbjct: 8 NSESNTPTVLSRSPSDAFFDQNSSIESSKEGDLTEVIHEKCPVSGANKSE-VTKTAS 63
>gnl|CDD|204405 pfam10146, zf-C4H2, Zinc finger-containing protein. This is a
family of proteins which appears to have a highly
conserved zinc finger domain at the C terminal end,
described as -C-X2-CH-X3-H-X5-C-X2-C-. The structure is
predicted to contain a coiled coil. Members are
annotated as being tumour-associated antigen HCA127 in
humans but this could not confirmed.
Length = 215
Score = 29.8 bits (67), Expect = 2.2
Identities = 22/111 (19%), Positives = 35/111 (31%), Gaps = 18/111 (16%)
Query: 258 FVEDMTMDLLEDICHQVLDLYSQSQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDL 317
+ +M L ++ L+ + +Q+ S SP P P P +
Sbjct: 96 SINEMRKLEL--GLEELPQLHEEERQKISPATSPVPPVPLPDPPAFPST----------- 142
Query: 318 VAPPHSAIPTLGAQYPQRVAPPPSG--HYPNPYPAHPVPIPNIHPVHAVVP 366
P ++A Q QR P H P P P+ N H +
Sbjct: 143 -LPANAAAAAAAQQ--QRDVPQNLINRHRMPPSFQQPPPMKNCLSCHQQIH 190
>gnl|CDD|111097 pfam02166, Androgen_recep, Androgen receptor.
Length = 423
Score = 30.1 bits (67), Expect = 2.3
Identities = 22/62 (35%), Positives = 27/62 (43%), Gaps = 8/62 (12%)
Query: 280 QSQQQASQQDSPPQTQPEPSTKPPPQSHLPPSK-LKPDLVAPP---HSAIPT----LGAQ 331
+QQ SQQ S + PE P P + SK L APP SA P+ LG
Sbjct: 85 DEEQQPSQQQSALECHPESGCVPEPGAATAASKGLPQQPPAPPDEDDSAAPSTLSLLGPT 144
Query: 332 YP 333
+P
Sbjct: 145 FP 146
>gnl|CDD|215194 PLN02340, PLN02340, endoglucanase.
Length = 614
Score = 30.2 bits (68), Expect = 2.4
Identities = 10/39 (25%), Positives = 19/39 (48%)
Query: 271 CHQVLDLYSQSQQQASQQDSPPQTQPEPSTKPPPQSHLP 309
++ L+++ Q + D+ +P P+T P QS P
Sbjct: 483 NAPLVGLFAKLQSASETADAYGSYKPTPNTSSPKQSGAP 521
>gnl|CDD|236138 PRK07994, PRK07994, DNA polymerase III subunits gamma and tau;
Validated.
Length = 647
Score = 30.2 bits (69), Expect = 2.4
Identities = 15/111 (13%), Positives = 24/111 (21%), Gaps = 22/111 (19%)
Query: 279 SQSQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKL-----------------KPDLVAPP 321
+ + PP P P S+L K + A
Sbjct: 388 TAAVAPPQAPAVPPPPASAPQQAPAVPLPETTSQLLAARQQLQRAQGATKAKKSEPAAAS 447
Query: 322 -----HSAIPTLGAQYPQRVAPPPSGHYPNPYPAHPVPIPNIHPVHAVVPP 367
+SA+ L + P A + Y + P
Sbjct: 448 RARPVNSALERLASVRPAPSALEKAPAKKEAYRWKATNPVEVKKEPVATPK 498
Score = 28.7 bits (65), Expect = 7.0
Identities = 15/89 (16%), Positives = 28/89 (31%), Gaps = 7/89 (7%)
Query: 279 SQSQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAP 338
+ QA+ + P+ PPP + P V P + L A+ + A
Sbjct: 377 PAASAQATAAPTAAVAPPQAPAVPPPPASAPQQ---APAVPLPETTSQLLAARQQLQRAQ 433
Query: 339 PPSGHYPNPYPAHPVPIPNIHPVHAVVPP 367
+ + P PV++ +
Sbjct: 434 GATK----AKKSEPAAASRARPVNSALER 458
>gnl|CDD|236940 PRK11633, PRK11633, cell division protein DedD; Provisional.
Length = 226
Score = 29.6 bits (67), Expect = 2.6
Identities = 24/86 (27%), Positives = 26/86 (30%), Gaps = 19/86 (22%)
Query: 291 PPQTQPEPSTKPPPQSHL------PPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPPSGHY 344
P TQ P+ P + L P VAPP T P V PP
Sbjct: 57 PAATQALPTQPPEGAAEAVRAGDAAAPSLDPATVAPP----NTPVEPEPAPVEPPKPKPV 112
Query: 345 PNPYP---------AHPVPIPNIHPV 361
P P A P P P PV
Sbjct: 113 EKPKPKPKPQQKVEAPPAPKPEPKPV 138
>gnl|CDD|236333 PRK08691, PRK08691, DNA polymerase III subunits gamma and tau;
Validated.
Length = 709
Score = 30.1 bits (67), Expect = 2.6
Identities = 17/96 (17%), Positives = 30/96 (31%), Gaps = 2/96 (2%)
Query: 284 QASQQDSPPQTQPEPSTKPPPQSHLPPSKLK--PDLVAPPHSAIPTLGAQYPQRVAPPPS 341
Q + P + P+ + + ++ + PP + + A + AP
Sbjct: 428 QENNDVPPWEDAPDEAQTAAGTAQTSAKSIQTASEAETPPENQVSKNKAADNETDAPLSE 487
Query: 342 GHYPNPYPAHPVPIPNIHPVHAVVPPGSQYYGHHRP 377
NP A P A P +YG+ P
Sbjct: 488 VPSENPIQATPNDEAVETETFAHEAPAEPFYGYGFP 523
>gnl|CDD|139494 PRK13335, PRK13335, superantigen-like protein; Reviewed.
Length = 356
Score = 29.7 bits (66), Expect = 2.6
Identities = 15/55 (27%), Positives = 21/55 (38%), Gaps = 8/55 (14%)
Query: 280 QSQQQASQQDSPP-------QTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPT 327
Q +Q++ + P QT E ST P + PPS P + S P
Sbjct: 100 QEEQKSLNISATPAPKQEQSQTTTE-STTPKTKVTTPPSTNTPQPMQSTKSDTPQ 153
>gnl|CDD|222579 pfam14179, YppG, YppG-like protein. The YppG-like protein family
includes the B. subtilis YppG protein, which is
functionally uncharacterized. This family of proteins is
found in bacteria. Proteins in this family are typically
between 115 and 181 amino acids in length. There are two
completely conserved residues (F and G) that may be
functionally important.
Length = 110
Score = 28.5 bits (64), Expect = 2.7
Identities = 13/54 (24%), Positives = 14/54 (25%), Gaps = 3/54 (5%)
Query: 332 YPQRVAPPPSGHYPNPY--PAHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAAGS 383
PQ P P P P P H +PP Q Y P
Sbjct: 11 PPQNQQQQPYQQQPYHQQMPPPPYSPPQQQQGHF-MPPQPQPYPKQSPQQQQPP 63
>gnl|CDD|223065 PHA03378, PHA03378, EBNA-3B; Provisional.
Length = 991
Score = 30.0 bits (67), Expect = 2.8
Identities = 32/117 (27%), Positives = 37/117 (31%), Gaps = 14/117 (11%)
Query: 283 QQASQQDSPPQTQPEPSTKP--PPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRV---A 337
Q A PP P P P PP P+ A P +A P A+ P A
Sbjct: 688 QWAPGTMQPPPRAPTPMRPPAAPPGRAQRPAAATGR--ARPPAAAPG-RARPPAAAPGRA 744
Query: 338 PPPSGHYPNPYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAAGSTQQTQTRSCST 394
PP+ P P P A P Q PPP A Q + R T
Sbjct: 745 RPPAAAPGRARPPAAAPGRARPPAAAPGAPTPQ------PPPQAPPAPQQRPRGAPT 795
>gnl|CDD|219358 pfam07271, Cytadhesin_P30, Cytadhesin P30/P32. This family
consists of several Mycoplasma species specific
Cytadhesin P32 and P30 proteins. P30 has been found to
be membrane associated and localised on the tip
organelle. It is thought that it is important in
cytadherence and virulence.
Length = 279
Score = 29.6 bits (66), Expect = 3.0
Identities = 22/111 (19%), Positives = 35/111 (31%), Gaps = 9/111 (8%)
Query: 268 EDICHQVLDLYSQSQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPT 327
E + Q+ + Q++QQA + D + + T+P P P
Sbjct: 110 EQMAEQLQRISEQNEQQAIEIDPTEEVNTQEPTQPAGV---------NVANNPQPQVQPQ 160
Query: 328 LGAQYPQRVAPPPSGHYPNPYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPP 378
G QR+ P G P + P +PP G + P
Sbjct: 161 FGPNPQQRINPQRFGFPMQPNMGMRPGFNQMPPHMPGMPPNQMRPGFNPMP 211
>gnl|CDD|222010 pfam13254, DUF4045, Domain of unknown function (DUF4045). This
presumed domain is functionally uncharacterized. This
domain family is found in bacteria and eukaryotes, and
is typically between 384 and 430 amino acids in length.
Length = 414
Score = 29.8 bits (67), Expect = 3.2
Identities = 24/102 (23%), Positives = 34/102 (33%), Gaps = 9/102 (8%)
Query: 279 SQSQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPT-LGAQYPQRVA 337
+ QQ+ S D+ + P+P P+S P K P P + + A
Sbjct: 239 EKPQQETSSMDTEKSSAPKPRETLDPKS---PEKAPPIDTTEEELKSPEASPKESEEASA 295
Query: 338 PPPSGHYPNPYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPP 379
S +P P P P P + P S RP P
Sbjct: 296 RKRSPSLLSPSPKAESPKPLASPGKSPRDPLSP-----RPKP 332
>gnl|CDD|177614 PHA03377, PHA03377, EBNA-3C; Provisional.
Length = 1000
Score = 30.0 bits (67), Expect = 3.3
Identities = 39/163 (23%), Positives = 47/163 (28%), Gaps = 38/163 (23%)
Query: 279 SQSQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAP 338
S+ +S + P + E P L S L PD PP P G + PQ
Sbjct: 701 SEESHLSSMSPTQPISHEEQPRYEDPDDPLDLS-LHPDQAPPPSHQAPYSGHEEPQAQQA 759
Query: 339 PPSGHY---PNPYPAHPVPIPNIHPVHAVVPPG---------------------SQYYGH 374
P G++ P P P V PG SQY GH
Sbjct: 760 PYPGYWEPRPPQAPYLGYQEPQAQGVQVSSYPGYAGPWGLRAQHPRYRHSWAYWSQYPGH 819
Query: 375 ------------HRPPPAAGSTQQTQTRSCSTTPLCHSYPGSP 405
H PP GS Q + L S G P
Sbjct: 820 GHPQGPWAPRPPHLPPQWDGSAGHGQDQVSQFPHL-QSETGPP 861
>gnl|CDD|237871 PRK14965, PRK14965, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 576
Score = 29.7 bits (67), Expect = 3.3
Identities = 19/71 (26%), Positives = 20/71 (28%), Gaps = 7/71 (9%)
Query: 321 PHSAIPTLGAQYPQRVAPPPSGHYPNPYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPPA 380
P GA P A PP P PA P P A PP + P
Sbjct: 382 PAPPSAAWGAPTPAAPAAPPPAAAPPVPPAAPARPAAARPAPAPAPPAA-------AAPP 434
Query: 381 AGSTQQTQTRS 391
A S S
Sbjct: 435 ARSADPAAAAS 445
>gnl|CDD|237030 PRK12270, kgd, alpha-ketoglutarate decarboxylase; Reviewed.
Length = 1228
Score = 29.9 bits (68), Expect = 3.3
Identities = 18/109 (16%), Positives = 24/109 (22%), Gaps = 13/109 (11%)
Query: 274 VLDLYSQSQQQASQQD----------SPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHS 323
V ++Y Q + D P T + + P AP
Sbjct: 11 VEEMYQQYLADPNSVDPSWREFFADYGPGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAP 70
Query: 324 AIPTLGAQYPQRVAPPPSGHYPNPYP--AHPVPIPNIHPVHAVVPPGSQ 370
A A P P+ A P P A V
Sbjct: 71 APAPPAAA-APAAPPKPAAAAAAAAAPAAPPAAAAAAAPAAAAVEDEVT 118
Score = 29.1 bits (66), Expect = 5.3
Identities = 14/102 (13%), Positives = 23/102 (22%), Gaps = 10/102 (9%)
Query: 254 WWEMFVEDMTMDLLEDICHQVLDLYSQSQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKL 313
W E F D + A+ + + P + PP+
Sbjct: 29 WREFF-ADYGPGSTAA---------PTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAA 78
Query: 314 KPDLVAPPHSAIPTLGAQYPQRVAPPPSGHYPNPYPAHPVPI 355
P P +A A A + P+
Sbjct: 79 APAAPPKPAAAAAAAAAPAAPPAAAAAAAPAAAAVEDEVTPL 120
>gnl|CDD|233154 TIGR00859, ENaC, sodium channel transporter. The Epithelial Na+
Channel (ENaC) Family (TC 1.A.06)The ENaC family
consists of sodium channels from animals and has no
recognizable homologues in other eukaryotes or bacteria.
The vertebrate ENaC proteins from epithelial cells
cluster tightly together on the phylogenetic tree:
voltage-insensitive ENaC homologues are also found in
the brain. Eleven sequenced C. elegans proteins,
including the degenerins, are distantly related to the
vertebrate proteins as well as to each other. At least
some ofthese proteins form part of a mechano-transducing
complex for touch sensitivity. Other members of the ENaC
family, the acid-sensing ion channels, ASIC1-3,are homo-
or hetero-oligomeric neuronal H+-gated channels that
mediate pain sensation in response to tissue acidosis.
The homologous Helix aspersa(FMRF-amide)-activated Na+
channel is the first peptide neurotransmitter-gated
ionotropic receptor to be sequenced.Mammalian ENaC is
important for the maintenance of Na+ balance and the
regulation of blood pressure. Three homologous ENaC
subunits, a, b and g, havebeen shown to assemble to form
the highly Na+-selective channel.This model is designed
from the vertebrate members of the ENaC family
[Transport and binding proteins, Cations and iron
carrying compounds].
Length = 595
Score = 29.7 bits (67), Expect = 3.5
Identities = 19/77 (24%), Positives = 32/77 (41%), Gaps = 6/77 (7%)
Query: 266 LLEDICHQVLDLYSQSQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAI 325
+++ I +L L + ++ ++ PP +P P P S P L+ D SA+
Sbjct: 520 IIDLIFITLLRLLWRFRKWWQRRRGPPYAEP-----PEPVSADTPPSLQLDDPPTFPSAL 574
Query: 326 PTLGAQYPQ-RVAPPPS 341
P A PPP+
Sbjct: 575 PLPHASGLSLPGTPPPN 591
>gnl|CDD|235161 PRK03814, PRK03814, oxaloacetate decarboxylase subunit gamma;
Provisional.
Length = 85
Score = 27.4 bits (61), Expect = 3.6
Identities = 15/48 (31%), Positives = 18/48 (37%), Gaps = 6/48 (12%)
Query: 292 PQTQPEPSTKPPPQSHLPP--SKLKPDLVAPPHSAIPTLGAQYPQRVA 337
PQ PEP P P S + P +VA AI Q+ A
Sbjct: 41 PQEVPEPIAAPKKSKKQQPTSSAVSPQVVA----AISAAVHQHRASKA 84
>gnl|CDD|233508 TIGR01649, hnRNP-L_PTB, hnRNP-L/PTB/hephaestus splicing factor
family. Included in this family of heterogeneous
ribonucleoproteins are PTB (polypyrimidine tract binding
protein ) and hnRNP-L. These proteins contain four RNA
recognition motifs (rrm: pfam00067).
Length = 481
Score = 29.4 bits (66), Expect = 3.7
Identities = 18/92 (19%), Positives = 26/92 (28%)
Query: 288 QDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPPSGHYPNP 347
DS T P+ + P + +P L+ S+ G P +G
Sbjct: 183 DDSRDYTNPDLPGRRDPGLDQTHRQRQPALLGQHPSSYGHDGYSSHGGPLAPLAGGDRMG 242
Query: 348 YPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPP 379
P P A + P YG P
Sbjct: 243 PPHGPPSRYRPAYEAAPLAPAISSYGPAGGGP 274
>gnl|CDD|222095 pfam13388, DUF4106, Protein of unknown function (DUF4106). This
family of proteins are found in large numbers in the
Trichomonas vaginalis proteome. The function of this
protein is unknown.
Length = 422
Score = 29.3 bits (65), Expect = 4.0
Identities = 9/62 (14%), Positives = 17/62 (27%), Gaps = 4/62 (6%)
Query: 291 PPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPPSGHYPNPYPA 350
P+ +P+ + P Q + + P + Q P+ P
Sbjct: 200 APKPTQQPTVQNPAQQPTVQNPAQQPQQQPQQQPVQ----PAQQPTPQNPAQQPPQTEQG 255
Query: 351 HP 352
H
Sbjct: 256 HK 257
Score = 28.9 bits (64), Expect = 6.2
Identities = 9/31 (29%), Positives = 10/31 (32%)
Query: 280 QSQQQASQQDSPPQTQPEPSTKPPPQSHLPP 310
QQ QQ QP P + PP
Sbjct: 220 NPAQQPQQQPQQQPVQPAQQPTPQNPAQQPP 250
Score = 28.5 bits (63), Expect = 7.7
Identities = 8/28 (28%), Positives = 13/28 (46%)
Query: 283 QQASQQDSPPQTQPEPSTKPPPQSHLPP 310
QQ + Q+ Q Q +P +P + P
Sbjct: 214 QQPTVQNPAQQPQQQPQQQPVQPAQQPT 241
>gnl|CDD|235585 PRK05733, PRK05733, single-stranded DNA-binding protein;
Provisional.
Length = 172
Score = 28.4 bits (63), Expect = 4.4
Identities = 24/86 (27%), Positives = 32/86 (37%), Gaps = 11/86 (12%)
Query: 231 YLAGKLSKFEVVDWIGRNSKHLRWWEMFVEDM--TMDLL----EDICHQVLDLYSQSQQQ 284
Y+ GKL E W K R+ V DM TM LL + + +Q
Sbjct: 78 YIEGKLQTRE---WEKDGIK--RYTTEIVVDMQGTMQLLGGRPQGDDQGGQGGGNYNQSA 132
Query: 285 ASQQDSPPQTQPEPSTKPPPQSHLPP 310
QQ PQ + ++P PQ P
Sbjct: 133 PRQQAQRPQQAAQQQSRPAPQQPAPQ 158
Score = 28.0 bits (62), Expect = 6.1
Identities = 14/53 (26%), Positives = 21/53 (39%), Gaps = 1/53 (1%)
Query: 273 QVLDLYSQSQQQASQ-QDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSA 324
Q+L Q Q Q + Q+ P + P Q+ S+ P AP +A
Sbjct: 109 QLLGGRPQGDDQGGQGGGNYNQSAPRQQAQRPQQAAQQQSRPAPQQPAPQPAA 161
>gnl|CDD|218735 pfam05760, IER, Immediate early response protein (IER). This
family consists of several eukaryotic immediate early
response (IER) 2 and 5 proteins. The role of IER5 is
unclear although it play an important role in mediating
the cellular response to mitogenic signals. Again,
little is known about the function of IER2 although it
is thought to play a role in mediating the cellular
responses to a variety of extracellular signals.
Length = 272
Score = 28.7 bits (64), Expect = 4.7
Identities = 21/121 (17%), Positives = 26/121 (21%), Gaps = 10/121 (8%)
Query: 292 PQTQPEPSTKPPPQSHL-PPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPPSGHYPNPYPA 350
Q P + P L PPS +P L A S P+ P P + P +
Sbjct: 62 EQPMVPPPEQQPGAPRLHPPSGAEPPLPARAAS---------PEEEQPAPEPAFYRPSVS 112
Query: 351 HPVPIPNIHPVHAVVPPGSQYYGHHRPPPAAGSTQQTQTRSCSTTPLCHSYPGSPISTTC 410
P P G T P C
Sbjct: 113 EGFPASGERPDATSTGSGQCSQTTVLDLDTHVVTTVESPYLHQDCPCQAEDGQPAPPAVC 172
Query: 411 S 411
Sbjct: 173 K 173
>gnl|CDD|183854 PRK13042, PRK13042, superantigen-like protein; Reviewed.
Length = 291
Score = 28.8 bits (64), Expect = 4.8
Identities = 14/58 (24%), Positives = 24/58 (41%), Gaps = 1/58 (1%)
Query: 284 QASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPPS 341
QA+ +P T+ E PP + + + KP+ PP + + Q P P +
Sbjct: 26 QAANATTPSSTKVEAPQSTPPSTKVEAPQSKPNATTPPSTKVEA-PQQTPNATTPSST 82
Score = 28.1 bits (62), Expect = 7.9
Identities = 11/47 (23%), Positives = 21/47 (44%), Gaps = 5/47 (10%)
Query: 286 SQQDSPPQTQPE-----PSTKPPPQSHLPPSKLKPDLVAPPHSAIPT 327
+ Q +PP T+ E P+ PP + + + P+ P + + T
Sbjct: 40 APQSTPPSTKVEAPQSKPNATTPPSTKVEAPQQTPNATTPSSTKVET 86
>gnl|CDD|184918 PRK14954, PRK14954, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 620
Score = 29.1 bits (65), Expect = 4.9
Identities = 18/65 (27%), Positives = 22/65 (33%), Gaps = 5/65 (7%)
Query: 295 QPEPSTKPPPQSHLPPSKLK-PDLVAPPHSAIPTLGAQYPQRVAPPPSGHYPNPYP--AH 351
P P+ P + P L PD P A P P + P S P P A
Sbjct: 381 APSPAGSPDVKKKAPEPDLPQPDRHPGP--AKPEAPGARPAELPSPASAPTPEQQPPVAR 438
Query: 352 PVPIP 356
P+P
Sbjct: 439 SAPLP 443
>gnl|CDD|222456 pfam13925, Katanin_con80, con80 domain of Katanin. The con80
domain of katanin is the C-terminal region of the
protein that binds to the N-terminal domain of
katanin-p60, the catalytic ATPase. The complex
associates with a specific subregion of the mitotic
spindle leading to increased microtubule disassembly and
targeting of p60 to the spindle poles. The assembly and
function of the mitotic spindle requires the activity of
a number of microtubule-binding proteins. Katanin, a
heterodimeric microtubule-severing ATPase, is found
localized at mitotic spindle poles. A proposed model is
that katanin is targeted to spindle poles through a
combination of direct microtubule binding by the p60
subunit and through interactions between the WD40 domain
and an unknown protein.
Length = 164
Score = 28.4 bits (64), Expect = 5.1
Identities = 4/23 (17%), Positives = 11/23 (47%)
Query: 48 KDIRNSPSFREGIDYETEQRYRR 70
+ ++ G+D E+R ++
Sbjct: 105 RSALSAAPSSVGVDLSREERLQK 127
>gnl|CDD|237015 PRK11901, PRK11901, hypothetical protein; Reviewed.
Length = 327
Score = 28.9 bits (65), Expect = 5.5
Identities = 21/98 (21%), Positives = 26/98 (26%), Gaps = 12/98 (12%)
Query: 281 SQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPP 340
SQQQ + Q ST P + + PSK P Q P P
Sbjct: 154 SQQQGQVNAASQNAQGNTSTLPTAPATVAPSKGAKVPATAETHPTP---PQKPATKKPAV 210
Query: 341 SGHYPNPYPAHPVPIPNIHPVHAVVP-------PGSQY 371
+ H P + P P S Y
Sbjct: 211 NHH-KTA-TVAVPPATSGKPKSGAASARALSSAPASHY 246
Score = 28.1 bits (63), Expect = 8.0
Identities = 17/67 (25%), Positives = 22/67 (32%), Gaps = 7/67 (10%)
Query: 276 DLYSQSQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQR 335
DL S + Q SP S + + D+ APP S PT A
Sbjct: 80 DLSGSSSLSSGNQSSPSAANN-TSDGHDASGVKNTAPPQ-DISAPPISPTPTQAAP---- 133
Query: 336 VAPPPSG 342
P+G
Sbjct: 134 -PQTPNG 139
>gnl|CDD|135173 PRK04654, PRK04654, sec-independent translocase; Provisional.
Length = 214
Score = 28.6 bits (63), Expect = 5.7
Identities = 20/71 (28%), Positives = 29/71 (40%), Gaps = 2/71 (2%)
Query: 290 SPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPPSGHYPNPYP 349
+P Q +P P Q+ +P +V PH+A P VAP +G +P P
Sbjct: 146 APVIAQAQPIAPAPHQTLVPAPH--DTIVPAPHAAHLPSAPATPVSVAPVDAGTSASPTP 203
Query: 350 AHPVPIPNIHP 360
+ P I P
Sbjct: 204 SEPTKIQEKQP 214
>gnl|CDD|215182 PLN02321, PLN02321, 2-isopropylmalate synthase.
Length = 632
Score = 28.8 bits (64), Expect = 5.9
Identities = 11/69 (15%), Positives = 16/69 (23%), Gaps = 4/69 (5%)
Query: 290 SPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPP-SGHYPNPY 348
SP ++ + P S + A P P
Sbjct: 14 SPAKSLSAFTPAPTRSSASSARFPAFLARPAAARSPSLASRASSALAASPSRPQVARRPR 73
Query: 349 PAHPVPIPN 357
P + IPN
Sbjct: 74 PEY---IPN 79
>gnl|CDD|165527 PHA03269, PHA03269, envelope glycoprotein C; Provisional.
Length = 566
Score = 28.9 bits (64), Expect = 6.0
Identities = 18/97 (18%), Positives = 24/97 (24%), Gaps = 20/97 (20%)
Query: 294 TQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPPSGHYPNPYPAHPV 353
+ P P+ H + KPD PH A ++ P P S P A
Sbjct: 19 IANLNTNIPIPELHTSAATQKPDPAPAPHQA----ASRAPDPAVAPTSAASRKPDLA--- 71
Query: 354 PIPNIHPVHAVVPPGSQYYGHHRPPPAAGSTQQTQTR 390
P + PA Q
Sbjct: 72 -----------QAPTPAAS--EKFDPAPAPHQAASRA 95
>gnl|CDD|220840 pfam10667, DUF2486, Protein of unknown function (DUF2486). This
family is made up of members from various Burkholderia
spp. The function is unknown.
Length = 245
Score = 28.3 bits (63), Expect = 6.0
Identities = 21/116 (18%), Positives = 28/116 (24%), Gaps = 23/116 (19%)
Query: 292 PQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPP----------S 341
Q P P + P A+ + + VA P +
Sbjct: 45 DQIVPGAEQAASAA---PVHAAREATADPEFVAVEPVPTPHVPAVALPGDTDAPAEPGAA 101
Query: 342 GHYP-NPYPAHPVPIPNIHPVH--AVVPPGSQYY---GHHRP---PPAAGSTQQTQ 388
H A P+P+ PP G P PPAAG
Sbjct: 102 PHVVAERAAAMQAPLPS-ALAADDPQAPPAGATAADAGDAAPDATPPAAGDASPPA 156
>gnl|CDD|173815 cd07490, Peptidases_S8_6, Peptidase S8 family domain,
uncharacterized subfamily 6. This family is a member of
the Peptidases S8 or Subtilases serine endo- and
exo-peptidase clan. They have an Asp/His/Ser catalytic
triad similar to that found in trypsin-like proteases,
but do not share their three-dimensional structure and
are not homologous to trypsin. The stability of
subtilases may be enhanced by calcium, some members have
been shown to bind up to 4 ions via binding sites with
different affinity. Some members of this clan contain
disulfide bonds. These enzymes can be intra- and
extracellular, some function at extreme temperatures and
pH values.
Length = 254
Score = 28.3 bits (63), Expect = 6.2
Identities = 11/42 (26%), Positives = 12/42 (28%), Gaps = 9/42 (21%)
Query: 304 PQSHLPPSKLKPDLVAPPHSAIPTLGAQ-YPQRVAPPPSGHY 344
P KPD+ AP G Y R G Y
Sbjct: 180 APDSPPDEYTKPDVAAP--------GVDVYSARQGANGDGQY 213
>gnl|CDD|221179 pfam11711, Tim54, Inner membrane protein import complex subunit
Tim54. Mitochondrial function depends on the import of
hundreds of different proteins synthesised in the
cytosol. Protein import is a multi-step pathway which
includes the binding of precursor proteins to surface
receptors, translocation of the precursor across one or
both mitochondrial membranes, and folding and assembly
of the imported protein inside the mitochondrion. Most
precursor proteins carry amino-terminal targeting
signals, called pre-sequences, and are imported into
mitochondria via import complexes located in both the
outer and the inner membrane (IM). The IM complex, TIM,
is made up of at least two proteins which mediate
translocation of proteins into the matrix by removing
their signal peptide and another pair of proteins, Tim54
and Tim22, that insert the polytopic proteins, that
carry internal targetting information, into the inner
membrane.
Length = 377
Score = 28.6 bits (64), Expect = 6.5
Identities = 18/81 (22%), Positives = 31/81 (38%), Gaps = 11/81 (13%)
Query: 279 SQSQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAP 338
++ A + +T PE + P + + P K V P+ + +YP P
Sbjct: 218 PAAESPAEPAEETAETTPEETEDAPEEENNKPVKPP---VPKPYIS----PDEYPSAPLP 270
Query: 339 PPSGHYPNPY-PAHPVPIPNI 358
P P P+ +P PN+
Sbjct: 271 P---ELPQLLQPSLVIPFPNL 288
>gnl|CDD|114709 pfam06003, SMN, Survival motor neuron protein (SMN). This family
consists of several eukaryotic survival motor neuron
(SMN) proteins. The Survival of Motor Neurons (SMN)
protein, the product of the spinal muscular
atrophy-determining gene, is part of a large
macromolecular complex (SMN complex) that functions in
the assembly of spliceosomal small nuclear
ribonucleoproteins (snRNPs). The SMN complex functions
as a specificity factor essential for the efficient
assembly of Sm proteins on U snRNAs and likely protects
cells from illicit, and potentially deleterious,
non-specific binding of Sm proteins to RNAs.
Length = 264
Score = 28.4 bits (63), Expect = 6.5
Identities = 19/89 (21%), Positives = 30/89 (33%), Gaps = 1/89 (1%)
Query: 269 DICHQVLDLYSQSQQQASQQDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTL 328
D+ L + ++ ++S +S + PPS P P P
Sbjct: 124 DVDEDALKEANVNETESSTDESDRSSHSHEVRSKSNFPMGPPSPWNPRFPPGPPPPPPGF 183
Query: 329 GAQYPQ-RVAPPPSGHYPNPYPAHPVPIP 356
G + PP +P P+P P IP
Sbjct: 184 GRHGEKPSGWPPFLSGWPPPFPLGPPMIP 212
>gnl|CDD|144451 pfam00859, CTF_NFI, CTF/NF-I family transcription modulation
region.
Length = 295
Score = 28.5 bits (63), Expect = 6.7
Identities = 31/136 (22%), Positives = 43/136 (31%), Gaps = 14/136 (10%)
Query: 281 SQQQASQQDSPPQTQPEPSTKPP-----PQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQR 335
S + D + P P P Q H P + A PH L
Sbjct: 105 SSPTMKKPDKSLFSSPSPQDSSPRLSAFTQHHRPVITGHSGISASPHPTPSPLHFPTSPI 164
Query: 336 VAPPPSGHYPNP---YPAHPVPIPNIHP-VHAVVPPGSQYYGHHRPPPAAGSTQQTQTRS 391
+ PS ++P+ YP H P + V V P SQ + GS Q
Sbjct: 165 LPQQPSSYFPHTAIRYPPHLHPQDPLKEFVQLVCDPSSQ-----QAGQPNGSGQGKVPNH 219
Query: 392 CSTTPLCHSYPGSPIS 407
TP+ P P++
Sbjct: 220 FLPTPMLAPPPPPPMA 235
>gnl|CDD|185628 PTZ00449, PTZ00449, 104 kDa microneme/rhoptry antigen; Provisional.
Length = 943
Score = 28.9 bits (64), Expect = 6.7
Identities = 17/65 (26%), Positives = 22/65 (33%), Gaps = 1/65 (1%)
Query: 290 SPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPPPSGHYPNPYP 349
S + S K P +P S +P+ P P P+R P P P P
Sbjct: 602 SAQRPTRPKSPKLPELLDIPKSPKRPESPKSPKRPPPPQRPSSPERPEGPKIIKSPKP-P 660
Query: 350 AHPVP 354
P P
Sbjct: 661 KSPKP 665
>gnl|CDD|221501 pfam12273, RCR, Chitin synthesis regulation, resistance to Congo
red. RCR proteins are ER membrane proteins that
regulate chitin deposition in fungal cell walls.
Although chitin, a linear polymer of beta-1,4-linked
N-acetylglucosamine, constitutes only 2% of the cell
wall it plays a vital role in the overall protection of
the cell wall against stress, noxious chemicals and
osmotic pressure changes. Congo red is a cell
wall-disrupting benzidine-type dye extensively used in
many cell wall mutant studies that specifically targets
chitin in yeast cells and inhibits growth. RCR proteins
render the yeasts resistant to Congo red by diminishing
the content of chitin in the cell wall. RCR proteins are
probably regulating chitin synthase III interact
directly with ubiquitin ligase Rsp5, and the VPEY motif
is necessary for this, via interaction with the WW
domains of Rsp5.
Length = 124
Score = 27.4 bits (61), Expect = 7.0
Identities = 19/86 (22%), Positives = 27/86 (31%), Gaps = 22/86 (25%)
Query: 278 YSQSQQQASQQDSPPQTQPEP----------------STKPPPQSHLPPSKLKPDLVAPP 321
Y QSQQQ + Q P + P P +L+P PP
Sbjct: 45 YRQSQQQYNNTPQGRPPQYVPPYTETANENDLGYYDGQGEFHPNPKAPAIELQP----PP 100
Query: 322 HSAIPTLGAQYPQRVAPPPSGHYPNP 347
++ + PP+G P P
Sbjct: 101 NAYERGTRSPTTDDEYQPPAG--PPP 124
>gnl|CDD|236394 PRK09169, PRK09169, hypothetical protein; Validated.
Length = 2316
Score = 28.9 bits (65), Expect = 7.1
Identities = 22/86 (25%), Positives = 33/86 (38%), Gaps = 16/86 (18%)
Query: 277 LYSQSQQQASQQDSPPQTQPEPSTKPPPQSHLPP--SKLKPDL-----VAP-PHSAIPTL 328
L++Q + A+ P P P+ PP L ++ P L A P + + L
Sbjct: 1685 LHAQLEALAAA----PANTPSPAAPPPA---LSAALAQRFPPLAEAYAQAGTPRARLAAL 1737
Query: 329 GAQYPQRVAPPPSGHYPNPYPAHPVP 354
YP P P +P P + VP
Sbjct: 1738 VDHYPVARLPHPDEQFPEGQPGY-VP 1762
>gnl|CDD|218610 pfam05498, RALF, Rapid ALkalinization Factor (RALF). RALF, a 5-kDa
ubiquitous polypeptide in plants, arrests root growth
and development.
Length = 66
Score = 26.2 bits (58), Expect = 7.8
Identities = 11/44 (25%), Positives = 13/44 (29%), Gaps = 8/44 (18%)
Query: 356 PNIHPVHAVVPPGSQYYGHHRPPPAAGSTQQTQTRSCSTTPLCH 399
N P G+ YY PA +R CS C
Sbjct: 30 RNSVPCS---RRGASYYNCKPGAPA-----NPYSRGCSAITRCR 65
>gnl|CDD|225657 COG3115, ZipA, Cell division protein [Cell division and chromosome
partitioning].
Length = 324
Score = 28.3 bits (63), Expect = 7.9
Identities = 23/96 (23%), Positives = 28/96 (29%), Gaps = 10/96 (10%)
Query: 282 QQQASQQDSPPQTQPEPSTKPP--PQSHLPPSKLKPDLVAPPHSA-IPTLG--AQYPQRV 336
Q + S P P P +H P++ D PP A P L A PQ V
Sbjct: 88 HQYQPEYASAQIKIPVPQPPQISDPPAHPQPTQPALDQEQPPEEARQPVLPQEAPAPQPV 147
Query: 337 A--PPPSG---HYPNPYPAHPVPIPNIHPVHAVVPP 367
P P P + P V P
Sbjct: 148 HSAAPQPAVQTVQPAVPEQQVQPEEVVEPAPEVKRP 183
>gnl|CDD|234683 PRK00191, tatA, twin arginine translocase protein A; Provisional.
Length = 84
Score = 26.5 bits (58), Expect = 8.1
Identities = 24/87 (27%), Positives = 39/87 (44%), Gaps = 13/87 (14%)
Query: 220 WEPEIIAVALMYLAGKLSKFEVVDWIGRNSKHLRWWEMFVEDMTMDLLEDICHQVLDLYS 279
WE II + ++ L G + IGR+ +R ++ V++M+ D
Sbjct: 6 WEIGIIVLLIIVLFGAKKLPDAARSIGRS---MRIFKSEVKEMSKDD---------QTPE 53
Query: 280 QSQQQASQQDSPPQT-QPEPSTKPPPQ 305
Q QQQ QQ +P Q P+P+ + Q
Sbjct: 54 QQQQQPQQQIAPNQIEAPQPNFQQHYQ 80
>gnl|CDD|218116 pfam04503, SSDP, Single-stranded DNA binding protein, SSDP. This
is a family of eukaryotic single-stranded DNA binding
proteins with specificity to a pyrimidine-rich element
found in the promoter region of the alpha2(I) collagen
gene.
Length = 293
Score = 28.1 bits (62), Expect = 8.1
Identities = 18/76 (23%), Positives = 23/76 (30%), Gaps = 3/76 (3%)
Query: 330 AQYPQRVAPPPSGHYPNPYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAAGSTQQTQT 389
A + A PS N P +P P G Q+ P + Q T
Sbjct: 8 AFHDYSAAAAPSPVLGNMPPGDGMPQGPDPPGFFQGAGGKQHQQKKTPQSGSTPQMQNTT 67
Query: 390 RSCSTTPLCHSYPGSP 405
+P YPG P
Sbjct: 68 SQPFMSP---RYPGGP 80
>gnl|CDD|215915 pfam00429, TLV_coat, ENV polyprotein (coat polyprotein).
Length = 560
Score = 28.2 bits (63), Expect = 8.4
Identities = 27/133 (20%), Positives = 41/133 (30%), Gaps = 28/133 (21%)
Query: 288 QDSPPQTQPEPSTKPPPQSHLPPSKL-----KPDLVAPPHSAI----PTL---------- 328
+D P +QP P PP P DL+ + + P+L
Sbjct: 208 RDQKPPSQPHP-LAPPLPRPTSPPGPGTGDRLLDLLQGAYLTLNATNPSLTQDCWLCLVS 266
Query: 329 GAQYPQRVAPPPSGHYPNPYPAHPVPIPNIHPVHAVVPPGSQYYGHHR---PPPAAGSTQ 385
G Y + + P+G N A P ++ P A+ G P
Sbjct: 267 GPPYYEGI--APNGSVSNHTSA---PACSLGPQLALTLSEVTGQGCCIGGIPVGHQALCN 321
Query: 386 QTQTRSCSTTPLC 398
T + S + LC
Sbjct: 322 TTVSTSSGSHYLC 334
>gnl|CDD|215544 PLN03029, PLN03029, type-a response regulator protein; Provisional.
Length = 222
Score = 28.1 bits (62), Expect = 8.7
Identities = 16/62 (25%), Positives = 27/62 (43%), Gaps = 6/62 (9%)
Query: 272 HQVLDLYSQSQQQASQQDSPPQTQPEPSTKPPPQSHLPPSK-----LKPDLVAPPHSAIP 326
+ L+ ++ Q PQ+QP+P + P Q + K L PD P ++ I
Sbjct: 161 QEKLEESEIQSEKQEQPSQQPQSQPQPQQQ-PQQPNNNKRKAMEEGLSPDRTRPRYNGIT 219
Query: 327 TL 328
T+
Sbjct: 220 TV 221
>gnl|CDD|234630 PRK00095, mutL, DNA mismatch repair protein; Reviewed.
Length = 617
Score = 28.3 bits (64), Expect = 9.0
Identities = 15/118 (12%), Positives = 23/118 (19%), Gaps = 13/118 (11%)
Query: 264 MDLLEDICHQVL-------------DLYSQSQQQASQQDSPPQTQPEPSTKPPPQSHLPP 310
DL+ + L + ++ + P S S PP
Sbjct: 314 HDLIVQAIQEALAQSGLIPAAAGANQVLEPAEPEPLPLQQTPLYASGSSPPASSPSSAPP 373
Query: 311 SKLKPDLVAPPHSAIPTLGAQYPQRVAPPPSGHYPNPYPAHPVPIPNIHPVHAVVPPG 368
+ + P A S PA P G
Sbjct: 374 EQSEESQEESSAEKNPLQPNASQSEAAAAASAEAAAAAPAAAPEPAEAAEEADSFPLG 431
>gnl|CDD|217197 pfam02711, Pap_E4, E4 protein. This is is a family of
Papillomavirus proteins, E4, coded for by ORF4. A splice
variant, E1--E4, exists but neither the function of E4
or E1--E4 is known.
Length = 95
Score = 26.5 bits (59), Expect = 9.0
Identities = 18/80 (22%), Positives = 25/80 (31%), Gaps = 11/80 (13%)
Query: 339 PPSGHYP-----NPYPAHPVPIPNIHPVHAVVPPGSQYYGHHRPPPAAGSTQQTQTRSCS 393
P + YP + Y P P P P P + R + S+ T S
Sbjct: 5 PVTPKYPLLDLLHTYTTPPRPPP--PPP----PWAPKKRCRRRLESDSDSSSTTSPSSPP 58
Query: 394 TTPLCHSYPGSPISTTCSTT 413
T P + +T S T
Sbjct: 59 TPQPTTECPWTVTTTGSSVT 78
>gnl|CDD|223066 PHA03379, PHA03379, EBNA-3A; Provisional.
Length = 935
Score = 28.5 bits (63), Expect = 9.6
Identities = 19/82 (23%), Positives = 25/82 (30%), Gaps = 5/82 (6%)
Query: 288 QDSPPQTQPEPSTKPPPQSHLPPSKLKPDLVAPPHSAIPTLGAQYPQRVAPP---PSGHY 344
PP P +P Q +P P A P + P +
Sbjct: 468 AQLPPG--PLQDLEPGDQLPGVVQDGRPACAPVPAPAGPIVRPWEASLSQVPGVAFAPVM 525
Query: 345 PNPYPAHPVPIPNIHPVHAVVP 366
P P P PVP+P + V P
Sbjct: 526 PQPMPVEPVPVPTVALERPVCP 547
>gnl|CDD|221818 pfam12868, DUF3824, Domain of unknwon function (DUF3824). This is
a repeating domain found in fungal proteins. It is
proline-rich, and the function is not known.
Length = 135
Score = 27.2 bits (60), Expect = 9.9
Identities = 14/42 (33%), Positives = 16/42 (38%), Gaps = 1/42 (2%)
Query: 338 PPPSGHYPNPYPAH-PVPIPNIHPVHAVVPPGSQYYGHHRPP 378
PPP G P P P P P +P P Q Y + P
Sbjct: 78 PPPPGSTPVPPPGPQPGYNPADYPPPPGAVPPPQNYPYPPGP 119
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.320 0.136 0.432
Gapped
Lambda K H
0.267 0.0647 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 21,395,639
Number of extensions: 2058749
Number of successful extensions: 3904
Number of sequences better than 10.0: 1
Number of HSP's gapped: 3020
Number of HSP's successfully gapped: 349
Length of query: 414
Length of database: 10,937,602
Length adjustment: 99
Effective length of query: 315
Effective length of database: 6,546,556
Effective search space: 2062165140
Effective search space used: 2062165140
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 60 (27.1 bits)