RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy14280
(1891 letters)
>gnl|CDD|227382 COG5049, XRN1, 5'-3' exonuclease [DNA replication, recombination,
and repair / Cell division and chromosome partitioning /
Translation].
Length = 953
Score = 640 bits (1651), Expect = 0.0
Identities = 319/889 (35%), Positives = 451/889 (50%), Gaps = 166/889 (18%)
Query: 1 MGVPKFFRYISERYPCLSEVVREYQIPEFDNLYLDMNGIIHNCSHPNDNDPHFRITEKQI 60
MGVP FFR++SERYP + +++ E QIPEFDNLYLDMNGI+HNC+HPND P TE+++
Sbjct: 1 MGVPSFFRWLSERYPKIIQLIEEKQIPEFDNLYLDMNGILHNCTHPNDGSP--PETEEEM 58
Query: 61 FQDIFHYIEILFRMIQPKKLFFMAIDGVAPRAKMNQQRGRRFRSAKEAEVLEQKAKEKGE 120
++ +F YI+ + I+P+KL +MA+DGVAPRAKMNQQR RRFRSAK+A KA+ GE
Sbjct: 59 YKAVFEYIDHILLKIRPRKLLYMAVDGVAPRAKMNQQRARRFRSAKDASAAALKAEPNGE 118
Query: 121 RLPTEKRFDFITLKFCFPKKLFFMAIDGVAPRAKMNQQRGRRFRSAKEAEVLEQKAKEKG 180
+P EK ++ E
Sbjct: 119 EIPEEK----------------------------------------------DEIGNEID 132
Query: 181 ERLPTEKRFDSNCITPGTPFMCRLHQQLRYFVKLKITTDKLWKDTLSLWVWKDVCVILSG 240
+K+FDSNCITPGTPFM RL + LRY++ K+++D W+++ +I SG
Sbjct: 133 TIDVEKKKFDSNCITPGTPFMERLAKVLRYYIHCKLSSDP---------EWRNLRIIFSG 183
Query: 241 HETPGEGEHKIMDYIRYMKSKPDYNPNMRHCLYGLDADLIMLGLCTHEPYFSLLREEVKF 300
H PGEGEHKIM++IR K++P YNPN RHC+YGLDADLIMLGL THEP+F +LRE+V F
Sbjct: 184 HLVPGEGEHKIMNFIRSQKAQPSYNPNTRHCIYGLDADLIMLGLSTHEPHFLILREDVFF 243
Query: 301 ---NTKSNKGGSQIVT---------PEETKFFLLHLSLMRDYLELEFGKLKEPGVLSFPF 348
+ + K T F+LLH+SL+R+YLE EF +EP L F F
Sbjct: 244 GSKSRRKRKCTKCGRTGHSDEECKVLTHQPFYLLHISLLREYLEREF---REPT-LPFTF 299
Query: 349 SIENIIDDWVLMGFLIGNDFIPNLPDLHIINGALPLLYKVYQEVLP------TLDGYIN- 401
+E I+DDW+ + F +GNDF+P+LP L I GA+ L +++++ LP T DG IN
Sbjct: 300 DLERILDDWIFLCFFVGNDFLPHLPCLDIREGAIETLTEIWKKSLPHMKGYITCDGVINL 359
Query: 402 ---EGGYLNLKRFEKFLTK------------LASFDFEQFREKYAD--------IQYLEG 438
E L FE + K L F + R++ + L G
Sbjct: 360 ARLEVILAILGSFEDDIFKKDHIQEERKNESLERFSLRKERKEGLKGMPRVVYEQKKLIG 419
Query: 439 KMGKKFSDVFHKEKS----KEMDPDLAALIKATDD------------------------D 470
+ D ++KS E D AL K D
Sbjct: 420 SIKPTLMDQLQEKKSPDLPDEEFIDTLALPKDLDMKNHELFLKRFANDLGLSISKAIKSK 479
Query: 471 YDDGTEEVLNYANYEEYYYSDEDE----------------------EWRTNFFSEFGQHK 508
+ E + + +E E E E F K
Sbjct: 480 GNYSLEMDIASDSPDEDEEEFESEVDSIRKIPDKYVNIIVEEEEENETEKTVNLRFPGWK 539
Query: 509 TDYYHYKLDYAKVTNEVLRDQAECYIRAIQWNLHYYYNGCVSWSWYYPHHYAPYISDIKD 568
YY KL + + E +RD A+ Y+ +QW L YYY GC SW WYYP+HYAP +D+
Sbjct: 540 ERYYTSKLHFTTDSEEKIRDMAKEYVEGLQWVLSYYYRGCPSWDWYYPYHYAPLAADLSK 599
Query: 569 FSHLNLKYEMGEPFLPFQQLLAVLPAASKELLPEPYQNLMTLENSPILEYYPEEFATDLN 628
S ++K+E+G PF PF+QL+AVLPA SK L+PE ++ LM E SPI+++YPEEF D+N
Sbjct: 600 LSDNDIKFELGTPFRPFEQLMAVLPARSKNLVPEGFRPLMDDEKSPIIDFYPEEFKLDMN 659
Query: 629 GKKQDWEAVVLIPFIDEVKLIKAMDAVEHLLTDEERSRNRHGPMLVYTYS-NVNLGTALA 687
GK W+AVVL+PFIDE +L+ A+ L++EER RN G L+++ + +L
Sbjct: 660 GKTASWQAVVLLPFIDERRLLSAVAVKYPTLSEEERKRNLRGLDLLFSSNKKSDL----- 714
Query: 688 PDYFPAINENHAVEKPLYRDDIYVHPNKLRKGLCQDVNLKLYVTHFPTLKHVQFTHKIEN 747
+ F + + +K I + + GL V L P L + +
Sbjct: 715 SELFKDL-YSKCKQKE----YITMCSKESPYGLFGTVKLGAEGL-APNLLSLCPISFLSY 768
Query: 748 AKVKVFQQPSRNESII-IVQDKVQVPSLKQVADQILGKTIFINYPHLIE 795
+ VF + S+N+S +++D + K + + K I + +P+L E
Sbjct: 769 PGLMVFLEYSKNQSARLVIEDPKSTVTNKSIVLRGFIKPINVLWPYLRE 817
>gnl|CDD|202558 pfam03159, XRN_N, XRN 5'-3' exonuclease N-terminus. This family
aligns residues towards the N-terminus of several
proteins with multiple functions. The members of this
family all appear to possess 5'-3' exonuclease activity
EC:3.1.11.-. Thus, the aligned region may be necessary
for 5' to 3' exonuclease function. The family also
contains several Xrn1 and Xrn2 proteins. The 5'-3'
exoribonucleases Xrn1p and Xrn2p/Rat1p function in the
degradation and processing of several classes of RNA in
Saccharomyces cerevisiae. Xrn1p is the main enzyme
catalyzing cytoplasmic mRNA degradation in multiple
decay pathways, whereas Xrn2p/Rat1p functions in the
processing of rRNAs and small nucleolar RNAs (snoRNAs)
in the nucleus.
Length = 237
Score = 382 bits (983), Expect = e-122
Identities = 153/299 (51%), Positives = 188/299 (62%), Gaps = 63/299 (21%)
Query: 1 MGVPKFFRYISERYPCLSEVVREYQIP-EFDNLYLDMNGIIHNCSHPNDNDPHFRITEKQ 59
MGVPKFFR++SERYP + + + E+QIP EFDNLYLDMNGIIH CSHP D F TE +
Sbjct: 1 MGVPKFFRWLSERYPKIIQPIIEHQIPDEFDNLYLDMNGIIHPCSHPEDK--PFPETEDE 58
Query: 60 IFQDIFHYIEILFRMIQPKKLFFMAIDGVAPRAKMNQQRGRRFRSAKEAEVLEQKAKEKG 119
+F IF YI+ LF +++P+KL +MAIDGVAPRAKMNQQR RRFR+AK+A+ E +A+E
Sbjct: 59 MFVAIFEYIDRLFNIVRPRKLLYMAIDGVAPRAKMNQQRSRRFRAAKDAKEKEAEAEENR 118
Query: 120 ERLPTEKRFDFITLKFCFPKKLFFMAIDGVAPRAKMNQQRGRRFRSAKEAEVLEQKAKEK 179
E L TE K +
Sbjct: 119 EELETEGIK---------------------------------------------LPEKVE 133
Query: 180 GERLPTEKRFDSNCITPGTPFMCRLHQQLRYFVKLKITTDKLWKDTLSLWVWKDVCVILS 239
E +FDSNCITPGTPFM RL + LRY++K K+ TD W+++ VILS
Sbjct: 134 KE------KFDSNCITPGTPFMARLAKALRYYIKKKLNTDPE---------WQNLTVILS 178
Query: 240 GHETPGEGEHKIMDYIRYMKSKPDYNPNMRHCLYGLDADLIMLGLCTHEPYFSLLREEV 298
E PGEGEHKIMD+IR +++PDY+PN RHCLYGLDADLIMLGL THEP+FS+LREEV
Sbjct: 179 DAEVPGEGEHKIMDFIRSQRAQPDYDPNTRHCLYGLDADLIMLGLATHEPHFSVLREEV 237
>gnl|CDD|219339 pfam07223, DUF1421, Protein of unknown function (DUF1421). This
family represents a conserved region approximately 350
residues long within a number of plant proteins of
unknown function.
Length = 357
Score = 47.2 bits (112), Expect = 5e-05
Identities = 30/134 (22%), Positives = 43/134 (32%), Gaps = 6/134 (4%)
Query: 1610 KEDLQQQQTVEPEQNKQAFPPHPDQLAQPGDDVTSLIRPQPGPVRGHTNSSDLSRSTNAQ 1669
E+ Q + + PP P+Q+A+ ++ Q + + Q
Sbjct: 31 HEEAQSSEAHSFHVDSTKQPPAPEQVAK--HELADAPLQQVNAALPPAPAPQSPQPDQQQ 88
Query: 1670 PRGIPILGSQAAMYNPSVPYQQQQQYQYYYNQQQPMYYPQQVAYPPPPYQQARHPQYSKP 1729
P + Y +P QQ Q QQ YYP PPP QQ + Q P
Sbjct: 89 QSQAP----PSHQYPSQLPPQQVQSVPQQPTPQQEPYYPPPSQPQPPPAQQPQAQQPQPP 144
Query: 1730 PHPILNASSQPVPP 1743
P Q P
Sbjct: 145 PQVPQQQQYQSPPQ 158
Score = 40.7 bits (95), Expect = 0.006
Identities = 28/143 (19%), Positives = 34/143 (23%), Gaps = 19/143 (13%)
Query: 1616 QQTVEPEQ----NKQAFPPHPDQLAQPGDDVTSLIRPQPGPVRGHTNSSDLSRS--TNAQ 1669
+Q PEQ P A P +P S
Sbjct: 48 KQPPAPEQVAKHELADAPLQQVNAALPPAPAPQSPQPDQQQQSQAPPSHQYPSQLPPQQV 107
Query: 1670 PRGIPILGSQAAMYNPSVPYQQQQQYQYYYNQQQPMY-------YPQQVAYPPPPYQQAR 1722
Q Y Y Q Q QQ PQQ Y PP QQ +
Sbjct: 108 QSVPQQPTPQQEPY-----YPPPSQPQPPPAQQPQAQQPQPPPQVPQQQQYQSPP-QQPQ 161
Query: 1723 HPQYSKPPHPILNASSQPVPPDT 1745
+ Q P S P ++
Sbjct: 162 YQQNPPPQAQSAPQVSGLYPEES 184
Score = 40.3 bits (94), Expect = 0.008
Identities = 37/148 (25%), Positives = 46/148 (31%), Gaps = 24/148 (16%)
Query: 1613 LQQQQTVEPEQ-NKQAFPPHPDQLAQPGDDVTSLIRPQPGPVRGHTNSSDLSRSTNAQPR 1671
LQQ P Q+ P Q +Q + P V+ + P
Sbjct: 66 LQQVNAALPPAPAPQSPQPDQQQQSQAPPSHQYPSQLPPQQVQSVPQQPTPQQEPYYPPP 125
Query: 1672 G--IPILGSQAAMYNPSVPYQQQQQYQYYYNQQQPMYYPQQVAYP--PPPYQQAR----- 1722
P Q P P Q QQ QY Q P PQQ Y PPP Q+
Sbjct: 126 SQPQPPPAQQPQAQQPQPPPQVPQQQQY----QSP---PQQPQYQQNPPPQAQSAPQVSG 178
Query: 1723 -------HPQYSKPPHPILNASSQPVPP 1743
+ S PP+ L +S PP
Sbjct: 179 LYPEESPYQPQSYPPNEPLPSSMAMQPP 206
Score = 33.8 bits (77), Expect = 0.98
Identities = 28/138 (20%), Positives = 39/138 (28%), Gaps = 15/138 (10%)
Query: 1615 QQQTVEP-EQNKQAFPPHPDQLAQPGDDVTSLIRPQPGPVRGHTNSSDLSRSTNAQPRGI 1673
Q P +Q Q+ P P P + Q P + + P
Sbjct: 142 QPPPQVPQQQQYQSPPQQPQYQQNPP------PQAQSAPQVSGLYPEESPYQPQSYPPNE 195
Query: 1674 PILGSQAAM--YNPSVPYQQ------QQQYQYYYNQQQPMYYPQQVAYPPPPYQQARHPQ 1725
P+ S A Y+ + P QQ Y Y +P PPP Q +
Sbjct: 196 PLPSSMAMQPPYSGAPPSQQFYGPPQPSPYMYGGPGGRPNSGFPSGQQPPPSQGQEGYGY 255
Query: 1726 YSKPPHPILNASSQPVPP 1743
PP + S P
Sbjct: 256 SGPPPSKGNHGSVASYAP 273
>gnl|CDD|220392 pfam09770, PAT1, Topoisomerase II-associated protein PAT1. Members
of this family are necessary for accurate chromosome
transmission during cell division.
Length = 804
Score = 45.5 bits (108), Expect = 3e-04
Identities = 25/128 (19%), Positives = 32/128 (25%), Gaps = 16/128 (12%)
Query: 1613 LQQQQTVEPEQNKQAFPPHPDQLAQPGDDVTSLIRPQPGPVRGHTNSSDLSRSTNAQPRG 1672
QQ E +P P + L P + + P
Sbjct: 191 FPQQGPPEQPPG---YPQPPQGHPEQVQPQQFLPAPSQ------------APAQPPLPPQ 235
Query: 1673 IPILGSQAAMYNPSVPYQQQQQYQYYYNQQQPMYYPQQVAYPPPPYQQARHPQYSKPPHP 1732
+P QQ QQ PQ A PPP Q HP + +
Sbjct: 236 LP-QQPPPLQQPQFPGLSQQMPPPPPQPPQQQQQPPQPQAQPPPQNQPTPHPGLPQGQNA 294
Query: 1733 ILNASSQP 1740
L QP
Sbjct: 295 PLPPPQQP 302
Score = 44.0 bits (104), Expect = 8e-04
Identities = 25/142 (17%), Positives = 38/142 (26%), Gaps = 5/142 (3%)
Query: 1603 EVMIIEIKEDLQQQQTVEPEQNKQAFPPHPDQLAQPGDDVTSLIRPQPGP-VRGHTNSSD 1661
+++ +E E QQ+ P+ + P + P+ P
Sbjct: 152 KMLSLEEVEAQLQQRQQAPQLPQPPQQVLPQGMPPRQAAFPQQGPPEQPPGYPQPPQGHP 211
Query: 1662 LSRSTNAQPRGIPILGSQAAMYNPSVPYQQQQQYQYYYNQQQPMYYPQQVAYPPPPYQQA 1721
+Q + P QQ Q P P PP QQ
Sbjct: 212 EQVQPQQFLPAPSQAPAQPPL--PPQLPQQPPPLQQPQFPGLSQQMPPPP--PQPPQQQQ 267
Query: 1722 RHPQYSKPPHPILNASSQPVPP 1743
+ PQ P P + P P
Sbjct: 268 QPPQPQAQPPPQNQPTPHPGLP 289
Score = 41.3 bits (97), Expect = 0.006
Identities = 17/85 (20%), Positives = 24/85 (28%), Gaps = 5/85 (5%)
Query: 1668 AQPRGIPILGSQAAMYNPSVPYQQQQQYQYYYNQQQPMYYPQQVAYPPPPYQQARHPQYS 1727
AQ + +P + Q + QQ P P PP + + PQ
Sbjct: 161 AQLQQRQQAPQLPQPPQQVLPQGMPPR-QAAFPQQGPPEQPPGYPQPPQGHPEQVQPQQF 219
Query: 1728 KPPHPILNASS---QPVPPDTTEDL 1749
P P + P P L
Sbjct: 220 -LPAPSQAPAQPPLPPQLPQQPPPL 243
Score = 40.9 bits (96), Expect = 0.008
Identities = 22/137 (16%), Positives = 32/137 (23%), Gaps = 19/137 (13%)
Query: 1614 QQQQTVEPEQNKQAFPPHPDQLAQPGDDVTSLIRPQPGPVRGHTNSSDLSRSTNAQPRGI 1673
Q ++ Q +++ + P + PR
Sbjct: 138 QAPESQPQPQTPAQKMLSLEEVEAQLQQRQQAPQLPQPPQQVLPQ--------GMPPRQA 189
Query: 1674 PILGSQAAMYNPSVPYQQQQQYQYYYNQQQPMYY-------PQQVAYPPPPYQQARHPQY 1726
P P Q Q + Q QP + P Q PP QQ Q
Sbjct: 190 AF----PQQGPPEQPPGYPQPPQGHPEQVQPQQFLPAPSQAPAQPPLPPQLPQQPPPLQQ 245
Query: 1727 SKPPHPILNASSQPVPP 1743
+ P P P
Sbjct: 246 PQFPGLSQQMPPPPPQP 262
Score = 36.3 bits (84), Expect = 0.19
Identities = 25/118 (21%), Positives = 32/118 (27%), Gaps = 23/118 (19%)
Query: 1611 EDLQQQQTVEPEQNKQAFPPHPDQLAQPGDDVTSLIRPQPGPVRGHTNSSDLSRSTNAQP 1670
LQQ Q P ++Q PP P Q P + QP
Sbjct: 241 PPLQQPQF--PGLSQQMPPPPPQPPQQQQQPP---QPQAQPPPQN-------------QP 282
Query: 1671 RGIPILG-SQAAMYNPSVPYQQQQQYQYYYNQQQPMYYPQQVAYPPPPYQQARHPQYS 1727
P L Q A P Q Q QQ+ + +Q Q + S
Sbjct: 283 TPHPGLPQGQNAPLPPPQQPQLLPLVQQPQGQQRGPQFREQ----LVQLSQQQREALS 336
Score = 32.8 bits (75), Expect = 2.2
Identities = 21/138 (15%), Positives = 32/138 (23%), Gaps = 8/138 (5%)
Query: 1611 EDLQQQQTVEPEQNKQAFPPHPDQLAQPGDDVTSLIRPQPGPVRGHTNSSDLSRSTNAQP 1670
+ + N+ A TS P G+ S+D + QP
Sbjct: 65 GGETAKLSAAVRYNQNAPGAPSVGPDSDLSQKTSTFSPCQ---SGYEASTDPEYIPDLQP 121
Query: 1671 R-----GIPILGSQAAMYNPSVPYQQQQQYQYYYNQQQPMYYPQQVAYPPPPYQQARHPQ 1725
P Q S P Q + ++ Q+ P P +
Sbjct: 122 DPSLWGTAPKPEPQPPQAPESQPQPQTPAQKMLSLEEVEAQLQQRQQAPQLPQPPQQVLP 181
Query: 1726 YSKPPHPILNASSQPVPP 1743
PP P
Sbjct: 182 QGMPPRQAAFPQQGPPEQ 199
>gnl|CDD|236669 PRK10263, PRK10263, DNA translocase FtsK; Provisional.
Length = 1355
Score = 45.1 bits (106), Expect = 4e-04
Identities = 37/135 (27%), Positives = 45/135 (33%), Gaps = 26/135 (19%)
Query: 1619 VEPEQNKQAFPPHPDQLAQPGDDVTSLIRPQPGPVRGHTNSSDLSRSTNAQPR-GIPILG 1677
VEP Q Q Q QP QP + QP+ +
Sbjct: 749 VEPVQQPQQPVAPQQQYQQP---------QQPVA----------PQPQYQQPQQPVAPQP 789
Query: 1678 SQAAMYNPSVPYQQQQQYQYYYNQQQPMYYPQQVAYPPPPYQQARHPQYSKPP----HPI 1733
P P Q QQ Q Q PQQ P P YQQ + P +P HP+
Sbjct: 790 QYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQDTLLHPL 849
Query: 1734 L--NASSQPVPPDTT 1746
L N S+P+ TT
Sbjct: 850 LMRNGDSRPLHKPTT 864
Score = 43.5 bits (102), Expect = 0.001
Identities = 31/132 (23%), Positives = 42/132 (31%), Gaps = 10/132 (7%)
Query: 1617 QTVEP--EQNKQAFPPHPDQLAQPGDDVTSLIRPQPGPVRGHTNSSDLSRSTNAQPRGIP 1674
QT EP + +P Q AQP P PV+ + AQ
Sbjct: 367 QTGEPVIAPAPEGYPQQS-QYAQPAVQYNE---PLQQPVQPQQPYYAPAAEQPAQQPYYA 422
Query: 1675 ILGSQAAMYNPSVPYQQQ--QQYQYYYNQQQPMYYPQQVAYPPPPYQQ--ARHPQYSKPP 1730
Q A P +Q + +QQ + PQ YQQ A+ P Y +P
Sbjct: 423 PAPEQPAQQPYYAPAPEQPVAGNAWQAEEQQSTFAPQSTYQTEQTYQQPAAQEPLYQQPQ 482
Query: 1731 HPILNASSQPVP 1742
+P P
Sbjct: 483 PVEQQPVVEPEP 494
Score = 43.1 bits (101), Expect = 0.002
Identities = 22/71 (30%), Positives = 24/71 (33%)
Query: 1679 QAAMYNPSVPYQQQQQYQYYYNQQQPMYYPQQVAYPPPPYQQARHPQYSKPPHPILNASS 1738
Q P P Q QYQ P QQ P P Q + PQ P P
Sbjct: 775 QPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQ 834
Query: 1739 QPVPPDTTEDL 1749
QPV P + L
Sbjct: 835 QPVAPQPQDTL 845
Score = 41.2 bits (96), Expect = 0.007
Identities = 29/140 (20%), Positives = 42/140 (30%), Gaps = 7/140 (5%)
Query: 1614 QQQQTVEPEQNKQAFPPHPDQLAQPGDDVTSLIRPQPGPVRGHTNSSDLSRSTNAQPRGI 1673
QQ Q +P P Q QP + Q P
Sbjct: 382 QQSQYAQPAVQYNEPLQQPVQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQ- 440
Query: 1674 PILGSQAAMYNPSVPYQQQQQYQYYYNQQQPMYYPQQVAYPPPPYQQARHPQYSKPPHPI 1733
P+ G+ + Q YQ QQP Q+ Y P + Q P P+
Sbjct: 441 PVAGNAWQAEEQQSTFAPQSTYQTEQTYQQPA--AQEPLYQQPQPVE---QQPVVEPEPV 495
Query: 1734 LNASSQPVPPDTT-EDLEER 1752
+ + PP E++EE+
Sbjct: 496 VEETKPARPPLYYFEEVEEK 515
Score = 40.1 bits (93), Expect = 0.013
Identities = 34/138 (24%), Positives = 42/138 (30%), Gaps = 7/138 (5%)
Query: 1615 QQQTVEPEQNKQAFPPHPDQLA-QPGDDVTS------LIRPQPGPVRGHTNSSDLSRSTN 1667
QQQ EQ A P D P + L P PV+ +
Sbjct: 707 QQQRYSGEQPAGANPFSLDDFEFSPMKALLDDGPHEPLFTPIVEPVQQPQQPVAPQQQYQ 766
Query: 1668 AQPRGIPILGSQAAMYNPSVPYQQQQQYQYYYNQQQPMYYPQQVAYPPPPYQQARHPQYS 1727
+ + P P Q QQ Q Q PQQ P P YQQ + P
Sbjct: 767 QPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAPQPQYQQPQQPVAP 826
Query: 1728 KPPHPILNASSQPVPPDT 1745
+P + P P DT
Sbjct: 827 QPQYQQPQQPVAPQPQDT 844
Score = 32.7 bits (74), Expect = 2.2
Identities = 32/139 (23%), Positives = 42/139 (30%), Gaps = 11/139 (7%)
Query: 1619 VEPEQNKQAFPPHPDQLAQPGDDVTSLIRPQPGPVRGHTNSSDLSRSTNAQPRGIPILGS 1678
VEP AQP T +P PGP G + Q +
Sbjct: 337 VEPVTQTPPVASVDVPPAQP----TVAWQPVPGPQTGEPVIAPAPEGYPQQSQYAQPAVQ 392
Query: 1679 QAAMYNPSVPYQQQQQYQYYYNQQQPMYYPQQVAYPPPPYQQARHPQYS-KPPHPILNAS 1737
P QQ Q Q Y QQ Y P P Q A+ P Y+ P P+ +
Sbjct: 393 YNE------PLQQPVQPQQPYYAPAAEQPAQQPYYAPAPEQPAQQPYYAPAPEQPVAGNA 446
Query: 1738 SQPVPPDTTEDLEERADSY 1756
Q +T + +
Sbjct: 447 WQAEEQQSTFAPQSTYQTE 465
>gnl|CDD|222579 pfam14179, YppG, YppG-like protein. The YppG-like protein family
includes the B. subtilis YppG protein, which is
functionally uncharacterized. This family of proteins is
found in bacteria. Proteins in this family are typically
between 115 and 181 amino acids in length. There are two
completely conserved residues (F and G) that may be
functionally important.
Length = 110
Score = 40.9 bits (96), Expect = 6e-04
Identities = 17/62 (27%), Positives = 19/62 (30%), Gaps = 3/62 (4%)
Query: 1682 MYNPSVPYQQQQQYQYYYNQQQPMYYPQQVAYPPPPYQQARHPQYSKPPHPILNASSQPV 1741
N + Q QQ Q Y Q P P P QQ + P P S Q
Sbjct: 4 QQNTNQYPPQNQQQQPYQQQPYHQQMPPP---PYSPPQQQQGHFMPPQPQPYPKQSPQQQ 60
Query: 1742 PP 1743
P
Sbjct: 61 QP 62
Score = 34.7 bits (80), Expect = 0.080
Identities = 15/49 (30%), Positives = 17/49 (34%), Gaps = 4/49 (8%)
Query: 1697 YYYNQQQPMYYPQQVAYPPPPYQQARHPQYSKPP--HPILNASSQPVPP 1743
Y Q Y PQ PYQQ + Q PP P +PP
Sbjct: 1 NPYQQNTNQYPPQN--QQQQPYQQQPYHQQMPPPPYSPPQQQQGHFMPP 47
Score = 32.4 bits (74), Expect = 0.55
Identities = 13/46 (28%), Positives = 15/46 (32%)
Query: 1682 MYNPSVPYQQQQQYQYYYNQQQPMYYPQQVAYPPPPYQQARHPQYS 1727
P Y QQ PQ YP QQ + PQ+S
Sbjct: 21 QQQPYHQQMPPPPYSPPQQQQGHFMPPQPQPYPKQSPQQQQPPQFS 66
Score = 32.4 bits (74), Expect = 0.63
Identities = 20/65 (30%), Positives = 24/65 (36%), Gaps = 5/65 (7%)
Query: 1678 SQAAMYNPSVPYQQQ--QQYQYYYNQQQPMYYP--QQVAYPPPPYQQARHPQYSKPPHPI 1733
Q P QQQ QQ Y+ P Y P QQ + PP Q +P+ S
Sbjct: 4 QQNTNQYPPQNQQQQPYQQQPYHQQMPPPPYSPPQQQQGHFMPPQPQP-YPKQSPQQQQP 62
Query: 1734 LNASS 1738
SS
Sbjct: 63 PQFSS 67
>gnl|CDD|220309 pfam09606, Med15, ARC105 or Med15 subunit of Mediator complex
non-fungal. The approx. 70 residue Med15 domain of the
ARC-Mediator co-activator is a three-helix bundle with
marked similarity to the KIX domain. The sterol
regulatory element binding protein (SREBP) family of
transcription activators use the ARC105 subunit to
activate target genes in the regulation of cholesterol
and fatty acid homeostasis. In addition, Med15 is a
critical transducer of gene activation signals that
control early metazoan development.
Length = 768
Score = 42.7 bits (100), Expect = 0.002
Identities = 29/130 (22%), Positives = 37/130 (28%), Gaps = 19/130 (14%)
Query: 1614 QQQQTVEPEQNKQAFPPHPDQLAQPGDDVTSLIRPQPGPVRGHTNSSDLSRSTNAQPRGI 1673
QQQQ + A +Q G + +L +G N +G
Sbjct: 356 QQQQQQGG-NHPAAHQQQMNQQVGQGGQMVALGYLNIQGNQGGLG-------ANPMQQGQ 407
Query: 1674 PILGSQAAMYNPSVPYQQQQQYQYYYNQQQPMY-YPQQVAYPPPPYQQARHPQYSKPPHP 1732
P + M +PS P Q Q Q QP P PP P P
Sbjct: 408 PGM-----MSSPS-PVPQVQTNQSMPQPPQPSVPSPGGPGSQPPQS----VSGGMIPSPP 457
Query: 1733 ILNASSQPVP 1742
L S P
Sbjct: 458 ALMPSPSPQM 467
Score = 33.8 bits (77), Expect = 1.1
Identities = 24/122 (19%), Positives = 29/122 (23%), Gaps = 8/122 (6%)
Query: 1604 VMIIEIKEDLQQQQTVEPEQNKQAFPPHPDQLAQPGDDVTSLIRPQ---PGPVRGHTNSS 1660
+M + QQ Q QA V PQ PG G
Sbjct: 150 MMQQSSGQPQSQQPNQMGPQQGQAQGQAGGMNQGQQGPVGQQQPPQMGQPGMPGGGGQG- 208
Query: 1661 DLSRSTNAQPRGIPILGSQAAMYNPSVPYQQQ-QQYQYYYNQQQPMYYPQQVAYPPPPYQ 1719
+ G + QQQ Q Q + Q M QQ P Q
Sbjct: 209 ---QMQQQGQPGGQQQQNPQMQQQLQNQQQQQMDQQQGPADAQAQMGQQQQGQGGMQPQQ 265
Query: 1720 QA 1721
Sbjct: 266 MQ 267
Score = 32.7 bits (74), Expect = 2.6
Identities = 22/141 (15%), Positives = 36/141 (25%), Gaps = 14/141 (9%)
Query: 1613 LQQQQTVEPEQNKQAFPPHPDQLAQPGDDVTSLIRPQPGPVRGHTNSSDLSRSTNAQPRG 1672
+ +Q Q Q Q + + + Q +G ++ Q +G
Sbjct: 202 PGGGGQGQMQQQGQPGGQ-QQQNPQMQQQLQNQQQQQMDQQQGPADAQ-AQMGQQQQGQG 259
Query: 1673 IPILGSQAAMYNPSVPYQQQQQYQYYYNQ-----QQPMYY-------PQQVAYPPPPYQQ 1720
QQ Q Q +Q Q P Q PPP
Sbjct: 260 GMQPQQMQGGQMQVPMQQQPPQQQPQQSQLGMLPNQMQQMPGGGQGGPGQPMGPPPQRPG 319
Query: 1721 ARHPQYSKPPHPILNASSQPV 1741
A +++A Q +
Sbjct: 320 AVPQGGQAVQQGVMSAGQQQL 340
Score = 31.9 bits (72), Expect = 4.8
Identities = 21/117 (17%), Positives = 30/117 (25%), Gaps = 10/117 (8%)
Query: 1614 QQQQTVEPEQNKQAFPPHPD-QLAQPGDDVTSLIRPQPGPVRGHTNSSDLSRSTNAQPRG 1672
QQ Q + + Q PP Q +Q G + + G G P+
Sbjct: 264 QQMQGGQMQVPMQQQPPQQQPQQSQLGMLPNQMQQMPGGGQGGPGQP------MGPPPQR 317
Query: 1673 IPILGSQAAMYNPSVPYQQQQQYQYYYNQQQPMYYPQQVAYPPPPYQQARHPQYSKP 1729
+ V QQQ + QQ Q HP +
Sbjct: 318 PGAVPQGGQAVQQGVMSAGQQQL---KQMKLRNMRGQQQTQQQQQQQGGNHPAAHQQ 371
Score = 30.7 bits (69), Expect = 9.8
Identities = 42/204 (20%), Positives = 64/204 (31%), Gaps = 23/204 (11%)
Query: 1538 MVQQSPHLLEKNKELNTPSGGELSFEEKHSMA-AKSKNETTPTIK---TEMRKIISSSDY 1593
+V + L KN + +T + E+ E H A A++K+E + R + +
Sbjct: 3 VVSKIEEALRKNGQTHTKNAREM---ENHVFAKARTKDEYLGLVARLIIHFRDMSKKAAQ 59
Query: 1594 QGLEEEEEEEVMIIEIKEDLQQQQTVEPEQNKQAFPPHPDQLAQPGDDVTS-------LI 1646
Q + + + I ++L Q T P+ P Q G T+ +
Sbjct: 60 QQVLQGGQGMPDPINALQNLTGQGTRGPQMGPMGPGPGRPMGQQMGGPGTASNLLQSLNV 119
Query: 1647 RPQPGPVRGHTNSSDLSRSTNAQPRGIPILGSQAAMYNPSVPYQQQQQYQYYYNQQQPMY 1706
R Q +SR QP G QA Q Q Q NQ P
Sbjct: 120 RGQMPMGAAGMGPHQMSRVGTMQP------GGQAGGMMQQSSGQPQSQQP---NQMGPQQ 170
Query: 1707 YPQQVAYPPPPYQQARHPQYSKPP 1730
Q Q +PP
Sbjct: 171 GQAQGQAGGMNQGQQGPVGQQQPP 194
>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family. Atrophin-1 is the protein
product of the dentatorubral-pallidoluysian atrophy
(DRPLA) gene. DRPLA OMIM:125370 is a progressive
neurodegenerative disorder. It is caused by the expansion
of a CAG repeat in the DRPLA gene on chromosome 12p. This
results in an extended polyglutamine region in
atrophin-1, that is thought to confer toxicity to the
protein, possibly through altering its interactions with
other proteins. The expansion of a CAG repeat is also the
underlying defect in six other neurodegenerative
disorders, including Huntington's disease. One
interaction of expanded polyglutamine repeats that is
thought to be pathogenic is that with the short glutamine
repeat in the transcriptional coactivator CREB binding
protein, CBP. This interaction draws CBP away from its
usual nuclear location to the expanded polyglutamine
repeat protein aggregates that are characteristic of the
polyglutamine neurodegenerative disorders. This
interferes with CBP-mediated transcription and causes
cytotoxicity.
Length = 979
Score = 41.2 bits (96), Expect = 0.006
Identities = 53/233 (22%), Positives = 89/233 (38%), Gaps = 19/233 (8%)
Query: 1520 SSSNDTEAENTFKRAISAMVQQSPHLLEKNKELNTPSGGELSFEEKHSMAAKSKNETTPT 1579
+SSND++AE+T K+ + +++ L+ K + E+ + A KSK +
Sbjct: 52 TSSNDSKAEST-KKPNKKIKEEATSPLKSTKRQREKPASDTEEPERVT-AKKSKTQ---- 105
Query: 1580 IKTEMRKIISSSDYQGLEEEEEEEVMIIEIKEDLQQQQTVEPEQNKQAFPPHPDQLAQPG 1639
E+ + S S+ +G E E E + E+ + N+ + P P
Sbjct: 106 ---ELSRPNSPSEGEGEGEGEGESSDSRSVNEEGSSDPKDIDQDNRSSSPSIPSPQDNES 162
Query: 1640 DDVTS----LIRPQPGPVRGHTNSSDLSRSTNAQPRGIPILGSQAAMYNPSVPYQQQQQY 1695
D +S L++PQ P + L+ S + Q + + P Q QQ
Sbjct: 163 DSDSSAQQQLLQPQGPPSIQVPPGAALAPSAPPPTPSAQAVPPQGSP-IAAQPAPQPQQP 221
Query: 1696 QYYYNQQQPMYYPQQVAYPPPPYQ----QARHPQYSKPPHPILNAS-SQPVPP 1743
P +PQ++ P PP Q + PQ P +S P PP
Sbjct: 222 SPLSLISAPSLHPQRLPSPHPPLQPQTASQQSPQPPAPSSRHPQSSHHGPGPP 274
>gnl|CDD|197891 smart00818, Amelogenin, Amelogenins, cell adhesion proteins, play a
role in the biomineralisation of teeth. They seem to
regulate formation of crystallites during the secretory
stage of tooth enamel development and are thought to play
a major role in the structural organisation and
mineralisation of developing enamel. The extracellular
matrix of the developing enamel comprises two major
classes of protein: the hydrophobic amelogenins and the
acidic enamelins. Circular dichroism studies of porcine
amelogenin have shown that the protein consists of 3
discrete folding units: the N-terminal region appears to
contain beta-strand structures, while the C-terminal
region displays characteristics of a random coil
conformation. Subsequent studies on the bovine protein
have indicated the amelogenin structure to contain a
repetitive beta-turn segment and a "beta-spiral" between
Gln112 and Leu138, which sequester a (Pro, Leu, Gln) rich
region. The beta-spiral offers a probable site for
interactions with Ca2+ ions. Muatations in the human
amelogenin gene (AMGX) cause X-linked hypoplastic
amelogenesis imperfecta, a disease characterised by
defective enamel. A 9bp deletion in exon 2 of AMGX
results in the loss of codons for Ile5, Leu6, Phe7 and
Ala8, and replacement by a new threonine codon,
disrupting the 16-residue (Met1-Ala16) amelogenin signal
peptide.
Length = 165
Score = 38.2 bits (89), Expect = 0.015
Identities = 18/61 (29%), Positives = 25/61 (40%), Gaps = 5/61 (8%)
Query: 1683 YNPSVPYQQQQQYQYYYNQQQPMYYPQQVAYPPPPYQQARHPQYSKPPHPILNASSQPVP 1742
+ P++P QQ +Q Q P Q P P P +PP P + QP+P
Sbjct: 88 HQPNLPQPAQQPFQPQPLQPPQPQQPMQ----PQPPVHPIPPLPPQPPLPPM-FPMQPLP 142
Query: 1743 P 1743
P
Sbjct: 143 P 143
Score = 34.8 bits (80), Expect = 0.17
Identities = 21/75 (28%), Positives = 25/75 (33%), Gaps = 10/75 (13%)
Query: 1674 PILGSQAAMYNPSVPYQQQQQYQYYYNQQ-QPMYYPQQV--AYPPPPYQQARHPQYSKPP 1730
+ +Q N P QQ Q Q Q Q PQ PP P Q P + P
Sbjct: 81 SMTPTQHHQPNLPQPAQQPFQPQPLQPPQPQQPMQPQPPVHPIPPLPPQPPLPPMFPMQP 140
Query: 1731 HPILNASSQPVPPDT 1745
P P+ PD
Sbjct: 141 LP-------PLLPDL 148
Score = 30.9 bits (70), Expect = 3.7
Identities = 20/74 (27%), Positives = 23/74 (31%), Gaps = 4/74 (5%)
Query: 1674 PILGSQAAMYNPSVPYQQQQQYQYYYNQ---QQPMY-YPQQVAYPPPPYQQARHPQYSKP 1729
P Q + P +P QQ Q Q M P P QQ PQ +P
Sbjct: 48 PTHTLQPHHHIPVLPAQQPVVPQQPLMPVPGQHSMTPTQHHQPNLPQPAQQPFQPQPLQP 107
Query: 1730 PHPILNASSQPVPP 1743
P P QP
Sbjct: 108 PQPQQPMQPQPPVH 121
>gnl|CDD|221501 pfam12273, RCR, Chitin synthesis regulation, resistance to Congo red.
RCR proteins are ER membrane proteins that regulate
chitin deposition in fungal cell walls. Although chitin,
a linear polymer of beta-1,4-linked N-acetylglucosamine,
constitutes only 2% of the cell wall it plays a vital
role in the overall protection of the cell wall against
stress, noxious chemicals and osmotic pressure changes.
Congo red is a cell wall-disrupting benzidine-type dye
extensively used in many cell wall mutant studies that
specifically targets chitin in yeast cells and inhibits
growth. RCR proteins render the yeasts resistant to Congo
red by diminishing the content of chitin in the cell
wall. RCR proteins are probably regulating chitin
synthase III interact directly with ubiquitin ligase
Rsp5, and the VPEY motif is necessary for this, via
interaction with the WW domains of Rsp5.
Length = 124
Score = 37.1 bits (86), Expect = 0.015
Identities = 15/48 (31%), Positives = 19/48 (39%), Gaps = 6/48 (12%)
Query: 1700 NQQQPMYYPQQVAYPPPPYQQARHPQYSKPPHPILNASSQPVPPDTTE 1747
QP+Y +A PP Q + QY+ P Q VPP T
Sbjct: 29 RGLQPIYGTAWLA-PPSYRQSQQ--QYNNTPQG---RPPQYVPPYTET 70
Score = 36.3 bits (84), Expect = 0.030
Identities = 26/88 (29%), Positives = 31/88 (35%), Gaps = 11/88 (12%)
Query: 1674 PILGSQAAMYNPSVPYQQQQQY----QYYYNQQQPMYYPQQVAYPPPPY--QQARHPQYS 1727
PI G+ A + PS Q QQQY Q Q P Y Y Q HP
Sbjct: 33 PIYGT-AWLAPPS-YRQSQQQYNNTPQGRPPQYVPPYTETANENDLGYYDGQGEFHPNPK 90
Query: 1728 KPPH---PILNASSQPVPPDTTEDLEER 1752
P P NA + TT+D +
Sbjct: 91 APAIELQPPPNAYERGTRSPTTDDEYQP 118
>gnl|CDD|130689 TIGR01628, PABP-1234, polyadenylate binding protein, human types 1,
2, 3, 4 family. These eukaryotic proteins recognize the
poly-A of mRNA and consists of four tandem RNA
recognition domains at the N-terminus (rrm: pfam00076)
followed by a PABP-specific domain (pfam00658) at the
C-terminus. The protein is involved in the transport of
mRNA's from the nucleus to the cytoplasm. There are four
paralogs in Homo sapiens which are expressed in testis
(GP:11610605_PABP3 ), platelets (SP:Q13310_PABP4 ),
broadly expressed (SP:P11940_PABP1) and of unknown tissue
range (SP:Q15097_PABP2).
Length = 562
Score = 37.5 bits (87), Expect = 0.074
Identities = 17/91 (18%), Positives = 23/91 (25%), Gaps = 16/91 (17%)
Query: 1670 PRGIPILGSQAA-MYNPSVPYQQQQQYQYYYNQQQPMYYPQQ-------VAYPPPP---- 1717
P G P+ G+ Y P QQ + + M P P
Sbjct: 386 PMGSPMGGAMGQPPYYGQGPQQQFNGQPLGWPRMSMMPTPMGPGGPLRPNGLAPMNAVRA 445
Query: 1718 ---YQQARHPQYSKPPHPIL-NASSQPVPPD 1744
Q + P N S P+ D
Sbjct: 446 PSRNAQNAAQKPPMQPVMYPPNYQSLPLSQD 476
>gnl|CDD|235124 PRK03427, PRK03427, cell division protein ZipA; Provisional.
Length = 333
Score = 36.9 bits (86), Expect = 0.088
Identities = 22/100 (22%), Positives = 28/100 (28%), Gaps = 4/100 (4%)
Query: 1647 RPQPGPVRGHTNSSDLSRSTNAQPRGIPILGSQAAMYNPSVPYQQQQQYQYYYNQQQPMY 1706
P + H ++ S QP Q P Q + QP+
Sbjct: 72 NHAPANAQEH-EAARPSPQHQYQPPYASAQPRQPVQQPPEAQVPPQHAPRPAQPAPQPVQ 130
Query: 1707 YPQQVAYPPPPYQQARHPQYSKPPHPILNASSQPVPPDTT 1746
P P P QQ PQ + P P S P P
Sbjct: 131 QPAYQPQPEQPLQQPVSPQVAPAPQP---VHSAPQPAQQA 167
Score = 33.9 bits (78), Expect = 0.75
Identities = 24/128 (18%), Positives = 36/128 (28%), Gaps = 23/128 (17%)
Query: 1620 EPEQNKQAFPPHPDQLAQPGDDVTSLIRPQPGPVRGHTNSSDLSRSTNAQPRGIPILGSQ 1679
Q +A P P QP QP + + +P +
Sbjct: 76 ANAQEHEAARPSPQHQYQP-----PYASAQPRQPVQQPPEAQVPPQHAPRP----AQPAP 126
Query: 1680 AAMYNPSVPYQQQQQYQYYYNQQQPMYYPQQVAYPPPPYQQARHPQYS----KPPHPILN 1735
+ P YQ Q + QP+ P P P PQ + +P P+
Sbjct: 127 QPVQQP--AYQPQPE--------QPLQQPVSPQVAPAPQPVHSAPQPAQQAFQPAEPVAA 176
Query: 1736 ASSQPVPP 1743
+PV
Sbjct: 177 PQPEPVAE 184
>gnl|CDD|237874 PRK14971, PRK14971, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 614
Score = 36.7 bits (85), Expect = 0.14
Identities = 18/88 (20%), Positives = 26/88 (29%), Gaps = 15/88 (17%)
Query: 1661 DLSRSTNAQPRGIPILGSQAAMYNPSVPYQQQQQYQYYYNQQQPMYYPQQVAYPPPPY-- 1718
D S + P+ AA PS P Q + P
Sbjct: 368 DASGGRGPKQHIKPVFTQPAAAPQPSAAAAASPS-------------PSQSSAAAQPSAP 414
Query: 1719 QQARHPQYSKPPHPILNASSQPVPPDTT 1746
Q A P + P + ++ PV P +T
Sbjct: 415 QSATQPAGTPPTVSVDPPAAVPVNPPST 442
>gnl|CDD|237082 PRK12373, PRK12373, NADH dehydrogenase subunit E; Provisional.
Length = 400
Score = 36.3 bits (84), Expect = 0.14
Identities = 23/84 (27%), Positives = 33/84 (39%), Gaps = 7/84 (8%)
Query: 1805 AKKADSVEICARRAFNELTQIYPTAPNTKSASDVTRAETKTAVAAQPAPQNTEVDQTDKN 1864
A KA + +I + T++ AP A+ V +E AA+P + E + K
Sbjct: 208 ASKALAEDIGDTVKRIDGTEVPLLAPWQGDAAPVPPSE-----AARPKSADAETNAALKT 262
Query: 1865 --TAAAAEKKEKKAPRIAAKFKTP 1886
TA A K KAP T
Sbjct: 263 PATAPKAAAKNAKAPEAQPVSGTA 286
>gnl|CDD|237191 PRK12757, PRK12757, cell division protein FtsN; Provisional.
Length = 256
Score = 35.0 bits (81), Expect = 0.28
Identities = 13/67 (19%), Positives = 16/67 (23%), Gaps = 9/67 (13%)
Query: 1685 PSVPY----QQQQQYQYYYNQQQPMYYPQQVAYPPPPYQQARHPQYSKPPHPILNASSQP 1740
VPY Q + QQ P P P R P+ + P
Sbjct: 103 SEVPYNEQTPQVPRSTVQIQQQAQQQQPPATTAQPQPVTPPRQTT-----APVQPQTPAP 157
Query: 1741 VPPDTTE 1747
V
Sbjct: 158 VRTQPAA 164
Score = 30.8 bits (70), Expect = 6.5
Identities = 17/76 (22%), Positives = 22/76 (28%), Gaps = 3/76 (3%)
Query: 1683 YN---PSVPYQQQQQYQYYYNQQQPMYYPQQVAYPPPPYQQARHPQYSKPPHPILNASSQ 1739
YN P VP Q Q QQ P Q PP A + P A+
Sbjct: 107 YNEQTPQVPRSTVQIQQQAQQQQPPATTAQPQPVTPPRQTTAPVQPQTPAPVRTQPAAPV 166
Query: 1740 PVPPDTTEDLEERADS 1755
+ + E+
Sbjct: 167 TQAVEAPKVEAEKEKE 182
>gnl|CDD|236362 PRK09010, PRK09010, single-stranded DNA-binding protein; Provisional.
Length = 177
Score = 34.1 bits (78), Expect = 0.36
Identities = 14/57 (24%), Positives = 20/57 (35%), Gaps = 1/57 (1%)
Query: 1675 ILGSQAAMYNPSVPYQQQQQYQYYYNQQQPMYYPQ-QVAYPPPPYQQARHPQYSKPP 1730
+LG + P+ QQ + QQP Q P Q A ++PP
Sbjct: 112 MLGGRQGGGAPAGGNIGGQQQGGWGQPQQPQGGNQFSGGAQSRPQQSAPAAPSNEPP 168
>gnl|CDD|237605 PRK14086, dnaA, chromosomal replication initiation protein;
Provisional.
Length = 617
Score = 34.8 bits (80), Expect = 0.45
Identities = 26/121 (21%), Positives = 35/121 (28%), Gaps = 19/121 (15%)
Query: 1618 TVEPEQNKQAFPPHPDQLAQPGDDVTSLIRPQPGPVRGHTNSSDLSRSTNAQPRGIPILG 1677
TV+P A P P L RP P G+ R
Sbjct: 87 TVDP----SAGEPAPPPPHARRTSEPELPRPGRRPYEGYGGPRADDRPPGLPR------Q 136
Query: 1678 SQAAMYNPSVPYQQQQQY--------QYYYNQQQPMYYPQQVAYPPPPYQQARHPQYSKP 1729
Q P+ P QQ+ Y QQQ + +P + Y P A + +
Sbjct: 137 DQLPTARPAYPAYQQRPEPGAWPRAADDYGWQQQRLGFPPRAPYASPA-SYAPEQERDRE 195
Query: 1730 P 1730
P
Sbjct: 196 P 196
>gnl|CDD|222095 pfam13388, DUF4106, Protein of unknown function (DUF4106). This
family of proteins are found in large numbers in the
Trichomonas vaginalis proteome. The function of this
protein is unknown.
Length = 422
Score = 34.6 bits (79), Expect = 0.46
Identities = 24/102 (23%), Positives = 36/102 (35%), Gaps = 10/102 (9%)
Query: 1688 PYQQQQQYQYYYNQQQPMYYPQQVAYPPPPYQQARHPQYSKPPHPILNASSQPVPPDTTE 1747
P QQ QQ + P Q P Q + Q P +P A P +
Sbjct: 201 PKPTQQPTVQNPAQQPTVQNPAQQPQQQPQQQPVQPAQQPTPQNP---AQQPPQTEQGHK 257
Query: 1748 DLEERADSYLAKLLRLTRAHNLGAPDYTFSMDASKLYVATVR 1789
E+ + K+L+ + G PD ++D S Y +R
Sbjct: 258 RSREQGNQEFLKMLK----EDYGYPD---TIDFSDRYKEAIR 292
Score = 33.5 bits (76), Expect = 1.2
Identities = 22/97 (22%), Positives = 31/97 (31%), Gaps = 11/97 (11%)
Query: 1629 PPHPDQLAQPGDDVTSLIRPQPGPVRGHTNSSDLSRSTNAQPRGIPILGSQAAMYNPSVP 1688
PP+P + A P PG + T+S +P P + + A P
Sbjct: 173 PPNPPREA-----------PAPGLPKTFTSSHGHRHRHAPKPTQQPTVQNPAQQPTVQNP 221
Query: 1689 YQQQQQYQYYYNQQQPMYYPQQVAYPPPPYQQARHPQ 1725
QQ QQ Q Q PP + H +
Sbjct: 222 AQQPQQQPQQQPVQPAQQPTPQNPAQQPPQTEQGHKR 258
>gnl|CDD|220708 pfam10349, WWbp, WW-domain ligand protein. The WWbp domain is
characterized by several short PY and PT-like motifs of
the PPPPY form. These appear to bind directly to the WW
domains of WWP1 and WWP2 and other such diverse proteins
as dystrophin and YAP (Yes-associated protein). This is
the WW-domain binding protein WWbp via PY and PY_like
motifs. The presence of a phosphotyrosine residue in the
pWBP-1 peptide abolishes WW domain binding which suggests
a potential regulatory role for tyrosine phosphorylation
in modulating WW domain-ligand interactions. Given the
likelihood that WWP1 and WWP2 function as E3
ubiquitin-protein ligases, it is possible that initial
substrate-specific recognition occurs via WW
domain-substrate protein interaction followed by
ubiquitin transfer and subsequent proteolysis. This
domain lies just downstream of the GRAM (pfam02893) in
many members.
Length = 111
Score = 32.8 bits (75), Expect = 0.48
Identities = 15/46 (32%), Positives = 15/46 (32%), Gaps = 2/46 (4%)
Query: 1704 PMYYPQQVAYPPPPYQQARHPQYSKPPHP--ILNASSQPVPPDTTE 1747
Y PPPPY A Y PP P A Q PP
Sbjct: 63 GQYAAPPPYGPPPPYYPAPPGVYPTPPPPNSGYMADPQEPPPPYPG 108
Score = 31.2 bits (71), Expect = 1.6
Identities = 11/68 (16%), Positives = 17/68 (25%)
Query: 1259 QPQRTTFYKENAPLVNSNLQVTSPYHVKVSRCNSQQSSVSQSPSSTMISQQSSPFNVKPT 1318
QP P +L+ Y + + P +
Sbjct: 38 QPVSRESGYYPPPGAYVHLEPLPAYGQYAAPPPYGPPPPYYPAPPGVYPTPPPPNSGYMA 97
Query: 1319 EKTEPPPP 1326
+ EPPPP
Sbjct: 98 DPQEPPPP 105
>gnl|CDD|219358 pfam07271, Cytadhesin_P30, Cytadhesin P30/P32. This family consists
of several Mycoplasma species specific Cytadhesin P32 and
P30 proteins. P30 has been found to be membrane
associated and localised on the tip organelle. It is
thought that it is important in cytadherence and
virulence.
Length = 279
Score = 33.9 bits (77), Expect = 0.63
Identities = 29/151 (19%), Positives = 50/151 (33%), Gaps = 21/151 (13%)
Query: 1594 QGLEEEEEEEVMIIEIKEDLQQQQTVEPEQNKQAFPPHPDQLAQPGDDVTSLIRPQPGPV 1653
Q + E+ E++ + I+ E++ Q+ +P A P P Q+ P P
Sbjct: 117 QRISEQNEQQAIEIDPTEEVNTQEPTQPAGVNVANNPQP-QVQPQ-------FGPNPQQ- 167
Query: 1654 RGHTNSSDLSRSTNAQPR-GIPILGSQAAMYNPSVPYQQQQQYQYYYNQQQPMYYPQQVA 1712
R + N R G +Q + P +P Q + +N M P +
Sbjct: 168 RINPQRFGFPMQPNMGMRPGF----NQMPPHMPGMPPNQMRP---GFNPMPGM--PPRPG 218
Query: 1713 YPPPPYQQARHPQYSKPPHPILNASSQPVPP 1743
+ P + P P + P P
Sbjct: 219 FNQNPNMMPNMNRPGFRPQP--GGFNHPGTP 247
>gnl|CDD|223066 PHA03379, PHA03379, EBNA-3A; Provisional.
Length = 935
Score = 34.3 bits (78), Expect = 0.74
Identities = 25/97 (25%), Positives = 29/97 (29%), Gaps = 7/97 (7%)
Query: 1651 GPVRGHTNSSDLSRSTNAQPRGIPILGSQAAMYNPSVPYQQQQQYQYYYNQQQPMYYPQQ 1710
R + S QP + + Q M P P QQ + M
Sbjct: 591 RLARLRAEAQPYQASVEVQPPQLTQVSPQQPMEYPLEPEQQMFPGSPFSQVADVMRAGGV 650
Query: 1711 VAYPPP----PYQQARHPQYSKPPHPILNASSQPVPP 1743
A P P QQ P P L AS PVPP
Sbjct: 651 PAMQPQYFDLPLQQ---PISQGAPLAPLRASMGPVPP 684
Score = 32.0 bits (72), Expect = 3.9
Identities = 22/109 (20%), Positives = 31/109 (28%), Gaps = 18/109 (16%)
Query: 1645 LIRPQPGPVRGHTNSSDLSRSTNAQPRGIPILGSQAAMYNPSVPYQQQQQYQYYYNQQQP 1704
L+R R S T PR P V + + Q +
Sbjct: 391 LMRAGKLTERAREALEKASEPTYGTPR-------------PPVEKPRPEVPQSL--ETAT 435
Query: 1705 MYYPQQVAYPPPPYQQARHP---QYSKPPHPILNASSQPVPPDTTEDLE 1750
+ QV PPP + P Q+S P P+ P+ D
Sbjct: 436 SHGSAQVPEPPPVHDLEPGPLHDQHSMAPCPVAQLPPGPLQDLEPGDQL 484
>gnl|CDD|218440 pfam05110, AF-4, AF-4 proto-oncoprotein. This family consists of AF4
(Proto-oncogene AF4) and FMR2 (Fragile X E mental
retardation syndrome) nuclear proteins. These proteins
have been linked to human diseases such as acute
lymphoblastic leukaemia and mental retardation. The
family also contains a Drosophila AF4 protein homologue
Lilliputian which contains an AT-hook domain. Lilliputian
represents a novel pair-rule gene that acts in
cytoskeleton regulation, segmentation and morphogenesis
in Drosophila.
Length = 1154
Score = 33.7 bits (77), Expect = 1.2
Identities = 21/99 (21%), Positives = 39/99 (39%), Gaps = 14/99 (14%)
Query: 1288 SRCNSQQSSVSQSPSSTMISQQSSPFNVKPTEKTEPPPPHK------LNQVNTSSAPKEK 1341
S S S S+S SS+ S+++ P + + EPP +K LN+VN +
Sbjct: 405 SGSESSSGSDSESESSSSDSEENEPPRT-ASPEPEPPSTNKWQLDNWLNKVNPHKVSPAE 463
Query: 1342 PRTSE-------QNQIKSNGNEQKVIINKEERCIFSNAT 1373
+S + + K + + +E S++
Sbjct: 464 SVSSNPPIKQPMEKEGKVKSSGSQYHPESKEPPPKSSSK 502
>gnl|CDD|220950 pfam11029, DAZAP2, DAZ associated protein 2 (DAZAP2). DAZ associated
protein 2 has a highly conserved sequence throughout
evolution including a conserved polyproline region and
several SH2/SH3 binding sites. It occurs as a single copy
gene with a four-exon organisation and is located on
chromosome 12. It encodes a ubiquitously expressed
protein and binds to DAZ and DAZL1 through DAZ repeats.
Length = 136
Score = 32.1 bits (73), Expect = 1.2
Identities = 12/33 (36%), Positives = 15/33 (45%), Gaps = 2/33 (6%)
Query: 1713 YP--PPPYQQARHPQYSKPPHPILNASSQPVPP 1743
YP PP Y + P Y PP + +S P P
Sbjct: 1 YPDAPPAYSELYQPSYVVPPQAQMPQASAPYPG 33
Score = 29.4 bits (66), Expect = 7.8
Identities = 18/70 (25%), Positives = 22/70 (31%), Gaps = 7/70 (10%)
Query: 1679 QAAMYNPS--VPYQQQQQYQYYYNQQQPMYYPQQVAYPPPPYQQARHPQYSKPPHPILNA 1736
+ +Y PS VP Q Q MY P P Y +PI A
Sbjct: 8 YSELYQPSYVVPPQAQMPQASAPYPGPSMYLPMAQVMAVGPQSSHPPMAY----YPI-GA 62
Query: 1737 SSQPVPPDTT 1746
PP +T
Sbjct: 63 PPPVYPPGST 72
>gnl|CDD|225657 COG3115, ZipA, Cell division protein [Cell division and chromosome
partitioning].
Length = 324
Score = 33.3 bits (76), Expect = 1.2
Identities = 20/106 (18%), Positives = 32/106 (30%), Gaps = 4/106 (3%)
Query: 1651 GPVRGHTNSSDLSRSTNAQP-RGIPILGSQA--AMYNPSVPYQQQQQYQYYYNQQQPMYY 1707
G VR + + + R P Q A +P Q Q QP
Sbjct: 62 GEVRVVRKNEAPQFTQEHEAARQSPQHQYQPEYASAQIKIPVPQPPQISDPPAHPQPTQP 121
Query: 1708 PQQVAYPPPPYQQARHPQYSKPPHPILNASSQPVPPDTTEDLEERA 1753
PP +Q PQ + P P+ ++++ T +
Sbjct: 122 ALDQEQPPEEARQPVLPQEAPAPQPV-HSAAPQPAVQTVQPAVPEQ 166
Score = 32.5 bits (74), Expect = 1.9
Identities = 32/226 (14%), Positives = 53/226 (23%), Gaps = 32/226 (14%)
Query: 1581 KTEMRKIISSSDYQGLEEEEE-EEVMIIEIKEDLQQQQTVEPEQNKQAFPPHPDQLAQPG 1639
M+ Y + ++E EV ++ E Q Q E + P H Q
Sbjct: 40 LKRMKSKRDDDPYDEVADDEGVGEVRVVRKNEAPQFTQEHEAAR---QSPQHQYQPEYAS 96
Query: 1640 DDVTSLIRPQPGPVRGHTNSSDLSRSTNAQPRGIPILGSQAAMYNPSVPYQQQQQYQYYY 1699
+ + P S P L + P Q+
Sbjct: 97 AQIKIPVPQPPQI----------SDPPAHPQPTQPALDQEQPPEEARQPVLPQEA-PAPQ 145
Query: 1700 NQQQPMYYPQQVAYPPPPYQQARHPQYSKPPHPILNASSQPVPPDTTEDLEERADSYLAK 1759
P P +Q Q + P + P + +
Sbjct: 146 PVHSAAPQPAVQTVQPAVPEQQ--VQPEEVVEPA----PEVKRPPRKDTV---------I 190
Query: 1760 LLRLTRAHN--LGAPDYTFSMDASKLYVATVRICHPHSDIVVRGKP 1803
+L + H S+ S + I H H + G
Sbjct: 191 ILNVAAHHESEFNGEKLLQSIQQSGFIFGDMNIFHRHLSLSGSGPV 236
>gnl|CDD|240419 PTZ00440, PTZ00440, reticulocyte binding protein 2-like protein;
Provisional.
Length = 2722
Score = 33.7 bits (77), Expect = 1.3
Identities = 36/238 (15%), Positives = 68/238 (28%), Gaps = 23/238 (9%)
Query: 1431 KEVEEITNGTQTITMNDLYQSANKRSENGFNIPSISDLIKQNNITAKTDYPTTSTFPEVK 1490
E + I+N K EN TS ++K
Sbjct: 2438 NETKRISNVDAFTNKILQDIDNEKNKENN---------NMNAEKIDDLIENVTSHNEKIK 2488
Query: 1491 YTNSPSDNGGRTATPDTSFTTKLTNLFNRSSSNDTEAENTFKRAISAMVQQSPHLLEKNK 1550
++ R KL N +++N+ + + ++ + + K
Sbjct: 2489 SELLIINDALRRVKEKKDEMNKLFNSLTENNNNNNNSAKNIVDNSTYIINELESHVSKLN 2548
Query: 1551 ELNTPSGGELSFEEKHSMAAKSKNETTPTIKTEMRKIISSSDYQGLEEEEEEEVMIIEIK 1610
EL + E+ E + K KI S + E E +E E +
Sbjct: 2549 ELLSYIDNEIKELENEKLKLLEKA-----------KIEESRKERERIESETQEDNTDEEQ 2597
Query: 1611 EDLQQQQTVEPEQNKQAFPPHPDQLAQPGDDVTSLIRPQPGPVRGHTNSSDLSRSTNA 1668
+ QQQ+ ++ E+ ++A + + T +N S N
Sbjct: 2598 INRQQQERLQKEEEQKA---YSQERLNREVSGTDDTNKNHNTGHDESNYGRYSNKRND 2652
>gnl|CDD|216209 pfam00946, Mononeg_RNA_pol, Mononegavirales RNA dependent RNA
polymerase. Members of the Mononegavirales including the
Paramyxoviridae, like other non-segmented negative strand
RNA viruses, have an RNA-dependent RNA polymerase
composed of two subunits, a large protein L and a
phosphoprotein P. This is a protein family of the L
protein. The L protein confers the RNA polymerase
activity on the complex. The P protein acts as a
transcription factor.
Length = 1065
Score = 32.6 bits (75), Expect = 2.4
Identities = 33/152 (21%), Positives = 48/152 (31%), Gaps = 13/152 (8%)
Query: 1443 ITMNDLYQSANKR--SENGFNIPSISDLIKQNNITAKTDYPTTSTFPEVKYTNSPSDNGG 1500
+ND N R G +P I L N K + E K PS N
Sbjct: 26 YLLNDKTLLRNIRWDRIRGEELPLIEHLRTLQNWLNKW----LLSLGEYKVVPYPSINPE 81
Query: 1501 RTATPDTSFTTKLTNLFNRSSSNDTEAENTFKRAISAMVQQSPHLLEKNKELNTPSGGEL 1560
D +++L L + E + + L K+K T
Sbjct: 82 LLRIVDPHDSSQLEELLKLV---NRELSIISSKVEDFLDDLGYKLKGKDKLFLTSLAPLC 138
Query: 1561 -SFEEKHSMAAKSKN---ETTPTIKTEMRKII 1588
S + H++ T TIKTE+R +I
Sbjct: 139 NSILDLHAVMLILNWYDFSTWFTIKTELRSLI 170
>gnl|CDD|172376 PRK13855, PRK13855, type IV secretion system protein VirB10;
Provisional.
Length = 376
Score = 32.2 bits (73), Expect = 2.5
Identities = 13/56 (23%), Positives = 25/56 (44%), Gaps = 1/56 (1%)
Query: 1292 SQQSSVSQSPSSTMISQQSSPFNVKPTE-KTEPPPPHKLNQVNTSSAPKEKPRTSE 1346
+ +P STMI+ + PF+ P + +PP + Q + + +P +E
Sbjct: 52 RSKKENEPAPPSTMIATNTKPFHPAPIDVPPDPPAAQEAVQPTAPPSAQSEPERNE 107
>gnl|CDD|235903 PRK06991, PRK06991, ferredoxin; Provisional.
Length = 270
Score = 32.1 bits (73), Expect = 2.5
Identities = 19/81 (23%), Positives = 26/81 (32%), Gaps = 4/81 (4%)
Query: 1815 ARRAFNELTQIYPTAPNTKSASDVTRAETKTAVAAQPAPQNTEVD--QTDKNT--AAAAE 1870
AR + A+ A A A A D + K AAA E
Sbjct: 157 ARHDARQARLRREREAAEARAAARAAASAAAAAAEASAAAAPAADDAEAKKRAIIAAALE 216
Query: 1871 KKEKKAPRIAAKFKTPLNLKN 1891
+ KK +AA+ P N +
Sbjct: 217 RARKKKEELAAQGAGPKNTEG 237
>gnl|CDD|185514 PTZ00212, PTZ00212, T-complex protein 1 subunit beta; Provisional.
Length = 533
Score = 32.3 bits (74), Expect = 2.9
Identities = 20/63 (31%), Positives = 30/63 (47%), Gaps = 16/63 (25%)
Query: 743 HKIENAKVKVFQQPSRNESIIIVQDKVQVPSLKQVA--------------DQIL--GKTI 786
++EN K+ V P + I I KV+V S+++VA D+IL G +
Sbjct: 232 KRLENCKILVANTPMDTDKIKIYGAKVKVDSMEKVAEIEAAEKEKMKNKVDKILAHGCNV 291
Query: 787 FIN 789
FIN
Sbjct: 292 FIN 294
>gnl|CDD|233569 TIGR01776, TonB-tbp-lbp, TonB-dependent lactoferrin and transferrin
receptors. This family of TonB-dependent receptors are
responsible for import of iron from the mammalian iron
carriers lactoferrin and transferrin across the outer
membrane. These receptors are found only in bacteria
which can infect mammals such as Moraxella, Mannheimia,
Neisseria, Actinobacillus, Pasteurella, Haemophilus and
Histophilus species [Transport and binding proteins,
Cations and iron carrying compounds, Transport and
binding proteins, Porins].
Length = 932
Score = 32.5 bits (74), Expect = 2.9
Identities = 24/88 (27%), Positives = 35/88 (39%), Gaps = 13/88 (14%)
Query: 1685 PSVPYQQQQQYQYYYNQQQPMYYPQQVAYPPPPYQQARHPQYSKPPHPILNASSQPVPPD 1744
S+ QY Y Y+ Q QV ++Q Y+ + +L P P
Sbjct: 272 ASLDCAPCAQYGYVYDNQLSDEEKAQVLASRYRHEQVSAKDYTG-ENRLL-----PNP-- 323
Query: 1745 TTEDLEERADSYLAKL-LRLTRAHNLGA 1771
L+ R+DS+LAKL L H +G
Sbjct: 324 ----LDYRSDSWLAKLGYHLNNRHYVGG 347
>gnl|CDD|189016 cd09609, M3B_PepF_4, Peptidase family M3B Oligopeptidase F (PepF).
Peptidase family M3B Oligopeptidase F (PepF;
Pz-peptidase B; EC 3.4.24.-) is mostly bacterial and
includes oligoendopeptidase F from Lactococcus lactis.
This enzyme hydrolyzes peptides containing between 7 and
17 amino acids with fairly broad specificity. The PepF
gene is duplicated in L. lactis on the plasmid that
bears it, while a shortened second copy is found in
Bacillus subtilis. Most bacterial PepFs are cytoplasmic
endopeptidases; however, the PepF Bacillus
amyloliquefaciens oligopeptidase is a secreted protein
and may facilitate the process of sporulation.
Specifically, the yjbG gene encoding the homolog of the
PepF1 and PepF2 oligoendopeptidases of Lactococcus
lactis has been identified in Bacillus subtilis as an
inhibitor of sporulation initiation when over expressed
from a multicopy plasmid.
Length = 586
Score = 32.2 bits (74), Expect = 2.9
Identities = 37/147 (25%), Positives = 51/147 (34%), Gaps = 41/147 (27%)
Query: 407 NLKRFEKFLTKLASFDFEQFREKYADIQ-YLEGKMGKKFSDVFHKEK---SKEMDPDLAA 462
L FE L L E+ E+ + YLE D+ K+ S E++ LAA
Sbjct: 101 KLSFFESELLALDEKILEEVIEELPEYAAYLE--------DLLRKKPHTLSPEVEEALAA 152
Query: 463 L-------------IKATDDDYD----DGTEEVLNYANYEEYYYSDEDEEWRTNFFSEFG 505
L K D ++ DG L++ +E Y D D E R F F
Sbjct: 153 LSPVLDAPYNIYETTKLADMTFEDFEVDGKTYPLSFVLFENEYEYDPDTEVRRAAFKSF- 211
Query: 506 QHKT--DYYH-----YKLDYAKVTNEV 525
KT Y + Y +V E
Sbjct: 212 -SKTLRKYQNTTAATY---ATQVQKEK 234
>gnl|CDD|222449 pfam13908, Shisa, Wnt and FGF inhibitory regulator. Shisa is a
transcription factor-type molecule that physically
interacts with immature forms of the Wnt receptor
Frizzled and the FGF receptor within the endoplasmic
reticulum to inhibit their post-translational maturation
and trafficking to the cell surface.
Length = 177
Score = 31.3 bits (71), Expect = 3.0
Identities = 20/73 (27%), Positives = 24/73 (32%), Gaps = 4/73 (5%)
Query: 1653 VRGHTNSSDLSRSTNAQPRGIPILGSQAAMYNPSVPYQQQQQYQYYYNQQQPMYYPQQVA 1712
R + ST Q +P S PS P Q Q Y Q P +
Sbjct: 107 CRPQRPVMTRATSTTVQTTPLPQPPS----TAPSYPGPQYQGYHPMPPQPGMPAPPYSLQ 162
Query: 1713 YPPPPYQQARHPQ 1725
YPPP Q + P
Sbjct: 163 YPPPGLLQPQGPP 175
>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
Length = 3151
Score = 32.6 bits (74), Expect = 3.1
Identities = 24/127 (18%), Positives = 39/127 (30%), Gaps = 12/127 (9%)
Query: 1621 PEQNKQAFPPHPDQLAQPGDDVTSLIRPQPGPV--RGHTNSSDLSRSTNAQPRGIPILGS 1678
P + PP P P + + P G V R + S + A+P + +
Sbjct: 2831 PTSAQPTAPPPPPGPPPPSLPLGGSVAPG-GDVRRRPPSRSPAAKPAAPARPPVRRL--A 2887
Query: 1679 QAAMYNPSVPYQQQQQYQYYYNQQQPMYYPQQVAYPPPPYQQARHPQYSKPPHPILNASS 1738
+ A+ + + ++ P P P PQ PP P
Sbjct: 2888 RPAVSRSTESFALPPDQP----ERPPQPQAPPPPQPQPQPPPPPQPQPPPPPPPRPQP-- 2941
Query: 1739 QPVPPDT 1745
P+ P T
Sbjct: 2942 -PLAPTT 2947
>gnl|CDD|202096 pfam02029, Caldesmon, Caldesmon.
Length = 431
Score = 31.9 bits (72), Expect = 3.5
Identities = 21/104 (20%), Positives = 38/104 (36%), Gaps = 11/104 (10%)
Query: 1546 LEKNKELNTPSGGELSFEEKHSMAAKSKNETTPTIKTEMRKIISSSDYQGLEEEEEEEVM 1605
LE+ KE P+ + S E + T++ E + EE EE E +
Sbjct: 78 LERQKEFK-PTSTDQSLSEPSRRMQEDSGAENETVEEEE----KEESREEREEVEETEGV 132
Query: 1606 IIEIKEDL------QQQQTVEPEQNKQAFPPHPDQLAQPGDDVT 1643
+++ Q++ EPE ++ P G+ +T
Sbjct: 133 TKSEQKNDWRDAEECQKEEKEPEPEEEEKPKRGSLEENNGEFMT 176
>gnl|CDD|237030 PRK12270, kgd, alpha-ketoglutarate decarboxylase; Reviewed.
Length = 1228
Score = 32.2 bits (74), Expect = 3.5
Identities = 15/60 (25%), Positives = 17/60 (28%)
Query: 1827 PTAPNTKSASDVTRAETKTAVAAQPAPQNTEVDQTDKNTAAAAEKKEKKAPRIAAKFKTP 1886
AP +A+ A A A AP AA K A AA P
Sbjct: 41 TAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAAAAAPAAPP 100
Score = 31.4 bits (72), Expect = 6.0
Identities = 16/64 (25%), Positives = 19/64 (29%), Gaps = 4/64 (6%)
Query: 1827 PTAPNTKSASDVTRAE----TKTAVAAQPAPQNTEVDQTDKNTAAAAEKKEKKAPRIAAK 1882
PTA +A+ + K A PAP AAAA A AA
Sbjct: 44 PTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAAAAAPAAPPAAA 103
Query: 1883 FKTP 1886
Sbjct: 104 AAAA 107
>gnl|CDD|165164 PHA02813, PHA02813, hypothetical protein; Provisional.
Length = 354
Score = 31.8 bits (72), Expect = 3.7
Identities = 26/96 (27%), Positives = 41/96 (42%), Gaps = 21/96 (21%)
Query: 1445 MNDLYQSANKRSENGFNIPS-ISDLIKQNNIT----AKTDYPTTST------------FP 1487
++D+++ K+ F P ISD+I N IT K D+ +
Sbjct: 66 LDDIFKVIRKKLLLSFEFPQKISDIILDNTITLIKYEKGDFFNNHRDFIHFKSKNCYCYH 125
Query: 1488 EVKYTNSPSDNGGRTA---TPDTSFTTKLTNLFNRS 1520
V Y N+ S GG T +T F+TK LF+++
Sbjct: 126 LVLYLNNTS-KGGNTNIHIKDNTIFSTKNDVLFDKT 160
>gnl|CDD|237862 PRK14948, PRK14948, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 620
Score = 31.9 bits (73), Expect = 3.9
Identities = 29/175 (16%), Positives = 51/175 (29%), Gaps = 20/175 (11%)
Query: 1714 PPPPYQQARHPQYSKPPHPILNASSQPVPPDTTEDLEERADSYLAKL-LRLTRAHNLGAP 1772
PPP P ++P P + + +LEE LAKL L TR L
Sbjct: 406 PPPAKASPPIPVPAEPTEPSPTPPANAANAPPSLNLEELWQQILAKLELPSTRML-LSQQ 464
Query: 1773 DYTFSMDASKLYVATVRICHPHSDIVVRGKPCAKKADSVEICA--RRAFNELTQIY---- 1826
S+D+++ A + + +V ++ + + ++
Sbjct: 465 AELVSLDSNR---AVIAVSPNWLGMV--------QSRKPLLEQAFAKVLGRSIKLNLESQ 513
Query: 1827 -PTAPNTKSASDVTRAETKTAVAAQPAPQNTEVDQTDKNTAAAAEKKEKKAPRIA 1880
+A NT + P PQ T + + A
Sbjct: 514 SGSASNTAKTPPPPQKSPPPPAPTPPLPQPTATAPPPTPPPPPPTATQASSNAPA 568
>gnl|CDD|235585 PRK05733, PRK05733, single-stranded DNA-binding protein; Provisional.
Length = 172
Score = 31.1 bits (70), Expect = 3.9
Identities = 19/49 (38%), Positives = 22/49 (44%), Gaps = 1/49 (2%)
Query: 1669 QPRGIPILGSQAAMYNPSVPYQQQQQYQYYYNQQQPMYYPQQVAYPPPP 1717
+P+G G YN S P QQ Q+ Q QQQ PQQ A P
Sbjct: 114 RPQGDDQGGQGGGNYNQSAPRQQAQRPQ-QAAQQQSRPAPQQPAPQPAA 161
>gnl|CDD|221581 pfam12446, DUF3682, Protein of unknown function (DUF3682). This
domain family is found in eukaryotes, and is typically
between 125 and 136 amino acids in length.
Length = 133
Score = 30.2 bits (68), Expect = 4.4
Identities = 21/75 (28%), Positives = 33/75 (44%), Gaps = 10/75 (13%)
Query: 1558 GELSFEEKHSMAAKSKNETTPTIKTEMRKIISSSDYQGLEEEEEEEVMIIEIKEDLQQQQ 1617
G+ S ++ AA + N + P S + + EEEEEEE E+ +QQQ
Sbjct: 61 GDSSTGDQSPAAAAAHNSSPPEGPAGT---TSGTGHTRQEEEEEEE-------ENEKQQQ 110
Query: 1618 TVEPEQNKQAFPPHP 1632
+ E + + HP
Sbjct: 111 SDEAQVQQHQQHEHP 125
>gnl|CDD|213343 cd05395, RasGAP_RASA4, Ras-GTPase Activating Domain of RASA4. Ras
GTPase activating-like 4 protein (RASAL4), also known as
Ca2+ -promoted Ras inactivator (CAPRI), is a member of
the GAP1 family. Members of the GAP1 family are
characterized by a conserved domain structure comprising
N-terminal tandem C2 domains, a highly conserved central
RasGAP domain, and a C-terminal pleckstrin-homology
domain that is associated with a Bruton's tyrosine
kinase motif. RASAL4, like RASAL, is a cytosolic protein
that undergoes a rapid translocation to the plasma
membrane in response to a receptor-mediated elevation in
the concentration of intracellular free Ca2+ ([Ca2+]i).
However, unlike RASAL, RASAL4 does not sense
oscillations in [Ca2+]i.
Length = 287
Score = 31.4 bits (71), Expect = 4.7
Identities = 17/41 (41%), Positives = 23/41 (56%), Gaps = 1/41 (2%)
Query: 418 LASFDFEQFREKYADIQYLEGKMGKKFSDVFHKEKSKEMDP 458
LAS E F K A +QYL +G + VF ++K E+DP
Sbjct: 84 LASKSMESFL-KVAGMQYLHSVLGPTINRVFEEKKYVELDP 123
>gnl|CDD|172341 PRK13808, PRK13808, adenylate kinase; Provisional.
Length = 333
Score = 31.0 bits (70), Expect = 6.0
Identities = 19/87 (21%), Positives = 31/87 (35%), Gaps = 14/87 (16%)
Query: 1801 GKPCAKKADSVEICARRAFNELTQIYPTAPNTKSASDVTRAETKTAVAAQPAPQN----- 1855
K AK + + + + + + KTAV+A+ A +
Sbjct: 199 AKKAAKTPAAKSGAKKAS---------AKAKSAAKKVSKKKAAKTAVSAKKAAKTAAKAA 249
Query: 1856 TEVDQTDKNTAAAAEKKEKKAPRIAAK 1882
+ +T K A K KKA + AAK
Sbjct: 250 KKAKKTAKKALKKAAKAVKKAAKKAAK 276
>gnl|CDD|177328 PHA01929, PHA01929, putative scaffolding protein.
Length = 306
Score = 30.8 bits (69), Expect = 6.8
Identities = 22/76 (28%), Positives = 28/76 (36%), Gaps = 8/76 (10%)
Query: 1670 PRGIPILGSQ---AAMYNPSVPYQQQQQYQYYYNQQQPMYYPQQVAYPPPPYQQARHPQY 1726
P G+ L + AA P Q Q Q PQQ+A P Q P
Sbjct: 9 PPGLAGLVANVPPAAAPTPQPNPVIQPQAPVQPGQPGA---PQQLAIPTQ--QPQPVPTS 63
Query: 1727 SKPPHPILNASSQPVP 1742
+ PH + A +QP P
Sbjct: 64 AMTPHVVQQAPAQPAP 79
>gnl|CDD|185594 PTZ00395, PTZ00395, Sec24-related protein; Provisional.
Length = 1560
Score = 31.2 bits (70), Expect = 6.8
Identities = 37/225 (16%), Positives = 78/225 (34%), Gaps = 31/225 (13%)
Query: 1274 NSNLQVTSPYHVKVSRCNSQQSSVSQSPSSTMISQQSSPFNV-KPTEKTEPPPPHKLNQV 1332
++ V Y ++QQS++ P ++ N +P E+ E PH +
Sbjct: 103 DNRAHVGRAYGAGACEQSNQQSNI---PIGDPVNHLRGHPNFGEPRERAEDAAPHAQHNH 159
Query: 1333 NTSSAPKEKPRTSEQNQIKSNGNEQKVIINKEERCIFSNATYSIPTTQNSELFSTGGMSF 1392
+ + P + +Q +++ + + + + P+ ++ G
Sbjct: 160 SGKTNGDNPPTGGQYHQSGGTSRNHQMMDSNKNCPADALFNETNPSGEHKRNSIDG---- 215
Query: 1393 FGYPNLLDIPENKNKLLTGEITANKMETKPEKATCPQDKEVEEIT----NGTQTITM--- 1445
DIP + I + E +K Q K+ T N +T+
Sbjct: 216 -------DIPSDI------YIDSQPNEGDVQKTNPWQGKQGNSATSPPANENNAVTLSCS 262
Query: 1446 NDLYQSANKRSENGFNIPSISDL---IKQNNITAKTDYPTTSTFP 1487
ND + A+ +E+G+ S++ +NI + P +T
Sbjct: 263 NDQQRGASSAAESGYAHHRGSNIASHTPNDNIMHAANNPLNNTND 307
>gnl|CDD|172947 PRK14472, PRK14472, F0F1 ATP synthase subunit B; Provisional.
Length = 175
Score = 30.2 bits (68), Expect = 6.9
Identities = 24/78 (30%), Positives = 39/78 (50%), Gaps = 8/78 (10%)
Query: 1528 ENTFKRAISAMVQQSPHLLEKNKELNTPSGGELS--FEEKHSMAAKSKNETTPTIKTEMR 1585
+++ RA SA ++ +L KN+EL + E E A K + E T TE +
Sbjct: 55 QSSIDRAHSAK-DEAEAILRKNRELLAKADAEADKIIREGKEYAEKLRAEITEKAHTEAK 113
Query: 1586 KIISSSDYQGLEEEEEEE 1603
K+I+S+ +EE E+E
Sbjct: 114 KMIASA-----KEEIEQE 126
>gnl|CDD|219927 pfam08601, PAP1, Transcription factor PAP1. The transcription factor
Pap1 regulates antioxidant-gene transcription in response
to H2O2. This region is cysteine rich. Alkylation of
cysteine residues following treatment with a cysteine
alkylating agent can mask the accessibility of the
nuclear exporter Crm1, triggering nuclear accumulation
and Pap1 dependent transcriptional expression.
Length = 344
Score = 30.7 bits (69), Expect = 7.2
Identities = 18/114 (15%), Positives = 37/114 (32%), Gaps = 5/114 (4%)
Query: 1281 SPYHVKVSR-CNSQQSSVSQSPSSTMISQQSSPF-NVKPTEKTEPPPPHKLNQVNTSSAP 1338
+ K+ C ++ + + + S SSP + + ++ P+ TS +
Sbjct: 27 PKFCGKLPGACGTKNCPIPKLAKN---SSVSSPVPGLLNSTESNVSSPNNNPNGYTSPSS 83
Query: 1339 KEKPRTSEQNQIKSNGNEQKVIINKEERCIFSNATYSIPTTQNSELFSTGGMSF 1392
S + + N N S Y+ +S+ S+G F
Sbjct: 84 AAMNNKSNNRAVDPSANASAASTNSPNGLQSSATQYNSNDNSSSDSPSSGSDGF 137
>gnl|CDD|169051 PRK07667, PRK07667, uridine kinase; Provisional.
Length = 193
Score = 30.1 bits (68), Expect = 7.5
Identities = 20/59 (33%), Positives = 27/59 (45%), Gaps = 12/59 (20%)
Query: 464 IKATDDDYDDGTEEVLNYANYEEYYYSDEDEEW-RTNFFSEFGQHKTD----YYHYKLD 517
I + Y G EE + EYYY D EW R FF + Q++T +YH + D
Sbjct: 57 IVERNKRYHTGFEE------WYEYYYLQWDIEWLRQKFFRKL-QNETKLTLPFYHDETD 108
>gnl|CDD|217298 pfam02948, Amelogenin, Amelogenin. Amelogenins play a role in
biomineralisation. They seem to regulate the formation of
crystallites during the secretory stage of tooth enamel
development. thought to play a major role in the
structural organisation and mineralisation of developing
enamel. They are found in the extracellular matrix.
Mutations in X-chromosomal amelogenin can cause
Amelogenesis imperfecta.
Length = 174
Score = 29.9 bits (67), Expect = 7.5
Identities = 23/76 (30%), Positives = 27/76 (35%), Gaps = 11/76 (14%)
Query: 1673 IPILGSQAAM-----YNPSVPYQQQQQYQYYYNQQQPMYYPQQVAYPPPPYQQARHPQYS 1727
+ IL Q M ++P VP QQ + Q P P P QQ H Q
Sbjct: 74 LLILPPQQPMMPVPGHHPMVPMTGQQ--PHLQPPAQHPLQPTYGQNPQP--QQPTHTQ-- 127
Query: 1728 KPPHPILNASSQPVPP 1743
P P A QP P
Sbjct: 128 PPVQPQQPADPQPGQP 143
>gnl|CDD|222010 pfam13254, DUF4045, Domain of unknown function (DUF4045). This
presumed domain is functionally uncharacterized. This
domain family is found in bacteria and eukaryotes, and is
typically between 384 and 430 amino acids in length.
Length = 414
Score = 31.0 bits (70), Expect = 7.8
Identities = 15/68 (22%), Positives = 27/68 (39%), Gaps = 2/68 (2%)
Query: 1281 SPYHVKVSRCNSQQSSVSQSPSSTMISQQSSPFNVKPTEKTEPPPPHKLNQVNTSSAPKE 1340
P + Q +S+M +++SS KP E +P P K ++T+ +
Sbjct: 224 IPDLPSSRDSEKTKPEKPQQETSSMDTEKSS--APKPRETLDPKSPEKAPPIDTTEEELK 281
Query: 1341 KPRTSEQN 1348
P S +
Sbjct: 282 SPEASPKE 289
>gnl|CDD|235322 PRK04950, PRK04950, ProP expression regulator; Provisional.
Length = 213
Score = 30.3 bits (69), Expect = 8.1
Identities = 13/50 (26%), Positives = 19/50 (38%)
Query: 1841 AETKTAVAAQPAPQNTEVDQTDKNTAAAAEKKEKKAPRIAAKFKTPLNLK 1890
E K V AQ A Q + + A ++ K P+ K + P K
Sbjct: 104 EEAKAKVQAQRAEQQAKKREAAGEKEKAPRRERKPKPKAPRKKRKPRAQK 153
>gnl|CDD|217461 pfam03261, CDK5_activator, Cyclin-dependent kinase 5 activator
protein.
Length = 314
Score = 30.6 bits (69), Expect = 8.2
Identities = 19/96 (19%), Positives = 39/96 (40%), Gaps = 2/96 (2%)
Query: 1267 KENAPLVNSNLQVTSPYHVKVSR-CNSQQSSVSQSPSSTMISQQSSPFNVKPTEKTEPPP 1325
K+ + +NL +S + + + + SV + +T+ S + PP
Sbjct: 68 KKTSKKSTANLASSSYQKIVIQLNRENLKKSVPRPSDTTLSCANLSSLREPSPGQPPAPP 127
Query: 1326 PHKLNQVNTSSAPKEKPRTSEQNQIKSNGNEQKVII 1361
P L+ N + P +K S + G+ ++VI+
Sbjct: 128 PSVLSGKNANCIPSQKNAPSIAIT-STGGSPRRVIV 162
>gnl|CDD|236275 PRK08486, PRK08486, single-stranded DNA-binding protein; Provisional.
Length = 182
Score = 29.8 bits (67), Expect = 8.3
Identities = 10/46 (21%), Positives = 13/46 (28%), Gaps = 3/46 (6%)
Query: 1678 SQAAMYNPSVPYQQQQQYQYYYNQQ---QPMYYPQQVAYPPPPYQQ 1720
+ N Y Q Y Q Q Y + PP Y+
Sbjct: 123 HENFNNNYPGNYNNPSQDPYMNQAQSYNQNAYAKENQQAQPPKYEN 168
>gnl|CDD|223061 PHA03369, PHA03369, capsid maturational protease; Provisional.
Length = 663
Score = 30.7 bits (69), Expect = 8.3
Identities = 16/111 (14%), Positives = 27/111 (24%), Gaps = 12/111 (10%)
Query: 1665 STNAQPRGIPILGSQAAMYNPS--VPYQQQQQYQYYYNQQQPMYYPQQV------AYPPP 1716
A + ++ + P+ Q+ Y + PM V
Sbjct: 358 RVLAAAAKVAVIAAPQTHTGPADRQRPQRPDGIPYSVPARSPMTAYPPVPQFCGDPGLVS 417
Query: 1717 PYQQARHPQYSKPPHPILNASSQPVPPDTTEDLEERADSYLAKLLRLTRAH 1767
PY P S P P+ QP + + + H
Sbjct: 418 PYNPQS-PGTSYGPEPVGPVPPQPT---NPYVMPISMANMVYPGHPQEHGH 464
>gnl|CDD|219392 pfam07382, HC2, Histone H1-like nucleoprotein HC2. This family
contains the bacterial histone H1-like nucleoprotein HC2
(approximately 200 residues long), which seems to be
found mostly in Chlamydia. HC2 functions in DNA
condensation, although it has been suggested that it also
has other roles.
Length = 187
Score = 30.1 bits (67), Expect = 8.7
Identities = 19/85 (22%), Positives = 27/85 (31%), Gaps = 2/85 (2%)
Query: 1802 KPCAKKADSVEICARRAFNELTQIYPTAPNTKSASDVTRAETKTAVAAQPAPQNTEVDQT 1861
KP AKKA + + R+ + T A+ A K A A +
Sbjct: 22 KPAAKKAAAKKTVVRKVAAKKPAARKTVAKKTVAAKKPAA--KKAAKKAVAKKVVAKKPV 79
Query: 1862 DKNTAAAAEKKEKKAPRIAAKFKTP 1886
K A +K A + KT
Sbjct: 80 AKKAVAKKATAKKVAAKKVVAKKTV 104
>gnl|CDD|234683 PRK00191, tatA, twin arginine translocase protein A; Provisional.
Length = 84
Score = 28.4 bits (63), Expect = 9.1
Identities = 14/44 (31%), Positives = 17/44 (38%), Gaps = 6/44 (13%)
Query: 1682 MYNPSVPYQQQQQYQYYYNQQQPMYYPQQVAYPPPPYQQARHPQ 1725
M +QQQQ Q Q P Q+ P P +QQ Q
Sbjct: 45 MSKDDQTPEQQQQ------QPQQQIAPNQIEAPQPNFQQHYQGQ 82
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.316 0.133 0.392
Gapped
Lambda K H
0.267 0.0647 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 96,526,810
Number of extensions: 9655777
Number of successful extensions: 10046
Number of sequences better than 10.0: 1
Number of HSP's gapped: 9597
Number of HSP's successfully gapped: 142
Length of query: 1891
Length of database: 10,937,602
Length adjustment: 111
Effective length of query: 1780
Effective length of database: 6,014,308
Effective search space: 10705468240
Effective search space used: 10705468240
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 66 (29.4 bits)