RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy12777
(697 letters)
>gnl|CDD|214473 smart00020, Tryp_SPc, Trypsin-like serine protease. Many of these
are synthesised as inactive precursor zymogens that are
cleaved during limited proteolysis to generate their
active forms. A few, however, are active as single chain
molecules, and others are inactive due to substitutions
of the catalytic triad residues.
Length = 229
Score = 158 bits (401), Expect = 3e-44
Identities = 62/146 (42%), Positives = 85/146 (58%), Gaps = 7/146 (4%)
Query: 390 RIVGGEKATFGKWPWQISLRQWIRSTYLHKCGAALFNENWAVTAAHCVEDVPPSDLLLRL 449
RIVGG +A G +PWQ+SL+ H CG +L + W +TAAHCV PS++ +RL
Sbjct: 1 RIVGGSEANIGSFPWQVSLQ---YGGGRHFCGGSLISPRWVLTAAHCVRGSDPSNIRVRL 57
Query: 450 GEHDLSTEEEPYGYQERRVQIVASHPQFDPRTFEYDLALLRFYEPVKFQPNIIPICVPE- 508
G HDLS+ EE Q +V V HP ++P T++ D+ALL+ EPV N+ PIC+P
Sbjct: 58 GSHDLSSGEE---GQVIKVSKVIIHPNYNPSTYDNDIALLKLKEPVTLSDNVRPICLPSS 114
Query: 509 DDTNFVGTSAHVTGWGRLYEGRFRRS 534
+ GT+ V+GWGR EG
Sbjct: 115 NYNVPAGTTCTVSGWGRTSEGAGSLP 140
Score = 73.5 bits (181), Expect = 1e-14
Identities = 23/50 (46%), Positives = 30/50 (60%), Gaps = 3/50 (6%)
Query: 562 RIVGGEKATFGKWPWQISLRQWIRSTYLHKCGAALFNENWAVTAAHCVED 611
RIVGG +A G +PWQ+SL+ H CG +L + W +TAAHCV
Sbjct: 1 RIVGGSEANIGSFPWQVSLQ---YGGGRHFCGGSLISPRWVLTAAHCVRG 47
Score = 53.1 bits (128), Expect = 1e-07
Identities = 21/59 (35%), Positives = 33/59 (55%), Gaps = 4/59 (6%)
Query: 631 W-KMALADGPLPSVLQEVSVPVINNSLCETMYRAAGFIEHIPEIFICAGWRKGSFDSCE 688
W + + G LP LQEV+VP+++N+ C RA I + +CAG +G D+C+
Sbjct: 129 WGRTSEGAGSLPDTLQEVNVPIVSNATCR---RAYSGGGAITDNMLCAGGLEGGKDACQ 184
>gnl|CDD|238113 cd00190, Tryp_SPc, Trypsin-like serine protease; Many of these are
synthesized as inactive precursor zymogens that are
cleaved during limited proteolysis to generate their
active forms. Alignment contains also inactive enzymes
that have substitutions of the catalytic triad residues.
Length = 232
Score = 154 bits (392), Expect = 5e-43
Identities = 61/140 (43%), Positives = 84/140 (60%), Gaps = 6/140 (4%)
Query: 391 IVGGEKATFGKWPWQISLRQWIRSTYLHKCGAALFNENWAVTAAHCVEDVPPSDLLLRLG 450
IVGG +A G +PWQ+SL+ + H CG +L + W +TAAHCV PS+ +RLG
Sbjct: 1 IVGGSEAKIGSFPWQVSLQ---YTGGRHFCGGSLISPRWVLTAAHCVYSSAPSNYTVRLG 57
Query: 451 EHDLSTEEEPYGYQERRVQIVASHPQFDPRTFEYDLALLRFYEPVKFQPNIIPICVPEDD 510
HDLS+ E G Q +V+ V HP ++P T++ D+ALL+ PV N+ PIC+P
Sbjct: 58 SHDLSSNEG--GGQVIKVKKVIVHPNYNPSTYDNDIALLKLKRPVTLSDNVRPICLPSSG 115
Query: 511 TNF-VGTSAHVTGWGRLYEG 529
N GT+ V+GWGR EG
Sbjct: 116 YNLPAGTTCTVSGWGRTSEG 135
Score = 71.2 bits (175), Expect = 9e-14
Identities = 22/49 (44%), Positives = 30/49 (61%), Gaps = 3/49 (6%)
Query: 563 IVGGEKATFGKWPWQISLRQWIRSTYLHKCGAALFNENWAVTAAHCVED 611
IVGG +A G +PWQ+SL+ + H CG +L + W +TAAHCV
Sbjct: 1 IVGGSEAKIGSFPWQVSLQ---YTGGRHFCGGSLISPRWVLTAAHCVYS 46
Score = 50.4 bits (121), Expect = 7e-07
Identities = 22/52 (42%), Positives = 33/52 (63%), Gaps = 3/52 (5%)
Query: 637 DGPLPSVLQEVSVPVINNSLCETMYRAAGFIEHIPEIFICAGWRKGSFDSCE 688
GPLP VLQEV+VP+++N+ C+ Y G I + +CAG +G D+C+
Sbjct: 135 GGPLPDVLQEVNVPIVSNAECKRAYSYGGT---ITDNMLCAGGLEGGKDACQ 183
>gnl|CDD|215708 pfam00089, Trypsin, Trypsin.
Length = 218
Score = 118 bits (297), Expect = 3e-30
Identities = 59/154 (38%), Positives = 85/154 (55%), Gaps = 15/154 (9%)
Query: 391 IVGGEKATFGKWPWQISLRQWIRSTYLHKCGAALFNENWAVTAAHCVEDVPPSDLLLRLG 450
IVGG++A G +PWQ+SL+ S+ H CG +L +ENW +TAAHCV + + + LG
Sbjct: 1 IVGGDEAQPGSFPWQVSLQ---VSSGKHFCGGSLISENWVLTAAHCVSN--AKSVRVVLG 55
Query: 451 EHDLSTEEEPYGYQERRVQIVASHPQFDPRTFEYDLALLRFYEPVKFQPNIIPICVPEDD 510
H++ E G Q+ V+ V HP ++P T D+ALL+ PV + PIC+P
Sbjct: 56 AHNIVLREG--GEQKFDVKKVIVHPNYNPDTD-NDIALLKLKSPVTLGDTVRPICLPTAS 112
Query: 511 TNF-VGTSAHVTGWGRLYEGRFRRSYGHPATRQE 543
++ VGT+ V+GWG G P T QE
Sbjct: 113 SDLPVGTTCTVSGWGNTKTL------GLPDTLQE 140
Score = 59.8 bits (145), Expect = 5e-10
Identities = 25/49 (51%), Positives = 35/49 (71%), Gaps = 3/49 (6%)
Query: 563 IVGGEKATFGKWPWQISLRQWIRSTYLHKCGAALFNENWAVTAAHCVED 611
IVGG++A G +PWQ+SL+ S+ H CG +L +ENW +TAAHCV +
Sbjct: 1 IVGGDEAQPGSFPWQVSLQ---VSSGKHFCGGSLISENWVLTAAHCVSN 46
Score = 34.3 bits (79), Expect = 0.12
Identities = 18/48 (37%), Positives = 25/48 (52%), Gaps = 7/48 (14%)
Query: 640 LPSVLQEVSVPVINNSLCETMYRAAGFIEHIPEIFICAGWRKGSFDSC 687
LP LQEV+VPV++ C + Y + + ICAG G D+C
Sbjct: 134 LPDTLQEVTVPVVSRETCRSAYGGT-----VTDNMICAGA--GGKDAC 174
>gnl|CDD|227927 COG5640, COG5640, Secreted trypsin-like serine protease
[Posttranslational modification, protein turnover,
chaperones].
Length = 413
Score = 57.2 bits (138), Expect = 1e-08
Identities = 32/145 (22%), Positives = 53/145 (36%), Gaps = 9/145 (6%)
Query: 388 SSRIVGGEKATFGKWPWQISLRQWIRSTYLHK-CGAALFNENWAVTAAHCVEDVPPSDLL 446
SSRI+GG A G++P ++L I CG + + +TAAHC + P
Sbjct: 30 SSRIIGGSNANAGEYPSLVALVDRISDYVSGTFCGGSKLGGRYVLTAAHCADASSPISSD 89
Query: 447 LRLGEHDLSTEEEPYGYQERRVQIVASHPQFDPRTFEYDLALLRFYEPVKFQPNIIPICV 506
+ DL+ + + V+ + H + P D+A+L + +P
Sbjct: 90 VNRVVVDLNDSSQ---AERGHVRTIYVHEFYSPGNLGNDIAVLELA-----RAASLPRVK 141
Query: 507 PEDDTNFVGTSAHVTGWGRLYEGRF 531
VT + G F
Sbjct: 142 ITSFDASDTFLNSVTTVSPMTNGTF 166
Score = 49.1 bits (117), Expect = 6e-06
Identities = 18/53 (33%), Positives = 27/53 (50%), Gaps = 1/53 (1%)
Query: 560 SSRIVGGEKATFGKWPWQISLRQWIRSTYLHK-CGAALFNENWAVTAAHCVED 611
SSRI+GG A G++P ++L I CG + + +TAAHC +
Sbjct: 30 SSRIIGGSNANAGEYPSLVALVDRISDYVSGTFCGGSKLGGRYVLTAAHCADA 82
>gnl|CDD|226406 COG3889, COG3889, Predicted solute binding protein [General
function prediction only].
Length = 872
Score = 47.5 bits (113), Expect = 2e-05
Identities = 34/144 (23%), Positives = 56/144 (38%), Gaps = 16/144 (11%)
Query: 222 SSSTIEEETNMIG-GGTAYGT-STTPSLVTW-----TTVDEIPVIPPDRTRPPPLVTTIK 274
SS +T IG T YG+ P+ W T ++ ++ P + T+
Sbjct: 712 SSEAKNPDTVKIGQALTVYGSLEVFPAGENWGFIPTTKRVKVRIMDPASGTGTSITTSG- 770
Query: 275 APPSTVASPVTSSTESWVQISLTTLSFPPPQQNVTTTVVTTKPYPETTTTTEKTEPSPST 334
T P S T++ +S + S T+ V+ +T T T+ T SPS
Sbjct: 771 --TFTAEVP-QSPTKTETTLSYSAYSNTSILIETTSVVI-----TKTVTQTQTTTSSPSP 822
Query: 335 VYETSSMSPSSPKPSPTTSTVSTT 358
TS S+ + T+ + +TT
Sbjct: 823 TQTTSPTQTSTSTTTTTSPSQTTT 846
Score = 41.4 bits (97), Expect = 0.002
Identities = 30/106 (28%), Positives = 44/106 (41%), Gaps = 10/106 (9%)
Query: 42 TTTVVTTKPYPETTTTTEKTEPSPSTVYETSSMSPSSPKPSPTTSTVSTTAFIDESNEIE 101
TT V + + T S + E P SP + T +T+S +A+ + S IE
Sbjct: 747 TTKRVKVRIMDPASGTGTSITTSGTFTAEV----PQSP--TKTETTLSYSAYSNTSILIE 800
Query: 102 SQGINMSNYKEDVTTTVVTTKPYPETTITTEKTEPSPSTVYETSSS 147
+ + ++ T TT P T TT T+ S ST TS S
Sbjct: 801 TTSVVITK----TVTQTQTTTSSPSPTQTTSPTQTSTSTTTTTSPS 842
Score = 38.7 bits (90), Expect = 0.013
Identities = 17/56 (30%), Positives = 27/56 (48%), Gaps = 1/56 (1%)
Query: 37 PIRNVTTTVVTTKPYPETTTTTEKTEPSPSTVYETSSMSPSSPKPSPTTSTVSTTA 92
I TT+VV TK +T TTT P+ +T T + + ++ SP+ +T
Sbjct: 796 SILIETTSVVITKTVTQTQTTTSSPSPTQTTS-PTQTSTSTTTTTSPSQTTTGGGI 850
Score = 30.2 bits (68), Expect = 5.0
Identities = 21/90 (23%), Positives = 30/90 (33%), Gaps = 23/90 (25%)
Query: 42 TTTVVTTKPYPETTTTTEKTEPSPSTVYETSSMSPSSPKPSPTTSTVSTTAFIDESNEIE 101
T T ++ Y T+ E T + + + SSP P+ TTS T+
Sbjct: 783 TETTLSYSAYSNTSILIETTSVVITKTVTQTQTTTSSPSPTQTTSPTQTS---------- 832
Query: 102 SQGINMSNYKEDVTTTVVTTKPYPETTITT 131
T+T TT P TT
Sbjct: 833 -------------TSTTTTTSPSQTTTGGG 849
Score = 29.8 bits (67), Expect = 5.6
Identities = 22/103 (21%), Positives = 32/103 (31%), Gaps = 18/103 (17%)
Query: 238 AYGTSTTPSLVTWTTVDEIP--------VIPPDRTRPPPLVTTIKAPPSTVASPVTSSTE 289
GTS T S V + P T T++ + + T+S+
Sbjct: 761 GTGTSITTSGTFTAEVPQSPTKTETTLSYSAYSNTSILIETTSVVITKTVTQTQTTTSSP 820
Query: 290 SWVQISLTTLSFPPPQQNVTTTVVTTKPYPETTTTTEKTEPSP 332
+ P Q T+T TT P + TTT P
Sbjct: 821 -------SPTQTTSPTQTSTSTTTTTSP---SQTTTGGGICGP 853
Score = 29.4 bits (66), Expect = 8.0
Identities = 22/101 (21%), Positives = 35/101 (34%), Gaps = 20/101 (19%)
Query: 41 VTTTVVTTKPYPETTTTTEKTEPSPSTVYETSSMSPSSPKPSPTTSTVSTTAFIDESNEI 100
+TT+ T P++ T TE T + + + +S + T + TT
Sbjct: 766 ITTSGTFTAEVPQSPTKTETTLSYSAYSNTSILIETTSVVITKTVTQTQTTT-------- 817
Query: 101 ESQGINMSNYKEDVTTTVVTTKPYPETTITTEKTEPSPSTV 141
T T T+ +T TT T PS +T
Sbjct: 818 ---------SSPSPTQTTSPTQT---STSTTTTTSPSQTTT 846
Score = 29.4 bits (66), Expect = 9.1
Identities = 27/138 (19%), Positives = 42/138 (30%), Gaps = 29/138 (21%)
Query: 247 LVTWTTVDEIPVIPPDRTRPPPLVTTIKAPPSTVASPVTSSTESWVQISLTTLSFPPPQQ 306
+ +T + P V +A + V + E+W I T
Sbjct: 698 WLPYTNSLYKATTLSSEAKNPDTVKIGQALTVYGSLEVFPAGENWGFIPTTKRVKVRIMD 757
Query: 307 NV--TTTVVTTKPYPETTTTTEKTEPSPST-------------VYETSSMSP-------- 343
T T +TT T T P+ + + ET+S+
Sbjct: 758 PASGTGTSITTS---GTFTAEVPQSPTKTETTLSYSAYSNTSILIETTSVVITKTVTQTQ 814
Query: 344 ---SSPKPSPTTSTVSTT 358
SSP P+ TTS T+
Sbjct: 815 TTTSSPSPTQTTSPTQTS 832
>gnl|CDD|215130 PLN02217, PLN02217, probable pectinesterase/pectinesterase
inhibitor.
Length = 670
Score = 40.1 bits (93), Expect = 0.004
Identities = 27/94 (28%), Positives = 40/94 (42%), Gaps = 3/94 (3%)
Query: 242 STTPSLVTWTTVDEIPVIPPDRTRPPPLVTTIKAPPSTVASPVTSSTESWVQISLTTLSF 301
S S T+++ V+ P + P + + A PS + SP TS S + TT S
Sbjct: 576 SAASSNTTFSSDSPSTVVAPSTSPPAGHLGSPPATPSKIVSPSTSPPASHLGSPSTTPSS 635
Query: 302 PPPQQNVTTTVVTTKPYPETTTTTEKTEPSPSTV 335
P V +T + PE++ TE S S V
Sbjct: 636 PESSIKVASTETAS---PESSIKVASTESSVSMV 666
Score = 30.8 bits (69), Expect = 3.2
Identities = 27/102 (26%), Positives = 40/102 (39%), Gaps = 6/102 (5%)
Query: 42 TTTVVTTKPYPETTTTTEKTEPSPSTVYETSSMSPSSPKPSP--TTSTVSTTAFIDESNE 99
T + T TT + SPSTV S+ P+ SP T S + + + ++
Sbjct: 569 TNSTPTGSAASSNTTFSSD---SPSTVVAPSTSPPAGHLGSPPATPSKIVSPSTSPPASH 625
Query: 100 IESQGINMSNYKEDVTTTVVTTKPYPETTITTEKTEPSPSTV 141
+ S S+ E T PE++I TE S S V
Sbjct: 626 LGSPSTTPSS-PESSIKVASTETASPESSIKVASTESSVSMV 666
>gnl|CDD|114270 pfam05539, Pneumo_att_G, Pneumovirinae attachment membrane
glycoprotein G.
Length = 408
Score = 39.3 bits (91), Expect = 0.005
Identities = 45/168 (26%), Positives = 54/168 (32%), Gaps = 17/168 (10%)
Query: 192 SCCK---IAGETDVDISEPINNFIHNTNHIDEHSSSTIEEETNMIGGGTAYGTSTTPSLV 248
SCCK A T S P +H S T + + G TA S
Sbjct: 163 SCCKEPKTAVTTSKTTSWP-----TEVSHPTYPSQVTPQSQPATQGHQTATANQRLSSTE 217
Query: 249 TWTTVDE-IPVIPPDRTRPPP----LVTTIKAPPSTVASPVTSSTESWVQISLTTLSFPP 303
T P +T PPP + + PPST + ST Q PP
Sbjct: 218 PVGTQGTTTSSNPEPQTEPPPSQRGPSGSPQHPPSTTSQ--DQSTTGDGQEHTQRRKTPP 275
Query: 304 PQQNVTTTVVTTKPYPETTTTTEKTEPSPSTVYETSSMSPSSPKPSPT 351
N + T P P TT E P+P T S S P SP
Sbjct: 276 ATSNRRSPHSTATP-PPTTKRQETGRPTPRPTATTQS-GSSPPHSSPP 321
Score = 30.0 bits (67), Expect = 4.6
Identities = 29/105 (27%), Positives = 37/105 (35%), Gaps = 20/105 (19%)
Query: 270 VTTIKAPPSTVASPVTSSTESWVQISL--------TTLSFPPPQQNVTTT----VVTTKP 317
V+ K P + V TS T SW T S P Q + T T + +T+P
Sbjct: 162 VSCCKEPKTAVT---TSKTTSWPTEVSHPTYPSQVTPQSQPATQGHQTATANQRLSSTEP 218
Query: 318 YPETTTTT-----EKTEPSPSTVYETSSMSPSSPKPSPTTSTVST 357
TTT +TEP PS + S S ST
Sbjct: 219 VGTQGTTTSSNPEPQTEPPPSQRGPSGSPQHPPSTTSQDQSTTGD 263
>gnl|CDD|222077 pfam13365, Trypsin_2, Trypsin-like peptidase domain. This family
includes trypsin like peptidase domains.
Length = 138
Score = 37.2 bits (86), Expect = 0.007
Identities = 28/179 (15%), Positives = 49/179 (27%), Gaps = 52/179 (29%)
Query: 426 NENWAVTAAHCVEDVPPSDLLLRLGEHDLSTEEEPYGYQERRVQIVASHPQFDPRTFEYD 485
++ +T AH VED S++ + L ++VA+ P + D
Sbjct: 8 SDGLILTNAHVVEDADASEIEVVL-----------PDGGRVPAEVVAADP-------DLD 49
Query: 486 LALLRFYEPVKFQPNIIPI-CVPEDDTNFVGTSAHVTGWGRLYEGRFRRSYGHPATRQEM 544
LALL+ P+ ++ P + V G G G
Sbjct: 50 LALLKVDGPLLPAAPLLASSAAPLGGSVVVVGGPGGIGLGASGGGGGVGGLVSG------ 103
Query: 545 ATCWNHFLGNRILFPSSRIVGGEKATFGKWPWQISLRQWIRSTYLHKCGAALFNENWAV 603
+ R + + G S G +F+ + V
Sbjct: 104 ---SLGGVDGRYILTDADTSPG------------------SS------GGPVFDADGEV 135
>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
Length = 3151
Score = 39.2 bits (91), Expect = 0.009
Identities = 18/102 (17%), Positives = 29/102 (28%)
Query: 251 TTVDEIPVIPPDRTRPPPLVTTIKAPPSTVASPVTSSTESWVQISLTTLSFPPPQQNVTT 310
T P T PP AP + +T + + S +L P +
Sbjct: 2750 TPGGPARPARPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPA 2809
Query: 311 TVVTTKPYPETTTTTEKTEPSPSTVYETSSMSPSSPKPSPTT 352
V+ + P P++ T+ P P P
Sbjct: 2810 AVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLP 2851
Score = 34.1 bits (78), Expect = 0.33
Identities = 21/111 (18%), Positives = 30/111 (27%), Gaps = 6/111 (5%)
Query: 242 STTPSLVTWTTVDEIPVIPPDRTRPPPLVTTIKAPPSTVA-SPVTSSTESWVQISLTTLS 300
+ P+ V P P PPP APP P + V
Sbjct: 2805 ADPPAAVLAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRR 2864
Query: 301 FPPPQQNVTTTVVTTKPYPETTTTTEKTEPSPSTVYETSSMSPSSPKPSPT 351
PP + P P+ S E+ ++ P P+ P
Sbjct: 2865 RPPSRS--PAAKPAAPARPPVRRL---ARPAVSRSTESFALPPDQPERPPQ 2910
Score = 34.1 bits (78), Expect = 0.35
Identities = 19/98 (19%), Positives = 26/98 (26%), Gaps = 11/98 (11%)
Query: 260 PPDRTRPPPLVTTIKAPPSTVASP-VTSSTESWVQISLTTLSFPPPQQNVTTTVVTTKPY 318
PP R+ + P +A P V+ STES+ P PQ P
Sbjct: 2866 PPSRSPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERPPQPQ----------APP 2915
Query: 319 PETTTTTEKTEPSPSTVYETSSMSPSSPKPSPTTSTVS 356
P P P P+ +
Sbjct: 2916 PPQPQPQPPPPPQPQPPPPPPPRPQPPLAPTTDPAGAG 2953
Score = 33.4 bits (76), Expect = 0.48
Identities = 26/126 (20%), Positives = 43/126 (34%), Gaps = 16/126 (12%)
Query: 245 PSLVTWTTVDEIPVIPPDRTRPPPLVTTIKAPPSTVASPVTSSTESWVQISLTTLSFP-- 302
P+ + +P P + P + APP+ A P T + TT P
Sbjct: 2710 PAPHALVSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAP 2769
Query: 303 -PPQQNVTTTV-VTTKPYPETTTTTEKTEPSPS------------TVYETSSMSPSSPKP 348
PP T+P + + + ++ PSP + SP+ P P
Sbjct: 2770 APPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLP 2829
Query: 349 SPTTST 354
PT++
Sbjct: 2830 PPTSAQ 2835
>gnl|CDD|218439 pfam05109, Herpes_BLLF1, Herpes virus major outer envelope
glycoprotein (BLLF1). This family consists of the BLLF1
viral late glycoprotein, also termed gp350/220. It is
the most abundantly expressed glycoprotein in the viral
envelope of the Herpesviruses and is the major antigen
responsible for stimulating the production of
neutralising antibodies in vivo.
Length = 830
Score = 39.0 bits (90), Expect = 0.010
Identities = 51/285 (17%), Positives = 95/285 (33%), Gaps = 14/285 (4%)
Query: 35 TSPIRNVTTTVVTTKPYPETTTTTEKTEPSPSTVYETSSMS-PSSPKPSPTTSTVSTTAF 93
TS N T+ T+ TT+KT +P+ T + ++ SP T T S
Sbjct: 493 TSATPNATSPTPAVTTPNATSPTTQKTSDTPNATSPTPIVIGVTTTATSPPTGTTSVPNA 552
Query: 94 IDESNEIESQGINMSNYKEDVTTTVVTTKPYPE---TTITTEKTEPSPSTVYETSSSGRN 150
ES N + +V+T+ T + +P + ++ +
Sbjct: 553 TSPQVTEESPVNNTNTPVVTSAPSVLTSAVTTGQHGTGSSPTSQQPGIPSSSHSTPRSNS 612
Query: 151 IRHLPCVSRRTGETGLCMFAFSCAKANGTHLGTCIDRFYFGSCCKIAGETDVDISEPINN 210
P ++ G + + + + TH+ T G+ +++G + S
Sbjct: 613 TSTTPLLTSAHPTGGENITEETPSVPSTTHVSTLSPGPGPGTTSQVSGPGNSSTS-RYPG 671
Query: 211 FIHNT----NHIDEHSSSTIEEETNMIGGGTAYGTSTTPSLVTWTTVDEIPVIPP----- 261
+H T N S+ ++T + + G + + + T + P
Sbjct: 672 EVHVTEGMPNPNATSPSAPSGQKTAVPTVTSTGGKANSTTKETSGSTLMASTSPHTNEGA 731
Query: 262 DRTRPPPLVTTIKAPPSTVASPVTSSTESWVQISLTTLSFPPPQQ 306
RT P T + S+ P + T V T+ PP Q
Sbjct: 732 FRTTPYNATTYLPPSTSSKLRPRWTFTSPPVTTKQATVPVPPTQH 776
Score = 36.3 bits (83), Expect = 0.069
Identities = 28/161 (17%), Positives = 49/161 (30%), Gaps = 15/161 (9%)
Query: 211 FIHNTNHIDEHSSSTIEEETNMIGGGTAYG------TSTTPSLVTWTTVDEIP-----VI 259
F++ H++ H ++ + ++ T TS TP+ T +T+ E
Sbjct: 434 FVYTLVHVEPHKTTAVPTTPSLPPASTGPTVSTADPTSGTPTGTTSSTLPEDTSPTSRTT 493
Query: 260 PPDRTRPPPLVTTIKAPPSTVASPVTSSTESWVQISLTTLSFPPPQQNVTT-TVVTTKPY 318
P SP T T + T T+ TT
Sbjct: 494 SATPNATSPTPAV---TTPNATSPTTQKTSDTPNATSPTPIVIGVTTTATSPPTGTTSVP 550
Query: 319 PETTTTTEKTEPSPSTVYETSSMSPSSPKPSPTTSTVSTTA 359
T+ + P +T + +PS + TT T +
Sbjct: 551 NATSPQVTEESPVNNTNTPVVTSAPSVLTSAVTTGQHGTGS 591
Score = 35.9 bits (82), Expect = 0.086
Identities = 34/132 (25%), Positives = 52/132 (39%), Gaps = 6/132 (4%)
Query: 241 TSTTPSLVTWTTVDEIPVIPPDRTRPPPLVTTIKAPPSTVASPVTSSTESWVQISLTTLS 300
TTP+ + TT P+ T P P+V +T SP T +T S
Sbjct: 505 AVTTPNATSPTTQKTSD--TPNATSPTPIVIG---VTTTATSPPTGTTSVPNATSPQVTE 559
Query: 301 FPPPQQNVTTTVVTTKPYPETTTTTEKTEPSPSTVYETSSMSPSSPKPSPTTSTVSTTAF 360
P N T VVT+ P T+ T + S+ PSS +P +++ STT
Sbjct: 560 -ESPVNNTNTPVVTSAPSVLTSAVTTGQHGTGSSPTSQQPGIPSSSHSTPRSNSTSTTPL 618
Query: 361 IDESNEIESQGI 372
+ ++ + I
Sbjct: 619 LTSAHPTGGENI 630
Score = 35.5 bits (81), Expect = 0.11
Identities = 28/107 (26%), Positives = 46/107 (42%)
Query: 34 DTSPIRNVTTTVVTTKPYPETTTTTEKTEPSPSTVYETSSMSPSSPKPSPTTSTVSTTAF 93
+ SP+ N T VVT+ P T+ T + S+ PSS +P +++ STT
Sbjct: 559 EESPVNNTNTPVVTSAPSVLTSAVTTGQHGTGSSPTSQQPGIPSSSHSTPRSNSTSTTPL 618
Query: 94 IDESNEIESQGINMSNYKEDVTTTVVTTKPYPETTITTEKTEPSPST 140
+ ++ + I TT V T P P T++ + P S+
Sbjct: 619 LTSAHPTGGENITEETPSVPSTTHVSTLSPGPGPGTTSQVSGPGNSS 665
Score = 32.8 bits (74), Expect = 0.71
Identities = 30/144 (20%), Positives = 46/144 (31%), Gaps = 7/144 (4%)
Query: 219 DEHSSSTIEEETNMIGGGTAYGTSTTPSLVTWTTVDEIPVIPPDRTRPPPLVTTIKAPPS 278
D S + TTPSL +T V D T P TT P
Sbjct: 427 DTTKSVIFVYTLVHVEPHKTTAVPTTPSLPPASTGP--TVSTADPTSGTPTGTTSSTLPE 484
Query: 279 TVASPVTSSTESWVQISLTTLSFPPPQQNVTTTVVTTKPYPETTTTTEKTEPSP----ST 334
SP + +T + + T + P TT T+ T+ T +
Sbjct: 485 -DTSPTSRTTSATPNATSPTPAVTTPNATSPTTQKTSDTPNATSPTPIVIGVTTTATSPP 543
Query: 335 VYETSSMSPSSPKPSPTTSTVSTT 358
TS + +SP+ + + +T
Sbjct: 544 TGTTSVPNATSPQVTEESPVNNTN 567
>gnl|CDD|217310 pfam02993, MCPVI, Minor capsid protein VI. This minor capsid
protein may act as a link between the external capsid
and the internal DNA-protein core. The C-terminal 11
residues may function as a protease cofactor leading to
enzyme activation.
Length = 238
Score = 35.9 bits (83), Expect = 0.040
Identities = 20/80 (25%), Positives = 27/80 (33%), Gaps = 6/80 (7%)
Query: 254 DEIPVIPPDRTRPPPLVTTIKAPPSTVASPVTSS-TESWVQISLTTLSFPP-PQQNVTTT 311
+P P PP TIK P+ V PV S + TL PP PQ
Sbjct: 140 VLVPAAPE----PPSYEETIKPGPAPVEEPVDSMAIAVPAIDTPVTLELPPAPQPPPPVV 195
Query: 312 VVTTKPYPETTTTTEKTEPS 331
+ + ++T S
Sbjct: 196 PQPSTMVVHRRSRIKRTRSS 215
Score = 32.9 bits (75), Expect = 0.43
Identities = 17/104 (16%), Positives = 27/104 (25%), Gaps = 7/104 (6%)
Query: 261 PDRTRPPPLVTTIKAPPSTVASPVTSSTESWVQISLTTLSFPPPQQNVTTTVVTTKPYPE 320
P P T+ P + E L + PP T + P P
Sbjct: 110 LGEEEPAPQEETVADPIQALQPRPRPDVEE----VLVPAAPEPPSYEET---IKPGPAPV 162
Query: 321 TTTTTEKTEPSPSTVYETSSMSPSSPKPSPTTSTVSTTAFIDES 364
P+ + P +P+P P +T +
Sbjct: 163 EEPVDSMAIAVPAIDTPVTLELPPAPQPPPPVVPQPSTMVVHRR 206
>gnl|CDD|178806 PRK00030, minC, septum formation inhibitor; Provisional.
Length = 292
Score = 35.8 bits (82), Expect = 0.053
Identities = 21/93 (22%), Positives = 40/93 (43%), Gaps = 10/93 (10%)
Query: 265 RPPPLVTTIKAPPSTVASPVTSSTESWVQISLTTLSFPPPQQNVTTTVVTTKPYPETTTT 324
R P ++ T APP+ VA+PV S E+ + + PQ + + E
Sbjct: 106 RAPQVIDT--APPNDVATPVPSVPEA----TAEAAAKAGPQDDEADG----EQADEAPAH 155
Query: 325 TEKTEPSPSTVYETSSMSPSSPKPSPTTSTVST 357
++ P+ + T + P++ P +++ V T
Sbjct: 156 NPESVPTRAARETTEANRPTATPPQSSSALVIT 188
>gnl|CDD|237019 PRK11907, PRK11907, bifunctional 2',3'-cyclic nucleotide
2'-phosphodiesterase/3'-nucleotidase precursor protein;
Reviewed.
Length = 814
Score = 36.0 bits (83), Expect = 0.072
Identities = 18/95 (18%), Positives = 31/95 (32%), Gaps = 9/95 (9%)
Query: 41 VTTTVVTTKPYPETTTTTEKTEPSPSTVYETSSMSPSSPKPSPTTSTVSTTAFIDESNEI 100
+ P T+T E+T P E+ + + +P +T + A
Sbjct: 25 LAQAEEIVTTTPATSTEAEQTTPV-----ESDATEEADNTETPVAATTAAEAPSSSETAE 79
Query: 101 ESQGINMSNYKEDVTTTVVTTKPYPETTITTEKTE 135
S + + TTT P T T++ E
Sbjct: 80 TSDPTSEAT----DTTTSEARTVTPAATETSKPVE 110
Score = 35.2 bits (81), Expect = 0.15
Identities = 15/83 (18%), Positives = 29/83 (34%), Gaps = 7/83 (8%)
Query: 288 TESWVQISLTTLSFPPPQQNVTTTVVTTKPYPETTTTTEKTEPSPSTVYETSSMSPSSPK 347
++S V ++L L+ + P T+T E+T P E+ + +
Sbjct: 7 SKSAVALTLALLTASN--PKLAQAEEIVTTTPATSTEAEQTTPV-----ESDATEEADNT 59
Query: 348 PSPTTSTVSTTAFIDESNEIESQ 370
+P +T + A S
Sbjct: 60 ETPVAATTAAEAPSSSETAETSD 82
Score = 32.5 bits (74), Expect = 0.94
Identities = 19/100 (19%), Positives = 34/100 (34%), Gaps = 11/100 (11%)
Query: 275 APPSTVASPVTSSTESWVQISLTTLSFPPPQQNVTTTVVTTKPYPETTTTTEKTEPSPST 334
A +P TS+ T + ++ T TT PS S
Sbjct: 28 AEEIVTTTPATSTEAE----QTTPVESDATEEADNTET-------PVAATTAAEAPSSSE 76
Query: 335 VYETSSMSPSSPKPSPTTSTVSTTAFIDESNEIESQGINM 374
ETS + + + + + T A + S +E Q +++
Sbjct: 77 TAETSDPTSEATDTTTSEARTVTPAATETSKPVEGQTVDV 116
Score = 32.5 bits (74), Expect = 0.96
Identities = 18/87 (20%), Positives = 30/87 (34%), Gaps = 9/87 (10%)
Query: 268 PLVTTIKAPPSTVASPVTSSTESWVQISLTTLSFPPPQQNVTTTVVTTKPYPETTTTTEK 327
+VTT A + T+ ES P TT P ++ T E
Sbjct: 30 EIVTTTPATSTEAEQ--TTPVESDATEEADNTETPVA----ATTAAEA---PSSSETAET 80
Query: 328 TEPSPSTVYETSSMSPSSPKPSPTTST 354
++P+ T+S + + + TS
Sbjct: 81 SDPTSEATDTTTSEARTVTPAATETSK 107
Score = 31.0 bits (70), Expect = 2.3
Identities = 17/73 (23%), Positives = 30/73 (41%), Gaps = 1/73 (1%)
Query: 35 TSPIRNVTTTVVTTKPYPETTTTTEKTEPSPSTVYETSSMSPSSPKPSPTTSTVSTTAFI 94
T+P+ + T P TT PS S ETS + + + + + T A
Sbjct: 45 TTPVESDATEEADNTETP-VAATTAAEAPSSSETAETSDPTSEATDTTTSEARTVTPAAT 103
Query: 95 DESNEIESQGINM 107
+ S +E Q +++
Sbjct: 104 ETSKPVEGQTVDV 116
>gnl|CDD|216257 pfam01034, Syndecan, Syndecan domain. Syndecans are transmembrane
heparin sulfate proteoglycans which are implicated in
the binding of extracellular matrix components and
growth factors.
Length = 207
Score = 34.7 bits (80), Expect = 0.088
Identities = 24/83 (28%), Positives = 32/83 (38%), Gaps = 7/83 (8%)
Query: 267 PPLVTTIKAPPSTVASPVTSSTESWVQISLTTLSFPPPQQNVTTTVVTTKPYPETTTTTE 326
VTT PP + + S ++ TT S +T VTT P T T E
Sbjct: 68 SEPVTTSATPPKLTTTSSSPSNDT------TTASTSTKTSPTVSTTVTTTTSPSETDTEE 121
Query: 327 KTEPSPSTVYETSSMSPSSPKPS 349
T + ST T S ++ PS
Sbjct: 122 AT-TTVSTETPTEGGSSAATDPS 143
Score = 30.1 bits (68), Expect = 3.1
Identities = 17/64 (26%), Positives = 25/64 (39%), Gaps = 7/64 (10%)
Query: 296 LTTLSFPPPQQNVTTTVVTTKPYPETTTTTEKTEPSPSTVYETSSMSPSSPKPSPTTSTV 355
+TT + PP TT P TTT T S T+ + +SP + T
Sbjct: 71 VTTSATPPKLT-------TTSSSPSNDTTTASTSTKTSPTVSTTVTTTTSPSETDTEEAT 123
Query: 356 STTA 359
+T +
Sbjct: 124 TTVS 127
Score = 29.7 bits (67), Expect = 4.4
Identities = 18/71 (25%), Positives = 27/71 (38%), Gaps = 8/71 (11%)
Query: 35 TSPIRNVTTTVVTTKPYPETTTTTEKTEPSPSTVYETSSMSPSSPKPSPTTSTVSTTAFI 94
+S N TTT T+ T +TT T SPS + + T ST + T
Sbjct: 84 SSSPSNDTTTASTSTKTSPTVSTTVTTTTSPSE-------TDTEEAT-TTVSTETPTEGG 135
Query: 95 DESNEIESQGI 105
+ S+ +
Sbjct: 136 SSAATDPSKNL 146
>gnl|CDD|178748 PLN03209, PLN03209, translocon at the inner envelope of chloroplast
subunit 62; Provisional.
Length = 576
Score = 35.3 bits (81), Expect = 0.10
Identities = 26/118 (22%), Positives = 42/118 (35%), Gaps = 8/118 (6%)
Query: 260 PPDRTRPPPLVTTIKAPPSTVASPVTSSTESWVQISLTTLSFPPPQQNVTTTVVTTKPYP 319
PP T P P T +P + S V + ++ + T + PPP + PY
Sbjct: 449 PP--TSPSPTAPTGVSPSVSSTSSVPAVPDTAPATAATDAAAPPPANMRPLS-----PYA 501
Query: 320 ETTTTTEKTEPSPSTVYETSSMSPSSPKPSPTTSTVSTTAFIDESNEIESQGINMSNY 377
T PSP S+ + ++ TA DE + + + +S Y
Sbjct: 502 VYDDLKPPTSPSP-AAPVGKVAPSSTNEVVKVGNSAPPTALADEQHHAQPKPRPLSPY 558
Score = 33.0 bits (75), Expect = 0.57
Identities = 25/111 (22%), Positives = 37/111 (33%), Gaps = 9/111 (8%)
Query: 34 DTSPIRNVTTTVVTTKPYPETTTTTEKTEP-SPSTVYETSSMSPSSPKPSPTT---STVS 89
D P + V +P +KT P SP YE P+SP P+ T +VS
Sbjct: 411 DVVPSPGSASNVPEVEPAQVEA---KKTRPLSPYARYEDLK-PPTSPSPTAPTGVSPSVS 466
Query: 90 TTAFIDESNEIESQGINMSNYKEDVTTTVVTTKPYPETTITTEKTEPSPST 140
+T+ + + + PY T PSP+
Sbjct: 467 STSSVPAVPDTAPATAATDA-AAPPPANMRPLSPYAVYDDLKPPTSPSPAA 516
Score = 33.0 bits (75), Expect = 0.62
Identities = 28/111 (25%), Positives = 43/111 (38%), Gaps = 19/111 (17%)
Query: 45 VVTTKPYPETTTTTEKTEP-----------SPSTVYETSSMSPSSPKPSPTTSTVSTTAF 93
V T PE + + EP SP T YE P+SP P+P +S+ +++
Sbjct: 342 VPTKPVTPEAPSPPIEEEPPQPKAVVPRPLSPYTAYEDLK-PPTSPIPTPPSSSPASSKS 400
Query: 94 IDESNEIESQGINMSNYKEDVTTTVVTTKPYPETTITTEKTEP-SPSTVYE 143
+D + + S + V +P +KT P SP YE
Sbjct: 401 VDAVAKPAEPDVVPS---PGSASNVPEVEPAQV---EAKKTRPLSPYARYE 445
Score = 32.6 bits (74), Expect = 0.76
Identities = 27/124 (21%), Positives = 35/124 (28%), Gaps = 14/124 (11%)
Query: 242 STTPSLVTWTTVDEIPVIPPDRTRPPPLVTTIKAPPSTVASPVTSSTESWVQISLTTLSF 301
S T +V +P P T A P S + + T
Sbjct: 454 SPTAPTGVSPSVSSTSSVPAVPDTAPATAATDAAAPPPANMRPLSPYAVYDDLKPPTSPS 513
Query: 302 P--------PPQQNVTTTVVTTKPYPETTTTTEKTEP-----SPSTVYETSSMSPSSPKP 348
P P N V + P +P SP T+YE P+SP P
Sbjct: 514 PAAPVGKVAPSSTNEVVKVGNSAPPTALADEQHHAQPKPRPLSPYTMYEDLK-PPTSPTP 572
Query: 349 SPTT 352
SP
Sbjct: 573 SPVL 576
Score = 29.5 bits (66), Expect = 7.0
Identities = 23/91 (25%), Positives = 34/91 (37%), Gaps = 22/91 (24%)
Query: 302 PPPQQNVTTTVV----TTKPYPETTTTTEKTEP-----------SPSTVYETSSMSPSSP 346
PP+++ T PE + + EP SP T YE P+SP
Sbjct: 328 VPPKESDAADGPKPVPTKPVTPEAPSPPIEEEPPQPKAVVPRPLSPYTAYEDLK-PPTSP 386
Query: 347 KPSPTTS------TVSTTAFIDESNEIESQG 371
P+P +S +V A E + + S G
Sbjct: 387 IPTPPSSSPASSKSVDAVAKPAEPDVVPSPG 417
>gnl|CDD|234504 TIGR04216, halo_surf_glyco, major cell surface glycoprotein.
Members of this family are the S-layer-forming
halobacterial major cell surface glycoprotein. The
highest scores below model cutoffs are fragmentary
paralogs to actual members of the family. Modifications
include at N-linked and O-linked glycosylation, a
C-terminal diphytanylglyceryl modification, and probable
cleavage of the PGF-CTERM tail.
Length = 782
Score = 34.9 bits (80), Expect = 0.19
Identities = 17/65 (26%), Positives = 29/65 (44%), Gaps = 3/65 (4%)
Query: 27 NFAMYKNDTSPIRNVTTTVVTTKPYPETTTTTEKTEPSPSTVYETSSMSPSSPKPSPTTS 86
N+ + +D V VV P+TTT+ + P+ +T T+ ++ PTT+
Sbjct: 689 NYTVEADDGDNTDRVNVEVVEETERPDTTTSED---PTTTTTPTTTGPEETTETAEPTTT 745
Query: 87 TVSTT 91
T T
Sbjct: 746 TEEPT 750
Score = 32.9 bits (75), Expect = 0.66
Identities = 15/51 (29%), Positives = 24/51 (47%), Gaps = 3/51 (5%)
Query: 308 VTTTVVTTKPYPETTTTTEKTEPSPSTVYETSSMSPSSPKPSPTTSTVSTT 358
V VV P+TTT+ + P+ +T T+ ++ PTT+T T
Sbjct: 703 VNVEVVEETERPDTTTSED---PTTTTTPTTTGPEETTETAEPTTTTEEPT 750
Score = 29.5 bits (66), Expect = 7.4
Identities = 17/45 (37%), Positives = 26/45 (57%), Gaps = 2/45 (4%)
Query: 314 TTKPYPETTTTTEKTEPSPSTVYETSSMSPSSPKPSPTTSTVSTT 358
TT P TTTT T P +T ET+ + ++ +P+ T+T S+T
Sbjct: 717 TTSEDPTTTTTPTTTGPEETT--ETAEPTTTTEEPTEETTTGSST 759
>gnl|CDD|139494 PRK13335, PRK13335, superantigen-like protein; Reviewed.
Length = 356
Score = 34.3 bits (78), Expect = 0.22
Identities = 20/75 (26%), Positives = 29/75 (38%), Gaps = 8/75 (10%)
Query: 287 STESWVQISLTTLSFPPPQQNVTTTVVTTKPYP---ETTTTTEKTEPSPSTVYETSSMSP 343
+ E S P Q+ + ++ P P ++ TTTE T P + P
Sbjct: 82 TNEEKTSASKIEKISQPKQEEQKSLNISATPAPKQEQSQTTTESTTPKTKV-----TTPP 136
Query: 344 SSPKPSPTTSTVSTT 358
S+ P P ST S T
Sbjct: 137 STNTPQPMQSTKSDT 151
>gnl|CDD|114205 pfam05467, Herpes_U47, Herpesvirus glycoprotein U47.
Length = 627
Score = 34.1 bits (77), Expect = 0.32
Identities = 37/133 (27%), Positives = 50/133 (37%), Gaps = 12/133 (9%)
Query: 241 TSTTPSLVTWTTVDEIPVIPPDRTRPPPLVTTIKAPPSTVASPVTSSTESWVQISLTTLS 300
T TP++ TT + I I P P+ T P T+ T TE +
Sbjct: 285 TPNTPTITAQTT-ESIKKIVKRSDFPRPMYTPTDIPTLTIRLNATIKTEQNTENPTENPK 343
Query: 301 FPPPQQNV-TTTVVTTKPYPETTTTTEKTEPSPSTVYET-------SSMSPSSPKPS--- 349
PP N TT+ + + T T T+ ST + T + SSPK S
Sbjct: 344 SPPKPTNFENTTIRIPETFESATVATNATQKIESTTFATTIGIEEINDNIYSSPKNSIYL 403
Query: 350 PTTSTVSTTAFID 362
+ S STT F D
Sbjct: 404 KSKSQQSTTKFTD 416
>gnl|CDD|233987 TIGR02733, desat_CrtD, C-3',4' desaturase CrtD. Members of this
family are slr1293, a carotenoid biosynthesis protein
which was shown to be the C-3',4' desaturase (CrtD) of
myxoxanthophyll biosynthesis in Synechocystis sp. strain
PCC 6803, and close homologs (presumed to be
functionally equivalent) from other cyanobacteria, where
myxoxanthophyll biosynthesis is either known or
expected. This enzyme can act on neurosporene and so
presumably catalyzes the first step that is committed to
myxoxanthophyll [Biosynthesis of cofactors, prosthetic
groups, and carriers, Other].
Length = 492
Score = 33.5 bits (77), Expect = 0.37
Identities = 15/40 (37%), Positives = 20/40 (50%), Gaps = 5/40 (12%)
Query: 427 ENWAVTAAHCVEDVPPSDLLLRLGEHDLSTEEEPYGYQER 466
E+ V A V ++PP LL LG L P GY++R
Sbjct: 277 EDLNVKADDVVANLPPQSLLELLGPLGL-----PPGYRKR 311
>gnl|CDD|216421 pfam01299, Lamp, Lysosome-associated membrane glycoprotein (Lamp).
Length = 305
Score = 32.8 bits (75), Expect = 0.56
Identities = 34/124 (27%), Positives = 48/124 (38%), Gaps = 17/124 (13%)
Query: 241 TSTTPSLVTWTTVDEIPVIPPDRTRPPPLVTTIKAPPSTVASPVTSSTESWVQISLTTLS 300
S T ++ D + P+ T V T+ + T A P ++ +S TT+
Sbjct: 1 YSVTELTFSYNLSDTT--LFPNATSKG--VKTVTSSTDTKA-PTNTTYRC---VSSTTV- 51
Query: 301 FPPPQQNVTTTVVTTKPY------PETTTTTEKTEPSPSTVYETSSMSPSSPKPSPTTST 354
P VT VT + Y +T T E PSP+TV S SP+ SP
Sbjct: 52 -PMTNVTVTLHDVTLQAYLSNGTFSKTETRCEADTPSPTTVATPSP-SPTPVPSSPAVGN 109
Query: 355 VSTT 358
S T
Sbjct: 110 YSVT 113
>gnl|CDD|233045 TIGR00601, rad23, UV excision repair protein Rad23. All proteins
in this family for which functions are known are
components of a multiprotein complex used for targeting
nucleotide excision repair to specific parts of the
genome. In humans, Rad23 complexes with the XPC protein.
This family is based on the phylogenomic analysis of JA
Eisen (1999, Ph.D. Thesis, Stanford University) [DNA
metabolism, DNA replication, recombination, and repair].
Length = 378
Score = 32.9 bits (75), Expect = 0.58
Identities = 14/65 (21%), Positives = 24/65 (36%), Gaps = 2/65 (3%)
Query: 295 SLTTLSFPPPQQNVTTTVVTTKPYPETTTTTEKTEPSPSTVYETSSMSPSSPKPSPTTST 354
+ T PP T+ T P P + + + S V E S S+ +P + +
Sbjct: 79 TGTGKVAPPAAT--PTSAPTPTPSPPASPASGMSAAPASAVEEKSPSEESATATAPESPS 136
Query: 355 VSTTA 359
S +
Sbjct: 137 TSVPS 141
Score = 32.9 bits (75), Expect = 0.62
Identities = 23/91 (25%), Positives = 34/91 (37%), Gaps = 10/91 (10%)
Query: 42 TTTVVTTKPYPETTTTTEKTEPSPSTVYETSSMSPSSPKP--SPTTSTVSTTAFIDESNE 99
T T P T+ T P++ S +P+S SP+ + + TA S
Sbjct: 79 TGTGKVAPPAATPTSAPTPTPSPPASPASGMSAAPASAVEEKSPSEESATATAPESPSTS 138
Query: 100 IESQGINMSNYKEDVTTTVVTTKPYPETTIT 130
+ S G D +T+V ETTI
Sbjct: 139 VPSSG-------SDAASTLVVGSER-ETTIE 161
Score = 30.2 bits (68), Expect = 4.2
Identities = 16/70 (22%), Positives = 28/70 (40%), Gaps = 2/70 (2%)
Query: 309 TTTVVTTKPYPETTTTTEKTEPSPSTVYETSSMSPSSPKP--SPTTSTVSTTAFIDESNE 366
T T P T+ T P++ S +P+S SP+ + + TA S
Sbjct: 79 TGTGKVAPPAATPTSAPTPTPSPPASPASGMSAAPASAVEEKSPSEESATATAPESPSTS 138
Query: 367 IESQGINMSN 376
+ S G + ++
Sbjct: 139 VPSSGSDAAS 148
Score = 29.1 bits (65), Expect = 8.1
Identities = 19/103 (18%), Positives = 31/103 (30%), Gaps = 19/103 (18%)
Query: 264 TRPPPLVTTIKAPPSTVASPVTSSTESWVQISLTTLSFPPPQQNVTTTVVTTKPYPETTT 323
++P + P +T S T + S P + +
Sbjct: 75 SKPKTGTGKVAPPAATPTSAPTPTP-----------SPPAS--------PASGMSAAPAS 115
Query: 324 TTEKTEPSPSTVYETSSMSPSSPKPSPTTSTVSTTAFIDESNE 366
E+ PS + T+ SPS+ PS + ST E
Sbjct: 116 AVEEKSPSEESATATAPESPSTSVPSSGSDAASTLVVGSERET 158
>gnl|CDD|220749 pfam10428, SOG2, RAM signalling pathway protein. SOG2 proteins in
Saccharomyces cerevisiae are involved in cell separation
and cytokinesis.
Length = 419
Score = 32.8 bits (75), Expect = 0.61
Identities = 28/144 (19%), Positives = 45/144 (31%), Gaps = 14/144 (9%)
Query: 222 SSSTIEEETNMIGGGTAYGTSTTPSLVTWTTVDEIPVIPPDRTRPPPLVTTIKAPPSTVA 281
SS + TA +S + + P R R P + P+T+
Sbjct: 146 SSLGPPLQHRKRDAVTASPSSMIARNTPISDRLRPRSVTPTRGRRPSSSPRSLSNPTTLE 205
Query: 282 SPVTSSTESWVQISLTTLSFPPPQQNVTTTVVTTKPYPETTTTTEKTEPSPSTVYETSSM 341
SP S +Q++ PPP N T+ T + S +T S
Sbjct: 206 SP------SNLQVTTDV---PPPYSNGTSRSSTMSSSANLSI-----ISSLATPRSGESF 251
Query: 342 SPSSPKPSPTTSTVSTTAFIDESN 365
+ S + + VS +E
Sbjct: 252 RSTPTSGSSSINPVSGLDEAEEDR 275
>gnl|CDD|225828 COG3291, COG3291, FOG: PKD repeat [General function prediction
only].
Length = 297
Score = 32.2 bits (73), Expect = 0.85
Identities = 20/126 (15%), Positives = 29/126 (23%), Gaps = 6/126 (4%)
Query: 27 NFAMYKNDTSPIRNVTTTVVTTKPYPETTTTTEKTEPSPSTVYETSSMSPSSPKPSPTTS 86
+F TS +N T Y T T + T S
Sbjct: 114 DFGDGGVTTSTEQNPVHTYTDAGTYTVTLTVSNSTGSD-SKTKTDYVTVSEEGIEEAVPE 172
Query: 87 TVSTTAFIDESNEIESQG-----INMSNYKEDVTTTVVTTKPYPETTITTEKTEPSPSTV 141
ST + + +++D T P I P P TV
Sbjct: 173 AASTVVTKPLTVSGTESSSGNLSSWVYVFEDDKGTNSTVKTPLLGGVIKVTLGSPLPDTV 232
Query: 142 YETSSS 147
+
Sbjct: 233 VYPTDK 238
>gnl|CDD|237862 PRK14948, PRK14948, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 620
Score = 31.9 bits (73), Expect = 1.2
Identities = 18/93 (19%), Positives = 31/93 (33%), Gaps = 7/93 (7%)
Query: 274 KAPPSTVASPVTSSTESWVQISLTTLSFPPPQQNVTTTVVTTKPYPETTTTTEKTEPSPS 333
+ + P S PP Q T T P P T T +
Sbjct: 517 ASNTAKTPPPPQKSPPPPAPT-------PPLPQPTATAPPPTPPPPPPTATQASSNAPAQ 569
Query: 334 TVYETSSMSPSSPKPSPTTSTVSTTAFIDESNE 366
++S P +P+P+ + S+ ID++ +
Sbjct: 570 IPADSSPPPPIPEEPTPSPTKDSSPEEIDKAAK 602
Score = 29.2 bits (66), Expect = 9.6
Identities = 11/85 (12%), Positives = 25/85 (29%), Gaps = 4/85 (4%)
Query: 281 ASPVTSSTESWVQISLTTLSFPPPQQNVTTTVVTTKPYPE--TTTTTEKTEPSPSTVYET 338
S S + + T + P + P + TT + + +
Sbjct: 361 PSAFISEIANASAPANPTPAPNPSPPPAPIQ--PSAPKTKQAATTPSPPPAKASPPIPVP 418
Query: 339 SSMSPSSPKPSPTTSTVSTTAFIDE 363
+ + SP P + + ++E
Sbjct: 419 AEPTEPSPTPPANAANAPPSLNLEE 443
>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger. [Transport and
binding proteins, Cations and iron carrying compounds].
Length = 1096
Score = 31.5 bits (71), Expect = 2.1
Identities = 19/85 (22%), Positives = 32/85 (37%), Gaps = 3/85 (3%)
Query: 271 TTIKAPPSTVASPVTSSTESWVQISLTTLSFPPPQQNVTTTVVTTKPYPETTTTTEKTEP 330
+ P TV +++E V IS+ T S P + T P T+ +
Sbjct: 304 NNLTTPQGTVLEHTPATSEGQVTISIMTGSSPAETKASTAAWKIRNPLSRTSAPAVRI-- 361
Query: 331 SPSTVYETSSMSPSSPKPSPTTSTV 355
+ +T +PS+ +P T V
Sbjct: 362 ASATFRGLEK-NPSTAPSTPATPRV 385
>gnl|CDD|225689 COG3147, DedD, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 226
Score = 30.6 bits (69), Expect = 2.4
Identities = 19/91 (20%), Positives = 27/91 (29%), Gaps = 12/91 (13%)
Query: 255 EIPVIPPDRTRPPPLVTTIKA---------PPSTVASPVTSSTESWVQISLTT--LSFPP 303
+ V+ P + I+ P VA P ST + V ++ T PP
Sbjct: 60 VVQVVALPTQPPEGVAQEIQDAGDAAAASVDPQPVAQPPVESTPAGVPVAAQTPKPVKPP 119
Query: 304 PQQNVTTTVVTTKPYPETTTTTE-KTEPSPS 333
Q P PE E P+
Sbjct: 120 KQPPAGAVPAKPTPKPEPKPVAEPAAAPTGQ 150
>gnl|CDD|237864 PRK14950, PRK14950, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 585
Score = 30.9 bits (70), Expect = 2.8
Identities = 12/86 (13%), Positives = 25/86 (29%), Gaps = 7/86 (8%)
Query: 271 TTIKAPPSTVASPVTSSTESWVQISLTTLSFPPPQQNVTTTVVTTKPYPETTTTTEKTEP 330
+ P+ + T++ S V+ + + P+ + +P ET T
Sbjct: 358 ALLVPVPAPQPAKPTAAAPSPVRPTPAPST--RPKAAAAANIPPKEPVRETAT---PPPV 412
Query: 331 SPSTVYETSSMSPSSPKPSPTTSTVS 356
P V + + T
Sbjct: 413 PPRPV--APPVPHTPESAPKLTRAAI 436
Score = 29.4 bits (66), Expect = 7.4
Identities = 13/89 (14%), Positives = 17/89 (19%), Gaps = 13/89 (14%)
Query: 242 STTPSLVTWTTVDEIPVIPPDRTRPPPLVTTIKAPPSTVASPVTSSTESWVQISLTTLSF 301
+ +P T P V PP PV
Sbjct: 375 APSPVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPRPVAPPV------------- 421
Query: 302 PPPQQNVTTTVVTTKPYPETTTTTEKTEP 330
P ++ P E T P
Sbjct: 422 PHTPESAPKLTRAAIPVDEKPKYTPPAPP 450
>gnl|CDD|183756 PRK12799, motB, flagellar motor protein MotB; Reviewed.
Length = 421
Score = 30.8 bits (69), Expect = 2.8
Identities = 39/220 (17%), Positives = 73/220 (33%), Gaps = 22/220 (10%)
Query: 161 TGETGLCMFAFSCAKANGTHLGTCIDRFYFGSCCKIAGETDV------DISEPINNFI-- 212
GE G + S +AN + G ++ G +P+N I
Sbjct: 202 RGERGYSNWELSADRANASRRELLAGGLDEGKILRVVGMASTMRLKEQASDDPVNRRISI 261
Query: 213 --------HNTNHID-EHSSSTIEEETNMIGGGTAYGTSTTPSLVTWTTVDEIPVIPPDR 263
H+ H + ++ + IE+ T + T + + V + I P
Sbjct: 262 LVLNKQSQHDIEHENLDNRALDIEKATGLKQIDTHGTVPVAAVTPS-SAVTQSSAITPSS 320
Query: 264 TRPPPLVTTIKAPPSTVASPVTSSTE--SWVQISLTTLSFPPPQQNVTTTVVTTKPYPET 321
P + + A+ +S S + + ++ P V +P P +
Sbjct: 321 AAIPSPAVIPSSVTTQSATTTQASAVALSSAGVLPSDVTLPGTVALPAAEPVNMQPQPMS 380
Query: 322 TTTTEK--TEPSPSTVYETSSMSPSSPKPSPTTSTVSTTA 359
TT T++ T ST ++ P++P + S S A
Sbjct: 381 TTETQQSSTGNITSTANGPTTSLPAAPASNIPVSPTSRDA 420
>gnl|CDD|165021 PHA02638, PHA02638, CC chemokine receptor-like protein;
Provisional.
Length = 417
Score = 30.4 bits (68), Expect = 3.2
Identities = 17/68 (25%), Positives = 32/68 (47%), Gaps = 4/68 (5%)
Query: 278 STVASPVTSSTESWVQISLTTLSFPPPQQNVTTTVVTTKPYPETTTTTEKTEPSPSTVYE 337
+T++S + SS+ +S TT + + ++++T T PSP++
Sbjct: 6 TTLSSIILSSS----TLSPTTFFTIETSMDESKSIISTFTEIIPTEIPTSESPSPNSNSS 61
Query: 338 TSSMSPSS 345
+SS S SS
Sbjct: 62 SSSSSSSS 69
Score = 29.2 bits (65), Expect = 9.1
Identities = 24/99 (24%), Positives = 42/99 (42%), Gaps = 13/99 (13%)
Query: 278 STVASPVTSSTESWVQISLTTLSFPPPQQNVTTTVVTTKPYPETTTTTEKTEPSPSTVYE 337
ST S + S+ + +S TT + + ++++T + E T T SPS
Sbjct: 5 STTLSSIILSSST---LSPTTFFTIETSMDESKSIIST--FTEIIPTEIPTSESPS---- 55
Query: 338 TSSMSPSSPKPSPTTSTVSTTAFIDESNEIESQGINMSN 376
S+ S ++S+ S T + N I + IN+ N
Sbjct: 56 ----PNSNSSSSSSSSSSSITYDYEYENNITYELINIKN 90
>gnl|CDD|218191 pfam04652, DUF605, Vta1 like. Vta1 (VPS20-associated protein 1) is
a positive regulator of Vps4. Vps4 is an ATPase that is
required in the multivesicular body (MVB) sorting
pathway to dissociate the endosomal sorting complex
required for transport (ESCRT). Vta1 promotes correct
assembly of Vps4 and stimulates its ATPase activity
through its conserved Vta1/SBP1/LIP5 region.
Length = 315
Score = 30.4 bits (69), Expect = 3.3
Identities = 15/90 (16%), Positives = 30/90 (33%)
Query: 268 PLVTTIKAPPSTVASPVTSSTESWVQISLTTLSFPPPQQNVTTTVVTTKPYPETTTTTEK 327
P + PPS+ + + S + S PP + + P T +
Sbjct: 179 PASASPSDPPSSSPGVPSFPSPPEDPSSPSDSSLPPAPSSFQSDTPPPSPESPTNPSPPP 238
Query: 328 TEPSPSTVYETSSMSPSSPKPSPTTSTVST 357
+P S+ KP+P +++ +
Sbjct: 239 GPAAPPPPPVQQVPPLSTAKPTPPSASATP 268
>gnl|CDD|165513 PHA03255, PHA03255, BDLF3; Provisional.
Length = 234
Score = 29.9 bits (66), Expect = 3.5
Identities = 30/130 (23%), Positives = 50/130 (38%), Gaps = 5/130 (3%)
Query: 241 TSTTPSLVTWTTVDEIPVIPPDRTRPPPLVTTIKAPPSTVASPVT-----SSTESWVQIS 295
TS+ S + V + T +T ++P+T S+ + V +
Sbjct: 25 TSSGSSTASAGNVTGTTAVTTPSPSASGPSTNQSTTLTTTSAPITTTAILSTNTTTVTST 84
Query: 296 LTTLSFPPPQQNVTTTVVTTKPYPETTTTTEKTEPSPSTVYETSSMSPSSPKPSPTTSTV 355
TT++ P N +T VTTK + T TE + + V + SS + T T
Sbjct: 85 GTTVTPVPTTSNASTINVTTKVTAQNITATEAGTGTSTGVTSNVTTRSSSTTSATTRITN 144
Query: 356 STTAFIDESN 365
+TT S+
Sbjct: 145 ATTLAPTLSS 154
>gnl|CDD|237874 PRK14971, PRK14971, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 614
Score = 30.1 bits (68), Expect = 4.4
Identities = 22/113 (19%), Positives = 32/113 (28%), Gaps = 5/113 (4%)
Query: 268 PLVTTIKAPPSTVASPVTSSTESWVQISLTTLSFPPPQQNVTTTVVTTKPYPETTTTTEK 327
P+ T A P A+ S + S + + P Q+ T T
Sbjct: 381 PVFTQPAAAPQPSAAAAASPSPS----QSSAAAQPSAPQSATQPAGTPPTVSVDPPAAVP 436
Query: 328 TEPSPSTVYETSSMSPSSPKPSPTTSTVSTTAFIDESNEIESQGINMSNYKEV 380
P PST + + + S VS+ E N KE
Sbjct: 437 VNP-PSTAPQAVRPAQFKEEKKIPVSKVSSLGPSTLRPIQEKAEQATGNIKEA 488
>gnl|CDD|236766 PRK10811, rne, ribonuclease E; Reviewed.
Length = 1068
Score = 30.0 bits (68), Expect = 4.9
Identities = 11/126 (8%), Positives = 32/126 (25%), Gaps = 10/126 (7%)
Query: 241 TSTTPSLVTWTTVDEIPVI----------PPDRTRPPPLVTTIKAPPSTVASPVTSSTES 290
+ P + V E PV+ + T P + + P + + +
Sbjct: 882 VVSAPVVEAVAEVVEEPVVVAEPQPEEVVVVETTHPEVIAAPVTEQPQVITESDVAVAQE 941
Query: 291 WVQISLTTLSFPPPQQNVTTTVVTTKPYPETTTTTEKTEPSPSTVYETSSMSPSSPKPSP 350
+ + + ++ T + + + ++ +P
Sbjct: 942 VAEHAEPVVEPQDETADIEEAAETAEVVVAEPEVVAQPAAPVVAEVAAEVETVTAVEPEV 1001
Query: 351 TTSTVS 356
+ V
Sbjct: 1002 APAQVP 1007
>gnl|CDD|237865 PRK14951, PRK14951, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 618
Score = 30.1 bits (68), Expect = 4.9
Identities = 18/106 (16%), Positives = 32/106 (30%), Gaps = 11/106 (10%)
Query: 251 TTVDEIPVIPPDRTRPPPLVTTIKAPPSTVASPVTSSTESWVQISLTTLSFPPPQQNVTT 310
P P P APP+ VA+P ++ + + ++ P
Sbjct: 397 AAAAPAPAAAPAAAASAPAAPPAAAPPAPVAAPAAAAPAAAPAAAPAAVALAPAPPAQ-- 454
Query: 311 TVVTTKPYPETTTTTEKTEPSPSTVYETSSMSPSSPKPSPTTSTVS 356
PET + P P+ S+ + P+ T +
Sbjct: 455 ------AAPETVAIPVRVAPEPAV---ASAAPAPAAAPAAARLTPT 491
>gnl|CDD|235055 PRK02625, rpoC1, DNA-directed RNA polymerase subunit gamma;
Provisional.
Length = 627
Score = 30.1 bits (68), Expect = 5.4
Identities = 13/23 (56%), Positives = 14/23 (60%), Gaps = 1/23 (4%)
Query: 244 TPSLVTWTTVDEIPVIPPDRTRP 266
T S W +D IPVIPPD RP
Sbjct: 238 TGSRPEWMVLDVIPVIPPD-LRP 259
>gnl|CDD|236733 PRK10672, PRK10672, rare lipoprotein A; Provisional.
Length = 361
Score = 29.6 bits (67), Expect = 6.2
Identities = 17/90 (18%), Positives = 35/90 (38%), Gaps = 2/90 (2%)
Query: 272 TIKAPPSTVASPVTSSTESWVQ-ISLTTL-SFPPPQQNVTTTVVTTKPYPETTTTTEKTE 329
+ T + + + V +S +TL S P VT++ P E +E
Sbjct: 199 DLSGGMGTPSVQPAPAPQGDVLPVSNSTLKSEDPTGAPVTSSGFLGAPTTLAPGVLEGSE 258
Query: 330 PSPSTVYETSSMSPSSPKPSPTTSTVSTTA 359
P+P+ + +P++ P ++ S +
Sbjct: 259 PTPTAPSSAPATAPAAAAPQAAATSSSASG 288
>gnl|CDD|131440 TIGR02387, rpoC1_cyan, DNA-directed RNA polymerase, gamma subunit.
The RNA polymerase gamma subunit, encoded by the rpoC1
gene, is found in cyanobacteria and corresponds to the
N-terminal region the beta' subunit, encoded by rpoC, in
other bacteria. The equivalent subunit in plastids and
chloroplasts is designated beta', while the product of
the rpoC2 gene is designated beta''.
Length = 619
Score = 29.6 bits (66), Expect = 6.2
Identities = 13/23 (56%), Positives = 14/23 (60%), Gaps = 1/23 (4%)
Query: 244 TPSLVTWTTVDEIPVIPPDRTRP 266
T S W +D IPVIPPD RP
Sbjct: 231 TSSRPEWMVLDVIPVIPPD-LRP 252
>gnl|CDD|218421 pfam05086, Dicty_REP, Dictyostelium (Slime Mold) REP protein. This
family consists of REP proteins from Dictyostelium
(Slime molds). REP protein is likely involved in
transcription regulation and control of DNA replication,
specifically amplification of plasmid at low copy
numbers. The formation of homomultimers may be required
for their regulatory activity.
Length = 910
Score = 29.8 bits (67), Expect = 6.8
Identities = 20/110 (18%), Positives = 38/110 (34%), Gaps = 6/110 (5%)
Query: 32 KNDTSPIRNVTTTVVTTKPYPETTTTTEKTEPSPSTVYETSSMSPSSPKPSPTTSTVSTT 91
+ P N TTT TT TTT +T S S+ +P+ +
Sbjct: 247 QPSKRP-NNTTTTTTTTT----TTTFQPRTRKRKSIDDHKLSL-NQAPEKFKNNTKPDDD 300
Query: 92 AFIDESNEIESQGINMSNYKEDVTTTVVTTKPYPETTITTEKTEPSPSTV 141
D S++ + ++ + + D + V+ + TE + +
Sbjct: 301 PQSDFSDKGSRKSGSLKDVRIDNISCSVSHNGVNISISPTETIISVSNKI 350
>gnl|CDD|236652 PRK10118, PRK10118, flagellar hook-length control protein;
Provisional.
Length = 408
Score = 29.4 bits (66), Expect = 6.9
Identities = 17/106 (16%), Positives = 28/106 (26%), Gaps = 11/106 (10%)
Query: 260 PPDRTRPPPLVTTIKAPPS----TVASPVTSSTESWVQISLTTLSFPPPQQNVTTTVVTT 315
P V + P ++P + L+ T
Sbjct: 156 TTPVADAPSTVLPAEKPTLLTKDMPSAPQDETHTLSSDEHEKGLTSAQLTTAQPDDAPGT 215
Query: 316 KPYPETTTTTEKTE-------PSPSTVYETSSMSPSSPKPSPTTST 354
P T E PSP T + +++P +P PT +
Sbjct: 216 PAQPLTPLAAEAQAKAEVISTPSPVTAAASPTITPHQTQPLPTAAA 261
>gnl|CDD|217330 pfam03035, RNA_capsid, Calicivirus putative RNA polymerase/capsid
protein.
Length = 226
Score = 28.9 bits (65), Expect = 7.6
Identities = 22/106 (20%), Positives = 37/106 (34%), Gaps = 9/106 (8%)
Query: 52 PETTTTTEKTEPSPSTVY--ETSSMSPSSPKPSPTTSTVSTTAFIDESNEIESQGINMSN 109
+ S S+V T S SS S ++++ T S+ + SQ N+
Sbjct: 122 SSPVPVPPSSSSSASSVSSQSTQSTGLSSSSYSSSSASSRT------SSWVRSQNSNLEP 175
Query: 110 YKEDVTTTVVTTKPYPETTITTEKTEPSPSTVYETSSSGRNIRHLP 155
+ T T P T ++ P V ++ + N R P
Sbjct: 176 FMPGALQTAWVTPP-SSTASSSGTVSTVPKGVLDSWTPAFNTRRQP 220
>gnl|CDD|216513 pfam01456, Mucin, Mucin-like glycoprotein. This family of
trypanosomal proteins resemble vertebrate mucins. The
protein consists of three regions. The N and C terminii
are conserved between all members of the family, whereas
the central region is not well conserved and contains a
large number of threonine residues which can be
glycosylated. Indirect evidence suggested that these
genes might encode the core protein of parasite mucins,
glycoproteins that were proposed to be involved in the
interaction with, and invasion of, mammalian host cells.
This family contains an N-terminal signal peptide.
Length = 143
Score = 28.3 bits (62), Expect = 7.7
Identities = 16/74 (21%), Positives = 27/74 (36%)
Query: 276 PPSTVASPVTSSTESWVQISLTTLSFPPPQQNVTTTVVTTKPYPETTTTTEKTEPSPSTV 335
+T P T++T + ++TT + TTT TT + TT P+ +
Sbjct: 48 TTTTTTPPTTTTTTTTTTTTITTTTTKTTTTTTTTTTTTTTTEAPSKNTTTSEAPTTTDT 107
Query: 336 YETSSMSPSSPKPS 349
SS+
Sbjct: 108 RAPSSIREIDGSLG 121
>gnl|CDD|114140 pfam05399, EVI2A, Ectropic viral integration site 2A protein
(EVI2A). This family contains several mammalian
ectropic viral integration site 2A (EVI2A) proteins. The
function of this protein is unknown although it is
thought to be a membrane protein and may function as an
oncogene in retrovirus induced myeloid tumours.
Length = 227
Score = 28.9 bits (64), Expect = 7.8
Identities = 28/99 (28%), Positives = 35/99 (35%), Gaps = 14/99 (14%)
Query: 286 SSTESWVQISLTTLSFPPPQQNVTTTVVTTKPYPETTTTTEKTEPSPSTVYETS---SMS 342
S E W I S P+ N T T T +T P +Y TS +
Sbjct: 34 VSNEIWYSICQNLTSRNIPETNNTNPPTPEVNGKSTPTAEPQTST-PVPLYSTSGSNFFT 92
Query: 343 PSSPKPSPTTSTVSTTAFIDESNEIESQGINMSNYKEVC 381
PSS + SP T N +S+G KEVC
Sbjct: 93 PSSAQNSPDTGGP--------GNTSKSKGETFK--KEVC 121
>gnl|CDD|115650 pfam07010, Endomucin, Endomucin. This family consists of several
mammalian endomucin proteins. Endomucin is an early
endothelial-specific antigen that is also expressed on
putative hematopoietic progenitor cells.
Length = 259
Score = 28.9 bits (64), Expect = 8.2
Identities = 34/132 (25%), Positives = 55/132 (41%), Gaps = 17/132 (12%)
Query: 241 TSTTPSLVTWTTVDEIPVIPPDRTRPPPLVTTIKAPPSTVASPVTSSTESWVQISLTTLS 300
++T S+ T TV V P PP TT + +++ S TS T ++ TT
Sbjct: 35 STTKASITTPNTVSLKNVNKPTTGTPPKGTTTSELLKTSLMSTATSLTTPKHELKTTTTG 94
Query: 301 F-------------PPPQQNVTTTVVTTKPYPETTTTTEKTEPSPSTVYETSSMSPSSPK 347
N +T+ +++ E ++ TE SP++V + +SPK
Sbjct: 95 VRKNESSTSKVTVTNVTLSNAVSTLQSSQNKTENQSSIRTTEISPTSVLQPD----ASPK 150
Query: 348 PSPTTSTVSTTA 359
+ TTS TTA
Sbjct: 151 KTGTTSASLTTA 162
>gnl|CDD|236940 PRK11633, PRK11633, cell division protein DedD; Provisional.
Length = 226
Score = 28.8 bits (65), Expect = 8.4
Identities = 21/110 (19%), Positives = 32/110 (29%), Gaps = 12/110 (10%)
Query: 256 IPVIPP--DRTRP---PPLVTTIKAPPSTVASPVTSSTESWVQISLTTLSFPPPQQNVTT 310
IP++P DR P P + P A+ + ++ PP
Sbjct: 41 IPLVPKPGDRDEPDMMPAATQALPTQPPEGAAEAVRAGDAAAPSLDPATVAPPNTPVEPE 100
Query: 311 TVVTTKPYPE-TTTTTEKTEPSPSTVYETSSMSPSSPKPSPTTSTVSTTA 359
P P+ K +P +P +PKP P A
Sbjct: 101 PAPVEPPKPKPVEKPKPKPKPQQKV------EAPPAPKPEPKPVVEEKAA 144
>gnl|CDD|185628 PTZ00449, PTZ00449, 104 kDa microneme/rhoptry antigen; Provisional.
Length = 943
Score = 29.3 bits (65), Expect = 8.8
Identities = 21/127 (16%), Positives = 37/127 (29%), Gaps = 5/127 (3%)
Query: 15 PKHAFCDEERERNFAMYKNDTSPIRNVTTTVVTTKPYPETTTTTEKTEP-----SPSTVY 69
PK F + +E+ + Y + + + TTVV + + T P +P +
Sbjct: 663 PKPPFDPKFKEKFYDDYLDAAAKSKETKTTVVLDESFESILKETLPETPGTPFTTPRPLP 722
Query: 70 ETSSMSPSSPKPSPTTSTVSTTAFIDESNEIESQGINMSNYKEDVTTTVVTTKPYPETTI 129
P I+ E + D + + + E I
Sbjct: 723 PKLPRDEEFPFEPIGDPDAEQPDDIEFFTPPEEERTFFHETPADTPLPDILAEEFKEEDI 782
Query: 130 TTEKTEP 136
E EP
Sbjct: 783 HAETGEP 789
>gnl|CDD|237191 PRK12757, PRK12757, cell division protein FtsN; Provisional.
Length = 256
Score = 28.9 bits (65), Expect = 8.9
Identities = 11/60 (18%), Positives = 18/60 (30%), Gaps = 2/60 (3%)
Query: 292 VQISLTTLSFPPPQQNVTTTVVTTKPYPETTTTTEKTEPSPSTVYETSSMSPSSPKPSPT 351
VQI PP P +TT + P+P + ++ + P
Sbjct: 119 VQIQQQAQQQQPPATTAQP--QPVTPPRQTTAPVQPQTPAPVRTQPAAPVTQAVEAPKVE 176
>gnl|CDD|189554 pfam00454, PI3_PI4_kinase, Phosphatidylinositol 3- and 4-kinase.
Some members of this family probably do not have lipid
kinase activity and are protein kinases, .
Length = 233
Score = 28.5 bits (64), Expect = 9.4
Identities = 28/99 (28%), Positives = 38/99 (38%), Gaps = 7/99 (7%)
Query: 378 KEVCGRRLFPSSRIVGGEKATFGKW-PWQISLRQWIRSTYLHKCGAALFNENWAVTAAHC 436
E RRL I G + +W P +L + I TY+ K G LFN + V
Sbjct: 31 GEGLDRRLAAYLVIPLGPGSGLIEWVPNSTTLAE-IPRTYMVKKGIPLFNYSRKVLVFES 89
Query: 437 VEDVPPSDLLLR-LGEHDLSTEEEPYGYQERRVQIVASH 474
+ P LL+ +H EE + E R V S
Sbjct: 90 RTALFPKVGLLQWFVKHFPDAEE----WGEARKNFVRSC 124
>gnl|CDD|222812 PHA00727, PHA00727, hypothetical protein.
Length = 278
Score = 28.7 bits (64), Expect = 9.8
Identities = 13/30 (43%), Positives = 16/30 (53%), Gaps = 1/30 (3%)
Query: 306 QNVTTTVVTTKPYPETTTTTEK-TEPSPST 334
Q+VT T T TTTT+ T PS S+
Sbjct: 247 QHVTNTTTTHLATTTTTTTSPSFTIPSNSS 276
>gnl|CDD|140276 PTZ00249, PTZ00249, variable surface protein Vir28; Provisional.
Length = 516
Score = 28.8 bits (64), Expect = 10.0
Identities = 22/106 (20%), Positives = 31/106 (29%), Gaps = 24/106 (22%)
Query: 260 PPDRTRPPPLVTTIKAPPSTV--ASPVTSSTESWVQISLT---------------TLSFP 302
P T PPLVT P S +PV + T + V + T S P
Sbjct: 275 HPGETHTPPLVT---VPSSKAHDRNPVQTPTPTSVSGYSSQAKGLEKQAGGESERTSSVP 331
Query: 303 PPQQNVTTTVVTTKPYPETTTTTEKTEPSPSTVYETSSMSPSSPKP 348
Q + V+ ++E E T P+
Sbjct: 332 SEQFPLPLPVLLPLGQSGPLESSESEE----TDEYAGPKGLPEPEL 373
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.315 0.130 0.404
Gapped
Lambda K H
0.267 0.0738 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 34,968,328
Number of extensions: 3293296
Number of successful extensions: 4356
Number of sequences better than 10.0: 1
Number of HSP's gapped: 3848
Number of HSP's successfully gapped: 219
Length of query: 697
Length of database: 10,937,602
Length adjustment: 104
Effective length of query: 593
Effective length of database: 6,324,786
Effective search space: 3750598098
Effective search space used: 3750598098
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 62 (27.6 bits)