RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy4257
(419 letters)
>gnl|CDD|227223 COG4886, COG4886, Leucine-rich repeat (LRR) protein [Function
unknown].
Length = 394
Score = 62.7 bits (152), Expect = 1e-10
Identities = 82/264 (31%), Positives = 119/264 (45%), Gaps = 10/264 (3%)
Query: 6 NWLFLNNNRLKSLEGQLGTLSKLQLLVIEQNQLEALPSDIQLFSQLGSLYANNNRITSLD 65
L L++N L L L L L LL ++ L SL N NR+ S
Sbjct: 50 LALNLSSNTLLLLPSSLSRLLSLDLLSPSGISSLDGSENLLNLLPLPSLDLNLNRLRSNI 109
Query: 66 GLLRGLTKLQVFNMDFNQITMVRRDEFQNLHNLDSISLQNNQITSMNSSLSGLTKLAYLY 125
L LT L ++D N IT + NL + L +N+I S+ S L L L L
Sbjct: 110 SELLELTNLTSLDLDNNNITDIPPLIGLLKSNLKELDLSDNKIESLPSPLRNLPNLKNLD 169
Query: 126 LSHNQLTEFLLDDIRGLKRLRTVDLSYNKINKFGTRNEGKNQVQGVTNIFELKLQHNEIE 185
LS N L++ L + L L +DLS NKI+ E + + EL L +N I
Sbjct: 170 LSFNDLSD-LPKLLSNLSNLNNLDLSGNKISDLPPEIELLS------ALEELDLSNNSII 222
Query: 186 NLDGALMGIHGLSRLDLSHNKLRTISPDDFIGLDSLKMLDISHNLLTTLEETSKTFLPAL 245
L +L + LS L+LS+NKL + P+ L +L+ LD+S+N ++++ L L
Sbjct: 223 ELLSSLSNLKNLSGLELSNNKLEDL-PESIGNLSNLETLDLSNNQISSISSLGS--LTNL 279
Query: 246 EELFVSHNSLTRLDKDFHGLPVLC 269
EL +S NSL+ L +L
Sbjct: 280 RELDLSGNSLSNALPLIALLLLLL 303
Score = 62.7 bits (152), Expect = 1e-10
Identities = 66/235 (28%), Positives = 116/235 (49%), Gaps = 11/235 (4%)
Query: 1 GLTDLNWLFLNNNRLKSLEGQLGTLSKLQLLVIEQNQLEALPSDIQLFSQLGSLYANNNR 60
L L+ L + L L L L + N+L + S++ + L SL +NN
Sbjct: 68 RLLSLDLLSPSGISSLDGSENLLNLLPLPSLDLNLNRLRSNISELLELTNLTSLDLDNNN 127
Query: 61 ITSLDGLLRGLT-KLQVFNMDFNQITMVRRDEFQNLHNLDSISLQNNQITSMNSSLSGLT 119
IT + L+ L L+ ++ N+I + +NL NL ++ L N ++ + LS L+
Sbjct: 128 ITDIPPLIGLLKSNLKELDLSDNKIESLPS-PLRNLPNLKNLDLSFNDLSDLPKLLSNLS 186
Query: 120 KLAYLYLSHNQLTEFLLDDIRGLKRLRTVDLSYNKINKFGTRNEGKNQVQGVTNIFELKL 179
L L LS N++++ L +I L L +DLS N I + + + + N+ L+L
Sbjct: 187 NLNNLDLSGNKISD-LPPEIELLSALEELDLSNNSIIEL------LSSLSNLKNLSGLEL 239
Query: 180 QHNEIENLDGALMGIHGLSRLDLSHNKLRTISPDDFIGLDSLKMLDISHNLLTTL 234
+N++E+L ++ + L LDLS+N++ +IS L +L+ LD+S N L+
Sbjct: 240 SNNKLEDLPESIGNLSNLETLDLSNNQISSISS--LGSLTNLRELDLSGNSLSNA 292
Score = 57.3 bits (138), Expect = 6e-09
Identities = 55/186 (29%), Positives = 77/186 (41%), Gaps = 3/186 (1%)
Query: 2 LTDLNWLFLNNNRLKSLEGQLGTLSKLQLLVIEQNQLEALPSDIQLFSQLGSLYANNNRI 61
L +L L L+ N L L L LS L L + N++ LP +I+L S L L +NN I
Sbjct: 162 LPNLKNLDLSFNDLSDLPKLLSNLSNLNNLDLSGNKISDLPPEIELLSALEELDLSNNSI 221
Query: 62 TSLDGLLRGLTKLQVFNMDFNQITMVRRDEFQNLHNLDSISLQNNQITSMNSSLSGLTKL 121
L L L L + N++ + + NL NL+++ L NNQI+S+ SSL LT L
Sbjct: 222 IELLSSLSNLKNLSGLELSNNKLEDL-PESIGNLSNLETLDLSNNQISSI-SSLGSLTNL 279
Query: 122 AYLYLSHNQLTEFLLDDIRGLKRLRTVDLSYNKINKFGTRNEGKNQVQGVTNIFELKLQH 181
L LS N L+ L L L L+ K +
Sbjct: 280 RELDLSGNSLSNALPLIALLLLLLEL-LLNLLLTLKALELKLNSILLNNNILSNGETSSP 338
Query: 182 NEIENL 187
+ L
Sbjct: 339 EALSIL 344
>gnl|CDD|206026 pfam13855, LRR_8, Leucine rich repeat.
Length = 60
Score = 55.6 bits (135), Expect = 2e-10
Identities = 26/60 (43%), Positives = 38/60 (63%)
Query: 196 GLSRLDLSHNKLRTISPDDFIGLDSLKMLDISHNLLTTLEETSKTFLPALEELFVSHNSL 255
L LDLS+N+L I F GL +LK+LD+S N LT++ + + LP+L L +S N+L
Sbjct: 1 NLKSLDLSNNRLTVIPDGAFKGLPNLKVLDLSGNNLTSISPEAFSGLPSLRSLDLSGNNL 60
Score = 52.9 bits (128), Expect = 2e-09
Identities = 27/57 (47%), Positives = 35/57 (61%), Gaps = 1/57 (1%)
Query: 176 ELKLQHNEIENL-DGALMGIHGLSRLDLSHNKLRTISPDDFIGLDSLKMLDISHNLL 231
L L +N + + DGA G+ L LDLS N L +ISP+ F GL SL+ LD+S N L
Sbjct: 4 SLDLSNNRLTVIPDGAFKGLPNLKVLDLSGNNLTSISPEAFSGLPSLRSLDLSGNNL 60
Score = 48.7 bits (117), Expect = 7e-08
Identities = 20/60 (33%), Positives = 33/60 (55%), Gaps = 1/60 (1%)
Query: 73 KLQVFNMDFNQITMVRRDEFQNLHNLDSISLQNNQITSMNS-SLSGLTKLAYLYLSHNQL 131
L+ ++ N++T++ F+ L NL + L N +TS++ + SGL L L LS N L
Sbjct: 1 NLKSLDLSNNRLTVIPDGAFKGLPNLKVLDLSGNNLTSISPEAFSGLPSLRSLDLSGNNL 60
Score = 47.2 bits (113), Expect = 3e-07
Identities = 24/60 (40%), Positives = 33/60 (55%), Gaps = 1/60 (1%)
Query: 97 NLDSISLQNNQITSMN-SSLSGLTKLAYLYLSHNQLTEFLLDDIRGLKRLRTVDLSYNKI 155
NL S+ L NN++T + + GL L L LS N LT + GL LR++DLS N +
Sbjct: 1 NLKSLDLSNNRLTVIPDGAFKGLPNLKVLDLSGNNLTSISPEAFSGLPSLRSLDLSGNNL 60
Score = 46.8 bits (112), Expect = 3e-07
Identities = 25/60 (41%), Positives = 38/60 (63%), Gaps = 1/60 (1%)
Query: 220 SLKMLDISHNLLTTLEETSKTFLPALEELFVSHNSLTRLDKD-FHGLPVLCKADLAHNNI 278
+LK LD+S+N LT + + + LP L+ L +S N+LT + + F GLP L DL+ NN+
Sbjct: 1 NLKSLDLSNNRLTVIPDGAFKGLPNLKVLDLSGNNLTSISPEAFSGLPSLRSLDLSGNNL 60
Score = 43.7 bits (104), Expect = 4e-06
Identities = 21/60 (35%), Positives = 34/60 (56%), Gaps = 1/60 (1%)
Query: 50 QLGSLYANNNRITSL-DGLLRGLTKLQVFNMDFNQITMVRRDEFQNLHNLDSISLQNNQI 108
L SL +NNR+T + DG +GL L+V ++ N +T + + F L +L S+ L N +
Sbjct: 1 NLKSLDLSNNRLTVIPDGAFKGLPNLKVLDLSGNNLTSISPEAFSGLPSLRSLDLSGNNL 60
Score = 34.1 bits (79), Expect = 0.009
Identities = 17/59 (28%), Positives = 27/59 (45%), Gaps = 2/59 (3%)
Query: 5 LNWLFLNNNRLKSLE-GQLGTLSKLQLLVIEQNQLEALPSD-IQLFSQLGSLYANNNRI 61
L L L+NNRL + G L L++L + N L ++ + L SL + N +
Sbjct: 2 LKSLDLSNNRLTVIPDGAFKGLPNLKVLDLSGNNLTSISPEAFSGLPSLRSLDLSGNNL 60
Score = 33.7 bits (78), Expect = 0.013
Identities = 22/65 (33%), Positives = 32/65 (49%), Gaps = 8/65 (12%)
Query: 245 LEELFVSHNSLTRLDKD-FHGLPVLCKADLAHNNIKAINIQLALKTQCQIFGLNSTLRIY 303
L+ L +S+N LT + F GLP L DL+ NN+ +I+ + GL S +
Sbjct: 2 LKSLDLSNNRLTVIPDGAFKGLPNLKVLDLSGNNLTSISPEA-------FSGLPSLRSLD 54
Query: 304 LEGNP 308
L GN
Sbjct: 55 LSGNN 59
Score = 31.7 bits (73), Expect = 0.057
Identities = 18/65 (27%), Positives = 28/65 (43%), Gaps = 5/65 (7%)
Query: 120 KLAYLYLSHNQLTEFLLDDIRGLKRLRTVDLSYNKINKFGTRNEGKNQVQGVTNIFELKL 179
L L LS+N+LT +GL L+ +DLS N + G+ ++ L L
Sbjct: 1 NLKSLDLSNNRLTVIPDGAFKGLPNLKVLDLSGNNLTSI-----SPEAFSGLPSLRSLDL 55
Query: 180 QHNEI 184
N +
Sbjct: 56 SGNNL 60
Score = 26.4 bits (59), Expect = 5.0
Identities = 13/39 (33%), Positives = 19/39 (48%), Gaps = 1/39 (2%)
Query: 1 GLTDLNWLFLNNNRLKSLEGQ-LGTLSKLQLLVIEQNQL 38
GL +L L L+ N L S+ + L L+ L + N L
Sbjct: 22 GLPNLKVLDLSGNNLTSISPEAFSGLPSLRSLDLSGNNL 60
>gnl|CDD|215061 PLN00113, PLN00113, leucine-rich repeat receptor-like protein
kinase; Provisional.
Length = 968
Score = 58.3 bits (141), Expect = 4e-09
Identities = 77/310 (24%), Positives = 133/310 (42%), Gaps = 29/310 (9%)
Query: 12 NNRLKSLEGQLGTLSKLQLLVIEQNQLEA-LPSDIQLFSQLGSLYANNNRITSL--DGLL 68
NN + L +L +LQ+L + N+ +P ++ + L L + N +T +GL
Sbjct: 318 NNFTGKIPVALTSLPRLQVLQLWSNKFSGEIPKNLGKHNNLTVLDLSTNNLTGEIPEGLC 377
Query: 69 RG--LTKLQVFNMDFNQITMVRRDEFQNLHNLDSISLQNNQITSMNSSLSGLTKLAYLY- 125
L KL +F+ N + +L + LQ+N + S TKL +Y
Sbjct: 378 SSGNLFKLILFS---NSLEGEIPKSLGACRSLRRVRLQDNSFSGELPS--EFTKLPLVYF 432
Query: 126 --LSHNQLTEFLLDDIRGLKRLRTVDLSYNKINKFGTRNEGKNQVQGVTNIFELKLQHNE 183
+S+N L + + L+ + L+ NK FG G G + L L N+
Sbjct: 433 LDISNNNLQGRINSRKWDMPSLQMLSLARNKF--FG----GLPDSFGSKRLENLDLSRNQ 486
Query: 184 I-ENLDGALMGIHGLSRLDLSHNKLRTISPDDFIGLDSLKMLDISHNLLTTLEETSKTFL 242
+ L + L +L LS NKL PD+ L LD+SHN L+ S + +
Sbjct: 487 FSGAVPRKLGSLSELMQLKLSENKLSGEIPDELSSCKKLVSLDLSHNQLSGQIPASFSEM 546
Query: 243 PALEELFVSHNSLT-RLDKDFHGLPVLCKADLAHNNIKAINIQLALKTQCQIFGLNSTLR 301
P L +L +S N L+ + K+ + L + +++HN++ +L + +N++
Sbjct: 547 PVLSQLDLSQNQLSGEIPKNLGNVESLVQVNISHNHLHG-----SLPSTGAFLAINASA- 600
Query: 302 IYLEGNPVLC 311
+ GN LC
Sbjct: 601 --VAGNIDLC 608
Score = 56.0 bits (135), Expect = 2e-08
Identities = 71/250 (28%), Positives = 106/250 (42%), Gaps = 35/250 (14%)
Query: 11 NNNRLKSLEGQLGTLSKLQLLVIEQNQLEA-LPSDIQLFSQLGSLYANNNRITS-LDGLL 68
NNN S+ G++ L+ L + N L +P+DI FS L L N + + L
Sbjct: 127 NNNFTGSIPR--GSIPNLETLDLSNNMLSGEIPNDIGSFSSLKVLDLGGNVLVGKIPNSL 184
Query: 69 RGLTKLQVFNMDFNQITMVRRDEFQNLHNLDSISLQNN-----------QITSMN----- 112
LT L+ + NQ+ E + +L I L N +TS+N
Sbjct: 185 TNLTSLEFLTLASNQLVGQIPRELGQMKSLKWIYLGYNNLSGEIPYEIGGLTSLNHLDLV 244
Query: 113 ---------SSLSGLTKLAYLYLSHNQLTEFLLDDIRGLKRLRTVDLSYNKINKFGTRNE 163
SSL L L YL+L N+L+ + I L++L ++DLS N ++ G E
Sbjct: 245 YNNLTGPIPSSLGNLKNLQYLFLYQNKLSGPIPPSIFSLQKLISLDLSDNSLS--GEIPE 302
Query: 164 GKNQVQGVTNIFELKLQHNEIE-NLDGALMGIHGLSRLDLSHNKLRTISPDDFIGLDSLK 222
Q+Q N+ L L N + AL + L L L NK P + ++L
Sbjct: 303 LVIQLQ---NLEILHLFSNNFTGKIPVALTSLPRLQVLQLWSNKFSGEIPKNLGKHNNLT 359
Query: 223 MLDISHNLLT 232
+LD+S N LT
Sbjct: 360 VLDLSTNNLT 369
Score = 49.1 bits (117), Expect = 4e-06
Identities = 75/303 (24%), Positives = 114/303 (37%), Gaps = 28/303 (9%)
Query: 2 LTDLNWLFLNNNRLK-SLEGQLGTLSKLQLLVIEQNQLEA-LPSDIQLFSQLGSLYANNN 59
L +L +LFL N+L + + +L KL L + N L +P + L L+ +N
Sbjct: 259 LKNLQYLFLYQNKLSGPIPPSIFSLQKLISLDLSDNSLSGEIPELVIQLQNLEILHLFSN 318
Query: 60 RIT-SLDGLLRGLTKLQVFNMDFNQITMVRRDEFQNLHNLDSISLQNNQITS-MNSSLSG 117
T + L L +LQV + N+ + +NL + L N +T + L
Sbjct: 319 NFTGKIPVALTSLPRLQVLQLWSNKFSGEIPKNLGKHNNLTVLDLSTNNLTGEIPEGLCS 378
Query: 118 LTKLAYLYLSHNQLTEFLLDDIRGLKRLRTVDLSYNKINKFGTRNEGK-----------N 166
L L L N L + + + LR V L N + K N
Sbjct: 379 SGNLFKLILFSNSLEGEIPKSLGACRSLRRVRLQDNSFSGELPSEFTKLPLVYFLDISNN 438
Query: 167 QVQGVTNIFELKLQHNEIENLDGA----------LMGIHGLSRLDLSHNKLRTISPDDFI 216
+QG I K ++ L A G L LDLS N+ P
Sbjct: 439 NLQG--RINSRKWDMPSLQMLSLARNKFFGGLPDSFGSKRLENLDLSRNQFSGAVPRKLG 496
Query: 217 GLDSLKMLDISHNLLTTLEETSKTFLPALEELFVSHNSLT-RLDKDFHGLPVLCKADLAH 275
L L L +S N L+ + L L +SHN L+ ++ F +PVL + DL+
Sbjct: 497 SLSELMQLKLSENKLSGEIPDELSSCKKLVSLDLSHNQLSGQIPASFSEMPVLSQLDLSQ 556
Query: 276 NNI 278
N +
Sbjct: 557 NQL 559
Score = 40.6 bits (95), Expect = 0.002
Identities = 66/258 (25%), Positives = 111/258 (43%), Gaps = 28/258 (10%)
Query: 51 LGSLYANNNRITSLD-----------GLLRGLTKLQVFNMDFNQIT-MVRRDEFQNLHNL 98
G N++R+ S+D + L +Q N+ NQ++ + D F +L
Sbjct: 61 QGITCNNSSRVVSIDLSGKNISGKISSAIFRLPYIQTINLSNNQLSGPIPDDIFTTSSSL 120
Query: 99 DSISLQNNQITSM--NSSLSGLTKLAYLYLSHNQLTEFLLDDIRGLKRLRTVDLSYNKIN 156
++L NN T S+ L L LS+N L+ + +DI L+ +DL N +
Sbjct: 121 RYLNLSNNNFTGSIPRGSIPNLETLD---LSNNMLSGEIPNDIGSFSSLKVLDLGGNVLV 177
Query: 157 KFGTRNEGK--NQVQGVTNIFELKLQHNEIENLDGALMG-IHGLSRLDLSHNKLRTISPD 213
GK N + +T++ L L N++ +G + L + L +N L P
Sbjct: 178 -------GKIPNSLTNLTSLEFLTLASNQLVGQIPRELGQMKSLKWIYLGYNNLSGEIPY 230
Query: 214 DFIGLDSLKMLDISHNLLTTLEETSKTFLPALEELFVSHNSLT-RLDKDFHGLPVLCKAD 272
+ GL SL LD+ +N LT +S L L+ LF+ N L+ + L L D
Sbjct: 231 EIGGLTSLNHLDLVYNNLTGPIPSSLGNLKNLQYLFLYQNKLSGPIPPSIFSLQKLISLD 290
Query: 273 LAHNNIKAINIQLALKTQ 290
L+ N++ +L ++ Q
Sbjct: 291 LSDNSLSGEIPELVIQLQ 308
>gnl|CDD|226406 COG3889, COG3889, Predicted solute binding protein [General
function prediction only].
Length = 872
Score = 50.2 bits (120), Expect = 1e-06
Identities = 20/58 (34%), Positives = 30/58 (51%), Gaps = 5/58 (8%)
Query: 329 TKIHGETICQPDSNETST-----TTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTT 381
T + ET++ T T T TTT P+P T++ TQ ST++TTT +P+
Sbjct: 786 TLSYSAYSNTSILIETTSVVITKTVTQTQTTTSSPSPTQTTSPTQTSTSTTTTTSPSQ 843
Score = 47.5 bits (113), Expect = 1e-05
Identities = 15/37 (40%), Positives = 25/37 (67%)
Query: 344 TSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPT 380
+ T TTT++ +P T +PT T+T +TT++ +QT T
Sbjct: 810 VTQTQTTTSSPSPTQTTSPTQTSTSTTTTTSPSQTTT 846
Score = 46.8 bits (111), Expect = 2e-05
Identities = 19/39 (48%), Positives = 25/39 (64%), Gaps = 1/39 (2%)
Query: 344 TSTTTTT-TTTTTPEPTPAPTSTTTQRSTTSTTTQTPTT 381
T T T T TTT++P PT + T T STT+TT+ + TT
Sbjct: 807 TKTVTQTQTTTSSPSPTQTTSPTQTSTSTTTTTSPSQTT 845
Score = 44.9 bits (106), Expect = 7e-05
Identities = 20/90 (22%), Positives = 29/90 (32%), Gaps = 14/90 (15%)
Query: 318 VIDAMETINNNTKIHGETICQPDSNETSTTTT-------------TTTTTTPEPTPAPTS 364
++D + G + + T T TT TT+ T T
Sbjct: 755 IMDPASGTGTSITTSGTFTAEVPQSPTKTETTLSYSAYSNTSILIETTSVVITKTVTQTQ 814
Query: 365 TTTQRSTTSTTTQTPTTPIQEEYTETITLE 394
TTT S + T T +PT T T +
Sbjct: 815 TTTS-SPSPTQTTSPTQTSTSTTTTTSPSQ 843
Score = 41.4 bits (97), Expect = 8e-04
Identities = 14/35 (40%), Positives = 21/35 (60%), Gaps = 2/35 (5%)
Query: 344 TSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQT 378
T+++ + T TT+P T T+TTT S + TTT
Sbjct: 816 TTSSPSPTQTTSPTQTSTSTTTTT--SPSQTTTGG 848
Score = 39.8 bits (93), Expect = 0.002
Identities = 23/83 (27%), Positives = 27/83 (32%), Gaps = 15/83 (18%)
Query: 341 SNETSTTTTTTTTTTPEPTPAPTSTT---------------TQRSTTSTTTQTPTTPIQE 385
TS TT+ T T +P T TT T T T TQT TT
Sbjct: 761 GTGTSITTSGTFTAEVPQSPTKTETTLSYSAYSNTSILIETTSVVITKTVTQTQTTTSSP 820
Query: 386 EYTETITLELPQPVETNNQIPVQ 408
T+T + T P Q
Sbjct: 821 SPTQTTSPTQTSTSTTTTTSPSQ 843
Score = 39.1 bits (91), Expect = 0.005
Identities = 17/53 (32%), Positives = 25/53 (47%), Gaps = 6/53 (11%)
Query: 344 TSTTTTTTTTTTPEPTPAPTSTTTQRST---TSTTTQTPTTPIQEEYTETITL 393
+T+ T T T T + + TQ ++ TST+T T T+P Q T T
Sbjct: 800 ETTSVVITKTVTQTQTTTSSPSPTQTTSPTQTSTSTTTTTSPSQ---TTTGGG 849
Score = 28.7 bits (64), Expect = 7.3
Identities = 14/75 (18%), Positives = 21/75 (28%), Gaps = 10/75 (13%)
Query: 343 ETSTTTTTTTTTTPEPTPA--------PTSTTTQRSTTS--TTTQTPTTPIQEEYTETIT 392
+ PT P S T TTS T + P +P + E T + +
Sbjct: 731 GSLEVFPAGENWGFIPTTKRVKVRIMDPASGTGTSITTSGTFTAEVPQSPTKTETTLSYS 790
Query: 393 LELPQPVETNNQIPV 407
+ V
Sbjct: 791 AYSNTSILIETTSVV 805
>gnl|CDD|216513 pfam01456, Mucin, Mucin-like glycoprotein. This family of
trypanosomal proteins resemble vertebrate mucins. The
protein consists of three regions. The N and C terminii
are conserved between all members of the family, whereas
the central region is not well conserved and contains a
large number of threonine residues which can be
glycosylated. Indirect evidence suggested that these
genes might encode the core protein of parasite mucins,
glycoproteins that were proposed to be involved in the
interaction with, and invasion of, mammalian host cells.
This family contains an N-terminal signal peptide.
Length = 143
Score = 46.0 bits (108), Expect = 4e-06
Identities = 29/76 (38%), Positives = 38/76 (50%)
Query: 326 NNNTKIHGETICQPDSNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTPIQE 385
+ E Q + T+TT TTTTTT T T+TTT+ +TT+TTT T TT +
Sbjct: 32 DAAVVEAAEGQSQTTTTTTTTTPPTTTTTTTTTTTTITTTTTKTTTTTTTTTTTTTTTEA 91
Query: 386 EYTETITLELPQPVET 401
T T E P +T
Sbjct: 92 PSKNTTTSEAPTTTDT 107
Score = 41.0 bits (95), Expect = 2e-04
Identities = 25/58 (43%), Positives = 30/58 (51%)
Query: 349 TTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTPIQEEYTETITLELPQPVETNNQIP 406
TTTTTTT P T+TTT +T +TTT TT T T T E P T ++ P
Sbjct: 45 TTTTTTTTTPPTTTTTTTTTTTTITTTTTKTTTTTTTTTTTTTTTEAPSKNTTTSEAP 102
Score = 36.0 bits (82), Expect = 0.012
Identities = 24/63 (38%), Positives = 30/63 (47%), Gaps = 5/63 (7%)
Query: 341 SNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTPIQEEYTETITLELPQPVE 400
++ TTTTTTTTT P T+TTT +TT+ TT T T T T T P +
Sbjct: 40 EGQSQTTTTTTTTTPPT-----TTTTTTTTTTTITTTTTKTTTTTTTTTTTTTTTEAPSK 94
Query: 401 TNN 403
Sbjct: 95 NTT 97
Score = 34.8 bits (79), Expect = 0.026
Identities = 21/52 (40%), Positives = 28/52 (53%)
Query: 324 TINNNTKIHGETICQPDSNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTT 375
T T TI + T+TTTTTTTTTT P+ +TT++ TT+ T
Sbjct: 56 TTTTTTTTTTTTITTTTTKTTTTTTTTTTTTTTTEAPSKNTTTSEAPTTTDT 107
Score = 34.5 bits (78), Expect = 0.034
Identities = 23/62 (37%), Positives = 28/62 (45%)
Query: 320 DAMETINNNTKIHGETICQPDSNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTP 379
T T T + T TTTTT TTTT T T+TT S +TT++ P
Sbjct: 43 SQTTTTTTTTTPPTTTTTTTTTTTTITTTTTKTTTTTTTTTTTTTTTEAPSKNTTTSEAP 102
Query: 380 TT 381
TT
Sbjct: 103 TT 104
Score = 33.7 bits (76), Expect = 0.067
Identities = 18/57 (31%), Positives = 23/57 (40%)
Query: 336 ICQPDSNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTPIQEEYTETIT 392
+C E + T+TTT TT+TTT T TT I T+T T
Sbjct: 21 VCATAQGEGQYDAAVVEAAEGQSQTTTTTTTTTPPTTTTTTTTTTTTITTTTTKTTT 77
Score = 32.5 bits (73), Expect = 0.17
Identities = 20/39 (51%), Positives = 28/39 (71%), Gaps = 1/39 (2%)
Query: 344 TSTTTTTTTTTTPEPTPAPT-STTTQRSTTSTTTQTPTT 381
+TTTTTTTTTT T AP+ +TTT + T+T T+ P++
Sbjct: 74 KTTTTTTTTTTTTTTTEAPSKNTTTSEAPTTTDTRAPSS 112
Score = 32.1 bits (72), Expect = 0.25
Identities = 17/59 (28%), Positives = 21/59 (35%)
Query: 348 TTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTPIQEEYTETITLELPQPVETNNQIP 406
T T+TTT +TT+TTT T TT T T T + P
Sbjct: 34 AVVEAAEGQSQTTTTTTTTTPPTTTTTTTTTTTTITTTTTKTTTTTTTTTTTTTTTEAP 92
Score = 30.2 bits (67), Expect = 1.0
Identities = 14/61 (22%), Positives = 18/61 (29%)
Query: 341 SNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTPIQEEYTETITLELPQPVE 400
+ + T+TTT T+TTT T TT T T
Sbjct: 25 AQGEGQYDAAVVEAAEGQSQTTTTTTTTTPPTTTTTTTTTTTTITTTTTKTTTTTTTTTT 84
Query: 401 T 401
T
Sbjct: 85 T 85
Score = 27.9 bits (61), Expect = 6.9
Identities = 19/52 (36%), Positives = 23/52 (44%), Gaps = 7/52 (13%)
Query: 324 TINNNTKIHGETICQPDSNETSTTTTTTTTTTPEP-------TPAPTSTTTQ 368
T T T+TTTTTTTTTT E + APT+T T+
Sbjct: 57 TTTTTTTTTTTITTTTTKTTTTTTTTTTTTTTTEAPSKNTTTSEAPTTTDTR 108
>gnl|CDD|185285 PRK15387, PRK15387, E3 ubiquitin-protein ligase SspH2; Provisional.
Length = 788
Score = 48.2 bits (114), Expect = 5e-06
Identities = 53/153 (34%), Positives = 76/153 (49%), Gaps = 23/153 (15%)
Query: 3 TDLNWLFLNNNRLKSLEGQLGTL-SKLQLLVIEQNQLEALPSDIQLFSQLGSLYANNNRI 61
++L L+ NN+L SL TL S LQ L + NQL +LP+ L S+L L+A NNR+
Sbjct: 322 SELCKLWAYNNQLTSLP----TLPSGLQELSVSDNQLASLPT---LPSELYKLWAYNNRL 374
Query: 62 TSLDGLLRG----------LTKLQVFNMDFNQITMVRRDEFQNL----HNLDSISLQNNQ 107
TSL L G LT L V + ++ MV + +L L S+S+ NQ
Sbjct: 375 TSLPALPSGLKELIVSGNRLTSLPVLPSELKEL-MVSGNRLTSLPMLPSGLLSLSVYRNQ 433
Query: 108 ITSMNSSLSGLTKLAYLYLSHNQLTEFLLDDIR 140
+T + SL L+ + L N L+E L +R
Sbjct: 434 LTRLPESLIHLSSETTVNLEGNPLSERTLQALR 466
Score = 40.9 bits (95), Expect = 0.001
Identities = 35/107 (32%), Positives = 57/107 (53%), Gaps = 13/107 (12%)
Query: 28 LQLLVIEQNQLEALPSDIQLFSQLGSLYANNNRITSLDGLLRGLTKLQVFNMDFNQITMV 87
LQ L + NQL +LP+ L S+L L+A NN++TSL L GL +L V + + +
Sbjct: 304 LQELSVSDNQLASLPA---LPSELCKLWAYNNQLTSLPTLPSGLQELSVSDNQLASLPTL 360
Query: 88 RRDEFQNLHNLDSISLQNNQITSMNSSLSGLTKLAYLYLSHNQLTEF 134
+ ++ + NN++TS+ + SGL +L +S N+LT
Sbjct: 361 PSELYK-------LWAYNNRLTSLPALPSGLKELI---VSGNRLTSL 397
>gnl|CDD|185268 PRK15370, PRK15370, E3 ubiquitin-protein ligase SlrP; Provisional.
Length = 754
Score = 44.3 bits (104), Expect = 9e-05
Identities = 59/232 (25%), Positives = 98/232 (42%), Gaps = 49/232 (21%)
Query: 31 LVIEQNQLEALPSDIQLFSQLGSLYANNNRITSLDGLLRGLTKLQVFNMDFNQITMVRRD 90
L+++ N+L++LP ++Q + +LYAN+N++TS+ L +Q + N+IT +
Sbjct: 204 LILDNNELKSLPENLQ--GNIKTLYANSNQLTSIPATLP--DTIQEMELSINRITELPER 259
Query: 91 EFQNLHNLDSISLQNNQITSMNSSLSGLTKLAYLYLSHNQLTEFLLDDIRGLKRLRTVDL 150
L +LD L +N+I+ + +L + LR + +
Sbjct: 260 LPSALQSLD---LFHNKISCLPENLP--------------------------EELRYLSV 290
Query: 151 SYNKINKFGTRNEGKNQVQGVTNIFELKLQHNEIENLDGALMGIHGLSRLDLSHNKLRTI 210
N I + I L +Q N + L L GL L+ N L T
Sbjct: 291 YDNSIRTLPAHLP--------SGITHLNVQSNSLTALPETLP--PGLKTLEAGENAL-TS 339
Query: 211 SPDDFIGLDSLKMLDISHNLLTTLEETSKTFLPALEELFVSHNSLTRLDKDF 262
P L++LD+S N +T L ET P + L VS N+LT L ++
Sbjct: 340 LPASLP--PELQVLDVSKNQITVLPET---LPPTITTLDVSRNALTNLPENL 386
Score = 33.5 bits (76), Expect = 0.24
Identities = 36/142 (25%), Positives = 60/142 (42%), Gaps = 28/142 (19%)
Query: 10 LNNNRLKSLEGQLGTLSKLQLLVIEQNQLEALPSDIQLFSQLGSLYANNNRITSL-DGLL 68
L+ NR+ L +L S LQ L + N++ LP ++ + S+Y N+ I +L L
Sbjct: 248 LSINRITELPERLP--SALQSLDLFHNKISCLPENLPEELRYLSVYDNS--IRTLPAHLP 303
Query: 69 RGLTKLQVFNMDFNQITMVRRDEFQNLHNLDS------------------ISLQNNQITS 110
G+T L V + N +T + L L++ + + NQIT
Sbjct: 304 SGITHLNVQS---NSLTALPETLPPGLKTLEAGENALTSLPASLPPELQVLDVSKNQITV 360
Query: 111 MNSSLSGLTKLAYLYLSHNQLT 132
+ +L + L +S N LT
Sbjct: 361 LPETLP--PTITTLDVSRNALT 380
>gnl|CDD|205079 pfam12799, LRR_4, Leucine Rich repeats (2 copies). Leucine rich
repeats are short sequence motifs present in a number of
proteins with diverse functions and cellular locations.
These repeats are usually involved in protein-protein
interactions. Each Leucine Rich Repeat is composed of a
beta-alpha unit. These units form elongated non-globular
structures. Leucine Rich Repeats are often flanked by
cysteine rich domains.
Length = 43
Score = 36.7 bits (86), Expect = 8e-04
Identities = 18/40 (45%), Positives = 23/40 (57%), Gaps = 2/40 (5%)
Query: 119 TKLAYLYLSHNQLTEFLLDDIRGLKRLRTVDLSYNKINKF 158
T L L LS+NQ+T+ L + L L T+DLS NKI
Sbjct: 1 TNLETLDLSNNQITD--LPPLSNLPNLETLDLSGNKITDL 38
Score = 34.4 bits (80), Expect = 0.006
Identities = 17/39 (43%), Positives = 24/39 (61%), Gaps = 1/39 (2%)
Query: 96 HNLDSISLQNNQITSMNSSLSGLTKLAYLYLSHNQLTEF 134
NL+++ L NNQIT + LS L L L LS N++T+
Sbjct: 1 TNLETLDLSNNQITDL-PPLSNLPNLETLDLSGNKITDL 38
Score = 31.3 bits (72), Expect = 0.070
Identities = 16/43 (37%), Positives = 24/43 (55%), Gaps = 2/43 (4%)
Query: 197 LSRLDLSHNKLRTISPDDFIGLDSLKMLDISHNLLTTLEETSK 239
L LDLS+N++ + P L +L+ LD+S N +T L S
Sbjct: 3 LETLDLSNNQITDLPP--LSNLPNLETLDLSGNKITDLSPLSN 43
Score = 30.5 bits (70), Expect = 0.100
Identities = 14/42 (33%), Positives = 24/42 (57%), Gaps = 2/42 (4%)
Query: 72 TKLQVFNMDFNQITMVRRDEFQNLHNLDSISLQNNQITSMNS 113
T L+ ++ NQIT + NL NL+++ L N+IT ++
Sbjct: 1 TNLETLDLSNNQITDL--PPLSNLPNLETLDLSGNKITDLSP 40
Score = 30.5 bits (70), Expect = 0.11
Identities = 13/36 (36%), Positives = 20/36 (55%), Gaps = 1/36 (2%)
Query: 50 QLGSLYANNNRITSLDGLLRGLTKLQVFNMDFNQIT 85
L +L +NN+IT L L L L+ ++ N+IT
Sbjct: 2 NLETLDLSNNQITDLPPL-SNLPNLETLDLSGNKIT 36
Score = 29.0 bits (66), Expect = 0.35
Identities = 17/40 (42%), Positives = 24/40 (60%), Gaps = 2/40 (5%)
Query: 220 SLKMLDISHNLLTTLEETSKTFLPALEELFVSHNSLTRLD 259
+L+ LD+S+N +T L S LP LE L +S N +T L
Sbjct: 2 NLETLDLSNNQITDLPPLSN--LPNLETLDLSGNKITDLS 39
Score = 29.0 bits (66), Expect = 0.42
Identities = 17/41 (41%), Positives = 24/41 (58%), Gaps = 1/41 (2%)
Query: 172 TNIFELKLQHNEIENLDGALMGIHGLSRLDLSHNKLRTISP 212
TN+ L L +N+I +L L + L LDLS NK+ +SP
Sbjct: 1 TNLETLDLSNNQITDLP-PLSNLPNLETLDLSGNKITDLSP 40
Score = 28.2 bits (64), Expect = 0.81
Identities = 12/39 (30%), Positives = 19/39 (48%), Gaps = 1/39 (2%)
Query: 5 LNWLFLNNNRLKSLEGQLGTLSKLQLLVIEQNQLEALPS 43
L L L+NN++ L L L L+ L + N++ L
Sbjct: 3 LETLDLSNNQITDLPP-LSNLPNLETLDLSGNKITDLSP 40
Score = 27.5 bits (62), Expect = 1.4
Identities = 13/41 (31%), Positives = 20/41 (48%), Gaps = 1/41 (2%)
Query: 27 KLQLLVIEQNQLEALPSDIQLFSQLGSLYANNNRITSLDGL 67
L+ L + NQ+ LP + L +L + N+IT L L
Sbjct: 2 NLETLDLSNNQITDLPP-LSNLPNLETLDLSGNKITDLSPL 41
Score = 25.9 bits (58), Expect = 5.4
Identities = 14/40 (35%), Positives = 20/40 (50%), Gaps = 1/40 (2%)
Query: 243 PALEELFVSHNSLTRLDKDFHGLPVLCKADLAHNNIKAIN 282
LE L +S+N +T L LP L DL+ N I ++
Sbjct: 1 TNLETLDLSNNQITDLP-PLSNLPNLETLDLSGNKITDLS 39
>gnl|CDD|237019 PRK11907, PRK11907, bifunctional 2',3'-cyclic nucleotide
2'-phosphodiesterase/3'-nucleotidase precursor protein;
Reviewed.
Length = 814
Score = 41.4 bits (97), Expect = 9e-04
Identities = 16/84 (19%), Positives = 26/84 (30%), Gaps = 7/84 (8%)
Query: 332 HGETICQPDSNETSTTTTTTTTTTPE-------PTPAPTSTTTQRSTTSTTTQTPTTPIQ 384
E I ++ TT + TP +T + ++S T +T +
Sbjct: 27 QAEEIVTTTPATSTEAEQTTPVESDATEEADNTETPVAATTAAEAPSSSETAETSDPTSE 86
Query: 385 EEYTETITLELPQPVETNNQIPVQ 408
T T P T PV+
Sbjct: 87 ATDTTTSEARTVTPAATETSKPVE 110
Score = 33.7 bits (77), Expect = 0.21
Identities = 19/71 (26%), Positives = 28/71 (39%), Gaps = 10/71 (14%)
Query: 317 AVIDAMETINNNTKIHGETICQPDSNETSTTTTTTTTT--TPEPTPAPTSTTTQRSTTST 374
DA E +N P + T+ +++ T T +PT T TTT + T T
Sbjct: 48 VESDATEEADNTE--------TPVAATTAAEAPSSSETAETSDPTSEATDTTTSEARTVT 99
Query: 375 TTQTPTTPIQE 385
T T+ E
Sbjct: 100 PAATETSKPVE 110
Score = 32.5 bits (74), Expect = 0.54
Identities = 17/75 (22%), Positives = 26/75 (34%), Gaps = 1/75 (1%)
Query: 326 NNNTKIHGETICQPDSNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTPIQE 385
+T+ T + D+ E + T T T +S T + S ++ TT
Sbjct: 37 ATSTEAEQTTPVESDATEEADNTETPVAATTAAEAPSSSETAETSDPTSEATDTTTSEAR 96
Query: 386 EYTETITLELPQPVE 400
T T PVE
Sbjct: 97 TVTPAATETSK-PVE 110
Score = 29.1 bits (65), Expect = 5.6
Identities = 17/69 (24%), Positives = 19/69 (27%), Gaps = 6/69 (8%)
Query: 341 SNETSTTTTTTTTTTPE-PTPAPTSTTTQRSTTSTTTQTPTTPIQEEYTETITLELPQPV 399
SN TTTP T A +T + T T T T E P
Sbjct: 21 SNPKLAQAEEIVTTTPATSTEAEQTTPVESDATEEADNTETPV-----AATTAAEAPSSS 75
Query: 400 ETNNQIPVQ 408
ET
Sbjct: 76 ETAETSDPT 84
>gnl|CDD|216257 pfam01034, Syndecan, Syndecan domain. Syndecans are transmembrane
heparin sulfate proteoglycans which are implicated in
the binding of extracellular matrix components and
growth factors.
Length = 207
Score = 40.1 bits (94), Expect = 9e-04
Identities = 15/44 (34%), Positives = 18/44 (40%)
Query: 339 PDSNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTP 382
S TTT +T+T T +TTT S T T T T
Sbjct: 84 SSSPSNDTTTASTSTKTSPTVSTTVTTTTSPSETDTEEATTTVS 127
Score = 37.0 bits (86), Expect = 0.009
Identities = 18/56 (32%), Positives = 30/56 (53%), Gaps = 5/56 (8%)
Query: 332 HGETICQPDSNETSTTTTTTTTTTPEP-----TPAPTSTTTQRSTTSTTTQTPTTP 382
T SN+T+T +T+T T+ T +P+ T T+ +TT+ +T+TPT
Sbjct: 79 KLTTTSSSPSNDTTTASTSTKTSPTVSTTVTTTTSPSETDTEEATTTVSTETPTEG 134
Score = 36.6 bits (85), Expect = 0.011
Identities = 21/69 (30%), Positives = 31/69 (44%), Gaps = 6/69 (8%)
Query: 331 IHGETICQPDSNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTPIQEEYTET 390
T + + +S + TTT +T T T+ STT TTT +P+ EE T T
Sbjct: 72 TTSATPPKLTTTSSSPSNDTTTAST------STKTSPTVSTTVTTTTSPSETDTEEATTT 125
Query: 391 ITLELPQPV 399
++ E P
Sbjct: 126 VSTETPTEG 134
Score = 33.9 bits (78), Expect = 0.10
Identities = 16/57 (28%), Positives = 22/57 (38%)
Query: 329 TKIHGETICQPDSNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTPIQE 385
T T P + T TTTT+ + T E ST T S+ P+ + E
Sbjct: 92 TTASTSTKTSPTVSTTVTTTTSPSETDTEEATTTVSTETPTEGGSSAATDPSKNLLE 148
Score = 32.8 bits (75), Expect = 0.23
Identities = 18/65 (27%), Positives = 25/65 (38%), Gaps = 3/65 (4%)
Query: 340 DSNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTPIQEEYT---ETITLELP 396
D ++ TT+ T T + S T ++TST T + T ET T E
Sbjct: 64 DDEDSEPVTTSATPPKLTTTSSSPSNDTTTASTSTKTSPTVSTTVTTTTSPSETDTEEAT 123
Query: 397 QPVET 401
V T
Sbjct: 124 TTVST 128
Score = 30.9 bits (70), Expect = 0.83
Identities = 17/57 (29%), Positives = 24/57 (42%), Gaps = 4/57 (7%)
Query: 323 ETINNNTKIHGETICQPDSNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTP 379
T +++ T + +TT TTTT+P T +TT T ST T T
Sbjct: 81 TTTSSSPSNDTTTASTSTKTSPTVSTTVTTTTSPSETDTEEATT----TVSTETPTE 133
>gnl|CDD|223031 PHA03273, PHA03273, envelope glycoprotein C; Provisional.
Length = 486
Score = 40.0 bits (93), Expect = 0.002
Identities = 24/86 (27%), Positives = 36/86 (41%), Gaps = 11/86 (12%)
Query: 335 TICQPDSN--ETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTPIQEEYTETI- 391
+I D++ E +T T T TT T +P T T ST + T++ T Q ET
Sbjct: 33 SIENSDNSTAEMQSTPATPTHTTSNLT-SPFGTGTDNSTNANGTESTTQASQPHSHETTI 91
Query: 392 -----TLELP--QPVETNNQIPVQDN 410
+ +P + V+ N V N
Sbjct: 92 TCTKSLISVPYYKSVDMNCTTSVGVN 117
Score = 29.6 bits (66), Expect = 3.8
Identities = 13/65 (20%), Positives = 21/65 (32%)
Query: 341 SNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTPIQEEYTETITLELPQPVE 400
TS++ + +T E P + T S ++ T T T + QP
Sbjct: 27 GASTSSSIENSDNSTAEMQSTPATPTHTTSNLTSPFGTGTDNSTNANGTESTTQASQPHS 86
Query: 401 TNNQI 405
I
Sbjct: 87 HETTI 91
>gnl|CDD|218421 pfam05086, Dicty_REP, Dictyostelium (Slime Mold) REP protein. This
family consists of REP proteins from Dictyostelium
(Slime molds). REP protein is likely involved in
transcription regulation and control of DNA replication,
specifically amplification of plasmid at low copy
numbers. The formation of homomultimers may be required
for their regulatory activity.
Length = 910
Score = 39.8 bits (93), Expect = 0.002
Identities = 19/76 (25%), Positives = 31/76 (40%), Gaps = 1/76 (1%)
Query: 343 ETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTPIQEEYTETITLELPQPVETN 402
+ S + +P+ P +TTT +TT+TTT P T + + + L L Q E
Sbjct: 233 QISINSENIQRINSQPSKRPNNTTTTTTTTTTTTFQPRT-RKRKSIDDHKLSLNQAPEKF 291
Query: 403 NQIPVQDNLEKEKQPD 418
D+ + D
Sbjct: 292 KNNTKPDDDPQSDFSD 307
Score = 34.4 bits (79), Expect = 0.11
Identities = 16/51 (31%), Positives = 21/51 (41%), Gaps = 3/51 (5%)
Query: 338 QPDSNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTPIQEEYT 388
QP +TTTTTTTTTT P T ++S P + +
Sbjct: 247 QPSKRPNNTTTTTTTTTTTTFQPR---TRKRKSIDDHKLSLNQAPEKFKNN 294
>gnl|CDD|234504 TIGR04216, halo_surf_glyco, major cell surface glycoprotein.
Members of this family are the S-layer-forming
halobacterial major cell surface glycoprotein. The
highest scores below model cutoffs are fragmentary
paralogs to actual members of the family. Modifications
include at N-linked and O-linked glycosylation, a
C-terminal diphytanylglyceryl modification, and probable
cleavage of the PGF-CTERM tail.
Length = 782
Score = 39.9 bits (93), Expect = 0.003
Identities = 20/48 (41%), Positives = 24/48 (50%), Gaps = 3/48 (6%)
Query: 338 QPD---SNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTP 382
+PD S + +TTTT TTT E T TTT T TT +TP
Sbjct: 713 RPDTTTSEDPTTTTTPTTTGPEETTETAEPTTTTEEPTEETTTGSSTP 760
Score = 37.6 bits (87), Expect = 0.013
Identities = 14/45 (31%), Positives = 19/45 (42%), Gaps = 1/45 (2%)
Query: 335 TICQPDSNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTP 379
T + + T+ TTT TT P T T T+T + TP
Sbjct: 717 TTSEDPTTTTTPTTTGPEETTETAEPTTT-TEEPTEETTTGSSTP 760
Score = 35.6 bits (82), Expect = 0.051
Identities = 13/51 (25%), Positives = 17/51 (33%)
Query: 340 DSNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTPIQEEYTET 390
D + T T P + T +T +TT TT E T T
Sbjct: 696 DGDNTDRVNVEVVEETERPDTTTSEDPTTTTTPTTTGPEETTETAEPTTTT 746
Score = 34.9 bits (80), Expect = 0.077
Identities = 19/47 (40%), Positives = 22/47 (46%), Gaps = 4/47 (8%)
Query: 334 ETICQPDSNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPT 380
T P + T TTT TT T PT TTT+ T TTT + T
Sbjct: 717 TTSEDPTTTTTPTTTGPEETT---ETAEPT-TTTEEPTEETTTGSST 759
Score = 33.7 bits (77), Expect = 0.19
Identities = 21/65 (32%), Positives = 31/65 (47%), Gaps = 7/65 (10%)
Query: 326 NNNTKIHGETICQPDSNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTPIQE 385
+N +++ E + + + +T+T+ TTTTTP T P TT T T TT
Sbjct: 698 DNTDRVNVEVVEETERPDTTTSEDPTTTTTPTTT-GPEETTE------TAEPTTTTEEPT 750
Query: 386 EYTET 390
E T T
Sbjct: 751 EETTT 755
Score = 30.2 bits (68), Expect = 2.1
Identities = 15/51 (29%), Positives = 21/51 (41%), Gaps = 2/51 (3%)
Query: 318 VIDAMETINNNTKIHGETICQPDSNETSTTTTTTT--TTTPEPTPAPTSTT 366
V++ E + T T P + TT T TTT EPT T+ +
Sbjct: 707 VVEETERPDTTTSEDPTTTTTPTTTGPEETTETAEPTTTTEEPTEETTTGS 757
>gnl|CDD|191251 pfam05283, MGC-24, Multi-glycosylated core protein 24 (MGC-24).
This family consists of several MGC-24 (or Cd164
antigen) proteins from eukaryotic organisms.
MGC-24/CD164 is a sialomucin expressed in many normal
and cancerous tissues. In humans, soluble and
transmembrane forms of MGC-24 are produced by
alternative splicing.
Length = 187
Score = 38.1 bits (88), Expect = 0.003
Identities = 15/50 (30%), Positives = 23/50 (46%)
Query: 341 SNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTPIQEEYTET 390
S T+T T +T P TP+PT++ ++ T T T Q + T
Sbjct: 109 SVATTTPVPTNSTAKPTITPSPTTSHHHVTSEPKTNTTVTPTSQPDRKST 158
Score = 37.7 bits (87), Expect = 0.005
Identities = 14/62 (22%), Positives = 24/62 (38%)
Query: 321 AMETINNNTKIHGETICQPDSNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPT 380
E + + +I+ T + T T +T T P+P + + TT TPT
Sbjct: 91 QAEVVKSGCQIYNTTDSCSVATTTPVPTNSTAKPTITPSPTTSHHHVTSEPKTNTTVTPT 150
Query: 381 TP 382
+
Sbjct: 151 SQ 152
Score = 29.2 bits (65), Expect = 3.4
Identities = 14/56 (25%), Positives = 22/56 (39%)
Query: 344 TSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTPIQEEYTETITLELPQPV 399
TT + + TP PT++T + + T + T + E T T QP
Sbjct: 99 CQIYNTTDSCSVATTTPVPTNSTAKPTITPSPTTSHHHVTSEPKTNTTVTPTSQPD 154
>gnl|CDD|215964 pfam00513, Late_protein_L2, Late Protein L2.
Length = 466
Score = 38.8 bits (91), Expect = 0.004
Identities = 17/70 (24%), Positives = 25/70 (35%), Gaps = 9/70 (12%)
Query: 339 PDSNETSTTTTTTTTTTPEPTPA---PTSTTTQRSTTSTTTQTPTTPIQEEYTETITLEL 395
P + TT+ TPA T TT + T P +T+ L+
Sbjct: 116 PIPPIPGDGSGFPITTSSTTTPAILDVTPTTRTVHVSRTQYNNPL------FTDPSVLQP 169
Query: 396 PQPVETNNQI 405
PQP E + +
Sbjct: 170 PQPAEVSGHV 179
Score = 28.4 bits (64), Expect = 7.9
Identities = 17/77 (22%), Positives = 27/77 (35%), Gaps = 5/77 (6%)
Query: 339 PDSNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTT---TQTPTTPIQEEYTETITLEL 395
+ + S TT T S++++ STT+TT + TP P+ Y I L
Sbjct: 366 VELDTYSDDLLLDETTEDFSTSQLVSSSSRTSTTNTTIPLSSTPDVPVY--YGPDIVLPE 423
Query: 396 PQPVETNNQIPVQDNLE 412
+P
Sbjct: 424 SPGTTPIVPVPPDLPTV 440
>gnl|CDD|114270 pfam05539, Pneumo_att_G, Pneumovirinae attachment membrane
glycoprotein G.
Length = 408
Score = 38.9 bits (90), Expect = 0.004
Identities = 16/46 (34%), Positives = 20/46 (43%), Gaps = 1/46 (2%)
Query: 345 STTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTPIQEEYTET 390
+TT T P +T+ +RS ST T PTT QE T
Sbjct: 258 QSTTGDGQEHTQRRK-TPPATSNRRSPHSTATPPPTTKRQETGRPT 302
Score = 32.7 bits (74), Expect = 0.32
Identities = 11/60 (18%), Positives = 16/60 (26%), Gaps = 2/60 (3%)
Query: 345 STTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTPIQEEYTETITLELPQPVETNNQ 404
S + + P T + +T TQ TP + P T Q
Sbjct: 239 SQRGPSGSPQHPPSTTSQDQST--TGDGQEHTQRRKTPPATSNRRSPHSTATPPPTTKRQ 296
Score = 28.5 bits (63), Expect = 8.0
Identities = 17/64 (26%), Positives = 26/64 (40%), Gaps = 13/64 (20%)
Query: 347 TTTTTTTTTPEPT-----PAPTSTTTQ--------RSTTSTTTQTPTTPIQEEYTETITL 393
T TT+ TT PT P+ T Q ++ T+ + T P+ + T T +
Sbjct: 170 TAVTTSKTTSWPTEVSHPTYPSQVTPQSQPATQGHQTATANQRLSSTEPVGTQGTTTSSN 229
Query: 394 ELPQ 397
PQ
Sbjct: 230 PEPQ 233
Score = 28.1 bits (62), Expect = 9.1
Identities = 21/85 (24%), Positives = 32/85 (37%), Gaps = 9/85 (10%)
Query: 341 SNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTPIQEEYT------ETITLE 394
S E T TTT++ PEP P + QR + + P+T Q++ T T +
Sbjct: 215 STEPVGTQGTTTSSNPEPQTEPPPS--QRGPSGSPQHPPSTTSQDQSTTGDGQEHTQRRK 272
Query: 395 LPQPVETNNQIPVQDNLEKEKQPDH 419
P P +N + P
Sbjct: 273 TP-PATSNRRSPHSTATPPPTTKRQ 296
>gnl|CDD|115579 pfam06933, SSP160, Special lobe-specific silk protein SSP160. This
family consists of several special lobe-specific silk
protein SSP160 sequences which appear to be specific to
Chironomus (Midge) species.
Length = 758
Score = 38.2 bits (88), Expect = 0.008
Identities = 22/59 (37%), Positives = 31/59 (52%), Gaps = 2/59 (3%)
Query: 317 AVIDAMETINNNTKIHGETICQPDSNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTT 375
A++ E N G +N T+++TTTTTTTT T APT+TTT+ + T
Sbjct: 655 AILAMFEAAYQNCTAPGSVTVPAAANTTTSSTTTTTTTT--TTAAPTTTTTKAANAPFT 711
Score = 36.7 bits (84), Expect = 0.024
Identities = 20/78 (25%), Positives = 35/78 (44%), Gaps = 8/78 (10%)
Query: 317 AVIDAMETINNNTKIHGETICQPDSNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTT 376
A++ A+E N + T +++ +T+ +TT T STT+ ST+S+ +
Sbjct: 268 AILAALEAFANGSASANST--------SNSNSTSNSTTNSNSTTTTNSTTSTNSTSSSNS 319
Query: 377 QTPTTPIQEEYTETITLE 394
T I TI L+
Sbjct: 320 STIAGCIDIAANFTIALQ 337
Score = 33.2 bits (75), Expect = 0.27
Identities = 17/66 (25%), Positives = 33/66 (50%)
Query: 315 MRAVIDAMETINNNTKIHGETICQPDSNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTST 374
+ A I A+++ + + + + SN STT+ +TT++ T ST++ ST+S
Sbjct: 91 ISANIQAIQSGSGSASGNSSSSANSTSNSNSTTSNNSTTSSNSTTTTSNSTSSSNSTSSG 150
Query: 375 TTQTPT 380
T +
Sbjct: 151 LTSGAS 156
Score = 31.3 bits (70), Expect = 1.2
Identities = 11/41 (26%), Positives = 24/41 (58%)
Query: 341 SNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTT 381
S +++++ +T+ T + STT+ STT+T+ T ++
Sbjct: 104 SASGNSSSSANSTSNSNSTTSNNSTTSSNSTTTTSNSTSSS 144
Score = 30.1 bits (67), Expect = 2.7
Identities = 15/44 (34%), Positives = 21/44 (47%), Gaps = 4/44 (9%)
Query: 339 PDSNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTP 382
P S TTT++ T T+TTT + T+TTT+ P
Sbjct: 670 PGSVTVPAAANTTTSS----TTTTTTTTTTAAPTTTTTKAANAP 709
Score = 28.6 bits (63), Expect = 6.9
Identities = 12/28 (42%), Positives = 15/28 (53%)
Query: 354 TTPEPTPAPTSTTTQRSTTSTTTQTPTT 381
T P P + T S+T+TTT T TT
Sbjct: 668 TAPGSVTVPAAANTTTSSTTTTTTTTTT 695
>gnl|CDD|221315 pfam11921, DUF3439, Domain of unknown function (DUF3439). This
presumed domain is functionally uncharacterized. This
domain is found in eukaryotes. This domain is typically
between 46 to 94 amino acids in length. This domain is
found associated with pfam01462, pfam00560.
Length = 122
Score = 36.1 bits (83), Expect = 0.008
Identities = 15/22 (68%), Positives = 17/22 (77%)
Query: 344 TSTTTTTTTTTTPEPTPAPTST 365
T+TTTTTTTTT P T PT+T
Sbjct: 48 TTTTTTTTTTTMPTTTTLPTTT 69
Score = 34.9 bits (80), Expect = 0.016
Identities = 16/26 (61%), Positives = 18/26 (69%)
Query: 345 STTTTTTTTTTPEPTPAPTSTTTQRS 370
+TTTTTTTTTT PT TTT+ S
Sbjct: 47 TTTTTTTTTTTTMPTTTTLPTTTKMS 72
Score = 32.6 bits (74), Expect = 0.12
Identities = 22/69 (31%), Positives = 29/69 (42%), Gaps = 11/69 (15%)
Query: 325 INNNTKI--HGETICQPDSNETSTT---------TTTTTTTTPEPTPAPTSTTTQRSTTS 373
I N+ I PDS + S + TTTTTTT T PT+TT +T
Sbjct: 12 IQKNSGIVKDSSGYVDPDSAKCSGSGKPVRSIICPTTTTTTTTTTTTMPTTTTLPTTTKM 71
Query: 374 TTTQTPTTP 382
+ + P P
Sbjct: 72 SMVKVPLVP 80
>gnl|CDD|236792 PRK10905, PRK10905, cell division protein DamX; Validated.
Length = 328
Score = 36.5 bits (84), Expect = 0.019
Identities = 15/46 (32%), Positives = 19/46 (41%), Gaps = 1/46 (2%)
Query: 338 QPDSNETSTTTTTTTTTTPEPTPAPTST-TTQRSTTSTTTQTPTTP 382
P E + +T TPAP T TT T++ QT TP
Sbjct: 182 TPKRTEPAAPVASTKAPAATSTPAPKETATTAPVQTASPAQTTATP 227
Score = 32.2 bits (73), Expect = 0.43
Identities = 11/38 (28%), Positives = 18/38 (47%)
Query: 344 TSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTT 381
++T T+TP P T+ Q ++ + TT TP
Sbjct: 192 VASTKAPAATSTPAPKETATTAPVQTASPAQTTATPAA 229
Score = 31.4 bits (71), Expect = 0.84
Identities = 11/41 (26%), Positives = 14/41 (34%)
Query: 344 TSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTPIQ 384
T T PA TST + T +T +P Q
Sbjct: 182 TPKRTEPAAPVASTKAPAATSTPAPKETATTAPVQTASPAQ 222
Score = 29.9 bits (67), Expect = 2.8
Identities = 10/39 (25%), Positives = 13/39 (33%)
Query: 344 TSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTP 382
T T T AP +T+T + TT T
Sbjct: 180 AQTPKRTEPAAPVASTKAPAATSTPAPKETATTAPVQTA 218
Score = 28.4 bits (63), Expect = 6.9
Identities = 13/55 (23%), Positives = 19/55 (34%), Gaps = 8/55 (14%)
Query: 354 TTPEPTPAPTSTTTQRSTTSTTTQTPTTPIQEEYTETITLELPQPVETNNQIPVQ 408
T + P P + T +R+ + + P P P ET PVQ
Sbjct: 170 ATAKTEPKPVAQTPKRTEPAAPVASTKAP--------AATSTPAPKETATTAPVQ 216
Score = 28.0 bits (62), Expect = 8.8
Identities = 14/53 (26%), Positives = 18/53 (33%), Gaps = 5/53 (9%)
Query: 329 TKIHGETICQPDSNETSTTTTT-----TTTTTPEPTPAPTSTTTQRSTTSTTT 376
T E S + T+T T TT P T +P TT + T
Sbjct: 182 TPKRTEPAAPVASTKAPAATSTPAPKETATTAPVQTASPAQTTATPAAGGKTA 234
>gnl|CDD|238064 cd00116, LRR_RI, Leucine-rich repeats (LRRs), ribonuclease
inhibitor (RI)-like subfamily. LRRs are 20-29 residue
sequence motifs present in many proteins that
participate in protein-protein interactions and have
different functions and cellular locations. LRRs
correspond to structural units consisting of a beta
strand (LxxLxLxxN/CxL conserved pattern) and an alpha
helix. This alignment contains 12 strands corresponding
to 11 full repeats, consistent with the extent observed
in the subfamily acting as Ran GTPase Activating
Proteins (RanGAP1).
Length = 319
Score = 36.6 bits (85), Expect = 0.020
Identities = 36/168 (21%), Positives = 55/168 (32%), Gaps = 17/168 (10%)
Query: 120 KLAYLYLSHNQLT----EFLLDDIRGLKRLRTVDLSYNKINKFGTRNEGK----NQVQGV 171
L L L N+L E L +R + L+ ++L+ N I G R + N V
Sbjct: 138 ALEKLVLGRNRLEGASCEALAKALRANRDLKELNLANNGIGDAGIRALAEGLKANCNLEV 197
Query: 172 TNIFELKLQHNEIENLDGALMGIHGLSRLDLSHNKLR-----TISPDDFIGLDSLKMLDI 226
++ L L L + L L+L N L ++ SL L +
Sbjct: 198 LDLNNNGLTDEGASALAETLASLKSLEVLNLGDNNLTDAGAAALASALLSPNISLLTLSL 257
Query: 227 SHNLLTTLE-ETSKTFLPA---LEELFVSHNSLTRLDKDFHGLPVLCK 270
S N +T + L L EL + N +L
Sbjct: 258 SCNDITDDGAKDLAEVLAEKESLLELDLRGNKFGEEGAQLLAESLLEP 305
Score = 33.5 bits (77), Expect = 0.17
Identities = 44/212 (20%), Positives = 69/212 (32%), Gaps = 53/212 (25%)
Query: 26 SKLQLLVIEQNQLEALPSDI------QLFSQLGSLYANNNRITS-----LDGLLRGLTKL 74
S LQ L + N L + L L L NR+ L LR L
Sbjct: 108 SSLQELKLNNNGLGDRGLRLLAKGLKDLPPALEKLVLGRNRLEGASCEALAKALRANRDL 167
Query: 75 QVFNMDFNQITM--VRR--DEFQNLHNLDSISLQNNQITSMNSS-----LSGLTKLAYLY 125
+ N+ N I +R + + NL+ + L NN +T +S L+ L L L
Sbjct: 168 KELNLANNGIGDAGIRALAEGLKANCNLEVLDLNNNGLTDEGASALAETLASLKSLEVLN 227
Query: 126 LSHNQLTEF--------LLDDIRGLKRL-------------------------RTVDLSY 152
L N LT+ LL L L +DL
Sbjct: 228 LGDNNLTDAGAAALASALLSPNISLLTLSLSCNDITDDGAKDLAEVLAEKESLLELDLRG 287
Query: 153 NKINKFGTRNEGKNQVQGVTNIFELKLQHNEI 184
NK + G + ++ ++ + L ++ +
Sbjct: 288 NKFGEEGAQLLAESLLEPGNELESLWVKDDSF 319
>gnl|CDD|218161 pfam04589, RFX1_trans_act, RFX1 transcription activation region.
The RFX family is a family of winged-helix DNA binding
proteins. RFX1 is a regulatory factor essential for
expression of MHC class II genes. This region is to
found N terminal to the RFX DNA binding region
(pfam02257) in some mammalian RFX proteins, and is
thought to activate transcription when associated with
DNA. Deletion analysis has identified the region 233-351
in human RFX1 as being required for maximal activation.
Length = 150
Score = 35.0 bits (80), Expect = 0.025
Identities = 18/70 (25%), Positives = 28/70 (40%), Gaps = 8/70 (11%)
Query: 340 DSNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTPIQEEYTETITLELPQPV 399
S S + + T P AP + QRS T+QT P+Q+ L + +
Sbjct: 3 TSEGGSDSPASVALQTSVPAQAPVPASQQRSVVQATSQTKGGPVQQ-------LPVHRVQ 55
Query: 400 ETNNQIP-VQ 408
+ Q+ VQ
Sbjct: 56 QVPQQVQQVQ 65
>gnl|CDD|237555 PRK13914, PRK13914, invasion associated secreted endopeptidase;
Provisional.
Length = 481
Score = 35.9 bits (82), Expect = 0.032
Identities = 19/89 (21%), Positives = 32/89 (35%), Gaps = 5/89 (5%)
Query: 321 AMETINNNTKIHGETICQPDSNETSTTTTTTTTTTPEPTPAPT-----STTTQRSTTSTT 375
A + NT + T + ++ T T +P PAP+ + T + T+T
Sbjct: 267 AAPVVKENTNTNTATTEKKETTTQQQTAPKAPTEAAKPAPAPSTNTNANKTNTNTNTNTN 326
Query: 376 TQTPTTPIQEEYTETITLELPQPVETNNQ 404
+TP + T T + NQ
Sbjct: 327 NTNTSTPSKNTNTNTNSNTNTNSNTNANQ 355
>gnl|CDD|178695 PLN03150, PLN03150, hypothetical protein; Provisional.
Length = 623
Score = 35.6 bits (82), Expect = 0.056
Identities = 29/109 (26%), Positives = 47/109 (43%), Gaps = 28/109 (25%)
Query: 124 LYLSHNQLTEFLLDDIRGLKRLRTVDLSYNKINKFGTRNEGKNQVQGVTNIFELKLQHNE 183
L L + L F+ +DI L+ L++++LS N I G
Sbjct: 423 LGLDNQGLRGFIPNDISKLRHLQSINLSGNSI-------RG------------------- 456
Query: 184 IENLDGALMGIHGLSRLDLSHNKLRTISPDDFIGLDSLKMLDISHNLLT 232
N+ +L I L LDLS+N P+ L SL++L+++ N L+
Sbjct: 457 --NIPPSLGSITSLEVLDLSYNSFNGSIPESLGQLTSLRILNLNGNSLS 503
Score = 33.3 bits (76), Expect = 0.26
Identities = 21/75 (28%), Positives = 36/75 (48%), Gaps = 1/75 (1%)
Query: 194 IHGLSRLDLSHNKLRTISPDDFIGLDSLKMLDISHNLLTTLEETSKTFLPALEELFVSHN 253
+ L ++LS N +R P + SL++LD+S+N S L +L L ++ N
Sbjct: 441 LRHLQSINLSGNSIRGNIPPSLGSITSLEVLDLSYNSFNGSIPESLGQLTSLRILNLNGN 500
Query: 254 SLT-RLDKDFHGLPV 267
SL+ R+ G +
Sbjct: 501 SLSGRVPAALGGRLL 515
Score = 32.9 bits (75), Expect = 0.32
Identities = 20/65 (30%), Positives = 34/65 (52%), Gaps = 1/65 (1%)
Query: 90 DEFQNLHNLDSISLQNNQIT-SMNSSLSGLTKLAYLYLSHNQLTEFLLDDIRGLKRLRTV 148
++ L +L SI+L N I ++ SL +T L L LS+N + + + L LR +
Sbjct: 436 NDISKLRHLQSINLSGNSIRGNIPPSLGSITSLEVLDLSYNSFNGSIPESLGQLTSLRIL 495
Query: 149 DLSYN 153
+L+ N
Sbjct: 496 NLNGN 500
>gnl|CDD|223033 PHA03291, PHA03291, envelope glycoprotein I; Provisional.
Length = 401
Score = 35.3 bits (81), Expect = 0.057
Identities = 17/46 (36%), Positives = 21/46 (45%), Gaps = 2/46 (4%)
Query: 339 PDSNETSTT--TTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTP 382
S ET+ T TTT+ +T P P+ T Q TT TP P
Sbjct: 215 TASPETTPTPSTTTSPPSTTIPAPSTTIAAPQAGTTPEAEGTPAPP 260
Score = 32.2 bits (73), Expect = 0.54
Identities = 15/50 (30%), Positives = 23/50 (46%), Gaps = 1/50 (2%)
Query: 334 ETICQPDSNET-STTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTP 382
T P++ T STTT+ +TT P P+ + + + T P TP
Sbjct: 213 RTTASPETTPTPSTTTSPPSTTIPAPSTTIAAPQAGTTPEAEGTPAPPTP 262
Score = 30.7 bits (69), Expect = 1.7
Identities = 14/53 (26%), Positives = 20/53 (37%)
Query: 338 QPDSNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTPIQEEYTET 390
P + + TT + TTP P+ + +T STT P E T
Sbjct: 204 VPATPRPTPRTTASPETTPTPSTTTSPPSTTIPAPSTTIAAPQAGTTPEAEGT 256
>gnl|CDD|217197 pfam02711, Pap_E4, E4 protein. This is is a family of
Papillomavirus proteins, E4, coded for by ORF4. A splice
variant, E1--E4, exists but neither the function of E4
or E1--E4 is known.
Length = 95
Score = 32.6 bits (75), Expect = 0.060
Identities = 14/41 (34%), Positives = 22/41 (53%)
Query: 340 DSNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPT 380
DS+ +STT+ ++ T T P + TT S+ + T QT
Sbjct: 45 DSDSSSTTSPSSPPTPQPTTECPWTVTTTGSSVTLTAQTKD 85
>gnl|CDD|180536 PRK06347, PRK06347, autolysin; Reviewed.
Length = 592
Score = 35.4 bits (81), Expect = 0.060
Identities = 17/73 (23%), Positives = 28/73 (38%), Gaps = 1/73 (1%)
Query: 341 SNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTPIQEEYTETITLELPQPVE 400
+ E T T TT P P T T Q + T Q +++ E T+ P
Sbjct: 70 TKEAPATATPENTTEPTVEPKQTETKEQ-TKTPEEKQPAAKQVEKAPAEPATVSNPDNAT 128
Query: 401 TNNQIPVQDNLEK 413
+++ + L+K
Sbjct: 129 SSSTPATYNLLQK 141
>gnl|CDD|132198 TIGR03154, sulfolob_CbsA, cytochrome b558/566, subunit A. Members
of this protein family are CbsA, one subunit of a highly
glycosylated, heterodimeric, mono-heme cytochrome
b558/566, found in Sulfolobus acidocaldarius and several
other members of the Sulfolobales, a branch of the
Crenarchaeota.
Length = 465
Score = 35.0 bits (80), Expect = 0.062
Identities = 19/49 (38%), Positives = 27/49 (55%), Gaps = 2/49 (4%)
Query: 333 GETICQPDSNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTT 381
GET+ D + TS+ T TTP P T+T+T STT+++ TT
Sbjct: 395 GETLF--DKSITSSFLTLELVTTPPTPPTSTTTSTSPSTTTSSAIPSTT 441
Score = 28.8 bits (64), Expect = 5.4
Identities = 11/33 (33%), Positives = 18/33 (54%)
Query: 344 TSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTT 376
+T T T+TT +P+ T+++ STT T
Sbjct: 413 VTTPPTPPTSTTTSTSPSTTTSSAIPSTTLYVT 445
>gnl|CDD|216421 pfam01299, Lamp, Lysosome-associated membrane glycoprotein (Lamp).
Length = 305
Score = 34.7 bits (80), Expect = 0.067
Identities = 15/53 (28%), Positives = 22/53 (41%), Gaps = 4/53 (7%)
Query: 326 NNNTKIHGETICQPDSNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQT 378
+N T ET C + + TT T +P PTP P+S + + T
Sbjct: 70 SNGTFSKTETRC----EADTPSPTTVATPSPSPTPVPSSPAVGNYSVTNGNGT 118
Score = 28.9 bits (65), Expect = 4.4
Identities = 10/40 (25%), Positives = 14/40 (35%)
Query: 344 TSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTPI 383
T + + TT P T T T + T T T +
Sbjct: 7 TFSYNLSDTTLFPNATSKGVKTVTSSTDTKAPTNTTYRCV 46
>gnl|CDD|227281 COG4945, COG4945, Membrane-anchored protein predicted to be
involved in regulation of amylopullulanase [Carbohydrate
transport and metabolism].
Length = 570
Score = 34.9 bits (80), Expect = 0.081
Identities = 14/63 (22%), Positives = 23/63 (36%), Gaps = 1/63 (1%)
Query: 343 ETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTPIQEEYTETITLELPQPVETN 402
+ T + T T T T+T T+ TT+ T T T + I P+ +
Sbjct: 489 YIAYTQGGSITLTWTQTYTLTNTVTEMRTTTGTMLTSETF-SSPILQPIPYLYPEADPST 547
Query: 403 NQI 405
+
Sbjct: 548 LSV 550
Score = 28.4 bits (63), Expect = 8.5
Identities = 14/58 (24%), Positives = 19/58 (32%), Gaps = 4/58 (6%)
Query: 341 SNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTS---TTTQTPTTPIQEEYTETITLEL 395
+ T T T T T T TTT TS ++ P + TL +
Sbjct: 494 QGGSITLTW-TQTYTLTNTVTEMRTTTGTMLTSETFSSPILQPIPYLYPEADPSTLSV 550
>gnl|CDD|236776 PRK10856, PRK10856, cytoskeletal protein RodZ; Provisional.
Length = 331
Score = 34.6 bits (80), Expect = 0.087
Identities = 15/49 (30%), Positives = 19/49 (38%), Gaps = 1/49 (2%)
Query: 338 QPDSNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTT-QTPTTPIQE 385
S T+TTT TTP P +T T T + T P Q+
Sbjct: 158 SGQSVPLDTSTTTDPATTPAPAAPVDTTPTNSQTPAVATAPAPAVDPQQ 206
Score = 33.1 bits (76), Expect = 0.24
Identities = 12/45 (26%), Positives = 16/45 (35%)
Query: 338 QPDSNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTP 382
Q T+TTT P PA TT ++ + T P
Sbjct: 156 QNSGQSVPLDTSTTTDPATTPAPAAPVDTTPTNSQTPAVATAPAP 200
Score = 28.8 bits (65), Expect = 5.6
Identities = 8/45 (17%), Positives = 10/45 (22%)
Query: 340 DSNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTPIQ 384
T+ TTP + P T P Q
Sbjct: 170 TDPATTPAPAAPVDTTPTNSQTPAVATAPAPAVDPQQNAVVAPSQ 214
>gnl|CDD|236782 PRK10871, nlpD, lipoprotein NlpD; Provisional.
Length = 319
Score = 34.0 bits (78), Expect = 0.12
Identities = 15/43 (34%), Positives = 25/43 (58%), Gaps = 7/43 (16%)
Query: 339 PDSNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTT 381
P++ +TT T T APT++TT+ + +ST+T TP +
Sbjct: 163 PNNKPAATTVTAPVT-------APTASTTEPTASSTSTSTPIS 198
>gnl|CDD|183854 PRK13042, PRK13042, superantigen-like protein; Reviewed.
Length = 291
Score = 33.8 bits (77), Expect = 0.14
Identities = 20/63 (31%), Positives = 30/63 (47%), Gaps = 8/63 (12%)
Query: 319 IDAMETINNNTKIHGETICQPDSNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQT 378
++A ++ +TK+ P S +TT +T P+ TP TT ST T Q+
Sbjct: 38 VEAPQSTPPSTKVEA-----PQSKPNATTPPSTKVEAPQQTPNA---TTPSSTKVETPQS 89
Query: 379 PTT 381
PTT
Sbjct: 90 PTT 92
Score = 28.8 bits (64), Expect = 4.7
Identities = 22/75 (29%), Positives = 31/75 (41%), Gaps = 10/75 (13%)
Query: 339 PDSNETSTTTTTTTTTTPE-PTPAPTSTTTQRSTTSTTTQTP--TTPIQEEYTETITLEL 395
P S + +T +T E P P +TT + QTP TTP + +E
Sbjct: 33 PSSTKVEAPQSTPPSTKVEAPQSKPNATTPPSTKVEAPQQTPNATTP------SSTKVET 86
Query: 396 PQPVETNNQIPVQDN 410
PQ T Q+P + N
Sbjct: 87 PQS-PTTKQVPTEIN 100
>gnl|CDD|165099 PHA02732, PHA02732, hypothetical protein; Provisional.
Length = 1467
Score = 34.3 bits (78), Expect = 0.16
Identities = 11/66 (16%), Positives = 28/66 (42%), Gaps = 3/66 (4%)
Query: 331 IHGETICQPDSNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTPIQEEYTET 390
+ G + + + ++ PTP P+S+++ S+ + + +P IQ +
Sbjct: 1213 MSGTNTFTFKAAQLQLSAASSPPAATTPTPPPSSSSS--SSAQSISTSPGQ-IQIVLNGS 1269
Query: 391 ITLELP 396
T+ +
Sbjct: 1270 TTIHIN 1275
>gnl|CDD|114205 pfam05467, Herpes_U47, Herpesvirus glycoprotein U47.
Length = 627
Score = 34.1 bits (77), Expect = 0.16
Identities = 25/79 (31%), Positives = 35/79 (44%), Gaps = 11/79 (13%)
Query: 339 PDSNETSTTTTTTT-----TTTPEPTPAPTSTTTQRSTTSTTTQTPTTP-IQEEYTETIT 392
P S +ST+ + T+ T P P P P + T T TP TP I + TE+I
Sbjct: 241 PSSTPSSTSASITSPHIPSTNIPTPEPPPVTKNFTELHTDTIKVTPNTPTITAQTTESIK 300
Query: 393 -----LELPQPVETNNQIP 406
+ P+P+ T IP
Sbjct: 301 KIVKRSDFPRPMYTPTDIP 319
>gnl|CDD|227655 COG5352, COG5352, Uncharacterized protein conserved in bacteria
[Function unknown].
Length = 169
Score = 32.6 bits (74), Expect = 0.22
Identities = 15/68 (22%), Positives = 26/68 (38%), Gaps = 8/68 (11%)
Query: 352 TTTTPEPTPAPTSTTTQ------RSTTSTTTQTPTTPIQEEY-TETITLELPQPVETNNQ 404
T T++ + +S T T+T T +Q E T + +P E N
Sbjct: 50 KPTAAPARQKKTTSAPRAPVRAAQSVRHTPTRTGATALQAEVATPVVAPRYIRPAE-NVV 108
Query: 405 IPVQDNLE 412
+P+ +L
Sbjct: 109 LPISRHLT 116
>gnl|CDD|183558 PRK12495, PRK12495, hypothetical protein; Provisional.
Length = 226
Score = 32.5 bits (74), Expect = 0.28
Identities = 12/73 (16%), Positives = 21/73 (28%), Gaps = 1/73 (1%)
Query: 338 QPDSNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTS-TTTQTPTTPIQEEYTETITLELP 396
Q E S+T+ T T P A T+ T ++ + P
Sbjct: 111 QSAPPEASSTSATDEAATDPPATAAARDGPTPDPTAQPATPDERRSPRQRPPVSGEPPTP 170
Query: 397 QPVETNNQIPVQD 409
+ + +Q
Sbjct: 171 STPDAHVAGTLQA 183
Score = 27.9 bits (62), Expect = 8.7
Identities = 11/52 (21%), Positives = 16/52 (30%), Gaps = 1/52 (1%)
Query: 340 DSNETSTTTTTT-TTTTPEPTPAPTSTTTQRSTTSTTTQTPTTPIQEEYTET 390
S+ S + E A S + S+TS T + T P
Sbjct: 87 PSDAGSQASPDDDAQPAAEAEAADQSAPPEASSTSATDEAATDPPATAAARD 138
>gnl|CDD|237191 PRK12757, PRK12757, cell division protein FtsN; Provisional.
Length = 256
Score = 32.7 bits (75), Expect = 0.28
Identities = 16/78 (20%), Positives = 22/78 (28%), Gaps = 11/78 (14%)
Query: 340 DSNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTPIQEEYTETITLELPQPV 399
+ + +T + P T+ Q T T P P P PV
Sbjct: 110 QTPQVPRSTVQIQQQAQQQQPPATTAQPQPVTPPRQTTAPVQPQT-----------PAPV 158
Query: 400 ETNNQIPVQDNLEKEKQP 417
T PV +E K
Sbjct: 159 RTQPAAPVTQAVEAPKVE 176
Score = 30.0 bits (68), Expect = 1.8
Identities = 10/50 (20%), Positives = 15/50 (30%)
Query: 335 TICQPDSNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTPIQ 384
T+ + TT P P T+ Q T + P P+
Sbjct: 118 TVQIQQQAQQQQPPATTAQPQPVTPPRQTTAPVQPQTPAPVRTQPAAPVT 167
Score = 30.0 bits (68), Expect = 2.2
Identities = 16/77 (20%), Positives = 28/77 (36%), Gaps = 1/77 (1%)
Query: 339 PDSNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTPIQEEYTETITLELPQP 398
P + +T +T + TT + T + T P+Q + + + P
Sbjct: 106 PYNEQTPQVPRSTVQIQQQAQQQQPPATTAQPQPVTPPRQTTAPVQPQTPAPVRTQPAAP 165
Query: 399 VETNNQIPVQDNLEKEK 415
V + P + EKEK
Sbjct: 166 VTQAVEAPKVEA-EKEK 181
>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
Length = 3151
Score = 33.4 bits (76), Expect = 0.32
Identities = 17/91 (18%), Positives = 26/91 (28%), Gaps = 13/91 (14%)
Query: 339 PDSNETSTTTTTTTTTTPEPTPAPTSTTT-----------QRSTTSTTTQTPTTPIQEEY 387
P T+ T P P P P S +R + + P P +
Sbjct: 2824 PAGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPGGDVRRRPPSRSPAAKPAAPARPPV 2883
Query: 388 TETITLELPQPVETNNQIPVQDNLEKEKQPD 418
+ + E+ P D E+ QP
Sbjct: 2884 RRLARPAVSRSTESFALPP--DQPERPPQPQ 2912
>gnl|CDD|218439 pfam05109, Herpes_BLLF1, Herpes virus major outer envelope
glycoprotein (BLLF1). This family consists of the BLLF1
viral late glycoprotein, also termed gp350/220. It is
the most abundantly expressed glycoprotein in the viral
envelope of the Herpesviruses and is the major antigen
responsible for stimulating the production of
neutralising antibodies in vivo.
Length = 830
Score = 33.2 bits (75), Expect = 0.33
Identities = 16/45 (35%), Positives = 23/45 (51%)
Query: 339 PDSNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTPI 383
P S TS T T+ T TP TS TTQ+++ + +PT +
Sbjct: 488 PTSRTTSATPNATSPTPAVTTPNATSPTTQKTSDTPNATSPTPIV 532
Score = 32.1 bits (72), Expect = 0.58
Identities = 16/85 (18%), Positives = 27/85 (31%), Gaps = 4/85 (4%)
Query: 321 AMETINNNTKIHGETICQPDSNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPT 380
+ N + P++ +T T+ T PTP TT ++ T T +
Sbjct: 491 RTTSATPNATSPTPAVTTPNATSPTTQKTSDTPNATSPTPIVIGVTTTATSPPTGTTSVP 550
Query: 381 TPIQEEYTETITLELPQPVETNNQI 405
+ TE E P +
Sbjct: 551 NATSPQVTE----ESPVNNTNTPVV 571
Score = 30.9 bits (69), Expect = 1.5
Identities = 18/70 (25%), Positives = 26/70 (37%), Gaps = 2/70 (2%)
Query: 333 GETICQPDSNETSTTTTTTTTTTPEPT-PAPTSTTTQRSTTSTTTQTPTTPIQEEYTETI 391
G S T+ T+ TT+ P +PT T + TS TTQ T+ + T
Sbjct: 472 GTPTGTTSSTLPEDTSPTSRTTSATPNATSPTPAVTTPNATSPTTQK-TSDTPNATSPTP 530
Query: 392 TLELPQPVET 401
+ T
Sbjct: 531 IVIGVTTTAT 540
Score = 30.9 bits (69), Expect = 1.6
Identities = 24/83 (28%), Positives = 32/83 (38%), Gaps = 10/83 (12%)
Query: 327 NNTKIHGETICQPDSNETSTTTTTTTTTTPEPT------PAPTSTTTQRSTTSTTTQTPT 380
NNT + S TS TT T PT P+ + +T + ++TSTT
Sbjct: 564 NNTNTP--VVTSAPSVLTSAVTTGQHGTGSSPTSQQPGIPSSSHSTPRSNSTSTT--PLL 619
Query: 381 TPIQEEYTETITLELPQPVETNN 403
T E IT E P T +
Sbjct: 620 TSAHPTGGENITEETPSVPSTTH 642
Score = 30.5 bits (68), Expect = 2.2
Identities = 17/77 (22%), Positives = 23/77 (29%), Gaps = 1/77 (1%)
Query: 326 NNNTKIHGETICQPDSNETSTTTTTTTTTTPEPT-PAPTSTTTQRSTTSTTTQTPTTPIQ 384
N I + +N T+TT P+ T T TT PTTP
Sbjct: 396 NPVADAKTLIITRTATNATTTTHKVVFHKAPDTTKSVIFVYTLVHVEPHKTTAVPTTPSL 455
Query: 385 EEYTETITLELPQPVET 401
+ T+ P
Sbjct: 456 PPASTGPTVSTADPTSG 472
>gnl|CDD|212004 TIGR04281, peripla_PGF_1, putative ABC transporter
PGF-CTERM-modified substrate-binding protein. Members
of this archaeal protein family resemble periplasmic
substrate-binding proteins of ABC transporters and
appear in gene neighborhoods with permease and
ATP-binding cassette proteins. Notably, essentially all
members also have the PGF-CTERM putative protein-sorting
domain at the C-terminus, while more distant homologs
(excluded by the trusted cutoff) instead have what
appear to be lipoprotein signal peptides at the
N-terminus.
Length = 330
Score = 32.3 bits (74), Expect = 0.39
Identities = 17/59 (28%), Positives = 25/59 (42%), Gaps = 9/59 (15%)
Query: 318 VIDAMETINNNTKIHGETICQPDSNET-STTTTTTTTTTPEPTPAPTSTTTQRSTTSTT 375
V++A+ET+ E P++ E T T TT + T TT ST + T
Sbjct: 253 VVEAVETL-------AEAF-HPEAYEEAETADTEADTTYDDSTTDAEDGTTTTSTGTAT 303
>gnl|CDD|165021 PHA02638, PHA02638, CC chemokine receptor-like protein;
Provisional.
Length = 417
Score = 32.3 bits (73), Expect = 0.47
Identities = 22/83 (26%), Positives = 39/83 (46%), Gaps = 5/83 (6%)
Query: 314 SMRAVIDAMETINNNTKIHGETICQPDSNETSTTTTTTTTTTP-EPTPAPTSTTTQRSTT 372
++ ++I + T++ T ET + ST T T P +P+P S ++ S++
Sbjct: 7 TLSSIILSSSTLSPTTFFTIETSMDESKSIISTFTEIIPTEIPTSESPSPNSNSSSSSSS 66
Query: 373 STTTQTPTTPIQEEYTETITLEL 395
S+++ T EY IT EL
Sbjct: 67 SSSSIT----YDYEYENNITYEL 85
Score = 28.1 bits (62), Expect = 10.0
Identities = 15/58 (25%), Positives = 23/58 (39%), Gaps = 4/58 (6%)
Query: 342 NETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTT-TQTPTTPIQEEYTETITLELPQP 398
+ +STT ++ ++ +P T S + T T I TE T E P P
Sbjct: 2 DNSSTTLSSIILSSSTLSPTTFFTIETSMDESKSIISTFTEIIP---TEIPTSESPSP 56
>gnl|CDD|221093 pfam11359, gpUL132, Glycoprotein UL132. Glycoprotein UL132 is a
low-abundance structural component of Human
cytomegalovirus (HCMV). The function of this protein is
not fully understood.
Length = 235
Score = 31.9 bits (72), Expect = 0.48
Identities = 17/38 (44%), Positives = 19/38 (50%)
Query: 344 TSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTT 381
TS+TTT TT TS TT ST TT T +T
Sbjct: 2 TSSTTTPANTTATVTVTVATSNTTSVSTNVTTALTAST 39
Score = 28.1 bits (62), Expect = 8.6
Identities = 12/45 (26%), Positives = 19/45 (42%)
Query: 341 SNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTPIQE 385
+ + TT T T T ST + T++TT P + + E
Sbjct: 5 TTTPANTTATVTVTVATSNTTSVSTNVTTALTASTTAEPGSVLTE 49
>gnl|CDD|184900 PRK14907, rplD, 50S ribosomal protein L4; Provisional.
Length = 295
Score = 31.8 bits (72), Expect = 0.52
Identities = 12/41 (29%), Positives = 16/41 (39%), Gaps = 2/41 (4%)
Query: 343 ETSTTTTTTTTT--TPEPTPAPTSTTTQRSTTSTTTQTPTT 381
ET TT TT P A TS T ++ + T +
Sbjct: 3 ETKKTTKKKTTEEKKPAAKKATTSKETAKTKKTAKTTSTKA 43
>gnl|CDD|237862 PRK14948, PRK14948, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 620
Score = 32.2 bits (74), Expect = 0.54
Identities = 12/75 (16%), Positives = 24/75 (32%)
Query: 338 QPDSNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTPIQEEYTETITLELPQ 397
+++ + T + P P++ T+++ T+ + T P
Sbjct: 368 IANASAPANPTPAPNPSPPPAPIQPSAPKTKQAATTPSPPPAKASPPIPVPAEPTEPSPT 427
Query: 398 PVETNNQIPVQDNLE 412
P P NLE
Sbjct: 428 PPANAANAPPSLNLE 442
>gnl|CDD|165513 PHA03255, PHA03255, BDLF3; Provisional.
Length = 234
Score = 31.8 bits (71), Expect = 0.56
Identities = 24/77 (31%), Positives = 30/77 (38%), Gaps = 7/77 (9%)
Query: 324 TINNNTKIHGETICQPDSNETSTTTTTTTTTTPEPTPAPTSTT--TQRSTTSTTTQTPTT 381
TIN TK+ + I T T T+T T T +STT T R T +TT +
Sbjct: 99 TINVTTKVTAQNITA-----TEAGTGTSTGVTSNVTTRSSSTTSATTRITNATTLAPTLS 153
Query: 382 PIQEEYTETITLELPQP 398
T ELP
Sbjct: 154 SKGTSNATKTTAELPTV 170
Score = 30.3 bits (67), Expect = 1.4
Identities = 21/54 (38%), Positives = 28/54 (51%), Gaps = 2/54 (3%)
Query: 341 SNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTP--TTPIQEEYTETIT 392
S+ S T TT P+P+ + +T +STT TTT P TT I T T+T
Sbjct: 29 SSTASAGNVTGTTAVTTPSPSASGPSTNQSTTLTTTSAPITTTAILSTNTTTVT 82
Score = 29.9 bits (66), Expect = 2.0
Identities = 19/49 (38%), Positives = 25/49 (51%)
Query: 344 TSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTPIQEEYTETIT 392
T+ +TT TTT+ P T A ST T T++ TT TP T +T
Sbjct: 55 TNQSTTLTTTSAPITTTAILSTNTTTVTSTGTTVTPVPTTSNASTINVT 103
Score = 29.1 bits (64), Expect = 3.8
Identities = 16/37 (43%), Positives = 21/37 (56%)
Query: 344 TSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPT 380
T+TTT T+T TT P P ++ +T TT T Q T
Sbjct: 76 TNTTTVTSTGTTVTPVPTTSNASTINVTTKVTAQNIT 112
>gnl|CDD|215299 PLN02543, PLN02543, pfkB-type carbohydrate kinase family protein.
Length = 496
Score = 32.2 bits (73), Expect = 0.59
Identities = 11/61 (18%), Positives = 16/61 (26%), Gaps = 9/61 (14%)
Query: 328 NTKIHGETICQPDSNETSTTTTTTTTTTPEP---------TPAPTSTTTQRSTTSTTTQT 378
K + +T +P+P T +P T R T T Q
Sbjct: 36 KPKASLHPSIKRSRPGRCSTNGAAVPESPKPSRRGRKKKPTSSPPKAKTTRRRTKKTDQE 95
Query: 379 P 379
Sbjct: 96 L 96
>gnl|CDD|218673 pfam05642, Sporozoite_P67, Sporozoite P67 surface antigen. This
family consists of several Theileria P67 surface
antigens. A stage specific surface antigen of Theileria
parva, p67, is the basis for the development of an
anti-sporozoite vaccine for the control of East Coast
fever (ECF) in cattle. The antigen has been shown to
contain five distinct linear peptide sequences
recognised by sporozoite-neutralising murine monoclonal
antibodies.
Length = 727
Score = 32.0 bits (72), Expect = 0.65
Identities = 13/51 (25%), Positives = 26/51 (50%)
Query: 333 GETICQPDSNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTPI 383
G++ P + S +TTTTT + P+ ++ R+ + +T + + PI
Sbjct: 271 GDSTSSPSTTRPSGSTTTTTPASSGPSAPGGPGSSSRNAVTRSTDSISGPI 321
Score = 28.1 bits (62), Expect = 9.6
Identities = 13/44 (29%), Positives = 19/44 (43%)
Query: 339 PDSNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTP 382
STTTTT ++ P P S++ T ST + + P
Sbjct: 279 TTRPSGSTTTTTPASSGPSAPGGPGSSSRNAVTRSTDSISGPIP 322
>gnl|CDD|139494 PRK13335, PRK13335, superantigen-like protein; Reviewed.
Length = 356
Score = 31.6 bits (71), Expect = 0.66
Identities = 17/59 (28%), Positives = 28/59 (47%), Gaps = 1/59 (1%)
Query: 335 TICQPDSNETSTTTTTTTTTTPEPT-PAPTSTTTQRSTTSTTTQTPTTPIQEEYTETIT 392
I + + + TTT +TTP+ P ST T + ST + TP +P ++ +T
Sbjct: 107 NISATPAPKQEQSQTTTESTTPKTKVTTPPSTNTPQPMQSTKSDTPQSPTIKQAQTDMT 165
Score = 31.6 bits (71), Expect = 0.70
Identities = 19/88 (21%), Positives = 35/88 (39%), Gaps = 12/88 (13%)
Query: 339 PDSNETSTTTTTTTTTTPEPT--------PAPTSTTTQRSTTSTTTQTPTTPIQEEYTET 390
P++NE T+ + + A + ++S T+T + TP T + T
Sbjct: 80 PNTNEEKTSASKIEKISQPKQEEQKSLNISATPAPKQEQSQTTTESTTPKTKV----TTP 135
Query: 391 ITLELPQPVETNNQIPVQDNLEKEKQPD 418
+ PQP+++ Q K+ Q D
Sbjct: 136 PSTNTPQPMQSTKSDTPQSPTIKQAQTD 163
>gnl|CDD|151471 pfam11024, DGF-1_4, Dispersed gene family protein 1 of Trypanosoma
cruzi region 4. This protein is likely to be highly
expressed, and is expressed from the sub-telomeric
region. However, the function is not known. Other
domains on this protein include DGF-1_N, DGF-1_2, and
DGF-1_5. This domain is just downstream from the
C-terminus, but not the C-terminus of proteins, also
annotated as being DGF-1, that constitute family
DGF-1_C.
Length = 75
Score = 29.5 bits (66), Expect = 0.67
Identities = 19/49 (38%), Positives = 24/49 (48%), Gaps = 5/49 (10%)
Query: 345 STTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTPIQEEYTETITL 393
S T T T T + PT P+ +TT S T PT +Q TET+ L
Sbjct: 14 SHTATLTETRSLTPTWTPSMSTTHYSPTQ---YGPTETLQ--VTETVAL 57
>gnl|CDD|215633 PLN03210, PLN03210, Resistant to P. syringae 6; Provisional.
Length = 1153
Score = 31.8 bits (72), Expect = 0.85
Identities = 66/261 (25%), Positives = 107/261 (40%), Gaps = 52/261 (19%)
Query: 3 TDLNWLFLNN-NRLKSLEGQLGTLSKLQLLVIEQ-NQLEALPSDIQLFSQLGSLYANNNR 60
T+L L L++ + L L + L+KL+ L + + LE LP+ I L SLY N
Sbjct: 657 TNLETLKLSDCSSLVELPSSIQYLNKLEDLDMSRCENLEILPTGINL----KSLYRLN-- 710
Query: 61 ITSLDGLLRGLTKLQVF-----NMDFNQITMVRRDEF-QNLH--NLDSISLQNNQITSMN 112
L G ++L+ F N+ + + +EF NL NLD + L + +
Sbjct: 711 -------LSGCSRLKSFPDISTNISWLDLDETAIEEFPSNLRLENLDELILCEMKSEKLW 763
Query: 113 SSLSGLTKLAYLYLSHNQLTEFLLDDIRGLKRLRTVDLSYNKINKFGTRN---------- 162
+ LT L + LS + LT L DI L L + + +K+ N
Sbjct: 764 ERVQPLTPLMTM-LSPS-LTRLFLSDIPSLVELPSSIQNLHKLEHLEIENCINLETLPTG 821
Query: 163 -----------EGKNQVQGV----TNIFELKLQHNEIENLDGALMGIHGLSRLDLSH-NK 206
G ++++ TNI +L L IE + + LS LD++ N
Sbjct: 822 INLESLESLDLSGCSRLRTFPDISTNISDLNLSRTGIEEVPWWIEKFSNLSFLDMNGCNN 881
Query: 207 LRTISPDDFIGLDSLKMLDIS 227
L+ +S + L L+ +D S
Sbjct: 882 LQRVSL-NISKLKHLETVDFS 901
>gnl|CDD|233045 TIGR00601, rad23, UV excision repair protein Rad23. All proteins
in this family for which functions are known are
components of a multiprotein complex used for targeting
nucleotide excision repair to specific parts of the
genome. In humans, Rad23 complexes with the XPC protein.
This family is based on the phylogenomic analysis of JA
Eisen (1999, Ph.D. Thesis, Stanford University) [DNA
metabolism, DNA replication, recombination, and repair].
Length = 378
Score = 31.4 bits (71), Expect = 0.85
Identities = 13/56 (23%), Positives = 22/56 (39%), Gaps = 9/56 (16%)
Query: 344 TSTTTTTTTTTTPEPTP--APTSTTTQRSTTST-------TTQTPTTPIQEEYTET 390
+ T T P TP APT T + ++ ++ + +P +E T T
Sbjct: 75 SKPKTGTGKVAPPAATPTSAPTPTPSPPASPASGMSAAPASAVEEKSPSEESATAT 130
Score = 30.2 bits (68), Expect = 1.9
Identities = 10/65 (15%), Positives = 19/65 (29%), Gaps = 3/65 (4%)
Query: 329 TKIHGETICQPDSNETSTTTTTTTT---TTPEPTPAPTSTTTQRSTTSTTTQTPTTPIQE 385
K + P + TS T T + + AP S ++S + +
Sbjct: 77 PKTGTGKVAPPAATPTSAPTPTPSPPASPASGMSAAPASAVEEKSPSEESATATAPESPS 136
Query: 386 EYTET 390
+
Sbjct: 137 TSVPS 141
>gnl|CDD|237864 PRK14950, PRK14950, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 585
Score = 31.3 bits (71), Expect = 1.0
Identities = 14/87 (16%), Positives = 19/87 (21%), Gaps = 4/87 (4%)
Query: 317 AVIDAMETINNNTKIHGETICQPDSNE----TSTTTTTTTTTTPEPTPAPTSTTTQRSTT 372
AVI+A+ + T P ST P T T
Sbjct: 354 AVIEALLVPVPAPQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVP 413
Query: 373 STTTQTPTTPIQEEYTETITLELPQPV 399
P E + +P
Sbjct: 414 PRPVAPPVPHTPESAPKLTRAAIPVDE 440
>gnl|CDD|177958 PLN02324, PLN02324, triacylglycerol lipase.
Length = 415
Score = 30.8 bits (69), Expect = 1.4
Identities = 15/62 (24%), Positives = 30/62 (48%), Gaps = 8/62 (12%)
Query: 265 LPVLCKADLAHNNIKAINIQLALKTQCQIFGLNSTLRIYLEGNPVLCDDSMRAVIDAMET 324
+ VL ADL + INI L K + ++ G+P + D + + ++D+++
Sbjct: 228 MSVLSAADLVYGKKNKINISLQKKQVP--------ITVFAFGSPRIGDHNFKNLVDSLQP 279
Query: 325 IN 326
+N
Sbjct: 280 LN 281
>gnl|CDD|184285 PRK13733, PRK13733, conjugal transfer protein TraV; Provisional.
Length = 171
Score = 30.1 bits (68), Expect = 1.4
Identities = 8/44 (18%), Positives = 14/44 (31%), Gaps = 1/44 (2%)
Query: 341 SNETSTTTTTTTT-TTPEPTPAPTSTTTQRSTTSTTTQTPTTPI 383
+T T TT + + TP + T+ P+
Sbjct: 74 PVQTVTATTPSGSRPAVTATPEQKLLAPRPLFTAAREVKTVVPV 117
>gnl|CDD|224664 COG1750, COG1750, Archaeal serine proteases [General function
prediction only].
Length = 579
Score = 30.9 bits (70), Expect = 1.4
Identities = 27/106 (25%), Positives = 36/106 (33%), Gaps = 15/106 (14%)
Query: 280 AINIQLALKTQCQIFGLNSTLRIYLEGNPVLCDDSMRAV--IDAMETINNNTKIHGETIC 337
I I LA + Q + N L D A DA E + GE +
Sbjct: 468 FIKIGLAENSGDQPVLALAY--YEYAENGQLQDTINAAAYYKDASEL----ALLMGEVVS 521
Query: 338 QPDSNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTPI 383
S ET ++ + TT ST T +T T Q +T I
Sbjct: 522 STGSKETISSPIYSATTQT-------STATVTTTIENTVQLGSTAI 560
>gnl|CDD|183756 PRK12799, motB, flagellar motor protein MotB; Reviewed.
Length = 421
Score = 30.5 bits (68), Expect = 1.6
Identities = 18/101 (17%), Positives = 33/101 (32%), Gaps = 2/101 (1%)
Query: 319 IDAMETINNNTKIHGETICQPDSNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTS-TTTQ 377
I+ + HG + ++ T ++ T + P+P + +T S TTTQ
Sbjct: 284 IEKATGLKQ-IDTHGTVPVAAVTPSSAVTQSSAITPSSAAIPSPAVIPSSVTTQSATTTQ 342
Query: 378 TPTTPIQEEYTETITLELPQPVETNNQIPVQDNLEKEKQPD 418
+ + LP V PV + +
Sbjct: 343 ASAVALSSAGVLPSDVTLPGTVALPAAEPVNMQPQPMSTTE 383
>gnl|CDD|234797 PRK00575, tatA, twin arginine translocase protein A; Provisional.
Length = 92
Score = 28.6 bits (64), Expect = 1.7
Identities = 6/40 (15%), Positives = 13/40 (32%)
Query: 342 NETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTT 381
++ + P PT +QR + + +T
Sbjct: 48 SDNKAEASAAAAQAPYQVATPTPVQSQRVDPAAASGQDST 87
>gnl|CDD|237874 PRK14971, PRK14971, DNA polymerase III subunits gamma and tau;
Provisional.
Length = 614
Score = 30.5 bits (69), Expect = 1.7
Identities = 14/88 (15%), Positives = 28/88 (31%)
Query: 330 KIHGETICQPDSNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTPIQEEYTE 389
+ +P + + + P+P+ +S Q S + TQ TP
Sbjct: 372 GRGPKQHIKPVFTQPAAAPQPSAAAAASPSPSQSSAAAQPSAPQSATQPAGTPPTVSVDP 431
Query: 390 TITLELPQPVETNNQIPVQDNLEKEKQP 417
+ + P + E++K P
Sbjct: 432 PAAVPVNPPSTAPQAVRPAQFKEEKKIP 459
>gnl|CDD|197688 smart00370, LRR, Leucine-rich repeats, outliers.
Length = 24
Score = 26.5 bits (60), Expect = 2.2
Identities = 9/19 (47%), Positives = 13/19 (68%)
Query: 197 LSRLDLSHNKLRTISPDDF 215
L LDLS+N+L ++ P F
Sbjct: 4 LRELDLSNNQLSSLPPGAF 22
>gnl|CDD|197687 smart00369, LRR_TYP, Leucine-rich repeats, typical (most populated)
subfamily.
Length = 24
Score = 26.5 bits (60), Expect = 2.2
Identities = 9/19 (47%), Positives = 13/19 (68%)
Query: 197 LSRLDLSHNKLRTISPDDF 215
L LDLS+N+L ++ P F
Sbjct: 4 LRELDLSNNQLSSLPPGAF 22
>gnl|CDD|216233 pfam00997, Casein_kappa, Kappa casein. Kappa-casein is a mammalian
milk protein involved in a number of important
physiological processes. In the gut, the ingested
protein is split into an insoluble peptide (para
kappa-casein) and a soluble hydrophilic glycopeptide
(caseinomacropeptide). Caseinomacropeptide is
responsible for increased efficiency of digestion,
prevention of neonate hypersensitivity to ingested
proteins, and inhibition of gastric pathogens.
Length = 161
Score = 29.1 bits (65), Expect = 3.2
Identities = 14/54 (25%), Positives = 21/54 (38%), Gaps = 7/54 (12%)
Query: 343 ETSTTTTTTTTTTPEPTPAPTS-------TTTQRSTTSTTTQTPTTPIQEEYTE 389
+ + T T T EPTP PT+ T + S+ + TT + E
Sbjct: 107 DKTAIPTINTIATVEPTPVPTTEPTVNTVVTPEASSEFIISTPETTTVAVTSPE 160
>gnl|CDD|178748 PLN03209, PLN03209, translocon at the inner envelope of chloroplast
subunit 62; Provisional.
Length = 576
Score = 29.5 bits (66), Expect = 3.4
Identities = 10/53 (18%), Positives = 15/53 (28%), Gaps = 3/53 (5%)
Query: 358 PTPAPTSTTTQRSTTSTTTQ---TPTTPIQEEYTETITLELPQPVETNNQIPV 407
P P P S++ S + P + P VE P+
Sbjct: 386 PIPTPPSSSPASSKSVDAVAKPAEPDVVPSPGSASNVPEVEPAQVEAKKTRPL 438
>gnl|CDD|132431 TIGR03390, ascorbOXfungal, L-ascorbate oxidase, fungal type. This
model describes a family of fungal ascorbate oxidases,
within a larger family of multicopper oxidases that also
includes plant ascorbate oxidases (TIGR03388), plant
laccases and laccase-like proteins (TIGR03389), and
related proteins. The member from Acremonium sp. HI-25
is characterized.
Length = 538
Score = 29.8 bits (67), Expect = 3.5
Identities = 19/105 (18%), Positives = 32/105 (30%), Gaps = 10/105 (9%)
Query: 305 EGNPVLCDDSMRAVIDAMETINNNTKIHGETICQPDSNETSTTTTTTTTTTPEPTPAPTS 364
+ LC R ET + G + + S++ S + PE P P
Sbjct: 264 KTEDELCGGDKRQYFIQFETRDRPKVYRGYAVLRYRSDKASKLPSV-----PETPPLPLP 318
Query: 365 TTTQRSTTST-----TTQTPTTPIQEEYTETITLELPQPVETNNQ 404
+T P +E T + ++ Q V+ N
Sbjct: 319 NSTYDWLEYELEPLSEENNQDFPTLDEVTRRVVIDAHQNVDPLNG 363
>gnl|CDD|118131 pfam09595, Metaviral_G, Metaviral_G glycoprotein. This is a viral
attachment glycoprotein from region G of metaviruses. It
is high in serine and threonine suggesting it is highly
glycosylated.
Length = 183
Score = 28.8 bits (64), Expect = 3.5
Identities = 15/54 (27%), Positives = 21/54 (38%)
Query: 345 STTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTPIQEEYTETITLELPQP 398
S T TT + T + T T +T++ QT TT + T QP
Sbjct: 106 SRTQTTQAVHEKKNTRTTSRTQTPPTTSTAAVQTTTTTNTSSTGKEPTTTSVQP 159
Score = 27.7 bits (61), Expect = 9.3
Identities = 17/48 (35%), Positives = 26/48 (54%)
Query: 343 ETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTPIQEEYTET 390
+ +T TT+ T T P + A TTT +T+ST + TT +Q + T
Sbjct: 117 KKNTRTTSRTQTPPTTSTAAVQTTTTTNTSSTGKEPTTTSVQPRSSAT 164
>gnl|CDD|177577 PHA03292, PHA03292, envelope glycoprotein I; Provisional.
Length = 413
Score = 29.5 bits (66), Expect = 3.6
Identities = 15/60 (25%), Positives = 22/60 (36%)
Query: 339 PDSNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTPIQEEYTETITLELPQP 398
P TT+T + + + +P A T+T T T TT P + QP
Sbjct: 194 PRYLNAVTTSTYSRSMSSQPAGAATATPTPTLDTGLTTVAPPNETVVTGETALLCHWFQP 253
>gnl|CDD|215541 PLN03020, PLN03020, low-temperature-induced protein; Provisional.
Length = 556
Score = 29.3 bits (65), Expect = 4.8
Identities = 12/38 (31%), Positives = 17/38 (44%), Gaps = 2/38 (5%)
Query: 355 TPEPTPAPTSTTTQRSTTSTTTQTPTTPIQEEYTETIT 392
+PEP P T + ST+ Q+ YTE I+
Sbjct: 285 SPEPLS-PAETKSTEDPQSTS-NAEKPSNQKTYTEKIS 320
>gnl|CDD|113196 pfam04415, DUF515, Protein of unknown function (DUF515). Family of
hypothetical Archaeal proteins.
Length = 416
Score = 29.1 bits (65), Expect = 4.8
Identities = 11/54 (20%), Positives = 29/54 (53%)
Query: 341 SNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTPIQEEYTETITLE 394
S+ +ST+++ +++T+ P A + + S+T+Q+ + Y+ ++ L
Sbjct: 290 SSSSSTSSSESSSTSYSPGDASIQNSQRSQLQSSTSQSESESASSSYSYSVNLP 343
>gnl|CDD|219355 pfam07267, Nucleo_P87, Nucleopolyhedrovirus capsid protein P87.
This family consists of several Nucleopolyhedrovirus
capsid protein P87 sequences. P87 is expressed late in
infection and concentrated in infected cell nuclei.
Length = 606
Score = 29.1 bits (65), Expect = 5.1
Identities = 18/99 (18%), Positives = 34/99 (34%), Gaps = 2/99 (2%)
Query: 313 DSMRAVIDAMETINNNTKIHGETICQPDSNETSTTTTTTTTTTPEPTPAPTSTTTQRS-- 370
DS+ V + +T+ +T S +T T +TP + T ++
Sbjct: 248 DSVDDVRNFAKTVWRQKNTPAQTPMPERSWQTPAQTPARRISTPMTEEIKSWQTPLQTPA 307
Query: 371 TTSTTTQTPTTPIQEEYTETITLELPQPVETNNQIPVQD 409
S+ Q P + E + P + + +P D
Sbjct: 308 MYSSDYQAPKPEPIYTWEELLRERFPSDLFAISSLPDSD 346
>gnl|CDD|219426 pfam07489, Tir_receptor_C, Translocated intimin receptor (Tir)
C-terminus. Intimin and its translocated intimin
receptor (Tir) are bacterial proteins that mediate
adhesion between mammalian cells and attaching and
effacing (A/E) pathogens. A unique and essential feature
of A/E bacterial pathogens is the formation of
actin-rich pedestals beneath the intimately adherent
bacteria and localised destruction of the intestinal
brush border. The bacterial outer membrane adhesin,
intimin, is necessary for the production of the A/E
lesion and diarrhoea. The A/E bacteria translocate their
own receptor for intimin, Tir, into the membrane of
mammalian cells using the type III secretion system. The
translocated Tir triggers additional host signalling
events and actin nucleation, which are essential for
lesion formation. This family represents the Tir
C-terminal domain which has been reported to bind
uninfected host cells and beta-1 integrins although the
role of intimin binding to integrins is unclear. This
intimin C-terminal domain has also been shown to be
sufficient for Tir recognition.
Length = 222
Score = 28.7 bits (64), Expect = 5.3
Identities = 16/38 (42%), Positives = 21/38 (55%), Gaps = 1/38 (2%)
Query: 338 QPDSNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTT 375
QP +T+TTTTTTTTT P + T + T T+
Sbjct: 55 QP-VEQTTTTTTTTTTTHTTVENKPANNTPAQGNTDTS 91
>gnl|CDD|223714 COG0641, AslB, Arylsulfatase regulator (Fe-S oxidoreductase)
[General function prediction only].
Length = 378
Score = 28.9 bits (65), Expect = 5.8
Identities = 11/32 (34%), Positives = 19/32 (59%)
Query: 63 SLDGLLRGLTKLQVFNMDFNQITMVRRDEFQN 94
+ D +++GL LQ +DFN +T+V R +
Sbjct: 140 TFDRVMKGLELLQAHGVDFNTLTVVNRQNVLH 171
>gnl|CDD|223259 COG0181, HemC, Porphobilinogen deaminase [Coenzyme metabolism].
Length = 307
Score = 28.7 bits (65), Expect = 6.1
Identities = 8/32 (25%), Positives = 15/32 (46%)
Query: 185 ENLDGALMGIHGLSRLDLSHNKLRTISPDDFI 216
D ++ GL RL L + + P++F+
Sbjct: 160 GEYDAIILAAAGLKRLGLENRITEILDPEEFL 191
>gnl|CDD|113398 pfam04625, DEC-1_N, DEC-1 protein, N-terminal region. The
defective chorion-1 gene (dec-1) in Drosophila encodes
follicle cell proteins necessary for proper eggshell
assembly. Multiple products of the dec-1 gene are formed
by alternative RNA splicing and proteolytic processing.
Cleavage products include S80 (80 kDa) which is
incorporated into the eggshell, and further proteolysis
of S80 gives S60 (60 kDa).
Length = 407
Score = 28.6 bits (63), Expect = 6.1
Identities = 16/50 (32%), Positives = 19/50 (38%)
Query: 360 PAPTSTTTQRSTTSTTTQTPTTPIQEEYTETITLELPQPVETNNQIPVQD 409
PAP T Q S S P P +E I L E Q+ V+D
Sbjct: 198 PAPAGTDAQASDISEVRVRPEDPYSQEAQMKIKSALEMEQERQQQVQVKD 247
>gnl|CDD|185693 cd08023, GH16_laminarinase_like, Laminarinase, member of the
glycosyl hydrolase family 16. Laminarinase, also known
as glucan endo-1,3-beta-D-glucosidase, is a glycosyl
hydrolase family 16 member that hydrolyzes
1,3-beta-D-glucosidic linkages in 1,3-beta-D-glucans
such as laminarins, curdlans, paramylons, and pachymans,
with very limited action on mixed-link
(1,3-1,4-)-beta-D-glucans.
Length = 235
Score = 28.4 bits (64), Expect = 6.6
Identities = 9/39 (23%), Positives = 18/39 (46%), Gaps = 2/39 (5%)
Query: 319 IDAMETINNNT-KIHGETICQPDSNETSTTTTTTTTTTP 356
ID ME + N ++G T+ +N+ + + + T
Sbjct: 116 IDIMEYVGNEPNTVYG-TLHGGATNDGNNGSGGSYTLPT 153
>gnl|CDD|221145 pfam11596, DUF3246, Protein of unknown function (DUF3246). This is
a small family of fungal proteins one of whose members
from Pichia stipitis is described as being an extremely
serine rich protein-mucin-like protein.
Length = 208
Score = 28.2 bits (62), Expect = 6.7
Identities = 15/48 (31%), Positives = 20/48 (41%)
Query: 340 DSNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTPIQEEY 387
D +ET+T TTT T T T + +T T PT + Y
Sbjct: 161 DCDETTTYTTTYYTPYTTVIHGGTVYTNGVTVIATHTVYPTDVEDDGY 208
>gnl|CDD|161659 TIGR00005, rluA_subfam, pseudouridine synthase, RluA family. In E.
coli, RluD (SfhB) modifies uridine to pseudouridine at
23S RNA U1911, 1915, and 1917, RluC modifies 955, 2504
and 2580, and RluA modifies U746 and tRNA U32. An
additional homolog from E. coli outside this family,
TruC (SP|Q46918), modifies uracil-65 in transfer RNAs to
pseudouridine [Protein synthesis, tRNA and rRNA base
modification].
Length = 299
Score = 28.4 bits (64), Expect = 6.7
Identities = 13/33 (39%), Positives = 18/33 (54%), Gaps = 3/33 (9%)
Query: 257 RLDKDFHGLPVLCKADLAHNNIKAINIQLALKT 289
RLD+D GL V+ K LA ++ QL +T
Sbjct: 125 RLDRDTSGLMVVAKTPLALRELQR---QLKNRT 154
>gnl|CDD|114172 pfam05432, BSP_II, Bone sialoprotein II (BSP-II). Bone
sialoprotein (BSP) is a major structural protein of the
bone matrix that is specifically expressed by
fully-differentiated osteoblasts. The expression of bone
sialoprotein (BSP) is normally restricted to mineralised
connective tissues of bones and teeth where it has been
associated with mineral crystal formation. However, it
has been found that ectopic expression of BSP occurs in
various lesions, including oral and extraoral
carcinomas, in which it has been associated with the
formation of microcrystalline deposits and the
metastasis of cancer cells to bone.
Length = 291
Score = 28.5 bits (63), Expect = 7.1
Identities = 24/86 (27%), Positives = 32/86 (37%), Gaps = 11/86 (12%)
Query: 333 GETICQPDSNETSTTTTTTTTTTP----EPTPAPTSTTTQRSTTSTTTQTPTTPIQE-EY 387
GE ++ TT TTT+P +PT P TT TTP + EY
Sbjct: 184 GEEESVTEAEAEGTTVAGPTTTSPNGGFQPTTPPQEVYG---TTDPPFGKVTTPEYQGEY 240
Query: 388 TETITLELPQPVE---TNNQIPVQDN 410
+T E E + N P D+
Sbjct: 241 EQTGANEYDGGYEIYESENGEPRGDS 266
>gnl|CDD|234428 TIGR03979, His_Ser_Rich, His-Xaa-Ser repeat protein HxsA. Members
of this protein share two defining regions. One is a
histidine/serine-rich cluster, typically
H-R-S-H-S-S-H-R-S-H-S-S-H. Members are found always in
the context of a pair of radical SAM proteins, HxsB and
HxsC, and a fourth protein HxsD. The system is predicted
to perform peptide modifications, likely in the
His-Xaa-Ser region, to produce some uncharacterized
natural product.
Length = 186
Score = 27.9 bits (62), Expect = 8.0
Identities = 11/37 (29%), Positives = 20/37 (54%)
Query: 345 STTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTT 381
S T+T + P P+ +P+ ++ +S STT P +
Sbjct: 80 SGDTSTYSYPVPSPSYSPSPGSSIQSLPSTTGVRPQS 116
>gnl|CDD|233534 TIGR01693, UTase_glnD, [Protein-PII] uridylyltransferase. This
model describes GlnD, the
uridylyltransferase/uridylyl-removing enzyme for the
nitrogen regulatory protein PII. Not all homologs of PII
share the property of uridylyltransferase modification
on the characteristic Tyr residue (see Prosite pattern
PS00496 and document PDOC00439), but the modification
site is preserved in the PII homolog of all species with
a member of this family [Central intermediary
metabolism, Nitrogen metabolism, Regulatory functions,
Protein interactions].
Length = 850
Score = 28.5 bits (64), Expect = 8.5
Identities = 15/44 (34%), Positives = 21/44 (47%), Gaps = 3/44 (6%)
Query: 148 VDLSYNKINKFGTRNEGKNQVQGVTNIFELKLQHNEIENLDGAL 191
+ + KI FG + E V VT++F LKL E + L L
Sbjct: 805 LSIQSAKITTFGEKAE---DVFYVTDLFGLKLTDEEEQRLLEVL 845
>gnl|CDD|132138 TIGR03094, sulfo_cyanin, sulfocyanin. Members of this family are
blue-copper redox proteins designated sulfocyanin, from
the archaeal genera Sulfolobus, Ferroplasma, and
Picrophilus. The most closely related proteins
characterized as functionally different are the
rustacyanins [Energy metabolism, Electron transport].
Length = 195
Score = 27.9 bits (62), Expect = 8.5
Identities = 12/42 (28%), Positives = 18/42 (42%)
Query: 341 SNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTP 382
S++TS +T T+TT+ P +S T T P
Sbjct: 33 SSDTSAVSTGTSTTSKIYIPYNSSNKTVFLTLVVWGSGNAYP 74
>gnl|CDD|217495 pfam03326, Herpes_TAF50, Herpesvirus transcription activation
factor (transactivator). This family includes EBV BRLF1
and similar ORF 50 proteins from other herpesviruses.
Length = 500
Score = 28.2 bits (63), Expect = 8.8
Identities = 14/66 (21%), Positives = 27/66 (40%), Gaps = 2/66 (3%)
Query: 333 GETICQPDSNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTTPIQEEYTETIT 392
++ D ST + T+ PA S +++R +T+ Q + Q E + +
Sbjct: 239 QQSESFSDVWPASTQSPREETSAEPLAPA--SPSSRRPSTAQEEQIACSSPQAEPEQGVQ 296
Query: 393 LELPQP 398
+PQ
Sbjct: 297 SYVPQS 302
>gnl|CDD|165473 PHA03207, PHA03207, serine/threonine kinase US3; Provisional.
Length = 392
Score = 28.3 bits (63), Expect = 9.3
Identities = 16/80 (20%), Positives = 26/80 (32%)
Query: 322 METINNNTKIHGETICQPDSNETSTTTTTTTTTTPEPTPAPTSTTTQRSTTSTTTQTPTT 381
ME +++K+ I S T T T+ + + T + + T
Sbjct: 1 MECHTDSSKVSDSNINLGYEAIFSLTGGTDTSDSKDTTGDKFDDCDELGDSDDVTHATDY 60
Query: 382 PIQEEYTETITLELPQPVET 401
EE T +P ET
Sbjct: 61 DADEESLSPQTDVCQEPCET 80
>gnl|CDD|184042 PRK13415, PRK13415, flagella biosynthesis protein FliZ;
Provisional.
Length = 219
Score = 27.8 bits (62), Expect = 9.9
Identities = 8/27 (29%), Positives = 9/27 (33%)
Query: 338 QPDSNETSTTTTTTTTTTPEPTPAPTS 364
PD T T T T E + S
Sbjct: 38 HPDKCGKETPTGQQTQETAEAAASSVS 64
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.314 0.131 0.365
Gapped
Lambda K H
0.267 0.0925 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 20,312,326
Number of extensions: 1919894
Number of successful extensions: 3531
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2571
Number of HSP's successfully gapped: 300
Length of query: 419
Length of database: 10,937,602
Length adjustment: 99
Effective length of query: 320
Effective length of database: 6,546,556
Effective search space: 2094897920
Effective search space used: 2094897920
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (22.0 bits)
S2: 60 (26.5 bits)