RPS-BLAST 2.2.26 [Sep-21-2011]
Database: CDD.v3.10
44,354 sequences; 10,937,602 total letters
Searching..................................................done
Query= psy17507
(1121 letters)
>gnl|CDD|202886 pfam04096, Nucleoporin2, Nucleoporin autopeptidase.
Length = 137
Score = 173 bits (441), Expect = 2e-50
Identities = 66/144 (45%), Positives = 92/144 (63%), Gaps = 7/144 (4%)
Query: 550 GYFVSPTLNELKQLKTKNGPCIVEGFTIGHEEFGSICFLESIDISELNNVDLDSIVEFRF 609
GY+ SP++ EL ++ + VE FT+G + +GSI FL +D+S +DLD IV+F
Sbjct: 1 GYWTSPSIEELSKM-SLEELSSVENFTVGRKGYGSIEFLGPVDLS---GIDLDKIVDFGP 56
Query: 610 KEVVLYPDDANKPPVGTALNRPAKVTLLKVWPKDKSTGELIKDVQKIKNMNYEAIVIRAC 669
+ V +YPDD NKPPVG LN PA +TL +P+DKST + IKD K + +E + R
Sbjct: 57 RSVEVYPDDENKPPVGEGLNVPATITLENCYPRDKSTKKPIKDPSKPR---FEKHLKRLK 113
Query: 670 TRMNVEFVSYQPETGSWCFKVQHF 693
+ EF+SY PETG+W FKV+HF
Sbjct: 114 EKKGAEFISYDPETGTWVFKVKHF 137
>gnl|CDD|221425 pfam12110, Nup96, Nuclear protein 96. Nup96 (often known by the name
of its yeast homolog Nup145C) is part of the Nup84
heptameric complex in the nuclear pore complex. Nup96
complexes with Sec13 in the middle of the heptamer. The
function of the heptamer is to coat the curvature of the
nuclear pore complex between the inner and outer nuclear
membranes. Nup96 is predicted to be an alpha helical
solenoid. The interaction between Nup96 and Sec13 is the
point of curvature in the heptameric complex.
Length = 290
Score = 75.1 bits (185), Expect = 2e-14
Identities = 29/96 (30%), Positives = 48/96 (50%), Gaps = 1/96 (1%)
Query: 1026 DPEQNIFQLVTQHKLHEAIVSAQQSDNFYLSGALSMLGCSEIGQAFMQAQLSHYVNATSD 1085
P + IF L++ +++ EA A +S+NF+L+ LS +G ++ Q QLS +
Sbjct: 1 SPLEKIFLLLSGNRIVEACELAIESNNFHLATLLSQIGNDPSVRSLAQQQLSKWRQNGVL 60
Query: 1086 SFINKDILKLMLLTSGQPLYDGSNGIINVCESLDWK 1121
S I++ I K+ L +G + N E LDW
Sbjct: 61 SEIDEPIRKIYELLAGNVFRSLGDSGENEFE-LDWL 95
>gnl|CDD|227938 COG5651, COG5651, PPE-repeat proteins [Cell motility and
secretion].
Length = 490
Score = 44.9 bits (106), Expect = 2e-04
Identities = 47/301 (15%), Positives = 76/301 (25%), Gaps = 16/301 (5%)
Query: 52 GTTPQPSMFGTNTSTAQPATSLFGATENKPLFGQSTTTFGQPTNTFGTTNSLFGKPNTGF 111
P P+ + + A G + T + L
Sbjct: 168 FNEPPPTTNSSGLAAQASAVQALGDLASGITLASQVNLSLLELINPATLSGLANGGTGNL 227
Query: 112 GAAVSTQSNTFSFNPTTFGQ-PNTTAIKPFGAAAPQTSNLFATNTSQATTAFGAPQTS-G 169
G Q+ F FG + P A+ +NL S T+ G
Sbjct: 228 GIGALQQAQNLGFGNVGFGNLGSGNPGAPGLASQFSATNLGTLLGSLNPYLGNIGATNIG 287
Query: 170 FGSFGTNQNQSLNFNQNKP---TFNLGGSTTSTGFGFGTNTATNNTSLFGQKPATNT-FG 225
+ GT S N + +G ++ +T N + G N+
Sbjct: 288 LAAAGTGNIGSGNAVDSGGSALVGAIGQTSQATANAGSVNATGGAAAGSGNLGVANSGSA 347
Query: 226 LPTNTASPFGTSTSFGAQPAQTST-LFNSFNKPTPG--------FQFNTNTTPNTSLANN 276
+ + GA + L N+ G + NT N+
Sbjct: 348 AAPFGIAGANQAALGGANSGAGNFGLGNNPGGGLGGKPLGGTGNGGIGASGIGNTGYGNS 407
Query: 277 AFSGTLFGNTAKPAGLFGNSTNTGGLFGNSTFGSNTGNANTFGQGLNSSFNLGGTSNTFN 336
+ N N+ N G + N G+ N G N G + F
Sbjct: 408 GIANAGLSNAGSNNAGGENAGNANNTGGGNVGLWNAGDFNAGAAGTG-FTNNGSYNTGFL 466
Query: 337 N 337
N
Sbjct: 467 N 467
Score = 44.5 bits (105), Expect = 3e-04
Identities = 46/253 (18%), Positives = 64/253 (25%), Gaps = 18/253 (7%)
Query: 60 FGTNTSTAQPATSLFGATENKPLFGQSTTTFGQPTNTFGTTNSLFGKPNTGFGAAVSTQS 119
G+ A S F AT L G G T + N G G AV +
Sbjct: 248 LGSGNPGAPGLASQFSATNLGTLLGSLNPYLGNIGATNIGLAAAGT-GNIGSGNAVDSGG 306
Query: 120 NTFSFNPTTFGQP--NTTAIKPFGAAAPQTSNLFATNTSQATTAFG--APQTSGFGSFGT 175
+ Q N ++ G AA + NL N+ A FG + G +
Sbjct: 307 SALVGAIGQTSQATANAGSVNATGGAAAGSGNLGVANSGSAAAPFGIAGANQAALGGANS 366
Query: 176 NQNQSLNFNQNKPTFNLGGSTTSTGFGFGTNTATNNTSLFGQKPATNTFGLPTNTASPFG 235
N + G G + N N
Sbjct: 367 GAGNFGLGNNPGGGLGGKPLGGTGNGGIGASGIGNTGY---------GNSGIANAGLSNA 417
Query: 236 TSTSFGAQPAQTSTLFNSFNKPTP--GFQFNTNTTPNTSLANNAFSGTLFGNTAKPAGLF 293
S + G + A + N G FN N +++ F N
Sbjct: 418 GSNNAGGENAGNANNTGGGNVGLWNAGD-FNAGAAGTGFTNNGSYNTG-FLNFGNDNTGI 475
Query: 294 GNSTNTGGLFGNS 306
N NT N
Sbjct: 476 FNGGNTSNGTFNQ 488
Score = 43.4 bits (102), Expect = 6e-04
Identities = 48/296 (16%), Positives = 72/296 (24%), Gaps = 41/296 (13%)
Query: 59 MFGTNTSTAQPATSLFGATENKPLFGQSTTTFGQPTNTFGTTNSLFGKPNTGFGAAVSTQ 118
M+G + + A++L N+P +++ + L + A+
Sbjct: 151 MYGYYAAASAAASAL--TPFNEPPPTTNSSGLAAQASAVQALGDL---ASGITLASQVNL 205
Query: 119 SNTFSFNPTTFGQPNTTAIKPFGAAAPQTSNLFATNTSQATTAFGAPQTSGFGSFGTNQN 178
S NP T G A Q A N FG + G+ G
Sbjct: 206 SLLELINPATLSGLANGGTGNLGIGALQQ----AQNLGFGNVGFGNLGSGNPGAPGLASQ 261
Query: 179 QSLNFNQNKPTFNLGGSTTSTGFGFGTNTATNNTSLFGQKPATNTFGLPTNTASPFGTST 238
S G A N +
Sbjct: 262 FSATNLGTLLGSLNPYLGNIGATNIGLAAAGTG----------NIGSGNAVDSGGSALVG 311
Query: 239 SFGAQPAQTSTLFNSFNKPTPGFQFNTNTTPNTSLANNAFSGTL-FGNTAKPAGLFG-NS 296
+ G T+ + N + A SG L N+ A FG
Sbjct: 312 AIGQTSQATANAGSV----------------NATGGAAAGSGNLGVANSGSAAAPFGIAG 355
Query: 297 TNTGGLFGNSTFGSNTGNANTFGQGLNSSFNLG----GTSNTFNNPLAPQAPPASN 348
N L G ++ N G N G GL G G + +N
Sbjct: 356 ANQAALGGANSGAGNFGLGNNPGGGLGGKPLGGTGNGGIGASGIGNTGYGNSGIAN 411
Score = 33.7 bits (77), Expect = 0.61
Identities = 25/185 (13%), Positives = 38/185 (20%), Gaps = 14/185 (7%)
Query: 43 LVSATGSFFGTTPQPSMFGTNTSTAQPATSLFGATENKPLFGQSTTTFGQPTNTFGTTNS 102
SA G T Q + + + A + G + N +
Sbjct: 305 GGSALVGAIGQTSQATANAGSVNATGGAAAGSGNLGVANSGSAAAPFGIAGANQAALGGA 364
Query: 103 LFGKPNTG--------------FGAAVSTQSNTFSFNPTTFGQPNTTAIKPFGAAAPQTS 148
G N G G + N A +
Sbjct: 365 NSGAGNFGLGNNPGGGLGGKPLGGTGNGGIGASGIGNTGYGNSGIANAGLSNAGSNNAGG 424
Query: 149 NLFATNTSQATTAFGAPQTSGFGSFGTNQNQSLNFNQNKPTFNLGGSTTSTGFGFGTNTA 208
+ G F + + N + N N G T G T+
Sbjct: 425 ENAGNANNTGGGNVGLWNAGDFNAGAAGTGFTNNGSYNTGFLNFGNDNTGIFNGGNTSNG 484
Query: 209 TNNTS 213
T N
Sbjct: 485 TFNQG 489
Score = 30.7 bits (69), Expect = 5.3
Identities = 35/214 (16%), Positives = 59/214 (27%), Gaps = 13/214 (6%)
Query: 220 ATNTFGLPTNTASPFGTSTSFGAQPAQTSTLFNSFNKPTPGFQFNTNTTPNTSLANNAFS 279
A + P N P T+ AQ S + + + + L N A
Sbjct: 160 AAASALTPFNEPPPT---TNSSGLAAQASAVQALGDLASGITLASQVNLSLLELINPATL 216
Query: 280 GTLFGNTAKPAGLFGNSTNTGGLFGNSTFGSNTGNANTFGQGLNSSFNLGGTSNTFNNPL 339
L G+ FGN FG N G+ N GL S F +N
Sbjct: 217 SGLANGGTGNLGIGALQQAQNLGFGNVGFG-NLGSGNPGAPGLASQF---SATNLGTLLG 272
Query: 340 APQAPPASNSVNTQQILNLASLPYFSAPTLYKGVLPATSSSSKLDELILPAFQKNRINNK 399
+ + + + S + G + + + A +
Sbjct: 273 SLNPYLGNIGATNIGLAAAGTGNIGSGNAVDSGGSALVGAIGQTSQATANAG------SV 326
Query: 400 NVTVNSNASYNKIKIKPVGSTSASTKSLFDGLSS 433
N T + A + + GS +A ++
Sbjct: 327 NATGGAAAGSGNLGVANSGSAAAPFGIAGANQAA 360
>gnl|CDD|222274 pfam13634, Nucleoporin_FG, Nucleoporin FG repeat region. This
family includes a number of FG repeats that are found in
nucleoporin proteins. This family includes the yeast
nucleoporins Nup116, Nup100, Nup49, Nup57 and Nup 145.
Length = 106
Score = 41.3 bits (97), Expect = 2e-04
Identities = 18/98 (18%), Positives = 27/98 (27%), Gaps = 3/98 (3%)
Query: 152 ATNTSQATTAFGAPQTSGFGSFGTNQNQSLNFNQNKPTFNLGGSTTSTGFGFGTNTATNN 211
T + + FG+ +G G FG N + G S++ G +
Sbjct: 4 TTAGASSGGLFGSAPATGGGLFGQNAANT---TPTSGGGLFGSSSSQATQPSGGGLFGSA 60
Query: 212 TSLFGQKPATNTFGLPTNTASPFGTSTSFGAQPAQTST 249
FG T T + FG A
Sbjct: 61 AQTSATTTGGGLFGSTTATTTTATGGGLFGNATAAQPA 98
Score = 39.4 bits (92), Expect = 0.001
Identities = 29/124 (23%), Positives = 43/124 (34%), Gaps = 22/124 (17%)
Query: 116 STQSNTFSFNPTTFGQPNTTAIKPFGAAAPQTSNLFATNTSQATTAFGAPQTSGFGSFGT 175
S+ + + + FG T FG A T+ ++++ + G G FG+
Sbjct: 1 SSSTTAGASSGGLFGSAPATGGGLFGQNAANTTPTSGGGLFGSSSSQATQPSGG-GLFGS 59
Query: 176 NQNQSLNFNQNKPTFNLGGSTTSTGFGFGTNTATNNT----SLFGQKPATNTFGLPTNTA 231
S +TT+ G FG+ TAT T LFG A T
Sbjct: 60 AAQTS--------------ATTTGGGLFGSTTATTTTATGGGLFGNATAAQP---ATTGG 102
Query: 232 SPFG 235
FG
Sbjct: 103 GLFG 106
Score = 39.4 bits (92), Expect = 0.001
Identities = 36/110 (32%), Positives = 47/110 (42%), Gaps = 9/110 (8%)
Query: 195 STTSTGFGFGTNTATNNTSLFGQKPATNTFGLPTNTASPFGTSTSFGAQPAQTSTLFNSF 254
+ S+G FG+ AT LFGQ A T PT+ FG+S+S QP+ LF S
Sbjct: 6 AGASSGGLFGSAPATG-GGLFGQNAANTT---PTSGGGLFGSSSSQATQPS-GGGLFGSA 60
Query: 255 NKPTPGFQFNTNTTPNTSLANNAFSGTLFGNTAKPAGLFGNSTNTGGLFG 304
+ + T+ A G LFGN +T GGLFG
Sbjct: 61 AQTSATTTGGGLFGSTTATTTTATGGGLFGNATAAQ----PATTGGGLFG 106
Score = 39.4 bits (92), Expect = 0.001
Identities = 24/106 (22%), Positives = 39/106 (36%), Gaps = 5/106 (4%)
Query: 228 TNTASPFGTSTSFGAQPAQTSTLFNSFNKPTPGFQFNTNTTPNTSLANNAFSGTLFGNTA 287
++T + + FG+ PA LF T ++S A G LFG+ A
Sbjct: 2 SSTTAGASSGGLFGSAPATGGGLFGQNAANTTPTSGGGLFGSSSSQATQPSGGGLFGSAA 61
Query: 288 KPAGLFGNSTNTGGLFGNS-TFGSNTGNANTFGQGLNSSFNLGGTS 332
+ + +T GGLFG++ + FG + G
Sbjct: 62 QTS----ATTTGGGLFGSTTATTTTATGGGLFGNATAAQPATTGGG 103
Score = 36.3 bits (84), Expect = 0.013
Identities = 19/105 (18%), Positives = 27/105 (25%), Gaps = 9/105 (8%)
Query: 52 GTTPQPSMFGTNTSTAQPATSLFGATENKPLFGQSTTTFGQPTNTF--GTTNSLFGKPNT 109
+FG+ +T LFG FG ++ + LFG
Sbjct: 6 AGASSGGLFGSAPATG---GGLFGQNAANTTPTSGGGLFGSSSSQATQPSGGGLFGSAAQ 62
Query: 110 GFGAAVSTQSNTFSFNPTTFGQPNTTAIKPFGAAAPQTSNLFATN 154
+T + F TT T FG A
Sbjct: 63 TS----ATTTGGGLFGSTTATTTTATGGGLFGNATAAQPATTGGG 103
Score = 34.4 bits (79), Expect = 0.056
Identities = 27/116 (23%), Positives = 36/116 (31%), Gaps = 14/116 (12%)
Query: 205 TNTATNNTSLFGQKPATNTFGLPTNTASPFGTSTSFGAQPAQTSTLFNSFNKPTPGFQFN 264
T ++ LFG PAT FG + + P LF S +
Sbjct: 4 TTAGASSGGLFGSAPATGG--------GLFGQNAA-NTTPTSGGGLFGSSSSQATQPSGG 54
Query: 265 TNTTPNTSLANNAFSGTLFGNTAKPAGLFGNSTNTGGLFGNS-TFGSNTGNANTFG 319
+ G LFG+T + GGLFGN+ T FG
Sbjct: 55 GLFGSAAQTSATTTGGGLFGSTTATT----TTATGGGLFGNATAAQPATTGGGLFG 106
Score = 33.6 bits (77), Expect = 0.098
Identities = 21/97 (21%), Positives = 31/97 (31%), Gaps = 1/97 (1%)
Query: 84 GQSTTTFGQPTNTFGTTNSLFGKPNTGFGAAVSTQSNTFSFNPTTFGQPNTTAIKPFGAA 143
STT FG+ + G A + S F ++ + FG+A
Sbjct: 1 SSSTTAGASSGGLFGSAPATGGGLFGQNAANTTPTSGGGLFGSSSSQATQPSGGGLFGSA 60
Query: 144 APQTSNLFATNTSQATTAFGAPQTSGFGSFGTNQNQS 180
A ++ +TTA T G G FG
Sbjct: 61 AQTSATTTGGGLFGSTTATTTTATGG-GLFGNATAAQ 96
Score = 32.8 bits (75), Expect = 0.19
Identities = 26/104 (25%), Positives = 38/104 (36%), Gaps = 12/104 (11%)
Query: 45 SATGSFFGTTPQP--SMFGTNTSTAQPAT---SLFGATENKPLFGQSTTTFGQPTNTFG- 98
+++G FG+ P +FG + T LFG++ ++ FG T
Sbjct: 8 ASSGGLFGSAPATGGGLFG-QNAANTTPTSGGGLFGSSSSQATQPSGGGLFGSAAQTSAT 66
Query: 99 -TTNSLFGKPNTGFGAAVSTQSNTFSFNPTTFGQPNTTAIKPFG 141
T LFG +T + F T QP TT FG
Sbjct: 67 TTGGGLFGSTTA----TTTTATGGGLFGNATAAQPATTGGGLFG 106
Score = 30.1 bits (68), Expect = 2.1
Identities = 19/89 (21%), Positives = 30/89 (33%), Gaps = 11/89 (12%)
Query: 34 GTLQGSKEFLVSATGSFFGTTPQPSMFGTNTSTAQPAT--SLFGATENKP-------LFG 84
G+ + L + T +FG+++S A + LFG+ LFG
Sbjct: 15 GSAPATGGGLFGQNAANTTPTSGGGLFGSSSSQATQPSGGGLFGSAAQTSATTTGGGLFG 74
Query: 85 QSTTTFGQPTNT--FGTTNSLFGKPNTGF 111
+T T T FG + G
Sbjct: 75 STTATTTTATGGGLFGNATAAQPATTGGG 103
Score = 29.4 bits (66), Expect = 3.4
Identities = 24/100 (24%), Positives = 33/100 (33%), Gaps = 14/100 (14%)
Query: 264 NTNTTPNTSL--ANNAFSGTLFGNTA------KPAGLFGNSTN------TGGLFGNSTFG 309
T + L + A G LFG A GLFG+S++ GGLFG++
Sbjct: 4 TTAGASSGGLFGSAPATGGGLFGQNAANTTPTSGGGLFGSSSSQATQPSGGGLFGSAAQT 63
Query: 310 SNTGNANTFGQGLNSSFNLGGTSNTFNNPLAPQAPPASNS 349
S T ++ F N A Q
Sbjct: 64 SATTTGGGLFGSTTATTTTATGGGLFGNATAAQPATTGGG 103
>gnl|CDD|222890 PHA02584, 34, long tail fiber, proximal subunit; Provisional.
Length = 1229
Score = 43.2 bits (102), Expect = 9e-04
Identities = 71/316 (22%), Positives = 108/316 (34%), Gaps = 51/316 (16%)
Query: 115 VSTQSNTFSFNPTTFG-QPNTTAIKPFGAAAPQTSNLFATN------TSQATTAFGAPQT 167
++T SNTF N T QP T K A +P N N T+Q + +
Sbjct: 836 LATGSNTFVGNDTVGSTQPLTNYEKDGYAVSPYGLNYALQNYLPLKATAQNSLKLDGLDS 895
Query: 168 SGFGSFGTNQ--NQSLNFNQNKPTFNLGGSTTSTGFG-FGTNTATNNTSLFGQKPATNTF 224
+ F +Q N SL F +N NL S+ FG + N+T
Sbjct: 896 AQFIRRDIDQTVNGSLTFTKNT---NLSAPLVSSSTATFGGSVTANST------------ 940
Query: 225 GLPTNTASPFGTSTSFGAQPAQTSTLFNSFNKPTPGFQFNTNTTPNTSLANNAFSGTLFG 284
L T + GT +TS F S N T FN + T + N +GTL
Sbjct: 941 -LTTQN-TSNGTVVVVD----ETSIAFYSQNNTTGNIVFNIDGTVDPINVNA--NGTLNA 992
Query: 285 NTAKPAGLFGNSTNTGGLFGNSTFGSNTGNAN-------TFGQGLNSSFNLGGTSNTFNN 337
G +TN ++ + T NA T +++ L +
Sbjct: 993 T--------GVATNGRAVYAEGGGIARTNNAARAITGGFTIRNDGSTTVFLLTAAGDQTG 1044
Query: 338 PLAPQAPPASNSVNTQQILNLASLPYFSAPTLYKGVLPATSS--SSKLDELILPAFQKNR 395
N+ N Q +N + + T+ G L S S A +
Sbjct: 1045 GFNGLKSLIINNANGQVTINDNYIIN-AGGTIMSGGLTVNSRIRSQGTKASYTRAPTADT 1103
Query: 396 INNKNVTVNSNASYNK 411
+ +V +N +A+YN+
Sbjct: 1104 VGFWSVDINDSATYNQ 1119
Score = 33.6 bits (77), Expect = 0.78
Identities = 62/317 (19%), Positives = 87/317 (27%), Gaps = 53/317 (16%)
Query: 61 GTNTSTAQPATSLFGATENKPLFGQSTTTFGQPTNTFGTTNSLFGKPNTGFGAAVSTQSN 120
G+ T T + PL ST TFG T NS NT G V
Sbjct: 909 GSLTFTKN-------TNLSAPLVSSSTATFGGSV----TANSTLTTQNTSNGTVVVVDET 957
Query: 121 TFSFNPTTFGQPNTTAIKPFGAAAPQTSNLFATNTSQATTAF---GAPQTSGFGSFGTNQ 177
+ +F N N+ A T AT A G G TN
Sbjct: 958 SIAFYSQNNTTGNIVFN---IDGTVDPINVNANGTLNATGVATNGRAVYAEGGGIARTNN 1014
Query: 178 -NQSLN-----FNQNKPTFNLGGSTTSTGFGFG-----------TNTATNNTSLFGQKPA 220
+++ N T L + GF N+ +
Sbjct: 1015 AARAITGGFTIRNDGSTTVFLLTAAGDQTGGFNGLKSLIINNANGQVTINDNYIINAGGT 1074
Query: 221 TNTFGLPTNTAS-PFGTSTSFGAQPAQTSTLFNSFNKPTPGFQFNTNTTPNTSLANNAFS 279
+ GL N+ GT S+ P + F S + +N +
Sbjct: 1075 IMSGGLTVNSRIRSQGTKASYTRAPTADTVGFWSVDI-NDSATYNQFPGYFQMVTKTKSP 1133
Query: 280 GTL--FGNTAKPAGLFGNSTNTGGLFGNSTFGSNTGNANTFGQGLNSSFNLGGTSNTFNN 337
GTL FGNT + G +T + T N +++ G T N
Sbjct: 1134 GTLTQFGNTLD------SLYQDWSPDGRTTRYTRTWQKN------KNAWTSFGEVYTKGN 1181
Query: 338 PLAPQ---APPASNSVN 351
+ A PA
Sbjct: 1182 KPTAEDIGAVPADGGTM 1198
>gnl|CDD|236952 PRK11669, pbpG, D-alanyl-D-alanine endopeptidase; Provisional.
Length = 306
Score = 38.1 bits (89), Expect = 0.020
Identities = 26/64 (40%), Positives = 31/64 (48%), Gaps = 15/64 (23%)
Query: 63 NTSTAQPATSLFGATENKPLFGQ-STT-----TFGQPTNT--FGTTNSLFGKPN------ 108
N STA+ T L A++ PL GQ STT TF +P T F TN L + N
Sbjct: 172 NVSTARDLTKLLIASKQYPLIGQLSTTREKTATFRKPNYTLPFRNTNHLVYRDNWNIQLT 231
Query: 109 -TGF 111
TGF
Sbjct: 232 KTGF 235
>gnl|CDD|221143 pfam11593, Med3, Mediator complex subunit 3 fungal. Mediator is a
large complex of up to 33 proteins that is conserved
from plants to fungi to humans - the number and
representation of individual subunits varying with
species. It is arranged into four different sections, a
core, a head, a tail and a kinase-activity part, and the
number of subunits within each of these is what varies
with species. Overall, Mediator regulates the
transcriptional activity of RNA polymerase II but it
would appear that each of the four different sections
has a slightly different function. Mediator subunit
Hrs1/Med3 is a physical target for Cyc8-Tup1, a yeast
transcriptional co-repressor.
Length = 381
Score = 38.1 bits (88), Expect = 0.026
Identities = 48/249 (19%), Positives = 72/249 (28%), Gaps = 32/249 (12%)
Query: 52 GTTPQPSMFGTNTSTAQPATSLFGATENKPLFGQSTTTFGQPTNTFGTTNSLFGKPNTGF 111
GT Q G + S + + AT S+T N+ PNT
Sbjct: 117 GTYNQLGNAGASASITKTSNGSDAAT-------TSSTANTPAAAKVLKANAA-SAPNTT- 167
Query: 112 GAAVSTQSNTFSFNPTTFGQPNTTAIKPFGAAAPQTSNLFATNTSQATTAFGAPQTSGFG 171
V + + T + + TT P TT KP + + A +QA+
Sbjct: 168 -TGVGSAATTAAISATTATTPTTTQKKPRKPRQTKKTGPAAAAKAQASAQ---------- 216
Query: 172 SFGTNQNQSLNFNQNKPTFNLGGSTTSTGFGFGTNTATNNTSLFGQKPATNTFGLPTNTA 231
Q Q+ +NQ G +T T L N P N
Sbjct: 217 ----AQAQASAYNQM----GSLGVPQNTSMLAQIPNPTPLMQLLNGVSPNNAMASPLNNM 268
Query: 232 SPFGTSTSFGAQPAQTSTLFNSFNKPTPGFQFNTNTTPNTSLANNAFSGTLFGNTAKPAG 291
SP G Q ++ N G N + + + + + N PA
Sbjct: 269 SPMRNLNQMGNQNNGGQMTPSANN----GNMNNQSRENSMNQGMTPSASMINLNNITPAN 324
Query: 292 LFGNSTNTG 300
+ S N
Sbjct: 325 ILNMSLNLA 333
>gnl|CDD|237846 PRK14876, PRK14876, conjugal transfer mating pair stabilization
protein TraN; Provisional.
Length = 928
Score = 36.0 bits (83), Expect = 0.12
Identities = 35/195 (17%), Positives = 60/195 (30%), Gaps = 18/195 (9%)
Query: 123 SFNPTTFGQPNTTAIKPFGAAAPQTSNLFATNTSQATTAFGAPQTSGFGSFGTNQNQSLN 182
S+ Q T K + SN F +++ + T G F S+N
Sbjct: 26 SYANGAQAQDITAVGKEAQSFGQNLSNSFKSSSGTVQDGTISMPTLKDGQFQMGGGSSVN 85
Query: 183 FNQNKPTFNLGGSTTSTGFGFGTNTATNN-TSLFGQKPATNTFGLPTNTA-SPFGTSTSF 240
N P + G + F + ++ + G + S +
Sbjct: 86 VNDLFPGTS-GTNNKPDYFPDANKPDVDGLQGIYDSGDDMDDVGNSAKGSLWSDANSANP 144
Query: 241 GAQPAQTSTLFNSFNKPTPGFQFNTNTTPNTSLANNAFSGTLFGNTAKPAGLFGNSTNTG 300
A L +S N+ P F + P +L+ + + AG FG+ +
Sbjct: 145 SISGAAYKVLLDSSNRSRPDF----SNDPVLNLSKKTYE-----DMDLIAGGFGDCSAE- 194
Query: 301 GLFGNSTFGSNTGNA 315
+T NT NA
Sbjct: 195 -----TTINQNTINA 204
>gnl|CDD|177303 PHA00735, PHA00735, hypothetical protein.
Length = 808
Score = 33.3 bits (76), Expect = 0.78
Identities = 70/290 (24%), Positives = 104/290 (35%), Gaps = 58/290 (20%)
Query: 32 YVGTLQGSKEFLVSATGSF--FGTTPQPSMFGTNTSTA----QPATSLFGATENKPLFGQ 85
Y +Q + L SATG+ + +P P +F + + T F + N P+
Sbjct: 453 YTINIQLGQFSLGSATGTITIYPVSPLPVIFVSQYPHSVQLGTKVTITFQFSYNTPVANV 512
Query: 86 STTTFGQPTNTFG-------TTNSLFGKP----NTGFGAAVSTQSNT----FSFNPTTF- 129
+ + F Q T TF T++S+ + G + TQS F+F TF
Sbjct: 513 TMSAFTQTTTTFAWAYASIITSSSVVQFKAYWLSANDGILIITQSQNYLIPFNFTGLTFY 572
Query: 130 -GQPNTTAIKPFGAAAPQTSNLFATNTSQATTAFGAPQTSGFGSFGTN------------ 176
NT I A T N N + ++ G G G N
Sbjct: 573 NNSVNTIQINIVNNALQLTINGGTLNIANSSKVIGLGFYYGAGKLVLNWFFVSGIILQSA 632
Query: 177 -QNQ----SLNFNQNKPTFNLGGSTTSTGFGFGTNTATN------NTSLFGQKPATNTFG 225
NQ N + T G T S+GFG T T T + G T
Sbjct: 633 TANQAYVILTGTNPSTLTQYTTGYTNSSGFGQVTITLTYTPYELIDIYWAG---VTKYVL 689
Query: 226 LPTNTASPFGTSTSFGAQPAQTSTLFNSFNKPTPGFQFNTNTTPNTSLAN 275
L + P ++T+ TSTL ++ +P F+ N +PN+SL N
Sbjct: 690 LNISVTQPTTSTTT----TVNTSTLNYNYTQP-----FSNNISPNSSLYN 730
>gnl|CDD|218696 pfam05692, Myco_haema, Mycoplasma haemagglutinin. This family
consists of several haemagglutinin sequences from
Mycoplasma synoviae and Mycoplasma gallisepticum. The
major plasma membrane proteins, pMGAs, of Mycoplasma
gallisepticum are cell adhesin (hemagglutinin)
molecules. It has been shown that the genetic
determinants that code for the haemagglutinins are
organised into a large family of genes and that only one
of these genes is predominately expressed in any given
strain.
Length = 437
Score = 32.7 bits (75), Expect = 1.2
Identities = 33/149 (22%), Positives = 48/149 (32%), Gaps = 24/149 (16%)
Query: 146 QTSNLFATNTSQATTAF------GAPQTSGFGSFGTNQNQSLNFNQNKPTF--------- 190
Q + L N SQ + + G + N S N N + TF
Sbjct: 279 QFTGLTTDNGSQTSDPYYLIGFIGGSGNRTVANSVMNVQNSPNVNGDNRTFTIYVNAPKA 338
Query: 191 ---NLGGSTTST-----GFGFGTNTATNNTSLFGQKPATNTFGLPT-NTASPFGTSTSFG 241
+ GS S+ F T ++NN+ N L T +TA+ T++S
Sbjct: 339 GDYYISGSYLSSSNRNLKFSTSTTDSSNNSVTITVLSKNNWTTLGTFDTATTTTTNSSTV 398
Query: 242 AQPAQTSTLFNSFNKPTPGFQFNTNTTPN 270
+T TL NK N P
Sbjct: 399 TNNKKTLTLKKGLNKIVISGGTNNGNAPY 427
>gnl|CDD|222260 pfam13609, Porin_4, Gram-negative porin.
Length = 310
Score = 32.4 bits (74), Expect = 1.4
Identities = 31/212 (14%), Positives = 51/212 (24%), Gaps = 20/212 (9%)
Query: 148 SNLFATNTSQATTAFGAPQTSGFGSFGTNQN--------QSLNFNQNKPTFNLGGSTTST 199
+ + T GF G N + + +
Sbjct: 32 DGGATGSGATGETGLDNDSRVGFTGSGELDNGLTLGASFELEAGDNGGTGSFNRDAYVGL 91
Query: 200 GFGFGTNTATNNTSLFGQKPATNTFGLPTNTASPFGTSTSFGAQPAQTSTLFNSFNKPTP 259
FGT T + + S S G NS +P
Sbjct: 92 SGDFGTVTLGRQDTAADDAGVDYDYEGGGLGDSGSDGSGFDGDAGFTGRD-SNSVIYYSP 150
Query: 260 ---GFQFNTNTTPNTSLANNAFSGTLFGNTAK--PAGLFGNSTNTGGL---FGNSTFGSN 311
GF + N +G + G + GL +N G +
Sbjct: 151 KFGGFTAGASYALGEDGNTNGNNGGVAGGSDDTDAYGLGAGYSNGVGFSVAAAYQQTDNA 210
Query: 312 TGNANTFGQGLN---SSFNLGGTSNTFNNPLA 340
G+ + +G G + F LG + + A
Sbjct: 211 GGDQDAWGLGASYDFGGFTLGASYADIDVDAA 242
>gnl|CDD|221296 pfam11894, DUF3414, Protein of unknown function (DUF3414). This
family of proteins are functionally uncharacterized. This
protein is found in eukaryotes. Proteins in this family
are typically between 764 to 2011 amino acids in length.
This protein has a conserved LLG sequence motif.
Length = 1612
Score = 32.7 bits (75), Expect = 1.4
Identities = 14/78 (17%), Positives = 31/78 (39%), Gaps = 15/78 (19%)
Query: 751 LVKEMNRNYVSSDISKNILNIDETMDYIESSTEALARHMGTDPYKVQLMKQCFEDTFDNY 810
L+ ++ N ++S T+ Y+ SS + L RH+ + P +D
Sbjct: 1015 LLYKLCSNPLTSGP---------TLRYLRSSNDFLFRHLASLP------FIDPNTLWDGL 1059
Query: 811 GFPMSVSREFYIENKAHY 828
F ++ +F++ +
Sbjct: 1060 SFALANDNKFFLSSSIGA 1077
>gnl|CDD|131542 TIGR02489, flgE_epsilon, flagellar hook protein FlgE, epsilon
proteobacterial. Members of this family are flagellar
hook proteins, designated FlgE, as found in the epsilon
subdivision of the Proteobacteria (Helicobacter,
Wolinella, and Campylobacter). These proteins differ
significantly in architecture from proteins designated
FlgE in other lineages; the N-terminal and C-terminal
domains are homologous, but members of this family only
contain a large central domain that is surface-exposed
and variable between strains.
Length = 719
Score = 32.2 bits (73), Expect = 1.9
Identities = 36/152 (23%), Positives = 47/152 (30%), Gaps = 27/152 (17%)
Query: 116 STQSNTFSFNPTTFGQPNTTAIKPFGAA------APQTSNLFATNTSQATTAFGAPQTSG 169
T+ S N T++ N T F A P T + Q A +
Sbjct: 430 DTEPKNLSLNVTSYSNKNVTTNVLFKAIMKALNTGPLTEGGSIKESEQLKLAAHSAGLDI 489
Query: 170 FGSFGTNQNQSLNFNQNKP-------------TFNLGGSTTSTGFGFGTNTATNNTSLFG 216
S G+ + F + FN G F G N+ SL G
Sbjct: 490 VDSLGSKHTLEVQFVKQGGNEWSFIIRVPEPAEFNGGSGEGPNIFEGGRARFNNDGSLAG 549
Query: 217 QKPATNTFGLPTNTASP-------FGTSTSFG 241
P T F P N A+P FGTS +F
Sbjct: 550 MNPPTIQFS-PKNGAAPPQRIKLSFGTSGTFD 580
>gnl|CDD|218439 pfam05109, Herpes_BLLF1, Herpes virus major outer envelope
glycoprotein (BLLF1). This family consists of the BLLF1
viral late glycoprotein, also termed gp350/220. It is
the most abundantly expressed glycoprotein in the viral
envelope of the Herpesviruses and is the major antigen
responsible for stimulating the production of
neutralising antibodies in vivo.
Length = 830
Score = 32.1 bits (72), Expect = 2.1
Identities = 62/361 (17%), Positives = 106/361 (29%), Gaps = 38/361 (10%)
Query: 63 NTSTAQPATSLFGATENKPLFGQSTTTFGQPTNTFGTTNSLFGKPNTGFGAAVSTQSNTF 122
+ +TA P T P + T G PT T +T P + ++ +
Sbjct: 444 HKTTAVPTTPSLPPASTGPTVSTADPTSGTPTGTTSSTLPEDTSPTS----RTTSATPNA 499
Query: 123 SFNPTTFGQPNTTAIKPFGAAAPQTSNLFATNTSQATTAFGAPQTSGFGSFGTNQNQSLN 182
+ PN T+ + + TT +P T G+ S
Sbjct: 500 TSPTPAVTTPNATSPTTQKTSDTPNATSPTPIVIGVTTTATSPPT---GTTSVPNATSPQ 556
Query: 183 FNQNKPTFNLGGSTTSTGFGFGTNTATNNTSLFGQKPATNTFGLPTNTAS---------- 232
+ P N ++ T+ T G P + G+P+++ S
Sbjct: 557 VTEESPVNNTNTPVVTSAPSVLTSAVTTGQHGTGSSPTSQQPGIPSSSHSTPRSNSTSTT 616
Query: 233 -------PFGTSTSFGAQPAQTSTLFNSFNKPTPGFQFNTNTTPNTSLANNAFSGTLFGN 285
P G P+ ST S P PG TT S N+ + G
Sbjct: 617 PLLTSAHPTGGENITEETPSVPSTTHVSTLSPGPG----PGTTSQVSGPGNSSTSRYPGE 672
Query: 286 TAKPAGLFG-NSTNTGGLFGNSTFGSNTGNANTFGQGLNSSFNLGGTSNTFNNPLAPQAP 344
G+ N+T+ G T A N + + +A +P
Sbjct: 673 VHVTEGMPNPNATSPSAPSGQKT-------AVPTVTSTGGKANSTTKETSGSTLMASTSP 725
Query: 345 PASNSVNTQQILNLASLPYFSAPTLYKGVLPATS--SSSKLDELILPAFQKNRINNKNVT 402
+ N + S + + TS ++K + +P Q +N ++
Sbjct: 726 HTNEGAFRTTPYNATTYLPPSTSSKLRPRWTFTSPPVTTKQATVPVPPTQHPDHSNLSML 785
Query: 403 V 403
V
Sbjct: 786 V 786
>gnl|CDD|162022 TIGR00754, bfr, bacterioferritin. Bacterioferritin, predominantly
an iron-storage protein restricted to Bacteria, has also
been designated cytochrome b1 and cytochrome
b-557.Bacterioferritin is a homomultimer most species.
In Neisseria gonorrhoeae, Synechocystis PCC6803,
Magnetospirillum magnetotacticum, and Pseudomonas
aeruginosa, two types of subunit are found in a
heteromultimeric complex, with each species having one
member of each type. At present, both types of subunit
are including in this single model [Transport and
binding proteins, Cations and iron carrying compounds].
Length = 157
Score = 30.9 bits (70), Expect = 2.1
Identities = 17/57 (29%), Positives = 31/57 (54%), Gaps = 2/57 (3%)
Query: 746 LRNEILVKEMNRNYVSSDISKNIL-NIDETMDYIESSTEALARHMGTDPYKVQLMKQ 801
L+ I E R+YVS D+ + IL + +E +D++E+ E L +G + Y + +
Sbjct: 101 LKEAIAYAEEVRDYVSRDLLEEILEDEEEHIDWLETQLE-LIDKLGLENYLQAQVSE 156
>gnl|CDD|220308 pfam09603, Fib_succ_major, Fibrobacter succinogenes major domain
(Fib_succ_major). This domain of about 175 to 200 amino
acids is found, in from one to five copies, in over 50
proteins in Fibrobacter succinogenes S85, an obligate
anaerobe of the rumen. Many members of this family have
an apparent lipoprotein signal sequence. Conserved
cysteine residues, suggestive of disulfide bond
formation, are also consistent with an extracytoplasmic
location for this domain. This domain can also be found
in small numbers of proteins in Chlorobium tepidum and
Bacteroides thetaiotaomicron.
Length = 181
Score = 30.8 bits (70), Expect = 2.3
Identities = 19/67 (28%), Positives = 24/67 (35%), Gaps = 6/67 (8%)
Query: 192 LGGSTTSTGFGFGTNTATNNTSLFGQKPA-----TNTFGLPTNTASPFGTSTSFGAQPAQ 246
GG +TG G N N++ F PA +F A F +ST G A
Sbjct: 95 AGGKLKATGTWKGPNGGATNSTGFNALPAGYRDCDGSFFAEGKYAY-FWSSTESGTASAW 153
Query: 247 TSTLFNS 253
L S
Sbjct: 154 ARFLGYS 160
>gnl|CDD|204032 pfam08703, PLC-beta_C, PLC-beta C terminal. This domain
corresponds to the alpha helical C terminal domain of
phospholipase C beta.
Length = 181
Score = 30.5 bits (69), Expect = 2.8
Identities = 14/54 (25%), Positives = 21/54 (38%), Gaps = 12/54 (22%)
Query: 736 EKIS-PSPKQKLRNEILVKEMNRNYVSSDISKNILNIDETMDYIESSTEALARH 788
E+I K+K E E+NR++ I E + I+ EA R
Sbjct: 78 ERIQEAKTKEKHAQEREKTEINRSH-----------IQEVVQSIKRLEEAQKRR 120
>gnl|CDD|225142 COG2233, UraA, Xanthine/uracil permeases [Nucleotide transport and
metabolism].
Length = 451
Score = 31.4 bits (72), Expect = 2.9
Identities = 16/47 (34%), Positives = 19/47 (40%), Gaps = 6/47 (12%)
Query: 273 LANNAFSG-TLFGNTAKPAGLFGNSTNT--GGLFG---NSTFGSNTG 313
A +G L G GL + T GLFG N+TF N G
Sbjct: 266 TATGEITGRDLDGKPRLRRGLLADGLATLIAGLFGGFPNTTFAQNIG 312
>gnl|CDD|234371 TIGR03839, termin_org_P1, adhesin P1. Members of this protein family
are the major adhesin of the Mycoplasma terminal
organelle. The protein is called adhesin P1, cytadhesin
P1, P140, attachment protein, and MgPa, with locus names
MG191 in Mycoplasma genitalium and MPN141 in M.
pneumoniae. A conserved C-terminal region is shared by
additional paralogs in M. pneumoniae and M.
gallisepticum, as well as by the member of this family
[Cell envelope, Surface structures, Cellular processes,
Pathogenesis].
Length = 1425
Score = 31.7 bits (71), Expect = 3.2
Identities = 40/185 (21%), Positives = 64/185 (34%), Gaps = 38/185 (20%)
Query: 273 LANNAFSGTLFGNTAKPAGLFGNSTNTGGLFGNSTFGSNTGNANTFGQGLNS-------- 324
L N A + T + T G FG++T + G +N +A GLN
Sbjct: 956 LYNTALNITYYNLTR---GDFGSNTPSTPGIGFKPEQNNDSDATLVSSGLNWTPQDVGNL 1012
Query: 325 --------SFNLGGTSNTFNNPLAPQA--PPASNSVNTQQILNLASLPYFSAPTLY---- 370
F LGG TF + P+ + Q+ L A P+
Sbjct: 1013 VVSTGTQQGFQLGGWYVTFTDFTPPRQGYLGLQLTGLDQRALIWAPRPWQDFKGSTTTAS 1072
Query: 371 ---------KGV--LPATSSSSKLDELILPAFQKNRINNKNVTVNSNASYNKIKIKPVGS 419
G P S++ +++ FQK + N + + NS+ Y ++ PV
Sbjct: 1073 GAVISNWDKAGYSSRPDDSTTFNTSKILPEPFQKYKPNTSSGSTNSS-PYLEL-TAPVNV 1130
Query: 420 TSAST 424
TS +
Sbjct: 1131 TSLDS 1135
>gnl|CDD|220819 pfam10599, Nup_retrotrp_bd, Retro-transposon transporting motif.
This is the highly conserved C-terminal motif
GRKIxxxxxRRKx of nucleoporins that plays a critical and
unique role in the nuclear import of retro-transposons
in both yeasts and higher organisms. It would appear
that the arginine residues at positions 2 and 9-10
constitute a bipartite nuclear localisation signal, with
two basic peptide motifs separated by an interchangeable
spacer sequence, that is crucial for the
retro-transposon activity.
Length = 101
Score = 29.3 bits (65), Expect = 3.4
Identities = 17/74 (22%), Positives = 25/74 (33%), Gaps = 10/74 (13%)
Query: 174 GTNQNQSLNFNQNKPTFNLGGSTTSTGFGFGTNTATN---------NTSLFGQKPATNTF 224
N N + NF + ST ST F FG + +T N F PA+
Sbjct: 13 MGNSNTNFNFAGSNNQ-PHTSSTPSTPFTFGASPSTPANAASPSGSNPPAFSGSPASAPN 71
Query: 225 GLPTNTASPFGTST 238
+ + S+
Sbjct: 72 QFNVGSNNSNMFSS 85
>gnl|CDD|218761 pfam05815, DUF844, Baculovirus protein of unknown function (DUF844).
This family consists of several Baculovirus sequences of
between 350 and 380 residues long. The family has no
known function.
Length = 359
Score = 30.7 bits (70), Expect = 4.0
Identities = 20/91 (21%), Positives = 32/91 (35%), Gaps = 14/91 (15%)
Query: 1017 ENPKSIASKDPEQNIFQLVTQHKLHEAIVSAQQSDNFYLSGALSMLGCSEIGQAFMQAQL 1076
N + P + + L + + VSA S + + + Q L
Sbjct: 80 ANAAPATASSPRRPVLNLFAPARAARSAVSATNSID------MKKY------RKACQKLL 127
Query: 1077 SHYV--NATSDSFINKDILKLMLLTSGQPLY 1105
+Y + TS F KDI+ M+ S P Y
Sbjct: 128 QYYTSASTTSTEFKVKDIVMCMIYLSHSPKY 158
>gnl|CDD|227270 COG4934, COG4934, Predicted protease [Posttranslational modification,
protein turnover, chaperones].
Length = 1174
Score = 31.3 bits (71), Expect = 4.0
Identities = 77/490 (15%), Positives = 130/490 (26%), Gaps = 72/490 (14%)
Query: 82 LFGQSTTTFGQPTNTFGTTNSLFG---------KPNTGFGAAVSTQSNTFSFNPTTFG-- 130
L + +FG T G S G T + F ++P+
Sbjct: 601 LLAFGSLSFGG-TYWGGNFVSTGGNYNIVASNLSGMTYITFNYTIS---FIYSPSYLAPT 656
Query: 131 -QPNTTAIKPFGAAAPQTSNLFATNTSQATTAFGAP-------QTSGFGSFGTNQNQSLN 182
N+ + F N + GF G ++ Q
Sbjct: 657 FLGNSYSYDYFLPNIGIMRPAVGVNMIGVYDNLVSLSVPILNVNPEGFAYPGLSEGQYR- 715
Query: 183 FNQNKPTFNLGGSTTSTGFGFGTN--TATNNTSLFGQKPATNTFGLPTNTAS-PFGTSTS 239
+G G G +N + + S +T
Sbjct: 716 ------YSGIGPQYDYLGIGVMSNGSPVAQAVVFLISVNVSQLTPIIQYVISYAPPFTTG 769
Query: 240 FGAQPAQTSTLFNSFNKPTPGFQFNTNTTPNTSLANNAFSG---TLFGNTAKPAGLFGNS 296
T++ + N + N TP T N +SG + L
Sbjct: 770 LFLSNGTAYTVYWNGNLIA---ESNGTLTPQTIQFNTTYSGSNTVTNQTIPQVGLLIPLF 826
Query: 297 TNTGGLFGNST--FGSNTGNANTFGQGLNSSFNLGGTSNTFNNPLAPQAPPASNSVNTQQ 354
G + +S N G G + +G T F + P S S N
Sbjct: 827 KFVYGYYYSSAIATIDAKYVFNE-GNGPGAYIYVGSTPLYFF---SAIIYPNSLSYNIYV 882
Query: 355 ILNLASLPYFSAPTL----YKGVLPATSSSSKLDELILPAFQKNRINNKNVTVNSNASYN 410
I ++A +P TL ++P TSS S + + +
Sbjct: 883 IGSIAIIPLPYNATLLEWVGPAIIPLTSSGSNFTFSFGYYVIQFPPGIYTINTSIPGLDP 942
Query: 411 KIKIKPVGSTSASTKSLFDGLSSDTDDSEKYFNMIKSAPKRLIILPK----KTKPMTLKV 466
+ S + S LSS K K T +
Sbjct: 943 YSSLINSKSGTVSNLQ-IYFLSS--------VPTSGLTGKSSDGGIKNFVIDVLVNTNGI 993
Query: 467 NNVTTAETDVTVVSNNSSTPVSSSTQLQTPVSSKLTLNVAELKENISPSVKVSDLKVIKS 526
+ + + V+++ S+ +S S+Q + K N+ + NI + + S
Sbjct: 994 SAINNGTGNYYVIASVSNGTISFSSQ----IYGKDVYNITVAEGNI------TSVNSALS 1043
Query: 527 PLNTSIGSET 536
L S+ S T
Sbjct: 1044 NLIVSLSSTT 1053
>gnl|CDD|179543 PRK03137, PRK03137, 1-pyrroline-5-carboxylate dehydrogenase;
Provisional.
Length = 514
Score = 31.1 bits (71), Expect = 4.1
Identities = 11/24 (45%), Positives = 13/24 (54%), Gaps = 2/24 (8%)
Query: 830 GGSLFKTGNFIQPTV--DLSPSAR 851
G G FIQPT+ D+ P AR
Sbjct: 387 GEGDDSKGYFIQPTIFADVDPKAR 410
>gnl|CDD|131633 TIGR02584, cas_NE0113, CRISPR-associated protein, NE0113 family.
Members of this minor CRISPR-associated (Cas) protein
family are found in cas gene clusters in Vibrio
vulnificus YJ016, Nitrosomonas europaea ATCC 19718,
Mannheimia succiniciproducens MBEL55E, and
Verrucomicrobium spinosum [Mobile and extrachromosomal
element functions, Other].
Length = 209
Score = 30.2 bits (68), Expect = 4.5
Identities = 20/73 (27%), Positives = 30/73 (41%), Gaps = 13/73 (17%)
Query: 1003 RKDNLTEWLESVMEENPKSIASKDPEQN------IFQLVTQ------HKLHEAIVSAQQS 1050
D L +L + P + + P N I Q V H+LH +I +++
Sbjct: 73 PFDELRIYLIPTGQRKPLADI-RTPADNEAAANFIVQTVAPLCAAQDHQLHASIAGGRKT 131
Query: 1051 DNFYLSGALSMLG 1063
FYL ALS+ G
Sbjct: 132 MGFYLGYALSLFG 144
>gnl|CDD|187817 cd09686, Csx1_III-U, CRISPR/Cas system-associated protein Csx1.
CRISPR (Clustered Regularly Interspaced Short Palindromic
Repeats) and associated Cas proteins comprise a system
for heritable host defense by prokaryotic cells against
phage and other foreign DNA; Protein of this family often
fused to HTH domain; Some proteins could have an
additional fusion with RecB-family nuclease domain; Core
domain appears to have a Rossmann-like fold; loosely
associated with CRISPR/Cas systems; also known as NE0113
family.
Length = 209
Score = 30.2 bits (68), Expect = 4.8
Identities = 20/73 (27%), Positives = 30/73 (41%), Gaps = 13/73 (17%)
Query: 1003 RKDNLTEWLESVMEENPKSIASKDPEQN------IFQLVTQ------HKLHEAIVSAQQS 1050
D L +L + P + + P N I Q V H+LH +I +++
Sbjct: 73 PFDELRIYLIPTGQRKPLADI-RTPADNEAAANFIVQTVAPLCAAQDHQLHASIAGGRKT 131
Query: 1051 DNFYLSGALSMLG 1063
FYL ALS+ G
Sbjct: 132 MGFYLGYALSLFG 144
>gnl|CDD|224544 COG1629, CirA, Outer membrane receptor proteins, mostly Fe
transport [Inorganic ion transport and metabolism].
Length = 768
Score = 30.6 bits (68), Expect = 5.2
Identities = 40/228 (17%), Positives = 62/228 (27%), Gaps = 19/228 (8%)
Query: 86 STTTFGQPTNTFGTTNSLFGKP--NTGFGAAVSTQSNTFSFNPTTFGQPNTTAIKPFGAA 143
N SL KP +T S + + F+ +G P A
Sbjct: 237 GGRDGDNLGNRTAAAPSLTWKPDDDTRLTFQGSYNKDNYGFDRGGYGLPEGGT----VFA 292
Query: 144 APQTSNLFATNTSQATTAFGAPQTSGFGSFGTNQN--QSLNFNQNKPTFNLGGSTTSTGF 201
P L + ++ A G N N + + +
Sbjct: 293 PPAGGYLGDPDDNETNPATARLGLDFEHDLGDNDNWTLRAGLGYAQLAKSANSDYATGLT 352
Query: 202 GFGTNTATNNTSLFGQKPATNTFGLPTNTASPFGT-----STSFGAQPAQTSTLFNSFNK 256
T Q +TFGL T+ + F T FGA ++T
Sbjct: 353 SLDTGRTATRRYGADQDLDLDTFGLDTDLEAEFATGAIGHELVFGADYSRTDNSAGVDR- 411
Query: 257 PTPGFQFNTNTTPNTSLANNAFSGTLFGNTAKPAGLFGNSTNTGGLFG 304
N +P + N + F A P N+ +T GL+
Sbjct: 412 ----GYGTGNGSPPLDIFNPVYGNPAFVLAALPT-TSTNTQDTLGLYA 454
>gnl|CDD|221318 pfam11925, DUF3443, Protein of unknown function (DUF3443). This
family of proteins are functionally uncharacterized.
This protein is found in bacteria. Proteins in this
family are typically between 400 to 434 amino acids in
length. This protein has two conserved sequence motifs:
NPV and DNNG.
Length = 369
Score = 30.4 bits (69), Expect = 5.4
Identities = 34/145 (23%), Positives = 54/145 (37%), Gaps = 18/145 (12%)
Query: 188 PTFNLGGSTTSTG-FGFGTNTATNNTSLFGQKPATNTFGLPTNTASPFGTSTSFGAQPAQ 246
P + G+ ++TG FG T +NN T++ G T G S + +
Sbjct: 206 PAVSATGAASATGSLIFGIGTQSNNALPGATVLTTDSNG--DFTTVFNGRSFTASFIDSG 263
Query: 247 TSTLF---NSFNKPTPGFQFNTNTTPNTSLANNAFSGTLFG-NTAKPAGLFGNSTNTGGL 302
++ LF +S + + T+ T S TL G N A A + + N L
Sbjct: 264 SNGLFFPDSSIPTCSSNTGWYCPTSTLT------LSATLTGTNGAASATVSFSIANADTL 317
Query: 303 FGNSTF-----GSNTGNANTFGQGL 322
F G G+++ F GL
Sbjct: 318 FSTGNAAFNDLGGPFGSSSMFDWGL 342
>gnl|CDD|225483 COG2931, COG2931, RTX toxins and related Ca2+-binding proteins
[Secondary metabolites biosynthesis, transport, and
catabolism].
Length = 510
Score = 30.3 bits (68), Expect = 6.0
Identities = 53/315 (16%), Positives = 74/315 (23%), Gaps = 48/315 (15%)
Query: 60 FGTNTSTAQ----PATSLFGATENKPLFGQSTTTF------GQPTNTFGTTNSLFGKPNT 109
GTN S +L G N L G + ++L G
Sbjct: 81 NGTNGSDTLRGGAGNDALTGGAGNDYLDGGAGNDTLFGGDGNDALVGGAGNDTLDGGAGA 140
Query: 110 GFGAAVSTQSNTFSFNPTTF--GQPNTTAIKPFGAAAPQTSNLFATNTSQATTAFGAPQT 167
T ++T G N T + + T G
Sbjct: 141 NDTLDGGTGNDTLQNGNDILIGGSGNDT----LTGNSGADDRVINGGAGDDTILGGGNGD 196
Query: 168 SGFGSFGTNQNQSLNF-NQNKPTFNLGGSTTSTGFGFGTNTATNNTSLFGQK-------- 218
G+ N +L+ N G G G G
Sbjct: 197 DTVFGGGSGGNATLDGGNGGDSLITGGSGADVAAGGNGAVLIDGVVFADGTTWANLFLQA 256
Query: 219 -------PATNTFGLPTNTASPFGTS-TSFGAQPAQTSTLFNSFNKPTPGFQFNTNTTPN 270
T G+ + G + A TL G N
Sbjct: 257 LAIVGGAGNDFTVGIGNLNDTLKGGAGNDTLLGGAGNDTLTIG----GGGNDTLDGGAGN 312
Query: 271 TSLANNAFSGTLFGNT--AKPAGLFGNSTNTGGLF---------GNSTFGSNTGNANTFG 319
+L + T++G G GN T G GN T N GN G
Sbjct: 313 DTLDFSGGDDTIYGGAGNDTLDGGAGNDTLAGNAGALALLNGGDGNDTISGNDGNDTLIG 372
Query: 320 QGLNSSFNLGGTSNT 334
N + + G S+T
Sbjct: 373 GAGNDTLSGGAGSDT 387
>gnl|CDD|227614 COG5295, Hia, Autotransporter adhesin [Intracellular trafficking
and secretion / Extracellular structures].
Length = 715
Score = 30.2 bits (68), Expect = 8.0
Identities = 33/183 (18%), Positives = 53/183 (28%), Gaps = 15/183 (8%)
Query: 141 GAAAPQTSNLFATNTSQATTAFG--APQTSGFGSFGTNQNQSLNFNQ-NKPTFNLGGSTT 197
G A S+ A T+ A G A ++ S + N G+
Sbjct: 130 GTGAASASSTLAAGTTIRAVATGKTAVLSTASASGVQAKATDSALGFENTAVDQQAGAAG 189
Query: 198 STGFGFGTNTATNNTSLFGQKPATNTFGLPTNTASPFGTSTSFGAQPAQTSTLFNSFNKP 257
+T T+ G T+ + + A T+++ N+
Sbjct: 190 LGNAAENADTNEARTNDLGDS--------LTDDGNQSTIAVGSVGLKAGTNSVANAAGNS 241
Query: 258 TP---GFQFNTNTTPNTSLANNAFSGTLFGNTAKPAGLFGNSTNTGGLFGNSTFGSNTGN 314
T G + T T+ + A SG TA S +G + GSN
Sbjct: 242 TDAAIGSGLQSVLTAATAQGSAALSGGN-NGTAVIGKNNTLSGTSGANSIAAAAGSNGDT 300
Query: 315 ANT 317
A
Sbjct: 301 AIV 303
>gnl|CDD|133139 cd05472, cnd41_like, Chloroplast Nucleoids DNA-binding Protease,
catalyzes the degradation of ribulose-1,5-bisphosphate
carboxylase/oxygenase. Chloroplast Nucleoids
DNA-binding Protease catalyzes the degradation of
ribulose-1,5-bisphosphate carboxylase/oxygenase
(Rubisco) in senescent leaves of tobacco. Antisense
tobacco with reduced amount of CND41 maintained green
leaves and constant protein levels, especially Rubisco.
CND41 has DNA-binding as well as aspartic protease
activities. The pepsin-like aspartic protease domain is
located at the C-terminus of the protein. The enzyme is
characterized by having two aspartic protease catalytic
site motifs, the Asp-Thr-Gly-Ser in the N-terminal and
Asp-Ser-Gly-Ser in the C-terminal region. Aspartic
proteases are bilobal enzymes, each lobe contributing a
catalytic Asp residue, with an extended active site
cleft localized between the two lobes of the molecule.
One lobe may be evolved from the other through ancient
gene-duplication event. This family of aspartate
proteases is classified by MEROPS as the peptidase
family A1 (pepsin A, clan AA).
Length = 299
Score = 29.9 bits (68), Expect = 8.2
Identities = 25/86 (29%), Positives = 32/86 (37%), Gaps = 15/86 (17%)
Query: 189 TFNLGGSTTSTGFGFGTNTATNNTSLFGQKPATNTFGL-------PTNTASPFGTSTSF- 240
T LG S GF FG +N LFG A GL P+ TAS +G S+
Sbjct: 54 TLTLGSSDVVPGFAFG--CGHDNEGLFGG--AAGLLGLGRGKLSLPSQTASSYGGVFSYC 109
Query: 241 --GAQPAQTSTL-FNSFNKPTPGFQF 263
+ + L F + G F
Sbjct: 110 LPDRSSSSSGYLSFGAAASVPAGASF 135
>gnl|CDD|237019 PRK11907, PRK11907, bifunctional 2',3'-cyclic nucleotide
2'-phosphodiesterase/3'-nucleotidase precursor protein;
Reviewed.
Length = 814
Score = 29.8 bits (67), Expect = 9.1
Identities = 23/89 (25%), Positives = 40/89 (44%), Gaps = 5/89 (5%)
Query: 459 TKPMT-LKVNNVTTAETDVTVVSNNSSTPVSSSTQLQTPVSSKLTLNVAELKENISPSVK 517
T P T + T E+D T ++N+ TPV+++T + P SS AE + S +
Sbjct: 34 TTPATSTEAEQTTPVESDATEEADNTETPVAATTAAEAPSSS----ETAETSDPTSEATD 89
Query: 518 VSDLKVIKSPLNTSIGSETSEGDTFDFKL 546
+ + + S+ EG T D ++
Sbjct: 90 TTTSEARTVTPAATETSKPVEGQTVDVRI 118
>gnl|CDD|226406 COG3889, COG3889, Predicted solute binding protein [General
function prediction only].
Length = 872
Score = 29.8 bits (67), Expect = 9.2
Identities = 24/109 (22%), Positives = 36/109 (33%), Gaps = 4/109 (3%)
Query: 177 QNQSLNFNQNKPTFNLGGSTTSTG-FGFGTNTATNNTSLFGQKPATNTFGLPTNTASPFG 235
+ + P G S T++G F + T A + + T S
Sbjct: 747 TTKRVKVRIMDPASGTGTSITTSGTFTAEVPQSPTKTETTLSYSAYSNTSILIETTSVVI 806
Query: 236 TSTSFGAQPAQTSTLFNSFNKPTPGFQFNTNTTPNTSLANNAFSGTLFG 284
T T Q +S PT Q +T+TT TS + G + G
Sbjct: 807 TKTVTQTQTTTSSPSPTQTTSPT---QTSTSTTTTTSPSQTTTGGGICG 852
Database: CDD.v3.10
Posted date: Mar 20, 2013 7:55 AM
Number of letters in database: 10,937,602
Number of sequences in database: 44,354
Lambda K H
0.312 0.128 0.367
Gapped
Lambda K H
0.267 0.0707 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 54,076,098
Number of extensions: 5142279
Number of successful extensions: 3659
Number of sequences better than 10.0: 1
Number of HSP's gapped: 3546
Number of HSP's successfully gapped: 118
Length of query: 1121
Length of database: 10,937,602
Length adjustment: 107
Effective length of query: 1014
Effective length of database: 6,191,724
Effective search space: 6278408136
Effective search space used: 6278408136
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 64 (28.5 bits)