RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy17507
         (1121 letters)



>gnl|CDD|202886 pfam04096, Nucleoporin2, Nucleoporin autopeptidase. 
          Length = 137

 Score =  173 bits (441), Expect = 2e-50
 Identities = 66/144 (45%), Positives = 92/144 (63%), Gaps = 7/144 (4%)

Query: 550 GYFVSPTLNELKQLKTKNGPCIVEGFTIGHEEFGSICFLESIDISELNNVDLDSIVEFRF 609
           GY+ SP++ EL ++ +      VE FT+G + +GSI FL  +D+S    +DLD IV+F  
Sbjct: 1   GYWTSPSIEELSKM-SLEELSSVENFTVGRKGYGSIEFLGPVDLS---GIDLDKIVDFGP 56

Query: 610 KEVVLYPDDANKPPVGTALNRPAKVTLLKVWPKDKSTGELIKDVQKIKNMNYEAIVIRAC 669
           + V +YPDD NKPPVG  LN PA +TL   +P+DKST + IKD  K +   +E  + R  
Sbjct: 57  RSVEVYPDDENKPPVGEGLNVPATITLENCYPRDKSTKKPIKDPSKPR---FEKHLKRLK 113

Query: 670 TRMNVEFVSYQPETGSWCFKVQHF 693
            +   EF+SY PETG+W FKV+HF
Sbjct: 114 EKKGAEFISYDPETGTWVFKVKHF 137


>gnl|CDD|221425 pfam12110, Nup96, Nuclear protein 96.  Nup96 (often known by the name
            of its yeast homolog Nup145C) is part of the Nup84
            heptameric complex in the nuclear pore complex. Nup96
            complexes with Sec13 in the middle of the heptamer. The
            function of the heptamer is to coat the curvature of the
            nuclear pore complex between the inner and outer nuclear
            membranes. Nup96 is predicted to be an alpha helical
            solenoid. The interaction between Nup96 and Sec13 is the
            point of curvature in the heptameric complex.
          Length = 290

 Score = 75.1 bits (185), Expect = 2e-14
 Identities = 29/96 (30%), Positives = 48/96 (50%), Gaps = 1/96 (1%)

Query: 1026 DPEQNIFQLVTQHKLHEAIVSAQQSDNFYLSGALSMLGCSEIGQAFMQAQLSHYVNATSD 1085
             P + IF L++ +++ EA   A +S+NF+L+  LS +G     ++  Q QLS +      
Sbjct: 1    SPLEKIFLLLSGNRIVEACELAIESNNFHLATLLSQIGNDPSVRSLAQQQLSKWRQNGVL 60

Query: 1086 SFINKDILKLMLLTSGQPLYDGSNGIINVCESLDWK 1121
            S I++ I K+  L +G       +   N  E LDW 
Sbjct: 61   SEIDEPIRKIYELLAGNVFRSLGDSGENEFE-LDWL 95


>gnl|CDD|227938 COG5651, COG5651, PPE-repeat proteins [Cell motility and
           secretion].
          Length = 490

 Score = 44.9 bits (106), Expect = 2e-04
 Identities = 47/301 (15%), Positives = 76/301 (25%), Gaps = 16/301 (5%)

Query: 52  GTTPQPSMFGTNTSTAQPATSLFGATENKPLFGQSTTTFGQPTNTFGTTNSLFGKPNTGF 111
              P P+   +  +    A    G   +                   T + L        
Sbjct: 168 FNEPPPTTNSSGLAAQASAVQALGDLASGITLASQVNLSLLELINPATLSGLANGGTGNL 227

Query: 112 GAAVSTQSNTFSFNPTTFGQ-PNTTAIKPFGAAAPQTSNLFATNTSQATTAFGAPQTS-G 169
           G     Q+    F    FG   +     P  A+    +NL     S          T+ G
Sbjct: 228 GIGALQQAQNLGFGNVGFGNLGSGNPGAPGLASQFSATNLGTLLGSLNPYLGNIGATNIG 287

Query: 170 FGSFGTNQNQSLNFNQNKP---TFNLGGSTTSTGFGFGTNTATNNTSLFGQKPATNT-FG 225
             + GT    S N   +        +G ++ +T      N      +  G     N+   
Sbjct: 288 LAAAGTGNIGSGNAVDSGGSALVGAIGQTSQATANAGSVNATGGAAAGSGNLGVANSGSA 347

Query: 226 LPTNTASPFGTSTSFGAQPAQTST-LFNSFNKPTPG--------FQFNTNTTPNTSLANN 276
                 +    +   GA     +  L N+      G             +   NT   N+
Sbjct: 348 AAPFGIAGANQAALGGANSGAGNFGLGNNPGGGLGGKPLGGTGNGGIGASGIGNTGYGNS 407

Query: 277 AFSGTLFGNTAKPAGLFGNSTNTGGLFGNSTFGSNTGNANTFGQGLNSSFNLGGTSNTFN 336
             +     N         N+ N     G +    N G+ N    G     N G  +  F 
Sbjct: 408 GIANAGLSNAGSNNAGGENAGNANNTGGGNVGLWNAGDFNAGAAGTG-FTNNGSYNTGFL 466

Query: 337 N 337
           N
Sbjct: 467 N 467



 Score = 44.5 bits (105), Expect = 3e-04
 Identities = 46/253 (18%), Positives = 64/253 (25%), Gaps = 18/253 (7%)

Query: 60  FGTNTSTAQPATSLFGATENKPLFGQSTTTFGQPTNTFGTTNSLFGKPNTGFGAAVSTQS 119
            G+    A    S F AT    L G      G    T     +     N G G AV +  
Sbjct: 248 LGSGNPGAPGLASQFSATNLGTLLGSLNPYLGNIGATNIGLAAAGT-GNIGSGNAVDSGG 306

Query: 120 NTFSFNPTTFGQP--NTTAIKPFGAAAPQTSNLFATNTSQATTAFG--APQTSGFGSFGT 175
           +          Q   N  ++   G AA  + NL   N+  A   FG      +  G   +
Sbjct: 307 SALVGAIGQTSQATANAGSVNATGGAAAGSGNLGVANSGSAAAPFGIAGANQAALGGANS 366

Query: 176 NQNQSLNFNQNKPTFNLGGSTTSTGFGFGTNTATNNTSLFGQKPATNTFGLPTNTASPFG 235
                   N             +   G G +   N                  N      
Sbjct: 367 GAGNFGLGNNPGGGLGGKPLGGTGNGGIGASGIGNTGY---------GNSGIANAGLSNA 417

Query: 236 TSTSFGAQPAQTSTLFNSFNKPTP--GFQFNTNTTPNTSLANNAFSGTLFGNTAKPAGLF 293
            S + G + A  +      N      G  FN          N +++   F N        
Sbjct: 418 GSNNAGGENAGNANNTGGGNVGLWNAGD-FNAGAAGTGFTNNGSYNTG-FLNFGNDNTGI 475

Query: 294 GNSTNTGGLFGNS 306
            N  NT     N 
Sbjct: 476 FNGGNTSNGTFNQ 488



 Score = 43.4 bits (102), Expect = 6e-04
 Identities = 48/296 (16%), Positives = 72/296 (24%), Gaps = 41/296 (13%)

Query: 59  MFGTNTSTAQPATSLFGATENKPLFGQSTTTFGQPTNTFGTTNSLFGKPNTGFGAAVSTQ 118
           M+G   + +  A++L     N+P    +++      +       L    +    A+    
Sbjct: 151 MYGYYAAASAAASAL--TPFNEPPPTTNSSGLAAQASAVQALGDL---ASGITLASQVNL 205

Query: 119 SNTFSFNPTTFGQPNTTAIKPFGAAAPQTSNLFATNTSQATTAFGAPQTSGFGSFGTNQN 178
           S     NP T            G  A Q     A N       FG   +   G+ G    
Sbjct: 206 SLLELINPATLSGLANGGTGNLGIGALQQ----AQNLGFGNVGFGNLGSGNPGAPGLASQ 261

Query: 179 QSLNFNQNKPTFNLGGSTTSTGFGFGTNTATNNTSLFGQKPATNTFGLPTNTASPFGTST 238
            S                       G   A             N        +       
Sbjct: 262 FSATNLGTLLGSLNPYLGNIGATNIGLAAAGTG----------NIGSGNAVDSGGSALVG 311

Query: 239 SFGAQPAQTSTLFNSFNKPTPGFQFNTNTTPNTSLANNAFSGTL-FGNTAKPAGLFG-NS 296
           + G     T+   +                 N +    A SG L   N+   A  FG   
Sbjct: 312 AIGQTSQATANAGSV----------------NATGGAAAGSGNLGVANSGSAAAPFGIAG 355

Query: 297 TNTGGLFGNSTFGSNTGNANTFGQGLNSSFNLG----GTSNTFNNPLAPQAPPASN 348
            N   L G ++   N G  N  G GL      G    G   +            +N
Sbjct: 356 ANQAALGGANSGAGNFGLGNNPGGGLGGKPLGGTGNGGIGASGIGNTGYGNSGIAN 411



 Score = 33.7 bits (77), Expect = 0.61
 Identities = 25/185 (13%), Positives = 38/185 (20%), Gaps = 14/185 (7%)

Query: 43  LVSATGSFFGTTPQPSMFGTNTSTAQPATSLFGATENKPLFGQSTTTFGQPTNTFGTTNS 102
             SA     G T Q +    + +    A +  G          +        N      +
Sbjct: 305 GGSALVGAIGQTSQATANAGSVNATGGAAAGSGNLGVANSGSAAAPFGIAGANQAALGGA 364

Query: 103 LFGKPNTG--------------FGAAVSTQSNTFSFNPTTFGQPNTTAIKPFGAAAPQTS 148
             G  N G               G        +   N          A      +     
Sbjct: 365 NSGAGNFGLGNNPGGGLGGKPLGGTGNGGIGASGIGNTGYGNSGIANAGLSNAGSNNAGG 424

Query: 149 NLFATNTSQATTAFGAPQTSGFGSFGTNQNQSLNFNQNKPTFNLGGSTTSTGFGFGTNTA 208
                  +      G      F +       + N + N    N G   T    G  T+  
Sbjct: 425 ENAGNANNTGGGNVGLWNAGDFNAGAAGTGFTNNGSYNTGFLNFGNDNTGIFNGGNTSNG 484

Query: 209 TNNTS 213
           T N  
Sbjct: 485 TFNQG 489



 Score = 30.7 bits (69), Expect = 5.3
 Identities = 35/214 (16%), Positives = 59/214 (27%), Gaps = 13/214 (6%)

Query: 220 ATNTFGLPTNTASPFGTSTSFGAQPAQTSTLFNSFNKPTPGFQFNTNTTPNTSLANNAFS 279
           A  +   P N   P    T+     AQ S +    +  +     +        L N A  
Sbjct: 160 AAASALTPFNEPPPT---TNSSGLAAQASAVQALGDLASGITLASQVNLSLLELINPATL 216

Query: 280 GTLFGNTAKPAGLFGNSTNTGGLFGNSTFGSNTGNANTFGQGLNSSFNLGGTSNTFNNPL 339
             L        G+          FGN  FG N G+ N    GL S F     +N      
Sbjct: 217 SGLANGGTGNLGIGALQQAQNLGFGNVGFG-NLGSGNPGAPGLASQF---SATNLGTLLG 272

Query: 340 APQAPPASNSVNTQQILNLASLPYFSAPTLYKGVLPATSSSSKLDELILPAFQKNRINNK 399
           +      +       +    +    S   +  G      +  +  +    A       + 
Sbjct: 273 SLNPYLGNIGATNIGLAAAGTGNIGSGNAVDSGGSALVGAIGQTSQATANAG------SV 326

Query: 400 NVTVNSNASYNKIKIKPVGSTSASTKSLFDGLSS 433
           N T  + A    + +   GS +A         ++
Sbjct: 327 NATGGAAAGSGNLGVANSGSAAAPFGIAGANQAA 360


>gnl|CDD|222274 pfam13634, Nucleoporin_FG, Nucleoporin FG repeat region.  This
           family includes a number of FG repeats that are found in
           nucleoporin proteins. This family includes the yeast
           nucleoporins Nup116, Nup100, Nup49, Nup57 and Nup 145.
          Length = 106

 Score = 41.3 bits (97), Expect = 2e-04
 Identities = 18/98 (18%), Positives = 27/98 (27%), Gaps = 3/98 (3%)

Query: 152 ATNTSQATTAFGAPQTSGFGSFGTNQNQSLNFNQNKPTFNLGGSTTSTGFGFGTNTATNN 211
            T  + +   FG+   +G G FG N   +            G S++      G     + 
Sbjct: 4   TTAGASSGGLFGSAPATGGGLFGQNAANT---TPTSGGGLFGSSSSQATQPSGGGLFGSA 60

Query: 212 TSLFGQKPATNTFGLPTNTASPFGTSTSFGAQPAQTST 249
                       FG  T T +       FG   A    
Sbjct: 61  AQTSATTTGGGLFGSTTATTTTATGGGLFGNATAAQPA 98



 Score = 39.4 bits (92), Expect = 0.001
 Identities = 29/124 (23%), Positives = 43/124 (34%), Gaps = 22/124 (17%)

Query: 116 STQSNTFSFNPTTFGQPNTTAIKPFGAAAPQTSNLFATNTSQATTAFGAPQTSGFGSFGT 175
           S+ +   + +   FG    T    FG  A  T+         ++++     + G G FG+
Sbjct: 1   SSSTTAGASSGGLFGSAPATGGGLFGQNAANTTPTSGGGLFGSSSSQATQPSGG-GLFGS 59

Query: 176 NQNQSLNFNQNKPTFNLGGSTTSTGFGFGTNTATNNT----SLFGQKPATNTFGLPTNTA 231
               S              +TT+ G  FG+ TAT  T     LFG   A       T   
Sbjct: 60  AAQTS--------------ATTTGGGLFGSTTATTTTATGGGLFGNATAAQP---ATTGG 102

Query: 232 SPFG 235
             FG
Sbjct: 103 GLFG 106



 Score = 39.4 bits (92), Expect = 0.001
 Identities = 36/110 (32%), Positives = 47/110 (42%), Gaps = 9/110 (8%)

Query: 195 STTSTGFGFGTNTATNNTSLFGQKPATNTFGLPTNTASPFGTSTSFGAQPAQTSTLFNSF 254
           +  S+G  FG+  AT    LFGQ  A  T   PT+    FG+S+S   QP+    LF S 
Sbjct: 6   AGASSGGLFGSAPATG-GGLFGQNAANTT---PTSGGGLFGSSSSQATQPS-GGGLFGSA 60

Query: 255 NKPTPGFQFNTNTTPNTSLANNAFSGTLFGNTAKPAGLFGNSTNTGGLFG 304
            + +            T+    A  G LFGN          +T  GGLFG
Sbjct: 61  AQTSATTTGGGLFGSTTATTTTATGGGLFGNATAAQ----PATTGGGLFG 106



 Score = 39.4 bits (92), Expect = 0.001
 Identities = 24/106 (22%), Positives = 39/106 (36%), Gaps = 5/106 (4%)

Query: 228 TNTASPFGTSTSFGAQPAQTSTLFNSFNKPTPGFQFNTNTTPNTSLANNAFSGTLFGNTA 287
           ++T +   +   FG+ PA    LF      T           ++S A     G LFG+ A
Sbjct: 2   SSTTAGASSGGLFGSAPATGGGLFGQNAANTTPTSGGGLFGSSSSQATQPSGGGLFGSAA 61

Query: 288 KPAGLFGNSTNTGGLFGNS-TFGSNTGNANTFGQGLNSSFNLGGTS 332
           + +     +T  GGLFG++    +       FG    +     G  
Sbjct: 62  QTS----ATTTGGGLFGSTTATTTTATGGGLFGNATAAQPATTGGG 103



 Score = 36.3 bits (84), Expect = 0.013
 Identities = 19/105 (18%), Positives = 27/105 (25%), Gaps = 9/105 (8%)

Query: 52  GTTPQPSMFGTNTSTAQPATSLFGATENKPLFGQSTTTFGQPTNTF--GTTNSLFGKPNT 109
                  +FG+  +T      LFG              FG  ++     +   LFG    
Sbjct: 6   AGASSGGLFGSAPATG---GGLFGQNAANTTPTSGGGLFGSSSSQATQPSGGGLFGSAAQ 62

Query: 110 GFGAAVSTQSNTFSFNPTTFGQPNTTAIKPFGAAAPQTSNLFATN 154
                 +T +    F  TT      T    FG A           
Sbjct: 63  TS----ATTTGGGLFGSTTATTTTATGGGLFGNATAAQPATTGGG 103



 Score = 34.4 bits (79), Expect = 0.056
 Identities = 27/116 (23%), Positives = 36/116 (31%), Gaps = 14/116 (12%)

Query: 205 TNTATNNTSLFGQKPATNTFGLPTNTASPFGTSTSFGAQPAQTSTLFNSFNKPTPGFQFN 264
           T    ++  LFG  PAT            FG + +    P     LF S +         
Sbjct: 4   TTAGASSGGLFGSAPATGG--------GLFGQNAA-NTTPTSGGGLFGSSSSQATQPSGG 54

Query: 265 TNTTPNTSLANNAFSGTLFGNTAKPAGLFGNSTNTGGLFGNS-TFGSNTGNANTFG 319
                    +     G LFG+T         +   GGLFGN+      T     FG
Sbjct: 55  GLFGSAAQTSATTTGGGLFGSTTATT----TTATGGGLFGNATAAQPATTGGGLFG 106



 Score = 33.6 bits (77), Expect = 0.098
 Identities = 21/97 (21%), Positives = 31/97 (31%), Gaps = 1/97 (1%)

Query: 84  GQSTTTFGQPTNTFGTTNSLFGKPNTGFGAAVSTQSNTFSFNPTTFGQPNTTAIKPFGAA 143
             STT        FG+  +  G       A  +  S    F  ++      +    FG+A
Sbjct: 1   SSSTTAGASSGGLFGSAPATGGGLFGQNAANTTPTSGGGLFGSSSSQATQPSGGGLFGSA 60

Query: 144 APQTSNLFATNTSQATTAFGAPQTSGFGSFGTNQNQS 180
           A  ++         +TTA     T G G FG      
Sbjct: 61  AQTSATTTGGGLFGSTTATTTTATGG-GLFGNATAAQ 96



 Score = 32.8 bits (75), Expect = 0.19
 Identities = 26/104 (25%), Positives = 38/104 (36%), Gaps = 12/104 (11%)

Query: 45  SATGSFFGTTPQP--SMFGTNTSTAQPAT---SLFGATENKPLFGQSTTTFGQPTNTFG- 98
           +++G  FG+ P     +FG   +     T    LFG++ ++         FG    T   
Sbjct: 8   ASSGGLFGSAPATGGGLFG-QNAANTTPTSGGGLFGSSSSQATQPSGGGLFGSAAQTSAT 66

Query: 99  -TTNSLFGKPNTGFGAAVSTQSNTFSFNPTTFGQPNTTAIKPFG 141
            T   LFG          +T +    F   T  QP TT    FG
Sbjct: 67  TTGGGLFGSTTA----TTTTATGGGLFGNATAAQPATTGGGLFG 106



 Score = 30.1 bits (68), Expect = 2.1
 Identities = 19/89 (21%), Positives = 30/89 (33%), Gaps = 11/89 (12%)

Query: 34  GTLQGSKEFLVSATGSFFGTTPQPSMFGTNTSTAQPAT--SLFGATENKP-------LFG 84
           G+   +   L     +    T    +FG+++S A   +   LFG+            LFG
Sbjct: 15  GSAPATGGGLFGQNAANTTPTSGGGLFGSSSSQATQPSGGGLFGSAAQTSATTTGGGLFG 74

Query: 85  QSTTTFGQPTNT--FGTTNSLFGKPNTGF 111
            +T T    T    FG   +       G 
Sbjct: 75  STTATTTTATGGGLFGNATAAQPATTGGG 103



 Score = 29.4 bits (66), Expect = 3.4
 Identities = 24/100 (24%), Positives = 33/100 (33%), Gaps = 14/100 (14%)

Query: 264 NTNTTPNTSL--ANNAFSGTLFGNTA------KPAGLFGNSTN------TGGLFGNSTFG 309
            T    +  L  +  A  G LFG  A         GLFG+S++       GGLFG++   
Sbjct: 4   TTAGASSGGLFGSAPATGGGLFGQNAANTTPTSGGGLFGSSSSQATQPSGGGLFGSAAQT 63

Query: 310 SNTGNANTFGQGLNSSFNLGGTSNTFNNPLAPQAPPASNS 349
           S T           ++         F N  A Q       
Sbjct: 64  SATTTGGGLFGSTTATTTTATGGGLFGNATAAQPATTGGG 103


>gnl|CDD|222890 PHA02584, 34, long tail fiber, proximal subunit; Provisional.
          Length = 1229

 Score = 43.2 bits (102), Expect = 9e-04
 Identities = 71/316 (22%), Positives = 108/316 (34%), Gaps = 51/316 (16%)

Query: 115  VSTQSNTFSFNPTTFG-QPNTTAIKPFGAAAPQTSNLFATN------TSQATTAFGAPQT 167
            ++T SNTF  N T    QP T   K   A +P   N    N      T+Q +       +
Sbjct: 836  LATGSNTFVGNDTVGSTQPLTNYEKDGYAVSPYGLNYALQNYLPLKATAQNSLKLDGLDS 895

Query: 168  SGFGSFGTNQ--NQSLNFNQNKPTFNLGGSTTSTGFG-FGTNTATNNTSLFGQKPATNTF 224
            + F     +Q  N SL F +N    NL     S+    FG +   N+T            
Sbjct: 896  AQFIRRDIDQTVNGSLTFTKNT---NLSAPLVSSSTATFGGSVTANST------------ 940

Query: 225  GLPTNTASPFGTSTSFGAQPAQTSTLFNSFNKPTPGFQFNTNTTPNTSLANNAFSGTLFG 284
             L T   +  GT         +TS  F S N  T    FN + T +    N   +GTL  
Sbjct: 941  -LTTQN-TSNGTVVVVD----ETSIAFYSQNNTTGNIVFNIDGTVDPINVNA--NGTLNA 992

Query: 285  NTAKPAGLFGNSTNTGGLFGNSTFGSNTGNAN-------TFGQGLNSSFNLGGTSNTFNN 337
                     G +TN   ++      + T NA        T     +++  L   +     
Sbjct: 993  T--------GVATNGRAVYAEGGGIARTNNAARAITGGFTIRNDGSTTVFLLTAAGDQTG 1044

Query: 338  PLAPQAPPASNSVNTQQILNLASLPYFSAPTLYKGVLPATSS--SSKLDELILPAFQKNR 395
                      N+ N Q  +N   +   +  T+  G L   S   S         A   + 
Sbjct: 1045 GFNGLKSLIINNANGQVTINDNYIIN-AGGTIMSGGLTVNSRIRSQGTKASYTRAPTADT 1103

Query: 396  INNKNVTVNSNASYNK 411
            +   +V +N +A+YN+
Sbjct: 1104 VGFWSVDINDSATYNQ 1119



 Score = 33.6 bits (77), Expect = 0.78
 Identities = 62/317 (19%), Positives = 87/317 (27%), Gaps = 53/317 (16%)

Query: 61   GTNTSTAQPATSLFGATENKPLFGQSTTTFGQPTNTFGTTNSLFGKPNTGFGAAVSTQSN 120
            G+ T T            + PL   ST TFG       T NS     NT  G  V     
Sbjct: 909  GSLTFTKN-------TNLSAPLVSSSTATFGGSV----TANSTLTTQNTSNGTVVVVDET 957

Query: 121  TFSFNPTTFGQPNTTAIKPFGAAAPQTSNLFATNTSQATTAF---GAPQTSGFGSFGTNQ 177
            + +F        N               N+ A  T  AT       A    G G   TN 
Sbjct: 958  SIAFYSQNNTTGNIVFN---IDGTVDPINVNANGTLNATGVATNGRAVYAEGGGIARTNN 1014

Query: 178  -NQSLN-----FNQNKPTFNLGGSTTSTGFGFG-----------TNTATNNTSLFGQKPA 220
              +++       N    T  L  +      GF                 N+  +      
Sbjct: 1015 AARAITGGFTIRNDGSTTVFLLTAAGDQTGGFNGLKSLIINNANGQVTINDNYIINAGGT 1074

Query: 221  TNTFGLPTNTAS-PFGTSTSFGAQPAQTSTLFNSFNKPTPGFQFNTNTTPNTSLANNAFS 279
              + GL  N+     GT  S+   P   +  F S +       +N        +      
Sbjct: 1075 IMSGGLTVNSRIRSQGTKASYTRAPTADTVGFWSVDI-NDSATYNQFPGYFQMVTKTKSP 1133

Query: 280  GTL--FGNTAKPAGLFGNSTNTGGLFGNSTFGSNTGNANTFGQGLNSSFNLGGTSNTFNN 337
            GTL  FGNT        +        G +T  + T   N       +++   G   T  N
Sbjct: 1134 GTLTQFGNTLD------SLYQDWSPDGRTTRYTRTWQKN------KNAWTSFGEVYTKGN 1181

Query: 338  PLAPQ---APPASNSVN 351
                +   A PA     
Sbjct: 1182 KPTAEDIGAVPADGGTM 1198


>gnl|CDD|236952 PRK11669, pbpG, D-alanyl-D-alanine endopeptidase; Provisional.
          Length = 306

 Score = 38.1 bits (89), Expect = 0.020
 Identities = 26/64 (40%), Positives = 31/64 (48%), Gaps = 15/64 (23%)

Query: 63  NTSTAQPATSLFGATENKPLFGQ-STT-----TFGQPTNT--FGTTNSLFGKPN------ 108
           N STA+  T L  A++  PL GQ STT     TF +P  T  F  TN L  + N      
Sbjct: 172 NVSTARDLTKLLIASKQYPLIGQLSTTREKTATFRKPNYTLPFRNTNHLVYRDNWNIQLT 231

Query: 109 -TGF 111
            TGF
Sbjct: 232 KTGF 235


>gnl|CDD|221143 pfam11593, Med3, Mediator complex subunit 3 fungal.  Mediator is a
           large complex of up to 33 proteins that is conserved
           from plants to fungi to humans - the number and
           representation of individual subunits varying with
           species. It is arranged into four different sections, a
           core, a head, a tail and a kinase-activity part, and the
           number of subunits within each of these is what varies
           with species. Overall, Mediator regulates the
           transcriptional activity of RNA polymerase II but it
           would appear that each of the four different sections
           has a slightly different function. Mediator subunit
           Hrs1/Med3 is a physical target for Cyc8-Tup1, a yeast
           transcriptional co-repressor.
          Length = 381

 Score = 38.1 bits (88), Expect = 0.026
 Identities = 48/249 (19%), Positives = 72/249 (28%), Gaps = 32/249 (12%)

Query: 52  GTTPQPSMFGTNTSTAQPATSLFGATENKPLFGQSTTTFGQPTNTFGTTNSLFGKPNTGF 111
           GT  Q    G + S  + +     AT        S+T            N+    PNT  
Sbjct: 117 GTYNQLGNAGASASITKTSNGSDAAT-------TSSTANTPAAAKVLKANAA-SAPNTT- 167

Query: 112 GAAVSTQSNTFSFNPTTFGQPNTTAIKPFGAAAPQTSNLFATNTSQATTAFGAPQTSGFG 171
              V + + T + + TT   P TT  KP      + +   A   +QA+            
Sbjct: 168 -TGVGSAATTAAISATTATTPTTTQKKPRKPRQTKKTGPAAAAKAQASAQ---------- 216

Query: 172 SFGTNQNQSLNFNQNKPTFNLGGSTTSTGFGFGTNTATNNTSLFGQKPATNTFGLPTNTA 231
                Q Q+  +NQ        G   +T         T    L       N    P N  
Sbjct: 217 ----AQAQASAYNQM----GSLGVPQNTSMLAQIPNPTPLMQLLNGVSPNNAMASPLNNM 268

Query: 232 SPFGTSTSFGAQPAQTSTLFNSFNKPTPGFQFNTNTTPNTSLANNAFSGTLFGNTAKPAG 291
           SP       G Q        ++ N    G   N +   + +      +  +  N   PA 
Sbjct: 269 SPMRNLNQMGNQNNGGQMTPSANN----GNMNNQSRENSMNQGMTPSASMINLNNITPAN 324

Query: 292 LFGNSTNTG 300
           +   S N  
Sbjct: 325 ILNMSLNLA 333


>gnl|CDD|237846 PRK14876, PRK14876, conjugal transfer mating pair stabilization
           protein TraN; Provisional.
          Length = 928

 Score = 36.0 bits (83), Expect = 0.12
 Identities = 35/195 (17%), Positives = 60/195 (30%), Gaps = 18/195 (9%)

Query: 123 SFNPTTFGQPNTTAIKPFGAAAPQTSNLFATNTSQATTAFGAPQTSGFGSFGTNQNQSLN 182
           S+      Q  T   K   +     SN F +++        +  T   G F      S+N
Sbjct: 26  SYANGAQAQDITAVGKEAQSFGQNLSNSFKSSSGTVQDGTISMPTLKDGQFQMGGGSSVN 85

Query: 183 FNQNKPTFNLGGSTTSTGFGFGTNTATNN-TSLFGQKPATNTFGLPTNTA-SPFGTSTSF 240
            N   P  + G +     F        +    ++      +  G     +      S + 
Sbjct: 86  VNDLFPGTS-GTNNKPDYFPDANKPDVDGLQGIYDSGDDMDDVGNSAKGSLWSDANSANP 144

Query: 241 GAQPAQTSTLFNSFNKPTPGFQFNTNTTPNTSLANNAFSGTLFGNTAKPAGLFGNSTNTG 300
               A    L +S N+  P F    +  P  +L+   +      +    AG FG+ +   
Sbjct: 145 SISGAAYKVLLDSSNRSRPDF----SNDPVLNLSKKTYE-----DMDLIAGGFGDCSAE- 194

Query: 301 GLFGNSTFGSNTGNA 315
                +T   NT NA
Sbjct: 195 -----TTINQNTINA 204


>gnl|CDD|177303 PHA00735, PHA00735, hypothetical protein.
          Length = 808

 Score = 33.3 bits (76), Expect = 0.78
 Identities = 70/290 (24%), Positives = 104/290 (35%), Gaps = 58/290 (20%)

Query: 32  YVGTLQGSKEFLVSATGSF--FGTTPQPSMFGTNTSTA----QPATSLFGATENKPLFGQ 85
           Y   +Q  +  L SATG+   +  +P P +F +    +       T  F  + N P+   
Sbjct: 453 YTINIQLGQFSLGSATGTITIYPVSPLPVIFVSQYPHSVQLGTKVTITFQFSYNTPVANV 512

Query: 86  STTTFGQPTNTFG-------TTNSLFGKP----NTGFGAAVSTQSNT----FSFNPTTF- 129
           + + F Q T TF        T++S+        +   G  + TQS      F+F   TF 
Sbjct: 513 TMSAFTQTTTTFAWAYASIITSSSVVQFKAYWLSANDGILIITQSQNYLIPFNFTGLTFY 572

Query: 130 -GQPNTTAIKPFGAAAPQTSNLFATNTSQATTAFGAPQTSGFGSFGTN------------ 176
               NT  I     A   T N    N + ++   G     G G    N            
Sbjct: 573 NNSVNTIQINIVNNALQLTINGGTLNIANSSKVIGLGFYYGAGKLVLNWFFVSGIILQSA 632

Query: 177 -QNQ----SLNFNQNKPTFNLGGSTTSTGFGFGTNTATN------NTSLFGQKPATNTFG 225
             NQ        N +  T    G T S+GFG  T T T       +    G    T    
Sbjct: 633 TANQAYVILTGTNPSTLTQYTTGYTNSSGFGQVTITLTYTPYELIDIYWAG---VTKYVL 689

Query: 226 LPTNTASPFGTSTSFGAQPAQTSTLFNSFNKPTPGFQFNTNTTPNTSLAN 275
           L  +   P  ++T+       TSTL  ++ +P     F+ N +PN+SL N
Sbjct: 690 LNISVTQPTTSTTT----TVNTSTLNYNYTQP-----FSNNISPNSSLYN 730


>gnl|CDD|218696 pfam05692, Myco_haema, Mycoplasma haemagglutinin.  This family
           consists of several haemagglutinin sequences from
           Mycoplasma synoviae and Mycoplasma gallisepticum. The
           major plasma membrane proteins, pMGAs, of Mycoplasma
           gallisepticum are cell adhesin (hemagglutinin)
           molecules. It has been shown that the genetic
           determinants that code for the haemagglutinins are
           organised into a large family of genes and that only one
           of these genes is predominately expressed in any given
           strain.
          Length = 437

 Score = 32.7 bits (75), Expect = 1.2
 Identities = 33/149 (22%), Positives = 48/149 (32%), Gaps = 24/149 (16%)

Query: 146 QTSNLFATNTSQATTAF------GAPQTSGFGSFGTNQNQSLNFNQNKPTF--------- 190
           Q + L   N SQ +  +      G        +   N   S N N +  TF         
Sbjct: 279 QFTGLTTDNGSQTSDPYYLIGFIGGSGNRTVANSVMNVQNSPNVNGDNRTFTIYVNAPKA 338

Query: 191 ---NLGGSTTST-----GFGFGTNTATNNTSLFGQKPATNTFGLPT-NTASPFGTSTSFG 241
               + GS  S+      F   T  ++NN+         N   L T +TA+   T++S  
Sbjct: 339 GDYYISGSYLSSSNRNLKFSTSTTDSSNNSVTITVLSKNNWTTLGTFDTATTTTTNSSTV 398

Query: 242 AQPAQTSTLFNSFNKPTPGFQFNTNTTPN 270
               +T TL    NK       N    P 
Sbjct: 399 TNNKKTLTLKKGLNKIVISGGTNNGNAPY 427


>gnl|CDD|222260 pfam13609, Porin_4, Gram-negative porin. 
          Length = 310

 Score = 32.4 bits (74), Expect = 1.4
 Identities = 31/212 (14%), Positives = 51/212 (24%), Gaps = 20/212 (9%)

Query: 148 SNLFATNTSQATTAFGAPQTSGFGSFGTNQN--------QSLNFNQNKPTFNLGGSTTST 199
                 + +   T        GF   G   N        +    +          +    
Sbjct: 32  DGGATGSGATGETGLDNDSRVGFTGSGELDNGLTLGASFELEAGDNGGTGSFNRDAYVGL 91

Query: 200 GFGFGTNTATNNTSLFGQKPATNTFGLPTNTASPFGTSTSFGAQPAQTSTLFNSFNKPTP 259
              FGT T     +          +       S    S   G          NS    +P
Sbjct: 92  SGDFGTVTLGRQDTAADDAGVDYDYEGGGLGDSGSDGSGFDGDAGFTGRD-SNSVIYYSP 150

Query: 260 ---GFQFNTNTTPNTSLANNAFSGTLFGNTAK--PAGLFGNSTNTGGL---FGNSTFGSN 311
              GF    +         N  +G + G +      GL    +N  G           + 
Sbjct: 151 KFGGFTAGASYALGEDGNTNGNNGGVAGGSDDTDAYGLGAGYSNGVGFSVAAAYQQTDNA 210

Query: 312 TGNANTFGQGLN---SSFNLGGTSNTFNNPLA 340
            G+ + +G G +     F LG +    +   A
Sbjct: 211 GGDQDAWGLGASYDFGGFTLGASYADIDVDAA 242


>gnl|CDD|221296 pfam11894, DUF3414, Protein of unknown function (DUF3414).  This
            family of proteins are functionally uncharacterized. This
            protein is found in eukaryotes. Proteins in this family
            are typically between 764 to 2011 amino acids in length.
            This protein has a conserved LLG sequence motif.
          Length = 1612

 Score = 32.7 bits (75), Expect = 1.4
 Identities = 14/78 (17%), Positives = 31/78 (39%), Gaps = 15/78 (19%)

Query: 751  LVKEMNRNYVSSDISKNILNIDETMDYIESSTEALARHMGTDPYKVQLMKQCFEDTFDNY 810
            L+ ++  N ++S           T+ Y+ SS + L RH+ + P             +D  
Sbjct: 1015 LLYKLCSNPLTSGP---------TLRYLRSSNDFLFRHLASLP------FIDPNTLWDGL 1059

Query: 811  GFPMSVSREFYIENKAHY 828
             F ++   +F++ +    
Sbjct: 1060 SFALANDNKFFLSSSIGA 1077


>gnl|CDD|131542 TIGR02489, flgE_epsilon, flagellar hook protein FlgE, epsilon
           proteobacterial.  Members of this family are flagellar
           hook proteins, designated FlgE, as found in the epsilon
           subdivision of the Proteobacteria (Helicobacter,
           Wolinella, and Campylobacter). These proteins differ
           significantly in architecture from proteins designated
           FlgE in other lineages; the N-terminal and C-terminal
           domains are homologous, but members of this family only
           contain a large central domain that is surface-exposed
           and variable between strains.
          Length = 719

 Score = 32.2 bits (73), Expect = 1.9
 Identities = 36/152 (23%), Positives = 47/152 (30%), Gaps = 27/152 (17%)

Query: 116 STQSNTFSFNPTTFGQPNTTAIKPFGAA------APQTSNLFATNTSQATTAFGAPQTSG 169
            T+    S N T++   N T    F A        P T       + Q   A  +     
Sbjct: 430 DTEPKNLSLNVTSYSNKNVTTNVLFKAIMKALNTGPLTEGGSIKESEQLKLAAHSAGLDI 489

Query: 170 FGSFGTNQNQSLNFNQNKP-------------TFNLGGSTTSTGFGFGTNTATNNTSLFG 216
             S G+     + F +                 FN G       F  G     N+ SL G
Sbjct: 490 VDSLGSKHTLEVQFVKQGGNEWSFIIRVPEPAEFNGGSGEGPNIFEGGRARFNNDGSLAG 549

Query: 217 QKPATNTFGLPTNTASP-------FGTSTSFG 241
             P T  F  P N A+P       FGTS +F 
Sbjct: 550 MNPPTIQFS-PKNGAAPPQRIKLSFGTSGTFD 580


>gnl|CDD|218439 pfam05109, Herpes_BLLF1, Herpes virus major outer envelope
           glycoprotein (BLLF1).  This family consists of the BLLF1
           viral late glycoprotein, also termed gp350/220. It is
           the most abundantly expressed glycoprotein in the viral
           envelope of the Herpesviruses and is the major antigen
           responsible for stimulating the production of
           neutralising antibodies in vivo.
          Length = 830

 Score = 32.1 bits (72), Expect = 2.1
 Identities = 62/361 (17%), Positives = 106/361 (29%), Gaps = 38/361 (10%)

Query: 63  NTSTAQPATSLFGATENKPLFGQSTTTFGQPTNTFGTTNSLFGKPNTGFGAAVSTQSNTF 122
           + +TA P T         P    +  T G PT T  +T      P +      ++ +   
Sbjct: 444 HKTTAVPTTPSLPPASTGPTVSTADPTSGTPTGTTSSTLPEDTSPTS----RTTSATPNA 499

Query: 123 SFNPTTFGQPNTTAIKPFGAAAPQTSNLFATNTSQATTAFGAPQTSGFGSFGTNQNQSLN 182
           +        PN T+      +    +          TT   +P T   G+       S  
Sbjct: 500 TSPTPAVTTPNATSPTTQKTSDTPNATSPTPIVIGVTTTATSPPT---GTTSVPNATSPQ 556

Query: 183 FNQNKPTFNLGGSTTSTGFGFGTNTATNNTSLFGQKPATNTFGLPTNTAS---------- 232
             +  P  N      ++     T+  T      G  P +   G+P+++ S          
Sbjct: 557 VTEESPVNNTNTPVVTSAPSVLTSAVTTGQHGTGSSPTSQQPGIPSSSHSTPRSNSTSTT 616

Query: 233 -------PFGTSTSFGAQPAQTSTLFNSFNKPTPGFQFNTNTTPNTSLANNAFSGTLFGN 285
                  P G        P+  ST   S   P PG      TT   S   N+ +    G 
Sbjct: 617 PLLTSAHPTGGENITEETPSVPSTTHVSTLSPGPG----PGTTSQVSGPGNSSTSRYPGE 672

Query: 286 TAKPAGLFG-NSTNTGGLFGNSTFGSNTGNANTFGQGLNSSFNLGGTSNTFNNPLAPQAP 344
                G+   N+T+     G  T       A           N      + +  +A  +P
Sbjct: 673 VHVTEGMPNPNATSPSAPSGQKT-------AVPTVTSTGGKANSTTKETSGSTLMASTSP 725

Query: 345 PASNSVNTQQILNLASLPYFSAPTLYKGVLPATS--SSSKLDELILPAFQKNRINNKNVT 402
             +         N  +    S  +  +     TS   ++K   + +P  Q    +N ++ 
Sbjct: 726 HTNEGAFRTTPYNATTYLPPSTSSKLRPRWTFTSPPVTTKQATVPVPPTQHPDHSNLSML 785

Query: 403 V 403
           V
Sbjct: 786 V 786


>gnl|CDD|162022 TIGR00754, bfr, bacterioferritin.  Bacterioferritin, predominantly
           an iron-storage protein restricted to Bacteria, has also
           been designated cytochrome b1 and cytochrome
           b-557.Bacterioferritin is a homomultimer most species.
           In Neisseria gonorrhoeae, Synechocystis PCC6803,
           Magnetospirillum magnetotacticum, and Pseudomonas
           aeruginosa, two types of subunit are found in a
           heteromultimeric complex, with each species having one
           member of each type. At present, both types of subunit
           are including in this single model [Transport and
           binding proteins, Cations and iron carrying compounds].
          Length = 157

 Score = 30.9 bits (70), Expect = 2.1
 Identities = 17/57 (29%), Positives = 31/57 (54%), Gaps = 2/57 (3%)

Query: 746 LRNEILVKEMNRNYVSSDISKNIL-NIDETMDYIESSTEALARHMGTDPYKVQLMKQ 801
           L+  I   E  R+YVS D+ + IL + +E +D++E+  E L   +G + Y    + +
Sbjct: 101 LKEAIAYAEEVRDYVSRDLLEEILEDEEEHIDWLETQLE-LIDKLGLENYLQAQVSE 156


>gnl|CDD|220308 pfam09603, Fib_succ_major, Fibrobacter succinogenes major domain
           (Fib_succ_major).  This domain of about 175 to 200 amino
           acids is found, in from one to five copies, in over 50
           proteins in Fibrobacter succinogenes S85, an obligate
           anaerobe of the rumen. Many members of this family have
           an apparent lipoprotein signal sequence. Conserved
           cysteine residues, suggestive of disulfide bond
           formation, are also consistent with an extracytoplasmic
           location for this domain. This domain can also be found
           in small numbers of proteins in Chlorobium tepidum and
           Bacteroides thetaiotaomicron.
          Length = 181

 Score = 30.8 bits (70), Expect = 2.3
 Identities = 19/67 (28%), Positives = 24/67 (35%), Gaps = 6/67 (8%)

Query: 192 LGGSTTSTGFGFGTNTATNNTSLFGQKPA-----TNTFGLPTNTASPFGTSTSFGAQPAQ 246
            GG   +TG   G N    N++ F   PA       +F      A  F +ST  G   A 
Sbjct: 95  AGGKLKATGTWKGPNGGATNSTGFNALPAGYRDCDGSFFAEGKYAY-FWSSTESGTASAW 153

Query: 247 TSTLFNS 253
              L  S
Sbjct: 154 ARFLGYS 160


>gnl|CDD|204032 pfam08703, PLC-beta_C, PLC-beta C terminal.  This domain
           corresponds to the alpha helical C terminal domain of
           phospholipase C beta.
          Length = 181

 Score = 30.5 bits (69), Expect = 2.8
 Identities = 14/54 (25%), Positives = 21/54 (38%), Gaps = 12/54 (22%)

Query: 736 EKIS-PSPKQKLRNEILVKEMNRNYVSSDISKNILNIDETMDYIESSTEALARH 788
           E+I     K+K   E    E+NR++           I E +  I+   EA  R 
Sbjct: 78  ERIQEAKTKEKHAQEREKTEINRSH-----------IQEVVQSIKRLEEAQKRR 120


>gnl|CDD|225142 COG2233, UraA, Xanthine/uracil permeases [Nucleotide transport and
           metabolism].
          Length = 451

 Score = 31.4 bits (72), Expect = 2.9
 Identities = 16/47 (34%), Positives = 19/47 (40%), Gaps = 6/47 (12%)

Query: 273 LANNAFSG-TLFGNTAKPAGLFGNSTNT--GGLFG---NSTFGSNTG 313
            A    +G  L G      GL  +   T   GLFG   N+TF  N G
Sbjct: 266 TATGEITGRDLDGKPRLRRGLLADGLATLIAGLFGGFPNTTFAQNIG 312


>gnl|CDD|234371 TIGR03839, termin_org_P1, adhesin P1.  Members of this protein family
            are the major adhesin of the Mycoplasma terminal
            organelle. The protein is called adhesin P1, cytadhesin
            P1, P140, attachment protein, and MgPa, with locus names
            MG191 in Mycoplasma genitalium and MPN141 in M.
            pneumoniae. A conserved C-terminal region is shared by
            additional paralogs in M. pneumoniae and M.
            gallisepticum, as well as by the member of this family
            [Cell envelope, Surface structures, Cellular processes,
            Pathogenesis].
          Length = 1425

 Score = 31.7 bits (71), Expect = 3.2
 Identities = 40/185 (21%), Positives = 64/185 (34%), Gaps = 38/185 (20%)

Query: 273  LANNAFSGTLFGNTAKPAGLFGNSTNTGGLFGNSTFGSNTGNANTFGQGLNS-------- 324
            L N A + T +  T    G FG++T +    G     +N  +A     GLN         
Sbjct: 956  LYNTALNITYYNLTR---GDFGSNTPSTPGIGFKPEQNNDSDATLVSSGLNWTPQDVGNL 1012

Query: 325  --------SFNLGGTSNTFNNPLAPQA--PPASNSVNTQQILNLASLPYFSAPTLY---- 370
                     F LGG   TF +   P+        +   Q+ L  A  P+           
Sbjct: 1013 VVSTGTQQGFQLGGWYVTFTDFTPPRQGYLGLQLTGLDQRALIWAPRPWQDFKGSTTTAS 1072

Query: 371  ---------KGV--LPATSSSSKLDELILPAFQKNRINNKNVTVNSNASYNKIKIKPVGS 419
                      G    P  S++    +++   FQK + N  + + NS+  Y ++   PV  
Sbjct: 1073 GAVISNWDKAGYSSRPDDSTTFNTSKILPEPFQKYKPNTSSGSTNSS-PYLEL-TAPVNV 1130

Query: 420  TSAST 424
            TS  +
Sbjct: 1131 TSLDS 1135


>gnl|CDD|220819 pfam10599, Nup_retrotrp_bd, Retro-transposon transporting motif.
           This is the highly conserved C-terminal motif
           GRKIxxxxxRRKx of nucleoporins that plays a critical and
           unique role in the nuclear import of retro-transposons
           in both yeasts and higher organisms. It would appear
           that the arginine residues at positions 2 and 9-10
           constitute a bipartite nuclear localisation signal, with
           two basic peptide motifs separated by an interchangeable
           spacer sequence, that is crucial for the
           retro-transposon activity.
          Length = 101

 Score = 29.3 bits (65), Expect = 3.4
 Identities = 17/74 (22%), Positives = 25/74 (33%), Gaps = 10/74 (13%)

Query: 174 GTNQNQSLNFNQNKPTFNLGGSTTSTGFGFGTNTATN---------NTSLFGQKPATNTF 224
             N N + NF  +        ST ST F FG + +T          N   F   PA+   
Sbjct: 13  MGNSNTNFNFAGSNNQ-PHTSSTPSTPFTFGASPSTPANAASPSGSNPPAFSGSPASAPN 71

Query: 225 GLPTNTASPFGTST 238
                + +    S+
Sbjct: 72  QFNVGSNNSNMFSS 85


>gnl|CDD|218761 pfam05815, DUF844, Baculovirus protein of unknown function (DUF844). 
            This family consists of several Baculovirus sequences of
            between 350 and 380 residues long. The family has no
            known function.
          Length = 359

 Score = 30.7 bits (70), Expect = 4.0
 Identities = 20/91 (21%), Positives = 32/91 (35%), Gaps = 14/91 (15%)

Query: 1017 ENPKSIASKDPEQNIFQLVTQHKLHEAIVSAQQSDNFYLSGALSMLGCSEIGQAFMQAQL 1076
             N     +  P + +  L    +   + VSA  S +      +         +   Q  L
Sbjct: 80   ANAAPATASSPRRPVLNLFAPARAARSAVSATNSID------MKKY------RKACQKLL 127

Query: 1077 SHYV--NATSDSFINKDILKLMLLTSGQPLY 1105
             +Y   + TS  F  KDI+  M+  S  P Y
Sbjct: 128  QYYTSASTTSTEFKVKDIVMCMIYLSHSPKY 158


>gnl|CDD|227270 COG4934, COG4934, Predicted protease [Posttranslational modification,
            protein turnover, chaperones].
          Length = 1174

 Score = 31.3 bits (71), Expect = 4.0
 Identities = 77/490 (15%), Positives = 130/490 (26%), Gaps = 72/490 (14%)

Query: 82   LFGQSTTTFGQPTNTFGTTNSLFG---------KPNTGFGAAVSTQSNTFSFNPTTFG-- 130
            L    + +FG  T   G   S  G            T      +     F ++P+     
Sbjct: 601  LLAFGSLSFGG-TYWGGNFVSTGGNYNIVASNLSGMTYITFNYTIS---FIYSPSYLAPT 656

Query: 131  -QPNTTAIKPFGAAAPQTSNLFATNTSQATTAFGAP-------QTSGFGSFGTNQNQSLN 182
               N+ +   F             N         +           GF   G ++ Q   
Sbjct: 657  FLGNSYSYDYFLPNIGIMRPAVGVNMIGVYDNLVSLSVPILNVNPEGFAYPGLSEGQYR- 715

Query: 183  FNQNKPTFNLGGSTTSTGFGFGTN--TATNNTSLFGQKPATNTFGLPTNTAS-PFGTSTS 239
                     +G      G G  +N                +    +     S     +T 
Sbjct: 716  ------YSGIGPQYDYLGIGVMSNGSPVAQAVVFLISVNVSQLTPIIQYVISYAPPFTTG 769

Query: 240  FGAQPAQTSTLFNSFNKPTPGFQFNTNTTPNTSLANNAFSG---TLFGNTAKPAGLFGNS 296
                     T++ + N      + N   TP T   N  +SG          +   L    
Sbjct: 770  LFLSNGTAYTVYWNGNLIA---ESNGTLTPQTIQFNTTYSGSNTVTNQTIPQVGLLIPLF 826

Query: 297  TNTGGLFGNST--FGSNTGNANTFGQGLNSSFNLGGTSNTFNNPLAPQAPPASNSVNTQQ 354
                G + +S           N  G G  +   +G T   F    +    P S S N   
Sbjct: 827  KFVYGYYYSSAIATIDAKYVFNE-GNGPGAYIYVGSTPLYFF---SAIIYPNSLSYNIYV 882

Query: 355  ILNLASLPYFSAPTL----YKGVLPATSSSSKLDELILPAFQKNRINNKNVTVNSNASYN 410
            I ++A +P     TL       ++P TSS S           +       +  +      
Sbjct: 883  IGSIAIIPLPYNATLLEWVGPAIIPLTSSGSNFTFSFGYYVIQFPPGIYTINTSIPGLDP 942

Query: 411  KIKIKPVGSTSASTKSLFDGLSSDTDDSEKYFNMIKSAPKRLIILPK----KTKPMTLKV 466
               +    S + S       LSS                K      K         T  +
Sbjct: 943  YSSLINSKSGTVSNLQ-IYFLSS--------VPTSGLTGKSSDGGIKNFVIDVLVNTNGI 993

Query: 467  NNVTTAETDVTVVSNNSSTPVSSSTQLQTPVSSKLTLNVAELKENISPSVKVSDLKVIKS 526
            + +     +  V+++ S+  +S S+Q    +  K   N+   + NI      + +    S
Sbjct: 994  SAINNGTGNYYVIASVSNGTISFSSQ----IYGKDVYNITVAEGNI------TSVNSALS 1043

Query: 527  PLNTSIGSET 536
             L  S+ S T
Sbjct: 1044 NLIVSLSSTT 1053


>gnl|CDD|179543 PRK03137, PRK03137, 1-pyrroline-5-carboxylate dehydrogenase;
           Provisional.
          Length = 514

 Score = 31.1 bits (71), Expect = 4.1
 Identities = 11/24 (45%), Positives = 13/24 (54%), Gaps = 2/24 (8%)

Query: 830 GGSLFKTGNFIQPTV--DLSPSAR 851
           G      G FIQPT+  D+ P AR
Sbjct: 387 GEGDDSKGYFIQPTIFADVDPKAR 410


>gnl|CDD|131633 TIGR02584, cas_NE0113, CRISPR-associated protein, NE0113 family.
            Members of this minor CRISPR-associated (Cas) protein
            family are found in cas gene clusters in Vibrio
            vulnificus YJ016, Nitrosomonas europaea ATCC 19718,
            Mannheimia succiniciproducens MBEL55E, and
            Verrucomicrobium spinosum [Mobile and extrachromosomal
            element functions, Other].
          Length = 209

 Score = 30.2 bits (68), Expect = 4.5
 Identities = 20/73 (27%), Positives = 30/73 (41%), Gaps = 13/73 (17%)

Query: 1003 RKDNLTEWLESVMEENPKSIASKDPEQN------IFQLVTQ------HKLHEAIVSAQQS 1050
              D L  +L    +  P +   + P  N      I Q V        H+LH +I   +++
Sbjct: 73   PFDELRIYLIPTGQRKPLADI-RTPADNEAAANFIVQTVAPLCAAQDHQLHASIAGGRKT 131

Query: 1051 DNFYLSGALSMLG 1063
              FYL  ALS+ G
Sbjct: 132  MGFYLGYALSLFG 144


>gnl|CDD|187817 cd09686, Csx1_III-U, CRISPR/Cas system-associated protein Csx1.
            CRISPR (Clustered Regularly Interspaced Short Palindromic
            Repeats) and associated Cas proteins comprise a system
            for heritable host defense by prokaryotic cells against
            phage and other foreign DNA; Protein of this family often
            fused to HTH domain; Some proteins could have an
            additional fusion with RecB-family nuclease domain; Core
            domain appears to have a Rossmann-like fold; loosely
            associated with CRISPR/Cas systems; also known as NE0113
            family.
          Length = 209

 Score = 30.2 bits (68), Expect = 4.8
 Identities = 20/73 (27%), Positives = 30/73 (41%), Gaps = 13/73 (17%)

Query: 1003 RKDNLTEWLESVMEENPKSIASKDPEQN------IFQLVTQ------HKLHEAIVSAQQS 1050
              D L  +L    +  P +   + P  N      I Q V        H+LH +I   +++
Sbjct: 73   PFDELRIYLIPTGQRKPLADI-RTPADNEAAANFIVQTVAPLCAAQDHQLHASIAGGRKT 131

Query: 1051 DNFYLSGALSMLG 1063
              FYL  ALS+ G
Sbjct: 132  MGFYLGYALSLFG 144


>gnl|CDD|224544 COG1629, CirA, Outer membrane receptor proteins, mostly Fe
           transport [Inorganic ion transport and metabolism].
          Length = 768

 Score = 30.6 bits (68), Expect = 5.2
 Identities = 40/228 (17%), Positives = 62/228 (27%), Gaps = 19/228 (8%)

Query: 86  STTTFGQPTNTFGTTNSLFGKP--NTGFGAAVSTQSNTFSFNPTTFGQPNTTAIKPFGAA 143
                    N      SL  KP  +T      S   + + F+   +G P          A
Sbjct: 237 GGRDGDNLGNRTAAAPSLTWKPDDDTRLTFQGSYNKDNYGFDRGGYGLPEGGT----VFA 292

Query: 144 APQTSNLFATNTSQATTAFGAPQTSGFGSFGTNQN--QSLNFNQNKPTFNLGGSTTSTGF 201
            P    L   + ++   A            G N N          +   +      +   
Sbjct: 293 PPAGGYLGDPDDNETNPATARLGLDFEHDLGDNDNWTLRAGLGYAQLAKSANSDYATGLT 352

Query: 202 GFGTNTATNNTSLFGQKPATNTFGLPTNTASPFGT-----STSFGAQPAQTSTLFNSFNK 256
              T           Q    +TFGL T+  + F T        FGA  ++T         
Sbjct: 353 SLDTGRTATRRYGADQDLDLDTFGLDTDLEAEFATGAIGHELVFGADYSRTDNSAGVDR- 411

Query: 257 PTPGFQFNTNTTPNTSLANNAFSGTLFGNTAKPAGLFGNSTNTGGLFG 304
                    N +P   + N  +    F   A P     N+ +T GL+ 
Sbjct: 412 ----GYGTGNGSPPLDIFNPVYGNPAFVLAALPT-TSTNTQDTLGLYA 454


>gnl|CDD|221318 pfam11925, DUF3443, Protein of unknown function (DUF3443).  This
           family of proteins are functionally uncharacterized.
           This protein is found in bacteria. Proteins in this
           family are typically between 400 to 434 amino acids in
           length. This protein has two conserved sequence motifs:
           NPV and DNNG.
          Length = 369

 Score = 30.4 bits (69), Expect = 5.4
 Identities = 34/145 (23%), Positives = 54/145 (37%), Gaps = 18/145 (12%)

Query: 188 PTFNLGGSTTSTG-FGFGTNTATNNTSLFGQKPATNTFGLPTNTASPFGTSTSFGAQPAQ 246
           P  +  G+ ++TG   FG  T +NN         T++ G    T    G S +     + 
Sbjct: 206 PAVSATGAASATGSLIFGIGTQSNNALPGATVLTTDSNG--DFTTVFNGRSFTASFIDSG 263

Query: 247 TSTLF---NSFNKPTPGFQFNTNTTPNTSLANNAFSGTLFG-NTAKPAGLFGNSTNTGGL 302
           ++ LF   +S    +    +   T+  T       S TL G N A  A +  +  N   L
Sbjct: 264 SNGLFFPDSSIPTCSSNTGWYCPTSTLT------LSATLTGTNGAASATVSFSIANADTL 317

Query: 303 FGNSTF-----GSNTGNANTFGQGL 322
           F          G   G+++ F  GL
Sbjct: 318 FSTGNAAFNDLGGPFGSSSMFDWGL 342


>gnl|CDD|225483 COG2931, COG2931, RTX toxins and related Ca2+-binding proteins
           [Secondary metabolites biosynthesis, transport, and
           catabolism].
          Length = 510

 Score = 30.3 bits (68), Expect = 6.0
 Identities = 53/315 (16%), Positives = 74/315 (23%), Gaps = 48/315 (15%)

Query: 60  FGTNTSTAQ----PATSLFGATENKPLFGQSTTTF------GQPTNTFGTTNSLFGKPNT 109
            GTN S          +L G   N  L G +                    ++L G    
Sbjct: 81  NGTNGSDTLRGGAGNDALTGGAGNDYLDGGAGNDTLFGGDGNDALVGGAGNDTLDGGAGA 140

Query: 110 GFGAAVSTQSNTFSFNPTTF--GQPNTTAIKPFGAAAPQTSNLFATNTSQATTAFGAPQT 167
                  T ++T          G  N T        +     +        T   G    
Sbjct: 141 NDTLDGGTGNDTLQNGNDILIGGSGNDT----LTGNSGADDRVINGGAGDDTILGGGNGD 196

Query: 168 SGFGSFGTNQNQSLNF-NQNKPTFNLGGSTTSTGFGFGTNTATNNTSLFGQK-------- 218
                 G+  N +L+  N        G        G G           G          
Sbjct: 197 DTVFGGGSGGNATLDGGNGGDSLITGGSGADVAAGGNGAVLIDGVVFADGTTWANLFLQA 256

Query: 219 -------PATNTFGLPTNTASPFGTS-TSFGAQPAQTSTLFNSFNKPTPGFQFNTNTTPN 270
                      T G+     +  G +        A   TL         G         N
Sbjct: 257 LAIVGGAGNDFTVGIGNLNDTLKGGAGNDTLLGGAGNDTLTIG----GGGNDTLDGGAGN 312

Query: 271 TSLANNAFSGTLFGNT--AKPAGLFGNSTNTGGLF---------GNSTFGSNTGNANTFG 319
            +L  +    T++G        G  GN T  G            GN T   N GN    G
Sbjct: 313 DTLDFSGGDDTIYGGAGNDTLDGGAGNDTLAGNAGALALLNGGDGNDTISGNDGNDTLIG 372

Query: 320 QGLNSSFNLGGTSNT 334
              N + + G  S+T
Sbjct: 373 GAGNDTLSGGAGSDT 387


>gnl|CDD|227614 COG5295, Hia, Autotransporter adhesin [Intracellular trafficking
           and secretion / Extracellular structures].
          Length = 715

 Score = 30.2 bits (68), Expect = 8.0
 Identities = 33/183 (18%), Positives = 53/183 (28%), Gaps = 15/183 (8%)

Query: 141 GAAAPQTSNLFATNTSQATTAFG--APQTSGFGSFGTNQNQSLNFNQ-NKPTFNLGGSTT 197
           G  A   S+  A  T+    A G  A  ++   S    +         N       G+  
Sbjct: 130 GTGAASASSTLAAGTTIRAVATGKTAVLSTASASGVQAKATDSALGFENTAVDQQAGAAG 189

Query: 198 STGFGFGTNTATNNTSLFGQKPATNTFGLPTNTASPFGTSTSFGAQPAQTSTLFNSFNKP 257
                   +T    T+  G           T+  +    +       A T+++ N+    
Sbjct: 190 LGNAAENADTNEARTNDLGDS--------LTDDGNQSTIAVGSVGLKAGTNSVANAAGNS 241

Query: 258 TP---GFQFNTNTTPNTSLANNAFSGTLFGNTAKPAGLFGNSTNTGGLFGNSTFGSNTGN 314
           T    G    +  T  T+  + A SG     TA        S  +G     +  GSN   
Sbjct: 242 TDAAIGSGLQSVLTAATAQGSAALSGGN-NGTAVIGKNNTLSGTSGANSIAAAAGSNGDT 300

Query: 315 ANT 317
           A  
Sbjct: 301 AIV 303


>gnl|CDD|133139 cd05472, cnd41_like, Chloroplast Nucleoids DNA-binding Protease,
           catalyzes the degradation of ribulose-1,5-bisphosphate
           carboxylase/oxygenase.  Chloroplast Nucleoids
           DNA-binding Protease catalyzes the degradation of
           ribulose-1,5-bisphosphate carboxylase/oxygenase
           (Rubisco) in senescent leaves of tobacco. Antisense
           tobacco with reduced amount of CND41 maintained green
           leaves and constant protein levels, especially Rubisco. 
           CND41 has DNA-binding as well as aspartic protease
           activities. The pepsin-like aspartic protease domain is
           located at the C-terminus of the protein. The enzyme is
           characterized by having two aspartic protease catalytic
           site motifs, the Asp-Thr-Gly-Ser in the N-terminal and
           Asp-Ser-Gly-Ser in the C-terminal region. Aspartic
           proteases are bilobal enzymes, each lobe contributing a
           catalytic Asp residue, with an extended active site
           cleft localized between the two lobes of the molecule.
           One lobe may be evolved from the other through ancient
           gene-duplication event. This family of aspartate
           proteases is classified by MEROPS as the peptidase
           family A1 (pepsin A, clan AA).
          Length = 299

 Score = 29.9 bits (68), Expect = 8.2
 Identities = 25/86 (29%), Positives = 32/86 (37%), Gaps = 15/86 (17%)

Query: 189 TFNLGGSTTSTGFGFGTNTATNNTSLFGQKPATNTFGL-------PTNTASPFGTSTSF- 240
           T  LG S    GF FG     +N  LFG   A    GL       P+ TAS +G   S+ 
Sbjct: 54  TLTLGSSDVVPGFAFG--CGHDNEGLFGG--AAGLLGLGRGKLSLPSQTASSYGGVFSYC 109

Query: 241 --GAQPAQTSTL-FNSFNKPTPGFQF 263
                 + +  L F +      G  F
Sbjct: 110 LPDRSSSSSGYLSFGAAASVPAGASF 135


>gnl|CDD|237019 PRK11907, PRK11907, bifunctional 2',3'-cyclic nucleotide
           2'-phosphodiesterase/3'-nucleotidase precursor protein;
           Reviewed.
          Length = 814

 Score = 29.8 bits (67), Expect = 9.1
 Identities = 23/89 (25%), Positives = 40/89 (44%), Gaps = 5/89 (5%)

Query: 459 TKPMT-LKVNNVTTAETDVTVVSNNSSTPVSSSTQLQTPVSSKLTLNVAELKENISPSVK 517
           T P T  +    T  E+D T  ++N+ TPV+++T  + P SS      AE  +  S +  
Sbjct: 34  TTPATSTEAEQTTPVESDATEEADNTETPVAATTAAEAPSSS----ETAETSDPTSEATD 89

Query: 518 VSDLKVIKSPLNTSIGSETSEGDTFDFKL 546
            +  +        +  S+  EG T D ++
Sbjct: 90  TTTSEARTVTPAATETSKPVEGQTVDVRI 118


>gnl|CDD|226406 COG3889, COG3889, Predicted solute binding protein [General
           function prediction only].
          Length = 872

 Score = 29.8 bits (67), Expect = 9.2
 Identities = 24/109 (22%), Positives = 36/109 (33%), Gaps = 4/109 (3%)

Query: 177 QNQSLNFNQNKPTFNLGGSTTSTG-FGFGTNTATNNTSLFGQKPATNTFGLPTNTASPFG 235
             + +      P    G S T++G F      +   T       A +   +   T S   
Sbjct: 747 TTKRVKVRIMDPASGTGTSITTSGTFTAEVPQSPTKTETTLSYSAYSNTSILIETTSVVI 806

Query: 236 TSTSFGAQPAQTSTLFNSFNKPTPGFQFNTNTTPNTSLANNAFSGTLFG 284
           T T    Q   +S        PT   Q +T+TT  TS +     G + G
Sbjct: 807 TKTVTQTQTTTSSPSPTQTTSPT---QTSTSTTTTTSPSQTTTGGGICG 852


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.312    0.128    0.367 

Gapped
Lambda     K      H
   0.267   0.0707    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 54,076,098
Number of extensions: 5142279
Number of successful extensions: 3659
Number of sequences better than 10.0: 1
Number of HSP's gapped: 3546
Number of HSP's successfully gapped: 118
Length of query: 1121
Length of database: 10,937,602
Length adjustment: 107
Effective length of query: 1014
Effective length of database: 6,191,724
Effective search space: 6278408136
Effective search space used: 6278408136
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 64 (28.5 bits)