RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy664
         (379 letters)



>gnl|CDD|227396 COG5064, SRP1, Karyopherin (importin) alpha [Intracellular
           trafficking and secretion].
          Length = 526

 Score =  418 bits (1077), Expect = e-144
 Identities = 197/378 (52%), Positives = 253/378 (66%), Gaps = 25/378 (6%)

Query: 2   QTRMVIDAGAVPVFIQLLLSPHEDQVTHPSVETMS-------------LDNNILYP---L 45
           QT++V+DAGAVP+FIQLL S  ED V   +V  +              L    L P   L
Sbjct: 149 QTKVVVDAGAVPLFIQLLSST-EDDVREQAVWALGNIAGDSEGCRDYVLQCGALEPLLGL 207

Query: 46  IDKPKNRLSMVRNSVWVLSNLCRGKTPPPDFAKVAPALACLSRLLFHADPDVLADACWAI 105
           +      +SM+RN+ W LSNLCRGK PPPD++ ++ AL  L++L++  DP+VL DACWAI
Sbjct: 208 LLSSAIHISMLRNATWTLSNLCRGKNPPPDWSNISQALPILAKLIYSRDPEVLVDACWAI 267

Query: 106 SYLSDGPNEKIQAVIDAGVCRRLVELLMHDQHKVVSAALRAVGNIVTGDDQQTQVILNCS 165
           SYLSDGPNEKIQAV+D G+  RLVELL H+  K+ + ALR+VGNIVTG D QTQVI+NC 
Sbjct: 268 SYLSDGPNEKIQAVLDVGIPGRLVELLSHESAKIQTPALRSVGNIVTGSDDQTQVIINCG 327

Query: 166 ALMCLLHLIQSPKESIRKEACWAVSNITAGNRQQIQAVIDANIFPSLIEILQKAEFKTRK 225
           AL     L+ SPKE+IRKEACW +SNITAGN +QIQAVIDAN+ P LI +L  AE+K +K
Sbjct: 328 ALKAFRSLLSSPKENIRKEACWTISNITAGNTEQIQAVIDANLIPPLIHLLSSAEYKIKK 387

Query: 226 EAAWAITNATSGGT--PDQIRYLIQQGCIEPFCELLTLLDAKIIQVALNGLENILKLGE- 282
           EA WAI+NATSGG   PD IRYL+ QG I+P C+LL ++D KII+VAL+ +ENILK+GE 
Sbjct: 388 EACWAISNATSGGLNRPDIIRYLVSQGFIKPLCDLLDVVDNKIIEVALDAIENILKVGEQ 447

Query: 283 EEAKQTGSVNPYVVLIEECYGLDKIEFLQSHENIEIYQKAFDIIEHYFGSEEEDTRVAPC 342
           +  +   ++N Y V +E+  G+D I  LQ   N  IY KA+ IIE +FG E+    +AP 
Sbjct: 448 DRLRYGKNINIYAVYVEKAGGMDAIHGLQDSVNRTIYDKAYSIIEKFFGEEDAVDELAPE 507

Query: 343 VTHDASGAQEFTFAGATQ 360
              +      FTF     
Sbjct: 508 TAGNT-----FTFGSNVN 520



 Score = 78.0 bits (192), Expect = 1e-15
 Identities = 56/216 (25%), Positives = 98/216 (45%), Gaps = 4/216 (1%)

Query: 66  LCRGKTPPPDFAKVAPALACLSRLLFHADPDVLA-DACWAISYLSDGPNEKIQAVIDAGV 124
           L +  +PP      A  +      +     D+L  +A WA++ ++ G  ++ + V+DAG 
Sbjct: 99  LSKETSPPIQPVIDAGVVPRFVEFMDEIQRDMLQFEAAWALTNIASGTTQQTKVVVDAGA 158

Query: 125 CRRLVELLMHDQHKVVSAALRAVGNIVTGDDQQTQVILNCSALMCLLHLIQS--PKESIR 182
               ++LL   +  V   A+ A+GNI    +     +L C AL  LL L+ S     S+ 
Sbjct: 159 VPLFIQLLSSTEDDVREQAVWALGNIAGDSEGCRDYVLQCGALEPLLGLLLSSAIHISML 218

Query: 183 KEACWAVSNITAGNRQQIQAVIDANIFPSLIEILQKAEFKTRKEAAWAITNATSGGTPDQ 242
           + A W +SN+  G          +   P L +++   + +   +A WAI+   S G  ++
Sbjct: 219 RNATWTLSNLCRGKNPPPDWSNISQALPILAKLIYSRDPEVLVDACWAIS-YLSDGPNEK 277

Query: 243 IRYLIQQGCIEPFCELLTLLDAKIIQVALNGLENIL 278
           I+ ++  G      ELL+   AKI   AL  + NI+
Sbjct: 278 IQAVLDVGIPGRLVELLSHESAKIQTPALRSVGNIV 313



 Score = 75.3 bits (185), Expect = 7e-15
 Identities = 45/169 (26%), Positives = 77/169 (45%), Gaps = 4/169 (2%)

Query: 106 SYLSDGPNEKIQAVIDAGVCRRLVELLMHDQHKVVS-AALRAVGNIVTGDDQQTQVILNC 164
             LS   +  IQ VIDAGV  R VE +   Q  ++   A  A+ NI +G  QQT+V+++ 
Sbjct: 97  KLLSKETSPPIQPVIDAGVVPRFVEFMDEIQRDMLQFEAAWALTNIASGTTQQTKVVVDA 156

Query: 165 SALMCLLHLIQSPKESIRKEACWAVSNITAGNRQQIQAVIDANIFPSLIEILQ--KAEFK 222
            A+   + L+ S ++ +R++A WA+ NI   +      V+       L+ +L        
Sbjct: 157 GAVPLFIQLLSSTEDDVREQAVWALGNIAGDSEGCRDYVLQCGALEPLLGLLLSSAIHIS 216

Query: 223 TRKEAAWAITNATSGGTPDQIRYLIQQGCIEPFCELLTLLDAKIIQVAL 271
             + A W ++N   G  P      I Q  +    +L+   D +++  A 
Sbjct: 217 MLRNATWTLSNLCRGKNPPPDWSNISQA-LPILAKLIYSRDPEVLVDAC 264


>gnl|CDD|237987 cd00020, ARM, Armadillo/beta-catenin-like repeats. An approximately
           40 amino acid long tandemly repeated sequence motif
           first identified in the Drosophila segment polarity gene
           armadillo; these repeats were also found in the
           mammalian armadillo homolog beta-catenin, the junctional
           plaque protein plakoglobin, the adenomatous polyposis
           coli (APC) tumor suppressor protein, and a number of
           other proteins. ARM has been implicated in mediating
           protein-protein interactions, but no common features
           among the target proteins recognized by the ARM repeats
           have been identified; related to the HEAT domain; three
           consecutive copies of the repeat are represented by this
           alignment model.
          Length = 120

 Score =  109 bits (274), Expect = 3e-29
 Identities = 42/109 (38%), Positives = 65/109 (59%)

Query: 86  LSRLLFHADPDVLADACWAISYLSDGPNEKIQAVIDAGVCRRLVELLMHDQHKVVSAALR 145
           L  LL  +D +V  +A WA+S LS G N+ IQAV++AG    LV+LL  +  +VV AAL 
Sbjct: 12  LVSLLSSSDENVQREAAWALSNLSAGNNDNIQAVVEAGGLPALVQLLKSEDEEVVKAALW 71

Query: 146 AVGNIVTGDDQQTQVILNCSALMCLLHLIQSPKESIRKEACWAVSNITA 194
           A+ N+  G +    ++L    +  L++L+ S  E I+K A  A+SN+ +
Sbjct: 72  ALRNLAAGPEDNKLIVLEAGGVPKLVNLLDSSNEDIQKNATGALSNLAS 120



 Score =  103 bits (259), Expect = 4e-27
 Identities = 37/120 (30%), Positives = 58/120 (48%)

Query: 117 QAVIDAGVCRRLVELLMHDQHKVVSAALRAVGNIVTGDDQQTQVILNCSALMCLLHLIQS 176
           +AVI AG    LV LL      V   A  A+ N+  G++   Q ++    L  L+ L++S
Sbjct: 1   EAVIQAGGLPALVSLLSSSDENVQREAAWALSNLSAGNNDNIQAVVEAGGLPALVQLLKS 60

Query: 177 PKESIRKEACWAVSNITAGNRQQIQAVIDANIFPSLIEILQKAEFKTRKEAAWAITNATS 236
             E + K A WA+ N+ AG       V++A   P L+ +L  +    +K A  A++N  S
Sbjct: 61  EDEEVVKAALWALRNLAAGPEDNKLIVLEAGGVPKLVNLLDSSNEDIQKNATGALSNLAS 120



 Score = 92.0 bits (229), Expect = 9e-23
 Identities = 36/119 (30%), Positives = 66/119 (55%), Gaps = 1/119 (0%)

Query: 159 QVILNCSALMCLLHLIQSPKESIRKEACWAVSNITAGNRQQIQAVIDANIFPSLIEILQK 218
           + ++    L  L+ L+ S  E++++EA WA+SN++AGN   IQAV++A   P+L+++L+ 
Sbjct: 1   EAVIQAGGLPALVSLLSSSDENVQREAAWALSNLSAGNNDNIQAVVEAGGLPALVQLLKS 60

Query: 219 AEFKTRKEAAWAITNATSGGTPDQIRYLIQQGCIEPFCELLTLLDAKIIQVALNGLENI 277
            + +  K A WA+ N  +G   D    +++ G +     LL   +  I + A   L N+
Sbjct: 61  EDEEVVKAALWALRNLAAGP-EDNKLIVLEAGGVPKLVNLLDSSNEDIQKNATGALSNL 118



 Score = 82.0 bits (203), Expect = 4e-19
 Identities = 32/127 (25%), Positives = 63/127 (49%), Gaps = 11/127 (8%)

Query: 201 QAVIDANIFPSLIEILQKAEFKTRKEAAWAITNATSGGTPDQIRYLIQQGCIEPFCELLT 260
           +AVI A   P+L+ +L  ++   ++EAAWA++N ++G   D I+ +++ G +    +LL 
Sbjct: 1   EAVIQAGGLPALVSLLSSSDENVQREAAWALSNLSAG-NNDNIQAVVEAGGLPALVQLLK 59

Query: 261 LLDAKIIQVALNGLENILKLGEEEAKQTGSVNPYVVLIEECYGLDKIEFLQSHENIEIYQ 320
             D ++++ AL  L N+    E+             ++ E  G+ K+  L    N +I +
Sbjct: 60  SEDEEVVKAALWALRNLAAGPEDNKL----------IVLEAGGVPKLVNLLDSSNEDIQK 109

Query: 321 KAFDIIE 327
            A   + 
Sbjct: 110 NATGALS 116



 Score = 45.8 bits (109), Expect = 2e-06
 Identities = 24/108 (22%), Positives = 41/108 (37%), Gaps = 29/108 (26%)

Query: 2   QTRMVIDAGAVPVFIQLLLSPHEDQVTHPSVETMSLDNNILYPLIDKPKNRLSMVRNSVW 61
             + V++AG +P  +QLL S  E+                             +V+ ++W
Sbjct: 41  NIQAVVEAGGLPALVQLLKSEDEE-----------------------------VVKAALW 71

Query: 62  VLSNLCRGKTPPPDFAKVAPALACLSRLLFHADPDVLADACWAISYLS 109
            L NL  G          A  +  L  LL  ++ D+  +A  A+S L+
Sbjct: 72  ALRNLAAGPEDNKLIVLEAGGVPKLVNLLDSSNEDIQKNATGALSNLA 119



 Score = 27.7 bits (62), Expect = 5.3
 Identities = 15/68 (22%), Positives = 23/68 (33%), Gaps = 29/68 (42%)

Query: 1   MQTRMVIDAGAVPVFIQLLLSPHEDQVTHPSVETMSLDNNILYPLIDKPKNRLSMVRNSV 60
               +V++AG VP  + LL S +ED                             + +N+ 
Sbjct: 82  DNKLIVLEAGGVPKLVNLLDSSNED-----------------------------IQKNAT 112

Query: 61  WVLSNLCR 68
             LSNL  
Sbjct: 113 GALSNLAS 120


>gnl|CDD|201276 pfam00514, Arm, Armadillo/beta-catenin-like repeat.  Approx. 40
           amino acid repeat. Tandem repeats form super-helix of
           helices that is proposed to mediate interaction of
           beta-catenin with its ligands. CAUTION: This family does
           not contain all known armadillo repeats.
          Length = 41

 Score = 47.4 bits (114), Expect = 1e-07
 Identities = 14/41 (34%), Positives = 25/41 (60%)

Query: 154 DDQQTQVILNCSALMCLLHLIQSPKESIRKEACWAVSNITA 194
             +  Q ++   A+  L+ L+ SP E +++EA WA+SN+ A
Sbjct: 1   SPENKQAVIEAGAVPPLVQLLSSPDEEVQEEAAWALSNLAA 41



 Score = 44.8 bits (107), Expect = 9e-07
 Identities = 14/41 (34%), Positives = 27/41 (65%)

Query: 196 NRQQIQAVIDANIFPSLIEILQKAEFKTRKEAAWAITNATS 236
           + +  QAVI+A   P L+++L   + + ++EAAWA++N  +
Sbjct: 1   SPENKQAVIEAGAVPPLVQLLSSPDEEVQEEAAWALSNLAA 41



 Score = 42.8 bits (102), Expect = 5e-06
 Identities = 15/41 (36%), Positives = 20/41 (48%)

Query: 112 PNEKIQAVIDAGVCRRLVELLMHDQHKVVSAALRAVGNIVT 152
             E  QAVI+AG    LV+LL     +V   A  A+ N+  
Sbjct: 1   SPENKQAVIEAGAVPPLVQLLSSPDEEVQEEAAWALSNLAA 41



 Score = 39.4 bits (93), Expect = 8e-05
 Identities = 18/67 (26%), Positives = 24/67 (35%), Gaps = 29/67 (43%)

Query: 2  QTRMVIDAGAVPVFIQLLLSPHEDQVTHPSVETMSLDNNILYPLIDKPKNRLSMVRNSVW 61
            + VI+AGAVP  +QLL SP E+                             +   + W
Sbjct: 4  NKQAVIEAGAVPPLVQLLSSPDEE-----------------------------VQEEAAW 34

Query: 62 VLSNLCR 68
           LSNL  
Sbjct: 35 ALSNLAA 41



 Score = 37.4 bits (88), Expect = 4e-04
 Identities = 10/39 (25%), Positives = 22/39 (56%)

Query: 239 TPDQIRYLIQQGCIEPFCELLTLLDAKIIQVALNGLENI 277
           +P+  + +I+ G + P  +LL+  D ++ + A   L N+
Sbjct: 1   SPENKQAVIEAGAVPPLVQLLSSPDEEVQEEAAWALSNL 39



 Score = 37.0 bits (87), Expect = 5e-04
 Identities = 12/31 (38%), Positives = 18/31 (58%)

Query: 80  APALACLSRLLFHADPDVLADACWAISYLSD 110
           A A+  L +LL   D +V  +A WA+S L+ 
Sbjct: 11  AGAVPPLVQLLSSPDEEVQEEAAWALSNLAA 41


>gnl|CDD|214547 smart00185, ARM, Armadillo/beta-catenin-like repeats.  Approx. 40
           amino acid repeat. Tandem repeats form superhelix of
           helices that is proposed to mediate interaction of
           beta-catenin with its ligands. Involved in transducing
           the Wingless/Wnt signal. In plakoglobin arm repeats bind
           alpha-catenin and N-cadherin.
          Length = 41

 Score = 45.1 bits (108), Expect = 8e-07
 Identities = 15/41 (36%), Positives = 27/41 (65%)

Query: 154 DDQQTQVILNCSALMCLLHLIQSPKESIRKEACWAVSNITA 194
           DD+  Q +++   L  L+ L++S  E + KEA WA+SN+++
Sbjct: 1   DDENKQAVVDAGGLPALVELLKSEDEEVVKEAAWALSNLSS 41



 Score = 43.6 bits (104), Expect = 3e-06
 Identities = 17/41 (41%), Positives = 29/41 (70%)

Query: 196 NRQQIQAVIDANIFPSLIEILQKAEFKTRKEAAWAITNATS 236
           + +  QAV+DA   P+L+E+L+  + +  KEAAWA++N +S
Sbjct: 1   DDENKQAVVDAGGLPALVELLKSEDEEVVKEAAWALSNLSS 41



 Score = 42.0 bits (100), Expect = 9e-06
 Identities = 17/40 (42%), Positives = 23/40 (57%)

Query: 112 PNEKIQAVIDAGVCRRLVELLMHDQHKVVSAALRAVGNIV 151
            +E  QAV+DAG    LVELL  +  +VV  A  A+ N+ 
Sbjct: 1   DDENKQAVVDAGGLPALVELLKSEDEEVVKEAAWALSNLS 40



 Score = 36.6 bits (86), Expect = 7e-04
 Identities = 8/39 (20%), Positives = 18/39 (46%)

Query: 239 TPDQIRYLIQQGCIEPFCELLTLLDAKIIQVALNGLENI 277
             +  + ++  G +    ELL   D ++++ A   L N+
Sbjct: 1   DDENKQAVVDAGGLPALVELLKSEDEEVVKEAAWALSNL 39



 Score = 32.0 bits (74), Expect = 0.026
 Identities = 15/67 (22%), Positives = 24/67 (35%), Gaps = 29/67 (43%)

Query: 2  QTRMVIDAGAVPVFIQLLLSPHEDQVTHPSVETMSLDNNILYPLIDKPKNRLSMVRNSVW 61
            + V+DAG +P  ++LL S  E+                             +V+ + W
Sbjct: 4  NKQAVVDAGGLPALVELLKSEDEE-----------------------------VVKEAAW 34

Query: 62 VLSNLCR 68
           LSNL  
Sbjct: 35 ALSNLSS 41



 Score = 30.1 bits (69), Expect = 0.16
 Identities = 12/29 (41%), Positives = 16/29 (55%)

Query: 82  ALACLSRLLFHADPDVLADACWAISYLSD 110
            L  L  LL   D +V+ +A WA+S LS 
Sbjct: 13  GLPALVELLKSEDEEVVKEAAWALSNLSS 41


>gnl|CDD|205691 pfam13513, HEAT_EZ, HEAT-like repeat.  The HEAT repeat family is
           related to armadillo/beta-catenin-like repeats (see
           pfam00514). These EZ repeats are found in subunits of
           cyanobacterial phycocyanin lyase and other proteins and
           probably carry out a scaffolding role.
          Length = 55

 Score = 35.9 bits (83), Expect = 0.002
 Identities = 12/55 (21%), Positives = 29/55 (52%), Gaps = 1/55 (1%)

Query: 179 ESIRKEACWAVSNITAGNRQQIQAVIDANIFPSLIEILQKAEFKTRKEAAWAITN 233
             +R+ A  A+  +  G  + ++  +   + P+L+ +L+  + + R+ AAWA+  
Sbjct: 1   WEVREAAALALGALAGGGPELLRPAVP-ELLPALLPLLKDDDDEVREAAAWALGR 54



 Score = 35.1 bits (81), Expect = 0.003
 Identities = 15/56 (26%), Positives = 27/56 (48%), Gaps = 1/56 (1%)

Query: 137 HKVVSAALRAVGNIVTGDDQQTQVILNCSALMCLLHLIQSPKESIRKEACWAVSNI 192
            +V  AA  A+G +  G  +  +  +    L  LL L++   + +R+ A WA+  I
Sbjct: 1   WEVREAAALALGALAGGGPELLRPAVP-ELLPALLPLLKDDDDEVREAAAWALGRI 55



 Score = 29.7 bits (67), Expect = 0.28
 Identities = 14/52 (26%), Positives = 18/52 (34%), Gaps = 1/52 (1%)

Query: 57  RNSVWVLSNLCRGKTPPPDFAKVAPALACLSRLLFHADPDVLADACWAISYL 108
             +   L  L  G  P      V   L  L  LL   D +V   A WA+  +
Sbjct: 5   EAAALALGALA-GGGPELLRPAVPELLPALLPLLKDDDDEVREAAAWALGRI 55


>gnl|CDD|222285 pfam13646, HEAT_2, HEAT repeats.  This family includes multiple
           HEAT repeats.
          Length = 88

 Score = 36.2 bits (84), Expect = 0.003
 Identities = 30/101 (29%), Positives = 43/101 (42%), Gaps = 22/101 (21%)

Query: 86  LSRLLFHADPDVLADACWAISYLSDGPNEKIQAVIDAGVCRRLVELLMHDQHKVVSAALR 145
           L  LL   DP+V A A  A+  L D         + A     L+ELL     +V  AA  
Sbjct: 5   LEALLSDPDPEVRAAAARALGELGD------PEALPA-----LLELLKDPDPEVRRAAAE 53

Query: 146 AVGNIVTGDDQQTQVILNCSALMCLLHLIQSPKESIRKEAC 186
           A+G +  GD +         AL  LL L+Q   +++ + A 
Sbjct: 54  ALGKL--GDPE---------ALPALLELLQDDDDAVVRAAA 83



 Score = 35.0 bits (81), Expect = 0.008
 Identities = 29/102 (28%), Positives = 49/102 (48%), Gaps = 23/102 (22%)

Query: 128 LVELLMHDQH-KVVSAALRAVGNIVTGDDQQTQVILNCSALMCLLHLIQSPKESIRKEAC 186
           L+E L+ D   +V +AA RA+G +  GD +         AL  LL L++ P   +R+ A 
Sbjct: 4   LLEALLSDPDPEVRAAAARALGEL--GDPE---------ALPALLELLKDPDPEVRRAAA 52

Query: 187 WAVSNITAGNRQQIQAVIDANIFPSLIEILQKAEFKTRKEAA 228
            A+  +  G+ + + A         L+E+LQ  +    + AA
Sbjct: 53  EALGKL--GDPEALPA---------LLELLQDDDDAVVRAAA 83



 Score = 30.4 bits (69), Expect = 0.29
 Identities = 23/68 (33%), Positives = 29/68 (42%), Gaps = 12/68 (17%)

Query: 82  ALACLSRLLFHADPDVLADACWAISYLSDGPNEKIQAVIDAGVCRRLVELLMHDQHKVV- 140
           AL  L  LL   DP+V   A  A+  L D         + A     L+ELL  D   VV 
Sbjct: 32  ALPALLELLKDPDPEVRRAAAEALGKLGD------PEALPA-----LLELLQDDDDAVVR 80

Query: 141 SAALRAVG 148
           +AA  A+ 
Sbjct: 81  AAAASALA 88



 Score = 30.0 bits (68), Expect = 0.40
 Identities = 17/76 (22%), Positives = 33/76 (43%), Gaps = 14/76 (18%)

Query: 173 LIQSPKESIRKEACWAVSNITAGNRQQIQAVIDANIFPSLIEILQKAEFKTRKEAAWAIT 232
           L+  P   +R  A  A+  +            D    P+L+E+L+  + + R+ AA A+ 
Sbjct: 8   LLSDPDPEVRAAAARALGEL-----------GDPEALPALLELLKDPDPEVRRAAAEALG 56

Query: 233 NATSGGTPDQIRYLIQ 248
                G P+ +  L++
Sbjct: 57  KL---GDPEALPALLE 69


>gnl|CDD|215629 PLN03200, PLN03200, cellulose synthase-interactive protein;
           Provisional.
          Length = 2102

 Score = 38.5 bits (90), Expect = 0.005
 Identities = 36/140 (25%), Positives = 61/140 (43%), Gaps = 9/140 (6%)

Query: 118 AVIDAGVCRRLVELLMHDQHKVVSAALRAVGNIVTGDDQQTQVILNCSALMCLLHLIQSP 177
           A ++AG    LV+LL        + A   +  ++   +     +L+  A+  LL L+   
Sbjct: 184 ATLEAGGVDILVKLLSSGNSDAQANAASLLARLMMAFESSISKVLDAGAVKQLLKLLGQG 243

Query: 178 KE-SIRKEACWAVSNITAGNRQQIQAVIDANIFPSLIE--ILQKAEFKTRKEAAWAITNA 234
            E S+R EA  A+  +++ +++  QA+ DA   P+LI   +    EF   + A     NA
Sbjct: 244 NEVSVRAEAAGALEALSSQSKEAKQAIADAGGIPALINATVAPSKEFMQGEFAQALQENA 303

Query: 235 T------SGGTPDQIRYLIQ 248
                   GG    I YL +
Sbjct: 304 MGALANICGGMSALILYLGE 323



 Score = 28.9 bits (65), Expect = 6.5
 Identities = 36/146 (24%), Positives = 59/146 (40%), Gaps = 15/146 (10%)

Query: 162  LNCSALMCLLHLIQSPKESIRKEACWAVSNITAGNRQQIQAVID--ANIFPSLIEILQKA 219
            L   A+  L+ ++ +  ES +  A  A+  +++GN  +  A+ D   N   +L +IL   
Sbjct: 1270 LARQAVQPLVEMLNTGSESEQHAAIGALIKLSSGNPSKALAIADVEGNALENLCKILSSD 1329

Query: 220  EFKTRKEAAWAI-----TNATSGGTPDQIRYLIQQGCIEPFCELLTLLDAKIIQVALNGL 274
                 KE A  +     TN     TP          CIEP   LL    +   +  +  L
Sbjct: 1330 SSLELKEDAAELCRVLFTNTRIRSTP------AAARCIEPLISLLVSESSTAQEAGVCAL 1383

Query: 275  ENIL--KLGEEEAKQTGSVNPYVVLI 298
            + +L  +   E     G+V P V L+
Sbjct: 1384 DRLLDDEQLAELVAAHGAVVPLVGLV 1409


>gnl|CDD|219645 pfam07923, N1221, N1221-like protein.  The sequences featured in
           this family are similar to a hypothetical protein
           product of ORF N1221 in the CPT1-SPC98 intergenic region
           of the yeast genome. This encodes an acidic polypeptide
           with several possible transmembrane regions.
          Length = 293

 Score = 32.7 bits (75), Expect = 0.27
 Identities = 26/113 (23%), Positives = 42/113 (37%), Gaps = 28/113 (24%)

Query: 156 QQTQVILNCSALMCLLHLIQSPKESIRKEACWAVSNITAGNRQQIQA-------VIDANI 208
           + T +     AL  LL++ Q           WA    TA   +Q+ +       +++   
Sbjct: 70  EVTDLSRRLKALEALLYIAQ---------GTWA---ETASKSEQLHSIRRNVLLLVECGG 117

Query: 209 FPSLIEILQKAEFKTRKEAAWAITNATSGGTPDQ--------IRYLIQQGCIE 253
           F  L+E+L    F+       A TN  +    D+        I YLI + C E
Sbjct: 118 FEPLVELLNM-FFEHAIACDDAATNLPAYSAADETELFLLLTILYLIVEVCRE 169


>gnl|CDD|146914 pfam04510, DUF577, Family of unknown function (DUF577).  Family of
           Arabidopsis thaliana proteins. Many of these members
           contain a repeated region.
          Length = 174

 Score = 30.0 bits (68), Expect = 1.5
 Identities = 15/60 (25%), Positives = 18/60 (30%), Gaps = 22/60 (36%)

Query: 11  AVPVFIQLLLSPHEDQVTHPSVETMSLDNNILYPLIDK--PK--NRLSM---VRNSVWVL 63
           A  VFI L                M L    + P +D   P+    L     V    WVL
Sbjct: 62  AFHVFIDL---------------PMPLYEEFIIPFLDNLLPEISKVLLPPEEVDVEDWVL 106


>gnl|CDD|152134 pfam11698, V-ATPase_H_C, V-ATPase subunit H.  The yeast
           Saccharomyces cerevisiae vacuolar H+-ATPase (V-ATPase)
           is a multisubunit complex responsible for acidifying
           organelles. It functions as an ATP dependent proton pump
           that transports protons across a lipid bilayer. This
           domain corresponds to the C terminal domain of the H
           subunit of V-ATPase. The N-terminal domain is required
           for the activation of the complex whereas the C-terminal
           domain is required for coupling ATP hydrolysis to proton
           translocation.
          Length = 119

 Score = 29.1 bits (66), Expect = 1.6
 Identities = 19/58 (32%), Positives = 27/58 (46%)

Query: 94  DPDVLADACWAISYLSDGPNEKIQAVIDAGVCRRLVELLMHDQHKVVSAALRAVGNIV 151
           DP  LA AC  I        E    +   GV  R++EL+ H+  +V   AL AV  ++
Sbjct: 57  DPLTLAVACNDIGEFVRLYPEGKNILEKLGVKARVMELMSHEDPEVRYEALLAVQKLM 114


>gnl|CDD|236973 PRK11767, PRK11767, SpoVR family protein; Provisional.
          Length = 498

 Score = 30.2 bits (69), Expect = 2.2
 Identities = 7/18 (38%), Positives = 13/18 (72%), Gaps = 1/18 (5%)

Query: 307 IEFLQSHENIEIYQKAFD 324
           +EFL SH N+ ++Q  ++
Sbjct: 301 LEFLHSHTNV-VFQPPYN 317


>gnl|CDD|147785 pfam05822, UMPH-1, Pyrimidine 5'-nucleotidase (UMPH-1).  This
           family consists of several eukaryotic pyrimidine
           5'-nucleotidase proteins. P5'N-1, also known as uridine
           monophosphate hydrolase-1 (UMPH-1), is a member of a
           large functional group of enzymes, characterized by the
           ability to dephosphorylate nucleic acids. P5'N-1
           catalyzes the dephosphorylation of pyrimidine nucleoside
           monophosphates to the corresponding nucleosides.
           Deficiencies in this proteins function can lead to
           several different disorders in humans.
          Length = 246

 Score = 28.8 bits (65), Expect = 3.8
 Identities = 9/23 (39%), Positives = 14/23 (60%), Gaps = 2/23 (8%)

Query: 306 KIEFLQSH--ENIEIYQKAFDII 326
           KI FL     EN++ Y  ++DI+
Sbjct: 206 KIGFLNDKVEENLDKYMDSYDIV 228


>gnl|CDD|224460 COG1543, COG1543, Uncharacterized conserved protein [Function
           unknown].
          Length = 504

 Score = 29.3 bits (66), Expect = 3.9
 Identities = 21/93 (22%), Positives = 33/93 (35%), Gaps = 17/93 (18%)

Query: 241 DQIRYLIQQGCIEPFCELLTLLDAKIIQVALNGLENILKLGEEEAKQTGS---------V 291
           + +++LI         E+L   D  +    L  LE  ++L E+E K+             
Sbjct: 50  EGVKFLITLSMTPVLMEMLA--DPYLQDRYLRYLEWKIELSEKEEKRYADQSLRELAEFY 107

Query: 292 NPY---VVLIEECYG---LDKIEFLQSHENIEI 318
            P         E Y    L+  + LQ    IEI
Sbjct: 108 IPEFLNARGYWEQYDGNLLEAFKELQRSGGIEI 140


>gnl|CDD|110227 pfam01209, Ubie_methyltran, ubiE/COQ5 methyltransferase family. 
          Length = 233

 Score = 29.0 bits (65), Expect = 3.9
 Identities = 23/68 (33%), Positives = 35/68 (51%), Gaps = 17/68 (25%)

Query: 265 KIIQVALNGLENILKLGEEEAKQTGSVNPYVVLIEECYGLDKIEFLQSH-ENIEIYQKAF 323
           K++ + +N  EN+LK GE++AK+ G  N              IEFLQ + E +     +F
Sbjct: 74  KVVGLDIN--ENMLKEGEKKAKEEGKYN--------------IEFLQGNAEELPFEDDSF 117

Query: 324 DIIEHYFG 331
           DI+   FG
Sbjct: 118 DIVTISFG 125


>gnl|CDD|131422 TIGR02369, trimeth_pyl, trimethylamine:corrinoid methyltransferase.
            This model represents a distinct subfamily of
           pfam06253. All members here are trimethylamine:corrinoid
           methyltransferases that contain a critical pyrrolysine
           residue incorporated during translation via a special
           tRNA for a TAG (amber) codon. Known members so far are
           from the genus Methanosarcina. It is one of a suite of
           three non-homologous enzymes with a critical UAG-encoded
           pyrrolysine residue in these species (along with
           dimethylamine methyltransferase and monomethylamine
           methyltransferase). It demethylates trimethylamine,
           leaving dimethylamine, and methylates the prosthetic
           group of its small cognate corrinoid protein, MttC. The
           methyl group is then transferred by
           methylcorrinoid:coenzyme M methyltransferase to coenzyme
           M. Note that the pyrrolysine residue is variously
           translated as K or X, or as a stop codon that truncates
           the sequence.
          Length = 489

 Score = 29.3 bits (65), Expect = 4.3
 Identities = 12/28 (42%), Positives = 19/28 (67%), Gaps = 3/28 (10%)

Query: 314 ENIEIYQKAFDIIEHYFGSEEEDTRVAP 341
           EN+E Y+   DI++ Y+G +EE+ R  P
Sbjct: 186 ENVEYYR---DIVKAYYGGDEEEARKKP 210


>gnl|CDD|131323 TIGR02270, TIGR02270, conserved hypothetical protein.  Members are
           found in Myxococcus xanthus (six members), Geobacter
           sulfurreducens, and Pseudomonas aeruginosa; a short
           protein homologous to the N-terminal region is found in
           Mesorhizobium loti. All sequence are from
           Proteobacteria. The function is unknown [Hypothetical
           proteins, Conserved].
          Length = 410

 Score = 29.1 bits (65), Expect = 4.3
 Identities = 31/152 (20%), Positives = 51/152 (33%), Gaps = 36/152 (23%)

Query: 127 RLVELLMHDQHKVVSAALRAVGNIVTGDDQQTQVILNCSALMCLLHLIQSPKESIRKEAC 186
            L   L H+   V +AALRA+G +     + T           L   ++     +R  A 
Sbjct: 151 ALEAALTHEDALVRAAALRALGELPRRLSEST-----------LRLYLRDSDPEVRFAAL 199

Query: 187 WAVSNITAGN----------------RQQIQAVIDANIFPS----LIEILQKAEFKTRKE 226
            A     +                  RQ++  ++     P     L E+LQ A   TR+E
Sbjct: 200 EAGLLAGSRLAWGVCRRFQVLEGGPHRQRLLVLLAVAGGPDAQAWLRELLQAAA--TRRE 257

Query: 227 AAWAITNATSGGTPDQIRYLIQQGCIEPFCEL 258
           A  A       G  +   + ++     P+  L
Sbjct: 258 ALRA-VGLV--GDVEAAPWCLEAMREPPWARL 286


>gnl|CDD|233811 TIGR02289, M3_not_pepF, oligoendopeptidase, M3 family.  This family
           consists of probable oligoendopeptidases in the M3
           family, related to lactococcal PepF and group B
           streptococcal PepB (TIGR00181) but in a distinct clade
           with considerable sequence differences. The likely
           substrate is small peptides and not whole proteins, as
           with PepF, but members are not characterized and the
           activity profile may differ. Several bacteria have both
           a member of this family and a member of the PepF family.
          Length = 549

 Score = 29.0 bits (65), Expect = 5.2
 Identities = 10/43 (23%), Positives = 19/43 (44%), Gaps = 5/43 (11%)

Query: 173 LIQSPKESIRKEACWAVSNITAGNRQQIQAVIDANIFPSLIEI 215
            +Q P  S RK+A  A     A   +++       I+  L+++
Sbjct: 152 FLQDPNRSTRKKAWEARYEFFAEVEEELD-----RIYDELVKV 189


>gnl|CDD|202500 pfam02985, HEAT, HEAT repeat.  The HEAT repeat family is related to
           armadillo/beta-catenin-like repeats (see pfam00514).
          Length = 31

 Score = 25.6 bits (57), Expect = 5.6
 Identities = 7/25 (28%), Positives = 12/25 (48%)

Query: 169 CLLHLIQSPKESIRKEACWAVSNIT 193
            LL L+  P   +R+ A  A+  + 
Sbjct: 4   LLLELLNDPDPEVREAAAEALGALA 28


>gnl|CDD|223179 COG0101, TruA, Pseudouridylate synthase [Translation, ribosomal
           structure and biogenesis].
          Length = 266

 Score = 28.4 bits (64), Expect = 6.5
 Identities = 10/27 (37%), Positives = 13/27 (48%)

Query: 53  LSMVRNSVWVLSNLCRGKTPPPDFAKV 79
             MVRN V  L  + RGK P     ++
Sbjct: 193 WHMVRNIVGALLLVGRGKRPVEWIKEL 219


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.320    0.135    0.403 

Gapped
Lambda     K      H
   0.267   0.0877    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 19,340,946
Number of extensions: 1882263
Number of successful extensions: 1706
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1684
Number of HSP's successfully gapped: 57
Length of query: 379
Length of database: 10,937,602
Length adjustment: 99
Effective length of query: 280
Effective length of database: 6,546,556
Effective search space: 1833035680
Effective search space used: 1833035680
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 60 (26.6 bits)