RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy14923
         (180 letters)



>gnl|CDD|238686 cd01390, HMGB-UBF_HMG-box, HMGB-UBF_HMG-box, class II and III
           members of the HMG-box superfamily of DNA-binding
           proteins. These proteins bind the minor groove of DNA in
           a non-sequence specific fashion and contain two or more
           tandem HMG boxes. Class II members include non-histone
           chromosomal proteins, HMG1 and HMG2, which bind to bent
           or distorted DNA such as four-way DNA junctions,
           synthetic DNA cruciforms, kinked cisplatin-modified DNA,
           DNA bulges, cross-overs in supercoiled DNA, and can
           cause looping of linear DNA. Class III members include
           nucleolar and mitochondrial transcription factors, UBF
           and mtTF1, which bind four-way DNA junctions.
          Length = 66

 Score = 58.0 bits (141), Expect = 4e-12
 Identities = 27/61 (44%), Positives = 42/61 (68%)

Query: 49  PKKPLTPYFKFQGQIRNKIMERNPNIKIIEVTKLVSEEWKKIGEAEKKRLENEYAKEKEI 108
           PK+PL+ YF F  + R K+ + NP+  + EVTK++ E+WK++ E EKK+ E +  K+KE 
Sbjct: 1   PKRPLSAYFLFSQEQRPKLKKENPDASVTEVTKILGEKWKELSEEEKKKYEEKAEKDKER 60

Query: 109 Y 109
           Y
Sbjct: 61  Y 61


>gnl|CDD|238037 cd00084, HMG-box, High Mobility Group (HMG)-box is found in a
           variety of eukaryotic chromosomal proteins and
           transcription factors. HMGs bind to the minor groove of
           DNA and have been classified by DNA binding preferences.
           Two phylogenically distinct groups of Class I proteins
           bind DNA in a sequence specific fashion and contain a
           single HMG box. One group (SOX-TCF) includes
           transcription factors, TCF-1, -3, -4; and also SRY and
           LEF-1, which bind four-way DNA junctions and duplex DNA
           targets. The second group (MATA) includes fungal mating
           type gene products MC, MATA1 and Ste11. Class II and III
           proteins (HMGB-UBF) bind DNA in a non-sequence specific
           fashion and contain two or more tandem HMG boxes. Class
           II members include non-histone chromosomal proteins,
           HMG1 and HMG2, which bind to bent or distorted DNA such
           as four-way DNA junctions, synthetic DNA cruciforms,
           kinked cisplatin-modified DNA, DNA bulges, cross-overs
           in supercoiled DNA, and can cause looping of linear DNA.
           Class III members include nucleolar and mitochondrial
           transcription factors, UBF and mtTF1, which bind
           four-way DNA junctions.
          Length = 66

 Score = 56.1 bits (136), Expect = 2e-11
 Identities = 24/66 (36%), Positives = 41/66 (62%)

Query: 49  PKKPLTPYFKFQGQIRNKIMERNPNIKIIEVTKLVSEEWKKIGEAEKKRLENEYAKEKEI 108
           PK+PL+ YF F  + R ++   NP + + E++K++ E WK + E EKK+ E +  K+KE 
Sbjct: 1   PKRPLSAYFLFSQEHRAEVKAENPGLSVGEISKILGEMWKSLSEEEKKKYEEKAEKDKER 60

Query: 109 YAQNVK 114
           Y + + 
Sbjct: 61  YEKEMP 66


>gnl|CDD|197700 smart00398, HMG, high mobility group. 
          Length = 70

 Score = 51.5 bits (124), Expect = 1e-09
 Identities = 24/69 (34%), Positives = 40/69 (57%)

Query: 48  KPKKPLTPYFKFQGQIRNKIMERNPNIKIIEVTKLVSEEWKKIGEAEKKRLENEYAKEKE 107
           KPK+P++ +  F  + R KI   NP++   E++K + E WK + E EK   E +  K+KE
Sbjct: 1   KPKRPMSAFMLFSQENRAKIKAENPDLSNAEISKKLGERWKLLSEEEKAPYEEKAKKDKE 60

Query: 108 IYAQNVKKY 116
            Y + + +Y
Sbjct: 61  RYEEEMPEY 69


>gnl|CDD|189580 pfam00505, HMG_box, HMG (high mobility group) box. 
          Length = 69

 Score = 51.1 bits (123), Expect = 2e-09
 Identities = 26/68 (38%), Positives = 40/68 (58%)

Query: 49  PKKPLTPYFKFQGQIRNKIMERNPNIKIIEVTKLVSEEWKKIGEAEKKRLENEYAKEKEI 108
           PK+PL+ +F F  + R K+   NP +K  E++K++ E+WK + E EKK  E +  KEK  
Sbjct: 1   PKRPLSAFFLFSQEQRAKLKAENPGLKNAEISKILGEKWKNLSEEEKKPYEEKAEKEKAR 60

Query: 109 YAQNVKKY 116
           Y +    Y
Sbjct: 61  YEKAYPAY 68


>gnl|CDD|227935 COG5648, NHP6B, Chromatin-associated proteins containing the HMG
           domain [Chromatin structure and dynamics].
          Length = 211

 Score = 47.2 bits (112), Expect = 6e-07
 Identities = 32/110 (29%), Positives = 56/110 (50%), Gaps = 1/110 (0%)

Query: 45  IPEKPKKPLTPYFKFQGQIRNKIMERNPNIKIIEVTKLVSEEWKKIGEAEKKRLENEYAK 104
            P  PK+PL+ YF +  + R++I + NP +   EV KL+SE+WK++ + EK+    E   
Sbjct: 67  DPNGPKRPLSAYFLYSAENRDEIRKENPKLTFGEVGKLLSEKWKELTDEEKEPYYKEANS 126

Query: 105 EKEIYAQNVKKYHDELTP-EAIEFLRKEKEVKKQKKEKREMKKLFRETGK 153
           ++E Y +  ++Y+ +L     I    + +   + K E     K   E  K
Sbjct: 127 DRERYQREKEEYNKKLPNKAPIGPFIENEPKIRPKVEGPSPDKALVEETK 176



 Score = 26.7 bits (59), Expect = 6.8
 Identities = 20/62 (32%), Positives = 40/62 (64%)

Query: 48  KPKKPLTPYFKFQGQIRNKIMERNPNIKIIEVTKLVSEEWKKIGEAEKKRLENEYAKEKE 107
             K P+ P+ + + +IR K+   +P+  ++E TK++S+ W ++ E++KK+  ++Y K KE
Sbjct: 143 PNKAPIGPFIENEPKIRPKVEGPSPDKALVEETKIISKAWSELDESKKKKYIDKYKKLKE 202

Query: 108 IY 109
            Y
Sbjct: 203 EY 204


>gnl|CDD|185511 PTZ00199, PTZ00199, high mobility group protein; Provisional.
          Length = 94

 Score = 35.2 bits (81), Expect = 0.003
 Identities = 26/75 (34%), Positives = 37/75 (49%), Gaps = 9/75 (12%)

Query: 46  PEKPKKPLTPYFKFQGQIRNKIMERNPNIK--IIEVTKLVSEEWKKIGEAEKKRLENE-- 101
           P  PK+ L+ Y  F  + R +I+  NP +   +  V K+V E W K+ E EK   E +  
Sbjct: 20  PNAPKRALSAYMFFAKEKRAEIIAENPELAKDVAAVGKMVGEAWNKLSEEEKAPYEKKAQ 79

Query: 102 -----YAKEKEIYAQ 111
                Y KEK  YA+
Sbjct: 80  EDKVRYEKEKAEYAK 94


>gnl|CDD|204115 pfam09011, DUF1898, Domain of unknown function (DUF1898).  This
           domain is predominantly found in Maelstrom homolog
           proteins. It has no known function.
          Length = 69

 Score = 34.3 bits (79), Expect = 0.004
 Identities = 22/67 (32%), Positives = 37/67 (55%), Gaps = 1/67 (1%)

Query: 46  PEKPKKPLTPYFKFQGQIRNKIMERNPNI-KIIEVTKLVSEEWKKIGEAEKKRLENEYAK 104
           P KPK     YF F   +R ++    P +  + E +KL SE+WK + E EK++ E +  +
Sbjct: 1   PNKPKAKRNAYFFFVQTMRPELKREGPQVPGVAEFSKLCSEKWKAMSEEEKEKYEEKARE 60

Query: 105 EKEIYAQ 111
           +K+ Y +
Sbjct: 61  DKKRYDR 67


>gnl|CDD|238684 cd01388, SOX-TCF_HMG-box, SOX-TCF_HMG-box, class I member of the
           HMG-box superfamily of DNA-binding proteins. These
           proteins contain a single HMG box, and bind the minor
           groove of DNA in a highly sequence-specific manner.
           Members include SRY and its homologs in insects and
           vertebrates, and transcription factor-like proteins,
           TCF-1, -3, -4, and LEF-1. They appear to bind the minor
           groove of the A/T C A A A G/C-motif.
          Length = 72

 Score = 31.1 bits (71), Expect = 0.051
 Identities = 15/60 (25%), Positives = 32/60 (53%)

Query: 50  KKPLTPYFKFQGQIRNKIMERNPNIKIIEVTKLVSEEWKKIGEAEKKRLENEYAKEKEIY 109
           K+P+  +  F  + R K+++  P  +   ++K++ + WK +   EK+    E  K KE++
Sbjct: 3   KRPMNAFMLFSKRHRRKVLQEYPLKENRAISKILGDRWKALSNEEKQPYYEEAKKLKELH 62


>gnl|CDD|213349 cd12789, RasGAP_plexin_C1, Ras-GTPase Activating Domain of
           plexin-C1.  Plexins form a conserved family of
           transmembrane receptors for semaphorins and may be the
           ancestors of semaphorins. Plexins are divided into four
           types (A-D) according to sequence similarity. Plexin-C1
           has been identified as the receptor of semaphorin 7A,
           which plays regulatory roles in both the immune and
           nervous systems. Unlike other semaphorins which act as
           repulsive guidance cues, Sema7A enhances central and
           peripheral axon growth and is required for proper axon
           tract formation during embryonic development. Plexin-C1
           is a potential tumor suppressor for melanoma
           progression. The expression of Plexin-C1 is diminished
           or absent in human melanoma cell lines. Cofilin, an
           actin-binding protein involved in cell migration, is a
           downstream target of Sema7A and Plexin-C1 signaling.
           Melanoma invasion and metastasis may be promoted through
           the loss of Plexin-C1 inhibitory signaling on cofilin
           activation. Plexins contain a C-terminal RasGAP domain,
           which functions as an enhancer of the hydrolysis of GTP
           that is bound to Ras-GTPases. Plexins display GAP
           activity towards the Ras homolog Rap. Although the Rho
           (Ras homolog) GTPases are most closely related to
           members of the Ras family, RhoGAP and RasGAP show no
           sequence homology at their amino acid level. RasGTPases
           function as molecular switches in a large number of of
           signaling pathways. When bound to GTP they are in the on
           state and when bound to GDP they are in the off state.
           The RasGAP domain speeds up the hydrolysis of GTP in
           Ras-like proteins acting as a negative regulator.
          Length = 393

 Score = 33.3 bits (76), Expect = 0.051
 Identities = 26/83 (31%), Positives = 45/83 (54%), Gaps = 8/83 (9%)

Query: 76  IIEVTKLVSEEWKKIGEAEKKRLENEYAKE---KEIYAQNVKKYHDELTPEAIEFLRKEK 132
           I ++  L S E ++    E K+ ENE+ +E    EIY + + KY DE+    +  L +E+
Sbjct: 315 IRDLPPLSSSELEEFLTQESKKHENEFNEEVALMEIY-KYIVKYFDEI----LNKLERER 369

Query: 133 EVKKQKKEKREMKKLFRETGKPK 155
            +++ +K+   +K LF E  K K
Sbjct: 370 GLEEVQKQLLHVKALFDEKKKCK 392


>gnl|CDD|240271 PTZ00108, PTZ00108, DNA topoisomerase 2-like protein; Provisional.
          Length = 1388

 Score = 33.1 bits (76), Expect = 0.077
 Identities = 26/90 (28%), Positives = 40/90 (44%), Gaps = 18/90 (20%)

Query: 48  KPKKPLTP-YFKFQGQIRNK---------IMERNPNIKIIEVTKLVSEEWKKIGEAEKKR 97
           +  +PL P Y  F+G I            I E+ P+   +E+T+L      K  E  K+ 
Sbjct: 850 EEVRPLVPWYRGFKGTIERNPKGKFISKGIYEKVPD-GTVEITEL---PIGKWTEDYKEF 905

Query: 98  LENEYAKEKEIYAQNVKKYHDELTPEAIEF 127
           LE+E  KEK++    +  Y D  T   + F
Sbjct: 906 LESETLKEKDV----IVDYRDYSTANTVHF 931


>gnl|CDD|237005 PRK11864, PRK11864, 2-ketoisovalerate ferredoxin oxidoreductase
           subunit beta; Provisional.
          Length = 300

 Score = 31.2 bits (71), Expect = 0.25
 Identities = 17/57 (29%), Positives = 30/57 (52%), Gaps = 6/57 (10%)

Query: 40  SQPLSIPEKPKKPLTPYFKFQGQIRNKIMERNPNIKIIEVTKLVSEEWKKIGEAEKK 96
           S+ L   +K K P+  Y K QG+ ++   E    IK ++  + + E W++I +  KK
Sbjct: 248 SKTLLDKKKRK-PVEEYLKLQGRFKHLTEE---EIKGLQ--EEIDEMWEEIKKLAKK 298


>gnl|CDD|222649 pfam14284, PcfJ, PcfJ-like protein.  The PcfJ-like protein family
           includes the E. faecalis PcfJ protein, which is
           functionally uncharacterized. It is found in bacteria
           and viruses, and is typically between 159 and 170 amino
           acids in length. There is a conserved HCV sequence
           motif.
          Length = 169

 Score = 30.0 bits (68), Expect = 0.45
 Identities = 13/55 (23%), Positives = 29/55 (52%), Gaps = 1/55 (1%)

Query: 96  KRLENEYAKEKEIYAQNVKKYHDELTPEAIEFLRKEKEVKKQKKEKREMKKLFRE 150
           +RL  +      +  +N+K  HD       +  +++++ +K +K   E++K+FRE
Sbjct: 10  RRLGKDLHDAHYVCPKNLKAAHDRAV-AKYQAKKEKEKAEKMRKRYPELEKIFRE 63


>gnl|CDD|183346 PRK11865, PRK11865, pyruvate ferredoxin oxidoreductase subunit
           beta; Provisional.
          Length = 299

 Score = 30.5 bits (69), Expect = 0.47
 Identities = 16/56 (28%), Positives = 32/56 (57%), Gaps = 5/56 (8%)

Query: 39  TSQPLSIPEKPKKPLTPYFKFQGQIRNKIMERNPNIKIIEVTKLVSEEWKKIGEAE 94
           T +PL +  + +KP+  Y K QG+ ++       +I+I++  K + E+WK++G   
Sbjct: 249 TYEPLHLDRRTRKPIEEYLKVQGRFKHL---TEEDIEILQ--KYIDEKWKELGLML 299


>gnl|CDD|173412 PTZ00121, PTZ00121, MAEBL; Provisional.
          Length = 2084

 Score = 30.5 bits (68), Expect = 0.48
 Identities = 19/73 (26%), Positives = 33/73 (45%)

Query: 84   SEEWKKIGEAEKKRLENEYAKEKEIYAQNVKKYHDELTPEAIEFLRKEKEVKKQKKEKRE 143
            +EE KK  EA+KK  E + A E +  A+  KK  D    +A E  +  +  K + +   +
Sbjct: 1298 AEEKKKADEAKKKAEEAKKADEAKKKAEEAKKKADAAKKKAEEAKKAAEAAKAEAEAAAD 1357

Query: 144  MKKLFRETGKPKK 156
              +   E  +  +
Sbjct: 1358 EAEAAEEKAEAAE 1370



 Score = 30.5 bits (68), Expect = 0.57
 Identities = 25/85 (29%), Positives = 42/85 (49%), Gaps = 1/85 (1%)

Query: 75   KIIEVTKLVSEEWKKIGEAEKKRLENEYAKEKEIYAQNVKKYHDELTPEAIEFLRKEKEV 134
            K  +  K  +EE KK  EA+KK  E++   ++   A   KK  DE   +A E  +K  E 
Sbjct: 1378 KKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEE-KKKADEA 1436

Query: 135  KKQKKEKREMKKLFRETGKPKKPGN 159
            KK+ +E ++  +  ++  + KK   
Sbjct: 1437 KKKAEEAKKADEAKKKAEEAKKAEE 1461



 Score = 30.1 bits (67), Expect = 0.75
 Identities = 24/73 (32%), Positives = 37/73 (50%), Gaps = 1/73 (1%)

Query: 84   SEEWKKIGEAEKKRLENEYAKEKEIYAQNVKKYHDELTPEAIEFLRKEKEVKKQKKEKRE 143
             E  KK   A+KK  E + A E +  A+  KK  DEL   A    +K  E KK+ +EK++
Sbjct: 1374 EEAKKKADAAKKKAEEKKKADEAKKKAEEDKKKADELKKAAAA-KKKADEAKKKAEEKKK 1432

Query: 144  MKKLFRETGKPKK 156
              +  ++  + KK
Sbjct: 1433 ADEAKKKAEEAKK 1445



 Score = 29.0 bits (64), Expect = 1.9
 Identities = 24/82 (29%), Positives = 40/82 (48%), Gaps = 2/82 (2%)

Query: 75   KIIEVTKLVSEEWKKIGEAEKKRLENEYAKEKEIYAQNVKKYHDELTPEAIEFLRKEKEV 134
            K  E  K   E+ KK  E +K     + A E +  A+  KK  DE   +A E  +K  E 
Sbjct: 1392 KADEAKKKAEEDKKKADELKKAAAAKKKADEAKKKAEEKKK-ADEAKKKAEE-AKKADEA 1449

Query: 135  KKQKKEKREMKKLFRETGKPKK 156
            KK+ +E ++ ++  ++  + KK
Sbjct: 1450 KKKAEEAKKAEEAKKKAEEAKK 1471



 Score = 28.6 bits (63), Expect = 2.1
 Identities = 23/79 (29%), Positives = 40/79 (50%)

Query: 64   RNKIMERNPNIKIIEVTKLVSEEWKKIGEAEKKRLENEYAKEKEIYAQNVKKYHDELTPE 123
            + +  ++    +I EV KL  EE K   E  KK  E +   E+   A+  KK  ++L  +
Sbjct: 1582 KAEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKK 1641

Query: 124  AIEFLRKEKEVKKQKKEKR 142
              E  +K +E+KK ++E +
Sbjct: 1642 EAEEKKKAEELKKAEEENK 1660



 Score = 28.2 bits (62), Expect = 3.4
 Identities = 25/78 (32%), Positives = 38/78 (48%), Gaps = 1/78 (1%)

Query: 78   EVTKLVSEEWKKIGEAEKKRLENEYAKEKEIYAQNVKKYHDELTPEAIEFLRKEKEVKKQ 137
            +  K  +EE KK  EA+KK  E + A E +  A+  KK  DE   +A E  +K  E KK 
Sbjct: 1447 DEAKKKAEEAKKAEEAKKKAEEAKKADEAKKKAEEAKK-ADEAKKKAEEAKKKADEAKKA 1505

Query: 138  KKEKREMKKLFRETGKPK 155
             + K++  +  +     K
Sbjct: 1506 AEAKKKADEAKKAEEAKK 1523



 Score = 27.8 bits (61), Expect = 4.3
 Identities = 25/82 (30%), Positives = 41/82 (50%), Gaps = 2/82 (2%)

Query: 75   KIIEVTKLVSEEWKKIGEAEKKRLENEYAKEKEIYAQNVKKYHDELTPEAIEFLRKEKEV 134
            K  +  K  +EE KK  EA+KK  E + A E +  A+  KK   E   +  E  +K  E 
Sbjct: 1418 KKADEAKKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAKK--AEEAKKKAEEAKKADEA 1475

Query: 135  KKQKKEKREMKKLFRETGKPKK 156
            KK+ +E ++  +  ++  + KK
Sbjct: 1476 KKKAEEAKKADEAKKKAEEAKK 1497



 Score = 26.6 bits (58), Expect = 8.7
 Identities = 22/70 (31%), Positives = 34/70 (48%)

Query: 84   SEEWKKIGEAEKKRLENEYAKEKEIYAQNVKKYHDELTPEAIEFLRKEKEVKKQKKEKRE 143
            +EE KK  EA+KK  E + A E +  A+  KK  DE    A    + ++  K ++ +K +
Sbjct: 1466 AEEAKKADEAKKKAEEAKKADEAKKKAEEAKKKADEAKKAAEAKKKADEAKKAEEAKKAD 1525

Query: 144  MKKLFRETGK 153
              K   E  K
Sbjct: 1526 EAKKAEEAKK 1535


>gnl|CDD|223087 COG0008, GlnS, Glutamyl- and glutaminyl-tRNA synthetases
           [Translation, ribosomal structure and biogenesis].
          Length = 472

 Score = 30.0 bits (68), Expect = 0.65
 Identities = 17/85 (20%), Positives = 29/85 (34%), Gaps = 2/85 (2%)

Query: 75  KIIEVTKLVSEEWKKIGEAEKKRLENEYAKEKEIYAQNVKKYHDELTPEAIE-FLRKEKE 133
            +IE+ +L    ++   E   KRL N    E  +          + T E I  +++    
Sbjct: 366 TLIEIERLDFFFFEDKEEVRLKRLANVIVAE-VLEKDAEGLITSDWTKENIIHWVKAVAR 424

Query: 134 VKKQKKEKREMKKLFRETGKPKKPG 158
           +   K  K         TG   +P 
Sbjct: 425 LFGVKGPKLFPPLRVALTGGYVEPE 449


>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
          Length = 880

 Score = 30.0 bits (68), Expect = 0.82
 Identities = 26/87 (29%), Positives = 48/87 (55%), Gaps = 3/87 (3%)

Query: 75  KIIEVTKLVSEEWKKIGEAEKKRLENEYAKEKEIYAQNVKKYHDELTPEAIEFLRKEKEV 134
           ++ E+ K + E  K++ E E++    E AK K+   + +KK    LTPE +E  ++ +E+
Sbjct: 339 RLEELKKKLKELEKRLEELEERHELYEEAKAKKEELERLKKRLTGLTPEKLE--KELEEL 396

Query: 135 KKQKKE-KREMKKLFRETGKPKKPGNS 160
           +K K+E + E+ K+    G+ KK    
Sbjct: 397 EKAKEEIEEEISKITARIGELKKEIKE 423


>gnl|CDD|235250 PRK04195, PRK04195, replication factor C large subunit;
           Provisional.
          Length = 482

 Score = 29.5 bits (67), Expect = 1.1
 Identities = 21/60 (35%), Positives = 31/60 (51%)

Query: 119 ELTPEAIEFLRKEKEVKKQKKEKREMKKLFRETGKPKKPGNSFYLFIKDNIDKKEYEGKS 178
           ELT E IEFL   K+  K+ K+  E  +  RE  K +K   +F    K+  +++E E K 
Sbjct: 393 ELTEEEIEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEEEEKEKKE 452



 Score = 28.0 bits (63), Expect = 3.1
 Identities = 26/92 (28%), Positives = 48/92 (52%), Gaps = 11/92 (11%)

Query: 74  IKIIEVTKLVSEEWKKI-GEAEKKRLENEYAKEKEIYAQNVKKYHDELTPEAIEFLRKEK 132
           I+ +  +K  +++ KKI  +AEKKR E +  K+K+ +A   K+  +E          +EK
Sbjct: 399 IEFLTGSKKATKKIKKIVEKAEKKREEEKKEKKKKAFAGKKKEEEEE----------EEK 448

Query: 133 EVKKQKKEKREMKKLFRETGKPKKPGNSFYLF 164
           E K+++KE+ E +    +  + +K      LF
Sbjct: 449 EKKEEEKEEEEEEAEEEKEEEEEKKKKQATLF 480


>gnl|CDD|235623 PRK05849, PRK05849, hypothetical protein; Provisional.
          Length = 783

 Score = 29.6 bits (67), Expect = 1.1
 Identities = 16/66 (24%), Positives = 29/66 (43%), Gaps = 3/66 (4%)

Query: 109 YAQNVKKYHDELTPEAIEFLRKEKEVKKQKKEKREMKKLFRETGKPKKPGNSFYLFIKDN 168
           Y ++ + Y D    E+      +KE +   +E + + KL  E G        F  F+K+ 
Sbjct: 528 YDEDPELYFDWTKDESTV--DSDKEFELSPEELKALNKLLHEHGLNIDA-EEFLDFLKEA 584

Query: 169 IDKKEY 174
           I+ +E 
Sbjct: 585 IEGREL 590


>gnl|CDD|227217 COG4880, COG4880, Secreted protein containing C-terminal
           beta-propeller domain distantly related to WD-40 repeats
           [General function prediction only].
          Length = 603

 Score = 29.1 bits (65), Expect = 1.3
 Identities = 15/59 (25%), Positives = 23/59 (38%), Gaps = 6/59 (10%)

Query: 59  FQGQIRNKIMERNP------NIKIIEVTKLVSEEWKKIGEAEKKRLENEYAKEKEIYAQ 111
             G++R KI   NP      N K++E+       +   GE  + RL+       E   Q
Sbjct: 288 LPGEVREKIDLTNPLEDLKNNAKVVEILTTAERVYISEGEGPRLRLKAAAENPGENLDQ 346


>gnl|CDD|238685 cd01389, MATA_HMG-box, MATA_HMG-box, class I member of the HMG-box
           superfamily of DNA-binding proteins. These proteins
           contain a single HMG box, and bind the minor groove of
           DNA in a highly sequence-specific manner. Members
           include the fungal mating type gene products MC, MATA1
           and Ste11.
          Length = 77

 Score = 27.2 bits (61), Expect = 1.4
 Identities = 10/60 (16%), Positives = 25/60 (41%)

Query: 48  KPKKPLTPYFKFQGQIRNKIMERNPNIKIIEVTKLVSEEWKKIGEAEKKRLENEYAKEKE 107
           K  +P   +  ++     ++   NP +   E+++++   W+      K   +    +EKE
Sbjct: 1   KIPRPRNAFILYRQDKHAQLKTENPGLTNNEISRIIGRMWRSESPEVKAYYKELAEEEKE 60


>gnl|CDD|224309 COG1391, GlnE, Glutamine synthetase adenylyltransferase
           [Posttranslational modification, protein turnover,
           chaperones / Signal transduction mechanisms].
          Length = 963

 Score = 28.9 bits (65), Expect = 1.8
 Identities = 14/37 (37%), Positives = 20/37 (54%), Gaps = 3/37 (8%)

Query: 123 EAI--EFLRKEKEVKKQKKEKREMK-KLFRETGKPKK 156
           EAI  E L   ++  K K E  EM+ K+  E G+P +
Sbjct: 813 EAIRAEILTYPRDGAKLKTEVEEMREKMRAELGRPPR 849


>gnl|CDD|192760 pfam11444, DUF2895, Protein of unknown function (DUF2895).  This is
           a bacterial family of uncharacterized proteins.
          Length = 199

 Score = 28.0 bits (63), Expect = 2.1
 Identities = 12/38 (31%), Positives = 21/38 (55%)

Query: 106 KEIYAQNVKKYHDELTPEAIEFLRKEKEVKKQKKEKRE 143
           +E Y  N+++    LTP    FL+K+ E++    E R+
Sbjct: 82  EEDYGANIERLSPYLTPRCQAFLQKDYELRLLNGELRD 119


>gnl|CDD|235582 PRK05729, valS, valyl-tRNA synthetase; Reviewed.
          Length = 874

 Score = 28.5 bits (65), Expect = 2.2
 Identities = 24/63 (38%), Positives = 32/63 (50%), Gaps = 14/63 (22%)

Query: 92  EAEKKRLENEYAK-EKEIYAQNVKKYHDELTPEAIEFLRK------EKEVKKQKKEKREM 144
           EAE  RLE E AK EKEI  +  KK  +E       F+ K      EKE +K  + + ++
Sbjct: 810 EAELARLEKELAKLEKEI-ERVEKKLSNE------GFVAKAPEEVVEKEREKLAEYEEKL 862

Query: 145 KKL 147
            KL
Sbjct: 863 AKL 865


>gnl|CDD|184287 PRK13735, PRK13735, conjugal transfer mating pair stabilization
           protein TraG; Provisional.
          Length = 942

 Score = 28.6 bits (64), Expect = 2.2
 Identities = 21/88 (23%), Positives = 41/88 (46%), Gaps = 7/88 (7%)

Query: 60  QGQIRNKIMERNPNIKIIEVTKLVSEEWKKIGEAEKKRLENEYAKEKEIYAQNVKKYHDE 119
           + Q+ N + E   NI   +   +  EE     + +   L+N +  + E  +QN  KY++E
Sbjct: 860 KHQVDNMVTEYEGNIGDTQ-NSIRGEE--NTVKGQYSELQNHH--KTEALSQN-NKYNEE 913

Query: 120 LT-PEAIEFLRKEKEVKKQKKEKREMKK 146
            +  E +      +E+ K+ KE ++  K
Sbjct: 914 KSAQERMPGADSPEELMKRAKEYQDKHK 941


>gnl|CDD|227061 COG4717, COG4717, Uncharacterized conserved protein [Function
           unknown].
          Length = 984

 Score = 28.3 bits (63), Expect = 2.6
 Identities = 18/105 (17%), Positives = 35/105 (33%), Gaps = 13/105 (12%)

Query: 60  QGQIRNKIMERNPNIKIIEVTKL---------VSEEW--KKIGEAEKKRLENEYAKEKEI 108
           +  +R+K        +   V  L          S +   K+   A  KRL+ +   E E 
Sbjct: 343 EADLRDKEEAAGNGFEAERVHDLRSLECMLRYQSSQRELKQTEAAYCKRLDEKRLFEDEA 402

Query: 109 YAQNVKKYHDELTPE--AIEFLRKEKEVKKQKKEKREMKKLFRET 151
             +  ++  D+        E   ++     Q  +K E+  L+   
Sbjct: 403 EEEARQRLADDEEEVRAGDEAREEKIAANSQVIDKEEVCNLYDRR 447


>gnl|CDD|234750 PRK00409, PRK00409, recombination and DNA strand exchange inhibitor
           protein; Reviewed.
          Length = 782

 Score = 28.3 bits (64), Expect = 2.7
 Identities = 12/53 (22%), Positives = 29/53 (54%), Gaps = 2/53 (3%)

Query: 92  EAEKKRLENEYAKEKEIYAQNVKKYHDELTPEAIEFL-RKEKEVKKQKKEKRE 143
           E  ++ LE + A+E E   +  +K  +EL  +  +    ++K +++ +KE ++
Sbjct: 526 EELERELE-QKAEEAEALLKEAEKLKEELEEKKEKLQEEEDKLLEEAEKEAQQ 577


>gnl|CDD|179174 PRK00941, PRK00941, acetyl-CoA decarbonylase/synthase complex
           subunit alpha; Validated.
          Length = 781

 Score = 28.0 bits (63), Expect = 2.9
 Identities = 16/43 (37%), Positives = 23/43 (53%), Gaps = 1/43 (2%)

Query: 114 KKYHDELTPEAIEFLRKEKEVKKQKKEKREMKKLFRETGKPKK 156
           KKY  EL  +   F+R E ++    KE+  +K L  +  KPKK
Sbjct: 724 KKYLGELPDDLELFVRTEADLPLAMKEEL-LKILEEKDWKPKK 765


>gnl|CDD|223363 COG0286, HsdM, Type I restriction-modification system
           methyltransferase subunit [Defense mechanisms].
          Length = 489

 Score = 28.2 bits (63), Expect = 2.9
 Identities = 17/79 (21%), Positives = 31/79 (39%), Gaps = 16/79 (20%)

Query: 88  KKIGEAEKKRLENEYAKEKEI----------------YAQNVKKYHDELTPEAIEFLRKE 131
           K++ E   +++ + Y + KEI                Y  N  +Y  E   E I+     
Sbjct: 401 KRLTEENIEKIVDTYREFKEIEGFSKSVSLEEIRENDYNLNPGRYVAEEEEEDIDLDELL 460

Query: 132 KEVKKQKKEKREMKKLFRE 150
           +E+ +   E  E++K   E
Sbjct: 461 QELSEIDAELAELEKELEE 479


>gnl|CDD|219279 pfam07044, DUF1329, Protein of unknown function (DUF1329).  This
           family consists of several hypothetical bacterial
           proteins of around 475 residues in length. The majority
           of family members are from Pseudomonas species but the
           family also contains sequences from Shewanella
           oneidensis and Thauera aromatica.
          Length = 368

 Score = 27.9 bits (63), Expect = 3.2
 Identities = 10/23 (43%), Positives = 14/23 (60%)

Query: 108 IYAQNVKKYHDELTPEAIEFLRK 130
           I A NV +Y D+LTP     L++
Sbjct: 2   ITAANVDQYKDKLTPGQKALLKR 24


>gnl|CDD|223529 COG0452, Dfp, Phosphopantothenoylcysteine synthetase/decarboxylase
           [Coenzyme metabolism].
          Length = 392

 Score = 27.7 bits (62), Expect = 3.6
 Identities = 13/73 (17%), Positives = 24/73 (32%), Gaps = 6/73 (8%)

Query: 107 EIYAQNVKKYHDELTPEAIEFLRKEKEVKKQKKEKREMKKLFRETGKPKKPG-NSFYLFI 165
              A NV  Y    T E ++ L+ E  V   +  + E+     + G  +           
Sbjct: 118 LAPAMNVIMYTHPATQENLQRLKSEG-VLFIEPIEGELA----DVGDGRLAEPEEIVEAA 172

Query: 166 KDNIDKKEYEGKS 178
              +   + +GK 
Sbjct: 173 LALLKTPDLKGKK 185


>gnl|CDD|227519 COG5192, BMS1, GTP-binding protein required for 40S ribosome
           biogenesis [Translation, ribosomal structure and
           biogenesis].
          Length = 1077

 Score = 27.8 bits (61), Expect = 3.7
 Identities = 20/96 (20%), Positives = 42/96 (43%), Gaps = 1/96 (1%)

Query: 77  IEVTKLVSEEWKKIGEAEKKRLENEYAKEKEIYAQNVKKYHDELTPEAIEFLRKEKEVKK 136
           IE  + + ++ +K    + +  EN    E E    +     +E + + +++   E+E   
Sbjct: 589 IEGEEELIQDDEKGNFEDLEDEENSSDNEMEESRGSSVTAENEESADEVDY-ETEREENA 647

Query: 137 QKKEKREMKKLFRETGKPKKPGNSFYLFIKDNIDKK 172
           +KKE+        E G P+K    +Y   K  I+++
Sbjct: 648 RKKEELRGNFELEERGDPEKKDVDWYTEEKRKIEEQ 683


>gnl|CDD|237177 PRK12704, PRK12704, phosphodiesterase; Provisional.
          Length = 520

 Score = 27.8 bits (63), Expect = 3.7
 Identities = 16/69 (23%), Positives = 41/69 (59%), Gaps = 2/69 (2%)

Query: 78  EVTKLVSEEWKKIGEAEKKRLENEYAKEKEIYAQNVKKYHDELTPEAIEFLRKEKEVKKQ 137
           E  K + E   ++ + EK+ L+ E   ++++  + ++K  +EL  +  E  +K++E++K+
Sbjct: 72  EFEKELRERRNELQKLEKRLLQKEENLDRKL--ELLEKREEELEKKEKELEQKQQELEKK 129

Query: 138 KKEKREMKK 146
           ++E  E+ +
Sbjct: 130 EEELEELIE 138



 Score = 26.7 bits (60), Expect = 9.4
 Identities = 23/68 (33%), Positives = 36/68 (52%), Gaps = 8/68 (11%)

Query: 87  WKKIGEAEKKRLENEYA---KEKEIYAQNVKKYHDELTPEA-IEFLRKEKEVKKQKKEKR 142
            KKI EA+ K  E E     +E +  A+ +KK   E   EA  E  +   E +K+ +E+R
Sbjct: 25  RKKIAEAKIKEAEEEAKRILEEAKKEAEAIKK---EALLEAKEEIHKLRNEFEKELRERR 81

Query: 143 -EMKKLFR 149
            E++KL +
Sbjct: 82  NELQKLEK 89


>gnl|CDD|221177 pfam11708, Slu7, Pre-mRNA splicing Prp18-interacting factor.  The
           spliceosome, an assembly of snRNAs (U1, U2, U4/U6, and
           U5) and proteins, catalyzes the excision of introns from
           pre-mRNAs in two successive trans-esterification
           reactions. Step 2 depends upon integral spliceosome
           constituents such as U5 snRNA and Prp8 and
           non-spliceosomal proteins Prp16, Slu7, Prp18, and Prp22.
           ATP hydrolysis by the DEAH-box enzyme Prp16 promotes a
           conformational change in the spliceosome that leads to
           protection of the 3'ss from targeted RNase H cleavage.
           This change, which probably reflects binding of the 3'ss
           PyAG in the catalytic centre of the spliceosome,
           requires the ordered recruitment of Slu7, Prp18, and
           Prp22 to the spliceosome. There is a close functional
           relationship between Prp8, Prp18, and Slu7, and Prp18
           interacts with Slu7, so that together they recruit Prp22
           to the spliceosome. Most members of the family carry a
           zinc-finger of the CCHC-type upstream of this domain.
          Length = 236

 Score = 27.4 bits (61), Expect = 3.9
 Identities = 15/57 (26%), Positives = 30/57 (52%), Gaps = 3/57 (5%)

Query: 91  GEAEKKRLENEYAKEKEIYAQNVKKYHDELTPEAIEFLRKEKEVKKQKKEKREMKKL 147
           GEA +     ++A E      +V   H +  P  +E LRK+++ KK++ + ++ + L
Sbjct: 127 GEALEFEELQKFAWEAAEKGGDV---HLQANPTKLELLRKKEKEKKEQLKIQKKQSL 180


>gnl|CDD|218561 pfam05340, DUF740, Protein of unknown function (DUF740).  This
           family consists of several uncharacterized plant
           proteins of unknown function.
          Length = 565

 Score = 27.7 bits (61), Expect = 4.1
 Identities = 23/107 (21%), Positives = 48/107 (44%), Gaps = 1/107 (0%)

Query: 37  QQTSQPLSIPEKPKKPLTPYFKFQGQIRNKIMERNPNIKIIEVTKLVSEEWKKIGEAEKK 96
             +  P  I  +P+KP+ P    + +   + ME +      E  K+V E+ ++  E E K
Sbjct: 118 PSSIAPPEIDPEPRKPIVPDLVLEEEEEVE-MEEDEEYYEKEPGKVVDEKSEEEEEEELK 176

Query: 97  RLENEYAKEKEIYAQNVKKYHDELTPEAIEFLRKEKEVKKQKKEKRE 143
            +++    E +    +VK         A  F +K ++ ++++K K+ 
Sbjct: 177 TMKDFIDLESQTKKPSVKDNGKSFWSAASVFSKKLQKWRQKQKLKKP 223


>gnl|CDD|106979 PHA00671, PHA00671, hypothetical protein.
          Length = 135

 Score = 26.9 bits (58), Expect = 4.2
 Identities = 21/70 (30%), Positives = 28/70 (40%), Gaps = 7/70 (10%)

Query: 91  GEAEKKRLENEYAKEKEIYAQNVKKYHDELTPEAIEFLRKEKEVKKQKKEKREMKKLFRE 150
            +AE  RL  EY K+ E Y Q  K    +L      F R     +KQ   +R +  +   
Sbjct: 48  AKAESDRLTQEYQKQTEAYQQQAKIMQQQLLDSQQSFNRAN---QKQPNSQRALSAV--- 101

Query: 151 TGKPKKPGNS 160
            G   K G S
Sbjct: 102 -GSAAKSGQS 110


>gnl|CDD|234017 TIGR02794, tolA_full, TolA protein.  TolA couples the inner
           membrane complex of itself with TolQ and TolR to the
           outer membrane complex of TolB and OprL (also called
           Pal). Most of the length of the protein consists of
           low-complexity sequence that may differ in both length
           and composition from one species to another,
           complicating efforts to discriminate TolA (the most
           divergent gene in the tol-pal system) from paralogs such
           as TonB. Selection of members of the seed alignment and
           criteria for setting scoring cutoffs are based largely
           conserved operon struction. //The Tol-Pal complex is
           required for maintaining outer membrane integrity. Also
           involved in transport (uptake) of colicins and
           filamentous DNA, and implicated in pathogenesis.
           Transport is energized by the proton motive force. TolA
           is an inner membrane protein that interacts with
           periplasmic TolB and with outer membrane porins ompC,
           phoE and lamB [Transport and binding proteins, Other,
           Cellular processes, Pathogenesis].
          Length = 346

 Score = 27.5 bits (61), Expect = 4.3
 Identities = 17/76 (22%), Positives = 29/76 (38%)

Query: 75  KIIEVTKLVSEEWKKIGEAEKKRLENEYAKEKEIYAQNVKKYHDELTPEAIEFLRKEKEV 134
           K  E  +  +EE K    AE K      A++K       +   +     A E  +K  E 
Sbjct: 112 KQAEEKQKQAEEAKAKQAAEAKAKAEAEAEKKAKEEAKKQAEEEAKAKAAAEAKKKAAEA 171

Query: 135 KKQKKEKREMKKLFRE 150
           KK+ + + + K   + 
Sbjct: 172 KKKAEAEAKAKAEAKA 187


>gnl|CDD|235772 PRK06290, PRK06290, aspartate aminotransferase; Provisional.
          Length = 410

 Score = 27.3 bits (61), Expect = 5.1
 Identities = 16/51 (31%), Positives = 26/51 (50%), Gaps = 10/51 (19%)

Query: 117 HDELTPEAIEFLRKEKEVKKQKKEKREMKKLFRETG-KPKKPGNSFYLFIK 166
           H E+T         EK  +K  +   ++ K+  E G K + PG +FYL++K
Sbjct: 298 HPEIT---------EKIREKYSRRLDKLVKILNEVGFKAEMPGGTFYLYVK 339


>gnl|CDD|223599 COG0525, ValS, Valyl-tRNA synthetase [Translation, ribosomal
           structure and biogenesis].
          Length = 877

 Score = 27.2 bits (61), Expect = 5.4
 Identities = 22/86 (25%), Positives = 34/86 (39%), Gaps = 14/86 (16%)

Query: 69  ERNPNIKIIEVTKLVSEEWKKIGEAEKKRLENEYAK-EKEIYAQNVKKYHDELTPEAIEF 127
                  +      +         AE  RLE E  K EKEI    ++K   +L+ E   F
Sbjct: 790 PVEVTAVVGGAELFLPLAGLIDLAAELARLEKELEKLEKEI--DRIEK---KLSNEG--F 842

Query: 128 LRK------EKEVKKQKKEKREMKKL 147
           + K      EKE +K  + + ++ KL
Sbjct: 843 VAKAPEEVVEKEKEKLAEYQVKLAKL 868


>gnl|CDD|220232 pfam09421, FRQ, Frequency clock protein.  The frequency clock
           protein, is the central component of the frq-based
           circadian negative feedback loop, regulates various
           aspects of the circadian clock in Neurospora crassa.
           This protein has been shown to interact with itself via
           a coiled-coil.
          Length = 989

 Score = 27.3 bits (60), Expect = 5.6
 Identities = 13/53 (24%), Positives = 21/53 (39%)

Query: 3   LFKLTTGVFTGGNVFNLRSCLMPHLNSVTTAGLKQQTSQPLSIPEKPKKPLTP 55
           L ++ TG  +G NV   +S L P +++       QQ       P +P      
Sbjct: 303 LEQIFTGKISGRNVRRTQSTLTPSVDAALPPVQAQQQEGTQMAPPQPPSNFIT 355


>gnl|CDD|216319 pfam01134, GIDA, Glucose inhibited division protein A. 
          Length = 391

 Score = 27.1 bits (61), Expect = 5.8
 Identities = 11/36 (30%), Positives = 19/36 (52%), Gaps = 3/36 (8%)

Query: 58  KFQGQIRNKIMERNPNIKII--EVTKLVSEEWKKIG 91
            +  ++   + E +PN+ +I  EVT L+ E  K  G
Sbjct: 96  LYSKEMTETL-ENHPNLTLIQGEVTDLIPENGKVKG 130


>gnl|CDD|238410 cd00795, NOS_oxygenase_euk, Nitric oxide synthase (NOS) eukaryotic
           oxygenase domain. NOS produces nitric oxide (NO) by
           catalyzing a five-electron heme-based oxidation of a
           guanidine nitrogen of L-arginine to L-citrulline via two
           successive monooxygenation reactions producing
           N(omega)-hydroxy-L-arginine (NHA) as an intermediate. In
           mammals, there are three distinct NOS isozymes: neuronal
           (nNOS or NOS-1), cytokine-inducible (iNOS or NOS-2) and
           endothelial (eNOS or NOS-3) . Nitric oxide synthases are
           homodimers. In eukaryotes, each monomer has an
           N-terminal oxygenase domain, which binds to the
           substrate L-Arg,  zinc, and to the cofactors heme and
           5.6.7.8-(6R)-tetrahydrobiopterin (BH4) . Eukaryotic
           NOS's also have a C-terminal electron supplying
           reductase region, which is homologous to cytochrome P450
           reductase and binds NADH, FAD and FMN.
          Length = 412

 Score = 26.9 bits (60), Expect = 6.4
 Identities = 11/49 (22%), Positives = 18/49 (36%), Gaps = 6/49 (12%)

Query: 110 AQNVKKYHDELTPEAIEFLR----KEKEVKKQKKEKR--EMKKLFRETG 152
                +  +EL P+A +F+       K    +    R  E+ K    TG
Sbjct: 45  RPRDGRPKEELLPQAKDFINQYYSSIKRSGSEAHLARLEEVTKEIEATG 93


>gnl|CDD|219655 pfam07946, DUF1682, Protein of unknown function (DUF1682).  The
           members of this family are all hypothetical eukaryotic
           proteins of unknown function. One member is described as
           being an adipocyte-specific protein, but no evidence of
           this was found.
          Length = 322

 Score = 26.8 bits (60), Expect = 6.6
 Identities = 16/81 (19%), Positives = 40/81 (49%), Gaps = 10/81 (12%)

Query: 62  QIRNKIMERNPNIKIIEVTKLVSEEWKKIGEAEKKRLENEYAKEKEIYAQNVKKYHDELT 121
           ++  +++      K+ +  +   E+  K   AE++R E    K++E   +  +    +L+
Sbjct: 252 KLSPEVLR-----KVDKTREEEEEKILK--AAEEERQEEAQEKKEEKKKEEREAKLAKLS 304

Query: 122 PEAIEFLRKEKEVKKQKKEKR 142
           PE     RK +E +++K+ ++
Sbjct: 305 PEEQ---RKLEEKERKKQARK 322


>gnl|CDD|114524 pfam05802, EspB, Enterobacterial EspB protein.  EspB is a
           type-III-secreted pore-forming protein of
           enteropathogenic Escherichia coli (EPEC) which is
           essential for EPEC pathogenesis. EspB is also found in
           Citrobacter rodentium.
          Length = 317

 Score = 27.0 bits (59), Expect = 6.8
 Identities = 13/42 (30%), Positives = 22/42 (52%), Gaps = 3/42 (7%)

Query: 83  VSEEWK---KIGEAEKKRLENEYAKEKEIYAQNVKKYHDELT 121
           +SE  K   +I E   K +EN      E+Y Q+V++  D++ 
Sbjct: 248 ISESVKSNHEINEQRAKSVENFQQGNLELYKQDVRRSQDDIA 289


>gnl|CDD|234084 TIGR03007, pepcterm_ChnLen, polysaccharide chain length determinant
           protein, PEP-CTERM locus subfamily.  Members of this
           protein family belong to the family of polysaccharide
           chain length determinant proteins (pfam02706). All are
           found in species that encode the PEP-CTERM/exosortase
           system predicted to act in protein sorting in a number
           of Gram-negative bacteria, and are found near the epsH
           homolog that is the putative exosortase gene [Cell
           envelope, Biosynthesis and degradation of surface
           polysaccharides and lipopolysaccharides].
          Length = 498

 Score = 26.9 bits (60), Expect = 7.0
 Identities = 11/43 (25%), Positives = 18/43 (41%), Gaps = 7/43 (16%)

Query: 92  EAEKKRLENEYAKEKEIYAQNVKKYHD-------ELTPEAIEF 127
           EAE  +L  +Y   K  Y Q + +          E+  +A+ F
Sbjct: 354 EAELTQLNRDYEVNKSNYEQLLTRRESAEVSKQMEVQDKAVSF 396


>gnl|CDD|188995 cd06456, M3A_DCP, Peptidase family M3 dipeptidyl carboxypeptidase
           (DCP).  Peptidase family M3 dipeptidyl carboxypeptidase
           (DCP; Dcp II; peptidyl dipeptidase; EC 3.4.15.5). This
           metal-binding M3A family also includes oligopeptidase A
           (OpdA; EC 3.4.24.70) enzyme. DCP cleaves dipeptides off
           the C-termini of various peptides and proteins, the
           smallest substrate being N-blocked tripeptides and
           unblocked tetrapeptides. DCP from E. coli is inhibited
           by the anti-hypertensive drug captopril, an inhibitor of
           the mammalian angiotensin converting enzyme (ACE, also
           called  peptidyl dipeptidase A). Oligopeptidase A (OpdA)
           may play a specific role in the degradation of signal
           peptides after they are released from precursor forms of
           secreted proteins. It can also cleave N-acetyl-L-Ala.
          Length = 654

 Score = 27.0 bits (61), Expect = 7.1
 Identities = 14/39 (35%), Positives = 20/39 (51%), Gaps = 3/39 (7%)

Query: 121 TPEA-IEFLRK--EKEVKKQKKEKREMKKLFRETGKPKK 156
           TPEA + FL    EK   + KKE  E++   +E G   +
Sbjct: 268 TPEAVLAFLEDLAEKAKPQAKKELAELQAFAKEEGGDDE 306


>gnl|CDD|237753 PRK14552, PRK14552, C/D box methylation guide ribonucleoprotein
           complex aNOP56 subunit; Provisional.
          Length = 414

 Score = 26.9 bits (60), Expect = 7.8
 Identities = 17/67 (25%), Positives = 29/67 (43%), Gaps = 14/67 (20%)

Query: 90  IGEAEKKRLENEYAKEKEIYAQNVKKYHDELTPEAIEFLRKEKEVKKQKKEKREMKKLFR 149
           IG+  K+ L     + KE Y +  K              +K +E K QK++K++ +K   
Sbjct: 360 IGDELKEELNKRIEEIKEKYPKPPK--------------KKREEKKPQKRKKKKKRKKKG 405

Query: 150 ETGKPKK 156
           +  K K 
Sbjct: 406 KKRKKKG 412


>gnl|CDD|114011 pfam05262, Borrelia_P83, Borrelia P83/100 protein.  This family
           consists of several Borrelia P83/P100 antigen proteins.
          Length = 489

 Score = 26.9 bits (59), Expect = 7.9
 Identities = 12/60 (20%), Positives = 24/60 (40%)

Query: 95  KKRLENEYAKEKEIYAQNVKKYHDELTPEAIEFLRKEKEVKKQKKEKREMKKLFRETGKP 154
           K+R   E AK  +   + + K   +      +    +    KQ+ E R+ ++  +   KP
Sbjct: 204 KERESQEDAKRAQQLKEELDKKQIDADKAQQKADFAQDNADKQRDEVRQKQQEAKNLPKP 263


>gnl|CDD|227038 COG4694, COG4694, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 758

 Score = 26.8 bits (59), Expect = 8.1
 Identities = 12/78 (15%), Positives = 32/78 (41%), Gaps = 2/78 (2%)

Query: 102 YAKEKEIYAQNVKKYHDELTPEAIEFLRKEK--EVKKQKKEKREMKKLFRETGKPKKPGN 159
             K+     + ++   + +  +  + ++ E   +VK Q++EK E   +        K   
Sbjct: 109 LGKKTNDDLEKIESKKESINKKNEKKIKNEASLQVKTQREEKEEKDFIADCWRNLYKKNE 168

Query: 160 SFYLFIKDNIDKKEYEGK 177
             +    +N  +K+++ K
Sbjct: 169 ERFKEYLENFKRKKFKRK 186


>gnl|CDD|191022 pfam04538, BEX, Brain expressed X-linked like family.  This is a
           family of transcription elongation factors which
           includes those referred to as Bex proteins as well as
           those named TCEAL7. Bex1 was shown to be a novel link
           between neurotrophin signalling, the cell cycle, and
           neuronal differentiation, suggesting it might function
           by coordinating internal cellular states with the
           ability of cells to respond to external signals. TCEAL7
           has been shown negatively to regulate the NF-kappaB
           pathway, hence being important in ovarian cancer as it
           one of the genes frequently downregulated in this
           cancer. A closely related protein, TFIIS/TCEA, found in
           pfam07500 is involved in transcription elongation and
           transcript fidelity. TFIIS/TCEA promotes 3'
           endoribonuclease activity of RNA polymerase II (pol II)
           and allows pol II to bypass transcript pause or 'arrest'
           during elongation process. It is thus possible that BEX
           is also acting in this way.
          Length = 97

 Score = 25.4 bits (56), Expect = 8.7
 Identities = 14/37 (37%), Positives = 20/37 (54%)

Query: 125 IEFLRKEKEVKKQKKEKREMKKLFRETGKPKKPGNSF 161
           +E   KE E K + + K E +K   E G+ KKP  +F
Sbjct: 1   MEKPCKENEGKPESEPKEEEEKRPLEEGEGKKPEGNF 37


>gnl|CDD|217318 pfam03011, PFEMP, PFEMP DBL domain.  PfEMP1 (Plasmodium falciparum
           erythrocyte membrane protein) has been identified as the
           rosetting ligand of the malaria parasite P. falciparum.
           Rosetting is the adhesion of infected erythrocytes with
           uninfected erythrocytes in the vasculature of the
           infected organ, and is associated with severe malaria.
           PfEMP1 interacts with Complement Receptor One on
           uninfected erythrocytes to form rosettes.
          Length = 150

 Score = 26.1 bits (58), Expect = 8.8
 Identities = 20/87 (22%), Positives = 35/87 (40%), Gaps = 8/87 (9%)

Query: 85  EEWKKIGEAEKKRLENEYAKEKEIYAQNVKKYHDELTPEAIEFLRKEKEVKKQKKEKREM 144
           +EW+KI    K+    +Y  +      NV +   E     I+     K+     KE +++
Sbjct: 52  KEWEKI----KEHFNKQYKIDDGRDEYNVLESFLEDLLFQIDI----KKAYGDAKELKKI 103

Query: 145 KKLFRETGKPKKPGNSFYLFIKDNIDK 171
           K+L +  G      ++     KD ID 
Sbjct: 104 KELLKCNGCNNSANSAKNGENKDAIDC 130


>gnl|CDD|214371 CHL00122, secA, preprotein translocase subunit SecA; Validated.
          Length = 870

 Score = 26.5 bits (59), Expect = 9.1
 Identities = 13/48 (27%), Positives = 21/48 (43%)

Query: 100 NEYAKEKEIYAQNVKKYHDELTPEAIEFLRKEKEVKKQKKEKREMKKL 147
                  E    ++ K   +L+   +     EK  K Q+KEK+ +KKL
Sbjct: 544 ENLKILNEASEISIPKNSYQLSLRFLYNELLEKYKKLQEKEKKIVKKL 591


>gnl|CDD|234941 PRK01315, PRK01315, putative inner membrane protein translocase
           component YidC; Provisional.
          Length = 329

 Score = 26.2 bits (58), Expect = 9.6
 Identities = 13/36 (36%), Positives = 23/36 (63%), Gaps = 6/36 (16%)

Query: 119 ELTPEAIEFLRKEKEVKKQKKEK--REMKKLFRETG 152
           E+ P+    ++K +E  K  +E+  +EM KL++ETG
Sbjct: 73  EIQPK----MKKIQEKYKGDRERMSQEMMKLYKETG 104


>gnl|CDD|179310 PRK01622, PRK01622, OxaA-like protein precursor; Validated.
          Length = 256

 Score = 26.3 bits (58), Expect = 9.6
 Identities = 8/25 (32%), Positives = 18/25 (72%)

Query: 128 LRKEKEVKKQKKEKREMKKLFRETG 152
           L+  K+++KQK+ ++EM +L++   
Sbjct: 107 LKVTKDLEKQKEYQKEMMELYKSGN 131


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.313    0.132    0.370 

Gapped
Lambda     K      H
   0.267   0.0677    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 9,246,462
Number of extensions: 871060
Number of successful extensions: 2146
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1960
Number of HSP's successfully gapped: 357
Length of query: 180
Length of database: 10,937,602
Length adjustment: 91
Effective length of query: 89
Effective length of database: 6,901,388
Effective search space: 614223532
Effective search space used: 614223532
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.9 bits)
S2: 56 (25.5 bits)