RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy11980
         (831 letters)



>gnl|CDD|214780 smart00702, P4Hc, Prolyl 4-hydroxylase alpha subunit homologues.
           Mammalian enzymes catalyse hydroxylation of collagen,
           for example. Prokaryotic enzymes might catalyse
           hydroxylation of antibiotic peptides. These are
           2-oxoglutarate-dependent dioxygenases, requiring
           2-oxoglutarate and dioxygen as cosubstrates and ferrous
           iron as a cofactor.
          Length = 165

 Score = 58.6 bits (142), Expect = 6e-10
 Identities = 27/111 (24%), Positives = 39/111 (35%), Gaps = 8/111 (7%)

Query: 715 SNIGRLISEVDAIIMRANRMVNNGRMGDFVINGRTKQNGGLLRIFPEGGGDKVAD-IEPM 773
           +   +L+ E + +  R   +       +       +Q+ G          D V + I   
Sbjct: 3   AECQKLLEEAEPLGWRG-EVTRGIGNPNETSQ--YRQSNGTW--LELLERDLVIERIRQR 57

Query: 774 FDRIL-FFWSDRRNPHEAMVACYPGHGSHYVKHVDNPNRDGRCITAIYYLN 823
               L        +  +A VA Y G G HY  HVDN     R  T I YLN
Sbjct: 58  LADFLGLLAGLPLSAEDAQVARY-GPGGHYGPHVDNFLYGDRIATFILYLN 107



 Score = 38.9 bits (91), Expect = 0.003
 Identities = 12/40 (30%), Positives = 16/40 (40%), Gaps = 2/40 (5%)

Query: 753 GGLLRIFPEGGGDKVADIEPMFDRILFFWS-DRRNPHEAM 791
           GG L          VA ++P    +LFF S   R+ H   
Sbjct: 112 GGELVFPGLRL-MVVATVKPKKGDLLFFPSGHGRSLHGVC 150


>gnl|CDD|201954 pfam01753, zf-MYND, MYND finger. 
          Length = 39

 Score = 52.0 bits (125), Expect = 6e-09
 Identities = 17/33 (51%), Positives = 20/33 (60%)

Query: 423 GEENLLRCSKCKVTFYCSKQHQTLDWKAHKINC 455
               LLRCS+CK  +YCSK+ Q  DW  HK  C
Sbjct: 7   EALKLLRCSRCKSVYYCSKECQKADWPYHKKEC 39


>gnl|CDD|177301 PHA00733, PHA00733, hypothetical protein.
          Length = 128

 Score = 46.0 bits (109), Expect = 7e-06
 Identities = 21/55 (38%), Positives = 29/55 (52%), Gaps = 2/55 (3%)

Query: 327 VKPHKCPICTKGFTMKSSLKIHLLTHTKEPPKSCEVCGRAFIRQDCLIRHLRQKH 381
           V P+ CP+C   F+   SLK H+     E  K C VCG+ F   D  + H+ +KH
Sbjct: 71  VSPYVCPLCLMPFSSSVSLKQHI--RYTEHSKVCPVCGKEFRNTDSTLDHVCKKH 123



 Score = 30.6 bits (69), Expect = 1.2
 Identities = 17/51 (33%), Positives = 27/51 (52%), Gaps = 7/51 (13%)

Query: 132 YECDICKNRFPSSGAMKKHRR--KHTGERPYECKEVCGRVFNRKDNLREHL 180
           Y C +C   F SS ++K+H R  +H+   P     VCG+ F   D+  +H+
Sbjct: 74  YVCPLCLMPFSSSVSLKQHIRYTEHSKVCP-----VCGKEFRNTDSTLDHV 119


>gnl|CDD|227381 COG5048, COG5048, FOG: Zn-finger [General function prediction
           only].
          Length = 467

 Score = 48.2 bits (114), Expect = 1e-05
 Identities = 41/168 (24%), Positives = 63/168 (37%), Gaps = 16/168 (9%)

Query: 221 TCYARFAAKETLNRHMKT--HSG--VKPHVC--EYCEKAFIQASQLKAHMFHHTGENGHT 274
            C   F+    L RH+++  HSG  +KP  C    C K F +   LK H+  HT  +   
Sbjct: 294 QCNISFSRSSPLTRHLRSVNHSGESLKPFSCPYSLCGKLFSRNDALKRHILLHTSISPAK 353

Query: 275 CEL--CNKTFNRKA----RLELHMKYIHEGADPYQCD--ICRKTFIRKEDLTRHCLLHTG 326
            +L   +  F+          L      +     +     C + F R  +L+ H + H  
Sbjct: 354 EKLLNSSSKFSPLLNNEPPQSLQQYKDLKNDKKSETLSNSCIRNFKRDSNLSLHIITHLS 413

Query: 327 VKPH--KCPICTKGFTMKSSLKIHLLTHTKEPPKSCEVCGRAFIRQDC 372
            +P+  K P C+K F    +L  H   HT   P  C +        D 
Sbjct: 414 FRPYNCKNPPCSKSFNRHYNLIPHKKIHTNHAPLLCSILKSFRRDLDL 461



 Score = 43.1 bits (101), Expect = 4e-04
 Identities = 40/257 (15%), Positives = 73/257 (28%), Gaps = 20/257 (7%)

Query: 328 KPHKCPICTKGFTMKSSLKIHLLTHTKEPPKSCEVCGRA--FIRQDCLIRHLRQKHRSHL 385
           +P  CP CT  F+    L  H+ +HT E P  C   G    F R   L RHLR  H +  
Sbjct: 32  RPDSCPNCTDSFSRLEHLTRHIRSHTGEKPSQCSYSGCDKSFSRPLELSRHLRTHHNNPS 91

Query: 386 DEVLAQGDKKKLEAQLLAAAATELTSVLEDGEEEGDEGEENLLRCSKCKVTFYCSKQHQT 445
           D           +A   + +++   S   +          +               Q   
Sbjct: 92  DLNSKSLPLSNSKASSSSLSSSSSNSNDNNLLSSHSLPPSSRDP------------QLPD 139

Query: 446 LDWKAHKINCNILSNQSATNNTSLANNTPQVAQIQNQFPLSNQYFHEGSSENEIANSTVQ 505
           L   ++  N  +  N S++ NT  +N+        +     +       S N   +    
Sbjct: 140 LLSISNLRNNPLPGNNSSSVNTPQSNSLHPPLPANSLSKDPSSNLSLLISSNVSTSIPSS 199

Query: 506 YLNPSLFQNNTPNQLVDTAGTSRAPGPSNMNTQYGLYSDIPILNENNGQNNLTNMHLDTN 565
             N  L  + +            +     + T   L     +    +  ++        +
Sbjct: 200 SENSPLSSSYSIPSSSSDQNLENSSSSLPLTTNSQLSPKSLLSQSPSSLSS------SDS 253

Query: 566 VVVVPPFHHTNNDRVHS 582
                    ++     S
Sbjct: 254 SSSASESPRSSLPTASS 270



 Score = 41.6 bits (97), Expect = 0.001
 Identities = 27/92 (29%), Positives = 40/92 (43%), Gaps = 6/92 (6%)

Query: 301 PYQCDICRKTFIRKEDLTRH--CLLHTG--VKPHKCPI--CTKGFTMKSSLKIHLLTHTK 354
           P +   C  +F R   LTRH   + H+G  +KP  CP   C K F+   +LK H+L HT 
Sbjct: 289 PIKSKQCNISFSRSSPLTRHLRSVNHSGESLKPFSCPYSLCGKLFSRNDALKRHILLHTS 348

Query: 355 EPPKSCEVCGRAFIRQDCLIRHLRQKHRSHLD 386
             P   ++   +      L     Q  + + D
Sbjct: 349 ISPAKEKLLNSSSKFSPLLNNEPPQSLQQYKD 380



 Score = 35.1 bits (80), Expect = 0.14
 Identities = 20/57 (35%), Positives = 28/57 (49%), Gaps = 1/57 (1%)

Query: 128 AEKLYECDICKNRFPSSGAMKKHRRKHTGERPYECKEV-CGRVFNRKDNLREHLRAH 183
           A +   C  C + F     + +H R HTGE+P +C    C + F+R   L  HLR H
Sbjct: 30  APRPDSCPNCTDSFSRLEHLTRHIRSHTGEKPSQCSYSGCDKSFSRPLELSRHLRTH 86



 Score = 34.7 bits (79), Expect = 0.21
 Identities = 38/155 (24%), Positives = 59/155 (38%), Gaps = 17/155 (10%)

Query: 43  HLCKGCYRIFTSQG-----YMLLSYKSKEEEKQICTYIKVCGRVFNRKDNLREHLRAHAG 97
              K C   F+           +++  +  +   C Y   CG++F+R D L+ H+  H  
Sbjct: 290 IKSKQCNISFSRSSPLTRHLRSVNHSGESLKPFSCPYSL-CGKLFSRNDALKRHILLHTS 348

Query: 98  ETKRKKKFKCDKCEKQFYGL------TLLRIHERLHAEKLYECD--ICKNRFPSSGAMKK 149
            +  K+K        +F  L        L+ ++ L  +K  E     C   F     +  
Sbjct: 349 ISPAKEKL--LNSSSKFSPLLNNEPPQSLQQYKDLKNDKKSETLSNSCIRNFKRDSNLSL 406

Query: 150 HRRKHTGERPYECKE-VCGRVFNRKDNLREHLRAH 183
           H   H   RPY CK   C + FNR  NL  H + H
Sbjct: 407 HIITHLSFRPYNCKNPPCSKSFNRHYNLIPHKKIH 441



 Score = 34.7 bits (79), Expect = 0.24
 Identities = 21/132 (15%), Positives = 38/132 (28%), Gaps = 10/132 (7%)

Query: 243 KPHVCEYCEKAFIQASQLKAHMFHHTGENGHTCELCNKTFNRKARLEL--HMKYIHEGAD 300
           +P  C  C  +F +   L  H+  HTGE    C       +    LEL  H++  H    
Sbjct: 32  RPDSCPNCTDSFSRLEHLTRHIRSHTGEKPSQCSYSGCDKSFSRPLELSRHLRTHHNNPS 91

Query: 301 PYQCDICRKTFIRKEDLTRHCLL----HTGVKPHKCPICTKGFTMKSSLKIHLLTHTKEP 356
               D+  K+            L          +     +   + +      LL+ +   
Sbjct: 92  ----DLNSKSLPLSNSKASSSSLSSSSSNSNDNNLLSSHSLPPSSRDPQLPDLLSISNLR 147

Query: 357 PKSCEVCGRAFI 368
                    + +
Sbjct: 148 NNPLPGNNSSSV 159



 Score = 32.7 bits (74), Expect = 0.90
 Identities = 44/200 (22%), Positives = 69/200 (34%), Gaps = 31/200 (15%)

Query: 130 KLYECDICKNRFPSSGAMKKHRRK--HTGE--RPYEC-KEVCGRVFNRKDNLREHLRAHA 184
              +   C   F  S  + +H R   H+GE  +P+ C   +CG++F+R D L+ H+  H 
Sbjct: 288 LPIKSKQCNISFSRSSPLTRHLRSVNHSGESLKPFSCPYSLCGKLFSRNDALKRHILLHT 347

Query: 185 GETKRKKKFKCDKCEKQFYGLTLLRIHERLHGLIERTCYARFAAKETLNRHMKTHSGVKP 244
             +  K+K        +F  L      + L                      K     K 
Sbjct: 348 SISPAKEKL--LNSSSKFSPLLNNEPPQSLQ-------------------QYKDLKNDKK 386

Query: 245 HVCEY--CEKAFIQASQLKAHMFHHTGENGHTCE--LCNKTFNRKARLELHMKYIHEGAD 300
                  C + F + S L  H+  H     + C+   C+K+FNR   L  H K     A 
Sbjct: 387 SETLSNSCIRNFKRDSNLSLHIITHLSFRPYNCKNPPCSKSFNRHYNLIPHKKIHTNHA- 445

Query: 301 PYQCDICRKTFIRKEDLTRH 320
           P  C I +      +     
Sbjct: 446 PLLCSILKSFRRDLDLSNHG 465



 Score = 30.0 bits (67), Expect = 6.5
 Identities = 28/151 (18%), Positives = 45/151 (29%), Gaps = 3/151 (1%)

Query: 221 TCYARFAAKETLNRHMKTHSGVKPHVCEY--CEKAFIQASQLKAHMFHHTGENGHTCELC 278
            C   F+  E L RH+++H+G KP  C Y  C+K+F +  +L  H+  H           
Sbjct: 38  NCTDSFSRLEHLTRHIRSHTGEKPSQCSYSGCDKSFSRPLELSRHLRTHHNNPSDLNSKS 97

Query: 279 NKTFNRKARLELHMKYIHEGADPYQCDICRKTFIRKEDLTRHCLLHTGVKPHKCPICTKG 338
               N KA             D             ++      L  + ++ +  P     
Sbjct: 98  LPLSNSKASSSSLSSSSSNSNDNNLLSSHSLPPSSRDPQLPDLLSISNLRNNPLPGNNSS 157

Query: 339 FT-MKSSLKIHLLTHTKEPPKSCEVCGRAFI 368
                 S  +H         K         I
Sbjct: 158 SVNTPQSNSLHPPLPANSLSKDPSSNLSLLI 188


>gnl|CDD|222150 pfam13465, zf-H2C2_2, Zinc-finger double domain. 
          Length = 26

 Score = 38.9 bits (91), Expect = 2e-04
 Identities = 13/25 (52%), Positives = 17/25 (68%)

Query: 232 LNRHMKTHSGVKPHVCEYCEKAFIQ 256
           L RHM+TH+G KP+ C  C K+F  
Sbjct: 2   LRRHMRTHTGEKPYKCPVCGKSFSS 26



 Score = 36.6 bits (85), Expect = 0.001
 Identities = 13/27 (48%), Positives = 20/27 (74%), Gaps = 1/27 (3%)

Query: 146 AMKKHRRKHTGERPYECKEVCGRVFNR 172
            +++H R HTGE+PY+C  VCG+ F+ 
Sbjct: 1   NLRRHMRTHTGEKPYKCP-VCGKSFSS 26



 Score = 35.8 bits (83), Expect = 0.002
 Identities = 14/25 (56%), Positives = 18/25 (72%)

Query: 316 DLTRHCLLHTGVKPHKCPICTKGFT 340
           +L RH   HTG KP+KCP+C K F+
Sbjct: 1   NLRRHMRTHTGEKPYKCPVCGKSFS 25



 Score = 30.8 bits (70), Expect = 0.14
 Identities = 12/26 (46%), Positives = 17/26 (65%)

Query: 344 SLKIHLLTHTKEPPKSCEVCGRAFIR 369
           +L+ H+ THT E P  C VCG++F  
Sbjct: 1   NLRRHMRTHTGEKPYKCPVCGKSFSS 26



 Score = 30.4 bits (69), Expect = 0.18
 Identities = 11/25 (44%), Positives = 16/25 (64%)

Query: 260 LKAHMFHHTGENGHTCELCNKTFNR 284
           L+ HM  HTGE  + C +C K+F+ 
Sbjct: 2   LRRHMRTHTGEKPYKCPVCGKSFSS 26



 Score = 30.4 bits (69), Expect = 0.18
 Identities = 14/28 (50%), Positives = 16/28 (57%), Gaps = 4/28 (14%)

Query: 87  NLREHLRAHAGETKRKKKFKCDKCEKQF 114
           NLR H+R H GE    K +KC  C K F
Sbjct: 1   NLRRHMRTHTGE----KPYKCPVCGKSF 24



 Score = 30.4 bits (69), Expect = 0.18
 Identities = 14/28 (50%), Positives = 16/28 (57%), Gaps = 4/28 (14%)

Query: 175 NLREHLRAHAGETKRKKKFKCDKCEKQF 202
           NLR H+R H GE    K +KC  C K F
Sbjct: 1   NLRRHMRTHTGE----KPYKCPVCGKSF 24



 Score = 28.5 bits (64), Expect = 0.81
 Identities = 11/26 (42%), Positives = 15/26 (57%), Gaps = 1/26 (3%)

Query: 288 LELHMKYIHEGADPYQCDICRKTFIR 313
           L  HM+  H G  PY+C +C K+F  
Sbjct: 2   LRRHMR-THTGEKPYKCPVCGKSFSS 26



 Score = 26.6 bits (59), Expect = 4.1
 Identities = 11/23 (47%), Positives = 13/23 (56%), Gaps = 1/23 (4%)

Query: 120 LRIHERLH-AEKLYECDICKNRF 141
           LR H R H  EK Y+C +C   F
Sbjct: 2   LRRHMRTHTGEKPYKCPVCGKSF 24


>gnl|CDD|222280 pfam13640, 2OG-FeII_Oxy_3, 2OG-Fe(II) oxygenase superfamily.  This
           family contains members of the 2-oxoglutarate (2OG) and
           Fe(II)-dependent oxygenase superfamily.
          Length = 93

 Score = 38.2 bits (89), Expect = 0.001
 Identities = 13/38 (34%), Positives = 19/38 (50%), Gaps = 2/38 (5%)

Query: 790 AMVACYPGHGSHYVKHVDNPNRDGRCITAIYYLNRDWD 827
           A ++ Y   G     H DN ++  R +T + YLN DW 
Sbjct: 1   AGLSRYR-PGGFLGPHTDN-SKGERRVTLLLYLNDDWW 36



 Score = 36.2 bits (84), Expect = 0.007
 Identities = 12/39 (30%), Positives = 17/39 (43%), Gaps = 1/39 (2%)

Query: 753 GGLLRIFPEGGGDKVADIEPMFDRILFFWSDRRNPHEAM 791
           GG L ++P   G  V  I P   R++ F S   + H   
Sbjct: 40  GGELELYPSDDGGVVRVIAPKPGRLVLFRSG-NSLHGVT 77


>gnl|CDD|226274 COG3751, EGL-9, Predicted proline hydroxylase [Posttranslational
           modification, protein turnover, chaperones].
          Length = 252

 Score = 40.1 bits (94), Expect = 0.002
 Identities = 15/39 (38%), Positives = 21/39 (53%), Gaps = 2/39 (5%)

Query: 789 EAMVACYPGHGSHYVKHVDN-PNRDGRCITAIYYLNRDW 826
           E  +  Y   G   +KH DN  ++D R  T +YYL R+W
Sbjct: 137 EGQITVYN-PGCFLLKHDDNGRDKDIRLATYVYYLTREW 174



 Score = 28.9 bits (65), Expect = 8.9
 Identities = 14/48 (29%), Positives = 19/48 (39%), Gaps = 8/48 (16%)

Query: 753 GGLLRIFPEG-----GGDKVADIEPMFDRILFFWSDRRNPHEAM-VAC 794
           GG LR+F          D    I P+F+ ++FF S     H    V  
Sbjct: 179 GGELRLFHSLQKNNTAADSFKTIAPVFNSLVFFKSRPS--HSVHSVEE 224


>gnl|CDD|227561 COG5236, COG5236, Uncharacterized conserved protein, contains RING
           Zn-finger [General function prediction only].
          Length = 493

 Score = 35.8 bits (82), Expect = 0.10
 Identities = 34/121 (28%), Positives = 44/121 (36%), Gaps = 6/121 (4%)

Query: 84  RKDNLREHLRAHAGETKRKKKFKCDKCEKQFYGLTLLRIHERLHAEKLYECD----ICKN 139
           R   LR+H      E   K    C  C+  FY    LR H RL  E  + CD    I   
Sbjct: 200 RSSTLRDHKNGGLEEEGFKGHPLCIFCKIYFYDDDELRRHCRLRHEACHICDMVGPIRYQ 259

Query: 140 RFPSSGAMKKHRRK-HTGERPYECKEVCGRVFNRKDNLREHL-RAHAGETKRKKKFKCDK 197
            F S   ++ H R  H       C+     VF     L EHL R H    +  +  +  +
Sbjct: 260 YFKSYEDLEAHFRNAHYCCTFQTCRVGKCYVFPYHTELLEHLTRFHKVNARLSEIPRPGR 319

Query: 198 C 198
           C
Sbjct: 320 C 320



 Score = 32.7 bits (74), Expect = 0.82
 Identities = 33/143 (23%), Positives = 48/143 (33%), Gaps = 30/143 (20%)

Query: 104 KFKC--DKCEKQFYGLTLLRIHERLHAEKLYECDICKNRFPSSGAMKKHRRKHTGERPYE 161
            FKC   KC ++   L  L+ H +     +  C  C            +++    E    
Sbjct: 151 SFKCPKSKCHRRCGSLKELKKHYKAQHGFVL-CSECI----------GNKKDFWNEI--- 196

Query: 162 CKEVCGRVFNRKDNLREHLRAHAGETKRKKKFKCDKCEKQFYGLTLLRI-----HERLH- 215
                 R+F R   LR+H      E   K    C  C+  FY    LR      HE  H 
Sbjct: 197 ------RLF-RSSTLRDHKNGGLEEEGFKGHPLCIFCKIYFYDDDELRRHCRLRHEACHI 249

Query: 216 -GLIERTCYARFAAKETLNRHMK 237
             ++    Y  F + E L  H +
Sbjct: 250 CDMVGPIRYQYFKSYEDLEAHFR 272


>gnl|CDD|200998 pfam00096, zf-C2H2, Zinc finger, C2H2 type.  The C2H2 zinc finger
           is the classical zinc finger domain. The two conserved
           cysteines and histidines co-ordinate a zinc ion. The
           following pattern describes the zinc finger.
           #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C] Where X can
           be any amino acid, and numbers in brackets indicate the
           number of residues. The positions marked # are those
           that are important for the stable fold of the zinc
           finger. The final position can be either his or cys. The
           C2H2 zinc finger is composed of two short beta strands
           followed by an alpha helix. The amino terminal part of
           the helix binds the major groove in DNA binding zinc
           fingers. The accepted consensus binding sequence for Sp1
           is usually defined by the asymmetric hexanucleotide core
           GGGCGG but this sequence does not include, among others,
           the GAG (=CTC) repeat that constitutes a high-affinity
           site for Sp1 binding to the wt1 promoter.
          Length = 22

 Score = 31.2 bits (71), Expect = 0.10
 Identities = 10/22 (45%), Positives = 13/22 (59%)

Query: 303 QCDICRKTFIRKEDLTRHCLLH 324
           +C  C K+F RK +L RH   H
Sbjct: 1   KCPDCGKSFSRKSNLKRHLRTH 22



 Score = 31.2 bits (71), Expect = 0.11
 Identities = 14/22 (63%), Positives = 16/22 (72%)

Query: 331 KCPICTKGFTMKSSLKIHLLTH 352
           KCP C K F+ KS+LK HL TH
Sbjct: 1   KCPDCGKSFSRKSNLKRHLRTH 22



 Score = 29.2 bits (66), Expect = 0.48
 Identities = 9/22 (40%), Positives = 12/22 (54%)

Query: 246 VCEYCEKAFIQASQLKAHMFHH 267
            C  C K+F + S LK H+  H
Sbjct: 1   KCPDCGKSFSRKSNLKRHLRTH 22



 Score = 29.2 bits (66), Expect = 0.52
 Identities = 11/20 (55%), Positives = 14/20 (70%)

Query: 164 EVCGRVFNRKDNLREHLRAH 183
             CG+ F+RK NL+ HLR H
Sbjct: 3   PDCGKSFSRKSNLKRHLRTH 22



 Score = 29.2 bits (66), Expect = 0.53
 Identities = 11/20 (55%), Positives = 14/20 (70%)

Query: 76 KVCGRVFNRKDNLREHLRAH 95
            CG+ F+RK NL+ HLR H
Sbjct: 3  PDCGKSFSRKSNLKRHLRTH 22



 Score = 27.3 bits (61), Expect = 2.6
 Identities = 10/20 (50%), Positives = 13/20 (65%)

Query: 359 SCEVCGRAFIRQDCLIRHLR 378
            C  CG++F R+  L RHLR
Sbjct: 1   KCPDCGKSFSRKSNLKRHLR 20



 Score = 26.9 bits (60), Expect = 3.0
 Identities = 8/20 (40%), Positives = 14/20 (70%)

Query: 274 TCELCNKTFNRKARLELHMK 293
            C  C K+F+RK+ L+ H++
Sbjct: 1   KCPDCGKSFSRKSNLKRHLR 20



 Score = 26.5 bits (59), Expect = 4.6
 Identities = 7/22 (31%), Positives = 10/22 (45%)

Query: 133 ECDICKNRFPSSGAMKKHRRKH 154
           +C  C   F     +K+H R H
Sbjct: 1   KCPDCGKSFSRKSNLKRHLRTH 22


>gnl|CDD|197676 smart00355, ZnF_C2H2, zinc finger. 
          Length = 23

 Score = 30.5 bits (69), Expect = 0.19
 Identities = 11/23 (47%), Positives = 16/23 (69%)

Query: 330 HKCPICTKGFTMKSSLKIHLLTH 352
           ++CP C K F  KS+L+ H+ TH
Sbjct: 1   YRCPECGKVFKSKSALREHMRTH 23



 Score = 30.1 bits (68), Expect = 0.20
 Identities = 9/23 (39%), Positives = 10/23 (43%)

Query: 302 YQCDICRKTFIRKEDLTRHCLLH 324
           Y+C  C K F  K  L  H   H
Sbjct: 1   YRCPECGKVFKSKSALREHMRTH 23



 Score = 29.7 bits (67), Expect = 0.32
 Identities = 9/23 (39%), Positives = 12/23 (52%)

Query: 132 YECDICKNRFPSSGAMKKHRRKH 154
           Y C  C   F S  A+++H R H
Sbjct: 1   YRCPECGKVFKSKSALREHMRTH 23



 Score = 29.0 bits (65), Expect = 0.67
 Identities = 13/24 (54%), Positives = 15/24 (62%), Gaps = 1/24 (4%)

Query: 160 YECKEVCGRVFNRKDNLREHLRAH 183
           Y C   CG+VF  K  LREH+R H
Sbjct: 1   YRC-PECGKVFKSKSALREHMRTH 23



 Score = 28.6 bits (64), Expect = 0.80
 Identities = 8/21 (38%), Positives = 11/21 (52%)

Query: 273 HTCELCNKTFNRKARLELHMK 293
           + C  C K F  K+ L  HM+
Sbjct: 1   YRCPECGKVFKSKSALREHMR 21



 Score = 28.6 bits (64), Expect = 0.88
 Identities = 9/23 (39%), Positives = 11/23 (47%)

Query: 245 HVCEYCEKAFIQASQLKAHMFHH 267
           + C  C K F   S L+ HM  H
Sbjct: 1   YRCPECGKVFKSKSALREHMRTH 23



 Score = 27.4 bits (61), Expect = 2.3
 Identities = 11/20 (55%), Positives = 13/20 (65%)

Query: 76 KVCGRVFNRKDNLREHLRAH 95
            CG+VF  K  LREH+R H
Sbjct: 4  PECGKVFKSKSALREHMRTH 23



 Score = 25.9 bits (57), Expect = 7.6
 Identities = 9/23 (39%), Positives = 13/23 (56%)

Query: 105 FKCDKCEKQFYGLTLLRIHERLH 127
           ++C +C K F   + LR H R H
Sbjct: 1   YRCPECGKVFKSKSALREHMRTH 23



 Score = 25.9 bits (57), Expect = 7.6
 Identities = 9/23 (39%), Positives = 13/23 (56%)

Query: 193 FKCDKCEKQFYGLTLLRIHERLH 215
           ++C +C K F   + LR H R H
Sbjct: 1   YRCPECGKVFKSKSALREHMRTH 23


>gnl|CDD|240014 cd04643, CBS_pair_30, The CBS domain, named after human CBS, is a
           small domain originally identified in cystathionine
           beta-synthase and is subsequently found in a wide range
           of different proteins. CBS domains usually occur in
           tandem repeats. They associate to form a so-called
           Bateman domain or a CBS pair based on crystallographic
           studies in bacteria.  The CBS pair was used as a basis
           for this cd hierarchy since the human CBS proteins can
           adopt the typical core structure and form an
           intramolecular CBS pair.  The interface between the two
           CBS domains forms a cleft that is a potential ligand
           binding site. The CBS pair coexists with a variety of
           other functional domains and this has been used to help
           in its classification here.  It has been proposed that
           the CBS domain may play a regulatory role, although its
           exact function is unknown. Mutations of conserved
           residues within this domain are associated with a
           variety of human hereditary diseases, including
           congenital myotonia, idiopathic generalized epilepsy,
           hypercalciuric nephrolithiasis, and classic Bartter
           syndrome (CLC chloride channel family members),
           Wolff-Parkinson-White syndrome (gamma 2 subunit of
           AMP-activated protein kinase), retinitis pigmentosa (IMP
           dehydrogenase-1), and homocystinuria (cystathionine
           beta-synthase).
          Length = 116

 Score = 31.8 bits (73), Expect = 0.38
 Identities = 25/91 (27%), Positives = 40/91 (43%), Gaps = 21/91 (23%)

Query: 537 TQYGLYSDIPILNEN-------------NGQNNLTNMHL-DTNVVVVPPFHHTNNDRVHS 582
           T++G YS IP+L++                   L N+ L     + V    +T+   +  
Sbjct: 20  TKHG-YSAIPVLDKEGKYVGTISLTDILWKLKGLENLDLERLVDLKVIDVMNTDVPVIID 78

Query: 583 DIMIEKMCHNVIRDMDQYGLCVLDD---FLG 610
           D  IE++ H +I   DQ  L V+DD   F+G
Sbjct: 79  DADIEEILHLLI---DQPFLPVVDDDGIFIG 106


>gnl|CDD|222297 pfam13661, 2OG-FeII_Oxy_4, 2OG-Fe(II) oxygenase superfamily.  This
           family contains members of the 2-oxoglutarate (2OG) and
           Fe(II)-dependent oxygenase superfamily.
          Length = 65

 Score = 30.5 bits (69), Expect = 0.42
 Identities = 11/31 (35%), Positives = 17/31 (54%), Gaps = 1/31 (3%)

Query: 797 GHGSHYVKHVDNPNRDGRCITAIYYLNRDWD 827
             G  +  H+D    + R ++A+ YLN DWD
Sbjct: 18  RRGDFFGPHIDADPEELRDLSALLYLN-DWD 47


>gnl|CDD|227516 COG5189, SFP1, Putative transcriptional repressor regulating G2/M
           transition [Transcription / Cell division and chromosome
           partitioning].
          Length = 423

 Score = 33.5 bits (76), Expect = 0.43
 Identities = 21/84 (25%), Positives = 33/84 (39%), Gaps = 7/84 (8%)

Query: 187 TKRKKKFKCD--KCEKQFYGLTLLRIHERLHGLIERTCYARFAAKETLNRHMKTHSGVKP 244
            K  K +KC    C K++     L+ H  LHG        +     +  +     +  KP
Sbjct: 344 VKDGKPYKCPVEGCNKKYKNQNGLKYH-MLHG----HQNQKLHENPSPEKMNIFSAKDKP 398

Query: 245 HVCEYCEKAFIQASQLKAHMFHHT 268
           + CE C+K +   + LK H  H  
Sbjct: 399 YRCEVCDKRYKNLNGLKYHRKHSH 422



 Score = 33.2 bits (75), Expect = 0.59
 Identities = 17/89 (19%), Positives = 32/89 (35%), Gaps = 12/89 (13%)

Query: 262 AHMFHHTGENGHTCEL--CNKTFNRKARLELHMKYIHEGADPYQCDICRKTFIRKEDLTR 319
           + M        + C +  CNK +  +  L+ HM + H            +         +
Sbjct: 339 SRMLKVKDGKPYKCPVEGCNKKYKNQNGLKYHMLHGH----------QNQKLHENPSPEK 388

Query: 320 HCLLHTGVKPHKCPICTKGFTMKSSLKIH 348
             +     KP++C +C K +   + LK H
Sbjct: 389 MNIFSAKDKPYRCEVCDKRYKNLNGLKYH 417



 Score = 31.2 bits (70), Expect = 2.1
 Identities = 20/77 (25%), Positives = 30/77 (38%), Gaps = 21/77 (27%)

Query: 326 GVKPHKCPI--CTKGFTMKSSLKIHLL--------THTKEP-----------PKSCEVCG 364
             KP+KCP+  C K +  ++ LK H+L             P           P  CEVC 
Sbjct: 346 DGKPYKCPVEGCNKKYKNQNGLKYHMLHGHQNQKLHENPSPEKMNIFSAKDKPYRCEVCD 405

Query: 365 RAFIRQDCLIRHLRQKH 381
           + +   + L  H +  H
Sbjct: 406 KRYKNLNGLKYHRKHSH 422



 Score = 29.3 bits (65), Expect = 9.6
 Identities = 20/77 (25%), Positives = 29/77 (37%), Gaps = 21/77 (27%)

Query: 241 GVKPHVCEY--CEKAFIQASQLKAHMFH-HTGENGHT------------------CELCN 279
             KP+ C    C K +   + LK HM H H  +  H                   CE+C+
Sbjct: 346 DGKPYKCPVEGCNKKYKNQNGLKYHMLHGHQNQKLHENPSPEKMNIFSAKDKPYRCEVCD 405

Query: 280 KTFNRKARLELHMKYIH 296
           K +     L+ H K+ H
Sbjct: 406 KRYKNLNGLKYHRKHSH 422


>gnl|CDD|206065 pfam13894, zf-C2H2_4, C2H2-type zinc finger.  This family contains
           a number of divergent C2H2 type zinc fingers.
          Length = 24

 Score = 29.1 bits (65), Expect = 0.60
 Identities = 9/23 (39%), Positives = 14/23 (60%)

Query: 302 YQCDICRKTFIRKEDLTRHCLLH 324
           ++C +C K+F  K+ L RH   H
Sbjct: 1   FKCPLCGKSFSSKDALKRHLRKH 23



 Score = 28.0 bits (62), Expect = 1.2
 Identities = 10/23 (43%), Positives = 15/23 (65%)

Query: 132 YECDICKNRFPSSGAMKKHRRKH 154
           ++C +C   F S  A+K+H RKH
Sbjct: 1   FKCPLCGKSFSSKDALKRHLRKH 23



 Score = 28.0 bits (62), Expect = 1.3
 Identities = 9/24 (37%), Positives = 14/24 (58%)

Query: 273 HTCELCNKTFNRKARLELHMKYIH 296
             C LC K+F+ K  L+ H++  H
Sbjct: 1   FKCPLCGKSFSSKDALKRHLRKHH 24



 Score = 27.6 bits (61), Expect = 2.1
 Identities = 12/23 (52%), Positives = 15/23 (65%)

Query: 330 HKCPICTKGFTMKSSLKIHLLTH 352
            KCP+C K F+ K +LK HL  H
Sbjct: 1   FKCPLCGKSFSSKDALKRHLRKH 23



 Score = 26.8 bits (59), Expect = 3.8
 Identities = 11/24 (45%), Positives = 17/24 (70%), Gaps = 1/24 (4%)

Query: 160 YECKEVCGRVFNRKDNLREHLRAH 183
           ++C  +CG+ F+ KD L+ HLR H
Sbjct: 1   FKCP-LCGKSFSSKDALKRHLRKH 23



 Score = 26.4 bits (58), Expect = 5.6
 Identities = 11/22 (50%), Positives = 16/22 (72%)

Query: 360 CEVCGRAFIRQDCLIRHLRQKH 381
           C +CG++F  +D L RHLR+ H
Sbjct: 3   CPLCGKSFSSKDALKRHLRKHH 24



 Score = 26.1 bits (57), Expect = 7.6
 Identities = 10/20 (50%), Positives = 14/20 (70%)

Query: 76 KVCGRVFNRKDNLREHLRAH 95
           +CG+ F+ KD L+ HLR H
Sbjct: 4  PLCGKSFSSKDALKRHLRKH 23


>gnl|CDD|178473 PLN02885, PLN02885, nicotinate phosphoribosyltransferase.
          Length = 545

 Score = 32.3 bits (74), Expect = 1.1
 Identities = 16/46 (34%), Positives = 25/46 (54%), Gaps = 4/46 (8%)

Query: 23  NRTRGLLNEDDAVGEKTVPGHLCKGCYRIFTSQGYMLLSYKSKEEE 68
            + R  L+ED  V + T+P    K CYR++  +GY L+   + E E
Sbjct: 401 GQPRIKLSED--VEKVTIPC--KKRCYRLYGKEGYPLVDLMTGENE 442


>gnl|CDD|223571 COG0497, RecN, ATPase involved in DNA repair [DNA replication,
           recombination, and repair].
          Length = 557

 Score = 32.6 bits (75), Expect = 1.1
 Identities = 20/116 (17%), Positives = 45/116 (38%), Gaps = 5/116 (4%)

Query: 375 RHLRQKHRSHLDEVLAQGDKKKLEAQLLAAAATELTSV-LEDGEEEGDEGEENLLRCSKC 433
           +  +Q  R  L+++  +  ++   A LL     EL  + L+ GE+E  E E   L  +  
Sbjct: 164 QAWKQA-RRELEDLQEKERERAQRADLLQFQLEELEELNLQPGEDEELEEERKRL-SNSE 221

Query: 434 KVTFYCSKQHQTLDWKAHKINCNILSNQSATNNTSLANNTPQVAQIQNQFPLSNQY 489
           K+        + L  +   ++   L  ++      L+    +++++     L    
Sbjct: 222 KLAEAIQNALELLSGEDDTVSALSLLGRALEALEDLSEYDGKLSELAEL--LEEAL 275


>gnl|CDD|226202 COG3677, COG3677, Transposase and inactivated derivatives [DNA
           replication, recombination, and repair].
          Length = 129

 Score = 30.8 bits (70), Expect = 1.2
 Identities = 25/115 (21%), Positives = 34/115 (29%), Gaps = 23/115 (20%)

Query: 111 EKQFYGLTLLRIHERLHAEKLYECDICK--NRFPSSGAMKKHRRKHTGERPYECKEVCGR 168
           + +   L       R+   K+  C  CK  N     G  + H+R       Y+CK  CG 
Sbjct: 11  KLKSIALADAAYAIRMQITKVN-CPRCKSSNVVKIGGIRRGHQR-------YKCKS-CGS 61

Query: 169 VFNRKDNLREHLRAHAGETKRKKKFKCDKCEKQFYGLTLLR--IHERLHGLIERT 221
            F  +           G    K  +K        Y L L    I   L   I   
Sbjct: 62  TFTVET----------GSPLSKALYKIKLQAVTLYMLGLGIRDIARTLGISINTV 106


>gnl|CDD|180567 PRK06452, sdhA, succinate dehydrogenase flavoprotein subunit;
           Reviewed.
          Length = 566

 Score = 31.8 bits (72), Expect = 1.6
 Identities = 23/68 (33%), Positives = 34/68 (50%), Gaps = 10/68 (14%)

Query: 59  LLSYKSKEEEKQICTYIKVCGRVFNRKDNLREHLRAHAGETKRKKKFKCDKCEKQFYGLT 118
           LLS KS E    I   ++  G +FNR+ + R  +R   G+T  + +F  DK      G+ 
Sbjct: 87  LLSNKSGE----IVMLLERWGALFNRQPDGRVAVRYFGGQTYPRTRFVGDKT-----GMA 137

Query: 119 LLR-IHER 125
           LL  + ER
Sbjct: 138 LLHTLFER 145


>gnl|CDD|238371 cd00729, rubredoxin_SM, Rubredoxin, Small Modular nonheme iron
           binding domain containing a [Fe(SCys)4] center, present
           in rubrerythrin and nigerythrin and detected either N-
           or C-terminal to such proteins as flavin reductase,
           NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin
           reductase. In rubredoxin, the iron atom is coordinated
           by four cysteine residues (Fe(S-Cys)4), and  believed to
           be involved in electron transfer. Rubrerythrins and
           nigerythrins are small homodimeric proteins, generally
           consisting of 2 domains: a rubredoxin domain C-terminal
           to a non-sulfur, oxo-bridged diiron site in the
           N-terminal rubrerythrin domain. Rubrerythrins and
           nigerythrins have putative peroxide activity.
          Length = 34

 Score = 27.9 bits (63), Expect = 1.9
 Identities = 9/35 (25%), Positives = 15/35 (42%), Gaps = 12/35 (34%)

Query: 330 HKCPICTKGFTMKSSLKIHLLTHTKEPPKSCEVCG 364
             CP+C  G+  +           +E P+ C +CG
Sbjct: 3   WVCPVC--GYIHEG----------EEAPEKCPICG 25


>gnl|CDD|204841 pfam12171, zf-C2H2_jaz, Zinc-finger double-stranded RNA-binding.
           This domain family is found in archaea and eukaryotes,
           and is approximately 30 amino acids in length. The
           mammalian members of this group occur multiple times
           along the protein, joined by flexible linkers, and are
           referred to as JAZ - dsRNA-binding ZF protein -
           zinc-fingers. The JAZ proteins are expressed in all
           tissues tested and localise in the nucleus, particularly
           the nucleolus. JAZ preferentially binds to
           double-stranded (ds) RNA or RNA/DNA hybrids rather than
           DNA. In addition to binding double-stranded RNA, these
           zinc-fingers are required for nucleolar localisation.
          Length = 27

 Score = 27.5 bits (62), Expect = 1.9
 Identities = 8/21 (38%), Positives = 11/21 (52%)

Query: 273 HTCELCNKTFNRKARLELHMK 293
             C  C+K F  +  LE H+K
Sbjct: 2   FYCVACDKYFKSENALENHLK 22



 Score = 26.4 bits (59), Expect = 5.5
 Identities = 6/21 (28%), Positives = 10/21 (47%)

Query: 244 PHVCEYCEKAFIQASQLKAHM 264
              C  C+K F   + L+ H+
Sbjct: 1   QFYCVACDKYFKSENALENHL 21


>gnl|CDD|181996 PRK09619, flgD, flagellar basal body rod modification protein;
           Reviewed.
          Length = 218

 Score = 30.9 bits (70), Expect = 2.0
 Identities = 12/29 (41%), Positives = 12/29 (41%), Gaps = 2/29 (6%)

Query: 459 SNQSATNNTSLANN--TPQVAQIQNQFPL 485
                     L N   T  VAQIQNQ PL
Sbjct: 19  PEGGLPGAEQLNNQFMTLLVAQIQNQDPL 47


>gnl|CDD|205121 pfam12874, zf-met, Zinc-finger of C2H2 type.  This is a zinc-finger
           domain with the CxxCx(12)Hx(6)H motif, found in multiple
           copies in a wide range of proteins from plants to
           metazoans. Some member proteins, particularly those from
           plants, are annotated as being RNA-binding.
          Length = 25

 Score = 27.5 bits (62), Expect = 2.0
 Identities = 9/21 (42%), Positives = 15/21 (71%)

Query: 273 HTCELCNKTFNRKARLELHMK 293
             CELCN TF  +++L+ H++
Sbjct: 1   FYCELCNVTFTSESQLKSHLR 21


>gnl|CDD|206083 pfam13912, zf-C2H2_6, C2H2-type zinc finger. 
          Length = 27

 Score = 27.6 bits (62), Expect = 2.1
 Identities = 7/24 (29%), Positives = 13/24 (54%)

Query: 131 LYECDICKNRFPSSGAMKKHRRKH 154
           ++ C +C   F S  A+  H++ H
Sbjct: 1   VHTCGVCGKTFSSLQALGGHKKSH 24



 Score = 26.4 bits (59), Expect = 4.2
 Identities = 8/24 (33%), Positives = 13/24 (54%), Gaps = 1/24 (4%)

Query: 160 YECKEVCGRVFNRKDNLREHLRAH 183
           + C  VCG+ F+    L  H ++H
Sbjct: 2   HTCG-VCGKTFSSLQALGGHKKSH 24


>gnl|CDD|131199 TIGR02144, LysX_arch, Lysine biosynthesis enzyme LysX.  The family
           of proteins found in this equivalog include the
           characterized LysX from Thermus thermophilus which is
           part of a well-organized lysine biosynthesis gene
           cluster. LysX is believed to carry out an ATP-dependent
           acylation of the amino group of alpha-aminoadipate in
           the prokaryotic version of the fungal AAA lysine
           biosynthesis pathway. No species having a sequence in
           this equivalog contains the elements of the more common
           diaminopimelate lysine biosythesis pathway, and none has
           been shown to be a lysine auxotroph. These sequences
           have mainly recieved the name of the related enzyme,
           "ribosomal protein S6 modification protein RimK". RimK
           has been characterized in E. coli, and acts by
           ATP-dependent condensation of S6 with glutamate
           residues.
          Length = 280

 Score = 30.8 bits (70), Expect = 2.3
 Identities = 11/39 (28%), Positives = 17/39 (43%), Gaps = 8/39 (20%)

Query: 796 PGHGSHYVK-HVDNPNRDGRC-------ITAIYYLNRDW 826
             H   Y++ +++ P RD R        I AIY  +  W
Sbjct: 160 SQHKLFYIQEYINKPGRDIRVFVIGDEAIAAIYRYSNHW 198


>gnl|CDD|197394 cd10159, CsoR-like_DUF156_2, Uncharacterized family 2; belongs to a
           superfamily containing transcriptional regulators CsoR
           (copper-sensitive operon repressor), RcnR, and FrmR, and
           related domains; this family was previously known as
           part of DUF156.  Uncharacterized family 2, belonging to
           a larger superfamily that contains various
           transcriptional regulators that respond to different
           stressors such as Cu(I), Ni(I), sulfite, and
           formaldehyde, and includes CsoRs (copper-sensitive
           operon repressors). CsoRs form homotetramers (dimer of
           dimers). In Mycobacterium tuberculosis CsoR, within each
           dimer, two Cys residues on opposite subunits, along with
           a His residue, bind the Cu(I) ion (forming a triagonal
           S2N coordination complex, C-H-C). These residues are
           conserved in the majority of members of this
           superfamily, including this family, and a conserved Tyr
           and a Glu residue that facilitate allosteric regulation
           of DNA binding for CsoRs are also conserved.
          Length = 82

 Score = 28.7 bits (65), Expect = 2.9
 Identities = 18/57 (31%), Positives = 27/57 (47%), Gaps = 8/57 (14%)

Query: 689 KSTNKQDLKTIRGDQITWI-----DGRETYCSNIGRLISEVDAIIMRANRMVNNGRM 740
           K    + LKT RG QI  I     + R  YC +I   I    A++ +AN+ + +  M
Sbjct: 2   KKKALRLLKTARG-QIDGIIKMIEEDR--YCIDISNQILATQALLKKANKEILSAHM 55


>gnl|CDD|219584 pfam07800, DUF1644, Protein of unknown function (DUF1644).  This
           family consists of sequences found in a number of
           hypothetical plant proteins of unknown function. The
           region of interest contains nine highly conserved
           cysteine residues and is approximately 160 amino acids
           in length, and is probably a zinc-binding domain.
          Length = 148

 Score = 29.6 bits (67), Expect = 3.0
 Identities = 24/104 (23%), Positives = 40/104 (38%), Gaps = 23/104 (22%)

Query: 296 HEGADPYQCDI------C----RKTFIRKEDLTRHCLLHTGVKPHKCPIC---TKGFTMK 342
            +G  PY CD       C    +K + +K +  +        +   CP+C    KG+T+ 
Sbjct: 24  DKGCRPYMCDTSHRHSNCLDQFKKAYSKKRNSGKDSEKEDSTELT-CPLCRGEVKGWTVV 82

Query: 343 SSLKIHLLTHTKEPPKSCEVCGRAFI---RQDCLIRHLRQKHRS 383
              +  L        +SC   G +F    R+  L +H R +H  
Sbjct: 83  EPARRFL----NSKKRSCMQEGCSFSGTYRE--LRKHARSEHPG 120


>gnl|CDD|217403 pfam03171, 2OG-FeII_Oxy, 2OG-Fe(II) oxygenase superfamily.  This
           family contains members of the 2-oxoglutarate (2OG) and
           Fe(II)-dependent oxygenase superfamily. This family
           includes the C-terminal of prolyl 4-hydroxylase alpha
           subunit. The holoenzyme has the activity EC:1.14.11.2
           catalyzing the reaction: Procollagen L-proline +
           2-oxoglutarate + O2 <=> procollagen trans-
           4-hydroxy-L-proline + succinate + CO2. The full enzyme
           consists of a alpha2 beta2 complex with the alpha
           subunit contributing most of the parts of the active
           site. The family also includes lysyl hydrolases,
           isopenicillin synthases and AlkB.
          Length = 96

 Score = 28.9 bits (65), Expect = 3.2
 Identities = 9/39 (23%), Positives = 12/39 (30%), Gaps = 6/39 (15%)

Query: 786 NPHEAMVACYP-GHGSHYVKHVDNPNRDGRCITAIYYLN 823
            P   +V  YP   G     H D+       IT +    
Sbjct: 1   TPEGLLVNYYPPDLGLGLGPHTDD-----SIITILLQDG 34


>gnl|CDD|221755 pfam12756, zf-C2H2_2, C2H2 type zinc-finger (2 copies).  This
           family contains two copies of a C2H2-like zinc finger
           domain.
          Length = 100

 Score = 28.4 bits (64), Expect = 5.1
 Identities = 14/70 (20%), Positives = 18/70 (25%), Gaps = 23/70 (32%)

Query: 247 CEYCEKAFIQASQLKAHMFHHTG-----------------------ENGHTCELCNKTFN 283
           C +C        +   HMF   G                         G+ C  C K F 
Sbjct: 2   CLFCNHTSDTVEENLEHMFKSHGFFIPEREYLVDLEGLLNYLREKIHEGNECLYCGKQFK 61

Query: 284 RKARLELHMK 293
               L  HM+
Sbjct: 62  SLEALRQHMR 71


>gnl|CDD|165133 PHA02768, PHA02768, hypothetical protein; Provisional.
          Length = 55

 Score = 26.5 bits (58), Expect = 7.8
 Identities = 11/26 (42%), Positives = 18/26 (69%), Gaps = 1/26 (3%)

Query: 360 CEVCGRAFIRQDCLIRHLRQKHRSHL 385
           C +CG  +I++  +I HLR KH ++L
Sbjct: 8   CPICGEIYIKRKSMITHLR-KHNTNL 32


>gnl|CDD|222400 pfam13824, zf-Mss51, Zinc-finger of mitochondrial splicing
           suppressor 51.  Mss51 regulates the expression of
           cytochrome oxidase, so this domain is probably
           DNA-binding.
          Length = 55

 Score = 26.6 bits (59), Expect = 7.9
 Identities = 9/29 (31%), Positives = 16/29 (55%)

Query: 430 CSKCKVTFYCSKQHQTLDWKAHKINCNIL 458
           C  C +  +CS++H   D++ H+  C  L
Sbjct: 17  CPDCGIPTHCSEEHWEDDYEEHRELCETL 45


>gnl|CDD|221190 pfam11727, ISG65-75, Invariant surface glycoprotein.  This family
           is found in Trypanosome species, and appears to be one
           of two invariant surface glycoproteins, ISG65 and ISG75.
           that are found in the mammalian stage of the parasitic
           protozoan. the sequence suggests the two families are
           polypeptides with N-terminal signal sequences,
           hydrophilic extracellular domains, single trans-membrane
           alpha-helices and short cytoplasmic domains. they are
           both expressed in the bloodstream form but not in the
           midgut stage. Both polypeptides are distributed over the
           entire surface of the parasite.
          Length = 286

 Score = 29.4 bits (66), Expect = 8.1
 Identities = 13/53 (24%), Positives = 27/53 (50%), Gaps = 1/53 (1%)

Query: 373 LIRHLRQKHRSHLDEVLAQGDKKKLEAQLLAAAATELTSVLEDGEEEGDEGEE 425
            +++  +K  S  D  L+  DKKK+  ++   A  ++   L + ++ G E E+
Sbjct: 77  RVKYYLEKLESRSDSKLSDSDKKKI-KEICEKAKDKIDEALPEAKKAGREAEK 128


>gnl|CDD|235514 PRK05580, PRK05580, primosome assembly protein PriA; Validated.
          Length = 679

 Score = 29.4 bits (67), Expect = 9.0
 Identities = 11/37 (29%), Positives = 18/37 (48%), Gaps = 3/37 (8%)

Query: 331 KCPICTKGFTM---KSSLKIHLLTHTKEPPKSCEVCG 364
           +CP C    T+   +  L+ H   + +  PK+C  CG
Sbjct: 392 ECPHCDASLTLHRFQRRLRCHHCGYQEPIPKACPECG 428


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.321    0.136    0.425 

Gapped
Lambda     K      H
   0.267   0.0691    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 41,511,200
Number of extensions: 4009251
Number of successful extensions: 3548
Number of sequences better than 10.0: 1
Number of HSP's gapped: 3524
Number of HSP's successfully gapped: 107
Length of query: 831
Length of database: 10,937,602
Length adjustment: 105
Effective length of query: 726
Effective length of database: 6,280,432
Effective search space: 4559593632
Effective search space used: 4559593632
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 63 (28.1 bits)