RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy7068
         (487 letters)



>gnl|CDD|197477 smart00027, EH, Eps15 homology domain.  Pair of EF hand motifs that
           recognise proteins containing Asn-Pro-Phe (NPF)
           sequences.
          Length = 96

 Score =  124 bits (314), Expect = 1e-34
 Identities = 45/96 (46%), Positives = 63/96 (65%)

Query: 273 EWAVPHASKLKYTQLFNTTDRTRSGFLSGPQARNIMVATGLSQGILAQIWNLADMDSDGQ 332
            WA+    K KY Q+F + D+ + G ++G QA+ I++ +GL Q +LA+IWNLAD+D+DG+
Sbjct: 1   PWAISPEDKAKYEQIFRSLDKNQDGTVTGAQAKPILLKSGLPQTLLAKIWNLADIDNDGE 60

Query: 333 LSCDEFVLAMHLCDLAKGGEKIPVPLPIDMIPPAFR 368
           L  DEF LAMHL      G  IP  LP  +IPP+ R
Sbjct: 61  LDKDEFALAMHLIYRKLNGYPIPASLPPSLIPPSKR 96



 Score =  103 bits (260), Expect = 6e-27
 Identities = 39/93 (41%), Positives = 60/93 (64%), Gaps = 1/93 (1%)

Query: 385 PWNVSRHERTRFETHFQAMKP-VNGIVTGDQVKGFLMQSGLPVATLGQIWSLADTDQDGK 443
           PW +S  ++ ++E  F+++    +G VTG Q K  L++SGLP   L +IW+LAD D DG+
Sbjct: 1   PWAISPEDKAKYEQIFRSLDKNQDGTVTGAQAKPILLKSGLPQTLLAKIWNLADIDNDGE 60

Query: 444 MNINEFVIACKLISNKLRGFDIPPTLPVSLMQS 476
           ++ +EF +A  LI  KL G+ IP +LP SL+  
Sbjct: 61  LDKDEFALAMHLIYRKLNGYPIPASLPPSLIPP 93


>gnl|CDD|238009 cd00052, EH, Eps15 homology domain; found in proteins implicated in
           endocytosis, vesicle transport, and signal transduction.
           The alignment contains a pair of EF-hand motifs,
           typically one of them is canonical and binds to Ca2+,
           while the other may not bind to Ca2+. A hydrophobic
           binding pocket is formed by residues from both EF-hand
           motifs. The EH domain binds to proteins containing NPF
           (class I), [WF]W or SWG (class II), or H[TS]F (class
           III) sequence motifs.
          Length = 67

 Score =   99 bits (250), Expect = 6e-26
 Identities = 31/66 (46%), Positives = 43/66 (65%)

Query: 284 YTQLFNTTDRTRSGFLSGPQARNIMVATGLSQGILAQIWNLADMDSDGQLSCDEFVLAMH 343
           Y Q+F + D    G +SG +AR  +  +GL + +LAQIW+LAD D DG+L  +EF +AMH
Sbjct: 1   YDQIFRSLDPDGDGLISGDEARPFLGKSGLPRSVLAQIWDLADTDKDGKLDKEEFAIAMH 60

Query: 344 LCDLAK 349
           L  LA 
Sbjct: 61  LIALAL 66



 Score = 88.0 bits (219), Expect = 1e-21
 Identities = 30/67 (44%), Positives = 46/67 (68%), Gaps = 1/67 (1%)

Query: 396 FETHFQAMKPVN-GIVTGDQVKGFLMQSGLPVATLGQIWSLADTDQDGKMNINEFVIACK 454
           ++  F+++ P   G+++GD+ + FL +SGLP + L QIW LADTD+DGK++  EF IA  
Sbjct: 1   YDQIFRSLDPDGDGLISGDEARPFLGKSGLPRSVLAQIWDLADTDKDGKLDKEEFAIAMH 60

Query: 455 LISNKLR 461
           LI+  L 
Sbjct: 61  LIALALN 67


>gnl|CDD|193239 pfam12763, efhand_3, Cytoskeletal-regulatory complex EF hand.  This
           is an efhand family from the N-terminal of actin
           cytoskeleton-regulatory complex END3 and similar
           proteins from fungi and closely related species.
          Length = 104

 Score = 76.3 bits (188), Expect = 4e-17
 Identities = 34/73 (46%), Positives = 43/73 (58%), Gaps = 2/73 (2%)

Query: 400 FQAMKPVNGIVTGDQVKGFLMQSGLPVATLGQIWSLADTDQDGKMNINEFVIACKLISNK 459
           F  +KP N  +TGDQV   L  S LP   L +IW LAD D DGK++  EF IA +LI + 
Sbjct: 16  FSGLKPENNKLTGDQVSPVLKNSRLPDDQLAKIWDLADIDSDGKLDFEEFCIAMRLIFDL 75

Query: 460 LRGF--DIPPTLP 470
           + G   D+P  LP
Sbjct: 76  VNGNIADVPDELP 88



 Score = 60.5 bits (147), Expect = 2e-11
 Identities = 31/84 (36%), Positives = 48/84 (57%), Gaps = 3/84 (3%)

Query: 283 KYTQLFNTTDRTRSGFLSGPQARNIMVATGLSQGILAQIWNLADMDSDGQLSCDEFVLAM 342
           KY ++F+      +  L+G Q   ++  + L    LA+IW+LAD+DSDG+L  +EF +AM
Sbjct: 11  KYWEIFSGLK-PENNKLTGDQVSPVLKNSRLPDDQLAKIWDLADIDSDGKLDFEEFCIAM 69

Query: 343 HLC-DLAKGGEK-IPVPLPIDMIP 364
            L  DL  G    +P  LP  ++P
Sbjct: 70  RLIFDLVNGNIADVPDELPDWLVP 93


>gnl|CDD|223021 PHA03247, PHA03247, large tegument protein UL36; Provisional.
          Length = 3151

 Score = 41.1 bits (96), Expect = 0.001
 Identities = 25/78 (32%), Positives = 31/78 (39%), Gaps = 4/78 (5%)

Query: 146  PLSTSIVSPPNTVAPPSSTTPLIPGITSPPGTLQGGITSPVSIIPGATSPPNVPAAITKL 205
             L++    PP    P  +   L+     PPG       SP    P A +PP VPA     
Sbjct: 2694 SLTSLADPPPPPPTPEPAPHALVSATPLPPGPAAARQASPAL--PAAPAPPAVPAG--PA 2749

Query: 206  TPPGPLTPDKPPAPVESP 223
            TP GP  P +PP     P
Sbjct: 2750 TPGGPARPARPPTTAGPP 2767



 Score = 38.0 bits (88), Expect = 0.012
 Identities = 17/72 (23%), Positives = 22/72 (30%), Gaps = 1/72 (1%)

Query: 153  SPPNTVAPPSSTTPLIPGITSPPGTLQGGITSPVSIIPGATSPPNVPAAITKLTPPGP-L 211
             PP T  PP+   P  P    P    +  + S         SP +       +  P   L
Sbjct: 2759 RPPTTAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAAL 2818

Query: 212  TPDKPPAPVESP 223
             P   PA    P
Sbjct: 2819 PPAASPAGPLPP 2830



 Score = 35.3 bits (81), Expect = 0.099
 Identities = 23/99 (23%), Positives = 33/99 (33%), Gaps = 23/99 (23%)

Query: 142  SGPSPLSTSIVSPPNTVAPPSSTTPLIPGITSPPGTLQGGITSPVSIIPG--------AT 193
              P+       SP   + PP+S  P  P     P      +    S+ PG        + 
Sbjct: 2812 LAPAAALPPAASPAGPLPPPTSAQPTAPPPPPGP--PPPSLPLGGSVAPGGDVRRRPPSR 2869

Query: 194  SPPNVPAA-------------ITKLTPPGPLTPDKPPAP 219
            SP   PAA             +++ T    L PD+P  P
Sbjct: 2870 SPAAKPAAPARPPVRRLARPAVSRSTESFALPPDQPERP 2908



 Score = 35.3 bits (81), Expect = 0.100
 Identities = 24/95 (25%), Positives = 35/95 (36%), Gaps = 4/95 (4%)

Query: 140  GGSGPSPLSTSIVSPPNTVAPP--SSTTPLIPGITSP--PGTLQGGITSPVSIIPGATSP 195
            G   P+P +     PP  +  P  +S +     + SP  P      + +P + +P A SP
Sbjct: 2765 GPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASP 2824

Query: 196  PNVPAAITKLTPPGPLTPDKPPAPVESPWGAFING 230
                   T   P  P  P  PP P     G+   G
Sbjct: 2825 AGPLPPPTSAQPTAPPPPPGPPPPSLPLGGSVAPG 2859



 Score = 34.9 bits (80), Expect = 0.13
 Identities = 24/102 (23%), Positives = 32/102 (31%), Gaps = 12/102 (11%)

Query: 137  VSGGGSGPSPLSTSIVSPPNT---VAPPSSTTPLIPGITSPPGTLQGGITSPVSI---IP 190
            VS     P P +    SP        P     P  PG  + P         P       P
Sbjct: 2716 VSATPLPPGPAAARQASPALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPAPAPPAAP 2775

Query: 191  GATSPPNVP------AAITKLTPPGPLTPDKPPAPVESPWGA 226
             A  P  +        + ++ + P P  P  PPA V +P  A
Sbjct: 2776 AAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAA 2817



 Score = 32.6 bits (74), Expect = 0.56
 Identities = 21/89 (23%), Positives = 25/89 (28%), Gaps = 3/89 (3%)

Query: 144  PSPLSTSIVSPPNTVAPPSSTTPLIPGITSPPGTLQGGITSPVSIIPGATSPPNVPAAIT 203
              P    +  P   V+  + +  L P     P   Q     P         P   P    
Sbjct: 2879 ARPPVRRLARPA--VSRSTESFALPPDQPERPPQPQAP-PPPQPQPQPPPPPQPQPPPPP 2935

Query: 204  KLTPPGPLTPDKPPAPVESPWGAFINGWL 232
               P  PL P   PA    P GA    WL
Sbjct: 2936 PPRPQPPLAPTTDPAGAGEPSGAVPQPWL 2964



 Score = 31.4 bits (71), Expect = 1.3
 Identities = 25/85 (29%), Positives = 32/85 (37%), Gaps = 11/85 (12%)

Query: 144  PSPLSTSIVSPPNTVAPPSSTTPLIPGITSPPGTLQGGITSPVSIIPGATSPPNVPA--- 200
              P  TS    P+  APP S  P  P      G  +G   +P S +P  T  P+ P    
Sbjct: 2578 SEPAVTSRARRPD--APPQSARPRAPV--DDRGDPRG--PAPPSPLPPDTHAPDPPPPSP 2631

Query: 201  --AITKLTPPGPLTPDKPPAPVESP 223
              A  +  P  P T   P  P + P
Sbjct: 2632 SPAANEPDPHPPPTVPPPERPRDDP 2656



 Score = 31.1 bits (70), Expect = 1.8
 Identities = 23/82 (28%), Positives = 30/82 (36%), Gaps = 12/82 (14%)

Query: 142  SGPSPLSTSIVSPPNTVAPPSSTTPLIPGITSPPGTLQGGITSPVSIIPGATSPPNVPAA 201
              P  L ++   PP   A   ++ P +P   +PP    G  T      PG  + P  P  
Sbjct: 2710 PAPHALVSATPLPPGPAAARQAS-PALPAAPAPPAVPAGPAT------PGGPARPARPP- 2761

Query: 202  ITKLTPPGPLTPDKPPAPVESP 223
                T  GP  P  P AP   P
Sbjct: 2762 ----TTAGPPAPAPPAAPAAGP 2779



 Score = 29.1 bits (65), Expect = 7.2
 Identities = 15/70 (21%), Positives = 19/70 (27%), Gaps = 1/70 (1%)

Query: 154 PPNTVAPPSSTTPLIPGITSPPGTLQGGITSPVSIIPGATSPPN-VPAAITKLTPPGPLT 212
            P   + P+          +P     GG        P   S P   P  +    PP P T
Sbjct: 374 HPKRASLPTRKRRSARHAATPFARGPGGDDQTRPAAPVPASVPTPAPTPVPASAPPPPAT 433

Query: 213 PDKPPAPVES 222
           P     P   
Sbjct: 434 PLPSAEPGSD 443


>gnl|CDD|200946 pfam00036, efhand, EF hand.  The EF-hands can be divided into two
           classes: signaling proteins and buffering/transport
           proteins. The first group is the largest and includes
           the most well-known members of the family such as
           calmodulin, troponin C and S100B. These proteins
           typically undergo a calcium-dependent conformational
           change which opens a target binding site. The latter
           group is represented by calbindin D9k and do not undergo
           calcium dependent conformational changes.
          Length = 29

 Score = 35.5 bits (83), Expect = 0.002
 Identities = 8/26 (30%), Positives = 13/26 (50%)

Query: 431 QIWSLADTDQDGKMNINEFVIACKLI 456
           + +   D D DGK++  EF    K +
Sbjct: 4   EAFKEFDKDGDGKISFEEFKELLKKL 29



 Score = 35.1 bits (82), Expect = 0.002
 Identities = 8/26 (30%), Positives = 14/26 (53%)

Query: 318 LAQIWNLADMDSDGQLSCDEFVLAMH 343
           L + +   D D DG++S +EF   + 
Sbjct: 2   LKEAFKEFDKDGDGKISFEEFKELLK 27


>gnl|CDD|238008 cd00051, EFh, EF-hand, calcium binding motif; A diverse superfamily
           of calcium sensors and calcium signal modulators; most
           examples in this alignment model have 2 active canonical
           EF hands. Ca2+ binding induces a conformational change
           in the EF-hand motif, leading to the activation or
           inactivation of target proteins. EF-hands tend to occur
           in pairs or higher copy numbers.
          Length = 63

 Score = 35.6 bits (83), Expect = 0.004
 Identities = 14/63 (22%), Positives = 30/63 (47%), Gaps = 2/63 (3%)

Query: 283 KYTQLFNTTDRTRSGFLSGPQARNIMVATG--LSQGILAQIWNLADMDSDGQLSCDEFVL 340
           +  + F   D+   G +S  + +  + + G  LS+  + ++    D D DG++  +EF+ 
Sbjct: 1   ELREAFRLFDKDGDGTISADELKAALKSLGEGLSEEEIDEMIREVDKDGDGKIDFEEFLE 60

Query: 341 AMH 343
            M 
Sbjct: 61  LMA 63



 Score = 35.2 bits (82), Expect = 0.005
 Identities = 12/46 (26%), Positives = 24/46 (52%), Gaps = 2/46 (4%)

Query: 407 NGIVTGDQVKGFLMQSGLPV--ATLGQIWSLADTDQDGKMNINEFV 450
           +G ++ D++K  L   G  +    + ++    D D DGK++  EF+
Sbjct: 14  DGTISADELKAALKSLGEGLSEEEIDEMIREVDKDGDGKIDFEEFL 59


>gnl|CDD|185504 PTZ00184, PTZ00184, calmodulin; Provisional.
          Length = 149

 Score = 37.0 bits (86), Expect = 0.005
 Identities = 20/62 (32%), Positives = 35/62 (56%), Gaps = 2/62 (3%)

Query: 283 KYTQLFNTTDRTRSGFLSGPQARNIMVATG--LSQGILAQIWNLADMDSDGQLSCDEFVL 340
           +  + F   DR  +GF+S  + R++M   G  L+   + ++   AD+D DGQ++ +EFV 
Sbjct: 85  EIKEAFKVFDRDGNGFISAAELRHVMTNLGEKLTDEEVDEMIREADVDGDGQINYEEFVK 144

Query: 341 AM 342
            M
Sbjct: 145 MM 146


>gnl|CDD|197492 smart00054, EFh, EF-hand, calcium binding motif.  EF-hands are
           calcium-binding motifs that occur at least in pairs.
           Links between disease states and genes encoding
           EF-hands, particularly the S100 subclass, are emerging.
           Each motif consists of a 12 residue loop flanked on
           either side by a 12 residue alpha-helix. EF-hands
           undergo a conformational change unpon binding calcium
           ions.
          Length = 29

 Score = 32.7 bits (76), Expect = 0.018
 Identities = 9/26 (34%), Positives = 14/26 (53%)

Query: 431 QIWSLADTDQDGKMNINEFVIACKLI 456
           + + L D D DGK++  EF    K +
Sbjct: 4   EAFRLFDKDGDGKIDFEEFKDLLKAL 29



 Score = 29.7 bits (68), Expect = 0.22
 Identities = 8/25 (32%), Positives = 14/25 (56%)

Query: 318 LAQIWNLADMDSDGQLSCDEFVLAM 342
           L + + L D D DG++  +EF   +
Sbjct: 2   LKEAFRLFDKDGDGKIDFEEFKDLL 26


>gnl|CDD|237862 PRK14948, PRK14948, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 620

 Score = 36.9 bits (86), Expect = 0.024
 Identities = 15/77 (19%), Positives = 21/77 (27%)

Query: 147 LSTSIVSPPNTVAPPSSTTPLIPGITSPPGTLQGGITSPVSIIPGATSPPNVPAAITKLT 206
           L +   S  NT   P       P     P   Q   T+P    P         ++     
Sbjct: 510 LESQSGSASNTAKTPPPPQKSPPPPAPTPPLPQPTATAPPPTPPPPPPTATQASSNAPAQ 569

Query: 207 PPGPLTPDKPPAPVESP 223
            P   +P  P     +P
Sbjct: 570 IPADSSPPPPIPEEPTP 586



 Score = 31.5 bits (72), Expect = 1.0
 Identities = 16/81 (19%), Positives = 22/81 (27%)

Query: 144 PSPLSTSIVSPPNTVAPPSSTTPLIPGITSPPGTLQGGITSPVSIIPGATSPPNVPAAIT 203
            S ++ +      T AP  S  P     ++P         SP         P        
Sbjct: 365 ISEIANASAPANPTPAPNPSPPPAPIQPSAPKTKQAATTPSPPPAKASPPIPVPAEPTEP 424

Query: 204 KLTPPGPLTPDKPPAPVESPW 224
             TPP       P   +E  W
Sbjct: 425 SPTPPANAANAPPSLNLEELW 445


>gnl|CDD|237057 PRK12323, PRK12323, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 700

 Score = 36.8 bits (85), Expect = 0.024
 Identities = 23/94 (24%), Positives = 33/94 (35%), Gaps = 5/94 (5%)

Query: 138 SGGGSGPSPLSTSIVS--PPNTVAPPSST-TPLIPGITSPPGTLQGGITSPVSIIPGATS 194
           SGGG+GP+  + + V+   P   AP ++   P  P                V+  P   S
Sbjct: 368 SGGGAGPATAAAAPVAQPAPAAAAPAAAAPAPAAPPAAPAAAPAAAAAARAVAAAPARRS 427

Query: 195 PPNVPAAITKLTPP--GPLTPDKPPAPVESPWGA 226
           P     A  +          P   PAP  +P  A
Sbjct: 428 PAPEALAAARQASARGPGGAPAPAPAPAAAPAAA 461



 Score = 31.8 bits (72), Expect = 1.0
 Identities = 17/93 (18%), Positives = 27/93 (29%), Gaps = 14/93 (15%)

Query: 139 GGGSGPSPLSTSIVSPPNTVAPPSSTTPLIPGITSPPGTLQGGITSPVSIIPGATSPPNV 198
           G G  P+P      +P     P ++    +    +                P   +P   
Sbjct: 443 GPGGAPAPAPAPAAAPAAAARPAAAGPRPVAAAAAAA--------------PARAAPAAA 488

Query: 199 PAAITKLTPPGPLTPDKPPAPVESPWGAFINGW 231
           PA      PP    P +  +P  +   A   GW
Sbjct: 489 PAPADDDPPPWEELPPEFASPAPAQPDAAPAGW 521



 Score = 30.6 bits (69), Expect = 2.1
 Identities = 14/86 (16%), Positives = 24/86 (27%), Gaps = 3/86 (3%)

Query: 144 PSPLSTSIVSPPNTVAPPSSTTPLIPGITSPPGTLQGGITSPVSIIPGATSPPNVPAAIT 203
           P+  + +  +     A  ++     P   +     Q     P      A +P   PAA  
Sbjct: 403 PAAPAAAPAAAAAARAVAAAPARRSPAPEALAAARQASARGPGGAPAPAPAPAAAPAAAA 462

Query: 204 KLTPPGPLTP---DKPPAPVESPWGA 226
           +    GP              +P  A
Sbjct: 463 RPAAAGPRPVAAAAAAAPARAAPAAA 488


>gnl|CDD|215130 PLN02217, PLN02217, probable pectinesterase/pectinesterase
           inhibitor.
          Length = 670

 Score = 36.2 bits (83), Expect = 0.040
 Identities = 25/90 (27%), Positives = 35/90 (38%), Gaps = 1/90 (1%)

Query: 135 LIVSGGGSGPSPLSTSIVSPPNTVAPPSSTTPLIPGITSPPGTLQGGITSPVSIIPGATS 194
           L     GS  S  + S  S   T +  S +T + P  + P G L     +P  I+  +TS
Sbjct: 561 LFAGNPGSTNSTPTGSAASSNTTFSSDSPSTVVAPSTSPPAGHLGSPPATPSKIVSPSTS 620

Query: 195 PPNVPAAITKLTPPGPLTPDKPPA-PVESP 223
           PP         TP  P +  K  +    SP
Sbjct: 621 PPASHLGSPSTTPSSPESSIKVASTETASP 650


>gnl|CDD|237874 PRK14971, PRK14971, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 614

 Score = 36.3 bits (84), Expect = 0.040
 Identities = 18/84 (21%), Positives = 29/84 (34%), Gaps = 5/84 (5%)

Query: 136 IVSGGGSGPSPLSTSIVSPPNTVAPPSSTTPLIPGITSPPGTLQGGITSPVSIIPGATSP 195
           + +   + P P + +  SP    +P  S+    P       T   G    VS+ P A  P
Sbjct: 382 VFTQPAAAPQPSAAAAASP----SPSQSSAAAQPSAP-QSATQPAGTPPTVSVDPPAAVP 436

Query: 196 PNVPAAITKLTPPGPLTPDKPPAP 219
            N P+   +   P     +K    
Sbjct: 437 VNPPSTAPQAVRPAQFKEEKKIPV 460



 Score = 33.6 bits (77), Expect = 0.26
 Identities = 22/85 (25%), Positives = 33/85 (38%), Gaps = 7/85 (8%)

Query: 127 TIQAKGGYLIVSGGGSGPSPLSTSIVSPPNTVAPPSSTTPLIPGITSPPGTLQGGITSPV 186
             Q        +   + PSP  +S  + P   AP S+T    P  T P  ++      PV
Sbjct: 383 FTQPAAAPQPSAAAAASPSPSQSSAAAQP--SAPQSATQ---PAGTPPTVSVDPPAAVPV 437

Query: 187 SIIPGATSPPNVPAAITKLTPPGPL 211
           +  P +T+P  V  A  K     P+
Sbjct: 438 N--PPSTAPQAVRPAQFKEEKKIPV 460


>gnl|CDD|184923 PRK14959, PRK14959, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 624

 Score = 36.2 bits (83), Expect = 0.042
 Identities = 21/79 (26%), Positives = 34/79 (43%), Gaps = 3/79 (3%)

Query: 141 GSGPSPLSTSIVSPPNTVAPPSSTTPLIPGITSPPGTLQGGITS-PVSIIPGATSPPNVP 199
           G+ P+   T   + P T AP ++ +P +P   +PP   + GI   P   +P A+  P  P
Sbjct: 407 GTAPAAGMTPSSAAPATPAPSAAPSPRVPWDDAPPAPPRSGIPPRPAPRMPEASPVPGAP 466

Query: 200 AAITKL--TPPGPLTPDKP 216
            ++      PP    P   
Sbjct: 467 DSVASASDAPPTLGDPSDT 485



 Score = 32.7 bits (74), Expect = 0.46
 Identities = 25/92 (27%), Positives = 33/92 (35%), Gaps = 16/92 (17%)

Query: 138 SGGGSGPSPLSTSIVSPPNTVAPPSSTTPLIPGITSPPGTLQGGITSPVSIIPGATSPPN 197
           SGGG+     S +        +  ++T P  PG   P GT      +P S  P   +P  
Sbjct: 374 SGGGASAPSGSAA----EGPASGGAATIP-TPGTQGPQGTAPAAGMTPSSAAPATPAPSA 428

Query: 198 VPAAITKLTPPGPLTP--DKPPAPVESPWGAF 227
            P+         P  P  D PPAP  S     
Sbjct: 429 APS---------PRVPWDDAPPAPPRSGIPPR 451


>gnl|CDD|235906 PRK07003, PRK07003, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 830

 Score = 36.0 bits (83), Expect = 0.046
 Identities = 9/85 (10%), Positives = 15/85 (17%)

Query: 142 SGPSPLSTSIVSPPNTVAPPSSTTPLIPGITSPPGTLQGGITSPVSIIPGATSPPNVPAA 201
                 +      P   A  ++T    P     P                  +  N  A+
Sbjct: 399 VTAVTGAAGAALAPKAAAAAAATRAEAPPAAPAPPATADRGDDAADGDAPVPAKANARAS 458

Query: 202 ITKLTPPGPLTPDKPPAPVESPWGA 226
                      P        +P   
Sbjct: 459 ADSRCDERDAQPPADSGSASAPASD 483


>gnl|CDD|222177 pfam13499, EF_hand_5, EF-hand domain pair. 
          Length = 60

 Score = 32.3 bits (74), Expect = 0.046
 Identities = 16/57 (28%), Positives = 29/57 (50%), Gaps = 6/57 (10%)

Query: 292 DRTRSGFLSGPQARNIMVATGLSQG------ILAQIWNLADMDSDGQLSCDEFVLAM 342
           D+   G++   + R ++ A GL         ++   +N  D D DG++S +EF+ AM
Sbjct: 4   DKDGDGYIDVEELRKLLKALGLKLTDEEVEELIEADFNEIDKDGDGRISFEEFLEAM 60


>gnl|CDD|217469 pfam03276, Gag_spuma, Spumavirus gag protein. 
          Length = 582

 Score = 35.7 bits (82), Expect = 0.052
 Identities = 20/86 (23%), Positives = 32/86 (37%)

Query: 138 SGGGSGPSPLSTSIVSPPNTVAPPSSTTPLIPGITSPPGTLQGGITSPVSIIPGATSPPN 197
            GG  G  P +     P +    P  ++ L P  +S PG     ++    +   + + P+
Sbjct: 175 IGGRGGNIPGAIQPPPPSSLPGLPPGSSSLAPSASSTPGNRLPRVSFNPFLPGPSPAQPS 234

Query: 198 VPAAITKLTPPGPLTPDKPPAPVESP 223
            P A     P  P+     P PV  P
Sbjct: 235 APPASIPAPPIPPVIQYVAPPPVPPP 260


>gnl|CDD|222648 pfam14283, DUF4366, Domain of unknown function (DUF4366).  This
           family of proteins is found in bacteria and eukaryotes.
           Proteins in this family are typically between 227 and
           387 amino acids in length.
          Length = 213

 Score = 34.2 bits (79), Expect = 0.082
 Identities = 15/45 (33%), Positives = 16/45 (35%)

Query: 170 GITSPPGTLQGGITSPVSIIPGATSPPNVPAAITKLTPPGPLTPD 214
           G  S P    GG   P      AT P  V          GPLTP+
Sbjct: 14  GGFSVPAFAYGGEPEPEVPQTEATDPSEVVVETVTEEEGGPLTPE 58


>gnl|CDD|217310 pfam02993, MCPVI, Minor capsid protein VI.  This minor capsid
           protein may act as a link between the external capsid
           and the internal DNA-protein core. The C-terminal 11
           residues may function as a protease cofactor leading to
           enzyme activation.
          Length = 238

 Score = 34.0 bits (78), Expect = 0.12
 Identities = 21/87 (24%), Positives = 29/87 (33%), Gaps = 8/87 (9%)

Query: 140 GGSGPSPLSTSIVSPPNTVAP---PSSTTPLIPGITSPPGTLQGGITSPVSIIPGATSPP 196
           G   P+P   ++  P   + P   P     L+P    PP          +   P     P
Sbjct: 111 GEEEPAPQEETVADPIQALQPRPRPDVEEVLVPAAPEPPS-----YEETIKPGPAPVEEP 165

Query: 197 NVPAAITKLTPPGPLTPDKPPAPVESP 223
               AI       P+T + PPAP   P
Sbjct: 166 VDSMAIAVPAIDTPVTLELPPAPQPPP 192


>gnl|CDD|237865 PRK14951, PRK14951, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 618

 Score = 34.3 bits (79), Expect = 0.16
 Identities = 15/84 (17%), Positives = 25/84 (29%), Gaps = 1/84 (1%)

Query: 141 GSGPSPLSTSIVSPPNTVAPPSSTTPLIPGITSPPGTLQGGITSPVSIIPGATSPP-NVP 199
              P+    +  S P      +   P+     + P        + V++ P   +      
Sbjct: 400 APAPAAAPAAAASAPAAPPAAAPPAPVAAPAAAAPAAAPAAAPAAVALAPAPPAQAAPET 459

Query: 200 AAITKLTPPGPLTPDKPPAPVESP 223
            AI     P P      PAP  +P
Sbjct: 460 VAIPVRVAPEPAVASAAPAPAAAP 483



 Score = 31.2 bits (71), Expect = 1.5
 Identities = 14/85 (16%), Positives = 20/85 (23%), Gaps = 5/85 (5%)

Query: 144 PSPLSTSIVSPPNTVAPPSSTTPLIPGITSPPGTLQGGITSPVSIIPGATSPPNVPAAIT 203
               +    +P    A   +         S P              P A +P   PAA  
Sbjct: 383 RPEAAAPAAAPVAQAAAAPAPAAAPAAAASAPAAPPAAAPPAPVAAPAAAAPAAAPAA-- 440

Query: 204 KLTPPGPLTPDKPPAPVESPWGAFI 228
               P  +     P    +P    I
Sbjct: 441 ---APAAVALAPAPPAQAAPETVAI 462


>gnl|CDD|227455 COG5126, FRQ1, Ca2+-binding protein (EF-Hand superfamily) [Signal
           transduction mechanisms / Cytoskeleton / Cell division
           and chromosome partitioning / General function
           prediction only].
          Length = 160

 Score = 32.7 bits (75), Expect = 0.20
 Identities = 14/59 (23%), Positives = 29/59 (49%), Gaps = 2/59 (3%)

Query: 283 KYTQLFNTTDRTRSGFLSGPQARNIMVATG--LSQGILAQIWNLADMDSDGQLSCDEFV 339
           +  + F   D+   G++S  + R ++ + G  LS   + ++    D D DG++  +EF 
Sbjct: 93  ELREAFKLFDKDHDGYISIGELRRVLKSLGERLSDEEVEKLLKEYDEDGDGEIDYEEFK 151


>gnl|CDD|152960 pfam12526, DUF3729, Protein of unknown function (DUF3729).  This
           family of proteins is found in viruses. Proteins in this
           family are typically between 145 and 1707 amino acids in
           length. The family is found in association with
           pfam01443, pfam01661, pfam05417, pfam01660, pfam00978.
           There is a single completely conserved residue L that
           may be functionally important.
          Length = 115

 Score = 31.6 bits (72), Expect = 0.27
 Identities = 23/90 (25%), Positives = 28/90 (31%), Gaps = 15/90 (16%)

Query: 137 VSGGGSGPSPLSTSIVSPPNTVAPPSSTTPLIPGITSPPGTLQGGITSPVSIIPGATSPP 196
            SG  S  SP  ++    P  V  P       P               PVS +     P 
Sbjct: 22  TSGFSSCFSPPESAHPDDPPPVGDPRPPVVDTP--------------PPVSAVWVLPPPS 67

Query: 197 NVPAAITKLTPPGPLTPDKPPAPVESPWGA 226
              A      PP P  P  PP+P+  P  A
Sbjct: 68  EPAAPPPDPEPPVP-GPAGPPSPLAPPAPA 96


>gnl|CDD|237871 PRK14965, PRK14965, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 576

 Score = 33.6 bits (77), Expect = 0.28
 Identities = 19/90 (21%), Positives = 25/90 (27%), Gaps = 14/90 (15%)

Query: 139 GGGSGPSPLSTSIVSPPNTVAPPSSTTPLIPGITSPPGTLQGGITSPVSIIPGATSPPNV 198
            G   P   +    +P    APP +  P +P               P +  P     P  
Sbjct: 379 RGAPAPPSAAWGAPTPAAPAAPPPAAAPPVPPAAPAR---------PAAARPAPAPAPPA 429

Query: 199 PAAITKLTPPGPLTPDKPPAPVESPWGAFI 228
            AA     PP         A     W AF+
Sbjct: 430 AAA-----PPARSADPAAAASAGDRWRAFV 454


>gnl|CDD|236090 PRK07764, PRK07764, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 824

 Score = 33.4 bits (77), Expect = 0.29
 Identities = 23/148 (15%), Positives = 35/148 (23%), Gaps = 5/148 (3%)

Query: 79  STPGSSCSASHTSATNGPCHPSLTSPTHWYLQGKKYHTHVFTAGQFCMTIQAKGGYLIVS 138
           + P +  + +  +       P+  S           H     A                 
Sbjct: 616 AAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGWPAKAGG 675

Query: 139 GGGSGPSPLSTSIVSPPNTVAPPSSTTPLIPGITSPPGTLQGGITSPVSIIPGATSPPNV 198
              + P P            A P+   P             G    P +  P A    + 
Sbjct: 676 AAPAAPPPAPAPAAPAAPAGAAPAQPAP-----APAATPPAGQADDPAAQPPQAAQGASA 730

Query: 199 PAAITKLTPPGPLTPDKPPAPVESPWGA 226
           P+       P P  PD PP P  +P   
Sbjct: 731 PSPAADDPVPLPPEPDDPPDPAGAPAQP 758



 Score = 31.1 bits (71), Expect = 1.5
 Identities = 20/102 (19%), Positives = 27/102 (26%), Gaps = 13/102 (12%)

Query: 138 SGGGSGPSPLSTSIVSPPNTVAPPSSTTPLIPGITSPPGTLQGGITSPVSIIPGATSPPN 197
              G    P   S   P     P +   P  P   +P G       +  +  PG  +P +
Sbjct: 593 GAAGGEGPPAPASSGPPEEAARPAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEH 652

Query: 198 VPAAI-------------TKLTPPGPLTPDKPPAPVESPWGA 226
            P  +              K     P  P   PAP      A
Sbjct: 653 HPKHVAVPDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPA 694


>gnl|CDD|237864 PRK14950, PRK14950, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 585

 Score = 31.7 bits (72), Expect = 0.95
 Identities = 12/77 (15%), Positives = 21/77 (27%), Gaps = 5/77 (6%)

Query: 152 VSPPNTVAPPSSTTPLI---PGITSPPGTLQGGITSPVSIIPGATSPPNVPA--AITKLT 206
           V  P    P ++    +   P  ++ P         P   +    +PP VP       + 
Sbjct: 363 VPAPQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPRPVAPPVP 422

Query: 207 PPGPLTPDKPPAPVESP 223
                 P    A +   
Sbjct: 423 HTPESAPKLTRAAIPVD 439



 Score = 30.9 bits (70), Expect = 1.9
 Identities = 17/68 (25%), Positives = 22/68 (32%), Gaps = 1/68 (1%)

Query: 157 TVAPPSSTTPLIPGITSP-PGTLQGGITSPVSIIPGATSPPNVPAAITKLTPPGPLTPDK 215
            + P  +  P  P   +P P       ++       A  PP  P   T   PP P  P  
Sbjct: 359 LLVPVPAPQPAKPTAAAPSPVRPTPAPSTRPKAAAAANIPPKEPVRETATPPPVPPRPVA 418

Query: 216 PPAPVESP 223
           PP P    
Sbjct: 419 PPVPHTPE 426



 Score = 30.2 bits (68), Expect = 3.3
 Identities = 25/91 (27%), Positives = 32/91 (35%), Gaps = 7/91 (7%)

Query: 143 GPSPLSTSIVSPPNTVAPPSSTTPLIPGITSP-PGTLQGGITSPVSIIPGATSPPNVPAA 201
            P+  + S V P  T AP +           P     +     PV   P A   P+ P +
Sbjct: 370 KPTAAAPSPVRP--TPAPSTRPKAAAAANIPPKEPVRETATPPPVPPRPVAPPVPHTPES 427

Query: 202 ITKLTP---PGPLTPD-KPPAPVESPWGAFI 228
             KLT    P    P   PPAP +    A I
Sbjct: 428 APKLTRAAIPVDEKPKYTPPAPPKEEEKALI 458


>gnl|CDD|237030 PRK12270, kgd, alpha-ketoglutarate decarboxylase; Reviewed.
          Length = 1228

 Score = 31.8 bits (73), Expect = 1.1
 Identities = 12/72 (16%), Positives = 20/72 (27%), Gaps = 1/72 (1%)

Query: 153 SPPNTVAPPSSTTPLIPGITSPPGTL-QGGITSPVSIIPGATSPPNVPAAITKLTPPGPL 211
            P +T AP ++        ++P          +P    P A +P   P            
Sbjct: 37  GPGSTAAPTAAAAAAAAAASAPAAAPAAKAPAAPAPAPPAAAAPAAPPKPAAAAAAAAAP 96

Query: 212 TPDKPPAPVESP 223
                 A   +P
Sbjct: 97  AAPPAAAAAAAP 108


>gnl|CDD|183756 PRK12799, motB, flagellar motor protein MotB; Reviewed.
          Length = 421

 Score = 31.2 bits (70), Expect = 1.4
 Identities = 18/85 (21%), Positives = 30/85 (35%), Gaps = 2/85 (2%)

Query: 137 VSGGGSGPSPLSTSIVSPPNTVAPPSSTTPLIPGITSPPGTLQGGITSPVSIIPGATSPP 196
           +      PS ++T   +     A   S+  ++P   + PGT+      PV++ P   S  
Sbjct: 323 IPSPAVIPSSVTTQSATTTQASAVALSSAGVLPSDVTLPGTVALPAAEPVNMQPQPMSTT 382

Query: 197 NVPAAITKLTPPGPLTPDK--PPAP 219
               + T         P    P AP
Sbjct: 383 ETQQSSTGNITSTANGPTTSLPAAP 407


>gnl|CDD|236138 PRK07994, PRK07994, DNA polymerase III subunits gamma and tau;
           Validated.
          Length = 647

 Score = 31.0 bits (71), Expect = 1.5
 Identities = 18/80 (22%), Positives = 30/80 (37%), Gaps = 9/80 (11%)

Query: 144 PSPLSTSIVSPPNTVAPPSSTTPLIPGITSPPGTLQGGITSPVSIIPGATSPPNVPAAIT 203
           P+  + +  +P   VAPP +     P  ++P         +P   +P  TS   + AA  
Sbjct: 377 PAASAQATAAPTAAVAPPQAPAVPPPPASAPQ-------QAPAVPLPETTSQ--LLAARQ 427

Query: 204 KLTPPGPLTPDKPPAPVESP 223
           +L      T  K   P  + 
Sbjct: 428 QLQRAQGATKAKKSEPAAAS 447


>gnl|CDD|223033 PHA03291, PHA03291, envelope glycoprotein I; Provisional.
          Length = 401

 Score = 30.7 bits (69), Expect = 1.7
 Identities = 17/71 (23%), Positives = 21/71 (29%), Gaps = 2/71 (2%)

Query: 143 GPSPLSTSIVSPPNTVAPPSSTTPLIPGITSPPGTLQGGITSPVSIIPGATSPPNVPAAI 202
                +    + P T   PS+TT   P  T+ P              P A   P  P   
Sbjct: 206 ATPRPTPRTTASPETTPTPSTTTS--PPSTTIPAPSTTIAAPQAGTTPEAEGTPAPPTPG 263

Query: 203 TKLTPPGPLTP 213
               PP   TP
Sbjct: 264 GGEAPPANATP 274



 Score = 29.9 bits (67), Expect = 3.4
 Identities = 15/71 (21%), Positives = 22/71 (30%)

Query: 144 PSPLSTSIVSPPNTVAPPSSTTPLIPGITSPPGTLQGGITSPVSIIPGATSPPNVPAAIT 203
           P P   +  SP  T  P ++T+P    I +P  T+            G  +PP       
Sbjct: 208 PRPTPRTTASPETTPTPSTTTSPPSTTIPAPSTTIAAPQAGTTPEAEGTPAPPTPGGGEA 267

Query: 204 KLTPPGPLTPD 214
                 P    
Sbjct: 268 PPANATPAPEA 278



 Score = 28.4 bits (63), Expect = 9.4
 Identities = 18/60 (30%), Positives = 23/60 (38%), Gaps = 6/60 (10%)

Query: 160 PPSSTTPLIPGITSPPGTLQGGITSPVSIIPGATSPPNVPAAITKLTPPGPLTPDKPPAP 219
            P+  T   P  T  P T     + P + IP  ++    P A T     G  TP  PP P
Sbjct: 209 RPTPRTTASPETTPTPST---TTSPPSTTIPAPSTTIAAPQAGTTPEAEG--TPA-PPTP 262


>gnl|CDD|178748 PLN03209, PLN03209, translocon at the inner envelope of chloroplast
           subunit 62; Provisional.
          Length = 576

 Score = 30.7 bits (69), Expect = 1.8
 Identities = 21/83 (25%), Positives = 31/83 (37%), Gaps = 7/83 (8%)

Query: 142 SGPSPLSTSIVSPPNTVAPPSSTTPLIPGI-TSPPGTLQGGITSPVSIIPGATSPPNVPA 200
           + PSP      + P  V+P  S+T  +P +  + P T      +P        SP  V  
Sbjct: 451 TSPSP------TAPTGVSPSVSSTSSVPAVPDTAPATAATDAAAPPPANMRPLSPYAVYD 504

Query: 201 AITKLTPPGPLTPDKPPAPVESP 223
            +   T P P  P    AP  + 
Sbjct: 505 DLKPPTSPSPAAPVGKVAPSSTN 527


>gnl|CDD|225207 COG2326, COG2326, Uncharacterized conserved protein [Function
           unknown].
          Length = 270

 Score = 30.4 bits (69), Expect = 2.0
 Identities = 8/26 (30%), Positives = 12/26 (46%), Gaps = 2/26 (7%)

Query: 99  PSLTSPTHWYLQGKKYHTHVFTAGQF 124
           P+      WY Q  +Y  H+  AG+ 
Sbjct: 111 PTDRERGQWYFQ--RYVAHLPAAGEI 134


>gnl|CDD|218471 pfam05157, T2SE_Nter, Type II secretion system (T2SS), protein E,
           N-terminal domain.  This domain is found at the
           N-terminus of members of the Type II secretion system
           protein E. Proteins in this subfamily are typically
           involved in Type 4 pilus biogenesis , though some are
           involved in other processes; for instance aggregation in
           Myxococcus xanthus. The structure of this domain is now
           known.
          Length = 110

 Score = 28.8 bits (65), Expect = 2.3
 Identities = 14/71 (19%), Positives = 30/71 (42%), Gaps = 13/71 (18%)

Query: 357 PLPIDMIPPAFRRQRQ--------NSVTLAANVAMDPWNVSRHERTRFETHFQAMKPVNG 408
           P  + ++P  F R+           ++ +A     DP ++   +  RF T  + ++PV  
Sbjct: 35  PELLRLLPREFARRHLVLPLREDDGTLVVAVA---DPLDLELLDELRFLTGRKRVEPV-- 89

Query: 409 IVTGDQVKGFL 419
           + T   ++  L
Sbjct: 90  LATPSDIRRAL 100


>gnl|CDD|236776 PRK10856, PRK10856, cytoskeletal protein RodZ; Provisional.
          Length = 331

 Score = 30.4 bits (69), Expect = 2.4
 Identities = 15/89 (16%), Positives = 22/89 (24%), Gaps = 13/89 (14%)

Query: 138 SGGGSGPSPLSTSIVSPPNTVAPPSSTTPLIPGITSPPGTLQGGITSPVSIIPGATSPPN 197
           S       PL TS  + P T   P++         + P   Q    +           P 
Sbjct: 155 SQNSGQSVPLDTSTTTDPATTPAPAAPVD------TTPTNSQTPAVAT-------APAPA 201

Query: 198 VPAAITKLTPPGPLTPDKPPAPVESPWGA 226
           V      +  P     D    P  +    
Sbjct: 202 VDPQQNAVVAPSQANVDTAATPAPAAPAT 230



 Score = 30.0 bits (68), Expect = 2.5
 Identities = 19/85 (22%), Positives = 27/85 (31%), Gaps = 2/85 (2%)

Query: 141 GSGPSPLSTSIVSPPNTVAPPSSTTPLIPGITSPPGTLQGGITSPVSIIPGATSPPNVPA 200
            +   P +T   + P    P +S TP +    +P    Q       S     T+    PA
Sbjct: 167 STTTDPATTPAPAAPVDTTPTNSQTPAVATAPAPAVDPQQNAVVAPSQANVDTAATPAPA 226

Query: 201 AITKLTPPGPLTPDK--PPAPVESP 223
           A        PL  D+     P   P
Sbjct: 227 APATPDGAAPLPTDQAGVSTPAADP 251


>gnl|CDD|227785 COG5498, ACF2, Predicted glycosyl hydrolase [Cell envelope
           biogenesis, outer membrane].
          Length = 760

 Score = 30.6 bits (69), Expect = 2.4
 Identities = 19/96 (19%), Positives = 29/96 (30%), Gaps = 8/96 (8%)

Query: 125 CMTIQAKGGYLIVSGGGSGPSPLSTSIVSPPN-----TVAPPSSTTPLIPGITSPPGTLQ 179
             +     G    +  G+   PL    V         + +P S + P        P  L 
Sbjct: 2   SYSRGTIDGAATATMKGANVQPLFALTVIILLFPTGASASPHSVSVPKGDSNVEFPPILS 61

Query: 180 GGITSPVSIIPGATSPPNVPAAITKLTPPGPLTPDK 215
           G I +P S       P   P    +    GP+  +K
Sbjct: 62  GTI-NPPSTFRSVQHPV--PPPGVRSDSTGPIQTNK 94


>gnl|CDD|218191 pfam04652, DUF605, Vta1 like.  Vta1 (VPS20-associated protein 1) is
           a positive regulator of Vps4. Vps4 is an ATPase that is
           required in the multivesicular body (MVB) sorting
           pathway to dissociate the endosomal sorting complex
           required for transport (ESCRT). Vta1 promotes correct
           assembly of Vps4 and stimulates its ATPase activity
           through its conserved Vta1/SBP1/LIP5 region.
          Length = 315

 Score = 30.0 bits (68), Expect = 2.4
 Identities = 20/73 (27%), Positives = 28/73 (38%), Gaps = 5/73 (6%)

Query: 142 SGPSPLSTSIVSPPNTVAPPSSTTPLIPGITSPPGTLQGGITSPVSIIPGATSPPNVPAA 201
           S   P  +S  S     +P S T P     + PPG           + P +T+ P  P+A
Sbjct: 210 SSLPPAPSSFQSDTPPPSPESPTNP-----SPPPGPAAPPPPPVQQVPPLSTAKPTPPSA 264

Query: 202 ITKLTPPGPLTPD 214
                P G +T D
Sbjct: 265 SATPAPIGGITLD 277


>gnl|CDD|177641 PHA03413, PHA03413, putative internal core protein; Provisional.
          Length = 1304

 Score = 30.4 bits (68), Expect = 2.6
 Identities = 25/103 (24%), Positives = 47/103 (45%), Gaps = 10/103 (9%)

Query: 313  LSQGILA-QIWNLADMDSDGQLSCDEFVLAMHLCDLAKGGEKIPVPLPIDMIPPAFRRQR 371
            L+ GI+A ++ N+   D++   + +EF L   + D A          P  + PP  + + 
Sbjct: 1077 LNAGIIADELENMFKGDAEAFKALEEFGLTPEIADQAIAANAAN---PGALFPPDLQME- 1132

Query: 372  QNSVTLAANVAMDPWNVSRHERTRFETHFQAMKPVNGIVTGDQ 414
               + +    AMD  ++ ++ RT   +HF    P+  I+ G Q
Sbjct: 1133 ---IEIVGTRAMD--SLMQNIRTGETSHFAQFNPIGKIIVGYQ 1170


>gnl|CDD|218673 pfam05642, Sporozoite_P67, Sporozoite P67 surface antigen.  This
           family consists of several Theileria P67 surface
           antigens. A stage specific surface antigen of Theileria
           parva, p67, is the basis for the development of an
           anti-sporozoite vaccine for the control of East Coast
           fever (ECF) in cattle. The antigen has been shown to
           contain five distinct linear peptide sequences
           recognised by sporozoite-neutralising murine monoclonal
           antibodies.
          Length = 727

 Score = 30.4 bits (68), Expect = 2.8
 Identities = 21/70 (30%), Positives = 27/70 (38%), Gaps = 4/70 (5%)

Query: 137 VSGGGSGPSPLSTSIVSPP--NTVAPPSSTTPLIPGITSPPGTLQGGITSPVSIIPGATS 194
           + G   G S  S S   P    T   P+S+ P  PG   P  + +  +T     I G   
Sbjct: 265 LPGVRVGDSTSSPSTTRPSGSTTTTTPASSGPSAPG--GPGSSSRNAVTRSTDSISGPIP 322

Query: 195 PPNVPAAITK 204
            P  P AIT 
Sbjct: 323 SPGAPRAITG 332


>gnl|CDD|223039 PHA03307, PHA03307, transcriptional regulator ICP4; Provisional.
          Length = 1352

 Score = 30.1 bits (68), Expect = 2.9
 Identities = 10/87 (11%), Positives = 18/87 (20%), Gaps = 3/87 (3%)

Query: 140 GGSGPSPLSTSIVSPPNTVAPPSSTTPLIPGITSPPGTLQGGITSPVSIIPGATSPPNVP 199
             S  +    S     +   P  S+    P    P        +    +        +  
Sbjct: 89  TWSLSTLAPASPAREGSPTPPGPSSPDPPPPTPPPASP---PPSPAPDLSEMLRPVGSPG 145

Query: 200 AAITKLTPPGPLTPDKPPAPVESPWGA 226
                  P    +P    +   S   A
Sbjct: 146 PPPAASPPAAGASPAAVASDAASSRQA 172


>gnl|CDD|220472 pfam09924, DUF2156, Uncharacterized conserved protein (DUF2156).
           This domain, found in various hypothetical prokaryotic
           proteins, has no known function.
          Length = 246

 Score = 29.5 bits (67), Expect = 3.3
 Identities = 12/47 (25%), Positives = 18/47 (38%)

Query: 98  HPSLTSPTHWYLQGKKYHTHVFTAGQFCMTIQAKGGYLIVSGGGSGP 144
           + S  S  +  L   K +      G   +  + KGG L+V G   G 
Sbjct: 3   YGSENSFANLALWRDKSYYFYAPDGDAFIAYRVKGGSLVVLGDPVGD 49


>gnl|CDD|219865 pfam08493, AflR, Aflatoxin regulatory protein.  This domain is
           found in the aflatoxin regulatory protein (AflR) which
           is involved in the regulation of the biosynthesis of
           aflatoxin in the fungal genus Aspergillus. It occurs
           together with the fungal Zn(2)-Cys(6) binuclear cluster
           domain (pfam00172).
          Length = 275

 Score = 29.5 bits (66), Expect = 3.5
 Identities = 31/121 (25%), Positives = 50/121 (41%), Gaps = 21/121 (17%)

Query: 159 APPSSTTPLIPGITSPPGTLQGGITSPVSIIPGATSPPNVPAAITKLTPPGPLTPD---- 214
            P ++++P IP  T+   T       PV   P +  PP      T  TP G  +P     
Sbjct: 4   TPNTASSPTIPANTTANTTSSSHPQPPVQSGPSSIQPPVA----TPHTPNGTSSPSPKFS 59

Query: 215 --KPPAPVESPWGAFINGWLSTKEKDLEVNNSNLVDLKKTLANMMSDSEKLYNVLGGPPL 272
              PPA  E  WG+ ++   S +++          DL  +L ++ +D  +L+  L   PL
Sbjct: 60  HQSPPAEPEL-WGSILSPNASNQDQG---------DL-SSLLSVNTDFGQLFASLSPSPL 108

Query: 273 E 273
            
Sbjct: 109 F 109


>gnl|CDD|177464 PHA02682, PHA02682, ORF080 virion core protein; Provisional.
          Length = 280

 Score = 29.4 bits (65), Expect = 3.7
 Identities = 24/90 (26%), Positives = 29/90 (32%), Gaps = 10/90 (11%)

Query: 142 SGPSPLSTSIVSPPNTVAPPSSTTPLI--------PGITSPPGTLQGGITSPVSIIPGAT 193
           SG SPL+ S        A P+              P    PP T       P ++ P   
Sbjct: 77  SGQSPLAPSPACAAPAPACPACAPAAPAPAVTCPAPAPACPPAT--APTCPPPAVCPAPA 134

Query: 194 SPPNVPAAITKLTPPGPLTPDKPPAPVESP 223
            P       T+  PP P  P   PAP   P
Sbjct: 135 RPAPACPPSTRQCPPAPPLPTPKPAPAAKP 164


>gnl|CDD|216257 pfam01034, Syndecan, Syndecan domain.  Syndecans are transmembrane
           heparin sulfate proteoglycans which are implicated in
           the binding of extracellular matrix components and
           growth factors.
          Length = 207

 Score = 29.3 bits (66), Expect = 3.8
 Identities = 20/90 (22%), Positives = 27/90 (30%), Gaps = 10/90 (11%)

Query: 137 VSGGGSGPSPLSTSIVSPPNTVA--PPSSTTPLIPGITSPPGTLQGGITSPVSIIPGATS 194
            SG GSG +P       P  T A  P  +TT   P   +   +     +  VS     T+
Sbjct: 53  YSGSGSGATPSDDEDSEPVTTSATPPKLTTTSSSPSNDTTTASTSTKTSPTVSTTVTTTT 112

Query: 195 PP------NVPAAI--TKLTPPGPLTPDKP 216
            P           +     T  G      P
Sbjct: 113 SPSETDTEEATTTVSTETPTEGGSSAATDP 142


>gnl|CDD|222010 pfam13254, DUF4045, Domain of unknown function (DUF4045).  This
           presumed domain is functionally uncharacterized. This
           domain family is found in bacteria and eukaryotes, and
           is typically between 384 and 430 amino acids in length.
          Length = 414

 Score = 29.4 bits (66), Expect = 4.1
 Identities = 16/83 (19%), Positives = 26/83 (31%), Gaps = 1/83 (1%)

Query: 141 GSGPSPLSTSIVSPPNTVAPPS-STTPLIPGITSPPGTLQGGITSPVSIIPGATSPPNVP 199
           G   S    + V    T  P S S +P   GI   P +     T P       +S     
Sbjct: 193 GRTNSFKEVTPVGLMRTPPPGSHSKSPSKSGIPDLPSSRDSEKTKPEKPQQETSSMDTEK 252

Query: 200 AAITKLTPPGPLTPDKPPAPVES 222
           ++  K          +   P+++
Sbjct: 253 SSAPKPRETLDPKSPEKAPPIDT 275


>gnl|CDD|223065 PHA03378, PHA03378, EBNA-3B; Provisional.
          Length = 991

 Score = 29.6 bits (66), Expect = 4.2
 Identities = 21/82 (25%), Positives = 26/82 (31%), Gaps = 7/82 (8%)

Query: 143 GPSPLSTSIVSPPNTVAPPSSTTPLIPGITSPPGTLQGGITSPVSIIPGATSPPNVPAAI 202
            P+P       P   + P   T   I  I   P         P+   PG   PP  P A 
Sbjct: 648 FPTPHQP----PQVEITPYKPTWTQIGHIPYQPSPTGANTMLPIQWAPGTMQPP--PRAP 701

Query: 203 TKLTPP-GPLTPDKPPAPVESP 223
           T + PP  P    + PA     
Sbjct: 702 TPMRPPAAPPGRAQRPAAATGR 723



 Score = 29.6 bits (66), Expect = 4.7
 Identities = 22/81 (27%), Positives = 29/81 (35%), Gaps = 4/81 (4%)

Query: 146 PLSTSIVSPPNTVAPPSSTTPLIPGITSPPGTLQGGITSPVSIIPGATSPPNVPAAITKL 205
           P + +  + P   AP  +  P      + P     G   P +  PG   PP         
Sbjct: 717 PAAATGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGRARPPAAAPGAPTP 776

Query: 206 TPPGPLTPDKPPAPVESPWGA 226
            PP    P  PPAP + P GA
Sbjct: 777 QPP----PQAPPAPQQRPRGA 793


>gnl|CDD|178165 PLN02550, PLN02550, threonine dehydratase.
          Length = 591

 Score = 29.5 bits (66), Expect = 4.4
 Identities = 37/139 (26%), Positives = 59/139 (42%), Gaps = 10/139 (7%)

Query: 156 NTVAPPSSTTPLIPGITSPPGTLQGGITSPVSIIPGATSPPNVPAAITKLTPPGPLTPDK 215
           ++V  P++ +PL   I SP   + G      S IP A       A  T + P  P  P  
Sbjct: 2   SSVGLPTAGSPLRSHIGSPSKPVVGSTPFSRSRIPAAVD----SADETSMAP--PPPPSP 55

Query: 216 PPAPVESPWG-AFINGWL-STKEKDLEVNNSNLVDLKKTLANMMSDSEKLYNVLGGPPLE 273
            P    SP    +  G+L +  E+  E  N ++ +  + L N++  S K+Y+V    PL+
Sbjct: 56  LPLLKVSPNSLQYPAGYLGAVPERTNEAENGSIPEAMEYLTNIL--SAKVYDVAIESPLQ 113

Query: 274 WAVPHASKLKYTQLFNTTD 292
            A   + +L    L    D
Sbjct: 114 LAKKLSERLGVKVLLKRED 132


>gnl|CDD|163669 cd08162, MPP_PhoA_N, Synechococcus sp. strain PCC 7942  PhoA and
           related proteins, N-terminal metallophosphatase domain. 
           Synechococcus sp. strain PCC 7942 PhoA is a large
           atypical alkaline phosphatase.  It is known to be
           transported across the inner cytoplasmic membrane and
           into the periplasmic space.  In vivo inactivation of the
           gene encoding PhoA leads to a loss of extracellular,
           phosphate-regulated phosphatase activity, but does not
           appear to affect the cells capacity for phosphate
           uptake.  PhoA may play a role in scavenging phosphate
           during growth of Synechococcus sp. strain PCC 7942 in
           its natural environment.  PhoA  belongs to a domain
           family which includes the bacterial enzyme UshA and
           several other related enzymes including SoxB, CpdB,
           YhcR, and CD73.  All members have a similar domain
           architecture which includes an N-terminal
           metallophosphatase domain and a C-terminal nucleotidase
           domain.  The N-terminal metallophosphatase domain
           belongs to a large superfamily of distantly related
           metallophosphatases (MPPs) that includes:
           Mre11/SbcD-like exonucleases, Dbr1-like RNA lariat
           debranching enzymes, YfcE-like phosphodiesterases,
           purple acid phosphatases (PAPs), YbbF-like
           UDP-2,3-diacylglucosamine hydrolases, and acid
           sphingomyelinases (ASMases).  MPPs are functionally
           diverse, but all share a conserved domain with an active
           site consisting of two metal ions (usually manganese,
           iron, or zinc) coordinated with octahedral geometry by a
           cage of histidine, aspartate, and asparagine residues.
           The conserved domain is a double beta-sheet sandwich
           with a di-metal active site made up of residues located
           at the C-terminal side of the sheets. This domain is
           thought to allow for productive metal coordination.
          Length = 313

 Score = 28.8 bits (65), Expect = 5.5
 Identities = 12/46 (26%), Positives = 19/46 (41%), Gaps = 2/46 (4%)

Query: 164 TTPLIPGITSPPGTLQGGITSPVSIIPGATSPPNVPAAITKLTPPG 209
           TTP +P I+SP G          + +    +   +  +I  LT  G
Sbjct: 164 TTPTLPSISSPGGVT--VNPDDTADMQALAAVEQIQPSIDALTAQG 207


>gnl|CDD|218439 pfam05109, Herpes_BLLF1, Herpes virus major outer envelope
           glycoprotein (BLLF1).  This family consists of the BLLF1
           viral late glycoprotein, also termed gp350/220. It is
           the most abundantly expressed glycoprotein in the viral
           envelope of the Herpesviruses and is the major antigen
           responsible for stimulating the production of
           neutralising antibodies in vivo.
          Length = 830

 Score = 29.4 bits (65), Expect = 6.2
 Identities = 19/69 (27%), Positives = 25/69 (36%), Gaps = 7/69 (10%)

Query: 149 TSIVSPPNTVAPPSSTTPLIPGITSP---PGTLQGGITSPVSIIPG----ATSPPNVPAA 201
           TS  +     A   +     P  TSP     +     TSP  I+ G    ATSPP    +
Sbjct: 489 TSRTTSATPNATSPTPAVTTPNATSPTTQKTSDTPNATSPTPIVIGVTTTATSPPTGTTS 548

Query: 202 ITKLTPPGP 210
           +   T P  
Sbjct: 549 VPNATSPQV 557


>gnl|CDD|165468 PHA03201, PHA03201, uracil DNA glycosylase; Provisional.
          Length = 318

 Score = 28.7 bits (64), Expect = 6.5
 Identities = 26/91 (28%), Positives = 31/91 (34%), Gaps = 20/91 (21%)

Query: 153 SPPNTVAPPSSTTPLIPGIT---SPPGTLQGGITSPVSIIPGATSPPNVPAAITKLTPPG 209
           SPP   +PP  T P  P  +   +PP     G   P    PG  + P  P    +  P G
Sbjct: 10  SPPRRPSPPRPTPPRSPDASPEETPPSPPGPGAEPP----PGRAAGPAAPRRRPRGCPAG 65

Query: 210 -----------PLTPDKPPAPVESP--WGAF 227
                      PL  D  PA    P  W  F
Sbjct: 66  VTFSSSAPPRPPLGLDDAPAATPPPLDWTEF 96


>gnl|CDD|184927 PRK14963, PRK14963, DNA polymerase III subunits gamma and tau;
           Provisional.
          Length = 504

 Score = 29.0 bits (65), Expect = 6.6
 Identities = 14/99 (14%), Positives = 23/99 (23%), Gaps = 6/99 (6%)

Query: 130 AKGGYLIVSGGGSGPSPLSTSIVSPPNTVAPPSSTTPLIPGITSPPGTLQGGITSPVSII 189
             G         + P+P    +    N +     +    P   +                
Sbjct: 342 LGGAPSEGVAAVAPPAPAPADLTQRLNRLEKEVRSLRSAPTAAATAAG-----APLPDFD 396

Query: 190 PGATSPPNVPAAITKLTPPGPLTPDKPPAPVESPWGAFI 228
           P    PP  P        P  + P   PA +   W   +
Sbjct: 397 PRPRGPPA-PEPARSAEAPPLVAPAAAPAGLALRWRDVL 434


>gnl|CDD|215079 PLN00151, PLN00151, potassium transporter; Provisional.
          Length = 852

 Score = 28.9 bits (65), Expect = 7.7
 Identities = 20/107 (18%), Positives = 32/107 (29%), Gaps = 22/107 (20%)

Query: 20  IGQTSFEDKRKEN---YEKGQAE-LER--RRKALELPHQSSFNRALCATTPHHETSSSA- 72
           I +  ++D RKEN   +E+   E LE+  RR+A E   +S  N           +     
Sbjct: 672 IARYGYKDVRKENHQAFEQLLIESLEKFIRREAQERALESDGNDDTDDEDSVTSSRVLIA 731

Query: 73  ---------------HNPGWDSTPGSSCSASHTSATNGPCHPSLTSP 104
                          +       P +S S   +         S    
Sbjct: 732 PNGSVYSLGVPLLADYRLTSKPIPEASTSEEVSPVLPSSSMSSDEDQ 778


>gnl|CDD|221827 pfam12881, NUT_N, NUT protein N terminus.  This family includes the
           NUT protein. The gene encoding for NUT protein (Nuclear
           Testis protein) is found fused to BRD3 or BRD4 genes, in
           some aggressive types of carcinoma, due to chromosomal
           translocations. Proteins of the BRD family contain two
           bromodomains that bind transcriptionally active
           chromatin through associations with acetylated histones
           H3 and H4. Such proteins are crucial for the regulation
           of cell cycle progression. On the other hand, little is
           known about NUT protein. NUT is known to have a Nuclear
           Export Sequence (NES) as well as a Nuclear Localization
           Signal (NLS), both located towards the C-terminal end of
           the protein. A fused NUT-GFP protein showed either
           cytoplasmic or nuclear localization, suggesting that it
           is subject to nuclear/cytoplasmic shuttling. Consistent
           with this possibility, treatment with leptomycin B an
           inhibitor of CRM1-dependent nuclear export resulted in
           re-distribution of NUT-GFP to the nucleus. Inspection of
           NUT revealed a C-terminal sequence similar to known
           nuclear export sequences (NES) which are often regulated
           by phosphorylation.
          Length = 328

 Score = 28.4 bits (63), Expect = 8.8
 Identities = 27/99 (27%), Positives = 36/99 (36%), Gaps = 10/99 (10%)

Query: 135 LIVSGGGSGPSPLSTS--IVSPPNTVAP--PSSTTPLI----PGITSPPGTLQGGITSPV 186
           L+  GGG G S       IV       P  PS T  L+    P   + PGT  GG+  P 
Sbjct: 63  LVTEGGGPGLSGAGAGNVIVQVKTEGGPVEPSQTQNLVLTQTPLNWNAPGTPCGGVEGPP 122

Query: 187 SIIPGATSPPNVPAAITKLTPP--GPLTPDKPPAPVESP 223
             +  +     +PA    ++ P  G      P  P   P
Sbjct: 123 PFVTASNVKTILPARAVGVSQPQEGTWILGLPLPPPPPP 161


>gnl|CDD|235577 PRK05718, PRK05718,
           keto-hydroxyglutarate-aldolase/keto-deoxy-
           phosphogluconate aldolase; Provisional.
          Length = 212

 Score = 27.9 bits (63), Expect = 9.7
 Identities = 10/33 (30%), Positives = 17/33 (51%), Gaps = 2/33 (6%)

Query: 169 PGITSPPGTLQGGITSPVSIIPGATSPPNVPAA 201
           PG+T  P  L+     P+ +IPG ++P  +   
Sbjct: 94  PGLT--PPLLKAAQEGPIPLIPGVSTPSELMLG 124


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.314    0.132    0.403 

Gapped
Lambda     K      H
   0.267   0.0908    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 24,745,555
Number of extensions: 2359214
Number of successful extensions: 2603
Number of sequences better than 10.0: 1
Number of HSP's gapped: 2350
Number of HSP's successfully gapped: 189
Length of query: 487
Length of database: 10,937,602
Length adjustment: 101
Effective length of query: 386
Effective length of database: 6,457,848
Effective search space: 2492729328
Effective search space used: 2492729328
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (22.0 bits)
S2: 61 (27.0 bits)