RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy14534
         (423 letters)



>gnl|CDD|238040 cd00087, FReD, Fibrinogen-related domains (FReDs); C terminal
           globular domain of fibrinogen. Fibrinogen is involved in
           blood clotting, being activated by thrombin to assemble
           into fibrin clots. The N-termini of 2 times 3 chains
           come together to form a globular arrangement called the
           disulfide knot. The C termini of fibrinogen chains end
           in globular domains, which are not completely
           equivalent. C terminal globular domains of the gamma
           chains (C-gamma) dimerize and bind to the GPR motif of
           the N-terminal domain of the alpha chain, while the GHR
           motif of N-terminal domain of the beta chain binds to
           the C terminal globular domains of another beta chain
           (C-beta), which leads to lattice formation.
          Length = 215

 Score =  239 bits (613), Expect = 9e-78
 Identities = 89/216 (41%), Positives = 117/216 (54%), Gaps = 26/216 (12%)

Query: 230 KLPNDC-------EGHSGLQLISL-----GRSVLCDEHH----WMTIQRRYNGMQEFNLK 273
            LP DC          SG+  I          V CD       W  IQRR +G  +F   
Sbjct: 1   PLPRDCSEVLQRGGRTSGVYTIQPPGSNEPFQVYCDMDTDGGGWTVIQRRGDGSVDFYRS 60

Query: 274 WTDYAQGFGSPESEFWIGNDALHRLTSQDNMSLRIEFVDIYGKAWYAEYDTFSVASESEG 333
           W +Y  GFG+ + EFW+G + +H LTSQ    LRI+  D  G   YAEYD+F V SESEG
Sbjct: 61  WKEYKDGFGNLDGEFWLGLEKIHLLTSQGPYELRIDLEDWEGNTAYAEYDSFKVGSESEG 120

Query: 334 YRLNLSGYHGNASDAMTYQNNMKFSTIDRDNDLSNTHCASNYEGGWWFSHCLHANLNGKF 393
           YRL L GY G A DA++Y N MKFST DRDND ++ +CA +Y GGWW++ C  +NLNG++
Sbjct: 121 YRLTLGGYSGTAGDALSYHNGMKFSTFDRDNDGASGNCAESYSGGWWYNSCHASNLNGRY 180

Query: 394 N---------LGLTWFHSEKNEWMAVARSHMKIRRR 420
                      G+ W  + K    ++  + MKIR +
Sbjct: 181 YSGGHRNEYDNGINW-ATWKGSTYSLKFTEMKIRPK 215


>gnl|CDD|214548 smart00186, FBG, Fibrinogen-related domains (FReDs).  Domain
           present at the C-termini of fibrinogen beta and gamma
           chains, and a variety of fibrinogen-related proteins,
           including tenascin and Drosophila scabrous.
          Length = 212

 Score =  226 bits (579), Expect = 1e-72
 Identities = 92/213 (43%), Positives = 119/213 (55%), Gaps = 24/213 (11%)

Query: 231 LPNDCEGH-------SGLQLISLGRS-----VLCD-EHH---WMTIQRRYNGMQEFNLKW 274
           LP DC          SGL  I    S     V CD E     W  IQRR +G  +F   W
Sbjct: 1   LPRDCSDVLQNGGKTSGLYTIYPDGSSRPLKVYCDMETDGGGWTVIQRRMDGSVDFYRDW 60

Query: 275 TDYAQGFGSPESEFWIGNDALHRLTSQDNMSLRIEFVDIYGKAWYAEYDTFSVASESEGY 334
            DY +GFG+   EFW+GN+ +H LTSQ    LRI+  D  G   YA YD+F VA E++GY
Sbjct: 61  KDYKEGFGNLAGEFWLGNENIHLLTSQGKYELRIDLEDWEGNTAYALYDSFKVADEADGY 120

Query: 335 RLNLSGYHGNASDA-MTYQNNMKFSTIDRDNDLSNTHCASNYEGGWWFSHCLHANLNGKF 393
           RL++ GY G A DA +TY N M+FST DRDND  + +CA  Y GGWW+++C  ANLNG++
Sbjct: 121 RLHIGGYSGTAGDASLTYHNGMQFSTYDRDNDKYSGNCAEEYGGGWWYNNCHAANLNGRY 180

Query: 394 ------NLGLTWFHSEKNEWMAVARSHMKIRRR 420
                 + G+ W    K  W ++  + MKIR  
Sbjct: 181 YPNNNYDNGINWATW-KGSWYSLKFTEMKIRPL 212


>gnl|CDD|215748 pfam00147, Fibrinogen_C, Fibrinogen beta and gamma chains,
           C-terminal globular domain. 
          Length = 221

 Score =  187 bits (476), Expect = 3e-57
 Identities = 89/221 (40%), Positives = 113/221 (51%), Gaps = 33/221 (14%)

Query: 232 PNDCEGH-------SGLQLISLGRS-----VLCD----EHHWMTIQRRYNGMQEFNLKWT 275
           P DC          SGL  I    +     V CD       W   QRR +G   F   W 
Sbjct: 2   PKDCSDILQNGGETSGLYTIQPDGANKPLRVYCDMETDGGGWTVFQRRQDGSVNFYRNWK 61

Query: 276 DYAQGFGSPES-EFWIGNDALHRLTSQDNMSLRIEFVDIYGKAWYAEYDTFSVASESEGY 334
           DY  GFG+  + EFW+GND +H LTSQ    LR+E  D  G+  YA YD F+V  E+  Y
Sbjct: 62  DYKAGFGNLATGEFWLGNDKIHLLTSQGPYELRVELEDWNGETVYALYDKFTVGDEANKY 121

Query: 335 RLNLSGYHGNASDA-------MTYQNNMKFSTIDRDNDLSNTHCASNYEGGWWFSHCLHA 387
           RL++SGY G A +A       MTY N M FST DRDND    +CA +Y GGWW+++C  A
Sbjct: 122 RLSVSGYSGTAGNALMTAGDSMTYHNGMFFSTYDRDNDSPRGNCALSYGGGWWYNNCHAA 181

Query: 388 NLNGKF--------NLGLTWFHSEKNEWMAVARSHMKIRRR 420
           NLNG++          G+ W    K  W ++  + MKIR R
Sbjct: 182 NLNGRYYYGGTYSWQNGVVWMTW-KGSWYSMKFAEMKIRPR 221


>gnl|CDD|113356 pfam04582, Reo_sigmaC, Reovirus sigma C capsid protein. 
          Length = 326

 Score = 39.1 bits (91), Expect = 0.003
 Identities = 42/170 (24%), Positives = 76/170 (44%), Gaps = 26/170 (15%)

Query: 45  EILQRISHLETSSQTNVKTLFNITKQISGVEHLHSSMMELLESIETIENKVDYNLPTIQR 104
           E++  I  L +S   N   L  I ++++ +E   S+   L ES+ TI +++      +  
Sbjct: 11  EVVGLILSLTSSVTINPGDLTPIYERLTNLE---SATASLSESVSTISSRLS----DLSA 63

Query: 105 EISKLEFGVAQINSTLQVVNETRESDANYLKSTVSTLATLQEKVEANNII---MQALSDA 161
           ++  +   +    +TL  +  T ++    + S  S LA L   V A++     +Q    A
Sbjct: 64  DLQDMTTSLDDAKATLNGLITTVQALQTSVDSLSSELADLSLTVSAHDAAISDLQTTVHA 123

Query: 162 NY-----LKSTVS----TLATLQEKVEANNIIMQALRNGRSEGLSLSSQL 202
           N      LKS+VS     +  L+++V       ++L +G S GLS S  L
Sbjct: 124 NSTDISNLKSSVSANGLNITDLEQRV-------KSLESGSSHGLSFSPPL 166


>gnl|CDD|224117 COG1196, Smc, Chromosome segregation ATPases [Cell division and
           chromosome partitioning].
          Length = 1163

 Score = 38.5 bits (90), Expect = 0.007
 Identities = 36/205 (17%), Positives = 83/205 (40%), Gaps = 9/205 (4%)

Query: 25  LEKEQHVINHKMKTLQAINEEILQRISHLETSSQTNVKTLFNITKQISGVEHLHSSMMEL 84
           LE++   +  ++  L+   E++  R+  LE   +   + L  + +++  +E    S+ E 
Sbjct: 714 LERQLEELKRELAALEEELEQLQSRLEELEEELEELEEELEELQERLEELEEELESLEEA 773

Query: 85  L----ESIETIENKVDYN---LPTIQREISKLEFGVAQINSTLQVVNETRESDANYLKST 137
           L    E IE +E K       L  ++ E+ + E  +  +   L+ + + RE     ++  
Sbjct: 774 LAKLKEEIEELEEKRQALQEELEELEEELEEAERRLDALERELESLEQRRERLEQEIEEL 833

Query: 138 VSTLATLQEKVEANNIIMQALSDANYLKSTVSTLATLQEKVEANNIIMQALRNGRSEGLS 197
              +  L+EK++     ++ L     L+     L  L+ + E     ++ L   + E   
Sbjct: 834 EEEIEELEEKLDELEEELEELEKE--LEELKEELEELEAEKEELEDELKELEEEKEELEE 891

Query: 198 LSSQLDDSKNLSPETIMQQMNKIHD 222
              +L+       E I +   ++ +
Sbjct: 892 ELRELESELAELKEEIEKLRERLEE 916



 Score = 34.7 bits (80), Expect = 0.11
 Identities = 32/157 (20%), Positives = 67/157 (42%), Gaps = 9/157 (5%)

Query: 23  HHLEKEQHVINHKMKTLQAINEEILQRISHLETSSQTNVKTLFNITKQISGVEHLHSSMM 82
             LE+E   +  +++ LQ   EE+ + +  LE       + L  + ++I  +E    ++ 
Sbjct: 740 EELEEELEELEEELEELQERLEELEEELESLE-------EALAKLKEEIEELEEKRQALQ 792

Query: 83  ELLESIETIENKVDYNLPTIQREISKLEFGVAQINSTLQVVNETRESDANYLKSTVSTLA 142
           E LE +E    + +  L  ++RE+  LE    ++   ++ + E  E     L      L 
Sbjct: 793 EELEELEEELEEAERRLDALERELESLEQRRERLEQEIEELEEEIEELEEKLDELEEELE 852

Query: 143 TLQEKVEANNIIMQALSDANYLKSTVSTLATLQEKVE 179
            L++++E     ++ L      +     L  L+E+ E
Sbjct: 853 ELEKELEELKEELEELEAE--KEELEDELKELEEEKE 887



 Score = 32.4 bits (74), Expect = 0.54
 Identities = 25/138 (18%), Positives = 63/138 (45%), Gaps = 7/138 (5%)

Query: 18  FLKTKHHLEKEQHVINHKMKTLQAINEEILQRISHLETSSQTNVKTLFNITKQISGVEHL 77
             +    LE+ +  +  +++ L+   EE+ +++  LE   +   K L  + +++   E L
Sbjct: 812 LERELESLEQRRERLEQEIEELEEEIEELEEKLDELEEELEELEKELEELKEEL---EEL 868

Query: 78  HSSMMELLESIETIENKVDYNLPTIQREISKLEFGVAQINSTLQVVNETRESDANYLKST 137
            +   EL + ++ +E + +     ++ E+ +LE  +A++   ++ + E  E     L+  
Sbjct: 869 EAEKEELEDELKELEEEKE----ELEEELRELESELAELKEEIEKLRERLEELEAKLERL 924

Query: 138 VSTLATLQEKVEANNIIM 155
              L  L+E++E      
Sbjct: 925 EVELPELEEELEEEYEDT 942



 Score = 30.5 bits (69), Expect = 1.9
 Identities = 37/195 (18%), Positives = 76/195 (38%), Gaps = 21/195 (10%)

Query: 19  LKTKHH-LEKEQHVINHKMKTLQAINEEILQRISHLETSSQTNVKTLFNITKQISGVEHL 77
           LK++   L +E   +  ++  L+   EE+   IS L    +     L  + +++  ++  
Sbjct: 272 LKSELEELREELEELQEELLELKEEIEELEGEISLLRERLEELENELEELEERLEELKEK 331

Query: 78  HSSMMELLESIETI------------------ENKVDYNLPTIQREISKLEFGVAQINST 119
             ++ E LE  ET+                  E K+   L  ++     L   +A++ + 
Sbjct: 332 IEALKEELEERETLLEELEQLLAELEEAKEELEEKLSALLEELEELFEALREELAELEAE 391

Query: 120 LQVVNETRESDANYLKSTVSTLATLQEKVEANNIIMQALSDANYLKSTVSTLATLQEKVE 179
           L  +    E     ++S    L  L E++E     ++ L     L+   + L  L E++E
Sbjct: 392 LAEIRNELEELKREIESLEERLERLSERLEDLKEELKELEAE--LEELQTELEELNEELE 449

Query: 180 ANNIIMQALRNGRSE 194
                ++ LR+   E
Sbjct: 450 ELEEQLEELRDRLKE 464



 Score = 28.5 bits (64), Expect = 7.8
 Identities = 32/190 (16%), Positives = 78/190 (41%), Gaps = 12/190 (6%)

Query: 26  EKEQHVINHKMKTLQAINEEILQRISHLETSSQTNVKTLFNITKQISGVEHLHSSMMELL 85
           E E  ++  K+K L+   EE+ + +S LE   +   + L    K+I  ++     + E L
Sbjct: 224 ELELALLLAKLKELRKELEELEEELSRLEEELEELQEELEEAEKEIEELKSELEELREEL 283

Query: 86  ESIET-------IENKVDYNLPTIQREISKLEFGVAQINSTLQVVNETRESDANYLKSTV 138
           E ++           +++  +  ++  + +LE  + ++   L+ + E  E+    L+   
Sbjct: 284 EELQEELLELKEEIEELEGEISLLRERLEELENELEELEERLEELKEKIEALKEELEERE 343

Query: 139 STLATLQEKVEANNIIMQALSDAN--YLKSTVSTLATLQEKVEANNIIMQALRNGRSEGL 196
           + L  L++ +       + L +     L+        L+E++     +   L   R+E  
Sbjct: 344 TLLEELEQLLAELEEAKEELEEKLSALLEELEELFEALREELAE---LEAELAEIRNELE 400

Query: 197 SLSSQLDDSK 206
            L  +++  +
Sbjct: 401 ELKREIESLE 410


>gnl|CDD|179329 PRK01742, tolB, translocation protein TolB; Provisional.
          Length = 429

 Score = 34.5 bits (79), Expect = 0.10
 Identities = 24/68 (35%), Positives = 37/68 (54%), Gaps = 15/68 (22%)

Query: 178 VEANNIIMQALRNGRSEGLSLSSQLDDSKNLSPETIMQQMNKIHDSYLQVVGKLPNDCEG 237
           +  +N++ Q L +G +E LS S+ LD+S ++SP  IM     I  S  Q +GK+      
Sbjct: 350 INGDNVVKQDLTSGSTEVLS-STFLDESPSISPNGIM-----IIYSSTQGLGKV------ 397

Query: 238 HSGLQLIS 245
              LQL+S
Sbjct: 398 ---LQLVS 402


>gnl|CDD|233757 TIGR02168, SMC_prok_B, chromosome segregation protein SMC, common
           bacterial type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. This family
           represents the SMC protein of most bacteria. The smc
           gene is often associated with scpB (TIGR00281) and scpA
           genes, where scp stands for segregation and condensation
           protein. SMC was shown (in Caulobacter crescentus) to be
           induced early in S phase but present and bound to DNA
           throughout the cell cycle [Cellular processes, Cell
           division, DNA metabolism, Chromosome-associated
           proteins].
          Length = 1179

 Score = 34.6 bits (80), Expect = 0.10
 Identities = 37/209 (17%), Positives = 84/209 (40%), Gaps = 10/209 (4%)

Query: 25  LEKEQHVINHKMKTLQAINEEILQRISHLETSSQTNVKTLFNITKQISGVEHLHSSMMEL 84
           L KE   +  +++ L+   EE+ ++IS L          +  + ++I+ +    + +   
Sbjct: 703 LRKELEELEEELEQLRKELEELSRQISALRKDLARLEAEVEQLEERIAQLSKELTELEAE 762

Query: 85  LESIETIENKVDYNLPTIQREISKLEFGVAQINSTLQVVNETRESDANYLKSTVSTLATL 144
           +E +E    + +  L   + EI +LE  + Q+   L+ + E  +     L       A L
Sbjct: 763 IEELEERLEEAEEELAEAEAEIEELEAQIEQLKEELKALREALDELRAELTLLNEEAANL 822

Query: 145 QEKVEA--NNIIMQALSDANYLKSTVSTLATLQEKVEANNIIMQALRNGRSEGLSLSSQL 202
           +E++E+    I          L+     +  L E +E+    ++ L     E   L S+L
Sbjct: 823 RERLESLERRIAATERR----LEDLEEQIEELSEDIESLAAEIEELEELIEE---LESEL 875

Query: 203 DDSKNLSPETIMQQMNKIHDSYLQVVGKL 231
           +   N    ++ + +  +     ++  +L
Sbjct: 876 EALLNER-ASLEEALALLRSELEELSEEL 903



 Score = 31.6 bits (72), Expect = 0.92
 Identities = 35/186 (18%), Positives = 84/186 (45%), Gaps = 6/186 (3%)

Query: 25  LEKEQHVINHKMKTLQAINEEILQRISHLETSSQTNVKTLFNITKQISGVEHLHSSMMEL 84
            E+E   +  +++ L+   EE+   +S LE   +   K L+ +  +IS +E     + E 
Sbjct: 251 AEEELEELTAELQELEEKLEELRLEVSELEEEIEELQKELYALANEISRLEQQKQILRER 310

Query: 85  LESIETIENKVDYNLPTIQREISKLEFGVAQINSTLQVVNETRESDANYLKSTVSTLATL 144
           L ++E    +++  L  ++ ++ +L   +A++   L+ + E  ES    L+   + L  L
Sbjct: 311 LANLERQLEELEAQLEELESKLDELAEELAELEEKLEELKEELESLEAELEELEAELEEL 370

Query: 145 QEKVEANNIIMQALSDANYLKSTVSTLATLQEKVEANNIIMQAL----RNGRSEGLSLSS 200
           + ++E     ++ L     +      +A+L  ++E     ++ L       + E   L  
Sbjct: 371 ESRLEELEEQLETLRSK--VAQLELQIASLNNEIERLEARLERLEDRRERLQQEIEELLK 428

Query: 201 QLDDSK 206
           +L++++
Sbjct: 429 KLEEAE 434


>gnl|CDD|112109 pfam03279, Lip_A_acyltrans, Bacterial lipid A biosynthesis
           acyltransferase. 
          Length = 295

 Score = 33.4 bits (77), Expect = 0.19
 Identities = 20/58 (34%), Positives = 26/58 (44%), Gaps = 2/58 (3%)

Query: 165 KSTVSTLATLQEKVEANNIIMQALRNGRSEGLSLS--SQLDDSKNLSPETIMQQMNKI 220
           K+T ++ A L  K  A  I +   RNG   G +      LD +     E I Q MNKI
Sbjct: 217 KATTTSPAKLARKTGAAVIPVFPWRNGDGSGYTAIVHPALDLTPTEDVEQIAQAMNKI 274


>gnl|CDD|233758 TIGR02169, SMC_prok_A, chromosome segregation protein SMC,
           primarily archaeal type.  SMC (structural maintenance of
           chromosomes) proteins bind DNA and act in organizing and
           segregating chromosomes for partition. SMC proteins are
           found in bacteria, archaea, and eukaryotes. It is found
           in a single copy and is homodimeric in prokaryotes, but
           six paralogs (excluded from this family) are found in
           eukarotes, where SMC proteins are heterodimeric. This
           family represents the SMC protein of archaea and a few
           bacteria (Aquifex, Synechocystis, etc); the SMC of other
           bacteria is described by TIGR02168. The N- and
           C-terminal domains of this protein are well conserved,
           but the central hinge region is skewed in composition
           and highly divergent [Cellular processes, Cell division,
           DNA metabolism, Chromosome-associated proteins].
          Length = 1164

 Score = 33.5 bits (77), Expect = 0.22
 Identities = 18/96 (18%), Positives = 40/96 (41%), Gaps = 7/96 (7%)

Query: 35  KMKTLQAINEEILQRISHLETSSQTNVK--TLFNITKQISGVEHLHSSMMELLESIETIE 92
            ++ L  I +E  Q++  L    +   +   L    ++  G E L      L    E IE
Sbjct: 185 NIERLDLIIDEKRQQLERLRREREKAERYQALLKEKREYEGYELLKE-KEALERQKEAIE 243

Query: 93  NKVDYNLPTIQREISKLEFGVAQINSTLQVVNETRE 128
            ++     +++ E+ KL   ++++   L+ + +  E
Sbjct: 244 RQLA----SLEEELEKLTEEISELEKRLEEIEQLLE 275



 Score = 32.3 bits (74), Expect = 0.52
 Identities = 31/131 (23%), Positives = 66/131 (50%), Gaps = 7/131 (5%)

Query: 25  LEKEQHVINHKMKTLQAINEEILQRISHLETSSQTNVKTLFNITKQISGVEHLHSSMMEL 84
           +EKE   +  + + L+   EE+ + +S LE     NVK+   + +  + +E L   + +L
Sbjct: 721 IEKEIEQLEQEEEKLKERLEELEEDLSSLE-QEIENVKS--ELKELEARIEELEEDLHKL 777

Query: 85  LESIETIENKVDYN-LPTIQREISKLEFGVAQINSTLQVVN---ETRESDANYLKSTVST 140
            E++  +E ++ ++ +P IQ E+SKLE  V++I + L+ +         +  YL+  +  
Sbjct: 778 EEALNDLEARLSHSRIPEIQAELSKLEEEVSRIEARLREIEQKLNRLTLEKEYLEKEIQE 837

Query: 141 LATLQEKVEAN 151
           L   +  ++  
Sbjct: 838 LQEQRIDLKEQ 848


>gnl|CDD|235175 PRK03918, PRK03918, chromosome segregation protein; Provisional.
          Length = 880

 Score = 31.6 bits (72), Expect = 0.84
 Identities = 32/159 (20%), Positives = 63/159 (39%), Gaps = 13/159 (8%)

Query: 25  LEKEQHVINHKMKTLQAINEEILQRISHLETSS---QTNVKTLFNITKQISGVEHLHSSM 81
           LEKE   +    + L+    E+ +RI  L+      +  VK L    K++      +  +
Sbjct: 243 LEKELESLEGSKRKLEEKIRELEERIEELKKEIEELEEKVKEL----KELKEKAEEYIKL 298

Query: 82  MELLESIETIENKVDYNLPTIQREISKLEFGVAQINSTLQVVNETRESDANYLKSTVSTL 141
            E  E       +++  L  ++ EI+ +E  + ++    + + E ++     LK     L
Sbjct: 299 SEFYEEYLDELREIEKRLSRLEEEINGIEERIKELEEKEERLEELKKK----LKELEKRL 354

Query: 142 ATLQEKVEANNIIMQALSDANYLKSTVSTLATLQEKVEA 180
             L+E+ E          +   LK  ++ L    EK+E 
Sbjct: 355 EELEERHELYEEAKAKKEELERLKKRLTGLTP--EKLEK 391



 Score = 30.0 bits (68), Expect = 2.5
 Identities = 27/173 (15%), Positives = 64/173 (36%), Gaps = 26/173 (15%)

Query: 19  LKTKHHLEKEQHVINHKMKTLQAINEEILQRISHLETSSQTNVKTLFNITKQISGVEHLH 78
           L+    L+K+   +  K+  L+    E+L+ +  L   S   ++      +++  +E  +
Sbjct: 548 LEKLEELKKKLAELEKKLDELEEELAELLKELEELGFESVEELE------ERLKELEPFY 601

Query: 79  SSMMEL----------LESIETIENKVDYNLPTIQREISKLEFGVAQINSTLQVVN-ETR 127
           +  +EL           + ++ +E ++D     +     +LE    ++    +  + E  
Sbjct: 602 NEYLELKDAEKELEREEKELKKLEEELDKAFEELAETEKRLEELRKELEELEKKYSEEEY 661

Query: 128 ESDANYLKSTVSTLATLQEKVEANNIIMQALSDANYLKSTVSTLATLQEKVEA 180
           E            LA L+ ++E               +    TL  L+E++E 
Sbjct: 662 EELREEYLELSRELAGLRAELEELE---------KRREEIKKTLEKLKEELEE 705


>gnl|CDD|224428 COG1511, COG1511, Predicted membrane protein [Function unknown].
          Length = 780

 Score = 30.6 bits (69), Expect = 1.8
 Identities = 24/116 (20%), Positives = 46/116 (39%), Gaps = 5/116 (4%)

Query: 93  NKVDYNLPTIQREISKLEFGVAQINSTLQVVN---ETRESDANYLKSTVSTLATLQEKVE 149
           N++D  L T+   I +L+ G  Q+N  +   +       S    L + V  L      + 
Sbjct: 237 NQLDSGLGTLAAGIGELKQGAEQLNEGIGEFSSGLSELNSGVQDLAAGVPQLNQGISALA 296

Query: 150 ANNIIMQALSD--ANYLKSTVSTLATLQEKVEANNIIMQALRNGRSEGLSLSSQLD 203
           A   +  +L D  ++  ++       L++K  ++    Q   +     L+LS  LD
Sbjct: 297 AGLSLPDSLGDQFSSLQEALTQIAQGLKQKTSSSLEAAQGSLSSLQSMLALSKSLD 352


>gnl|CDD|226823 COG4388, COG4388, Mu-like prophage I protein [General function
           prediction only].
          Length = 357

 Score = 30.2 bits (68), Expect = 1.9
 Identities = 30/124 (24%), Positives = 47/124 (37%), Gaps = 10/124 (8%)

Query: 85  LESIETIENKVDYNLPTIQREISKLEFGVAQINSTLQVVNETRESDANYLKSTVSTLATL 144
           L   ET+    D   P +  E+SK +          +   +        LKS V+ L+  
Sbjct: 207 LTLAETLAAPKDEKDPQVA-ELSKGDAAKPTDKDLTKYAPD---GTVAELKSQVAALSAK 262

Query: 145 QEKVEANNIIMQALSDANYLKSTVSTLATL--QEKVEANNIIMQALRNGRSEGLSLSSQL 202
              +E + +I  ALS+   L +       L  Q+ V A    + AL +GR    +LS   
Sbjct: 263 VIAMEVDGLIRAALSEGRLLPAQEEYAKQLGKQDGVAA----LSALIDGRQPVAALSKTQ 318

Query: 203 DDSK 206
              K
Sbjct: 319 TGGK 322


>gnl|CDD|177949 PLN02315, PLN02315, aldehyde dehydrogenase family 7 member.
          Length = 508

 Score = 30.2 bits (68), Expect = 2.3
 Identities = 23/94 (24%), Positives = 41/94 (43%), Gaps = 7/94 (7%)

Query: 76  HLHSSMMELLESIETIEN---KVDYNLPTIQREISKLEFGVAQINSTLQVVNETRESDAN 132
           H   S     + IE I++   K+      I+ E + ++  + +I+    VV E       
Sbjct: 343 HTPESKKNFEKGIEIIKSQGGKILTGGSAIESEGNFVQPTIVEISPDADVVKEELFGPVL 402

Query: 133 YLKSTVSTLATLQEKVEANNIIMQALSDANYLKS 166
           Y    V    TL+E +E NN + Q LS + + ++
Sbjct: 403 Y----VMKFKTLEEAIEINNSVPQGLSSSIFTRN 432


>gnl|CDD|218408 pfam05062, RICH, RICH domain.  This presumed domain is about 85
          residues in length and very rich in charged residues,
          hence the name RICH (Rich In CHarged residues). It is
          found in secreted proteins such as PspC, SpsA, and IgA
          FC receptor from Streptococcus agalactiae. This domain
          could be involved in bacterial adherence or cell wall
          binding.
          Length = 81

 Score = 27.7 bits (62), Expect = 2.7
 Identities = 13/45 (28%), Positives = 21/45 (46%), Gaps = 1/45 (2%)

Query: 16 KLFLKTKHHLEKEQHVINHKM-KTLQAINEEILQRISHLETSSQT 59
          K+  +    L+K +H  N  + K L AI  E L  +  L+T  + 
Sbjct: 17 KILSEIYKQLDKRKHTQNVALIKKLSAIKTEYLYELDVLKTKVEL 61


>gnl|CDD|224495 COG1579, COG1579, Zn-ribbon protein, possibly nucleic acid-binding
           [General function prediction only].
          Length = 239

 Score = 29.6 bits (67), Expect = 2.8
 Identities = 28/127 (22%), Positives = 54/127 (42%), Gaps = 9/127 (7%)

Query: 59  TNVKTLF---NITKQISGVEHLHSSMMELLESIETIENKVDYNLPTIQREISKLEFGVAQ 115
            N+K+L     +  +   +E     + + L+  +     ++  L  ++ E+  LE  V+Q
Sbjct: 4   NNLKSLLAIQKLDLEKDRLEPRIKEIRKALKKAKAELEALNKALEALEIELEDLENQVSQ 63

Query: 116 INSTLQVVNETRESDANYLKSTVSTLATLQEKVEANNIIMQAL-SDANYLKSTVSTLATL 174
           + S +Q + E R   A    S V      + ++ A NI +Q      N L+  ++ L   
Sbjct: 64  LESEIQEIRE-RIKRAEEKLSAVKD----ERELRALNIEIQIAKERINSLEDELAELMEE 118

Query: 175 QEKVEAN 181
            EK+E  
Sbjct: 119 IEKLEKE 125


>gnl|CDD|224241 COG1322, COG1322, Predicted nuclease of restriction
           endonuclease-like fold, RmuC family [General function
           prediction only].
          Length = 448

 Score = 29.3 bits (66), Expect = 4.3
 Identities = 27/137 (19%), Positives = 55/137 (40%), Gaps = 9/137 (6%)

Query: 24  HLEKEQHVINHKMKTLQAINEEILQRISHLETSSQTNVKTLFNITKQISGVEHLHSSMME 83
               E   +    ++LQA+N E++Q ++ L+   Q  +       +    +E L     E
Sbjct: 49  AFRAEAEQLRTFARSLQALNLELIQELNELKARLQQQLLQS--REQLQLLIESLAQLSSE 106

Query: 84  LLESIETIENKVDYNLPTIQREISKLEFGVAQINSTLQVVNET-RESDANYLKSTVSTLA 142
             E    I  +++  L  + ++  K      Q+   L+ V E  RE     +  +    +
Sbjct: 107 FQELANEIFEELNRRLAELNQQNLK------QLLKPLREVLEKFREQLEQRIHESAEERS 160

Query: 143 TLQEKVEANNIIMQALS 159
           TL E+++     +Q L+
Sbjct: 161 TLLEEIDRLLGEIQQLA 177


>gnl|CDD|221893 pfam13010, pRN1_helical, Primase helical domain.  This alpha
          helical domain is found in a set of bacterial plasmid
          replication proteins. The domain is found to the
          C-terminus of the primase/polymerase domain. Mutants of
          this domain are defective in template binding,
          dinucleotide formation and conformation change prior to
          DNA extension.
          Length = 135

 Score = 28.2 bits (63), Expect = 4.4
 Identities = 11/37 (29%), Positives = 19/37 (51%)

Query: 35 KMKTLQAINEEILQRISHLETSSQTNVKTLFNITKQI 71
          K KT++AI EE+ ++I       +   K + N  K +
Sbjct: 27 KGKTVEAIREEVCKKIKKSLEELKEKKKAILNTAKGV 63


>gnl|CDD|212500 cd11641, Precorrin-4_C11-MT, Precorrin-4 C11-methyltransferase
           (CbiF/CobM).  Precorrin-4 C11-methyltransferase
           participates in the pathway toward the biosynthesis of
           cobalamin (vitamin B12). There are two distinct
           cobalamin biosynthetic pathways in bacteria. The aerobic
           pathway requires oxygen, and cobalt is inserted late in
           the pathway; the anaerobic pathway does not require
           oxygen, and cobalt insertion is the first committed step
           towards cobalamin synthesis. In the aerobic pathway,
           CobM catalyzes the methylation of precorrin-4 at C-11 to
           yield precorrin-5. In the anaerobic pathway, CibF
           catalyzes the methylation of cobalt-precorrin-4 to
           cobalt-precorrin-5. Both CibF and CobM, which are
           homologous, are included in this model. There are about
           30 enzymes involved in vitamin B12 synthetic pathway.
           The enzymes involved in the aerobic pathway are prefixed
           Cob and those of the anaerobic pathway Cbi. Most of the
           enzymes are shared in both pathways and several of these
           enzymes are pathway-specific.
          Length = 228

 Score = 28.6 bits (65), Expect = 5.7
 Identities = 9/19 (47%), Positives = 11/19 (57%)

Query: 140 TLATLQEKVEANNIIMQAL 158
           TLA + EKV+   I   AL
Sbjct: 202 TLADIAEKVKEAGITRTAL 220



 Score = 28.6 bits (65), Expect = 5.7
 Identities = 9/19 (47%), Positives = 11/19 (57%)

Query: 170 TLATLQEKVEANNIIMQAL 188
           TLA + EKV+   I   AL
Sbjct: 202 TLADIAEKVKEAGITRTAL 220


>gnl|CDD|218403 pfam05055, DUF677, Protein of unknown function (DUF677).  This
           family consists of AT14A like proteins from Arabidopsis
           thaliana. At14a has a small domain that has sequence
           similarities to integrins from fungi, insects and
           humans. Transcripts of At14a are found in all
           Arabidopsis tissues and localises partly to the plasma
           membrane.
          Length = 336

 Score = 28.6 bits (64), Expect = 6.2
 Identities = 24/70 (34%), Positives = 38/70 (54%), Gaps = 7/70 (10%)

Query: 44  EEILQR----ISHLETSSQTNVKTLFNITKQISGVEHLHSSMMELLE-SIETIENKVDYN 98
           EE L+R    IS +E   Q NVK + NI+  +  +E   +SM++ +E ++E  EN+V   
Sbjct: 235 EEALKRQKEIISSMEKGIQVNVKDMENISILVDRLESEITSMLKTVEFAVEHEENEVAVR 294

Query: 99  LPTIQREISK 108
           +     EI K
Sbjct: 295 I--AMDEIKK 302


>gnl|CDD|224476 COG1560, HtrB, Lauroyl/myristoyl acyltransferase [Cell envelope
           biogenesis, outer membrane].
          Length = 308

 Score = 28.4 bits (64), Expect = 6.6
 Identities = 13/58 (22%), Positives = 22/58 (37%), Gaps = 2/58 (3%)

Query: 165 KSTVSTLATLQEKVEANNIIMQALRNGRSEG--LSLSSQLDDSKNLSPETIMQQMNKI 220
            +T +  A L     A  + +  +RN    G  L +   + D  +   E   Q+MN  
Sbjct: 220 AATTTGPAKLARLTGAAVVPVFPVRNPDGSGYTLHIHPPMTDDPSEDVEADAQRMNDF 277


>gnl|CDD|237582 PRK14011, PRK14011, prefoldin subunit alpha; Provisional.
          Length = 144

 Score = 27.8 bits (62), Expect = 6.8
 Identities = 16/52 (30%), Positives = 30/52 (57%), Gaps = 7/52 (13%)

Query: 42 INEEILQRISHLETSSQTNVKTLFNITKQISGVEHLHSSMMELLESIETIEN 93
          +NEE+  +   LE  +Q   K    + +++S ++ +    MELL+SIE++E 
Sbjct: 1  MNEELQNQFMALEVYNQQVQK----LQEELSSIDMM---KMELLKSIESMEG 45


>gnl|CDD|224414 COG1497, COG1497, Predicted transcriptional regulator
          [Transcription].
          Length = 260

 Score = 28.1 bits (63), Expect = 7.8
 Identities = 22/78 (28%), Positives = 35/78 (44%), Gaps = 9/78 (11%)

Query: 23 HHLEKEQHVINHKMK-TLQAINEEI--LQRISHLETSSQTNVKTLFNITKQISGVEHLHS 79
               +Q  I  K+  TLQA++E I  L +   +E   +      + ITK+  G E L  
Sbjct: 22 RQPRVKQKEIAKKLGITLQAVSEHIKELVKEGLIEKEGRGE----YEITKK--GAEWLLE 75

Query: 80 SMMELLESIETIENKVDY 97
           + +L    E +E  +DY
Sbjct: 76 QLSDLRRFSEEVELVLDY 93


>gnl|CDD|112953 pfam04163, Tht1, Tht1-like nuclear fusion protein. 
          Length = 554

 Score = 28.5 bits (63), Expect = 8.2
 Identities = 41/147 (27%), Positives = 55/147 (37%), Gaps = 24/147 (16%)

Query: 98  NLPTIQREISKLEFGVAQINSTLQVVNETRESDANYLKSTVSTLATLQEKVEANNIIMQA 157
           N   I   I +LE  +  INS     N   ES        +  LA   E    N I + A
Sbjct: 244 NQDEILETIEQLESEIFDINS--FFANFIEESAG-VHNHIIEDLA--DEFASENAIALSA 298

Query: 158 LSD-----ANYLKSTVSTLAT-----LQEKVEAN----NIIMQALRN---GRSEGL-SLS 199
           + D       +L+  +  L       L+E V+ N    N   Q L +   G  E L  L 
Sbjct: 299 IGDLKDDFLAFLEDRIKNLIEMSEDSLKESVQRNIDFVNSGFQELEDFSIGLKEELGGLK 358

Query: 200 SQLDDSKNLSPETIMQQMNKI-HDSYL 225
             L + +NL  E I+Q  N   HDS  
Sbjct: 359 KDLSEQQNLEAEEILQGFNDFLHDSKD 385


>gnl|CDD|116500 pfam07888, CALCOCO1, Calcium binding and coiled-coil domain
           (CALCOCO1) like.  Proteins found in this family are
           similar to the coiled-coil transcriptional coactivator
           protein coexpressed by Mus musculus (CoCoA/CALCOCO1).
           This protein binds to a highly conserved N-terminal
           domain of p160 coactivators, such as GRIP1, and thus
           enhances transcriptional activation by a number of
           nuclear receptors. CALCOCO1 has a central coiled-coil
           region with three leucine zipper motifs, which is
           required for its interaction with GRIP1 and may regulate
           the autonomous transcriptional activation activity of
           the C-terminal region.
          Length = 546

 Score = 28.0 bits (62), Expect = 9.9
 Identities = 23/132 (17%), Positives = 52/132 (39%), Gaps = 18/132 (13%)

Query: 22  KHHLEKEQHVINHKMKTLQAINEEILQ---RISHLETSSQTNVKTLFNITKQISGVEHLH 78
           ++ LE  Q   N  MK   A+  ++     RI  LET+ + + +    + +Q    +   
Sbjct: 142 QNQLEGCQRERNDLMKKFLALEGDVSDMRSRIEQLETALRHSTEKTEELEEQHKEAQSSS 201

Query: 79  SSMMELLESIETIENKVDYNLPTIQREISKLEFGVAQINSTLQVVNETRESDANYLKSTV 138
            SM              + N   +  + ++ +  + ++   +Q + + ++ +   L+ T 
Sbjct: 202 ESMSA------------ERNA--LLAQRAENQQRILELEQDIQTLTQKKQENDRVLEGTQ 247

Query: 139 STLATL-QEKVE 149
              A L + K E
Sbjct: 248 DIEAELERMKGE 259


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.315    0.128    0.373 

Gapped
Lambda     K      H
   0.267   0.0746    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 20,465,612
Number of extensions: 1881489
Number of successful extensions: 1907
Number of sequences better than 10.0: 1
Number of HSP's gapped: 1884
Number of HSP's successfully gapped: 86
Length of query: 423
Length of database: 10,937,602
Length adjustment: 100
Effective length of query: 323
Effective length of database: 6,502,202
Effective search space: 2100211246
Effective search space used: 2100211246
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 60 (26.9 bits)