RPS-BLAST 2.2.26 [Sep-21-2011]

Database: CDD.v3.10 
           44,354 sequences; 10,937,602 total letters

Searching..................................................done

Query= psy15988
         (718 letters)



>gnl|CDD|219609 pfam07856, Orai-1, Mediator of CRAC channel activity.  ORAI-1 is a
           protein homologue of Drosophila Orai and human Orai1,
           Orai2 and Orai3. ORAI-1 GFP reporters are co- expressed
           with STIM-1 (ER CA(2+) sensors) in the gonad and
           intestine. The protein has four predicted transmembrane
           domains with a highly conserved region between TM2 ad
           TM3. This conserved domain is thought to function in
           channel regulation. ORAI1- related proteins are required
           for the production of the calcium channel, CRAC, along
           with STIM1-related proteins.
          Length = 175

 Score =  231 bits (591), Expect = 5e-72
 Identities = 89/167 (53%), Positives = 117/167 (70%)

Query: 271 DSEGLVWRRLHMSRAKLKATATTSELLSGFAMVAMVELQINEPTNVPEWLFIMFAVCTTV 330
             E L  R L +SRA+LKA++ TS LL+GFAMVA+VELQ ++P   PE L I FAV T +
Sbjct: 4   RLEDLEQRHLELSRAQLKASSRTSALLAGFAMVALVELQFSDPEVYPEGLLIAFAVVTAL 63

Query: 331 LVAVHIFALMISTYLLPNIEAISKLQSTKLVSESPHERMRGFVELAWAFSTVLGLFLFLV 390
           LVAVH+ ALMIST +LPNIE ++ + +  L  ESPH R   + EL W FST +G+ LFLV
Sbjct: 64  LVAVHLLALMISTCILPNIECVTFVHNYNLTPESPHRRFERYCELDWRFSTRMGIPLFLV 123

Query: 391 EVAILCWVKFWDYSLRAAIAATIIVIPVLIIFVVFVVHFYHNVVKYK 437
           E+A+L WVKF+D    AAI  T I++PVLI+FV F  HF+  ++ +K
Sbjct: 124 ELALLAWVKFYDSGPVAAIIVTAILVPVLIVFVAFTQHFWGELLSHK 170



 Score =  208 bits (532), Expect = 2e-63
 Identities = 89/215 (41%), Positives = 117/215 (54%), Gaps = 48/215 (22%)

Query: 480 DSEGLVWRRLHMSRAKLKATATTSELLSGFAMVAMVELQINEPTNVPEWLFIMFAVCTTV 539
             E L  R L +SRA+LKA++ TS LL+GFAMVA+VELQ ++P   PE L I FAV T +
Sbjct: 4   RLEDLEQRHLELSRAQLKASSRTSALLAGFAMVALVELQFSDPEVYPEGLLIAFAVVTAL 63

Query: 540 LVAVHIFALMISTYLLPGNDVAMVELQINEPTNVPEWLFIMFAVCTTVLVAVHIFALMIS 599
           LVAVH+ ALMIST +LP                                           
Sbjct: 64  LVAVHLLALMISTCILP------------------------------------------- 80

Query: 600 TYLLPNIEAISKLQSTKLVSESPHERMRGFVELAWAFSTVLGLFLFLVEVAILCWVKFWD 659
                NIE ++ + +  L  ESPH R   + EL W FST +G+ LFLVE+A+L WVKF+D
Sbjct: 81  -----NIECVTFVHNYNLTPESPHRRFERYCELDWRFSTRMGIPLFLVELALLAWVKFYD 135

Query: 660 YSLRAAIAATIIVIPVLIIFVVFVVHFYHNVVKYK 694
               AAI  T I++PVLI+FV F  HF+  ++ +K
Sbjct: 136 SGPVAAIIVTAILVPVLIVFVAFTQHFWGELLSHK 170


>gnl|CDD|217835 pfam03999, MAP65_ASE1, Microtubule associated protein (MAP65/ASE1
           family). 
          Length = 619

 Score = 44.8 bits (106), Expect = 2e-04
 Identities = 26/144 (18%), Positives = 45/144 (31%), Gaps = 18/144 (12%)

Query: 138 SRTQTASKRGALSHPRTPTTLLSKQGSHPLTPSSR--------------YPTTPTSGGHN 183
           +  +        S P TP+T  + +     TPS +                 + ++G  +
Sbjct: 455 TVMEPPYGSTESSVPSTPSTRRNDRNITSNTPSLKRTPNLTKSSLSQEASLISKSTGNTH 514

Query: 184 PSYSSSGSEASIEPRTPTVSKSPLGRTFSAGSHMSLPSLSPLPCHQCQHSPPTWSPYILP 243
               S+    +  P+ P  S+S  G    +G++ +  S    P      SP       L 
Sbjct: 515 --KHSTPRRLTTLPKLPAASRSSKGNLIRSGANGNASSDLSSPGSINSKSPEH--SVPLV 570

Query: 244 MNTDHHPRLYKNHSRRDLGHHYEE 267
              D H R      R       E+
Sbjct: 571 RVFDIHLRASTTKGRHSTPSTNEK 594


>gnl|CDD|224213 COG1294, AppB, Cytochrome bd-type quinol oxidase, subunit 2 [Energy
           production and conversion].
          Length = 346

 Score = 35.7 bits (83), Expect = 0.079
 Identities = 27/180 (15%), Positives = 64/180 (35%), Gaps = 33/180 (18%)

Query: 527 EWLFIMFAVCTTVLVAVHIFALMISTYLL-----PGNDVAMVELQINEPTNVPEWLFIMF 581
           + L   FA+   + + + ++ L  + +LL        + A    +I     +    F++F
Sbjct: 161 DQLLNPFALLCGLGLVL-MYVLHGAAWLLLKTEGALQERARKLARIAALLTLV--GFLLF 217

Query: 582 AVCTTVLVAVHIFALMIST---YLLPNIEAISKLQSTKLVSESPHERMRGFVELAWAFST 638
            V  T  +     +   S    +L   +  +  L +  L         +G  +   AF  
Sbjct: 218 GVWVTPGLDGFAASRWFSLPWLFLFSLLPVVGALLAVLLKR-------KGRGKEGLAFVL 270

Query: 639 VLGLFLFLVE-VAILCW---------VKFWD-----YSLRAAIAATIIVIPVLIIFVVFV 683
              LFL  +  + I  +         +  WD      +L   +   ++ +P+++ + ++ 
Sbjct: 271 TSLLFLLAILGLGISLFPYVLPSSVSLTIWDAASSPLTLSVMLVVALVFLPIVLAYTIWS 330



 Score = 31.4 bits (72), Expect = 1.5
 Identities = 20/111 (18%), Positives = 38/111 (34%), Gaps = 25/111 (22%)

Query: 318 EWLFIMFAVCTTVLVAVHIFALMISTYLLPNIEAISKLQSTKLVSESP-HERMRGFVELA 376
           + L   FA+   + + + ++ L  + +LL             L +E    ER R      
Sbjct: 161 DQLLNPFALLCGLGLVL-MYVLHGAAWLL-------------LKTEGALQERAR------ 200

Query: 377 WAFSTVLGLFLFLVEVAILCWVKFWDYSLRAAIAATIIVIPVLIIFVVFVV 427
              + +  L   +  +    WV      L    A+    +P L +F +  V
Sbjct: 201 -KLARIAALLTLVGFLLFGVWVTPG---LDGFAASRWFSLPWLFLFSLLPV 247



 Score = 31.4 bits (72), Expect = 1.5
 Identities = 20/111 (18%), Positives = 38/111 (34%), Gaps = 25/111 (22%)

Query: 575 EWLFIMFAVCTTVLVAVHIFALMISTYLLPNIEAISKLQSTKLVSESP-HERMRGFVELA 633
           + L   FA+   + + + ++ L  + +LL             L +E    ER R      
Sbjct: 161 DQLLNPFALLCGLGLVL-MYVLHGAAWLL-------------LKTEGALQERAR------ 200

Query: 634 WAFSTVLGLFLFLVEVAILCWVKFWDYSLRAAIAATIIVIPVLIIFVVFVV 684
              + +  L   +  +    WV      L    A+    +P L +F +  V
Sbjct: 201 -KLARIAALLTLVGFLLFGVWVTPG---LDGFAASRWFSLPWLFLFSLLPV 247


>gnl|CDD|227709 COG5422, ROM1, RhoGEF, Guanine nucleotide exchange factor for
           Rho/Rac/Cdc42-like GTPases [Signal transduction
           mechanisms].
          Length = 1175

 Score = 36.0 bits (83), Expect = 0.083
 Identities = 47/237 (19%), Positives = 77/237 (32%), Gaps = 32/237 (13%)

Query: 25  DNIKKEQLDTLTKLELEHNLNPI--------RLRTSSHSSVKRRLLTRTSSQASLTSRTG 76
           D I  E  ++  K  L H +           + R S+         + TSS +SL S  G
Sbjct: 53  DIINSESKESFGKYALGHQIFSSFSSSPKLFQRRNSAGPITHSP--SATSSTSSLNSNDG 110

Query: 77  RSDLRPRS-----GLERPILRVNTSPYPATKGTPHFTKKSFTSADLSRTVTHPYSRTQ-- 129
                P S            R ++ P     G+P   +K+      S   THP       
Sbjct: 111 DQ-FSPASDSLSFNPSSTQSRKDSGP---GDGSPVQKRKNPLLPSSSTHGTHPPIVFTDN 166

Query: 130 ---SVQSLRDVSRTQTASKRGALSHPRTPTTLLSKQGSHPLTPSSRYPTTPTSGGHNPSY 186
                 +    SR +  S         +P        S      S YP+   +  H+ + 
Sbjct: 167 NGSHAGAPNARSRKEIPSLGSQSMQLPSPHFRQKFSSSDTSNGFS-YPSIRKNSRHSSNS 225

Query: 187 SSSGSEASIEPRTPTVSKSPLGRTFSAGSHMSLPSLSPLPCHQCQHSPPTWSPYILP 243
             S   +S    T  + K     + S+G+ +   +++P   +    S  +  PYI P
Sbjct: 226 MPSFPHSS----TAVLLKR---HSGSSGASLISSNITPSSSNSEAMSTSSKRPYIYP 275


>gnl|CDD|223031 PHA03273, PHA03273, envelope glycoprotein C; Provisional.
          Length = 486

 Score = 35.7 bits (82), Expect = 0.094
 Identities = 18/87 (20%), Positives = 37/87 (42%), Gaps = 5/87 (5%)

Query: 141 QTASKRGALSHPRTPTTLLSKQGSHPLTPS--SRYPTTPTSGGHNPSYSSSGSEASIEPR 198
             AS     S         ++  S P TP+  +   T+P   G + S +++G+E++ +  
Sbjct: 23  TYASGASTSSSIENSDNSTAEMQSTPATPTHTTSNLTSPFGTGTDNSTNANGTESTTQAS 82

Query: 199 TPTVSKSPLGRTFSAGSHMSLPSLSPL 225
            P   ++ +  T    S +S+P    +
Sbjct: 83  QPHSHETTITCT---KSLISVPYYKSV 106


>gnl|CDD|219805 pfam08347, CTNNB1_binding, N-terminal CTNNB1 binding.  This region
           tends to appear at the N-terminus of proteins also
           containing DNA-binding HMG (high mobility group) boxes
           (pfam00505) and appears to bind the armadillo repeat of
           CTNNB1 (beta-catenin), forming a stable complex.
           Signaling by Wnt through TCF/LCF is involved in
           developmental patterning, induction of neural tissues,
           cell fate decisions and stem cell differentiation.
           Isoforms of HMG T-cell factors lacking the N-terminal
           CTNNB1-binding domain cannot fulfill their role as
           transcriptional activators in T-cell differentiation.
          Length = 200

 Score = 33.7 bits (77), Expect = 0.18
 Identities = 23/120 (19%), Positives = 37/120 (30%), Gaps = 13/120 (10%)

Query: 128 TQSVQSLR--DVSRTQTASKRGAL--SHPRTPTTLLSKQGSHPLTPSSRYPTTPTSGGHN 183
           ++S +S     +SR     +  +           +  K           Y   P     N
Sbjct: 52  SESSESSSSEAISRRDHPPRYESYRDKSREKLEDVSHKHQDMGYYKGPPYSGYPFLMLPN 111

Query: 184 PSYSSSGSEASIEPRTPTVSKSPLGRTFSAGSHMSLPSLSPLPCHQCQHSPPTWSPYILP 243
           P Y  +GS + + P +  V             H     L+PL  +  +H  P   P  LP
Sbjct: 112 P-YLPNGSLSPLPPSSNKV----PVVQPPHHVH----PLTPLITYSNEHFSPGTPPPHLP 162


>gnl|CDD|118131 pfam09595, Metaviral_G, Metaviral_G glycoprotein.  This is a viral
           attachment glycoprotein from region G of metaviruses. It
           is high in serine and threonine suggesting it is highly
           glycosylated.
          Length = 183

 Score = 33.5 bits (76), Expect = 0.21
 Identities = 24/99 (24%), Positives = 37/99 (37%), Gaps = 2/99 (2%)

Query: 91  LRVNTSPYPATKGTPHFTKKSFTSADLSRTVTHPYSRTQSVQSLRDVSRTQTASKRGALS 150
           L    +  P T        +  T    SRT T      +  ++ R  SRTQT       +
Sbjct: 79  LETEPTSTPDTTNRQQTVDRHTTPPSSSRTQTTQAVHEK--KNTRTTSRTQTPPTTSTAA 136

Query: 151 HPRTPTTLLSKQGSHPLTPSSRYPTTPTSGGHNPSYSSS 189
              T TT  S  G  P T S +  ++ T+  H  +  ++
Sbjct: 137 VQTTTTTNTSSTGKEPTTTSVQPRSSATTQSHEETSQAN 175


>gnl|CDD|218439 pfam05109, Herpes_BLLF1, Herpes virus major outer envelope
           glycoprotein (BLLF1).  This family consists of the BLLF1
           viral late glycoprotein, also termed gp350/220. It is
           the most abundantly expressed glycoprotein in the viral
           envelope of the Herpesviruses and is the major antigen
           responsible for stimulating the production of
           neutralising antibodies in vivo.
          Length = 830

 Score = 34.4 bits (78), Expect = 0.22
 Identities = 38/206 (18%), Positives = 64/206 (31%), Gaps = 32/206 (15%)

Query: 49  LRTSSHSSVKRRLLTRTSSQASLTSRTGRS------DLRPRSGLERPILRVNTSPYPA-T 101
           + T+          T +++  +  + TG +      D  P S           +  P   
Sbjct: 449 VPTTPSLPPASTGPTVSTADPTSGTPTGTTSSTLPEDTSPTSRTTSA---TPNATSPTPA 505

Query: 102 KGTPHFTKKSFTSADLSRTVTHPYSRTQSVQSLRDVSRTQTASKRGALSHPRTPTTLLSK 161
             TP+ T  +      +   T P      V +      T T S   A S   T  + ++ 
Sbjct: 506 VTTPNATSPTTQKTSDTPNATSPTPIVIGVTTTATSPPTGTTSVPNATSPQVTEESPVNN 565

Query: 162 QGSHPLT-PSSRYPTTPTSGGHNPSYSSSGSEASIEPRT--------------------P 200
             +  +T   S   +  T+G H    SS  S+    P +                    P
Sbjct: 566 TNTPVVTSAPSVLTSAVTTGQHGTG-SSPTSQQPGIPSSSHSTPRSNSTSTTPLLTSAHP 624

Query: 201 TVSKSPLGRTFSAGSHMSLPSLSPLP 226
           T  ++    T S  S   + +LSP P
Sbjct: 625 TGGENITEETPSVPSTTHVSTLSPGP 650



 Score = 31.7 bits (71), Expect = 1.5
 Identities = 39/231 (16%), Positives = 72/231 (31%), Gaps = 27/231 (11%)

Query: 34  TLTKLELEHNLNPIRLRTSSHSSVKRRLLTRTSSQASLTSRTGRSDLRPRSGLERPILRV 93
                 + +  +P     S  ++    ++T   S  +    TG+         ++P +  
Sbjct: 543 PTGTTSVPNATSPQVTEESPVNNTNTPVVTSAPSVLTSAVTTGQHGTGSSPTSQQPGIPS 602

Query: 94  NTSPYPATKGTPHFTKKSFTSADLSRTVTHPYSRTQSVQSLRDVSRTQTASKRGALSHPR 153
           ++   P +  T      +         +T     T SV S   VS        G  S   
Sbjct: 603 SSHSTPRSNSTSTTPLLTSAHPTGGENITE---ETPSVPSTTHVSTLSPGPGPGTTSQVS 659

Query: 154 TPTTLLSKQGSHPLTPSSRYP--TTPTSGGHNPSYSSSGSEASIEPRTPTVSKSPLGRTF 211
            P            + +SRYP     T G  NP+ +S  + +  +   PTV+ +      
Sbjct: 660 GPGN----------SSTSRYPGEVHVTEGMPNPNATSPSAPSGQKTAVPTVTSTGGKANS 709

Query: 212 --------SAGSHMSLPS----LSPLPCHQCQHSPPTWSPYILPMNTDHHP 250
                   +  +  S  +        P +   + PP+ S  + P  T   P
Sbjct: 710 TTKETSGSTLMASTSPHTNEGAFRTTPYNATTYLPPSTSSKLRPRWTFTSP 760


>gnl|CDD|219833 pfam08418, Pol_alpha_B_N, DNA polymerase alpha subunit B
           N-terminal.  This is the eukaryotic DNA polymerase alpha
           subunit B N-terminal domain which is involved in complex
           formation. Also see pfam04058.
          Length = 239

 Score = 33.5 bits (77), Expect = 0.26
 Identities = 21/96 (21%), Positives = 31/96 (32%), Gaps = 6/96 (6%)

Query: 117 LSRTVTHPYSRTQSVQSLRDVSRTQTASKRGALSHPRTPTTLLSKQGSHPLTPSSRYPTT 176
           L + V  P +  ++ + L +V      ++   L    TP+           TP       
Sbjct: 62  LEKRVRTP-ASIKTSKRLIEVPE----AEESLLDSYTTPSDK-GGMLRILSTPELPKRKR 115

Query: 177 PTSGGHNPSYSSSGSEASIEPRTPTVSKSPLGRTFS 212
             S     S S   S AS  P     + SP    FS
Sbjct: 116 SFSASSLESPSLFFSPASFSPSAAPSTPSPNSAKFS 151


>gnl|CDD|217393 pfam03154, Atrophin-1, Atrophin-1 family.  Atrophin-1 is the
           protein product of the dentatorubral-pallidoluysian
           atrophy (DRPLA) gene. DRPLA OMIM:125370 is a progressive
           neurodegenerative disorder. It is caused by the
           expansion of a CAG repeat in the DRPLA gene on
           chromosome 12p. This results in an extended
           polyglutamine region in atrophin-1, that is thought to
           confer toxicity to the protein, possibly through
           altering its interactions with other proteins. The
           expansion of a CAG repeat is also the underlying defect
           in six other neurodegenerative disorders, including
           Huntington's disease. One interaction of expanded
           polyglutamine repeats that is thought to be pathogenic
           is that with the short glutamine repeat in the
           transcriptional coactivator CREB binding protein, CBP.
           This interaction draws CBP away from its usual nuclear
           location to the expanded polyglutamine repeat protein
           aggregates that are characteristic of the polyglutamine
           neurodegenerative disorders. This interferes with
           CBP-mediated transcription and causes cytotoxicity.
          Length = 979

 Score = 33.9 bits (77), Expect = 0.34
 Identities = 31/151 (20%), Positives = 41/151 (27%), Gaps = 6/151 (3%)

Query: 89  PILRVNTSPYPATKGTPHFTKKSFTSADLSRTVTHPYSRTQSVQSLRDVSRTQTASKRGA 148
           P ++V      A    P            S     P  + Q    L  +S      +R  
Sbjct: 179 PSIQVPPGAALAPSAPPPTPSAQAVPPQGSPIAAQPAPQPQQPSPLSLISAPSLHPQRLP 238

Query: 149 LSHPRTPTTLLSKQGSHPLTPSSRYPTTPTSGGHNPSYSSSGSEASIEPRTPTVSKSPLG 208
             HP       S+Q   P  PSSR+P +   G   P   +            +    P G
Sbjct: 239 SPHPPLQPQTASQQSPQPPAPSSRHPQSSHHGPGPPMPHALQQGPVFLQHPSSNPPQPFG 298

Query: 209 RTFSAGSHMSLPSLSPLPCHQCQHSPPTWSP 239
              S           PLP     HS    S 
Sbjct: 299 LAQSQV------PPLPLPSQAQPHSHTPPSQ 323


>gnl|CDD|173502 PTZ00266, PTZ00266, NIMA-related protein kinase; Provisional.
          Length = 1021

 Score = 33.9 bits (77), Expect = 0.35
 Identities = 27/147 (18%), Positives = 65/147 (44%), Gaps = 6/147 (4%)

Query: 49  LRTSSHSSVKRRLLTRTSSQASLTSRTGRSD-LRPRSGLERPILRVNTSPYPATKGT--P 105
           +R+  H S++  +       A    R+G  D +R  SG++   +    +P P  +GT   
Sbjct: 604 MRSGVHDSLRGGVHDSIRGGAHDGVRSGAHDGVRKSSGVKVEGMGNARAPPP-NQGTHDG 662

Query: 106 HFTKKSFTSADLSRTVTHPYS--RTQSVQSLRDVSRTQTASKRGALSHPRTPTTLLSKQG 163
           +F++   +S  +S    + Y   + ++ Q   D +      K   +S     + ++  +G
Sbjct: 663 NFSRGKDSSDHVSSYKPYAYDSRKAKNYQDTFDRNDMHGYMKNSPISTIVPSSHMMHTKG 722

Query: 164 SHPLTPSSRYPTTPTSGGHNPSYSSSG 190
           +  +  + +Y      GG++ +++++G
Sbjct: 723 NTGMNVNCKYLGGAAYGGYSNNHAATG 749


>gnl|CDD|119392 cd06174, MFS, The Major Facilitator Superfamily (MFS) is a large
           and diverse group of secondary transporters that
           includes uniporters, symporters, and antiporters. MFS
           proteins facilitate the transport across cytoplasmic or
           internal membranes of a variety of substrates including
           ions, sugar phosphates, drugs, neurotransmitters,
           nucleosides, amino acids, and peptides. They do so using
           the electrochemical potential of the transported
           substrates. Uniporters transport a single substrate,
           while symporters and antiporters transport two
           substrates in the same or in opposite directions,
           respectively, across membranes. MFS proteins are
           typically 400 to 600 amino acids in length, and the
           majority contain 12 transmembrane alpha helices (TMs)
           connected by hydrophilic loops. The N- and C-terminal
           halves of these proteins display weak similarity and may
           be the result of a gene duplication/fusion event. Based
           on kinetic studies and the structures of a few bacterial
           superfamily members, GlpT (glycerol-3-phosphate
           transporter), LacY (lactose permease), and EmrD
           (multidrug transporter), MFS proteins are thought to
           function through a single substrate binding site,
           alternating-access mechanism involving a rocker-switch
           type of movement. Bacterial members function primarily
           for nutrient uptake, and as drug-efflux pumps to confer
           antibiotic resistance. Some MFS proteins have medical
           significance in humans such as the glucose transporter
           Glut4, which is impaired in type II diabetes, and
           glucose-6-phosphate transporter (G6PT), which causes
           glycogen storage disease when mutated.
          Length = 352

 Score = 32.7 bits (75), Expect = 0.62
 Identities = 15/108 (13%), Positives = 40/108 (37%), Gaps = 1/108 (0%)

Query: 321 FIMFAVCTTVLVAVHIFALMISTYLLPNI-EAISKLQSTKLVSESPHERMRGFVELAWAF 379
            ++FA+ + +L       L++    L  +        +  L++E    + RG     ++ 
Sbjct: 71  LLLFALGSLLLAFASSLWLLLVGRFLLGLGGGALYPAAAALIAEWFPPKERGRALGLFSA 130

Query: 380 STVLGLFLFLVEVAILCWVKFWDYSLRAAIAATIIVIPVLIIFVVFVV 427
              LG  L  +   +L     W +         +++  +L+  +  ++
Sbjct: 131 GFGLGALLGPLLGGLLAESLGWRWLFLILAILGLLLALLLLFLLRLLL 178



 Score = 32.7 bits (75), Expect = 0.62
 Identities = 15/108 (13%), Positives = 40/108 (37%), Gaps = 1/108 (0%)

Query: 578 FIMFAVCTTVLVAVHIFALMISTYLLPNI-EAISKLQSTKLVSESPHERMRGFVELAWAF 636
            ++FA+ + +L       L++    L  +        +  L++E    + RG     ++ 
Sbjct: 71  LLLFALGSLLLAFASSLWLLLVGRFLLGLGGGALYPAAAALIAEWFPPKERGRALGLFSA 130

Query: 637 STVLGLFLFLVEVAILCWVKFWDYSLRAAIAATIIVIPVLIIFVVFVV 684
              LG  L  +   +L     W +         +++  +L+  +  ++
Sbjct: 131 GFGLGALLGPLLGGLLAESLGWRWLFLILAILGLLLALLLLFLLRLLL 178


>gnl|CDD|216908 pfam02159, Oest_recep, Oestrogen receptor. 
          Length = 139

 Score = 31.4 bits (71), Expect = 0.71
 Identities = 21/75 (28%), Positives = 26/75 (34%), Gaps = 8/75 (10%)

Query: 169 PSSRYPTTPTSGGHNPSYSSSGSEASIEPRTPTVSKSPLGRTFSAGSHMSLPSLSPLPCH 228
           P   Y     +    P YSS+       P    V   P     S G   SL S+ P P  
Sbjct: 15  PEGAYDFGAPTETSAPVYSSAS--LGYYPAPEDVHGPP-----SLGGLQSLGSVPPSPLM 67

Query: 229 QCQHSPPTWSPYILP 243
               S P  SP++ P
Sbjct: 68  -FLPSSPQLSPFLHP 81


>gnl|CDD|223333 COG0255, RpmC, Ribosomal protein L29 [Translation, ribosomal
          structure and biogenesis].
          Length = 69

 Score = 29.6 bits (67), Expect = 0.74
 Identities = 18/57 (31%), Positives = 27/57 (47%), Gaps = 9/57 (15%)

Query: 11 SDLRHKCDERLQT-IDNIKKEQLDTLTKLELEHNLNPIRLRTSSHSSVKR---RLLT 63
           +LR K  E L+  +  +KKE  +   +L      NP R+R      V+R   R+LT
Sbjct: 6  KELREKSVEELEEELRELKKELFNLRFQLATGQLENPHRIRE-----VRRDIARILT 57


>gnl|CDD|216998 pfam02366, PMT, Dolichyl-phosphate-mannose-protein
           mannosyltransferase.  This is a family of
           Dolichyl-phosphate-mannose-protein mannosyltransferase
           proteins EC:2.4.1.109. These proteins are responsible
           for O-linked glycosylation of proteins, they catalyze
           the reaction:- Dolichyl phosphate D-mannose + protein
           <=> dolichyl phosphate + O-D-mannosyl-protein. Also in
           this family is the Drosophila rotated abdomen protein
           which is a putative mannosyltransferase. This family
           appears to be distantly related to pfam02516 (A Bateman
           pers. obs.).
          Length = 244

 Score = 31.9 bits (73), Expect = 0.82
 Identities = 20/73 (27%), Positives = 27/73 (36%), Gaps = 12/73 (16%)

Query: 371 GFVELAWAFSTVL-GLFLFLVEVAILCWV---KFWDYSLR--------AAIAATIIVIPV 418
             + L  A ST   GLF  L    +  W+      D SL          A    +I+IP 
Sbjct: 170 TGIALGLAVSTKGVGLFTVLPVGLLTIWLLWQLLGDLSLLYKSIWKHLFARLLFLILIPW 229

Query: 419 LIIFVVFVVHFYH 431
            +    F VHF+ 
Sbjct: 230 ALYLAQFYVHFWL 242



 Score = 31.9 bits (73), Expect = 0.82
 Identities = 20/73 (27%), Positives = 27/73 (36%), Gaps = 12/73 (16%)

Query: 628 GFVELAWAFSTVL-GLFLFLVEVAILCWV---KFWDYSLR--------AAIAATIIVIPV 675
             + L  A ST   GLF  L    +  W+      D SL          A    +I+IP 
Sbjct: 170 TGIALGLAVSTKGVGLFTVLPVGLLTIWLLWQLLGDLSLLYKSIWKHLFARLLFLILIPW 229

Query: 676 LIIFVVFVVHFYH 688
            +    F VHF+ 
Sbjct: 230 ALYLAQFYVHFWL 242


>gnl|CDD|144451 pfam00859, CTF_NFI, CTF/NF-I family transcription modulation
           region. 
          Length = 295

 Score = 32.0 bits (72), Expect = 1.0
 Identities = 39/173 (22%), Positives = 54/173 (31%), Gaps = 29/173 (16%)

Query: 63  TRTSSQASLTSRTGRSDLRPRSGL----ERPILRVNTSPY---PATKGTPHFTKKSFTSA 115
           T      SL S     D  PR        RP++  ++        T    HF        
Sbjct: 108 TMKKPDKSLFSSPSPQDSSPRLSAFTQHHRPVITGHSGISASPHPTPSPLHFPTSPILPQ 167

Query: 116 DLSRTVTHPYSR-----------TQSVQSLRDVSRTQTA----SKRGALSHPRTPTTLLS 160
             S    H   R            + VQ + D S  Q      S +G + +   PT +L+
Sbjct: 168 QPSSYFPHTAIRYPPHLHPQDPLKEFVQLVCDPSSQQAGQPNGSGQGKVPNHFLPTPMLA 227

Query: 161 KQGSHPLT-------PSSRYPTTPTSGGHNPSYSSSGSEASIEPRTPTVSKSP 206
                P+        P ++ PTT T GG     S + S  S  P    V   P
Sbjct: 228 PPPPPPMARPVPLPMPDTKPPTTSTEGGATSPTSPTYSTPSTSPANRFVGLGP 280


>gnl|CDD|219890 pfam08537, NBP1, Fungal Nap binding protein NBP1.  NBP1 is a
           nuclear protein which has been shown in Saccharomyces
           cerevisiae to be essential for the G2/M transition of
           the cell cycle.
          Length = 313

 Score = 31.8 bits (72), Expect = 1.1
 Identities = 20/60 (33%), Positives = 33/60 (55%), Gaps = 6/60 (10%)

Query: 3   TAGPVKSSS-DLRHKCDERLQTIDNI-----KKEQLDTLTKLELEHNLNPIRLRTSSHSS 56
           T+ P++ S+   R +  + LQ  DN      K  + + LT+ E E +L+PIR+  S +SS
Sbjct: 254 TSSPIRKSNRQGRERPLQPLQKGDNFYAKYPKIPKTERLTEKESEESLSPIRIDYSKYSS 313


>gnl|CDD|222193 pfam13520, AA_permease_2, Amino acid permease. 
          Length = 425

 Score = 31.9 bits (73), Expect = 1.3
 Identities = 20/172 (11%), Positives = 55/172 (31%), Gaps = 10/172 (5%)

Query: 524 NVPEWLFIMFAVCTTVLVAVHIFALMIST-YLLPGNDVAMVELQINEPTNVPEWLFIMFA 582
           +VP+ LFI   +   + + V+I  L +     +           +        W  I+  
Sbjct: 218 DVPKALFIGLLIVGVLYLLVNILFLGVVPDDEIAKLSNLPSVAALLFEAVGGPWGAIIVV 277

Query: 583 VCTTVLVAVHIFALMISTYLLPNIEAISKLQSTKLVSESPHERMRGFVELAWAFSTVLGL 642
           +   + +   +   ++++  +    A   +        +        + L    S +L L
Sbjct: 278 ILLALSLLGAVNTAIVASSRVLEALARDGVLPKFFAKVNKFGSPVRALILTAILSLILLL 337

Query: 643 FLFLVEVAILCWVKFWDYSLRAAIAATIIVIPVLIIFVVFVVHFYHNVVKYK 694
              L   A         Y+   +++A   ++  L++ +  ++         +
Sbjct: 338 LFLLSGAA---------YNALLSLSAVGYLLVYLLLIIGLLILRKKRPDLPR 380


>gnl|CDD|221787 pfam12815, CTD, Spt5 C-terminal nonapeptide repeat binding Spt4.
           The C-terminal domain of the transcription elongation
           factor protein Spt5 is necessary for binding to Spt4 to
           form the functional complex that regulates early
           transcription elongation by RNA polymerase II. The
           complex may be involved in pre-mRNA processing through
           its association with mRNA capping enzymes. This CTD
           domain carries a regular nonapeptide repeat that can be
           present in up to 18 copies, as in S. pombe. The repeat
           has a characteristic TPA motif.
          Length = 112

 Score = 30.1 bits (68), Expect = 1.3
 Identities = 20/86 (23%), Positives = 25/86 (29%), Gaps = 9/86 (10%)

Query: 153 RTPT-TLLSKQGSHPLTPSSRYPTTPTSGGHNPSYSSSGSEASIEPRTPTVSKSPLGRTF 211
           RTP       +         R P    +G   P++           RTP       GRT 
Sbjct: 14  RTPAWGQGGSRTPAYGGAGGRTPAWGGAGNKTPAFGDGS-------RTPAYGGESNGRTP 66

Query: 212 SAGSHMSLPSLS-PLPCHQCQHSPPT 236
           + G   S    S P        S PT
Sbjct: 67  AYGGSGSYLGASTPGAAPTPAKSAPT 92


>gnl|CDD|146273 pfam03546, Treacle, Treacher Collins syndrome protein Treacle. 
          Length = 519

 Score = 31.8 bits (71), Expect = 1.4
 Identities = 20/87 (22%), Positives = 29/87 (33%), Gaps = 11/87 (12%)

Query: 154 TPTTLLSKQGSHPLTPSSRYPTTPTSGGHNPSYSSSGSEASIE------PRTPTVSKSPL 207
            P     ++G+ P+TP    P    +G      S   S++  E        T      PL
Sbjct: 60  APAKESPQKGAPPVTPGKAGPAAAQAGEEEAKSSEEESDSEGETPTAATLTTSPAQAKPL 119

Query: 208 G-----RTFSAGSHMSLPSLSPLPCHQ 229
           G     R  S  +       + LPC Q
Sbjct: 120 GKNSQVRPASTVTPGPSGKGANLPCPQ 146


>gnl|CDD|225579 COG3037, SgaT, Uncharacterized protein conserved in bacteria
           [Function unknown].
          Length = 481

 Score = 31.5 bits (72), Expect = 1.6
 Identities = 33/187 (17%), Positives = 63/187 (33%), Gaps = 48/187 (25%)

Query: 521 EPTNVPEWLFIMF--AVCTTVLVAVHIFALMISTYLLPGNDVAMVELQINEPTNVPEWLF 578
           E   +P+WL I     V  T  + +  F  ++ +    G D   +        N     +
Sbjct: 219 EDLKLPKWLSIFRDSIVAITATMMILFFGAILLSLGAAGIDNVQLMAS-GGQAN-----W 272

Query: 579 IMFAVCTTVLVAVHIFALM----------------ISTYLLPNIEAISKLQSTKLVSESP 622
            ++ +   +  A  +F ++                IS  L+PN  A+  L    +   +P
Sbjct: 273 FVYIIQQGLTFAAGVFIILQGVRMFVAELVPAFKGISEKLVPN--AVPALDCAAVYPFAP 330

Query: 623 HERMRGFVELAWAFSTVLGLFLFLVEVAILCWVKFWDYSLRAAIAATIIVIPVLIIFVVF 682
           +  + GF+        +  L    + + IL           +     I++IP       F
Sbjct: 331 NAVLIGFL-----SGFIGQL----IGMGIL-------GGFSSPNTGGILIIPG------F 368

Query: 683 VVHFYHN 689
           V HF+  
Sbjct: 369 VPHFFSG 375


>gnl|CDD|147982 pfam06112, Herpes_capsid, Gammaherpesvirus capsid protein.  This
           family consists of several Gammaherpesvirus capsid
           proteins. The exact function of this family is unknown.
          Length = 148

 Score = 30.2 bits (68), Expect = 1.7
 Identities = 21/87 (24%), Positives = 33/87 (37%), Gaps = 7/87 (8%)

Query: 122 THPYSRTQSVQSLRDVSRTQTASKRGALS-HPRTPTTLLSKQGSHPLTPSSRYPTTPTSG 180
            H   R + +Q+LR      ++S   ALS    + + +           S    +  +SG
Sbjct: 65  MHGIRRKKHLQALRGAGPQTSSSIGSALSASSSSASGV----PGGANQLSGSSGSALSSG 120

Query: 181 GHN--PSYSSSGSEASIEPRTPTVSKS 205
             +   S S SGS A      P+ SK 
Sbjct: 121 PGSLSSSSSLSGSGAGAGDTAPSSSKK 147


>gnl|CDD|219339 pfam07223, DUF1421, Protein of unknown function (DUF1421).  This
           family represents a conserved region approximately 350
           residues long within a number of plant proteins of
           unknown function.
          Length = 357

 Score = 31.4 bits (71), Expect = 1.8
 Identities = 16/75 (21%), Positives = 21/75 (28%), Gaps = 12/75 (16%)

Query: 166 PLTPSSRYPTTPTSGGHNPSYSSSGSEASIEPRTPTVSKSPLGRTFSAGSHMSLPSLSPL 225
              P  +  + P   G  P  S    ++      P              S    P  S  
Sbjct: 163 QQNPPPQAQSAPQVSGLYPEESPYQPQSY----PPNEPLPS--------SMAMQPPYSGA 210

Query: 226 PCHQCQHSPPTWSPY 240
           P  Q  + PP  SPY
Sbjct: 211 PPSQQFYGPPQPSPY 225


>gnl|CDD|215145 PLN02258, PLN02258, 9-cis-epoxycarotenoid dioxygenase NCED.
          Length = 590

 Score = 31.2 bits (71), Expect = 2.0
 Identities = 28/83 (33%), Positives = 34/83 (40%), Gaps = 8/83 (9%)

Query: 169 PSSRYPTTPTSGGHNPSYSSSGSEASIEPRTPTVSKSPLGRTFSAGSHMSLPSLSPLPCH 228
            SS  PT   S  H  S SSS S++S    T   S  P  R  SA S +  PS+ PLP  
Sbjct: 1   SSSSNPT-SRSQSHASSSSSSSSQSSPPSST---SPRPRRRKPSASSLLHTPSILPLP-- 54

Query: 229 QCQHSPPTWSPYILPMNTDHHPR 251
             + S P+     LP        
Sbjct: 55  --KLSSPSPPSVTLPPAATTQTP 75


>gnl|CDD|219916 pfam08580, KAR9, Yeast cortical protein KAR9.  The KAR9 protein in
           Saccharomyces cerevisiae is a cytoskeletal protein
           required for karyogamy, correct positioning of the
           mitotic spindle and for orientation of cytoplasmic
           microtubules. KAR9 localises at the shmoo tip in mating
           cells and at the tip of the growing bud in anaphase.
          Length = 626

 Score = 31.4 bits (71), Expect = 2.1
 Identities = 24/125 (19%), Positives = 34/125 (27%), Gaps = 9/125 (7%)

Query: 65  TSSQASLTSRTGRSDLRPRSGLERPILRVNTSPYPATKGTPHFTKKSFTSADLSRTVTHP 124
            SS   L+ R     L       RP  R+++        +            +SR     
Sbjct: 502 LSSIPPLSPRQSIITLPT---PSRPASRISSLSLRLGSYSGSIVSPPPYPTLVSRKGAAG 558

Query: 125 YSRTQSVQSLRDVSRTQTASKRGALSHPRTPTTLLSKQGSHPLTPSSRYPTTPTSGGHNP 184
            S  +SV  +           R  L   R P  L  K  S   +  S    +PT     P
Sbjct: 559 LSFNRSVSDIE-----GERIGRYNLLPTRIPA-LPFKAESTTSSRRSSSLPSPTGVIGFP 612

Query: 185 SYSSS 189
                
Sbjct: 613 GSVPR 617


>gnl|CDD|233191 TIGR00927, 2A1904, K+-dependent Na+/Ca+ exchanger.  [Transport and
           binding proteins, Cations and iron carrying compounds].
          Length = 1096

 Score = 31.5 bits (71), Expect = 2.1
 Identities = 34/143 (23%), Positives = 57/143 (39%), Gaps = 16/143 (11%)

Query: 69  ASLTSRTGRSDLRPR-SGLERPILRVNTSP------YPATKGTPHFTKKSFTSADLSRTV 121
           A +T  T +++  P  +G ER       +P      Y +T G      KS+T     R  
Sbjct: 119 AKITPTTPKNNYSPTAAGTERVKEDTPATPSRALNHYISTSGRQRV--KSYTPK--PRGE 174

Query: 122 THPYSRTQSVQSLRDVSRTQTASKRGALSHPRTPTTLLSKQGSHPLTPSSRYPTTPTSGG 181
               S TQ+    R+  R  T S  G + +   P+T ++   SH +TP +    +  +  
Sbjct: 175 VKSSSPTQT----REKVRKYTPSPLGRMVNSYAPSTFMTMPRSHGITPRTTVKDSEITAT 230

Query: 182 HNPSYSSSGSEASIEPRTPTVSK 204
                 ++ S+ +    TPT  K
Sbjct: 231 -YKMLETNPSKRTAGKTTPTPLK 252


>gnl|CDD|218244 pfam04754, Transposase_31, Putative transposase, YhgA-like.  This
           family of putative transposases includes the YhgA
           sequence from Escherichia coli and several prokaryotic
           homologues.
          Length = 203

 Score = 30.6 bits (70), Expect = 2.3
 Identities = 10/42 (23%), Positives = 19/42 (45%), Gaps = 9/42 (21%)

Query: 414 IVIPVLIIFVVFVVHFYHNVVKYKCKTSISDMKELEDIKKKL 455
           +VIP++         FYH    +    ++ D+ E  ++ K L
Sbjct: 110 LVIPIV---------FYHGKESWPAPRNLLDLFEDPELAKYL 142



 Score = 30.6 bits (70), Expect = 2.3
 Identities = 10/42 (23%), Positives = 19/42 (45%), Gaps = 9/42 (21%)

Query: 671 IVIPVLIIFVVFVVHFYHNVVKYKCKTSISDMKELEDIKKKL 712
           +VIP++         FYH    +    ++ D+ E  ++ K L
Sbjct: 110 LVIPIV---------FYHGKESWPAPRNLLDLFEDPELAKYL 142


>gnl|CDD|217379 pfam03125, Sre, C. elegans Sre G protein-coupled chemoreceptor.
           Caenorhabditis elegans Sre proteins are candidate
           chemosensory receptors. There are four main recognised
           groups of such receptors: Odr-10, Sra, Sro, and Srg. Sre
           (this family), Sra pfam02117 and Srb pfam02175 comprise
           the Sra group. All of the above receptors are thought to
           be G protein-coupled seven transmembrane domain
           proteins. The existence of several different
           chemosensory receptors underlies the fact that in spite
           of having only 20-30 chemosensory neurones, C. elegans
           detects hundreds of different chemicals, with the
           ability to discern individual chemicals among
           combinations.
          Length = 363

 Score = 30.8 bits (70), Expect = 2.4
 Identities = 17/67 (25%), Positives = 29/67 (43%), Gaps = 3/67 (4%)

Query: 371 GFVELAWAFSTVLGLFLFLVEVAILCW-VKFWDYSLRAAIAATIIVI--PVLIIFVVFVV 427
           GF++  + FS V GL    VE       +K ++   R  I   +I+I   + I   +F +
Sbjct: 126 GFLQWHYMFSVVFGLLAIAVERCFASVFIKDYEKKSRIWIPVILIIISQFLAISSALFFL 185

Query: 428 HFYHNVV 434
                V+
Sbjct: 186 FNKLPVL 192



 Score = 30.8 bits (70), Expect = 2.4
 Identities = 17/67 (25%), Positives = 29/67 (43%), Gaps = 3/67 (4%)

Query: 628 GFVELAWAFSTVLGLFLFLVEVAILCW-VKFWDYSLRAAIAATIIVI--PVLIIFVVFVV 684
           GF++  + FS V GL    VE       +K ++   R  I   +I+I   + I   +F +
Sbjct: 126 GFLQWHYMFSVVFGLLAIAVERCFASVFIKDYEKKSRIWIPVILIIISQFLAISSALFFL 185

Query: 685 HFYHNVV 691
                V+
Sbjct: 186 FNKLPVL 192


>gnl|CDD|218397 pfam05044, Prox1, Homeobox prospero-like protein (PROX1).  The
           homeobox gene Prox1 is expressed in a subpopulation of
           endothelial cells that, after budding from veins, gives
           rise to the mammalian lymphatic system. Prox1 has been
           found to be an early specific marker for the developing
           liver and pancreas in the mammalian foregut endoderm.
           This family contains an atypical homeobox domain.
          Length = 908

 Score = 31.2 bits (70), Expect = 2.5
 Identities = 47/220 (21%), Positives = 65/220 (29%), Gaps = 55/220 (25%)

Query: 88  RPILRVNTSPYPATKGTPHFTKKSFTSAD-------LSRTVTHPYSRTQSVQSLRDVSRT 140
           +P          A   +P    ++    D       LS  VT P  +   V   R   RT
Sbjct: 539 KPPDSFANVAAAALYNSPFCMPQTPQPQDAPEQTEALSLVVT-PKKKRHKVTDTRITPRT 597

Query: 141 ---------QTASKRGALSHPRTPTTLLSKQGSHPLTPSSRYPTTPTSGGHNPSYSSSGS 191
                          G    P  P++L +  G HP      +P   T    NPS   S  
Sbjct: 598 VSRILALRDAVGPAAGTHHQPLHPSSLSASMGFHPPPFRHPFPLPLTVAIPNPSLHQSEV 657

Query: 192 EASIEPRTPTVSKSPLGRTFSAGSHMSLPSLSPLPCHQCQHSPPTWSPYILPMN-TDHHP 250
                 ++P               H+  PS SP    + + SP       LP   T  HP
Sbjct: 658 FMGYPFQSP---------------HLGAPSGSPPG--KDRDSPD------LPRPTTSLHP 694

Query: 251 RLYKNHSRRDLGHHYEELPEDSEGLVWRRLHMSRAKLKAT 290
           +L   H      HH    P+         L +S  K +  
Sbjct: 695 KLLSAH------HHPGSSPDG--------LSLSLIKSECG 720


>gnl|CDD|165093 PHA02724, PHA02724, hydrophobic IMV membrane protein; Provisional.
          Length = 53

 Score = 27.6 bits (61), Expect = 2.8
 Identities = 12/46 (26%), Positives = 25/46 (54%)

Query: 389 LVEVAILCWVKFWDYSLRAAIAATIIVIPVLIIFVVFVVHFYHNVV 434
           L+ + I C V  +++++ +     II     I+F+ F+ HF H+ +
Sbjct: 8   LLLLVITCCVLLFNFTISSKTKIDIIFAVQTIVFIWFIFHFVHSAI 53



 Score = 27.6 bits (61), Expect = 2.8
 Identities = 12/46 (26%), Positives = 25/46 (54%)

Query: 646 LVEVAILCWVKFWDYSLRAAIAATIIVIPVLIIFVVFVVHFYHNVV 691
           L+ + I C V  +++++ +     II     I+F+ F+ HF H+ +
Sbjct: 8   LLLLVITCCVLLFNFTISSKTKIDIIFAVQTIVFIWFIFHFVHSAI 53


>gnl|CDD|240370 PTZ00342, PTZ00342, acyl-CoA synthetase; Provisional.
          Length = 746

 Score = 30.8 bits (70), Expect = 3.4
 Identities = 12/29 (41%), Positives = 16/29 (55%)

Query: 544 HIFALMISTYLLPGNDVAMVELQINEPTN 572
           HI  L++  Y L  N +A+VE    EP N
Sbjct: 62  HIMKLLLEKYKLNNNKIAIVEHSCGEPQN 90


>gnl|CDD|212032 cd10322, SLC5sbd, Solute carrier 5 family, sodium/glucose
           transporters and related proteins; solute-binding
           domain.  This family represents the solute-binding
           domain of SLC5 proteins (also called the sodium/glucose
           cotransporter family or solute sodium symporter family)
           that co-transport Na+ with sugars, amino acids,
           inorganic ions or vitamins. Family members include: the
           human glucose (SGLT1, 2, 4, 5), chiro-inositol (SGLT5),
           myo-inositol (SMIT), choline (CHT), iodide (NIS),
           multivitamin (SMVT), and monocarboxylate (SMCT)
           cotransporters, as well as Vibrio parahaemolyticus
           glucose/galactose (vSGLT), and Escherichia coli proline
           (PutP) and pantothenate (PutF) cotransporters. Vibrio
           parahaemolyticus Na(+)/galactose cotransporter (vSGLT)
           has 13 transmembrane helices (TMs): TM-1, an inverted
           topology repeat: TMs1-5 and TMs6-10, and TMs 11-12 (TMs
           numbered to conform to the solute carrier 6 family
           Aquifex aeolicus LeuT). One member of this family, human
           SGLT3, has been characterized as a glucose sensor and
           not a transporter. Members of this family are important
           in human physiology and disease.
          Length = 455

 Score = 30.6 bits (70), Expect = 3.6
 Identities = 15/62 (24%), Positives = 20/62 (32%), Gaps = 7/62 (11%)

Query: 274 GLVWRRLHMSRAKLKATATTSELLSGFAMVAMVELQINEPTNVPEWLFIMFAVCTTVLVA 333
           GL W+R        KA A  S L+ G  +  +  L         +       V   V V 
Sbjct: 401 GLFWKRAT------KAGAIAS-LIVGLIVTLVWLLLGLASPFGIDPAIPGLLVSLIVFVV 453

Query: 334 VH 335
           V 
Sbjct: 454 VS 455



 Score = 30.6 bits (70), Expect = 3.6
 Identities = 15/62 (24%), Positives = 20/62 (32%), Gaps = 7/62 (11%)

Query: 483 GLVWRRLHMSRAKLKATATTSELLSGFAMVAMVELQINEPTNVPEWLFIMFAVCTTVLVA 542
           GL W+R        KA A  S L+ G  +  +  L         +       V   V V 
Sbjct: 401 GLFWKRAT------KAGAIAS-LIVGLIVTLVWLLLGLASPFGIDPAIPGLLVSLIVFVV 453

Query: 543 VH 544
           V 
Sbjct: 454 VS 455


>gnl|CDD|219051 pfam06479, Ribonuc_2-5A, Ribonuclease 2-5A.  This domain is a
           endoribonuclease. Specifically it cleaves an intron from
           Hac1 mRNA in humans, which causes it to be much more
           efficiently translated.
          Length = 127

 Score = 29.1 bits (66), Expect = 3.9
 Identities = 14/28 (50%), Positives = 15/28 (53%), Gaps = 7/28 (25%)

Query: 251 RLYKNHSRRDL-------GHHYEELPED 271
           R YK  S RDL        HHY ELPE+
Sbjct: 56  RKYKGDSVRDLLRAIRNKKHHYRELPEE 83


>gnl|CDD|227367 COG5034, TNG2, Chromatin remodeling protein, contains PhD zinc
           finger [Chromatin structure and dynamics].
          Length = 271

 Score = 29.9 bits (67), Expect = 4.1
 Identities = 29/190 (15%), Positives = 51/190 (26%), Gaps = 27/190 (14%)

Query: 11  SDLRHKCDE---RLQTIDNIKK--EQLDTLTKLELEHNLNPIRLRTSSHSSVKRRLLTRT 65
           S++  K  +    L+ + +I K    LD+ T  E+E  L            ++  LL   
Sbjct: 29  SEIDAKVCDIIKNLRQMISILKKIIDLDSQTYEEVEDGLL---------KEIRELLLKAI 79

Query: 66  SSQASLTSRTGRSDLRPRSGLERPILRVNTSPYPATKGTPHFTKKSFTSADLSRTVTHPY 125
             Q            R    L R    ++     A +       +               
Sbjct: 80  YIQKE----KSDLADRAEKLLRRHRKLLDDRI--AKRPHEKVAAR--IENCHDAVSRLER 131

Query: 126 SRTQSVQSLRDVSRTQTASKRGALSHPRTPTTLLSKQGSHPLTPSSRYPTTPTSGGHNPS 185
           +   S        R+    +  A S     T L  ++  H L   S   ++         
Sbjct: 132 NSYSSA-----ARRSSGEHRSAASSQGSRHTKLKKRKNIHNLKRRSPELSSKREVSFTLE 186

Query: 186 YSSSGSEASI 195
             S    A+ 
Sbjct: 187 SPSVPDTATR 196


>gnl|CDD|165564 PHA03309, PHA03309, transcriptional regulator ICP4; Provisional.
          Length = 2033

 Score = 30.6 bits (68), Expect = 4.4
 Identities = 40/163 (24%), Positives = 65/163 (39%), Gaps = 10/163 (6%)

Query: 46   PIRLRTSSHSSVKRRLLTRTSSQASLTSRTGRSDLRPRSGLERPIL-RVNTSPYPATKGT 104
            P+R R+ + +   R +++   S +S +S +  S   P S   R     ++ SP P  +  
Sbjct: 1795 PVRRRSGAANCGGRWMISAGRSSSSSSSSSSSSSSSPSSRPSRSATPSLSPSPSPPRRA- 1853

Query: 105  PHFTKKSFTSADLSRTVTHPYSRTQSVQSLRDVSRTQTASKRGALSHPRTPTTLLSKQGS 164
            P    +S    +  R   +P+      +S  D S   TA     L+    P      +G 
Sbjct: 1854 PVDRSRSGRRRERDRPSANPFRWAPRQRSRADHSPDGTAPGDAPLNLEDGP-----GRGR 1908

Query: 165  HPLTPSSRYPTTPTSGGHNPSYSSSGSEASIEPRTPTVSKSPL 207
               TPSS   T P+  G   S   + +E S  P    ++ SPL
Sbjct: 1909 PIWTPSSA-TTLPSRSGPEDSVDETETEDSAPP--ARLAPSPL 1948


>gnl|CDD|224945 COG2034, COG2034, Predicted membrane protein [Function unknown].
          Length = 85

 Score = 27.6 bits (62), Expect = 6.0
 Identities = 7/55 (12%), Positives = 20/55 (36%)

Query: 368 RMRGFVELAWAFSTVLGLFLFLVEVAILCWVKFWDYSLRAAIAATIIVIPVLIIF 422
                + +A      +G  L  + + +  +  F +          +++ P+ I+F
Sbjct: 2   IRSEDLIIAGLILIFIGFLLIFLGIVLPAFSPFAESGSSVEYGGVVMIGPIPIVF 56



 Score = 27.6 bits (62), Expect = 6.0
 Identities = 7/55 (12%), Positives = 20/55 (36%)

Query: 625 RMRGFVELAWAFSTVLGLFLFLVEVAILCWVKFWDYSLRAAIAATIIVIPVLIIF 679
                + +A      +G  L  + + +  +  F +          +++ P+ I+F
Sbjct: 2   IRSEDLIIAGLILIFIGFLLIFLGIVLPAFSPFAESGSSVEYGGVVMIGPIPIVF 56


>gnl|CDD|227005 COG4658, RnfD, Predicted NADH:ubiquinone oxidoreductase, subunit
           RnfD [Energy production and conversion].
          Length = 338

 Score = 29.7 bits (67), Expect = 6.1
 Identities = 43/186 (23%), Positives = 74/186 (39%), Gaps = 13/186 (6%)

Query: 494 AKLKATATTSELLSGFAMVAMVELQINEPTNVPEWLFIMFAVCTTVLVAVHIFALMISTY 553
            KL+  + T+ L  G A++  + L +  P   P W+ ++      V++  H++  +    
Sbjct: 61  LKLRKKSVTAFLPDGSALLTGLLLAVVIPPLAPWWMTVIGTFF-AVVIGKHLYGGLGQNP 119

Query: 554 LLPGNDVAMVELQINEPTNVPEWLFIMFAVCTTVLVAVHIFALMISTYLLPNIEAISKLQ 613
             P   V  V L I+ P  +  WL  + A+         IF+ +    L   I+AI+  Q
Sbjct: 120 FNPAM-VGYVVLLISFPVQMTTWLPPISALQEPADSYSLIFSSL--VQLRTGIDAIT--Q 174

Query: 614 STKLVSESPHERMRGFVELAWAFSTVLGLF-------LFLVEVAILCWVKFWDYSLRAAI 666
           +T L       R   F+  A  FS  L           FL+    L WVK   + +  + 
Sbjct: 175 ATPLDDFKLSLREGDFILQAPIFSGKLAGAGWEWVNLAFLLGGLFLLWVKAIRWHIPVSF 234

Query: 667 AATIIV 672
             T+ +
Sbjct: 235 LVTLFL 240


>gnl|CDD|223701 COG0628, yhhT, Predicted permease, member of the PurR regulon
           [General function prediction only].
          Length = 355

 Score = 29.5 bits (67), Expect = 6.6
 Identities = 28/158 (17%), Positives = 64/158 (40%), Gaps = 19/158 (12%)

Query: 285 AKLKATATTSELLSGFAMVAMVELQINEPTNVPEWLFIMFAVCTTVLVAVHIFALMISTY 344
           AKL  +          ++ A++   ++   ++   L  + +   +++V++ +  L++  +
Sbjct: 112 AKLLESLPLLGERLPVSVDALLSALLSILQSILGSLLSLLSSLLSLIVSL-LLVLVLLFF 170

Query: 345 LLPNIEAISKLQSTKLVSESPHERMRGFVELAWA--FSTVLGLFLFLVEVAILCWVKFW- 401
           LL + E + + +  KL+     +R R  +    A     + G  L  + V IL  +    
Sbjct: 171 LLLDGERLRR-KLIKLLPRKLRKRARRILSEVNATLSGYLRGQVLVALIVGILTGIGLLI 229

Query: 402 ---DYSLRAAIAA-----------TIIVIPVLIIFVVF 425
               Y+L   + A            I +IP +II ++ 
Sbjct: 230 LGVPYALLLGLLAGLLSLIPYIGPVIGLIPAVIIALLQ 267


>gnl|CDD|223654 COG0581, PstA, ABC-type phosphate transport system, permease
           component [Inorganic ion transport and metabolism].
          Length = 292

 Score = 29.1 bits (66), Expect = 7.5
 Identities = 33/162 (20%), Positives = 62/162 (38%), Gaps = 33/162 (20%)

Query: 531 IMFAVCTTVLVAVHIFALMISTYLLPGNDVAMVELQ--INEPTNVPEW----------LF 578
               V     VA+     ++ T L+ G     + L+     P+   E           L+
Sbjct: 19  ATGLVFLAAAVALLPLLWILWTILIRGIPALSLNLEFFTGSPSPPGEGGGIGPAIVGTLY 78

Query: 579 IMFAVCTTVLVAVHIFALMISTYLLPNIEAISKLQSTKLVSESPHERMRGFVELAWAF-S 637
           ++      +L+ V +  +    YL    E   K + TK++        R  +++  +  S
Sbjct: 79  LILL---AILIGVPL-GIGAGIYL---AEYAKKSRLTKVI--------RFAIDILASVPS 123

Query: 638 TVLGLFLFLVEVAILCWVKFWDYSLRA-AIAATIIVIPVLII 678
            V GLF     V  L     + +S  A A+A  ++++PV+I 
Sbjct: 124 IVYGLFGLGFFVVTL----GFGFSALAGALALALLMLPVVIR 161


>gnl|CDD|119394 cd06261, TM_PBP2, Transmembrane subunit (TM) found in Periplasmic
           Binding Protein (PBP)-dependent ATP-Binding Cassette
           (ABC) transporters which generally bind type 2 PBPs.
           These types of transporters consist of a PBP, two TMs,
           and two cytoplasmic ABC ATPase subunits, and are mainly
           involved in importing solutes from the environment. The
           solute is captured by the PBP which delivers it to a
           gated translocation pathway formed by the two TMs. The
           two ABCs bind and hydrolyze ATP and drive the transport
           reaction. For these transporters the ABCs and TMs are on
           independent polypeptide chains. These systems transport
           a diverse range of substrates. Most are specific for a
           single substrate or a group of related substrates;
           however some transporters are more promiscuous,
           transporting structurally diverse substrates such as the
           histidine/lysine and arginine transporter in
           Enterobacteriaceae. In the latter case, this is achieved
           through binding different PBPs with different
           specificities to the TMs. For other promiscuous
           transporters such as the multiple-sugar transporter Msm
           of Streptococcus mutans, the PBP has a wide substrate
           specificity. These transporters include the
           maltose-maltodextrin, phosphate and sulfate
           transporters, among others.
          Length = 190

 Score = 28.8 bits (65), Expect = 7.5
 Identities = 16/56 (28%), Positives = 26/56 (46%), Gaps = 1/56 (1%)

Query: 369 MRGFVELAWAF-STVLGLFLFLVEVAILCWVKFWDYSLRAAIAATIIVIPVLIIFV 423
           +R  ++L  +  S VLGL L L+   +L W       L A I A +++ P   +  
Sbjct: 37  LRRIIDLLLSLPSLVLGLLLVLLFGVLLGWGILPGLGLPALILALLLIAPFARLIR 92



 Score = 28.8 bits (65), Expect = 7.5
 Identities = 16/56 (28%), Positives = 26/56 (46%), Gaps = 1/56 (1%)

Query: 626 MRGFVELAWAF-STVLGLFLFLVEVAILCWVKFWDYSLRAAIAATIIVIPVLIIFV 680
           +R  ++L  +  S VLGL L L+   +L W       L A I A +++ P   +  
Sbjct: 37  LRRIIDLLLSLPSLVLGLLLVLLFGVLLGWGILPGLGLPALILALLLIAPFARLIR 92


>gnl|CDD|219702 pfam08022, FAD_binding_8, FAD-binding domain. 
          Length = 103

 Score = 27.7 bits (62), Expect = 7.8
 Identities = 8/30 (26%), Positives = 13/30 (43%), Gaps = 2/30 (6%)

Query: 201 TVSKSPLGRTFSAGSHM--SLPSLSPLPCH 228
            VSK      +  G ++  + P +S L  H
Sbjct: 20  IVSKPKKPFKYKPGQYIFINFPPISFLQSH 49


>gnl|CDD|224205 COG1286, CvpA, Uncharacterized membrane protein, required for
           colicin V production [General function prediction only].
          Length = 182

 Score = 28.8 bits (65), Expect = 7.9
 Identities = 19/65 (29%), Positives = 27/65 (41%), Gaps = 5/65 (7%)

Query: 369 MRGFV-ELAWAFSTVLGLF---LFLVEVAILCWVKFWDYSLRAAIAATIIVIPVLIIFVV 424
            RGF+ E+    S +L  F   LF   +A L   ++  Y   A   A  I   +L+I   
Sbjct: 21  RRGFIREVLSLLSWILAAFVASLFYKPLAPL-LREYIPYPNIAIGIAIAIFFVILLIVGA 79

Query: 425 FVVHF 429
           FV   
Sbjct: 80  FVNSL 84



 Score = 28.8 bits (65), Expect = 7.9
 Identities = 19/65 (29%), Positives = 27/65 (41%), Gaps = 5/65 (7%)

Query: 626 MRGFV-ELAWAFSTVLGLF---LFLVEVAILCWVKFWDYSLRAAIAATIIVIPVLIIFVV 681
            RGF+ E+    S +L  F   LF   +A L   ++  Y   A   A  I   +L+I   
Sbjct: 21  RRGFIREVLSLLSWILAAFVASLFYKPLAPL-LREYIPYPNIAIGIAIAIFFVILLIVGA 79

Query: 682 FVVHF 686
           FV   
Sbjct: 80  FVNSL 84


>gnl|CDD|199217 cd10422, RNase_Ire1, RNase domain (also known as the kinase
           extension nuclease domain) of Ire1.  The model
           represents the C-terminal endoribonuclease domain of the
           multi-functional protein Ire1; Ire1 in addition contains
           a type I transmembrane serine/threonine protein kinase
           (STK) domain, and a Luminal dimerization domain. Ire1 is
           essential for the endoplasmic reticulum (ER) unfolded
           protein response (UPR), which acts as an ER stress
           sensor and is the oldest and most conserved component of
           the UPR in eukaryotes. During ER stress, IRE1 dimerizes
           through its N-terminal luminal domain and forms
           oligomers, promoting trans-autophosphorylation by its
           cytosolic kinase domain. This leads to a conformational
           change that stimulates its endoribonuclease (RNase)
           activity and results in the cleavage of its mRNA
           substrate, Hac1 in yeast and Xbp1 in metazoans, thus
           promoting a splicing event that enables translation into
           a transcription factor which activates the UPR. This
           RNase domain is homologous to the RNase domain of RNase
           L, and possesses a novel fold for a nuclease and appears
           to be rigid irrespective of the activation state of
           IRE1. Structural analysis and mutational studies have
           revealed that an early stage 'phosphoryl-transfer'
           competent conformation of IRE1 favors face-to-face
           dimerization of the kinase domains which precedes and is
           distinct from the RNase 'active' back-to-back
           conformation. Furthermore, in yeast IRE1, the flavonol
           quercetin activates the RNase and potentiates activation
           of the protein kinase by ADP, hinting at the possible
           existence of endogenous cytoplasmic ligands that may
           function along with stress signals from ER lumen in
           order to modulate IRE1 activity, thus identifying IRE1
           as a target for development of ATP-competitive
           inhibitors to modulate the UPR with specific relevance
           for multiple myeloma.
          Length = 129

 Score = 27.9 bits (63), Expect = 8.0
 Identities = 14/28 (50%), Positives = 14/28 (50%), Gaps = 7/28 (25%)

Query: 251 RLYKNHSRRDL-------GHHYEELPED 271
           R YK  S RDL        HHY ELP D
Sbjct: 57  RKYKGSSVRDLLRALRNKKHHYRELPPD 84


>gnl|CDD|221371 pfam12004, DUF3498, Domain of unknown function (DUF3498).  This
           presumed domain is functionally uncharacterized. This
           domain is found in eukaryotes. This domain is typically
           between 433 to 538 amino acids in length. This domain is
           found associated with pfam00616, pfam00168. This domain
           has two conserved sequence motifs: DLQ and PLSFQNP.
          Length = 489

 Score = 29.3 bits (65), Expect = 8.1
 Identities = 19/72 (26%), Positives = 21/72 (29%), Gaps = 8/72 (11%)

Query: 147 GALSHPRTPTTLLSKQGSHPLTPSSRYPTTPTSGGHNPSYSSSGSEASIEPRT------- 199
                        S   S P   S      P SG      SSS    S E R        
Sbjct: 267 PPGGGSHRGRIPPSLLSSLPSEGSMLSSEWPQSGARPRQQSSSSKGDSPELRPAAGHLQQ 326

Query: 200 PT-VSKSPLGRT 210
           P+ V+ S L RT
Sbjct: 327 PSPVNMSALERT 338


>gnl|CDD|220365 pfam09726, Macoilin, Transmembrane protein.  This entry is a highly
           conserved protein present in eukaryotes.
          Length = 680

 Score = 29.5 bits (66), Expect = 8.2
 Identities = 30/196 (15%), Positives = 50/196 (25%), Gaps = 19/196 (9%)

Query: 39  ELEHNLNPIRLRTSSHSSVKRRLLTRTSSQASLTSRT--GRSDLRPRSGLERPILRVNTS 96
           E  H L+      S  SS        TS++  +  +   G   L   S  + P L  +  
Sbjct: 204 EENHTLSVTDKEKSEASSKGL-----TSTKELVPVQNSGGNHSLSKSSNSQTPELEYSEK 258

Query: 97  PYPATKGTPHFTKK------SFTSADLSRTVTHPYSRTQSVQSLRDVSRTQTASKRGALS 150
                    H              AD             +       S   +       S
Sbjct: 259 GKDHHHSHNHQHHSIGINNHHSKHADSKLQTIEVIENHSNKSRPSSSSTNGSKETTSNSS 318

Query: 151 HPRTPTTLLSKQGSHPLTPSSRYPTTPTSGGHNPSYSSSGSEASIEPRTPTVSKSPLGRT 210
                +       S   +  ++  ++P S         S S +  E +    SKS     
Sbjct: 319 SAAAGSIGSKSSKSAKHSNRNKSNSSPKSHSSANGSVPSSSVSDNESKQKRASKSS---- 374

Query: 211 FSAGSHMSLPSLSPLP 226
             +G+  S    S + 
Sbjct: 375 --SGARDSKKDASGMS 388


>gnl|CDD|110602 pfam01611, Filo_glycop, Filovirus glycoprotein.  This family
           includes an extracellular region from the envelope
           glycoprotein of Ebola and Marburg viruses. This region
           is also produced as a separate transcript that gives
           rise to a non-structural, secreted glycoprotein, which
           is produced in large amounts and has an unknown
           function. Processing of this protein may be involved in
           viral pathogenicity.
          Length = 364

 Score = 29.0 bits (65), Expect = 8.2
 Identities = 15/79 (18%), Positives = 26/79 (32%), Gaps = 2/79 (2%)

Query: 105 PHFTKKSFTSADLSRTVTHPYSRTQSVQSLRDVSRTQTASKRGALSHPRTPTTLLSKQGS 164
              TKK+ T    S T     S +    +    + T+    R +L      TT +S   +
Sbjct: 276 FWETKKNVTKQGQSSTCLSTPSLSPRTTNHSRQAVTELDKNRTSLQPSTNNTTTIST--N 333

Query: 165 HPLTPSSRYPTTPTSGGHN 183
           +    +    + P     N
Sbjct: 334 NTSKHNFSTQSIPLQNFTN 352


>gnl|CDD|221931 pfam13136, DUF3984, Protein of unknown function (DUF3984).  This
           family of proteins is functionally uncharacterized. This
           family of proteins is found in eukaryotes. Proteins in
           this family are typically between 393 and 442 amino
           acids in length.
          Length = 301

 Score = 28.9 bits (65), Expect = 8.6
 Identities = 23/113 (20%), Positives = 29/113 (25%), Gaps = 16/113 (14%)

Query: 168 TPSSRYPTTPTSGGHNPSYSSSGSEASIEP---RTPTVSKSPLGR--TFSAGSHMSLPSL 222
                Y   P       SY SS S  +       + + S+S L +    S  S MS   L
Sbjct: 27  DEERDYSYAPHP--PTTSYLSSKSVPTTPGILSHSRSPSRSRLHKRKKSSRRSPMSDTLL 84

Query: 223 SPLPCHQCQHSPPTWSPYILPMNTDHHPRLYKNHSRRDLGHHYEELPEDSEGL 275
                     S  +           H        S R           DSE L
Sbjct: 85  ---------KSKSSAHLLHHQSTRSHRRSKSGTTSPRKPSSSAHRRRNDSEWL 128


>gnl|CDD|233213 TIGR00966, 3a0501s07, protein-export membrane protein SecF.  This
           bacterial protein is always found with the homologous
           protein-export membrane protein SecD. In numerous
           lineages, this protein occurs as a SecDF fusion protein
           [Protein fate, Protein and peptide secretion and
           trafficking].
          Length = 246

 Score = 28.8 bits (65), Expect = 8.8
 Identities = 18/53 (33%), Positives = 27/53 (50%), Gaps = 3/53 (5%)

Query: 374 ELAWAFSTVLGLFLFLVEVAILCWVKF-WDYSLRAAIAATIIVIPVLIIFVVF 425
           ELA     VL L   LV + I   V+F W ++L A +A    VI  + ++ +F
Sbjct: 95  ELAR--KAVLALLAALVLILIYITVRFEWRFALGAIVALVHDVIITVGVYSLF 145



 Score = 28.8 bits (65), Expect = 8.8
 Identities = 18/53 (33%), Positives = 27/53 (50%), Gaps = 3/53 (5%)

Query: 631 ELAWAFSTVLGLFLFLVEVAILCWVKF-WDYSLRAAIAATIIVIPVLIIFVVF 682
           ELA     VL L   LV + I   V+F W ++L A +A    VI  + ++ +F
Sbjct: 95  ELAR--KAVLALLAALVLILIYITVRFEWRFALGAIVALVHDVIITVGVYSLF 145


>gnl|CDD|226007 COG3476, COG3476, Tryptophan-rich sensory protein (mitochondrial
           benzodiazepine receptor homolog) [Signal transduction
           mechanisms].
          Length = 161

 Score = 28.1 bits (63), Expect = 9.4
 Identities = 13/61 (21%), Positives = 30/61 (49%), Gaps = 2/61 (3%)

Query: 372 FVELAWAFSTVLGLFLFLV-EVAILCWVKFWDYSLRAAIAATIIVIPVLIIFVVFVVHFY 430
           + +     ++ L +FL+L+  +    W   + + LR+  AA II++ + I  +V  +  +
Sbjct: 68  WEKGPGQGTSWLLMFLYLLQLILNFAWSPIF-FGLRSLGAALIIILLLWIAVLVLTIIVW 126

Query: 431 H 431
            
Sbjct: 127 P 127



 Score = 28.1 bits (63), Expect = 9.4
 Identities = 13/61 (21%), Positives = 30/61 (49%), Gaps = 2/61 (3%)

Query: 629 FVELAWAFSTVLGLFLFLV-EVAILCWVKFWDYSLRAAIAATIIVIPVLIIFVVFVVHFY 687
           + +     ++ L +FL+L+  +    W   + + LR+  AA II++ + I  +V  +  +
Sbjct: 68  WEKGPGQGTSWLLMFLYLLQLILNFAWSPIF-FGLRSLGAALIIILLLWIAVLVLTIIVW 126

Query: 688 H 688
            
Sbjct: 127 P 127


  Database: CDD.v3.10
    Posted date:  Mar 20, 2013  7:55 AM
  Number of letters in database: 10,937,602
  Number of sequences in database:  44,354
  
Lambda     K      H
   0.321    0.133    0.398 

Gapped
Lambda     K      H
   0.267   0.0778    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 44354
Number of Hits to DB: 36,040,269
Number of extensions: 3543308
Number of successful extensions: 5344
Number of sequences better than 10.0: 1
Number of HSP's gapped: 5280
Number of HSP's successfully gapped: 215
Length of query: 718
Length of database: 10,937,602
Length adjustment: 104
Effective length of query: 614
Effective length of database: 6,324,786
Effective search space: 3883418604
Effective search space used: 3883418604
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 63 (28.0 bits)