BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 030938
         (169 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|21554083|gb|AAM63164.1| unknown [Arabidopsis thaliana]
          Length = 167

 Score =  130 bits (328), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 78/135 (57%), Positives = 92/135 (68%), Gaps = 16/135 (11%)

Query: 50  AHAPKSAQA----LHASRRNLLFFSLTALPVLTARESASSAEDIPLFGLRRKLKQAEEEA 105
           AH+ K A A      ASRRNLLF SLTA  V+T  +SAS AE+IPLFG+R+KLK+AEEEA
Sbjct: 32  AHSNKPALASFSLTGASRRNLLF-SLTAATVVTGLQSASMAENIPLFGIRKKLKKAEEEA 90

Query: 106 VDIVRE-----------VEKKIETVEERIEASEKEVETTAGFGGLAQAGAVAGAELVGLL 154
           V+IV+E            EK +E  E  +E +EKE+ T   F GL QAGAV  AE VG+L
Sbjct: 91  VEIVKEGFETAEKGVDAAEKGLEAAERGVETAEKEIVTAVSFNGLTQAGAVVAAEFVGVL 150

Query: 155 VATSVVNGILGPEPQ 169
           VATSVVNGILGPE Q
Sbjct: 151 VATSVVNGILGPEAQ 165


>gi|224094564|ref|XP_002310181.1| predicted protein [Populus trichocarpa]
 gi|222853084|gb|EEE90631.1| predicted protein [Populus trichocarpa]
          Length = 167

 Score =  129 bits (324), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 88/169 (52%), Positives = 110/169 (65%), Gaps = 28/169 (16%)

Query: 21  MAST---IPWSSLSRAPTLLRVRNNARPVTCKAH-APKSAQALHASRRNLLFFSLTALPV 76
           MAST   +P  +LS    L R+RN A P  C+AH +PK A      RR+LLF  L + P 
Sbjct: 1   MASTFSPLPLRTLS----LHRMRN-AHPTICEAHTSPKPAPISFPGRRHLLFM-LASAPA 54

Query: 77  LTARESASSAEDIPLFGLRRKLKQAEEEAVDIVRE------------------VEKKIET 118
           L  RE +S A+DIPLFGLR+KLK+AEEEA ++V+E                  VE+ IET
Sbjct: 55  LALREPSSLAQDIPLFGLRKKLKRAEEEAEELVKEGFETAEKGLETAGKGIVTVERGIET 114

Query: 119 VEERIEASEKEVETTAGFGGLAQAGAVAGAELVGLLVATSVVNGILGPE 167
            E+ I  +EKE+E    FGGLAQAGAVAGAE+VG+LVAT++VNGILGPE
Sbjct: 115 AEKEIVTAEKEIEEAVSFGGLAQAGAVAGAEVVGVLVATAIVNGILGPE 163


>gi|297825275|ref|XP_002880520.1| hypothetical protein ARALYDRAFT_481233 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297326359|gb|EFH56779.1| hypothetical protein ARALYDRAFT_481233 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 167

 Score =  128 bits (322), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 73/129 (56%), Positives = 88/129 (68%), Gaps = 12/129 (9%)

Query: 52  APKSAQALHASRRNLLFFSLTALPVLTARESASSAEDIPLFGLRRKLKQAEEEAVDIVRE 111
           AP S     ASRRNLLF SLTA  V+T  +SAS AE+IP FG+R+KL +AEEEAV+IV+E
Sbjct: 38  APPSFSLTGASRRNLLF-SLTAATVMTGLQSASMAENIPFFGIRKKLNKAEEEAVEIVKE 96

Query: 112 -----------VEKKIETVEERIEASEKEVETTAGFGGLAQAGAVAGAELVGLLVATSVV 160
                       E+ +E  E  +E++EKE+ T   F GL QAGAV  AE VG+LVATSVV
Sbjct: 97  GFETAEKGVDAAERGLEAAERGVESAEKEIVTAVSFNGLTQAGAVVAAEFVGVLVATSVV 156

Query: 161 NGILGPEPQ 169
           NGILGPE Q
Sbjct: 157 NGILGPEAQ 165


>gi|18400342|ref|NP_565556.1| synechocystis YCF37-like protein [Arabidopsis thaliana]
 gi|3152615|gb|AAC17094.1| expressed protein [Arabidopsis thaliana]
 gi|16648702|gb|AAL25543.1| At2g23670/F26B6.32 [Arabidopsis thaliana]
 gi|20197003|gb|AAM14867.1| expressed protein [Arabidopsis thaliana]
 gi|20334912|gb|AAM16212.1| At2g23670/F26B6.32 [Arabidopsis thaliana]
 gi|206748360|gb|ACI22363.1| YCF37-like protein [Arabidopsis thaliana]
 gi|330252384|gb|AEC07478.1| synechocystis YCF37-like protein [Arabidopsis thaliana]
          Length = 167

 Score =  127 bits (320), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 72/120 (60%), Positives = 85/120 (70%), Gaps = 12/120 (10%)

Query: 61  ASRRNLLFFSLTALPVLTARESASSAEDIPLFGLRRKLKQAEEEAVDIVRE--------- 111
           ASRRNLLF SLTA  V+T  + AS AE+IPLFG+R+KLK+AEEEAV+IV+E         
Sbjct: 47  ASRRNLLF-SLTAATVVTGLQPASMAENIPLFGIRKKLKKAEEEAVEIVKEGFETAEKGV 105

Query: 112 --VEKKIETVEERIEASEKEVETTAGFGGLAQAGAVAGAELVGLLVATSVVNGILGPEPQ 169
              EK +E  E  +E +EKE+ T   F GL QAGAV  AE VG+LVATSVVNGILGPE Q
Sbjct: 106 DAAEKGLEAAERGVETAEKEIVTAVSFNGLTQAGAVVAAEFVGVLVATSVVNGILGPEAQ 165


>gi|356496840|ref|XP_003517273.1| PREDICTED: uncharacterized protein LOC100780194 [Glycine max]
          Length = 169

 Score =  127 bits (318), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 90/166 (54%), Positives = 109/166 (65%), Gaps = 23/166 (13%)

Query: 23  STIPWSSLSRAPTLLRVRNNARPVTCKAHAPKSAQAL-HASRRNLLFFSLTALPVLTARE 81
           S+ P+S+LS    ++R   NA P  CK  A K   AL  A RR LLFF LTA   LTARE
Sbjct: 6   SSSPFSTLSFRRLVVR---NATPSPCKPRAVKPLTALPSAGRRQLLFF-LTATTALTARE 61

Query: 82  SASSAEDIPLFGLRRKLKQAEEEAVDIVRE-----------VEKKIETVEERIEASEK-- 128
           +AS A+DIPLFG+R+ LK+ EEEA +IVRE            EK +ET E+ + A+EK  
Sbjct: 62  AASVAQDIPLFGIRKSLKKVEEEAEEIVREGFEAADKGLVAAEKGLETAEKGLVAAEKGI 121

Query: 129 -----EVETTAGFGGLAQAGAVAGAELVGLLVATSVVNGILGPEPQ 169
                E+E T  FGGLAQAGAVAGAE+ G+LVAT+VVNGILGPE Q
Sbjct: 122 EEAEREIEETVSFGGLAQAGAVAGAEVFGILVATAVVNGILGPEAQ 167


>gi|357483133|ref|XP_003611853.1| hypothetical protein MTR_5g018590 [Medicago truncatula]
 gi|355513188|gb|AES94811.1| hypothetical protein MTR_5g018590 [Medicago truncatula]
          Length = 161

 Score =  127 bits (318), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 77/153 (50%), Positives = 97/153 (63%), Gaps = 14/153 (9%)

Query: 29  SLSRAPTLLRVR---NNARPVTCKAHAPKSAQALHASRRNLLFFSLTALPVLTARESASS 85
           ++S  PTL   R   +NA   +CKA A  S  A  +S R  + F LTA   LTARES S 
Sbjct: 5   AISSFPTLSFQRVIVSNATTFSCKAKATNSVTASPSSGRREMLFLLTASTALTARESVSM 64

Query: 86  AEDIPLFGLRRKLKQAEEEAVDIVRE-----------VEKKIETVEERIEASEKEVETTA 134
           A+DIPLFG+R+ LK+AEE+A +IV+E            EK +E  E  +E +EKE+ET  
Sbjct: 65  AQDIPLFGIRKSLKKAEEKAEEIVKEGFESAEKGLETAEKGLEKAELGLEEAEKEIETGV 124

Query: 135 GFGGLAQAGAVAGAELVGLLVATSVVNGILGPE 167
           G G LAQA  V GAE+ GLLVA++VVNGILGPE
Sbjct: 125 GLGNLAQAIVVGGAEVFGLLVASAVVNGILGPE 157


>gi|356537796|ref|XP_003537411.1| PREDICTED: uncharacterized protein LOC100788441 [Glycine max]
          Length = 162

 Score =  120 bits (300), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 84/161 (52%), Positives = 103/161 (63%), Gaps = 20/161 (12%)

Query: 23  STIPWSSLSRAPTLLR--VRNNARPVTCKAHAPKSAQAL-HASRRNLLFFSLTALPVLTA 79
           S+ P+S+LS      R  V +NA    CK  A K   AL  A RR LLFF LTA    TA
Sbjct: 6   SSSPFSTLS-----FRRLVVSNATVSPCKPRAVKLLTALPSAGRRQLLFF-LTATTAFTA 59

Query: 80  RESASSAEDIPLFGLRRKLKQAEE----------EAVDI-VREVEKKIETVEERIEASEK 128
           RE+AS A+DIPLFG+R+ LK+ EE          EA D  +   EK +ET E+ + A+E+
Sbjct: 60  REAASVAQDIPLFGIRKSLKKVEEEAEEIVKEGFEAADKGLVAAEKGLETAEKGLVAAER 119

Query: 129 EVETTAGFGGLAQAGAVAGAELVGLLVATSVVNGILGPEPQ 169
           E+E    FGGLAQAGAVAGAE+ G+LVAT+VVNGILGPE Q
Sbjct: 120 EIEEAVSFGGLAQAGAVAGAEVFGILVATAVVNGILGPEAQ 160


>gi|255562751|ref|XP_002522381.1| conserved hypothetical protein [Ricinus communis]
 gi|223538459|gb|EEF40065.1| conserved hypothetical protein [Ricinus communis]
          Length = 157

 Score =  113 bits (282), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 85/147 (57%), Positives = 99/147 (67%), Gaps = 18/147 (12%)

Query: 37  LRVRNNARPV---TCKAHAPKSAQALHASRRNLLFFSLTALPVLTARESASSAEDIPLFG 93
           L + N AR     +C A  P+SA      RR LLF  LTA P LT RE+AS AEDIPLFG
Sbjct: 13  LSLNNRARRAHAPSCNAEIPQSA---FTGRRQLLFI-LTAAPALTVRETASKAEDIPLFG 68

Query: 94  LRRKLKQAEEEAVDIVRE----VEKKIETVEERIEASEK-------EVETTAGFGGLAQA 142
           LR+KLK+AEEEA ++VRE     EK +ET E+ IE +EK       E+E    FG LAQA
Sbjct: 69  LRKKLKKAEEEAEELVREGFETAEKGLETAEKGIETAEKGIEAAEKEIEEAVSFGALAQA 128

Query: 143 GAVAGAELVGLLVATSVVNGILGPEPQ 169
           GAVAGAE+ G+LVATSVVNGILGPE Q
Sbjct: 129 GAVAGAEIFGVLVATSVVNGILGPEAQ 155


>gi|225467027|ref|XP_002265860.1| PREDICTED: uncharacterized protein LOC100262366 [Vitis vinifera]
 gi|225468769|ref|XP_002275177.1| PREDICTED: uncharacterized protein LOC100262616 [Vitis vinifera]
 gi|147791855|emb|CAN61831.1| hypothetical protein VITISV_027633 [Vitis vinifera]
          Length = 152

 Score =  113 bits (282), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 74/139 (53%), Positives = 86/139 (61%), Gaps = 18/139 (12%)

Query: 42  NARPVTCKAHAPKSAQALHASRRNLLFFSLTALPVLTARESASSAEDIPLFGLRRK---- 97
           NA P++CKAH   S   L  SRR  L    TAL   TARE  S A+DIPLFGLR+K    
Sbjct: 19  NAHPMSCKAHV--STPVL--SRRRCLVLLTTAL---TARELPSRAQDIPLFGLRKKLEKV 71

Query: 98  -------LKQAEEEAVDIVREVEKKIETVEERIEASEKEVETTAGFGGLAQAGAVAGAEL 150
                  LK+  E A   +   EK IE  E+ IE +EKE+ET   + GL QAGAVAGAE+
Sbjct: 72  EEEAEEILKEGIEAAEKGIVTAEKGIEKAEKGIETAEKEIETAVSYNGLTQAGAVAGAEV 131

Query: 151 VGLLVATSVVNGILGPEPQ 169
           VG+LVATSVVN ILGPE Q
Sbjct: 132 VGVLVATSVVNAILGPEAQ 150


>gi|449460624|ref|XP_004148045.1| PREDICTED: uncharacterized protein LOC101208092 [Cucumis sativus]
          Length = 167

 Score =  109 bits (272), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 79/145 (54%), Positives = 98/145 (67%), Gaps = 22/145 (15%)

Query: 47  TCKAHAPKS-AQALHASR---RNLLFFSLTALPVLTARESASSAEDIPLFGLRRKLKQAE 102
           T +A A KS +++L ASR   + LLF + TA   +  RE+ S AEDIPLFGLR+KLK+ E
Sbjct: 22  TVRAAAVKSPSESLSASRTRRQTLLFLTATATAAVVGRENPSMAEDIPLFGLRKKLKKVE 81

Query: 103 EEAVDIVRE----VEKKIETVE-------------ER-IEASEKEVETTAGFGGLAQAGA 144
           EEA +IVRE     EK +ET E             ER IE +EKE+ET   FG L+QAGA
Sbjct: 82  EEAEEIVREGFEAAEKGLETAERGIVTAEKGIIAAEREIETAEKEIETAVNFGALSQAGA 141

Query: 145 VAGAELVGLLVATSVVNGILGPEPQ 169
           VAGAE+VG+L+ATS+VNGILGPE Q
Sbjct: 142 VAGAEVVGVLIATSIVNGILGPEGQ 166


>gi|118489097|gb|ABK96355.1| unknown [Populus trichocarpa x Populus deltoides]
          Length = 167

 Score = 89.4 bits (220), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 86/166 (51%), Positives = 106/166 (63%), Gaps = 22/166 (13%)

Query: 21  MASTIPWSSLSRAPTLLRVRNNARPVTCKA-HAPKSAQALHASRRNLLFFSLTALPVLTA 79
           MAST     L R  +L R+R NA P  C+A  +PK A      RR+LLF  L + P L  
Sbjct: 1   MASTFSPPPL-RTLSLHRMR-NAHPTICEALTSPKPAPISFPGRRHLLFM-LASAPALAL 57

Query: 80  RESASSAEDIPLFGLRRKLKQAEEEAVDIVRE------------------VEKKIETVEE 121
           RE +S A+DIPLFGLR+KLK+AEEEA ++V+E                  VE+ IET E+
Sbjct: 58  REPSSLAQDIPLFGLRKKLKRAEEEAEELVKEGFETAEKGLETAGKGIVTVERGIETAEK 117

Query: 122 RIEASEKEVETTAGFGGLAQAGAVAGAELVGLLVATSVVNGILGPE 167
            I  +EKE+E    FGGLAQAGAVAGAE VG+LVAT++VNGILGPE
Sbjct: 118 EIVTAEKEIEEAVSFGGLAQAGAVAGAEAVGVLVATAIVNGILGPE 163


>gi|449510297|ref|XP_004163625.1| PREDICTED: uncharacterized LOC101208092 [Cucumis sativus]
          Length = 167

 Score = 85.1 bits (209), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 78/145 (53%), Positives = 97/145 (66%), Gaps = 22/145 (15%)

Query: 47  TCKAHAPKS-AQALHASR---RNLLFFSLTALPVLTARESASSAEDIPLFGLRRKLKQAE 102
           T +A A KS +++L ASR   + LLF + TA   +  RE+ S AEDIPLFGLR+KLK+ E
Sbjct: 22  TVRAAAVKSPSESLSASRTRRQTLLFLTATATAAVVGRENPSMAEDIPLFGLRKKLKKVE 81

Query: 103 EEAVDIVRE----VEKKIETVE-------------ER-IEASEKEVETTAGFGGLAQAGA 144
           EEA +IVRE     EK +ET E             ER IE +EKE+ET   FG L+QAG 
Sbjct: 82  EEAEEIVREGFEAAEKGLETAERGIVTAEKGIIAAEREIETAEKEIETAVNFGALSQAGV 141

Query: 145 VAGAELVGLLVATSVVNGILGPEPQ 169
           VAGAE+VG+L+ATS+VNGILGPE Q
Sbjct: 142 VAGAEVVGVLIATSIVNGILGPEGQ 166


>gi|148908213|gb|ABR17221.1| unknown [Picea sitchensis]
          Length = 160

 Score = 71.2 bits (173), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 38/85 (44%), Positives = 65/85 (76%), Gaps = 4/85 (4%)

Query: 84  SSAEDIPLFGLRR-KLKQAEEEAVDIVREVEKKIETVEERIEASEKEVETTA---GFGGL 139
           + A+DIP+FG+++ K+++A E+A ++V+E E+++E V E ++ + + +E  A   G G +
Sbjct: 74  AQAKDIPIFGIKKMKVEEAVEKAKELVKEGEQEVEIVAETVKETAENLEKIAALEGPGPV 133

Query: 140 AQAGAVAGAELVGLLVATSVVNGIL 164
            QAG VAGAE++G+LVA+SVVNGI+
Sbjct: 134 VQAGVVAGAEVIGVLVASSVVNGIV 158


>gi|116781151|gb|ABK21984.1| unknown [Picea sitchensis]
          Length = 160

 Score = 71.2 bits (173), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 38/85 (44%), Positives = 65/85 (76%), Gaps = 4/85 (4%)

Query: 84  SSAEDIPLFGLRR-KLKQAEEEAVDIVREVEKKIETVEERIEASEKEVETTA---GFGGL 139
           + A+DIP+FG+++ K+++A E+A ++V+E E+++E V E ++ + + +E  A   G G +
Sbjct: 74  AQAKDIPIFGIKKLKVEEAVEKAKELVKEGEQEVEIVAETVKETAENLEKIAALEGPGPV 133

Query: 140 AQAGAVAGAELVGLLVATSVVNGIL 164
            QAG VAGAE++G+LVA+SVVNGI+
Sbjct: 134 VQAGVVAGAEVIGVLVASSVVNGIV 158


>gi|357137590|ref|XP_003570383.1| PREDICTED: uncharacterized protein LOC100827989 [Brachypodium
           distachyon]
          Length = 154

 Score = 48.5 bits (114), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 47/96 (48%), Positives = 59/96 (61%), Gaps = 9/96 (9%)

Query: 81  ESASSAEDIPLFGLRRKLKQAEEEAVDIVREVEKKIETVEERIEASEKEV-------ETT 133
            S ++A  I LFG+R+KL++AEE A + VREVE+      E +  + KEV        T 
Sbjct: 58  SSPANAAGIGLFGIRKKLERAEEAAAEAVREVEEAAVAGGEAVAEAGKEVAGEGLQIATE 117

Query: 134 AGFGG--LAQAGAVAGAELVGLLVATSVVNGILGPE 167
            G  G  L QAGAVAGAE +G+LV  SVVNGIL PE
Sbjct: 118 VGLAGDALVQAGAVAGAEALGVLVGLSVVNGILKPE 153


>gi|116780415|gb|ABK21672.1| unknown [Picea sitchensis]
          Length = 129

 Score = 43.9 bits (102), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 27/82 (32%), Positives = 48/82 (58%), Gaps = 29/82 (35%)

Query: 84  SSAEDIPLFGLRR-KLKQAEEEAVDIVREVEKKIETVEERIEASEKEVETTAGFGGLAQA 142
           + A+DIP+FG+++ K+++A E+A ++V+E E+++E V                       
Sbjct: 74  AQAKDIPIFGIKKMKVEEAVEKAKELVKEGEQEVEIV----------------------- 110

Query: 143 GAVAGAELVGLLVATSVVNGIL 164
                AE++G+LVA+SVVNGI+
Sbjct: 111 -----AEVIGVLVASSVVNGIV 127


>gi|322830926|ref|YP_004210953.1| thermoresistant glucokinase family carbohydrate kinase [Rahnella
           sp. Y9602]
 gi|383188181|ref|YP_005198309.1| thermoresistant glucokinase family carbohydrate kinase [Rahnella
           aquatilis CIP 78.65 = ATCC 33071]
 gi|384256097|ref|YP_005400031.1| gluconokinase [Rahnella aquatilis HX2]
 gi|321166127|gb|ADW71826.1| carbohydrate kinase, thermoresistant glucokinase family [Rahnella
           sp. Y9602]
 gi|371586439|gb|AEX50169.1| carbohydrate kinase, thermoresistant glucokinase family [Rahnella
           aquatilis CIP 78.65 = ATCC 33071]
 gi|380752073|gb|AFE56464.1| gluconokinase [Rahnella aquatilis HX2]
          Length = 171

 Score = 40.4 bits (93), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 36/141 (25%), Positives = 63/141 (44%), Gaps = 15/141 (10%)

Query: 6   KFVT-THLTQKSSLRAMASTIPWSSLSRAPTLLRV---------RNNARPVTCKAHAPKS 55
           KF+    L  +++++ MAS  P +   RAP LLR+         +N    + C A   + 
Sbjct: 30  KFIDGDDLHPRANIQKMASGTPLNDDDRAPWLLRLNDAAYSLRHKNETGIIVCSALKRRY 89

Query: 56  AQALHASRRNLLFFSLT-ALPVLTARESASSAEDIPLFGLRRKLKQAE---EEAVDIVR- 110
             AL      ++F  +  +  V+  R  A +   +P   LR +    E   E+  D++R 
Sbjct: 90  RDALRKDNEGMVFIYMKGSFEVIAERLKARAGHFMPTDLLRSQFDALEEPGEDEPDVLRV 149

Query: 111 EVEKKIETVEERIEASEKEVE 131
            ++ K E V +R  A+ K V+
Sbjct: 150 NIDHKFEGVVDRCIAALKTVQ 170


>gi|326493676|dbj|BAJ85299.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326508947|dbj|BAJ86866.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 153

 Score = 39.7 bits (91), Expect = 0.40,   Method: Compositional matrix adjust.
 Identities = 20/27 (74%), Positives = 22/27 (81%)

Query: 141 QAGAVAGAELVGLLVATSVVNGILGPE 167
           QAG VAGAE +G+LV  SVVNGIL PE
Sbjct: 126 QAGVVAGAEALGVLVGLSVVNGILKPE 152


>gi|336467103|gb|EGO55267.1| hypothetical protein NEUTE1DRAFT_85424 [Neurospora tetrasperma FGSC
           2508]
 gi|350288278|gb|EGZ69514.1| phenylalanine ammonia-lyase [Neurospora tetrasperma FGSC 2509]
          Length = 761

 Score = 38.5 bits (88), Expect = 0.87,   Method: Compositional matrix adjust.
 Identities = 31/99 (31%), Positives = 48/99 (48%), Gaps = 6/99 (6%)

Query: 52  APKS-AQALHASRRNLLFFSLTALPVLTARESASSAEDIPLFGLRRKLKQAEEEAVDIVR 110
           AP S AQA++AS   L     +  P+  +  S + A+ + +      L  A+    D  R
Sbjct: 4   APNSHAQAVYASWARLRHLRESKQPIEVSGSSLTIADVVAV-----SLHGAKAHLSDDTR 58

Query: 111 EVEKKIETVEERIEASEKEVETTAGFGGLAQAGAVAGAE 149
           +V++ I  +EERI+A +       GFGG A     AG+E
Sbjct: 59  QVDRSIALLEERIQAGDVIYGVNTGFGGSADTRTDAGSE 97


>gi|292656403|ref|YP_003536300.1| hypothetical protein HVO_2276 [Haloferax volcanii DS2]
 gi|291370857|gb|ADE03084.1| conserved hypothetical protein [Haloferax volcanii DS2]
          Length = 1397

 Score = 38.5 bits (88), Expect = 0.88,   Method: Composition-based stats.
 Identities = 20/46 (43%), Positives = 30/46 (65%), Gaps = 4/46 (8%)

Query: 87  EDIPLFGLRRKLKQAEEEAVDIVREVEKKIETVEERIEASEKEVET 132
           EDIPL  LR   ++AEE     +RE+E ++ T++E  +A + EVET
Sbjct: 271 EDIPLSELRDTCEKAEEA----IRELEPQLRTLKEEADAEKDEVET 312


>gi|317473372|ref|ZP_07932667.1| excinuclease ABC [Anaerostipes sp. 3_2_56FAA]
 gi|316899208|gb|EFV21227.1| excinuclease ABC [Anaerostipes sp. 3_2_56FAA]
          Length = 627

 Score = 36.2 bits (82), Expect = 4.7,   Method: Composition-based stats.
 Identities = 22/73 (30%), Positives = 41/73 (56%), Gaps = 1/73 (1%)

Query: 89  IPLFGLRRKLKQ-AEEEAVDIVREVEKKIETVEERIEASEKEVETTAGFGGLAQAGAVAG 147
           IPL G + +L + A+E A +++R+ ++K++  E+R   + KE+E   G  GL +  A   
Sbjct: 343 IPLRGEKHRLVELAKENASNVLRQDKEKLKREEKRTIGAVKEIEQLLGIKGLHRMEAFDI 402

Query: 148 AELVGLLVATSVV 160
           + + G     S+V
Sbjct: 403 SNISGFQTVASMV 415


>gi|167745943|ref|ZP_02418070.1| hypothetical protein ANACAC_00638 [Anaerostipes caccae DSM 14662]
 gi|167654458|gb|EDR98587.1| excinuclease ABC, C subunit [Anaerostipes caccae DSM 14662]
          Length = 627

 Score = 36.2 bits (82), Expect = 4.8,   Method: Composition-based stats.
 Identities = 22/73 (30%), Positives = 41/73 (56%), Gaps = 1/73 (1%)

Query: 89  IPLFGLRRKLKQ-AEEEAVDIVREVEKKIETVEERIEASEKEVETTAGFGGLAQAGAVAG 147
           IPL G + +L + A+E A +++R+ ++K++  E+R   + KE+E   G  GL +  A   
Sbjct: 343 IPLRGEKHRLVELAKENASNVLRQDKEKLKREEKRTIGAVKEIEQLLGIKGLHRMEAFDI 402

Query: 148 AELVGLLVATSVV 160
           + + G     S+V
Sbjct: 403 SNISGFQTVASMV 415


>gi|85091095|ref|XP_958734.1| hypothetical protein NCU09391 [Neurospora crassa OR74A]
 gi|28920117|gb|EAA29498.1| conserved hypothetical protein [Neurospora crassa OR74A]
          Length = 762

 Score = 35.8 bits (81), Expect = 5.9,   Method: Compositional matrix adjust.
 Identities = 29/99 (29%), Positives = 48/99 (48%), Gaps = 6/99 (6%)

Query: 52  APKS-AQALHASRRNLLFFSLTALPVLTARESASSAEDIPLFGLRRKLKQAEEEAVDIVR 110
           AP S AQA++AS   L     +  P+  +  + + A+ + +      L  A+    D  +
Sbjct: 4   APNSHAQAVYASWARLRHLKESKQPIEISGSNLTIADVVAV-----SLHGAKAHLSDDTQ 58

Query: 111 EVEKKIETVEERIEASEKEVETTAGFGGLAQAGAVAGAE 149
           +V++ I  +EERI+A +       GFGG A     AG+E
Sbjct: 59  QVDRSIALLEERIQAGDVIYGVNTGFGGSADTRTDAGSE 97


>gi|145350929|ref|XP_001419845.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144580077|gb|ABO98138.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 1321

 Score = 35.8 bits (81), Expect = 6.1,   Method: Compositional matrix adjust.
 Identities = 20/49 (40%), Positives = 31/49 (63%), Gaps = 2/49 (4%)

Query: 79  ARESASSAEDIPLFGLRRKLKQAEEEAVD--IVREVEKKIETVEERIEA 125
           AR+   +AED+    LR ++ QA++  VD  IVR  E K+E +E +I+A
Sbjct: 709 ARDLLHAAEDVDPVKLRERIAQAKDMGVDEDIVRSAETKLENLERQIQA 757


>gi|365877465|ref|ZP_09416969.1| preprotein translocase subunit SecA [Elizabethkingia anophelis Ag1]
 gi|442587740|ref|ZP_21006555.1| preprotein translocase subunit SecA [Elizabethkingia anophelis R26]
 gi|365754898|gb|EHM96833.1| preprotein translocase subunit SecA [Elizabethkingia anophelis Ag1]
 gi|442562594|gb|ELR79814.1| preprotein translocase subunit SecA [Elizabethkingia anophelis R26]
          Length = 1023

 Score = 35.8 bits (81), Expect = 6.8,   Method: Composition-based stats.
 Identities = 18/54 (33%), Positives = 30/54 (55%)

Query: 93  GLRRKLKQAEEEAVDIVREVEKKIETVEERIEASEKEVETTAGFGGLAQAGAVA 146
           GLR K ++ +++  D    V K+IE ++ +IE +E   E  A +G +A    VA
Sbjct: 44  GLREKTREFQQKIQDATANVRKQIEDIQSKIETTENVDEKEALYGQVADLNKVA 97


>gi|255568879|ref|XP_002525410.1| conserved hypothetical protein [Ricinus communis]
 gi|223535301|gb|EEF36977.1| conserved hypothetical protein [Ricinus communis]
          Length = 151

 Score = 35.0 bits (79), Expect = 9.6,   Method: Compositional matrix adjust.
 Identities = 28/85 (32%), Positives = 43/85 (50%), Gaps = 10/85 (11%)

Query: 51  HAPKSAQALHASRRNLLFFSLTALPVLTARESASSAEDIPLFGLRRKLKQA-EEEAVDI- 108
           H+P S  +++     LL  ++++ P        S+A   P+  L   +KQ   EEAVD+ 
Sbjct: 71  HSPSSPTSVYVPNLPLLIPTVSSTP--------STARSAPMLILGWHIKQCLWEEAVDVL 122

Query: 109 VREVEKKIETVEERIEASEKEVETT 133
           VRE+EK  +    R   SE E+ TT
Sbjct: 123 VREIEKGKQEARVRGSKSEGEMGTT 147


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.314    0.127    0.342 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,236,616,156
Number of Sequences: 23463169
Number of extensions: 75938928
Number of successful extensions: 453357
Number of sequences better than 100.0: 131
Number of HSP's better than 100.0 without gapping: 39
Number of HSP's successfully gapped in prelim test: 92
Number of HSP's that attempted gapping in prelim test: 453209
Number of HSP's gapped (non-prelim): 164
length of query: 169
length of database: 8,064,228,071
effective HSP length: 129
effective length of query: 40
effective length of database: 9,332,446,566
effective search space: 373297862640
effective search space used: 373297862640
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (22.0 bits)
S2: 71 (32.0 bits)