BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 044715
         (202 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|255554469|ref|XP_002518273.1| conserved hypothetical protein [Ricinus communis]
 gi|223542493|gb|EEF44033.1| conserved hypothetical protein [Ricinus communis]
          Length = 187

 Score =  208 bits (529), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 118/197 (59%), Positives = 137/197 (69%), Gaps = 19/197 (9%)

Query: 1   MKVKALVAVKTKLFRPCKKLNQIFKFKLRKSFFIRSLRSRSFPTTKRSKPQKSPSRTRIS 60
           MK+KALVA K+KL +PC+KL   F+FK+RK  FIRSLR R+     RSK    P R    
Sbjct: 1   MKMKALVAFKSKLVKPCRKLLLTFRFKIRKPVFIRSLRFRA----SRSKTGTRPVRNWFL 56

Query: 61  ALLPS-FRPPKQWKKMEKLAEEHRSVSENPECDQALFSSPITPVLFPSPITPDYVKACRL 119
            +L S F P KQ ++M +LAE  +SVS+     +          LFPSP+TP Y+ A R 
Sbjct: 57  TVLHSVFCPAKQSREMNRLAE-LKSVSKAEHEKK----------LFPSPLTPAYITASRT 105

Query: 120 SKRDEV---EDVEDACRSFENYLVEMVVEEGKVGDLTDVEELLYCWKNLKCPVFIDLVCR 176
           + R+     EDVEDACRSFENYLVEM+VEEGKV DL DVEELLYCWKNLKCPVFIDLVCR
Sbjct: 106 AMRESFMGSEDVEDACRSFENYLVEMIVEEGKVRDLMDVEELLYCWKNLKCPVFIDLVCR 165

Query: 177 FYGELCKDLFFPEDDED 193
           FYGELCKDLF PEDD D
Sbjct: 166 FYGELCKDLFLPEDDND 182


>gi|297745775|emb|CBI15831.3| unnamed protein product [Vitis vinifera]
          Length = 187

 Score =  184 bits (468), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 107/200 (53%), Positives = 143/200 (71%), Gaps = 15/200 (7%)

Query: 1   MKVKALVAVKTKLFRPCKKLNQIFKFKLRKSFFIRSLRSRSFPTTKRSKPQKSPSRTRIS 60
           MKV+ L+A K+KL  PC+KL  IFKF+ ++ FF+R+L+ R  P  ++S+ +K+P + R +
Sbjct: 1   MKVRTLIAFKSKLLNPCRKLQPIFKFRFKRPFFLRALQVRRRP--RQSETRKTPPKNRFT 58

Query: 61  ALLPSFRPPKQWKKMEKLAEEHRSVSENPECDQALFSSPITPVLFPSPITPDYVKACRLS 120
           ALL  FR  +Q K+M+++  E RS+S+  +  +          LFPSP+TP  +KA    
Sbjct: 59  ALLSLFRFQRQEKEMDRV-RELRSISDAGQERR----------LFPSPLTPASMKAGAAG 107

Query: 121 KR--DEVEDVEDACRSFENYLVEMVVEEGKVGDLTDVEELLYCWKNLKCPVFIDLVCRFY 178
           KR     ED+EDACRSFENYLVEM+VEEGKV DL DVEELLYCWKNLKCPVFIDLVCRFY
Sbjct: 108 KRVASGHEDLEDACRSFENYLVEMIVEEGKVRDLMDVEELLYCWKNLKCPVFIDLVCRFY 167

Query: 179 GELCKDLFFPEDDEDHANLQ 198
           GELCKDLF  +++++  + Q
Sbjct: 168 GELCKDLFSTDEEDNEESTQ 187


>gi|42570337|ref|NP_850144.2| ovate family protein 17 [Arabidopsis thaliana]
 gi|29725750|gb|AAO89198.1| hypothetical protein [Arabidopsis thaliana]
 gi|50058905|gb|AAT69197.1| hypothetical protein At2g30395 [Arabidopsis thaliana]
 gi|330253287|gb|AEC08381.1| ovate family protein 17 [Arabidopsis thaliana]
          Length = 195

 Score =  154 bits (389), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 92/198 (46%), Positives = 118/198 (59%), Gaps = 12/198 (6%)

Query: 1   MKVKA-LVAVKTKLFRPCKKLNQIFKFKLRKSFFIRSLRSRSFPTTKRSKPQKSPSRTRI 59
           M+VKA L+  K+KL + C +   +F+F++++  FIR LR+R      R   Q   S+  I
Sbjct: 1   MRVKATLINFKSKLSKSCNRFVSLFRFRVKRPVFIRPLRARHGNVKPRH--QHHHSKKPI 58

Query: 60  SALLPSFRPPKQWKKMEKLAEEHRSVSENPECDQALFSSPITPV----LFPSPITPDYVK 115
            + L  F    +  KM        S S N +       SP+TP     LF SPIT  Y  
Sbjct: 59  CSCL-CFLNSSKNHKMSNAKHRSSSFSVNDDDYSKFMQSPLTPATAKKLFTSPITTPYSS 117

Query: 116 ACR--LSKRDEVED--VEDACRSFENYLVEMVVEEGKVGDLTDVEELLYCWKNLKCPVFI 171
             R  L+ RD  ED  VEDACRSFENYL+ ++VEEGK+ DL D+EELL+CWKNLK PVFI
Sbjct: 118 RTRKSLNARDTFEDNAVEDACRSFENYLIHLIVEEGKIDDLMDIEELLFCWKNLKSPVFI 177

Query: 172 DLVCRFYGELCKDLFFPE 189
           +LV RFYGELC+DLF  E
Sbjct: 178 ELVSRFYGELCRDLFSGE 195


>gi|449463304|ref|XP_004149374.1| PREDICTED: uncharacterized protein LOC101206169 [Cucumis sativus]
          Length = 184

 Score =  153 bits (387), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 95/203 (46%), Positives = 122/203 (60%), Gaps = 22/203 (10%)

Query: 1   MKVK-ALVAVKTKLFRPCKKLNQIFKFKLRKSFFIRSLRSRSFPTTKRSKPQKSPSRTRI 59
           MKV+  LV+ K+KL +PC KL  +FKF ++K F I+S  +R     + S+    P R   
Sbjct: 1   MKVEVGLVSFKSKLSKPCSKLLHLFKFPMKKPFSIKSFWAR-----RSSRAISKPRRKTW 55

Query: 60  SALLPSFRPPKQWKKMEKLAE--EHRSVSENPECDQALFSSPITPVLFPSPITPDYVKAC 117
           S L    R  ++  KME++ E     SV  + E  + L        LFPSP+     K  
Sbjct: 56  SWL----RWLRRVGKMERVRELKSSESVGSDNESREKL--------LFPSPLIRRRTKVA 103

Query: 118 RLSKRDEVEDVEDACRSFENYLVEMVVEEGKVGDLTDVEELLYCWKNLKCPVFIDLVCRF 177
             +  +E E+VEDACRSFENYLVEM++EEGKV DL DVEELLYCW+ LKCPVF+DLV RF
Sbjct: 104 --ASWEEREEVEDACRSFENYLVEMIIEEGKVRDLMDVEELLYCWRKLKCPVFVDLVSRF 161

Query: 178 YGELCKDLFFPEDDEDHANLQTQ 200
           YGELCKDLF   +     N Q +
Sbjct: 162 YGELCKDLFSSHNQAFTPNFQHK 184


>gi|357491457|ref|XP_003616016.1| hypothetical protein MTR_5g075190 [Medicago truncatula]
 gi|355517351|gb|AES98974.1| hypothetical protein MTR_5g075190 [Medicago truncatula]
          Length = 182

 Score =  152 bits (383), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 93/203 (45%), Positives = 114/203 (56%), Gaps = 38/203 (18%)

Query: 1   MKVKALVAVKTKLFRPCKKLNQIFKFKLRKSFFIRSLRSRSFPTTKRSKPQK-------- 52
           MKVKAL   K+K  +PCKKL   FK K +K  FIR+ R    P                 
Sbjct: 1   MKVKALSIFKSKFLKPCKKLIFFFKLKPKKLIFIRAFRKAKSPKPSSPSKPMSSLLSVFC 60

Query: 53  SPSRTRISALLPSFRPPKQWKKMEKLAEEHRSVSENPECDQALFSSPITPVLFPSPITPD 112
           SP ++R   +L SF+ P   ++                          TP+ FPSP+TP 
Sbjct: 61  SPKKSRDYNMLQSFKSPSNMQE--------------------------TPI-FPSPLTPA 93

Query: 113 YVKACRL-SKRDEV--EDVEDACRSFENYLVEMVVEEGKVGDLTDVEELLYCWKNLKCPV 169
            V+  +    + E   E+VEDACRSFENYL EM+V EGK  DL DVEELLYCWKNLKCPV
Sbjct: 94  CVRKNQTEGSKSEASSEEVEDACRSFENYLAEMIVAEGKTKDLMDVEELLYCWKNLKCPV 153

Query: 170 FIDLVCRFYGELCKDLFFPEDDE 192
           F+DLV RFYGELCKDLF P+++E
Sbjct: 154 FVDLVSRFYGELCKDLFSPDNEE 176


>gi|297826449|ref|XP_002881107.1| ATOFP17/OFP17 [Arabidopsis lyrata subsp. lyrata]
 gi|297326946|gb|EFH57366.1| ATOFP17/OFP17 [Arabidopsis lyrata subsp. lyrata]
          Length = 195

 Score =  151 bits (381), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 91/198 (45%), Positives = 118/198 (59%), Gaps = 12/198 (6%)

Query: 1   MKVKA-LVAVKTKLFRPCKKLNQIFKFKLRKSFFIRSLRSRSFPTTKRSKPQKSPSRTRI 59
           M+VKA L+  K+KL + C +   +F+F++++  FIR LR+R      R +    P +   
Sbjct: 1   MRVKATLINFKSKLSKSCNRFVSLFRFRVKRPVFIRPLRARHGNVKPRHQ-HHHPKKPIC 59

Query: 60  SALLPSFRPPKQWKKMEKLAEEHRSVSENPECDQALFSSPITPV----LFPSPITPDYVK 115
           S L   F    +  KM K      S S N +       SP+TP     LF SPIT  Y  
Sbjct: 60  SCLC--FLSTSKNHKMSKTKPRSSSFSVNDDDYSKFMHSPLTPATAKKLFTSPITTPYSI 117

Query: 116 ACR--LSKRDEVED--VEDACRSFENYLVEMVVEEGKVGDLTDVEELLYCWKNLKCPVFI 171
             R  L+ R+  ED  VEDACRSFENYL+ ++VEEGK+ DL D+EELL+CWKNLK PVFI
Sbjct: 118 RTRKSLNSREAFEDNAVEDACRSFENYLIHLIVEEGKIDDLMDIEELLFCWKNLKSPVFI 177

Query: 172 DLVCRFYGELCKDLFFPE 189
           +LV RFYGELC+DLF  E
Sbjct: 178 ELVSRFYGELCRDLFSGE 195


>gi|449531350|ref|XP_004172649.1| PREDICTED: uncharacterized LOC101206169 [Cucumis sativus]
          Length = 184

 Score =  150 bits (379), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 94/203 (46%), Positives = 121/203 (59%), Gaps = 22/203 (10%)

Query: 1   MKVK-ALVAVKTKLFRPCKKLNQIFKFKLRKSFFIRSLRSRSFPTTKRSKPQKSPSRTRI 59
           MKV+  LV+ K+KL +PC KL  +FKF ++K   I+S  +R     + S+    P R   
Sbjct: 1   MKVEVGLVSFKSKLSKPCSKLLHLFKFPMKKPLSIKSFWAR-----RSSRAISKPRRKTW 55

Query: 60  SALLPSFRPPKQWKKMEKLAE--EHRSVSENPECDQALFSSPITPVLFPSPITPDYVKAC 117
           S L    R  ++  KME++ E     SV  + E  + L        LFPSP+     K  
Sbjct: 56  SWL----RWLRRVGKMERVRELKSSESVDSDNESREKL--------LFPSPLIRRRTKVA 103

Query: 118 RLSKRDEVEDVEDACRSFENYLVEMVVEEGKVGDLTDVEELLYCWKNLKCPVFIDLVCRF 177
             +  +E E+VEDACRSFENYLVEM++EEGKV DL DVEELLYCW+ LKCPVF+DLV RF
Sbjct: 104 --ASWEEREEVEDACRSFENYLVEMIIEEGKVRDLMDVEELLYCWRKLKCPVFVDLVSRF 161

Query: 178 YGELCKDLFFPEDDEDHANLQTQ 200
           YGELCKDLF   +     N Q +
Sbjct: 162 YGELCKDLFSSHNQAFTPNFQHK 184


>gi|224124752|ref|XP_002319413.1| predicted protein [Populus trichocarpa]
 gi|222857789|gb|EEE95336.1| predicted protein [Populus trichocarpa]
          Length = 119

 Score =  145 bits (367), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 72/102 (70%), Positives = 80/102 (78%), Gaps = 2/102 (1%)

Query: 103 VLFPSPITPDYVKACRLSKRDEV--EDVEDACRSFENYLVEMVVEEGKVGDLTDVEELLY 160
           +L+PSP+TP Y+KA   +KR     EDVEDACRSFENYLVEM+VEEGKV DL DVEELLY
Sbjct: 18  MLYPSPLTPAYIKASLATKRQTFGDEDVEDACRSFENYLVEMMVEEGKVRDLADVEELLY 77

Query: 161 CWKNLKCPVFIDLVCRFYGELCKDLFFPEDDEDHANLQTQPK 202
           CWKNLKCPVFI LVCRFYGELCKDLF P+ D    +    PK
Sbjct: 78  CWKNLKCPVFIGLVCRFYGELCKDLFSPDVDNTDVDSPKSPK 119


>gi|224145460|ref|XP_002325650.1| predicted protein [Populus trichocarpa]
 gi|222862525|gb|EEF00032.1| predicted protein [Populus trichocarpa]
          Length = 160

 Score =  135 bits (340), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 89/167 (53%), Positives = 118/167 (70%), Gaps = 18/167 (10%)

Query: 3   VKALVAVKTKLFRPCKKLNQIFKFKLRKSFFIRSLRSRSFPTTKRSKPQKSPSRTRI-SA 61
           +KAL   ++KLF PCKKL  +F+FKL++  FIR L+ R     +  KP+K+P + R+ ++
Sbjct: 1   MKALAVFRSKLFSPCKKLLLLFRFKLKRPVFIRGLQLRR----RSKKPRKAPQKNRVFNS 56

Query: 62  LLPSFRPPKQWKKMEKLAEEHRSVSENPECDQALFSSPITPVLFPSPITPDYVKACRLSK 121
           LL  F P ++ +KM++++E  RSVSE PEC++         +LFPSP+TP Y+KA    K
Sbjct: 57  LLSVFHPLRKSRKMDRVSE-LRSVSE-PECER---------MLFPSPLTPAYIKASLEKK 105

Query: 122 RDEV--EDVEDACRSFENYLVEMVVEEGKVGDLTDVEELLYCWKNLK 166
           R     EDVEDACRSFENYLVEM+VEEG+V DL DVEELLYCWKNLK
Sbjct: 106 RQTFGDEDVEDACRSFENYLVEMMVEEGQVRDLMDVEELLYCWKNLK 152


>gi|125544696|gb|EAY90835.1| hypothetical protein OsI_12440 [Oryza sativa Indica Group]
          Length = 237

 Score =  116 bits (290), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 63/104 (60%), Positives = 72/104 (69%), Gaps = 6/104 (5%)

Query: 100 ITPVLFPSPITPDYVK-ACRL-SKRDEVEDVEDACRSFENYLVEMVVEEGKVGDLTDVEE 157
           + PV  PSP TP YVK   RL S+R   E+ E  CRSFE  L+EM++EEGKV DL DVEE
Sbjct: 126 LAPV--PSPETPAYVKMVARLRSRRSGGEEEEGPCRSFEERLMEMLLEEGKVRDLQDVEE 183

Query: 158 LLYCWKNLKCPVFIDLVCRFYGELCKDLFFP--EDDEDHANLQT 199
           LL CW+ LK PVF++LVCRFYGELCKDLF P  ED  D     T
Sbjct: 184 LLRCWERLKSPVFVELVCRFYGELCKDLFSPGEEDGGDEGASST 227


>gi|79317098|ref|NP_001030983.1| uncharacterized protein [Arabidopsis thaliana]
 gi|98961723|gb|ABF59191.1| unknown protein [Arabidopsis thaliana]
 gi|332189933|gb|AEE28054.1| uncharacterized protein [Arabidopsis thaliana]
          Length = 170

 Score =  106 bits (265), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 52/92 (56%), Positives = 67/92 (72%), Gaps = 4/92 (4%)

Query: 104 LFPSPITPDYVKACRLSKRDEVED--VEDACRSFENYLVEMVVEEGKVGDLTDVEELLYC 161
           +FP  +T    KA R S+R  +++  V++ACR FENY++ M VE+GK+ DL D+EELLY 
Sbjct: 81  MFPLAMTQRNKKA-RTSQRHMLKEDGVKEACRRFENYMIHMFVEDGKIDDLMDMEELLYY 139

Query: 162 WKNLKCPVFIDLVCRFYGELCKDLFFPEDDED 193
           WKNL  P+FIDLV RFYGELC DL FP D +D
Sbjct: 140 WKNLNSPIFIDLVTRFYGELCTDL-FPSDHDD 170


>gi|297843486|ref|XP_002889624.1| hypothetical protein ARALYDRAFT_333988 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297335466|gb|EFH65883.1| hypothetical protein ARALYDRAFT_333988 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 171

 Score =  104 bits (260), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 50/91 (54%), Positives = 68/91 (74%), Gaps = 4/91 (4%)

Query: 104 LFPSPITPDYVKACRLSKRDEVED--VEDACRSFENYLVEMVVEEGKVGDLTDVEELLYC 161
           +FP  +T    KA R S+R  +++  V++AC +FENY++ M+VE+GK+ DL D+EELLY 
Sbjct: 82  MFPLAVTQRNKKA-RTSQRHMLKEDGVKEACTNFENYMIHMIVEDGKLDDLMDMEELLYY 140

Query: 162 WKNLKCPVFIDLVCRFYGELCKDLFFPEDDE 192
           WKNL  P+FIDLV RFYGELC DL FP D++
Sbjct: 141 WKNLNSPIFIDLVTRFYGELCTDL-FPSDND 170


>gi|55168120|gb|AAV43987.1| hypothetical protein [Oryza sativa Japonica Group]
 gi|55168196|gb|AAV44062.1| hypothetical protein [Oryza sativa Japonica Group]
 gi|222631149|gb|EEE63281.1| hypothetical protein OsJ_18091 [Oryza sativa Japonica Group]
          Length = 237

 Score =  104 bits (260), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 60/98 (61%), Positives = 68/98 (69%), Gaps = 4/98 (4%)

Query: 106 PSPITPDYVK-ACRL-SKRDEVEDVEDACRSFENYLVEMVVEEGKVGDLTDVEELLYCWK 163
           PSP TP YVK   RL S+R   E+ E  CRSFE  L+EM++EEGKV D  DVEELL CW+
Sbjct: 130 PSPETPAYVKMVARLRSRRRRGEEEEGPCRSFEERLMEMLLEEGKVRDQQDVEELLRCWE 189

Query: 164 NLKCPVFIDLVCRFYGELCKDLFFP--EDDEDHANLQT 199
            LK PVF++LVCRFYGELCKDLF P  ED  D     T
Sbjct: 190 RLKSPVFVELVCRFYGELCKDLFSPGEEDGGDEGASST 227


>gi|242051288|ref|XP_002463388.1| hypothetical protein SORBIDRAFT_02g042940 [Sorghum bicolor]
 gi|241926765|gb|EER99909.1| hypothetical protein SORBIDRAFT_02g042940 [Sorghum bicolor]
          Length = 222

 Score =  100 bits (249), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 47/67 (70%), Positives = 52/67 (77%)

Query: 127 DVEDACRSFENYLVEMVVEEGKVGDLTDVEELLYCWKNLKCPVFIDLVCRFYGELCKDLF 186
           DVE ACRSFE +LVEM+VEE KV DLTDVEELL CW+ L+CP F+ LV RFYGELC DLF
Sbjct: 142 DVEAACRSFERHLVEMLVEERKVMDLTDVEELLCCWEQLRCPAFVRLVGRFYGELCMDLF 201

Query: 187 FPEDDED 193
               D D
Sbjct: 202 SSAQDAD 208


>gi|242087471|ref|XP_002439568.1| hypothetical protein SORBIDRAFT_09g013380 [Sorghum bicolor]
 gi|241944853|gb|EES17998.1| hypothetical protein SORBIDRAFT_09g013380 [Sorghum bicolor]
          Length = 230

 Score = 97.4 bits (241), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 54/105 (51%), Positives = 68/105 (64%), Gaps = 13/105 (12%)

Query: 101 TPVLFP--SPITPDYVK----------ACRLSKRDEVEDVEDACRSFENYLVEMVVEEGK 148
           TPV+ P  SP TP Y+K          A    ++++ +  E+ACRSFE  L+EM+VEEGK
Sbjct: 109 TPVVAPVSSPETPAYMKVVARLRLERSAASAGRQEDGDKEEEACRSFETCLMEMLVEEGK 168

Query: 149 VGDLTDVEELLYCWKNLKCPVFIDLVCRFYGELCKDLF-FPEDDE 192
             DL DVEELL CW+ LK PVF++LVCRFY  LC  +F  P DDE
Sbjct: 169 TRDLQDVEELLRCWERLKSPVFVELVCRFYDGLCNGMFPAPGDDE 213


>gi|414888108|tpg|DAA64122.1| TPA: hypothetical protein ZEAMMB73_869574 [Zea mays]
          Length = 217

 Score = 95.5 bits (236), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 45/76 (59%), Positives = 54/76 (71%)

Query: 118 RLSKRDEVEDVEDACRSFENYLVEMVVEEGKVGDLTDVEELLYCWKNLKCPVFIDLVCRF 177
           R+      +DVE ACRSFE +L+E +VEE +V DLTDVEELL CW+ L+CP  + LV RF
Sbjct: 131 RVGHGHGHDDVEAACRSFERHLMETLVEEREVVDLTDVEELLCCWEKLRCPALVRLVGRF 190

Query: 178 YGELCKDLFFPEDDED 193
           YGELC DLF P  D D
Sbjct: 191 YGELCMDLFSPAQDAD 206


>gi|413949384|gb|AFW82033.1| hypothetical protein ZEAMMB73_537791 [Zea mays]
          Length = 216

 Score = 94.0 bits (232), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 53/104 (50%), Positives = 67/104 (64%), Gaps = 18/104 (17%)

Query: 101 TPVLFPSPI----TPDYVKAC------RLSKRDE--------VEDVEDACRSFENYLVEM 142
           TPV+  +P+    TP Y+K        R++ R           E+ E+ACRSFE+YL+EM
Sbjct: 109 TPVIAVAPVSSTETPAYMKVVARLRSERIAARGGGEDEDKEKEEEEEEACRSFESYLMEM 168

Query: 143 VVEEGKVGDLTDVEELLYCWKNLKCPVFIDLVCRFYGELCKDLF 186
           +VEEGK  DL DVEELL CW+ L+ PVFI+LV  FYGELC DLF
Sbjct: 169 LVEEGKARDLKDVEELLQCWERLRSPVFIELVWSFYGELCNDLF 212


>gi|33146438|dbj|BAC79546.1| hypothetical protein [Oryza sativa Japonica Group]
 gi|50509520|dbj|BAD31213.1| hypothetical protein [Oryza sativa Japonica Group]
 gi|125559611|gb|EAZ05147.1| hypothetical protein OsI_27341 [Oryza sativa Indica Group]
          Length = 209

 Score = 92.4 bits (228), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 43/65 (66%), Positives = 51/65 (78%)

Query: 126 EDVEDACRSFENYLVEMVVEEGKVGDLTDVEELLYCWKNLKCPVFIDLVCRFYGELCKDL 185
           + VE+ACRSFE +L+EM+VEE KV DL DVEELL CW+ L+ PVF+ LV RFYGELC DL
Sbjct: 133 DGVEEACRSFEKHLMEMLVEERKVRDLMDVEELLCCWEKLRSPVFVQLVGRFYGELCMDL 192

Query: 186 FFPED 190
           F   D
Sbjct: 193 FSGRD 197


>gi|125601517|gb|EAZ41093.1| hypothetical protein OsJ_25585 [Oryza sativa Japonica Group]
          Length = 209

 Score = 92.0 bits (227), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 43/65 (66%), Positives = 51/65 (78%)

Query: 126 EDVEDACRSFENYLVEMVVEEGKVGDLTDVEELLYCWKNLKCPVFIDLVCRFYGELCKDL 185
           + VE+ACRSFE +L+EM+VEE KV DL DVEELL CW+ L+ PVF+ LV RFYGELC DL
Sbjct: 133 DGVEEACRSFEKHLMEMLVEERKVRDLMDVEELLCCWEKLRSPVFVQLVGRFYGELCMDL 192

Query: 186 FFPED 190
           F   D
Sbjct: 193 FSGRD 197


>gi|294464704|gb|ADE77859.1| unknown [Picea sitchensis]
          Length = 260

 Score = 87.8 bits (216), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 39/73 (53%), Positives = 51/73 (69%), Gaps = 5/73 (6%)

Query: 127 DVEDACRSFENYLVEMVVEEGKVGDLTDVEELLYCWKNLKCPVFIDLVCRFYGELCKDLF 186
           DV++AC+ F   L+EMV E+  V DL DVEELLYC++NL CPV+ DLV  FY E+C D+F
Sbjct: 181 DVDNACKQFRKCLIEMVAEDKNVRDLMDVEELLYCYQNLTCPVYRDLVSHFYAEICSDIF 240

Query: 187 FPE-----DDEDH 194
            P      DDE++
Sbjct: 241 LPSASSSFDDEEN 253


>gi|118482746|gb|ABK93291.1| unknown [Populus trichocarpa]
          Length = 154

 Score = 77.0 bits (188), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 39/64 (60%), Positives = 46/64 (71%), Gaps = 2/64 (3%)

Query: 103 VLFPSPITPDYVKACRLSKRDEV--EDVEDACRSFENYLVEMVVEEGKVGDLTDVEELLY 160
           +L+PSP+TP Y+KA   +KR     EDVEDACRSFENYLVEM+VEEGKV DL DV+    
Sbjct: 18  MLYPSPLTPAYIKASLATKRQTFGDEDVEDACRSFENYLVEMMVEEGKVRDLADVKSFCI 77

Query: 161 CWKN 164
             K 
Sbjct: 78  AGKT 81


>gi|357468391|ref|XP_003604480.1| ATP-dependent RNA helicase chl1 [Medicago truncatula]
 gi|355505535|gb|AES86677.1| ATP-dependent RNA helicase chl1 [Medicago truncatula]
          Length = 1129

 Score = 43.5 bits (101), Expect = 0.046,   Method: Compositional matrix adjust.
 Identities = 17/47 (36%), Positives = 31/47 (65%)

Query: 143  VVEEGKVGDLTDVEELLYCWKNLKCPVFIDLVCRFYGELCKDLFFPE 189
            ++E G +  + D+EE L+ +  LK PV++D+V RF+ ++  D  FP+
Sbjct: 1063 MMESGDLEHVLDIEEALHYYSRLKSPVYLDIVDRFFIDINSDFTFPQ 1109


>gi|168041447|ref|XP_001773203.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162675562|gb|EDQ62056.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 958

 Score = 43.1 bits (100), Expect = 0.058,   Method: Compositional matrix adjust.
 Identities = 20/58 (34%), Positives = 33/58 (56%), Gaps = 1/58 (1%)

Query: 130 DACRSFENYLVEMVVEEGKVGDLT-DVEELLYCWKNLKCPVFIDLVCRFYGELCKDLF 186
           DA  +F+N +  M++E   V +L  DVE+ +  +  L  P +IDL+  F+  +C D F
Sbjct: 865 DAVSAFQNEVARMMLEGADVPELNMDVEQFIAGYMKLNSPFYIDLIEDFFKSVCMDCF 922


>gi|302759769|ref|XP_002963307.1| hypothetical protein SELMODRAFT_405091 [Selaginella moellendorffii]
 gi|300168575|gb|EFJ35178.1| hypothetical protein SELMODRAFT_405091 [Selaginella moellendorffii]
          Length = 622

 Score = 42.0 bits (97), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 25/54 (46%), Positives = 33/54 (61%), Gaps = 3/54 (5%)

Query: 139 LVEMVVEEGKVGDLTDVEELLYCWKNLKCPVFIDLVCRFYGELCKDLFFPEDDE 192
           +VEMVV +    DL D+EE L  +  L CPVF  +V RF+ E C +  F ED+E
Sbjct: 539 VVEMVVAD-SFDDLVDLEEFLGYYMLLNCPVFRAMVQRFFSEFCVE--FCEDEE 589


>gi|168008445|ref|XP_001756917.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162691788|gb|EDQ78148.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 936

 Score = 42.0 bits (97), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 21/59 (35%), Positives = 33/59 (55%), Gaps = 1/59 (1%)

Query: 129 EDACRSFENYLVEMVVEEGKVGDLT-DVEELLYCWKNLKCPVFIDLVCRFYGELCKDLF 186
            DA  SF+N +  M++E   V +L  DVE+ +  +  L  P +IDL+  F+  +C D F
Sbjct: 841 NDAVTSFQNEVARMMLEGVDVPNLNMDVEQFIAGYMKLNSPFYIDLIEDFFKSVCIDCF 899


>gi|302759755|ref|XP_002963300.1| hypothetical protein SELMODRAFT_405078 [Selaginella moellendorffii]
 gi|300168568|gb|EFJ35171.1| hypothetical protein SELMODRAFT_405078 [Selaginella moellendorffii]
          Length = 509

 Score = 42.0 bits (97), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 25/54 (46%), Positives = 33/54 (61%), Gaps = 3/54 (5%)

Query: 139 LVEMVVEEGKVGDLTDVEELLYCWKNLKCPVFIDLVCRFYGELCKDLFFPEDDE 192
           +VEMVV +    DL D+EE L  +  L CPVF  +V RF+ E C +  F ED+E
Sbjct: 426 VVEMVVAD-SFDDLVDLEEFLGYYMLLNCPVFRAMVQRFFSEFCVE--FCEDEE 476


>gi|356527780|ref|XP_003532485.1| PREDICTED: uncharacterized protein LOC100811016 [Glycine max]
          Length = 222

 Score = 40.8 bits (94), Expect = 0.32,   Method: Compositional matrix adjust.
 Identities = 19/56 (33%), Positives = 35/56 (62%), Gaps = 6/56 (10%)

Query: 137 NYLVEMVVEEGKVGDLTDV------EELLYCWKNLKCPVFIDLVCRFYGELCKDLF 186
           N +VE  + E ++ D++DV      EE+L+ +  L CPV++ +V +F+ E+C + F
Sbjct: 148 NRMVEKKLRELEMLDMSDVDYVLDIEEVLHYYSRLTCPVYLQIVDKFFMEMCSEFF 203


>gi|168038473|ref|XP_001771725.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162677032|gb|EDQ63508.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 909

 Score = 40.4 bits (93), Expect = 0.44,   Method: Compositional matrix adjust.
 Identities = 20/58 (34%), Positives = 32/58 (55%), Gaps = 1/58 (1%)

Query: 130 DACRSFENYLVEMVVEEGKVGDLT-DVEELLYCWKNLKCPVFIDLVCRFYGELCKDLF 186
           DA  SF++ +  M++E     DL  DVE+ +  +  L  P +IDL+  F+  +C D F
Sbjct: 815 DAVTSFQSEVARMMLEGVDAPDLNMDVEQFIAGYMKLNSPFYIDLIEDFFKSVCIDCF 872


>gi|388513621|gb|AFK44872.1| unknown [Lotus japonicus]
          Length = 209

 Score = 40.0 bits (92), Expect = 0.51,   Method: Compositional matrix adjust.
 Identities = 20/70 (28%), Positives = 39/70 (55%), Gaps = 3/70 (4%)

Query: 120 SKRDEVEDVEDACRSFENYLVEMVVEEGKVGDLTDVEELLYCWKNLKCPVFIDLVCRFYG 179
           SK    E V D  +  +   +EM  + G V  + D+EE+L+ +  L  PV++D+V +F+ 
Sbjct: 124 SKNKNEEGVHDLAQKMKE--LEMA-DSGDVEHVLDIEEVLHYYSRLTSPVYLDIVDKFFT 180

Query: 180 ELCKDLFFPE 189
           ++  ++  P+
Sbjct: 181 DMHSEINLPQ 190


>gi|357468387|ref|XP_003604478.1| hypothetical protein MTR_4g012880 [Medicago truncatula]
 gi|355505533|gb|AES86675.1| hypothetical protein MTR_4g012880 [Medicago truncatula]
          Length = 147

 Score = 39.7 bits (91), Expect = 0.76,   Method: Compositional matrix adjust.
 Identities = 15/47 (31%), Positives = 30/47 (63%)

Query: 143 VVEEGKVGDLTDVEELLYCWKNLKCPVFIDLVCRFYGELCKDLFFPE 189
           +++ G +  + D+EE L+ +  LK PV++D+V RF+ ++  D   P+
Sbjct: 17  MMDSGDLEHVLDIEEALHYYSRLKSPVYLDIVNRFFIDINSDFTLPQ 63


>gi|224146631|ref|XP_002326076.1| predicted protein [Populus trichocarpa]
 gi|222862951|gb|EEF00458.1| predicted protein [Populus trichocarpa]
          Length = 386

 Score = 39.3 bits (90), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 18/56 (32%), Positives = 33/56 (58%), Gaps = 1/56 (1%)

Query: 130 DACRSFENYLVEMVVEEGKVGDLTDVEELLYCWKNLKCPVFIDLVCRFYGELCKDL 185
           D  R F++ +VEM+VE   + D  D+EELL C+ +L    + D + + + ++  D+
Sbjct: 327 DPQRDFKDSMVEMIVENN-IQDSKDLEELLACYLSLNSKKYHDFIIKAFEQIWFDM 381


>gi|356520569|ref|XP_003528934.1| PREDICTED: uncharacterized protein LOC100795198 [Glycine max]
          Length = 239

 Score = 38.9 bits (89), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 18/61 (29%), Positives = 34/61 (55%), Gaps = 6/61 (9%)

Query: 128 VEDACRSFENYLVEMVVEEGKVGDLTDVEELLYCWKNLKCPVFIDLVCRFYGELCKDLFF 187
           VE   R  E      +++   V  + D+EE+L+ +  L CP ++++V +F+ E+  +LF 
Sbjct: 166 VEQKLRELE------MLDMNNVEYVLDIEEVLHYYSRLTCPAYLEIVDKFFLEMYSELFG 219

Query: 188 P 188
           P
Sbjct: 220 P 220


>gi|449530203|ref|XP_004172085.1| PREDICTED: uncharacterized protein LOC101229726 [Cucumis sativus]
          Length = 325

 Score = 38.5 bits (88), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 17/61 (27%), Positives = 35/61 (57%), Gaps = 1/61 (1%)

Query: 125 VEDVEDACRSFENYLVEMVVEEGKVGDLTDVEELLYCWKNLKCPVFIDLVCRFYGELCKD 184
           V+  +D  R F   +VEM+VE  K+    ++E+LL C+ +L    + D++ + + ++  D
Sbjct: 258 VKSTKDPQRDFRESMVEMIVE-NKISGSNELEDLLACYLSLNTDEYHDIIVKVFKQIWFD 316

Query: 185 L 185
           +
Sbjct: 317 M 317


>gi|449439817|ref|XP_004137682.1| PREDICTED: uncharacterized protein LOC101203459 [Cucumis sativus]
          Length = 325

 Score = 38.5 bits (88), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 17/61 (27%), Positives = 35/61 (57%), Gaps = 1/61 (1%)

Query: 125 VEDVEDACRSFENYLVEMVVEEGKVGDLTDVEELLYCWKNLKCPVFIDLVCRFYGELCKD 184
           V+  +D  R F   +VEM+VE  K+    ++E+LL C+ +L    + D++ + + ++  D
Sbjct: 258 VKSTKDPQRDFRESMVEMIVE-NKISGSNELEDLLACYLSLNTDEYHDIIVKVFKQIWFD 316

Query: 185 L 185
           +
Sbjct: 317 M 317


>gi|168007386|ref|XP_001756389.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162692428|gb|EDQ78785.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 960

 Score = 38.5 bits (88), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 17/59 (28%), Positives = 36/59 (61%), Gaps = 1/59 (1%)

Query: 129 EDACRSFENYLVEMVVEEGKV-GDLTDVEELLYCWKNLKCPVFIDLVCRFYGELCKDLF 186
           ++A   F + + +M++EEGK   ++ DV+E L  + +L+ P ++ +V  F+  +C D +
Sbjct: 826 QEAVAVFHDDVAQMMLEEGKYCSEVNDVDEFLEGYMSLRSPFYLAMVDDFFRAVCVDSY 884


>gi|168028431|ref|XP_001766731.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162681940|gb|EDQ68362.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 995

 Score = 38.5 bits (88), Expect = 1.7,   Method: Composition-based stats.
 Identities = 18/59 (30%), Positives = 35/59 (59%), Gaps = 1/59 (1%)

Query: 129 EDACRSFENYLVEMVVEEGK-VGDLTDVEELLYCWKNLKCPVFIDLVCRFYGELCKDLF 186
           ++A   F + +  M++EEG+  G+  DVEE L  + +L+ P ++ +V  F+  +C D +
Sbjct: 899 QEATAVFHDEVARMMLEEGQDCGEAHDVEEFLQGYMSLRSPFYLSMVDNFFRAVCVDCY 957


>gi|255546491|ref|XP_002514305.1| conserved hypothetical protein [Ricinus communis]
 gi|223546761|gb|EEF48259.1| conserved hypothetical protein [Ricinus communis]
          Length = 230

 Score = 38.5 bits (88), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 20/69 (28%), Positives = 38/69 (55%), Gaps = 4/69 (5%)

Query: 120 SKRDEVEDVEDACRSFENYLVEMVVEEGKVGDLTDVEELLYCWKNLKCPVFIDLVCRFYG 179
           SKRDE      A +  E  +++M      V  + D+EE+L+ +  L CP ++++V +F+ 
Sbjct: 150 SKRDEERSFLVAQKLKELEMMDM----SNVDHVLDIEEVLHYYSRLTCPAYLEIVDKFFM 205

Query: 180 ELCKDLFFP 188
           ++  + F P
Sbjct: 206 DMYAEFFGP 214


>gi|168001982|ref|XP_001753693.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162695100|gb|EDQ81445.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 796

 Score = 37.4 bits (85), Expect = 3.7,   Method: Compositional matrix adjust.
 Identities = 15/59 (25%), Positives = 35/59 (59%)

Query: 129 EDACRSFENYLVEMVVEEGKVGDLTDVEELLYCWKNLKCPVFIDLVCRFYGELCKDLFF 187
           ++A  +F++ +  M++EE    ++ D+EE L  + +LK P ++++V  F+  +  D + 
Sbjct: 736 KEAVAAFQDDIAHMMLEEQDSDEVADIEEFLDGYMHLKSPFYLEVVEEFFRAISYDCYH 794


>gi|336408299|ref|ZP_08588793.1| hypothetical protein HMPREF1018_00808 [Bacteroides sp. 2_1_56FAA]
 gi|335939599|gb|EGN01473.1| hypothetical protein HMPREF1018_00808 [Bacteroides sp. 2_1_56FAA]
          Length = 1512

 Score = 37.4 bits (85), Expect = 4.0,   Method: Composition-based stats.
 Identities = 25/82 (30%), Positives = 38/82 (46%), Gaps = 4/82 (4%)

Query: 96   FSSPITPVLFPSPITPDYVKACRLSKRDEVEDVEDACRSFENYLVEMVVEEGKVGDLTDV 155
            F S + P L    +T     A  L K  +V++ ++   S + YL +   E+G   D TD 
Sbjct: 1172 FLSKVAPSLSQQSVTEKAWSAIVLQKAGKVKEAQEFMASLKEYLTQ-TDEQGMFFDRTDS 1230

Query: 156  EELLYCWKNLKCPVFIDLVCRF 177
                Y W NLK P  +D++  F
Sbjct: 1231 P---YAWNNLKVPAHVDVMEAF 1249


>gi|356556188|ref|XP_003546408.1| PREDICTED: uncharacterized protein LOC100802301 [Glycine max]
          Length = 217

 Score = 37.0 bits (84), Expect = 4.5,   Method: Compositional matrix adjust.
 Identities = 17/70 (24%), Positives = 39/70 (55%), Gaps = 4/70 (5%)

Query: 116 ACRLSKR---DEVEDVEDACRSFENYLVEM-VVEEGKVGDLTDVEELLYCWKNLKCPVFI 171
           +C L K+   +  + + +        + E+ +++ G V  + D+EE L+ +  LK PV++
Sbjct: 121 SCELGKKECLNNSKSINEGVHDLTQKMKELEMMDTGDVEHVLDIEEALHYYSRLKSPVYL 180

Query: 172 DLVCRFYGEL 181
           D+V +F+ ++
Sbjct: 181 DIVDKFFMDM 190


>gi|423248752|ref|ZP_17229768.1| hypothetical protein HMPREF1066_00778 [Bacteroides fragilis
            CL03T00C08]
 gi|423253701|ref|ZP_17234632.1| hypothetical protein HMPREF1067_01276 [Bacteroides fragilis
            CL03T12C07]
 gi|392655330|gb|EIY48973.1| hypothetical protein HMPREF1067_01276 [Bacteroides fragilis
            CL03T12C07]
 gi|392657693|gb|EIY51324.1| hypothetical protein HMPREF1066_00778 [Bacteroides fragilis
            CL03T00C08]
          Length = 1926

 Score = 37.0 bits (84), Expect = 4.5,   Method: Composition-based stats.
 Identities = 25/82 (30%), Positives = 38/82 (46%), Gaps = 4/82 (4%)

Query: 96   FSSPITPVLFPSPITPDYVKACRLSKRDEVEDVEDACRSFENYLVEMVVEEGKVGDLTDV 155
            F S + P L    +T     A  L K  +V++ ++   S + YL +   E+G   D TD 
Sbjct: 1586 FLSKVAPSLSQQSVTEKAWSAIVLQKAGKVKEAQEFMASLKEYLTQ-TDEQGMFFDRTDS 1644

Query: 156  EELLYCWKNLKCPVFIDLVCRF 177
                Y W NLK P  +D++  F
Sbjct: 1645 P---YAWNNLKVPAHVDVMEAF 1663


>gi|60680272|ref|YP_210416.1| hypothetical protein BF0713 [Bacteroides fragilis NCTC 9343]
 gi|60491706|emb|CAH06460.1| conserved hypothetical protein [Bacteroides fragilis NCTC 9343]
          Length = 1926

 Score = 37.0 bits (84), Expect = 4.6,   Method: Composition-based stats.
 Identities = 25/82 (30%), Positives = 38/82 (46%), Gaps = 4/82 (4%)

Query: 96   FSSPITPVLFPSPITPDYVKACRLSKRDEVEDVEDACRSFENYLVEMVVEEGKVGDLTDV 155
            F S + P L    +T     A  L K  +V++ ++   S + YL +   E+G   D TD 
Sbjct: 1586 FLSKVAPSLSQQSVTEKAWSAIVLQKAGKVKEAQEFMASLKEYLTQ-TDEQGMFFDRTDS 1644

Query: 156  EELLYCWKNLKCPVFIDLVCRF 177
                Y W NLK P  +D++  F
Sbjct: 1645 P---YAWNNLKVPAHVDVMEAF 1663


>gi|53712079|ref|YP_098071.1| hypothetical protein BF0786 [Bacteroides fragilis YCH46]
 gi|52214944|dbj|BAD47537.1| conserved hypothetical protein [Bacteroides fragilis YCH46]
          Length = 1926

 Score = 37.0 bits (84), Expect = 4.6,   Method: Composition-based stats.
 Identities = 25/82 (30%), Positives = 38/82 (46%), Gaps = 4/82 (4%)

Query: 96   FSSPITPVLFPSPITPDYVKACRLSKRDEVEDVEDACRSFENYLVEMVVEEGKVGDLTDV 155
            F S + P L    +T     A  L K  +V++ ++   S + YL +   E+G   D TD 
Sbjct: 1586 FLSKVAPSLSQQSVTEKAWSAIVLQKAGKVKEAQEFMASLKEYLTQ-TDEQGMFFDRTDS 1644

Query: 156  EELLYCWKNLKCPVFIDLVCRF 177
                Y W NLK P  +D++  F
Sbjct: 1645 P---YAWNNLKVPAHVDVMEAF 1663


>gi|423269315|ref|ZP_17248287.1| hypothetical protein HMPREF1079_01369 [Bacteroides fragilis
            CL05T00C42]
 gi|423273122|ref|ZP_17252069.1| hypothetical protein HMPREF1080_00722 [Bacteroides fragilis
            CL05T12C13]
 gi|392701109|gb|EIY94269.1| hypothetical protein HMPREF1079_01369 [Bacteroides fragilis
            CL05T00C42]
 gi|392708154|gb|EIZ01262.1| hypothetical protein HMPREF1080_00722 [Bacteroides fragilis
            CL05T12C13]
          Length = 1926

 Score = 37.0 bits (84), Expect = 4.6,   Method: Composition-based stats.
 Identities = 25/82 (30%), Positives = 38/82 (46%), Gaps = 4/82 (4%)

Query: 96   FSSPITPVLFPSPITPDYVKACRLSKRDEVEDVEDACRSFENYLVEMVVEEGKVGDLTDV 155
            F S + P L    +T     A  L K  +V++ ++   S + YL +   E+G   D TD 
Sbjct: 1586 FLSKVAPSLSQQSVTEKAWSAIVLQKAGKVKEAQEFMASLKEYLTQ-TDEQGMFFDRTDS 1644

Query: 156  EELLYCWKNLKCPVFIDLVCRF 177
                Y W NLK P  +D++  F
Sbjct: 1645 P---YAWNNLKVPAHVDVMEAF 1663


>gi|423258926|ref|ZP_17239849.1| hypothetical protein HMPREF1055_02126 [Bacteroides fragilis
            CL07T00C01]
 gi|423264103|ref|ZP_17243106.1| hypothetical protein HMPREF1056_00793 [Bacteroides fragilis
            CL07T12C05]
 gi|387776506|gb|EIK38606.1| hypothetical protein HMPREF1055_02126 [Bacteroides fragilis
            CL07T00C01]
 gi|392706369|gb|EIY99492.1| hypothetical protein HMPREF1056_00793 [Bacteroides fragilis
            CL07T12C05]
          Length = 1926

 Score = 37.0 bits (84), Expect = 4.6,   Method: Composition-based stats.
 Identities = 25/82 (30%), Positives = 38/82 (46%), Gaps = 4/82 (4%)

Query: 96   FSSPITPVLFPSPITPDYVKACRLSKRDEVEDVEDACRSFENYLVEMVVEEGKVGDLTDV 155
            F S + P L    +T     A  L K  +V++ ++   S + YL +   E+G   D TD 
Sbjct: 1586 FLSKVAPSLSQQSVTEKAWSAIVLQKAGKVKEAQEFMASLKEYLTQ-TDEQGMFFDRTDS 1644

Query: 156  EELLYCWKNLKCPVFIDLVCRF 177
                Y W NLK P  +D++  F
Sbjct: 1645 P---YAWNNLKVPAHVDVMEAF 1663


>gi|375357118|ref|YP_005109890.1| hypothetical protein BF638R_0756 [Bacteroides fragilis 638R]
 gi|301161799|emb|CBW21339.1| conserved hypothetical protein [Bacteroides fragilis 638R]
          Length = 1926

 Score = 37.0 bits (84), Expect = 4.6,   Method: Composition-based stats.
 Identities = 25/82 (30%), Positives = 38/82 (46%), Gaps = 4/82 (4%)

Query: 96   FSSPITPVLFPSPITPDYVKACRLSKRDEVEDVEDACRSFENYLVEMVVEEGKVGDLTDV 155
            F S + P L    +T     A  L K  +V++ ++   S + YL +   E+G   D TD 
Sbjct: 1586 FLSKVAPSLSQQSVTEKAWSAIVLQKAGKVKEAQEFMASLKEYLTQ-TDEQGMFFDRTDS 1644

Query: 156  EELLYCWKNLKCPVFIDLVCRF 177
                Y W NLK P  +D++  F
Sbjct: 1645 P---YAWNNLKVPAHVDVMEAF 1663


>gi|383117040|ref|ZP_09937787.1| hypothetical protein BSHG_0859 [Bacteroides sp. 3_2_5]
 gi|251947658|gb|EES87940.1| hypothetical protein BSHG_0859 [Bacteroides sp. 3_2_5]
          Length = 1926

 Score = 37.0 bits (84), Expect = 4.6,   Method: Composition-based stats.
 Identities = 25/82 (30%), Positives = 38/82 (46%), Gaps = 4/82 (4%)

Query: 96   FSSPITPVLFPSPITPDYVKACRLSKRDEVEDVEDACRSFENYLVEMVVEEGKVGDLTDV 155
            F S + P L    +T     A  L K  +V++ ++   S + YL +   E+G   D TD 
Sbjct: 1586 FLSKVAPSLSQQSVTEKAWSAIVLQKAGKVKEAQEFMASLKEYLTQ-TDEQGMFFDRTDS 1644

Query: 156  EELLYCWKNLKCPVFIDLVCRF 177
                Y W NLK P  +D++  F
Sbjct: 1645 P---YAWNNLKVPAHVDVMEAF 1663


>gi|423282004|ref|ZP_17260889.1| hypothetical protein HMPREF1204_00427 [Bacteroides fragilis HMW 615]
 gi|404582491|gb|EKA87185.1| hypothetical protein HMPREF1204_00427 [Bacteroides fragilis HMW 615]
          Length = 1926

 Score = 37.0 bits (84), Expect = 4.7,   Method: Composition-based stats.
 Identities = 25/82 (30%), Positives = 38/82 (46%), Gaps = 4/82 (4%)

Query: 96   FSSPITPVLFPSPITPDYVKACRLSKRDEVEDVEDACRSFENYLVEMVVEEGKVGDLTDV 155
            F S + P L    +T     A  L K  +V++ ++   S + YL +   E+G   D TD 
Sbjct: 1586 FLSKVAPSLSQQSVTEKAWSAIVLQKAGKVKEAQEFMASLKEYLTQ-TDEQGMFFDRTDS 1644

Query: 156  EELLYCWKNLKCPVFIDLVCRF 177
                Y W NLK P  +D++  F
Sbjct: 1645 P---YAWNNLKVPAHVDVMEAF 1663


>gi|357521019|ref|XP_003630798.1| hypothetical protein MTR_8g103530 [Medicago truncatula]
 gi|355524820|gb|AET05274.1| hypothetical protein MTR_8g103530 [Medicago truncatula]
          Length = 203

 Score = 37.0 bits (84), Expect = 4.7,   Method: Compositional matrix adjust.
 Identities = 38/172 (22%), Positives = 72/172 (41%), Gaps = 28/172 (16%)

Query: 34  IRSLRSRSFPTTKRSKPQKSPSRTRISALLPSFRPPKQWKKMEKLAEEHRSVSENPECDQ 93
            RSL SRS+    ++ P    S   +      F   ++W       +  +  S+    ++
Sbjct: 21  FRSLFSRSYQKIPKTPPHNHSSYNDLEKFYSDF--TEKW-------DSEKGKSKKRRKNK 71

Query: 94  ALFSSPITPV---LFPSPITPDYVKACRLSKRDEVEDVE---------DACRSF------ 135
           A+ SS  T     L       + V      +++E+E  E         D+   F      
Sbjct: 72  AMLSSSTTKQEEELHVHNHNENMVALNNAKEKNEMEKSEELDGEQCKQDSSSMFNSKSMV 131

Query: 136 ENYLVEM-VVEEGKVGDLTDVEELLYCWKNLKCPVFIDLVCRFYGELCKDLF 186
           E  L E+ ++E   V  + D+EE+L+ +  L CP ++++V +F+ E+  + F
Sbjct: 132 EKKLKELEMLEMSNVDYILDIEEVLHYYSRLTCPSYLEIVDKFFVEMYSEFF 183


>gi|404494779|ref|YP_006718885.1| hypothetical protein Pcar_3089 [Pelobacter carbinolicus DSM 2380]
 gi|77546762|gb|ABA90324.1| hypothetical protein Pcar_3089 [Pelobacter carbinolicus DSM 2380]
          Length = 204

 Score = 37.0 bits (84), Expect = 5.2,   Method: Compositional matrix adjust.
 Identities = 22/71 (30%), Positives = 34/71 (47%), Gaps = 1/71 (1%)

Query: 78  LAEEHRSVSENPECDQALFSSPITPVLFPSPITPDYV-KACRLSKRDEVEDVEDACRSFE 136
           L  E   +     CDQ LF   +   LF  P++ D   K  R  ++ ++ D E   R+  
Sbjct: 11  LFSEELGIDLQSSCDQELFKWLLAATLFGGPVSRDIARKTYRTFQKYQLLDPESISRAGW 70

Query: 137 NYLVEMVVEEG 147
           +YLV  V++EG
Sbjct: 71  SYLVHPVMQEG 81


>gi|440913504|gb|ELR62953.1| ATP-binding cassette sub-family A member 3, partial [Bos grunniens
           mutus]
          Length = 1715

 Score = 36.6 bits (83), Expect = 5.9,   Method: Composition-based stats.
 Identities = 15/43 (34%), Positives = 24/43 (55%)

Query: 144 VEEGKVGDLTDVEELLYCWKNLKCPVFIDLVCRFYGELCKDLF 186
           +    VG L +VE LL+ W ++K  +FI L+C     LC+  +
Sbjct: 865 IRRASVGSLLEVESLLFDWVDVKLNLFIQLICFLKNLLCQQFY 907


>gi|164428824|ref|XP_956393.2| hypothetical protein NCU00083 [Neurospora crassa OR74A]
 gi|157072296|gb|EAA27157.2| conserved hypothetical protein [Neurospora crassa OR74A]
          Length = 441

 Score = 36.6 bits (83), Expect = 6.5,   Method: Compositional matrix adjust.
 Identities = 17/48 (35%), Positives = 30/48 (62%), Gaps = 3/48 (6%)

Query: 53  SPSRTRISALLPSFRPPKQWKKMEKLAEEHRSVSENPECDQALFSSPI 100
           SP+R R+SALLP +R    W+   +LA+  RS   + + ++A+ +S +
Sbjct: 42  SPTRRRVSALLPPYRADTHWR---RLADTSRSTPTSTKVNKAIMASAV 86


>gi|336469615|gb|EGO57777.1| hypothetical protein NEUTE1DRAFT_109968 [Neurospora tetrasperma
           FGSC 2508]
          Length = 544

 Score = 36.2 bits (82), Expect = 7.4,   Method: Compositional matrix adjust.
 Identities = 17/48 (35%), Positives = 30/48 (62%), Gaps = 3/48 (6%)

Query: 53  SPSRTRISALLPSFRPPKQWKKMEKLAEEHRSVSENPECDQALFSSPI 100
           SP+R R+SALLP +R    W+   +LA+  RS   + + ++A+ +S +
Sbjct: 145 SPTRRRVSALLPPYRANTHWR---RLADTSRSTPTSTKVNKAIMASAV 189


>gi|350290738|gb|EGZ71952.1| hypothetical protein NEUTE2DRAFT_89679 [Neurospora tetrasperma FGSC
           2509]
          Length = 442

 Score = 36.2 bits (82), Expect = 7.5,   Method: Compositional matrix adjust.
 Identities = 17/48 (35%), Positives = 30/48 (62%), Gaps = 3/48 (6%)

Query: 53  SPSRTRISALLPSFRPPKQWKKMEKLAEEHRSVSENPECDQALFSSPI 100
           SP+R R+SALLP +R    W+   +LA+  RS   + + ++A+ +S +
Sbjct: 43  SPTRRRVSALLPPYRANTHWR---RLADTSRSTPTSTKVNKAIMASAV 87


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.321    0.136    0.415 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 3,123,682,630
Number of Sequences: 23463169
Number of extensions: 125753752
Number of successful extensions: 324286
Number of sequences better than 100.0: 72
Number of HSP's better than 100.0 without gapping: 29
Number of HSP's successfully gapped in prelim test: 43
Number of HSP's that attempted gapping in prelim test: 324225
Number of HSP's gapped (non-prelim): 76
length of query: 202
length of database: 8,064,228,071
effective HSP length: 135
effective length of query: 67
effective length of database: 9,191,667,552
effective search space: 615841725984
effective search space used: 615841725984
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 73 (32.7 bits)