BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 000864
         (1247 letters)

Database: swissprot 
           539,616 sequences; 191,569,459 total letters

Searching..................................................done



>sp|Q8K368|FANCI_MOUSE Fanconi anemia group I protein homolog OS=Mus musculus GN=Fanci
           PE=1 SV=2
          Length = 1330

 Score =  194 bits (492), Expect = 5e-48,   Method: Compositional matrix adjust.
 Identities = 153/537 (28%), Positives = 264/537 (49%), Gaps = 63/537 (11%)

Query: 29  GGDYVNLVFDRLLDCDWSKGLLLKLVSIVRELGFFDKRRCSEFLMKVFEGMKIVDLQDLP 88
           G +Y   + D L    W +  +++L S+ +++        +  + KV      ++LQ++P
Sbjct: 153 GEEYKRQLIDTLCSVRWPQRYMIQLTSVFKDVCL-TPEEMNLVVAKVLTMFSKLNLQEIP 211

Query: 89  TLVYQLLVLASRGFNKREVIEGIVVFFG---------------SKMGSKVTSIVRQVEGT 133
            LVYQLLVL+S+G ++R V++GI+ FF                S++ +     +  VEGT
Sbjct: 212 PLVYQLLVLSSKG-SRRSVLDGIIAFFRELDKQHREEQSSDELSELITAPADELYHVEGT 270

Query: 134 VLLHVNFAVKQDPSLGQEVMGLVKLDL-----RAFNHFAVAVLLSVARIRKFSESAMGIL 188
           V+LH+ FA+K D  LG+E++  +K        +    F++A+LLS+ RI++F E    +L
Sbjct: 271 VILHIVFAIKLDCELGRELLKHLKAGQQGDPSKCLCPFSIALLLSLTRIQRFEEQVFDLL 330

Query: 189 KTALLTAYRDYKFSKDCMWLPNDSKDEFLQSV----KVVEKAVLRSVNESNYGREHVVPS 244
           KT+++ +++D       + L   SK  FLQ++      V   +L  V  S +  +HV   
Sbjct: 331 KTSVVKSFKD-------LQLLQGSK--FLQTLVPQRTCVSTMILEVVRNSVHSWDHVTQG 381

Query: 245 IVQFGFILLES-----------VEEG-SLKEHCQSNGLLGIEKLGIQILKTLFEVHDMAR 292
           +++FGFIL++S           VE G SL +    +      KLG  IL   F++H+M R
Sbjct: 382 LIEFGFILMDSYGPKKILDGKAVEIGTSLSKMTNQHAC----KLGANILLETFKIHEMIR 437

Query: 293 SEIVEQCKFRVLSLKPEQSMSVIRLIGNLVQNYPYPMLEHVSRLKELLDYFTFMHGKVAT 352
            EI+EQ   RV++         + L  +++   P  +L++ S++ E  DY TF+  +   
Sbjct: 438 QEILEQVLNRVVTRTSSPINHFLDLFSDIIMYAPL-ILQNCSKVTETFDYLTFLPLQTVQ 496

Query: 353 YLVTALLPLINISRDLQDYTILVVRKAMFRREDAVRHAATNAIIDLIMAEKQSKTDGSLS 412
            L+ A+ PL+ IS  ++D  ILV+RKAMF  +   R +A    + L+   K  K  GSL 
Sbjct: 497 GLLKAVQPLLKISMSMRDSLILVLRKAMFASQLDARKSAVAGFLLLL---KNFKVLGSLP 553

Query: 413 LQDSSSQASCSQQEIP-RSTHGGLFQELSAL-----LQRCLYQQAKVKEVMYQGLVKLVL 466
               +     +Q  +   S +  +  E   L     L+R L QQA ++ ++Y G   ++ 
Sbjct: 554 SSQCTQSIGVTQVRVDVHSRYSAVANETFCLEIIDSLKRSLGQQADIRLMLYDGFYDVLR 613

Query: 467 VDPSAGGPIFDFLLPHFLRFFREDSDV--QLGVNLCIKVENGRAFIDEPLDCLLYCV 521
            +      I   L     +F+  + D+   L +  C+  +  + F+ EPLD LL C+
Sbjct: 614 RNSQLASSIMQTLFSQLKQFYEPEPDLLPPLKLGACVLTQGSQIFLQEPLDHLLSCI 670



 Score = 97.4 bits (241), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 94/325 (28%), Positives = 154/325 (47%), Gaps = 40/325 (12%)

Query: 908  TWAEGFCKSNSITNANVVKSVVKLAICL----SSPPNDLFVAQSLAQELLKVTGSGSSD- 962
            +W    CK  S  +A+  KS++ L   L     SP   L   + L+Q++    G    D 
Sbjct: 1004 SWTSKICKEYSQEDASFCKSLMNLFFSLHVLYKSPVTLL---RDLSQDIHGQLGDIDQDV 1060

Query: 963  SLEVSELYPLINHSTST-TILSCILQLTEEVVSDMDWAIKKVKTFSLVTQKGIHVNHDED 1021
             +E ++ + ++N  T+  T+   +L   E+V+ ++DW I K+K  +   Q+ +      +
Sbjct: 1061 EIEKTDHFAVVNLRTAAPTVCLLVLSQAEKVLEEVDWLIAKIKGSA--NQETLSDKVTPE 1118

Query: 1022 HVSG------LGFEENLYSRTEALVKVLSSFVLMSLKDPQ-AEHFLRLATRLYKHLAQMS 1074
              S       L  E+ +  +   LV      V  +L      +  L+  +++Y  L    
Sbjct: 1119 DASSQAVPPTLLIEKAIVMQLGTLVTFFHELVQTALPSGSCVDTLLKGLSKIYSTLTAFV 1178

Query: 1075 KLRIAPKGCKQLLPSLEFQRLVELTCKQLTVPLYNFVAEMQR--------GQLENASSKG 1126
            K  +      + +P+   ++LV+L+   LT   Y+F++ +Q          + E A+   
Sbjct: 1179 KYYLQVCQSSRGIPN-TVEKLVKLSGSHLTPVCYSFISYVQNKSSDAPKCSEKEKAAVST 1237

Query: 1127 ILNKIKRENKCIPDLIFQIEDYEKYLIQLSKVSKINLLRHAKRSTSRDFKI----LDTV- 1181
             + K+ RE K IP+L+F IE YEK+LIQLSK SK+NL++H K STSRDFKI    LD V 
Sbjct: 1238 TMAKVLRETKPIPNLVFAIEQYEKFLIQLSKKSKVNLMQHMKLSTSRDFKIKGSVLDMVL 1297

Query: 1182 --------GEGDASNNEANDNEAAE 1198
                     EG AS +   D E A+
Sbjct: 1298 REDEEDENEEGTASAHTQQDREPAK 1322


>sp|Q9NVI1|FANCI_HUMAN Fanconi anemia group I protein OS=Homo sapiens GN=FANCI PE=1 SV=4
          Length = 1328

 Score =  189 bits (479), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 150/514 (29%), Positives = 262/514 (50%), Gaps = 48/514 (9%)

Query: 45  WSKGLLLKLVSIVRELGFFDKRRCSEFLM-KVFEGMKIVDLQDLPTLVYQLLVLASRGFN 103
           W +  +++L S+ +++    +    EF++ K       ++LQ++P LVYQLLVL+S+G +
Sbjct: 169 WDQQYVIQLTSMFKDVPLTAEE--VEFVVEKALSMFSKMNLQEIPPLVYQLLVLSSKG-S 225

Query: 104 KREVIEGIVVFFGS--------KMGSKVTSIV-------RQVEGTVLLHVNFAVKQDPSL 148
           ++ V+EGI+ FF +        + G ++  +V       R VEGT++LH+ FA+K D  L
Sbjct: 226 RKSVLEGIIAFFSALDKQHNEEQSGDELLDVVTVPSGELRHVEGTIILHIVFAIKLDYEL 285

Query: 149 GQEVMGLVKLDLRA-----FNHFAVAVLLSVARIRKFSESAMGILKTALLTAYRDYKFSK 203
           G+E++  +K+  +       + F++A+LLSV RI++F +  + +LKT+++ +++D +  +
Sbjct: 286 GRELVKHLKVGQQGDSNNNLSPFSIALLLSVTRIQRFQDQVLDLLKTSVVKSFKDLQLLQ 345

Query: 204 DCMWLPNDSKDEFLQSVKVVEKAVLRSVNESNYGREHVVPSIVQFGFILLES-----VEE 258
              +L N      +     V   +L  V  S +  +HV   +V+ GFIL++S     V +
Sbjct: 346 GSKFLQN-----LVPHRSYVSTMILEVVKNSVHSWDHVTQGLVELGFILMDSYGPKKVLD 400

Query: 259 GSLKEHCQSNGLLGIE---KLGIQILKTLFEVHDMARSEIVEQCKFRVLSLKPEQSMSVI 315
           G   E   S   +  +   KLG  IL   F++H+M R EI+EQ   RV++         +
Sbjct: 401 GKTIETSPSLSRMPNQHACKLGANILLETFKIHEMIRQEILEQVLNRVVTRASSPISHFL 460

Query: 316 RLIGNLVQNYPYPMLEHVSRLKELLDYFTFMHGKVATYLVTALLPLINISRDLQDYTILV 375
            L+ N+V   P  +    S++ E  DY +F+  +    L+ A+ PL+ +S  ++D  ILV
Sbjct: 461 DLLSNIVMYAPLVLQSCSSKVTEAFDYLSFLPLQTVQRLLKAVQPLLKVSMSMRDCLILV 520

Query: 376 VRKAMFRREDAVRHAATNAIIDLIMAEKQSKTDGSLSLQDSSSQASCSQQEIPRSTHGG- 434
           +RKAMF  +   R +A    + L+   K  K  GSLS    S   S SQ  +   +H   
Sbjct: 521 LRKAMFANQLDARKSAVAGFLLLL---KNFKVLGSLSSSQCSQSLSVSQVHVDVHSHYNS 577

Query: 435 -----LFQELSALLQRCLYQQAKVKEVMYQGLVKLVLVDPSAGGPIFDFLLPHFLRFFRE 489
                   E+   L+RCL QQA V+ ++Y+G   ++  +      +   LL    +F+  
Sbjct: 578 VANETFCLEIMDSLRRCLSQQADVRLMLYEGFYDVLRRNSQLANSVMQTLLSQLKQFYEP 637

Query: 490 DSDV--QLGVNLCIKVENGRAFIDEPLDCLLYCV 521
             D+   L +  CI  +  +  + EPLD LL C+
Sbjct: 638 KPDLLPPLKLEACILTQGDKISLQEPLDYLLCCI 671



 Score = 88.6 bits (218), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 78/285 (27%), Positives = 139/285 (48%), Gaps = 18/285 (6%)

Query: 908  TWAEGFCKSNSITNANVVKSVVKLAICLS-SPPNDLFVAQSLAQELLKVTGSGSSD-SLE 965
            +W    CK NS  +A   KS++ L   L  S  + + + + L+Q++    G    D  +E
Sbjct: 1008 SWTSKICKENSREDALFCKSLMNLLFSLHVSYKSPVILLRDLSQDIHGHLGDIDQDVEVE 1067

Query: 966  VSELYPLINHSTST-TILSCILQLTEEVVSDMDWAIKKVKTFSLVTQKGIHVN-HDEDHV 1023
             +  + ++N  T+  T+   +L   E+V+ ++DW I K+K    V+Q+ +      +  +
Sbjct: 1068 KTNHFAIVNLRTAAPTVCLLVLSQAEKVLEEVDWLITKLK--GQVSQETLSEEASSQATL 1125

Query: 1024 SGLGFEENLYSRTEALVKVLSSFVLMSLKDPQ-AEHFLRLATRLYKHLAQMSKLRIAPKG 1082
                 E+ +  +   L+      V  +L      +  L+   ++Y  L  + +  +    
Sbjct: 1126 PNQPVEKAIIMQLGTLLTFFHELVQTALPSGSCVDTLLKDLCKMYTTLTALVRYYLQVCQ 1185

Query: 1083 CKQLLPSLEFQRLVELTCKQLTVPLYNFVAEMQ----------RGQLENASSKGILNKIK 1132
                +P    ++LV+L+   LT   Y+F++ +Q            + + A+    + ++ 
Sbjct: 1186 SSGGIPK-NMEKLVKLSGSHLTPLCYSFISYVQNKSKSLNYTGEKKEKPAAVATAMARVL 1244

Query: 1133 RENKCIPDLIFQIEDYEKYLIQLSKVSKINLLRHAKRSTSRDFKI 1177
            RE K IP+LIF IE YEK+LI LSK SK+NL++H K STSRDFKI
Sbjct: 1245 RETKPIPNLIFAIEQYEKFLIHLSKKSKVNLMQHMKLSTSRDFKI 1289


>sp|P30002|TEGU_HHV6G Large tegument protein OS=Human herpesvirus 6A (strain GS) GN=U31
           PE=3 SV=1
          Length = 2077

 Score = 38.1 bits (87), Expect = 0.52,   Method: Compositional matrix adjust.
 Identities = 33/98 (33%), Positives = 46/98 (46%), Gaps = 7/98 (7%)

Query: 385 DAVRHAATNAIIDLIMAEKQS-KTDGSLSL--QDSSSQASCSQQEIPRSTHG--GLFQ-E 438
           + + ++     ID IM E     T G+L+L   +S SQ      EIP+  H   G+ Q E
Sbjct: 37  NGINNSLNAGTIDAIMEEGYHLDTAGTLALMLNNSDSQDYRLPTEIPKRIHSRYGVTQHE 96

Query: 439 LSALLQRCLYQQAKVKEVMYQGLVKLVLVDPSAGGPIF 476
           LS      L  Q K+   +Y GL+ L+L   S  GP F
Sbjct: 97  LSRPFNGTLDTQ-KIDNEVYLGLIDLILYGKSKNGPTF 133


  Database: swissprot
    Posted date:  Mar 23, 2013  2:32 AM
  Number of letters in database: 191,569,459
  Number of sequences in database:  539,616
  
Lambda     K      H
   0.319    0.135    0.379 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 419,024,180
Number of Sequences: 539616
Number of extensions: 16929091
Number of successful extensions: 53312
Number of sequences better than 100.0: 23
Number of HSP's better than 100.0 without gapping: 5
Number of HSP's successfully gapped in prelim test: 18
Number of HSP's that attempted gapping in prelim test: 53232
Number of HSP's gapped (non-prelim): 85
length of query: 1247
length of database: 191,569,459
effective HSP length: 129
effective length of query: 1118
effective length of database: 121,958,995
effective search space: 136350156410
effective search space used: 136350156410
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 67 (30.4 bits)