Query         FBpp0290329 type=protein; loc=3L:join(7019749..7019898,7034972..7035038,7035170..7035315,7035744..7035881,7038005..7038151,7041018..7041131,7043466..7043720,7044926..7045003,7045109..7045355,7046334..7046488,7046586..7046678,7048842..7048964,7049550..7049576,7050565..7050695,7051242..7051421,7051486..7051722,7051977..7052052); ID=FBpp0290329; name=Mp-PF; parent=FBgn0260660,FBtr0301107; dbxref=REFSEQ:NP_001163357,GB_protein:ACZ94628,FlyBase:FBpp0290329,FlyBase_Annotation_IDs:CG42543-PF,FlyMine:FBpp0290329,modMine:FBpp0290329; MD5=0f04423f6d49df9e517757cc4a7d0be7; length=787; release=r6.06; species=Dmel;
Match_columns 787
No_of_seqs    699 out of 2636
Neff          6.1 
Searched_HMMs 16187

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
   1 PF06482 Endostatin:  Collagena 100.0   4E-75 2.4E-79  608.0  13.3  262  487-748     2-285 (286)
   2 PF07588 DUF1554:  Protein of u  99.8 3.3E-19 2.1E-23  167.5   8.2  126  588-723     2-133 (135)
   3 PF01484 Col_cuticle_N:  Nemato  93.8  0.0084 5.2E-07   44.9   0.8   42   14-55      1-42  (50)
   4 PF11267 DUF3067:  Domain of un  36.0     7.8 0.00048   32.8   1.7   14  773-786    42-55  (98)
   5 PF14880 COX14:  Cytochrome oxi  23.6      22  0.0013   26.9   2.2   31    8-38     13-43  (59)
   6 PF00386 C1q:  C1q domain;  Int  23.4      11 0.00066   32.2   0.3   57  469-525    45-108 (127)
   7 PF06609 TRI12:  Fungal trichot  20.6      45  0.0027   36.6   4.9   46    7-56    530-575 (599)
   8 PF07172 GRP:  Glycine rich pro  19.1      40  0.0025   27.9   3.1    7    6-12      3-9   (95)
   9 PF11533 DUF3225:  Protein of u  15.8      31  0.0019   30.3   1.6   16  515-530   102-117 (126)
  10 PF02676 TYW3:  Methyltransfera  15.4      36  0.0022   32.3   2.2   54  731-785    42-98  (214)

No 1
>PF06482 Endostatin: Collagenase NC10 and Endostatin; InterPro: IPR010515 NC10 stands for Non-helical region 10 and is taken from P39059 from SWISSPROT. A mutation in this region in P39060 from SWISSPROT is associated with an increased risk of prostrate cancer. This domain is cleaved from the precursor and forms endostatin. Endostatin is a key tumour suppressor and has been used highly successfully to treat cancer. It is a potent angiogenesis inhibitor []. Endostatin also binds a zinc ion near the N terminus; this is likely to be of structural rather than functional importance according to [].; GO: 0005198 structural molecule activity, 0007155 cell adhesion, 0031012 extracellular matrix; PDB: 1DY2_A 1DY1_A 1DY0_A 1KOE_A 3N3F_B 1BNL_D 3HSH_E 3HON_A. Probab=100.00 E-value=4e-75 Score=608.03 Aligned_cols=262 Identities=48% Similarity=0.868 Sum_probs=181.1 Q ss_pred CceeeehHHHHhhccCCCCCCceEEecCCceEEEEEcCCceeeecCccccCCCCCCCCCC----CCCCcc-cccc----- Q FBpp0290329 487 GAVTFQNIDEMTKKSALNPPGTLAYITEEEALLVRVNKGWQYIALGTLVPIATPAPPTTV----APSMRF-DLQS----- 556 (787) Q Consensus 487 ~~~~~~t~d~m~~~s~~sseGtl~yl~~~~el~vrv~~Gw~~I~lg~~~p~~~~~pp~~~----~p~~~~-~~~~----- 556 (787) ++++|+|+++|++.++...||+|+||++++||||||++|||+|+||+++|+....++.++ .++... .... T Consensus 2 GV~vf~T~~~Ml~~a~~~pEGTLayV~e~~eLYVRVrnGWRkV~LG~~ip~~~~~~~~~va~~~p~P~v~~~~~~~~~~~ 81 (286) T PF06482_consen 2 GVTVFRTYETMLATAHRVPEGTLAYVIEREELYVRVRNGWRKVQLGELIPIPSDTPDNEVASTQPPPVVSSPPQSSPPSS 81 (286) T ss_dssp -EEEESSHHHHHCHGGGS-TTEEEEETTTTEEEEEETTEEEEE-EEEEEE------------------------------ T ss_pred CcEEecCHHHHHhhcccCCCeEEEEEEecceEEEEecCCeeeeccCCcccCCCCcccccccccCCCCccccCcccccccc Confidence 578999999999999999999999999999999999999999999999998775532211 111110 0000 Q ss_pred cCccCC----------C-CCCCCCCCccCCccccceEEEEeCCCCCCCCCccchhhHHHHHHHhhcCCCCceeEEeeccc Q FBpp0290329 557 KNLLNS----------P-PPLLNTPTFTTAPEYETLRVAALNEPSTGDLQGIRGADFACYRQGRRAGLLGTFKAFLSSRV 625 (787) Q Consensus 557 ~~~~~~----------~-~~~~~~p~~~~~~~~~~l~~~a~~~~~~G~lgGi~GAD~~C~~~A~~~g~~gt~rA~Ls~~~ 625 (787) .....+ + .+....+...+...+..|||||||+|++|||+||+|||++||+|||++|+.||||||||+++ T Consensus 82 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~liAlN~P~~G~m~Gi~gAD~~C~~qAr~~gl~gtfRAfLSs~~ 161 (286) T PF06482_consen 82 HPRPPSTAPDPHYPPQPRRPPPPSPSAHTHHDDGPLHLIALNEPLSGNMRGIRGADFQCFRQARAAGLTGTFRAFLSSRL 161 (286) T ss_dssp ----------------------------S--TTS-EEEEE-SS-B-SBSSHHHHHHHHHHHHHHHTT--S-EEESS-BTT T ss_pred cCCcccCCCCccCCCCccccCCCCCccccccCCCceEEEEcCCCCCCCccccccccHHHHHHHHHcCCCCceEEeeeccc Confidence 000000 0 00011123344455667999999999999999999999999999999999999999999999 Q ss_pred cCcccccCCCCC-CCccccCCCcEEeCCccccccCCCCcccCCCceeecCCCccCCCCCCCcceEEeeCCCCcccccCcc Q FBpp0290329 626 QNLDTIVRPADR-DLPVVNTRGDVLFNSWKGIFNGQGGFFSQAPRIYSFSGKNVMTDSTWPMKMVWHGSLPNGERSMDTY 704 (787) Q Consensus 626 ~~~~~~V~~~dr-~~p~vn~~g~vl~~~~~~l~~~~~~~~~~~~~i~~f~~~~~l~d~~~p~~~vW~Gs~~~g~~~~~~~ 704 (787) |||++||++.|| ++||||+||||||+||++||+++++.|..+++||||||+|||+|++||+|+|||||+++|++..++| T Consensus 162 qdL~~iV~~~dr~~~PivNlkgevLf~sw~~lf~g~~~~~~~~~~iySFdGr~v~~d~~wP~K~vWhGs~~~G~r~~~~~ 241 (286) T PF06482_consen 162 QDLYSIVRRADRDNVPIVNLKGEVLFNSWESLFSGSGGPFNPNAPIYSFDGRDVLTDPAWPQKMVWHGSDPRGRRLTDSY 241 (286) T ss_dssp B-GGGGS-GGGTSS--EE-TTS-EEES-HHHHTSSS-SB--TTS--BBTTS-BTTTSTTSSS-EEE--B-TTS-B-TTSB T ss_pred ccHhhhccHhhCCCCCeEeCcCCEeecCHHHHhCCCCCCCCCCCcEEeECCccccCCCCcceEEEEeCCCCCCccCCcCc Confidence 999999999999 8999999999999999999999888999999999999999999999999999999999999999999 Q ss_pred CCccccCCCCceeecccCCccccccccccccccCceEEEEeccc Q FBpp0290329 705 CDAWHSGDHLKGSFASNLDGHKLLEQKRQSCDSKLIILCVEALS 748 (787) Q Consensus 705 C~~W~s~~~~~~G~as~~~~~~l~~~~~~~C~~~~~vlCvE~~~ 748 (787) |++|+|++.+++|+||+|++++||.|+.+||+++||||||||+. T Consensus 242 C~~Wrs~~~~~~G~As~l~~g~ll~q~~~sC~~~~ivLCiE~~~ 285 (286) T PF06482_consen 242 CEAWRSSDPAVTGQASSLQSGKLLDQQPYSCSNSFIVLCIENSF 285 (286) T ss_dssp HHHHB---TTSEEEEEEGGGTBSS--EEEETTS-BB-EEEESS- T ss_pred ccccccCCCCceEeeeecCCCCcccCCcccCCCceEEEEEeccc Confidence 99999999999999999999999999999999999999999974
No 2
>PF07588 DUF1554: Protein of unknown function (DUF1554); InterPro: IPR011448 This is a domain that occurs in 1-2 copies in a family of proteins identified in Leptospira interrogans and other bacteria. The function of the proteins is not known. Probab=99.76 E-value=3.3e-19 Score=167.48 Aligned_cols=126 Identities=20% Similarity=0.310 Sum_probs=98.1 Q ss_pred CCCCCCCCccchhhHHHHHHHhhcC--CCCceeEEeeccccCcccc-cCCCCCCCccccCCCcEEeCCccccccCCCC-c Q FBpp0290329 588 EPSTGDLQGIRGADFACYRQGRRAG--LLGTFKAFLSSRVQNLDTI-VRPADRDLPVVNTRGDVLFNSWKGIFNGQGG-F 663 (787) Q Consensus 588 ~~~~G~lgGi~GAD~~C~~~A~~~g--~~gt~rA~Ls~~~~~~~~~-V~~~dr~~p~vn~~g~vl~~~~~~l~~~~~~-~ 663 (787) ..|+|||+||+|||++|++.+.+.. ..++|||||++.+...+.+ +.++-. ....||||.+|.+|++. ++. + T Consensus 2 ~~~~GnlGGi~GADa~C~~d~~~p~~~~~~~yKAml~~~~~~~R~a~~t~n~~----~g~~DWVl~pnt~Y~r~-dgt~i 76 (135) T PF07588_consen 2 NTYNGNLGGISGADAKCNADANKPSPGGGGTYKAMLVDGSNSTRRACVTANCG----DGQIDWVLKPNTTYYRS-DGTTI 76 (135) T ss_pred ccccCcccchhhHhHHHHcCCCCCCCCCCcCeEEEEEcCccccceeecCCCCC----CCcccceecCCceEEec-CCCEE Confidence 4689999999999999999887764 5679999999976522222 222222 22789999999999998 565 6 Q ss_pred ccCCCc-eeecCCCccCCCCCCC-cceEEeeCCCCcccccCccCCccccCCCCceeecccCC Q FBpp0290329 664 FSQAPR-IYSFSGKNVMTDSTWP-MKMVWHGSLPNGERSMDTYCDAWHSGDHLKGSFASNLD 723 (787) Q Consensus 664 ~~~~~~-i~~f~~~~~l~d~~~p-~~~vW~Gs~~~g~~~~~~~C~~W~s~~~~~~G~as~~~ 723 (787) |+++.. ||+|+ |++++-. .+.||||++.|++... .+|++|+++....+|..+..+ T Consensus 77 ~tTn~~glf~f~----l~~~i~~~~~~~WTGl~~~Wt~~~-~~C~~Wt~~s~~~~G~~G~~n 133 (135) T PF07588_consen 77 FTTNSNGLFDFP----LSNPISGTSGTIWTGLNSDWTTAT-NNCNNWTSGSSGVTGAYGSSN 133 (135) T ss_pred EecCCCceEccc----ccceecCCCccEEEeECCCCeeCC-CcccCCcCCCCcccccccccc Confidence 766766 99997 5555544 7999999999988774 899999999988888776654
No 3
>PF01484 Col_cuticle_N: Nematode cuticle collagen N-terminal domain; InterPro: IPR002486 The function of this domain is unknown. It is found in the N-terminal region of nematode cuticle collagens (see IPR008160 from INTERPRO). Cuticle is a tough elastic structure secreted by hypodermal cells and is primarily composed of collagen proteins [, ].; GO: 0042302 structural constituent of cuticle Probab=93.83 E-value=0.0084 Score=44.95 Aligned_cols=42 Identities=14% Similarity=0.025 Sum_probs=37.9 Q ss_pred HHHHHHHHHHHHHhhhccccccccccchhhhHHHHHhhcccc Q FBpp0290329 14 TALILFFLLGIVLVTGSTKGWFNPNRYNGERVAARIQATDIF 55 (787) Q Consensus 14 ~~~~~~sivail~~~~~~p~~~~~~~~~~e~~~a~i~~~di~ 55 (787) |.++++|++++++|++++|.+++..+..++.+..+.+.|+.. T Consensus 1 y~a~~~s~~~i~~~l~~~~~i~~~i~~~~~e~~~e~~~fk~~ 42 (50) T PF01484_consen 1 YVAIAFSTLSIISCLFTIPMIYNDIQEFQEELEDEMDEFKES 42 (50) T ss_pred ChhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 567889999999999999999999999999999999999653
No 4
>PF11267 DUF3067: Domain of unknown function (DUF3067); InterPro: IPR021420 This family of proteins has no known function. ; PDB: 2LJW_A. Probab=35.97 E-value=7.8 Score=32.80 Aligned_cols=14 Identities=36% Similarity=0.650 Sum_probs=11.4 Q ss_pred cChHHHHHHHHhhh Q FBpp0290329 773 KTADEYAAHLENLL 786 (787) Q Consensus 773 ~~~~~~~~~~~~~~ 786 (787) +||+||.+||+.+. T Consensus 42 ltE~eY~~hL~~ia 55 (98) T PF11267_consen 42 LTEEEYLEHLDAIA 55 (98) T ss_dssp S-HHHHHHHHHHHH T ss_pred CCHHHHHHHHHHHH Confidence 69999999999863
No 5
>PF14880 COX14: Cytochrome oxidase c assembly; InterPro: IPR029208 COX14 plays an essential role in cytochrome oxidase assembly. The COX14 product is a low-molecular weight membrane protein of mitochondria, but it is not a subunit of cytochrome oxidase []. Orthology-prediction methods have identified the vertebrate C12orf62 orthologues to be orthologues of the yeast COX14 []. Probab=23.61 E-value=22 Score=26.88 Aligned_cols=31 Identities=19% Similarity=0.157 Sum_probs=23.5 Q ss_pred hhHHHHHHHHHHHHHHHHHhhhccccccccc Q FBpp0290329 8 RAKLVITALILFFLLGIVLVTGSTKGWFNPN 38 (787) Q Consensus 8 ~~~lv~~~~~~~sivail~~~~~~p~~~~~~ 38 (787) .-+++++++++|++++.+++...+=.++..| T Consensus 13 ~HR~tv~~Lig~T~~~g~~~~~~~~~~~~~~ 43 (59) T PF14880_consen 13 AHRGTVLGLIGFTVYGGGLTVYTVYDYMRYN 43 (59) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 4578899999999999999888864344433
No 6
>PF00386 C1q: C1q domain; InterPro: IPR001073 This entry represents the C-terminal domain of C1q. C1q is a subunit of the C1 enzyme complex that activates the serum complement system. C1q comprises 6 A, 6 B and 6 C chains. These share the same topology, each possessing a small, globular N-terminal domain, a collagen-like Gly/Pro-rich central region, and a conserved C-terminal region, the C1q domain []. The C1q protein is produced in collagen-producing cells and shows sequence and structural similarity to collagens VIII and X [, ]. This domain is also found in multimerin and EMILIN proteins.; PDB: 1O91_C 2JG8_D 2JG9_A 2WNV_A 2WNU_A 1PK6_A 4DOU_A 4OUS_A 1C3H_C 1C28_C .... Probab=23.38 E-value=11 Score=32.25 Aligned_cols=57 Identities=18% Similarity=0.271 Sum_probs=35.7 Q ss_pred CCCccccC-----CCCCceeeecC-ceeeehHHHHhhcc-CCCCCCceEEecCCceEEEEEcCC Q FBpp0290329 469 EEPYFSAS-----SSNMNMKIVPG-AVTFQNIDEMTKKS-ALNPPGTLAYITEEEALLVRVNKG 525 (787) Q Consensus 469 ~~P~~s~s-----~~~~~v~iv~~-~~~~~t~d~m~~~s-~~sseGtl~yl~~~~el~vrv~~G 525 (787) +.|+|+.. .....+.|+++ ..+........... +..+..++++|..+|+||||+..+ T Consensus 45 G~Y~f~~~~~~~~~~~~~v~l~~n~~~~~~~~~~~~~~~~~~~s~~~~l~L~~GD~v~v~~~~~ 108 (127) T PF00386_consen 45 GVYFFSFTVCTSGSGNLRVELRKNGNSVLSVYSSSTSNSSDTLSRSAVLELNAGDTVWVRLNQG 108 (127) T ss_dssp EEEEEEEEEESEECCEEEEEEEETTEEEEEEEECSBTTBEEEEEEEEEEEE-TT-EEEEEEEEC T ss_pred CeEEEEEEEEecCCCceEEEEEecCcceeeeeecCCCCceeEEeeEEEEEeCCCCEEEEEEEeC Confidence 56776642 34466777777 33333333333333 466778899999999999999954
No 7
>PF06609 TRI12: Fungal trichothecene efflux pump (TRI12); InterPro: IPR010573 This family consists of several fungal specific trichothecene efflux pump proteins. Many of the genes involved in trichothecene toxin biosynthesis in Fusarium sporotrichioides are present within a gene cluster. It has been suggested that TRI12 may play a role in F. sporotrichioides self-protection against trichothecenes []. Probab=20.58 E-value=45 Score=36.58 Aligned_cols=46 Identities=22% Similarity=0.234 Sum_probs=36.8 Q ss_pred hhhHHHHHHHHHHHHHHHHHhhhccccccccccchhhhHHHHHhhccccc Q FBpp0290329 7 TRAKLVITALILFFLLGIVLVTGSTKGWFNPNRYNGERVAARIQATDIFD 56 (787) Q Consensus 7 ~~~~lv~~~~~~~sivail~~~~~~p~~~~~~~~~~e~~~a~i~~~di~~ 56 (787) .+++.|-|++++|.+++++++|++- +..++..++++.+++..+.-+ T Consensus 530 ~afr~V~~~siaFg~va~i~a~f~~----d~~~~mt~~Va~~l~~~~~~~ 575 (599) T PF06609_consen 530 DAFRYVYLASIAFGVVAIIAAFFLK----DIDKYMTNHVAVRLEDRKEAD 575 (599) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHcC----CcHHhhhhhhhHhHhcccccc Confidence 4568888999999999999999864 677788888888887765443
No 8
>PF07172 GRP: Glycine rich protein family; InterPro: IPR010800 This family consists of glycine rich proteins. Some of them may be involved in resistance to environmental stress []. Probab=19.08 E-value=40 Score=27.89 Aligned_cols=7 Identities=29% Similarity=0.150 Sum_probs=3.0 Q ss_pred ehhhHHH Q FBpp0290329 6 STRAKLV 12 (787) Q Consensus 6 s~~~~lv 12 (787) ||.+.|+ T Consensus 3 SK~~llL 9 (95) T PF07172_consen 3 SKAFLLL 9 (95) T ss_pred hhHHHHH Confidence 4443333
No 9
>PF11533 DUF3225: Protein of unknown function (DUF3225); InterPro: IPR024507 This family of proteins has no known function.; PDB: 2RCD_B 2OWP_A. Probab=15.82 E-value=31 Score=30.27 Aligned_cols=16 Identities=31% Similarity=0.713 Sum_probs=0.0 Q ss_pred CceEEEEEcCCceeee Q FBpp0290329 515 EEALLVRVNKGWQYIA 530 (787) Q Consensus 515 ~~el~vrv~~Gw~~I~ 530 (787) +.|+|+|+..|||=|+ T Consensus 102 QsQtW~R~~~GWkVva 117 (126) T PF11533_consen 102 QSQTWVRFPDGWKVVA 117 (126) T ss_dssp EEEEEEEETTEEEEEE T ss_pred EEEEEEecCCCEEEEE
No 10
>PF02676 TYW3: Methyltransferase TYW3; InterPro: IPR003827 The methyltransferase TYW3 (tRNA-yW- synthesising protein 3) has been identified in yeast to be involved in wybutosine (yW) biosynthesis []. yW is a complexly modified guanosine residue that contains a tricyclic base and is found at the 3'-position adjacent the anticodon of phenylalanine tRNA. TYW3 is an N-4 methylase that methylates yW-86 to yield yW-72 in an Ado-Met-dependent manner [].; PDB: 1TLJ_A 2IT3_B 2IT2_A 2DRV_A 2DVK_A 2QG3_B. Probab=15.37 E-value=36 Score=32.33 Aligned_cols=54 Identities=20% Similarity=0.314 Sum_probs=0.0 Q ss_pred cccccccCceEEEEecccHHHHhhhcccCCCCCCCCC---chhhccChHHHHHHHHhh Q FBpp0290329 731 KRQSCDSKLIILCVEALSQDRKRKKREIGDGSSHGES---ESREFKTADEYAAHLENL 785 (787) Q Consensus 731 ~~~~C~~~~~vlCvE~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~ 785 (787) +++|||.+.+|+| |......+........+...+-. .+|+-++++|..+-|.+. T Consensus 42 TTSSCSGRI~vf~-eg~~~~~~~~~~~~~~~~k~gg~wL~vsH~~v~~~e~~~~l~~~ 98 (214) T PF02676_consen 42 TTSSCSGRISVFL-EGSKSDSASESDAKSGGKKKGGKWLFVSHDPVTEDEVLEALRKA 98 (214) T ss_dssp EEEEES-EEEEE-------------EESSTTSCCC-EEEEEESS---HHHHHHHHTT- T ss_pred EecccccceEEEe-ecccccccccccCCCCCCCCCCEEEEEECCCCCHHHHHHHHhcc