Query psy4958
Match_columns 65
No_of_seqs 69 out of 71
Neff 4.3
Searched_HMMs 46136
Date Fri Aug 16 22:14:24 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy4958.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/4958hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PF00041 fn3: Fibronectin type 95.9 0.0065 1.4E-07 33.8 2.0 24 2-25 61-84 (85)
2 KOG4221|consensus 95.1 0.064 1.4E-06 47.0 6.1 58 2-63 579-643 (1381)
3 PF00041 fn3: Fibronectin type 94.8 0.026 5.7E-07 31.3 2.2 23 36-60 2-24 (85)
4 smart00060 FN3 Fibronectin typ 94.3 0.062 1.3E-06 27.7 2.8 27 35-63 2-28 (83)
5 KOG3513|consensus 94.2 0.12 2.5E-06 44.5 5.5 55 4-61 586-640 (1051)
6 KOG1225|consensus 93.6 0.045 9.8E-07 43.7 2.0 60 2-63 423-482 (525)
7 cd00063 FN3 Fibronectin type 3 93.2 0.1 2.3E-06 27.7 2.5 26 34-61 1-26 (93)
8 cd00063 FN3 Fibronectin type 3 91.7 0.25 5.3E-06 26.2 2.8 22 3-24 63-84 (93)
9 KOG0196|consensus 90.8 0.21 4.6E-06 42.6 2.7 36 23-61 321-356 (996)
10 KOG4221|consensus 90.7 0.57 1.2E-05 41.4 5.3 57 2-61 679-744 (1381)
11 PF14054 DUF4249: Domain of un 90.4 0.77 1.7E-05 31.9 4.9 58 3-65 95-160 (298)
12 KOG0196|consensus 86.6 1.5 3.2E-05 37.7 5.0 58 2-62 403-469 (996)
13 PF02014 Reeler: Reeler domain 86.2 5.6 0.00012 25.2 6.5 57 4-62 30-112 (132)
14 cd08544 Reeler Reeler, the N-t 85.3 6.1 0.00013 25.0 6.4 58 4-62 30-108 (135)
15 PF10179 DUF2369: Uncharacteri 84.0 2.9 6.4E-05 31.4 5.1 60 3-62 134-202 (300)
16 KOG4802|consensus 83.6 2.8 6E-05 33.8 5.0 59 3-63 215-283 (516)
17 PF09294 Interfer-bind: Interf 78.4 2.4 5.1E-05 25.2 2.4 15 2-16 72-86 (106)
18 KOG3513|consensus 75.3 6.3 0.00014 34.3 4.9 58 1-60 678-741 (1051)
19 PF13205 Big_5: Bacterial Ig-l 66.0 7.3 0.00016 22.9 2.5 12 3-14 71-82 (107)
20 PF09423 PhoD: PhoD-like phosp 64.6 8.6 0.00019 28.9 3.2 39 2-42 69-108 (453)
21 PF01108 Tissue_fac: Tissue fa 62.6 12 0.00027 22.7 3.2 28 33-63 21-48 (107)
22 PF14250 AbrB-like: AbrB-like 59.8 5.5 0.00012 24.6 1.2 14 1-14 50-63 (71)
23 PF02018 CBM_4_9: Carbohydrate 55.6 15 0.00033 21.6 2.6 20 2-21 56-75 (131)
24 KOG3632|consensus 53.5 19 0.00041 32.1 3.7 61 2-63 723-797 (1335)
25 KOG4258|consensus 45.3 33 0.00071 30.1 3.9 57 2-60 571-635 (1025)
26 KOG4222|consensus 43.2 33 0.00072 30.7 3.7 58 3-62 707-775 (1281)
27 PF07691 PA14: PA14 domain; I 40.9 78 0.0017 19.2 4.2 30 30-62 109-138 (145)
28 PF10179 DUF2369: Uncharacteri 40.9 28 0.0006 26.3 2.6 18 1-18 264-281 (300)
29 smart00758 PA14 domain in bact 38.3 94 0.002 19.0 4.5 32 29-64 100-131 (136)
30 KOG4152|consensus 37.8 44 0.00095 28.3 3.4 52 4-59 654-711 (830)
31 PF03684 UPF0179: Uncharacteri 36.6 21 0.00046 24.3 1.3 17 3-21 37-53 (142)
32 PF07532 Big_4: Bacterial Ig-l 36.3 77 0.0017 17.4 3.4 14 45-58 25-38 (59)
33 cd03869 M14_CPX_like Peptidase 36.1 24 0.00052 27.4 1.6 13 3-16 368-380 (405)
34 COG1860 Uncharacterized protei 35.4 23 0.0005 24.5 1.3 12 2-13 38-49 (147)
35 PF03442 CBM_X2: Carbohydrate 35.3 20 0.00044 21.8 0.9 11 3-13 39-49 (84)
36 PRK01177 hypothetical protein; 30.1 31 0.00067 23.5 1.2 17 3-21 39-55 (140)
37 PF07495 Y_Y_Y: Y_Y_Y domain; 29.4 52 0.0011 17.7 1.9 17 4-21 36-52 (66)
38 TIGR02148 Fibro_Slime fibro-sl 28.6 61 0.0013 20.6 2.3 17 2-18 58-75 (90)
39 COG4026 Uncharacterized protei 27.2 40 0.00087 25.4 1.5 16 3-18 28-43 (290)
40 PF05935 Arylsulfotrans: Aryls 27.2 71 0.0015 24.6 2.9 35 1-36 56-91 (477)
41 KOG4258|consensus 27.1 29 0.00063 30.4 0.8 58 3-61 788-854 (1025)
42 PF15404 PH_4: Pleckstrin homo 24.9 89 0.0019 22.0 2.8 36 2-37 20-57 (185)
43 cd03863 M14_CPD_II The second 24.6 54 0.0012 25.0 1.8 15 3-18 337-351 (375)
44 cd00604 IPT_CGTD IPT domain (d 24.4 1.2E+02 0.0027 18.1 3.1 22 4-27 55-76 (81)
45 PF07198 DUF1410: Protein of u 21.6 61 0.0013 18.2 1.3 17 3-19 44-60 (66)
46 PF15206 FAM209: FAM209 family 21.3 81 0.0018 21.9 2.0 19 5-23 131-150 (150)
47 PF07867 DUF1654: Protein of u 21.2 1.4E+02 0.003 18.3 2.8 14 49-62 60-73 (73)
48 COG5294 Uncharacterized protei 21.0 84 0.0018 20.9 2.0 41 4-44 50-90 (113)
49 KOG4806|consensus 20.4 2.6E+02 0.0057 22.5 4.9 59 3-61 290-354 (454)
No 1
>PF00041 fn3: Fibronectin type III domain; InterPro: IPR003961 Fibronectins are multi-domain glycoproteins found in a soluble form in plasma, and in an insoluble form in loose connective tissue and basement membranes []. They contain multiple copies of 3 repeat regions (types I, II and III), which bind to a variety of substances including heparin, collagen, DNA, actin, fibrin and fibronectin receptors on cell surfaces. The wide variety of these substances means that fibronectins are involved in a number of important functions: e.g., wound healing; cell adhesion; blood coagulation; cell differentiation and migration; maintenance of the cellular cytoskeleton; and tumour metastasis []. The role of fibronectin in cell differentiation is demonstrated by the marked reduction in the expression of its gene when neoplastic transformation occurs. Cell attachment has been found to be mediated by the binding of the tetrapeptide RGDS to integrins on the cell surface [], although related sequences can also display cell adhesion activity. Plasma fibronectin occurs as a dimer of 2 different subunits, linked together by 2 disulphide bonds near the C terminus. The difference in the 2 chains occurs in the type III repeat region and is caused by alternative splicing of the mRNA from one gene []. The observation that, in a given protein, an individual repeat of one of the 3 types (e.g., the first FnIII repeat) shows much less similarity to its subsequent tandem repeats within that protein than to its equivalent repeat between fibronectins from other species, has suggested that the repeating structure of fibronectin arose at an early stage of evolution. It also seems to suggest that the structure is subject to high selective pressure []. The fibronectin type III repeat region is an approximately 100 amino acid domain, different tandem repeats of which contain binding sites for DNA, heparin and the cell surface []. The superfamily of sequences believed to contain FnIII repeats represents 45 different families, the majority of which are involved in cell surface binding in some manner, or are receptor protein tyrosine kinases, or cytokine receptors.; GO: 0005515 protein binding; PDB: 1UEM_A 1TDQ_A 1X5I_A 2IC2_B 2IBG_C 2IBB_A 3R8Q_A 2FNB_A 1FNH_A 2EDB_A ....
Probab=95.88 E-value=0.0065 Score=33.81 Aligned_cols=24 Identities=25% Similarity=0.444 Sum_probs=20.0
Q ss_pred eeccCCCceEEEEEEecCCccCcc
Q psy4958 2 LIIGTGRAYNISVQTVSEDEISTP 25 (65)
Q Consensus 2 ~~L~PGR~Y~ItV~tvs~~e~S~p 25 (65)
.+|.||+.|.|.|+++.+...|.+
T Consensus 61 ~~L~p~t~Y~~~v~a~~~~g~g~~ 84 (85)
T PF00041_consen 61 TGLQPGTTYEFRVRAVNSDGEGPP 84 (85)
T ss_dssp ESCCTTSEEEEEEEEEETTEEEEE
T ss_pred ccCCCCCEEEEEEEEEeCCcCcCC
Confidence 479999999999999997765543
No 2
>KOG4221|consensus
Probab=95.09 E-value=0.064 Score=47.00 Aligned_cols=58 Identities=26% Similarity=0.395 Sum_probs=41.1
Q ss_pred eeccCCCceEEEEEEecCC---ccCcceeeeeec---CCCC-ceeeEEecCccccceEEEEeeCCCCCC
Q psy4958 2 LIIGTGRAYNISVQTVSED---EISTPTTAQYRT---IPLR-PLSFTYDKASITSNSLRVVWEPPKGFS 63 (65)
Q Consensus 2 ~~L~PGR~Y~ItV~tvs~~---e~S~p~~~~~RT---~Pl~-p~~L~f~~~~vt~~S~rV~W~pp~G~~ 63 (65)
-+|.|-++|.|.|-+-... ..|+. +.-|| +|.+ |.||+. .-++.+++||+|++|....
T Consensus 579 ~gL~k~TeY~~~vvA~N~~G~g~sS~~--i~V~Tlsd~PsaPP~Nl~l--ev~sStsVrVsW~pP~~~t 643 (1381)
T KOG4221|consen 579 NGLEKYTEYSIRVVAYNSAGSGVSSAD--ITVRTLSDVPSAPPQNLSL--EVVSSTSVRVSWLPPPSET 643 (1381)
T ss_pred ecCCCccceEEEEEEecCCCCCCCCCc--eEEEeccCCCCCCCcceEE--EecCCCeEEEEccCCCccc
Confidence 3789999999998765432 33333 23344 5666 677987 5577999999999998654
No 3
>PF00041 fn3: Fibronectin type III domain; InterPro: IPR003961 Fibronectins are multi-domain glycoproteins found in a soluble form in plasma, and in an insoluble form in loose connective tissue and basement membranes []. They contain multiple copies of 3 repeat regions (types I, II and III), which bind to a variety of substances including heparin, collagen, DNA, actin, fibrin and fibronectin receptors on cell surfaces. The wide variety of these substances means that fibronectins are involved in a number of important functions: e.g., wound healing; cell adhesion; blood coagulation; cell differentiation and migration; maintenance of the cellular cytoskeleton; and tumour metastasis []. The role of fibronectin in cell differentiation is demonstrated by the marked reduction in the expression of its gene when neoplastic transformation occurs. Cell attachment has been found to be mediated by the binding of the tetrapeptide RGDS to integrins on the cell surface [], although related sequences can also display cell adhesion activity. Plasma fibronectin occurs as a dimer of 2 different subunits, linked together by 2 disulphide bonds near the C terminus. The difference in the 2 chains occurs in the type III repeat region and is caused by alternative splicing of the mRNA from one gene []. The observation that, in a given protein, an individual repeat of one of the 3 types (e.g., the first FnIII repeat) shows much less similarity to its subsequent tandem repeats within that protein than to its equivalent repeat between fibronectins from other species, has suggested that the repeating structure of fibronectin arose at an early stage of evolution. It also seems to suggest that the structure is subject to high selective pressure []. The fibronectin type III repeat region is an approximately 100 amino acid domain, different tandem repeats of which contain binding sites for DNA, heparin and the cell surface []. The superfamily of sequences believed to contain FnIII repeats represents 45 different families, the majority of which are involved in cell surface binding in some manner, or are receptor protein tyrosine kinases, or cytokine receptors.; GO: 0005515 protein binding; PDB: 1UEM_A 1TDQ_A 1X5I_A 2IC2_B 2IBG_C 2IBB_A 3R8Q_A 2FNB_A 1FNH_A 2EDB_A ....
Probab=94.83 E-value=0.026 Score=31.29 Aligned_cols=23 Identities=30% Similarity=0.765 Sum_probs=20.4
Q ss_pred CceeeEEecCccccceEEEEeeCCC
Q psy4958 36 RPLSFTYDKASITSNSLRVVWEPPK 60 (65)
Q Consensus 36 ~p~~L~f~~~~vt~~S~rV~W~pp~ 60 (65)
+|.+|++ .+++.+++.|+|.+|.
T Consensus 2 ~P~~l~v--~~~~~~sv~v~W~~~~ 24 (85)
T PF00041_consen 2 APENLSV--SNISPTSVTVSWKPPS 24 (85)
T ss_dssp SSEEEEE--EEECSSEEEEEEEESS
T ss_pred cCcCeEE--EECCCCEEEEEEECCC
Confidence 5899988 7788999999999995
No 4
>smart00060 FN3 Fibronectin type 3 domain. One of three types of internal repeat within the plasma protein, fibronectin. The tenth fibronectin type III repeat contains a RGD cell recognition sequence in a flexible loop between 2 strands. Type III modules are present in both extracellular and intracellular proteins.
Probab=94.28 E-value=0.062 Score=27.68 Aligned_cols=27 Identities=30% Similarity=0.575 Sum_probs=17.9
Q ss_pred CCceeeEEecCccccceEEEEeeCCCCCC
Q psy4958 35 LRPLSFTYDKASITSNSLRVVWEPPKGFS 63 (65)
Q Consensus 35 l~p~~L~f~~~~vt~~S~rV~W~pp~G~~ 63 (65)
.+|.++.. .....+++.|+|++|.+..
T Consensus 2 ~~p~~~~~--~~~~~~~~~v~W~~~~~~~ 28 (83)
T smart00060 2 SPPSNLRV--TDVTSTSVTLSWEPPPDDG 28 (83)
T ss_pred CCCCcEEE--EEEeCCEEEEEECCCCCCC
Confidence 34555554 3345669999999887653
No 5
>KOG3513|consensus
Probab=94.16 E-value=0.12 Score=44.48 Aligned_cols=55 Identities=22% Similarity=0.329 Sum_probs=48.6
Q ss_pred ccCCCceEEEEEEecCCccCcceeeeeecCCCCceeeEEecCccccceEEEEeeCCCC
Q psy4958 4 IGTGRAYNISVQTVSEDEISTPTTAQYRTIPLRPLSFTYDKASITSNSLRVVWEPPKG 61 (65)
Q Consensus 4 L~PGR~Y~ItV~tvs~~e~S~p~~~~~RT~Pl~p~~L~f~~~~vt~~S~rV~W~pp~G 61 (65)
|.+.-.|.-.|||.- +++|+-..-+-|..|.+|-+|.. .+++.++++|+|.+..-
T Consensus 586 l~~~G~Y~C~aqT~~-Ds~s~~A~l~V~gpPgpP~~v~~--~~i~~t~~~lsW~~g~d 640 (1051)
T KOG3513|consen 586 LEDSGKYTCVAQTAL-DSASARADLLVRGPPGPPPDVHV--DDISDTTARLSWSPGSD 640 (1051)
T ss_pred cccCceEEEEEEEee-cchhcccceEEecCCCCCCceeE--eeeccceEEEEeecCCC
Confidence 566778999999966 99999999999999999999977 88999999999997653
No 6
>KOG1225|consensus
Probab=93.57 E-value=0.045 Score=43.71 Aligned_cols=60 Identities=25% Similarity=0.151 Sum_probs=54.1
Q ss_pred eeccCCCceEEEEEEecCCccCcceeeeeecCCCCceeeEEecCccccceEEEEeeCCCCCC
Q psy4958 2 LIIGTGRAYNISVQTVSEDEISTPTTAQYRTIPLRPLSFTYDKASITSNSLRVVWEPPKGFS 63 (65)
Q Consensus 2 ~~L~PGR~Y~ItV~tvs~~e~S~p~~~~~RT~Pl~p~~L~f~~~~vt~~S~rV~W~pp~G~~ 63 (65)
.+|.||..|+++|.++.+..-|.|..+...|+---+..+.| ...+.+|+.+.|..|.-..
T Consensus 423 ~~~~~g~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~g~~~v--~~~~~~s~e~~g~~~s~~~ 482 (525)
T KOG1225|consen 423 QGLEPGDEYNCSVNTVAANIGSLPKDKSETTVLCWNGGLCV--DGETESSLEVGGPCPSSGT 482 (525)
T ss_pred eeecCCcceeeehhhhhhhhccCCcccccceEeecCCceee--eeeeeccccccCCCCCccc
Confidence 46899999999999999999999999999999888999999 6688999999999987553
No 7
>cd00063 FN3 Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
Probab=93.25 E-value=0.1 Score=27.71 Aligned_cols=26 Identities=31% Similarity=0.732 Sum_probs=19.0
Q ss_pred CCCceeeEEecCccccceEEEEeeCCCC
Q psy4958 34 PLRPLSFTYDKASITSNSLRVVWEPPKG 61 (65)
Q Consensus 34 Pl~p~~L~f~~~~vt~~S~rV~W~pp~G 61 (65)
|.+|.++++ .....++++|+|++|..
T Consensus 1 p~~p~~~~~--~~~~~~~~~v~W~~~~~ 26 (93)
T cd00063 1 PSPPTNLRV--TDVTSTSVTLSWTPPED 26 (93)
T ss_pred CcCCCCcEE--EEecCCEEEEEECCCCC
Confidence 456777766 33447999999999865
No 8
>cd00063 FN3 Fibronectin type 3 domain; One of three types of internal repeats found in the plasma protein fibronectin. Its tenth fibronectin type III repeat contains an RGD cell recognition sequence in a flexible loop between 2 strands. Approximately 2% of all animal proteins contain the FN3 repeat; including extracellular and intracellular proteins, membrane spanning cytokine receptors, growth hormone receptors, tyrosine phosphatase receptors, and adhesion molecules. FN3-like domains are also found in bacterial glycosyl hydrolases.
Probab=91.73 E-value=0.25 Score=26.22 Aligned_cols=22 Identities=23% Similarity=0.321 Sum_probs=18.0
Q ss_pred eccCCCceEEEEEEecCCccCc
Q psy4958 3 IIGTGRAYNISVQTVSEDEIST 24 (65)
Q Consensus 3 ~L~PGR~Y~ItV~tvs~~e~S~ 24 (65)
+|.||..|.+.|.++..+..|.
T Consensus 63 ~l~p~~~Y~~~v~a~~~~~~~~ 84 (93)
T cd00063 63 GLKPGTEYEFRVRAVNGGGESP 84 (93)
T ss_pred cccCCCEEEEEEEEECCCccCC
Confidence 5899999999999998655553
No 9
>KOG0196|consensus
Probab=90.76 E-value=0.21 Score=42.63 Aligned_cols=36 Identities=25% Similarity=0.509 Sum_probs=30.8
Q ss_pred CcceeeeeecCCCCceeeEEecCccccceEEEEeeCCCC
Q psy4958 23 STPTTAQYRTIPLRPLSFTYDKASITSNSLRVVWEPPKG 61 (65)
Q Consensus 23 S~p~~~~~RT~Pl~p~~L~f~~~~vt~~S~rV~W~pp~G 61 (65)
++|...-=..+|.+|+||.+ + |+.+|+.+.|.+|+.
T Consensus 321 ~Dp~~mpCT~PPSaP~nlis--~-vn~Ts~~L~W~~P~d 356 (996)
T KOG0196|consen 321 SDPPSMPCTRPPSAPRNLIS--N-VNGTSLILEWSPPAD 356 (996)
T ss_pred CCCCCCCCCCCCCccceeee--e-cccceEEEEecCCcc
Confidence 56667777889999999988 5 889999999999974
No 10
>KOG4221|consensus
Probab=90.75 E-value=0.57 Score=41.40 Aligned_cols=57 Identities=19% Similarity=0.319 Sum_probs=38.8
Q ss_pred eeccCCCceEEEEEEecC---CccCcce------eeeeecCCCCceeeEEecCccccceEEEEeeCCCC
Q psy4958 2 LIIGTGRAYNISVQTVSE---DEISTPT------TAQYRTIPLRPLSFTYDKASITSNSLRVVWEPPKG 61 (65)
Q Consensus 2 ~~L~PGR~Y~ItV~tvs~---~e~S~p~------~~~~RT~Pl~p~~L~f~~~~vt~~S~rV~W~pp~G 61 (65)
-+|.|++.|+|-|.++-- |-+|+.. +..+--+|.+|..|... --+++|.|+|+||.-
T Consensus 679 ~~Lep~T~Y~vrIsa~t~nGtGpaS~w~~aeT~~~d~~e~vp~~ps~l~~~---~g~~si~vsW~Pp~~ 744 (1381)
T KOG4221|consen 679 NGLEPNTQYRVRISAMTVNGTGPASEWVSAETPESDLDERVPGKPSELHVH---PGSNSIVVSWTPPPH 744 (1381)
T ss_pred hcCCCCceEEEEEEEeccCCCCCcccceeccCccccccccCCCCCceeeec---cCceeEEEEeCCCCC
Confidence 369999999988776632 2233333 22333467788888764 338999999999863
No 11
>PF14054 DUF4249: Domain of unknown function (DUF4249)
Probab=90.40 E-value=0.77 Score=31.86 Aligned_cols=58 Identities=22% Similarity=0.355 Sum_probs=35.2
Q ss_pred eccCCCceEEEEEEecCCccCcceeeeeecCCCC--ceeeEEec---Cccccce---EEEEeeCCCCCCCC
Q psy4958 3 IIGTGRAYNISVQTVSEDEISTPTTAQYRTIPLR--PLSFTYDK---ASITSNS---LRVVWEPPKGFSPF 65 (65)
Q Consensus 3 ~L~PGR~Y~ItV~tvs~~e~S~p~~~~~RT~Pl~--p~~L~f~~---~~vt~~S---~rV~W~pp~G~~~~ 65 (65)
.+.||+.|.+.|.+=-+..+|+.. ++|.+ +.++.+.. ....... ++++|.-|.+...|
T Consensus 95 ~~~~G~~Y~L~V~~~~~~~~sa~~-----~vp~~~~i~~v~~~~~~~~~~~~~~~~~i~~~~~D~~~~~ny 160 (298)
T PF14054_consen 95 RGRPGRTYRLEVETPGGKTYSAET-----TVPPPPPIDSVSYEFKKIGDGDEGEYYRISITFQDPPGEDNY 160 (298)
T ss_pred cccCCCEEEEEEEECCCCEEEEEE-----EECCCCceeEEEEEEecCCCCcccceeEEEEEEeCCCCCceE
Confidence 578999999999976444444432 34444 34554431 1112222 89999998877643
No 12
>KOG0196|consensus
Probab=86.55 E-value=1.5 Score=37.73 Aligned_cols=58 Identities=19% Similarity=0.247 Sum_probs=42.5
Q ss_pred eeccCCCceEEEEEEecCCccC---ccee------eeeecCCCCceeeEEecCccccceEEEEeeCCCCC
Q psy4958 2 LIIGTGRAYNISVQTVSEDEIS---TPTT------AQYRTIPLRPLSFTYDKASITSNSLRVVWEPPKGF 62 (65)
Q Consensus 2 ~~L~PGR~Y~ItV~tvs~~e~S---~p~~------~~~RT~Pl~p~~L~f~~~~vt~~S~rV~W~pp~G~ 62 (65)
-+|.|-..|...|+++- |+-. .|.. ......|.++.+|+. ...+.+|+.|+|..|.-.
T Consensus 403 ~~L~ah~~YTFeV~AvN-gVS~lsp~~~~~a~vnItt~qa~ps~V~~~r~--~~~~~~sitlsW~~p~~p 469 (996)
T KOG0196|consen 403 SDLLAHTNYTFEVEAVN-GVSDLSPFPRQFASVNITTNQAAPSPVSVLRQ--VSRTSDSITLSWSEPDQP 469 (996)
T ss_pred eccccccccEEEEEEee-cccccCCCCCcceeEEeeccccCCCccceEEE--eeeccCceEEecCCCCCC
Confidence 47999999999999987 3322 1111 124567888999987 446789999999988654
No 13
>PF02014 Reeler: Reeler domain Schematic picture including Reeler domain; InterPro: IPR002861 Extracellular matrix (ECM) proteins play an important role in early cortical development, specifically in the formation of neural connections and in controlling the cyto-architecture of the central nervous system. The product of the reeler gene in mouse is reelin,a large extracellular protein secreted by pioneer neurons that coordinates cell positioning during neurodevelopment []. F-spondin and mindin are a family of matrix-attached adhesion molecules that share structural similarities and overlapping domains of expression. Both F-spondin and mindin promote adhesion and outgrowth of hippocampal embryonic neurons and bind to a putative receptor(s) expressed on both hippocampal and sensory neurons []. This domain of unknown function is found at the N terminus of reelin and F-spondin.; PDB: 2ZOT_B 2ZOU_B 3COO_A.
Probab=86.16 E-value=5.6 Score=25.22 Aligned_cols=57 Identities=18% Similarity=0.267 Sum_probs=30.7
Q ss_pred ccCCCceEEEEEEecCCccCcceeeeeecCCCC----c----------------------eeeEEecCccccceEEEEee
Q psy4958 4 IGTGRAYNISVQTVSEDEISTPTTAQYRTIPLR----P----------------------LSFTYDKASITSNSLRVVWE 57 (65)
Q Consensus 4 L~PGR~Y~ItV~tvs~~e~S~p~~~~~RT~Pl~----p----------------------~~L~f~~~~vt~~S~rV~W~ 57 (65)
..||..|+|++ ..+++..-.--..|+|-.-.. + ..++ ..+...-+++.+.|.
T Consensus 30 y~pg~~~~Vtl-~~~~~~~F~GFllqAr~~~~~~~~~vG~f~~~~~~~~~~~~~C~~~~~~avT-H~~~~~K~~v~~~W~ 107 (132)
T PF02014_consen 30 YEPGQTYTVTL-SSSGSSSFRGFLLQARDANNSSPAPVGTFQLPPDSDTTQLLDCSGGDNNAVT-HSNNSPKTSVTFTWT 107 (132)
T ss_dssp B-TTBEEEEEE-EETTTEEBSEEEEEEEETT--SSSBSSEEEES-TTTEEEETTE----EEEEE-ESS-S-BSEEEEEEE
T ss_pred EcCCCEEEEEE-ECCCCCceeEEEEEEEeCCCCCcccccccCcCCcccceEeecccccccceEE-ecCCCCccEEEEEEc
Confidence 57999999999 455565555555555532222 1 1111 123333478999999
Q ss_pred CCCCC
Q psy4958 58 PPKGF 62 (65)
Q Consensus 58 pp~G~ 62 (65)
+|...
T Consensus 108 AP~~~ 112 (132)
T PF02014_consen 108 APADF 112 (132)
T ss_dssp --STT
T ss_pred CCCCC
Confidence 99863
No 14
>cd08544 Reeler Reeler, the N-terminal domain of reelin, F-spondin, and a variety of other proteins. This domain is found at the N-terminus of F-spondin, a protein attached to the extracellular matrix, which plays roles in neuronal development and vascular remodelling. The F-spondin reeler domain has been reported to bind heparin. The reeler domain is also found at the N-terminus of reelin, an extracellular glycoprotein involved in the development of the brain cortex, and in a variety of other eukaryotic proteins with different domain architectures, including the animal ferric-chelate reductase 1 or stromal cell-derived receptor 2, a member of the cytochrome B561 family, which reduces ferric iron before its transport from the endosome to the cytoplasm. Also included is the insect putative defense protein 1, which is expressed upon bacterial infection and appears to contain a single reeler domain.
Probab=85.30 E-value=6.1 Score=24.99 Aligned_cols=58 Identities=17% Similarity=0.213 Sum_probs=33.0
Q ss_pred ccCCCceEEEEEEecCCccCcceeeeeecCC--CCceeeEEec-------------Cccc------cceEEEEeeCCCCC
Q psy4958 4 IGTGRAYNISVQTVSEDEISTPTTAQYRTIP--LRPLSFTYDK-------------ASIT------SNSLRVVWEPPKGF 62 (65)
Q Consensus 4 L~PGR~Y~ItV~tvs~~e~S~p~~~~~RT~P--l~p~~L~f~~-------------~~vt------~~S~rV~W~pp~G~ 62 (65)
-.||..|+|+++.-.. ....--..|+|-.- .++..+.... +.++ -+++.+.|.+|...
T Consensus 30 y~pG~~~~Vtl~~~~~-~~F~GF~lqAr~~~~~~~vG~f~~~~~~~tk~~~c~~~~~avtH~~~~~k~~v~~~W~AP~~~ 108 (135)
T cd08544 30 YVPGETYTVTLSGSSP-SPFRGFLLQARDASSNEPVGTFQLPPDSGTKLLTCCGSDNAVTHSNNSKKTSVTFTWTAPSNG 108 (135)
T ss_pred ECCCCEEEEEEECCCC-CceeEEEEEEEcCCCCcceEEEEeCCCcccEEecCCCCCCcceeCCCCCCceeEEEEECCCCC
Confidence 4799999999986542 23344455555531 1122211110 2233 45789999999864
No 15
>PF10179 DUF2369: Uncharacterised conserved protein (DUF2369); InterPro: IPR019326 This is a proline-rich region of a group of proteins found from plants to fungi. The function is largely unknown, although the entry contains Fibronectin type-III domain-containing protein C4orf31, which promotes matrix assembly and cell adhesiveness.
Probab=84.01 E-value=2.9 Score=31.40 Aligned_cols=60 Identities=22% Similarity=0.265 Sum_probs=42.1
Q ss_pred eccCCCceEEEEEEecCCccCcceeee-------eecCCCCceeeEEecCc--cccceEEEEeeCCCCC
Q psy4958 3 IIGTGRAYNISVQTVSEDEISTPTTAQ-------YRTIPLRPLSFTYDKAS--ITSNSLRVVWEPPKGF 62 (65)
Q Consensus 3 ~L~PGR~Y~ItV~tvs~~e~S~p~~~~-------~RT~Pl~p~~L~f~~~~--vt~~S~rV~W~pp~G~ 62 (65)
++.||-.|-|.++..+.+|...-..++ .+-+|+-|.++++..-+ -+=+|+.|.|.+....
T Consensus 134 ~~~~g~~Yliri~~~~~~e~~~~~kV~aast~~~~~~~P~LP~d~~Ik~f~~lrtC~SvTIAW~~s~d~ 202 (300)
T PF10179_consen 134 GVKPGERYLIRIQISNSDEGPSTFKVQAASTNPSKQPYPQLPDDTSIKEFNKLRTCNSVTIAWLGSPDR 202 (300)
T ss_pred CCCCCCeEEEEEEccCCCCCceEEEEEEecCCcccCCCCCCCCCCceeEEcCCcccceEEEEEecCCCC
Confidence 568999999999877777766555555 34466668777763222 4558999999975543
No 16
>KOG4802|consensus
Probab=83.61 E-value=2.8 Score=33.80 Aligned_cols=59 Identities=27% Similarity=0.351 Sum_probs=33.6
Q ss_pred eccCCCceEEEEEEecC------CccCcceeeeee-cCCCCceeeEEec---CccccceEEEEeeCCCCCC
Q psy4958 3 IIGTGRAYNISVQTVSE------DEISTPTTAQYR-TIPLRPLSFTYDK---ASITSNSLRVVWEPPKGFS 63 (65)
Q Consensus 3 ~L~PGR~Y~ItV~tvs~------~e~S~p~~~~~R-T~Pl~p~~L~f~~---~~vt~~S~rV~W~pp~G~~ 63 (65)
-+.|||-|++-|.+|+- .+.|.|-....- -.|-+|.+|+.-+ ++. -+ +.|.|-|+..|-
T Consensus 215 ~~rPgRwyefrvaavn~~G~rGFs~PSkpf~ssk~pkaPp~P~dl~l~~v~~dG~-~~-~~v~w~P~~sdl 283 (516)
T KOG4802|consen 215 DMRPGRWYEFRVAAVNAYGFRGFSEPSKPFPSSKNPKAPPSPNDLKLIGVQFDGR-YM-LKVVWCPSKSDL 283 (516)
T ss_pred ecCcceeEEEEEeeeecccccccCCCCCCCCCCCCCCCCcCcccceeeeeeecce-EE-EEEEeCCCCCCC
Confidence 36899999999999931 122322222221 1233355554321 222 23 889999998763
No 17
>PF09294 Interfer-bind: Interferon-alpha/beta receptor, fibronectin type III; InterPro: IPR015373 Members of this family adopt a secondary structure consisting of seven beta-strands arranged in an immunoglobulin-like beta-sandwich, in a Greek-key topology. They are required for binding to interferon-alpha []. ; PDB: 1A21_A 3LQM_B 3ELA_T 1AHW_C 2A2Q_T 1TFH_B 1FAK_T 1WSS_T 1W2K_T 2FIR_T ....
Probab=78.43 E-value=2.4 Score=25.20 Aligned_cols=15 Identities=33% Similarity=0.446 Sum_probs=11.8
Q ss_pred eeccCCCceEEEEEE
Q psy4958 2 LIIGTGRAYNISVQT 16 (65)
Q Consensus 2 ~~L~PGR~Y~ItV~t 16 (65)
-.|.||+.|=|+|++
T Consensus 72 ~~L~p~t~YCv~V~~ 86 (106)
T PF09294_consen 72 SDLKPGTNYCVSVQA 86 (106)
T ss_dssp ES--TTSEEEEEEEE
T ss_pred eCCCCCCCEEEEEEE
Confidence 368999999999998
No 18
>KOG3513|consensus
Probab=75.32 E-value=6.3 Score=34.34 Aligned_cols=58 Identities=21% Similarity=0.319 Sum_probs=43.1
Q ss_pred CeeccCCCceEEEEEEecCCccCcce--eeeeecCCC---C-ceeeEEecCccccceEEEEeeCCC
Q psy4958 1 MLIIGTGRAYNISVQTVSEDEISTPT--TAQYRTIPL---R-PLSFTYDKASITSNSLRVVWEPPK 60 (65)
Q Consensus 1 ~~~L~PGR~Y~ItV~tvs~~e~S~p~--~~~~RT~Pl---~-p~~L~f~~~~vt~~S~rV~W~pp~ 60 (65)
++.|.|=-.|+..|.++-.-..++|. +-..||-+. . |+||+- .+..-+.|.|+|+|=.
T Consensus 678 vv~L~Pwv~YeFRV~AvN~iG~gePS~pS~~~rT~ea~P~~~P~nv~g--~g~~~~eLvItW~Pl~ 741 (1051)
T KOG3513|consen 678 VVNLSPWVEYEFRVVAVNSIGIGEPSPPSEKVRTPEAAPSVNPSNVKG--GGGSPTELVITWEPLP 741 (1051)
T ss_pred EEccCCCcceEEEEEEEcccccCCCCCCccceecCCCCCccCCccccc--cCCCCceEEEEeccCC
Confidence 36799999999999998654444432 456777654 4 799965 6667899999999854
No 19
>PF13205 Big_5: Bacterial Ig-like domain
Probab=65.99 E-value=7.3 Score=22.86 Aligned_cols=12 Identities=17% Similarity=0.617 Sum_probs=10.8
Q ss_pred eccCCCceEEEE
Q psy4958 3 IIGTGRAYNISV 14 (65)
Q Consensus 3 ~L~PGR~Y~ItV 14 (65)
.|.||+.|.|+|
T Consensus 71 ~L~~~t~Y~v~i 82 (107)
T PF13205_consen 71 PLKPGTTYTVTI 82 (107)
T ss_pred cCCCCCEEEEEE
Confidence 488999999998
No 20
>PF09423 PhoD: PhoD-like phosphatase; InterPro: IPR018946 This entry contains a number of putative proteins as well as Alkaline phosphatase D which catalyses the reaction: A phosphate monoester + H(2)O = an alcohol + phosphate ; PDB: 2YEQ_B.
Probab=64.60 E-value=8.6 Score=28.91 Aligned_cols=39 Identities=23% Similarity=0.392 Sum_probs=17.4
Q ss_pred eeccCCCceEEEEEEecCCccCcceeeeeecCCCC-ceeeEE
Q psy4958 2 LIIGTGRAYNISVQTVSEDEISTPTTAQYRTIPLR-PLSFTY 42 (65)
Q Consensus 2 ~~L~PGR~Y~ItV~tvs~~e~S~p~~~~~RT~Pl~-p~~L~f 42 (65)
-+|.||+.|...|..-.++..| + .-+-||.|.. +..++|
T Consensus 69 ~gL~p~t~Y~Y~~~~~~~~~~s-~-~g~~rT~p~~~~~~~r~ 108 (453)
T PF09423_consen 69 TGLQPGTRYYYRFVVDGGGQTS-P-VGRFRTAPDGDPDPFRF 108 (453)
T ss_dssp -S--TT-EEEEEEEE--TTEE-----EEEE--TT-----EEE
T ss_pred CCCCCCceEEEEEEEecCCCCC-C-ceEEEcCCCCCCCceEE
Confidence 4799999998888862222333 3 3567888877 666776
No 21
>PF01108 Tissue_fac: Tissue factor; PDB: 3OG4_B 3OG6_B 1FYH_E 1FG9_D 1JRH_I 3DGC_R 3DLQ_R 1LQS_R 1Y6M_R 1J7V_R ....
Probab=62.62 E-value=12 Score=22.70 Aligned_cols=28 Identities=21% Similarity=0.313 Sum_probs=19.5
Q ss_pred CCCCceeeEEecCccccceEEEEeeCCCCCC
Q psy4958 33 IPLRPLSFTYDKASITSNSLRVVWEPPKGFS 63 (65)
Q Consensus 33 ~Pl~p~~L~f~~~~vt~~S~rV~W~pp~G~~ 63 (65)
...+|.||+|+.. .-.-.++|+++.+..
T Consensus 21 ~lp~P~nv~~~s~---nf~~iL~W~~~~~~~ 48 (107)
T PF01108_consen 21 SLPAPQNVTVDSV---NFKHILRWDPGPGSP 48 (107)
T ss_dssp SGSSCEEEEEEEE---TTEEEEEEEESTTSS
T ss_pred cCCCCCeeEEEEE---CCceEEEeCCCCCCC
Confidence 3344999999543 345789999966643
No 22
>PF14250 AbrB-like: AbrB-like transcriptional regulator
Probab=59.81 E-value=5.5 Score=24.58 Aligned_cols=14 Identities=14% Similarity=0.413 Sum_probs=12.2
Q ss_pred CeeccCCCceEEEE
Q psy4958 1 MLIIGTGRAYNISV 14 (65)
Q Consensus 1 ~~~L~PGR~Y~ItV 14 (65)
+++|.||.++.|.+
T Consensus 50 ~m~L~PGdEFeI~L 63 (71)
T PF14250_consen 50 QMGLKPGDEFEIKL 63 (71)
T ss_pred HhCCCCCCEEEEEe
Confidence 47899999999986
No 23
>PF02018 CBM_4_9: Carbohydrate binding domain; InterPro: IPR003305 The 1,4-beta-glucanase CenC from Cellulomonas fimi contains two cellulose-binding domains, CBD(N1) and CBD(N2), arranged in tandem at its N terminus. These homologous CBDs are distinct in their selectivity for binding amorphous and not crystalline cellulose []. Multidimensional heteronuclear nuclear magnetic resonance (NMR) spectroscopy was used to determine the tertiary structure of the 152 amino acid N-terminal cellulose-binding domain from C. fimi 1,4-beta-glucanase CenC (CBDN1) []. The tertiary structure of CBDN1 is strikingly similar to that of the bacterial 1,3-1,4-beta-glucanases, as well as other sugar-binding proteins with jelly-roll folds.; GO: 0016798 hydrolase activity, acting on glycosyl bonds; PDB: 3OEA_B 2ZEX_B 3OEB_A 2ZEY_A 2ZEW_A 1GUI_A 2W5F_A 2WZE_A 2WYS_A 2ZEZ_B ....
Probab=55.60 E-value=15 Score=21.62 Aligned_cols=20 Identities=15% Similarity=0.426 Sum_probs=15.5
Q ss_pred eeccCCCceEEEEEEecCCc
Q psy4958 2 LIIGTGRAYNISVQTVSEDE 21 (65)
Q Consensus 2 ~~L~PGR~Y~ItV~tvs~~e 21 (65)
..|.||..|.|++|.-....
T Consensus 56 ~~l~~G~~Y~~s~~vk~~~~ 75 (131)
T PF02018_consen 56 ISLKPGKTYTVSFWVKADSG 75 (131)
T ss_dssp EEE-TTSEEEEEEEEEESSS
T ss_pred eEecCCCEEEEEEEEEeCCC
Confidence 46899999999999777444
No 24
>KOG3632|consensus
Probab=53.54 E-value=19 Score=32.08 Aligned_cols=61 Identities=26% Similarity=0.439 Sum_probs=41.4
Q ss_pred eeccCCCceEEEEEEecCCccC----------cceeeeeecCC----CCceeeEEecCccccceEEEEeeCCCCCC
Q psy4958 2 LIIGTGRAYNISVQTVSEDEIS----------TPTTAQYRTIP----LRPLSFTYDKASITSNSLRVVWEPPKGFS 63 (65)
Q Consensus 2 ~~L~PGR~Y~ItV~tvs~~e~S----------~p~~~~~RT~P----l~p~~L~f~~~~vt~~S~rV~W~pp~G~~ 63 (65)
..|.||..|+++|-.----..+ -..+.+-||+| .+|..++... +++-..++|+|.||.=||
T Consensus 723 ~~lrpgt~y~a~vea~~p~q~pwdl~~v~~etr~atv~f~tLpAGppappldV~vE~-g~spg~l~vswrPptlds 797 (1335)
T KOG3632|consen 723 MRLRPGTDYWASVEAALPRQEPWDLRMVPMETRQATVLFRTLPAGPPAPPLDVKVET-GGSPGRLEVSWRPPTLDS 797 (1335)
T ss_pred hccCCCCccceecccccCcCCCcccccchhhhhccceeeecccCCCCCCchheeeec-CCCCceeeeeccCceecc
Confidence 3689999999999643310000 12344567765 4578888764 488899999999997665
No 25
>KOG4258|consensus
Probab=45.27 E-value=33 Score=30.07 Aligned_cols=57 Identities=25% Similarity=0.374 Sum_probs=35.7
Q ss_pred eeccCCCceEEEEEEecCCccCcceeee-----eecCCCC---ceeeEEecCccccceEEEEeeCCC
Q psy4958 2 LIIGTGRAYNISVQTVSEDEISTPTTAQ-----YRTIPLR---PLSFTYDKASITSNSLRVVWEPPK 60 (65)
Q Consensus 2 ~~L~PGR~Y~ItV~tvs~~e~S~p~~~~-----~RT~Pl~---p~~L~f~~~~vt~~S~rV~W~pp~ 60 (65)
-+|-|-+.|-|=|.|+--.++-+...|+ -||.|.. |...-. .+ -+++-|-|+|.||.
T Consensus 571 ~~LkP~TqYAvfVkT~t~t~~~~~~~A~S~I~YvqT~~~~PspPl~~ls-~s-nsSsqi~l~W~pP~ 635 (1025)
T KOG4258|consen 571 DGLKPWTQYAVFVKTLTVTEAHEAYEAKSKIGYVQTLPDIPSPPLDVLS-KS-NSSSQILLKWKPPS 635 (1025)
T ss_pred hcCCccceeEEEEeeeehhhhccccccccceEEEEecCCCCCCcchhhh-cc-CcchheeEEecCCC
Confidence 5799999999999999434443333222 3666555 443321 12 22457999999985
No 26
>KOG4222|consensus
Probab=43.20 E-value=33 Score=30.74 Aligned_cols=58 Identities=22% Similarity=0.279 Sum_probs=37.1
Q ss_pred eccCCCceEEEEEEecC------CccCcceeeeee-cCCCC-ce---eeEEecCccccceEEEEeeCCCCC
Q psy4958 3 IIGTGRAYNISVQTVSE------DEISTPTTAQYR-TIPLR-PL---SFTYDKASITSNSLRVVWEPPKGF 62 (65)
Q Consensus 3 ~L~PGR~Y~ItV~tvs~------~e~S~p~~~~~R-T~Pl~-p~---~L~f~~~~vt~~S~rV~W~pp~G~ 62 (65)
-|.||+.|++.|.-... +--|+++.+.-- -.|.. |. +++. ..-+.+++.|+|.+|..|
T Consensus 707 nl~p~t~ye~f~~Pf~~~~~s~~g~pS~sk~alt~e~~PSapp~~~~~~s~--~~~n~Ta~~Vsw~~pp~d 775 (1281)
T KOG4222|consen 707 NLKPGTNYEFFVRPFFPHGYSIQGAPSNSKTALTLEEPPSAPPQGVQHVSK--GSYNGTAGSVSWAPPPAD 775 (1281)
T ss_pred ccCCCccceeeccCccCCCcceecCCcccccccccccCCCCCCCCcccccc--ccCCCceeeEEecCCccc
Confidence 48899999999876654 333444443211 12222 34 4644 557789999999999655
No 27
>PF07691 PA14: PA14 domain; InterPro: IPR011658 The PA14 domain forms an insert in bacterial beta-glucosidases, other glycosidases, glycosyltransferases, proteases, amidases, yeast adhesins and bacterial toxins, including anthrax protective antigen (PA). The domain also occurs in a Dictyostelium pre-spore cell-inducing factor Psi and in fibrocystin, the mammalian protein whose mutation leads to polycystic kidney and hepatic disease. The crystal structure of PA shows that this domain (named PA14 after its location in the PA20 pro-peptide) has a beta-barrel structure. The PA14 domain sequence suggests a binding function, rather than a catalytic role. The PA14 domain distribution is compatible with carbohydrate binding [].; PDB: 2XVG_A 2XVK_A 2XVL_A 2XJU_A 2XJT_A 2XJQ_A 2XJS_A 2XJV_A 2XJP_A 2XJR_A ....
Probab=40.88 E-value=78 Score=19.22 Aligned_cols=30 Identities=23% Similarity=0.461 Sum_probs=15.1
Q ss_pred eecCCCCceeeEEecCccccceEEEEeeCCCCC
Q psy4958 30 YRTIPLRPLSFTYDKASITSNSLRVVWEPPKGF 62 (65)
Q Consensus 30 ~RT~Pl~p~~L~f~~~~vt~~S~rV~W~pp~G~ 62 (65)
++.++.++..... . .....|+|.|.+|.+.
T Consensus 109 g~~y~i~i~y~~~--~-~~~~~~~l~w~~p~~~ 138 (145)
T PF07691_consen 109 GGKYPIRIEYFNR--G-GDASLIQLSWTPPGGA 138 (145)
T ss_dssp T-EEEEEEEEEEC--S-CSSCCEEEEEE-TTS-
T ss_pred CeeEEEEEEEEEC--C-CCCEEEEEEEECCCCc
Confidence 3455555544432 2 2223369999998764
No 28
>PF10179 DUF2369: Uncharacterised conserved protein (DUF2369); InterPro: IPR019326 This is a proline-rich region of a group of proteins found from plants to fungi. The function is largely unknown, although the entry contains Fibronectin type-III domain-containing protein C4orf31, which promotes matrix assembly and cell adhesiveness.
Probab=40.88 E-value=28 Score=26.26 Aligned_cols=18 Identities=17% Similarity=0.204 Sum_probs=15.2
Q ss_pred CeeccCCCceEEEEEEec
Q psy4958 1 MLIIGTGRAYNISVQTVS 18 (65)
Q Consensus 1 ~~~L~PGR~Y~ItV~tvs 18 (65)
+.+|.||+.|=|-|+..-
T Consensus 264 I~~L~PG~~Yl~dV~~~~ 281 (300)
T PF10179_consen 264 IKGLKPGTTYLFDVYVNG 281 (300)
T ss_pred cccCCCCcEEEEEEEEec
Confidence 368999999999998764
No 29
>smart00758 PA14 domain in bacterial beta-glucosidases other glycosidases, glycosyltransferases, proteases, amidases, yeast adhesins, and bacterial toxins.
Probab=38.33 E-value=94 Score=18.98 Aligned_cols=32 Identities=25% Similarity=0.325 Sum_probs=19.3
Q ss_pred eeecCCCCceeeEEecCccccceEEEEeeCCCCCCC
Q psy4958 29 QYRTIPLRPLSFTYDKASITSNSLRVVWEPPKGFSP 64 (65)
Q Consensus 29 ~~RT~Pl~p~~L~f~~~~vt~~S~rV~W~pp~G~~~ 64 (65)
.++.++.++..... ....++++.|..|.+..|
T Consensus 100 ~g~~~~i~v~y~~~----~~~~~~~l~w~~~~~~~~ 131 (136)
T smart00758 100 AGGTYPIRIEYFEA----GTGGLLKLGWTTPDAAKE 131 (136)
T ss_pred CCcEEEEEEEEEeC----CCCEEEEEEEECCCCCcE
Confidence 34455555544422 234588999999987653
No 30
>KOG4152|consensus
Probab=37.78 E-value=44 Score=28.26 Aligned_cols=52 Identities=27% Similarity=0.553 Sum_probs=0.0
Q ss_pred ccCCCceEEEEEEecCCccCcceeeeeec------CCCCceeeEEecCccccceEEEEeeCC
Q psy4958 4 IGTGRAYNISVQTVSEDEISTPTTAQYRT------IPLRPLSFTYDKASITSNSLRVVWEPP 59 (65)
Q Consensus 4 L~PGR~Y~ItV~tvs~~e~S~p~~~~~RT------~Pl~p~~L~f~~~~vt~~S~rV~W~pp 59 (65)
|+||..|..-|.++- |.---|-..-... +|-.|+.++..++ +..+.+.||||
T Consensus 654 lv~Gq~yrfrV~aIn-g~G~gp~s~i~~~kTc~pG~P~apS~~ri~k~---~eGi~l~wepp 711 (830)
T KOG4152|consen 654 LVTGQAYRFRVTAIN-GKGPGPASTILKLKTCAPGKPTAPSGARIKKT---IEGISLVWEPP 711 (830)
T ss_pred cccccceeeeeeeee-ccCCCchhhheeeeeccCCCCCCccccccccc---ccceeecccCC
No 31
>PF03684 UPF0179: Uncharacterised protein family (UPF0179); InterPro: IPR005369 The function of this family is unknown, however the proteins contain two cysteine clusters that may be iron sulphur redox centres.
Probab=36.60 E-value=21 Score=24.31 Aligned_cols=17 Identities=35% Similarity=0.606 Sum_probs=13.7
Q ss_pred eccCCCceEEEEEEecCCc
Q psy4958 3 IIGTGRAYNISVQTVSEDE 21 (65)
Q Consensus 3 ~L~PGR~Y~ItV~tvs~~e 21 (65)
.|.|||-|.|+ .|....
T Consensus 37 nLe~Gr~YrI~--~VR~~~ 53 (142)
T PF03684_consen 37 NLEPGRRYRIV--EVRNNE 53 (142)
T ss_pred cCCCCceEEEE--EEcCCC
Confidence 69999999997 566555
No 32
>PF07532 Big_4: Bacterial Ig-like domain (group 4); InterPro: IPR011081 This entry represents bacterial domains with an Ig-like fold. These domains are found in a variety of bacterial surface proteins.
Probab=36.28 E-value=77 Score=17.39 Aligned_cols=14 Identities=21% Similarity=0.715 Sum_probs=11.6
Q ss_pred CccccceEEEEeeC
Q psy4958 45 ASITSNSLRVVWEP 58 (65)
Q Consensus 45 ~~vt~~S~rV~W~p 58 (65)
++.+...+.|.|+.
T Consensus 25 ~dGs~~~~~V~W~~ 38 (59)
T PF07532_consen 25 SDGSTEEVPVTWDP 38 (59)
T ss_pred CCCCEEEEEeEeCC
Confidence 66777889999995
No 33
>cd03869 M14_CPX_like Peptidase M14-like domain of carboxypeptidase (CP)-like protein X (CPX), CPX forms a distinct subgroup of the N/E subfamily of the M14 family of metallocarboxypeptidases (MCPs). The M14 family are zinc-binding CPs which hydrolyze single, C-terminal amino acids from polypeptide chains, and have a recognition site for the free C-terminal carboxyl group, which is a key determinant of specificity. Proteins belonging to this subgroup include CP-like protein X1 (CPX1), CP-like protein X2 (CPX2), and aortic CP-like protein (ACLP) and its isoform adipocyte enhancer binding protein-1 (AEBP1). AEBP1 is a truncated form of ACLP, which may arise from alternative splicing of the gene. These proteins are inactive towards standard CP substrates because they lack one or more critical active site and substrate-binding residues that are necessary for activity. They may function as binding proteins rather than as active CPs or display catalytic activity toward other substrates. Pro
Probab=36.11 E-value=24 Score=27.42 Aligned_cols=13 Identities=15% Similarity=0.710 Sum_probs=10.9
Q ss_pred eccCCCceEEEEEE
Q psy4958 3 IIGTGRAYNISVQT 16 (65)
Q Consensus 3 ~L~PGR~Y~ItV~t 16 (65)
.|.||+ |+|+|++
T Consensus 368 ll~pG~-y~v~~~a 380 (405)
T cd03869 368 LLNPGE-YRVTAHA 380 (405)
T ss_pred ecCCce-EEEEEEe
Confidence 489995 9999985
No 34
>COG1860 Uncharacterized protein conserved in archaea [Function unknown]
Probab=35.42 E-value=23 Score=24.53 Aligned_cols=12 Identities=33% Similarity=0.681 Sum_probs=10.3
Q ss_pred eeccCCCceEEE
Q psy4958 2 LIIGTGRAYNIS 13 (65)
Q Consensus 2 ~~L~PGR~Y~It 13 (65)
+-|+|||-|.|+
T Consensus 38 ~nLe~GrrYkVv 49 (147)
T COG1860 38 LNLEEGRRYKVV 49 (147)
T ss_pred eccCCCcEEEEE
Confidence 369999999986
No 35
>PF03442 CBM_X2: Carbohydrate binding domain X2; InterPro: IPR005102 The structure of this module is known [] and consists of an Ig-like fold. The function of this domain is unknown, but might be involved in mediating interaction with carbohydrates.; PDB: 1EHX_A.
Probab=35.34 E-value=20 Score=21.76 Aligned_cols=11 Identities=27% Similarity=0.561 Sum_probs=6.9
Q ss_pred eccCCCceEEE
Q psy4958 3 IIGTGRAYNIS 13 (65)
Q Consensus 3 ~L~PGR~Y~It 13 (65)
.|++|++|.|+
T Consensus 39 ~L~~g~DYtvs 49 (84)
T PF03442_consen 39 TLVEGTDYTVS 49 (84)
T ss_dssp --G-GGGEEEE
T ss_pred cccCCCCEEEe
Confidence 48999999883
No 36
>PRK01177 hypothetical protein; Provisional
Probab=30.12 E-value=31 Score=23.54 Aligned_cols=17 Identities=41% Similarity=0.534 Sum_probs=13.2
Q ss_pred eccCCCceEEEEEEecCCc
Q psy4958 3 IIGTGRAYNISVQTVSEDE 21 (65)
Q Consensus 3 ~L~PGR~Y~ItV~tvs~~e 21 (65)
.|.|||-|.|+ .|...+
T Consensus 39 nLe~GrrYrI~--~VR~~~ 55 (140)
T PRK01177 39 NLEPGRRYKIL--KVREIE 55 (140)
T ss_pred cCCCCceEEEE--EEecCC
Confidence 69999999997 566433
No 37
>PF07495 Y_Y_Y: Y_Y_Y domain; InterPro: IPR011123 This region is mostly found at the end of the beta propellers (IPR011110 from INTERPRO) in a family of two component regulators. However they are also found tandemly repeated in Q891H4 from SWISSPROT without other signal conduction domains being present. It is named after the conserved tyrosines found in the alignment. The exact function is not known.; PDB: 3V9F_D 3VA6_B 3OTT_B 4A2M_D 4A2L_B.
Probab=29.43 E-value=52 Score=17.74 Aligned_cols=17 Identities=18% Similarity=0.499 Sum_probs=11.2
Q ss_pred ccCCCceEEEEEEecCCc
Q psy4958 4 IGTGRAYNISVQTVSEDE 21 (65)
Q Consensus 4 L~PGR~Y~ItV~tvs~~e 21 (65)
|.|| .|.+.|++.....
T Consensus 36 L~~G-~Y~l~V~a~~~~~ 52 (66)
T PF07495_consen 36 LPPG-KYTLEVRAKDNNG 52 (66)
T ss_dssp --SE-EEEEEEEEEETTS
T ss_pred CCCE-EEEEEEEEECCCC
Confidence 5565 5999999887443
No 38
>TIGR02148 Fibro_Slime fibro-slime domain. This model represents a conserved region of about 90 amino acids, shared in at least 4 distinct large putative proteins from the slime mold Dictyostelium discoideum and 10 proteins from the rumen bacterium Fibrobacter succinogenes, and in no other species so far. We propose here the name fibro-slime domain
Probab=28.58 E-value=61 Score=20.61 Aligned_cols=17 Identities=18% Similarity=0.358 Sum_probs=14.4
Q ss_pred eeccCCCceEEEE-EEec
Q psy4958 2 LIIGTGRAYNISV-QTVS 18 (65)
Q Consensus 2 ~~L~PGR~Y~ItV-~tvs 18 (65)
|+|++|+.|.+.+ +|-.
T Consensus 58 lgl~~g~~Y~~d~F~~ER 75 (90)
T TIGR02148 58 LGLKEGKTYPFDIFYCER 75 (90)
T ss_pred cCCccCcEeeEEEEEEee
Confidence 6899999999999 6555
No 39
>COG4026 Uncharacterized protein containing TOPRIM domain, potential nuclease [General function prediction only]
Probab=27.24 E-value=40 Score=25.44 Aligned_cols=16 Identities=25% Similarity=0.609 Sum_probs=13.2
Q ss_pred eccCCCceEEEEEEec
Q psy4958 3 IIGTGRAYNISVQTVS 18 (65)
Q Consensus 3 ~L~PGR~Y~ItV~tvs 18 (65)
.|.=|-.|||||+++-
T Consensus 28 r~aLG~eYnITisSIi 43 (290)
T COG4026 28 RLALGSEYNITISSII 43 (290)
T ss_pred HhhhcccceeEEEeec
Confidence 3556899999999887
No 40
>PF05935 Arylsulfotrans: Arylsulfotransferase (ASST); InterPro: IPR010262 This family consists of several bacterial arylsulphotransferase proteins. Arylsulphotransferase (ASST) transfers a sulphate group from phenolic sulphate esters to a phenolic acceptor substrate [].; PDB: 3ETT_B 3ELQ_A 3ETS_A.
Probab=27.20 E-value=71 Score=24.63 Aligned_cols=35 Identities=14% Similarity=0.157 Sum_probs=18.9
Q ss_pred CeeccCCCceEEEEEEec-CCccCcceeeeeecCCCC
Q psy4958 1 MLIIGTGRAYNISVQTVS-EDEISTPTTAQYRTIPLR 36 (65)
Q Consensus 1 ~~~L~PGR~Y~ItV~tvs-~~e~S~p~~~~~RT~Pl~ 36 (65)
++||.|+..+.|+|..-. .|+. ...+..-.|-+++
T Consensus 56 V~GLyp~~~N~V~v~~~~~~G~~-~~~~~~i~t~~~~ 91 (477)
T PF05935_consen 56 VIGLYPDYENTVTVTYTDEDGKT-ETKTYTITTPPLP 91 (477)
T ss_dssp EE---TTSEEEEEEEEEE-TTEE-EEEEEEEE-----
T ss_pred EEEEeCCCeEEEEEEEEccCCeE-eeeEEEEECCCcC
Confidence 479999999999999887 3333 4444555555554
No 41
>KOG4258|consensus
Probab=27.10 E-value=29 Score=30.37 Aligned_cols=58 Identities=26% Similarity=0.324 Sum_probs=41.3
Q ss_pred eccCCCceEEEEEEecCCc----cCcceeeeeecCCCC-----ceeeEEecCccccceEEEEeeCCCC
Q psy4958 3 IIGTGRAYNISVQTVSEDE----ISTPTTAQYRTIPLR-----PLSFTYDKASITSNSLRVVWEPPKG 61 (65)
Q Consensus 3 ~L~PGR~Y~ItV~tvs~~e----~S~p~~~~~RT~Pl~-----p~~L~f~~~~vt~~S~rV~W~pp~G 61 (65)
+|.+=++|.|..|+..-.- .|.....++||-|-. +..+..... +..+++-++|++|+.
T Consensus 788 ~Lrh~tlY~i~l~aCnh~~~~~~cS~a~~v~~RT~~~~~aD~i~g~v~we~~-~~~~~v~l~w~EP~~ 854 (1025)
T KOG4258|consen 788 GLRHFTLYRIDLQACNHATPKCGCSHAAFVFARTMPTMGADDIPGPVTWECH-IEMNSVILRWLEPKE 854 (1025)
T ss_pred ccccchhhhhhHhhhcccccccccchhhhhhhccccccccccCCCceeEecc-cCcceEEEecCCCCC
Confidence 4566678888888765433 489999999998854 232333323 578999999999863
No 42
>PF15404 PH_4: Pleckstrin homology domain
Probab=24.91 E-value=89 Score=21.95 Aligned_cols=36 Identities=22% Similarity=0.302 Sum_probs=27.5
Q ss_pred eeccCCCc--eEEEEEEecCCccCcceeeeeecCCCCc
Q psy4958 2 LIIGTGRA--YNISVQTVSEDEISTPTTAQYRTIPLRP 37 (65)
Q Consensus 2 ~~L~PGR~--Y~ItV~tvs~~e~S~p~~~~~RT~Pl~p 37 (65)
..|.||.+ |++...+..|-.....-..++.|+|+.=
T Consensus 20 vvL~~G~Li~f~~~~Rs~tG~~~~~~~~~~~~tI~L~d 57 (185)
T PF15404_consen 20 VVLIPGFLILFQLFKRSRTGFAKPTVHYEHYLTIPLRD 57 (185)
T ss_pred EEEeCCEEEEEEEEeeccCCcccceEEeEeeeeeccCC
Confidence 47899985 7887778776666667777888888763
No 43
>cd03863 M14_CPD_II The second carboxypeptidase (CP)-like domain of Carboxypeptidase D (CPD; EC 3.4.17.22), domain II. CPD differs from all other metallocarboxypeptidases in that it contains multiple CP-like domains. CPD belongs to the N/E-like subfamily of the M14 family of metallocarboxypeptidases (MCPs).The M14 family are zinc-binding CPs which hydrolyze single, C-terminal amino acids from polypeptide chains, and have a recognition site for the free C-terminal carboxyl group, which is a key determinant of specificity. CPD is a single-chain protein containing a signal peptide, three tandem repeats of CP-like domains separated by short bridge regions, followed by a transmembrane domain, and a C-terminal cytosolic tail. The first two CP-like domains of CPD contain all of the essential active site and substrate-binding residues, while the third CP-like domain lacks critical residues necessary for enzymatic activity and is inactive towards standard CP substrates. Domain I is optimally ac
Probab=24.63 E-value=54 Score=24.99 Aligned_cols=15 Identities=13% Similarity=0.474 Sum_probs=12.6
Q ss_pred eccCCCceEEEEEEec
Q psy4958 3 IIGTGRAYNISVQTVS 18 (65)
Q Consensus 3 ~L~PGR~Y~ItV~tvs 18 (65)
.|.||+ |+|+|+.+.
T Consensus 337 ~l~pG~-ytl~vs~~G 351 (375)
T cd03863 337 LLVPGT-YKVTASARG 351 (375)
T ss_pred ccCCee-EEEEEEEcC
Confidence 478886 999999876
No 44
>cd00604 IPT_CGTD IPT domain (domain D) of cyclodextrin glycosyltransferase (CGTase) and similar enzymes. These enzymes are involved in the enzymatic hydrolysis of alpha-1,4 linkages of starch polymers and belong to the glycosyl hydrolase family 13. Most consist of three domains (A,B,C) but CGTase is more complex and has two additional domains (D,E). The function of the IPT/D domain is unknown.
Probab=24.43 E-value=1.2e+02 Score=18.09 Aligned_cols=22 Identities=41% Similarity=0.530 Sum_probs=13.7
Q ss_pred ccCCCceEEEEEEecCCccCccee
Q psy4958 4 IGTGRAYNISVQTVSEDEISTPTT 27 (65)
Q Consensus 4 L~PGR~Y~ItV~tvs~~e~S~p~~ 27 (65)
+.+| .|+|+|.+-. |..|....
T Consensus 55 ~~~g-~~~i~V~~~~-G~~Sn~~~ 76 (81)
T cd00604 55 VAPG-NYNISVTTVD-GVTSNGYN 76 (81)
T ss_pred CCCC-ceEEEEEECC-CcccCcEe
Confidence 3444 7888888554 66665543
No 45
>PF07198 DUF1410: Protein of unknown function (DUF1410); InterPro: IPR009849 This entry represents a conserved region approximately 100 residues long, multiple copies of which are sometimes found within hypothetical Ureaplasma parvum proteins of unknown function.
Probab=21.63 E-value=61 Score=18.18 Aligned_cols=17 Identities=6% Similarity=0.210 Sum_probs=13.3
Q ss_pred eccCCCceEEEEEEecC
Q psy4958 3 IIGTGRAYNISVQTVSE 19 (65)
Q Consensus 3 ~L~PGR~Y~ItV~tvs~ 19 (65)
.|.||+.|+|.=-+..+
T Consensus 44 ~L~~n~~Y~l~~I~~~n 60 (66)
T PF07198_consen 44 NLPPNHKYKLKKIVLNN 60 (66)
T ss_pred CCCCCCEEEEEEEEECC
Confidence 58999999997665553
No 46
>PF15206 FAM209: FAM209 family
Probab=21.27 E-value=81 Score=21.94 Aligned_cols=19 Identities=21% Similarity=0.176 Sum_probs=14.6
Q ss_pred cCCCceE-EEEEEecCCccC
Q psy4958 5 GTGRAYN-ISVQTVSEDEIS 23 (65)
Q Consensus 5 ~PGR~Y~-ItV~tvs~~e~S 23 (65)
.|.--|| ||+|-|.|+|-|
T Consensus 131 ~PaDp~nniTIYeiWgeeds 150 (150)
T PF15206_consen 131 VPADPQNNITIYEIWGEEDS 150 (150)
T ss_pred CCCCCCcceEEEEeccccCC
Confidence 4666777 999999977654
No 47
>PF07867 DUF1654: Protein of unknown function (DUF1654); InterPro: IPR012449 This entry is represented by Bacteriophage F116 (Pseudomonas phage F116), Orf28. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This family consists of proteins from the Pseudomonadaceae.
Probab=21.24 E-value=1.4e+02 Score=18.31 Aligned_cols=14 Identities=43% Similarity=0.788 Sum_probs=11.1
Q ss_pred cceEEEEeeCCCCC
Q psy4958 49 SNSLRVVWEPPKGF 62 (65)
Q Consensus 49 ~~S~rV~W~pp~G~ 62 (65)
+-+++|+|+.+.-|
T Consensus 60 DGsv~i~W~~~~~d 73 (73)
T PF07867_consen 60 DGSVRIRWERPKED 73 (73)
T ss_pred CCeEEEEEEcccCC
Confidence 46899999988654
No 48
>COG5294 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=21.02 E-value=84 Score=20.90 Aligned_cols=41 Identities=17% Similarity=0.058 Sum_probs=30.7
Q ss_pred ccCCCceEEEEEEecCCccCcceeeeeecCCCCceeeEEec
Q psy4958 4 IGTGRAYNISVQTVSEDEISTPTTAQYRTIPLRPLSFTYDK 44 (65)
Q Consensus 4 L~PGR~Y~ItV~tvs~~e~S~p~~~~~RT~Pl~p~~L~f~~ 44 (65)
+.||+.|+|+.+.=.|.+.+.--++..-.-+..=..|+.+.
T Consensus 50 ~~~~y~y~i~ayn~~Gkkk~v~f~a~~~lr~~aYLKlt~kg 90 (113)
T COG5294 50 GSPGYEYTITAYNKNGKKKEVKFTATHNLRKEAYLKLTHKG 90 (113)
T ss_pred CCccceeeehhhccCCcEEEEEEEecCcCCCccEEEEEecC
Confidence 46899999999999988888777776666665555555533
No 49
>KOG4806|consensus
Probab=20.44 E-value=2.6e+02 Score=22.50 Aligned_cols=59 Identities=17% Similarity=0.310 Sum_probs=42.2
Q ss_pred eccCCCceEEEEEEecCCccCcceeeee------ecCCCCceeeEEecCccccceEEEEeeCCCC
Q psy4958 3 IIGTGRAYNISVQTVSEDEISTPTTAQY------RTIPLRPLSFTYDKASITSNSLRVVWEPPKG 61 (65)
Q Consensus 3 ~L~PGR~Y~ItV~tvs~~e~S~p~~~~~------RT~Pl~p~~L~f~~~~vt~~S~rV~W~pp~G 61 (65)
++.||+-|-.-++-+-++|+-.-++.-+ |-+|.-|.+-+..+-+-+-+|+.|.|.....
T Consensus 290 vv~~~~~~~~rf~~~ndde~~~~v~V~aSt~~t~~~~P~LP~dTtVk~v~r~CSsAtIaW~gs~d 354 (454)
T KOG4806|consen 290 VVLPGERYLMRFEPSNDDEALQKVMVAASTEATFRDLPELPQDTTVKNVRRRCSSATIAWNGSPD 354 (454)
T ss_pred EecCCCceEEEEEecCCchhhheeEeeeecccccCcCCCCCCCceEeeeccccchheeeeccCcc
Confidence 5789999999999877777654444333 4456667776665555667899999986543
Done!