Query 013408
Match_columns 443
No_of_seqs 360 out of 2191
Neff 7.8
Searched_HMMs 46136
Date Fri Mar 29 03:33:21 2013
Command hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/013408.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/013408hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 COG2335 Secreted and surface p 100.0 8.1E-29 1.8E-33 220.5 11.1 139 55-200 45-183 (187)
2 COG2335 Secreted and surface p 99.9 5.9E-24 1.3E-28 189.4 6.7 132 281-429 48-185 (187)
3 PF02469 Fasciclin: Fasciclin 99.9 5E-23 1.1E-27 177.6 9.4 124 68-198 1-128 (128)
4 KOG1437 Fasciclin and related 99.9 1.6E-22 3.5E-27 212.5 12.4 335 91-440 129-521 (682)
5 KOG1437 Fasciclin and related 99.9 2E-22 4.3E-27 211.9 12.2 259 58-431 374-647 (682)
6 smart00554 FAS1 Four repeated 99.8 9E-21 1.9E-25 156.3 9.3 98 96-199 1-99 (99)
7 smart00554 FAS1 Four repeated 99.8 6.6E-19 1.4E-23 145.2 7.1 93 321-426 1-99 (99)
8 PF02469 Fasciclin: Fasciclin 99.7 3.4E-18 7.4E-23 147.3 5.0 107 308-425 14-128 (128)
9 COG5443 FlbT Flagellar biosynt 19.2 1.6E+02 0.0036 25.3 3.8 33 157-194 3-35 (148)
10 PF02680 DUF211: Uncharacteriz 14.3 81 0.0018 25.6 0.8 16 408-424 67-82 (95)
No 1
>COG2335 Secreted and surface protein containing fasciclin-like repeats [Cell envelope biogenesis, outer membrane]
Probab=99.96 E-value=8.1e-29 Score=220.46 Aligned_cols=139 Identities=25% Similarity=0.407 Sum_probs=129.7
Q ss_pred CCCccHHHHHhcCCChHHHHHHHHHcCChhHhhhccCCCCeEEEecCcHHHhcCCChHHHHhccCCCCHHHHHHHHhcce
Q 013408 55 QINSNSVLVALLDSHYTELSELVEKALLLQPLEDAVGKHSITIFAPKNEALERDLDPEFKRFLLEPGNLKSLQTLLLHHI 134 (443)
Q Consensus 55 ~~~~ni~~~~~~~~~~s~f~~~l~~agl~~~L~~~~~~~~~TvFAPtd~Af~~~l~~~~~~~l~~~~~~~~L~~lL~yHi 134 (443)
....+|++.+.++++|++|..+++.++|.++|+ +.|+||||||+|+||.+ ++......|..|+++.+|..+|.|||
T Consensus 45 ~~~~~iV~~a~~~~~f~tl~~a~~aa~Lv~~L~---~~gp~TVFaPtn~AFa~-lp~~T~~~Ll~pen~~~L~~iLtYHV 120 (187)
T COG2335 45 GNRADIVESAANNPSFTTLVAALKAAGLVDTLN---ETGPFTVFAPTNEAFAK-LPAGTLDALLKPENKPLLTKILTYHV 120 (187)
T ss_pred cchhHHHHHHccCcchHHHHHHHHhhhhHHHhc---CCCCeEEecCCHHHHHh-CChhHHHHHhCccchhhhheeeEEEE
Confidence 455799999999999999999999999999999 88999999999999999 89988888999999999999999999
Q ss_pred ecceeecccccCCCccccccCCCeEEEEEecCCcEEEcceEEEecCcEEcCCceEEEeCccccCCC
Q 013408 135 VSTRIELNRTATESTQHHTLSSDSVELTSHDSGDKFISQSKVIHPNAVDRPDGVIHGIERLLIPRS 200 (443)
Q Consensus 135 v~~~~~~~~l~~g~~~~~Tl~g~~l~v~~~~~g~v~vn~a~V~~~d~i~a~NGvIHvID~VL~P~~ 200 (443)
++|.+..+++..... .+|+.|..++|... +++++||.++++.+| |.++||+||+||+||+||.
T Consensus 121 v~Gk~~~~~l~~~~~-v~t~~G~~~~i~~~-~~~~~Vn~a~v~~~d-i~a~NgvIhvID~Vl~Pp~ 183 (187)
T COG2335 121 VEGKITAADLKSSGS-VKTVQGADLKIKVT-GGGVYVNDATVTIAD-INASNGVIHVIDKVLIPPM 183 (187)
T ss_pred EcCcccHHHhhcccc-ceeecCceEEEEEc-CCcEEEeeeEEEecc-EeccCcEEEEEeeeccCCC
Confidence 999999999986554 68999999999995 566999999999999 9999999999999999985
No 2
>COG2335 Secreted and surface protein containing fasciclin-like repeats [Cell envelope biogenesis, outer membrane]
Probab=99.89 E-value=5.9e-24 Score=189.42 Aligned_cols=132 Identities=25% Similarity=0.349 Sum_probs=106.1
Q ss_pred hhHHHHHHHcCCCchHHHHHHHhhhhhhhccccccCCceeEEEecChHHHhcCCccCC------CCCchHHHHhhhcccC
Q 013408 281 KDFIQTLLHYGGYNEMADILVNLTSLATEMGRLVSEGYVLTVLAPNDEAMAKLTTDQL------SEPGAPEQIIYYHVIP 354 (443)
Q Consensus 281 s~f~~~l~~~~~~~~~~~~l~~~~~~a~~~~~L~~~~~~~TVFAPtN~AF~~l~~~~l------~~~~~L~~iL~yHVv~ 354 (443)
..+++.....+.|+++.. +.+.+++.+.|.+.| +||||||||+||.+++...+ .|+.+|.++|.||||+
T Consensus 48 ~~iV~~a~~~~~f~tl~~----a~~aa~Lv~~L~~~g-p~TVFaPtn~AFa~lp~~T~~~Ll~pen~~~L~~iLtYHVv~ 122 (187)
T COG2335 48 ADIVESAANNPSFTTLVA----ALKAAGLVDTLNETG-PFTVFAPTNEAFAKLPAGTLDALLKPENKPLLTKILTYHVVE 122 (187)
T ss_pred hHHHHHHccCcchHHHHH----HHHhhhhHHHhcCCC-CeEEecCCHHHHHhCChhHHHHHhCccchhhhheeeEEEEEc
Confidence 455555555555555433 344567888898889 99999999999999998743 3889999999999999
Q ss_pred CccchHHhhHHhhhcCceeccccCCCceEEEEecCCeEEEccCCcceEEeCCceeecCCeEEEEeCccccCCCCC
Q 013408 355 EYQTEESMYNAVRRFGKISYDTLRLPHKVLAQEADGSVKFGHGDGSAYLFDPDIYTDGRISVQGIDGVLFPPEET 429 (443)
Q Consensus 355 ~~~~~~~l~~~~~~~g~~~~~Tl~~g~~l~~~~~~~~v~v~~~~~~a~V~~~di~~~nG~vIH~ID~VL~P~~~~ 429 (443)
|+...+.+.. .+ ...|++ |..+.+...++.+.||+ +.|+.+||.++|| |||+||+||+||...
T Consensus 123 Gk~~~~~l~~----~~--~v~t~~-G~~~~i~~~~~~~~Vn~----a~v~~~di~a~Ng-vIhvID~Vl~Pp~~~ 185 (187)
T COG2335 123 GKITAADLKS----SG--SVKTVQ-GADLKIKVTGGGVYVND----ATVTIADINASNG-VIHVIDKVLIPPMDL 185 (187)
T ss_pred CcccHHHhhc----cc--cceeec-CceEEEEEcCCcEEEee----eEEEeccEeccCc-EEEEEeeeccCCCcc
Confidence 9866655432 13 567888 99999988888899997 9999999999999 999999999999754
No 3
>PF02469 Fasciclin: Fasciclin domain; InterPro: IPR000782 The FAS1 (fasciclin-like) domain is an extracellular module of about 140 amino acid residues. It has been suggested that the FAS1 domain represents an ancient cell adhesion domain common to plants and animals []; related FAS1 domains are also found in bacteria []. The crystal structure of FAS1 domains 3 and 4 of fasciclin I from Drosophila melanogaster (Fruit fly) has been determined, revealing a novel domain fold consisting of a seven-stranded beta wedge and at least five alpha helices; two well-ordered N-acetylglucosamine groups attached to a conserved asparagine are located in the interface region between the two FAS1 domains []. Fasciclin I is an insect neural cell adhesion molecule involved in axonal guidance that is attached to the membrane by a GPI-anchored protein. FAS1 domains are present in many secreted and membrane-anchored proteins. These proteins are usually GPI anchored and consist of: (i) a single FAS1 domain, (ii) a tandem array of FAS1 domains, or (iii) FAS1 domain(s) interspersed with other domains. Proteins known to contain a FAS1 domain include: Fasciclin I (4 FAS1 domains). Human TGF-beta induced Ig-H3 (BIgH3) protein (4 FAS1 domains), where the FAS1 domains mediate cell adhesion through an interaction with alpha3/beta1 integrin; mutation in the FAS1 domains result in corneal dystrophy []. Volvox major cell adhesion protein (2 FAS1 domains) []. Arabidopsis fasciclin-like arabinogalactan proteins (2 FAS1 domains) []. Mammalian stabilin protein, a family of fasciclin-like hyaluronan receptor homologues (7 FAS1 domains)[]. Human extracellular matrix protein periostin (4 FAS1 domains). Bacterial immunogenic protein MPT70 (1 FAS1 domain) []. The FAS1 domains of both human periostin (Q15063 from SWISSPROT) and BIgH3 (Q15582 from SWISSPROT) proteins were found to contain vitamin K-dependent gamma-carboxyglutamate residues []. Gamma-carboxyglutamate residues are more commonly associated with GLA domains (IPR000294 from INTERPRO), where they occur through post-translational modification catalysed by the vitamin K-dependent enzyme gamma-glutamylcarboxylase.; PDB: 1O70_A 1W7D_A 1W7E_A 1NYO_A 1X3B_A 2VXP_A.
Probab=99.88 E-value=5e-23 Score=177.58 Aligned_cols=124 Identities=28% Similarity=0.505 Sum_probs=102.0
Q ss_pred CChHHHHHHHHHcCChhHhhhccCCCCeEEEecCcHHHhcCCChHHHHhccCCCCHHHHHHHHhcceecceeecccccCC
Q 013408 68 SHYTELSELVEKALLLQPLEDAVGKHSITIFAPKNEALERDLDPEFKRFLLEPGNLKSLQTLLLHHIVSTRIELNRTATE 147 (443)
Q Consensus 68 ~~~s~f~~~l~~agl~~~L~~~~~~~~~TvFAPtd~Af~~~l~~~~~~~l~~~~~~~~L~~lL~yHiv~~~~~~~~l~~g 147 (443)
|+||.|.++|+++|+.+.|++ ..+.||||||+|+||++ ++......+.+ ..+.++++|+||++++.++.+++..+
T Consensus 1 ~~~s~f~~~l~~~~l~~~l~~--~~~~~TvfaP~d~a~~~-~~~~~~~~~~~--~~~~l~~~l~~hiv~~~~~~~~l~~~ 75 (128)
T PF02469_consen 1 PDLSTFSRLLEQAGLADLLND--SDGNYTVFAPTDDAFQK-LSQETNSSLAD--SKEQLKSLLKYHIVPGSITSSDLRNG 75 (128)
T ss_dssp -TTHHHHHHHHHTTCHHHHGC--SSSSEEEEEE-HHHHHH-SHHHHHHHHHT--HHHHHHHHHHHTEEES---HCHHHCH
T ss_pred CCHHHHHHHHHHcCCHHHHhc--CCCCEEEEEECHHHHHh-ccccccchhhh--hhhhHhhhhhhEEEcCceehhhhccc
Confidence 689999999999999999941 56899999999999999 44444444433 45679999999999999999999988
Q ss_pred -Cccccc-cCCCeEEEEEe-cCCcEEEcc-eEEEecCcEEcCCceEEEeCccccC
Q 013408 148 -STQHHT-LSSDSVELTSH-DSGDKFISQ-SKVIHPNAVDRPDGVIHGIERLLIP 198 (443)
Q Consensus 148 -~~~~~T-l~g~~l~v~~~-~~g~v~vn~-a~V~~~d~i~a~NGvIHvID~VL~P 198 (443)
.. ++| +.|+.+.++.. +++.++||+ ++|+..| +.++||+||+||+||+|
T Consensus 76 ~~~-~~t~~~g~~~~v~~~~~~~~~~v~~~a~i~~~~-~~~~nG~ih~id~vL~P 128 (128)
T PF02469_consen 76 KQT-LETLLNGQPLRVSSSPSNGTIYVNGKARIVKSD-IEASNGVIHIIDDVLIP 128 (128)
T ss_dssp HEE-EEBSSTTCEEEEEEEGGTTEEEECCEEEESEEE-EEESSEEEEEESS-TSS
T ss_pred ccc-ceeccCCCEEEEEEEecCCceEecCceEEEeCC-EEeCCEEEEEECceECc
Confidence 44 577 99999999986 478999999 9999999 99999999999999998
No 4
>KOG1437 consensus Fasciclin and related adhesion glycoproteins [Cell wall/membrane/envelope biogenesis; Extracellular structures]
Probab=99.88 E-value=1.6e-22 Score=212.54 Aligned_cols=335 Identities=36% Similarity=0.503 Sum_probs=218.7
Q ss_pred CCCCeEEEecCcHHHhcCCChHHHHhccCCCCHHHHHHHHhcceecce-eecccccCCCc---cccccCCCeEEEEEecC
Q 013408 91 GKHSITIFAPKNEALERDLDPEFKRFLLEPGNLKSLQTLLLHHIVSTR-IELNRTATEST---QHHTLSSDSVELTSHDS 166 (443)
Q Consensus 91 ~~~~~TvFAPtd~Af~~~l~~~~~~~l~~~~~~~~L~~lL~yHiv~~~-~~~~~l~~g~~---~~~Tl~g~~l~v~~~~~ 166 (443)
++++||+|+|.|+||.+-+++...+.+..+.+.+.+.+++.+|+++.+ ....++.++.. .+.++.+..+....
T Consensus 129 g~~~ftIFa~~neaw~~~~d~~v~~~le~~~n~d~~~~~l~~h~i~q~~v~~~~~~~~~~~p~~~~~l~~~~~~~~~--- 205 (682)
T KOG1437|consen 129 GNKDFTIFAPSNEAWTNNLDSRVKSFLESPYNLDSLNQLLRSHIINQRLVSSAQLPNKMIIPKMHRTLGNKELHYLN--- 205 (682)
T ss_pred cCCceEEecccccchhcCCChhhhhhccccchHHHHHHHHHhcccchhhccchhcccccccccccccCCCceEEeec---
Confidence 789999999999999986688888888889999899999999999999 77777776542 24556666554433
Q ss_pred CcEEEcceEEEecC------cEEcCCceEEEeCccccCCCcccccccccccccCcCCCCCCCCCCCccccccCCCCCC--
Q 013408 167 GDKFISQSKVIHPN------AVDRPDGVIHGIERLLIPRSVQQDFNNRRNLRSISAVRPEGAPEVDPRTNRLKKPTPA-- 238 (443)
Q Consensus 167 g~v~vn~a~V~~~d------~i~a~NGvIHvID~VL~P~~~~~~~~~~~sl~~~~~~~~~~~~~~~~~~~~l~~~~~~-- 238 (443)
+...+++++.+..| ++.-..|+||.|+.-+.|....+.+......++.....|..++..++++....+....
T Consensus 206 ~~~~~~~~r~~~~~~~t~~~Di~~~d~~I~~I~~~~~~~~~~~d~~~~~~~~~~t~~~~e~a~~~d~rt~~a~tn~a~~~ 285 (682)
T KOG1437|consen 206 GIVTVNYKRLVNNDVITTNYDLLRIDGVIHTIGRLIIPRIEQEDFLKYLSGASATVFLPELAPFVDPRTHLAPTNEAFFT 285 (682)
T ss_pred cccceeccccccccccccccccccCCCceEeeeeccchhhhhccchhcccccceeeeccccccccccccccccCcchhhc
Confidence 22244444443333 3555566777777766777666555554445555556677777777765444443221
Q ss_pred CCCCCCCccc-------------cccccCCCCCCCC----------CCCCCCCCCCccccCCcch-hhHHHH--HHHcCC
Q 013408 239 SKPGSSPALP-------------VYYAMAPGPSLAP----------APAPGPGGPHHHFNGEKQV-KDFIQT--LLHYGG 292 (443)
Q Consensus 239 ~~~~~~~~~~-------------~~~~~~~g~~~a~----------a~~P~~~~~~~~~~~~~~~-s~f~~~--l~~~~~ 292 (443)
...+.++.++ +.+.+..|++..+ +++|+.++.+.+..+.... ++++.. ++...+
T Consensus 286 ip~~~~~~~~~~~~v~~~~~~~~i~~~~~~~~s~~~~~~r~~~~~~~~a~g~~g~~~~~ng~~~I~kd~i~~~~~lh~id 365 (682)
T KOG1437|consen 286 IPRGYPPRILGYHLVLGNLKYNHILDNMKLGPSLAPGTVRLTGEGVAIAPGSSGERYHINGRAIIQKDFIHTNGLLHYID 365 (682)
T ss_pred ccccCCCcccccccchhhhhhhhhcccccccccccccceeeccccccccccCCCceEEeecceeEEEeeeccceEEEEcc
Confidence 2233444444 5666777877776 7888888877766665544 555554 555555
Q ss_pred CchHHHHHHHhhh---------------hhhhccccccCCceeEEEecChHHHhcCCccCCCCCchHHHHhhhcccCCcc
Q 013408 293 YNEMADILVNLTS---------------LATEMGRLVSEGYVLTVLAPNDEAMAKLTTDQLSEPGAPEQIIYYHVIPEYQ 357 (443)
Q Consensus 293 ~~~~~~~l~~~~~---------------~a~~~~~L~~~~~~~TVFAPtN~AF~~l~~~~l~~~~~L~~iL~yHVv~~~~ 357 (443)
+....+...+..+ ..++...|...+ .+|+|+|+|+||+.+..... +..+++||+||||+.+.
T Consensus 366 ~~l~p~~~~~l~~La~e~~~st~~rlv~elgll~~L~~n~-e~t~~lp~n~~fd~~~~~~~--r~l~~qIL~~HII~~~~ 442 (682)
T KOG1437|consen 366 YVLEPDSLKNLMSLAREDEISTSMRLVAELGLLTALAPND-EATLLLPTNNLFDDLTPLES--RRLAEQILYNHIIPEYL 442 (682)
T ss_pred cccCCchHHHHHHHHhcccccHHHHHHHhccceEEEcCCC-ceEEeeehhhhccCCChhhh--HHHHHHHHHHhCcchhh
Confidence 5544443222222 223444455566 59999999999998765432 13379999999999998
Q ss_pred chHHhhHHhhhcCceeccccCCCceEEE-E-e---cCCeEEEccCCcceEEeCCceeecCCeEEEEeCccccCCCCCCcC
Q 013408 358 TEESMYNAVRRFGKISYDTLRLPHKVLA-Q-E---ADGSVKFGHGDGSAYLFDPDIYTDGRISVQGIDGVLFPPEETSTN 432 (443)
Q Consensus 358 ~~~~l~~~~~~~g~~~~~Tl~~g~~l~~-~-~---~~~~v~v~~~~~~a~V~~~di~~~nG~vIH~ID~VL~P~~~~~~~ 432 (443)
+++.+|+ |...++|++ +..+.. . . +.+...+-.|+. +.|..+|+...|| +||.||+||.| ....+.
T Consensus 443 ~~~~~y~-----~~~~v~t~g-~~~l~~fv~r~~~s~~~t~i~~~~~-~~Ii~aDi~~~nG-vvH~id~vl~p-~~l~~~ 513 (682)
T KOG1437|consen 443 TSSSMYN-----GQTTVRTLG-KNKLLYFVYRHSVSANVTDILIGNE-ACIIEADISVKNG-VVHIIDRVLDP-VSLMED 513 (682)
T ss_pred hhhhhhc-----ccceeeccC-CeEEEEEEecccccccceeeeccce-eeEEecccceecC-ceEEeeEEcCc-ccHHHH
Confidence 8887776 443567776 544432 1 1 112111222234 8899999999999 99999999999 333344
Q ss_pred HHHHHHhh
Q 013408 433 YQKVKKMS 440 (443)
Q Consensus 433 ~~~~~~~~ 440 (443)
-.+.+.||
T Consensus 514 l~~d~r~s 521 (682)
T KOG1437|consen 514 LKTDGRIS 521 (682)
T ss_pred Hhhccchh
Confidence 44445555
No 5
>KOG1437 consensus Fasciclin and related adhesion glycoproteins [Cell wall/membrane/envelope biogenesis; Extracellular structures]
Probab=99.88 E-value=2e-22 Score=211.88 Aligned_cols=259 Identities=23% Similarity=0.296 Sum_probs=179.4
Q ss_pred ccHHHHHhcCCChHHHHHHHHHcCChhHhhhccCCCCeEEEecCcHHHhcCCChHHHHhccCCCCHHHHHHHHhcceecc
Q 013408 58 SNSVLVALLDSHYTELSELVEKALLLQPLEDAVGKHSITIFAPKNEALERDLDPEFKRFLLEPGNLKSLQTLLLHHIVST 137 (443)
Q Consensus 58 ~ni~~~~~~~~~~s~f~~~l~~agl~~~L~~~~~~~~~TvFAPtd~Af~~~l~~~~~~~l~~~~~~~~L~~lL~yHiv~~ 137 (443)
.++++++ ...+-+++.+++..-|+...|. .++.+|+|+|+|.||++ +.+...+. .++++|.||+++.
T Consensus 374 ~~l~~La-~e~~~st~~rlv~elgll~~L~---~n~e~t~~lp~n~~fd~-~~~~~~r~--------l~~qIL~~HII~~ 440 (682)
T KOG1437|consen 374 KNLMSLA-REDEISTSMRLVAELGLLTALA---PNDEATLLLPTNNLFDD-LTPLESRR--------LAEQILYNHIIPE 440 (682)
T ss_pred HHHHHHH-hcccccHHHHHHHhccceEEEc---CCCceEEeeehhhhccC-CChhhhHH--------HHHHHHHHhCcch
Confidence 4555544 4456799999999999999888 66779999999999999 55543332 3689999999999
Q ss_pred eeecccccCCCccccccCCCeEEEEEecC----C--cEEEc-ceEEEecCcEEcCCceEEEeCccccCCCcccccccccc
Q 013408 138 RIELNRTATESTQHHTLSSDSVELTSHDS----G--DKFIS-QSKVIHPNAVDRPDGVIHGIERLLIPRSVQQDFNNRRN 210 (443)
Q Consensus 138 ~~~~~~l~~g~~~~~Tl~g~~l~v~~~~~----g--~v~vn-~a~V~~~d~i~a~NGvIHvID~VL~P~~~~~~~~~~~s 210 (443)
+.+++++.++...++|++|..+..-+.++ + .+.++ .+.|+.+| +.+.||++|+||+|+.|.+ .+
T Consensus 441 ~~~~~~~y~~~~~v~t~g~~~l~~fv~r~~~s~~~t~i~~~~~~~Ii~aD-i~~~nGvvH~id~vl~p~~---l~----- 511 (682)
T KOG1437|consen 441 YLTSSSMYNGQTTVRTLGKNKLLYFVYRHSVSANVTDILIGNEACIIEAD-ISVKNGVVHIIDRVLDPVS---LM----- 511 (682)
T ss_pred hhhhhhhhcccceeeccCCeEEEEEEecccccccceeeeccceeeEEecc-cceecCceEEeeEEcCccc---HH-----
Confidence 99999999888667999998876654321 1 23343 36788898 9999999999999998831 11
Q ss_pred cccCcCCCCCCCCCCCccccccCCCCCCCCCCCCCccccccccCCCCCCCCCCCCCCCCCCccccCCcchhhHHHHHHHc
Q 013408 211 LRSISAVRPEGAPEVDPRTNRLKKPTPASKPGSSPALPVYYAMAPGPSLAPAPAPGPGGPHHHFNGEKQVKDFIQTLLHY 290 (443)
Q Consensus 211 l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~g~~~a~a~~P~~~~~~~~~~~~~~~s~f~~~l~~~ 290 (443)
+.+..+.+++.+++.
T Consensus 512 -------------------------------------------------------------~~l~~d~r~s~~~~~---- 526 (682)
T KOG1437|consen 512 -------------------------------------------------------------EDLKTDGRISGTVQG---- 526 (682)
T ss_pred -------------------------------------------------------------HHHhhccchhhhHHh----
Confidence 122235556666552
Q ss_pred CCCchHHHHHHHhhhhhhhccccccCCceeEEEecChHHHhcCCccCC--CCCchHHHHhhhcccCCccc-hHHhhHHhh
Q 013408 291 GGYNEMADILVNLTSLATEMGRLVSEGYVLTVLAPNDEAMAKLTTDQL--SEPGAPEQIIYYHVIPEYQT-EESMYNAVR 367 (443)
Q Consensus 291 ~~~~~~~~~l~~~~~~a~~~~~L~~~~~~~TVFAPtN~AF~~l~~~~l--~~~~~L~~iL~yHVv~~~~~-~~~l~~~~~ 367 (443)
++..++.++|..++ .||+|||||+||.+.+.+.. .+...|+++++||++++... +..
T Consensus 527 -------------le~~~l~e~l~~~~-~~t~fvPt~ka~~~~~~~~~~~~~~~~l~~~l~yH~v~~~~~ls~~------ 586 (682)
T KOG1437|consen 527 -------------LEGVLLPEELTPEG-NYTLFVPTNKAWQKSTKDEKSLFHKKALQDFLKYHLVPGQSRLSLG------ 586 (682)
T ss_pred -------------hhhcCChhhhccCC-ceEEEeecccccccCCcchhhcchHHHHHHHHHhccccceeeeecc------
Confidence 12234445565566 89999999999999877753 57789999999999997531 110
Q ss_pred hcCceeccccCCCceEEEEecCCeEEEc----cCC-cceEEeCCceeecCCeEEEEeCccccCCCCCCc
Q 013408 368 RFGKISYDTLRLPHKVLAQEADGSVKFG----HGD-GSAYLFDPDIYTDGRISVQGIDGVLFPPEETST 431 (443)
Q Consensus 368 ~~g~~~~~Tl~~g~~l~~~~~~~~v~v~----~~~-~~a~V~~~di~~~nG~vIH~ID~VL~P~~~~~~ 431 (443)
++ .+.. .+ +.....++.+.+. .+. +..+++..||...|| ++|+||.||.|+.....
T Consensus 587 --~~-~~v~---~~-~k~s~~~~~~~~~~~~~~~~vn~e~~~~~~i~~~n~-~~h~i~~vl~p~~l~~~ 647 (682)
T KOG1437|consen 587 --SS-PYVM---IQ-VKLSLRGDHLFFSLVNPRGDVNKERLVGIDIMGTNG-VVHVIDLVLKPPDLPFL 647 (682)
T ss_pred --cc-ccee---ee-eeEEEecccEEeeeeccccceeeeeeeccceeeecc-eeEEEEEEcccCcchhh
Confidence 11 0111 11 2222222222221 111 336778889999999 99999999999865433
No 6
>smart00554 FAS1 Four repeated domains in the Fasciclin I family of proteins, present in many other contexts.
Probab=99.84 E-value=9e-21 Score=156.35 Aligned_cols=98 Identities=30% Similarity=0.532 Sum_probs=83.8
Q ss_pred EEEecCcHHHhcCCChHHHHhccCCCCHHHHHHHHhcceecceeecccccCCCccccccCCCeEEEEEecC-CcEEEcce
Q 013408 96 TIFAPKNEALERDLDPEFKRFLLEPGNLKSLQTLLLHHIVSTRIELNRTATESTQHHTLSSDSVELTSHDS-GDKFISQS 174 (443)
Q Consensus 96 TvFAPtd~Af~~~l~~~~~~~l~~~~~~~~L~~lL~yHiv~~~~~~~~l~~g~~~~~Tl~g~~l~v~~~~~-g~v~vn~a 174 (443)
|+|||+|+||++ +.....+.+..+ . .|+++|+||++++.++.++|..+.. ++|+.|..++++...+ +.+++|++
T Consensus 1 TvfaP~d~Af~~-~~~~~~~~l~~~--~-~l~~ll~~Hiv~~~~~~~~l~~~~~-~~Tl~g~~l~v~~~~~~~~i~in~~ 75 (99)
T smart00554 1 TVFAPTDEAFQK-LPPGTLNSLLAD--P-KLKNLLLYHVVPGRLSSADLLNGGT-LPTLAGSKLRVTRSGDSGTVTVNGA 75 (99)
T ss_pred CEeCcCHHHHHh-cCHHHHHHHhCC--H-HHHHHHHhcEeCceEcHHHhccCCc-cccCCCCEEEEEEeCCCCeEEEcce
Confidence 899999999999 555443334332 1 7899999999999999999988655 6999999999998644 78999999
Q ss_pred EEEecCcEEcCCceEEEeCccccCC
Q 013408 175 KVIHPNAVDRPDGVIHGIERLLIPR 199 (443)
Q Consensus 175 ~V~~~d~i~a~NGvIHvID~VL~P~ 199 (443)
+|+.+| +.++||+||+||+||+|+
T Consensus 76 ~v~~~d-i~~~nGvih~Id~vL~P~ 99 (99)
T smart00554 76 RIVEAD-IAATNGVVHVIDRVLLPP 99 (99)
T ss_pred EEEECC-EecCCeEEEEECceeCCC
Confidence 999999 999999999999999986
No 7
>smart00554 FAS1 Four repeated domains in the Fasciclin I family of proteins, present in many other contexts.
Probab=99.77 E-value=6.6e-19 Score=145.15 Aligned_cols=93 Identities=34% Similarity=0.569 Sum_probs=78.0
Q ss_pred EEEecChHHHhcCCcc---CC-CCCchHHHHhhhcccCCccchHHhhHHhhhcCceeccccCCCceEEEEecC--CeEEE
Q 013408 321 TVLAPNDEAMAKLTTD---QL-SEPGAPEQIIYYHVIPEYQTEESMYNAVRRFGKISYDTLRLPHKVLAQEAD--GSVKF 394 (443)
Q Consensus 321 TVFAPtN~AF~~l~~~---~l-~~~~~L~~iL~yHVv~~~~~~~~l~~~~~~~g~~~~~Tl~~g~~l~~~~~~--~~v~v 394 (443)
|||||+|+||++++.+ .+ .++ .++++|+|||++++.+...+.+ + ..++|+. |..+.+...+ +.+.+
T Consensus 1 TvfaP~d~Af~~~~~~~~~~l~~~~-~l~~ll~~Hiv~~~~~~~~l~~-----~-~~~~Tl~-g~~l~v~~~~~~~~i~i 72 (99)
T smart00554 1 TVFAPTDEAFQKLPPGTLNSLLADP-KLKNLLLYHVVPGRLSSADLLN-----G-GTLPTLA-GSKLRVTRSGDSGTVTV 72 (99)
T ss_pred CEeCcCHHHHHhcCHHHHHHHhCCH-HHHHHHHhcEeCceEcHHHhcc-----C-CccccCC-CCEEEEEEeCCCCeEEE
Confidence 8999999999999765 23 333 8999999999999876665533 3 2688998 9999987766 78889
Q ss_pred ccCCcceEEeCCceeecCCeEEEEeCccccCC
Q 013408 395 GHGDGSAYLFDPDIYTDGRISVQGIDGVLFPP 426 (443)
Q Consensus 395 ~~~~~~a~V~~~di~~~nG~vIH~ID~VL~P~ 426 (443)
++ ++|+.+|+.++|| +||+||+||+|+
T Consensus 73 n~----~~v~~~di~~~nG-vih~Id~vL~P~ 99 (99)
T smart00554 73 NG----ARIVEADIAATNG-VVHVIDRVLLPP 99 (99)
T ss_pred cc----eEEEECCEecCCe-EEEEECceeCCC
Confidence 87 8999999999999 999999999986
No 8
>PF02469 Fasciclin: Fasciclin domain; InterPro: IPR000782 The FAS1 (fasciclin-like) domain is an extracellular module of about 140 amino acid residues. It has been suggested that the FAS1 domain represents an ancient cell adhesion domain common to plants and animals []; related FAS1 domains are also found in bacteria []. The crystal structure of FAS1 domains 3 and 4 of fasciclin I from Drosophila melanogaster (Fruit fly) has been determined, revealing a novel domain fold consisting of a seven-stranded beta wedge and at least five alpha helices; two well-ordered N-acetylglucosamine groups attached to a conserved asparagine are located in the interface region between the two FAS1 domains []. Fasciclin I is an insect neural cell adhesion molecule involved in axonal guidance that is attached to the membrane by a GPI-anchored protein. FAS1 domains are present in many secreted and membrane-anchored proteins. These proteins are usually GPI anchored and consist of: (i) a single FAS1 domain, (ii) a tandem array of FAS1 domains, or (iii) FAS1 domain(s) interspersed with other domains. Proteins known to contain a FAS1 domain include: Fasciclin I (4 FAS1 domains). Human TGF-beta induced Ig-H3 (BIgH3) protein (4 FAS1 domains), where the FAS1 domains mediate cell adhesion through an interaction with alpha3/beta1 integrin; mutation in the FAS1 domains result in corneal dystrophy []. Volvox major cell adhesion protein (2 FAS1 domains) []. Arabidopsis fasciclin-like arabinogalactan proteins (2 FAS1 domains) []. Mammalian stabilin protein, a family of fasciclin-like hyaluronan receptor homologues (7 FAS1 domains)[]. Human extracellular matrix protein periostin (4 FAS1 domains). Bacterial immunogenic protein MPT70 (1 FAS1 domain) []. The FAS1 domains of both human periostin (Q15063 from SWISSPROT) and BIgH3 (Q15582 from SWISSPROT) proteins were found to contain vitamin K-dependent gamma-carboxyglutamate residues []. Gamma-carboxyglutamate residues are more commonly associated with GLA domains (IPR000294 from INTERPRO), where they occur through post-translational modification catalysed by the vitamin K-dependent enzyme gamma-glutamylcarboxylase.; PDB: 1O70_A 1W7D_A 1W7E_A 1NYO_A 1X3B_A 2VXP_A.
Probab=99.72 E-value=3.4e-18 Score=147.32 Aligned_cols=107 Identities=28% Similarity=0.455 Sum_probs=79.3
Q ss_pred hhcccc-ccCCceeEEEecChHHHhcCCccC---C-CCCchHHHHhhhcccCCccchHHhhHHhhhcCceeccc-cCCCc
Q 013408 308 TEMGRL-VSEGYVLTVLAPNDEAMAKLTTDQ---L-SEPGAPEQIIYYHVIPEYQTEESMYNAVRRFGKISYDT-LRLPH 381 (443)
Q Consensus 308 ~~~~~L-~~~~~~~TVFAPtN~AF~~l~~~~---l-~~~~~L~~iL~yHVv~~~~~~~~l~~~~~~~g~~~~~T-l~~g~ 381 (443)
++.+.| ...+ .||||||+|+||++++.+. + ++++.++++|+|||+++......+.. +...++| +. |.
T Consensus 14 ~l~~~l~~~~~-~~TvfaP~d~a~~~~~~~~~~~~~~~~~~l~~~l~~hiv~~~~~~~~l~~-----~~~~~~t~~~-g~ 86 (128)
T PF02469_consen 14 GLADLLNDSDG-NYTVFAPTDDAFQKLSQETNSSLADSKEQLKSLLKYHIVPGSITSSDLRN-----GKQTLETLLN-GQ 86 (128)
T ss_dssp TCHHHHGCSSS-SEEEEEE-HHHHHHSHHHHHHHHHTHHHHHHHHHHHTEEES---HCHHHC-----HHEEEEBSST-TC
T ss_pred CCHHHHhcCCC-CEEEEEECHHHHHhccccccchhhhhhhhHhhhhhhEEEcCceehhhhcc-----ccccceeccC-CC
Confidence 444556 3345 8999999999999885442 2 35688999999999999765554433 2135777 55 88
Q ss_pred eEEEEec--CCeEEEccCCcceEEeCCceeecCCeEEEEeCccccC
Q 013408 382 KVLAQEA--DGSVKFGHGDGSAYLFDPDIYTDGRISVQGIDGVLFP 425 (443)
Q Consensus 382 ~l~~~~~--~~~v~v~~~~~~a~V~~~di~~~nG~vIH~ID~VL~P 425 (443)
.+.+... ++.+.|++ .++|+..|+.++|| +||+||+||+|
T Consensus 87 ~~~v~~~~~~~~~~v~~---~a~i~~~~~~~~nG-~ih~id~vL~P 128 (128)
T PF02469_consen 87 PLRVSSSPSNGTIYVNG---KARIVKSDIEASNG-VIHIIDDVLIP 128 (128)
T ss_dssp EEEEEEEGGTTEEEECC---EEEESEEEEEESSE-EEEEESS-TSS
T ss_pred EEEEEEEecCCceEecC---ceEEEeCCEEeCCE-EEEEECceECc
Confidence 8888665 78899975 58999999999999 99999999998
No 9
>COG5443 FlbT Flagellar biosynthesis regulator FlbT [Cell motility and secretion]
Probab=19.15 E-value=1.6e+02 Score=25.27 Aligned_cols=33 Identities=6% Similarity=0.164 Sum_probs=22.2
Q ss_pred CeEEEEEecCCcEEEcceEEEecCcEEcCCceEEEeCc
Q 013408 157 DSVELTSHDSGDKFISQSKVIHPNAVDRPDGVIHGIER 194 (443)
Q Consensus 157 ~~l~v~~~~~g~v~vn~a~V~~~d~i~a~NGvIHvID~ 194 (443)
.+++++...+..+++|||.+ +.| .-+++-..++
T Consensus 3 ~tlriSLk~gEki~iNGAVl-r~D----Rkv~lellNd 35 (148)
T COG5443 3 STLRISLKPGEKIFINGAVL-RVD----RKVALELLND 35 (148)
T ss_pred CceEEeecCCCEEEEeccEE-EEe----ceeEEEeecc
Confidence 46788887788999999844 455 3455555444
No 10
>PF02680 DUF211: Uncharacterized ArCR, COG1888; InterPro: IPR003831 This entry describes proteins of unknown function.; PDB: 3BPD_I 2RAQ_F 2X3D_E.
Probab=14.30 E-value=81 Score=25.60 Aligned_cols=16 Identities=31% Similarity=0.362 Sum_probs=11.7
Q ss_pred eeecCCeEEEEeCcccc
Q 013408 408 IYTDGRISVQGIDGVLF 424 (443)
Q Consensus 408 i~~~nG~vIH~ID~VL~ 424 (443)
|.--|| +||-||.|-.
T Consensus 67 Ie~~Gg-~IHSIDeVva 82 (95)
T PF02680_consen 67 IEELGG-VIHSIDEVVA 82 (95)
T ss_dssp HHHTT--EEEEEEEEEE
T ss_pred HHHcCC-eEEeeeeeee
Confidence 445678 9999999863
Done!