current user: public




Query: gi|60682477|ref|YP_212621.1|(removed signalp:1-23) hypothetical protein (BF3007) [Bacteroides fragilis NCTC 9343] (Range: 24-588), from B.fragilis

Results of FFAS03 search in PfamA30U
Master-slave alignment(slide right to see more) does not show gaps in the query sequence, use ali links to display alignment between query and templates.
    .   10    .   20    .   30    .   40    .   50    .   60    .   70    .   80    .   90    .  100    .  110    .  120    .  130    .  140    .  150    .  160    .  170    .  180    .  190    .  200    .  210    .  220    .  230    .  240    .  250    .  260    .  270    .  280    .  290    .  300    .  310    .  320    .  330    .  340    .  350    .  360    .  370    .  380    .  390    .  400    .  410    .  420    .  430    .  440    .  450    .  460    .  470    .  480    .  490    .  500    .  510    .  520    .  530    .  540    .  550    .  560    .
# Score Template Links and tools%idFirst CYSTPDGVMSSLSQAEKIMESRPDSAMAILQHIPTPETLHGKAQADYSLLMTQAMDKNYINFTSDSLIKFAVGYYGGHTEDLVAKGKSFYYYGRVMESLDKVEDAMTFYLKAKDVLQSSDQFKLLGLVTEKIGDLNRRQKLLDAALNDYKESFDFYASIPDSLCMLYAYRNLGRGFLYKNQIDSAYYYYDKALYILNLKKYSAVGSILLELGVIHRSEKDYVGAEQYFLSFIEKEKDPEKLFSGYLALGNLYLYMDRLKDAERYLLLCLGSSNLVIKRDACECLYDLEKELNNFKGAIGYKDIADSLRIITQDIDIQNSIATLQSRYNSEKWQRESLQSSIEKKNILLISSFVSFIAIMVIIYIYYKYRTNQKLVKDINERIRKNDADIKMYQRQILNYQDLQRETLQDYRNQIGELHGKMSVLEDQNKALSLRLTEKKHDIPESEADDLYAIYMQALHILIMLRGKNIENTSGKKLLLDADWDKLFHLSNAIHGDFITRIKNDFPTLTKHDIEICCLLRFGIEHEVLGSIFLTETDSVTKAKRRMKKRLNLSASDDLDVFLLKYLast
1 -31.900PF14938.4; B8ML21_TALSN/2-284; Soluble NSF attachment protein, SNAP  ali follow..  14  10...............................................................ADKALSSAFSFFGGRTEKLENAADLYVQAANAFRVQRLNKEAGMAFEKAAAVQKNNLNEPDDAANSLSEAFKVYRKTDPEDAVRVLAGAIQHYISKGNFRRAATHQQNLAECYEVIGNPKKALEEYEKAAEWYESDNAALANKYFLKVADLAAIEADYYKAIAHYEKVGKASIDKWSVKDYFFKAGICHLASGDLVATNRALESYRDIDPSFASTREHTLLVDLQGEPDTFADKLFQFDQLSKLDK------KTTLFLRIKENIE............................................................................................................................................................................................................................................. 283
2 -30.800PF17139.2; R7EHD4_9BACE/12-276; Domain of unknown function (DUF5112 topsan)  ali follow..  2................................................................DSLNQVAYAFRYK-----------------------NLDSSCHAASRAYREVSLYKQ---KAEASNNLGFCAFMRMDFEQAEKFHMDVYNL---TKNELELLIADIGLMKIYQRTALNKEFYDYRNSALHRMKREDDNLFVDQHEQMRLNYARSEFYIVSAVYYYYLQQR-PEAVASINEVTKKQELLADTNQLLYYHYIKGSAARKGYLYFEGNGVQGLANLMASPDNYAFFQDRLTRFGVPVDSLLPMRLGQLALQK.................................................................................................................................................................................................................................................. 263
3 -24.200PF08631.8; H2L969_ORYLA/170-426; Meiosis protein SPO22/ZIP4 like  ali follow..  12  1.....................................................SAIAQGNNQEAIVFIERCKDILLRLPNETAYLSVVCYNFGIEAYNLTKFEDSAFWLSQSYDIGKAGPGSEALAKVLRLLATCYLEWDCEKF----QEKALNTVSLANKECVTPFGLYLKIRILLRCRAPDSPVRAGLNEMLESKVS-----LDMCLNTVRLLMSENREVLAFEYLKTVCQHFESSPSLGTALVLHIELLLQRGKVLLSRQKIEDVLAPEALTSLHIILWDQASKHYEDDNYSEALQWYDYSLS................................................................................................................................................................................................................................................................... 257
4 -22.800PF13429.4; Q39U81_GEOMG/198-480; Tetratricopeptide repeat  ali follow..  11  8......TEEGILRASALTFGTDVKGAYAILDRG---QIIPASATAYWQSFDDLAWSLQ----------QAVELASRKLLESGKAREEDYQRLILLGRD-NDPKAAYRLSMDGWQRFRRSDFLKSALELGIAMKDY----RTTADMLTRQEREGNLKPQEQD----AYFWSLVSQVHRGTGATGESIRCYREALRRAPTD-----SELAAEYVWLLLDLDRREELRELLIAWKGRENTMPSL---YEPFGAALAYLGEYHKALPYFQARYSQKR--SDPAWLAAYADTMEQAGWVEAAFLERLNALALSRR............................................................................................................................................................................................................................................................... 283
5 -22.500PF12569.6; A9SDC6_PHYPA/187-705; NMDA receptor-regulated protein 1  ali follow..  18.....GHPEKALEELAKKEGKIVDKAEEIYRKL----VVNPDNYHYYEGLQKCLGLISDGKAYTSEQVEKLVKLYDD---EKYPRSAAAKRIPLDFLEEDAFKAAVSLYVRPFSGSSDIEAPSTYLWTLYLLAQHFDKRRQYEKAHSYINLAIEHTPTVID------LYLVKGRILKHAGDPVAAAALADEARSMDLAD-----RFLNSECVKRMLQADQVELAEKTAVLFTKDGDQHNNLFDMELASGDSHFRQGNLGKALKKYLSVEKHYN--DMVEDQFDFHTYCLRKMTLRAYIRMLRFQDHLHSHRFFFRAATSAIRCYIKLHDSPPKAAAEQHEAAVAGL----------------------ATERKKMRQKLRKAEAKAKKEAEEKAKEEEIAAAAAASKGGKKGTQTSRPVDTDPDGDKLMNVEDPLSEALKYLRLLQE...................................................................................................................... 487
6 -21.000PF04733.12; COPE2_ARATH/7-293; Coatomer epsilon subunit  ali follow..  15.....GAYQTAINNSEIANLSPENAVERDCLVFRSYIALGSYQLV-----------------------------ISEIDESAATPLQAVKLLAMYLSTPQNKESTISSLKEWLADSTIGNNDT----LRLIAGIIFMHEEDYNETLKHTHAGGT-----------MDLYALNVQIFIKMHRAEYAEKQLRVMQQI---DEDHTLTQLASAWLNLAVGGSKIQEAYLIFEDFSEKYPMT---CLILNGKAVCCMQMGNFDEAETLLLEALNKDA--KDPETLANLVVCSLHVGKSSSRH-----LSQLKLSHPEHILVKRVSSAEDNFE............................................................................................................................................................................................................................................. 280
7 -20.300PF13424.4; Q4DK03_TRYCC/687-760; Tetratricopeptide repeat  ali follow..  12  2.............................................................................................................................QRPLQNYAQLKCQLGDMLSALSLQETELEIAKEFGDTRATSRCMGAIALTKRFLGRYDEAAENYMEELSVYNA............................................................................................................................................................................................................................................................................................................................................................................... 74
8 -19.700PF12862.5; APC5_SCHPO/263-353; Anaphase-promoting complex subunit 5  ali follow..  11  1........................................................................................YYCKFLNSWFSGDYQQSVENLCRYFDHIMHSDEKVSYQYALLNLAMLQADFGCNEEALHAIEDTINTARESGDTACLNFALAWMFEFK..................................................................................................................................................................................................................................................................................................................................................................................................... 88
9 -19.500PF12968.5; Q8KAL8_CHLTE/2-144; Domain of Unknown Function (DUF3856 topsan)  ali follow..  9.....................................................................................AYLALSDAQRQLVAGEYDEAAANCRRAMEISHTMPPEEAFAFCHAGLAEALAGLRSFDEALHSADKALHYFNRRDEGKLWISAVYSRALALDGLGRGAEAMPEFKKVVEM.................................................................................................................................................................................................................................................................................................................................................................................. 129
10 -16.400PF12688.5; Q9RUW3_DEIRA/41-159; Tetratrico peptide repeat  ali follow..  10  1.....................................................................................SVALFERAGARDSVGREAEAVQLYEQALAAGLSGVRRR---RAVIQLASSLRNVGQVERGLALLEAEQQRGPDELSDALSAF----LALMLVDTGREREAVATVLTALAP---------------------HLPRYQRSVTNYARLL............................................................................................................................................................................................................................................................................................................................................. 119
11 -16.400PF10952.6; F0LVC6_VIBFN/6-145; Protein of unknown function (DUF2753 topsan)  ali follow..  4................................................................................................................................HTLLADIAMQQADHLRSILHYQQALTLSDRIGVAEESVISCHNMAGFWRTIGDDQYELKYLELASELIPQCGNPSCEAFVDSLGCCKKALVEFMKR..................................................................................................................................................................................................................................................................................................................................................... 112
12 -16.300PF09295.8; Q6CDA2_YARLI/40-455; ChAPs (Chs5p-Arf1p-binding proteins)  ali follow..  10  202.............................................................................................AFMRITRLYDRGLEIVRRLQTEYPE---------VVSLIARIQLLMDEEVEAVKTMVKGLEENGRDAD------LLSLQAEFLLSKNREDMALGCATRAVNSAPSE-----SKLWTELVQVYIRQKDWENALLTLNSCPMFTYHEHD---QHRMPAPAKSHYPLPLDGIL--ASQVDQTEEHAQIDPKLLKLPAPNLRATFAKVYEHLTAIVHEIGWDQLLEVRSRVFVMDEEYK............................................................................................................................................................................................................................................. 416
13 -16.300PF10579.7; Q8QGW4_DANRE/1-80; Rapsyn N-terminal myristoylation and linker region  ali follow..  10  2.................................................................................GQDQTKQQIEKGLKLYQSNDTEKALYVWMKVLR---KTSDPGGKFRVLGCLITAHSEMGKYKEMLQFSLAQINTAREMEDP................................................................................................................................................................................................................................................................................................................................................................................................................... 79
14 -15.200PF06552.10; TO203_ARATH/7-193; Plant specific mitochondrial import receptor subunit TOM20  ali follow..  11  1.............................................................FDRILLFEQIRQDAENTYKSNPLDADNLTRWGGVLLELSQIQEAITKFEEALLIDPKKDEVWCIGNAYTSFAFL-EAKHNFDLATQFFQQAVDELEMTAKAPQLHAEAYKQGLGSQPMGRVEAPAPPSSKAVKNKKSS-----DAKYDAMGWVILAIGVVA....................................................................................................................................................................................................................................................................................................................................................... 181
15 -14.600PF15015.4; SPT16_HUMAN/5-567; Spermatogenesis-associated, N-terminal  ali follow..  11  166...............................................................................PQIDKWLQVALKDASSCYRQKKYALAAGQFRTALELCSKEDIASVASFIETKLVTCYLRMRKPDLALNHAHRSIVL------NPAYFRNHLRQATVFRCLERYSEAARSAMIADYMIFTEHKTPFGLTREDTVRQMETMGKRILPILDFIRSTQLNGSASSGVMEKLQYASLLSQLQRVKEQSQVINQAMAELAQDISQQEAELLQSLMADAMDTLEGRRNNNER--VWNMIQKVQIEDFLYQLEDSFLKTKKLRTARRQKTKMKRL........................................................................................................................................................................................................................... 559
16 -14.500PF11207.6; F7YLW7_VIBA7/16-218; Protein of unknown function (DUF2989 topsan)  ali follow..  27....................................................................................................NINDGQCRVPRTDLIWHRFLKNPSDSNKIKEYGLVQA----YRKCLELASQIQAIDQTELKQRALVNTGKDLEQIEELQSSSSAETLYFKSAQRKFLQKPELDTAEMQYALATFYVQ-RDREKAIYLLHRTLELSPKGSINLDAIKSLASTNQILKQKEKAYIW............................................................................................................................................................................................................................................................................................................. 201
17 -14.400PF09986.7; Q898V4_CLOTE/16-227; Uncharacterized protein conserved in bacteria (DUF2225 topsan)  ali follow..  10  8CPVCDSKFHVKAVKSSSYRLNSKDSDFF--------LRYALINPYFYDVWLCNE-----------SALKVDFPNIRRHQKQLVLDKISPKWKRRYYDEVFNVDIAIERYKLALLNYTLAEKASQKAMTCLKLAWMYRLKNNNKSEETYLTNALKGFIEAIFGMDKYTIMYLIGELNRRLGDKDKALLWFSYVITTPNVSYK............................................................................................................................................................................................................................................................................................................................................................................ 200
18 -12.800PF13281.4; W4Y576_STRPU/79-449; Domain of unknown function (DUF4071 topsan)  ali follow..  10  68...................LCEEDSALPLLTDRLREEEVHISNRSNATEVLLNDIRKARIKLKGETLAKELGNLKARMDEQLLLSGDIVHQFLLSYREIQDYNSMVSLVEAIKQLKND--HVTDKPAILHLYAFALNRRKKPGDREKAVTVITKALESDDNQVPDMFCL--CGRIYKDIESRDNAIYWYRKGFEVQPNEAGINLATLLVLAGKEFTKCAELQKIGMTLNNLLGKKGSLVSLEDYWATFFEISVLAGDFSKASQAAECMFKKPPIWYLKSTLGNIQMICRVRPTSDEEPTIDQQLFNF.................................................................................................................................................................................................................................................................. 367
19 -12.700PF16918.3; V9XE69_9NOCA/412-767; Protein kinase G tetratricopeptide repeat  ali follow..  14  51.....DPNDPSAALLAAAVHSEPQQTLDSLRHARENGVGRVVFSKEITLAEVRAHLDLGQVDSAVEILTRLERESGDD-------WRMDWYAGIAELLQDDYEAAFTRFDKVLQALPGEIAPKIAAELTLQHWESDDPDAWRRFCEQSYRVVWR------TDHAVVSAAFGLARQLTARDEIRAAVDVLDEVPTTSRHHSAATMTAALILLRG--ISEADLREAAHRIASLPPDEGRALQMRALVLGTALEWVRSGRASSREYDRILDVPFTEKGLRLGTEAALRQLARNAPSRTHRYTLVDLANAIR................................................................................................................................................................................................................................................................. 353
20 -12.200PF13432.4; A1AQL4_PELPD/29-93; Tetratricopeptide repeat  ali follow..  11  1................................................................................................................................................................................................................SRLSDAYLKLNLVDDALHTARRGVEKFPA----VAGQRSLALACYAKGLMDESRQALEAVVAAV..................................................................................................................................................................................................................................................................................................... 61
21 -12.100PF09613.8; F5XVY8_RAMTT/1-126; Bacterial type III secretion protein (HrpB1_HrpK)  ali follow..  9...................................................................................................................................................................PLVDALAAVFYVGSDLEEDAQTLVVLRAIRRLRPHA-----PTLAVVEAQQLVENGDLQGARLLLEEADACSPGT---PVVKAMFALVLQQQRN-GLWQAYAQEARNLPPDPKALSILDYLDRIAR.................................................................................................................................................................................................................................................................................... 125
22 -11.600PF16811.3; D5CPB6_SIDLE/41-285; TRAP transporter T-component  ali follow..  9....EAGMPSYLLLADSMIEGDPQNEHMLLSGSKLYGAYAAREPERARRLARKARDCAHDVQLCKLYEQPYDKFAAAVANLKADDVPLLYGSATAWNAIATLPKVKAMMERVVTLDETYSH----GEAHLYLGVFATL-GKPEEGRVHFERAIQL-----SAGRDLMAKVEYARRYARTYDRELHDRLLHEVLDADATEPGLTLSNVLAQ-----------RQAKALLASADSY........................................................................................................................................................................................................................................................................................................................................... 244
23 -11.300PF11180.6; B4EB83_BURCJ/50-226; Protein of unknown function (DUF2968 topsan)  ali follow..  41....................................................................................................................................................................................................................................................................................................SYGASLMFDPREMTYYVALFQDKNLWRVIRSQEKNRAETVYANFVQQTAQLADI----------------------EIRRTELQAQKAFLERVIALQANRAQQLQADLSVARSQQAEVAQRQKSAQEQAQVLQVEKRAAQLQLRDLQEQVRQLERQT.................................................................................................................... 175
24 -10.400PF09976.7; Q12PT2_SHEDO/15-205; Tetratricopeptide repeat-like domain  ali follow..  13  32...................................KVASAELASESFQSIALDAENSPNVMMAAVEFE--------KNHDQQGYQALLQFMLAKSAVEAGDLEKAEQAYTKVIA---AKPAAGLDMIAALRLSRVQAEQGNLGMALATLEHV-------TDSAYAAQRDELKGDFYARQGEFDKAKNAYQAAVTLGGTLASPALQMKLDSLN................................................................................................................................................................................................................................................................................................................................................................. 190
25 -9.760PF10345.7; Q6CDL4_YARLI/63-642; Cohesin loading factor  ali follow..  104..........ECLNVRALCKTSRNAALLHCNKCLDKYQGMYDNYAYHTLQLLRF--DLTLDNSLPQAFDIFKSWETLPDSIPKSFLQLYGISKALDYNVGNYEETLLTLDHMRAFEEASECTETIPQVAMLRCMMTVLVALRRDCDIPVKEASKALETLSQCTQYCYYLLGLAALRHHTTL-KYGRELLKDCLKLLDRDQEDLRCNVHLYIALVNFSGLAYKKGHRSFERFVTLSKGLPEEASSCLVAALSSQTSGKWKKALKYYKRIESDDPLYPAALA--NICCINPSTANMDTLKQLINEMSPEHHGLKHA---FEVIRLVHDTSNMSSLDRQESMKKAQKALHL------QLSYATTLACVDMFGSTAEKNLELGNAFRTALSESDCVWAYVIGEKKKGYLHQLGQEEKLEQLNTSVANIGK........................................................................................................................................... 580
26 -9.540PF00196.17; NARP_HAEIN/147-203; Bacterial regulatory proteins, luxR family  ali follow..  14  2..........................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................SLTDREMGVLRQIATGLSNKQIAAQLFISEETVKVHIRNLLRKLNVHSRVAATVLF... 57
27 -9.500PF15254.4; G1T8F1_RABIT/39-892; Coiled-coil domain-containing protein 14  ali follow..  13  401.............................................................................................................................................................................................................................................................................................QLRILNQRLREQEKTKKTTGSVECNLELFSLQSLNMSLQSQLQESLKNQELLQSKNEELLKVI--------------------ENQKDENKKYTSIFKEKDQTILENKQQFDIEITKIKIELEEALVDVKSSKFKLDTAEKENQILGITLRQRDAEVARLRELTRT--LQNSMAKLLSDLSVDSARCKSGNNLTKSLLNIYDRQLQHDPAPAHTSIMSYLNKLETNH..................................................... 605
28 -9.480PF13371.4; Q1LJ97_CUPMC/200-272; Tetratricopeptide repeat  ali follow..  1...........................................................................................................................................................................LKAIYLQESRWQRLLAVQNRLVILLPDS-----IEEVRDRGLAYANLECFRPALEDLEAYVQARPDAADS.................................................................................................................................................................................................................................................................................................................................... 65
29 -9.410PF11817.6; A6QX66_AJECN/229-501; Foie gras liver health family 1  ali follow..  1............RMATNL----DMKSIFVVQPEPSQLDI-----------------SEFIKALFANLRGPSVEYYRDLSKHARRNVRYDFKLGVFAEFRQEMDAACRSYEAAYEGLFGEEVFEMIAGWNPRRQLAENIPAHSKLILGFLESSTEEFSKRQQNRIVEQLRLKMAKEHIRIGQWKDGLALIQPLWLQLSWRREGWWGLMEDFAWTLRECAIQAEDRETVLRVDWELMNH........................................................................................................................................................................................................................................................................................................................................ 273
30 -9.370PF03904.11; YP2A_STAAU/1-230; Domain of unknown function (DUF334 topsan)  ali follow..  10  14................................................................................................................................................................................................................................................................................................................................................................................KNNKENPNTQMNSKSTGTPSSSTQNSLNNEELSELKRQNKLIVKYLAEIQENQKIREKEQKEITSELKEATKDFRDKSLKIRNDFVDVLQEKLKHVDTEELKDILGRGIYKREENDRMLQEVKRSHEHYQTRQKYLFTGIGAMLLVFMLFALIMTIGSDFMSFLH-----VDTLQNAIAGKLKASEG.......... 196
31 -9.330PF04184.10; F7I5Z8_CALJA/4-540; ST7 protein  ali follow..  94........NNSSNNSNSSNGDSDSNRQSVSECKVWRNPLNLFRGAEYNRYTWVTGREPLTYYDMNLSAQDHQTFFTCDSDHLRPADAIMQKA----WRERNPQARISAAHEALEINECATAYILLA--EEEATTIAEAEKLFKQALKAGDGCYRRSQQLQDTNVLVYIKRRLAMCARRLGRTREAVKMMRDLMKEFPLLS---MFNIHENLLEALLELQAYADVQAVLAKYDD--ISLPKSATICYTAALLKARAVSDKFSPEAASRRGLSTAEMNAVEAIHRAVEFNPLKRGDSEAIAYAFFHLAHWKRVTFRMIPYPLEKGHLFYPYPICTEHEVSVYPKKELPFFILFTAGLCSFTAMLALL........................................................................................................................................................................................................ 493
32 -9.310PF06936.9; H0XGC8_OTOGA/1-189; Selenoprotein S (SelS)  ali follow..  37..................................................................................................................................................................................................................................................................................................................................................................IVFSCVLLYVVFQKFSTRLRALRQRQLDRAAAAVEILEPDVVVKRQEALAAARLKMQEELNAQVEKHKEKLKQLEEEKRRQKIEMWDSMQEGKSYKGNTKKPQEEDNSGPSSVIPKRKSDRKPLRGGGYNPL............................................................................... 168
33 -9.160PF15771.3; F7CHR8_HORSE/9-582; Domain of unknown function (DUF4700 topsan)  ali follow..  100..........................................................................................................................................................................................................................................................................................................QFEEKKKRAKDRCDSEALYNFISHTRESIYRLQTSVEKSEEHLSSRSQSI-------------------LETVAKTLQETAQAQSDLVLETAQDKGNMEQAILEMQKRFEARQAEFSEMKSNLKHLEVLVAQQNKDFQQLCEQLGQLNVPSILAELKRLISVPQVPRHVKDSTSQTSPPLAQS.................................................................................... 266
34 -9.100PF04156.12; Q50FQ1_CHLTH/23-216; IncA protein  ali follow..  44...............................................................................................................................................................................................................................................................................................................................................................TIVLLALFIISLAGNALYLQKTANLHLYQDLQREVGSLKEINFMLSVLQKEFLHLSKEFATTSKDLSAVSQDFYSCLQGFRDNYKGFESLLDEYKNSTEEM----------------------RKLFSQEIIADLKGSVASLREE-IRFLTPLAEEVRRL...................................................... 180

FFAS is supported by the NIH grant R01-GM087218-01
8 2 8 4 2   jobs submitted since Jan 1, 2011
Comments and questions to: webmaster

Selected papers from Godzik Lab
Ying Zhang, Ines Thiele, Dana Weekes, Zhanwen Li, Lukasz Jaroszewski, Krzysztof Ginalski, Ashley Deacon, John Wooley, Scott Lesley, Ian Wilson, Bernhard Palsson, Andrei Osterman, Adam Godzik. Three-Dimensional Structural View of the Central Metabolic Network of Thermotoga maritima. Science. 2009 Sep 18;325(5947):1544-9.

Alexey M. Eroshkin, Andrew LeBlanc, Dana Weekes, Kai Post, Zhanwen Li, Akhil Rajput, Sal T. Butera, Dennis R. Burton, Adam Godzik. bNAber: database of broadly neutralizing HIV antibodies. Nucl. Acids Res. 2013; published on November 7, 2013.

Li W, Jaroszewski L, Godzik A. Clustering of highly homologous sequences to reduce the size of large protein databases. Bioinformatics. 2001 Mar;17(3):282-3.