current user: public

anford Burnham Prebys Medical Discovery Institute
We are hiring bioinformatics experts and software developers. Join us!

Query: IDP00010 uroporphyrin-III C-methyltransferase, putative BA_2144 [Bacillus anthracis str. Ames], from CSGID

Results of FFAS03 search in PDB0516
Master-slave alignment(slide right to see more) does not show gaps in the query sequence, use ali links to display alignment between query and templates.
    .   10    .   20    .   30    .   40    .   50    .   60    .   70    .   80    .   90    .  100    .  110    .  120    .  130    .  140    .  150    .  160    .  170    .  180    .  190    .  200    .  210    .  220    .  230    .  240    .  250    .  260    .  270    .  280    .  290    .  300    .  310    .  320    .  330    .  340    .  350    .  360    .  370    .  380    .  390    .  400    .  410    .  420    .  430    .  440    .  450    .  460    .  470
# Score Template Links and tools%idFirst MNGYVYLVGAGPGDEGLITKKAIECLKRADIVLYDRLLNPFFLSYTKQTCELMYCGKMPKNHIMRQEMINAHLLQFAQEGKIVVRLKGGDPSIFGRVGEEAETLAAANIPYEIVPGITSSTAASSYAGIPLTHRNYSNSVTLLTGHAKGSLTDHGKYNSSHNSDTIAYYMGIKNLPTICKNLRQAGKKEDTPVAVIEWGTTGKQRVVTGTLSTIVSIVKNENISNPSMTIVGDVVSLRDQIAWKEHKPLHGKKVLFTSATNKTSVMKQKLQEAGAEIYQIPTFKKEEYTLTLEQIIEIFNVNRLVFCSAESVEILMQSCSKYKKDIRSLQAELQHMNVATQEKLMQYGLLSKEAKFSSDTTVYLGRNINRIAFIQEKIGAGSYMMTHKYTIDHRFDEVHSRMLSEFLWDSIVFEGRASIDTFLAEVKRLGFLDIVTLPFSYTDVPTLRYANKVGFHNIDHQLQETLMQKDMVRQLast
1 -92.7001s4d_A mol:protein length:280 Uroporphyrin-III C-methyltransferase  ali model follow..  37  14.KGSVWLVGAGPGDPGLLTLHAANALRQADVIVHDALVNEDCLKLARPGAVLEFAGKRGGKPSPKQRDISLRLVELARAGNRVLRLKGGDPFVFGRGGEEALTLVEHQVPFRIVPGITAGIGGLAYAGIPVTHREVNHAVTFLTGHDSSGLDRINWQGIASGSPVIVMYMAMKHIGAITANLIAGGRSPDEPVAFVCNAATPQQAVLETTLARAEADVAAAGLEPPAIVVVGEVVRLRAALDWIGALDPFANRILRNPA....................................................................................................................................................................................................................... 280
2 -90.9002ybo_A mol:protein length:294 METHYLTRANSFERASE  ali model follow..  36  24.AGSVALVGAGPGDPGLLTLRAWALLQQAEVVVYDRLVARELIALLPESCQRIYVGKRCGHHSLPQEEINELLVRLARQQRRVVRLKGGDPFIFGRGAEELERLLEAGVDCQVVPGVTAASGCSTYAGIPLTHRDLAQSCTFVTGHLQNDGRDLDWAGLARGKQTLVFYMGLGNLAEIAARLVEHGLASDTPAALVSQGTQAGQQVTRGALAELPALARRYQLKPPTLIVVGQVVALFAERAMAHP-SYLGAGSPVSREAVACAL................................................................................................................................................................................................................. 287
3 -90.0002ybq_A mol:protein length:292 METHYLTRANSFERASE  ali model follow..  36  22.AGSVALVGAGPGDPGLLTLRAWALLQQAEVVVYDRLVARELIALLPESCQRIYVGKRCGHHSLPQEEINELLVRLARQQRRVVRLKGGDPFIFGRGAEELERLLEAGVDCQVVPGVTAASGCSTYAGIPLTHRDLAQSCTFVTGHLQNDGRDLDWAGLARGKQTLVFYMGLGNLAEIAARLVEHGLASDTPAALVSQGTQAGQQVTRGALAELPALARRYQLKPPTLIVVGQVVALFAERAMAHP-SYLGAGSPVSREAVACAL................................................................................................................................................................................................................. 285
4 -87.2001ve2_A mol:protein length:235 Uroporphyrin-III C-methyltransferase  ali model follow..  38  1MRGKVYLVGAGFGGPEHLTLKALRVLEVAEVVLHDRLVHPGVLALAK--GELVPVGKEGYGGKTPQEAITARLIALAREGRVVARLKGGDPMVFGRGGEEALALRRAGIPFEVVPGVTSAVGALSALGLPLTHRGLARSFAVATGH--------DPALPLPRADTLVLLMPLHTLGGLKERLLER-FPPETPLALLARVGWPGEAVRLGRVEDLPGLGE--GLPSPALLVVGKVVGLYGELLPKDHG................................................................................................................................................................................................................................... 234
5 -80.2001v9a_A mol:protein length:239 Uroporphyrin-III C-methyltransferase  ali model follow..  36  1..GRVYLVGAGPGDPELLTLKAYRLLKEAPVVLYDRLVDERVLALAP--GEKVYVGKEEG-ESEKQEEIHRLLLRHARAHPFVVRLKGGDPMVFGRGGEEVLFLLRHGVPVEVVPGVTSLL----ASGLPLTHRGLAHGFAAVSGVLEGGG-YPDLRPFARV-PTLVVLMGVGRRVWIAKELLRLGRDPREPTLFVERASTPKERRVHARLEEVAEGKVE--VRPPALWILGEVVRVFAEKEPVDALALGG............................................................................................................................................................................................................................... 239
6 -61.7001pjq_A mol:protein length:457 Siroheme synthase  ali model follow..  46  215.RGEVVLVGAGPGDAGLLTLKGLQQIQQADIVVYDRLVSDDIMNLVRRDADRVFVGKRAGYHCVPQEEINQILLREAQKGKRVVRLKGGDPFIFGRGGEELETLCHAGIPFSVVPGITAASGCSAYSGIPLTHRDYAQSVRLVTGHLKTGG-ELDWENLAAEKQTLVFYMGLNQAATIQEKLIAFGMQADMPVALVENGTSVKQRVVHGVLTQLGELAQQ--VESPALIIVGRVVALRDKLNWFSNH................................................................................................................................................................................................................................... 457
7 -52.8001vhv_A mol:protein length:268 diphthine synthase  ali model follow..  16  6.MSLLTFVGLGLWDVKDISVKGLEAVREADEVYVEYYTSKLLSSI---EEMEEFFGKRVVELERSDLEENSFRLIERAKSKSVVLLVPGDPMVATTHSAIKLEAERKGVKTRIIHGASISTAVCGLTGLHNYRFGKSATVSWHRSQTPVNVI---KANRSIDAHTLLFLEPMTIGHAVENLIAEDAQMKDLYAVGIARAGSGEEVVKCDRLENLKKI--DFGKPLHVMVVLAKTLHFMEFLREFADAPAELERLVAEGGS...................................................................................................................................................................................................................... 262
8 -51.0001vce_A mol:protein length:265 diphthine synthase  ali model follow..  15  2...VLYFIGLGLYDERDITVKGLEIAKKCDYVFAEFYTSL-MAGTTLGRIQKLIGKEIRVLSREDVELNFENIVLPLAKENDVAFLTPGDPLVATTHAELRIRAKRAGVESYVIHAP-SIYSAVGITGLHIYKFGKSATVAYPEGNWFPTSYDVIKENAERGLHTLLFLEAMELLLKVEDMKKGGVFTDDTLVVVLARAGSLNPTIRAGYVKDLIR--EDFGDPPHILIVPGKLHIVAEYLVEIAGAPREILRVNV.......................................................................................................................................................................................................................... 265
9 -49.0003i4t_A mol:protein length:292 diphthine synthase  ali model follow..  17  14.GSMLYIIGLGLYDEKDITVRGLEAVKSCDLVFLEHYTAILQCDVAKLEE---FYGKKVIIGDRDLVETEADQILEPAKTKNVALLVVGDVYGATTHSDIFVRCQKMGIEVKVIHNASIMNAI-GCSGLQLYRFGQTVSVCFWSEHERSEESIIKGRDIFEPPRYMTINQCIEQLLEVEKEQHLGVYDEDTMVVGMARVACADQKIVYGKMKDLLH--YDFGAPMHCLLIPAPQVDDLDQLEYFKYKP.................................................................................................................................................................................................................................. 285
10 -47.4004e16_A mol:protein length:253 precorrin-4 C(11)-methyltransferase  ali model follow..  31  3AMNKVHFVGAGPGDKELITLKGYKLLSNADVVIYGSLVNPELLEYCKEDCQIH------NSAHMDLQEIIDVMREGIENNKSVVRLQTGDFSIYGSIREQVEDLNKLNIDYDCTPGVSSFLGAASSLGVEYTVPEISQSVIITRMEGRTPVPEKE-QSYAKHQTSMVIFLSVQEIEKVVSKLLEGGYPKDTPIAVIYKATWADEKIVKGTLSDIAVKVKENNINKTALIMVGRFLG.............................................................................................................................................................................................................................................. 234
11 -46.1001cbf_A mol:protein length:285 COBALT-PRECORRIN-4 TRANSMETHYLASE  ali model follow..  29  14.HMKLYIIGAGPGDPDLITVKGLKLLQQADVVLYDSLVSQDLIAKSKPGAEVL------KTAGMHLEEMVGTMLDRMREGKMVVRVHTGDPAMYGAIMEQMVLLKREGVDIEIVPGVTSVFAAAAAAEAELTIPDLTQTVILTRAEGRTPVPEFE-TDLAKHKCTIALFLSSTLTKKVMKEFINAGWSEDTPVVVVYKATWPDEKIVRTTVKDLDDAMRTNGIRKQAMILAGWALDPATPWL........................................................................................................................................................................................................................................ 250
12 -45.8001wde_A mol:protein length:294 Probable diphthine synthase  ali model follow..  15  7.AVTLLLVGWGYAP-GMQTLEALDAVRRADVVYVESYTMPGSSWLYKSVVEAAGEARVVEASRRDLEERSREIVSRALDAV-VAVVTAGDPMVATTHSSLAAEALEAGVAVRYIPGVSGVQAARGATMLSFYRFGGTVTLPGPWRGVTPISVRRIYLNLCAGLHTTALLQGVSLLLEADREYARELLARLPSVLVEAGAGGGHRVLYWSSLERLST--ADVEGGVYSIVIPARLSGVEWLLAAAQRRPLEYDRSVYETVEENCKKGV............................................................................................................................................................................................................... 289
13 -42.9003nei_A mol:protein length:281 Precorrin-4 C(11)-methyltransferase  ali model follow..  27  16...TVHFIGAGPGAADLITIRGRDLIASCPVCLYGSLVPEALLAHCPPGAKIVNTAP------MSLDAIIDTIAEAHAAGQDVARLHSGDLSIWSAMGEQLRRLRALNIPYDVTPGVPSFAAAAATLGAELTLPGVAQSVILTSGRASAMPAGETLENFARTGAVLAIHLSVHVLDEVVQKLVP-HYGEDCPVAIVWRASWPDQRVVRATLATLQTSL-GAELERTALILVGRSLATED........................................................................................................................................................................................................................................... 246
14 -38.1003d8n_A mol:protein length:286 Uroporphyrinogen-III synthase  ali model follow..  2.................................................................................................................................................................................................................................GKPIPNPLLGLDSTENLYFQGIDPFTMRIAYAGLR-RKEEFKALAEKLGFTPLLFPVQATEKVPVRDQVRELAQGVDLFLATTGVGVRDLLEAGKALGLDLEGPKAFRLARGAKAARALKEAGLPPHAVGDQGRGVAALQLYGKPLPLLENALAERGPLMPYRHLPDPEGILRLEEAVLRGEVDALAFVAAIQVEFLFEGAKDPKAALNTRVKALAVGRVTADALREWGVKPFYVDETERLGSL..... 267
15 -36.4003mw8_A mol:protein length:240 Uroporphyrinogen-III synthase  ali model follow..  14  1..........................................................................................................................................................................................................................................................GMKLLLTRPEGKNAAMASALDALAIPYLVEPLLSVEAAAVTQAQLDELSRADILIFISTSAVSFATPWLKDQWPKAT-----YYAVGDATADALALQGITAERSPADSGKQIVIVRGKGGREAMADGLRLRGYLEVYQRACPPLDAPASVSRWQSFGIDTIVVTSGEVLENLINLVPKDSFAWLRDCHIIVPSARVETQARKKGLRRVTNAGAANQAAV..... 234
16 -35.1001wcw_A mol:protein length:261 Uroporphyrinogen III synthase  ali model follow..  10  1...................................................................................................................................................................................................................................................MRRLEEDAVRVAYAGLR-RKEAFKALAEKLGFTPLLFPVQATEKVPVRDQVRALAQGVDLFLATTGVGVRDLLEAGKALGLDLEGPKAFRLARGAKAARALKEAGLPPHAVGDQGRGVAALQLYGKPLPLLENALAERGPLMPYRHLPDPEGILRLEEALLRGEVDALAFVAAIQVEFLFEGAKDPKAALNTRVKALAVGRVTADALREWGVKPFYVDETERLGSL..... 248
17 -34.8001wcx_A mol:protein length:261 Uroporphyrinogen III Synthase  ali model follow..  11  1...................................................................................................................................................................................................................................................MRRLEEDAVRVAYAGLR-RKEAFKALAEKLGFTPLLFPVQATEKVPVRDQVRALAQGVDLFLATTGVGVRDLMEAGKALGLDLEGPKAFRLARGAKAARALKEAGLPPHAVGDQGRGVAALQLYGKPLPLLENALAERGPLMPYRHLPDPEGILRLEEALLRGEVDALAFVAAMQVEFLFEGAKDPKAALNTRVKALAVGRVTADALREWGVKPFYVDETERLGSM..... 248
18 -33.6004es6_A mol:protein length:254 Uroporphyrinogen-III synthase  ali model follow..  12  1.....................................................................................................................................................................................................................................................GSHMSGWRLLLTRPDEECAALAASLGEAGVHSSSLPLLAIDPLEEQRTLMLDLDRYCAVVVVSKPAARLGLERLDRYWPQPPQQ--TWCSVGAATAAILEAYGLDVTYPEQGDDSKVLIMRGEGGREFLAERLRGQGYLPLYRRRAPDYPAGELLARVRAERLNGLVVSSGQGLQNLYQLAAAD-WPEIGRLPLFVPSPRVAEMARELGAQRVIDCRGASAPAL..... 246
19 -32.7003re1_A mol:protein length:269 Uroporphyrinogen-III synthetase  ali model follow..  13  1...................................................................................................................................................................................................................................................MSSMDMSAWRLLLTRPAEESAALARVLADAGIFSSSLPLLETEPLPLQRSIIFELLNYSAVIVVSKPAARLAIELIDEVWPQPPMQ--PWFSVGSATGQILLDYGLDASWPEQGDDSRVLIMRGNEGRELLAEQLRERGYLPLYRRYLPQHAPGTLLQRVEVERLNGLVVSSGQGFEHLLQLAGD-SWPDLAGLPLFVPSPRVASLAQAAGARNVIDCRGASAAAL..... 248
20 -31.4001jr2_A mol:protein length:286 UROPORPHYRINOGEN-III SYNTHASE  ali model follow..  15  3....................................................................................................................................................................................................................................................SSGHIEGRHLLLKDAKEDDDPYIRELGLYGLEATLIPVLSFESLPSFSEKLSHPEDYGGLIFTSPRAVEAAELCLEQNNKTEVW-AKSVYVVGNATASLVSKIGLDTEGETCSSALPLLFPCGNLKREILPKALKDKGSITVYQTVAHPGIQGNLNSYYSQGVPASITFFSPSGLTYSLKHIQELSGDNIDQIKFAAIGPTTARALAAQGLPVSCTAESPTPQALTGIRK 268
21 -30.4003p9z_A mol:protein length:229 Uroporphyrinogen III cosynthase (HemD)  ali model follow..  10  2..................................................................................................................................................................................................................................................................NAMREIVWVHSQRIAPYKTLILNEFCYYPLE------LDPTPFNALIFTSKNAVFSLLETLKNSPKLKMLQNIPAYALSEPTAKTLQDHHFKVAFMGEKAKKSVLYLRAKEIVSSLDTILLEHGQAVVYENKLKHLTLSEQ-NALKPKEKSILIFTAISHAKAFLHYFE-----FLENYTAISIGNTTALYLQEQGIPSYI-AKKPSLEAC..... 217
22 -28.7002qbu_A mol:protein length:232 Precorrin-2 methyltransferase  ali model follow..  17  1MHGKLIGVGVGPGDSELLTLRAVNVLRSVPVICAPR-IVEDILTERRDGCRILDP-DDRDELESHWDSAARMVAAELEDGRDVAFITLGDPSIYSTFSYLQQRIEDMGFKTEMVPGVTSFTACAATAGRTLVEGD--EILLVVPRVDDR-------ERVLRDVDACVIMKTSRHGRRAMEVVESDPRGKD--VVSVANCSMDDEVVE........................................................................................................................................................................................................................................................................... 211
23 -27.9002e0k_A mol:protein length:259 Precorrin-2 C20-methyltransferase  ali model follow..  20  3NQGSIISVSLGPGDPGLITVKALSQLREADVIYYPGILKEFDLDPSKLRGMLVPMSRSRGAAEASYAANYASMAEEVQAGRRVAVVSVGDGGFYSTASAIIERARRDGLDCSMTPGIPAFIAAGSAAGMPLALQS--DSVLVLAQIDEIGE-----ERALVTHSTVVVMKLSTVRDELVSFLERYAKP----FLYAEKVGMAGEFITME-----VDALRSRAIPYFSLLVC................................................................................................................................................................................................................................................... 232
24 -25.0002zvb_A mol:protein length:295 Precorrin-3 C17-methyltransferase  ali model follow..  20  2..GELFLVGMGPGDLPGLTQRAREALEGAEVVIYVKLLEEMGL----------LAGKEVVRKGMTEELDREEALERALSGQRVALVSGGDPGIYGMAAPVLELMEERGVFLAVIPGVTAANAVASLLGSPLAHDTCLISLSDLLTPWPLIERRLHAAGQGDFVVVLYNPQSKRRDWQLSAEILLEYRPKETPAALVKSAYRKRQEVALTTLEGL----REAEAGMLTTVVIGN-----------RQSRFYEGTFLTPRGYALKYDLDTK............................................................................................................................................................................................................. 270
25 -24.6003nut_A mol:protein length:251 Precorrin-3 methylase  ali model follow..  22  1MSGWVTVAGLGPGREDLVTPEVTAALAEATDIVYVARIAPR--------------GLTLHPTDNRVEDRATHALEMAAEGRRVVVVSSGDPGVFAMASALFEALEAAGTEIRILPGITAMLAAAAAAGAPLGHDFCAINLSDNLKPFEILEKRLRHAARGDFAMAFYNPRSKSRPHQFTREILREECEPGRLILFARAVTTPEQAISVVELRDA----TPEMADMRTVVLVGN................................................................................................................................................................................................................................................. 227
26 -24.4002npn_A mol:protein length:251 Putative cobalamin synthesis related protein  ali model follow..  16  2.MRTIYVIGIGTGSPEFLTLQAISGLRHAQAIVA-LALRQKIVDTHAPGTPIYYEEEVRRWHAERAHLLASTIRERTPDDGAVAFLVWGDPSLYDSTLRIIEHMRNLEADVKVIPGITAVQVLTAEHGILINR--IGEAIHITTGRNLPE------TSAKDRRNCVVMLDG--------KTAWQDVATEHTYMWWGAFLGTEQQVLRKGYVHEIGAQVAELK------------QQLRTEHGWIMDTYLLRE.............................................................................................................................................................................................................................. 249
27 -21.9003nd1_A mol:protein length:275 Precorrin-6A synthase/CobF protein  ali model follow..  16  15.MIELSLIGIGTGNPRHITGQAVDAMNAADLILI---DKSDLAGLRRQICAAHYRKGVDDWHDAIAETWLSEIAHVPGLEGRVALLVWGDPSLYDSTLRIAERLKSRPLTTKVIPGITAIQALCAAHAIPLND--IGAPVVITTGRQLRDHG-----WPAGTETVVAMLDGECSFQSL--------PPDGLTIFWGACVAMPEEVLIRGPVAEVTDEILQAR------------ADLRARHGWVMDIYLLRRN............................................................................................................................................................................................................................. 265
28 -18.3003kwp_A mol:protein length:296 Predicted methyltransferase  ali model follow..  16  14TGGHLYLVPTPIGNLDDMTFRAVKTLTAVDLIA-DTRNTQKLLNHFEITTKQI------SFHEHNTQERIPQLIAKLKQGMQIAQVSAGMPSISDPGHELVNACIDAHIPVVPLPGANAGLTALIASGLAPQPFYFYGFLD-----RKPKDRKAEIAGLAQRPETLIFYEAPHRLKKTLQNLAAG--GDERPAVLCRELTKRYEEFLRGSLAELANWAATDTVRGEFVVLVGAPTTAATTAVDLSEPIDVQVDRLIAAGEKPNDAIKEVAKLRGAKKQEI.................................................................................................................................................................................................. 286
29 -16.0002bb3_A mol:protein length:221 cobalamin biosynthesis precorrin-6Y methylase  ali model follow..  17  14SGHMIWIVGSGTCRGQT-TERAKEIIERAEVIY-----SRRALELAGVVDD----SRARILRSFKGDEIRRIM--EEGREREVAVISTGDPMVAGLGRVLREIAEDVEI--KIEPAISSVQVALARLKVDL------SEVAVVDCHAKDFDA------------ELTELLKYRHLLILADSHFPLERLGKRRVVLLENLCMEGERIREGNADSIELES-----DYTIIFVEREVME.............................................................................................................................................................................................................................................. 213
30 -15.3003hh1_A mol:protein length:117 Tetrapyrrole methylase family protein  ali model follow..  22  5.KGTLYVVATPLGNLDDMTFRAVNTLRNAGAIA-DTRRTSILLKHFG-----IEGKRLVSYHSFNEERAVRQVIELLEEGSDVALVTAGTPAISDPGYTMASAAHAAGLPVVPVPGA..................................................................................................................................................................................................................................................................................................................................................................... 117
31 -14.9001wyz_A mol:protein length:242 putative S-adenosylmethionine-dependent methy  ali model follow..  12  1METALYLLPVTLGDTPLLPSYNTEIIRGIRHFIVEDVRSARFLKKVDREIDIDSLTFYPLNKHTSPEDISGYL-KPLAGGASMGVIEAGCPAVADPGADVVAIAQRQKLKVIPLVGPSSIILSVMASGFNGQSFAFHGYLPIEPGERAKKLKTLE-QRVYAESQTQLFIETPYRNHKMIEDILQN--RPQTKLCIAANITCEGEFIQTRTVKDWKGHIPELS-KIPCIFLLYK................................................................................................................................................................................................................................................. 234
32 -10.0003fq6_A mol:protein length:115 Methyltransferase  ali model follow..  10  2...................................................................................................................NATAFVPALVASGLPNEKFCFEGFLPQKKGRQTR------LKALAEEHRTMVFYESPHRLLKTLTQFAEY--GTERQATVSREISKLHEETVRGSLAELIEHFTATEPRGEIVIVLA.................................................................................................................................................................................................................................................. 111
33 -9.9003ffy_A mol:protein length:115 Putative tetrapyrrole (Corrin/porphyrin) meth  ali model follow..  13  2...................................................................................................................NATAFVPALVASGLPNEKFCFEGFLPQKKGRMTK------LKSLVDEHRTMVFYESPHRLLKTLTQFAEY--GPERQVSVSREISKIHEETVRGTLSELIEHFTATDPRGEIVIVLA.................................................................................................................................................................................................................................................. 111

FFAS is supported by the NIH grant R01-GM087218-01
1 2 0 2 1 2   jobs submitted since Jan 1, 2011
Comments and questions to: webmaster

Selected papers from Godzik Lab
Ying Zhang, Ines Thiele, Dana Weekes, Zhanwen Li, Lukasz Jaroszewski, Krzysztof Ginalski, Ashley Deacon, John Wooley, Scott Lesley, Ian Wilson, Bernhard Palsson, Andrei Osterman, Adam Godzik. Three-Dimensional Structural View of the Central Metabolic Network of Thermotoga maritima. Science. 2009 Sep 18;325(5947):1544-9.

Alexey M. Eroshkin, Andrew LeBlanc, Dana Weekes, Kai Post, Zhanwen Li, Akhil Rajput, Sal T. Butera, Dennis R. Burton, Adam Godzik. bNAber: database of broadly neutralizing HIV antibodies. Nucl. Acids Res. 2013; published on November 7, 2013.

Li W, Godzik A. cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences. Bioinformatics. 2006 May 26;