cazy_webscraper: For creating a local CAZy database

Emma Hobbs, Tracey Gloster, Sean Chapman, Leighton Pritchard

Research output: Contribution to conferencePoster

37 Downloads (Pure)


Carbohydrate Active enZymes (CAZymes) are pivotal in pathogen recognition, signalling, structure and energy metabolism. CAZy ( is the most comprehensive CAZyme database, but it does not provide methods for automating data retrieval or submitting sequences for annotation.

cazy_webscraper retrieves user-specified datasets from CAZy, producing a local SQL database enabling thorough interrogation of the data. cazy_webscraper can also retrieve protein sequences from GenBank and download structure files from RCSB PDB.
Original languageEnglish
Publication statusPublished - 4 Apr 2021
EventMicrobiology Society Annual Conference 2021 - Online
Duration: 26 Apr 202130 Apr 2021


ConferenceMicrobiology Society Annual Conference 2021
Internet address


  • bioinformatics
  • CAZyme
  • CAZymes
  • CAZy
  • CAZy proteins
  • CAZy protein families
  • webscraper
  • CAZy family


Dive into the research topics of 'cazy_webscraper: For creating a local CAZy database'. Together they form a unique fingerprint.
  • PhD Supervision

    Tracey Gloster (Advisor), Leighton Pritchard (Advisor), Sean Chapman (Advisor) & Emma Hobbs (Recipient)

    1 Sept 20191 Sept 2023

    Activity: Public Engagement and Other ActivitiesOther

Cite this