parlCymru: a dataset of spoken contributions from the Welsh Parliament

  • Daniel Braby (Creator)
  • Jac Larner (Contributor)

Dataset

Description

parlCymru intends to provide full-text vectors for all spoken contributions in the Senedd Cymru/Welsh Parliament (formerly National Assembly). This present version provides coverage of all recorded speeches of the Fifth Senedd from 2016-05-05 to 2021-05-05. Metadata includes the speaker, their party, gender, electoral district, the title of the debate and the date. Debates are identified by a unique id, as are members. A crosswalk to Wikidata for members is included for future integration with aggregate datasets of legislators and for easily pulling in additional variables. A dataset of Members of the Fifth Senedd is included, also providing Twitter handles. Text is available in both English and Cymraeg, with an additional variable for the language spoken in Parliament. Two files "Corp_Senedd_en_V2.rds" and Corp_Senedd_cy_V2.rds" provides versions compatible with Rauh & Schwalbach's "ParlSpeech V2" dataset for comparative analyses. Full replication materials are available as a single R script.

CC0 1.0 Universal (CC0 1.0) Public Domain Dedication

This site includes records provided by Elsevier's Data Monitor product. University of Strathclyde does not control or guarantee the accuracy, relevance, or completeness of the information contained in such records and accepts no responsibility or liability for such information.
Date made available15 Jun 2023
PublisherHarvard Dataverse
Date of data production2021

Cite this