Indicators on firm level innovation activities from web scraped data

Sajad Ashouri, Arho Suominen, Arash Hajikhani, Lukas Pukelis, Torben Schubert, Serdar Türkeli, Cees Van Beers, Scott Cunningham

Research output: Contribution to journalArticlepeer-review

4 Citations (Scopus)
20 Downloads (Pure)


This article presents data on companies' innovative behavior measured at the firm-level based on web scraped firm-level data derived from medium-high and high-technology companies in the European Union and the United Kingdom. The data are retrieved from individual company websites and contains in total data on 96,921 companies. The data provide information on various aspects of innovation, most significantly the research and development orientation of the company at the company and product level, the company's collaborative activities, company's products, and use of standards. In addition to the web scraped data, the dataset aggregates a variety firm-level indicators including patenting activities. In total, the dataset includes 21 variables with unique identifiers which enables connecting to other databases such as financial data.
Original languageEnglish
Article number108246
JournalData in Brief
Early online date6 May 2022
Publication statusPublished - 30 Jun 2022


  • big data
  • web scraped data
  • text data
  • firm-level data
  • firm innovation


Dive into the research topics of 'Indicators on firm level innovation activities from web scraped data'. Together they form a unique fingerprint.

Cite this