Toolbox

To streamline the analysis of data downloaded from the DSA Transparency Database, you can use the open-source dsa-tdb python package. The package allows to efficiently carry out a number of data pre- and post-processing tasks at scale thanks to its high-performance data processing backend. Specifically, the package allows you to:

  • Easily download the daily download files, perform their checksum verification and convert them into data processing-ready csv or parquet files.
  • Filter and/or aggregate the statements of reasons across user-selected variables from the database schema to create bespoke datasets for advanced visualisations or to answer advanced research questions.
  • Develop ad-hoc dashboards and visualisations based on the aggregated data using the Apache Superset framework.

Depending on your technical level and preferences, you can access these functionalities

  • via the high-level command line interface;
  • through a jupyter notebook, directly using the module’s python bindings;
  • through fully functional APIs, either programmatically or using an interactive web-based interface.

To access the package as well as its full technical documentation, you can visit the dsa-tdb page on code.europa.eu.

If you use the data from the DSA Transparency Database for your research work, please cite it using the following information:

European Commission-DG CONNECT, Digital Services Act Transparency Database, Directorate-General for Communications Networks, Content and Technology, 2023
https://doi.org/10.2906/134353607485211

Digital Services Act Logo