euwebarchive - Web tools

euwebarchive

EU web archive

More and more EU information is only made available on the web. However, web content often has a short lifespan and web technologies evolve quickly. So this information is at risk of getting lost, e.g. when websites are substantially changed or taken offline. Web archiving is the process of collecting websites to ensure that the information is preserved in an archive for future researchers, historians, and the public.

The Publications Office of the European Union carries out web archiving to preserve the websites of EU institutions, agencies and bodies. The archived materials are kept in the EU web archive, open and available online which the members of the public can access online

This archive reflects the content and design of websites as it was at a given point in time. Thanks to it, the information institutions provided on their websites stays available, even if the original site or page has fully or partially disappeared

Web archiving at the European Union started in 2013. Since then, we archive EU websites and we continuously add new websites to the collection.

What do we archive?

We archive on regular basis the websites of the EU institutions, most of which are hosted on the europa.eu domain and subdomains and are archived on a regular basis.

Exceptionally we create collections of archives of websites related to the activities of the European Union (e.g. the sites of EU Council presidencies, EU publications, etc.).

Ad hoc crawls of websites that will be taken offline or will change substantially can be done on request of the respective EU institution. For example, we can archive pages created for certain events.

The following contents are excluded from the archive:

  • External websites.
  • Databases.
  • Dynamic content.
  • Social media.
  • Password-protected sites.
  • Streaming & downloadable media.

We only capture web content that is freely available in the public domain.

Guidance for website managers

In order to optimise the quality of the archived versions of your website it is important to create preservable websites. You will find more information in these guidelines.

For more information concerning the European Commission websites, see the Europa Web Guide.

You must remove content that should not be preserved (and be accessible) in the long term. This can be for reasons such as intellectual property rights (e.g. copyright), confidentiality, privacy, data protection, etc. If this content cannot be removed before archiving, prevent it from being crawled by using robot.txt files.

Takedown policy

There are legitimate circumstances when it may be required to hide pages in the web archive from public view.

Anyone can submit a motivated takedown request. Please use this email link to initiate it: op-web-preservation@publications.europa.eu.

Takedown will only be considered in one of the following cases:

  • if the page includes one of the following types of content:
    • personal or sensitive personal information, as defined by Regulation (EU) 2018/1725 on the protection of natural persons with regard to the processing of personal data by the Union institutions, bodies, offices and agencies;
    • copyright protected material for which the necessary rights are not held;
    • defamatory or obscene material or messages;
  • if the content of the page may cause serious and real administrative difficulties to the website owner;
  • if the page was published in good faith, but circumstances for this have changed and its takedown is now considered appropriate;
  • if the page was published in error and its takedown is deemed necessary to correct this mistake.

Legal information

Copyright

© European Union, 2019

The Publications Office carries out web archiving to preserve the websites of the European Union. Most of the archived content of the websites that are accessible in the EU web archive (EUWA), is under EU (or EU institutions, agencies or bodies) copyright. Ownership and copyright of websites in the EUWA remain the responsibility of the website owners.

Unless otherwise stated, the material obtained from the EUWA may be freely reproduced. This general principle can be subject to conditions, which may be specified in individual copyright notices. It does not apply to photographs, videos, pieces of music or other material subject to intellectual property rights of third parties (non-EU). In such cases, permission to use the material must be sought directly from the copyright holders. The Publications Office does not warrant that all third-party content is appropriately marked.

All logos and trademarks are excluded from the abovementioned permission.

Any queries regarding the above should be addressed to the following email OP-COPYRIGHT@publications.europa.eu

Privacy statement

Find out more

Contact the web archiving team: op-web-preservation@publications.europa.eu