For publications where some content should not be preserved, consider tagging what can be preserved in a consistent way that can be used by preservation export or harvesting processes to exclude items that should not be preserved. Platforms may want to facilitate this tagging.
These guidelines also concern the inclusion and exclusion of content in the preservation process:
- 10 - Define and document core intellectual components that need to be preserved
- 20 - Represent all core intellectual components of the work in the export package
- 40 - Identify the rights for external web content
- 55 - Consider whether it is ethical/appropriate to preserve social media
- 65 - Ensure irrelevant or private administrative data is removed from data exports