HomeKnowledge base › Cleaning up the email archive
IT

Cleaning up the email archive: from mailbox quota to GDPR-compliant

An average business mailbox in the Netherlands contains, after five years, between 50,000 and 250,000 emails, with storage of 5-25 GB. For the company it is a treasure trove of correspondence and contracts; for IT it is a quota problem; for compliance it is a GDPR time bomb. This article describes how to set up a clean-up of the business email archive without losing business knowledge or exceeding retention periods.

Three forces demanding a clean-up

  1. Quota. Mail servers with capacity licences; cloud platforms with 50/100 GB per user; storage costs in cloud environments that scale with email count.
  2. GDPR storage limitation. Personal data of customers and applicants in email body and attachments. Under GDPR art. 5 this may not be retained longer than necessary.
  3. Speed and searchability. A mail archive of 250,000 messages is practically unusable for searches.

Retention periods for email

Email is not a separate category under the GDPR; the retention period follows the content:

For the full overview, see the GDPR retention cheat sheet.

A clean-up strategy in four steps

Step 1: Inventory

Step 2: Set policy

Step 3: Implementation

Step 4: Communication

The biggest mistake: clearing without policy. A spontaneous ‘let's throw out emails older than 5 years’ is not a GDPR-compliant plan, it is a data breach waiting to happen.

What about ex-employees?

Mailboxes of people who have left are often a gap in policy. GDPR rules of thumb:

Local versus cloud deletion

A cloud delete is not the same as physical destruction. Read our article really deleting cloud data for the details. For most organisations: cryptographic erasure in the cloud suffices for GDPR purposes, provided the provider has demonstrable delete processes.

What does require physical destruction: local PST files on USB sticks or laptops, on-premise backups on tape or HDD. For these:

Auto-archive versus active clean-up

Two models:

Auto-archive

Emails older than X months go automatically to an ‘archive’ folder, and eventually to a cold-storage system. Good for mailbox quota, but extends the effective retention period unless a deletion policy is also active.

Active clean-up

Per quarter or year the oldest segment is reviewed and (per policy rule) deleted or retained. Requires human involvement, delivers better GDPR compliance.

Most organisations combine: auto-archive for capacity, annual review for compliance.

Specific points of attention

  1. Attachments with personal data. Excel files with customer lists, PDF contracts with BSN, CV attachments from applicants. Inventory this stream separately.
  2. Personal folders and rules. Staff create their own mailbox structures that are sometimes 10 years old.
  3. Sent items. Personal data here too. Not only the inbox.
  4. Shared mailboxes. info@, support@, hr@. Their own retention policy.
  5. OneDrive/SharePoint links. Attachments in emails often refer to shared files; for cloud deletion see really deleting cloud data.

Documentation

For GDPR evidence you need:

Cloud mail cleaned up, on-premise backups destroyed.

We destroy the physical copies that do not travel with cloud deletion: USBs, tapes, old HDDs with mail archive.

Read more for IT MSPs

Is your organisation working on email retention policy? Email us via desnipperaar.nl about the physical side of the clean-up.