• Blog
  • Contact
  • Sign in
CloverDX
Product
  • Overview
  • CloverDX Data Integration Platform
  • What's new in CloverDX 6
  • Pricing
  • CloverDX plans
  • Deployment
  • CloverDX on AWS
  • CloverDX on Azure
  • CloverDX on Google Cloud
  • CloverDX on-premise
  • Resources
  • Customer Portal
  • Documentation
  • Downloads & Licenses
  • Webinars
  • Academy & Training
  • Release Notes
  • CloverDX Forum
  • CloverDX Blog
  • Tech Blog
  • Other resources
isometric-illustration--product@2x 1

Get under the hood of CloverDX

See how CloverDX can benefit your business with a live demo. Simply get in touch with our team and we’ll handle the rest.

Book a demo
Solutions
  • By Industry
  • Banking
  • Capital Markets
  • Consultancy & Advisory
  • FinTech
  • Government Agencies
  • Healthcare
  • By Use Case
  • Data Quality
  • Data Ingest
  • Data Warehousing
  • Data Migration
  • Digital Transformation
  • Enterprise Data Management
  • Risk & Compliance
  • Anonymization
How F3 Group use CloverDX to ingest more client data - webinar
Customer interview

Formula 3: Staying Small And Agile While Working With Large Enterprise Ecosystems

Browse webinars
Services
  • Services
  • Onboarding & Training
  • Professional Services
  • Customer Support

More efficient, streamlined data feeds

Discover how Gain Theory automated their data ingestion and improved collaboration, productivity and time-to-delivery thanks to CloverDX.

 

Read case study
Customers
  • By Use Case
  • Analytics and BI
  • Data Ingest
  • Data Integration
  • Data Migration
  • Data Quality
  • Data Warehousing
  • Digital Transformation
  • By Industry
  • App & Platform Providers
  • Banking
  • Capital Markets
  • Consultancy & Advisory
  • E-Commerce
  • FinTech
  • Government
  • Healthcare
  • Logistics
  • Manufacturing
  • Retail
Migrating data to Workday - case study
Case study

Effectively Migrating Legacy Data Into Workday

Read customer story
Company
  • About CloverDX
  • Our story & leadership
  • Contact us
  • Partners
  • CloverDX Partners
  • Become a partner
Pricing
Demo
Trial

Build A Painless Anonymization And Pseudonymization Strategy

Data Architecture Data Anonymization
Posted January 27, 2020
3 min read
Build A Painless Anonymization And Pseudonymization Strategy

Data anonymization and pseudonymization are necessary data privacy techniques for extracting value from your data whilst remaining GDPR compliant.

Both involve using de-identification methods, such as scrambling, masking, and blurring to help conceal identifiable data sets. But there’s one key difference:

  • Anonymization scrubs your data of all identifiable information that could expose your data subject. For example, a simple but effective anonymization might involve replacing all names with a ‘*’ or replacing real credit card numbers with just 16 random digits.
  • Pseudonymization, on the other hand, does not remove all identifiable information, but does make it extremely difficult to link data back to its subject. Without the hidden ‘key’ (i.e. information of substituted fields), outside parties will never know the true identities behind your data.

Pseudonymization makes it easier to retain the usefulness of the data set with a decent level of protection, whereas simple anonymization (masking, deleting or randomly generating replacement data) provides stronger protection at the expense of losing much of the data’s value. As an example, randomly generated credit card numbers are useless for properly testing a web payment form or analysis of “which card issuer has most problems clearing payments”.

On the other hand, careful anonymization of the original card numbers would preserve critical information – card issuer, type, valid check digit – in a way that does not reveal any identifiable information about the holder.

Implementing either practice is easier said than done. That’s because applying anonymization and pseudonymization on a larger scale requires thorough planning and careful execution.

We’ve boiled this down to the three Ds of de-identification.

Removing danger from data - webinar - watch now

1. Define your use case and the level of anonymization needed

Ultimately, the more data fields you anonymize, the less ‘realistic’ and usable your data becomes. On the other hand, when you anonymize fewer fields, the data becomes less secure, and the easier it is to re-identify the data. How you process, share and use your data should define the anonymization technique used.

2. Discover your data

Taking the time to discover your own data might seem like an obvious next step. But for large organizations, this process is more akin to finding needles in not one, but multiple haystacks.

With numerous IT systems and hundreds of thousands of database tables, often containing similar data records, it’s difficult to work out what data you have and where it is. But, for compilatory and business purposes, you need to understand where your data resides.

This is a huge project for any large organization to undertake. As a result, you’ll need help from a consulting company or a data expert with the right tools for discovering and anonymizing your data simply and effectively.

3. Data anonymization and pseudonymization at scale

When anonymizing a single data set to send to a contractor, you can easily make do with Excel or other readily available anonymization tools.

However, if you have multiple use cases and, therefore, require various levels of anonymization, things become much more complex. This problem is only doubled when you consider the amount of your data that’s dotted all over your systems.

So, if you require large, enterprise scale anonymizations, the job will require anonymizing entire databases at once, alongside any other accompanying data (referential integrity, IDs etc). Without anonymizing or pseudonymizing this data, your anonymization process will fail.

In order to get your specified data treated correctly, you can either ask your developers to build a customized anonymization process internally or contact an expert who already has templates and tools built for the task. However, bear in mind that tackling this on an ad-hoc base internally may take months to years to complete.

The CloverDX approach

If you don’t have the time to wait for your developers to build an anonymization process, it’s better to enlist the help of an automated tool that can do most of the work for you.

CloverDX’s anonymization framework simplifies setting up and operating your complex anonymization and pseudonymization process. We’ve developed a ‘Data Harvester’ that crawls your thousands of databases and finds the specific sensitive datasets you’re looking for at large scale, cutting the time you’d be spending doing it manually from potentially months to just a couple of weeks...

From here, CloverDX’s anonymization engine uses rules that define multiple targets for the different levels of data anonymization required. As your data grows and changes (as it inevitably will), you can easily re-configure the platform and continue to anonymize your sensitive information automatically and with minimal hassle.

So, are you ready to de-stress your data anonymization processes? Watch our data anonymization webinar for more information.

New call-to-action

Share

Facebook icon Twitter icon LinkedIn icon Email icon
Try CloverDX for 45 days  Full access to Tech Support as if you were a customer

Newsletter

Subscribe

Join 54,000+ data-minded IT professionals. Get regular updates from the CloverDX blog. No spam. Unsubscribe anytime.

Related articles

Back to all articles
buying data integration software
Data Architecture
7 min read

Dos and don'ts when buying a data integration platform

Continue reading
Data architecture health check - do you have these symptoms?
Data Architecture
7 min read

Data architecture health check: Do you have these symptoms?

Continue reading
What is modern enterprise data architecture?
Data Architecture
5 min read

What is modern enterprise data architecture?

Continue reading
CloverDX logo
Book a demo
Get the free trial
  • Company
  • Our story
  • Contact
  • Partners
  • Our partners
  • Become a partner
  • Product
  • Platform overview
  • Plans & Pricing
  • Customers
  • By Use Case
  • By Industry
  • Deployment
  • On-premise
  • AWS
  • Azure
  • Google Cloud
  • Services
  • Onboarding & Training
  • Professional Services
  • CloverCARE Support
  • Resources
  • Customer Portal
  • Documentation
  • Downloads & Licenses
  • Webinars
  • Academy & Training
  • Release Notes
  • CloverDX Forum
  • CloverDX Blog
  • Tech Blog
  • Other resources
Blog
4 steps to providing a data-driven customer experience
Data Integration
Implementing data democratization: 3 ways to make your data more accessible
Data Innovation
Data dictionary vs data catalog: what’s the difference?
Data Innovation
What is a ‘live’ data catalog and how can you use one in your organization?
Data Innovation
© 2023 CloverDX. All rights reserved.
  • info@cloverdx.com
  • sales@cloverdx.com
  • ●
  • Legal
  • Privacy Policy
  • Cookie Policy
  • EULA
  • Support Policy