• Blog
  • Contact
  • Sign in
CloverDX
Product
  • Overview
  • CloverDX Data Integration Platform
  • What's new in CloverDX 6
  • Pricing
  • CloverDX plans
  • Deployment
  • CloverDX on AWS
  • CloverDX on Azure
  • CloverDX on Google Cloud
  • CloverDX on-premise
  • Resources
  • Customer Portal
  • Documentation
  • Downloads & Licenses
  • Webinars
  • Academy & Training
  • Release Notes
  • CloverDX Forum
  • CloverDX Blog
  • Tech Blog
  • Other resources
isometric-illustration--product@2x 1

Get under the hood of CloverDX

See how CloverDX can benefit your business with a live demo. Simply get in touch with our team and we’ll handle the rest.

Book a demo
Solutions
  • By Industry
  • Banking
  • Capital Markets
  • Consultancy & Advisory
  • FinTech
  • Government Agencies
  • Healthcare
  • By Use Case
  • Data Quality
  • Data Ingest
  • Data Warehousing
  • Data Migration
  • Digital Transformation
  • Enterprise Data Management
  • Risk & Compliance
  • Anonymization
How F3 Group use CloverDX to ingest more client data - webinar
Customer interview

Formula 3: Staying Small And Agile While Working With Large Enterprise Ecosystems

Browse webinars
Services
  • Services
  • Onboarding & Training
  • Professional Services
  • Customer Support

More efficient, streamlined data feeds

Discover how Gain Theory automated their data ingestion and improved collaboration, productivity and time-to-delivery thanks to CloverDX.

 

Read case study
Customers
  • By Use Case
  • Analytics and BI
  • Data Ingest
  • Data Integration
  • Data Migration
  • Data Quality
  • Data Warehousing
  • Digital Transformation
  • By Industry
  • App & Platform Providers
  • Banking
  • Capital Markets
  • Consultancy & Advisory
  • E-Commerce
  • FinTech
  • Government
  • Healthcare
  • Logistics
  • Manufacturing
  • Retail
Migrating data to Workday - case study
Case study

Effectively Migrating Legacy Data Into Workday

Read customer story
Company
  • About CloverDX
  • Our story & leadership
  • Contact us
  • Partners
  • CloverDX Partners
  • Become a partner
Pricing
Demo
Trial

What is a data vault?

Data Architecture
Posted April 19, 2022
4 min read
What is a data vault?

A data vault is a database modelling methodology and architecture, created by Dan Lindstedt in the 1990s. In his own words:

‘The data vault is a detail-oriented, historical tracking and uniquely linked set of normalized tables that support one or more functional areas of business.’

Lindstedt created the model in an attempt to ease issues surrounding scalability, flexibility, and data mining in data warehouses.

It’s considered a hybrid of the ‘third normal form’ and ‘dimensional modeling’ (or star schema) techniques, and stores all of your data. It’s often the model of choice for source systems that are subject to change over time.

But this is only skimming the surface of data vaults. Let’s explore how this technique works and how it can benefit your business.

How does it work?

The data vault technique separates each business function or entity into ‘business keys’. These keys form the overarching structure of the data vault, and the data warehouse takes shape around them. As these keys are largely unchanging functions, it offers some stability against organizational transformations.

In a typical data vault structure, the framework consists of three key aspects:

  • Hubs. These hubs contain a list of stable business keys. For instance, employees, or license type.
  • Links. These represent relationships and transactions between hubs. As an example, you might create a link between EMPLOYEE_HUB and SOFTWARE_LICENSE_HUB when an employee has a license to a certain platform.
  • Satellites. Finally, satellites contain temporary and descriptive metadata around hubs and links. This extra information aids businesses with historical tracking and auditing. For instance, a satellite might contain contact details for a customer.

Every dataset within a data vault structure is essentially ‘time stamped’. This means you can determine load date and origin information.

4 data vault benefits

‘The design is flexible, scalable, consistent and adaptable to the needs of the enterprise. It is a data model that is architected specifically to meet the needs of enterprise data warehouses.’ - Dan Linstedt

The data vault methodology solves complexities around data warehousing and is resistant to change.

The top four benefits of the model include:

1. Scalability

As the volume of data increases, data warehouses can become increasingly inflexible. It then becomes difficult to add functionalities to your current datasets without breaking (and subsequently remodeling) anything. This presents a tricky blocker to businesses that anticipate change over time.

The data vault technique allows you to build upon your structure over time without putting your historical data at risk. You can continually add or delete links and satellites as you go.

2. Auditability and historical tracking

As a data vault hinges on historical metadata, you can also benefit from easy auditability. You load the data into the structure without any cleansing. This means you can trace the source and lineage of each data set, including when it was loaded and from where.

This is a crucial benefit for heavily regulated businesses, such as those within the financial sector.

3. Fast loading

Because you don’t need to cleanse data before placing it in your data vault, you can load large volumes of data at speed.

What’s more, you can load this data in parallel, as there are very few dependencies between your hubs, links, and satellites during the loading process.

4. Enterprise-wide oversights

It’s straightforward to get a bird’s eye view of your organization’s data. Particularly as it’s organized by business functions.

This makes it simple for various departments to dip into the data vault and access any relevant information.

When to use the data vault model

More often than not, you’ll want to use a data vault when you need to audit your data. As each row in every table requires load metadata, you can easily access current and historical data. This allows you to trace data origins, loading dates, and changes.

But the model’s use cases go beyond this. Data vaults can also be useful when:

  • You need to get the hard facts. Data experts often describe vaults as ‘one single source of the facts’.
  • You have multiple source systems. As the data vault structure is very flexible, it works particularly well for organizations with multiple data sources or changing data sources and relationships. You can add hubs, satellites, and links as you go. Similarly, you can delete any links if relationships are no longer relevant.
  • You want to load data quickly. As we already stated, data vault models enable fast and simultaneous data loading at scale. This is particularly beneficial if your business handles a large amount of data, such as transactional data.

On the other hand, a data vault may not be the right choice if you:

  • Only have one source system or unchanging data relationships. This is because making changes should be relatively easy and wouldn’t warrant a vault structure.
  • Need to load data directly into a data reporting tool. This would require an added step of data manipulation in order to join up your various hub, satellite, and link tables. As well as taking time, this process is prone to errors.
  • Are on a tight budget. Of all the data warehousing methodologies available, building a data vault is the most expensive.

Is the data vault right for you?

For organizations that handle large amounts of data and regularly experience change, the data vault is a compelling framework.

Its flexibility, scalability and speed of loading allow companies to build an incremental, evolving, and auditable data structure as they grow. As you organize the vault by business entities, it’s simple to dive in and use the data for analysis. That said, this methodology might not be for everyone. Particularly if your purse strings are tight.

If you’d like to learn more about your data architecture options, we recommend reading our Guide to Enterprise Data Architecture

New call-to-action

Share

Facebook icon Twitter icon LinkedIn icon Email icon
Try CloverDX for 45 days  Full access to Tech Support as if you were a customer

Newsletter

Subscribe

Join 54,000+ data-minded IT professionals. Get regular updates from the CloverDX blog. No spam. Unsubscribe anytime.

Related articles

Back to all articles
buying data integration software
Data Architecture
7 min read

Dos and don'ts when buying a data integration platform

Continue reading
Data architecture health check - do you have these symptoms?
Data Architecture
7 min read

Data architecture health check: Do you have these symptoms?

Continue reading
What is modern enterprise data architecture?
Data Architecture
5 min read

What is modern enterprise data architecture?

Continue reading
CloverDX logo
Book a demo
Get the free trial
  • Company
  • Our story
  • Contact
  • Partners
  • Our partners
  • Become a partner
  • Product
  • Platform overview
  • Plans & Pricing
  • Customers
  • By Use Case
  • By Industry
  • Deployment
  • On-premise
  • AWS
  • Azure
  • Google Cloud
  • Services
  • Onboarding & Training
  • Professional Services
  • CloverCARE Support
  • Resources
  • Customer Portal
  • Documentation
  • Downloads & Licenses
  • Webinars
  • Academy & Training
  • Release Notes
  • CloverDX Forum
  • CloverDX Blog
  • Tech Blog
  • Other resources
Blog
Why data accessibility is essential for your digital transformation
Data Integration
4 barriers to making data-driven decisions
Data Strategy
4 steps to providing a data-driven customer experience
Data Integration
Implementing data democratization: 3 ways to make your data more accessible
Data Innovation
© 2023 CloverDX. All rights reserved.
  • info@cloverdx.com
  • sales@cloverdx.com
  • ●
  • Legal
  • Privacy Policy
  • Cookie Policy
  • EULA
  • Support Policy