• Blog
  • Podcast
  • Contact
  • Sign in
CloverDX Logo
Product
  • OVERVIEW
  • Discover CloverDX Data Integration Platform###Automate data pipelines, empower business users.
  • Deploy in Cloud
  • Deploy on Premise
  • Deploy on Docker
  • Plans & Pricing
  • Release Notes
  • Documentation
  • Customer Portal
  • More Resources
  • CAPABILITIES
  • Sources and Targets###Cloud and On-premise storage, Files, APIs, messages, legacy sources…
  • AI-enabled Transformations###Full code or no code, debugging, mapping
  • Automation & Orchestration###Full workflow management and robust operations
  • MDM & Data Stewardship###Reference data management
  • Manual Intervention###Manually review, edit and approve data
  • ROLES
  • Data Engineers###Automated Data Pipelines
  • Business Experts###Self-service & Collaboration
  • Data Stewards###MDM & Data Quality
clip-mini-card

 

Ask us anything!

We're here to walk you through how CloverDX can help you solve your data challenges.

 

Request a demo
Solutions
  • Solutions
  • On-Premise & Hybrid ETL###Flexible deployment & full control
  • Data Onboarding###Accelerate setup time for new data
  • Application Integration###Integrate operational data & systems
  • Replace Legacy Tooling###Modernize slow, unreliable or ad-hoc data processes
  • Self-Service Data Prep###Empower business users to do more
  • MDM & Data Stewardship###Give domain experts more power over data quality
  • Data Migration###Flexible, repeatable migrations - cloud, on-prem or hybrid
  • By Industry
  • SaaS
  • Healthcare & Insurance
  • FinTech
  • Government
  • Consultancy
zywave-3

How Zywave freed up engineer time by a third with automated data onboarding

Read case study
Services
  • Services
  • Onboarding & Training
  • Professional Services
  • Customer Support

More efficient, streamlined data feeds

Discover how Gain Theory automated their data ingestion and improved collaboration, productivity and time-to-delivery thanks to CloverDX.

 

Read case study
Customers
  • By Use Case
  • Analytics and BI
  • Data Ingest
  • Data Integration
  • Data Migration
  • Data Quality
  • Data Warehousing
  • Digital Transformation
  • By Industry
  • App & Platform Providers
  • Banking
  • Capital Markets
  • Consultancy & Advisory
  • E-Commerce
  • FinTech
  • Government
  • Healthcare
  • Logistics
  • Manufacturing
  • Retail
Migrating data to Workday - case study
Case study

Effectively Migrating Legacy Data Into Workday

Read customer story
Company
  • About CloverDX
  • Our Story & Leadership
  • Contact Us
  • Partners
  • CloverDX Partners
  • Become a Partner
Pricing
Demo
Trial

What is a data vault?

Data Architecture
Posted April 19, 2022
4 min read
What is a data vault?

A data vault is a database modelling methodology and architecture, created by Dan Lindstedt in the 1990s. In his own words:

‘The data vault is a detail-oriented, historical tracking and uniquely linked set of normalized tables that support one or more functional areas of business.’

Lindstedt created the model in an attempt to ease issues surrounding scalability, flexibility, and data mining in data warehouses.

It’s considered a hybrid of the ‘third normal form’ and ‘dimensional modeling’ (or star schema) techniques, and stores all of your data. It’s often the model of choice for source systems that are subject to change over time.

But this is only skimming the surface of data vaults. Let’s explore how this technique works and how it can benefit your business.

How does it work?

The data vault technique separates each business function or entity into ‘business keys’. These keys form the overarching structure of the data vault, and the data warehouse takes shape around them. As these keys are largely unchanging functions, it offers some stability against organizational transformations.

In a typical data vault structure, the framework consists of three key aspects:

  • Hubs. These hubs contain a list of stable business keys. For instance, employees, or license type.
  • Links. These represent relationships and transactions between hubs. As an example, you might create a link between EMPLOYEE_HUB and SOFTWARE_LICENSE_HUB when an employee has a license to a certain platform.
  • Satellites. Finally, satellites contain temporary and descriptive metadata around hubs and links. This extra information aids businesses with historical tracking and auditing. For instance, a satellite might contain contact details for a customer.

Every dataset within a data vault structure is essentially ‘time stamped’. This means you can determine load date and origin information.

4 data vault benefits

‘The design is flexible, scalable, consistent and adaptable to the needs of the enterprise. It is a data model that is architected specifically to meet the needs of enterprise data warehouses.’ - Dan Linstedt

The data vault methodology solves complexities around data warehousing and is resistant to change.

The top four benefits of the model include:

1. Scalability

As the volume of data increases, data warehouses can become increasingly inflexible. It then becomes difficult to add functionalities to your current datasets without breaking (and subsequently remodeling) anything. This presents a tricky blocker to businesses that anticipate change over time.

The data vault technique allows you to build upon your structure over time without putting your historical data at risk. You can continually add or delete links and satellites as you go.

2. Auditability and historical tracking

As a data vault hinges on historical metadata, you can also benefit from easy auditability. You load the data into the structure without any cleansing. This means you can trace the source and lineage of each data set, including when it was loaded and from where.

This is a crucial benefit for heavily regulated businesses, such as those within the financial sector.

3. Fast loading

Because you don’t need to cleanse data before placing it in your data vault, you can load large volumes of data at speed.

What’s more, you can load this data in parallel, as there are very few dependencies between your hubs, links, and satellites during the loading process.

4. Enterprise-wide oversights

It’s straightforward to get a bird’s eye view of your organization’s data. Particularly as it’s organized by business functions.

This makes it simple for various departments to dip into the data vault and access any relevant information.

When to use the data vault model

More often than not, you’ll want to use a data vault when you need to audit your data. As each row in every table requires load metadata, you can easily access current and historical data. This allows you to trace data origins, loading dates, and changes.

But the model’s use cases go beyond this. Data vaults can also be useful when:

  • You need to get the hard facts. Data experts often describe vaults as ‘one single source of the facts’.
  • You have multiple source systems. As the data vault structure is very flexible, it works particularly well for organizations with multiple data sources or changing data sources and relationships. You can add hubs, satellites, and links as you go. Similarly, you can delete any links if relationships are no longer relevant.
  • You want to load data quickly. As we already stated, data vault models enable fast and simultaneous data loading at scale. This is particularly beneficial if your business handles a large amount of data, such as transactional data.

On the other hand, a data vault may not be the right choice if you:

  • Only have one source system or unchanging data relationships. This is because making changes should be relatively easy and wouldn’t warrant a vault structure.
  • Need to load data directly into a data reporting tool. This would require an added step of data manipulation in order to join up your various hub, satellite, and link tables. As well as taking time, this process is prone to errors.
  • Are on a tight budget. Of all the data warehousing methodologies available, building a data vault is the most expensive.

Is the data vault right for you?

For organizations that handle large amounts of data and regularly experience change, the data vault is a compelling framework.

Its flexibility, scalability and speed of loading allow companies to build an incremental, evolving, and auditable data structure as they grow. As you organize the vault by business entities, it’s simple to dive in and use the data for analysis. That said, this methodology might not be for everyone. Particularly if your purse strings are tight.

If you’d like to learn more about your data architecture options, we recommend reading our Guide to Enterprise Data Architecture

New call-to-action

Share

Facebook icon Twitter icon LinkedIn icon Email icon
Behind the Data  Learn how data leaders solve complex problems every day

Newsletter

Subscribe

Join 54,000+ data-minded IT professionals. Get regular updates from the CloverDX blog. No spam. Unsubscribe anytime.

Related articles

Back to all articles
buying data integration software
Data Architecture
7 min read

Dos and don'ts when buying a data integration platform

Continue reading
Data architecture health check - do you have these symptoms?
Data Architecture
7 min read

Data architecture health check: Do you have these symptoms?

Continue reading
What is modern enterprise data architecture?
Data Architecture
5 min read

What is modern enterprise data architecture?

Continue reading
CloverDX logo
Book a demo
Get the free trial
  • Company
  • Our Story
  • Contact
  • Partners
  • Our Partners
  • Become a Partner
  • Product
  • Platform Overview
  • Plans & Pricing
  • Customers
  • By Use Case
  • By Industry
  • Deployment
  • AWS
  • Azure
  • Google Cloud
  • Services
  • Onboarding & Training
  • Professional Services
  • Customer Support
  • Resources
  • Customer Portal
  • Documentation
  • Downloads & Licenses
  • Webinars
  • Academy & Training
  • Release Notes
  • CloverDX Forum
  • CloverDX Blog
  • Behind the Data Podcast
  • Tech Blog
  • CloverDX Marketplace
  • Other resources
Blog
The vital importance of data governance in the age of AI
Data Governance
Bringing a human perspective to data integration, mapping and AI
Data Integration
How AI is shaping the future of data integration
Data Integration
How to say ‘yes’ to all types of data and embark on a data-driven transformation journey
Data Ingest
© 2025 CloverDX. All rights reserved.
  • info@cloverdx.com
  • sales@cloverdx.com
  • ●
  • Legal
  • Privacy Policy
  • Cookie Policy
  • EULA
  • Support Policy