Hello š
Mapping your data domains is probably the most underestimated task in terms of complexity. Because what could be simpler than drawing boxes on a slide representing your big groups of data?
Play the Chief Data Officer simulation game
Get my Data Governance templates
Discover my consulting services
Join +270 readers of my ebook āData Governance : where to start?ā
Get a boost with a 4 weeks training on Generative AI
The Data Governance Training Day is back ! You can already apply for the next edition that will be on July 17th.
Full remote. Paris time. Fun and engaging discussions on Data Governance from strategy, buy-in to operationalisation and culture change.
Only 6 people by edition.
Letās see how to handle it š
Agenda
The beautiful idea
Cracking the code
The cheatsheet
The beautiful idea
When you start your Data Governance program, there's always a moment of paralysis at the beginning, a mountain to climb. Because you donāt know where to start among the tons of data stored in the warehouse, in various systems, or even at third-parties.
Thatās where the idea of data domain is great.
A data domain is a logical grouping of data, owned and managed by a specific part of the business. Each domain has its own data, processes and experts.
It gives a clear structure in your big data library, like having large sections.
And this means that you can prioritize some data domains to start your governance initiatives with a smaller scope.
Itās elegant, manageable, reassuring for the mind.
But let me tell you itās far from being an easy exercise⦠usually takes months.
š Letās take an example to understand the complexity.
Taking āCustomers,ā many questions will pop up :
Is it part of Sales, or is it a separate domain?
Who owns the data about customer behaviors? Marketing or Sales?
Should it be a āmaster dataā as every team claims to own it?
Suddenly, you find overlaps, data that fits in multiple places, or worse, data that doesnāt clearly belong anywhere. Teams have different definitions for the same thing. Ownership debates begin.
And thatās how a beautiful idea on paper can get ugly in reality š
Cracking the code
Iāve refined some kind of method, but needless to say that I donāt have the magic recipe. And if I did, I would never give it publiclyā¦
So this is it, there are 3 ways to do a data domain mapping :
1ļøā£ Functions-based data domains : organizes data domains based on business functions or departments.
Why? It aligns with existing team structures, making it easier to assign data owners and stewards.
š Use this method to start your mapping. Your Comex members should find easily their domain. You have a VP of Sales? Create a Sales domain. A CFO? Create a Finance domain. Keep in mind that these can be your macro domains, you might split them in domains and sub domains after.
2ļøā£ Data-value based domains : organizes data based on its strategic value, focusing on critical assets that drive business decisions.
Why? It helps prioritize governance efforts on high-value data and enables better cross-functional decision-making.
š If there is already a data team, you can start by checking how theyāve organized data during storage and transformations : is it datamarts? domain repos? Bronze/Silver/Gold zones? But like mentioned above, you will find it hard to assign a data owner to these domains at first.
3ļøā£ Process-based data domains : organizes data based on business processes and workflows rather than departments.
Why? It supports cross-functional integration, enabling better efficiency and workflow automation.
š This works well if youāve gone through major transformations like ERP integration and already have Business Process Owners. Or at least some documentation about your business processesā¦
Guess what, you can do hybrid too
A little bit of functions-based and value-based :
Define core valueābased subdomains
Start by identifying the principal business ānounsā or data centers of gravity : Customer, Order, Product, Invoice, etc. For each, agree on a data model (the attributes, types, lifecycles) that every team references. This ensures everyoneās speaking the same language about, say, what a Customerās āstatusā or āsegmentā means.Overlay functionābased domains
Next, carve these value domains into functional responsibilities :Order Processing owns the lifecycle transitions (create, validate, ship) of Order. And it is part of Sales domain.
Billing owns invoice generation and payment reconciliation. And Billing is part of Finance domain.
š Or you could go the other way around : start with big function-based domains and then add value-based subdomains inside.
As you can see, with these 3 methods in mind, youāll still have to crack the code according to your organization and constraints.
For data that belong anywhere (list of countries, currencies, calendar, etc.), you can have a āTransversalā domain. Be creative for the owner of this domain š
The cheatsheet
As this method is not magic nor easy to implement, letās review some tips to make it less painful.
ā DOāS
Write the why and vision
Explain the need for your domain framework, take feedback from data team members first - you can use my templates to start your domain framework ;)
Build a first draft of the mapping based on your organization chart (the functions-based method!)
Use available documentation on your business process and data sources to improve your first draft
Do 1-1 sessions with each data owner identified so they can help you with the validation of domains and the mapping of sub-domains
Adjust your draft after each session
Challenge the mapping globally to add value-based domains like Customers or Products, especially if these are master data for your company
Have each data owner assign their domainās data stewards
ā DONāTS
Donāt do it for the sake of doing it, it should serve a purpose
Donāt do the mapping at data element level thatās a waste of time
Donāt listen to only one executive, you need a holistic vision
Donāt take 6 months before showing your mapping, do it iteratively
Donāt start with the systems layer, itās not the point - applications owners already exist in IT department, what you want is business/data owner
Donāt let domains overlap or compete : clarify meaning, assign attributes to different domains if necessary
Donāt think that there is an ideal number of domains that you should not exceed, there is absolutely no rules !
On the last one, fine, Iāll give a rough number : you should have around 15-25 domains depending on the size and complexity of your company.
See you soon,
Charlotte
I'm Charlotte Ledoux, freelance in Data & AI Governance.
You can follow me on Linkedin !
Charlotte, this is a really good breakdown to understanding how to categorize data into specific data domains. The aforementioned is a topic I find myself getting stuck on and not understanding how to identify within my non-profit organization. This provides clarity and the why behind each domain. Again, you've provided insight in connecting the dots that makes the complexities of the Data Governance Framework and bit more digestible. Thank you!