OPENLINEAGE: UNVEILING DATA LINEAGE FOR MODERN DATA ECOSYSTEMS

29/08/2024/by Datalumen

Understanding the journey of data from its source to its final destination is crucial for businesses and organizations. This journey, known as data lineage, has become increasingly complex with the proliferation of data sources, transformation processes, and analytical tools. Enter OpenLineage, an open-source standard that aims to simplify and standardize data lineage tracking across diverse data ecosystems.

What is data lineage?

Data lineage is the process of tracing the journey of data from its origin to its destination, tracking every transformation, processing step, and the tools or systems it interacts with along the way. With data flowing through increasingly complex architectures, the ability to accurately map and understand these movements is vital for ensuring data quality, compliance, and operational efficiency.

However, tracking data lineage is no small feat, especially with the explosion of data sources, analytics platforms, and transformation tools that make up modern data stacks.

What is OpenLineage?

OpenLineage is an open standard for data lineage collection and analysis. Initiated by Datakin and now part of the Linux Foundation, OpenLineage provides a set of standardized definitions and APIs that allow different tools and platforms in the data ecosystem to share lineage metadata in a consistent format.

The primary goal of OpenLineage is to create a unified approach to collecting and utilizing data lineage information. By establishing a common language for data lineage, OpenLineage enables better interoperability between various data tools, platforms, and processes.

Key Components of OpenLineage

OpenLineage Specification: This defines the core concepts and data model for representing lineage metadata. It includes definitions for jobs, datasets, runs, and the relationships between them.
Integration Libraries: OpenLineage provides libraries and SDKs for popular data processing frameworks like Apache Spark, Apache Airflow, and dbt. These integrations allow developers to easily instrument their data pipelines to emit lineage events.
API: The OpenLineage API defines how lineage events should be structured and transmitted. This standardization ensures that all tools speaking the OpenLineage language can understand and process lineage data consistently.
Facets: These are extensible metadata attributes that can be attached to core OpenLineage entities, allowing for custom metadata to be included in lineage information.

Why should I care about this?

Standardization and Interoperability	One of the most significant advantages of OpenLineage is its ability to standardize lineage data across different tools and platforms. This standardization enables seamless integration between various components of a data stack, from data ingestion tools to transformation engines and analytics platforms. As a result, organizations can build a comprehensive view of their data lineage without being locked into a single vendor or tool.
Enhanced Data Governance and Compliance	With the increasing importance of data privacy regulations like GDPR and the AI Act, understanding data lineage is crucial for compliance. OpenLineage makes it easier to track the flow of sensitive data across systems, helping organizations ensure that data is handled in accordance with regulatory requirements. This comprehensive lineage information also aids in auditing processes and demonstrating compliance to regulatory bodies.
Improved Trust	By providing visibility into the entire data pipeline, OpenLineage helps data teams identify and resolve data quality issues more efficiently. When inconsistencies or errors are discovered, teams can quickly trace the problem back to its source, understanding all the transformations and processes the data has undergone. This transparency builds trust in the data and the insights derived from it.
Efficient Troubleshooting and Debugging	When issues arise in data pipelines or analytics, OpenLineage’s detailed lineage information becomes invaluable. Data engineers and analysts can trace the path of data through various systems, identifying where problems may have occurred. This capability significantly reduces the time and effort required for troubleshooting, leading to faster resolution of data-related issues.
Support for Data Cataloging and Metadata Management	OpenLineage integrates seamlessly with data catalogs and metadata management tools. By providing rich lineage information, it enhances the capabilities of these tools, allowing for more comprehensive documentation of data assets. This integration supports better data discovery, understanding, and utilization across the organization.

Conclusion

OpenLineage represents a significant step forward in the field of data lineage and metadata management. By providing a standardized, open-source approach to tracking data lineage, it addresses many of the challenges faced by modern data-driven organizations. From improving data governance and quality to enhancing troubleshooting capabilities and fostering collaboration, OpenLineage offers a wide range of benefits.

As data ecosystems continue to grow in complexity, tools like OpenLineage will become increasingly crucial. Organizations that adopt OpenLineage can expect to gain a competitive edge through better data management, increased efficiency, and improved data-driven decision-making capabilities.

The open nature of the project ensures that it will continue to evolve and improve, driven by the needs of the data community. As more tools and platforms adopt the OpenLineage standard, we can expect to see even greater interoperability and capabilities in the future of data lineage tracking.

CONTACT US

Need expert support with your data agenda? Discover how Datalumen can help you.

Contact us and start our data conversation

ESTABLISHING ROBUST DATA LITERACY – FROM AWARENESS TO ACTION

21/08/2024/by Datalumen

Data literacy is no longer a niche skill reserved for data professionals. It’s becoming a core competency required for all employees in forward-looking organizations. Data literacy — the ability to read, write, and communicate data in context — is essential for making informed decisions, driving innovation, and fostering a data-driven culture across the enterprise. It is crucial not only to equip employees with the necessary skills but also to foster a shared mindset and language around data.

The Imperative of a Data Literacy Program

Launching a data literacy program isn’t just about offering a few training sessions. It requires a comprehensive approach that touches every level of the organization. This is an opportunity to grow and amplify an understanding of data management and with extension also artificial intelligence (AI) (and other emerging technologies) within the organization. As these capabilities become increasingly integrated into business processes, the need for an organization that can interpret and leverage these technologies, in an ethical and compliant way, becomes even more critical.

To help organizations successfully launch and sustain a data literacy program, here are some key steps:

Craft a Strong Argument for Transformation
Before embarking on a data literacy initiative, it’s vital to establish a compelling reason for change. This involves articulating the strategic importance of data literacy to the organization’s future, aligning the program’s goals with business objectives, and gaining buy-in from leadership and stakeholders. A well-defined case for change will serve as the foundation for all subsequent efforts.
Build a Solid Program Foundation with Targeted Pilots
Starting small with targeted pilots can help demonstrate the value of data literacy initiatives. These pilots should be designed to address specific business challenges and provide measurable outcomes. By focusing on practical applications, organizations can build momentum and create a sustainable foundation for the program.
Showcase and Celebrate Successes
Highlighting success stories is crucial for building credibility and inspiring broader participation. By showcasing examples of how data literacy has led to positive business outcomes, organizations can encourage more employees to engage with the program. This also helps reinforce the importance of data literacy across the organization.
Foster Connections and Support Isolated Teams
In any organization, there are often key individuals or teams who may feel disconnected from the broader data culture. Connecting these communities and providing them with the support they need is essential for fostering a sense of belonging and encouraging active participation in the data literacy program. This can be achieved through internal networks, forums, or mentoring programs.
Integrate Across the Organization to Achieve Sustainable Transformation
An effective data literacy program should be integrated with other data culture and training initiatives within the organization. By connecting these efforts, organizations can ensure that employees have access to a cohesive set of resources and training opportunities, enabling them to continuously build their skills and knowledge. Ultimately, the goal is to deliver lasting benefits to the organization, including not only improving individual skills but also embedding a data-driven mindset into the company’s culture. Over time, a strong data culture will lead to better decision-making, increased innovation, and a competitive advantage in the marketplace.

The Path Forward

As organizations continue to navigate the complexities of the digital age, the importance of data literacy cannot be overstated. By following these six steps, companies can build a data literacy program that empowers their employees, drives cultural transformation, and ensures long-term success in an increasingly data-driven world.

Investing in data literacy is not just about upskilling employees; it’s about preparing the entire organization for the future. Whether you’re just starting on this journey or looking to enhance existing efforts, it is fundamental to approach data literacy with intention, commitment, and a clear vision for the future.

CONTACT US

Need expert support with your data agenda? Discover how Datalumen can help you.

Contact us and start our data conversation

AUGMENTED DATA QUALITY: AN AI-FUELED APPROACH FOR YOUR DATA ZEN MOMENT

14/03/2024/by Datalumen

Data’s effectiveness hinges on its quality and here’s where Augmented Data Quality (ADQ) steps in, revolutionizing how we ensure our information assets are accurate, reliable, and ready to use.

Traditional Data Quality: A Manual Marathon

For years, data quality relied on automated but nevertheless manual processes. Data stewards meticulously combed through datasets, identifying and correcting errors like inconsistencies, missing values, and formatting issues. This painstaking approach, while crucial, becomes increasingly inefficient as data volumes explode.

Augmented Data Quality: AI-Powered Efficiency

Augmented Data Quality tackles this challenge head-on by leveraging artificial intelligence (AI) and machine learning (ML). These powerful tools automate data quality tasks, freeing up human experts for more strategic endeavors.

Here’s how ADQ makes a difference:

Automated anomaly detection: AI algorithms can scan huge datasets, pinpointing anomalies and potential errors that might escape manual analysis.
Intelligent data cleansing: ADQ can suggest corrections for identified issues, streamlining the cleaning process. Machine learning even allows the system to “learn” from past corrections, continuously improving its accuracy.
Proactive monitoring: ADQ can be configured for real-time monitoring, enabling early detection and rectification of data quality issues before they impact downstream processes.

Benefits Beyond Efficiency

The advantages of ADQ extend far beyond simply saving time and resources. Here’s what organizations can expect:

Enhanced data trust: ADQ fosters a culture of data trust within an organization. With a high degree of confidence in data quality, employees across departments can make informed decisions based on reliable information.
Improved decision-making: Clean, accurate data leads to better insights. ADQ empowers businesses to leverage data for strategic planning, risk management, and optimized operations.
Reduced costs: Data quality issues can lead to costly rework and missed opportunities. ADQ proactively addresses these challenges, minimizing associated costs.

Conclusion

ADQ represents a significant step forward in data management. By harnessing the power of AI and automation, organizations can unlock the full potential of their data assets. As data continues to be the cornerstone of success, ADQ will be a critical differentiator for businesses that prioritize reliable information and data-driven decision making.

CONTACT US

In need for support with your Data Quality initiatives? Discover how Datalumen can help you getting there.

Contact us and start our data conversation

AI & DATA GOVERNANCE: THE INTERSECTION YOU CAN’T MISS TO MAKE AI RESPONSIBLE & TRUSTWORTHY

05/03/2024/by Dimitri Maesfranckx

Artificial Intelligence (AI) has become a transformative force across industries, offering significant benefits such as increased efficiency, personalized services, and better decision-making. However, the adoption of AI also raises ethical, legal, and social concerns, necessitating effective governance mechanisms. AI governance involves establishing policies, regulations, and best practices to ensure the responsible development, deployment, and use of AI. A crucial aspect of AI governance is data governance, which focuses on managing and ensuring the quality, security, and ethical use of data.

The Importance of Data Governance for AI

Data governance is the foundation of any AI system, as AI models rely on data to learn, make predictions, and provide insights. The quality, diversity, and fairness of the data used in AI models significantly impact the accuracy, reliability, and fairness of AI outcomes. Therefore, robust data governance is essential for building trustworthy AI systems that deliver value while respecting ethical considerations and legal requirements.

Effective Data Governance for Trustworthy AI

Effective data governance includes several key elements:

Data quality:
Ensuring the accuracy, completeness, consistency, and timeliness of data used in AI models is crucial for generating reliable outcomes. Data cleansing, validation, and normalization techniques can help improve data quality.
Data security:
Protecting data from unauthorized access, theft, and misuse is essential for maintaining trust and complying with data protection regulations. Encryption, access controls, and monitoring can help ensure data security.
Data privacy:
Respecting individuals’ privacy rights and complying with data protection regulations, such as GDPR, is essential for ethical AI development. Techniques such as differential privacy, data anonymization, and user consent management can help protect individual privacy.
Data bias and fairness:
Ensuring that data used in AI models is representative, unbiased, and free from discrimination is critical for building fair and equitable AI systems. Techniques such as bias detection, mitigation, and fairness-aware machine learning can help address data bias and promote fairness.
Data provenance and transparency:
Providing clear documentation and explanations of data sources, processing, and usage is essential for building trust and accountability in AI systems. Techniques such as data lineage, model cards, and interpretability methods can help improve data and model transparency.

AI Governance: Building on Data Governance Foundations

Effective AI governance builds on these data governance principles and includes additional considerations:

AI model transparency and explainability:
Providing clear explanations and justifications for AI model outcomes is essential for building trust, ensuring accountability, and facilitating auditability. Techniques such as SHAP, LIME, and decision trees can help improve model explainability.
AI model validation and testing:
Ensuring the accuracy, reliability, and robustness of AI models through rigorous testing, validation, and monitoring is crucial for building trust and ensuring safe and effective AI systems. Techniques such as cross-validation, stress testing, and model monitoring can help ensure model performance and reliability.
AI model risk management:
Identifying, assessing, and mitigating risks associated with AI models, such as safety, security, and reputational risks, is essential for responsible AI development. Techniques such as risk assessment frameworks, risk mitigation plans, and incident response plans can help manage AI risks.
AI ethics and social responsibility:
Ensuring that AI systems align with ethical principles, such as fairness, accountability, transparency, and social responsibility, is crucial for building trust and ensuring societal acceptance. Techniques such as ethical frameworks, social impact assessments, and multi-stakeholder engagement can help promote AI ethics and social responsibility.

Conclusion

AI governance and data governance are interconnected and interdependent, as effective data governance is essential for building trustworthy AI systems. By adopting robust data and AI governance practices, organizations can ensure the responsible development, deployment, and use of AI systems, while delivering value, building trust, and maintaining compliance with legal and ethical requirements. As AI continues to evolve and transform industries, effective governance will be crucial for achieving responsible and trustworthy AI that delivers long-term value and benefits for all stakeholders.

CONTACT US

In need for responsible & trustworthy AI? Discover how Datalumen can help you getting there.

Contact us and start our data conversation

AGILE DATA GOVERNANCE – THE SMART WAY TO UPGRADE YOUR DATA DYNAMICS?

28/02/2024/by Datalumen

In the dynamics of today’s business, data is key for organizational vitality. While the imperative of data-driven decision-making is paramount, traditional old school data governance methodologies can prove ponderous, impeding progress. Enter agile data governance, a transformative paradigm inspired by principles from agile software development.

Understanding Agile Data Governance

Agile data governance represents a contemporary and adaptable approach to data management, drawing inspiration from the agility of software development methodologies. It prioritizes collaboration, adaptability, and continual improvement, aiming to streamline decision-making and enhance communication across diverse departments and stakeholders.

Traditional Data Governance – The challenges & the case for the agile approach

Conventional data governance potentially encounters several challenges:

Sluggish Processes: Extensive documentation and prolonged approval cycles can substantially delay data initiatives.
Inflexibility: Rigid frameworks struggle to keep pace with the ever-evolving demands of the business.
Top-Down Structure: Lack of collaboration leads to isolated information, hindering effective data utilization.
Low Engagement: Complex procedures create disconnection and discouragement among data users.

Agile Data Governance – Distinct Advantages

Accelerated Value Realization: Break down extensive governance projects into manageable sprints for swift implementation and feedback loops, ensuring alignment with evolving needs. Prioritize business value at each stage, concentrating on crucial data elements and processes for rapid wins and showcasing the value of data governance to stakeholders.
Collaboration as a Cornerstone: Cultivate an environment where data producers and consumers collaborate, fostering a shared understanding of data definitions, usage guidelines, and ownership for improved data quality and accuracy. Leverage open communication channels and collaborative tools to encourage discussions, feedback, and shared ownership, dismantling silos and nurturing a data-driven culture.
Embracing Continuous Enhancement: Adopt an agile mindset, emphasizing learning and adaptation based on feedback to keep the data governance framework relevant, efficient, and aligned with changing business landscapes and technological advancements. Regularly review and refine policies and procedures based on real-world experiences and user feedback, ensuring ongoing effectiveness and support for organizational evolution.
Empowering Teams: Move away from a top-down, bureaucratic approach by equipping team members with the knowledge and tools needed to make data-informed decisions within defined boundaries. Promote ownership and accountability among data users, instilling a sense of responsibility for data quality and compliance, thereby fostering an engaged and data-driven workforce.

Implementing Agile Data Governance – Key Steps

While there is no one-size-fits-all approach, consider these key steps:

Define business goals and objectives, clearly understanding desired outcomes from adopting an agile data governance framework.
Identify key stakeholders and roles, involving data owners, stewards, consumers, and Business & IT representatives in the process.
Prioritize data assets and processes, focusing on critical data elements aligned with business goals.
Develop an iterative framework with clear principles, roles, responsibilities, and communication channels.
Establish a continuous improvement process, regularly reviewing framework effectiveness and adapting based on feedback and emerging needs.
Make optimal usage of fit-for-purpose tooling. While success isn’t solely dictated by technology, its impact on the degree to which agile data governance can be implemented is undeniable. It’s crucial to have a business-centric platform rather than one solely focused on IT to ensure a flexible and collaborative approach.

Conclusion

By embracing an agile approach to data governance, organizations can unlock the full potential of their data assets. Increased collaboration, faster time to value, and a culture of continuous improvement empower teams to make data-driven decisions and drive innovation in today’s dynamic business environment. Embark on your journey toward an agile data governance mindset and harness the power of data to propel your organization to success.

CONTACT US

Interested in elevating your data governance initiative to the next level? Discover how Datalumen can assist you getting there.

Contact us and start our data conversation

CHANGE & DATA GOVERNANCE – TAKE A LEAP FORWARD

19/05/2022/by Datalumen

A successful data governance initiative is based on properly managing the People, Process, Data & Technology square. The most important element of these four is undoubtedly People. The reason for that is that at the end it boils down to people in your organization to act in a new business environment. This always implies change so make sure that you have an enabling framework for managing also the people side of change. Prepare, support and equip individuals at different levels in your organization to drive change and data governance success.

Change & the critical ingredient for data governance success.

Change is crucial in the success or failure of a data governance initiative for two reasons:

1First of all you should realize that with data governance you are going to tilt an organization. What we mean by this is that the situation before data governance is usually a silo-oriented organization. Individual employees, teams, departments, etc are the exclusive owner of their systems and associated data. With the implementation of data governance you will tilt that typical vertical data approach and align data flows with business processes that also run horizontally through an entire organization. This means that you need to help the organization to arrive at an environment where the data sharing & collaboration concept is the new normal.

2The second important reason is the so-called data governance heartbeat. What we see in many organizations is that there is a lot of enthusiasm at the start of a program. However, without the necessary framework, read also a change management plan, you run the fundamental risk that such an initiative will eventually die a silent death. People lose interest, no longer feel involved, no longer see the point of it. From that perspective, it is necessary to create a framework that keeps data governance’s heart beating.

How to approach change?

Change goes beyond training & communication. To facilitate the necessary changes, ChangeLab and Datalumen designed the ADKAR-based LEAP approach. LEAP is an acronym that stands for Learn, Envision, Apply & Poll. Each of these important steps help realize successful and lasting change.

LEARN

• Organizational culture and risk mitigation culture; “Readiness” for change

• Scope and type of the (sub)projects

• Timing and path for the change

• Set up the change team

• Stakeholder assessment & impact analysis (personnel analysis)

• Sponsorship analysis

• Risk assessment

ENVISION

APPLY

PERSIST

Need help covering change in the context of your data initiatives?

Would you like to find out how Datalumen can also help you with your Data Governance initiative? Contact us and start our data conversation.

CALCULATING DATA GOVERNANCE ROI

08/04/2022/by Dimitri Maesfranckx

THE GDPR BUSINESS VALUE ROADMAP

08/12/2021/by Datalumen

Getting a good understanding of the requirements but also the opportunities and business value is not easy. We designed a GDPR business value roadmap to help you with this and also make you understand what capabilities you need to get the job done.

1

2

3

4

Complete the form and download this Datalumen infogram (A3 PDF).

The Datalumen privacy policy can be consulted here.

More info on our Advisory Services?
Would you like to know what Datalumen can also mean to your GDPR or other data governance initiatives?

Have a look at our GDPR or Data Governance,
contact us and start our Data Conversation.

https://www.datalumen.eu/wp-content/uploads/2018/03/Datalumen-GDPR-Business-Value-Roadmap.jpeg 1709 2560 Datalumen https://www.datalumen.eu/wp-content/uploads/2019/06/datalumen_color_V1.1-340-156B.png Datalumen2021-12-08 11:09:112022-05-25 16:19:47THE GDPR BUSINESS VALUE ROADMAP

SUMMER READING TIP

21/08/2018/by Datalumen

Summer is here and the longer days it brings means more time available to spend with a ripping read. That’s how it ideally works at least. We selected 3 valuable books worth your extra time.

The Chief Data Officer’s Playbook

The issues and profession of the Chief Data Officer (CDO) are of significant interest and relevance to organisations and data professionals internationally. Written by two practicing CDOs, this new book offers a practical, direct and engaging discussion of the role, its place and importance within organisations. Chief Data Officer is a new and rapidly expanding role and many organisations are finding that it is an uncomfortable fit into the existing C-suite. Bringing together views, opinions and practitioners experience for the first time, The Chief Data Officer’s Playbook offers a compelling guide to anyone looking to understand the current (and possible future) CDO landscape.

Search on Google

Data Virtualization: Going Beyond Traditional Data Integration to Achieve Business Agility

Data Virtualization: Going Beyond Traditional Data Integration to Achieve Business Agility, the first book ever written on the topic of data virtualization, introduces the technology that enables data virtualization and presents ten real-world case studies that demonstrate the significant value and tangible business agility benefits that can be achieved through the implementation of data virtualization solutions. The book introduces the relationship between data virtualization and business agility but also gives you a more thorough exploration of data virtualization technology. Topics include what is data virtualization, why use it, how it works and how enterprises typically adopt it.

Search on Google

Start With Why

Simon Sinek started a movement to help people become more inspired at work, and in turn inspire their colleagues and customers. Since then, millions have been touched by the power of his ideas, including more than 28 million who’ve watched his TED Talk based on ‘Start With Why’ — the third most popular TED video of all time. Sinek starts with a fundamental question: Why are some people and organizations more innovative, more influential, and more profitable than others? Why do some command greater loyalty from customers and employees alike? Even among the successful, why are so few able to repeat their success over and over?

People like Martin Luther King, Steve Jobs, and the Wright Brothers had little in common, but they all started with Why. They realized that people won’t truly buy into a product, service, movement, or idea until they understand the Why behind it. ‘Start With Why’ shows that the leaders who’ve had the greatest influence in the world all think, act, and communicate the same way — and it’s the opposite of what everyone else does. Sinek calls this powerful idea The Golden Circle, and it provides a framework upon which organizations can be built, movements can be led, and people can be inspired. And it all starts with Why.

Search on Google

Summer Giveaways
We’re giving away 50 copies of ‘Data Virtualization: Going Beyond Traditional Data Integration to Achieve Business Agility’. Want to win? Just complete the form and cross your fingers. Good luck!

Winners are picked randomly at the end of the giveaway. Our privacy policy is available here.

https://www.datalumen.eu/wp-content/uploads/2018/08/Datalumen-Summer-Reading-2018-ORIG-Covers.png 1114 2560 Datalumen https://www.datalumen.eu/wp-content/uploads/2019/06/datalumen_color_V1.1-340-156B.png Datalumen2018-08-21 14:29:422020-09-22 07:52:32SUMMER READING TIP

GARTNER SURVEY FINDS CHIEF DATA OFFICERS ARE DELIVERING BUSINESS IMPACT AND ENABLING DIGITAL TRANSFORMATION

06/12/2017/by Datalumen

By 2021, the CDO Role Will Be the Most Gender Diverse of All Technology-Affiliated C-level Positions.

As the role of chief data officer (CDO) continues to gain traction within organizations, a recent survey by Gartner, Inc. found that these data and analytics leaders are proving to be a linchpin of digital business transformation.

The third annual Gartner Chief Data Officer survey was conducted July through September 2017 with 287 CDOs, chief analytics officers and other high-level data and analytics leaders from across the world. Respondents were required to have the title of CDO, chief analytics officer or be a senior leader with responsibility for leading data and/or analytics in their organization.

“While the early crop of CDOs was focused on data governance, data quality and regulatory drivers, today’s CDOs are now also delivering tangible business value, and enabling a data-driven culture,” said Valerie Logan, research director at Gartner. “Aligned with this shift in focus, the survey also showed that for the first time, more than half of CDOs now report directly to a top business leader such as the CEO, COO, CFO, president/owner or board/shareholders. By 2021, the office of the CDO will be seen as a mission-critical function comparable to IT, business operations, HR and finance in 75 percent of large enterprises.”

The survey found that support for the CDO role and business function is rising globally. A majority of survey respondents reported holding the formal title of CDO, revealing a steady increase over 2016 (57 percent in 2017 compared with 50 percent in 2016). Those organizations implementing an Office of the CDO also rose since last year, with 47 percent reporting an Office of the CDO implemented (either formally or informally) in 2017, compared with 23 percent fully implemented in 2016.

“The steady maturation of the office of the CDO underlines the acceptance and broader understanding of the role and recognizes the impact and value CDOs worldwide are providing,” said Michael Moran, research director at Gartner. “The addition of new talent for increasing responsibilities, growing budgets and increasing positive engagement across the C-suite illustrate how central the role of CDO is becoming to more and more organizations.”

Budgets are also on the rise. Respondents to the 2017 survey report an average CDO office budget of $8 million, representing a 23 percent increase from the average of $6.5 million reported in 2016. Fifteen percent of respondents report budgets more than $20 million, contrasting with 7 percent last year. A further indicator of maturity is the size of the office of the CDO organization. Last year’s study reported total full time employees at an average of 38 (not distinguishing between direct and indirect reporting), while this year reports an average of 54 direct and indirect employees, representing the federated nature of the office of the CDO design.

Key Findings

CDO shift from defense to offense to drive digital transformation

With more than one-third of respondents saying “increase revenue” is a top three measure of success, the survey findings show a clear bias developing in favor of value creation over risk mitigation as the key measure of success for a CDO. The survey also looked at how CDOs allocate their time. On a mean basis, 45 percent of the CDO’s time is allocated to value creation and/or revenue generation, 28 percent to cost savings and efficiency, and 27 percent to risk mitigation.
“CDOs and any data and analytics leader must take responsibility to put data governance and analytics principles on the digital agenda. They have the right and obligation to do it,” said Mario Faria, managing vice president at Gartner.
CDO are responsible for more than just data governance
According to the survey, in 2017, CDOs are not just focused on data as the title may imply. Their responsibilities span data management, analytics, data science, ethics and digital transformation. A larger than expected percentage of respondents (36 percent) also report responsibility for profit and loss (P&L) ownership. “This increased level of reported responsibility by CDOs reflects the growing importance and pervasive nature of data and analytics across organizations, and the maturity of the CDO role and function,” said Ms. Logan.
In the 2017 survey, 86 percent of respondents ranked “defining data and analytics strategy for the organization” as their top responsibility, up from 64 percent in 2016. This reflects a need for creating or modernizing data and analytics strategies within an increasing dependence on data and insights within a digital business context.
CDO are becoming impactful change agents leading the data-driven transformation
The survey results provided insight into the kind of activities CDOs are taking on in order to drive change in their organizations. Several areas seem to have a notable increase in CDO responsibilities compared with last year:
Serving as a digital advisor: 71 percent of respondents are acting as a thought leader on emerging digital models, and helping to create the digital business vision for the enterprise.
Providing an external pulse and liaison: 60 percent of respondents are assessing external opportunities and threats as input to business strategy, and 75 percent of respondents are building and maintaining external relationships across the organization’s ecosystem.
Exploiting data for competitive edge: 77 percent of respondents are developing new data and analytics solutions to compete in new ways.
CDO are diverse and tackling a wide array of internal challenges
Gartner predicts that by 2021, the CDO role will be the most gender diverse of all technology-affiliated C-level positions and the survey results reflect that position. Of the respondents to Gartner’s 2017 CDO survey who provided their gender, 19 percent were female and this proportion is even higher within large organizations — 25 percent in organizations with worldwide revenue of more than $1 billion. This contrasts with 13 percent of CIOs who are women, per the 2018 Gartner CIO Agenda Survey. When it comes to average age of CDOs, 29 percent of respondents said they were 40 or younger.
The survey respondents reported that there is no shortage of internal roadblocks challenging CDOs. The top internal roadblock to the success of the Office of the CDO is “culture challenges to accept change” — a top three challenge for 40 percent of respondents in 2017. A new roadblock, “poor data literacy,” debuted as the second biggest challenge (35 percent), suggesting that a top CDO priority is ensuring commonality of shared language and fluency with data, analytics and business outcomes across a wide range of organizational roles. When asked about engagement with other C-level executives, respondents ranked the relationship with the CIO and CTO as the strongest, followed by a broad, healthy degree of positive engagement across the C-Suite.

More info on our Advisory Services?
Would you like to know what Datalumen can mean to your CDO Office?

Have a look at our Services Offering,
contact us and start our Data Conversation.

https://www.datalumen.eu/wp-content/uploads/2017/12/Datalumen-CDO.jpeg 1812 2560 Datalumen https://www.datalumen.eu/wp-content/uploads/2019/06/datalumen_color_V1.1-340-156B.png Datalumen2017-12-06 20:14:212018-04-24 10:13:48GARTNER SURVEY FINDS CHIEF DATA OFFICERS ARE DELIVERING BUSINESS IMPACT AND ENABLING DIGITAL TRANSFORMATION

OPENLINEAGE: UNVEILING DATA LINEAGE FOR MODERN DATA ECOSYSTEMS

What is data lineage?

What is OpenLineage?

Key Components of OpenLineage

Conclusion

ESTABLISHING ROBUST DATA LITERACY – FROM AWARENESS TO ACTION

The Imperative of a Data Literacy Program

The Path Forward

AUGMENTED DATA QUALITY: AN AI-FUELED APPROACH FOR YOUR DATA ZEN MOMENT

AI & DATA GOVERNANCE: THE INTERSECTION YOU CAN’T MISS TO MAKE AI RESPONSIBLE & TRUSTWORTHY

The Importance of Data Governance for AI

Effective Data Governance for Trustworthy AI

AI Governance: Building on Data Governance Foundations

Conclusion

AGILE DATA GOVERNANCE – THE SMART WAY TO UPGRADE YOUR DATA DYNAMICS?

Understanding Agile Data Governance

Traditional Data Governance – The challenges & the case for the agile approach

Agile Data Governance – Distinct Advantages

Implementing Agile Data Governance – Key Steps

Conclusion

CHANGE & DATA GOVERNANCE – TAKE A LEAP FORWARD

Change & the critical ingredient for data governance success.

How to approach change?

CALCULATING DATA GOVERNANCE ROI

SUMMER READING TIP

Summer is here and the longer days it brings means more time available to spend with a ripping read. That’s how it ideally works at least. We selected 3 valuable books worth your extra time.

The Chief Data Officer’s Playbook

Data Virtualization: Going Beyond Traditional Data Integration to Achieve Business Agility

Start With Why

Summer Giveaways

GARTNER SURVEY FINDS CHIEF DATA OFFICERS ARE DELIVERING BUSINESS IMPACT AND ENABLING DIGITAL TRANSFORMATION

By 2021, the CDO Role Will Be the Most Gender Diverse of All Technology-Affiliated C-level Positions.

Key Findings

CDO shift from defense to offense to drive digital transformation

CDO are responsible for more than just data governance

CDO are becoming impactful change agents leading the data-driven transformation

CDO are diverse and tackling a wide array of internal challenges

Services

Solutions

About Us

Posts

What is data lineage?

What is OpenLineage?

Key Components of OpenLineage

Conclusion

The Imperative of a Data Literacy Program

The Path Forward

The Importance of Data Governance for AI

Effective Data Governance for Trustworthy AI

AI Governance: Building on Data Governance Foundations

Conclusion

Understanding Agile Data Governance

Traditional Data Governance – The challenges & the case for the agile approach

Agile Data Governance – Distinct Advantages

Implementing Agile Data Governance – Key Steps

Conclusion

Change & the critical ingredient for data governance success.

How to approach change?

Summer is here and the longer days it brings means more time available to spend with a ripping read. That’s how it ideally works at least. We selected 3 valuable books worth your extra time.

The Chief Data Officer’s Playbook

Data Virtualization: Going Beyond Traditional Data Integration to Achieve Business Agility

Start With Why

Summer Giveaways

By 2021, the CDO Role Will Be the Most Gender Diverse of All Technology-Affiliated C-level Positions.

Key Findings

CDO shift from defense to offense to drive digital transformation

CDO are responsible for more than just data governance

CDO are becoming impactful change agents leading the data-driven transformation

CDO are diverse and tackling a wide array of internal challenges

Services

Solutions

About Us