Gregory Shoup is a Customer Success Analyst at Zengines, where he leads client onboarding and delivers ongoing training and professional services to help customers realize the full value of the platform. With a unique blend of technical and consulting experience, he enhances the customer experience.
Previously, Gregory was a Principal Consultant at ACA Group, managing GIPS compliance projects for leading asset managers. He also completed a 500-hour full-stack engineering fellowship at General Assembly, building applications with modern frameworks and APIs.
Gregory’s background in data analysis, compliance, and software development makes him a vital partner to Zengines clients navigating complex system migrations and data conversions.
Data lineage is the process of tracking data usage within your organization. This includes how data originates, how it is transformed, how it is calculated, its movement between different systems, and ultimately how it is utilized in applications, reporting, analysis, and decision-making. This is a crucial capability for any modern ecosystem, as the amount of data businesses generate and store increases every year.
As of 2024, 64% of organizations manage at least one petabyte of data — and 41% have at least 500 petabytes of information within their systems. In many industries, like banking and insurance, this includes legacy data that spans not just systems but eras of technology.
As the data volume grows, so does the need to aid the business with trust in access to that data. Thus, it is important for companies to invest in data lineage initiatives to improve data governance, quality, and transparency. If you’re shopping for a data lineage tool, there are many cutting-edge options. The cloud-based Zengines platform uses an innovative artificial intelligence-powered model that includes data lineage capabilities to support clean, consistent, and well-organized data.
Whether you go with Zengines or something else, though, it’s important to be strategic in your decision-making. Here is a step-by-step process to help you choose the best data lineage tools for your organization’s needs.
Start by ensuring your selection team has a thorough understanding of not just data lineage as a concept but also the requirements that your particular data lineage tools must have.
First, consider core data lineage tool functionalities that every company needs. For example, you want to be able to access a clear visualization of the relationship between complex data across programs and systems at a glance. Impact analysis also provides a clear picture of how change will influence your current data system.
In addition, review technology-specific data-lineage needs, such as the need to ingest legacy codebases like COBOL. Compliance and regulatory requirements vary from one industry to the next, too. They also change often. Make sure you’re aware of both business operations needs and what is expected of the business from a compliance and legal perspective.
Also, consider future growth. Can the tool you select support the data as you scale? Don’t hamstring momentum down the road by short-changing your data lineage capabilities in the present.
When you begin to review specific data lineage tools, you want to know what features to prioritize. Here are six key areas to focus on:
Keep these factors in mind and make sure whatever tool you choose satisfies these basic requirements.
Along with specific features, you want to assess how easy it is to implement the tool and how easy it is to use the tool.
Start with setup. Consider how well each data lineage software solution is designed to implement within and configure to your system. For businesses that built technology solutions before the 1980s, you may have critical business operations that run on mainframes. Make sure a data lineage tool will be able to easily integrate into a complex system before signing off on it.
Consider the learning curve and usability too. Does the tool have an intuitive interface? Are there complex training requirements? Is the information and operation accessible?
When considering the cost of a data lineage software solution, there are a few factors to keep in mind. Here are the top elements that can influence expenses when implementing and using a tool like this over time:
Make sure to consider costs, benefits, TCO and ROI when assessing your options.
If you’re looking for a comprehensive assessment of what makes the Zengines platform stand out from other data lineage solutions, here it is in a nutshell:
Our automated solutions create frictionless, sped-up solutions that reduce risk, lower costs, and create more accessible data lineage solutions.
As you assess your data lineage tool choices, keep the above factors in mind. What are your industry and organizational requirements? Focus on key features like automation and integration capabilities. Consider implementation, training, user experience, ROI, and comprehensive cost analyses.
Use this framework to help create stakeholder buy-in for your strategy. Then, select your tool with confidence, knowing you are organizing your data’s past to improve your present and lay the groundwork for a more successful future.
If you have any follow-up questions about data lineage and what makes a software solution particularly effective and relevant in this field, our team at Zengines can help. Reach out for a consultation, and together, we can explore how to create a clean, transparent, and effective future for your data.
In today's increasingly regulated financial landscape, banks and financial institutions face mounting pressure to ensure complete visibility and traceability of their Critical Data Elements (CDEs). While regulatory frameworks like BCBS 239, CDD, and CIP establish clear requirements for data governance, many organizations struggle with implementation, particularly when critical information resides within decades-old mainframe systems.
These legacy environments have become the Achilles' heel of compliance efforts, with opaque data flows and hard-to-decipher COBOL code creating significant blind spots. Zengines Mainframe Data Lineage product offers a revolutionary solution to this challenge, providing unparalleled visibility into "black box" systems and transforming regulatory compliance from a time-consuming burden into an efficient, streamlined process.
For banks and financial services firms, managing Critical Data Elements (CDEs) is no longer optional - it's a fundamental regulatory requirement with significant implications for compliance, risk management, and operational integrity. Regulations like BCBS 239, the Customer Due Diligence (CDD) Rule, and the Customer Identification Program (CIP) mandate that financial institutions not only identify their critical data but also understand its origins, transformations, and dependencies across all systems.
However, for institutions with legacy mainframe systems, this presents a unique challenge. These "black box" environments, often powered by decades-old COBOL code spread across thousands of modules, make tracing data lineage a time-consuming and error-prone process. Without the right tools, financial institutions face substantial risks, including regulatory penalties, audit failures, and compromised decision-making.
"Financial institutions today are trapped between regulatory demands for data transparency and legacy systems that were never designed with this level of visibility in mind. At Zengines, we've created Mainframe Data Lineage to bridge this gap, turning black box mainframes into transparent, auditable systems that satisfy even the most stringent CDE requirements." - Caitlyn Truong, CEO, Zengines
Many financial institutions operate with legacy mainframe technology that can contain up to 80,000 different COBOL modules, each potentially containing thousands of lines of code. This complexity creates several critical challenges for CDE compliance:
"The challenge with mainframe environments isn't that the data isn't there—it's that it's buried in thousands of COBOL modules and complex code paths that would take months to manually trace. Zengines automates this process, reducing what would be weeks of research into minutes of interactive exploration." - Caitlyn Truong, CEO, Zengines
Zengines Mainframe Data Lineage product is purpose-built to solve compliance challenges like these by bringing transparency to legacy systems. By automatically analyzing and visualizing mainframe data flows, it enables financial institutions to meet regulatory requirements without the traditional manual effort.
Zengines ingests COBOL modules, JCL code, SQL, and other mainframe components to automatically map relationships between data elements across your entire mainframe environment. This comprehensive approach ensures that no critical data element remains untraced.
Instead of manually tracing through thousands of lines of code, Zengines provides interactive visualizations that instantly show:
This visualization capability is particularly valuable during regulatory examinations, allowing institutions to demonstrate compliance with confidence and clarity.
For BCBS 239 compliance, institutions must understand and validate calculation methodologies for risk data aggregation. Zengines automatically extracts and presents calculation logic in human-readable format, making it simple to verify that risk metrics are computed correctly.
When regulators question why certain customer records received specific treatment (critical for CDD and CIP compliance), Zengines can immediately identify the conditional logic that determined the data path, showing exactly which business rules were applied and why.
Zengines provides detailed metrics about your mainframe environment, helping compliance teams understand the scope and complexity of systems containing critical data elements.
"When regulators ask where a critical value came from or how it was calculated, financial institutions shouldn't have to launch a massive investigation. With Zengines Mainframe Data Lineage, they can answer these questions confidently and immediately, transforming their compliance posture from reactive to proactive." - Caitlyn Truong, CEO, Zengines
Financial institutions using Zengines Mainframe Data Lineage have experienced transformative results in their regulatory compliance activities:
While regulatory compliance drives initial adoption, financial institutions discover additional strategic benefits from implementing Zengines Mainframe Data Lineage:
"What we've discovered working with financial services firms is that CDE compliance isn't just about satisfying regulators—it's about fundamentally understanding your own critical data. Our Mainframe Data Lineage solution doesn't just help banks pass audits; it gives them unprecedented insight into their own operations." - Caitlyn Truong, CEO, Zengines
For financial institutions struggling with CDE compliance across legacy systems, Zengines offers a proven path forward. The implementation process is designed to be non-disruptive, with no modifications required to your existing mainframe environment.
The journey to compliance begins with a simple assessment of your current mainframe landscape, followed by automated ingestion of your code base. Within days, you'll have unprecedented visibility into your critical data elements – transforming your compliance posture from reactive to proactive.
In today's regulatory environment, financial institutions can no longer afford the uncertainty and risk associated with "black box" mainframe systems. Zengines Mainframe Data Lineage brings the transparency and traceability required not just to satisfy regulators, but to operate with confidence in an increasingly data-driven industry.
In today's rapidly evolving technology landscape, organizations with legacy mainframe systems face increasing pressure to modernize. Whether driven by cost concerns, skills shortages, or the need for greater agility, mainframe modernization has become a strategic imperative.
However, there's no one-size-fits-all approach. Let's explore the various paths to modernization and how platforms like Zengines can help you navigate this complex journey.
The various ways organizations are approaching mainframe modernization include:
What it is: Moving mainframe applications to new hardware with minimal code changes, often to cloud infrastructure.
What it is: Migrating applications to a new platform while making moderate modifications to the code.
What it is: Automatically converting legacy code (like COBOL) to modern languages like Java or C#.
What it is: Restructuring existing code without changing external behavior.
What it is: Redeveloping applications from scratch using modern languages and architectures.
What it is: Abandoning legacy applications for newer commercial off-the-shelf solutions.
Data migration remains "the highest risk during any systems change" according to industry experts. Organizations face numerous challenges including:
Before embarking on any modernization journey, organizations need to understand their current systems deeply. This process becomes particularly challenging with legacy mainframes that have been operating for decades with limited documentation and dwindling expertise.
Zengines tackles the two most critical aspects of mainframe modernization:
Zengines' Mainframe Data Lineage technology illuminates the "black box" of legacy systems by:
This deep visibility allows organizations to understand how their current systems work before they attempt migration, preventing costly errors and unexpected outcomes.
Zengines accelerates data migration through:
Organizations using Zengines can complete data migration tasks in minutes rather than months, dramatically reducing the time, cost, and risk associated with modernization projects.
Mainframe modernization is a complex journey with multiple potential paths. The right approach depends on your organization's specific goals, timeline, budget, and risk tolerance.
What's universal, however, is the need to understand your legacy systems and data thoroughly before making changes. With Zengines, organizations gain both the deep visibility into their current mainframe operations and the powerful tools to migrate data efficiently and accurately.
By reducing the highest-risk aspects of modernization, Zengines helps organizations avoid becoming another cautionary tale of failed transformations and instead realize the full benefits of their technology investments.
Connect with our team to learn more about how we’re supporting some of the largest and most complex mainframe modernizations today.