Data first: The critical foundation for your GenAI Launchpad

Blogs and Articles

Generative AI has opened up new frontiers in how the government executes its mission, but GenAI is only as good as the data that powers it. How strong is your data foundation?

Mary Ellen Buzzelli
Mary Ellen Buzzelli
28 May 20257  mins
Data First: The Critical Foundation for Your GenAI Launchpad

In the past year, Generative AI (GenAI) has dominated media headlines for its potential to drastically transform the way organisations design and deliver digital services. While government agencies have already embraced AI in the form of chat bots, social media monitoring, predictive policy and more, the capacity of GenAI to analyse vast amounts of data, simulate human-like reasoning, and adapt to dynamic situations has opened up new frontiers in how the government executes its mission.

In fact, in a recent survey from NASCIO, 53 percent of state CIO respondents stated GenAI will be the most impactful emerging IT area in the next 3-5 years. NASCIO’s annual list of State CIO Top Ten Priorities for 2024 even includes AI for the first time ever.

It’s clear that state and local governments are excited about GenAI, but are they ready to harness it for real value?

Building your GenAI Launchpad

According to NASCIO, only 27 percent of state CIOs stated they have mature governance over enterprise information management - a critical and foundational element for GenAI implementation. Like many emerging tech applications, GenAI is only as good as the information it’s reasoning over, and that information has to be packaged and fed to GenAI in formats it can understand. That is, structured, digitised and accessible data able to be used for analysing, sharing and informing decision-making.

Data is the lifeblood that fuels innovation. A strong data foundation acts as the cornerstone for developing powerful and reliable GenAI models. Here are four keys to establishing a solid data foundation:

  1. Digitise and Standardise Data:
    1. Digitise Paper Records: Convert paper records into digital formats using optical character recognition (OCR) to automatically generate meaningful metadata.
    2. Standardise Digital Data: Digitised records and existing digital data should be ingested into a centralised repository. Standardisation requires organising data in a uniform manner, such as using a common file format, naming conventions, and data structure.
  2. Implement Data Quality Control:
    1. Cleanse and Validate Data: Identify and rectify errors, inconsistencies, and inaccuracies in your data. Proper digitisation will provide this automatically, with a human in the loop to ensure accuracy.
    2. Enforce Data Entry Standards: Establish guidelines for entering new data, including validation rules, review processes, and day-forward paper digitisation to prevent errors at the source.
  3. Create Metadata and Taxonomies:
    1. Generate Metadata: Go beyond record-level metadata to develop metadata for each dataset, including information about data origin, creation date, and any relevant contextual details. This metadata is crucial for understanding the data's meaning and lineage.
    2. Build Taxonomies: Create structured classifications or taxonomies that define relationships and categorisations within your data. This helps organise information and facilitates more efficient searching and analysis.
  4. Implement Data Governance:
    1. Define Data Ownership and Responsibility: Clearly assign ownership and responsibility for different datasets within your organisation. This ensures accountability and helps maintain data quality.
    2. Establish Data Governance Policies: Develop and enforce policies regarding data access, security, and privacy. This includes defining who has permission to modify or delete data and setting guidelines for data sharing.

While each state’s AI roadmap must be tailored to its specific needs, strategic plans and priorities, including these important considerations ensures the establishment of a solid data foundation for the seamless integration of AI into state IT initiatives. As we see more states implement GenAI, we gain greater insights into the best practices in governing the technology. Jurisdictions around the world have already published GenAI policies, acting as leading examples for those that will soon follow.

The future of GenAI’s use in government holds enormous promise, but it is the responsibility of state and local leaders to prioritise the foundational data elements to make GenAI a safe and strategic asset.

To learn more about how Iron Mountain can help catalyse your GenAI journey, get in touch!