Frequently Asked Questions

Product Overview & Use Cases

What is Data Society's solution for risk mitigation at the Inter-American Development Bank (IDB)?

Data Society partnered with the Inter-American Development Bank (IDB) to develop a machine-learning-based model that proactively identifies infrastructure projects with significant risk factors. This solution enables decision-makers to take preemptive actions to save time, reduce budget overruns, and secure project success. The model leverages open-source contracting data standards (OCDS) and is deployed across 26 countries, helping IDB standardize risk mitigation processes at scale. Source

How does Data Society's machine learning model work for infrastructure risk mitigation?

The model uses advanced machine learning techniques, including gradient boosting and cross-validation, to analyze contract data and predict which projects are likely to exceed original contracting limits. It addresses data imbalance by applying ROSE and SMOTE oversampling techniques, improving accuracy for modified contracts. The solution includes a user-friendly application for non-technical users to browse and investigate high-risk contracts. Source

What is the Open Contracting Data Standard (OCDS) and how is it used in Data Society's solution?

The Open Contracting Data Standard (OCDS) is an internationally accepted set of guidelines for publishing data about government contracts, developed by the Open Contracting Partnership. Data Society leverages OCDS to efficiently scale its modeling framework, standardize contract risk analysis, and ensure consistent data governance across IDB's 26-country deployment. Source

What industries does Data Society serve?

Data Society serves a wide range of industries, including government, financial services, energy & utilities, media, healthcare, education, retail, aerospace & defense, professional services & consulting, and telecommunications. The IDB case study specifically highlights applications in government and finance. For more details, see Data Society's Case Studies Page.

Features & Capabilities

What are the key features of Data Society's risk mitigation solution for IDB?

Key features include:

Source

Does Data Society's solution integrate with other platforms or tools?

Yes, Data Society offers seamless integrations with platforms such as Power BI, Tableau, ChatGPT, and Copilot. These integrations enable organizations to streamline data access, improve collaboration, and reduce manual work, ensuring efficient and scalable workflows. Source

Pain Points & Problem Solving

What problems does Data Society's risk mitigation solution solve for organizations like IDB?

Data Society's solution addresses several core challenges:

Source

How does Data Society's approach to solving these pain points differ from other providers?

Data Society differentiates itself by offering tailored, scalable solutions that leverage open data standards and advanced machine learning. Its approach includes hands-on, instructor-led training, seamless integration with existing systems, and a focus on empowering both technical and non-technical users. Data Society also provides frameworks for robust governance and measurable ROI, setting it apart from generic AI and data solution providers. Source

Implementation & Onboarding

How easy is it to implement Data Society's risk mitigation solution?

Data Society's solutions are designed for quick and efficient implementation. Organizations can start with a focused project by equipping a small, cross-functional team with tools and support, ensuring fast adoption and learning. The onboarding process is streamlined, with live, instructor-led training and tailored learning paths. Automated training and assessment systems require minimal maintenance, and regular updates are automated. Training can be delivered online or in-person, with cohorts capped at 30 participants for active engagement. Source

What support and training does Data Society provide during and after implementation?

Data Society offers extensive support, including:

These resources ensure customers can maintain and optimize their systems effectively. Source

Performance, Metrics & Results

What measurable results has Data Society achieved for IDB and similar clients?

For IDB, Data Society's solution enabled standardized risk mitigation across 26 countries, helping decision-makers proactively identify and address high-risk infrastructure projects. Globally, Data Society's products have delivered measurable ROI, such as 0,000 in annual cost savings (see HHS CoLab case study) and improved healthcare access for 125 million people (see Optum Health case study). Source

What key performance indicators (KPIs) are associated with Data Society's risk mitigation solutions?

Relevant KPIs include:

Source

Security & Compliance

Is Data Society's solution secure and compliant with industry standards?

Yes, Data Society is ISO 9001:2015 certified, demonstrating its commitment to quality management and continuous improvement. This certification ensures that solutions meet stringent standards for reliability and quality, providing assurance about security and compliance. For more details, visit Data Society's security and compliance page.

Target Audience & Roles

Who can benefit from Data Society's risk mitigation solutions?

Data Society's solutions are designed for a diverse range of roles and organizations, including:

Source

Using Predictive Modeling To Secure International Development

Scaling Risk Mitigation through Machine Learning at Inter-American Development Bank

  • 26

    Countries – IDB's standardized deployment reach

  • 9/10

    Infrastructure projects with cost overruns

  • $9.5 trillion

    Global annual government contracts

Client Profile

Bio:

The Inter-American Development Bank (IDB) is a cooperative development bank with a mission to accelerate the economic and social development of its Latin American and Caribbean member countries. The bank provides financing in the form of loans and grants.

  • Industry: Government (NGO), Finance
  • Headquarters: Washington, DC
  • Size: 2,000 employees

At a Glance

Decision-makers at the Inter-American Development Bank were looking to create an effective way to mitigate risks in their infrastructure investment portfolio. To achieve this, the bank partnered with Data Society to create an innovative, machine-learning-based model that would enable decision-makers to proactively identify projects with significant risk factors and take preemptive actions to save time and budget and to secure project success.

The Challenge

Historically, conventional statistical models have afforded bankers risk-related insights, but couldn’t predict the success and failure of infrastructure projects overall. Studies of infrastructure projects throughout the world find that 9 out of 10 experienced cost overruns, which vary by sector and average between 20% and 45% of baseline costs (Flyvbjerg, 2007). The Inter-American Development Bank (IDB) wanted to test its hypothesis that subtle factors can impact the delivery and budget of an infrastructure effort. 

The Solution

Data Society utilized machine learning to develop a model that can accurately determine which projects are likely to exceed the original contracting limits, enabling decision-makers to proactively identify projects with significant risk factors and take preemptive actions to mitigate risks. The new solution was used to determine success and potential failure of IDB’s undergoing public construction projects in Paraguay.

User-Friendly Application

In order to make the analysis simple for a non-technical audience to digest, Data Society developed an easy-to-use application that allows users to browse IDB contracts and investigate ones that presented the most risk. 

Leveraging the Benefits of Open Sourcing

Open-source data enabled Data Society to most efficiently scale the modeling framework. The Open Contracting Partnership – a consortium of hundreds of stakeholders across government, business, and civil societies with support from The World Bank – developed the Open Contracting Data Standard (OCDS), an internationally accepted set of guidelines for contracting data, to provide a common format for the publication of data about the ~USD 9.5 trillion in annual government contracts awarded globally.

Data Society leveraged the OCDS’s standard data model that reflects the way in which governments structure and award contracts. The Item Classification Scheme (ICS) within the OCDS includes various fields that describe the specific items included in the procurement. Through such classification, OCDS offers a particular standard that is useful in categorizing items with a unique ID, which helps create a standard system for contract risk analysis. The application allows users to browse IDB contracts and investigate ones that are flagged as high risk, as well as explore the patterns in the raw data.

Modeling Approach

When first examining IDB’s infrastructure project contracts, Data Society recognized that there was a distribution imbalance that had to be rectified in order to produce accurate forecasting results for the solution. Research found that there were 80-85% unmodified contracts in our data set and 15-20% modified contracts. Data Society’s models were achieving high accuracy by mostly predicting that a contract would not be modified. Data Society further trained the model on the few contracts that did have modifications. The team utilized ROSE and SMOTE oversampling techniques to bring our modified contract percentage up to ~30-40%. Data Society then used the re-balanced data to develop the final model.

After testing numerous machine learning techniques, Data Society selected a gradient boosting model optimized via cross-validation. The designed algorithm used an ensemble of weak learners, and built them sequentially to obtain a strong learner. Data Society then applied cross-validation to minimize the effects of randomness. This allowed our analysts and IDB stakeholders access to a more accurate idea of the performance of the model that captured the real-world phenomena driving the outcomes of infrastructure projects, which has been especially useful for the imbalanced datasets on which the model was built.

The Results:

Data Society created a scalable machine-learning-driven modeling framework to specifically identify projects with higher than average risk levels. What made our approach scalable is our use of Open Contracting Data Standards (OCDS) when pulling the data. 

Data Society created a comprehensive database and data schema to gather the data necessary to operate at a larger scale. For the IDB and its client countries, adherence to a data standard is an imperative policy and data governance prerequisite. Machine learning was applied to risk management and contract structuring, primarily useful when starting with a comprehensive data set.

To deploy at scale, a standardized pipeline was designed for the data extract transform load (ETL) process, and this process evolves over four steps:

Web scraped or API data collection 
Transform data into usable format 
Load data into scalable cloud system 
Run model to inform underwriting

  • Web scraped or API data collection 
  • Transform data into usable format 
  • Load data into scalable cloud system 
  • Run model to inform underwriting

Ultimately, applying this consistent data governance framework and reporting format, standardizing processes across the 26 countries it serves, the IDB is able to leverage machine learning capabilities to automatically flag infrastructure projects that are at risk of not meeting expectations.

  • 26

    Countries – IDB's standardized deployment reach

  • 9/10

    Infrastructure projects with cost overruns

  • $9.5 trillion

    Global annual government contracts

Don’t wanna miss any Data Society Resources?

Stay informed with Data Society Resources—get the latest news, blogs, press releases, thought leadership, and case studies delivered straight to your inbox.

Subscription Form