Talk
Beginner

From Data to Trust: Leveraging the Open Ethics Data Passport for Ethical AI

Review Pending

Imagine an AI system making critical decisions—approving loans, screening candidates, or recommending medical treatments. Behind the scenes, this AI learned from vast amounts of data, carefully labeled by humans or gathered from online sources. But what if the data it learned from carried hidden biases? What if those biases reflected the backgrounds, experiences, or unconscious prejudices of the people who labeled it? Without transparency, these ethical blind spots remain invisible, quietly shaping AI outcomes that affect real lives.

This is the problem we face today: AI models often inherit the biases embedded in their training data, yet little is known about how that data was created, who labeled it, or what ethical safeguards were applied. This lack of visibility makes it difficult for developers, auditors, and users to trust AI systems or ensure fairness.


Enter the Open Ethics Data Passport (OEDP) — a breakthrough framework designed to shine a light on the origins of AI datasets. Think of it as a “passport” that travels with every dataset and model, telling the story of where the data came from, how it was collected, cleaned, and annotated, and who the labelers were—including their expertise and potential influences.

In this session, I will walk you through how OEDP provides a standardized, open, and machine-readable documentation system that brings transparency to the dataset layer of AI development. You’ll learn how OEDP captures:

  • The provenance of datasets: their sources and collection methods

  • The annotation process: how data was labeled, including guidelines and quality controls

  • Labeler profiles: who annotated the data, their background, and possible biases

  • The data cleaning and preparation steps before training

  • The scope and ethical considerations shaping the dataset’s intended use

I’ll explain how OEDP helps reveal hidden biases, promotes accountability, and enables better, more ethical AI decision-making by making dataset ethics visible and auditable.

![Open Ethics Data Passport Structure](https://openethics.ai/wp-content/uploads/2023/09/Open-Ethics-Data-Passport-OEDP-structure.png)

Beyond data documentation, OEDP is part of a broader Open Ethics ecosystem that also covers decision transparency and algorithmic explainability—all open-source and designed to work seamlessly with existing AI governance tools.


Who is this talk for?:

  1. AI researchers and data scientists interested in improving transparency and ethical standards in training datasets.

  2. Open data advocates and practitioners working with publicly accessible datasets for responsible AI development.

  3. Policy makers, auditors, and ethics professionals seeking practical tools to assess and govern AI data fairness and accountability.


In the demo, I will showcase how easy it is to create an Open Ethics Data Passport for any open dataset, using the publicly available code and templates on GitHub. You’ll see how dataset publishers and AI developers can adopt OEDP to build trust and comply with emerging ethical standards.

Join me to discover how the Open Ethics Data Passport is transforming the way we think about open data for AI—from invisible bias to visible ethics, empowering the community to build fairer, more transparent AI systems.

3 key takeaways:

  1. Understanding how the Open Ethics Data Passport (OEDP) enhances transparency and accountability in AI dataset documentation.

  2. Practical insights into implementing OEDP to uncover and mitigate biases in training data.

  3. Awareness of the broader Open Ethics framework and its role in promoting responsible, open-source AI governance.

Introducing a FOSS project or a new version of a popular project
Story of a FOSS project - from inception to growth
Knowledge Commons (Open Hardware, Open Science, Open Data etc.)
Tutorial about using a FOSS project
Which track are you applying for?
Open Data Devroom

0 %
Approvability
0
Approvals
0
Rejections
1
Not Sure

Maybe as a lightning talk? OEDP is quite old now, but it hasn't seen much adoption.

Reviewer #1
Not Sure