<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>privacy | Math to Power Industry</title><link>https://m2pi.ca/keywords/privacy/</link><atom:link href="https://m2pi.ca/keywords/privacy/index.xml" rel="self" type="application/rss+xml"/><description>privacy</description><generator>Wowchemy (https://wowchemy.com)</generator><language>en-us</language><copyright>© 2025 Pacific Institute for the Mathematical Sciences</copyright><lastBuildDate>Tue, 24 Mar 2026 00:00:00 +0000</lastBuildDate><image><url>https://m2pi.ca/media/logo.svg</url><title>privacy</title><link>https://m2pi.ca/keywords/privacy/</link></image><item><title>One Child Every Child</title><link>https://m2pi.ca/project/2026/onechildeverychild/</link><pubDate>Tue, 24 Mar 2026 00:00:00 +0000</pubDate><guid>https://m2pi.ca/project/2026/onechildeverychild/</guid><description>&lt;p>
&lt;figure >
&lt;div class="d-flex justify-content-center">
&lt;div class="w-100" >&lt;img alt="" srcset="
/project/2026/onechildeverychild/OCEC_huce7ae0c3cde9b86bd1a12c5bc6d654e6_172919_bc7504d4a7466406be33bd5f42561708.webp 400w,
/project/2026/onechildeverychild/OCEC_huce7ae0c3cde9b86bd1a12c5bc6d654e6_172919_bce72567446a0e517caf2ba80a609063.webp 760w,
/project/2026/onechildeverychild/OCEC_huce7ae0c3cde9b86bd1a12c5bc6d654e6_172919_1200x1200_fit_q90_h2_lanczos_3.webp 1200w"
src="https://m2pi.ca/project/2026/onechildeverychild/OCEC_huce7ae0c3cde9b86bd1a12c5bc6d654e6_172919_bc7504d4a7466406be33bd5f42561708.webp"
width="760"
height="277"
loading="lazy" data-zoomable />&lt;/div>
&lt;/div>&lt;/figure>
&lt;/p>
&lt;p>This project examines how re-identification risk varies within datasets,
particularly for individuals with unique or complex diagnostic profiles, and
identifies factors beyond average sample-level risk that contribute to
vulnerability. It aims to assess how data type, dataset structure, and existing
safeguards influence re-identification risk, with a focus on informing
responsible data sharing practices for research involving children with complex
diagnoses.&lt;/p>
&lt;h2 id="introduction">Introduction&lt;/h2>
&lt;p>Re-identification risk is disproportionately shouldered by
individuals with unique data profiles (i.e., outliers), including children with
complex diagnostic profiles. There is also limited literature on the factors
that might influence re-identification risk beyond the risk to the whole sample
on average.&lt;/p>
&lt;p>With the increase in requirements for data sharing, it is important to
understand re-identification risk within a sample, but also the
re-identification risk of individuals who are unique. Considerations should be
made that there are some risks that are not within the control of researchers
(e.g., knowing whether someone is in a study or not; a family might willingly
share this information). Thus, extra consideration should be made for those
things that can be controlled by the research team while appreciating the
importance of data sharing and utilization of research data.&lt;/p>
&lt;h2 id="goal">GOAL&lt;/h2>
&lt;p>The goal of this study is to determine factors contributing to re-identification
risk in large datasets and their impact on children with complex diagnostic
profiles.&lt;/p>
&lt;h3 id="aim-1">Aim 1&lt;/h3>
&lt;p>What kind of data is at risk for re-identification and what recommendations have
been made within the literature?&lt;/p>
&lt;p>With the utilization of publicly available data to train AI models, it is
becoming increasingly important for researchers to clearly communicate to
research participants how their data will be used and the risk associated with
data sharing. The first aim of this study will look at the existing literature
and assess the recommendations that have been made to limit re-identification
risk in research data. This aim will also address the existing literature that
has evaluated re-identification risk in specific types of data and the
differences in recommendations based on data type (i.e., MRI, genetic,
behavioural, etc.).&lt;/p>
&lt;h3 id="aim-2">Aim 2&lt;/h3>
&lt;p>How does the shape of the data influence
re-identification risk within a randomly generated sample and is this comparable
to a sample generated based on known variable relationships?&lt;/p>
&lt;h4 id="aim-21">Aim 2.1&lt;/h4>
&lt;p>What factors contribute to greater re-identification risk in a sample of
randomly generated data Assuming that re-identification risk is influenced by
several factors including&lt;/p>
&lt;ol>
&lt;li>the number of possible combinations of variables (# of possible unique
&amp;lsquo;profiles&amp;rsquo;)&lt;/li>
&lt;li>the size of the sample&lt;/li>
&lt;li>the variability of each of the variables (i.e., how likely are there to be
outliers, especially &amp;rsquo;extreme&amp;rsquo; outliers), and&lt;/li>
&lt;li>the number of shared variables which could be considered identifiable.&lt;/li>
&lt;/ol>
&lt;p>The shape of the dataset could influence re-identification risk and by
understanding the relationship between the shape of the dataset and
re-identification risk we can determine the risk level of data sharing.
Particularly for individuals who might have ‘unique’ data profiles which might
be more likely to be re-identified.&lt;/p>
&lt;h4 id="aim-22">Aim 2.2&lt;/h4>
&lt;p>Does a sample that is
based on known variable relationships (co-occurrence between neurodevelopmental
disorders) behave the same way as a fully synthetic dataset?&lt;/p>
&lt;p>One use case where re-identification risk is of particular concern in within
populations of children with unique diagnostic profiles (unique combinations of
diagnoses). Given the high rate of co-occurrence in these disorders, it is
common for a child to present with multiple diagnoses. Determine the
co-occurrence rates for several neurodevelopmental disorders (Autism, ADHD, OCD,
CD, ODD, etc.)&lt;/p></description></item></channel></rss>