<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Data Discovery on</title><link>http://developer.docs.protegrity.com/docs/prod_features/data_discovery/</link><description>Recent content in Data Discovery on</description><generator>Hugo</generator><language>en</language><lastBuildDate>Thu, 25 Jun 2026 05:41:58 +0000</lastBuildDate><atom:link href="http://developer.docs.protegrity.com/docs/prod_features/data_discovery/index.xml" rel="self" type="application/rss+xml"/><item><title>Data Discovery Architecture</title><link>http://developer.docs.protegrity.com/docs/prod_features/data_discovery/dd_arch/</link><pubDate>Tue, 20 Feb 2024 00:00:00 +0000</pubDate><guid>http://developer.docs.protegrity.com/docs/prod_features/data_discovery/dd_arch/</guid><description>&lt;p>Data Discovery is a powerful, developer-friendly feature. For more information, refer to the complete body of the &lt;a href="https://docs.protegrity.com/data-discovery/2.0.0/docs/">Data Discovery documentation&lt;/a>.&lt;/p>
&lt;h3 id="overview">Overview&lt;/h3>
&lt;p>Data Discovery Text Classification service advances data discovery and classification. It specializes in the detection of Personally Identifiable Information (PII), Protected Health Information (PHI), and Payment Card Information (PCI) within plain text and free-text inputs. Unlike traditional structured data tools, it excels in dynamic, unstructured environments such as chatbot conversations, call transcripts, and Generative AI (GenAI) outputs.&lt;/p></description></item><item><title>What's New</title><link>http://developer.docs.protegrity.com/docs/prod_features/data_discovery/whats_new/</link><pubDate>Mon, 01 Jan 0001 00:00:00 +0000</pubDate><guid>http://developer.docs.protegrity.com/docs/prod_features/data_discovery/whats_new/</guid><description>&lt;h2 id="data-discovery">Data Discovery&lt;/h2>
&lt;ul>
&lt;li>Standardized v2 APIs for Classify for Text and Tabular data, and Transform.&lt;/li>
&lt;li>New endpoints added for API docs, log level management, and version info.&lt;/li>
&lt;li>Improved Context Provider and Pattern Provider AI models.&lt;/li>
&lt;li>Updated Classify API default threshold to 0.7. The default threshold for v1.1 remains at 0.0 for compatibility.&lt;/li>
&lt;li>Added usage metrics and per‑language accuracy metrics.&lt;/li>
&lt;li>Extended PII detection to multiple Markdown dialects. &lt;br>&lt;/li>
&lt;/ul>
&lt;p>For more details, refer to &lt;a href="https://docs.protegrity.com/data-discovery/2.0.0/docs/whatsnew/">What’s New in Data Discovery&lt;/a>.&lt;/p></description></item><item><title>Prerequisites for Data Discovery</title><link>http://developer.docs.protegrity.com/docs/prod_features/data_discovery/dd_prereq/</link><pubDate>Tue, 20 Feb 2024 00:00:00 +0000</pubDate><guid>http://developer.docs.protegrity.com/docs/prod_features/data_discovery/dd_prereq/</guid><description>&lt;p>Ensure that the following prerequisites are met before running these examples for Data Discovery:&lt;/p>
&lt;ul>
&lt;li>Docker CLI, Docker Compose, and Python are installed. For more information, refer to &lt;a href="http://developer.docs.protegrity.com/docs/install/">AI Developer Edition, Pre-requisites Guide&lt;/a>.&lt;/li>
&lt;li>For shell samples: Bash version greater than or equal to 5.1.8 and curl version greater than or equal to 7.76.1.&lt;/li>
&lt;li>For notebook samples: JupyterLab version greater than or equal to 4.5.6.&lt;/li>
&lt;/ul></description></item><item><title>Setting up Data Discovery</title><link>http://developer.docs.protegrity.com/docs/prod_features/data_discovery/dd_install/</link><pubDate>Tue, 20 Feb 2024 00:00:00 +0000</pubDate><guid>http://developer.docs.protegrity.com/docs/prod_features/data_discovery/dd_install/</guid><description>&lt;p>Use the containers to set up the Data Discovery components required for identifying sensitive data.&lt;/p>


&lt;!-- tabbed panes for 1 start-->
 

&lt;!-- 1 end -->
 
 &lt;!-- tabbed panes for 2 start-->
 

&lt;!-- 2 end -->

&lt;ul class="nav nav-tabs" id="tabs-0" role="tablist">
 &lt;li class="nav-item">
 &lt;button class="nav-link active"
 id="tabs-00-00-tab" data-bs-toggle="tab" data-bs-target="#tabs-00-00" role="tab"
 data-td-tp-persist="fresh setup" aria-controls="tabs-00-00" aria-selected="true">
 Fresh setup
 &lt;/button>
 &lt;/li>&lt;li class="nav-item">
 &lt;button class="nav-link"
 id="tabs-00-01-tab" data-bs-toggle="tab" data-bs-target="#tabs-00-01" role="tab"
 data-td-tp-persist="upgrade" aria-controls="tabs-00-01" aria-selected="false">
 Upgrade
 &lt;/button>
 &lt;/li>
&lt;/ul>

&lt;div class="tab-content" id="tabs-0-content">
 &lt;div class="tab-body tab-pane fade show active"
 id="tabs-00-00" role="tabpanel" aria-labelled-by="tabs-00-00-tab" tabindex="0">
 &lt;!--Content-->
&lt;ol>
&lt;li>
&lt;p>Open a command prompt.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Navigate to the cloned repository location for &lt;strong>protegrity-ai-developer-edition&lt;/strong>.&lt;/p></description></item><item><title>Running the Data Discovery samples</title><link>http://developer.docs.protegrity.com/docs/prod_features/data_discovery/dd_using/</link><pubDate>Tue, 20 Feb 2024 00:00:00 +0000</pubDate><guid>http://developer.docs.protegrity.com/docs/prod_features/data_discovery/dd_using/</guid><description>&lt;p>Use the information in this section to run the Data Discovery samples provided in the &lt;code>data-discovery/samples&lt;/code> folder. These samples demonstrate how to use the Data Discovery API for classification and redaction of sensitive information in text and tabular data.&lt;/p>
&lt;h2 id="running-data-discovery">Running Data Discovery&lt;/h2>
&lt;p>The example scripts under the &lt;code>data-discovery/&lt;/code> folder demonstrate classification and redaction using the Data Discovery v2 API. For more information about the Data Discovery APIs, refer to the section &lt;a href="http://developer.docs.protegrity.com/docs/prod_features/data_discovery/dd_rest_api/">Data Discovery APIs&lt;/a>.&lt;/p></description></item><item><title>Uninstalling Data Discovery</title><link>http://developer.docs.protegrity.com/docs/prod_features/data_discovery/dd_uninstall/</link><pubDate>Tue, 20 Feb 2024 00:00:00 +0000</pubDate><guid>http://developer.docs.protegrity.com/docs/prod_features/data_discovery/dd_uninstall/</guid><description>&lt;ol>
&lt;li>
&lt;p>Open a command prompt.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Navigate to the cloned repository location.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Uninstall Semantic Guardrails if it is installed. For complete instructions, refer to &lt;a href="http://developer.docs.protegrity.com/docs/prod_features/sem_guard/sgr_uninstall/">Uninstalling Semantic Guardrails&lt;/a>.&lt;/p>
&lt;/li>
&lt;li>
&lt;p>Navigate to the &lt;code>data-discovery&lt;/code> directory.&lt;/p>
&lt;div class="highlight">&lt;pre tabindex="0" style="background-color:#f8f8f8;-moz-tab-size:4;-o-tab-size:4;tab-size:4;">&lt;code class="language-fallback" data-lang="fallback">&lt;span style="display:flex;">&lt;span>cd data-discovery
&lt;/span>&lt;/span>&lt;/code>&lt;/pre>&lt;/div>&lt;/li>
&lt;li>
&lt;p>Run the following command to remove the containers and images.&lt;/p>
&lt;div class="highlight">&lt;pre tabindex="0" style="background-color:#f8f8f8;-moz-tab-size:4;-o-tab-size:4;tab-size:4;">&lt;code class="language-fallback" data-lang="fallback">&lt;span style="display:flex;">&lt;span>docker compose down --rmi all
&lt;/span>&lt;/span>&lt;/code>&lt;/pre>&lt;/div>&lt;/li>
&lt;/ol></description></item></channel></rss>