Setting up Synthetic Data
Use the containers to set up the Synthetic Data feature for data generation.
Open a command prompt.
Navigate to the cloned repository location for protegrity-ai-developer-edition.
Run the following command to download and start the containers. The dependent containers are large in size. Based on the network connection, the containers might take time to download and deploy.
cd synthetic-data docker compose up -dBased on your configuration use the
docker-compose up -dcommand.Note: By default images are obtained from
ghcr.io. To obtain images frompublic.ecr.aws, navigate to thesynthetic-datadirectory and copy the.env.examplefile to.env. Open the.envfile and uncomment theREGISTRY=public.ecr.aws/protegrity-ai-developer-editionline in the file. Save the file and run thedocker compose up -dcommand to download and start the containers.Verify that the containers started successfully.
docker compose logsSet up the Jupyter notebook for working with the notebooks provided from the cloned repository location for protegrity-ai-developer-edition.
pip install -r shared/requirements.txtInstall the Synthetic Data SDK package.
pip install protegrity-synthetic-data-sdk
Open a command prompt.
Navigate to the cloned repository location for protegrity-ai-developer-edition.
If the step to stop containers was missed earlier, then use the following commands to identify and remove the AI Developer Edition containers.
docker compose down --remove-orphansDelete the docker network resources.
docker network rm -f <network_name_or_id>For example,
docker network rm -f protegrity-networkRun the following command to download and start the containers. The dependent containers are large in size. Based on the network connection, the containers might take time to download and deploy.
cd synthetic-data docker compose up -dBased on your configuration use the
docker-compose up -dcommand.Verify that the containers started successfully.
docker compose logsSet up the Jupyter notebook for working with the notebooks provided from the cloned repository location for protegrity-ai-developer-edition.
pip install -r shared/requirements.txtUpgrade the Synthetic Data SDK package.
pip install --upgrade protegrity-synthetic-data-sdk
Feedback
Was this page helpful?