Daanalytics

Snowflake Data Governance directly from Snowsight

Last year I blogged about how to use Snowflake functionality to; “Know your Data”. Especially in these times where Generative AI becomes more and more mainstream, it’s essential to know what data is input for the LLM’s. Now Snowflake has made this a few clicks easier, offering classifying functionalities directly from Snowsight. This functionality is currently in Public Preview.

In the blogpost I wrote earlier, I stated that the classification of data was a 3 step process with SQL-statements. This now can all be done from within the Snowsight UI.

This blogpost will show how easy it can be to do the tasks from the Snowsight UI instead of via SQL statements.

Setup Environment

I chose a simple setup, just to see for myself how easy things are. Therefore I created two tables; Customers and Transactions and generated some data. For the Customer-table I used the Faker-package I used earlier.

Classify and Tag Sensitive Data

Now that I have the data in place, I can navigate to the schema where the two tables reside.

Snowflake Data Classification from the Snowsight UI

From the dropdown in the upper-right corner I can select; ‘Classify and Tag Sensitive Data’.

Execute Classify and Tag Sensitive Data

In the next screen I can select which tables I would like to classify. Select (or de-select) the tables of your choice. Clicking the blue botten starts the ‘Classify and Tag Sensitive Data’-process

Classify and Tag Sensitive Data in Progress
Finished Classify and Tag Sensitive Data Process

When you see the green check marks the ‘Classify and Tag Sensitive Data’-process is finished and ready to be analyzed.

Verify the results of the the 'Classify and Tag Sensitive Data'-process

Each column can be checked to see whether and how Snowflake has classified each column.

Like I mentioned in the previous blog, automatic Data Classification analyzes columns in structured data for personal information and provides customers with a set of pre-defined Snowflake System Tags to automatically categorize this data according to the supported types.

These System Tags automate the process of classifying data.

The System Tags are:

  • semantic_category – tells you what the cells of the column contain
    • some possible values are: name, gender, age, us_postal_code, phone
  • privacy_category – tells you what kind of personal information it is
    • some possible values are: identifier, quasi_identifier, insensitive

If, for some reason, you do not agree with the outcome of the classification, you can make any adjustment necessary. In my case I had to mark the names as ‘IDENTIFIER’. After that the tags can be applied and the classification completed.

No I can switch to the Governance Summary Dashboard in Snowsight to see the tagged and protected assets with workflows to take action.

You either need ACCOUNTADMIN or a role that has been granted the following Database roles; GOVERNANCE_VIEWER & OBJECT_VIEWER.

Via the Snowflake, Monitoring, Governance menu you can access the Governance Dashboard.

Snowflake, Monitoring, Governance menu
Snowflake Data Governance dashboard - Summary

The Governance dashboard gives an a consolidated view of all the tagged and classified objects. Note: “Information latency can be up to 2 hours”

Drilling down on the ’tagged columns’ gives a detailed view of the tagged objects.

Snowflake Data Governance dashboard - Detail

Summary

With the Data Governance functionality in the Snowsight UI, currently in Public Preview, Snowflake has improved the ease of use when it comes to ‘Getting to Know your Data’. With a few clicks, Data can be tagged and classified, without having to write any SQL.

Next Steps

For more details, please refer to the following information:

  • Blog: www.snowflake.com/blog/empower-data-governance-snowsight/
  • Blog: https://www.snowflake.com/blog/data-classification-snowsight-ui/
  • Documentation: https://docs.snowflake.com/user-guide/governance-classify-using
  • Documentation: https://docs.snowflake.com/en/user-guide/governance-classify-concepts
  • Documentation: https://docs.snowflake.com/en/user-guide/governance-classify-concepts#system-tags-and-categories

Till next time.

Snowflake Data Superhero. Online also known as; DaAnalytics.

Daan Bakboord

DaAnalytics signature picture with Snowflake Data Superhero avatar.

Bekijk ook:

Why didn't we see this coming?

Why didn’t we see this coming?

Early this month I attended the two days International Master Class in Strategic Intelligence executed by Rodenberg Tillman & Associates. If you’re really determined to move beyond simply gathering data and truly understand its strategic impact, this Master Class is designed for you. The Master Class is built around the Six Building Blocks™, ensuring comprehensive coverage of the critical aspects of Strategic Management and Intelligence. It’s perfect for business professionals who aim to excel by integrating Strategic Intelligence into their everyday practices, gaining the insights necessary to not only anticipate, but shape the future.

Lees verder »
Amsterdam User Group Meeting October 2024

Snowflake Dutch User Group – October 2024

Last night I had the privilege to organize a Snowflake ❄️ User Group in Snowflake’s Amsterdam Office.

Johan van der Kooij shared his experiences regarding optimizing Snowflake from a cost & performance perspective. He shared practical hints, as well as example queries, that you can use to optimize your Snowflake environment.

Lees verder »
Snowflake Data Cloud Summit - Wrap Up

Snowflake Snowflake Data Cloud Summit — Wrap Up

Snowflake Data Cloud Summit proved that after all this years the core idea remains the same and is still strong. Technology should serve and Snowflake makes things simple. One Single Unified Platform, one product and one engine. Ease of use and Govenance. Maximum efficiency and maximum simplicity.

Bring the processing of data to the data instead of the other way around. Snowflake as a Platform where you build and share your Data, Apps and AI Products. Your data never has to leave the Platform and Snowflake takes care of this Platform.

Lees verder »