Skip to main content

Google Data Catalog

Recommendation
Updated
Moved
ASSESS
2022-05-13

What is it

Data Catalog is a managed metadata managenement service in Google Cloud's Analytics family of products. It is used for data discovery and allows data stakeholders to easier find and understand the organizations data assets. It does this by providing two services, one is used for tagging assets with metadata, the other one is used for searching among the data assets. In addition, Data Catalog can leverage the results of a Cloud Data Loss Prevention (DLP) scan to identify sensitive data directly within Data Catalog in the form of tag templates.

Data catalog

When to use it

We are evaluating if we should use Google Data Catalog as our main data discovery tool. Use it for tagging data sets in GCP with appropriate metadata, and finding relevant data sets.

How to learn it

Start at the official documentation