Core Concepts
What is Data Discovery?
What is Data Discovery?
What is Data Catalog?
What is Data Catalog?
What is Data Observability?
What is Data Observability?
- Focuses on change and drift — volume dropped, schema changed, freshness delayed, distribution shifted
- Produces alerts, incidents, SLA breaches
- Forward-looking and continuous — “something changed, investigate now”
- Example: “Row count dropped 40% compared to yesterday’s average — anomaly detected”
What is Data Quality?
What is Data Quality?
- Focuses on the data itself — completeness, accuracy, consistency, uniqueness, timeliness, validity
- Produces a score, a pass/fail, a rule result
- Backward-looking — “was this data good when it was loaded?”
- Example: “12.7% of customer_id values are null — that fails our threshold”
What is Semantic?
What is Semantic?
- Semantics tells you: “This is a Customer Identifier — a unique reference to a person or organization that has a business relationship with the company”
- It gets tagged with business terms like: Customer, PII, Primary Key, CRM Entity
- This meaning is stable — cust_id means the same thing whether it appears in a sales table, a support ticket table, or a billing table
What is Context?
What is Context?
- It feeds into the daily revenue dashboard used by the CFO
- It’s joined to a pipeline that triggers customer invoices
- It was flagged with 3% null values last Tuesday
- It’s downstream of a Salesforce sync that ran late
- This particular instance of cust_id is business-critical
- A data issue here affects revenue reporting and invoicing
- This needs to be prioritized over a cust_id sitting in an archive table nobody uses
Compare Semantic vs Context
Compare Semantic vs Context
| Semantics | Context | |
|---|---|---|
| Question | What does this data mean? | Why does this data matter right now? |
| Nature | Static definition | Dynamic and situational |
| Example | cust_id = Customer Identifier | cust_id feeds the CFO dashboard and invoice pipeline |
| Set by | Business glossary, classification | Lineage, usage patterns, downstream dependencies |
| Changes over time? | Rarely | Constantly |
Does Prizm provide Data Discovery or Data Catalog or both?
Does Prizm provide Data Discovery or Data Catalog or both?
| Discovery | Catalog | |
|---|---|---|
| Question it answers | ”Does this data exist?" | "What is this data, who owns it, can I trust it?” |
| Primary user | Anyone looking for data | Data owners, stewards, engineers, and analysts |
| Frequency | One-time or occasional | Ongoing, daily governance work |
| Core feature | Search bar | Asset pages, lineage, DQ rules, glossary |
| Value | Speed to find | Depth of trust |
Does Prizm provide Data Observability or Quality or both?
Does Prizm provide Data Observability or Quality or both?
Does Prizm provide Semantic or Context or both?
Does Prizm provide Semantic or Context or both?
- Semantics tells Prizm: “This is customer data, it’s PII, it’s a key business entity”
- Context tells Prizm: “This specific instance flows into 12 downstream reports, was touched by 3 pipelines today, and is used by the finance team daily”
- Together, Prizm’s can say: “There’s a data quality issue here — and it’s high priority because of what this data means AND how critical it is to the business right now”
What Tag Types are supported?
What Tag Types are supported?
- Status — Current health state, for example
Healthy. - Domain — Business domain, for example
Customer,Sales, orFinance. - System — Source system, for example
CRM. - Data — Data category, for example
Transactions. - Priority — Operational priority, for example
High Priority. - Security — Data sensitivity classification, for example
PII.
Getting started
What is PRIZM?
What is PRIZM?
Which data sources does PRIZM support?
Which data sources does PRIZM support?
- Snowflake (e.g.,
snowflake/production_db) - PostgreSQL (e.g.,
postgres/analytics_db) - MySQL (e.g.,
mysql/ecommerce_db) - Amazon Redshift (e.g.,
redshift/warehouse_db)
How do I create an account?
How do I create an account?
Assets
What is an asset in PRIZM?
What is an asset in PRIZM?
- Tables & Views — relational tables and database views
- Queries — saved or scheduled SQL queries
- Pipelines — data pipeline jobs moving or transforming data
- Reports — reporting objects or materialized outputs
- Semantic models — logical data models and definitions
- Attributes — columns related to tables, queries, views.
What is a fingerprint column?
What is a fingerprint column?
updated_at, transaction_date, or last_login. PRIZM reads this column to determine when the data was last modified, which directly feeds the Freshness metric. Your PRIZM administrator configures the fingerprint column for each asset during setup.What does the Score metric mean?
What does the Score metric mean?
What is Freshness?
What is Freshness?
Alerts & Issues
What triggers an alert?
What triggers an alert?
email must stay below 5%” or “the row count must not drop by more than 10% day over day.” When PRIZM evaluates a measure and finds the threshold has been exceeded, it generates an alert. Alerts appear on the Alerts page, categorized by level: Critical, Warning, Info, or High.What is drift status?
What is drift status?
- Low — minor deviation, within normal variation
- Medium — notable deviation that warrants review
- High — significant deviation that likely requires action
How is an alert different from an issue?
How is an alert different from an issue?
- An alert is an automated notification generated when a measure threshold is breached. Alerts are read-only records of what PRIZM detected.
- An issue is a tracked work item you create and manage through to resolution. Issues have a status (New, In Progress, Resolved), a priority, and are linked to a specific asset and database.
Can I silence or acknowledge an alert?
Can I silence or acknowledge an alert?
Metrics
What is the Data Quality percentage?
What is the Data Quality percentage?
What does 'Active Pipelines' mean?
What does 'Active Pipelines' mean?