AI-Generated Comments and Table Insights
Who this is for:
Architecture / Concept Overview: AI-Generated Comments and Table Insights
AI-generated comments and table insights are produced by Databricks AI analysing table schemas, sample data, column statistics, and usage patterns.
%%{init: {"theme":"base","themeVariables":{"background":"#0B0E14","primaryTextColor":"#E0E6ED","lineColor":"#5D6470","darkMode":true,"primaryColor":"#2E4A4A","secondaryColor":"#374151","secondaryTextColor":"#E0E6ED","tertiaryColor":"#111827","tertiaryTextColor":"#E0E6ED","edgeLabelBackground":"#1f2937"}}}%%
flowchart LR
classDef source fill:#3F4B59,stroke:#9CA3AF,stroke-width:2px,rx:8,ry:8,color:#E0E6ED
classDef ingestion fill:#5A4B36,stroke:#C9A86B,stroke-width:2px,rx:8,ry:8,color:#E0E6ED
classDef processing fill:#535072,stroke:#8E82B4,stroke-width:2px,rx:8,ry:8,color:#E0E6ED
classDef storage fill:#2E4A4A,stroke:#5FAFA8,stroke-width:2px,rx:8,ry:8,color:#E0E6ED
classDef serving fill:#3D5550,stroke:#6BB7AA,stroke-width:2px,rx:8,ry:8,color:#E0E6ED
classDef governance fill:#5A3F52,stroke:#C28BB0,stroke-width:2px,rx:8,ry:8,color:#E0E6ED
TBL[Unity Catalog Table] --> SCHEMA[Schema Metadata<br/>Columns · Types · Constraints]
TBL --> STATS[Column Statistics<br/>Min · Max · Nulls · Distinct]
TBL --> SAMPLE[Sample Data<br/>Representative rows]
SCHEMA --> AI[Databricks AI<br/>Comment Generation]
STATS --> AI
SAMPLE --> AI
AI --> COMMENTS[AI-Generated Comments<br/>Table & column descriptions]
AI --> INSIGHTS[Table Insights<br/>Distribution · Quality · Patterns]
TBL:::storage
SCHEMA:::governance
STATS:::processing
SAMPLE:::processing
AI:::processing
COMMENTS:::serving
INSIGHTS:::serving
*Figure 1 — AI analyses schema, statistics, and sample data to generate comments and insights automatically.*
AI comments integrate into the broader data documentation workflow.
%%{init: {"theme":"base","themeVariables":{"background":"#0B0E14","primaryTextColor":"#E0E6ED","lineColor":"#5D6470","darkMode":true,"primaryColor":"#2E4A4A","secondaryColor":"#374151","secondaryTextColor":"#E0E6ED","tertiaryColor":"#111827","tertiaryTextColor":"#E0E6ED","edgeLabelBackground":"#1f2937"}}}%%
graph TD
classDef source fill:#3F4B59,stroke:#9CA3AF,stroke-width:2px,rx:8,ry:8,color:#E0E6ED
classDef ingestion fill:#5A4B36,stroke:#C9A86B,stroke-width:2px,rx:8,ry:8,color:#E0E6ED
classDef processing fill:#535072,stroke:#8E82B4,stroke-width:2px,rx:8,ry:8,color:#E0E6ED
classDef storage fill:#2E4A4A,stroke:#5FAFA8,stroke-width:2px,rx:8,ry:8,color:#E0E6ED
classDef serving fill:#3D5550,stroke:#6BB7AA,stroke-width:2px,rx:8,ry:8,color:#E0E6ED
classDef governance fill:#5A3F52,stroke:#C28BB0,stroke-width:2px,rx:8,ry:8,color:#E0E6ED
DOC[Documentation Workflow] --> AI_GEN[AI-Generated<br/>Auto-created descriptions]
DOC --> HUMAN[Human-Authored<br/>Manual COMMENT statements]
DOC --> HYBRID[Hybrid<br/>AI draft → human review → accepted]
AI_GEN --> SURFACES[Surfaces In]
HUMAN --> SURFACES
SURFACES --> CE[Catalog Explorer]
SURFACES --> GENIE[Genie / AI/BI]
SURFACES --> SEARCH[Search Results]
DOC:::governance
AI_GEN:::processing
HUMAN:::source
HYBRID:::ingestion
SURFACES:::serving
CE:::serving
GENIE:::serving
SEARCH:::serving
*Figure 2 — AI-generated and human-authored comments flow into the same surfaces: Catalog Explorer, Genie, and search.*
Key Terms
Prerequisites and Setup
- Unity Catalog enabled
- AI-generated documentation feature enabled in workspace settings (Admin Console → Preview → AI-generated documentation)
BROWSEprivilege for viewing AI comments; table ownership orMODIFYfor accepting/editing- Tables with data (AI needs sample rows and statistics)
Step-by-Step Implementation
Configuration Reference
| Setting | Scope | Default | Notes |
|---|---|---|---|
| AI-generated documentation | Workspace | Disabled | Enable in Admin Console |
| Comment storage | Unity Catalog | Metadata store | Stored as table/column properties |
| Statistics refresh | Table | Manual via ANALYZE TABLE | Predictive optimisation can automate |
| Insight retention | Table | Continuously updated | Refreshed as data changes |
| AI model access | Workspace | Databricks-managed | No external API calls needed |