Skip to content

Conversation

@aparna-shyam
Copy link
Contributor

Description

Enhanced the console output for the SemanticModel build pipeline using the rich library. The output now includes detailed summary panels with statistics for profiling, link prediction, and glossary generation stages. This improves visibility into the build process and provides immediate feedback on data quality and relationships.

Changes

  • Imported Panel from rich.panel in src/intugle/semantic_model.py.
  • Added _print_profiling_summary method to display table counts, column counts, and dimension/measure distribution.
  • Added _print_link_prediction_summary method to list predicted relationships and total link counts.
  • Added _print_glossary_summary method to show documentation coverage percentages.
  • Added _print_build_summary method for a polished final success message.
  • Updated profile(), predict_links(), generate_glossary(), and build() methods to trigger these summaries at the appropriate times.

Related Issue

Fixes #134

@raphael-intugle
Copy link
Collaborator

Great attempt @aparna-shyam ! There are a couple of things I want to point out:

  1. line 90 in semantic_model.py
if col.data_type in ['string', 'date', 'datetime', 'boolean', 'text']:
                        dimensions += 1

col here does not have a data_type object

  1. Many of the functions' docstrings were removed. Please restore them.
  2. Make sure the test cases for semantic model have been run. You can run them with uv run pytest path/to/test

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

[GOOD FIRST ISSUE] Enrich Console Output with Summary Statistics

2 participants