Spaces:

sanzgiri
/

schema-translator

Paused

App Files Files Community

schema-translator / UI_GUIDE.md

sanzgiri

Complete Schema Translator implementation - All 8 phases

a584f85 about 1 month ago

preview code

raw

history blame contribute delete

9.52 kB

A newer version of the Gradio SDK is available: 6.1.0

Upgrade

Schema Translator UI Guide

Overview

The Schema Translator UI provides an intuitive chat interface for querying customer databases using natural language. The system automatically translates your queries into SQL and harmonizes results across different customer schemas.

Getting Started

Starting the Application

# Activate your virtual environment
source .venv/bin/activate

# Start the Chainlit server
chainlit run app.py --port 8000

The application will be available at http://localhost:8000

First Steps

Open the application in your browser
Read the welcome message to understand available commands
Try a simple query like "Show me all contracts"

Query Examples

Basic Queries

List all records:

Show me all contracts
Find all contracts
List contracts

Filter by status:

Show active contracts
Find pending contracts
List completed contracts

Filter by value:

Find contracts with value over 10000
Show contracts worth more than 50000
List high-value contracts (over 100000)

Filter by dates:

Show contracts expiring soon
Find contracts expiring in next 30 days
List recent contracts

Advanced Queries

Aggregation queries:

Count contracts by status
Sum contract values by customer
Average contract value

Sorting:

Show contracts ordered by value
List contracts sorted by expiration date

Combining filters:

Find active contracts with value over 10000
Show pending contracts expiring in next 60 days
List high-value active contracts

Available Commands

`/help`

Display help information with examples and available commands.

Usage:

/help

`/customers`

List all available customers with their database information.

Usage:

/customers

Output:

Customer IDs
Number of rows per customer
Number of concepts mapped

`/select <customer_id>`

Query specific customer(s) instead of all customers.

Usage:

# Query single customer
/select customer_a

# Query multiple customers (comma-separated)
/select customer_a, customer_c, customer_e

# Query all customers (default)
/select all

`/debug on|off`

Toggle debug mode to see SQL queries and semantic plans.

Usage:

# Enable debug mode
/debug on

# Disable debug mode
/debug off

# Check current status
/debug

Debug Information Includes:

Semantic query plan (intent, projections, filters)
Generated SQL queries for each customer
Execution details

`/stats`

Display query execution statistics and knowledge graph information.

Usage:

/stats

Shows:

Total queries executed
Success rate
Average execution time
Knowledge graph statistics (concepts, customers, mappings)

`/explain <query>`

Explain how a query will be processed without executing it.

Usage:

/explain Show me all active contracts

Output:

Human-readable explanation
Sample SQL query

Understanding Results

Result Table

Query results are displayed as markdown tables with:

Column headers (concept names)
Harmonized values (normalized across customers)
Row count (limited to first 50 rows)

Execution Statistics

Each result includes:

Success Rate: Percentage of customers successfully queried
Total Rows: Number of results returned
Customers Queried: List of customers included
Customers Succeeded: List of customers that returned results
Execution Time: Query execution time in milliseconds

Action Buttons

After each query, you can:

📖 Explain Query: See how the query was processed
👍 Good Result: Provide positive feedback
👎 Incorrect Result: Report incorrect results
🔍 Show Debug Info: View SQL and semantic plan (if debug mode off)

Features

Multi-Customer Queries

By default, queries run across all available customers. Results are automatically harmonized and combined.

Benefits:

Single query syntax works for all customers
Automatic schema translation
Normalized results

Query Modes

Mock Mode:

Uses predefined query patterns
No LLM API calls required
Fast response times
Good for testing

LLM Mode:

Uses Claude Sonnet 4 for query understanding
Better natural language understanding
Requires ANTHROPIC_API_KEY
More flexible query handling

Debug Mode

When enabled, debug mode shows:

Semantic Plan:
- Intent (e.g., find_contracts)
- Projections (fields to return)
- Filters (conditions)
- Aggregations (if any)
Generated SQL:
- Customer-specific SQL queries
- Shows schema differences
Execution Details:
- Per-customer execution status
- Error messages if any

Tips and Best Practices

Writing Effective Queries

Be specific: ✅ "Show active contracts with value over 10000" ❌ "Show stuff"

Use natural language: ✅ "Find contracts expiring in next 30 days" ❌ "SELECT * WHERE exp_date < NOW() + 30"

Use common terms:

"active", "pending", "completed" for status
"high value", "over X", "more than X" for amounts
"expiring soon", "recent", "next X days" for dates

Performance

For faster queries:

Query specific customers instead of all
Use filters to limit results
Avoid complex aggregations when not needed

For comprehensive analysis:

Query all customers
Use aggregation functions
Enable debug mode to see execution details

Troubleshooting

Query returns no results:

Check if selected customers have data
Try broader filters
Use /customers to see available data

Query fails with error:

Check query syntax (at least 3 characters)
Enable debug mode to see details
Try simpler query first
Check /stats for system status

Slow queries:

Reduce number of customers
Add more specific filters
Check debug info for complex SQL

Configuration

Environment Variables

The application uses .env file for configuration:

# Optional: Anthropic API key for LLM mode
ANTHROPIC_API_KEY=your_api_key_here

# Optional: Model selection
LLM_MODEL=claude-sonnet-4-20250514

# Optional: Database directory
DATABASE_DIR=./data/databases

Chainlit Configuration

Edit .chainlit/config.toml to customize:

UI theme
Session timeout
File upload settings
Feature flags

Keyboard Shortcuts

While in the chat interface:

Enter: Send message
Shift + Enter: New line in message
Ctrl/Cmd + K: Clear chat
Ctrl/Cmd + L: Toggle light/dark mode

Advanced Features

Custom Customer Selection

You can maintain a session-specific customer selection:

# Set selection
/select customer_a, customer_b

# All queries will now use only these customers
Show me all contracts

# Reset to all customers
/select all

Query History

The system tracks all queries in the session:

Success/failure status
Execution time
Timestamp
View with /stats command

Feedback Loop

Use feedback buttons to improve results:

👍 Mark good results
👎 Report incorrect results
Feedback stored for future improvements (Phase 8)

Example Workflows

Exploring Available Data

1. /customers                    # See what's available
2. /select customer_a            # Focus on one customer
3. Show me all contracts         # See sample data
4. /debug on                     # Enable debug mode
5. Find active contracts         # See how queries work
6. /select all                   # Return to all customers

Complex Analysis

1. /debug on                           # Enable debugging
2. Count contracts by status           # Get overview
3. Find high-value contracts           # Identify valuable ones
4. Show active contracts over 50000    # Combine filters
5. /stats                              # Check performance

Troubleshooting Issues

1. /debug on                     # Enable detailed output
2. <your failing query>          # Run the query
3. Check semantic plan           # Verify interpretation
4. Check SQL queries             # Verify generated SQL
5. Simplify query and retry      # Try simpler version

Support and Documentation

Getting Help

Use /help in chat for quick reference
Check this guide for detailed information
Enable debug mode to diagnose issues
Check /stats for system health

Known Limitations

Maximum 50 rows displayed per query (full results processed)
Query text must be at least 3 characters
Mock mode has limited query patterns
Some complex queries require LLM mode

Future Enhancements (Phase 8)

Learning from feedback
Schema drift detection
Query optimization suggestions
Improved error recovery
Enhanced analytics

Quick Reference Card

Command	Purpose	Example
`/help`	Show help	`/help`
`/customers`	List customers	`/customers`
`/select`	Choose customers	`/select customer_a, customer_b`
`/debug`	Toggle debug	`/debug on`
`/stats`	Show statistics	`/stats`
`/explain`	Explain query	`/explain Find contracts`

Query Type	Example
List all	`Show me all contracts`
Filter status	`Find active contracts`
Filter value	`Show contracts over 10000`
Filter date	`Find contracts expiring soon`
Aggregate	`Count contracts by status`
Combined	`Show active contracts over 10000`

Version: Phase 7 - UI Implementation Last Updated: November 7, 2025