Skip to content

Validation Reports

This section contains automated validation reports that measure the accuracy and completeness of General SQL Parser's analysis capabilities against real-world SQL codebases.

Data Lineage Validation

The Data Lineage Validation reports compare lineage results from the GSP DataFlowAnalyzer against an independent LLM-based analysis of the same SQL files. Each report documents:

  • What lineage was correctly detected
  • What lineage was missed (with root cause analysis)
  • Crashes or errors encountered
  • Recommendations for improvement

Reports are organized by database vendor, with multiple test suites per vendor covering different SQL patterns and complexity levels.

Latest Results

Vendor Test Suite Accuracy Issues Report
Oracle DB Sample Schemas 96% 1 Report
SQL Server SQL Data Warehouse Project 100% 0 Report
Snowflake Mattermost Data Warehouse (dbt) 0% 46 Report
DuckDB SOMA B2B SaaS (dbt) 0% 33 Report

!!! info "Generating Reports" Reports are generated using the /dlineage-verify skill, which automatically:

1
2
3
4
5
6
7
8
9
1. Analyzes SQL files with both LLM and DataFlowAnalyzer
2. Compares results and categorizes discrepancies
3. Generates a detailed markdown report in this section
4. Files MantisBT tickets for any issues found
5. Updates the summary table above

```bash
/dlineage-verify oracle /path/to/sql/directory
```