Deep Inspection for Parquet Files & Apache Iceberg Tables
A powerful forensics tool for analyzing file structure, metadata, row groups, column statistics, and table evolution with an intuitive terminal interface.
Why Data Engineers Use TableSleuth
Direct visibility into the file-level details that determine your data lakehouse performance
Key Features
Parquet Analysis
- File structure & metadata inspection
- Row group analysis with statistics
- Column-level profiling
- Data sample preview
- Compression & encoding details
Iceberg Table Analysis
- Snapshot history & evolution
- Manifest file inspection
- Delete file (MOR) forensics
- Snapshot comparison
- Performance benchmarking
Cloud Integration
- S3 file access
- AWS Glue Catalog support
- S3 Tables integration
- PyIceberg catalog support
- Local & remote files
Performance Testing
- Query performance comparison
- Snapshot benchmark testing
- GizmoSQL integration
- DuckDB profiling
- Arrow Flight SQL support
See It In Action
Parquet File Inspection
File Structure & Schema
Row Group Analysis
Data Sample View
Column Profiling
Iceberg Table Analysis
Snapshot Overview
Performance Testing
Delete Files (MOR)
Snapshot Comparison
Quick Start
Basic Usage
Inspect a Parquet File
tablesleuth inspect /path/to/file.parquet
Inspect S3 Files
tablesleuth inspect s3://bucket/path/to/file.parquet
Analyze Iceberg Table
tablesleuth iceberg --catalog glue --namespace db --table my_table
Documentation
Setup Guide
Complete installation and configuration instructions
User Guide
Comprehensive guide to all features and workflows
Architecture
Technical architecture and design documentation
GizmoSQL Setup
Deploy GizmoSQL for advanced profiling features
Development
Contributing and development environment setup
Developer Guide
API reference and extension development
Ready to Start Investigating?
Install TableSleuth and start analyzing your Parquet files and Iceberg tables today.