perf[layout]: stop using dict layout if it offers poor compression#7884
perf[layout]: stop using dict layout if it offers poor compression#7884joseph-isaacs wants to merge 1 commit into
Conversation
Signed-off-by: Joe Isaacs <[email protected]>
Polar Signals Profiling ResultsLatest Run
Powered by Polar Signals Cloud |
Benchmarks: PolarSignals ProfilingVortex (geomean): 0.819x ✅ datafusion / vortex-file-compressed (0.819x ✅, 9↑ 0↓)
|
File Sizes: PolarSignals ProfilingNo file size changes detected. |
Benchmarks: FineWeb NVMeVerdict: No clear signal (low confidence) datafusion / vortex-file-compressed (0.972x ➖, 0↑ 0↓)
datafusion / vortex-compact (0.947x ➖, 1↑ 0↓)
datafusion / parquet (0.940x ➖, 1↑ 0↓)
duckdb / vortex-file-compressed (0.949x ➖, 1↑ 0↓)
duckdb / vortex-compact (0.960x ➖, 1↑ 0↓)
duckdb / parquet (0.935x ➖, 1↑ 0↓)
Full attributed analysis
|
File Sizes: FineWeb NVMeNo file size changes detected. |
Benchmarks: TPC-H SF=1 on NVMEVerdict: No clear signal (low confidence) datafusion / vortex-file-compressed (0.993x ➖, 0↑ 0↓)
datafusion / vortex-compact (1.001x ➖, 0↑ 0↓)
datafusion / parquet (0.986x ➖, 2↑ 1↓)
datafusion / arrow (1.009x ➖, 0↑ 2↓)
duckdb / vortex-file-compressed (0.993x ➖, 0↑ 0↓)
duckdb / vortex-compact (0.994x ➖, 0↑ 0↓)
duckdb / parquet (0.988x ➖, 0↑ 0↓)
duckdb / duckdb (1.002x ➖, 0↑ 0↓)
Full attributed analysis
|
File Sizes: TPC-H SF=1 on NVMENo file size changes detected. |
Benchmarks: TPC-DS SF=1 on NVMEVerdict: No clear signal (low confidence) datafusion / vortex-file-compressed (0.990x ➖, 1↑ 1↓)
datafusion / vortex-compact (0.988x ➖, 1↑ 0↓)
datafusion / parquet (0.984x ➖, 0↑ 0↓)
duckdb / vortex-file-compressed (0.995x ➖, 0↑ 0↓)
duckdb / vortex-compact (0.996x ➖, 1↑ 2↓)
duckdb / parquet (0.999x ➖, 0↑ 0↓)
duckdb / duckdb (0.986x ➖, 2↑ 1↓)
Full attributed analysis
|
File Sizes: TPC-DS SF=1 on NVMENo file size changes detected. |
Benchmarks: FineWeb S3Verdict: No clear signal (environment too noisy confidence) datafusion / vortex-file-compressed (0.894x ➖, 1↑ 0↓)
datafusion / vortex-compact (1.039x ➖, 0↑ 0↓)
datafusion / parquet (1.126x ➖, 0↑ 1↓)
duckdb / vortex-file-compressed (1.128x ➖, 0↑ 1↓)
duckdb / vortex-compact (1.053x ➖, 0↑ 1↓)
duckdb / parquet (1.024x ➖, 0↑ 0↓)
Full attributed analysis
|
Benchmarks: TPC-H SF=10 on NVMEVerdict: No clear signal (low confidence) datafusion / vortex-file-compressed (1.046x ➖, 0↑ 0↓)
datafusion / vortex-compact (1.040x ➖, 0↑ 0↓)
datafusion / parquet (1.037x ➖, 0↑ 0↓)
datafusion / arrow (1.073x ➖, 0↑ 3↓)
duckdb / vortex-file-compressed (1.054x ➖, 0↑ 0↓)
duckdb / vortex-compact (1.031x ➖, 0↑ 0↓)
duckdb / parquet (1.026x ➖, 0↑ 0↓)
duckdb / duckdb (1.028x ➖, 0↑ 0↓)
Full attributed analysis
|
Benchmarks: Statistical and Population GeneticsVerdict: No clear signal (low confidence) duckdb / vortex-file-compressed (1.071x ➖, 0↑ 4↓)
duckdb / vortex-compact (0.998x ➖, 0↑ 0↓)
duckdb / parquet (1.042x ➖, 0↑ 0↓)
Full attributed analysis
|
File Sizes: TPC-H SF=10 on NVMENo file size changes detected. |
File Sizes: Statistical and Population GeneticsNo file size changes detected. |
Benchmarks: Clickbench on NVMEVerdict: No clear signal (low confidence) datafusion / vortex-file-compressed (0.952x ➖, 6↑ 3↓)
datafusion / parquet (0.941x ➖, 3↑ 0↓)
duckdb / vortex-file-compressed (1.037x ➖, 5↑ 10↓)
duckdb / parquet (0.964x ➖, 1↑ 0↓)
duckdb / duckdb (0.954x ➖, 5↑ 0↓)
Full attributed analysis
|
File Sizes: Clickbench on NVMEFile Size Changes (143 files changed, +0.9% overall, 122↑ 21↓)
Totals:
|
Benchmarks: TPC-H SF=1 on S3Verdict: No clear signal (environment too noisy confidence) datafusion / vortex-file-compressed (1.313x ❌, 0↑ 9↓)
datafusion / vortex-compact (1.210x ➖, 0↑ 8↓)
datafusion / parquet (1.147x ➖, 1↑ 5↓)
duckdb / vortex-file-compressed (1.050x ➖, 0↑ 0↓)
duckdb / vortex-compact (1.115x ➖, 0↑ 0↓)
duckdb / parquet (1.098x ➖, 0↑ 0↓)
Full attributed analysis
|
Benchmarks: CompressionVortex (geomean): 0.997x ➖ unknown / unknown (0.966x ➖, 13↑ 2↓)
|
Benchmarks: Random AccessVortex (geomean): 0.963x ➖ unknown / unknown (0.974x ➖, 3↑ 0↓)
|
Benchmarks: TPC-H SF=10 on S3Verdict: No clear signal (environment too noisy confidence) datafusion / vortex-file-compressed (1.214x ➖, 0↑ 6↓)
datafusion / vortex-compact (1.078x ➖, 0↑ 3↓)
datafusion / parquet (1.018x ➖, 0↑ 1↓)
duckdb / vortex-file-compressed (1.079x ➖, 0↑ 0↓)
duckdb / vortex-compact (1.057x ➖, 0↑ 0↓)
duckdb / parquet (1.125x ➖, 0↑ 1↓)
Full attributed analysis
|
onursatici
left a comment
There was a problem hiding this comment.
I wonder why clickbench files increased in size, do you know if the fallback layout still compresses them as dict arrays?
No description provided.