Parquet
Compaction and Compression
- Improving Parquet Dedupe on Hugging Face Hub
- Encoding and Compression Guide for Parquet String Data Using RAPIDS | NVIDIA Technical Blog
Performance
- Optimizing Access to Parquet Data with fsspec | NVIDIA Technical Blog
- Efficiently Scaling Polars GPU Parquet Reader | NVIDIA Technical Blog
- Accelerating Apache Parquet Scans on Apache Spark with GPUs | NVIDIA Technical Blog
Columnar Formats and Other Big Data Formats