DuckDB pivot tables Data Source Setup
All necessary DuckDB binaries are already included into nreco/pivotdataservice docker image.
PivotDataService zip doesn't include DuckDB native binaries and they should be downloaded for your hosting platform separately.
Please contact us to get step-by-step instructions.
DuckDB is a powerful, in-process (built-in) database management system designed for high-performance analytics. DuckDB is good at handling complex queries on large datasets, it supports various columnar storage formats and parallel execution capabilities and offers a fast, reliable, and user-friendly solution for data analysis and manipulation.
PivotDataService can use DuckDB a data source with SQL-compatible database connector:
{ "Id": "DuckDB_DS1", "Name": "DuckDB DataSource1", "SourceType": "SqlDb", "SourceDb": { "Connector": "duckdb", "ConnectionString": "DataSource=:memory:?cache=shared;", "SelectSql": "select * from read_csv('https://www.seektable.com/demo/sales.csv')" }, "InferSchema": true }
With DuckDB connector you can use SQL to query:
- Large CSV/JSON/Parquet/Iceberg files (including multiple files at once) that are stored either locally or by URL or in the cloud storage (S3).
- Local DuckDB columnar data files that can be used as serverless data warehouse that 'lives' inside PivotDataService.
- MySql/PostgreSql servers (via DuckDB extensions)
- any data source supported by PivotDataService via special
cube_query
(see below).
Note that with DuckDB you can combine data from a wide variety of different sources.