These examples query a parquet file that contains historical taxi data from NYC. Install and Load DuckDB httpfs extensionĭuckDB’s httpfs extension allows parquet and csv files to be queried remotely over http. This delegates memory management to the engine and ensures that intermediate computations do not keep eating up memory, efficiently plotting massive datasets. The plotting module in JupySQL runs computations in the SQL engine. This approach requires loading all data into memory which is highly inefficient. The most common way to plot datasets in Python is to load them using Pandas and then use matplotlib or seaborn for plotting. You should now be able to access your files and launch an interactive notebook instance.įor more information about using this software, see Project Jupyter.% sql output_df << SELECT sum ( i ) as total_i FROM input_df Visualizing DuckDB Data ![]()
0 Comments
Leave a Reply. |