Data Configuration

Configure your data sources and processing options for the ZettaQuant SEAL pipeline.

Data Source Options

ZettaQuant supports multiple data ingestion methods to fit your workflow:

Connect to your current document and sentence tables with minimal setup.

Prerequisites:

Upload and process PDF documents directly through the Streamlit interface.

Features:

Requirements:

CREATE TABLE permissions on your target schema
Sufficient data access grants if the tables are already created as outlined in access grants.

For files (PDF/Zip) greater than 200MB, you can process them directly through SnowSQL staging:

Stage the file using SnowSQL:

PUT file://path/to/your/large-file @my_stage;

Note: For detailed information on staging files from your local environment, refer to the Snowflake documentation on PUT command.

Best Practices for Large Files: