TheDocumentation Index
Fetch the complete documentation index at: https://mintlify.com/embucket/embucket/llms.txt
Use this file to discover all available pages before exploring further.
embucketd binary provides comprehensive command-line configuration for running the Embucket server. All flags can also be set via environment variables.
Usage
Server Configuration
Path to YAML config describing volumes/databases to seed the metastore.Environment variable:
METASTORE_CONFIGHost address to bind the server to.Environment variable:
BUCKET_HOSTPort number to bind the server to.Environment variable:
BUCKET_PORTService idle timeout in seconds (5 hours by default).Environment variable:
IDLE_TIMEOUT_SECONDSQuery Execution
Maximum number of queries that can run simultaneously.Environment variable:
MAX_CONCURRENCY_LEVELMaximum duration in seconds a single query is allowed to run (20 minutes by default).Environment variable:
QUERY_TIMEOUT_SECSThe maximum number of concurrent requests to get table details.Environment variable:
MAX_CONCURRENT_TABLE_FETCHESMemory Management
Memory pool type for query execution.Environment variable:
MEM_POOL_TYPEOptions:greedy- Allocates memory aggressivelyfair- Distributes memory fairly among queries
Maximum memory pool size in megabytes. If not set, uses system defaults.Environment variable:
MEM_POOL_SIZE_MBWrap memory pool with TrackConsumersPool for tracking per-consumer memory usage.Environment variable:
MEM_ENABLE_TRACK_CONSUMERS_POOLMaximum disk pool size in megabytes for spilling operations.Environment variable:
DISK_POOL_SIZE_MBEnable memory tracing functionality for debugging.Environment variable:
ALLOC_TRACINGData Formats and Parsing
Data serialization format in Snowflake v1 API.Environment variable:
DATA_FORMATSQL parser dialect to use.Environment variable:
SQL_PARSER_DIALECTOptions: snowflake, postgres, mysql, genericAuthentication
Username for demo authentication mode.Environment variable:
AUTH_DEMO_USERPassword for demo authentication mode.Environment variable:
AUTH_DEMO_PASSWORDJWT secret for authentication. This value is sensitive and will be removed from environment after loading.Environment variable:
JWT_SECRETAWS SDK Configuration
AWS SDK connection timeout in seconds.Environment variable:
AWS_SDK_CONNECT_TIMEOUT_SECSAWS SDK operation timeout in seconds.Environment variable:
AWS_SDK_OPERATION_TIMEOUT_SECSAWS SDK operation attempt timeout in seconds.Environment variable:
AWS_SDK_OPERATION_ATTEMPT_TIMEOUT_SECSIceberg Configuration
Iceberg table creation timeout in seconds.Environment variable:
ICEBERG_CREATE_TABLE_TIMEOUT_SECSIceberg catalog operation timeout in seconds.Environment variable:
ICEBERG_CATALOG_TIMEOUT_SECSObject Store Configuration
Object store operation timeout in seconds.Environment variable:
OBJECT_STORE_TIMEOUT_SECSObject store connection timeout in seconds.Environment variable:
OBJECT_STORE_CONNECT_TIMEOUT_SECSObservability
Tracing level for logs. Can be overridden by the
RUST_LOG environment variable.Environment variable: TRACING_LEVELOptions: off, info, debug, traceTracing span processor type.Environment variable:
span_processorOptions:batch-span-processorbatch-span-processor-experimental-async-runtime
OpenTelemetry Exporter Protocol.Environment variable:
OTEL_EXPORTER_OTLP_PROTOCOL