Embucket can be configured entirely through environment variables, making it ideal for containerized deployments. All CLI flags have corresponding environment variables.Documentation Index
Fetch the complete documentation index at: https://mintlify.com/embucket/embucket/llms.txt
Use this file to discover all available pages before exploring further.
Server Configuration
Path to YAML config file describing volumes/databases to seed the metastore.
Host address to bind the server to.
Port number to bind the server to.
Service idle timeout in seconds.
Query Execution
Maximum number of queries running simultaneously.
Maximum duration in seconds for a single query.
Maximum concurrent requests to get table details.
Memory and Resources
Memory pool type:
greedy or fair.Maximum memory pool size in megabytes.
Enable per-consumer memory usage tracking.
Maximum disk pool size in megabytes for spilling.
Enable memory tracing functionality.
Data Format
Data serialization format in Snowflake v1 API.
SQL parser dialect:
snowflake, postgres, mysql, or generic.Authentication
Username for demo authentication.
Password for demo authentication.
JWT secret for authentication. Automatically cleared after startup.
AWS SDK Configuration
AWS SDK connection timeout in seconds.
AWS SDK operation timeout in seconds.
AWS SDK operation attempt timeout in seconds.
Iceberg Configuration
Iceberg table creation timeout in seconds.
Iceberg catalog operation timeout in seconds.
Object Store Configuration
Object store operation timeout in seconds.
Object store connection timeout in seconds.
Observability
Tracing level:
off, info, debug, or trace. Can be overridden by RUST_LOG.Tracing span processor type.
OpenTelemetry Exporter Protocol:
grpc or http.Volume Bootstrap Variables
These variables allow you to bootstrap a volume at startup without a YAML configuration file.Type of volume to bootstrap.Options:
s3, s3tables (or s3_tables, s3-tables), memoryIdentifier name for the volume.
Optional database name to auto-create with this volume.
S3 Volume Variables
AWS access key ID for S3 volumes. Required for
s3 and s3tables volume types unless using credential provider chain.AWS secret access key for S3 volumes. Required for
s3 and s3tables volume types unless using credential provider chain.Optional AWS session token for temporary credentials.
S3 Tables Volume Variables
Amazon S3 Tables bucket ARN. Required for
s3tables volume type.Format: arn:aws:s3tables:region:account-id:bucket/bucket-nameDeployment Examples
Docker Compose
Kubernetes ConfigMap
Bootstrap S3 Tables Volume
Use AWS Credential Provider Chain
WhenVOLUME_ACCESS_KEY and VOLUME_SECRET_KEY are not set for S3 Tables volumes, Embucket will use the AWS default credential provider chain:
- Environment variables (
AWS_ACCESS_KEY_ID,AWS_SECRET_ACCESS_KEY) - Shared config files (
~/.aws/config,~/.aws/credentials) - Web Identity Tokens
- ECS (IAM Roles for Tasks) & General HTTP credentials
- EC2 IMDSv2