Configuration Concepts¶

Understand how DBWarden configuration works under the hood.

What is Configuration?¶

Configuration tells DBWarden: - Where your databases are (connection URLs) - What kind of databases they are (PostgreSQL, SQLite, etc.) - Where your SQLAlchemy models live (for migration generation) - Where to store migrations (directories)

Why Python Configuration?¶

Type Safety¶

Your IDE can help you:

primary = database_config(
    database_name="primary",  #  IDE suggests parameter names
    default=True,             #  IDE knows this is boolean
    database_type="sqlite",   #  IDE can validate enum values
    database_url_sync="...",
)

Dynamic Configuration¶

You can use Python logic:

import os

# Different config per environment
environment = os.getenv("ENV", "dev")

if environment == "production":
    database_url = "postgresql://prod-host/myapp"
else:
    database_url = "sqlite:///./dev.db"

primary = database_config(
    database_name="primary",
    default=True,
    database_type="postgresql" if environment == "production" else "sqlite",
    database_url_sync=database_url,
)

Multiple Databases¶

Easy to configure multiple databases:

DATABASES = {
    "primary": "postgresql://localhost/main",
    "analytics": "postgresql://localhost/analytics",
    "logging": "postgresql://localhost/logs",
}

for name, url in DATABASES.items():
    db = database_config(
        database_name=name,
        default=(name == "primary"),
        database_type="postgresql",
        database_url_sync=url,
        model_paths=[f"app.models.{name}"],
    )

Configuration Loading¶

Discovery Order¶

DBWarden searches for configuration in this order:

1. dbwarden.py in current directory
      not found
2. dbwarden.py in parent directories
      not found
3. Full scan for files with database_config()
      not found
4. DBWARDEN_CONFIG_MODULE environment variable
      not found
Error: No configuration found

When Configuration Loads¶

Configuration loads when you run any DBWarden command:

$ dbwarden migrate    #  Config loads here
$ dbwarden status     #  Config loads here
$ dbwarden history    #  Config loads here

Load process: 1. Python imports your config module 2. database_config() calls execute 3. Databases register in internal registry 4. Validation runs 5. Command executes with loaded config

Validation Rules¶

DBWarden validates configuration at load time:

Rule	Why It Matters
Exactly one `default=True`	CLI needs to know which DB to use when `--database` is omitted
Unique `database_name`	Commands target databases by name
Unique `database_url`	Prevents accidental duplicate configurations
Unique physical targets	Prevents two configs pointing to same DB with different credentials
Required `model_paths` in multi-DB	Keeps model discovery boundaries clear
No overlapping `model_paths`	Prevents ambiguous model ownership (unless `overlap_models=True`)

Validation Timing¶

# dbwarden.py
primary = database_config(
    database_name="primary",
    default=True,
    database_type="postgresql",
    database_url_sync="postgresql://localhost/myapp",
)

primary = database_config(
    database_name="primary",  #  Duplicate!
    default=True,
    database_type="postgresql",
    database_url_sync="postgresql://localhost/other",
)

When you run dbwarden migrate:

Error: Duplicate database_name 'primary'

Validation happens before any commands execute.

Config Source Precedence¶

When looking for config, DBWarden uses this precedence:

Top-level dbwarden.py: the conventional standalone config file at your project root. This is the default scaffold created by dbwarden init, not the only valid location. Always sandboxed (only dbwarden imports allowed).
DBWARDEN_CONFIG_MODULE: an explicit environment variable override. Always imported normally as a Python module (no sandbox). This is the escape hatch for projects with ambiguous full-scan results or non-standard layouts.
Full-scan discovery: if neither of the above produces a config source, DBWarden walks your project tree looking for any database_config(...) call. This means database_config(...) can live in any discovered Python file inside your project. Files directly at the project root are sandboxed; files inside subdirectories are imported normally.

Config Loading Security (Sandbox)¶

DBWarden applies import restrictions only to config files that are isolated (sandboxed) vs in-package (normal import):

Mode	Import behavior	Applies to
`isolated`	Sandboxed: only `dbwarden.*` imports allowed	Top-level `dbwarden.py`; any full-scan-discovered file at the project root
`in-package`	Normal Python import	Full-scan-discovered files inside subdirectories; `DBWARDEN_CONFIG_MODULE` modules

An isolated config file runs in a sandbox that prevents accidental escalation of file-read access to arbitrary code execution. Only dbwarden and its submodules can be imported.

An in-package config file is imported as a normal Python module, with full access to app.* and any other project imports. This is the correct path when your database_config(...) call lives in an application package that imports other project modules.

Import root detection. For full-scan-discovered files, DBWarden tries to resolve the dotted module path. It checks two common import roots in order:

src/ (PEP 517/518, setuptools, poetry)
The project root itself

For example, src/myapp/databases.py resolves as myapp.databases with import root src/. If neither root produces an importable path, the file falls back to isolated (sandboxed). Projects with other layouts should set DBWARDEN_CONFIG_MODULE explicitly.

Path validation (path traversal blocking) applies to all file-based sources regardless of mode.

For debugging, set DBWARDEN_DISABLE_SANDBOX=1 to disable the sandbox for isolated files:

DBWARDEN_DISABLE_SANDBOX=1 dbwarden status  # Skip sandbox (debug only)

Disabling the sandbox also removes import restrictions for isolated config files, which can be useful in development. Keep it enabled in production.

The `default` Database¶

Why `default=True` Exists¶

Consider these commands:

# Explicit database
$ dbwarden migrate --database primary

# Implicit database (uses default)
$ dbwarden migrate

Without default=True, DBWarden wouldn't know which database to use for the second command.

Only One Default¶

#  Good
analytics = database_config(
analytics = database_config(database_name="analytics", default=False, ...)  # or omit default

#  Bad - two defaults
analytics = database_config(
analytics = database_config(database_name="analytics", default=True, ...)  # Error!

Default Affects CLI Behavior¶

# These are equivalent when primary is default:
$ dbwarden migrate
$ dbwarden migrate --database primary

# These are NOT equivalent:
$ dbwarden migrate
$ dbwarden migrate --database analytics  # Targets analytics, not primary

Model Discovery¶

What Are `model_paths`?¶

model_paths tells DBWarden where your SQLAlchemy models live:

primary = database_config(
    database_name="primary",
    default=True,
    database_type="postgresql",
    database_url_sync="postgresql://localhost/myapp",
    model_paths=["app.models"],  #  Look here for models
)

How Discovery Works¶

1. Import each module in model_paths

2. Find all classes inheriting from DeclarativeBase

3. Extract table metadata (__tablename__, columns, etc.)

4. Build internal representation for migration generation

Filtering by Table Name¶

When two databases share the same model_paths but should own different subsets of tables, use model_tables:

primary = database_config(
    database_name="primary",
    default=True,
    database_type="postgresql",
    database_url_sync="postgresql://localhost/main",
    model_paths=["app.models"],
    model_tables=["users", "posts", "comments"],
)

analytics = database_config(
    database_name="analytics",
    database_type="clickhouse",
    database_url_sync="http://clickhouse-host:8123/analytics",
    model_paths=["app.models"],
    model_tables=["analytics_events", "analytics_sessions"],
)

This is useful when all models live under one shared package but each database only owns a subset. DBWarden validates every name in model_tables exists among the discovered tables and prevents overlap between databases (unless overlap_models=True).

When Is It Required?¶

Single database: Optional (DBWarden scans entire codebase)

# This works
primary = database_config(
    database_name="primary",
    default=True,
    database_type="sqlite",
    database_url_sync="sqlite:///./app.db",
    # No model_paths needed
)

Multiple databases: Required for each database

# This is required
primary = database_config(
    database_name="primary",
    default=True,
    database_type="postgresql",
    database_url_sync="postgresql://localhost/main",
    model_paths=["app.models.primary"],  #  Required
)

analytics = database_config(
    database_name="analytics",
    database_type="postgresql",
    database_url_sync="postgresql://localhost/analytics",
    model_paths=["app.models.analytics"],  #  Required
)

Why? To prevent ambiguity about which models belong to which database.

Dev Mode¶

What Is Dev Mode?¶

Dev mode lets you use a different database for local development:

primary = database_config(
    database_name="primary",
    default=True,
    database_type="postgresql",              # Production
    database_url_sync="postgresql://prod/myapp",
    dev_database_type="sqlite",              # Development
    dev_database_url="sqlite:///./dev.db",
)

Run commands with --dev:

$ dbwarden --dev migrate  # Uses SQLite
$ dbwarden migrate        # Uses PostgreSQL

How It Works¶

Command: dbwarden --dev migrate

Check for --dev flag

Swap database_type → dev_database_type
Swap database_url → dev_database_url

Connect to dev database

Execute command

Why Use It?¶

Speed: - SQLite is faster than PostgreSQL for local iteration - No network latency - No server setup

Safety: - Can't accidentally affect production - Each developer has their own isolated database - Easy to reset (just delete the file)

Simplicity: - No Docker containers needed - No database server installation - Works on all platforms

Multi-Database Configuration¶

Why Multiple Databases?¶

Common scenarios: - Separation of concerns - Transactions vs analytics - Performance - Offload reporting to separate database - Compliance - Audit logs in separate database - Legacy systems - New and old databases coexist

How It Works¶

Each database_config() call registers an independent database:

primary = database_config(
    database_name="primary",
    default=True,
    database_type="postgresql",
    database_url_sync="postgresql://localhost/main",
    model_paths=["app.models.primary"],
)

analytics = database_config(
    database_name="analytics",
    database_type="postgresql",
    database_url_sync="postgresql://localhost/analytics",
    model_paths=["app.models.analytics"],
)

They're completely independent: - Separate migration histories - Separate migration directories - Separate model sets - Can use different database types

Model Path Boundaries¶

app/
  models/
    primary/
      user.py         #  Goes to primary database
      order.py
    analytics/
      event.py        #  Goes to analytics database
      metric.py

Configuration:

primary = database_config(
    database_name="primary",
    model_paths=["app.models.primary"],  #  Only primary models
    ...
)

analytics = database_config(
    database_name="analytics",
    model_paths=["app.models.analytics"],  #  Only analytics models
    ...
)

Secure Values¶

What Is `secure_values`?¶

Prevents credentials from appearing in terminal output:

import os

DATABASE_URL = os.getenv("DATABASE_URL")

primary = database_config(
    database_name="primary",
    default=True,
    database_type="postgresql",
    database_url_sync=DATABASE_URL,
    secure_values=True,  #  Hide credentials
)

Without `secure_values`:¶

$ dbwarden settings show
Database: primary
  URL: postgresql://user:SECRET_PASSWORD@prod-host/myapp

With `secure_values`:¶

$ dbwarden settings show --all
Database: primary
  URL: DATABASE_URL (expression)

Shows the variable name instead of resolved value.

Configuration vs Runtime¶

Configuration Time¶

When config loads: - database_config() calls execute - Validation runs - Internal registry populates - No database connections made

Runtime¶

When commands run: - DBWarden reads from registry - Connects to database - Executes command logic

Key point: Configuration errors are caught early, before any database operations.

What's Next?¶

Connection URLs - URL format reference
Model Discovery - Deep dive into model paths
Dev Mode - Local development workflows
Multi-Database - Multi-database patterns

Configuration Concepts¶

What is Configuration?¶

Why Python Configuration?¶

Type Safety¶

Dynamic Configuration¶

Multiple Databases¶

Configuration Loading¶

Discovery Order¶

When Configuration Loads¶

Validation Rules¶

Validation Timing¶

Config Source Precedence¶

Config Loading Security (Sandbox)¶

The default Database¶

Why default=True Exists¶

Only One Default¶

Default Affects CLI Behavior¶

Model Discovery¶

What Are model_paths?¶

How Discovery Works¶

Filtering by Table Name¶

When Is It Required?¶

Dev Mode¶

What Is Dev Mode?¶

How It Works¶

Why Use It?¶

Multi-Database Configuration¶

Why Multiple Databases?¶

How It Works¶

Model Path Boundaries¶

Secure Values¶

What Is secure_values?¶

Without secure_values:¶

With secure_values:¶

Configuration vs Runtime¶

Configuration Time¶

Runtime¶

What's Next?¶

The `default` Database¶

Why `default=True` Exists¶

What Are `model_paths`?¶

What Is `secure_values`?¶

Without `secure_values`:¶

With `secure_values`:¶