Platform Mode
In platform mode, vectors are stored directly in your own database alongside your source data.
How It Works
- You create a connection to your database (Snowflake or PostgreSQL)
- You create a vector table — Embedd.to creates a vector table in your database
- On backfill, source data is read, embedded, and written back to your database
- Sync keeps vectors up to date as source data changes
When to Use Platform Mode
- Data residency — Keep vectors in your own infrastructure
- Join with existing data — Vector tables live alongside source tables, enabling SQL joins
- Existing database investment — Leverage your Snowflake or PostgreSQL infrastructure
Supported Providers
Snowflake
- Uses
VECTOR(FLOAT, N)type for vector storage - Supports Snowflake Cortex for native embeddings (no external embedding provider needed)
- Uses
VARIANTtype for metadata storage
PostgreSQL
- Uses pgvector's
vector(N)type for vector storage - Requires the pgvector extension to be installed
- Uses
JSONBtype for metadata storage - HNSW indexes with
vector_cosine_opsfor fast similarity search - Requires an external embedding provider (OpenAI, Gemini)
Atomic Swap Re-Backfill
When you change the embedding model or column configuration, Embedd.to performs an atomic swap:
- Creates a new vector table with the updated configuration
- Backfills the new table with fresh embeddings
- Atomically renames the tables (old → _old, new → live)
- Drops the old table
This ensures zero downtime during re-embedding operations.
Architecture
Source DB → Embedd.to → Embedding Provider
↓
Source DB (vector table created by Embedd.to)
↓
Query API (reads from your DB)