Architecture // Storage Engine
Distributed SQL, shared-nothing, peer-to-peer architecture. All nodes symmetrical; any node can handle reads/writes. Cluster uses distributed consensus: No matter where data lives, every node can access data anywhere in cluster // Built on Pebble, a highly tuned, GO-based, LSM-tree key-value storage engine developed by Cockroach Labs and inspired by RocksDB specifically for distributed SQL
Distributed SQL, shared-nothing. Two key services: YB-Master (metadata, coordination, load balancing) and YB-TServer (data storage, query execution). Data split into tablets; each tablet has a Raft-elected leader that handles writes and replicates to follower replicas // DocDB, a customized LSM-tree document store built on a heavily modified version of RocksDB, written in C++. Raft consensus used for replication at the storage layer; RocksDB WAL replaced by Raft logs
Ideal Workloads
SYSTEM OF RECORD. Optimized for transactional workloads that require strong consistency and global distribution, such as AI innovators, cybersecurity, eCommerce & retail, financial services, fintech/payments, gaming, quant/trading & research, and online travel
HORIZONTAL SCALE BEYOND WHAT POSTGRES CAN SUPPORT. Cloud-native OLTP and scale-out RDBMS workloads; migrations from PostgreSQL or Cassandra. Common use cases include financial services, retail, gaming, and microservices
Auto-Sharding (Dynamic Re-Sharding Online)
NATIVE & AUTOMATIC. Automatically shards data into ranges and dynamically splits, merges, and rebalances online across nodes based on load and size. Zero downtime, fully transparent
NATIVE & AUTOMATIC. Hash- or range-based sharding into tablets; tablets split automatically when reaching a size threshold, and rebalance across nodes. Hash-based sharding is the default; range-based is configurable. Hash default can affect range scan performance
Automatic Geo-Partitioning (Multi-Region Data Affinity)
NATIVE AND AUTOMATIC. Declarative SQL schema adjustments automatically route, partition, and anchor data close to the user's location, automatically moving data to the region where it is most frequently accessed. Supports geo-partitioning with zone configurations for data locality, compliance, and low latency
NATIVE. Supported via row-level geo-partitioning using tablespaces and a designated partition column. Developer must define the partition column and assign tablespaces to regions; placement is then enforced automatically. Adding a new region requires creating a new tablespace and partition
Availability including Multi-Cloud and Hybrid
Available on all public clouds, e.g., AWS-Google Cloud-Azure; can run a single logical cluster spanning multiple clouds. Can run on prem/local, and cloud + prem hybrid deployments
Deployable on AWS, GCP, and Azure; supports on-premises, hybrid, and multi-cloud via YugabyteDB Anywhere. A single logical cluster can span multiple clouds or data centers
Change Data Capture (CDC)
NATIVE. CHANGEFEED command enables scalable, resilient streaming of data changes to Kafka, cloud storage, and webhooks; no third-party CDC tool needed. CDC Queries enable SQL-based filtering and transformation of streams
SUPPORTED via a Debezium-based gRPC connector and PostgreSQL logical replication (available since v2024.1.1). Streams row-level changes to Kafka and compatible consumers. At-least-once delivery guaranteed; cross-tablet ordering is not guaranteed
Cluster Sizes // Scale
SINGLE BINARY, ALL ROLES ON EVERY NODE. No separate compute, storage, or metadata layers. Minimum 3 nodes for HA (one per availability zone), scales linearly to hundreds or thousands by simply adding nodes. Development and testing run on a single node. Minimum production configuration on CockroachDB Advanced: 3 nodes × 4 vCPUs = 12 vCPUs // Virtually unlimited horizontal scale-out simply by adding more nodes
Separate YB-Master (metadata) and YB-TServer (data/query) roles; typically co-located in deployment. Minimum 3 nodes for HA (RF=3). Compute and storage scale together; independent compute-without-storage scaling is not supported // Horizontal scale-out by adding nodes; automatically shards and rebalances tablets across the cluster
Data Anomalies
ZERO under Serializable isolation: all standard SQL anomalies (dirty reads, non-repeatable reads, phantom reads, lost updates, and write skew) are prevented by default with no additional developer configuration. Read Committed is also available for workloads where some consistency relaxation is an acceptable tradeoff for reduced latency
ZERO under Serializable isolation; all standard SQL anomalies are prevented. Default effective isolation is Snapshot (equivalent to Repeatable Read), which prevents dirty reads and non-repeatable reads but does not prevent write skew without SELECT FOR UPDATE. Read Committed requires explicit flag enablement prior to v2025.2
Data Integrity & Foreign Keys Support
Provides strict ACID enforcement at the storage layer and full referential integrity. NATIVELY VALIDATES FOREIGN KEYS, explicit CHECKs, and transactional constraints to ensure absolute global correctness across global nodes
Strict ACID enforcement in YSQL with full referential integrity; foreign keys and CHECK constraints enforced across distributed nodes. YCQL does not enforce referential integrity or foreign keys
Data Model Complexity
LOW. Relational model with strict schemas, normalized tables, joins, and referential integrity. Ideal for managing complex relationships and transactional systems of record; adapts easily to microservices and enterprise legacy systems
LOW. Relational model via YSQL (PostgreSQL-compatible) with joins, foreign keys, and referential integrity. Semi-relational model via YCQL (Cassandra-compatible). Foreign keys and joins are not supported in YCQL
Data Residency
STRONG, INTUITIVE, AND LOCALITY-AWARE. Helps fulfill compliance (e.g., GDPR, CCPA) with Row-Level Control: can pin specific rows to specific geographic regions using REGIONAL BY ROW command, while preserving single logical data platform. Business and compliance teams can use simple SQL commands to ensure customer data never leaves specific geographic borders
LOCALITY-AWARE BUT REQUIRES SCHEMA DESIGN. Row-level geo-partitioning via tablespaces pins specific rows to specific geographic regions, supporting GDPR and other data residency requirements. Requires schema design with a designated partition column to implement and maintain
Developer Tools // Developer Experience // Ease of Use
Rich ecosystem: Local CLI, web console UI, ORMS, BI tools, SQL clients, native DB migration toolkits, language-specific drivers, and compatibility with standard PostgreSQL developer tools like psql // PostgreSQL wire protocol-compatible; feels exactly like developing on standard PostgreSQL. Fits effortlessly into existing ORMs, drivers, and frameworks // Can be spun up instantly in any environment (AWS, GCP, on-prem) with the exact same management interface. The cluster manages its own data balancing, scaling, and hardware survival automatically; DBAs do not need to be distributed systems experts to keep it running smoothly.
yugabyted CLI, web console UI, YugabyteDB Voyager migration tool, Kubernetes Operator, topology-aware smart drivers, and compatibility with psql and standard PostgreSQL ecosystem tools // Reuses PostgreSQL query layer (YSQL); compatible with most PostgreSQL drivers, ORMs, and tools but compatibility gaps exist. Offers a Cassandra-compatible API (YCQL) // Deployable via yugabyted on Kubernetes, VMs, or bare metal. YugabyteDB Anywhere provides management UI for multi-cloud fleet operations. Geo-partitioning and topology tuning require distributed systems knowledge
Distributed ACID Transactions
Fully distributed, multi-row, multi-table ACID transactions out-of-the-box. Fully supported with serializable isolation using distributed consensus (Raft Protocol) across tables, ranges, and regions; strong ACID guarantees
Multi-row, multi-table distributed ACID transactions via two-phase commit and Raft replication. Transaction design inspired by Google Spanner, using hybrid logical clocks for consistency across distributed nodes
Enterprise Support
Dedicated 24/7/365 enterprise support directly from Cockroach Labs with strict SLAs and custom engineering channels. Offers global follow-the-sun support (TSE+SRE) with proven reliability and global partnerships with industry leaders. Single Global Incident Management integrates Engineering + Support + Customer Success in one channel for consistency/immediacy
Enterprise support available through Yugabyte for YugabyteDB Anywhere and Aeon deployments. Community support via Slack, forums, and Yugabyte University online courses. Support SLAs vary by commercial agreement
FinOps Support
HIGH. Straightforward pricing based on predictable node usage or consumption metrics. Avoids hidden, fluctuating network traps when moving data across different infrastructure regions. Supports financial governance/FinOps
MODERATE TO HIGH. Open source (Apache 2.0) for self-managed deployments. YugabyteDB Aeon managed service pricing based on cluster size, instance type, storage, and cloud provider. Egress costs vary by cloud provider and deployment topology
Follower Reads
SUPPORTED. Supports follower/replica reads with Bounded (controlled) Staleness, allowing low-latency local reads from nearby replicas while keeping strong global ordering
SUPPORTED. Reads can be served from follower replicas with configurable bounded staleness, reducing read latency by accessing a geographically closer replica
FREEDOM
ZERO VENDOR LOCK-IN. Runs on any public or private cloud, across multiple clouds, via CockroachDB's Bring Your Own Cloud (BYOC) offering, on-premises, bare metal, Kubernetes, self-hosted, or in a hybrid deployment encompassing some or all of these. Business Source License (BSL) but Source Available. Full commercial-grade support directly from CockroachDB
ZERO VENDOR LOCK-IN. Core database is open source under the Apache 2.0 license. Deployable on any public or private cloud, on-premises, Kubernetes, VMs, or bare metal via YugabyteDB Anywhere or Aeon BYOC. No proprietary lock-in on core database features
Joins
Executes fully distributed hash joins, merge joins, and lookup joins across arbitrary nodes with CockroachDB's advanced Cost-based Optimizer. Full standard SQL support for complex INNER, OUTER, LEFT, RIGHT joins across distributed tables
INNER, LEFT, RIGHT, FULL OUTER, and CROSS joins supported in YSQL. Distributed query execution with nested loop, hash, and merge join algorithms. Adaptive Cost-Based Optimizer accounts for data co-location and distributed tablet placement. YCQL does not support joins
LDAP Support
NATIVE. Direct native support for external authentication systems like LDAP, Active Directory, GSSAPI, and OIDC
NATIVE. Supported in YSQL and YCQL APIs, and in the YugabyteDB Anywhere management platform. Supports simple bind and search-and-bind modes, with optional TLS encryption between the database and LDAP server
Migrations
Uses MOLT (Migration Off Legacy Technology) Toolkit & change data capture (CDC): MOLT handles schema conversion/verification and CDC moves data out. PostgreSQL wire protocol compatibility enables lift-and-shift; shadow mode testing
YugabyteDB Voyager (free, open source) handles schema assessment, conversion, and data migration from PostgreSQL, MySQL, Oracle, and Cassandra. Supports assessment reports, shadow testing, and cutover workflows
Multi-Active
YES: FULLY MULTI-ACTIVE/MULTI-REGION; read/write and handle connection requests from any node in the cluster. All nodes are equal and active; any node can accept read and write traffic simultaneously.
YES: PARTIALLY. Multi-active writes possible via synchronous multi-region clusters or bidirectional async xCluster replication. Writes route to the tablet leader for each shard; not all nodes simultaneously accept writes to all data without topology configuration
Multi-Data-Center Support
FULL. Connects geographically isolated, heterogeneous data centers (AWS, GCP, Azure, on-prem) into a single logical cluster, supported by features such as Physical Cluster Replication (PCR) and Logical Data Replication (LDR)
FULL. Synchronous replication across 3+ regions within a single cluster; asynchronous xCluster replication across 2+ independent clusters for DR or multi-active. YugabyteDB Anywhere manages multi-DC and multi-cloud deployments from a single control plane
Multi-region Functionality // Multi-region Writes
ACTIVE-ACTIVE: Read/Write from any node in any region; built-in low-latency local access patterns and Survival Goals (e.g., ALTER DATABASE...SURVIVE REGION FAILURE) commands configure fault tolerance intent // True multi-region, multi-active writes: any node in any region can serve reads and writes while preserving serializable consistency guarantees
Active-active via synchronous multi-region Raft replication (3+ regions) or asynchronous bidirectional xCluster replication (2+ independent clusters). Supports geo-partitioning and read replicas. Cross-region write latency reflects consensus round-trips // Synchronous multi-region writes via Raft consensus across 3+ regions; asynchronous bidirectional (multi-master) xCluster replication across 2+ independent clusters. Synchronous writes incur cross-region round-trip latency overhead
Replication
Built-in, automatic consensus replication using the Raft protocol; data is divided into ranges and replicated across nodes
Raft consensus-based synchronous replication within a cluster (default RF=3). xCluster supports asynchronous unidirectional or bidirectional replication across independent clusters for disaster recovery or multi-active deployments
Required Downtime
ZERO. Online schema changes, rolling upgrades, and cluster expansion occur without taking the data platform offline
ZERO downtime for rolling upgrades, node failures, and distributed backups. Online schema changes supported; transactional DDL available (GA in 2025). Some complex DDL in multi-region deployments may require careful planning
Resilience
Five 9s availability: Survives node/disk/rack/region failures automatically via Raft consensus, with zero data loss (RPO=0). Naturally resilient to outages with granular row-level control
Survives node, zone, and region failures via Raft consensus. Self-healing: automatically re-replicates data when a node is lost. Zero downtime during rolling software upgrades and distributed backups
Schema Changes
FULLY ONLINE & NON-BLOCKING. Online transactional schema changes (add/alter columns, indexes, constraints) run in the background without locking tables with zero downtime. Designed for always-on services
FULLY ONLINE; MOST NON-BLOCKING. Online schema changes supported; transactional DDL in GA as of 2025. Most operations are non-blocking. Complex schema changes in multi-region deployments may require additional coordination
Security-Privacy-Compliance
RBAC, Encryption at Rest with Customer Managed Encryption Keys (CMEK), TLS encryption in transit, IAM integrations, column-level encryption, and robust data-masking natively. Fine-grained encryption at cluster, database, table, or partition levels. Certified SOC 2 Type II and SOC 3, PCI-DSS, HIPAA, and ISO 27001-27017-27018 compliant, with ISO 42001 (Responsible, Ethical, and Safe AI Governance) pending. CockroachDB CIS Benchmarks to deploy hardened CockroachDB configurations. Comprehensive support for GDPR and DORA compliance
RBAC, encryption at rest with CMK support (AWS KMS, Azure Key Vault, GCP KMS), TLS in transit, LDAP and OIDC authentication, column-level encryption via pgcrypto, audit logging via pgaudit. Certified SOC 2 Type II, ISO 27001, PCI DSS Level 1. Row-level geo-partitioning supports data residency requirements including GDPR
SQL Compatibility
HIGH. PostgreSQL Wire Compatible: Uses PG wire protocol; strong ANSI SQL with complex queries, joins, window functions, triggers, stored procedures, and UDFs. Supports spatial data, extensions, syntax; most apps connect with minimal or no changes
HIGH. PostgreSQL wire protocol compatible (YSQL); reuses the PostgreSQL query layer. Based on PostgreSQL 15 as of 2025. Some extensions and DDL behaviors differ from standard PostgreSQL; compatibility gaps remain for certain advanced features
Stored Procedures
SUPPORTED AND MATURE. PL/pgSQL and other languages such as Python and Perl support deep procedural logic, autonomous transactions, and complex business rule enforcement. Supports user-defined stored procedures
SUPPORTED AND MATURE in YSQL via PL/pgSQL, reusing PostgreSQL's stored procedure and user-defined function framework. Triggers, event triggers, and extensions supported through the reused PostgreSQL layer
Transaction Performance // Isolation Levels
Optimized for OLTP with strong consistency; cross-region transactions maintain data correctness. Optimizations like Parallel Commits drop distributed execution overhead to a single network round-trip for most transactions // Enforces strict Serializable isolation exclusively, the strongest isolation level, to ensure zero data anomalies under heavy parallel traffic, and Read Committed
Supports Serializable, Snapshot (maps to Repeatable Read), and Read Committed. YCQL supports Snapshot isolation only // Single-shard transactions use a fast path that bypasses the transaction status tablet for lower latency. Cross-shard transactions use two-phase commit; cross-region write latency tied to Raft consensus round-trips
Triggers & Deferrable Constraints
FULLY SUPPORTED. Supports triggers and deferrable constraints across all deployment models
FULLY SUPPORTED. Triggers and event triggers supported in YSQL via the PostgreSQL trigger framework. Deferrable constraints supported
Vector Search
BUILT-IN NATIVE VECTOR SEARCH, scalable distributed HNSW/IVF indexing, and pgvector (the industry standard for vector similarity search). CockroachDB's C-SPANN provides distributed vector indexing (ANN) at scale; available across all tiers. Suited for AI/ML inference and RAG applications where vectors and transactional data coexist in one engine without a separate vector database
BUILT-IN NATIVE VECTOR SEARCH. Supports vector similarity search via HNSW indexing for AI and RAG workloads. pgvector-compatible. Vectors and transactional data can coexist in the same cluster
Writes and Query Routing
Every node is a gateway to the entirety of the database for unlimited reads and writes in any region. Any node can accept SQL queries; a Distributed Optimizer routes work to the right ranges/replicas based on locality and cost
Writes route to tablet leader for target row; smart drivers route queries to the appropriate node. YB-Master tracks tablet leadership. Cross-node routing occurs when queried data spans multiple tablets
PRICING
SIMPLE. Commercial Enterprise: Simple, straightforward pricing, plus the ability to tie data to a location to avoid egress costs. Free for single-node/dev. Free Community Tier
SIMPLE. Open source (Apache 2.0) for self-managed deployments. YugabyteDB Aeon managed service pricing based on cluster size, instance type, storage, and cloud provider. YugabyteDB Anywhere for self-managed enterprise. Free tier available for development and testing