OceanBase Database Enterprise Edition V4.3.3|V4.3.5| docs|Distributed Database

OceanBase Database Enterprise Edition V4.3.3

Last Updated：2025-06-25 08:02:52 Updated

Version information

Release date: September 30, 2024
Version: V4.3.3

Overview

OceanBase Database V4.3.3 is the first General Availability (GA) release in the V4.3 series, offering significant advancements in key areas. It introduces support for vector types and indexes for AI-related analysis and processing, enhancing its core relational database capabilities. Additionally, it implements a new form of columnstore replicas, ensuring physical isolation between transaction processing (TP) and analytical processing (AP) resources in hybrid transaction/analytical processing (HTAP) scenarios. This version also boosts the overall performance of AP queries. Furthermore, the update includes support for the complex ARRAY data type, improved computing performance for the RoaringBitmap data type, enhanced query rewriting based on materialized views, extended external table features, better import performance for external tables, and optimized strategies for plan generation and execution of AP SQL statements. These enhancements significantly improve the database's ability to handle online analytical processing (OLAP) workloads. Additionally, the new version offers a quick restore feature, allowing you to restore only logs instead of data to your local server for faster read and write services. It also introduces support for query-level resource groups, enhancing system reliability and usability. Most features from OceanBase Database V4.2.4 and earlier are supported in V4.3.3. An upcoming integrated version will target online transaction processing (OLTP) scenarios.

Key features

Kernel enhancements

Vector types and indexes (Experimental)

The rise of AI applications has led to a surge in unstructured data like images, videos, and texts. Embedding algorithms are now able to represent this data as high-dimensional vectors for analysis and processing. This gave rise to vector databases, which are fully managed solutions for handling unstructured data, including storing, indexing, and retrieving embedding vectors. Vector indexes are a key feature, transforming keyword-based searches into vectorized retrievals. This shift turns deterministic searches into similarity searches, enabling the retrieval of large-scale, high-dimensional vectors.

OceanBase Database V4.3.3 in MySQL mode now supports vector type storage, vector indexes, and embedding vector retrieval. It handles float vectors with up to 16,000 dimensions and supports basic operations like addition, subtraction, multiplication, comparison, and aggregation. It also offers precise search and Approximate Nearest Neighbor Search (ANNS) with Hierarchical Navigable Small World Network (HNSW) indexes for up to 2,000 dimensions. This can be applied to Retrieval-Augmented Generation (RAG) for scenarios such as image and video retrieval, behavior preference recommendations, security and fraud detection, and ChatGPT-like applications.

This feature is in the experimental stage in OceanBase Database V4.3.3, and will be enhanced for production use in later versions.
ARRAY data type

ARRAY is a complex data type commonly used in AP business scenarios to store multiple elements of the same type. It is an ideal choice for managing and querying multi-valued attributes that cannot be effectively represented by relational data. Starting from OceanBase Database V4.3.3, the ARRAY data type is supported in MySQL mode. You can define a column as either a numeric or character array, including nested arrays, when creating a table. It supports expressions for querying or writing to array objects, array_contains expressions, and the ANY operator to check if an element is present in the array. Additionally, it supports operators like +, -, =, and != for calculations and comparisons of array elements.
Read-only columnstore replicas (Experimental)

OceanBase Database supports columnar storage since V4.3.0. You can create a columnstore table, rowstore table, or hybrid rowstore-columnstore table based on the business type. The replica deployment form is consistent among zones of the same tenant, regardless of the table form. For example, suppose the units of Tenant X are distributed across three zones in 1:1:1 mode. If you create a hybrid rowstore-columnstore table named T1 in Tenant X, the T1 table has a rowstore replica and a columnstore replica in each of the zones. To implement physical isolation between TP resources and AP resources in HTAP scenarios, OceanBase Database V4.3.3 introduces a new deployment form that allows you to add an independent zone for storing read-only columnstore replicas based on the original cluster. In the new zone, all user tables are stored in columnar storage mode. In an AP business scenario, you can use an independent OceanBase Database Proxy (ODP) for database connection and set the session-level system variable ob_route_policy, which specifies the routing strategy, to COLUMN_STORE_ONLY so that you can access only columnstore replicas for queries and analysis in weak-consistency read mode, without affecting TP business. Compared with the hybrid rowstore-columnstore architecture, the 3+1 deployment form can help reduce the storage overhead. This feature must be used in combination with ODP V4.3.2 or later.

This feature is in the experimental stage in OceanBase Database V4.3.3, and will be enhanced for production use in later versions.
Enhanced materialized view capabilities

To reduce the manual rewrite workload, OceanBase Database supports materialized view-based rewriting since V4.3.1. When the system variable QUERY_REWRITE_ENABLED is set to True, you can specify the ENABLE QUERY REWRITE clause in the materialized view creation statement to enable automatic rewriting. After automatic rewriting is enabled, the system can rewrite table queries into materialized view-based queries. OceanBase Database V4.3.1 supports rewriting a table query into a query based on a non-aggregate materialized view as long as the FROM clause in the original query fully matches that in the materialized view and the WHERE condition partially matches that in the materialized view. OceanBase Database V4.3.3 also supports query rewriting in the following scenarios when specific conditions are met: (1) The join order or join type in the FROM clause of the original query is different from that of the materialized view. (2) The original query contains tables not included in the materialized view. (3) The original query needs to be rewritten into a query based on an aggregate materialized view. (4) Aggregation rollup is involved.

In addition, the new version extends the SQL types supported for incremental refreshes and real-time materialized views. Earlier versions already support incremental refreshes and real-time queries in single-table aggregate and multi-table join scenarios. OceanBase Database V4.3.3 extends the support for incremental refreshes and real-time queries in join aggregate scenarios.

OceanBase Database supports only rowstore materialized views in earlier versions. OceanBase Database V4.3.3 also supports columnstore materialized views, making it possible to achieve higher query performance in complex analysis scenarios that involve materialized view references.
Enhanced INSERT OVERWRITE statement

OceanBase Database supports table-level overwrite based on the INSERT OVERWRITE statement since V4.3.2. Specifically, the database can empty old data and write new data in a table in an atomic manner. This feature applies to AP business scenarios that involve periodic data update, data conversion, data cleansing, and data correction. However, in OceanBase Database V4.3.2, you can overwrite only a whole table, and cannot overwrite the data of specific partitions or columns. In OceanBase Database V4.3.3, you can specify partitions, subpartitions, or columns of the destination table in the INSERT OVERWRITE statement, allowing for more flexibility in data overwriting and making it applicable to a wider range of business scenarios.
Support for compressed files in LOAD DATA statements and external table queries

The LOAD DATA statement is commonly used to import text files from a server or client, or in direct load mode to a database. In earlier versions, you can import only normal text files. To import a compressed file such as a GZIP file, you must first decompress the file, which makes the operation complex. In the new version, you can import compressed files in GZIP, DEFLATE, and ZSTD formats. The files are loaded, decompressed, and written to the database simultaneously during the import process. Compressed files are also supported for external tables with external files in the CSV format. You can directly query data from a compressed file in any of the preceding formats by accessing the corresponding external table.
Partition-wise data export by using the SELECT INTO OUTFILE statement

OceanBase Database of earlier versions allow you to use the SELECT INTO OUTFILE statement to export multiple files. However, you cannot export data in a partition-wise manner. OceanBase Database V4.3.3 allows you to export data by partition to obtain a clearer directory structure. You can also build a partitioned external table based on the file directories and improve external table query efficiency through partition pruning.
Distributed late materialization

In a single-table query, the database usually filters data based on the index table and then queries row data from the primary table to avoid a full-table scan. Each access to the primary table is a random I/O query. If a large amount of data is involved in table access during the execution of an SQL statement, a significant amount of performance resources will be consumed. Take the statement SELECT * FROM t1 WHERE c1 > 1 ORDER BY c2 LIMIT 10; as an example. If a composite index on c1 and c2 exists, the generated execution plan typically filters data that meets the c1 > 1 condition in the index table, then accesses the t1 table to retrieve data, and finally selects the top 10 data records. If a large amount of data is involved in table access, the performance will be poor. However, if you modify the execution plan to first filter data based on the index table, then sort the data and select the top 10 data records, and finally query the primary table for 10 times, the amount of performance resources consumed by random I/Os can be significantly reduced. This optimization strategy is called late materialization. Simply put, late materialization is a strategy where a single-table query sorts the filtered data and limits the number of data records returned before retrieving data from the primary table, thus reducing performance resource consumption by random I/Os.

OceanBase Database has previously supported the late materialization feature for non-partitioned tables, and this capability has been extended in V4.3.3 to include support for partitioned tables.
Hyperscan support for regular expressions

OceanBase Database of earlier versions use International Components for Unicode (ICU) as the underlying regular expression engine. ICU provides full support for unicode character sets and abundant external interfaces. However, its performance is poorer compared to Hyperscan. OceanBase Database V4.3.3 supports Hyperscan as the regular expression engine, which obviously improves the matching performance of regular expressions. However, Hyperscan supports only the x86 architecture and does not apply to the Arm platform. Therefore, Hyperscan is not enabled by default. If the cluster contains only servers of the x86 architecture and demands higher matching performance, you can set the _regex_engine parameter to Hyperscan.
Asynchronous job scheduling

OceanBase Database of earlier versions support various data import statements, such as INSERT OVERWRITE, INSERT SELECT, CREATE TABLE AS, and LOAD DATA. You can use these statements to write data to your database in real time. However, real-time data import requires the session to wait until the import is complete and cannot be interrupted during this process, which is inconvenient when a large amount of data is to be imported. To address this issue, OceanBase Database V4.3.3 provides the asynchronous job scheduling capability through the DBMS_SCHEDULER system package. You can use the SUBMIT JOB, SHOW JOB STATUS, and CANCEL JOB statements to respectively create an asynchronous import job, query the job status, and cancel a job.
Enhanced row estimation and statistics collection

Cost estimation by the optimizer relies on accurate row estimation for each operator. Accurate row estimation relies on an appropriate estimation strategy and accurate statistics. OceanBase Database of earlier versions have restructured the base table-based and selectivity-based row estimation methods to improve the cost estimation accuracy of the optimizer. OceanBase Database V4.3.3 makes the following improvements for some complex scenarios:
- Calculation of the joint selectivity in the case of multiple base table predicates: The cardinality_estimation_model system variable is provided for the predicate correlation assumption model to control the base table-based row estimation strategy to be used by the optimizer. The default value is PARTIAL, indicating that the predicates are partially correlated. In this case, the joint selectivity is calculated through exponential backoff.
- Adjustment of the method for calculating the number of distinct values (NDV): An environment cardinality is maintained during plan generation. The joint NDV of multiple expressions is calculated based on the environment cardinality, thereby affecting the predicate selectivity and the row estimation of the GROUP BY operator.
- Adaptive statistics correction upon expiration: The number of inserted, deleted, and updated rows is refreshed to the internal table that stores statistics every 15 minutes. By default, a table where the number of rows after data change exceeds 10 times the original number of rows is marked as a table with significantly expired statistics, and its statistics will be adaptively corrected. For example, an asynchronous statistics collection job is initiated every 15 minutes at the background for tables with significantly expired statistics, or dynamic sampling is used to ensure that appropriate execution plans are generated.
- Dynamic sampling supported for more complex predicate scenarios: For common complex predicates of base tables, dynamic sampling can be used to calculate a more accurate selectivity of complex predicates, even in scenarios where statistics are effective.
OceanBase Database V4.3.3 also optimizes the sampling strategy for histogram collection. Specifically, histogram collection is decoupled from basic statistics collection, the hist_est_percent parameter is provided to specify the sampling ratio of histogram collection, and the hist_block_sample parameter is provided to specify whether to use block sampling for histogram collection. To optimize the histogram collection performance, the new version selects a sampling strategy for histogram collection based on the table size. Row sampling is used for a small table, and block sampling is used for a large table.

Compatibility with MySQL

SHOW CREATE USER

OceanBase Database V4.3.3 supports the syntax of the SHOW CREATE USER statement in MySQL, which is used to display the information about database users. The administrator or a user with the SELECT privilege on the oceanbase or mysql database can view the information about all users in the tenant, whereas a normal user can view only its own information.
Value assignment to a system variable through a subquery

In earlier versions, OceanBase Database allowed the use of subqueries to assign values to user variables. With V4.3.3, it now supports using subqueries to assign values to system variables for MySQL compatibility.
Support for more character sets and collations

Apart from character sets supported in earlier versions, OceanBase Database V4.3.3 also supports the ASCII character set (with ascii_bin and ascii_general_ci collations) and TIS620 character set (with tis620_bin and tis620_thai_ci collations). Moreover, the new version also supports the utf8mb4_unicode_520_ci, utf8mb4_croatian_ci, utf8mb4_czech_ci, and utf8mb4_0900_ai_ci collations for the utf8mb4 character set, and supports the utf8mb3 character set as the alias of the utf8mb4 character set.
Expressions as default values

The new version supports using an expression that does not rely on other columns as the default value of a column, for example, DEFAULT(DATE_FORMAT(sysdate(), '%Y%m%d')), DEFAULT(CURRENT_DATE), and DEFAULT(UNIX_TIMESTAMP()).
CHECK TABLE statement

The new version partially supports the syntax of the CHECK TABLE statement. In the new version, you can use the statement to check whether a table or view exists in the database or whether objects referenced in a view are valid. Other features of the statement are not supported.

Compatibility with Oracle

PL syntax details

OceanBase Database V4.3.3 in Oracle mode allows you to use the => symbol to pass values to parameters in a custom type.
Remote call of stored procedures between Oracle tenants through a DBLink

In OceanBase Database of earlier versions, an Oracle tenant can call stored procedures from a native Oracle database through a DBLink. In OceanBase Database V4.3.3, one Oracle tenant can call stored procedures from another through a DBLink. At present, only basic types of parameters are supported.
Control over the maximum number of rows returned by an SQL statement through the JDBC setMaxRows API

The new version allows you to control the maximum number of rows returned by an SQL statement through the JDBC setMaxRows API at the protocol layer.
More accurate error stack information recorded by DBMS_UTILITY

The DBMS_UTILITY system package records error stack information in error_trace when an error occurs during PL execution. This information is very useful during PL debugging. Specifically, it can be used to effectively identify the cause of an error reported by the executor, significantly improving PL usability. In earlier versions, only the information about the first error stack is saved in error_trace. If an exception occurs during the error, the recorded error stack information will be inaccurate and cannot be used to effectively locate the error. In the new version, error_trace is redesigned and restructured to record accurate error stack information whenever an error occurs during PL execution.
Savepoints for DBLink transactions

For a regular transaction, you can create a savepoint for the transaction or roll back the transaction to a savepoint. Similar savepoint operations are also required for DBLink transactions. OceanBase Database V4.3.3 supports savepoints for DBLink transactions. Based on eXtended Architecture (XA) transactions, you can create a savepoint for a local or remote branch transaction, or roll back the transaction to a savepoint.

Performance improvements

Performance improvement for the RoaringBitmap data type

OceanBase Database V4.3.2 has supported the RoaringBitmap data type and related expressions to meet multidimensional analysis requirements in business scenarios such as user profiling, personalized recommendation, and precise marketing. However, the performance is poor in some scenarios. OceanBase Database V4.3.3 focuses on analyzing the performance issues of RoaringBitmap data computations and significantly improves the execution performance in cardinality, AND, OR, XOR, ANDNOT, and aggregate scenarios by optimizing memory application and expression execution logic to reduce unnecessary performance overhead.
Performance improvement for external table import

OceanBase Database V4.3.3 optimizes execution performance in the phase of reading data from external tables during direct load, improving import performance by about 15% compared to the previous version.
Extension of parallel DDL operations

In OceanBase Database of earlier versions, parallel DDL and serial DDL are mutually exclusive. The performance is poor when parallel DDL and serial DDL are executed alternatively. When OceanBase Migration Service (OMS) is used for schema migration, OceanBase Database V4.3 supports parallel execution of the CREATE TABLE statement. However, some other DDL operations that are executed alternatively during the migration are serial DDL operations, which reduces migration efficiency. OceanBase Database supports parallel execution of the following DDL operations since V4.3.3:
- Setting comments on tables or columns in Oracle mode.
- Executing the CREATE INDEX statement.
By default, parallel execution of the preceding two types of DDL operations is disabled for tenants upgraded from earlier versions to the new version, and is enabled for new tenants created in the new version. You can modify the _parallel_ddl_control parameter to change the default setting as needed.

Reliability improvements

Quick restore

In OceanBase Database of earlier versions, physical restore is a process of restoring the full data. A physical restore is completed only after all the data (minor compaction data and baseline data) and logs are restored. Then, you can log in to and use the restored tenant. If a large amount of data is to be restored to a tenant, the restore will take a long time and you need to reserve sufficient disk space for the tenant at the very beginning to ensure a successful restore. In some scenarios, a tenant is restored only for query and verification purposes and will be destroyed later. If only a few tablets are involved during the query, a full restore costs too high and is a waste of storage space, time, and network bandwidth. The new version provides a quick restore feature that allows you to provide read and write services by restoring only logs rather than data to your local server. In addition, the data backup feature allows you to build an intermediate-layer index for a backup SSTable based on the backup address. With this index, OBServer nodes can randomly read data from the backup SSTable like reading local data.
Automatic routing to the primary/standby tenants

OceanBase Database V2.x and V3.x support primary and standby clusters. A primary/standby cluster pair is uniquely identified by the cluster name (CLUSTER_NAME). If you use ODP to connect to an OceanBase cluster, you are automatically routed to the primary cluster based on the cluster name. OceanBase Database V4.x supports primary and standby tenants. The primary and standby tenants do not record the information of each other. The primary/standby relationship is maintained by using external tools such as OCP. In this case, you cannot be automatically routed to the primary tenant if you use ODP for database connection. To address this issue, the new version allows you to manage a pair of primary/standby tenants by service name (SERVICE_NAME). When you use ODP for database connection, you can specify the service name in the command for login, for example, obclient -h $ip -P $port -u$user_name@SERVICE:$service_name. This way, ODP can route your connection to the primary tenant based on the service name. Moreover, the sys tenant provides a set of service name management commands. You can create, enable, disable, or delete a service name.

The service name feature is supported in ODP V4.3.1 and later and OCP V4.3.1 and later.
Verification of primary/standby tenant switchover and failover

OceanBase Database supports primary/standby tenant switchover in scenarios without data loss and primary/standby tenant failover in scenarios with data loss. The switchover and failover can fail. Therefore, to reduce the risks of failures, OceanBase Database V4.3.3 supports the verification feature (SWITCHOVER/FAILOVER VERIFY). You can add the VERIFY keyword after the switchover or failover command to verify in advance whether the operation can be performed. If the operation cannot be performed, an error is returned. In this case, you can take the suggested actions.

Resource optimization

Memory usage optimization for SQL_AUDIT

The SQL_AUDIT view records information about the SQL statements executed on each server of the current tenant. A queue of pointers to 10 million records is constructed for a normal tenant, based on which memory is preallocated to tenant servers upon their startup. The preallocated memory is called static memory. However, dynamic memory is actually used by records. The new version optimizes the memory structure for SQL_AUDIT. Specifically, the size of initial static memory is reduced to adapt to small-scale tenants. You can also dynamically adjust the size of queue memory based on the actual number of records, allowing a tenant with high specifications to store more records.

Security enhancements

Compatibility with the REFERENCES, CREATE ROLE, DROP ROLE, and TRIGGER privileges of MySQL

OceanBase Database V4.3.3 is compatible with the REFERENCES, CREATE ROLE, DROP ROLE, and TRIGGER privileges of MySQL. You can grant the REFERENCES privilege at the global, database, table, or column level. This privilege is required to create a foreign key constraint during table creation or table modification. Additionally, you can configure the REFERENCES privilege to not take effect at the column level, which is compatible with MySQL. The CREATE ROLE and DROP ROLE privileges take effect globally. The CREATE USER or CREATE ROLE privilege is required to create a role. The CREATE USER or DROP ROLE privilege is required to drop a role. You can grant the TRIGGER privilege at the global, database, or table level. This privilege is required to create, drop, execute, or display triggers in a table.

By default, verification is disabled for the preceding privileges in upgrade scenarios to avoid affecting online business after the upgrade. If these privileges are required in the new version, you can use the ob_security_version variable to specify a security version, which cannot be rolled back to V4.3.2 once it is set to V4.3.3. By default, verification is enabled for the preceding privileges in a new cluster.
Audit in MySQL mode

Cloud consoles support database audit based on SQL statistics recorded in the SQL_AUDIT view. However, in the case of high business traffic, some statistics may not be collected if the SQL_AUDIT view is refreshed frequently. OceanBase Database V4.3.3 provides a new database audit feature in MySQL mode. You can use filters to specify the types of requests to be audited, such as login, logout, DML statements, and CMD commands. Corresponding records are persisted to the local disk or audit files of Alibaba Cloud Object Storage Service (OSS), and are rotated, deleted, and compressed based on the configured strategy. You can read audit records in cloud consoles, or directly read audit records in the database by using the external table feature of OceanBase Database.

Usability improvements

Query-level resource groups

OceanBase Database of earlier versions allow you to use the DBMS_RESOURCE_MANAGER system package to configure resource groups at the user level, background task level, or column parameter level, to implement CPU and IOPS resource isolation. OceanBase Database V4.3.3 supports query-level resource group binding. You can specify the /*+ resource_group('group_name')*/ hint in an SQL statement to force this statement to use resources in the specified resource group. If the specified resource group does not exist, resources in the default resource group are used. After the resource group is changed, you need to reconnect the session for the new resource group to take effect.
More parameters and variables supported for the OPT_PARAM hint

You can use the OPT_PARAM hint to specify optimizer-related parameters or system variables to be updated at the query level. Apart from those supported in earlier versions, OceanBase Database V4.3.3 also supports the optimizer_index_cost_adj parameter and the runtime_filter_type system variable.
LS replica management

OceanBase Database of versions earlier than V4.0 provide a series of O&M commands for partition replica management. For example, you can use related commands to add a replica to a partition, drop a replica from a partition, and convert the type of a replica for a partition. OceanBase Database V4.x replaces the concept of partition with log stream (LS). OceanBase Database V4.3.3 redesigns the partition O&M commands in earlier versions for LS replica-level O&M. It provides a series of syntaxes for adding LS replicas, dropping LS replicas, converting the type of LS replicas, migrating LS replicas, modifying the number of Paxos members of an LS replica, and canceling a disaster recovery task, to meet manual LS replica O&M requirements.
Enhanced PS diagnostic capabilities

When a prepared statement (PS) handle leak occurs in earlier versions, you can only query the GV$OB_PS_ITEM_INFO view for global information, and no session-level diagnostic methods are available. OceanBase Database V4.3.3 provides the [G]V$OB_SESSION_PS_INFO view to display the PS reference information of each session to help you accurately locate PS handle leaks.
Support for querying data from the [G]V$PLAN_CACHE_PLAN_EXPLAIN view only by plan_id

When you query data from the [G]V$PLAN_CACHE_PLAN_EXPLAIN view in earlier versions, you must specify ip, port, tenant_id, and plan_id as the filter conditions. If you specify only plan_id, the query result set is empty, which means poor usability of the view. The new version supports scanning the underlying virtual table of this view and allows you to specify only plan_id to accurately query data from this view.
trace_id parsing

OceanBase Database uses trace_id to mark the full process of an SQL request. trace_id can be associated with monitoring metrics or query logs and contains the IP address and port number of the OBServer node that initiates the SQL request. However, no method is available for directly parsing trace_id. The new version provides the decode_trace_id function for parsing trace_id to obtain the IP address and port number.
Display of events of primary and standby tenants

In OceanBase Database of earlier versions, events of primary and standby tenants such as switchover and failover are recorded in RootService events, which are cleared over time. Moreover, it is difficult to find tenant-level events among cluster-level events. OceanBase Database V4.3.3 splits the events of primary and standby tenants at the tenant level and displays the events in the CDB/DBA_OB_TENANT_EVENT_HISTORY view.

Compatibility changes

Product behavioral changes

Change	Description
The `SHOW CREATE TABLE/VIEW` statement in MySQL mode and the `DBMS_METADATA.GET_DDL` function in Oracle mode can display more attributes of materialized views.	In earlier versions, the `SHOW CREATE TABLE/VIEW` statement and `DBMS_METADATA.GET_DDL` function can display only part of the attributes in materialized view definition. The attributes that specify the data refresh type, refresh subtask details, whether to enable query rewriting, whether to use columnar storage, and whether the materialized view is a real-time one, cannot be displayed. In the new version, these attributes of materialized views are displayed by default.
An error is returned when a value is assigned to a const variable in a trigger in Oracle mode.	In earlier versions, no error is returned when a value is assigned to a const variable in a trigger, which is not as expected. In the new version, an error is returned in such cases.
An error is returned when the number of parameters exceeds 65,535 in the PS protocol.	The MySQL protocol supports a maximum of 65,535 parameters. When the number of parameters exceeds 65,535, an exception is thrown. In earlier versions of OceanBase Database, 65,535 parameters, instead of an error, are returned in this case. The new version is adapted for compatibility with MySQL, and an error is returned when the number of parameters exceeds 65,535.
You can query data from the `[G]V$PLAN_CACHE_PLAN_EXPLAIN` view by specifying only `plan_id`.	When you query data from the `[G]V$PLAN_CACHE_PLAN_EXPLAIN` view in earlier versions, you must specify `ip`, `port`, `tenant_id`, and `plan_id` as the filter conditions. If you specify only `plan_id`, the query result set is empty, which means poor usability of the view. The new version supports scanning the underlying virtual table of this view and allows you to specify only `plan_id` to accurately query data from this view.
The meaning of the `tenant_id` column in the `[G]V$OB_SQL_AUDIT` view is modified.	Some internal requests generated in a user tenant are initialized in the sys tenant. When such an internal request needs to read data, it reads data from the user tenant. In this case, the value of `tenant_id` is `1`, and that of `effective_tenant_id` is the ID of the current tenant. For sequential collection requirements based on OceanBase Autonomy Service (OAS) within a tenant, the new version adjusts the `tenant_id` column, which can be used as an index, to be equivalent with the `effective_tenant_id` column.

View changes

View	Change type	Description
CDB/DBA_OB_RESTORE_PROGRESS	Modified	The `RESTORE_TYPE` column is added to indicate whether an ongoing or completed physical restore is a normal restore or a quick restore.
CDB/DBA_OB_RESTORE_HISTORY	Modified	The `RESTORE_TYPE` column is added to indicate whether an ongoing or completed physical restore is a normal restore or a quick restore.
DBA_OB_TENANTS	Modified	The `RESTORE_DATA_MODE` column is added to indicate whether a tenant is normally created (or restored through full data restore) or quickly restored.
CDB/DBA_OB_RESTORE_PROGRESS	Modified	The columns `RECOVER_SCN`, `RECOVER_SCN_DISPLAY`, `RECOVER_PROGRESS`, `TABLET_COUNT`, `FINISH_TABLET_COUNT`, and `RESTORE_PROGRESS` are added to show the physical restore progress of a tenant.
CDB/DBA_OB_LS_REPLICA_TASKS	Modified	The columns `DATA_SOURCE_SVR_IP`, `DATA_SOURCE_SVR_PORT`, and `IS_MANUAL` are added to record the data source referenced during the execution of a disaster recovery task and the source that initiates the disaster recovery task.
CDB/DBA_OB_LS_REPLICA_TASK_HISTORY	New	Displays the execution history of disaster recovery tasks. You can query the `CDB` view only in the sys tenant, and the `DBA` view in all tenants.
CDB/DBA_OB_SERVICES	New	Displays information about all service names of all tenants or the current tenant. You can query the `CDB` view only in the sys tenant.
CDB/DBA_OB_TEMP_FILES	New	Displays information about temporary files. You can query the `CDB` view only in the sys tenant.
CDB/DBA_OB_LS_LOCATIONS	Modified	The `COLUMNSTORE` enumerated value, which indicates the columnstore replica type, is added for the `REPLICA_TYPE` column.
[G]V$OB_SESSION_PS_INFO	New	Displays the information about PS opened in all sessions of a tenant. The `V$` view displays the PS information on the current OBServer node, and the `GV$` view displays the PS information on all OBServer nodes. You can query these views in all tenants.
[G]V$OB_SQL_AUDIT	Modified	The `STMT_TYPE` column is added to indicate the SQL type.
[G]V$OB_PROCESSLIST	Modified	The `SERVICE_NAME` column is added to indicate the service name that creates the session. If this column is empty, the session is not created by a service name.
[G]V$OB_TABLET_COMPACTION_HISTORY	Modified	The columns `IS_FULL_MERGE`, `IO_COST_TIME_PERCENTAGE`, `MERGE_REASON`, `BASE_MAJOR_STATUS`, and `CO_MERGE_TYPE` are added to describe information about major compactions.
information_schema.tables	Modified	The `DATA_LENGTH` column is modified to indicate the size of storage space occupied by tables. The `INDEX_LENGTH` column is modified to indicate the size of storage space occupied by non-primary key indexes. The values of the columns are calculated based on macroblocks, in bytes.
information_schema.partitions	Modified	The `DATA_LENGTH` column is modified to indicate the size of storage space occupied by partitions. The `INDEX_LENGTH` column is modified to indicate the size of storage space occupied by partition indexes. The values of the columns are calculated based on macroblocks, in bytes.
information_schema.role_column_grants	New	Displays the table-level privileges of activated roles granted to the user in the current session, and the table-level privileges of other roles granted to these roles. This view is available only in MySQL mode.
information_schema.role_routine_grants	New	Displays the routine privileges of activated roles granted to the user in the current session, and the routine privileges of other roles granted to these roles. This view is available only in MySQL mode.
information_schema.role_table_grants	New	Displays the column privileges of activated roles granted to the user in the current session, and the column privileges of other roles granted to these roles. This view is available only in MySQL mode.

Parameter changes

Parameter	Change type	Description
ob_vector_memory_limit_percentage	New	The maximum percentage of memory resources available for the vector index module. The parameter value must be smaller than `100 - 15 - _memstore_limit_percentage`. It is a tenant-level parameter. The default value is `0`, which specifies to disable vector indexes.
ob_storage_s3_url_encode_type	New	Specifies whether the URL encoding method used when a request is sent over the S3 protocol complies with the RFC 3986 standard. It is a cluster-level parameter. The default value is `default`, which specifies to use the default behavior of the S3 SDK and not to encode some special characters such as the at sign (`@`). If you set the value to `compliantRfc3986Encoding`, the URL encoding method used when a request is sent over the S3 protocol complies with the RFC 3986 standard.
major_compaction_memory_percentage_limit	New	The maximum percentage of memory in the tenant available for major compactions. It is a tenant-level parameter. The default value is `40`. If allowed by the system load and resources, you can set this parameter to a larger value to increase the concurrency of major compactions of data stored in columnar storage mode.
ob_encoding_granularity	New	The maximum number of rows encoded per microblock. It is a tenant-level parameter. The default value is `65536`. In an AP business scenario with a few data updates, if the skip index feature cannot be used to accelerate filtering due to a large number of microblock rows, or if the major compaction performance is poor because microblocks cannot be reused, you can decrease the value of this parameter to improve performance.
px_task_size	Modified	The minimum amount of data scanned per subtask when a scan task is split for parallel execution. In earlier versions, a scan task can be split only by macroblock and therefore the minimum value of `px_task_size` is `2M`, which is also the size of a macroblock. In the new version, a scan task can be split by microblock and therefore the minimum value of `px_task_size` is adjusted to `1K`.

System variable changes

System variable	Change type	Description
cardinality_estimation_model	New	The correlation model used by the optimizer for row estimation. It can take effect both at the global level and session level. Valid values are `INDEPENDENT` which assumes that predicates are totally independent of each other, `PARTIAL` which assumes that predicates are partially correlated, and `FULL` which assumes that predicates are fully correlated. The default value is `PARTIAL`.
ob_table_access_policy	New	The storage format preferentially selected by the optimizer when it determines the base table access strategy in the execution plan. Valid values are `ROW_STORE`, `COLUMN_STORE`, and `AUTO`. The value `ROW_STORE` means that the rowstore format is preferentially selected. It can take effect both at the global level and session level. The default value is `AUTO`, which means that the optimizer automatically selects the format of data to be accessed.
ob_hnsw_ef_search	New	The number of neighbor nodes considered in an HNSW index-based search. It takes effect at the session level. The default value is `64`. A larger value indicates higher accuracy and longer search time.
ob_route_policy	Modified	The `COLUMN_STORE_ONLY` enumerated value is added to enable the optimizer to select only a columnstore replica to generate a columnar storage plan.

System package changes

System package	Change type	Description
DBMS_VECTOR	New	Provides the `REFRESH_INDEX` and `REBUILD_INDEX` subprograms for refreshing and rebuilding vector indexes respectively.

Function changes

Function	Change type	Description
decode_trace_id	New	Parses `trace_id` to obtain the IP address and port number of the OBServer node that initiates the SQL request.
array_contains	New	Verifies whether a specific element is included in an array.
tokenize	New	Queries the parsing results generated by the corresponding parser for a text.

Syntax changes

Syntax	Change description
The `method` option is added for the `restore_option` parameter in the physical restore statement.	The `method` option is added for the `restore_option` parameter in the `ALTER SYSTEM RESTORE dest_tenant_name FROM 'uri' UNTIL [TIME\\|SCN]=['time'\\|scn] WITH 'restore_option' [WITH KEY FROM 'backup_key_path'] [ENCRYPTED BY 'password']` statement. Here, `method = FULL` indicates full restore and is the default value. `method = QUICK` indicates quick restore.
LS replica management statements and commands are added.	The following statements and commands are added: Statement for adding an LS replica Statement for removing an LS replica Statement for modifying the type of an LS replica Statement for migrating an LS replica Statement for modifying the number of Paxos members of an LS replica Statement for canceling an LS replica task ob_admin O&M commands
The partition and column definition options are added to the `INSERT OVERWRITE` statement.	-
The syntax for creating an `ARRAY` column is added.	-
A `SERVICE_NAME` management statement is added.	The following statement is added: `ALTER SYSTEM {CREATE \\| DELETE \\| START \\| STOP} SERVICE $service_name [tenant = '$tenant_name'];`
A partition export option is added to the `SELECT INTO OUTFILE` statement.	-
Asynchronous job statements are added.	The following statements are added: Statement for creating an asynchronous job Statement for viewing the status of an asynchronous job Statement for canceling an asynchronous job
Syntaxes related to vector indexes are added.	-
The syntax for specifying zones where columnstore replicas are located in the `LOCALITY` option is added.	-

Recommended versions of tools

The following table lists the recommended versions of tools for OceanBase Database V4.3.3.

Tool	Version	Remarks
ODP	ODP V4.3.2	-
OCP	OCP V4.3.2	The columnstore replica feature not yet supported and is expected to be available in OCP V4.3.3.
ODC	ODC V4.3.2	-
OBCDC	OBCDC V4.3.3	-
OMS	Public Cloud Iteration 202405	-
OCCI	OCCI V1.0.4	-
OBCI	OBCI V2.0.9 BP1	-
ECOB	ECOB V1.1.9	-
OBClient	OBClient V2.2.7	-
LibOBClient	LibOBClient V2.2.6	-
OBJDBC	OBJDBC V2.4.12	-
OBODBC	OBODBC V2.0.9.3	-
obloader	obloader V4.3.1.1	The `ARRAY` and `VECTOR` types are not supported in obloader V4.3.1.1, and will be supported in obloader V4.3.3.

Upgrade notes

OceanBase Database V4.3.0 Beta and its later minor versions, V4.3.1 Beta and its later minor versions, as well as V4.3.2 Beta Hotfix1 and its later minor versions can be directly upgraded to V4.3.3.
OceanBase Database V4.2.x or earlier cannot be upgraded to V4.3.3. As the version evolves, corresponding upgrade paths will be supported.
OceanBase Database V4.3.2 reconstructs the persistent format of multi-source data, and therefore format conversion needs to be performed for multi-source data during the upgrade. You must reserve sufficient upgrade time if a large number of partitions are involved.
During an upgrade to V4.3.3, make sure that no incremental direct load task is in progress. You can query the oceanbase.__all_virtual_load_data_stat view for confirmation. We plan to resolve this issue in later versions.

Considerations

OceanBase Database V4.3.x introduces some Advanced Vector Extensions 2 (AVX2) instructions. If the CPU of a server does not support these instructions, the server may crash, generating a core dump file.
If x86 servers that support AVX2 instructions are co-deployed with Arm servers or x86 servers that do not support AVX2 instructions, accuracy issues may occur during backup and restore.
The OceanBase Change Data Capture (CDC) process may crash, generating a core dump file, when the CDC and table-level restore features are used together.