This topic describes how to use OceanBase Migration Service (OMS) Community Edition to migrate data from a TiDB database to OceanBase Database Community Edition.
Background information
OMS Community Edition allows you to create a task to migrate the existing business data and incremental data from a TiDB database to OceanBase Database Community Edition through schema migration, full migration, and incremental synchronization.
TiDB is an integrated distributed database that supports hybrid transactional and analytical processing (HTAP).
If you use TiDB 4.x or later and
Bind Kafka Data Source is selected when you create a TiDB data source, you must deploy a TiCDC cluster and a Kafka cluster to synchronize incremental data from the TiDB database to OceanBase Database Community Edition.
TiCDC is an incremental data synchronization tool for TiDB and provides high availability by using a placement driver (PD) cluster, which is the scheduling module of the TiDB cluster and usually consists of three PD nodes. TiKV Server is a TiKV node in the TiDB cluster. TiKV Server sends data changes in change logs to the TiCDC cluster. TiCDC runs multiple TiCDC processes to obtain and process data from TiKV nodes, and then synchronizes the data to the Kafka cluster. The Kafka cluster saves the incremental logs of the TiDB database that are converted by TiCDC. During incremental data synchronization, OMS Community Edition obtains the corresponding data from the Kafka cluster and migrates the data to OceanBase Database Community Edition in real time.
If you use TiDB 4.x or later, you do not need to select
Bind Kafka Data Source when you create a TiDB data source. In this case, incremental data can be directly synchronized from the TiDB database by using OMS Community Edition.
Prerequisites
You have created dedicated database users for data migration in the source TiDB database and OceanBase Database Community Edition that serves as the target, and granted required privileges to the users. For more information, see Create a database user.
Limitations
TiDB 2.x, 3.x, 4.x, 5.x, 6.x, and 7.x are supported.
Notice
TiDB 2.x and 3.x support only TiDB Binlog.
If the target is a database, OMS Community Edition does not support triggers in the target database. If triggers exist in the target database, the data migration may fail.
When you use OMS Community Edition to migrate data from a TiDB database to OceanBase Database Community Edition, DDL synchronization is not supported.
OMS Community Edition does not support the migration of tables without a primary key and data with spaces from a TiDB database to OceanBase Database Community Edition.
The data source identifiers and user accounts must be globally unique in OMS Community Edition.
OMS Community Edition supports the migration of only objects whose database name, table name, and column name are ASCII-encoded without special characters. The special characters are line breaks and
| " ' ` ( ) = ; / &.OMS Community Edition supports only TiCDC Open Protocol. If you use an unsupported protocol, a JDBC connector exception occurs, and an error that indicates a null pointer is returned.
When you use OMS Community Edition to synchronize data from a TiCDC instance to a Kafka instance, you must set the
enable-old-valueparameter totrue. Otherwise, the format of data synchronization messages may be invalid. For more information, see Task configuration file.
Considerations
To ensure the performance of a data migration task, we recommend that you migrate no more than 1,000 tables at a time.
If the source database contains foreign keys with the same name, an error occurs during schema migration. In this case, you can rename the foreign keys to resume the task.
If the UTF-8 character set is used in the source, we recommend that you use a compatible character set, such as UTF-8 or UTF-16, in the target to avoid garbled characters.
In reverse incremental synchronization scenarios from a TiDB database to OceanBase Database Community Edition of a version earlier than V3.2.x, if a source table is a multi-partition table with a global unique index and the values of the partitioning key of the table are updated, data can be lost during the migration.
Do not write data to the topic synchronously used by TiCDC. Otherwise, a JDBC connector exception may occur and a null pointer error may be returned.
Check whether the migration precision of OMS Community Edition for columns of data types such as DECIMAL, FLOAT, and DOUBLE is as expected. If the precision of the target column type is lower than that of the source column type, the value with a higher precision may be truncated. This may result in data inconsistency between the source and target columns.
If you change the unique index in the target database, you must restart the Incr-Sync component. Otherwise, the data may be inconsistent.
If the clocks between nodes or between the client and the server are out of synchronization, the latency may be inaccurate during incremental synchronization or reverse incremental synchronization.
For example, if the clock is earlier than the standard time, the latency can be negative. If the clock is later than the standard time, the latency can be positive.
Take note of the following considerations if you want to perform data merge migration:
We recommend that you configure the mappings between the source and target databases by importing objects or specifying matching rules.
We recommend that you manually create schemas in the target database. If you use OMS Community Edition to create schemas, skip failed objects in the schema migration step.
A difference between the source and target table schemas may result in data consistency. Some known scenarios are described as follows:
When you manually create a table schema, if the data type of a column in the source is not supported by OMS Community Edition, implicit data type conversion may occur, which causes inconsistent column types between the source and target databases.
If the length of a column in the target database is shorter than that in the source database, the data of this column may be automatically truncated, which causes data inconsistency between the source and target databases.
If you use TiDB 2.x or 3.x, take note of the following considerations. For more information about TiDB Binlog, see TiDB Binlog Cluster Overview.
TiDB Binlog does not send heartbeat packets. If the source is not updated, the timestamp of incremental synchronization is not updated.
Before incremental synchronization starts, the source must be updated. Otherwise, the
data not existederror is reported.For a table without a primary key, the information of a unique key cannot be obtained from incremental binlogs. In this case, the program uses all columns as the primary key. When an update operation in the source is synchronized to the target, the corresponding data record is first deleted and then inserted.
If you select only
Incremental Synchronization when you create the data migration task, OMS Community Edition requires that the local incremental logs in the source database be retained for more than 48 hours.If you select
Full Migration andIncremental Synchronization when you create the data migration task, OMS Community Edition requires that the local incremental logs of the source database be retained for at least seven days. Otherwise, the data migration task may fail or the data in the source and target databases may be inconsistent because OMS Community Edition cannot obtain incremental logs.
Data type mappings
| TiDB database | OceanBase Database Community Edition |
|---|---|
| INTEGER | INTEGER |
| TINYINT | TINYINT |
| MEDIUMINT | MEDIUMINT |
| BIGINT | BIGINT |
| SMALLINT | SMALLINT |
| DECIMAL | DECIMAL |
| NUMERIC | NUMERIC |
| FLOAT | FLOAT |
| REAL | REAL |
| DOUBLE PRECISION | DOUBLE PRECISION |
| BIT | BIT |
| CHAR | CHAR |
| VARCHAR | VARCHAR |
| BINARY | BINARY |
| VARBINARY | VARBINARY |
| BLOB | BLOB |
| TEXT | TEXT |
| ENUM | ENUM |
| SET | SET |
| DATE | DATE |
| DATETIME | DATETIME |
| TIMESTAMP | TIMESTAMP |
| TIME | TIME |
| YEAR | YEAR |
Procedure
Create a data migration task.
Log in to the console of OMS Community Edition.
In the left-side navigation pane, click
Data Migration .On the
Data Migration page, clickNew Task in the upper-right corner.
On the
Select Source and Target page, configure the parameters.Parameter Description Data Migration Task Name We recommend that you set it to a combination of digits and letters. It must not contain any spaces and cannot exceed 64 characters in length. Tag (Optional) Click the field and select a tag from the drop-down list. You can also click Manage Tags to create, modify, and delete tags. For more information, see Use tags to manage data migration tasks.Source If you have created a TiDB data source, select it from the drop-down list. If not, click New Data Source in the drop-down list and create one in the dialog box that appears on the right. For more information, see Create a TiDB data source.
Notice- If the source TiDB data source is not bound to a valid Kafka data source or topic, incremental synchronization is not supported.
- If the Kafka server requires consumer authentication, you must add the
properties={"group.id":"user-environment consumer"}setting to thesourceparameter for incremental synchronization tasks. In OMS Community Edition, the default value ofuser-environment consumerisoms_jdbc_connector_null.
Target If you have created an OceanBase-CE data source, select it from the drop-down list.project If not, click New Data Source in the drop-down list and create one in the dialog box that appears on the right. For more information about the parameters, see Create an OceanBase-CE data source.Click
Next .In the dialog box that appears, click
OK .For tables without a primary key and tables with a non-null unique key in the TiDB database, full migration and full verification are supported but incremental synchronization is not supported.
On the
Select Migration Type page, configure the parameters.Options for
Migration Type areSchema Migration ,Full Migration ,Incremental Synchronization ,Full Verification , and Reverse Incremental Migration.Migration type Description Schema migration The definitions of data objects, such as tables, indexes, constraints, comments, and views, are migrated from the source database to the target database. Temporary tables are automatically filtered out. Full migration After a full migration task is started, OMS Community Edition migrates existing data of tables in the source database to corresponding tables in the target database. If you select Full Migration , we recommend that you use theANALYZEstatement to collect the statistics of the TiDB database before data migration.Incremental synchronization Changed data in the source database is synchronized to the corresponding tables in the target database after an incremental synchronization task starts. Data changes are data addition, modification, and deletion.
Options forDML Synchronization in theIncremental Synchronization section includeInsert,Delete, andUpdate. You can select the operations as needed. For more information, see DML filtering. If you create a TiDB data source without binding it to a Kafka data source, you cannot selectIncremental Synchronization .Full verification After the full migration and incremental synchronization are completed, OMS Community Edition automatically initiates a full verification task to verify the data tables in the source and target databases. - If you select
Full Verification , we recommend that you collect the statistics of both the source and target databases before full verification. For more information about how to collect statistics of OceanBase Database Community Edition, see Manual statistics collection. - If you select
Incremental Synchronization but do not select all DML operations in theDML Synchronization section, you cannot selectFull Verification .
Reverse incremental migration When a reverse incremental migration task starts, OMS migrates the data changed in the target database after the business switchover back to the source database in real time. You cannot select Reverse Incremental Migration in the following cases: - Data merge migration that involves multiple tables is enabled.
- Multiple source schemas map to the same target schema.
- If you select
(Optional) Click
Next . If you have selected Reverse Incremental Migration but the ConfigUrl or the username or password of the Data Replication Center (DRC) user is not configured for the target OceanBase-CE data source, theMore about Data Sources dialog box appears, prompting you to configure related parameters. For more information about the parameters, see Create an OceanBase-CE data source.After you configure the parameters, click Test Connectivity. After the test succeeds, click
Save .Click
Next . On theSelect Migration Objects page, select the migration objects and migration scope.You can select
Specify Objects orMatch Rules to specify the migration objects. The following procedure describes how to specify migration objects by using theSpecify Objects option. For information about the procedure for specifying migration objects by using theMatch Rules option, see Configure matching rules for migration objects.Notice
The name of a table to be migrated, as well as the names of columns in the table, must not contain Chinese characters.
If a database or table name contains double dollar signs ("$$"), you cannot create the migration task.

In the
Select Migration Objects section, selectSpecify Objects .In the
Specify Migration Scope section, select the objects to be migrated from theSource Object(s) list. You can select tables and views of one or more databases as the migration objects.Click > to add the selected objects to the
Target Object(s) list.
OMS Community Edition allows you to import objects from text, rename objects, and remove a single migration object or all migration objects.
Operation Description Import objects - In the list on the right of the
Specify Migration Scope section, clickImport Objects in the upper-right corner. - In the dialog box that appears, click
OK .
Notice
This operation will overwrite previous selections. Proceed with caution. - In the
Import Objects dialog box, import the objects to be migrated.
You can import CSV files to rename databases/tables and set row filtering conditions. For more information, see Download and import the settings of migration objects. - Click
Validate . - After the validation succeeds, click
OK .
Rename objects OMS Community Edition allows you to rename migration objects. For more information, see Rename a database table. Remove one or all objects OMS Community Edition allows you to remove a single object or all objects to be migrated to the target database during data mapping. - To remove a single migration object:
In the list on the right of theSpecify Migration Scope section, move the pointer over the target object and clickRemove . - To remove all migration objects:
In the list on the right of theSpecify Migration Scope section, clickRemove All in the upper-right corner. In the dialog box that appears, clickOK .
Click
Next . On the Migration Options page, configure the following parameters.To view or modify parameters of the Full-Import or Incr-Sync component, click
Configuration Details in the upper-right corner of theFull Migration orIncremental Synchronization section. For more information about the parameters, see Component parameters.Full migration
The following parameters are displayed only if you have selected
Full Migration on theSelect Migration Type page.Parameter Description Concurrency Speed Valid values: Stable ,Normal ,Fast , andCustom . The amount of resources to be consumed by a full migration task varies based on the migration performance. If you selectCustom , you can setRead Concurrency ,Write Concurrency , andJVM Memory as needed.Handle Non-empty Tables in Target Database Valid values: Ignore andStop Migration .- If you select
Ignore , when the data to be inserted conflicts with the existing data of a target table, OMS Community Edition retains the existing data and records the conflict data.Notice
If you select Ignore, data is pulled in IN mode for full verification. In this case, the scenario where the target table contains more data than the source table cannot be verified, and the verification efficiency will be decreased.
- If you select
Stop Migration and a target table contains data, an error is returned during full migration, indicating that the migration is not allowed. In this case, you must clear the data in the target table before you can continue with the migration.Notice
After an error is returned, if you click
Resume in the dialog box, OMS Community Edition ignores this error and continues to migrate data. Proceed with caution.
Processing Strategy When Target Table Has Records Valid values: Ignore andStop Migration .- If you select
Ignore , when the data to be inserted conflicts with the existing data of a target table, OMS Community Edition retains the existing data and records the conflict data.Notice
If you select Ignore, data is pulled in IN mode for full verification. In this case, the scenario where the target contains more data than the source cannot be verified, and the verification efficiency will be decreased.
- If you select
Stop Migration and a target table contains data, an error is returned during full migration, indicating that the migration is not allowed. In this case, you must clear the data in the target table before you can continue with the migration.Notice
After an error is returned, if you click
Resume in the dialog box, OMS Community Edition ignores this error and continues to migrate data. Proceed with caution.
Writing Method Valid values: SQL (specifies to write data to tables by using INSERTorREPLACE) and Direct Load (specifies to write data through direct load). For more information about the direct load method, see Direct load.Post-Indexing Specifies whether to create indexes after the full migration is completed. Post-indexing can shorten the time required for full migration. For more information about the considerations on post-indexing, see the description below. Notice
This feature is supported only if you have selected both Schema Migration and Full Migration on the Select Migration Type page.
If post-indexing is allowed, we recommend that you adjust the parameters based on the hardware conditions of the target database and the business traffic.
If you use OceanBase Database Community Edition V4.x, adjust the following parameters of the
systenant and business tenants by using a command-line interface (CLI) client.Adjust the parameters of the
systenant// parallel_servers_target specifies the queuing conditions for parallel queries on each server. // To maximize performance, we recommend that you set this parameter to a value greater than, for example, 1.5 times, the number of physical CPU cores. In addition, make sure that the value does not exceed 64, to prevent database kernels from contending for locks. set global parallel_servers_target = 64;Adjust the parameters of a business tenant
// Specify the limit on the file memory buffer size. alter system set _temporary_file_io_area_size = '10' tenant = 'xxx'; // Disable throttling in V4.x. alter system set sys_bkgd_net_percentage = 100;
If you use OceanBase Database Community Edition V3.x, adjust the following parameters of the
systenant by using a CLI client.// parallel_servers_target specifies the queuing conditions for parallel queries on each server. // To maximize performance, we recommend that you set this parameter to a value greater than, for example, 1.5 times, the number of physical CPU cores. In addition, make sure that the value does not exceed 64, to prevent database kernels from contending for locks. set global parallel_servers_target = 64; // `data_copy_concurrency` specifies the maximum number of concurrent data migration and replication tasks allowed in the system. alter system set data_copy_concurrency = 200;
- If you select
Incremental synchronization
The following parameters are displayed only if you have selected
Incremental Synchronization on theSelect Migration Type page.Parameter Description Concurrency Speed Valid values: Stable ,Normal ,Fast , andCustom . The amount of resources to be consumed by an incremental synchronization task varies based on the synchronization performance. If you selectCustom , you can setRead Concurrency ,Write Concurrency , andJVM Memory as needed.Incremental Record Retention Time The duration that incremental parsed files are cached in OMS Community Edition. This parameter is displayed only if you did not select Bind Kafka Data Source when you created the TiDB data source. A longer retention period results in more disk space occupied by the Store component of OMS Community Edition.Incremental Synchronization Start Timestamp - If you have set the migration type to
Full Migration , this parameter is not displayed. - If you did not select
Bind Kafka Data Source when you created the TiDB data source and have selectedIncremental Synchronization but notFull Migration when you selected migration types, specify a point in time after which the data is to be synchronized. The default value is the current system time. For more information, see Set an incremental synchronization timestamp.
Kafka Consumer group.id (Optional) The unique identifier of the consumer group in the Kafka cluster. This parameter is displayed only if you have selected Bind Kafka Data Source when you created the TiDB data source.- If you have set the migration type to
Full verification
The following parameters are displayed only if you have selected
Full Verification on theSelect Migration Type page.Parameter Description Concurrency Speed Valid values: Stable ,Normal ,Fast , andCustom . Through resource configuration for the Full-Verification component, you can limit the resource consumption of a task in the full verification phase. If you selectCustom , you can setRead Concurrency ,Write Concurrency , andJVM Memory as needed.
Click
Precheck to start a precheck on the data migration task.During the precheck, OMS Community Edition checks the read and write privileges of the database users and the network connectivity of the databases. A data migration task can be started only after it passes all check items. If an error is returned during the precheck, you can perform the following operations:
Identify and troubleshoot the problem and then perform the precheck again.
Click
Skip in theActions column of the failed precheck item. In the dialog box that prompts the consequences of the operation, clickOK .
Click
Start Task . If you do not need to start the task now, clickSave to go to the details page of the data migration task. You can start the task later as needed.OMS Community Edition allows you to modify the migration objects when the data migration task is running. For more information, see View and modify migration objects. After the data migration task is started, it is executed based on the selected migration types. For more information, see the Migration Details section in the View details of a data migration task topic.