OceanBase logo

OceanBase

A unified distributed database ready for your transactional, analytical, and AI workloads.

Product Overview
DEPLOY YOUR WAY

OceanBase Cloud

The best way to deploy and scale OceanBase

OceanBase Enterprise

Run and manage OceanBase on your infra

TRY OPEN SOURCE

OceanBase Community Edition

The free, open-source distributed database

OceanBase seekdb

Open source AI native search database

Customer Stories

Real-world success stories from enterprises across diverse industries.

View All
BY USE CASES

Mission-Critical Transactions

Global & Multicloud Application

Elastic Scaling for Peak Traffic

Real-time Analytics

Active Geo-redundancy

Database Consolidation

Resources

Comprehensive knowledge hub for OceanBase.

Blog

Live Demos

Training & Certification

Documentation

Official technical guides, tutorials, API references, and manuals for all OceanBase products.

View All
PRODUCTS

OceanBase Cloud

OceanBase Database

Tools

Connectors and Middleware

QUICK START

OceanBase Cloud

OceanBase Database

BEST PRACTICES

Practical guides for utilizing OceanBase more effectively and conveniently

Company

Learn more about OceanBase – our company, partnerships, and trust and security initiatives.

About OceanBase

Partner

Trust Center

Contact Us

International - English
中国站 - 简体中文
日本 - 日本語
Sign In
Start on Cloud

OceanBase

A unified distributed database ready for your transactional, analytical, and AI workloads.

Product Overview
DEPLOY YOUR WAY

OceanBase Cloud

The best way to deploy and scale OceanBase

OceanBase Enterprise

Run and manage OceanBase on your infra

TRY OPEN SOURCE

OceanBase Community Edition

The free, open-source distributed database

OceanBase seekdb

Open source AI native search database

Customer Stories

Real-world success stories from enterprises across diverse industries.

View All
BY USE CASES

Mission-Critical Transactions

Global & Multicloud Application

Elastic Scaling for Peak Traffic

Real-time Analytics

Active Geo-redundancy

Database Consolidation

Comprehensive knowledge hub for OceanBase.

Blog

Live Demos

Training & Certification

Documentation

Official technical guides, tutorials, API references, and manuals for all OceanBase products.

View All
PRODUCTS
OceanBase CloudOceanBase Database
ToolsConnectors and Middleware
QUICK START
OceanBase CloudOceanBase Database
BEST PRACTICES

Practical guides for utilizing OceanBase more effectively and conveniently

Learn more about OceanBase – our company, partnerships, and trust and security initiatives.

About OceanBase

Partner

Trust Center

Contact Us

Start on Cloud
编组
All Products
    • Databases
    • iconOceanBase Database
    • iconOceanBase Cloud
    • iconOceanBase Tugraph
    • iconInteractive Tutorials
    • iconOceanBase Best Practices
    • Tools
    • iconOceanBase Cloud Platform
    • iconOceanBase Migration Service
    • iconOceanBase Developer Center
    • iconOceanBase Migration Assessment
    • iconOceanBase Admin Tool
    • iconOceanBase Loader and Dumper
    • iconOceanBase Deployer
    • iconKubernetes operator for OceanBase
    • iconOceanBase Diagnostic Tool
    • iconOceanBase Binlog Service
    • Connectors and Middleware
    • iconOceanBase Database Proxy
    • iconEmbedded SQL in C for OceanBase
    • iconOceanBase Call Interface
    • iconOceanBase Connector/C
    • iconOceanBase Connector/J
    • iconOceanBase Connector/ODBC
    • iconOceanBase Connector/NET
icon

OceanBase Database

SQL - V3.2.4Enterprise Edition

    Download PDF

    OceanBase logo

    The Unified Distributed Database for the AI Era.

    Follow Us
    Products
    OceanBase CloudOceanBase EnterpriseOceanBase Community EditionOceanBase seekdb
    Resources
    DocsBlogLive DemosTraining & CertificationTicket
    Company
    About OceanBaseTrust CenterLegalPartnerContact Us
    Follow Us

    © OceanBase 2026. All rights reserved

    Cloud Service AgreementPrivacy PolicySecurity
    Contact Us
    Document Feedback
    1. Documentation Center
    2. OceanBase Database
    3. SQL
    4. V3.2.4
    iconOceanBase Database
    SQL - V 3.2.4Enterprise Edition
    SQL
    KV
    • V 4.6.0
    • V 4.4.2
    • V 4.3.5
    • V 4.3.3
    • V 4.3.1
    • V 4.3.0
    • V 4.2.5
    • V 4.2.2
    • V 4.2.1
    • V 4.2.0
    • V 4.1.0
    • V 4.0.0
    • V 3.1.4 and earlier

    GROUP BY

    Last Updated:2023-10-24 09:23:03  Updated
    share
    What is on this page
    SCALAR GROUP BY
    HASH GROUP BY
    MERGE GROUP BY

    folded

    share

    GROUP BY operators group and aggregate data in SQL statements.

    Algorithms for data grouping include HASH and MERGE. GROUP BY operators also fall into the following types: HASH GROUP BY and MERGE GROUP BY. For a newly generated execution plan, the SQL optimizer selects from these two GROUP BY operators based on cost evaluation.

    General aggregate functions (SUM, MAX, MIN, AVG, COUNT, and STDDEV) are also executed by assigning the GROUP BY operator. However, if an SQL statement contains only aggregate functions but no GROUP BY, the SCALAR GROUP BY operator is assigned. In that sense, GROUP BY operators can be divided into three types: SCALAR GROUP BY, HASH GROUP BY, and MERGE GROUP BY.

    SCALAR GROUP BY

    Example 1: an execution plan containing a SCALAR GROUP BY operator

    obclient> CREATE TABLE t1(c1 INT, c2 INT);
    Query OK, 0 rows affected
    
    obclient> INSERT INTO t1 VALUES(1, 1);
    Query OK, 1 rows affected
    
    obclient> INSERT INTO t1 VALUES(2, 2);
    Query OK, 1 rows affected
    
    obclient> INSERT INTO t1 VALUES(3, 3);
    Query OK, 1 rows affected
    
    Q1:
    obclient> EXPLAIN SELECT SUM(c1) FROM t1\G
    *************************** 1. row **************
    Query Plan:
    | ========================================
    |ID|OPERATOR       |NAME|EST. ROWS|COST|
    ----------------------------------------
    |0 |SCALAR GROUP BY|    |1        |37  |
    |1 | TABLE SCAN    |T1  |3        |37  |
    ========================================
    
    Outputs & filters:
    -------------------------------------
      0 - output([T_FUN_SUM(T1.C1)]), filter(nil),
          group(nil), agg_func([T_FUN_SUM(T1.C1)])
      1 - output([T1.C1]), filter(nil),
          access([T1.C1]), partitions(p0)
    

    In the preceding example, the outputs & filters section in the execution plan display of query Q1 shows in detail the output information of the SCALAR GROUP BY operator.

    Parameter
    Description
    output The output expression of the operator.
    filter The filter conditions of the operator. In this example, filter is set to nil because no filter condition is configured for the SCALAR GROUP BY operator.
    group The columns to be grouped. For example, query Q1 contains the SCALAR GROUP BY operator. Therefore, the value of group is nil.
    agg_func The aggregate functions involved. For example, query Q1 is to sum up the data in column c1 of table t1. Therefore, the value is T_FUN_SUM(t1.c1).

    HASH GROUP BY

    Example 2: an execution plan containing a HASH GROUP BY operator

    Q2:
    obclient> EXPLAIN SELECT SUM(c2) FROM t1 GROUP BY c1 HAVING SUM(c2) > 2\G
    *************************** 1. row ***************************
    Query Plan:
    | ======================================
    |ID|OPERATOR     |NAME|EST. ROWS|COST|
    --------------------------------------
    |0 |HASH GROUP BY|    |1        |40  |
    |1 | TABLE SCAN  |T1  |3        |37  |
    ======================================
    
    Outputs & filters:
    -------------------------------------
      0 - output([T_FUN_SUM(T1.C2)]), filter([T_FUN_SUM(T1.C2) > 2]),
          group([T1.C1]), agg_func([T_FUN_SUM(T1.C2)])
      1 - output([T1.C1], [T1.C2]), filter(nil),
          access([T1.C1], [T1.C2]), partitions(p0)
    

    In the preceding example, the outputs & filters section in the execution plan display of query Q2 shows in detail the output information of the HASH GROUP BY operator.

    Parameter
    Description
    output The output expression of the operator.
    filter The filter conditions of the operator. Because the setting requires that the sum of column c2 after grouping is greater than 2, the filter is T_FUN_SUM(t1.c2) > 2.
    group The columns to be grouped. For example, the result of query Q2 is grouped by column c1, so the value is c1.
    agg_func The aggregate functions involved. For example, query Q2 is to sum up the data in column c2 of Table t1, therefore the value is T_FUN_SUM(t1.c2).

    Description

    The HASH GROUP BY operator ensures that the data is grouped by the HASH algorithm in the execution.

    MERGE GROUP BY

    Example 3: an execution plan containing a MERGE GROUP BY operator.

    Q3:
    obclient> EXPLAIN SELECT /*+NO_USE_HASH_AGGREGATION*/SUM(c2) FROM
           t1 GROUP BY c1 HAVING SUM(c2) > 2\G
    *************************** 1. row ***************************
    Query Plan:
    | =======================================
    |ID|OPERATOR      |NAME|EST. ROWS|COST|
    ---------------------------------------
    |0 |MERGE GROUP BY|    |1        |45  |
    |1 | SORT         |    |3        |44  |
    |2 |  TABLE SCAN  |T1  |3        |37  |
    =======================================
    
    Outputs & filters:
    -------------------------------------
      0 - output([T_FUN_SUM(T1.C2)]), filter([T_FUN_SUM(T1.C2) > 2]),
          group([T1.C1]), agg_func([T_FUN_SUM(T1.C2)])
      1 - output([T1.C1], [T1.C2]), filter(nil), sort_keys([T1.C1, ASC])
      2 - output([T1.C1], [T1.C2]), filter(nil),
          access([T1.C1], [T1.C2]), partitions(p0)
    

    In the preceding example, the outputs & filters section in the execution plan display of query Q3 shows in detail the information of the MERGE GROUP BY operator. It shows that the MERGE GROUP BY operator was selected for the execution plan that was generated by the same SQL statement, and the basic information of the operator was the same as that of other GROUP BY operators, except for the algorithm selected for grouping in the execution. Additionally, the No. 2 operator TABLE SCAN in the example returns an unsorted result, while the GROUP BY algorithm uses the MERGE GROUP BY operator, so a SORT operator must be assigned.

    Notice

    NO_USE_HASH_AGGREGATION and USE_HASH_AGGREGATION hints can control the type of algorithm selected by the GROUP BY operator for grouping.

    Previous topic

    COUNT
    Last

    Next topic

    WINDOW FUNCTION
    Next
    What is on this page
    SCALAR GROUP BY
    HASH GROUP BY
    MERGE GROUP BY