std::sort coredump 说起

最新推荐文章于 2022-03-24 19:23:37 发布

orientlu

最新推荐文章于 2022-03-24 19:23:37 发布

阅读量797

点赞数

CC 4.0 BY-SA版权

分类专栏：编程后台

本文链接：https://blog.youkuaiyun.com/qq_18150497/article/details/82079327

编程同时被 2 个专栏收录

39 篇文章

订阅专栏

后台

7 篇文章

订阅专栏

本文探讨了C++ std::sort函数可能导致程序coredump的原因，特别是在自定义比较函数时。当比较函数对相等元素返回true，违反Strict_weak_orderings原则，可能导致coredump。当元素数量大于等于16时，std::sort使用快速排序，这种情况下问题尤为明显。此外，文章还提到一种隐藏更深的坑，即错误的比较函数可能导致数据错误但程序不崩溃，提醒开发者要特别注意。

摘要生成于 C知道，由 DeepSeek-R1 满血版支持，前往体验 >

@(c++)

core 的原因

c++ 标准库 sort() 默认采用 < 这个 operator 来排序的, 另个一个重载函数增加第三个参数，指定一个比较的函数，函数接受两个参数。
对于基础类型（int，float..），直接调用 sort(start,end) 即可，对于非基础类型的结构体，可以通过重载对象的 < 运算符或者提供一个比较函数。
详见

本文从一个 core 说起相信很多人在编写使用 sort 都在这个地方翻车，

#include<iostream>
#include<vector>
#include<algorithm>
using namespace std;

class CompareGreater {
    public:
        bool operator()(const int* left, const int* right)
        {
            return *left >= *right;
        }
};

int main(int argc, char *argv[])
{
    vector<int *> verPInt;
    for (int i = 0; i < 18; i++) { // will coredump
        int *pInt = new int;
        if (pInt == NULL) {
            cout << "new faile" << endl;
            goto final;
        }
        *pInt = 1;
        verPInt.push_back(pInt);
    }
    sort(verPInt.begin(), verPInt.end(), CompareGreater());
    for (vector<int *>::iterator iter = verPInt.begin(); iter != verPInt.end(); ++iter) {
        cout << **iter;
    }
    cout << endl;
final:
    for (vector<int *>::iterator iter = verPInt.begin(); iter != verPInt.end(); ++iter) {
        delete *iter;
        *iter = NULL;
    }
    return 0;
}

gdb 查看堆栈，发现是 core 在比较函数里面。

Program terminated with signal SIGSEGV, Segmentation fault.
#0  0x000055e88121e00c in CompareGreater::operator()(int const*, int const*) ()

上述代码，所有元素值相同，当个数大于等于 16个 的时候就会 coredump，查看说明，core 的原因是：

std::sort()在排序时，比较函数对相等的元素应该返回 false！如果返回 true，会导致程序coredump

因为定义 cmp(x,x) 返回 true 不符合 Strict_weak_orderings，标准库算法里面很多都要求满足 Strict_weak_orderings，使用时需要格外注意，避免翻车。
上述例子代码只需修改比较函数中，将 >= 改为 >即可修复。

为什么是元素个数大于等于 16个 呢，从 STL 源码可以发现，由于 std::sort() 的排序分2种，当元素个数 >16 (_S_threshold = 16) 时选择快速排序，<=16 个则选择插入排序 (对象少时快排性能不理想)。按照快排原理，每次都是遍历所有值和一个中间值比较，小的放左边，大的放右边。从STL源代码可看出，std::sort() 在遍历比较时，是没有加边界保护的。如果比较相等的元素返回真，则在极端情况下 (如所有元素值相等时) __first 会出现访问越界，导致coredump。
STL 源码 : /usr/include/c++/7/bits/stl_algo.h（具体目录）