C++统计单词个数及排序：容器按照value排序

最新推荐文章于 2025-03-23 10:04:35 发布

原创最新推荐文章于 2025-03-23 10:04:35 发布 · 3.8k 阅读

7 ·

CC 4.0 BY-SA版权

文章标签：

#c语言 #算法

C/C++ 专栏收录该内容

7 篇文章

订阅专栏

本文介绍了一种用于统计英文文本中单词出现频率的算法，并通过C++实现该算法。算法首先读取文件中的所有单词，然后使用哈希表记录每个单词出现的次数，最后根据单词出现次数及字母顺序进行排序。

部署运行你感兴趣的模型镜像

例：

给定一段英语文本，要求对其中单词出现的个数按照从小到大进行排序，出现次数相同的按照首字母顺序排列。

算法实现：

#include <iostream>
#include <algorithm>
#include <vector>
#include <unordered_map>
#include <map>
#include <fstream>

using namespace std;
typedef pair<string, int> PAIR;

int cmp(const PAIR &x, const PAIR &y) {
    return x.second < y.second;
}

int main() {
    string file;
    cout << "Enter the text path:";
    cin >> file;
    ifstream inout;
    inout.open(file, ios::in);

    if (inout.fail()) {
        cout << "Your file don't exist!" << endl;
        return 0;
    }
    vector<string> key;
    map<string, int> map;
    string skey;
    int flag = 0;
    inout >> skey;
    key.push_back(skey);
    map[skey] = 1;
    while (!inout.eof()) {
        inout >> skey;
        for (int i = 0; i < key.size(); ++i) {
            if (key[i] == skey) {
                flag = 1;
            }
        }
        if (flag == 1) {
            map[skey]++;
        } else {
            key.push_back(skey);
            map[skey] = 1;
        }
    }
    inout.close();
    vector<PAIR> pair_vec; //用pair来实现按照pair的第二个元素大小也就是value排序
    for (auto it = map.begin(); it != map.end(); it++) {
        pair_vec.push_back(make_pair(it->first, it->second));
    }
    stable_sort(pair_vec.begin(), pair_vec.end(), cmp);
    for (auto curr = pair_vec.begin(); curr != pair_vec.end(); ++curr) {
        cout << curr->first << " " << curr->second << endl;
    }
    return 0;
}

测试用例：

Everybody knows the famous and handsome football player David Beckham, who has a happy family. For a long time, this man’s wife was known to people as David’s beautiful wife, while the fact is that Victoria works so hard for her fashion circle and she makes it. Now everyone treats her as the fashion designer.