leetcode 统计词频

最新推荐文章于 2024-08-11 09:00:00 发布

原创最新推荐文章于 2024-08-11 09:00:00 发布 · 319 阅读

0 ·

CC 4.0 BY-SA版权

leetcode 专栏收录该内容

8 篇文章

订阅专栏

博客围绕编写bash脚本统计文本文件中单词频率展开。先给出题目描述，假设文本只含小写字母和空格，单词由小写字母组成且以空格分隔。接着阐述思路，将空格替换成行，去除多余行，排序后用uniq函数统计词频，最后按词频反向排序。

题目描述

写一个 bash 脚本以统计一个文本文件 words.txt 中每个单词出现的频率。

为了简单起见，你可以假设：

words.txt只包括小写字母和 ' ' 。
每个单词只由小写字母组成。
单词间由一个或多个空格字符分隔。

示例:

假设 words.txt 内容如下：

the day is sunny the the
the sunny is is

你的脚本应当输出（以词频降序排列）：

the 4
is 3
sunny 2
day 1

说明:

不要担心词频相同的单词的排序问题，每个单词出现的频率都是唯一的。
你可以使用一行 Unix pipes 实现吗？

思路

首先要把每个单词放成一行，具体把空格替换成行就可以了然后把行给去掉，排序之后把相同的单词放在一起，用uniq函数统计词频，然后把单词和词频输出之后再根据词频反向排序即可

具体代码如下

cat words.txt | sed 's/ /\n/g' | sed '/^$/d' | sort | uniq -c | awk '{print $2, $1}' | sort -nrk2

确定要放弃本次机会？

福利倒计时

: :

立减 ¥

普通VIP年卡可用

立即使用

netcaoniao

关注关注

0
点赞
踩
0

收藏

觉得还不错? 一键收藏
0
评论
分享

复制链接

分享到 QQ

分享到新浪微博

扫一扫
举报

举报

专栏目录

【中等】力扣算法题解析LeetCode192：统计词频

ylumwd的博客

07-30

350

Bash脚本统计单词频率：通过管道将文本分割为单词行，排序后使用uniq统计词频，再按频率降序排序并调整输出格式。核心命令：tr -s ' ' '\n' | sort | uniq -c | sort -nr | awk '{print $2,$1}'，高效处理流数据，输出单词及其出现次数。

【leetcode】优先级队列的两种妙用：词频统计与动态中位数（附代码模板）

最新发布

GGBond778的博客

05-06

1511

本期小编主要是针对力扣上两道关于堆的题目进行讲解：前K个高频单词，数据流的中位数；

参与评论您还未登录，请先登录后发表或查看评论

LeetCode191.统计词频

番茄都是西红柿

08-23

583

写一个 bash 脚本以统计一个文本文件 words.txt 中每个单词出现的频率。为了简单起见，你可以假设： words.txt只包括小写字母和 ’ ’ 。每个单词只由小写字母组成。单词间由一个或多个空格字符分隔。示例: 假设 words.txt 内容如下： the day is sunny the the the sunny is is 你的脚本应当输出（以词频降序...

leetcode 词频统计

不才的专栏

07-08

939

写一个 bash 脚本以统计一个文本文件 words.txt 中每个单词出现的频率。为了简单起见，你可以假设：words.txt只包括小写字母和 ' ' 。每个单词只由小写字母组成。单词间由一个或多个空格字符分隔。示例:假设 words.txt 内容如下：the day is sunny the the the sunny is is思路：1. tr 把空格全部转换成换行2. sed把空行全部过滤...

LeetCode刷题实战192：统计词频

程序IT圈

02-23

251

算法的重要性，我就不多说了吧，想去大厂，就必须要经过基础知识和业务逻辑面试+算法面试。所以，为了提高大家的算法能力，这个公众号后续每天带大家做一道算法题，题目就从LeetCode上面选！...

【LeetCode 中等题 bash】85-统计词频

weixin_41011942的博客

02-10

363

题目描述：写一个 bash 脚本以统计一个文本文件 words.txt 中每个单词出现的频率。为了简单起见，你可以假设： words.txt只包括小写字母和 ' ' 。每个单词只由小写字母组成。单词间由一个或多个空格字符分隔。示例: 假设 words.txt 内容如下： the day is sunny the the the sunny is is 你的脚本应当输出（...

leetcode 192. 统计词频

天使之翼

07-04

363

写一个 bash 脚本以统计一个文本文件 words.txt 中每个单词出现的频率。为了简单起见，你可以假设： words.txt只包括小写字母和 ' ' 。每个单词只由小写字母组成。单词间由一个或多个空格字符分隔。示例: 假设 words.txt 内容如下： the day is sunny the the the sunny is is 你的脚本应当输出（以词频降序排列）： the 4 is 3 sunny 2 day 1 说明: 不要担心词频相同的单词的排序问题，每个单词出现的频率都是唯

algoboy101#note_blog_leetcode#[192]统计词频1

07-25

示例:the day is sunny the the你的脚本应当输出（以词频降序排列）：说明:不要担心词频相同的单词的排序问题，每个单词出现的频率都是唯一的。

leetcode(24): 单词频率

fang 0 jun的博客

08-28

263

法一：hash表 class WordsFrequency { private: unordered_map<string, int> hash; public: WordsFrequency(vector<string>& book) { for(int i = 0; i < book.size(); i++){ hash[book[i]]++; } } int g.

LeetCode题练习与总结：统计词频--192

一直学习永不止步

08-11

1148

本文详细介绍了如何编写Bash脚本来统计文本文件中单词出现的频率，涵盖了脚本编写、时间复杂度、空间复杂度及关键知识点，为文本处理任务提供了实用的解决方案。

LeetCode192——统计词频

清风阁

01-24

792

我的LeetCode代码仓：https://github.com/617076674/LeetCode 原题链接：https://leetcode-cn.com/problems/word-frequency/description/ 题目描述：知识点：Linux常用指令思路一：cat+tr+sort+uniq+sort+awk cat命令：用于连接文件并打印到标准输出设备上。 ...

LeetCode shell(一)统计词频

xihuanyuye的博客

08-20

703

问题写一个 bash 脚本以统计一个文本文件 words.txt 中每个单词出现的频率。为了简单起见，你可以假设： words.txt只包括小写字母和 ’ ’ 。每个单词只由小写字母组成。单词间由一个或多个空格字符分隔。示例 words.txt the day is sunny the the the sunny is is 输出 the 4 is 3 sun...

LeetCode Shell 192. 统计词频

Alex

07-15

423

192. 统计词频 Ideas xargs分割字符串 -n 1表示每行输出一个 uniq统计词频需要被统计文本相同单词前后在一起，所以先排序 uniq -c表示同时输出单词出现次数 sort -nr表示把数字当做真正的数字处理 Code cat words.txt | xargs -n 1 | sort | uniq -c | sort -nr | awk '{print $2" "$1}' ...

Leetcode#192. 统计词频

CongliYin的博客

08-11

550

统计文件中单词出现的个数。思路： tr 把空格全部转换成换行 sed把空行全部过滤掉 sort排序 uniq统计词频 sort 降序 awk 格式输出 cat words.txt | tr " " "\n" | sed -e '/^$/d' | sort | uniq -c | sort -rn | awk '{print $2,$1}'...

leetcode192 词频统计

小妹的博客

04-24

318

awk '{for(i=1;i<=NF;++i){++m[$i]}}END{for(k in m){print k, m[k}}' words.txt | sort -nr -k 2 方法（1）：cat words.txt | tr -s ' ' '\n' | sort | uniq -c | sort -rn | awk '{print $2, $1}' 1、sort语法复...

力扣（LeetCode）192. 统计词频（2022.07.10）

ChaoYue_miku的博客

07-11

472

写一个 bash 脚本以统计一个文本文件 words.txt 中每个单词出现的频率。为了简单起见，你可以假设：words.txt只包括小写字母和 ’ ’ 。每个单词只由小写字母组成。单词间由一个或多个空格字符分隔。示例:假设 words.txt 内容如下：the day is sunny the the the sunny is is 你的脚本应当输出（以词频降序排列）：the 4 is 3 sunny 2 day 1说明:来源：力扣（LeetCode）链接：https://leetcode.cn/

词频统计LeetCode简单题

轩玉的博客

12-31

532

给定一个string数组article及其大小n及一个待统计单词word，请返回该单词在数组中出现的频数。文章的词数在1000以内。 class Frequency { public: int getFrequency(vector<string> article, int n, string word) { // write code here int count = 0; for(string arti : article){

[leetcode]Word Frequency

lydcsdn

09-10

1069

awk '{for(i=1;i<=NF;i++){a[$i]++;count++}} END{for(j in a){print j,a[j]}}' words.txt | sort -k 2 -nr

leetcode-819-Most Common Word（词频统计）

weixin_34224941的博客

05-19

105

题目描述： Given a paragraphand a list of banned words, return the most frequent word that is not in the list of banned words. It is guaranteed there is at least one word that isn't banned, and that the...