linux使用bash遍历多层文件结构

最新推荐文章于 2024-10-11 15:07:30 发布

hangguns

最新推荐文章于 2024-10-11 15:07:30 发布

阅读量1.2k

点赞数

文章标签： linux bash 运维

本文链接：https://blog.youkuaiyun.com/qiongyaoxinpo/article/details/129546136

版权

文章介绍了如何使用Bash脚本来遍历并处理多层文件结构的语料库，对比了Python的glob模块的便利性，提供了Bash脚本的示例代码，用于处理text目录下的所有文件和子目录中的文件。

摘要生成于 C知道，由 DeepSeek-R1 满血版支持，前往体验 >

引言

问题的起因是想要对语料进行数据处理，类似wikipedia的语料基本都是有多层文件结构，类似于text/GS/wiki_01，编写好了一个针对单一文件的处理程序后，需要对所有文件进行处理，不想用python去编写，那就可以用bash去编写

方法

#!/bin/bash

# loop through all files and subdirectories in the "text" directory
for file in text/*; do

    # check if the current file is a directory
    if [ -d "$file" ]; then

        # if it is a directory, loop through all files in it
        for subdirfile in "$file"/*; do
            # do something with the file
            echo "Processing file: $subdirfile"
        done

    else

        # if it is a file, do something with it
        echo "Processing file: $file"

    fi

done