如何利用python将xmind转为Excel？

theskylife

已于 2022-11-11 09:07:59 修改

阅读量5.4k

点赞数 8

CC 4.0 BY-SA版权

分类专栏： python学习之旅文章标签： python xmind 开发语言 json

于 2022-11-10 12:04:22 首次发布

本文链接：https://blog.youkuaiyun.com/qq_41780234/article/details/126153444

python学习之旅专栏收录该内容

59 篇文章

订阅专栏

本文详细介绍了如何使用xmindparser库解析xmind文件，并通过递归函数将解析后的数据转换成DataFrame，最后保存为Excel文件。整个过程包括环境准备、源数据构造及流程解析、数据处理和转换，适合于需要从xmind文件获取结构化数据的场景。

摘要生成于 C知道，由 DeepSeek-R1 满血版支持，前往体验 >

1.环境准备

需要先安装xmindparser模块，安装方法较为简单，直接运行pip命令即可。

pip install xmindparser

2.源数据构造及流程解析

2.1 准备xmind文件

构建好xmind文件后，将其命名为test.xmind，其图像如下：
2.1

2.2 解析xmind文件

利用xmindparser对2.1中的xmind文件进行解析，解析后效果如下：
2.2
解析代码如下：

from xmindparser import xmind_to_dict
x_flie = r'Test.xmind'
json_data= xmind_to_dict(x_flie)

2.3 对解析后的数据进行处理

由于解析后的数据不符合写出的要求，故需要对这部分数据进行处理。由于本人喜欢用pandas进行数据处理，故在此构建dataframe进行输出。

2.3.1 处理1——提取数据

对2.2中解析后的数据进行处理，按照层次进行组合，得到效果如下的数据：
2.3.1
实现代码如下：

# 可以设想为一个树结构，利用递归函数，获取由根至各叶子节点的路径。
def xm_parse(dic, pre_data=[]):
    """输入一个由xmindparser，转换而来的字典形式的数据，将之转换成列表"""
    title_list = []
    topic_list = []
    try:
        topics = dic.get("topics")
        title = dic.get("title")
        # 将前缀追加
        title_list.append(title)
        title_list = pre_data + title_list
        # 如果到达末尾，就返回
        if topics is None and title:
            yield title, title_list
#             print(title,title_list)
            return
        # 如果是列表，就暂存起来（若每个对象为标准的列表，即 topics= topic_list,则可以跳过该步骤）
        elif isinstance(topics, list) and title:
            for topic in topics:
                topic_list.append(topic)
    except AttributeError as e:
        print("异常结束")
        return
    # 若列表不为空，则需要递归寻找
    if topic_list:
        for topic in topic_list:
            yield from xm_parse(topic,title_list)

2.3.2 处理2——填充数据

经2.3.1的过程处理后的数据，并不能直接作为参数直接转换为DataFrame，需要将数据处理成以下形式：
2.3.2
实现代码如下：

temp=[]
max_cols=0
#提取数据，并找出最大深度(列数)
for i,j in xm_parse(json_data[0]['topic']):
    temp.append(j)
    max_cols= max_cols if max_cols > len(j) else len(j)
#对缺失数据采用补全
for i in range(len(temp)):
    temp[i] = temp[i] + (max_cols - len(temp[i])) * [None]

2.3.3 处理3——转换成DataFrame

转换后，效果如下：
2.3.3

实现代码如下：

result=pd.DataFrame.from_records(temp,columns=["标题-{}".format(i+1) for i in range(max_cols)])

3.完整代码

from xmindparser import xmind_to_dict
import pandas as pd

# 可以设想为一个树结构，利用递归函数，获取由根至各叶子节点的路径。
def xm_parse(dic, pre_data=[]):
    """输入一个由xmindparser，转换而来的字典形式的数据，将之转换成列表"""
    title_list = []
    topic_list = []
    try:
        topics = dic.get("topics")
        title = dic.get("title")
        # 将前缀追加
        title_list.append(title)
        title_list = pre_data + title_list
        # 如果到达末尾，就返回
        if topics is None and title:
            yield title, title_list
#             print(title,title_list)
            return
        # 如果是列表，就暂存起来（若每个对象为标准的列表，即 topics= topic_list,则可以跳过该步骤）
        elif isinstance(topics, list) and title:
            for topic in topics:
                topic_list.append(topic)
    except AttributeError as e:
        print("异常结束")
        return
    if topic_list:
        for topic in topic_list:
            yield from xm_parse(topic,title_list)


def main():
    x_flie=r"Test.xmind"
    out_file=r"xmind转换后.xlsx"
    temp=[]
    max_cols=0
    json_data= xmind_to_dict(x_flie)
    #提取数据，并找出最大深度(列数)
    for i,j in xm_parse(json_data[0]['topic']):
        temp.append(j)
        max_cols= max_cols if max_cols > len(j) else len(j)
    #对缺失数据采用补全
    for i in range(len(temp)):
        temp[i] = temp[i] + (max_cols - len(temp[i])) * [None]
    result=pd.DataFrame.from_records(temp,columns=["标题-{}".format(i+1) for i in range(max_cols)])
    result.to_excel(out_file,index=False,encoding='utf-8-sig')
    
if __name__ == '__main__':
    main()