HDSA-Dialog 项目使用教程-优快云博客

本文链接：https://blog.youkuaiyun.com/gitblog_00632/article/details/142505198

HDSA-Dialog 项目使用教程

HDSA-Dialog Code and Data for ACL 2019 "Semantically Conditioned Dialog Response Generation via Hierarchical Disentangled Self-Attention" 项目地址: https://gitcode.com/gh_mirrors/hd/HDSA-Dialog

1. 项目介绍

HDSA-Dialog 是一个用于对话响应生成的开源项目，基于 ACL 2019 论文 "Semantically Conditioned Dialog Response Generation via Hierarchical Disentangled Self-Attention"。该项目通过分层解耦自注意力机制来生成语义条件化的对话响应。

项目的主要架构包括两个组件：

对话行为预测器（Fine-tuned BERT 模型）：用于预测下一步的对话行为。
响应生成器（分层解耦自注意力网络）：基于预测的对话行为生成响应。

2. 项目快速启动

环境准备

确保你的环境满足以下要求：

Python 3.5
PyTorch 1.0
Pytorch-pretrained-BERT

安装依赖

pip install torch==1.0
pip install pytorch-pretrained-bert

下载项目

git clone https://github.com/wenhuchen/HDSA-Dialog.git
cd HDSA-Dialog

数据准备

下载预训练模型和数据文件：

sh collect_data.sh

训练对话行为预测器

rm -r checkpoints/predictor/
CUDA_VISIBLE_DEVICES=0 python3.5 train_predictor.py --do_train --do_eval --train_batch_size 6 --eval_batch_size 6

训练响应生成器

CUDA_VISIBLE_DEVICES=0 python3.5 train_generator.py --option train --model BERT_dim128_w_domain_exp --batch_size 512 --max_seq_length 50 --field Delexicalized

测试响应生成器

CUDA_VISIBLE_DEVICES=0 python3.5 train_generator.py --option test --model BERT_dim128_w_domain_exp --batch_size 512 --max_seq_length 50 --field Non-Delexicalized