基于语篇的文本抽象类别识别与文学作品中的元语言运用
文本分类中的挑战与解决方案
在文本分类领域,简单的逐句比较往往不足以准确学习文本的某些语义特征。这是因为信息在多句子中的传达方式具有多样性,同时文本的语篇结构也存在多种变化,这些都需要被充分考虑。
以一个文本分类问题为例,短文本可分为两类:
1. 房东将办公室出租给企业的税务责任;
2. 企业主从房东处租赁办公室的税务责任。
以下是相关文本示例:
- 企业主视角:
- “I rent an office space. This office is for my business. I can deduct office rental expense from my business profit to calculate net income.”
- “To run my business, I have to rent an office. The net business profit is calculated as follows. Rental expense needs to be subtracted from revenue.”
- “To store goods for my retail business I rent some space. When I calculate the net income, I take revenue and subtract business expenses such as office rent.”
- 房东视角:
- “I rent out a first floor unit of m
超级会员免费看
订阅专栏 解锁全文
44

被折叠的 条评论
为什么被折叠?



