条件随机场(CRF)识别命名实体

本文档记录了一次使用 CRF++ 进行命名实体识别的实验过程,包括实验环境配置、使用的工具介绍、特征模板设定及结果评估方法。

摘要生成于 C知道 ,由 DeepSeek-R1 满血版支持, 前往体验 >

<p class="MsoListParagraph" style=""><a href="http://download.youkuaiyun.com/source/1507312">资实验相关资料下载</a></p>
<p class="MsoListParagraph" style="">CRF++使用见<a href="http://blog.youkuaiyun.com/Felomeng/archive/2009/06/22/4288492.aspx" target="_blank"><span style="color: #4477aa;">《CRF++的简单使用》</span></a></p>
<p class="MsoListParagraph" style="">一、实验环境</p>
<p class="MsoListParagraph" style=""><span style="" lang="EN-US"><span style=""><span style="font-size: small; font-family: Calibri;">a)</span><span style='font: 7pt "Times New Roman";'> </span></span></span><span style="font-size: small;"><span style="">软件:</span><span lang="EN-US"><span style="font-family: Calibri;">windows XP pro sp3</span></span><span style="">,</span><span lang="EN-US"><span style="font-family: Calibri;">visual studio 2008 & Dotnet2.0</span></span><span style="">,</span><span lang="EN-US"><span style="font-family: Calibri;"> CRF++</span></span><span style="">,</span><span lang="EN-US"><span style="font-family: Calibri;"> perl</span></span></span></p>
<p class="MsoListParagraph" style=""><span style="" lang="EN-US"><span style=""><span style="font-size: small; font-family: Calibri;">b)</span><span style='font: 7pt "Times New Roman";'> </span></span></span><span style="font-size: small;"><span style="">硬件:</span><span style="font-family: Calibri;"> <span lang="EN-US">CPU: cm420</span></span><span style="">,内存:</span><span lang="EN-US"><span style="font-family: Calibri;">2G ddr533</span></span><span style="">, </span><span lang="EN-US"><span style="font-family: Calibri;">160G 8M sata </span></span><span style="">富士通</span></span></p>
<p class="MsoListParagraph" style="">二、实验过程</p>
<p class="MsoListParagraph" style="margin: 0cm 0cm 0pt 42pt; text-indent: 0cm;"><span style="font-size: small;"><span style="">下面未经特别说明,都是按照作业要求将训练语料分成</span><span lang="EN-US"><span style="font-family: Calibri;">7:3</span></span><span style="">进行训练和评测所得的结果。</span></span></p>
<p class="MsoListParagraph" style=""><a name="_Ref233259982"><span style="" lang="EN-US"><span style=""><span style="font-size: small; font-family: Calibri;">a)</span><span style="font: 7pt 'Times New Roman';"> </span></span></span><span style="font-size: small;"><span style="">直接应用</span><span lang="EN-US"><span style="font-family: Calibri;">CRF</span></span></span></a></p>
<p class="MsoListParagraph" style=""><span style="" lang="EN-US"><span style=""><span style='font: 7pt "Times New Roman";'> </span><span style="font-size: small; font-family: Calibri;">i.</span><span style='font: 7pt "Times New Roman";'> </span></span></span><span style="font-size: small;"><span style="">所给定的语料格式非常符合条件随机场的要求,故直接使用条件随机场进行训练测试。(本次试验的文件在包</span><span lang="EN-US"><span style="font-family: Calibri;">test1.rar</span></span><span style="">中)</span></span></p>
<p class="MsoListParagraph" style=""><span style="" lang="EN-US"><span style=""><span style="font-size: small; font-family: Calibri;">1.</span><span style='font: 7pt "Times New Roman";'> </span></span></span><span style="font-size: small;"><span style="">转换文档编码为</span><span lang="EN-US"><span style="font-family: Calibri;">UTF8</span></span><span style="">(</span><span lang="EN-US"><span style="font-family: Calibri;">CRF++</span></span><span style="">在使用</span><span lang="EN-US"><span style="font-family: Calibri;">UTF16</span></span><span style="">时会报错)</span></span></p>
<p class="MsoListParagraph" style=""><span style="" lang="EN-US"><span style=""><span style="font-size: small; font-family: Calibri;">2.</span><span style='font: 7pt "Times New Roman";'> </span></span></span><span style=""><span style="font-size: small;">制定模板,如下:</span></span></p>
<p class="MsoListParagraph" style="margin: 0cm 0cm 0pt 84pt; text-indent: 1.05pt;"><span lang="EN-US"><span style="font-size: small; font-family: Calibri;">#Unigram</span></span></p>
<p class="MsoListParagraph" style="margin: 0cm 0cm 0pt 83.9pt; text-indent: 1.05pt;"><span lang="EN-US"><span style="font-size: small; font-family: Calibri;">U00:%x[-2,0]</span></span></p>
<p class="MsoListParagraph" style="margin: 0cm 0cm 0pt 83.9pt; text-indent: 1.05pt;"><span lang="EN-US"><span style="font-size: small; font-family: Calibri;">U01:%x[-1,0]</span></span></p>
<p class="MsoListParagraph" style="margin: 0cm 0cm 0pt 83.9pt; text-indent: 1.05pt;"><span lang="EN-US"><span style="font-size: small; font-family: Calibri;">U02:%x[0,0]</span></span></p>
<p class="MsoListParagraph" style="margin: 0cm 0cm 0pt 83.9pt; text-indent: 1.05pt;"><span lang="EN-US"><span style="font-size: small; font-family: Calibri;">U03:%x[1,0]</span></span></p>
<p class="MsoListParagraph" style="margin: 0cm 0cm 0pt 83.9pt; text-indent: 1.05pt;"><span lang="EN-US"><span style="font-size: small; font-family: Calibri;">U04:%x[2,0]</span></span></p>
<p class="MsoListParagraph" style="margin: 0cm 0cm 0pt 83.9pt; text-indent: 1.05pt;"><span lang="EN-US"><span style="font-size: small; font-family: Calibri;">U10:%x[-1,0]/%x[0,0]</span></span></p>
<p class="MsoListParagraph" style="margin: 0cm 0cm 0pt 84pt; text-indent: 1.05pt;"><span lang="EN-US"><span style="font-size: small; font-family: Calibri;">U11:%x[0,0]/%x[1,0]</span></span></p>
<p class="MsoListParagraph" style=""><span style="" lang="EN-US"><span style=""><span style="font-size: small; font-family: Calibri;">3.</span><span style='font: 7pt "Times New Roman";'> </span></span></span><span style="font-size: small;"><span style="">使用</span><span lang="EN-US"><span style="font-family: Calibri;">CRF++</span></span><span style="">学习特征(相关信息如下)</span></span></p>
<p class="MsoListParagraph" style="margin: 0cm 0cm 0pt 84pt; text-indent: 0cm;"><span style="" lang="EN-US"><span style=""><span style="font-size: small; font-family: Calibri;">a)</span><span style='font: 7pt "Times New Roman";'> </span></span></span><span style="font-size: small;"><span style="">命令:</span><span lang="EN-US"><span style="font-family: Calibri;">crf_learn template_file train_file model</span></span></span></p>
<p class="MsoListParagraph" style="margin: 0cm 0cm 0pt 84pt; text-indent: 0cm;"><span style="font-size: small;"><span style="">其中</span><span lang="EN-US"><span style="font-family: Calibri;">template_file</span></span><span style="">是模板文件,</span><span lang="EN-US"><span style="font-family: Calibri;">train_file</span></span><span style="">是训练语料,都需要事先准备好;</span><span lang="EN-US"><span style="font-family: Calibri;">model</span></span><span style="">是</span><span lang="EN-US"><span style="font-family: Calibri;">CRF++</span></span><span style="">根据模板和训练语料生成的文件,用于解码。</span></span></p>
<p class="MsoListParagraph" style=""><span style="" lang="EN-US"><span style=""><span style='font: 7pt "Times New Roman";'> </span><span style="font-size: small; font-family: Calibri;">i.</span><span style='font: 7pt "Times New Roman";'> </span></span></span><span style="font-size: small;"><span lang="EN-US"><span style="font-family: Calibri;">template_file</span></span><span style="">文件</span></span></p>
<p class="MsoListParagraph" style=""><span style="" lang="EN-US"><span style=""><span style="font-size: small; font-family: Calibri;">1.</span><span style='font: 7pt "Times New Roman";'> </span></span></span><span style="font-size: small;"><span style="">模板的基本格式为</span><span lang="EN-US"><span style="font-family: Calibri;">%x[row,col]</span></span><span style="">,它用于确定输入数据中的一个</span><span lang="EN-US"><span style="font-family: Calibri;">token</span></span></span></p>
<p class="MsoListParagraph" style="margin: 0cm 0cm 0pt 126pt; text-indent: 1.55pt;"><span style="font-size: small;"><span style="">其中,</span><span lang="EN-US"><span style="font-family: Calibri;">row</span></span><span style="">确定与当前的</span><span lang="EN-US"><span style="font-family: Calibri;">token</span></span><span style="">的相对行数。</span><span lang="EN-US"><span style="font-family: Calibri;">col</span></span><span style="">用于确定绝对列数。(如下图:)</span></span></p>
<table class="MsoTableGrid" style="margin: auto auto auto 126pt; border-collapse: collapse;" border="1" cellspacing="0" cellpadding="0"><tbody>
<tr style="">
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 53.1pt; padding-right: 5.4pt; padding-top: 0cm; border: black 1pt solid;" width="71" valign="top">
<p class="MsoListParagraph" style="margin: 0cm 0cm 0pt; text-indent: 0cm; text-align: left;" align="left"><span lang="EN-US"><span style="font-size: small; font-family: Calibri;"></span></span></p>
</td>
<td style="border-bottom: black 1pt solid; border-left: #ece9d8; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 59pt; padding-right: 5.4pt; border-top: black 1pt solid; border-right: black 1pt solid; padding-top: 0cm;" width="79" valign="top">
<p class="MsoListParagraph" style="margin: 0cm 0cm 0pt; text-indent: 0cm; text-align: left;" align="left"><span lang="EN-US"><span style="font-size: small; font-family: Calibri;">col 0</span></span></p>
</td>
<td style="border-bottom: black 1pt solid; border-left: #ece9d8; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 62.8pt; padding-right: 5.4pt; border-top: black 1pt solid; border-right: black 1pt solid; padding-top: 0cm;" width="84" valign="top">
<p class="MsoListParagraph" style="margin: 0cm 0cm 0pt; text-indent: 0cm; text-align: left;" align="left"><span lang="EN-US"><span style="font-size: small; font-family: Calibri;">col 1</span></span></p>
</td>
<td style="border-bottom: black 1pt solid; border-left: #ece9d8; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 66.25pt; padding-right: 5.4pt; border-top: black 1pt solid; border-right: black 1pt solid; padding-top: 0cm;" width="88" valign="top">
<p class="MsoListParagraph" style="margin: 0cm 0cm 0pt; text-indent: 0cm; text-align: left;" align="left"><span lang="EN-US"><span style="font-size: small; font-family: Calibri;">col 2</span></span></p>
</td>
<td style="border-bottom: black 1pt solid; border-left: #ece9d8; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 58.95pt; padding-right: 5.4pt; border-top: black 1pt solid; border-right: black 1pt solid; padding-top: 0cm;" width="79" valign="top">
<p class="MsoListParagraph" style="margin: 0cm 0cm 0pt; text-indent: 0cm; text-align: left;" align="left"><span lang="EN-US"><span style="font-size: small; font-family: Calibri;"></span></span></p>
</td>
</tr>
<tr style="">
<td style="border-bottom: black 1pt solid; border-left: black 1pt solid; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 53.1pt; padding-right: 5.4pt; border-top: #ece9d8; border-right: black 1pt solid; padding-top: 0cm;" width="71" valign="top">
<p class="MsoListParagraph" style="margin: 0cm 0cm 0pt; text-indent: 1.55pt; text-align: left;" align="left"><span lang="EN-US"><span style="font-size: small; font-family: Calibri;">row -2</span></span></p>
</td>
<td style="border-bottom: black 1pt solid; border-left: #ece9d8; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 59pt; padding-right: 5.4pt; border-top: #ece9d8; border-right: black 1pt solid; padding-top: 0cm;" width="79" valign="top">
<p class="MsoListParagraph" style="" align="left"><span style=""><span style="font-size: small;">疆</span></span></p>
</td>
<td style="border-bottom: black 1pt solid; border-left: #ece9d8; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 62.8pt; padding-right: 5.4pt; border-top: #ece9d8; border-right: black 1pt solid; padding-top: 0cm;" width="84" valign="top">
<p class="MsoListParagraph" style="margin: 0cm 0cm 0pt; text-indent: 0cm; text-align: left;" align="left"><span lang="EN-US"><span style="font-size: small; font-family: Calibri;">Ens</span></span></p>
</td>
<td style="border-bottom: black 1pt solid; border-left: #ece9d8; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 66.25pt; padding-right: 5.4pt; border-top: #ece9d8; border-right: black 1pt solid; padding-top: 0cm;" width="88" valign="top">
<p class="MsoListParagraph" style="margin: 0cm 0cm 0pt; text-indent: 0cm; text-align: left;" align="left"><span lang="EN-US"><span style="font-size: small; font-family: Calibri;">I-LOC</span></span></p>
</td>
<td style="border-bottom: black 1pt solid; border-left: #ece9d8; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 58.95pt; padding-right: 5.4pt; border-top: #ece9d8; border-right: black 1pt solid; padding-top: 0cm;" width="79" valign="top">
<p class="MsoListParagraph" style="margin: 0cm 0cm 0pt; text-indent: 1.35pt; text-align: left;" align="left"><span lang="EN-US"><span style="font-size: small; font-family: Calibri;"></span></span></p>
</td>
</tr>
<tr style="">
<td style="border-bottom: black 1pt solid; border-left: black 1pt solid; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 53.1pt; padding-right: 5.4pt; border-top: #ece9d8; border-right: black 1pt solid; padding-top: 0cm;" width="71" valign="top">
<p class="MsoListParagraph" style="margin: 0cm 0cm 0pt; text-indent: 1.55pt; text-align: left;" align="left"><span lang="EN-US"><span style="font-size: small; font-family: Calibri;">row -1</span></span></p>
</td>
<td style="border-bottom: black 1pt solid; border-left: #ece9d8; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 59pt; padding-right: 5.4pt; border-top: #ece9d8; border-right: black 1pt solid; padding-top: 0cm;" width="79" valign="top">
<p class="MsoListParagraph" style="" align="left"><span style=""><span style="font-size: small;">总</span></span></p>
</td>
<td style="border-bottom: black 1pt solid; border-left: #ece9d8; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 62.8pt; padding-right: 5.4pt; border-top: #ece9d8; border-right: black 1pt solid; padding-top: 0cm;" width="84" valign="top">
<p class="MsoListParagraph" style="margin: 0cm 0cm 0pt; text-indent: 0cm; text-align: left;" align="left"><span lang="EN-US"><span style="font-size: small; font-family: Calibri;">Bn</span></span></p>
</td>
<td style="border-bottom: black 1pt solid; border-left: #ece9d8; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 66.25pt; padding-right: 5.4pt; border-top: #ece9d8; border-right: black 1pt solid; padding-top: 0cm;" width="88" valign="top">
<p class="MsoListParagraph" style="margin: 0cm 0cm 0pt; text-indent: 0cm; text-align: left;" align="left"><span lang="EN-US"><span style="font-size: small; font-family: Calibri;">N</span></span></p>
</td>
<td style="border-bottom: black 1pt solid; border-left: #ece9d8; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 58.95pt; padding-right: 5.4pt; border-top: #ece9d8; border-right: black 1pt solid; padding-top: 0cm;" width="79" valign="top">
<p class="MsoListParagraph" style="margin: 0cm 0cm 0pt; text-indent: 1.35pt; text-align: left;" align="left"><span lang="EN-US"><span style="font-size: small; font-family: Calibri;"></span></span></p>
</td>
</tr>
<tr style="">
<td style="border-bottom: black 1pt solid; border-left: black 1pt solid; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 53.1pt; padding-right: 5.4pt; border-top: #ece9d8; border-right: black 1pt solid; padding-top: 0cm;" width="71" valign="top">
<p class="MsoListParagraph" style="margin: 0cm 0cm 0pt; text-indent: 1.55pt; text-align: left;" align="left"><span lang="EN-US"><span style="font-size: small; font-family: Calibri;">row 0</span></span></p>
</td>
<td style="border-bottom: black 1pt solid; border-left: #ece9d8; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 59pt; padding-right: 5.4pt; border-top: #ece9d8; border-right: black 1pt solid; padding-top: 0cm;" width="79" valign="top">
<p class="MsoListParagraph" style="" align="left"><span style=""><span style="font-size: small;">统</span></span></p>
</td>
<td style="border-bottom: black 1pt solid; border-left: #ece9d8; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 62.8pt; padding-right: 5.4pt; border-top: #ece9d8; border-right: black 1pt solid; padding-top: 0cm;" width="84" valign="top">
<p class="MsoListParagraph" style="margin: 0cm 0cm 0pt; text-indent: 0cm; text-align: left;" align="left"><span lang="EN-US"><span style="font-size: small; font-family: Calibri;">En</span></span></p>
</td>
<td style="border-bottom: black 1pt solid; border-left: #ece9d8; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 66.25pt; padding-right: 5.4pt; border-top: #ece9d8; border-right: black 1pt solid; padding-top: 0cm;" width="88" valign="top">
<p class="MsoListParagraph" style="margin: 0cm 0cm 0pt; text-indent: 0cm; text-align: left;" align="left"><span lang="EN-US"><span style="font-size: small; font-family: Calibri;">N</span></span></p>
</td>
<td style="border-bottom: black 1pt solid; border-left: #ece9d8; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 58.95pt; padding-right: 5.4pt; border-top: #ece9d8; border-right: black 1pt solid; padding-top: 0cm;" width="79" valign="top">
<p class="MsoListParagraph" style="margin: 0cm 0cm 0pt; text-indent: 1.35pt; text-align: left;" align="left"><span style=""><span style="font-size: small;">当前行</span></span></p>
</td>
</tr>
<tr style="">
<td style="border-bottom: black 1pt solid; border-left: black 1pt solid; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 53.1pt; padding-right: 5.4pt; border-top: #ece9d8; border-right: black 1pt solid; padding-top: 0cm;" width="71" valign="top">
<p class="MsoListParagraph" style="margin: 0cm 0cm 0pt; text-indent: 1.55pt; text-align: left;" align="left"><span lang="EN-US"><span style="font-size: small; font-family: Calibri;">row 1</span></span></p>
</td>
<td style="border-bottom: black 1pt solid; border-left: #ece9d8; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 59pt; padding-right: 5.4pt; border-top: #ece9d8; border-right: black 1pt solid; padding-top: 0cm;" width="79" valign="top">
<p class="MsoListParagraph" style="" align="left"><span style=""><span style="font-size: small;">阿</span></span></p>
</td>
<td style="border-bottom: black 1pt solid; border-left: #ece9d8; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 62.8pt; padding-right: 5.4pt; border-top: #ece9d8; border-right: black 1pt solid; padding-top: 0cm;" width="84" valign="top">
<p class="MsoListParagraph" style="margin: 0cm 0cm 0pt; text-indent: 0cm; text-align: left;" align="left"><span lang="EN-US"><span style="font-size: small; font-family: Calibri;">Bns</span></span></p>
</td>
<td style="border-bottom: black 1pt solid; border-left: #ece9d8; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 66.25pt; padding-right: 5.4pt; border-top: #ece9d8; border-right: black 1pt solid; padding-top: 0cm;" width="88" valign="top">
<p class="MsoListParagraph" style="margin: 0cm 0cm 0pt; text-indent: 0cm; text-align: left;" align="left"><span lang="EN-US"><span style="font-size: small; font-family: Calibri;">B-PER</span></span></p>
</td>
<td style="border-bottom: black 1pt solid; border-left: #ece9d8; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 58.95pt; padding-right: 5.4pt; border-top: #ece9d8; border-right: black 1pt solid; padding-top: 0cm;" width="79" valign="top">
<p class="MsoListParagraph" style="margin: 0cm 0cm 0pt; text-indent: 1.35pt; text-align: left;" align="left"><span lang="EN-US"><span style="font-size: small; font-family: Calibri;"></span></span></p>
</td>
</tr>
<tr style="">
<td style="border-bottom: black 1pt solid; border-left: black 1pt solid; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 53.1pt; padding-right: 5.4pt; border-top: #ece9d8; border-right: black 1pt solid; padding-top: 0cm;" width="71" valign="top">
<p class="MsoListParagraph" style="margin: 0cm 0cm 0pt; text-indent: 1.55pt; text-align: left;" align="left"><span lang="EN-US"><span style="font-size: small; font-family: Calibri;">row 2</span></span></p>
</td>
<td style="border-bottom: black 1pt solid; border-left: #ece9d8; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 59pt; padding-right: 5.4pt; border-top: #ece9d8; border-right: black 1pt solid; padding-top: 0cm;" width="79" valign="top">
<p class="MsoListParagraph" style="" align="left"><span style=""><span style="font-size: small;">利</span></span></p>
</td>
<td style="border-bottom: black 1pt solid; border-left: #ece9d8; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 62.8pt; padding-right: 5.4pt; border-top: #ece9d8; border-right: black 1pt solid; padding-top: 0cm;" width="84" valign="top">
<p class="MsoListParagraph" style="margin: 0cm 0cm 0pt; text-indent: 0cm; text-align: left;" align="left"><span lang="EN-US"><span style="font-size: small; font-family: Calibri;">Mns</span></span></p>
</td>
<td style="border-bottom: black 1pt solid; border-left: #ece9d8; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 66.25pt; padding-right: 5.4pt; border-top: #ece9d8; border-right: black 1pt solid; padding-top: 0cm;" width="88" valign="top">
<p class="MsoListParagraph" style="margin: 0cm 0cm 0pt; text-indent: 0cm; text-align: left;" align="left"><span lang="EN-US"><span style="font-size: small; font-family: Calibri;">I-PER</span></span></p>
</td>
<td style="border-bottom: black 1pt solid; border-left: #ece9d8; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 58.95pt; padding-right: 5.4pt; border-top: #ece9d8; border-right: black 1pt solid; padding-top: 0cm;" width="79" valign="top">
<p class="MsoListParagraph" style="margin: 0cm 0cm 0pt; text-indent: 1.35pt; text-align: left;" align="left"><span lang="EN-US"><span style="font-size: small; font-family: Calibri;"></span></span></p>
</td>
</tr>
</tbody></table>
<p class="MsoListParagraph" style="margin: 0cm 0cm 0pt 126pt; text-indent: 1.55pt;"><span lang="EN-US"><span style="font-size: small; font-family: Calibri;"></span></span></p>
<table class="MsoTableGrid" style="margin: auto auto auto 126pt; border-collapse: collapse;" border="1" cellspacing="0" cellpadding="0"><tbody>
<tr style="">
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 150.05pt; padding-right: 5.4pt; padding-top: 0cm; border: black 1pt solid;" width="200" valign="top">
<p class="MsoListParagraph" style="margin: 0cm 0cm 0pt; text-indent: 0cm;"><span style=""><span style="font-size: small;">模板</span></span></p>
</td>
<td style="border-bottom: black 1pt solid; border-left: #ece9d8; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 150.05pt; padding-right: 5.4pt; border-top: black 1pt solid; border-right: black 1pt solid; padding-top: 0cm;" width="200" valign="top">
<p class="MsoListParagraph" style="margin: 0cm 0cm 0pt; text-indent: 0cm;"><span style=""><span style="font-size: small;">指代的特征</span></span></p>
</td>
</tr>
<tr style="">
<td style="border-bottom: black 1pt solid; border-left: black 1pt solid; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 150.05pt; padding-right: 5.4pt; border-top: #ece9d8; border-right: black 1pt solid; padding-top: 0cm;" width="200" valign="top">
<p class="MsoListParagraph" style="margin: 0cm 0cm 0pt; text-indent: 0cm;"><span lang="EN-US"><span style="font-size: small; font-family: Calibri;">U00:%x[-2,0]</span></span></p>
</td>
<td style="border-bottom: black 1pt solid; border-left: #ece9d8; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 150.05pt; padding-right: 5.4pt; border-top: #ece9d8; border-right: black 1pt solid; padding-top: 0cm;" width="200" valign="top">
<p class="MsoListParagraph" style="margin: 0cm 0cm 0pt; text-indent: 0cm;"><span style=""><span style="font-size: small;">疆</span></span></p>
</td>
</tr>
<tr style="">
<td style="border-bottom: black 1pt solid; border-left: black 1pt solid; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 150.05pt; padding-right: 5.4pt; border-top: #ece9d8; border-right: black 1pt solid; padding-top: 0cm;" width="200" valign="top">
<p class="MsoListParagraph" style="margin: 0cm 0cm 0pt; text-indent: 0cm;"><span lang="EN-US"><span style="font-size: small; font-family: Calibri;">U01:%x[-1,0]</span></span></p>
</td>
<td style="border-bottom: black 1pt solid; border-left: #ece9d8; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 150.05pt; padding-right: 5.4pt; border-top: #ece9d8; border-right: black 1pt solid; padding-top: 0cm;" width="200" valign="top">
<p class="MsoListParagraph" style="" align="left"><span style=""><span style="font-size: small;">总</span></span></p>
</td>
</tr>
<tr style="">
<td style="border-bottom: black 1pt solid; border-left: black 1pt solid; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 150.05pt; padding-right: 5.4pt; border-top: #ece9d8; border-right: black 1pt solid; padding-top: 0cm;" width="200" valign="top">
<p class="MsoListParagraph" style="margin: 0cm 0cm 0pt; text-indent: 0cm;"><span lang="EN-US"><span style="font-size: small; font-family: Calibri;">U02:%x[0,0]</span></span></p>
</td>
<td style="border-bottom: black 1pt solid; border-left: #ece9d8; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 150.05pt; padding-right: 5.4pt; border-top: #ece9d8; border-right: black 1pt solid; padding-top: 0cm;" width="200" valign="top">
<p class="MsoListParagraph" style="" align="left"><span style=""><span style="font-size: small;">统</span></span></p>
</td>
</tr>
<tr style="">
<td style="border-bottom: black 1pt solid; border-left: black 1pt solid; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 150.05pt; padding-right: 5.4pt; border-top: #ece9d8; border-right: black 1pt solid; padding-top: 0cm;" width="200" valign="top">
<p class="MsoListParagraph" style="margin: 0cm 0cm 0pt; text-indent: 0cm;"><span lang="EN-US"><span style="font-size: small; font-family: Calibri;">U03:%x[1,0]</span></span></p>
</td>
<td style="border-bottom: black 1pt solid; border-left: #ece9d8; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 150.05pt; padding-right: 5.4pt; border-top: #ece9d8; border-right: black 1pt solid; padding-top: 0cm;" width="200" valign="top">
<p class="MsoListParagraph" style="" align="left"><span style=""><span style="font-size: small;">阿</span></span></p>
</td>
</tr>
<tr style="">
<td style="border-bottom: black 1pt solid; border-left: black 1pt solid; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 150.05pt; padding-right: 5.4pt; border-top: #ece9d8; border-right: black 1pt solid; padding-top: 0cm;" width="200" valign="top">
<p class="MsoListParagraph" style="margin: 0cm 0cm 0pt; text-indent: 0cm;"><span lang="EN-US"><span style="font-size: small; font-family: Calibri;">U04:%x[2,0]</span></span></p>
</td>
<td style="border-bottom: black 1pt solid; border-left: #ece9d8; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 150.05pt; padding-right: 5.4pt; border-top: #ece9d8; border-right: black 1pt solid; padding-top: 0cm;" width="200" valign="top">
<p class="MsoListParagraph" style="" align="left"><span style=""><span style="font-size: small;">利</span></span></p>
</td>
</tr>
<tr style="">
<td style="border-bottom: black 1pt solid; border-left: black 1pt solid; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 150.05pt; padding-right: 5.4pt; border-top: #ece9d8; border-right: black 1pt solid; padding-top: 0cm;" width="200" valign="top">
<p class="MsoListParagraph" style="margin: 0cm 0cm 0pt; text-indent: 0cm;"><span lang="EN-US"><span style="font-size: small; font-family: Calibri;">U10:%x[-1,0]/%x[0,0]</span></span></p>
</td>
<td style="border-bottom: black 1pt solid; border-left: #ece9d8; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 150.05pt; padding-right: 5.4pt; border-top: #ece9d8; border-right: black 1pt solid; padding-top: 0cm;" width="200" valign="top">
<p class="MsoListParagraph" style="" align="left"><span style="font-size: small;"><span style="">总</span><span lang="EN-US"><span style="font-family: Calibri;">/</span></span><span style="">统</span></span></p>
</td>
</tr>
<tr style="">
<td style="border-bottom: black 1pt solid; border-left: black 1pt solid; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 150.05pt; padding-right: 5.4pt; border-top: #ece9d8; border-right: black 1pt solid; padding-top: 0cm;" width="200" valign="top">
<p class="MsoListParagraph" style="margin: 0cm 0cm 0pt; text-indent: 0cm;"><span lang="EN-US"><span style="font-size: small; font-family: Calibri;">U11:%x[0,0]/%x[1,0]</span></span></p>
</td>
<td style="border-bottom: black 1pt solid; border-left: #ece9d8; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 150.05pt; padding-right: 5.4pt; border-top: #ece9d8; border-right: black 1pt solid; padding-top: 0cm;" width="200" valign="top">
<p class="MsoListParagraph" style="" align="left"><span style="font-size: small;"><span style="">统</span><span lang="EN-US"><span style="font-family: Calibri;">/</span></span><span style="">阿</span></span></p>
</td>
</tr>
</tbody></table>
<p class="MsoListParagraph" style="margin: 0cm 0cm 0pt 126pt; text-indent: 1.55pt;"><span lang="EN-US"><span style="font-size: small; font-family: Calibri;"></span></span></p>
<p class="MsoListParagraph" style=""><span style="" lang="EN-US"><span style=""><span style="font-size: small; font-family: Calibri;">2.</span><span style='font: 7pt "Times New Roman";'> </span></span></span><span style=""><span style="font-size: small;">特征模板的类型</span></span></p>
<p class="MsoListParagraph" style="text-indent: 0cm;"><span style="" lang="EN-US"><span style=""><span style="font-size: small; font-family: Calibri;">a)</span><span style='font: 7pt "Times New Roman";'> </span></span></span><span style="font-size: small;"><span style="">第一种以字母</span><span lang="EN-US"><span style="font-family: Calibri;">U</span></span><span style="">开头,为</span><span lang="EN-US"><span style="font-family: Calibri;">Unigram template</span></span><span style="">。当模板前加上</span><span lang="EN-US"><span style="font-family: Calibri;">U</span></span><span style="">之后,</span><span lang="EN-US"><span style="font-family: Calibri;">CRF</span></span><span style="">会自动生成一个特征函数集合。</span></span></p>
<p class="MsoListParagraph" style="text-indent: 0cm;"><span style="font-size: small;"><span style="">一个模型生成的特征函数的个数总数为</span><span lang="EN-US"><span style="font-family: Calibri;">L*N</span></span><span style="">,其中</span><span lang="EN-US"><span style="font-family: Calibri;">L</span></span><span style="">是输出的类别数,</span><span lang="EN-US"><span style="font-family: Calibri;">N</span></span><span style="">是根据给定的</span><span lang="EN-US"><span style="font-family: Calibri;">template</span></span><span style="">扩展出的独立串</span><span lang="EN-US"><span style="font-family: Calibri;">(unique string )</span></span><span style="">的数目。</span></span></p>
<p class="MsoListParagraph" style=""><span style="" lang="EN-US"><span style=""><span style="font-size: small; font-family: Calibri;">b)</span><span style='font: 7pt "Times New Roman";'> </span></span></span><span style="font-size: small;"><span style="">第二种特征模板以</span><span lang="EN-US"><span style="font-family: Calibri;">B</span></span><span style="">开头,即</span><span lang="EN-US"><span style="font-family: Calibri;">Bigram template</span></span></span></p>
<p class="MsoListParagraph" style="text-indent: 0cm;"><span style="font-size: small;"><span style="">它用于描述</span><span lang="EN-US"><span style="font-family: Calibri;">Bigram</span></span><span style="">特征。系统将自动产生当前输出</span><span lang="EN-US"><span style="font-family: Calibri;">token</span></span><span style="">与前一个输出</span><span lang="EN-US"><span style="font-family: Calibri;">token</span></span><span style="">的组合。产生的可区分的特征的总数是</span><span lang="EN-US"><span style="font-family: Calibri;">L*L*N</span></span><span style="">,其中</span><span lang="EN-US"><span style="font-family: Calibri;">L</span></span><span style="">是输出类别数,</span><span lang="EN-US"><span style="font-family: Calibri;">N</span></span><span style="">是这个模板产生的</span><span lang="EN-US"><span style="font-family: Calibri;">unique features</span></span><span style="">数。</span></span></p>
<p class="MsoListParagraph" style=""><span style="" lang="EN-US"><span style=""><span style="font-size: small; font-family: Calibri;">c)</span><span style='font: 7pt "Times New Roman";'> </span></span></span><span style=""><span style="font-size: small;">两种模板的区别</span></span></p>
<p class="MsoListParagraph" style=""><span style="font-size: small;"><span style="">注意:</span><span lang="EN-US"><span style="font-family: Calibri;">Unigram/Bigram</span></span><span style="">是指输出</span><span lang="EN-US"><span style="font-family: Calibri;">token</span></span><span style="">的</span><span lang="EN-US"><span style="font-family: Calibri;">Unigram/Bigrams</span></span><span style="">,而不是特征!</span></span></p>
<p class="MsoListParagraph" style=""><span style="font-size: small;"><span lang="EN-US"><span style="font-family: Calibri;">unigram</span></span><span style="">:</span><span lang="EN-US"><span style="font-family: Calibri;">|output tag|</span></span><span style="">×</span><span lang="EN-US"><span style="font-family: Calibri;">|</span></span><span style="">从模板中扩展的所有可能串</span><span lang="EN-US"><span style="font-family: Calibri;">|</span></span></span></p>
<p class="MsoListParagraph" style=""><span style="font-size: small;"><span lang="EN-US"><span style="font-family: Calibri;">bigram: |output tag| </span></span><span style="">×</span><span lang="EN-US"><span style="font-family: Calibri;"> |output tag| </span></span><span style="">×</span><span lang="EN-US"><span style="font-family: Calibri;"> |</span></span><span style="">从模板中扩展的所有可能串</span><span lang="EN-US"><span style="font-family: Calibri;">|</span></span></span></p>
<p class="MsoListParagraph" style="margin: 0cm 0cm 0pt 84pt; text-indent: 0cm;"><span style="" lang="EN-US"><span style=""><span style="font-size: small; font-family: Calibri;">b)</span><span style='font: 7pt "Times New Roman";'> </span></span></span><span lang="EN-US"><span style="font-size: small; font-family: Calibri;">iter=88 terr=0.01365 serr=0.23876 obj=67066.17413 diff=0.00006</span></span></p>
<p class="MsoListParagraph" style="margin: 0cm 0cm 0pt 84pt; text-indent: 0cm;"><span style="font-size: small;"><span style="">其中:</span><span lang="EN-US"><span style="font-family: Calibri;">iter</span></span><span style="">是迭代次数;</span><span lang="EN-US"><span style="font-family: Calibri;">terr</span></span><span style="">是词错误率;</span><span lang="EN-US"><span style="font-family: Calibri;">serr</span></span><span style="">是句错误率;</span><span lang="EN-US"><span style="font-family: Calibri;">obj</span></span><span style="">是当前对象值,当它收敛时,迭代结束;</span><span lang="EN-US"><span style="font-family: Calibri;">diff</span></span><span style="">是与上一对象的差。</span></span></p>
<p class="MsoListParagraph" style=""><span style="" lang="EN-US"><span style=""><span style="font-size: small; font-family: Calibri;">4.</span><span style='font: 7pt "Times New Roman";'> </span></span></span><span style="font-size: small;"><span lang="EN-US"><span style="font-family: Calibri;">Done!2706.41 s</span></span><span style="">,用时间</span><span lang="EN-US"><span style="font-family: Calibri;">2706.41s</span></span><span style="">(在电脑</span><span lang="EN-US"><span style="font-family: Calibri;">1</span></span><span style="">上)。</span></span></p>
<p class="MsoListParagraph" style=""><span style="" lang="EN-US"><span style=""><span style="font-size: small; font-family: Calibri;">5.</span><span style='font: 7pt "Times New Roman";'> </span></span></span><span style=""><span style="font-size: small;">对测试语料进行测试</span></span></p>
<p class="MsoListParagraph" style="margin: 0cm 0cm 0pt 84pt; text-indent: 0cm;"><span style="" lang="EN-US"><span style=""><span style="font-size: small; font-family: Calibri;">a)</span><span style='font: 7pt "Times New Roman";'> </span></span></span><span style="font-size: small;"><span style="">命令:</span><span lang="EN-US"><span style="font-family: Calibri;">crf_test -m model_file test_file > result_file</span></span></span></p>
<p class="MsoListParagraph" style="margin: 0cm 0cm 0pt 84pt; text-indent: 0cm;"><span style="font-size: small;"><span style="">其中</span><span lang="EN-US"><span style="font-family: Calibri;"> model_file</span></span><span style="">是刚才生成的</span><span lang="EN-US"><span style="font-family: Calibri;">model</span></span><span style="">文件,</span><span lang="EN-US"><span style="font-family: Calibri;">test_file</span></span><span style="">是待测试语料,“</span><span lang="EN-US"><span style="font-family: Calibri;">>result_file</span></span><span style="">”是重定向语句,指将屏幕输出直接输出到文件</span><span lang="EN-US"><span style="font-family: Calibri;">result_file</span></span><span style="">中。</span></span></p>
<p class="MsoListParagraph" style="margin: 0cm 0cm 0pt 84pt; text-indent: 0cm;"><span style="" lang="EN-US"><span style=""><span style="font-size: small; font-family: Calibri;">b)</span><span style='font: 7pt "Times New Roman";'> </span></span></span><span style="font-size: small;"><span lang="EN-US"><span style="font-family: Calibri;">CRF++</span></span><span style="">的解码速度是很快的,尤其是直接写入文件时。但是因为特征选取的问题,正确率、召回率都不高。</span></span></p>
<p class="MsoListParagraph" style="margin: 0cm 0cm 0pt 84pt; text-indent: 0cm;"><span style="" lang="EN-US"><span style=""><span style="font-size: small; font-family: Calibri;">c)</span><span style='font: 7pt "Times New Roman";'> </span></span></span><span style="font-size: small;"><span style="">结果使用</span><span lang="EN-US"><span style="font-family: Calibri;">conlleval.pl</span></span><span style="">程序测评。(其代码在提交包根目录中)</span></span></p>
<p class="MsoListParagraph" style="margin: 0cm 0cm 0pt 84pt; text-indent: 0cm;"><span style="font-size: small;"><span style="">测评的命令为:</span><span lang="EN-US"><span style="font-family: Calibri;">perl conlleval.pl < output.txt</span></span><span style="">,其中</span><span lang="EN-US"><span style="font-family: Calibri;">output.txt</span></span><span style="">为待评测文件,需要</span><span lang="EN-US"><span style="font-family: Calibri;">perl</span></span><span style="">解释器支持。详细结果如下:</span></span></p>
<table class="MsoNormalTable" style="margin: auto auto auto 4.9pt; width: 651px; border-collapse: collapse;" border="0" cellspacing="0" cellpadding="0"><tbody>
<tr style="height: 13.5pt;">
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: left;" align="left"><span style="" lang="EN-US">LOC:</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 56pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="75">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: left;" align="left"><span style="" lang="EN-US">precision:</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: left;" align="left"><span style="" lang="EN-US">63.67%;</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: left;" align="left"><span style="" lang="EN-US">recall:</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: left;" align="left"><span style="" lang="EN-US">72.93%;</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: left;" align="left"><span style="" lang="EN-US">FB1:</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: right;" align="right"><span style="" lang="EN-US">67.98</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: right;" align="right"><span style="" lang="EN-US">5623</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: right;" align="right"><span style="" lang="EN-US">382251.5</span></p>
</td>
</tr>
<tr style="height: 13.5pt;">
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: left;" align="left"><span style="" lang="EN-US">ORG:</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 56pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="75">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: left;" align="left"><span style="" lang="EN-US">precision:</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: left;" align="left"><span style="" lang="EN-US">21.26%;</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: left;" align="left"><span style="" lang="EN-US">recall:</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: left;" align="left"><span style="" lang="EN-US">35.90%;</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: left;" align="left"><span style="" lang="EN-US">FB1:</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: right;" align="right"><span style="" lang="EN-US">26.71</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: right;" align="right"><span style="" lang="EN-US">4491</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: right;" align="right"><span style="" lang="EN-US">119954.6</span></p>
</td>
</tr>
<tr style="height: 13.5pt;">
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: left;" align="left"><span style="" lang="EN-US">PER:</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 56pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="75">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: left;" align="left"><span style="" lang="EN-US">precision:</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: left;" align="left"><span style="" lang="EN-US">65.90%;</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: left;" align="left"><span style="" lang="EN-US">recall:</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: left;" align="left"><span style="" lang="EN-US">65.06%;</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: left;" align="left"><span style="" lang="EN-US">FB1:</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: right;" align="right"><span style="" lang="EN-US">65.47</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: right;" align="right"><span style="" lang="EN-US">2554</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: right;" align="right"><span style="" lang="EN-US">167210.4</span></p>
</td>
</tr>
<tr style="height: 13.5pt;">
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72"></td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 56pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="75"></td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72"></td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72"></td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72"></td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: left;" align="left"><span style="">宏平均</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: right;" align="right"><span style="" lang="EN-US">53.38667</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: left;" align="left"><span style="">微平均:</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: right;" align="right"><span style="" lang="EN-US">52.84311</span></p>
</td>
</tr>
</tbody></table>
<p class="MsoNormal" style="margin: 0cm 0cm 0pt;"><span lang="EN-US"><span style="font-size: small; font-family: Calibri;"></span></span></p>
<p class="MsoListParagraph" style=""><span style="" lang="EN-US"><span style=""><span style='font: 7pt "Times New Roman";'> </span><span style="font-size: small; font-family: Calibri;">ii.</span><span style='font: 7pt "Times New Roman";'> </span></span></span><span style="font-size: small;"><span style="">因为刚才特征选取地特别少,故猜想多加入有效特征可以提高结果,于是把模板定义如下:(本次试验的相关数据文件在包</span><span lang="EN-US"><span style="font-family: Calibri;">test2.rar</span></span><span style="">中)</span></span></p>
<p class="MsoListParagraph" style=""><span style="" lang="EN-US"><span style=""><span style="font-size: small; font-family: Calibri;">1.</span><span style='font: 7pt "Times New Roman";'> </span></span></span><span style="font-size: small;"><span style="">模板</span><span lang="EN-US"><span style="font-family: Calibri;">2</span></span><span style="">:</span></span></p>
<p class="MsoListParagraph" style="margin: 0cm 0cm 0pt 83.9pt; text-indent: 0cm;"><span lang="EN-US"><span style="font-size: small; font-family: Calibri;">#Unigram</span></span></p>
<p class="MsoListParagraph" style="margin: 0cm 0cm 0pt 83.9pt; text-indent: 0cm;"><span lang="EN-US"><span style="font-size: small; font-family: Calibri;">U00:%x[-2,0]</span></span></p>
<p class="MsoListParagraph" style="margin: 0cm 0cm 0pt 83.9pt; text-indent: 0cm;"><span lang="EN-US"><span style="font-size: small; font-family: Calibri;">U01:%x[-1,0]</span></span></p>
<p class="MsoListParagraph" style="margin: 0cm 0cm 0pt 83.9pt; text-indent: 0cm;"><span lang="EN-US"><span style="font-size: small; font-family: Calibri;">U02:%x[0,0]</span></span></p>
<p class="MsoListParagraph" style="margin: 0cm 0cm 0pt 83.9pt; text-indent: 0cm;"><span lang="EN-US"><span style="font-size: small; font-family: Calibri;">U03:%x[1,0]</span></span></p>
<p class="MsoListParagraph" style="margin: 0cm 0cm 0pt 83.9pt; text-indent: 0cm;"><span lang="EN-US"><span style="font-size: small; font-family: Calibri;">U04:%x[2,0]</span></span></p>
<p class="MsoListParagraph" style="margin: 0cm 0cm 0pt 83.9pt; text-indent: 0cm;"><span lang="EN-US"><span style="font-size: small; font-family: Calibri;">U5:%x[-2,0]/%x[-1,0]</span></span></p>
<p class="MsoListParagraph" style="margin: 0cm 0cm 0pt 83.9pt; text-indent: 0cm;"><span lang="EN-US"><span style="font-size: small; font-family: Calibri;">U6:%x[-1,0]/%x[0,0]</span></span></p>
<p class="MsoListParagraph" style="margin: 0cm 0cm 0pt 83.9pt; text-indent: 0cm;"><span lang="EN-US"><span style="font-size: small; font-family: Calibri;">U7:%x[0,0]/%x[1,0]</span></span></p>
<p class="MsoListParagraph" style="margin: 0cm 0cm 0pt 84pt; text-indent: 0cm;"><span lang="EN-US"><span style="font-size: small; font-family: Calibri;">U8:%x[1,0]/%x[2,0]</span></span></p>
<p class="MsoListParagraph" style=""><span style="" lang="EN-US"><span style=""><span style="font-size: small; font-family: Calibri;">2.</span><span style='font: 7pt "Times New Roman";'> </span></span></span><span style=""><span style="font-size: small;">相关的实验数据如下:</span></span></p>
<p class="MsoListParagraph" style=""><span style="" lang="EN-US"><span style=""><span style="font-size: small; font-family: Calibri;">a)</span><span style='font: 7pt "Times New Roman";'> </span></span></span><span style=""><span style="font-size: small;">训练过程:</span></span></p>
<p class="MsoListParagraph" style="margin: 0cm 0cm 0pt 105pt; text-indent: 0cm;"><span lang="EN-US"><span style="font-size: small; font-family: Calibri;">iter=94 terr=0.00571 serr=0.12313 obj=53321.45523 diff=0.00000</span></span></p>
<p class="MsoListParagraph" style="margin: 0cm 0cm 0pt 105pt; text-indent: 0cm;"><span lang="EN-US"><span style="font-size: small; font-family: Calibri;">Done!2915.53 s</span></span></p>
<p class="MsoListParagraph" style=""><span style="" lang="EN-US"><span style=""><span style="font-size: small; font-family: Calibri;">b)</span><span style='font: 7pt "Times New Roman";'> </span></span></span><span style=""><span style="font-size: small;">测试结果:</span></span></p>
<table class="MsoNormalTable" style="margin: auto auto auto 4.9pt; width: 651px; border-collapse: collapse;" border="0" cellspacing="0" cellpadding="0"><tbody>
<tr style="height: 13.5pt;">
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: left;" align="left"><span style="" lang="EN-US">LOC:</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 56pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="75">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: left;" align="left"><span style="" lang="EN-US">precision:</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: left;" align="left"><span style="" lang="EN-US">66.86%;</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: left;" align="left"><span style="" lang="EN-US">recall:</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: left;" align="left"><span style="" lang="EN-US">74.31%;</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: left;" align="left"><span style="" lang="EN-US">FB1:</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: right;" align="right"><span style="" lang="EN-US">70.39</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: right;" align="right"><span style="" lang="EN-US">5456</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: right;" align="right"><span style="" lang="EN-US">384047.8</span></p>
</td>
</tr>
<tr style="height: 13.5pt;">
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: left;" align="left"><span style="" lang="EN-US">ORG:</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 56pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="75">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: left;" align="left"><span style="" lang="EN-US">precision:</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: left;" align="left"><span style="" lang="EN-US">26.95%;</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: left;" align="left"><span style="" lang="EN-US">recall:</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: left;" align="left"><span style="" lang="EN-US">41.02%;</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: left;" align="left"><span style="" lang="EN-US">FB1:</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: right;" align="right"><span style="" lang="EN-US">32.53</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: right;" align="right"><span style="" lang="EN-US">4048</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: right;" align="right"><span style="" lang="EN-US">131681.4</span></p>
</td>
</tr>
<tr style="height: 13.5pt;">
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: left;" align="left"><span style="" lang="EN-US">PER:</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 56pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="75">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: left;" align="left"><span style="" lang="EN-US">precision:</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: left;" align="left"><span style="" lang="EN-US">68.29%;</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: left;" align="left"><span style="" lang="EN-US">recall:</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: left;" align="left"><span style="" lang="EN-US">65.67%;</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: left;" align="left"><span style="" lang="EN-US">FB1:</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: right;" align="right"><span style="" lang="EN-US">66.96</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: right;" align="right"><span style="" lang="EN-US">2488</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: right;" align="right"><span style="" lang="EN-US">166596.5</span></p>
</td>
</tr>
<tr style="height: 13.5pt;">
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72"></td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 56pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="75"></td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72"></td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72"></td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72"></td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: left;" align="left"><span style="">宏平均</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: right;" align="right"><span style="" lang="EN-US">56.62667</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: left;" align="left"><span style="">微平均:</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: right;" align="right"><span style="" lang="EN-US">56.89841</span></p>
</td>
</tr>
</tbody></table>
<p class="MsoListParagraph" style="margin: 0cm 0cm 0pt 105pt; text-indent: 0cm;"><span style=""><span style="font-size: small;">的确有所进步,但是还是明显显低。</span></span></p>
<p class="MsoListParagraph" style=""><a name="_Ref233259991"><span style="" lang="EN-US"><span style=""><span style="font-size: small; font-family: Calibri;">a)</span><span style="font: 7pt 'Times New Roman';"> </span></span></span><span style=""><span style="font-size: small;">制定规则,改进结果</span></span></a></p>
<p class="MsoListParagraph" style=""><span style="" lang="EN-US"><span style=""><span style='font: 7pt "Times New Roman";'> </span><span style="font-size: small; font-family: Calibri;">i.</span><span style='font: 7pt "Times New Roman";'> </span></span></span><span style="font-size: small;"><span style="">对结果进行分析(详见各包中以</span><span lang="EN-US"><span style="font-family: Calibri;">error</span></span><span style="">开头的文件),可以发现错误主要有以下几种:</span></span></p>
<p class="MsoListParagraph" style=""><span style="" lang="EN-US"><span style=""><span style="font-size: small; font-family: Calibri;">1.</span><span style='font: 7pt "Times New Roman";'> </span></span></span><span style=""><span style="font-size: small;">同一实体内不同字间的类型不同,则以字类数较多者为准</span></span></p>
<p class="MsoListParagraph" style=""><span style="" lang="EN-US"><span style=""><span style="font-size: small; font-family: Calibri;">a)</span><span style='font: 7pt "Times New Roman";'> </span></span></span><span style="font-size: small;"><span style="">个数相同时,多数情况下为</span><span lang="EN-US"><span style="font-family: Calibri;">LOC</span></span></span></p>
<p class="MsoListParagraph" style=""><span style="" lang="EN-US"><span style=""><span style="font-size: small; font-family: Calibri;">2.</span><span style='font: 7pt "Times New Roman";'> </span></span></span><span style="font-size: small;"><span style="">实体开头的字必定为</span><span lang="EN-US"><span style="font-family: Calibri;">B-???</span></span><span style="">格式</span></span></p>
<p class="MsoListParagraph" style=""><span style="" lang="EN-US"><span style=""><span style="font-size: small; font-family: Calibri;">3.</span><span style='font: 7pt "Times New Roman";'> </span></span></span><span style=""><span style="font-size: small;">实体的开始和结尾都有特定的特征可以遵循(如停用词、动词等作为分界等)</span></span></p>
<p class="MsoListParagraph" style=""><span style="" lang="EN-US"><span style=""><span style="font-size: small; font-family: Calibri;">4.</span><span style='font: 7pt "Times New Roman";'> </span></span></span><span style="font-size: small;"><span style="">固定实体后跟实体应为</span><span lang="EN-US"><span style="font-family: Calibri;">B-???</span></span><span style="">格式(如省名后)</span></span></p>
<p class="MsoListParagraph" style=""><span style="" lang="EN-US"><span style=""><span style="font-size: small; font-family: Calibri;">5.</span><span style='font: 7pt "Times New Roman";'> </span></span></span><span style=""><span style="font-size: small;">实体间间隔较小时可能合并为同一实体</span></span></p>
<p class="MsoListParagraph" style=""><span style="" lang="EN-US"><span style=""><span style="font-size: small; font-family: Calibri;">6.</span><span style='font: 7pt "Times New Roman";'> </span></span></span><span style=""><span style="font-size: small;">……</span></span></p>
<p class="MsoListParagraph" style=""><span style="" lang="EN-US"><span style=""><span style='font: 7pt "Times New Roman";'> </span><span style="font-size: small; font-family: Calibri;">ii.</span><span style='font: 7pt "Times New Roman";'> </span></span></span><span style="font-size: small;"><span style="">根据以上特点对结果进行优化,计划依次试验各个规则。但因时间因素,只检测了四五种,其中较有效果的是前两种(即规则</span><span lang="EN-US"><span style="font-family: Calibri;">1</span></span><span style="">和</span><span lang="EN-US"><span style="font-family: Calibri;">2</span></span><span style="">),两者结合可以把结果成绩提高</span><span lang="EN-US"><span style="font-family: Calibri;">12%</span></span><span style="">左右。在</span><span lang="EN-US"><span style="font-family: Calibri;">test2</span></span><span style="">的结果上加以更正,得到的结果如下:</span></span></p>
<table class="MsoNormalTable" style="margin: auto auto auto 4.9pt; width: 651px; border-collapse: collapse;" border="0" cellspacing="0" cellpadding="0"><tbody>
<tr style="height: 13.5pt;">
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: left;" align="left"><span style="" lang="EN-US">LOC:</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 56pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="75">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: left;" align="left"><span style="" lang="EN-US">precision:</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: left;" align="left"><span style="" lang="EN-US">79.40%;</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: left;" align="left"><span style="" lang="EN-US">recall:</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: left;" align="left"><span style="" lang="EN-US">76.43%;</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: left;" align="left"><span style="" lang="EN-US">FB1:</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: right;" align="right"><span style="" lang="EN-US">77.89</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: right;" align="right"><span style="" lang="EN-US">4966</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: right;" align="right"><span style="" lang="EN-US">386801.7</span></p>
</td>
</tr>
<tr style="height: 13.5pt;">
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: left;" align="left"><span style="" lang="EN-US">ORG:</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 56pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="75">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: left;" align="left"><span style="" lang="EN-US">precision:</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: left;" align="left"><span style="" lang="EN-US">53.86%;</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: left;" align="left"><span style="" lang="EN-US">recall:</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: left;" align="left"><span style="" lang="EN-US">52.63%;</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: left;" align="left"><span style="" lang="EN-US">FB1:</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: right;" align="right"><span style="" lang="EN-US">53.24</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: right;" align="right"><span style="" lang="EN-US">3457</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: right;" align="right"><span style="" lang="EN-US">184050.7</span></p>
</td>
</tr>
<tr style="height: 13.5pt;">
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: left;" align="left"><span style="" lang="EN-US">PER:</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 56pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="75">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: left;" align="left"><span style="" lang="EN-US">precision:</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: left;" align="left"><span style="" lang="EN-US">80.88%;</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: left;" align="left"><span style="" lang="EN-US">recall:</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: left;" align="left"><span style="" lang="EN-US">67.09%;</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: left;" align="left"><span style="" lang="EN-US">FB1:</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: right;" align="right"><span style="" lang="EN-US">73.34</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: right;" align="right"><span style="" lang="EN-US">2327</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: right;" align="right"><span style="" lang="EN-US">170662.2</span></p>
</td>
</tr>
<tr style="height: 13.5pt;">
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72"></td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 56pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="75"></td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72"></td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72"></td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72"></td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: left;" align="left"><span style="">宏平均</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: right;" align="right"><span style="" lang="EN-US">68.15667</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: left;" align="left"><span style="">微平均:</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: right;" align="right"><span style="" lang="EN-US">68.9781</span></p>
</td>
</tr>
</tbody></table>
<p class="MsoListParagraph" style="margin: 0cm 0cm 0pt 63pt; text-indent: 0cm;"><span style="font-size: small;"><span style="">虽然</span><span lang="EN-US"><span style="font-family: Calibri;">F</span></span><span style="">值有很大提高,但是还是太不理想</span></span></p>
<p class="MsoListParagraph" style=""><a name="_Ref233259997"><span style="" lang="EN-US"><span style=""><span style="font-size: small; font-family: Calibri;">c)</span><span style='font: 7pt "Times New Roman";'> </span></span></span><span style="font-size: small;"><span style="">先分词并标注词性信息,再用</span><span lang="EN-US"><span style="font-family: Calibri;">CRF</span></span></span></a><span style=""><span style=""><span style="font-size: small;">学习规则</span></span></span></p>
<p class="MsoListParagraph" style=""><span style="" lang="EN-US"><span style=""><span style='font: 7pt "Times New Roman";'> </span><span style="font-size: small; font-family: Calibri;">i.</span><span style='font: 7pt "Times New Roman";'> </span></span></span><span style="font-size: small;"><span style="">看来单从字的角度着眼已然不够,于是试图利用分词和词性标注信息。因为题目未给出相应信息,故用分词标注信息先进行分词标注(分词标注工具见附件包根目录)。</span></span></p>
<p class="MsoListParagraph" style=""><span style="" lang="EN-US"><span style=""><span style='font: 7pt "Times New Roman";'> </span><span style="font-size: small; font-family: Calibri;">ii.</span><span style='font: 7pt "Times New Roman";'> </span></span></span><span style=""><span style="font-size: small;">分词标注后,字的特征如下所示:</span></span></p>
<table class="MsoTableGrid" style="margin: auto auto auto 63pt; border-collapse: collapse;" border="1" cellspacing="0" cellpadding="0"><tbody>
<tr style="">
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 90.75pt; padding-right: 5.4pt; padding-top: 0cm; border: black 1pt solid;" width="121" valign="top">
<p class="MsoListParagraph" style="margin: 0cm 0cm 0pt; text-indent: 0cm;"><span style=""><span style="font-size: small;">字</span></span></p>
</td>
<td style="border-bottom: black 1pt solid; border-left: #ece9d8; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 90.8pt; padding-right: 5.4pt; border-top: black 1pt solid; border-right: black 1pt solid; padding-top: 0cm;" width="121" valign="top">
<p class="MsoListParagraph" style="margin: 0cm 0cm 0pt; text-indent: 0cm;"><span style=""><span style="font-size: small;">词性及分词标记</span></span></p>
</td>
<td style="border-bottom: black 1pt solid; border-left: #ece9d8; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 90.8pt; padding-right: 5.4pt; border-top: black 1pt solid; border-right: black 1pt solid; padding-top: 0cm;" width="121" valign="top">
<p class="MsoListParagraph" style="margin: 0cm 0cm 0pt; text-indent: 0cm;"><span style=""><span style="font-size: small;">实体标记</span></span></p>
</td>
</tr>
<tr style="">
<td style="border-bottom: black 1pt solid; border-left: black 1pt solid; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 90.75pt; padding-right: 5.4pt; border-top: #ece9d8; border-right: black 1pt solid; padding-top: 0cm;" width="121" valign="top">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt;"><span style=""><span style="font-size: small;">:</span></span></p>
</td>
<td style="border-bottom: black 1pt solid; border-left: #ece9d8; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 90.8pt; padding-right: 5.4pt; border-top: #ece9d8; border-right: black 1pt solid; padding-top: 0cm;" width="121" valign="top">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt;"><span lang="EN-US"><span style="font-size: small; font-family: Calibri;">Sw</span></span></p>
</td>
<td style="border-bottom: black 1pt solid; border-left: #ece9d8; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 90.8pt; padding-right: 5.4pt; border-top: #ece9d8; border-right: black 1pt solid; padding-top: 0cm;" width="121" valign="top">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt;"><span lang="EN-US"><span style="font-size: small; font-family: Calibri;">N</span></span></p>
</td>
</tr>
<tr style="">
<td style="border-bottom: black 1pt solid; border-left: black 1pt solid; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 90.75pt; padding-right: 5.4pt; border-top: #ece9d8; border-right: black 1pt solid; padding-top: 0cm;" width="121" valign="top">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt;"><span style=""><span style="font-size: small;">印</span></span></p>
</td>
<td style="border-bottom: black 1pt solid; border-left: #ece9d8; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 90.8pt; padding-right: 5.4pt; border-top: #ece9d8; border-right: black 1pt solid; padding-top: 0cm;" width="121" valign="top">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt;"><span lang="EN-US"><span style="font-size: small; font-family: Calibri;">Bns</span></span></p>
</td>
<td style="border-bottom: black 1pt solid; border-left: #ece9d8; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 90.8pt; padding-right: 5.4pt; border-top: #ece9d8; border-right: black 1pt solid; padding-top: 0cm;" width="121" valign="top">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt;"><span lang="EN-US"><span style="font-size: small; font-family: Calibri;">B-LOC</span></span></p>
</td>
</tr>
<tr style="">
<td style="border-bottom: black 1pt solid; border-left: black 1pt solid; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 90.75pt; padding-right: 5.4pt; border-top: #ece9d8; border-right: black 1pt solid; padding-top: 0cm;" width="121" valign="top">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt;"><span style=""><span style="font-size: small;">度</span></span></p>
</td>
<td style="border-bottom: black 1pt solid; border-left: #ece9d8; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 90.8pt; padding-right: 5.4pt; border-top: #ece9d8; border-right: black 1pt solid; padding-top: 0cm;" width="121" valign="top">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt;"><span lang="EN-US"><span style="font-size: small; font-family: Calibri;">Ens</span></span></p>
</td>
<td style="border-bottom: black 1pt solid; border-left: #ece9d8; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 90.8pt; padding-right: 5.4pt; border-top: #ece9d8; border-right: black 1pt solid; padding-top: 0cm;" width="121" valign="top">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt;"><span lang="EN-US"><span style="font-size: small; font-family: Calibri;">I-LOC</span></span></p>
</td>
</tr>
<tr style="">
<td style="border-bottom: black 1pt solid; border-left: black 1pt solid; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 90.75pt; padding-right: 5.4pt; border-top: #ece9d8; border-right: black 1pt solid; padding-top: 0cm;" width="121" valign="top">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt;"><span style=""><span style="font-size: small;">首</span></span></p>
</td>
<td style="border-bottom: black 1pt solid; border-left: #ece9d8; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 90.8pt; padding-right: 5.4pt; border-top: #ece9d8; border-right: black 1pt solid; padding-top: 0cm;" width="121" valign="top">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt;"><span lang="EN-US"><span style="font-size: small; font-family: Calibri;">Bd</span></span></p>
</td>
<td style="border-bottom: black 1pt solid; border-left: #ece9d8; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 90.8pt; padding-right: 5.4pt; border-top: #ece9d8; border-right: black 1pt solid; padding-top: 0cm;" width="121" valign="top">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt;"><span lang="EN-US"><span style="font-size: small; font-family: Calibri;">N</span></span></p>
</td>
</tr>
<tr style="">
<td style="border-bottom: black 1pt solid; border-left: black 1pt solid; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 90.75pt; padding-right: 5.4pt; border-top: #ece9d8; border-right: black 1pt solid; padding-top: 0cm;" width="121" valign="top">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt;"><span style=""><span style="font-size: small;">先</span></span></p>
</td>
<td style="border-bottom: black 1pt solid; border-left: #ece9d8; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 90.8pt; padding-right: 5.4pt; border-top: #ece9d8; border-right: black 1pt solid; padding-top: 0cm;" width="121" valign="top">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt;"><span lang="EN-US"><span style="font-size: small; font-family: Calibri;">Ed</span></span></p>
</td>
<td style="border-bottom: black 1pt solid; border-left: #ece9d8; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 90.8pt; padding-right: 5.4pt; border-top: #ece9d8; border-right: black 1pt solid; padding-top: 0cm;" width="121" valign="top">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt;"><span lang="EN-US"><span style="font-size: small; font-family: Calibri;">N</span></span></p>
</td>
</tr>
</tbody></table>
<p class="MsoListParagraph" style=""><span style="" lang="EN-US"><span style=""><span style='font: 7pt "Times New Roman";'> </span><span style="font-size: small; font-family: Calibri;">iii.</span><span style='font: 7pt "Times New Roman";'> </span></span></span><span style=""><span style="font-size: small;">于是针对其建立模板:</span></span></p>
<p class="MsoListParagraph" style=""><span style="" lang="EN-US"><span style=""><span style='font: 7pt "Times New Roman";'> </span><span style="font-size: small; font-family: Calibri;">iv.</span><span style='font: 7pt "Times New Roman";'> </span></span></span><span style="font-size: small;"><span style="">以此模板进行训练,得到模型后进行测试,最后用</span><span lang="EN-US"><span style="font-family: Calibri;">conlleval</span></span><span style="">测得结果如下:</span><span style="font-family: Calibri;"> </span></span></p>
<p class="MsoListParagraph" style="margin: 0cm 0cm 0pt 63pt; text-indent: 0cm;"><span lang="EN-US"><span style="font-size: small; font-family: Calibri;">iter=226 terr=0.00935 serr=0.17661 act=2913330 obj=42785.69115 diff=0.00009</span></span></p>
<p class="MsoListParagraph" style="margin: 0cm 0cm 0pt 63pt; text-indent: 0cm;"><span lang="EN-US"><span style="font-size: small; font-family: Calibri;">Done!4502.97 s</span></span></p>
<table class="MsoNormalTable" style="margin: auto auto auto 4.9pt; width: 651px; border-collapse: collapse;" border="0" cellspacing="0" cellpadding="0"><tbody>
<tr style="height: 13.5pt;">
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: left;" align="left"><span style="" lang="EN-US">LOC:</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 56pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="75">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: left;" align="left"><span style="" lang="EN-US">precision:</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: left;" align="left"><span style="" lang="EN-US">82.05%;</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: left;" align="left"><span style="" lang="EN-US">recall:</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: left;" align="left"><span style="" lang="EN-US">89.97%;</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: left;" align="left"><span style="" lang="EN-US">FB1:</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: right;" align="right"><span style="" lang="EN-US">85.83</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: right;" align="right"><span style="" lang="EN-US">20309</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: right;" align="right"><span style="" lang="EN-US">1743121</span></p>
</td>
</tr>
<tr style="height: 13.5pt;">
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: left;" align="left"><span style="" lang="EN-US">ORG:</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 56pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="75">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: left;" align="left"><span style="" lang="EN-US">precision:</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: left;" align="left"><span style="" lang="EN-US">48.36%;</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: left;" align="left"><span style="" lang="EN-US">recall:</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: left;" align="left"><span style="" lang="EN-US">65.12%;</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: left;" align="left"><span style="" lang="EN-US">FB1:</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: right;" align="right"><span style="" lang="EN-US">55.5</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: right;" align="right"><span style="" lang="EN-US">13818</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: right;" align="right"><span style="" lang="EN-US">766899</span></p>
</td>
</tr>
<tr style="height: 13.5pt;">
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: left;" align="left"><span style="" lang="EN-US">PER:</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 56pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="75">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: left;" align="left"><span style="" lang="EN-US">precision:</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: left;" align="left"><span style="" lang="EN-US">91.52%;</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: left;" align="left"><span style="" lang="EN-US">recall:</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: left;" align="left"><span style="" lang="EN-US">93.15%;</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: left;" align="left"><span style="" lang="EN-US">FB1:</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: right;" align="right"><span style="" lang="EN-US">92.33</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: right;" align="right"><span style="" lang="EN-US">9189</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: right;" align="right"><span style="" lang="EN-US">848420.4</span></p>
</td>
</tr>
<tr style="height: 13.5pt;">
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72"></td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 56pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="75"></td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72"></td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72"></td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72"></td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: left;" align="left"><span style="">宏平均</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: right;" align="right"><span style="" lang="EN-US">77.88667</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: left;" align="left"><span style="">微平均:</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: right;" align="right"><span style="" lang="EN-US">77.53349</span></p>
</td>
</tr>
</tbody></table>
<p class="MsoListParagraph" style=""><span style="" lang="EN-US"><span style=""><span style='font: 7pt "Times New Roman";'> </span><span style="font-size: small; font-family: Calibri;">v.</span><span style='font: 7pt "Times New Roman";'> </span></span></span><span style=""><span style="font-size: small;">对此结果再以用前面建立的规则优化,最终得到结果如下:</span></span></p>
<table class="MsoNormalTable" style="margin: auto auto auto 4.9pt; width: 651px; border-collapse: collapse;" border="0" cellspacing="0" cellpadding="0"><tbody>
<tr style="height: 13.5pt;">
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: left;" align="left"><span style="" lang="EN-US">LOC:</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 56pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="75">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: left;" align="left"><span style="" lang="EN-US">precision:</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: left;" align="left"><span style="" lang="EN-US">90.34%;</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: left;" align="left"><span style="" lang="EN-US">recall:</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: left;" align="left"><span style="" lang="EN-US">90.37%;</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: left;" align="left"><span style="" lang="EN-US">FB1:</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: right;" align="right"><span style="" lang="EN-US">90.36</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: right;" align="right"><span style="" lang="EN-US">18878</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: right;" align="right"><span style="" lang="EN-US">1705816</span></p>
</td>
</tr>
<tr style="height: 13.5pt;">
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: left;" align="left"><span style="" lang="EN-US">ORG:</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 56pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="75">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: left;" align="left"><span style="" lang="EN-US">precision:</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: left;" align="left"><span style="" lang="EN-US">70.47%;</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: left;" align="left"><span style="" lang="EN-US">recall:</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: left;" align="left"><span style="" lang="EN-US">71.54%;</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: left;" align="left"><span style="" lang="EN-US">FB1:</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: right;" align="right"><span style="" lang="EN-US">71</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: right;" align="right"><span style="" lang="EN-US">12474</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: right;" align="right"><span style="" lang="EN-US">885654</span></p>
</td>
</tr>
<tr style="height: 13.5pt;">
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: left;" align="left"><span style="" lang="EN-US">PER:</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 56pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="75">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: left;" align="left"><span style="" lang="EN-US">precision:</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: left;" align="left"><span style="" lang="EN-US">94.85%;</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: left;" align="left"><span style="" lang="EN-US">recall:</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: left;" align="left"><span style="" lang="EN-US">92.70%;</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: left;" align="left"><span style="" lang="EN-US">FB1:</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: right;" align="right"><span style="" lang="EN-US">93.76</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: right;" align="right"><span style="" lang="EN-US">8954</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: right;" align="right"><span style="" lang="EN-US">839527</span></p>
</td>
</tr>
<tr style="height: 13.5pt;">
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72"></td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 56pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="75"></td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72"></td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72"></td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72"></td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: left;" align="left"><span style="">宏平均</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: right;" align="right"><span style="" lang="EN-US">85.04</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: left;" align="left"><span style="">微平均:</span></p>
</td>
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 54pt; padding-right: 5.4pt; height: 13.5pt; padding-top: 0cm; border: #ece9d8;" width="72">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt; text-align: right;" align="right"><span style="" lang="EN-US">85.12373</span></p>
</td>
</tr>
</tbody></table>
<p class="MsoNormal" style="margin: 0cm 0cm 0pt;"><span style="font-size: small;"><span style="">在此基础上对</span><span lang="EN-US"><span style="font-family: Calibri;">Test_utf16.ner</span></span><span style="">进行训练,最终得到</span><span lang="EN-US"><span style="font-family: Calibri;">finalAnswer.txt</span></span></span></p>
<p class="MsoListParagraph" style=""><span style="font-size: small;"><span style="">三<a name="_Ref233260008"><span style="" lang="EN-US"><span style=""><span style="color: #000000; font-family: Calibri;">、</span></span></span><span style="">实验结果对照</span></a></span></span><span style="font-size: small;"><span style="">表</span></span></p>
<table class="MsoTableGrid" style="" border="1" cellspacing="0" cellpadding="0"><tbody>
<tr style="">
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 35.4pt; padding-right: 5.4pt; padding-top: 0cm; border: black 1pt solid;" width="47" valign="top">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt;"><span style=""><span style="font-size: small;">编号</span></span></p>
</td>
<td style="border-bottom: black 1pt solid; border-left: #ece9d8; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 70.95pt; padding-right: 5.4pt; border-top: black 1pt solid; border-right: black 1pt solid; padding-top: 0cm;" width="95" valign="top">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt;"><span style=""><span style="font-size: small;">使用策略</span></span></p>
</td>
<td style="border-bottom: black 1pt solid; border-left: #ece9d8; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 2cm; padding-right: 5.4pt; border-top: black 1pt solid; border-right: black 1pt solid; padding-top: 0cm;" width="76" valign="top">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt;"><span style=""><span style="font-size: small;">结果</span></span></p>
</td>
<td style="border-bottom: black 1pt solid; border-left: #ece9d8; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 99.2pt; padding-right: 5.4pt; border-top: black 1pt solid; border-right: black 1pt solid; padding-top: 0cm;" width="132" valign="top">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt;"><span style=""><span style="font-size: small;">方法改进</span></span></p>
</td>
<td style="border-bottom: black 1pt solid; border-left: #ece9d8; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 2cm; padding-right: 5.4pt; border-top: black 1pt solid; border-right: black 1pt solid; padding-top: 0cm;" width="76" valign="top">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt;"><span style=""><span style="font-size: small;">性能提升</span></span></p>
</td>
<td style="border-bottom: black 1pt solid; border-left: #ece9d8; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 205.55pt; padding-right: 5.4pt; border-top: black 1pt solid; border-right: black 1pt solid; padding-top: 0cm;" width="274" valign="top">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt;"><span lang="EN-US"><span style="font-size: small; font-family: Calibri;"></span></span></p>
</td>
</tr>
<tr style="">
<td style="border-bottom: black 1pt solid; border-left: black 1pt solid; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 35.4pt; padding-right: 5.4pt; border-top: #ece9d8; border-right: black 1pt solid; padding-top: 0cm;" width="47" valign="top">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt;"><span lang="EN-US"><span style="font-size: small; font-family: Calibri;">1</span></span></p>
</td>
<td style="border-bottom: black 1pt solid; border-left: #ece9d8; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 70.95pt; padding-right: 5.4pt; border-top: #ece9d8; border-right: black 1pt solid; padding-top: 0cm;" width="95" valign="top">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt;"><span style="font-size: small;"><span style="">单字</span><span lang="EN-US"><span style="font-family: Calibri;">CRF(1)</span></span></span></p>
</td>
<td style="border-bottom: black 1pt solid; border-left: #ece9d8; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 2cm; padding-right: 5.4pt; border-top: #ece9d8; border-right: black 1pt solid; padding-top: 0cm;" width="76" valign="top">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt;"><span style="font-size: small;"><span style="">约</span><span lang="EN-US"><span style="font-family: Calibri;">53%</span></span></span></p>
</td>
<td style="border-bottom: black 1pt solid; border-left: #ece9d8; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 99.2pt; padding-right: 5.4pt; border-top: #ece9d8; border-right: black 1pt solid; padding-top: 0cm;" width="132" valign="top">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt;"><span lang="EN-US"><span style="font-size: small; font-family: Calibri;"></span></span></p>
</td>
<td style="border-bottom: black 1pt solid; border-left: #ece9d8; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 2cm; padding-right: 5.4pt; border-top: #ece9d8; border-right: black 1pt solid; padding-top: 0cm;" width="76" valign="top">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt;"><span lang="EN-US"><span style="font-size: small; font-family: Calibri;"></span></span></p>
</td>
<td style="border-bottom: black 1pt solid; border-left: #ece9d8; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 205.55pt; padding-right: 5.4pt; border-top: #ece9d8; border-right: black 1pt solid; padding-top: 0cm;" width="274" valign="top">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt;"><span lang="EN-US"><span style="font-size: small; font-family: Calibri;"></span></span></p>
</td>
</tr>
<tr style="">
<td style="border-bottom: black 1pt solid; border-left: black 1pt solid; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 35.4pt; padding-right: 5.4pt; border-top: #ece9d8; border-right: black 1pt solid; padding-top: 0cm;" width="47" valign="top">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt;"><span lang="EN-US"><span style="font-size: small; font-family: Calibri;">2</span></span></p>
</td>
<td style="border-bottom: black 1pt solid; border-left: #ece9d8; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 70.95pt; padding-right: 5.4pt; border-top: #ece9d8; border-right: black 1pt solid; padding-top: 0cm;" width="95" valign="top">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt;"><span style="font-size: small;"><span style="">单字</span><span lang="EN-US"><span style="font-family: Calibri;">CRF(1)</span></span></span></p>
</td>
<td style="border-bottom: black 1pt solid; border-left: #ece9d8; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 2cm; padding-right: 5.4pt; border-top: #ece9d8; border-right: black 1pt solid; padding-top: 0cm;" width="76" valign="top">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt;"><span style="font-size: small;"><span style="">约</span><span lang="EN-US"><span style="font-family: Calibri;">56.7%</span></span></span></p>
</td>
<td style="border-bottom: black 1pt solid; border-left: #ece9d8; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 99.2pt; padding-right: 5.4pt; border-top: #ece9d8; border-right: black 1pt solid; padding-top: 0cm;" width="132" valign="top">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt;"><span style=""><span style="font-size: small;">使用更多的特征信息</span></span></p>
</td>
<td style="border-bottom: black 1pt solid; border-left: #ece9d8; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 2cm; padding-right: 5.4pt; border-top: #ece9d8; border-right: black 1pt solid; padding-top: 0cm;" width="76" valign="top">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt;"><span style="font-size: small;"><span style="">约</span><span lang="EN-US"><span style="font-family: Calibri;">3.7%</span></span></span></p>
</td>
<td style="border-bottom: black 1pt solid; border-left: #ece9d8; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 205.55pt; padding-right: 5.4pt; border-top: #ece9d8; border-right: black 1pt solid; padding-top: 0cm;" width="274" valign="top">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt;"><span style=""><span style="font-size: small;">特征对于结果有较大影响,但因硬件条件和时间原因未能引入更多的特征加以佐证。</span></span></p>
</td>
</tr>
<tr style="">
<td style="border-bottom: black 1pt solid; border-left: black 1pt solid; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 35.4pt; padding-right: 5.4pt; border-top: #ece9d8; border-right: black 1pt solid; padding-top: 0cm;" width="47" valign="top">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt;"><span lang="EN-US"><span style="font-size: small; font-family: Calibri;">3</span></span></p>
</td>
<td style="border-bottom: black 1pt solid; border-left: #ece9d8; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 70.95pt; padding-right: 5.4pt; border-top: #ece9d8; border-right: black 1pt solid; padding-top: 0cm;" width="95" valign="top">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt;"><span style="font-size: small;"><span style="">单字</span><span lang="EN-US"><span style="font-family: Calibri;">CRF+</span></span><span style="">规则</span></span></p>
</td>
<td style="border-bottom: black 1pt solid; border-left: #ece9d8; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 2cm; padding-right: 5.4pt; border-top: #ece9d8; border-right: black 1pt solid; padding-top: 0cm;" width="76" valign="top">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt;"><span style="font-size: small;"><span style="">约</span><span lang="EN-US"><span style="font-family: Calibri;">68.5%</span></span></span></p>
</td>
<td style="border-bottom: black 1pt solid; border-left: #ece9d8; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 99.2pt; padding-right: 5.4pt; border-top: #ece9d8; border-right: black 1pt solid; padding-top: 0cm;" width="132" valign="top">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt;"><span style=""><span style="font-size: small;">人工添加规则,对结果进行优化</span></span></p>
</td>
<td style="border-bottom: black 1pt solid; border-left: #ece9d8; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 2cm; padding-right: 5.4pt; border-top: #ece9d8; border-right: black 1pt solid; padding-top: 0cm;" width="76" valign="top">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt;"><span style="font-size: small;"><span style="">约</span><span lang="EN-US"><span style="font-family: Calibri;">11.8%</span></span></span></p>
</td>
<td style="border-bottom: black 1pt solid; border-left: #ece9d8; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 205.55pt; padding-right: 5.4pt; border-top: #ece9d8; border-right: black 1pt solid; padding-top: 0cm;" width="274" valign="top">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt;"><span style=""><span style="font-size: small;">规则可以弥补机器学习方法的不足,依次(并改变规则的顺序)尝试各种规则。</span></span></p>
</td>
</tr>
<tr style="">
<td style="border-bottom: black 1pt solid; border-left: black 1pt solid; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 35.4pt; padding-right: 5.4pt; border-top: #ece9d8; border-right: black 1pt solid; padding-top: 0cm;" width="47" valign="top">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt;"><span lang="EN-US"><span style="font-size: small; font-family: Calibri;">4</span></span></p>
</td>
<td style="border-bottom: black 1pt solid; border-left: #ece9d8; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 70.95pt; padding-right: 5.4pt; border-top: #ece9d8; border-right: black 1pt solid; padding-top: 0cm;" width="95" valign="top">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt;"><span style="font-size: small;"><span style="">分词</span><span lang="EN-US"><span style="font-family: Calibri;">+</span></span><span style="">词性标注</span><span lang="EN-US"><span style="font-family: Calibri;">+CRF</span></span></span></p>
</td>
<td style="border-bottom: black 1pt solid; border-left: #ece9d8; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 2cm; padding-right: 5.4pt; border-top: #ece9d8; border-right: black 1pt solid; padding-top: 0cm;" width="76" valign="top">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt;"><span style="font-size: small;"><span style="">约</span><span lang="EN-US"><span style="font-family: Calibri;">77.7%</span></span></span></p>
</td>
<td style="border-bottom: black 1pt solid; border-left: #ece9d8; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 99.2pt; padding-right: 5.4pt; border-top: #ece9d8; border-right: black 1pt solid; padding-top: 0cm;" width="132" valign="top">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt;"><span style=""><span style="font-size: small;">采用了不同方法</span></span></p>
</td>
<td style="border-bottom: black 1pt solid; border-left: #ece9d8; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 2cm; padding-right: 5.4pt; border-top: #ece9d8; border-right: black 1pt solid; padding-top: 0cm;" width="76" valign="top">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt;"><span style="font-size: small;"><span style="">约</span><span lang="EN-US"><span style="font-family: Calibri;">9.2%</span></span></span></p>
</td>
<td style="border-bottom: black 1pt solid; border-left: #ece9d8; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 205.55pt; padding-right: 5.4pt; border-top: #ece9d8; border-right: black 1pt solid; padding-top: 0cm;" width="274" valign="top">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt;"><span style=""><span style="font-size: small;">引入词的概念显然</span></span></p>
</td>
</tr>
<tr style="">
<td style="border-bottom: black 1pt solid; border-left: black 1pt solid; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 35.4pt; padding-right: 5.4pt; border-top: #ece9d8; border-right: black 1pt solid; padding-top: 0cm;" width="47" valign="top">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt;"><span lang="EN-US"><span style="font-size: small; font-family: Calibri;">5</span></span></p>
</td>
<td style="border-bottom: black 1pt solid; border-left: #ece9d8; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 70.95pt; padding-right: 5.4pt; border-top: #ece9d8; border-right: black 1pt solid; padding-top: 0cm;" width="95" valign="top">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt;"><span style="font-size: small;"><span style="">分词</span><span lang="EN-US"><span style="font-family: Calibri;">+</span></span><span style="">词性标注</span><span lang="EN-US"><span style="font-family: Calibri;">+CRF+</span></span><span style="">规则</span></span></p>
</td>
<td style="border-bottom: black 1pt solid; border-left: #ece9d8; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 2cm; padding-right: 5.4pt; border-top: #ece9d8; border-right: black 1pt solid; padding-top: 0cm;" width="76" valign="top">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt;"><span style="font-size: small;"><span style="">约</span><span style="color: red;" lang="EN-US"><span style="font-family: Calibri;">85.1%</span></span></span></p>
</td>
<td style="border-bottom: black 1pt solid; border-left: #ece9d8; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 99.2pt; padding-right: 5.4pt; border-top: #ece9d8; border-right: black 1pt solid; padding-top: 0cm;" width="132" valign="top">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt;"><span style="font-size: small;"><span style="">在</span><span lang="EN-US"><span style="font-family: Calibri;">4</span></span><span style="">基础上引入规则</span></span></p>
</td>
<td style="border-bottom: black 1pt solid; border-left: #ece9d8; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 2cm; padding-right: 5.4pt; border-top: #ece9d8; border-right: black 1pt solid; padding-top: 0cm;" width="76" valign="top">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt;"><span style="font-size: small;"><span style="">约</span><span lang="EN-US"><span style="font-family: Calibri;">7.4%</span></span></span></p>
</td>
<td style="border-bottom: black 1pt solid; border-left: #ece9d8; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 205.55pt; padding-right: 5.4pt; border-top: #ece9d8; border-right: black 1pt solid; padding-top: 0cm;" width="274" valign="top">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt;"><span style=""><span style="font-size: small;">机器学习方法的某些弊端不随条件的变化而变化</span></span></p>
</td>
</tr>
<tr style="">
<td style="border-bottom: black 1pt solid; border-left: black 1pt solid; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 35.4pt; padding-right: 5.4pt; border-top: #ece9d8; border-right: black 1pt solid; padding-top: 0cm;" width="47" valign="top">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt;"><span lang="EN-US"><span style="font-size: small; font-family: Calibri;">6</span></span></p>
</td>
<td style="border-bottom: black 1pt solid; border-left: #ece9d8; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 70.95pt; padding-right: 5.4pt; border-top: #ece9d8; border-right: black 1pt solid; padding-top: 0cm;" width="95" valign="top">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt;"><span lang="EN-US"><span style="font-size: small; font-family: Calibri;"></span></span></p>
</td>
<td style="border-bottom: black 1pt solid; border-left: #ece9d8; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 2cm; padding-right: 5.4pt; border-top: #ece9d8; border-right: black 1pt solid; padding-top: 0cm;" width="76" valign="top">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt;"><span lang="EN-US"><span style="font-size: small; font-family: Calibri;"></span></span></p>
</td>
<td style="border-bottom: black 1pt solid; border-left: #ece9d8; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 99.2pt; padding-right: 5.4pt; border-top: #ece9d8; border-right: black 1pt solid; padding-top: 0cm;" width="132" valign="top">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt;"><span lang="EN-US"><span style="font-size: small; font-family: Calibri;"></span></span></p>
</td>
<td style="border-bottom: black 1pt solid; border-left: #ece9d8; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 2cm; padding-right: 5.4pt; border-top: #ece9d8; border-right: black 1pt solid; padding-top: 0cm;" width="76" valign="top">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt;"><span lang="EN-US"><span style="font-size: small; font-family: Calibri;"></span></span></p>
</td>
<td style="border-bottom: black 1pt solid; border-left: #ece9d8; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 205.55pt; padding-right: 5.4pt; border-top: #ece9d8; border-right: black 1pt solid; padding-top: 0cm;" width="274" valign="top">
<p class="MsoNormal" style="margin: 0cm 0cm 0pt;"><span lang="EN-US"><span style="font-size: small; font-family: Calibri;"></span></span></p>
</td>
</tr>
</tbody></table>
<p class="MsoListParagraph" style="">四<a name="_Ref233260012"><span style="font-size: small;"><span style="" lang="EN-US"><span style=""><span style="font-family: Calibri;">、</span></span></span><span style="">未来的工作</span></span></a></p>
<p class="MsoListParagraph" style=""><span style="" lang="EN-US"><span style=""><span style="font-size: small; font-family: Calibri;">a)</span><span style='font: 7pt "Times New Roman";'> </span></span></span><span style=""><span style="font-size: small;">尝试更多的规则,尽量减少机器学习方法的弊端;</span></span></p>
<p class="MsoListParagraph" style=""><span style="" lang="EN-US"><span style=""><span style="font-size: small; font-family: Calibri;">b)</span><span style='font: 7pt "Times New Roman";'> </span></span></span><span style=""><span style="font-size: small;">尝试把分词和词性信息作为不同的属性,看看对结果有什么影响;</span></span></p>
<p class="MsoListParagraph" style=""><span style="" lang="EN-US"><span style=""><span style="font-size: small; font-family: Calibri;">c)</span><span style='font: 7pt "Times New Roman";'> </span></span></span><span style=""><span style="font-size: small;">改进分词及词性标注的正确率,以便收到更好的命名实体识别的效果。</span></span></p>
<p class="MsoListParagraph" style=""><a name="_Ref233260017"><span style="font-size: small;"><span style="" lang="EN-US"><span style=""><span style="font-family: Calibri;">五、</span></span></span><span style="">注意事项</span></span></a></p>
<p class="MsoListParagraph" style=""><span style="" lang="EN-US"><span style=""><span style="font-size: small; font-family: Calibri;">a)</span><span style='font: 7pt "Times New Roman";'> </span></span></span><span style=""><span style="font-size: small;">编码格式可能造成某些文件无法正常处理,当出现格式错误时要留心一下;</span></span></p>
<p class="MsoListParagraph" style=""><span style="" lang="EN-US"><span style=""><span style="font-size: small; font-family: Calibri;">b)</span><span style='font: 7pt "Times New Roman";'> </span></span></span><span style=""><span style="font-size: small;">各个程序所需要的分隔符不尽相同,主要是空格和制表符,在遇到问题时注意看是不是分隔符不符合程序要求;</span></span></p>
<p class="MsoListParagraph" style=""><span style="" lang="EN-US"><span style=""><span style="font-size: small; font-family: Calibri;">c)</span><span style='font: 7pt "Times New Roman";'> </span></span></span><span style=""><span style="font-size: small;">实验过程中开发的一些实用小工具并未提供说明书,但这些小工具界面简洁,使用方便,应该很容易掌握。</span></span></p>
<table class="MsoTableGrid" style="margin: auto auto auto 42pt; border-collapse: collapse;" border="1" cellspacing="0" cellpadding="0"><tbody>
<tr style="">
<td style="padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 198.75pt; padding-right: 5.4pt; padding-top: 0cm; border: black 1pt solid;" width="265" valign="top">
<p class="MsoListParagraph" style="margin: 0cm 0cm 0pt; text-indent: 0cm;"><span lang="EN-US"><span style="font-size: small; font-family: Calibri;"></span></span></p>
</td>
<td style="border-bottom: black 1pt solid; border-left: #ece9d8; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 185.35pt; padding-right: 5.4pt; border-top: black 1pt solid; border-right: black 1pt solid; padding-top: 0cm;" width="247" valign="top">
<p class="MsoListParagraph" style="margin: 0cm 0cm 0pt; text-indent: 0cm;"><span lang="EN-US"><span style="font-size: small; font-family: Calibri;"></span></span></p>
</td>
</tr>
<tr style="">
<td style="border-bottom: black 1pt solid; border-left: black 1pt solid; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 198.75pt; padding-right: 5.4pt; border-top: #ece9d8; border-right: black 1pt solid; padding-top: 0cm;" width="265" valign="top">
<p class="MsoListParagraph" style="margin: 0cm 0cm 0pt; text-indent: 0cm;"><span lang="EN-US"><span style="font-size: small; font-family: Calibri;">Felomeng.BackFormation</span></span></p>
</td>
<td style="border-bottom: black 1pt solid; border-left: #ece9d8; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 185.35pt; padding-right: 5.4pt; border-top: #ece9d8; border-right: black 1pt solid; padding-top: 0cm;" width="247" valign="top">
<p class="MsoListParagraph" style="margin: 0cm 0cm 0pt; text-indent: 0cm;"><span style=""><span style="font-size: small;">用于在标准格式和分词标注格式之间转换,还附带将两种标记合并、将分词标注信息删除两个功能</span></span></p>
</td>
</tr>
<tr style="">
<td style="border-bottom: black 1pt solid; border-left: black 1pt solid; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 198.75pt; padding-right: 5.4pt; border-top: #ece9d8; border-right: black 1pt solid; padding-top: 0cm;" width="265" valign="top">
<p class="MsoListParagraph" style="margin: 0cm 0cm 0pt; text-indent: 0cm;"><span lang="EN-US"><span style="font-size: small; font-family: Calibri;">Felomeng.ErrorExtractor</span></span></p>
</td>
<td style="border-bottom: black 1pt solid; border-left: #ece9d8; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 185.35pt; padding-right: 5.4pt; border-top: #ece9d8; border-right: black 1pt solid; padding-top: 0cm;" width="247" valign="top">
<p class="MsoListParagraph" style="margin: 0cm 0cm 0pt; text-indent: 0cm;"><span style=""><span style="font-size: small;">错误提取工具,可以方便地从结果(带答案)中提取错误,以便于实验分析</span></span></p>
</td>
</tr>
<tr style="">
<td style="border-bottom: black 1pt solid; border-left: black 1pt solid; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 198.75pt; padding-right: 5.4pt; border-top: #ece9d8; border-right: black 1pt solid; padding-top: 0cm;" width="265" valign="top">
<p class="MsoListParagraph" style="margin: 0cm 0cm 0pt; text-indent: 0cm;"><span lang="EN-US"><span style="font-size: small; font-family: Calibri;">Felomeng.NERRules</span></span></p>
</td>
<td style="border-bottom: black 1pt solid; border-left: #ece9d8; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 185.35pt; padding-right: 5.4pt; border-top: #ece9d8; border-right: black 1pt solid; padding-top: 0cm;" width="247" valign="top">
<p class="MsoListParagraph" style="margin: 0cm 0cm 0pt; text-indent: 0cm;"><span style=""><span style="font-size: small;">本来有四个功能,因为实验中验证了前三个功能效果不佳,固主要功能就是改善结果(对机器学习方法的结果进行规则化改进)。</span></span></p>
</td>
</tr>
<tr style="">
<td style="border-bottom: black 1pt solid; border-left: black 1pt solid; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 198.75pt; padding-right: 5.4pt; border-top: #ece9d8; border-right: black 1pt solid; padding-top: 0cm;" width="265" valign="top">
<p class="MsoListParagraph" style="margin: 0cm 0cm 0pt; text-indent: 0cm;"><span lang="EN-US"><span style="font-size: small; font-family: Calibri;"></span></span></p>
</td>
<td style="border-bottom: black 1pt solid; border-left: #ece9d8; padding-bottom: 0cm; background-color: transparent; padding-left: 5.4pt; width: 185.35pt; padding-right: 5.4pt; border-top: #ece9d8; border-right: black 1pt solid; padding-top: 0cm;" width="247" valign="top">
<p class="MsoListParagraph" style="margin: 0cm 0cm 0pt; text-indent: 0cm;"><span lang="EN-US"><span style="font-size: small; font-family: Calibri;"></span></span></p>
</td>
</tr>
</tbody></table>
<p class="MsoListParagraph" style="margin: 0cm 0cm 0pt 42pt; text-indent: 0cm;"><span lang="EN-US"><span style="font-size: small; font-family: Calibri;"></span></span></p>
<p class="MsoListParagraph" style="margin: 0cm 0cm 0pt 42pt; text-indent: 0cm;"><span lang="EN-US"><span style="font-size: small; font-family: Calibri;">后记:其实结果和使用的训练测试数据的选择很有关系,本人采用的是前70%训练,后30%测试。后经改进选取方法,正确率可以达92%以上,有兴趣的可以改变一下训练语料和测试语料的提取方式。</span></span></p>
评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值