[POJ 2001] Shortest Prefixes

本文介绍了一种用于找出每个单词最短且能唯一标识该单词的前缀的算法。通过构建字典树(Trie),算法能高效地处理大量词汇,确保每个前缀的唯一性,避免歧义。示例输入输出展示了算法的功能。

摘要生成于 C知道 ,由 DeepSeek-R1 满血版支持, 前往体验 >

Shortest Prefixes
Time Limit: 1000MS
Memory Limit: 30000K
Total Submissions: 10210
Accepted: 4309

Description

A prefix of a string is a substring starting at the beginning of the given string. The prefixes of "carbon" are: "c", "ca", "car", "carb", "carbo", and "carbon". Note that the empty string is not considered a prefix in this problem, but every non-empty string is considered to be a prefix of itself. In everyday language, we tend to abbreviate words by prefixes. For example, "carbohydrate" is commonly abbreviated by "carb". In this problem, given a set of words, you will find for each word the shortest prefix that uniquely identifies the word it represents.

In the sample input below, "carbohydrate" can be abbreviated to "carboh", but it cannot be abbreviated to "carbo" (or anything shorter) because there are other words in the list that begin with "carbo".

An exact match will override a prefix match. For example, the prefix "car" matches the given word "car" exactly. Therefore, it is understood without ambiguity that "car" is an abbreviation for "car" , not for "carriage" or any of the other words in the list that begins with "car".

Input

The input contains at least two, but no more than 1000 lines. Each line contains one word consisting of 1 to 20 lower case letters.

Output

The output contains the same number of lines as the input. Each line of the output contains the word from the corresponding line of the input, followed by one blank space, and the shortest prefix that uniquely (without ambiguity) identifies this word.

Sample Input

carbohydrate
cart
carburetor
caramel
caribou
carbonic
cartilage
carbon
carriage
carton
car
carbonate

Sample Output

carbohydrate carboh
cart cart
carburetor carbu
caramel cara
caribou cari
carbonic carboni
cartilage carti
carbon carbon
carriage carr
carton carto
car car
carbonate carbona

Source

#include <stdio.h>
#include <stdlib.h>
#include <string.h>
#include <vector>
#include <iostream>
using namespace std;

#define MAX_WORD_LEN 21
#define MAX_LINE 1001



class Node{
public:
char ch;
int count;
vector<Node*> children;

public:
Node(char _ch = '#', int _count = 0) : ch(_ch), count(_count){}
~Node(){}
};



class Trie{
public:
Node *root;

public:
Trie();
~Trie();
void insert(Node *current, char *word, int len);
void search(Node *current, char *word, int len, char *prefix, int preLen);
void clear(Node *current);

private:
Node *findChild(Node *current, char ch);
};


Trie::Trie(){
root = new Node();
}


Trie::~Trie(){
}


void Trie::clear(Node *current){
if(NULL == current)return;

// clear children
for(int i = 0; i < current->children.size(); ++i){
clear(current->children[i]);
}

// clear itself
current->children.clear();
}


void Trie::insert(Node *current, char *word, int len){
if(NULL == current || len == 0)return;

// find child
Node *child = findChild(current, word[0]);

// not found
if(NULL == child){
Node *tmp = new Node(word[0], 1);
current->children.push_back(tmp);
insert(tmp, word+1, len-1);
}
// found
else{
child->count++;
insert(child, word+1, len-1);
}
}


void Trie::search(Node *current, char *word, int len, char *prefix, int preLen){
if(NULL == current || len == 0)return;

// find child
Node *child = findChild(current, word[0]);

// not found
if(NULL == child){
cerr << "ERROR: not found \'" << word[0] << "\' when search prefix" << endl;
exit(1);
}
// found
else{
prefix[preLen++] = word[0];
if(child->count == 1 || len == 1){
prefix[preLen] = '\0';
return;
}
else{
search(child, word+1, len-1, prefix, preLen);
}
}
}


Node *Trie::findChild(Node *current, char ch){
if(NULL == current) return NULL;

// find
for(int i = 0; i < current->children.size(); ++i){
if(current->children[i]->ch == ch){
return current->children[i];
}
}
return NULL;
}



int main(int argc, char **argv){

// variable definition
int len = 0, i = 0;
Trie trie;
char word[MAX_LINE][MAX_WORD_LEN];
char prefix[MAX_WORD_LEN];

// read words
while(cin >> word[i]){
len = strlen(word[i]);

// insert
trie.insert(trie.root, word[i], len);
i++;
}

// run
for(int j = 0; j < i; ++j){
len = strlen(word[j]);

// generate prefix
trie.search(trie.root, word[j], len, prefix, 0);

// print out result
cout << word[j] << " " << prefix << endl;
}

return 0;
}


 说明:版权所有,转载请注明出处。Coder007的博客



评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值