<FZU1033>URLs

本文详细阐述了URL的组成部分及解析方法,包括协议、主机、端口和路径的识别与提取,通过实例展示了如何从URL中获取这些关键信息。

摘要生成于 C知道 ,由 DeepSeek-R1 满血版支持, 前往体验 >

URLs
Time Limit:1000MS     Memory Limit:32768KB     64bit IO Format:%I64d & %I64u

Description

In the early nineties, the World Wide Web (WWW) was invented. Nowadays, most people think that the WWW simply consists of all the pretty (or not so pretty) HTML-pages that you can read with your WWW browser. But back then, one of the main intentions behind the design of the WWW was to unify several existing communication protocols.

Then (and even now), information on the Internet was available via a multitude of channels: FTP, HTTP, E-Mail, News, Gopher, and many more. Thanks to the WWW, all these services can now be uniformly addressed via URLs (Uniform Resource Locators). The syntax of URLs is defined in the Internet standard RFC 1738. For our problem, we consider a simplified version of the syntax, which is as follows:

<protocol> "://" <host> [ ":" <port> ] [ "/" <path> ]

The square brackets [] mean that the enclosed string is optional and may or may not appear. Examples of URLs are the following:

http://www.informatik.uni-ulm.de/acm
ftp://acm.baylor.edu:1234/pub/staff/mr-p
gopher://veryold.edu

More specifically,

<protocol> is always one of http, ftp or gopher.

<host> is a string consisting of alphabetic (a-z, A-Z) or numeric (0-9) characters and points (.).

<port> is a positive integer, smaller than 65536.

<path> is a string that contains no spaces.

You are to write a program that parses an URL into its components.


Input

The input starts with a line containing a single integer n, the number of URLs in the input. The following n lines contain one URL each, in the format described above. The URLs will consist of at most 60 characters each.


Output

For each URL in the input first print the number of the URL, as shown in the sample output. Then print four lines, stating the protocol, host, port and path specified by the URL. If the port and/or path are not given in the URL, print the string <default> instead. Adhere to the format shown in the sample output.

Print a blank line after each test case.


Sample Input

3
ftp://acm.baylor.edu:1234/pub/staff/mr-p
http://www.informatik.uni-ulm.de/acm
gopher://veryold.edu

Sample Output

URL #1
Protocol = ftp
Host     = acm.baylor.edu
Port     = 1234
Path     = pub/staff/mr-p

URL #2
Protocol = http
Host     = www.informatik.uni-ulm.de
Port     = <default>
Path     = acm

URL #3
Protocol = gopher
Host     = veryold.edu
Port     = <default>
Path     = <default>

//Memory: 228 KB		Time: 0 MS
//Language: GNU C++		Result: Accepted

#include <iostream>
#include <cstdio>
#include <cstring>
using namespace std;

int main()
{
    char s[61];
    int n, i;
    scanf("%d", &n);
    for(int ca=1; n--; ca++)
    {
        i = 0;
        scanf("%s", s);//puts(s);
        printf("URL #%d\n", ca);
        if(s[i] == 'h')
        {
            puts("Protocol = http");
            i+=7;
        }
        else if(s[i] == 'f')
        {
            puts("Protocol = ftp");
            i+=6;
        }
        else
        {//cout<<s[i];
            puts("Protocol = gopher");
            i+=9;
        }
        printf("Host     = ");
        for(; s[i] != '\0' && s[i] != ':' && s[i] != '/'; i++)
        {
            putchar(s[i]);
        }
        puts("");
        if(s[i] != ':')
            puts("Port     = <default>");
        else
        {
            printf("Port     = ");
            for(i++; s[i] != '\0' && s[i] != '/'; i++)
            {
                putchar(s[i]);
            }
            puts("");
        }
        if(s[i] != '/')
            puts("Path     = <default>");
        else
        {
            printf("Path     = ");
            for(i++; s[i] != '\0'; i++)
            {
                putchar(s[i]);
            }
            puts("");
        }
        puts("");
    }
    return 0;
}


评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值