what Larbin can do
Larbin is an open source web crawler programmed by C++. It is intended to fetch a large number of web pages to fill the database of a serch engine. With a network fast enough, Larbin should be able to fetch more than 100 millions pages on a standard PC.
what Larbin can not do
Larbin is just a web crawler, NOT an indexer. He gives us the raw meterial to cook our dinner, NOT the dinner itself! We have to write some code to enhance its functions, endowing him with the ability to interprete the information, generate indexs for our database, and some other customize applications!
OK, Prologue ends. If the introduction above does not satisfy your craving fully, may be this website can provide you some further introductive information. We will talk about something deepen next.
What this article will NOT cover
(but provide link which may answer your questions)
Maybe you have some problems to make Larbin work even you read the website above
OK, let’s walk into the Larbin World!