The Journal of Information Science and Technology Association
(Johono Kagaku to Gijutsu)
Vol. 54 (2004) , No.2
Special feature : Internet Search Engines


Technologies and mechanisms of web search engines
Toshikazu FUKUSHIMA
(Internet Systems Research Laboratories, NEC Corporation (8916-47, Takayama-cho, Ikoma-shi, Nara 630-0101))

 Abstract : This paper surveys the evolution of the web search engine technologies. The web is a hypermedia that has very large-scale and heterogeneous contents changing every day. The web search engines have been developed as the means to utilize the web which has the above-mentioned characteristic as a powerful information resource. The first generation technologies were based on the database but they depended heavily on manual operations. The second generation technologies expanded the scale of the searchable web pages by the crawler to collect the web pages automatically and the parallel full-text search method. The third generation technologies improved the precision by the web link analysis. As the new generation technologies, the purpose specialized search and situation sensitive search technologies are being developed.

 Keywords : Search engine / Web / Crawler / Link analysis / Situation sensitive

Table of Contents


Search engine relation chart
Motoharu SUMI
(Freelance (Corp Yamazaki B-202, 330-31, Sakuragi-cho, Wakaba-ku, Chiba-shi, Chiba 264-0022))

 Abstract : This paper explains the relation of the business world of the internet search engine.
Four kinds of players, a search provider, a directory provider, a advertising provider, a portal site, exist in the market of the internet search engine.
This paper explains subjects, such as a business tie-up of four kinds of those players, and acquisition, and development of search technology.

 Keywords : search engine / directory / yahoo / google / overture

Table of Contents


Search Engine Algorithms
Susumu KANEMUNE
(Ricoh Co., Ltd. Document Solution Division (Shin-Yokohama 3-2-3, Kohoku-ku, Yokohama-shi, Kanagawa 222-8530))

 Abstract : Search engines are one of the essential technologies for searching in the Internet. This article provides an overview of modern search engine algorithms. We firstly explain the requirements for search engines, then we describe three key algorithms : crawling algorithms which collect Web pages, text matching algorithms which retrieve text data using indexes, and scoring algorithms which display Web pages based on the ranking. The scoring is the most important feature for modern search engines.

 Keywords : algorithms / crawling / indexing / information retrieval / search engine / ranking / scoring

Table of Contents


The Architecture of Search Engines
Hayato YAMANA
(Department of Computer Science, School of Science and Engineering, Wasecla University (3-4-1 Okubo, Shinjuku-ku, Tokyo 169-8555))

 Abstract : Search engines are indispensable for using the Internet, today. However, their architecture is somewhat unknown. In this paper, the architecture of search engines is described by way of Google as an example, focusing on Web crawlers, the indexing and the searching scheme. Moreover, the problems to manage many queries and the cost of running the search engines is taken up.

 Keywords : Search Engine / Information Retrieval / Google / Crawler / Indexing

Table of Contents


How Google works, and how should we use it?
Yuji SEKI
(SHIKENCHO. COM (807-4 Yamaki, Nirayama-cho, Tagata-gun, Shizuoka 410-2141))

 Abstract : It is very interresting how Google treats the search-keywords. Although Google has not released the algorithm in detail, it seems that various works for a user's convenience are carried out. After knowing how Google treats a keyword, I present how to craft useful search strings to get exactly what you want, and we get into some advanced search functions.

 Keywords : search engine / Google / keyword / search strings / search functions

Table of Contents