Intelligent spider for Internet searching

Hsinchun Chen, Yi Ming Chung, Marshall Ramsey, Christopher C. Yang, Pai Chun Ma, Jerome Yen

Research output: Chapter in Book/Report/Conference proceedingConference contribution

12 Citations (Scopus)

Abstract

As the World-Wide Web (WWW) based Internet services become more popular, information overload also becomes a pressing research problem. Difficulties with searching on Internet get worse as the amount of information that available on the Internet increases. A scalable approach to support Internet search is critical to the success of Internet services and other current or future National Information Infrastructure (NII) applications. A new approach to build intelligent personal spider (agent), which is based on automatic textual analysis of Internet documents, is proposed in this paper. Best first search and genetic algorithm have been tested to develop the intelligent spider. These personal spiders are able to dynamically and intelligently analyze the contents of the users selected homepages as the starting point to search for the most relevant homepages based on the links and indexing. An intelligent spider must have the capability to make adjustments according to progress of searching in order to be an intelligent agent. However, the current searching engines do not have the communication between the users and the robots. The spider presented in this paper use Java to develop the user interface such that the users can adjust the control parameters according to the progress and observe the intermediate results. The performances of the genetic algorithm based and best first search based spiders are also reported.

Original languageEnglish (US)
Title of host publicationProceedings of the Hawaii International Conference on System Sciences
Editors Anon
PublisherIEEE Comp Soc
Pages178-188
Number of pages11
Volume4
StatePublished - 1997
EventProceedings of the 1997 30th Annual Hawaii International Conference on System Sciences. Part 1 (of 6) - Wailea, HI, USA
Duration: Jan 7 1997Jan 10 1997

Other

OtherProceedings of the 1997 30th Annual Hawaii International Conference on System Sciences. Part 1 (of 6)
CityWailea, HI, USA
Period1/7/971/10/97

Fingerprint

Internet
Genetic algorithms
Intelligent agents
World Wide Web
User interfaces
Robots
Engines
Communication

ASJC Scopus subject areas

  • Software
  • Industrial and Manufacturing Engineering

Cite this

Chen, H., Chung, Y. M., Ramsey, M., Yang, C. C., Ma, P. C., & Yen, J. (1997). Intelligent spider for Internet searching. In Anon (Ed.), Proceedings of the Hawaii International Conference on System Sciences (Vol. 4, pp. 178-188). IEEE Comp Soc.

Intelligent spider for Internet searching. / Chen, Hsinchun; Chung, Yi Ming; Ramsey, Marshall; Yang, Christopher C.; Ma, Pai Chun; Yen, Jerome.

Proceedings of the Hawaii International Conference on System Sciences. ed. / Anon. Vol. 4 IEEE Comp Soc, 1997. p. 178-188.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Chen, H, Chung, YM, Ramsey, M, Yang, CC, Ma, PC & Yen, J 1997, Intelligent spider for Internet searching. in Anon (ed.), Proceedings of the Hawaii International Conference on System Sciences. vol. 4, IEEE Comp Soc, pp. 178-188, Proceedings of the 1997 30th Annual Hawaii International Conference on System Sciences. Part 1 (of 6), Wailea, HI, USA, 1/7/97.
Chen H, Chung YM, Ramsey M, Yang CC, Ma PC, Yen J. Intelligent spider for Internet searching. In Anon, editor, Proceedings of the Hawaii International Conference on System Sciences. Vol. 4. IEEE Comp Soc. 1997. p. 178-188
Chen, Hsinchun ; Chung, Yi Ming ; Ramsey, Marshall ; Yang, Christopher C. ; Ma, Pai Chun ; Yen, Jerome. / Intelligent spider for Internet searching. Proceedings of the Hawaii International Conference on System Sciences. editor / Anon. Vol. 4 IEEE Comp Soc, 1997. pp. 178-188
@inproceedings{92c46de14c2a4a7497f72de2fb23e8b0,
title = "Intelligent spider for Internet searching",
abstract = "As the World-Wide Web (WWW) based Internet services become more popular, information overload also becomes a pressing research problem. Difficulties with searching on Internet get worse as the amount of information that available on the Internet increases. A scalable approach to support Internet search is critical to the success of Internet services and other current or future National Information Infrastructure (NII) applications. A new approach to build intelligent personal spider (agent), which is based on automatic textual analysis of Internet documents, is proposed in this paper. Best first search and genetic algorithm have been tested to develop the intelligent spider. These personal spiders are able to dynamically and intelligently analyze the contents of the users selected homepages as the starting point to search for the most relevant homepages based on the links and indexing. An intelligent spider must have the capability to make adjustments according to progress of searching in order to be an intelligent agent. However, the current searching engines do not have the communication between the users and the robots. The spider presented in this paper use Java to develop the user interface such that the users can adjust the control parameters according to the progress and observe the intermediate results. The performances of the genetic algorithm based and best first search based spiders are also reported.",
author = "Hsinchun Chen and Chung, {Yi Ming} and Marshall Ramsey and Yang, {Christopher C.} and Ma, {Pai Chun} and Jerome Yen",
year = "1997",
language = "English (US)",
volume = "4",
pages = "178--188",
editor = "Anon",
booktitle = "Proceedings of the Hawaii International Conference on System Sciences",
publisher = "IEEE Comp Soc",

}

TY - GEN

T1 - Intelligent spider for Internet searching

AU - Chen, Hsinchun

AU - Chung, Yi Ming

AU - Ramsey, Marshall

AU - Yang, Christopher C.

AU - Ma, Pai Chun

AU - Yen, Jerome

PY - 1997

Y1 - 1997

N2 - As the World-Wide Web (WWW) based Internet services become more popular, information overload also becomes a pressing research problem. Difficulties with searching on Internet get worse as the amount of information that available on the Internet increases. A scalable approach to support Internet search is critical to the success of Internet services and other current or future National Information Infrastructure (NII) applications. A new approach to build intelligent personal spider (agent), which is based on automatic textual analysis of Internet documents, is proposed in this paper. Best first search and genetic algorithm have been tested to develop the intelligent spider. These personal spiders are able to dynamically and intelligently analyze the contents of the users selected homepages as the starting point to search for the most relevant homepages based on the links and indexing. An intelligent spider must have the capability to make adjustments according to progress of searching in order to be an intelligent agent. However, the current searching engines do not have the communication between the users and the robots. The spider presented in this paper use Java to develop the user interface such that the users can adjust the control parameters according to the progress and observe the intermediate results. The performances of the genetic algorithm based and best first search based spiders are also reported.

AB - As the World-Wide Web (WWW) based Internet services become more popular, information overload also becomes a pressing research problem. Difficulties with searching on Internet get worse as the amount of information that available on the Internet increases. A scalable approach to support Internet search is critical to the success of Internet services and other current or future National Information Infrastructure (NII) applications. A new approach to build intelligent personal spider (agent), which is based on automatic textual analysis of Internet documents, is proposed in this paper. Best first search and genetic algorithm have been tested to develop the intelligent spider. These personal spiders are able to dynamically and intelligently analyze the contents of the users selected homepages as the starting point to search for the most relevant homepages based on the links and indexing. An intelligent spider must have the capability to make adjustments according to progress of searching in order to be an intelligent agent. However, the current searching engines do not have the communication between the users and the robots. The spider presented in this paper use Java to develop the user interface such that the users can adjust the control parameters according to the progress and observe the intermediate results. The performances of the genetic algorithm based and best first search based spiders are also reported.

UR - http://www.scopus.com/inward/record.url?scp=0031377266&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0031377266&partnerID=8YFLogxK

M3 - Conference contribution

AN - SCOPUS:0031377266

VL - 4

SP - 178

EP - 188

BT - Proceedings of the Hawaii International Conference on System Sciences

A2 - Anon, null

PB - IEEE Comp Soc

ER -