Results 1 -
9 of
9
Analysis of the query logs of a web site search engine
- Journal of the American Society for Information Science and Technology
, 2005
"... A large number of studies have investigated the transaction log of general-purpose search engines such as Excite and AltaVista, but few studies have reported on the analysis of search logs for search engines that are limited to particular Web sites, namely, Web site search engines. In this article, ..."
Abstract
-
Cited by 14 (1 self)
- Add to MetaCart
A large number of studies have investigated the transaction log of general-purpose search engines such as Excite and AltaVista, but few studies have reported on the analysis of search logs for search engines that are limited to particular Web sites, namely, Web site search engines. In this article, we report our research on analyzing the search logs of the search engine of the Utah state government Web site. Our results show that some statistics, such as the number of search terms per query, of Web users are the same for general-purpose search engines and Web site search engines, but others, such as the search topics and the terms used, are considerably different. Possible reasons for the differences include the focused domain of Web site search engines and users ’ different information needs. The findings are useful for Web site developers to improve the performance of their services provided on the Web and for researchers to conduct further research in this area. The analysis also can be applied in e-government research by investigating how information should be delivered to users in government Web sites.
SpidersRUs: Automated development of vertical search engines in different domains and languages
- In Proceedings of the ACM/IEEE-CS Joint Conference on Digital Libraries
, 2005
"... In this paper we discuss the architecture of a tool designed to help users develop vertical search engines in different domains and different languages. The design of the tool is presented and an evaluation study was conducted, showing that the system is easier to use than other existing tools. Cate ..."
Abstract
-
Cited by 4 (3 self)
- Add to MetaCart
In this paper we discuss the architecture of a tool designed to help users develop vertical search engines in different domains and different languages. The design of the tool is presented and an evaluation study was conducted, showing that the system is easier to use than other existing tools. Categories and Subject Descriptors
Automated identification of web communities for business intelligence analysis
- Proceedings of the Fourth Workshop on E-Business (WEB), Las Vegas
, 2005
"... Analysts often search the Web for business intelligence using traditional search engines which provide keyword-based search. Recently, it has been suggested that the incoming links, or backlinks, of a company’s Web site can provide useful information about the company’s “Web communities”. Backlinks ..."
Abstract
-
Cited by 4 (2 self)
- Add to MetaCart
Analysts often search the Web for business intelligence using traditional search engines which provide keyword-based search. Recently, it has been suggested that the incoming links, or backlinks, of a company’s Web site can provide useful information about the company’s “Web communities”. Backlinks refer to other Web pages which have a hyperlink pointing to the company of interest and these pages form a cyber community on the Web. Analysis of these communities can provide useful signals for a company or information about its stakeholder groups, but the manual analysis process can be very time-consuming for business analysts and consultants. In this study, we report the design and evaluation of a tool called Redips that integrates automatic backlink meta-searching and text mining techniques to facilitate users in identifying such cyber communities on the Web for business intelligence purposes. The system architecture of the tool is presented and an experimental study was reported. The experiment results showed that Redips performed significantly better than two benchmark methods, namely backlink search engines and manual browsing.
Identifying keywords to improve a web site text content
- 6 th International Conference on Information Integration and Web-based Applications & Services
, 2004
"... The steadily increasing competition of internet web sites makes it both more difficult and more important to attract and retain users. However, it is not always possible to deter-mine beforehand which content is most appropriate to reach this goal, since the behavior and requirements of users can be ..."
Abstract
-
Cited by 1 (1 self)
- Add to MetaCart
The steadily increasing competition of internet web sites makes it both more difficult and more important to attract and retain users. However, it is not always possible to deter-mine beforehand which content is most appropriate to reach this goal, since the behavior and requirements of users can be heterogeneous and changing over time. In order to improve a web site text content, it is necessary to better use the words that have proven to attract the user interest. In this paper we introduce a method to identify such keywords in the text content of an operating web site. The effectiveness of the method was tested in a real web site, showing its benefits. 1
An Intelligent Model and Its Implementation of Search Engine *1,*2 *2,*3
"... Intelligence of humankind mostly includes five parts: the observing ability, the memory ability, the practice ability, the thought ability, the imagining ability, etc.. In this paper, an intelligent behaviour, which people search the book in library, is discussed. By contrasting in references of boo ..."
Abstract
-
Cited by 1 (0 self)
- Add to MetaCart
Intelligence of humankind mostly includes five parts: the observing ability, the memory ability, the practice ability, the thought ability, the imagining ability, etc.. In this paper, an intelligent behaviour, which people search the book in library, is discussed. By contrasting in references of book and linkage of Web pages, we proposed that the process of searching information on Internet is similar as book search. And so, we proposed that Search Engines take on the five intelligence behaviours corresponding five parts intelligence of humankind: the apperceiving behaviour, the memory behaviour, the learning behaviour, the thought behaviour, the comprehension behaviour. We divided the process of information searching of search engine into four stages: classifying Web page, confirming a scope of information searching, crawling Web pages in internet, and filtrating the result Web pages. Finally, we proposed an intelligent model (Outer net�internet � inner net, Three Net Model) of implementing of Search Engine.
, Zan Huang
"... There has been a tremendous growth in the amount of information and resources on the World Wide Web that are useful to researchers and practitioners in science domains. While the Web has made the communication and sharing of research ideas and results among scientists easier and faster than ever, it ..."
Abstract
- Add to MetaCart
There has been a tremendous growth in the amount of information and resources on the World Wide Web that are useful to researchers and practitioners in science domains. While the Web has made the communication and sharing of research ideas and results among scientists easier and faster than ever, its dynamic and unstructured nature also makes the scientists faced with such problems as information overload, vocabulary difference, and lack of analysis tools. To address these problems, it is highly desirable to have an integrated, “one-stop shopping ” Web portal to support effective information searching and analysis as well as to enhance communication and collaboration among researchers in various scientific fields. In this paper, we review existing information retrieval techniques and related literature, and propose a framework for developing integrated Web portals that support information searching and analysis for scientific knowledge. Our framework incorporates collection building, meta-searching, keyword suggestion, and various content analysis techniques such as document summarization, document clustering, and topic map visualization. Patent analysis techniques such as citation analysis and content map analysis are also incorporated. To demonstrate the feasibility of our approach, we developed based on our architecture a knowledge portal, called NanoPort, in the field of nanoscale science and engineering. We report our experience and explore the various issues of relevance to developing a Web portal for scientific domains. The system was compared to other search systems in the field and several design issues were identified. An evaluation study was conducted and the results showed that subjects were more satisfied with the NanoPort system
Evaluating the Use of Search Engine Development Tools in IT Education
, 2008
"... It is important for education in computer science and information systems to keep up to date with the latest development in technology. With the rapid development of the Internet and the Web, many schools have included Internet-related technologies, such asWeb search engines and e-commerce, as part ..."
Abstract
- Add to MetaCart
It is important for education in computer science and information systems to keep up to date with the latest development in technology. With the rapid development of the Internet and the Web, many schools have included Internet-related technologies, such asWeb search engines and e-commerce, as part of their curricula. Previous research has shown that it is effective to use search engine development tools to facilitate students’ learning. However, the effectiveness of these tools in the classroom has not been evaluated. In this article, we review the design of three search engine development tools, SpidersRUs, Greenstone, and Alkaline, followed by an evaluation study that compared the three tools in the classroom. In the study, 33 students were divided into 13 groups and each group used the three tools to develop three independent search engines in a class project. Our evaluation results showed that SpidersRUs performed better than the two other tools in overall satisfaction and the level of knowledge gained in their learning experience when using the tools for a class project on Internet applications development.

