====== Differences in Search engine technology====== Adam Shell

Also edited by Sachin

Search Engine

Is a piece of software that allows you to locate a particular node in a hypermedia database.

There are four types:

Key Word Search:

Example: alta vista.

Uses it’s own database of websites. In background robots/crawlers continually search the web for non-indexed sites. When a new site is found, the search engine must determine key words. These key words are then added to a search engine index with relevant URL’s. When searching it searches the index to locate the URL. Key word allocation: looks for words in the title and first few paragraphs in the body of the text OR are specified in metatags (by designer).

Advantages



• Can be quick

Disadvantages

• May not find site you want (not indexed) • Key words may not relevant if defined from the text • May not be able to search properly.

Directory Search:

Example: yahoo.

Created manually, where all entries are categorised under headings. An appropriate category is determined when a new site is submitted.

Advantages • Search is narrowed once relevant category is identified • Human centred (people generated categories)

Disadvantages • Website may be in wrong category • Frustrating because user has to go through many layers

Meta Search Engine:

Example: metacrawler, mamma. Search engine, which uses other search engines.

Advantages • Larger success rate (multiple indexes)

Disadvantages • Can timeout before finished complete search with individual server

Natural Language Search engines:

Example: ask.com

Do not need to use syntax, follows natural language such as asking questions.

Advantages • Human centred • No syntax required

Disadvantages • It pulls out what it thinks are the keywords in the users search

Using a search engine

If you don’t search correctly you may get irrelevant sites.

Syntax and Description

Logical operators/Boolean

And/or/not/near

Quotes “phrase”

Finds the set of words in the quotes.

Case

Lower: finds upper/lower. Upper: only finds upper.

+

(plus symbol) Site contains all specified keywords

-

(minus symbol) Only finds sites without that word.

*

(wildcard) Any combination after wildcard. E.g. Cat* finds catastrophe, cathode.

%, ?

Search specified length with character. E.g. Sh%%% finds shell

t:

Finds keywords in the title

i: / image:

Finds all sites with named picture.

 
differences_in_search_engine_technology.txt · Last modified: 2006/09/30 18:16 by sachinsuch01
 
Recent changes RSS feed Creative Commons License Donate Powered by PHP Valid XHTML 1.0 Valid CSS Driven by DokuWiki