Finding Stuff on the Internet - Search Engines

by Dewey Williams

 

Search Operators with Examples
Boolean Operators Cheatsheet
Searching the Internet: Using Meta Words
Search Operator Comparison Chart

 

The Internet is HUGE...and getting bigger every minute. Even before the World Wide Web, people realized that some type of cataloging was necessary in order to find all the information. The WAIS database was created to document Internet information in various areas. WAIS was(is) a very powerful text searching tool. It has lost some of its usefulness, however, with the explosion of the World Wide Web.

 

Cataloging the World Wide Web is a daunting task. Take a look at the comparison chart. You can see that each of the search engines catalog MILLIONS of PAGES each day. Even with that, the search engine databases are only updated every few months.

 

The program(s) used to search the Web are called robots or spiders. The both work their magic in similar ways. These programs look at the database and check to see if anything has changed on the existing sites. They also look for new links and check to see if they have been cataloged. As new links are reported back to the search engine, these are added to the list to be cataloged. You can submit your URL to a search engine, or all of them, so the robot will visit your site.

 

Robots and spiders do not look at the whole site, in most cases. Pages can be protected from robots and spiders also, preventing them from indexing certain areas. Entire sites can also be protected.

 

Search engines store their information in a database. This database is available through a query, or question, that runs a search to find the information requested. The basic query could be a word or phrase, but this might find several thousands or millions of 'hits' (results sent back from the query). Each search engine uses a core logic for the queries. This is based on the Boolean logic operators AND, OR and NOT.

 

Many of the search databases have begun cataloging their information by subjects. These are often known as directories. In a directory, you have a list of subjects and you can search or browse the subject list for infromation. This is similar to the way library books are cataloged.

 

Other search engines are specific to certain areas or types of documents. Recently, a few search engines have started including Usenet newsgroups in the database, so documents and messages there can be accessed. Some search engines have begun cataloging online newspapers and magazines.

 

Finding software available on the Internet has never been easier. Several search engines have special sites, which catalogue the directories of the large shareware and freeware sites. Most of these FTP sites also have their own online software searches.

 

You can also find people and businesses on the Internet. These directories, sometimes referred to as white or yellow pages, will give you the name and sometimes address, phone, email and even a map to the person or business. Most of these are only accurate for people or businesses who have signed on to the service. Others have scoured information from other sources and added it to their database.

 

Good luck and happy searching.

Return to top

Search Operators with Examples

 

Be Specific with AND, OR, NOT

 

AND. Joining search terms with the AND operator tells Lycos Pro that you want to find documents containing each term. (For example, Larry AND Curly AND Moe) This is similar to selecting "All the

Words" in the drop-down menu. But, as you'll see below, AND can be used in conjunction with other operators for more powerful searching.

 

OR. Use OR when any of the search terms joined by the operator should appear in the results. This is similar to selecting Any of the Words in the drop-down menu. (For example, Chico OR Zeppo.) Like AND, OR can be used as a building block of more-complex Boolean expressions.

 

NOT. Use the NOT operator to exclude unwanted documents when your search uses a term commonly found in connection with an unrelated topic. For example, if you're not in the mood for laughs, you can improve a search for information on the author of The Communist Manifesto by entering Marx NOT Brothers.

 

Require or Exclude Words with "+" or "-"

 

Use the plus sign (+) in front of a search term if you require it to be in your search results documents. Use the minus sign (-) before terms to weed out webpages that contain those words. The two operators work much like AND and NOT. However, they can make more-complex expressions easier to read -- and certainly easier to type. For example, Larry OR Curly +Moe +Shemp indicates that results should always mention Moe and Shemp, but would not have to name both Larry and Curly. The expression Panama AND Hat -Canal will quickly get you past all the information on the Panamanian waterway and straight to documents mentioning the headgear.

 

Group Thoughts with Parentheses and Quotation Marks

 

As you explore the Web, your queries won't always be based on single words. Lycos Pro allows you to work with groups of words and phrases. And Lycos search operators can be used to evaluate such composite terms just as they can individual words. Here's what you can do:

 

Parenthesis ( ) (or these symbols: [ ] or < > or { }). Parentheses are used to group and combine search terms. For example, you can dig up information on one or two famed Hollywood duos with this query: (Fred AND Ginger) OR (Roy AND Trigger) .

 

Quotation Marks " ". Put quotation marks around a group of words and you've got a phrase. This is like selecting The Exact Phrase in the drop-down menu. However, using quoted phrases in conjunction with other Boolean operators offers additional flexibility. Try "Smashing Pumpkins" AND Simpsons to find references to the band's incarnation as cartoon characters alongside Homer and Bart.

 

Find Relationships with ADJ, NEAR, FAR and BEFORE

 

Frequently, the placement of words within a document reveals their relevancy to one another. Lycos Pro offers four operators to help define those relationships: ADJ (meaning adjacent), NEAR, FAR and BEFORE.

 

ADJ. Use the ADJ operator if you want to find documents in which the terms appear right next to each other, in any order. Sports fans might search for car ADJ race knowing that such a query will match car race and race car.

 

NEAR. When you use NEAR, the terms must appear within 25 words of each other in the results documents. So, searching for "Kevin Bacon" NEAR game will find webpages referencing a popular trivial pastime without having to get so exclusive as "Kevin Bacon game". (For example, that NEAR query would find a match in this sentence: "Here's a neat game based on the movies of Kevin Bacon and his co-stars.")

 

FAR. You can probably guess that FAR is the opposite of NEAR. When you use FAR, Lycos will find documents in which the search terms appear 25 words or more apart in at least one instance. Since it doesn't preclude the possibility that the terms may also appear closer together on the same page, FAR is most useful when used in conjunction with other expressions. For example, you can search for Apple FAR Pie NOT "Apple Pie" and find pages containing both words but not apple pie.

 

BEFORE. BEFORE works like the familiar AND, the only difference being that the terms must appear in the order you specify, but they can be any distance apart in the same document. If you're a little shaky on the spelling of Gettysburg, you can still track down the text of former U.S. President Abraham Lincoln's famous address in that town by searching this way: fourscore BEFORE liberty.

 

Bring Some Order to ADJ, NEAR and FAR

 

Of the operators that reveal the relationship between terms by evaluating their positions in a document, only BEFORE is concerned with the order in which they appear. But you can add BEFORE's ordering capabilities to ADJ, NEAR and FAR by prepending those operators with the letter "O" -- for ordered. So, race OADJ car will match only race car and not car race. (Note that this would not exclude documents containing the phrase car race if they also contained the phrase you seek.) Similarly, ONEAR and OFAR work like their NEAR/FAR cousins but result in matches only when search terms appear in the same order you type them.

 

Crunch Your Own NEAR/ADJ numbers

 

By default, the NEAR and FAR operators use a word count of 25 to ensure that search terms appear the proper distance from one another. You can modify this behavior by appending a slash ("/") to either operator, followed by some number you choose as the maximum or minimum word count between search terms in the results documents.

 

For example, to be more particular with a NEAR query, you might search like this: 1996 NEAR/5 "Oscar nominees".

 

While, by default, ADJ finds matches only when search terms are found side by side in Web documents, you can use the above technique to coax ADJ into specifying an exact word count between terms. For example: bacon ADJ/2 eggs would match bacon and eggs as well as eggs and bacon, or eggs with bacon.

 

Specifying word counts works for OADJ, ONEAR and OFAR just as it does for ADJ, NEAR and FAR.

Return to top

Boolean Operators Cheatsheet

Operator

Symbol

Description

 ""
(quotes)

 

Specifies that documents must contain the phrase within the quote marks. For example, "Grand Canyon" will find documents with this exact phrase.

 AND

 &

This operator indicates a search for documents containing all the terms joined by the operator. So, space AND Nasa or space +Nasa will find documents with both space and Nasa within them.

 OR

This operator indicates a search for documents containing any of the terms joined by the operator.

 NOT

!

This operator excludes documents containing the term that follows the operator. So, Bill NOT Hillary or Bill !Hillary will find documents with Bill that don't mention Hillary.

 NEAR

 N

This operator finds documents in which two words appear within 25 words of each other.

 NEAR/n

 

This operator finds documents in which two words appear within n (a number) words of each other.

 ONEAR

 

This operator works the same as NEAR, except that words must appear in the order specified.

 ONEAR/n

 

This operator also works the same as NEAR/n, except that words must appear in order specified within n (a number) words of each other.

 FAR

 

This operator finds documents in which two words appear at least 25 words apart.

 FAR/n

 

This operator finds documents in which two words appear at least n (a number) words apart.

 OFAR

 

This works the same as FAR, except that words must appear in the order specified.

 OFAR/n

 

This works also the same as FAR/n, except that words must appear in the order specified within n (a number) words of each other..

 ADJ

 

The ADJ operator finds documents in which two words appear next to each other

 ADJ/n

 

This operator will enable you to find documents in which two words appear exactly within n (a number) words of each other.

 OADJ

 

This works the same as ADJ, except that words must appear in the order specified within n (a number) words of each other.

 OADJ/n

 

This one also works the same as ADJ/n, except that words must appear in the order specified within n (a number) words of each other.

 BEFORE

 

This operator will find documents in which the first term appears before the other.

 WITHIN

 W

This operator will find documents that appear within a specified distance from a keyword.

 (), {}, [], <>
(parentheses)

 

This nesting operator can be used to group expressions when building sophisticated queries. For example, (red +car) AND (porsche | audi) is a sample nested query.

 +

(plus sign)

 

Placed at the beginning of a word, this operator indicates that the word is required. For example, Gumby +Pokey will return docs that mention both Gumby and Pokey.

 -
(minus sign)

 

Placed at the beginning of a word, this operator indicates that the word must be excluded. For example, Gumby -Pokey will return docs only with Gumby.

*, ?
(wildcard)

 

 Wildcards can be used to truncate words so that results contain variations of a word. So, garden* or garden? will return results containing such variations as garden, gardening, gardeners and so on.

Return to top

Searching the Internet: Using Meta Words

 

Meta Word

Abbreviation

Description

domain:(keyword)

Restricts a search to the domain selected. Domains can be specified up to three levels deep: .com, zdnet.com, special.zdnet.com.

depth:(number)

Restricts the depth of pages retrieved.

feature:(keyword)

Limits your query to pages containing the specified feature.

linkdomain:(keyword)

Restricts a search to pages containing links to the specified domain. So linkdomain:zdnet.com will find all the pages pointing to zdnet.com.

linkext:(extension)

Restricts a search to pages containing embedded files with the specified extension. For example, linkext:ra finds pages containing RealAudio files.

newsgroup:(alt.newsgroup name)

Restricts Usenet searches to articles that have been posted to the specified newsgroup.

scriptlanguage:(language)

Searches for pages containing JavaScript or VBScript. For example, scriptlanguage:javascript (note that the language should always be lowercase) will return pages containing JavaScript.

title:(keyword)

t:

Searches for pages containing the given word in their titles between the HTML tags.

url:(keyword)

u:

Restricts searches to document only URLs.

 

In addition to the meta words listed above, there are also three date meta words that allow you to conduct focused searches specifying a range of dates. These are:

after:(day)/(month)/(year)

Restricts a search to documents created or modified after the specified date. For example: Titanic AND after: 28/1/98 will find docs with Titanic in the body which were posted or modified after September 28, 1998.

before:(day)/(month)/(year)

Restricts a search to documents created or modified before the specified date. For example, Titanic AND before 30/6/97 will find you documents posted or modified before June 30, 1997.

within:(number)/(unit)

Restricts a search to documents created or modified within the last specified time period. For example, Titanic +movies within:3/months will return docs within the last 3 months. Units can be days, months, or years.

Currently, these date meta words will only function correctly if they are used as a single term within a Boolean clause, without any pluses or minuses. So (+space -Nasa) AND within:8/months will work, however, +space -Nasa +within:3/months will not.

Return to top

Search Operator Comparison Chart

The following is a table of six popular search engines and the operators that each one supports. This list was put together using information contained on the search engines' sites as well as from hands-on testing of the operators and search engines. The search engine names are also linked back to advanced search areas of each of their respective sites (so you can read up on some of the advanced searching features that may not have been covered in this tutorial).

Operator

Alta Vista 

Excite

HotBot

InfoSeek

Lycos

Yahoo!

AND

Y

Y 

Y 

Y 

Y 

Y

OR

Y

Y

Y

Y 

Y 

Y

NOT

Y

Y

Y

Y 

Y 

Y

NEAR

Y

Y

Y

Y

Y

Y

Nesting

Y

Y

Y

Y

Y

Y

""

Y

Y

Y

Y

Y

Y

+/-

Y

Y

Y

Y

Y 

Y

*

Y 

Y

Y

Y 

Y 

Y

text:

Y 

Y 

 

(uses title

Y 

Y 

url:

Y 

Y 

 

Y 

Y 

Y 

image:

Y 

Y 

Y 

 

Y 

Y 

host:

Y 

Y 

Y 

 

Y 

 

link:

Y 

Y 

Y 

 

Y 

Y

applet:

Y

Y

 

 

Y

 

anchor:

Y

Y

 

 

Y

Y

domain:

Y

Y 

 

(uses site:)

Y

 

Natural language

Y

Y

Y

Y

Y 

Y 

specify language

Y 

 

 

 

 

 


This table was modified from information compiled by ZDNet. The complete ZDNET Search Tutorial may be found at http://www.zdnet.com/products/garage/search/search

Posted 7/6/1998

Return to top