Logo MTA

Home
Diensten overzicht
Internet mogelijkheden
Technisch tekenwerk
Document opmaak
Backup op CD-R
Hardware
Software
Nieuws
Contact met MTA


Diverse links
Zoeken op internet

About search engines

This was originally written for one of our customers. Because it is frequently asked for by many parties it is here provided AS IS and does not prentend to offer a complete picture, the matter is far too complex for that. This page is update November 29, 2005.
Any remarks or suggestions are welcomed by e-mail.

Search engines

When addressing this subject one should realise that the Internet is a dynamic, developing and growing entity. As this is written it may already be outdated or overcome, bypassed or obsolete. Nevertheless, some basics.

What are search engines and how do they work?
Search engines are basically huge databases that help you to find information on the Internet. Since it is not possible for you to know exactly where the information that you are looking for resides, one can always query these databases to find and locate information.
The bigger global engines are www.google.com, www.hotbot.com, www.lycos.com, www.altavista.com, www.infoseek.com, www.webcrawler.com, www.excite.com and the like. National, global or specific area's of interest all have their own search engines, there are quite a lot of them.

How do search engines index the Internet?
Well, basically by starting at the homepage of a site, any site, and indexing that page. All the links that are found will be traced and indexed. So, if the homepage has a link to a page with contact information, then that page will be indexed as well. If this contact page offers a link to the company that built the site, than that entire site will be indexed as well. You understand this goes everywhere. This process is called spidering a website, when a spider comes along the web and obtains information about linked pages.
Once your page or site is linked to from any other page on the Internet, than it's just a matter of time and your site will be visited by search engines...

Once an engine has indexed a site it may take 2 weeks till 6 months before it is actually shows up in the result listing depending on the search engine.
Every now and then, typically every two weeks, the search engines will revisit all the indexed pages to check if they still exist and if so if the content has changed. The index will be updated accordingly and any changes will be available instantly. This implies that it takes time to get listed in search engines, and also to get out, even though the site or pages may be non existing.

How to control search engines?
Various tools and methods are available to influence search engines in if, and how a site is indexed. One is by means of a robots.txt file in the root of the site, or META tags in the individual pages. Good reference on his matter can be found at AltaVista or at searchenginewatch.com and many other search engines offer background information on this subject.

How to get a better ranking in search engines?
In other words, "How to get your site or pages best positioned in the results to a query issued to a search engine?"
Specially here one faces the dynamic character of the Internet. There is no absolute answer to this question yet some basic guidelines are these.

  1. Title of the page
    The title of the pages should specifically reflect the content of that page. Don't name all the pages "Company website ".
  2. Links to your site
    When another website links to your site, it is seen as an endorsement, a vote, for your site. Also, it will be more frequently visited by search engines, and be better positioned in the index.
    Don't forget links inside your site to your other pages. A navigation bar should be standard on all pages, so if a search engine throws most of your pages out of it's index, then the one's that remain offer links to all other pages to be reindexed.
  3. Page content
    This may look simple and yet don't forget that search engines can't read images, javascript or flash movies. If you use images for headings and navigation buttons than be sure to add the ALT tag.

Those are the main ingredients that determine your ranking. Of course some fine-tuning can be done:

  1. Don't use frames
    Search engines are primitive creatures and technologically compatible to NN2.0 or IE3.0 and these browsers can't read framed sites. If you do use them anyway be sure to add content to the <NOFRAME> container with links to all of the important individual pages.
    Other con's to frames;
    • You can't bookmark a framed page, you'll end up bookmarking the homepage.
    • Visitors coming from search engines end up in a page that is supposed to be framed yet isn't because they are referred straight to the right page. No navigation or context is available then.
    • When having multiple browsers open at the same time, your page might turn up in the site of someone else...
    • These items can be overcome with some clever techniques utilising JavaScript and PHP.
  2. Make the content accessible
    Beware that SE's wont accept cookies, won't execute JavaScript, find frames a horde simply too high and can't submit forms. Several SE's refuse addresses that have parameters in the URL. Therefore, in case any of the before mentioned is used, navigation options have to be provided to all the pages you want to see indexed.
  3. Frequency of update
    A frequently updated site stands a better chance of not being forgotten or bypassed than those that are 'last revised Aug 15, 1985'. You may refresh the files on the webserver regularly even though the content may not be changed.
  4. Manually add your site to search engines
    Nowadays companies offers services to promote your website to the leading 400 search engines for free! Beware, the rumour goes that several search engines have started refusing sites that are added this way, even deleting them from their index as they see it as a form of spamming. Therefore if you want a site added to a specific search engine do it manually, search their site for a 'add your site' link or use our listing.
  5. Keywords META tag
    In the keywords META tag one puts keywords associated to the content, and are the words likely to be used by others trying to find your content. Don't forget to put the antithesis there to not just let people find the bad PR sites.
    You may find sites putting the same word several times in the META tag hoping for a better ranking, mind you this no longer works yet once did. If you put the same word more than twice in the keywords META that keyword will be ignored... this as an answer to sites trying to improve their ranking by keywords "sex, sex, sex, sex, sex, sex".
  6. Description META tag
    Typically these one or two sentences are shown by search engines in their result listing to a query. Be concise and simple, remember that people hate to read a lot.

Which search engines should a site be added to?
Well, this is usually overemphasised. The question "From where do you expect the majority of visitors?" should be leading in this consideration.
As guideline try this;

  • The major American/global search engines;
    People tend to grab the bigger ones and use them all the time. The top-ones are listed on our AddUrl page.
  • The major national search engines;
    See above, the Dutch most famous are listed on our AddUrl page.
  • Specific interest search engines, if applicable.
    If a site covers all aspects of dancing then it might be wise to find search engines that have specialised in that field. If the site is a 'general business site' then don't bother.

Remember that sooner or later your site will be indexed, whether you add it manually or not.

Technical details
For surfers with graphics turned off, you want both a background color and a background image. But if fonts have the same hex value, some search engines (HotBot is one) will penalize for spamming their index with hidden keywords, even with a different colored background image.
Example: if your background color is white and you use white letters in a navy table, or on a navy background image, you may trigger spam penalties that will reduce your page rankings.

META name="keywords" use up to 1,000 characters, including spaces and commas";