Newsletter and webmaster resources site   Frëe High Tech Equipment Tutorial!
  Advertise in SiteProNews SiteProNews Archives About SiteProNews SPN Privacy Statement FeedBack SiteProNews Homepage SiteProNews Image Map SEO-News Discussion Forums
  Stretch Your Budget - Advertise in SiteProNews
    QUICK LINKS
 
SEPT. 20,  ISSUE #843
Web Search

ExactSeek Links
      Add your Site
     Buy a Top 10 Listing
     Newsletter SignUp
     ExactSeek Member Login

Buy Results, Not Promises - Your Intial Deposit Matched to $100
SiteProNews Blog
SiteProNews has launched the SiteProNews Blog for webmasters. Drop by to read regular posts by two of the Web's top writers, Jim Hedger and Kim Roach.

Top SEO Tools

Try it Fr-e-e for 14 Days
SEO Tools and Services

Webmaster Tools
   Site Ranking Tool
   Meta Tag Generator
   Link Popularity Checker
   Search Engine Submitter
   Internet Tools Directory
   Site Resource Directory
 

SEO-News Forums
Join the SEO-News Forums to post comments, articles and tips or learn from SEO experts.
Forum Posts
Yahoo 198
Google 1666
SE Articles 123
Link Exchanges 329
General Discussion 157
Join the SEO-News Forums
Blog Search
   Add a Blog
   Search 9,000+ Blogs
   Grab a Blog RSS Feed
   Blog Express for RSS Feeds

ExactSeek Toolbar
Get the toolbar with spyware scanning, webpage keyword analysis, web search on multiple meta engines, popup-blocking, Alexa site ranking, word highlighting, auto-upgrade and erase browser cookies.
Download Version 2.3

Traffïc Exchanges
Get Frëe Traffïc for Your Site with these Traffïc Exchanges:


TrafficZap


TrafficSwarm


Site of the Day
BruceClay.com provides information, tips, and helpful hints on search engine optimization, search engine marketing, site analytics, and "how to" placement and ranking advice.

Does your web site qualify as a SPN Site of the Day? Webmaster resource sites can apply via email: sotd@sitepronews.com
 

App of the Day
Advanced RSS2Web 2.3.28 (2.9 MB) allows you to publish aggregated news pages or articles on your website. Download newsfeeds in RSS and XML format; reformat them according to user-defined html templates; and upload the results to your website via shared folder or ftp. Freeware for Windows 95 and above.

If you have a Webmaster App that you would like listed on the SPN site, send us an email with details to: wapps@sitepronews.com
 

Jayde Newsletters
Subscribe to SiteProNews, the Net's foremost Webmaster ezine or SEO-News, the weekly ezine for do-it-yourself website optimizers. Just enter your email address in the field below and use the Subscribe button.

HTML Newsletter
SiteProNews
SEO-News


Must Read Ebooks
SPN offers one of the best eBook libraries on the Web. Our current selection includes Commercial and over 183 Frëe eBooks.

Authors can submit eBooks to SiteProNews via email: ebooks@sitepronews.com
 

Link to SPN
Link your site to SiteProNews, the newsletter and resource site for Webmasters.

Or, Add SPN to your site with just 2 lines of Java-script code. Top content for your site without any of the work.

Visit our SPN Promotion Partners page. Some great sites have opted to support the SiteProNews newsletter.

SPN Partners
SubmitPlus - Promote your site to 110 search engines... Frëe!

Template Monster - The Web's number one website templates are available for immediate download.

PreWired.com - Providing ISPs & Publishers a Web based revenue stream!

FindMyHost.com... Review detailed Report Cards of web hosts who made the grade.

Web-Source... Your Guide to Professional Web Site Design & Development.

TheCgiSite.com - A directory of programming resources.

TechNewsletters.com - A search engine where you can review and subscribe to thousands of IT newsletters.

Frëe Alexa Toolbar
An indispensable tool for web professionals, providing Traffïc Data, Site Stats, and Contact Info for all the sites you visit!.

NewWebDirectory- A new internet web directory of professionally reviewed web sites providing both frëe and paid site submission.

FreeWebMonitoring - Monitor your web site's availability 24 hours a day, 7 days a week with ínstant email alerts and weekly web site performänce statistics.

Top 10 Exposure - Forget PPC. Get Google-Type ads for $3 - $4 per month and top 10 exposure across 230+ search engines & web directories.

 

Submit Plus
Blog Search
FindMyHost
Add Me.com
DesignerWiz
Web Position
Alexa Toolbar
SEO Company
SubmitExpress
Top SEO Tools
Website Builder
Top 10 Exposure
$100 Free-Traffic
NewWebDirectory
Website Templates
FreeWebMonitoring
Search Engine Tool
FreeWebSubmission



The Google Goal Of Indexing
100 Billion Web Pages

By Danny Wirken

Google's Goal of Quality Search

In their paper 'The Anatomy of a Large-Scale Hypertextual Web Search Engine' it is very evident that Google's goal has always been to be one of the best search engines there is in terms of the quality of the results it gives. Sergey Brin and Lawrence Page, however knew that in order to do this, Google needed to be able to store information efficiently and cost effectively and to have excellent crawling, indexing, and sorting methods or techniques. Google not only aimed to give quality results but to produce the results as fast as possible.

Editorial Note: Read Jim Hedger's blog post about Google's Matt Cutts comments on Domain Hijacking as a Blackhat Technique.

Everything You Know About SEO is Dead Wrong - Download this Frëe Eye-Opening 21 Page Report!

Google started as a high quality search engine and continues to be the best search engine today. It has managed to stay true to its original intent to be a search engine that not only crawls and indexes the web efficiently but also a search engine that produces more satisfying results in comparison to other existing search engines. To stay true to the goal of providing the best search results, Google knew right from the start that it had to be designed so that the search engine could catch up with the web's growth. According to Brin and Page "In designing Google we have considered both the rate of growth of the Web and technological changes. Google is designed to scale well to extremely large data sets. It makes efficient use of storage space to store the index". They knew that they needed much space to store an ever growing index.

Google's index size, which started out as 24 million web pages, was large for its time and has grown to around 25 billion web pages, still keeping Google ahead of its competitors. However, Google is a company that doesn't settle for just beating the competitors. They truly aim to give their users the best service there is and that means as a search engine they want to give users access to all or at least most of the quality information that is available on the web.

Google's New System for Indexing More Pages

As mentioned earlier, Google aims to give access to even more information and has been devoting time and much effort to realize this goal. It seems that the new patent entitled 'Multiple Index Based Information Retrieval System' filed by Google employee Anna Patterson might be the answer to the problem. The patent published just this May of 2006 and filed way back in January of 2005 shows that Google might actually be aiming to expand their index size to as much as a 100 billion web pages or even more.

Advertise on 1000's of Sites for Frëe!

According to the patent, conventional information retrieval systems, more commonly known as search engines, are able to index only a small part of the documents available on the Internet. According to estimates, the existing number of web pages on the Internet as of last year was around 200 billion; however, Patterson claimed that even the best search engine (that is Google) was able to index only up to 6 to 8 billion web pages.

The disparity between the number of indexed pages and existing pages clearly signaled a need for a new breed of information retrieval system. Conventional information retrieval systems just weren't capable of doing the job and just wouldn't be able to index enough web pages to give users access to a large enough percentage of the present existing information available on the web.

The Multiple Index Based Information Retrieval System, however, is up to the challenge and is Google's answer to the problem. Two characteristics of the new system makes it stand out compared to the conventional systems. One is that it has the "capability to index an extremely large number of documents, on the order of a hundred billion or more". And the other is its capability to "index multiple versions or instances of documents for archiving...enabling a user to search for documents within a specific range of dates, and allowing date or version related relevance information to be used in evaluating documents in response to a search query and in organizing search results."

Create Dynamic Talking Characters for Your Website!

With the new system developed by Patterson, Google now has the ability to expand its index size to unbelievable proportions as well as improve document analysis and processing, document annotation, and even the process of ranking according to contained and anchor phrases.

History of Google's Index Size

Google started out with an index size of around 24 million web pages in 1996. By August of 2000, Google had managed to quadruple their index size to approximately one billion web pages. In September of 2003, Google's front-page boasted an index of 3.3 billion web pages. Microdoc, however, revealed that the actual number of web pages Google had indexed during that time was already more than five billion web pages. In their article 'Google Understates the Size of Its Database', they emphasized that Google not only specialized in simplicity but also in understating their power and complexity. Google was still managing to stay ahead of its competitors and continued to surprise everyone with what they had up their sleeves.

Forget Expensive PPC Advertising

Get a Google-Type Ad with Top 10 Exposure across 230+ search engines and web directories delivering 150 Million+ Searches/Mo.

$3 - $4/Month - Quick Inclusion - World Wide Placement!
Your Keywords - No Bidding - No Click Fraud - Stats Tracking


Sign Up Today - Receive 3 Bonuses Valued at $90

As Google's index continued to grow the number in their front page grew impressively large as well before it plateaued at eight billion web pages. This was around the time that Patterson filed the new patent. Then in 2005, with controversies in index size growing, Google decided to stop counting in front of the public and simply claimed that their index size was three times larger than the nearest competitor's index size. Google also maintained that it was not just the size of indexed pages that was important but how relevant the results they returned were.

Then in September of 2005, as part of Google's 7th anniversary, Anna Patterson, the same software engineer who filed the patent on the Multiple Based Index Information Retrieval System posted an entry on Google's official blog claiming that the index size was now 1,000 times larger than the original index. This pegged their index size at around 24 billion web pages, about a fourth of Google's goal of indexing a 100 billion web pages. It seems then that Google must have started using the new system in mid 2005. With the new system in place, we can only wait and see how fast Google will reach the goal of a 100 billion web pages in its index. It's most likely though that when Google has reached that goal it will set an even higher goal to provide continuous quality service.


About The Author
Danny Wirken is co-owner of http://www.theinternetone.net an internet marketing website that primarily focuses on the many aspects, methodologies and processes that are used in internet marketing.



Printer Friendly Version of this Article


Recommended Articles and Blog News for Webmasters

Blogging: A Start-Up Guide
How To Build An Authority Site
Search Marketing and Social Media
No News is Goog News in Belgium
The Orion Algorithm and Your SEO Efforts
Social Media Optimization: Advice from the Best
9 Practical Reasons Why Web-Audio Is A Necessity
Organic SEO or Pay-Per-Click Advertising - Which Should You Choose?
Matt Cutts on Good Karma - Domain Hijacking as a Blackhat Technique

Need Content for Your Website - GoArticles.com has 250,000+ Articles
Add a RSS feed or Javascrïpt feed in seconds.


Webmaster Resource Sites & Services

Top SEO Tools - A suite of the best online submission, reporting and SEO Tools available. Sign up for a frëe 14 day trial.


Add Me! - a pioneer in search engine submission, and the most popular. They offer frëe submission and paid submission.

Google Ranking Secrets Revealed! Boost Your Google Ranking,
Get More Orders, And Make More Monëy!


Build Your Traff'ic with ABCSearch
Get $100 of FR-E-E qualified Visitors. Sign-up today and we'll match any initial deposit up to $100. Geo-targeting, full reporting and one clíck results!


Recommended Webmaster Tools & Services

Net Research Server (NRS) - a complete search engine solution for sites wanting to host entire web search, industry specific search, site search, or directory search. NRS also enables users to submit listings, create alerts, and organize bookmarks.

Select from 1000's of Quality Templates
Need a new site look? Select from thousands of professional designs for a fraction of web design costs. Get a multi-page website up in just a few hours.

Build a Business Website in Under 5 Minutes.
Over 172,000 people just like you have used Exact Websites to build professional websites, complete with web pages, photo albums, email, links and 27 other features without ever having built a website before.

WebPosition
WebPosition helps you maximize your site's search engine visibility by providing a complete SEO solution including rank reporting, keyword research, page optimization and submission. Download a frëe demo today!

Have an Opinion on Today's Article?
Post Your Comments in the SEO-News Forums
Sign Up for FR-E-E and Participate

 

  SiteProNews - The Net's most widely read Webmaster newsletter


(c) Copyright 2006 All rights reserved. Jayde Online, Inc.
Web design by
ControlV.